Query gi|255764474|ref|YP_003064854.2| glycosyl transferase group 1 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 352 No_of_seqs 143 out of 18416 Neff 9.7 Searched_HMMs 39220 Date Sun May 29 17:09:40 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 255764474.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 cd03796 GT1_PIG-A_like This fa 100.0 0 0 403.2 31.5 330 5-351 1-371 (398) 2 cd03800 GT1_Sucrose_synthase T 100.0 0 0 379.5 31.4 337 5-343 1-398 (398) 3 TIGR03449 mycothiol_MshA UDP-N 100.0 0 0 379.7 31.1 331 17-351 19-405 (405) 4 cd04962 GT1_like_5 This family 100.0 0 0 377.4 31.1 332 5-348 2-371 (371) 5 cd03814 GT1_like_2 This family 100.0 0 0 370.6 31.3 326 5-346 1-364 (364) 6 cd03817 GT1_UGDG_like This fam 100.0 0 0 366.8 31.8 331 5-348 1-374 (374) 7 cd03799 GT1_amsK_like This is 100.0 0 0 370.1 29.0 319 5-341 1-355 (355) 8 TIGR03088 stp2 sugar transfera 100.0 0 0 366.9 31.4 335 5-349 3-374 (374) 9 cd03801 GT1_YqgM_like This fam 100.0 0 0 363.6 32.0 333 5-346 1-374 (374) 10 cd04951 GT1_WbdM_like This fam 100.0 0 0 368.5 27.2 330 5-346 1-359 (360) 11 cd03807 GT1_WbnK_like This fam 100.0 0 0 362.5 31.6 331 5-346 1-365 (365) 12 cd03819 GT1_WavL_like This fam 100.0 0 0 365.8 28.5 322 9-337 3-355 (355) 13 cd03798 GT1_wlbH_like This fam 100.0 0 0 351.9 31.0 332 6-348 1-377 (377) 14 TIGR02149 glgA_Coryne glycogen 100.0 0 0 362.6 21.1 339 5-349 2-416 (416) 15 cd03820 GT1_amsD_like This fam 100.0 0 0 349.7 30.0 318 5-343 1-348 (348) 16 cd03795 GT1_like_4 This family 100.0 0 0 350.4 29.2 322 5-338 1-357 (357) 17 cd03808 GT1_cap1E_like This fa 100.0 0 0 349.3 29.0 321 5-343 1-359 (359) 18 cd03811 GT1_WabH_like This fam 100.0 0 0 347.4 29.7 317 5-333 1-352 (353) 19 cd05844 GT1_like_7 Glycosyltra 100.0 0 0 351.4 26.3 320 5-344 1-367 (367) 20 cd03822 GT1_ecORF704_like This 100.0 0 0 343.4 31.6 333 5-346 1-366 (366) 21 cd03812 GT1_CapH_like This fam 100.0 0 0 345.2 29.1 323 5-341 1-355 (358) 22 cd03821 GT1_Bme6_like This fam 100.0 0 0 342.4 30.4 331 5-343 1-375 (375) 23 cd03805 GT1_ALG2_like This fam 100.0 0 0 344.1 29.1 329 5-341 2-392 (392) 24 PRK10307 predicted glycosyl tr 100.0 0 0 338.9 31.8 337 5-349 2-415 (415) 25 cd03823 GT1_ExpE7_like This fa 100.0 0 0 344.1 27.4 317 5-347 1-359 (359) 26 cd03825 GT1_wcfI_like This fam 100.0 0 0 342.3 26.9 315 5-348 2-365 (365) 27 cd03794 GT1_wbuB_like This fam 100.0 0 0 332.7 28.9 328 5-342 1-394 (394) 28 cd03813 GT1_like_3 This family 100.0 0 0 340.1 21.4 271 67-346 172-475 (475) 29 cd03809 GT1_mtfB_like This fam 100.0 2.8E-45 0 324.6 28.6 313 15-343 12-365 (365) 30 cd04955 GT1_like_6 This family 100.0 3.8E-44 0 316.3 30.0 325 5-346 1-363 (363) 31 cd03792 GT1_Trehalose_phosphor 100.0 1.5E-44 0 318.9 27.3 336 8-348 2-372 (372) 32 cd03802 GT1_AviGT4_like This f 100.0 2.9E-43 0 310.3 27.9 307 5-346 2-335 (335) 33 cd03818 GT1_ExpC_like This fam 100.0 6.1E-43 0 308.2 27.2 327 5-342 1-395 (396) 34 PRK09922 UDP-D-galactose:(gluc 100.0 4.7E-42 0 302.2 28.1 322 5-348 4-359 (361) 35 cd03791 GT1_Glycogen_synthase_ 100.0 2.5E-41 0 297.3 27.6 332 5-347 1-476 (476) 36 PRK00654 glgA glycogen synthas 100.0 2.6E-41 0 297.1 26.6 336 5-350 2-476 (476) 37 cd04946 GT1_AmsK_like This fam 100.0 1.8E-40 4.2E-45 291.5 26.9 316 6-341 2-405 (407) 38 cd04949 GT1_gtfA_like This fam 100.0 1E-38 2.6E-43 279.8 20.6 211 114-341 155-372 (372) 39 KOG1111 consensus 100.0 1.2E-37 3E-42 272.5 17.2 327 5-351 2-370 (426) 40 cd03804 GT1_wbaZ_like This fam 100.0 7.2E-36 1.8E-40 260.5 20.6 297 5-341 1-350 (351) 41 TIGR03087 stp1 sugar transfera 100.0 8.1E-34 2.1E-38 246.7 27.4 320 6-347 1-396 (397) 42 TIGR02472 sucr_P_syn_N sucrose 100.0 3.2E-35 8.1E-40 256.2 19.6 325 18-345 26-444 (445) 43 TIGR02095 glgA glycogen/starch 100.0 3E-33 7.7E-38 242.9 19.4 278 63-348 140-517 (517) 44 PRK10125 predicted glycosyl tr 100.0 6.2E-29 1.6E-33 213.9 22.2 314 8-348 3-405 (405) 45 cd03806 GT1_ALG11_like This fa 100.0 1.7E-27 4.4E-32 204.1 24.3 322 5-339 2-418 (419) 46 pfam00534 Glycos_transf_1 Glyc 100.0 5.4E-29 1.4E-33 214.3 12.0 167 153-326 2-171 (172) 47 cd01635 Glycosyltransferase_GT 99.9 5E-26 1.3E-30 194.3 16.6 225 6-295 1-229 (229) 48 COG0297 GlgA Glycogen synthase 99.9 1.6E-25 4.1E-30 190.9 17.5 276 67-352 129-482 (487) 49 KOG0853 consensus 99.9 9.9E-26 2.5E-30 192.3 14.9 233 114-350 207-470 (495) 50 TIGR02470 sucr_synth sucrose s 99.9 1.6E-24 4E-29 184.2 12.5 277 64-344 387-750 (790) 51 COG0438 RfaG Glycosyltransfera 99.9 1.4E-21 3.6E-26 164.3 18.5 222 116-351 150-380 (381) 52 TIGR02468 sucrsPsyn_pln sucros 99.9 1.2E-22 3E-27 171.6 8.9 286 55-344 291-671 (1072) 53 cd04950 GT1_like_1 Glycosyltra 99.9 1.7E-20 4.4E-25 157.0 14.0 216 109-347 146-371 (373) 54 KOG1387 consensus 99.8 2.8E-17 7E-22 135.5 19.3 225 114-351 220-462 (465) 55 TIGR02918 TIGR02918 conserved 99.8 5.2E-19 1.3E-23 147.1 8.3 219 111-348 271-511 (511) 56 PRK13608 diacylglycerol glucos 99.8 7E-17 1.8E-21 132.8 17.4 261 62-346 98-370 (391) 57 cd03793 GT1_Glycogen_synthase_ 99.8 6.2E-16 1.6E-20 126.4 21.4 284 66-351 146-590 (590) 58 PRK13609 diacylglycerol glucos 99.7 4.1E-16 1E-20 127.6 15.6 260 62-345 98-369 (388) 59 pfam05693 Glycogen_syn Glycoge 99.7 1.8E-14 4.5E-19 116.6 20.1 283 68-352 143-586 (633) 60 cd03816 GT1_ALG1_like This fam 99.7 1.1E-13 2.7E-18 111.4 24.0 261 68-341 95-410 (415) 61 PRK10117 trehalose-6-phosphate 99.6 9.8E-13 2.5E-17 104.9 20.6 276 67-351 122-457 (474) 62 cd03788 GT1_TPS Trehalose-6-Ph 99.6 2.7E-12 6.9E-17 101.9 21.7 271 67-343 130-457 (460) 63 PRK00726 murG N-acetylglucosam 99.5 2.1E-11 5.3E-16 95.9 23.6 312 4-345 2-357 (359) 64 pfam00982 Glyco_transf_20 Glyc 99.5 1.6E-11 4.1E-16 96.6 21.3 271 68-345 138-468 (470) 65 PHA01630 putative group 1 glyc 99.5 2.1E-12 5.2E-17 102.7 16.2 286 23-347 17-332 (333) 66 PRK05749 3-deoxy-D-manno-octul 99.5 4.1E-11 1.1E-15 93.9 22.6 245 88-349 143-419 (423) 67 cd03785 GT1_MurG MurG is an N- 99.5 5.2E-11 1.3E-15 93.3 21.9 283 24-329 16-340 (350) 68 TIGR02400 trehalose_OtsA alpha 99.5 8.4E-13 2.2E-17 105.3 12.3 270 65-344 137-473 (476) 69 PHA01633 putative glycosyl tra 99.5 8.9E-12 2.3E-16 98.4 15.6 289 17-349 10-334 (335) 70 KOG2941 consensus 99.4 4.8E-10 1.2E-14 86.7 23.1 309 23-343 28-437 (444) 71 COG1519 KdtA 3-deoxy-D-manno-o 99.4 7.5E-10 1.9E-14 85.4 23.2 244 87-346 142-416 (419) 72 COG0707 MurG UDP-N-acetylgluco 99.4 2E-10 5E-15 89.3 19.3 261 57-343 80-353 (357) 73 COG0380 OtsA Trehalose-6-phosp 99.3 4.4E-10 1.1E-14 87.0 18.6 204 134-345 245-477 (486) 74 PRK12446 N-acetylglucosaminyl 99.3 1.6E-08 4E-13 76.5 23.2 308 3-342 1-350 (352) 75 PRK00025 lpxB lipid-A-disaccha 99.2 1.5E-08 3.7E-13 76.7 18.8 309 23-345 16-378 (382) 76 COG3914 Spy Predicted O-linked 98.9 8.4E-09 2.1E-13 78.4 9.1 187 155-351 419-617 (620) 77 TIGR01133 murG undecaprenyldip 98.9 4.8E-07 1.2E-11 66.5 17.4 304 18-338 15-365 (368) 78 TIGR02398 gluc_glyc_Psyn gluco 98.9 1.9E-08 4.9E-13 76.0 9.8 201 138-346 257-488 (495) 79 TIGR03492 conserved hypothetic 98.8 1.9E-06 4.9E-11 62.5 18.0 242 82-342 104-393 (396) 80 COG0763 LpxB Lipid A disacchar 98.8 1.3E-06 3.4E-11 63.6 17.0 251 62-326 79-357 (381) 81 pfam02684 LpxB Lipid-A-disacch 98.7 3.4E-06 8.6E-11 60.8 17.1 293 23-329 13-356 (373) 82 KOG3742 consensus 98.7 1.4E-07 3.7E-12 70.0 9.1 112 239-352 494-617 (692) 83 PRK09814 hypothetical protein; 98.6 3.1E-05 7.8E-10 54.4 19.4 270 19-329 16-318 (337) 84 cd04299 GT1_Glycogen_Phosphory 98.5 1.6E-06 4.1E-11 63.0 9.8 183 161-346 473-688 (778) 85 TIGR02094 more_P_ylases alpha- 98.3 3.6E-06 9.1E-11 60.7 7.8 183 161-346 394-616 (618) 86 PRK01021 lpxB lipid-A-disaccha 98.3 0.0002 5.1E-09 48.9 16.5 291 23-329 241-586 (607) 87 pfam04101 Glyco_tran_28_C Glyc 98.3 1.1E-05 2.8E-10 57.4 9.6 146 169-328 2-159 (167) 88 COG0381 WecB UDP-N-acetylgluco 98.2 0.00043 1.1E-08 46.7 16.8 268 57-346 81-369 (383) 89 COG4641 Uncharacterized protei 98.2 3.6E-05 9.2E-10 53.9 10.8 303 24-351 20-365 (373) 90 COG0058 GlgP Glucan phosphoryl 98.1 3.5E-05 8.9E-10 54.0 9.9 137 161-298 482-631 (750) 91 pfam04007 DUF354 Protein of un 98.0 0.00099 2.5E-08 44.2 14.7 256 25-312 17-309 (335) 92 TIGR03590 PseG pseudaminic aci 97.9 0.0013 3.3E-08 43.5 14.4 97 165-276 171-268 (280) 93 pfam02350 Epimerase_2 UDP-N-ac 97.9 0.0026 6.7E-08 41.4 17.3 246 57-320 56-325 (346) 94 cd03784 GT1_Gtf_like This fami 97.9 9.6E-05 2.4E-09 51.1 8.1 155 166-343 240-400 (401) 95 KOG1050 consensus 97.8 0.00048 1.2E-08 46.4 10.9 185 134-322 240-449 (732) 96 TIGR03568 NeuC_NnaA UDP-N-acet 97.7 0.0059 1.5E-07 39.0 17.5 241 59-320 84-345 (365) 97 COG1819 Glycosyl transferases, 97.7 0.00074 1.9E-08 45.1 9.8 161 163-347 234-401 (406) 98 TIGR01426 MGT glycosyltransfer 97.6 0.001 2.6E-08 44.1 10.6 166 160-345 253-427 (429) 99 pfam00343 Phosphorylase Carboh 97.4 0.0024 6E-08 41.7 9.4 133 161-295 439-594 (712) 100 pfam09314 DUF1972 Domain of un 97.3 0.018 4.6E-07 35.8 15.0 141 3-143 1-185 (185) 101 pfam04464 Glyphos_transf CDP-G 97.2 0.0039 9.8E-08 40.3 9.0 147 157-322 2-162 (186) 102 pfam08323 Glyco_transf_5 Starc 97.2 0.02 5.1E-07 35.4 12.5 30 16-45 13-42 (229) 103 pfam00201 UDPGT UDP-glucoronos 97.2 0.019 4.9E-07 35.5 12.2 80 257-338 349-434 (501) 104 COG3980 spsG Spore coat polysa 97.1 0.02 5E-07 35.5 12.0 288 5-326 2-306 (318) 105 cd03786 GT1_UDP-GlcNAc_2-Epime 97.1 0.027 6.9E-07 34.5 16.1 241 58-318 78-342 (363) 106 TIGR03609 S_layer_CsaB polysac 96.8 0.045 1.1E-06 33.1 11.5 257 8-283 2-284 (298) 107 PRK10017 putative pyruvyl tran 96.8 0.048 1.2E-06 32.9 21.6 231 85-328 142-407 (426) 108 KOG4626 consensus 96.7 0.015 3.9E-07 36.2 8.6 188 155-350 748-943 (966) 109 PRK02797 4-alpha-L-fucosyltran 96.5 0.069 1.8E-06 31.8 15.7 257 68-350 78-356 (358) 110 pfam06258 DUF1022 Protein of u 96.3 0.094 2.4E-06 30.9 11.5 236 19-279 2-256 (308) 111 pfam01075 Glyco_transf_9 Glyco 96.2 0.02 5.2E-07 35.4 6.7 115 155-281 97-216 (249) 112 pfam10933 DUF2827 Protein of u 96.1 0.12 3E-06 30.3 10.9 217 115-350 128-364 (364) 113 COG1817 Uncharacterized protei 94.7 0.34 8.7E-06 27.1 13.7 259 25-316 17-317 (346) 114 TIGR02919 TIGR02919 conserved 93.8 0.097 2.5E-06 30.8 4.2 130 185-328 299-436 (448) 115 pfam07429 Fuc4NAc_transf 4-alp 93.2 0.66 1.7E-05 25.2 13.8 217 68-299 80-317 (361) 116 PRK10422 lipopolysaccharide co 90.9 1.2 3.1E-05 23.5 16.7 250 2-275 4-287 (352) 117 PRK10964 ADP-heptose:LPS hepto 89.9 1.5 3.7E-05 22.9 7.2 141 157-312 170-321 (322) 118 PRK10916 ADP-heptose:LPS hepto 89.8 1.5 3.8E-05 22.8 8.6 115 153-276 167-287 (348) 119 pfam03016 Exostosin Exostosin 89.1 0.15 3.9E-06 29.5 1.0 68 238-307 216-286 (289) 120 KOG1021 consensus 88.7 1.8 4.5E-05 22.3 6.5 94 238-336 336-434 (464) 121 pfam05686 DUF821 Arabidopsis t 84.1 3 7.7E-05 20.8 7.3 53 298-350 269-324 (396) 122 KOG1192 consensus 83.0 3.3 8.5E-05 20.5 10.2 64 261-325 366-434 (496) 123 KOG0832 consensus 82.5 3.5 8.9E-05 20.4 6.7 140 115-299 78-220 (251) 124 smart00672 CAP10 Putative lipo 82.2 3.6 9.2E-05 20.3 8.2 82 265-349 164-253 (256) 125 PRK04020 rps2P 30S ribosomal p 80.2 4 0.0001 20.0 4.8 71 259-343 127-200 (204) 126 pfam12000 DUF3495 Domain of un 79.9 4.3 0.00011 19.8 7.5 81 65-147 63-171 (172) 127 COG4671 Predicted glycosyl tra 78.7 4.5 0.00012 19.6 4.7 140 163-314 217-366 (400) 128 pfam05159 Capsule_synth Capsul 78.2 1.4 3.5E-05 23.1 1.9 87 179-277 140-226 (268) 129 cd04300 GT1_Glycogen_Phosphory 78.0 4.9 0.00012 19.4 9.8 125 170-296 534-681 (797) 130 COG0859 RfaF ADP-heptose:LPS h 76.4 5.4 0.00014 19.1 8.1 202 54-276 67-277 (334) 131 TIGR01123 ilvE_II branched-cha 75.5 2.9 7.5E-05 20.9 3.0 46 262-313 267-313 (329) 132 COG2984 ABC-type uncharacteriz 74.0 2 5E-05 22.1 1.8 123 182-312 146-278 (322) 133 COG1927 Mtd Coenzyme F420-depe 73.0 6.5 0.00017 18.5 7.2 12 197-208 87-98 (277) 134 COG3660 Predicted nucleoside-d 73.0 6.5 0.00017 18.5 12.8 246 5-276 2-272 (329) 135 pfam08288 PIGA PIGA (GPI ancho 72.3 6.8 0.00017 18.4 4.7 47 55-103 37-85 (90) 136 cd01425 RPS2 Ribosomal protein 71.3 7.1 0.00018 18.3 4.8 112 165-285 55-167 (193) 137 COG2204 AtoC Response regulato 69.6 7.7 0.0002 18.1 4.0 15 28-42 21-35 (464) 138 pfam11997 DUF3492 Domain of un 69.4 7.8 0.0002 18.0 7.6 38 5-42 2-40 (268) 139 pfam10087 DUF2325 Uncharacteri 69.2 2.5 6.3E-05 21.4 1.4 70 214-283 12-87 (96) 140 PTZ00254 40S ribosomal protein 68.8 8.1 0.00021 17.9 6.0 27 259-285 131-158 (242) 141 pfam03401 Bug Tripartite trica 65.8 9.2 0.00023 17.5 9.5 177 166-352 78-272 (274) 142 KOG2884 consensus 65.5 9.3 0.00024 17.5 7.4 57 262-318 178-234 (259) 143 PRK10494 hypothetical protein; 64.5 9.7 0.00025 17.4 3.7 98 179-278 104-210 (259) 144 TIGR01011 rpsB_bact ribosomal 64.5 5.4 0.00014 19.1 2.4 45 249-303 162-207 (227) 145 TIGR01012 Sa_S2_E_A ribosomal 64.4 7.7 0.0002 18.0 3.2 93 236-342 100-194 (197) 146 cd06305 PBP1_methylthioribose_ 61.5 11 0.00028 17.0 4.4 36 5-42 1-36 (273) 147 cd03789 GT1_LPS_heptosyltransf 61.3 11 0.00028 17.0 14.0 99 168-278 124-226 (279) 148 TIGR00236 wecB UDP-N-acetylglu 59.8 12 0.0003 16.8 7.6 152 164-325 206-361 (380) 149 TIGR02360 pbenz_hydroxyl 4-hyd 59.2 11 0.00028 17.0 3.2 104 210-336 239-349 (393) 150 COG3563 KpsC Capsule polysacch 58.3 9.1 0.00023 17.6 2.6 12 266-277 489-500 (671) 151 PRK07726 DNA topoisomerase III 57.6 13 0.00032 16.6 4.3 83 267-351 514-599 (716) 152 COG0052 RpsB Ribosomal protein 55.8 14 0.00035 16.4 3.2 34 259-299 169-203 (252) 153 TIGR01163 rpe ribulose-phospha 55.7 10 0.00026 17.2 2.6 12 138-149 137-148 (216) 154 PRK05282 peptidase E; Validate 55.3 14 0.00035 16.4 3.1 23 22-44 18-40 (233) 155 TIGR01768 GGGP-family geranylg 55.2 13 0.00032 16.6 2.9 13 181-193 176-188 (242) 156 COG2327 WcaK Polysaccharide py 55.0 14 0.00036 16.3 20.1 224 84-319 117-357 (385) 157 TIGR01343 hacA_fam homoaconita 54.8 9.3 0.00024 17.5 2.2 62 142-203 271-344 (432) 158 PRK07220 DNA topoisomerase I; 52.0 16 0.0004 16.0 4.3 94 255-350 470-566 (740) 159 TIGR02195 heptsyl_trn_II lipop 52.0 16 0.0004 16.0 4.7 304 5-344 1-359 (361) 160 cd06313 PBP1_ABC_sugar_binding 51.1 16 0.00041 15.9 4.6 14 30-43 24-37 (272) 161 pfam04392 ABC_sub_bind ABC tra 50.0 17 0.00043 15.8 4.3 61 246-312 184-249 (292) 162 PRK02395 hypothetical protein; 49.3 17 0.00044 15.7 5.1 144 165-314 100-260 (284) 163 PRK10360 DNA-binding transcrip 49.1 17 0.00044 15.7 4.9 25 288-312 93-117 (196) 164 COG0550 TopA Topoisomerase IA 49.1 17 0.00044 15.7 4.7 99 252-352 450-554 (570) 165 PRK10610 chemotaxis regulatory 48.8 17 0.00044 15.7 4.6 45 268-312 79-125 (129) 166 pfam08660 Alg14 Oligosaccharid 48.7 18 0.00045 15.6 8.8 79 56-134 76-160 (166) 167 KOG3349 consensus 47.6 18 0.00045 15.6 2.7 94 168-275 6-106 (170) 168 COG4370 Uncharacterized protei 46.6 19 0.00048 15.4 4.7 154 176-342 238-408 (412) 169 TIGR02493 PFLA pyruvate format 44.9 20 0.00051 15.3 3.7 174 39-237 6-207 (243) 170 KOG2619 consensus 44.6 20 0.00051 15.2 6.6 59 265-327 273-333 (372) 171 PRK07219 DNA topoisomerase I; 44.2 20 0.00052 15.2 4.5 94 255-350 468-564 (769) 172 pfam09949 DUF2183 Uncharacteri 43.9 21 0.00053 15.2 2.8 32 185-216 52-83 (100) 173 TIGR02093 P_ylase glycogen/sta 43.7 21 0.00053 15.1 7.4 186 159-350 539-756 (822) 174 TIGR02201 heptsyl_trn_III lipo 43.5 21 0.00053 15.1 6.7 195 54-274 69-287 (347) 175 COG0062 Uncharacterized conser 43.0 21 0.00054 15.1 6.1 114 157-280 41-162 (203) 176 COG3959 Transketolase, N-termi 43.0 21 0.00054 15.1 3.2 69 192-264 137-210 (243) 177 TIGR01769 GGGP geranylgeranylg 42.7 21 0.00053 15.1 2.5 22 23-44 14-35 (212) 178 TIGR02030 BchI-ChlI magnesium 42.7 13 0.00033 16.6 1.4 108 169-281 137-264 (340) 179 pfam00290 Trp_syntA Tryptophan 42.2 22 0.00056 15.0 3.3 23 23-45 24-46 (258) 180 cd05566 PTS_IIB_galactitol PTS 41.9 15 0.00038 16.1 1.6 56 199-255 3-58 (89) 181 PRK00124 hypothetical protein; 41.9 18 0.00045 15.6 2.0 84 262-349 59-146 (149) 182 COG1887 TagB Putative glycosyl 41.5 22 0.00057 14.9 8.1 169 133-318 170-358 (388) 183 pfam02504 FA_synthesis Fatty a 40.6 23 0.00059 14.8 3.7 50 19-77 13-63 (322) 184 COG3414 SgaB Phosphotransferas 40.4 23 0.0006 14.8 2.8 55 199-254 4-58 (93) 185 TIGR02478 6PF1K_euk 6-phosphof 38.1 25 0.00064 14.6 3.7 144 182-338 227-385 (777) 186 PRK10643 DNA-binding transcrip 36.8 26 0.00067 14.4 4.7 18 25-42 14-31 (222) 187 cd06302 PBP1_LsrB_Quorum_Sensi 36.1 27 0.00069 14.4 2.7 88 181-273 169-262 (298) 188 PRK05927 hypothetical protein; 35.4 25 0.00063 14.6 1.9 135 158-315 195-337 (350) 189 PRK01909 pdxA 4-hydroxythreoni 35.0 28 0.00072 14.3 2.8 63 241-312 251-321 (329) 190 PRK10365 transcriptional regul 33.8 29 0.00075 14.1 5.8 51 298-349 317-369 (441) 191 PRK10310 galactitol-specific P 33.7 25 0.00064 14.6 1.7 55 199-255 5-60 (94) 192 cd05099 PTKc_FGFR4 Catalytic D 33.6 16 0.0004 16.0 0.6 90 244-350 215-306 (314) 193 TIGR02995 ectoine_ehuB ectoine 33.3 18 0.00046 15.6 0.9 122 178-317 127-260 (285) 194 PRK08445 hypothetical protein; 33.2 29 0.00074 14.1 2.0 40 158-200 192-232 (348) 195 cd05115 PTKc_Zap-70 Catalytic 33.0 12 0.00031 16.7 0.0 71 242-322 175-247 (257) 196 PRK11361 acetoacetate metaboli 32.2 31 0.0008 13.9 5.2 32 3-42 4-35 (457) 197 PRK09567 nirA ferredoxin-nitri 31.9 28 0.00072 14.2 1.7 48 171-227 379-434 (591) 198 smart00072 GuKc Guanylate kina 31.8 32 0.00081 13.9 2.7 62 257-318 80-144 (184) 199 PRK10834 hypothetical protein; 31.6 32 0.00081 13.9 2.8 92 185-276 70-168 (239) 200 PRK10710 DNA-binding transcrip 31.2 32 0.00082 13.8 5.2 19 25-43 24-42 (240) 201 PRK05889 putative acetyl-CoA c 31.2 22 0.00055 15.0 1.0 42 254-299 24-65 (71) 202 PRK10430 DNA-binding transcrip 30.8 33 0.00084 13.8 2.0 10 263-272 212-221 (239) 203 TIGR01250 pro_imino_pep_2 prol 30.2 34 0.00086 13.7 2.0 36 5-42 25-61 (302) 204 PRK08174 DNA topoisomerase III 30.1 34 0.00086 13.7 4.5 86 264-351 515-602 (670) 205 COG2949 SanA Uncharacterized m 30.1 34 0.00086 13.7 3.3 99 182-281 79-185 (235) 206 pfam01113 DapB_N Dihydrodipico 30.0 23 0.00059 14.8 1.0 45 237-281 57-101 (122) 207 PRK05654 acetyl-CoA carboxylas 29.8 28 0.00071 14.3 1.4 21 154-176 102-122 (288) 208 TIGR01658 EYA-cons_domain EYA 29.5 35 0.00088 13.7 2.9 53 186-248 262-316 (319) 209 KOG1588 consensus 29.1 33 0.00084 13.8 1.7 59 168-232 105-163 (259) 210 cd05066 PTKc_EphR_A Catalytic 29.1 25 0.00064 14.6 1.1 68 244-322 188-258 (267) 211 COG1618 Predicted nucleotide k 29.0 31 0.0008 13.9 1.5 73 239-313 94-177 (179) 212 TIGR00016 ackA acetate kinase; 28.6 36 0.00091 13.5 2.6 87 253-349 237-334 (416) 213 pfam00318 Ribosomal_S2 Ribosom 28.6 36 0.00091 13.5 3.5 27 259-285 150-177 (205) 214 cd06611 STKc_SLK_like Serine/t 28.4 19 0.00049 15.3 0.4 89 245-342 188-278 (280) 215 cd06325 PBP1_ABC_uncharacteriz 28.1 37 0.00093 13.5 7.0 67 210-276 145-218 (281) 216 cd06319 PBP1_ABC_sugar_binding 28.0 28 0.0007 14.3 1.1 55 172-235 164-218 (277) 217 cd01018 ZntC Metal binding pro 28.0 37 0.00093 13.5 4.6 21 288-308 216-236 (266) 218 TIGR02041 CysI sulfite reducta 28.0 37 0.00093 13.5 2.0 98 169-281 351-472 (550) 219 PRK05472 redox-sensing transcr 27.9 37 0.00094 13.5 2.2 40 169-208 56-95 (211) 220 cd05091 PTKc_Ror2 Catalytic Do 27.7 14 0.00036 16.2 -0.4 71 243-323 204-276 (283) 221 PRK13119 consensus 27.3 38 0.00096 13.4 3.7 13 215-227 135-147 (261) 222 cd01540 PBP1_arabinose_binding 27.0 19 0.00047 15.5 0.1 36 5-42 1-36 (289) 223 PRK06857 consensus 26.8 38 0.00098 13.3 1.9 19 234-252 92-110 (209) 224 cd06308 PBP1_sensor_kinase_lik 26.8 38 0.00098 13.3 2.7 44 186-234 171-214 (270) 225 cd05084 PTKc_Fes Catalytic Dom 26.6 16 0.00042 15.8 -0.3 70 243-322 174-245 (252) 226 TIGR02858 spore_III_AA stage I 26.5 20 0.00052 15.2 0.2 17 262-278 224-240 (282) 227 TIGR00644 recJ single-stranded 26.4 39 0.00099 13.3 4.9 28 114-143 229-256 (705) 228 cd05063 PTKc_EphR_A2 Catalytic 26.2 23 0.00059 14.8 0.4 70 242-322 187-259 (268) 229 COG3181 Uncharacterized protei 25.9 40 0.001 13.2 9.1 59 291-351 249-314 (319) 230 TIGR01949 AroFGH_arch predicte 25.5 40 0.001 13.2 3.0 46 266-313 190-250 (259) 231 PRK08225 acetyl-CoA carboxylas 25.5 27 0.00068 14.4 0.7 42 254-299 23-64 (70) 232 PRK05331 putative glycerol-3-p 25.1 41 0.0011 13.1 3.8 51 19-77 14-65 (317) 233 TIGR01825 gly_Cac_T_rel pyrido 25.0 41 0.0011 13.1 2.8 31 295-325 271-301 (392) 234 PRK10355 xylF D-xylose transpo 24.9 18 0.00045 15.6 -0.3 36 5-42 27-62 (330) 235 cd06311 PBP1_ABC_sugar_binding 24.7 42 0.0011 13.1 2.7 53 173-234 165-217 (274) 236 cd05060 PTKc_Syk_like Catalyti 24.7 18 0.00045 15.6 -0.4 70 244-323 177-248 (257) 237 cd05064 PTKc_EphR_A10 Catalyti 24.7 21 0.00055 15.0 0.1 69 244-322 187-257 (266) 238 KOG4131 consensus 24.5 42 0.0011 13.1 2.3 112 139-278 123-236 (272) 239 CHL00067 rps2 ribosomal protei 24.5 42 0.0011 13.1 2.7 34 259-299 170-204 (227) 240 cd05058 PTKc_Met_Ron Catalytic 24.2 21 0.00054 15.1 -0.0 69 244-322 181-251 (262) 241 PRK13435 response regulator; P 24.2 43 0.0011 13.0 3.6 44 268-313 73-117 (141) 242 cd05116 PTKc_Syk Catalytic Dom 24.2 19 0.00047 15.5 -0.3 72 242-323 175-248 (257) 243 cd05578 STKc_Yank1 STKc_Yank1: 23.9 25 0.00063 14.6 0.3 65 244-317 178-244 (258) 244 cd05071 PTKc_Src Catalytic Dom 23.9 38 0.00097 13.4 1.2 78 244-333 182-262 (262) 245 cd06324 PBP1_ABC_sugar_binding 23.9 28 0.00072 14.2 0.6 67 172-247 181-251 (305) 246 cd05044 PTKc_c-ros Catalytic D 23.7 25 0.00064 14.6 0.3 71 242-322 191-263 (270) 247 cd06316 PBP1_ABC_sugar_binding 23.7 44 0.0011 13.0 1.6 85 5-97 1-87 (294) 248 cd05074 PTKc_Tyro3 Catalytic D 23.6 26 0.00065 14.5 0.3 69 244-322 194-264 (273) 249 cd05050 PTKc_Musk Catalytic Do 23.6 23 0.00058 14.9 0.0 70 244-323 211-282 (288) 250 PRK05776 DNA topoisomerase III 23.6 44 0.0011 13.0 5.3 94 256-351 482-580 (675) 251 pfam04230 PS_pyruv_trans Polys 23.5 44 0.0011 12.9 8.4 36 236-276 220-255 (258) 252 pfam03949 Malic_M Malic enzyme 23.5 44 0.0011 12.9 2.3 17 291-307 229-246 (255) 253 cd06300 PBP1_ABC_sugar_binding 23.4 44 0.0011 12.9 4.5 30 5-36 1-30 (272) 254 PRK00192 mannosyl-3-phosphogly 23.1 45 0.0011 12.9 3.6 43 268-313 228-271 (275) 255 PRK07455 keto-hydroxyglutarate 23.0 41 0.001 13.2 1.2 19 234-252 93-111 (210) 256 COG2344 AT-rich DNA-binding pr 22.9 45 0.0012 12.9 1.9 40 169-208 56-95 (211) 257 cd05047 PTKc_Tie Catalytic Dom 22.8 25 0.00063 14.6 0.1 72 241-322 187-260 (270) 258 TIGR00700 GABAtrnsam 4-aminobu 22.7 46 0.0012 12.8 3.6 122 212-342 221-350 (427) 259 cd00133 PTS_IIB PTS_IIB: subun 22.5 46 0.0012 12.8 1.9 53 200-254 3-55 (84) 260 cd03146 GAT1_Peptidase_E Type 22.5 46 0.0012 12.8 3.9 18 262-279 171-190 (212) 261 PRK08444 hypothetical protein; 22.1 47 0.0012 12.8 2.3 38 159-199 200-238 (353) 262 PRK05299 rpsB 30S ribosomal pr 22.0 47 0.0012 12.7 5.9 34 259-299 170-204 (255) 263 PRK06846 deaminase; Validated 22.0 47 0.0012 12.7 3.1 93 181-276 204-311 (410) 264 cd06317 PBP1_ABC_sugar_binding 21.9 26 0.00066 14.5 0.0 16 27-42 22-37 (275) 265 COG1671 Uncharacterized protei 21.7 48 0.0012 12.7 4.0 20 266-285 64-83 (150) 266 pfam00308 Bac_DnaA Bacterial d 21.7 48 0.0012 12.7 5.6 85 261-349 122-213 (219) 267 TIGR01527 arch_NMN_Atrans nico 21.4 48 0.0012 12.7 1.6 71 170-252 3-76 (171) 268 cd05085 PTKc_Fer Catalytic Dom 21.3 24 0.0006 14.8 -0.3 72 242-323 171-244 (250) 269 TIGR00314 cdhA CO dehydrogenas 21.2 49 0.0012 12.6 4.1 72 212-283 251-345 (795) 270 CHL00174 accD acetyl-CoA carbo 21.2 49 0.0012 12.6 1.5 21 154-176 121-141 (305) 271 PRK09219 xanthine phosphoribos 20.9 36 0.00092 13.5 0.6 35 63-97 45-79 (189) 272 pfam10649 DUF2478 Protein of u 20.7 18 0.00047 15.5 -0.9 56 245-302 93-155 (159) 273 pfam01866 Diphthamide_syn Puta 20.5 50 0.0013 12.5 2.9 74 197-275 204-280 (300) 274 PRK08243 4-hydroxybenzoate 3-m 20.4 51 0.0013 12.5 2.5 71 262-336 272-346 (392) 275 cd05033 PTKc_EphR Catalytic Do 20.2 30 0.00075 14.1 0.0 69 242-321 185-256 (266) 276 cd05090 PTKc_Ror1 Catalytic Do 20.2 32 0.00081 13.9 0.2 70 244-323 205-276 (283) 277 PRK08782 consensus 20.2 51 0.0013 12.5 1.9 19 234-252 97-115 (219) 278 cd05046 PTK_CCK4 Catalytic Dom 20.1 28 0.00072 14.3 -0.1 70 244-322 197-268 (275) No 1 >cd03796 GT1_PIG-A_like This family is most closely related to the GT1 family of glycosyltransferases. Phosphatidylinositol glycan-class A (PIG-A), an X-linked gene in humans, is necessary for the synthesis of N-acetylglucosaminyl-phosphatidylinositol, a very early intermediate in glycosyl phosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is an important cellular structure that facilitates the attachment of many proteins to cell surfaces. Somatic mutations in PIG-A have been associated with Paroxysmal Nocturnal Hemoglobinuria (PNH), an acquired hematological disorder. Probab=100.00 E-value=0 Score=403.18 Aligned_cols=330 Identities=20% Similarity=0.270 Sum_probs=267.7 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCC--------------CCC------CHHH----HHHH Q ss_conf 458970787878770778888999999971985999805886656--------------234------7899----8998 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNI--------------PSI------GISS----LLTC 60 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~------~~~~----~~~~ 60 (352) +|.++.+.|.+..+|++.++.++++.+.+.||++.++++..+... |.. .+.. +..+ T Consensus 1 RI~ivt~~f~P~iGG~e~~v~~La~~L~~~Gh~V~Vit~~~~~~~~~~~~~~g~~V~~~p~~~~~~~~~~~~~~~~~~~~ 80 (398) T cd03796 1 RICMVSDFFYPNLGGVETHIYQLSQCLIKRGHKVVVITHAYGNRVGIRYLTNGLKVYYLPFVVFYNQSTLPTFFGTFPLL 80 (398) T ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEECCCCEEEECCCCCCCCCCHHHHHHHHHHHH T ss_conf 98999488799999779999999999997699899996899988774685388469975663345631167798889999 Q ss_pred HHHHHHCCCEEEEECCCHHHHH--HHHHHHHHCCCEEEEECCHH------HHHHHHHHHHHHHHCCEEEECCHHHHHHC- Q ss_conf 8754325970899628546899--99999970998199975745------62011347989962668998898999743- Q gi|255764474|r 61 WKKPIGQNSRIWHARRNNEMLL--GVMMRDVLRMPLKLVFTSPS------QRNHSRWTRYLISRMDEVITTSQKSARFI- 131 (352) Q Consensus 61 ~~~~~~~~~~ivh~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~l~~~~d~ii~~s~~~~~~~- 131 (352) .+.....++||+|+|.....+. +++.+...+.|+.++.|+.. ......+.++.++.+|+++++|+..++.. T Consensus 81 r~~~~~~~~DIIH~H~~~~~l~~~~~~~ar~~g~~~V~T~H~~~~~~~~~~~~~~~~~~~~l~~~d~vIavS~~~~e~~~ 160 (398) T cd03796 81 RNILIRERITIVHGHQAFSALAHEALLHARTMGLKTVFTDHSLFGFADASSIHTNKLLRFSLADVDHVICVSHTSKENTV 160 (398) T ss_pred HHHHHHCCCCEEEECCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHH T ss_conf 99997679988998962688999999998755997899834432446314999999999999857999999779999999 Q ss_pred ------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf ------68839974882777823784326878872889887279995122644512366654565300477405886313 Q gi|255764474|r 132 ------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGK 205 (352) Q Consensus 132 ------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~ 205 (352) +.++.|||||||.+.|.|.+... ..+...|+|+||+.++||++.+++|++.+.+++|+++|+|+|+ T Consensus 161 ~~~~~~~~ki~vIpNGVd~~~f~p~~~~~--------~~~~~~il~vGRL~~~KG~d~Li~A~~~l~~~~p~~~lvIvGd 232 (398) T cd03796 161 LRASLDPERVSVIPNAVDSSDFTPDPSKR--------DNDKITIVVISRLVYRKGIDLLVGIIPEICKKHPNVRFIIGGD 232 (398) T ss_pred HHHCCCCCCEEEECCCCCHHHCCCCCCCC--------CCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 98489941099988957476448872215--------8898699997067503009999999999996589959999378 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHH Q ss_conf 3321000146677644311233221222222--33100000000000123332222221000001001110278841010 Q gi|255764474|r 206 TTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSEL 283 (352) Q Consensus 206 g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~ 283 (352) |+.+ ++|++++++++++++|.|+|++ +++..+|+.||+|++||++||||++++||||||+|||+|++||++|+ T Consensus 233 Gp~~-----~~L~~l~~~~~l~~~V~flG~v~~~~l~~~~~~aDvfv~PS~~Egfglv~lEAmA~G~PVVat~vgG~~Ev 307 (398) T cd03796 233 GPKR-----ILLEEMREKYNLQDRVELLGAVPHERVRDVLVQGHIFLNTSLTEAFCIAIVEAASCGLLVVSTRVGGIPEV 307 (398) T ss_pred CCCH-----HHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHEEECCCCCCCCCHHHHHHHHCCCCEEECCCCCCCCE T ss_conf 7118-----99999998723367289758885677788887744212765424666799999983998998889986113 Q ss_pred CCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 02348842499659998999999999986989999999999999998299899999999999998860 Q gi|255764474|r 284 LDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLLRT 351 (352) Q Consensus 284 i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l~~ 351 (352) |. +|.+ ++.+.|++++|++|.++++|++.+++|+++||+++.++|||++++++++++|+++|+. T Consensus 308 v~--~~~~--~~~~~d~~~la~~l~~ll~~~~~~~~~~~~~r~~v~~~fsw~~ia~~~~~vY~~vl~~ 371 (398) T cd03796 308 LP--PDMI--LLAEPDVESIVRKLEEAISILRTGKHDPWSFHNRVKKMYSWEDVAKRTEKVYDRILQT 371 (398) T ss_pred EE--CCCE--EECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCC T ss_conf 41--8936--8748999999999999976999999999999999999699999999999999999818 No 2 >cd03800 GT1_Sucrose_synthase This family is most closely related to the GT1 family of glycosyltransferases. The sucrose-phosphate synthases in this family may be unique to plants and photosynthetic bacteria. This enzyme catalyzes the synthesis of sucrose 6-phosphate from fructose 6-phosphate and uridine 5'-diphosphate-glucose, a key regulatory step of sucrose metabolism. The activity of this enzyme is regulated by phosphorylation and moderated by the concentration of various metabolites and light. Probab=100.00 E-value=0 Score=379.50 Aligned_cols=337 Identities=25% Similarity=0.353 Sum_probs=273.0 Q ss_pred HEEEECCCCCCC-------CCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC---------------CCC-----CCH--- Q ss_conf 458970787878-------77077888899999997198599980588665---------------623-----478--- Q gi|255764474|r 5 NIDVIAPNMKFR-------HTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN---------------IPS-----IGI--- 54 (352) Q Consensus 5 ~i~~i~~~~~~~-------~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~-----~~~--- 54 (352) ||.+|.....+- .+|++.++.++++.+.+.||++.+++...... ++. +.. T Consensus 1 ~~a~~~~~~~p~~~~g~~d~GG~e~~v~~La~~L~~~GH~V~V~t~~~~~~~~~~~~~~~gv~v~rl~~~~~~~~~~~~l 80 (398) T cd03800 1 RIALISLHGSPLAQPGGADTGGQNVYVLELARALARLGHEVDIFTRRIDDALPPIVELAPGVRVVRVPAGPAEYLPKEEL 80 (398) T ss_pred CEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEECCCEEEEEECCCCCCCCCHHHH T ss_conf 97999777686546899996887999999999999869969999724777888806824986999955788543327777 Q ss_pred H----HH-HHHHHH--HHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHH--HH-------HH-----HHHHHH Q ss_conf 9----98-998875--4325970899628546899999999709981999757456--20-------11-----347989 Q gi|255764474|r 55 S----SL-LTCWKK--PIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQ--RN-------HS-----RWTRYL 113 (352) Q Consensus 55 ~----~~-~~~~~~--~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-------~~-----~~~~~l 113 (352) + .+ ..+.+. ....++||+|+|.+...+.+..++...+.|+++++|.... +. .. ...++. T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~pDvIH~h~~~~~~~~~~~~~~~~ip~V~t~H~l~~~~~~~~~~~~~~~~~~r~~~e~~~ 160 (398) T cd03800 81 WPYLDEFADDLLRFLRREGGRPDLIHAHYWDSGLVALLLARRLGIPLVHTFHSLGAVKRRHLGAADTYEPARRIEAEERL 160 (398) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 87899999999999998389988899888407899999999719999996375114443320235542347899999999 Q ss_pred HHHCCEEEECCHHHHHHC-------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHH Q ss_conf 962668998898999743-------6883997488277782378432687887288988727999512264451236665 Q gi|255764474|r 114 ISRMDEVITTSQKSARFI-------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVD 186 (352) Q Consensus 114 ~~~~d~ii~~s~~~~~~~-------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~ 186 (352) ++++|.++++|+..++.+ ..++.|||||||.+.|.|.......+..++++.+.++|+|+||+.++||++.+++ T Consensus 161 ~~~ad~via~S~~~~~~~~~~~~~~~~~i~vI~nGvd~~~f~p~~~~~~~~~~~~~~~~~~~i~~vGRl~~~Kg~~~li~ 240 (398) T cd03800 161 LRAADRVIASTPQEAEELYSLYGAYPRRIRVVPPGVDLERFTPYGRAEARRARLLRDPDKPRILAVGRLDPRKGIDTLIR 240 (398) T ss_pred HHHCCEEEECCHHHHHHHHHHHCCCCCCEEEECCCCCHHHCCCCCCHHHHHHHHCCCCCCCEEEEECCCCHHCCHHHHHH T ss_conf 98499999879999999999729990228997698677443889805899986089989948999828960209899999 Q ss_pred HHHHHHCCCCCEEEEEECCCCCCC-CCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCC Q ss_conf 456530047740588631333210-00146677644311233221222222--331000000000001233322222210 Q gi|255764474|r 187 CMINILPHHPGWTAVVVGKTTLKH-YLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPL 263 (352) Q Consensus 187 a~~~l~~~~~~~~l~i~G~g~~~~-~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~l 263 (352) |++.+.+..++++++|+|+|+... ..+.++++++++++++.++|.|+|++ +++..+|+.||+|++||++||||++++ T Consensus 241 A~~~l~~~~~~~~l~ivGg~~~~~~~~~~~~~~~~~~~~~l~~~V~f~G~~~~~~~~~~~~~adv~v~pS~~E~fgl~~l 320 (398) T cd03800 241 AYAELPELRERANLVIVGGPRDDILAMDEEELRELARELGVIDRVDFPGRVSREDLPALYRAADVFVNPALYEPFGLTAL 320 (398) T ss_pred HHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCEEEECCCCCCHHHHHH T ss_conf 99998877899689999688765313459999999997599874998898998999999985788875451332214899 Q ss_pred CCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 00001001110278841010023488424996599989999999999869899999999999999982998999999999 Q gi|255764474|r 264 EAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGK 343 (352) Q Consensus 264 EAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~ 343 (352) ||||||+|||+|++||++|+|. +|++|++++++|+++|+++|.++++|++++++|+++||+++.++|||+++++++++ T Consensus 321 EAma~G~PvIas~~gg~~e~v~--~g~~G~l~~~~d~~~la~ai~~ll~d~~~~~~~g~~ar~~~~~~fsw~~~a~~~le 398 (398) T cd03800 321 EAMACGLPVVATAVGGPRDIVV--DGVTGLLVDPRDPEALAAALRRLLTDPALRRRLSRAGLRRARARYTWERVAARLLE 398 (398) T ss_pred HHHHCCCCEEECCCCCCHHHHC--CCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHC T ss_conf 9998299999938998077741--79718997899999999999999779999999999999999986899999999749 No 3 >TIGR03449 mycothiol_MshA UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase. Members of this protein family, found exclusively in the Actinobacteria, are MshA, the glycosyltransferase of mycothiol biosynthesis. Mycothiol replaces glutathione in these species. Probab=100.00 E-value=0 Score=379.74 Aligned_cols=331 Identities=21% Similarity=0.336 Sum_probs=264.7 Q ss_pred CCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC----------CC--C--------CC---HHH--------HHHHHHHHH Q ss_conf 77077888899999997198599980588665----------62--3--------47---899--------899887543 Q gi|255764474|r 17 HTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN----------IP--S--------IG---ISS--------LLTCWKKPI 65 (352) Q Consensus 17 ~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~----------~~--~--------~~---~~~--------~~~~~~~~~ 65 (352) .+|++.++.++++.+.+.||++.+++...... +. . +. ... ++..+.... T Consensus 19 ~GG~e~~v~~La~~La~rGheV~V~t~~~~~~~~~~~~~~~gv~v~~~~~~p~~~~~~~~l~~~l~~~~~~~l~~~~~~~ 98 (405) T TIGR03449 19 AGGMNVYILETATELARRGIEVDIFTRATRPSQPPVVEVAPGVRVRNVVAGPYEGLDKEDLPTQLCAFTGGVLRAEARHE 98 (405) T ss_pred CCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEECCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 58869999999999997899699993588877888467049849998257862324566769999999999999999856 Q ss_pred HCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHH--------------HHHHHHHHHHHCCEEEECCHHHHHHC Q ss_conf 2597089962854689999999970998199975745620--------------11347989962668998898999743 Q gi|255764474|r 66 GQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRN--------------HSRWTRYLISRMDEVITTSQKSARFI 131 (352) Q Consensus 66 ~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~l~~~~d~ii~~s~~~~~~~ 131 (352) ...+|++|+|.+.....+...+...+.|+..+.|...... .....+++++.+|.++++|+..++.+ T Consensus 99 ~~~~DvIH~h~~~~~~~~~~~~~~~~iP~V~t~H~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ad~ii~~s~~~~~~l 178 (405) T TIGR03449 99 PGYYDLIHSHYWLSGQVGWLLRDRWGVPLVHTAHTLAAVKNAALADGDTPEPEARRIGEQQLVDNADRLIANTDEEARDL 178 (405) T ss_pred CCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEECCHHHHCCHHHHCCCCCCHHHHHHHHHHHHHHCCEEEEECHHHHHHH T ss_conf 89976899887107899999998649998998144143131244326664419999999999974899999578999999 Q ss_pred -------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCE--EEEE Q ss_conf -------6883997488277782378432687887288988727999512264451236665456530047740--5886 Q gi|255764474|r 132 -------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGW--TAVV 202 (352) Q Consensus 132 -------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~--~l~i 202 (352) ..++.|||||+|.+.|.|. ++...+.+++++.+.++|+|+||+.++||++.+++|+.++.++.|+. .+++ T Consensus 179 ~~~~~~~~~ki~vi~nGvd~~~f~p~-~~~~~r~~~g~~~~~~~il~vGRl~~~Kg~~~li~A~~~l~~~~p~~~l~~~v 257 (405) T TIGR03449 179 VRHYDADPDRIDVVAPGADLERFRPG-DRATERARLGLPLDTKVVAFVGRIQPLKAPDVLLRAVAELLDRDPDRNLRVIV 257 (405) T ss_pred HHHHCCCHHHEEEECCCCCCCEECCC-CHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 99849886788997799770306888-85899997198989818999558850114899999999999868998789999 Q ss_pred ECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCC Q ss_conf 3133321000146677644311233221222222--33100000000000123332222221000001001110278841 Q gi|255764474|r 203 VGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVF 280 (352) Q Consensus 203 ~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~ 280 (352) +|..........+.++++++++++.++|.|+|.+ +++..+|++||+|++||++||||++++||||||+|||+|++||+ T Consensus 258 ~Gg~~g~~~~~~~~l~~~~~~lgl~~~V~f~G~~~~~~~~~~~~~adv~v~PS~~E~fg~~~lEAma~G~PVVas~~gg~ 337 (405) T TIGR03449 258 VGGPSGSGLATPDALIELAAELGIADRVRFLPPRPPEELVHVYRAADVVAVPSYNESFGLVAMEAQACGTPVVAARVGGL 337 (405) T ss_pred EECCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCEEECCCCCCCCCHHHHHHHHCCCCEEEECCCCC T ss_conf 83888753656999999999828887598679988999999999578763556667888479999986999999179986 Q ss_pred CHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 01002348842499659998999999999986989999999999999998299899999999999998860 Q gi|255764474|r 281 SELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLLRT 351 (352) Q Consensus 281 ~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l~~ 351 (352) +|+|. +|.+|++++++|+++||++|.++++|++++++|+++|++++ ++|||+.++++++++|+++|+. T Consensus 338 ~e~v~--~g~~G~lv~~~d~~~la~ai~~ll~d~~~~~~l~~~~~~~~-~~fsw~~~a~~~~~vY~~~L~a 405 (405) T TIGR03449 338 PVAVA--DGETGLLVDGHDPADWADALARLLDDPRTRIRMGAAAVEHA-AGFSWAATADGLLSSYRDALAA 405 (405) T ss_pred CEEEE--CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-HHCCHHHHHHHHHHHHHHHHCC T ss_conf 11253--79738997989999999999999759999999999999999-9699999999999999999666 No 4 >cd04962 GT1_like_5 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=100.00 E-value=0 Score=377.45 Aligned_cols=332 Identities=21% Similarity=0.339 Sum_probs=269.8 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC--------------C--------CCCCHHHHHHHHH Q ss_conf 45897078787877077888899999997198599980588665--------------6--------2347899899887 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN--------------I--------PSIGISSLLTCWK 62 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~--------------~--------~~~~~~~~~~~~~ 62 (352) +|.++. .+..+|++.++.++++.+.+.||++.+++...|.. . +...+.....+.+ T Consensus 2 kI~i~~---~P~~GG~e~~v~~La~~L~~~GHeV~vit~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~l~~ 78 (371) T cd04962 2 KIGIVC---YPTYGGSGVVATELGKALARRGHEVHFITSSRPFRLDEYSPNIFFHEVEVPQYPLFQYPPYDLALASKIAE 78 (371) T ss_pred EEEEEC---CCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHH T ss_conf 799989---99998699999999999997599999995689876555689737998468776534467213789999999 Q ss_pred HHHHCCCEEEEECCCHH-HHHHHHHHH---HHCCCEEEEECCHH------HHHHHHHHHHHHHHCCEEEECCHHHHHHC- Q ss_conf 54325970899628546-899999999---70998199975745------62011347989962668998898999743- Q gi|255764474|r 63 KPIGQNSRIWHARRNNE-MLLGVMMRD---VLRMPLKLVFTSPS------QRNHSRWTRYLISRMDEVITTSQKSARFI- 131 (352) Q Consensus 63 ~~~~~~~~ivh~~~~~~-~~~~~~~~~---~~~~~~~~~~~~~~------~~~~~~~~~~l~~~~d~ii~~s~~~~~~~- 131 (352) .....++||+|+|...+ .+.+++... ..+.|+..++|... ......+.+|.+.++|.++++|+..++++ T Consensus 79 ~~~~~~~DvvH~h~~~p~~~~~~l~~~~~~~~~~~~v~t~H~~~~~~~~~~~~~~~~~~~~~~~a~~vi~~S~~~~~~~~ 158 (371) T cd04962 79 VAKRYKLDLLHVHYAVPHAVAAYLAREILGKKDLPVVTTLHGTDITLVGQDPSFQPATRFSIEKSDGVTAVSESLRQETY 158 (371) T ss_pred HHHHCCCCEEEECCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHCCCHHHHHHHHHHHHHCCEEEECCHHHHHHHH T ss_conf 99973998899725542679999999986447997899937985564214745668999999758989989999999999 Q ss_pred -----CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCC Q ss_conf -----688399748827778237843268788728898872799951226445123666545653004774058863133 Q gi|255764474|r 132 -----ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKT 206 (352) Q Consensus 132 -----~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g 206 (352) ..++.+||||+|.+.|.|.+ ..+.++.++.+.+.++++++||+.++||++.+++|+.++.++.| .+++++|+| T Consensus 159 ~~~~~~~~i~vI~Ngvd~~~f~~~~-~~~~~~~~~~~~~~~~i~~vgrl~~~Kg~~~ll~a~~~l~~~~~-~~l~ivG~G 236 (371) T cd04962 159 ELFDITKEIEVIPNFVDEDRFRPKP-DEALKRRLGAPEGEKVLIHISNFRPVKRIDDVIRIFAKVRKEVP-ARLLLVGDG 236 (371) T ss_pred HHCCCCCCEEEECCCCCHHHCCCCC-CHHHHHHHCCCCCCEEEEEECCCCHHCCHHHHHHHHHHHHHCCC-EEEEEECCC T ss_conf 9609998889987975732148885-07899970999898599994265021496999999998630576-599998264 Q ss_pred CCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC Q ss_conf 32100014667764431123322122222233100000000000123332222221000001001110278841010023 Q gi|255764474|r 207 TLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDP 286 (352) Q Consensus 207 ~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~ 286 (352) +. .++++++++++++.++|.|+|.++++..+|+.||++++||++||||++++||||||+|||+||+||++|+|. T Consensus 237 ~~-----~~~~~~~~~~~~l~~~V~f~G~~~~~~~~~~~adi~v~pS~~E~fg~vilEAma~G~PvIat~~gg~~e~v~- 310 (371) T cd04962 237 PE-----RSPAERLARELGLQDDVLFLGKQDHVEELLSIADLFLLPSEKESFGLAALEAMACGVPVVASNAGGIPEVVK- 310 (371) T ss_pred HH-----HHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEECCCCCHHHHHC- T ss_conf 03-----778888887621031366327365599999855111387324432025999995699499868998389970- Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 48842499659998999999999986989999999999999998299899999999999998 Q gi|255764474|r 287 ENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRL 348 (352) Q Consensus 287 ~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~ 348 (352) +|.+|++++++|+++++++|.++++||+++++|+++|++++.++|||++++++++++|++| T Consensus 311 -~g~~G~l~~~~d~~~la~~i~~ll~~~~~~~~~~~~a~~~~~~~fs~~~~~~~~~~iY~~~ 371 (371) T cd04962 311 -HGETGFLVDVGDVEAMAEYALSLLEDDELWQEFSRAARNRAAERFDSERIVPQYEALYRRL 371 (371) T ss_pred -CCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCC T ss_conf -8971899789999999999999976999999999999999998689999999999998379 No 5 >cd03814 GT1_like_2 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=100.00 E-value=0 Score=370.58 Aligned_cols=326 Identities=22% Similarity=0.319 Sum_probs=263.0 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC--------------CC---C--CCHHHHHHHHHHHH Q ss_conf 45897078787877077888899999997198599980588665--------------62---3--47899899887543 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN--------------IP---S--IGISSLLTCWKKPI 65 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~--------------~~---~--~~~~~~~~~~~~~~ 65 (352) ||.+|.+.|.+..+|++.++.++++.+.+.||++.++++..... .+ . ........+.+... T Consensus 1 kIl~i~~~f~P~~GG~e~~~~~la~~L~~~Gh~V~v~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (364) T cd03814 1 RIAIVTDTFLPQVNGVVRTLQRLVEHLRARGHEVLVIAPGPFRESEGPARVVPVPSVPLPGYPEIRLALPPRRRVRRLLD 80 (364) T ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCEEEEEEEECCCCCCCEECCCHHHHHHHHHH T ss_conf 98999388899988499999999999997799899997899765555663467866746888763002032999999998 Q ss_pred HCCCEEEEECCCHH-HHHHHHHHHHHCCCEEEEECCHHH------------HHHHHHHHHHHHHCCEEEECCHHHHHHC- Q ss_conf 25970899628546-899999999709981999757456------------2011347989962668998898999743- Q gi|255764474|r 66 GQNSRIWHARRNNE-MLLGVMMRDVLRMPLKLVFTSPSQ------------RNHSRWTRYLISRMDEVITTSQKSARFI- 131 (352) Q Consensus 66 ~~~~~ivh~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~l~~~~d~ii~~s~~~~~~~- 131 (352) ..++|++|+|.... .+.+...+...+.|+..++|.... .....+.+|+++.+|.++++|+..++.+ T Consensus 81 ~~~pDiIh~~~~~~~~~~a~~~~~~~~ip~i~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ii~~S~~~~~~~~ 160 (364) T cd03814 81 AFAPDVVHIATPGPLGLAALRAARRLGIPVVTSYHTDFPEYLRYYGLGPLSWLAWAYLRWFHNRADRVLVPSPSLADELR 160 (364) T ss_pred HCCCCEEEECCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHCCHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHH T ss_conf 65999999878416789999999975997899974764888776032056899999999998507999989999999998 Q ss_pred ---CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCC Q ss_conf ---68839974882777823784326878872889887279995122644512366654565300477405886313332 Q gi|255764474|r 132 ---ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTL 208 (352) Q Consensus 132 ---~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~ 208 (352) ..++.+||||+|.+.|.|.+.....+...+ ..+.+.++|+||+.++||++.+++|+..+.+ .++++++++|+|+. T Consensus 161 ~~~~~~~~vi~nGvd~~~f~p~~~~~~~~~~~~-~~~~~~i~~vGrl~~~Kg~~~ll~a~~~l~~-~~~~~l~ivG~G~~ 238 (364) T cd03814 161 ARGFRRVRLWPRGVDTELFHPRRRDEALRARLG-PPDRPVLLYVGRLAPEKNLEALLDADLPLRR-RPPVRLVIVGDGPA 238 (364) T ss_pred HHCCCCEEEECCCCCHHHCCCCCCCHHHHHHCC-CCCCCEEEEEECCCCCCCHHHHHHHHHHHHC-CCCEEEEEECCCCC T ss_conf 509988899689616988487543266653026-8998389996457555789999999997300-58859999847633 Q ss_pred CCCCHHHHHHHHHHHCCCCCCCCCCCCCC--CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC Q ss_conf 10001466776443112332212222223--3100000000000123332222221000001001110278841010023 Q gi|255764474|r 209 KHYLFKKNLQRRIYANGLKKRILFIDEQS--SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDP 286 (352) Q Consensus 209 ~~~~~~~~l~~~i~~~~l~~~V~~~g~~~--~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~ 286 (352) + +++++. .++|.|+|.++ ++..+|++||+|++||.+||||++++||||||+|||+|++||++|+|. T Consensus 239 ~-----~~l~~~------~~~v~f~G~~~~~el~~~~~~adi~v~pS~~E~fg~~~lEAma~G~PvI~s~~gg~~Eiv~- 306 (364) T cd03814 239 R-----ARLEAR------YPNVHFLGFLDGEELAAAYASADVFVFPSRTETFGLVVLEAMASGLPVVAPDAGGPADIVT- 306 (364) T ss_pred H-----HHHHHC------CCCEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCEEECCCCCHHHHHC- T ss_conf 9-----999851------8987990789989999999824756788653457657999998399899958997488831- Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 488424996599989999999999869899999999999999982998999999999999 Q gi|255764474|r 287 ENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 287 ~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) ++++|++++++|+++|+++|.++++||+++++||++||+++ ++|||++++++++++|+ T Consensus 307 -~~~~G~l~~~~d~~~la~~i~~l~~~~~~~~~mg~~ar~~~-~~~~w~~~~~~~~~~Y~ 364 (364) T cd03814 307 -DGENGLLVEPGDAEAFAAALAALLADPELRRRMAARARAEA-ERRSWEAFLDNLLEAYR 364 (364) T ss_pred -CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-HHCCHHHHHHHHHHHCC T ss_conf -79828997999999999999999769999999999999999-96899999999999759 No 6 >cd03817 GT1_UGDG_like This family is most closely related to the GT1 family of glycosyltransferases. UDP-glucose-diacylglycerol glucosyltransferase (UGDG; also known as 1,2-diacylglycerol 3-glucosyltransferase) catalyzes the transfer of glucose from UDP-glucose to 1,2-diacylglycerol forming 3-D-glucosyl-1,2-diacylglycerol. Probab=100.00 E-value=0 Score=366.78 Aligned_cols=331 Identities=26% Similarity=0.430 Sum_probs=267.2 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCC--------------------CCHHHHHHHHHHH Q ss_conf 45897078787877077888899999997198599980588665623--------------------4789989988754 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPS--------------------IGISSLLTCWKKP 64 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~~~~~~~~~ 64 (352) +|.+|++.+.+..+|++.++.++++.+.+.||++.++++..+..... ........+.+.. T Consensus 1 kIaivt~~y~P~~GG~~~~~~~La~~L~~~GheV~Vit~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (374) T cd03817 1 KIGIFTDTYLPQVNGVATSIRRLAEELEKRGHEVYVVAPSYPGAPEEEEVVVVRPFRVPTFKYPDFRLPLPIPRALIIIL 80 (374) T ss_pred CEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 98999589899998099999999999997799899997279887754357628984367776521345555799999999 Q ss_pred HHCCCEEEEECCCHH-HHHHHHHHHHHCCCEEEEECCHHH---------------HHHHHHHHHHHHHCCEEEECCHHHH Q ss_conf 325970899628546-899999999709981999757456---------------2011347989962668998898999 Q gi|255764474|r 65 IGQNSRIWHARRNNE-MLLGVMMRDVLRMPLKLVFTSPSQ---------------RNHSRWTRYLISRMDEVITTSQKSA 128 (352) Q Consensus 65 ~~~~~~ivh~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~l~~~~d~ii~~s~~~~ 128 (352) ...++|++|+|.... ...+...+...+.|++.++|.... .....+.+++++++|.++++|+..+ T Consensus 81 ~~~~~DvIh~~~~~~~~~~a~~~~~~~~ip~V~t~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ii~~S~~~~ 160 (374) T cd03817 81 KELGPDIVHTHTPFSLGLLGLRVARKLGIPVVATYHTMYEDYTHYVPLGRLLARAVVRRKLSRRFYNRCDAVIAPSEKIA 160 (374) T ss_pred HHCCCCEEEECCHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCEEEECHHHHH T ss_conf 86699999987825889999999997499599995577799887631110135678999999999985999997809999 Q ss_pred HHC-----CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf 743-----688399748827778237843268788728898872799951226445123666545653004774058863 Q gi|255764474|r 129 RFI-----ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVV 203 (352) Q Consensus 129 ~~~-----~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~ 203 (352) +++ ..++.+||||+|.+.|.+.+ ....+...+++++.+.++|+||+.++||++.+|+|+.++.++.|+++|+++ T Consensus 161 ~~l~~~~~~~~i~vI~ngvd~~~f~~~~-~~~~~~~~~~~~~~~~i~~~Grl~~~Kg~~~li~a~~~l~~~~~~~~l~iv 239 (374) T cd03817 161 DLLREYGVKRPIEVIPTGIDLDRFEPVD-GDDERRKLGIPEDEPVLLYVGRLAKEKNIDFLIRAFARLLKEEPDVKLVIV 239 (374) T ss_pred HHHHHCCCCCCEEEECCCCCHHHCCCCC-HHHHHHHHCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 9999708999889986960666439864-178999818999984999970575421078999999988741898489998 Q ss_pred CCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCC Q ss_conf 133321000146677644311233221222222--331000000000001233322222210000010011102788410 Q gi|255764474|r 204 GKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFS 281 (352) Q Consensus 204 G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~ 281 (352) |+|+.+ ++++++++++++.++|.|+|++ +++..+|++||++++||.+||||++++||||||+|||+|++||++ T Consensus 240 G~G~~~-----~~l~~~~~~~~l~~~V~f~G~v~~~~~~~~l~~adi~v~pS~~E~fg~~~~EAma~G~PvI~s~~gg~~ 314 (374) T cd03817 240 GDGPER-----EELEELARELGLADRVIFTGFVPREELPDYYKAADLFVFASTTETQGLVLLEAMAAGLPVVAVDAPGLP 314 (374) T ss_pred CCCHHH-----HHHHHHHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCEEEECCCCHH T ss_conf 774476-----556788887424662443588756677876442475447776657759999999819989991799759 Q ss_pred HHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 1002348842499659998999999999986989999999999999998299899999999999998 Q gi|255764474|r 282 ELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRL 348 (352) Q Consensus 282 e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~ 348 (352) |+|. +|.+|++++++| ++++++|.++++|++++++|+++|++++ ++|||++ ..+++|+++ T Consensus 315 e~i~--~g~~G~l~~~~~-~~l~~~i~~l~~~~~~~~~~~~~a~~~a-~~f~~~~---~~~~~~~~~ 374 (374) T cd03817 315 DLVA--DGENGFLFPPGD-EALAEALLRLLQDPELRRRLSKNAEESA-EKFSFAK---KVEKLYEEV 374 (374) T ss_pred HHHH--CCCEEEEECCCH-HHHHHHHHHHHCCHHHHHHHHHHHHHHH-HHCCHHH---HHHHHHHHC T ss_conf 9964--798599969786-9999999999759999999999999999-9789999---999998539 No 7 >cd03799 GT1_amsK_like This is a family of GT1 glycosyltransferases found specifically in certain bacteria. amsK in Erwinia amylovora, has been reported to be involved in the biosynthesis of amylovoran, a exopolysaccharide acting as a virulence factor. Probab=100.00 E-value=0 Score=370.09 Aligned_cols=319 Identities=20% Similarity=0.344 Sum_probs=262.2 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCC-------------CC-----CHHHHHHHHHHHHH Q ss_conf 4589707878787707788889999999719859998058866562-------------34-----78998998875432 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIP-------------SI-----GISSLLTCWKKPIG 66 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~-----~~~~~~~~~~~~~~ 66 (352) +|.+|.+++.+ .+ +.++.++++.+.+.||++.+++...+.... .. .......+.+.... T Consensus 1 ki~~v~~~~P~-~~--etfv~~la~~L~~~GHeV~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~ 77 (355) T cd03799 1 KIAYLVKEFPR-LS--ETFILREILALEAAGHEVEIFSLRPPEDTLVHPEDRAELARTRYLARSLALLAQALVLARELRR 77 (355) T ss_pred CEEEECCCCCC-CC--CHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEECCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 98999898989-96--1799999999996798499995348877730643021215521547779999999999999977 Q ss_pred CCCEEEEECC-CHHHHHHHHHHHHHCCCEEEEECCHHH--HHHHHHHHHHHHHCCEEEECCHHHHHHC-------CCCEE Q ss_conf 5970899628-546899999999709981999757456--2011347989962668998898999743-------68839 Q gi|255764474|r 67 QNSRIWHARR-NNEMLLGVMMRDVLRMPLKLVFTSPSQ--RNHSRWTRYLISRMDEVITTSQKSARFI-------ERPST 136 (352) Q Consensus 67 ~~~~ivh~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~d~ii~~s~~~~~~~-------~~~~~ 136 (352) .++|++|+|+ ......+.+++...+.|+.+++|.+.. .......+..++++|.++++|+..++.+ ..++. T Consensus 78 ~~~DiIH~H~~~~~~~~~~~~~~~~~ip~v~t~Hg~d~~~~~~~~~l~~~~~~ad~ii~vS~~~~~~l~~~~~~~~~ki~ 157 (355) T cd03799 78 LGIDHIHAHFGTTPATVAMLASRLGGIPYSFTAHGKDIFRSPDAIDLDEKLARADFVVAISEYNRQQLIRLLGCDPDKIH 157 (355) T ss_pred CCCCEEEECCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCHHHEE T ss_conf 79989997688337999999999749999999816765567368999999983999999899999999986099914689 Q ss_pred EECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHH Q ss_conf 97488277782378432687887288988727999512264451236665456530047740588631333210001466 Q gi|255764474|r 137 VIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKN 216 (352) Q Consensus 137 vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~ 216 (352) +||||+|++.|.+.+... +.+.+.|+|+||+.++||++.+++|+.++.++.|+++++++|+|+.. ++ T Consensus 158 vi~ngvd~~~f~~~~~~~--------~~~~~~il~vGrl~~~Kg~~~li~A~~~l~~~~~~~~l~ivG~G~~~-----~~ 224 (355) T cd03799 158 VVHCGVDLERFPPRPPPP--------PGEPLRILSVGRLVEKKGLDYLLEALALLKDRGIDFRLDIVGDGPLR-----DE 224 (355) T ss_pred EECCCCCHHHCCCCCCCC--------CCCCEEEEEEECCCHHCCCHHHHHHHHHHHHHCCCEEEEEECCCHHH-----HH T ss_conf 989964888879987557--------78986999981580211909999999999864999799996676048-----89 Q ss_pred HHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCC------CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCC Q ss_conf 77644311233221222222--3310000000000012333------222222100000100111027884101002348 Q gi|255764474|r 217 LQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLY------EGFGLTPLEAMASGIPVVASNTGVFSELLDPEN 288 (352) Q Consensus 217 l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~------Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~ 288 (352) +++++++++++++|+|+|++ +++..+|+.||+|++||++ ||||++++||||||+|||+|++||++|+|. + T Consensus 225 l~~~~~~l~l~~~V~f~G~v~~~~l~~~~~~adv~v~pS~~~~~~~~Eg~p~~~lEAma~G~PvVas~~~g~~e~v~--~ 302 (355) T cd03799 225 LEALIAELGLEDRVTLLGAKSQEEVRELLRAADLFVLPSVTAADGDREGLPVVLMEAMAMGLPVISTDVSGIPELVE--D 302 (355) T ss_pred HHHHHHHCCCCCCEEECCCCCCHHHHHHHHHCCEEEECCCCCCCCCCCCCCHHHHHHHHCCCCEEECCCCCCHHHHC--C T ss_conf 99999974998550764444647679999749899984523356776687779999996699899917998576622--8 Q ss_pred CCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 84249965999899999999998698999999999999999829989999999 Q gi|255764474|r 289 AKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDI 341 (352) Q Consensus 289 g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~ 341 (352) |.+|++++++|+++|+++|.++++|++++++|+++|++++.++|||+++++++ T Consensus 303 g~~G~l~~~~d~~~la~~i~~ll~d~~~~~~~~~~ar~~v~~~fs~~~~a~rL 355 (355) T cd03799 303 GETGLLVPPGDPEALADAIERLLDDPELRREMGEAGRARVEEEFDIRKQAARL 355 (355) T ss_pred CCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHCC T ss_conf 98589979999999999999998799999999999999999869959999409 No 8 >TIGR03088 stp2 sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate. Probab=100.00 E-value=0 Score=366.89 Aligned_cols=335 Identities=23% Similarity=0.341 Sum_probs=270.9 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCC---------C--------CCCCCHHHHHHHHHHHHHC Q ss_conf 4589707878787707788889999999719859998058866---------5--------6234789989988754325 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPK---------N--------IPSIGISSLLTCWKKPIGQ 67 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~---------~--------~~~~~~~~~~~~~~~~~~~ 67 (352) .|.-|.+.+.. +|++..+.++++.+.+.++...++...... + .+...+..+..+++..... T Consensus 3 ~IlhiI~~l~~--GGaE~~~~~l~~~l~~~~~~~~vi~~~~~~~~~~~~~~~~v~~~~l~~~~~~~~~~~~~l~~li~~~ 80 (374) T TIGR03088 3 LIVHVVYRFDV--GGLENGLVNLINHLPADRYRHAVVALTEVSAFRKRIQRPDVAFYALHKQPGKDVAVYPQLYRLLRQL 80 (374) T ss_pred EEEEEECCCCC--CHHHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHCCCEEEECCCCCCCCHHHHHHHHHHHHHH T ss_conf 89999089999--8279999999997887599399999789845789998689889990788764799999999999983 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHH-------H-HHHHHHHHHHHHCCEEEECCHHHHHHCC------- Q ss_conf 970899628546899999999709981999757456-------2-0113479899626689988989997436------- Q gi|255764474|r 68 NSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQ-------R-NHSRWTRYLISRMDEVITTSQKSARFIE------- 132 (352) Q Consensus 68 ~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~-~~~~~~~~l~~~~d~ii~~s~~~~~~~~------- 132 (352) ++||+|+|.... +.+.+.+...+.|..+...+... . ......+.+...++.++++|+..++++. T Consensus 81 kpDiIh~~~~~~-~~~~~~~~~~~~~~~i~~~h~~~~~~~~~~~~~~~~~~k~~~~~~~~~i~vs~~~~~~~~~~~~~~~ 159 (374) T TIGR03088 81 RPDIVHTRNLAA-LEAQLPAALAGVPARIHGEHGRDVFDLDGSNWKYRWLRRLYRPLIHHYVAVSRDLEDWLRGPVKVPP 159 (374) T ss_pred CCCEEEECCHHH-HHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHHCCCH T ss_conf 984898636116-9999999984998899960787543743105899999999998568899915899999998709987 Q ss_pred CCEEEECCCCCHHHHCCCC-CHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCC----CEEEEEECCCC Q ss_conf 8839974882777823784-326878872889887279995122644512366654565300477----40588631333 Q gi|255764474|r 133 RPSTVIMHGVDTERFRPTS-NKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHP----GWTAVVVGKTT 207 (352) Q Consensus 133 ~~~~vi~~gid~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~----~~~l~i~G~g~ 207 (352) .++.+|+||+|++.|.|.. .+...+...+..++..+++++||+.++||++.+++|+..+.++.| +++++++|+|+ T Consensus 160 ~ki~~I~Ngid~~~f~~~~~~~~~~~~~~~~~~~~~~i~~vgRl~~~Kg~~~li~a~~~l~~~~~~~~~~~~l~i~G~G~ 239 (374) T TIGR03088 160 AKIHQIYNGVDTERFHPSRGDRSPILPPDFFADESVVVGTVGRLQAVKDQPTLVRAFALLVRQLPEGAERLRLVIVGDGP 239 (374) T ss_pred HHEEEECCCCCHHHCCCCCCCHHHHHHHCCCCCCCEEEEEEEECEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEECCCC T ss_conf 89899668776521587731067765431589877699996634030787999999999998677766888999981776 Q ss_pred CCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCC Q ss_conf 21000146677644311233221222222331000000000001233322222210000010011102788410100234 Q gi|255764474|r 208 LKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPE 287 (352) Q Consensus 208 ~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~ 287 (352) .+ +++++++++++++++|.|+|+++++..+|+.+|+||+||++||||++++||||||+|||+||+||.+|+|. T Consensus 240 ~~-----~~l~~~i~~~~l~~~v~f~G~~~~~~~~~~~~Di~v~~S~~EGf~~~llEAma~g~PvIasdvgg~~eii~-- 312 (374) T TIGR03088 240 AR-----GACEQMVRAAGLAHLVWLPGERDDVPALMQALDLFVLPSLAEGISNTILEAMASGLPVIATAVGGNPELVQ-- 312 (374) T ss_pred CH-----HHHHHHHHHCCCCCEEECCCCCCCHHHHHHHCCCEEECCCCCCCCHHHHHHHHCCCCEEECCCCCHHHHHC-- T ss_conf 59-----99999999718777585378746899999963900313443446779999997599799918998189861-- Q ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 88424996599989999999999869899999999999999982998999999999999988 Q gi|255764474|r 288 NAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLL 349 (352) Q Consensus 288 ~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l 349 (352) +|.+|++++++|+++++++|.++++|++.+.+|+++|++++.++|||++++++++++|+++| T Consensus 313 ~~~~G~l~~~~d~~~la~~i~~ll~~~~~~~~~~~~a~~~v~~~Fs~~~~~~~~~~lY~~ll 374 (374) T TIGR03088 313 HGVTGALVPPGDAVALARALQPYVSDPAARRAHGAAGRARAEQQFSINAMVAAYAGLYDQLL 374 (374) T ss_pred CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHC T ss_conf 79868997899999999999999779999999999999999987899999999999999969 No 9 >cd03801 GT1_YqgM_like This family is most closely related to the GT1 family of glycosyltransferases and named after YqgM in Bacillus licheniformis about which little is known. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. Probab=100.00 E-value=0 Score=363.62 Aligned_cols=333 Identities=29% Similarity=0.442 Sum_probs=276.5 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCC------------------C---CCHHHHHHHHHH Q ss_conf 4589707878787707788889999999719859998058866562------------------3---478998998875 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIP------------------S---IGISSLLTCWKK 63 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~---~~~~~~~~~~~~ 63 (352) +|.+|.+.+.+..+|++..+.+++..+.+.|+++.+++........ . ..........+. T Consensus 1 kIlii~~~~~P~~GG~e~~~~~la~~L~~~G~~V~vit~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (374) T cd03801 1 KILLVTPEYPPSVGGAERHVLELARALAARGHEVTVLTPGDGGLPDEEEVGGIVVVRPPPLLRVRRLLLLLLLALRLRRL 80 (374) T ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHHHH T ss_conf 98999487799988199999999999997799899996079887503423771699567665420024567899999999 Q ss_pred HHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHH-----------HHHHHHHHHHHHCCEEEECCHHHHHHCC Q ss_conf 43259708996285468999999997099819997574562-----------0113479899626689988989997436 Q gi|255764474|r 64 PIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQR-----------NHSRWTRYLISRMDEVITTSQKSARFIE 132 (352) Q Consensus 64 ~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~l~~~~d~ii~~s~~~~~~~~ 132 (352) ....++|++|+|..........++...+.|+.+..|..... ....+.+++++.+|.++++|+..++.+. T Consensus 81 ~~~~~~Dii~~~~~~~~~~~~~~~~~~~~~~i~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ii~~S~~~~~~l~ 160 (374) T cd03801 81 LRRERFDVVHAHDWLALLAAALAARLLGIPLVLTVHGLEFGRPGNELGLLLKLARALERRALRRADRIIAVSEATREELR 160 (374) T ss_pred HHHCCCCEEEECCCHHHHHHHHHHHHCCCCEEEEECCCCHHCCCCCHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHH T ss_conf 98559989997883178999999986699789996786210022100256899999999999838999998999999999 Q ss_pred -------CCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf -------8839974882777823784326878872889887279995122644512366654565300477405886313 Q gi|255764474|r 133 -------RPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGK 205 (352) Q Consensus 133 -------~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~ 205 (352) .++.+||||+|.+.|.+.. ...+...+.+.+.++++|+||+.++||++.+|+|+.++.++.++++++++|+ T Consensus 161 ~~~~~~~~ki~vI~ngid~~~~~~~~--~~~~~~~~~~~~~~~il~vGrl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~ 238 (374) T cd03801 161 ELGGVPPEKITVIPNGVDTERFRPAP--RAARRRLGIPEDEPVILFVGRLVPRKGVDLLLEALAKLRKEYPDVRLVIVGD 238 (374) T ss_pred HHCCCCHHHEEEECCCCCHHHCCCCC--HHHHHHHCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 86199856899988976755417541--7789871899998299999533200283578999999985288729999568 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHH Q ss_conf 3321000146677644311233221222222--33100000000000123332222221000001001110278841010 Q gi|255764474|r 206 TTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSEL 283 (352) Q Consensus 206 g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~ 283 (352) |+.. +++++.+++++++++|.|+|++ +++..+|+.||++++||++||||++++|||+||+|||+|++||++|+ T Consensus 239 g~~~-----~~~~~~~~~~~l~~~v~f~g~v~~~~~~~~~~~adi~v~pS~~E~~~~~~lEAma~G~PvI~t~~gg~~e~ 313 (374) T cd03801 239 GPLR-----EELEALAAELGLGDRVTFLGFVPDEDLPALYAAADVFVLPSLYEGFGLVLLEAMAAGLPVVASDVGGIPEV 313 (374) T ss_pred CCHH-----HHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEECCCCCHHHH T ss_conf 8136-----69999999739985599758642113778876540365873554321589999976998999789975888 Q ss_pred CCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 023488424996599989999999999869899999999999999982998999999999999 Q gi|255764474|r 284 LDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 284 i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) |. ++.+|++++++|+++++++|.++++|++++.+|+++|++++.++|||++++++++++|. T Consensus 314 i~--~~~~G~l~~~~d~~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~fs~~~~~~~~~~~y~ 374 (374) T cd03801 314 VE--DGETGLLVPPGDPEALAEAILRLLDDPELRRRLGEAARERVAERFSWDRVAARTEEVYY 374 (374) T ss_pred HC--CCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCC T ss_conf 51--89718997899999999999999779999999999999999986899999999997666 No 10 >cd04951 GT1_WbdM_like This family is most closely related to the GT1 family of glycosyltransferases and is named after WbdM in Escherichia coli. In general glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have Probab=100.00 E-value=0 Score=368.52 Aligned_cols=330 Identities=24% Similarity=0.335 Sum_probs=264.9 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC---------CCC-------C-CHHHHHHHHHHHHHC Q ss_conf 45897078787877077888899999997198599980588665---------623-------4-789989988754325 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN---------IPS-------I-GISSLLTCWKKPIGQ 67 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~---------~~~-------~-~~~~~~~~~~~~~~~ 67 (352) ||..|.+++. .+|++.++.++++.+.+.||++.++....+.. ... . .+..+..+.+..... T Consensus 1 KIl~vi~~l~--~GG~e~~~~~la~~L~~~G~~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 78 (360) T cd04951 1 KILYVITGLG--LGGAEKQVVDLADQFVAKGHQVAIISLTGESEVKPPIDATIILNLNMSKNPLSFLLALWKLRKILRQF 78 (360) T ss_pred CEEEEECCCC--CCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 9899969999--81379999999999997699899998179854443345733786376667678999999999999982 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHH----HHCCEEEECCHHHHHHC-------CCCEE Q ss_conf 97089962854689999999970998199975745620113479899----62668998898999743-------68839 Q gi|255764474|r 68 NSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLI----SRMDEVITTSQKSARFI-------ERPST 136 (352) Q Consensus 68 ~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~----~~~d~ii~~s~~~~~~~-------~~~~~ 136 (352) ++|++|+|.....+.+.+.+..... .+++.+.|..........+.+ +.++.++.+|+....+. ..++. T Consensus 79 ~pDvIh~h~~~~~~~~~~~~~~~~~-~~~i~t~h~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~~~~~~~~ki~ 157 (360) T cd04951 79 KPDVVHAHMFHANIFARLLRLFLPS-PPLICTAHSKNEGGRLRMLAYRLTDFLSDLTTNVSKEALDYFIASKAFNANKSF 157 (360) T ss_pred CCCEEEECCCCHHHHHHHHHHHCCC-CCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHEE T ss_conf 9999998663078999999985799-819998588875417999999999888786523339999999985588844489 Q ss_pred EECCCCCHHHHCCCCC-HHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHH Q ss_conf 9748827778237843-268788728898872799951226445123666545653004774058863133321000146 Q gi|255764474|r 137 VIMHGVDTERFRPTSN-KQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKK 215 (352) Q Consensus 137 vi~~gid~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~ 215 (352) +||||+|.+.|.|.+. +...+..++++.+.++++++||+.++||++.+|+|+.++.++.++++++++|+|+.+ + T Consensus 158 vI~ngvd~~~f~~~~~~~~~~r~~~~~~~~~~~il~vgRl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~G~~~-----~ 232 (360) T cd04951 158 VVYNGIDTDRFRKDPARRLKIRNALGVKNDTFVILAVGRLVEAKDYPNLLKAFAKLLSDYLDIKLLIAGDGPLR-----A 232 (360) T ss_pred EECCCCCHHHCCCCHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCHHH-----H T ss_conf 96687344421876156788998619998987999984066331157899999999864899799996782567-----8 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEE Q ss_conf 67764431123322122222233100000000000123332222221000001001110278841010023488424996 Q gi|255764474|r 216 NLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIV 295 (352) Q Consensus 216 ~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~ 295 (352) ++++++++++++++|.|+|+++++.++|+.||+|++||++||||++++|||+||+|||+||+||++|+|. .+|+++ T Consensus 233 ~l~~~~~~~~l~~~v~f~G~~~d~~~~~~~adi~v~pS~~Egfg~~~lEAma~G~PvI~s~~gg~~eii~----~~G~lv 308 (360) T cd04951 233 TLERLIKALGLSNRVKLLGLRDDIAAYYNAADLFVLSSAWEGFGLVVAEAMACELPVVATDAGGVREVVG----DSGLIV 308 (360) T ss_pred HHHHHHHHHCCCCCEEECCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCEEECCCCCHHHEEC----CCEEEE T ss_conf 8876677617776154247510268987621425588655788708999998599999878997265574----864998 Q ss_pred CCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 599989999999999869899999999999999982998999999999999 Q gi|255764474|r 296 PPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 296 ~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) +++|+++++++|.++++|++++.+|+.++++++.++|||++++++++++|+ T Consensus 309 ~~~d~~~la~~i~~ll~~~~~~~~~~~~~~~~v~~~fs~~~~~~~~~~lY~ 359 (360) T cd04951 309 PISDPEALANKIDEILKMSGEERDIIGARRERIVKKFSINSIVQQWLTLYT 359 (360) T ss_pred CCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCC T ss_conf 399999999999999879199999999999999986999999999999739 No 11 >cd03807 GT1_WbnK_like This family is most closely related to the GT1 family of glycosyltransferases. WbnK in Shigella dysenteriae has been shown to be involved in the type 7 O-antigen biosynthesis. Probab=100.00 E-value=0 Score=362.48 Aligned_cols=331 Identities=25% Similarity=0.370 Sum_probs=267.1 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCC---------CCC---------CCCHHHHHHHHHHHHH Q ss_conf 4589707878787707788889999999719859998058866---------562---------3478998998875432 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPK---------NIP---------SIGISSLLTCWKKPIG 66 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~---------~~~---------~~~~~~~~~~~~~~~~ 66 (352) ||..|.+++.. +|++..+.+++..+.+.|+.+.++...... +++ ...+..+..+++.... T Consensus 1 KIl~v~~~l~~--GG~e~~~~~la~~L~~~g~~v~vi~~~~~~~~~~~~~~~~i~v~~l~~~~~~~~~~~~~~l~~~i~~ 78 (365) T cd03807 1 KVLHVITGLDV--GGAERMLVRLLKGLDRDRFEHVVISLTDRGELGEELEEAGVPVYCLGKRPGRPDPGALLRLYKLIRR 78 (365) T ss_pred CEEEEECCCCC--CCHHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHHHHHH T ss_conf 98999698999--4189999999999997799499999579985578987489569992787665688999999999998 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHCC-CEEEEECCHHHH---H----HHHHHHHHHHHCCEEEECCHHHHHHC------C Q ss_conf 59708996285468999999997099-819997574562---0----11347989962668998898999743------6 Q gi|255764474|r 67 QNSRIWHARRNNEMLLGVMMRDVLRM-PLKLVFTSPSQR---N----HSRWTRYLISRMDEVITTSQKSARFI------E 132 (352) Q Consensus 67 ~~~~ivh~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~---~----~~~~~~~l~~~~d~ii~~s~~~~~~~------~ 132 (352) .++|++|+|.....+.+.+.....+. ++....+..... . .....+++...+|.++++|+..+..+ . T Consensus 79 ~~~DiIh~~~~~~~~~~~l~~~~~~~~~~i~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~ii~~S~~~~~~~~~~~~~~ 158 (365) T cd03807 79 LRPDVVHTWMYHADLYGGLAARLAGVPPVIWGIRHSDLDLGKKSTRLVARLRRLLSSFIPLIVANSAAAAEYHQAIGYPP 158 (365) T ss_pred HCCCEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHHHHCCCH T ss_conf 39999998774267999999997599828999568853210105799999999998429999994999999999819984 Q ss_pred CCEEEECCCCCHHHHCCCCC-HHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCC Q ss_conf 88399748827778237843-26878872889887279995122644512366654565300477405886313332100 Q gi|255764474|r 133 RPSTVIMHGVDTERFRPTSN-KQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHY 211 (352) Q Consensus 133 ~~~~vi~~gid~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~ 211 (352) .+..+||||+|.+.|.|... ....+..++++++.++++++||+.++||++.+|+|+..+.++.|+++|+++|+|+... T Consensus 159 ~~~~vI~ngid~~~~~~~~~~~~~~~~~~~~~~~~~~i~~vGrl~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~- 237 (365) T cd03807 159 KKIVVIPNGVDTERFSPDLDARARLREELGLPEDTFLIGIVARLHPQKDHATLLRAAALLLKKFPNARLLLVGDGPDRA- 237 (365) T ss_pred HHEEEECCCCCHHHCCCCHHHHHHHHHHHCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCHH- T ss_conf 5689989986788669870367999998299988869999504653101567889999988758882799983785588- Q ss_pred CHHHHHHH-HHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCC Q ss_conf 01466776-44311233221222222331000000000001233322222210000010011102788410100234884 Q gi|255764474|r 212 LFKKNLQR-RIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAK 290 (352) Q Consensus 212 ~~~~~l~~-~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~ 290 (352) +++. ..+++++.++|.|+|+++++..+|+.+|+|++||++||||++++|||+||+|||+||+||++|++. . T Consensus 238 ----~~~~~~~~~~~l~~~v~f~G~~~d~~~~~~~adi~v~pS~~Egf~~~~lEAma~G~PvI~s~~gg~~eii~----~ 309 (365) T cd03807 238 ----NLELLALKELGLEDKVILLGERSDVPALLNALDVFVLSSLSEGFPNVLLEAMACGLPVVATDVGDNAELVG----D 309 (365) T ss_pred ----HHHHHHHHHCCCCCEEEECCCCCCHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCEEECCCCCCHHEEC----C T ss_conf ----99998999759987399936633189999871603388700053327999998599999867998411451----7 Q ss_pred EEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 24996599989999999999869899999999999999982998999999999999 Q gi|255764474|r 291 AGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 291 ~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) +|++++++|+++++++|.+++.|++++++|+++|++++.++|||++++++++++|+ T Consensus 310 ~G~l~~~~d~~~l~~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~~~~~~~lY~ 365 (365) T cd03807 310 TGFLVPPGDPEALAEAIEALLADPALRQALGEAARERIEENFSIEAMVEAYEELYR 365 (365) T ss_pred CEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCC T ss_conf 67997799999999999999779999999999999999986899999999999768 No 12 >cd03819 GT1_WavL_like This family is most closely related to the GT1 family of glycosyltransferases. WavL in Vibrio cholerae has been shown to be involved in the biosynthesis of the lipopolysaccharide core. Probab=100.00 E-value=0 Score=365.83 Aligned_cols=322 Identities=23% Similarity=0.285 Sum_probs=264.5 Q ss_pred ECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC------------CCC------CCHHHHHHHHHHHHHCCCE Q ss_conf 7078787877077888899999997198599980588665------------623------4789989988754325970 Q gi|255764474|r 9 IAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN------------IPS------IGISSLLTCWKKPIGQNSR 70 (352) Q Consensus 9 i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~------------~~~------~~~~~~~~~~~~~~~~~~~ 70 (352) |.|.+.. +|++..+.++++.+.+.|+++.+++...+.. ++. ..+.....+.+.....++| T Consensus 3 ii~sl~~--GGaE~~~~~La~~L~~~Gh~V~vi~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~D 80 (355) T cd03819 3 VLPALES--GGVERGTLELARALVERGHRSLVASAGGRLVAELEAEGSRHIKLPFISKNPLRILLNVARLRRLIREEKVD 80 (355) T ss_pred ECCCCCC--CHHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 7899999--41799999999999987998999968998715566349669991787788289999999999999996998 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHC-------CCCEEEECCCCC Q ss_conf 8996285468999999997099819997574562011347989962668998898999743-------688399748827 Q gi|255764474|r 71 IWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFI-------ERPSTVIMHGVD 143 (352) Q Consensus 71 ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~-------~~~~~vi~~gid 143 (352) ++|+|.....+.+.++....+.|+..++|. ......+.+.+..+.|.++++|+..++++ ..++.+||||+| T Consensus 81 ivh~h~~~~~~~~~~a~~~~~~~~i~t~H~--~~~~~~~~~~~~~~~~~~i~~S~~~~~~~~~~~~~~~~ki~vI~ngid 158 (355) T cd03819 81 IVHARSRAPAWSAYLAARRTRPPFVTTVHG--FYSVNFRYNAIMARGDRVIAVSNFIADHIRENYGVDPDRIRVIPRGVD 158 (355) T ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEECC--CCCHHHHHHHHHHHCCEEEEECHHHHHHHHHHHCCCHHHEEEECCCCC T ss_conf 999778644999999998539978999678--774067999999727989994589999999973998789999788756 Q ss_pred HHHHCCCCC----HHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHH Q ss_conf 778237843----2687887288988727999512264451236665456530047740588631333210001466776 Q gi|255764474|r 144 TERFRPTSN----KQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQR 219 (352) Q Consensus 144 ~~~~~~~~~----~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~ 219 (352) .+.|.|... ....+..++.+.+.++++++||+.++||++.+++|+..+.++.|+++++++|+|+.+ ..+.+++++ T Consensus 159 ~~~f~~~~~~~~~~~~~~~~~~~~~~~~~i~~vGRl~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~-~~~~~~~~~ 237 (355) T cd03819 159 LDRFDPGAVPPERILALAREWPLPKGKPVILLPGRLTRWKGQEVFIEALARLKKDDPDVHLLIVGDAQGR-RFYYAELLE 237 (355) T ss_pred HHHCCCCCCCHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCC-HHHHHHHHH T ss_conf 5423877787788999998628999987999961665445769999999999864899799997078641-689999999 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCC-CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCC Q ss_conf 443112332212222223310000000000012333-2222221000001001110278841010023488424996599 Q gi|255764474|r 220 RIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLY-EGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPR 298 (352) Q Consensus 220 ~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~-Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~ 298 (352) ++++++++++|.|+|+++++..+|+.+|+|++||.+ ||||++++||||||+|||+||+||++|+|. +|++|++++++ T Consensus 238 ~~~~~~l~~~v~f~G~~~d~~~~~~~adi~v~pS~~~Egf~~vllEAma~G~PvV~s~~gg~~eii~--~~~~G~l~~~~ 315 (355) T cd03819 238 LIKRLGLQDRVTFVGHCSDMPAAYALADIVVSASTEPEAFGRTAVEAQAMGRPVIASDHGGARETVR--PGETGLLVPPG 315 (355) T ss_pred HHHHCCCCCCEEECCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCEEEECCCCCHHHCC--CCCCEEEECCC T ss_conf 9998199762886576214689987403255887771000078999998699899908999476615--89978998899 Q ss_pred CHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHHHCCHHHH Q ss_conf 98999999999986-9899999999999999982998999 Q gi|255764474|r 299 NLHALEKAVLYFMN-SKKIMSDTGNRGRERAVKHFSIVKE 337 (352) Q Consensus 299 d~~~la~~i~~l~~-~~~~~~~~~~~a~~~~~~~fs~~~~ 337 (352) |+++|+++|.++++ +++++.+|+++||++++++|||++| T Consensus 316 d~~~l~~~i~~~l~~~~~~r~~~~~~ar~~~~~~fs~~~m 355 (355) T cd03819 316 DAEALAQALDQILSLLPEGRAKMFAKARMCVETLFSYDRM 355 (355) T ss_pred CHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHC T ss_conf 9999999999998699999999999999999984898339 No 13 >cd03798 GT1_wlbH_like This family is most closely related to the GT1 family of glycosyltransferases. wlbH in Bordetella parapertussis has been shown to be required for the biosynthesis of a trisaccharide that, when attached to the B. pertussis lipopolysaccharide (LPS) core (band B), generates band A LPS. Probab=100.00 E-value=0 Score=351.85 Aligned_cols=332 Identities=29% Similarity=0.435 Sum_probs=265.8 Q ss_pred EEEECCCCCC-CCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCC-------------------C---C-CC----HHHH Q ss_conf 5897078787-8770778888999999971985999805886656-------------------2---3-47----8998 Q gi|255764474|r 6 IDVIAPNMKF-RHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNI-------------------P---S-IG----ISSL 57 (352) Q Consensus 6 i~~i~~~~~~-~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~---~-~~----~~~~ 57 (352) |.+|++.|.+ ..+|++..+.++++.+.+.|+++.++++...... . . .. .... T Consensus 1 ILiit~~fpp~~~GG~e~~~~~la~~L~~~G~~V~Vit~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (377) T cd03798 1 ILVISSLYPPPNNGGGGIFVKELARALAKRGVEVTVLAPGPWGPKLLDLLKGRLVGVERLPVLLPVVPLLKGPLLYLLAA 80 (377) T ss_pred EEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHCCCCCEEECCCCCCCCCCCCCHHHHHHHHHH T ss_conf 29993776899889499999999999997799699995379875312312575200034454554443310466778999 Q ss_pred HHHHH--HHHHCCCEEEEECCCH-HHHHHHHHHHHHCCCEEEEECCHH------HHHHHHHHHHHHHHCCEEEECCHHHH Q ss_conf 99887--5432597089962854-689999999970998199975745------62011347989962668998898999 Q gi|255764474|r 58 LTCWK--KPIGQNSRIWHARRNN-EMLLGVMMRDVLRMPLKLVFTSPS------QRNHSRWTRYLISRMDEVITTSQKSA 128 (352) Q Consensus 58 ~~~~~--~~~~~~~~ivh~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~l~~~~d~ii~~s~~~~ 128 (352) ..+++ .....++|++|+|... ....+..+....+.|+.+..|... ......+.++.++++|.++++|+..+ T Consensus 81 ~~l~~~~~~~~~~~DvI~~~~~~~~~~~~~~~~~~~~~~~v~~~h~~~~~~~~~~~~~~~~~~~~~~~ad~ii~~S~~~~ 160 (377) T cd03798 81 RALLKLLKLKRFRPDLIHAHFAYPDGFAAALLKRKLGIPLVVTLHGSDVNLLPRKRLLRALLRRALRRADAVIAVSEALA 160 (377) T ss_pred HHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHCCCCHHHHHHHHHHHHHCCEEEECCHHHH T ss_conf 99999999746998689978840679999999997399889996774143102316899999999985899998898999 Q ss_pred HHC------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEE Q ss_conf 743------68839974882777823784326878872889887279995122644512366654565300477405886 Q gi|255764474|r 129 RFI------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVV 202 (352) Q Consensus 129 ~~~------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i 202 (352) +++ ..++.+||||+|.+.|.|.... .......+.+.+.|+|+||+.++||++.+++|+.++.++.++++|++ T Consensus 161 ~~~~~~~~~~~~i~vi~ngid~~~f~~~~~~--~~~~~~~~~~~~~i~~~Grl~~~Kg~~~li~a~~~l~~~~~~~~l~i 238 (377) T cd03798 161 DELKALGIDPEKVTVIPNGVDTERFSPADRA--EARKLGLPEDKKVILFVGRLVPRKGIDYLIEALARLLKKRPDVHLVI 238 (377) T ss_pred HHHHHHCCCCCCEEEECCCCCHHHCCCCCHH--HHHHHCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 9999858996559998897578754988777--89860899998599998245200182899999999887488852243 Q ss_pred ECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCC Q ss_conf 3133321000146677644311233221222222--33100000000000123332222221000001001110278841 Q gi|255764474|r 203 VGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVF 280 (352) Q Consensus 203 ~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~ 280 (352) +|+|+.. +.++++++++++.++|.|+|++ +++..+|+.||++++||++||||++++||||||+|||+|++||. T Consensus 239 ~G~g~~~-----~~l~~~~~~~~l~~~v~~~g~v~~~~~~~~~~~adv~v~pS~~E~~~~~~lEama~G~PvI~~~~gg~ 313 (377) T cd03798 239 VGDGPLR-----EALEALAAELGLEDRVTFLGAVPHEEVPAYYAAADVFVLPSLREGFGLVLLEAMACGLPVVATDVGGI 313 (377) T ss_pred CCCCHHH-----HHHHHHHHHCCCCCEEEECCCEEHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEECCCCCH T ss_conf 2682788-----89999988618873698605210010101333377413785576512558999975997999589986 Q ss_pred CHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 01002348842499659998999999999986989999999999999998299899999999999998 Q gi|255764474|r 281 SELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRL 348 (352) Q Consensus 281 ~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~ 348 (352) +|++. ++.+|++++++|+++++++|.++++|+++ +|+++||+.+.++|||+.++++|+++|++| T Consensus 314 ~e~i~--~~~~G~~~~~~~~~~l~~~i~~l~~~~~~--~~~~~~~~~~~~~fs~~~~~~~~~~~y~~~ 377 (377) T cd03798 314 PEIIT--DGENGLLVPPGDPEALAEAILRLLADPWL--RLGRAARRRVAERFSWENVAERLLELYREV 377 (377) T ss_pred HHHHC--CCCEEEEECCCCHHHHHHHHHHHHCCHHH--HHHHHHHHHHHHHCCHHHHHHHHHHHHHCC T ss_conf 99851--79749997999999999999999879999--999999999998699999999999998569 No 14 >TIGR02149 glgA_Coryne glycogen synthase, Corynebacterium family; InterPro: IPR011875 This entry describes Corynebacterium glutamicum GlgA and closely related proteins in other species. This enzyme is required for glycogen biosynthesis and appears to replace the distantly related IPR011835 from INTERPRO, family of ADP-glucose type glycogen synthase in Corynebacterium glutamicum, Mycobacterium tuberculosis, Bifidobacterium longum, and Streptomyces coelicolor .. Probab=100.00 E-value=0 Score=362.59 Aligned_cols=339 Identities=22% Similarity=0.360 Sum_probs=270.4 Q ss_pred HEEEECCCCCCC-CCCHHHHHHHHHHHHHHH-CCEEEEEECCC-CCCCCC-------------CCHHHHHHHHH------ Q ss_conf 458970787878-770778888999999971-98599980588-665623-------------47899899887------ Q gi|255764474|r 5 NIDVIAPNMKFR-HTGVTSTVFGLCPIQRKL-GQRLVVFGYCL-PKNIPS-------------IGISSLLTCWK------ 62 (352) Q Consensus 5 ~i~~i~~~~~~~-~~Gv~~~v~~~~~~~~~~-~~~~~~~~~~~-~~~~~~-------------~~~~~~~~~~~------ 62 (352) ++.+++--+++. ++|...++..|+++|+++ ...+..++... ..++|. ..+...-.+-. T Consensus 2 kV~~LtrEYPP~vYGGAGVHv~~L~~~L~~l~~vdVr~fG~~rteadiPaiPna~~~~gsL~v~~Y~~~~~L~~GldPra 81 (416) T TIGR02149 2 KVTLLTREYPPEVYGGAGVHVEELARELARLVDVDVRCFGDKRTEADIPAIPNAFFSEGSLKVLGYRPWSELKEGLDPRA 81 (416) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEECCCCHHHCCCCCHHH T ss_conf 01242467878867887411888999998653400464388744233456662216898328874078743225677225 Q ss_pred -------------HHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECC-----HHHHH-------HHHH-HHHHHHH Q ss_conf -------------54325970899628546899999999709981999757-----45620-------1134-7989962 Q gi|255764474|r 63 -------------KPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTS-----PSQRN-------HSRW-TRYLISR 116 (352) Q Consensus 63 -------------~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~-------~~~~-~~~l~~~ 116 (352) .......+|+|+|+|...+.|.+++.+.+.|.+++-|| ++++. .+.| .|--+.. T Consensus 82 n~aL~tfSvDL~m~~d~~~~~vvHsHTWYa~LAG~LAk~Lyd~PlVvTaHSLEPLRPWK~EQLGgGY~lSsW~EktA~~a 161 (416) T TIGR02149 82 NKALKTFSVDLAMANDVEDADVVHSHTWYAALAGHLAKLLYDVPLVVTAHSLEPLRPWKKEQLGGGYKLSSWAEKTAIEA 161 (416) T ss_pred HHHHCCHHHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHCCCCCCCCHHHHHHHHHH T ss_conf 75402311578876110027155320788878999999966998399730378887131756589742024788889985 Q ss_pred CCEEEECCHHHHHHC--------CCCEEEECCCCCHHHHCCC--CCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHH Q ss_conf 668998898999743--------6883997488277782378--432687887288988727999512264451236665 Q gi|255764474|r 117 MDEVITTSQKSARFI--------ERPSTVIMHGVDTERFRPT--SNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVD 186 (352) Q Consensus 117 ~d~ii~~s~~~~~~~--------~~~~~vi~~gid~~~~~~~--~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~ 186 (352) +|+|||+|..+++-+ ++++.||+||||+..|.+. .+..+.+.+.|++.+++.++|+||++++||+..||+ T Consensus 162 Ad~vIAVS~amr~DiL~~YP~lD~~kv~Vv~NGId~~~y~~~~~~~~~~v~~~~Gid~~rP~~lFVGRItRQKGv~~L~~ 241 (416) T TIGR02149 162 ADRVIAVSGAMREDILKVYPDLDPEKVHVVYNGIDTKEYKPAADDDGNKVLDRYGIDRSRPYVLFVGRITRQKGVPHLLK 241 (416) T ss_pred CCCEEEHHHHCHHHHHCCCCCCCCCCEEEEECCCCHHHHCCCCCCCHHHHHHHHCCCCCCCEEEEEEEEECCCCHHHHHH T ss_conf 04065311103355831586888464688864764576068888741134663267998887898520203165589999 Q ss_pred HHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCC-CCCCCCCCC---CCCCCCCCHHHHHHCCCCCCCCCCCCC Q ss_conf 4565300477405886313332100014667764431123-322122222---233100000000000123332222221 Q gi|255764474|r 187 CMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGL-KKRILFIDE---QSSIEDWYRALNIFVAPPLYEGFGLTP 262 (352) Q Consensus 187 a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l-~~~V~~~g~---~~~~~~~~~~adi~i~pS~~Eg~gl~~ 262 (352) |...+. +++++++|.+. .+..+-.+|.++.++.+.- -+.|+|... ++++.+++.+|++||+||.||+||++= T Consensus 242 A~~~~~---~dvqvVLCAga-pDTPEv~~Ev~~~~a~l~~~R~gv~WI~~ml~~~~~~~L~~~A~vFvCPSvYEPLGIvN 317 (416) T TIGR02149 242 AVHLIS---KDVQVVLCAGA-PDTPEVAEEVRQAVAKLDREREGVIWIEKMLPKEELVELLSNAEVFVCPSVYEPLGIVN 317 (416) T ss_pred HHHHCC---CCCEEEEECCC-CCCCCHHHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHCCCEEECCCCCCCCCHHH T ss_conf 996255---20359870676-78720689999999988761698386356588789999984694786484425420556 Q ss_pred CCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCC-------------H-HHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 0000010011102788410100234884249965999-------------8-9999999999869899999999999999 Q gi|255764474|r 263 LEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRN-------------L-HALEKAVLYFMNSKKIMSDTGNRGRERA 328 (352) Q Consensus 263 lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d-------------~-~~la~~i~~l~~~~~~~~~~~~~a~~~~ 328 (352) |||||||+|||+|++||++|+|- +|+||++|++++ . +.||++|..++.||+..++||++||+++ T Consensus 318 LEAMAC~tpVVAS~~GGIpEVV~--dg~TG~LV~~~~lhdGtGtP~d~d~f~~~LA~ai~~ll~dp~~A~k~G~aGr~R~ 395 (416) T TIGR02149 318 LEAMACGTPVVASAVGGIPEVVV--DGETGFLVPIDDLHDGTGTPDDDDKFEADLAKAIDALLADPELAKKMGEAGRKRA 395 (416) T ss_pred HHHHHCCCCEEECCCCCCCCEEE--CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHCHHHHHHH T ss_conf 87885078634403689552683--3744312470145577888887405689999999997429578988344346554 Q ss_pred HHHCCHHHHHHHHHHHHHHHH Q ss_conf 982998999999999999988 Q gi|255764474|r 329 VKHFSIVKEASDIGKVYDRLL 349 (352) Q Consensus 329 ~~~fs~~~~a~~~~~iY~~~l 349 (352) .+.|||+.+|++..++|++|+ T Consensus 396 ~~~FSW~~iA~kT~~~Y~~v~ 416 (416) T TIGR02149 396 VEEFSWESIAKKTVELYRKVL 416 (416) T ss_pred HHCCCHHHHHHHHHHHHHHCC T ss_conf 212575789999999987419 No 15 >cd03820 GT1_amsD_like This family is most closely related to the GT1 family of glycosyltransferases. AmSD in Erwinia amylovora has been shown to be involved in the biosynthesis of amylovoran, the acidic exopolysaccharide acting as a virulence factor. This enzyme may be responsible for the formation of galactose alpha-1,6 linkages in amylovoran. Probab=100.00 E-value=0 Score=349.69 Aligned_cols=318 Identities=22% Similarity=0.339 Sum_probs=250.8 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCC--------CCCC------------CCHHHHHHHHHHH Q ss_conf 4589707878787707788889999999719859998058866--------5623------------4789989988754 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPK--------NIPS------------IGISSLLTCWKKP 64 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~--------~~~~------------~~~~~~~~~~~~~ 64 (352) ||..|.|.+.. .+|++..+.++++.+.+.|+.+.+++...+. .+.. ..+..+..+.+.. T Consensus 1 KIl~v~~~l~~-~GG~e~~~~~la~~L~~~G~~V~vit~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~ 79 (348) T cd03820 1 KILFVIPSLGN-AGGAERVLSNLANALAEKGHEVTIISLDKGEPPFYELDPKIKVIDLGDKRDSKLLARFKKLRRLRKLL 79 (348) T ss_pred CEEEEECCCCC-CCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHH T ss_conf 98999797999-98789999999999987799899999669998644058974999888765420567899999999999 Q ss_pred HHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHH-----HHHHHHHHHHCCEEEECCHHHHHH----CCCCE Q ss_conf 325970899628546899999999709981999757456201-----134798996266899889899974----36883 Q gi|255764474|r 65 IGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNH-----SRWTRYLISRMDEVITTSQKSARF----IERPS 135 (352) Q Consensus 65 ~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~l~~~~d~ii~~s~~~~~~----~~~~~ 135 (352) ...++|++|++......... .......|+..+.|....... ....++.++.+|.++++|+..... ...+. T Consensus 80 ~~~~~Dvi~~~~~~~~~~~~-~~~~~~~~~i~~~H~~~~~~~~~~~~~~~~~~~~~~~~~ii~~S~~~~~~~~~~~~~k~ 158 (348) T cd03820 80 KNNKPDVVISFLTSLLTFLA-SLGLKIVKLIVSEHNSPDAYKKRLRRLLLRRLLYRRADAVVVLTEEDRALYYKKFNKNV 158 (348) T ss_pred HHCCCCEEEECCCCHHHHHH-HHHCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCCCCE T ss_conf 97599999998963699999-99759982899975785663013679999999998689999969999998752378898 Q ss_pred EEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHH Q ss_conf 99748827778237843268788728898872799951226445123666545653004774058863133321000146 Q gi|255764474|r 136 TVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKK 215 (352) Q Consensus 136 ~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~ 215 (352) .+||||+|...+.+... .+.+.|+++||+.++||++.+++|+..+.++.|+++|+++|+|+.+ + T Consensus 159 ~vI~N~v~~~~~~~~~~-----------~~~~~il~vGRl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~G~~~-----~ 222 (348) T cd03820 159 VVIPNPLPFPPEEPSSD-----------LKSKRILAVGRLVPQKGFDLLIEAWAKIAKKHPDWKLRIVGDGPER-----E 222 (348) T ss_pred EEECCCCCHHHCCCCCC-----------CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCC-----C T ss_conf 99889988232265446-----------6798899993786324949998888988864898599999468753-----2 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECC-CCCCCHHCCCCCCCEEEE Q ss_conf 6776443112332212222223310000000000012333222222100000100111027-884101002348842499 Q gi|255764474|r 216 NLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASN-TGVFSELLDPENAKAGVI 294 (352) Q Consensus 216 ~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~-~~~~~e~i~~~~g~~G~~ 294 (352) +++++++++++.++|.|+|.++++..+|+.||++++||++||||++++||||||+|||+|| .||.+|+|. +|.+|++ T Consensus 223 ~l~~~i~~~~l~~~v~~~G~~~~~~~~~~~adv~v~pS~~Egfgl~~lEAma~G~PvIas~~~gg~~e~v~--~~~~G~l 300 (348) T cd03820 223 ALEALIKELGLEDRVILLGFTKNIEEYYAKASIFVLTSRFEGFPMVLLEAMAFGLPVISFDCPTGPSEIIE--DGVNGLL 300 (348) T ss_pred CHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCHHHHHC--CCCEEEE T ss_conf 01567776335773642475222233221357531464124587089999986999999679988499953--8962999 Q ss_pred ECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 6599989999999999869899999999999999982998999999999 Q gi|255764474|r 295 VPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGK 343 (352) Q Consensus 295 ~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~ 343 (352) ++++|+++|+++|.++++|++++++|+++|++++ ++|||++++++||+ T Consensus 301 ~~~~d~~~la~~i~~ll~~~~~~~~~~~~a~~~~-~~fsw~~i~~~ye~ 348 (348) T cd03820 301 VPNGDVEALAEALLRLMEDEELRKRMGANARESA-ERFSIENIIKQWEE 348 (348) T ss_pred ECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-HHCCHHHHHHHHCC T ss_conf 8899999999999999779999999999999999-96999999999659 No 16 >cd03795 GT1_like_4 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP-linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=100.00 E-value=0 Score=350.39 Aligned_cols=322 Identities=21% Similarity=0.290 Sum_probs=250.0 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC-------------------CCCCCHHHHHHHHHHHH Q ss_conf 45897078787877077888899999997198599980588665-------------------62347899899887543 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN-------------------IPSIGISSLLTCWKKPI 65 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~ 65 (352) +|.+|...|.+..+|++.++.++++.+.+.||++.+++...... .....+........... T Consensus 1 kIL~i~~~f~P~~GG~e~~~~~L~~~L~~~Gh~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (357) T cd03795 1 RVLHVGKFYPPDRGGIEQVIRDLAEGLAARGIEVAVLCASPEPKGRDEERNGHRVIRAPSLLNVASTPFSPSFFKQLKKL 80 (357) T ss_pred CEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHH T ss_conf 99999382899898299999999999997799899998279887765025884799877433334420469999999997 Q ss_pred HCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHH------HHHHHHHHHHHHHCCEEEECCHHHHHHC------CC Q ss_conf 25970899628546899999999709981999757456------2011347989962668998898999743------68 Q gi|255764474|r 66 GQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQ------RNHSRWTRYLISRMDEVITTSQKSARFI------ER 133 (352) Q Consensus 66 ~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~l~~~~d~ii~~s~~~~~~~------~~ 133 (352) ..++|++|+|...............+.|+.+++|+... ....++.+++++++|.++++|+..++.. .. T Consensus 81 ~~~~Diih~h~~~~~~~~~~~~~~~~~~~v~t~H~~~~~~~~~~~~~~~~~~~~~~~ad~ii~~S~~~~~~~~~~~~~~~ 160 (357) T cd03795 81 AKKADVIHLHFPNPLADLALLLLPRKKPVVVHWHSDIVKQKLLLKLYRPLQRRFLRRADAIVATSPNYAETSPVLRRFRD 160 (357) T ss_pred CCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCC T ss_conf 25999999947635999999998579979999878853205679999999999998489999889999999998447767 Q ss_pred CEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCH Q ss_conf 83997488277782378432687887288988727999512264451236665456530047740588631333210001 Q gi|255764474|r 134 PSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLF 213 (352) Q Consensus 134 ~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~ 213 (352) ++.+||||+|...|.+.. ...+.......+.+.|+++||+.++||++.+|+|+.++ |+++++++|+|+.. T Consensus 161 k~~vIpngvd~~~~~~~~--~~~~~~~~~~~~~~~i~~vGrl~~~Kg~~~li~a~~~l----~~~~l~i~G~G~~~---- 230 (357) T cd03795 161 KVRVIPLGLDPARYPRPD--ALEEAIWRRAAGRPFFLFVGRLVYYKGLDVLLEAAAAL----PDAPLVIVGEGPLE---- 230 (357) T ss_pred CEEEECCCCCCCCCCCCC--HHHHHHHHCCCCCCEEEEEECCCHHCCCHHHHHHHHHC----CCEEEEEEECCCCC---- T ss_conf 689988976623368873--67888740358997899980780430957899898769----89099999567542---- Q ss_pred HHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCC--CCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCC Q ss_conf 46677644311233221222222--331000000000001233--32222221000001001110278841010023488 Q gi|255764474|r 214 KKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPL--YEGFGLTPLEAMASGIPVVASNTGVFSELLDPENA 289 (352) Q Consensus 214 ~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~--~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g 289 (352) +++++++++++++++|+|+|++ +++..+|++||++++||. .||||++++||||||+|||+|++||.+|.|. .+| T Consensus 231 -~~l~~~~~~~~~~~~V~f~G~~~~~~~~~~~~~adi~v~pS~~~~Egfg~~~lEAma~G~PVVat~~gg~~~~i~-~~~ 308 (357) T cd03795 231 -AELEALAAALGLLDRVRFLGRLDDEEKAALLAACDVFVFPSVERSEAFGIVLLEAMAFGKPVISTEIGTGGSYVN-LHG 308 (357) T ss_pred -CCCCCHHHHCCCCCEEEECCCCCHHHHHHHHHCCCEEEEECCHHCCCCCHHHHHHHHCCCCEEEECCCCCHHHHE-ECC T ss_conf -221000555187514752586514557988626878999464021356667999998799899935999815605-569 Q ss_pred CEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 4249965999899999999998698999999999999999829989999 Q gi|255764474|r 290 KAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEA 338 (352) Q Consensus 290 ~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a 338 (352) .+|++++++|+++++++|.++++|++++++|+++||++++++|||++++ T Consensus 309 ~~G~l~~~~d~~~l~~~i~~ll~~~~~~~~m~~~ar~~~~~~fs~~~mv 357 (357) T cd03795 309 VTGLVVPPGDPAALAEAIRRLLEDPELRERLGEAARERAEEEFTADRMV 357 (357) T ss_pred CEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHCC T ss_conf 5799978999999999999997799999999999999999857998519 No 17 >cd03808 GT1_cap1E_like This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides. Probab=100.00 E-value=0 Score=349.31 Aligned_cols=321 Identities=22% Similarity=0.292 Sum_probs=251.9 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC------------CC----CC----CHHHHHHHHHHH Q ss_conf 45897078787877077888899999997198599980588665------------62----34----789989988754 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN------------IP----SI----GISSLLTCWKKP 64 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~------------~~----~~----~~~~~~~~~~~~ 64 (352) ||..|.. ..+|++.++.++++.+.+.||++.++++..+.. ++ .. .+..+..+.+.. T Consensus 1 kil~i~~----~~GG~e~~~~~La~~L~~~Gh~V~vit~~~~~~~~~~~~gv~~~~~~~~~~~~~~~~~~~~~~~l~~~i 76 (359) T cd03808 1 KILHIVT----VDGGLYSFRLPLIKALRAAGYEVHVVAPPGDELEELEALGVKVIPIPLDRRGINPFKDLKALLRLYRLL 76 (359) T ss_pred CEEEEEC----CCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHHH T ss_conf 9899975----876599999999999997699999997079874336757988999278777788699999999999999 Q ss_pred HHCCCEEEEECCCHHHHHHHHHHHHHCCC-EEEEECCHHH-H--------HHHHHHHHHHHHCCEEEECCHHHHHHCC-- Q ss_conf 32597089962854689999999970998-1999757456-2--------0113479899626689988989997436-- Q gi|255764474|r 65 IGQNSRIWHARRNNEMLLGVMMRDVLRMP-LKLVFTSPSQ-R--------NHSRWTRYLISRMDEVITTSQKSARFIE-- 132 (352) Q Consensus 65 ~~~~~~ivh~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~--------~~~~~~~~l~~~~d~ii~~s~~~~~~~~-- 132 (352) ...++|++|+|...+.+.+.+++...+.+ +..+.|.... . ......++.++.+|.++++|+..++.+. T Consensus 77 ~~~~pDvIh~~~~~~~~~~~la~~~~~~~~~v~~~h~~~~~~~~~~~~~~~~~~~~k~~~~~~~~ii~~S~~~~~~~~~~ 156 (359) T cd03808 77 RKERPDIVHTHTPKPGILGRLAARLAGVPKVIYTVHGLGFVFTSGGLKRRLYLLLERLALRFTDKVIFQNEDDRDLALKL 156 (359) T ss_pred HHHCCCEEEEECCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHHHH T ss_conf 98499899990651357899999864998699995677401245477899999999999964999999498999999983 Q ss_pred -----CCEEE-ECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCC Q ss_conf -----88399-748827778237843268788728898872799951226445123666545653004774058863133 Q gi|255764474|r 133 -----RPSTV-IMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKT 206 (352) Q Consensus 133 -----~~~~v-i~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g 206 (352) ....+ .+||+|.+.|.+.+.. .+.+.+.++++||+.++||++.+++|++.+.+++|+++++++|+| T Consensus 157 ~~~~~~~~~~i~~~gvd~~~~~~~~~~--------~~~~~~~i~~vGrl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~g 228 (359) T cd03808 157 GIIKKKKTVLIPGSGVDLDRFSPSPEP--------IPEDDPVFLFVARLLKDKGIDELLEAARILKAKGPNVRLLLVGDG 228 (359) T ss_pred CCCCCCCEEEECCCCCCHHHCCCCCCC--------CCCCCEEEEEECCCCHHCCHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 799746089977997686653866546--------898984999980463220739999999999863998089997688 Q ss_pred CCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC Q ss_conf 32100014667764431123322122222233100000000000123332222221000001001110278841010023 Q gi|255764474|r 207 TLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDP 286 (352) Q Consensus 207 ~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~ 286 (352) +..+ +..+..++++++.++|.|+|+++++..+|+.+|+|++||++||||++++||||||+|||+|++||++|+|. T Consensus 229 ~~~~----~~~~~~~~~~~~~~~v~f~G~~~~~~~~~~~~di~v~pS~~Egf~~~~lEAma~G~PvI~s~~gg~~e~i~- 303 (359) T cd03808 229 DEEN----PAAILEIEKLGLEGRVEFLGFRDDVPELLAAADVFVLPSYREGLPRVLLEAMAMGRPVIATDVPGCREAVI- 303 (359) T ss_pred CCHH----HHHHHHHHHCCCCCEEEEECCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCEEECCCCCHHHHHH- T ss_conf 7258----99999999718898699807577899999960215787521357842899986699899948997288860- Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 488424996599989999999999869899999999999999982998999999999 Q gi|255764474|r 287 ENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGK 343 (352) Q Consensus 287 ~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~ 343 (352) +|.+|++++++|+++++++|.++++|++++.+|+++|++++.++|||+.+++++++ T Consensus 304 -~~~~G~l~~~~d~~~la~~i~~ll~~~~~~~~~~~~a~~~~~~~fs~~~~~~~~le 359 (359) T cd03808 304 -DGVNGFLVPPGDAEALADAIERLIEDPELRARMGQAARKRAEEEFDEEIVVKKLLE 359 (359) T ss_pred -CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHC T ss_conf -79818998999999999999999889999999999999999987799999999749 No 18 >cd03811 GT1_WabH_like This family is most closely related to the GT1 family of glycosyltransferases. WabH in Klebsiella pneumoniae has been shown to transfer a GlcNAc residue from UDP-GlcNAc onto the acceptor GalUA residue in the cellular outer core. Probab=100.00 E-value=0 Score=347.38 Aligned_cols=317 Identities=22% Similarity=0.284 Sum_probs=255.7 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC--------C-----------CCCCHHHHHHHHHHHH Q ss_conf 45897078787877077888899999997198599980588665--------6-----------2347899899887543 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN--------I-----------PSIGISSLLTCWKKPI 65 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~--------~-----------~~~~~~~~~~~~~~~~ 65 (352) +|..|.|.+. .+|++..+.++++.+.+.|+++.+++...... . ..........+.+... T Consensus 1 KIl~v~~~~~--~GG~e~~~~~la~~L~~~G~~V~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~ 78 (353) T cd03811 1 KILFVIPSLG--GGGAERVLLNLANGLDKRGYDVTLVVLRDEGDYLELLPSNVKLIPVRVLKLKSLRDLLAILRLRRLLR 78 (353) T ss_pred CEEEEECCCC--CCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHH T ss_conf 9899969999--91599999999999997799799999779985133305673388613556553325999999999999 Q ss_pred HCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHH------HHHHHHHHHHHHCCEEEECCHHHHHHCC------- Q ss_conf 259708996285468999999997099819997574562------0113479899626689988989997436------- Q gi|255764474|r 66 GQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQR------NHSRWTRYLISRMDEVITTSQKSARFIE------- 132 (352) Q Consensus 66 ~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~l~~~~d~ii~~s~~~~~~~~------- 132 (352) ..++|++|+|...............+.|...+.|..... ......+++++.+|.++++|+..++.+. T Consensus 79 ~~~~DiI~~~~~~~~~~~~~~~~~~~~~~i~~~h~~~~~~~~~~~~~~~~~~~~~~~~d~ii~~S~~~~~~~~~~~~~~~ 158 (353) T cd03811 79 KEKPDVVISHLTTTPNVLALLAARLGTKLIVWEHNSLSLELKRKLRLLLLIRKLYRRADKIVAVSEGVKEDLLKLLGIPP 158 (353) T ss_pred HCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCH T ss_conf 74998999988627899999997479978999798704432334669999999998689999959999999998619985 Q ss_pred CCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC Q ss_conf 88399748827778237843268788728898872799951226445123666545653004774058863133321000 Q gi|255764474|r 133 RPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYL 212 (352) Q Consensus 133 ~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~ 212 (352) .++.+||||+|.+.|.+. ...+.....+.+.+.++++||+.++||++.+|+|+..+.++.|+++++++|+|+.+ T Consensus 159 ~~i~vI~Ngvd~~~~~~~---~~~~~~~~~~~~~~~il~vGrl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~G~~~--- 232 (353) T cd03811 159 DKIEVIYNPIDIEEIRAL---AEEPLELGIPPDGPVILAVGRLSPQKGFDTLIRAFALLRKEGPDARLVILGDGPLR--- 232 (353) T ss_pred HHEEEECCCCCHHHHCHH---HHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCHHH--- T ss_conf 689997675686762324---56545306889986999982076642299999999976641873799974786039--- Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEE Q ss_conf 14667764431123322122222233100000000000123332222221000001001110278841010023488424 Q gi|255764474|r 213 FKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAG 292 (352) Q Consensus 213 ~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G 292 (352) +++++++++++++++|.|+|+++++.++|+.||++++||.+||||++++||||||+|||+||+||++|+|. +|.+| T Consensus 233 --~~l~~~~~~~~l~~~V~~~G~~~d~~~~~~~~di~v~pS~~Egfg~~~lEAma~G~pvI~s~~gg~~e~i~--~g~~G 308 (353) T cd03811 233 --EELEALAKELGLADRVHFLGFQSNPYPYLKAADLFVLSSRYEGFPNVLLEAMALGTPVVATDCPGPREILE--DGENG 308 (353) T ss_pred --HHHHHHHHHCCCCCEEEECCCCCCHHHHHHHCCEEECCCCCCCCCHHHHHHHHCCCCEEECCCCCHHHHHC--CCCCE T ss_conf --99997788659860687548664302324208688715434688538999998099899948998489844--89838 Q ss_pred EEECCCCHHHHHHHHHHH---HCCHHHHHHHHHHHHHHHHHHCC Q ss_conf 996599989999999999---86989999999999999998299 Q gi|255764474|r 293 VIVPPRNLHALEKAVLYF---MNSKKIMSDTGNRGRERAVKHFS 333 (352) Q Consensus 293 ~~~~~~d~~~la~~i~~l---~~~~~~~~~~~~~a~~~~~~~fs 333 (352) ++++++|++++++++..+ +.|++++++|+++|++++.++|| T Consensus 309 ~l~~~~d~~~la~~i~~l~~l~~~~~~~~~~g~~~~~~~~~~f~ 352 (353) T cd03811 309 LLVPVGDEAALAAAALALLDLLLDPELRERLAAAARERVAREYS 352 (353) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf 99789999999999999985149999999999999999998679 No 19 >cd05844 GT1_like_7 Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center Probab=100.00 E-value=0 Score=351.36 Aligned_cols=320 Identities=23% Similarity=0.322 Sum_probs=252.2 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECC--CCCCCCC----------CCH---------HHHHHHHHH Q ss_conf 4589707878787707788889999999719859998058--8665623----------478---------998998875 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYC--LPKNIPS----------IGI---------SSLLTCWKK 63 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~--~~~~~~~----------~~~---------~~~~~~~~~ 63 (352) +|.++.+++... -+..+.+-+..+.+....+...... .+...+. ... +....+.+. T Consensus 1 ~v~i~~~~~~~~---sETFI~~q~~~l~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~r~ 77 (367) T cd05844 1 RVLIFRPLLLAP---SETFVRNQAEALRRFRPVYVGGRRLGPAPLGALAVRLADLAGGKAGLRLGALRLLTGSAPQLRRL 77 (367) T ss_pred CEEEEECCCCCC---CHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCHHHHHH T ss_conf 989992888998---53889999972778776799956237787776525766425536678999997426874899999 Q ss_pred HHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHH--H-H---------HHHHHHHHHHHCCEEEECCHHHHHHC Q ss_conf 4325970899628546899999999709981999757456--2-0---------11347989962668998898999743 Q gi|255764474|r 64 PIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQ--R-N---------HSRWTRYLISRMDEVITTSQKSARFI 131 (352) Q Consensus 64 ~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~---------~~~~~~~l~~~~d~ii~~s~~~~~~~ 131 (352) .+..++|++|+|+......+..++..++.|+.+++|.... . . .....+.+++++|.+|++|+..++.+ T Consensus 78 lr~~~pDlIHaH~~~~g~~~~~~a~~~~iP~V~T~Hg~d~~~~~~~~~~~~~~~~~~~~~~l~~~a~~iI~vS~~~~~~l 157 (367) T cd05844 78 LRRHRPDLVHAHFGFDGVYALPLARRLGVPLVVTFHGFDATTSLALLLRSRWALYARRRRRLARRAALFIAVSQFIRDRL 157 (367) T ss_pred HHHCCCCEEEECCCCHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHCCCHHHHHHHHHHHHHHHCCEEEECCHHHHHHH T ss_conf 99769999997686068999999999699999998136410141010011046789999999972699996999999999 Q ss_pred ------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf ------68839974882777823784326878872889887279995122644512366654565300477405886313 Q gi|255764474|r 132 ------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGK 205 (352) Q Consensus 132 ------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~ 205 (352) +.++.|||||+|.+.|.|.+.. ...+.++|+||+.++||++.+|+|++.+.+++|+++++++|+ T Consensus 158 ~~~G~~~~ki~vi~~GvD~~~f~~~~~~----------~~~~~il~vGRl~~~KG~~~li~A~~~l~~~~p~~~l~ivG~ 227 (367) T cd05844 158 LALGFPPEKVHVHPIGVDTAKFTPATPA----------RRPPRILFVGRFVEKKGPLLLLEAFARLARRVPEVRLVIIGD 227 (367) T ss_pred HHCCCCHHHEEEECCCCCHHHCCCCCCC----------CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEEEC T ss_conf 9859897899997786367646999877----------789689999357300076999999999796686979999988 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCC------CCCCCCCCCCCCCCEEEEEECCC Q ss_conf 3321000146677644311233221222222--331000000000001233------32222221000001001110278 Q gi|255764474|r 206 TTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPL------YEGFGLTPLEAMASGIPVVASNT 277 (352) Q Consensus 206 g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~------~Eg~gl~~lEAma~G~PvI~s~~ 277 (352) |+.. ++++++++++++.++|+|+|.+ +++..+|+.+|+|++||. +||||++++||||||+|||+|++ T Consensus 228 Gp~~-----~~l~~~~~~l~l~~~V~f~G~~~~~~v~~~l~~adv~v~PS~~~~~g~~Eg~~~~~lEAmA~G~PVVat~~ 302 (367) T cd05844 228 GPLL-----AALEALARALGLGGRVTFLGAQPHAEVRELMRRARIFLQPSVTAPSGDAEGLPVVLLEAQASGVPVVATRH 302 (367) T ss_pred CCCH-----HHHHHHHHHCCCCCCEEECCCCCCHHHHHHHHHCCEEEEEEEECCCCCCCCCCHHHHHHHHCCCCEEEECC T ss_conf 8378-----99999999709876378778898188999998578799600203778856763799999984997899279 Q ss_pred CCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 8410100234884249965999899999999998698999999999999999829989999999999 Q gi|255764474|r 278 GVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKV 344 (352) Q Consensus 278 ~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~i 344 (352) ||++|+|. +|++|+++++.|+++++++|.++++|++++++|+++||++++++|||++++++++++ T Consensus 303 ggi~e~V~--~g~~G~lv~~~d~~~La~ai~~Ll~d~~~~~~m~~~gr~~v~~~f~~~~~~~~Le~l 367 (367) T cd05844 303 GGIPEAVE--DGETGLLVPEGDVAALAAALGRLLADPDLRARMGAAGRRRVEERFDLRRQTAKLEAL 367 (367) T ss_pred CCCHHHHC--CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCC T ss_conf 98687720--799689978999999999999998499999999999999999819999999998339 No 20 >cd03822 GT1_ecORF704_like This family is most closely related to the GT1 family of glycosyltransferases. ORF704 in E. coli has been shown to be involved in the biosynthesis of O-specific mannose homopolysaccharides. Probab=100.00 E-value=0 Score=343.39 Aligned_cols=333 Identities=20% Similarity=0.261 Sum_probs=260.2 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC-------------CCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 45897078787877077888899999997198599980588665-------------62347899899887543259708 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN-------------IPSIGISSLLTCWKKPIGQNSRI 71 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~i 71 (352) ||.+|.| +.+..+|++.++.++++.+.+.||.+.+++...... ........+....+.....++|+ T Consensus 1 kI~~i~~-ypP~~GGi~~~~~~La~~L~~~Gh~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~Dv 79 (366) T cd03822 1 RIALVSP-YPPRKCGIATFTTDLVNALSARGPDVLVVSVAALYPSLLYGGEQEVVRVIVLDNPLDYRRAARAIRLSGPDV 79 (366) T ss_pred CEEEEEC-CCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHCCCCE T ss_conf 9899918-999998389999999999986799899995888887544467764167603667325999999998539999 Q ss_pred EEECCCHH------HHHHHHHHHHHCCCEEEEECCHHHHH----HHHHHHHHHHHCCEEEECCHHHHHH-----CCCCEE Q ss_conf 99628546------89999999970998199975745620----1134798996266899889899974-----368839 Q gi|255764474|r 72 WHARRNNE------MLLGVMMRDVLRMPLKLVFTSPSQRN----HSRWTRYLISRMDEVITTSQKSARF-----IERPST 136 (352) Q Consensus 72 vh~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~l~~~~d~ii~~s~~~~~~-----~~~~~~ 136 (352) +|.+.+.. ...........+.|+..++|...... ...+.+.+++++|.+++.|...++. ...++. T Consensus 80 vh~~~~~~~~~~~~~~~~~~~~~~~~~p~v~t~H~~~~~~~~~~~~~~~~~~~~~ad~vi~~s~~~~~~~~~~~~~~~i~ 159 (366) T cd03822 80 VVIQHEYGIFGGEAGLYLLLLLRGLGIPVVVTLHTVLLHEPRPGDRALLRLLLRRADAVIVMSSELLRALLLRAYPEKIA 159 (366) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCCEE T ss_conf 99936533210668999999998559989999777765542277999999999867999995799999998646987399 Q ss_pred EECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHH Q ss_conf 97488277782378432687887288988727999512264451236665456530047740588631333210001466 Q gi|255764474|r 137 VIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKN 216 (352) Q Consensus 137 vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~ 216 (352) +||||++...+.+. ..++.....++.++|+|+||+.++||++.+|+|++++.++.|+++++|+|++......+..+ T Consensus 160 vIpngv~~~~~~~~----~~~~~~~~~~~~~~il~~Grl~~~Kg~~~li~A~~~l~~~~~~~~l~ivG~~~~~~~~~~~~ 235 (366) T cd03822 160 VIPHGVPDPPAEPP----ESLKALGGLDGRPVLLTFGLLRPYKGLELLLEALPLLVAKHPDVRLLVAGETHPDLERYRGE 235 (366) T ss_pred EECCCCCCCCCCCH----HHHHHHCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHH T ss_conf 96799875455886----77887458999859999853405548599999999988768985999995898742667899 Q ss_pred HHHHHHHCCCCCCCCCC-CCC--CCCCCCCHHHHHHCCCCCCCCCCC--CCCCCCCCEEEEEECCCCCCCHHCCCCCCCE Q ss_conf 77644311233221222-222--331000000000001233322222--2100000100111027884101002348842 Q gi|255764474|r 217 LQRRIYANGLKKRILFI-DEQ--SSIEDWYRALNIFVAPPLYEGFGL--TPLEAMASGIPVVASNTGVFSELLDPENAKA 291 (352) Q Consensus 217 l~~~i~~~~l~~~V~~~-g~~--~~~~~~~~~adi~i~pS~~Eg~gl--~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~ 291 (352) ++.+++++|++++|.|. |++ +++..+|+.||+|++||.+|+++. +++||||||+|||+|++|+.+|++ ++++ T Consensus 236 ~~~~~~~lgl~~~V~f~~g~v~~~~~~~~~~~adv~v~Ps~~e~~~~s~v~~EAma~G~PvVat~~gg~~ev~---~~~~ 312 (366) T cd03822 236 AYALAERLGLADRVIFINRYLPDEELPELFSAADVVVLPYRSADQTQSGVLAYAIGFGKPVISTPVGHAEEVL---DGGT 312 (366) T ss_pred HHHHHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEECCCCHHEEE---CCCC T ss_conf 9999997399765532478889999999999557030555466544569999999749989990899744088---3996 Q ss_pred EEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 4996599989999999999869899999999999999982998999999999999 Q gi|255764474|r 292 GVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 292 G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) |++++++|+++++++|.++++|++++++|+++|++++ ++|||++++++++++|. T Consensus 313 G~lv~~~d~~~la~ai~~ll~d~~~r~~l~~~a~~~a-~~~sW~~ia~~~~~ly~ 366 (366) T cd03822 313 GLLVPPGDPAALAEAIRRLLADPELAQALRARAREYA-RAMSWERVAERYLRLLA 366 (366) T ss_pred EEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-HHCCHHHHHHHHHHHHC T ss_conf 8998999999999999999879999999999999999-97999999999999859 No 21 >cd03812 GT1_CapH_like This family is most closely related to the GT1 family of glycosyltransferases. capH in Staphylococcus aureus has been shown to be required for the biosynthesis of the type 1 capsular polysaccharide (CP1). Probab=100.00 E-value=0 Score=345.17 Aligned_cols=323 Identities=17% Similarity=0.289 Sum_probs=258.5 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC----------C--------CCCCHHHHHHHHHHHHH Q ss_conf 45897078787877077888899999997198599980588665----------6--------23478998998875432 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN----------I--------PSIGISSLLTCWKKPIG 66 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~----------~--------~~~~~~~~~~~~~~~~~ 66 (352) +|..|.+.+.. +|++..+.++++.+.+.|+.+.++....... . ....+..+..+.+.... T Consensus 1 KIL~vi~~l~~--GGaE~~~~~L~~~L~~~g~~v~v~~~~~~~~~~~~~~~~~g~~v~~~~~~~~~~~~~~~~l~~~i~~ 78 (358) T cd03812 1 KILHIVGTMNR--GGIETFIMNYYRNLDRSKIQFDFLVTSKEEGDYDDEIEKLGGKIYYIPARKKNPLKYFKKLYKLIKK 78 (358) T ss_pred CEEEECCCCCC--CCHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHH T ss_conf 98999499999--1189999999999987699899999879876368999975997999389764289999999999998 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCCEEEEE-CCHHHH-------HHHHHHHH-HHHHCCEEEECCHHHHHHC-----C Q ss_conf 59708996285468999999997099819997-574562-------01134798-9962668998898999743-----6 Q gi|255764474|r 67 QNSRIWHARRNNEMLLGVMMRDVLRMPLKLVF-TSPSQR-------NHSRWTRY-LISRMDEVITTSQKSARFI-----E 132 (352) Q Consensus 67 ~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-------~~~~~~~~-l~~~~d~ii~~s~~~~~~~-----~ 132 (352) .++|++|+|.........+++...+.|..+.+ |..... ......++ +.+.+|.++++|+..++++ . T Consensus 79 ~~~DiIh~h~~~~~~~~~~~~~~~~~~~~I~h~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iavS~~~~~~l~~~~~~ 158 (358) T cd03812 79 NKYDIVHVHGSSASGFILLAAKKAGVKVRIAHSHNTSDSHDKKKKILKYKVLRKLINRLATDYLACSEEAGKWLFGKVKN 158 (358) T ss_pred CCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECHHHHHHHHHHCCC T ss_conf 39999998575068999999998599989998578744543167899999999999986999999588999999731687 Q ss_pred CCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC Q ss_conf 88399748827778237843268788728898872799951226445123666545653004774058863133321000 Q gi|255764474|r 133 RPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYL 212 (352) Q Consensus 133 ~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~ 212 (352) .++.+||||||.+.|.+.+.....++..+.+++.++++++||+.++||++.+|+|++.+.++.|+++++++|+|+.+ T Consensus 159 ~ki~vI~NgId~~~~~~~~~~~~~~~~~~~~~~~~vi~~vGRl~~~Kg~~~Li~A~~~l~~~~~~~~l~ivG~G~~~--- 235 (358) T cd03812 159 KKFKVIPNGIDLEKFIFNEEIRKKRRELGILEDKFVIGHVGRFSEQKNHEFLIEIFAELLKKNPNAKLLLVGDGELE--- 235 (358) T ss_pred CCEEEECCCCCHHHCCCCHHHHHHHHHHCCCCCCEEEEEECCCHHHHCCHHHHHHHHHHHHHCCCCEEEEECCCHHH--- T ss_conf 87899869807442387546689999838998986999960476651718899999986510888239996275278--- Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEE Q ss_conf 14667764431123322122222233100000000000123332222221000001001110278841010023488424 Q gi|255764474|r 213 FKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAG 292 (352) Q Consensus 213 ~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G 292 (352) +++++++++++++++|.|+|+++++..+|+.||+|++||++||||++++||||||+|||+||+||.++++ ++.+| T Consensus 236 --~~l~~~i~~~~l~~~V~f~G~~~~~~~~~~~aDi~v~pS~~Egf~~v~lEAma~G~PVVasd~gg~~~ii---~~~~~ 310 (358) T cd03812 236 --EEIKKKVKELGLEDKVIFLGVRNDVPELLQAMDVFLFPSLYEGLPLVLIEAQASGLPCILSDTITKEVDL---TDLVK 310 (358) T ss_pred --HHHHHHHHHCCCCCEEEECCCCCCHHHHHHHCCEEEECCCCCCCCHHHHHHHHHCCCEEEECCCCCCEEE---ECCCC T ss_conf --7899999982987249974670137899973989997487678847999999949989996599974697---29955 Q ss_pred EEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 9965999899999999998698999999999999999829989999999 Q gi|255764474|r 293 VIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDI 341 (352) Q Consensus 293 ~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~ 341 (352) ++++++|+++||++|.+|++||+ +++++++++. +.+.++..+..+ T Consensus 311 ~l~~~~~~~~~a~~I~~l~~~~~-~~~~~~~~~~---~~~~~~~~~~~~ 355 (358) T cd03812 311 FLSLDESPEIWAEEILKLKSEDR-RERSSESIKK---KGLDADDEANTL 355 (358) T ss_pred EEECCCCHHHHHHHHHHHHCCHH-HHHHHHHHHH---CCCCHHHHHHHH T ss_conf 79689999999999999865836-7999999998---599879999998 No 22 >cd03821 GT1_Bme6_like This family is most closely related to the GT1 family of glycosyltransferases. Bme6 in Brucella melitensis has been shown to be involved in the biosynthesis of a polysaccharide. Probab=100.00 E-value=0 Score=342.43 Aligned_cols=331 Identities=23% Similarity=0.325 Sum_probs=260.0 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCC------------------CC-----CCHHHHHHHH Q ss_conf 458970787878770778888999999971985999805886656------------------23-----4789989988 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNI------------------PS-----IGISSLLTCW 61 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~-----~~~~~~~~~~ 61 (352) ||.+|.+.|.+..+|++.++.++++.+.+.||++.+++...+... +. ..+......+ T Consensus 1 KIl~i~~~~~P~~GG~e~~~~~la~~L~~~Gh~V~v~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (375) T cd03821 1 KILHVIPSFDPKYGGPVRVVLNLSKALAKLGHEVTVATTDAGGDPLLVALNGVPVKLFSINVAYGLNLARYLFPPSLLAW 80 (375) T ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHH T ss_conf 98999598899999899999999999997799899997079876431024675167416655542013332220689999 Q ss_pred HHHHHCCCEEEEECCC--HHHHHHHHHHHHHCCCEEEEECCH---HHHHHHHH---------HHHHHHHCCEEEECCHHH Q ss_conf 7543259708996285--468999999997099819997574---56201134---------798996266899889899 Q gi|255764474|r 62 KKPIGQNSRIWHARRN--NEMLLGVMMRDVLRMPLKLVFTSP---SQRNHSRW---------TRYLISRMDEVITTSQKS 127 (352) Q Consensus 62 ~~~~~~~~~ivh~~~~--~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~---------~~~l~~~~d~ii~~s~~~ 127 (352) ......++|++|+|.. .....+.......+.|+.+..|.. .......+ .++..+.++.++++|+.. T Consensus 81 ~~~~~~~~Dvi~~~~~~~~~~~~~~~~~~~~~~p~i~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~s~~~ 160 (375) T cd03821 81 LRLNIREADIVHVHGLWSYPSLAAARAARKYGIPYVVSPHGMLDPWALPHKALKKRLAWFLFERRLLQAAAAVHATSEQE 160 (375) T ss_pred HHHHCCCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCHHH T ss_conf 99854899999986862566999999999849969999799860344443467888999999999874086999765799 Q ss_pred HHHC-----CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEE Q ss_conf 9743-----68839974882777823784326878872889887279995122644512366654565300477405886 Q gi|255764474|r 128 ARFI-----ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVV 202 (352) Q Consensus 128 ~~~~-----~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i 202 (352) ..+. ..++.+||||+|.+.|.+.+. ...+...+.+.+.++++|+||+.++||++.+++|+.++.++.+++++++ T Consensus 161 ~~~~~~~~~~~k~~vI~ngid~~~~~~~~~-~~~~~~~~~~~~~~~il~vGRl~~~Kg~~~li~a~~~l~~~~~~~~l~i 239 (375) T cd03821 161 AAEIRRLGLKAPIAVIPNGVDIPPFAALPS-RGRRRKFPILPDKRIILFLGRLHPKKGLDLLIEAFAKLAERFPDWHLVI 239 (375) T ss_pred HHHHHHHCCCCCEEEECCCCCCCCCCCCCH-HHHHHHCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 999996289888899869727201488623-7889854899898389998713432147789999999997679859999 Q ss_pred ECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCC--CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCC Q ss_conf 31333210001466776443112332212222223--3100000000000123332222221000001001110278841 Q gi|255764474|r 203 VGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQS--SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVF 280 (352) Q Consensus 203 ~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~--~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~ 280 (352) +|.|+.. +.+.++.++++++++++|.|+|+++ ++..+|+.||++++||++||||++++||||||+|||+|+.||+ T Consensus 240 vG~g~~~---~~~~~~~~~~~~~l~~~V~f~G~~~~~~~~~~~~~adi~v~pS~~Egf~~~~lEAma~G~PvIas~~gg~ 316 (375) T cd03821 240 AGPDEGG---YRAELKQIAAALGLEDRVTFTGMLYGEDKAAALADADLFVLPSHSENFGIVVAEALACGTPVVTTDKVPW 316 (375) T ss_pred EECCCCH---HHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHCCEECCCCHHHHHHHHHHHHHHCCCCEEECCCCCC T ss_conf 9789826---7899999999826785585347768311098985150014684776645899999985999999279980 Q ss_pred CHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 010023488424996599989999999999869899999999999999982998999999999 Q gi|255764474|r 281 SELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGK 343 (352) Q Consensus 281 ~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~ 343 (352) +|++. ++ +|+++++ |+++++++|.++++|++++.+|+++||+++.++|||++++++|++ T Consensus 317 ~eii~--~~-~g~~~~~-~~~~l~~~i~~l~~d~~~~~~~~~~ar~~~~~~fsw~~~~~~~~~ 375 (375) T cd03821 317 QELIE--YG-CGWVVDD-DVDALAAALRRALELPQRLKAMGENGRALVEERFSWTAIAQQLLE 375 (375) T ss_pred HHEEE--CC-CEEEECC-CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHC T ss_conf 77287--88-4899492-999999999999769999999999999999995899999999749 No 23 >cd03805 GT1_ALG2_like This family is most closely related to the GT1 family of glycosyltransferases. ALG2, a 1,3-mannosyltransferase, in yeast catalyzes the mannosylation of Man(2)GlcNAc(2)-dolichol diphosphate and Man(1)GlcNAc(2)-dolichol diphosphate to form Man(3)GlcNAc(2)-dolichol diphosphate. A deficiency of this enzyme causes an abnormal accumulation of Man1GlcNAc2-PP-dolichol and Man2GlcNAc2-PP-dolichol, which is associated with a type of congenital disorders of glycosylation (CDG), designated CDG-Ii, in humans. Probab=100.00 E-value=0 Score=344.13 Aligned_cols=329 Identities=19% Similarity=0.277 Sum_probs=248.9 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC-------------------CCCCC---HHHHHHHHH Q ss_conf 45897078787877077888899999997198599980588665-------------------62347---899899887 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN-------------------IPSIG---ISSLLTCWK 62 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~---~~~~~~~~~ 62 (352) +|.+|.|++. .+|++..+.++++.+.+.||++.+++...... +|... ...+...++ T Consensus 2 kI~fi~p~l~--~GGaEr~v~~la~~L~~~Gh~V~v~t~~~d~~~~~~~~~~~~~~v~~~~~~~p~~~~~~~~~~~~~lr 79 (392) T cd03805 2 RVAFIHPDLG--IGGAERLVVDAALALQSRGHEVTIYTSHHDPSHCFEETKDGTLPVRVRGDWLPRSIFGRFHILCAYLR 79 (392) T ss_pred EEEEECCCCC--CCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHCCCCEEEEEECCCCCCHHCCHHHHHHHHHH T ss_conf 8999869999--98699999999999997699399997268833240555178548999267467012123789999999 Q ss_pred ---------HHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHH---HHHH-----------HHHHHHHHHHCCE Q ss_conf ---------5432597089962854689999999970998199975745---6201-----------1347989962668 Q gi|255764474|r 63 ---------KPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPS---QRNH-----------SRWTRYLISRMDE 119 (352) Q Consensus 63 ---------~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-----------~~~~~~l~~~~d~ 119 (352) ......+|++|++....... +++...+.++.+..|.+. ..+. ..+.++.++++|. T Consensus 80 ~~~~~~~~~~~~~~~~Dvi~~~~~~~~~~--~~~~~~~~~ii~~~H~p~~~l~~~~~~~~~~y~~~~~~le~~~~~~~d~ 157 (392) T cd03805 80 MLYLALYLLLLPDEKYDVFIVDQVSACVP--LLKLFSPSKILFYCHFPDQLLAQRGSLLKRLYRKPFDWLEEFTTGMADK 157 (392) T ss_pred HHHHHHHHHHHCCCCCEEEEECCCCHHHH--HHHHCCCCCEEEEEECCCHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCE T ss_conf 99999999863347980999888534799--9997469967999607840210266689999999999999998850889 Q ss_pred EEECCHHHHHHC--------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHH Q ss_conf 998898999743--------688399748827778237843268788728898872799951226445123666545653 Q gi|255764474|r 120 VITTSQKSARFI--------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINI 191 (352) Q Consensus 120 ii~~s~~~~~~~--------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l 191 (352) ++++|+.+++.+ ..+..||+||+|.+.|.+.......+ ......+.++++++||+.++||++.+|+|++.+ T Consensus 158 ii~~S~~~~~~~~~~~~~~~~~~~~vi~ngid~~~~~~~~~~~~~~-~~~~~~~~~~il~vgRl~~~Kg~~~lI~A~~~l 236 (392) T cd03805 158 IVVNSNFTASVFKKTFPSLAKNPREVVYPCVDTDSFESTSEDPDPG-LLIPKSGKKTFLSINRFERKKNIALAIEAFAIL 236 (392) T ss_pred EEECCHHHHHHHHHHHHHHCCCCCEEECCCCCHHHCCCCCCCHHHH-HHCCCCCCEEEEEEECCCCCCCHHHHHHHHHHH T ss_conf 9995678999999983020658857977984777648764104455-532467873999984445346689999999999 Q ss_pred HCCCC---CEEEEEECCCCCC---CCCHHHHHHHHHHHC-CCCCCCCCCCCCCCC--CCCCHHHHHHCCCCCCCCCCCCC Q ss_conf 00477---4058863133321---000146677644311-233221222222331--00000000000123332222221 Q gi|255764474|r 192 LPHHP---GWTAVVVGKTTLK---HYLFKKNLQRRIYAN-GLKKRILFIDEQSSI--EDWYRALNIFVAPPLYEGFGLTP 262 (352) Q Consensus 192 ~~~~~---~~~l~i~G~g~~~---~~~~~~~l~~~i~~~-~l~~~V~~~g~~~~~--~~~~~~adi~i~pS~~Eg~gl~~ 262 (352) .++.| +++|+++|+++.+ +..|.++|+++++++ +++++|.|+|.+++. ..+|++||+|++||.+||||+++ T Consensus 237 ~~~~~~~~~~~Lvi~Gg~~~r~~e~~~y~~eL~~l~~~~~~l~~~V~Flg~~~~~~~~~l~~~ad~~v~~s~~Egfg~v~ 316 (392) T cd03805 237 KDKLAEFKNVRLVIAGGYDPRVAENVEYLEELQRLAEELLLLEDQVIFLPSISDSQKELLLSSARALLYTPSNEHFGIVP 316 (392) T ss_pred HHHCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCEEEECCCCCCCCHHH T ss_conf 98567668859999818755551018999999999998259878599988899699999998597999887434666079 Q ss_pred CCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 0000010011102788410100234884249965999899999999998698999999999999999829989999999 Q gi|255764474|r 263 LEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDI 341 (352) Q Consensus 263 lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~ 341 (352) +||||||+|||+||+||.+|+|. +|.+|+++++ |++++|++|.++++|++++++||++||+++.++|||+++++++ T Consensus 317 lEAma~G~PVVasd~gG~~E~I~--~g~~G~Lv~~-d~~~la~~i~~ll~d~~lr~~mg~~ar~~v~~~Fs~e~~a~~l 392 (392) T cd03805 317 LEAMYAGKPVIACNSGGPLETVV--DGETGFLCEP-TPEEFAEAMLKLANDPDLADRMGAAGRKRVKEKFSTEAFAERL 392 (392) T ss_pred HHHHHCCCCEEEECCCCCHHHEE--CCCEEEEECC-CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHCC T ss_conf 99997799999948998676645--7966999595-9999999999997899999999999999999866999998409 No 24 >PRK10307 predicted glycosyl transferase; Provisional Probab=100.00 E-value=0 Score=338.89 Aligned_cols=337 Identities=19% Similarity=0.201 Sum_probs=265.5 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEC--CCCC------------------C---------CCCCC-- Q ss_conf 458970787878770778888999999971985999805--8866------------------5---------62347-- Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGY--CLPK------------------N---------IPSIG-- 53 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~--~~~~------------------~---------~~~~~-- 53 (352) +|.++..+|.+..+|+..++.++++.+.+.||++.+++. ..|. + .+... T Consensus 2 rIl~vs~~y~P~~~G~~~~~~~La~~L~~~GheV~Vit~~p~~p~~~~~~~~~~~~~~~e~~~gv~v~R~p~~~~~~~~~ 81 (415) T PRK10307 2 KILVYGINYAPELTGIGKYTGEMAEWLAARGHEVRVITAPPYYPQWQVGEGYSAWRYRREERGGVTVWRCPLYVPKQPSG 81 (415) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCCCCEEEECCEEEEEEECCCCCCCHH T ss_conf 89998584899788799999999999997899899997799887665577766654311367888999830456788408 Q ss_pred HHHHH----------HHHHHHHHCCCEEEEECCCHH--HHHHHHHHHHHCCCEEEEECCH------------------HH Q ss_conf 89989----------988754325970899628546--8999999997099819997574------------------56 Q gi|255764474|r 54 ISSLL----------TCWKKPIGQNSRIWHARRNNE--MLLGVMMRDVLRMPLKLVFTSP------------------SQ 103 (352) Q Consensus 54 ~~~~~----------~~~~~~~~~~~~ivh~~~~~~--~~~~~~~~~~~~~~~~~~~~~~------------------~~ 103 (352) ...+. .........++|++|.+.+.. ...+..++...+.+..+..+.. .. T Consensus 82 ~~r~~~~~~f~~~~~~~~~~~~~~~pD~v~~~~p~~~~~~~~~~~~~~~~~~~v~~~~d~~~~~~~~~g~l~~~~~~~~~ 161 (415) T PRK10307 82 LKRLLHLGSFALSSFPPLMAQRRWRPDRVIGVAPTLFCAPGARLLARLSGARTWLHIQDYEVDAAFGLGLLGKGKGGKVA 161 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEEHHHHHHHCCCCCCCCHHHH T ss_conf 99999999999999999999847699989992877889999999999629888999900211456651401124456999 Q ss_pred HHHHHHHHHHHHHCCEEEECCHHHHHHC------CCCEEEECCCCCHHHHCCCCC--HHHHHHHCCCCCCCCEEEEEEEE Q ss_conf 2011347989962668998898999743------688399748827778237843--26878872889887279995122 Q gi|255764474|r 104 RNHSRWTRYLISRMDEVITTSQKSARFI------ERPSTVIMHGVDTERFRPTSN--KQEARRHLKISEDAKLIGCFGRI 175 (352) Q Consensus 104 ~~~~~~~~~l~~~~d~ii~~s~~~~~~~------~~~~~vi~~gid~~~~~~~~~--~~~~~~~~~~~~~~~~i~~~G~~ 175 (352) +....+.+|+++++|.++++|+..++++ ..++.+||||+|.+.|.|.+. ....|.+++++++.++++|+||+ T Consensus 162 ~~~~~~e~~~~~~ad~v~~~S~~~~~~l~~~g~~~~ki~vipNgvd~~~f~p~~~~~~~~~r~~~g~~~~~~vvly~Grl 241 (415) T PRK10307 162 RLATAFERSLLRRFDNVSTISRSMMNKAREKGVAAEKVIFFPNWSDVARFQDVADADVDALRAQLGLPDGKKIVLYSGNI 241 (415) T ss_pred HHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHCCCCCCEEEEEECCC T ss_conf 99999999999858989977999999999828998709997681510003787852068999970999998799994776 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCC Q ss_conf 6445123666545653004774058863133321000146677644311233221222222--33100000000000123 Q gi|255764474|r 176 RKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPP 253 (352) Q Consensus 176 ~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS 253 (352) .+.||++.+++|+..+. ..|+++|+|+|+|+.+ ++|+++++++|+. +|.|+|.+ +++..+|++||++++|| T Consensus 242 ~~~kg~~~li~A~~~l~-~~~~~~lvivG~G~~~-----~~L~~~a~~~gl~-~V~f~g~~~~e~l~~~~~~aDv~v~ps 314 (415) T PRK10307 242 GEKQGLELVIDAAARLR-DRPDLIFVICGQGGGK-----ARLEKMARCRGLR-NVHFLPLQPYDALPALLKMADCHLLPQ 314 (415) T ss_pred HHHCCHHHHHHHHHHHH-CCCCEEEEEECCCCHH-----HHHHHHHHHCCCC-CEEECCCCCHHHHHHHHHHCCEEEEEC T ss_conf 01118799999999831-2898699996887408-----9999999970998-389818878899999998474999704 Q ss_pred CCCCCCCC----CCCCCCCEEEEEECCCCCC--CHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 33222222----1000001001110278841--01002348842499659998999999999986989999999999999 Q gi|255764474|r 254 LYEGFGLT----PLEAMASGIPVVASNTGVF--SELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRER 327 (352) Q Consensus 254 ~~Eg~gl~----~lEAma~G~PvI~s~~~~~--~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~ 327 (352) +.|+++++ +.||||||+|||+|+.+|. .+.+. .++++|++++++|+++|+++|.++++|++++++||++||++ T Consensus 315 ~~e~~~~v~Pskl~~~mA~G~PVva~~~~g~~~~~~v~-~~~~~G~~v~p~d~~~La~ai~~ll~d~~~~~~mg~~gr~~ 393 (415) T PRK10307 315 KAGAADLVLPSKLTNMLASGRNVVATAEPGTELGQLCE-TFPGIGVCVPPEDVEALVAAIRQLLALPKRRTALGAAAREY 393 (415) T ss_pred CCCCCCCCCHHHHHHHHHCCCCEEEEECCCCCCCHHEE-EECCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 41112345757999998669968999258876520127-62780899789999999999999977999999999999999 Q ss_pred HHHHCCHHHHHHHHHHHHHHHH Q ss_conf 9982998999999999999988 Q gi|255764474|r 328 AVKHFSIVKEASDIGKVYDRLL 349 (352) Q Consensus 328 ~~~~fs~~~~a~~~~~iY~~~l 349 (352) ++++|||++++++|+++|+.|| T Consensus 394 ~~~~f~~e~~~~~~~~~~~~ll 415 (415) T PRK10307 394 AERTLDRENVLRQFIADIRGLL 415 (415) T ss_pred HHHHCCHHHHHHHHHHHHHHHC T ss_conf 9997799999999999999759 No 25 >cd03823 GT1_ExpE7_like This family is most closely related to the GT1 family of glycosyltransferases. ExpE7 in Sinorhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucans (exopolysaccharide II). Probab=100.00 E-value=0 Score=344.13 Aligned_cols=317 Identities=19% Similarity=0.239 Sum_probs=242.4 Q ss_pred HEEEECCCCCCC-CCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCC--------------------C----------- Q ss_conf 458970787878-77077888899999997198599980588665623--------------------4----------- Q gi|255764474|r 5 NIDVIAPNMKFR-HTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPS--------------------I----------- 52 (352) Q Consensus 5 ~i~~i~~~~~~~-~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------~----------- 52 (352) +|.+|...|.+. .+|++.++.++++.+.+.||++.+++......... . T Consensus 1 rIl~vt~~~pP~~~GG~e~~~~~la~~L~~~Gh~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (359) T cd03823 1 RILVVNHLYPPRSVGGAEVVAHDLAEALAKRGHEVAVLTAGEDPPRQDKEVIGVVVYGRPIDEVLRSALPRDLFHLSDYD 80 (359) T ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHC T ss_conf 99999486489997649999999999999779989999557987664213576179704764200354310156776422 Q ss_pred CHHHHHHHHHHHHHCCCEEEEECCCHHHH-HHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHC Q ss_conf 78998998875432597089962854689-99999997099819997574562011347989962668998898999743 Q gi|255764474|r 53 GISSLLTCWKKPIGQNSRIWHARRNNEML-LGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFI 131 (352) Q Consensus 53 ~~~~~~~~~~~~~~~~~~ivh~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~ 131 (352) .......+.+.....++|++|+|...... ..+......+.|+..+.|... ........+.+.+|.++++|+..++.+ T Consensus 81 ~~~~~~~~~~~~~~~~pDivh~h~~~~~~~~~~~~~~~~~~p~v~t~h~~~--~~~~~~~~~~~~~d~vi~~S~~~~~~~ 158 (359) T cd03823 81 NPAVVAEFARLLEDFRPDVVHFHHLQGLGVSILRAARDRGIPIVLTLHDYW--LICPRQGLFKKGGDAVIAPSRFLLDRY 158 (359) T ss_pred CHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCCEEEEECCHH--HCCCHHHHHHCCCCEEEECCHHHHHHH T ss_conf 789999999999874999999888317679999999984998999972221--106177887458999999999999999 Q ss_pred ------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf ------68839974882777823784326878872889887279995122644512366654565300477405886313 Q gi|255764474|r 132 ------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGK 205 (352) Q Consensus 132 ------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~ 205 (352) ..++.+||||+|.+.|.+..... +.+...|+|+||+.++||++.+|+|++++. .++++|+|+|+ T Consensus 159 ~~~~~~~~~i~vI~ngvd~~~~~~~~~~~--------~~~~~~i~~vGRl~~~Kg~~~li~a~~~l~--~~~~~l~i~G~ 228 (359) T cd03823 159 VANGLFAEKISVIRNGIDLDRAKRPRRAP--------PGGRLRFGFIGQLTPHKGVDLLLEAFKRLP--RGDIELVIVGN 228 (359) T ss_pred HHHCCCCCCEEEECCCCCHHHCCCCCCCC--------CCCCEEEEEECCCCCCCCHHHHHHHHHHCC--CCCCEEEEECC T ss_conf 98089932389988986845427433345--------678749999588976259999999998555--57828999778 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCC-CCCCCCCCCCCCCCEEEEEECCCCCCCH Q ss_conf 3321000146677644311233221222222--331000000000001233-3222222100000100111027884101 Q gi|255764474|r 206 TTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPL-YEGFGLTPLEAMASGIPVVASNTGVFSE 282 (352) Q Consensus 206 g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~-~Eg~gl~~lEAma~G~PvI~s~~~~~~e 282 (352) |+... .+ ....+..++|.|+|.+ +++..+|++||+|++||. .||||++++||||||+|||+|++||++| T Consensus 229 g~~~~-----~~---~~~~~~~~~v~f~G~~~~~~~~~~~~~adi~v~pS~~~E~fg~~~lEAma~G~PvIas~~gg~~e 300 (359) T cd03823 229 GLELE-----EE---SYELEGDPRVEFLGAYPQEEIDDFYAEIDVLVVPSIWPENFPLVIREALAAGVPVIASDIGGMAE 300 (359) T ss_pred CCHHH-----HH---HHHHHCCCCCEECCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCEEECCCCCCHH T ss_conf 60568-----99---99972577648806567899999998657365677565777479999998299899888998175 Q ss_pred HCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 00234884249965999899999999998698999999999999999829989999999999999 Q gi|255764474|r 283 LLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDR 347 (352) Q Consensus 283 ~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~ 347 (352) +|. +|.+|++++++|+++|+++|.++++|++++++|++++ .++|||+.++++++++|++ T Consensus 301 ~i~--~g~~G~lv~~~d~~~la~ai~~ll~d~~~~~~~~~~~----~~~~s~~~~a~~~~~~Y~s 359 (359) T cd03823 301 LVR--DGVNGLLFPPGDAEDLAAALERLIDDPDLLERLRAGI----EPPRSIEDQAEEYLKLYRS 359 (359) T ss_pred HHC--CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHH----HHCCCHHHHHHHHHHHHCC T ss_conf 603--7986799899999999999999984999999999999----9747999999999998098 No 26 >cd03825 GT1_wcfI_like This family is most closely related to the GT1 family of glycosyltransferases. wcfI in Bacteroides fragilis has been shown to be involved in the capsular polysaccharide biosynthesis. Probab=100.00 E-value=0 Score=342.29 Aligned_cols=315 Identities=22% Similarity=0.296 Sum_probs=239.4 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCHHHHH-H Q ss_conf 458970787878770778888999999971985999805886656234789989988754325970899628546899-9 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPSIGISSLLTCWKKPIGQNSRIWHARRNNEMLL-G 83 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivh~~~~~~~~~-~ 83 (352) ||..|.+. ...+|++..+.++++.+.+.|+++.++...... +.+......+|++|.|+...... . T Consensus 2 KIL~v~~~--~~~GGae~~~~~L~~~L~~~Gh~v~v~~~~~~~------------~~~~~~~~~~dvvh~h~~~~~~~~~ 67 (365) T cd03825 2 KVLHLNTS--DISGGAARAAYRLHRALQAAGVDSTMLVQEKKA------------LISKIEIINADIVHLHWIHGGFLSI 67 (365) T ss_pred EEEEEECC--CCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCC------------HHHHHHHCCCCEEEECCCCCCHHCH T ss_conf 59999389--999238999999999999779908999926984------------7776655289989984776102059 Q ss_pred HHHHH-HHCCCEEEEECCHH--------H------------------HHHHHHHHHH--------HHHCCEEEECCHHHH Q ss_conf 99999-70998199975745--------6------------------2011347989--------962668998898999 Q gi|255764474|r 84 VMMRD-VLRMPLKLVFTSPS--------Q------------------RNHSRWTRYL--------ISRMDEVITTSQKSA 128 (352) Q Consensus 84 ~~~~~-~~~~~~~~~~~~~~--------~------------------~~~~~~~~~l--------~~~~d~ii~~s~~~~ 128 (352) ..++. ..+.|+..+.|..+ . .....+.++. ....+.++++|+..+ T Consensus 68 ~~l~~l~~~~p~v~t~Hd~~~~tg~~~~~~~~~~~~~~~~~~p~l~~~~~~~~~~~~~~~~~~~~~~~~~~iv~~S~~~~ 147 (365) T cd03825 68 EDLSKLLDRKPVVWTLHDMWPFTGGCHYPGGCDRYKTECGNCPQLGSYPEKDLSRWIWRRKRKAWADLNLTIVAPSRWLA 147 (365) T ss_pred HHHHHHHCCCCEEEEECCCCHHHCCCHHCCHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEECCHHHH T ss_conf 99999970899899963565221510100011343020465777566553466799999999998525998998698999 Q ss_pred HHC-------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEE--EEEECHHHHHHHHHHHHHHHHCC-CCCE Q ss_conf 743-------6883997488277782378432687887288988727999--51226445123666545653004-7740 Q gi|255764474|r 129 RFI-------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGC--FGRIRKLKGTDLFVDCMINILPH-HPGW 198 (352) Q Consensus 129 ~~~-------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~--~G~~~~~Kg~~~li~a~~~l~~~-~~~~ 198 (352) +.. ..++.+||||||.+.|.|. ++...|..++++.+.+++++ +|++.++||++.+++|++.+.++ .+++ T Consensus 148 ~~~~~~~~~~~~ki~vI~Ngid~~~f~p~-~~~~~r~~~~~~~~~~vi~~~~~~~~~~~Kg~~~li~A~~~l~~~~~~~~ 226 (365) T cd03825 148 DCARSSSLFKGIPIEVIPNGIDTTIFRPR-DKREARKRLGLPADKKIILFGAVGGTDPRKGFDELIEALKRLAERWKDDI 226 (365) T ss_pred HHHHHCCCCCCCCEEEECCCCCHHHCCCC-CHHHHHHHHCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCE T ss_conf 99997248898978998997364644986-88999998397988858999530015643247999999998765068988 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCC---CCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 5886313332100014667764431123322122222233---1000000000001233322222210000010011102 Q gi|255764474|r 199 TAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSS---IEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS 275 (352) Q Consensus 199 ~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~---~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s 275 (352) .++++|++..+ .+ ..++.+|.|+|++++ +..+|++||+|++||++||||++++||||||+|||+| T Consensus 227 ~lvi~G~~~~~-------~~-----~~l~~~v~flG~~~~~~~l~~~~~~aDi~v~pS~~Egfg~v~lEAma~G~PVVas 294 (365) T cd03825 227 ELVVFGASDPE-------IP-----PDLPFPVHYLGSLNDDESLALIYSAADVFVVPSLQENFPNTAIEALACGTPVVAF 294 (365) T ss_pred EEEEECCCCHH-------HH-----HHCCCCEEEECCCCCHHHHHHHHHHCCEEECCCCCCCCCHHHHHHHHCCCCEEEC T ss_conf 99993798588-------89-----6689979992687998999999972729951677688859999999719989973 Q ss_pred CCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 7884101002348842499659998999999999986989999999999999998299899999999999998 Q gi|255764474|r 276 NTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRL 348 (352) Q Consensus 276 ~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~ 348 (352) |+||++|+|. +|.+|++++++|+++|+++|.++++|++++++|+++||+++.++|||++++++|+++|++| T Consensus 295 d~gg~~eiv~--~~~~G~lv~~~d~~~la~ai~~ll~d~~~~~~~~~~ar~~~~~~fs~~~~~~~~~~lY~~l 365 (365) T cd03825 295 DVGGIPDIVD--HGVTGYLAKPGDPEDLAEGIEWLLADPDEREELGEAARELAENEFDSRVQAKRYLSLYEEL 365 (365) T ss_pred CCCCHHHHHC--CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCC T ss_conf 8998599960--7982799799999999999999986999999999999999998689999999999998687 No 27 >cd03794 GT1_wbuB_like This family is most closely related to the GT1 family of glycosyltransferases. wbuB in E. coli is involved in the biosynthesis of the O26 O-antigen. It has been proposed to function as an N-acetyl-L-fucosamine (L-FucNAc) transferase. Probab=100.00 E-value=0 Score=332.66 Aligned_cols=328 Identities=22% Similarity=0.273 Sum_probs=256.4 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCC---------------CCC-----CCC-----C-HHHH- Q ss_conf 458970787878770778888999999971985999805886---------------656-----234-----7-8998- Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLP---------------KNI-----PSI-----G-ISSL- 57 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~---------------~~~-----~~~-----~-~~~~- 57 (352) +|.+|++.|.+..+|++.++.++++.+.+.||++.+++.... .++ +.. . .... T Consensus 1 KIlii~~~fpP~~gG~~~~~~~la~~L~~~Gh~V~v~t~~~~~~~~~~~~~~~~~~~~gi~v~r~~~~~~~~~~~~~~~~ 80 (394) T cd03794 1 KILILSQYFPPELGGGAFRTTELAEELVKRGHEVTVITGSPNYPSGKIYKGYKREEVDGVRVHRVPLPPYKKNGLLKRLL 80 (394) T ss_pred CEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEECCEEEEEECCCCCCCCCHHHHHH T ss_conf 98999177789898299999999999997799799995478776432356664466488599993377667752788999 Q ss_pred --------HHHHHHHHHCCCEEEEECCCH--HHHHHHHHHHHHCCCEEEEECCHH----------------HHHHHHHHH Q ss_conf --------998875432597089962854--689999999970998199975745----------------620113479 Q gi|255764474|r 58 --------LTCWKKPIGQNSRIWHARRNN--EMLLGVMMRDVLRMPLKLVFTSPS----------------QRNHSRWTR 111 (352) Q Consensus 58 --------~~~~~~~~~~~~~ivh~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~ 111 (352) ...+......++|++|+|.+. ....+..++...+.|+.++.|... .+....+.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~Div~~~~~~~~~~~~~~~~~~~~~~p~v~~~hd~~p~~~~~~~~~~~~~~~~~~~~~~~~ 160 (394) T cd03794 81 NYLSFALSALLALLKRRRRPDVIIATSPPLLIALAALLLARLKGAPFVLEVRDLWPESAVALGLLKNGSLLYRLLRKLER 160 (394) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 99999999999999855899889991784788999999998639969999687446789983674444489999999999 Q ss_pred HHHHHCCEEEECCHHHHHHC------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHH Q ss_conf 89962668998898999743------688399748827778237843268788728898872799951226445123666 Q gi|255764474|r 112 YLISRMDEVITTSQKSARFI------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFV 185 (352) Q Consensus 112 ~l~~~~d~ii~~s~~~~~~~------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li 185 (352) ++++++|.++++|+..++++ ..++.+||||+|.+.|.+.+.... ........+.+.++|+||+.+.||++.++ T Consensus 161 ~~~~~ad~vi~~S~~~~~~~~~~~~~~~~i~vipngvd~~~~~~~~~~~~-~~~~~~~~~~~~i~~~Grl~~~kg~~~li 239 (394) T cd03794 161 LIYRRADAIVVISPGMREYLVRRGVPPEKISVIPNGVDLELFKPPPADES-LRKELGLDDKFVVLYAGNIGRAQGLDTLL 239 (394) T ss_pred HHHHHCCEEEECCHHHHHHHHHHCCCCCCEEEEECCCCHHHCCCCCCHHH-HHHHCCCCCCEEEEEEEECCCCCCCHHHH T ss_conf 99984899997729999999984899230999947625765277750477-78742689985999961110003637999 Q ss_pred HHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCC-----CC Q ss_conf 545653004774058863133321000146677644311233221222222--33100000000000123332-----22 Q gi|255764474|r 186 DCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYE-----GF 258 (352) Q Consensus 186 ~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~E-----g~ 258 (352) +|+..+ ++.|+++++++|+|+.. +++++.++++++ ++|.|+|++ +++..+|+.||++++|+..+ ++ T Consensus 240 ~a~~~l-~~~~~~~l~ivG~G~~~-----~~l~~~~~~~~~-~~V~~~G~v~~~~~~~~~~~adi~v~p~~~~~~~~~~~ 312 (394) T cd03794 240 EAAALL-KDRPDIRFLIVGDGPEK-----EELKELAKALGL-DNVTFLGRVPKEELPELLAAADVGLVPLKPGPAFEGVS 312 (394) T ss_pred HHHHHH-CCCCCEEEEEECCCHHH-----HHHHHHHHHCCC-CEEEECCCCCCCHHHHHHHCCCEEEEECCCCCCCCCCC T ss_conf 999974-55898599995685167-----899999998199-94998163046136778742969999277754457735 Q ss_pred CCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 22210000010011102788410100234884249965999899999999998698999999999999999829989999 Q gi|255764474|r 259 GLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEA 338 (352) Q Consensus 259 gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a 338 (352) |++++||||||+|||+|++||++|++. +|.+|++++++|+++++++|.++++|++++.+|+++||+++.++|||++++ T Consensus 313 P~kllEAma~G~PVV~s~~gg~~e~i~--~~~~G~l~~~~d~~~la~~i~~ll~d~~~~~~~~~~ar~~~~~~fsw~~~a 390 (394) T cd03794 313 PSKLFEYMAAGKPVLASVDGESAELVE--EAGAGLVVPPGDPEALAAAILELLDDPEERAEMGENGRRYVEEKFSREKLA 390 (394) T ss_pred HHHHHHHHHCCCCEEEECCCCHHHHEE--CCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 689999998499799958998077612--188089977999999999999997799999999999999999858999999 Q ss_pred HHHH Q ss_conf 9999 Q gi|255764474|r 339 SDIG 342 (352) Q Consensus 339 ~~~~ 342 (352) ++|+ T Consensus 391 ~~~l 394 (394) T cd03794 391 ERLL 394 (394) T ss_pred HHHC T ss_conf 8319 No 28 >cd03813 GT1_like_3 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=100.00 E-value=0 Score=340.06 Aligned_cols=271 Identities=24% Similarity=0.427 Sum_probs=233.8 Q ss_pred CCCEEEEEC-CCHHHHHHHHHHHHHCCCEEEEECCHHHHHH---------------HHH-------HHHHHHHCCEEEEC Q ss_conf 597089962-8546899999999709981999757456201---------------134-------79899626689988 Q gi|255764474|r 67 QNSRIWHAR-RNNEMLLGVMMRDVLRMPLKLVFTSPSQRNH---------------SRW-------TRYLISRMDEVITT 123 (352) Q Consensus 67 ~~~~ivh~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~-------~~~l~~~~d~ii~~ 123 (352) .+.+++|+. .+.+.+.+...++..+.|+.++-|.-..+.. ..| .+..|..+|.|++. T Consensus 172 P~a~vyHavStGyAgllg~l~~~~~g~P~llTEHGIYtRER~i~l~~a~w~~~~~r~~wi~ff~~l~~~~Y~~Ad~I~~l 251 (475) T cd03813 172 PKADVYHAVSTGYAGLLGALAKARRGTPFLLTEHGIYTRERKIELLQADWEMSYFRRLWIRFFESLGRLAYQAADRITTL 251 (475) T ss_pred CCCCEEECCCCCHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCEEECC T ss_conf 85657851455377899999988758987985067228999988864754358999999999999999999851700013 Q ss_pred CHHHHHHC------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCC Q ss_conf 98999743------688399748827778237843268788728898872799951226445123666545653004774 Q gi|255764474|r 124 SQKSARFI------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPG 197 (352) Q Consensus 124 s~~~~~~~------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~ 197 (352) ++...++- +.++.|||||||.++|.|...... ....++++++||+.|.||++.+|+|++.+.++.|+ T Consensus 252 ~~~nr~~qi~~Ga~~~k~~vIpNGId~~~F~~~~~~~~-------~~~~~~v~~vgRv~p~Kdi~tlI~A~~~v~~~~p~ 324 (475) T cd03813 252 YEGNRERQIEDGADPEKIRVIPNGIDPERFAPARRARP-------EKEPPVVGLIGRVVPIKDIKTFIRAAAIVRKKIPD 324 (475) T ss_pred CHHHHHHHHHHCCCHHHCEECCCCCCHHHCCCCCCCCC-------CCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCC T ss_conf 42448999981989999441179832988488765666-------78998899997011166999999999999986898 Q ss_pred EEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCC Q ss_conf 05886313332100014667764431123322122222233100000000000123332222221000001001110278 Q gi|255764474|r 198 WTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNT 277 (352) Q Consensus 198 ~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~ 277 (352) ++++|+|.++ ++..|.++++.+++++|++++|.|+|.+ ++.++|+++|++++||.+||||++++||||||+|||+||+ T Consensus 325 ~rl~I~Gp~d-~~~~y~~ec~~lv~~lgL~~~V~F~G~~-dv~~~l~~~Dv~vl~S~~Eg~plvllEAmA~G~PvVaTdV 402 (475) T cd03813 325 AEGWVIGPTD-EDPEYAEECRELVESLGLEDNVKFTGFQ-NVKEYLPKLDVLVLTSISEGQPLVILEAMAAGIPVVATDV 402 (475) T ss_pred EEEEEECCCC-CCHHHHHHHHHHHHHHCCCCCEEECCCC-CHHHHHHHCCEEEECCCCCCCCHHHHHHHHCCCCEEECCC T ss_conf 3999977998-8858999999999982998727983878-9899998579999657334675799999976998897269 Q ss_pred CCCCHHCCCC----CCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 8410100234----88424996599989999999999869899999999999999982998999999999999 Q gi|255764474|r 278 GVFSELLDPE----NAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 278 ~~~~e~i~~~----~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) ||++|+|... .|.+|+++++.|++++|+++.+|++|++++++|+++|+++++++||+++++++|.++|. T Consensus 403 Gg~~e~v~~~~~~~~G~~G~lvp~~d~~~LA~ai~~Ll~d~~~r~~~g~~ar~rv~~~f~~e~m~~~Y~~lY~ 475 (475) T cd03813 403 GSCRELIEGADDEALGPAGEVVPPADPEALARAILRLLKDPELRRAMGEAGRKRVERYYTLERMIDSYRRLYL 475 (475) T ss_pred CCCHHHHCCCCCCCCCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHC T ss_conf 9818875386656778854896999999999999999739999999999999999996899999999998539 No 29 >cd03809 GT1_mtfB_like This family is most closely related to the GT1 family of glycosyltransferases. mtfB (mannosyltransferase B) in E. coli has been shown to direct the growth of the O9-specific polysaccharide chain. It transfers two mannoses into the position 3 of the previously synthesized polysaccharide. Probab=100.00 E-value=2.8e-45 Score=324.65 Aligned_cols=313 Identities=26% Similarity=0.292 Sum_probs=239.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCC--------------------CCHHHHHHHHHHHHHCCCEEEEE Q ss_conf 7877077888899999997198599980588665623--------------------47899899887543259708996 Q gi|255764474|r 15 FRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPS--------------------IGISSLLTCWKKPIGQNSRIWHA 74 (352) Q Consensus 15 ~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~~~~~~~~~~~~~~~ivh~ 74 (352) .+.+|+..++.++++.+.+.++.+.++.......... .................+|++|+ T Consensus 12 ~~~gGi~ry~~~L~~~L~~~g~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dv~h~ 91 (365) T cd03809 12 RRPTGIGRYARELLRALLKLDPEEVLLLLPGAPGLLLLPLRAALRLLLRLPRRLLWGLLFLLRAGDRLLLLLLGLDLLHS 91 (365) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 99996899999999999745997799993586422111112354101356653113343577888899998559999998 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEECCHHH------------HHHHHHHHHHHHHCCEEEECCHHHHHHC-------CCCE Q ss_conf 28546899999999709981999757456------------2011347989962668998898999743-------6883 Q gi|255764474|r 75 RRNNEMLLGVMMRDVLRMPLKLVFTSPSQ------------RNHSRWTRYLISRMDEVITTSQKSARFI-------ERPS 135 (352) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~l~~~~d~ii~~s~~~~~~~-------~~~~ 135 (352) +.....+ ....+.|..+++|.-.. .....+.+..++++|.++++|+.+++.+ ..++ T Consensus 92 ~~~~~~~-----~~~~~~~~V~t~HD~~~~~~~~~~~~~~~~~~~~~~~~~~~~ad~ii~vS~~~~~~~~~~~~~~~~~i 166 (365) T cd03809 92 PHNTAPL-----LRLRGVPVVVTIHDLIPLRFPEYFSPGFRRYFRRLLRRALRRADAIITVSEATKRDLLRYLGVPPDKI 166 (365) T ss_pred CCCHHHH-----HHHCCCCEEEEECCCHHHHCCCCCCHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHHCCCHHHE T ss_conf 9832655-----64359989999788506538200797789999999999999699999979999999999849885898 Q ss_pred EEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHH Q ss_conf 99748827778237843268788728898872799951226445123666545653004774058863133321000146 Q gi|255764474|r 136 TVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKK 215 (352) Q Consensus 136 ~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~ 215 (352) .|||||+|...+.+.......+.... .+.++|+|+|++.++||++.+++|+..+.++.++++|+++|..... .+ T Consensus 167 ~vi~~gv~~~~~~~~~~~~~~~~~~~--~~~~~il~vg~~~~~K~~~~li~a~~~l~~~~~~~~lvivG~~~~~----~~ 240 (365) T cd03809 167 VVIPLGVDPRFRPPPAEAEVLRALYL--LPRPYFLYVGTIEPRKNLERLLEAFARLPAKGPDPKLVIVGKRGWL----NE 240 (365) T ss_pred EEECCCCCHHCCCCCCHHHHHHHHCC--CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCC----HH T ss_conf 99815555110588742678887438--9998899995364557999999999998876899389997789741----79 Q ss_pred HHHHHHHHCCCCCCCCCCCCCC--CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEE Q ss_conf 6776443112332212222223--31000000000001233322222210000010011102788410100234884249 Q gi|255764474|r 216 NLQRRIYANGLKKRILFIDEQS--SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGV 293 (352) Q Consensus 216 ~l~~~i~~~~l~~~V~~~g~~~--~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~ 293 (352) ++.+.+++.++.++|+|+|+++ ++..+|++||+|++||.+||||++++|||+||+|||+|++|+++|++ |.+|+ T Consensus 241 ~~~~~~~~~~~~~~v~~~g~~~~~~l~~~y~~ad~~v~PS~~EgfGl~~lEAma~G~Pvi~s~~~~~~Ei~----g~~g~ 316 (365) T cd03809 241 ELLARLRELGLGDRVRFLGYVSDEELAALYRGARAFVFPSLYEGFGLPVLEAMACGTPVIASNISSLPEVA----GDAAL 316 (365) T ss_pred HHHHHHHHCCCCCCEEEECCCCHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHHCCCCEEEECCCCCCEEE----CCCEE T ss_conf 99999996599985899368798999999971774354143357896899999859989990799873085----78379 Q ss_pred EECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 96599989999999999869899999999999999982998999999999 Q gi|255764474|r 294 IVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGK 343 (352) Q Consensus 294 ~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~ 343 (352) +++|+|+++++++|.++++|++++++|+++|++++ ++|||+++++++.+ T Consensus 317 ~~~p~d~~~la~~i~~ll~d~~~~~~~~~~~~~~~-~~fsW~~~a~~~l~ 365 (365) T cd03809 317 YFDPLDPEALAAAIERLLEDPALREELRERGLARA-KRFSWEKTARRTLD 365 (365) T ss_pred EECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-HHCCHHHHHHHHHC T ss_conf 98999999999999999879999999999999999-96999999999859 No 30 >cd04955 GT1_like_6 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=100.00 E-value=3.8e-44 Score=316.25 Aligned_cols=325 Identities=19% Similarity=0.218 Sum_probs=234.3 Q ss_pred HEEEE-CCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC-------------C--CCC-CHHHH---HHHHHHH Q ss_conf 45897-078787877077888899999997198599980588665-------------6--234-78998---9988754 Q gi|255764474|r 5 NIDVI-APNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN-------------I--PSI-GISSL---LTCWKKP 64 (352) Q Consensus 5 ~i~~i-~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~-------------~--~~~-~~~~~---~~~~~~~ 64 (352) +|.+| ...+++..+|++.++.++++.+.+.||++.+++...... + +.. ....+ +...... T Consensus 1 ~ia~i~~~~~Pp~~GG~e~~v~~La~~L~~~Gh~V~v~t~~~~~~~~~~~~~gv~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (363) T cd04955 1 KIAIIGTRGIPAKYGGFETFVEELAPRLVARGHEVTVYCRSPYPKQKETEYNGVRLIHIPAPEIGGLGTIIYDILAILHA 80 (363) T ss_pred CEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEECCEEEEEECCCCCCCHHHHHHHHHHHHHH T ss_conf 98999168489999878999999999999779979999878988887513477799992764445167789899999999 Q ss_pred HHCCCEEEEECCC-HHHHHHHHHHHHHCCCEEEEECCHHHHHH------HHHH----HHHHHHCCEEEECCHHHHHHCC- Q ss_conf 3259708996285-46899999999709981999757456201------1347----9899626689988989997436- Q gi|255764474|r 65 IGQNSRIWHARRN-NEMLLGVMMRDVLRMPLKLVFTSPSQRNH------SRWT----RYLISRMDEVITTSQKSARFIE- 132 (352) Q Consensus 65 ~~~~~~ivh~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~----~~l~~~~d~ii~~s~~~~~~~~- 132 (352) ...++++.|.|.. .............+.|+.+++|...+... ..+. +...+.+|.++++|+..++++. T Consensus 81 ~~~~~d~~~~h~~~~~~~~~~~~~~~~~~~~v~t~Hg~~~~~~~~~~~~~~~~~~~~~~~~~~ad~vi~~S~~~~~~l~~ 160 (363) T cd04955 81 LFVKRDIDHVHALGPAIAPFLPLLRLKGKKVVVNMDGLEWKRAKWGRPAKRYLKFGEKLAVKFADRLIADSPGIKEYLKE 160 (363) T ss_pred HHHCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHH T ss_conf 86089989999778168999999973398399993674011322017899999999999986089999998899999998 Q ss_pred ---CCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf ---88399748827778237843268788728898872799951226445123666545653004774058863133321 Q gi|255764474|r 133 ---RPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLK 209 (352) Q Consensus 133 ---~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~ 209 (352) .+..+||||+|...+ ......+..+++.+.. .++++||+.++||++.+|+|+.++. +++.|+++|+|+.. T Consensus 161 ~~~~~~~vIpnGvd~~~~---~~~~~~~~~~~~~~~~-~il~vgRl~~~Kg~~~ll~A~~~l~---~~~~l~iiG~g~~~ 233 (363) T cd04955 161 KYGRDSTYIPYGADHVVS---SEEDEILKKYGLEPGR-YYLLVGRIVPENNIDDLIEAFSKSN---SGKKLVIVGNADHN 233 (363) T ss_pred HCCCCCEEECCCCCCCCC---CCCHHHHHHCCCCCCC-EEEEEECCCCCCCHHHHHHHHHHHC---CCCEEEEECCCCCC T ss_conf 649983997898754677---7506679870899898-8999944753047999999998526---35619997777763 Q ss_pred CCCHHHHHHHHHHHCCCCCCCCCCCCCC--CCCCCCHHHHHHCCCC-CCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC Q ss_conf 0001466776443112332212222223--3100000000000123-332222221000001001110278841010023 Q gi|255764474|r 210 HYLFKKNLQRRIYANGLKKRILFIDEQS--SIEDWYRALNIFVAPP-LYEGFGLTPLEAMASGIPVVASNTGVFSELLDP 286 (352) Q Consensus 210 ~~~~~~~l~~~i~~~~l~~~V~~~g~~~--~~~~~~~~adi~i~pS-~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~ 286 (352) +. +.+ .+.+.++..++|.|+|.++ ++..+|+.+|++++|| ..||||++++||||||+|||+||+|+.+|++. T Consensus 234 ~~-~~~---~l~~~~~~~~~V~flG~~~~~~~~~~~~~ad~~v~pS~~~Eg~~~~~lEAma~G~PVVas~~~~~~ev~~- 308 (363) T cd04955 234 TP-YGK---LLKEKAAADPRIIFVGPIYDQELLELLRYAALFYLHGHSVGGTNPSLLEAMAYGCPVLASDNPFNREVLG- 308 (363) T ss_pred CH-HHH---HHHHHHCCCCCEEEECCCCCHHHHHHHHCCCEEECCCCCCCCCCHHHHHHHHHCCCEEECCCCCCCEEEC- T ss_conf 08-999---9999734699379707888477898631354464345666787689999998199999917998706975- Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 488424996599989999999999869899999999999999982998999999999999 Q gi|255764474|r 287 ENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 287 ~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) + +|++++ |+++++++|.++++|++++.+|+++|++++.++|||++++++++++|+ T Consensus 309 -~--~~~~~~--~~~~la~~i~~ll~d~~~~~~~g~~ar~~v~~~fsw~~~a~~~~~lY~ 363 (363) T cd04955 309 -D--KAIYFK--VGDDLASLLEELEADPEEVSAMAKAARERIREKYTWEKIADQYEELYK 363 (363) T ss_pred -C--CEEECC--CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCC T ss_conf -8--847779--989999999999759999999999999999985899999999999739 No 31 >cd03792 GT1_Trehalose_phosphorylase Trehalose phosphorylase (TP) reversibly catalyzes trehalose synthesis and degradation from alpha-glucose-1-phosphate (alpha-Glc-1-P) and glucose. The catalyzing activity includes the phosphorolysis of trehalose, which produce alpha-Glc-1-P and glucose, and the subsequent synthesis of trehalose. This family is most closely related to the GT1 family of glycosyltransferases. Probab=100.00 E-value=1.5e-44 Score=318.88 Aligned_cols=336 Identities=16% Similarity=0.162 Sum_probs=249.0 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC---------------CCCCCHH---HHHHHH-----HHH Q ss_conf 97078787877077888899999997198599980588665---------------6234789---989988-----754 Q gi|255764474|r 8 VIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN---------------IPSIGIS---SLLTCW-----KKP 64 (352) Q Consensus 8 ~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~---~~~~~~-----~~~ 64 (352) |++-|-...++||...+..+++.++.+|+.+.+.-..-... -..+... .+.... +.. T Consensus 2 VlhvnsT~~gGGVa~~l~~Lv~~~~~lGv~~~w~V~~~~~~ff~~tk~~hn~Lqg~~~~ls~~~~~~y~~~~~~na~~~~ 81 (372) T cd03792 2 VLHVNSTPYGGGVAEILHSLVPLMRDLGVDTRWEVIKGDPEFFNVTKKFHNALQGADIELSEEEKEIYLEWNEENAERPL 81 (372) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHCCHHHCCCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 79997899887699999999999996698169999459835789887500654199976798899999999999873131 Q ss_pred HHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHH---HHHHHHHHHHHHHHCCEEEECCHH-HHHHCCCCEEEECC Q ss_conf 32597089962854689999999970998199975745---620113479899626689988989-99743688399748 Q gi|255764474|r 65 IGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPS---QRNHSRWTRYLISRMDEVITTSQK-SARFIERPSTVIMH 140 (352) Q Consensus 65 ~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~l~~~~d~ii~~s~~-~~~~~~~~~~vi~~ 140 (352) .....|++|.|.+.++....+. ...+.|+....|... ++....+.+..++.+|.++..++. ....+..+..+||+ T Consensus 82 ~~~~~DvV~iHdpqp~~l~~~~-~~~~~~~I~r~Hid~~~~~~~~w~fl~~~i~~~d~~V~~~~~~~~~~~~~~~~~ip~ 160 (372) T cd03792 82 LDLDADVVVIHDPQPLALPLFK-KKRGRPWIWRCHIDLSSPNRRVWDFLQPYIEDYDAAVFHLPEYVPPQVPPRKVIIPP 160 (372) T ss_pred CCCCCCEEEECCCCHHHHHHHH-HCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCEEEEECHHHHCCCCCCCEEEECC T ss_conf 0279998998793667899986-368995899968866885388999999999857999997357504368876478167 Q ss_pred CCCHH-HHC---CCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHH Q ss_conf 82777-823---78432687887288988727999512264451236665456530047740588631333210001466 Q gi|255764474|r 141 GVDTE-RFR---PTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKN 216 (352) Q Consensus 141 gid~~-~~~---~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~ 216 (352) +||+- ..+ +..+....++++++++++++|+++||++++||++.+|+|+++++++.|+++|+++|+|+.++.+...- T Consensus 161 ~IDpl~~kn~~l~~~~~~~~~~~~gi~~d~piIl~VgRl~~~Kg~~~li~A~~~~~~~~~d~~LvivG~g~~ddpe~~~~ 240 (372) T cd03792 161 SIDPLSGKNRELSPADIEYILEKYGIDPERPYITQVSRFDPWKDPFGVIDAYRKVKERVPDPQLVLVGSGATDDPEGWIV 240 (372) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHH T ss_conf 10667743455898999999998298989958999872565468699999999999768997899989987778147899 Q ss_pred HHHHHHHCCCCCCCCCCC--CCC--CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEE Q ss_conf 776443112332212222--223--3100000000000123332222221000001001110278841010023488424 Q gi|255764474|r 217 LQRRIYANGLKKRILFID--EQS--SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAG 292 (352) Q Consensus 217 l~~~i~~~~l~~~V~~~g--~~~--~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G 292 (352) +++..+..+..+.|++++ ..+ ++..+|+.||++++||.+||||++++|||+||+|||+|++||++|.|. +|++| T Consensus 241 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~adv~v~~S~~Egfgl~~lEAm~~G~PVVas~vgGi~e~v~--dg~~G 318 (372) T cd03792 241 YEEVLEYAEGDPDIHVLTLPPVSDLEVNALQRASTVVLQKSIREGFGLTVTEALWKGKPVIAGPVGGIPLQIE--DGETG 318 (372) T ss_pred HHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHHCCCCEEECCCCCCHHHCC--CCCCE T ss_conf 9999997188996699936888678999999539799957642344469999998699899837998377604--89857 Q ss_pred EEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 99659998999999999986989999999999999998299899999999999998 Q gi|255764474|r 293 VIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRL 348 (352) Q Consensus 293 ~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~ 348 (352) ++++ +++++|++|.++++|++++.+||++|++++.++|++++++++|..+|.++ T Consensus 319 ~Lv~--~~d~~A~~i~~ll~d~~l~~~mg~~ar~~v~~~f~~~~~~~~yl~L~~~l 372 (372) T cd03792 319 FLVD--TVEEAAVRILYLLRDPELRRKMGANAREHVRENFLITRHLKDYLYLISKL 372 (372) T ss_pred EECC--CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHC T ss_conf 9889--86999999999974999999999999999998789999999999999839 No 32 >cd03802 GT1_AviGT4_like This family is most closely related to the GT1 family of glycosyltransferases. aviGT4 in Streptomyces viridochromogenes has been shown to be involved in biosynthesis of oligosaccharide antibiotic avilamycin A. Inactivation of aviGT4 resulted in a mutant that accumulated a novel avilamycin derivative lacking the terminal eurekanate residue. Probab=100.00 E-value=2.9e-43 Score=310.34 Aligned_cols=307 Identities=19% Similarity=0.216 Sum_probs=229.3 Q ss_pred HEEEECCCCC----CCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC-----------C-------CCCCHHHHHHHHH Q ss_conf 4589707878----7877077888899999997198599980588665-----------6-------2347899899887 Q gi|255764474|r 5 NIDVIAPNMK----FRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN-----------I-------PSIGISSLLTCWK 62 (352) Q Consensus 5 ~i~~i~~~~~----~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~-----------~-------~~~~~~~~~~~~~ 62 (352) ||.+|.+.+. +..+|++.++.++++.+.+.||++.+++...... . ..........+.+ T Consensus 2 kI~~v~~~~~p~pP~~~GG~e~~~~~La~~L~~~Gh~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (335) T cd03802 2 RIALVAPPREPVPPPAYGGTERVVAALTEGLVARGHEVTLFASGDSKTAAPLVPVVPEPLRLDAPGRDRAEAEALALAER 81 (335) T ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 86998884003699998979999999999999769989999628987788500456766544544221245689999999 Q ss_pred HHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCC--CCEEEECC Q ss_conf 5432597089962854689999999970998199975745620113479899626689988989997436--88399748 Q gi|255764474|r 63 KPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIE--RPSTVIMH 140 (352) Q Consensus 63 ~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~--~~~~vi~~ 140 (352) .....++|++|+|...... ......+.|+.++.|....... ............++++|+..+.... .+..+||| T Consensus 82 ~~~~~~~Dvvh~~~~~~~~---~~~~~~~~p~v~t~H~~~~~~~-~~~~~~~~~~~~~i~vS~~~~~~~~~~~~~~vI~n 157 (335) T cd03802 82 ALAAGDFDIVHNHSLHLPL---PFARPLPVPVVTTLHGPPDPEL-LKLYYAARPDVPFVSISDAQRRPWPPLPWVATVHN 157 (335) T ss_pred HHHHCCCCEEEECCCHHHH---HHHHHCCCCEEEEECCCCCHHH-HHHHHHHCCCCEEEECCHHHHHHCCCCCCEEEECC T ss_conf 9975798589989717899---9986279978999899970677-99997523586899945999954577677799879 Q ss_pred CCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHH Q ss_conf 82777823784326878872889887279995122644512366654565300477405886313332100014667764 Q gi|255764474|r 141 GVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRR 220 (352) Q Consensus 141 gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~ 220 (352) |||.+.|.+... ....++|+||+.++||++.+|+|+.. ++++++++|+++... +..+.. T Consensus 158 gvd~~~f~~~~~------------~~~~~l~vGRl~~~KG~~~li~a~~~-----~~~~L~i~G~~~~~~----~~~~~~ 216 (335) T cd03802 158 GIDLDDYPFRGP------------KGDYLLFLGRISPEKGPHLAIRAARR-----AGIPLKLAGPVSDPD----YFYREI 216 (335) T ss_pred CCCHHHCCCCCC------------CCCEEEEEEECCCCCCHHHHHHHHHH-----CCCEEEEEECCCCHH----HHHHHH T ss_conf 988876798889------------99789999933633476999999874-----598089994767379----999999 Q ss_pred HHHCCCCCCCCCCCCCC--CCCCCCHHHHHHCCCCCC-CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECC Q ss_conf 43112332212222223--310000000000012333-222222100000100111027884101002348842499659 Q gi|255764474|r 221 IYANGLKKRILFIDEQS--SIEDWYRALNIFVAPPLY-EGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPP 297 (352) Q Consensus 221 i~~~~l~~~V~~~g~~~--~~~~~~~~adi~i~pS~~-Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~ 297 (352) ...+++.++|+|+|.++ ++..+|+.||++++||.+ ||||++++||||||+|||+|++||++|+|. +|.||++++ T Consensus 217 ~~~~~~~~~V~f~G~v~~~~~~~~~~~a~~~v~pS~~~E~fglv~lEAma~G~PVVat~~gG~~E~v~--~g~~G~lv~- 293 (335) T cd03802 217 APELLDGPDIEYLGEVGGAEKAELLGNARALLFPILWEEPFGLVMIEAMACGTPVIAFRRGAVPEVVE--DGVTGFLVD- 293 (335) T ss_pred HHHCCCCCCEEEEECCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCEEECCCCCHHHHHC--CCCEEEECC- T ss_conf 99617899589950468399999999741245676532467479999998499899928998145423--896189819- Q ss_pred CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 9989999999999869899999999999999982998999999999999 Q gi|255764474|r 298 RNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 298 ~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) |++++|++|.++...+ .+++|++++++|||++++++++++|+ T Consensus 294 -~~~~la~ai~~~~~~~------~~~~r~~~~~~fs~~~~~~~~~~~Yr 335 (335) T cd03802 294 -SVEELAAAVARADRLD------RAACRRRAERRFSAARMVDDYLALYR 335 (335) T ss_pred -CHHHHHHHHHHHHHCC------HHHHHHHHHHHCCHHHHHHHHHHHCC T ss_conf -9999999999875289------99999999986899999999999719 No 33 >cd03818 GT1_ExpC_like This family is most closely related to the GT1 family of glycosyltransferases. ExpC in Rhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucan (exopolysaccharide II). Probab=100.00 E-value=6.1e-43 Score=308.17 Aligned_cols=327 Identities=17% Similarity=0.190 Sum_probs=236.5 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCC----C--------------CCHHH---------- Q ss_conf 4589707878787707788889999999719859998058866562----3--------------47899---------- Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIP----S--------------IGISS---------- 56 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~----~--------------~~~~~---------- 56 (352) +|..|+.||.- +...+++.+.+.||++.+++...+...+ . ..... T Consensus 1 ~~~~~~~~~p~-------q~r~LA~~La~rGHeV~Vit~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (396) T cd03818 1 RILFVHQNFPG-------QFRHLAPALAAQGHEVVFLTEPNAAPPPGGVRVVRYRPPRGPTSGTHPYLREFEEAVLRGQA 73 (396) T ss_pred CEEEECCCCCH-------HHHHHHHHHHHCCCEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 98998899881-------69999999997899899996899988999726888533688877777323447888887899 Q ss_pred HHHHHHH--HHHCCCEEEEECCCHHHHHHHHHHHHH-CCCEEEEE----CCHHHHH------HHHH------------HH Q ss_conf 8998875--432597089962854689999999970-99819997----5745620------1134------------79 Q gi|255764474|r 57 LLTCWKK--PIGQNSRIWHARRNNEMLLGVMMRDVL-RMPLKLVF----TSPSQRN------HSRW------------TR 111 (352) Q Consensus 57 ~~~~~~~--~~~~~~~ivh~~~~~~~~~~~~~~~~~-~~~~~~~~----~~~~~~~------~~~~------------~~ 111 (352) ....... ....++|++|+|.... .+++++..+ ..+....+ +.+.... ...+ .. T Consensus 74 ~~~~~~~~~~~~~~PDvVh~H~~~~--~~l~l~~~~p~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (396) T cd03818 74 VARALLALRAKGFRPDVIVAHPGWG--ETLFLKDVWPDAPLIGYFEFYYRAEGADVGFDPEFPPSLDDALRLRNRNALIL 151 (396) T ss_pred HHHHHHHHHHCCCCCCEEEECCCHH--HHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 9999999997189998899887058--99999985656878876412410365446778333655668999999888888 Q ss_pred HHHHHCCEEEECCHHHHHHC----CCCEEEECCCCCHHHHCCCCCHH-HHHHHCCCCCCCCEEEEEEE-ECHHHHHHHHH Q ss_conf 89962668998898999743----68839974882777823784326-87887288988727999512-26445123666 Q gi|255764474|r 112 YLISRMDEVITTSQKSARFI----ERPSTVIMHGVDTERFRPTSNKQ-EARRHLKISEDAKLIGCFGR-IRKLKGTDLFV 185 (352) Q Consensus 112 ~l~~~~d~ii~~s~~~~~~~----~~~~~vi~~gid~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~G~-~~~~Kg~~~li 185 (352) .....+|.+++.|+..+..+ ..++.|||||||++.|.|.+... ..+....++.+.++|+|+|| +.++||++.+| T Consensus 152 ~~~~~ad~~v~~s~~~~~~~~~~~~~~i~VipnGVD~~~f~P~~~a~~~~~~~~~~~~~~~vvl~vGR~l~~~KG~~~Ll 231 (396) T cd03818 152 LALAQADAGVSPTRWQRSTFPAELRSRISVIHDGIDTDRLRPDPQARLRLPNGRVLTPGDEVITFVARNLEPYRGFHVFM 231 (396) T ss_pred HHHHHCCEEECCCHHHHHHHHHHHCCCEEEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCHHHHH T ss_conf 88984878871889999752676248379982687713338880455544211468999869999776513044899999 Q ss_pred HHHHHHHCCCCCEEEEEECCC-----CCCC--CCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCC Q ss_conf 545653004774058863133-----3210--00146677644311233221222222--33100000000000123332 Q gi|255764474|r 186 DCMINILPHHPGWTAVVVGKT-----TLKH--YLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYE 256 (352) Q Consensus 186 ~a~~~l~~~~~~~~l~i~G~g-----~~~~--~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~E 256 (352) +|++.+.++.|+++|+|+|++ .... ..+.+.+.+.....+..++|+|+|.+ +++..+|++||+|++||..| T Consensus 232 ~A~~~l~~~~p~~~lvivG~~~~~~g~~~~~~~~~~~~ll~~l~~~~~~~rV~F~G~v~~~~l~~~l~~adv~v~PS~~~ 311 (396) T cd03818 232 RALPRLLRARPDARVVIVGGDGVSYGAPPPDGESWKQHMLDELGGRLDLSRVHFLGRVPYDQYLALLQVSDVHVYLTYPF 311 (396) T ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHEEEEEEECCEEC T ss_conf 99999998789968999926874456667654379999998632236763689708985899988751003999531404 Q ss_pred CCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 22222100000100111027884101002348842499659998999999999986989999999999999998299899 Q gi|255764474|r 257 GFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVK 336 (352) Q Consensus 257 g~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~ 336 (352) ++|++++||||||+|||+|++||++|+|. +|.+|++++++|+++||++|.++++||+++++|+++||+.+++.|.|+. T Consensus 312 ~~~~~llEAMA~G~PVVas~~gg~~e~V~--dg~~G~Lvpp~d~~~LA~ai~~lL~dp~~r~~lg~aaR~~~~~~~~~~~ 389 (396) T cd03818 312 VLSWSLLEAMACGCLVVGSDTAPVREVIT--DGENGLLVDFFDPDALAAAVIELLDDPARRARLRRAARRTALRYDLLSV 389 (396) T ss_pred CCCCHHHHHHHCCCCEEEECCCCCHHHHC--CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 55760899997799899927998265525--9987899699999999999999975999999999999999999844532 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|255764474|r 337 EASDIG 342 (352) Q Consensus 337 ~a~~~~ 342 (352) ++..+. T Consensus 390 ~~~~~~ 395 (396) T cd03818 390 CLPRQL 395 (396) T ss_pred CHHHHH T ss_conf 064753 No 34 >PRK09922 UDP-D-galactose:(glucosyl)lipopolysaccharide-1,6-D-galactosyltransferase; Provisional Probab=100.00 E-value=4.7e-42 Score=302.16 Aligned_cols=322 Identities=14% Similarity=0.187 Sum_probs=246.5 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCC--------------CCCCC------CHHHHHHHHHHH Q ss_conf 4589707878787707788889999999719859998058866--------------56234------789989988754 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPK--------------NIPSI------GISSLLTCWKKP 64 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~------~~~~~~~~~~~~ 64 (352) +|..|.+++. ..+|++..+.++++.+.+.++.+.++...... ..+.. .......+.+.. T Consensus 4 KIlfi~~~l~-~~GGaErvl~~La~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i 82 (361) T PRK09922 4 KIAFIGEAVS-GFGGMETVISNVINTFENSKINCEMFFFCRNDKMDKAWLKKIKYAQSFSNIKLSFLRRAKHVYNFSQWL 82 (361) T ss_pred EEEEECCCCC-CCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHHH T ss_conf 9999999999-988049999999999987199879999934985415576447722433665520245789999999999 Q ss_pred HHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHH-HHHHCCEEEECCHHHHHHC------CCCEEE Q ss_conf 325970899628546899999999709981999757456201134798-9962668998898999743------688399 Q gi|255764474|r 65 IGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRY-LISRMDEVITTSQKSARFI------ERPSTV 137 (352) Q Consensus 65 ~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~~~d~ii~~s~~~~~~~------~~~~~v 137 (352) ....+|++|++.......+...+...+.++++..+.|.......+.+. .+..+|.++++|...++.+ ..++.+ T Consensus 83 k~~~~Dii~~~~~~~~~~~~~~~~~~~~~~~ii~~~h~~~~~~~~~~~~~~~~~d~~i~vS~~~~~~~~~~~~~~~ki~v 162 (361) T PRK09922 83 KETSPDIVICIDVISCLYANKARKKSGKQFKIFSWPHFSLDHKKHAECITLVYADYHLAISSGIKEQMIARGISAQRISV 162 (361) T ss_pred HHHCCCEEEECCHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHHCCEEEEECHHHHHHHHHCCCCHHCEEE T ss_conf 97099999998806899999999982999589997556534267899899985885699578999999970997542999 Q ss_pred ECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEEC--HHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHH Q ss_conf 748827778237843268788728898872799951226--445123666545653004774058863133321000146 Q gi|255764474|r 138 IMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIR--KLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKK 215 (352) Q Consensus 138 i~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~--~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~ 215 (352) |+||+|.+.+...... ......++++||+. ++||++.+++|++++ .++|+|+|+|+|+.+ + T Consensus 163 I~N~i~~~~~~~~~~~---------~~~~~~il~vGRl~~~~qK~~~~li~a~~~~---~~~~~L~IvG~G~~~-----~ 225 (361) T PRK09922 163 IYNPVEIKTIIIPPPE---------RDKPAVFLYVGRLKFEGQKNVKELFDGLSRT---TGEWQLHIIGDGSDF-----E 225 (361) T ss_pred EECCCCHHHCCCCCHH---------HCCCCEEEEEECCCCCCCCCHHHHHHHHHHH---CCCCEEEEEEEECCH-----H T ss_conf 9599173540467503---------1578779999544452568999999999854---899489999843889-----9 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCC----CCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCC-CCCCHHCCCCCCC Q ss_conf 67764431123322122222233----100000000000123332222221000001001110278-8410100234884 Q gi|255764474|r 216 NLQRRIYANGLKKRILFIDEQSS----IEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNT-GVFSELLDPENAK 290 (352) Q Consensus 216 ~l~~~i~~~~l~~~V~~~g~~~~----~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~-~~~~e~i~~~~g~ 290 (352) +|++++++++++++|.|+|++++ +..+|..||+|++||++||||++++||||||+|||+||+ +|.+|+|. +|. T Consensus 226 ~L~~~i~~l~l~~~V~flG~~~np~~~l~~~~~~adifVl~S~~EGfp~vllEAma~G~PvIatd~~~G~~EiI~--dg~ 303 (361) T PRK09922 226 KCKAYSRELGIEQRIIWHGWQSNPWEVVQQKIKNVSALLLTSKFEGFPMTLLEAMSYGIPCISSDCMSGPRDIIK--PGL 303 (361) T ss_pred HHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHCCEEEECCCCCCCCCHHHHHHHHCCCEEEECCCCCCHHHHC--CCC T ss_conf 999999983898738990675987999999985134999647556887289999995998999759999088715--898 Q ss_pred EEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 2499659998999999999986989999999999999998299899999999999998 Q gi|255764474|r 291 AGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRL 348 (352) Q Consensus 291 ~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~ 348 (352) +|++++++|+++++++|.++++|+..... .+++..+++-|+...+++-+.++|-.+ T Consensus 304 nG~Lv~~~d~~~la~~i~~li~~e~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 359 (361) T PRK09922 304 NGELYTPGNIDEFVGKLNKVISGEVKYQH--DIIPGSIEKFYDVLYFKNLNNALFSKL 359 (361) T ss_pred CEEEECCCCHHHHHHHHHHHHHCHHHCCH--HHHHHHHHHHCHHHHHHHHHHHHHHHH T ss_conf 37997799999999999999848221399--999999986468999999999999984 No 35 >cd03791 GT1_Glycogen_synthase_DULL1_like This family is most closely related to the GT1 family of glycosyltransferases. Glycogen synthase catalyzes the formation and elongation of the alpha-1,4-glucose backbone using ADP-glucose, the second and key step of glycogen biosynthesis. This family includes starch synthases of plants, such as DULL1 in Zea mays and glycogen synthases of various organisms. Probab=100.00 E-value=2.5e-41 Score=297.30 Aligned_cols=332 Identities=20% Similarity=0.287 Sum_probs=239.6 Q ss_pred HEEEECCCCCC--CCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC---------------------------------- Q ss_conf 45897078787--877077888899999997198599980588665---------------------------------- Q gi|255764474|r 5 NIDVIAPNMKF--RHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN---------------------------------- 48 (352) Q Consensus 5 ~i~~i~~~~~~--~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~---------------------------------- 48 (352) +|..|++.+.+ +-+|+...+..+.+.+.+.||.+.++.|.++.. T Consensus 1 kIl~vs~E~~P~~kvGGLg~vv~~L~~aLa~~Gh~V~Vi~P~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 80 (476) T cd03791 1 KVLFVASEVAPFAKTGGLGDVVGALPKALAKLGHDVRVIMPKYGRILDELRGQLLVLRLFGVPVGGRPEYVGVFELPVDG 80 (476) T ss_pred CEEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHCCCEEEEEEEEECCCCCEEEEEEEEEECCC T ss_conf 97999863486656267999999999999977996999957986610341356346678655147840699999985499 Q ss_pred CCCC--C---------------------H-------HHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHH------CC Q ss_conf 6234--7---------------------8-------998998875432597089962854689999999970------99 Q gi|255764474|r 49 IPSI--G---------------------I-------SSLLTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVL------RM 92 (352) Q Consensus 49 ~~~~--~---------------------~-------~~~~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~------~~ 92 (352) ++.+ . . .......+ .....+||+|+|.|...+.+..++... +. T Consensus 81 v~~~~l~~~~~~~~~~~~~~~~~~~~d~~~rf~~f~~a~l~~~~-~~~~~pDVIH~HDW~tal~~~~lk~~~~~~~~~~~ 159 (476) T cd03791 81 VPVYFLDNPDYFDRPGLYDDSGYDYEDNAERFALFSRAALELLR-RLGWKPDIIHCHDWHTGLVPALLKEKYADPFFKNI 159 (476) T ss_pred CEEEEECCHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH-HCCCCCCEEEECCHHHHHHHHHHHHHHHCCCCCCC T ss_conf 55999708676167888776578875279999999999999998-63999887997653799999999996530024699 Q ss_pred CEEEEECCHHHHH---H-------HH-----------------HHHHHHHHCCEEEECCHHHHHHC-------------- Q ss_conf 8199975745620---1-------13-----------------47989962668998898999743-------------- Q gi|255764474|r 93 PLKLVFTSPSQRN---H-------SR-----------------WTRYLISRMDEVITTSQKSARFI-------------- 131 (352) Q Consensus 93 ~~~~~~~~~~~~~---~-------~~-----------------~~~~l~~~~d~ii~~s~~~~~~~-------------- 131 (352) |++++.|+-.+.. . .. +.+.-+..+|.|+++|+..++.+ T Consensus 160 p~V~TIHnl~~qG~~~~~~~~~~~~~~~~~~~~~~~~~~~~in~lk~gi~~ad~V~tVS~~ya~Ei~~~~~g~~l~~~l~ 239 (476) T cd03791 160 KTVFTIHNLAYQGVFPLEALEDLGLPWEELFHIDGLEFYGQVNFLKAGIVYADAVTTVSPTYAREILTPEFGEGLDGLLR 239 (476) T ss_pred CEEEEECCCHHCCCCCHHHHHHCCCCCHHHHHHCCCCCCCCHHHHHHHHHHCCEEEEECHHHHHHHCCCCCCCCHHHHHH T ss_conf 88999643010687874358761897066530001002450359999987478779807778877647213666666753 Q ss_pred --CCCEEEECCCCCHHHHCCCCC------------------HHHHHHHCCCC--CCCCEEEEEEEECHHHHHHHHHHHHH Q ss_conf --688399748827778237843------------------26878872889--88727999512264451236665456 Q gi|255764474|r 132 --ERPSTVIMHGVDTERFRPTSN------------------KQEARRHLKIS--EDAKLIGCFGRIRKLKGTDLFVDCMI 189 (352) Q Consensus 132 --~~~~~vi~~gid~~~~~~~~~------------------~~~~~~~~~~~--~~~~~i~~~G~~~~~Kg~~~li~a~~ 189 (352) ..+..+|+||||.+.|.|..+ +..++++++++ ++.++++|+||+.++||++++++|+. T Consensus 240 ~~~~kl~gI~NGiD~~~~~P~~d~~l~~~~~~~~~~~k~~~k~~l~~~~gl~~d~~~pl~~~vgRl~~qKG~dll~~a~~ 319 (476) T cd03791 240 ARAGKLSGILNGIDYDVWNPATDPHLPANYSADDLEGKAENKAALQEELGLPVDPDAPLFGFVGRLTEQKGIDLLLEALP 319 (476) T ss_pred HCCCCEEEEECCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHH T ss_conf 04687699858988643388777531056671135517999999999959897899868999423520248999999999 Q ss_pred HHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCC-CCCCCCC-CCCCHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 5300477405886313332100014667764431123322122-2222331-0000000000012333222222100000 Q gi|255764474|r 190 NILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILF-IDEQSSI-EDWYRALNIFVAPPLYEGFGLTPLEAMA 267 (352) Q Consensus 190 ~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~-~g~~~~~-~~~~~~adi~i~pS~~Eg~gl~~lEAma 267 (352) .+.++ +.+++++|.|+.. +.+.++++...+ .+++.+ .|+.+++ ..+|++||++++||++|+||++.+|||+ T Consensus 320 ~~~~~--~~~~vi~G~G~~~---~e~~~~~l~~~~--~~~~~~~~~~~e~l~~~lya~aD~~l~PS~~EP~Gl~qleAm~ 392 (476) T cd03791 320 ELLEL--GGQLVILGSGDPE---YEEALRELAARY--PGRVAVLIGYDEALAHLIYAGADFFLMPSRFEPCGLTQMYAMR 392 (476) T ss_pred HHHHC--CCEEEEEECCCHH---HHHHHHHHHHHC--CCCEEEEEECCHHHHHHHHHHCCEEECCCCCCCCCHHHHHHHH T ss_conf 99963--9889999469778---999999999768--9959999806878899999849997425545785489999986 Q ss_pred CEEEEEECCCCCCCHHCCCCC----CCEEEEECCCCHHHHHHHHHHHH---CCHHHHHHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 100111027884101002348----84249965999899999999998---69899999999999999982998999999 Q gi|255764474|r 268 SGIPVVASNTGVFSELLDPEN----AKAGVIVPPRNLHALEKAVLYFM---NSKKIMSDTGNRGRERAVKHFSIVKEASD 340 (352) Q Consensus 268 ~G~PvI~s~~~~~~e~i~~~~----g~~G~~~~~~d~~~la~~i~~l~---~~~~~~~~~~~~a~~~~~~~fs~~~~a~~ 340 (352) ||+|+|++++||++|+|...+ ..||+++++.|+++++++|.+++ .||+.+.++.++|. +.+|||++.|++ T Consensus 393 ~GtppIa~~tGGL~dtV~d~~~~~~~~tGf~f~~~~~~~l~~ai~~al~~~~~~~~~~~l~~~am---~~~fsW~~~A~~ 469 (476) T cd03791 393 YGTVPIVRATGGLADTVIDYNEDTGEGTGFVFEGYNADALLAALRRALALYRDPEAWRKLQRNAM---AQDFSWDRSAKE 469 (476) T ss_pred CCCCEEECCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHH---HCCCCCHHHHHH T ss_conf 69975980699986510036667777745886799999999999999998579999999999886---428992999999 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|255764474|r 341 IGKVYDR 347 (352) Q Consensus 341 ~~~iY~~ 347 (352) |+++|++ T Consensus 470 y~~~Y~~ 476 (476) T cd03791 470 YLELYRS 476 (476) T ss_pred HHHHHCC T ss_conf 9998549 No 36 >PRK00654 glgA glycogen synthase; Provisional Probab=100.00 E-value=2.6e-41 Score=297.12 Aligned_cols=336 Identities=22% Similarity=0.280 Sum_probs=239.3 Q ss_pred HEEEECCCCCC--CCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC---------------------------------C Q ss_conf 45897078787--877077888899999997198599980588665---------------------------------6 Q gi|255764474|r 5 NIDVIAPNMKF--RHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN---------------------------------I 49 (352) Q Consensus 5 ~i~~i~~~~~~--~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~---------------------------------~ 49 (352) +|..|++-+.+ +-+|+...+..+.+.+.+.|+++.++.|.++.. + T Consensus 2 kIl~vs~E~~P~~KvGGLgdvv~~L~~aL~~~G~~V~Vi~P~Y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv 81 (476) T PRK00654 2 KILFVASECAPLIKTGGLGDVVGALPKALAKLGHDVRVLLPGYPAIRGGLKDLQVVSVPLGWRDQFAGHVALLYGPYDGV 81 (476) T ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCCC T ss_conf 79999874387557574999999999999976995999947983344506664698886144664305889864125896 Q ss_pred CC--CC---------------------------H-HHHHH-HHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEE Q ss_conf 23--47---------------------------8-99899-887543259708996285468999999997099819997 Q gi|255764474|r 50 PS--IG---------------------------I-SSLLT-CWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVF 98 (352) Q Consensus 50 ~~--~~---------------------------~-~~~~~-~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (352) +. +. + ...+. +........+||+|+|.|...+....++......+++++ T Consensus 82 ~~~~i~~~~~f~r~~~~~~~~~~y~Dn~~Rfa~f~~a~le~~~~l~~~~~pDIIH~HDW~t~lvp~~lk~~~~~~~~~V~ 161 (476) T PRK00654 82 PVYLIDAPHLFDRPGSPDESGYGYEDNGERFAFFSWAAAEFAEGLDPGWRPDIVHAHDWHTGLVPALLKERGRPPIKSVF 161 (476) T ss_pred EEEEECCHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHHHHHCCCCCCEEE T ss_conf 69997388872788887655778763699999999999999975144788886785464778999999974378997799 Q ss_pred CCHHHHH-------H-----HHH----------------HHHHHHHCCEEEECCHHHHHHC----------------CCC Q ss_conf 5745620-------1-----134----------------7989962668998898999743----------------688 Q gi|255764474|r 99 TSPSQRN-------H-----SRW----------------TRYLISRMDEVITTSQKSARFI----------------ERP 134 (352) Q Consensus 99 ~~~~~~~-------~-----~~~----------------~~~l~~~~d~ii~~s~~~~~~~----------------~~~ 134 (352) |-|.... . ... .+.-+..+|.|+++|+..++.+ ..+ T Consensus 162 TIHnl~yQG~~~~~~~~~~gl~~~~~~~~~~e~~~~~n~lk~gi~~aD~v~tVS~~Ya~Ei~~~~~g~gL~~~l~~~~~k 241 (476) T PRK00654 162 TIHNLAYQGLFPAEHLGALGLPDERFHLEGLEFYGQISFLKAGLYYADRVTTVSPTYAREITTPEFGYGLEGLLRARQGK 241 (476) T ss_pred EECCCCCCCCCCHHHHHHCCCCHHHHCCCHHHHCCCHHHHHHHHHHCCEEEECCHHHHHHHCCCCCCCCHHHHHHHCCCC T ss_conf 97342016766887896659956762430001045027888988757874873788887661633466667887635666 Q ss_pred EEEECCCCCHHHHCCCCC------------------HHHHHHHCCCCCC--CCEEEEEEEECHHHHHHHHHHHHHHHHCC Q ss_conf 399748827778237843------------------2687887288988--72799951226445123666545653004 Q gi|255764474|r 135 STVIMHGVDTERFRPTSN------------------KQEARRHLKISED--AKLIGCFGRIRKLKGTDLFVDCMINILPH 194 (352) Q Consensus 135 ~~vi~~gid~~~~~~~~~------------------~~~~~~~~~~~~~--~~~i~~~G~~~~~Kg~~~li~a~~~l~~~ 194 (352) ...|+||||.+.|+|..+ +..+++.++++.+ .++++++||+..+||++.+++|+.++.++ T Consensus 242 l~gI~NGiD~~~wnP~tD~~l~~~y~~~~~~~K~~~k~~l~~~~gl~~~~~~pl~~~vgRl~~qKG~~ll~~a~~~~~~~ 321 (476) T PRK00654 242 LSGILNGIDYDIWNPETDPYLPANYSADDLEGKAENKRALQERFGLPVDDDAPLFAMVSRLTEQKGLDLVLEALDEILEQ 321 (476) T ss_pred EEEECCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHC T ss_conf 66844887710039987865116668100331698999999994989789974899985164567889999999999970 Q ss_pred CCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCC-CCCCCCCC-CCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 77405886313332100014667764431123322122-22223310-00000000001233322222210000010011 Q gi|255764474|r 195 HPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILF-IDEQSSIE-DWYRALNIFVAPPLYEGFGLTPLEAMASGIPV 272 (352) Q Consensus 195 ~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~-~g~~~~~~-~~~~~adi~i~pS~~Eg~gl~~lEAma~G~Pv 272 (352) +.+++++|.|+..- .+.++++..++ .+++.+ .|+-+.+. .+|++||++++||++|+||++.+|||++|+|+ T Consensus 322 --~~~~vi~G~G~~~~---~~~l~~l~~~~--~~~~~~~~gf~e~l~~~iya~aD~~lmPS~~EP~Gl~qleAm~~Gt~P 394 (476) T PRK00654 322 --GGQLVLLGTGDPEL---EEAFRALAARY--PGRVGVQIGYDEALAHRIYAGADFFLMPSRFEPCGLTQLYALRYGTLP 394 (476) T ss_pred --CCEEEEEECCCHHH---HHHHHHHHHHC--CCCEEEEECCCHHHHHHHHHHCCEEECCCCCCCCCHHHHHHHHCCCCC T ss_conf --99899994597899---99999999877--988899957886898999872887864561136776899998769985 Q ss_pred EECCCCCCCHHCCC----CCCCEEEEECCCCHHHHHHHHHH---HHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 10278841010023----48842499659998999999999---986989999999999999998299899999999999 Q gi|255764474|r 273 VASNTGVFSELLDP----ENAKAGVIVPPRNLHALEKAVLY---FMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVY 345 (352) Q Consensus 273 I~s~~~~~~e~i~~----~~g~~G~~~~~~d~~~la~~i~~---l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY 345 (352) |++.+||++|+|.. .++.||+++++.|+++++.+|.+ +..+|+.+.++.++|. .++|||++.|++|+++| T Consensus 395 vv~~tGGL~dtV~d~~~~~~~~tGf~f~~~~~~~l~~ai~~al~~~~~~~~~~~l~~~~m---~~~fsW~~~A~~y~~lY 471 (476) T PRK00654 395 IVRRTGGLADTVIDYDPEDGGATGFVFDDFNAEDLLTALRRALELYRQPDLWRALQRQAM---AQDFSWDKSAEEYLELY 471 (476) T ss_pred EEECCCCCCCCCCCCCCCCCCCCEEEECCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHH---HCCCCCHHHHHHHHHHH T ss_conf 881799975514566667787634873799999999999999988569999999999985---23799599999999999 Q ss_pred HHHHH Q ss_conf 99886 Q gi|255764474|r 346 DRLLR 350 (352) Q Consensus 346 ~~~l~ 350 (352) +++|+ T Consensus 472 ~~llg 476 (476) T PRK00654 472 RRLLG 476 (476) T ss_pred HHHHC T ss_conf 99739 No 37 >cd04946 GT1_AmsK_like This family is most closely related to the GT1 family of glycosyltransferases. AmsK is involved in the biosynthesis of amylovoran, which functions as a virulence factor. It functions as a glycosyl transferase which transfers galactose from UDP-galactose to a lipid-linked amylovoran-subunit precursor. The members of this family are found mainly in bacteria and Archaea. Probab=100.00 E-value=1.8e-40 Score=291.53 Aligned_cols=316 Identities=17% Similarity=0.243 Sum_probs=236.1 Q ss_pred EEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCC----------------CCC-CH----HH-------- Q ss_conf 58970787878770778888999999971985999805886656----------------234-78----99-------- Q gi|255764474|r 6 IDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNI----------------PSI-GI----SS-------- 56 (352) Q Consensus 6 i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~-~~----~~-------- 56 (352) |.+++.++++ ..| +..+.+.+..+.+.|+.+.++....+... +.. .. .. T Consensus 2 Ll~lT~~yP~-~~s-ETFI~~EI~~L~~~G~~V~I~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 79 (407) T cd04946 2 LILLTNTFPG-AKG-ESFIEPEIKYLSKSFDKIIILPTNVGKEREKVRPNGVSNIIISNYRQDKSRAKLIFLALSVFSLP 79 (407) T ss_pred EEEEECCCCC-CCC-CCHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHH T ss_conf 9999387999-998-87799999999967997999978888644436775402322200011022677899878765357 Q ss_pred -HHHHH-----------------------------------HHHHHCCCEEEEECCCHHHH-HHHHHHHHHCCCEEEEEC Q ss_conf -89988-----------------------------------75432597089962854689-999999970998199975 Q gi|255764474|r 57 -LLTCW-----------------------------------KKPIGQNSRIWHARRNNEML-LGVMMRDVLRMPLKLVFT 99 (352) Q Consensus 57 -~~~~~-----------------------------------~~~~~~~~~ivh~~~~~~~~-~~~~~~~~~~~~~~~~~~ 99 (352) +...| ........+++|+||..... .+..++.....+ +++++ T Consensus 80 f~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ii~ahw~~~~~~~~all~~~~~~~-~~~~t 158 (407) T cd04946 80 FYKELLKKLKRRRKNIKYFLLLLYFIKRSILLKLKYLHLLIYNSIDGQGTVFYSYWLHETAYALALLKKEYLRK-RVISR 158 (407) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCHHHHHHHHHHHCCCC-CEEEE T ss_conf 89999973311364145799999999999998889999877402466666799874473389999999863898-29997 Q ss_pred CHHHH--------HHHHHHHHHHHHCCEEEECCHHHHHHCC-------CCEEEECCCCCHHHHCCCCCHHHHHHHCCCCC Q ss_conf 74562--------0113479899626689988989997436-------88399748827778237843268788728898 Q gi|255764474|r 100 SPSQR--------NHSRWTRYLISRMDEVITTSQKSARFIE-------RPSTVIMHGVDTERFRPTSNKQEARRHLKISE 164 (352) Q Consensus 100 ~~~~~--------~~~~~~~~l~~~~d~ii~~s~~~~~~~~-------~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~ 164 (352) .|.+- ......+++++++|.++++|+....++. .++.++++|+|...|...+. +. T Consensus 159 aHg~Diy~~~~~~~~~~~~~~~~~~~d~v~~vS~~~~~~l~~~~~~~~~ki~v~~~Gv~~~~~~~~~~----------~~ 228 (407) T cd04946 159 AHGYDLYEDRYPSGYIPLRRYLLSSLDAVFPCSEQGRNYLQKRYPAYKEKIKVSYLGVSDPGIISKPS----------KD 228 (407) T ss_pred EECHHHCCCCCCCCCHHHHHHHHHHCCEEEEECHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCC----------CC T ss_conf 30303033302353047899998526879997778899998736997575899968967433467888----------78 Q ss_pred CCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEE--EECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCC Q ss_conf 8727999512264451236665456530047740588--63133321000146677644311233221222222--3310 Q gi|255764474|r 165 DAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAV--VVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIE 240 (352) Q Consensus 165 ~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~--i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~ 240 (352) +...|+++||+.+.||++.+|+|++.+.+++|++++. ++|+|+.+ ++|++++++++++++|.|+|.+ +++. T Consensus 229 ~~~~i~svgrlv~~Kg~~~li~A~~~l~~~~~~~~~~~~iiG~G~~~-----~~l~~~~~~l~l~~~v~f~G~~~~~~v~ 303 (407) T cd04946 229 DTLRIVSCSYLVPVKRVDLIIKALAALAKARPSIKIKWTHIGGGPLE-----DTLKELAESKPENISVNFTGELSNSEVY 303 (407) T ss_pred CCEEEEEEECCCHHCCHHHHHHHHHHHHHHCCCCEEEEEEEECCHHH-----HHHHHHHHHCCCCCEEEEECCCCHHHHH T ss_conf 96599996177300388999999999986589938999998268128-----9999999977998879992789969999 Q ss_pred CCCH--HHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCC-CHHHHHHHHHHHHCCHHHH Q ss_conf 0000--000000123332222221000001001110278841010023488424996599-9899999999998698999 Q gi|255764474|r 241 DWYR--ALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPR-NLHALEKAVLYFMNSKKIM 317 (352) Q Consensus 241 ~~~~--~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~-d~~~la~~i~~l~~~~~~~ 317 (352) .+|+ .+|+|+.||..||+|++++||||||+|||+|++||++|+|. +|.+|++++++ ++++++++|.++++|++.+ T Consensus 304 ~~~~~~~~d~~~~~s~~eg~p~~~~Eama~g~pvi~t~~~g~~e~v~--~~~~g~l~~~~~~~~~la~~i~~l~~~~~~~ 381 (407) T cd04946 304 KLYKENPVDVFVNLSESEGLPVSIMEAMSFGIPVIATNVGGTPEIVD--NGGNGLLLSKDPTPNELVSSLSKFIDNEEEY 381 (407) T ss_pred HHHHHCCCCEEEECCCCCCCCHHHHHHHHCCCCEEECCCCCCHHHEE--CCCCEEEECCCCCHHHHHHHHHHHHHCHHHH T ss_conf 99985696199956633464179999997499999847999841230--6982799689999999999999998099999 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 999999999999829989999999 Q gi|255764474|r 318 SDTGNRGRERAVKHFSIVKEASDI 341 (352) Q Consensus 318 ~~~~~~a~~~~~~~fs~~~~a~~~ 341 (352) .+|+++||+.++++|++++.-.++ T Consensus 382 ~~~~~~ar~~v~~~f~~~~ny~~f 405 (407) T cd04946 382 QTMREKAREKWEENFNASKNYREF 405 (407) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 999999999999867988779997 No 38 >cd04949 GT1_gtfA_like This family is most closely related to the GT1 family of glycosyltransferases and is named after gtfA in Streptococcus gordonii, where it plays a role in the O-linked glycosylation of GspB, a cell surface glycoprotein involved in platelet binding. In general glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltra Probab=100.00 E-value=1e-38 Score=279.75 Aligned_cols=211 Identities=20% Similarity=0.328 Sum_probs=184.3 Q ss_pred HHHCCEEEECCHHHHHHC------CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHH Q ss_conf 962668998898999743------68839974882777823784326878872889887279995122644512366654 Q gi|255764474|r 114 ISRMDEVITTSQKSARFI------ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDC 187 (352) Q Consensus 114 ~~~~d~ii~~s~~~~~~~------~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a 187 (352) ..+.|.+|+.++.-+..+ ..++.+||+|+..+.-.|.+. .......|+++||+.++||++.||+| T Consensus 155 ~~~~d~~Iv~T~~Q~~di~~~f~~~~~i~~IP~~~~~~~~~~~~~---------~~r~~~~ii~vgRL~~eK~~d~LI~A 225 (372) T cd04949 155 LDKVDGVIVATEQQKQDLQKQFGNYNPIYTIPVGSIDPLKLPAQF---------KQRKPHKIITVARLAPEKQLDQLIKA 225 (372) T ss_pred HHHCCEEEECCHHHHHHHHHHHCCCCCEEEECCCCCCHHCCCCCC---------CCCCCCEEEEEECCCCCCCHHHHHHH T ss_conf 431788997879999999997178885899678242031166664---------35898979999677740285999999 Q ss_pred HHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 56530047740588631333210001466776443112332212222223310000000000012333222222100000 Q gi|255764474|r 188 MINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMA 267 (352) Q Consensus 188 ~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma 267 (352) ++++.++.|+++|.|+|+|+++ ++|++++++++++++|.|.|+++++..+|+.+|+|++||.+||||++++|||+ T Consensus 226 ~~~v~~~~P~~~L~I~G~G~~~-----~~L~~~i~~l~l~~~V~f~G~~~~~~~~y~~a~~~v~~S~~EGfgl~llEAma 300 (372) T cd04949 226 FAKVVKQVPDATLDIYGYGDEE-----EKLKELIEELGLEDYVFLKGYTRDLDEVYQKAQLSLLTSQSEGFGLSLMEALS 300 (372) T ss_pred HHHHHHHCCCCEEEEEECCHHH-----HHHHHHHHHCCCCCEEEECCCCCCHHHHHHHCCEEEECCCCCCCCCHHHHHHH T ss_conf 9999987899299999734778-----99999999829998799889988989999757999980200367658999998 Q ss_pred CEEEEEECCCC-CCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 10011102788-410100234884249965999899999999998698999999999999999829989999999 Q gi|255764474|r 268 SGIPVVASNTG-VFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDI 341 (352) Q Consensus 268 ~G~PvI~s~~~-~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~ 341 (352) ||+|||++|++ |.+|+|. +|.+|++++++|+++||++|.+++.|++++.+|+++|++.+ ++||++++++++ T Consensus 301 ~GlPvIa~d~~yG~~eiI~--~g~nG~Lv~~~d~~~la~~i~~ll~~~~~~~~~s~~a~~~a-~~fs~~~i~~~W 372 (372) T cd04949 301 HGLPVISYDVNYGPSEIIE--DGENGYLVPKGDIEALAEAIIELLNDPKLLQKFSEAAYENA-ERYSEENVWEKW 372 (372) T ss_pred CCCCEEEECCCCCCHHHHC--CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-HHCCHHHHHHHC T ss_conf 5999998059999688845--89847996899999999999999869999999999999999-955989999409 No 39 >KOG1111 consensus Probab=100.00 E-value=1.2e-37 Score=272.54 Aligned_cols=327 Identities=21% Similarity=0.263 Sum_probs=254.8 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCC--------------C----------CHHHHHHH Q ss_conf 45897078787877077888899999997198599980588665623--------------4----------78998998 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPS--------------I----------GISSLLTC 60 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~----------~~~~~~~~ 60 (352) .|.+|.++|-+..+|++.+++.+...+.++||.+.+.++.+++..+- . .+..+..+ T Consensus 2 ~i~mVsdff~P~~ggveshiy~lSq~li~lghkVvvithayg~r~girylt~glkVyylp~~v~~n~tT~ptv~~~~Pll 81 (426) T KOG1111 2 RILMVSDFFYPSTGGVESHIYALSQCLIRLGHKVVVITHAYGNRVGIRYLTNGLKVYYLPAVVGYNQTTFPTVFSDFPLL 81 (426) T ss_pred CCEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCEEEECCCCEEEEEEEEEEECCCCCHHHHCCCHHH T ss_conf 52001642156888711368887511665187699984323671250565477269998537655464120210467012 Q ss_pred HHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHH--------HHHHHHCCEEEECCHHHHHH-- Q ss_conf 87543259708996285468999999997099819997574562011347--------98996266899889899974-- Q gi|255764474|r 61 WKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWT--------RYLISRMDEVITTSQKSARF-- 130 (352) Q Consensus 61 ~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~l~~~~d~ii~~s~~~~~~-- 130 (352) ......+++.|+|.|...+.+..-.+.+..-..++.+|+.|..+.+.... ..-+...|++||+|..+++- T Consensus 82 r~i~lrE~I~ivhghs~fS~lahe~l~hartMGlktVfTdHSlfGfad~~si~~n~ll~~sL~~id~~IcVshtskentv 161 (426) T KOG1111 82 RPILLRERIEIVHGHSPFSYLAHEALMHARTMGLKTVFTDHSLFGFADIGSILTNKLLPLSLANIDRIICVSHTSKENTV 161 (426) T ss_pred HHHHHHHCEEEEECCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHCCEEEEEECCCCCEEEEEECCCCCEE T ss_conf 26765411689965770888999999988745825998524311423312442111213672378807998613788617 Q ss_pred -----CCCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCC-CEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf -----368839974882777823784326878872889887-27999512264451236665456530047740588631 Q gi|255764474|r 131 -----IERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDA-KLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVG 204 (352) Q Consensus 131 -----~~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~-~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G 204 (352) .+.++.+|||.++.+.|.|.+... +.++ ..|+.++|+.++||+|++++.+.++++++|+++++|+| T Consensus 162 lr~~L~p~kvsvIPnAv~~~~f~P~~~~~--------~S~~i~~ivv~sRLvyrKGiDll~~iIp~vc~~~p~vrfii~G 233 (426) T KOG1111 162 LRGALAPAKVSVIPNAVVTHTFTPDAADK--------PSADIITIVVASRLVYRKGIDLLLEIIPSVCDKHPEVRFIIIG 233 (426) T ss_pred EEECCCHHHEEECCCEEECCCCCCCCCCC--------CCCCEEEEEEEEEEEECCCHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 97214776755335235403346584346--------8887069999741111242678999999997359873699956 Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCH Q ss_conf 33321000146677644311233221222222--3310000000000012333222222100000100111027884101 Q gi|255764474|r 205 KTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSE 282 (352) Q Consensus 205 ~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e 282 (352) +||.+. .+++..+++.+.+||.++|.+ +++.+.|...|+|+.||.+|+|+++++|||+||+|||+|++||++| T Consensus 234 DGPk~i-----~lee~lEk~~l~~rV~~lG~v~h~~Vr~vl~~G~IFlntSlTEafc~~ivEAaScGL~VVsTrVGGIpe 308 (426) T KOG1111 234 DGPKRI-----DLEEMLEKLFLQDRVVMLGTVPHDRVRDVLVRGDIFLNTSLTEAFCMVIVEAASCGLPVVSTRVGGIPE 308 (426) T ss_pred CCCCCC-----HHHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCCEEECCHHHHHHHHHHHHHHHCCCEEEEEECCCCCC T ss_conf 886502-----199999985004805886146617888887638579620788888999999870797799751488665 Q ss_pred HCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 002348842499659998999999999986989999999999999998299899999999999998860 Q gi|255764474|r 283 LLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLLRT 351 (352) Q Consensus 283 ~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l~~ 351 (352) ++..+ .- .....+++++++++++.+..-... -+.-.+.+.+.|+|+..|++.+++|.++..+ T Consensus 309 VLP~d---~i-~~~~~~~~dl~~~v~~ai~~~~~~---p~~~h~~v~~~y~w~dVa~rTekvy~r~~~t 370 (426) T KOG1111 309 VLPED---MI-TLGEPGPDDLVGAVEKAITKLRTL---PLEFHDRVKKMYSWKDVAERTEKVYDRAATT 370 (426) T ss_pred CCCCC---CE-ECCCCCHHHHHHHHHHHHHHHCCC---CHHHHHHHHHHCCHHHHHHHHHHHHHHHHHC T ss_conf 48701---02-236898577788999999874158---0457788777613898998889999887631 No 40 >cd03804 GT1_wbaZ_like This family is most closely related to the GT1 family of glycosyltransferases. wbaZ in Salmonella enterica has been shown to possess the mannosyl transferase activity. The members of this family are found in certain bacteria and Archaea. Probab=100.00 E-value=7.2e-36 Score=260.54 Aligned_cols=297 Identities=22% Similarity=0.271 Sum_probs=205.0 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCC-------------CCCC-----CHHHHHH----HHH Q ss_conf 45897078787877077888899999997198599980588665-------------6234-----7899899----887 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKN-------------IPSI-----GISSLLT----CWK 62 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~-----~~~~~~~----~~~ 62 (352) ||.+++|.+. .++|+++.+..+++.+... .+.+..+..... .... .+..++. .+. T Consensus 1 KIAi~Hp~~~-~~GG~ERVv~~la~~~~~~--~i~T~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~l~~~a~~ 77 (351) T cd03804 1 KVAIVHDWLV-NIGGGEKVVEALARLFPDA--DIFTLVDDPDKLPRLLRLKKIRTSFIQKLPFARRRYRKYLPLMPLAIE 77 (351) T ss_pred CEEEECCCCC-CCCCHHHHHHHHHHHCCCC--CEEEEECCCCCCCHHHCCCCCEEECHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 9899948888-9977999999999768999--899995678758565507852464010250345368899989999999 Q ss_pred HHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHH---------HHHH----------------HHHHHHHHHHC Q ss_conf 5432597089962854689999999970998199975745---------6201----------------13479899626 Q gi|255764474|r 63 KPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPS---------QRNH----------------SRWTRYLISRM 117 (352) Q Consensus 63 ~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~----------------~~~~~~l~~~~ 117 (352) ......+|+++.+...... ......+.+.....|.+. +... ..|.++..+++ T Consensus 78 ~~~l~~yDiiI~s~~~~~~---~~~~~~~~~~i~Y~H~P~R~~~d~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~~~~ 154 (351) T cd03804 78 QFDLSGYDLVISSSHAVAK---GVITRPDQLHICYCHTPMRYAWDLYHDYLKESGLGKRLALRLLLHYLRIWDRRSAARV 154 (351) T ss_pred HHCCCCCCEEEECCCHHHH---HHHCCCCCCEEEEEECCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 8254689989987841436---5502789858999605516654226888765335778899999999999999887438 Q ss_pred CEEEECCHHHHHH----CCCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHC Q ss_conf 6899889899974----368839974882777823784326878872889887279995122644512366654565300 Q gi|255764474|r 118 DEVITTSQKSARF----IERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILP 193 (352) Q Consensus 118 d~ii~~s~~~~~~----~~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~ 193 (352) |.++++|+.+++. ...+..||++++|.+.|.+.+. ....++++||+.++||++++|+|+.++ T Consensus 155 d~iianS~~t~~~i~~~~~~~~~Viyppvd~~~~~~~~~------------~~~~~l~vgRl~~~K~~~~~i~a~~~~-- 220 (351) T cd03804 155 DYFIANSRFVARRIKKYYGRDATVIYPPVDTDRFTPAEE------------KEDYYLSVGRLVPYKRIDLAIEAFNKL-- 220 (351) T ss_pred CEEEECCHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCC------------CCCCEEEEECCCHHCCCHHHHHHHHHC-- T ss_conf 899987989999999985899735582983543575766------------677248855150421808999999847-- Q ss_pred CCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCC--CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 47740588631333210001466776443112332212222223--3100000000000123332222221000001001 Q gi|255764474|r 194 HHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQS--SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIP 271 (352) Q Consensus 194 ~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~--~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~P 271 (352) +.+|+|+|+|+.. ++|++++ .++|.|+|+++ ++..+|+.|+++++||. |+||++++||||||+| T Consensus 221 ---~~~L~i~G~g~~~-----~~l~~~~-----~~~V~f~g~~~~~~~~~~~~~a~~~v~ps~-E~FGi~~vEAma~G~P 286 (351) T cd03804 221 ---GKRLVVIGDGPEL-----DRLRAKA-----GPNVTFLGRVSDEELRDLYARARAFLFPAE-EDFGIVPVEAMASGTP 286 (351) T ss_pred ---CCCEEEEECCHHH-----HHHHHHC-----CCCEEEEECCCHHHHHHHHHHCCEEEECCH-HCCCCHHHHHHHCCCC T ss_conf ---9978999847379-----9999667-----898799803898999999983886995164-2079759999976998 Q ss_pred EEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 1102788410100234884249965999899999999998698999999999999999829989999999 Q gi|255764474|r 272 VVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDI 341 (352) Q Consensus 272 vI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~ 341 (352) ||+++.||.+|+|. +|+||++++++|+++++++|.+++.+++.+.+ .+|+.+ ++|||+...+++ T Consensus 287 vIa~~~gG~~e~v~--~g~tG~l~~~~~~~~la~ai~~~~~~~~~~~~---~~r~~a-~~Fs~~~F~~~~ 350 (351) T cd03804 287 VIAYGKGGALETVI--DGVTGILFEEQTVESLAAAVERFEKNEDFDPQ---AIRAHA-ERFSESRFREKI 350 (351) T ss_pred EEEECCCCCCCEEC--CCCCEEEECCCCHHHHHHHHHHHHHCCCCCHH---HHHHHH-HHCCHHHHHHHH T ss_conf 89828999755015--89978995989999999999999859501599---999999-967999999860 No 41 >TIGR03087 stp1 sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate. Probab=100.00 E-value=8.1e-34 Score=246.73 Aligned_cols=320 Identities=15% Similarity=0.159 Sum_probs=225.9 Q ss_pred EEEECCCCC-CCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCC--------------CCCCCC---H------------- Q ss_conf 589707878-787707788889999999719859998058866--------------562347---8------------- Q gi|255764474|r 6 IDVIAPNMK-FRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPK--------------NIPSIG---I------------- 54 (352) Q Consensus 6 i~~i~~~~~-~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~---~------------- 54 (352) |.+|.+.+. +..+|....++++++.+.+. |++++++..... .+..+. . T Consensus 1 IL~lt~~~PyP~~~G~~ir~~~llk~L~~~-h~V~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~l~~~ 79 (397) T TIGR03087 1 ILYLVHRIPYPPNKGDKIRSFHLLRHLAAR-HRVHLGTFVDDPEDWQYAAALRPLCEEVCVVPLDPRVARLRSLLGLLTG 79 (397) T ss_pred CEEECCCCCCCCCCCHHHHHHHHHHHHHHC-CCEEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHCCC T ss_conf 979788899798742889999999999828-9389998269866767889998644437999678417777999875248 Q ss_pred ----------HHH-HHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCH-H--HH---------------- Q ss_conf ----------998-99887543259708996285468999999997099819997574-5--62---------------- Q gi|255764474|r 55 ----------SSL-LTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSP-S--QR---------------- 104 (352) Q Consensus 55 ----------~~~-~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~---------------- 104 (352) ..+ ..+........+|+++++... +...+....-+.|..+.++.. + +. T Consensus 80 ~p~~~~~~~s~~~~~~i~~~~~~~~~D~i~~~~~~--~a~yl~~~~~~~p~ild~hdv~s~~~~~~a~~~~~~~~~~~~~ 157 (397) T TIGR03087 80 EPLSLPYYRSRRLARWVNALLAAEPVDAIVVFSSA--MAQYVTPHVRGVPRIVDFVDVDSDKWLQYARTKRWPLRWIYRR 157 (397) T ss_pred CCCCCHHHCCHHHHHHHHHHHHCCCCCEEEEECHH--HHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 98721553499999999999606998689991657--6775344523799899985343489999987345467789999 Q ss_pred ---HHHHHHHHHHHHCCEEEECCHHHHHHCC-------CCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEE Q ss_conf ---0113479899626689988989997436-------883997488277782378432687887288988727999512 Q gi|255764474|r 105 ---NHSRWTRYLISRMDEVITTSQKSARFIE-------RPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGR 174 (352) Q Consensus 105 ---~~~~~~~~l~~~~d~ii~~s~~~~~~~~-------~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~ 174 (352) ....+.+.+++++|.++++|+..++++. .++.++|||+|++.|.|...... ..+.+.+.++|+|+ T Consensus 158 e~~~l~~~E~~~~~~~d~~~~vS~~d~~~~~~~~~~~~~~i~vipnGvd~~~f~p~~~~~~-----~~~~~~~~i~f~G~ 232 (397) T TIGR03087 158 EGRLLLAYERAIAARFDAATFVSRAEAELFRRLAPEAAGRITAFPNGVDADFFSPDRDYPN-----PYPPGKRVLVFTGA 232 (397) T ss_pred HHHHHHHHHHHHHHHCCEEEEECHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCCCC-----CCCCCCCEEEEEEE T ss_conf 9999999999999966999997799999998747776772776578755123687644457-----66788987999971 Q ss_pred ECHHHHHHHHH----HHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 26445123666----54565300477405886313332100014667764431123322122222233100000000000 Q gi|255764474|r 175 IRKLKGTDLFV----DCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFV 250 (352) Q Consensus 175 ~~~~Kg~~~li----~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i 250 (352) ++...|++-++ +++..+.++.|++++.|+|.++. ++++.+.+ ...|.++|+++|+..+|+.+++++ T Consensus 233 ~~~~pN~da~~~f~~~v~p~l~~~~p~~~~~ivG~~p~------~~~~~l~~----~~~V~~~G~V~d~~~~~~~a~v~v 302 (397) T TIGR03087 233 MDYWPNIDAVVWFAERVFPAVRARRPAAEFYIVGAKPS------PAVRALAA----LPGVTVTGSVADVRPYLAHAAVAV 302 (397) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC------HHHHHHCC----CCCEEEEEECCCHHHHHHCCCEEE T ss_conf 78723099999999999999998789987999908962------99998517----999799765498699996198999 Q ss_pred CCCCC-CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 12333-22222210000010011102788410100234884249965999899999999998698999999999999999 Q gi|255764474|r 251 APPLY-EGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAV 329 (352) Q Consensus 251 ~pS~~-Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~ 329 (352) .|.+. +|+.++++||||||+|||+|..|. |-+...+|+..+ .++|++++|++|.+|++|++++++|+++||+++. T Consensus 303 ~Pl~~g~G~~~KilEama~g~PvVst~~g~--egl~~~~g~~~l--ia~~~~~fa~~i~~Ll~d~~~~~~l~~~~r~~v~ 378 (397) T TIGR03087 303 APLRIARGIQNKVLEAMAMAKPVVASPEAA--EGIDALPGAELL--VAADPADFAAAILALLANPAEREELGQAARRRVL 378 (397) T ss_pred EECCCCCCCCHHHHHHHHCCCCEEECCCCC--CCCCCCCCCEEE--ECCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 946544575357999997699899777430--244367897059--5799999999999998199999999999999999 Q ss_pred HHCCHHHHHHHHHHHHHH Q ss_conf 829989999999999999 Q gi|255764474|r 330 KHFSIVKEASDIGKVYDR 347 (352) Q Consensus 330 ~~fs~~~~a~~~~~iY~~ 347 (352) ++|||++.+++++++|++ T Consensus 379 ~~ysW~~~~~~le~~~~~ 396 (397) T TIGR03087 379 QHYHWPRNLARLDALLEQ 396 (397) T ss_pred HHCCHHHHHHHHHHHHCC T ss_conf 829999999999998649 No 42 >TIGR02472 sucr_P_syn_N sucrose-phosphate synthase, putative, glycosyltransferase domain; InterPro: IPR012822 This family consists of the N-terminal regions, or in some cases the entirety, of bacterial proteins closely related to plant sucrose-phosphate synthases (SPS). The C-terminal domain (IPR012821 from INTERPRO), found with most members of this family, resembles both bona fide plant sucrose-phosphate phosphatases (SPP) and the SPP-like domain of plant sucrose phosphate synthase (SPS). At least two members of this family lack the SPP-like domain, which may have binding or regulatory rather than enzymatic activity by analogy to plant SPS. This enzyme produces sucrose 6-phosphate and UDP from UDP-glucose and D-fructose 6-phosphate, and may be encoded near the gene for fructokinase.. Probab=100.00 E-value=3.2e-35 Score=256.21 Aligned_cols=325 Identities=20% Similarity=0.284 Sum_probs=238.0 Q ss_pred CCHHHHHHHHHHHHHHHCC--EEEEEECCCCC-----------------------------------CCCCCCH--HHHH Q ss_conf 7077888899999997198--59998058866-----------------------------------5623478--9989 Q gi|255764474|r 18 TGVTSTVFGLCPIQRKLGQ--RLVVFGYCLPK-----------------------------------NIPSIGI--SSLL 58 (352) Q Consensus 18 ~Gv~~~v~~~~~~~~~~~~--~~~~~~~~~~~-----------------------------------~~~~~~~--~~~~ 58 (352) +|-+.||+++++.+.+.-. ++.++|--+.+ .||..+- -..+ T Consensus 26 GGQ~~YVlELar~La~~~ev~qVdlvTR~I~D~~vspDYA~~~E~i~p~ArIvRl~fGP~rYLRKE~LWPyLD~~~D~~l 105 (445) T TIGR02472 26 GGQTKYVLELARALARRSEVEQVDLVTRLIKDAKVSPDYAQPIERIAPKARIVRLPFGPRRYLRKELLWPYLDELADQLL 105 (445) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 43377888889986158975147677503431776802334776408995599745788855740156600789999999 Q ss_pred HHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHH----------------HH------HH------H Q ss_conf 98875432597089962854689999999970998199975745620----------------11------34------7 Q gi|255764474|r 59 TCWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRN----------------HS------RW------T 110 (352) Q Consensus 59 ~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~------~~------~ 110 (352) . .-+-..+.||+||+|-..+.-.|.-+...+++|++++.||-+-.. .. .+ . T Consensus 106 ~-ylr~~g~lPdlIH~HYADAGYVG~~ls~~L~vPlvfTGHSLGR~Kr~RLLa~G~~skaPkP~~~IE~~f~is~RI~AE 184 (445) T TIGR02472 106 S-YLRQQGRLPDLIHAHYADAGYVGARLSRLLGVPLVFTGHSLGREKRRRLLAAGLKSKAPKPPQEIEKQFNISRRIEAE 184 (445) T ss_pred H-HHHHCCCCCCCCHHHHCCHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCHHHH T ss_conf 9-997358888731010101558999998625898488375357789999984340026788778999861226414788 Q ss_pred HHHHHHCCEEEECCHHH--------HHHCCCCEEEECCCCCHHHHCCCCCHH------HHHHHCCCCCCCCEEEEEEEEC Q ss_conf 98996266899889899--------974368839974882777823784326------8788728898872799951226 Q gi|255764474|r 111 RYLISRMDEVITTSQKS--------ARFIERPSTVIMHGVDTERFRPTSNKQ------EARRHLKISEDAKLIGCFGRIR 176 (352) Q Consensus 111 ~~l~~~~d~ii~~s~~~--------~~~~~~~~~vi~~gid~~~~~~~~~~~------~~~~~~~~~~~~~~i~~~G~~~ 176 (352) +..+..|+-||+.+... ..|-+.++.|||.|+|+++|.|....+ +....|=-.+++++|+.++|-. T Consensus 185 E~tL~~AslvitST~QEi~~QY~~Y~~y~P~r~~VIPPGvD~~rFyp~~~~~~~~~i~~~L~rFL~~p~KP~ilaisRpd 264 (445) T TIGR02472 185 EETLAHASLVITSTHQEIEEQYALYDSYDPERMEVIPPGVDLSRFYPPQSSEEDSEIDKLLARFLKDPEKPPILAISRPD 264 (445) T ss_pred HHHHHHCCEEEEECCCEEHHHHCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCEEEEECCCC T ss_conf 99985147458614510321210147867021135178888754347888888758887522311478878388722788 Q ss_pred HHHHHHHHHHHHHHH--HCCCCCEEEEEECCCCCCC-----CCHHHHHHHHHHHCCCCCCCCCCCC--CCCCCCCCHHH- Q ss_conf 445123666545653--0047740588631333210-----0014667764431123322122222--23310000000- Q gi|255764474|r 177 KLKGTDLFVDCMINI--LPHHPGWTAVVVGKTTLKH-----YLFKKNLQRRIYANGLKKRILFIDE--QSSIEDWYRAL- 246 (352) Q Consensus 177 ~~Kg~~~li~a~~~l--~~~~~~~~l~i~G~g~~~~-----~~~~~~l~~~i~~~~l~~~V~~~g~--~~~~~~~~~~a- 246 (352) ++||+..|++|+.+= +++.-|..+++...-+.+. ..-+.++-..+..+.|-++|=++-. .+|++++|+-| T Consensus 265 ~RKNi~~Lv~aYG~~p~L~~~aNLVlvlG~RdD~r~me~~qR~Vl~~vl~~iD~YDLYGkvAyPK~H~~~dvP~lYRLAA 344 (445) T TIGR02472 265 RRKNIPALVEAYGRSPKLQEMANLVLVLGSRDDIRKMESSQREVLTEVLLLIDRYDLYGKVAYPKHHRPDDVPELYRLAA 344 (445) T ss_pred CCCCHHHHHHHHCCCHHHHHHCCEEEEECCCCCHHHCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 76674555620078866765208088752778853121578999999987630002456402688888112326789998 Q ss_pred ---HHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf ---00001233322222210000010011102788410100234884249965999899999999998698999999999 Q gi|255764474|r 247 ---NIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNR 323 (352) Q Consensus 247 ---di~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~ 323 (352) .|||-|..+|+||||++||.|||+|+|+|+-||-.||+. +.+||++|++-|.++|++||.+.++|+++++.+++| T Consensus 345 ~~rGiFVNPALTEPFGLTLlEAAAcGLPivAT~DGGP~dI~~--~C~NGLLvd~ld~e~i~~AL~~alsd~~QW~~Ws~N 422 (445) T TIGR02472 345 RRRGIFVNPALTEPFGLTLLEAAACGLPIVATEDGGPRDIIA--NCRNGLLVDVLDLEAIASALEQALSDSSQWQTWSDN 422 (445) T ss_pred HCCCEEECCCCCCCCHHHHHHHHHCCCCEEECCCCCCHHHHH--HCCCCCEECCCCHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 659867627212530168999997699721078648668884--288875005789899999999733890667899985 Q ss_pred HHHHHHHHCCHHHHHHHHHHHH Q ss_conf 9999998299899999999999 Q gi|255764474|r 324 GRERAVKHFSIVKEASDIGKVY 345 (352) Q Consensus 324 a~~~~~~~fs~~~~a~~~~~iY 345 (352) |.+-+..||||++-+++|++.. T Consensus 423 Gi~gV~~HYSW~aHV~~YL~~~ 444 (445) T TIGR02472 423 GIEGVRRHYSWDAHVEKYLRLL 444 (445) T ss_pred HHHHHCCCCCHHHHHHHHHHHC T ss_conf 5776401555467999999732 No 43 >TIGR02095 glgA glycogen/starch synthases, ADP-glucose type; InterPro: IPR011835 This family consists of glycogen (or starch) synthases that use ADP-glucose (2.4.1.21 from EC), rather than UDP-glucose (2.4.1.11 from EC) as in animals, as the glucose donor. This enzyme is found in bacteria and plants. Whether the name given is glycogen synthase or starch synthase depends on context, and therefore on substrate.; GO: 0009011 starch synthase activity, 0009250 glucan biosynthetic process. Probab=100.00 E-value=3e-33 Score=242.89 Aligned_cols=278 Identities=23% Similarity=0.328 Sum_probs=214.7 Q ss_pred HHHHCCCE-EEEECCCHHHHHHHHHHHHHC----CCEEEEECCH--HHHH------------------------------ Q ss_conf 54325970-899628546899999999709----9819997574--5620------------------------------ Q gi|255764474|r 63 KPIGQNSR-IWHARRNNEMLLGVMMRDVLR----MPLKLVFTSP--SQRN------------------------------ 105 (352) Q Consensus 63 ~~~~~~~~-ivh~~~~~~~~~~~~~~~~~~----~~~~~~~~~~--~~~~------------------------------ 105 (352) ......+| |||+|.|-..+....++...+ .+++.+||-| .+.. T Consensus 140 ~~~~~~PDN~vH~HDWhtaL~P~llk~~~~~~~f~~~~~v~TIHNl~yQG~fp~~~~~~~~glp~~~~~~~~~e~~D~p~ 219 (517) T TIGR02095 140 LPLGWQPDNVVHAHDWHTALVPALLKEVYRNNGFKRIKTVFTIHNLAYQGVFPAGDLFSLLGLPPELFHMEGLEFRDNPV 219 (517) T ss_pred CCCCCCCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCCEEEEEECHHHCCCCCHHHHHHHHCCCHHHCCCCCHHCCCCCC T ss_conf 33446888079965768999999999740568883454678740321266788789987627886681741011056885 Q ss_pred ---HHHHHHHHHHHCCEEEECCHHHHHHCCC-------------------CEEEECCCCCHHHHCCC------------- Q ss_conf ---1134798996266899889899974368-------------------83997488277782378------------- Q gi|255764474|r 106 ---HSRWTRYLISRMDEVITTSQKSARFIER-------------------PSTVIMHGVDTERFRPT------------- 150 (352) Q Consensus 106 ---~~~~~~~l~~~~d~ii~~s~~~~~~~~~-------------------~~~vi~~gid~~~~~~~------------- 150 (352) .-.|.+--+..+|.|.++|+.-|+.+.. +..=|.||||.+.++|. T Consensus 220 ~~~~~nflKgGi~~ad~vtTVSPtYA~EI~t~pe~G~gL~g~l~~~~~~~~l~GIlNGID~~~WNP~tD~~L~~~Ys~~~ 299 (517) T TIGR02095 220 NYGRLNFLKGGIVYADRVTTVSPTYAREILTPPEFGCGLDGVLKARVRSGKLRGILNGIDTEVWNPATDPYLKANYSADD 299 (517) T ss_pred CCCHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCEEECCCCCCCCCCCCCCCCCCCCCHHH T ss_conf 78617777678734881037860689972688444524699984032057731133233434468532543343578432 Q ss_pred -------CC-HHHHHHHCCCC---CCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHH Q ss_conf -------43-26878872889---88727999512264451236665456530047740588631333210001466776 Q gi|255764474|r 151 -------SN-KQEARRHLKIS---EDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQR 219 (352) Q Consensus 151 -------~~-~~~~~~~~~~~---~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~ 219 (352) .+ +..+.+++|++ ++.++++++|||+.+||+|++++|+.++..+..-.+|+++|.|+++- .++++. T Consensus 300 ~D~~~K~~ncK~aLq~~lGL~~~Y~~~Pl~~~isRL~~QKG~Dl~~~a~~~ll~~~~~~Qlv~lG~Gdp~l---e~~l~~ 376 (517) T TIGR02095 300 LDLSGKAKNCKEALQEELGLPVDYDDVPLFGVISRLVEQKGVDLLLAALPELLELGDFGQLVVLGTGDPEL---EEALRE 376 (517) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHCCCCCEEEEEEECCCHHH---HHHHHH T ss_conf 15666788758999998198878888537999822562442789999999997117966899970488799---999999 Q ss_pred HH--HHCCCCCCCCC-CCCCCCC-CCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCC-------- Q ss_conf 44--31123322122-2222331-000000000001233322222210000010011102788410100234-------- Q gi|255764474|r 220 RI--YANGLKKRILF-IDEQSSI-EDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPE-------- 287 (352) Q Consensus 220 ~i--~~~~l~~~V~~-~g~~~~~-~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~-------- 287 (352) .. +++ .++|.+ .++-+.+ +.+|+.||+||+||+.|++||+-+.||.+|++.|++.+||+.|+|... T Consensus 377 la~~~~~--p~~~~~~~~yde~LAh~iyAgaD~~lmPSrFEPCGL~Ql~amRYGt~PiVr~tGGL~DTV~d~~~~~~~aP 454 (517) T TIGR02095 377 LADHERY--PGKVRVIIGYDEALAHRIYAGADFFLMPSRFEPCGLTQLYAMRYGTVPIVRRTGGLADTVVDADPENLAAP 454 (517) T ss_pred HHHHHHC--CCCEEEEEECCHHHHHHHHHCCCEEECCCCCCCCHHHHHHHHHCCCCEEEECCCCCCEEEECCCCCCCCCC T ss_conf 9999637--89489996258799998972377688078557312579989734995387158895201003877644477 Q ss_pred -CCCEEEEECCCCHHHHHHHHHHHH---C-CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf -884249965999899999999998---6-989999999999999998299899999999999998 Q gi|255764474|r 288 -NAKAGVIVPPRNLHALEKAVLYFM---N-SKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRL 348 (352) Q Consensus 288 -~g~~G~~~~~~d~~~la~~i~~l~---~-~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~ 348 (352) ...|||.++..|+++|..+|.+.+ . +++.+++|.++| +.++|||++.|++|+++|+++ T Consensus 455 ~~~~tGF~F~~~~~~~L~~a~~rAl~lY~~~~~~w~~l~~~a---M~~DfSW~~sA~~Y~~lY~~~ 517 (517) T TIGR02095 455 AGSGTGFLFEEYDPEALLAALSRALRLYRQDPELWKALQKNA---MSQDFSWDKSAKQYVELYRSL 517 (517) T ss_pred CCCCCCEEECCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHH---HCCCCCHHHHHHHHHHHHHCC T ss_conf 877654172368889999999999998723978999999985---136876105799999999609 No 44 >PRK10125 predicted glycosyl transferase; Provisional Probab=99.97 E-value=6.2e-29 Score=213.87 Aligned_cols=314 Identities=14% Similarity=0.117 Sum_probs=200.6 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEE-CC------C-----CCCCCCCCH----HH--------------H Q ss_conf 97078787877077888899999997198599980-58------8-----665623478----99--------------8 Q gi|255764474|r 8 VIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFG-YC------L-----PKNIPSIGI----SS--------------L 57 (352) Q Consensus 8 ~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~-~~------~-----~~~~~~~~~----~~--------------~ 57 (352) |+.-|-.-..+|....+.++...+.+.|++..... .. . +..+..... .. . T Consensus 3 VL~INts~~~GGAaraA~rLh~~L~~~G~~s~mlv~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~ 82 (405) T PRK10125 3 ILQFNVRLAEGGAAGVALDLHQRALQQGLASHFVYGYGKGGKESVSHQNYPQVIKHTPRMTAMANIALFRLFNRDLFGNF 82 (405) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHCCCCCCCCCC T ss_conf 99997366988299999999999985799669999735789851135776620212404677765778761363224562 Q ss_pred HHHH-HHHHHCCCEEEEECCC-H-----HHHHHHHH-HH--HHCCCEEEEECC--------HHHHHHH------------ Q ss_conf 9988-7543259708996285-4-----68999999-99--709981999757--------4562011------------ Q gi|255764474|r 58 LTCW-KKPIGQNSRIWHARRN-N-----EMLLGVMM-RD--VLRMPLKLVFTS--------PSQRNHS------------ 107 (352) Q Consensus 58 ~~~~-~~~~~~~~~ivh~~~~-~-----~~~~~~~~-~~--~~~~~~~~~~~~--------~~~~~~~------------ 107 (352) ..+. .......+||+|.|.. . ..+..+.. .. ..+.|++.+.|. |...... T Consensus 83 ~~~~~~i~~~~~pDIihLH~ih~~~ln~~~l~~~~~~~ki~k~~kPvVWTLHDmW~fTG~Chy~~~C~~wk~gC~~CP~L 162 (405) T PRK10125 83 NELYRTITRTPGPVVLHFHVLHSYWLNLKSVVRFCEKVKNHKPDVTLVWTLHDHWSVTGRCAFTDGCEGWKTGCQKCPTL 162 (405) T ss_pred HHHHHHHHCCCCCCEEEEEECCCCEECHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHCCCCCCCC T ss_conf 67888762157898899740254543099998786554430479978999436765567778878437676255656445 Q ss_pred -------------HH--HHHHHHH----CCEEEECCHHHH-----HHCCCCEEEECCCCCHHHHCCCCCHHHHHHHCCCC Q ss_conf -------------34--7989962----668998898999-----74368839974882777823784326878872889 Q gi|255764474|r 108 -------------RW--TRYLISR----MDEVITTSQKSA-----RFIERPSTVIMHGVDTERFRPTSNKQEARRHLKIS 163 (352) Q Consensus 108 -------------~~--~~~l~~~----~d~ii~~s~~~~-----~~~~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~ 163 (352) .| .+.+++. ...+++.|...+ .+...++.+||||||+..+.+..+....+ .. T Consensus 163 ~~yp~~~~D~s~~~~~~K~~~~~~~~~~~~~~V~ps~~~~~~~~sl~~~~~v~vIpNgID~~~~~~~~~~~~~~----~~ 238 (405) T PRK10125 163 NNYPPVRVDRAHQLVAGKRQLFREMLRLGCQFISPSQHVADAFNSLYGPGRCRIINNGIDMATEAILAELPPVR----ET 238 (405) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCHHHHHHC----CC T ss_conf 67885335748999999999987535217719978899999987561799867838997854344430455404----37 Q ss_pred CCCCEEEEEEE--ECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC---CC Q ss_conf 88727999512--26445123666545653004774058863133321000146677644311233221222222---33 Q gi|255764474|r 164 EDAKLIGCFGR--IRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ---SS 238 (352) Q Consensus 164 ~~~~~i~~~G~--~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~---~~ 238 (352) .+.+.++++.. -.+.|+...+++++..+. .+..++++|...... ...+..+|+. .+ T Consensus 239 ~~~~~i~~~a~~~~~~~k~~~~ll~~l~~l~---~~~~l~~~G~~~~~~----------------~~~v~~lg~~~d~~~ 299 (405) T PRK10125 239 QGKPRIAVVAHDLRYDGKTDQQLVREMMALG---DKIELHTFGKFSPFT----------------AGNVVNHGFETDKRK 299 (405) T ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHHCC---CCEEEEEEECCCCCC----------------CCCEEECCCCCCHHH T ss_conf 8997699995454456333899999998448---970899972576446----------------887464687589999 Q ss_pred CCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHH Q ss_conf 10000000000012333222222100000100111027884101002348842499659998999999999986989999 Q gi|255764474|r 239 IEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMS 318 (352) Q Consensus 239 ~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~ 318 (352) +..+|++||+|++||+.|+||++++||||||+|||++++||++|+++. .+|+++++.|.+++|+.+...+.+..... T Consensus 300 La~~YsaAd~~v~ps~~e~~~~~~~Ea~acg~pvv~~~~~g~~~~~~~---~~g~~~~~~d~~~la~~i~~~~~~~~~~~ 376 (405) T PRK10125 300 LMSALNQMDALVFSSRVDNYPLILCEALSIGVPVIATHSDAAREVLQK---SGGKTVSEEEVLQLAQLSKPEIAQAVFGT 376 (405) T ss_pred HHHHHHHCCEEEECCHHHCCCHHHHHHHHCCCCEEEECCCCCHHHEEC---CCCEEECCCCHHHHHHHHHHHHHCCHHHH T ss_conf 999996378897274675465089999974998898359997365115---87459668899999998999984604677 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 999999999998299899999999999998 Q gi|255764474|r 319 DTGNRGRERAVKHFSIVKEASDIGKVYDRL 348 (352) Q Consensus 319 ~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~ 348 (352) ..+++++.+.++||++.++++|.++|+++ T Consensus 377 -~~~~~~~~~~~~fs~~~~a~~Y~~lY~~l 405 (405) T PRK10125 377 -TLAEFSQRSRAAYSGQQMLEEYVNFYQNL 405 (405) T ss_pred -HHHHHHHHHHHHCCHHHHHHHHHHHHHHC T ss_conf -89999999998679999999999998619 No 45 >cd03806 GT1_ALG11_like This family is most closely related to the GT1 family of glycosyltransferases. ALG11 in yeast is involved in adding the final 1,2-linked Man to the Man5GlcNAc2-PP-Dol synthesized on the cytosolic face of the ER. The deletion analysis of ALG11 was shown to block the early steps of core biosynthesis that takes place on the cytoplasmic face of the ER and lead to a defect in the assembly of lipid-linked oligosaccharides. Probab=99.97 E-value=1.7e-27 Score=204.14 Aligned_cols=322 Identities=15% Similarity=0.139 Sum_probs=222.7 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHC--CEEEEEECCC-CC------------------CCC---------CCC- Q ss_conf 4589707878787707788889999999719--8599980588-66------------------562---------347- Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLG--QRLVVFGYCL-PK------------------NIP---------SIG- 53 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~--~~~~~~~~~~-~~------------------~~~---------~~~- 53 (352) -|..++|...-. +|.++.++..+..+.+.+ +.+++++... +. ... .+. T Consensus 2 ~VgFfHPycn~G-GGgERVLw~Av~alq~~~~~~~vvIYT~d~d~~~~~il~~v~~~F~i~l~~~~~~~~~~~~~~~i~~ 80 (419) T cd03806 2 TVGFFHPYCNAG-GGGERVLWCAVRALQKRYPNNIVVIYTGDLDATPEEILEKVESRFNIELDRPRIVFFLLKYRKLVEA 80 (419) T ss_pred CEEEECCCCCCC-CCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHCCEEECCCCEEEEEEEECCEECH T ss_conf 389978998998-4148999999999985599965999899989977898998885077065698349999301235212 Q ss_pred ------------HHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHH----------H-------- Q ss_conf ------------8998998875432597089962854689999999970998199975745----------6-------- Q gi|255764474|r 54 ------------ISSLLTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPS----------Q-------- 103 (352) Q Consensus 54 ------------~~~~~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~-------- 103 (352) +.++...+.......+||+--....+.. --+++.+.+.|+....|.+. . T Consensus 81 ~~y~~ftll~q~lgs~~l~~eal~~~~pdvfiDt~g~af~-~pl~k~~~~~~V~~Y~HyP~is~Dml~~v~~~~~~~nn~ 159 (419) T cd03806 81 STYPRFTLLGQALGSMILGLEALLKLVPDIFIDTMGYPFT-YPLVRLLGGCPVGAYVHYPTISTDMLQKVRSREASYNNS 159 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCHHH-HHHHHHCCCCCEEEEECCCCCCHHHHHHHHHCCCCCCCH T ss_conf 1306889999999999999999964699889967863216-789997379846999728876277999886123223420 Q ss_pred -----------------HHHHHHHHHHHHHCCEEEECCHHHHHHCC------CCEEEECCCCCHHHHCCCCCHHHHHHHC Q ss_conf -----------------20113479899626689988989997436------8839974882777823784326878872 Q gi|255764474|r 104 -----------------RNHSRWTRYLISRMDEVITTSQKSARFIE------RPSTVIMHGVDTERFRPTSNKQEARRHL 160 (352) Q Consensus 104 -----------------~~~~~~~~~l~~~~d~ii~~s~~~~~~~~------~~~~vi~~gid~~~~~~~~~~~~~~~~~ 160 (352) +-+..+.+|..+.+|.|++.|.+++.... ....|++..+|++.|...+... T Consensus 160 ~~ia~~~~~s~~K~lY~~~f~~ly~~~~~~ad~v~vNS~~T~~~i~~~w~~~~~~~VvYPP~d~~~~~~~~~~~------ 233 (419) T cd03806 160 ATIARSPVLSKAKLLYYRLFAFLYGLAGSFADVVMVNSTWTRNHIRSLWKRNTKPSIVYPPCDVEELLKLPLDE------ 233 (419) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHCCCCCCC------ T ss_conf 24322327789999999999999999725774999857869999999838888864147997767745366542------ Q ss_pred CCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCC-----CEEEEEECCCC-CCCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 889887279995122644512366654565300477-----40588631333-210001466776443112332212222 Q gi|255764474|r 161 KISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHP-----GWTAVVVGKTT-LKHYLFKKNLQRRIYANGLKKRILFID 234 (352) Q Consensus 161 ~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~-----~~~l~i~G~g~-~~~~~~~~~l~~~i~~~~l~~~V~~~g 234 (352) ......+++++|+.++|++++.|+|++++.++.+ +++|+++|+.. .++..+.++|+.++++++++++|.|.+ T Consensus 234 --~~~~~~ilSi~rfrpeKn~~L~i~af~~l~~~~~~~~~~~~~Lvi~Gg~R~~ed~~~~~~L~~la~~l~l~~~V~f~~ 311 (419) T cd03806 234 --KTRENQILSIAQFRPEKNHPLQLRAFAKLLKRLPEEIKEKIKLVLIGSCRNEDDEKRVEDLKLLAKELGLEDKVEFVV 311 (419) T ss_pred --CCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCHHCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCEEEEE T ss_conf --357667999864554568699999999998746301056855999948876556999999999999729988769981 Q ss_pred CC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCC-HHCC-CCCCCEEEEECCCCHHHHHHHHHHH Q ss_conf 22--331000000000001233322222210000010011102788410-1002-3488424996599989999999999 Q gi|255764474|r 235 EQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFS-ELLD-PENAKAGVIVPPRNLHALEKAVLYF 310 (352) Q Consensus 235 ~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~-e~i~-~~~g~~G~~~~~~d~~~la~~i~~l 310 (352) .. +++..+++.|.+++++-..|.||++++||||+|+|+|+-+.||-. ++|. ..++.+||+++ +++++++++.++ T Consensus 312 ~~s~~e~~~lL~~a~~~l~T~~nEHFGI~pVEaMaaG~pvvA~nSGGP~edIV~~~~~~~tGfL~~--~~~e~a~a~~~~ 389 (419) T cd03806 312 NAPFEELLEELSTASIGLHTMWNEHFGIGVVEYMAAGLIPLAHASGGPLLDIVVPWDGGPTGFLAS--TAEEYAEAIEKI 389 (419) T ss_pred CCCHHHHHHHHHHCEEEEECCCCCCCCCHHHHHHHCCCCEEEECCCCCCCCEEEECCCCCCCCCCC--CHHHHHHHHHHH T ss_conf 599899999997397988557325668589999866995799788897530776058998511279--879999999999 Q ss_pred HC-CHHHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 86-989999999999999998299899999 Q gi|255764474|r 311 MN-SKKIMSDTGNRGRERAVKHFSIVKEAS 339 (352) Q Consensus 311 ~~-~~~~~~~~~~~a~~~~~~~fs~~~~a~ 339 (352) ++ .++.+.+|.+++|..+ ++||-+...+ T Consensus 390 l~~~~~~~~~~~~~ar~~~-~rFS~~~F~~ 418 (419) T cd03806 390 LSLSEEERLRIRRAARSSV-KRFSDEEFER 418 (419) T ss_pred HCCCHHHHHHHHHHHHHHH-HHCCHHHHCC T ss_conf 8199878999999999999-8468987565 No 46 >pfam00534 Glycos_transf_1 Glycosyl transferases group 1. Mutations in this domain of subunit A of phosphatidylinositol N-acetylglucosaminyltransferase lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharides. Members of this family transfer UDP, ADP, GDP or CMP linked sugars. The eukaryotic glycogen synthases may be distant members of this family. Probab=99.96 E-value=5.4e-29 Score=214.30 Aligned_cols=167 Identities=32% Similarity=0.602 Sum_probs=153.9 Q ss_pred HHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCC-CCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 268788728898872799951226445123666545653004-7740588631333210001466776443112332212 Q gi|255764474|r 153 KQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPH-HPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRIL 231 (352) Q Consensus 153 ~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~-~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~ 231 (352) +...|..++++++.++++|+|++.++||++.+++|+.++.++ .+++.++++|.++.. ..++..+...++.+++. T Consensus 2 ~~~~r~~~~i~~~~~vi~~~G~~~~~Kg~~~~i~a~~~l~~~~~~~~~l~i~G~~~~~-----~~~~~~~~~~~~~~~i~ 76 (172) T pfam00534 2 REEERKKLGIPEDKPIILFVGRLVPEKGLDLLLEAFALLKEQLHPNLKLVIVGDGEEE-----KKLKKLALKLGLEDNVI 76 (172) T ss_pred HHHHHHHCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCH-----HHHHHHHHHCCCCCCEE T ss_conf 7899998799999959999654855449899999899888740898599998378326-----78999999839998689 Q ss_pred CCCCCC--CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHH Q ss_conf 222223--310000000000012333222222100000100111027884101002348842499659998999999999 Q gi|255764474|r 232 FIDEQS--SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLY 309 (352) Q Consensus 232 ~~g~~~--~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~ 309 (352) |.|+.+ ++..+|..+|++++||..|+||++++|||++|+|+|+|+.|+.+|++. ++.+|++++++|+++++++|.+ T Consensus 77 ~~~~~~~~~~~~~l~~sdi~i~ps~~E~~~~~~~Eam~~G~pvI~s~~~~~~eii~--~~~~G~~~~~~~~~~l~~~i~~ 154 (172) T pfam00534 77 FVGFVPREDLIELYRIADLFVLPSRYEGFGLVLLEAMAAGVPVIASDVGGPAEIVK--DGETGLLVDPGDAEALAEAIEK 154 (172) T ss_pred EECCCCHHHHHHHHHHHHHCCCCCHHHCCCCHHHHHHHCCCEEEEECCCCHHHHHH--CCCEEEEECCCCHHHHHHHHHH T ss_conf 95788989999999972410477366515711899996797199956997299971--8983999789999999999999 Q ss_pred HHCCHHHHHHHHHHHHH Q ss_conf 98698999999999999 Q gi|255764474|r 310 FMNSKKIMSDTGNRGRE 326 (352) Q Consensus 310 l~~~~~~~~~~~~~a~~ 326 (352) +++|++.+.+|+++|++ T Consensus 155 li~n~~~~~~m~~n~~~ 171 (172) T pfam00534 155 LLKDEELRERLGENARK 171 (172) T ss_pred HHCCHHHHHHHHHHHCC T ss_conf 98799999999998448 No 47 >cd01635 Glycosyltransferase_GTB_type Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. The structures of the formed glycoconjugates are extremely diverse, reflecting a wide range of biological functions. The members of this family share a common GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility. Probab=99.95 E-value=5e-26 Score=194.32 Aligned_cols=225 Identities=24% Similarity=0.345 Sum_probs=163.6 Q ss_pred EEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCHHHHHHH- Q ss_conf 5897078787877077888899999997198599980588665623478998998875432597089962854689999- Q gi|255764474|r 6 IDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPSIGISSLLTCWKKPIGQNSRIWHARRNNEMLLGV- 84 (352) Q Consensus 6 i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivh~~~~~~~~~~~- 84 (352) |.++...+.+..+|++..+.++.+.+.+.|+.+.+.... .....+.....++|++|+|......... T Consensus 1 Il~v~~~~~p~~GG~~~~~~~l~~~l~~~gh~v~v~~~~------------~~~~~~~~~~~~pDiIH~h~~~~~~~~~~ 68 (229) T cd01635 1 ILLVSTPLLPGGGGVELVLLDLAKALARRGHEVEVVALL------------LLLLLRILRGFKPDVVHAHGYYPAPLALL 68 (229) T ss_pred CEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECC------------HHHHHHHHHHCCCCEEEECCCCHHHHHHH T ss_conf 989939989999959999999999999869999999388------------49999999809988899899727999999 Q ss_pred HHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCCCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCC Q ss_conf 99997099819997574562011347989962668998898999743688399748827778237843268788728898 Q gi|255764474|r 85 MMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIERPSTVIMHGVDTERFRPTSNKQEARRHLKISE 164 (352) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~ 164 (352) ..+...+.|+..++|...........+.. ..... T Consensus 69 ~~~~~~~~~~v~t~H~~~~~~~~~~~~~~----------------------------------------------~~~~~ 102 (229) T cd01635 69 LAARLLGIPLVLTVHGVNRSLLEGVPLSL----------------------------------------------LALSI 102 (229) T ss_pred HHHHHCCCCEEEEECCCCHHHHHHHHHHH----------------------------------------------HHHCC T ss_conf 99984899199998999803424455555----------------------------------------------54211 Q ss_pred CCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCC---CCCCCC Q ss_conf 87279995122644512366654565300477405886313332100014667764431123322122222---233100 Q gi|255764474|r 165 DAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDE---QSSIED 241 (352) Q Consensus 165 ~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~---~~~~~~ 241 (352) .....+|+||+.++||++.+++|+..+.++.++++++++|+|+.. +.++......+..+++.+.|. .+++.. T Consensus 103 ~~~~~~~vgrl~~~K~~~~li~a~~~l~~~~~~~~l~i~G~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (229) T cd01635 103 GLADKVFVGRLAPEKGLDDLIEAFALLKERGPDLKLVIAGDGPER-----EYLEELLAALLLLDRVIFLGGLDPEELLAL 177 (229) T ss_pred CCEEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHH-----HHHHHHHHHCCCCCCEEECCCCCCHHHHHH T ss_conf 361899999546212999999999998867899489999688068-----899999997288774632362210678999 Q ss_pred CCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEE Q ss_conf 000000000123332222221000001001110278841010023488424996 Q gi|255764474|r 242 WYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIV 295 (352) Q Consensus 242 ~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~ 295 (352) +++.+|++++||.+||||++++|||+||+|||+|++|+.+|++. ++.+|+++ T Consensus 178 ~~~~~d~~v~pS~~E~~~~~~~EA~a~G~pvi~~~~gg~~e~i~--~~~~G~lv 229 (229) T cd01635 178 LLAAADVFVLPSLREGFGLVVLEAMACGLPVIATDVGGPPEIVE--DGLTGLLV 229 (229) T ss_pred HHHHCCEEEECCCCCCCCHHHHHHHHCCCCEEECCCCCCHHHCC--CCCCEEEC T ss_conf 99706806605666788889999998299899878998377626--99956869 No 48 >COG0297 GlgA Glycogen synthase [Carbohydrate transport and metabolism] Probab=99.94 E-value=1.6e-25 Score=190.87 Aligned_cols=276 Identities=20% Similarity=0.271 Sum_probs=203.4 Q ss_pred CCCEEEEECCCHHHHHHHHHHHH----HCCCEEEEECCHHHHHH--------H------------------HHHHHHHHH Q ss_conf 59708996285468999999997----09981999757456201--------1------------------347989962 Q gi|255764474|r 67 QNSRIWHARRNNEMLLGVMMRDV----LRMPLKLVFTSPSQRNH--------S------------------RWTRYLISR 116 (352) Q Consensus 67 ~~~~ivh~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~--------~------------------~~~~~l~~~ 116 (352) ..+||+|+|.+-..+....++.. ...|.+++.|+-.+... . .+.+--+.. T Consensus 129 ~~pDIvH~hDWqt~L~~~~lk~~~~~~~~i~tVfTIHNl~~qG~~~~~~~~~lgLp~~~~~~~~l~~~~~~~~lK~gi~~ 208 (487) T COG0297 129 WLPDIVHAHDWQTGLLPAYLKQRYRSGYIIPTVFTIHNLAYQGLFRLQYLEELGLPFEAYASFGLEFYGQISFLKGGLYY 208 (487) T ss_pred CCCCEEEEECHHHHHHHHHHHHCCCCCCCCCEEEEEEECEEECCCCHHHHHHHCCCHHHHHHCEEEECCCCHHHHHHHEE T ss_conf 77888885018877789998512335666775999941002301642567772698888621104664760266543042 Q ss_pred CCEEEECCHHHHHHCC----------------CCEEEECCCCCHHHHCCCCC------------------HHHHHHHCCC Q ss_conf 6689988989997436----------------88399748827778237843------------------2687887288 Q gi|255764474|r 117 MDEVITTSQKSARFIE----------------RPSTVIMHGVDTERFRPTSN------------------KQEARRHLKI 162 (352) Q Consensus 117 ~d~ii~~s~~~~~~~~----------------~~~~vi~~gid~~~~~~~~~------------------~~~~~~~~~~ 162 (352) +|++.++|+..++.+- .+..-|-||+|.+.++|..+ +..+...+++ T Consensus 209 ad~vttVSptYa~Ei~t~~~g~gl~g~l~~~~~~l~GI~NgiD~~~wnp~~d~~~~~~y~~~~~~~k~~nk~~L~~~~gL 288 (487) T COG0297 209 ADAVTTVSPTYAGEIYTPEYGEGLEGLLSWRSGKLSGILNGIDYDLWNPETDPYIAANYSAEVLPAKAENKVALQERLGL 288 (487) T ss_pred CCEEEEECHHHHHHHCCCCCCCCCHHHHHHCCCCEEEEEEEEEECCCCCCCCCCHHCCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 22899988778776526002555210555104518998744773231866443300147701033467899999998098 Q ss_pred CC--CCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCC-CCCCCCC Q ss_conf 98--87279995122644512366654565300477405886313332100014667764431123322122-2222331 Q gi|255764474|r 163 SE--DAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILF-IDEQSSI 239 (352) Q Consensus 163 ~~--~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~-~g~~~~~ 239 (352) +. +.+.++++||+..+||+|++++++..+..+. +++++.|.|+. .+...+..+.+.+.. ++.. .|+.+.+ T Consensus 289 ~~~~~~pl~~~vsRl~~QKG~dl~~~~i~~~l~~~--~~~vilG~gd~---~le~~~~~la~~~~~--~~~~~i~~~~~l 361 (487) T COG0297 289 DVDLPGPLFGFVSRLTAQKGLDLLLEAIDELLEQG--WQLVLLGTGDP---ELEEALRALASRHPG--RVLVVIGYDEPL 361 (487) T ss_pred CCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHC--CEEEEEECCCH---HHHHHHHHHHHHCCC--EEEEEEEECHHH T ss_conf 77999758999422420134558999999999708--36999826827---899999999986676--099996336899 Q ss_pred -CCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC------CCCCEEEEECCCCHHHHHHHHHHHH- Q ss_conf -00000000000123332222221000001001110278841010023------4884249965999899999999998- Q gi|255764474|r 240 -EDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDP------ENAKAGVIVPPRNLHALEKAVLYFM- 311 (352) Q Consensus 240 -~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~------~~g~~G~~~~~~d~~~la~~i~~l~- 311 (352) +..|+.+|++++||+.|++|++-++||.+|++.|++.+||+.|.|.. ....+|+++.+.|+++++.+|.+.+ T Consensus 362 a~~i~agaD~~lmPSrfEPcGL~ql~amryGtvpIv~~tGGLadTV~~~~~~~~~~~gtGf~f~~~~~~~l~~al~rA~~ 441 (487) T COG0297 362 AHLIYAGADVILMPSRFEPCGLTQLYAMRYGTLPIVRETGGLADTVVDRNEWLIQGVGTGFLFLQTNPDHLANALRRALV 441 (487) T ss_pred HHHHHHCCCEEEECCCCCCCCHHHHHHHHCCCCCEECCCCCCCCEECCCCCHHCCCCEEEEEEECCCHHHHHHHHHHHHH T ss_conf 99998449899957767687589999987188446754688654103766110037503799826998999999999999 Q ss_pred --CCHHH-HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC Q ss_conf --69899-999999999999982998999999999999988609 Q gi|255764474|r 312 --NSKKI-MSDTGNRGRERAVKHFSIVKEASDIGKVYDRLLRTA 352 (352) Q Consensus 312 --~~~~~-~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l~~a 352 (352) .++.. .+.+..++- ..+|+|++.+++|.++|+.++..+ T Consensus 442 ~y~~~~~~w~~~~~~~m---~~d~sw~~sa~~y~~lY~~~~~~~ 482 (487) T COG0297 442 LYRAPPLLWRKVQPNAM---GADFSWDLSAKEYVELYKPLLSKP 482 (487) T ss_pred HHHCCHHHHHHHHHHHC---CCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 96087899999998642---466680357888999999986466 No 49 >KOG0853 consensus Probab=99.94 E-value=9.9e-26 Score=192.30 Aligned_cols=233 Identities=21% Similarity=0.289 Sum_probs=187.4 Q ss_pred HHHCCEEEECCHHHHHHCC-------C-CEEEECCCCCHHHHCC------CCCHHHHHHHCCCCCCCCEEEEEEEECHHH Q ss_conf 9626689988989997436-------8-8399748827778237------843268788728898872799951226445 Q gi|255764474|r 114 ISRMDEVITTSQKSARFIE-------R-PSTVIMHGVDTERFRP------TSNKQEARRHLKISEDAKLIGCFGRIRKLK 179 (352) Q Consensus 114 ~~~~d~ii~~s~~~~~~~~-------~-~~~vi~~gid~~~~~~------~~~~~~~~~~~~~~~~~~~i~~~G~~~~~K 179 (352) ....|.+++.|..++.... . ...+.+-.+|.+.+.+ ..-+...|...++...+.+...+.++.|.| T Consensus 207 ~~~~~~~~~ns~~~~~~f~~~~~~L~~~d~~~~y~ei~~s~~~~~~~~~~~~~~~~~r~~~~v~~~d~~~~siN~~~pgk 286 (495) T KOG0853 207 TGLAWKILVNSYFTKRQFKATFVSLSNSDITSTYPEIDGSWFTYGQYESHLELRLPVRLYRGVSGIDRFFPSINRFEPGK 286 (495) T ss_pred HHHCCEEECCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHCCCCCCCCCHHCCCCCCEEEECCCCCEEEEEEEECCCCC T ss_conf 31015172214666654453553057888614632424011255542111010255422430046646754566417999 Q ss_pred HHHHHHHHHHHHHCCC-----CCEEEEEECCC-----CCCCCCHHHHHHHHHHHCCC-CCCCCCCCCCCCCCCCCHHHHH Q ss_conf 1236665456530047-----74058863133-----32100014667764431123-3221222222331000000000 Q gi|255764474|r 180 GTDLFVDCMINILPHH-----PGWTAVVVGKT-----TLKHYLFKKNLQRRIYANGL-KKRILFIDEQSSIEDWYRALNI 248 (352) Q Consensus 180 g~~~li~a~~~l~~~~-----~~~~l~i~G~g-----~~~~~~~~~~l~~~i~~~~l-~~~V~~~g~~~~~~~~~~~adi 248 (352) +.+.+++|+.++.... +..+++++|+- ..+...+.+++.++++++++ .+.|.|+....+...+..++|+ T Consensus 287 d~~l~l~a~~~~~~~i~~~~~~~~hl~~~g~~G~d~~~sen~~~~~el~~lie~~~l~g~~v~f~~s~~~~~~yrl~adt 366 (495) T KOG0853 287 DQDLALPAFTLLHDSIPEPSISSEHLVVAGSRGYDERDSENVEYLKELLSLIEEYDLLGQFVWFLPSTTRVAKYRLAADT 366 (495) T ss_pred CCEEEHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHC T ss_conf 84664444776400357888771169994378764455355899999999999727666569984577638889998744 Q ss_pred H---CCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHH---HHHHHHHHHHCCHHHHHHHHH Q ss_conf 0---0123332222221000001001110278841010023488424996599989---999999999869899999999 Q gi|255764474|r 249 F---VAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLH---ALEKAVLYFMNSKKIMSDTGN 322 (352) Q Consensus 249 ~---i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~---~la~~i~~l~~~~~~~~~~~~ 322 (352) . ..|.. |.||++++|||+||+||++|+.||..|+|. ++.+|++++| +.+ .+|+++.++..||+++.+|++ T Consensus 367 ~~v~~qPa~-E~FGiv~IEAMa~glPvvAt~~GGP~EiV~--~~~tG~l~dp-~~e~~~~~a~~~~kl~~~p~l~~~~~~ 442 (495) T KOG0853 367 KGVLYQPAN-EHFGIVPIEAMACGLPVVATNNGGPAEIVV--HGVTGLLIDP-GQEAVAELADALLKLRRDPELWARMGK 442 (495) T ss_pred CEEEECCCC-CCCCCEEHHHHHCCCCEEEECCCCCEEEEE--CCCCCEEECC-CHHHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 357726888-775633398785599889966999657898--4885044577-457789999999998139899999988 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 9999999829989999999999999886 Q gi|255764474|r 323 RGRERAVKHFSIVKEASDIGKVYDRLLR 350 (352) Q Consensus 323 ~a~~~~~~~fs~~~~a~~~~~iY~~~l~ 350 (352) +|++++.+.|||+.+.+++.++-.+.+. T Consensus 443 ~G~krV~e~fs~~~~~~ri~~~~~~~~~ 470 (495) T KOG0853 443 NGLKRVKEMFSWQHYSERIASVLGKYLQ 470 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 7889999987077799999998775077 No 50 >TIGR02470 sucr_synth sucrose synthase; InterPro: IPR012820 This entry represents sucrose synthase an enzyme that despite its name, generally uses rather produces sucrose. Sucrose plus UDP (or ADP) becomes D-fructose plus UDP-glucose (or ADP-glucose), which is then available for cell wall (or starch) biosynthesis. The enzyme is homologous to sucrose phosphate synthase, which catalyses the penultimate step in sucrose synthesis. Sucrose synthase is found, so far, exclusively in plants and cyanobacteria.; GO: 0016157 sucrose synthase activity. Probab=99.92 E-value=1.6e-24 Score=184.22 Aligned_cols=277 Identities=19% Similarity=0.274 Sum_probs=192.7 Q ss_pred HHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHH------------------HHHHHH-HHHHHH--HHCCEEEE Q ss_conf 432597089962854689999999970998199975745------------------620113-479899--62668998 Q gi|255764474|r 64 PIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPS------------------QRNHSR-WTRYLI--SRMDEVIT 122 (352) Q Consensus 64 ~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~~~~~-~~~~l~--~~~d~ii~ 122 (352) -+...||.+-..-...=+-+.++++.++.--..+ .|+ ++|++. |+.-++ +.+|.||+ T Consensus 387 el~g~PDLIIGNYSDGNLVA~LLA~kLgVTQC~I--AHALEKtKY~~Sd~yW~~~e~kYHFS~QFTADLIAMN~aDFIIT 464 (790) T TIGR02470 387 ELQGKPDLIIGNYSDGNLVASLLARKLGVTQCTI--AHALEKTKYPDSDIYWQELEDKYHFSCQFTADLIAMNAADFIIT 464 (790) T ss_pred HHCCCCCEEEECCCCHHHHHHHHHHHCCCCCCCH--HHHCCCCCCCCCCCHHHHCCCCCCCCHHHHHHHHHHHCCCCEEE T ss_conf 8468997067325635899999985336222203--33133436766454042025765643057899996415896587 Q ss_pred CCHH--------HHHHC--------------------CCCEEEECCCCCHHHHCCCCCHH----HHH------------- Q ss_conf 8989--------99743--------------------68839974882777823784326----878------------- Q gi|255764474|r 123 TSQK--------SARFI--------------------ERPSTVIMHGVDTERFRPTSNKQ----EAR------------- 157 (352) Q Consensus 123 ~s~~--------~~~~~--------------------~~~~~vi~~gid~~~~~~~~~~~----~~~------------- 157 (352) .|-. ..+|- ..|-.+|+.|+|.+-|-|=.+.. ..+ T Consensus 465 STYQEIaGt~~~~GQYESh~aFTMP~LYrVV~GIDvFdPKFNiVsPGad~~iYFPY~~~~~Rl~~l~~~ie~LLf~~~~~ 544 (790) T TIGR02470 465 STYQEIAGTKDSVGQYESHQAFTMPGLYRVVHGIDVFDPKFNIVSPGADESIYFPYSDKEKRLTSLHPEIEELLFSDEDN 544 (790) T ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEECCCCCHHHCCHHHHHHHHHHCCCCCCC T ss_conf 11220146887788421221105887504761032258775546878878200376770104623689999861784587 Q ss_pred -HHCC--CCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC-------CCCCCHHHHHHHHHHHCCCC Q ss_conf -8728--8988727999512264451236665456530047740588631333-------21000146677644311233 Q gi|255764474|r 158 -RHLK--ISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTT-------LKHYLFKKNLQRRIYANGLK 227 (352) Q Consensus 158 -~~~~--~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~-------~~~~~~~~~l~~~i~~~~l~ 227 (352) .+.| -+++++.|-.++|++.-||+--|+|.+++=.+-..-+-|++||+-. .++....+++=+++++++|. T Consensus 545 ~~~~G~L~d~~KPiiFsMARLDrvKNlTGLvE~yg~s~~Lr~l~NLv~VaG~~~~~~S~DrEE~~Ei~kmh~lid~Y~L~ 624 (790) T TIGR02470 545 DEHIGYLKDPNKPIIFSMARLDRVKNLTGLVELYGRSKKLRELANLVVVAGKLDAKESKDREEQAEIEKMHNLIDQYQLH 624 (790) T ss_pred CCEEEEECCCCCCCEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 30078720768782663063001013477884850484478166888880656864467878999999999988864669 Q ss_pred CCCCCCCCCC---CCCCCCH-HH---HHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCH Q ss_conf 2212222223---3100000-00---000012333222222100000100111027884101002348842499659998 Q gi|255764474|r 228 KRILFIDEQS---SIEDWYR-AL---NIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNL 300 (352) Q Consensus 228 ~~V~~~g~~~---~~~~~~~-~a---di~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~ 300 (352) ++|.|+|-+- +--.+|+ -| .+||.|..+|.||++++|||.||+|..+|..||-.|+|. ||.+|+.++|.+. T Consensus 625 G~iRWi~~q~~~~~~GElYR~IAD~~GiFVQPA~yEAFGLTv~EAM~cGLPTFAT~~GGP~EII~--dgvsGFhIdP~~~ 702 (790) T TIGR02470 625 GQIRWIGAQLNRVRNGELYRYIADTKGIFVQPALYEAFGLTVVEAMTCGLPTFATRFGGPLEIIQ--DGVSGFHIDPYHG 702 (790) T ss_pred CCEEEEEECCCCCCCCEEEEEEECCCCCEECCCHHHHHHHHHHHHHHCCCHHHHHHCCCCEEEEE--ECCCCCEECCCCH T ss_conf 71677000068531464676442577503333044453257999863011246762689636877--2883542267558 Q ss_pred HHHHHHHHHHH----CCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 99999999998----698999999999999999829989999999999 Q gi|255764474|r 301 HALEKAVLYFM----NSKKIMSDTGNRGRERAVKHFSIVKEASDIGKV 344 (352) Q Consensus 301 ~~la~~i~~l~----~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~i 344 (352) ++.|+.|..-+ .||..++++++.|-+++.++|||+..++++..+ T Consensus 703 ~~~A~~l~~Ff~kC~~dP~~W~~iS~~gl~Ri~~~YTWk~y~~RLltL 750 (790) T TIGR02470 703 EEAAEKLVDFFEKCDEDPSYWQKISQGGLQRIYEKYTWKIYSERLLTL 750 (790) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 999999999998620383689999998867666405537899998874 No 51 >COG0438 RfaG Glycosyltransferase [Cell envelope biogenesis, outer membrane] Probab=99.90 E-value=1.4e-21 Score=164.34 Aligned_cols=222 Identities=28% Similarity=0.528 Sum_probs=180.3 Q ss_pred HCCEEEECCHHHHHHC-----CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCC--CCEEEEEEEECHHHHHHHHHHHH Q ss_conf 2668998898999743-----6883997488277782378432687887288988--72799951226445123666545 Q gi|255764474|r 116 RMDEVITTSQKSARFI-----ERPSTVIMHGVDTERFRPTSNKQEARRHLKISED--AKLIGCFGRIRKLKGTDLFVDCM 188 (352) Q Consensus 116 ~~d~ii~~s~~~~~~~-----~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~--~~~i~~~G~~~~~Kg~~~li~a~ 188 (352) ..+.++..++...... .....+++++++.+.+.+. ...+... ...++++|++.+.||++.+++++ T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~i~~~g~~~~~k~~~~~i~~~ 221 (381) T COG0438 150 LADRVIAVSPALKELLEALGVPNKIVVIPNGIDTEKFAPA--------RIGLLPEGGKFVVLYVGRLDPEKGLDLLIEAA 221 (381) T ss_pred CCEEEEEECHHHHHHHHHHCCCCCEEEECCCCCHHHCCCC--------CCCCCCCCCCEEEEEECCCCCCCCHHHHHHHH T ss_conf 4459999458889999984677887997588587874700--------01445356987999964886547999999999 Q ss_pred HHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCC--CCCCCCHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 6530047740588631333210001466776443112332212222223--31000000000001233322222210000 Q gi|255764474|r 189 INILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQS--SIEDWYRALNIFVAPPLYEGFGLTPLEAM 266 (352) Q Consensus 189 ~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~--~~~~~~~~adi~i~pS~~Eg~gl~~lEAm 266 (352) ..+....++..+.++|.++.. ...+...+...+..+.|.|.|..+ ++..+++.+|++++||..|+||++++||| T Consensus 222 ~~~~~~~~~~~~~~~g~~~~~----~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~v~ps~~e~~~~~~~Ea~ 297 (381) T COG0438 222 AKLKKRGPDIKLVIVGDGPER----REELEKLAKKLGLEDNVKFLGYVPDEELAELLASADVFVLPSLSEGFGLVLLEAM 297 (381) T ss_pred HHHHHHCCCEEEEEEEECCCH----HHHHHHHHHHCCCCCCEEECCCCCHHHHHHHHHHCCEEEECCCCCCCCHHHHHHH T ss_conf 985321586489999953412----8899999997088887899177898999999972809991786445588999999 Q ss_pred CCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 01001110278841010023488424996599989999999999869899999999999999982998999999999999 Q gi|255764474|r 267 ASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 267 a~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) ++|+|+|+++.++.++++. ++.+|+++++.|.+++++++..++.+++.++.++..+++.+.+.|+|+..++++.++|. T Consensus 298 a~g~pvi~~~~~~~~~~~~--~~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 375 (381) T COG0438 298 AAGTPVIASDVGGIPEVVE--DGETGLLVPPGDVEELADALEQLLEDPELREELGEAARERVEEEFSWERIAEQLLELYE 375 (381) T ss_pred HHCCEEEECCCCCHHHHHC--CCCEEEEECCCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 8498699958998688430--69706998899999999999999869799999999999999986699999999999999 Q ss_pred HHHHH Q ss_conf 98860 Q gi|255764474|r 347 RLLRT 351 (352) Q Consensus 347 ~~l~~ 351 (352) ++... T Consensus 376 ~~~~~ 380 (381) T COG0438 376 ELLAE 380 (381) T ss_pred HHHHC T ss_conf 98615 No 52 >TIGR02468 sucrsPsyn_pln sucrose phosphate synthase; InterPro: IPR012819 Sucrose occupies a central position in the metabolic pathways of all plants and plays a variety of roles including transport sugar, storage reserve, compatible solute, and signal compound . This compound is produced by the combined action of two enzymes, sucrose-phosphate synthase (2.4.1.14 from EC) and sucrose-phosphate phosphatase (3.1.3.24 from EC), via the intermediate sucrose 6-phosphate. Several studies have shown a positive correlation between sucrose-phosphate synthase activity and plant growth rate and yield in agronomically important plants, though direct proof of a causal link is lacking. This entry represents sucrose-phosphate synthase from plants, which is known to exist in multigene families in several species of both monocots and dicots. The enzyme contains an N-terminal domain glucosyltransferase domain, a variable linker region, and a C-terminal domain similar to that of sucrose-phosphate phosphatase, the next and final enzyme of sucrose biosynthesis. The C-terminal domain is likely to serve a binding - not a catalytic - function, as sucrose-phosphate phosphatase is always encoded by a distinct protein. ; GO: 0046524 sucrose-phosphate synthase activity, 0005985 sucrose metabolic process. Probab=99.88 E-value=1.2e-22 Score=171.62 Aligned_cols=286 Identities=20% Similarity=0.274 Sum_probs=202.0 Q ss_pred HHHHHHHHHH-------HHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHH-------------HHHHH Q ss_conf 9989988754-------325970899628546899999999709981999757456201134-------------79899 Q gi|255764474|r 55 SSLLTCWKKP-------IGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRW-------------TRYLI 114 (352) Q Consensus 55 ~~~~~~~~~~-------~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~l~ 114 (352) .++..+.+.. ..-.|.+||.|...+.-.+..+...+..|..++.|+-......-+ +..+. T Consensus 291 ~Hi~~mSk~LGEqiG~G~pVWP~~IHGHYADAGd~AAlLSGALNVPMvlTGHsLGRdKLEQLLkqGR~s~eeI~~tYKI~ 370 (1072) T TIGR02468 291 SHIVNMSKVLGEQIGSGKPVWPYVIHGHYADAGDSAALLSGALNVPMVLTGHSLGRDKLEQLLKQGRLSKEEINSTYKIM 370 (1072) T ss_pred HHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHCCHHHHHHCCCCCHHHHHHHHHHCCCCCHHHHCHHHHH T ss_conf 99999999875520788421562650630046899998741102125761543342369999971467813442100121 Q ss_pred HH---------CCEEEECCHH-H---------------HHHCC--------------CCEEEECCCCCHHHHCCCCCHH- Q ss_conf 62---------6689988989-9---------------97436--------------8839974882777823784326- Q gi|255764474|r 115 SR---------MDEVITTSQK-S---------------ARFIE--------------RPSTVIMHGVDTERFRPTSNKQ- 154 (352) Q Consensus 115 ~~---------~d~ii~~s~~-~---------------~~~~~--------------~~~~vi~~gid~~~~~~~~~~~- 154 (352) +| +.-++.+|.+ . .+.++ .+..|||.|+|-....+..+.. T Consensus 371 RRIEAEEL~LDasE~ViTSTrQEIeEQW~LYdGFD~~LerkLRaR~rRgVsC~GRfMPRM~vIPPGmeF~~i~~~~~~d~ 450 (1072) T TIGR02468 371 RRIEAEELSLDASEIVITSTRQEIEEQWGLYDGFDVVLERKLRARARRGVSCYGRFMPRMAVIPPGMEFSHIVPQEDHDG 450 (1072) T ss_pred EEHHHHHHHCCCCEEEEECCCCCHHHHCCCCCCCCCEEEHHHHHCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCC T ss_conf 00003443201100010044421375156547752011010221112686424652773200487965441023577755 Q ss_pred HHHH---------------------HCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCC--CCEEEEEECCCCCCC- Q ss_conf 8788---------------------7288988727999512264451236665456530047--740588631333210- Q gi|255764474|r 155 EARR---------------------HLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHH--PGWTAVVVGKTTLKH- 210 (352) Q Consensus 155 ~~~~---------------------~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~--~~~~l~i~G~g~~~~- 210 (352) +... .|=-.+.++.||=++|-+|.||+-.|++||.+.++-+ -|.+| |+|+=+.-| T Consensus 451 d~~~~~~~~~~~~~~~~PpIWseimRFftnp~KPmILALaRPDPkKNiTTLvKAFGECRpLRELANLtL-ImGNRD~IDe 529 (1072) T TIGR02468 451 DMDTETEGNEESSAKADPPIWSEIMRFFTNPRKPMILALARPDPKKNITTLVKAFGECRPLRELANLTL-IMGNRDDIDE 529 (1072) T ss_pred CHHHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCCEEEECCCCCCCCHHHHHHHHHCCCCCHHHHHHHHH-HHCCCCCHHH T ss_conf 313443213356777878417889986318898779732687873014778876337861467765774-4147522354 Q ss_pred -----CCHHHHHHHHHHHCCCCCCCCCCCC--CCCCCCCCHHH----HHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCC Q ss_conf -----0014667764431123322122222--23310000000----000012333222222100000100111027884 Q gi|255764474|r 211 -----YLFKKNLQRRIYANGLKKRILFIDE--QSSIEDWYRAL----NIFVAPPLYEGFGLTPLEAMASGIPVVASNTGV 279 (352) Q Consensus 211 -----~~~~~~l~~~i~~~~l~~~V~~~g~--~~~~~~~~~~a----di~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~ 279 (352) ..-+-..-++|.++.|-.+|=|+-. +.|++++|+-| .+||-|..-|+||++++||.|.|+|+|+|..|| T Consensus 530 ms~~~~sVL~svLkLID~YDLYGqVAYPKHHkqsDVP~IYRLAAktKGVFINPA~~EPFGLTLIEAAa~GLP~VATKNGG 609 (1072) T TIGR02468 530 MSSGSASVLTSVLKLIDKYDLYGQVAYPKHHKQSDVPDIYRLAAKTKGVFINPALIEPFGLTLIEAAAYGLPMVATKNGG 609 (1072) T ss_pred HHCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCCEEECCCCCCCCCHHHHHHHHCCCCEEEECCCC T ss_conf 51555589999998862003565546776788888724889974279657523210456436899986399779835868 Q ss_pred CCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 10100234884249965999899999999998698999999999999999829989999999999 Q gi|255764474|r 280 FSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKV 344 (352) Q Consensus 280 ~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~i 344 (352) --|+.. .-.||++|+|.|.+++|+||.+|++|..++.+..+||-+.+. .|||..-++.|+.= T Consensus 610 PVDI~~--vL~NGLLvDPHdq~aIa~ALLkLvadK~LW~~CR~NGLkNIH-~FSWPeHCk~YLsr 671 (1072) T TIGR02468 610 PVDIHR--VLDNGLLVDPHDQEAIADALLKLVADKQLWAKCRQNGLKNIH-LFSWPEHCKTYLSR 671 (1072) T ss_pred CHHHHH--HHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC-CCCCHHHHHHHHHH T ss_conf 133887--731787336776688999999986215778999736450245-67675577878888 No 53 >cd04950 GT1_like_1 Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center Probab=99.86 E-value=1.7e-20 Score=157.04 Aligned_cols=216 Identities=19% Similarity=0.244 Sum_probs=150.8 Q ss_pred HHHHHHHHCCEEEECCHHHHHHC---CCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHH Q ss_conf 47989962668998898999743---688399748827778237843268788728898872799951226445123666 Q gi|255764474|r 109 WTRYLISRMDEVITTSQKSARFI---ERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFV 185 (352) Q Consensus 109 ~~~~l~~~~d~ii~~s~~~~~~~---~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li 185 (352) ..+-+++++|.++++|+...+.. ..++.+|+||+|.+.|.+.......... -.....++|+|+|.++.+-+.+++. T Consensus 146 ~e~~l~~~ad~v~~ts~~L~~~~~~~~~~~~~i~nGvD~~~F~~~~~~~~~~~~-~~~~~~p~igy~G~i~~~~D~~ll~ 224 (373) T cd04950 146 AERRLLKRADLVFTTSPSLYEAKRRLNPNVVLVPNGVDYEHFAAARDPPPPPAD-LAALPRPVIGYYGAIAEWLDLELLE 224 (373) T ss_pred HHHHHHHHCCEEEECCHHHHHHHHHCCCCEEEECCEECHHHHHHHCCCCCCCHH-HCCCCCCEEEEEECCCCCCCHHHHH T ss_conf 999999977999985999999887469998998882178884100157688045-5047998899992575214899999 Q ss_pred HHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCC--CCCCCCHHHHHHCCCCCC-C----CC Q ss_conf 5456530047740588631333210001466776443112332212222223--310000000000012333-2----22 Q gi|255764474|r 186 DCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQS--SIEDWYRALNIFVAPPLY-E----GF 258 (352) Q Consensus 186 ~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~--~~~~~~~~adi~i~pS~~-E----g~ 258 (352) + +.+.+|++.++++|.+.... ....+.+ ..+|+|+|.++ ++..+++.+|+.++|-.. | .+ T Consensus 225 ~----~a~~~p~~~~~liGp~~~~~--~~~~l~~-------~~Nv~~lG~~~~~~lp~~l~~~Dv~l~P~~~~~~t~~~~ 291 (373) T cd04950 225 A----LAKARPDWSFVLIGPVDVSI--DPSALLR-------LPNVHYLGPKPYKELPAYLAGFDVAILPFRLNELTRATS 291 (373) T ss_pred H----HHHHCCCCEEEEEECCCCCC--CHHHHHC-------CCCEEECCCCCHHHHHHHHHHCCEEEEECCCCHHHHCCC T ss_conf 9----99988996899994388755--8345625-------998798489899999999985787774120554542468 Q ss_pred CCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 22210000010011102788410100234884249965999899999999998698999999999999999829989999 Q gi|255764474|r 259 GLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEA 338 (352) Q Consensus 259 gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a 338 (352) |+++.|+||+|+|||+|+.+.+.++ .+ . .+.-.+|++++.++|.+.+....... .++.++.+. ..||++.+ T Consensus 292 P~Kl~EYlA~G~PVVst~~~~~~~~----~~-~-~~~~~~~~~~F~~~i~~al~~~~~~~--~~~~~~~a~-~~sW~~r~ 362 (373) T cd04950 292 PLKLFEYLAAGKPVVATPLPEVRRY----ED-E-VVLIADDPEEFVAAIEKALLEDGPAR--ERRRLRLAA-QNSWDARA 362 (373) T ss_pred CCHHHHHHHCCCCEEECCCHHHHHC----CC-C-EEEECCCHHHHHHHHHHHHHCCCHHH--HHHHHHHHH-HCCHHHHH T ss_conf 6379999866998896584887731----68-6-79977999999999999984389288--999999999-79999999 Q ss_pred HHHHHHHHH Q ss_conf 999999999 Q gi|255764474|r 339 SDIGKVYDR 347 (352) Q Consensus 339 ~~~~~iY~~ 347 (352) +++++..++ T Consensus 363 ~~~~~~i~~ 371 (373) T cd04950 363 AEMLEALQE 371 (373) T ss_pred HHHHHHHHH T ss_conf 999999975 No 54 >KOG1387 consensus Probab=99.80 E-value=2.8e-17 Score=135.47 Aligned_cols=225 Identities=18% Similarity=0.227 Sum_probs=174.9 Q ss_pred HHHCCEEEECCHHHHHHC-----CCCEEEECCCCCHHHHCCCCCHHHHHHHCC-CCCCCCEEEEEEEECHHHHHHHHHHH Q ss_conf 962668998898999743-----688399748827778237843268788728-89887279995122644512366654 Q gi|255764474|r 114 ISRMDEVITTSQKSARFI-----ERPSTVIMHGVDTERFRPTSNKQEARRHLK-ISEDAKLIGCFGRIRKLKGTDLFVDC 187 (352) Q Consensus 114 ~~~~d~ii~~s~~~~~~~-----~~~~~vi~~gid~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~G~~~~~Kg~~~li~a 187 (352) -+.+|-+.+.|..+...+ ...+.+|+...+++... ...+ ...+.+.++++|.+.|+|++. +++. T Consensus 220 G~~ad~vm~NssWT~nHI~qiW~~~~~~iVyPPC~~e~lk---------s~~~te~~r~~~ll~l~Q~RPEKnH~-~Lql 289 (465) T KOG1387 220 GSKADIVMTNSSWTNNHIKQIWQSNTCSIVYPPCSTEDLK---------SKFGTEGERENQLLSLAQFRPEKNHK-ILQL 289 (465) T ss_pred CCCCEEEEECCHHHHHHHHHHHHCCCEEEECCCCCHHHHH---------HHHCCCCCCCEEEEEEEECCCCCCCH-HHHH T ss_conf 4642299962665677899986026214872898878888---------77424577604789876037655558-8999 Q ss_pred HHHHHCCC------CCEEEEEECCCC-CCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCC Q ss_conf 56530047------740588631333-21000146677644311233221222222--3310000000000012333222 Q gi|255764474|r 188 MINILPHH------PGWTAVVVGKTT-LKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGF 258 (352) Q Consensus 188 ~~~l~~~~------~~~~l~i~G~g~-~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~ 258 (352) ++....+. ++++|+++|+-. ++|+.+.+.|++..+++.++++|.|.-.. +++..+++.|.+.++.-+.|.| T Consensus 290 ~Al~~~~~pl~a~~~~iKL~ivGScRneeD~ervk~Lkd~a~~L~i~~~v~F~~N~Py~~lv~lL~~a~iGvh~MwNEHF 369 (465) T KOG1387 290 FALYLKNEPLEASVSPIKLIIVGSCRNEEDEERVKSLKDLAEELKIPKHVQFEKNVPYEKLVELLGKATIGVHTMWNEHF 369 (465) T ss_pred HHHHHHCCCHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHCCCEEEHHHHHHHHC T ss_conf 99997518201046882599970547811399999888789862875453899559879999986115554524455203 Q ss_pred CCCCCCCCCCEEEEEECC-CCCCCHHCCCCCCC-EEEEECCCCHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHHHCCHH Q ss_conf 222100000100111027-88410100234884-2499659998999999999986-98999999999999999829989 Q gi|255764474|r 259 GLTPLEAMASGIPVVASN-TGVFSELLDPENAK-AGVIVPPRNLHALEKAVLYFMN-SKKIMSDTGNRGRERAVKHFSIV 335 (352) Q Consensus 259 gl~~lEAma~G~PvI~s~-~~~~~e~i~~~~g~-~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~~~a~~~~~~~fs~~ 335 (352) |+.+.|+||+|+-.|+-+ .|..=|+|.+.+|+ +|++.+ +.++.++++.++.. +++.+..|+++||... .+|+-. T Consensus 370 GIsVVEyMAAGlIpi~h~SgGP~lDIV~~~~G~~tGFla~--t~~EYaE~iLkIv~~~~~~r~~~r~~AR~s~-~RFsE~ 446 (465) T KOG1387 370 GISVVEYMAAGLIPIVHNSGGPLLDIVTPWDGETTGFLAP--TDEEYAEAILKIVKLNYDERNMMRRNARKSL-ARFGEL 446 (465) T ss_pred CHHHHHHHHCCCEEEEECCCCCCEEEEECCCCCCCEEECC--CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-HHHHHH T ss_conf 5568998755726887078997323640457860101158--7289999999999719888888899999999-886688 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999998860 Q gi|255764474|r 336 KEASDIGKVYDRLLRT 351 (352) Q Consensus 336 ~~a~~~~~iY~~~l~~ 351 (352) +.-+.+...+.++|++ T Consensus 447 ~F~kd~~~~i~kll~e 462 (465) T KOG1387 447 KFDKDWENPICKLLEE 462 (465) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 8877676777775333 No 55 >TIGR02918 TIGR02918 conserved hypothetical protein TIGR02918; InterPro: IPR014267 Members of this entry are found only in Gram-positive bacteria of the Firmicutes lineage, including several species of Staphylococcus, Streptococcus, and Lactobacillus.. Probab=99.78 E-value=5.2e-19 Score=147.06 Aligned_cols=219 Identities=19% Similarity=0.285 Sum_probs=170.0 Q ss_pred HHHHHHC---CEEEECCHHH--------HHHCC--CCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECH Q ss_conf 9899626---6899889899--------97436--883997488277782378432687887288988727999512264 Q gi|255764474|r 111 RYLISRM---DEVITTSQKS--------ARFIE--RPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRK 177 (352) Q Consensus 111 ~~l~~~~---d~ii~~s~~~--------~~~~~--~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~ 177 (352) .+.-.++ |.-|+.++.- ++|.+ +++.+||=|= .+....+.. .-| ..+-|..++||-. T Consensus 271 EYqF~nA~~iDFFItATd~Q~~~l~~QF~kY~~~~P~i~TIPVGS-l~~L~~p~W--~~R-------kp~SiiTASRLA~ 340 (511) T TIGR02918 271 EYQFSNADYIDFFITATDIQNQILLEQFKKYKNIEPKIYTIPVGS-LDELQYPEW--QER-------KPFSIITASRLAK 340 (511) T ss_pred HHHCCCCCEEEEEEECHHHHHHHHHHHHHHHCCCCCEEEEEECCC-HHHCCCCCC--CCC-------CCHHHHHHHHHCC T ss_conf 553156866511340057889999998887479987788850575-433578863--464-------5215677734137 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCC Q ss_conf 45123666545653004774058863133321000146677644311233221222222331000000000001233322 Q gi|255764474|r 178 LKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEG 257 (352) Q Consensus 178 ~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg 257 (352) +|.+|=|++|.-+..+..|+++|-|.|.|.++ ..|++.|++..-+|-|++.|.+ ++...|+.=.+|++-|.+|| T Consensus 341 EKHiDWLV~AVv~Ak~~~P~l~FDIYG~GgE~-----~~L~~iI~~n~A~DYI~LkGH~-~L~~vY~~YelyLsaStSEG 414 (511) T TIGR02918 341 EKHIDWLVKAVVKAKKSVPELTFDIYGEGGEK-----SKLKKIINENQAEDYIRLKGHK-NLSEVYKDYELYLSASTSEG 414 (511) T ss_pred CCCHHHHHHHHHHHCCCCCCEEEEECCCCHHH-----HHHHHHHHHHCCCCHHHHCCCC-CHHHHHCCCCEEEEECCCCC T ss_conf 67126888999995133885110003563788-----9999987631200124311543-35666232322345212144 Q ss_pred CCCCCCCCCCCEEEEEECCCC-CCCHHCCCCCCCEEEEECCC-C-------HHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 222210000010011102788-41010023488424996599-9-------89999999999869899999999999999 Q gi|255764474|r 258 FGLTPLEAMASGIPVVASNTG-VFSELLDPENAKAGVIVPPR-N-------LHALEKAVLYFMNSKKIMSDTGNRGRERA 328 (352) Q Consensus 258 ~gl~~lEAma~G~PvI~s~~~-~~~e~i~~~~g~~G~~~~~~-d-------~~~la~~i~~l~~~~~~~~~~~~~a~~~~ 328 (352) |||+++||.-+|+|.|.-|++ |.+.+|+ |+.||+++|.+ + .+.+|++|.++.-+.....++.+...+.| T Consensus 415 FGLTLmEAvGSGLgmIGFDV~YGN~TFI~--d~~NGYLIP~~~~~~~~~~I~~~lA~~Iv~~Fv~~~~~~~~~~~SY~IA 492 (511) T TIGR02918 415 FGLTLMEAVGSGLGMIGFDVNYGNPTFIK--DNKNGYLIPIDEEEDDEDEIITALAEKIVEYFVNENDIDAFHEVSYQIA 492 (511) T ss_pred CHHHHHHHHHCCCCCCCCCCCCCCCCEEE--CCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 11579999750433236618743887024--0888404334578788789999999998986126786002656317888 Q ss_pred HHHCCHHHHHHHHHHHHHHH Q ss_conf 98299899999999999998 Q gi|255764474|r 329 VKHFSIVKEASDIGKVYDRL 348 (352) Q Consensus 329 ~~~fs~~~~a~~~~~iY~~~ 348 (352) +.|.-+.+++++.++-++| T Consensus 493 -~~Flt~~i~~kW~~Lv~ev 511 (511) T TIGR02918 493 -EGFLTEEIVEKWKKLVEEV 511 (511) T ss_pred -HHHHHHHHHHHHHHHHCCC T ss_conf -6631489999999985039 No 56 >PRK13608 diacylglycerol glucosyltransferase; Provisional Probab=99.77 E-value=7e-17 Score=132.75 Aligned_cols=261 Identities=14% Similarity=0.186 Sum_probs=178.5 Q ss_pred HHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCC----CCEEE Q ss_conf 75432597089962854689999999970998199975745620113479899626689988989997436----88399 Q gi|255764474|r 62 KKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIE----RPSTV 137 (352) Q Consensus 62 ~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~----~~~~v 137 (352) +.....+||++-+..+...+..+..+.....|+..+.|.... -..|+...+|..++.++..++.+. .+..+ T Consensus 98 ~~L~~~kPDvII~T~P~~~~s~lk~~~~~~iP~~tViTD~~~-----H~~W~~~~~D~y~Va~~~~~~~l~~~Gi~~~kI 172 (391) T PRK13608 98 NLLIKEKPDLILLTFPTPVMSVLTEQFNINIPVATVMTDYRL-----HKNWITPYSTRYYVATKETKQDFIDVGIDPSTV 172 (391) T ss_pred HHHHHHCCCEEEECCHHHHHHHHHHHCCCCCCEEEEECCHHH-----HHHHCCCCCCEEEECCHHHHHHHHHCCCCHHHE T ss_conf 999984929999998289999999824999988999587133-----230368999979969999999999849997688 Q ss_pred ECCCCCH-HHHCCCCCHHHHHHHCCCCCCCCEEEEEE-EECHHHHHHHHHHHHHHHHCCCCCEEEE-EECCCCCCCCCHH Q ss_conf 7488277-78237843268788728898872799951-2264451236665456530047740588-6313332100014 Q gi|255764474|r 138 IMHGVDT-ERFRPTSNKQEARRHLKISEDAKLIGCFG-RIRKLKGTDLFVDCMINILPHHPGWTAV-VVGKTTLKHYLFK 214 (352) Q Consensus 138 i~~gid~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~G-~~~~~Kg~~~li~a~~~l~~~~~~~~l~-i~G~g~~~~~~~~ 214 (352) ...||+. ..|....++.+.+++.+++++.+.++..| .+.-.++.+.+++ ++....++++++ ++|.. .... T Consensus 173 ~vtGIPV~~~F~~~~~~~~~~~~~~l~~~~~~iLv~gG~~G~~~~~~~~i~---~ll~~~~~~qivvvcGrN----~~L~ 245 (391) T PRK13608 173 KVTGIPIDNKFETPINQKQWLIDNNLDPDKQTILMSAGAFGVSKGFDTMIT---DILAKSANAQVVMICGKS----KELK 245 (391) T ss_pred EEEEEECCHHHCCCCCHHHHHHHCCCCCCCCEEEEECCCCCHHHHHHHHHH---HHHHCCCCCEEEEEECCC----HHHH T ss_conf 998343586773755678999971899777689996886310246999999---997159996599990899----9999 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC-CCCCCCH----HCCCCCC Q ss_conf 6677644311233221222222331000000000001233322222210000010011102-7884101----0023488 Q gi|255764474|r 215 KNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS-NTGVFSE----LLDPENA 289 (352) Q Consensus 215 ~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e----~i~~~~g 289 (352) +.|+.... -..+|..+|+++++..+|++||+++.- +-|+++.||+++|+|+|.. .++|=.+ ++. T Consensus 246 ~~L~~~~~---~~~~v~vlG~t~~m~~lM~asDllITK----pGGlT~sEAla~~lPmii~~piPGQEe~Na~~l~---- 314 (391) T PRK13608 246 RSLTAKFK---SNENVLILGYTKHMNEWMASSQLMITK----PGGITITEGFARCIPMIFLNPAPGQELENALYFE---- 314 (391) T ss_pred HHHHHHHC---CCCCEEEECCCCCHHHHHHHCCEEEEC----CCHHHHHHHHHHCCCEEECCCCCCCHHHHHHHHH---- T ss_conf 99997624---599769970705199999865299967----8667999999958998975799974466799999---- Q ss_pred CEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 424996599989999999999869899999999999999982998999999999999 Q gi|255764474|r 290 KAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 290 ~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) ++|.-+-..+++++++.+.+|++|++...+|.+++++.. .-++...+++.+.++.. T Consensus 315 ~~G~a~~~~~~~~~~~~v~~l~~~~~~l~~m~~~~~~~~-~p~a~~~I~~~~~~l~~ 370 (391) T PRK13608 315 EKGFGKIADTPEEAIKIVASLTNGNEQLTNMISTMEQDK-IKYATQTICRDLLDLIG 370 (391) T ss_pred HCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHH T ss_conf 689768859999999999998559999999999999717-99629999999999872 No 57 >cd03793 GT1_Glycogen_synthase_GSY2_like Glycogen synthase, which is most closely related to the GT1 family of glycosyltransferases, catalyzes the transfer of a glucose molecule from UDP-glucose to a terminal branch of a glycogen molecule, a rate-limit step of glycogen biosynthesis. GSY2, the member of this family in S. cerevisiae, has been shown to possess glycogen synthase activity. Probab=99.76 E-value=6.2e-16 Score=126.38 Aligned_cols=284 Identities=19% Similarity=0.264 Sum_probs=168.8 Q ss_pred HCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHH--------------HHH-----------------HHHHHH Q ss_conf 2597089962854689999999970998199975745620--------------113-----------------479899 Q gi|255764474|r 66 GQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRN--------------HSR-----------------WTRYLI 114 (352) Q Consensus 66 ~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~-----------------~~~~l~ 114 (352) .....+.|.|.+........++ ....++..+|+-|+... ... ..|-.. T Consensus 146 ~~~~vvaHfHEWm~g~gll~lr-~~~~~vaTvFTTHATllGR~laagn~d~Yn~L~~~n~d~eA~~~~i~~k~~iEraaA 224 (590) T cd03793 146 DEPAVVAHFHEWQAGVGLPLLR-KRKVDVSTIFTTHATLLGRYLCAGNVDFYNNLDYFDVDKEAGKRGIYHRYCIERAAA 224 (590) T ss_pred CCCCEEEEEEHHHHHHHHHHHH-HHCCCEEEEEEECHHHHHHHHHCCCCHHHHHHHHCCCHHHHCCCCCCHHHHHHHHHH T ss_conf 8986899821676708999997-627984788550121353687648935765240078043400359801788999887 Q ss_pred HHCCEEEECCHHHHH----HCC-CCEEEECCCCCHHHHCCCCCHH--------H----HHHHC----CCCCC-CCEEEEE Q ss_conf 626689988989997----436-8839974882777823784326--------8----78872----88988-7279995 Q gi|255764474|r 115 SRMDEVITTSQKSAR----FIE-RPSTVIMHGVDTERFRPTSNKQ--------E----ARRHL----KISED-AKLIGCF 172 (352) Q Consensus 115 ~~~d~ii~~s~~~~~----~~~-~~~~vi~~gid~~~~~~~~~~~--------~----~~~~~----~~~~~-~~~i~~~ 172 (352) ..+|-..++|+-++. .+. +|..|.|||+|..+|....+-. . .+.++ ..+.+ ...+... T Consensus 225 h~AdvFTTVSeITa~Ea~~lL~r~pDvVtpNGlnv~~~~a~~efqnlh~~~k~ki~~fv~ghf~g~~dfdldkTl~~fts 304 (590) T cd03793 225 HCAHVFTTVSEITAYEAEHLLKRKPDVVLPNGLNVKKFSALHEFQNLHAQSKEKINEFVRGHFYGHYDFDLDKTLYFFTA 304 (590) T ss_pred HCCCEEEECHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEE T ss_conf 41988885406449999998289989877898774345207888888999999999999976257667784635899975 Q ss_pred EEEC-HHHHHHHHHHHHHHHH----CCCCCE---EEEEECCCC------------------------------------- Q ss_conf 1226-4451236665456530----047740---588631333------------------------------------- Q gi|255764474|r 173 GRIR-KLKGTDLFVDCMINIL----PHHPGW---TAVVVGKTT------------------------------------- 207 (352) Q Consensus 173 G~~~-~~Kg~~~li~a~~~l~----~~~~~~---~l~i~G~g~------------------------------------- 207 (352) ||.. ..||.|.+|+|+++|. ....+. -++++=.+. T Consensus 305 GRYEf~NKG~DvfIeaL~rLn~~lk~~~~~~tVvAFii~Pa~t~~~nv~~l~g~a~~k~l~~~~~~i~~~~g~r~~e~~l 384 (590) T cd03793 305 GRYEFSNKGADMFLEALARLNYLLKVEGSDTTVVAFFIMPAKTNNFNVESLKGQAVRKQLRDTVNSVKEKIGKRLFEAAL 384 (590) T ss_pred EEEEECCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 41143267766899999999899875479973999999416657757223212578888998999998876367888875 Q ss_pred -----CCCCCH----HHHHH----------------------------HHHHHCCCC----CC--CCCCCC----CC--- Q ss_conf -----210001----46677----------------------------644311233----22--122222----23--- Q gi|255764474|r 208 -----LKHYLF----KKNLQ----------------------------RRIYANGLK----KR--ILFIDE----QS--- 237 (352) Q Consensus 208 -----~~~~~~----~~~l~----------------------------~~i~~~~l~----~~--V~~~g~----~~--- 237 (352) ..++.. .-.|| +.++..+|. ++ |+|.+. .+ T Consensus 385 ~g~~p~~~~~l~~~d~v~lkr~i~~~~~~~lpPi~TH~l~~~~~D~iL~~~rr~~L~N~~~drVKvIF~P~fLs~~dgl~ 464 (590) T cd03793 385 KGKLPDLEELLDKEDKVMLKRRIFALQRHSLPPVVTHNMVDDANDPILNHIRRIQLFNSPEDRVKVVFHPEFLSSTNPLL 464 (590) T ss_pred CCCCCCHHHHCCHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECHHCCCCCCCC T ss_conf 05789856742816677888888763036899814623655666789999887067788778368998010036899877 Q ss_pred --CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC---CCCCEEEEE-CCC--C----HHHHHH Q ss_conf --3100000000000123332222221000001001110278841010023---488424996-599--9----899999 Q gi|255764474|r 238 --SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDP---ENAKAGVIV-PPR--N----LHALEK 305 (352) Q Consensus 238 --~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~---~~g~~G~~~-~~~--d----~~~la~ 305 (352) +..++++.||+.|+||++|++|.+.+|+.++|+|.|+||..|...++.. .....|..| +.. + .+++++ T Consensus 465 ~m~Y~efv~Gc~LgVFPSyYEPWGYTPlE~~a~gvPsITTdLsGFG~~~~~~~~~~~~~GV~VvdR~~~~~~Esv~~l~~ 544 (590) T cd03793 465 GLDYEEFVRGCHLGVFPSYYEPWGYTPAECTVMGIPSITTNLSGFGCFMEEHIEDPESYGIYIVDRRFKSPDESVQQLTQ 544 (590) T ss_pred CCCHHHHHCCCCCCCCCCCCCCCCCCHHHHHHCCCCEEECCCHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHH T ss_conf 87699996457655576534887888698876279802216603779999864164237479997898997999999999 Q ss_pred HHHHHHC-CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 9999986-989999999999999998299899999999999998860 Q gi|255764474|r 306 AVLYFMN-SKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLLRT 351 (352) Q Consensus 306 ~i~~l~~-~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l~~ 351 (352) .+..... ++.++.....++.+ ..+.++|++...-|.+-|+-.|+| T Consensus 545 ~~~~f~~~s~~qr~~~R~rae~-ls~~~dW~~~~~yY~~A~~lAL~r 590 (590) T cd03793 545 YMYEFCQLSRRQRIIQRNRTER-LSDLLDWRNLGRYYRKARQLALSR 590 (590) T ss_pred HHHHHHCCCHHHHHHHHHHHHH-HHHHCCHHHHHHHHHHHHHHHHCC T ss_conf 9999974999999999999999-998668999999999999998449 No 58 >PRK13609 diacylglycerol glucosyltransferase; Provisional Probab=99.73 E-value=4.1e-16 Score=127.61 Aligned_cols=260 Identities=12% Similarity=0.104 Sum_probs=171.0 Q ss_pred HHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCC----CCEEE Q ss_conf 75432597089962854689999999970998199975745620113479899626689988989997436----88399 Q gi|255764474|r 62 KKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIE----RPSTV 137 (352) Q Consensus 62 ~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~----~~~~v 137 (352) +.....+||++-+..+...+..+..+.....|+..+.+... .-..|+....|..++.|+..+..+. .+..+ T Consensus 98 ~li~~~kPDvII~T~P~~~l~~lk~~~~~~iP~~tViTD~~-----~H~~Wi~~~~D~y~Vase~~k~~l~~~Gv~~~kI 172 (388) T PRK13609 98 LLLQAEKPDIVINTFPIIAVPELKKQTGISIPVYNVLTDFC-----VHKIWIHREVDRYFVATDHVKEVMVDIGVPAEQI 172 (388) T ss_pred HHHHHCCCCEEEECCCHHHHHHHHHHCCCCCCEEEEECCCC-----CHHHHCCCCCCEEEECCHHHHHHHHHHCCCHHHE T ss_conf 99998295999988878999999984599998899947852-----0464557899979939899999999809988899 Q ss_pred ECCCCCH-HHHCCCCCHHHHHHHCCCCCCCCEEEEEEE-ECHHHHHHHHHHHHHHHHCCCCCEEEE-EECCCCCCCCCHH Q ss_conf 7488277-782378432687887288988727999512-264451236665456530047740588-6313332100014 Q gi|255764474|r 138 IMHGVDT-ERFRPTSNKQEARRHLKISEDAKLIGCFGR-IRKLKGTDLFVDCMINILPHHPGWTAV-VVGKTTLKHYLFK 214 (352) Q Consensus 138 i~~gid~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~-~~~~Kg~~~li~a~~~l~~~~~~~~l~-i~G~g~~~~~~~~ 214 (352) ...|++. ..|....++...+++++++++.++++.+|. ..-.+.+..+++.+ . ..++++++ ++|... ... T Consensus 173 ~vtGiPVr~~F~~~~~k~~~r~~lgL~~d~~~vLimgGg~G~~g~i~~l~~~L---~-~~~~~qiiVVcGrN~----~L~ 244 (388) T PRK13609 173 VETGIPIRSSFELKINPDIIYNKYQLCKNKKILLIVAGAHGVLGNVKELCQSF---M-SVPDLQVVVVCGKNE----ALK 244 (388) T ss_pred EEECCCCCHHHCCCCCHHHHHHHHCCCCCCCEEEEECCHHHHHHHHHHHHHHH---H-CCCCCEEEEEECCCH----HHH T ss_conf 98898438787275887899998289987847999766012114799999997---4-589924999908998----999 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC-CCCCCCH----HCCCCCC Q ss_conf 6677644311233221222222331000000000001233322222210000010011102-7884101----0023488 Q gi|255764474|r 215 KNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS-NTGVFSE----LLDPENA 289 (352) Q Consensus 215 ~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e----~i~~~~g 289 (352) ++|++.... ..++|...|+.+++..+|++||+++.- +-|+++.||++||+|+|.. ..+|-.+ ++. +. T Consensus 245 ~~L~~~~~~--~~~~v~vlGf~~~~~~~~~~~d~~i~k----~Gg~t~~E~~~~~~P~i~~~~~pgqe~~N~~~~~-~~- 316 (388) T PRK13609 245 QDLVGLQET--NPDALKVFGYVENIDELFRVTSCMITK----PGGITLSEAAALQVPVILYKPVPGQENENAMYFE-RK- 316 (388) T ss_pred HHHHHHHHC--CCCCEEEEECHHHHHHHHHHCCEEEEC----CCHHHHHHHHHHCCCEEECCCCCCHHHHHHHHHH-HC- T ss_conf 999988750--799469950452099999857599957----8645899999948998970689961677799999-78- Q ss_pred CEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 42499659998999999999986989999999999999998299899999999999 Q gi|255764474|r 290 KAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVY 345 (352) Q Consensus 290 ~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY 345 (352) ..|..+ .|++.+++.+..|++|++..++|++++++.. .-...+.+++.+++-+ T Consensus 317 g~~~~~--~~~~~~~~~~~~ll~~~~~l~~m~~~~~~~~-~p~aa~~I~~~il~~~ 369 (388) T PRK13609 317 GAAVVI--RDDSEVFAKTEALLQDDMKLLQMKEAMKSIY-LPEPADHIVDTILAEN 369 (388) T ss_pred CCEEEE--CCHHHHHHHHHHHHCCHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHH T ss_conf 987997--9999999999999769999999999998627-9858999999999863 No 59 >pfam05693 Glycogen_syn Glycogen synthase. This family consists of the eukaryotic glycogen synthase proteins GYS1, GYS2 and GYS3. Glycogen synthase (GS) is the enzyme responsible for the synthesis of -1,4-linked glucose chains in glycogen. It is the rate limiting enzyme in the synthesis of the polysaccharide, and its activity is highly regulated through phosphorylation at multiple sites and also by allosteric effectors, mainly glucose 6-phosphate (G6P). Probab=99.69 E-value=1.8e-14 Score=116.57 Aligned_cols=283 Identities=20% Similarity=0.318 Sum_probs=168.1 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHH--------------HHH------------HHHH-----HHHH Q ss_conf 97089962854689999999970998199975745620--------------113------------4798-----9962 Q gi|255764474|r 68 NSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRN--------------HSR------------WTRY-----LISR 116 (352) Q Consensus 68 ~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~------------~~~~-----l~~~ 116 (352) ...|.|.|.|......+.++ .....+..+|+-|+... ... +.|+ .... T Consensus 143 ~~VvahfHEWmaG~gll~lr-~~~~~VaTvFTTHAT~LGR~l~~~n~d~Yn~L~~fn~D~eA~~~~I~~k~~iEraaAh~ 221 (633) T pfam05693 143 PAVIAHFHEWQAGVGLILCR-KRKLDVATVFTTHATLLGRYLCAGGVDFYNNLDKFDVDAEAGKRQIYHRYCIERAAAHT 221 (633) T ss_pred CCEEEEHHHHHHHHHHHHHH-HCCCCEEEEEEHHHHHHHHHHHCCCCHHHHHHCCCCHHHHHHHCCCHHHHHHHHHHHHC T ss_conf 82898715767616799986-41797137743307888789874895377655038847678870981157699888722 Q ss_pred CCEEEECCHHHHH----HCC-CCEEEECCCCCHHHHCCCCC--------HHH----HHHHC----CCC-CCCCEEEEEEE Q ss_conf 6689988989997----436-88399748827778237843--------268----78872----889-88727999512 Q gi|255764474|r 117 MDEVITTSQKSAR----FIE-RPSTVIMHGVDTERFRPTSN--------KQE----ARRHL----KIS-EDAKLIGCFGR 174 (352) Q Consensus 117 ~d~ii~~s~~~~~----~~~-~~~~vi~~gid~~~~~~~~~--------~~~----~~~~~----~~~-~~~~~i~~~G~ 174 (352) +|-..++|+-++. .+. +|..|.|||+|..+|....+ +.. .+.++ ..+ ++..++...|| T Consensus 222 advFTTVSeITa~Eae~lL~r~pDvVtPNGlnv~~f~a~~efqnlh~~ak~ki~~fv~ghf~g~~dfdld~Tl~~ftsGR 301 (633) T pfam05693 222 AHVFTTVSEITALEAEHLLKRKPDIITPNGLNVKKFSAVHEFQNLHAQNKEKINDFVRGHFYGHLDFDLDKTLYFFIAGR 301 (633) T ss_pred CCEEEECCHHHHHHHHHHCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECC T ss_conf 88546521102698999628998888888767543520578889999999999999998732546788563489996122 Q ss_pred EC-HHHHHHHHHHHHHHHH----CCCCC---EEEEEECCCCCC-------C----------------------------- Q ss_conf 26-4451236665456530----04774---058863133321-------0----------------------------- Q gi|255764474|r 175 IR-KLKGTDLFVDCMINIL----PHHPG---WTAVVVGKTTLK-------H----------------------------- 210 (352) Q Consensus 175 ~~-~~Kg~~~li~a~~~l~----~~~~~---~~l~i~G~g~~~-------~----------------------------- 210 (352) .. ..||+|.+|+|+++|. ....+ +-|+++-.+... . T Consensus 302 YEF~NKG~DvFIEALarLn~~lk~~~~~~tVvAFii~Pa~t~~~nve~l~g~av~k~l~d~~~~i~~~ig~r~~e~~l~g 381 (633) T pfam05693 302 YEFSNKGADMFIESLARLNHRLKTSGSKVTVVAFLIMPAKTNSFNVESLKGQAVIKQLRDTVNEIKEKVGKRIFESCLRG 381 (633) T ss_pred EEECCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 01227756779999999989998538998499999932776874766631236787898899999987657788888516 Q ss_pred ----------CC----------------------------HHHHHHHHHHHCCC----CCC--CCCCCC----CC----- Q ss_conf ----------00----------------------------14667764431123----322--122222----23----- Q gi|255764474|r 211 ----------YL----------------------------FKKNLQRRIYANGL----KKR--ILFIDE----QS----- 237 (352) Q Consensus 211 ----------~~----------------------------~~~~l~~~i~~~~l----~~~--V~~~g~----~~----- 237 (352) .. ....+-+.++..+| +++ |+|.+. .+ T Consensus 382 ~~p~~~~~l~~~d~~~lkr~i~~~~r~~lPPi~THnl~dd~~DpIL~~~rr~~L~N~~~drVKVIF~P~fL~~~d~lf~l 461 (633) T pfam05693 382 HLPEMDELLDSADLVLLKRCIFALQRDSLPPVCTHNMIDDANDPILNQIRRVGLFNSPSDRVKVIFHPEFLSSTSPLLGL 461 (633) T ss_pred CCCCHHHHCCHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCHHHHHHHHCCCCCCCCCCEEEEECCHHCCCCCCCCCC T ss_conf 78984552384788899999886403689971333577676688999998737868877735899620012688987787 Q ss_pred CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCC--C-CCEEEEE-CCC--C----HHHHHHHH Q ss_conf 31000000000001233322222210000010011102788410100234--8-8424996-599--9----89999999 Q gi|255764474|r 238 SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPE--N-AKAGVIV-PPR--N----LHALEKAV 307 (352) Q Consensus 238 ~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~--~-g~~G~~~-~~~--d----~~~la~~i 307 (352) +..++.+.||+.++||++|++|-+.+|+.++|+|.|+||..|.--++... + ...|+.| +.. + .+++++.| T Consensus 462 ~Y~efv~Gc~LgVFPSYYEPWGYTPlE~~a~gVPsITTdLSGFG~~~~~~~~~~~~~GI~VvdR~~~n~~Esv~ql~~~m 541 (633) T pfam05693 462 DYEEFVRGCHLGVFPSYYEPWGYTPAECTVMGIPSVTTNLSGFGCFMEEHIEDPKDYGIYIVDRRFKSPDESVQQLAQFM 541 (633) T ss_pred CHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCCEECCCHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHH T ss_conf 79998524664446543589888859986517880113751578999987306434757999788898799999999999 Q ss_pred HHHHC-CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC Q ss_conf 99986-9899999999999999982998999999999999988609 Q gi|255764474|r 308 LYFMN-SKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLLRTA 352 (352) Q Consensus 308 ~~l~~-~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l~~a 352 (352) ..... .+.++.....++.. ..+.++|++...-|.+-|+-.|++| T Consensus 542 ~~f~~~s~rqri~~Rnrter-LS~~~dW~~~~~yY~~A~~lAL~ra 586 (633) T pfam05693 542 YEFCQQSRRQRIIQRNRTER-LSDLLDWKRLGRYYRKARQLALRRA 586 (633) T ss_pred HHHHCCCHHHHHHHHHHHHH-HHHHCCHHHHHHHHHHHHHHHHHHH T ss_conf 99974999999999999999-9886489999999999999999762 No 60 >cd03816 GT1_ALG1_like This family is most closely related to the GT1 family of glycosyltransferases. The yeast gene ALG1 has been shown to function as a mannosyltransferase that catalyzes the formation of dolichol pyrophosphate (Dol-PP)-GlcNAc2Man from GDP-Man and Dol-PP-Glc-NAc2, and participates in the formation of the lipid-linked precursor oligosaccharide for N-glycosylation. In humans ALG1 has been associated with the congenital disorders of glycosylation (CDG) designated as subtype CDG-Ik. Probab=99.69 E-value=1.1e-13 Score=111.37 Aligned_cols=261 Identities=13% Similarity=0.211 Sum_probs=177.2 Q ss_pred CCEEEEECCC---HHHHHHHHHHHHHCCCEEEEECCHHHHH--------------HHHHHHHHHHHCCEEEECCHHHHHH Q ss_conf 9708996285---4689999999970998199975745620--------------1134798996266899889899974 Q gi|255764474|r 68 NSRIWHARRN---NEMLLGVMMRDVLRMPLKLVFTSPSQRN--------------HSRWTRYLISRMDEVITTSQKSARF 130 (352) Q Consensus 68 ~~~ivh~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~l~~~~d~ii~~s~~~~~~ 130 (352) .++.+-.+.+ +......+...+.+.++++++|+..+.- ...+.+.+.+.+|.-+|+|+.++++ T Consensus 95 ~~~~iLvQNPPsiPtl~v~~l~~~lr~~klIIDWHN~gyTIL~l~~g~~h~lV~~~k~~E~~fgr~a~~~lcVT~AM~~~ 174 (415) T cd03816 95 PADYILIQNPPSIPTLLIAWLYCLLRRTKLIIDWHNYGYTILALKLGENHPLVRLAKWYEKLFGRLADYNLCVTKAMKED 174 (415) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHCCEEEEEECCCCCHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEECCCHHHHHH T ss_conf 87579841887643699999999982882999905877512205788887389999999987550364605406899999 Q ss_pred CC------CCEEEECCCCCHHHHCCCCCHHHHHH--HC--------------CCCCCCC-EEEEEEEECHHHHHHHHHHH Q ss_conf 36------88399748827778237843268788--72--------------8898872-79995122644512366654 Q gi|255764474|r 131 IE------RPSTVIMHGVDTERFRPTSNKQEARR--HL--------------KISEDAK-LIGCFGRIRKLKGTDLFVDC 187 (352) Q Consensus 131 ~~------~~~~vi~~gid~~~~~~~~~~~~~~~--~~--------------~~~~~~~-~i~~~G~~~~~Kg~~~li~a 187 (352) +. .+..|++. -+.+.|.|......... +. ...++++ +++.--.+++-.+...|++| T Consensus 175 L~~~~~~~~~~~vlyD-rP~~~F~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~llVSSTSWTpDEDF~iLl~A 253 (415) T cd03816 175 LQQFNNWKIRATVLYD-RPPEQFRPLPLEEKHELFLKLAKTFLTRELRIGAVQLSEERPALLVSSTSWTPDEDFGILLDA 253 (415) T ss_pred HHHHHCCCCCEEEEEC-CCHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHHH T ss_conf 9860266888799822-888975988678999999743554232334665311357773389951677776556789999 Q ss_pred HHHHHC------CCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCC---CCCCCCCCCHHHHHHCCCCCC-CC Q ss_conf 565300------47740588631333210001466776443112332212222---223310000000000012333-22 Q gi|255764474|r 188 MINILP------HHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFID---EQSSIEDWYRALNIFVAPPLY-EG 257 (352) Q Consensus 188 ~~~l~~------~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g---~~~~~~~~~~~adi~i~pS~~-Eg 257 (352) +...-. ..|++.++|-|.||.+ +...+.+++..+. +|.+.. ..+|.+.+++.||+.|+...+ -| T Consensus 254 L~~Yd~~~~~~~~~p~ll~iITGKGP~K-----~~y~~~I~~~~l~-~V~i~t~wL~~eDYP~lL~~ADLGVsLHtSSSG 327 (415) T cd03816 254 LVAYEKSAATGPKLPKLLCIITGKGPLK-----EKYLERIKELKLK-KVTIRTPWLSAEDYPKLLASADLGVSLHTSSSG 327 (415) T ss_pred HHHHHHHHHCCCCCCCEEEEEECCCHHH-----HHHHHHHHHCCCC-CEEEEECCCCHHHHHHHHHCCCCCEEECCCCCC T ss_conf 9999765321478998799996885308-----9999999862888-219972578878899987415347242126876 Q ss_pred --CCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCC---HHHHHHHHHHHHHHHHHHC Q ss_conf --22221000001001110278841010023488424996599989999999999869---8999999999999999829 Q gi|255764474|r 258 --FGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNS---KKIMSDTGNRGRERAVKHF 332 (352) Q Consensus 258 --~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~---~~~~~~~~~~a~~~~~~~f 332 (352) .|+++++...||+||++-+-+.++|+|+ +|+||+++. |.+++++.|..++.+ .+...++.+++.+. +.. T Consensus 328 lDLPMKVVDMfG~GlPV~A~~f~~i~ELVk--~~~NG~~F~--~~~eL~~~l~~l~~~~p~~~~l~~lk~~a~~~--~~~ 401 (415) T cd03816 328 LDLPMKVVDMFGCGLPVCALDFKCIDELVK--HGENGLVFG--DSEELAEQLIDLLSNFPNRGKLNSLKKGAQEE--SEL 401 (415) T ss_pred CCCCHHHHHHCCCCCCEEEECCCCHHHHHC--CCCCCCEEC--CHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH--CCC T ss_conf 677702101026887579833751787722--687876657--89999999999985599668999999777774--149 Q ss_pred CHHHHHHHH Q ss_conf 989999999 Q gi|255764474|r 333 SIVKEASDI 341 (352) Q Consensus 333 s~~~~a~~~ 341 (352) +|+..-++. T Consensus 402 rWd~~W~~~ 410 (415) T cd03816 402 RWDENWDRV 410 (415) T ss_pred CHHHHHHHH T ss_conf 856878864 No 61 >PRK10117 trehalose-6-phosphate synthase; Provisional Probab=99.60 E-value=9.8e-13 Score=104.85 Aligned_cols=276 Identities=13% Similarity=0.106 Sum_probs=174.0 Q ss_pred CCCEEEEECCCHHHHHHHHHHHH-HCCCEEEEECCHHH----HHHHHHHHHHHH---HCCEEEECCHHHHH--------H Q ss_conf 59708996285468999999997-09981999757456----201134798996---26689988989997--------4 Q gi|255764474|r 67 QNSRIWHARRNNEMLLGVMMRDV-LRMPLKLVFTSPSQ----RNHSRWTRYLIS---RMDEVITTSQKSAR--------F 130 (352) Q Consensus 67 ~~~~ivh~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~----~~~~~~~~~l~~---~~d~ii~~s~~~~~--------~ 130 (352) ..-|+|+.|..--++....++.. ...++-+..|.++- ...-++.+.+++ .+|-|-..++..++ . T Consensus 122 ~~~D~VWVHDYHL~llp~~LR~~~~~~~IgfFlHiPFPs~eifr~LP~r~ell~glL~aDlIGFqt~~~~~~Fl~~~~~~ 201 (474) T PRK10117 122 QDDDIIWIHDYHLLPFASELRKRGVNNRIGFFLHIPFPTPEIFNALPPHDELLEQLCDYDLLGFQTENDRLAFLDCLSNQ 201 (474) T ss_pred CCCCEEEECCCHHHCCHHHHHHHCCCCCEEEEECCCCCCHHHHHHCCCHHHHHHHHHCCCEEEECCHHHHHHHHHHHHHH T ss_conf 88987998154022132999961899827988568999979983399669999987507867875789999999999986 Q ss_pred C-------------C--CCEEEECCCCCHHHHCCCCC------HHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHH Q ss_conf 3-------------6--88399748827778237843------2687887288988727999512264451236665456 Q gi|255764474|r 131 I-------------E--RPSTVIMHGVDTERFRPTSN------KQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMI 189 (352) Q Consensus 131 ~-------------~--~~~~vi~~gid~~~~~~~~~------~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~ 189 (352) + . ..+.+.|-|||.+.|..... ..++++.+ .+..+|+-+.|+++-||+..=++|+. T Consensus 202 ~~~~~~~~~~~~~~Gr~v~v~~~PigID~~~~~~~a~~~~~~~~~~l~~~~---~~~~lIlgVDRLDytKGi~~rl~Afe 278 (474) T PRK10117 202 TRVTTRSGKSHTAWGKAFRTEVYPIGIEPDEIAKQAAGPLPPKLAQLKAEL---KNVQNIFSVERLDYSKGLPERFLAYE 278 (474) T ss_pred CCCCCCCCCEEEECCEEEEEEEEECEECHHHHHHHHCCHHHHHHHHHHHHH---CCCEEEEEECCCCCCCCHHHHHHHHH T ss_conf 186404797599668089999995657599999984142789999999985---79849999356423258899999999 Q ss_pred HHHCCCCCE----EEEEECCCCCCCCCHHHHHHHHHHH--------CCCCCC--CCCCC---CCCCCCCCCHHHHHHCCC Q ss_conf 530047740----5886313332100014667764431--------123322--12222---223310000000000012 Q gi|255764474|r 190 NILPHHPGW----TAVVVGKTTLKHYLFKKNLQRRIYA--------NGLKKR--ILFID---EQSSIEDWYRALNIFVAP 252 (352) Q Consensus 190 ~l~~~~~~~----~l~i~G~g~~~~~~~~~~l~~~i~~--------~~l~~~--V~~~g---~~~~~~~~~~~adi~i~p 252 (352) .+++++|++ .++.++..+..+....+++++.+++ +|-.+. |.+.. .++++..+|+.||+.+.+ T Consensus 279 ~fL~~~Pe~~gkvvlvQia~psR~~v~~Y~~l~~ev~~~VgrINg~fg~~~w~Pi~y~~~~~~~~~l~alyr~ADv~lVT 358 (474) T PRK10117 279 ALLEKYPQHHGKIRYTQIAPTSRGDVQAYQDIRHQLETEAGRINGKYGQLGWTPLYYLNQHFDRKLLMKIFRYSDVGLVT 358 (474) T ss_pred HHHHHCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCHHHHHHHHHHCCEEEEC T ss_conf 99985900158679999827886553999999999999999985645889971499982689999999999746689964 Q ss_pred CCCCCCCCCCCCCCCCEE---E--EEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHHHHHHH Q ss_conf 333222222100000100---1--11027884101002348842499659998999999999986-98999999999999 Q gi|255764474|r 253 PLYEGFGLTPLEAMASGI---P--VVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTGNRGRE 326 (352) Q Consensus 253 S~~Eg~gl~~lEAma~G~---P--vI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~~~a~~ 326 (352) |..+|..++..|+.||-. | +|.|...|..+.+. .+++|+|.|.+++|++|.+.++ .++++.+--++-++ T Consensus 359 plRDGMNLVAkEyVa~q~~~~~GvLILSeFaGaa~~L~-----~AllVNP~d~~~~A~ai~~AL~Mp~~Er~~R~~~l~~ 433 (474) T PRK10117 359 PLRDGMNLVAKEYVAAQDPANPGVLVLSQFAGAANELT-----SALIVNPYDRDEVAAALDRALTMPLAERISRHAEMLD 433 (474) T ss_pred CCCCCCCCCCHHEEEEECCCCCEEEEEECCCCCHHHHC-----CCEEECCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 54342355530105785589980399853413266747-----6868799997999999999976999999999999999 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 9998299899999999999998860 Q gi|255764474|r 327 RAVKHFSIVKEASDIGKVYDRLLRT 351 (352) Q Consensus 327 ~~~~~fs~~~~a~~~~~iY~~~l~~ 351 (352) .+. +++....++.+++-.+++-.+ T Consensus 434 ~v~-~~dv~~W~~~Fl~~L~~~~~~ 457 (474) T PRK10117 434 VIV-KNDINHWQECFISDLKQITPR 457 (474) T ss_pred HHH-HCCHHHHHHHHHHHHHHHHHH T ss_conf 998-678999999999999864154 No 62 >cd03788 GT1_TPS Trehalose-6-Phosphate Synthase (TPS) is a glycosyltransferase that catalyses the synthesis of alpha,alpha-1,1-trehalose-6-phosphate from glucose-6-phosphate using a UDP-glucose donor. It is a key enzyme in the trehalose synthesis pathway. Trehalose is a nonreducing disaccharide present in a wide variety of organisms and may serve as a source of energy and carbon. It is characterized most notably in insect, plant, and microbial cells. Its production is often associated with a variety of stress conditions, including desiccation, dehydration, heat, cold, and oxidation. This family represents the catalytic domain of the TPS. Some members of this domain family coexist with a C-terminal trehalose phosphatase domain. Probab=99.58 E-value=2.7e-12 Score=101.87 Aligned_cols=271 Identities=17% Similarity=0.203 Sum_probs=169.9 Q ss_pred CCCEEEEECCCHHHHHHHHHHHH-HCCCEEEEECCHHH----HHHHHHHHHHHH---HCCEEEECCHHHHH--------H Q ss_conf 59708996285468999999997-09981999757456----201134798996---26689988989997--------4 Q gi|255764474|r 67 QNSRIWHARRNNEMLLGVMMRDV-LRMPLKLVFTSPSQ----RNHSRWTRYLIS---RMDEVITTSQKSAR--------F 130 (352) Q Consensus 67 ~~~~ivh~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~----~~~~~~~~~l~~---~~d~ii~~s~~~~~--------~ 130 (352) ...|+|+.|..--++....++.. ...++-+..|.++- ...-++.+.+++ .+|-|-..++..++ . T Consensus 130 ~~~D~VWVHDYhL~llP~~LR~~~~~~~igfFlHiPFPs~eifr~LP~r~eil~glL~~DlIGF~t~~y~r~Fl~~~~r~ 209 (460) T cd03788 130 RPGDLVWVHDYHLLLLPQMLRERGPDARIGFFLHIPFPSSEIFRCLPWREELLRGLLGADLIGFQTERYARNFLSCCSRL 209 (460) T ss_pred CCCCEEEEECCHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHCCCHHHHHHHHHHCCEEEECCHHHHHHHHHHHHHH T ss_conf 89987999641677689999985899848988707999989997697679999998747766646889999999999997 Q ss_pred CC---------------CCEEEECCCCCHHHHCCCCC----HHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHH Q ss_conf 36---------------88399748827778237843----268788728898872799951226445123666545653 Q gi|255764474|r 131 IE---------------RPSTVIMHGVDTERFRPTSN----KQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINI 191 (352) Q Consensus 131 ~~---------------~~~~vi~~gid~~~~~~~~~----~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l 191 (352) +. ..+.+.|-|||.+.|..... ............+.++|+-+.|+++-||+..-++|+..+ T Consensus 210 l~~~~~~~~~v~~~gr~v~v~~~PigId~~~~~~~~~~~~~~~~~~~l~~~~~~~~li~gvDRlDy~KGi~~kl~Afe~f 289 (460) T cd03788 210 LGLEVTDDGGVEYGGRRVRVGAFPIGIDPDAFRKLAASPEVQERAAELRERLGGRKLIVGVDRLDYSKGIPERLLAFERL 289 (460) T ss_pred CCCCCCCCCEEEECCEEEEEEEEECCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHH T ss_conf 09852479859999989999898040188999998628324799999998737971999625321114878899999999 Q ss_pred HCCCCC----EEEEEECCCCCCCCCHHHHHHHHHHHC--------CCCC--CCCCCC---CCCCCCCCCHHHHHHCCCCC Q ss_conf 004774----058863133321000146677644311--------2332--212222---22331000000000001233 Q gi|255764474|r 192 LPHHPG----WTAVVVGKTTLKHYLFKKNLQRRIYAN--------GLKK--RILFID---EQSSIEDWYRALNIFVAPPL 254 (352) Q Consensus 192 ~~~~~~----~~l~i~G~g~~~~~~~~~~l~~~i~~~--------~l~~--~V~~~g---~~~~~~~~~~~adi~i~pS~ 254 (352) ++++|+ +.|+.++..+..+.....++++.+++. |-.+ -|+|.. ..+++..+|+.||+++.+|. T Consensus 290 L~~~Pe~~~kvvlvQia~psr~~v~~y~~l~~~i~~~v~~IN~~fg~~~w~PI~y~~~~~~~~el~aly~~ADv~lVT~l 369 (460) T cd03788 290 LERYPEWRGKVVLVQIAVPSRTDVPEYQELRREVEELVGRINGKFGTLDWTPVRYLYRSLPREELAALYRAADVALVTPL 369 (460) T ss_pred HHHCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHHHEEEECCC T ss_conf 98691433777999992587767557899999999999998614378997029999179999999999986105785534 Q ss_pred CCCCCCCCCCCCCCEEE----EEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHH Q ss_conf 32222221000001001----11027884101002348842499659998999999999986-98999999999999999 Q gi|255764474|r 255 YEGFGLTPLEAMASGIP----VVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTGNRGRERAV 329 (352) Q Consensus 255 ~Eg~gl~~lEAma~G~P----vI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~~~a~~~~~ 329 (352) .+|..++..|+.+|..+ .|.|...|..+.+. .+++|+|.|.+++|++|.+.++ .++++.+-.++-++.+. T Consensus 370 rDGMNLvakEyva~q~~~~GvLILSefaGaa~~L~-----~Al~VNP~d~~~~a~ai~~AL~M~~~Er~~R~~~l~~~v~ 444 (460) T cd03788 370 RDGMNLVAKEYVACQDDDPGVLILSEFAGAAEELS-----GALLVNPYDIDEVADAIHRALTMPLEERRERHRKLREYVR 444 (460) T ss_pred CCCCCCCCHHHEEEECCCCCEEEEECCCCCHHHHC-----CCEEECCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 23325341311367559995599965524366728-----7879799998999999999975999999999999999988 Q ss_pred HHCCHHHHHHHHHH Q ss_conf 82998999999999 Q gi|255764474|r 330 KHFSIVKEASDIGK 343 (352) Q Consensus 330 ~~fs~~~~a~~~~~ 343 (352) .++....++.+.+ T Consensus 445 -~~~~~~W~~~fl~ 457 (460) T cd03788 445 -THDVQAWANSFLD 457 (460) T ss_pred -HCCHHHHHHHHHH T ss_conf -5799999999999 No 63 >PRK00726 murG N-acetylglucosaminyl transferase; Provisional Probab=99.55 E-value=2.1e-11 Score=95.89 Aligned_cols=312 Identities=14% Similarity=0.118 Sum_probs=184.7 Q ss_pred HHEEEECCCCCCCCCCHHHH-HHHHHHHHHHHCCEEEEEECCCC-------C-C-----CCC-----C-C---------- Q ss_conf 44589707878787707788-88999999971985999805886-------6-5-----623-----4-7---------- Q gi|255764474|r 4 NNIDVIAPNMKFRHTGVTST-VFGLCPIQRKLGQRLVVFGYCLP-------K-N-----IPS-----I-G---------- 53 (352) Q Consensus 4 ~~i~~i~~~~~~~~~Gv~~~-v~~~~~~~~~~~~~~~~~~~~~~-------~-~-----~~~-----~-~---------- 53 (352) ++|.+.+.. +|...+ ...++..+++.|+++..++...+ . . ++. . . T Consensus 2 kkI~i~~GG-----TGGHi~Palala~~L~~~g~ev~~ig~~~g~E~~~v~~~~~~~~~i~~~~~~~~~~~~~~~~~~~l 76 (359) T PRK00726 2 KKILLAGGG-----TGGHVFPALALAEELKKRGWEVLWLGTKRGMEARLVPKAGIEFHFIPIGGLRRKGSLANLKAPFKL 76 (359) T ss_pred CEEEEEECC-----CHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHH T ss_conf 889999588-----689999999999999838798999978826865404414983899777888987879999999999 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCCC Q ss_conf 89989988754325970899628546899999999709981999757456201134798996266899889899974368 Q gi|255764474|r 54 ISSLLTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIER 133 (352) Q Consensus 54 ~~~~~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~~ 133 (352) ...+...++.....+||++-..........++.+.+++.|+.+.-.+. ....-.|++.+.++.+++.-+....+.+. T Consensus 77 ~~~~~~~~~il~~~kPd~Vig~GGY~s~P~~laA~l~~iP~iiHEqN~---v~G~aNr~l~~~a~~i~~~f~~~~~~~~~ 153 (359) T PRK00726 77 LKGVLQARKILKRFKPDVVVGFGGYVSGPAGLAARLLGIPLVIHEQNA---VPGLANKLLARFAKKVATAFEETPKAFPK 153 (359) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHCCCCEEEECCCC---CCCCCHHHHHHHCCEEEECCHHHHHCCCC T ss_conf 999999999999749999997897412899999998299869974542---35623378888509789977555403762 Q ss_pred -CEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEE-ECHHHHH-HHHHHHHHHHHCCCCCEEEE-EECCCCCC Q ss_conf -83997488277782378432687887288988727999512-2644512-36665456530047740588-63133321 Q gi|255764474|r 134 -PSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGR-IRKLKGT-DLFVDCMINILPHHPGWTAV-VVGKTTLK 209 (352) Q Consensus 134 -~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~-~~~~Kg~-~~li~a~~~l~~~~~~~~l~-i~G~g~~~ 209 (352) +..++.|.+-.+.+... ....+. ...+.+.|+.+|. ... +.+ +.+.+++..+.++ .+++++ ++|... T Consensus 154 ~k~~~~G~PvR~~~~~~~--~~~~~~---~~~~~~~iLV~GGSqGa-~~~N~~v~~~l~~l~~~-~~~~i~~~~G~~~-- 224 (359) T PRK00726 154 AKAVVTGNPVREEILALP--APAFRL---AGRGPPTLLVVGGSQGA-RVLNEAVPEALALLPEE-LRIQVIHQTGKKD-- 224 (359) T ss_pred CCEEEECCCCCHHHHCCC--HHHHHH---CCCCCCEEEEECCCCHH-HHHHHHHHHHHHHHHHC-CCEEEEEECCCCH-- T ss_conf 455996784027766143--333321---04788579997685204-78999999999987652-5908999828403-- Q ss_pred CCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCC--------C Q ss_conf 00014667764431123322122222233100000000000123332222221000001001110278841--------0 Q gi|255764474|r 210 HYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVF--------S 281 (352) Q Consensus 210 ~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~--------~ 281 (352) .+++++...+.+ .++...++.+++..+|++||+.++-+ -..++.|.+++|+|.|.-+.+.. . T Consensus 225 ----~~~~~~~~~~~~--~~~~v~~f~~~m~~~~~~aDlvIsRa----Ga~Ti~E~~~~g~P~IlIP~p~a~~~HQ~~NA 294 (359) T PRK00726 225 ----LEEVRAAYAELG--VNAEVVPFIDDMAAAYAAADLVICRA----GASTVAELAAAGLPAILVPLPHAADDHQTANA 294 (359) T ss_pred ----HHHHHHHHHHCC--CCEEECCCCCHHHHHHHCCCEEEECC----CCCHHHHHHHHCCCEEEEECCCCCCCHHHHHH T ss_conf ----999999998659--97697575231899874088999889----83269999982898699836877753899999 Q ss_pred HHCCCCCCCEEEEECCC--CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 10023488424996599--98999999999986989999999999999998299899999999999 Q gi|255764474|r 282 ELLDPENAKAGVIVPPR--NLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVY 345 (352) Q Consensus 282 e~i~~~~g~~G~~~~~~--d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY 345 (352) +.+. +...|++++.. +++.+.+.|.+++.|++.+.+|++++++....+ +.+.+++.++++- T Consensus 295 ~~l~--~~gaa~~i~e~~~~~~~L~~~i~~ll~d~~~l~~m~~~~~~~~~~~-a~~~i~~~i~~~~ 357 (359) T PRK00726 295 RALV--DAGAAKLIPQSDLTPERLAEALLELLSDRERLEAMAEAARALGIPD-AAERLADLIEKLA 357 (359) T ss_pred HHHH--HCCCEEEEECCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC-HHHHHHHHHHHHH T ss_conf 9999--7899999531469999999999999869999999999997248978-9999999999985 No 64 >pfam00982 Glyco_transf_20 Glycosyltransferase family 20. Members of this family belong to glycosyl transferase family 20. OtsA (Trehalose-6-phosphate synthase) is homologous to regions in the subunits of yeast trehalose-6-phosphate synthase/phosphate complex,. Probab=99.52 E-value=1.6e-11 Score=96.65 Aligned_cols=271 Identities=14% Similarity=0.170 Sum_probs=169.7 Q ss_pred CCEEEEECCCHHHHHHHHHHHH-HCCCEEEEECCHHH----HHHHHHHHHHHH---HCCEEEECCHHHHH--------HC Q ss_conf 9708996285468999999997-09981999757456----201134798996---26689988989997--------43 Q gi|255764474|r 68 NSRIWHARRNNEMLLGVMMRDV-LRMPLKLVFTSPSQ----RNHSRWTRYLIS---RMDEVITTSQKSAR--------FI 131 (352) Q Consensus 68 ~~~ivh~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~----~~~~~~~~~l~~---~~d~ii~~s~~~~~--------~~ 131 (352) .-|+|+.|..--++....++.. ...++-+..|.++- ...-++.+.+++ .+|-|-..++..++ .+ T Consensus 138 ~~D~VWVHDYHL~llp~~LR~~~~~~~IgfFlHiPFPs~eifr~lP~r~eil~glL~~DliGFqt~~y~r~Fl~~~~rll 217 (470) T pfam00982 138 DGDLIWVHDYHLMLLPQMLRKRLPDAKIGFFLHIPFPSSEIFRCLPVREEILRGLLGADLIGFHTYDYARHFLSCCSRLL 217 (470) T ss_pred CCCEEEECCCHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHCCCHHHHHHHHHHCCEEEECCHHHHHHHHHHHHHHH T ss_conf 89989982633666899999848998389886189999899976987699999987588889678899999999999971 Q ss_pred C---------------CCEEEECCCCCHHHHCCCCC-------HHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHH Q ss_conf 6---------------88399748827778237843-------2687887288988727999512264451236665456 Q gi|255764474|r 132 E---------------RPSTVIMHGVDTERFRPTSN-------KQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMI 189 (352) Q Consensus 132 ~---------------~~~~vi~~gid~~~~~~~~~-------~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~ 189 (352) . ..+.+.|-|||.+.|..... ...++..++ .+.++|+-+.|+++-||+..-+.|+. T Consensus 218 g~~~~~~~~v~~~gr~v~v~~~PigId~~~~~~~~~~~~~~~~~~~lr~~~~--~~~kli~gvDRlDytKGi~~kl~Afe 295 (470) T pfam00982 218 GLETTSDGGVEYGGRTVSVGAFPIGIDPGRIESGLKSPSVQEKVKELKERFG--NKKKLILGVDRLDYIKGIPQKLLAFE 295 (470) T ss_pred CCCCCCCCEEEECCEEEEEEEEECEECHHHHHHHHCCCHHHHHHHHHHHHHC--CCCEEEEECCCCCCCCCHHHHHHHHH T ss_conf 8740589739999989999998133188999998639146899999999837--99669997366402258899999999 Q ss_pred HHHCCCCCE----EEEEECCCCCCCCCHHHHHHHHHHHC--------CCCCC--CCCCC---CCCCCCCCCHHHHHHCCC Q ss_conf 530047740----58863133321000146677644311--------23322--12222---223310000000000012 Q gi|255764474|r 190 NILPHHPGW----TAVVVGKTTLKHYLFKKNLQRRIYAN--------GLKKR--ILFID---EQSSIEDWYRALNIFVAP 252 (352) Q Consensus 190 ~l~~~~~~~----~l~i~G~g~~~~~~~~~~l~~~i~~~--------~l~~~--V~~~g---~~~~~~~~~~~adi~i~p 252 (352) .+++++|++ .|+.++..+..+.....++++.+++. |-.+. |.+.- ..+++..+|+.||+++.+ T Consensus 296 ~fL~~~Pe~~gkv~lvQia~psr~~v~~y~~l~~~v~~~v~rIN~~fg~~~w~Pv~y~~~~~~~~el~aly~~adv~lVT 375 (470) T pfam00982 296 RFLEEYPEWRGKVVLVQIAVPSRGDVEEYQNLRSQVEELVGRINGEFGTLDYTPVHHLHRSLDFDELIALYAIADVCLVT 375 (470) T ss_pred HHHHHCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHEEEEC T ss_conf 99986914468779999815887562789999999999999996144789974299990799999999999851235745 Q ss_pred CCCCCCCCCCCCCCCCEE----EEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHHHHHHHH Q ss_conf 333222222100000100----111027884101002348842499659998999999999986-989999999999999 Q gi|255764474|r 253 PLYEGFGLTPLEAMASGI----PVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTGNRGRER 327 (352) Q Consensus 253 S~~Eg~gl~~lEAma~G~----PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~~~a~~~ 327 (352) |..+|+.|+..|+.+|-. ++|.|...|..+.+. + .+++|+|.|.+++|++|.+.+. .++++++-.++-++. T Consensus 376 plrDGMNLvakEyva~q~~~~GvLILSefaGaa~~L~--~--gAllVNP~d~~~~a~ai~~AL~M~~~Er~~R~~~l~~~ 451 (470) T pfam00982 376 SLRDGMNLVAYEYVACQQDRKGVLILSEFAGAAQSLN--D--GAILVNPWDIEEVAEAINEALTMSEEERQKRHRKLFKY 451 (470) T ss_pred CCCCCCCCCCHHHEEEECCCCCEEEEECCCCHHHHHC--C--CCEEECCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 3334335330221366559995499952403176755--9--70898999989999999999759999999999999999 Q ss_pred HHHHCCHHHHHHHHHHHH Q ss_conf 998299899999999999 Q gi|255764474|r 328 AVKHFSIVKEASDIGKVY 345 (352) Q Consensus 328 ~~~~fs~~~~a~~~~~iY 345 (352) + ..++....++++.+-. T Consensus 452 v-~~~d~~~W~~~fL~~L 468 (470) T pfam00982 452 I-SKHDVQYWAESFLSDL 468 (470) T ss_pred H-HHCCHHHHHHHHHHHH T ss_conf 8-8579999999999984 No 65 >PHA01630 putative group 1 glycosyl transferase Probab=99.51 E-value=2.1e-12 Score=102.68 Aligned_cols=286 Identities=18% Similarity=0.221 Sum_probs=176.4 Q ss_pred HHHHHHHHHHHHCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEEC-CH Q ss_conf 88899999997198599980588665623478998998875432597089962854689999999970998199975-74 Q gi|255764474|r 23 TVFGLCPIQRKLGQRLVVFGYCLPKNIPSIGISSLLTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFT-SP 101 (352) Q Consensus 23 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 101 (352) .-.-+...++.+++.+..+.- +......+. -.....|+|.-++......- +-+.+-.+.+- .. T Consensus 17 ~k~l~~e~lK~L~~k~~vfe~------~~l~~~~~~------P~~~~~i~~~~f~sm~~Wk~----i~~~~~~i~~EVaD 80 (333) T PHA01630 17 QKKLLEEHLKMLGHKVTVFEK------PTLTKYQLP------PGYPIYIYYTIFNSMLFWKG----IPHVGKNIVFEVAD 80 (333) T ss_pred HHHHHHHHHHHHCCEEEEEEC------CCCCHHHCC------CCCCEEEEEEECCCEEEECC----CCCCCCEEEEEEEC T ss_conf 899999999985866688644------754310069------99960799750024135325----75335347999606 Q ss_pred HHHHHHHHHHHH-HHHCCEEEECCHHHHHH-------CCCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEE Q ss_conf 562011347989-96266899889899974-------3688399748827778237843268788728898872799951 Q gi|255764474|r 102 SQRNHSRWTRYL-ISRMDEVITTSQKSARF-------IERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFG 173 (352) Q Consensus 102 ~~~~~~~~~~~l-~~~~d~ii~~s~~~~~~-------~~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G 173 (352) ..+-...+..+. -.-+|.++..|+.+..- ++.|+.+|||..|+..|+-.+... +..-++...= T Consensus 81 TD~Is~~~l~~~~n~~~D~ivv~SqWS~naf~~sgl~I~~PiY~IpHn~nprm~~~~~kek---------~~~~Vl~~l~ 151 (333) T PHA01630 81 TDAISHTALYFFRNQPVDEIVVPSQWSKNAFYTSGLKIPQPIYVIPHNLNPRMFEYKPKEK---------PHPCVLAILP 151 (333) T ss_pred CHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHCCCCCCCCEEECCCCCCHHHHHCCHHHC---------CCCEEEEECC T ss_conf 1677799999986087540674244436578752899998627645679935640763218---------8756999856 Q ss_pred EECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCC Q ss_conf 22644512366654565300477405886313332100014667764431123322122222233100000000000123 Q gi|255764474|r 174 RIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPP 253 (352) Q Consensus 174 ~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS 253 (352) .-..+||-|.+.+++.++.+.+.++.+++--.. .. ++ +++.-.|+. -+-.-+|+.++|+.||++..|| T Consensus 152 HS~~RKG~Di~~~v~~elqke~~d~Y~Lvkssn-~~------d~-Rl~~l~gvk----~plp~dd~~~lf~~~Di~f~p~ 219 (333) T PHA01630 152 HSWDRKGGDIVVKIFHELQNEGYDFYFLIKSSN-ML------DP-RLFGLNGVK----TPLPDDDIYSLFAGCDILFYPV 219 (333) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCEEEEEEECC-CC------CC-EEECCCCCC----CCCCCHHHHHHHHHCCEEEEEC T ss_conf 654546568899999999845785699998415-56------73-021233544----7898167899874063799841 Q ss_pred CCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCC------------------CEEEEECCCCHHHHHHHHHHHHCCH- Q ss_conf 332222221000001001110278841010023488------------------4249965999899999999998698- Q gi|255764474|r 254 LYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENA------------------KAGVIVPPRNLHALEKAVLYFMNSK- 314 (352) Q Consensus 254 ~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g------------------~~G~~~~~~d~~~la~~i~~l~~~~- 314 (352) +.-.|-+..+||++.|+|+|+|.-|...|.+..... .-|..+++ |.++..+++...++|- T Consensus 220 RGGaFEi~~iEAl~~gl~~v~te~GaWsE~~~~~~~~~~ik~~~~~k~~~~NpiHvG~~le~-~~eda~qKll~~L~n~~ 298 (333) T PHA01630 220 RGGAFEIPVIEALALGLDVVVTEKGAWSEWVLSNLDVYWIKSGRKPKLWYTNPIHVGYFLDP-DIEDAYQKLLEALANWT 298 (333) T ss_pred CCCEEECCHHHHHHCCCCEEECCCCCHHHHCCCCCHHHHHHHCCCCEECCCCCEEECCCCCC-CHHHHHHHHHHHHHCCC T ss_conf 58603431799987079767627864065247521002212068724426797322266678-76899999999986258 Q ss_pred -H-HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf -9-99999999999999829989999999999999 Q gi|255764474|r 315 -K-IMSDTGNRGRERAVKHFSIVKEASDIGKVYDR 347 (352) Q Consensus 315 -~-~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~ 347 (352) + ..+.+..++. ..+++||..+++++++++.++ T Consensus 299 ~ek~Ke~le~~~~-~~rEnYsy~ai~k~~~kIlek 332 (333) T PHA01630 299 PEKKKENLEGRAI-LYRENYSYNAIAKMWEKILEK 332 (333) T ss_pred HHHHHHHHHHHHH-HHHHHCCHHHHHHHHHHHHHC T ss_conf 3777765553121-124534789999999999850 No 66 >PRK05749 3-deoxy-D-manno-octulosonic-acid transferase; Reviewed Probab=99.51 E-value=4.1e-11 Score=93.89 Aligned_cols=245 Identities=21% Similarity=0.263 Sum_probs=153.8 Q ss_pred HHHCCCEEEEECCHH------HHHHHHHHHHHHHHCCEEEECCHHHHHHCC-----CCEEEECCCCCHHHHCCC--C--- Q ss_conf 970998199975745------620113479899626689988989997436-----883997488277782378--4--- Q gi|255764474|r 88 DVLRMPLKLVFTSPS------QRNHSRWTRYLISRMDEVITTSQKSARFIE-----RPSTVIMHGVDTERFRPT--S--- 151 (352) Q Consensus 88 ~~~~~~~~~~~~~~~------~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~-----~~~~vi~~gid~~~~~~~--~--- 151 (352) ...+.|+.++--+-+ ++....+.+.++++.|.+.+.|+..++.+. +.+.+..| -+|... + T Consensus 143 ~~~~Ip~~liNaR~s~~S~~~y~~~~~~~~~~l~~~~~i~~qs~~~~~r~~~lG~~~~v~v~Gn----lKfd~~~~~~~~ 218 (423) T PRK05749 143 KKRGIPLVLANARLSERSFKRYAKFKRFYRLLFKNIDLVLAQSEEDAERFLALGAKNEVTVTGN----LKFDIEIPPELA 218 (423) T ss_pred HHCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCCCCCEEECCC----CCCCCCCCHHHH T ss_conf 6278866765251163367666766999999997427665269999999997599974387686----322457871107 Q ss_pred -CHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCC--- Q ss_conf -3268788728898872799951226445123666545653004774058863133321000146677644311233--- Q gi|255764474|r 152 -NKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLK--- 227 (352) Q Consensus 152 -~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~--- 227 (352) ....++..+ .++++++. |...+ .--..+++++.++.+..|+..++++= ++.+..+++.+.+++.|+. T Consensus 219 ~~~~~~~~~~---~~~~v~va-gSth~-~EE~iil~~~~~l~~~~~~~~lIiaP----RHpeR~~~i~~~l~~~gl~~~~ 289 (423) T PRK05749 219 ARAAALRRQL---PNRPVWIA-ASTHE-GEEELVLDAHQALLKQFPNLLLILVP----RHPERFKEVEELLKKAGLSYVR 289 (423) T ss_pred HHHHHHHHHH---CCCCEEEE-EECCH-HHHHHHHHHHHHHHHCCCCCEEEEEC----CCHHHHHHHHHHHHHCCCCEEE T ss_conf 8899999981---89968999-92887-69999999999997408782899947----8776799999999967997798 Q ss_pred ----------CCCCCCCCCCCCCCCCHHHHHH-CCCCCCCCCCCCCCCCCCCEEEEEEC-CCCCCCHHCCCCCCCEEEEE Q ss_conf ----------2212222223310000000000-01233322222210000010011102-78841010023488424996 Q gi|255764474|r 228 ----------KRILFIDEQSSIEDWYRALNIF-VAPPLYEGFGLTPLEAMASGIPVVAS-NTGVFSELLDPENAKAGVIV 295 (352) Q Consensus 228 ----------~~V~~~g~~~~~~~~~~~adi~-i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e~i~~~~g~~G~~~ 295 (352) .+|.+.+..-++..+|+.||+. +--|....-|--++||.++|+|||.- .....+|+.+.- ...|... T Consensus 290 ~S~~~~~~~~~~Vli~Dt~GeL~~~Y~~a~iafvGGsf~~~GGHN~lEpa~~g~pvi~GP~~~nf~e~~~~L-~~~g~~~ 368 (423) T PRK05749 290 RSQGEAPSADTDVLLGDTMGELGLLYAIADIAFVGGSLVKIGGHNPLEPAAFGVPVISGPHTFNFKEIFERL-LQAGAAI 368 (423) T ss_pred ECCCCCCCCCCEEEEECCCCHHHHHHHHCCEEEECCCCCCCCCCCHHHHHHCCCCEEECCCCCCHHHHHHHH-HHCCCEE T ss_conf 279999998872999888875889998578789827768889959799998399889993832779999999-9789958 Q ss_pred CCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 599989999999999869899999999999999982998999999999999988 Q gi|255764474|r 296 PPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLL 349 (352) Q Consensus 296 ~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l 349 (352) .-.|.+++++.+.++++|++.+.+|+.+|++.+.++ .-..+++.+..++.| T Consensus 369 ~v~~~~eL~~~~~~ll~~~~~~~~~~~~a~~~v~~~---~Gat~r~~~~i~~~L 419 (423) T PRK05749 369 QVEDAEDLAKAVSSLLTDPDAREAMGEAGVAFLKQN---RGALQRTLQLLKPYL 419 (423) T ss_pred EECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHC---CCHHHHHHHHHHHHC T ss_conf 968999999999999769999999999999999978---479999999999763 No 67 >cd03785 GT1_MurG MurG is an N-acetylglucosaminyltransferase, the last enzyme involved in the intracellular phase of peptidoglycan biosynthesis. It transfers N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the C4 hydroxyl of a lipid-linked N-acetylmuramoyl pentapeptide (NAM). The resulting disaccharide is then transported across the cell membrane, where it is polymerized into NAG-NAM cell-wall repeat structure. MurG belongs to the GT-B structural superfamily of glycoslytransferases, which have characteristic N- and C-terminal domains, each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility. Probab=99.49 E-value=5.2e-11 Score=93.26 Aligned_cols=283 Identities=16% Similarity=0.156 Sum_probs=174.9 Q ss_pred HHHHHHHHHHHCCEEEEEECCCC-------C-CC-----CC------CCH----------HHHHHHHHHHHHCCCEEEEE Q ss_conf 88999999971985999805886-------6-56-----23------478----------99899887543259708996 Q gi|255764474|r 24 VFGLCPIQRKLGQRLVVFGYCLP-------K-NI-----PS------IGI----------SSLLTCWKKPIGQNSRIWHA 74 (352) Q Consensus 24 v~~~~~~~~~~~~~~~~~~~~~~-------~-~~-----~~------~~~----------~~~~~~~~~~~~~~~~ivh~ 74 (352) ...++.++++.|+++..++...+ . .+ +. ... ..+....+.....+||++-+ T Consensus 16 alala~~L~~~g~~V~~i~~~~g~e~~~~~~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kPd~vi~ 95 (350) T cd03785 16 ALALAEELRERGAEVLFLGTKRGLEARLVPKAGIPLHTIPVGGLRRKGSLKKLKAPFKLLKGVLQARKILKKFKPDVVVG 95 (350) T ss_pred HHHHHHHHHHCCCEEEEEECCCHHHHCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 99999999978798999987836864234413994899768887888739999999999999999999999649999998 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCC-CCEEEECCCCCHHHHCCCCCH Q ss_conf 2854689999999970998199975745620113479899626689988989997436-883997488277782378432 Q gi|255764474|r 75 RRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIE-RPSTVIMHGVDTERFRPTSNK 153 (352) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~-~~~~vi~~gid~~~~~~~~~~ 153 (352) -.........+.+++++.|+.+.-.+.. ...-.|++.+.++.+++..+....... .+..++.|.+..+.+. . T Consensus 96 ~GGY~s~P~~laA~~~~iP~~ihEqN~v---~G~anr~l~~~a~~i~~~f~~~~~~~~~~k~~~vG~PvR~~~~~----~ 168 (350) T cd03785 96 FGGYVSGPVGLAAKLLGIPLVIHEQNAV---PGLANRLLARFADRVALSFPETAKYFPKDKAVVTGNPVREEILA----L 168 (350) T ss_pred CCCCHHHHHHHHHHHCCCCCEEECCCCC---CCHHHHHHCCCCCEEEECCHHHHHCCCCCCEEEECCCCCHHHHC----C T ss_conf 8981038999999972998556567722---57132332100398998575654124667779968852266641----4 Q ss_pred HHHHHHCCCCCCCCEEEEEEEECHHHHH-HHHHHHHHHHHCCCCCEEEE-EECCCCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 6878872889887279995122644512-36665456530047740588-631333210001466776443112332212 Q gi|255764474|r 154 QEARRHLKISEDAKLIGCFGRIRKLKGT-DLFVDCMINILPHHPGWTAV-VVGKTTLKHYLFKKNLQRRIYANGLKKRIL 231 (352) Q Consensus 154 ~~~~~~~~~~~~~~~i~~~G~~~~~Kg~-~~li~a~~~l~~~~~~~~l~-i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~ 231 (352) ...+..++.+++.+.|+.+|.-.-.+.+ +.+.+++..+.. .+++++ ++|... .++++...... ..++. T Consensus 169 ~~~~~~~~~~~~~~~iLv~GGSqGa~~ln~~v~~~~~~l~~--~~~~ii~~~G~~~------~~~~~~~~~~~--~~~~~ 238 (350) T cd03785 169 DRERARLGLRPGKPTLLVFGGSQGARAINEAVPEALAELLR--KRLQVIHQTGKGD------LEEVKKAYEEL--GVNYE 238 (350) T ss_pred CHHHHHCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHH--CCCEEEEECCCCH------HHHHHHHHHHC--CCCEE T ss_conf 34467527898973999984872047899999999998764--4968999838400------89999999866--99889 Q ss_pred CCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCC--------CHHCCCCCCCEEEEECCC--CHH Q ss_conf 2222233100000000000123332222221000001001110278841--------010023488424996599--989 Q gi|255764474|r 232 FIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVF--------SELLDPENAKAGVIVPPR--NLH 301 (352) Q Consensus 232 ~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~--------~e~i~~~~g~~G~~~~~~--d~~ 301 (352) ..++.+++..+|+.||++++-+ -..++.|.+++|+|.|.-+.+.. .+.+. +...|++++.. +++ T Consensus 239 v~~f~~~m~~~l~~aDlvIsra----Ga~Ti~E~~~~g~P~IlIP~p~a~d~hQ~~NA~~l~--~~g~a~~i~e~~~~~~ 312 (350) T cd03785 239 VFPFIDDMAAAYAAADLVISRA----GASTVAELAALGLPAILIPLPYAADDHQTANARALV--KAGAAVLIPQEELTPE 312 (350) T ss_pred EECHHHHHHHHHHHCCEEEECC----CCCHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHH--HCCCEEEEECCCCCHH T ss_conf 9251889999986198899779----842599999819986998458777665999999999--8899999500249999 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 9999999998698999999999999999 Q gi|255764474|r 302 ALEKAVLYFMNSKKIMSDTGNRGRERAV 329 (352) Q Consensus 302 ~la~~i~~l~~~~~~~~~~~~~a~~~~~ 329 (352) .+.++|.+++.|++.+.+|++++++... T Consensus 313 ~L~~~i~~ll~d~~~l~~m~~~~~~~~~ 340 (350) T cd03785 313 RLAAALLELLSDPERLKAMAEAARSLAR 340 (350) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHCCCC T ss_conf 9999999998799999999999874589 No 68 >TIGR02400 trehalose_OtsA alpha,alpha-trehalose-phosphate synthase [UDP-forming]; InterPro: IPR012766 This enzyme catalyzes the key, penultimate step in biosynthesis of trehalose, a compatible solute made as an osmoprotectant in some species in all three domains of life. The gene symbol OtsA stands for osmotically regulated trehalose synthesis A. Trehalose helps protect against both osmotic and thermal stresses, and is made from two glucose subunits. This entry excludes glucosylglycerol-phosphate synthase, an enzyme of an analogous osmoprotectant system in many cyanobacterial strains. This entry does not identify archaeal examples, as they are more divergent than glucosylglycerol-phosphate synthase. Sequences that score in the gray zone between the trusted and noise cut offs include a number of yeast multidomain proteins in which the N-terminal domain may be functionally equivalent to this family. The gray zone also includes the OtsA of Cornyebacterium glutamicum (and related species), shown to be responsible for synthesis of only trace amounts of trehalose while the majority is synthesized by the TreYZ pathway; the significance of OtsA in this species is unclear (see ).; GO: 0003825 alphaalpha-trehalose-phosphate synthase (UDP-forming) activity, 0005992 trehalose biosynthetic process. Probab=99.48 E-value=8.4e-13 Score=105.28 Aligned_cols=270 Identities=20% Similarity=0.222 Sum_probs=180.1 Q ss_pred HHCCCEEEEECCCHHHHHHHHHHHHH-CC-CEEEEECCHH----HHHHHH-HHHHHHHH---CCEEEECCHHHH------ Q ss_conf 32597089962854689999999970-99-8199975745----620113-47989962---668998898999------ Q gi|255764474|r 65 IGQNSRIWHARRNNEMLLGVMMRDVL-RM-PLKLVFTSPS----QRNHSR-WTRYLISR---MDEVITTSQKSA------ 128 (352) Q Consensus 65 ~~~~~~ivh~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~----~~~~~~-~~~~l~~~---~d~ii~~s~~~~------ 128 (352) ....-|+++.|..--++.+..++... .. ++-+..|-++ ....-+ +.+-+++. +|-|=.-++..+ T Consensus 137 ~~~~gD~~WvHDYHL~~lp~~LRe~~~~n~~iGFFLHiPfPS~ei~~~lP~~~~~ll~gll~yDLvGFqT~~d~~nF~~~ 216 (476) T TIGR02400 137 LLQPGDIVWVHDYHLMLLPAMLRELGVENVKIGFFLHIPFPSSEIYRTLPEWRRELLEGLLAYDLVGFQTYEDARNFLSA 216 (476) T ss_pred HCCCCCEEEEECCHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH T ss_conf 63899899984414568999999717785300104178998579985180579999999871583024568899999999 Q ss_pred --------------HHCC----C--CEEEECCCCCHHHHCCCCCHH-------HHHHHCCCCCCCCEEEEEEEECHHHHH Q ss_conf --------------7436----8--839974882777823784326-------878872889887279995122644512 Q gi|255764474|r 129 --------------RFIE----R--PSTVIMHGVDTERFRPTSNKQ-------EARRHLKISEDAKLIGCFGRIRKLKGT 181 (352) Q Consensus 129 --------------~~~~----~--~~~vi~~gid~~~~~~~~~~~-------~~~~~~~~~~~~~~i~~~G~~~~~Kg~ 181 (352) .+-. + .+.+-|-|||++.|......+ ++++.+ ....+|+-|-||++-||+ T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~R~~~v~aFPIGID~~~f~~~a~~~~~~~~~~~l~~~l---~~~k~i~GVDRLDYsKG~ 293 (476) T TIGR02400 217 VSRELGYETLENGVESGGGAGGRTVRVGAFPIGIDVDRFAEQAKKPSVQKRIAELRESL---RGRKLIIGVDRLDYSKGL 293 (476) T ss_pred HHHHHHHHHCCCCEECCCCCCCCEEEEEEEEECCCHHHHHHHHHCHHHHHHHHHHHHHH---CCCEEEEEEEECHHHCCH T ss_conf 99985111126620117756433699988742007799998731400389999999983---797499984202011227 Q ss_pred HHHHHHHHHHHCCCCC----EEEEEECCCCCCCCCHHHHHHHHHHHCC-----CCCCCCCCC--------CCCCCCCCCH Q ss_conf 3666545653004774----0588631333210001466776443112-----332212222--------2233100000 Q gi|255764474|r 182 DLFVDCMINILPHHPG----WTAVVVGKTTLKHYLFKKNLQRRIYANG-----LKKRILFID--------EQSSIEDWYR 244 (352) Q Consensus 182 ~~li~a~~~l~~~~~~----~~l~i~G~g~~~~~~~~~~l~~~i~~~~-----l~~~V~~~g--------~~~~~~~~~~ 244 (352) ..=++||.++++++|+ ++++.+-..+-.+....++|++.+++.- --..+.|++ .++.+..+|+ T Consensus 294 ~~R~~AFe~fL~~~P~~rgkV~lvQIA~PSR~~V~~Y~~Lr~~~~~~VGrING~fg~~~WtPi~Y~~~~~~~~~L~aLYr 373 (476) T TIGR02400 294 PERLKAFERFLEEHPEWRGKVTLVQIAVPSRGDVEEYQQLRRQVEELVGRINGRFGTLDWTPIRYLNRSYPREELMALYR 373 (476) T ss_pred HHHHHHHHHHHHHCHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHH T ss_conf 78999999998747342072799998506688878999999999998645420217888536787238888689999974 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCEEEE-----EECCC-CCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCH-HHH Q ss_conf 0000001233322222210000010011-----10278-8410100234884249965999899999999998698-999 Q gi|255764474|r 245 ALNIFVAPPLYEGFGLTPLEAMASGIPV-----VASNT-GVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSK-KIM 317 (352) Q Consensus 245 ~adi~i~pS~~Eg~gl~~lEAma~G~Pv-----I~s~~-~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~-~~~ 317 (352) .|||.+.+|..+|.-||.-|+.||-.|. |.|.. |...|+ + .+++|+|.|.+++|+||.+.++-| ++| T Consensus 374 ~adV~LVTpLRDGMNLVAKEYVAaQ~~~~PGVLiLSefAGAA~eL-~-----~ALlVNP~d~~g~a~Ai~~AL~Mp~~eR 447 (476) T TIGR02400 374 AADVGLVTPLRDGMNLVAKEYVAAQDPEDPGVLILSEFAGAAQEL-K-----GALLVNPYDIEGMADAIARALTMPLEER 447 (476) T ss_pred HCCEEEECCCCCCCCHHHHHHHHCCCCCCCCCEECCHHHHHHHHH-C-----CCCCCCCCCHHHHHHHHHHHHCCCHHHH T ss_conf 103266545656751353587852898847854300266699885-0-----1562189788999999999852998899 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 999999999999829989999999999 Q gi|255764474|r 318 SDTGNRGRERAVKHFSIVKEASDIGKV 344 (352) Q Consensus 318 ~~~~~~a~~~~~~~fs~~~~a~~~~~i 344 (352) ++--++=.+++. .++.....+.+++- T Consensus 448 ~~R~~~l~~~l~-~~d~~~W~~~Fl~~ 473 (476) T TIGR02400 448 EERHRALMKKLR-KNDVQRWREDFLSD 473 (476) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHH T ss_conf 999999999999-87799999999997 No 69 >PHA01633 putative glycosyl transferase group 1 Probab=99.45 E-value=8.9e-12 Score=98.40 Aligned_cols=289 Identities=18% Similarity=0.275 Sum_probs=169.9 Q ss_pred CCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEE Q ss_conf 77077888899999997198599980588665623478998998875432597089962854689999999970998199 Q gi|255764474|r 17 HTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPSIGISSLLTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKL 96 (352) Q Consensus 17 ~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~ 96 (352) ++.+......++.-++..|..+++++... .++.. .-.|+...+.++++...+..+..-..-++ T Consensus 10 yssi~nvaediaevlr~n~e~v~~~t~~~--~i~~a---------------eklivfvpf~pp~ln~yl~~y~~fkg~ky 72 (335) T PHA01633 10 YSSISNVSEDIAEVLRENGEIVTITKNPF--YIPKA---------------EKLIVFIPFHPPSLNPYLYAYYQFKGKKY 72 (335) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCC--CCCCC---------------CEEEEEEECCCCCCCHHHHHHHEECCCEE T ss_conf 05554178899999972895899824875--46764---------------55899961587321426764210047626 Q ss_pred EE-CCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCCCC----EEEECCCCCHHHHCCCCC-HHHHHHH--CCCCCCCCE Q ss_conf 97-574562011347989962668998898999743688----399748827778237843-2687887--288988727 Q gi|255764474|r 97 VF-TSPSQRNHSRWTRYLISRMDEVITTSQKSARFIERP----STVIMHGVDTERFRPTSN-KQEARRH--LKISEDAKL 168 (352) Q Consensus 97 ~~-~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~~~----~~vi~~gid~~~~~~~~~-~~~~~~~--~~~~~~~~~ 168 (352) .+ +.....+.....+++++.. ..|..|..+++-+... .--+-|||+-........ .+++..+ ...| +..- T Consensus 73 fytt~dgvpn~~~~n~~l~~dv-tfipnsk~sa~nlq~agl~vdlpvfhginfk~vekae~lvpqlkqkl~kdfp-dt~k 150 (335) T PHA01633 73 FYTTCDGIPNIEIVNKYLLQDV-KFIPNSKFSAENLQEVGLQVDLPVFHGINFKIVENAEKLVPQLKQKLDKDFP-DTIK 150 (335) T ss_pred EEEECCCCCCHHHHHHHHHHHC-EECCCCCCCHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCC-CCEE T ss_conf 8873488766789999987433-7577876435558653731154203255523245477661899877533488-6445 Q ss_pred EEEEEEECHHHHHHHHHHHHHHHHCCCCC----EEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCC---CC--CCCC Q ss_conf 99951226445123666545653004774----058863133321000146677644311233221222---22--2331 Q gi|255764474|r 169 IGCFGRIRKLKGTDLFVDCMINILPHHPG----WTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFI---DE--QSSI 239 (352) Q Consensus 169 i~~~G~~~~~Kg~~~li~a~~~l~~~~~~----~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~---g~--~~~~ 239 (352) ++.++.+..+|++|++++++..+..+.|+ +++.++.. ++ ..++.+..+|+|. |+ ++++ T Consensus 151 fgvvsg~tkrknidl~lq~~n~lntkypd~ak~ihffvish---ed----------f~k~evpanvhfva~fg~q~~e~i 217 (335) T PHA01633 151 FGIVSGLTKRKNMDLMLQVFNELNTKYPDIAKKIHFFVISH---KQ----------FTQLEVPANVHFVAEFGHNSREYI 217 (335) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCCHHHCEEEEEEEH---HH----------HHHCCCCCCEEEHHHHCCCCHHHH T ss_conf 54664443234456999999886301887667147999626---77----------654248740310676477848999 Q ss_pred CCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCC-----------------CCCCCEEEEECCCCHHH Q ss_conf 0000000000012333222222100000100111027884101002-----------------34884249965999899 Q gi|255764474|r 240 EDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLD-----------------PENAKAGVIVPPRNLHA 302 (352) Q Consensus 240 ~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~-----------------~~~g~~G~~~~~~d~~~ 302 (352) -.+|.+.|..++||-.||||+.++|+||+|+|||.--.+.+.|+-. .++++ .|-+-.-++++ T Consensus 218 ~afy~amdf~~vpsg~egfglpvlesmamgtpvihq~ipp~~eftswq~n~l~k~~kveey~~kehaq-kwki~rfq~ed 296 (335) T PHA01633 218 FAFYGAMDFTIVPSGTEGFGMPVLESMAMGTPVIHQLMPPLDEFTSWQWNLLIKSSKVEEYYDKEHGQ-KWKIHKFQIED 296 (335) T ss_pred HHHHHHCEEEECCCCCCCCCCHHHHHHHCCCHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HEEEEECCHHH T ss_conf 99860030687358866458267877603763376407980032034225421030788877566543-10567248889 Q ss_pred HHHHHHHH--HCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 99999999--869899999999999999982998999999999999988 Q gi|255764474|r 303 LEKAVLYF--MNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLL 349 (352) Q Consensus 303 la~~i~~l--~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l 349 (352) ++.+|.-. .+|+++| +.+-++.+ +.|+.. .+|.+.| T Consensus 297 ~a~ail~a~~~qdreer---s~~l~ela-k~ydi~-------~lykrfl 334 (335) T PHA01633 297 MANAIILAFELQDREER---SMKLKELA-KKYDIR-------NLYTRFL 334 (335) T ss_pred HHHHHHHHHHHCCHHHH---HHHHHHHH-HHHHHH-------HHHHHHC T ss_conf 99999998760357888---89999999-873578-------9999740 No 70 >KOG2941 consensus Probab=99.43 E-value=4.8e-10 Score=86.73 Aligned_cols=309 Identities=11% Similarity=0.153 Sum_probs=195.0 Q ss_pred HHHHHHHHHHHHCCEEEEEECCCCCC-----------------------CCCC---C---HHHH-HHHHHHHHHCCCEEE Q ss_conf 88899999997198599980588665-----------------------6234---7---8998-998875432597089 Q gi|255764474|r 23 TVFGLCPIQRKLGQRLVVFGYCLPKN-----------------------IPSI---G---ISSL-LTCWKKPIGQNSRIW 72 (352) Q Consensus 23 ~v~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~---~---~~~~-~~~~~~~~~~~~~iv 72 (352) .+-.-+..+.+.|..+...++..+.. .|.+ . ++++ ..+|.......++++ T Consensus 28 RMqYHA~Sla~~gf~VdliGy~~s~p~e~l~~hprI~ih~m~~l~~~~~~p~~~~l~lKvf~Qfl~Ll~aL~~~~~~~~i 107 (444) T KOG2941 28 RMQYHALSLAKLGFQVDLIGYVESIPLEELLNHPRIRIHGMPNLPFLQGGPRVLFLPLKVFWQFLSLLWALFVLRPPDII 107 (444) T ss_pred HHHHHHHHHHHCCCEEEEEEECCCCCHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 77899999997277278987317887688715896389727987525778622444899999999999999824688589 Q ss_pred EECCCH---HHHHHHHHHHHHCCCEEEEECCHHHHHH---------------HHHHHHHHHHCCEEEECCHHHHHHCC-- Q ss_conf 962854---6899999999709981999757456201---------------13479899626689988989997436-- Q gi|255764474|r 73 HARRNN---EMLLGVMMRDVLRMPLKLVFTSPSQRNH---------------SRWTRYLISRMDEVITTSQKSARFIE-- 132 (352) Q Consensus 73 h~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~l~~~~d~ii~~s~~~~~~~~-- 132 (352) -.+-++ .++...+...+.+.++.+++|+..+... .++.+.+-+.+|.-+|++..+++++. T Consensus 108 lvQNPP~iPtliv~~~~~~l~~~KfiIDWHNy~Ysl~l~~~~g~~h~lV~l~~~~E~~fgk~a~~nLcVT~AMr~dL~qn 187 (444) T KOG2941 108 LVQNPPSIPTLIVCVLYSILTGAKFIIDWHNYGYSLQLKLKLGFQHPLVRLVRWLEKYFGKLADYNLCVTKAMREDLIQN 187 (444) T ss_pred EEECCCCCCHHHHHHHHHHHHCCEEEEEEHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH T ss_conf 97289987347999999987366589970225789998864288870588999999986133432224588899999873 Q ss_pred ---CCEEEECC-----CCCHHH----HCCCCC----------------HHHHHHH------CCCCCCCCEEEEEEEECHH Q ss_conf ---88399748-----827778----237843----------------2687887------2889887279995122644 Q gi|255764474|r 133 ---RPSTVIMH-----GVDTER----FRPTSN----------------KQEARRH------LKISEDAKLIGCFGRIRKL 178 (352) Q Consensus 133 ---~~~~vi~~-----gid~~~----~~~~~~----------------~~~~~~~------~~~~~~~~~i~~~G~~~~~ 178 (352) .+..|++- +.+.+. |.+... ...+..+ .-.+....+++.--.+++- T Consensus 188 Wgi~ra~v~YDrPps~~~~l~~~H~lf~~l~~d~~~f~ar~~q~~~~~~taf~~k~~s~~v~~~~~~pallvsSTswTpD 267 (444) T KOG2941 188 WGINRAKVLYDRPPSKPTPLDEQHELFMKLAGDHSPFRAREPQDKALERTAFTKKDASGDVQLLPERPALLVSSTSWTPD 267 (444) T ss_pred CCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCHHHHCCCCCCHHHHHHHHHHCCCCHHHHCCCCCEEEEECCCCCCC T ss_conf 38740489822798889855677888865335554244224333012245676513441233146787589953787876 Q ss_pred HHHHHHHHHHHHH-----HC--CCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCC-C--CCCCCCCCCCHHHHH Q ss_conf 5123666545653-----00--477405886313332100014667764431123322122-2--222331000000000 Q gi|255764474|r 179 KGTDLFVDCMINI-----LP--HHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILF-I--DEQSSIEDWYRALNI 248 (352) Q Consensus 179 Kg~~~li~a~~~l-----~~--~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~-~--g~~~~~~~~~~~adi 248 (352) .+..-|++|+... .+ ..|...++|-|.||.+ +...+.|+++++.. |.+ + ...+|.+.+++.||+ T Consensus 268 Edf~ILL~AL~~y~~~~~~~~~~lP~llciITGKGPlk-----E~Y~~~I~~~~~~~-v~~~tpWL~aEDYP~ll~saDl 341 (444) T KOG2941 268 EDFGILLEALVIYEEQLYDKTHNLPSLLCIITGKGPLK-----EKYSQEIHEKNLQH-VQVCTPWLEAEDYPKLLASADL 341 (444) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHH-----HHHHHHHHHHCCCC-EEEEECCCCCCCCHHHHHCCCC T ss_conf 32789999998645554013579973799992788316-----89999987715321-0365022321100667631431 Q ss_pred HCCCCCC---CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC----CHHHHHHHH Q ss_conf 0012333---222222100000100111027884101002348842499659998999999999986----989999999 Q gi|255764474|r 249 FVAPPLY---EGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN----SKKIMSDTG 321 (352) Q Consensus 249 ~i~pS~~---Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~----~~~~~~~~~ 321 (352) .|+...+ =..|++++....||+||++-+...+.|+|+ +|+||++++ |.+++++.+..+.. +.+...++. T Consensus 342 GVcLHtSSSGLDLPMKVVDMFGcglPvcA~~fkcl~ELVk--h~eNGlvF~--Ds~eLa~ql~~lf~~fp~~a~~l~~lk 417 (444) T KOG2941 342 GVCLHTSSSGLDLPMKVVDMFGCGLPVCAVNFKCLDELVK--HGENGLVFE--DSEELAEQLQMLFKNFPDNADELNQLK 417 (444) T ss_pred CEEEEECCCCCCCCHHHHHHHCCCCCEEEECCHHHHHHHH--CCCCCEEEC--CHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 3476304766676466777625787436645653899985--677853750--599999999999854999878999999 Q ss_pred HHHHHHHHHHCCHHHHHHHHHH Q ss_conf 9999999982998999999999 Q gi|255764474|r 322 NRGRERAVKHFSIVKEASDIGK 343 (352) Q Consensus 322 ~~a~~~~~~~fs~~~~a~~~~~ 343 (352) ++.++. +...|+..-++... T Consensus 418 kn~~e~--~e~RW~~~W~~~~~ 437 (444) T KOG2941 418 KNLREE--QELRWDESWERTAL 437 (444) T ss_pred HHHHHH--HHHHHHHHHHHHHH T ss_conf 865787--76417888998643 No 71 >COG1519 KdtA 3-deoxy-D-manno-octulosonic-acid transferase [Cell envelope biogenesis, outer membrane] Probab=99.41 E-value=7.5e-10 Score=85.42 Aligned_cols=244 Identities=21% Similarity=0.229 Sum_probs=155.8 Q ss_pred HHHHCCCEEEEECCHH------HHHHHHHHHHHHHHCCEEEECCHHHHHHCC----CCEEEECCCCCHHHH-CCCC---- Q ss_conf 9970998199975745------620113479899626689988989997436----883997488277782-3784---- Q gi|255764474|r 87 RDVLRMPLKLVFTSPS------QRNHSRWTRYLISRMDEVITTSQKSARFIE----RPSTVIMHGVDTERF-RPTS---- 151 (352) Q Consensus 87 ~~~~~~~~~~~~~~~~------~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~----~~~~vi~~gid~~~~-~~~~---- 151 (352) ....+.|..++--+-+ +.....+.+.+.++.|.|++.|+..++... +++.+..| .... .+.+ T Consensus 142 ~~~~~~p~~LvNaRLS~rS~~~y~k~~~~~~~~~~~i~li~aQse~D~~Rf~~LGa~~v~v~GN---lKfd~~~~~~~~~ 218 (419) T COG1519 142 LKRRGIPLVLVNARLSDRSFARYAKLKFLARLLFKNIDLILAQSEEDAQRFRSLGAKPVVVTGN---LKFDIEPPPQLAA 218 (419) T ss_pred HHHCCCCEEEEEEEECHHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCCCCEEEECC---EEECCCCCHHHHH T ss_conf 9876998999942302325777987789999999742333454888899999649861386333---2423778732489 Q ss_pred CHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCC--- Q ss_conf 32687887288988727999512264451236665456530047740588631333210001466776443112332--- Q gi|255764474|r 152 NKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKK--- 228 (352) Q Consensus 152 ~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~--- 228 (352) .....|.+++.+ ++++...+. ...--+.++++++.+.+++|+..++++= ++.+..++..++++..|+.- T Consensus 219 ~~~~~r~~l~~~--r~v~iaaST--H~GEeei~l~~~~~l~~~~~~~llIlVP----RHpERf~~v~~l~~~~gl~~~~r 290 (419) T COG1519 219 ELAALRRQLGGH--RPVWVAAST--HEGEEEIILDAHQALKKQFPNLLLILVP----RHPERFKAVENLLKRKGLSVTRR 290 (419) T ss_pred HHHHHHHHCCCC--CCEEEEECC--CCCHHHHHHHHHHHHHHHCCCCEEEEEC----CCHHHHHHHHHHHHHCCCEEEEE T ss_conf 999999850888--855999547--7863889999999999638995699915----87556799999999759818861 Q ss_pred ----------CCCCCCCCCCCCCCCHHHHH-HCCCCCCCCCCCCCCCCCCCEEEEEEC-CCCCCCHHCCC-CCCCEEEEE Q ss_conf ----------21222222331000000000-001233322222210000010011102-78841010023-488424996 Q gi|255764474|r 229 ----------RILFIDEQSSIEDWYRALNI-FVAPPLYEGFGLTPLEAMASGIPVVAS-NTGVFSELLDP-ENAKAGVIV 295 (352) Q Consensus 229 ----------~V~~~g~~~~~~~~~~~adi-~i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e~i~~-~~g~~G~~~ 295 (352) +|...+.--++..+|+.+|+ ||--|..+--|--++|+.++|+|+|.- .+-..+|+-.. ...+.|+.| T Consensus 291 S~~~~~~~~tdV~l~DtmGEL~l~y~~adiAFVGGSlv~~GGHN~LEpa~~~~pvi~Gp~~~Nf~ei~~~l~~~ga~~~v 370 (419) T COG1519 291 SQGDPPFSDTDVLLGDTMGELGLLYGIADIAFVGGSLVPIGGHNPLEPAAFGTPVIFGPYTFNFSDIAERLLQAGAGLQV 370 (419) T ss_pred CCCCCCCCCCCEEEEECHHHHHHHHHHCCEEEECCCCCCCCCCCHHHHHHCCCCEEECCCCCCHHHHHHHHHHCCCEEEE T ss_conf 37899988886899962868999973432799877446778988223787089788777513589999999866986997 Q ss_pred CCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 599989999999999869899999999999999982998999999999999 Q gi|255764474|r 296 PPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 296 ~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) + |.+.+++++..+++|++.+++|++++.+.+.++ +...+++.+..+ T Consensus 371 ~--~~~~l~~~v~~l~~~~~~r~~~~~~~~~~v~~~---~gal~r~l~~l~ 416 (419) T COG1519 371 E--DADLLAKAVELLLADEDKREAYGRAGLEFLAQN---RGALARTLEALK 416 (419) T ss_pred C--CHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH T ss_conf 7--889999999997078899999999899999986---779999999755 No 72 >COG0707 MurG UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase [Cell envelope biogenesis, outer membrane] Probab=99.40 E-value=2e-10 Score=89.33 Aligned_cols=261 Identities=14% Similarity=0.106 Sum_probs=162.3 Q ss_pred HHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCC-CCE Q ss_conf 8998875432597089962854689999999970998199975745620113479899626689988989997436-883 Q gi|255764474|r 57 LLTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIE-RPS 135 (352) Q Consensus 57 ~~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~-~~~ 135 (352) .....+.....++|++-...+.......+++...+.|+.+.-+.... ....+++.+.++.+...-+....+.. ... T Consensus 80 ~~~a~~il~~~kPd~vig~Ggyvs~P~~~Aa~~~~iPv~ihEqn~~~---G~ank~~~~~a~~V~~~f~~~~~~~~~~~~ 156 (357) T COG0707 80 VLQARKILKKLKPDVVIGTGGYVSGPVGIAAKLLGIPVIIHEQNAVP---GLANKILSKFAKKVASAFPKLEAGVKPENV 156 (357) T ss_pred HHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHCCCCEEEEEECCCC---CHHHHHHHHHEEEEEECCCHHHCCCCCCCE T ss_conf 99999999970998999579863464999986169987999734664---265645323012577125112115786643 Q ss_pred EEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCC-CEEEE-EECCCCCCCCCH Q ss_conf 9974882777823784326878872889887279995122644512366654565300477-40588-631333210001 Q gi|255764474|r 136 TVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHP-GWTAV-VVGKTTLKHYLF 213 (352) Q Consensus 136 ~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~-~~~l~-i~G~g~~~~~~~ 213 (352) .++.+.+..+.+. . .....+..... +.++|+.+|.-.-.+.+..++. ++..... ++.++ ++|... T Consensus 157 ~~tG~Pvr~~~~~-~-~~~~~~~~~~~--~~~~ilV~GGS~Ga~~ln~~v~---~~~~~l~~~~~v~~~~G~~~------ 223 (357) T COG0707 157 VVTGIPVRPEFEE-L-PAAEVRKDGRL--DKKTILVTGGSQGAKALNDLVP---EALAKLANRIQVIHQTGKND------ 223 (357) T ss_pred EEECCCCCHHHHC-C-CHHHHHHHCCC--CCCEEEEECCCHHHHHHHHHHH---HHHHHHCCCEEEEEECCCCH------ T ss_conf 7857846366521-6-35544320378--9848999888242799999999---99872121669999769736------ Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCC---C-----HHCC Q ss_conf 4667764431123322122222233100000000000123332222221000001001110278841---0-----1002 Q gi|255764474|r 214 KKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVF---S-----ELLD 285 (352) Q Consensus 214 ~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~---~-----e~i~ 285 (352) .++++...++++. +...++.+++..+|++||+.++-+ -++++-|..++|+|+|.-+-+.. . ..+. T Consensus 224 ~~~~~~~~~~~~~---~~v~~f~~dm~~~~~~ADLvIsRa----Ga~Ti~E~~a~g~P~IliP~p~~~~~~Q~~NA~~l~ 296 (357) T COG0707 224 LEELKSAYNELGV---VRVLPFIDDMAALLAAADLVISRA----GALTIAELLALGVPAILVPYPPGADGHQEYNAKFLE 296 (357) T ss_pred HHHHHHHHHCCCC---EEEEEHHHHHHHHHHHCCEEEECC----CHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHHHH T ss_conf 9999998720681---899766753999998645898678----664999999958988996589876441899999999 Q ss_pred CCCCCEEEEECCCC--HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 34884249965999--89999999999869899999999999999982998999999999 Q gi|255764474|r 286 PENAKAGVIVPPRN--LHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGK 343 (352) Q Consensus 286 ~~~g~~G~~~~~~d--~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~ 343 (352) +...|.+++..+ ++.+.+.|.+++++++...+|.+++++....+ ..+..++.+++ T Consensus 297 --~~gaa~~i~~~~lt~~~l~~~i~~l~~~~~~l~~m~~~a~~~~~p~-aa~~i~~~~~~ 353 (357) T COG0707 297 --KAGAALVIRQSELTPEKLAELILRLLSNPEKLKAMAENAKKLGKPD-AAERIADLLLA 353 (357) T ss_pred --HCCCEEEECCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC-HHHHHHHHHHH T ss_conf --6797699425547999999999999659899999999998717987-89999999999 No 73 >COG0380 OtsA Trehalose-6-phosphate synthase [Carbohydrate transport and metabolism] Probab=99.35 E-value=4.4e-10 Score=87.01 Aligned_cols=204 Identities=17% Similarity=0.221 Sum_probs=141.7 Q ss_pred CEEEECCCCCHHHHCCCCC-------HHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCE----EEEE Q ss_conf 8399748827778237843-------2687887288988727999512264451236665456530047740----5886 Q gi|255764474|r 134 PSTVIMHGVDTERFRPTSN-------KQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGW----TAVV 202 (352) Q Consensus 134 ~~~vi~~gid~~~~~~~~~-------~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~----~l~i 202 (352) .+...|-|||...|..... ..++++..+ .+..+|+.+.|+++-||+..-+.|+.+++.++|++ .++. T Consensus 245 ~v~a~PIgID~~~~~~~~~~~~v~~~~~el~~~~~--~~~kiivgvDRlDy~kGi~~rl~Afe~lL~~~Pe~~~kvvliQ 322 (486) T COG0380 245 KVGAFPIGIDPEEFERALKSPSVQEKVLELKAELG--RNKKLIVGVDRLDYSKGIPQRLLAFERLLEEYPEWRGKVVLLQ 322 (486) T ss_pred EEEEEEEECCHHHHHHHHCCCCHHHHHHHHHHHHC--CCCEEEEEEHHCCCCCCCHHHHHHHHHHHHHCHHHHCCEEEEE T ss_conf 89899401488999876168731567999999846--8866999740101246838999999999985856527468998 Q ss_pred ECCCCCCCCCHHHHHHHHHHHC-----CCCCCCCCCC--------CCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCE Q ss_conf 3133321000146677644311-----2332212222--------22331000000000001233322222210000010 Q gi|255764474|r 203 VGKTTLKHYLFKKNLQRRIYAN-----GLKKRILFID--------EQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASG 269 (352) Q Consensus 203 ~G~g~~~~~~~~~~l~~~i~~~-----~l~~~V~~~g--------~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G 269 (352) ++..+..+......+++.++++ +.-..+.|++ ..+++..+|..+|+++..|..||..++..|+.||- T Consensus 323 i~~pSr~~v~~y~~~~~~i~~~V~rIN~~fG~~~~~Pv~~l~~~~~~~~l~al~~~aDv~lVtplrDGMNLvakEyVa~q 402 (486) T COG0380 323 IAPPSREDVEEYQALRLQIEELVGRINGEFGSLSWTPVHYLHRDLDRNELLALYRAADVMLVTPLRDGMNLVAKEYVAAQ 402 (486) T ss_pred ECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCHHHHHHHHHHHCEEEECCCCCCCCHHHHHHHHHH T ss_conf 41777556388999999999999998750078886416888625898899999963034453233454207788899861 Q ss_pred E----EEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 0----111027884101002348842499659998999999999986-98999999999999999829989999999999 Q gi|255764474|r 270 I----PVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTGNRGRERAVKHFSIVKEASDIGKV 344 (352) Q Consensus 270 ~----PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~i 344 (352) - +.|.|...|....+. + +++|+|.|.++++++|.+.++ .++++.+.-++-.+.+.+ ++....++.+++- T Consensus 403 ~~~~G~LiLSeFaGaa~~L~---~--AliVNP~d~~~va~ai~~AL~m~~eEr~~r~~~~~~~v~~-~d~~~W~~~fl~~ 476 (486) T COG0380 403 RDKPGVLILSEFAGAASELR---D--ALIVNPWDTKEVADAIKRALTMSLEERKERHEKLLKQVLT-HDVARWANSFLDD 476 (486) T ss_pred CCCCCCEEEECCCCCHHHHC---C--CEEECCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH T ss_conf 47897289852656555643---3--8767888769999999998459999999999999999986-1399999999999 Q ss_pred H Q ss_conf 9 Q gi|255764474|r 345 Y 345 (352) Q Consensus 345 Y 345 (352) . T Consensus 477 l 477 (486) T COG0380 477 L 477 (486) T ss_pred H T ss_conf 8 No 74 >PRK12446 N-acetylglucosaminyl transferase; Reviewed Probab=99.28 E-value=1.6e-08 Score=76.51 Aligned_cols=308 Identities=13% Similarity=0.111 Sum_probs=181.0 Q ss_pred HHHEEEECCCCCCCCCCHHHH-HHHHHHHHHHHCCEEEEEECCCCC--------CCCC-----------CC--------- Q ss_conf 044589707878787707788-889999999719859998058866--------5623-----------47--------- Q gi|255764474|r 3 MNNIDVIAPNMKFRHTGVTST-VFGLCPIQRKLGQRLVVFGYCLPK--------NIPS-----------IG--------- 53 (352) Q Consensus 3 ~~~i~~i~~~~~~~~~Gv~~~-v~~~~~~~~~~~~~~~~~~~~~~~--------~~~~-----------~~--------- 53 (352) |++|.+.+.. +|...+ ...++..+++.++++..++...+- +++. .. T Consensus 1 MkkIii~~GG-----TGGHi~Palala~~L~~~~~~v~~ig~~~g~e~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 75 (352) T PRK12446 1 MKKIVFTGGG-----SAGHVTPNLAIIPKLIEDNWDISYIGSHQGIEKTIIEKENIPYYSISSGKLRRYFDLKNIKDPFL 75 (352) T ss_pred CCEEEEEECC-----CHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHH T ss_conf 9879999587-----58889999999999984899599998896054304450499689954477278552999999999 Q ss_pred -HHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCC Q ss_conf -8998998875432597089962854689999999970998199975745620113479899626689988989997436 Q gi|255764474|r 54 -ISSLLTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIE 132 (352) Q Consensus 54 -~~~~~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~ 132 (352) ...++..++..+..+||++-.-.........+.+.+++.|+.+.-.+.. ...-.|++.+.++.++..-+.+..+++ T Consensus 76 l~~~~~~s~~il~~~kPd~Vig~GGY~S~P~~lAA~ll~iP~~ihEqNav---~G~aNr~l~~~a~~i~~~f~~~~~~~~ 152 (352) T PRK12446 76 VMKGVMDAYVRIRKLKPDVIFSKGGFVSVPVVIGGWLNRVPVLLHESDMT---PGLANKIALRFASKIFVTFEEAAKHLP 152 (352) T ss_pred HHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHCCCCEEEECCCHH---HHHHHHHHHHHHCCEEECCHHHHHCCC T ss_conf 99999999999996399999974987779999999985999699887467---778999999870712899624552088 Q ss_pred C-CEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHH-HHHHHHHHCCCCCEE-EEEECCCCCC Q ss_conf 8-839974882777823784326878872889887279995122644512366-654565300477405-8863133321 Q gi|255764474|r 133 R-PSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLF-VDCMINILPHHPGWT-AVVVGKTTLK 209 (352) Q Consensus 133 ~-~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~l-i~a~~~l~~~~~~~~-l~i~G~g~~~ 209 (352) + ++.++.|.+..+.. ..+....+..++++.+.++|+.+|.-.-.+.+..+ .+++.++.. ++. ++.+|.+..+ T Consensus 153 ~~k~~~tGnPvR~~i~--~~~~~~~~~~~~~~~~~~~iLV~GGSqGA~~iN~~v~~~l~~l~~---~~~iih~~g~~~~~ 227 (352) T PRK12446 153 KEKVIYTGSPVREEVL--KGNREKGLAFLGFSRKKPVITIMGGSLGAKKINETVREALPELLL---KYQIVHLCGKGNLD 227 (352) T ss_pred CCCEEECCCCCCHHHH--CCCHHHHHHHCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHH---CCEEEEECCCCCHH T ss_conf 6736874886207654--035566787548887785799973751179999999999999851---97799992877156 Q ss_pred CCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCC-CCCH---HCC Q ss_conf 000146677644311233221222222331000000000001233322222210000010011102788-4101---002 Q gi|255764474|r 210 HYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTG-VFSE---LLD 285 (352) Q Consensus 210 ~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~-~~~e---~i~ 285 (352) +.++.. +....+.-..+++..+|+.||+.++-| -..++.|-+++|+|.|.-+.+ ...+ .-+ T Consensus 228 -----~~~~~~------~~~~~~~~~~~~m~~~~~~aDlvIsRA----GAsTiaEl~~~g~PsIlIP~p~~a~~nHQ~~N 292 (352) T PRK12446 228 -----DSLQNK------EGYRQFEYVHGELPDILAMTDFVISRA----GSNAIFEFLTLQKPMLLIPLSKFASRGDQILN 292 (352) T ss_pred -----HHHHCC------CCCEECCCHHHHHHHHHHHCCEEEECC----CCHHHHHHHHHCCCEEEECCCCCCCCCHHHHH T ss_conf -----898501------360765724554999998498899778----70289999982998899628987775759999 Q ss_pred ---CCCCCEEEEECCC--CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf ---3488424996599--98999999999986989999999999999998299899999999 Q gi|255764474|r 286 ---PENAKAGVIVPPR--NLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIG 342 (352) Q Consensus 286 ---~~~g~~G~~~~~~--d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~ 342 (352) -++...+++++.. +++.+.+.|.+++.|++. |.+++++... .-.++++++.++ T Consensus 293 A~~l~~~gaa~vi~e~~l~~~~L~~~i~~l~~n~~~---~~~~~kk~~~-p~aa~~I~d~i~ 350 (352) T PRK12446 293 AESFERQGYASVLYEEDVTVNSLIKHVEELSHNNEK---YKTALKKYNG-KEAIQTIIDHIS 350 (352) T ss_pred HHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCHHH---HHHHHHHCCC-CCHHHHHHHHHH T ss_conf 999997798899641469999999999999849999---9999985079-559999999985 No 75 >PRK00025 lpxB lipid-A-disaccharide synthase; Reviewed Probab=99.17 E-value=1.5e-08 Score=76.75 Aligned_cols=309 Identities=13% Similarity=0.160 Sum_probs=177.3 Q ss_pred HHHHHHHHHHHHCCEEEEEECCCC-------------CCCCCCCHHH-----------HHHHHHHHHHCCCEEEEECCCH Q ss_conf 888999999971985999805886-------------6562347899-----------8998875432597089962854 Q gi|255764474|r 23 TVFGLCPIQRKLGQRLVVFGYCLP-------------KNIPSIGISS-----------LLTCWKKPIGQNSRIWHARRNN 78 (352) Q Consensus 23 ~v~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~-----------~~~~~~~~~~~~~~ivh~~~~~ 78 (352) +..++.+.+++....+..+|-+-+ ..+..+++.+ +..+.+.....++|++-.=..+ T Consensus 16 ~ga~li~~Lk~~~~~~~~~GiGG~~M~~~G~~~l~d~~~l~vmG~~evl~~~~~~~~~~~~~~~~i~~~~Pd~vi~ID~p 95 (382) T PRK00025 16 LGAGLIRALKAQAPNLEFVGIGGPRMEAAGFESLFDMEELAVMGLVEVLPRLPRLLKIRKELKRLLLAEPPDVFILIDAP 95 (382) T ss_pred HHHHHHHHHHHHCCCCEEEEECCHHHHHCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 99999999983198967999882999976995447757831301999997799999999999999986499999997783 Q ss_pred H--HHHHHHHHHHHCCCEEEEECCHHHH--HHHHHHHHHHHHCCEEEECCHHHHHHCCC---CEEEECCCCCHHHHCCCC Q ss_conf 6--8999999997099819997574562--01134798996266899889899974368---839974882777823784 Q gi|255764474|r 79 E--MLLGVMMRDVLRMPLKLVFTSPSQR--NHSRWTRYLISRMDEVITTSQKSARFIER---PSTVIMHGVDTERFRPTS 151 (352) Q Consensus 79 ~--~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~~d~ii~~s~~~~~~~~~---~~~vi~~gid~~~~~~~~ 151 (352) + ...+..++.. ...++++++...+- ...+-.+.+-+.+|++++.=++..++..+ ++..|.|.+ .+.+.+.. T Consensus 96 gFnlrlak~lkk~-~~~ipvi~yv~PqvWAWr~~R~k~~~~~~D~ll~ifPFE~~~f~~~g~~~~fVGHPl-~d~~~~~~ 173 (382) T PRK00025 96 DFNLRLAKKLKKA-LPGIPIIHYVSPSVWAWRPGRAKKIAKATDHVLAILPFEAAFYDKLGVPVTYVGHPL-ADEIPLEP 173 (382) T ss_pred HHHHHHHHHHHHH-CCCCCEEEEECCHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHCCCCCEECCCCH-HHHCCCCC T ss_conf 0659999999971-699988999471565406418999999998761087656899986599813569815-64322356 Q ss_pred CHHHHHHHCCCCCCCCEEEEE-E-EECHH-HHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 326878872889887279995-1-22644-51236665456530047740588631333210001466776443112332 Q gi|255764474|r 152 NKQEARRHLKISEDAKLIGCF-G-RIRKL-KGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKK 228 (352) Q Consensus 152 ~~~~~~~~~~~~~~~~~i~~~-G-~~~~~-Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~ 228 (352) ++...+..++++++.++|+.+ | |-++- +.+..+++++..+.+++|++++++.-..+. ..+.+++.++..+..- T Consensus 174 ~~~~~~~~~~~~~~~~~i~lLPGSR~~EI~~~lPi~l~a~~~l~~~~p~~~fvip~~~~~----~~~~i~~~~~~~~~~~ 249 (382) T PRK00025 174 DRAAARAELGLDPDARVLALLPGSRGSEIKRLLPTFLEAAQLLQQRYPDLRFVVPLANPK----RREQFEQLLAEYAPDL 249 (382) T ss_pred CHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHH----HHHHHHHHHHHCCCCC T ss_conf 879999873998556617870588589999978999999999998789939999558877----8999999998479998 Q ss_pred CCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-ECCCCCCCHHCCCCCCCE---------------- Q ss_conf 212222223310000000000012333222222100000100111-027884101002348842---------------- Q gi|255764474|r 229 RILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVV-ASNTGVFSELLDPENAKA---------------- 291 (352) Q Consensus 229 ~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI-~s~~~~~~e~i~~~~g~~---------------- 291 (352) .+.. ..++....|+.||+.+..| |.+.+|+..+|+|.| +-.+..+..++-..--.. T Consensus 250 ~i~~--~~~~~~~~l~~sd~ai~~S-----GTaTLE~al~~~P~Vv~Yk~~~lt~~i~k~lvkv~~isL~Nii~~k~ivP 322 (382) T PRK00025 250 SVTL--LDGQAREAMAAADAALAAS-----GTVTLEAALLGVPMVVGYKVKPLTFWIAKRLVKVPYISLPNLLAGEELVP 322 (382) T ss_pred CEEE--ECCCCHHHHHHCCEEEECC-----CHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHCCCEEEHHHHHCCCCCCH T ss_conf 3898--2684177887388876537-----77999999971985899807899999999965699765248754997661 Q ss_pred EEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCH---HHHHHHHHHHH Q ss_conf 4996599989999999999869899999999999999982998---99999999999 Q gi|255764474|r 292 GVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSI---VKEASDIGKVY 345 (352) Q Consensus 292 G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~---~~~a~~~~~iY 345 (352) -++-+..+++.+++++.++++|++.++++.+..++. .+.... ++.|+.+.++. T Consensus 323 EllQ~~~~~~~i~~~~~~ll~d~~~~~~~~~~~~~l-r~~L~~gas~raA~~Il~~l 378 (382) T PRK00025 323 ELLQEEATPEKLARALLELLADGARRQALLEGFTEL-HQYLRCGAAERAAQAVLELL 378 (382) T ss_pred HHHCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHH-HHHHCCCHHHHHHHHHHHHH T ss_conf 340566999999999999966999999999999999-99857899999999999999 No 76 >COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones] Probab=98.93 E-value=8.4e-09 Score=78.36 Aligned_cols=187 Identities=16% Similarity=0.221 Sum_probs=134.0 Q ss_pred HHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCC-CCCCCCC Q ss_conf 878872889887279995122644512366654565300477405886313332100014667764431123-3221222 Q gi|255764474|r 155 EARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGL-KKRILFI 233 (352) Q Consensus 155 ~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l-~~~V~~~ 233 (352) ..|...|+|++..++++.++ ..|-...+.+-|.++++.-|+-.|++.|.|+.. .....++...++.|+ .+|..|+ T Consensus 419 ~sR~~lglp~~avVf~c~~n--~~K~~pev~~~wmqIL~~vP~Svl~L~~~~~~~--~~~~~l~~la~~~Gv~~eRL~f~ 494 (620) T COG3914 419 PSRAQLGLPEDAVVFCCFNN--YFKITPEVFALWMQILSAVPNSVLLLKAGGDDA--EINARLRDLAEREGVDSERLRFL 494 (620) T ss_pred CCHHHCCCCCCEEEEEECCC--CCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCH--HHHHHHHHHHHHCCCCHHHEEEC T ss_conf 43210599998089996688--644787999999999984898579982689868--89999999999708981334626 Q ss_pred CCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC-------CCCCCCHHCCCCCCCEEEEECCCCHHHHH Q ss_conf 222--331000000000001233322222210000010011102-------78841010023488424996599989999 Q gi|255764474|r 234 DEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS-------NTGVFSELLDPENAKAGVIVPPRNLHALE 304 (352) Q Consensus 234 g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s-------~~~~~~e~i~~~~g~~G~~~~~~d~~~la 304 (352) +.. ++....|..||+++=+--+ |-..+..||+.+|+||++- +.|+ .++ ...|..-+++. +.++.. T Consensus 495 p~~~~~~h~a~~~iADlvLDTyPY-~g~TTa~daLwm~vPVlT~~G~~FasR~~~--si~-~~agi~e~vA~--s~~dYV 568 (620) T COG3914 495 PPAPNEDHRARYGIADLVLDTYPY-GGHTTASDALWMGVPVLTRVGEQFASRNGA--SIA-TNAGIPELVAD--SRADYV 568 (620) T ss_pred CCCCCHHHHHHHCHHHEEEECCCC-CCCCCHHHHHHHCCCEEEECCHHHHHHHHH--HHH-HHCCCCHHHCC--CHHHHH T ss_conf 999988999862313246524667-886426777873584465111778876059--999-86698024218--888999 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHHHH--HCCHHHHHHHHHHHHHHHHHH Q ss_conf 99999986989999999999999998--299899999999999998860 Q gi|255764474|r 305 KAVLYFMNSKKIMSDTGNRGRERAVK--HFSIVKEASDIGKVYDRLLRT 351 (352) Q Consensus 305 ~~i~~l~~~~~~~~~~~~~a~~~~~~--~fs~~~~a~~~~~iY~~~l~~ 351 (352) +.-..+-+|...+++.+..-+..... -|+.+..+++++.+|..+.++ T Consensus 569 ~~av~~g~dral~q~~r~~l~~~r~tspL~d~~~far~le~~y~~M~~~ 617 (620) T COG3914 569 EKAVAFGSDRALRQQVRAELKRSRQTSPLFDPKAFARKLETLYWGMWSE 617 (620) T ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 9999853417778755799984032674568899999999999999986 No 77 >TIGR01133 murG undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase; InterPro: IPR006009 The murG gene of Escherichia coli encodes the N-acetylglucosaminyltransferase, UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase, responsible for the final step in the formation of the lipid-linked disaccharide-pentapeptide subunit of peptidoglycan. The enzyme is peripherally associated with the inner face of the cytoplasmic membrane. Therefore, the peptidoglycan subunit is completely assembled before it traverses the cytoplasmic membrane .; GO: 0050511 undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase activity, 0019277 UDP-N-acetylgalactosamine biosynthetic process, 0009276 1-2nm peptidoglycan-based cell wall. Probab=98.90 E-value=4.8e-07 Score=66.52 Aligned_cols=304 Identities=16% Similarity=0.141 Sum_probs=191.3 Q ss_pred CCHHHHHHHHHHHHHHHC--CEEEEEECCC-------C-C-CCCC----------------------CCHHHHHHHHHHH Q ss_conf 707788889999999719--8599980588-------6-6-5623----------------------4789989988754 Q gi|255764474|r 18 TGVTSTVFGLCPIQRKLG--QRLVVFGYCL-------P-K-NIPS----------------------IGISSLLTCWKKP 64 (352) Q Consensus 18 ~Gv~~~v~~~~~~~~~~~--~~~~~~~~~~-------~-~-~~~~----------------------~~~~~~~~~~~~~ 64 (352) +|---=...+...+.+.+ ..+.+.|... + . .++. .-....+...+.. T Consensus 15 TGG~fPAlA~a~~l~~~~~~~~v~~lG~~~g~e~~lv~~~~~~~~~~i~~~gl~~~~~~~~~~~~~~~~~~~~~~a~~~l 94 (368) T TIGR01133 15 TGGIFPALAVAEELIKRGPEVEVVWLGTKRGLEERLVPSKEGIEFLTIPVGGLRRKGSKKLLKLPLLKLLKAVLQARRIL 94 (368) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 30268999999999974893699985067750000343215741777740100365510146788999999999999998 Q ss_pred HHCCCEEEEECCCHHHHHHHHHHHHHC-CCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCC--CCEEEECCC Q ss_conf 325970899628546899999999709-98199975745620113479899626689988989997436--883997488 Q gi|255764474|r 65 IGQNSRIWHARRNNEMLLGVMMRDVLR-MPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIE--RPSTVIMHG 141 (352) Q Consensus 65 ~~~~~~ivh~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~--~~~~vi~~g 141 (352) +..+++++-.-.......+.+++...+ +|+.+= .+. ....-.||+-+.++.|...-+.+..+.+ +++.++-|- T Consensus 95 ~~~~p~~v~G~GGY~s~P~~~AA~l~g~iP~~~E-QN~---~pG~~Nk~ls~~A~~V~~~f~~~~~~~~~~~~~~~~g~p 170 (368) T TIGR01133 95 KKFKPDVVVGFGGYVSGPAGLAAKLLGRIPLILE-QNA---VPGLTNKLLSRFAKRVLVSFPGAKKFFPAAEKVVVVGNP 170 (368) T ss_pred HHCCCCEEEECCCHHHHHHHHHHHHCCCCCEEEE-ECC---HHHHHHHHHHHHHCEEEEECHHHHCCCCCCCCEEEECCH T ss_conf 6008747987473678999999876679948986-154---125788887887443111051332267666756870141 Q ss_pred CCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHH-HHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHH Q ss_conf 27778237843268788728898872799951226445123-66654565300477405886313332100014667764 Q gi|255764474|r 142 VDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTD-LFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRR 220 (352) Q Consensus 142 id~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~-~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~ 220 (352) +-.+.+.... ...+..+..+++...|+.+|.=--.+-+- .+.+++..|.++.....+...|..+ .++.++. T Consensus 171 vr~~~~~~~~--~~~~~~~~~~~~~~~ilv~GGSQGA~~lN~~vp~~~~~L~~~~~~~~~~~~g~~~------~~~~~~~ 242 (368) T TIGR01133 171 VREEIRSLPA--ARARKRFKLRPGKLRILVLGGSQGAKILNELVPKALAKLAEKGLILVIIQGGKGD------LEKVKNV 242 (368) T ss_pred HHHHHHCCCC--HHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHH------HHHHHHH T ss_conf 3454303782--5688862168998279996273768999999999998864016525888766377------9999998 Q ss_pred HHHCC-CCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCC-CC--CHHCC----CCCCCEE Q ss_conf 43112-33221222222331000000000001233322222210000010011102788-41--01002----3488424 Q gi|255764474|r 221 IYANG-LKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTG-VF--SELLD----PENAKAG 292 (352) Q Consensus 221 i~~~~-l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~-~~--~e~i~----~~~g~~G 292 (352) ..+.+ ....+...-+..|+...|+.||+.|+=+ --.++-|=++.|+|.|-=+-+ .. .+-.. .++...| T Consensus 243 y~~~~l~~~~~~~f~~~~dm~~~y~~ADLvIsRA----GA~T~~El~a~G~PaIliPyP~a~~r~~Q~~NA~~l~~~gag 318 (368) T TIGR01133 243 YSELGLVAAKITPFIDNEDMAAAYAAADLVISRA----GASTVAELAAAGVPAILIPYPYAAKRDDQYYNAKFLEDAGAG 318 (368) T ss_pred HHHCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHCCCEEECCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 5213710222103778757999998740400233----378999999717773762587546817899999999734465 Q ss_pred EEECCC--CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 996599--9899999999998698999999999999999829989999 Q gi|255764474|r 293 VIVPPR--NLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEA 338 (352) Q Consensus 293 ~~~~~~--d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a 338 (352) ++++.. .++.+.+.+..+..++..+.+|++++++....+ +.+.++ T Consensus 319 ~~~~q~~~~~e~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~-a~~~~~ 365 (368) T TIGR01133 319 IVIEQKELLTEKLVSALLKLLKDPARLEEMAEAARKLAKPD-AAKRIA 365 (368) T ss_pred EEEHHHCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCC-HHHHHH T ss_conf 46402204768999998741610899999999999861572-779999 No 78 >TIGR02398 gluc_glyc_Psyn glucosylglycerol-phosphate synthase; InterPro: IPR012764 Glucosylglycerol-phosphate synthase catalyzes the key step in the biosynthesis of the osmolyte glucosylglycerol. It is known in several cyanobacteria and in Pseudomonas anguilliseptica. The enzyme is closely related to the alpha,alpha-trehalose-phosphate synthase, likewise involved in osmolyte biosynthesis, of Escherichia coli and many other bacteria. A close homolog from Xanthomonas campestris is excluded from this family.; GO: 0016758 transferase activity transferring hexosyl groups, 0051473 glucosylglycerol biosynthetic process. Probab=98.89 E-value=1.9e-08 Score=75.97 Aligned_cols=201 Identities=20% Similarity=0.274 Sum_probs=144.4 Q ss_pred ECCCCCHHHHCCCCC----H---HHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCC----EEEEEECCC Q ss_conf 748827778237843----2---68788728898872799951226445123666545653004774----058863133 Q gi|255764474|r 138 IMHGVDTERFRPTSN----K---QEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPG----WTAVVVGKT 206 (352) Q Consensus 138 i~~gid~~~~~~~~~----~---~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~----~~l~i~G~g 206 (352) -|=|.|++.+..... + ..++.. +.++.+.||.++|+++-||...+|-|+-+|+.++|+ +.|+++.-. T Consensus 257 ~PVG~~p~~ir~~~~~a~~~e~~e~ir~e--l~kg~KLILSa~RvDYtKG~~E~L~ayERLLErrpeL~gkV~Lv~acv~ 334 (495) T TIGR02398 257 FPVGTDPERIREAVAAASVRERIEKIREE--LAKGRKLILSAERVDYTKGIKELLVAYERLLERRPELLGKVVLVVACVK 334 (495) T ss_pred CCCCCCHHHHHHHHHCCCHHHHHHHHHHH--HCCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEEEEEHH T ss_conf 35578889999987345478999999998--6048526885010011044157788888752258230541340244426 Q ss_pred CCCCCCHHHHHHHHHHHCC--CCC---CCCCCC--------CCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCC----- Q ss_conf 3210001466776443112--332---212222--------2233100000000000123332222221000001----- Q gi|255764474|r 207 TLKHYLFKKNLQRRIYANG--LKK---RILFID--------EQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMAS----- 268 (352) Q Consensus 207 ~~~~~~~~~~l~~~i~~~~--l~~---~V~~~g--------~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~----- 268 (352) ..+..--.++++..|+++- +.+ +|-|++ ..+++..+|..|||.=.+-..+|+-||.=|+.++ T Consensus 335 AA~gM~iY~e~Q~~IE~~vGRINGRFa~~~W~P~~fftr~lpyEE~~awf~~ADv~WITPLRDGLNLVAKEyvaA~g~~d 414 (495) T TIGR02398 335 AASGMKIYEELQSEIERLVGRINGRFARVDWTPVRFFTRSLPYEELVAWFAAADVAWITPLRDGLNLVAKEYVAARGLRD 414 (495) T ss_pred HCCCCEEHHHHHHHHHHHHCCCCCEECCCCCEEHHHHHHCCCHHHHHHHHHCCCCEECCCCCCCHHHHHHHHHHHCCCCC T ss_conf 60677726876688998606424403134654010442116678999997414311206763421478899999515889 Q ss_pred E--EEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 0--01110278841010023488424996599989999999999869899999999999999982998999999999999 Q gi|255764474|r 269 G--IPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 269 G--~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) | +-|++-=.|..-++ + .+++-+|.++..+.++|...|+-|+.-++-.-+.-..+.+.|+.+..++.+.+-.. T Consensus 415 GrGVLvLSEFaGaaveL-~-----gA~LTNPy~~~rMde~i~~ALaMP~~Eqq~Rm~~m~~av~~ydvq~Wa~e~la~~~ 488 (495) T TIGR02398 415 GRGVLVLSEFAGAAVEL-K-----GAVLTNPYDAKRMDEAIEVALAMPKEEQQRRMKEMFDAVEAYDVQRWADEFLAEVK 488 (495) T ss_pred CCEEEEEECCCCCCEEE-C-----CCEECCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 96689851112540121-6-----64312888755357999998506388999999999999854206778999985078 No 79 >TIGR03492 conserved hypothetical protein. This protein family is restricted to the Cyanobacteria, in one or two copies, save for instances in the genus Deinococcus. This protein shows some sequence similarity, especially toward the C-terminus, to lipid-A-disaccharide synthase (TIGR00215 or pfam02684). The function is unknown. Probab=98.82 E-value=1.9e-06 Score=62.50 Aligned_cols=242 Identities=16% Similarity=0.165 Sum_probs=135.9 Q ss_pred HHHHHHHHHCCCEEEEECCH--HH------------------HHHHHHHHHHHH--HCCEEEECCHHHHHHCCC---CEE Q ss_conf 99999997099819997574--56------------------201134798996--266899889899974368---839 Q gi|255764474|r 82 LGVMMRDVLRMPLKLVFTSP--SQ------------------RNHSRWTRYLIS--RMDEVITTSQKSARFIER---PST 136 (352) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~--~~------------------~~~~~~~~~l~~--~~d~ii~~s~~~~~~~~~---~~~ 136 (352) ..+.+++..+.|+.++.+.. .+ ....+|.+|+.+ +|..|++-=+.+++.+.. +.. T Consensus 104 ~pl~~A~~sg~p~~~~~~~~S~yy~~~~~~~~~~~~~~~~~g~~~~Pwe~~lm~~~rc~~Vf~RD~lTA~~L~~~gi~a~ 183 (396) T TIGR03492 104 VPLLFAWLSGKPYAFVGTAKSDYYWESGPRRSPSDEYHRLEGSLYLPWERWLMRSRRCLAVFVRDRLTARDLRRQGVRAS 183 (396) T ss_pred HHHHHHHHCCCCCEEEEECCCCEEEECCCCCCCHHHHHCCCEEECCHHHHHHHCCCCCCEEECCCHHHHHHHHHCCCCEE T ss_conf 88899998189816997045323660687753012332155178167799974096652995055887999997799649 Q ss_pred EECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEE-EEEE-EE-CHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCH Q ss_conf 974882777823784326878872889887279-9951-22-64451236665456530047740588631333210001 Q gi|255764474|r 137 VIMHGVDTERFRPTSNKQEARRHLKISEDAKLI-GCFG-RI-RKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLF 213 (352) Q Consensus 137 vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i-~~~G-~~-~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~ 213 (352) .+-|.+ ++.+.+... ..++++.+.| +.-| |. +-.++...+++++..+.+..| +.+++.-.... . T Consensus 184 f~GnPm-MD~l~~~~~-------~~~~~~~~~I~LLPGSR~pEa~~nl~~~L~a~~~l~~~~~-~~f~~alap~l----~ 250 (396) T TIGR03492 184 YLGNPM-MDGLEPPER-------KPLLTGRFRIALLPGSRPPEAYRNLKLLLRALEALPDSQP-FVFLAAIVPSL----S 250 (396) T ss_pred EECCCH-HCCCCCCCC-------CCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCC-EEEEEEECCCC----C T ss_conf 608734-137888887-------6678878679995898859999879999999996341488-69999868999----9 Q ss_pred HHHHHHHHHHCCCCCC-------CCCC-------CCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCC Q ss_conf 4667764431123322-------1222-------2223310000000000012333222222100000100111027884 Q gi|255764474|r 214 KKNLQRRIYANGLKKR-------ILFI-------DEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGV 279 (352) Q Consensus 214 ~~~l~~~i~~~~l~~~-------V~~~-------g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~ 279 (352) .+.++..+...|.... ..|. =..+...++++.||+.+..| |.+..|+...|+|+|+-.-.| T Consensus 251 ~~~l~~~l~~~Gw~~~~~~~~~~~~~~~~~~~v~~~~~~f~~~l~~adl~ia~A-----GTAteQ~vgLG~Pvv~l~g~G 325 (396) T TIGR03492 251 LEKLQAILEDLGWQLEGSSEDQTSLFQKGTLEVLLGRGAFAEILHWADLGIAMA-----GTATEQAVGLGKPVIQLPGKG 325 (396) T ss_pred HHHHHHHHHHCCCEECCCCCCCCHHEECCCEEEEEECCCHHHHHHHCHHHHHCC-----CCHHHHHHHCCCCEEEECCCC T ss_conf 899999999659700578654200010487689997384899998551144437-----709999987189879972787 Q ss_pred C------CHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 1------01002348842499659998999999999986989999999999999998299899999999 Q gi|255764474|r 280 F------SELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIG 342 (352) Q Consensus 280 ~------~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~ 342 (352) - .+.-..--|.. ......+++.+++.+..++.|++.+++++++++++.=+-=-.+++|+.+. T Consensus 326 PQfT~~fA~~Q~RLLG~s-v~~~~~~p~~ia~~~~~lL~d~~~~~~~~~~gr~RlG~~Gas~RiA~~il 393 (396) T TIGR03492 326 PQFTYGFAEAQSRLLGGS-VFLASKNPEQAAQVVRQLLADPELLERCRRNGQERMGPPGASARIAESIL 393 (396) T ss_pred CHHHHHHHHHHHHHCCCE-EEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 277799999999862753-35268999999999999854999999999999985588658899999999 No 80 >COG0763 LpxB Lipid A disaccharide synthetase [Cell envelope biogenesis, outer membrane] Probab=98.81 E-value=1.3e-06 Score=63.56 Aligned_cols=251 Identities=14% Similarity=0.188 Sum_probs=141.4 Q ss_pred HHHHHCCCEEEEECCCHH--HHHHHHHHHHHCCCEEEEECCHHHHHH--HHHHHHHHHHCCEEEECCHHHHHHCCC---C Q ss_conf 754325970899628546--899999999709981999757456201--134798996266899889899974368---8 Q gi|255764474|r 62 KKPIGQNSRIWHARRNNE--MLLGVMMRDVLRMPLKLVFTSPSQRNH--SRWTRYLISRMDEVITTSQKSARFIER---P 134 (352) Q Consensus 62 ~~~~~~~~~ivh~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~l~~~~d~ii~~s~~~~~~~~~---~ 134 (352) +.....++|++-.=..++ ......++.. +..++++++...+-.. ..-..-+.+.+|++++.-++...+..+ + T Consensus 79 ~~i~~~kpD~~i~IDsPdFnl~vak~lrk~-~p~i~iihYV~PsVWAWr~~Ra~~i~~~~D~lLailPFE~~~y~k~g~~ 157 (381) T COG0763 79 RYILANKPDVLILIDSPDFNLRVAKKLRKA-GPKIKIIHYVSPSVWAWRPKRAVKIAKYVDHLLAILPFEPAFYDKFGLP 157 (381) T ss_pred HHHHHCCCCEEEEECCCCCCHHHHHHHHHH-CCCCCEEEEECCCEEEECHHHHHHHHHHHHHEEEECCCCHHHHHHCCCC T ss_conf 999855998899967898864999999970-8999869997853056552168999997617214367788999855997 Q ss_pred EEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEE--EEECHH-HHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCC Q ss_conf 39974882777823784326878872889887279995--122644-512366654565300477405886313332100 Q gi|255764474|r 135 STVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCF--GRIRKL-KGTDLFVDCMINILPHHPGWTAVVVGKTTLKHY 211 (352) Q Consensus 135 ~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~--G~~~~~-Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~ 211 (352) +..|.|.+- +...-.+++.++|..++++.+.+++... +|-++- +-...+.+|+..+..++|+.++++--..... T Consensus 158 ~~yVGHpl~-d~i~~~~~r~~ar~~l~~~~~~~~lalLPGSR~sEI~rl~~~f~~a~~~l~~~~~~~~~vlp~~~~~~-- 234 (381) T COG0763 158 CTYVGHPLA-DEIPLLPDREAAREKLGIDADEKTLALLPGSRRSEIRRLLPPFVQAAQELKARYPDLKFVLPLVNAKY-- 234 (381) T ss_pred EEEECCHHH-HHCCCCCCHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHH-- T ss_conf 089678056-42344345789999828997787699816885889998778999999999865899659995684788-- Q ss_pred CHHHHHHHHHHHCCCC-CCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC-CCCCCCHHCCCCCC Q ss_conf 0146677644311233-221222222331000000000001233322222210000010011102-78841010023488 Q gi|255764474|r 212 LFKKNLQRRIYANGLK-KRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS-NTGVFSELLDPENA 289 (352) Q Consensus 212 ~~~~~l~~~i~~~~l~-~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e~i~~~~g 289 (352) +.++......... ..++.. -.+-.+.|.+||+.+..| |.+.+|+|.+|+|.|++ .+..+.-++-..-. T Consensus 235 ---~~~~~~~~~~~~~~~~~~~~--~~~~~~a~~~aD~al~aS-----GT~tLE~aL~g~P~Vv~Yk~~~it~~iak~lv 304 (381) T COG0763 235 ---RRIIEEALKWEVAGLSLILI--DGEKRKAFAAADAALAAS-----GTATLEAALAGTPMVVAYKVKPITYFIAKRLV 304 (381) T ss_pred ---HHHHHHHHHCCCCCCEEEEE--CCHHHHHHHHHHHHHHHC-----CHHHHHHHHHCCCEEEEEECCHHHHHHHHHHC T ss_conf ---99999986233457607840--745789999841899844-----67999999829997999944588999999861 Q ss_pred CEEE-----------EE-----CCCCHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 4249-----------96-----5999899999999998698999999999999 Q gi|255764474|r 290 KAGV-----------IV-----PPRNLHALEKAVLYFMNSKKIMSDTGNRGRE 326 (352) Q Consensus 290 ~~G~-----------~~-----~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~ 326 (352) ...+ ++ +...++.+++++..++.|...+.++.+..++ T Consensus 305 k~~yisLpNIi~~~~ivPEliq~~~~pe~la~~l~~ll~~~~~~~~~~~~~~~ 357 (381) T COG0763 305 KLPYVSLPNILAGREIVPELIQEDCTPENLARALEELLLNGDRREALKEKFRE 357 (381) T ss_pred CCCCCCCHHHHCCCCCCHHHHHHHCCHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 67724436886187304677753169999999999983486769999999999 No 81 >pfam02684 LpxB Lipid-A-disaccharide synthetase. This is a family of lipid-A-disaccharide synthetases, EC:2.4.2.128. These enzymes catalyse the reaction: UDP-2,3-bis(3-hydroxytetradecanoyl) glucosamine + 2,3-bis(3-hydroxytetradecanoyl)-beta-D-glucosaminyl 1-phosphate <= UDP + 2,3-bis(3-hydroxytetradecanoyl)-D-glucosaminyl-1,6 -beta-D-2,3-bis(3-hydroxytetradecanoyl)-beta-D-glucosaminyl 1-phosphate. These enzymes catalyse the fist disaccharide step in the synthesis of lipid-A-disaccharide. Probab=98.73 E-value=3.4e-06 Score=60.85 Aligned_cols=293 Identities=13% Similarity=0.110 Sum_probs=158.8 Q ss_pred HHHHHHHHHHHHCCEEEEEECCCC------C-------CCCCCCHHH-----------HHHHHHHHHHCCCEEEEECCCH Q ss_conf 888999999971985999805886------6-------562347899-----------8998875432597089962854 Q gi|255764474|r 23 TVFGLCPIQRKLGQRLVVFGYCLP------K-------NIPSIGISS-----------LLTCWKKPIGQNSRIWHARRNN 78 (352) Q Consensus 23 ~v~~~~~~~~~~~~~~~~~~~~~~------~-------~~~~~~~~~-----------~~~~~~~~~~~~~~ivh~~~~~ 78 (352) +..++.+.+++....+..+|-+-+ . .+..+++.. ...+.+.....++|++-.=..+ T Consensus 13 ~ga~Li~~Lk~~~p~i~~~GvGG~~M~~~G~~~l~d~~~lsvmG~~evl~~l~~l~~~~~~~~~~i~~~~PD~vIlID~p 92 (373) T pfam02684 13 LGGELIQNLKEHYPNLRFIGVGGPKMEAEGFESLANMEEISVMGFIEVLGRLFKLLKIYQKVVKNILKKKPDTLILIDAP 92 (373) T ss_pred HHHHHHHHHHHCCCCCEEEEECCHHHHHCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 99999999983098948999880899977995347727841401999998999999999999998742699889997176 Q ss_pred --HHHHHHHHHHHHCCCEEEEECCHHHHH--HHHHHHHHHHHCCEEEECCHHHHHHCCC---CEEEECCCCCHHHHCCCC Q ss_conf --689999999970998199975745620--1134798996266899889899974368---839974882777823784 Q gi|255764474|r 79 --EMLLGVMMRDVLRMPLKLVFTSPSQRN--HSRWTRYLISRMDEVITTSQKSARFIER---PSTVIMHGVDTERFRPTS 151 (352) Q Consensus 79 --~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~l~~~~d~ii~~s~~~~~~~~~---~~~vi~~gid~~~~~~~~ 151 (352) ....+..++. .+.+++++++...+-. ..+-.+.+-+.+|+++|.=++..++..+ ++..|.|.+ .+...+.. T Consensus 93 gFNlrlak~lkk-~g~~ipvi~yV~PqvWAWr~~R~k~i~~~~D~ll~IfPFE~~~y~~~gv~~~fVGHPl-~d~~~~~~ 170 (373) T pfam02684 93 DFNLRLAKKLRK-LGPKLKIIHYVSPSVWAWRPKRAKIIAKYTDLLLAILPFEKGFFQKFGLDCRYVGHPL-LDEIKLYK 170 (373) T ss_pred CHHHHHHHHHHH-CCCCCCEEEEECCCEEEECHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCEEECCCCH-HHHHCCCC T ss_conf 155999999997-1999878999688422127158999999998731289887899986099715759811-65401377 Q ss_pred CHHHHHHHCCCCCCCCEEEE-EE-EECHHH-HHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 32687887288988727999-51-226445-1236665456530047740588631333210001466776443112332 Q gi|255764474|r 152 NKQEARRHLKISEDAKLIGC-FG-RIRKLK-GTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKK 228 (352) Q Consensus 152 ~~~~~~~~~~~~~~~~~i~~-~G-~~~~~K-g~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~ 228 (352) .....++..+.+ +...|+. -| |-++-| ....+++++.++.++.|++++++.-..+.. .+..+......+..- T Consensus 171 ~~~~~~~~~~~~-~~~~i~lLPGSR~~EI~~~lPi~l~aa~~l~~~~~~~~~~ip~~~~~~----~~~~~~~~~~~~~~~ 245 (373) T pfam02684 171 KRASAKELLIDH-DEPILALLPGSRKGEIRRLLPPIQVAAQQLSSQFPNLKLLVPLVNKKY----EEQIEDTLALENCDV 245 (373) T ss_pred CHHHHHHHCCCC-CCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHH----HHHHHHHHHHCCCCC T ss_conf 658999746898-775577678886999999999999999999976899189996588789----999999998649998 Q ss_pred CCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC-CCCCCCHHCCC----------------CCCCE Q ss_conf 21222222331000000000001233322222210000010011102-78841010023----------------48842 Q gi|255764474|r 229 RILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS-NTGVFSELLDP----------------ENAKA 291 (352) Q Consensus 229 ~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e~i~~----------------~~g~~ 291 (352) .+.. ...+..+.|++||+.+..| |.+.+|++.+|+|.|+. .+..++.++-. ..-.- T Consensus 246 ~i~~--~~~~~~~~~~~sd~al~~S-----GTaTLE~al~~~P~vV~Yk~~~lty~i~k~lvkv~~isL~Nii~~k~ivP 318 (373) T pfam02684 246 SLLE--ISGERYEAMFAADAALAKS-----GTATLEAALAGTPMVVAYRVKPLTFFLAKRLVKIDYISLPNIILNREIVP 318 (373) T ss_pred CEEE--CCCCHHHHHHHCCEEEECC-----CHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHCCCEEEHHHHHCCCCCCH T ss_conf 7898--0572499998486501216-----76999999819998999957789999999998389544348866998673 Q ss_pred EEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 49965999899999999998698999999999999999 Q gi|255764474|r 292 GVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAV 329 (352) Q Consensus 292 G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~ 329 (352) -++-+..+++.+++++.+++.|++.++++.+..++.-+ T Consensus 319 EllQ~~~t~~~ia~~~~~lL~d~~~~~~~~~~~~~~~~ 356 (373) T pfam02684 319 EFIQGECDAHLEAIALLLLLLNGLKAKKQKDRCRKFKQ 356 (373) T ss_pred HHHCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 56475178999999999996799999999999999999 No 82 >KOG3742 consensus Probab=98.70 E-value=1.4e-07 Score=70.05 Aligned_cols=112 Identities=25% Similarity=0.356 Sum_probs=76.4 Q ss_pred CCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCC----CHHCCCCCCCEEEEE-------CCCCHHHHHHHH Q ss_conf 100000000000123332222221000001001110278841----010023488424996-------599989999999 Q gi|255764474|r 239 IEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVF----SELLDPENAKAGVIV-------PPRNLHALEKAV 307 (352) Q Consensus 239 ~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~----~e~i~~~~g~~G~~~-------~~~d~~~la~~i 307 (352) ...+.+.|++.++||++|++|-+..|--.+|+|.|+|+..|. .|.|. .+...|+++ +.++.+++++-+ T Consensus 494 YeeFVRGCHLGVFPSYYEPWGYTPAECTVMGiPSvtTNlSGFGcfMeehi~-d~~ayGIYIvDRRfks~deSv~qL~~~m 572 (692) T KOG3742 494 YEEFVRGCHLGVFPSYYEPWGYTPAECTVMGIPSVTTNLSGFGCFMEEHIE-DPQAYGIYIVDRRFKSPDESVQQLASFM 572 (692) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCCHHEEEECCCCCCCCCCCHHHHHHHHHC-CCHHCEEEEEECCCCCHHHHHHHHHHHH T ss_conf 899852343344633247777883240784366301056303464898735-7123417998432478266899999999 Q ss_pred HHHHCCHHHHHHHHHHHH-HHHHHHCCHHHHHHHHHHHHHHHHHHC Q ss_conf 999869899999999999-999982998999999999999988609 Q gi|255764474|r 308 LYFMNSKKIMSDTGNRGR-ERAVKHFSIVKEASDIGKVYDRLLRTA 352 (352) Q Consensus 308 ~~l~~~~~~~~~~~~~a~-~~~~~~fs~~~~a~~~~~iY~~~l~~a 352 (352) .+... ...|+++.++.| ++..+-.+|+.+..-|.+.=+-.|++| T Consensus 573 ~~F~~-qsRRQRIiqRNrtErLSdLLDWk~lG~~Y~~aR~laL~r~ 617 (692) T KOG3742 573 YEFCK-QSRRQRIIQRNRTERLSDLLDWKYLGRYYRKARHLALSRA 617 (692) T ss_pred HHHHH-HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99988-8789999871232567787728877589999899998753 No 83 >PRK09814 hypothetical protein; Provisional Probab=98.63 E-value=3.1e-05 Score=54.40 Aligned_cols=270 Identities=12% Similarity=0.086 Sum_probs=137.9 Q ss_pred CHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCC--HHHHH-HHHHHHHHCCCEE Q ss_conf 07788889999999719859998058866562347899899887543259708996285--46899-9999997099819 Q gi|255764474|r 19 GVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPSIGISSLLTCWKKPIGQNSRIWHARRN--NEMLL-GVMMRDVLRMPLK 95 (352) Q Consensus 19 Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivh~~~~--~~~~~-~~~~~~~~~~~~~ 95 (352) +....-........+.|..-..+....-...........+..+.. ....-|++-.+.+ ..... ..+...+....++ T Consensus 16 ag~KAk~Dv~~i~~~~G~~~l~~~~~~~~~~~~~~~~~~~~~i~~-~l~~gDivi~QyP~~~~~~~~~~l~~~lk~~~~K 94 (337) T PRK09814 16 AGLKAKNDVTDIAKQLGFKELGIYFYNDKSDSLAERSQRIDGILA-SLNRGDIVVFQYPTYNGFEFDLLFIDKLKKKQIK 94 (337) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHH-CCCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCE T ss_conf 678889999999997799685665046630029999999999995-7799999999789870788999999999975987 Q ss_pred EEE--C-CHHHHHH--HH--H--HHHHHHHCCEEEECCHHHHHHCCCC---E-EEECCCC-CHHHHCCCCCHHHHHHHCC Q ss_conf 997--5-7456201--13--4--7989962668998898999743688---3-9974882-7778237843268788728 Q gi|255764474|r 96 LVF--T-SPSQRNH--SR--W--TRYLISRMDEVITTSQKSARFIERP---S-TVIMHGV-DTERFRPTSNKQEARRHLK 161 (352) Q Consensus 96 ~~~--~-~~~~~~~--~~--~--~~~l~~~~d~ii~~s~~~~~~~~~~---~-~vi~~gi-d~~~~~~~~~~~~~~~~~~ 161 (352) +++ | -..-|.. .. . .--+++.+|.+|+.|+.+.+++... + .++.-++ |- ..+.+.. T Consensus 95 ~i~~IHDie~LR~~~~~~~~~~~ei~~ln~aD~iIvhn~~M~~~L~~~G~~~~kiv~l~ifDY--l~~~~~~-------- 164 (337) T PRK09814 95 IIILIHDIEPLRFGKSNYYLMKEEIDMLNLSDVVIVHNKKMTDWLVEEGVTVDKIIDQEIFDY--LVDIDLE-------- 164 (337) T ss_pred EEEEECCCHHHHCCCCCHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCCCCCCEEEECCCCC--CCCCCCC-------- T ss_conf 999977658986567770367999999852899997999999999977998686478578775--8875556-------- Q ss_pred CCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCC Q ss_conf 898872799951226445123666545653004774058863133321000146677644311233221222222--331 Q gi|255764474|r 162 ISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSI 239 (352) Q Consensus 162 ~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~ 239 (352) -+.....|.|+|.++ |-. .+. +..+++.+.+.|.+.... ...+.|.+.|.. +++ T Consensus 165 ~~~~~k~I~fAGNL~--Ks~--fL~------~~~~~~~l~lYG~~~~~~--------------~~~~nv~Y~G~~~pdeL 220 (337) T PRK09814 165 TPSFQKKIIFAGNLS--KSP--FLK------KWSENIKLNVFGPGPSSL--------------ESSANVIYKGSFDPEEL 220 (337) T ss_pred CCCCCCEEEEECCHH--HHH--HHH------HCCCCCCEEEECCCCCCC--------------CCCCCEEEECCCCHHHH T ss_conf 656774399816724--346--788------476688779978885455--------------66566379476681888 Q ss_pred CCCCHHHHHHCC-C-CCCC------------CCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHH Q ss_conf 000000000001-2-3332------------2222210000010011102788410100234884249965999899999 Q gi|255764474|r 240 EDWYRALNIFVA-P-PLYE------------GFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEK 305 (352) Q Consensus 240 ~~~~~~adi~i~-p-S~~E------------g~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~ 305 (352) ...+...+..+. - ...| ..|-++--++|||+|||+-+-.++.++|.. ...|+.|+ +.+++.+ T Consensus 221 ~~~l~~g~FGLVWDgds~~tc~G~~g~Ylk~NnPHK~SLYLaaGiPVIVW~~aAla~fV~~--n~lGi~V~--sL~ei~~ 296 (337) T PRK09814 221 PLELSKGGFGLVWDGDSSETNDGEYGEYYKYNNPHKLSLYLAAGLPVIVWSKAAIADFIVE--NNLGFVVD--SLEELPE 296 (337) T ss_pred HHHHHCCCEEEEECCCCCCCCCCCCCCHHEEECCHHHHHHHHCCCCEEECCCHHHHHHHHH--CCCEEEEC--CHHHHHH T ss_conf 9987149856897698653467862110013150578899874998898273156789997--69769987--8999999 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 999998698999999999999999 Q gi|255764474|r 306 AVLYFMNSKKIMSDTGNRGRERAV 329 (352) Q Consensus 306 ~i~~l~~~~~~~~~~~~~a~~~~~ 329 (352) .+..+ ..++..+|.+|++.... T Consensus 297 ~l~~i--t~~eY~~m~~Nv~~i~~ 318 (337) T PRK09814 297 KINNM--TEEEYNEMVENVKKIAQ 318 (337) T ss_pred HHHHC--CHHHHHHHHHHHHHHHH T ss_conf 99858--99999999999999999 No 84 >cd04299 GT1_Glycogen_Phosphorylase_like This family is most closely related to the oligosaccharide phosphorylase domain family and other unidentified sequences. Oligosaccharide phosphorylase catalyzes the breakdown of oligosaccharides into glucose-1-phosphate units. They are important allosteric enzymes in carbohydrate metabolism. The members of this family are found in bacteria and Archaea. Probab=98.49 E-value=1.6e-06 Score=63.04 Aligned_cols=183 Identities=13% Similarity=0.139 Sum_probs=131.3 Q ss_pred CCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCC--CC--CEEEEEECCCCCCCCCHHH---HHHHHHHHCCCCCCCCCC Q ss_conf 8898872799951226445123666545653004--77--4058863133321000146---677644311233221222 Q gi|255764474|r 161 KISEDAKLIGCFGRIRKLKGTDLFVDCMINILPH--HP--GWTAVVVGKTTLKHYLFKK---NLQRRIYANGLKKRILFI 233 (352) Q Consensus 161 ~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~--~~--~~~l~i~G~g~~~~~~~~~---~l~~~i~~~~l~~~V~~~ 233 (352) .+.++..+|+|.=|+...|..++++.-...+.+- +| .++++++|..-+.|..-.+ ++-+..+...+.++|.|+ T Consensus 473 ~ldp~~LTIGFARRfAtYKRa~Llf~D~eRL~~il~~~~rPVQiIfAGKAHP~D~~Gk~lI~~i~~~s~~~~~~~rivFL 552 (778) T cd04299 473 VLDPNVLTIGFARRFATYKRATLLLRDPERLKRLLNDPERPVQFIFAGKAHPADEPGKELIQEIVEFSRRPEFRGRIVFL 552 (778) T ss_pred CCCCCCCEEEEHHHCCCCCCHHHHCCCHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 26865213411220566676022208999999986399976599993788998766799999999986380557758996 Q ss_pred CCCC-CCC-CCCHHHHHHCCCCC--CCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCC---------- Q ss_conf 2223-310-00000000001233--322222210000010011102788410100234884249965999---------- Q gi|255764474|r 234 DEQS-SIE-DWYRALNIFVAPPL--YEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRN---------- 299 (352) Q Consensus 234 g~~~-~~~-~~~~~adi~i~pS~--~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d---------- 299 (352) ...+ .+. .+.+.+|+-+...+ .|..|.+-.-|...|++-++.--|-..|- .+|.|||.+.+.+ T Consensus 553 edYDm~lAr~Lv~GvDVWLNnP~rPlEASGTSGMKAa~NGvlNlSvlDGWW~Eg---y~g~nGWaIg~~~~~~d~~~qD~ 629 (778) T cd04299 553 EDYDMALARHLVQGVDVWLNTPRRPLEASGTSGMKAALNGGLNLSVLDGWWDEG---YDGENGWAIGDGDEYEDDEYQDA 629 (778) T ss_pred CCCCHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCHHHCCCCCCCCCCCEEEEC---CCCCCCEECCCCCCCCCHHHCCH T ss_conf 798869998750664351379999743676320313334873333047767503---15988624798877788455039 Q ss_pred --HHHHHHHHHHH----H-C-C----HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf --89999999999----8-6-9----899999999999999982998999999999999 Q gi|255764474|r 300 --LHALEKAVLYF----M-N-S----KKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 300 --~~~la~~i~~l----~-~-~----~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) .++|-+.|+.- . + + |..+.+|.+++.+.+...|+..+|+++|.+-|. T Consensus 630 ~da~~LY~lLE~eViP~yY~r~~~g~p~~Wv~~mk~si~~~~p~F~t~RMv~eY~~~~Y 688 (778) T cd04299 630 EEAEALYDLLENEVIPLFYDRDEGGYPPGWVAMMKHSMATLGPRFSAERMVREYVERFY 688 (778) T ss_pred HHHHHHHHHHHHHHCCHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 99999999988511113112799999889999999999973675574468999999982 No 85 >TIGR02094 more_P_ylases alpha-glucan phosphorylases; InterPro: IPR011834 This family consists of known phosphorylases, and homologues believed to share the function of using inorganic phosphate to cleave an alpha 1,4 linkage between the terminal glucose residue and the rest of the polymer (maltodextrin, glycogen, etc.). The name of the glucose storage polymer substrate, and therefore the name of this enzyme, depends on the chain lengths and branching patterns. A number of the members of this family have been shown to operate on small maltodextrins, as may be obtained by utilization of exogenous sources. This family represents a distinct clade from the related families: Glycogen/starch/alpha-glucan phosphorylase and Glycosyl transferase, family 35.; GO: 0004645 phosphorylase activity, 0030170 pyridoxal phosphate binding, 0005975 carbohydrate metabolic process. Probab=98.30 E-value=3.6e-06 Score=60.68 Aligned_cols=183 Identities=15% Similarity=0.193 Sum_probs=136.0 Q ss_pred CCC---CCCCEEEEEEEECHHHHHHHHHH---HHHHHHCCCC--CEEEEEECCCCCCC---CCHHHHHHHHHHHCCCCCC Q ss_conf 889---88727999512264451236665---4565300477--40588631333210---0014667764431123322 Q gi|255764474|r 161 KIS---EDAKLIGCFGRIRKLKGTDLFVD---CMINILPHHP--GWTAVVVGKTTLKH---YLFKKNLQRRIYANGLKKR 229 (352) Q Consensus 161 ~~~---~~~~~i~~~G~~~~~Kg~~~li~---a~~~l~~~~~--~~~l~i~G~g~~~~---~~~~~~l~~~i~~~~l~~~ 229 (352) -++ ++..+|||+=|+.++|+.+++++ .+.++....+ .++++|+|.+-+.| +.+.+++-+.+++-++.++ T Consensus 394 ~lD~~~p~~LtIGFARRfAtYKRa~Llf~D~eRL~~il~~n~~RPVQ~vFAGKAHP~D~~GK~~Iq~i~~~s~~Pef~g~ 473 (618) T TIGR02094 394 LLDAKSPDVLTIGFARRFATYKRATLLFRDLERLKRILNNNPERPVQIVFAGKAHPADEAGKELIQRIVEFSKRPEFRGK 473 (618) T ss_pred HCCCCCCCCCEECHHHCCCCCCCHHHHCCCHHHHHHHHCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCE T ss_conf 14645788525221222760026132207778999983568888568897155679975458999999998627785762 Q ss_pred CCCCCCCC-CCC-CCCHHHHHHCC-CCC-CCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCE-EEEECCC------ Q ss_conf 12222223-310-00000000001-233-3222222100000100111027884101002348842-4996599------ Q gi|255764474|r 230 ILFIDEQS-SIE-DWYRALNIFVA-PPL-YEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKA-GVIVPPR------ 298 (352) Q Consensus 230 V~~~g~~~-~~~-~~~~~adi~i~-pS~-~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~-G~~~~~~------ 298 (352) |.|+...+ .+. .+.+.+||=+- |-+ .|..|++-.=|...|++=++.=-|-..|-- ++.+ ||.+.+. T Consensus 474 ivFlEnYD~~~Ar~LV~GVDVWLNNP~RPlEASGTSGMKA~mNG~LNLS~lDGWW~Egy---~~~~nGW~ig~~~e~~~~ 550 (618) T TIGR02094 474 IVFLENYDINLARYLVSGVDVWLNNPRRPLEASGTSGMKAAMNGVLNLSILDGWWDEGY---NGDNNGWAIGDGEEVYPD 550 (618) T ss_pred EEEEECCCHHHHHHHHCCCEEECCCCCCCHHHCCCCHHHHHHCCCCCCCEECCCCCCCC---CCCCCCCCCCCCCCCCCC T ss_conf 78850863788877515550421767970532586036775548753513337412267---888687437872346787 Q ss_pred -------CHHHHHHHHHHH-----HC-CHHH-----HHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf -------989999999999-----86-9899-----999999999999982998999999999999 Q gi|255764474|r 299 -------NLHALEKAVLYF-----MN-SKKI-----MSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 299 -------d~~~la~~i~~l-----~~-~~~~-----~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) |.++|-+-|+.= .+ +.+. +.+|.+++.+.+.-.||..+|++.|..-|. T Consensus 551 ~~~qD~~DA~~LY~lLEneviPlYY~h~~~~G~p~~W~~~mk~SI~~~~P~F~~~RMv~eY~~~~Y 616 (618) T TIGR02094 551 EEEQDRLDAEALYDLLENEVIPLYYDHRDEKGIPARWVKMMKESIKTIAPRFSTDRMVREYVQKFY 616 (618) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHC T ss_conf 214559999999788674105655543257888888999999999871311384105899988626 No 86 >PRK01021 lpxB lipid-A-disaccharide synthase; Reviewed Probab=98.29 E-value=0.0002 Score=48.92 Aligned_cols=291 Identities=11% Similarity=0.055 Sum_probs=143.2 Q ss_pred HHHHHHHHHHHHCCEEEEEECCCCC-------------CCCCCCHHH-----------HHHHHHHHHHCCCEEEEECCCH Q ss_conf 8889999999719859998058866-------------562347899-----------8998875432597089962854 Q gi|255764474|r 23 TVFGLCPIQRKLGQRLVVFGYCLPK-------------NIPSIGISS-----------LLTCWKKPIGQNSRIWHARRNN 78 (352) Q Consensus 23 ~v~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~-----------~~~~~~~~~~~~~~ivh~~~~~ 78 (352) .-.++.+.+++...++...|-+-+. .+..+++.+ ...+.+.....++|++-.=+.+ T Consensus 241 lGa~Li~~Lk~~~P~i~F~GVGGp~M~~eGl~slf~me~lsVMG~~EVL~~lp~L~~~~~~l~~~i~~~~PD~~I~ID~P 320 (607) T PRK01021 241 LGGNLLKEIKALYPDIRCFGVGGPQMRQEGLEPLFNMEEFQVSGFWEVLLSLFKLYYRYRKLYKTILKENPETVICIDFP 320 (607) T ss_pred HHHHHHHHHHHHCCCCEEEECCHHHHHHCCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 79999999985498988996377999876886124467710114899998899999999999999986199999995899 Q ss_pred H--HHHHHHHHHHHCCCEEEEECCHHH--HHHHHHHHHHHHHCCEEEECCHHHHHHCCC---CEEEECCCCCHHHHCCCC Q ss_conf 6--899999999709981999757456--201134798996266899889899974368---839974882777823784 Q gi|255764474|r 79 E--MLLGVMMRDVLRMPLKLVFTSPSQ--RNHSRWTRYLISRMDEVITTSQKSARFIER---PSTVIMHGVDTERFRPTS 151 (352) Q Consensus 79 ~--~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~d~ii~~s~~~~~~~~~---~~~vi~~gid~~~~~~~~ 151 (352) + ...+..++. .+.+.+++++...+ -....-.+.+-+.+|++++.=+...++..+ ++..|.|.+- +...... T Consensus 321 dFNlrlak~lkk-~gi~ik~vhYVsPsVWAWr~~R~k~i~~~vD~~l~lfPFE~~~~~~~g~~~~yVGHPl~-e~i~~~~ 398 (607) T PRK01021 321 DFHFLLIKKLRK-CGYKGKIIHYVCPSIWAWRPKRKTILEKYLDLLLLILPFEQGLFKNSPLRTVYLGHPLV-ETISNFQ 398 (607) T ss_pred CCCHHHHHHHHH-CCCCCCEEEEECCCEEEECCCHHHHHHHHHHHHEECCCCCHHHHHCCCCCCEEECCCCH-HHCCCCC T ss_conf 878899999997-28999868997883688662179999998867305267788999507998379789601-2022347 Q ss_pred CHHHHHHHCCCCCCCCEE-EEEE-EECHHHH-HHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 326878872889887279-9951-2264451-236665456530047740588631333210001466776443112332 Q gi|255764474|r 152 NKQEARRHLKISEDAKLI-GCFG-RIRKLKG-TDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKK 228 (352) Q Consensus 152 ~~~~~~~~~~~~~~~~~i-~~~G-~~~~~Kg-~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~ 228 (352) +..+.+.+... .+++++ +.-| |-++-+. ...+++++... .......+++....+.. .+.+.+.++..+... T Consensus 399 ~~~~~r~~l~~-~~k~IIALLPGSR~SEI~RlLPI~~~ai~~~-~l~~~~~~lV~~a~p~~----~~~i~e~l~~~~~~~ 472 (607) T PRK01021 399 PCASWKEQLLP-SDKPIVAAFPGSRRGDILRNLTIQVQAFLAS-SLAQTHQLLVSSANPKY----DHLILDVLQQEGCLH 472 (607) T ss_pred CCHHHHHHCCC-CCCCEEEEECCCCHHHHHHHHHHHHHHHHHH-HCCCCCEEEEECCCHHH----HHHHHHHHHHCCCCC T ss_conf 74569986177-8898899908997899998749999999987-24467649995687668----899999998648988 Q ss_pred CCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC-CCCC------------------CCHHCCCCCC Q ss_conf 21222222331000000000001233322222210000010011102-7884------------------1010023488 Q gi|255764474|r 229 RILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS-NTGV------------------FSELLDPENA 289 (352) Q Consensus 229 ~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~------------------~~e~i~~~~g 289 (352) .....+ +...+.|++||+.+..| |.+.+|++.+|+|.|++ +... ++.++..+.- T Consensus 473 ~~II~~--~~kyeam~aSDaALaAS-----GTATLE~ALagvPmVVaYKlnpLTyfIAK~LvKI~lp~vsLPNILagr~V 545 (607) T PRK01021 473 SKIVPS--QFRYELMRECDCALAKC-----GTIVLETALNQTPTIVTCQLRPFDTFLAKYIFKILLPAYSLPNIILGSVI 545 (607) T ss_pred CEECCC--HHHHHHHHHCCEEEECC-----CHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCCCCHHHCCCCC T ss_conf 457173--25899998588998887-----78999999838898999967827999999999725751211301169986 Q ss_pred CEEEE--ECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 42499--65999899999999998698999999999999999 Q gi|255764474|r 290 KAGVI--VPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAV 329 (352) Q Consensus 290 ~~G~~--~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~ 329 (352) .=-++ -...+||.+|.++. ++.||+.+++--..+++... T Consensus 546 VPElI~gQ~D~~PE~iAaAl~-lL~~p~~~ekq~~~c~~~~~ 586 (607) T PRK01021 546 FPEFIGGKKDFHPEEVAAALD-ILKTSQSKEKQKEACRDLYK 586 (607) T ss_pred CCCCCCCCCCCCHHHHHHHHH-HHHCHHHHHHHHHHHHHHHH T ss_conf 666367766589899999999-87191567999999999999 No 87 >pfam04101 Glyco_tran_28_C Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contains the UDP-GlcNAc binding site. Probab=98.26 E-value=1.1e-05 Score=57.40 Aligned_cols=146 Identities=11% Similarity=0.067 Sum_probs=91.7 Q ss_pred EEEEEEECHHHHHHH-HHHHHHHHHCCCCCEEE-EEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 999512264451236-66545653004774058-8631333210001466776443112332212222223310000000 Q gi|255764474|r 169 IGCFGRIRKLKGTDL-FVDCMINILPHHPGWTA-VVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRAL 246 (352) Q Consensus 169 i~~~G~~~~~Kg~~~-li~a~~~l~~~~~~~~l-~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~a 246 (352) |+.+|.-...+.+.. +.+++..+.....++.+ +.+|.... +.++...... ..++...++.+++..+|+.| T Consensus 2 iLV~GGSqGa~~lN~~v~~~~~~~~~~~~~~~vihq~G~~~~------~~~~~~~~~~--~~~~~~~~f~~~m~~~~~~a 73 (167) T pfam04101 2 IFVTGGSQGAQALNRLVLEVDPLLELKGIEYQVLHQTGKSDY------EPVNCKYSKF--GINVEVFPFIDNMAEYIKAA 73 (167) T ss_pred EEEEECHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHH------HHHHHHHHHC--CCCEEEEEHHHHHHHHHHHC T ss_conf 899954488999999999999998753998299998597358------9999998605--99889971255599999966 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCC--------HHCCCCCCCEEEEECC--CCHHHHHHHHHHHHCCHHH Q ss_conf 00001233322222210000010011102788410--------1002348842499659--9989999999999869899 Q gi|255764474|r 247 NIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFS--------ELLDPENAKAGVIVPP--RNLHALEKAVLYFMNSKKI 316 (352) Q Consensus 247 di~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~--------e~i~~~~g~~G~~~~~--~d~~~la~~i~~l~~~~~~ 316 (352) |+.++-+ -+.++.|.++.|+|.|.-+.+... ..+. ....|++++. .+++.+.+.|.+++.|++. T Consensus 74 dlvIsRa----Ga~Ti~E~~~~g~P~IliP~p~~~~~hQ~~NA~~l~--~~gaa~~i~e~~~~~~~L~~~i~~l~~~~~~ 147 (167) T pfam04101 74 DLVISRA----GAGTIAELLALGKPAILVPRPKAAGEHQDNNALELV--KAGAALVLLQKELTPEKLVEALLKLLLKPLR 147 (167) T ss_pred CHHHHCC----CCHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHH--HCCCEEEEECCCCCHHHHHHHHHHHHCCHHH T ss_conf 0688657----622799999948998997076556563999999999--8799899642679999999999999869999 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|255764474|r 317 MSDTGNRGRERA 328 (352) Q Consensus 317 ~~~~~~~a~~~~ 328 (352) +.+|++++++.. T Consensus 148 l~~m~~~a~~~~ 159 (167) T pfam04101 148 LYEMNKAAKGSR 159 (167) T ss_pred HHHHHHHHHHCC T ss_conf 999999998448 No 88 >COG0381 WecB UDP-N-acetylglucosamine 2-epimerase [Cell envelope biogenesis, outer membrane] Probab=98.21 E-value=0.00043 Score=46.70 Aligned_cols=268 Identities=12% Similarity=0.111 Sum_probs=148.5 Q ss_pred HHHHHHHHHHCCCEEEEECCCH-HHHHHHHHHHHHCCCEEEEECCHHHHHHHH-----HHHHHHHHCCE-EEECCHHHHH Q ss_conf 8998875432597089962854-689999999970998199975745620113-----47989962668-9988989997 Q gi|255764474|r 57 LLTCWKKPIGQNSRIWHARRNN-EMLLGVMMRDVLRMPLKLVFTSPSQRNHSR-----WTRYLISRMDE-VITTSQKSAR 129 (352) Q Consensus 57 ~~~~~~~~~~~~~~ivh~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~l~~~~d~-ii~~s~~~~~ 129 (352) +..+.+.....+||++-.|.-+ ..+.+.+.+...++|+ .+.-.+.|.... ..|.+...... -++.++..++ T Consensus 81 i~~~~~vl~~~kPD~VlVhGDT~t~lA~alaa~~~~IpV--~HvEAGlRt~~~~~PEE~NR~l~~~~S~~hfapte~ar~ 158 (383) T COG0381 81 IEGLSKVLEEEKPDLVLVHGDTNTTLAGALAAFYLKIPV--GHVEAGLRTGDLYFPEEINRRLTSHLSDLHFAPTEIARK 158 (383) T ss_pred HHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCE--EEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 999999998629998999178536889999999868936--887425444787783798787887765230377199999 Q ss_pred HCC------CCEEEECCCC-CHHHHCC--CCCHHHHHHH-CCCCCCCCEEEEEEEECHH-HHHHHHHHHHHHHHCCCCCE Q ss_conf 436------8839974882-7778237--8432687887-2889887279995122644-51236665456530047740 Q gi|255764474|r 130 FIE------RPSTVIMHGV-DTERFRP--TSNKQEARRH-LKISEDAKLIGCFGRIRKL-KGTDLFVDCMINILPHHPGW 198 (352) Q Consensus 130 ~~~------~~~~vi~~gi-d~~~~~~--~~~~~~~~~~-~~~~~~~~~i~~~G~~~~~-Kg~~~li~a~~~l~~~~~~~ 198 (352) ++. +.+.++.|.+ |.-.... .......... +..+..+.+++..=|-.-. ++...+.+++.++..++++. T Consensus 159 nLl~EG~~~~~IfvtGnt~iDal~~~~~~~~~~~~~~~~~~~~~~~~~iLvT~HRreN~~~~~~~i~~al~~i~~~~~~~ 238 (383) T COG0381 159 NLLREGVPEKRIFVTGNTVIDALLNTRDRVLEDSKILAKGLDDKDKKYILVTAHRRENVGEPLEEICEALREIAEEYPDV 238 (383) T ss_pred HHHHCCCCCCCEEEECCHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCEEEEECCHHHCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 99976999551688597399999987764100046677663245673899970555403642999999999999867895 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCC--CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC- Q ss_conf 588631333210001466776443112332212222223--31000000000001233322222210000010011102- Q gi|255764474|r 199 TAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQS--SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS- 275 (352) Q Consensus 199 ~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~--~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s- 275 (352) .++.--. .. ...+++. ...++-.++|++....+ +...++..|-+.+.-| |-..=||-..|+||++- T Consensus 239 ~viyp~H--~~--~~v~e~~--~~~L~~~~~v~li~pl~~~~f~~L~~~a~~iltDS-----GgiqEEAp~lg~Pvl~lR 307 (383) T COG0381 239 IVIYPVH--PR--PRVRELV--LKRLKNVERVKLIDPLGYLDFHNLMKNAFLILTDS-----GGIQEEAPSLGKPVLVLR 307 (383) T ss_pred EEEEECC--CC--HHHHHHH--HHHHCCCCCEEEECCCCHHHHHHHHHHCEEEEECC-----CCHHHHHHHCCCCEEEEC T ss_conf 6997479--97--6666889--99838987679868836698999997450999548-----713544776199277613 Q ss_pred CCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 78841010023488424996599989999999999869899999999999999982998999999999999 Q gi|255764474|r 276 NTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYD 346 (352) Q Consensus 276 ~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~ 346 (352) +...-||.+. .| +-.++.. |.+.+.+++..++.+++.+++|+... ..|.-.+..+++.++.+ T Consensus 308 ~~TERPE~v~--ag-t~~lvg~-~~~~i~~~~~~ll~~~~~~~~m~~~~-----npYgdg~as~rIv~~l~ 369 (383) T COG0381 308 DTTERPEGVE--AG-TNILVGT-DEENILDAATELLEDEEFYERMSNAK-----NPYGDGNASERIVEILL 369 (383) T ss_pred CCCCCCCCEE--CC-CEEEECC-CHHHHHHHHHHHHHCHHHHHHHHCCC-----CCCCCCCHHHHHHHHHH T ss_conf 6777841000--37-0487176-58999999999862958899874255-----88867505799999999 No 89 >COG4641 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.18 E-value=3.6e-05 Score=53.92 Aligned_cols=303 Identities=11% Similarity=0.059 Sum_probs=165.1 Q ss_pred HHHHHHHHHHHCCEEEEEECCCCCCCCCC--------------CHHHHHHHHHHHHHCCCEEEEECCCH----HHHHHHH Q ss_conf 88999999971985999805886656234--------------78998998875432597089962854----6899999 Q gi|255764474|r 24 VFGLCPIQRKLGQRLVVFGYCLPKNIPSI--------------GISSLLTCWKKPIGQNSRIWHARRNN----EMLLGVM 85 (352) Q Consensus 24 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~ivh~~~~~----~~~~~~~ 85 (352) -..+.+.+...|+.+.-+-.+.+...... +... ..........+++++-.-... ......+ T Consensus 20 ~~~~~~~l~~~g~kvlflE~~~~~~~k~rd~~~~~~~~~~~~~~~~e-~~~~~~i~~fk~d~iv~~~~~~~~~~~~~~~~ 98 (373) T COG4641 20 YRGLLRALKMDGMKVLFLESGDFWDYKNRDIDAEDGCTEAFYKDQPE-LESLLYIREFKPDIIVNMSGDDQPDEESTIDL 98 (373) T ss_pred HHHHHHHHHHCCCEEEEEECCCHHHHHCCCCCCCCCHHHEEECCCHH-HHHHHHHHHCCCCEEEEECCCCCCCCEEHHHH T ss_conf 99999999856554899832627754064456765600100057478-99999898628747999515666441001799 Q ss_pred HHHHHCCCE--EEEECCHHHH--HHHHHHHHHH-HH----CCEEEECCHHH--HHHC----CCCEEEECCCCCHHHHCCC Q ss_conf 999709981--9997574562--0113479899-62----66899889899--9743----6883997488277782378 Q gi|255764474|r 86 MRDVLRMPL--KLVFTSPSQR--NHSRWTRYLI-SR----MDEVITTSQKS--ARFI----ERPSTVIMHGVDTERFRPT 150 (352) Q Consensus 86 ~~~~~~~~~--~~~~~~~~~~--~~~~~~~~l~-~~----~d~ii~~s~~~--~~~~----~~~~~vi~~gid~~~~~~~ 150 (352) ..++.+..+ .+.++..... ....+....+ ++ .|-+++.++.. ..|. ...+.-++.++|.+.|.|. T Consensus 99 ~a~l~~~~l~~~~w~te~p~~~~~~~~~~~~~~~~~~l~~fd~v~~~g~~l~~~~yyq~~~~~~~~~~~~a~d~~~~~~i 178 (373) T COG4641 99 WAWLKRKCLPVIVWYTEDPYDTDIFSQVAEEQLARRPLFIFDNVLSFGGGLVANKYYQEGGARNCYYLPWAVDDSLFHPI 178 (373) T ss_pred HHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHHHHHHHHHHCCCCEECCCCCCCCHHCCCC T ss_conf 99852678626999714600246665301777610555133423312640788999876235511305756782320669 Q ss_pred CCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCC-CEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCC Q ss_conf 4326878872889887279995122644512366654565300477-405886313332100014667764431123322 Q gi|255764474|r 151 SNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHP-GWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKR 229 (352) Q Consensus 151 ~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~-~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~ 229 (352) +...+.. --+.++|...+. ....+-+.+.+-..+.. +-.+...|..-.. .+... ..+++ T Consensus 179 ~~da~~~---------~dL~~ign~~pD-r~e~~ke~~~~ps~kl~v~rr~~~~g~~y~~------~~~~~----~~~~~ 238 (373) T COG4641 179 PPDASYD---------VDLNLIGNPYPD-RVEEIKEFFVEPSFKLMVDRRFYVLGPRYPD------DIWGR----TWEPN 238 (373) T ss_pred CCCCCCE---------EEEEEECCCCCC-HHHHHHHHHHCCCHHHHCCCEEEECCCCCCH------HHHCC----CCCCH T ss_conf 8541301---------336773588855-7899999861520111006534550776523------44211----33656 Q ss_pred CCCCC---CCCCCCCCCHHHHHHCCCCCC---CC---CCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCH Q ss_conf 12222---223310000000000012333---22---2222100000100111027884101002348842499659998 Q gi|255764474|r 230 ILFID---EQSSIEDWYRALNIFVAPPLY---EG---FGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNL 300 (352) Q Consensus 230 V~~~g---~~~~~~~~~~~adi~i~pS~~---Eg---~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~ 300 (352) +...| ....+...+...|+++.-++. ++ +.+-+.|+++||.|.|++...++.-++. +|.. ++-..|. T Consensus 239 ~~yIg~~~~~~~v~~~~~~~~~~~n~~r~~~~~~l~~~~~RvFeiagc~~~liT~~~~~~e~~f~--pgk~--~iv~~d~ 314 (373) T COG4641 239 VQYIGYYNPKDGVPNAFKRDDVTLNINRASIANALFSPTNRVFEIAGCGGFLITDYWKDLEKFFK--PGKD--IIVYQDS 314 (373) T ss_pred HHHHHCCCCCCHHHHCCCCCCEEEEECHHHHHHHCCCCHHHHHHHHHCCCCCCCCCHHHHHHHCC--CCHH--EEEECCH T ss_conf 66633268500000003554435641378887614785056888761587501542788987259--8602--5896378 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 999999999986989999999999999998299899999999999998860 Q gi|255764474|r 301 HALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLLRT 351 (352) Q Consensus 301 ~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l~~ 351 (352) .++.+.+.+++..++.++++++.|.+.+...++-+..+.++.+.-.++..+ T Consensus 315 kdl~~~~~yll~h~~erkeiae~~ye~V~~~ht~~~r~~~~~~~i~sI~~r 365 (373) T COG4641 315 KDLKEKLKYLLNHPDERKEIAECAYERVLARHTYEERIFKLLNEIASINIR 365 (373) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH T ss_conf 999999999844830689999866999987432788899999999999886 No 90 >COG0058 GlgP Glucan phosphorylase [Carbohydrate transport and metabolism] Probab=98.14 E-value=3.5e-05 Score=54.02 Aligned_cols=137 Identities=18% Similarity=0.202 Sum_probs=96.6 Q ss_pred CCCCCCCEEEEEEEECHHHHHHHHHHHH----HHHH-CCCCCEEEEEECCCCCCCCCHHHHHHHHHHH----CCCCCCCC Q ss_conf 8898872799951226445123666545----6530-0477405886313332100014667764431----12332212 Q gi|255764474|r 161 KISEDAKLIGCFGRIRKLKGTDLFVDCM----INIL-PHHPGWTAVVVGKTTLKHYLFKKNLQRRIYA----NGLKKRIL 231 (352) Q Consensus 161 ~~~~~~~~i~~~G~~~~~Kg~~~li~a~----~~l~-~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~----~~l~~~V~ 231 (352) .+.++..+++++-|+...|+..+.+.-. ..+. ...|.+.+++.|...+.+.. .+++.++|.. -+...+|. T Consensus 482 ~~~p~~lfd~~~kRiheYKRq~Lnl~~i~~ly~~i~~d~~prv~~iFaGKAhP~y~~-aK~iIk~I~~~a~~in~~lkVv 560 (750) T COG0058 482 EVDPNALFDGQARRIHEYKRQLLNLLDIERLYRILKEDWVPRVQIIFAGKAHPADYA-AKEIIKLINDVADVINNKLKVV 560 (750) T ss_pred CCCCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCHH-HHHHHHHHHHHHHHHCCCCEEE T ss_conf 458885356320231454366776651999999873177886699984467876257-9999999999998632345299 Q ss_pred CCCCCC--CCCCCCHHHHHHCCCCCC--CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCC Q ss_conf 222223--310000000000012333--2222221000001001110278841010023488424996599 Q gi|255764474|r 232 FIDEQS--SIEDWYRALNIFVAPPLY--EGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPR 298 (352) Q Consensus 232 ~~g~~~--~~~~~~~~adi~i~pS~~--Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~ 298 (352) |+...+ -...++.++|+-...|+. |..|.+-+-+|..|.+.|+|--|...|+.....++|||++-.. T Consensus 561 Fl~nYdvslA~~iipa~Dvweqis~a~~EASGTsnMK~alNGaltigtlDGanvEi~e~vg~~N~~~fG~~ 631 (750) T COG0058 561 FLPNYDVSLAELLIPAADVWEQIPTAGKEASGTSNMKAALNGALTLGTLDGANVEIYEHVGGENGWIFGET 631 (750) T ss_pred EECCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCEEECCCCHHHHHHHHCCCCCEEEECCC T ss_conf 96898736897640012201238998745567675457765886231355567888986498734783786 No 91 >pfam04007 DUF354 Protein of unknown function (DUF354). Members of this family are around 350 amino acids in length. They are found in archaebacteria and have no known function. Probab=97.97 E-value=0.00099 Score=44.24 Aligned_cols=256 Identities=11% Similarity=0.099 Sum_probs=136.9 Q ss_pred HHHHHHHHHHCCEEEEEECCCC--------------------CCCCC---CCHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 8999999971985999805886--------------------65623---478998998875432597089962854689 Q gi|255764474|r 25 FGLCPIQRKLGQRLVVFGYCLP--------------------KNIPS---IGISSLLTCWKKPIGQNSRIWHARRNNEML 81 (352) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~---~~~~~~~~~~~~~~~~~~~ivh~~~~~~~~ 81 (352) -.+.+.+.+.||.+.+.+.... ..... ........+.+.....++|+.-++...+. T Consensus 17 k~iI~eL~k~GheV~iTaR~~~~~~~LL~~y~i~~~~iG~~g~s~~~Kl~~~~~R~~~L~~~~~~~~PDv~is~~S~~a- 95 (335) T pfam04007 17 KPIISELEKEGYEVLLTCRKFGELPELLRSLGFQVKSIGKHGATLIKKLLSSAERVYLLTKLIPEKKPDVAIMKNSMEL- 95 (335) T ss_pred HHHHHHHHHCCCEEEEEEECCCHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCHHH- T ss_conf 9999999868988999996135199999976997699758888889999999999999999988629978994488019- Q ss_pred HHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCH----HHHHHCCCCEEEE-CCCCCHHH-HCCCCCHHH Q ss_conf 99999997099819997574562011347989962668998898----9997436883997-48827778-237843268 Q gi|255764474|r 82 LGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQ----KSARFIERPSTVI-MHGVDTER-FRPTSNKQE 155 (352) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~----~~~~~~~~~~~vi-~~gid~~~-~~~~~~~~~ 155 (352) ...+..++.|............. .+-.+..+|.+++..- ...++-.+ ..++ ++|++..- ..+...... T Consensus 96 --~~va~~LgipsI~f~Dteha~~~---~~Lt~Pf~~~i~~P~~~~~~~~~~~G~~-~~i~~y~g~~E~a~l~~F~Pd~~ 169 (335) T pfam04007 96 --PRVAFGLRIPSIIVLDNEHALAA---NKLTFPLADYILVPEIIDDEFLRFFGAD-NRIRTYPGIKEIANISDYVPDPE 169 (335) T ss_pred --HHHHHHCCCCEEEEECCHHHCCC---CEEEEECCCEEECCCCCCHHHHHHHCCC-CCEEEECCCCEEEECCCCCCCHH T ss_conf --99998829987999477554123---3023123868881244677899860877-85676668441432166689865 Q ss_pred HHHHCCCCCCCCEEEEEEEECHH-----HHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCC Q ss_conf 78872889887279995122644-----5123666545653004774058863133321000146677644311233221 Q gi|255764474|r 156 ARRHLKISEDAKLIGCFGRIRKL-----KGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRI 230 (352) Q Consensus 156 ~~~~~~~~~~~~~i~~~G~~~~~-----Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V 230 (352) ....+++..+. +|+. |..++ +|...++..+.+.+++..+ ..++....... +...+.. ++ T Consensus 170 vl~~lgl~~~~-yIvv--R~~~~~A~y~~g~~~i~~~ii~~l~~~~~-~iv~~pr~~~q--------~~~~~~~----~v 233 (335) T pfam04007 170 ILKKLGLEFEE-YIVM--RPEPLASSYVNGHESILPEIIEMLTKEGV-NIIYFPRNKEQ--------REIFRGF----DV 233 (335) T ss_pred HHHHCCCCCCC-EEEE--EECCCCCEECCCCCCHHHHHHHHHHHCCC-CEEEECCCCHH--------HHHHCCC----CE T ss_conf 78764998798-8999--61645560011442159999999987598-19997587036--------6775047----70 Q ss_pred CCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCC---CCHHCCCCCCCEEEEECCCCHHHHHHHH Q ss_conf 2222223310000000000012333222222100000100111027884---1010023488424996599989999999 Q gi|255764474|r 231 LFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGV---FSELLDPENAKAGVIVPPRNLHALEKAV 307 (352) Q Consensus 231 ~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~---~~e~i~~~~g~~G~~~~~~d~~~la~~i 307 (352) ..+...-+..+++.-||+++- +-|....||...|+|+|++.-|- ..+.+ -+.|.+.-..|++++.+.+ T Consensus 234 ~i~~~~vd~~~Lly~adl~Ig-----~GgTMa~EAAlLGtPaIs~~p~~~~~vd~~l----~~~gl~~~~~d~~~i~~~v 304 (335) T pfam04007 234 IIPKKPVDTLSLLYYSDLVIG-----AGGTMNREAALLGTPAVSCYPGKLLAVDKYL----IEKGEMYHSTDPREIVNYV 304 (335) T ss_pred ECCCCCCCHHHHHHHCCEEEC-----CCHHHHHHHHHHCCCEEEECCCCCCHHHHHH----HHCCCEEEECCHHHHHHHH T ss_conf 367888777888865468972-----7568999999828987984388521367999----8679879618989999999 Q ss_pred HHHHC Q ss_conf 99986 Q gi|255764474|r 308 LYFMN 312 (352) Q Consensus 308 ~~l~~ 312 (352) .+.+. T Consensus 305 ~~~~~ 309 (335) T pfam04007 305 ISNLK 309 (335) T ss_pred HHHHH T ss_conf 99860 No 92 >TIGR03590 PseG pseudaminic acid biosynthesis-associated protein PseG. This protein is found in association with enzymes involved in the biosynthesis of pseudaminic acid, a component of polysaccharide in certain Pseudomonas strains as well as a modification of flagellin in Campylobacter and Hellicobacter. The role of this protein is unclear, although it may participate in N-acetylation in conjunction with, or in the absence of PseH (TIGR03585) as it often scores above the trusted cutoff to pfam00583 representing a family of acetyltransferases. Probab=97.91 E-value=0.0013 Score=43.49 Aligned_cols=97 Identities=12% Similarity=0.185 Sum_probs=65.3 Q ss_pred CCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEE-EEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 872799951226445123666545653004774058-8631333210001466776443112332212222223310000 Q gi|255764474|r 165 DAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTA-VVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWY 243 (352) Q Consensus 165 ~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l-~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~ 243 (352) ...++++.|.-++..=...+++++.+. .++..+ +++|.+... .+++++.++.. .++...-+.+++..+| T Consensus 171 ~~~Ili~~GGsD~~~lt~~il~~l~~~---~~~~~i~vvig~~~~~----~~~i~~~~~~~---~~~~~~~~~~~m~~~m 240 (280) T TIGR03590 171 LRRVLVSFGGADPDNLTLKLLSALAES---QINISITLVTGSSNPN----LDELKKFAKEY---PNIILFIDVENMAELM 240 (280) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHH---CCCCCEEEEECCCCCC----HHHHHHHHHHC---CCEEEECCHHHHHHHH T ss_conf 532899977877000899999999851---6685679998679876----69999999728---9969965988999999 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECC Q ss_conf 000000012333222222100000100111027 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVASN 276 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~ 276 (352) +.||+.+. +.|.++.|++++|+|+|+-. T Consensus 241 ~~aDlaI~-----agG~t~~El~~~gvP~i~i~ 268 (280) T TIGR03590 241 NEADLAIG-----AAGSTSWERCCLGLPSLSIV 268 (280) T ss_pred HHCCEEEE-----CCCHHHHHHHHHCCCEEEEE T ss_conf 97799998-----59658999999499989999 No 93 >pfam02350 Epimerase_2 UDP-N-acetylglucosamine 2-epimerase. This family consists of UDP-N-acetylglucosamine 2-epimerases EC:5.1.3.14 this enzyme catalyses the production of UDP-ManNAc from UDP-GlcNAc. Note that some of the enzymes is this family are bifunctional, in this instance Pfam matches only the N-terminal half of the protein suggesting that the additional C-terminal part (when compared to mono-functional members of this family) is responsible for the UPD-N-acetylmannosamine kinase activity of these enzymes. This hypothesis is further supported by the assumption that the C-terminal part of rat bifunctional UDP-N-acetylglucosamine 2-epimerase/N-acetylmannosamine kinase is the kinase domain. Probab=97.89 E-value=0.0026 Score=41.40 Aligned_cols=246 Identities=15% Similarity=0.154 Sum_probs=129.5 Q ss_pred HHHHHHHHHHCCCEEEEECC-CHHHHHHHHHHHHHCCCEEEEECCHHHHH-------HHHHHHHHHHHC-CEEEECCHHH Q ss_conf 89988754325970899628-54689999999970998199975745620-------113479899626-6899889899 Q gi|255764474|r 57 LLTCWKKPIGQNSRIWHARR-NNEMLLGVMMRDVLRMPLKLVFTSPSQRN-------HSRWTRYLISRM-DEVITTSQKS 127 (352) Q Consensus 57 ~~~~~~~~~~~~~~ivh~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~l~~~~-d~ii~~s~~~ 127 (352) +..+.+.....+||++-.+. ..+++.+...+...+.|+..+ ....|. ..-..|..+.+. |-.+|.++.. T Consensus 56 i~~~~~~l~~~~PD~vlv~GDr~e~la~aiaa~~~~ipi~Hi--egG~RS~d~t~g~~de~~R~~isklS~~hf~~t~~~ 133 (346) T pfam02350 56 LIGLEDVLEEEKPDLVLVLGDTNETLAGALAAFYLRIPVAHV--EAGLRSFDLTEGMPEEINRHLIDKLSDLHFAPTEEA 133 (346) T ss_pred HHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCEEEE--ECCCCCCCCCCCCCHHHHHCCCCCCCCEEECCCHHH T ss_conf 999999999829999999689715889999999819848995--268744556799930543114665413772464999 Q ss_pred HHHC------CCCEEEECC-CCCHH-HHCCCCCHHHHHHHCCCCCC-CCEEEEEEE---ECHHHHHHHHHHHHHHHHCCC Q ss_conf 9743------688399748-82777-82378432687887288988-727999512---264451236665456530047 Q gi|255764474|r 128 ARFI------ERPSTVIMH-GVDTE-RFRPTSNKQEARRHLKISED-AKLIGCFGR---IRKLKGTDLFVDCMINILPHH 195 (352) Q Consensus 128 ~~~~------~~~~~vi~~-gid~~-~~~~~~~~~~~~~~~~~~~~-~~~i~~~G~---~~~~Kg~~~li~a~~~l~~~~ 195 (352) ++.+ +..+.++.+ ++|.- .+.+................ .++++..=| ....+....+++++..+. +. T Consensus 134 ~~~L~~~G~~~~~If~vG~~~iD~i~~~~~~~~~~~~~~~~~~~~~~~~iLvt~Hr~en~~~~~~~~~i~~~l~~l~-~~ 212 (346) T pfam02350 134 RENLLQEGEPPERIFVTGNTVIDALLLSREEIEESLEISGILADLGKRYILVTFHRRENEDDPERLEQILEALKALA-ER 212 (346) T ss_pred HHHHHHCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHH-HC T ss_conf 99999819994728997971999999999873101556664034568779999677534476448999999999998-35 Q ss_pred CCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 74058863133321000146677644311233221222222--3310000000000012333222222100000100111 Q gi|255764474|r 196 PGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVV 273 (352) Q Consensus 196 ~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI 273 (352) ++..+++.....+.. .+.+.+.++.+ +++.+.... .+...++..|++++-=| |-...||-+.|+|+| T Consensus 213 ~~~~~i~~~~n~d~~---~~~i~~~l~~~---~ni~~~~~l~~~~fl~ll~~s~~vigdS-----s~~~~Ea~~l~~P~i 281 (346) T pfam02350 213 PDVPVVFPVHNNPRT---RKRINELLEEL---PNIRLIEPLGYLDFLSLLKNADLVITDS-----GGIQEEAPSLGKPVL 281 (346) T ss_pred CCCCEEEEECCCCHH---HHHHHHHHHCC---CCEEEECCCCHHHHHHHHHHCCEEEECC-----CCHHHHHHHHCCCEE T ss_conf 686099983799207---78999998347---9879965689999999998518898368-----621666665089689 Q ss_pred E-CCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHH Q ss_conf 0-2788410100234884249965999899999999998698999999 Q gi|255764474|r 274 A-SNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDT 320 (352) Q Consensus 274 ~-s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~ 320 (352) . .+.|.-++-+. .|.+ .+++ .|.+++.+++.++++++..+..+ T Consensus 282 niR~~geRqegr~--~g~n-vlv~-~~~~~I~~ai~~~l~~~~~~~~~ 325 (346) T pfam02350 282 NLRDTTERPEGRE--AGTN-VLVG-TDKEAILAAIEKLLDDEEEYEKM 325 (346) T ss_pred EECCCCCCHHHHH--HCEE-EEEC-CCHHHHHHHHHHHHHCHHHHHHH T ss_conf 8278888875695--3846-9978-99999999999997196777641 No 94 >cd03784 GT1_Gtf_like This family includes the Gtfs, a group of homologous glycosyltransferases involved in the final stages of the biosynthesis of antibiotics vancomycin and related chloroeremomycin. Gtfs transfer sugar moieties from an activated NDP-sugar donor to the oxidatively cross-linked heptapeptide core of vancomycin group antibiotics. The core structure is important for the bioactivity of the antibiotics. Probab=97.87 E-value=9.6e-05 Score=51.07 Aligned_cols=155 Identities=14% Similarity=0.121 Sum_probs=86.1 Q ss_pred CCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 72799951226445123666545653004774058863133321000146677644311233221222222331000000 Q gi|255764474|r 166 AKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRA 245 (352) Q Consensus 166 ~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~ 245 (352) ..+++..|.+.. ...+.+.+.+...+...+.-.++.+|...... ....+++.+.++.+. ..++.. T Consensus 240 ~vVyvs~GS~~~-~~~~~~~~~~~~~l~~~~~~~i~~~~~~~~~~-------------~~~~~nv~i~~~~pq-~~iL~~ 304 (401) T cd03784 240 PPVYVGFGSMVV-RDPEALARLDVEAVATLGQRAILSLGWGGLGA-------------EDLPDNVRVVDFVPH-DWLLPR 304 (401) T ss_pred CEEEEECCCCHH-CCHHHHHHHHHHHHHHCCCEEEEEECCCCCCC-------------CCCCCCEEEECCCCH-HHHHHH T ss_conf 769997883010-28999999999999966984999967876665-------------568997899567898-999743 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCC----CCHHCCCCCCCEEEEECCC--CHHHHHHHHHHHHCCHHHHHH Q ss_conf 0000012333222222100000100111027884----1010023488424996599--989999999999869899999 Q gi|255764474|r 246 LNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGV----FSELLDPENAKAGVIVPPR--NLHALEKAVLYFMNSKKIMSD 319 (352) Q Consensus 246 adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~----~~e~i~~~~g~~G~~~~~~--d~~~la~~i~~l~~~~~~~~~ 319 (352) +|+|++- |--.++.||+++|+|.|+-...+ ....+. .-+.|..++.. +++.+++++.++++|+..+ + T Consensus 305 ~~~~ItH----gG~~s~~Eal~~GvP~v~~P~~~DQ~~nA~rv~--~~G~G~~l~~~~~t~e~l~~av~~lL~~~~~~-~ 377 (401) T cd03784 305 CAAVVHH----GGAGTTAAALRAGVPQLVVPFFGDQPFWAARVA--ELGAGPALDPRELTAERLAAALRRLLDPPSRR-R 377 (401) T ss_pred CCEEEEC----CCHHHHHHHHHCCCCEEECCCCCCHHHHHHHHH--HCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHH-H T ss_conf 7999966----875899999981999895377556899999999--87971277835699999999999994899999-9 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 999999999982998999999999 Q gi|255764474|r 320 TGNRGRERAVKHFSIVKEASDIGK 343 (352) Q Consensus 320 ~~~~a~~~~~~~fs~~~~a~~~~~ 343 (352) ..+.+++.. ..=..++.++-+++ T Consensus 378 a~~~~~~~~-~~~g~~~aa~~ie~ 400 (401) T cd03784 378 AAALLRRIR-EEDGVPSAADVIER 400 (401) T ss_pred HHHHHHHHH-HCCCHHHHHHHHHC T ss_conf 999999987-55888999999843 No 95 >KOG1050 consensus Probab=97.82 E-value=0.00048 Score=46.38 Aligned_cols=185 Identities=15% Similarity=0.172 Sum_probs=120.4 Q ss_pred CEEEECCCCCHHHHCCCCCHH---HHHHHCCCC-CCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCE----EEEEECC Q ss_conf 839974882777823784326---878872889-88727999512264451236665456530047740----5886313 Q gi|255764474|r 134 PSTVIMHGVDTERFRPTSNKQ---EARRHLKIS-EDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGW----TAVVVGK 205 (352) Q Consensus 134 ~~~vi~~gid~~~~~~~~~~~---~~~~~~~~~-~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~----~l~i~G~ 205 (352) .+...|-|+|..+|....+.. .......-+ .+..+++.+-+++.-||...=+.|+.+.+.++|++ .++.+.. T Consensus 240 ~v~~~pigid~~r~v~~~~~~~~~~~~~ei~~~~~g~klilgvD~~d~~kg~~~Kl~a~e~~L~~~pe~~~kVvliqi~~ 319 (732) T KOG1050 240 SVKALPIGIDVQRFVKLLELPYVGSKGMEIKEPFKGKKLILGVDRLDSIKGIQLKLLAFEQFLEEYPEWIDKVVLIQIEN 319 (732) T ss_pred EEEECCCCCCHHHHHCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCHHHHCEEEEEEEEC T ss_conf 54201223556776300266267888987750314973673355322126832889999999986755423089999844 Q ss_pred CCCCCCCHHHHHHHHHHHC--------C-C-CCCCCCCC-C--CCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCE--- Q ss_conf 3321000146677644311--------2-3-32212222-2--2331000000000001233322222210000010--- Q gi|255764474|r 206 TTLKHYLFKKNLQRRIYAN--------G-L-KKRILFID-E--QSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASG--- 269 (352) Q Consensus 206 g~~~~~~~~~~l~~~i~~~--------~-l-~~~V~~~g-~--~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G--- 269 (352) +...+....++++..+..+ + . ...|++.- . ..++..++..+|+.+..|+.+|..++.+|+..|. T Consensus 320 ~~~~~~~~v~~~k~~v~~~v~rIn~~f~~~~~~pV~~~~~~~~~~~l~a~~~Vaev~~v~s~rdGmnl~~~e~i~~~~~~ 399 (732) T KOG1050 320 PKRTDGKEVEELKFCVSVHVRRINEKFGSASYQPVHSLLKDLPFLELLALYKVAEVCPVTSWRDGMNLVFLEYILCQENK 399 (732) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHEEECCCCCCCCHHHHHHHHHHCCC T ss_conf 77666067788777757555010011387554437876234888887433776542100001341133556888750346 Q ss_pred -EEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf -01110278841010023488424996599989999999999869899999999 Q gi|255764474|r 270 -IPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGN 322 (352) Q Consensus 270 -~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~ 322 (352) .+.|.+..-|..+... + ...+++|.|.++++.++...+..++...++.. T Consensus 400 ~~~lVlsef~G~~~tl~--d--~aivvnpw~~~~~~~~i~~al~~s~~e~~~r~ 449 (732) T KOG1050 400 KSVLVLSEFIGDDTTLE--D--AAIVVNPWDGDEFAILISKALTMSDEERELRE 449 (732) T ss_pred CCCEEEEEECCCCCCCC--C--CCEEECCCCHHHHHHHHHHHHHCCHHHHHHCC T ss_conf 78648766325665451--5--57797776517899999887616788873133 No 96 >TIGR03568 NeuC_NnaA UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing. This family of enzymes catalyzes the combined epimerization and UDP-hydrolysis of UDP-N-acetylglucosamine to N-acetylmannosamine. This is in contrast to the related enzyme WecB (TIGR00236) which retains the UDP moiety. NeuC acts in concert with NeuA and NeuB to synthesize CMP-N5-acetyl-neuraminate. Probab=97.66 E-value=0.0059 Score=39.02 Aligned_cols=241 Identities=10% Similarity=0.062 Sum_probs=125.4 Q ss_pred HHHHHHHHCCCEEEEECCC-HHHHHHHHHHHHHCCCEEEEECCHHHHH---HHHHHHHHHHH-CCEEEECCHHHHHHC-- Q ss_conf 9887543259708996285-4689999999970998199975745620---11347989962-668998898999743-- Q gi|255764474|r 59 TCWKKPIGQNSRIWHARRN-NEMLLGVMMRDVLRMPLKLVFTSPSQRN---HSRWTRYLISR-MDEVITTSQKSARFI-- 131 (352) Q Consensus 59 ~~~~~~~~~~~~ivh~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~l~~~-~d~ii~~s~~~~~~~-- 131 (352) .+-......+||++-.+.- ...+.+.+.+...+.|+..+ ....+. ..-..|.++.+ +|-.+|.++..++.+ T Consensus 84 ~~~~~l~~~kPD~VlV~GDt~stla~alaA~~~~Ipv~Hv--eaGlrs~~~~dE~~R~~i~~lS~~hf~~t~~a~~nL~~ 161 (365) T TIGR03568 84 GFSDAFERLKPDLVVVLGDRFEMLAAAIAAALLNIPIAHI--HGGEVTEGAIDESIRHAITKLSHLHFVATEEYRQRVIQ 161 (365) T ss_pred HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHCCEEEEE--ECCCCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 9999985439989999489860779999999819818999--67864589886588889899877773233578899986 Q ss_pred ----CCCEEEECC-CCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEE------ECHHHHHHHHHHHHHHHHCCCCCEEE Q ss_conf ----688399748-8277782378432687887288988727999512------26445123666545653004774058 Q gi|255764474|r 132 ----ERPSTVIMH-GVDTERFRPTSNKQEARRHLKISEDAKLIGCFGR------IRKLKGTDLFVDCMINILPHHPGWTA 200 (352) Q Consensus 132 ----~~~~~vi~~-gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~------~~~~Kg~~~li~a~~~l~~~~~~~~l 200 (352) +..+.++.| ++|.-.......+......++++.+..+++..-+ -...+.+..+++++.++. .++ T Consensus 162 eG~~~~~I~~vGn~~iD~l~~~~~~~~~~~~~~~~~~~~~~~~LvT~Hp~~~~~~~~~~~l~~il~al~~~~---~~~-- 236 (365) T TIGR03568 162 MGEDPDRVFNVGSPGLDNILSLDLLSKEELEEKLGIDLDKPYALVTFHPVTLEKESAEEQIKELLKALDELN---KNY-- 236 (365) T ss_pred CCCCCCEEEEECCHHHHHHHHCCCCCHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHC---CCC-- T ss_conf 247867089827718999862221378899987412136876999953532566568999999999997208---881-- Q ss_pred EEECCCCCCCCCHHHHHHHHHHHC-CCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCC Q ss_conf 863133321000146677644311-233221222222--33100000000000123332222221000001001110278 Q gi|255764474|r 201 VVVGKTTLKHYLFKKNLQRRIYAN-GLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNT 277 (352) Q Consensus 201 ~i~G~g~~~~~~~~~~l~~~i~~~-~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~ 277 (352) +++-..+. ...+.+...++++ ...+++.+.... .+...+++.|++++--| +-|+ .||-+.|+|||.. T Consensus 237 ~~i~Pn~d---~~~~~i~~~i~~~~~~~~ni~~i~pl~y~~fl~ll~~a~~vitdS---sggi--~Ea~~l~~P~i~i-- 306 (365) T TIGR03568 237 IFTYPNAD---AGSRIINEAIEEYVNEHPNFRLFKSLGQERYLSLLKNADAVIGNS---SSGI--IEAPSFGVPTINI-- 306 (365) T ss_pred EEEECCCC---HHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHCCEEEECC---CCCE--EEECCCCCCEEEE-- T ss_conf 79826986---027889999999970799889966788899999998701999858---8654--6701049867883-- Q ss_pred CCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHH Q ss_conf 8410100234884249965999899999999998698999999 Q gi|255764474|r 278 GVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDT 320 (352) Q Consensus 278 ~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~ 320 (352) |.-++-. ..+.+-+.+ ..|++.+.+++..+++ ++....+ T Consensus 307 ~~Rq~~r--~~~~nvi~v-~~~~~~I~~ai~~~~~-~~~~~~~ 345 (365) T TIGR03568 307 GTRQKGR--LRADSVIDV-DPDKEEIVKAIEKLLD-PAFKKSL 345 (365) T ss_pred CCCCCCC--CCCCEEEEE-CCCHHHHHHHHHHHHC-HHHHHHH T ss_conf 7885555--247608981-7999999999999748-7898755 No 97 >COG1819 Glycosyl transferases, related to UDP-glucuronosyltransferase [Carbohydrate transport and metabolism / Signal transduction mechanisms] Probab=97.65 E-value=0.00074 Score=45.09 Aligned_cols=161 Identities=14% Similarity=0.156 Sum_probs=97.3 Q ss_pred CCCCC-EEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 98872-79995122644512366654565300477405886313332100014667764431123322122222233100 Q gi|255764474|r 163 SEDAK-LIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIED 241 (352) Q Consensus 163 ~~~~~-~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~ 241 (352) +.+.+ +....|..... ..+++.+.+.+... +.++++...+ .+. ..-++.+++...++.+.. . T Consensus 234 ~~d~~~vyvslGt~~~~---~~l~~~~~~a~~~l-~~~vi~~~~~-~~~-----------~~~~~p~n~~v~~~~p~~-~ 296 (406) T COG1819 234 PADRPIVYVSLGTVGNA---VELLAIVLEALADL-DVRVIVSLGG-ARD-----------TLVNVPDNVIVADYVPQL-E 296 (406) T ss_pred CCCCCEEEEECCCCCCH---HHHHHHHHHHHHCC-CCEEEEECCC-CCC-----------CCCCCCCCEEEECCCCHH-H T ss_conf 37996399955787537---88999999998549-9649997367-642-----------346888774786157589-8 Q ss_pred CCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCC----CCHHCCCCCCCEEEEEC--CCCHHHHHHHHHHHHCCHH Q ss_conf 00000000012333222222100000100111027884----10100234884249965--9998999999999986989 Q gi|255764474|r 242 WYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGV----FSELLDPENAKAGVIVP--PRNLHALEKAVLYFMNSKK 315 (352) Q Consensus 242 ~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~----~~e~i~~~~g~~G~~~~--~~d~~~la~~i~~l~~~~~ 315 (352) ++..||++++. |-..++.||+.+|+|+|+-..+. ..+.+. .-.+|.... ..+++.+++++.++++|+. T Consensus 297 ~l~~ad~vI~h----GG~gtt~eaL~~gvP~vv~P~~~DQ~~nA~rve--~~G~G~~l~~~~l~~~~l~~av~~vL~~~~ 370 (406) T COG1819 297 LLPRADAVIHH----GGAGTTSEALYAGVPLVVIPDGADQPLNAERVE--ELGAGIALPFEELTEERLRAAVNEVLADDS 370 (406) T ss_pred HHHCCCEEEEC----CCHHHHHHHHHCCCCEEEECCCCCHHHHHHHHH--HCCCCEECCCCCCCHHHHHHHHHHHHCCHH T ss_conf 87407989917----985799999973999898278730787999999--749883127565888999999999967399 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 99999999999999829989999999999999 Q gi|255764474|r 316 IMSDTGNRGRERAVKHFSIVKEASDIGKVYDR 347 (352) Q Consensus 316 ~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~ 347 (352) .++...+-.+. +.+.-..++.++.+++..++ T Consensus 371 ~~~~~~~~~~~-~~~~~g~~~~a~~le~~~~~ 401 (406) T COG1819 371 YRRAAERLAEE-FKEEDGPAKAADLLEEFARE 401 (406) T ss_pred HHHHHHHHHHH-HHHCCCHHHHHHHHHHHHHC T ss_conf 99999999999-76555379999999999832 No 98 >TIGR01426 MGT glycosyltransferase, MGT family; InterPro: IPR006326 These sequences belong to the MGT (macroside glycosyltransferase) subfamily of the UDP-glucuronosyltransferase family. Members include a number of glucosyl transferases for macrolide antibiotic inactivation, but also include transferases of glucose-related sugars for macrolide antibiotic production. ; GO: 0016758 transferase activity transferring hexosyl groups, 0016999 antibiotic metabolic process. Probab=97.65 E-value=0.001 Score=44.09 Aligned_cols=166 Identities=20% Similarity=0.303 Sum_probs=102.3 Q ss_pred CCCCCCCCEE--EEEEEECHHHHHHHHHHHHHHHHCCCCCEEEE-EECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 2889887279--99512264451236665456530047740588-63133321000146677644311233221222222 Q gi|255764474|r 160 LKISEDAKLI--GCFGRIRKLKGTDLFVDCMINILPHHPGWTAV-VVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ 236 (352) Q Consensus 160 ~~~~~~~~~i--~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~-i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~ 236 (352) +..+.+..-+ ..+|...- +....+.+.|.+.....|.|+.+ .+|.+.+- ..|. .+.++|....++ T Consensus 253 w~~~~~~~pV~liSLGTvFn-~~p~~fyr~f~~AF~~~~GW~vV~~~g~~vDp-----~~L~------~~P~Nv~VR~~V 320 (429) T TIGR01426 253 WAEPEKGRPVVLISLGTVFN-AQPSKFYRTFVEAFRDLPGWHVVLSVGKGVDP-----ADLG------ELPKNVEVRRWV 320 (429) T ss_pred CCCCCCCCCEEEEECCHHHC-CCHHHHHHHHHHHCCCCCCCEEEEEECCCCCH-----HHHC------CCCCCEEEECCC T ss_conf 78888888469997561441-24479999999860899870799972670264-----6616------798877885465 Q ss_pred CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCC----CCCHHCCCCCCCEEEEECCCC--HHHHHHHHHHH Q ss_conf 331000000000001233322222210000010011102788----410100234884249965999--89999999999 Q gi|255764474|r 237 SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTG----VFSELLDPENAKAGVIVPPRN--LHALEKAVLYF 310 (352) Q Consensus 237 ~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~----~~~e~i~~~~g~~G~~~~~~d--~~~la~~i~~l 310 (352) ++. .++..||+||.- |--.+..|||++|+|+|+-+.+ .+.+-|. .=+.|..++++. ++.|.++..++ T Consensus 321 Pq~-evL~~A~lfvTH----gGmnSt~EaL~~gVP~va~P~~adQ~~~A~R~~--ELGlg~~l~~e~vTa~~LR~~v~~v 393 (429) T TIGR01426 321 PQL-EVLEKADLFVTH----GGMNSTMEALAAGVPLVAVPQGADQPMTARRIA--ELGLGRVLPKEEVTAEKLREAVLAV 393 (429) T ss_pred CHH-HHHHHHHHHHHC----CCCHHHHHHHHCCCCEEEECCCCCCHHHHHHHH--HCCCEEECCCCCCCHHHHHHHHHHH T ss_conf 627-789888888631----660158999964996898517888013765751--3562111376552789999999986 Q ss_pred HCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 86989999999999999998299899999999999 Q gi|255764474|r 311 MNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVY 345 (352) Q Consensus 311 ~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY 345 (352) ++|++.+.++.+=.++ +.+-=-.++.|+.++++. T Consensus 394 ~~D~~~~~~~~~~r~~-~~eAGG~~rAAdeiE~~l 427 (429) T TIGR01426 394 LSDDEYLERLKKIRAE-IREAGGARRAADEIEGFL 427 (429) T ss_pred HCCHHHHHHHHHHHHH-HHHCCCHHHHHHHHHHHH T ss_conf 0588899999999999-985045338999999974 No 99 >pfam00343 Phosphorylase Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses; glycogen, starch, glucan or maltodextrin. Probab=97.38 E-value=0.0024 Score=41.70 Aligned_cols=133 Identities=17% Similarity=0.170 Sum_probs=89.5 Q ss_pred CCCCCCCEEEEEEEECHHHHHHH-H---HHHHHHHHCCCC-----CEEEEEECCCCCCCCCHHHHHHHHHHHC------- Q ss_conf 88988727999512264451236-6---654565300477-----4058863133321000146677644311------- Q gi|255764474|r 161 KISEDAKLIGCFGRIRKLKGTDL-F---VDCMINILPHHP-----GWTAVVVGKTTLKHYLFKKNLQRRIYAN------- 224 (352) Q Consensus 161 ~~~~~~~~i~~~G~~~~~Kg~~~-l---i~a~~~l~~~~~-----~~~l~i~G~g~~~~~~~~~~l~~~i~~~------- 224 (352) .+.++..+.+++-|+...|+..+ + +.-+.+++. .| ..++++.|...+.+. .-+++.++|... T Consensus 439 ~ldp~slfdvq~kR~heYKRq~Ln~l~ii~~y~rik~-~~~~~~~P~tfIFaGKAaP~y~-~aK~iIklI~~va~~in~D 516 (712) T pfam00343 439 VVNPNALFDVQVKRIHEYKRQLLNVLHVIYRYNRIKE-DPPKDVVPRVVIFGGKAAPGYY-MAKRIIKLINSVADVVNND 516 (712) T ss_pred EECCCCCHHHHHHHHHHHCCHHCCHHHHHHHHHHHHH-CCCCCCCCEEEEECCCCCCCCH-HHHHHHHHHHHHHHHHHCC T ss_conf 0085532025532264404000015369999999983-8877878659998277798748-9999999999999997078 Q ss_pred -CCCC--CCCCCCCC--CCCCCCCHHHHHHCCCCCC--CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEE Q ss_conf -2332--21222222--3310000000000012333--2222221000001001110278841010023488424996 Q gi|255764474|r 225 -GLKK--RILFIDEQ--SSIEDWYRALNIFVAPPLY--EGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIV 295 (352) Q Consensus 225 -~l~~--~V~~~g~~--~~~~~~~~~adi~i~pS~~--Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~ 295 (352) .+.+ +|.|+... +-.+.++.+||+-...|+. |..|..-+-+|..|...|+|--|..-|+.+....++-+++ T Consensus 517 p~v~~~lkVVFlenY~VslAe~lipa~Dvseqis~a~~EASGTsnMK~~lNGaltlgtlDGanvEi~e~vG~eN~fiF 594 (712) T pfam00343 517 PAVGDKLKVVFLPNYRVSLAEKIIPASDISEQISTAGTEASGTSNMKFALNGALTIGTLDGANVEIAEEVGEENIFIF 594 (712) T ss_pred HHHCCEEEEEECCCCCHHHHHHHCCCCCHHHHCCCCCCCCCCCCCHHHHHCCCEEEECCCCHHHHHHHHCCCCCEEEE T ss_conf 311782699983899869998753420046437888734578620689976872673366526788986285536870 No 100 >pfam09314 DUF1972 Domain of unknown function (DUF1972). Members of this family of functionally uncharacterized domains are found in bacterial glycosyltransferases and rhamnosyltransferases. Probab=97.27 E-value=0.018 Score=35.78 Aligned_cols=141 Identities=16% Similarity=0.189 Sum_probs=79.6 Q ss_pred HHHEEEECC-CCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCC--CC---C----------CCCCC-H----HHHHHHH Q ss_conf 044589707-8787877077888899999997198599980588--66---5----------62347-8----9989988 Q gi|255764474|r 3 MNNIDVIAP-NMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCL--PK---N----------IPSIG-I----SSLLTCW 61 (352) Q Consensus 3 ~~~i~~i~~-~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~--~~---~----------~~~~~-~----~~~~~~~ 61 (352) +++|.+|-. .++.+++|.+..+.++++.+...++++.++-... +. . .+... . .++..++ T Consensus 1 mkKIaiiGtRGiPa~yGGfEt~ve~L~~~l~~~~~~v~V~c~~~~~~~~~~~y~gv~~~~i~~~~~g~~~~i~~d~~s~~ 80 (185) T pfam09314 1 MQHVFIIGSRGLPAKYGGFETFVEKLVEHQQSKNIKYHVACLSENSAKSHFEYKGADCFYIKVPKIGPARVIAYDIMAIN 80 (185) T ss_pred CCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCEECCEEEEEECCCCCCCHHHHHHHHHHHH T ss_conf 95699995888885347299999999988853895699998179887888567888999947866673779999999999 Q ss_pred HH---H--HHCCCEEEEECCCHH-HHHHHHHHHH--HCCCEEEEECCHHHHHHH---------HHHHHH-HHHCCEEEEC Q ss_conf 75---4--325970899628546-8999999997--099819997574562011---------347989-9626689988 Q gi|255764474|r 62 KK---P--IGQNSRIWHARRNNE-MLLGVMMRDV--LRMPLKLVFTSPSQRNHS---------RWTRYL-ISRMDEVITT 123 (352) Q Consensus 62 ~~---~--~~~~~~ivh~~~~~~-~~~~~~~~~~--~~~~~~~~~~~~~~~~~~---------~~~~~l-~~~~d~ii~~ 123 (352) .. . .....+++|.+.... .....+++.+ ++.++.+..+..-+++.+ ++.+++ .+.+|.+|+. T Consensus 81 ~a~~~~~~~~~~~~i~~~lg~~~g~~~~~~~~~~~~~g~kvvvn~dGlEWkR~KW~~~ak~ylk~~E~~avk~ad~lI~D 160 (185) T pfam09314 81 YALKIIKENNIQNPIFYILGNTIGPFIAPFAKKIHSVGGKLFVNPDGLEWKRAKWSRPVRKYLKISEKLMVKYADLVISD 160 (185) T ss_pred HHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHCCCEEEEECCCCHHCHHHHHHHHHHHHHHHHHHHHHHCCEEEEC T ss_conf 99987465035688899976762788899999887508819995897101756563799999999999999818978984 Q ss_pred CHHHHHHCC----C-CEEEECCCCC Q ss_conf 989997436----8-8399748827 Q gi|255764474|r 124 SQKSARFIE----R-PSTVIMHGVD 143 (352) Q Consensus 124 s~~~~~~~~----~-~~~vi~~gid 143 (352) |+..++|+. + ...+||+|.| T Consensus 161 s~~I~~y~~~~y~~~~~~~I~YG~d 185 (185) T pfam09314 161 NKGIEKYIQSEYGAPKTTFIAYGTD 185 (185) T ss_pred CHHHHHHHHHHCCCCCCEEECCCCC T ss_conf 8899999999809988689217999 No 101 >pfam04464 Glyphos_transf CDP-Glycerol:Poly(glycerophosphate) glycerophosphotransferase. Wall-associated teichoic acids are a heterogeneous class of phosphate-rich polymers that are covalently linked to the cell wall peptidoglycan of gram-positive bacteria. They consist of a main chain of phosphodiester-linked polyols and/or sugar moieties attached to peptidoglycan via a linkage unit. CDP-glycerol:poly(glycerophosphate) glycerophosphotransferase is responsible for the polymerisation of the main chain of the teichoic acid by sequential transfer of glycerol-phosphate units from CDP-glycerol to the linkage unit lipid. Probab=97.21 E-value=0.0039 Score=40.25 Aligned_cols=147 Identities=16% Similarity=0.236 Sum_probs=85.5 Q ss_pred HHHCCCCCCCCEEEEEEEECHHHHH-------HHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCC Q ss_conf 8872889887279995122644512-------366654565300477405886313332100014667764431123322 Q gi|255764474|r 157 RRHLKISEDAKLIGCFGRIRKLKGT-------DLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKR 229 (352) Q Consensus 157 ~~~~~~~~~~~~i~~~G~~~~~Kg~-------~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~ 229 (352) ++.+|+++++++|+|+=.+...... ...++.+.+.++ .++.+++-.... ............+ T Consensus 2 k~~lgl~~~kkvILYaPT~R~~~~~~~~~~~~~~~~~~l~~~l~--~n~~liik~Hp~---------~~~~~~~~~~~~~ 70 (186) T pfam04464 2 REELGIPKDKKVILYAPTFRDDGYYSAGSILFNLDLEKLLEKLG--ENYVILVKLHPL---------VSNSIINKRYDSD 70 (186) T ss_pred HHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHC--CCEEEEEEECHH---------HHCCHHHHCCCCC T ss_conf 26419999897999869733886655444323013999998727--683999972667---------6400022025776 Q ss_pred CCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCC-------CCCEEEEECCCCHHH Q ss_conf 1222222331000000000001233322222210000010011102788410100234-------884249965999899 Q gi|255764474|r 230 ILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPE-------NAKAGVIVPPRNLHA 302 (352) Q Consensus 230 V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~-------~g~~G~~~~~~d~~~ 302 (352) +.......++..++..||+++. -++-+..|+|.+++|||.- .....+..... .-.-|-.+ .+.++ T Consensus 71 ~~~~~~~~di~~ll~~aDiLIT-----DYSSi~fD~lll~kPii~~-~~D~~~Y~~~rg~~~d~~~~~~g~~v--~~~~e 142 (186) T pfam04464 71 VIDVSDYSDIQDLFLASDILIT-----DYSSVFFDFALLDKPIIFY-APDLEEYRELRGFYFDYEKEAPGPVV--KTFEE 142 (186) T ss_pred EEECCCCCCHHHHHHHHCEEEE-----EHHHHHHHHHHHCCCEEEE-ECCHHHHHHCCCCCCCHHHCCCCCEE--CCHHH T ss_conf 7978898589999998436776-----4688999999879978998-18789997525861058780787625--98999 Q ss_pred HHHHHHHHHCCHHHHHHHHH Q ss_conf 99999999869899999999 Q gi|255764474|r 303 LEKAVLYFMNSKKIMSDTGN 322 (352) Q Consensus 303 la~~i~~l~~~~~~~~~~~~ 322 (352) |.++|...+.++....+-.+ T Consensus 143 L~~~i~~~~~~~~~~~~~~~ 162 (186) T pfam04464 143 LLDALKNYMENDEEYAEKRR 162 (186) T ss_pred HHHHHHHHHHCCHHHHHHHH T ss_conf 99999998757767799999 No 102 >pfam08323 Glyco_transf_5 Starch synthase catalytic domain. Probab=97.20 E-value=0.02 Score=35.45 Aligned_cols=30 Identities=23% Similarity=0.180 Sum_probs=26.0 Q ss_pred CCCCHHHHHHHHHHHHHHHCCEEEEEECCC Q ss_conf 877077888899999997198599980588 Q gi|255764474|r 16 RHTGVTSTVFGLCPIQRKLGQRLVVFGYCL 45 (352) Q Consensus 16 ~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~ 45 (352) +-+|....+..+.+.+.+.|+++.++.|.. T Consensus 13 KvGGLgdvv~~Lp~aL~~~G~~V~vi~P~Y 42 (229) T pfam08323 13 KTGGLADVVGALPKALAKLGHDVRVILPRY 42 (229) T ss_pred CCCCHHHHHHHHHHHHHHCCCEEEEEECCC T ss_conf 466589999999999997699499993289 No 103 >pfam00201 UDPGT UDP-glucoronosyl and UDP-glucosyl transferase. Probab=97.16 E-value=0.019 Score=35.54 Aligned_cols=80 Identities=14% Similarity=0.073 Sum_probs=51.9 Q ss_pred CCCCCCCCCCCCEEEEEECCC----CCCCHHCCCCCCCEEEEECCC--CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 222221000001001110278----841010023488424996599--98999999999986989999999999999998 Q gi|255764474|r 257 GFGLTPLEAMASGIPVVASNT----GVFSELLDPENAKAGVIVPPR--NLHALEKAVLYFMNSKKIMSDTGNRGRERAVK 330 (352) Q Consensus 257 g~gl~~lEAma~G~PvI~s~~----~~~~e~i~~~~g~~G~~~~~~--d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~ 330 (352) |--+++.||+.+|+|+|+-+. ......+. +.| .|..++.. +.+.+.++|.++++|+.+++.+.+-++-.-.+ T Consensus 349 gG~~S~~Eai~~GVP~v~iP~f~DQ~~Na~~~~-~~G-~g~~l~~~~lt~~~l~~ai~~vl~n~~Y~~na~~~s~~~~d~ 426 (501) T pfam00201 349 AGSNGVYEAICHGVPMVGMPLFGDQMDNAKHME-AKG-AAVTLNVLTMTSEDLLNALKTVINDPSYKENIMRLSSIHHDQ 426 (501) T ss_pred CCHHHHHHHHHCCCCEEEECCCCCHHHHHHHHH-HCC-CEEEEECCCCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHCC T ss_conf 873069999985989897156344699999999-779-789963211999999999999970988999999999998659 Q ss_pred HCCHHHHH Q ss_conf 29989999 Q gi|255764474|r 331 HFSIVKEA 338 (352) Q Consensus 331 ~fs~~~~a 338 (352) -.++...| T Consensus 427 P~~p~~~a 434 (501) T pfam00201 427 PVKPLDRA 434 (501) T ss_pred CCCHHHHH T ss_conf 99989999 No 104 >COG3980 spsG Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase [Cell envelope biogenesis, outer membrane] Probab=97.13 E-value=0.02 Score=35.50 Aligned_cols=288 Identities=11% Similarity=0.078 Sum_probs=134.0 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCHHHHHHHH-----HHHHHCCCEEEEE-CCCH Q ss_conf 458970787878770778888999999971985999805886656234789989988-----7543259708996-2854 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPSIGISSLLTCW-----KKPIGQNSRIWHA-RRNN 78 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~ivh~-~~~~ 78 (352) +|.++.......+.|=......+++.+.+.+..+.-++......++.- .++.+... ..+...++|++-. +... T Consensus 2 ~V~i~~Dgg~~iGmGHV~R~l~LA~~l~k~~~~~~fl~k~~~e~~~~~-~~~~f~~~~~~~~n~ik~~k~d~lI~Dsygl 80 (318) T COG3980 2 KVLIRCDGGLEIGMGHVMRTLTLARELEKRGFACLFLTKQDIEAIIHK-VYEGFKVLEGRGNNLIKEEKFDLLIFDSYGL 80 (318) T ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHH-HHHHCCCEEEECCCCCCCCCCCEEEEECCCC T ss_conf 579992687555751345599999999851746888406625642156-6651043002336410036677899942688 Q ss_pred HHHHHHHHHHHHCCCEEEEECCHHHHHHHH---HHHHHHHHCCEEEECCHHHHHHCCCCEEEECCCCCHHHHCCC-CCHH Q ss_conf 689999999970998199975745620113---479899626689988989997436883997488277782378-4326 Q gi|255764474|r 79 EMLLGVMMRDVLRMPLKLVFTSPSQRNHSR---WTRYLISRMDEVITTSQKSARFIERPSTVIMHGVDTERFRPT-SNKQ 154 (352) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~l~~~~d~ii~~s~~~~~~~~~~~~vi~~gid~~~~~~~-~~~~ 154 (352) ..-.....+.-.+.+.. .|.....+.+.. .....++..|.. .+.+.++. +..|.+ |.|. +.-. T Consensus 81 ~~dd~k~ik~e~~~k~l-~fDd~~~~~~~d~d~ivN~~~~a~~~y--------~~v~~k~~-~~lGp~---y~~lr~eF~ 147 (318) T COG3980 81 NADDFKLIKEEAGSKIL-IFDDENAKSFKDNDLIVNAILNANDYY--------GLVPNKTR-YYLGPG---YAPLRPEFY 147 (318) T ss_pred CHHHHHHHHHHHCCCEE-EECCCCCCCHHHHHHHHHHHHCCHHHC--------CCCCCCEE-EEECCC---CEECCHHHH T ss_conf 87899998897388179-964777642256673545553511220--------53676637-996587---114169999 Q ss_pred HHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 87887288988727999512264451236665456530047740588631333210001466776443112332212222 Q gi|255764474|r 155 EARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFID 234 (352) Q Consensus 155 ~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g 234 (352) +.|+..-..+.+.++...|.-++ |++. .+.+..+.+..-++++ ++|++.+. .+++.+.++. ..++.+.- T Consensus 148 ~~r~~~~~r~~r~ilI~lGGsDp-k~lt--~kvl~~L~~~~~nl~i-V~gs~~p~----l~~l~k~~~~---~~~i~~~~ 216 (318) T COG3980 148 ALREENTERPKRDILITLGGSDP-KNLT--LKVLAELEQKNVNLHI-VVGSSNPT----LKNLRKRAEK---YPNINLYI 216 (318) T ss_pred HHHHHHHHCCHHEEEEECCCCCH-HHHH--HHHHHHHHCCCEEEEE-EECCCCCC----HHHHHHHHHH---CCCEEEEE T ss_conf 86899863531128997168872-4459--9999984035704999-94688854----6678888865---78802686 Q ss_pred CCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE----CCCCCCCHHCCCCCCCEEEEECC---CCHHHHHHHH Q ss_conf 2233100000000000123332222221000001001110----27884101002348842499659---9989999999 Q gi|255764474|r 235 EQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVA----SNTGVFSELLDPENAKAGVIVPP---RNLHALEKAV 307 (352) Q Consensus 235 ~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~----s~~~~~~e~i~~~~g~~G~~~~~---~d~~~la~~i 307 (352) +.+++..+|..||+.+. .-|.++.||...|+|.++ .+--....... ..|...+. .........+ T Consensus 217 ~~~dma~LMke~d~aI~-----AaGstlyEa~~lgvP~l~l~~a~NQ~~~a~~f~----~lg~~~~l~~~l~~~~~~~~~ 287 (318) T COG3980 217 DTNDMAELMKEADLAIS-----AAGSTLYEALLLGVPSLVLPLAENQIATAKEFE----ALGIIKQLGYHLKDLAKDYEI 287 (318) T ss_pred CCHHHHHHHHHCCHHEE-----CCCHHHHHHHHHCCCCEEEEEECCHHHHHHHHH----HCCCHHHCCCCCCHHHHHHHH T ss_conf 22458999986033314-----463579999982698258763301788877898----668600226777618789999 Q ss_pred HHHHCCHHHHHHHHHHHHH Q ss_conf 9998698999999999999 Q gi|255764474|r 308 LYFMNSKKIMSDTGNRGRE 326 (352) Q Consensus 308 ~~l~~~~~~~~~~~~~a~~ 326 (352) .++..|+.++...+...+. T Consensus 288 ~~i~~d~~~rk~l~~~~~~ 306 (318) T COG3980 288 LQIQKDYARRKNLSFGSKL 306 (318) T ss_pred HHHHHCHHHHHHHHHCCCE T ss_conf 9864077776422211102 No 105 >cd03786 GT1_UDP-GlcNAc_2-Epimerase Bacterial members of the UDP-N-Acetylglucosamine (GlcNAc) 2-Epimerase family are known to catalyze the reversible interconversion of UDP-GlcNAc and UDP-N-acetylmannosamine (UDP-ManNAc). The enzyme serves to produce an activated form of ManNAc residues (UDP-ManNAc) for use in the biosynthesis of a variety of cell surface polysaccharides; The mammalian enzyme is bifunctional, catalyzing both the inversion of stereochemistry at C-2 and the hydrolysis of the UDP-sugar linkage to generate free ManNAc. It also catalyzes the phosphorylation of ManNAc to generate ManNAc 6-phosphate, a precursor to salic acids. In mammals, sialic acids are found at the termini of oligosaccharides in a large variety of cell surface glycoconjugates and are key mediators of cell-cell recognition events. Mutations in human members of this family have been associated with Sialuria, a rare disease caused by the disorders of sialic acid metabolism. This family belongs to the GT-B st Probab=97.08 E-value=0.027 Score=34.54 Aligned_cols=241 Identities=13% Similarity=0.105 Sum_probs=119.2 Q ss_pred HHHHHHHHHCCCEEEEECC-CHHHHHHHHHHHHHCCCEEEEECCHHHH-----HHHHHHHHHHHHCCE-EEECCHHHHHH Q ss_conf 9988754325970899628-5468999999997099819997574562-----011347989962668-99889899974 Q gi|255764474|r 58 LTCWKKPIGQNSRIWHARR-NNEMLLGVMMRDVLRMPLKLVFTSPSQR-----NHSRWTRYLISRMDE-VITTSQKSARF 130 (352) Q Consensus 58 ~~~~~~~~~~~~~ivh~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~l~~~~d~-ii~~s~~~~~~ 130 (352) ..+.+.....+||++-.+. ..+++.+.+++...+.|+..+ ....+ ......|..+.+... .++.++..++. T Consensus 78 ~~~~~~l~~~kPD~VlV~GDr~e~la~Alaa~~~~Ipi~Hi--egG~rs~~~~~~de~~R~~i~kls~lhf~~t~~~~~~ 155 (363) T cd03786 78 IGLEAVLLEEKPDLVLVLGDTNETLAAALAAFKLGIPVAHV--EAGLRSFDRGMPDEENRHAIDKLSDLHFAPTEEARRN 155 (363) T ss_pred HHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCEEEE--ECCCCCCCCCCCCHHHHHHHHHCCCEEECCCHHHHHH T ss_conf 99999999729999999488842879999999819818996--2643347679987798755221012561461999999 Q ss_pred C------CCCEEEECC-CCCHHHHCCCCCHHHHHHHCCC-CCCCCEEEEEEEE---CHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 3------688399748-8277782378432687887288-9887279995122---644512366654565300477405 Q gi|255764474|r 131 I------ERPSTVIMH-GVDTERFRPTSNKQEARRHLKI-SEDAKLIGCFGRI---RKLKGTDLFVDCMINILPHHPGWT 199 (352) Q Consensus 131 ~------~~~~~vi~~-gid~~~~~~~~~~~~~~~~~~~-~~~~~~i~~~G~~---~~~Kg~~~li~a~~~l~~~~~~~~ 199 (352) + +..+.++.| ++|.-................. ....++++..=|. ...+.+..+++++.++... ++. T Consensus 156 L~~~G~~~~~I~~vG~~~iD~l~~~~~~~~~~~~~~~~~~~~~~~~lvt~Hr~~n~~~~~~~~~i~~al~~~~~~--~~~ 233 (363) T cd03786 156 LLQEGEPPERIFVVGNTMIDALLRLLELAKKELILELLGLLPKKYILVTLHRVENVDDGEQLEEILEALAELAEE--DVP 233 (363) T ss_pred HHHHCCCCCCEEECCCCHHHHHHHHHHHHHCCHHHHHCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHC--CCE T ss_conf 986154755257738619999998876410326677445455877999964523335689999999999998743--968 Q ss_pred EEEECCCCCCCCCHHHHHHHHHHHCC-CCCCCCCCCC--CCCCCCCCHHHHHHCCCCCCCCCCCCCCC-CCCCEEEEEE- Q ss_conf 88631333210001466776443112-3322122222--23310000000000012333222222100-0001001110- Q gi|255764474|r 200 AVVVGKTTLKHYLFKKNLQRRIYANG-LKKRILFIDE--QSSIEDWYRALNIFVAPPLYEGFGLTPLE-AMASGIPVVA- 274 (352) Q Consensus 200 l~i~G~g~~~~~~~~~~l~~~i~~~~-l~~~V~~~g~--~~~~~~~~~~adi~i~pS~~Eg~gl~~lE-Ama~G~PvI~- 274 (352) +++ ...+. ..+.+++.+.+.- -.+++.+... ..+...++..|++++-=| | .+.| |-..|+|+|. T Consensus 234 v~~--pn~d~---~~~~i~~~~~~~~~~~~~i~~~~~l~~~~~~~ll~~a~~vigdS-----s-Gi~Eea~~l~~P~i~i 302 (363) T cd03786 234 VVF--PNHPR---TRPRIREAGLEFLGHHPNVLLISPLGYLYFLLLLKNADLVLTDS-----G-GIQEEASFLGVPVLNL 302 (363) T ss_pred EEE--ECCCC---HHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCEEEEECC-----C-CCEEEECCCCCCEEEE T ss_conf 999--77972---57789999999855787799978877499999995073998258-----8-8688502069878982 Q ss_pred CCCCCCCHHCCCCCCCEEEEEC-CCCHHHHHHHHHHHHCCHHHHH Q ss_conf 2788410100234884249965-9998999999999986989999 Q gi|255764474|r 275 SNTGVFSELLDPENAKAGVIVP-PRNLHALEKAVLYFMNSKKIMS 318 (352) Q Consensus 275 s~~~~~~e~i~~~~g~~G~~~~-~~d~~~la~~i~~l~~~~~~~~ 318 (352) .+..--+|.++ +|.-+. ..|.+++.+++..++++..... T Consensus 303 r~rqe~re~~~-----~~~~~~v~~~~~~I~~~i~~~l~~~~~~~ 342 (363) T cd03786 303 RDRTERPETVE-----SGTNVLVGTDPEAILAAIEKLLSDEFAYS 342 (363) T ss_pred CCCCCCCEEHH-----CCEEEECCCCHHHHHHHHHHHHCCHHHHH T ss_conf 68776734210-----05488658999999999999975703453 No 106 >TIGR03609 S_layer_CsaB polysaccharide pyruvyl transferase CsaB. The CsaB protein (cell surface anchoring B) of Bacillus anthracis adds a pyruvoyl group to peptidoglycan-associated polysaccharide. This addition is required for proteins with an S-layer homology domain (pfam00395) to bind. Within the larger group of proteins described by pfam04230, this model represents a distinct clade that nearly exactly follows the phylogenetic distribution of the S-layer homology domain (pfam00395). Probab=96.81 E-value=0.045 Score=33.10 Aligned_cols=257 Identities=13% Similarity=0.109 Sum_probs=122.1 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCC------CCCCCCHHHHHHHHHHHHHCCCEEEEE------- Q ss_conf 9707878787707788889999999719859998058866------562347899899887543259708996------- Q gi|255764474|r 8 VIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPK------NIPSIGISSLLTCWKKPIGQNSRIWHA------- 74 (352) Q Consensus 8 ~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~ivh~------- 74 (352) +|..-+.+...|-+.....+++.++. +..+.+... .|. .+..+..+++..+++....-+..|+=. T Consensus 2 ~i~GyyG~~N~GDea~l~a~l~~l~~-~~~i~vls~-~p~~t~~~~~v~av~r~~~~~~~~al~~~d~~I~GGG~llqD~ 79 (298) T TIGR03609 2 LLCGYYGFGNLGDEALLAALLRELPP-GVEPTVLSN-DPAETAKLYGVEAVNRRSLLAVLRALRRADVVIWGGGSLLQDV 79 (298) T ss_pred EEEECCCCCCCCHHHHHHHHHHHCCC-CCCEEEECC-CHHHHHHHCCCEEECCCCHHHHHHHHHHCCEEEECCCCCCCCC T ss_conf 59951589975439999999985589-981899529-8699998639466755699999999987799998585414588 Q ss_pred CCC---HHHHHHHHHHHHHCCCEEEEECCHH--HH-HHHHHHHHHHHHCCEEEECCHHHHHHCC---CCEEEECCCCCHH Q ss_conf 285---4689999999970998199975745--62-0113479899626689988989997436---8839974882777 Q gi|255764474|r 75 RRN---NEMLLGVMMRDVLRMPLKLVFTSPS--QR-NHSRWTRYLISRMDEVITTSQKSARFIE---RPSTVIMHGVDTE 145 (352) Q Consensus 75 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~l~~~~d~ii~~s~~~~~~~~---~~~~vi~~gid~~ 145 (352) ... ...+.-..++.+++.|+.+.-.+.. .+ ......+++++++|.+.+-=+.+.+++. .++.+. -|+- T Consensus 80 ts~~s~~yy~~~~~la~~~gkpv~~~gqgiGP~~~~~~r~l~r~~l~~~~~i~vRD~~S~~~l~~lGv~~~l~---~D~a 156 (298) T TIGR03609 80 TSFRSLLYYLGLMRLARLFGKPVILWGQGIGPLRRRLSRWLVRRVLRGCRAISVRDAASYRLLKRLGIPAELA---ADPV 156 (298) T ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCCCEEEE---CCEE T ss_conf 7543478999999999982998899942688767878999999998419999977888899999749984896---7713 Q ss_pred HHCCCCCHHHHHHHCCCCCCCCEEEEEEEEC---HHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHH Q ss_conf 8237843268788728898872799951226---4451236665456530047740588631333210001466776443 Q gi|255764474|r 146 RFRPTSNKQEARRHLKISEDAKLIGCFGRIR---KLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIY 222 (352) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~---~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~ 222 (352) ..-+...... ....++..+++.-|-. ..+....+.+++.++.++.. ...+++--....|....+++..... T Consensus 157 f~l~~~~~~~-----~~~~~~~~i~v~~r~~~~~~~~~~~~~~~~l~~l~~~~g-~~V~~lp~~~~~D~~~~~~l~~~~~ 230 (298) T TIGR03609 157 WLLPPEPWPG-----GEPLPEPVIVVSLRPWPLLDVSRLLRLLRALDRLQRDTG-AFVLFLPFQQPQDLPLARALRDQLL 230 (298) T ss_pred ECCCCCCCCC-----CCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCC-CEEEEEECCCCCHHHHHHHHHHHCC T ss_conf 2057755444-----445679989999788887899999999999999998359-8699996887854999999997578 Q ss_pred HCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE-CCCCCCCHH Q ss_conf 1123322122222233100000000000123332222221000001001110-278841010 Q gi|255764474|r 223 ANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVA-SNTGVFSEL 283 (352) Q Consensus 223 ~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e~ 283 (352) ....+......+++..+++.+|+++..-.+ .++-|+.+|+|+|+ +..+=+..+ T Consensus 231 ---~~~~i~~~~~~~e~~~~i~~~~~vI~~RlH-----~~I~A~~~gvP~i~isY~~Kv~~f 284 (298) T TIGR03609 231 ---GPAEVLSPLDPEELLGLFASARLVIGMRLH-----ALILAAAAGVPFVALSYDPKVRAF 284 (298) T ss_pred ---CCCEEECCCCHHHHHHHHHHCCEEEEECCH-----HHHHHHHCCCCEEEECCHHHHHHH T ss_conf ---863653789999999999609989980708-----999999779998995307999999 No 107 >PRK10017 putative pyruvyl transferase; Provisional Probab=96.78 E-value=0.048 Score=32.87 Aligned_cols=231 Identities=14% Similarity=0.139 Sum_probs=114.5 Q ss_pred HHHHHHCCCEEEEECC--HHHH-HHHHHHHHHHHHCCEEEECCHHHHHHCCC------CEEEECCCCCHHHH-CCC---- Q ss_conf 9999709981999757--4562-01134798996266899889899974368------83997488277782-378---- Q gi|255764474|r 85 MMRDVLRMPLKLVFTS--PSQR-NHSRWTRYLISRMDEVITTSQKSARFIER------PSTVIMHGVDTERF-RPT---- 150 (352) Q Consensus 85 ~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~l~~~~d~ii~~s~~~~~~~~~------~~~vi~~gid~~~~-~~~---- 150 (352) +.+.+.+.|+.+.-++ +... ......+|.++++|.|.+-=+.+.+++.+ ++.+. .|+-.. .+. T Consensus 142 L~a~l~kKpv~~~aQgIGP~~~~~~~~l~~~vl~~~d~ItvRD~~S~~~L~~lGv~~~~i~~t---aDpAF~l~~~~~~~ 218 (426) T PRK10017 142 LCTFMAKKPLFMIGHSVGPFQDEQFNQLANYVFGHCDALILRESVSLDLMKRSNITTAKVEHG---VDTAWLVDHHTEDF 218 (426) T ss_pred HHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCEEEECCHHHHHHHHHCCCCCCCEEEE---CCCCEECCCCCCCC T ss_conf 999973996899904468808788999999998419789976587899999859997862894---58211025654332 Q ss_pred CCHHHHHHHCCCCCCCCEEEEEEE-ECHH---------HHHHHHHHHHHHHHCCCCCEEEEE--ECC-C-CCCCCCHHHH Q ss_conf 432687887288988727999512-2644---------512366654565300477405886--313-3-3210001466 Q gi|255764474|r 151 SNKQEARRHLKISEDAKLIGCFGR-IRKL---------KGTDLFVDCMINILPHHPGWTAVV--VGK-T-TLKHYLFKKN 216 (352) Q Consensus 151 ~~~~~~~~~~~~~~~~~~i~~~G~-~~~~---------Kg~~~li~a~~~l~~~~~~~~l~i--~G~-g-~~~~~~~~~~ 216 (352) ......++..++....+.|+..=| +.+. +-...+.+.+..+..+.-.+.++- .|. . ...|.....+ T Consensus 219 ~~~~~~~~~l~~~~~~~~VgisVr~~~~~~~~~~~~~~~y~~a~a~~~d~l~~~G~~Vv~lp~~~~i~~~~~dD~~~~~~ 298 (426) T PRK10017 219 TASYAVQHWLDVAAQQKTVAITLRELAPFDKRLGTTQQAYEKAFAGVVNRILDEGYQVIALSTCTGIDSYNKDDRMVALN 298 (426) T ss_pred CCCHHHHHHCCCCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCHHHHHHHH T ss_conf 21235564136566687799997036630112441089999999999999997798799960566877778025999999 Q ss_pred HHHHHHHCCCCCCCCCCCC---CCCCCCCCHHHHHHCCCCCCCCCCC-CCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCE Q ss_conf 7764431123322122222---2331000000000001233322222-21000001001110-27884101002348842 Q gi|255764474|r 217 LQRRIYANGLKKRILFIDE---QSSIEDWYRALNIFVAPPLYEGFGL-TPLEAMASGIPVVA-SNTGVFSELLDPENAKA 291 (352) Q Consensus 217 l~~~i~~~~l~~~V~~~g~---~~~~~~~~~~adi~i~pS~~Eg~gl-~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~ 291 (352) +. +..+-.++++++.. ..++..+++.||+++- .-+ +++=|+.+|+|+|+ +..+=...+.. .-|.. T Consensus 299 i~---~~m~~~~~~~il~~~~~~~E~~~ii~~~dl~IG------~RLHslIfA~~~gvP~i~IsYd~K~~g~~~-~lGl~ 368 (426) T PRK10017 299 LR---QHISDPARYHVVMDELNDLEMGKILGACELTVG------TRLHSAIISMNFATPAIAINYEHKSAGIMQ-QLGLP 368 (426) T ss_pred HH---HHHCCCCCEEEECCCCCHHHHHHHHHHCHHHHH------HHHHHHHHHHHCCCCEEEEEECHHHHHHHH-HCCCC T ss_conf 99---972687636983899998999999973922998------889999999975999698402287899999-75993 Q ss_pred EEE--ECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 499--6599989999999999869899999999999999 Q gi|255764474|r 292 GVI--VPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERA 328 (352) Q Consensus 292 G~~--~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~ 328 (352) .+. ++.-+++++..++.+++++.+...+...++-+.. T Consensus 369 ~~~~di~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (426) T PRK10017 369 EMAIDIRHLLDGSLQAMVADTLGQLPALNARLAEAVSRE 407 (426) T ss_pred HHEECHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 003037766927899999999976999999999999999 No 108 >KOG4626 consensus Probab=96.72 E-value=0.015 Score=36.21 Aligned_cols=188 Identities=15% Similarity=0.249 Sum_probs=118.3 Q ss_pred HHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCC-CCCCCC Q ss_conf 8788728898872799951226445123666545653004774058863133321000146677644311233-221222 Q gi|255764474|r 155 EARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLK-KRILFI 233 (352) Q Consensus 155 ~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~-~~V~~~ 233 (352) ..|...++|++..+++....+ .|=-...++.|..++++-|+-.+++.-..... ....+..+++.|++ +||.|. T Consensus 748 P~r~~y~Lp~d~vvf~~FNqL--yKidP~~l~~W~~ILk~VPnS~LwllrfPa~g----e~rf~ty~~~~Gl~p~riifs 821 (966) T KOG4626 748 PTRSQYGLPEDAVVFCNFNQL--YKIDPSTLQMWANILKRVPNSVLWLLRFPAVG----EQRFRTYAEQLGLEPDRIIFS 821 (966) T ss_pred CCCCCCCCCCCEEEEEECHHH--HCCCHHHHHHHHHHHHHCCCCEEEEEECCCCC----HHHHHHHHHHHCCCCCCEEEC T ss_conf 877777899870798530445--42898999999999985886416877245335----088999999819994525403 Q ss_pred CC--CCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC---CCCCEEEEECCCCHHHHHHHHH Q ss_conf 22--233100000000000123332222221000001001110278841010023---4884249965999899999999 Q gi|255764474|r 234 DE--QSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDP---ENAKAGVIVPPRNLHALEKAVL 308 (352) Q Consensus 234 g~--~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~---~~g~~G~~~~~~d~~~la~~i~ 308 (352) +- .++-..-++-+|+.+=+....| -.+-.|.+.+|+|+|+-....+..-|.. ..-+.|-++ ..+-++..+.-. T Consensus 822 ~va~k~eHvrr~~LaDv~LDTplcnG-hTTg~dvLw~GvPmVTmpge~lAsrVa~Sll~~~Gl~hli-ak~~eEY~~iaV 899 (966) T KOG4626 822 PVAAKEEHVRRGQLADVCLDTPLCNG-HTTGMDVLWAGVPMVTMPGETLASRVAASLLTALGLGHLI-AKNREEYVQIAV 899 (966) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCC-CCCCHHHHCCCCCEEECCCHHHHHHHHHHHHHHCCCHHHH-HHHHHHHHHHHH T ss_conf 43104888876556652015767588-6655233103774263350888989899999981628887-512899999999 Q ss_pred HHHCCHHHHHHHHHHHHHHHH--HHCCHHHHHHHHHHHHHHHHH Q ss_conf 998698999999999999999--829989999999999999886 Q gi|255764474|r 309 YFMNSKKIMSDTGNRGRERAV--KHFSIVKEASDIGKVYDRLLR 350 (352) Q Consensus 309 ~l~~~~~~~~~~~~~a~~~~~--~~fs~~~~a~~~~~iY~~~l~ 350 (352) +|-.|.+....+..+-+.... --|+-...+..+++.|.+.-+ T Consensus 900 ~Latd~~~L~~lr~~l~~~r~~splfd~~q~~~~LE~~y~~MW~ 943 (966) T KOG4626 900 RLATDKEYLKKLRAKLRKARASSPLFDTKQYAKGLERLYLQMWK 943 (966) T ss_pred HHHCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 86057799999999999985279766846888889999999999 No 109 >PRK02797 4-alpha-L-fucosyltransferase; Provisional Probab=96.54 E-value=0.069 Score=31.82 Aligned_cols=257 Identities=11% Similarity=0.102 Sum_probs=131.8 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCE-EEEECCHHHHH--------HHHHHHHHHHHCCEEEECCHH---HHH-HCCCC Q ss_conf 970899628546899999999709981-99975745620--------113479899626689988989---997-43688 Q gi|255764474|r 68 NSRIWHARRNNEMLLGVMMRDVLRMPL-KLVFTSPSQRN--------HSRWTRYLISRMDEVITTSQK---SAR-FIERP 134 (352) Q Consensus 68 ~~~ivh~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~--------~~~~~~~l~~~~d~ii~~s~~---~~~-~~~~~ 134 (352) .--++|..++...+.+++.......+. ..++-...+.. ...+.|...+++.+++++.-. .++ +...+ T Consensus 78 ~kfflHGqFn~~LwLaLl~g~~~~~q~yWhiWGaDLYe~~~~~k~rlfy~lRR~a~~rvg~v~atrGDl~~~~q~~~~~~ 157 (358) T PRK02797 78 QRFFLHGQFNPTLWLALLSGGIKLKQFYWHIWGADLYEDSKGLKFKLFYPLRRLAQKRVGHVFATRGDLSYAAQRHPRVP 157 (358) T ss_pred CEEEEECCCCHHHHHHHHHCCCCCHHEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHHHCCCCC T ss_conf 65999567887999999958614021499998817877677589998899999997402459996421999998737778 Q ss_pred EEEE--CCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEE-ECHHHHHHHHHHHHHHHHCCCCCEEEE-EECCCCCCC Q ss_conf 3997--488277782378432687887288988727999512-264451236665456530047740588-631333210 Q gi|255764474|r 135 STVI--MHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGR-IRKLKGTDLFVDCMINILPHHPGWTAV-VVGKTTLKH 210 (352) Q Consensus 135 ~~vi--~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~-~~~~Kg~~~li~a~~~l~~~~~~~~l~-i~G~g~~~~ 210 (352) ...+ |.-+|+ .+..... .....+...| -+|+ -++.-++...++++.+.. ..+++.+ -+|-+ ..+ T Consensus 158 ~~~lyfpt~m~~-~l~~~~~-------~~~~~~~~tI-lvGNSgD~sN~Hie~L~~l~~~~--~~~v~i~~Pl~Yp-agn 225 (358) T PRK02797 158 GELLYFPTRMDP-SLNAMAN-------DRKRAGKMTI-LVGNSGDPSNRHIEALRALHQQF--GDNVKIIVPMGYP-ANN 225 (358) T ss_pred CEEEEECCCCCH-HHHCCCC-------CCCCCCCEEE-EEECCCCCCCCHHHHHHHHHHHH--CCCEEEEEECCCC-CCC T ss_conf 549994777886-7733566-------6567885599-97078985400899999999973--6886999977757-765 Q ss_pred CCHHHHHHHHHHHCCCCCCCCCCCC---CCCCCCCCHHHHHHCCC-CCCCCCCCCCCCCCCCEEEEEE-CCCCCCCHHCC Q ss_conf 0014667764431123322122222---23310000000000012-3332222221000001001110-27884101002 Q gi|255764474|r 211 YLFKKNLQRRIYANGLKKRILFIDE---QSSIEDWYRALNIFVAP-PLYEGFGLTPLEAMASGIPVVA-SNTGVFSELLD 285 (352) Q Consensus 211 ~~~~~~l~~~i~~~~l~~~V~~~g~---~~~~~~~~~~adi~i~p-S~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e~i~ 285 (352) +.|.++..+...++=-.+++..+.. -++...+++.||+.++. -+..|+|..++ -+..|+||+. ++.+-..++.. T Consensus 226 ~~Yi~~V~~~g~~lFg~~~~~~L~e~l~f~~Yl~lL~~~Dl~~F~~~RQQgiGnl~l-Li~lG~~~~l~r~n~fwqdl~e 304 (358) T PRK02797 226 QAYIEEVRQAGLALFGEENLQILTEKLPFDDYLALLRQCDLGYFIFARQQGIGTLCL-LIQLGKPVVLNRDNPFWQDMTE 304 (358) T ss_pred HHHHHHHHHHHHHHCCCCCEEEHHHHCCHHHHHHHHHHCCEEEEEHHHHHHHHHHHH-HHHCCCEEEEECCCCHHHHHHH T ss_conf 999999999999865866677644218989999999738973443124666608999-9977980899568947889986 Q ss_pred CCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 34884249965999899999999998698999999999999999829989999999999999886 Q gi|255764474|r 286 PENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLLR 350 (352) Q Consensus 286 ~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l~ 350 (352) .+. -+++..++.+ ++.. .+..+++....++.+. |+.++..+.+.++++...+ T Consensus 305 --q~v-pVlf~~d~L~---~~~i-----~e~qrqLa~~dk~~I~--Ff~pny~~~W~~~l~~~~g 356 (358) T PRK02797 305 --QGL-PVLFTGDDLD---EDLV-----REAQRQLASVDKNIIA--FFSPNYLQGWRNALAIAAG 356 (358) T ss_pred --CCC-CEEECCCCCC---HHHH-----HHHHHHHHHCCHHHEE--EECCCHHHHHHHHHHHHCC T ss_conf --698-2895388788---8999-----9999999851743242--1187637999999988626 No 110 >pfam06258 DUF1022 Protein of unknown function (DUF1022). This family consists of several hypothetical eukaryotic and prokaryotic proteins. The function of this family is unknown. Probab=96.31 E-value=0.094 Score=30.93 Aligned_cols=236 Identities=14% Similarity=0.106 Sum_probs=119.0 Q ss_pred CHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCC----CCHHHHHHHH-HHHHHCCCEEEEECCCHHHHHHHHHHHHHCCC Q ss_conf 077888899999997198599980588665623----4789989988-75432597089962854689999999970998 Q gi|255764474|r 19 GVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPS----IGISSLLTCW-KKPIGQNSRIWHARRNNEMLLGVMMRDVLRMP 93 (352) Q Consensus 19 Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~-~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~ 93 (352) |...++..++..+.. ......+.+..+..+.. ...+.+.... .......+|++-+-..........++...+.+ T Consensus 2 G~~~Q~~gLAeaL~~-~~~~~~i~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~p~PdliIs~Gr~t~~~~~~lkr~~~~~ 80 (308) T pfam06258 2 GHENQALGLAEALGL-PYEIREVRVRKPWRWLPRRLPAPLWAILGPFAPALEPPWPDLVIGAGRQTHPLLRLLRRLSGGK 80 (308) T ss_pred CCHHHHHHHHHHCCC-CCEEEEEECCCHHHHCCCCCCCCHHHHHCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCC T ss_conf 638899999997099-8717999528677756765785045542345411158999889978814799999999974999 Q ss_pred EEEEEC-CHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCCCCEEEE-----CCCCCHHHHCCCCCHHHHHHHCCCCCCCC Q ss_conf 199975-745620113479899626689988989997436883997-----48827778237843268788728898872 Q gi|255764474|r 94 LKLVFT-SPSQRNHSRWTRYLISRMDEVITTSQKSARFIERPSTVI-----MHGVDTERFRPTSNKQEARRHLKISEDAK 167 (352) Q Consensus 94 ~~~~~~-~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~~~~~vi-----~~gid~~~~~~~~~~~~~~~~~~~~~~~~ 167 (352) ...++. .+.. . .+..|.+++..+.- +.....|| +|.++.+..... ...+ ..+.-.+... T Consensus 81 ~~~I~i~~P~~-~--------~~~FDliv~P~HD~---~~~g~NVi~t~gal~~i~~~~l~~~--~~~~-~~~~~~~~p~ 145 (308) T pfam06258 81 TKTVQIMDPRL-P--------LGRFDLVIAPEHDG---VPPGPNVLLTVGALHRVTPQRLAEA--AAAW-PELAALPRPR 145 (308) T ss_pred CEEEEEECCCC-C--------CCCCCCEECCCCCC---CCCCCCEEEECCCCCCCCHHHHHHH--HHHH-HHHCCCCCCE T ss_conf 67999818998-8--------13477102574558---8899978962575554787788777--7665-5402477876 Q ss_pred EEEEEEEECHHHHHH-----HHHHHHHHHHCCCCCEEEEEECCCCCCCC-CHHHHHHHHHHHCCCCCCC-CCCCCCCC-C Q ss_conf 799951226445123-----66654565300477405886313332100-0146677644311233221-22222233-1 Q gi|255764474|r 168 LIGCFGRIRKLKGTD-----LFVDCMINILPHHPGWTAVVVGKTTLKHY-LFKKNLQRRIYANGLKKRI-LFIDEQSS-I 239 (352) Q Consensus 168 ~i~~~G~~~~~Kg~~-----~li~a~~~l~~~~~~~~l~i~G~g~~~~~-~~~~~l~~~i~~~~l~~~V-~~~g~~~~-~ 239 (352) +.+.+|.-+..-..+ .+++.+..+.+.. +..+.+..+- +.. ...+.|++.. +-...+ .|-+.-++ . T Consensus 146 i~vLIGG~sk~~~~~~~~~~~l~~~i~~l~~~~-~~~l~it~SR--RTP~~~~~~l~~~~---~~~~~~~~~~~~~~Npy 219 (308) T pfam06258 146 VAVLVGGPSKHFRWDADAARRLLEQLQALLEAY-GGSLLITTSR--RTPEAAEAALRKLL---GPRPGLYVWDGTGPNPY 219 (308) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHC-CCEEEEEECC--CCCHHHHHHHHHHH---CCCCEEEECCCCCCCHH T ss_conf 999965578788889999999999999999877-9729999468--89699999999860---89972898279886458 Q ss_pred CCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCC Q ss_conf 0000000000012333222222100000100111027884 Q gi|255764474|r 240 EDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGV 279 (352) Q Consensus 240 ~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~ 279 (352) ..+++.||.++.++.+-.+ +-||.+.|+||-+-+..+ T Consensus 220 ~~~L~~Ad~iiVT~DSvSM---isEA~~tGkPV~i~~l~~ 256 (308) T pfam06258 220 FGFLAWADAVVVTADSVSM---VSEAAATGAPVGVLPLEG 256 (308) T ss_pred HHHHHHCCEEEEECCHHHH---HHHHHHCCCCEEEEECCC T ss_conf 9999858868990671889---999986499779996776 No 111 >pfam01075 Glyco_transf_9 Glycosyltransferase family 9 (heptosyltransferase). Members of this family belong to glycosyltransferase family 9. Lipopolysaccharide is a major component of the outer leaflet of the outer membrane in Gram-negative bacteria. It is composed of three domains; lipid A, Core oligosaccharide and the O-antigen. All of these enzymes transfer heptose to the lipopolysaccharide core. Probab=96.23 E-value=0.02 Score=35.39 Aligned_cols=115 Identities=17% Similarity=0.167 Sum_probs=60.5 Q ss_pred HHHHHCCCCCCCCEEEEEEEECHHHHHH--HHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCC-CC Q ss_conf 8788728898872799951226445123--66654565300477405886313332100014667764431123322-12 Q gi|255764474|r 155 EARRHLKISEDAKLIGCFGRIRKLKGTD--LFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKR-IL 231 (352) Q Consensus 155 ~~~~~~~~~~~~~~i~~~G~~~~~Kg~~--~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~-V~ 231 (352) ......+......+++..|.-.+.|... ...+.+..+.++ +...++.|..++.+.. ..+.+.+ +...+ +. T Consensus 97 ~~~~~~~~~~~~~i~i~pga~~~~K~Wp~e~f~~L~~~l~~~--~~~vvl~gg~~~~e~~----~~~~i~~-~~~~~~~~ 169 (249) T pfam01075 97 AALDKAGNGGRPYIAFCPGASRPAKRWPAEHFAELAQALQER--GWQVVLFGGAEDREEE----IAKRLAA-GLRTPCVN 169 (249) T ss_pred HHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHC--CCCEEEEECCCHHHHH----HHHHHHH-HCCCCEEE T ss_conf 999862025999899973878856779999999999999966--9956997386789999----9999986-38998686 Q ss_pred CCCC--CCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCC Q ss_conf 2222--2331000000000001233322222210000010011102788410 Q gi|255764474|r 232 FIDE--QSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFS 281 (352) Q Consensus 232 ~~g~--~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~ 281 (352) +.|. ..++..+++.||++|.... -.+-=|.|.|+|+|+--.+..+ T Consensus 170 l~g~~sL~el~ali~~a~l~I~nDS-----Gp~HiAaA~g~Pti~ifGpT~p 216 (249) T pfam01075 170 LLGKTSLEQAAALLAGADLVVGNDS-----GLMHLAAALDRPVIGLYGPTDP 216 (249) T ss_pred CCCCCCHHHHHHHHHHCCEEEECCC-----HHHHHHHHCCCCEEEEECCCCH T ss_conf 2699999999999985066885798-----5999999839988999788987 No 112 >pfam10933 DUF2827 Protein of unknown function (DUF2827). This is a family of uncharacterized proteins found in Burkholderia. Probab=96.12 E-value=0.12 Score=30.29 Aligned_cols=217 Identities=20% Similarity=0.189 Sum_probs=117.1 Q ss_pred HHCCEEEECCHHH---HHH----CCCCEEEECCCCCHHHHCCCCC-HHHHHHHCCCCCC---CCEEEEEEEECHHHHH-- Q ss_conf 6266899889899---974----3688399748827778237843-2687887288988---7279995122644512-- Q gi|255764474|r 115 SRMDEVITTSQKS---ARF----IERPSTVIMHGVDTERFRPTSN-KQEARRHLKISED---AKLIGCFGRIRKLKGT-- 181 (352) Q Consensus 115 ~~~d~ii~~s~~~---~~~----~~~~~~vi~~gid~~~~~~~~~-~~~~~~~~~~~~~---~~~i~~~G~~~~~Kg~-- 181 (352) .+.|.|-..-+.. +.+ .+.|+.++|+==|+..+...-. ....-..++..++ .++-.+=-+++-.|-- T Consensus 128 ~~~DeIW~~Pq~~~~~~~~l~t~~r~pv~~vP~IW~p~fl~~~~~~l~~~g~~fgy~pg~~~~riaifEPNi~~vK~~~i 207 (364) T pfam10933 128 TPYDEVWTIPQFERTCAPYLQTLRRAPVTVVPHIWDPMFIERRTAALPEAGLRFGYRPGRPAWRVAVFEPNICVVKTCVI 207 (364) T ss_pred CCCCEEEECCCHHHHCCHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCEEEHHCCC T ss_conf 98651784500453325798898629805636114789999998753533766673678875069997687402012242 Q ss_pred -HHHHHHHHHHHCCCCCE-EEEEECCCC-CCCCCHHHHHHHHHHHCCC--CCCCCCCCCCCCCCCCCH-HHHHHCCCCCC Q ss_conf -36665456530047740-588631333-2100014667764431123--322122222233100000-00000012333 Q gi|255764474|r 182 -DLFVDCMINILPHHPGW-TAVVVGKTT-LKHYLFKKNLQRRIYANGL--KKRILFIDEQSSIEDWYR-ALNIFVAPPLY 255 (352) Q Consensus 182 -~~li~a~~~l~~~~~~~-~l~i~G~g~-~~~~~~~~~l~~~i~~~~l--~~~V~~~g~~~~~~~~~~-~adi~i~pS~~ 255 (352) -++++ ..-++.|+. ..+.+.++. .++. +..+..+..+.+ .+++.|.|..+. ..+++ .+|+.++--+. T Consensus 208 P~lI~e---~~yR~~p~~i~~v~v~N~~~l~~~---~~F~~~~~~ldlv~~gk~~f~~R~~~-~~~l~~~~d~vvshq~~ 280 (364) T pfam10933 208 PMLVAD---EAYRAQPDAVEFVHVTNTLHLKEH---PEFVHFANSLDLVRHGKATFEGRFDT-PEFLAQHTDAVVSHQWE 280 (364) T ss_pred HHHHHH---HHHHCCHHHHHEEEEECHHHHHCC---HHHHHHHHHHHHHHCCCEEEEEECCH-HHHHHHCCCEEEECCCC T ss_conf 499999---998718545004788244765028---89999997646775484389742373-88897579779982424 Q ss_pred CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHHHCCH Q ss_conf 222222100000100111027884101002348842499659998999999999986-9899999999999999982998 Q gi|255764474|r 256 EGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTGNRGRERAVKHFSI 334 (352) Q Consensus 256 Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~~~a~~~~~~~fs~ 334 (352) -+.--..+|++..|-|+|-.. +-++ ..|+..+..|.++=+++|.+.+. +.+...+..+++++. ..+||+ T Consensus 281 n~lNYlyld~ly~gyPliHNs-~~~~--------d~GYYY~~~Di~~ga~~L~~a~~~Hd~~~~~Y~~~a~~~-l~~~~~ 350 (364) T pfam10933 281 NAQNYLYYDVLYGGYPLVHNS-PLLG--------DAGYYYPDFDIQAGARALLRAFRHHDADLDDYRARARSF-LASVSI 350 (364) T ss_pred CCCCHHHHHHHHCCCCCCCCC-HHHC--------CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHCC T ss_conf 643016898987287620274-1331--------176156865899999999999998764399999999999-987389 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999886 Q gi|255764474|r 335 VKEASDIGKVYDRLLR 350 (352) Q Consensus 335 ~~~a~~~~~iY~~~l~ 350 (352) .+ ....+.|.++|. T Consensus 351 ~n--p~nv~~Y~~ll~ 364 (364) T pfam10933 351 YN--PANVAAYDALLA 364 (364) T ss_pred CC--HHHHHHHHHHHC T ss_conf 98--789999999749 No 113 >COG1817 Uncharacterized protein conserved in archaea [Function unknown] Probab=94.72 E-value=0.34 Score=27.14 Aligned_cols=259 Identities=14% Similarity=0.165 Sum_probs=123.1 Q ss_pred HHHHHHHHHHCCEEEEEECCCC---------------------CCCC-C--CCHHHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 8999999971985999805886---------------------6562-3--47899899887543259708996285468 Q gi|255764474|r 25 FGLCPIQRKLGQRLVVFGYCLP---------------------KNIP-S--IGISSLLTCWKKPIGQNSRIWHARRNNEM 80 (352) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~---------------------~~~~-~--~~~~~~~~~~~~~~~~~~~ivh~~~~~~~ 80 (352) -++...+.+.|+++.+...... ..+. . ........+.+.....++|+.-...+... T Consensus 17 k~lI~elekkG~ev~iT~rd~~~v~~LLd~ygf~~~~Igk~g~~tl~~Kl~~~~eR~~~L~ki~~~~kpdv~i~~~s~~l 96 (346) T COG1817 17 KNLIWELEKKGHEVLITCRDFGVVTELLDLYGFPYKSIGKHGGVTLKEKLLESAERVYKLSKIIAEFKPDVAIGKHSPEL 96 (346) T ss_pred HHHHHHHHHCCEEEEEEEEECCCHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCH T ss_conf 99999998578499999851275889999839970764045774478999999999999999875229855752278105 Q ss_pred HHHHHHHHHHCCCEEEEE-CCHHHHHHHHHHHHHHHHCCEEEECCHHHH----HHCCCCEEEE-CCCC----CHHHHCCC Q ss_conf 999999997099819997-574562011347989962668998898999----7436883997-4882----77782378 Q gi|255764474|r 81 LLGVMMRDVLRMPLKLVF-TSPSQRNHSRWTRYLISRMDEVITTSQKSA----RFIERPSTVI-MHGV----DTERFRPT 150 (352) Q Consensus 81 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~----~~~~~~~~vi-~~gi----d~~~~~~~ 150 (352) -.....++.|..+.. +-|+... .+.....++.++..+.... ..-..|...+ .||+ +...|.| T Consensus 97 ---~rvafgLg~psIi~~D~ehA~~q----nkl~~Pla~~ii~P~~~~~~~~~~~G~~p~~i~~~~giae~~~v~~f~p- 168 (346) T COG1817 97 ---PRVAFGLGIPSIIFVDNEHAEAQ----NKLTLPLADVIITPEAIDEEELLDFGADPNKISGYNGIAELANVYGFVP- 168 (346) T ss_pred ---HHHHHHCCCCEEEECCCHHHHHH----HHCCHHHHHHEECCCCCCHHHHHHHCCCCCCEECCCCEEEEEECCCCCC- T ss_conf ---56776528863896487547778----6300024421506434435778870889552113566267731026798- Q ss_pred CCHHHHHHHCCCCCCCCEEEEEEEECH--------HHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHH Q ss_conf 432687887288988727999512264--------451236665456530047740588631333210001466776443 Q gi|255764474|r 151 SNKQEARRHLKISEDAKLIGCFGRIRK--------LKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIY 222 (352) Q Consensus 151 ~~~~~~~~~~~~~~~~~~i~~~G~~~~--------~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~ 222 (352) .++..+++|+..+..+|++ |..+ .++.+.+-+.+.++ ++. -.+++-. + ++.++..+ T Consensus 169 --d~evlkeLgl~~~~~yIVm--Rpe~~~A~y~~g~~~~~~~~~li~~l-~k~---giV~ipr----~----~~~~eife 232 (346) T COG1817 169 --DPEVLKELGLEEGETYIVM--RPEPWGAHYDNGDRGISVLPDLIKEL-KKY---GIVLIPR----E----KEQAEIFE 232 (346) T ss_pred --CHHHHHHCCCCCCCCEEEE--EECCCCCEEECCCCCHHHHHHHHHHH-HHC---CEEEECC----C----HHHHHHHH T ss_conf --8789987588879866999--64344542343322256688899988-757---2899557----5----56899874 Q ss_pred HCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHH Q ss_conf 11233221222222331000000000001233322222210000010011102788410100234884249965999899 Q gi|255764474|r 223 ANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHA 302 (352) Q Consensus 223 ~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~ 302 (352) +... +......-|..+++--|++++- +-|.-.-||...|+|+|++.-| -.--++..--+.|.+....|+.+ T Consensus 233 --~~~n-~i~pk~~vD~l~Llyya~lvig-----~ggTMarEaAlLGtpaIs~~pG-kll~vdk~lie~G~~~~s~~~~~ 303 (346) T COG1817 233 --GYRN-IIIPKKAVDTLSLLYYATLVIG-----AGGTMAREAALLGTPAISCYPG-KLLAVDKYLIEKGLLYHSTDEIA 303 (346) T ss_pred --HHCC-CCCCCCCCCHHHHHHHHHEEEC-----CCCHHHHHHHHHCCCEEEECCC-CCCCCCHHHHHCCCEEECCCHHH T ss_conf --1011-0588555227878865415641-----7703788888728834785388-53322389886684343178889 Q ss_pred HHHHHHHHHCCHHH Q ss_conf 99999999869899 Q gi|255764474|r 303 LEKAVLYFMNSKKI 316 (352) Q Consensus 303 la~~i~~l~~~~~~ 316 (352) ..+...+++.++.. T Consensus 304 ~~~~a~~~l~~~~~ 317 (346) T COG1817 304 IVEYAVRNLKYRRL 317 (346) T ss_pred HHHHHHHHHHCHHH T ss_conf 99999998425000 No 114 >TIGR02919 TIGR02919 conserved hypothetical protein TIGR02919; InterPro: IPR014268 Members of this entry are found only in Gram-positive bacteria of the Firmicutes lineage, including several species of Staphylococcus, Streptococcus, and Lactobacillus.. Probab=93.78 E-value=0.097 Score=30.83 Aligned_cols=130 Identities=12% Similarity=0.199 Sum_probs=90.3 Q ss_pred HHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCC--CCCCCCCCHHHHHHCCCCCCCCCCCCC Q ss_conf 654565300477405886313332100014667764431123322122222--233100000000000123332222221 Q gi|255764474|r 185 VDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDE--QSSIEDWYRALNIFVAPPLYEGFGLTP 262 (352) Q Consensus 185 i~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~--~~~~~~~~~~adi~i~pS~~Eg~gl~~ 262 (352) |+.+..+-..+||++|.|+.--.+ ....=.++-.++|.+=+. ...+..+|+.||+++=--+.-=-.-++ T Consensus 299 I~~le~iV~~~pd~~FhIAA~TEM---------S~KLm~l~~Y~NV~LYP~i~~~~i~~Lyq~cDIYLDIN~GNEil~Av 369 (448) T TIGR02919 299 IEHLEEIVQALPDYEFHIAALTEM---------SSKLMSLSKYDNVKLYPNISRQKIQELYQLCDIYLDINHGNEILNAV 369 (448) T ss_pred HHHHHHHHHHCCCCCEEEEEECCC---------CHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCEEEECCCCCCHHHHH T ss_conf 357999998578970465435013---------17761578835524376777778988873142344123457378899 Q ss_pred CCCCCCEEEEEECC-CCCCCHHCCCCCCCEEEEECCCC-----HHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 00000100111027-88410100234884249965999-----89999999999869899999999999999 Q gi|255764474|r 263 LEAMASGIPVVASN-TGVFSELLDPENAKAGVIVPPRN-----LHALEKAVLYFMNSKKIMSDTGNRGRERA 328 (352) Q Consensus 263 lEAma~G~PvI~s~-~~~~~e~i~~~~g~~G~~~~~~d-----~~~la~~i~~l~~~~~~~~~~~~~a~~~~ 328 (352) -.|+-.-.-+++-+ +.=-+++|..++ +++..| .+.|++.|..+++|+....+.=++.++++ T Consensus 370 R~AF~ynlLIlgf~eT~Hn~~~i~~~N-----~F~~~~eitme~~~Li~~l~~l~~~~~~~~~~L~~Q~~~A 436 (448) T TIGR02919 370 RRAFEYNLLILGFEETAHNRDFIASEN-----IFEHNEEITMEVEQLISILKELLNDPNQFRELLEQQREHA 436 (448) T ss_pred HHHHHHHHHHHHHHHCCCCCEEECCCC-----CCCCCCCEEEHHHHHHHHHHHHHCCHHHHHHHHHHHHHHC T ss_conf 887564477777762135532206012-----7456541221189999999997318667999999999864 No 115 >pfam07429 Fuc4NAc_transf 4-alpha-L-fucosyltransferase (Fuc4NAc transferase). This family contains the bacterial enzyme 4-alpha-L-fucosyltransferase (Fuc4NAc transferase) (EC 2.4.1.-) (approximately 360 residues long). This catalyses the synthesis of Fuc4NAc-ManNAcA-GlcNAc-PP-Und (lipid III) as part of the biosynthetic pathway of enterobacterial common antigen (ECA), a polysaccharide comprised of the trisaccharide repeat unit Fuc4NAc-ManNAcA-GlcNAc. Probab=93.17 E-value=0.66 Score=25.21 Aligned_cols=217 Identities=12% Similarity=0.120 Sum_probs=109.8 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCEE-EEECCHHHH--------HHHHHHHHHHHHCCEEEECCHH---H-HHHCCCC Q ss_conf 9708996285468999999997099819-997574562--------0113479899626689988989---9-9743688 Q gi|255764474|r 68 NSRIWHARRNNEMLLGVMMRDVLRMPLK-LVFTSPSQR--------NHSRWTRYLISRMDEVITTSQK---S-ARFIERP 134 (352) Q Consensus 68 ~~~ivh~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~--------~~~~~~~~l~~~~d~ii~~s~~---~-~~~~~~~ 134 (352) .--++|..++...+.+++...+....+- .++-...+. .+....|...+++.+|+++.-. . ..+...+ T Consensus 80 ~rFf~HGQFN~~LWLALl~Gki~~~q~~WhIWGADLYe~s~~lKfRLfy~lRRlAQ~rvghVfatrGDl~~~~q~~p~v~ 159 (361) T pfam07429 80 ARFFLHGQFNPWLWLALLSGKIKPAQFSWHIWGADLYEDSRSLKFRLFYPLRRLAQGRVGHVFATRGDLAFFAQRHPRVP 159 (361) T ss_pred CEEEEECCCCHHHHHHHHHCCCCHHCEEEEEECHHHHHCCCCHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHCCCCC T ss_conf 53999556888999999809755020699998726631231115888899999987440259996514999998668998 Q ss_pred EEEEC--CCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEE-CHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCC Q ss_conf 39974--882777823784326878872889887279995122-644512366654565300477405886313332100 Q gi|255764474|r 135 STVIM--HGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRI-RKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHY 211 (352) Q Consensus 135 ~~vi~--~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~-~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~ 211 (352) ..+.+ +-+|+. +...+. .. ..+.+.-.-+|+- ++--++...++++.+-. ..+++.++-=+-|..++ T Consensus 160 ~~lLYFPTrM~~~-l~~~~~-------~~-~~~~~ltIlvGNSgD~sN~HieaL~~i~~~f--g~~vkiivPmgYPannq 228 (361) T pfam07429 160 GSLLYFPTRMDPA-LNIAPP-------DK-PRAGKLTILVGNSGDRSNRHIAALKAIHQQF--GDNVRIIVPMGYPANNQ 228 (361) T ss_pred CCEEEECCCCCHH-HHCCCC-------CC-CCCCCEEEEEECCCCCCHHHHHHHHHHHHHH--CCCEEEEEECCCCCCCH T ss_conf 6178826767645-404567-------77-7889538997378884404999999999973--79848999788678769 Q ss_pred CHHHHHHHHHHHCCCCCCCCCCCC---CCCCCCCCHHHHHHCCC-CCCCCCCCCCCCCCCCEEEEEEC-CCCCCCHHCCC Q ss_conf 014667764431123322122222---23310000000000012-33322222210000010011102-78841010023 Q gi|255764474|r 212 LFKKNLQRRIYANGLKKRILFIDE---QSSIEDWYRALNIFVAP-PLYEGFGLTPLEAMASGIPVVAS-NTGVFSELLDP 286 (352) Q Consensus 212 ~~~~~l~~~i~~~~l~~~V~~~g~---~~~~~~~~~~adi~i~p-S~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e~i~~ 286 (352) .|.++.++...++=-.+++..+.+ -++...+++.||+..+. -+.-|.|..++ =+..|+|++.+ ..+-..++.. T Consensus 229 ~Yi~~V~~~g~~lF~~~~~~iL~e~m~Fd~YL~lL~~cDlgyF~f~RQQgiGtlcl-Liq~GiP~vlsr~NpFwqDl~e- 306 (361) T pfam07429 229 AYIEQVRQAGLALFPAENLQILTEKLAFDDYLALLRQCDLGYFIFNRQQGIGTLCL-LIQAGVPCVLSRENPFWQDMQE- 306 (361) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHCCHHHHHHHHHHCCEEEEEECCHHHHHHHHH-HHHCCCCEEEECCCCHHHHHHH- T ss_conf 99999999999764842111665318999999999748877885300234609999-9847993899379955664887- Q ss_pred CCCCEEEEECCCC Q ss_conf 4884249965999 Q gi|255764474|r 287 ENAKAGVIVPPRN 299 (352) Q Consensus 287 ~~g~~G~~~~~~d 299 (352) ..--+++..++ T Consensus 307 --q~vPvlf~~d~ 317 (361) T pfam07429 307 --QQVPVLFTGDD 317 (361) T ss_pred --CCCCEEECCCC T ss_conf --28977754664 No 116 >PRK10422 lipopolysaccharide core biosynthesis protein; Provisional Probab=90.87 E-value=1.2 Score=23.45 Aligned_cols=250 Identities=10% Similarity=0.099 Sum_probs=105.5 Q ss_pred CHHHEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCE--EEEEECC--------CCC--CCCC---------CCHHHHHHH Q ss_conf 804458970787878770778888999999971985--9998058--------866--5623---------478998998 Q gi|255764474|r 2 DMNNIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQR--LVVFGYC--------LPK--NIPS---------IGISSLLTC 60 (352) Q Consensus 2 ~~~~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~--~~~~~~~--------~~~--~~~~---------~~~~~~~~~ 60 (352) +.++|.+|-++. . |=.....-+.+.+++.... +.+.... .|. .+.. .....++.+ T Consensus 4 ~~kkILIir~~~---i-GD~il~tP~i~~Lk~~~P~a~I~~l~~~~~~~ll~~~P~id~i~~~~~k~~~~~~~~~~~~~l 79 (352) T PRK10422 4 PFRRILIIKMRF---H-GDMLLTTPVISSLKKNYPDAKIDVLLYQDTIPILSENPEINALYGIKNKKAGASEKIKNFFSL 79 (352) T ss_pred CCCEEEEECCCC---H-HHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHH T ss_conf 997799975886---0-499999999999999889988999978047999833999627988667554456779999999 Q ss_pred HHHHHHCCCEEEE-ECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCCCCEEEEC Q ss_conf 8754325970899-628546899999999709981999757456201134798996266899889899974368839974 Q gi|255764474|r 61 WKKPIGQNSRIWH-ARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIERPSTVIM 139 (352) Q Consensus 61 ~~~~~~~~~~ivh-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~~~~~vi~ 139 (352) ++..+..++|++- .+... ..+ ++..+.+.+.++.+.... +....|..++..... .. ..+...+++.- +-+ T Consensus 80 ~~~Lr~~~yD~vi~l~~~~--~~~-~l~~~~~~~~~ig~~~~~-~~~~~~~~~~~~~~~-~~-~~h~v~~~l~l---l~~ 150 (352) T PRK10422 80 IKVLRANKYDLIVNLTDQW--MVA-LLVRLLNARVKISQDYHH-RQSAFWRKSFTHLVP-LQ-GGHVVESNLSV---LTP 150 (352) T ss_pred HHHHHHCCCCEEEECCCCH--HHH-HHHHHHCCCEEECCCCCC-CHHHHHHHHHCCCCC-CC-CHHHHHHHHHH---HHH T ss_conf 9998554887788667664--999-999983898586566652-101455655314688-75-41599999999---864 Q ss_pred CCCCHHHHC-----CCCCHHHH---HHHCCCCCCCCEEEEEEEECHHHHH--HHHHHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 882777823-----78432687---8872889887279995122644512--3666545653004774058863133321 Q gi|255764474|r 140 HGVDTERFR-----PTSNKQEA---RRHLKISEDAKLIGCFGRIRKLKGT--DLFVDCMINILPHHPGWTAVVVGKTTLK 209 (352) Q Consensus 140 ~gid~~~~~-----~~~~~~~~---~~~~~~~~~~~~i~~~G~~~~~Kg~--~~li~a~~~l~~~~~~~~l~i~G~g~~~ 209 (352) .|++..... +..+.... ....++. ...++++.|.-.+.|.. +...+.+..+.++ ++..++.|.+..+ T Consensus 151 l~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~-~~~ivi~pga~~~~K~Wp~e~~a~l~~~L~~~--g~~vvl~ggp~~~ 227 (352) T PRK10422 151 LGLSSLVKETTMSYRPESWKRMRRQLDHLGVT-QNYVVIQPTARQIFKCWDNDKFSAVIDALQAR--GYEVVLTSGPAKD 227 (352) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC-CCEEEEECCCCCCCCCCCHHHHHHHHHHHHHC--CCEEEEECCCCHH T ss_conf 69986675556678878999999876744888-98799967899856779999999999999847--9919997289889 Q ss_pred CCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 000146677644311233221222222--331000000000001233322222210000010011102 Q gi|255764474|r 210 HYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS 275 (352) Q Consensus 210 ~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s 275 (352) +....+++..... -..-+.+.|.. .++..+++.||++|... +-.+-=|.|.|+|+|+- T Consensus 228 e~~~~~~i~~~~~---~~~v~~l~G~tsL~el~ali~~a~l~I~nD-----SGpmHlAaAlg~P~Val 287 (352) T PRK10422 228 DLACVNEIAQGCQ---TPPVTALAGKTTFPELGALIDHAQLFIGVD-----SAPAHIAAAVNTPLICL 287 (352) T ss_pred HHHHHHHHHHHCC---CCCCEECCCCCCHHHHHHHHHHCCEEEECC-----CHHHHHHHHCCCCEEEE T ss_conf 9999999997467---987042357888999999998178756059-----81899999829998999 No 117 >PRK10964 ADP-heptose:LPS heptosyl transferase I; Provisional Probab=89.92 E-value=1.5 Score=22.92 Aligned_cols=141 Identities=9% Similarity=-0.025 Sum_probs=71.2 Q ss_pred HHHCCCCCCCCEEEEEEEECHHHHHH--HHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 88728898872799951226445123--6665456530047740588631333210001466776443112332212222 Q gi|255764474|r 157 RRHLKISEDAKLIGCFGRIRKLKGTD--LFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFID 234 (352) Q Consensus 157 ~~~~~~~~~~~~i~~~G~~~~~Kg~~--~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g 234 (352) ...........+++..|.-.+.|... ...+....+.++ +++.++.+.++ .+....+++.. + .+.+...| T Consensus 170 ~~~~~~~~~~~vv~~~~~s~~~K~Wp~e~f~~La~~L~~~--g~~v~l~~G~~-~e~~~~~~i~~-----~-~~~v~~~g 240 (322) T PRK10964 170 LTNLPADAGPYLVFLHATTRDDKHWPEAHWRELIGLLADS--GLRIKLPWGAP-HEEARAKRLAE-----G-FDYVEVLP 240 (322) T ss_pred HHHCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHC--CCEEEEECCCH-HHHHHHHHHHH-----C-CCCCEECC T ss_conf 7412125698499973787412589989999999999967--99799947898-99999999980-----6-99612458 Q ss_pred CC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEEC-------CCCHHHHHH Q ss_conf 22--331000000000001233322222210000010011102788410100234884249965-------999899999 Q gi|255764474|r 235 EQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVP-------PRNLHALEK 305 (352) Q Consensus 235 ~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~-------~~d~~~la~ 305 (352) .. .++..+++.||++|..- +-.+-=|.|+|+|+|+=-.+.-|+...+ .|.+-..+. .=+++..-+ T Consensus 241 ~~sL~elaall~~a~l~I~nD-----SG~mHlAaAlg~P~v~LFGpT~P~~~gP-~g~~~~~~~~~~~~~~~~~~~~v~~ 314 (322) T PRK10964 241 KMSLEEVARVLAGAKAVVSVD-----TGLSHLTAALDRPNITLYGPTDPGLIGG-YGKNQHACRSEGKSMANLSAETVFQ 314 (322) T ss_pred CCCHHHHHHHHHHCCEEEECC-----CHHHHHHHHCCCCEEEEECCCCCCCCCC-CCCCCEEEECCCCCCCCCCHHHHHH T ss_conf 999999999997099999669-----7599999983999899988899403078-8888248968998702199999999 Q ss_pred HHHHHHC Q ss_conf 9999986 Q gi|255764474|r 306 AVLYFMN 312 (352) Q Consensus 306 ~i~~l~~ 312 (352) +++++++ T Consensus 315 ~~~~~~~ 321 (322) T PRK10964 315 KLETLIS 321 (322) T ss_pred HHHHHHC T ss_conf 9999745 No 118 >PRK10916 ADP-heptose:LPS heptosyltransferase II; Provisional Probab=89.77 E-value=1.5 Score=22.84 Aligned_cols=115 Identities=15% Similarity=0.108 Sum_probs=62.0 Q ss_pred HHHHHHHCCCCCCCCEEEEE-EE-ECHHHHHH--HHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 26878872889887279995-12-26445123--6665456530047740588631333210001466776443112332 Q gi|255764474|r 153 KQEARRHLKISEDAKLIGCF-GR-IRKLKGTD--LFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKK 228 (352) Q Consensus 153 ~~~~~~~~~~~~~~~~i~~~-G~-~~~~Kg~~--~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~ 228 (352) .......+++..++++|++. |. +.+.|... .+.+....+.++ +++.++.|...+ ....+++...+....... T Consensus 167 ~~~~~~~~~~~~~~~~i~i~pGa~~~~~KrWp~e~fa~la~~L~~~--g~~vvl~G~~~e--~~~~~~i~~~l~~~~~~~ 242 (348) T PRK10916 167 KSYTCNQFSLSSERPIIGFCPGAEFGPAKRWPHYHYAELAKQLIDE--GYQVVLFGSAKD--HEAGNEILAALNTEQQAW 242 (348) T ss_pred HHHHHHHCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHC--CCEEEEEECHHH--HHHHHHHHHHHHHHCCCC T ss_conf 9999986487779977998168766656779889999999999968--997999817236--999999998510331565 Q ss_pred CCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECC Q ss_conf 21222222--3310000000000012333222222100000100111027 Q gi|255764474|r 229 RILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASN 276 (352) Q Consensus 229 ~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~ 276 (352) -+-+.|.. .++..+++.||++|.. ++- .+-=|.|.|+|+|+=- T Consensus 243 ~~nl~GktsL~el~ali~~a~l~I~n---DSG--pmHlAaA~g~P~valF 287 (348) T PRK10916 243 CRNLAGETQLEQAVILIAACKAIVTN---DSG--LMHVAAALNRPLVALY 287 (348) T ss_pred EECCCCCCCHHHHHHHHHHCCEEEEC---CCH--HHHHHHHCCCCEEEEE T ss_conf 14167889999999999855987844---882--8999998099889998 No 119 >pfam03016 Exostosin Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the most common known skeletal dysplasia. The chromosomal locations of other EXT genes suggest association with other forms of neoplasia. EXT1 and EXT2 have both been shown to encode a heparan sulphate polymerase with both D-glucuronyl (GlcA) and N-acetyl-D-glucosaminoglycan (GlcNAC) transferase activities. The nature of the defect in heparan sulphate biosynthesis in HME is unclear. Probab=89.07 E-value=0.15 Score=29.47 Aligned_cols=68 Identities=22% Similarity=0.364 Sum_probs=42.7 Q ss_pred CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEE-EEEECCCC--CCCHHCCCCCCCEEEEECCCCHHHHHHHH Q ss_conf 310000000000012333222222100000100-11102788--41010023488424996599989999999 Q gi|255764474|r 238 SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGI-PVVASNTG--VFSELLDPENAKAGVIVPPRNLHALEKAV 307 (352) Q Consensus 238 ~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~-PvI~s~~~--~~~e~i~~~~g~~G~~~~~~d~~~la~~i 307 (352) +..+.|+.+..-++|.-....+..+.|||.+|+ |||.+|-= ...++++ ...-++.++..|...+.+.| T Consensus 216 ~y~~~~~~s~Fcl~p~G~~~~~~r~~~a~~~gciPV~~~d~~~lpf~~~~d--~~~~sv~i~~~~~~~l~~~L 286 (289) T pfam03016 216 DYMELLRSSKFCLVPPGDTPTSPRLFDALRAGCIPVIISDGWELPFEDVID--WSKFSVRVPENDIPELKEIL 286 (289) T ss_pred HHHHHHHCCEEEEECCCCCCCCCCHHHHHHCCCEEEEEECCCCCCCCCCCC--HHHEEEEECHHHHHHHHHHH T ss_conf 799987637688867989867661999997098679975782478645477--65789996789977999998 No 120 >KOG1021 consensus Probab=88.67 E-value=1.8 Score=22.33 Aligned_cols=94 Identities=17% Similarity=0.226 Sum_probs=56.8 Q ss_pred CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEE-EEEECCC--CCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHH-C- Q ss_conf 310000000000012333222222100000100-1110278--8410100234884249965999899999999998-6- Q gi|255764474|r 238 SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGI-PVVASNT--GVFSELLDPENAKAGVIVPPRNLHALEKAVLYFM-N- 312 (352) Q Consensus 238 ~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~-PvI~s~~--~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~-~- 312 (352) ...+.|+.+-.-++|+-++...-.+.||+..|| |||.+|. ....++++ ...-+..++. +++-+.+.+++ + T Consensus 336 ~y~~~m~~S~FCL~p~Gd~~ts~R~fdai~~gCvPViisd~~~lpf~~~~d--~~~fSV~v~~---~~v~~~~~~iL~~i 410 (464) T KOG1021 336 NYMEGMQDSKFCLCPPGDTPTSPRLFDAIVSGCVPVIISDGIQLPFGDVLD--WTEFSVFVPE---KDVPELIKNILLSI 410 (464) T ss_pred HHHHHHHHCEEEECCCCCCCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCC--CEEEEEEEEH---HHHHHHHHHHHHHC T ss_conf 799866627587668999865440899987178667970785336567554--2068999658---97327898886525 Q ss_pred CHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 989999999999999998299899 Q gi|255764474|r 313 SKKIMSDTGNRGRERAVKHFSIVK 336 (352) Q Consensus 313 ~~~~~~~~~~~a~~~~~~~fs~~~ 336 (352) -.+...+|.++-...+.+.|.+.. T Consensus 411 ~~~~~~~m~~~v~~~v~r~~~~~~ 434 (464) T KOG1021 411 PEEEVLRMRENVIRLVPRHFLKKP 434 (464) T ss_pred CHHHHHHHHHHHHHHHHHHEECCC T ss_conf 699999999999999766367068 No 121 >pfam05686 DUF821 Arabidopsis thaliana protein of unknown function (DUF821). This family consists of a group of Arabidopsis thaliana proteins with no known function. Probab=84.14 E-value=3 Score=20.77 Aligned_cols=53 Identities=15% Similarity=0.208 Sum_probs=43.3 Q ss_pred CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHH---HHHHHHHHHHH Q ss_conf 998999999999986989999999999999998299899999---99999999886 Q gi|255764474|r 298 RNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEAS---DIGKVYDRLLR 350 (352) Q Consensus 298 ~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~---~~~~iY~~~l~ 350 (352) ++-+++..++...-+++++.++++++|++++.++...+.+-. .++.-|.++|+ T Consensus 269 ~~~~dl~~~v~W~~~hd~~Aq~Ia~~g~~Fi~e~L~m~~VycY~~~LL~eYaKLlk 324 (396) T pfam05686 269 DDCRDLKEAVDWGNAHDDKAQEIAERGSQFIRENLKMDDVYDYMFHLLTEYAKLLK 324 (396) T ss_pred CCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHC T ss_conf 52778999999988688999999999999999852702789999999999998735 No 122 >KOG1192 consensus Probab=83.05 E-value=3.3 Score=20.48 Aligned_cols=64 Identities=19% Similarity=0.257 Sum_probs=36.9 Q ss_pred CCCCCCCCEEEEEECCC----CCCCHHCCCCCCCEEEEECCC-CHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 21000001001110278----841010023488424996599-989999999999869899999999999 Q gi|255764474|r 261 TPLEAMASGIPVVASNT----GVFSELLDPENAKAGVIVPPR-NLHALEKAVLYFMNSKKIMSDTGNRGR 325 (352) Q Consensus 261 ~~lEAma~G~PvI~s~~----~~~~e~i~~~~g~~G~~~~~~-d~~~la~~i~~l~~~~~~~~~~~~~a~ 325 (352) +++|+..+|+|+|+.+. ......+. +.|..+.+...+ ....+.+++.+++.+++......+-.. T Consensus 366 St~E~~~~GvP~v~~Plf~DQ~~Na~~i~-~~g~~~v~~~~~~~~~~~~~~~~~il~~~~y~~~~~~l~~ 434 (496) T KOG1192 366 STLESIYSGVPMVCVPLFGDQPLNARLLV-RHGGGGVLDKRDLVSEELLEAIKEILENEEYKEAAKRLSE 434 (496) T ss_pred HHHHHHHCCCCEECCCCCCCCHHHHHHHH-HCCCEEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 89999865998434786335388999999-7275489962123312567777765076999999999999 No 123 >KOG0832 consensus Probab=82.54 E-value=3.5 Score=20.35 Aligned_cols=140 Identities=19% Similarity=0.203 Sum_probs=71.9 Q ss_pred HHCCEEEECCHHHHHHCCCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCC Q ss_conf 62668998898999743688399748827778237843268788728898872799951226445123666545653004 Q gi|255764474|r 115 SRMDEVITTSQKSARFIERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPH 194 (352) Q Consensus 115 ~~~d~ii~~s~~~~~~~~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~ 194 (352) .+....|..++.++.|+.....++.+ +......|+|+|+= .+...+++..++=... T Consensus 78 ~R~Gi~IIdLdqT~~~Lr~A~~fVa~---------------------vA~r~GiILFv~tn---~~~~~~ve~aA~r~~g 133 (251) T KOG0832 78 KRLGIHIIDLDQTASYLRRALNFVAH---------------------VAHRGGIILFVGTN---NGFKDLVERAARRAGG 133 (251) T ss_pred CCCCCEEEECHHHHHHHHHHHHHHHH---------------------HHHCCCEEEEEECC---CCHHHHHHHHHHHHCC T ss_conf 32672798428899999999999998---------------------98459849999458---4267899999998467 Q ss_pred CCCEEEEEECCCCCCCCCHHHHHHHHH--HHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 774058863133321000146677644--311233221222222331000000000001233322222210000010011 Q gi|255764474|r 195 HPGWTAVVVGKTTLKHYLFKKNLQRRI--YANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPV 272 (352) Q Consensus 195 ~~~~~l~i~G~g~~~~~~~~~~l~~~i--~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~Pv 272 (352) +.-.+.+..|--+.. .++.... +.+.+.+...|..... -|++|...-.|. ..+++||.-+++|+ T Consensus 134 y~~~~~w~~G~lTN~-----~~l~g~~~~~~~~~pd~~~f~~t~~--------~D~vvvln~~e~-~sAilEA~K~~IPT 199 (251) T KOG0832 134 YSHNRKWLGGLLTNA-----RELFGALVRKFLSLPDALCFLPTLT--------PDLVVVLNPEEN-HSAILEAAKMAIPT 199 (251) T ss_pred CEEEEEECCCEEECC-----HHHCCCCCCCCCCCCCCEEECCCCC--------CCEEEECCCCCC-CHHHHHHHHHCCCE T ss_conf 214201215523240-----6650642455457875202535578--------662676275443-18999999707985 Q ss_pred EE-CCCCCCCHHCCCCCCCEEEEECCCC Q ss_conf 10-2788410100234884249965999 Q gi|255764474|r 273 VA-SNTGVFSELLDPENAKAGVIVPPRN 299 (352) Q Consensus 273 I~-s~~~~~~e~i~~~~g~~G~~~~~~d 299 (352) |+ -|....++++. +-++.+| T Consensus 200 IgIVDtN~~P~liT-------YpVPaND 220 (251) T KOG0832 200 IGIVDTNCNPELIT-------YPVPAND 220 (251) T ss_pred EEEECCCCCCCCEE-------ECCCCCC T ss_conf 88866999810053-------1158888 No 124 >smart00672 CAP10 Putative lipopolysaccharide-modifying enzyme. Probab=82.18 E-value=3.6 Score=20.26 Aligned_cols=82 Identities=21% Similarity=0.237 Sum_probs=51.6 Q ss_pred CCCCEEEEEECCCCCCCHHCCCC--CCCEEEEECC-CCH--HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 00010011102788410100234--8842499659-998--999999999986989999999999999998299899999 Q gi|255764474|r 265 AMASGIPVVASNTGVFSELLDPE--NAKAGVIVPP-RNL--HALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEAS 339 (352) Q Consensus 265 Ama~G~PvI~s~~~~~~e~i~~~--~g~~G~~~~~-~d~--~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~ 339 (352) -|+||-.|+..+.. ..|..... ... .+++- .|. ++|.+++..+.++++..++++++|+++++++.+.+.+.. T Consensus 164 ll~~~Svvl~~~~~-~~e~f~~~L~P~~--HYvPv~~d~sd~~l~~~i~w~~~~d~~a~~Ia~~~~~f~~~~L~~~~~~~ 240 (256) T smart00672 164 ILACDSVVLKVKPE-YYEFFSRGLQPWV--HYWPIKSDLSCRELKEAVDWGNEHDKKAQEIGKRGSEFIQQNLSMEDVYD 240 (256) T ss_pred HHHCCCEEEEECCH-HHHHHHCCCCCCC--CEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 87268469962841-8999870126542--17886668756669999999860989999999999999999749789999 Q ss_pred H---HHHHHHHHH Q ss_conf 9---999999988 Q gi|255764474|r 340 D---IGKVYDRLL 349 (352) Q Consensus 340 ~---~~~iY~~~l 349 (352) = +..-|.+++ T Consensus 241 Y~~~Ll~eyakl~ 253 (256) T smart00672 241 YMFHLLQEYAKLL 253 (256) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999974 No 125 >PRK04020 rps2P 30S ribosomal protein S2; Provisional Probab=80.23 E-value=4 Score=19.97 Aligned_cols=71 Identities=23% Similarity=0.357 Sum_probs=39.4 Q ss_pred CCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCC--HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHH Q ss_conf 2221000001001110-2788410100234884249965999--899999999998698999999999999999829989 Q gi|255764474|r 259 GLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRN--LHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIV 335 (352) Q Consensus 259 gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d--~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~ 335 (352) -.++.||.-+|+|+|+ .|...-++.|+ +.++.+| ..++ ..+..++++.-++ ++-...+.-.|+ T Consensus 127 ~~AV~EA~~l~IPvIaivDTn~dp~~Vd-------y~IP~NDds~~SI-~Li~~lLa~ail~------~kg~~~~~~~~~ 192 (204) T PRK04020 127 AQAVKEAIEIGIPVVALCDTDNLTSNVD-------LVIPTNNKGRKAL-ALIYWLLAREILR------ERGEIKPDEDLP 192 (204) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCHHCC-------EEEECCCCCHHHH-HHHHHHHHHHHHH------HCCCCCCCCCCC T ss_conf 4899999876999899827998911157-------3651688307389-9999999999999------739777666887 Q ss_pred HHHHHHHH Q ss_conf 99999999 Q gi|255764474|r 336 KEASDIGK 343 (352) Q Consensus 336 ~~a~~~~~ 343 (352) ...+.++. T Consensus 193 v~~~~f~~ 200 (204) T PRK04020 193 VPVEDFET 200 (204) T ss_pred CCHHHHHH T ss_conf 88889654 No 126 >pfam12000 DUF3495 Domain of unknown function (DUF3495). This presumed domain is functionally uncharacterized. This domain is found in bacteria. This domain is about 170 amino acids in length. This domain is found associated with pfam00534. This domain has a single completely conserved residue G that may be functionally important. Probab=79.90 E-value=4.3 Score=19.75 Aligned_cols=81 Identities=20% Similarity=0.218 Sum_probs=48.1 Q ss_pred HHCCCEEEEECCCHHHHHHHHHHHHH-CCCEEEE---ECCHHH--HHH---------HHHHH---------HHHHHCCEE Q ss_conf 32597089962854689999999970-9981999---757456--201---------13479---------899626689 Q gi|255764474|r 65 IGQNSRIWHARRNNEMLLGVMMRDVL-RMPLKLV---FTSPSQ--RNH---------SRWTR---------YLISRMDEV 120 (352) Q Consensus 65 ~~~~~~ivh~~~~~~~~~~~~~~~~~-~~~~~~~---~~~~~~--~~~---------~~~~~---------~l~~~~d~i 120 (352) ....||+|-+|....- +++++.++ ..++... |++... ..+ ....+ --+..+|.. T Consensus 63 ~Gf~PDvIi~H~GWGe--~LfLkdv~P~a~li~Y~E~~y~~~g~d~gFDpe~~~~~~~~~~~~r~rN~~~~~~l~~ad~~ 140 (172) T pfam12000 63 QGFRPDVIVAHPGWGE--TLFLKDVWPDARLIGYFEFYYRASGADVGFDPEFDPPDLDDRARLRLRNAVILLALEQADAG 140 (172) T ss_pred CCCCCCEEEECCCCCC--HHCHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 4999998987587640--10577776588757888778748887678585779989889999998667999999965656 Q ss_pred EECCHHHHHHC----CCCEEEECCCCCHHHH Q ss_conf 98898999743----6883997488277782 Q gi|255764474|r 121 ITTSQKSARFI----ERPSTVIMHGVDTERF 147 (352) Q Consensus 121 i~~s~~~~~~~----~~~~~vi~~gid~~~~ 147 (352) ++.|...+.-. ..++.||+-|||++.+ T Consensus 141 i~PT~wQ~~~fP~~~~~ri~VihdGiDt~~~ 171 (172) T pfam12000 141 ISPTAWQRSTFPAPLRDRITVIHDGIDTDRL 171 (172) T ss_pred CCCCHHHHHHCCHHHHCCCEEECCCCCHHHC T ss_conf 3877898875999888184782156152322 No 127 >COG4671 Predicted glycosyl transferase [General function prediction only] Probab=78.65 E-value=4.5 Score=19.60 Aligned_cols=140 Identities=16% Similarity=0.212 Sum_probs=75.0 Q ss_pred CCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEE---EEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 9887279995122644512366654565300477405---8863133321000146677644311233221222222331 Q gi|255764474|r 163 SEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWT---AVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSI 239 (352) Q Consensus 163 ~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~---l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~ 239 (352) ++...+++.+|.-. -|-++ ++++.......|++. +++.| |.-......++...+ +-.+.|....+.+++ T Consensus 217 pE~~~Ilvs~GGG~--dG~eL-i~~~l~A~~~l~~l~~~~~ivtG--P~MP~~~r~~l~~~A---~~~p~i~I~~f~~~~ 288 (400) T COG4671 217 PEGFDILVSVGGGA--DGAEL-IETALAAAQLLAGLNHKWLIVTG--PFMPEAQRQKLLASA---PKRPHISIFEFRNDF 288 (400) T ss_pred CCCCEEEEECCCCH--HHHHH-HHHHHHHHHHCCCCCCCEEEEEC--CCCCHHHHHHHHHHC---CCCCCEEEEEHHHHH T ss_conf 76333999548872--05999-99999875507788743389848--999889999998742---569972899733039 Q ss_pred CCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE-CCCCCCCH-HCCC----CCCCEEEEEC-CCCHHHHHHHHHHHHC Q ss_conf 00000000000123332222221000001001110-27884101-0023----4884249965-9998999999999986 Q gi|255764474|r 240 EDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVA-SNTGVFSE-LLDP----ENAKAGVIVP-PRNLHALEKAVLYFMN 312 (352) Q Consensus 240 ~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e-~i~~----~~g~~G~~~~-~~d~~~la~~i~~l~~ 312 (352) ..++..|+..|+-+-+ +++-|-+++|+|.+. ...-+-.| .+.. +=|-...+-+ .-+++.+|++|...++ T Consensus 289 ~~ll~gA~~vVSm~GY----NTvCeILs~~k~aLivPr~~p~eEQliRA~Rl~~LGL~dvL~pe~lt~~~La~al~~~l~ 364 (400) T COG4671 289 ESLLAGARLVVSMGGY----NTVCEILSFGKPALIVPRAAPREEQLIRAQRLEELGLVDVLLPENLTPQNLADALKAALA 364 (400) T ss_pred HHHHHHHHEEEECCCC----HHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHCCC T ss_conf 9998764403520462----226688737995598436787389999999998669510307355896899999985426 Q ss_pred CH Q ss_conf 98 Q gi|255764474|r 313 SK 314 (352) Q Consensus 313 ~~ 314 (352) .| T Consensus 365 ~P 366 (400) T COG4671 365 RP 366 (400) T ss_pred CC T ss_conf 89 No 128 >pfam05159 Capsule_synth Capsule polysaccharide biosynthesis protein. This family includes export proteins involved in capsule polysaccharide biosynthesis, such as KpsS and LipB. Probab=78.16 E-value=1.4 Score=23.09 Aligned_cols=87 Identities=11% Similarity=0.087 Sum_probs=47.8 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCC Q ss_conf 51236665456530047740588631333210001466776443112332212222223310000000000012333222 Q gi|255764474|r 179 KGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGF 258 (352) Q Consensus 179 Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~ 258 (352) +-...+++++ ....|+..+++-=. |.. .. ........+..+.++++.+.. --++..++..++-++.- - T Consensus 140 ~~~~~vl~sf---a~~~Pda~lv~K~H-P~~-~g-~~~~~~~~~~~~~~~~~~~~~-~~~l~~Ll~~~~~Vvtv-----n 207 (268) T pfam05159 140 DFFLEVLESF---AEENPDAQIIIKPH-PDV-LG-GNKYGLITELPGLERVDVIVH-DVNLISLLRHVDEVVTI-----T 207 (268) T ss_pred HHHHHHHHHH---HHHCCCCEEEEECC-CCC-CC-CCCCCCCCHHHCCCCEEEEEC-CCCHHHHHHHCCEEEEE-----C T ss_conf 9999999999---98788983999689-875-46-776454575455688199936-99999999857999996-----5 Q ss_pred CCCCCCCCCCEEEEEECCC Q ss_conf 2221000001001110278 Q gi|255764474|r 259 GLTPLEAMASGIPVVASNT 277 (352) Q Consensus 259 gl~~lEAma~G~PvI~s~~ 277 (352) +.+-+|||..|+||+|-.. T Consensus 208 St~G~eALl~gkpV~~~G~ 226 (268) T pfam05159 208 STVGFEALLLGKPVITLGN 226 (268) T ss_pred CHHHHHHHHCCCCEEEEEE T ss_conf 5689999985997389401 No 129 >cd04300 GT1_Glycogen_Phosphorylase This is a family of oligosaccharide phosphorylases. It includes yeast and mammalian glycogen phosphorylases, plant starch/glucan phosphorylase, as well as the maltodextrin phosphorylases of bacteria. The members of this family catalyze the breakdown of oligosaccharides into glucose-1-phosphate units. They are important allosteric enzymes in carbohydrate metabolism. The allosteric control mechanisms of yeast and mammalian members of this family are different from that of bacterial members. The members of this family belong to the GT-B structural superfamily of glycoslytransferases, which have characteristic N- and C-terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility. Probab=78.00 E-value=4.9 Score=19.37 Aligned_cols=125 Identities=18% Similarity=0.162 Sum_probs=77.8 Q ss_pred EEEEEECHHHHHHH-HHHHH---HHHHCCCC-----CEEEEEECCCCCCCCCHHHHHHHHHHHCC--------CCC--CC Q ss_conf 99512264451236-66545---65300477-----40588631333210001466776443112--------332--21 Q gi|255764474|r 170 GCFGRIRKLKGTDL-FVDCM---INILPHHP-----GWTAVVVGKTTLKHYLFKKNLQRRIYANG--------LKK--RI 230 (352) Q Consensus 170 ~~~G~~~~~Kg~~~-li~a~---~~l~~~~~-----~~~l~i~G~g~~~~~~~~~~l~~~i~~~~--------l~~--~V 230 (352) +-+-|+...|+..+ ++..+ .+++ ..| .-++++.|...+.. ...+.+.++|...+ +.+ +| T Consensus 534 vQvKRiHEYKRQlLN~L~ii~~Y~~ik-~~~~~~~~Prt~IFggKAAPgY-~~AK~iIklIn~va~~IN~Dp~v~~~LkV 611 (797) T cd04300 534 VQVKRIHEYKRQLLNVLHIIHLYNRIK-ENPNADIVPRTFIFGGKAAPGY-YMAKLIIKLINAVADVVNNDPDVGDKLKV 611 (797) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HCCCCCCCCEEEEECCCCCCCH-HHHHHHHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 123454687899998999999999998-4899887874899557688655-99999999999999997278200773799 Q ss_pred CCCCC--CCCCCCCCHHHHHHCCCC--CCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEEC Q ss_conf 22222--233100000000000123--3322222210000010011102788410100234884249965 Q gi|255764474|r 231 LFIDE--QSSIEDWYRALNIFVAPP--LYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVP 296 (352) Q Consensus 231 ~~~g~--~~~~~~~~~~adi~i~pS--~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~ 296 (352) .|++. ++-.+.++.+||+--.-| -.|..|..=+-.|..|...|.|--|..-|+.+....+|-+++- T Consensus 612 VFl~NY~VSlAE~iiPAaDlSeQIStAg~EASGTgNMKfmlNGALtiGTlDGANVEI~e~VG~eN~fiFG 681 (797) T cd04300 612 VFLPNYNVSLAEKIIPAADLSEQISTAGKEASGTGNMKFMLNGALTIGTLDGANVEIAEEVGEENIFIFG 681 (797) T ss_pred EECCCCCHHHHHHHCCHHHHHHCCCCCCCCCCCCCCHHHHHCCCEEECCCCCCHHHHHHHHCCHHEEECC T ss_conf 9838985578765255110000157775335788762688648635435677316798771812336457 No 130 >COG0859 RfaF ADP-heptose:LPS heptosyltransferase [Cell envelope biogenesis, outer membrane] Probab=76.43 E-value=5.4 Score=19.09 Aligned_cols=202 Identities=11% Similarity=0.120 Sum_probs=85.9 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHH-HH-- Q ss_conf 899899887543259708996285468999999997099819997574562011347989962668998898999-74-- Q gi|255764474|r 54 ISSLLTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSA-RF-- 130 (352) Q Consensus 54 ~~~~~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~-~~-- 130 (352) +.....+++..+..++|.+-.-.. .+...++....+.+.++-+.....+. .+....+.+. -....+... ++ T Consensus 67 ~~~~~~l~~~lr~~~yD~vidl~~--~~ksa~l~~~~~~~~r~g~~~~~~r~--~~~~~~~~~~--~~~~~~~~~~~~~~ 140 (334) T COG0859 67 LKERLALLRTLRKERYDAVIDLQG--LLKSALLALLLGIPFRIGFDKKSARE--LLLNKFYPRL--DKPEGQHVVERYLA 140 (334) T ss_pred HHHHHHHHHHHCCCCCCEEEECHH--HHHHHHHHHHHCCCCEECCCCHHHHH--HHHHHCCCCC--CCCCHHHHHHHHHH T ss_conf 899999999831468678997402--58899999973898363145077777--7776302445--65402459999999 Q ss_pred -CCCCEEEECCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEE-EECHHHHHH--HHHHHHHHHHCCCCCEEEEEECCC Q ss_conf -3688399748827778237843268788728898872799951-226445123--666545653004774058863133 Q gi|255764474|r 131 -IERPSTVIMHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFG-RIRKLKGTD--LFVDCMINILPHHPGWTAVVVGKT 206 (352) Q Consensus 131 -~~~~~~vi~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G-~~~~~Kg~~--~li~a~~~l~~~~~~~~l~i~G~g 206 (352) ....-...+. .....+.-.....+............++++.| .-...|... ...+....+.++. +..++.|.. T Consensus 141 l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~pg~s~~~~K~wp~e~~~~l~~~l~~~~--~~Vvl~g~~ 217 (334) T COG0859 141 LLEDLGLYPPP-EPQLDFPLPRPPIELAKNLAKFDRPYIVINPGASRGSAKRWPLEHYAELAELLIAKG--YQVVLFGGP 217 (334) T ss_pred HHHHHCCCCCC-CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCC--CEEEEEECH T ss_conf 98760798766-775554555478889865420379879996473466778899999999999999769--989994087 Q ss_pred CCCCCCHHHHHHHHHHHCCCCCCCCCCCC--CCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECC Q ss_conf 32100014667764431123322122222--23310000000000012333222222100000100111027 Q gi|255764474|r 207 TLKHYLFKKNLQRRIYANGLKKRILFIDE--QSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASN 276 (352) Q Consensus 207 ~~~~~~~~~~l~~~i~~~~l~~~V~~~g~--~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~ 276 (352) ++....+++.+.. + +-+.+.|. .+++..+...||+++.+... .+-=|.|.|+|+|+-. T Consensus 218 --~e~e~~~~i~~~~---~--~~~~l~~k~sL~e~~~li~~a~l~I~~DSg-----~~HlAaA~~~P~I~iy 277 (334) T COG0859 218 --DEEERAEEIAKGL---P--NAVILAGKTSLEELAALIAGADLVIGNDSG-----PMHLAAALGTPTIALY 277 (334) T ss_pred --HHHHHHHHHHHHC---C--CCCCCCCCCCHHHHHHHHHCCCEEECCCCH-----HHHHHHHCCCCEEEEE T ss_conf --8999999999736---7--661217999999999999669899914887-----9999987399889998 No 131 >TIGR01123 ilvE_II branched-chain amino acid aminotransferase; InterPro: IPR005786 Aminotransferases share certain mechanistic features with other pyridoxal-phosphate dependent enzymes, such as the covalent binding of the pyridoxal-phosphate group to a lysine residue. On the basis of sequence similarity, these various enzymes can be grouped into subfamilies. One of these, called class-IV, currently consists of proteins of about 270 to 415 amino-acid residues that share a few regions of sequence similarity. Surprisingly, the best conserved region does not include the lysine residue to which the pyridoxal-phosphate group is known to be attached, in ilvE, but is located some 40 residues at the C-terminus side of the PlP-lysine. Among the class IV aminotransferases are two phylogenetically separable groups of branched-chain amino acid aminotransferase (IlvE) (2.6.1.42 from EC). The last common ancestor of the two lineages appears also to have given rise to a family of D-amino acid aminotransferases (DAAT). This model represents the IlvE family less similar to the DAAT family. ; GO: 0004084 branched-chain-amino-acid transaminase activity, 0009081 branched chain family amino acid metabolic process. Probab=75.53 E-value=2.9 Score=20.86 Aligned_cols=46 Identities=20% Similarity=0.239 Sum_probs=35.2 Q ss_pred CCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCC-CHHHHHHHHHHHHCC Q ss_conf 1000001001110278841010023488424996599-989999999999869 Q gi|255764474|r 262 PLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPR-NLHALEKAVLYFMNS 313 (352) Q Consensus 262 ~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~-d~~~la~~i~~l~~~ 313 (352) +-|+.+||+..|.|+++.+.. +| .-+.+... .+-++..+|.+.+.| T Consensus 267 ~tEvfacGTAAVITPv~~~~~-----~g-~~~~~~~~~~~G~~t~~Ly~~ltd 313 (329) T TIGR01123 267 ITEVFACGTAAVITPVGEIQH-----GG-KEVVVKSESQPGEVTKALYDELTD 313 (329) T ss_pred CEEEECCCCCEEECCCCEEEE-----CC-EEEEECCCCCCCHHHHHHHHHHHH T ss_conf 101001186148857422787-----77-689962688757344999997620 No 132 >COG2984 ABC-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=74.02 E-value=2 Score=22.06 Aligned_cols=123 Identities=15% Similarity=0.162 Sum_probs=71.4 Q ss_pred HHHHHHHHHHHCCCCCEEE--EEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC----CHHHHHHCCCCCC Q ss_conf 3666545653004774058--863133321000146677644311233221222222331000----0000000012333 Q gi|255764474|r 182 DLFVDCMINILPHHPGWTA--VVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDW----YRALNIFVAPPLY 255 (352) Q Consensus 182 ~~li~a~~~l~~~~~~~~l--~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~----~~~adi~i~pS~~ 255 (352) +.-++.+.++ .|+++- +++..+........+++++.++..|++=-........|+... ....|++..|-.. T Consensus 146 ~q~i~lik~~---~Pnak~Igv~Y~p~E~ns~~l~eelk~~A~~~Gl~vve~~v~~~ndi~~a~~~l~g~~d~i~~p~dn 222 (322) T COG2984 146 AQQIELIKAL---LPNAKSIGVLYNPGEANSVSLVEELKKEARKAGLEVVEAAVTSVNDIPRAVQALLGKVDVIYIPTDN 222 (322) T ss_pred HHHHHHHHHH---CCCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHCCCCCEEEEECCH T ss_conf 9999999986---7887069999579886608999999999987798899983476320089999734787679986606 Q ss_pred ---CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHH-HHHHHHHHHHC Q ss_conf ---2222221000001001110278841010023488424996599989-99999999986 Q gi|255764474|r 256 ---EGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLH-ALEKAVLYFMN 312 (352) Q Consensus 256 ---Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~-~la~~i~~l~~ 312 (352) -++...+.+|...++|+++++.+...+ ..-..+.++..+.. +.++-..+++. T Consensus 223 ~i~s~~~~l~~~a~~~kiPli~sd~~~V~~-----Ga~aA~gvdy~~~G~qtg~~v~~ILk 278 (322) T COG2984 223 LIVSAIESLLQVANKAKIPLIASDTSSVKE-----GALAALGVDYKDLGKQTGEMVVKILK 278 (322) T ss_pred HHHHHHHHHHHHHHHHCCCEECCCHHHHHC-----CCCEEECCCHHHHHHHHHHHHHHHHC T ss_conf 778889999999887089735477788766-----86036606799988999999999976 No 133 >COG1927 Mtd Coenzyme F420-dependent N(5),N(10)-methenyltetrahydromethanopterin dehydrogenase [Energy production and conversion] Probab=73.04 E-value=6.5 Score=18.53 Aligned_cols=12 Identities=17% Similarity=0.354 Sum_probs=5.7 Q ss_pred CEEEEEECCCCC Q ss_conf 405886313332 Q gi|255764474|r 197 GWTAVVVGKTTL 208 (352) Q Consensus 197 ~~~l~i~G~g~~ 208 (352) ++..+++|+++. T Consensus 87 ~~PaiiigDaPg 98 (277) T COG1927 87 DVPAIIIGDAPG 98 (277) T ss_pred CCCEEEECCCCC T ss_conf 997799668863 No 134 >COG3660 Predicted nucleoside-diphosphate-sugar epimerase [Cell envelope biogenesis, outer membrane] Probab=72.98 E-value=6.5 Score=18.52 Aligned_cols=246 Identities=15% Similarity=0.123 Sum_probs=106.8 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECC----CCCCCCCCCHH----HHH--HHHHHHHHCCCEEEEE Q ss_conf 4589707878787707788889999999719859998058----86656234789----989--9887543259708996 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYC----LPKNIPSIGIS----SLL--TCWKKPIGQNSRIWHA 74 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~----~~~--~~~~~~~~~~~~ivh~ 74 (352) +|-+++.. -+|.+..+..++..+.+-.+.+....+. +|..+...... .++ .+.+. ...++|.|-+ T Consensus 2 ki~aisD~----RtGnt~QaiaLa~~l~r~eyttk~l~~~~l~~lP~~wl~~yp~~~~~~l~~~~~~r~-p~~~Pdl~I~ 76 (329) T COG3660 2 KIWAISDG----RTGNTHQAIALAEQLTRSEYTTKLLEYNNLAKLPNFWLAYYPIHILRELFGPRLSRK-PEQRPDLIIT 76 (329) T ss_pred CEEEEECC----CCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHCCCCHHHHHHHCCCCCCC-CCCCCCEEEE T ss_conf 34896158----776389999999986046437899512200127446651276276787636700017-5579855886 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHCC----C--CEEEECCCCCHHHHC Q ss_conf 2854689999999970998199975745620113479899626689988989997436----8--839974882777823 Q gi|255764474|r 75 RRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSARFIE----R--PSTVIMHGVDTERFR 148 (352) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~~~~----~--~~~vi~~gid~~~~~ 148 (352) -..........++..++.+..+....+-.. ++..|.+|..-+.-.+... + ++.-.+|-|...+.. T Consensus 77 aGrrta~l~~~lkk~~~~~~vVqI~~Prlp---------~~~fDlvivp~HD~~~~~s~~~~Nilpi~Gs~h~Vt~~~lA 147 (329) T COG3660 77 AGRRTAPLAFYLKKKFGGIKVVQIQDPRLP---------YNHFDLVIVPYHDWREELSDQGPNILPINGSPHNVTSQRLA 147 (329) T ss_pred CCCCHHHHHHHHHHHCCCCEEEEEECCCCC---------CCCCEEEECCCHHHHHHHHCCCCCEEECCCCCCCCCHHHHH T ss_conf 152100789999986189538995079998---------53021784260246665311578455426887756577756 Q ss_pred CCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHH-----HHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHH Q ss_conf 784326878872889887279995122644512-----366654565300477405886313332100014667764431 Q gi|255764474|r 149 PTSNKQEARRHLKISEDAKLIGCFGRIRKLKGT-----DLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYA 223 (352) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~-----~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~ 223 (352) +..+..+... -....++-+++|.-++.-.. ..+..++.+...+. .+.+++--+ .+.. +..+..++. T Consensus 148 --a~~e~~~~~~-p~~rq~vAVlVGg~nk~f~~~~d~a~q~~~~l~k~l~~~-g~~~lisfS--RRTp---~~~~s~l~~ 218 (329) T COG3660 148 --ALREAFKHLL-PLPRQRVAVLVGGNNKAFVFQEDKAHQFASLLVKILENQ-GGSFLISFS--RRTP---DTVKSILKN 218 (329) T ss_pred --HHHHHHHHHC-CCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHC-CCEEEEEEE--CCCC---HHHHHHHHH T ss_conf --4588878637-787744999966788777667789999999999998747-851899960--6891---789999971 Q ss_pred -CCCCCCCCCCCCC--C-CCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECC Q ss_conf -1233221222222--3-310000000000012333222222100000100111027 Q gi|255764474|r 224 -NGLKKRILFIDEQ--S-SIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASN 276 (352) Q Consensus 224 -~~l~~~V~~~g~~--~-~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~ 276 (352) +.-..-+.|.+.- . -...++++||.++....+-. -+-||.+.|+||-+-. T Consensus 219 ~l~s~~~i~w~~~d~g~NPY~~~La~Adyii~TaDSin---M~sEAasTgkPv~~~~ 272 (329) T COG3660 219 NLNSSPGIVWNNEDTGYNPYIDMLAAADYIISTADSIN---MCSEAASTGKPVFILE 272 (329) T ss_pred CCCCCCEEEECCCCCCCCCHHHHHHHCCEEEEECCHHH---HHHHHHCCCCCEEEEE T ss_conf 35668445737987898816888852133787043012---4578760499759980 No 135 >pfam08288 PIGA PIGA (GPI anchor biosynthesis). This domain is found on phosphatidylinositol n-acetylglucosaminyltransferase proteins. These proteins are involved in GPI anchor biosynthesis and are associated with disease the paroxysmal nocturnal haemoglobinuria. Probab=72.33 E-value=6.8 Score=18.42 Aligned_cols=47 Identities=21% Similarity=0.222 Sum_probs=28.7 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCHHHHH--HHHHHHHHCCCEEEEECCHHH Q ss_conf 9989988754325970899628546899--999999709981999757456 Q gi|255764474|r 55 SSLLTCWKKPIGQNSRIWHARRNNEMLL--GVMMRDVLRMPLKLVFTSPSQ 103 (352) Q Consensus 55 ~~~~~~~~~~~~~~~~ivh~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 103 (352) ..+..+......++++|+|.|.....+. +.+.+...+ ++.+|+.|+. T Consensus 37 ~~~Pl~R~IliRE~I~IVHgH~a~S~l~hEailhA~tmG--lktVfTDHSL 85 (90) T pfam08288 37 GTFPLFRNILIRERIDIVHGHQAFSTLAHEAILHARTMG--LKTVFTDHSL 85 (90) T ss_pred HHHHHHHHHHHHHCCEEEEEEHHHHHHHHHHHHHHHHCC--CCEEEECCCC T ss_conf 745999999987262199721777777889999998669--8178524532 No 136 >cd01425 RPS2 Ribosomal protein S2 (RPS2), involved in formation of the translation initiation complex, where it might contact the messenger RNA and several components of the ribosome. It has been shown that in Escherichia coli RPS2 is essential for the binding of ribosomal protein S1 to the 30s ribosomal subunit. In humans, most likely in all vertebrates, and perhaps in all metazoans, the protein also functions as the 67 kDa laminin receptor (LAMR1 or 67LR), which is formed from a 37 kDa precursor, and is overexpressed in many tumors. 67LR is a cell surface receptor which interacts with a variety of ligands, laminin-1 and others. It is assumed that the ligand interactions are mediated via the conserved C-terminus, which becomes extracellular as the protein undergoes conformational changes which are not well understood. Specifically, a conserved palindromic motif, LMWWML, may participate in the interactions. 67LR plays essential roles in the adhesion of cells to the basement membrane an Probab=71.29 E-value=7.1 Score=18.27 Aligned_cols=112 Identities=19% Similarity=0.132 Sum_probs=49.5 Q ss_pred CCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCH Q ss_conf 87279995122644512366654565300477405886313332100014667764431123322122222233100000 Q gi|255764474|r 165 DAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYR 244 (352) Q Consensus 165 ~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~ 244 (352) ....|+|+|. .+....+++.+++.... +-+.-..+|+- .-. ....+..++.+..-+.-.+.-.-..+..... T Consensus 55 ~~g~ILfVgt---k~~~~~~v~~~a~~~~~-~yv~~rWigG~-LTN---~~~~~~~i~~~~~~~~~k~~~~~~g~~~~~~ 126 (193) T cd01425 55 KGGKILFVGT---KPQAQRAVKKFAERTGS-FYVNGRWLGGT-LTN---WKTIRKSIKRLKKLEKEKLEKNLGGIKDMFR 126 (193) T ss_pred CCCEEEEEEC---CHHHHHHHHHHHHHCCC-EEECCEECCCC-CCC---HHHHHHHHHHHHHHHHHHHHHHCCCHHHCCC T ss_conf 5995999975---78999999999998399-45448575877-248---8888668998887542256652034010166 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCEEEEEE-CCCCCCCHHCC Q ss_conf 000000123332222221000001001110-27884101002 Q gi|255764474|r 245 ALNIFVAPPLYEGFGLTPLEAMASGIPVVA-SNTGVFSELLD 285 (352) Q Consensus 245 ~adi~i~pS~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e~i~ 285 (352) .-|+.+...-.+ -..++.||-.+|.|+|+ .|....+..|+ T Consensus 127 ~Pdliiv~d~~~-~~~ai~Ea~~l~IPvI~i~Dtn~~p~~id 167 (193) T cd01425 127 LPDLVIVLDPRK-EHQAIREASKLGIPVIAIVDTNCDPDLID 167 (193) T ss_pred CCCEEEEECCCC-CHHHHHHHHHHCCCEEEEECCCCCCCCCC T ss_conf 999899988984-16899999861875578850899966587 No 137 >COG2204 AtoC Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms] Probab=69.55 E-value=7.7 Score=18.05 Aligned_cols=15 Identities=13% Similarity=-0.022 Sum_probs=6.5 Q ss_pred HHHHHHHCCEEEEEE Q ss_conf 999997198599980 Q gi|255764474|r 28 CPIQRKLGQRLVVFG 42 (352) Q Consensus 28 ~~~~~~~~~~~~~~~ 42 (352) ...+...|+.+..+. T Consensus 21 ~~~L~~~G~~v~~a~ 35 (464) T COG2204 21 EQALELAGYEVVTAE 35 (464) T ss_pred HHHHHHCCCEEEEEC T ss_conf 999997697489858 No 138 >pfam11997 DUF3492 Domain of unknown function (DUF3492). This presumed domain is functionally uncharacterized. This domain is found in bacteria, archaea and eukaryotes. This domain is typically between 259 to 282 amino acids in length. This domain is found associated with pfam00534. This domain has two conserved sequence motifs: GGVS and EHGIY. Probab=69.41 E-value=7.8 Score=18.00 Aligned_cols=38 Identities=18% Similarity=0.244 Sum_probs=25.6 Q ss_pred HEEEEC-CCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf 458970-78787877077888899999997198599980 Q gi|255764474|r 5 NIDVIA-PNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFG 42 (352) Q Consensus 5 ~i~~i~-~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~ 42 (352) ||-+|+ .++..-.+||++++-++++.+-.....+..++ T Consensus 2 dI~Ll~EGTYPYv~GGVSsWv~~LI~~lPe~~F~v~~i~ 40 (268) T pfam11997 2 DVCLLLEGTYPYVRGGVSSWVHQLITGLPEVEFAIVFIG 40 (268) T ss_pred CEEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEEEEEE T ss_conf 689995168888178514999999852999659999981 No 139 >pfam10087 DUF2325 Uncharacterized protein conserved in bacteria (DUF2325). Members of this family of hypothetical bacterial proteins have no known function. Probab=69.21 E-value=2.5 Score=21.38 Aligned_cols=70 Identities=17% Similarity=0.106 Sum_probs=43.4 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCC---CCCCCCHHHHHHCCCCCCC---CCCCCCCCCCCCEEEEEECCCCCCCHH Q ss_conf 466776443112332212222223---3100000000000123332---222221000001001110278841010 Q gi|255764474|r 214 KKNLQRRIYANGLKKRILFIDEQS---SIEDWYRALNIFVAPPLYE---GFGLTPLEAMASGIPVVASNTGVFSEL 283 (352) Q Consensus 214 ~~~l~~~i~~~~l~~~V~~~g~~~---~~~~~~~~adi~i~pS~~E---g~gl~~lEAma~G~PvI~s~~~~~~e~ 283 (352) ...+++.+++.|.+...+..|..+ .+...+..+|++++|...= ..-.+--+|-..|+|++.++..++..+ T Consensus 12 ~~~~~~~~~~~G~~~~~h~~g~~~~~~~l~~~i~~aDlVI~~td~vsH~~~~~vK~~akk~~~p~v~~~s~s~~~l 87 (96) T pfam10087 12 LGHYRKLLEKYGGEFIVHDGGREKKKKKIPALLKKADLVIVFTDCVSHDAMWRVKEEAKKRGIPVVFSRSRSLSAL 87 (96) T ss_pred HHHHHHHHHHCCCEEEEECCCCCCHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCEEEECCCCHHHH T ss_conf 8999999998399899965898733556775058988899971766879999999999984997899768749999 No 140 >PTZ00254 40S ribosomal protein SA; Provisional Probab=68.75 E-value=8.1 Score=17.91 Aligned_cols=27 Identities=33% Similarity=0.302 Sum_probs=21.2 Q ss_pred CCCCCCCCCCEEEEEE-CCCCCCCHHCC Q ss_conf 2221000001001110-27884101002 Q gi|255764474|r 259 GLTPLEAMASGIPVVA-SNTGVFSELLD 285 (352) Q Consensus 259 gl~~lEAma~G~PvI~-s~~~~~~e~i~ 285 (352) -.++.||..+|+|+|+ .|...-+++|+ T Consensus 131 ~~AV~EA~~~nIPvIal~DTds~p~~VD 158 (242) T PTZ00254 131 HQAIREASYVNIPVIALCDTDAPLEYVD 158 (242) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCC T ss_conf 0899999985998898616999953265 No 141 >pfam03401 Bug Tripartite tricarboxylate transporter family receptor. These probable extra-cytoplasmic solute receptors are strongly overrepresented in several beta-proteobacteria. This family, formerly known as Bug - Bordetella uptake gene (bug) product - is a family of bacterial tripartite tricarboxylate receptors of the extracytoplasmic solute binding receptor-dependent transporter group of families, distinct from the ABC and TRAP-T families. The TctABC system has been characterized in S. typhimurium, and TctC is the extracytoplasmic tricarboxylate-binding receptor which binds the transporters TctA and TctB, two integral membrane proteins. Complete three-component systems are found only in bacteria. Probab=65.76 E-value=9.2 Score=17.52 Aligned_cols=177 Identities=15% Similarity=0.134 Sum_probs=78.6 Q ss_pred CCEEEEEEEECHHHHHHHHHHHHHHHHCCCC-CEEEEEECCCCCCCCCHHHHHHHHHHHCCCC-CCCCCCCCCCCCCCCC Q ss_conf 7279995122644512366654565300477-4058863133321000146677644311233-2212222223310000 Q gi|255764474|r 166 AKLIGCFGRIRKLKGTDLFVDCMINILPHHP-GWTAVVVGKTTLKHYLFKKNLQRRIYANGLK-KRILFIDEQSSIEDWY 243 (352) Q Consensus 166 ~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~-~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~-~~V~~~g~~~~~~~~~ 243 (352) .+.++.+..=++.|.++.++++. +.+| .+.+--.|.|...+ .....+ .+..|++ ..|-|-|.-+.+..++ T Consensus 78 ~~~~l~v~~dsp~~t~~dli~~a----k~~Pg~~~~g~~G~gs~~h-l~~~~l---~~~~G~~~~~Vpy~G~~~~~~all 149 (274) T pfam03401 78 SPMVLVVPADSPFKTLQELVAYA----KANPGKLTFASAGIGTSNH-LAGELL---ASKAGVQLSHVPYKGSSPALQDLL 149 (274) T ss_pred CCEEEEECCCCCCCCHHHHHHHH----HHCCCCEEEECCCCCCCCH-HHHHHH---HHHCCCCEEEECCCCCHHHHHHHH T ss_conf 63588977999756899999999----8489965873278886318-999999---997099648834678326689986 Q ss_pred H-HHHHHCC-CCC----CCCCCCCCCCCCCCEEEEEECCCCCCCHHCC-C--CCCCEEEEECCCCHHH----HHHHHHHH Q ss_conf 0-0000001-233----3222222100000100111027884101002-3--4884249965999899----99999999 Q gi|255764474|r 244 R-ALNIFVA-PPL----YEGFGLTPLEAMASGIPVVASNTGVFSELLD-P--ENAKAGVIVPPRNLHA----LEKAVLYF 310 (352) Q Consensus 244 ~-~adi~i~-pS~----~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~-~--~~g~~G~~~~~~d~~~----la~~i~~l 310 (352) . ..|+.+. ++. .+.--+.++=-++...--...+++-+.|.-- . .....|+.++++-|++ |.+++.+. T Consensus 150 gG~vd~~~~~~~~~~~~v~~G~lr~Lav~~~~R~~~~PdVPT~~E~G~~~~~~~~w~g~~ap~gtP~~~~~~l~~a~~~~ 229 (274) T pfam03401 150 GGRVDMMIDSLTSTAPYIKAGKLRALAVTSPKRSPQLPDVPTVAELGLKGFEAGVWFGLVAPKGTPPAVVEKLNDAIKKA 229 (274) T ss_pred CCEEEEEEECCHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHHH T ss_conf 78445998351320445417982599997575675448999768848787147888887634988999999999999999 Q ss_pred HCCHHHHHHHHHHHHHHHHHHCCHHHHH---HHHHHHHHHHHHHC Q ss_conf 8698999999999999999829989999---99999999988609 Q gi|255764474|r 311 MNSKKIMSDTGNRGRERAVKHFSIVKEA---SDIGKVYDRLLRTA 352 (352) Q Consensus 311 ~~~~~~~~~~~~~a~~~~~~~fs~~~~a---~~~~~iY~~~l~~a 352 (352) +++|+..+.+.+.+.. ....+.+... ++-.+.|+++++++ T Consensus 230 ~~dpe~~~~~~~~g~~--~~~~~~~e~~~~~~~e~~~~~~l~k~~ 272 (274) T pfam03401 230 LKDPEVVERLANLGME--PVPGTPQQLAQFIQAEVKRWGPLIQEL 272 (274) T ss_pred HCCHHHHHHHHHCCCC--CCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 7499999999977895--788999999999999999999999982 No 142 >KOG2884 consensus Probab=65.51 E-value=9.3 Score=17.49 Aligned_cols=57 Identities=16% Similarity=0.135 Sum_probs=24.3 Q ss_pred CCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHH Q ss_conf 100000100111027884101002348842499659998999999999986989999 Q gi|255764474|r 262 PLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMS 318 (352) Q Consensus 262 ~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~ 318 (352) +++-.-.-.|+|.-+-|+..--.....+..-|-++|++-.+||-+|.--+.....|+ T Consensus 178 ~L~d~l~ssPii~ge~g~a~~~~~a~g~~f~fgvdp~~DPELAlALRlSMEEer~rQ 234 (259) T KOG2884 178 LLSDALLSSPIIQGEDGGAAAGLGANGMDFEFGVDPEDDPELALALRLSMEEERARQ 234 (259) T ss_pred CHHHHHHCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 077776428603167654322445555532148786659889999985089999889 No 143 >PRK10494 hypothetical protein; Provisional Probab=64.53 E-value=9.7 Score=17.37 Aligned_cols=98 Identities=8% Similarity=0.099 Sum_probs=55.4 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECCC-CCCCCCHHHHHHHHHHHCCC-CCCCCCCCCC-C---C---CCCCCHHHHHH Q ss_conf 5123666545653004774058863133-32100014667764431123-3221222222-3---3---10000000000 Q gi|255764474|r 179 KGTDLFVDCMINILPHHPGWTAVVVGKT-TLKHYLFKKNLQRRIYANGL-KKRILFIDEQ-S---S---IEDWYRALNIF 249 (352) Q Consensus 179 Kg~~~li~a~~~l~~~~~~~~l~i~G~g-~~~~~~~~~~l~~~i~~~~l-~~~V~~~g~~-~---~---~~~~~~~adi~ 249 (352) ...+++++++ ++.+++|+.++++-|.. ..+.....+..++...++|+ .++|.+.+.. + + +.+.+....++ T Consensus 104 ~a~~Rl~~g~-~L~r~~~~~~li~SGG~~~~~~~sea~~~~~~~~~lGv~~~~I~~e~~srnT~EnA~~~~~ll~~~~~l 182 (259) T PRK10494 104 NSLPRLNEGI-RLWRANPGAKLIFTGGVAKTNTVSTAEVGARVAQSLGVPREDIITLDLPKDTEEEAAAVKQAIGDAPFL 182 (259) T ss_pred CHHHHHHHHH-HHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCHHHEEECCCCCCHHHHHHHHHHHHCCCCEE T ss_conf 4899999999-999838998399968878889998999999999983999899152567778899999999984889789 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEEECCCC Q ss_conf 01233322222210000010011102788 Q gi|255764474|r 250 VAPPLYEGFGLTPLEAMASGIPVVASNTG 278 (352) Q Consensus 250 i~pS~~Eg~gl~~lEAma~G~PvI~s~~~ 278 (352) +.+|-. ..+.++.-.-..|.-||...++ T Consensus 183 LVTSA~-HMpRA~~~F~~~G~~viP~P~d 210 (259) T PRK10494 183 LVTSAS-HLPRAMIFFQQEGLNPLPAPAN 210 (259) T ss_pred EECCHH-HHHHHHHHHHHCCCEEEEECCC T ss_conf 983733-5389999999879922430465 No 144 >TIGR01011 rpsB_bact ribosomal protein S2; InterPro: IPR005706 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family describes the bacterial, mitochondrial and chloroplast forms of ribosomal protein S2.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=64.46 E-value=5.4 Score=19.09 Aligned_cols=45 Identities=36% Similarity=0.510 Sum_probs=29.8 Q ss_pred HCCCCCCCCCCCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCCHHHH Q ss_conf 00123332222221000001001110-27884101002348842499659998999 Q gi|255764474|r 249 FVAPPLYEGFGLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRNLHAL 303 (352) Q Consensus 249 ~i~pS~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d~~~l 303 (352) ||.=..+| -+++.||--.|.|||+ -|+=.-||.|+ +.+|-+| +++ T Consensus 162 ~viD~~~E--~IAv~EA~kLgIPvvA~vDTNCdPd~vD-------~~IPgND-Dai 207 (227) T TIGR01011 162 FVIDPRKE--KIAVAEARKLGIPVVAIVDTNCDPDVVD-------YPIPGND-DAI 207 (227) T ss_pred EEECCCCC--HHHHHHHHHCCCCEEEEECCCCCCCCCC-------CCCCCCC-CHH T ss_conf 99748210--6899997337997898704788888665-------0257973-058 No 145 >TIGR01012 Sa_S2_E_A ribosomal protein S2; InterPro: IPR005707 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family describes the ribosomal protein of the eukaryotic cytosol and of Archaea, homologous to S2 of bacteria. It is designated typically as Sa in eukaryotes and Sa or S2 in the archaea. ; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=64.39 E-value=7.7 Score=18.04 Aligned_cols=93 Identities=20% Similarity=0.303 Sum_probs=56.5 Q ss_pred CCCCCCCCHHHHHHCCC-CCCCCCCCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCC Q ss_conf 23310000000000012-3332222221000001001110-278841010023488424996599989999999999869 Q gi|255764474|r 236 QSSIEDWYRALNIFVAP-PLYEGFGLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNS 313 (352) Q Consensus 236 ~~~~~~~~~~adi~i~p-S~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~ 313 (352) .+....-|..=++++.. -+.+ --++-||..-|+|+|+ .|+....+.|+ +.+|.+|-..=|=+|.+|+-- T Consensus 100 TNp~~~~f~EP~vv~vtDPR~D--~Qa~~EA~~vgiP~vALcDTd~~~~~vD-------l~IP~NNKG~~s~al~yWlLa 170 (197) T TIGR01012 100 TNPALKEFREPEVVVVTDPRAD--HQALKEASEVGIPIVALCDTDNSLRYVD-------LVIPTNNKGRRSLALIYWLLA 170 (197) T ss_pred CCHHHHCCCCCEEEEECCCCCC--CCHHHHHHCCCCCEEEEECCCCCCCCCC-------EEECCCCCHHHHHHHHHHHHH T ss_conf 4023313678737997588766--2121200015896689875887500024-------010798613689999999999 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 89999999999999998299899999999 Q gi|255764474|r 314 KKIMSDTGNRGRERAVKHFSIVKEASDIG 342 (352) Q Consensus 314 ~~~~~~~~~~a~~~~~~~fs~~~~a~~~~ 342 (352) ++..+ .+| .+.+.-.|+.+.+.++ T Consensus 171 Re~L~---~RG--~i~~~~~~~~~~e~F~ 194 (197) T TIGR01012 171 REILR---MRG--VISADEDLDVSVEEFF 194 (197) T ss_pred HHHHH---HCC--CCCCCCCCCCCHHHHH T ss_conf 99997---537--4047987433667561 No 146 >cd06305 PBP1_methylthioribose_binding_like Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. The sugar-binding domain of the periplasmic proteins in this group is also homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR), DNA-binding transcriptional repressors such as LacI and GalR. Probab=61.46 E-value=11 Score=17.01 Aligned_cols=36 Identities=19% Similarity=0.157 Sum_probs=19.8 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf 45897078787877077888899999997198599980 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFG 42 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~ 42 (352) ||.++..++...+. .......-..-+++|..+.+.. T Consensus 1 kIa~~~~~~~~~f~--~~~~~G~~~~A~~~G~~~~v~d 36 (273) T cd06305 1 RIAVVRYGGSGDFD--QAYLAGTKAEAEALGGDLRVYD 36 (273) T ss_pred CEEEEECCCCCHHH--HHHHHHHHHHHHHCCCEEEEEC T ss_conf 98999717889799--9999999999997499899973 No 147 >cd03789 GT1_LPS_heptosyltransferase Lipopolysaccharide heptosyltransferase is involved in the biosynthesis of lipooligosaccharide (LOS). Lipopolysaccharide (LPS) is a major component of the outer membrane of gram-negative bacteria. LPS heptosyltransferase transfers heptose molecules from ADP-heptose to 3-deoxy-D-manno-octulosonic acid (KDO), a part of the inner core component of LPS. This family belongs to the GT-B structural superfamily of glycoslytransferases, which have characteristic N- and C-terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility. Probab=61.27 E-value=11 Score=16.98 Aligned_cols=99 Identities=16% Similarity=0.097 Sum_probs=52.6 Q ss_pred EEEEEEEECHHHHH--HHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCC--CCCCCCCC Q ss_conf 79995122644512--366654565300477405886313332100014667764431123322122222--23310000 Q gi|255764474|r 168 LIGCFGRIRKLKGT--DLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDE--QSSIEDWY 243 (352) Q Consensus 168 ~i~~~G~~~~~Kg~--~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~--~~~~~~~~ 243 (352) ++...|.-.+.|.. +...+.+..+.++ ++..+++|...+. ...+++. +..+....+.+.|. ..++..++ T Consensus 124 i~i~~ga~~~~K~wp~~~~~~l~~~l~~~--~~~ivl~g~~~e~--~~~~~i~---~~~~~~~~~~l~g~~sl~el~~li 196 (279) T cd03789 124 VVLPPGASGPAKRWPAERFAALADRLLAR--GARVVLTGGPAER--ELAEEIA---AALGGPRVVNLAGKTSLRELAALL 196 (279) T ss_pred EEEECCCCCCCCCCCHHHHHHHHHHHHHC--CCEEEEECCHHHH--HHHHHHH---HHCCCCCEEECCCCCCHHHHHHHH T ss_conf 99958989734579899999999999858--9959993486689--9999999---967999758368999999999999 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCC Q ss_conf 00000001233322222210000010011102788 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTG 278 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~ 278 (352) +.||+++.+.. -.+-=|.|+|+|+|+=-.+ T Consensus 197 ~~a~l~I~~DT-----g~~HlAaa~~~p~i~ifG~ 226 (279) T cd03789 197 ARADLVVTNDS-----GPMHLAAALGTPTVALFGP 226 (279) T ss_pred HHCCEEEECCC-----HHHHHHHHCCCCEEEEECC T ss_conf 84683375687-----7999999849998999899 No 148 >TIGR00236 wecB UDP-N-acetylglucosamine 2-epimerase; InterPro: IPR003331 UDP-N-acetylglucosamine 2-epimerase 5.1.3.14 from EC catalyses the production of UDP-ManNAc from UDP-GlcNAc. Some of the enzymes is this family are bifunctional. In microorganisms the epimerase is involved in in the synthesis of the capsule precursor UDP-ManNAcA , . The protein from rat liver displays both epimerase and kinase activity .; GO: 0008761 UDP-N-acetylglucosamine 2-epimerase activity, 0006047 UDP-N-acetylglucosamine metabolic process, 0009103 lipopolysaccharide biosynthetic process. Probab=59.78 E-value=12 Score=16.81 Aligned_cols=152 Identities=17% Similarity=0.178 Sum_probs=87.4 Q ss_pred CCCCEEEEEEE---ECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 88727999512---264451236665456530047740588631333210001466776443112332212222223310 Q gi|255764474|r 164 EDAKLIGCFGR---IRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIE 240 (352) Q Consensus 164 ~~~~~i~~~G~---~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~ 240 (352) ++..+++..-| +....+...+++++.++...+++..+++--.....- ...--..-+.++-..++.+....+.+. T Consensus 206 ~~~~~~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~h~~~~~---~~~~~~~~~~~~~~~~~~~~~p~~~~~ 282 (380) T TIGR00236 206 DDRVLLLTLHRRENVGNPEPLENILEAILEILEEFPDVAIVFPVHPNPKV---REPEFGLYKALGDAKRVFLIEPLGYLD 282 (380) T ss_pred CCCEEEEEEHHHHCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCC---CHHHHHHHHHHCCCCCEEEECCHHHHH T ss_conf 45444654001104676014689999999987412430356621456421---002310466632566357755134788 Q ss_pred CCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHH Q ss_conf 0000000000123332222221000001001110-278841010023488424996599989999999999869899999 Q gi|255764474|r 241 DWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSD 319 (352) Q Consensus 241 ~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~ 319 (352) -.......++..+ ..|-.--||-..|+|+++ .+...-+|.+. .| +..++ -.|.+.+.+...+++.+|+.+.+ T Consensus 283 ~~~~~~~~~~~l~---d~gg~~~~~~~~~~p~~~~~~~~~~p~~~~--~g-~~~l~-g~~~~~~~~~~~~~l~~p~~~~~ 355 (380) T TIGR00236 283 FLLLLSNAYLILT---DSGGLQEEAPSLGKPVLVLRDTTERPEAVE--AG-TNKLV-GTDKEKITKAAGRLLSDPDEYRK 355 (380) T ss_pred HHHHHHCCEEEEE---CCCCCCHHHHHCCCCEEEEECCCCCCCHHH--CC-CEEEE-CCCHHHHHHHHHHHHCCHHHHHH T ss_conf 8887532416873---477630011212774377503566642010--03-20010-46678899999987317267888 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|255764474|r 320 TGNRGR 325 (352) Q Consensus 320 ~~~~a~ 325 (352) +..... T Consensus 356 ~~~~~~ 361 (380) T TIGR00236 356 MSNAEN 361 (380) T ss_pred HHHHCC T ss_conf 864126 No 149 >TIGR02360 pbenz_hydroxyl 4-hydroxybenzoate 3-monooxygenase; InterPro: IPR012733 4-hydroxybenzoate 3-monooxygenase is a flavoprotein that converts its substrate to 3,4-dihydroxybenzoate, which subsequently enters the beta-ketioadipate pathway of aromatic degradation, using molecular oxygen and NADPH as shown below .4-hydroxybenzoate + NADPH + O(2) = 3,4-dihydroxybenzoate + NADP(+) + H(2)O 4-hydroxybenzoate is an intermediate in the degradation of lignin and other aromatic plant compounds, and this enzyme is found extensively in soil bacteria. This enzyme is a homodimer where each subunit is composed of three distinct domains: an N-terminal flavin-binding domain with a beta-alpha-beta fold, a small substrate-binding domain composed of a single alpha helix and beta-sheet, and a C-terminal helical domain . The active site is found at the interface of all three domains. Catalysis occurs by a two-step reaction. In the first step, flavin is reduced by NADPH. Subsequently, the reduced flavin is oxygenated to a hydroperoxide which transfers the hydroxyl group to the substrate, forming 3,4-dihydroxybenzoate.; GO: 0018659 4-hydroxybenzoate 3-monooxygenase activity, 0050660 FAD binding, 0043639 benzoate catabolic process. Probab=59.20 E-value=11 Score=16.97 Aligned_cols=104 Identities=17% Similarity=0.259 Sum_probs=66.3 Q ss_pred CCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCC---CCCCCCCCCCCCEEEEEECCCCCCCHHCCC Q ss_conf 00014667764431123322122222233100000000000123332---222221000001001110278841010023 Q gi|255764474|r 210 HYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYE---GFGLTPLEAMASGIPVVASNTGVFSELLDP 286 (352) Q Consensus 210 ~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~E---g~gl~~lEAma~G~PvI~s~~~~~~e~i~~ 286 (352) |..+++|||+++.+.- .+ .++--||..- ++.-=+.|-|..|.--+|-| ...||++ T Consensus 239 Dd~FW~ELK~Rlp~e~-A~------------------~LvTGPSiEKSIAPLRSFV~EPM~yGrLFLaGD---AAHIVPP 296 (393) T TIGR02360 239 DDRFWEELKRRLPEEA-AE------------------RLVTGPSIEKSIAPLRSFVAEPMRYGRLFLAGD---AAHIVPP 296 (393) T ss_pred HHHHHHHHHHCCCHHH-HH------------------HCCCCCCCHHHCCCCCCCCCCCCCCCCHHHHCC---CCCCCCC T ss_conf 0778999861378668-75------------------405676412211678774338652465111046---4435888 Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHC--CHHHHHHHHHHHHHHH--HHHCCHHH Q ss_conf 48842499659998999999999986--9899999999999999--98299899 Q gi|255764474|r 287 ENAKAGVIVPPRNLHALEKAVLYFMN--SKKIMSDTGNRGRERA--VKHFSIVK 336 (352) Q Consensus 287 ~~g~~G~~~~~~d~~~la~~i~~l~~--~~~~~~~~~~~a~~~~--~~~fs~~~ 336 (352) .|=-|+=....|+.-|.++|.+-.. +.+-..+.+++|-+++ .++|||-. T Consensus 297 -TGAKGLN~AASD~~yL~~aL~~~Y~~~~~~gi~~YSa~aLaRvWKA~RFSWWm 349 (393) T TIGR02360 297 -TGAKGLNLAASDVHYLYEALLEHYQEGDSAGIDGYSAKALARVWKAERFSWWM 349 (393) T ss_pred -CCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCHHH T ss_conf -76221034888999999999997302216577889999963332200433466 No 150 >COG3563 KpsC Capsule polysaccharide export protein [Cell envelope biogenesis, outer membrane] Probab=58.29 E-value=9.1 Score=17.56 Aligned_cols=12 Identities=25% Similarity=0.564 Sum_probs=4.2 Q ss_pred CCCEEEEEECCC Q ss_conf 001001110278 Q gi|255764474|r 266 MASGIPVVASNT 277 (352) Q Consensus 266 ma~G~PvI~s~~ 277 (352) +.+|+|-+.|+. T Consensus 489 i~YG~~~~mtN~ 500 (671) T COG3563 489 IRYGCPSIMTNN 500 (671) T ss_pred HCCCCCCCCCHH T ss_conf 004884003447 No 151 >PRK07726 DNA topoisomerase III; Provisional Probab=57.58 E-value=13 Score=16.57 Aligned_cols=83 Identities=16% Similarity=0.144 Sum_probs=53.6 Q ss_pred CCEEEEEECCCCCCCHHCCCCC--CCEEEEECCCCHHHHHHHH-HHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 0100111027884101002348--8424996599989999999-999869899999999999999982998999999999 Q gi|255764474|r 267 ASGIPVVASNTGVFSELLDPEN--AKAGVIVPPRNLHALEKAV-LYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIGK 343 (352) Q Consensus 267 a~G~PvI~s~~~~~~e~i~~~~--g~~G~~~~~~d~~~la~~i-~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~ 343 (352) .-|+| +|+.+-+..+++... ...+-+++..--..+.+++ ...+.+|+.-.++-..-.+......+++...+.+.+ T Consensus 514 GIGTp--ATRA~IIe~L~~r~Yi~~~~k~l~pT~~G~~li~~l~~~~l~~p~~Ta~wE~~L~~I~~G~~~~~~f~~~i~~ 591 (716) T PRK07726 514 GLGTE--ATRAGIITMLKDRKYIDVKKNQVYATDKGKVLIEAIGDKILASPEMTAKWEQRLAEIGEGTASPATFMEQTKK 591 (716) T ss_pred CCCCC--CCHHHHHHHHHHCCCEEECCCEEEECHHHHHHHHHHCHHHCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 88884--1089999998745714410987517689999998629754389478999999999998699899999999999 Q ss_pred HHHHHHHH Q ss_conf 99998860 Q gi|255764474|r 344 VYDRLLRT 351 (352) Q Consensus 344 iY~~~l~~ 351 (352) ..+++++. T Consensus 592 ~~~~~v~~ 599 (716) T PRK07726 592 LSAKIIED 599 (716) T ss_pred HHHHHHHH T ss_conf 99999999 No 152 >COG0052 RpsB Ribosomal protein S2 [Translation, ribosomal structure and biogenesis] Probab=55.76 E-value=14 Score=16.38 Aligned_cols=34 Identities=32% Similarity=0.447 Sum_probs=25.1 Q ss_pred CCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCC Q ss_conf 2221000001001110-2788410100234884249965999 Q gi|255764474|r 259 GLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRN 299 (352) Q Consensus 259 gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d 299 (352) -+++.||--.|+|||+ .|+-.-|+.|+ +.+|.+| T Consensus 169 ~iAv~EA~klgIPVvAlvDTn~dpd~VD-------~~IP~Nd 203 (252) T COG0052 169 KIAVKEANKLGIPVVALVDTNCDPDGVD-------YVIPGND 203 (252) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCC-------EEECCCC T ss_conf 8999999975999899841899976576-------5522887 No 153 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=55.68 E-value=10 Score=17.19 Aligned_cols=12 Identities=25% Similarity=0.614 Sum_probs=5.2 Q ss_pred ECCCCCHHHHCC Q ss_conf 748827778237 Q gi|255764474|r 138 IMHGVDTERFRP 149 (352) Q Consensus 138 i~~gid~~~~~~ 149 (352) |..|+-=|+|=| T Consensus 137 VnPGFgGQkFIP 148 (216) T TIGR01163 137 VNPGFGGQKFIP 148 (216) T ss_pred EECCCCCCCCHH T ss_conf 607998841105 No 154 >PRK05282 peptidase E; Validated Probab=55.25 E-value=14 Score=16.36 Aligned_cols=23 Identities=13% Similarity=0.082 Sum_probs=7.7 Q ss_pred HHHHHHHHHHHHHCCEEEEEECC Q ss_conf 88889999999719859998058 Q gi|255764474|r 22 STVFGLCPIQRKLGQRLVVFGYC 44 (352) Q Consensus 22 ~~v~~~~~~~~~~~~~~~~~~~~ 44 (352) .+.......+-+....+.-+.+. T Consensus 18 ~~~~~~i~~~~~~~k~ilFIPyA 40 (233) T PRK05282 18 EHALPLIAELLAGRRKAVFIPYA 40 (233) T ss_pred HHHHHHHHHHHHCCCEEEEECCC T ss_conf 99999999998479659997588 No 155 >TIGR01768 GGGP-family geranylgeranylglyceryl phosphate synthase family protein; InterPro: IPR008205 This family contains prokaryotic proteins that are related to pcrB. Staphylococcus aureus chromosomal gene pcrA encodes a protein with significant similarity (40 0dentity) to two Escherichia coli helicases: the helicase II encoded by the uvrD gene and the Rep helicase. PcrB gene seems to belong to an operon containing at least one other gene, pcrBA, downstream from pcrB . The PcrB proteins often contain an FMN binding site although the function of these proteins is still unknown.. Probab=55.20 E-value=13 Score=16.61 Aligned_cols=13 Identities=15% Similarity=0.140 Sum_probs=5.4 Q ss_pred HHHHHHHHHHHHC Q ss_conf 2366654565300 Q gi|255764474|r 181 TDLFVDCMINILP 193 (352) Q Consensus 181 ~~~li~a~~~l~~ 193 (352) ...++...+++.. T Consensus 176 pPE~va~vk~v~~ 188 (242) T TIGR01768 176 PPELVAEVKKVLD 188 (242) T ss_pred CHHHHHHHHHHHH T ss_conf 4589999998741 No 156 >COG2327 WcaK Polysaccharide pyruvyl transferase family protein [Cell wall/membrane/envelope biogenesis] Probab=55.00 E-value=14 Score=16.29 Aligned_cols=224 Identities=14% Similarity=0.200 Sum_probs=94.6 Q ss_pred HHHHHHHCCCEEEEECCHHHHHH--HH-HHHHHHHHCCEEEECCHHHHHHCC---CCEEEECCCCCHHHHCCCCCHHHHH Q ss_conf 99999709981999757456201--13-479899626689988989997436---8839974882777823784326878 Q gi|255764474|r 84 VMMRDVLRMPLKLVFTSPSQRNH--SR-WTRYLISRMDEVITTSQKSARFIE---RPSTVIMHGVDTERFRPTSNKQEAR 157 (352) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~l~~~~d~ii~~s~~~~~~~~---~~~~vi~~gid~~~~~~~~~~~~~~ 157 (352) ..++.+.+.|+.+.-++.....+ .. +.+...++++.++.-=+.+-+++. .+..... |+-..-+.....+-. T Consensus 117 ~~la~l~~kp~~~~g~svGP~~~~~s~~~~~~~~~~~s~i~vRD~~S~~llk~~gi~a~l~~---D~Af~L~~~~~~~~~ 193 (385) T COG2327 117 ILLARLAGKPTFFFGQSVGPLKHPLSRQLLNYVLGGCSAISVRDPVSYELLKQLGINARLVT---DPAFLLPASSQNATA 193 (385) T ss_pred HHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHCCCCEEEEECHHHHHHHHHCCCCEEEEC---CCCEECCCCCCCCCC T ss_conf 99999759987999556787667779999998856774899836875999998099837605---840223454446552 Q ss_pred HHCCCCCCCCEEEEEE-EECHHHHH---HHHHHHHHHH-HCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCC Q ss_conf 8728898872799951-22644512---3666545653-00477405886313332100014667764431123322122 Q gi|255764474|r 158 RHLKISEDAKLIGCFG-RIRKLKGT---DLFVDCMINI-LPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILF 232 (352) Q Consensus 158 ~~~~~~~~~~~i~~~G-~~~~~Kg~---~~li~a~~~l-~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~ 232 (352) .....+.....+-+-| .-...+.. ..+.+++..+ ......+++...-.+...+..-.+.+..++.. .+++++ T Consensus 194 ~~~~~~~~~~~i~lr~~~~~~t~~~~~~~~~~~~l~~~~~~~~~~~~i~~~~~~~s~d~~va~~i~~~~~~---~~~i~~ 270 (385) T COG2327 194 SDVEAREKTVAITLRGLHPDNTAQRSILKYVNEALDLVERQVKALWRITLIDYGASDDLAVADAIAQLVLD---SAEILV 270 (385) T ss_pred CCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCCHHHHHHHHHHHCCC---CCCEEE T ss_conf 11003455379994146775156678999899999999986410448873204554216789998764387---200686 Q ss_pred CC-CC-CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECC-CCCCCHHCCCCCCCEEEEEC--CCCHHHHHHHH Q ss_conf 22-22-3310000000000012333222222100000100111027-88410100234884249965--99989999999 Q gi|255764474|r 233 ID-EQ-SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASN-TGVFSELLDPENAKAGVIVP--PRNLHALEKAV 307 (352) Q Consensus 233 ~g-~~-~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~-~~~~~e~i~~~~g~~G~~~~--~~d~~~la~~i 307 (352) .. .. +++...++++|+.|-.-.+ .++=||+.|+|+|+=. .+-...+. ..-|.-++..+ +.|.+.+.... T Consensus 271 ~~d~~~~~~~~~l~~~dl~Vg~R~H-----saI~al~~g~p~i~i~Y~~K~~~l~-~~~gl~~~~~~i~~~~~~~l~~~~ 344 (385) T COG2327 271 SSDEYAEELGGILAACDLIVGMRLH-----SAIMALAFGVPAIAIAYDPKVRGLM-QDLGLPGFAIDIDPLDAEILSAVV 344 (385) T ss_pred ECCHHHHHHHHHHCCCCEEEEEHHH-----HHHHHHHCCCCEEEEEECHHHHHHH-HHCCCCCCCCCCCCCCHHHHHHHH T ss_conf 1432788988775157459862168-----9999986599758986057778999-972997522247787567789999 Q ss_pred HHH-HCCHHHHHH Q ss_conf 999-869899999 Q gi|255764474|r 308 LYF-MNSKKIMSD 319 (352) Q Consensus 308 ~~l-~~~~~~~~~ 319 (352) .+. ..+++.+++ T Consensus 345 ~e~~~~~~~~~~~ 357 (385) T COG2327 345 LERLTKLDELRER 357 (385) T ss_pred HHHHHCCHHHHHH T ss_conf 9997345877765 No 157 >TIGR01343 hacA_fam homoaconitate hydratase family protein; InterPro: IPR006251 3-isopropylmalate dehydratase (or isopropylmalate isomerase; 4.2.1.33 from EC) catalyses the stereo-specific isomerisation of 2-isopropylmalate and 3-isopropylmalate, via the formation of 2-isopropylmaleate. This enzyme performs the second step in the biosynthesis of leucine, and is present in most prokaryotes and many fungal species. The prokaryotic enzyme is a heterodimer composed of a large (LeuC) and small (LeuD) subunit, while the fungal form is a monomeric enzyme. Both forms of isopropylmalate are related and are part of the larger aconitase family . Aconitases are mostly monomeric proteins which share four domains in common and contain a single, labile [4Fe-4S] cluster. Three structural domains (1, 2 and 3) are tightly packed around the iron-sulphur cluster, while a fourth domain (4) forms a deep active-site cleft. The prokaryotic enzyme is encoded by two adjacent genes, leuC and leuD, corresponding to aconitase domains 1-3 and 4 respectively , . LeuC does not bind an iron-sulphur cluster. It is thought that some prokaryotic isopropylamalate dehydrogenases can also function as homoaconitase 4.2.1.36 from EC, converting cis-homoaconitate to homoisocitric acid in lysine biosynthesis . Homoaconitase has been identified in higher fungi (mitochondria) and several archaea and one thermophilic species of bacteria, Thermus thermophilus . This entry represents the large subunit of 3-isopropylmalate dehydratase (LeuC), as well as homoaconitase, certain aconitases and uncharacterised proteins, and is a subgroup of those represented in IPR012095. Homoaconitase, aconitase and 3-isopropylmalate dehydratase have similar overall structures and domain organisation . All are dehydratases that bind a [4Fe-4S]-cluster. The majority of the members of this family have been designated as 3-isopropylmalate dehydratase large subunit (LeuC) in microbial genome annotation, but the only characterised member is T. thermophilus homoaconitase, an enzyme of a non-aspartate pathway of lysine biosynthesis. Information about related proteins can be found at Protein of the Month: Aconitase .; GO: 0016836 hydro-lyase activity, 0051539 4 iron 4 sulfur cluster binding, 0008652 amino acid biosynthetic process. Probab=54.79 E-value=9.3 Score=17.48 Aligned_cols=62 Identities=19% Similarity=0.325 Sum_probs=43.0 Q ss_pred CCHHHHCCC-------CCHHHHHHHCC-CCCCCCEEEEE--EEECHHHHHHHHHH-HHHHHHC-CCCCEEEEEE Q ss_conf 277782378-------43268788728-89887279995--12264451236665-4565300-4774058863 Q gi|255764474|r 142 VDTERFRPT-------SNKQEARRHLK-ISEDAKLIGCF--GRIRKLKGTDLFVD-CMINILP-HHPGWTAVVV 203 (352) Q Consensus 142 id~~~~~~~-------~~~~~~~~~~~-~~~~~~~i~~~--G~~~~~Kg~~~li~-a~~~l~~-~~~~~~l~i~ 203 (352) +|.+...|. .|-.+..+-.| ++-++-+|+.+ |||+.-+-.-.+|+ |+.+=++ -+++++++++ T Consensus 271 iD~~~~eP~va~Ph~vdNV~~vse~~G~~~vdQvfIGsCTNGRl~DLr~AAkiLkaalkqGr~GV~~~VrliVi 344 (432) T TIGR01343 271 IDASQLEPVVAVPHNVDNVKPVSEVEGRIEVDQVFIGSCTNGRLEDLRVAAKILKAALKQGRKGVAPDVRLIVI 344 (432) T ss_pred EEEECCCCEECCCCCCCCCCCCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEE T ss_conf 87104684322377798873135567860660588840237746789999999999975367552788439998 No 158 >PRK07220 DNA topoisomerase I; Validated Probab=52.03 E-value=16 Score=15.98 Aligned_cols=94 Identities=11% Similarity=0.077 Sum_probs=58.7 Q ss_pred CCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHH---HHCCHHHHHHHHHHHHHHHHHH Q ss_conf 3222222100000100111027884101002348842499659998999999999---9869899999999999999982 Q gi|255764474|r 255 YEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLY---FMNSKKIMSDTGNRGRERAVKH 331 (352) Q Consensus 255 ~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~---l~~~~~~~~~~~~~a~~~~~~~ 331 (352) +|+-=+..+|-..-|.|. |..+-+..+++...-+..-+.+..--..+.+++.. -+.+|+...+|-+.-...+..+ T Consensus 470 TEasLIk~ME~~GIGtpA--TrA~iIe~L~~R~YVe~~~l~PT~~G~~v~~~L~~~~~~l~~p~~Ta~~E~~Ld~Ia~Gk 547 (740) T PRK07220 470 GQGRLIKLMEDLGLGTKA--TRHEIISKLYSRAYIHGNPIQPTNTSFAVVDALEKYAPTITKPDMTKLLEEDMDLIAEGK 547 (740) T ss_pred CHHHHHHHHHHCCCCCCC--CHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCHHHCCHHHHHHHHHHHHHHHCCC T ss_conf 989999999738998500--499999999861883288652547999999999872345359127999999999998699 Q ss_pred CCHHHHHHHHHHHHHHHHH Q ss_conf 9989999999999999886 Q gi|255764474|r 332 FSIVKEASDIGKVYDRLLR 350 (352) Q Consensus 332 fs~~~~a~~~~~iY~~~l~ 350 (352) .+|+.+.++..+++++++. T Consensus 548 ~~~~~vl~e~~~~~~~~~~ 566 (740) T PRK07220 548 IKEDAVLEESREMLEQVFD 566 (740) T ss_pred CCHHHHHHHHHHHHHHHHH T ss_conf 9899999999999999999 No 159 >TIGR02195 heptsyl_trn_II lipopolysaccharide heptosyltransferase II; InterPro: IPR011910 This family consists of examples of ADP-heptose:LPS heptosyltransferase II, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria.; GO: 0016757 transferase activity transferring glycosyl groups, 0009103 lipopolysaccharide biosynthetic process. Probab=52.02 E-value=16 Score=15.98 Aligned_cols=304 Identities=16% Similarity=0.169 Sum_probs=143.4 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCE--EEEEECCCCCCC-----------------CCCCHHHHHHHHHHHH Q ss_conf 458970787878770778888999999971985--999805886656-----------------2347899899887543 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQR--LVVFGYCLPKNI-----------------PSIGISSLLTCWKKPI 65 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~--~~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~ 65 (352) +|.||.|+.- |=..-...+.+.+++...+ +++.++....-+ ..+.+..-.++-+..+ T Consensus 1 kILvIGPsWV----GDmvMaQ~Lf~~Lk~~yP~~~IDV~APaW~~PlL~RMPEi~~~~~~PlgHGaL~l~~R~rlg~~LR 76 (361) T TIGR02195 1 KILVIGPSWV----GDMVMAQSLFRLLKKRYPDAVIDVLAPAWCKPLLERMPEIRQAIDMPLGHGALELTERYRLGKSLR 76 (361) T ss_pred CEEEECCCCH----HHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHCCCCHHHHHHCCCCCCCCCHHHHHHHHHHHHH T ss_conf 9357438736----555676899999998589838972075330133312701578745887876212676789999997 Q ss_pred HCCCE--EEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHCCE-EEECCHHH-HHHC--CCCEEEEC Q ss_conf 25970--8996285468999999997099819997574562011347989962668-99889899-9743--68839974 Q gi|255764474|r 66 GQNSR--IWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRMDE-VITTSQKS-ARFI--ERPSTVIM 139 (352) Q Consensus 66 ~~~~~--ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~-ii~~s~~~-~~~~--~~~~~vi~ 139 (352) ..++| ||--.+.. +-+...+.++|.+.=|..-.-+. +..- ++..|. .=..-+.+ .+|. -....+|. T Consensus 77 ~~~YD~AiVLPNSlK----SALiPfFA~IP~RtGw~GEmRYG---LLND-~r~LdGkak~~~P~m~~rY~ALAy~K~~i~ 148 (361) T TIGR02195 77 EERYDQAIVLPNSLK----SALIPFFAGIPLRTGWRGEMRYG---LLND-IRALDGKAKERLPLMIERYIALAYDKDVIQ 148 (361) T ss_pred HCCCCEEEECCCCHH----HHHHHHHCCCCCCCCCCCCHHHH---HHHH-HHCCCCCHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 558988998677216----67636761886123556323566---6644-220677111433579999998864034467 Q ss_pred CCCCHH-HHCCCC---------CHHHHHHHCC----CCC-CCCEEEEE-E-EECHHHHHHH--HHHHHHHHHCCCCCEEE Q ss_conf 882777-823784---------3268788728----898-87279995-1-2264451236--66545653004774058 Q gi|255764474|r 140 HGVDTE-RFRPTS---------NKQEARRHLK----ISE-DAKLIGCF-G-RIRKLKGTDL--FVDCMINILPHHPGWTA 200 (352) Q Consensus 140 ~gid~~-~~~~~~---------~~~~~~~~~~----~~~-~~~~i~~~-G-~~~~~Kg~~~--li~a~~~l~~~~~~~~l 200 (352) +|-|.. .|.|.+ ++.....+++ +.. ++++|+++ | -+.+.|+..- ..+.+.++.++. .++. T Consensus 149 ~~~~LP~sf~plp~P~L~~~~~~~~~~~~kF~k~taL~~P~rP~ialCPGAEfGpAKRWP~~HyA~LA~~~~~~G-gy~V 227 (361) T TIGR02195 149 KGADLPESFQPLPRPQLQIDPAEQAAALAKFGKQTALDTPERPIIALCPGAEFGPAKRWPAEHYAELAKKLIAQG-GYQV 227 (361) T ss_pred CCCCCCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCC-CCEE T ss_conf 765576101137774525478899999998512212667788768758776677556785389999999998706-9189 Q ss_pred EEECCCCCCCCCHHHHHHHH-HHHCCCCCCC----CCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 86313332100014667764-4311233221----222222--3310000000000012333222222100000100111 Q gi|255764474|r 201 VVVGKTTLKHYLFKKNLQRR-IYANGLKKRI----LFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVV 273 (352) Q Consensus 201 ~i~G~g~~~~~~~~~~l~~~-i~~~~l~~~V----~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI 273 (352) ++.|+.. |. .+-+. |++.-..... =+.|.. +|-..+++.|...|. -.-| | +-=|.|.+.|+| T Consensus 228 ~lFGS~k--D~----~~~~~fI~~~~~~~~~~~~~NLaG~T~L~EAvdLia~a~avV~--NDSG--L-MHVAAAL~rPlV 296 (361) T TIGR02195 228 VLFGSKK--DK----PVGDEFIEALAPGELREYCDNLAGETSLDEAVDLIALAKAVVS--NDSG--L-MHVAAALNRPLV 296 (361) T ss_pred EEECCCC--CH----HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCEEEE--CCHH--H-HHHHHHCCCCEE T ss_conf 9842845--57----8999999986344688997410478888899999871560121--5606--7-999996279889 Q ss_pred ECCCCCCCHHCCCCCCCEEEEECCCCHH-HHHH--HHHHHHCCHHHHHH-HHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 0278841010023488424996599989-9999--99999869899999-9999999999829989999999999 Q gi|255764474|r 274 ASNTGVFSELLDPENAKAGVIVPPRNLH-ALEK--AVLYFMNSKKIMSD-TGNRGRERAVKHFSIVKEASDIGKV 344 (352) Q Consensus 274 ~s~~~~~~e~i~~~~g~~G~~~~~~d~~-~la~--~i~~l~~~~~~~~~-~~~~a~~~~~~~fs~~~~a~~~~~i 344 (352) +=.-+.-+++ -+|-+.. ++.. ++..+--.|- .++ -...|.....++.+++.+.+.+.++ T Consensus 297 AlYGsTsP~f-----------TPPLs~ka~~~~Gnal~~~~cspc-~~rk~c~~Gh~~cL~~l~P~~V~~~l~~L 359 (361) T TIGR02195 297 ALYGSTSPDF-----------TPPLSEKAEVVRGNALLNLECSPC-FKRKECPYGHHQCLIDLSPEQVLEALEEL 359 (361) T ss_pred EEECCCCCCC-----------CCCHHHHCEEEECCCEECCCCCCC-CCCCCCCCCHHHHHHHCCHHHHHHHHHHH T ss_conf 9736856888-----------887444110013101006732454-35653651147788606988999999974 No 160 >cd06313 PBP1_ABC_sugar_binding_like_5 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=51.06 E-value=16 Score=15.88 Aligned_cols=14 Identities=21% Similarity=0.347 Sum_probs=8.2 Q ss_pred HHHHHCCEEEEEEC Q ss_conf 99971985999805 Q gi|255764474|r 30 IQRKLGQRLVVFGY 43 (352) Q Consensus 30 ~~~~~~~~~~~~~~ 43 (352) .-+++|..+.+... T Consensus 24 aA~e~Gv~v~~~~a 37 (272) T cd06313 24 AGKLLGVDVTWYGG 37 (272) T ss_pred HHHHCCCEEEEECC T ss_conf 99981998999869 No 161 >pfam04392 ABC_sub_bind ABC transporter substrate binding protein. This family contains many hypothetical proteins and some ABC transporter substrate binding proteins. Probab=49.99 E-value=17 Score=15.77 Aligned_cols=61 Identities=21% Similarity=0.250 Sum_probs=30.2 Q ss_pred HHHHCCCCC---CCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCC-CEEEEECCCCHHHH-HHHHHHHHC Q ss_conf 000001233---32222221000001001110278841010023488-42499659998999-999999986 Q gi|255764474|r 246 LNIFVAPPL---YEGFGLTPLEAMASGIPVVASNTGVFSELLDPENA-KAGVIVPPRNLHAL-EKAVLYFMN 312 (352) Q Consensus 246 adi~i~pS~---~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g-~~G~~~~~~d~~~l-a~~i~~l~~ 312 (352) .|.++.|.. ...+...+..++.+++|++++.. +.+. +| -.|+.++..+.... ++...+++. T Consensus 184 ~Dal~i~~d~~v~s~~~~i~~~a~~~kiPv~~~~~----~~V~--~Gal~s~g~~~~~~G~~aa~~a~rIL~ 249 (292) T pfam04392 184 VDAIFIPTDNLIASAFTAVLQEANKAKIPVITSDT----SSVE--RGACAAVGVDYKQIGVQAANIVAKILK 249 (292) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHHCCCCEEECCH----HHHH--CCCEEEEECCHHHHHHHHHHHHHHHHC T ss_conf 98899937810788999999999974999895778----8884--796799946999999999999999978 No 162 >PRK02395 hypothetical protein; Provisional Probab=49.30 E-value=17 Score=15.70 Aligned_cols=144 Identities=13% Similarity=0.095 Sum_probs=72.6 Q ss_pred CCCEEEEEEEECHHHHHH-HHHHHHHHHHCC---CCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCC-----CCCCCCC Q ss_conf 872799951226445123-666545653004---7740588631333210001466776443112332-----2122222 Q gi|255764474|r 165 DAKLIGCFGRIRKLKGTD-LFVDCMINILPH---HPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKK-----RILFIDE 235 (352) Q Consensus 165 ~~~~i~~~G~~~~~Kg~~-~li~a~~~l~~~---~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~-----~V~~~g~ 235 (352) ..+.|.|.+-+...-.+. .++.-...+..+ ..+.-|.++|.|+.++..-.+.++..++.+.-.+ ...|+.. T Consensus 100 ~g~~i~Y~~PvGthp~m~dvi~~RA~~v~~~~~~~~~t~L~lvgHGT~rn~ns~~ai~~~a~~ire~~~FaEV~~~Fmde 179 (284) T PRK02395 100 GGKRVHYTGPVGTHPGMADVIAQRARDVTGDPEDADDTALLLVGHGTERNENSAKAIYYHADRLRERGRFAEVEAVFLDE 179 (284) T ss_pred CCCEEEECCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHCC T ss_conf 89338965677888237899999999852788888773599984577888653788999999999748718888876305 Q ss_pred C---CCCCCCCHHHHHHCCCC-CCCCCCC--CCCCCCCCE-EEEEECCCCCCCHHCCCCCCCEEEEEC-CCCHHHHHHHH Q ss_conf 2---33100000000000123-3322222--210000010-011102788410100234884249965-99989999999 Q gi|255764474|r 236 Q---SSIEDWYRALNIFVAPP-LYEGFGL--TPLEAMASG-IPVVASNTGVFSELLDPENAKAGVIVP-PRNLHALEKAV 307 (352) Q Consensus 236 ~---~~~~~~~~~adi~i~pS-~~Eg~gl--~~lEAma~G-~PvI~s~~~~~~e~i~~~~g~~G~~~~-~~d~~~la~~i 307 (352) . ++..+...+-|+++.|- ..+|+-. -+-|.|..- -+.-.-+ .|.- .+|..=|+.. -+....+++-| T Consensus 180 eP~v~~~~e~~~~~~VVvVPfFiAdG~Ht~edIP~~lGlt~~~~~~~~---~P~~---~~g~~i~ya~~vGt~p~~advi 253 (284) T PRK02395 180 EPEVDDWPDLVEADDVVVVPLFVSEGWHTQEDIPEDMGLTDDYRTGPD---SPTA---VDGHTIWYAGPVGTEPLMADVI 253 (284) T ss_pred CCCCCCHHHHCCCCCEEEEEEEECCCCCCCCCCHHHCCCCCCCCCCCC---CCCC---CCCEEEEEECCCCCCHHHHHHH T ss_conf 875232677617886799963752675334457465188878754788---9866---3670589713667753489999 Q ss_pred HHHHCCH Q ss_conf 9998698 Q gi|255764474|r 308 LYFMNSK 314 (352) Q Consensus 308 ~~l~~~~ 314 (352) .++..+. T Consensus 254 le~a~~a 260 (284) T PRK02395 254 LELAADA 260 (284) T ss_pred HHHHHHC T ss_conf 9999881 No 163 >PRK10360 DNA-binding transcriptional activator UhpA; Provisional Probab=49.14 E-value=17 Score=15.69 Aligned_cols=25 Identities=16% Similarity=0.080 Sum_probs=15.8 Q ss_pred CCCEEEEECCCCHHHHHHHHHHHHC Q ss_conf 8842499659998999999999986 Q gi|255764474|r 288 NAKAGVIVPPRNLHALEKAVLYFMN 312 (352) Q Consensus 288 ~g~~G~~~~~~d~~~la~~i~~l~~ 312 (352) .|..||+....+++++.++|..+.+ T Consensus 93 ~Ga~g~l~K~~~~~el~~aI~~v~~ 117 (196) T PRK10360 93 AGARGFLSKRCSPDELIAAVHTVAT 117 (196) T ss_pred CCCCEEEECCCCHHHHHHHHHHHHC T ss_conf 5998899778999999999999986 No 164 >COG0550 TopA Topoisomerase IA [DNA replication, recombination, and repair] Probab=49.09 E-value=17 Score=15.68 Aligned_cols=99 Identities=18% Similarity=0.261 Sum_probs=66.3 Q ss_pred CCCC-CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCC--CCEEEEECCCCHHHHHHHHHHH---HCCHHHHHHHHHHHH Q ss_conf 2333-222222100000100111027884101002348--8424996599989999999999---869899999999999 Q gi|255764474|r 252 PPLY-EGFGLTPLEAMASGIPVVASNTGVFSELLDPEN--AKAGVIVPPRNLHALEKAVLYF---MNSKKIMSDTGNRGR 325 (352) Q Consensus 252 pS~~-Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~--g~~G~~~~~~d~~~la~~i~~l---~~~~~~~~~~~~~a~ 325 (352) |+++ |+-=+..+|-.--|.|. |..+-+..+.+-.. .....+.+..--..+.+.+... +.+++...+|-+.-- T Consensus 450 P~rytEasLvk~mE~~GIGrpS--TyA~iI~~L~~RgYv~~~~~~~~pT~~G~~v~~~L~~~f~~l~~~~~Ta~mE~~Ld 527 (570) T COG0550 450 PPRYTEASLVKAMEKLGIGTPS--TYASIIETLQKRGYVEKKGKRLVPTELGEAVIELLEEYFPELVDPDFTAKMEEKLD 527 (570) T ss_pred CCCCCHHHHHHHHHHCCCCCCC--CHHHHHHHHHCCCCEEECCCEEEECHHHHHHHHHHHHHCHHHCCHHHHHHHHHHHH T ss_conf 6998989999999847999700--18999998862773880297567757899999999874524269878999999899 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHHHHC Q ss_conf 999982998999999999999988609 Q gi|255764474|r 326 ERAVKHFSIVKEASDIGKVYDRLLRTA 352 (352) Q Consensus 326 ~~~~~~fs~~~~a~~~~~iY~~~l~~a 352 (352) +.+..+-+|....+++...+.++++.+ T Consensus 528 ~I~~gk~~~~~~l~e~~~~~~~~~~~~ 554 (570) T COG0550 528 EIAEGKLEWKDVLDEFKKKFSKLLEEA 554 (570) T ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 998599428999999999889999999 No 165 >PRK10610 chemotaxis regulatory protein CheY; Provisional Probab=48.80 E-value=17 Score=15.65 Aligned_cols=45 Identities=18% Similarity=0.349 Sum_probs=31.1 Q ss_pred CEEEEEE-CCCCCCCHHCC-CCCCCEEEEECCCCHHHHHHHHHHHHC Q ss_conf 1001110-27884101002-348842499659998999999999986 Q gi|255764474|r 268 SGIPVVA-SNTGVFSELLD-PENAKAGVIVPPRNLHALEKAVLYFMN 312 (352) Q Consensus 268 ~G~PvI~-s~~~~~~e~i~-~~~g~~G~~~~~~d~~~la~~i~~l~~ 312 (352) ..+|+|. |..+.-.++.. .+.|-.+|+..|-+++++.+.+.+++. T Consensus 79 ~~~Pii~~T~~~~~~~~~~a~~~Ga~~yl~KP~~~~~L~~~i~~vl~ 125 (129) T PRK10610 79 SALPVLMVTAEAKKENIIAAAQAGASGYVVKPFTAATLEEKLNKIFE 125 (129) T ss_pred CCCCEEEEECCCCHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHH T ss_conf 89968999886899999999986998899898999999999999997 No 166 >pfam08660 Alg14 Oligosaccharide biosynthesis protein Alg14 like. Alg14 is involved dolichol-linked oligosaccharide biosynthesis and anchors the catalytic subunit Alg13 to the ER membrane. Probab=48.66 E-value=18 Score=15.64 Aligned_cols=79 Identities=10% Similarity=0.040 Sum_probs=42.4 Q ss_pred HHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHH------CCCEEEEECCHHHHHHHHHHHHHHHHCCEEEECCHHHHH Q ss_conf 98998875432597089962854689999999970------998199975745620113479899626689988989997 Q gi|255764474|r 56 SLLTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVL------RMPLKLVFTSPSQRNHSRWTRYLISRMDEVITTSQKSAR 129 (352) Q Consensus 56 ~~~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~l~~~~d~ii~~s~~~~~ 129 (352) .++...+...+.+||++-+..........+..+++ +.++.++-.-.-....+.-.+.++..+|..++.-+..++ T Consensus 76 ~~~~s~~il~k~kPdvii~tG~g~~vp~~~~a~ll~~~~~~~~k~i~iES~~r~~~~sltgkll~~~ad~f~vqW~~l~~ 155 (166) T pfam08660 76 NLLSALKLLRRERPDVILCNGPGTCVPFCLAAKLLKILGLKGTKIVYIESFARVKTLSLSGKLLYPLADRFIVQWPELKK 155 (166) T ss_pred HHHHHHHHHHHHCCCEEEECCCCEEEHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCHHHHHHHHHCCEEEEECHHHHH T ss_conf 99999999985399899977996030999999999864015885899984421378647777699868988983799994 Q ss_pred HCCCC Q ss_conf 43688 Q gi|255764474|r 130 FIERP 134 (352) Q Consensus 130 ~~~~~ 134 (352) ..++. T Consensus 156 ~yp~a 160 (166) T pfam08660 156 KYPKA 160 (166) T ss_pred HCCCC T ss_conf 68997 No 167 >KOG3349 consensus Probab=47.60 E-value=18 Score=15.63 Aligned_cols=94 Identities=16% Similarity=0.163 Sum_probs=48.7 Q ss_pred EEEEEEEECHHHHHHHHHHH------HHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHH-HHCCCCCCCCCCCCCCCCC Q ss_conf 79995122644512366654------5653004774058863133321000146677644-3112332212222223310 Q gi|255764474|r 168 LIGCFGRIRKLKGTDLFVDC------MINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRI-YANGLKKRILFIDEQSSIE 240 (352) Q Consensus 168 ~i~~~G~~~~~Kg~~~li~a------~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i-~~~~l~~~V~~~g~~~~~~ 240 (352) +++.+|.-.. |.||.+ ..++.+..-.--.+-.|.|..--. .. .+.+ +..++. |..-.+.+.+. T Consensus 6 vFVTVGtT~F----d~LI~~Vl~~~~~~~L~k~G~~kLiiQ~Grg~~~~~---d~-~~~~~k~~gl~--id~y~f~psl~ 75 (170) T KOG3349 6 VFVTVGTTSF----DDLISCVLSEEFLQELQKRGFTKLIIQIGRGQPFFG---DP-IDLIRKNGGLT--IDGYDFSPSLT 75 (170) T ss_pred EEEEECCCCH----HHHHHHHCCHHHHHHHHHCCCCEEEEEECCCCCCCC---CH-HHHHCCCCCEE--EEEEECCCCHH T ss_conf 9999566538----999999738999999997396389987358864789---87-77501248758--98773384178 Q ss_pred CCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 00000000001233322222210000010011102 Q gi|255764474|r 241 DWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS 275 (352) Q Consensus 241 ~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s 275 (352) +.++.||+++.-. -.-+++|-+..|+|.|+- T Consensus 76 e~I~~AdlVIsHA----GaGS~letL~l~KPlivV 106 (170) T KOG3349 76 EDIRSADLVISHA----GAGSCLETLRLGKPLIVV 106 (170) T ss_pred HHHHHCCEEEECC----CCCHHHHHHHCCCCEEEE T ss_conf 8875345887458----742099999749977999 No 168 >COG4370 Uncharacterized protein conserved in bacteria [Function unknown] Probab=46.62 E-value=19 Score=15.43 Aligned_cols=154 Identities=14% Similarity=0.092 Sum_probs=71.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCC---C-------CCCCCCCCCCCCCCCHH Q ss_conf 6445123666545653004774058863133321000146677644311233---2-------21222222331000000 Q gi|255764474|r 176 RKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLK---K-------RILFIDEQSSIEDWYRA 245 (352) Q Consensus 176 ~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~---~-------~V~~~g~~~~~~~~~~~ 245 (352) +...+...++.++..+......+.+. +.. .+..-..-+.+..++.|.. + ++++ .+..-.+.+.. T Consensus 238 ea~~nl~~il~slcal~~~~a~vvfw--~ai--~~~lpl~~l~~l~e~~gWq~~ad~~~kdnc~l~l--sqqsfadiLH~ 311 (412) T COG4370 238 EAQTNLAVILGSLCALPAMFALVVFW--AAI--APELPLLLLWTLEERQGWQPLADRFGKDNCSLWL--SQQSFADILHA 311 (412) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HCC--CCCCCHHHHHHHHHHCCCCHHHHHHCCCCEEEEE--EHHHHHHHHHH T ss_conf 78763999999986427777888987--603--7679878999999853862256664667638997--58889999988 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC----C---CCCEEEEECCCCHHHHHHHHHHHHCCHHHHH Q ss_conf 00000123332222221000001001110278841010023----4---8842499659998999999999986989999 Q gi|255764474|r 246 LNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDP----E---NAKAGVIVPPRNLHALEKAVLYFMNSKKIMS 318 (352) Q Consensus 246 adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~----~---~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~ 318 (352) +|+.+-. -|.+.-.+...|+|||.....|- .+... + -|..=.++.+. ++.-+....+++.|++..+ T Consensus 312 adaalgm-----AGTAtEQavGLGkPvi~fPg~GP-Qy~pgFA~rQ~rLLG~sltlv~~~-aq~a~~~~q~ll~dp~r~~ 384 (412) T COG4370 312 ADAALGM-----AGTATEQAVGLGKPVIGFPGQGP-QYNPGFAERQQRLLGASLTLVRPE-AQAAAQAVQELLGDPQRLT 384 (412) T ss_pred HHHHHHH-----CCCHHHHHHCCCCCEEECCCCCC-CCCHHHHHHHHHHHCCEEEECCCC-HHHHHHHHHHHHCCHHHHH T ss_conf 9999875-----44167776336986243689898-758179999999852534541775-4568999999844817778 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 999999999998299899999999 Q gi|255764474|r 319 DTGNRGRERAVKHFSIVKEASDIG 342 (352) Q Consensus 319 ~~~~~a~~~~~~~fs~~~~a~~~~ 342 (352) .+..++++++-+-=-..++++.+. T Consensus 385 air~nGqrRiGqaGaa~rIAe~l~ 408 (412) T COG4370 385 AIRHNGQRRIGQAGAARRIAEELG 408 (412) T ss_pred HHHHCCHHHCCCCCHHHHHHHHHH T ss_conf 887534433167623789999998 No 169 >TIGR02493 PFLA pyruvate formate-lyase 1-activating enzyme; InterPro: IPR012838 Members of this family are iron-sulphur proteins with a radical-SAM domain. A single glycine residue in 2.3.1.54 from EC, formate C-acetyltransferase (formate-pyruvate lyase), is oxidized to the corresponding radical by transfer of a proton from its CH2 to AdoMet with concomitant cleavage of the latter. The reaction requires Fe2+. The first stage is reduction of the AdoMet to give methionine and the 5'-deoxyadenosin-5-yl radical, which then abstracts a hydrogen radical from the glycine residue .; GO: 0043365 [formate-C-acetyltransferase]-activating enzyme activity. Probab=44.93 E-value=20 Score=15.26 Aligned_cols=174 Identities=13% Similarity=0.075 Sum_probs=82.7 Q ss_pred EEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEE-CCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHC Q ss_conf 998058866562347899899887543259708996-2854689999999970998199975745620113479899626 Q gi|255764474|r 39 VVFGYCLPKNIPSIGISSLLTCWKKPIGQNSRIWHA-RRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLISRM 117 (352) Q Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivh~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 117 (352) ...+.-.++++..+-+.+-..++-+ .-+++|-|-. +.+.+.-..-+.+.+-+. +-+++.+ T Consensus 6 Es~G~vDGPG~RFvvFmqGC~lRC~-YChNPDTW~~~~~G~~~t~~el~~e~~~y------------------k~f~~~s 66 (243) T TIGR02493 6 ESMGTVDGPGIRFVVFMQGCPLRCQ-YCHNPDTWDLKKGGTEVTVEELIKEVKSY------------------KDFFKAS 66 (243) T ss_pred EECCCCCCCCCEEEEEECCHHHHHH-HCCCCCCCCCCCCCCEECHHHHHHHHHHH------------------HHHHHCC T ss_conf 5345223885068775435367753-05898743358878120789999999989------------------9887207 Q ss_pred CEEEECC--------HHHHHHCC-CCE-EEECCCCCHHH----HCC--CCCHHHHHHHCCCCCCCCEEEEEEEECHHH-- Q ss_conf 6899889--------89997436-883-99748827778----237--843268788728898872799951226445-- Q gi|255764474|r 118 DEVITTS--------QKSARFIE-RPS-TVIMHGVDTER----FRP--TSNKQEARRHLKISEDAKLIGCFGRIRKLK-- 179 (352) Q Consensus 118 d~ii~~s--------~~~~~~~~-~~~-~vi~~gid~~~----~~~--~~~~~~~~~~~~~~~~~~~i~~~G~~~~~K-- 179 (352) -.-|++| ++..+.+. -+. .=|+..+|++- |.+ ....+++.. ..+.+++-+=.+++.| T Consensus 67 GGGvT~SGGEPllQ~~F~~~~f~~cK~~~GiHTclDT~GGCftf~~~~~~~~~~lLe-----~TDLvLLDiK~~~~~~y~ 141 (243) T TIGR02493 67 GGGVTVSGGEPLLQPEFVLELFKACKKELGIHTCLDTSGGCFTFLGGFTEAADELLE-----VTDLVLLDIKHIDPEKYK 141 (243) T ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHCCHHHHHHHHHH-----CCCEEEEECCCCCHHHCC T ss_conf 995898689502016999999999998569838874488343341212489997510-----058788623436812400 Q ss_pred ------HHHHHHHHHHHHHCCC-CCE-EEEEECCCCCCCCCHHHHHHHHHHHCCC-CCCCCCCCCCC Q ss_conf ------1236665456530047-740-5886313332100014667764431123-32212222223 Q gi|255764474|r 180 ------GTDLFVDCMINILPHH-PGW-TAVVVGKTTLKHYLFKKNLQRRIYANGL-KKRILFIDEQS 237 (352) Q Consensus 180 ------g~~~li~a~~~l~~~~-~~~-~l~i~G~g~~~~~~~~~~l~~~i~~~~l-~~~V~~~g~~~ 237 (352) -.+.-|+=++.|.+.. |-| +.|+| .|--++.+-.+.|.+.++.++. -+||.++||.. T Consensus 142 ~LTg~~~~~ptl~Fa~~L~~~~kP~WiRYVlV-PGyTD~~eDi~~l~~fv~~~~~averVe~LPYH~ 207 (243) T TIGR02493 142 KLTGVKSLEPTLEFAKYLAKRNKPIWIRYVLV-PGYTDSEEDIEALAEFVKTLPNAVERVEVLPYHQ 207 (243) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCEEEEEEEC-CCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 01456775245899999996589889999865-8877998999999999974699279986568860 No 170 >KOG2619 consensus Probab=44.64 E-value=20 Score=15.23 Aligned_cols=59 Identities=14% Similarity=0.183 Sum_probs=41.5 Q ss_pred CCCCE-EEEEECCCCCCCHHCCCCCCCEEEEE-CCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 00010-01110278841010023488424996-59998999999999986989999999999999 Q gi|255764474|r 265 AMASG-IPVVASNTGVFSELLDPENAKAGVIV-PPRNLHALEKAVLYFMNSKKIMSDTGNRGRER 327 (352) Q Consensus 265 Ama~G-~PvI~s~~~~~~e~i~~~~g~~G~~~-~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~ 327 (352) |+-+| +||+... +.+.+++... .-+.+ +-.+++++|+.|.+|-+|+....+.-+.-..+ T Consensus 273 al~~gsVPVvlg~-~n~e~fvP~~---SfI~vdDF~s~~ela~ylk~L~~n~~~Y~~Yf~WR~~~ 333 (372) T KOG2619 273 ALDAGSVPVVLGP-PNYENFVPPD---SFIHVDDFQSPQELAAYLKKLDKNPAAYLSYFEWRKDY 333 (372) T ss_pred HHHCCCCCEEECC-CCCCCCCCCC---CEEEHHHCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 4115730179787-5201468970---13763435999999999998632889999999988652 No 171 >PRK07219 DNA topoisomerase I; Validated Probab=44.16 E-value=20 Score=15.18 Aligned_cols=94 Identities=12% Similarity=0.101 Sum_probs=56.4 Q ss_pred CCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHH---HCCHHHHHHHHHHHHHHHHHH Q ss_conf 32222221000001001110278841010023488424996599989999999999---869899999999999999982 Q gi|255764474|r 255 YEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYF---MNSKKIMSDTGNRGRERAVKH 331 (352) Q Consensus 255 ~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l---~~~~~~~~~~~~~a~~~~~~~ 331 (352) +|+-=+..+|-..-|.|. |..+-+..+++...-+..-+.+..--..+.+.+... +.+++...+|-+.-.+...-+ T Consensus 468 TEasLIk~ME~~GIGrpS--TyA~iI~tL~~R~YVe~~~l~PT~lG~~v~~~l~~~~~~l~~~~~Ta~~E~~Ld~I~~G~ 545 (769) T PRK07219 468 DMSSLLKKMESLNLGTKS--TRHDIIGKLISRGFIEGNPVKPTPLGEAFIDAVISVNSRIADPEMTAKLEMDMDKIESGT 545 (769) T ss_pred CHHHHHHHHHHCCCCCCC--CHHHHHHHHHHCCCCCCCCCEECHHHHHHHHHHHHHCHHHCCHHHHHHHHHHHHHHHCCC T ss_conf 999999988617998501--599999999863875089750768899999999873545339267899999999998489 Q ss_pred CCHHHHHHHHHHHHHHHHH Q ss_conf 9989999999999999886 Q gi|255764474|r 332 FSIVKEASDIGKVYDRLLR 350 (352) Q Consensus 332 fs~~~~a~~~~~iY~~~l~ 350 (352) .+|+.+.+...+.+++++. T Consensus 546 ~~~~~vl~e~~~~~~~~~~ 564 (769) T PRK07219 546 MSKDDVVNESKKMLHSVLN 564 (769) T ss_pred CCHHHHHHHHHHHHHHHHH T ss_conf 8799999999999999999 No 172 >pfam09949 DUF2183 Uncharacterized conserved protein (DUF2183). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=43.86 E-value=21 Score=15.15 Aligned_cols=32 Identities=16% Similarity=0.241 Sum_probs=22.1 Q ss_pred HHHHHHHHCCCCCEEEEEECCCCCCCCCHHHH Q ss_conf 65456530047740588631333210001466 Q gi|255764474|r 185 VDCMINILPHHPGWTAVVVGKTTLKHYLFKKN 216 (352) Q Consensus 185 i~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~ 216 (352) ...+.++++..|+.+++++|+-.+.|.+-..+ T Consensus 52 ~~~i~~il~~fP~~kfiLiGDsgq~DpeiY~~ 83 (100) T pfam09949 52 RDAIERILRDFPDRKFILIGDSGQHDPEIYAE 83 (100) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCHHHHHH T ss_conf 99999999978897089956887769999999 No 173 >TIGR02093 P_ylase glycogen/starch/alpha-glucan phosphorylases; InterPro: IPR011833 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described . This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site . The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Proteins in this entry are glycosyltransferases with phosphorylase activities. Members use phosphate to break alpha 1,4 linkages between pairs of glucose residues at the end of long glucose polymers, releasing alpha-D-glucose 1-phosphate. The nomenclature convention is to preface the name according to the natural substrate, as in glycogen phosphorylase, starch phosphorylase, maltodextrin phosphorylase, etc. The main role of glycogen phosphorylase (GPase) is to provide phosphorylated glucose molecules (G-1-P) . GPase is a highly regulated allosteric enzyme. The net effect of the regulatory site allows the enzyme to operate at a variety of rates; the enzyme is not simply regulated as "on" or "off", but rather it can be thought of being set to operate at an ideal rate based on changing conditions at in the cell. The most important allosteric effector is the phosphate molecule covalently attached to Ser14. This switches GPase from the b (inactive) state to the a (active) state. Upon phosphorylation, GPase attains about 800f its Vmax. When the enzyme is not phosphorylated, GPase activity is practically non-existent at low AMP levels . There is some apparent controversy as to the structure of GPase. All sources agree that the enzyme is multimeric, but there is apparent controversy as to the enzyme being a tetramer or a dimer. Apparently, GPase (in the a form) forms tetramers in the crystal form. The consensus seems to be that 'regardless of the a or b form, GPase functions as a dimer in vivo . The GPase monomer is best described as consisting of two domains, an N-terminal domain and a C-terminal domain . The C-terminal domain is often referred to as the catalytic domain. It consists of a beta-sheet core surrounded by layers of helical segments . The vitamin cofactor pyridoxal phosphate (PLP) is covalently attached to the amino acid backbone. The N-terminal domain also consists of a central beta-sheet core and is surrounded by layers of helical segments. The N-terminal domain contains different allosteric effector sites to regulate the enzyme. Bacterial phosphorylases follow the same catalytic mechanisms as their plant and animal counterparts, but differ considerably in terms of their substrate specificity and regulation. The catalytic domains are highly conserved while the regulatory sites are only poorly conserved. For maltodextrin phosphorylase from Escherichia coli the physiological role of the enzyme in the utilisation of maltidextrins is known in detail; that of all the other bacterial phosphorylases is still unclear. Roles in regulatuon of endogenous glycogen metabolism in periods of starvation, and sporulation, stress response or quick adaptation to changing environments are possible .; GO: 0004645 phosphorylase activity, 0030170 pyridoxal phosphate binding, 0005975 carbohydrate metabolic process. Probab=43.70 E-value=21 Score=15.14 Aligned_cols=186 Identities=15% Similarity=0.130 Sum_probs=103.4 Q ss_pred HCCCCC--CCCEEEEEEEECHHHHHHH----HHHHHHHHHCC-CCC-E---EEEEECCCCCCCCCHHHHHHHHHHHCC-- Q ss_conf 728898--8727999512264451236----66545653004-774-0---588631333210001466776443112-- Q gi|255764474|r 159 HLKISE--DAKLIGCFGRIRKLKGTDL----FVDCMINILPH-HPG-W---TAVVVGKTTLKHYLFKKNLQRRIYANG-- 225 (352) Q Consensus 159 ~~~~~~--~~~~i~~~G~~~~~Kg~~~----li~a~~~l~~~-~~~-~---~l~i~G~g~~~~~~~~~~l~~~i~~~~-- 225 (352) +.|+.- +..+=+.+=|+..+|+..+ +|.-..+++.. ..+ + +++|.|...+. .-+.+.+-++|+.-+ T Consensus 539 ~~GV~~dP~siFDVQvKRlHEYKRQLLN~L~vi~lY~rIK~~~~~~r~vPRtviFGgKAAPG-Y~~AK~IIKLIN~VA~v 617 (822) T TIGR02093 539 ETGVEVDPNSIFDVQVKRLHEYKRQLLNVLHVIYLYNRIKEDPSKDRIVPRTVIFGGKAAPG-YHMAKLIIKLINSVAEV 617 (822) T ss_pred HCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH-HHHHHHHHHHHHHHHHH T ss_conf 77927686655321123656778999899999998999845878882034477714613355-78888999999878997 Q ss_pred ------CCC--CCCCCCC--CCCCCCCCHHHHHH--CCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEE Q ss_conf ------332--2122222--23310000000000--01233322222210000010011102788410100234884249 Q gi|255764474|r 226 ------LKK--RILFIDE--QSSIEDWYRALNIF--VAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGV 293 (352) Q Consensus 226 ------l~~--~V~~~g~--~~~~~~~~~~adi~--i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~ 293 (352) +.+ +|+|+.. |+-.+.++.+||+= |++.-.|..|..=+=-|..|...|.|--|..-|+.+..-.+|-| T Consensus 618 VNnDP~V~~~LKVVF~pNYnVS~AE~iiPAaDlSEQISTAGKEASGTGNMKFmlNGALTiGTlDGANVEI~E~vG~EN~F 697 (822) T TIGR02093 618 VNNDPAVGDKLKVVFVPNYNVSLAELIIPAADLSEQISTAGKEASGTGNMKFMLNGALTIGTLDGANVEIREEVGAENIF 697 (822) T ss_pred HCCCHHHCCCEEEEEECCCCCCHHHHHCCCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCC T ss_conf 13784067911589832787574633156244512143004534566217888755557740476116889602854543 Q ss_pred EECC--CCHHHHHHHH---H-HHH-CCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 9659--9989999999---9-998-698999999999999999829989999999999999886 Q gi|255764474|r 294 IVPP--RNLHALEKAV---L-YFM-NSKKIMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLLR 350 (352) Q Consensus 294 ~~~~--~d~~~la~~i---~-~l~-~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l~ 350 (352) ++=- ++++++.+.= . .+. .||++.+-+ .....-.|++ ..-+.+..+|++||. T Consensus 698 iFGl~~eEV~~l~~~GY~p~~~~y~~dp~L~~~~----d~~~~G~F~p-g~~~~f~~l~~sL~~ 756 (822) T TIGR02093 698 IFGLTVEEVEELREKGYNPSREYYEADPELKRVL----DLISSGTFSP-GDPDLFRPLVDSLLN 756 (822) T ss_pred CCCCCHHHHHHHHHCCCCCCHHHHHCCHHHHHHH----HHHHCCCCCC-CCCCCHHHHHHHHCC T ss_conf 4789888899998601771014441483689999----9885361389-885301688985323 No 174 >TIGR02201 heptsyl_trn_III lipopolysaccharide heptosyltransferase III, putative; InterPro: IPR011916 This family consists of examples of the putative ADP-heptose:LPS heptosyltransferase III, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria. This enzyme may be less widely distributed than heptosyltransferases I and II.. Probab=43.54 E-value=21 Score=15.12 Aligned_cols=195 Identities=16% Similarity=0.238 Sum_probs=101.4 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHH-----HHCCEEEECCHHHH Q ss_conf 8998998875432597089962854689999999970998199975745620113479899-----62668998898999 Q gi|255764474|r 54 ISSLLTCWKKPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLVFTSPSQRNHSRWTRYLI-----SRMDEVITTSQKSA 128 (352) Q Consensus 54 ~~~~~~~~~~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-----~~~d~ii~~s~~~~ 128 (352) ....+.+.+..+..++|.|- |....+..+ ++.++++...++-|.-| .|.+..|.+.+. ...|.+.++++... T Consensus 69 l~~q~~Li~~LR~~rYd~v~-nL~DQw~~A-~~~kLl~ar~~IgfDyp-kRr~~~Wr~~fTaLaplq~~n~LHtV~~NLs 145 (347) T TIGR02201 69 LASQLRLIKVLRRRRYDLVV-NLADQWQSA-VLAKLLAARVKIGFDYP-KRRKAVWRKSFTALAPLQQSNTLHTVEQNLS 145 (347) T ss_pred HHHHHHHHHHHHHCHHHEEE-EHHHHHHHH-HHHHHHHHHHHCCCCCC-CCCCCCCCCHHHCCCCCCCCCEEEEEEHHHH T ss_conf 99999999998732000464-232344599-99999863543057855-4678611012210252588872342101246 Q ss_pred HHCCCCEEEECCCCCHH----H--H-CCCCCHHHHHH---HCCCCCCCCEEEEEEEECHHHHH-----HHHHHHHHHHHC Q ss_conf 74368839974882777----8--2-37843268788---72889887279995122644512-----366654565300 Q gi|255764474|r 129 RFIERPSTVIMHGVDTE----R--F-RPTSNKQEARR---HLKISEDAKLIGCFGRIRKLKGT-----DLFVDCMINILP 193 (352) Q Consensus 129 ~~~~~~~~vi~~gid~~----~--~-~~~~~~~~~~~---~~~~~~~~~~i~~~G~~~~~Kg~-----~~li~a~~~l~~ 193 (352) - .-|-|++.. + . .+..+.++.++ +-|+.++..+|=-.+|.-. |.. --+++|++ . T Consensus 146 ~-------L~PLGl~~~~~~~rCvm~y~~adw~a~ralL~eaGv~~~YiVIqPtsRwfF-KCW~~drfsali~AL~---a 214 (347) T TIGR02201 146 V-------LAPLGLDSLAKEARCVMSYSEADWKAVRALLDEAGVSENYIVIQPTSRWFF-KCWEDDRFSALIDALA---A 214 (347) T ss_pred H-------HCCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHCCCCCCCEEECCCCCCCC-CCCHHHHHHHHHHHHH---H T ss_conf 6-------424688856576602137886459999985232579998178736736310-1662135899999885---1 Q ss_pred CCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCC-CCCCC--CCCCCCCHHHHHHCCCCCCCCC-CCCCCCCCCCE Q ss_conf 47740588631333210001466776443112332212-22222--3310000000000012333222-22210000010 Q gi|255764474|r 194 HHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRIL-FIDEQ--SSIEDWYRALNIFVAPPLYEGF-GLTPLEAMASG 269 (352) Q Consensus 194 ~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~-~~g~~--~~~~~~~~~adi~i~pS~~Eg~-gl~~lEAma~G 269 (352) + .+..|+-...+.++.....++...+.+ ++|+ +.|.. .++..+.-.|-+|+ |. +...-=|.|.| T Consensus 215 ~--Gy~vVLTsgPD~~e~~mV~~I~~g~~~----~rv~slAG~lTl~~LaA~IDhArlFI------GVDSvPmH~AAAlg 282 (347) T TIGR02201 215 E--GYEVVLTSGPDKDELKMVDEIAAGCRS----DRVVSLAGKLTLRQLAAVIDHARLFI------GVDSVPMHMAAALG 282 (347) T ss_pred C--CCEEEECCCCCHHHHHHHHHHHHCCCC----CCCEEEECCCCHHHHHHHHCCCEEEE------CCCCHHHHHHHHCC T ss_conf 8--956988289898679999999711689----87056422235566655731110342------23776789998618 Q ss_pred EEEEE Q ss_conf 01110 Q gi|255764474|r 270 IPVVA 274 (352) Q Consensus 270 ~PvI~ 274 (352) +|.|+ T Consensus 283 ~plva 287 (347) T TIGR02201 283 TPLVA 287 (347) T ss_pred CCEEE T ss_conf 87175 No 175 >COG0062 Uncharacterized conserved protein [Function unknown] Probab=42.99 E-value=21 Score=15.07 Aligned_cols=114 Identities=18% Similarity=0.188 Sum_probs=56.1 Q ss_pred HHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 88728898872799951226445123666545653004774058863133321000146677644311233221222222 Q gi|255764474|r 157 RRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ 236 (352) Q Consensus 157 ~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~ 236 (352) .+.+......++++++|.- ..|=|-++ |...|....-+++....|+..... .+......+.++...-+... T Consensus 41 ~~~~~~~~~~~v~vlcG~G--nNGGDG~V-aAR~L~~~G~~V~v~~~~~~~~~~---~~~a~~~~~~l~~~~~v~~~--- 111 (203) T COG0062 41 LREYPLGRARRVLVLCGPG--NNGGDGLV-AARHLKAAGYAVTVLLLGDPKKLK---TEAARANLKSLGIGGVVKIK--- 111 (203) T ss_pred HHHCCCCCCCEEEEEECCC--CCCHHHHH-HHHHHHHCCCCEEEEEECCCCCCC---HHHHHHHHHHHCCCCCEEEC--- T ss_conf 9876855687799998899--86278999-999998679846999707987760---89999988851688413403--- Q ss_pred CCCCCCCHHHHHHCC--------CCCCCCCCCCCCCCCCCEEEEEECCCCCC Q ss_conf 331000000000001--------23332222221000001001110278841 Q gi|255764474|r 237 SSIEDWYRALNIFVA--------PPLYEGFGLTPLEAMASGIPVVASNTGVF 280 (352) Q Consensus 237 ~~~~~~~~~adi~i~--------pS~~Eg~gl~~lEAma~G~PvI~s~~~~~ 280 (352) +.......+|+.|= ....|.+...+-..-..|+|+|+-|+|.- T Consensus 112 -~~~~~~~~~dvIVDalfG~G~~g~lrep~a~~Ie~iN~~~~pivAVDiPSG 162 (203) T COG0062 112 -ELEDEPESADVIVDALFGTGLSGPLREPFASLIEAINASGKPIVAVDIPSG 162 (203) T ss_pred -CCCCCCCCCCEEEEECEECCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCC T ss_conf -344666537899980420687888860899999999734990699868988 No 176 >COG3959 Transketolase, N-terminal subunit [Carbohydrate transport and metabolism] Probab=42.99 E-value=21 Score=15.07 Aligned_cols=69 Identities=20% Similarity=0.210 Sum_probs=40.5 Q ss_pred HCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCC---C-C-CCCCCCHHHHHHCCCCCCCCCCCCCCC Q ss_conf 00477405886313332100014667764431123322122222---2-3-310000000000012333222222100 Q gi|255764474|r 192 LPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDE---Q-S-SIEDWYRALNIFVAPPLYEGFGLTPLE 264 (352) Q Consensus 192 ~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~---~-~-~~~~~~~~adi~i~pS~~Eg~gl~~lE 264 (352) .+..+...++++|+|...+..-++.... +..++|.+-|.|.+. | + .....+...+ +...+|.||.-++| T Consensus 137 l~~~~~~VyvilGDGEl~EG~~WEAam~-Aah~~L~NLiaivD~N~~QldG~t~~i~~~~p---L~~k~eAFGw~V~e 210 (243) T COG3959 137 LKGSPYRVYVILGDGELDEGQVWEAAMT-AAHYKLDNLIAIVDRNKLQLDGETEEIMPKEP---LADKWEAFGWEVIE 210 (243) T ss_pred HCCCCCEEEEEECCCCCCCCCHHHHHHH-HHHHCCCCEEEEEECCCCCCCCCHHHCCCCCH---HHHHHHHCCCEEEE T ss_conf 5289844999955754366527999999-98705376799994587424776543268520---07888843806899 No 177 >TIGR01769 GGGP geranylgeranylglyceryl phosphate synthase; InterPro: IPR010946 This entry represents geranylgeranylglyceryl phosphate synthase which catalyses the first committed step in the synthesis of ether-linked membrane lipids in archaea .. Probab=42.73 E-value=21 Score=15.12 Aligned_cols=22 Identities=9% Similarity=0.191 Sum_probs=11.6 Q ss_pred HHHHHHHHHHHHCCEEEEEECC Q ss_conf 8889999999719859998058 Q gi|255764474|r 23 TVFGLCPIQRKLGQRLVVFGYC 44 (352) Q Consensus 23 ~v~~~~~~~~~~~~~~~~~~~~ 44 (352) .....+++....|-+...++.. T Consensus 14 ~~~~IAk~~~~aGtD~ilvGGs 35 (212) T TIGR01769 14 ELKKIAKNAKEAGTDAILVGGS 35 (212) T ss_pred HHHHHHHHHHHCCCCEEEECCC T ss_conf 6999999998528988983663 No 178 >TIGR02030 BchI-ChlI magnesium chelatase ATPase subunit I; InterPro: IPR011775 This entry represents one of two ATPase subunits of the trimeric magnesium chelatase responsible for insertion of magnesium ion into protoporphyrin IX. This is an essential step in the biosynthesis of both chlorophyll and bacteriochlorophyll . This subunit is found in green plants, photosynthetic algae, cyanobacteria and other photosynthetic bacteria. It is also found in certain archaea not known to make chlorins.; GO: 0005524 ATP binding, 0016851 magnesium chelatase activity, 0006779 porphyrin biosynthetic process. Probab=42.73 E-value=13 Score=16.56 Aligned_cols=108 Identities=18% Similarity=0.163 Sum_probs=56.2 Q ss_pred EEEEEE--ECHHHHHHHHHHHHHHHH---------CCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 999512--264451236665456530---------047740588631333210001466776443112332212222223 Q gi|255764474|r 169 IGCFGR--IRKLKGTDLFVDCMINIL---------PHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQS 237 (352) Q Consensus 169 i~~~G~--~~~~Kg~~~li~a~~~l~---------~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~ 237 (352) ++|+-- +-.-.=+|.|++.++.=. =+|| ++|+++|+|.+++.+.-.+| ....|+.-+|.-.-.++ T Consensus 137 ~LYiDEVNLLeDHlVDvLLDvAasG~NvVEREG~SiRHP-ARFVLVGSGNPEEGeLRPQL---LDRFGlhaeirt~rdve 212 (340) T TIGR02030 137 YLYIDEVNLLEDHLVDVLLDVAASGVNVVEREGVSIRHP-ARFVLVGSGNPEEGELRPQL---LDRFGLHAEIRTVRDVE 212 (340) T ss_pred CEEEEEEHHHHHHHHHHHHHHCCCCCEEEEECCEEEECC-CCEEEECCCCCCCCCCCCHH---HHHCCCEEEEECCCCHH T ss_conf 425521000124566566522047830586356023037-63587157888766667401---52446527872267722 Q ss_pred C-C-----C-CCCHHHHHHCCCCCCCC--CCCCCCCCCCCEEEEEECCCCCCC Q ss_conf 3-1-----0-00000000001233322--222210000010011102788410 Q gi|255764474|r 238 S-I-----E-DWYRALNIFVAPPLYEG--FGLTPLEAMASGIPVVASNTGVFS 281 (352) Q Consensus 238 ~-~-----~-~~~~~adi~i~pS~~Eg--~gl~~lEAma~G~PvI~s~~~~~~ 281 (352) . + . .|=+.-+.|+=-=-.|. +--.+..|.. -+|=+.++-+... T Consensus 213 ~RVeiv~rR~eyDadp~aF~ekw~~E~~~Lq~kI~~Aq~-~LPqv~~p~~vl~ 264 (340) T TIGR02030 213 ERVEIVKRRDEYDADPFAFVEKWAKEDEKLQAKIVNAQA-LLPQVKLPDDVLY 264 (340) T ss_pred HHHHHEEEHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH-HCCCCCCCCCHHH T ss_conf 201220100331778853455578889999999999997-4880118861021 No 179 >pfam00290 Trp_syntA Tryptophan synthase alpha chain. Probab=42.19 E-value=22 Score=14.99 Aligned_cols=23 Identities=13% Similarity=0.225 Sum_probs=11.5 Q ss_pred HHHHHHHHHHHHCCEEEEEECCC Q ss_conf 88899999997198599980588 Q gi|255764474|r 23 TVFGLCPIQRKLGQRLVVFGYCL 45 (352) Q Consensus 23 ~v~~~~~~~~~~~~~~~~~~~~~ 45 (352) +..++...+.+.|.++.=.+-+. T Consensus 24 ~~~~~i~~l~~~GaDiiEiGiPF 46 (258) T pfam00290 24 TTLEILEALEEAGADAIELGIPF 46 (258) T ss_pred HHHHHHHHHHHCCCCEEEECCCC T ss_conf 99999999997699999978998 No 180 >cd05566 PTS_IIB_galactitol PTS_IIB_galactitol: subunit IIB of enzyme II (EII) of the galactitol-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In this system, EII is a galactitol-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain that are expressed on three distinct polypeptide chains, in contrast to other PTS sugar transporters. The three genes encoding these subunits (gatA, gatB, and gatC) comprise the gatCBA operon. Galactitol PTS permease takes up exogenous galactitol, releasing the phosphate ester into the cytoplasm in preparation for oxidation and further metabolism via a modified glycolytic pathway called the tagatose-6-phosphate glycolytic pathway. The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include galactitol, chitobiose/lichenan, ascorbate, lactose, mannitol, fructose, and Probab=41.92 E-value=15 Score=16.09 Aligned_cols=56 Identities=9% Similarity=0.164 Sum_probs=38.6 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCC Q ss_conf 588631333210001466776443112332212222223310000000000012333 Q gi|255764474|r 199 TAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLY 255 (352) Q Consensus 199 ~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~ 255 (352) -++.+|.|-.-+..-..++++..++.|++..+. .....++.......|+++..+.. T Consensus 3 IlVaCGsGiaTSt~v~~ki~~~l~e~gi~~~i~-~~~i~e~~~~~~~~Dliv~tt~~ 58 (89) T cd05566 3 ILVACGTGVATSTVVASKVKELLKENGIDVKVE-QCKIAEVPSLLDDADLIVSTTKV 58 (89) T ss_pred EEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEE-EECHHHHHHCCCCCCEEEEECCC T ss_conf 999989970399999999999999869915999-97599987427998789994426 No 181 >PRK00124 hypothetical protein; Validated Probab=41.89 E-value=18 Score=15.60 Aligned_cols=84 Identities=15% Similarity=0.273 Sum_probs=39.8 Q ss_pred CCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHH-HHHHCC---HHHH Q ss_conf 100000100111027884101002348842499659998999999999986989999999999999-998299---8999 Q gi|255764474|r 262 PLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRER-AVKHFS---IVKE 337 (352) Q Consensus 262 ~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~-~~~~fs---~~~~ 337 (352) ++|-+..|=-||+.|.|-...++. .| ++.+.|.--.==.+-|..+++.+...+++...|... --..|+ -.+. T Consensus 59 I~~~~~~gDiVIT~DipLAa~~l~--Kg--a~vi~prG~~yt~~NI~~~L~~R~~~~~lR~~G~~t~Gp~~f~~~Dr~~F 134 (149) T PRK00124 59 IVERAEKGDIVITQDYPLAAEALE--KG--AIVLNPRGEIYTNDNIDMLLAMRDLMATLRRSGGRTGGPKPFTPEDRSRF 134 (149) T ss_pred HHHHCCCCCEEEECCHHHHHHHHH--CC--CEEECCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHH T ss_conf 998388999999698899999998--79--89999998689887899999988888999973998999999999999999 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999988 Q gi|255764474|r 338 ASDIGKVYDRLL 349 (352) Q Consensus 338 a~~~~~iY~~~l 349 (352) +++++.+.+++. T Consensus 135 ~~~L~~~l~~~~ 146 (149) T PRK00124 135 EAELDKLIRRVK 146 (149) T ss_pred HHHHHHHHHHHH T ss_conf 999999999986 No 182 >COG1887 TagB Putative glycosyl/glycerophosphate transferases involved in teichoic acid biosynthesis TagF/TagB/EpsJ/RodC [Cell envelope biogenesis, outer membrane] Probab=41.54 E-value=22 Score=14.92 Aligned_cols=169 Identities=14% Similarity=0.181 Sum_probs=84.2 Q ss_pred CCEEEECCCCCHHH-HC-CCCC---HHHHHHHCCCCCCCCEEEEEEEECHHH--------HHHHHHHHHHHHHCCCCCEE Q ss_conf 88399748827778-23-7843---268788728898872799951226445--------12366654565300477405 Q gi|255764474|r 133 RPSTVIMHGVDTER-FR-PTSN---KQEARRHLKISEDAKLIGCFGRIRKLK--------GTDLFVDCMINILPHHPGWT 199 (352) Q Consensus 133 ~~~~vi~~gid~~~-~~-~~~~---~~~~~~~~~~~~~~~~i~~~G~~~~~K--------g~~~li~a~~~l~~~~~~~~ 199 (352) ....+...|.+... +. .... ........+++.+..+|+|.-.+.+.. ..+.-++++.+...+ +++. T Consensus 170 ~~~~i~~~G~Pr~D~~~~~~~~~~~~~~~~~~~~~~~~k~vIlyaPTfr~~~~~~~~~~~~~~~~~~~~~~~l~~-~~~~ 248 (388) T COG1887 170 DKENILETGYPRNDKLFDEAGKTEDILLIQLALPLPQDKKVILYAPTFRDNDVLIGTQFFNLDIDIEKLKEKLGE-NEYV 248 (388) T ss_pred CCCCEEECCCCCCHHHHHHCCHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCHHHCCHHHHHHHHHHHCC-CCEE T ss_conf 300044336585056544122146678877604775557769965765788554330110001459999876166-8769 Q ss_pred EEEECCCCCCCCCHHHHHHHHHHHC-CCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE--CC Q ss_conf 8863133321000146677644311-23322122222233100000000000123332222221000001001110--27 Q gi|255764474|r 200 AVVVGKTTLKHYLFKKNLQRRIYAN-GLKKRILFIDEQSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVA--SN 276 (352) Q Consensus 200 l~i~G~g~~~~~~~~~~l~~~i~~~-~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~--s~ 276 (352) +++-=.... ++.+... ...+.+...-...++..+|..+|++|. -++.+..|+|...+|+|- .| T Consensus 249 ii~k~Hp~i---------s~~~~~~~~~~~~~~~vs~~~di~dll~~sDiLIT-----DySSv~fdf~~l~KPiify~~D 314 (388) T COG1887 249 IIVKPHPLI---------SDKIDKRYALDDFVLDVSDNADINDLLLVSDILIT-----DYSSVIFDFMLLDKPIIFYTYD 314 (388) T ss_pred EEEECCHHH---------CCCCHHHHCCCCCEEECCCCHHHHHHHHHCCEEEE-----CCCCHHHHHHHHCCCEEEEECC T ss_conf 999558534---------04210002036604654663449999875488885-----1630046688655967999647 Q ss_pred CCCCCHH---CCC-CCCCEEEEECCCCHHHHHHHHHHHHCCHHHHH Q ss_conf 8841010---023-48842499659998999999999986989999 Q gi|255764474|r 277 TGVFSEL---LDP-ENAKAGVIVPPRNLHALEKAVLYFMNSKKIMS 318 (352) Q Consensus 277 ~~~~~e~---i~~-~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~ 318 (352) .....+. ... ....-|-+++ +.+++.++|.....+++... T Consensus 315 ~~~y~~~rg~~~d~~~~~Pg~~~~--~~~~li~ai~~~~~~~~~~~ 358 (388) T COG1887 315 LEQYDELRGFYLDYKFEAPGEVVE--TQEELIDAIKPYDEDGNYDL 358 (388) T ss_pred CHHHHHCCCHHHHHHHCCCCCHHC--CHHHHHHHHHHHHCCCHHHH T ss_conf 047554011244577619951103--68999998775310221568 No 183 >pfam02504 FA_synthesis Fatty acid synthesis protein. The plsX gene is part of the bacterial fab gene cluster which encodes several key fatty acid biosynthetic enzymes. The exact function of the plsX protein in fatty acid synthesis is unknown. Probab=40.64 E-value=23 Score=14.83 Aligned_cols=50 Identities=20% Similarity=0.220 Sum_probs=23.3 Q ss_pred CHHHHHHHHHHHHHHH-CCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 0778888999999971-9859998058866562347899899887543259708996285 Q gi|255764474|r 19 GVTSTVFGLCPIQRKL-GQRLVVFGYCLPKNIPSIGISSLLTCWKKPIGQNSRIWHARRN 77 (352) Q Consensus 19 Gv~~~v~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivh~~~~ 77 (352) |....+......+.+. ...+..++... .....+. ......+..++|+... T Consensus 13 ~P~~vv~Ga~~al~~~~~~~i~LvG~e~-------~i~~~l~--~~~~~~~i~iv~a~~~ 63 (322) T pfam02504 13 GPLEVVKGVLLAAKSAQDLDIVLVGDKD-------AIDPLVK--EAELNSKLTIVHAEGV 63 (322) T ss_pred CHHHHHHHHHHHHHHCCCCEEEEEECHH-------HHHHHHH--HCCCCCCCEEEECCCC T ss_conf 8488999999999878897699992999-------9999985--1676568189968880 No 184 >COG3414 SgaB Phosphotransferase system, galactitol-specific IIB component [Carbohydrate transport and metabolism] Probab=40.35 E-value=23 Score=14.80 Aligned_cols=55 Identities=11% Similarity=0.104 Sum_probs=34.5 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCC Q ss_conf 58863133321000146677644311233221222222331000000000001233 Q gi|255764474|r 199 TAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPL 254 (352) Q Consensus 199 ~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~ 254 (352) -+..+|.|-.-+.--..++++..+++|+.-.+.- -..++.......+|+++...+ T Consensus 4 IL~aCG~GvgSS~~ik~kve~~l~~~gi~~~~~~-~~v~~~~~~~~~aDiiv~s~~ 58 (93) T COG3414 4 ILAACGNGVGSSTMIKMKVEEVLKELGIDVDVEQ-CAVDEIKALTDGADIIVTSTK 58 (93) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHCCCCCEEEE-EEECCCCCCCCCCCEEEEEHH T ss_conf 9998789802899999999999998599842666-771111457875529997467 No 185 >TIGR02478 6PF1K_euk 6-phosphofructokinase; InterPro: IPR009161 Members of this family are eukaryotic (with one exception) ATP-dependent 6-phosphofructokinases (2.7.1.11 from EC) in which two tandem copies of the phosphofructokinase are found . Members are found, often including several isozymes, in animals and fungi and in the bacterium Propionibacterium acnes KPA171202 (a human skin commensal). Phosphofructokinase (PFK) catalyses the phosphorylation of fructose-6-phosphate to fructose-1,6-biphosphate, which then enters the Embden-Meyerhof pathway. PFK is a key regulatory enzyme in glycolysis. For additional information please see , , .; GO: 0003872 6-phosphofructokinase activity, 0006096 glycolysis, 0005737 cytoplasm. Probab=38.10 E-value=25 Score=14.57 Aligned_cols=144 Identities=15% Similarity=0.104 Sum_probs=83.7 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCCCCCC---CCHHHHHHHHH-HHCCCCCCCCCCCCCC--CCCCC-------CHHHHH Q ss_conf 36665456530047740588631333210---00146677644-3112332212222223--31000-------000000 Q gi|255764474|r 182 DLFVDCMINILPHHPGWTAVVVGKTTLKH---YLFKKNLQRRI-YANGLKKRILFIDEQS--SIEDW-------YRALNI 248 (352) Q Consensus 182 ~~li~a~~~l~~~~~~~~l~i~G~g~~~~---~~~~~~l~~~i-~~~~l~~~V~~~g~~~--~~~~~-------~~~adi 248 (352) |.|...+.+-+++...-..+|++.|.-+. ..--+..|+.+ ++++++-||..+|++. -.+.. .+..+. T Consensus 227 d~lc~~l~~~R~~GkR~~iViVAEGA~D~~~~pItse~VK~vl~~~L~lDtRiT~LGHVQRGG~PsA~DR~Lat~~GvEA 306 (777) T TIGR02478 227 DQLCAKLKRNRKAGKRKNIVIVAEGAIDRELNPITSEDVKDVLVERLGLDTRITVLGHVQRGGAPSAYDRILATLQGVEA 306 (777) T ss_pred HHHHHHHHHHHHCCCCCEEEEEEECEECCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 89989988877368964079984210546777968898988864401840247660522468872289999998644999 Q ss_pred --HCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf --001233322222210000010011102788410100234884249965999899999999998698999999999999 Q gi|255764474|r 249 --FVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRE 326 (352) Q Consensus 249 --~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~ 326 (352) -|+-+.-|.-|-++. ..-.=-|||+.+ + | .-.=.|=-+.-+-.+.+...+.+.+.-..|..++++ T Consensus 307 V~avL~~tpe~~Gavvt-Gqk~ps~VI~l~--g--------N--kI~R~PL~~~V~~T~~V~~~i~~~~f~~A~~LRg~e 373 (777) T TIGR02478 307 VLAVLESTPETEGAVVT-GQKTPSPVISLR--G--------N--KIVRKPLVEAVRQTKTVAKAIKERRFDEAVRLRGRE 373 (777) T ss_pred HHHHHCCCCCCCCCEEE-CCCCCCEEEEEC--C--------C--EEEECHHHHHHHHHHHHHHHHHCCCCHHHHHCCCHH T ss_conf 99997268776874761-543555147506--8--------7--022131889999899989987317754587235723 Q ss_pred HHHHHCCHHHHH Q ss_conf 999829989999 Q gi|255764474|r 327 RAVKHFSIVKEA 338 (352) Q Consensus 327 ~~~~~fs~~~~a 338 (352) +++...++.... T Consensus 374 F~~~~~~~~~~s 385 (777) T TIGR02478 374 FVENLATFLELS 385 (777) T ss_pred HHHHHHHHHHHC T ss_conf 577544477620 No 186 >PRK10643 DNA-binding transcriptional regulator BasR; Provisional Probab=36.82 E-value=26 Score=14.44 Aligned_cols=18 Identities=17% Similarity=-0.007 Sum_probs=11.3 Q ss_pred HHHHHHHHHHCCEEEEEE Q ss_conf 899999997198599980 Q gi|255764474|r 25 FGLCPIQRKLGQRLVVFG 42 (352) Q Consensus 25 ~~~~~~~~~~~~~~~~~~ 42 (352) ..+...+...|+.+..++ T Consensus 14 ~~l~~~L~~~g~~V~~a~ 31 (222) T PRK10643 14 QGLILAAQTEGYACDGVS 31 (222) T ss_pred HHHHHHHHHCCCEEEEEC T ss_conf 999999997899999989 No 187 >cd06302 PBP1_LsrB_Quorum_Sensing Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs. Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs from other bacteria. The members of this group are homologous to a family of periplasmic pentose/hexose sugar-binding proteins that function as the primary receptors for chemotaxis and transporters of many sugar based solutes in bacteria and archaea and that are a member of the type I periplasmic binding protein superfamily. LsrB binds a chemically distinct form of the AI-2 signal that lacks boron, in contrast to the Vibrio harveyi AI-2 signaling molecule that has an unusual furanosyl borate diester. Hence, many bacteria coordinate their gene expression according to the local density of their population by producing species specific AI-2. This process of quorum sensing allows LsrB to function as a periplasmic AI-2 binding p Probab=36.14 E-value=27 Score=14.37 Aligned_cols=88 Identities=13% Similarity=0.157 Sum_probs=47.7 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHC--C-CCCC Q ss_conf 23666545653004774058863133321000146677644311233221222222--33100000000000--1-2333 Q gi|255764474|r 181 TDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFV--A-PPLY 255 (352) Q Consensus 181 ~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i--~-pS~~ 255 (352) .+...+++..++..+|++..+++.+... .....+-+++.|+.+.|...|+- ++..+.++.-.+.. . .... T Consensus 169 ~~~a~~~~~~~L~~~Pdi~~I~~~nd~~-----a~Ga~~Al~~aG~~~~V~vvG~D~~~~~~~~I~~G~i~atv~Q~P~~ 243 (298) T cd06302 169 ADKSYQTAQELLKAYPDLKGIIGPTSVG-----IPGAARAVEEAGLKGKVAVTGLGLPNQMAPYVKSGAVKEFALWNPAD 243 (298) T ss_pred HHHHHHHHHHHHHHCCCCCEEEECCCHH-----HHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHCCCCEEEEECCHHH T ss_conf 8999999999997599912999789816-----99999999967999985999968879999999859826999849999 Q ss_pred CCC-CCCCCCCCCCEEEEE Q ss_conf 222-222100000100111 Q gi|255764474|r 256 EGF-GLTPLEAMASGIPVV 273 (352) Q Consensus 256 Eg~-gl~~lEAma~G~PvI 273 (352) .|+ +.-..=.+..|.+|- T Consensus 244 ~G~~av~~a~~~~~Ge~V~ 262 (298) T cd06302 244 LGYAAVYVAKALLEGKEIG 262 (298) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 9999999999997799078 No 188 >PRK05927 hypothetical protein; Provisional Probab=35.43 E-value=25 Score=14.61 Aligned_cols=135 Identities=13% Similarity=0.116 Sum_probs=52.1 Q ss_pred HHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 87288988727999512264451236665456530047740588631333210001466776443112332212222223 Q gi|255764474|r 158 RHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQS 237 (352) Q Consensus 158 ~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~ 237 (352) ...|++.+... ..|.+......-.=+..+.++.++...++-.|. ..-......|.+.+. -....+ T Consensus 195 H~lGi~ttaTm--lyGhiEt~e~ri~HL~~lR~lQdeTgGF~~FIp----l~F~p~nt~l~~~~~---------~~~~~~ 259 (350) T PRK05927 195 HRLGFRSTATM--MFGHVENPEDILLHLQTLRDAQDENPGFYSFIP----WSYKPGNTALGRRVP---------QQASPE 259 (350) T ss_pred HHCCCCCCCEE--ECCCCCCHHHHHHHHHHHHHHHHHHCCEEEEEE----CCCCCCCCHHHHCCC---------CCCCHH T ss_conf 98599752024--636879999999999999998765098799994----676548874654278---------899845 Q ss_pred CCCCCCHHHHHHCC--C---CCCCCCCC-CCCCCCCCEEEEEECCCCC--CCHHCCCCCCCEEEEECCCCHHHHHHHHHH Q ss_conf 31000000000001--2---33322222-2100000100111027884--101002348842499659998999999999 Q gi|255764474|r 238 SIEDWYRALNIFVA--P---PLYEGFGL-TPLEAMASGIPVVASNTGV--FSELLDPENAKAGVIVPPRNLHALEKAVLY 309 (352) Q Consensus 238 ~~~~~~~~adi~i~--p---S~~Eg~gl-~~lEAma~G~PvI~s~~~~--~~e~i~~~~g~~G~~~~~~d~~~la~~i~~ 309 (352) +....++-|-+|+- | +.|-..|. ....|+.+|. .|.|| +.|-|... .|.-.. .+++++.+.|.. T Consensus 260 ~~Lr~~AvaRl~Ldn~~hIqa~Wv~~G~~~aq~aL~~Ga----nDlgGT~~eE~I~~a---aG~~~~-~~~~el~~~I~~ 331 (350) T PRK05927 260 LYYRILALARIFLDNFDHIAASWFGEGKEEGAKGLHYGA----DDFGGTILDESVHKC---TGWDLQ-SSEEEICAMIRS 331 (350) T ss_pred HHHHHHHHHHHHCCCCCCEEECEEECCHHHHHHHHHCCC----CCCCCCCEEEEEECC---CCCCCC-CCHHHHHHHHHH T ss_conf 759999999997069886372405257999999996699----767655300046432---289989-999999999998 Q ss_pred HHCCHH Q ss_conf 986989 Q gi|255764474|r 310 FMNSKK 315 (352) Q Consensus 310 l~~~~~ 315 (352) .=.-|. T Consensus 332 aG~~P~ 337 (350) T PRK05927 332 EGFIPV 337 (350) T ss_pred CCCCCC T ss_conf 599730 No 189 >PRK01909 pdxA 4-hydroxythreonine-4-phosphate dehydrogenase; Validated Probab=35.03 E-value=28 Score=14.25 Aligned_cols=63 Identities=14% Similarity=0.184 Sum_probs=34.0 Q ss_pred CCCHHHHHHCCCCCCCCC-CCCCCC-----CCCCEEEEEE--CCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC Q ss_conf 000000000012333222-222100-----0001001110--27884101002348842499659998999999999986 Q gi|255764474|r 241 DWYRALNIFVAPPLYEGF-GLTPLE-----AMASGIPVVA--SNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN 312 (352) Q Consensus 241 ~~~~~adi~i~pS~~Eg~-gl~~lE-----Ama~G~PvI~--s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~ 312 (352) ......|+++.--+.-|. |++++. -...|+|+|- .|.|-.-|+. |. | .-|+.++.+|+....+ T Consensus 251 ~~~~~~D~vlaMYHDQglip~K~l~f~~~vn~TlGLp~iRtS~dHGTA~diA----Gk-g----~A~~~s~~~Ai~~A~~ 321 (329) T PRK01909 251 RHLEDADCVLAMFHDQGLPVLKYATFGEGINVTLGLPIIRTSVDHGTALDLA----GT-G----RADPGSMIAAIDTAVT 321 (329) T ss_pred HHCCCCCEEEECCCCCCHHHHHHCCCCCCEEEECCCCCEEECCCCCHHHHHH----CC-C----CCCHHHHHHHHHHHHH T ss_conf 7577989999855012358787535787379934899106789987156662----78-9----8996999999999999 No 190 >PRK10365 transcriptional regulatory protein ZraR; Provisional Probab=33.85 E-value=29 Score=14.13 Aligned_cols=51 Identities=8% Similarity=0.172 Sum_probs=27.8 Q ss_pred CCHHHHHHHHHHHHCCHH--HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 998999999999986989--9999999999999982998999999999999988 Q gi|255764474|r 298 RNLHALEKAVLYFMNSKK--IMSDTGNRGRERAVKHFSIVKEASDIGKVYDRLL 349 (352) Q Consensus 298 ~d~~~la~~i~~l~~~~~--~~~~~~~~a~~~~~~~fs~~~~a~~~~~iY~~~l 349 (352) +|...|++.+..-++... ....+...+.+. ..+|+|.-.++++.++.++.+ T Consensus 317 eDI~~L~~~fl~~~~~~~~~~~~~~s~~a~~~-L~~y~WPGNvREL~n~iera~ 369 (441) T PRK10365 317 EDIPLLAGHFLQRFAERNRKAVKGFTPQAMDL-LIHYDWPGNIRELENAVERAV 369 (441) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH-HHCCCCCCHHHHHHHHHHHHH T ss_conf 00999999999999998499988889999999-970999989999999999999 No 191 >PRK10310 galactitol-specific PTS system component IIB; Provisional Probab=33.67 E-value=25 Score=14.59 Aligned_cols=55 Identities=9% Similarity=0.215 Sum_probs=37.9 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCC-CCCCCCCCCHHHHHHCCCCCC Q ss_conf 588631333210001466776443112332212222-223310000000000012333 Q gi|255764474|r 199 TAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFID-EQSSIEDWYRALNIFVAPPLY 255 (352) Q Consensus 199 ~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g-~~~~~~~~~~~adi~i~pS~~ 255 (352) -++-+|.|-.-+.--.+++++..++.|++ +.+.. ...++......+|+++..... T Consensus 5 IlVACGsGIATSTvva~kv~~~~~e~gi~--v~i~Q~~i~ev~~~~~~~DlivtTt~~ 60 (94) T PRK10310 5 IIVACGGAVATSTMAAEEIKELCQSHNIP--VELIQCRVNEIETYMDGVHLICTTARV 60 (94) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHHCCCE--EEEEEEEHHHHHHCCCCCCEEEEECCC T ss_conf 99985883759999999999999985980--689997478755416897789992253 No 192 >cd05099 PTKc_FGFR4 Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 4. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 4 (FGFR4); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR4 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to Probab=33.61 E-value=16 Score=15.98 Aligned_cols=90 Identities=14% Similarity=0.120 Sum_probs=48.8 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHH Q ss_conf 0000000123332222221000001001110-27884101002348842499659998999999999986-989999999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTG 321 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~ 321 (352) ..+|++ +||.++.|-+.+|.+.-. .....+-+.+. .|. -.-.+++-++++.+-+.+-++ +|+.|..+. T Consensus 215 ~ksDVw-------SfGvvl~El~t~G~~P~~~~~~~~~~~~i~--~g~-r~~~P~~~~~~l~~li~~Cw~~~P~~RPsf~ 284 (314) T cd05099 215 HQSDVW-------SFGILMWEIFTLGGSPYPGIPVEELFKLLR--EGH-RMDKPSNCTHELYMLMRECWHAVPTQRPTFK 284 (314) T ss_pred CCCCCC-------HHHHHHHHHHHCCCCCCCCCCHHHHHHHHH--CCC-CCCCCCCCHHHHHHHHHHHHCCCHHHCCCHH T ss_conf 655533-------299999999847999999989999999997--589-8999977759999999998087865791989 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 99999999829989999999999999886 Q gi|255764474|r 322 NRGRERAVKHFSIVKEASDIGKVYDRLLR 350 (352) Q Consensus 322 ~~a~~~~~~~fs~~~~a~~~~~iY~~~l~ 350 (352) += .+ ..+.+...+.+-|.+++. T Consensus 285 eI-----~~--~Le~il~~~~~~~~~~~~ 306 (314) T cd05099 285 QL-----VE--ALDKVLAAVSEEYLDLSM 306 (314) T ss_pred HH-----HH--HHHHHHHHCCHHHHCCCC T ss_conf 99-----99--999999868886746057 No 193 >TIGR02995 ectoine_ehuB ectoine/hydroxyectoine ABC transporter solute-binding protein; InterPro: IPR014337 Members of this entry are the extracellular solute-binding proteins of ABC transporters and they closely resemble amino acid transporters. The member from Sinorhizobium meliloti (Rhizobium meliloti) is involved in ectoine uptake, both for osmoprotection and for catabolism. All other members of the seed alignment are found associated with ectoine catabolic genes.. Probab=33.32 E-value=18 Score=15.58 Aligned_cols=122 Identities=17% Similarity=0.256 Sum_probs=70.8 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCC-CCCCCCCCC-CCCCCCCCH-HHHHHCCCCC Q ss_conf 4512366654565300477405886313332100014667764431123-322122222-233100000-0000001233 Q gi|255764474|r 178 LKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGL-KKRILFIDE-QSSIEDWYR-ALNIFVAPPL 254 (352) Q Consensus 178 ~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l-~~~V~~~g~-~~~~~~~~~-~adi~i~pS~ 254 (352) .||=..=|..+..+ .++|++++-++|.+.+ .+.+.+.|+ +++|.+.+. .+.+..+-. ++|++..|.. T Consensus 127 kKGNPk~L~sY~Di-AknP~~K~a~~~G~~e---------~~~ar~~GVk~eqi~~vPd~~~~l~~vq~~RaDay~~t~L 196 (285) T TIGR02995 127 KKGNPKELKSYKDI-AKNPEVKVAVVAGAVE---------VKYAREAGVKEEQIVVVPDTASALKAVQDKRADAYAATEL 196 (285) T ss_pred CCCCCCCCCCCCCH-HCCCCCEEEECCCCHH---------HHHHHHCCCCCCEEEEECCCHHHHHHHHHHCCCEECCCCC T ss_conf 58788887752020-0682312761247237---------8999865898660788248716788877403422003323 Q ss_pred C-CCC----CCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHH----HHHCCHHHH Q ss_conf 3-222----22210000010011102788410100234884249965999899999999----998698999 Q gi|255764474|r 255 Y-EGF----GLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVL----YFMNSKKIM 317 (352) Q Consensus 255 ~-Eg~----gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~----~l~~~~~~~ 317 (352) + .++ |-.-.|.|.==+-.++-|.|. -| +-.|+-+.++| ++|.+++. ++....+.. T Consensus 197 t~~~La~K~gd~nVE~~~pfvdPvvkdaPv---~i----~ygg~AFRpeD-keLRDafn~~L~k~~~s~e~~ 260 (285) T TIGR02995 197 TVKDLAKKAGDANVEVLAPFVDPVVKDAPV---RI----NYGGLAFRPED-KELRDAFNVELAKLKESDEFK 260 (285) T ss_pred CHHHHHHHCCCCCCEECCCCCCCCCCCCCC---EE----CCCCCCCCCCC-HHHHHHHHHHHHHHHCCCCCH T ss_conf 478898533897620004321531058873---41----16720116666-777879989999862465401 No 194 >PRK08445 hypothetical protein; Provisional Probab=33.25 E-value=29 Score=14.15 Aligned_cols=40 Identities=13% Similarity=0.292 Sum_probs=17.8 Q ss_pred HHCCCCCCCCEEEEEEEECHH-HHHHHHHHHHHHHHCCCCCEEE Q ss_conf 872889887279995122644-5123666545653004774058 Q gi|255764474|r 158 RHLKISEDAKLIGCFGRIRKL-KGTDLFVDCMINILPHHPGWTA 200 (352) Q Consensus 158 ~~~~~~~~~~~i~~~G~~~~~-Kg~~~li~a~~~l~~~~~~~~l 200 (352) ...|++.+... ..|.+... -.++-|. .+.++.++...++- T Consensus 192 H~lGi~ttaTM--lyGhiEt~e~rv~HL~-~lR~lQdeTgGF~~ 232 (348) T PRK08445 192 HKIGMKSTATM--MFGTVENDEEIIEHWE-HIRDLQDETGGFRA 232 (348) T ss_pred HHCCCCCCCEE--ECCCCCCHHHHHHHHH-HHHHHHHHCCCEEE T ss_conf 98699641213--6267799999999999-99999986199789 No 195 >cd05115 PTKc_Zap-70 Catalytic Domain of the Protein Tyrosine Kinase, Zeta-chain-associated protein of 70kDa. Protein Tyrosine Kinase (PTK) family; Zeta-chain-associated protein of 70kDa (Zap-70); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Zap-70 is a member of the Syk subfamily of kinases, which are cytoplasmic (or nonreceptor) tyr kinases containing two Src homology 2 (SH2) domains N-terminal to the catalytic tyr kinase domain. Zap-70 is primarily expressed in T-cells and NK cells, and is a crucial component in T-cell receptor (TCR) signaling. Zap-70 binds the phosphorylated ITAM (immunoreceptor tyr activation motif) sequences of the activated TCR zeta-chain through its SH2 domains, leading to its pho Probab=32.96 E-value=12 Score=16.67 Aligned_cols=71 Identities=10% Similarity=0.103 Sum_probs=42.7 Q ss_pred CCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC-CCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHH Q ss_conf 0000000001233322222210000010011102-7884101002348842499659998999999999986-9899999 Q gi|255764474|r 242 WYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS-NTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSD 319 (352) Q Consensus 242 ~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~ 319 (352) +-..+|++ +||+++.|.++.|.+.-.. +...+-..+. .|. -.-.+++-|+++.+-+..-++ ||+.|.. T Consensus 175 ~t~ksDVw-------SfGv~l~E~~~~g~~P~~~~~~~~~~~~i~--~g~-r~~~p~~~p~~l~~li~~Cw~~~P~~RPt 244 (257) T cd05115 175 FSSRSDVW-------SYGITMWEAFSYGQKPYKKMKGPEVMSFIE--QGK-RLDCPAECPPEMYALMKDCWIYKWEDRPN 244 (257) T ss_pred CCCCCCEE-------HHHHHHHHHHHCCCCCCCCCCHHHHHHHHH--CCC-CCCCCCCCCHHHHHHHHHHHCCCHHHCCC T ss_conf 88521174-------468999999847999999999899999997--589-99999877699999999982789667909 Q ss_pred HHH Q ss_conf 999 Q gi|255764474|r 320 TGN 322 (352) Q Consensus 320 ~~~ 322 (352) +.+ T Consensus 245 ~~~ 247 (257) T cd05115 245 FAK 247 (257) T ss_pred HHH T ss_conf 999 No 196 >PRK11361 acetoacetate metabolism regulatory protein AtoC; Provisional Probab=32.16 E-value=31 Score=13.95 Aligned_cols=32 Identities=13% Similarity=-0.023 Sum_probs=15.8 Q ss_pred HHHEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf 0445897078787877077888899999997198599980 Q gi|255764474|r 3 MNNIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFG 42 (352) Q Consensus 3 ~~~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~ 42 (352) +++|.+|-..-. ....+...+.+.|+.+.... T Consensus 4 ~~rILIVDDd~~--------ir~~l~~~L~~~G~~V~~a~ 35 (457) T PRK11361 4 INRILIVDDEDN--------VRRMLSTAFALQGFETHCAN 35 (457) T ss_pred CCCEEEECCCHH--------HHHHHHHHHHHCCCEEEEEC T ss_conf 992899839999--------99999999997799899989 No 197 >PRK09567 nirA ferredoxin-nitrite reductase; Reviewed Probab=31.88 E-value=28 Score=14.24 Aligned_cols=48 Identities=13% Similarity=0.039 Sum_probs=22.4 Q ss_pred EEEEECHHHHHHHHHHHHHHHHCCCC--CE------EEEEECCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 95122644512366654565300477--40------58863133321000146677644311233 Q gi|255764474|r 171 CFGRIRKLKGTDLFVDCMINILPHHP--GW------TAVVVGKTTLKHYLFKKNLQRRIYANGLK 227 (352) Q Consensus 171 ~~G~~~~~Kg~~~li~a~~~l~~~~~--~~------~l~i~G~g~~~~~~~~~~l~~~i~~~~l~ 227 (352) -.||+.... +++++++.+++. ++ .++|. +.+.+ ...++...+.++|+. T Consensus 379 p~Grit~~q-----l~~LA~iA~~yG~geiRlT~~QNlil~-~V~~~---~l~~l~~~l~~~gl~ 434 (591) T PRK09567 379 PVGKMTTDQ-----MRGLAKIAAECGDGDIRLTVWQNLLLS-GVPDA---KVAEACAAIEALGLA 434 (591) T ss_pred CCCCCCHHH-----HHHHHHHHHHHCCCCEEECCCCCEEEC-CCCHH---HHHHHHHHHHHCCCC T ss_conf 788888899-----999999999858998998898887988-88999---999999999976999 No 198 >smart00072 GuKc Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions. Probab=31.79 E-value=32 Score=13.91 Aligned_cols=62 Identities=11% Similarity=0.091 Sum_probs=32.6 Q ss_pred CCCC-CCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHH-HCCHHHHH Q ss_conf 2222-21000001001110-278841010023488424996599989999999999-86989999 Q gi|255764474|r 257 GFGL-TPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYF-MNSKKIMS 318 (352) Q Consensus 257 g~gl-~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l-~~~~~~~~ 318 (352) |.+. .+-+++..|..||. .+..|...+-.........++.|.|.+.+.+.+..- ..+++... T Consensus 80 Gt~~~~I~~~~~~g~~~ildi~~~g~~~l~~~~~~~~~Ifi~pps~e~L~~RL~~Rg~~~~~~i~ 144 (184) T smart00072 80 GTSKETIRQVAEQGKHCLLDIDPQGVKQLRKAQLYPIVIFIAPPSSEELERRLRGRGTETAERIQ 144 (184) T ss_pred CCHHHHHHHHHCCCCEEEEEECHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCCCHHHHH T ss_conf 41067899987269869999629999999985888079999389999999999716999999999 No 199 >PRK10834 hypothetical protein; Provisional Probab=31.61 E-value=32 Score=13.89 Aligned_cols=92 Identities=14% Similarity=0.157 Sum_probs=37.2 Q ss_pred HHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCC-----CCCC Q ss_conf 6545653004774058863133321000146677644311233221222222--33100000000000123-----3322 Q gi|255764474|r 185 VDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPP-----LYEG 257 (352) Q Consensus 185 i~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS-----~~Eg 257 (352) ++|..+|-+...--.+++=|+....+......+++...+.|+.+...++++- ....+.+++-++|=.-+ -.=. T Consensus 70 ldaA~~LY~~GKv~~iLvSGDn~~~~YnEp~~Mk~~Li~~GVP~e~I~~D~AGfrT~DS~vRAk~VF~~~~~iIVTQ~FH 149 (239) T PRK10834 70 IQGAINAYNSGKVNYLLLSGDNALQSYNEPMTMRKDLIAAGVDPADIVLDYAGFRTLDSIVRTRKVFDTNDFIIITQRFH 149 (239) T ss_pred HHHHHHHHHCCCCCEEEECCCCCCCCCCCHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHCCCCEEEEECCHH T ss_conf 99999999869974898668999888982899999999859989995056676548999999999809982899946013 Q ss_pred CCCCCCCCCCCEEEEEECC Q ss_conf 2222100000100111027 Q gi|255764474|r 258 FGLTPLEAMASGIPVVASN 276 (352) Q Consensus 258 ~gl~~lEAma~G~PvI~s~ 276 (352) ...++.=|-..|+-.++-+ T Consensus 150 ~~RAlfiAr~~GidA~g~~ 168 (239) T PRK10834 150 CERALFIALHMGIQAQCYA 168 (239) T ss_pred HHHHHHHHHHCCCCEEEEE T ss_conf 8999999997498369971 No 200 >PRK10710 DNA-binding transcriptional regulator BaeR; Provisional Probab=31.24 E-value=32 Score=13.85 Aligned_cols=19 Identities=11% Similarity=-0.123 Sum_probs=12.4 Q ss_pred HHHHHHHHHHCCEEEEEEC Q ss_conf 8999999971985999805 Q gi|255764474|r 25 FGLCPIQRKLGQRLVVFGY 43 (352) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~ 43 (352) ..+...+...|+.+..... T Consensus 24 ~~l~~~L~~~g~~V~~a~~ 42 (240) T PRK10710 24 QLLIDYLRAASYAPTLISH 42 (240) T ss_pred HHHHHHHHHCCCEEEEECC T ss_conf 9999999988999999899 No 201 >PRK05889 putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Provisional Probab=31.16 E-value=22 Score=15.02 Aligned_cols=42 Identities=29% Similarity=0.372 Sum_probs=29.9 Q ss_pred CCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCC Q ss_conf 3322222210000010011102788410100234884249965999 Q gi|255764474|r 254 LYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRN 299 (352) Q Consensus 254 ~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d 299 (352) ..+|-++.++|||=+-.|+.+...|-+.++.- ..|. .++.+| T Consensus 24 V~~Gd~l~vlEAMKME~~v~A~~~G~V~~i~v-~~Gd---~V~~Gd 65 (71) T PRK05889 24 IGEGDTLVLLESMKMEIPVLAEVAGTVSKVSV-SVGD---VIQAGD 65 (71) T ss_pred ECCCCEEEEEEHHCCCCEEECCCCCEEEEEEE-CCCC---EECCCC T ss_conf 88999999998200457784799909979997-8999---988999 No 202 >PRK10430 DNA-binding transcriptional activator DcuR; Provisional Probab=30.78 E-value=33 Score=13.80 Aligned_cols=10 Identities=30% Similarity=0.477 Sum_probs=3.5 Q ss_pred CCCCCCEEEE Q ss_conf 0000010011 Q gi|255764474|r 263 LEAMASGIPV 272 (352) Q Consensus 263 lEAma~G~Pv 272 (352) +++...|.|. T Consensus 212 ~~YG~~GRP~ 221 (239) T PRK10430 212 IHYGVTGRPV 221 (239) T ss_pred ECCCCCCCCC T ss_conf 3578999986 No 203 >TIGR01250 pro_imino_pep_2 proline-specific peptidases; InterPro: IPR005945 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidase belong to MEROPS peptidase family S33 (clan SC). They are proline iminopeptidase (Prolyl aminopeptidase, 3.4.11.5 from EC), which catalyzes the removal of the N-terminal proline from peptides. This family represents one of two related families of proline iminopeptidase containing the alpha/beta fold. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members. One of the members of this family is the tricorn protease (TRI) interacting factor 1 from Thermoplasma acidophilum. Factor 1 (F1) is a 33.5 kDa serine peptidase of the alpha/beta-hydrolase family. Tricorn generates small peptides, which are cleaved by F1 to yield single amino acids , . ; GO: 0016804 prolyl aminopeptidase activity, 0005737 cytoplasm. Probab=30.21 E-value=34 Score=13.73 Aligned_cols=36 Identities=11% Similarity=0.234 Sum_probs=20.1 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHH-HCCEEEEEE Q ss_conf 45897078787877077888899999997-198599980 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRK-LGQRLVVFG 42 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~-~~~~~~~~~ 42 (352) ++.++.-+=.+ ++-..+...+-...++ .|..+..+. T Consensus 25 ~~~Ll~lHGGP--G~~H~Yl~~l~~~~~~~~G~~V~~YD 61 (302) T TIGR01250 25 KIKLLVLHGGP--GSSHEYLENLKELLKEDLGREVIMYD 61 (302) T ss_pred CCEEEEEECCC--CCCHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 92189970889--88742379999997640782799983 No 204 >PRK08174 DNA topoisomerase III; Validated Probab=30.15 E-value=34 Score=13.73 Aligned_cols=86 Identities=13% Similarity=0.073 Sum_probs=54.9 Q ss_pred CCCCCEEEEEECCCCCCCHHCCCCC--CCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 0000100111027884101002348--84249965999899999999998698999999999999999829989999999 Q gi|255764474|r 264 EAMASGIPVVASNTGVFSELLDPEN--AKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDI 341 (352) Q Consensus 264 EAma~G~PvI~s~~~~~~e~i~~~~--g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~ 341 (352) |...-|+|. |..+-+..+++... .....+++..--..+.+.+...+.+++.-..|-..-.+....+.+|+....++ T Consensus 515 E~~GIGtpa--TrA~II~~L~~R~Yv~~~~k~l~pT~~G~~li~~l~~~l~~~~~Ta~~E~~Ld~I~~G~~~~~~f~~~~ 592 (670) T PRK08174 515 DTTGIGTEA--TRAGIIQGLLDRGYLVKKGKALRATDAAFTLIDAVPAAIADPGTTALWEQALDMIAAGQMTLDVFMAKQ 592 (670) T ss_pred CCCCCCCCC--CHHHHHHHHHHCCCEEECCCEEEECHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 057878503--699999999867989820987602489999999768762796789999999999985998899999999 Q ss_pred HHHHHHHHHH Q ss_conf 9999998860 Q gi|255764474|r 342 GKVYDRLLRT 351 (352) Q Consensus 342 ~~iY~~~l~~ 351 (352) .....++++. T Consensus 593 ~~~~~~~v~~ 602 (670) T PRK08174 593 AAWVSQLVAQ 602 (670) T ss_pred HHHHHHHHHH T ss_conf 9999999998 No 205 >COG2949 SanA Uncharacterized membrane protein [Function unknown] Probab=30.10 E-value=34 Score=13.72 Aligned_cols=99 Identities=14% Similarity=0.171 Sum_probs=58.6 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHH------CCCC Q ss_conf 3666545653004774058863133321000146677644311233221222222--3310000000000------0123 Q gi|255764474|r 182 DLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIF------VAPP 253 (352) Q Consensus 182 ~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~------i~pS 253 (352) ..=|+|..++-+...--.|.+-|+....+...-..+++-..+.|+.....++++- ..+.+.+++..+| +. | T Consensus 79 ~~Ri~aA~~ly~~gKV~~LLlSGDN~~~sYnEp~tM~kdL~~~GVp~~~i~lDyAGFrTLDSvvRA~kVF~~~~ftII-t 157 (235) T COG2949 79 TYRIDAAIALYKAGKVNYLLLSGDNATVSYNEPRTMRKDLIAAGVPAKNIFLDYAGFRTLDSVVRARKVFGTNDFTII-T 157 (235) T ss_pred HHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCHHHHHHHHHHCCCCHHHEEECCCCCCHHHHHHHHHHHCCCCCEEEE-E T ss_conf 999999999986477049998168775346545899999998399888903200374178989989987076727998-2 Q ss_pred CCCCCCCCCCCCCCCEEEEEECCCCCCC Q ss_conf 3322222210000010011102788410 Q gi|255764474|r 254 LYEGFGLTPLEAMASGIPVVASNTGVFS 281 (352) Q Consensus 254 ~~Eg~gl~~lEAma~G~PvI~s~~~~~~ 281 (352) -.=....++.-|+..|+-.||-.+++.. T Consensus 158 Q~FHceRAlfiA~~~gIdAic~~ap~p~ 185 (235) T COG2949 158 QRFHCERALFIARQMGIDAICFAAPDPE 185 (235) T ss_pred CCCCCHHHHHHHHHHCCCEEEECCCCCC T ss_conf 0123488999999829754786589944 No 206 >pfam01113 DapB_N Dihydrodipicolinate reductase, N-terminus. Dihydrodipicolinate reductase (DapB) reduces the alpha,beta-unsaturated cyclic imine, dihydro-dipicolinate. This reaction is the second committed step in the biosynthesis of L-lysine and its precursor meso-diaminopimelate, which are critical for both protein and cell wall biosynthesis. The N-terminal domain of DapB binds the dinucleotide NADPH. Probab=29.96 E-value=23 Score=14.84 Aligned_cols=45 Identities=18% Similarity=0.179 Sum_probs=29.8 Q ss_pred CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCC Q ss_conf 331000000000001233322222210000010011102788410 Q gi|255764474|r 237 SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFS 281 (352) Q Consensus 237 ~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~ 281 (352) ++.......+|+++=-|.-++.--.+-.+...|+|+|.-.+|-.. T Consensus 57 ~~~~~~~~~~DVvIDFS~p~~~~~~~~~~~~~~~~~ViGTTG~s~ 101 (122) T pfam01113 57 DDLEEVLADADVLIDFTTPEATLENLELALKHGKPLVIGTTGFTE 101 (122) T ss_pred CCHHHHCCCCCEEEEECCHHHHHHHHHHHHHCCCCEEEECCCCCH T ss_conf 447775157888999068789999999999679988998999999 No 207 >PRK05654 acetyl-CoA carboxylase subunit beta; Validated Probab=29.83 E-value=28 Score=14.27 Aligned_cols=21 Identities=14% Similarity=0.332 Sum_probs=8.6 Q ss_pred HHHHHHCCCCCCCCEEEEEEEEC Q ss_conf 68788728898872799951226 Q gi|255764474|r 154 QEARRHLKISEDAKLIGCFGRIR 176 (352) Q Consensus 154 ~~~~~~~~~~~~~~~i~~~G~~~ 176 (352) .+.+++.+..+ -++...|.+. T Consensus 102 ~~a~kkTg~~d--av~~g~G~I~ 122 (288) T PRK05654 102 KAAQKKTGLKD--AVVTGKGTIE 122 (288) T ss_pred HHHHHHCCCCC--EEEEEEEEEC T ss_conf 99998709750--7999899999 No 208 >TIGR01658 EYA-cons_domain EYA conserved domain; InterPro: IPR006545 The Eyes absent proteins are members of a conserved regulatory network implicated in the development of the eye, muscle, kidney and ear. Eyes absent is a nuclear transcription factor, acting through interaction with homeodomain-containing Sine oculis (also known as Six) proteins. Eyes absent is also a protein tyrosine phosphatase , , It does not resemble the classical tyrosine phosphatases that use cysteine as a nucleophile and proceed by means of a thiol-phosphate intermediate. Rather, Eyes absent is the prototype for a class of protein tyrosine phosphatases that use a nucleophilic aspartic acid in a metal-dependent reaction. Furthermore, the phosphatase activity of Eyes absent contributes to its ability to induce eye formation in drosophila. Thus, Eyes absent belongs to the phosphatase subgroup of the haloacid dehalogenase (HAD) superfamily and appears to act as a nuclear transcriptional coactivator with intrinsic phosphatase activity. The Eyes absent proteins contain a divergent 200300-residue, amino-terminal region and a conserved carboxy-terminal domain of 271274 residues (EYA domain), which is critical for activity and believed to participate in proteinprotein interactions ; GO: 0007275 multicellular organismal development. Probab=29.49 E-value=35 Score=13.65 Aligned_cols=53 Identities=17% Similarity=0.325 Sum_probs=29.8 Q ss_pred HHHHHHHCC--CCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 545653004--774058863133321000146677644311233221222222331000000000 Q gi|255764474|r 186 DCMINILPH--HPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNI 248 (352) Q Consensus 186 ~a~~~l~~~--~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi 248 (352) +.|..+..+ +|+++++.+|+|.+ |+...++-+.+.- +| ..++|+...|.+-++ T Consensus 262 ~CF~rI~~RF~~Pk~~yv~IGDG~e------Ee~AA~~~~~PF~-kI---~~h~D~~~~f~aL~l 316 (319) T TIGR01658 262 QCFKRIKERFGKPKVRYVVIGDGRE------EETAAKAMNWPFV-KI---ELHPDLSHLFPALEL 316 (319) T ss_pred HHHHHHHHHCCCCCEEEEEEECCHH------HHHHHHHHCCCEE-EE---ECCCCHHHHCCCCCC T ss_conf 2102444204879158998607818------9999998359838-98---247883542267888 No 209 >KOG1588 consensus Probab=29.12 E-value=33 Score=13.79 Aligned_cols=59 Identities=14% Similarity=0.231 Sum_probs=36.3 Q ss_pred EEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCC Q ss_conf 79995122644512366654565300477405886313332100014667764431123322122 Q gi|255764474|r 168 LIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILF 232 (352) Q Consensus 168 ~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~ 232 (352) -+=|+||+---||-- +.+|.+. -.+++.|-|.|..+|....++++....-..|.+..+. T Consensus 105 ~fNFVGRILGPrGnS-----lkrLe~e-TgCki~IrGrgSmrD~~KEE~lR~~p~yeHL~epLHV 163 (259) T KOG1588 105 KFNFVGRILGPRGNS-----LKRLEEE-TGCKIMIRGRGSMRDKAKEEELRGDPGYEHLNEPLHV 163 (259) T ss_pred CCCCCCCCCCCCCCH-----HHHHHHH-HCCEEEEECCCCCCCHHHHHHHHCCCCHHHHCCCCEE T ss_conf 876113110687531-----9999998-7976999668766665779985148466774887279 No 210 >cd05066 PTKc_EphR_A Catalytic Domain of the Protein Tyrosine Kinases, Class EphA Ephrin Receptors. Protein Tyrosine Kinase (PTK) family; Ephrin Receptor (EphR) subfamily; most class EphA receptors including EphA3, EphA4, EphA5, and EphA7, but excluding EphA1, EphA2 and EphA10; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. EphRs comprise the largest subfamily of receptor tyr kinases (RTKs). In general, class EphA receptors bind GPI-anchored ephrin-A ligands. There are ten vertebrate EphA receptors (EphA1-10), which display promiscuous interactions with six ephrin-A ligands. One exception is EphA4, which also binds ephrins-B2/B3. EphRs contain an ephrin-binding domain and two fibronectin repeats extracellul Probab=29.09 E-value=25 Score=14.59 Aligned_cols=68 Identities=15% Similarity=0.182 Sum_probs=36.8 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEE--CCCCHHHHHHHHHHHHC-CHHHHHHH Q ss_conf 0000000123332222221000001001110278841010023488424996--59998999999999986-98999999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIV--PPRNLHALEKAVLYFMN-SKKIMSDT 320 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~--~~~d~~~la~~i~~l~~-~~~~~~~~ 320 (352) ..+|++ +||.++.|-++.|.+.-... ...+++..- ..|.-. +.+-++++.+-+..-++ ||+.|..+ T Consensus 188 ~~sDVw-------S~Gv~l~Ellt~G~~P~~~~--~~~~~~~~i--~~g~r~p~p~~~p~~l~~li~~C~~~dP~~RPs~ 256 (267) T cd05066 188 SASDVW-------SYGIVMWEVMSYGERPYWEM--SNQDVIKAI--EEGYRLPAPMDCPAALHQLMLDCWQKERNERPKF 256 (267) T ss_pred CCCCCC-------CCHHHHHHHHCCCCCCCCCC--CHHHHHHHH--HCCCCCCCCCCCHHHHHHHHHHHHCCCHHHCCCH T ss_conf 655777-------62200538864899998889--989999999--7058799975578999999999818796579299 Q ss_pred HH Q ss_conf 99 Q gi|255764474|r 321 GN 322 (352) Q Consensus 321 ~~ 322 (352) .+ T Consensus 257 ~e 258 (267) T cd05066 257 EQ 258 (267) T ss_pred HH T ss_conf 99 No 211 >COG1618 Predicted nucleotide kinase [Nucleotide transport and metabolism] Probab=29.04 E-value=31 Score=13.94 Aligned_cols=73 Identities=14% Similarity=0.224 Sum_probs=40.1 Q ss_pred CCCCCHHHHHHCCC------CCCCCCCCCCCCCCCCEEEEEEC-----CCCCCCHHCCCCCCCEEEEECCCCHHHHHHHH Q ss_conf 10000000000012------33322222210000010011102-----78841010023488424996599989999999 Q gi|255764474|r 239 IEDWYRALNIFVAP------PLYEGFGLTPLEAMASGIPVVAS-----NTGVFSELLDPENAKAGVIVPPRNLHALEKAV 307 (352) Q Consensus 239 ~~~~~~~adi~i~p------S~~Eg~gl~~lEAma~G~PvI~s-----~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i 307 (352) +...+..||+.+.- -.+-.|.-.+-|.|-|+.|+|+| ..|...++- ...+. =+++.+.|-+.+...+ T Consensus 94 l~rA~~~aDvIIIDEIGpMElks~~f~~~ve~vl~~~kpliatlHrrsr~P~v~~ik-~~~~v-~v~lt~~NR~~i~~~I 171 (179) T COG1618 94 LRRALEEADVIIIDEIGPMELKSKKFREAVEEVLKSGKPLIATLHRRSRHPLVQRIK-KLGGV-YVFLTPENRNRILNEI 171 (179) T ss_pred HHHHHHCCCEEEEECCCCHHHCCHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHH-HCCCE-EEEECCCHHHHHHHHH T ss_conf 998863499899943363302008899999999658993799996256775899864-24877-9997512156799999 Q ss_pred HHHHCC Q ss_conf 999869 Q gi|255764474|r 308 LYFMNS 313 (352) Q Consensus 308 ~~l~~~ 313 (352) ..++.. T Consensus 172 l~~L~~ 177 (179) T COG1618 172 LSVLKG 177 (179) T ss_pred HHHHCC T ss_conf 998524 No 212 >TIGR00016 ackA acetate kinase; InterPro: IPR004372 Acetate kinase is involved in the activation of acetate to acetyl CoA and in the secretion of acetate. It catalyzes the reaction ATP + acetate = ADP + acetyl phosphate.; GO: 0016301 kinase activity, 0016774 phosphotransferase activity carboxyl group as acceptor, 0006082 organic acid metabolic process. Probab=28.55 E-value=36 Score=13.55 Aligned_cols=87 Identities=16% Similarity=0.188 Sum_probs=42.3 Q ss_pred CCCCCCCCCCCCCCCCEEEE------EECC-----CCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 33322222210000010011------1027-----884101002348842499659998999999999986989999999 Q gi|255764474|r 253 PLYEGFGLTPLEAMASGIPV------VASN-----TGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNSKKIMSDTG 321 (352) Q Consensus 253 S~~Eg~gl~~lEAma~G~Pv------I~s~-----~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~~~~~~~~~ 321 (352) |..-.+|+++||-++||+-. |.+- --.+.|+...=+-+.|++- ..+++.-+..+.+.+++-.+-+ T Consensus 237 S~DTSMG~TPLEGL~MGTRSGdIDPAii~y~~~~~~~s~~~i~~~LNKkSGlLG----isg~sSDlR~i~~~~~e~~e~a 312 (416) T TIGR00016 237 SIDTSMGFTPLEGLVMGTRSGDIDPAIISYLAETLGMSADDIENTLNKKSGLLG----ISGLSSDLRDIEDAYEEGNEQA 312 (416) T ss_pred EECCCCCCCHHHHHCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCHHHHHH----HHHHHHHHHHHHHHHHCCCHHH T ss_conf 462578873146501342447401799999998728988899987500221345----6543467899997363267778 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 9999999982998999999999999988 Q gi|255764474|r 322 NRGRERAVKHFSIVKEASDIGKVYDRLL 349 (352) Q Consensus 322 ~~a~~~~~~~fs~~~~a~~~~~iY~~~l 349 (352) +.|.+....+ ..+|..-|-..| T Consensus 313 ~~A~~~y~~R------~~~YIg~Y~A~L 334 (416) T TIGR00016 313 KLAIKVYVHR------IAKYIGSYIASL 334 (416) T ss_pred HHHHHHHHHH------HHHHHHHHHHHC T ss_conf 9999999999------999999999722 No 213 >pfam00318 Ribosomal_S2 Ribosomal protein S2. Probab=28.55 E-value=36 Score=13.55 Aligned_cols=27 Identities=37% Similarity=0.503 Sum_probs=19.9 Q ss_pred CCCCCCCCCCEEEEEE-CCCCCCCHHCC Q ss_conf 2221000001001110-27884101002 Q gi|255764474|r 259 GLTPLEAMASGIPVVA-SNTGVFSELLD 285 (352) Q Consensus 259 gl~~lEAma~G~PvI~-s~~~~~~e~i~ 285 (352) ..++.||..+|.|+|+ .|...-+..|+ T Consensus 150 ~~ai~Ea~~l~IP~I~ivDTn~~p~~id 177 (205) T pfam00318 150 HIAIKEASKLGIPVIAIVDTNCDPDLID 177 (205) T ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCC T ss_conf 3899999875997565405999954387 No 214 >cd06611 STKc_SLK_like Serine/threonine kinases (STKs), Ste20-like kinase (SLK)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The SLK-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of the subfamily include SLK, STK10 (also called LOK for lymphocyte-oriented kinase), SmSLK (Schistosoma mansoni SLK), and related proteins. SLK promotes apoptosis through apoptosis signal-regulating kinase 1 (ASK1) and the mitogen-activated protein kinase (MAPK) p38. It also plays a role in mediating actin reorganization. STK10 is responsible in regulating the CD28 responsive element in T cells, as well as leukocyte function associated antigen (LFA-1)-mediated lymphocyte adhesion. SmSLK is capable of activating the Probab=28.39 E-value=19 Score=15.35 Aligned_cols=89 Identities=15% Similarity=0.091 Sum_probs=45.4 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEE-EEECCCCHHHHHHHHHHHHC-CHHHHHHHHH Q ss_conf 000000123332222221000001001110278841010023488424-99659998999999999986-9899999999 Q gi|255764474|r 245 ALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAG-VIVPPRNLHALEKAVLYFMN-SKKIMSDTGN 322 (352) Q Consensus 245 ~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G-~~~~~~d~~~la~~i~~l~~-~~~~~~~~~~ 322 (352) .+|++ ++|.++.| |+.|.|.-... .....+.....+... +..+..-+.++.+-|.+++. ||+.|-.+.+ T Consensus 188 ~~DIW-------SlGv~lye-l~~g~~Pf~~~-~~~~~~~~i~~~~~~~~~~~~~~s~~~~dli~~~l~~dP~~R~t~~e 258 (280) T cd06611 188 KADIW-------SLGITLIE-LAQMEPPHHEL-NPMRVLLKILKSEPPTLDQPSKWSSSFNDFLKSCLVKDPDDRPTAAE 258 (280) T ss_pred CCCCH-------HHHHHHHH-HHHCCCCCCCC-CHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHCCCHHH T ss_conf 45161-------57999999-98698999999-99999999974999876776558999999999986799568909999 Q ss_pred HHHHHHHHHCCHHHHHHHHH Q ss_conf 99999998299899999999 Q gi|255764474|r 323 RGRERAVKHFSIVKEASDIG 342 (352) Q Consensus 323 ~a~~~~~~~fs~~~~a~~~~ 342 (352) --.--..++..+.+..+.++ T Consensus 259 ll~Hp~~~~~~~~~~~~~l~ 278 (280) T cd06611 259 LLKHPFVSDQSDNKAIKDLL 278 (280) T ss_pred HHCCHHHCCCCCCHHHHHHH T ss_conf 96496137799841688874 No 215 >cd06325 PBP1_ABC_uncharacterized_transporter Type I periplasmic ligand-binding domain of uncharacterized ABC-type transport systems that are predicted to be involved in the uptake of amino acids, peptides, or inorganic ions. This group includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type transport systems that are predicted to be involved in the uptake of amino acids, peptides, or inorganic ions. This subgroup has high sequence similarity to members of the family of hydrophobic amino acid transporters (HAAT), such as leucine/isoleucine/valine binding protein (LIVBP); its ligand specificity has not been determined experimentally. Probab=28.06 E-value=37 Score=13.49 Aligned_cols=67 Identities=15% Similarity=0.172 Sum_probs=30.4 Q ss_pred CCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCC----CCHHHHHHCCCCCC---CCCCCCCCCCCCCEEEEEECC Q ss_conf 00014667764431123322122222233100----00000000012333---222222100000100111027 Q gi|255764474|r 210 HYLFKKNLQRRIYANGLKKRILFIDEQSSIED----WYRALNIFVAPPLY---EGFGLTPLEAMASGIPVVASN 276 (352) Q Consensus 210 ~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~----~~~~adi~i~pS~~---Eg~gl~~lEAma~G~PvI~s~ 276 (352) .....+.+++..+++|++=...-.....|+.. +-...|++..|+.. .........+...++|++++. T Consensus 145 s~~~~~~~~~~a~~~Gi~v~~~~v~~~~ei~~a~~~l~~~~Dal~~~~d~~v~s~~~~i~~~a~~~~iPv~~~~ 218 (281) T cd06325 145 SVVQVKELKKAAAKLGIEVVEATVSSSNDVQQAAQSLAGKVDAIYVPTDNTVASAMEAVVKVANEAKIPVIASD 218 (281) T ss_pred HHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHHHCCCCEEECC T ss_conf 69999999999997698899994598889999999756258999991881277799999999987499889367 No 216 >cd06319 PBP1_ABC_sugar_binding_like_10 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=28.04 E-value=28 Score=14.32 Aligned_cols=55 Identities=15% Similarity=0.119 Sum_probs=30.3 Q ss_pred EEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 5122644512366654565300477405886313332100014667764431123322122222 Q gi|255764474|r 172 FGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDE 235 (352) Q Consensus 172 ~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~ 235 (352) .+.++..++. +++..++..+|++..+++.+... ....-+.+++.|..+.|.+.|+ T Consensus 164 ~~~~~~~~~~----~~~~~~L~~~p~i~ai~~~~d~~-----a~Ga~~a~~~~G~~~~i~vvg~ 218 (277) T cd06319 164 QKDFSYQETF----DYTNDLLTANPDIRAIWLQGSDR-----YQGALDAIATAGKTGKVLLICF 218 (277) T ss_pred ECCCCHHHHH----HHHHHHHHHCCCCCEEEECCCHH-----HHHHHHHHHHCCCCCCEEEEEC T ss_conf 0899999999----99999997389976899669889-----9999999997599998589963 No 217 >cd01018 ZntC Metal binding protein ZntC. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and bind their specific ligands in the cleft between these domains. In addition, many of these proteins possess a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=28.04 E-value=37 Score=13.49 Aligned_cols=21 Identities=19% Similarity=0.159 Sum_probs=9.5 Q ss_pred CCCEEEEECCCCHHHHHHHHH Q ss_conf 884249965999899999999 Q gi|255764474|r 288 NAKAGVIVPPRNLHALEKAVL 308 (352) Q Consensus 288 ~g~~G~~~~~~d~~~la~~i~ 308 (352) ++...+++++.-....++.|. T Consensus 216 ~~v~~If~E~~~~~k~~~~ia 236 (266) T cd01018 216 KGVRVVFVQPQFSTKSAEAIA 236 (266) T ss_pred CCCCEEEEECCCCCHHHHHHH T ss_conf 399899970899909999999 No 218 >TIGR02041 CysI sulfite reductase (NADPH) hemoprotein, beta-component; InterPro: IPR011786 Sulphite reductase (NADPH) catalyzes a six electron reduction of sulphite to sulphide in prokaryotic organisms. It is a complex oligomeric enzyme composed of two different peptides with a subunit composition of alpha(8)-beta(4). The alpha component, encoded by cysJ, is a flavoprotein containing both FMN and FAD, while the beta component, encoded by cysI, is a siroheme, iron-sulphur protein.; GO: 0004783 sulfite reductase (NADPH) activity, 0050661 NADP binding, 0051539 4 iron 4 sulfur cluster binding, 0008652 amino acid biosynthetic process, 0009337 sulfite reductase complex (NADPH). Probab=27.97 E-value=37 Score=13.48 Aligned_cols=98 Identities=14% Similarity=0.101 Sum_probs=55.6 Q ss_pred EEEEEEECHHHHHHHHHHHHHHHHCCC-------CCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 999512264451236665456530047-------7405886313332100014667764431123322122222233100 Q gi|255764474|r 169 IGCFGRIRKLKGTDLFVDCMINILPHH-------PGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIED 241 (352) Q Consensus 169 i~~~G~~~~~Kg~~~li~a~~~l~~~~-------~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~ 241 (352) .+--||+....+-.++ .-+.++++-| ||=-++|++ .++.. ..++..++.++||.+ | +..... T Consensus 351 fie~GR~~D~~~~~l~-~GlreIA~iH~GdfRiT~NQNlIiA~-V~e~~---K~~Ie~l~~~yGL~~-----g-~~~t~L 419 (550) T TIGR02041 351 FIESGRIIDYPDKPLL-TGLREIAKIHKGDFRITSNQNLIIAN-VPEGG---KAKIEALARQYGLID-----G-KKVTAL 419 (550) T ss_pred EECCCEEEECCCCHHH-HHHHHHHHCCCCCEEECCCCCEEEEC-CCCHH---HHHHHHHHHHHCCCC-----C-CCCHHH T ss_conf 8427524526863467-88999972289975344776378842-78423---688999999818767-----8-641156 Q ss_pred CCHHHHHHCCCCCCCCCCCCCCCCCC---------------CEE--EEEECCCCCCC Q ss_conf 00000000012333222222100000---------------100--11102788410 Q gi|255764474|r 242 WYRALNIFVAPPLYEGFGLTPLEAMA---------------SGI--PVVASNTGVFS 281 (352) Q Consensus 242 ~~~~adi~i~pS~~Eg~gl~~lEAma---------------~G~--PvI~s~~~~~~ 281 (352) --++..|+.+| ++||++.||=- .|+ -.|++++.|+| T Consensus 420 R~~smaCValP----TCpLAMAEAER~LP~Fi~~~~~i~~k~Gl~~~~I~~R~tGCP 472 (550) T TIGR02041 420 RRNSMACVALP----TCPLAMAEAERYLPDFIDKLDNIMEKHGLADEEIVLRITGCP 472 (550) T ss_pred HHHCCCEECCC----CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCC T ss_conf 76277444467----752369999745589999999999960888760789852798 No 219 >PRK05472 redox-sensing transcriptional repressor Rex; Provisional Probab=27.94 E-value=37 Score=13.48 Aligned_cols=40 Identities=20% Similarity=0.218 Sum_probs=30.0 Q ss_pred EEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCC Q ss_conf 9995122644512366654565300477405886313332 Q gi|255764474|r 169 IGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTL 208 (352) Q Consensus 169 i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~ 208 (352) ++..|+-..--.++.|++.+.+++.....|..+++|-|.. T Consensus 56 fG~fG~~G~GY~V~~L~~~i~~iLG~~~~~~v~lvGaGnL 95 (211) T PRK05472 56 FGELGKRGVGYNVKELLDFIEKILGLDKTTNVALVGAGNL 95 (211) T ss_pred HCCCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEECCCHH T ss_conf 6457999988129999999999968999750899888779 No 220 >cd05091 PTKc_Ror2 Catalytic Domain of the Protein Tyrosine Kinase, Receptor tyrosine kinase-like Orphan Receptor 2. Protein Tyrosine Kinase (PTK) family; Receptor tyrosine kinase-like orphan receptor 2 (Ror2); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Ror proteins are orphan receptor tyr kinases (RTKs) containing an extracellular region with immunoglobulin-like, cysteine-rich, and kringle domains, a transmembrane segment, and an intracellular catalytic domain. Ror RTKs are unrelated to the nuclear receptor subfamily called retinoid-related orphan receptors (RORs). RTKs are usually activated through ligand binding, which causes dimerization and autophosphorylation of the intracellular tyr kinase cataly Probab=27.72 E-value=14 Score=16.24 Aligned_cols=71 Identities=13% Similarity=0.107 Sum_probs=40.7 Q ss_pred CHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-ECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHH Q ss_conf 0000000012333222222100000100111-027884101002348842499659998999999999986-98999999 Q gi|255764474|r 243 YRALNIFVAPPLYEGFGLTPLEAMASGIPVV-ASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDT 320 (352) Q Consensus 243 ~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI-~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~ 320 (352) =..+|++ .||.++.|-++.|.+.- ......+.+.+. ++.. .-.+.+-+..+.+-+.+-++ ||+.|..+ T Consensus 204 ~~~sDV~-------SfGvvl~El~t~g~~P~~~~~~~~~~~~i~--~~~~-~~~p~~~p~~l~~li~~C~~~dP~~RPt~ 273 (283) T cd05091 204 SIDSDIW-------SYGVVLWEIFSYGLQPYCGYSNQDVIEMIR--NRQV-LPCPDDCPAWIYTLMLECWNEFPSRRPRF 273 (283) T ss_pred CCCCCCH-------HHHHHHHHHHHCCCCCCCCCCHHHHHHHHH--CCCC-CCCCCCCCHHHHHHHHHHCCCCHHHCCCH T ss_conf 8403310-------179999999948999999989899999997--2898-99976571999999999817796579099 Q ss_pred HHH Q ss_conf 999 Q gi|255764474|r 321 GNR 323 (352) Q Consensus 321 ~~~ 323 (352) .+= T Consensus 274 ~~I 276 (283) T cd05091 274 KDI 276 (283) T ss_pred HHH T ss_conf 999 No 221 >PRK13119 consensus Probab=27.31 E-value=38 Score=13.40 Aligned_cols=13 Identities=31% Similarity=0.516 Sum_probs=5.3 Q ss_pred HHHHHHHHHCCCC Q ss_conf 6677644311233 Q gi|255764474|r 215 KNLQRRIYANGLK 227 (352) Q Consensus 215 ~~l~~~i~~~~l~ 227 (352) +++.+.+++.|+. T Consensus 135 ~~~~~~~~~~gl~ 147 (261) T PRK13119 135 DPLYRELKDNGVD 147 (261) T ss_pred HHHHHHHHHCCCC T ss_conf 9999999975997 No 222 >cd01540 PBP1_arabinose_binding Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. ABP is only involved in transport contrary to other related sugar-binding proteins such as the glucose/galactose-binding protein (GGBP) and the ribose-binding protein (RBP), both of which are involved in chemotaxis as well as transport. The periplasmic ABP consists of two alpha/beta globular domains connected by a three-stranded hinge, a Venus flytrap-like domain, which undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, ABP is homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR) and DNA-binding transcriptional repressors such a Probab=26.98 E-value=19 Score=15.47 Aligned_cols=36 Identities=11% Similarity=-0.066 Sum_probs=18.2 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf 45897078787877077888899999997198599980 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFG 42 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~ 42 (352) ||-++.+++...+.-. .....-..-+++|..+.... T Consensus 1 KIg~~~~tl~npff~~--~~~g~~~~A~e~G~~l~~~~ 36 (289) T cd01540 1 KIGFIVKQPEEPWFQT--EWKFAKKAAKEKGFTVVKID 36 (289) T ss_pred CEEEEECCCCCHHHHH--HHHHHHHHHHHHCCEEEEEC T ss_conf 9899958998839999--99999999998498799976 No 223 >PRK06857 consensus Probab=26.80 E-value=38 Score=13.34 Aligned_cols=19 Identities=21% Similarity=0.350 Sum_probs=8.6 Q ss_pred CCCCCCCCCCHHHHHHCCC Q ss_conf 2223310000000000012 Q gi|255764474|r 234 DEQSSIEDWYRALNIFVAP 252 (352) Q Consensus 234 g~~~~~~~~~~~adi~i~p 252 (352) +..+++..+-...++...| T Consensus 92 ~~~~~v~~~a~~~~i~~iP 110 (209) T PRK06857 92 GFNPNTVKYCQQLNIPIVP 110 (209) T ss_pred CCCHHHHHHHHHCCCCEEC T ss_conf 9999999999974996547 No 224 >cd06308 PBP1_sensor_kinase_like Periplasmic binding domain of two-component sensor kinase signaling systems. Periplasmic binding domain of two-component sensor kinase signaling systems, some of which are fused with a C-terminal histidine kinase A domain (HisK) and/or a signal receiver domain (REC). Members of this group share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily and are predicted to be involved in sensing of environmental stimuli; their substrate specificities, however, are not known in detail. Probab=26.79 E-value=38 Score=13.34 Aligned_cols=44 Identities=20% Similarity=0.225 Sum_probs=18.0 Q ss_pred HHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 5456530047740588631333210001466776443112332212222 Q gi|255764474|r 186 DCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFID 234 (352) Q Consensus 186 ~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g 234 (352) +++..++..+|++..+++.+... .-...+-+++.|..+.|.+.| T Consensus 171 ~~~~~~L~~~pdi~aI~a~nD~~-----A~Ga~~Al~~aG~~~~i~vvg 214 (270) T cd06308 171 EKMEELLQANPDIDLVYAHNDPM-----ALGAYLAAKRAGREKEIKFIG 214 (270) T ss_pred HHHHHHHHHCCCCCEEEECCCHH-----HHHHHHHHHHCCCCCCCEEEE T ss_conf 99999985088888899778658-----999999999769999958998 No 225 >cd05084 PTKc_Fes Catalytic Domain of the Protein Tyrosine Kinase, Fes. Protein Tyrosine Kinase (PTK) family; Fes (or Fps) kinase subfamily; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Fes subfamily proteins are cytoplasmic (or nonreceptor) tyr kinases containing an N-terminal region with FCH (Fes/Fer/CIP4 homology) and coiled-coil domains, followed by a SH2 domain, and a C-terminal catalytic domain. The genes for Fes (feline sarcoma) and Fps (Fujinami poultry sarcoma) were first isolated from tumor-causing retroviruses. The viral oncogenes encode chimeric Fes proteins consisting of Gag sequences at the N-termini, resulting in unregulated tyr kinase activity. Fes kinase is expressed in myeloid, vascular Probab=26.55 E-value=16 Score=15.84 Aligned_cols=70 Identities=10% Similarity=0.122 Sum_probs=41.3 Q ss_pred CHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHH Q ss_conf 00000000123332222221000001001110-27884101002348842499659998999999999986-98999999 Q gi|255764474|r 243 YRALNIFVAPPLYEGFGLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDT 320 (352) Q Consensus 243 ~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~ 320 (352) -..+|++ .||.++.|-+++|.+.-. .......+.+. .+.. .-.+..-++++.+-+.+.++ ||+.|..+ T Consensus 174 ~~~~DVw-------S~Gi~l~E~~~~g~~P~~~~~~~~~~~~i~--~~~~-~~~p~~~~~~l~~li~~c~~~dP~~RPt~ 243 (252) T cd05084 174 SSESDVW-------SFGILLWETFSLGAVPYANLTNQQTREAVE--QGVR-LPAPENCPDEVYRLMCQCWEYDPRKRPSF 243 (252) T ss_pred CCCCCCC-------CHHHHHHHHHHCCCCCCCCCCHHHHHHHHH--CCCC-CCCCCCCHHHHHHHHHHHCCCCHHHCCCH T ss_conf 9622001-------209999999937999998999899999998--2788-99964465999999999808896679399 Q ss_pred HH Q ss_conf 99 Q gi|255764474|r 321 GN 322 (352) Q Consensus 321 ~~ 322 (352) .+ T Consensus 244 ~~ 245 (252) T cd05084 244 SK 245 (252) T ss_pred HH T ss_conf 99 No 226 >TIGR02858 spore_III_AA stage III sporulation protein AA; InterPro: IPR014217 Proteins in this entry include the stage III sporulation protein AA that is encoded by one of several genes in the spoIIIA locus. This protein is only found in species that are capable of endospore formation.. Probab=26.52 E-value=20 Score=15.21 Aligned_cols=17 Identities=41% Similarity=0.741 Sum_probs=12.5 Q ss_pred CCCCCCCEEEEEECCCC Q ss_conf 10000010011102788 Q gi|255764474|r 262 PLEAMASGIPVVASNTG 278 (352) Q Consensus 262 ~lEAma~G~PvI~s~~~ 278 (352) ++||+-+|+-+|+|-+| T Consensus 224 l~eA~naGV~~I~TaHg 240 (282) T TIGR02858 224 LLEALNAGVSVIATAHG 240 (282) T ss_pred HHHHHCCCCEEEEEECC T ss_conf 99986167568876404 No 227 >TIGR00644 recJ single-stranded-DNA-specific exonuclease RecJ; InterPro: IPR004610 All proteins in this family are 5'-3' single-strand DNA exonucleases. These proteins are used in some aspects of mismatch repair, recombination, and recombinational repair. RecJ is a single-stranded DNA-specific endonuclease which is required for many types of recombination events, although the stringency of the requirement for RecJ appears to vary with the type of recombinational event monitored, and the other recombination gene products which are available.; GO: 0008409 5'-3' exonuclease activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=26.41 E-value=39 Score=13.30 Aligned_cols=28 Identities=18% Similarity=0.159 Sum_probs=12.6 Q ss_pred HHHCCEEEECCHHHHHHCCCCEEEECCCCC Q ss_conf 962668998898999743688399748827 Q gi|255764474|r 114 ISRMDEVITTSQKSARFIERPSTVIMHGVD 143 (352) Q Consensus 114 ~~~~d~ii~~s~~~~~~~~~~~~vi~~gid 143 (352) .......+-.|.... ...+ +.|+-+-.+ T Consensus 229 ~~~~~~~L~~S~~~~-~~~~-~~~~~~~~~ 256 (705) T TIGR00644 229 LTVKNDMLLNSNKLL-AVLE-VIVIDEKVV 256 (705) T ss_pred HHCCCCCCCCHHHHH-HHHE-EEEEECHHH T ss_conf 624777666603464-3210-115644157 No 228 >cd05063 PTKc_EphR_A2 Catalytic Domain of the Protein Tyrosine Kinase, Ephrin Receptor A2. Protein Tyrosine Kinase (PTK) family; Ephrin Receptor (EphR) subfamily; EphA2 receptor; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. EphRs comprise the largest subfamily of receptor tyr kinases (RTKs). In general, class EphA receptors bind GPI-anchored ephrin-A ligands. There are ten vertebrate EphA receptors (EphA1-10), which display promiscuous interactions with six ephrin-A ligands. EphRs contain an ephrin binding domain and two fibronectin repeats extracellularly, a transmembrane segment, and a cytoplasmic tyr kinase domain. Binding of the ephrin ligand to EphR requires cell-cell contact since both are anchored Probab=26.23 E-value=23 Score=14.85 Aligned_cols=70 Identities=17% Similarity=0.185 Sum_probs=38.9 Q ss_pred CCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEE--EECCCCHHHHHHHHHHHHC-CHHHHH Q ss_conf 0000000001233322222210000010011102788410100234884249--9659998999999999986-989999 Q gi|255764474|r 242 WYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGV--IVPPRNLHALEKAVLYFMN-SKKIMS 318 (352) Q Consensus 242 ~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~--~~~~~d~~~la~~i~~l~~-~~~~~~ 318 (352) +-..+|++ +||.++.|-|++|.+.-.... ..+++..-. .|. -.+.+-++++.+-+.+-++ ||..|. T Consensus 187 ~~~~~Dvy-------SfGvil~El~t~g~~P~~~~~--~~~~~~~i~--~~~~~p~~~~~p~~l~~li~~C~~~~p~~RP 255 (268) T cd05063 187 FTSASDVW-------SFGIVMWEVMSFGERPYWDMS--NHEVMKAIN--DGFRLPAPMDCPSAVYQLMLQCWQQDRARRP 255 (268) T ss_pred CCCCCCCC-------CCHHHHHHHHCCCCCCCCCCC--HHHHHHHHH--CCCCCCCCCCCHHHHHHHHHHHHCCCHHHCC T ss_conf 99887678-------419999999826999987899--899999987--2587998755789999999998188965791 Q ss_pred HHHH Q ss_conf 9999 Q gi|255764474|r 319 DTGN 322 (352) Q Consensus 319 ~~~~ 322 (352) .+.+ T Consensus 256 ~~~e 259 (268) T cd05063 256 RFVD 259 (268) T ss_pred CHHH T ss_conf 9999 No 229 >COG3181 Uncharacterized protein conserved in bacteria [Function unknown] Probab=25.94 E-value=40 Score=13.24 Aligned_cols=59 Identities=15% Similarity=0.128 Sum_probs=35.0 Q ss_pred EEEEECCCCHH----HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHH---HHHHHHHHH Q ss_conf 24996599989----99999999986989999999999999998299899999999---999998860 Q gi|255764474|r 291 AGVIVPPRNLH----ALEKAVLYFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIG---KVYDRLLRT 351 (352) Q Consensus 291 ~G~~~~~~d~~----~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~---~iY~~~l~~ 351 (352) .|+..+++-++ .+.+++++++.+++....+.+.+.. ...-+.+.....++ +-|+++++. T Consensus 249 rgvfap~g~~~e~~~~~~~a~kk~l~s~e~~~~~~~~~~~--~~~~~~eel~a~i~~~~~~~~~~~k~ 314 (319) T COG3181 249 RGVFAPAGTPDEIIAKLSAALKKALASPEWQKRLKELGLV--PSYLTGEELKAYIEKEIARWGELVKA 314 (319) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHCCCC--CCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 5898079999999999999999986398899999855888--75579899999999999999999986 No 230 >TIGR01949 AroFGH_arch predicted phospho-2-dehydro-3-deoxyheptonate aldolase; InterPro: IPR010210 This entry represents a clade of primarily archaeal sequences related to fructose-bisphosphate aldolase. The members appear to be phospho-2-dehydro-3-deoxyheptonate aldolases. This enzyme is the first step of the chorismate biosynthesis pathway. Evidence for this assignment is based on gene clustering and phylogenetic profiling. A group of species lack members of the three other types of phospho-2-dehydro-3-deoxyheptonate aldolase (represented by IPR006219 from INTERPRO, IPR002480 from INTERPRO and IPR006268 from INTERPRO), and also apparently lack the well-known forms of step 2 (3-dehydroquinate synthase), but contain all other steps of the pathway: Aquifex, Archaeoglobus, Halobacterium, Methanococcus, Methanopyrus and Methanobacterium. In Aquifex, Archaeoglobus, Halobacterium, Methanopyrus and Methanosarcina the genes are clustered with other genes from the chorismate, phenylalanine, tyrosine and tryptophan biosynthesis pathways. In addition, these genes in Archaeoglobus, Halobacterium, and Methanosarcina are adjacent to 3-dehydroquinate synthase (IPR002812 from INTERPRO) which also has the property of having members only in those species which lack steps 1 and 2. Together these two genes appear to perform the synthesis of 3-dehydroquinate. It is presumed that the substrates and the chemical transformations involved are identical, but this has not yet been proven experimentally.. Probab=25.50 E-value=40 Score=13.19 Aligned_cols=46 Identities=22% Similarity=0.252 Sum_probs=29.3 Q ss_pred CCCEEEEEE---CCCCCCCHHCCC------------CCCCEEEEECCCCHHHHHHHHHHHHCC Q ss_conf 001001110---278841010023------------488424996599989999999999869 Q gi|255764474|r 266 MASGIPVVA---SNTGVFSELLDP------------ENAKAGVIVPPRNLHALEKAVLYFMNS 313 (352) Q Consensus 266 ma~G~PvI~---s~~~~~~e~i~~------------~~g~~G~~~~~~d~~~la~~i~~l~~~ 313 (352) .+|-+|||. +...+-+|+++. .-|+| +|+..+|..+++|+..+..+ T Consensus 190 ~~c~~PVvvAGG~k~~s~~efLq~v~DA~~aGAAGvs~GRn--vFqh~~p~~~~~Av~~iVH~ 250 (259) T TIGR01949 190 KACAVPVVVAGGPKLASDREFLQMVKDAMEAGAAGVSVGRN--VFQHDTPVAITKAVSAIVHE 250 (259) T ss_pred CCCCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCEECCCC--CCCCCCHHHHHHHHHHHCCC T ss_conf 21787778737779887467778899999818782310563--01468878999999873469 No 231 >PRK08225 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=25.49 E-value=27 Score=14.40 Aligned_cols=42 Identities=26% Similarity=0.381 Sum_probs=30.2 Q ss_pred CCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCC Q ss_conf 3322222210000010011102788410100234884249965999 Q gi|255764474|r 254 LYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRN 299 (352) Q Consensus 254 ~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d 299 (352) ..+|-++.++|||=+-.++.+...|-+.++.- ..| =.++.+| T Consensus 23 V~~G~~l~vlEAMKME~~i~A~~~G~V~~i~v-~~G---d~V~~G~ 64 (70) T PRK08225 23 VEEGQDVVILESMKMEIPIVAEEAGTVKKINV-QEG---DFVNEGD 64 (70) T ss_pred ECCCCEEEEEEECCCCCEEECCCCEEEEEEEE-CCC---CEECCCC T ss_conf 89999999997112657684799909999995-898---9989999 No 232 >PRK05331 putative glycerol-3-phosphate acyltransferase PlsX; Provisional Probab=25.08 E-value=41 Score=13.14 Aligned_cols=51 Identities=20% Similarity=0.219 Sum_probs=21.6 Q ss_pred CHHHHHHHHHHHHHHH-CCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 0778888999999971-9859998058866562347899899887543259708996285 Q gi|255764474|r 19 GVTSTVFGLCPIQRKL-GQRLVVFGYCLPKNIPSIGISSLLTCWKKPIGQNSRIWHARRN 77 (352) Q Consensus 19 Gv~~~v~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivh~~~~ 77 (352) |....+......+.+. ...+..++..- .+...+.-. .....+..|+|+... T Consensus 14 ~P~~~v~Ga~~al~~~~d~~~~LvG~e~-------~i~~~l~~~-~~~~~~i~iiha~~~ 65 (317) T PRK05331 14 GPEVVVPGALQALKEHPDLEILLVGDEE-------KIKPLLAKY-PALRERIEIVHASEV 65 (317) T ss_pred CHHHHHHHHHHHHHHCCCCEEEEEECHH-------HHHHHHHHC-CCCCCCEEEEECCCC T ss_conf 7288999999999878897699995899-------999999857-776357589989965 No 233 >TIGR01825 gly_Cac_T_rel pyridoxal phosphate-dependent acyltransferase, putative; InterPro: IPR010962 Pyridoxal phosphate is the active form of vitamin B6 (pyridoxine or pyridoxal). PLP is a versatile catalyst, acting as a coenzyme in a multitude of reactions, including decarboxylation, deamination and transamination , , . PLP-dependent enzymes are primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and in the synthesis or catabolism of neurotransmitters; pyridoxal phosphate can also inhibit DNA polymerases and several steroid receptors . Inadequate levels of pyridoxal phosphate in the brain can cause neurological dysfunction, particularly epilepsy . PLP enzymes exist in their resting state as a Schiff base, the aldehyde group of PLP forming a linkage with the epsilon-amino group of an active site lysine residue on the enzyme. The alpha-amino group of the substrate displaces the lysine epsilon-amino group, in the process forming a new aldimine with the substrate. This aldimine is the common central intermediate for all PLP-catalyzed reactions, enzymatic and non-enzymatic . This entry contains a number of enzyme families: Serine palmitoyltransferase Glycine C-acetyltransferase (2-amino-3-ketobutyrate coenzyme A ligase) 5-aminolevulinic acid synthase 8-amino-7-oxononanoate synthase All transfer the R-group (acetyl, succinyl, or 6-carboxyhexanoyl) from coenzyme A to an amino acid (Gly, Gly, Ala, respectively), with release of CO2 for the latter two reactions.; GO: 0016740 transferase activity, 0030170 pyridoxal phosphate binding. Probab=24.99 E-value=41 Score=13.13 Aligned_cols=31 Identities=29% Similarity=0.368 Sum_probs=27.4 Q ss_pred ECCCCHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 6599989999999999869899999999999 Q gi|255764474|r 295 VPPRNLHALEKAVLYFMNSKKIMSDTGNRGR 325 (352) Q Consensus 295 ~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~ 325 (352) .+|.|..++.+|+..|+.+|++.+++=+|++ T Consensus 271 ~~Pa~~~A~~~Av~~l~~~~~l~~kLW~Nt~ 301 (392) T TIGR01825 271 LPPAVVAALIEAVDVLQESPELMEKLWDNTR 301 (392) T ss_pred CCHHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 8807899999999987079314777678899 No 234 >PRK10355 xylF D-xylose transporter subunit XylF; Provisional Probab=24.93 E-value=18 Score=15.60 Aligned_cols=36 Identities=11% Similarity=0.153 Sum_probs=15.3 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf 45897078787877077888899999997198599980 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFG 42 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~ 42 (352) +|-+..+++...+- ......+....++.|..+.+.. T Consensus 27 ~IGv~~~~~~~~~~--~~~~~~~~~~Ake~G~~v~v~d 62 (330) T PRK10355 27 KIGMAIDDLRLERW--QKDRDIFVKKAESLGAKVFVQS 62 (330) T ss_pred EEEEEECCCCCHHH--HHHHHHHHHHHHHCCCEEEEEC T ss_conf 79999068888689--9999999999997599899971 No 235 >cd06311 PBP1_ABC_sugar_binding_like_3 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=24.73 E-value=42 Score=13.10 Aligned_cols=53 Identities=13% Similarity=0.166 Sum_probs=25.3 Q ss_pred EEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 12264451236665456530047740588631333210001466776443112332212222 Q gi|255764474|r 173 GRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFID 234 (352) Q Consensus 173 G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g 234 (352) |.+++.++ .+++..++..+|++..+++.+.... -...+-+++.|..+.+...| T Consensus 165 ~~~~~~~a----~~~~~~~L~~~pdi~~Ifa~nD~ma-----~Ga~~Al~~aG~~~~~~vvG 217 (274) T cd06311 165 ANWNRDDA----FSVMQDLLTKFPKIDAVWAHDDDMA-----VGVLAAIKQAGRTDIKFVVG 217 (274) T ss_pred CCCCHHHH----HHHHHHHHHCCCCCCEEEECCCCHH-----HHHHHHHHHCCCCCCCEEEE T ss_conf 87568999----9999999843888888998898058-----99999999769999978999 No 236 >cd05060 PTKc_Syk_like Catalytic Domain of Spleen Tyrosine Kinase-like Protein Tyrosine Kinases. Protein Tyrosine Kinase (PTK) family; Spleen Tyrosine Kinase (Syk) subfamily; catalytic (c) domain. The Syk subfamily is composed of Syk, ZAP-70, Shark, and similar proteins. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Syk subfamily kinases are cytoplasmic (or nonreceptor) tyr kinases containing two Src homology 2 (SH2) domains N-terminal to the catalytic tyr kinase domain. They are involved in the signaling downstream of activated receptors (including B-cell, T-cell, and Fc receptors) that contain ITAMs (immunoreceptor tyr activation motifs), leading to processes such as cell proliferation, differentiation, survival, adhesion, mi Probab=24.70 E-value=18 Score=15.63 Aligned_cols=70 Identities=13% Similarity=0.147 Sum_probs=41.3 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC-CCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHH Q ss_conf 0000000123332222221000001001110278841010023-48842499659998999999999986-989999999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDP-ENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTG 321 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~-~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~ 321 (352) ..+|++ +||.++.|-++.|.|.-.... ..+++.. ..|.- .-.++.-++.+.+-|.+.++ ||+.|..+. T Consensus 177 ~ksDvw-------SfGv~l~E~~t~g~~P~~~~~--~~~i~~~i~~~~~-~~~p~~~~~~l~~li~~c~~~~P~~RPs~~ 246 (257) T cd05060 177 SKSDVW-------SYGVTLWEAFSYGAKPYGEMK--GAEVIAMLESGER-LPRPEECPQEIYSIMLSCWKYRPEDRPTFS 246 (257) T ss_pred CCCCCC-------CHHHHHHHHHHCCCCCCCCCC--HHHHHHHHHCCCC-CCCCCCCHHHHHHHHHHHCCCCHHHCCCHH T ss_conf 777776-------149999999957999999999--9999999980899-998620359999999998388966891999 Q ss_pred HH Q ss_conf 99 Q gi|255764474|r 322 NR 323 (352) Q Consensus 322 ~~ 323 (352) += T Consensus 247 ei 248 (257) T cd05060 247 EL 248 (257) T ss_pred HH T ss_conf 99 No 237 >cd05064 PTKc_EphR_A10 Catalytic Domain of the Protein Tyrosine Kinase, Ephrin Receptor A10. Protein Tyrosine Kinase (PTK) family; Ephrin Receptor (EphR) subfamily; EphA10 receptor; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. EphRs comprise the largest subfamily of receptor tyr kinases (RTKs). In general, class EphA receptors bind GPI-anchored ephrin-A ligands. There are ten vertebrate EphA receptors (EphA1-10), which display promiscuous interactions with six ephrin-A ligands. EphRs contain an ephrin binding domain and two fibronectin repeats extracellularly, a transmembrane segment, and a cytoplasmic tyr kinase domain. Binding of the ephrin ligand to EphR requires cell-cell contact since both are anchor Probab=24.69 E-value=21 Score=15.05 Aligned_cols=69 Identities=13% Similarity=0.151 Sum_probs=38.2 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-ECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHH Q ss_conf 000000012333222222100000100111-027884101002348842499659998999999999986-989999999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVV-ASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTG 321 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI-~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~ 321 (352) ..+|++ +||.++.|-|+.|.+.- ..+...+.+.+. +|.. .-.+.+-++.+.+-+..-++ ||+.|..+. T Consensus 187 ~~sDV~-------S~Gv~l~Ei~t~g~~P~~~~~~~~~~~~v~--~g~~-~~~p~~~p~~l~~li~~C~~~dP~~RPs~~ 256 (266) T cd05064 187 SASDVW-------SFGIVMWEVMSYGERPYWDMSGQDVIKAVE--DGFR-LPAPRNCPNLLHQLMLDCWQKERGERPRFS 256 (266) T ss_pred CCCCCC-------CHHHHHHHHHHCCCCCCCCCCHHHHHHHHH--CCCC-CCCCCCCHHHHHHHHHHHCCCCHHHCCCHH T ss_conf 514015-------517999999827999998999999999997--3687-999644549999999998086965792999 Q ss_pred H Q ss_conf 9 Q gi|255764474|r 322 N 322 (352) Q Consensus 322 ~ 322 (352) + T Consensus 257 e 257 (266) T cd05064 257 Q 257 (266) T ss_pred H T ss_conf 9 No 238 >KOG4131 consensus Probab=24.54 E-value=42 Score=13.07 Aligned_cols=112 Identities=18% Similarity=0.210 Sum_probs=55.8 Q ss_pred CCCCCHHHHCCCCCHHHHHHHCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHH Q ss_conf 48827778237843268788728898872799951226445123666545653004774058863133321000146677 Q gi|255764474|r 139 MHGVDTERFRPTSNKQEARRHLKISEDAKLIGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQ 218 (352) Q Consensus 139 ~~gid~~~~~~~~~~~~~~~~~~~~~~~~~i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~ 218 (352) .+||+...+.|..... + .+....++| ||...-|--..+++.+++++...+-++.-. +.|.. ++ T Consensus 123 s~gIN~~~~~p~~~t~------~-gp~~~~~G~-gr~~e~~~~~~~~~~l~~ik~~l~~v~val-~~g~~--------~~ 185 (272) T KOG4131 123 SKGINANLSRPNDPTK------N-GPMEETIGY-GREEETKINLNVVEILKRIKRGLSSVRVAL-AVGHT--------LE 185 (272) T ss_pred HHHHHHHHCCCCCCCC------C-CCCCCCCCC-CCEEECCCCCCHHHHHHHHHHCCCEEEEEE-CCCCC--------CC T ss_conf 9765453057765012------6-877644345-611310376309999999870687189761-13776--------50 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCC--CCCCCCCCCCCCCCCCCEEEEEECCCC Q ss_conf 644311233221222222331000000000001--233322222210000010011102788 Q gi|255764474|r 219 RRIYANGLKKRILFIDEQSSIEDWYRALNIFVA--PPLYEGFGLTPLEAMASGIPVVASNTG 278 (352) Q Consensus 219 ~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~--pS~~Eg~gl~~lEAma~G~PvI~s~~~ 278 (352) .++..-++- .|.-.-+.+- -.+|+++- -|+.| ++++++.|.-||..... T Consensus 186 ~~i~~V~vc-----Agsg~svlk~-~~adly~TGEmSHH~-----vL~~~~~g~sVilc~HS 236 (272) T KOG4131 186 SQIKKVAVC-----AGSGSSVLKG-VDADLYITGEMSHHD-----VLDAAANGISVILCEHS 236 (272) T ss_pred CCEEEEEEE-----ECCCCCEECC-CCCCEEEECCCCHHH-----HHHHHHCCCEEEEECCC T ss_conf 020689883-----0567501105-543379943552888-----99998769769994688 No 239 >CHL00067 rps2 ribosomal protein S2 Probab=24.51 E-value=42 Score=13.07 Aligned_cols=34 Identities=24% Similarity=0.408 Sum_probs=23.6 Q ss_pred CCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCC Q ss_conf 2221000001001110-2788410100234884249965999 Q gi|255764474|r 259 GLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRN 299 (352) Q Consensus 259 gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d 299 (352) ..++.||.-.|.|+|+ .|.-.-|+.|+ +-+|.+| T Consensus 170 ~~ai~Ea~kL~IPvIaivDTn~~p~~id-------ypIP~ND 204 (227) T CHL00067 170 YTALRECIKLGIPTISIVDTNCDPDLAD-------IPIPAND 204 (227) T ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCC-------EEEECCC T ss_conf 7999999875999899963899977577-------6753786 No 240 >cd05058 PTKc_Met_Ron Catalytic Domain of the Protein Tyrosine Kinases, Met and Ron. Protein Tyrosine Kinase (PTK) family; Met and Ron; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Met and Ron are receptor tyr kinases (RTKs) composed of an alpha-beta heterodimer. The extracellular alpha chain is disulfide linked to the beta chain, which contains an extracellular ligand-binding region with a sema domain, a PSI domain and four IPT repeats, a transmembrane segment, and an intracellular catalytic domain. Binding to their ligands leads to receptor dimerization, autophosphorylation, activation, and intracellular signaling. Met binds to the ligand, hepatocyte growth factor/scatter factor (HGF/SF), and is also ca Probab=24.25 E-value=21 Score=15.07 Aligned_cols=69 Identities=14% Similarity=0.228 Sum_probs=36.5 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC-CCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHH Q ss_conf 00000001233322222210000010011102-7884101002348842499659998999999999986-989999999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVAS-NTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTG 321 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~ 321 (352) ..+|++ +||.++.|-++-|.|.-.. +...+...+. .|. -...+..-++.+.+-+.+-++ ||+.|..+. T Consensus 181 ~~~DVw-------SfGi~l~El~t~g~~p~~~~~~~~~~~~i~--~g~-~~~~p~~~p~~l~~li~~C~~~~P~~RPs~~ 250 (262) T cd05058 181 TKSDVW-------SFGVLLWELMTRGAPPYPDVDSFDITVYLL--QGR-RLLQPEYCPDPLYEVMLSCWHPKPEMRPTFS 250 (262) T ss_pred CCCCEE-------HHHHHHHHHHCCCCCCCCCCCHHHHHHHHH--CCC-CCCCCCCCHHHHHHHHHHHCCCCHHHCCCHH T ss_conf 554400-------189999999818999999989999999998--089-8899765759999999998388965792999 Q ss_pred H Q ss_conf 9 Q gi|255764474|r 322 N 322 (352) Q Consensus 322 ~ 322 (352) + T Consensus 251 e 251 (262) T cd05058 251 E 251 (262) T ss_pred H T ss_conf 9 No 241 >PRK13435 response regulator; Provisional Probab=24.17 E-value=43 Score=13.03 Aligned_cols=44 Identities=23% Similarity=0.346 Sum_probs=28.4 Q ss_pred CEEEEEECCCCCCCHHCC-CCCCCEEEEECCCCHHHHHHHHHHHHCC Q ss_conf 100111027884101002-3488424996599989999999999869 Q gi|255764474|r 268 SGIPVVASNTGVFSELLD-PENAKAGVIVPPRNLHALEKAVLYFMNS 313 (352) Q Consensus 268 ~G~PvI~s~~~~~~e~i~-~~~g~~G~~~~~~d~~~la~~i~~l~~~ 313 (352) +++|+|... +..|.+. ...|-.+|+..|-+++++.+++.++++. T Consensus 73 ~~ipvI~lT--a~~d~~~~~~aGA~dyl~KPf~~~eL~~ai~~~~~r 117 (141) T PRK13435 73 GGVEVIFMT--ANPERVGHDFAGALGVIAKPYSLRQVARALSYVAAR 117 (141) T ss_pred CCCCEEEEE--CCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHHHH T ss_conf 998389997--779986013459985898999989999999999971 No 242 >cd05116 PTKc_Syk Catalytic Domain of the Protein Tyrosine Kinase, Spleen tyrosine kinase. Protein Tyrosine Kinase (PTK) family; Spleen tyrosine kinase (Syk); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Syk, together with Zap-70, form the Syk subfamily of kinases which are cytoplasmic (or nonreceptor) tyr kinases containing two Src homology 2 (SH2) domains N-terminal to the catalytic tyr kinase domain. Syk was first cloned from the spleen, and its function in hematopoietic cells is well-established. Syk is involved in the signaling downstream of activated receptors (including B-cell and Fc receptors) that contain ITAMs (immunoreceptor tyr activation motifs), leading to processes such as cell proliferatio Probab=24.16 E-value=19 Score=15.46 Aligned_cols=72 Identities=10% Similarity=0.150 Sum_probs=42.0 Q ss_pred CCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC-CCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHH Q ss_conf 0000000001233322222210000010011102-7884101002348842499659998999999999986-9899999 Q gi|255764474|r 242 WYRALNIFVAPPLYEGFGLTPLEAMASGIPVVAS-NTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSD 319 (352) Q Consensus 242 ~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~ 319 (352) +-..+|++ +||.++.|.++.|.+.-.. +...+...+. .|.. .-.++.-+.++.+-|..-++ ||+.|.. T Consensus 175 ~~~ksDVw-------SfGv~l~E~~t~G~~P~~~~~~~~~~~~i~--~g~r-~~~p~~~~~~l~~li~~Cw~~~P~~RPs 244 (257) T cd05116 175 FSSKSDVW-------SFGVLMWEAFSYGQKPYKGMKGNEVTQMIE--SGER-MECPQRCPPEMYDLMKLCWTYGVDERPG 244 (257) T ss_pred CCCCCCCC-------CHHHHHHHHHHCCCCCCCCCCHHHHHHHHH--CCCC-CCCCCCCCHHHHHHHHHHHCCCHHHCCC T ss_conf 98301431-------279999999808999899989999999998--4899-9999767899999999981789657909 Q ss_pred HHHH Q ss_conf 9999 Q gi|255764474|r 320 TGNR 323 (352) Q Consensus 320 ~~~~ 323 (352) +.+= T Consensus 245 f~~i 248 (257) T cd05116 245 FAVV 248 (257) T ss_pred HHHH T ss_conf 9999 No 243 >cd05578 STKc_Yank1 STKc_Yank1: Serine/Threonine Kinases (STKs), Yank1 or STK32A subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The Yank1 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. This subfamily contains uncharacterized STKs with similarity to the human protein designated Yank1 or STK32A. Probab=23.92 E-value=25 Score=14.62 Aligned_cols=65 Identities=18% Similarity=0.185 Sum_probs=34.9 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHH Q ss_conf 0000000123332222221000001001110-27884101002348842499659998999999999986-98999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIM 317 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~ 317 (352) ..+|+. ++|.++.| |++|.+.-. .......++...... .....++.-++++.+-|.+++. ||+.| T Consensus 178 ~~~DiW-------SlGv~lye-l~~G~~Pf~~~~~~~~~~~~~~~~~-~~~~~~~~~s~~~~dli~~~L~~dP~~R 244 (258) T cd05578 178 FAVDWW-------SLGVTAYE-MLRGKRPYRGHSRTPREEILAKFET-ADVLYPAGWSSEAIDAINKLLERDPQKR 244 (258) T ss_pred CHHHHH-------HHHHHHHH-HHHCCCCCCCCCCHHHHHHHHHHCC-CCCCCCCCCCHHHHHHHHHHCCCCHHHC T ss_conf 666598-------79999999-9888599999983399999987426-9998987799999999999858798786 No 244 >cd05071 PTKc_Src Catalytic Domain of the Protein Tyrosine Kinase, Src. Protein Tyrosine Kinase (PTK) family; Src kinase; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Src (or c-Src) is a cytoplasmic (or non-receptor) tyr kinase, containing an N-terminal SH4 domain with a myristoylation site, followed by SH3 and SH2 domains, a tyr kinase domain, and a regulatory C-terminal region with a conserved tyr. It is activated by autophosphorylation at the tyr kinase domain, and is negatively regulated by phosphorylation at the C-terminal tyr by Csk (C-terminal Src Kinase). c-Src is the vertebrate homolog of the oncogenic protein (v-Src) from Rous sarcoma virus. Together with other Src subfamily proteins, it is invo Probab=23.90 E-value=38 Score=13.37 Aligned_cols=78 Identities=17% Similarity=0.154 Sum_probs=42.5 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEE--ECCCCHHHHHHHHHHHHC-CHHHHHHH Q ss_conf 000000012333222222100000100111027884101002348842499--659998999999999986-98999999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVI--VPPRNLHALEKAVLYFMN-SKKIMSDT 320 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~--~~~~d~~~la~~i~~l~~-~~~~~~~~ 320 (352) ..+|++ +||.++.|-+..|.+.-... ...+++..- ..|.- .+++-++++.+-+...++ ||+.|..+ T Consensus 182 ~ksDIw-------SfGvil~El~t~g~~Pf~~~--~~~~~~~~i--~~~~~~p~p~~~~~~l~~li~~C~~~dP~~RPt~ 250 (262) T cd05071 182 IKSDVW-------SFGILLTELTTKGRVPYPGM--VNREVLDQV--ERGYRMPCPPECPESLHDLMCQCWRKEPEERPTF 250 (262) T ss_pred CCCCEE-------CCHHHHHHHHHCCCCCCCCC--CHHHHHHHH--HCCCCCCCCCCCCHHHHHHHHHHHCCCHHHCCCH T ss_conf 758446-------53899999995899998998--999999999--7179899997668999999999808796679199 Q ss_pred HHHHHHHHHHHCC Q ss_conf 9999999998299 Q gi|255764474|r 321 GNRGRERAVKHFS 333 (352) Q Consensus 321 ~~~a~~~~~~~fs 333 (352) .+=. +..++.|| T Consensus 251 ~~l~-~~L~~~~~ 262 (262) T cd05071 251 EYLQ-AFLEDYFT 262 (262) T ss_pred HHHH-HHHHHHCC T ss_conf 9999-99862369 No 245 >cd06324 PBP1_ABC_sugar_binding_like_13 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=23.87 E-value=28 Score=14.23 Aligned_cols=67 Identities=12% Similarity=0.086 Sum_probs=33.4 Q ss_pred EEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCC--CCCCCCCC--CCCCCCCCHHHH Q ss_conf 51226445123666545653004774058863133321000146677644311233--22122222--233100000000 Q gi|255764474|r 172 FGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLK--KRILFIDE--QSSIEDWYRALN 247 (352) Q Consensus 172 ~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~--~~V~~~g~--~~~~~~~~~~ad 247 (352) .+.++..|+. +++..+++++|++..+++.+.... -..-+-+++.|+. +.|.+.|+ .++....++.-. T Consensus 181 ~~~~~~~~a~----~~~~~lL~~~Pdi~aI~~~ND~mA-----lGAv~Al~~aG~~~~~dv~IvG~Dg~~~~~~~I~~G~ 251 (305) T cd06324 181 YAGWSEDEAY----EQAENLLKRYPDVRLIWAANDQMA-----FGALRAAKEAGRKPGRDVLFGGVNWSPEALRAIKDGR 251 (305) T ss_pred CCCCCHHHHH----HHHHHHHHHCCCCCEEEECCCHHH-----HHHHHHHHHCCCCCCCCEEEEEECCCHHHHHHHHCCC T ss_conf 1898789999----999999973999988997885789-----9999999974998799889997069899999998498 No 246 >cd05044 PTKc_c-ros Catalytic Domain of the Protein Tyrosine Kinase, C-ros. Protein Tyrosine Kinases (PTK) family; C-ros and Drosophila Sevenless proteins; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. The proto-oncogene c-ros encodes an orphan receptor tyr kinase (RTK) with an unknown ligand. RTKs contain an extracellular ligand-binding domain, a transmembrane region, and an intracellular tyr kinase domain. RTKs are usually activated through ligand binding, which causes dimerization and autophosphorylation of the intracellular tyr kinase catalytic domain. C-ros is expressed in embryonic cells of the kidney, intestine and lung, but disappears soon after birth. It persists only in the adult epididymis. Male Probab=23.74 E-value=25 Score=14.59 Aligned_cols=71 Identities=11% Similarity=0.114 Sum_probs=39.8 Q ss_pred CCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC-CCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHH Q ss_conf 000000000123332222221000001001110278841010023-48842499659998999999999986-9899999 Q gi|255764474|r 242 WYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDP-ENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSD 319 (352) Q Consensus 242 ~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~-~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~ 319 (352) +-..+|++ +||.++.|-+++|.+.-.... ..+++.. ..+.. .-.++.-++++.+-+.+.++ ||+.|.. T Consensus 191 ~~~~sDiw-------S~Gvil~Ellt~G~~Pf~~~~--~~~~~~~i~~~~~-~~~p~~~~~~l~~li~~cl~~dP~~RPs 260 (270) T cd05044 191 FTTQSDVW-------AFGVLMWEILTLGQQPYPALN--NQEVLQHVTTGGR-LQSPENCPDKLYNLMTNCWAKDPSERPT 260 (270) T ss_pred CCCCCCCC-------CHHHHHHHHHCCCCCCCCCCC--HHHHHHHHHHCCC-CCCCCCCCHHHHHHHHHHHCCCHHHCCC T ss_conf 99766511-------249999999857999889989--9999999983898-8997667099999999982879667909 Q ss_pred HHH Q ss_conf 999 Q gi|255764474|r 320 TGN 322 (352) Q Consensus 320 ~~~ 322 (352) +.+ T Consensus 261 ~~~ 263 (270) T cd05044 261 FDR 263 (270) T ss_pred HHH T ss_conf 999 No 247 >cd06316 PBP1_ABC_sugar_binding_like_7 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=23.70 E-value=44 Score=12.97 Aligned_cols=85 Identities=12% Similarity=0.019 Sum_probs=33.2 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEE-ECCC-HHHHH Q ss_conf 458970787878770778888999999971985999805886656234789989988754325970899-6285-46899 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQRLVVFGYCLPKNIPSIGISSLLTCWKKPIGQNSRIWH-ARRN-NEMLL 82 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivh-~~~~-~~~~~ 82 (352) +|.+|.......+- .......-..-+++|..+......- .+...-...........+|.+- .... ..... T Consensus 1 kia~v~~~~~n~f~--~~~~~Ga~~aA~~lG~~v~~~~da~------~d~~~Q~~~i~~~I~~gvDaIii~p~d~~a~~~ 72 (294) T cd06316 1 KAAIVMHTSGSDWS--NAQVRGAKDEFAKLGIEVVATTDAQ------FDPAKQVADIETTISQKPDIIISIPVDPVSTAA 72 (294) T ss_pred CEEEEECCCCCHHH--HHHHHHHHHHHHHHCCEEEEECCCC------CCHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH T ss_conf 98999578899899--9999999999998399899966998------899999999999998599999993888678799 Q ss_pred HHHHHHHHCCCEEEE Q ss_conf 999999709981999 Q gi|255764474|r 83 GVMMRDVLRMPLKLV 97 (352) Q Consensus 83 ~~~~~~~~~~~~~~~ 97 (352) .+-...-.++|+.+. T Consensus 73 ~v~~A~~aGIpVv~~ 87 (294) T cd06316 73 AYKKVAEAGIKLVFM 87 (294) T ss_pred HHHHHHHCCCCEEEE T ss_conf 999999819967983 No 248 >cd05074 PTKc_Tyro3 Catalytic Domain of the Protein Tyrosine Kinase, Tyro3. Protein Tyrosine Kinase (PTK) family; Tyro3; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Tyro3 (or Sky) is a member of the Axl subfamily, which is composed of receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with two immunoglobulin-like domains followed by two fibronectin type III repeats, a transmembrane segment, and an intracellular catalytic domain. Binding to their ligands, Gas6 and protein S, leads to receptor dimerization, autophosphorylation, activation, and intracellular signaling. Tyro3 is predominantly expressed in the central nervous system and the brain, and functions as a neurotrophic fac Probab=23.61 E-value=26 Score=14.54 Aligned_cols=69 Identities=17% Similarity=0.221 Sum_probs=38.5 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEC-CCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHH Q ss_conf 00000001233322222210000010011102-7884101002348842499659998999999999986-989999999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVAS-NTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTG 321 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s-~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~ 321 (352) ..+|++ +||.++.|-++.|.+.-.. +...+-+.+. .+. -.-.+++-++++.+-+...++ ||+.|..+. T Consensus 194 ~~sDIw-------SfGvil~El~t~g~~P~~~~~~~~~~~~~~--~~~-~~~~p~~~~~~l~~li~~C~~~dP~~RPs~~ 263 (273) T cd05074 194 THSDVW-------AFGVTMWEIMTRGQTPYAGVENSEIYNYLI--KGN-RLKQPPDCLEDVYELMCQCWSPEPKCRPSFQ 263 (273) T ss_pred CCCCCC-------CCHHHHHHHHCCCCCCCCCCCHHHHHHHHH--HCC-CCCCCCCCHHHHHHHHHHHCCCCHHHCCCHH T ss_conf 788866-------429999999808999889989999999998--289-8899975649999999998187966891999 Q ss_pred H Q ss_conf 9 Q gi|255764474|r 322 N 322 (352) Q Consensus 322 ~ 322 (352) + T Consensus 264 ~ 264 (273) T cd05074 264 H 264 (273) T ss_pred H T ss_conf 9 No 249 >cd05050 PTKc_Musk Catalytic Domain of the Protein Tyrosine Kinase, Muscle-specific kinase. Protein Tyrosine Kinase (PTK) family; Muscle-specific kinase (Musk); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Musk is a receptor tyr kinase (RTK) containing an extracellular region with four immunoglobulin-like domains and a cysteine-rich cluster, a transmembrane segment, and an intracellular catalytic domain. Musk is expressed and concentrated in the postsynaptic membrane in skeletal muscle. It is essential for the establishment of the neuromuscular junction (NMJ), a peripheral synapse that conveys signals from motor neurons to muscle cells. Agrin, a large proteoglycan released from motor neurons, stimulates M Probab=23.58 E-value=23 Score=14.86 Aligned_cols=70 Identities=9% Similarity=0.109 Sum_probs=40.0 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEE-EEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHH Q ss_conf 000000012333222222100000100-111027884101002348842499659998999999999986-989999999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGI-PVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTG 321 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~-PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~ 321 (352) ..+|++ +||.++.|-++.|. |--......+...+. ++.. .-++++-++++.+-+...++ ||+.|..+. T Consensus 211 ~~sDVw-------SfGv~l~El~t~g~~P~~~~~~~~~~~~i~--~~~~-~~~p~~~p~~l~~li~~c~~~dP~~RPs~~ 280 (288) T cd05050 211 TESDVW-------AYGVVLWEIFSYGMQPYYGMAHEEVIYYVR--DGNV-LSCPDNCPLELYNLMRLCWSKLPSDRPSFA 280 (288) T ss_pred CCCCEE-------ECCHHHHHHHHCCCCCCCCCCHHHHHHHHH--CCCC-CCCCCCCHHHHHHHHHHHHCCCHHHCCCHH T ss_conf 607478-------343899999837999999989999999997--0897-889876729999999998485965792999 Q ss_pred HH Q ss_conf 99 Q gi|255764474|r 322 NR 323 (352) Q Consensus 322 ~~ 323 (352) += T Consensus 281 ei 282 (288) T cd05050 281 SI 282 (288) T ss_pred HH T ss_conf 99 No 250 >PRK05776 DNA topoisomerase III; Provisional Probab=23.57 E-value=44 Score=12.95 Aligned_cols=94 Identities=11% Similarity=0.111 Sum_probs=60.1 Q ss_pred CCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCC--CCEEEEECCCCHHHHHHHHHHH---HCCHHHHHHHHHHHHHHHHH Q ss_conf 222222100000100111027884101002348--8424996599989999999999---86989999999999999998 Q gi|255764474|r 256 EGFGLTPLEAMASGIPVVASNTGVFSELLDPEN--AKAGVIVPPRNLHALEKAVLYF---MNSKKIMSDTGNRGRERAVK 330 (352) Q Consensus 256 Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~--g~~G~~~~~~d~~~la~~i~~l---~~~~~~~~~~~~~a~~~~~~ 330 (352) |+-=+..+|-..-|+|. |..+-+..+++... ...+-+++..--..+.+.+... +.+|+...+|-+.-.+.... T Consensus 482 EatLi~~ME~~GIGTpA--Tra~IIe~L~~R~Yi~~~~~~l~pT~~G~~li~~l~~~~~~l~~p~~Ta~~E~~L~~I~~G 559 (675) T PRK05776 482 KASLLKWMESVNIGTEA--TRARIIETLFKRKYLRSKGGKVYATPLGIGVAEILQKYFPDLTSVELTRRFEEKLEKIRSG 559 (675) T ss_pred HHHHHHHHHCCCCCCCC--CHHHHHHHHHHCCCEEEECCEEEECHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC T ss_conf 99999998658999501--6899999896489699059999874459999999997436525914899999998999879 Q ss_pred HCCHHHHHHHHHHHHHHHHHH Q ss_conf 299899999999999998860 Q gi|255764474|r 331 HFSIVKEASDIGKVYDRLLRT 351 (352) Q Consensus 331 ~fs~~~~a~~~~~iY~~~l~~ 351 (352) +.+++.+.+...+..+++++. T Consensus 560 k~~~~~f~~e~~~~~~~~v~~ 580 (675) T PRK05776 560 KERREEVVEEAKKTLRKLLEE 580 (675) T ss_pred CCCHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999 No 251 >pfam04230 PS_pyruv_trans Polysaccharide pyruvyl transferase. Pyruvyl-transferases involved in peptidoglycan-associated polymer biosynthesis. CsaB in Bacillus anthracis is necessary for the non-covalent anchoring of proteins containing an SLH (S-layer homology) domain to peptidoglycan-associated pyruvylated polysaccharides. WcaK and AmsJ are involved in the biosynthesis of colanic acid in Escherichia coli and of amylovoran in Erwinia amylovora. Probab=23.51 E-value=44 Score=12.94 Aligned_cols=36 Identities=11% Similarity=0.194 Sum_probs=25.3 Q ss_pred CCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECC Q ss_conf 23310000000000012333222222100000100111027 Q gi|255764474|r 236 QSSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASN 276 (352) Q Consensus 236 ~~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~ 276 (352) .++...+++.+|+++....+ ..+-|+++|+|+|+-. T Consensus 220 ~~~~~~~i~~~~~vi~~RlH-----~~I~a~~~gvP~i~i~ 255 (258) T pfam04230 220 PEELLALISQADFVITTRLH-----GLIFALLLGVPFIVIA 255 (258) T ss_pred HHHHHHHHHCCCEEEEECHH-----HHHHHHHCCCCEEEEE T ss_conf 99999999709899983737-----9999997599989971 No 252 >pfam03949 Malic_M Malic enzyme, NAD binding domain. Probab=23.50 E-value=44 Score=12.94 Aligned_cols=17 Identities=29% Similarity=0.675 Sum_probs=6.6 Q ss_pred EEEEECC-CCHHHHHHHH Q ss_conf 2499659-9989999999 Q gi|255764474|r 291 AGVIVPP-RNLHALEKAV 307 (352) Q Consensus 291 ~G~~~~~-~d~~~la~~i 307 (352) .|.++++ .|+.+++..+ T Consensus 229 ~~~i~P~~~~~r~vs~~V 246 (255) T pfam03949 229 EGYIIPPLFDIREVSPRV 246 (255) T ss_pred CCCCCCCCCCHHHHHHHH T ss_conf 874649987487879999 No 253 >cd06300 PBP1_ABC_sugar_binding_like_1 Periplasmic sugar-binding component of uncharacterized ABC-type transport systems that are members of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily. Periplasmic sugar-binding component of uncharacterized ABC-type transport systems that are members of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes; however their substrate specificity is not known in detail. Probab=23.36 E-value=44 Score=12.92 Aligned_cols=30 Identities=7% Similarity=0.000 Sum_probs=13.1 Q ss_pred HEEEECCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 45897078787877077888899999997198 Q gi|255764474|r 5 NIDVIAPNMKFRHTGVTSTVFGLCPIQRKLGQ 36 (352) Q Consensus 5 ~i~~i~~~~~~~~~Gv~~~v~~~~~~~~~~~~ 36 (352) ||-++.+++...+.. ......-...++++. T Consensus 1 kIg~~~~~~~n~f~~--~~~~~~~~~ak~~~~ 30 (272) T cd06300 1 KIGLSNSYAGNTWRA--QMLDEFKAQAKELKK 30 (272) T ss_pred CEEEEECCCCCHHHH--HHHHHHHHHHHHCCC T ss_conf 989993688997999--999999999986376 No 254 >PRK00192 mannosyl-3-phosphoglycerate phosphatase; Reviewed Probab=23.07 E-value=45 Score=12.89 Aligned_cols=43 Identities=21% Similarity=0.179 Sum_probs=28.9 Q ss_pred CEEEEEECCC-CCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHCC Q ss_conf 1001110278-841010023488424996599989999999999869 Q gi|255764474|r 268 SGIPVVASNT-GVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMNS 313 (352) Q Consensus 268 ~G~PvI~s~~-~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~~ 313 (352) +|.+|..-+. +..++.++.. +-++-.+++++++|++|++++.. T Consensus 228 ag~gVAMgNa~~~~~~~~k~~---~~~~t~~n~~~Gva~ai~~~~~~ 271 (275) T PRK00192 228 ADIAVVIPGPDGPNPPLLRGI---EFTLASAPGPEGWAEAVNKILNK 271 (275) T ss_pred CCCEEEEECCCCCCCCCCCCC---CEEECCCCCCHHHHHHHHHHHHH T ss_conf 897599638988983412324---65987899968999999999998 No 255 >PRK07455 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=23.05 E-value=41 Score=13.18 Aligned_cols=19 Identities=16% Similarity=0.076 Sum_probs=8.5 Q ss_pred CCCCCCCCCCHHHHHHCCC Q ss_conf 2223310000000000012 Q gi|255764474|r 234 DEQSSIEDWYRALNIFVAP 252 (352) Q Consensus 234 g~~~~~~~~~~~adi~i~p 252 (352) +..+++...-...++.+.| T Consensus 93 ~~~~~vi~~a~~~~i~~iP 111 (210) T PRK07455 93 HVDLELIQAAVAADIPIIP 111 (210) T ss_pred CCCHHHHHHHHHCCCCEEC T ss_conf 8889999999982997658 No 256 >COG2344 AT-rich DNA-binding protein [General function prediction only] Probab=22.94 E-value=45 Score=12.87 Aligned_cols=40 Identities=18% Similarity=0.242 Sum_probs=31.1 Q ss_pred EEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCC Q ss_conf 9995122644512366654565300477405886313332 Q gi|255764474|r 169 IGCFGRIRKLKGTDLFVDCMINILPHHPGWTAVVVGKTTL 208 (352) Q Consensus 169 i~~~G~~~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~ 208 (352) ++..|+-...-+++.|.+.+.+++.....+..+++|-|.. T Consensus 56 FG~lGkrG~GYnV~~L~~ff~~~Lg~~~~tnviiVG~Gnl 95 (211) T COG2344 56 FGELGKRGYGYNVKYLRDFFDDLLGQDKTTNVIIVGVGNL 95 (211) T ss_pred HHHCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHH T ss_conf 7752788778439999999999838774034899905708 No 257 >cd05047 PTKc_Tie Catalytic Domain of Tie Protein Tyrosine Kinases. Protein Tyrosine Kinase (PTK) family; Tie subfamily; catalytic (c) domain. The Tie subfamily consists of Tie1 and Tie2. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Tie proteins are receptor tyr kinases (RTKs) containing an extracellular region, a transmembrane segment, and an intracellular catalytic domain. The extracellular region contains an immunoglobulin (Ig)-like domain, three epidermal growth factor (EGF)-like domains, a second Ig-like domain, and three fibronectin type III repeats. Tie receptors are specifically expressed in endothelial cells and hematopoietic stem cells. The angiopoietins (Ang-1 to Ang-4) serve as ligands for Tie2, while no specific l Probab=22.85 E-value=25 Score=14.62 Aligned_cols=72 Identities=8% Similarity=0.020 Sum_probs=39.3 Q ss_pred CCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHH Q ss_conf 0000000000123332222221000001001110-27884101002348842499659998999999999986-989999 Q gi|255764474|r 241 DWYRALNIFVAPPLYEGFGLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMS 318 (352) Q Consensus 241 ~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~ 318 (352) .+-..+|++ +||.++.|-+++|.+.-. .....+-+.+. .|. -.-.+..-++++.+-+...++ ||+.|. T Consensus 187 ~y~~k~DV~-------S~Gv~l~El~t~g~~P~~~~~~~~~~~~i~--~~~-~~~~p~~~~~~~~~li~~c~~~dP~~RP 256 (270) T cd05047 187 VYTTNSDVW-------SYGVLLWEIVSLGGTPYCGMTCAELYEKLP--QGY-RLEKPLNCDDEVYDLMRQCWREKPYERP 256 (270) T ss_pred CCCCCCCCH-------HHHHHHHHHHHCCCCCCCCCCHHHHHHHHH--CCC-CCCCCCCCCHHHHHHHHHHCCCCHHHCC T ss_conf 878545531-------279999999908999999989999999997--258-8898888779999999998088965792 Q ss_pred HHHH Q ss_conf 9999 Q gi|255764474|r 319 DTGN 322 (352) Q Consensus 319 ~~~~ 322 (352) .+.+ T Consensus 257 t~~e 260 (270) T cd05047 257 SFAQ 260 (270) T ss_pred CHHH T ss_conf 9999 No 258 >TIGR00700 GABAtrnsam 4-aminobutyrate transaminase; InterPro: IPR004632 Eukaryotic 4-aminobutyrate aminotransferase (2.6.1.19 from EC) is a class III pyridoxal-phosphate-dependent aminotransferase. The enzyme catalyses the conversion of 4-aminobutanoate and 2-oxoglutarate into succinate semialdehyde and L-glutamate. The degree of sequence difference between this set and known bacterial examples is greater than the distance between, in either set, the most similar enzyme with a distinct function, and so the prokaryotic and eukaryotic sets have been placed into separate families. This family describes known bacterial examples of the enzyme. The best archaeal matches are presumed but not trusted to have the equivalent function. Escherichia coli has two isozymes. Alternate names include GABA transaminase, gamma-amino-N-butyrate transaminase, and beta-alanine--oxoglutarate aminotransferase.; GO: 0003867 4-aminobutyrate transaminase activity, 0009448 gamma-aminobutyric acid metabolic process. Probab=22.74 E-value=46 Score=12.84 Aligned_cols=122 Identities=19% Similarity=0.269 Sum_probs=67.1 Q ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCC--C-C--CCCCCCCCCEEEEEECCCCCCCHHCCC Q ss_conf 0146677644311233221222222331000000000001233322--2-2--221000001001110278841010023 Q gi|255764474|r 212 LFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPLYEG--F-G--LTPLEAMASGIPVVASNTGVFSELLDP 286 (352) Q Consensus 212 ~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~~Eg--~-g--l~~lEAma~G~PvI~s~~~~~~e~i~~ 286 (352) .+...+.+..++++ |.|. -|+++.=+++.--+..--++.. + | +++--.+|.|+|. |.+-|-.|+++. T Consensus 221 ~fv~a~~~~C~ehg----iVfi--ADEvQtGFARTG~~FA~eH~~~dGl~PDLit~aKslA~GlPL--SgV~GrAeimdA 292 (427) T TIGR00700 221 EFVAAVLDLCKEHG----IVFI--ADEVQTGFARTGKLFAIEHYDDDGLKPDLITVAKSLADGLPL--SGVVGRAEIMDA 292 (427) T ss_pred HHHHHHHHHHHHCC----EEEE--ECCCCCCCCHHCCCEEEEECCCCCCCCCHHHHHHHHHCCCCC--CHHHCCHHHCCC T ss_conf 89999999754387----5899--725456800002112541006788875188786677356862--111044121168 Q ss_pred -CCCCEEEEECCCCHHHHHHHHH--HHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf -4884249965999899999999--9986989999999999999998299899999999 Q gi|255764474|r 287 -ENAKAGVIVPPRNLHALEKAVL--YFMNSKKIMSDTGNRGRERAVKHFSIVKEASDIG 342 (352) Q Consensus 287 -~~g~~G~~~~~~d~~~la~~i~--~l~~~~~~~~~~~~~a~~~~~~~fs~~~~a~~~~ 342 (352) ..|+.|-.+ .+||-+.|.|+. .+++.+.+.++..+=+|-.-.+-++|+.....+- T Consensus 293 papGgLGGTy-aGnPlacAAAlAvl~iIe~egL~ERa~qig~~~~d~l~~~k~~~p~Ig 350 (427) T TIGR00700 293 PAPGGLGGTY-AGNPLACAAALAVLDIIEEEGLIERAEQIGKLVKDKLIELKEEVPAIG 350 (427) T ss_pred CCCCCCCCCC-CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHH T ss_conf 8888887755-688799999999998631131247899987889888888876067211 No 259 >cd00133 PTS_IIB PTS_IIB: subunit IIB of enzyme II (EII) is the central energy-coupling domain of the phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In the multienzyme PTS complex, EII is a carbohydrate-specific permease consisting of two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include chitobiose/lichenan, ascorbate, lactose, galactitol, mannitol, fructose, and a sensory system with similarity to the bacterial bgl system. The PTS is found only in bacteria, where it catalyzes the transport and phosphorylation of numerous monosaccharides, disaccharides, polyols, amino sugars, and other sugar derivatives. The four proteins (domains) forming the PTS phosphorylation cascade (EI, HPr, EIIA, and EIIB), can phosphorylate or interact with numerous non-PTS proteins thereby r Probab=22.48 E-value=46 Score=12.81 Aligned_cols=53 Identities=9% Similarity=0.103 Sum_probs=27.3 Q ss_pred EEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCC Q ss_conf 8863133321000146677644311233221222222331000000000001233 Q gi|255764474|r 200 AVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDEQSSIEDWYRALNIFVAPPL 254 (352) Q Consensus 200 l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~~~~~~~~~~adi~i~pS~ 254 (352) ++++|.|..-......++++...+.+....+...+..+ . ......|+++.... T Consensus 3 lvvC~~G~~tS~ll~~~l~~~~~~~~~~~~v~~~~~~~-~-~~~~~~DlIist~~ 55 (84) T cd00133 3 LVVCGSGIGSSSMLAEKLEKAAKELGIEVKVEAQGLSE-V-IDLADADLIISTVP 55 (84) T ss_pred EEECCCCHHHHHHHHHHHHHHHHHCCEEEEEEEHHHHH-C-CCCCCCCEEEEEEC T ss_conf 99999988999999999999988688289998612331-4-14359888999002 No 260 >cd03146 GAT1_Peptidase_E Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E. Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E. This group contains proteins similar to the aspartyl dipeptidases Salmonella typhimurium peptidase E and Xenopus laevis peptidase E. In bacteria peptidase E is believed to play a role in degrading peptides generated by intracellular protein breakdown or imported into the cell as nutrient sources. Peptidase E uniquely hydrolyses only Asp-X dipeptides (where X is any amino acid), and one tripeptide Asp-Gly-Gly. Peptidase E is believed to be a serine peptidase having a Ser-His-Glu catalytic triad which differs from the Cys-His-Glu catalytic triad typical of GATase1 domains by having a Ser in place of the reactive Cys at the nucleophile elbow. Xenopus PepE is developmentally regulated in response to thyroid hormone and, it is thought to play a role in apoptosis during tail reabsorption. Probab=22.48 E-value=46 Score=12.81 Aligned_cols=18 Identities=33% Similarity=0.493 Sum_probs=7.5 Q ss_pred CCCCCCCE--EEEEECCCCC Q ss_conf 10000010--0111027884 Q gi|255764474|r 262 PLEAMASG--IPVVASNTGV 279 (352) Q Consensus 262 ~lEAma~G--~PvI~s~~~~ 279 (352) +.|+++.+ .|+|+-|-+. T Consensus 171 l~~~~~~~~~~~gigidegt 190 (212) T cd03146 171 FHEFLEAGPTEPVIALDEGA 190 (212) T ss_pred HHHHHHHCCCCEEEEEECCE T ss_conf 99999829997089983898 No 261 >PRK08444 hypothetical protein; Provisional Probab=22.09 E-value=47 Score=12.76 Aligned_cols=38 Identities=16% Similarity=0.258 Sum_probs=15.5 Q ss_pred HCCCCCCCCEEEEEEEECHHH-HHHHHHHHHHHHHCCCCCEE Q ss_conf 728898872799951226445-12366654565300477405 Q gi|255764474|r 159 HLKISEDAKLIGCFGRIRKLK-GTDLFVDCMINILPHHPGWT 199 (352) Q Consensus 159 ~~~~~~~~~~i~~~G~~~~~K-g~~~li~a~~~l~~~~~~~~ 199 (352) ..|++.+... ..|.+.... .++.| ..+.++..+...+. T Consensus 200 ~lGi~ttaTm--myGhvEt~e~rv~HL-~~lR~lQd~tgGF~ 238 (353) T PRK08444 200 KKGKQSNATM--LFGHIENREHRIDHM-LRLRDLQDKTGGFN 238 (353) T ss_pred HCCCCCCEEE--ECCCCCCHHHHHHHH-HHHHHHCCCCCCEE T ss_conf 8299664146--778879999999999-99998365579835 No 262 >PRK05299 rpsB 30S ribosomal protein S2; Provisional Probab=22.02 E-value=47 Score=12.75 Aligned_cols=34 Identities=32% Similarity=0.393 Sum_probs=24.0 Q ss_pred CCCCCCCCCCEEEEEE-CCCCCCCHHCCCCCCCEEEEECCCC Q ss_conf 2221000001001110-2788410100234884249965999 Q gi|255764474|r 259 GLTPLEAMASGIPVVA-SNTGVFSELLDPENAKAGVIVPPRN 299 (352) Q Consensus 259 gl~~lEAma~G~PvI~-s~~~~~~e~i~~~~g~~G~~~~~~d 299 (352) ..++.||.-.|+|+|+ .|...-|+.|+ +.+|.+| T Consensus 170 ~~AV~EA~kl~IPvI~ivDTn~dP~~id-------ypIP~ND 204 (255) T PRK05299 170 HIAVKEARKLGIPVVAIVDTNCDPDGVD-------YPIPGND 204 (255) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCC-------EEECCCC T ss_conf 7999999975998887624899966586-------5533775 No 263 >PRK06846 deaminase; Validated Probab=21.99 E-value=47 Score=12.75 Aligned_cols=93 Identities=13% Similarity=0.040 Sum_probs=52.6 Q ss_pred HHHHHHHHHHHHCCC---CCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCC-----CC-----CCCCCCCCHHHH Q ss_conf 236665456530047---74058863133321000146677644311233221222-----22-----233100000000 Q gi|255764474|r 181 TDLFVDCMINILPHH---PGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFI-----DE-----QSSIEDWYRALN 247 (352) Q Consensus 181 ~~~li~a~~~l~~~~---~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~-----g~-----~~~~~~~~~~ad 247 (352) ++.-++.+-++..++ -|+++.-.|+ .....++.+.+...+.|+..+|... +. ..++...++.++ T Consensus 204 ~~~~L~~~f~LA~~~~~~vD~HldE~~d---~~~~~l~~la~~t~~~g~~grVt~sH~~sL~~~~~~~~~~~~~~la~ag 280 (410) T PRK06846 204 IEKSLDTMFQIAVEFNKGVDIHLHDTNP---LGVATIKKLVETTEEAQWKGRVTISHAFALGDLNEEEVEELAERLAAQQ 280 (410) T ss_pred HHHHHHHHHHHHHHCCCCCEEEECCCCC---HHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCC T ss_conf 9999999999999809992363158898---4778999999999982998988837344464399999999999999859 Q ss_pred HHCCCCCCCCCCC-CCCCCCCCEEEEE-ECC Q ss_conf 0001233322222-2100000100111-027 Q gi|255764474|r 248 IFVAPPLYEGFGL-TPLEAMASGIPVV-ASN 276 (352) Q Consensus 248 i~i~pS~~Eg~gl-~~lEAma~G~PvI-~s~ 276 (352) +-+..+.--+-|. .+-|-.++|++|- .+| T Consensus 281 I~vvst~P~~r~v~pv~eL~~aGV~V~~g~D 311 (410) T PRK06846 281 ISITSTVPIGRLHMPLPLLHEKGVKVSLGTD 311 (410) T ss_pred CEEEEECCCCCCCCCHHHHHHCCCEEEEECC T ss_conf 9399858998997879999976991998268 No 264 >cd06317 PBP1_ABC_sugar_binding_like_8 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Pperiplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=21.94 E-value=26 Score=14.48 Aligned_cols=16 Identities=13% Similarity=0.260 Sum_probs=9.1 Q ss_pred HHHHHHHHCCEEEEEE Q ss_conf 9999997198599980 Q gi|255764474|r 27 LCPIQRKLGQRLVVFG 42 (352) Q Consensus 27 ~~~~~~~~~~~~~~~~ 42 (352) +-..-+++|..+.+.. T Consensus 22 ~e~~A~~~G~~~~v~~ 37 (275) T cd06317 22 FQAAAEEDGVEVIVLD 37 (275) T ss_pred HHHHHHHCCCEEEEEC T ss_conf 9999997699899977 No 265 >COG1671 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.75 E-value=48 Score=12.71 Aligned_cols=20 Identities=35% Similarity=0.335 Sum_probs=13.0 Q ss_pred CCCEEEEEECCCCCCCHHCC Q ss_conf 00100111027884101002 Q gi|255764474|r 266 MASGIPVVASNTGVFSELLD 285 (352) Q Consensus 266 ma~G~PvI~s~~~~~~e~i~ 285 (352) .-.|=-||+.|+|-...++. T Consensus 64 a~~gDlVVT~Di~LA~~ll~ 83 (150) T COG1671 64 AEKGDLVVTADIPLASLLLD 83 (150) T ss_pred CCCCCEEEECCHHHHHHHHH T ss_conf 77899899776478999996 No 266 >pfam00308 Bac_DnaA Bacterial dnaA protein. Probab=21.74 E-value=48 Score=12.71 Aligned_cols=85 Identities=16% Similarity=0.155 Sum_probs=39.7 Q ss_pred CCCCCCCCEEEEEECCCCCCCHHCCCCC-----CCEEEE--ECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCC Q ss_conf 2100000100111027884101002348-----842499--659998999999999986989999999999999998299 Q gi|255764474|r 261 TPLEAMASGIPVVASNTGVFSELLDPEN-----AKAGVI--VPPRNLHALEKAVLYFMNSKKIMSDTGNRGRERAVKHFS 333 (352) Q Consensus 261 ~~lEAma~G~PvI~s~~~~~~e~i~~~~-----g~~G~~--~~~~d~~~la~~i~~l~~~~~~~~~~~~~a~~~~~~~fs 333 (352) .+-+.+..|.++|.|-.....++-.... =..|.. +++.|.+...+.|.....+... .+.+...+++.++++ T Consensus 122 l~N~~~~~~~~lllts~~~p~~l~~~~~dL~SRL~~g~~~~i~~pdd~~~~~iL~~~a~~r~l--~l~~~v~~yl~~r~~ 199 (219) T pfam00308 122 TFNALHENNKQIVLTSDRPPKELEGFEDRLRSRFEWGLIIAIEPPDLETRLAILRKKAEEENI--NIPNEVLNFIAQRIT 199 (219) T ss_pred HHHHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHHCCEEEECCCCCHHHHHHHHHHHHHHCCC--CCCHHHHHHHHHHCC T ss_conf 999999729869997799810024532779999868756611699999999999999998499--999999999998427 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 8999999999999988 Q gi|255764474|r 334 IVKEASDIGKVYDRLL 349 (352) Q Consensus 334 ~~~~a~~~~~iY~~~l 349 (352) +-.++++.+..++- T Consensus 200 --R~~r~L~~~L~~L~ 213 (219) T pfam00308 200 --DNVRELEGALNRLL 213 (219) T ss_pred --CCHHHHHHHHHHHH T ss_conf --98999999999999 No 267 >TIGR01527 arch_NMN_Atrans nicotinamide-nucleotide adenylyltransferase; InterPro: IPR006418 This family of archaeal proteins exhibits NAD salvage biosynthesis enzyme nicotinamide-nucleotide adenylyltransferase (2.7.7.1 from EC) activity. In some cases, the enzyme was tested and found also to have the activity of nicotinate-nucleotide adenylyltransferase (2.7.7.18 from EC), an enzyme of NAD de novo biosynthesis, although with a higher Km. In some archaeal species, a number of proteins which are uncharacterised with respect to activity, are also present.; GO: 0000309 nicotinamide-nucleotide adenylyltransferase activity, 0009435 NAD biosynthetic process, 0005737 cytoplasm. Probab=21.42 E-value=48 Score=12.67 Aligned_cols=71 Identities=15% Similarity=0.309 Sum_probs=36.2 Q ss_pred EEEEEECHH-HHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCC-CCCCC-CCCHHH Q ss_conf 995122644-512366654565300477405886313332100014667764431123322122222-23310-000000 Q gi|255764474|r 170 GCFGRIRKL-KGTDLFVDCMINILPHHPGWTAVVVGKTTLKHYLFKKNLQRRIYANGLKKRILFIDE-QSSIE-DWYRAL 246 (352) Q Consensus 170 ~~~G~~~~~-Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~-~~~~~-~~~~~a 246 (352) +|+||+-|. +|+..+|+...+ +- |=-.+.+|+..+.|. +++ -.--.+||.+... .++.. .++-.. T Consensus 3 fy~GRFQPFH~GH~~Vik~~l~---~V-DELiiGiGSAq~SHt-----~~N---PFTAGERv~MI~~sl~~~~rel~~d~ 70 (171) T TIGR01527 3 FYIGRFQPFHLGHLEVIKKILE---EV-DELIIGIGSAQESHT-----LEN---PFTAGERVLMITKSLKEVKRELMDDL 70 (171) T ss_pred CCCCCCCCCCCCHHHHHHHHHH---HC-CEEEEEEECCCCCCC-----CCC---CCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 3467768755213799998873---54-777987404886202-----228---97613078999999998655442213 Q ss_pred HHHCCC Q ss_conf 000012 Q gi|255764474|r 247 NIFVAP 252 (352) Q Consensus 247 di~i~p 252 (352) .+|+.| T Consensus 71 ~~Y~iP 76 (171) T TIGR01527 71 KVYIIP 76 (171) T ss_pred CEECCC T ss_conf 221114 No 268 >cd05085 PTKc_Fer Catalytic Domain of the Protein Tyrosine Kinase, Fer. Protein Tyrosine Kinase (PTK) family; Fer kinase; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Fer kinase is a member of the Fes subfamily of proteins which are cytoplasmic (or nonreceptor) tyr kinases containing an N-terminal region with FCH (Fes/Fer/CIP4 homology) and coiled-coil domains, followed by a SH2 domain, and a C-terminal catalytic domain. Fer kinase is expressed in a wide variety of tissues, and is found to reside in both the cytoplasm and the nucleus. It plays important roles in neuronal polarization and neurite development, cytoskeletal reorganization, cell migration, growth factor signaling, and the regulation of cell-c Probab=21.34 E-value=24 Score=14.76 Aligned_cols=72 Identities=8% Similarity=0.136 Sum_probs=42.8 Q ss_pred CCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-ECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHH Q ss_conf 00000000012333222222100000100111-027884101002348842499659998999999999986-9899999 Q gi|255764474|r 242 WYRALNIFVAPPLYEGFGLTPLEAMASGIPVV-ASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSD 319 (352) Q Consensus 242 ~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI-~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~ 319 (352) +-..+|++ .||.++.|-++.|.+.- ........+.|. .+.. .-.+..-++++.+-+.+.++ ||+.|.. T Consensus 171 ~~~~sDV~-------SfGv~l~Ei~t~g~~P~~~~~~~~~~~~i~--~~~~-~~~p~~~~~~l~~li~~c~~~dP~~RPs 240 (250) T cd05085 171 YSSESDVW-------SYGILLWETFSLGVCPYPGMTNQQAREQVE--KGYR-MSCPQKCPDDVYKVMQRCWDYKPENRPK 240 (250) T ss_pred CCCCCCEE-------CHHHHHHHHHHCCCCCCCCCCHHHHHHHHH--CCCC-CCCCCCCCHHHHHHHHHHHCCCHHHCCC T ss_conf 88303341-------149999999957999999999899999997--4788-9998656099999999980889667929 Q ss_pred HHHH Q ss_conf 9999 Q gi|255764474|r 320 TGNR 323 (352) Q Consensus 320 ~~~~ 323 (352) +.+= T Consensus 241 ~~ei 244 (250) T cd05085 241 FSEL 244 (250) T ss_pred HHHH T ss_conf 9999 No 269 >TIGR00314 cdhA CO dehydrogenase/acetyl-CoA synthase complex, epsilon subunit; InterPro: IPR004460 Acetyl-CoA decarbonylase/synthase (ACDS) is a multienzyme complex (carbon monoxide dehydrogenase is a synonym). The ACDS complex carries out an unusual reaction involving the reversible cleavage and synthesis of acetyl-CoA in methanogens. The family contains the 4Fe-4S ferredoxin domain [C-x(2)-C-x(2)-C-x(3)-C-[PEG]] between residues 448-462 of the model.; GO: 0018492 carbon-monoxide dehydrogenase (acceptor) activity, 0006084 acetyl-CoA metabolic process. Probab=21.21 E-value=49 Score=12.64 Aligned_cols=72 Identities=14% Similarity=0.202 Sum_probs=49.0 Q ss_pred CHHHHHHHHHHHCCCCCCCCCCC---CCCCCCCCCHHHHHHCCCCCC-----CCCCCC------------CCCCCCCEEE Q ss_conf 01466776443112332212222---223310000000000012333-----222222------------1000001001 Q gi|255764474|r 212 LFKKNLQRRIYANGLKKRILFID---EQSSIEDWYRALNIFVAPPLY-----EGFGLT------------PLEAMASGIP 271 (352) Q Consensus 212 ~~~~~l~~~i~~~~l~~~V~~~g---~~~~~~~~~~~adi~i~pS~~-----Eg~gl~------------~lEAma~G~P 271 (352) .+...+.+.+++.+++|.|.+-| .-.|+..+++.|-++--.|++ -|.+=| +-|+.=.|.| T Consensus 251 ~p~~~i~dYleEn~~eD~ve~~GiCCtA~D~TRy~~kaKvvgpls~QL~~iRaG~aDV~v~DEQCiRaD~Le~~~K~g~~ 330 (795) T TIGR00314 251 LPGADIADYLEENGMEDEVELCGICCTAIDMTRYNEKAKVVGPLSRQLRVIRAGVADVIVVDEQCIRADILEEVKKLGIP 330 (795) T ss_pred CCHHHHHHHHHHCCCCCCEEEECCCCCCCCHHHHCCCCCEECCCHHCEEEEECCCCCEEEECCCCHHHHHHHHHHHCCCC T ss_conf 45125788875168403045303000431010010478345320101044351787379974730127899999865860 Q ss_pred EEECCC---CCCCHH Q ss_conf 110278---841010 Q gi|255764474|r 272 VVASNT---GVFSEL 283 (352) Q Consensus 272 vI~s~~---~~~~e~ 283 (352) +|+|+- -|++|+ T Consensus 331 ~IaT~~k~~~gLpD~ 345 (795) T TIGR00314 331 LIATNDKACLGLPDV 345 (795) T ss_pred EEECCHHHHHCCCCC T ss_conf 673417885138777 No 270 >CHL00174 accD acetyl-CoA carboxylase beta subunit; Reviewed Probab=21.18 E-value=49 Score=12.64 Aligned_cols=21 Identities=5% Similarity=0.270 Sum_probs=9.5 Q ss_pred HHHHHHCCCCCCCCEEEEEEEEC Q ss_conf 68788728898872799951226 Q gi|255764474|r 154 QEARRHLKISEDAKLIGCFGRIR 176 (352) Q Consensus 154 ~~~~~~~~~~~~~~~i~~~G~~~ 176 (352) .+.+++.+..+ -++...|.+. T Consensus 121 ~~arkkTg~~d--Avv~g~G~I~ 141 (305) T CHL00174 121 DSYQKKTGLTD--AVQTGIGQLN 141 (305) T ss_pred HHHHHHHCCCE--EEEEEEEEEC T ss_conf 99999719980--7999999999 No 271 >PRK09219 xanthine phosphoribosyltransferase; Validated Probab=20.89 E-value=36 Score=13.53 Aligned_cols=35 Identities=11% Similarity=0.041 Sum_probs=18.5 Q ss_pred HHHHCCCEEEEECCCHHHHHHHHHHHHHCCCEEEE Q ss_conf 54325970899628546899999999709981999 Q gi|255764474|r 63 KPIGQNSRIWHARRNNEMLLGVMMRDVLRMPLKLV 97 (352) Q Consensus 63 ~~~~~~~~ivh~~~~~~~~~~~~~~~~~~~~~~~~ 97 (352) ......+|-+-.=.......+...+..++.|+.+. T Consensus 45 ~F~~~~IdkIvTiEasGI~~A~~~A~~l~vPlV~a 79 (189) T PRK09219 45 RFKDAGITKILTIEASGIAPAVMAALALGVPVVFA 79 (189) T ss_pred HHCCCCCCEEEEEECCCCHHHHHHHHHHCCCEEEE T ss_conf 84578998899984158578999999859999999 No 272 >pfam10649 DUF2478 Protein of unknown function (DUF2478). This is a family of hypothetical bacterial proteins found in the vicinity of Molybdenum ABC transporter ATP-binding gene-products MobA MobB and MobC. However the function could not be confirmed. This family appears to belong to the P-loop superfamily by alignment to pfam03266. However, the characteristic P-loop sequence motif appears to have diverged beyond recognition in this family. Probab=20.66 E-value=18 Score=15.48 Aligned_cols=56 Identities=23% Similarity=0.322 Sum_probs=29.4 Q ss_pred HHHHHCCC--CCC--CCCC--CCCCCCCCCEEEEEEC-CCCCCCHHCCCCCCCEEEEECCCCHHH Q ss_conf 00000012--333--2222--2210000010011102-788410100234884249965999899 Q gi|255764474|r 245 ALNIFVAP--PLY--EGFG--LTPLEAMASGIPVVAS-NTGVFSELLDPENAKAGVIVPPRNLHA 302 (352) Q Consensus 245 ~adi~i~p--S~~--Eg~g--l~~lEAma~G~PvI~s-~~~~~~e~i~~~~g~~G~~~~~~d~~~ 302 (352) .+|+++.- ... ||-| -.+.||++.|+||++. +-...+.+... .|..+...++ |.++ T Consensus 93 g~dLlIvNkFGk~Ea~G~Glr~~i~~Al~~giPvLt~V~~~~~~~W~~F-agg~a~~L~~-d~~a 155 (159) T pfam10649 93 GADLLIVNKFGKQEAEGRGLRAEIAEALAAGIPVLTAVPPRNLEAWRAF-AGGLAEELPP-DREA 155 (159) T ss_pred CCCEEEECCCHHHHHCCCCHHHHHHHHHHCCCCEEEEECHHHHHHHHHH-CCCCCCCCCC-CHHH T ss_conf 9989998142466614997699999999769988998898888999985-2776765799-9888 No 273 >pfam01866 Diphthamide_syn Putative diphthamide synthesis protein. Human DPHL2 is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein synthesis by ADP-ribosylating diphthamide, a posttranslationally modified histidine residue present in EF2. The exact function of the members of this family is unknown. Probab=20.51 E-value=50 Score=12.55 Aligned_cols=74 Identities=15% Similarity=0.275 Sum_probs=45.4 Q ss_pred CEEEEEECC-CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 405886313-3321000146677644311233221222222--3310000000000012333222222100000100111 Q gi|255764474|r 197 GWTAVVVGK-TTLKHYLFKKNLQRRIYANGLKKRILFIDEQ--SSIEDWYRALNIFVAPPLYEGFGLTPLEAMASGIPVV 273 (352) Q Consensus 197 ~~~l~i~G~-g~~~~~~~~~~l~~~i~~~~l~~~V~~~g~~--~~~~~~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI 273 (352) +.--+++|. +-+......+.+++++++.|-.-.+..+|.. +.+..+ ..|+||..+--+ +++.+.--..+|+| T Consensus 204 ~~~GIlv~tl~~q~~~~i~~~l~~ll~~~gkk~y~~~v~~in~~kL~nf--~iD~fV~vaCPr---~sidd~~~f~kPil 278 (300) T pfam01866 204 KSFGIIVGTKGGQGRPDLARRLKKLLKERGKRAYLILLDEINPDKLANF--DVDAFVQTACPR---LSIDDGYAFPKPLL 278 (300) T ss_pred CEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHCC--CCCEEEEECCCC---CCCCCHHHCCCCCC T ss_conf 8479999677656879999999999998599089998388699998379--978899965997---42156766896556 Q ss_pred EC Q ss_conf 02 Q gi|255764474|r 274 AS 275 (352) Q Consensus 274 ~s 275 (352) +. T Consensus 279 TP 280 (300) T pfam01866 279 TP 280 (300) T ss_pred CH T ss_conf 79 No 274 >PRK08243 4-hydroxybenzoate 3-monooxygenase; Validated Probab=20.42 E-value=51 Score=12.53 Aligned_cols=71 Identities=17% Similarity=0.259 Sum_probs=47.1 Q ss_pred CCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC--CHHHHHHHHHHHHHHH--HHHCCHHH Q ss_conf 100000100111027884101002348842499659998999999999986--9899999999999999--98299899 Q gi|255764474|r 262 PLEAMASGIPVVASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN--SKKIMSDTGNRGRERA--VKHFSIVK 336 (352) Q Consensus 262 ~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~--~~~~~~~~~~~a~~~~--~~~fs~~~ 336 (352) +.|.|..|.-.++-| ...++.+. |.-|+=.-..|...|+.+|...+. +++.....++.+-.++ .++|||-. T Consensus 272 vae~~r~GRvfLaGD---AAHi~pP~-GgqGmN~gi~DA~nLawkLaa~l~~~~~~~ld~Ys~~~l~rvw~~~~fs~~~ 346 (392) T PRK08243 272 VAEPMQFGRLFLAGD---AAHIVPPT-GAKGLNLAASDVRYLSRALVEFYREGDTDGLDAYSATALARVWKAERFSWWM 346 (392) T ss_pred HHHHHCCCCEEEEEE---CCCCCCCC-HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 008665388999961---00327840-3446758999999999999999842986777787799999999999988999 No 275 >cd05033 PTKc_EphR Catalytic Domain of Ephrin Receptor Protein Tyrosine Kinases. Protein Tyrosine Kinase (PTK) family; Ephrin Receptor (EphR) subfamily; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. EphRs comprise the largest subfamily of receptor tyr kinases (RTKs). They can be classified into two classes (EphA and EphB), according to their extracellular sequences, which largely correspond to binding preferences for either GPI-anchored ephrin-A ligands or transmembrane ephrin-B ligands. Vertebrates have ten EphA and six EhpB receptors, which display promiscuous ligand interactions within each class. EphRs contain an ephrin binding domain and two fibronectin repeats extracellularly, a transmembrane segment Probab=20.25 E-value=30 Score=14.11 Aligned_cols=69 Identities=16% Similarity=0.200 Sum_probs=35.8 Q ss_pred CCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCCCCCCEEEEE--CCCCHHHHHHHHHHHHC-CHHHHH Q ss_conf 000000000123332222221000001001110278841010023488424996--59998999999999986-989999 Q gi|255764474|r 242 WYRALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDPENAKAGVIV--PPRNLHALEKAVLYFMN-SKKIMS 318 (352) Q Consensus 242 ~~~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~~~g~~G~~~--~~~d~~~la~~i~~l~~-~~~~~~ 318 (352) +-..+|++ .||.++.|-|+.|.+..... ...+++.. -..|.-. +.+-++++.+-+.+-++ ||+.|. T Consensus 185 ~~~~~DV~-------SfGv~l~El~t~g~~P~~~~--~~~~~~~~--i~~~~r~p~p~~~~~~~~~li~~C~~~~P~~RP 253 (266) T cd05033 185 FTSASDVW-------SFGIVMWEVMSYGERPYWDM--SNQDVIKA--VEDGYRLPPPMDCPSALYQLMLDCWQKDRNERP 253 (266) T ss_pred CCCCCCCC-------CCHHHHHHHHHCCCCCCCCC--CHHHHHHH--HHHCCCCCCCCCCHHHHHHHHHHHCCCCHHHCC T ss_conf 88647676-------24999999993799998889--99999999--971687999643689999999998087965791 Q ss_pred HHH Q ss_conf 999 Q gi|255764474|r 319 DTG 321 (352) Q Consensus 319 ~~~ 321 (352) .+. T Consensus 254 s~~ 256 (266) T cd05033 254 TFS 256 (266) T ss_pred CHH T ss_conf 999 No 276 >cd05090 PTKc_Ror1 Catalytic Domain of the Protein Tyrosine Kinase, Receptor tyrosine kinase-like Orphan Receptor 1. Protein Tyrosine Kinase (PTK) family; Receptor tyrosine kinase-like orphan receptor 1 (Ror1); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Ror proteins are orphan receptor tyr kinases (RTKs) containing an extracellular region with immunoglobulin-like, cysteine-rich, and kringle domains, a transmembrane segment, and an intracellular catalytic domain. Ror RTKs are unrelated to the nuclear receptor subfamily called retinoid-related orphan receptors (RORs). RTKs are usually activated through ligand binding, which causes dimerization and autophosphorylation of the intracellular tyr kinase cataly Probab=20.24 E-value=32 Score=13.90 Aligned_cols=70 Identities=9% Similarity=0.063 Sum_probs=39.8 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-ECCCCCCCHHCCCCCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHH Q ss_conf 000000012333222222100000100111-027884101002348842499659998999999999986-989999999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVV-ASNTGVFSELLDPENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTG 321 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI-~s~~~~~~e~i~~~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~ 321 (352) ..+|++ +||.++.|-+++|.+.- ......+-+.+. .+.. .-.+.+-+..+.+-|..-++ ||+.|..+. T Consensus 205 ~~sDV~-------SfGiil~Eilt~g~~P~~~~~~~~~~~~v~--~~~~-~p~~~~~~~~l~~li~~C~~~dP~~RPs~~ 274 (283) T cd05090 205 SDSDIW-------SFGVVLWEIFSFGLQPYYGFSNQEVIEMVR--KRQL-LPCSEDCPPRMYSLMTECWQEGPSRRPRFK 274 (283) T ss_pred CCCCEE-------CHHHHHHHHHHCCCCCCCCCCHHHHHHHHH--CCCC-CCCCCCCHHHHHHHHHHHHCCCHHHCCCHH T ss_conf 535450-------078999999808999999979999999997--1898-999865639999999998176965790999 Q ss_pred HH Q ss_conf 99 Q gi|255764474|r 322 NR 323 (352) Q Consensus 322 ~~ 323 (352) += T Consensus 275 dI 276 (283) T cd05090 275 DI 276 (283) T ss_pred HH T ss_conf 99 No 277 >PRK08782 consensus Probab=20.16 E-value=51 Score=12.50 Aligned_cols=19 Identities=11% Similarity=0.144 Sum_probs=8.5 Q ss_pred CCCCCCCCCCHHHHHHCCC Q ss_conf 2223310000000000012 Q gi|255764474|r 234 DEQSSIEDWYRALNIFVAP 252 (352) Q Consensus 234 g~~~~~~~~~~~adi~i~p 252 (352) +..+++..+-...++...| T Consensus 97 ~~~~~v~~~a~~~~i~~iP 115 (219) T PRK08782 97 GTPAPLARLLADAPIPAVP 115 (219) T ss_pred CCCHHHHHHHHHCCCCEEC T ss_conf 9979999999981997647 No 278 >cd05046 PTK_CCK4 Catalytic Domain of the Protein Tyrosine Kinase, Colon Carcinoma Kinase 4. Protein Tyrosine Kinase (PTK) family; Colon carcinoma kinase 4 (CCK4); pseudokinase domain. The PTKc (catalytic domain) family, to which this subfamily belongs, includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. CCK4, also called protein tyrosine kinase 7 (PTK7), is an orphan receptor tyr kinase (RTK) containing an extracellular region with seven immunoglobulin domains, a transmembrane segment, and an intracellular inactive pseudokinase domain. Studies in mice reveal that CCK4 is essential for neural development. Mouse embryos containing a truncated CCK4 die perinatally and display craniorachischisis, a severe form of neural tube defect. The mechanism of action of the CCK4 pseudokinase is stil Probab=20.06 E-value=28 Score=14.26 Aligned_cols=70 Identities=17% Similarity=0.146 Sum_probs=38.9 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHCCC-CCCCEEEEECCCCHHHHHHHHHHHHC-CHHHHHHHH Q ss_conf 0000000123332222221000001001110278841010023-48842499659998999999999986-989999999 Q gi|255764474|r 244 RALNIFVAPPLYEGFGLTPLEAMASGIPVVASNTGVFSELLDP-ENAKAGVIVPPRNLHALEKAVLYFMN-SKKIMSDTG 321 (352) Q Consensus 244 ~~adi~i~pS~~Eg~gl~~lEAma~G~PvI~s~~~~~~e~i~~-~~g~~G~~~~~~d~~~la~~i~~l~~-~~~~~~~~~ 321 (352) ..+|++ +||.++.|-++.|.+.-... . ..+++.. ..|..-+-.+..-++++.+-+...++ ||+.|..+. T Consensus 197 ~ksDIw-------S~Gvil~El~t~g~~Pf~~~-~-~~~~~~~i~~~~~~~~~~~~~~~~l~~li~~cl~~dP~~RPs~~ 267 (275) T cd05046 197 TKSDVW-------SFGVLMWEVFTQGELPFYGL-S-DEEVLNRLQAGKLELPVPEGCPSRLYKLMTRCWAVNPKDRPSFS 267 (275) T ss_pred CCCCCC-------CHHHHHHHHHHCCCCCCCCC-C-HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHCCCHH T ss_conf 756765-------59999999983899999998-9-99999999728978888865679999999998488955793999 Q ss_pred H Q ss_conf 9 Q gi|255764474|r 322 N 322 (352) Q Consensus 322 ~ 322 (352) + T Consensus 268 e 268 (275) T cd05046 268 E 268 (275) T ss_pred H T ss_conf 9 Done!