Query gi|254780989|ref|YP_003065402.1| GCN5-related N-acetyltransferase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 185 No_of_seqs 157 out of 1797 Neff 7.7 Searched_HMMs 39220 Date Mon May 30 03:54:40 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780989.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3153 Predicted acetyltransf 100.0 1.8E-36 4.7E-41 240.0 17.1 166 12-185 2-169 (171) 2 PRK03624 putative acetyltransf 99.9 4.5E-21 1.1E-25 147.2 14.8 124 12-146 1-130 (140) 3 PRK09491 rimI ribosomal-protei 99.8 1.9E-19 4.9E-24 137.4 14.9 137 15-166 1-144 (144) 4 PRK10146 aminoalkylphosphonic 99.8 3.2E-19 8.3E-24 136.0 14.0 130 11-146 1-138 (144) 5 PTZ00330 acetyltransferase; Pr 99.8 1.3E-18 3.3E-23 132.5 15.6 131 12-146 5-141 (147) 6 PRK07757 acetyltransferase; Pr 99.8 8E-19 2E-23 133.7 14.0 140 14-166 2-151 (152) 7 PRK10140 putative acetyltransf 99.8 3.5E-18 8.9E-23 129.8 12.2 150 11-164 1-160 (162) 8 PRK07922 N-acetylglutamate syn 99.8 1.8E-17 4.7E-22 125.5 12.5 149 12-173 5-163 (170) 9 TIGR01575 rimI ribosomal-prote 99.7 5.6E-17 1.4E-21 122.6 11.8 133 23-162 1-140 (140) 10 TIGR03448 mycothiol_MshD mycot 99.7 3.2E-16 8.3E-21 118.0 13.2 134 11-147 147-289 (292) 11 PRK10975 TDP-fucosamine acetyl 99.7 7.4E-16 1.9E-20 115.8 12.5 129 12-146 81-224 (231) 12 PRK10562 hypothetical protein; 99.7 1.8E-15 4.6E-20 113.5 11.3 132 16-163 2-139 (145) 13 PRK01346 hypothetical protein; 99.6 2.2E-15 5.7E-20 112.9 11.2 129 10-147 3-137 (411) 14 PRK10514 hypothetical protein; 99.6 2E-15 5E-20 113.3 9.8 124 14-150 2-130 (146) 15 COG1246 ArgA N-acetylglutamate 99.6 5.9E-15 1.5E-19 110.4 10.9 124 14-149 1-126 (153) 16 COG1247 Sortase and related ac 99.6 1.9E-14 4.7E-19 107.4 12.7 158 13-170 1-168 (169) 17 PRK12308 bifunctional arginino 99.6 2.5E-15 6.5E-20 112.6 7.7 125 11-148 461-586 (614) 18 PRK05279 N-acetylglutamate syn 99.6 2.6E-14 6.7E-19 106.5 12.2 122 15-148 296-419 (441) 19 PRK09831 hypothetical protein; 99.6 1.7E-14 4.2E-19 107.7 9.1 138 14-165 1-144 (147) 20 KOG3139 consensus 99.5 2E-13 5.1E-18 101.2 13.1 155 2-166 2-164 (165) 21 COG0456 RimI Acetyltransferase 99.5 1E-13 2.6E-18 102.9 10.9 149 7-165 5-172 (177) 22 cd04301 GNAT GCN5-related N-ac 99.5 1.2E-13 3.1E-18 102.5 10.8 76 66-142 2-83 (83) 23 KOG3216 consensus 99.4 5E-12 1.3E-16 92.8 10.8 131 11-146 1-146 (163) 24 pfam00583 Acetyltransf_1 Acety 99.4 3E-12 7.6E-17 94.1 9.2 74 68-142 1-80 (80) 25 PRK10314 hypothetical protein; 99.3 8.8E-11 2.3E-15 85.2 12.7 139 15-167 8-151 (153) 26 TIGR03103 trio_acet_GNAT GNAT- 99.2 1.8E-10 4.7E-15 83.3 10.5 141 11-162 80-232 (547) 27 TIGR01890 N-Ac-Glu-synth amino 99.1 5.5E-11 1.4E-15 86.5 3.0 126 13-150 291-418 (439) 28 pfam08445 FR47 FR47-like prote 99.0 2.7E-09 6.8E-14 76.3 8.4 72 67-145 5-81 (86) 29 COG2153 ElaA Predicted acyltra 99.0 2E-09 5.2E-14 77.0 7.8 80 66-146 52-136 (155) 30 KOG3235 consensus 99.0 2.1E-09 5.4E-14 76.9 7.1 143 13-168 1-155 (193) 31 PRK10809 ribosomal-protein-S5- 98.9 5.3E-08 1.3E-12 68.5 13.2 143 4-147 8-167 (194) 32 KOG3397 consensus 98.9 5.3E-09 1.3E-13 74.5 7.5 127 10-145 9-140 (225) 33 KOG4144 consensus 98.9 1E-09 2.6E-14 78.8 2.6 140 12-152 10-167 (190) 34 TIGR03448 mycothiol_MshD mycot 98.8 4.3E-08 1.1E-12 69.1 10.5 119 19-147 6-129 (292) 35 COG4552 Eis Predicted acetyltr 98.8 1.2E-08 2.9E-13 72.5 6.9 115 23-147 8-128 (389) 36 KOG3396 consensus 98.8 4.3E-08 1.1E-12 69.1 9.8 132 12-147 5-145 (150) 37 KOG3138 consensus 98.8 1.4E-08 3.6E-13 71.9 6.9 147 13-170 16-176 (187) 38 TIGR02382 wecD_rffC TDP-D-fuco 98.8 7.8E-09 2E-13 73.5 5.5 125 11-141 51-190 (201) 39 TIGR02406 ectoine_EctA L-2,4-d 98.8 3.9E-08 9.9E-13 69.3 8.8 119 16-144 1-130 (162) 40 COG1444 Predicted P-loop ATPas 98.8 2.6E-07 6.5E-12 64.4 12.3 80 90-171 531-615 (758) 41 PRK10151 ribosomal-protein-L7/ 98.7 6.5E-07 1.7E-11 61.9 13.1 153 12-167 9-177 (179) 42 KOG2488 consensus 98.7 1.9E-07 5E-12 65.1 9.7 122 22-147 54-183 (202) 43 TIGR03585 PseH pseudaminic aci 98.6 9.7E-07 2.5E-11 60.9 12.1 125 16-147 3-139 (156) 44 COG3393 Predicted acetyltransf 98.6 1E-07 2.6E-12 66.7 6.9 128 10-147 130-263 (268) 45 COG3981 Predicted acetyltransf 98.6 5.1E-07 1.3E-11 62.6 9.5 123 11-136 1-143 (174) 46 TIGR01211 ELP3 histone acetylt 98.4 3.6E-07 9.1E-12 63.5 3.7 85 63-147 459-568 (573) 47 KOG3234 consensus 98.2 4.5E-06 1.2E-10 56.9 7.2 145 15-171 3-156 (173) 48 COG1670 RimL Acetyltransferase 98.2 6.8E-05 1.7E-09 49.8 12.1 139 11-150 7-162 (187) 49 PHA00673 acetyltransferase dom 98.1 1.3E-05 3.4E-10 54.0 8.0 84 63-146 55-146 (154) 50 cd02169 Citrate_lyase_ligase C 98.1 1.5E-05 3.8E-10 53.7 8.1 70 69-146 12-84 (296) 51 pfam04958 AstA Arginine N-succ 98.1 3.9E-05 1E-09 51.2 8.8 127 14-140 1-181 (338) 52 COG2388 Predicted acetyltransf 98.0 4E-05 1E-09 51.1 7.5 66 63-131 15-80 (99) 53 PRK10456 arginine succinyltran 97.9 6.8E-05 1.7E-09 49.7 8.4 127 14-140 2-180 (344) 54 TIGR03244 arg_catab_AstA argin 97.9 8.5E-05 2.2E-09 49.2 8.8 127 15-141 1-179 (336) 55 COG3818 Predicted acetyltransf 97.9 1.3E-05 3.2E-10 54.2 3.7 129 7-150 4-152 (167) 56 COG3053 CitC Citrate lyase syn 97.8 0.00044 1.1E-08 44.9 11.2 108 69-184 43-157 (352) 57 TIGR03243 arg_catab_AOST argin 97.8 8.2E-05 2.1E-09 49.3 7.2 127 15-141 1-179 (335) 58 TIGR03245 arg_AOST_alph argini 97.7 0.0003 7.6E-09 45.9 8.4 126 15-140 1-179 (336) 59 COG1243 ELP3 Histone acetyltra 97.5 0.00012 3E-09 48.4 4.0 78 70-147 414-510 (515) 60 pfam00765 Autoind_synth Autoin 97.4 0.006 1.5E-07 38.1 11.9 124 17-144 3-153 (182) 61 COG3375 Uncharacterized conser 97.4 0.0024 6E-08 40.5 9.3 121 15-142 4-133 (266) 62 COG3882 FkbH Predicted enzyme 97.2 0.0023 5.7E-08 40.6 7.9 131 13-152 413-556 (574) 63 PRK13688 hypothetical protein; 97.1 0.00031 7.8E-09 45.8 2.4 91 50-146 34-133 (157) 64 COG0454 WecD Histone acetyltra 97.1 0.00043 1.1E-08 44.9 3.0 44 96-141 87-130 (156) 65 pfam02474 NodA Nodulation prot 97.0 0.0031 7.8E-08 39.8 6.5 115 20-140 16-137 (196) 66 PRK00756 acyltransferase NodA; 96.8 0.0053 1.4E-07 38.4 6.9 115 20-140 16-137 (196) 67 KOG3698 consensus 96.6 0.031 7.8E-07 33.8 9.5 140 2-145 672-877 (891) 68 pfam08444 Gly_acyl_tr_C Aralky 96.4 0.015 3.8E-07 35.7 7.0 68 70-144 6-78 (89) 69 pfam09390 DUF1999 Protein of u 96.4 0.015 3.9E-07 35.6 6.9 135 14-149 1-144 (161) 70 COG3138 AstA Arginine/ornithin 96.4 0.0068 1.7E-07 37.7 5.1 101 14-114 2-143 (336) 71 pfam01233 NMT Myristoyl-CoA:pr 96.2 0.11 2.8E-06 30.5 10.6 111 19-129 32-150 (162) 72 TIGR01686 FkbH FkbH domain; In 95.9 0.017 4.4E-07 35.2 5.2 119 13-142 197-334 (337) 73 KOG2036 consensus 95.9 0.022 5.5E-07 34.7 5.7 82 89-171 613-749 (1011) 74 PHA01807 hypothetical protein 95.9 0.071 1.8E-06 31.6 8.2 66 62-130 49-118 (145) 75 pfam02799 NMT_C Myristoyl-CoA: 95.8 0.16 4E-06 29.5 10.6 118 15-144 30-163 (190) 76 PTZ00064 histone acetyltransfe 95.8 0.019 4.9E-07 35.0 4.9 58 63-125 270-331 (442) 77 PRK13834 putative autoinducer 95.7 0.14 3.6E-06 29.7 9.2 128 24-156 18-174 (207) 78 pfam01853 MOZ_SAS MOZ/SAS fami 95.6 0.03 7.7E-07 33.8 5.3 59 64-125 53-115 (189) 79 KOG2535 consensus 95.2 0.018 4.7E-07 35.1 3.2 48 100-147 497-548 (554) 80 KOG2747 consensus 95.1 0.056 1.4E-06 32.2 5.5 61 62-125 233-295 (396) 81 pfam05301 Mec-17 Touch recepto 94.9 0.1 2.6E-06 30.6 6.2 60 72-134 18-88 (120) 82 COG5628 Predicted acetyltransf 94.8 0.069 1.8E-06 31.6 5.2 98 66-167 40-142 (143) 83 COG3916 LasI N-acyl-L-homoseri 94.6 0.37 9.6E-06 27.2 10.9 152 16-173 9-188 (209) 84 COG2401 ABC-type ATPase fused 93.6 0.064 1.6E-06 31.8 2.9 132 17-148 102-310 (593) 85 TIGR00124 cit_ly_ligase [citra 93.3 0.68 1.7E-05 25.7 8.0 78 63-148 32-115 (342) 86 TIGR03019 pepcterm_femAB FemAB 91.4 1.2 3E-05 24.2 12.0 128 10-146 148-281 (330) 87 PHA01733 hypothetical protein 91.1 1.1 2.8E-05 24.4 6.7 122 16-147 5-133 (153) 88 COG5027 SAS2 Histone acetyltra 89.4 0.16 4E-06 29.5 1.2 58 63-123 234-295 (395) 89 PRK01305 arginyl-tRNA-protein 89.1 1.9 4.8E-05 23.0 8.7 74 63-142 144-219 (241) 90 KOG2779 consensus 88.2 2.1 5.5E-05 22.7 8.2 52 72-123 145-200 (421) 91 KOG4135 consensus 87.1 1.3 3.3E-05 24.0 4.7 51 96-146 113-170 (185) 92 pfam04377 ATE_C Arginine-tRNA- 86.7 2.6 6.7E-05 22.1 7.2 62 63-130 39-102 (128) 93 KOG2696 consensus 85.8 1 2.6E-05 24.6 3.6 42 92-133 219-260 (403) 94 KOG4601 consensus 82.8 1.6 4E-05 23.5 3.5 60 72-134 81-150 (264) 95 pfam09924 DUF2156 Uncharacteri 80.2 4.9 0.00012 20.5 8.3 58 63-123 179-236 (250) 96 pfam06852 DUF1248 Protein of u 77.6 5.9 0.00015 20.0 6.3 78 63-140 46-131 (180) 97 pfam02388 FemAB FemAB family. 72.2 8.1 0.00021 19.2 10.4 35 97-131 324-358 (407) 98 cd03173 DUF619-like DUF619-lik 71.0 8.7 0.00022 19.0 6.0 48 69-118 15-62 (99) 99 cd04266 DUF619-NAGS-FABP DUF61 69.1 9.5 0.00024 18.8 5.3 53 89-144 38-97 (108) 100 pfam07395 Mig-14 Mig-14. This 65.2 11 0.00029 18.3 4.3 70 64-135 174-249 (264) 101 pfam12261 T_hemolysin Thermost 64.9 12 0.00029 18.3 7.5 50 91-144 89-141 (180) 102 TIGR02109 PQQ_syn_pqqE coenzym 62.5 10 0.00026 18.6 3.5 44 96-139 193-236 (363) 103 PRK12568 glycogen branching en 62.1 6.5 0.00016 19.8 2.4 59 109-167 269-329 (730) 104 PRK12313 glycogen branching en 61.0 5.5 0.00014 20.2 1.9 59 109-167 169-229 (632) 105 COG2898 Uncharacterized conser 60.5 14 0.00035 17.8 8.5 92 67-162 397-510 (538) 106 TIGR01490 HAD-SF-IB-hyp1 HAD-s 59.5 13 0.00032 18.0 3.5 35 112-146 94-133 (204) 107 PRK05402 glycogen branching en 59.2 8.5 0.00022 19.0 2.6 59 107-165 268-328 (730) 108 COG5630 ARG2 Acetylglutamate s 57.9 15 0.00039 17.5 6.4 86 22-120 345-431 (495) 109 TIGR00455 apsK adenylylsulfate 57.3 13 0.00034 17.9 3.4 46 106-151 29-79 (187) 110 pfam04768 DUF619 Protein of un 55.6 17 0.00043 17.3 7.3 110 23-150 33-148 (168) 111 COG5653 Protein involved in ce 52.0 19 0.00049 16.9 11.8 136 13-152 214-358 (406) 112 cd04265 DUF619-NAGS-U DUF619-N 45.9 24 0.00061 16.3 5.2 32 88-119 28-59 (95) 113 KOG0470 consensus 45.4 13 0.00033 17.9 1.7 40 108-147 253-295 (757) 114 PRK09936 hypothetical protein; 45.2 25 0.00063 16.3 3.6 31 109-139 71-102 (294) 115 PRK02186 argininosuccinate lya 43.7 25 0.00065 16.2 3.0 17 109-125 237-253 (892) 116 cd04736 MDH_FMN Mandelate dehy 41.8 16 0.0004 17.4 1.7 27 110-136 128-154 (361) 117 TIGR02277 PaaX_trns_reg phenyl 41.0 14 0.00037 17.7 1.4 78 87-167 14-96 (287) 118 COG4671 Predicted glycosyl tra 38.9 31 0.00079 15.7 3.9 18 119-136 163-180 (400) 119 smart00475 53EXOc 5'-3' exonuc 38.7 23 0.00058 16.5 2.1 70 108-177 108-191 (259) 120 TIGR01885 Orn_aminotrans ornit 37.6 20 0.00051 16.8 1.7 10 134-143 161-171 (426) 121 COG2935 Putative arginyl-tRNA: 37.6 32 0.00083 15.6 6.2 67 71-143 159-227 (253) 122 TIGR01323 nitrile_alph nitrile 36.9 33 0.00085 15.5 3.6 64 98-166 46-142 (186) 123 PRK13210 putative L-xylulose 5 36.2 33 0.00085 15.5 2.6 33 97-131 83-115 (284) 124 COG2243 CobF Precorrin-2 methy 35.6 35 0.00089 15.4 2.8 23 122-144 89-112 (234) 125 cd04264 DUF619-NAGS DUF619-NAG 34.7 36 0.00092 15.3 4.7 31 88-118 32-62 (100) 126 TIGR00222 panB 3-methyl-2-oxob 33.4 29 0.00075 15.8 2.0 51 73-133 135-187 (267) 127 TIGR00099 Cof-subfamily Cof-li 32.1 40 0.001 15.0 3.4 32 104-135 85-119 (270) 128 TIGR01488 HAD-SF-IB HAD-superf 31.5 41 0.001 14.9 3.3 33 111-143 92-129 (204) 129 TIGR02412 pepN_strep_liv amino 30.6 23 0.0006 16.4 1.1 75 100-184 218-294 (884) 130 TIGR01467 cobI_cbiL precorrin- 30.0 43 0.0011 14.8 2.5 38 110-147 88-125 (243) 131 pfam03928 DUF336 Domain of unk 29.8 44 0.0011 14.8 3.5 23 108-130 8-30 (132) 132 pfam02334 RTP Replication term 29.0 45 0.0011 14.7 2.7 24 102-125 28-51 (122) 133 KOG2509 consensus 27.8 47 0.0012 14.6 2.9 50 107-156 187-238 (455) 134 TIGR02768 TraA_Ti Ti-type conj 27.4 48 0.0012 14.5 3.4 37 107-144 513-562 (888) 135 PRK05576 cobalt-precorrin-2 C( 27.0 46 0.0012 14.6 2.1 23 121-143 88-111 (224) 136 TIGR02482 PFKA_ATP 6-phosphofr 27.0 49 0.0012 14.5 3.5 41 113-156 81-128 (302) 137 TIGR00542 hxl6Piso_put hexulos 27.0 49 0.0012 14.5 3.3 46 95-142 80-126 (290) 138 PRK05990 precorrin-2 C(20)-met 26.3 50 0.0013 14.4 2.7 23 121-143 96-119 (241) 139 PRK13795 hypothetical protein; 26.1 51 0.0013 14.4 5.9 60 16-82 45-104 (630) 140 TIGR01500 sepiapter_red sepiap 26.1 45 0.0011 14.7 1.9 14 105-118 10-23 (267) 141 PRK09482 xni exonuclease IX; P 26.0 51 0.0013 14.4 2.8 70 108-177 108-187 (256) 142 pfam06849 DUF1246 Protein of u 26.0 51 0.0013 14.4 2.9 29 114-142 11-41 (124) 143 TIGR02089 TTC tartrate dehydro 25.9 51 0.0013 14.4 2.8 29 102-131 162-190 (355) 144 TIGR01263 4HPPD 4-hydroxypheny 25.8 44 0.0011 14.8 1.8 15 134-148 187-202 (379) 145 COG5092 NMT1 N-myristoyl trans 25.6 52 0.0013 14.3 8.1 31 93-123 168-198 (451) 146 cd00008 53EXOc 5'-3' exonuclea 25.5 29 0.00075 15.8 0.9 70 108-177 109-188 (240) 147 COG3623 SgaU Putative L-xylulo 25.3 53 0.0013 14.3 2.9 39 98-138 86-124 (287) 148 TIGR03175 AllD ureidoglycolate 24.5 54 0.0014 14.2 4.5 33 107-139 89-126 (349) 149 PRK10513 sugar phosphatase; Pr 24.5 55 0.0014 14.2 2.6 15 19-33 21-35 (270) 150 COG0296 GlgB 1,4-alpha-glucan 24.0 56 0.0014 14.1 3.3 48 109-156 164-213 (628) 151 PRK05431 seryl-tRNA synthetase 23.6 57 0.0014 14.1 3.2 43 107-149 172-216 (422) 152 PRK10966 exonuclease subunit S 23.4 57 0.0015 14.1 4.0 11 161-171 261-271 (402) 153 cd05537 POLBc_Pol_II DNA polym 23.2 58 0.0015 14.0 3.4 27 106-134 125-151 (371) 154 pfam04330 consensus 23.2 58 0.0015 14.0 3.9 18 68-85 37-54 (70) 155 COG3327 PaaX Phenylacetic acid 22.6 59 0.0015 14.0 2.2 78 88-168 23-105 (291) 156 PRK07898 consensus 22.4 30 0.00078 15.7 0.5 22 8-32 266-287 (902) 157 TIGR02617 tnaA_trp_ase tryptop 22.3 60 0.0015 13.9 1.9 35 93-127 87-122 (468) 158 PRK02983 lysS lysyl-tRNA synth 22.0 61 0.0016 13.9 8.8 107 15-130 376-486 (1099) 159 PRK11197 lldD L-lactate dehydr 21.9 61 0.0016 13.9 3.5 34 95-135 127-160 (381) 160 cd05531 POLBc_B2 DNA polymeras 21.8 61 0.0016 13.9 3.3 27 106-134 129-155 (352) 161 TIGR02898 spore_YhcN_YlaJ spor 21.6 42 0.0011 14.9 1.1 39 103-141 118-158 (185) 162 TIGR02708 L_lactate_ox L-lacta 21.3 63 0.0016 13.8 2.5 30 113-142 149-184 (368) 163 cd02922 FCB2_FMN Flavocytochro 21.2 63 0.0016 13.8 3.6 37 109-145 130-172 (344) 164 TIGR02759 TraD_Ftype type IV c 20.9 64 0.0016 13.8 3.5 49 105-155 217-266 (613) 165 COG3640 CooC CO dehydrogenase 20.8 64 0.0016 13.8 3.1 33 96-132 161-193 (255) 166 PRK05929 consensus 20.7 30 0.00076 15.8 0.1 24 9-33 250-273 (870) 167 PRK05948 precorrin-2 methyltra 20.5 66 0.0017 13.7 2.5 21 122-142 91-112 (238) 168 PRK08786 consensus 20.4 37 0.00093 15.2 0.5 20 9-31 248-267 (927) 169 TIGR00619 sbcd nuclease SbcCD, 20.3 66 0.0017 13.7 3.5 68 104-171 64-133 (275) 170 TIGR02467 CbiE precorrin-6y C5 20.2 67 0.0017 13.7 3.0 13 121-133 141-153 (210) 171 COG3033 TnaA Tryptophanase [Am 20.1 67 0.0017 13.7 2.0 29 94-122 93-121 (471) No 1 >COG3153 Predicted acetyltransferase [General function prediction only] Probab=100.00 E-value=1.8e-36 Score=239.96 Aligned_cols=166 Identities=21% Similarity=0.303 Sum_probs=153.2 Q ss_pred CCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEEC--CCC Q ss_conf 020751379757899999999972787666756152025412000257665489999898786479999998860--678 Q gi|254780989|r 12 EGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIG--KYK 89 (185) Q Consensus 12 ~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~--~~~ 89 (185) ..|+||.++++|..+|.+|.++||++..+ ..+.+.+++...+...+++||++||++||||+||++++. ... T Consensus 2 ~~~~ir~e~~~d~~~i~~~~~~aF~~~~e-------~~~v~~lR~~~~~~~~LslVA~d~g~vvG~Il~s~v~~~g~~~~ 74 (171) T COG3153 2 MMMLIRTETPADIPAIEALTREAFGPGRE-------AKLVDKLREGGRPDLTLSLVAEDDGEVVGHILFSPVTVGGEELG 74 (171) T ss_pred CCCEEEECCHHHHHHHHHHHHHHHHCCHH-------HHHHHHHHHCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCCC T ss_conf 84278746834389999999987404368-------99999987467765533699820998999999868985686414 Q ss_pred EEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHCCCEEHHHCCCCCCCCCCCHHHEEEEECCCC Q ss_conf 48998677525461599899999999999998899799995083254313858705278267798888232265440579 Q gi|254780989|r 90 GFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWHKHLGFKKPHQHQIRYAHGGGAATDWLVHIFKNN 169 (185) Q Consensus 90 ~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~GF~~~~~~~i~~~~~~~~~~~~~~~~l~~~ 169 (185) +..|+||+|+|+|||||||++||+.+++.++.+|+..|++.|+|.||.||||+++..+++..|.+ .+++.||+++|.++ T Consensus 75 ~~~LaPLaV~p~~qg~GIG~~Lvr~~le~a~~~G~~~v~vlGdp~YY~rfGF~~~~~~~l~~p~~-~~~~~fl~~~L~~~ 153 (171) T COG3153 75 WLGLAPLAVDPEYQGQGIGSALVREGLEALRLAGASAVVVLGDPTYYSRFGFEPAAGAKLYAPGP-VPDERFLALELGDG 153 (171) T ss_pred EEEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCEECCCCCCCCCCC-CCCCEEEEEECCCC T ss_conf 69997678762215786789999999999997799789990684424454748865553116899-89740899984587 Q ss_pred CCCCCEEEEEECCCCC Q ss_conf 7657706998066789 Q gi|254780989|r 170 IMENMIGKMSLRREYD 185 (185) Q Consensus 170 ~~~~~~g~~~~~~~f~ 185 (185) .+++.+|++.|+++|+ T Consensus 154 ~l~~~~G~v~~~~~f~ 169 (171) T COG3153 154 ALEGPSGTVHYHEAFD 169 (171) T ss_pred CCCCCCCEEEECCCCC T ss_conf 5567763388553112 No 2 >PRK03624 putative acetyltransferase; Provisional Probab=99.87 E-value=4.5e-21 Score=147.23 Aligned_cols=124 Identities=18% Similarity=0.214 Sum_probs=96.4 Q ss_pred CCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEE Q ss_conf 02075137975789999999997278766675615202541200025766548999989878647999999886067848 Q gi|254780989|r 12 EGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGF 91 (185) Q Consensus 12 ~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~ 91 (185) |.|+||+++++|.++|.+|+.+++.+.+.... ...........+..+|||+.+|+|||++.... ..... T Consensus 1 ~~m~IR~~~~~D~~av~~L~~~~~~~~pw~~~-------~~~~~~~l~~~~~~~lVae~~g~ivG~~~~~~----~~~~~ 69 (140) T PRK03624 1 DAMEIRVFRQADFEAVIALWERCDLTRPWNDP-------EMDIERKLAHDPSLFLVAEVGGRVVGTVMGGY----DGHRG 69 (140) T ss_pred CCEEEEECCHHHHHHHHHHHHHCCCCCCCCCH-------HHHHHHHHHCCCCEEEEEEECCEEEEEEEEEE----ECCEE T ss_conf 96199859999999999999964999998863-------99999998479978999999999999999998----17789 Q ss_pred EEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH------HHHHHCCCEEHHH Q ss_conf 998677525461599899999999999998899799995083------2543138587052 Q gi|254780989|r 92 LRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH------DWHKHLGFKKPHQ 146 (185) Q Consensus 92 ~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~------~~Y~r~GF~~~~~ 146 (185) .|..|+|+|+|||+|||++||+++++.++++|+..+.|...+ .||+++||+.... T Consensus 70 ~i~~laV~p~~Rg~GiG~~Ll~~~~~~~~~~G~~~i~L~v~~~N~~A~~fY~~~GF~~~~r 130 (140) T PRK03624 70 WLYYLAVHPDFRGRGIGRALLAHAEKKLIARGCPKINLQVREDNDAVLGFYEALGYEEEDR 130 (140) T ss_pred EEEEEEECHHHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHCCCEECCE T ss_conf 9999888876765389999999999999976999999999088599999999889938328 No 3 >PRK09491 rimI ribosomal-protein-alanine N-acetyltransferase; Provisional Probab=99.84 E-value=1.9e-19 Score=137.43 Aligned_cols=137 Identities=14% Similarity=0.186 Sum_probs=105.8 Q ss_pred EECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEEEE Q ss_conf 75137975789999999997278766675615202541200025766548999989878647999999886067848998 Q gi|254780989|r 15 TFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLRG 94 (185) Q Consensus 15 ~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~ 94 (185) |||+++++|.++|.+|.+++|...+. . +.+...+.. ..+.+|++.++++|||+.++.. .+...|. T Consensus 1 TIr~mt~~Dl~~v~~IE~~~f~~PWs--~----~~f~~~~~~-----~~~~~~~~~~~~ivGy~~~~~~----~de~~i~ 65 (144) T PRK09491 1 TISSLTTTDLPAAYHIEQRAHAFPWS--E----KTFASNQGE-----RYLNLKLTQNGQMAAFAITQVV----LDEATLF 65 (144) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCC--H----HHHHHHHHC-----CCEEEEEEECCEEEEEEEEEEE----CCCEEEE T ss_conf 93229899999999999986489999--9----999977556-----9179999989989999999981----7918999 Q ss_pred EEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH------HHHHHCCCEEHHHCCCCCCCC-CCCHHHEEEEEC Q ss_conf 677525461599899999999999998899799995083------254313858705278267798-888232265440 Q gi|254780989|r 95 PLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH------DWHKHLGFKKPHQHQIRYAHG-GGAATDWLVHIF 166 (185) Q Consensus 95 ~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~------~~Y~r~GF~~~~~~~i~~~~~-~~~~~~~~~~~l 166 (185) .|+|+|+|||+|+|++||++.++.++.+|...++|+... .+|+|+||+......=.++.+ +..|+..|.+.| T Consensus 66 ~iaV~p~~r~~G~g~~Ll~~~~~~~~~~g~~~i~LEVr~sN~~A~~lY~k~GF~~~g~R~~YY~~~dg~EDAiiM~l~L 144 (144) T PRK09491 66 NIAVDPDYQRQGLGRALLEHLIDELEKRGVATLWLEVRASNAAAIALYESLGFNEVTIRRNYYPTADGREDAIIMALPL 144 (144) T ss_pred EEEECHHHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHCCCEEECEECCCCCCCCCCCCCEEEECCC T ss_conf 9999688864897999999999999987997999999578789999999889989178878568978990216886279 No 4 >PRK10146 aminoalkylphosphonic acid N-acetyltransferase; Provisional Probab=99.82 E-value=3.2e-19 Score=136.05 Aligned_cols=130 Identities=15% Similarity=0.105 Sum_probs=100.0 Q ss_pred CCCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEE-E-CCC Q ss_conf 00207513797578999999999727876667561520254120002576654899998987864799999988-6-067 Q gi|254780989|r 11 LEGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQ-I-GKY 88 (185) Q Consensus 11 ~~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~-~-~~~ 88 (185) |..++||+++++|.++|.+|+.+....... . .............+....+||+.||+|||++.++-.. . ... T Consensus 1 Mp~v~IR~At~~D~~ai~~Ll~eL~~~~~d--~----~~~~~~~~~~~~~p~~~~~Vae~dG~vvG~~~l~~~~~l~~~~ 74 (144) T PRK10146 1 MPACELRPATQYDTDAVYALICELKQAEFD--H----QAFRVGFNANLRDPNMRYHLALLDGEVVGMIGLHLQFHLHHVN 74 (144) T ss_pred CCCCEEEECCHHHHHHHHHHHHHHHCCCCC--H----HHHHHHHHHHHCCCCCEEEEEEECCEEEEEEEEEEECCCCCCC T ss_conf 997440519888899999999986437678--7----9999999998549984899999899799999999963524688 Q ss_pred CEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH------HHHHHCCCEEHHH Q ss_conf 848998677525461599899999999999998899799995083------2543138587052 Q gi|254780989|r 89 KGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH------DWHKHLGFKKPHQ 146 (185) Q Consensus 89 ~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~------~~Y~r~GF~~~~~ 146 (185) ....|..++|+|++||+|||++|+++++++|+++|+..+.|..+. .||+|.||.+.+- T Consensus 75 ~~~~I~~l~V~~~~Rg~GiG~~Ll~~a~~~Ar~~G~~~~~L~s~~~r~~Ah~FY~r~Gf~~~~~ 138 (144) T PRK10146 75 WIGEIQELVVMPQARGLNVGSKLLAWAEEEARQAGAEMTELSTNVKRHDAHRFYLREGYEQSHF 138 (144) T ss_pred CEEEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCEEECC T ss_conf 3699999999856417699999999999999986991899984996789999999869963354 No 5 >PTZ00330 acetyltransferase; Provisional Probab=99.82 E-value=1.3e-18 Score=132.46 Aligned_cols=131 Identities=22% Similarity=0.223 Sum_probs=98.5 Q ss_pred CCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEE--E-EECCC Q ss_conf 02075137975789999999997278766675615202541200025766548999989878647999999--8-86067 Q gi|254780989|r 12 EGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATP--I-QIGKY 88 (185) Q Consensus 12 ~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~--~-~~~~~ 88 (185) +.|+||.++++|.+++.+|.++...... .......... ............+|++.+++|||++.+.- - ..... T Consensus 5 ~tl~IR~a~~~Dl~~il~Ll~~L~~~~~-~~~e~~~~~~---~~~~~~~~~~~v~Vae~~g~iVg~~~l~i~~~~~~~g~ 80 (147) T PTZ00330 5 GSLELRDLEEGDLGSVLELLSHLTSAPA-LSQEELEQIA---ARRRLAGVVTRVFVHSPTQRIVGTASLFVEPKFTRGGK 80 (147) T ss_pred CCEEEEECCHHHHHHHHHHHHHHCCCCC-CCHHHHHHHH---HHHHCCCCCEEEEEEECCCEEEEEEEEEEEEEEEECCC T ss_conf 8789885988899999999998543899-7889999999---98734799439999952997999999999921331795 Q ss_pred CEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH---HHHHCCCEEHHH Q ss_conf 8489986775254615998999999999999988997999950832---543138587052 Q gi|254780989|r 89 KGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD---WHKHLGFKKPHQ 146 (185) Q Consensus 89 ~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y~r~GF~~~~~ 146 (185) ...+|..|+|+|+|||+|+|++||+++++.|+++|+..++|..+++ ||+|+||++... T Consensus 81 ~~~~iEdvvV~~~~RG~GiG~~L~~~l~~~a~~~gc~kv~L~~~~~~~~FY~k~Gf~~~~~ 141 (147) T PTZ00330 81 CVGHIEDVVVDPSYRGQGLGRALISDLCEIARSSGCYKVILDCTEDMVAFYKKLGFRACER 141 (147) T ss_pred CEEEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHCCCEECCC T ss_conf 3699988999877658888999999999999987997999947951679999869997420 No 6 >PRK07757 acetyltransferase; Provisional Probab=99.81 E-value=8e-19 Score=133.67 Aligned_cols=140 Identities=15% Similarity=0.184 Sum_probs=102.9 Q ss_pred EEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEEE Q ss_conf 07513797578999999999727876667561520254120002576654899998987864799999988606784899 Q gi|254780989|r 14 LTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLR 93 (185) Q Consensus 14 l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l 93 (185) |+||+|+..|.++|.+|.+....... +.......+.+... + +||++.||+|||++.+.+. ..+...| T Consensus 2 ~~IR~A~~~Dv~~I~~Li~~~~~~g~------Ll~Rs~~~i~~~i~---~-f~Vae~dg~IvGc~aL~~~---~~~~aEi 68 (152) T PRK07757 2 MMIRKARLSDVKAIHALINEYAKEGL------MLPRSLDELYENIR---D-FYVAEEDGEIVGCCALHIL---WEDLAEI 68 (152) T ss_pred CEEEECCHHHHHHHHHHHHHHHHCCC------CCCCCHHHHHHHHC---C-EEEEEECCEEEEEEEEEEC---CCCCEEE T ss_conf 18985978889999999988876894------52689999997647---4-9999999999999998977---9991106 Q ss_pred EEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC-CHHHHHHCCCEEHHHCCCC---------CCCCCCCHHHEEE Q ss_conf 86775254615998999999999999988997999950-8325431385870527826---------7798888232265 Q gi|254780989|r 94 GPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA-KHDWHKHLGFKKPHQHQIR---------YAHGGGAATDWLV 163 (185) Q Consensus 94 ~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~-~~~~Y~r~GF~~~~~~~i~---------~~~~~~~~~~~~~ 163 (185) ..++|+|+|||+|+|++|+++.++.|+++|+..+++.+ .++||.++||...+...++ +|.=...|+.=|+ T Consensus 69 ~~l~V~p~~rg~GiG~~Ll~~l~~~Ar~~G~~~lf~LTt~~~fF~~~GF~~~~~~~lP~kvw~dc~~c~~~~~Cde~am~ 148 (152) T PRK07757 69 RSLAVSEEYRGKGIGRMLVEACLEEARELGVKRVFALTYQPEFFEKLGFREVDKEALPQKIWADCIHCPKFPNCDEIAMI 148 (152) T ss_pred EEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHCCCEECCHHHCCHHHHHHHCCCCCCCCCCHHHHH T ss_conf 77898376628888999999999999986999999905866789878998888355998999986079999998737843 Q ss_pred EEC Q ss_conf 440 Q gi|254780989|r 164 HIF 166 (185) Q Consensus 164 ~~l 166 (185) ++| T Consensus 149 ~~l 151 (152) T PRK07757 149 KEL 151 (152) T ss_pred HCC T ss_conf 103 No 7 >PRK10140 putative acetyltransferase YhhY; Provisional Probab=99.78 E-value=3.5e-18 Score=129.84 Aligned_cols=150 Identities=13% Similarity=0.164 Sum_probs=103.4 Q ss_pred CCCEEECCCCHHHHHHHHHHHHHH--CCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCC Q ss_conf 002075137975789999999997--278766675615202541200025766548999989878647999999886067 Q gi|254780989|r 11 LEGLTFANEQRVDQVAVDEVMVKA--FLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKY 88 (185) Q Consensus 11 ~~~l~IR~a~~~D~~aI~~L~~~a--F~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~ 88 (185) |+.++||+++++|+++|.+|.... ....-+........+ ...+.+ ......|||+.||++||++.+........ T Consensus 1 Ms~i~IR~~~~~D~~ai~~i~~~~~v~~~t~~~p~p~~~~~-~~~l~~---~~~~~~lVa~~dg~vvG~~~~~~~~~~r~ 76 (162) T PRK10140 1 MSEIVIRHAETRDYEAIRQIHAQPEVYHNTLQVPHPSDHMW-QERLTD---RPGIKQLVACIDGIVVGHLTIDVQQRPRR 76 (162) T ss_pred CCCEEEECCCHHHHHHHHHHHHCHHHHCCCCCCCCCCHHHH-HHHHHC---CCCCCEEEEEECCEEEEEEEEECCCCCCE T ss_conf 99569853997899999999849354416067999699999-999852---88984799999999999999872678722 Q ss_pred CEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHH-CCCCEEEEECCHH------HHHHCCCEEHHHC-CCCCCCCCCCHHH Q ss_conf 848998677525461599899999999999998-8997999950832------5431385870527-8267798888232 Q gi|254780989|r 89 KGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKN-SGAEFVLIDAKHD------WHKHLGFKKPHQH-QIRYAHGGGAATD 160 (185) Q Consensus 89 ~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~-~G~~~v~l~~~~~------~Y~r~GF~~~~~~-~i~~~~~~~~~~~ 160 (185) .+..--.++|+|+|||+|||++||+.+++.+.+ .+...+.+.+.++ .|+||||+..... +.....|.+.|.+ T Consensus 77 ~h~a~~gi~V~~~~~g~GiG~~Ll~~~i~~a~~~~~l~ri~l~V~~~N~~Ai~Ly~k~GF~~eG~~~~~~~~~G~y~D~~ 156 (162) T PRK10140 77 SHVADFGICVDSRWKNRGVASALMREMIDMCDNWLRVDRIELTVFVDNAPAIKVYKKYGFEIEGTGKKYALRNGEYVDAY 156 (162) T ss_pred EEEEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCEEEEEEEEECCCHHHHHHHHHCCCEEEEEEHHEEEECCEEEEEE T ss_conf 26999999999888698999999999999997316869999999869999999999889989988701188989998989 Q ss_pred EEEE Q ss_conf 2654 Q gi|254780989|r 161 WLVH 164 (185) Q Consensus 161 ~~~~ 164 (185) +|.+ T Consensus 157 ~MaR 160 (162) T PRK10140 157 YMAR 160 (162) T ss_pred EEEE T ss_conf 9998 No 8 >PRK07922 N-acetylglutamate synthase; Validated Probab=99.76 E-value=1.8e-17 Score=125.46 Aligned_cols=149 Identities=14% Similarity=0.131 Sum_probs=105.4 Q ss_pred CCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEE-CCEEEEEEEEEEEEECCCCE Q ss_conf 02075137975789999999997278766675615202541200025766548999989-87864799999988606784 Q gi|254780989|r 12 EGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRD-GDRIVGAIRATPIQIGKYKG 90 (185) Q Consensus 12 ~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~-d~~iVG~i~~~~~~~~~~~~ 90 (185) ++++||+|+.+|.++|.+|.+.-= ....+.......+.+... . ++|++. ||+||||+.+.+.. .+. T Consensus 5 ~~v~IR~A~~~DV~~I~~LI~p~a------~~G~LL~Rs~~~L~~~I~---~-F~Vae~~DG~IvGCaAL~~~~---~dl 71 (170) T PRK07922 5 PGIVVRRARTSDVPAIKRLVDTYA------DGRILLEKNLVTLYEAVQ---E-FWVAEHLDGEVVGCGALHVLW---EDL 71 (170) T ss_pred CCEEEECCCHHHHHHHHHHHHHHH------HCCEEECCCHHHHHHHHC---E-EEEEEECCCCEEEEEEEEECC---CCH T ss_conf 862887496777999999999998------689520489999998755---2-899994599689999746456---532 Q ss_pred EEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC-CHHHHHHCCCEEHHHCCCCC--------CCCCCCHHHE Q ss_conf 89986775254615998999999999999988997999950-83254313858705278267--------7988882322 Q gi|254780989|r 91 FLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA-KHDWHKHLGFKKPHQHQIRY--------AHGGGAATDW 161 (185) Q Consensus 91 ~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~-~~~~Y~r~GF~~~~~~~i~~--------~~~~~~~~~~ 161 (185) ..+..|+|+|+|||+|+|++|+++.+++|+++|...+++.+ .++|+.++||.......++. .+|..-++.+ T Consensus 72 AEIrsLAV~p~~rg~G~G~~Lv~~l~~~Ar~lGi~~vFvLT~~~~fF~k~GF~ev~k~~lp~kvw~ec~r~~~~~~~~~~ 151 (170) T PRK07922 72 AEVRTVAVDPAMRGHGVGHAIVERLLDVARELGLSRVFVLTFEVEFFARHGFVEIDGTPVTAEVYEEMLRSYDEGVAEFL 151 (170) T ss_pred HHHEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHCCCEECCCCCCCHHHHHHHHHCCCHHHHHHH T ss_conf 13044588787818984999999999999985998699997836899976998766884998999998768652268772 Q ss_pred EEEECCCCCCCC Q ss_conf 654405797657 Q gi|254780989|r 162 LVHIFKNNIMEN 173 (185) Q Consensus 162 ~~~~l~~~~~~~ 173 (185) =.--.|+|.|.+ T Consensus 152 ~~~~~~~~~~~~ 163 (170) T PRK07922 152 DLSRVKPNTLGN 163 (170) T ss_pred CCCCCCCCCCCC T ss_conf 532247776543 No 9 >TIGR01575 rimI ribosomal-protein-alanine acetyltransferase; InterPro: IPR006464 Histone acetylation is carried out by a class of enzymes known as histone acetyltransferases (HATs), which catalyse the transfer of an acetyl group from acetyl-CoA to the lysine E-amino groups on the N-terminal tails of histones. Early indication that HATs were involved in transcription came from the observation that in actively transcribed regions of chromatin, histones tend to be hyperacetylated, whereas in transcriptionally silent regions histones are hypoacetylated. The histone acetyltransferases are divided into five families. These include the Gcn5-related acetyltransferases (GNATs); the MYST (for 'MOZ, Ybf2/Sas3, Sas2 and Tip60)-related HATs; p300/CBP HATs; the general transcription factor HATs, which include the TFIID subunit TAF250; and the nuclear hormone-related HATs SRC1 and ACTR (SRC3). The GCN5-related N-acetyltransferase superfamily includes such enzymes as the histone acetyltransferases GCN5 and Hat1, the elongator complex subunit Elp3, the mediator-complex subunit Nut1, and Hpa2 . Many GNATs share several functional domains, including an N-terminal region of variable length, an acetyltransferase domain that encompasses the conserved sequence motifs described above, a region that interacts with the coactivator Ada2, and a C-terminal bromodomain that is believed to interact with acetyl-lysine residues. Members of the GNAT family are important for the regulation of cell growth and development. In mice, knockouts of Gcn5L are embryonic lethal. Yeast Gcn5 is needed for normal progression through the G2M boundary and mitotic gene expression. The importance of GNATs is probably related to their role in transcription and DNA repair. Members of this entry belong to the GCN5-related N-acetyltransferase (GNAT) superfamily. This entry covers prokaryotes and the archaea. It contains rimI, which catalyses the acetylation of the N-terminal alanine of ribosomal protein S18 .; GO: 0016407 acetyltransferase activity, 0006474 N-terminal protein amino acid acetylation. Probab=99.73 E-value=5.6e-17 Score=122.56 Aligned_cols=133 Identities=20% Similarity=0.187 Sum_probs=94.0 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECHHH Q ss_conf 78999999999727876667561520254120002576654899998987864799999988606784899867752546 Q gi|254780989|r 23 DQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEY 102 (185) Q Consensus 23 D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~ 102 (185) |.++|.+|.++||..+.+.... .+..++.- ......+++..++++|||+..+++.--..+...|..|+|+|++ T Consensus 1 d~~~v~~ie~~af~~~~PW~~~----~f~~~l~i---~~~~~yl~a~~g~~~~Gy~~~~r~~~~~~~Ea~i~~iaV~pe~ 73 (140) T TIGR01575 1 DLKAVLEIEKAAFTVDSPWTEA----AFAEELAI---NYYTCYLLAEIGGKVVGYIGISRIVQIVLDEAHILNIAVKPEY 73 (140) T ss_pred CHHHHHHHHHHHHCCCCCCCHH----HHHHHHCC---CCCCEEEEEEECCEEEEEEEEEEEEEEECCCCEEEEEEECCCC T ss_conf 9479999989871699998878----98865101---3330144566287789877656788754377448999887120 Q ss_pred HCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH------HHHHCCCEEHHHCCCCCCCCC-CCHHHEE Q ss_conf 15998999999999999988997999950832------543138587052782677988-8823226 Q gi|254780989|r 103 RKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD------WHKHLGFKKPHQHQIRYAHGG-GAATDWL 162 (185) Q Consensus 103 qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~------~Y~r~GF~~~~~~~i~~~~~~-~~~~~~~ 162 (185) ||+|||++|+++.++.++++|+..++|+...+ +|+|+||.......=.++-+. -+|+..| T Consensus 74 Qg~GiG~~LL~~L~~~~~~~~~~~~~LEVR~SN~~A~~LY~k~GF~~~~~r~~YY~~~~GrEDA~~M 140 (140) T TIGR01575 74 QGQGIGRALLRELIKEAKGRGVNEIFLEVRVSNVAAQALYKKLGFNEIAIRKNYYPDPEGREDAIVM 140 (140) T ss_pred CCCCHHHHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCCCCC T ss_conf 1663788999999999851898389984614068999987771850101043568887962223039 No 10 >TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species. Probab=99.71 E-value=3.2e-16 Score=117.97 Aligned_cols=134 Identities=19% Similarity=0.128 Sum_probs=105.1 Q ss_pred CCCEEECCCCH-HHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEE--CCEEEEEEEEEEEEECC Q ss_conf 00207513797-5789999999997278766675615202541200025766548999989--87864799999988606 Q gi|254780989|r 11 LEGLTFANEQR-VDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRD--GDRIVGAIRATPIQIGK 87 (185) Q Consensus 11 ~~~l~IR~a~~-~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~--d~~iVG~i~~~~~~~~~ 87 (185) -+|++||...| .|..++.++..+||...++.......++....-..+ +.+.-+|++++ +++++||. .+++.-+. T Consensus 147 p~Gv~lr~f~~~~De~~wl~vnn~AFa~HPEqg~wt~~dl~~r~~e~W--FdP~g~lla~~~~~g~lvGF~-wtk~h~~~ 223 (292) T TIGR03448 147 PDGVTVRAYVGAPDDAEWLRVNNAAFAWHPEQGGWTRADLAERRAEPW--FDPAGLFLAFDDAPGELLGFH-WTKVHPDE 223 (292) T ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCC--CCCCCCEEEEECCCCEEEEEE-EEEECCCC T ss_conf 998688436787767999999999974598678875678998750777--484773899847998399999-99863788 Q ss_pred CCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH------HHHHCCCEEHHHC Q ss_conf 78489986775254615998999999999999988997999950832------5431385870527 Q gi|254780989|r 88 YKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD------WHKHLGFKKPHQH 147 (185) Q Consensus 88 ~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~------~Y~r~GF~~~~~~ 147 (185) ...-.+..++|+|++||+|+|+.|+..++.+++.+|...++|+.+.+ .|+++||.....+ T Consensus 224 ~~~GEvYvvGV~p~~rGrGLG~aL~~~gL~~l~~~G~~~v~LyVd~dN~~A~~lY~~LGF~~~~~d 289 (292) T TIGR03448 224 PALGEVYVVGVDPAAQGRGLGDALTLIGLHHLAARGLPAVMLYVEADNEAAVRTYEKLGFTVAEVD 289 (292) T ss_pred CCCCEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCEEEEEE T ss_conf 885259999988342477678999999999999879988999837888899999985699584003 No 11 >PRK10975 TDP-fucosamine acetyltransferase; Provisional Probab=99.69 E-value=7.4e-16 Score=115.81 Aligned_cols=129 Identities=16% Similarity=0.179 Sum_probs=95.0 Q ss_pred CCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCC--------EEHHHHHCHHCCCCCCCEEEEEEEE-CCEEEEEEEEEE Q ss_conf 0207513797578999999999727876667561--------5202541200025766548999989-878647999999 Q gi|254780989|r 12 EGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDV--------LHKYVRNNVEEPLSFVRLMSFVCRD-GDRIVGAIRATP 82 (185) Q Consensus 12 ~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~l~~Vae~-d~~iVG~i~~~~ 82 (185) +.+.++.++++|.+++.++...+|..++.-.+.+ +..++++... ......|+|+++ +|+++||+.+-. T Consensus 81 ~~~~~~~A~~~Dip~L~~lA~~aF~~SRFr~P~~~~~~~~rfY~~Wie~aV~---G~fD~~cLv~~~~~g~i~GFvtlr~ 157 (231) T PRK10975 81 SDIGAEVATETDIPALRQLAAQAFAQSRFRAPWYAPDASGRFYAQWIENAVR---GTFDHQCLILRDATGQIRGFVSLRE 157 (231) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHC---CCCCCEEEEEECCCCCEEEEEEEEE T ss_conf 6676311777579999999986510212579877978987899999999846---6658568999868997789999985 Q ss_pred EEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH------HHHHHCCCEEHHH Q ss_conf 886067848998677525461599899999999999998899799995083------2543138587052 Q gi|254780989|r 83 IQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH------DWHKHLGFKKPHQ 146 (185) Q Consensus 83 ~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~------~~Y~r~GF~~~~~ 146 (185) . ......++-++|+|++||||||++||+.++++++++|...+.|.+-. .+|+|+||+-.++ T Consensus 158 --l-~~~~a~IgLlAV~P~~rG~GiG~~Lm~aa~~wa~~~g~~~l~l~tq~~N~~AirLY~~~GF~ie~t 224 (231) T PRK10975 158 --L-NDTDARIGLLAVFPGAQGRGIGAELMQAALNWAQARGLTTLRVATQMGNTAALKLYIQSGANIEST 224 (231) T ss_pred --C-CCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHCCCEEEEE T ss_conf --2-798637889889830127988999999999999985997899998578999999999869827567 No 12 >PRK10562 hypothetical protein; Provisional Probab=99.65 E-value=1.8e-15 Score=113.46 Aligned_cols=132 Identities=14% Similarity=0.169 Sum_probs=85.1 Q ss_pred ECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEEEEE Q ss_conf 51379757899999999972787666756152025412000257665489999898786479999998860678489986 Q gi|254780989|r 16 FANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLRGP 95 (185) Q Consensus 16 IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~ 95 (185) ||+++|+|.++|.+|+.++......+............+++. ..+..-.||++++|+|+||+.+.. .-.|.. T Consensus 2 IR~~~~~D~~~l~~IW~~s~~~ah~Fl~~~~~~~~~~~vr~~-~lp~~~~~V~e~~g~ivGFi~~~~-------~~~i~~ 73 (145) T PRK10562 2 IREYQPSDLPAILQLWLESTIWAHPFIKEQYWRESAPLVRDV-YLPAAQTWVWEEDGKLLGFVSILE-------GRFVGA 73 (145) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH-HCCCCEEEEEEECCEEEEEEEECC-------CCEEEE T ss_conf 763876679999999999998667455989999869999986-466563999996996899999569-------997611 Q ss_pred EEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC---CH---HHHHHCCCEEHHHCCCCCCCCCCCHHHEEE Q ss_conf 775254615998999999999999988997999950---83---254313858705278267798888232265 Q gi|254780989|r 96 LGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA---KH---DWHKHLGFKKPHQHQIRYAHGGGAATDWLV 163 (185) Q Consensus 96 laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~---~~---~~Y~r~GF~~~~~~~i~~~~~~~~~~~~~~ 163 (185) |+|+|++||+|+|++||+++.+.. ..+.|.+ |+ .||+|.||+..... ..- +...++..|. T Consensus 74 LfV~p~~~g~GiG~~Ll~~a~~~~-----~~l~L~v~~~N~~A~~FY~~~GF~~~~~~-~~~--~tg~p~~~m~ 139 (145) T PRK10562 74 LFVAPKAHRRGIGKALMQYVQQRY-----PHLSLEVYQKNQRAVNFYHAQGFRIVDCA-WQE--ETQHPTWIMS 139 (145) T ss_pred EEECHHHCCCCHHHHHHHHHHHHC-----CCEEEEECCCCHHHHHHHHHCCCEEEEEE-CCC--CCCCCEEEEE T ss_conf 688764337889999999999638-----93499982579789999998899996233-688--8899159999 No 13 >PRK01346 hypothetical protein; Provisional Probab=99.65 E-value=2.2e-15 Score=112.91 Aligned_cols=129 Identities=16% Similarity=0.129 Sum_probs=100.6 Q ss_pred CCCCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEEC--C Q ss_conf 00020751379757899999999972787666756152025412000257665489999898786479999998860--6 Q gi|254780989|r 10 ELEGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIG--K 87 (185) Q Consensus 10 ~~~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~--~ 87 (185) -|+.|+||+.+++|.+++.++...+|...... .... ..+.. ....-.+.+.+++++||.+...+.++. + T Consensus 3 ~~~~~~iR~l~~~d~~~~~~l~~~aF~~~~~~--e~~~-----~~~~~--~~~~~~~ga~d~~~lVg~~~~~~~~l~vpg 73 (411) T PRK01346 3 RMDTITIRVLTEDDWPAWFRAAARGFGDSPPD--EELE-----AWRTL--VEPDRTLGAFDGDDVVGTALAFDLELTVPG 73 (411) T ss_pred CCCCCEEECCCHHHHHHHHHHHHHHCCCCCCH--HHHH-----HHHHC--CCCCCEEEEEECCEEEEEEEEECEEEEEEC T ss_conf 75651654099899999999999873999986--8899-----88732--686746999999999999996144877308 Q ss_pred C---CEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC-CHHHHHHCCCEEHHHC Q ss_conf 7---8489986775254615998999999999999988997999950-8325431385870527 Q gi|254780989|r 88 Y---KGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA-KHDWHKHLGFKKPHQH 147 (185) Q Consensus 88 ~---~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~-~~~~Y~r~GF~~~~~~ 147 (185) . +...|+.|+|.|+|||+|+.++||++.++.++++|.....|.- ...||+||||-.+... T Consensus 74 G~~vp~agIt~Vav~PehRrrG~~~~Lm~~~L~~~r~~G~~ls~L~pse~~iY~RfGyg~at~~ 137 (411) T PRK01346 74 GAVLPAAAVTAVTVAPTHRRRGLLTALMREQLRRIAERGEPVAALTASEGGIYGRFGYGVATYS 137 (411) T ss_pred CCEEECCCEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCEEEEEE T ss_conf 8388716178999883106785899999999999997798399995067660146542130677 No 14 >PRK10514 hypothetical protein; Provisional Probab=99.63 E-value=2e-15 Score=113.27 Aligned_cols=124 Identities=14% Similarity=0.140 Sum_probs=81.2 Q ss_pred EEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEE-ECCEEEEEEEEEEEEECCCCEEE Q ss_conf 07513797578999999999727876667561520254120002576654899998-98786479999998860678489 Q gi|254780989|r 14 LTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCR-DGDRIVGAIRATPIQIGKYKGFL 92 (185) Q Consensus 14 l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae-~d~~iVG~i~~~~~~~~~~~~~~ 92 (185) |+||+++++|.++|.+|+.++-...-.+.............++... ..-.||+. ++|+++||+.+. + -. T Consensus 2 ~~IR~~~~~D~~~l~~IW~~s~~~aH~Fl~~~~~~~~~~~v~~ylp--~~~~~v~~~~~~~~~GFi~l~-----~---~~ 71 (146) T PRK10514 2 ISIRRSRHEEGERLVAIWRRSVDATHDFLSAEDRAEIEELVRSFLP--EAPLWVAVDERDQPVGFMLLS-----G---GH 71 (146) T ss_pred CEEECCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH--HCCEEEEEECCCCEEEEEEEC-----C---CE T ss_conf 2524498566999999999999988614999999999999998752--485999996699569999978-----8---98 Q ss_pred EEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEE-CCH---HHHHHCCCEEHHHCCCC Q ss_conf 98677525461599899999999999998899799995-083---25431385870527826 Q gi|254780989|r 93 RGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLID-AKH---DWHKHLGFKKPHQHQIR 150 (185) Q Consensus 93 l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~-~~~---~~Y~r~GF~~~~~~~i~ 150 (185) |..|+|+|++||+|||++|++++++... .-.+.|. .|+ .||+|+||+..+....- T Consensus 72 i~~LFV~p~~~g~GiG~~Ll~~a~~~~~---~l~L~V~~~N~~A~~FY~~~GF~~~gr~~~d 130 (146) T PRK10514 72 MEALFVDPDVRGCGVGRMLVEHALTLHP---ELTTDVNEQNEQAVGFYKKMGFKVTGRSEVD 130 (146) T ss_pred EEEEEECHHHCCCCHHHHHHHHHHHHCC---CCEEEEECCCHHHHHHHHHCCCEEECCCCCC T ss_conf 8789987877399979999999987385---4098500158689999997799897135679 No 15 >COG1246 ArgA N-acetylglutamate synthase and related acetyltransferases [Amino acid transport and metabolism] Probab=99.62 E-value=5.9e-15 Score=110.39 Aligned_cols=124 Identities=17% Similarity=0.236 Sum_probs=96.0 Q ss_pred EEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEEE Q ss_conf 07513797578999999999727876667561520254120002576654899998987864799999988606784899 Q gi|254780989|r 14 LTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLR 93 (185) Q Consensus 14 l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l 93 (185) +.||.|+.+|..+|.+|.+...... .+.......+.+... . ++|++.+|.++|++.+.| ....+...+ T Consensus 1 ~~iR~A~~~Di~~I~~Li~~~~~~g------il~~rs~~~le~~i~---d-F~i~E~~g~viGCaaL~~--~~~~~~gE~ 68 (153) T COG1246 1 EQIRKARISDIPAILELIRPLELQG------ILLRRSREQLEEEID---D-FTIIERDGKVIGCAALHP--VLEEDLGEL 68 (153) T ss_pred CCEEECCCCCHHHHHHHHHHHHHCC------CCCHHHHHHHHHHHH---H-HEEEEECCCEEEEEECCC--CCCCCEEEE T ss_conf 9100032024288999999974146------420315989999876---5-524322793899984035--675672567 Q ss_pred EEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC--CHHHHHHCCCEEHHHCCC Q ss_conf 86775254615998999999999999988997999950--832543138587052782 Q gi|254780989|r 94 GPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA--KHDWHKHLGFKKPHQHQI 149 (185) Q Consensus 94 ~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~--~~~~Y~r~GF~~~~~~~i 149 (185) ..|+|+|++|++|+|.+|+.+.+..|++.|.+.+++.+ .|+|+.++||+.+..-.. T Consensus 69 ~~laV~pd~r~~G~G~~Ll~~~~~~Ar~~gi~~~f~LTt~~~~~F~~~GF~~vd~~~L 126 (153) T COG1246 69 RSLAVHPDYRGSGRGERLLERLLADARELGIKELFVLTTRSPEFFAERGFTRVDKDEL 126 (153) T ss_pred EEEEECHHHCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHCCCEECCCCCC T ss_conf 7778778763887089999999999887497013433146688999809866763538 No 16 >COG1247 Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane] Probab=99.61 E-value=1.9e-14 Score=107.37 Aligned_cols=158 Identities=13% Similarity=0.132 Sum_probs=112.1 Q ss_pred CEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHC-HHCCCCCCCEEEEEEEEC-CEEEEEEEEEEEEE-CCCC Q ss_conf 20751379757899999999972787666756152025412-000257665489999898-78647999999886-0678 Q gi|254780989|r 13 GLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNN-VEEPLSFVRLMSFVCRDG-DRIVGAIRATPIQI-GKYK 89 (185) Q Consensus 13 ~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~Vae~d-~~iVG~i~~~~~~~-~~~~ 89 (185) .|+||.++.+|.++|.+|+..++.+................ .........-..+|++.+ |+++|++.+.+..- .... T Consensus 1 ~~~ir~~~~~Dl~~I~~IY~~~v~~~~a~~e~~~~~~~~~~~~~~~~~~~g~p~~V~~~~~g~v~G~a~~~~fr~r~ay~ 80 (169) T COG1247 1 EMEIRPATAADLEAILEIYNGAVENTAATFEEDPVSLEERAAWFSGRTRDGYPVVVAEEEDGKVLGYASAGPFRERPAYR 80 (169) T ss_pred CCEEECCHHHHHHHHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHCCCCCCEEEEEECCCCEEEEEEEEEECCCCCCCC T ss_conf 91783170776799999987765304479851699989999999833668854999974798099999851004755655 Q ss_pred EEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH------HHHHHCCCEEHHHC-CCCCCCCCCCHHHEE Q ss_conf 48998677525461599899999999999998899799995083------25431385870527-826779888823226 Q gi|254780989|r 90 GFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH------DWHKHLGFKKPHQH-QIRYAHGGGAATDWL 162 (185) Q Consensus 90 ~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~------~~Y~r~GF~~~~~~-~i~~~~~~~~~~~~~ 162 (185) +..-.+++|+|+.||+|+|++|++.+++.++++|+...+-...+ .+.+++||+..... ++...++.+.|-+|| T Consensus 81 ~tve~SiYv~~~~~g~GiG~~Ll~~Li~~~~~~g~~~lva~I~~~n~aSi~lh~~~GF~~~G~~~~vg~k~g~wld~~~~ 160 (169) T COG1247 81 HTVELSIYLDPAARGKGLGKKLLQALITEARALGVRELVAGIESDNLASIALHEKLGFEEVGTFPEVGDKFGRWLDLVLM 160 (169) T ss_pred EEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEEECCCCCHHHHHHHHCCCEEECCCCCCCCCCCEEEEEEEE T ss_conf 38999999985102466789999999999885884899999737873769999977977736345345314668766644 Q ss_pred EEECCCCC Q ss_conf 54405797 Q gi|254780989|r 163 VHIFKNNI 170 (185) Q Consensus 163 ~~~l~~~~ 170 (185) =+.|+++. T Consensus 161 ~~~l~~~~ 168 (169) T COG1247 161 QLLLEEGR 168 (169) T ss_pred EHHHCCCC T ss_conf 02221368 No 17 >PRK12308 bifunctional argininosuccinate lyase/N-acetylglutamate synthase; Provisional Probab=99.60 E-value=2.5e-15 Score=112.58 Aligned_cols=125 Identities=14% Similarity=0.145 Sum_probs=93.8 Q ss_pred CCCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCE Q ss_conf 00207513797578999999999727876667561520254120002576654899998987864799999988606784 Q gi|254780989|r 11 LEGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKG 90 (185) Q Consensus 11 ~~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~ 90 (185) .+||+||.|+-+|..+|..|... |...-+..+. .++.+.+... -++|+|.||+|+|++.+.-.+ ..- T Consensus 461 ~~~i~iR~Arl~Dv~~i~~lv~~-~A~~G~~LpR-----~~~~l~~~i~----~f~vaE~~g~v~g~~sl~i~~---~~L 527 (614) T PRK12308 461 TSGVKVRPARLTDIDAIEGMVAY-WAGLGENLPR-----TRNELVRDIG----SFAVAEHHGEVTGCASLYIYD---SGL 527 (614) T ss_pred CCCCEEECCCCCCHHHHHHHHHH-HHHHCCCCCC-----CHHHHHHHHH----HHEEEEECCEEEEEEEEEEEE---CCH T ss_conf 88736632567738999999999-8641245888-----7788999876----642665578188887898861---671 Q ss_pred EEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC-CHHHHHHCCCEEHHHCC Q ss_conf 89986775254615998999999999999988997999950-83254313858705278 Q gi|254780989|r 91 FLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA-KHDWHKHLGFKKPHQHQ 148 (185) Q Consensus 91 ~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~-~~~~Y~r~GF~~~~~~~ 148 (185) ..|.+|+|.|+||++|+|+.|+++.++.+++.+.+.|++.+ -|+|+.++||.....-. T Consensus 528 AEIrsl~v~~~~~~~G~G~~lV~~~l~~a~~~~~~rvfvLT~~p~fF~k~gf~~~~k~~ 586 (614) T PRK12308 528 AEIRSLGVEAGWQVQGQGKALVQYLVEKARQMAIKKVFVLTRVPEFFMKQGFSPTSKSL 586 (614) T ss_pred HHHHHHHCCHHHHHCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHCCCEECCHHH T ss_conf 99998616787774283289999999999983787589984371889975982078445 No 18 >PRK05279 N-acetylglutamate synthase; Validated Probab=99.59 E-value=2.6e-14 Score=106.48 Aligned_cols=122 Identities=18% Similarity=0.207 Sum_probs=94.0 Q ss_pred EECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEEEE Q ss_conf 75137975789999999997278766675615202541200025766548999989878647999999886067848998 Q gi|254780989|r 15 TFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLRG 94 (185) Q Consensus 15 ~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~ 94 (185) .||+|+..|...|.+|.+-.= ....+.+..+..+....+ .++|.|.||.||||+.+.|.. ......|+ T Consensus 296 ~iR~A~~~Dv~~Il~Li~Ple------~~G~Lv~Rsre~le~eI~----~f~V~e~Dg~iigCaALy~~~--~~~~aEla 363 (441) T PRK05279 296 QLRRATIDDVGGILELIRPLE------EQGILVRRSREQLEREID----QFTVIERDGVIIGCAALYPFP--EEKMGEMA 363 (441) T ss_pred HHCCCCHHHHHHHHHHHHHHH------HCCCCCCCCHHHHHHHHC----CEEEEEECCCEEEEEEEEECC--CCCEEEEE T ss_conf 835686767999999978898------679700189999997646----369999889599999600038--77806888 Q ss_pred EEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH--HHHHHCCCEEHHHCC Q ss_conf 677525461599899999999999998899799995083--254313858705278 Q gi|254780989|r 95 PLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH--DWHKHLGFKKPHQHQ 148 (185) Q Consensus 95 ~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~--~~Y~r~GF~~~~~~~ 148 (185) -++|+|+||++|+|++|++++.++|+++|...+++.+.. .|+...||+.+.... T Consensus 364 clav~~~y~~~g~G~~Ll~~~e~~A~~~g~~~lF~LTTqt~hwF~e~GF~~~~~~~ 419 (441) T PRK05279 364 CLAVHPDYRGSGRGERLLKRIEQRARQQGLSRLFVLTTRTAHWFLERGFVPADVDD 419 (441) T ss_pred EEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHCCCCCCCHHH T ss_conf 88987866189879999999999999869987999965638889975983089276 No 19 >PRK09831 hypothetical protein; Provisional Probab=99.57 E-value=1.7e-14 Score=107.67 Aligned_cols=138 Identities=17% Similarity=0.172 Sum_probs=90.9 Q ss_pred EEECCCCHHHHHHHHHHHHHHCCC--CCCCCCCCEEHHHHHCHHCCCC-CCCEEEEEEEECCEEEEEEEEEEEEECCCCE Q ss_conf 075137975789999999997278--7666756152025412000257-6654899998987864799999988606784 Q gi|254780989|r 14 LTFANEQRVDQVAVDEVMVKAFLS--TQEITDDVLHKYVRNNVEEPLS-FVRLMSFVCRDGDRIVGAIRATPIQIGKYKG 90 (185) Q Consensus 14 l~IR~a~~~D~~aI~~L~~~aF~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~ 90 (185) |.||+++|+|..++.+|..+|-.. .....+....-+.......+.. ......|||+.++++|||+.+ . ++ T Consensus 1 m~IR~~~~~D~~~l~~lf~~aV~~~~~~~Ys~~Q~~aWa~~~~~~w~~~l~~~~~~VA~~~~~ivGF~~~-----~--~~ 73 (147) T PRK09831 1 IQIRNYQPGDFQQLCAIFIRAVTMTASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFITC-----I--EH 73 (147) T ss_pred CEEEECCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCHHHHHHHHHHCCEEEEEECCEEEEEEEC-----C--CC T ss_conf 9744277777999999999999863401299999998867998999998750956999999998898854-----8--88 Q ss_pred EEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECC---HHHHHHCCCEEHHHCCCCCCCCCCCHHHEEEEE Q ss_conf 899867752546159989999999999999889979999508---325431385870527826779888823226544 Q gi|254780989|r 91 FLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAK---HDWHKHLGFKKPHQHQIRYAHGGGAATDWLVHI 165 (185) Q Consensus 91 ~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~---~~~Y~r~GF~~~~~~~i~~~~~~~~~~~~~~~~ 165 (185) +|..++|+|++||+|||++|+++..+ |...+.+... -.||+|.||+......+.. -|-...++.|-|. T Consensus 74 -~id~lfV~p~~~g~Gvg~~Ll~~l~~-----~~~~l~~~aS~~A~pFFe~~Gf~vv~~q~v~~-~G~~l~n~~M~K~ 144 (147) T PRK09831 74 -YIDMLFVDPEYTRRGVASALLKPLIK-----SESELTVDASITAKPFFERYGFQTVKQQRVEC-RGAWFTNFYMRYK 144 (147) T ss_pred -EEEEEEECHHHCCCCHHHHHHHHHHH-----HCCCEEEEHHHHHHHHHHHCCCEEEEEEEEEE-CCEEECCCEEEEC T ss_conf -08999988665389899999999984-----17205863157638889976979987756856-9989707042747 No 20 >KOG3139 consensus Probab=99.55 E-value=2e-13 Score=101.18 Aligned_cols=155 Identities=20% Similarity=0.170 Sum_probs=98.5 Q ss_pred CCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCE-EEEEEEE Q ss_conf 787987210002075137975789999999997278766675615202541200025766548999989878-6479999 Q gi|254780989|r 2 PKQPNVNIELEGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDR-IVGAIRA 80 (185) Q Consensus 2 ~~~~~~~~~~~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~-iVG~i~~ 80 (185) |..++.+......+||...-.+...+..+.+-.+.+-.+........+.. ..-+.++|+|.+++. .||.+.. T Consensus 2 ~~~~~~~~~~~~~~i~~~~~~~~~~l~~im~Li~k~lsepyS~~tyrYf~-------~~wp~~~~~a~d~~~~~VGai~c 74 (165) T KOG3139 2 ELELESDEWVLAEVIRPSLYPAEEYLADIMRLIDKDLSEPYSIYTYRYFV-------PNWPCFCFLALDEKGDTVGAIVC 74 (165) T ss_pred CCCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCHHHHHHHHHCC-------CCCCEEEEEEECCCCCEEEEEEE T ss_conf 75434664201002431001278877889988740148566777877502-------57866899999078866889998 Q ss_pred E-EEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH------HHHHCCCEEHHHCCCCCCC Q ss_conf 9-98860678489986775254615998999999999999988997999950832------5431385870527826779 Q gi|254780989|r 81 T-PIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD------WHKHLGFKKPHQHQIRYAH 153 (185) Q Consensus 81 ~-~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~------~Y~r~GF~~~~~~~i~~~~ 153 (185) . +...+ ....+|..|+|+++|||+|||++|++.+|+.++++|+..|+|.+.+. +|+++||++.... .++.. T Consensus 75 k~~~~r~-~~rgyi~mLaV~~e~Rg~GIg~aLvr~aId~m~~~g~~eVvLeTe~~n~~A~~LY~sLGF~r~~r~-~~YYl 152 (165) T KOG3139 75 KLDTHRN-TLRGYIAMLAVDSEYRGQGIGKALVRKAIDAMRSRGYSEVVLETEVTNLSALRLYESLGFKRDKRL-FRYYL 152 (165) T ss_pred ECCCCCC-CCEEEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCEEECCE-EEEEE T ss_conf 3445577-644899999961253345689999999999999779968998324334189999885494673312-47897 Q ss_pred CCCCHHHEEEEEC Q ss_conf 8888232265440 Q gi|254780989|r 154 GGGAATDWLVHIF 166 (185) Q Consensus 154 ~~~~~~~~~~~~l 166 (185) .+. ++.-|+..+ T Consensus 153 ng~-dA~rl~L~~ 164 (165) T KOG3139 153 NGM-DALRLKLFF 164 (165) T ss_pred CCC-CEEEEEEEC T ss_conf 785-148998524 No 21 >COG0456 RimI Acetyltransferases [General function prediction only] Probab=99.54 E-value=1e-13 Score=102.91 Aligned_cols=149 Identities=18% Similarity=0.269 Sum_probs=103.0 Q ss_pred CCCCCCCEEECCCCHHHHH--HHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEEC---C----EEEEE Q ss_conf 7210002075137975789--9999999972787666756152025412000257665489999898---7----86479 Q gi|254780989|r 7 VNIELEGLTFANEQRVDQV--AVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDG---D----RIVGA 77 (185) Q Consensus 7 ~~~~~~~l~IR~a~~~D~~--aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d---~----~iVG~ 77 (185) ..+....+.||..+..|.. ++..+...+|... .......+ .........+++|++.+ + +++|+ T Consensus 5 ~~~~~~~~~ir~~~~~d~~~~~~~~~~~~~~~~~----~~~~~~~~----~~~l~~~~~~~~v~~~~~~~~~~~~~~~G~ 76 (177) T COG0456 5 EELSEDKVTIREAINKDLLDVALAALEARTFDIR----LPWSREYF----EKDLTQAPELLLVAETGGLDGLLDGKVVGF 76 (177) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCC----CCCCHHHH----HHHHCCCCCEEEEEECCCCCCCCCCCEEEE T ss_conf 6667555653314076656789988877515767----64309999----998604873699996266666543104589 Q ss_pred EEEEEEEECC--CCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCC-CEEEEECCH------HHHHHCCCEEHHHCC Q ss_conf 9999988606--7848998677525461599899999999999998899-799995083------254313858705278 Q gi|254780989|r 78 IRATPIQIGK--YKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGA-EFVLIDAKH------DWHKHLGFKKPHQHQ 148 (185) Q Consensus 78 i~~~~~~~~~--~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~-~~v~l~~~~------~~Y~r~GF~~~~~~~ 148 (185) +..+...... .....+..++|+|+|||+|||++|++++++.++++|. ..+.|+..+ .+|+|+||+...... T Consensus 77 ~~~~~~~~~~~~~~~~~i~~iaV~p~~r~~Gig~~Ll~~~~~~~~~~~~~~~~~L~V~~~N~~Ai~lY~~~GF~~~~~~~ 156 (177) T COG0456 77 LLVRVVDGRPSADHEGHIYNLAVDPEYRGRGIGRALLDEALERLRERGLADKIVLEVRESNEAAIGLYRKLGFEVVKIRK 156 (177) T ss_pred EEEEEECCCCCCCCCEEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHCCCEEEHHHH T ss_conf 99974047656676089999999978769989999999999999852767369999947869999999986994312137 Q ss_pred CCCCCCCCC-HHHEEEEE Q ss_conf 267798888-23226544 Q gi|254780989|r 149 IRYAHGGGA-ATDWLVHI 165 (185) Q Consensus 149 i~~~~~~~~-~~~~~~~~ 165 (185) -.+ .+.. +...|.+. T Consensus 157 ~yy--~~~~~~a~~~~~~ 172 (177) T COG0456 157 NYY--ADGNGDALLMLKM 172 (177) T ss_pred HHH--HCCCCCHHHHHHH T ss_conf 540--2455649999865 No 22 >cd04301 GNAT GCN5-related N-acetyltransferases (GNAT) represent a large superfamily of functionally diverse enzymes that catalyze the transfer of an acetyl group from acetyl-Coenzyme A to the primary amine of a wide range of acceptor substrates. Members of this superfamily include aminoglycoside N-acetyltransferases, serotonin N-acetyltransferase, glucosamine-6-phosphate N-acetyltransferase, the histone acetyltransferases, mycothiol synthase, and the Fem family of amino acyl transferases. Probab=99.53 E-value=1.2e-13 Score=102.46 Aligned_cols=76 Identities=24% Similarity=0.268 Sum_probs=67.1 Q ss_pred EEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH------HHHHHC Q ss_conf 99989878647999999886067848998677525461599899999999999998899799995083------254313 Q gi|254780989|r 66 FVCRDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH------DWHKHL 139 (185) Q Consensus 66 ~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~------~~Y~r~ 139 (185) |||++||+|||++.+.+... ..+...|..++|+|++||+|+|++|++++++++++.|+..+.+.+++ .||+|+ T Consensus 2 ~Vae~~g~ivG~~~~~~~~~-~~~~~~i~~i~V~~~~rg~GiG~~L~~~~~~~a~~~~~~~i~l~v~~~N~~a~~~y~k~ 80 (83) T cd04301 2 LVAEDDGELVGFASFFIYRP-APREAEIEGLAVDPSYRGQGLGSALLDALIDWALERGARRLYLDTSPDNEAAIAFYEKL 80 (83) T ss_pred EEEEECCEEEEEEEEEEECC-CCCEEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHC T ss_conf 99999998999999999738-99979999999999992999389999999999998799899999928989999999976 Q ss_pred CCE Q ss_conf 858 Q gi|254780989|r 140 GFK 142 (185) Q Consensus 140 GF~ 142 (185) ||+ T Consensus 81 Gft 83 (83) T cd04301 81 GFT 83 (83) T ss_pred CCC T ss_conf 999 No 23 >KOG3216 consensus Probab=99.39 E-value=5e-12 Score=92.76 Aligned_cols=131 Identities=15% Similarity=0.100 Sum_probs=94.8 Q ss_pred CCCEEECCCCHHHHHHHHHHHHHH--CCCCCCCCCCCEEHHHHHCHHCC--CCCCCEEEEEEEE---CCEEEEEEEEEEE Q ss_conf 002075137975789999999997--27876667561520254120002--5766548999989---8786479999998 Q gi|254780989|r 11 LEGLTFANEQRVDQVAVDEVMVKA--FLSTQEITDDVLHKYVRNNVEEP--LSFVRLMSFVCRD---GDRIVGAIRATPI 83 (185) Q Consensus 11 ~~~l~IR~a~~~D~~aI~~L~~~a--F~~~~~~~~~~~~~~~~~~~~~~--~~~~~~l~~Vae~---d~~iVG~i~~~~~ 83 (185) |+-+.||.++|+|.+.|.+|..+- |..-.... ...+.++.+. .+.+-.+++|+.. ++.++|++.+.+. T Consensus 1 m~~~~IR~at~~D~~~i~rLikela~Fek~~~~v-----~~te~~l~~~~F~d~~~~~~~v~~ie~~~~~~aGf~~yf~~ 75 (163) T KOG3216 1 MDNIRIRLATPKDCEDILRLIKELAEFEKLEDQV-----EATEENLARDGFIDPPFKHWLVAAIETSGEVVAGFALYFNN 75 (163) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCH-----HHCHHHHHHHHCCCCCCCEEEEEEEECCCCCEEEEEEEECC T ss_conf 9845999648435789999999998788743612-----20335666640468872089999982489814677655115 Q ss_pred --EECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECC------HHHHHHCCCEEHHH Q ss_conf --8606784899867752546159989999999999999889979999508------32543138587052 Q gi|254780989|r 84 --QIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAK------HDWHKHLGFKKPHQ 146 (185) Q Consensus 84 --~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~------~~~Y~r~GF~~~~~ 146 (185) +.....+++|..++|.|+|||+|+|+.|+...-+.|...|+..+--..+ ..+|++.|++.... T Consensus 76 ystW~~k~~iYleDlyV~e~yR~kG~Gs~Ll~~va~~A~~~G~~rv~w~vldwN~rAi~lY~k~gaq~l~~ 146 (163) T KOG3216 76 YSTWLGKQGIYLEDLYVREQYRGKGIGSKLLKFVAEEADKLGTPRVEWVVLDWNHRAILLYEKVGAQDLKE 146 (163) T ss_pred CCCCCCCCEEEEEEEEECCHHCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHCCCCCCE T ss_conf 66300344078876472532203576899999999999973997079998124616899999857334340 No 24 >pfam00583 Acetyltransf_1 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions such as Elp3-related proteins. Probab=99.38 E-value=3e-12 Score=94.08 Aligned_cols=74 Identities=23% Similarity=0.260 Sum_probs=64.3 Q ss_pred EEECCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH------HHHHHCCC Q ss_conf 989878647999999886067848998677525461599899999999999998899799995083------25431385 Q gi|254780989|r 68 CRDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH------DWHKHLGF 141 (185) Q Consensus 68 ae~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~------~~Y~r~GF 141 (185) ++.||++||++.+++.. ...+...+..++|+|+|||+|+|++||+.+++.+++.|+..+.+.+.+ .||+|+|| T Consensus 1 ~~~~g~ivG~~~~~~~~-~~~~~~~i~~l~V~p~~rg~GiG~~Ll~~~~~~~~~~g~~~i~l~~~~~n~~a~~~y~k~GF 79 (80) T pfam00583 1 AEEDGELVGFASLSIID-EEGNVAEIEGLAVDPEYRGKGIGTALLEALEEYARELGLKRIELEVLEDNEAAIALYEKLGF 79 (80) T ss_pred CCCCCEEEEEEEEEEEC-CCCCEEEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCC T ss_conf 96789899999999964-89999999999999899599879999999999999879999999992787999999998697 Q ss_pred E Q ss_conf 8 Q gi|254780989|r 142 K 142 (185) Q Consensus 142 ~ 142 (185) + T Consensus 80 ~ 80 (80) T pfam00583 80 K 80 (80) T ss_pred C T ss_conf 9 No 25 >PRK10314 hypothetical protein; Provisional Probab=99.30 E-value=8.8e-11 Score=85.23 Aligned_cols=139 Identities=12% Similarity=0.125 Sum_probs=94.2 Q ss_pred EECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCC-CCEEEEEEEECCEEEEEEEEEEEEECCCCEEEE Q ss_conf 7513797578999999999727876667561520254120002576-654899998987864799999988606784899 Q gi|254780989|r 15 TFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSF-VRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLR 93 (185) Q Consensus 15 ~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l 93 (185) .+..-+..++-+|.+|-.++|-..+.... .+ + +..+. .....+++.+++++||++++.+.. +......| T Consensus 8 ~f~eLs~~ely~il~lR~~VFV~EQ~~p~---~d-----~-D~~D~~~~~~H~~~~~~~~~va~~Ri~~~~-~~~~~~~I 77 (153) T PRK10314 8 HHSELSVSQLYALLQLRCAVFVVEQNCPY---QD-----I-DGDDLTGDNRHILGWKNDELVAYARILKSD-DDLEPVVI 77 (153) T ss_pred CHHHCCHHHHHHHHHHHHHHEEECCCCCC---HH-----H-HCCCCCCCCEEEEEEECCEEEEEEEEECCC-CCCCCEEE T ss_conf 61659999999999988873780179894---32-----2-156666761899999599388999872589-88773478 Q ss_pred EEEEECHHHHCCCHHHHHHHHHHHHHHHC-CCCEEEEECC---HHHHHHCCCEEHHHCCCCCCCCCCCHHHEEEEECC Q ss_conf 86775254615998999999999999988-9979999508---32543138587052782677988882322654405 Q gi|254780989|r 94 GPLGVLSEYRKRGIGSRLAYMSFMAIKNS-GAEFVLIDAK---HDWHKHLGFKKPHQHQIRYAHGGGAATDWLVHIFK 167 (185) Q Consensus 94 ~~laV~P~~qg~GiG~~Li~~~l~~a~~~-G~~~v~l~~~---~~~Y~r~GF~~~~~~~i~~~~~~~~~~~~~~~~l~ 167 (185) +.|+|++++||+|+|++||+.+++.+++. +...+.+..- .+||++|||+..++- +. ...-+-..|+|+|. T Consensus 78 GRVaV~k~~Rg~glG~~Lm~~~l~~~~~~~~~~~i~l~AQ~~~~~FY~~~GF~~~g~~---f~-E~GIpHi~M~k~l~ 151 (153) T PRK10314 78 GRVIVSEALRGEKVGQQLMSKTLESCTRHWPDKPLYLGAQAHLQNFYASFGFIPVTDV---YE-EDGIPHIGMAREVI 151 (153) T ss_pred EEEEECHHHCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHCCCEECCCC---EE-ECCCCHHHHHHHHH T ss_conf 7899758784998999999999999999779997899628999999998799888984---55-38985798877753 No 26 >TIGR03103 trio_acet_GNAT GNAT-family acetyltransferase TIGR03103. Members of this protein family belong to the GNAT family of acetyltransferases. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and an asparagine synthetase family protein. Probab=99.21 E-value=1.8e-10 Score=83.29 Aligned_cols=141 Identities=13% Similarity=0.151 Sum_probs=99.4 Q ss_pred CCCEEECCCC-HHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEEC--CEEEEEEEEEE---EE Q ss_conf 0020751379-757899999999972787666756152025412000257665489999898--78647999999---88 Q gi|254780989|r 11 LEGLTFANEQ-RVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDG--DRIVGAIRATP---IQ 84 (185) Q Consensus 11 ~~~l~IR~a~-~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d--~~iVG~i~~~~---~~ 84 (185) ..|++||..+ ++|.++|.+|+.+.-- -+...+.+. ..........|||+++ |.|||.++-.. +. T Consensus 80 ~~g~~ir~~~~~~d~~~in~iy~~~~M--vp~~~~~~~--------~~~~~~~~~y~vA~d~~tg~ivg~v~gvdH~~af 149 (547) T TIGR03103 80 PRGFTVRRLRGPADVDAINRLYAARGM--VPVRVDFVL--------DHRHSRAITYLVAEDEASGAIIGTVMGVDHRKAF 149 (547) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCC--CCCCHHHHH--------CCCCCCCCEEEEEEECCCCCEEEEEECCCHHHHC T ss_conf 897489967876679999999997698--479977731--------5566787259998607999579998654268652 Q ss_pred ECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEE---CCH---HHHHHCCCEEHHHCCCCCCCCCCCH Q ss_conf 6067848998677525461599899999999999998899799995---083---2543138587052782677988882 Q gi|254780989|r 85 IGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLID---AKH---DWHKHLGFKKPHQHQIRYAHGGGAA 158 (185) Q Consensus 85 ~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~---~~~---~~Y~r~GF~~~~~~~i~~~~~~~~~ 158 (185) .+...+..|+-|+|+|....-|+|.+|+++..++.+.+|...+=|. .|. ..|+|+||.+...+-|+...+ ++. T Consensus 150 ~dpe~gsSlW~LaVdpqa~~pgvGeaLvr~lae~~~~rg~~~~DLSV~hdN~~Ai~lY~kLgF~~~p~f~vKrkn~-iNe 228 (547) T TIGR03103 150 NDPEHGSSLWCLAVDPQAAHPGVGEALVRALAEHFQSRGCAYMDLSVMHDNEQAIALYEKLGFRRIPVFALKRKNA-INE 228 (547) T ss_pred CCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCEECCEEEEECCCC-CCC T ss_conf 8976783136886668889998129999999999987693105110104868899999971975564578860575-674 Q ss_pred HHEE Q ss_conf 3226 Q gi|254780989|r 159 TDWL 162 (185) Q Consensus 159 ~~~~ 162 (185) ..|. T Consensus 229 ~lf~ 232 (547) T TIGR03103 229 RLFS 232 (547) T ss_pred CCCC T ss_conf 3246 No 27 >TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase; InterPro: IPR010167 This entry represents a clade of amino-acid N-acetyltransferases, or N-acetylglutamate synthase, which is the product of the argA gene and the first enzyme in arginine biosynthesis. This enzyme displays more diversity between bacteria, fungi and mammals than other enzymes in arginine metabolism, and N-acetylglutamate itself can have different roles in different taxonomic groups: in prokaryotes, lower eukaryotes and plants it is the first intermediate in arginine biosynthesis, while in vertebrates it is an allosteric cofactor for the first enzyme in the urea cycle . In bacteria, arginine can regulate ornithine biosynthesis via a feedback inhibition mechanism . This enzyme may also act on aspartate.; GO: 0004042 amino-acid N-acetyltransferase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=99.08 E-value=5.5e-11 Score=86.47 Aligned_cols=126 Identities=17% Similarity=0.194 Sum_probs=97.9 Q ss_pred CEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEE Q ss_conf 20751379757899999999972787666756152025412000257665489999898786479999998860678489 Q gi|254780989|r 13 GLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFL 92 (185) Q Consensus 13 ~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~ 92 (185) --.||+||-.|...|.+|.+= =.....+.+.-++.+.+.....+ |.|.||-|+||+.++|. .+.+... T Consensus 291 ~e~iR~At~~DvGGi~~LI~P------LEeqGiLv~RsRe~LE~~I~~FS----v~e~dG~~~GCAALypy--a~~~~GE 358 (439) T TIGR01890 291 FESIREATIDDVGGILELIRP------LEEQGILVRRSRELLEREIEEFS----VIEHDGLIIGCAALYPY--AEEDVGE 358 (439) T ss_pred HHHHHHHCCCCCCHHHHHCCC------HHHCCEECCHHHHHHHHHHHHCE----EEECCCCCHHHHHCCCC--CCCCCCC T ss_conf 898753112441008874351------34557310044787777530061----44017611212125887--6322330 Q ss_pred EEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECC--HHHHHHCCCEEHHHCCCC Q ss_conf 9867752546159989999999999999889979999508--325431385870527826 Q gi|254780989|r 93 RGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAK--HDWHKHLGFKKPHQHQIR 150 (185) Q Consensus 93 l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~--~~~Y~r~GF~~~~~~~i~ 150 (185) +.=|+|+|+||+-|.|.+|+.|..++||..|....++.+. ..||.+-||.++..-..+ T Consensus 359 ~AClAv~P~~~~GG~G~~LL~h~~~~Ar~~Gl~~LFvLtT~t~hWF~erGF~~a~vd~LP 418 (439) T TIGR01890 359 MACLAVSPEYRDGGRGERLLKHIEDRARQMGLSRLFVLTTRTEHWFRERGFQEASVDELP 418 (439) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCEECCCCHHHHHHCCCCCCCHHCCH T ss_conf 121121767888887178999999986422513520002561347865489726642151 No 28 >pfam08445 FR47 FR47-like protein. The members of this family are similar to the C-terminal region of the D. melanogaster hypothetical protein FR47. This protein has been found to consist of two N-acyltransferase-like domains swapped with the C-terminal strands. Probab=98.99 E-value=2.7e-09 Score=76.30 Aligned_cols=72 Identities=28% Similarity=0.324 Sum_probs=55.3 Q ss_pred EEEECCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC--C-H--HHHHHCCC Q ss_conf 99898786479999998860678489986775254615998999999999999988997999950--8-3--25431385 Q gi|254780989|r 67 VCRDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA--K-H--DWHKHLGF 141 (185) Q Consensus 67 Vae~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~--~-~--~~Y~r~GF 141 (185) .-++.|++++.+...+ .-.|+.|+|+|+|||||+|+.|++..++.+.++|........ | | .+|+|+|| T Consensus 5 f~~~~~el~aW~l~~~-------~G~i~~l~v~Pe~rgrG~G~~L~~~~~~~~~~~G~~~~l~V~~~N~~a~~lYeklGF 77 (86) T pfam08445 5 YRGDTGELAAWCLRLP-------GGELGALQTLPEHRRRGLGSRLVAALARGIAERGITPFAVVVAGNTPSRRLYEKLGF 77 (86) T ss_pred EECCCCCEEEEEEECC-------CCCEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCC T ss_conf 8789998999999889-------976899999989836988999999999999976994699998798689999998699 Q ss_pred EEHH Q ss_conf 8705 Q gi|254780989|r 142 KKPH 145 (185) Q Consensus 142 ~~~~ 145 (185) +... T Consensus 78 ~~v~ 81 (86) T pfam08445 78 RKID 81 (86) T ss_pred EEEE T ss_conf 7914 No 29 >COG2153 ElaA Predicted acyltransferase [General function prediction only] Probab=98.99 E-value=2e-09 Score=77.01 Aligned_cols=80 Identities=15% Similarity=0.164 Sum_probs=65.7 Q ss_pred EEEEE-CCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH----HHHHHCC Q ss_conf 99989-878647999999886067848998677525461599899999999999998899799995083----2543138 Q gi|254780989|r 66 FVCRD-GDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH----DWHKHLG 140 (185) Q Consensus 66 ~Vae~-d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~----~~Y~r~G 140 (185) +..+. +|++++++++.|...+... ..||.|.|.|++||+|+|++||..+|+.+.+..-+.-+..+.. +||.+|| T Consensus 52 l~~~~~~g~LvAyaRLlp~~~~~~~-~~iGRV~v~~~~RG~glG~~Lm~~AL~~~~~~~p~~~v~l~AQahLq~fYa~~G 130 (155) T COG2153 52 LLGWTPDGELVAYARLLPPGAEYEE-VSIGRVIVSPAARGQGLGQQLMEKALETAGREWPDKPVYLGAQAHLQDFYASFG 130 (155) T ss_pred EEEECCCCEEEEEEECCCCCCCCCC-EEEEEEEECHHHHCCCHHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHHHHHHHC T ss_conf 9998699969999860777777575-255469977756224545999999999988628999869841787999999709 Q ss_pred CEEHHH Q ss_conf 587052 Q gi|254780989|r 141 FKKPHQ 146 (185) Q Consensus 141 F~~~~~ 146 (185) |++.++ T Consensus 131 Fv~~~e 136 (155) T COG2153 131 FVRVGE 136 (155) T ss_pred CEECCC T ss_conf 677673 No 30 >KOG3235 consensus Probab=98.96 E-value=2.1e-09 Score=76.91 Aligned_cols=143 Identities=18% Similarity=0.206 Sum_probs=99.1 Q ss_pred CEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEE-CCEEEEEEEEEEEEE---CCC Q ss_conf 2075137975789999999997278766675615202541200025766548999989-878647999999886---067 Q gi|254780989|r 13 GLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRD-GDRIVGAIRATPIQI---GKY 88 (185) Q Consensus 13 ~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~-d~~iVG~i~~~~~~~---~~~ 88 (185) +|.||.+++.|+-........+.+.+ +..++.. -+....+.++|||++ +|+|||++..- +.- +.. T Consensus 1 ~m~iR~ar~~DL~~mQ~~Nl~~lpEN------yqmkyyl----yh~lswp~lSyVA~D~~gkiVGYvlAk-mee~p~~~~ 69 (193) T KOG3235 1 GMNIRRARPDDLLEMQHCNLLNLPEN------YQMKYYL----YHGLSWPQLSYVAEDENGKIVGYVLAK-MEEDPDDEP 69 (193) T ss_pred CCCCCCCCHHHHHHHHHCCCCCCCHH------HHHHHHH----HHHCCCCCCEEEEECCCCCEEEEEEEE-HHHCCCCCC T ss_conf 95634588888877664343368277------7478899----862266553478775899689876011-230656789 Q ss_pred CEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHH-CCCCEEEEECCH------HHHH-HCCCEEHHHCCCCCCCCCCCHHH Q ss_conf 848998677525461599899999999999998-899799995083------2543-13858705278267798888232 Q gi|254780989|r 89 KGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKN-SGAEFVLIDAKH------DWHK-HLGFKKPHQHQIRYAHGGGAATD 160 (185) Q Consensus 89 ~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~-~G~~~v~l~~~~------~~Y~-r~GF~~~~~~~i~~~~~~~~~~~ 160 (185) ..-.|.+|+|.-.||+.|++++||..+++.+.+ .+.+.|.|.... ..|+ .+||+-... .+.+ |.+..|++ T Consensus 70 ~hGhItSlaV~rs~RrlGla~kLm~qa~rAm~E~~~A~yvsLHVR~SNraAl~LY~~tl~F~v~ev-e~kY-YadGedAy 147 (193) T KOG3235 70 PHGHITSLAVKRSYRRLGLAQKLMNQASRAMVEVYEAKYVSLHVRKSNRAALHLYKNTLGFVVCEV-EPKY-YADGEDAY 147 (193) T ss_pred CCCEEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEEECCCHHHHHHHHHCCCEEEEEC-CCCC-CCCCHHHH T ss_conf 887068765324688761899999999999998635338888641106888876661314188613-6531-03647899 Q ss_pred EEEEECCC Q ss_conf 26544057 Q gi|254780989|r 161 WLVHIFKN 168 (185) Q Consensus 161 ~~~~~l~~ 168 (185) -|-+.|+. T Consensus 148 aM~~~L~~ 155 (193) T KOG3235 148 AMRKDLSV 155 (193) T ss_pred HHHHHHHH T ss_conf 99988999 No 31 >PRK10809 ribosomal-protein-S5-alanine N-acetyltransferase; Provisional Probab=98.93 E-value=5.3e-08 Score=68.50 Aligned_cols=143 Identities=11% Similarity=0.146 Sum_probs=96.9 Q ss_pred CCCCCCCCCCEEECCCCHHHHHHHHHHHHHH---CCCCCCCCCCC--EEHHHH---HCHHCCCCCCCEEEEEEE--ECCE Q ss_conf 7987210002075137975789999999997---27876667561--520254---120002576654899998--9878 Q gi|254780989|r 4 QPNVNIELEGLTFANEQRVDQVAVDEVMVKA---FLSTQEITDDV--LHKYVR---NNVEEPLSFVRLMSFVCR--DGDR 73 (185) Q Consensus 4 ~~~~~~~~~~l~IR~a~~~D~~aI~~L~~~a---F~~~~~~~~~~--~~~~~~---~~~~~~~~~~~~l~~Vae--~d~~ 73 (185) .|.+.+|.+-|++|.-+++|+.++.+..... +.+-.+..+.. ...... ....+.......+.|+.. ++++ T Consensus 8 ~P~~~leT~RL~LR~~~~~Da~~l~~~~~~~r~~l~~w~p~~d~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~k~~~~ 87 (194) T PRK10809 8 VPKVRLTTDRLVVRLVHERDAWRLADYYAENRHFLKPWEPVRDESHCYPSGWQARLGMINEFHKQGSAFYFGLLDPEEKE 87 (194) T ss_pred CCCCEEECCCEEEECCCHHHHHHHHHHHHHCHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCE T ss_conf 99868786978997899899999999997187555484766770215889999999999999866981799999989997 Q ss_pred EEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHH-CCCCEEEEECCHH------HHHHCCCEEHHH Q ss_conf 647999999886067848998677525461599899999999999998-8997999950832------543138587052 Q gi|254780989|r 74 IVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKN-SGAEFVLIDAKHD------WHKHLGFKKPHQ 146 (185) Q Consensus 74 iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~-~G~~~v~l~~~~~------~Y~r~GF~~~~~ 146 (185) +||.+.++.+.-.......|| -.++|+|||||+++.-++.+++.+-+ .|...|....+|+ ..+|+||+.... T Consensus 88 liG~i~l~~i~~~~~~~~eIG-Y~l~~~~~GkGyatEA~~ali~~aF~~l~lhRIea~~~~~N~~S~rvleklGF~~EG~ 166 (194) T PRK10809 88 IIGVANFSNVVRGSFHACYLG-YSIGQKWQGQGLMFEALTAAIRYMQRTQHIHRIMANYMPHNKRSGDLLARLGFEKEGY 166 (194) T ss_pred EEEEEEEEECCCCCCCEEEEE-EEECHHHHCCCHHHHHHHHHHHHHHHHCCCCEEEEEHHHCCHHHHHHHHHCCCEEEEE T ss_conf 999999874257765168899-9998889488889999999999998737953895643222877539999879968885 Q ss_pred C Q ss_conf 7 Q gi|254780989|r 147 H 147 (185) Q Consensus 147 ~ 147 (185) . T Consensus 167 ~ 167 (194) T PRK10809 167 A 167 (194) T ss_pred E T ss_conf 5 No 32 >KOG3397 consensus Probab=98.91 E-value=5.3e-09 Score=74.52 Aligned_cols=127 Identities=11% Similarity=0.160 Sum_probs=86.5 Q ss_pred CCCCEEECCC--CHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCC-EEEEEEEECCEEEEEEEEEEEEEC Q ss_conf 0002075137--9757899999999972787666756152025412000257665-489999898786479999998860 Q gi|254780989|r 10 ELEGLTFANE--QRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVR-LMSFVCRDGDRIVGAIRATPIQIG 86 (185) Q Consensus 10 ~~~~l~IR~a--~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~~Vae~d~~iVG~i~~~~~~~~ 86 (185) ++..+++-+- .|+=..+-..|...-+.-++ ....+.+.+..+..+ .+.+.-|...++|||..++++- . T Consensus 9 S~~~l~~vPiH~rPELlk~~~~LIN~eWPRS~--------TsR~hSL~~ScDs~P~sL~Ll~E~~~~VigH~rLS~i~-n 79 (225) T KOG3397 9 SMPDLFFVPLHDRPELLKESMTLINSEWPRSD--------TSREHSLKKSCDSPPMSLLLLNEENDEVLGHSRLSHLP-N 79 (225) T ss_pred CCCCCEEEECCCCHHHHHHHHHHHHCCCCCCH--------HHHHHHHHCCCCCCCEEEEEECCCCCCEEEEECCCCCC-C T ss_conf 88862356546667899999998744387522--------57676665135777615555336652001110014567-8 Q ss_pred CCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH--HHHHHCCCEEHH Q ss_conf 67848998677525461599899999999999998899799995083--254313858705 Q gi|254780989|r 87 KYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH--DWHKHLGFKKPH 145 (185) Q Consensus 87 ~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~--~~Y~r~GF~~~~ 145 (185) ....++...|.|+-.+||+|.|+.||..+.+++|+.|...+.|.++. .||+++||+... T Consensus 80 ~~~al~VEsVVV~k~~RG~GFGk~lMk~~E~~~R~~gf~~~yLsT~DQ~~FYe~lGYe~c~ 140 (225) T KOG3397 80 RDHALWVESVVVKKDQRGLGFGKFLMKSTEKWMREKGFNEAYLSTDDQCRFYESLGYEKCD 140 (225) T ss_pred CCCEEEEEEEEEEHHHCCCCHHHHHHHHHHHHHHHHHHHHEEEECCCCCHHHHHHCCCCCC T ss_conf 8851688888874433155388999999999999853321464112310166651410248 No 33 >KOG4144 consensus Probab=98.87 E-value=1e-09 Score=78.84 Aligned_cols=140 Identities=14% Similarity=0.139 Sum_probs=90.1 Q ss_pred CCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCC--CCCCEEEEEEEECCEEEEEEEEEEEEEC--- Q ss_conf 02075137975789999999997278766675615202541200025--7665489999898786479999998860--- Q gi|254780989|r 12 EGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPL--SFVRLMSFVCRDGDRIVGAIRATPIQIG--- 86 (185) Q Consensus 12 ~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~--~~~~~l~~Vae~d~~iVG~i~~~~~~~~--- 86 (185) +.+.||+-.++|...+..|....|+.++.+.-......+- +..+.. -+.+...+.-...+.+|||++-+...-+ T Consensus 10 ~~~~irp~i~e~~q~~~~Lea~~FPe~erasfeii~~r~i-~~pevc~glf~~~~h~~~~~~~tLIghIigs~~~~E~lt 88 (190) T KOG4144 10 EAPRIRPGIPESCQRRHTLEASEFPEDERASFEIIRERFI-SVPEVCPGLFDEIRHFLTLCEGTLIGHIIGSLWDKERLT 88 (190) T ss_pred CCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH-CCHHHCCHHHHHHHHHHHHCCCCCEEHHHCCCCCCHHHH T ss_conf 4555798886899987426513398567778999999983-453326113366776454411301000000567601201 Q ss_pred ---------CCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCC--CEEEEECCH--HHHHHCCCEEHHHCCCCCC Q ss_conf ---------67848998677525461599899999999999998899--799995083--2543138587052782677 Q gi|254780989|r 87 ---------KYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGA--EFVLIDAKH--DWHKHLGFKKPHQHQIRYA 152 (185) Q Consensus 87 ---------~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~--~~v~l~~~~--~~Y~r~GF~~~~~~~i~~~ 152 (185) +.....|..|+|+|+||++|.|..|+..-++.+-.+-. +.+++.-+| .||+||||+......+... T Consensus 89 ~ESm~kh~s~g~ni~iHsl~Ihpa~rk~g~a~~Ll~~ylq~l~~q~i~~r~~Li~h~pLvPFYEr~gFk~vgp~~~~~~ 167 (190) T KOG4144 89 QESMTKHRSGGHNIHIHSLAIHPAFRKQGRAPILLWRYLQHLGSQPIVRRAALICHDPLVPFYERFGFKAVGPCAITVG 167 (190) T ss_pred HHHHHHHHCCCCCEEEEEEEECHHHHHCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHCCCEEECCCCCCCC T ss_conf 8777654058840367888745989866863168999999862675312101334577531667507403246422203 No 34 >TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species. Probab=98.85 E-value=4.3e-08 Score=69.05 Aligned_cols=119 Identities=21% Similarity=0.165 Sum_probs=78.1 Q ss_pred CCHHHHHHHHHHHHHHCCCC-CCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEE Q ss_conf 79757899999999972787-66675615202541200025766548999989878647999999886067848998677 Q gi|254780989|r 19 EQRVDQVAVDEVMVKAFLST-QEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLRGPLG 97 (185) Q Consensus 19 a~~~D~~aI~~L~~~aF~~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~la 97 (185) -++....+|.+|..+|-..+ -++ +.+.....++... ......+|+.+++++|||+.+.+.. ...+ ....+. T Consensus 6 l~~~~~~~v~~l~~~a~~~DGv~p----lsE~~ll~Lr~~~-~~~~~hl~~~~~~~~vGya~l~~~~--~~~~-~~aElv 77 (292) T TIGR03448 6 LDADLRRDVRELLAAATAVDGVAP----VSEQVLRGLREPG-AGHTRHLVAVDSDPIVGYANLVPAR--GTDP-AMAELV 77 (292) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCC----CCHHHHHHCCCCC-CCCCEEEEEECCCEEEEEEEECCCC--CCCC-CEEEEE T ss_conf 899999999999999998429997----5579996443688-8860179994498689999962578--8877-647899 Q ss_pred ECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH----HHHHHCCCEEHHHC Q ss_conf 525461599899999999999998899799995083----25431385870527 Q gi|254780989|r 98 VLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH----DWHKHLGFKKPHQH 147 (185) Q Consensus 98 V~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~----~~Y~r~GF~~~~~~ 147 (185) |+|++||+|||++|++.+++.. .+.-.++..|+. .+-+++||.++... T Consensus 78 VhP~~R~rGiG~aLl~a~~~~~--~~~l~~WahG~~p~A~alA~~~G~~~~R~l 129 (292) T TIGR03448 78 VHPAHRRRGIGRALIRALLAKG--GGRLRVWAHGDLPAARALASRLGLVPTREL 129 (292) T ss_pred ECCCCCCCCHHHHHHHHHHHHC--CCCCEEEECCCCHHHHHHHHHCCCEEEEHH T ss_conf 8900037987999999999743--877328865998899999997799001323 No 35 >COG4552 Eis Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only] Probab=98.83 E-value=1.2e-08 Score=72.48 Aligned_cols=115 Identities=15% Similarity=0.147 Sum_probs=81.0 Q ss_pred HHHHHHHHHHHHCCCC-CCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEE----CCCCEEEEEEEE Q ss_conf 7899999999972787-66675615202541200025766548999989878647999999886----067848998677 Q gi|254780989|r 23 DQVAVDEVMVKAFLST-QEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQI----GKYKGFLRGPLG 97 (185) Q Consensus 23 D~~aI~~L~~~aF~~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~----~~~~~~~l~~la 97 (185) -..++..+...+|+.. .+......+.. +.. .=++|...+.++++.+.-.|..+ ...+...|+.|+ T Consensus 8 s~kE~i~~~s~~~~k~~~~~~~~~f~ki----l~~------~n~~vi~~nqkl~s~L~i~~f~~~f~~q~l~t~GIa~Va 77 (389) T COG4552 8 SFKEFIGLESYAFWKPLVPTDGAVFVKI----LAE------PNSYVIYMNQKLASRLHIPPFIFWFGNQVLPTAGIAGVA 77 (389) T ss_pred CHHHHHHHHHHHHCCCCCCCHHHHHHHH----CCC------CCCEEEEEHHHHHHCCCCCCHHEEECCEEEECCCEEEEE T ss_conf 3677875777876075540113566433----057------760688633445310335641201677363115347787 Q ss_pred ECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC-CHHHHHHCCCEEHHHC Q ss_conf 5254615998999999999999988997999950-8325431385870527 Q gi|254780989|r 98 VLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA-KHDWHKHLGFKKPHQH 147 (185) Q Consensus 98 V~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~-~~~~Y~r~GF~~~~~~ 147 (185) +.|+|||+|.-++|+.++++..+++|+..++|.- ++.+|+||||..++.+ T Consensus 78 s~P~~R~~G~~~~Ll~~sLre~~~kG~p~s~L~P~s~~iYrKfGye~asn~ 128 (389) T COG4552 78 SAPTYRRRGALRALLAHSLREIARKGYPVSALHPFSGGIYRKFGYEYASNY 128 (389) T ss_pred ECHHHCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHCCCCCCCEE T ss_conf 265230471899999999999987697069963675445765063213348 No 36 >KOG3396 consensus Probab=98.83 E-value=4.3e-08 Score=69.06 Aligned_cols=132 Identities=15% Similarity=0.164 Sum_probs=86.4 Q ss_pred CCEEECCCCHHHHHH-HHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEE--CCEEEEEEEEEEE--EEC Q ss_conf 020751379757899-99999997278766675615202541200025766548999989--8786479999998--860 Q gi|254780989|r 12 EGLTFANEQRVDQVA-VDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRD--GDRIVGAIRATPI--QIG 86 (185) Q Consensus 12 ~~l~IR~a~~~D~~a-I~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~--d~~iVG~i~~~~~--~~~ 86 (185) +++.+|+...+|... ..+++.+--... ......+.+..+ .+....++ -.-.|+++ .++|+|.+.+.-- .+. T Consensus 5 ~~~~lR~L~~~D~~kGf~elL~qLT~vG-~vt~e~F~krf~-~mk~~~~~--Y~i~Vied~~s~~vigtatL~IE~KfIh 80 (150) T KOG3396 5 DGFKLRPLEEDDYGKGFIELLKQLTSVG-VVTREQFEKRFE-AMKKSGDW--YYIVVIEDKESEKVIGTATLFIERKFIH 80 (150) T ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHCC-CCCHHHHHHHHH-HHHHCCCC--EEEEEEEECCCCEEEEEEEEEEEHHHHH T ss_conf 7458861230320110999998775134-237999999999-99853896--7999999678672788877777666665 Q ss_pred C-CCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH---HHHHCCCEEHHHC Q ss_conf 6-78489986775254615998999999999999988997999950832---5431385870527 Q gi|254780989|r 87 K-YKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD---WHKHLGFKKPHQH 147 (185) Q Consensus 87 ~-~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y~r~GF~~~~~~ 147 (185) . ...-.+..+.|++++||+++|+.|+...++.+++.|+=.+.|.++|+ ||+++||..+..+ T Consensus 81 ~~g~rGhiEDVVV~~~~rgk~LGkllv~~Lv~l~k~lgcYKi~LdC~~~nv~FYeKcG~s~~~~~ 145 (150) T KOG3396 81 GCGSRGHIEDVVVDSEYRGKQLGKLLVETLVDLAKSLGCYKIILDCDPKNVKFYEKCGYSNAGNE 145 (150) T ss_pred CCCCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHCCCCCCCHH T ss_conf 04445751479867545156786999999999987428279999636210148997396554202 No 37 >KOG3138 consensus Probab=98.81 E-value=1.4e-08 Score=71.93 Aligned_cols=147 Identities=14% Similarity=0.146 Sum_probs=98.9 Q ss_pred CEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECC----- Q ss_conf 207513797578999999999727876667561520254120002576654899998987864799999988606----- Q gi|254780989|r 13 GLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGK----- 87 (185) Q Consensus 13 ~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~----- 87 (185) .+.+|.-+|.|...+.+|.+..|+...... ...+.+.. ..+ ..+.+..+..+.|. ..-...... T Consensus 16 ~~~l~~it~~nl~~~~~l~~~~fP~~y~~k--fy~~~~~~-----~~~---~~~A~~~~~~v~a~-~~k~~~~~~~~~r~ 84 (187) T KOG3138 16 LIELRLITPNNLKQLKQLNEDIFPISYVDK--FYPDVLSN-----GDL---TQLAYYNEIAVGAV-ACKLIKFVQNAKRL 84 (187) T ss_pred CEEECCCCCCHHHHHHHHHCCCCCCCHHHH--HHHHHHHC-----CCH---HHHHHHCCCCCCCE-EEEEHHHHHHHHHH T ss_conf 035436783067899887202167635777--77888715-----777---77411101453001-33101224566654 Q ss_pred --CCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCC-CCEEEEEC----C--HHHHHHCCCEEHHHCCCCCCCCCCCH Q ss_conf --784899867752546159989999999999999889-97999950----8--32543138587052782677988882 Q gi|254780989|r 88 --YKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSG-AEFVLIDA----K--HDWHKHLGFKKPHQHQIRYAHGGGAA 158 (185) Q Consensus 88 --~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G-~~~v~l~~----~--~~~Y~r~GF~~~~~~~i~~~~~~~~~ 158 (185) .+..++-.++|+|.||.+|||+.|+.++.+.+.... ++.+.+.. + .+||+++||+....-...+.-....+ T Consensus 85 ~~~~~~yi~~Lgvl~~yR~~gIGs~Ll~~~~~~~~~~~~~~~v~lHv~~~n~~ai~~Y~~~gF~~~~~~~~~y~~~~~~~ 164 (187) T KOG3138 85 FGNRVIYILSLGVLPRYRNKGIGSKLLEFVKKYCSEAHQCRRVYLHVQAVNESAIEFYEKRGFEIVERLKNYYSILGPPD 164 (187) T ss_pred HCCCEEEEEEECCCHHHHHCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCHHHHHHHCCCEEEECCCCCCCCCCCCC T ss_conf 03321477762243877742548788999998875056455179999717973889988349567402544210366720 Q ss_pred HHEEEEECCCCC Q ss_conf 322654405797 Q gi|254780989|r 159 TDWLVHIFKNNI 170 (185) Q Consensus 159 ~~~~~~~l~~~~ 170 (185) ..+|.+.|..+. T Consensus 165 ~~~l~~~~~~~~ 176 (187) T KOG3138 165 DSFLRKLLIHGS 176 (187) T ss_pred HHHHHHHEECCC T ss_conf 556532201488 No 38 >TIGR02382 wecD_rffC TDP-D-fucosamine acetyltransferase; InterPro: IPR012752 This entry represents the WecD protein (formerly RffC) for the biosynthesis of enterobacterial common antigen (ECA), an outer leaflet, outer membrane glycolipid with a trisaccharide repeat unit. WecD is a member of the GNAT family of acetyltransferases.. Probab=98.81 E-value=7.8e-09 Score=73.49 Aligned_cols=125 Identities=18% Similarity=0.213 Sum_probs=94.5 Q ss_pred CCCEEECCCCHHHHHHHHHHHHHHCCC--------CCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEE-CCEEEEEEEEE Q ss_conf 002075137975789999999997278--------766675615202541200025766548999989-87864799999 Q gi|254780989|r 11 LEGLTFANEQRVDQVAVDEVMVKAFLS--------TQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRD-GDRIVGAIRAT 81 (185) Q Consensus 11 ~~~l~IR~a~~~D~~aI~~L~~~aF~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~-d~~iVG~i~~~ 81 (185) |-....|.||.+|+++++++..++|.- .++....++..++|+...=+.++. ||+.++ .|.+.|||.+- T Consensus 51 mGay~~rvAte~DIPaLR~~As~aFAlSRFRaPWY~p~dSgRFYA~WvEnAV~GTFDHq---CL~~~da~g~p~G~VtLR 127 (201) T TIGR02382 51 MGAYSLRVATEADIPALRDVASAAFALSRFRAPWYDPEDSGRFYAQWVENAVLGTFDHQ---CLLLRDAAGDPRGYVTLR 127 (201) T ss_pred CCCCHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCH---HHHHHCCCCCCEEEEEEE T ss_conf 56202000033030689999999986410588888824232138888886505676401---352224477940468740 Q ss_pred EEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEE---CCHH---HHHHCCC Q ss_conf 9886067848998677525461599899999999999998899799995---0832---5431385 Q gi|254780989|r 82 PIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLID---AKHD---WHKHLGF 141 (185) Q Consensus 82 ~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~---~~~~---~Y~r~GF 141 (185) .. ++.+. -||-|+|-|.-|-||||.+||..+.++++.+|....-|- ||.. .|-|-|= T Consensus 128 ~L--~d~dA-RIGLLav~PG~~~rGiG~~LM~~A~~Wc~~~Gl~rLRVATQ~gN~AALrlYirsGA 190 (201) T TIGR02382 128 EL--DDSDA-RIGLLAVFPGVTIRGIGARLMALAKAWCRRRGLIRLRVATQMGNVAALRLYIRSGA 190 (201) T ss_pred EC--CCCCC-CEECCCCCCCCCEECHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHCCC T ss_conf 04--78765-33010037874120456899999999998759824403332467999999986389 No 39 >TIGR02406 ectoine_EctA L-2,4-diaminobutyric acid acetyltransferase; InterPro: IPR012772 This enzyme family is the EctA of ectoine biosynthesis. Ectoine is a compatible solute, analogous to trehalose, betaines, etc., found often in halotolerant organisms. EctA is L-2,4-diaminobutyric acid acetyltransferase, also called DABA acetyltransferase.; GO: 0008415 acyltransferase activity, 0019491 ectoine biosynthetic process. Probab=98.80 E-value=3.9e-08 Score=69.31 Aligned_cols=119 Identities=14% Similarity=0.132 Sum_probs=80.2 Q ss_pred ECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECC-EEEEEEEEEEEEECCCCEEEEE Q ss_conf 513797578999999999727876667561520254120002576654899998987-8647999999886067848998 Q gi|254780989|r 16 FANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGD-RIVGAIRATPIQIGKYKGFLRG 94 (185) Q Consensus 16 IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~-~iVG~i~~~~~~~~~~~~~~l~ 94 (185) ||+-+-+|-.+|.+|..++=.=+ .. +.+..++. ......-|.||+.+| +|+|||.-+ ..-+..+.+.++ T Consensus 1 ~R~P~~~DG~~I~~L~~~c~pLD--lN-S~Y~YlL~------~~~F~~TsiVae~egg~i~GFV~GY-~~P~~Pd~LFVW 70 (162) T TIGR02406 1 FRPPRIEDGAGIWELVKDCPPLD--LN-SSYAYLLL------CTDFADTSIVAEEEGGEIVGFVSGY-LKPDRPDVLFVW 70 (162) T ss_pred CCCCCHHHHHHHHHHHHHCCCCC--CC-HHHHHHHH------HHHHHHCEEEEECCCCEEEEEEEEE-ECCCCCCEEEEE T ss_conf 99985302789999997089985--64-47789998------7536642276537999033588621-267999788898 Q ss_pred EEEECHHHHCCCHHHHHHHHHHHH--HHHCCCCEEEEE--CCHH------HHHHCCCEEH Q ss_conf 677525461599899999999999--998899799995--0832------5431385870 Q gi|254780989|r 95 PLGVLSEYRKRGIGSRLAYMSFMA--IKNSGAEFVLID--AKHD------WHKHLGFKKP 144 (185) Q Consensus 95 ~laV~P~~qg~GiG~~Li~~~l~~--a~~~G~~~v~l~--~~~~------~Y~r~GF~~~ 144 (185) -|||++..||+|++++|+++.+++ +..-+.+...|+ --|+ ++++|+=++- T Consensus 71 QVAV~~~~RG~GLA~rll~all~R~~~~~v~~~I~~~EtTiTPdN~AS~aLF~~La~~~~ 130 (162) T TIGR02406 71 QVAVDPKARGKGLASRLLEALLERVALERVRHCIQAVETTITPDNEASRALFKKLARRRG 130 (162) T ss_pred EEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHHHCC T ss_conf 754271314316899999997610333137982104662107666689999998765428 No 40 >COG1444 Predicted P-loop ATPase fused to an acetyltransferase [General function prediction only] Probab=98.77 E-value=2.6e-07 Score=64.37 Aligned_cols=80 Identities=21% Similarity=0.292 Sum_probs=64.7 Q ss_pred EEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEE-EC-CH---HHHHHCCCEEHHHCCCCCCCCCCCHHHEEEE Q ss_conf 4899867752546159989999999999999889979999-50-83---2543138587052782677988882322654 Q gi|254780989|r 90 GFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLI-DA-KH---DWHKHLGFKKPHQHQIRYAHGGGAATDWLVH 164 (185) Q Consensus 90 ~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l-~~-~~---~~Y~r~GF~~~~~~~i~~~~~~~~~~~~~~~ 164 (185) ++-+..|||||++|++|||++|++..++.++ +|++.+-+ +| ++ .|+.|-||.+.+-...+.+ -..+..+.|+| T Consensus 531 G~RIvRIAvhPe~q~~GiGsrlL~~l~~~a~-~~~DwlgvsFG~t~~L~rFW~rnGF~pVhls~~rn~-~SGeys~i~lk 608 (758) T COG1444 531 GWRIVRIAVHPELQRMGIGSRLLALLIEEAR-KGLDWLGVSFGYTEELLRFWLRNGFVPVHLSPTRNA-SSGEYTAIVLK 608 (758) T ss_pred CEEEEEEEECHHHHHCCHHHHHHHHHHHHHH-CCCCEEEECCCCCHHHHHHHHHCCEEEEEECCCCCC-CCCCEEEEEEE T ss_conf 1268888708878635888999999999984-589889745678889999999759079873686676-77864689885 Q ss_pred ECCCCCC Q ss_conf 4057976 Q gi|254780989|r 165 IFKNNIM 171 (185) Q Consensus 165 ~l~~~~~ 171 (185) .|++... T Consensus 609 pLs~~~~ 615 (758) T COG1444 609 PLSDAGK 615 (758) T ss_pred CCCHHHH T ss_conf 4887799 No 41 >PRK10151 ribosomal-protein-L7/L12-serine acetyltransferase; Provisional Probab=98.72 E-value=6.5e-07 Score=61.91 Aligned_cols=153 Identities=16% Similarity=0.105 Sum_probs=94.2 Q ss_pred CCEEECCCCHHHHHHHHHHHHHHC---CCC--CCC---CCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEE Q ss_conf 020751379757899999999972---787--666---756152025412000257665489999898786479999998 Q gi|254780989|r 12 EGLTFANEQRVDQVAVDEVMVKAF---LST--QEI---TDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPI 83 (185) Q Consensus 12 ~~l~IR~a~~~D~~aI~~L~~~aF---~~~--~~~---~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~ 83 (185) |.+.+|+.+++|.+++.++..+.- ... .+. ................. ....+.|+...++++||.+.+..+ T Consensus 9 erl~LR~~~~~Da~~L~~li~~nr~~L~~wlpW~~~~~s~~d~~~~i~~~~~~~~-~g~~~~f~I~~~~~liG~i~l~~i 87 (179) T PRK10151 9 ESLELHAVDESHVTPLHQLVCKNKTWLQQSLNWPQYVQSEEDTRKTVQGNVMLHQ-RGYAKMFLIFKEDELIGVLSFNRI 87 (179) T ss_pred CCEEEEECCHHHHHHHHHHHHHCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHH-HCCCCEEEEEECCEEEEEEEEEEE T ss_conf 9979980888899999999996789863789999988999999999999999997-186748999999989999999997 Q ss_pred EECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHC-CCCEEEEECCHH------HHHHCCCEEHHHCC-CCCCCCC Q ss_conf 860678489986775254615998999999999999988-997999950832------54313858705278-2677988 Q gi|254780989|r 84 QIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNS-GAEFVLIDAKHD------WHKHLGFKKPHQHQ-IRYAHGG 155 (185) Q Consensus 84 ~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~-G~~~v~l~~~~~------~Y~r~GF~~~~~~~-i~~~~~~ 155 (185) ... .....|| --+.|+|||+|+++.-++..++.+-+. +...+.+.+.++ ..+|+||+..+... -....|. T Consensus 88 ~~~-~~~aeiG-Ywl~~~~~GkG~~tea~~~li~~~f~~l~l~Ri~~~~~~~N~~S~~v~ek~Gf~~EG~lr~~~~~~G~ 165 (179) T PRK10151 88 EPL-NKTAYIG-YWLDESHQGQGIISQSLQALIHHYAQSGELRRFVIKCRVDNPASNQVALRNGFILEGCLKQAEYLNGA 165 (179) T ss_pred ECC-CCEEEEE-EEECHHHCCCCHHHHHHHHHHHHHHHHCCEEEEEEEEECCCHHHHHHHHHCCCEEEEEEEEEEEECCE T ss_conf 056-6769999-99887774896899999999999997289179999991899899999998899999986422999899 Q ss_pred CCHHHEEEEECC Q ss_conf 882322654405 Q gi|254780989|r 156 GAATDWLVHIFK 167 (185) Q Consensus 156 ~~~~~~~~~~l~ 167 (185) +.|-...-+.++ T Consensus 166 ~~D~~~ys~i~d 177 (179) T PRK10151 166 YDDVNLYARIID 177 (179) T ss_pred EEEEEEEEEEEC T ss_conf 988130889863 No 42 >KOG2488 consensus Probab=98.69 E-value=1.9e-07 Score=65.09 Aligned_cols=122 Identities=14% Similarity=0.047 Sum_probs=78.1 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCC-CEEHHHHHCHHCCCCCCCEEEEEEEECC-EEEEEEEEEEEEECCCCEEEEEEEEEC Q ss_conf 57899999999972787666756-1520254120002576654899998987-864799999988606784899867752 Q gi|254780989|r 22 VDQVAVDEVMVKAFLSTQEITDD-VLHKYVRNNVEEPLSFVRLMSFVCRDGD-RIVGAIRATPIQIGKYKGFLRGPLGVL 99 (185) Q Consensus 22 ~D~~aI~~L~~~aF~~~~~~~~~-~~~~~~~~~~~~~~~~~~~l~~Vae~d~-~iVG~i~~~~~~~~~~~~~~l~~laV~ 99 (185) +|.+....|...-.++....... ....--+..++.. ....++|.++. ++|||.+|--..-.+...+++..|-|. T Consensus 54 ~~ldw~f~L~k~nm~~~Y~qs~~Gw~~~~K~~El~~~----~~~Yi~a~~~~~~~vgf~~Frf~vd~g~~vlYcyEvqv~ 129 (202) T KOG2488 54 EDLDWCFSLFKKNMGAMYRQSSWGWDDNSKAKELRNR----KLRYICAWNNKSKLVGFTMFRFTVDTGDPVLYCYEVQVA 129 (202) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHC----CCEEEEEECCCCCEEEEEEEEEECCCCCEEEEEEEEEEH T ss_conf 8899999999863177765162265725679998634----336999986888624688999870249869999987645 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC------CHHHHHHCCCEEHHHC Q ss_conf 54615998999999999999988997999950------8325431385870527 Q gi|254780989|r 100 SEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA------KHDWHKHLGFKKPHQH 147 (185) Q Consensus 100 P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~------~~~~Y~r~GF~~~~~~ 147 (185) ++|||+|||+.|+..+...+.......|+|+. .-.||+++||.+..+. T Consensus 130 ~~yR~kGiGk~LL~~l~~~a~~~~~~kVmLTVf~~N~~al~Fy~~~gf~~~~~s 183 (202) T KOG2488 130 SAYRGKGIGKFLLDTLEKLADSRHMRKVMLTVFSENIRALGFYHRLGFVVDEES 183 (202) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHEEEEECCCCHHHHHHHHCCCCCCCCC T ss_conf 666526858999999999888887665114443146504778997596247889 No 43 >TIGR03585 PseH pseudaminic acid biosynthesis N-acetyl transferase. Sequences in this family are members of the pfam00583 (GNAT) superfamily of acetyltransferases and are proposed to perform a N-acetylation step in the process of pseudaminic acid biosynthesis in Campylobacter species. This gene is commonly observed in apparent operons with other genes responsible for the biosynthesis of pseudaminic acid and as a component of flagellar and exopolysaccharide biosynthesis loci. Significantly, many genomes containing other components of this pathway lack this gene, indicating that some other N-acetyl transferases may be incolved and/or the step is optional, resulting in a non-acetylated pseudaminic acid variant sugar. Probab=98.64 E-value=9.7e-07 Score=60.89 Aligned_cols=125 Identities=16% Similarity=0.150 Sum_probs=84.5 Q ss_pred ECCCCHHHHHHHHHHHHH-----HCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCE Q ss_conf 513797578999999999-----727876667561520254120002576654899998987864799999988606784 Q gi|254780989|r 16 FANEQRVDQVAVDEVMVK-----AFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKG 90 (185) Q Consensus 16 IR~a~~~D~~aI~~L~~~-----aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~ 90 (185) +|+-+++|++.|.+.... -+.+..+........+.+. .........|+.+.++++||.+.+....... .. T Consensus 3 LRpl~~~D~~~i~~WrNdpev~~~~~~~~~~s~ee~~~w~~~----~~~~~~~~~~ii~~~~~~IG~i~l~~id~~~-~~ 77 (156) T TIGR03585 3 FTPLNSEELELVLEWRNHPDVRANMYSDHLIDWEEHLHFIEA----LKQDPNRRYWIVCQESRPIGVISFTDINLVH-KS 77 (156) T ss_pred CCCCCHHHHHHHHHHHCCHHHHHHHCCCCCCCHHHHHHHHHH----HHCCCCCEEEEEEECCCEEEEEEEEECCCCC-CE T ss_conf 876999999999999589999975289887799999999999----7459973899999799289999887403003-98 Q ss_pred EEEEEEEECHHHHCCCHHHHHHHHHHHHHH-HCCCCEEEEECCH------HHHHHCCCEEHHHC Q ss_conf 899867752546159989999999999999-8899799995083------25431385870527 Q gi|254780989|r 91 FLRGPLGVLSEYRKRGIGSRLAYMSFMAIK-NSGAEFVLIDAKH------DWHKHLGFKKPHQH 147 (185) Q Consensus 91 ~~l~~laV~P~~qg~GiG~~Li~~~l~~a~-~~G~~~v~l~~~~------~~Y~r~GF~~~~~~ 147 (185) ..++ +.+.| ++++|+|+.++..+++.+- ..+...+.+.... .+|+++||+..... T Consensus 78 ae~g-i~~~~-~~~~G~g~~~~~~~l~~aF~~l~l~rl~~~v~~~N~~si~l~ek~GF~~EG~l 139 (156) T TIGR03585 78 AFWG-IYANP-FCKPGVGSVLEEAALEYAFEHLGLHKLSLEVLEFNNKALKLYEKFGFEREGVF 139 (156) T ss_pred EEEE-EEECH-HHCCCHHHHHHHHHHHHHHHHCCEEEEEEEECCCCHHHHHHHHHCCCEEEEEE T ss_conf 9999-98222-10257799999999999987569269999962568999999997899999998 No 44 >COG3393 Predicted acetyltransferase [General function prediction only] Probab=98.63 E-value=1e-07 Score=66.74 Aligned_cols=128 Identities=17% Similarity=0.120 Sum_probs=79.6 Q ss_pred CCCCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCC-EEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCC Q ss_conf 000207513797578999999999727876667561-5202541200025766548999989878647999999886067 Q gi|254780989|r 10 ELEGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDV-LHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKY 88 (185) Q Consensus 10 ~~~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~-~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~ 88 (185) +-+.+.+|.++..|.. +..+..+++-+......+. ..-...+.+-+ ..-.| .+.||+||..+.-. .... T Consensus 130 ~~~~~~~r~a~~~D~~-i~~~~~~~~l~~~g~~~~~~~~~~~~~a~g~-----~~~~f-~~~d~~iVa~A~t~---a~~~ 199 (268) T COG3393 130 SPEELDVRLAAAKDMF-IPEVGLRATLDDFGRADSRKEAVAVLNALGR-----SRTYF-LEGDGKIVAKAETA---AENP 199 (268) T ss_pred CCCCCEEEEEECCCCC-CHHEEEEEEECCCCCCCCHHHHHHHHHHHHC-----EEEEE-ECCCCCEEEEEECC---CCCC T ss_conf 1554505530012342-3101321001566667316778899997421-----05799-82688389862005---6688 Q ss_pred CEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEE--CCH---HHHHHCCCEEHHHC Q ss_conf 848998677525461599899999999999998899799995--083---25431385870527 Q gi|254780989|r 89 KGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLID--AKH---DWHKHLGFKKPHQH 147 (185) Q Consensus 89 ~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~--~~~---~~Y~r~GF~~~~~~ 147 (185) ....|..++|+|+|||||++++|+...-+.+-++|...++.. .|| .-|+|+||+..+++ T Consensus 200 ~~~~I~gV~T~peyR~kGyAt~lva~L~~~lL~eGk~~~L~~~~~N~~A~~iY~riGF~~~g~~ 263 (268) T COG3393 200 AYAQINGVYTHPEYRGKGYATALVATLAAKLLAEGKIPCLFVNSDNPVARRIYQRIGFREIGEF 263 (268) T ss_pred CCEEEEEEECCHHHCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHHHCCEECCEE T ss_conf 6168987871978726548999999999999857981489983478788999998097142417 No 45 >COG3981 Predicted acetyltransferase [General function prediction only] Probab=98.59 E-value=5.1e-07 Score=62.58 Aligned_cols=123 Identities=19% Similarity=0.287 Sum_probs=81.3 Q ss_pred CCCEEECCCCHHHHHHHHHHHHHHCCCCCCCCC----------CCEEHHHHHCHH------CCCCCCCEEEEEEEE-CCE Q ss_conf 002075137975789999999997278766675----------615202541200------025766548999989-878 Q gi|254780989|r 11 LEGLTFANEQRVDQVAVDEVMVKAFLSTQEITD----------DVLHKYVRNNVE------EPLSFVRLMSFVCRD-GDR 73 (185) Q Consensus 11 ~~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~----------~~~~~~~~~~~~------~~~~~~~~l~~Vae~-d~~ 73 (185) |+.|.+|..+-.|..++.+...+ |........ ..+.++++...+ -...+++...+.+.+ |++ T Consensus 1 me~~~l~~p~L~~k~a~le~~~e-~~~~~~~~~~~~~~~~~~~~~fed~L~~~~~~~~~~~~~~g~V~~~~y~~v~~d~~ 79 (174) T COG3981 1 MEEMKLRRPTLKDKDAFLEMKKE-FLTDGSTEAGAAWKADYEQEDFEDWLEDLTRQEPGNNLPEGWVPASTYWAVDEDGQ 79 (174) T ss_pred CCCCCCCCCCHHHHHHHHHHHHH-HHHCCCCCCCCEEECCCCCCCHHHHHHHHHCCCCCCCCCCCCEECEEEEEEECCCC T ss_conf 97540037751228999999873-64068855572012330030099999998602778689998030336899933880 Q ss_pred EEEEEEEEEEEECCCC---EEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHH Q ss_conf 6479999998860678---48998677525461599899999999999998899799995083254 Q gi|254780989|r 74 IVGAIRATPIQIGKYK---GFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWH 136 (185) Q Consensus 74 iVG~i~~~~~~~~~~~---~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y 136 (185) +||++.+- .++++.- +-.|| -+|.|..||+|+|+.+++.++++|++.|.+.|+|++|.+.- T Consensus 80 ivG~i~lR-h~Ln~~ll~~gGHIG-Y~VrPseR~KGYA~emLkl~L~~ar~lgi~~Vlvtcd~dN~ 143 (174) T COG3981 80 IVGFINLR-HQLNDFLLEEGGHIG-YSVRPSERRKGYAKEMLKLALEKARELGIKKVLVTCDKDNI 143 (174) T ss_pred EEEEEEEE-EECCHHHHHCCCCCC-CEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 89999862-100568886278564-11476352467899999999999998499769998479981 No 46 >TIGR01211 ELP3 histone acetyltransferase, ELP3 family; InterPro: IPR005910 Histone acetylation is carried out by a class of enzymes known as histone acetyltransferases (HATs), which catalyse the transfer of an acetyl group from acetyl-CoA to the lysine E-amino groups on the N-terminal tails of histones . Early indication that HATs were involved in transcription came from the observation that in actively transcribed regions of chromatin, histones tend to be hyperacetylated, whereas in transcriptionally silent regions histones are hypoacetylated. The histone acetyltransferases are divided into five families. These include the Gcn5-related acetyltransferases (GNATs); the MYST (for 'MOZ, Ybf2/Sas3, Sas2 and Tip60)-related HATs; p300/CBP HATs; the general transcription factor HATs, which include the TFIID subunit TAF250; and the nuclear hormone-related HATs SRC1 and ACTR (SRC3). The GCN5-related N-acetyltransferase superfamily includes such enzymes as the histone acetyltransferases GCN5 and Hat1, the elongator complex subunit Elp3, the mediator-complex subunit Nut1, and Hpa2 . Many GNATs share several functional domains, including an N-terminal region of variable length, an acetyltransferase domain that encompasses the conserved sequence motifs described above, a region that interacts with the coactivator Ada2, and a C-terminal bromodomain that is believed to interact with acetyl-lysine residues. Members of the GNAT family are important for the regulation of cell growth and development. In mice, knockouts of Gcn5L are embryonic lethal. Yeast Gcn5 is needed for normal progression through the G2M boundary and mitotic gene expression. The importance of GNATs is probably related to their role in transcription and DNA repair. The yeast GCN5 (yGCN5) transcriptional coactivator functions as a histone acetyltransferase (HAT) to promote transcriptional activation. The crystal structure of the yeast histone acetyltransferase Hat1-acetyl coenzyme A (AcCoA) shows that Hat1 has an elongated, curved structure, and the AcCoA molecule is bound in a cleft on the concave surface of the protein, marking the active site of the enzyme. A channel of variable width and depth that runs across the protein is probably the binding site for the histone substrate . The central protein core associated with AcCoA binding that appears to be structurally conserved among a superfamily of N-acetyltransferases, including yeast histone acetyltransferase 1 and Serratia marcescens aminoglycoside 3-N-acetyltransferase . The Saccharomyces cerevisiae member YPL086C has been characterised in vitro as an N-terminal acetyltransferase (2.3.1.48 from EC) for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases. In vivo, ELP3 gene deletion confers typical ELP phenotypes such as slow growth adaptation, slow gene activation, and temperature sensitivity. This suggests a role for the proteins as novel, tightly RNAPII-associated histone acetyltransferases in transcription of DNA packaged in chromatin .. Probab=98.35 E-value=3.6e-07 Score=63.49 Aligned_cols=85 Identities=25% Similarity=0.435 Sum_probs=62.6 Q ss_pred EEEEEEEECCEEEEEEEEE----E-EE---ECCCCEEE---------EEEEEE---CHHHHCCCHHHHHHHHHHHHHHHC Q ss_conf 4899998987864799999----9-88---60678489---------986775---254615998999999999999988 Q gi|254780989|r 63 LMSFVCRDGDRIVGAIRAT----P-IQ---IGKYKGFL---------RGPLGV---LSEYRKRGIGSRLAYMSFMAIKNS 122 (185) Q Consensus 63 ~l~~Vae~d~~iVG~i~~~----~-~~---~~~~~~~~---------l~~laV---~P~~qg~GiG~~Li~~~l~~a~~~ 122 (185) -+|+.-..+|.+|||++|= | +. ++...++. .-||+- .-+||.||+|++||++|.+.|++. T Consensus 459 FlS~Ed~~~D~LiGFlRLR~P~e~ya~r~e~d~~~TAlvRELHVYG~~vP~g~D~~~~~~QHrGyG~~Ll~EAEriA~Ee 538 (573) T TIGR01211 459 FLSYEDPKNDILIGFLRLRFPSEPYAHRKEVDAQRTALVRELHVYGSEVPIGEDRGEDEWQHRGYGRRLLEEAERIAKEE 538 (573) T ss_pred EEEEECCCCCEEEEEEECCCCCCCHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 87731100040021000068887213455430576103210321057212567787776444425578999999985630 Q ss_pred -CCCEEEEE---CCHHHHHH-CCCEEHHHC Q ss_conf -99799995---08325431-385870527 Q gi|254780989|r 123 -GAEFVLID---AKHDWHKH-LGFKKPHQH 147 (185) Q Consensus 123 -G~~~v~l~---~~~~~Y~r-~GF~~~~~~ 147 (185) |.+.++|. |-=+||+| |||....-| T Consensus 539 fG~~KIlVISGiGVReYYrKqlGY~~~GPY 568 (573) T TIGR01211 539 FGLEKILVISGIGVREYYRKQLGYEKDGPY 568 (573) T ss_pred CCCEEEEEECCCCCCCHHHHHCCCCCCCCE T ss_conf 697016785141010000223263213771 No 47 >KOG3234 consensus Probab=98.24 E-value=4.5e-06 Score=56.85 Aligned_cols=145 Identities=17% Similarity=0.125 Sum_probs=92.7 Q ss_pred EECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEE-CCEEEEEEEEEEEEECCCC-EEE Q ss_conf 75137975789999999997278766675615202541200025766548999989-8786479999998860678-489 Q gi|254780989|r 15 TFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRD-GDRIVGAIRATPIQIGKYK-GFL 92 (185) Q Consensus 15 ~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~-d~~iVG~i~~~~~~~~~~~-~~~ 92 (185) +||+.++.|.-....+...-.... ..-.+.-.+ +..+ ++.+.|++. +++|-|+++- +++..... +.. T Consensus 3 t~r~f~~~Dlf~fNninLDpltEt--~~~~Fyl~y----l~~~----pe~~~~a~~p~~~imgyimg-k~Eg~~~~wh~H 71 (173) T KOG3234 3 TIRPFTPQDLFKFNNINLDPLTET--FPISFYLIY----LAIW----PEDFIVAEAPTGEIMGYIMG-KVEGKDTEWHGH 71 (173) T ss_pred CCCCCCHHHHHHHCCCCCCCCCCC--CCEEHHHHH----HHHC----HHHHEECCCCCCCEEEEEEE-ECCCCCCCEEEE T ss_conf 655366788775356214634134--660005999----8757----58727311799846899854-204667513667 Q ss_pred EEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC---C---HHHHHHCCCEEHHHCCCCCCC-CCCCHHHEEEEE Q ss_conf 986775254615998999999999999988997999950---8---325431385870527826779-888823226544 Q gi|254780989|r 93 RGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA---K---HDWHKHLGFKKPHQHQIRYAH-GGGAATDWLVHI 165 (185) Q Consensus 93 l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~---~---~~~Y~r~GF~~~~~~~i~~~~-~~~~~~~~~~~~ 165 (185) +..|+|.|+||+.|+|+.||....+.....+.-++-|.. | .++|++|||...+. -|.+.. +...+++=|-|. T Consensus 72 vTAltVap~~Rrl~la~~lm~~led~~d~~~a~fvDLfVr~sN~iAI~mYkkLGY~~YR~-Vi~YY~~g~deda~dMRKa 150 (173) T KOG3234 72 VTALTVAPDYRRLGLAAKLMDTLEDVSDVDNAYFVDLFVRVSNQIAIDMYKKLGYSVYRT-VIEYYSVGPDEDAYDMRKA 150 (173) T ss_pred EEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEEECCCHHHHHHHHHCCCEEEEE-EEEEECCCCCCCHHHHHHH T ss_conf 898873546888779999999999988753212310320006545899998529368876-5466503887306766655 Q ss_pred CCCCCC Q ss_conf 057976 Q gi|254780989|r 166 FKNNIM 171 (185) Q Consensus 166 l~~~~~ 171 (185) |+-+.- T Consensus 151 lSrD~d 156 (173) T KOG3234 151 LSRDVD 156 (173) T ss_pred HCCCCC T ss_conf 503765 No 48 >COG1670 RimL Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis] Probab=98.18 E-value=6.8e-05 Score=49.76 Aligned_cols=139 Identities=13% Similarity=0.056 Sum_probs=82.3 Q ss_pred CCCEEECCCCHHHHHHHHHHH--HHH--CCCCCC-CCCCCEEHHHHHCHHCCCCCCCEEEEEEE--E--CCEEEEEEEEE Q ss_conf 002075137975789999999--997--278766-67561520254120002576654899998--9--87864799999 Q gi|254780989|r 11 LEGLTFANEQRVDQVAVDEVM--VKA--FLSTQE-ITDDVLHKYVRNNVEEPLSFVRLMSFVCR--D--GDRIVGAIRAT 81 (185) Q Consensus 11 ~~~l~IR~a~~~D~~aI~~L~--~~a--F~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~Vae--~--d~~iVG~i~~~ 81 (185) ..-+.+|..+..|...+.... ... +..... .............+...........|+.. . ++++||.+.++ T Consensus 7 ~~r~~lr~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iG~~~~~ 86 (187) T COG1670 7 TLRLLLREVDLEDLELLAEWANDPEVMLFWWLPPPLTPPTSDEELLRLLAEAWEDLGGGAFAIELKATGDGELIGVIGLS 86 (187) T ss_pred CCCCCCCCCCCCCHHHHHHHCCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCEEEEEEEE T ss_conf 55510155671379999875678777743057766666578899999999876315884599998027887089999978 Q ss_pred EEEE-CCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHH-CCCCEEEEECCHH------HHHHCCCEEHHHCCCC Q ss_conf 9886-067848998677525461599899999999999998-8997999950832------5431385870527826 Q gi|254780989|r 82 PIQI-GKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKN-SGAEFVLIDAKHD------WHKHLGFKKPHQHQIR 150 (185) Q Consensus 82 ~~~~-~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~-~G~~~v~l~~~~~------~Y~r~GF~~~~~~~i~ 150 (185) ...- .......++ ....|+|+|+|+|+..++..++.+-+ .|...+.+..+++ .++|+||+......-. T Consensus 87 ~~~~~~~~~~~~ig-~~l~~~~~g~G~~tea~~~~l~~~f~~~~~~rv~~~~~~~N~~S~rv~ek~Gf~~~g~~~~~ 162 (187) T COG1670 87 DIDRAANGDLAEIG-YWLDPEYWGKGYATEALRALLDYAFEELGLHRIEATVDPENEASIRVYEKLGFRLEGELRQH 162 (187) T ss_pred ECCCCCCCCCEEEE-EEECHHHCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCEEECEEEEC T ss_conf 63465678857999-99996663892699999999999750079658999988899999999998799992524101 No 49 >PHA00673 acetyltransferase domain containing protein Probab=98.15 E-value=1.3e-05 Score=54.00 Aligned_cols=84 Identities=18% Similarity=0.239 Sum_probs=66.3 Q ss_pred EEEEEEEECCEEEEEEEEEEEEECCCCEEEEE---EEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH----- Q ss_conf 48999989878647999999886067848998---6775254615998999999999999988997999950832----- Q gi|254780989|r 63 LMSFVCRDGDRIVGAIRATPIQIGKYKGFLRG---PLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD----- 134 (185) Q Consensus 63 ~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~---~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~----- 134 (185) .+.|-|..++++|||+.+.-....+..+..++ ++.|+++||+-|.|-+|++.+.+.+++.|+...++.+.++ T Consensus 55 ~~~l~a~q~d~LIGf~~llVT~~pHy~g~~i~t~es~fV~~eHR~ggaGlkLlr~te~~A~~LGa~~ll~sa~~~~~Ls~ 134 (154) T PHA00673 55 AHFLGVFRGEELVGFACLLVTPVPHFKGQLIGTTESIFVAAAHRPGGAGMALLRATEALARDLGATGLYVSGPTEGRLVQ 134 (154) T ss_pred EEEEEECCCCEEEEEEEEECCCCCCCCCCEEECCCEEEEEHHHCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHH T ss_conf 37875212871466544421367632477751000257746443886168999999999986087369995699981412 Q ss_pred HHHHCCCEEHHH Q ss_conf 543138587052 Q gi|254780989|r 135 WHKHLGFKKPHQ 146 (185) Q Consensus 135 ~Y~r~GF~~~~~ 146 (185) .-.|.||.++.+ T Consensus 135 ~Lp~~~y~~tN~ 146 (154) T PHA00673 135 LLPAAGYRETNR 146 (154) T ss_pred CCCCCCCCCCCC T ss_conf 125347764662 No 50 >cd02169 Citrate_lyase_ligase Citrate lyase ligase, also known as [Citrate (pro-3S)-lyase] ligase, is responsible for acetylation of the (2-(5''-phosphoribosyl)-3'-dephosphocoenzyme-A) prosthetic group of the gamma subunit of citrate lyase, converting the inactive thiol form of this enzyme to the active form. The acetylation of 1 molecule of deacetyl-citrate lyase to enzymatically active citrate lyase requires 6 molecules of ATP. The Adenylylyltranferase activity of the enzyme involves the formation of AMP and and pyrophosphate in the acetylation reaction. Probab=98.14 E-value=1.5e-05 Score=53.70 Aligned_cols=70 Identities=17% Similarity=0.239 Sum_probs=59.6 Q ss_pred EECCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH---HHHHCCCEEHH Q ss_conf 898786479999998860678489986775254615998999999999999988997999950832---54313858705 Q gi|254780989|r 69 RDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD---WHKHLGFKKPH 145 (185) Q Consensus 69 e~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y~r~GF~~~~ 145 (185) ++++++||+..+ .+-.|--+||+|++||.|+..+|+.+.++.+.++|....++++.|+ +++.+||.... T Consensus 12 d~~~~lvatGs~--------~gnvlKcvAV~~~~qg~gl~~~lvs~L~~~~~~~G~~h~FvfTKp~~~~~F~~lGF~~ia 83 (296) T cd02169 12 DDAGNLIATGSL--------AGNILKCVAVSPKYQGEGLLLKLVSELINRAFERGIFHLFVFTKPENVKFFSSLGFKELA 83 (296) T ss_pred ECCCCEEEEEEC--------CCCEEEEEEECHHCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHCCCEEEE T ss_conf 079978997302--------147879999872105864799999999999997699648999371057778758977988 Q ss_pred H Q ss_conf 2 Q gi|254780989|r 146 Q 146 (185) Q Consensus 146 ~ 146 (185) . T Consensus 84 ~ 84 (296) T cd02169 84 A 84 (296) T ss_pred E T ss_conf 7 No 51 >pfam04958 AstA Arginine N-succinyltransferase beta subunit. Arginine N-succinyltransferase EC:2.3.1.109 catalyses the transfer of succinyl-CoA to arginine to produce succinylarginine. This is the first step in arginine catabolism by the arginine succinyltransferase pathway. Probab=98.05 E-value=3.9e-05 Score=51.20 Aligned_cols=127 Identities=19% Similarity=0.180 Sum_probs=77.0 Q ss_pred EEECCCCHHHHHHHHHHHHHHCC--CCCCCCCCCEEHHHHHCHHCCC------CCCCEEEEEEEE--CCEEEEEEEEEEE Q ss_conf 07513797578999999999727--8766675615202541200025------766548999989--8786479999998 Q gi|254780989|r 14 LTFANEQRVDQVAVDEVMVKAFL--STQEITDDVLHKYVRNNVEEPL------SFVRLMSFVCRD--GDRIVGAIRATPI 83 (185) Q Consensus 14 l~IR~a~~~D~~aI~~L~~~aF~--~~~~~~~~~~~~~~~~~~~~~~------~~~~~l~~Vae~--d~~iVG~i~~~~~ 83 (185) |.||+++++|++++.+|...+=. ++-+...+.+...++....... .......||.|+ .++|||....... T Consensus 1 lviRpv~~~Dl~~L~~LA~~sg~G~TsLP~d~~~L~~rI~~S~~Sf~~~~~~~~~~~~YlFVLEd~~tg~vvGts~I~a~ 80 (338) T pfam04958 1 LVIRPARASDLPALYRLAKESGHGFTSLPADRELLRAKIARSEASFAGEEVDQPGDEGYLFVLEDTETGEVVGTSGIEAA 80 (338) T ss_pred CEEEECCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCCCEEEEEEEEEEC T ss_conf 95721751039999999998099865599999999999999999864677799775217999971578848888766643 Q ss_pred ----------EE----------------------CCCC-EEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCC---CCEE Q ss_conf ----------86----------------------0678-4899867752546159989999999999999889---9799 Q gi|254780989|r 84 ----------QI----------------------GKYK-GFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSG---AEFV 127 (185) Q Consensus 84 ----------~~----------------------~~~~-~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G---~~~v 127 (185) ++ .... ...|+.|.++|+||+.|.|+.|-..-+--+.+.- .+.| T Consensus 81 vG~~~Pfy~yr~~~~~h~S~~L~v~~~~~~L~l~~d~tg~sEl~tLfl~p~~R~~~~G~lLS~~RfLfma~~~~rF~~~v 160 (338) T pfam04958 81 VGLDEPFYSYRLSTLVHASRELGVHNRVEVLTLCNDLTGCSELCTLFLDPEYRKGGNGRLLSRSRFLFIAAHRERFAERV 160 (338) T ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCEEEEEEECCCCCCEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 46787867899322332161248520101588750677870368887488881776315789999999876266643556 Q ss_pred EEE--------CCHHHHHHCC Q ss_conf 995--------0832543138 Q gi|254780989|r 128 LID--------AKHDWHKHLG 140 (185) Q Consensus 128 ~l~--------~~~~~Y~r~G 140 (185) +.+ |...||..+| T Consensus 161 iAEmRG~~De~G~SPFW~~lg 181 (338) T pfam04958 161 IAEMRGVSDEDGRSPFWDALG 181 (338) T ss_pred HHHHCCCCCCCCCCCHHHHHH T ss_conf 645158618899982378763 No 52 >COG2388 Predicted acetyltransferase [General function prediction only] Probab=97.96 E-value=4e-05 Score=51.14 Aligned_cols=66 Identities=15% Similarity=0.180 Sum_probs=54.6 Q ss_pred EEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 489999898786479999998860678489986775254615998999999999999988997999950 Q gi|254780989|r 63 LMSFVCRDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA 131 (185) Q Consensus 63 ~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~ 131 (185) ...++..++|..+|.+.+++. +.+.+.+.--+|.+++||||+|++|+.++++.+|+.|...+=+.. T Consensus 15 ~~~y~~~~~G~~~~e~~y~~~---~~~~i~i~HT~V~d~lrGqGia~~L~~~al~~ar~~g~kiiP~Cs 80 (99) T COG2388 15 NGRYVLTDEGEVIGEATYYDR---GENLIIIDHTYVPDELRGQGIAQKLVEKALEEAREAGLKIIPLCS 80 (99) T ss_pred CEEEEEECCCCEEEEEEEECC---CCCEEEEECCCCCHHHCCCCHHHHHHHHHHHHHHHCCCEECCCCH T ss_conf 518999049938878888417---998799853768988858747999999999999981985733663 No 53 >PRK10456 arginine succinyltransferase; Provisional Probab=97.95 E-value=6.8e-05 Score=49.75 Aligned_cols=127 Identities=16% Similarity=0.151 Sum_probs=74.1 Q ss_pred EEECCCCHHHHHHHHHHHHHHCCC--CCCCCCCCEEHHHHHCHHC---C-CCCCCEEEEEEEE--CCEEEEEEEEEEE-- Q ss_conf 075137975789999999997278--7666756152025412000---2-5766548999989--8786479999998-- Q gi|254780989|r 14 LTFANEQRVDQVAVDEVMVKAFLS--TQEITDDVLHKYVRNNVEE---P-LSFVRLMSFVCRD--GDRIVGAIRATPI-- 83 (185) Q Consensus 14 l~IR~a~~~D~~aI~~L~~~aF~~--~~~~~~~~~~~~~~~~~~~---~-~~~~~~l~~Vae~--d~~iVG~i~~~~~-- 83 (185) |.||+++++|++++.+|...+-.. +-+...+.+...++..... . ......+.||.|+ .|+|||....... T Consensus 2 livRpv~~~Dl~~l~~LA~~sg~G~TsLP~d~~~L~~rI~~S~~sf~~~~~~~~~~YlFVLED~~tg~vvGts~I~a~vG 81 (344) T PRK10456 2 MVIRPVERSDLAALMQLAGKTGGGLTSLPANEATLAARIERALKTWQGELPKSEQGYVFVLEDSETGTVAGICAIEVAVG 81 (344) T ss_pred EEEEECCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEEEEEEECCC T ss_conf 48973760049999999997099976599999999999999999864789976431799998557880888875785035 Q ss_pred --------EE----------------------CC-CCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCC---CCEEEE Q ss_conf --------86----------------------06-784899867752546159989999999999999889---979999 Q gi|254780989|r 84 --------QI----------------------GK-YKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSG---AEFVLI 129 (185) Q Consensus 84 --------~~----------------------~~-~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G---~~~v~l 129 (185) ++ .+ .....|+.|.++|+||+.|.|+.|-..-+-.+.+.- .+.|+. T Consensus 82 ~~~Pfy~yr~~~~~h~S~~L~~~~~~~~L~l~~d~tG~sEl~tLfL~p~~R~~~~G~lLS~~RfLFia~~~~rF~~~viA 161 (344) T PRK10456 82 LNDPWYNYRVGTLVHASKELNVYNALPTLFLSNDHTGSSELCTLFLDPDWRKEGNGYLLSKSRFMFMAAFRDKFNDKVVA 161 (344) T ss_pred CCCCCEEEEECCEEECCCCCCCCCCCEEEEEECCCCCCEEEEEEEECHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78784788822245305234852010158876167787036888757888167640478898999998717665344554 Q ss_pred E--------CCHHHHHHCC Q ss_conf 5--------0832543138 Q gi|254780989|r 130 D--------AKHDWHKHLG 140 (185) Q Consensus 130 ~--------~~~~~Y~r~G 140 (185) + |...||..+| T Consensus 162 EmRG~~D~~G~SPFW~~lg 180 (344) T PRK10456 162 EMRGVIDEHGYSPFWQSLG 180 (344) T ss_pred HHCCCCCCCCCCCHHHHHH T ss_conf 4158618999982278763 No 54 >TIGR03244 arg_catab_AstA arginine N-succinyltransferase. In many bacteria, the arginine succinyltransferase (ast) pathway operon consists of five genes, including this protein, arginine N-succinyltransferase (EC 2.3.1.109). In a few species, such as Pseudomonas aeruginosa, the member of this family is encoded adjacent to a paralog, and the two polypeptides form a heterodimeric enzyme, active on both arginine and ornithine. In such species, this polypeptide may be treated as the beta subunit of an enzyme that may be named either arginine N-succinyltransferase (AST) or arginine and orthithine N-succinyltransferase (AOST). Probab=97.94 E-value=8.5e-05 Score=49.18 Aligned_cols=127 Identities=15% Similarity=0.161 Sum_probs=75.2 Q ss_pred EECCCCHHHHHHHHHHHHHHCC--CCCCCCCCCEEHHHHHCHH---CC-CCCCCEEEEEEEE--CCEEEEEEEEEEE--- Q ss_conf 7513797578999999999727--8766675615202541200---02-5766548999989--8786479999998--- Q gi|254780989|r 15 TFANEQRVDQVAVDEVMVKAFL--STQEITDDVLHKYVRNNVE---EP-LSFVRLMSFVCRD--GDRIVGAIRATPI--- 83 (185) Q Consensus 15 ~IR~a~~~D~~aI~~L~~~aF~--~~~~~~~~~~~~~~~~~~~---~~-~~~~~~l~~Vae~--d~~iVG~i~~~~~--- 83 (185) .||+++++|.+++.+|...+-. ++-+...+.+...++.... .. ........||.|+ .|+|||....... T Consensus 1 vvRpv~~~Dl~~l~~LA~~sg~G~TsLP~d~~~L~~rI~~S~~sf~~~~~~~~~~YlFVLEd~~~g~vvGts~I~a~vG~ 80 (336) T TIGR03244 1 IVRPVETSDLDALYQLAQSTGIGLTSLPANEDLLSARIERAEKTFSGELTRAEQGYLFVLEDTETGTVAGVSAIEAAVGL 80 (336) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEEEEEEECCCC T ss_conf 94437422099999999973998655999999999999999998658788776318999987789939998746752258 Q ss_pred -------EE----------------------CCC-CEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCC---CCEEEEE Q ss_conf -------86----------------------067-84899867752546159989999999999999889---9799995 Q gi|254780989|r 84 -------QI----------------------GKY-KGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSG---AEFVLID 130 (185) Q Consensus 84 -------~~----------------------~~~-~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G---~~~v~l~ 130 (185) ++ ... ....|+.+.++|+||+.|.|+.|-+.-+--+...- .+.|+.+ T Consensus 81 ~~Pfy~yr~~~~~~~S~~L~~~~~~~~L~l~~d~tg~sEl~tLfl~p~~R~~~~G~lLS~~RfLFia~~~erF~~~viAE 160 (336) T TIGR03244 81 EEPFYNYRVGTVVHASKELGIYKALETLFLSNDLTGYSELCTLFLDPDYRKGGNGRLLSKSRFLFIAQFRERFSKKIIAE 160 (336) T ss_pred CCCCEEEEECCEEEECCCCCCCCCCCEEEEECCCCCCEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 98878899675566244579753111599873776872689999788982787235899999999987176512142202 Q ss_pred --------CCHHHHHHCCC Q ss_conf --------08325431385 Q gi|254780989|r 131 --------AKHDWHKHLGF 141 (185) Q Consensus 131 --------~~~~~Y~r~GF 141 (185) |...||..+|= T Consensus 161 mRG~~d~~G~SPFW~~lg~ 179 (336) T TIGR03244 161 MRGVSDEQGRSPFWNALGR 179 (336) T ss_pred HCCCCCCCCCCCCHHHHHC T ss_conf 0585489999831777624 No 55 >COG3818 Predicted acetyltransferase, GNAT superfamily [General function prediction only] Probab=97.88 E-value=1.3e-05 Score=54.17 Aligned_cols=129 Identities=20% Similarity=0.252 Sum_probs=86.1 Q ss_pred CCCCCCCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEE--E Q ss_conf 72100020751379757899999999972787666756152025412000257665489999898786479999998--8 Q gi|254780989|r 7 VNIELEGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPI--Q 84 (185) Q Consensus 7 ~~~~~~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~--~ 84 (185) ++|++ .||.-...|..+|..|..+.-..-.+...+.+..++ .+ .|+|+.+|.+-|++.-.-. . T Consensus 4 vsmp~---~~~D~~apd~aavLaLNNeha~elswLe~erL~~l~----~e--------AF~ArR~G~l~afl~tFd~~a~ 68 (167) T COG3818 4 VSMPI---LIRDVRAPDLAAVLALNNEHALELSWLELERLYRLY----KE--------AFVARRDGNLAAFLVTFDSSAR 68 (167) T ss_pred CCCCE---EHHHHCCCCHHHHHHCCCHHHHHCCCCCHHHHHHHH----HH--------HHHHHHCCCHHHHEEECCCCCC T ss_conf 44312---335305874666772153532322624899999999----98--------8888632444440353366556 Q ss_pred E---------CCCC-EEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECC--H------HHHHHCCCEEHHH Q ss_conf 6---------0678-4899867752546159989999999999999889979999508--3------2543138587052 Q gi|254780989|r 85 I---------GKYK-GFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAK--H------DWHKHLGFKKPHQ 146 (185) Q Consensus 85 ~---------~~~~-~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~--~------~~Y~r~GF~~~~~ 146 (185) . +..+ -.++-.+.|....||+|+|++|.....+.++..|+..+..+.+ | .|...|||.+..+ T Consensus 69 ydSpNFlWFrErYe~F~YvDRvVVA~~aRGrG~aRalY~Dlf~~Ae~agy~~~tCEVn~DppnpasdaFHaalGF~eVG~ 148 (167) T COG3818 69 YDSPNFLWFRERYENFFYVDRVVVASRARGRGVARALYADLFSYAELAGYPYLTCEVNLDPPNPASDAFHAALGFHEVGQ 148 (167) T ss_pred CCCCCEEEHHHHCCCEEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHCCCEECCC T ss_conf 79986023565377427998999974134664389999999999986588569998607999827788766508647562 Q ss_pred CCCC Q ss_conf 7826 Q gi|254780989|r 147 HQIR 150 (185) Q Consensus 147 ~~i~ 150 (185) ..|+ T Consensus 149 a~ih 152 (167) T COG3818 149 ATIH 152 (167) T ss_pred EEEE T ss_conf 4786 No 56 >COG3053 CitC Citrate lyase synthetase [Energy production and conversion] Probab=97.85 E-value=0.00044 Score=44.87 Aligned_cols=108 Identities=12% Similarity=0.090 Sum_probs=71.3 Q ss_pred EECCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH---HHHHCCCEEHH Q ss_conf 898786479999998860678489986775254615998999999999999988997999950832---54313858705 Q gi|254780989|r 69 RDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD---WHKHLGFKKPH 145 (185) Q Consensus 69 e~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y~r~GF~~~~ 145 (185) .++++||++..+ -+-.|--|+|.|.+||.|+.-+|+.+.++.+-++|...+++++.|+ +++.+||.... T Consensus 43 ~~~~~iiacGsi--------aGnvikcvAvs~s~qGeGl~lkl~TeLin~ay~~g~~hLFiyTKp~~~~lFk~~GF~~i~ 114 (352) T COG3053 43 RDNEEIIACGSI--------AGNVIKCVAVSESLQGEGLALKLVTELINLAYERGRTHLFIYTKPEYAALFKQCGFSEIA 114 (352) T ss_pred CCCCCEEEECCC--------CCCEEEEEEECHHCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHCCCEEEE T ss_conf 178868995340--------135668999642015630899999999999997498269999466679988738966864 Q ss_pred HCC-CCCCCCCC---CHHHEEEEECCCCCCCCCEEEEEECCCC Q ss_conf 278-26779888---8232265440579765770699806678 Q gi|254780989|r 146 QHQ-IRYAHGGG---AATDWLVHIFKNNIMENMIGKMSLRREY 184 (185) Q Consensus 146 ~~~-i~~~~~~~---~~~~~~~~~l~~~~~~~~~g~~~~~~~f 184 (185) .-. ...-..+. ...+.+-.......-+++.-+|-..++| T Consensus 115 ~~~~~ivlmENs~trl~~y~~~L~k~r~~gkkIgaIVMNANPF 157 (352) T COG3053 115 SAENVIVLMENSATRLKDYLSSLKKLRHPGKKIGAIVMNANPF 157 (352) T ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCC T ss_conf 2576489961684558999998777305777068999707986 No 57 >TIGR03243 arg_catab_AOST arginine and ornithine succinyltransferase subunits. In many bacteria, the sole member of this protein family is arginine N-succinyltransferase (EC 2.3.1.109), the AstA protein of the arginine succinyltransferase (ast) pathway. However, in Pseudomonas aeruginosa and several other species, a tandem gene pair encodes alpha and beta subunits of a heterodimer that is designated arginine and ornithine succinyltransferase (AOST). Probab=97.83 E-value=8.2e-05 Score=49.27 Aligned_cols=127 Identities=14% Similarity=0.128 Sum_probs=73.9 Q ss_pred EECCCCHHHHHHHHHHHHHHCCC--CCCCCCCCEEHHHHHCHH---CC-CCCCCEEEEEEEE--CCEEEEEEEEEEE--- Q ss_conf 75137975789999999997278--766675615202541200---02-5766548999989--8786479999998--- Q gi|254780989|r 15 TFANEQRVDQVAVDEVMVKAFLS--TQEITDDVLHKYVRNNVE---EP-LSFVRLMSFVCRD--GDRIVGAIRATPI--- 83 (185) Q Consensus 15 ~IR~a~~~D~~aI~~L~~~aF~~--~~~~~~~~~~~~~~~~~~---~~-~~~~~~l~~Vae~--d~~iVG~i~~~~~--- 83 (185) .||+++++|++++.+|...+=.. +-+...+.+...++.... .. ........||.|+ .|+|||....... T Consensus 1 viRpv~~~Dl~~l~~LA~~sg~G~TsLP~d~~~L~~rI~~S~~sf~~~~~~~~~~YlFVLED~~tg~vvG~s~I~a~vG~ 80 (335) T TIGR03243 1 IVRPVRTSDLDALMQLARESGIGLTSLPADRAALGSRIARSEKSFAGESTRGEEGYLFVLEDTETGTVAGVSAIEAAVGL 80 (335) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCEEEEEEEEECCCC T ss_conf 94437323099999999973998655999999999999999998648778887348999987789918998746761458 Q ss_pred -------EE----------------------CCC-CEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCC---CEEEE- Q ss_conf -------86----------------------067-848998677525461599899999999999998899---79999- Q gi|254780989|r 84 -------QI----------------------GKY-KGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGA---EFVLI- 129 (185) Q Consensus 84 -------~~----------------------~~~-~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~---~~v~l- 129 (185) ++ ... ....|+.|.++|+||+.|.|+.|-..-+--+...-. +.|+. T Consensus 81 ~~Pfy~yr~~~~~h~S~~L~~~~~~~~L~l~~d~tg~sEl~tLfl~p~~R~~~~G~lLS~~RflFma~~~~rF~~~viAE 160 (335) T TIGR03243 81 DEPFYNYRVGTLVHASRELGVYNKIPTLTLSNDLTGSSELCTLFLDPDYRKGGNGRLLSRSRFLFIAAFRERFGDKIIAE 160 (335) T ss_pred CCCCEEEEECCEEEECCCCCCCCCCCEEEEECCCCCCEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88747999175565063468721211699762787870579999877873787245899999999987376643678854 Q ss_pred -------ECCHHHHHHCCC Q ss_conf -------508325431385 Q gi|254780989|r 130 -------DAKHDWHKHLGF 141 (185) Q Consensus 130 -------~~~~~~Y~r~GF 141 (185) .|...||..+|= T Consensus 161 mRG~~d~~G~SPFW~~lg~ 179 (335) T TIGR03243 161 MRGVSDEQGRSPFWEALGR 179 (335) T ss_pred CCCCCCCCCCCCCHHHHHC T ss_conf 1787389999830787634 No 58 >TIGR03245 arg_AOST_alph arginine/ornithine succinyltransferase, alpha subunit. In some bacteria, including Pseudomonas aeruginosa, the astB gene (arginine N-succinyltransferase) is replaced by tandem paralogs that form a heterodimer. This heterodimer from P. aeruginosa is characterized as arginine and ornithine N-2 succinyltransferase (AOST). Members of this protein family represent the less widespread paralog, designated AruI, or arginine/ornithine succinyltransferase, alpha subunit. Probab=97.70 E-value=0.0003 Score=45.90 Aligned_cols=126 Identities=11% Similarity=0.134 Sum_probs=74.5 Q ss_pred EECCCCHHHHHHHHHHHHHHCC--CCCCCCCCCEEHHHHHCHHC---CCC--CCCEEEEEEEE--CCEEEEEEEEEEE-- Q ss_conf 7513797578999999999727--87666756152025412000---257--66548999989--8786479999998-- Q gi|254780989|r 15 TFANEQRVDQVAVDEVMVKAFL--STQEITDDVLHKYVRNNVEE---PLS--FVRLMSFVCRD--GDRIVGAIRATPI-- 83 (185) Q Consensus 15 ~IR~a~~~D~~aI~~L~~~aF~--~~~~~~~~~~~~~~~~~~~~---~~~--~~~~l~~Vae~--d~~iVG~i~~~~~-- 83 (185) .||+++++|.+++.+|...+-. ++-+...+.+...++..... ... ....+.||.|+ .|+|||....... T Consensus 1 vvRpv~~~Dl~~l~~LA~~sg~G~TsLP~d~~~L~~rI~~S~~sf~~~~~~~~~e~YlFVLEd~~tg~vvG~s~I~a~vG 80 (336) T TIGR03245 1 IVRPSRFADLPAIERLANESAIGVTSLPADRAKLGEKIAQSERSFAAEVSFVGEERYLFVLEDTETGKLLGTSSIVASAG 80 (336) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEECCCCEEEEEEEEEEEEC T ss_conf 94447411199999999973999655999999999999999998635567888740899998568891899874786104 Q ss_pred --------EE----------------------CCC-CEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCC---CCEEEE Q ss_conf --------86----------------------067-84899867752546159989999999999999889---979999 Q gi|254780989|r 84 --------QI----------------------GKY-KGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSG---AEFVLI 129 (185) Q Consensus 84 --------~~----------------------~~~-~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G---~~~v~l 129 (185) ++ .+. ....|+.+.++|+||+.|.|+.|-..-+--+.+.- .+.|+. T Consensus 81 ~~~Pfy~yr~~~~vh~S~~L~~~~~~~~L~l~~d~tG~sEl~tLfL~p~~R~~~~G~LLS~~RfLFmA~~~~rF~~~viA 160 (336) T TIGR03245 81 YGEPFYSYRNDTLIHASRELKVNNKIHVLYMCHELTGSSLLCSFYVDPRLRKTEAAELLSRARLLFMAAHRERFQSRIIV 160 (336) T ss_pred CCCCCEEEEECCEEEECCCCCCCCCCEEEEEECCCCCCEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHHHCHHHEE T ss_conf 89872899926467525035974230059986168887057999978787478605689999999998717661203103 Q ss_pred E--------CCHHHHHHCC Q ss_conf 5--------0832543138 Q gi|254780989|r 130 D--------AKHDWHKHLG 140 (185) Q Consensus 130 ~--------~~~~~Y~r~G 140 (185) + |...||..+| T Consensus 161 EmRG~~d~~G~SPFW~~lg 179 (336) T TIGR03245 161 EIQGVQDDNGDSPFWDAIG 179 (336) T ss_pred HHCCCCCCCCCCCCHHHHH T ss_conf 0058548999983078763 No 59 >COG1243 ELP3 Histone acetyltransferase [Transcription / Chromatin structure and dynamics] Probab=97.50 E-value=0.00012 Score=48.35 Aligned_cols=78 Identities=21% Similarity=0.375 Sum_probs=53.9 Q ss_pred ECCEEEEEEEEEEEEE-------CCCCEE-----EEE---EEEE-CHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEE--- Q ss_conf 9878647999999886-------067848-----998---6775-25461599899999999999998899799995--- Q gi|254780989|r 70 DGDRIVGAIRATPIQI-------GKYKGF-----LRG---PLGV-LSEYRKRGIGSRLAYMSFMAIKNSGAEFVLID--- 130 (185) Q Consensus 70 ~d~~iVG~i~~~~~~~-------~~~~~~-----~l~---~laV-~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~--- 130 (185) +.+-++|++.+-...- .+...+ ..+ |++- .-.||.+|+|+.||+++.+.|++.|...+.+. T Consensus 414 ~~d~lig~lrlR~p~e~~~r~e~~~~~aivrelhvyg~~vpig~~~~~~QH~G~G~~L~~~AE~ia~ee~~~ki~viSgi 493 (515) T COG1243 414 KNDILIGFLRLREPSEGAHREEIDDKTAIVRELHVYGSEVPIGKREDEWQHRGYGRELLEEAERIAREEGAKKILVISGI 493 (515) T ss_pred HHHHHHHEEEECCCCCCHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 23113312353244530010102663135543202021025666743010054789999999999876135617998333 Q ss_pred CCHHHHHHCCCEEHHHC Q ss_conf 08325431385870527 Q gi|254780989|r 131 AKHDWHKHLGFKKPHQH 147 (185) Q Consensus 131 ~~~~~Y~r~GF~~~~~~ 147 (185) |--+||.||||....-| T Consensus 494 G~ReYy~k~GY~~~gpY 510 (515) T COG1243 494 GVREYYRKLGYELDGPY 510 (515) T ss_pred CHHHHHHHHCCCCCCCC T ss_conf 28999997386334772 No 60 >pfam00765 Autoind_synth Autoinducer synthetase. Probab=97.42 E-value=0.006 Score=38.06 Aligned_cols=124 Identities=10% Similarity=-0.019 Sum_probs=84.2 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEE----------- Q ss_conf 137975789999999997278766675615202541200025766548999989878647999999886----------- Q gi|254780989|r 17 ANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQI----------- 85 (185) Q Consensus 17 R~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~----------- 85 (185) ....++...+..++-.+.|-+.-...-.... ..-.+..+......+++.++|+++|++++.|.+- T Consensus 3 ~~~p~~~l~em~rlR~~vFvdrlgW~v~~~~----g~E~DqyD~~~~~Yll~~~~g~v~g~~RLlpTt~p~ML~d~Fp~l 78 (182) T pfam00765 3 NLLPETKLKELFRLRKKTFKDRLDWDVNCSN----GMEFDEYDNLNTTYLLGIKDGRVVCSVRFIETKYPNMITGTFSSY 78 (182) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCC----CCCCCCCCCCCCEEEEEEECCEEEEEECCCCCCCCCHHHCCCHHH T ss_conf 4267899999999999997462698787899----977557899998699999899899998137889955242101766 Q ss_pred -------CCCCEEEEEEEEECHHHHCC-C-----HHHHHHHHHHHHHHHCCCCEEEEECCHH---HHHHCCCEEH Q ss_conf -------06784899867752546159-9-----8999999999999988997999950832---5431385870 Q gi|254780989|r 86 -------GKYKGFLRGPLGVLSEYRKR-G-----IGSRLAYMSFMAIKNSGAEFVLIDAKHD---WHKHLGFKKP 144 (185) Q Consensus 86 -------~~~~~~~l~~laV~P~~qg~-G-----iG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y~r~GF~~~ 144 (185) ...+.+.+..++|+++..+. + +...|+...++.+...|.+.++.+.++. .++|+|+... T Consensus 79 l~~~~~P~~~~iwE~SRf~Vd~~~~~~~~~~~~~~~~~L~~~~~~~a~~~Gi~~~v~V~~~~~~rilrr~G~~~~ 153 (182) T pfam00765 79 FDDISLPESGNYIESSRFFVDKARARGLLGNLAPISLMLFLSMINYARASGYDGIYTIVSHPMLRILKRSGWPIS 153 (182) T ss_pred CCCCCCCCCCCEEEECEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHCCCCEE T ss_conf 289999999998997403796755544444527799999999999999879988999976899999998699559 No 61 >COG3375 Uncharacterized conserved protein [Function unknown] Probab=97.36 E-value=0.0024 Score=40.48 Aligned_cols=121 Identities=17% Similarity=0.123 Sum_probs=80.5 Q ss_pred EEC-CCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEEC-CEEEEEEEEEEEEECCCCEEE Q ss_conf 751-379757899999999972787666756152025412000257665489999898-786479999998860678489 Q gi|254780989|r 15 TFA-NEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDG-DRIVGAIRATPIQIGKYKGFL 92 (185) Q Consensus 15 ~IR-~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d-~~iVG~i~~~~~~~~~~~~~~ 92 (185) .+| ---|++..+..++..+|++... .+....+..+ .+ .....+.+-|..+ +++||...-+|..-++.-..+ T Consensus 4 vvrrl~dp~el~~~~dV~~~aWg~~d--~~~~~~d~i~-al----~~~GGlvlgAf~~dg~lVGls~G~pg~r~g~~y~y 76 (266) T COG3375 4 VVRRLTDPAELDEAEDVQASAWGSED--RDGAPADTIR-AL----RYHGGLVLGAFSADGRLVGLSYGYPGGRGGSLYLY 76 (266) T ss_pred EEEECCCHHHHHHHHHHHHHHHCCCC--CCCCHHHHHH-HH----HHCCCEEEEEECCCCCEEEEEECCCCCCCCCEEEE T ss_conf 68753798998888889998717653--3544388999-99----86188699987389837878852577678755655 Q ss_pred EEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH-------HHHHCCCE Q ss_conf 986775254615998999999999999988997999950832-------54313858 Q gi|254780989|r 93 RGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD-------WHKHLGFK 142 (185) Q Consensus 93 l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~-------~Y~r~GF~ 142 (185) -..++|+|+++++|+|-+|=..--+++.++|++.+-=+=||- ...|+|=. T Consensus 77 SH~~gV~e~~k~sglg~aLK~~Qre~a~~~G~tli~WTfDPl~alNA~fNi~KLGa~ 133 (266) T COG3375 77 SHMLGVREEVKGSGLGVALKMKQRERALSMGYTLIAWTFDPLNALNARFNISKLGAI 133 (266) T ss_pred EEEHHCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHCCHHHHCEE T ss_conf 300214501034665565577778899865803678760654354431135551336 No 62 >COG3882 FkbH Predicted enzyme involved in methoxymalonyl-ACP biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism] Probab=97.22 E-value=0.0023 Score=40.60 Aligned_cols=131 Identities=15% Similarity=0.150 Sum_probs=85.6 Q ss_pred CEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEE--E---CCEEEEEEEEEEEEECC Q ss_conf 207513797578999999999727876667561520254120002576654899998--9---87864799999988606 Q gi|254780989|r 13 GLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCR--D---GDRIVGAIRATPIQIGK 87 (185) Q Consensus 13 ~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae--~---d~~iVG~i~~~~~~~~~ 87 (185) -|+++.....+.+.|.+|.+..=--. -...++.+...++.......+.|-.. + |+.+||.+.+.+ . T Consensus 413 ~l~vs~~de~~i~RIsQLtqkTNQFn-----lTtkRy~e~dV~~~~~~~~~li~sv~l~DKfgDnGiigvviv~k----k 483 (574) T COG3882 413 RLTVSKFDEVNIPRISQLTQKTNQFN-----LTTKRYNEEDVRQMQEDPNFLIFSVSLKDKFGDNGIIGVVIVEK----K 483 (574) T ss_pred EEEEEECCCCCCHHHHHHHHHCCCEE-----ECHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCEEEEEEEEE----C T ss_conf 99984145558488999862043005-----22544148889987318771799998612346785589999993----1 Q ss_pred CCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH--------HHHHHCCCEEHHHCCCCCC Q ss_conf 7848998677525461599899999999999998899799995083--------2543138587052782677 Q gi|254780989|r 88 YKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH--------DWHKHLGFKKPHQHQIRYA 152 (185) Q Consensus 88 ~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~--------~~Y~r~GF~~~~~~~i~~~ 152 (185) ...+.+-.+...=.--||++-.+||...++.|++.|...+..+=.| +||+++||+-..+.+..+. T Consensus 484 ~~~w~IDt~lmSCRVlgRkvE~~l~~~~~e~A~~~gi~tir~~Y~pt~kN~pv~~FyE~mgf~l~~eng~~f~ 556 (574) T COG3882 484 ESEWFIDTFLMSCRVLGRKVEQRLMNSLEEQALSEGINTIRGYYIPTEKNAPVSDFYERMGFKLKGENGNKFY 556 (574) T ss_pred CCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHCHHHCCCCCCCCC T ss_conf 8727767889989998667899999999999986463222358646435783899998735000135687534 No 63 >PRK13688 hypothetical protein; Provisional Probab=97.10 E-value=0.00031 Score=45.83 Aligned_cols=91 Identities=22% Similarity=0.188 Sum_probs=57.6 Q ss_pred HHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEE------E-CCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHC Q ss_conf 54120002576654899998987864799999988------6-0678489986775254615998999999999999988 Q gi|254780989|r 50 VRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQ------I-GKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNS 122 (185) Q Consensus 50 ~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~------~-~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~ 122 (185) ++.++.+..+..+ ++-.-.++++|+-..+.... + ...+-+.|+.|-|+|.||++|+|++|+ +.||+. T Consensus 34 L~~niIen~SeSP--FYGIy~G~kLvARMsLy~~~~~~~~yF~~~~dYleLWKLEVLp~yQ~~GyG~~LV----dfAKsf 107 (157) T PRK13688 34 LQGNMIENDSESP--FYGIYYGDSLVARMSLYKKGDVEEPYFPIFGDYLELWKLEVLPKYQNRGYGEMLV----DFAKSF 107 (157) T ss_pred HHHHHHHCCCCCC--CEEEEECCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCHHHHHH----HHHHHC T ss_conf 8752320678998--2258876408888888860786545679841576764421062202366579999----999876 Q ss_pred CCCEEEE--ECCHHHHHHCCCEEHHH Q ss_conf 9979999--50832543138587052 Q gi|254780989|r 123 GAEFVLI--DAKHDWHKHLGFKKPHQ 146 (185) Q Consensus 123 G~~~v~l--~~~~~~Y~r~GF~~~~~ 146 (185) |.+.-.. ....+|+.|+||.+... T Consensus 108 ~lPIKt~aR~~S~~Fw~k~gF~~v~~ 133 (157) T PRK13688 108 NMPIKTIARNDSKDFWDKLGFTPVEA 133 (157) T ss_pred CCCEEECCCCCHHHHHHHCCCEECCC T ss_conf 99704101214077999749954223 No 64 >COG0454 WecD Histone acetyltransferase HPA2 and related acetyltransferases [Transcription / General function prediction only] Probab=97.08 E-value=0.00043 Score=44.94 Aligned_cols=44 Identities=34% Similarity=0.436 Sum_probs=36.6 Q ss_pred EEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHCCC Q ss_conf 7752546159989999999999999889979999508325431385 Q gi|254780989|r 96 LGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWHKHLGF 141 (185) Q Consensus 96 laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~GF 141 (185) ++|+|++||+|+|++|++++++.++..|.. .......+|.++|| T Consensus 87 l~v~~~~rg~Gig~~Ll~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 130 (156) T COG0454 87 LYVLPEYRGKGIGSALLEAALEWARKRGIS--LNRLALEVYEKNGF 130 (156) T ss_pred EEECHHHHCCCCHHHHHHHHHHHHHHCCCC--HHHHHHHHHHHHCC T ss_conf 788698855892899999999999980950--67888989987510 No 65 >pfam02474 NodA Nodulation protein A (NodA). Rhizobia nodulation (nod) genes control the biosynthesis of Nod factors required for infection and nodulation of their legume hosts. Nodulation protein A (NodA) is a N-acetyltransferase involved in production of Nod factors that stimulate mitosis in various plant protoplasts. Probab=96.96 E-value=0.0031 Score=39.79 Aligned_cols=115 Identities=15% Similarity=0.192 Sum_probs=85.0 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEE--EEECCCCE--EEEEE Q ss_conf 975789999999997278766675615202541200025766548999989878647999999--88606784--89986 Q gi|254780989|r 20 QRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATP--IQIGKYKG--FLRGP 95 (185) Q Consensus 20 ~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~--~~~~~~~~--~~l~~ 95 (185) +.+|..++.+..+.+|+..-.+.+..+. .-+.|..-.+++-.++.+...|++|+.+-+ +.++..+. ..|+- T Consensus 16 ~l~dH~eLaeFfrktYgptgafnakpFe-----G~RSWAGARPElR~IgyD~~GVaAH~G~LRRFIkVg~vDlLVaELGL 90 (196) T pfam02474 16 QLADHIELSEFFRKTYGPTGAFNAKPFE-----GGRSWAGARPELRAIAYDSRGVAAHMGLLRRFIKVGEVDLLVAELGL 90 (196) T ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCCCC-----CCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHEECCEEEEEEEEEE T ss_conf 5111699999999842887643466567-----77564557851378876573088988888775124761013443001 Q ss_pred EEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH---HHHHCC Q ss_conf 775254615998999999999999988997999950832---543138 Q gi|254780989|r 96 LGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD---WHKHLG 140 (185) Q Consensus 96 laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y~r~G 140 (185) .+|+|+.+|.||+.. |+...-.+++.|+.+.+-...+. .++||| T Consensus 91 ygVRpDlEGlGi~~s-~~~m~PvLq~LgVPF~FGtVR~al~~Hv~R~~ 137 (196) T pfam02474 91 YGVRPDLEGLGISHS-MRVMYPVLQELGVPFAFGTVRHALRKHVERLC 137 (196) T ss_pred EEECCCCCCCCHHHH-HHHHHHHHHHCCCCEECCCCHHHHHHHHHHHH T ss_conf 476565133534677-98998888862997452352598999999982 No 66 >PRK00756 acyltransferase NodA; Provisional Probab=96.84 E-value=0.0053 Score=38.36 Aligned_cols=115 Identities=14% Similarity=0.170 Sum_probs=85.5 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEE--EEECCCCE--EEEEE Q ss_conf 975789999999997278766675615202541200025766548999989878647999999--88606784--89986 Q gi|254780989|r 20 QRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATP--IQIGKYKG--FLRGP 95 (185) Q Consensus 20 ~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~--~~~~~~~~--~~l~~ 95 (185) +.+|..++.+..+.+|++.-.+.+..+. .-+.|....+++-.++.+...|++|+.+-+ +.++..+. ..|+- T Consensus 16 ~l~dH~eLaeFfrktYgptGafnakpFe-----G~RSWAGARPE~R~IgyD~~GVaAH~G~LRRFIkVg~vdlLVaElGL 90 (196) T PRK00756 16 ELSDHAELAEFFRKTYGPTGAFNAKPFE-----GGRSWAGARPELRAIAYDSHGVAAHMGLLRRFIKVGEVDLLVAELGL 90 (196) T ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCCCC-----CCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHEEECCEEEEEEEEEE T ss_conf 5012799999999832887642464567-----77565667873378865673088887777530255664057773024 Q ss_pred EEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH---HHHHCC Q ss_conf 775254615998999999999999988997999950832---543138 Q gi|254780989|r 96 LGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD---WHKHLG 140 (185) Q Consensus 96 laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y~r~G 140 (185) .+|.|+.+|.||+.. |+...-.+++.|+.+.+-...+. .++||+ T Consensus 91 ygVRpDlEGlGi~~s-~~~m~PvLq~LgVPF~FGtVR~al~~Hv~R~~ 137 (196) T PRK00756 91 YGVRPDLEGLGIAHS-IRAMYPVLQELGVPFAFGTVRHALRNHVERLC 137 (196) T ss_pred EEECCCCCCCCHHHH-HHHHHHHHHHCCCCEECCCCHHHHHHHHHHHH T ss_conf 787577443551355-88887888863997342342598999999997 No 67 >KOG3698 consensus Probab=96.62 E-value=0.031 Score=33.78 Aligned_cols=140 Identities=16% Similarity=0.174 Sum_probs=86.1 Q ss_pred CCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCC-EEHHHHHCHH-CCCCCCCEEEEEEEE-CCEEEEEE Q ss_conf 78798721000207513797578999999999727876667561-5202541200-025766548999989-87864799 Q gi|254780989|r 2 PKQPNVNIELEGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDV-LHKYVRNNVE-EPLSFVRLMSFVCRD-GDRIVGAI 78 (185) Q Consensus 2 ~~~~~~~~~~~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~l~~Vae~-d~~iVG~i 78 (185) |.+|-.+. .+||+.+++|...|..+-+..+.++.+..+.. ..+...+++. -..+..+.+|+|+++ +++||||+ T Consensus 672 p~~Pt~~~----y~iRPy~~~De~~v~~~ct~my~d~g~~lpf~n~pn~~~d~liggllsls~~lC~v~~de~~~i~gYa 747 (891) T KOG3698 672 PFLPTCMF----YDIRPYTIADEEYVSGMCTVMYTDNGELLPFRNAPNFADDNLIGGLLSLSEHLCEVVDDEGHKIVGYA 747 (891) T ss_pred CCCCCCEE----EEECCCCCCCHHHHHHHHHHEECCCCEECCCCCCCCCCCCCCHHHEECCCHHHEEEEECCCCCEEEEE T ss_conf 77863105----76216766458999756661342586033578887500135422112257556365524799666776 Q ss_pred EEEE--------EEE-------CCC-----------------------------CEEE-------EE------EEEECHH Q ss_conf 9999--------886-------067-----------------------------8489-------98------6775254 Q gi|254780989|r 79 RATP--------IQI-------GKY-----------------------------KGFL-------RG------PLGVLSE 101 (185) Q Consensus 79 ~~~~--------~~~-------~~~-----------------------------~~~~-------l~------~laV~P~ 101 (185) ...- ..+ ++. ++++ ++ ..+.+-+ T Consensus 748 ~a~~Dvt~F~rn~~i~w~~~l~EKY~~~i~p~~~g~~~~~~~e~i~~S~h~~~~~~~~~~~P~~~~~nfPa~v~~~~~~~ 827 (891) T KOG3698 748 SAHFDVTLFSRNFLITWKEKLKEKYRGLIEPIGSGKLTDEYIEFIQNSQHPMDIEEWYPKIPDQIFENFPAWVETYFGMD 827 (891) T ss_pred EEECCCCHHHHCEEEEEHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCHHHCCCCCCHHHHHCCHHHHHHCCCCC T ss_conf 42001221000315654798899864455666786007999999987058430000131380888703708773213534 Q ss_pred HHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH------HHHHCCCEEHH Q ss_conf 615998999999999999988997999950832------54313858705 Q gi|254780989|r 102 YRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD------WHKHLGFKKPH 145 (185) Q Consensus 102 ~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~------~Y~r~GF~~~~ 145 (185) .---|+.+++|+-.+..++..|....++...++ ||.++||.-.. T Consensus 828 a~D~~~~k~m~~vll~tL~aNGsrGaf~~V~~dD~~~~~fys~lG~~d~~ 877 (891) T KOG3698 828 ASDAHPMKKMIQVLLVTLAANGSRGAFLTVAIDDIERQKFYSELGLTDLG 877 (891) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHCHHHHH T ss_conf 45650789999999999873587511587551378999999871558776 No 68 >pfam08444 Gly_acyl_tr_C Aralkyl acyl-CoA:amino acid N-acyltransferase, C-terminal region. This family features the C-terminal region of several mammalian specific aralkyl acyl-CoA:amino acid N-acyltransferase (glycine N-acyltransferase) proteins EC:2.3.1.13. Probab=96.44 E-value=0.015 Score=35.67 Aligned_cols=68 Identities=18% Similarity=0.112 Sum_probs=49.0 Q ss_pred ECCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH-----HHHHHCCCEEH Q ss_conf 9878647999999886067848998677525461599899999999999998899799995083-----25431385870 Q gi|254780989|r 70 DGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH-----DWHKHLGFKKP 144 (185) Q Consensus 70 ~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~-----~~Y~r~GF~~~ 144 (185) .+|..|..+.. + ....++.-++.|+|||||+.+.++.+.++.+.++|+..-.-+.+. ..-+++||... T Consensus 6 peG~PVSW~Lm-----d--qtge~rmgyTlPeyR~kG~~~~~~~~~~~~L~~~g~P~Y~hv~~~N~~~~k~~~~lg~~~~ 78 (89) T pfam08444 6 PEGTPVSWSLM-----D--QTGELRMAGTLPKYRRQGLMSHVIYHQAQYLEKLGFPVYSHVDKANEISQKMSGNLGHVPM 78 (89) T ss_pred CCCCEEEEEEE-----C--CCCEEEECCCCHHHHCCCHHHHHHHHHHHHHHHCCCCCEEEEHHCCHHHHHHHHHCCCEEC T ss_conf 99987479985-----3--7421100133777875586999999999999867999468641157889999997798257 No 69 >pfam09390 DUF1999 Protein of unknown function (DUF1999). This family contains a putative Fe-S binding reductase whose structure adopts an alpha and beta fold. Probab=96.42 E-value=0.015 Score=35.61 Aligned_cols=135 Identities=10% Similarity=-0.008 Sum_probs=86.7 Q ss_pred EEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCC-----CCCCCEEEEEEEE-CCEEEEEEEEEEEEECC Q ss_conf 07513797578999999999727876667561520254120002-----5766548999989-87864799999988606 Q gi|254780989|r 14 LTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEP-----LSFVRLMSFVCRD-GDRIVGAIRATPIQIGK 87 (185) Q Consensus 14 l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~l~~Vae~-d~~iVG~i~~~~~~~~~ 87 (185) |.+|+.+..|...+.++-.+.-....+..+.....-.+..+..+ ....+..+|||++ ++.+.||+....+.-+. T Consensus 1 MryR~F~e~D~~~lqaldla~qr~~~P~yd~lPere~~grl~~SL~alrfyeRsgHSF~Ae~e~e~~~Gf~lAQavWqGd 80 (161) T pfam09390 1 MRYRPFTEPDLERLQALDLAEQRRTAPEYDVLPEREQAGRLSVSLGALRFYERSGHSFVAEGEGEEPMGFALAQAVWQGD 80 (161) T ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHCCCCCCCHHHHHHHHHCCCCEEEECCCCCEEEEEEHHHHHCCC T ss_conf 97555675009999987888742479633446114541645554245543431476035446887645564313565488 Q ss_pred CCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHH---HHHCCCEEHHHCCC Q ss_conf 784899867752546159989999999999999889979999508325---43138587052782 Q gi|254780989|r 88 YKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDW---HKHLGFKKPHQHQI 149 (185) Q Consensus 88 ~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~---Y~r~GF~~~~~~~i 149 (185) .....+..|..+| -+..+.-+-|++...+.+-..++=.|-+.-+|+. -+.-||.-...+.+ T Consensus 81 rptVl~~~ivl~~-~~~e~~~~gLL~AvvKSAYDaaVYEvhlpl~P~L~~Aa~~e~a~l~g~yaV 144 (161) T pfam09390 81 RPTVLVRAIVLDE-GRSEDALRGLLRAVVKSAYDAAVYEVHLPLDPELEAAARAEEAALTGVYAV 144 (161) T ss_pred CCEEEEEEEECCC-CCCHHHHHHHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHCCCCCCCEEEE T ss_conf 7359999874278-986899999999999875212257874258989999998502343653799 No 70 >COG3138 AstA Arginine/ornithine N-succinyltransferase beta subunit [Amino acid transport and metabolism] Probab=96.41 E-value=0.0068 Score=37.72 Aligned_cols=101 Identities=19% Similarity=0.155 Sum_probs=60.2 Q ss_pred EEECCCCHHHHHHHHHHHHHHCCC--CCCCCCCCEEHHHHHCHHC----CCCCCCEEEEEEEE--CCEEEEEEEEEE--- Q ss_conf 075137975789999999997278--7666756152025412000----25766548999989--878647999999--- Q gi|254780989|r 14 LTFANEQRVDQVAVDEVMVKAFLS--TQEITDDVLHKYVRNNVEE----PLSFVRLMSFVCRD--GDRIVGAIRATP--- 82 (185) Q Consensus 14 l~IR~a~~~D~~aI~~L~~~aF~~--~~~~~~~~~~~~~~~~~~~----~~~~~~~l~~Vae~--d~~iVG~i~~~~--- 82 (185) |.+|+++.+|++++.+|...+=.. +-+..+..+...++..... .........||.|+ .|+++|.....- T Consensus 2 lvvRP~~~aDl~al~~LA~~sg~G~TsLP~de~~L~~Ri~~se~sf~~~~~~ge~~Y~fVLEDsetG~VvG~saI~a~vG 81 (336) T COG3138 2 LVVRPVERADLEALMELAVKTGVGLTSLPADEATLRARIERSEKSFQGELPPGEAGYLFVLEDSETGTVVGISAIEAAVG 81 (336) T ss_pred CCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCEEEEEEEEEEECC T ss_conf 61322320378999999986599853587778999999998888773556787762899998457772876777887403 Q ss_pred ---------E-EE-------------------CCC-CEEEEEEEEECHHHHCCCHHHHHHHH Q ss_conf ---------8-86-------------------067-84899867752546159989999999 Q gi|254780989|r 83 ---------I-QI-------------------GKY-KGFLRGPLGVLSEYRKRGIGSRLAYM 114 (185) Q Consensus 83 ---------~-~~-------------------~~~-~~~~l~~laV~P~~qg~GiG~~Li~~ 114 (185) + ++ ++. ....+..+.++|++|+-|.|+-|-+. T Consensus 82 l~~PfYsyRv~tlvhaS~~L~v~~~i~~L~L~Nd~TG~SEl~sLFl~pd~Rkg~nG~Llsr~ 143 (336) T COG3138 82 LNDPFYSYRVGTLVHASPELNVYNEIPTLFLSNDLTGNSELCTLFLDPDWRKGGNGRLLSKS 143 (336) T ss_pred CCCCCCEEEEEEEEECCCCCCCCCCCEEEEEECCCCCCHHHHHEEECHHHHCCCCHHHHHHH T ss_conf 57765111453335327433641021179985266684245411506767335450024430 No 71 >pfam01233 NMT Myristoyl-CoA:protein N-myristoyltransferase, N-terminal domain. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold. Probab=96.18 E-value=0.11 Score=30.45 Aligned_cols=111 Identities=12% Similarity=0.059 Sum_probs=72.2 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCC-CCCCEEHHHHHCHHCCCCCCCEEEEE--EEECCEEEEEEEEEEEEECC----CCEE Q ss_conf 79757899999999972787666-75615202541200025766548999--98987864799999988606----7848 Q gi|254780989|r 19 EQRVDQVAVDEVMVKAFLSTQEI-TDDVLHKYVRNNVEEPLSFVRLMSFV--CRDGDRIVGAIRATPIQIGK----YKGF 91 (185) Q Consensus 19 a~~~D~~aI~~L~~~aF~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~V--ae~d~~iVG~i~~~~~~~~~----~~~~ 91 (185) ..+.+..++..++.+-|-.+... ....+..-+.....+...+.+..... ....+++||||...|+.+.- .... T Consensus 32 ~d~~~l~Evy~lL~~nYVEDdd~~FRF~YS~efL~Wal~~Pg~~~~whvGVR~~~s~kLvgfIs~iP~~irv~~~~~~~~ 111 (162) T pfam01233 32 DDEEQLKELYVLLNENYVEDDDAMFRFNYSPEFLNWALKPPGWRPDWHCGVRVKSTGKLVAFISAIPATIRVRDKTVKMV 111 (162) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHCCCCCCCCEEEEEEECCCCEEEEEEECCEEEEEEECCEEEEE T ss_conf 99899999999998615148776598867999998763498988645899998779909999704148999926484578 Q ss_pred EEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCC-CEEEE Q ss_conf 998677525461599899999999999998899-79999 Q gi|254780989|r 92 LRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGA-EFVLI 129 (185) Q Consensus 92 ~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~-~~v~l 129 (185) .+.-|.||..+|.++++--|+.+.-+++...|. .++.. T Consensus 112 eINFLCVhkklRskrLAPvLIkEItRRvn~~gIwQAvYT 150 (162) T pfam01233 112 EINFLCVHKKLRSKRLAPVLIKEITRRVNLTGIWQALYT 150 (162) T ss_pred EEEEEEECHHHHHCCCCHHHHHHHHHHHHCCCEEEEEEE T ss_conf 899988704455357743899998887200675699986 No 72 >TIGR01686 FkbH FkbH domain; InterPro: IPR010037 This entry describes a domain of unknown function. One of proteins with this domain is a modular polyketide synthase, which is 4800 amino acids in length from Streptomyces avermilitis, where the domain is the C-terminal segment. By contrast, the FkbH protein from Streptomyces hygroscopicus (Q84G18 from SWISSPROT) apparently contains only this domain. The remaining members of the family all contain an additional N-terminal domain of between 200 and 275 amino acids, which show less than 20 0dentity to one another. It seems likely then that these proteins are involved in disparate functions, probably the biosynthesis of different natural products. For instance, the FkbH gene is found in a gene cluster believed to be responsible for the biosynthesis of unusual 'PKS extender units' in the ascomycin pathway . This domain is composed of two parts, the first of which is a member of subfamily IIIC of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. All of the characterised enzymes within subfamily III are phosphatases, and all of the active site residues characteristic of HAD-superfamily phosphatases are present in this domain. The C-terminal portion of this domain is unique to this family (by BLAST).. Probab=95.94 E-value=0.017 Score=35.25 Aligned_cols=119 Identities=18% Similarity=0.213 Sum_probs=81.7 Q ss_pred CEEECCCCHHHHHHHHHHHHHH--CCCCCCCCCCCEEHHHHHCHHCCCCCC-CEEEEEEE--E---CCEEEEEEEEEEEE Q ss_conf 2075137975789999999997--278766675615202541200025766-54899998--9---87864799999988 Q gi|254780989|r 13 GLTFANEQRVDQVAVDEVMVKA--FLSTQEITDDVLHKYVRNNVEEPLSFV-RLMSFVCR--D---GDRIVGAIRATPIQ 84 (185) Q Consensus 13 ~l~IR~a~~~D~~aI~~L~~~a--F~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~Vae--~---d~~iVG~i~~~~~~ 84 (185) .|.|+....+|...|.+|..+. |--. ..++-...+.++.... ....|++- | |.-+||++.+.+ T Consensus 197 ~l~~~~~~~~~v~Rv~~L~~rTnQf~~~-------~~~~~~~dl~~~~~~~s~~~v~~~~~~DrfgD~G~vG~~~~~~-- 267 (337) T TIGR01686 197 SLDISKLKEEEVQRVEELVRRTNQFNLT-------AERLDEEDLKQTMEEESAEEVLTVAMSDRFGDSGLVGILLLEK-- 267 (337) T ss_pred EEEHHHCCCCCCHHHHHHHHHHHHHHHH-------HCCCCHHHHHHHHCCCCCEEEEEEEEECCCCCCCCEEEEEEEC-- T ss_conf 5421013621360488873113323121-------0136777899984489853899998641458876089999852-- Q ss_pred ECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHH---HCCCCEEEEECCH--------HHHHHCCCE Q ss_conf 606784899867752546159989999999999999---8899799995083--------254313858 Q gi|254780989|r 85 IGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIK---NSGAEFVLIDAKH--------DWHKHLGFK 142 (185) Q Consensus 85 ~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~---~~G~~~v~l~~~~--------~~Y~r~GF~ 142 (185) +...+.|..+.+.=.-=|+|+++++++.+.++|+ +.|...+.+.-.+ .||+..||. T Consensus 268 --~e~~l~i~~l~~SCR~l~r~~e~~~L~~l~d~A~hff~~g~~~~~~~y~~T~rN~~~~~~~~~~g~~ 334 (337) T TIGR01686 268 --KEGQLVIKDLVMSCRVLGRGVEERLLRWLVDQAKHFFELGAHKVRLIYRRTERNVPVESFYEELGFE 334 (337) T ss_pred --CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCCHHHHHHHHHHCCC T ss_conf --8751528767664014101167999999999889999841354031006898776899999985312 No 73 >KOG2036 consensus Probab=95.93 E-value=0.022 Score=34.67 Aligned_cols=82 Identities=20% Similarity=0.157 Sum_probs=57.2 Q ss_pred CEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEE----------------------------------EEE---- Q ss_conf 848998677525461599899999999999998899799----------------------------------995---- Q gi|254780989|r 89 KGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFV----------------------------------LID---- 130 (185) Q Consensus 89 ~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v----------------------------------~l~---- 130 (185) .+.-|..+||||+||+.|+|++-++-..+...-++...- +|. T Consensus 613 ~GaRIVRIAvhP~y~~MGYGsrAvqLL~~y~eG~~~~i~e~~~~~~~~~k~v~e~~~vsllee~i~pR~~lppLL~~L~e 692 (1011) T KOG2036 613 SGARIVRIAVHPEYQKMGYGSRAVQLLTDYFEGKFTSISEDVLAVDHSIKRVEEAEKVSLLEEQIKPRKDLPPLLLKLSE 692 (1011) T ss_pred CCCEEEEEEECCCHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCEEEECCC T ss_conf 57317999856103113740889999999971558774544345675442034155445666533654579861557023 Q ss_pred -------------CC-H---HHHHHCCCEEHHHCCCCCCCCCCCHHHEEEEECCCCCC Q ss_conf -------------08-3---25431385870527826779888823226544057976 Q gi|254780989|r 131 -------------AK-H---DWHKHLGFKKPHQHQIRYAHGGGAATDWLVHIFKNNIM 171 (185) Q Consensus 131 -------------~~-~---~~Y~r~GF~~~~~~~i~~~~~~~~~~~~~~~~l~~~~~ 171 (185) |- + .|++|-||.+..-.|.... -..+-.|.|++.|+.+.. T Consensus 693 r~perldylGvSfGLT~~L~kFWk~~gF~PvylrQt~n~-lTGEHtcimLk~L~~~e~ 749 (1011) T KOG2036 693 RPPERLDYLGVSFGLTPSLLKFWKKNGFVPVYLRQTSND-LTGEHTCIMLKTLEGDES 749 (1011) T ss_pred CCCCCCCEEEECCCCCHHHHHHHHHCCCEEEEEECCCCC-CCCCEEEEEEECCCCCCC T ss_conf 897655323103367889999998669604776306540-246445899825887654 No 74 >PHA01807 hypothetical protein Probab=95.89 E-value=0.071 Score=31.56 Aligned_cols=66 Identities=12% Similarity=0.063 Sum_probs=50.1 Q ss_pred CEEEEEEEECCEEEEEEEEEEEEECCCC----EEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 5489999898786479999998860678----48998677525461599899999999999998899799995 Q gi|254780989|r 62 RLMSFVCRDGDRIVGAIRATPIQIGKYK----GFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLID 130 (185) Q Consensus 62 ~~l~~Vae~d~~iVG~i~~~~~~~~~~~----~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~ 130 (185) ....+.+..||+++|++... .++.. .+....-+|.|+|+.-|+...+++.+++.|..-.++.+.-. T Consensus 49 dr~e~Lv~rdG~laG~Av~~---~dDD~HVG~cl~vQWqyVlPe~rnaGv~~~f~R~l~~lA~~~n~p~va~t 118 (145) T PHA01807 49 DRTELLVFRDGQLAGGAVIV---FDDDPHVGPCLTVQWQYVLPEYRNAGVAREFIRELHRLAGWGNIPLVAWT 118 (145) T ss_pred CCEEEEEEECCEEEEEEEEE---ECCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEE T ss_conf 72589999778863238998---45998743315788888523445664249999999997503687258987 No 75 >pfam02799 NMT_C Myristoyl-CoA:protein N-myristoyltransferase, C-terminal domain. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold. Probab=95.85 E-value=0.16 Score=29.53 Aligned_cols=118 Identities=14% Similarity=0.099 Sum_probs=72.2 Q ss_pred EECCCCHHHHHHHHHHHHHHCCC---CCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEE-CCEEEEEEEEE--EEEE-CC Q ss_conf 75137975789999999997278---766675615202541200025766548999989-87864799999--9886-06 Q gi|254780989|r 15 TFANEQRVDQVAVDEVMVKAFLS---TQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRD-GDRIVGAIRAT--PIQI-GK 87 (185) Q Consensus 15 ~IR~a~~~D~~aI~~L~~~aF~~---~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~-d~~iVG~i~~~--~~~~-~~ 87 (185) -+|++++.|.++|.+|+.+-... .+....+.+ ++.+.- ...-..++|.|+ +++|-.++.|. |.++ +. T Consensus 30 GlR~M~~kDvp~V~~LL~~yl~~f~l~~~f~~eev----~Hw~lp--r~~VI~sYVve~~~g~ITDf~SFY~lpstvi~~ 103 (190) T pfam02799 30 GLRPMKKKDVPAVHKLLNKYLKRFDLAPVFSKEEV----AHWFLP--RENVVYSYVVENEDGKITDFFSFYSLPSTVIGN 103 (190) T ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCEEEECCHHHH----HHHCCC--CCCEEEEEEEECCCCCEEEEEEEEECCCCCCCC T ss_conf 86618555189999999999974870665689997----763126--778789999978999471489997554100389 Q ss_pred C-----CEEEE-EEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEE---EECCHHHHHHCCCEEH Q ss_conf 7-----84899-86775254615998999999999999988997999---9508325431385870 Q gi|254780989|r 88 Y-----KGFLR-GPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVL---IDAKHDWHKHLGFKKP 144 (185) Q Consensus 88 ~-----~~~~l-~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~---l~~~~~~Y~r~GF~~~ 144 (185) . ...++ .-++.. --=..||+.++=.||..|++..- +..++.|-+.+-|.+- T Consensus 104 ~~h~~l~aAY~fY~v~~~------~~l~~L~~Dali~Ak~~~fDVFNaLd~mdN~~fL~~lKFg~G 163 (190) T pfam02799 104 PKHKTLKAAYSFYYASTS------TRLKQLMNDALILAKNLGFDVFNALDLMDNKLFLEDLKFGPG 163 (190) T ss_pred CCCCEEEEEEEEEEEECC------CCHHHHHHHHHHHHHHCCCCEEECCCCCCCHHHHHHCCCCCC T ss_conf 886613677899996558------529999999999998759978845002158878987586669 No 76 >PTZ00064 histone acetyltransferase; Provisional Probab=95.79 E-value=0.019 Score=35.02 Aligned_cols=58 Identities=16% Similarity=0.258 Sum_probs=39.9 Q ss_pred EEEEE-EEECC---EEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 48999-98987---86479999998860678489986775254615998999999999999988997 Q gi|254780989|r 63 LMSFV-CRDGD---RIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAE 125 (185) Q Consensus 63 ~l~~V-ae~d~---~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~ 125 (185) -+++| .+.|+ .+||| |++-.....+ |+=|-+.|.||++|+|+.||..+-+..+.+|.. T Consensus 270 FlFYVl~e~D~~G~HivGY--FSKEK~s~~N---laCIltLP~yQrkGyGklLI~fSYeLSk~Egk~ 331 (442) T PTZ00064 270 FIFYVMTEVDENGYHITGY--FSKEKHSHNN---VSCILSLPQHQRKGYGKLLTAFSYLLSLKEGKT 331 (442) T ss_pred EEEEEEEEECCCCCEEEEE--ECHHHCCCCC---EEEEEECCHHHHCCCCHHHHHHHHHHHHCCCCC T ss_conf 5799998855998468888--4443235666---258895477885150011244464564315768 No 77 >PRK13834 putative autoinducer synthesis protein; Provisional Probab=95.72 E-value=0.14 Score=29.74 Aligned_cols=128 Identities=9% Similarity=0.016 Sum_probs=74.7 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEE-CCEEEEEEEEEEEEE----------------- Q ss_conf 89999999997278766675615202541200025766548999989-878647999999886----------------- Q gi|254780989|r 24 QVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRD-GDRIVGAIRATPIQI----------------- 85 (185) Q Consensus 24 ~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~-d~~iVG~i~~~~~~~----------------- 85 (185) ..+..++-.+.|.+.-...-.... ....+..+......+++.+ +|+++|++++.|.+- T Consensus 18 l~~m~rlR~~vF~drLgW~v~~~~----g~E~D~fD~~~~~Yll~~~~~g~v~G~~RLlpTt~p~ML~~~Fp~ll~~~~~ 93 (207) T PRK13834 18 LHQMHRLRARVFGGRLGWDVEIID----GCESDQFDDLKPTYILAISDSGRVAGCARLLPALGPTMLADVFPQLLPDGRL 93 (207) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCC----CCCCCCCCCCCCEEEEEECCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCC T ss_conf 999999999997660586876899----9883268999986999990898489899633789976334705776389999 Q ss_pred -CCCCEEEEEEEEECHHHH---CCC----HHHHHHHHHHHHHHHCCCCEEEEECCHH---HHHHCCCEEHHHCCCCCCCC Q ss_conf -067848998677525461---599----8999999999999988997999950832---54313858705278267798 Q gi|254780989|r 86 -GKYKGFLRGPLGVLSEYR---KRG----IGSRLAYMSFMAIKNSGAEFVLIDAKHD---WHKHLGFKKPHQHQIRYAHG 154 (185) Q Consensus 86 -~~~~~~~l~~laV~P~~q---g~G----iG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y~r~GF~~~~~~~i~~~~~ 154 (185) ...+.+.+...+|+++.. +.+ +...|+...++.+..+|++.++.+.++. .+.|.|+.... -|...+.+ T Consensus 94 p~~~~i~E~SRF~Vd~~~~~~~~~~~~~~~t~~L~~a~ve~a~~~Gi~~ivtV~~~~~eRiL~R~G~~~~r-LG~p~~ig 172 (207) T PRK13834 94 NAHPAMIESSRFCVDTTLAEGRGGGSLHEATLTMFAGIIEWSMANGYTEIVTATDLRFERILARAGWPMQR-LGEPKAIG 172 (207) T ss_pred CCCCCEEEECCEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHCCCCEEE-CCCCEEEC T ss_conf 99999899783489776665556887277899999999999998799889999438999999985995596-76887778 Q ss_pred CC Q ss_conf 88 Q gi|254780989|r 155 GG 156 (185) Q Consensus 155 ~~ 156 (185) +. T Consensus 173 ~~ 174 (207) T PRK13834 173 NT 174 (207) T ss_pred CE T ss_conf 87 No 78 >pfam01853 MOZ_SAS MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases. Probab=95.60 E-value=0.03 Score=33.80 Aligned_cols=59 Identities=17% Similarity=0.307 Sum_probs=39.9 Q ss_pred EEEE-EEECC---EEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 8999-98987---86479999998860678489986775254615998999999999999988997 Q gi|254780989|r 64 MSFV-CRDGD---RIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAE 125 (185) Q Consensus 64 l~~V-ae~d~---~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~ 125 (185) +++| .+.|+ .+||| |++-.. ..+..-|+=+-|.|.||++|+|+-||..+-+..|.+|.. T Consensus 53 lFYVl~e~d~~g~h~vGy--FSKEk~-s~~~~NLaCIltlP~~QrkGyG~~LI~fSY~LSr~E~~~ 115 (189) T pfam01853 53 LFYILTETDETGCHIVGY--FSKEKE-SSENYNLACILTLPPYQRKGYGKLLIDFSYELSRKEGKI 115 (189) T ss_pred EEEEEEEECCCCCEEEEE--ECHHHC-CCCCCCEEEEEECCHHHHCCHHHHHHHHHHHHHHHCCCC T ss_conf 999999855998588766--311003-555673478996387885252332265550265653888 No 79 >KOG2535 consensus Probab=95.22 E-value=0.018 Score=35.13 Aligned_cols=48 Identities=13% Similarity=0.320 Sum_probs=37.5 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHC-CCCEEEE---ECCHHHHHHCCCEEHHHC Q ss_conf 54615998999999999999988-9979999---508325431385870527 Q gi|254780989|r 100 SEYRKRGIGSRLAYMSFMAIKNS-GAEFVLI---DAKHDWHKHLGFKKPHQH 147 (185) Q Consensus 100 P~~qg~GiG~~Li~~~l~~a~~~-G~~~v~l---~~~~~~Y~r~GF~~~~~~ 147 (185) ..||.||+|..||+++.+.|++. |-..+.+ .|.-.||.|+||.-..-| T Consensus 497 ~KfQHQG~GtLLmeEAERIAr~EHgS~KiavISGVGtR~YY~klGY~LdGPY 548 (554) T KOG2535 497 TKFQHQGFGTLLMEEAERIAREEHGSGKIAVISGVGTRNYYRKLGYELDGPY 548 (554) T ss_pred HHHHHCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHCCEECCCH T ss_conf 3432134347899999999887507863689943534888987180423705 No 80 >KOG2747 consensus Probab=95.14 E-value=0.056 Score=32.21 Aligned_cols=61 Identities=16% Similarity=0.232 Sum_probs=39.1 Q ss_pred CEEEEEEEE-C-CEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 548999989-8-786479999998860678489986775254615998999999999999988997 Q gi|254780989|r 62 RLMSFVCRD-G-DRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAE 125 (185) Q Consensus 62 ~~l~~Vae~-d-~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~ 125 (185) +-+++|... | ...||| |++- -+..+..-++=|-|.|.||++|+|+-||+.+=+..|.+|.. T Consensus 233 pFlFYVLte~d~~G~VGY--FSKE-K~s~~~yNlaCILtLPpyQRkGYGklLIdFSYeLSr~E~~~ 295 (396) T KOG2747 233 PFLFYVLTECDSYGCVGY--FSKE-KESSENYNLACILTLPPYQRKGYGKLLIDFSYELSRREGKI 295 (396) T ss_pred CEEEEEEEECCCCEEEEE--ECCC-CCCCCCCCEEEEEECCHHHHCCCCHHHHHHHHHHHHCCCCC T ss_conf 638999996288406677--2245-56744564478995574662551121898887987334768 No 81 >pfam05301 Mec-17 Touch receptor neuron protein Mec-17. Mec-17 is the protein product of one of the 18 genes required for the development and function of the touch receptor neuron for gentle touch. Mec-17 is specifically required for maintaining the differentiation of the touch receptor. This family is conserved to higher eukaryotes. Probab=94.87 E-value=0.1 Score=30.63 Aligned_cols=60 Identities=10% Similarity=-0.006 Sum_probs=38.3 Q ss_pred CEEEEEEEEEEEE--E--------CCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEE-EEECCHH Q ss_conf 7864799999988--6--------067848998677525461599899999999999998899799-9950832 Q gi|254780989|r 72 DRIVGAIRATPIQ--I--------GKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFV-LIDAKHD 134 (185) Q Consensus 72 ~~iVG~i~~~~~~--~--------~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v-~l~~~~~ 134 (185) +.|+|++-.-+=. + +-....++-..+|++..|++|+|++|+++.++. ++.... +.+..|+ T Consensus 18 g~v~G~LKvG~K~Lfl~d~~g~~~e~~~~~CvLDFYVhes~QR~G~Gk~Lf~~ML~~---e~~~p~~la~DrPS 88 (120) T pfam05301 18 SALKGLLKVGYKKLFLFDNEGNTYEVEQTPCILDFYVHESRQRSGNGKKLFDEMLKK---ENVEPHQCAVDRPS 88 (120) T ss_pred CEEEEEEEEEEEEEEEECCCCCEEEECCCCEEEEEEECHHHHCCCCHHHHHHHHHHH---CCCCHHHCCCCCCC T ss_conf 169999987403588998999988851675899999802462457579999999987---19981346105898 No 82 >COG5628 Predicted acetyltransferase [General function prediction only] Probab=94.83 E-value=0.069 Score=31.64 Aligned_cols=98 Identities=13% Similarity=0.061 Sum_probs=59.2 Q ss_pred EEEEECCEEEEEEEEE-EEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH--HHHHHCCCE Q ss_conf 9998987864799999-9886067848998677525461599899999999999998899799995083--254313858 Q gi|254780989|r 66 FVCRDGDRIVGAIRAT-PIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH--DWHKHLGFK 142 (185) Q Consensus 66 ~Vae~d~~iVG~i~~~-~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~--~~Y~r~GF~ 142 (185) |....+|-+||++.+- ++......-..++.+.+.-.||++|+||+-+.......+..-.-+++.+..| .|++|+-.. T Consensus 40 ~~~~~~~~~igf~l~L~~~~~~~~iD~~~~efFIi~k~~~~GvGR~aaK~If~~~~g~w~Va~i~EN~PA~~fwK~~~~t 119 (143) T COG5628 40 WLFRIGGLPVGFALVLDLAHSPTPIDRAVAEFFIVRKHRRRGVGRAAAKAIFGSAWGVWQVATVRENTPARAFWKRVAET 119 (143) T ss_pred EEEEECCCCEEEEEEECCCCCCCCCCCCCCEEEEEEHHHCCCHHHHHHHHHHHHHHCEEEEEEECCCCCHHHHHHHCCCC T ss_conf 59998894145534432568887432362006876043046653999999998761458999952788068899862022 Q ss_pred EHHHCCCCCC--CCCCCHHHEEEEECC Q ss_conf 7052782677--988882322654405 Q gi|254780989|r 143 KPHQHQIRYA--HGGGAATDWLVHIFK 167 (185) Q Consensus 143 ~~~~~~i~~~--~~~~~~~~~~~~~l~ 167 (185) +.+... -+-..|.-..++.++ T Consensus 120 ----~~i~~E~r~d~~~d~iy~i~TF~ 142 (143) T COG5628 120 ----YPVVEEDRQDARWDGIYAVLTFR 142 (143) T ss_pred ----CCCCHHHHHCCCCCCCEEEEEEE T ss_conf ----45524666422688735889985 No 83 >COG3916 LasI N-acyl-L-homoserine lactone synthetase [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism] Probab=94.64 E-value=0.37 Score=27.22 Aligned_cols=152 Identities=11% Similarity=0.041 Sum_probs=90.6 Q ss_pred ECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEE-EECCEEEEEEEEEEEEE--------- Q ss_conf 51379757899999999972787666756152025412000257665489999-89878647999999886--------- Q gi|254780989|r 16 FANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVC-RDGDRIVGAIRATPIQI--------- 85 (185) Q Consensus 16 IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Va-e~d~~iVG~i~~~~~~~--------- 85 (185) .+...+.-..++..|-.+.|.+.-......... .+. +..+......+++ ..+|+|+|++++-|.+- T Consensus 9 ~~~~~~~~l~em~rlR~~vF~erL~W~v~~~~g---~E~-DqyD~~~t~Yll~~~~~g~I~G~~RlLptt~P~mL~~vF~ 84 (209) T COG3916 9 RRELFPKALEEMHRLRYQVFKERLGWDVVCIDG---FEI-DQYDNLDTVYLLALTSDGRIVGCVRLLPTTGPYMLTDVFP 84 (209) T ss_pred CCHHCHHHHHHHHHHHHHHHHHHCCCCEECCCC---CCC-CCCCCCCCEEEEEECCCCCEEEEEEECCCCCCCHHHHHHH T ss_conf 320157999999888999988754771031688---440-1467888528999858996898998636778525550568 Q ss_pred ---C------CCCEEEEEEEEECH--HHHCCC----HHHHHHHHHHHHHHHCCCCEEEEECCHH---HHHHCCCEEHHHC Q ss_conf ---0------67848998677525--461599----8999999999999988997999950832---5431385870527 Q gi|254780989|r 86 ---G------KYKGFLRGPLGVLS--EYRKRG----IGSRLAYMSFMAIKNSGAEFVLIDAKHD---WHKHLGFKKPHQH 147 (185) Q Consensus 86 ---~------~~~~~~l~~laV~P--~~qg~G----iG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y~r~GF~~~~~~ 147 (185) + +...+.++..+|++ .-++.| ++..|+.-.++.+...|+..++.+++.- ...|.|+.-..-. T Consensus 85 ~Ll~~~~~P~~p~vwEsSRF~vd~~~a~~~~g~~~~a~~el~~g~ie~a~~~G~~~IvtVt~~~meril~r~Gw~~~riG 164 (209) T COG3916 85 ALLEGGPPPSSPGVWESSRFAVDKPSARRAAGGVSPAAYELFAGMIEYALARGITGIVTVTDTGMERILRRAGWPLTRIG 164 (209) T ss_pred HHHCCCCCCCCCCEEEEEEEEECCCCCHHHCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHCCCCEEECC T ss_conf 98469999999882787645515642533428954889999999999999769965999970089999998499708758 Q ss_pred CCCCCCCCCCHHHEEEEECCCCCCCC Q ss_conf 82677988882322654405797657 Q gi|254780989|r 148 QIRYAHGGGAATDWLVHIFKNNIMEN 173 (185) Q Consensus 148 ~i~~~~~~~~~~~~~~~~l~~~~~~~ 173 (185) . ...+++. ...-+-.+++.+.++. T Consensus 165 ~-~~~ig~~-~~VA~~l~i~~~s~~~ 188 (209) T COG3916 165 P-PLTIGNE-RAVALLLDIDRESLEQ 188 (209) T ss_pred C-CEEECCE-EEEEEEEECCHHHHHH T ss_conf 8-6366880-6999996258888998 No 84 >COG2401 ABC-type ATPase fused to a predicted acetyltransferase domain [General function prediction only] Probab=93.57 E-value=0.064 Score=31.84 Aligned_cols=132 Identities=15% Similarity=0.102 Sum_probs=78.8 Q ss_pred CCCCH-HHHHHHHHHHHHHCCCCCCCCCCC----EEHHHHHCHH------------CC-CCCCCEEEEEEEE------CC Q ss_conf 13797-578999999999727876667561----5202541200------------02-5766548999989------87 Q gi|254780989|r 17 ANEQR-VDQVAVDEVMVKAFLSTQEITDDV----LHKYVRNNVE------------EP-LSFVRLMSFVCRD------GD 72 (185) Q Consensus 17 R~a~~-~D~~aI~~L~~~aF~~~~~~~~~~----~~~~~~~~~~------------~~-~~~~~~l~~Vae~------d~ 72 (185) |.|+- +|...|.+|.+--|.+..+..+-. --...+.++. +. .+.+.+-++|.|. +- T Consensus 102 rea~ye~dfe~i~~leqyhyas~k~~va~w~c~~cg~~iean~kp~c~cg~~~~~~ei~gs~pasrf~i~el~~r~~yep 181 (593) T COG2401 102 REATYEEDFEFIAELEQYHYASQKEKVALWRCEKCGTIIEANTKPECKCGSHVHILEIKGSTPASRFLIVELVKREEYEP 181 (593) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCHHCHHHHHCCCCCCCCCCCEEEEEEECCCCCHHEEEEEHHCCCCCCC T ss_conf 34357777999987777631256545899805021100240488555789844899963688523313222211786886 Q ss_pred EEEEEEEEEEE-EE-----C------------------------------------CCCEEEEEEEEECHHHHCCCHHHH Q ss_conf 86479999998-86-----0------------------------------------678489986775254615998999 Q gi|254780989|r 73 RIVGAIRATPI-QI-----G------------------------------------KYKGFLRGPLGVLSEYRKRGIGSR 110 (185) Q Consensus 73 ~iVG~i~~~~~-~~-----~------------------------------------~~~~~~l~~laV~P~~qg~GiG~~ 110 (185) +|++|+..-|- .. + ...+.-|..+.|||+||+-|+|.. T Consensus 182 ~i~ayvrvdppiplmhr~l~g~ve~nirekvfpk~wf~p~f~p~r~l~~e~al~~~ntaaariarvvvhpdyr~dglg~~ 261 (593) T COG2401 182 RILAYVRVDPPIPLMHRKLDGKVERNIREKVFPKDWFHPTFWPERELRKEEALRECNTAAARIARVVVHPDYRADGLGQL 261 (593) T ss_pred CEEEEEECCCCCHHHHHHCCCHHHHHHHHHCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHEEEEEECCCCCCCCCCHH T ss_conf 34789964898444554326507766665408265437766620344489999873225443067885665456761063 Q ss_pred HHHHHHHHHHHCCCCEEEEE-----------CCHHHHHHCCCEEHHHCC Q ss_conf 99999999998899799995-----------083254313858705278 Q gi|254780989|r 111 LAYMSFMAIKNSGAEFVLID-----------AKHDWHKHLGFKKPHQHQ 148 (185) Q Consensus 111 Li~~~l~~a~~~G~~~v~l~-----------~~~~~Y~r~GF~~~~~~~ 148 (185) -|..++++.+++-.+-..-- .-..|+++.||+-..... T Consensus 262 sv~~a~ewI~eRriPEmr~rkHlvetiaqmarynpffe~~gfkylwdta 310 (593) T COG2401 262 SVIAALEWIIERRIPEMRPRKHLVETIAQMARYNPFFEKVGFKYLWDTA 310 (593) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCCHHHHHCEEEEEECC T ss_conf 7999999999851802324565999999987507455430303665337 No 85 >TIGR00124 cit_ly_ligase [citrate (pro-3S)-lyase] ligase; InterPro: IPR005216 [Citrate (pro-3S)-lyase] ligase (6.2.1.22 from EC), also known as citrate lyase ligase, is responsible for acetylation of the prosthetic group (2-(5''-phosphoribosyl)-3'-dephosphocoenzyme-A) of the gamma subunit of citrate lyase converting the inactive thiol form of the enzyme to the active form. ; GO: 0008771 [citrate (pro-3S)-lyase] ligase activity. Probab=93.31 E-value=0.68 Score=25.66 Aligned_cols=78 Identities=14% Similarity=0.146 Sum_probs=63.7 Q ss_pred EEEEEEEEC-CEEEEEEEEEEEEECCCCEEEEEEEEECHHHH-CCC-HHHHHHHHHHHHHHHCCCCEEEEECCHH---HH Q ss_conf 489999898-78647999999886067848998677525461-599-8999999999999988997999950832---54 Q gi|254780989|r 63 LMSFVCRDG-DRIVGAIRATPIQIGKYKGFLRGPLGVLSEYR-KRG-IGSRLAYMSFMAIKNSGAEFVLIDAKHD---WH 136 (185) Q Consensus 63 ~l~~Vae~d-~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~q-g~G-iG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y 136 (185) ....++..+ .+++|+..+ .+-.+.-+++++..+ |.| +...|+...+..+...|....+++..|. .+ T Consensus 32 ~~~~~~~~~~~~~~~~g~~--------~g~~~~~~~~~~~~~~g~g~~~~~l~~~l~~~~~~~~~~~~~~~~~p~~~~~~ 103 (342) T TIGR00124 32 EVFIVVYEDNEELVGCGGI--------AGNVIKCVAIDESLRSGEGTLALQLLTELLNLAYELGRPHLFLFTKPEYAALF 103 (342) T ss_pred EEEEEEEECCCEEEEECCC--------CCCCEEEEEECCHHCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECHHHHHHH T ss_conf 2688875057504650320--------00000012303001034215789999999988775156303565122567776 Q ss_pred HHCCCEEHHHCC Q ss_conf 313858705278 Q gi|254780989|r 137 KHLGFKKPHQHQ 148 (185) Q Consensus 137 ~r~GF~~~~~~~ 148 (185) +..||....+-+ T Consensus 104 ~~~g~~~~~~~~ 115 (342) T TIGR00124 104 EGCGFWTLAEAG 115 (342) T ss_pred HHCCCCEECCCC T ss_conf 432640010378 No 86 >TIGR03019 pepcterm_femAB FemAB-related protein, PEP-CTERM system-associated. Members of this protein family are found always as part of extended exopolysaccharide biosynthesis loci in bacteria. In nearly every case, these loci contain determinants for the processing of the PEP-CTERM proposed C-terminal protein sorting signal. This family shows remote, local sequence similarity to the FemAB protein family (see pfam02388), whose members Probab=91.39 E-value=1.2 Score=24.20 Aligned_cols=128 Identities=14% Similarity=0.075 Sum_probs=79.6 Q ss_pred CCCCEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCC Q ss_conf 00020751379757899999999972787666756152025412000257665489999898786479999998860678 Q gi|254780989|r 10 ELEGLTFANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYK 89 (185) Q Consensus 10 ~~~~l~IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~ 89 (185) +-.|+++|... |.+...+|+.......-. +-....+++ .+.+.......+..|+..+++++|-+.+.. ..+ T Consensus 148 ~K~Gl~v~~~~--dl~~FY~v~~~n~rrlG~--Pv~~~~~F~-~l~~~f~~~~~il~v~~~~g~~~ag~l~~~----~~~ 218 (330) T TIGR03019 148 IKAGLTVTVDG--DLDRFYDVYAENMRDLGT--PVFSRRYFR-LLKDVFGEDCEVLTVRLGDGVVASAVLSFY----FRD 218 (330) T ss_pred HHCCCEEEECC--CHHHHHHHHHHHHHHCCC--CCCCHHHHH-HHHHHCCCCCEEEEEEECCCCEEEEEEEEE----CCC T ss_conf 87597456478--889999999999985799--999999999-999861667379999927996899999995----298 Q ss_pred EEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEE------CCHHHHHHCCCEEHHH Q ss_conf 48998677525461599899999999999998899799995------0832543138587052 Q gi|254780989|r 90 GFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLID------AKHDWHKHLGFKKPHQ 146 (185) Q Consensus 90 ~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~------~~~~~Y~r~GF~~~~~ 146 (185) ...-.-.+..+++|.-+---.|.-++|++|.++|++.-=.- |--.|=++.||++.-- T Consensus 219 ~v~~~y~as~~~~r~~~~n~llyW~~i~~A~e~G~~~fDFGrS~~~~G~~~FKk~WG~~p~pl 281 (330) T TIGR03019 219 EVLPYYAGGLREARDVAANDLMYWELMRRACERGLRVFDFGRSKRGTGPFKFKKNWGFEPQPL 281 (330) T ss_pred EEEEEEECCCHHHHHCCCHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHCCCCEEEEE T ss_conf 899965345198874581488999999999987996883799999996867785379833201 No 87 >PHA01733 hypothetical protein Probab=91.11 E-value=1.1 Score=24.39 Aligned_cols=122 Identities=8% Similarity=-0.128 Sum_probs=69.6 Q ss_pred ECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEEEEE Q ss_conf 51379757899999999972787666756152025412000257665489999898786479999998860678489986 Q gi|254780989|r 16 FANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLRGP 95 (185) Q Consensus 16 IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~ 95 (185) ||++|.+|+..+..=++.. +..|....-.... ..+....... ...+....||+++|.+++.|...+ ..-.++. T Consensus 5 iRpaT~eDa~~vA~nLR~e--D~~Eieg~G~~P~--~l~~~~~~s~-~~~~~~~~dG~~aGmaGV~p~~~~--~~G~iWm 77 (153) T PHA01733 5 NRPATQADATEVAQNLRQE--DREEIEGLGHSPL--ALHLSLDVSE-NVVAFVAPDGSLAGVAGLVEDMGN--RVGEIWM 77 (153) T ss_pred CCCCCHHHHHHHHHHCCHH--HHHHHHHCCCCHH--HHHHHHHCCC-CEEEEEECCCCEEEEEEEECCCCC--CCCEEEE T ss_conf 2648799999999757898--9999986699838--9877876068-749999569968768775067657--7762689 Q ss_pred EEECHHHHCCCHHHHHHHHHHHHHH-HCCCCEEEEECCHH------HHHHCCCEEHHHC Q ss_conf 7752546159989999999999999-88997999950832------5431385870527 Q gi|254780989|r 96 LGVLSEYRKRGIGSRLAYMSFMAIK-NSGAEFVLIDAKHD------WHKHLGFKKPHQH 147 (185) Q Consensus 96 laV~P~~qg~GiG~~Li~~~l~~a~-~~G~~~v~l~~~~~------~Y~r~GF~~~~~~ 147 (185) |++..-+ -+-...++++-++.. .+.++...-+.|.. +-+-+||+-.... T Consensus 78 L~T~~i~---k~~~~FlR~~k~~v~~~~~y~~L~N~vd~rN~~hikwLk~lGF~f~~~~ 133 (153) T PHA01733 78 VCTPAIE---KNPIALLRGAKWWLPKSRNYDLLWNIVDKRNLVHRKLLRKLGFKGLRYV 133 (153) T ss_pred ECCHHHH---HCHHHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHCCCCEEEEE T ss_conf 7076887---5739999853999985379874788624575999999997697077641 No 88 >COG5027 SAS2 Histone acetyltransferase (MYST family) [Chromatin structure and dynamics] Probab=89.37 E-value=0.16 Score=29.48 Aligned_cols=58 Identities=19% Similarity=0.339 Sum_probs=34.6 Q ss_pred EEEEEEEECC----EEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCC Q ss_conf 4899998987----864799999988606784899867752546159989999999999999889 Q gi|254780989|r 63 LMSFVCRDGD----RIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSG 123 (185) Q Consensus 63 ~l~~Vae~d~----~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G 123 (185) -+++|....| .+||| |+.- -......-|+=+-+.|.||++|+|+-||+.+-...+.+| T Consensus 234 flFYvl~~~~~~~~h~vGy--FSKE-K~S~~~yNLaCILtLP~yQRrGYG~lLIdFSY~Ls~~E~ 295 (395) T COG5027 234 FLFYVLTERGDTGCHLVGY--FSKE-KESEQDYNLACILTLPPYQRRGYGKLLIDFSYLLSQKEG 295 (395) T ss_pred EEEEEEEECCCCCEEEEEE--ECHH-HCCCCCCCEEEEEECCHHHHCCCCEEEEEEEEECCCCCC T ss_conf 5999999738754035666--3321-135456755799965756706644575554220101456 No 89 >PRK01305 arginyl-tRNA-protein transferase; Provisional Probab=89.08 E-value=1.9 Score=23.02 Aligned_cols=74 Identities=14% Similarity=0.113 Sum_probs=49.0 Q ss_pred EEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEE--ECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHCC Q ss_conf 48999989878647999999886067848998677--5254615998999999999999988997999950832543138 Q gi|254780989|r 63 LMSFVCRDGDRIVGAIRATPIQIGKYKGFLRGPLG--VLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWHKHLG 140 (185) Q Consensus 63 ~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~la--V~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~G 140 (185) ...+--..+|++||.+..-. ..++ |+.|+ -+|++..+++|+-.+=.-|+.+++.|.+.+.|-=...==.|+. T Consensus 144 t~~~e~~~~~~Li~v~v~D~----l~dg--lSAVYtFyDP~~~~~SLGt~~IL~~I~~a~~~~l~y~YLGYwI~~c~kM~ 217 (241) T PRK01305 144 TRFIEFRLDGRLVAVAVTDV----LDDG--LSAVYTFYDPDEEHRSLGTFAILWQIEQAKRLGLPYVYLGYWIKGSRKMN 217 (241) T ss_pred CEEEEEEECCCEEEEEEEEC----CCCC--CCCCEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCC T ss_conf 68999980894799999835----8864--32360120756566897599999999999985998695226878887777 Q ss_pred CE Q ss_conf 58 Q gi|254780989|r 141 FK 142 (185) Q Consensus 141 F~ 142 (185) || T Consensus 218 YK 219 (241) T PRK01305 218 YK 219 (241) T ss_pred CC T ss_conf 75 No 90 >KOG2779 consensus Probab=88.21 E-value=2.1 Score=22.67 Aligned_cols=52 Identities=19% Similarity=0.142 Sum_probs=20.6 Q ss_pred CEEEEEEEEEEEEECCCC----EEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCC Q ss_conf 786479999998860678----4899867752546159989999999999999889 Q gi|254780989|r 72 DRIVGAIRATPIQIGKYK----GFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSG 123 (185) Q Consensus 72 ~~iVG~i~~~~~~~~~~~----~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G 123 (185) .++||||..-|..+.-++ .+.+.-|.||...|+++++--||++.-++..-.| T Consensus 145 ~kLVaFIsaiP~~irvrdk~vk~veINFLCVHKkLRSKRlaPvLIrEITRRvnl~g 200 (421) T KOG2779 145 KKLVAFISAIPATIRVRDKVVKMVEINFLCVHKKLRSKRLAPVLIREITRRVNLEG 200 (421) T ss_pred CCEEEEEECCCCEEEECCCEEEEEEEEEEEEEHHHHCCCCCCHHHHHHHHHHHHHH T ss_conf 72589985253178871426556457889973333016653077888877740442 No 91 >KOG4135 consensus Probab=87.07 E-value=1.3 Score=23.99 Aligned_cols=51 Identities=14% Similarity=0.063 Sum_probs=37.8 Q ss_pred EEECHHHHCCCHHHHHHHHHHHHHHHC----CCCEEEEECCH---HHHHHCCCEEHHH Q ss_conf 775254615998999999999999988----99799995083---2543138587052 Q gi|254780989|r 96 LGVLSEYRKRGIGSRLAYMSFMAIKNS----GAEFVLIDAKH---DWHKHLGFKKPHQ 146 (185) Q Consensus 96 laV~P~~qg~GiG~~Li~~~l~~a~~~----G~~~v~l~~~~---~~Y~r~GF~~~~~ 146 (185) +--.|.-||+|+|+.-+...+..+.+. .+.+-+-.++. -++++|+|..... T Consensus 113 MIAEP~~RgKG~G~eav~~ml~y~~s~l~l~Ky~vkig~~nk~sl~lFkk~~f~q~~~ 170 (185) T KOG4135 113 MIAEPRGRGKGIGTEAVRAMLAYAYSVLKLDKYEVKIGMDNKPSLRLFKKFLFTQVFY 170 (185) T ss_pred EEECCCCCCCCCCHHHHHHHHHHHHHHHHHHEEEEEECCCCCHHHHHHHHHHHEEEEE T ss_conf 9805555677731889999999999874020489972588862789999753404532 No 92 >pfam04377 ATE_C Arginine-tRNA-protein transferase, C terminus. This family represents the C terminal region of the enzyme arginine-tRNA-protein transferase (EC 2.3.2.8), which catalyses the post-translational conjugation of arginine to the N terminus of a protein. In eukaryotes, this functions as part of the N-end rule pathway of protein degradation by conjugating a destabilising amino acid to the amino terminal aspartate or glutamate of a protein, targeting the protein for ubiquitin-dependent proteolysis. N terminal cysteine is sometimes modified. Probab=86.66 E-value=2.6 Score=22.12 Aligned_cols=62 Identities=15% Similarity=0.173 Sum_probs=44.8 Q ss_pred EEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEE--ECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 48999989878647999999886067848998677--525461599899999999999998899799995 Q gi|254780989|r 63 LMSFVCRDGDRIVGAIRATPIQIGKYKGFLRGPLG--VLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLID 130 (185) Q Consensus 63 ~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~la--V~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~ 130 (185) ...+--.++|++||.+..-. ..++ ++.|+ -+|++..+++|+-.+=.-|+.+++.|.+.+.|- T Consensus 39 t~~~eyr~~~~Liav~v~D~----l~dg--lSaVY~ffdP~~~~~SlG~~~iL~~I~~ak~~~~~y~YLG 102 (128) T pfam04377 39 TRFLEYRLDGKLIAVAVTDI----LPDG--LSAVYTFYDPDYSKRSLGTYSILWQIELAKELGLPYVYLG 102 (128) T ss_pred CEEEEEEECCEEEEEEEEEC----CCCC--CHHEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCEEECE T ss_conf 78999982896899999842----7742--0104565372200036318999999999998299738640 No 93 >KOG2696 consensus Probab=85.83 E-value=1 Score=24.61 Aligned_cols=42 Identities=12% Similarity=0.116 Sum_probs=30.5 Q ss_pred EEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH Q ss_conf 998677525461599899999999999998899799995083 Q gi|254780989|r 92 LRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH 133 (185) Q Consensus 92 ~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~ 133 (185) -+..+-+.|.||++|+|+.|++.......+.--..=+.+-+| T Consensus 219 RiSQmlilpPfq~~Glgs~l~E~i~r~~~~~p~v~DiTVEdP 260 (403) T KOG2696 219 RISQMLILPPFQGKGLGSQLYEAIARDYLEEPTVLDITVEDP 260 (403) T ss_pred HHHEEEEECCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCC T ss_conf 341257736556775478999999886405983258873382 No 94 >KOG4601 consensus Probab=82.83 E-value=1.6 Score=23.46 Aligned_cols=60 Identities=10% Similarity=0.032 Sum_probs=39.9 Q ss_pred CEEEEEEEEEE--EEE-C------CCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCE-EEEECCHH Q ss_conf 78647999999--886-0------6784899867752546159989999999999999889979-99950832 Q gi|254780989|r 72 DRIVGAIRATP--IQI-G------KYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEF-VLIDAKHD 134 (185) Q Consensus 72 ~~iVG~i~~~~--~~~-~------~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~-v~l~~~~~ 134 (185) +.|.|++.... ..+ + ..+.++|-..+||+.-|++|.|+.|+++.+++ ++++. -+...+|. T Consensus 81 s~l~GllKVG~KkLfl~D~~~~~ye~e~lcILDFyVheS~QR~G~G~~lfdyMl~k---E~vephQ~a~DrPS 150 (264) T KOG4601 81 SILKGLLKVGYKKLFLTDNEQNQYEEEALCILDFYVHESEQRSGNGFKLFDYMLKK---ENVEPHQCAFDRPS 150 (264) T ss_pred HHEEEEEHCCCEEEEEECCHHHHHCCCCCEEEEEEEEHHHHHCCCHHHHHHHHHHH---CCCCCHHEECCCHH T ss_conf 30311000042158885307666304774478888614665147458999999985---28871120036837 No 95 >pfam09924 DUF2156 Uncharacterized conserved protein (DUF2156). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=80.23 E-value=4.9 Score=20.50 Aligned_cols=58 Identities=16% Similarity=0.075 Sum_probs=38.1 Q ss_pred EEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCC Q ss_conf 4899998987864799999988606784899867752546159989999999999999889 Q gi|254780989|r 63 LMSFVCRDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSG 123 (185) Q Consensus 63 ~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G 123 (185) .-.+++.++|+|+||+.+.|+. ..+++.+.-.=-+|+. =+|+-..|+.+.++-+++.. T Consensus 179 ~~~~~~~~~g~i~aF~~~~p~~--~~~~~~idl~r~~~da-~~G~~~~l~~~~~~gLr~~K 236 (250) T pfam09924 179 LRGAVVRVDGKIVAFASGEPLN--GRDTLSLDLMRRDPDA-PNGIMEFLNAELAEGLRRFK 236 (250) T ss_pred CCEEEEEECCEEEEEEEEEEEC--CCCEEEEEEEECCCCC-CCCHHHHHHHHHHHHHHHCC T ss_conf 9589999999999999999957--9997999998659999-88699999999999999747 No 96 >pfam06852 DUF1248 Protein of unknown function (DUF1248). This family represents a conserved region within a number of proteins of unknown function that seem to be specific to C. elegans. Note that some family members contain more than one copy of this region. Probab=77.59 E-value=5.9 Score=20.01 Aligned_cols=78 Identities=14% Similarity=0.218 Sum_probs=51.9 Q ss_pred EEEEEEEEC-CEEEEEEE---EEEEEEC-CCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH--HH Q ss_conf 489999898-78647999---9998860-67848998677525461599899999999999998899799995083--25 Q gi|254780989|r 63 LMSFVCRDG-DRIVGAIR---ATPIQIG-KYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH--DW 135 (185) Q Consensus 63 ~l~~Vae~d-~~iVG~i~---~~~~~~~-~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~--~~ 135 (185) .+.+++-.+ +++|+... |-|..-. ..+...+|-.-++|+|||+.+.+-.-..+.+..+..+...+.-...+ .| T Consensus 46 ~l~l~clKgT~rvi~~~h~i~f~pL~~~~d~~~~f~G~~Wi~PdyR~~~~~~l~~~~~~~~~~~~~~N~~~~~~~~~~~f 125 (180) T pfam06852 46 RLKLTCLKGTNRVIATFHTITFRPLPSSKDKPFVFLGFGWIDPDYRGKDIMKLTDDIAKEEERRKSDNAVAQNNPPSMNF 125 (180) T ss_pred EEEEEEECCCCEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHH T ss_conf 79999976998078874079853677789986289860346855464168998999998763375555313268489999 Q ss_pred HHH-CC Q ss_conf 431-38 Q gi|254780989|r 136 HKH-LG 140 (185) Q Consensus 136 Y~r-~G 140 (185) |++ .| T Consensus 126 W~k~~G 131 (180) T pfam06852 126 WKKLTG 131 (180) T ss_pred HHHHHC T ss_conf 999509 No 97 >pfam02388 FemAB FemAB family. The femAB operon codes for two nearly identical approximately 50-kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan. These proteins are also considered as a factor influencing the level of methicillin resistance. Probab=72.25 E-value=8.1 Score=19.18 Aligned_cols=35 Identities=23% Similarity=0.146 Sum_probs=20.4 Q ss_pred EECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 75254615998999999999999988997999950 Q gi|254780989|r 97 GVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA 131 (185) Q Consensus 97 aV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~ 131 (185) +..++|+.-.-.-.|.-+++++|+++|+..-=..| T Consensus 324 gs~~~~~~~~~~y~l~w~~i~~a~~~G~~~ydf~G 358 (407) T pfam02388 324 GSDPEYRKFYAPYLLQWEMIQEAKERGIDRYNFYG 358 (407) T ss_pred CCCHHHHHCCCHHHHHHHHHHHHHHCCCCEEECCC T ss_conf 72076564476189999999999987998896418 No 98 >cd03173 DUF619-like DUF619-like: This CD includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present C-terminal of a NAG kinase-like domain seen in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAG is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. This CD also includes the DUF619 domain of the FABP N-acetylglutamate kinase (NAGK). The nuclear-encoded mitochondrial polyprotein precursor (ARG5,6) consists of an N-terminal NAGK (ArgB) domain, a central DUF619 domain, and a C-ter Probab=70.96 E-value=8.7 Score=19.01 Aligned_cols=48 Identities=17% Similarity=0.054 Sum_probs=34.6 Q ss_pred EECCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHH Q ss_conf 89878647999999886067848998677525461599899999999999 Q gi|254780989|r 69 RDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMA 118 (185) Q Consensus 69 e~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~ 118 (185) -.++...|.+.+++- ......+|..++|.++.||.|+|..+.....+. T Consensus 15 y~~~~y~g~AIvt~~--~~~~~~yLdKFavs~~~~g~gv~d~v~~~l~~d 62 (99) T cd03173 15 YVDEPYEGLAIVTPE--GNGGTAYLDKFAVSSSGWGNGVGDNIFEALRKD 62 (99) T ss_pred EEECCCCEEEEEECC--CCCCCEEEEEEEECHHHHCCCHHHHHHHHHHHH T ss_conf 994897589999448--999965777799963454049899999999854 No 99 >cd04266 DUF619-NAGS-FABP DUF619-NAGS-FABP: DUF619 domain of N-acetylglutamate synthase (NAGS) of the fungal arginine-biosynthetic pathway (FABP). This NAGS (ARG2) consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain, yet to be characterized, is predicted to function in NAG synthase association in fungi. Probab=69.11 E-value=9.5 Score=18.76 Aligned_cols=53 Identities=17% Similarity=0.117 Sum_probs=34.7 Q ss_pred CEEEEEEEEECHHHHC-CCHHHHHHHHHHHHHHHCCCCEEEEE--CCH---HHHHH-CCCEEH Q ss_conf 8489986775254615-99899999999999998899799995--083---25431-385870 Q gi|254780989|r 89 KGFLRGPLGVLSEYRK-RGIGSRLAYMSFMAIKNSGAEFVLID--AKH---DWHKH-LGFKKP 144 (185) Q Consensus 89 ~~~~l~~laV~P~~qg-~GiG~~Li~~~l~~a~~~G~~~v~l~--~~~---~~Y~r-~GF~~~ 144 (185) ...+|-..||.|+.|| .|+|..|-+...+. ..-..++-. .+| =|++| -|+.+. T Consensus 38 ~vpYLDKFAV~~~aqG~~Gl~d~iw~~m~~~---~p~~lfWRSR~~NpvNkwYf~rs~G~~k~ 97 (108) T cd04266 38 KIAYLDKFAVLPKAQGSDGIADILFNAMLDE---FPNELIWRSRKDNPVNKWYFERSVGVLKL 97 (108) T ss_pred CCCEEEEEEECHHHCCCCCHHHHHHHHHHHH---CCCCEEEECCCCCCCCHHHHHHCCEEEEC T ss_conf 6505520454322248632899999999986---78521886068998557788644104754 No 100 >pfam07395 Mig-14 Mig-14. This family contains a number of bacterial mig-14 proteins (approximately 270 residues long). In Salmonella, mig-14 contributes to resistance to antimicrobial peptides, although the mechanism is not fully understood. Probab=65.25 E-value=11 Score=18.29 Aligned_cols=70 Identities=16% Similarity=-0.009 Sum_probs=53.1 Q ss_pred EEEEEEECCEEEEEEEEEEEEECCCCEEEEE--EEEECHHHHCCCHHHHH----HHHHHHHHHHCCCCEEEEECCHHH Q ss_conf 8999989878647999999886067848998--67752546159989999----999999999889979999508325 Q gi|254780989|r 64 MSFVCRDGDRIVGAIRATPIQIGKYKGFLRG--PLGVLSEYRKRGIGSRL----AYMSFMAIKNSGAEFVLIDAKHDW 135 (185) Q Consensus 64 l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~--~laV~P~~qg~GiG~~L----i~~~l~~a~~~G~~~v~l~~~~~~ 135 (185) +.-|.-.+|+.+++=.+.++ +...+.++- --+|+|+++.--.|+-| ++.+-+.++++|-+..+-.|.|++ T Consensus 174 fG~VLfl~~~PcA~qlv~k~--esp~wv~~d~iNGG~Dpe~~~~SpGSIL~wlN~~~A~~~~~~~~K~lrfSfGr~~~ 249 (264) T pfam07395 174 FGSVLFLNGKPIAIQIVYRS--EAPCWVSVEYINGGVDPETRKFSPGSVLSFLNTQAAWEDARARGKPLRFSFGRADR 249 (264) T ss_pred EEEEEEECCCCEEEEEEEEE--CCCCEEEEECCCCCCCCCCCCCCCCCEEEEECHHHHHHHHHHHCCCEEEEECCCCH T ss_conf 20089988963699999984--48870799536677381303479863898721799999999729926998078974 No 101 >pfam12261 T_hemolysin Thermostable hemolysin. This family of proteins is found in bacteria. Proteins in this family are typically between 200 and 228 amino acids in length. T_hemolysin is a pore-forming toxin of bacteria, able to lyse erythrocytes from a number of mammalian species. Probab=64.87 E-value=12 Score=18.25 Aligned_cols=50 Identities=20% Similarity=0.254 Sum_probs=41.3 Q ss_pred EEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHH---HHHHCCCEEH Q ss_conf 89986775254615998999999999999988997999950832---5431385870 Q gi|254780989|r 91 FLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHD---WHKHLGFKKP 144 (185) Q Consensus 91 ~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~---~Y~r~GF~~~ 144 (185) ..+|.++.. +.|..+.|+......+...|++.++.++.+. .+.|+|..+. T Consensus 89 vEvGnLAs~----~~g~s~~l~~~l~~~L~~~g~~w~vFTaT~~lr~~~~rlgl~~~ 141 (180) T pfam12261 89 AEVGNLASF----SAGAARLLFIALTQLLAARGFEWVVFTATDQLRNSFRRLGLEPT 141 (180) T ss_pred EEEEHHHHC----CCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHCCCCCE T ss_conf 776102326----85639999999999999779988999486899999997399966 No 102 >TIGR02109 PQQ_syn_pqqE coenzyme PQQ biosynthesis protein E; InterPro: IPR011843 This entry describes coenzyme PQQ biosynthesis protein E, a gene required for the biosynthesis of pyrrolo-quinoline-quinone (coenzyme PQQ). PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.; GO: 0051539 4 iron 4 sulfur cluster binding, 0018189 pyrroloquinoline quinone biosynthetic process. Probab=62.54 E-value=10 Score=18.59 Aligned_cols=44 Identities=14% Similarity=0.097 Sum_probs=40.0 Q ss_pred EEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHC Q ss_conf 77525461599899999999999998899799995083254313 Q gi|254780989|r 96 LGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWHKHL 139 (185) Q Consensus 96 laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~ 139 (185) =+-.|.--.---.+++++++-++++..|....+++.-||||.+. T Consensus 193 ~aLlPt~~Ql~~a~r~V~~aRer~~g~~~~~~l~yV~PDYY~~R 236 (363) T TIGR02109 193 AALLPTREQLEEATRIVEEARERLKGQGNPLSLDYVVPDYYAER 236 (363) T ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHHCC T ss_conf 42489889999999999999999860799823676348712106 No 103 >PRK12568 glycogen branching enzyme; Provisional Probab=62.07 E-value=6.5 Score=19.77 Aligned_cols=59 Identities=14% Similarity=0.103 Sum_probs=44.5 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECC--HHHHHHCCCEEHHHCCCCCCCCCCCHHHEEEEECC Q ss_conf 999999999999889979999508--32543138587052782677988882322654405 Q gi|254780989|r 109 SRLAYMSFMAIKNSGAEFVLIDAK--HDWHKHLGFKKPHQHQIRYAHGGGAATDWLVHIFK 167 (185) Q Consensus 109 ~~Li~~~l~~a~~~G~~~v~l~~~--~~~Y~r~GF~~~~~~~i~~~~~~~~~~~~~~~~l~ 167 (185) +.|....+..++++|+..|-|.-- ..||...||+...=|.+.+-||...+=..+|..+- T Consensus 269 ~ela~~Lipyvk~mGyThIELMPi~EhP~d~SWGYQvtg~fAptSRyGtPddfk~fVD~~H 329 (730) T PRK12568 269 PTLAEQLIPYVQQLGFTHIELLPITEHPFGGSWGYQPLGLYAPTARHGSPDGFAQFVDACH 329 (730) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCEEECCCCCCCCHHHHHHHHHHHH T ss_conf 9999999999998498779973444568877656556556415467889899999999998 No 104 >PRK12313 glycogen branching enzyme; Provisional Probab=61.00 E-value=5.5 Score=20.19 Aligned_cols=59 Identities=14% Similarity=0.037 Sum_probs=43.8 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCH--HHHHHCCCEEHHHCCCCCCCCCCCHHHEEEEECC Q ss_conf 9999999999998899799995083--2543138587052782677988882322654405 Q gi|254780989|r 109 SRLAYMSFMAIKNSGAEFVLIDAKH--DWHKHLGFKKPHQHQIRYAHGGGAATDWLVHIFK 167 (185) Q Consensus 109 ~~Li~~~l~~a~~~G~~~v~l~~~~--~~Y~r~GF~~~~~~~i~~~~~~~~~~~~~~~~l~ 167 (185) +.+.+..+..++++|+..|-|..-. .+|..+||....=+.+...|+...+=..||..+- T Consensus 169 ~e~a~~l~~yl~~lG~t~velmpv~e~p~~~swGY~~~~~fa~~sryGtp~dlk~lVd~~H 229 (632) T PRK12313 169 RELADELIPYVKEMGYTHVEFMPLMEHPLDGSWGYQLTGYFAPTSRYGTPEDFMYFVDACH 229 (632) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCEECCCCCCCCCHHHHHHHHHHHH T ss_conf 9999999999998198779982044568988756666500146567799899999999999 No 105 >COG2898 Uncharacterized conserved protein [Function unknown] Probab=60.49 E-value=14 Score=17.77 Aligned_cols=92 Identities=10% Similarity=0.082 Sum_probs=59.0 Q ss_pred EEEECCEEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCH----------H-- Q ss_conf 9989878647999999886067848998677525461599899999999999998899799995083----------2-- Q gi|254780989|r 67 VCRDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKH----------D-- 134 (185) Q Consensus 67 Vae~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~----------~-- 134 (185) +...+|+||+++.++|.. ......+--+=-+|+ --+|+-..|+-+.+.++|++|+....+-..| . T Consensus 397 ~~~~~g~VvaFa~l~~~~--~~~~~SlDlMR~sp~-ap~g~mdfLf~~li~~aKe~G~~~fsLgmApls~~~~~~la~~~ 473 (538) T COG2898 397 AVDNEGEVVAFANLMPTG--GKEGYSLDLMRRSPD-APNGTMDFLFSELILWAKEEGYQRFSLGMAPLSGLFERQLAPLW 473 (538) T ss_pred EECCCCCEEEEEEECCCC--CCCEEEEEEEECCCC-CCCHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCHHH T ss_conf 783779769987621067--765247874313888-99517999999999999974976994377426776401131699 Q ss_pred ------HHHH----CCCEEHHHCCCCCCCCCCCHHHEE Q ss_conf ------5431----385870527826779888823226 Q gi|254780989|r 135 ------WHKH----LGFKKPHQHQIRYAHGGGAATDWL 162 (185) Q Consensus 135 ------~Y~r----~GF~~~~~~~i~~~~~~~~~~~~~ 162 (185) .|++ ++|+.-..++-++- |.+.+-|.. T Consensus 474 ~R~~r~l~~~~~~~Y~f~gLrrfK~Kf~-P~W~prYl~ 510 (538) T COG2898 474 NRFGRLLFRHGERFYNFEGLRRFKDKFH-PEWEPRYLA 510 (538) T ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHCCC-CCCCEEEEE T ss_conf 9999999995350106177898763469-766305999 No 106 >TIGR01490 HAD-SF-IB-hyp1 HAD-superfamily subfamily IB hydrolase, TIGR01490; InterPro: IPR006385 This group of sequences belong to the IB subfamily of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. The sequences are all bacterial. The IB subfamily includes the enzyme phosphoserine phosphatase. Due to this relationship, several of these sequences have been annotated as "phosphoserine phosphatase related proteins," or "phosphoserine phosphatase-family enzymes." There is presently no evidence that any of the proteins in this family possess PSPase activity. . Probab=59.55 E-value=13 Score=18.01 Aligned_cols=35 Identities=26% Similarity=0.299 Sum_probs=28.2 Q ss_pred HHHHHHHHHHCCCCEEEEECCHH-----HHHHCCCEEHHH Q ss_conf 99999999988997999950832-----543138587052 Q gi|254780989|r 112 AYMSFMAIKNSGAEFVLIDAKHD-----WHKHLGFKKPHQ 146 (185) Q Consensus 112 i~~~l~~a~~~G~~~v~l~~~~~-----~Y~r~GF~~~~~ 146 (185) +...+++-+++|...|++.+.++ +++++||..+-. T Consensus 94 a~~~~~~H~~~G~~~vlvSaS~~~~v~~~A~~~G~~~~~g 133 (204) T TIGR01490 94 ARDLIKAHKAEGHTIVLVSASLEILVKPLARKLGIDNAIG 133 (204) T ss_pred HHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHCCCCEEEE T ss_conf 9999999998089389981678899999998718660301 No 107 >PRK05402 glycogen branching enzyme; Provisional Probab=59.16 E-value=8.5 Score=19.05 Aligned_cols=59 Identities=15% Similarity=0.121 Sum_probs=43.6 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEEC--CHHHHHHCCCEEHHHCCCCCCCCCCCHHHEEEEE Q ss_conf 8999999999999988997999950--8325431385870527826779888823226544 Q gi|254780989|r 107 IGSRLAYMSFMAIKNSGAEFVLIDA--KHDWHKHLGFKKPHQHQIRYAHGGGAATDWLVHI 165 (185) Q Consensus 107 iG~~Li~~~l~~a~~~G~~~v~l~~--~~~~Y~r~GF~~~~~~~i~~~~~~~~~~~~~~~~ 165 (185) -=+.|....+..++++|+..|-+.- ...||...||+...=|.+.+-||...+-..+|.. T Consensus 268 sY~ela~~Lipyvk~mGythIElMPv~EhP~d~SWGYQ~tgyfAptSRyGtPddfk~fVD~ 328 (730) T PRK05402 268 SYRELADQLIPYVKEMGFTHVELLPVAEHPFDGSWGYQPTGYYAPTSRFGTPDDFRYFVDA 328 (730) T ss_pred CHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 8999999999999973987799644304788887666655401563246898999999999 No 108 >COG5630 ARG2 Acetylglutamate synthase [Amino acid transport and metabolism] Probab=57.86 E-value=15 Score=17.49 Aligned_cols=86 Identities=17% Similarity=0.200 Sum_probs=51.8 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECHH Q ss_conf 57899999999972787666756152025412000257665489999898786479999998860678489986775254 Q gi|254780989|r 22 VDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSE 101 (185) Q Consensus 22 ~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~ 101 (185) -|.+.+..|...+|.-. ...++.+.++...... .+|+ |.--|.+.++.-.-+..+..+|-.+||.+. T Consensus 345 Ldl~r~q~LI~~SFkRT------Ld~h~y~~r~~~~La~----~iVs---gdY~g~aIlTyegs~~~~vpYLDKfAVl~~ 411 (495) T COG5630 345 LDLPRLQHLIQSSFKRT------LDPHYYETRINTPLAR----AIVS---GDYRGAAILTYEGSGENNVPYLDKFAVLDD 411 (495) T ss_pred CCCHHHHHHHHHHHHHC------CCHHHHHHHCCCCCEE----EEEE---CCCEEEEEEEEECCCCCCCCCEEEEECCCC T ss_conf 27478899998887630------3879888750673115----7761---562246899860467778731010001566 Q ss_pred HHC-CCHHHHHHHHHHHHHH Q ss_conf 615-9989999999999999 Q gi|254780989|r 102 YRK-RGIGSRLAYMSFMAIK 120 (185) Q Consensus 102 ~qg-~GiG~~Li~~~l~~a~ 120 (185) .|| .|||..+..-.-+..- T Consensus 412 aQGs~gisd~vfniM~e~fP 431 (495) T COG5630 412 AQGSEGISDAVFNIMREEFP 431 (495) T ss_pred CCCCCHHHHHHHHHHHHHCC T ss_conf 54320488999999987670 No 109 >TIGR00455 apsK adenylylsulfate kinase; InterPro: IPR002891 Enzyme that catalyses the phosphorylation of adenylylsulphate to 3'-phosphoadenylylsulphate. This domain contains an ATP binding P-loop motif .; GO: 0005524 ATP binding, 0016301 kinase activity, 0016772 transferase activity transferring phosphorus-containing groups, 0000103 sulfate assimilation. Probab=57.34 E-value=13 Score=17.86 Aligned_cols=46 Identities=22% Similarity=0.429 Sum_probs=31.4 Q ss_pred CHHHHHHHHHH-HHHHHCCCCEEEEECCH---HHHHHCCCEEHHHC-CCCC Q ss_conf 98999999999-99998899799995083---25431385870527-8267 Q gi|254780989|r 106 GIGSRLAYMSF-MAIKNSGAEFVLIDAKH---DWHKHLGFKKPHQH-QIRY 151 (185) Q Consensus 106 GiG~~Li~~~l-~~a~~~G~~~v~l~~~~---~~Y~r~GF~~~~~~-~i~~ 151 (185) |=||.-+..++ +++.++|+...+|.||. ..++-|||-..--. +|+- T Consensus 29 GsGKsTiA~Al~~~L~~~G~~~~~LDGDnvR~gL~~dLGFS~~DR~eNIRR 79 (187) T TIGR00455 29 GSGKSTIANALEKKLEKKGYRVYVLDGDNVRHGLNKDLGFSEEDRKENIRR 79 (187) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEEECCCEEECCCCCCCCCCHHHHCCCCCE T ss_conf 563579999999999966974999758634247788888885670568835 No 110 >pfam04768 DUF619 Protein of unknown function (DUF619). This region of unknown function is found at the C-terminus of Neurospora crassa acetylglutamate synthase (amino-acid acetyltransferase, EC: 2.3.1.1). It is also found C-terminal to the amino acid kinase region (pfam00696) in some fungal acetylglutamate kinase enzymes. Probab=55.64 E-value=17 Score=17.27 Aligned_cols=110 Identities=10% Similarity=0.044 Sum_probs=59.7 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECHHH Q ss_conf 78999999999727876667561520254120002576654899998987864799999988606784899867752546 Q gi|254780989|r 23 DQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQIGKYKGFLRGPLGVLSEY 102 (185) Q Consensus 23 D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~~~~~~~~~l~~laV~P~~ 102 (185) |.+.+.+|..++|... .....++ +++.... +-+-.++..=|.+.+++- -+.....+|-..+|.+.. T Consensus 33 d~~~L~~lle~sf~gk-----~~v~~y~-~~l~~~~-------~~iy~~~~~~~~AIVt~~-~~~~~vpyLdKFav~~~a 98 (168) T pfam04768 33 DLERLKNLIERSFDGK-----LSVADYL-DRLKGRL-------AKIYVDEPYEALAIVTPP-KEDGGVPYLDKFAVSKSA 98 (168) T ss_pred CHHHHHHHHHHCCCCC-----CCHHHHH-HHHCCCC-------CEEEECCCCCEEEEEECC-CCCCCCEEEEEEEECCCC T ss_conf 8999999998664775-----4499999-8733676-------569983798379999506-789987788777425301 Q ss_pred HCCCHHHHHHHHHHHHHHHCCCCEEEEE--CCHH---HHHH-CCCEEHHHCCCC Q ss_conf 1599899999999999998899799995--0832---5431-385870527826 Q gi|254780989|r 103 RKRGIGSRLAYMSFMAIKNSGAEFVLID--AKHD---WHKH-LGFKKPHQHQIR 150 (185) Q Consensus 103 qg~GiG~~Li~~~l~~a~~~G~~~v~l~--~~~~---~Y~r-~GF~~~~~~~i~ 150 (185) ||.|+|..+.....+... ..++.. .||- |++| =|+..-....+. T Consensus 99 ~g~gv~d~vw~~m~~d~p----~L~Wrsr~~n~~n~Wyf~~sdG~~~~~~w~lF 148 (168) T pfam04768 99 WGNGVSDNVFNALKKDFP----KLVWRSREDNPNNKWYFERSDGSLKKNGWVLF 148 (168) T ss_pred HHCCHHHHHHHHHHHHCC----CEEEEECCCCCCCCEEEECCEEEEECCCEEEE T ss_conf 210889999999986486----23999778997430033133157976980899 No 111 >COG5653 Protein involved in cellulose biosynthesis (CelD) [Cell envelope biogenesis, outer membrane] Probab=51.98 E-value=19 Score=16.91 Aligned_cols=136 Identities=11% Similarity=0.002 Sum_probs=80.3 Q ss_pred CEEECCCCHHH-----HHHHHHHHHHHCCC---CCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEEEE Q ss_conf 20751379757-----89999999997278---76667561520254120002576654899998987864799999988 Q gi|254780989|r 13 GLTFANEQRVD-----QVAVDEVMVKAFLS---TQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATPIQ 84 (185) Q Consensus 13 ~l~IR~a~~~D-----~~aI~~L~~~aF~~---~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~~~ 84 (185) +..+..+...| ...+.+.-+.=|.. ..-..+....+.+++-...........-+..+.+|+||+--..++ T Consensus 214 ~~r~v~a~s~d~~e~~~~~l~~~Kr~rfa~~G~~Dlf~~~~t~~fl~dL~~~~~~d~~~rl~gL~~G~~lvAV~~~lr-- 291 (406) T COG5653 214 AVRFVAARSPDEVEALFATLFRWKRLRFARTGQFDLFRAGWTRDFLRDLFTQRAEDGSGRLFGLHAGGRLVAVHGLLR-- 291 (406) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEEEEECCEEEEEEEEEC-- T ss_conf 756885279725789999999999999997188501036258999999984527677458998742777899886312-- Q ss_pred ECCCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEE-EECCHHHHHHCCCEEHHHCCCCCC Q ss_conf 60678489986775254615998999999999999988997999-950832543138587052782677 Q gi|254780989|r 85 IGKYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVL-IDAKHDWHKHLGFKKPHQHQIRYA 152 (185) Q Consensus 85 ~~~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~-l~~~~~~Y~r~GF~~~~~~~i~~~ 152 (185) ....+...-.+++|++-.--=|-.|+-..|+++-.+|+...= .+|+..|=+++|=+...-+++..+ T Consensus 292 --~~~t~h~~l~a~dpe~~~~SPG~~lf~d~i~~~~~~g~~~~DfgvG~q~YKR~~~~~~tvl~~~~~~ 358 (406) T COG5653 292 --QGGTYHAWLGAIDPEFARASPGMLLFLDLIEWACGQGLARFDFGVGDQSYKRHWGDQSTVLFGLLGA 358 (406) T ss_pred --CCCEEEEEEECCCHHHHHCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCC T ss_conf --5877999750348778636954789999999986489717863589747877640378899876415 No 112 >cd04265 DUF619-NAGS-U DUF619-NAGS-U: This CD includes the DUF619 domain of various N-acetylglutamate Synthases (NAGS) of the urea (U) cycle found in humans and fish, as well as the DUF619 domain present C-terminal of a NAG kinase-like domain seen in a limited number of bacterial and Dictyostelium predicted NAGSs. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAG is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain is yet to be characterized. Probab=45.92 E-value=24 Score=16.34 Aligned_cols=32 Identities=16% Similarity=-0.005 Sum_probs=26.9 Q ss_pred CCEEEEEEEEECHHHHCCCHHHHHHHHHHHHH Q ss_conf 78489986775254615998999999999999 Q gi|254780989|r 88 YKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAI 119 (185) Q Consensus 88 ~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a 119 (185) ....+|--.+|.++.||.|+|..+-....+.. T Consensus 28 ~~vpyLDKFaV~~~a~G~gv~d~vw~~m~~d~ 59 (95) T cd04265 28 PGTPYLDKFAVSSSAQGEGLGRALWRCLREDF 59 (95) T ss_pred CCCEEEEEEEECCCCCCCCHHHHHHHHHHHHC T ss_conf 99878865788652102487999999998558 No 113 >KOG0470 consensus Probab=45.38 E-value=13 Score=17.94 Aligned_cols=40 Identities=8% Similarity=0.079 Sum_probs=30.8 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCHHH---HHHCCCEEHHHC Q ss_conf 9999999999999889979999508325---431385870527 Q gi|254780989|r 108 GSRLAYMSFMAIKNSGAEFVLIDAKHDW---HKHLGFKKPHQH 147 (185) Q Consensus 108 G~~Li~~~l~~a~~~G~~~v~l~~~~~~---Y~r~GF~~~~~~ 147 (185) =+.-.+..+.++|.+|+.+|.|..-.+| |..+||....=+ T Consensus 253 Y~~FteKvlphlK~LG~NaiqLmpi~Ef~~~~~s~GY~~~nFF 295 (757) T KOG0470 253 YLGFTEKVLPHLKKLGYNAIQLMPIFEFGHYYASWGYQVTNFF 295 (757) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHCCCCCEEEEE T ss_conf 2333442246787728361577510433446541585015760 No 114 >PRK09936 hypothetical protein; Provisional Probab=45.17 E-value=25 Score=16.27 Aligned_cols=31 Identities=16% Similarity=0.344 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHHHCCCCEEE-EECCHHHHHHC Q ss_conf 99999999999988997999-95083254313 Q gi|254780989|r 109 SRLAYMSFMAIKNSGAEFVL-IDAKHDWHKHL 139 (185) Q Consensus 109 ~~Li~~~l~~a~~~G~~~v~-l~~~~~~Y~r~ 139 (185) +-.+...++.+++.|...++ |+.||+||+|. T Consensus 71 ~gwLa~~l~~A~~aGl~lvlGLyaDP~yf~~~ 102 (294) T PRK09936 71 RGWLFKRLAAAQQAGLKLVVGLYADPEYFMHQ 102 (294) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCCHHHHHHH T ss_conf 03999999999866962888631585886565 No 115 >PRK02186 argininosuccinate lyase; Provisional Probab=43.67 E-value=25 Score=16.18 Aligned_cols=17 Identities=18% Similarity=0.266 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999999988997 Q gi|254780989|r 109 SRLAYMSFMAIKNSGAE 125 (185) Q Consensus 109 ~~Li~~~l~~a~~~G~~ 125 (185) ..+...+...+...|+. T Consensus 237 ~~I~d~v~raL~AlGl~ 253 (892) T PRK02186 237 ERIVRTVLRALDAVGYA 253 (892) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 99999999999970866 No 116 >cd04736 MDH_FMN Mandelate dehydrogenase (MDH)-like FMN-binding domain. MDH is part of a widespread family of homologous FMN-dependent a-hydroxy acid oxidizing enzymes that oxidizes (S)-mandelate to phenylglyoxalate. MDH is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). Probab=41.83 E-value=16 Score=17.43 Aligned_cols=27 Identities=11% Similarity=0.085 Sum_probs=21.6 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCHHHH Q ss_conf 999999999998899799995083254 Q gi|254780989|r 110 RLAYMSFMAIKNSGAEFVLIDAKHDWH 136 (185) Q Consensus 110 ~Li~~~l~~a~~~G~~~v~l~~~~~~Y 136 (185) .+.+..++++++.|++.++|+.|.... T Consensus 128 ~~~~~li~RA~~aG~~alvlTvD~pv~ 154 (361) T cd04736 128 ELAELLVKRALAAGYTTLVLTTDVAVN 154 (361) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 999999999998599868995078887 No 117 >TIGR02277 PaaX_trns_reg phenylacetic acid degradation operon negative regulatory protein PaaX; InterPro: IPR011965 This transcriptional regulator is always found in association with operons believed to be involved in the degradation of phenylacetic acid . The gene product has been shown to bind to the promoter sites and repress their transcription .. Probab=41.04 E-value=14 Score=17.69 Aligned_cols=78 Identities=13% Similarity=0.172 Sum_probs=56.4 Q ss_pred CCCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHH-----HCCCEEHHHCCCCCCCCCCCHHHE Q ss_conf 678489986775254615998999999999999988997999950832543-----138587052782677988882322 Q gi|254780989|r 87 KYKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWHK-----HLGFKKPHQHQIRYAHGGGAATDW 161 (185) Q Consensus 87 ~~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~-----r~GF~~~~~~~i~~~~~~~~~~~~ 161 (185) ....++||.| ..=.++-||+.++++.++-++.++|.=..-=.|.-+||. +--|..|.. .|..+-....++.| T Consensus 14 rGG~IWlG~L--i~~l~~~Gi~E~~VRTAvsRL~a~GWL~~Er~GRrSfY~L~D~G~~~~~~A~~-~IY~~p~~~W~g~W 90 (287) T TIGR02277 14 RGGAIWLGSL--IELLEGLGINERLVRTAVSRLVAQGWLQSERKGRRSFYRLSDKGRREFAAAAR-RIYDPPAPAWDGKW 90 (287) T ss_pred CCCEEEHHHH--HHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH-HHCCCCCCCCCCCE T ss_conf 8970337669--99962258883379999999865100010245441457788999999999987-40578886704115 Q ss_pred EEEECC Q ss_conf 654405 Q gi|254780989|r 162 LVHIFK 167 (185) Q Consensus 162 ~~~~l~ 167 (185) ....+. T Consensus 91 ~Ll~~~ 96 (287) T TIGR02277 91 ELLLLE 96 (287) T ss_pred EEEEEE T ss_conf 665530 No 118 >COG4671 Predicted glycosyl transferase [General function prediction only] Probab=38.90 E-value=31 Score=15.68 Aligned_cols=18 Identities=22% Similarity=0.285 Sum_probs=10.8 Q ss_pred HHHCCCCEEEEECCHHHH Q ss_conf 998899799995083254 Q gi|254780989|r 119 IKNSGAEFVLIDAKHDWH 136 (185) Q Consensus 119 a~~~G~~~v~l~~~~~~Y 136 (185) +-++-++.|+++|+|+|| T Consensus 163 ~I~r~yD~V~v~GdP~f~ 180 (400) T COG4671 163 LINRFYDLVLVYGDPDFY 180 (400) T ss_pred HHHHHHEEEEEECCCCCC T ss_conf 998754079994695415 No 119 >smart00475 53EXOc 5'-3' exonuclease. Probab=38.68 E-value=23 Score=16.46 Aligned_cols=70 Identities=10% Similarity=0.059 Sum_probs=43.0 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCHHHHHHCC--------------CEEHHHCCCCCCCCCCCHHHEEEEECCCCCCCC Q ss_conf 999999999999988997999950832543138--------------587052782677988882322654405797657 Q gi|254780989|r 108 GSRLAYMSFMAIKNSGAEFVLIDAKHDWHKHLG--------------FKKPHQHQIRYAHGGGAATDWLVHIFKNNIMEN 173 (185) Q Consensus 108 G~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~G--------------F~~~~~~~i~~~~~~~~~~~~~~~~l~~~~~~~ 173 (185) +--++...-.++...|...+++.+|.|+++=.. ++......+...++-.|.+.--.+.|..+.-++ T Consensus 108 ADDvIatla~~~~~~~~~v~IvS~DkDl~QLv~~~v~v~~~~~~~~~~~~~~~~~v~e~~gv~P~q~~d~~aL~GD~sDn 187 (259) T smart00475 108 ADDVIATLAKKAEAEGYEVRIVSGDKDLLQLVSDKVSVLDPTKGIKEFELYTPENVIEKYGLTPEQIIDYKALMGDSSDN 187 (259) T ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCCCEEEEECCCCCCCCEEECHHHHHHHHCCCHHHHHHHHHCCCCCCCC T ss_conf 79999999999998699269950883788848887199981589965679659999988489989986376404645358 Q ss_pred CEEE Q ss_conf 7069 Q gi|254780989|r 174 MIGK 177 (185) Q Consensus 174 ~~g~ 177 (185) +.|+ T Consensus 188 IpGV 191 (259) T smart00475 188 IPGV 191 (259) T ss_pred CCCC T ss_conf 9999 No 120 >TIGR01885 Orn_aminotrans ornithine--oxo-acid transaminase; InterPro: IPR010164 Ornithine aminotransferase catalyses the conversion of L-ornithine and a 2-oxo acid to L-glutamate 5-semialdehyde and an L-amino acid. This enzyme is found in low-GC bacteria, where it is responsible for the fourth step in arginine biosynthesis, and in the mitochondrial matrix of eukaryotes, where it controls L-ornithine levels in tissues. In human hereditary ornithine aminotransferase deficiency, the elevated levels of intraocular concentrations of ornithine are responsible for gyrate atrophy, which affects the CNS and peripheral nervous system ; GO: 0008483 transaminase activity, 0030170 pyridoxal phosphate binding. Probab=37.61 E-value=20 Score=16.81 Aligned_cols=10 Identities=30% Similarity=0.401 Sum_probs=3.8 Q ss_pred HHHHHCC-CEE Q ss_conf 2543138-587 Q gi|254780989|r 134 DWHKHLG-FKK 143 (185) Q Consensus 134 ~~Y~r~G-F~~ 143 (185) +|+..|| |.| T Consensus 161 ~~k~~FGPflP 171 (426) T TIGR01885 161 TSKKGFGPFLP 171 (426) T ss_pred HHHCCCCCCCC T ss_conf 20067879677 No 121 >COG2935 Putative arginyl-tRNA:protein arginylyltransferase [Posttranslational modification, protein turnover, chaperones] Probab=37.60 E-value=32 Score=15.55 Aligned_cols=67 Identities=12% Similarity=0.122 Sum_probs=45.6 Q ss_pred CCEEEEEEEEEEEEECCCCEEEEEEEE--ECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHCCCEE Q ss_conf 878647999999886067848998677--5254615998999999999999988997999950832543138587 Q gi|254780989|r 71 GDRIVGAIRATPIQIGKYKGFLRGPLG--VLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWHKHLGFKK 143 (185) Q Consensus 71 d~~iVG~i~~~~~~~~~~~~~~l~~la--V~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~GF~~ 143 (185) .|++|+.+. +.+.-++ |+.|+ -+|++..+.+|+-.+=.=|.+|++.|...+.|-=..+=-.|++||. T Consensus 159 ~G~LvAVav-tDvL~dG-----lSsVY~FydPd~s~~SLGt~~iL~~I~~aq~~~l~yvYLGYwI~~c~KM~YKa 227 (253) T COG2935 159 EGKLVAVAV-TDVLPDG-----LSSVYTFYDPDMSKRSLGTLSILDQIAIAQRLGLPYVYLGYWIKGCPKMNYKA 227 (253) T ss_pred CCCEEEEEE-EECCCCC-----CEEEEEEECCCHHHHCCHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCCC T ss_conf 870799986-2116676-----30489973777333152388999999999984999698878977860137624 No 122 >TIGR01323 nitrile_alph nitrile hydratase, alpha subunit; InterPro: IPR004232 Nitrile hydratases (4.2.1.84 from EC) are bacterial enzymes that catalyse the hydration of nitrile compounds to the corresponding amides. They are used as biocatalysts in acrylamide production, one of the few commercial scale bioprocesses, as well as in environmental remediation for the removal of nitriles from waste streams. Nitrile hydratases are composed of two subunits, alpha and beta, and are normally active as a tetramer, alpha(2)beta(2). Nitrile hydratases contain either a non-haem iron or a non-corrinoid cobalt centre, both types sharing a highly conserved peptide sequence in the alpha subunit (CXLCSC) that provides all the residues involved in coordinating the metal ion. Each type of nitrile hydratase specifically incorporated its metal with the help of activator proteins encoded by flanking regions of the nitrile hydratase genes that are necessary for metal insertion. The Fe-containing enzyme is photo-regulated: in the dark the enzyme is inactivated due to the association of nitric oxide (NO) to the iron, while in the light the enzyme is active by photo-dissociation of NO. The NO is held in place by a claw setting formed through specific oxygen atoms in two modified cysteines and a serine residue in the active site , . The cobalt-containing enzyme is unaffected by NO, but was shown to undergo a similar effect with carbon monoxide , . Fe- and cobalt-containing enzymes also display different inhibition patterns with nitrophenols. This entry represents the alpha subunit, which is a duplication of two structural repeats, each consisting of 4 layers, alpha/beta/beta/alpha.; GO: 0018822 nitrile hydratase activity, 0046914 transition metal ion binding, 0006807 nitrogen compound metabolic process. Probab=36.86 E-value=33 Score=15.48 Aligned_cols=64 Identities=17% Similarity=0.251 Sum_probs=43.8 Q ss_pred ECHHHHCCCHHHHHHHHHHHHHHHCCCC------EEEEE----------------------CC-HHHHHHCCCEEHHH-- Q ss_conf 5254615998999999999999988997------99995----------------------08-32543138587052-- Q gi|254780989|r 98 VLSEYRKRGIGSRLAYMSFMAIKNSGAE------FVLID----------------------AK-HDWHKHLGFKKPHQ-- 146 (185) Q Consensus 98 V~P~~qg~GiG~~Li~~~l~~a~~~G~~------~v~l~----------------------~~-~~~Y~r~GF~~~~~-- 146 (185) |||+|| +.|.+.+-..+++.|+. .+.|+ |- |++|+.|-|+.--. T Consensus 46 VDPefr-----~~Ll~DaTaAca~~Gy~G~Qge~ivalent~~v~nvvVCtLCSCypWPVLGLPP~WYK~~eyRaR~Vrd 120 (186) T TIGR01323 46 VDPEFR-----KLLLKDATAACAELGYAGEQGEEIVALENTSGVHNVVVCTLCSCYPWPVLGLPPEWYKSFEYRARVVRD 120 (186) T ss_pred CCHHHH-----HHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCHHEEEEEC T ss_conf 697899-----988643789999718875461348898628963278884113788888888785200440010001126 Q ss_pred -CCCCC-CCCCCCHHHEEEEEC Q ss_conf -78267-798888232265440 Q gi|254780989|r 147 -HQIRY-AHGGGAATDWLVHIF 166 (185) Q Consensus 147 -~~i~~-~~~~~~~~~~~~~~l 166 (185) .++.. .+|...|..--||.= T Consensus 121 pRgVL~refG~~~p~dv~irvW 142 (186) T TIGR01323 121 PRGVLKREFGYELPDDVEIRVW 142 (186) T ss_pred CCCCHHHHCCCCCCCCCEEEEE T ss_conf 7600134527999797079884 No 123 >PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed Probab=36.16 E-value=33 Score=15.49 Aligned_cols=33 Identities=9% Similarity=0.145 Sum_probs=25.4 Q ss_pred EECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 75254615998999999999999988997999950 Q gi|254780989|r 97 GVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDA 131 (185) Q Consensus 97 aV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~ 131 (185) ..+|+.|.+++ .+++.+++.|...|.+.+.+.+ T Consensus 83 s~d~~~r~~~l--e~l~kaI~lA~~LGi~~I~l~g 115 (284) T PRK13210 83 SRDEATRERAL--EIMKKAIRLAQDLGIRTIQLAG 115 (284) T ss_pred CCCHHHHHHHH--HHHHHHHHHHHHCCCCEEEECC T ss_conf 98989999999--9999999999980997899688 No 124 >COG2243 CobF Precorrin-2 methylase [Coenzyme metabolism] Probab=35.55 E-value=35 Score=15.35 Aligned_cols=23 Identities=9% Similarity=0.041 Sum_probs=13.5 Q ss_pred CC-CCEEEEECCHHHHHHCCCEEH Q ss_conf 89-979999508325431385870 Q gi|254780989|r 122 SG-AEFVLIDAKHDWHKHLGFKKP 144 (185) Q Consensus 122 ~G-~~~v~l~~~~~~Y~r~GF~~~ 144 (185) .| .-+.+..|||.||+.|++... T Consensus 89 ~G~~VAf~~lGDP~~YsTf~~l~~ 112 (234) T COG2243 89 AGRDVAFLTLGDPTFYSTFMYLLE 112 (234) T ss_pred CCCEEEEEECCCCCHHHHHHHHHH T ss_conf 699289998148528887999999 No 125 >cd04264 DUF619-NAGS DUF619-NAGS: This CD includes the DUF619 domain of various N-acetylglutamate Synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present C-terminal of a NAG kinase-like domain seen in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAG is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain is yet to be characterized. Probab=34.69 E-value=36 Score=15.27 Aligned_cols=31 Identities=16% Similarity=-0.057 Sum_probs=26.6 Q ss_pred CCEEEEEEEEECHHHHCCCHHHHHHHHHHHH Q ss_conf 7848998677525461599899999999999 Q gi|254780989|r 88 YKGFLRGPLGVLSEYRKRGIGSRLAYMSFMA 118 (185) Q Consensus 88 ~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~ 118 (185) ....+|-..+|.++.||.|+|..+.....+. T Consensus 32 ~~vpyLDKFaV~~~~~g~gv~d~vw~~m~~d 62 (100) T cd04264 32 EGIPYLDKFAVSSSAQGEGLGDILWRALRED 62 (100) T ss_pred CCCCEEEEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 9872665588822221468899999999713 No 126 >TIGR00222 panB 3-methyl-2-oxobutanoate hydroxymethyltransferase; InterPro: IPR003700 The panB gene from Escherichia coli encodes the first enzyme of the pantothenate biosynthesis pathway, ketopantoate hydroxymethyltransferase (KPHMT) 2.1.2.11 from EC. Fungal ketopantoate hydroxymethyltransferase is essential for the biosynthesis of coenzyme A, while the pathway intermediate 4'-phosphopantetheine is required for penicillin production .; GO: 0003864 3-methyl-2-oxobutanoate hydroxymethyltransferase activity, 0015940 pantothenate biosynthetic process. Probab=33.42 E-value=29 Score=15.81 Aligned_cols=51 Identities=14% Similarity=0.363 Sum_probs=34.5 Q ss_pred EEEEEEEEEEEEECCCCEEEEEEEEECHHHHCCC--HHHHHHHHHHHHHHHCCCCEEEEECCH Q ss_conf 8647999999886067848998677525461599--899999999999998899799995083 Q gi|254780989|r 73 RIVGAIRATPIQIGKYKGFLRGPLGVLSEYRKRG--IGSRLAYMSFMAIKNSGAEFVLIDAKH 133 (185) Q Consensus 73 ~iVG~i~~~~~~~~~~~~~~l~~laV~P~~qg~G--iG~~Li~~~l~~a~~~G~~~v~l~~~~ 133 (185) -||||+.|+|=.+.-.-++-+ |||+ -+++|++.+|. +.+.|...++|++-| T Consensus 135 pV~GH~GLtPQsV~~~GGYk~---------qGk~~~~~~kLL~dAl~-LeeAGv~lLVLE~vp 187 (267) T TIGR00222 135 PVVGHLGLTPQSVRLLGGYKV---------QGKDEEAAKKLLEDALA-LEEAGVQLLVLECVP 187 (267) T ss_pred EEEECCCCCCCEEEECCCEEE---------CCCCHHHHHHHHHHHHH-HHHHHHHHHHHCCCC T ss_conf 587414788733400574000---------27887888889999999-998766743532768 No 127 >TIGR00099 Cof-subfamily Cof-like hydrolase; InterPro: IPR000150 The Haloacid Dehydrogenase (HAD) superfamily includes phosphatases, phosphonatases, P-type ATPases, beta-phosphoglucomutases, phosphomannomutases, and dehalogenases, which are involved in a variety of cellular processes ranging from amino acid biosynthesis to detoxification . Proteins in this entry are mostly uncharacterised, though they form a distinct subgroup within the HAD superfamily. Members are found almost exclusively in bacteria and many species contain several paralogs, for example Escherichia coli contains a total of six proteins from this entry. Sequence similarities suggest that these enzymes are phosphatases which work on phosphorylated sugars. The YbiV (P75792 from SWISSPROT) protein from E. coli has been experimentally characterised . This enzyme catalyzes the hydrolysis of sugar phosphate to sugar and inorganic phosphate. It has a wide substrate specificity, catalyzing the hydrolysis of ribose-5-phosphate and glucose-6-phosphate most efficiently, but it is not known if these are the real substrates in vivo. The protein appears to be a monomer that contains two domains, an alpha-beta hydrolase domain that forms a Rossman fold, and an alpha-beta domain. The active site is found in a negatively charged cavity found at the interface between the two domains. ; GO: 0016787 hydrolase activity, 0008152 metabolic process. Probab=32.08 E-value=40 Score=15.01 Aligned_cols=32 Identities=16% Similarity=0.227 Sum_probs=21.4 Q ss_pred CCCHHHHHHHHHHHHHHHCC--CCEEEEEC-CHHH Q ss_conf 59989999999999999889--97999950-8325 Q gi|254780989|r 104 KRGIGSRLAYMSFMAIKNSG--AEFVLIDA-KHDW 135 (185) Q Consensus 104 g~GiG~~Li~~~l~~a~~~G--~~~v~l~~-~~~~ 135 (185) ..=+-..++++.++.+++.+ ....+... +.-| T Consensus 85 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 119 (270) T TIGR00099 85 KKPLDLDLVEELLNFLKKHGQELDVILYSDSDSIY 119 (270) T ss_pred ECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCEEE T ss_conf 02478889999999997348925799981776256 No 128 >TIGR01488 HAD-SF-IB HAD-superfamily hydrolase, subfamily IB (PSPase-like); InterPro: IPR006383 This group represents a subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. Subfamily IA, B, C and D are distinguished from the rest of the superfamily by the presence of a variable domain between the first and second conserved catalytic motifs. In subfamilies IA and IB, this domain consists of an alpha-helical bundle. It was necessary to model these two subfamilies separately, breaking them at an apparent phylogenetic bifurcation, so that the resulting model(s) are not so broadly defined that members of subfamily III (which lack the variable domain) are included. ; GO: 0016791 phosphoric monoester hydrolase activity, 0008152 metabolic process. Probab=31.48 E-value=41 Score=14.95 Aligned_cols=33 Identities=21% Similarity=0.250 Sum_probs=28.9 Q ss_pred HHHHHHHHHHHCCCCEEEEECCHH-----HHHHCCCEE Q ss_conf 999999999988997999950832-----543138587 Q gi|254780989|r 111 LAYMSFMAIKNSGAEFVLIDAKHD-----WHKHLGFKK 143 (185) Q Consensus 111 Li~~~l~~a~~~G~~~v~l~~~~~-----~Y~r~GF~~ 143 (185) -++..++++|++|...+++.|.++ +++++|+.. T Consensus 92 ~~~~~~~~~~~~G~~~~ivSgg~~~~~~~~~e~~g~~~ 129 (204) T TIGR01488 92 GARELIKWLKEKGIKTVIVSGGLDFLVEPLAEKLGVDE 129 (204) T ss_pred HHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHCCHHH T ss_conf 08999999984799299980886799999997528434 No 129 >TIGR02412 pepN_strep_liv aminopeptidase N; InterPro: IPR012778 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family is a subset of the members of the zinc metallopeptidases belonging to MEROPS peptidase family M1 (aminopeptidase N, clan MA), with a single member characterised in Streptomyces lividans 66: aminopeptidase G . The rest of the members of this family are identified as aminopeptidase N of the actinomycete-type. The spectrum of activity may differ somewhat from the aminopeptidase N clade of Escherichia coli and most other proteobacteria, which are well separated phylogenetically within the M1 family. . Probab=30.63 E-value=23 Score=16.41 Aligned_cols=75 Identities=19% Similarity=0.246 Sum_probs=45.7 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHCCCEEHHH-CCCCCCCCCCCHHHEEEEECCCCCCCCCEEEE Q ss_conf 54615998999999999999988997999950832543138587052-78267798888232265440579765770699 Q gi|254780989|r 100 SEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWHKHLGFKKPHQ-HQIRYAHGGGAATDWLVHIFKNNIMENMIGKM 178 (185) Q Consensus 100 P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~GF~~~~~-~~i~~~~~~~~~~~~~~~~l~~~~~~~~~g~~ 178 (185) .+.++-=+| ..-+.++..-=+.-.+-++- +-|=||.-.+. |++++|+. -+|+.|. =|+.-|++|| .|-| T Consensus 218 ~~~~s~PLG-iyaR~SlA~~ld~d~D~~F~------iTrqGl~ff~~~F~~PyPf~-KYDQ~FV-PEfn~GAMEN-~G~V 287 (884) T TIGR02412 218 KEARSIPLG-IYARRSLAEYLDADADDIFD------ITRQGLDFFHRKFGYPYPFK-KYDQVFV-PEFNAGAMEN-AGAV 287 (884) T ss_pred CCCCCCCHH-HHHHHHHHHHCCCCCCHHHH------HHHHHHHHHHHHCCCCCCCC-CCCEEEE-CCCCCCCCCC-CCCC T ss_conf 688867558-76776467543833111778------87755788996486777988-6652561-4227862114-6761 Q ss_pred EECCC-C Q ss_conf 80667-8 Q gi|254780989|r 179 SLRRE-Y 184 (185) Q Consensus 179 ~~~~~-f 184 (185) ++... | T Consensus 288 T~~e~hf 294 (884) T TIGR02412 288 TFAENHF 294 (884) T ss_pred EEECCCE T ss_conf 2104853 No 130 >TIGR01467 cobI_cbiL precorrin-2 C20-methyltransferase; InterPro: IPR006364 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. In the anaerobic pathway, cobalt is inserted into precorrin-2 by CbiK to generate cobalt-precorrin-2, which is the substrate for CbiL, a C20 methyltransferase . In Clostridium difficile, CbiK and CbiL are fused into a bifunctional enzyme. In the aerobic pathway, CobI catalyses the methylation of precorrin-2 to precorrin-3a. This entry represents CbiL and CobI precorrin-2 C20-methyltransferases (2.1.1.130 from EC), both as stand-alone enzymes and when CbiL forms part of a bifunctional enzyme. This entry also includes other closely related S-adenosylmethionine-dependent methyltransferases involved in Cbl biosynthesis. ; GO: 0008757 S-adenosylmethionine-dependent methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=30.00 E-value=43 Score=14.80 Aligned_cols=38 Identities=3% Similarity=-0.065 Sum_probs=22.3 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCHHHHHHCCCEEHHHC Q ss_conf 99999999999889979999508325431385870527 Q gi|254780989|r 110 RLAYMSFMAIKNSGAEFVLIDAKHDWHKHLGFKKPHQH 147 (185) Q Consensus 110 ~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~GF~~~~~~ 147 (185) ...+...+.+++-+.=+.+.+|||.+|+-|.|....-. T Consensus 88 ~~A~~v~~~l~~G~~Vaf~TlGDP~~YSTF~yl~~~l~ 125 (243) T TIGR01467 88 EAAEAVAAELEEGRDVAFITLGDPSLYSTFSYLLQRLK 125 (243) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHH T ss_conf 99999999984698289998457202237999999998 No 131 >pfam03928 DUF336 Domain of unknown function (DUF336). This family contains uncharacterized sequences, including several GlcG proteins. The alignment contains many conserved motifs that are suggestive of cofactor binding and enzymatic activity. Probab=29.79 E-value=44 Score=14.77 Aligned_cols=23 Identities=13% Similarity=0.153 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 99999999999998899799995 Q gi|254780989|r 108 GSRLAYMSFMAIKNSGAEFVLID 130 (185) Q Consensus 108 G~~Li~~~l~~a~~~G~~~v~l~ 130 (185) ..+|++.+++.++++|+...+.. T Consensus 8 A~~l~~~a~~~a~~~g~~vsvaV 30 (132) T pfam03928 8 AWKLGAAAVAAARELGVRVTVAV 30 (132) T ss_pred HHHHHHHHHHHHHHCCCCEEEEE T ss_conf 99999999999998199889999 No 132 >pfam02334 RTP Replication terminator protein. The bacterial replication terminator protein (RTP) plays a role in the termination of DNA replication by impeding replication fork movement. Two RTP dimers bind to the two inverted repeat regions at the termination site. Probab=28.98 E-value=45 Score=14.69 Aligned_cols=24 Identities=21% Similarity=0.149 Sum_probs=20.0 Q ss_pred HHCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 615998999999999999988997 Q gi|254780989|r 102 YRKRGIGSRLAYMSFMAIKNSGAE 125 (185) Q Consensus 102 ~qg~GiG~~Li~~~l~~a~~~G~~ 125 (185) -|++|+|.+|++..-+..+..|+. T Consensus 28 eq~r~YG~qlL~~Lr~eFk~~Gy~ 51 (122) T pfam02334 28 EQERLYGLKLLEVLRSEFKEIGFK 51 (122) T ss_pred HHCCCHHHHHHHHHHHHHHHCCCC T ss_conf 502002799999999987852889 No 133 >KOG2509 consensus Probab=27.78 E-value=47 Score=14.56 Aligned_cols=50 Identities=12% Similarity=0.165 Sum_probs=36.4 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEE--CCHHHHHHCCCEEHHHCCCCCCCCCC Q ss_conf 899999999999998899799995--08325431385870527826779888 Q gi|254780989|r 107 IGSRLAYMSFMAIKNSGAEFVLID--AKHDWHKHLGFKKPHQHQIRYAHGGG 156 (185) Q Consensus 107 iG~~Li~~~l~~a~~~G~~~v~l~--~~~~~Y~r~GF~~~~~~~i~~~~~~~ 156 (185) +-.+|++++++.+.++|+.-+..- ..-+.-.+.||.+-.+....+..++. T Consensus 187 LeqALi~yal~~l~~kGy~pl~~P~i~rkeVm~~cg~~~~~d~~~~y~ld~~ 238 (455) T KOG2509 187 LEQALINYALDFLNAKGYTPLTTPDILRKEVMQKCGQLPRFDEEQYYVLDGG 238 (455) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCEEEEECCC T ss_conf 9999999999999976982466814536999998565767883217860688 No 134 >TIGR02768 TraA_Ti Ti-type conjugative transfer relaxase TraA; InterPro: IPR014136 This entry represents the Ti-type conjugative transfer relaxase TraA. TraA contains domains distinctive of a single strand exonuclease (N-terminus, MobA/MobL, IPR005053 from INTERPRO) as well as a helicase domain (central region, homologous to the corresponding region of the F-type relaxase TraI, IPR014129 from INTERPRO). This protein likely fills the same role as TraI(F), nicking (at the oriT site) and unwinding the coiled plasmid prior to conjugative transfer .. Probab=27.40 E-value=48 Score=14.52 Aligned_cols=37 Identities=16% Similarity=0.355 Sum_probs=26.3 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCHH-------------HHHHCCCEEH Q ss_conf 8999999999999988997999950832-------------5431385870 Q gi|254780989|r 107 IGSRLAYMSFMAIKNSGAEFVLIDAKHD-------------WHKHLGFKKP 144 (185) Q Consensus 107 iG~~Li~~~l~~a~~~G~~~v~l~~~~~-------------~Y~r~GF~~~ 144 (185) +|++=|...|+.|.+.|.+. +|+|||+ -=+|.||... T Consensus 513 V~S~Qm~r~l~~A~~AGaKv-VLvGD~~QLqaI~AGaaFRaia~riG~~El 562 (888) T TIGR02768 513 VGSRQMARVLKEAEEAGAKV-VLVGDPEQLQAIEAGAAFRAIAERIGYAEL 562 (888) T ss_pred HHHHHHHHHHHHHHHCCCCE-EEECCHHHHCHHHCCCCCHHHHHHCCEEEE T ss_conf 14677889999998727605-984885782442447500123410140325 No 135 >PRK05576 cobalt-precorrin-2 C(20)-methyltransferase; Validated Probab=27.05 E-value=46 Score=14.64 Aligned_cols=23 Identities=9% Similarity=0.083 Sum_probs=12.9 Q ss_pred HCCCCEEEE-ECCHHHHHHCCCEE Q ss_conf 889979999-50832543138587 Q gi|254780989|r 121 NSGAEFVLI-DAKHDWHKHLGFKK 143 (185) Q Consensus 121 ~~G~~~v~l-~~~~~~Y~r~GF~~ 143 (185) ..|.+.++| .|||.+|.++++.. T Consensus 88 ~~G~~Va~l~~GDP~iyst~~~~~ 111 (224) T PRK05576 88 EEGKNVAFITLGDPNVYSTFSHLL 111 (224) T ss_pred HCCCEEEEEECCCCCHHCHHHHHH T ss_conf 779949999368962112099999 No 136 >TIGR02482 PFKA_ATP 6-phosphofructokinase; InterPro: IPR012828 6-phosphofructokinase (2.7.1.11 from EC) catalyses the addition of phosphate from ATP to fructose 6-phosphate to give fructose 1,6-bisphosphate. This represents a key control step in glycolysis. This entry contains bacterial ATP-dependent 6-phosphofructokinases, which lack a beta-hairpin loop present in IPR012829 from INTERPRO family members. IPR012829 from INTERPRO contains members that are ATP-dependent as well as members that are pyrophosphate-dependent. IPR011183 from INTERPRO represents the pyrophosphate-dependent phosphofructokinase, diphosphate--fructose-6-phosphate 1-phosphotransferase (2.7.1.90 from EC).; GO: 0003872 6-phosphofructokinase activity, 0005524 ATP binding, 0006002 fructose 6-phosphate metabolic process, 0006096 glycolysis. Probab=26.96 E-value=49 Score=14.47 Aligned_cols=41 Identities=10% Similarity=0.044 Sum_probs=32.4 Q ss_pred HHHHHHHHHCCCCEEEEEC-C------HHHHHHCCCEEHHHCCCCCCCCCC Q ss_conf 9999999988997999950-8------325431385870527826779888 Q gi|254780989|r 113 YMSFMAIKNSGAEFVLIDA-K------HDWHKHLGFKKPHQHQIRYAHGGG 156 (185) Q Consensus 113 ~~~l~~a~~~G~~~v~l~~-~------~~~Y~r~GF~~~~~~~i~~~~~~~ 156 (185) +.+.+.+|+.|.+..+|.| | .-+|+.+|++-. +|+.+.||. T Consensus 81 ~kA~~nLK~~GI~~LVViGGDGSy~GA~~L~~~gg~~~i---GlPGTIDND 128 (302) T TIGR02482 81 EKAVENLKKLGIEALVVIGGDGSYTGAQKLYEEGGIPVI---GLPGTIDND 128 (302) T ss_pred HHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHCCCCEE---EECCCCCCC T ss_conf 999999887488668998684406889999971798478---745850256 No 137 >TIGR00542 hxl6Piso_put hexulose-6-phosphate isomerase, putative; InterPro: IPR004560 This family shows similarity to other isomerases. Putative identification as hexulose-6-phosphate isomerase has been reported. This family is conserved at better than 40 0dentity among the four known examples from three species: Escherichia coli (SgbU and SgaU), Haemophilus influenzae, and Mycoplasma pneumoniae. The rarity of the family, high level of conservation, and proposed catabolic role suggests lateral transfer may be a part of the evolutionary history of this protein.; GO: 0016861 intramolecular oxidoreductase activity interconverting aldoses and ketoses, 0005975 carbohydrate metabolic process. Probab=26.95 E-value=49 Score=14.47 Aligned_cols=46 Identities=13% Similarity=0.217 Sum_probs=33.0 Q ss_pred EEEECH-HHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHCCCE Q ss_conf 677525-461599899999999999998899799995083254313858 Q gi|254780989|r 95 PLGVLS-EYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWHKHLGFK 142 (185) Q Consensus 95 ~laV~P-~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~GF~ 142 (185) |++-.. .-|. -|...|+.+++.++..|.+.+.|-|-.=||+...=. T Consensus 80 PlGS~d~~~r~--~~~eIm~KAi~La~~lGIR~IQLAgYDVYYe~~d~e 126 (290) T TIGR00542 80 PLGSKDKAVRQ--QGLEIMEKAIQLARDLGIRIIQLAGYDVYYEEHDEE 126 (290) T ss_pred CCCCCCHHHHH--HHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCH T ss_conf 89885178999--999999999999975796066223330033557604 No 138 >PRK05990 precorrin-2 C(20)-methyltransferase; Reviewed Probab=26.33 E-value=50 Score=14.40 Aligned_cols=23 Identities=9% Similarity=0.238 Sum_probs=13.8 Q ss_pred HCCCCEEEE-ECCHHHHHHCCCEE Q ss_conf 889979999-50832543138587 Q gi|254780989|r 121 NSGAEFVLI-DAKHDWHKHLGFKK 143 (185) Q Consensus 121 ~~G~~~v~l-~~~~~~Y~r~GF~~ 143 (185) +.|.+.+++ .|||.+|..|++.. T Consensus 96 ~~G~~Va~lt~GDp~~YsT~~yl~ 119 (241) T PRK05990 96 DAGRDVAVICEGDPFFYGSYMYLH 119 (241) T ss_pred HCCCEEEEEECCCCHHHHHHHHHH T ss_conf 769969999577943664699999 No 139 >PRK13795 hypothetical protein; Provisional Probab=26.14 E-value=51 Score=14.38 Aligned_cols=60 Identities=17% Similarity=0.135 Sum_probs=34.4 Q ss_pred ECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEEEEECCEEEEEEEEEE Q ss_conf 5137975789999999997278766675615202541200025766548999989878647999999 Q gi|254780989|r 16 FANEQRVDQVAVDEVMVKAFLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFVCRDGDRIVGAIRATP 82 (185) Q Consensus 16 IR~a~~~D~~aI~~L~~~aF~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~Vae~d~~iVG~i~~~~ 82 (185) +|++-|.|..-|.++..+.|+.......+.+. + +.+....+. ..=...||.++|++.|-+ T Consensus 45 ~Rpaf~~D~e~i~~~l~~~fG~~~~~~~~~~v--L---LNK~Pg~D~--~~EIivdG~v~g~l~yd~ 104 (630) T PRK13795 45 VRPAFPYDIEFIREVLLEEFGDAPLIPEDKLV--L---LNKIPGEDK--ADEVIVDGRVFGHLRFDL 104 (630) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEE--E---EECCCCCCC--CCEEEECCEEEEEEEECC T ss_conf 67578789999999999873985457788489--9---846898664--426887777999999713 No 140 >TIGR01500 sepiapter_red sepiapterin reductase; InterPro: IPR006393 These sequences represent sepiapterin reductase, a member of the short chain dehydrogenase/reductase family. The enzyme catalyzes the last step in the biosynthesis of tetrahydrobiopterin. A similar enzyme in Bacillus cereus was isolated for its ability to convert benzil to (S)-benzoin, a property sepiapterin reductase also shares. ; GO: 0004757 sepiapterin reductase activity, 0006729 tetrahydrobiopterin biosynthetic process. Probab=26.09 E-value=45 Score=14.70 Aligned_cols=14 Identities=36% Similarity=0.368 Sum_probs=4.9 Q ss_pred CCHHHHHHHHHHHH Q ss_conf 99899999999999 Q gi|254780989|r 105 RGIGSRLAYMSFMA 118 (185) Q Consensus 105 ~GiG~~Li~~~l~~ 118 (185) ||+||.|.+...+. T Consensus 10 rGfGr~~Aq~lak~ 23 (267) T TIGR01500 10 RGFGRELAQELAKR 23 (267) T ss_pred CHHHHHHHHHHHHC T ss_conf 01568999998730 No 141 >PRK09482 xni exonuclease IX; Provisional Probab=25.99 E-value=51 Score=14.36 Aligned_cols=70 Identities=9% Similarity=0.036 Sum_probs=42.3 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCHHHHHHCC-------C---EEHHHCCCCCCCCCCCHHHEEEEECCCCCCCCCEEE Q ss_conf 999999999999988997999950832543138-------5---870527826779888823226544057976577069 Q gi|254780989|r 108 GSRLAYMSFMAIKNSGAEFVLIDAKHDWHKHLG-------F---KKPHQHQIRYAHGGGAATDWLVHIFKNNIMENMIGK 177 (185) Q Consensus 108 G~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~G-------F---~~~~~~~i~~~~~~~~~~~~~~~~l~~~~~~~~~g~ 177 (185) +--+|...-.++.+.|+..+++.+|-|+++=.. + +......+.-.|+-.|.+.-..+.|..+.-+++.|+ T Consensus 108 ADDiIatla~~~~~~~~~v~IvS~DkDl~QLv~~~v~v~~~~~~~~~~~~~v~ek~Gv~P~q~~d~~aL~GD~SDNIPGV 187 (256) T PRK09482 108 ADDLIATLAVKVAQAGHQATIVSTDKGYCQLLSPTIQIRDYFQKRWLDIPFIEQEFGVEPQQLPDYWGLAGISSSKIPGV 187 (256) T ss_pred HHHHHHHHHHHHHHCCCEEEEEECCCCHHHHCCCCEEEEECCCCCCCCHHHHHHHCCCCHHHHHHEEEEECCCCCCCCCC T ss_conf 88999999999997799699996688899967598499978677131799987852828735323024746543689999 No 142 >pfam06849 DUF1246 Protein of unknown function (DUF1246). This family represents the N-terminus of a number of hypothetical archaeal proteins of unknown function. Probab=25.96 E-value=51 Score=14.36 Aligned_cols=29 Identities=14% Similarity=0.238 Sum_probs=20.2 Q ss_pred HHHHHHHHCCCCEEEEE--CCHHHHHHCCCE Q ss_conf 99999998899799995--083254313858 Q gi|254780989|r 114 MSFMAIKNSGAEFVLID--AKHDWHKHLGFK 142 (185) Q Consensus 114 ~~l~~a~~~G~~~v~l~--~~~~~Y~r~GF~ 142 (185) +.++=||+.|.+.+.+. +...+|.+|+|. T Consensus 11 qIl~GAK~EGF~Tv~vc~~gr~~~Y~~f~~~ 41 (124) T pfam06849 11 QILDGAKDEGFRTVAVCQKGREKFYRRFPFV 41 (124) T ss_pred HHHHHHHHCCCCEEEEEECCCCCHHHHCCCC T ss_conf 9972388739957899817975213326867 No 143 >TIGR02089 TTC tartrate dehydrogenase; InterPro: IPR011829 Tartrate dehydrogenase catalyzes the oxidation of both meso- and (+)-tartrate as well as a D-malate . These enzymes are closely related to the 3-isopropylmalate and isohomocitrate dehydrogenases found in IPR004429 from INTERPRO and IPR011828 from INTERPRO, respectively.; GO: 0009027 tartrate dehydrogenase activity, 0051287 NAD binding. Probab=25.88 E-value=51 Score=14.35 Aligned_cols=29 Identities=10% Similarity=0.351 Sum_probs=19.8 Q ss_pred HHCCCHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 615998999999999999988997999950 Q gi|254780989|r 102 YRKRGIGSRLAYMSFMAIKNSGAEFVLIDA 131 (185) Q Consensus 102 ~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~ 131 (185) +=|+|+ .+.|+.+.+.|+++-...+-..+ T Consensus 162 fTR~Gv-eRilrFAFeLA~~RPrk~lTs~T 190 (355) T TIGR02089 162 FTRKGV-ERILRFAFELAQKRPRKHLTSAT 190 (355) T ss_pred EECHHH-HHHHHHHHHHHHHCCCCCEEEEE T ss_conf 430014-68889999999757864412256 No 144 >TIGR01263 4HPPD 4-hydroxyphenylpyruvate dioxygenase; InterPro: IPR005956 4-hydroxyphenylpyruvate dioxygenase (1.13.11.27 from EC) oxidizes 4-hydroxyphenylpyruvate, a tyrosine and phenylalanine catabolite, to homogentisate. Homogentisate can undergo a further non-enzymatic oxidation and polymerization into brown pigments that protect some bacterial species from light. A similar process occurs spontaneously in blood and is hemolytic . In some bacterial species, this enzyme has been studied as a hemolysin.; GO: 0003868 4-hydroxyphenylpyruvate dioxygenase activity, 0009072 aromatic amino acid family metabolic process. Probab=25.83 E-value=44 Score=14.77 Aligned_cols=15 Identities=20% Similarity=0.541 Sum_probs=9.1 Q ss_pred HHHHH-CCCEEHHHCC Q ss_conf 25431-3858705278 Q gi|254780989|r 134 DWHKH-LGFKKPHQHQ 148 (185) Q Consensus 134 ~~Y~r-~GF~~~~~~~ 148 (185) +||++ |||+....+. T Consensus 187 ~fY~~~~gF~~~~~fd 202 (379) T TIGR01263 187 DFYEKIFGFHEIRSFD 202 (379) T ss_pred HHHHHHHCCEEEEEEE T ss_conf 9999983984799996 No 145 >COG5092 NMT1 N-myristoyl transferase [Lipid metabolism] Probab=25.55 E-value=52 Score=14.31 Aligned_cols=31 Identities=16% Similarity=0.065 Sum_probs=13.8 Q ss_pred EEEEEECHHHHCCCHHHHHHHHHHHHHHHCC Q ss_conf 9867752546159989999999999999889 Q gi|254780989|r 93 RGPLGVLSEYRKRGIGSRLAYMSFMAIKNSG 123 (185) Q Consensus 93 l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G 123 (185) +.-|.||.+.|++-+.--||.+.-+++...| T Consensus 168 vNFLCihk~lRsKRltPvLIkEiTRR~n~~~ 198 (451) T COG5092 168 VNFLCIHKELRSKRLTPVLIKEITRRANVDG 198 (451) T ss_pred EEEEEEEHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 7899974555307562089999987631442 No 146 >cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity. Probab=25.49 E-value=29 Score=15.82 Aligned_cols=70 Identities=11% Similarity=0.057 Sum_probs=40.2 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCHHHHHHCC----------CEEHHHCCCCCCCCCCCHHHEEEEECCCCCCCCCEEE Q ss_conf 999999999999988997999950832543138----------5870527826779888823226544057976577069 Q gi|254780989|r 108 GSRLAYMSFMAIKNSGAEFVLIDAKHDWHKHLG----------FKKPHQHQIRYAHGGGAATDWLVHIFKNNIMENMIGK 177 (185) Q Consensus 108 G~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~G----------F~~~~~~~i~~~~~~~~~~~~~~~~l~~~~~~~~~g~ 177 (185) +--++...-.++...|...+++.+|-|+++=.. .+......+...++-.+......+.|..+.-+++.|+ T Consensus 109 ADD~ia~la~~~~~~~~~v~IvS~DkD~~QLv~~~v~~~~~~~~~~~~~~~v~~~~gv~p~q~~d~~aL~GD~sDnIpGV 188 (240) T cd00008 109 ADDVIGTLAKKAEAEGYKVVIVSGDKDLLQLVSDNVKVVSPMKKKLVTEENVIEKYGVTPAQIIDYKALMGDSSDNIPGV 188 (240) T ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCC T ss_conf 89999999999986697069962880688872433001355557657889999960989999987699728764688999 No 147 >COG3623 SgaU Putative L-xylulose-5-phosphate 3-epimerase [Carbohydrate transport and metabolism] Probab=25.26 E-value=53 Score=14.28 Aligned_cols=39 Identities=5% Similarity=0.047 Sum_probs=27.9 Q ss_pred ECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHH Q ss_conf 52546159989999999999999889979999508325431 Q gi|254780989|r 98 VLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWHKH 138 (185) Q Consensus 98 V~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r 138 (185) -+|.-+.+ +..+|+.+++.|+..|.+.+.|-|-.=||+. T Consensus 86 ~D~~~r~~--aleiM~KaI~LA~dLGIRtIQLAGYDVYYE~ 124 (287) T COG3623 86 KDEATRQQ--ALEIMEKAIQLAQDLGIRTIQLAGYDVYYEE 124 (287) T ss_pred CCHHHHHH--HHHHHHHHHHHHHHHCCEEEEECCCEEEECC T ss_conf 98789999--9999999999999708426763242344146 No 148 >TIGR03175 AllD ureidoglycolate dehydrogenase. This enzyme converts ureidoglycolate to oxalureate in the non-urea-forming catabolism of allantoin (GenProp0687). The pathway has been characterized in E. coli and is observed in the genomes of Entercoccus faecalis and Bacillus licheniformis. Probab=24.48 E-value=54 Score=14.19 Aligned_cols=33 Identities=15% Similarity=0.064 Sum_probs=25.5 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEEC-----CHHHHHHC Q ss_conf 8999999999999988997999950-----83254313 Q gi|254780989|r 107 IGSRLAYMSFMAIKNSGAEFVLIDA-----KHDWHKHL 139 (185) Q Consensus 107 iG~~Li~~~l~~a~~~G~~~v~l~~-----~~~~Y~r~ 139 (185) .+.+.|+.+++++|+.|...|.+-. .-.||.+. T Consensus 89 ~~~~a~~~aiekA~~~Gi~~v~v~ns~H~G~~g~y~~~ 126 (349) T TIGR03175 89 AAKMAMEHAIEIAKKSGVAVVGISRMSHSGALSYFVRQ 126 (349) T ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHH T ss_conf 99999999999999709279982688874305999999 No 149 >PRK10513 sugar phosphatase; Provisional Probab=24.45 E-value=55 Score=14.19 Aligned_cols=15 Identities=13% Similarity=0.003 Sum_probs=8.5 Q ss_pred CCHHHHHHHHHHHHH Q ss_conf 797578999999999 Q gi|254780989|r 19 EQRVDQVAVDEVMVK 33 (185) Q Consensus 19 a~~~D~~aI~~L~~~ 33 (185) -.|....+|.++..+ T Consensus 21 i~~~~~~ai~~l~~~ 35 (270) T PRK10513 21 ISPAVKNAIAAARAK 35 (270) T ss_pred CCHHHHHHHHHHHHC T ss_conf 899999999999978 No 150 >COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism] Probab=23.97 E-value=56 Score=14.13 Aligned_cols=48 Identities=15% Similarity=0.108 Sum_probs=40.2 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEC--CHHHHHHCCCEEHHHCCCCCCCCCC Q ss_conf 99999999999988997999950--8325431385870527826779888 Q gi|254780989|r 109 SRLAYMSFMAIKNSGAEFVLIDA--KHDWHKHLGFKKPHQHQIRYAHGGG 156 (185) Q Consensus 109 ~~Li~~~l~~a~~~G~~~v~l~~--~~~~Y~r~GF~~~~~~~i~~~~~~~ 156 (185) ..+..+.|..++++|++.|-|.- .-.|+...||....=|.....||.- T Consensus 164 ~e~a~~llpYl~elG~T~IELMPv~e~p~~~sWGYq~~g~yAp~sryGtP 213 (628) T COG0296 164 FELAIELLPYLKELGITHIELMPVAEHPGDRSWGYQGTGYYAPTSRYGTP 213 (628) T ss_pred HHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCH T ss_conf 99999875899970987799714435798888777743001565567998 No 151 >PRK05431 seryl-tRNA synthetase; Provisional Probab=23.64 E-value=57 Score=14.09 Aligned_cols=43 Identities=9% Similarity=0.008 Sum_probs=25.1 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEE--ECCHHHHHHCCCEEHHHCCC Q ss_conf 89999999999999889979999--50832543138587052782 Q gi|254780989|r 107 IGSRLAYMSFMAIKNSGAEFVLI--DAKHDWHKHLGFKKPHQHQI 149 (185) Q Consensus 107 iG~~Li~~~l~~a~~~G~~~v~l--~~~~~~Y~r~GF~~~~~~~i 149 (185) +-++|++.+++.+.+.|+..+.. ..+.+-...-||.+-.+-++ T Consensus 172 LerALi~f~ld~~~~~Gy~~v~~P~lv~~~~~~gtGqlPkfe~~~ 216 (422) T PRK05431 172 LERALIQFMLDLHTEEGYTEVIPPYLVNEESMYGTGQLPKFEEDL 216 (422) T ss_pred HHHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHHCCCCCCHHHH T ss_conf 999999999999998799799673763678874453797448775 No 152 >PRK10966 exonuclease subunit SbcD; Provisional Probab=23.38 E-value=57 Score=14.06 Aligned_cols=11 Identities=9% Similarity=0.060 Sum_probs=4.2 Q ss_pred EEEEECCCCCC Q ss_conf 26544057976 Q gi|254780989|r 161 WLVHIFKNNIM 171 (185) Q Consensus 161 ~~~~~l~~~~~ 171 (185) ..+-+++.+.. T Consensus 261 v~lVd~~~~~~ 271 (402) T PRK10966 261 VHLVEFDQGKL 271 (402) T ss_pred EEEEEECCCCC T ss_conf 99999779984 No 153 >cd05537 POLBc_Pol_II DNA polymerase type-II subfamily catalytic domain. Bacteria contain five DNA polymerases (I, II, III, IV and V). DNA polymerase II (Pol II) is a prototype for the B-family of polymerases. The role of Pol II in a variety of cellular activities, such as repair of DNA damaged by UV irradiation or oxidation has been proven by genetic studies. DNA polymerase III is the main enzyme responsible for replication of the bacterial chromosome; however, In vivo studies have also shown that Pol II is able to participate in chromosomal DNA replication with larger role in lagging-strand replication. Probab=23.23 E-value=58 Score=14.05 Aligned_cols=27 Identities=19% Similarity=0.170 Sum_probs=20.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCEEEEECCHH Q ss_conf 98999999999999988997999950832 Q gi|254780989|r 106 GIGSRLAYMSFMAIKNSGAEFVLIDAKHD 134 (185) Q Consensus 106 GiG~~Li~~~l~~a~~~G~~~v~l~~~~~ 134 (185) ..|+.+++.+.+.+++.|+. +++||.| T Consensus 125 ~~gr~~i~~~~~~~e~~g~~--VIYGDTD 151 (371) T cd05537 125 LRGHEIMKQTRAWIEQQGYQ--VIYGDTD 151 (371) T ss_pred HHHHHHHHHHHHHHHHCCCE--EEEECCC T ss_conf 88899999999999967986--8997265 No 154 >pfam04330 consensus Probab=23.16 E-value=58 Score=14.04 Aligned_cols=18 Identities=33% Similarity=0.335 Sum_probs=14.6 Q ss_pred EEECCEEEEEEEEEEEEE Q ss_conf 989878647999999886 Q gi|254780989|r 68 CRDGDRIVGAIRATPIQI 85 (185) Q Consensus 68 ae~d~~iVG~i~~~~~~~ 85 (185) |..+|+|+||+.++|..- T Consensus 37 a~~~g~ivaF~s~~p~~~ 54 (70) T pfam04330 37 ARDDGRIVAFASFMPTGG 54 (70) T ss_pred ECCCCCEEEEEEEEEECC T ss_conf 879997999999828669 No 155 >COG3327 PaaX Phenylacetic acid-responsive transcriptional repressor [Transcription] Probab=22.57 E-value=59 Score=13.97 Aligned_cols=78 Identities=10% Similarity=0.068 Sum_probs=45.0 Q ss_pred CCEEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHH-----CCCEEHHHCCCCCCCCCCCHHHEE Q ss_conf 784899867752546159989999999999999889979999508325431-----385870527826779888823226 Q gi|254780989|r 88 YKGFLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDWHKH-----LGFKKPHQHQIRYAHGGGAATDWL 162 (185) Q Consensus 88 ~~~~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r-----~GF~~~~~~~i~~~~~~~~~~~~~ 162 (185) ..+++++.|. --.++-|++...++.++.++...|.-...-.|...||+= .-+..+ +.-|.-+--...|..|. T Consensus 23 Gg~Iw~gsLI--~il~~fG~sE~~vRaal~Rm~kaG~l~~er~grks~Y~LSDkgl~r~~~a-~~riy~~~r~aWdgkW~ 99 (291) T COG3327 23 GGWIWIGSLI--QILAEFGISETTVRAALSRMVKAGWLVGEREGRKSFYRLSDKGLARQRRA-ARRIYPRTRPAWDGKWH 99 (291) T ss_pred CCEEEHHHHH--HHHHHCCCCHHHHHHHHHHHHHCCCHHEEECCCCCCEEECHHHHHHHHHH-HHHHCCCCCCCCCCCEE T ss_conf 8735487899--99997194278899999998742201000046633021058899999987-66642777756578458 Q ss_pred EEECCC Q ss_conf 544057 Q gi|254780989|r 163 VHIFKN 168 (185) Q Consensus 163 ~~~l~~ 168 (185) +...+. T Consensus 100 lli~~~ 105 (291) T COG3327 100 LLIYTI 105 (291) T ss_pred EEEEEC T ss_conf 999736 No 156 >PRK07898 consensus Probab=22.41 E-value=30 Score=15.72 Aligned_cols=22 Identities=18% Similarity=0.205 Sum_probs=9.9 Q ss_pred CCCCCCEEECCCCHHHHHHHHHHHH Q ss_conf 2100020751379757899999999 Q gi|254780989|r 8 NIELEGLTFANEQRVDQVAVDEVMV 32 (185) Q Consensus 8 ~~~~~~l~IR~a~~~D~~aI~~L~~ 32 (185) ..+++.+.+... |...+.++.. T Consensus 266 ~~~le~l~~~~~---d~~~L~~lf~ 287 (902) T PRK07898 266 PVTPDDLRLQPW---DRDAVHRLFD 287 (902) T ss_pred CCCHHHHCCCCC---CHHHHHHHHH T ss_conf 878887124899---9999999999 No 157 >TIGR02617 tnaA_trp_ase tryptophanase; InterPro: IPR013440 Proteins in this entry belong to the beta-eliminating lyase family and are thought to act as tryptophanases (4.1.99.1 from EC) (also known as L-tryptophan indole-lyases). The genes encoding these proteins are, as a rule, found with a tryptophanase leader peptide TnaC (IPR013439 from INTERPRO) encoded upstream. ; GO: 0009034 tryptophanase activity, 0006568 tryptophan metabolic process. Probab=22.26 E-value=60 Score=13.95 Aligned_cols=35 Identities=11% Similarity=0.011 Sum_probs=28.5 Q ss_pred EEEEEECHHHHCCCHHHHHHHHHHHHHHHC-CCCEE Q ss_conf 986775254615998999999999999988-99799 Q gi|254780989|r 93 RGPLGVLSEYRKRGIGSRLAYMSFMAIKNS-GAEFV 127 (185) Q Consensus 93 l~~laV~P~~qg~GiG~~Li~~~l~~a~~~-G~~~v 127 (185) +|==++-|.|||||-=..++-..|++=+++ |.+.. T Consensus 87 ~GY~YTIPTHQGRGAEQI~~P~LI~~~~~~~G~~~~ 122 (468) T TIGR02617 87 FGYQYTIPTHQGRGAEQIYIPVLIKKREAEKGLDRS 122 (468) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCC T ss_conf 273113687578860445665788877662589855 No 158 >PRK02983 lysS lysyl-tRNA synthetase; Provisional Probab=21.97 E-value=61 Score=13.89 Aligned_cols=107 Identities=10% Similarity=0.138 Sum_probs=61.8 Q ss_pred EECCCCHHHHHHHHHHHHHH--CCCCCCCCCCCEEHHHHHCHHCCCCCCCEEEEE-EEE-CCEEEEEEEEEEEEECCCCE Q ss_conf 75137975789999999997--278766675615202541200025766548999-989-87864799999988606784 Q gi|254780989|r 15 TFANEQRVDQVAVDEVMVKA--FLSTQEITDDVLHKYVRNNVEEPLSFVRLMSFV-CRD-GDRIVGAIRATPIQIGKYKG 90 (185) Q Consensus 15 ~IR~a~~~D~~aI~~L~~~a--F~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~V-ae~-d~~iVG~i~~~~~~~~~~~~ 90 (185) .||....-+.++..+|...| +... +.+..+.+.+ .++-+..+ ..+.+| |.+ +|+++|...|.|. +.++ T Consensus 376 ~irr~~~~~~~e~~~~~~~a~~wr~~-~~erGfsMal--~r~gdp~d--~~~~~v~a~~~~g~~~~~l~fvpw---g~~g 447 (1099) T PRK02983 376 RIRRHRDLSAEEMAQTIDRADAWRDT-ETERGFSMAL--GRLGDPAD--GDCLLVEAVDPDGQVVAMLSLVPW---GTNG 447 (1099) T ss_pred EEEECCCCCHHHHHHHHHHHHHHCCC-CCCCCEEEEC--CCCCCCCC--CCEEEEEEECCCCCEEEEEEECCC---CCCC T ss_conf 99873439999999999999985169-9766502200--46688888--987999998899968899985476---8998 Q ss_pred EEEEEEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 8998677525461599899999999999998899799995 Q gi|254780989|r 91 FLRGPLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLID 130 (185) Q Consensus 91 ~~l~~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~ 130 (185) +.|--+-=+|+ --.|+-..|+...++.+.+.|+..+.|- T Consensus 448 ~SLd~mrr~~~-a~nG~~e~mv~~~~~~a~~~gv~~iSln 486 (1099) T PRK02983 448 VSLDLMRRSPQ-SPNGTIELMVSELALEAEDLGITRISLN 486 (1099) T ss_pred CCEEECCCCCC-CCCCHHHHHHHHHHHHHHHCCCEEEEEH T ss_conf 52240257999-9885308999999986364494599752 No 159 >PRK11197 lldD L-lactate dehydrogenase; Provisional Probab=21.93 E-value=61 Score=13.89 Aligned_cols=34 Identities=15% Similarity=0.151 Sum_probs=25.6 Q ss_pred EEEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHH Q ss_conf 67752546159989999999999999889979999508325 Q gi|254780989|r 95 PLGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAKHDW 135 (185) Q Consensus 95 ~laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~~~~ 135 (185) .||+..+ +.++...++++++.|++.++|+.|... T Consensus 127 QLY~~~D-------r~~~~~ll~RA~~aG~~alvlTVD~pv 160 (381) T PRK11197 127 QLYVLRD-------RGFMRNALERAKAAGCSTLVFTVDMPV 160 (381) T ss_pred EEEECCC-------HHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 9841388-------899999999999849987998078887 No 160 >cd05531 POLBc_B2 DNA polymerase type-B B2 subfamily catalytic domain. Archaeal proteins that are involved in DNA replication are similar to those from eukaryotes. Some archaeal members also possess multiple family B DNA polymerases (B1, B2 and B3). So far there is no specific function(s) has been assigned for different members of the archaea type B DNA polymerases. Phylogenetic analyses of eubacterial, archaeal, and eukaryotic family B DNA polymerases are support independent gene duplications during the evolution of archaeal and eukaryotic family B DNA polymerases. Probab=21.83 E-value=61 Score=13.87 Aligned_cols=27 Identities=11% Similarity=0.160 Sum_probs=20.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCEEEEECCHH Q ss_conf 98999999999999988997999950832 Q gi|254780989|r 106 GIGSRLAYMSFMAIKNSGAEFVLIDAKHD 134 (185) Q Consensus 106 GiG~~Li~~~l~~a~~~G~~~v~l~~~~~ 134 (185) -.||.++..+++.+++.|+.. ++||.| T Consensus 129 ~~GR~~l~~~~~~~e~~g~~V--IYGDTD 155 (352) T cd05531 129 AYGRKILLRAKEIAEEMGFRV--LHGIVD 155 (352) T ss_pred HHHHHHHHHHHHHHHHCCCEE--EEECCC T ss_conf 999999999999999759889--997375 No 161 >TIGR02898 spore_YhcN_YlaJ sporulation lipoprotein, YhcN/YlaJ family; InterPro: IPR014247 This entry contains YhcN and YlaJ, which are predicted lipoproteins that have been detected as spore proteins but not vegetative proteins in Bacillus subtilis. Both appear to be expressed under control of the RNA polymerase sigma-G factor. The YlaJ-like members of this family have a low-complexity, strongly acidic, 40-residue C-terminal domain. . Probab=21.65 E-value=42 Score=14.87 Aligned_cols=39 Identities=13% Similarity=0.050 Sum_probs=26.7 Q ss_pred HCCCHHHHHHHHHHHHHHH--CCCCEEEEECCHHHHHHCCC Q ss_conf 1599899999999999998--89979999508325431385 Q gi|254780989|r 103 RKRGIGSRLAYMSFMAIKN--SGAEFVLIDAKHDWHKHLGF 141 (185) Q Consensus 103 qg~GiG~~Li~~~l~~a~~--~G~~~v~l~~~~~~Y~r~GF 141 (185) -|+-.-..|=+..-+..|. .-..-|.|..|||||.|+-= T Consensus 118 sG~~~~~~lK~~vA~~vk~tD~~i~nVyV~adpD~veR~~~ 158 (185) T TIGR02898 118 SGGKVTDELKEKVAETVKSTDKRIKNVYVSADPDTVERIRE 158 (185) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHH T ss_conf 57504478899999865300337872689728628899999 No 162 >TIGR02708 L_lactate_ox L-lactate oxidase; InterPro: IPR014080 Members of this entry oxidize L-lactate to pyruvate, reducing molecular oxygen to hydrogen peroxide. The enzyme is known in Aerococcus viridans, Streptococcus iniae, and some strains of Streptococcus pyogenes where it appears to contribute to virulence.. Probab=21.26 E-value=63 Score=13.80 Aligned_cols=30 Identities=23% Similarity=0.326 Sum_probs=23.7 Q ss_pred HHHHHHHHHCCCCEEEEECCH------HHHHHCCCE Q ss_conf 999999998899799995083------254313858 Q gi|254780989|r 113 YMSFMAIKNSGAEFVLIDAKH------DWHKHLGFK 142 (185) Q Consensus 113 ~~~l~~a~~~G~~~v~l~~~~------~~Y~r~GF~ 142 (185) ++.++++|+.|+++++|+.|. +-=.|=||+ T Consensus 149 R~I~D~~Ka~G~~AIvLTADaTV~GNR~~D~~N~FV 184 (368) T TIGR02708 149 RDILDRVKADGAKAIVLTADATVGGNREVDKRNGFV 184 (368) T ss_pred HHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCEE T ss_conf 467888752785289972146335774413558736 No 163 >cd02922 FCB2_FMN Flavocytochrome b2 (FCB2) FMN-binding domain. FCB2 (AKA L-lactate:cytochrome c oxidoreductase) is a respiratory enzyme located in the intermembrane space of fungal mitochondria which catalyzes the oxidation of L-lactate to pyruvate. FCB2 also participates in a short electron-transport chain involving cytochrome c and cytochrome oxidase which ultimately directs the reducing equivalents gained from L-lactate oxidation to oxygen, yielding one molecule of ATP for every L-lactate molecule consumed. FCB2 is composed of 2 domains: a C-terminal flavin-binding domain, which includes the active site for lacate oxidation, and an N-terminal b2-cytochrome domain, required for efficient cytochrome c reduction. FCB2 is a homotetramer and contains two noncovalently bound cofactors, FMN and heme per subunit. Probab=21.25 E-value=63 Score=13.80 Aligned_cols=37 Identities=8% Similarity=0.058 Sum_probs=27.8 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCHHH------HHHCCCEEHH Q ss_conf 999999999999889979999508325------4313858705 Q gi|254780989|r 109 SRLAYMSFMAIKNSGAEFVLIDAKHDW------HKHLGFKKPH 145 (185) Q Consensus 109 ~~Li~~~l~~a~~~G~~~v~l~~~~~~------Y~r~GF~~~~ 145 (185) +.++...++++++.|++.++|+.|... -.|.||.... T Consensus 130 r~~~~~li~RA~~aG~~alvlTvD~p~~G~R~rd~r~~~~~~~ 172 (344) T cd02922 130 RTKTEELLKRAEKLGAKAIFLTVDAPVLGKRERDERLKAEEAV 172 (344) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHCCCCCCC T ss_conf 7999999999998699889995678887752266650777788 No 164 >TIGR02759 TraD_Ftype type IV conjugative transfer system coupling protein TraD; InterPro: IPR014128 The TraD protein performs an essential coupling function in conjugative type IV secretion systems. This protein sits at the inner membrane in contact with the assembled pilus and its scaffold as well as the relaxosome-plasmid DNA complex (through TraM) , .; GO: 0000746 conjugation. Probab=20.85 E-value=64 Score=13.75 Aligned_cols=49 Identities=14% Similarity=0.182 Sum_probs=33.4 Q ss_pred CCHHH-HHHHHHHHHHHHCCCCEEEEECCHHHHHHCCCEEHHHCCCCCCCCC Q ss_conf 99899-9999999999988997999950832543138587052782677988 Q gi|254780989|r 105 RGIGS-RLAYMSFMAIKNSGAEFVLIDAKHDWHKHLGFKKPHQHQIRYAHGG 155 (185) Q Consensus 105 ~GiG~-~Li~~~l~~a~~~G~~~v~l~~~~~~Y~r~GF~~~~~~~i~~~~~~ 155 (185) -|=|| .+|++.++.+|++|.++|+..-.-.|-+|| |.+. .--|..|+|. T Consensus 217 tG~GKs~~lr~LL~~iR~rGd~AIiYDkgC~f~~~f-yd~~-~DviLNP~D~ 266 (613) T TIGR02759 217 TGSGKSVALRKLLRWIRQRGDRAIIYDKGCTFVSRF-YDPS-QDVILNPLDE 266 (613) T ss_pred CCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCC-CCCC-CCEEECCCCC T ss_conf 174389999999999986398589982574202132-6888-8746067443 No 165 >COG3640 CooC CO dehydrogenase maturation factor [Cell division and chromosome partitioning] Probab=20.84 E-value=64 Score=13.75 Aligned_cols=33 Identities=12% Similarity=0.153 Sum_probs=21.2 Q ss_pred EEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 7752546159989999999999999889979999508 Q gi|254780989|r 96 LGVLSEYRKRGIGSRLAYMSFMAIKNSGAEFVLIDAK 132 (185) Q Consensus 96 laV~P~~qg~GiG~~Li~~~l~~a~~~G~~~v~l~~~ 132 (185) +.|+|.+++-...++. -+.+.+.|...+++..| T Consensus 161 vVvDpS~~sl~taeri----~~L~~elg~k~i~~V~N 193 (255) T COG3640 161 VVVDPSYKSLRTAERI----KELAEELGIKRIFVVLN 193 (255) T ss_pred EEECCCHHHHHHHHHH----HHHHHHHCCCEEEEEEE T ss_conf 9957877888889999----99998718754999995 No 166 >PRK05929 consensus Probab=20.71 E-value=30 Score=15.77 Aligned_cols=24 Identities=21% Similarity=0.509 Sum_probs=10.5 Q ss_pred CCCCCEEECCCCHHHHHHHHHHHHH Q ss_conf 1000207513797578999999999 Q gi|254780989|r 9 IELEGLTFANEQRVDQVAVDEVMVK 33 (185) Q Consensus 9 ~~~~~l~IR~a~~~D~~aI~~L~~~ 33 (185) .+.+.+.+. ..+.|...+..+..+ T Consensus 250 ~~le~l~~~-~~~~d~~~L~~~~~~ 273 (870) T PRK05929 250 LPIEELAFP-QHPVDEEQLNTFYLQ 273 (870) T ss_pred CCHHHHHCC-CCCCCHHHHHHHHHH T ss_conf 997897157-678799999999997 No 167 >PRK05948 precorrin-2 methyltransferase; Provisional Probab=20.45 E-value=66 Score=13.70 Aligned_cols=21 Identities=5% Similarity=0.222 Sum_probs=13.0 Q ss_pred CCCCEEEE-ECCHHHHHHCCCE Q ss_conf 89979999-5083254313858 Q gi|254780989|r 122 SGAEFVLI-DAKHDWHKHLGFK 142 (185) Q Consensus 122 ~G~~~v~l-~~~~~~Y~r~GF~ 142 (185) .|.+.+++ .|||.+|..|++. T Consensus 91 ~G~~Va~l~~GDp~~Yst~~yl 112 (238) T PRK05948 91 QGEDVAFACEGDVSLYSTFTYL 112 (238) T ss_pred CCCCEEEEECCCCCHHHHHHHH T ss_conf 6993999944683066648999 No 168 >PRK08786 consensus Probab=20.37 E-value=37 Score=15.24 Aligned_cols=20 Identities=10% Similarity=0.097 Sum_probs=8.7 Q ss_pred CCCCCEEECCCCHHHHHHHHHHH Q ss_conf 10002075137975789999999 Q gi|254780989|r 9 IELEGLTFANEQRVDQVAVDEVM 31 (185) Q Consensus 9 ~~~~~l~IR~a~~~D~~aI~~L~ 31 (185) .+.+.+.++.. |...+..+. T Consensus 248 ~~le~l~~~~~---d~e~L~~~f 267 (927) T PRK08786 248 SGPRALDLREP---NTEALAVLY 267 (927) T ss_pred CCHHHHCCCCC---CHHHHHHHH T ss_conf 89899345899---999999999 No 169 >TIGR00619 sbcd nuclease SbcCD, D subunit; InterPro: IPR004593 All proteins in this family for which functions are known are double-stranded DNA exonuclease (as part of a complex with SbcC homologs). This complex functions in the initiation of recombination and recombinational repair and is particularly important in regulating the stability of DNA sections that can form secondary structures. This family is likely to be homologous to the MRE11 family.; GO: 0004527 exonuclease activity, 0006259 DNA metabolic process. Probab=20.30 E-value=66 Score=13.68 Aligned_cols=68 Identities=15% Similarity=0.163 Sum_probs=47.1 Q ss_pred CCCHHHHHHHHHHHHHHHCC--CCEEEEECCHHHHHHCCCEEHHHCCCCCCCCCCCHHHEEEEECCCCCC Q ss_conf 59989999999999999889--979999508325431385870527826779888823226544057976 Q gi|254780989|r 104 KRGIGSRLAYMSFMAIKNSG--AEFVLIDAKHDWHKHLGFKKPHQHQIRYAHGGGAATDWLVHIFKNNIM 171 (185) Q Consensus 104 g~GiG~~Li~~~l~~a~~~G--~~~v~l~~~~~~Y~r~GF~~~~~~~i~~~~~~~~~~~~~~~~l~~~~~ 171 (185) =-...++|+...+..+...+ ...|++.||.|-+.++-+-+.-.........+.+...=++..|+++.. T Consensus 64 P~~ea~~L~~~~l~~L~~~~~R~p~Vvi~GNHD~~~~Ls~~~~Ll~~~~~~v~~~~~~~~~~~~l~~~~~ 133 (275) T TIGR00619 64 PPAEAQELFYAFLLNLSDANPRLPIVVISGNHDSADRLSAAKKLLKELGVFVVGKPEEEPQILDLKDTAN 133 (275) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHCHHHHHHHHCCCEEEECCCCCCEEEECCCCCC T ss_conf 8799999999999999853896127870478887786111677897709544202477860243278887 No 170 >TIGR02467 CbiE precorrin-6y C5,15-methyltransferase (decarboxylating), CbiE subunit; InterPro: IPR012818 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CbiE subunit of precorrin-6Y C5,15-methyltransferase (2.1.1.132 from EC) from the anaerobic pathway, a bifunctional enzyme that catalyses two methylations (at C-5 and C-15) in precorrin-6Y, as well as the decarboxylation of the acetate side chain located in ring C, in order to generate precorrin-8X. In the anaerobic pathway, two enzymes are required to produce precorrin-8X: CbiE and CbiT, which can be fused as CbiET (sometimes called CobL) . In the aerobic pathway, the bifunctional enzyme is CobL .; GO: 0008276 protein methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=20.17 E-value=67 Score=13.66 Aligned_cols=13 Identities=23% Similarity=0.110 Sum_probs=5.7 Q ss_pred HCCCCEEEEECCH Q ss_conf 8899799995083 Q gi|254780989|r 121 NSGAEFVLIDAKH 133 (185) Q Consensus 121 ~~G~~~v~l~~~~ 133 (185) .++.+.+.+.+|+ T Consensus 141 ~~~~r~~~~lTd~ 153 (210) T TIGR02467 141 LRGARKVAVLTDP 153 (210) T ss_pred HCCCCEEEEECCC T ss_conf 5079879995287 No 171 >COG3033 TnaA Tryptophanase [Amino acid transport and metabolism] Probab=20.10 E-value=67 Score=13.65 Aligned_cols=29 Identities=14% Similarity=0.048 Sum_probs=22.1 Q ss_pred EEEEECHHHHCCCHHHHHHHHHHHHHHHC Q ss_conf 86775254615998999999999999988 Q gi|254780989|r 94 GPLGVLSEYRKRGIGSRLAYMSFMAIKNS 122 (185) Q Consensus 94 ~~laV~P~~qg~GiG~~Li~~~l~~a~~~ 122 (185) |-=++.|.+||||--+.|+...++..... T Consensus 93 Gfq~~iPthQGRgAE~Il~~i~ik~~~~~ 121 (471) T COG3033 93 GFQYTIPTHQGRGAENILIPILIKKGEQE 121 (471) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHCCCC T ss_conf 73465125577317778899984103206 Done!