Query gi|254781162|ref|YP_003065575.1| GCN5-related N-acetyltransferase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 192 No_of_seqs 206 out of 1693 Neff 5.8 Searched_HMMs 39220 Date Mon May 30 05:40:29 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781162.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3153 Predicted acetyltransf 100.0 1.4E-41 0 263.3 15.0 160 32-191 2-166 (171) 2 PRK03624 putative acetyltransf 99.7 4.1E-17 1.1E-21 120.0 12.8 120 33-156 2-130 (140) 3 PRK07757 acetyltransferase; Pr 99.7 2.3E-17 5.9E-22 121.5 10.9 123 34-159 2-125 (152) 4 PRK09491 rimI ribosomal-protei 99.7 6.9E-16 1.8E-20 112.9 12.3 133 35-175 1-144 (144) 5 PTZ00330 acetyltransferase; Pr 99.6 5.1E-15 1.3E-19 107.8 13.1 126 30-155 3-140 (147) 6 PRK07922 N-acetylglutamate syn 99.6 1.3E-15 3.3E-20 111.3 9.5 127 30-159 3-131 (170) 7 PRK10146 aminoalkylphosphonic 99.6 1.1E-14 2.8E-19 105.8 13.2 126 33-160 3-141 (144) 8 TIGR01575 rimI ribosomal-prote 99.6 8.6E-15 2.2E-19 106.5 11.0 129 41-171 2-140 (140) 9 PRK10140 putative acetyltransf 99.6 3.1E-14 7.8E-19 103.2 12.5 123 32-156 2-141 (162) 10 PRK05279 N-acetylglutamate syn 99.5 1.4E-13 3.7E-18 99.3 9.2 123 35-159 296-420 (441) 11 cd04301 GNAT GCN5-related N-ac 99.5 5.8E-13 1.5E-17 95.8 10.2 76 76-152 2-83 (83) 12 PRK01346 hypothetical protein; 99.5 8.8E-13 2.2E-17 94.7 10.7 123 32-156 5-136 (411) 13 COG1246 ArgA N-acetylglutamate 99.4 2.6E-13 6.7E-18 97.8 6.7 120 34-159 1-126 (153) 14 PRK10975 TDP-fucosamine acetyl 99.4 3.6E-12 9.2E-17 91.1 11.1 120 34-157 83-225 (231) 15 KOG3139 consensus 99.4 5.6E-12 1.4E-16 90.0 11.3 111 42-156 28-146 (165) 16 PRK12308 bifunctional arginino 99.4 1.3E-13 3.3E-18 99.5 2.7 121 32-159 462-587 (614) 17 PRK09831 hypothetical protein; 99.3 4.1E-12 1E-16 90.8 7.8 128 34-174 1-144 (147) 18 PRK10314 hypothetical protein; 99.3 2E-11 5.1E-16 86.8 10.9 130 41-176 17-151 (153) 19 pfam00583 Acetyltransf_1 Acety 99.3 1.2E-11 3E-16 88.1 9.2 74 78-152 1-80 (80) 20 PRK10562 hypothetical protein; 99.3 8E-12 2E-16 89.1 8.0 126 35-172 1-139 (145) 21 TIGR03448 mycothiol_MshD mycot 99.2 3.6E-10 9.3E-15 79.4 12.9 123 33-156 149-288 (292) 22 COG0456 RimI Acetyltransferase 99.2 9.1E-11 2.3E-15 82.9 9.0 143 28-172 6-170 (177) 23 TIGR03103 trio_acet_GNAT GNAT- 99.2 2E-10 5E-15 81.0 10.6 146 27-173 77-234 (547) 24 PRK10514 hypothetical protein; 99.2 1E-10 2.6E-15 82.7 8.8 115 34-160 2-130 (146) 25 COG2153 ElaA Predicted acyltra 99.2 1.4E-10 3.5E-15 81.9 8.9 96 76-175 52-152 (155) 26 COG1247 Sortase and related ac 99.0 1.2E-08 3.2E-13 70.4 13.3 126 33-158 1-145 (169) 27 KOG3397 consensus 99.0 2.6E-09 6.5E-14 74.4 7.8 117 36-155 20-140 (225) 28 KOG3216 consensus 98.9 2.2E-08 5.5E-13 69.0 11.6 123 32-156 2-146 (163) 29 TIGR01890 N-Ac-Glu-synth amino 98.9 1.4E-09 3.6E-14 75.9 3.7 122 33-159 291-417 (439) 30 pfam08445 FR47 FR47-like prote 98.9 1.4E-08 3.6E-13 70.1 8.8 74 76-156 4-82 (86) 31 TIGR03448 mycothiol_MshD mycot 98.8 1.3E-07 3.3E-12 64.5 11.6 133 40-178 10-158 (292) 32 COG4552 Eis Predicted acetyltr 98.7 3.7E-08 9.4E-13 67.6 5.6 120 52-175 21-145 (389) 33 KOG2488 consensus 98.5 8E-07 2E-11 59.8 8.2 81 76-156 95-182 (202) 34 KOG4144 consensus 98.5 6.8E-08 1.7E-12 66.1 2.1 124 35-160 13-165 (190) 35 COG3393 Predicted acetyltransf 98.4 1.4E-06 3.5E-11 58.5 7.8 80 77-159 181-265 (268) 36 KOG3396 consensus 98.3 3.8E-06 9.7E-11 55.9 8.2 81 74-155 54-143 (150) 37 TIGR01211 ELP3 histone acetylt 98.3 1.7E-06 4.3E-11 57.9 5.4 98 59-156 441-567 (573) 38 PRK10809 ribosomal-protein-S5- 98.2 6.8E-05 1.7E-09 48.5 13.3 121 34-155 18-165 (194) 39 KOG3235 consensus 98.2 8.2E-06 2.1E-10 53.9 7.1 120 34-157 2-136 (193) 40 COG1444 Predicted P-loop ATPas 98.1 7.3E-06 1.9E-10 54.2 6.4 80 100-180 531-615 (758) 41 PRK10151 ribosomal-protein-L7/ 98.1 4.1E-05 1.1E-09 49.8 9.8 122 32-155 9-154 (179) 42 TIGR02406 ectoine_EctA L-2,4-d 98.1 1.2E-05 3E-10 53.0 6.6 80 74-154 40-130 (162) 43 COG1670 RimL Acetyltransferase 98.0 0.00015 3.8E-09 46.6 11.3 86 72-158 65-160 (187) 44 KOG3234 consensus 98.0 2E-05 5.1E-10 51.6 6.8 142 36-178 4-154 (173) 45 TIGR03585 PseH pseudaminic aci 98.0 4.2E-05 1.1E-09 49.8 8.0 81 73-156 51-138 (156) 46 KOG3138 consensus 98.0 1.5E-05 3.7E-10 52.5 5.5 142 34-179 17-176 (187) 47 COG3981 Predicted acetyltransf 97.9 4.5E-05 1.1E-09 49.6 7.2 76 71-148 66-145 (174) 48 cd02169 Citrate_lyase_ligase C 97.9 0.00011 2.8E-09 47.3 8.2 69 79-155 12-83 (296) 49 COG2388 Predicted acetyltransf 97.8 7E-05 1.8E-09 48.5 6.9 67 72-141 14-80 (99) 50 pfam00765 Autoind_synth Autoin 97.6 0.0021 5.3E-08 39.8 11.0 108 65-175 37-171 (182) 51 PHA00673 acetyltransferase dom 97.5 0.00051 1.3E-08 43.4 7.3 83 73-155 55-145 (154) 52 TIGR02382 wecD_rffC TDP-D-fuco 97.5 0.00014 3.6E-09 46.7 4.0 118 34-155 54-194 (201) 53 COG1243 ELP3 Histone acetyltra 97.4 0.0002 5.2E-09 45.8 4.2 85 72-156 406-509 (515) 54 COG3053 CitC Citrate lyase syn 97.4 0.0026 6.5E-08 39.3 9.6 76 72-155 35-114 (352) 55 pfam04958 AstA Arginine N-succ 97.0 0.0061 1.6E-07 37.1 8.1 118 34-151 1-182 (338) 56 TIGR03245 arg_AOST_alph argini 96.9 0.011 2.7E-07 35.7 9.2 117 35-151 1-180 (336) 57 PRK13688 hypothetical protein; 96.9 0.00076 1.9E-08 42.4 3.2 77 76-156 48-133 (157) 58 COG0454 WecD Histone acetyltra 96.9 0.0006 1.5E-08 43.0 2.6 44 106-151 87-130 (156) 59 PRK00756 acyltransferase NodA; 96.9 0.0066 1.7E-07 36.9 7.6 112 38-150 17-137 (196) 60 PRK10456 arginine succinyltran 96.7 0.012 3E-07 35.4 7.6 117 34-151 2-181 (344) 61 PRK13834 putative autoinducer 96.5 0.032 8.2E-07 32.9 9.1 109 72-183 52-191 (207) 62 TIGR03244 arg_catab_AstA argin 96.5 0.021 5.3E-07 34.0 7.9 116 35-151 1-179 (336) 63 pfam02474 NodA Nodulation prot 96.4 0.011 2.9E-07 35.5 6.3 113 38-151 17-138 (196) 64 TIGR03243 arg_catab_AOST argin 96.3 0.029 7.3E-07 33.2 7.6 150 35-187 1-226 (335) 65 COG3818 Predicted acetyltransf 96.0 0.0053 1.3E-07 37.5 2.7 84 76-159 48-151 (167) 66 KOG2535 consensus 95.8 0.015 3.7E-07 34.9 4.3 77 80-156 450-547 (554) 67 PHA01807 hypothetical protein 95.5 0.086 2.2E-06 30.4 7.4 69 71-140 48-118 (145) 68 COG3375 Uncharacterized conser 95.4 0.089 2.3E-06 30.3 7.1 104 36-142 6-116 (266) 69 pfam08444 Gly_acyl_tr_C Aralky 95.3 0.042 1.1E-06 32.2 5.2 54 103-156 22-80 (89) 70 COG2401 ABC-type ATPase fused 95.2 0.015 3.7E-07 34.9 2.7 55 101-155 242-307 (593) 71 pfam01853 MOZ_SAS MOZ/SAS fami 94.3 0.09 2.3E-06 30.3 4.9 60 72-134 51-114 (189) 72 COG3916 LasI N-acyl-L-homoseri 93.5 0.59 1.5E-05 25.5 10.3 119 37-155 10-162 (209) 73 COG5628 Predicted acetyltransf 93.0 0.44 1.1E-05 26.2 6.5 75 76-152 40-119 (143) 74 KOG3698 consensus 93.0 0.7 1.8E-05 25.1 7.5 149 7-155 653-877 (891) 75 PTZ00064 histone acetyltransfe 92.8 0.24 6.1E-06 27.8 4.9 59 71-134 268-330 (442) 76 pfam01233 NMT Myristoyl-CoA:pr 92.7 0.77 2E-05 24.8 7.5 100 39-138 32-148 (162) 77 pfam09390 DUF1999 Protein of u 92.7 0.39 9.9E-06 26.5 5.9 121 34-155 1-140 (161) 78 KOG2747 consensus 92.5 0.43 1.1E-05 26.3 5.9 61 71-134 232-294 (396) 79 KOG4135 consensus 92.2 0.58 1.5E-05 25.5 6.2 94 64-157 57-171 (185) 80 TIGR01686 FkbH FkbH domain; In 90.8 0.65 1.7E-05 25.2 5.3 124 26-153 190-335 (337) 81 KOG2779 consensus 90.3 0.54 1.4E-05 25.7 4.5 99 37-135 87-202 (421) 82 COG3882 FkbH Predicted enzyme 89.9 0.78 2E-05 24.8 5.1 124 28-155 408-549 (574) 83 COG3138 AstA Arginine/ornithin 89.0 1.1 2.9E-05 23.9 5.3 90 34-124 2-143 (336) 84 TIGR00124 cit_ly_ligase [citra 87.7 2.2 5.5E-05 22.2 8.3 80 69-156 28-113 (342) 85 pfam06852 DUF1248 Protein of u 87.6 2.2 5.6E-05 22.1 6.6 90 64-153 37-134 (180) 86 COG5653 Protein involved in ce 86.4 2.6 6.6E-05 21.7 7.0 84 75-162 274-358 (406) 87 COG5027 SAS2 Histone acetyltra 85.4 0.51 1.3E-05 25.8 1.9 58 71-131 232-293 (395) 88 pfam05301 Mec-17 Touch recepto 83.5 1.4 3.5E-05 23.3 3.4 51 82-132 18-78 (120) 89 KOG2036 consensus 82.4 1.4 3.6E-05 23.3 3.1 77 102-178 616-747 (1011) 90 KOG2696 consensus 82.2 2.2 5.6E-05 22.1 4.0 41 103-143 220-260 (403) 91 TIGR03019 pepcterm_femAB FemAB 81.6 4.1 0.00011 20.5 9.4 90 63-156 186-281 (330) 92 PRK01305 arginyl-tRNA-protein 78.2 5.3 0.00014 19.9 8.2 68 66-139 137-206 (241) 93 COG5092 NMT1 N-myristoyl trans 77.9 5.5 0.00014 19.8 6.7 80 56-135 112-200 (451) 94 pfam04377 ATE_C Arginine-tRNA- 75.8 6.3 0.00016 19.5 8.2 65 69-139 35-101 (128) 95 COG4671 Predicted glycosyl tra 56.2 15 0.00037 17.3 3.4 29 113-141 227-258 (400) 96 KOG4601 consensus 55.4 13 0.00034 17.6 3.1 49 82-130 81-138 (264) 97 pfam02388 FemAB FemAB family. 54.2 17 0.00043 16.9 10.1 71 104-178 70-160 (407) 98 PRK12568 glycogen branching en 52.4 16 0.00042 17.0 3.2 48 119-166 269-318 (730) 99 TIGR02482 PFKA_ATP 6-phosphofr 52.2 18 0.00046 16.8 3.4 39 123-166 81-126 (302) 100 TIGR00542 hxl6Piso_put hexulos 51.7 15 0.00037 17.3 2.8 37 117-153 91-127 (290) 101 PRK05402 glycogen branching en 48.8 21 0.00053 16.4 3.2 46 118-163 269-316 (730) 102 cd03173 DUF619-like DUF619-lik 47.5 22 0.00056 16.3 5.6 49 78-128 14-62 (99) 103 COG3623 SgaU Putative L-xylulo 46.2 16 0.00042 17.0 2.4 33 118-150 94-126 (287) 104 TIGR02089 TTC tartrate dehydro 45.5 23 0.0006 16.1 3.4 14 114-127 11-24 (355) 105 TIGR00455 apsK adenylylsulfate 42.8 24 0.0006 16.1 2.7 39 116-154 29-71 (187) 106 TIGR02362 dhaK1b probable dihy 40.7 21 0.00053 16.5 2.2 47 106-152 134-189 (328) 107 COG2921 Uncharacterized conser 37.8 31 0.00079 15.4 3.7 43 8-50 36-79 (90) 108 PRK12313 glycogen branching en 36.8 32 0.00082 15.3 3.2 44 119-162 169-214 (632) 109 cd04266 DUF619-NAGS-FABP DUF61 36.1 33 0.00084 15.3 6.7 55 74-128 11-68 (108) 110 TIGR01490 HAD-SF-IB-hyp1 HAD-s 35.3 34 0.00087 15.2 3.7 35 122-156 94-133 (204) 111 PRK09936 hypothetical protein; 34.7 35 0.00089 15.1 3.6 33 118-150 70-103 (294) 112 PRK09331 Sep-tRNA:Cys-tRNA syn 33.8 15 0.00037 17.3 0.5 42 104-145 124-168 (386) 113 PRK13210 putative L-xylulose 5 32.8 20 0.0005 16.6 1.0 32 108-141 84-115 (284) 114 pfam06559 DCD 2'-deoxycytidine 32.1 38 0.00098 14.9 4.4 51 58-117 308-358 (366) 115 PRK07559 2'-deoxycytidine 5'-t 31.8 39 0.001 14.8 4.7 36 79-117 323-358 (366) 116 TIGR02277 PaaX_trns_reg phenyl 31.1 29 0.00073 15.6 1.6 73 100-174 17-94 (287) 117 PRK13876 conjugal transfer cou 30.9 39 0.00099 14.8 2.3 12 107-118 149-160 (662) 118 cd04265 DUF619-NAGS-U DUF619-N 29.0 44 0.0011 14.6 3.4 30 100-129 30-59 (95) 119 COG2243 CobF Precorrin-2 methy 27.8 46 0.0012 14.4 2.7 21 132-152 89-110 (234) 120 cd04736 MDH_FMN Mandelate dehy 27.0 47 0.0012 14.3 2.7 28 120-147 128-155 (361) 121 PRK05576 cobalt-precorrin-2 C( 26.5 48 0.0012 14.3 2.1 11 141-151 99-109 (224) 122 pfam04765 DUF616 Protein of un 25.4 38 0.00096 14.9 1.4 57 121-177 196-252 (306) 123 cd02922 FCB2_FMN Flavocytochro 24.8 52 0.0013 14.1 3.3 38 119-156 130-173 (344) 124 pfam09924 DUF2156 Uncharacteri 23.3 56 0.0014 13.9 8.6 55 75-132 181-235 (250) 125 pfam02457 DisA_N DisA bacteria 22.8 50 0.0013 14.2 1.6 48 76-139 59-106 (122) 126 COG0434 SgcQ Predicted TIM-bar 22.7 46 0.0012 14.4 1.4 19 133-151 47-67 (263) 127 pfam09837 DUF2064 Uncharacteri 22.0 59 0.0015 13.8 3.2 71 103-176 32-107 (121) 128 pfam06849 DUF1246 Protein of u 22.0 59 0.0015 13.8 2.7 29 124-152 11-41 (124) 129 TIGR02665 molyb_mobA molybdopt 21.1 51 0.0013 14.1 1.4 47 118-164 26-78 (202) 130 TIGR02708 L_lactate_ox L-lacta 20.8 63 0.0016 13.6 2.4 45 123-169 149-195 (368) 131 pfam01276 OKR_DC_1 Orn/Lys/Arg 20.3 59 0.0015 13.8 1.6 23 141-163 202-224 (417) No 1 >COG3153 Predicted acetyltransferase [General function prediction only] Probab=100.00 E-value=1.4e-41 Score=263.28 Aligned_cols=160 Identities=33% Similarity=0.576 Sum_probs=152.5 Q ss_pred CEEEEEECCHHHHHHHH---HHHCCCCCCCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEEC--CCCEEEEEEE Q ss_conf 20344207989999999---985389872100131103888876078999988989999998776118--8223899767 Q gi|254781162|r 32 AFFLERLEHDSSICAMH---ADSFGPGRFVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIE--KITGHLLGPI 106 (192) Q Consensus 32 ~~~~~R~e~~~di~ai~---~~~fgpgr~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~--~~~~~~Lgpl 106 (192) ++|.||+|+.+|+.+|. +.+|+++++++.+..+|+...++.+|||||+++|++||||+|||+.++ ...++.|||| T Consensus 2 ~~~~ir~e~~~d~~~i~~~~~~aF~~~~e~~~v~~lR~~~~~~~~LslVA~d~g~vvG~Il~s~v~~~g~~~~~~~LaPL 81 (171) T COG3153 2 MMMLIRTETPADIPAIEALTREAFGPGREAKLVDKLREGGRPDLTLSLVAEDDGEVVGHILFSPVTVGGEELGWLGLAPL 81 (171) T ss_pred CCCEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCCCEEEEEEE T ss_conf 84278746834389999999987404368999999874677655336998209989999998689856864146999767 Q ss_pred EECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHCCCEECCCCCEECCCCCCHHHEEEEECCCCCCCCCEEE Q ss_conf 88511138995899999999999613884799840865787429487463653668999956669843577810178167 Q gi|254781162|r 107 VVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDIAYYSKLGFQAVPWKSLILPAPVDPNRVLFLPLVQNVAQNIKGI 186 (192) Q Consensus 107 aV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~~yY~rfGF~~~~~~~l~~p~pv~~~~~l~l~L~~~~l~~~~G~ 186 (192) +|+|+|||||||++||+.+++.++.+|+..|+++|+|+||.||||++++..++.+|+|+++++||+++|.++.+++.+|+ T Consensus 82 aV~p~~qg~GIG~~Lvr~~le~a~~~G~~~v~vlGdp~YY~rfGF~~~~~~~l~~p~~~~~~~fl~~~L~~~~l~~~~G~ 161 (171) T COG3153 82 AVDPEYQGQGIGSALVREGLEALRLAGASAVVVLGDPTYYSRFGFEPAAGAKLYAPGPVPDERFLALELGDGALEGPSGT 161 (171) T ss_pred EECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCEECCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCE T ss_conf 87622157867899999999999977997899906844244547488655531168998974089998458755677633 Q ss_pred EEEEE Q ss_conf 79832 Q gi|254781162|r 187 VRCRE 191 (192) Q Consensus 187 v~~~~ 191 (192) |.+.+ T Consensus 162 v~~~~ 166 (171) T COG3153 162 VHYHE 166 (171) T ss_pred EEECC T ss_conf 88553 No 2 >PRK03624 putative acetyltransferase; Provisional Probab=99.74 E-value=4.1e-17 Score=120.04 Aligned_cols=120 Identities=23% Similarity=0.306 Sum_probs=89.8 Q ss_pred EEEEEECCHHHHHHHHHH---HCCCCCCCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEEC Q ss_conf 034420798999999998---53898721001311038888760789999889899999987761188223899767885 Q gi|254781162|r 33 FFLERLEHDSSICAMHAD---SFGPGRFVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVH 109 (192) Q Consensus 33 ~~~~R~e~~~di~ai~~~---~fgpgr~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~ 109 (192) .|.||+-+.+|++++.++ ++.+.-...+.............+.|||+.+|+|||++..+. ++..++ +..++|+ T Consensus 2 ~m~IR~~~~~D~~av~~L~~~~~~~~pw~~~~~~~~~~l~~~~~~~lVae~~g~ivG~~~~~~---~~~~~~-i~~laV~ 77 (140) T PRK03624 2 AMEIRVFRQADFEAVIALWERCDLTRPWNDPEMDIERKLAHDPSLFLVAEVGGRVVGTVMGGY---DGHRGW-LYYLAVH 77 (140) T ss_pred CEEEEECCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCEEEEEEECCEEEEEEEEEE---ECCEEE-EEEEEEC T ss_conf 619985999999999999996499999886399999998479978999999999999999998---177899-9998888 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEE---C---HHHHHHCCCEECCC Q ss_conf 11138995899999999999613884799840---8---65787429487463 Q gi|254781162|r 110 PLYQNKGIGRKLISMSVDAAEKKGSQVIVLVG---D---IAYYSKLGFQAVPW 156 (192) Q Consensus 110 P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g---~---~~yY~rfGF~~~~~ 156 (192) |+|||+|||++||+++++.+++.|+..+.|.- + ..||+++||+.... T Consensus 78 p~~Rg~GiG~~Ll~~~~~~~~~~G~~~i~L~v~~~N~~A~~fY~~~GF~~~~r 130 (140) T PRK03624 78 PDFRGRGIGRALLAHAEKKLIARGCPKINLQVREDNDAVLGFYEALGYEEEDR 130 (140) T ss_pred HHHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHCCCEECCE T ss_conf 76765389999999999999976999999999088599999999889938328 No 3 >PRK07757 acetyltransferase; Provisional Probab=99.74 E-value=2.3e-17 Score=121.51 Aligned_cols=123 Identities=20% Similarity=0.258 Sum_probs=92.8 Q ss_pred EEEEECCHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCC Q ss_conf 34420798999999998538987210013110388887607899998898999999877611882238997678851113 Q gi|254781162|r 34 FLERLEHDSSICAMHADSFGPGRFVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQ 113 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~fgpgr~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~r 113 (192) |.||..+..|+.+|+++.-..-+.....-|-++.-.....--+||+.||+|||++.+.+. +.+...+..++|+|+|| T Consensus 2 ~~IR~A~~~Dv~~I~~Li~~~~~~g~Ll~Rs~~~i~~~i~~f~Vae~dg~IvGc~aL~~~---~~~~aEi~~l~V~p~~r 78 (152) T PRK07757 2 MMIRKARLSDVKAIHALINEYAKEGLMLPRSLDELYENIRDFYVAEEDGEIVGCCALHIL---WEDLAEIRSLAVSEEYR 78 (152) T ss_pred CEEEECCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCEEEEEECCEEEEEEEEEEC---CCCCEEEEEEEECHHHC T ss_conf 189859788899999999888768945268999999764749999999999999998977---99911067789837662 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCEEEEEE-CHHHHHHCCCEECCCCCE Q ss_conf 8995899999999999613884799840-865787429487463653 Q gi|254781162|r 114 NKGIGRKLISMSVDAAEKKGSQVIVLVG-DIAYYSKLGFQAVPWKSL 159 (192) Q Consensus 114 g~GiG~~Li~~~l~~a~~~g~~~v~l~g-~~~yY~rfGF~~~~~~~l 159 (192) |+|+|++|++++++.|+++|+..++++. .++||.+.||+.++...+ T Consensus 79 g~GiG~~Ll~~l~~~Ar~~G~~~lf~LTt~~~fF~~~GF~~~~~~~l 125 (152) T PRK07757 79 GKGIGRMLVEACLEEARELGVKRVFALTYQPEFFEKLGFREVDKEAL 125 (152) T ss_pred CCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHCCCEECCHHHC T ss_conf 88889999999999999869999999058667898789988883559 No 4 >PRK09491 rimI ribosomal-protein-alanine N-acetyltransferase; Provisional Probab=99.69 E-value=6.9e-16 Score=112.90 Aligned_cols=133 Identities=20% Similarity=0.310 Sum_probs=98.2 Q ss_pred EEEECCHHH---HHHHHHHHCCCCCCCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCC Q ss_conf 442079899---99999985389872100131103888876078999988989999998776118822389976788511 Q gi|254781162|r 35 LERLEHDSS---ICAMHADSFGPGRFVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPL 111 (192) Q Consensus 35 ~~R~e~~~d---i~ai~~~~fgpgr~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~ 111 (192) +||+=+.+| |-+|+..+|.. -.++..+..-. ....+++|+..+++|||++.++.+. .++..+ .++|+|+ T Consensus 1 TIr~mt~~Dl~~v~~IE~~~f~~-PWs~~~f~~~~---~~~~~~~~~~~~~~ivGy~~~~~~~---de~~i~-~iaV~p~ 72 (144) T PRK09491 1 TISSLTTTDLPAAYHIEQRAHAF-PWSEKTFASNQ---GERYLNLKLTQNGQMAAFAITQVVL---DEATLF-NIAVDPD 72 (144) T ss_pred CCCCCCHHHHHHHHHHHHHHCCC-CCCHHHHHHHH---HCCCEEEEEEECCEEEEEEEEEEEC---CCEEEE-EEEECHH T ss_conf 93229899999999999986489-99999999775---5691799999899899999999817---918999-9999688 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCEEEEE------ECHHHHHHCCCEECCCCCEECCCC--CCHHHEEEEEC Q ss_conf 13899589999999999961388479984------086578742948746365366899--99566698435 Q gi|254781162|r 112 YQNKGIGRKLISMSVDAAEKKGSQVIVLV------GDIAYYSKLGFQAVPWKSLILPAP--VDPNRVLFLPL 175 (192) Q Consensus 112 ~rg~GiG~~Li~~~l~~a~~~g~~~v~l~------g~~~yY~rfGF~~~~~~~l~~p~p--v~~~~~l~l~L 175 (192) |||+|+|++||++.++.++..|...+.|- .+..+|+++||+.+....=..|.| -.+.-+|.++| T Consensus 73 ~r~~G~g~~Ll~~~~~~~~~~g~~~i~LEVr~sN~~A~~lY~k~GF~~~g~R~~YY~~~dg~EDAiiM~l~L 144 (144) T PRK09491 73 YQRQGLGRALLEHLIDELEKRGVATLWLEVRASNAAAIALYESLGFNEVTIRRNYYPTADGREDAIIMALPL 144 (144) T ss_pred HHHCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHCCCEEECEECCCCCCCCCCCCCEEEECCC T ss_conf 864897999999999999987997999999578789999999889989178878568978990216886279 No 5 >PTZ00330 acetyltransferase; Provisional Probab=99.65 E-value=5.1e-15 Score=107.79 Aligned_cols=126 Identities=23% Similarity=0.320 Sum_probs=92.3 Q ss_pred HCCEEEEEECCHHHHHHHHHHH----CCCCC--CCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEE--EE-EECCCCE Q ss_conf 0120344207989999999985----38987--21001311038888760789999889899999987--76-1188223 Q gi|254781162|r 30 IGAFFLERLEHDSSICAMHADS----FGPGR--FVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMT--PI-SIEKITG 100 (192) Q Consensus 30 ~~~~~~~R~e~~~di~ai~~~~----fgpgr--~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~--pv-~~~~~~~ 100 (192) +...|.||.-+.+|+.++.++- -.|.- .............+.....+||+.+++|||+..+. |- .-++.+. T Consensus 3 m~~tl~IR~a~~~Dl~~il~Ll~~L~~~~~~~~e~~~~~~~~~~~~~~~~~v~Vae~~g~iVg~~~l~i~~~~~~~g~~~ 82 (147) T PTZ00330 3 MSGSLELRDLEEGDLGSVLELLSHLTSAPALSQEELEQIAARRRLAGVVTRVFVHSPTQRIVGTASLFVEPKFTRGGKCV 82 (147) T ss_pred CCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCEEEEEEECCCEEEEEEEEEEEEEEEECCCCE T ss_conf 98878988598889999999999854389978899999999873479943999995299799999999992133179536 Q ss_pred EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC---HHHHHHCCCEECC Q ss_conf 899767885111389958999999999996138847998408---6578742948746 Q gi|254781162|r 101 HLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD---IAYYSKLGFQAVP 155 (192) Q Consensus 101 ~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~---~~yY~rfGF~~~~ 155 (192) ..+..|+|+|+|||+|||++||+++++.|+++|+..++|..+ .+||+++||+... T Consensus 83 ~~iEdvvV~~~~RG~GiG~~L~~~l~~~a~~~gc~kv~L~~~~~~~~FY~k~Gf~~~~ 140 (147) T PTZ00330 83 GHIEDVVVDPSYRGQGLGRALISDLCEIARSSGCYKVILDCTEDMVAFYKKLGFRACE 140 (147) T ss_pred EEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHCCCEECC T ss_conf 9998899987765888899999999999998799799994795167999986999742 No 6 >PRK07922 N-acetylglutamate synthase; Validated Probab=99.64 E-value=1.3e-15 Score=111.32 Aligned_cols=127 Identities=11% Similarity=0.140 Sum_probs=93.1 Q ss_pred HCCEEEEEECCHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCEEEEEEE-CCEEEEEEEEEEEEECCCCEEEEEEEEE Q ss_conf 012034420798999999998538987210013110388887607899998-8989999998776118822389976788 Q gi|254781162|r 30 IGAFFLERLEHDSSICAMHADSFGPGRFVRAAVLLREQGMHDLSLSFLCAE-GKRIVGSVRMTPISIEKITGHLLGPIVV 108 (192) Q Consensus 30 ~~~~~~~R~e~~~di~ai~~~~fgpgr~~~~~~~lR~~~~~~~~lslVA~~-~~~IVG~i~~~pv~~~~~~~~~LgplaV 108 (192) +...+++|..+.+|+.+|+++.-.-...-...-|-++.-.....--+|++. +|+|||++.+.+.. .+...+..++| T Consensus 3 ~~~~v~IR~A~~~DV~~I~~LI~p~a~~G~LL~Rs~~~L~~~I~~F~Vae~~DG~IvGCaAL~~~~---~dlAEIrsLAV 79 (170) T PRK07922 3 MAPGIVVRRARTSDVPAIKRLVDTYADGRILLEKNLVTLYEAVQEFWVAEHLDGEVVGCGALHVLW---EDLAEVRTVAV 79 (170) T ss_pred CCCCEEEECCCHHHHHHHHHHHHHHHHCCEEECCCHHHHHHHHCEEEEEEECCCCEEEEEEEEECC---CCHHHHEEEEE T ss_conf 788628874967779999999999986895204899999987552899994599689999746456---53213044588 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEE-CHHHHHHCCCEECCCCCE Q ss_conf 511138995899999999999613884799840-865787429487463653 Q gi|254781162|r 109 HPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVG-DIAYYSKLGFQAVPWKSL 159 (192) Q Consensus 109 ~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g-~~~yY~rfGF~~~~~~~l 159 (192) +|+|||+|+|++|+++.++.|+++|...++++. .++|+.+.||..+.+..+ T Consensus 80 ~p~~rg~G~G~~Lv~~l~~~Ar~lGi~~vFvLT~~~~fF~k~GF~ev~k~~l 131 (170) T PRK07922 80 DPAMRGHGVGHAIVERLLDVARELGLSRVFVLTFEVEFFARHGFVEIDGTPV 131 (170) T ss_pred CHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHCCCEECCCCCC T ss_conf 7878189849999999999999859986999978368999769987668849 No 7 >PRK10146 aminoalkylphosphonic acid N-acetyltransferase; Provisional Probab=99.63 E-value=1.1e-14 Score=105.81 Aligned_cols=126 Identities=15% Similarity=0.124 Sum_probs=91.0 Q ss_pred EEEEEECCHHHHHHHHHHH--CCCCCCCCEEEE--EE-CCCCCCCCEEEEEEECCEEEEEEEEEEEE--ECCCCEEEEEE Q ss_conf 0344207989999999985--389872100131--10-38888760789999889899999987761--18822389976 Q gi|254781162|r 33 FFLERLEHDSSICAMHADS--FGPGRFVRAAVL--LR-EQGMHDLSLSFLCAEGKRIVGSVRMTPIS--IEKITGHLLGP 105 (192) Q Consensus 33 ~~~~R~e~~~di~ai~~~~--fgpgr~~~~~~~--lR-~~~~~~~~lslVA~~~~~IVG~i~~~pv~--~~~~~~~~Lgp 105 (192) ...||+-+.+|+.|+.++. .++.......++ .. ...++. ...+||+.+|+|||++.++-.. ..+.....+.. T Consensus 3 ~v~IR~At~~D~~ai~~Ll~eL~~~~~d~~~~~~~~~~~~~~p~-~~~~Vae~dG~vvG~~~l~~~~~l~~~~~~~~I~~ 81 (144) T PRK10146 3 ACELRPATQYDTDAVYALICELKQAEFDHQAFRVGFNANLRDPN-MRYHLALLDGEVVGMIGLHLQFHLHHVNWIGEIQE 81 (144) T ss_pred CCEEEECCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCC-CEEEEEEECCEEEEEEEEEEECCCCCCCCEEEEEE T ss_conf 74405198888999999999864376787999999999854998-48999998997999999999635246883699999 Q ss_pred EEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC------HHHHHHCCCEECCCCCEE Q ss_conf 7885111389958999999999996138847998408------657874294874636536 Q gi|254781162|r 106 IVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD------IAYYSKLGFQAVPWKSLI 160 (192) Q Consensus 106 laV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~------~~yY~rfGF~~~~~~~l~ 160 (192) ++|+|++||+|||++||.++++.|+++|+..+.|..+ ..||+|.||+... ++++ T Consensus 82 l~V~~~~Rg~GiG~~Ll~~a~~~Ar~~G~~~~~L~s~~~r~~Ah~FY~r~Gf~~~~-~~F~ 141 (144) T PRK10146 82 LVVMPQARGLNVGSKLLAWAEEEARQAGAEMTELSTNVKRHDAHRFYLREGYEQSH-FRFT 141 (144) T ss_pred EEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCEEEC-CCEE T ss_conf 99985641769999999999999998699189998499678999999986996335-4168 No 8 >TIGR01575 rimI ribosomal-protein-alanine acetyltransferase; InterPro: IPR006464 Histone acetylation is carried out by a class of enzymes known as histone acetyltransferases (HATs), which catalyse the transfer of an acetyl group from acetyl-CoA to the lysine E-amino groups on the N-terminal tails of histones. Early indication that HATs were involved in transcription came from the observation that in actively transcribed regions of chromatin, histones tend to be hyperacetylated, whereas in transcriptionally silent regions histones are hypoacetylated. The histone acetyltransferases are divided into five families. These include the Gcn5-related acetyltransferases (GNATs); the MYST (for 'MOZ, Ybf2/Sas3, Sas2 and Tip60)-related HATs; p300/CBP HATs; the general transcription factor HATs, which include the TFIID subunit TAF250; and the nuclear hormone-related HATs SRC1 and ACTR (SRC3). The GCN5-related N-acetyltransferase superfamily includes such enzymes as the histone acetyltransferases GCN5 and Hat1, the elongator complex subunit Elp3, the mediator-complex subunit Nut1, and Hpa2 . Many GNATs share several functional domains, including an N-terminal region of variable length, an acetyltransferase domain that encompasses the conserved sequence motifs described above, a region that interacts with the coactivator Ada2, and a C-terminal bromodomain that is believed to interact with acetyl-lysine residues. Members of the GNAT family are important for the regulation of cell growth and development. In mice, knockouts of Gcn5L are embryonic lethal. Yeast Gcn5 is needed for normal progression through the G2M boundary and mitotic gene expression. The importance of GNATs is probably related to their role in transcription and DNA repair. Members of this entry belong to the GCN5-related N-acetyltransferase (GNAT) superfamily. This entry covers prokaryotes and the archaea. It contains rimI, which catalyses the acetylation of the N-terminal alanine of ribosomal protein S18 .; GO: 0016407 acetyltransferase activity, 0006474 N-terminal protein amino acid acetylation. Probab=99.61 E-value=8.6e-15 Score=106.48 Aligned_cols=129 Identities=26% Similarity=0.404 Sum_probs=94.5 Q ss_pred HHHHHHHHHHHCCCCC-CCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEE-EECCCCEEEEEEEEECCCCCCCCHH Q ss_conf 8999999998538987-2100131103888876078999988989999998776-1188223899767885111389958 Q gi|254781162|r 41 DSSICAMHADSFGPGR-FVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPI-SIEKITGHLLGPIVVHPLYQNKGIG 118 (192) Q Consensus 41 ~~di~ai~~~~fgpgr-~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv-~~~~~~~~~LgplaV~P~~rg~GiG 118 (192) ..++.+|++.+|.-+. .+++++..-..-..-..+ ++|..|+++||++..+++ .+.-.++..+. +||+|++||+||| T Consensus 2 ~~~v~~ie~~af~~~~PW~~~~f~~~l~i~~~~~y-l~a~~g~~~~Gy~~~~r~~~~~~~Ea~i~~-iaV~pe~Qg~GiG 79 (140) T TIGR01575 2 LKAVLEIEKAAFTVDSPWTEAAFAEELAINYYTCY-LLAEIGGKVVGYIGISRIVQIVLDEAHILN-IAVKPEYQGQGIG 79 (140) T ss_pred HHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCEE-EEEEECCEEEEEEEEEEEEEEECCCCEEEE-EEECCCCCCCCHH T ss_conf 47999998987169999887898865101333014-456628778987765678875437744899-9887120166378 Q ss_pred HHHHHHHHHHHHHCCCCEEEEE---E---CHHHHHHCCCEECCCCCEECCCCC--CHHHEE Q ss_conf 9999999999961388479984---0---865787429487463653668999--956669 Q gi|254781162|r 119 RKLISMSVDAAEKKGSQVIVLV---G---DIAYYSKLGFQAVPWKSLILPAPV--DPNRVL 171 (192) Q Consensus 119 ~~Li~~~l~~a~~~g~~~v~l~---g---~~~yY~rfGF~~~~~~~l~~p~pv--~~~~~l 171 (192) ++|+++.++.++..|...++|- + +..+|+|+||+.+....=..|.|. .+..+| T Consensus 80 ~~LL~~L~~~~~~~~~~~~~LEVR~SN~~A~~LY~k~GF~~~~~r~~YY~~~~GrEDA~~M 140 (140) T TIGR01575 80 RALLRELIKEAKGRGVNEIFLEVRVSNVAAQALYKKLGFNEIAIRKNYYPDPEGREDAIVM 140 (140) T ss_pred HHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCCCCC T ss_conf 8999999999851898389984614068999987771850101043568887962223039 No 9 >PRK10140 putative acetyltransferase YhhY; Provisional Probab=99.59 E-value=3.1e-14 Score=103.24 Aligned_cols=123 Identities=16% Similarity=0.156 Sum_probs=86.2 Q ss_pred CEEEEEECCHHHHHHHHHHHCCCC----C-----CCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCC-EE Q ss_conf 203442079899999999853898----7-----2100131103888876078999988989999998776118822-38 Q gi|254781162|r 32 AFFLERLEHDSSICAMHADSFGPG----R-----FVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKIT-GH 101 (192) Q Consensus 32 ~~~~~R~e~~~di~ai~~~~fgpg----r-----~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~-~~ 101 (192) ++++||+-+++|++||.+.--.|. . .+.....-|....+ ....|||+.+|+|||++.+......... .. T Consensus 2 s~i~IR~~~~~D~~ai~~i~~~~~v~~~t~~~p~p~~~~~~~~l~~~~-~~~~lVa~~dg~vvG~~~~~~~~~~r~~h~a 80 (162) T PRK10140 2 SEIVIRHAETRDYEAIRQIHAQPEVYHNTLQVPHPSDHMWQERLTDRP-GIKQLVACIDGIVVGHLTIDVQQRPRRSHVA 80 (162) T ss_pred CCEEEECCCHHHHHHHHHHHHCHHHHCCCCCCCCCCHHHHHHHHHCCC-CCCEEEEEECCEEEEEEEEECCCCCCEEEEE T ss_conf 956985399789999999984935441606799969999999985288-9847999999999999998726787222699 Q ss_pred EEEEEEECCCCCCCCHHHHHHHHHHHHHHH-CCCCEEEEE---EC---HHHHHHCCCEECCC Q ss_conf 997678851113899589999999999961-388479984---08---65787429487463 Q gi|254781162|r 102 LLGPIVVHPLYQNKGIGRKLISMSVDAAEK-KGSQVIVLV---GD---IAYYSKLGFQAVPW 156 (192) Q Consensus 102 ~LgplaV~P~~rg~GiG~~Li~~~l~~a~~-~g~~~v~l~---g~---~~yY~rfGF~~~~~ 156 (192) .+| ++|+|+|||+|||++||+.+++.|.+ .+...+.|. .+ ...|+|+||+.... T Consensus 81 ~~g-i~V~~~~~g~GiG~~Ll~~~i~~a~~~~~l~ri~l~V~~~N~~Ai~Ly~k~GF~~eG~ 141 (162) T PRK10140 81 DFG-ICVDSRWKNRGVASALMREMIDMCDNWLRVDRIELTVFVDNAPAIKVYKKYGFEIEGT 141 (162) T ss_pred EEE-EEECHHHCCCCHHHHHHHHHHHHHHHCCCEEEEEEEEECCCHHHHHHHHHCCCEEEEE T ss_conf 999-9999888698999999999999997316869999999869999999999889989988 No 10 >PRK05279 N-acetylglutamate synthase; Validated Probab=99.50 E-value=1.4e-13 Score=99.29 Aligned_cols=123 Identities=15% Similarity=0.162 Sum_probs=95.6 Q ss_pred EEEECCHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCC Q ss_conf 44207989999999985389872100131103888876078999988989999998776118822389976788511138 Q gi|254781162|r 35 LERLEHDSSICAMHADSFGPGRFVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQN 114 (192) Q Consensus 35 ~~R~e~~~di~ai~~~~fgpgr~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg 114 (192) .+|.-+..|+..|.++.-.-...--...|-|+.-....+--+|.|.||.||||+.+.|.. ......++.++|+|+||+ T Consensus 296 ~iR~A~~~Dv~~Il~Li~Ple~~G~Lv~Rsre~le~eI~~f~V~e~Dg~iigCaALy~~~--~~~~aElaclav~~~y~~ 373 (441) T PRK05279 296 QLRRATIDDVGGILELIRPLEEQGILVRRSREQLEREIDQFTVIERDGVIIGCAALYPFP--EEKMGEMACLAVHPDYRG 373 (441) T ss_pred HHCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCEEEEEECCCEEEEEEEEECC--CCCEEEEEEEEECHHHCC T ss_conf 835686767999999978898679700189999997646369999889599999600038--778068888898786618 Q ss_pred CCHHHHHHHHHHHHHHHCCCCEEEEEEC--HHHHHHCCCEECCCCCE Q ss_conf 9958999999999996138847998408--65787429487463653 Q gi|254781162|r 115 KGIGRKLISMSVDAAEKKGSQVIVLVGD--IAYYSKLGFQAVPWKSL 159 (192) Q Consensus 115 ~GiG~~Li~~~l~~a~~~g~~~v~l~g~--~~yY~rfGF~~~~~~~l 159 (192) +|+|.+|++++.+.|+++|...++++.. ..||...||+......+ T Consensus 374 ~g~G~~Ll~~~e~~A~~~g~~~lF~LTTqt~hwF~e~GF~~~~~~~L 420 (441) T PRK05279 374 SGRGERLLKRIEQRARQQGLSRLFVLTTRTAHWFLERGFVPADVDDL 420 (441) T ss_pred CCHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHCCCCCCCHHHC T ss_conf 98799999999999998699879999656388899759830892769 No 11 >cd04301 GNAT GCN5-related N-acetyltransferases (GNAT) represent a large superfamily of functionally diverse enzymes that catalyze the transfer of an acetyl group from acetyl-Coenzyme A to the primary amine of a wide range of acceptor substrates. Members of this superfamily include aminoglycoside N-acetyltransferases, serotonin N-acetyltransferase, glucosamine-6-phosphate N-acetyltransferase, the histone acetyltransferases, mycothiol synthase, and the Fem family of amino acyl transferases. Probab=99.46 E-value=5.8e-13 Score=95.76 Aligned_cols=76 Identities=25% Similarity=0.455 Sum_probs=65.7 Q ss_pred EEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC------HHHHHHC Q ss_conf 9999889899999987761188223899767885111389958999999999996138847998408------6578742 Q gi|254781162|r 76 FLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD------IAYYSKL 149 (192) Q Consensus 76 lVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~------~~yY~rf 149 (192) |||+++|+|||++.+.+.... .+...+..++|+|++||+|+|++||+++++.|++.|+..+.+... ..||+|+ T Consensus 2 ~Vae~~g~ivG~~~~~~~~~~-~~~~~i~~i~V~~~~rg~GiG~~L~~~~~~~a~~~~~~~i~l~v~~~N~~a~~~y~k~ 80 (83) T cd04301 2 LVAEDDGELVGFASFFIYRPA-PREAEIEGLAVDPSYRGQGLGSALLDALIDWALERGARRLYLDTSPDNEAAIAFYEKL 80 (83) T ss_pred EEEEECCEEEEEEEEEEECCC-CCEEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHC T ss_conf 999999989999999997389-9979999999999992999389999999999998799899999928989999999976 Q ss_pred CCE Q ss_conf 948 Q gi|254781162|r 150 GFQ 152 (192) Q Consensus 150 GF~ 152 (192) ||+ T Consensus 81 Gft 83 (83) T cd04301 81 GFT 83 (83) T ss_pred CCC T ss_conf 999 No 12 >PRK01346 hypothetical protein; Provisional Probab=99.46 E-value=8.8e-13 Score=94.70 Aligned_cols=123 Identities=14% Similarity=0.229 Sum_probs=92.3 Q ss_pred CEEEEEECCHHHHHHHHHH---HCCCCCCCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEE--E-CCC--CEEEE Q ss_conf 2034420798999999998---53898721001311038888760789999889899999987761--1-882--23899 Q gi|254781162|r 32 AFFLERLEHDSSICAMHAD---SFGPGRFVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPIS--I-EKI--TGHLL 103 (192) Q Consensus 32 ~~~~~R~e~~~di~ai~~~---~fgpgr~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~--~-~~~--~~~~L 103 (192) ..|.+|+-++.|.++..++ +|+.-...+.....|.. ....-.+.+.+++++||.+..+|.. + +|. +.-++ T Consensus 5 ~~~~iR~l~~~d~~~~~~l~~~aF~~~~~~e~~~~~~~~--~~~~~~~ga~d~~~lVg~~~~~~~~l~vpgG~~vp~agI 82 (411) T PRK01346 5 DTITIRVLTEDDWPAWFRAAARGFGDSPPDEELEAWRTL--VEPDRTLGAFDGDDVVGTALAFDLELTVPGGAVLPAAAV 82 (411) T ss_pred CCCEEECCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHC--CCCCCEEEEEECCEEEEEEEEECEEEEEECCCEEECCCE T ss_conf 651654099899999999999873999986889988732--686746999999999999996144877308838871617 Q ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEE-CHHHHHHCCCEECCC Q ss_conf 76788511138995899999999999613884799840-865787429487463 Q gi|254781162|r 104 GPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVG-DIAYYSKLGFQAVPW 156 (192) Q Consensus 104 gplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g-~~~yY~rfGF~~~~~ 156 (192) +.|+|.|+|||+|+.++||++.++.++++|....+|.. ...||+||||..+.. T Consensus 83 t~Vav~PehRrrG~~~~Lm~~~L~~~r~~G~~ls~L~pse~~iY~RfGyg~at~ 136 (411) T PRK01346 83 TAVTVAPTHRRRGLLTALMREQLRRIAERGEPVAALTASEGGIYGRFGYGVATY 136 (411) T ss_pred EEEEECHHHCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCEEEEE T ss_conf 899988310678589999999999999779839999506766014654213067 No 13 >COG1246 ArgA N-acetylglutamate synthase and related acetyltransferases [Amino acid transport and metabolism] Probab=99.43 E-value=2.6e-13 Score=97.77 Aligned_cols=120 Identities=20% Similarity=0.271 Sum_probs=88.9 Q ss_pred EEEEECCHHHHHHHHHHH--C-CCCC-CCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEEC Q ss_conf 344207989999999985--3-8987-21001311038888760789999889899999987761188223899767885 Q gi|254781162|r 34 FLERLEHDSSICAMHADS--F-GPGR-FVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVH 109 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~--f-gpgr-~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~ 109 (192) +.+|--+.+|+.+|.++. + .+|- -.++...+-+ ....+ .|++.+|.|||++.+.|. ...+.-.+..|+|+ T Consensus 1 ~~iR~A~~~Di~~I~~Li~~~~~~gil~~rs~~~le~---~i~dF-~i~E~~g~viGCaaL~~~--~~~~~gE~~~laV~ 74 (153) T COG1246 1 EQIRKARISDIPAILELIRPLELQGILLRRSREQLEE---EIDDF-TIIERDGKVIGCAALHPV--LEEDLGELRSLAVH 74 (153) T ss_pred CCEEECCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHH---HHHHH-EEEEECCCEEEEEECCCC--CCCCEEEEEEEEEC T ss_conf 9100032024288999999974146420315989999---87655-243227938999840356--75672567777877 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEE--CHHHHHHCCCEECCCCCE Q ss_conf 11138995899999999999613884799840--865787429487463653 Q gi|254781162|r 110 PLYQNKGIGRKLISMSVDAAEKKGSQVIVLVG--DIAYYSKLGFQAVPWKSL 159 (192) Q Consensus 110 P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g--~~~yY~rfGF~~~~~~~l 159 (192) |+||++|+|..|+++++..|++.|...++++. .+.|++++||+.+....+ T Consensus 75 pd~r~~G~G~~Ll~~~~~~Ar~~gi~~~f~LTt~~~~~F~~~GF~~vd~~~L 126 (153) T COG1246 75 PDYRGSGRGERLLERLLADARELGIKELFVLTTRSPEFFAERGFTRVDKDEL 126 (153) T ss_pred HHHCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHCCCEECCCCCC T ss_conf 8763887089999999999887497013433146688999809866763538 No 14 >PRK10975 TDP-fucosamine acetyltransferase; Provisional Probab=99.41 E-value=3.6e-12 Score=91.13 Aligned_cols=120 Identities=18% Similarity=0.277 Sum_probs=82.9 Q ss_pred EEEEECCHHHHHHHHHHH---CCCCCCCCEE---------EEEE----CCCCCCCCEEEEEEE-CCEEEEEEEEEEEEEC Q ss_conf 344207989999999985---3898721001---------3110----388887607899998-8989999998776118 Q gi|254781162|r 34 FLERLEHDSSICAMHADS---FGPGRFVRAA---------VLLR----EQGMHDLSLSFLCAE-GKRIVGSVRMTPISIE 96 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~---fgpgr~~~~~---------~~lR----~~~~~~~~lslVA~~-~~~IVG~i~~~pv~~~ 96 (192) +..+..++.||.+|..++ |.-.||--+- |... -... -...|+|+.. +|+|+|.|.+-....+ T Consensus 83 ~~~~~A~~~Dip~L~~lA~~aF~~SRFr~P~~~~~~~~rfY~~Wie~aV~G~-fD~~cLv~~~~~g~i~GFvtlr~l~~~ 161 (231) T PRK10975 83 IGAEVATETDIPALRQLAAQAFAQSRFRAPWYAPDASGRFYAQWIENAVRGT-FDHQCLILRDATGQIRGFVSLRELNDT 161 (231) T ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC-CCCEEEEEECCCCCEEEEEEEEECCCC T ss_conf 7631177757999999998651021257987797898789999999984666-585689998689977899999852798 Q ss_pred CCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEE---EC---HHHHHHCCCEECCCC Q ss_conf 82238997678851113899589999999999961388479984---08---657874294874636 Q gi|254781162|r 97 KITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLV---GD---IAYYSKLGFQAVPWK 157 (192) Q Consensus 97 ~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~---g~---~~yY~rfGF~~~~~~ 157 (192) . + -+|-++|+|++||+|||++||++++++|+++|...+.+. ++ ..+|.|+||+...+. T Consensus 162 ~--a-~IgLlAV~P~~rG~GiG~~Lm~aa~~wa~~~g~~~l~l~tq~~N~~AirLY~~~GF~ie~t~ 225 (231) T PRK10975 162 D--A-RIGLLAVFPGAQGRGIGAELMQAALNWAQARGLTTLRVATQMGNTAALKLYIQSGANIESTA 225 (231) T ss_pred C--C-EEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHCCCEEEEEE T ss_conf 6--3-78898898301279889999999999999859978999985789999999998698275679 No 15 >KOG3139 consensus Probab=99.40 E-value=5.6e-12 Score=90.01 Aligned_cols=111 Identities=20% Similarity=0.293 Sum_probs=85.4 Q ss_pred HHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCEEEEEEECCE-EEEEEEEE-EEEECCCCEEEEEEEEECCCCCCCCHHH Q ss_conf 999999998538987210013110388887607899998898-99999987-7611882238997678851113899589 Q gi|254781162|r 42 SSICAMHADSFGPGRFVRAAVLLREQGMHDLSLSFLCAEGKR-IVGSVRMT-PISIEKITGHLLGPIVVHPLYQNKGIGR 119 (192) Q Consensus 42 ~di~ai~~~~fgpgr~~~~~~~lR~~~~~~~~lslVA~~~~~-IVG~i~~~-pv~~~~~~~~~LgplaV~P~~rg~GiG~ 119 (192) .+|-.+.+..+ ....+.+.-|-...+-+.++|+|.+++. -||.+... +...+ .-..-+..+||+++|||+|||+ T Consensus 28 ~~im~Li~k~l---sepyS~~tyrYf~~~wp~~~~~a~d~~~~~VGai~ck~~~~r~-~~rgyi~mLaV~~e~Rg~GIg~ 103 (165) T KOG3139 28 ADIMRLIDKDL---SEPYSIYTYRYFVPNWPCFCFLALDEKGDTVGAIVCKLDTHRN-TLRGYIAMLAVDSEYRGQGIGK 103 (165) T ss_pred HHHHHHHHHHC---CCHHHHHHHHHCCCCCCEEEEEEECCCCCEEEEEEEECCCCCC-CCEEEEEEEEECHHHCCCCHHH T ss_conf 88998874014---8566777877502578668999990788668899983445577-6448999999612533456899 Q ss_pred HHHHHHHHHHHHCCCCEEEEEEC------HHHHHHCCCEECCC Q ss_conf 99999999996138847998408------65787429487463 Q gi|254781162|r 120 KLISMSVDAAEKKGSQVIVLVGD------IAYYSKLGFQAVPW 156 (192) Q Consensus 120 ~Li~~~l~~a~~~g~~~v~l~g~------~~yY~rfGF~~~~~ 156 (192) +|++.+|+.++.+|+..|+|-.+ ..+|+++||+.... T Consensus 104 aLvr~aId~m~~~g~~eVvLeTe~~n~~A~~LY~sLGF~r~~r 146 (165) T KOG3139 104 ALVRKAIDAMRSRGYSEVVLETEVTNLSALRLYESLGFKRDKR 146 (165) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCEEECC T ss_conf 9999999999977996899832433418999988549467331 No 16 >PRK12308 bifunctional argininosuccinate lyase/N-acetylglutamate synthase; Provisional Probab=99.39 E-value=1.3e-13 Score=99.54 Aligned_cols=121 Identities=16% Similarity=0.165 Sum_probs=90.7 Q ss_pred CEEEEEECCHHHHHHHHHHH--CCC-CC-CCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEE Q ss_conf 20344207989999999985--389-87-210013110388887607899998898999999877611882238997678 Q gi|254781162|r 32 AFFLERLEHDSSICAMHADS--FGP-GR-FVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIV 107 (192) Q Consensus 32 ~~~~~R~e~~~di~ai~~~~--fgp-gr-~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~Lgpla 107 (192) .-|.||..+-+||++|+.++ |+. |- -+|+.--+-++ ..++ +|||.+|+|+|+..++-...+ -..+-.++ T Consensus 462 ~~i~iR~Arl~Dv~~i~~lv~~~A~~G~~LpR~~~~l~~~---i~~f-~vaE~~g~v~g~~sl~i~~~~---LAEIrsl~ 534 (614) T PRK12308 462 SGVKVRPARLTDIDAIEGMVAYWAGLGENLPRTRNELVRD---IGSF-AVAEHHGEVTGCASLYIYDSG---LAEIRSLG 534 (614) T ss_pred CCCEEECCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHH---HHHH-EEEEECCEEEEEEEEEEEECC---HHHHHHHH T ss_conf 8736632567738999999999864124588877889998---7664-266557818888789886167---19999861 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC-HHHHHHCCCEECCCCCE Q ss_conf 85111389958999999999996138847998408-65787429487463653 Q gi|254781162|r 108 VHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD-IAYYSKLGFQAVPWKSL 159 (192) Q Consensus 108 V~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~-~~yY~rfGF~~~~~~~l 159 (192) |.|+||++|+|+.|+++.++.|++.+...|+++.. |+||.+.||..+++..+ T Consensus 535 v~~~~~~~G~G~~lV~~~l~~a~~~~~~rvfvLT~~p~fF~k~gf~~~~k~~l 587 (614) T PRK12308 535 VEAGWQVQGQGKALVQYLVEKARQMAIKKVFVLTRVPEFFMKQGFSPTSKSLL 587 (614) T ss_pred CCHHHHHCCCHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHCCCEECCHHHC T ss_conf 67877742832899999999999837875899843718899759820784457 No 17 >PRK09831 hypothetical protein; Provisional Probab=99.34 E-value=4.1e-12 Score=90.82 Aligned_cols=128 Identities=13% Similarity=0.157 Sum_probs=84.6 Q ss_pred EEEEECCHHHHHHHHHHHC------CCCCCCCEEEEEEC-------CCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCE Q ss_conf 3442079899999999853------89872100131103-------8888760789999889899999987761188223 Q gi|254781162|r 34 FLERLEHDSSICAMHADSF------GPGRFVRAAVLLRE-------QGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITG 100 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~f------gpgr~~~~~~~lR~-------~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~ 100 (192) |.||+=+++|..++.++-+ ++..++.....-.- ...-.....+||+.++++||++.+. + . T Consensus 1 m~IR~~~~~D~~~l~~lf~~aV~~~~~~~Ys~~Q~~aWa~~~~~~w~~~l~~~~~~VA~~~~~ivGF~~~~----~---~ 73 (147) T PRK09831 1 IQIRNYQPGDFQQLCAIFIRAVTMTASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFITCI----E---H 73 (147) T ss_pred CEEEECCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCHHHHHHHHHHCCEEEEEECCEEEEEEECC----C---C T ss_conf 97442777779999999999998634012999999988679989999987509569999999988988548----8---8 Q ss_pred EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE---EECHHHHHHCCCEECCCCCEECCCCCCHHHEEEEE Q ss_conf 899767885111389958999999999996138847998---40865787429487463653668999956669843 Q gi|254781162|r 101 HLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL---VGDIAYYSKLGFQAVPWKSLILPAPVDPNRVLFLP 174 (192) Q Consensus 101 ~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l---~g~~~yY~rfGF~~~~~~~l~~p~pv~~~~~l~l~ 174 (192) .|..++|+|+|||+|||++|+++.+..+ ..+.+ ..+-+||+|.||+.+....+..-|-.=+++.|-+. T Consensus 74 -~id~lfV~p~~~g~Gvg~~Ll~~l~~~~-----~~l~~~aS~~A~pFFe~~Gf~vv~~q~v~~~G~~l~n~~M~K~ 144 (147) T PRK09831 74 -YIDMLFVDPEYTRRGVASALLKPLIKSE-----SELTVDASITAKPFFERYGFQTVKQQRVECRGAWFTNFYMRYK 144 (147) T ss_pred -EEEEEEECHHHCCCCHHHHHHHHHHHHC-----CCEEEEHHHHHHHHHHHCCCEEEEEEEEEECCEEECCCEEEEC T ss_conf -0899998866538989999999998417-----2058631576388899769799877568569989707042747 No 18 >PRK10314 hypothetical protein; Provisional Probab=99.33 E-value=2e-11 Score=86.79 Aligned_cols=130 Identities=11% Similarity=0.161 Sum_probs=87.4 Q ss_pred HHHHHHHHHHHCCCCCCCCEEEEEECCCCC-CCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHH Q ss_conf 899999999853898721001311038888-7607899998898999999877611882238997678851113899589 Q gi|254781162|r 41 DSSICAMHADSFGPGRFVRAAVLLREQGMH-DLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGR 119 (192) Q Consensus 41 ~~di~ai~~~~fgpgr~~~~~~~lR~~~~~-~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~ 119 (192) --+|-.|...+|--.-. ..+..-++.+. ..++-+++.+++++||+.++.+.... .....+|.|+|++++||+|+|+ T Consensus 17 ly~il~lR~~VFV~EQ~--~p~~d~D~~D~~~~~~H~~~~~~~~~va~~Ri~~~~~~-~~~~~IGRVaV~k~~Rg~glG~ 93 (153) T PRK10314 17 LYALLQLRCAVFVVEQN--CPYQDIDGDDLTGDNRHILGWKNDELVAYARILKSDDD-LEPVVIGRVIVSEALRGEKVGQ 93 (153) T ss_pred HHHHHHHHHHHEEECCC--CCCHHHHCCCCCCCCEEEEEEECCEEEEEEEEECCCCC-CCCEEEEEEEECHHHCCCCHHH T ss_conf 99999988873780179--89432215666676189999959938899987258988-7734787899758784998999 Q ss_pred HHHHHHHHHHHHCC-CCEEEEEEC---HHHHHHCCCEECCCCCEECCCCCCHHHEEEEECC Q ss_conf 99999999996138-847998408---6578742948746365366899995666984357 Q gi|254781162|r 120 KLISMSVDAAEKKG-SQVIVLVGD---IAYYSKLGFQAVPWKSLILPAPVDPNRVLFLPLV 176 (192) Q Consensus 120 ~Li~~~l~~a~~~g-~~~v~l~g~---~~yY~rfGF~~~~~~~l~~p~pv~~~~~l~l~L~ 176 (192) +||+.+++.+++.. ...+.+... .+||++|||+++.. ..+...+| .--|.++|. T Consensus 94 ~Lm~~~l~~~~~~~~~~~i~l~AQ~~~~~FY~~~GF~~~g~--~f~E~GIp-Hi~M~k~l~ 151 (153) T PRK10314 94 QLMSKTLESCTRHWPDKPLYLGAQAHLQNFYASFGFIPVTD--VYEEDGIP-HIGMAREVI 151 (153) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHCCCEECCC--CEEECCCC-HHHHHHHHH T ss_conf 99999999999977999789962899999999879988898--45538985-798877753 No 19 >pfam00583 Acetyltransf_1 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions such as Elp3-related proteins. Probab=99.32 E-value=1.2e-11 Score=88.09 Aligned_cols=74 Identities=32% Similarity=0.421 Sum_probs=63.1 Q ss_pred EEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC------HHHHHHCCC Q ss_conf 99889899999987761188223899767885111389958999999999996138847998408------657874294 Q gi|254781162|r 78 CAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD------IAYYSKLGF 151 (192) Q Consensus 78 A~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~------~~yY~rfGF 151 (192) ++++|+|||++.+++.... .+...+..++|+|+|||+|+|++||+++++.+++.|+..+.+... ..||+|+|| T Consensus 1 ~~~~g~ivG~~~~~~~~~~-~~~~~i~~l~V~p~~rg~GiG~~Ll~~~~~~~~~~g~~~i~l~~~~~n~~a~~~y~k~GF 79 (80) T pfam00583 1 AEEDGELVGFASLSIIDEE-GNVAEIEGLAVDPEYRGKGIGTALLEALEEYARELGLKRIELEVLEDNEAAIALYEKLGF 79 (80) T ss_pred CCCCCEEEEEEEEEEECCC-CCEEEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCC T ss_conf 9678989999999996489-999999999999899599879999999999999879999999992787999999998697 Q ss_pred E Q ss_conf 8 Q gi|254781162|r 152 Q 152 (192) Q Consensus 152 ~ 152 (192) + T Consensus 80 ~ 80 (80) T pfam00583 80 K 80 (80) T ss_pred C T ss_conf 9 No 20 >PRK10562 hypothetical protein; Provisional Probab=99.31 E-value=8e-12 Score=89.10 Aligned_cols=126 Identities=17% Similarity=0.293 Sum_probs=76.5 Q ss_pred EEEECCHHHHHHHHHHHCC---------CCC-CCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEE Q ss_conf 4420798999999998538---------987-210013110388887607899998898999999877611882238997 Q gi|254781162|r 35 LERLEHDSSICAMHADSFG---------PGR-FVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLG 104 (192) Q Consensus 35 ~~R~e~~~di~ai~~~~fg---------pgr-~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~Lg 104 (192) +||+-+++|+.+|-++=.. +.. .......+|+.--+.. -.+|++++|+|+|.+.+.. ++ .++ T Consensus 1 mIR~~~~~D~~~l~~IW~~s~~~ah~Fl~~~~~~~~~~~vr~~~lp~~-~~~V~e~~g~ivGFi~~~~---~~----~i~ 72 (145) T PRK10562 1 MIREYQPSDLPAILQLWLESTIWAHPFIKEQYWRESAPLVRDVYLPAA-QTWVWEEDGKLLGFVSILE---GR----FVG 72 (145) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCC-EEEEEEECCEEEEEEEECC---CC----EEE T ss_conf 976387667999999999999866745598999986999998646656-3999996996899999569---99----761 Q ss_pred EEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC---HHHHHHCCCEECCCCCEECCCCCCHHHEEE Q ss_conf 67885111389958999999999996138847998408---657874294874636536689999566698 Q gi|254781162|r 105 PIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD---IAYYSKLGFQAVPWKSLILPAPVDPNRVLF 172 (192) Q Consensus 105 plaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~---~~yY~rfGF~~~~~~~l~~p~pv~~~~~l~ 172 (192) .|+|+|++||+|||++||+++.+... ...+-+...+ ..||+|.||+.+....- +..-.|+..|. T Consensus 73 ~LfV~p~~~g~GiG~~Ll~~a~~~~~--~l~L~v~~~N~~A~~FY~~~GF~~~~~~~~--~~tg~p~~~m~ 139 (145) T PRK10562 73 ALFVAPKAHRRGIGKALMQYVQQRYP--HLSLEVYQKNQRAVNFYHAQGFRIVDCAWQ--EETQHPTWIMS 139 (145) T ss_pred EEEECHHHCCCCHHHHHHHHHHHHCC--CEEEEECCCCHHHHHHHHHCCCEEEEEECC--CCCCCCEEEEE T ss_conf 16887643378899999999996389--349998257978999999889999623368--88899159999 No 21 >TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species. Probab=99.23 E-value=3.6e-10 Score=79.39 Aligned_cols=123 Identities=21% Similarity=0.298 Sum_probs=84.9 Q ss_pred EEEEEE----CCHHHHHHHHHHHCC--C--CCCCCEEEEEECCC-CCCCCEEEEEEE--CCEEEEEEEEEEEEECCCCEE Q ss_conf 034420----798999999998538--9--87210013110388-887607899998--898999999877611882238 Q gi|254781162|r 33 FFLERL----EHDSSICAMHADSFG--P--GRFVRAAVLLREQG-MHDLSLSFLCAE--GKRIVGSVRMTPISIEKITGH 101 (192) Q Consensus 33 ~~~~R~----e~~~di~ai~~~~fg--p--gr~~~~~~~lR~~~-~~~~~lslVA~~--~~~IVG~i~~~pv~~~~~~~~ 101 (192) -+.+|+ .++.+.-+++|.+|. | |+-+...+..|... -+++...|++++ +++|+|++ .+++.-.....- T Consensus 149 Gv~lr~f~~~~De~~wl~vnn~AFa~HPEqg~wt~~dl~~r~~e~WFdP~g~lla~~~~~g~lvGF~-wtk~h~~~~~~G 227 (292) T TIGR03448 149 GVTVRAYVGAPDDAEWLRVNNAAFAWHPEQGGWTRADLAERRAEPWFDPAGLFLAFDDAPGELLGFH-WTKVHPDEPALG 227 (292) T ss_pred CEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCCCCEEEEECCCCEEEEEE-EEEECCCCCCCC T ss_conf 8688436787767999999999974598678875678998750777484773899847998399999-998637888852 Q ss_pred EEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC------HHHHHHCCCEECCC Q ss_conf 99767885111389958999999999996138847998408------65787429487463 Q gi|254781162|r 102 LLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD------IAYYSKLGFQAVPW 156 (192) Q Consensus 102 ~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~------~~yY~rfGF~~~~~ 156 (192) .+..+.|+|++||+|+|+.|+..++..++.+|...+.|.-+ ...|+++||..... T Consensus 228 EvYvvGV~p~~rGrGLG~aL~~~gL~~l~~~G~~~v~LyVd~dN~~A~~lY~~LGF~~~~~ 288 (292) T TIGR03448 228 EVYVVGVDPAAQGRGLGDALTLIGLHHLAARGLPAVMLYVEADNEAAVRTYEKLGFTVAEV 288 (292) T ss_pred EEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCEEEEE T ss_conf 5999998834247767899999999999987998899983788889999998569958400 No 22 >COG0456 RimI Acetyltransferases [General function prediction only] Probab=99.21 E-value=9.1e-11 Score=82.90 Aligned_cols=143 Identities=20% Similarity=0.317 Sum_probs=89.6 Q ss_pred HHHCCEEEEEECCHHHHH-----HHHHHHCCCCCCCCEEEEEECCCCCCCCEEEEEEEC---C----EEEEEEEEEEEEE Q ss_conf 320120344207989999-----999985389872100131103888876078999988---9----8999999877611 Q gi|254781162|r 28 QKIGAFFLERLEHDSSIC-----AMHADSFGPGRFVRAAVLLREQGMHDLSLSFLCAEG---K----RIVGSVRMTPISI 95 (192) Q Consensus 28 ~~~~~~~~~R~e~~~di~-----ai~~~~fgpgr~~~~~~~lR~~~~~~~~lslVA~~~---~----~IVG~i~~~pv~~ 95 (192) ......+.+|.....|+. .++..+|..- ...+.............+.+|+..+ + +++|++..+.... T Consensus 6 ~~~~~~~~ir~~~~~d~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~~v~~~~~~~~~~~~~~~G~~~~~~~~~ 84 (177) T COG0456 6 ELSEDKVTIREAINKDLLDVALAALEARTFDIR-LPWSREYFEKDLTQAPELLLVAETGGLDGLLDGKVVGFLLVRVVDG 84 (177) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCC-CCCCHHHHHHHHCCCCCEEEEEECCCCCCCCCCCEEEEEEEEEECC T ss_conf 667555653314076656789988877515767-6430999999860487369999626666654310458999974047 Q ss_pred CC---CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCC-CEEEEEE------CHHHHHHCCCEECCCCCEECCCCC Q ss_conf 88---2238997678851113899589999999999961388-4799840------865787429487463653668999 Q gi|254781162|r 96 EK---ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGS-QVIVLVG------DIAYYSKLGFQAVPWKSLILPAPV 165 (192) Q Consensus 96 ~~---~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~-~~v~l~g------~~~yY~rfGF~~~~~~~l~~p~pv 165 (192) .. ..++ +..++|+|+|||+|||++|++++++.+++.+. ..+.|.- +..+|+|+||+......-..+.+. T Consensus 85 ~~~~~~~~~-i~~iaV~p~~r~~Gig~~Ll~~~~~~~~~~~~~~~~~L~V~~~N~~Ai~lY~~~GF~~~~~~~~yy~~~~ 163 (177) T COG0456 85 RPSADHEGH-IYNLAVDPEYRGRGIGRALLDEALERLRERGLADKIVLEVRESNEAAIGLYRKLGFEVVKIRKNYYADGN 163 (177) T ss_pred CCCCCCCEE-EEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHCCCEEEHHHHHHHHCCC T ss_conf 656676089-9999999787699899999999999998527673699999478699999999869943121375402455 Q ss_pred CHHHEEE Q ss_conf 9566698 Q gi|254781162|r 166 DPNRVLF 172 (192) Q Consensus 166 ~~~~~l~ 172 (192) +..+.|. T Consensus 164 ~~a~~~~ 170 (177) T COG0456 164 GDALLML 170 (177) T ss_pred CCHHHHH T ss_conf 6499998 No 23 >TIGR03103 trio_acet_GNAT GNAT-family acetyltransferase TIGR03103. Members of this protein family belong to the GNAT family of acetyltransferases. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and an asparagine synthetase family protein. Probab=99.21 E-value=2e-10 Score=80.97 Aligned_cols=146 Identities=20% Similarity=0.269 Sum_probs=104.9 Q ss_pred HHHHCCEEEEEECCHHHHHHHHHHHCCCC-CCCCEEEEEECCCCCCCCEEEEEEE--CCEEEEEEEEE---EEEECCCCE Q ss_conf 33201203442079899999999853898-7210013110388887607899998--89899999987---761188223 Q gi|254781162|r 27 WQKIGAFFLERLEHDSSICAMHADSFGPG-RFVRAAVLLREQGMHDLSLSFLCAE--GKRIVGSVRMT---PISIEKITG 100 (192) Q Consensus 27 ~~~~~~~~~~R~e~~~di~ai~~~~fgpg-r~~~~~~~lR~~~~~~~~lslVA~~--~~~IVG~i~~~---pv~~~~~~~ 100 (192) .+....|..-|....+|.+++..+--.-| -.......+..+......+ |||++ .|.|||+++-- ....+.+.+ T Consensus 77 ~~~~~g~~ir~~~~~~d~~~in~iy~~~~Mvp~~~~~~~~~~~~~~~~y-~vA~d~~tg~ivg~v~gvdH~~af~dpe~g 155 (547) T TIGR03103 77 ARTPRGFTVRRLRGPADVDAINRLYAARGMVPVRVDFVLDHRHSRAITY-LVAEDEASGAIIGTVMGVDHRKAFNDPEHG 155 (547) T ss_pred CCCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCCCCEE-EEEEECCCCCEEEEEECCCHHHHCCCCCCC T ss_conf 7788974899678766799999999976984799777315566787259-998607999579998654268652897678 Q ss_pred EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE---EEC---HHHHHHCCCEECCCCCEECCCCCCHHHEEEE Q ss_conf 899767885111389958999999999996138847998---408---6578742948746365366899995666984 Q gi|254781162|r 101 HLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL---VGD---IAYYSKLGFQAVPWKSLILPAPVDPNRVLFL 173 (192) Q Consensus 101 ~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l---~g~---~~yY~rfGF~~~~~~~l~~p~pv~~~~~l~l 173 (192) -.|..|+|+|....-|+|.+|+++..+.-+.+|...+-| ..+ ..+|+|+||+.+|...+.--.+++..-|..- T Consensus 156 sSlW~LaVdpqa~~pgvGeaLvr~lae~~~~rg~~~~DLSV~hdN~~Ai~lY~kLgF~~~p~f~vKrkn~iNe~lf~gp 234 (547) T TIGR03103 156 SSLWCLAVDPQAAHPGVGEALVRALAEHFQSRGCAYMDLSVMHDNEQAIALYEKLGFRRIPVFALKRKNAINERLFSGP 234 (547) T ss_pred CEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCEECCEEEEECCCCCCCCCCCCC T ss_conf 3136886668889998129999999999987693105110104868899999971975564578860575674324699 No 24 >PRK10514 hypothetical protein; Provisional Probab=99.20 E-value=1e-10 Score=82.65 Aligned_cols=115 Identities=18% Similarity=0.240 Sum_probs=70.9 Q ss_pred EEEEECCHHHHHHHHHHH---------CCCCCC-CCEEEEEECCCCCCCCEEEEE-EECCEEEEEEEEEEEEECCCCEEE Q ss_conf 344207989999999985---------389872-100131103888876078999-988989999998776118822389 Q gi|254781162|r 34 FLERLEHDSSICAMHADS---------FGPGRF-VRAAVLLREQGMHDLSLSFLC-AEGKRIVGSVRMTPISIEKITGHL 102 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~---------fgpgr~-~~~~~~lR~~~~~~~~lslVA-~~~~~IVG~i~~~pv~~~~~~~~~ 102 (192) |.||.-+.+|.++|.++= |=+..+ .+.....++ .-+...+ +|| +++|+++|.+.+.. . . T Consensus 2 ~~IR~~~~~D~~~l~~IW~~s~~~aH~Fl~~~~~~~~~~~v~~-ylp~~~~-~v~~~~~~~~~GFi~l~~-------~-~ 71 (146) T PRK10514 2 ISIRRSRHEEGERLVAIWRRSVDATHDFLSAEDRAEIEELVRS-FLPEAPL-WVAVDERDQPVGFMLLSG-------G-H 71 (146) T ss_pred CEEECCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-HHHHCCE-EEEEECCCCEEEEEEECC-------C-E T ss_conf 2524498566999999999999988614999999999999998-7524859-999966995699999788-------9-8 Q ss_pred EEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEE---ECHHHHHHCCCEECCCCCEE Q ss_conf 97678851113899589999999999961388479984---08657874294874636536 Q gi|254781162|r 103 LGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLV---GDIAYYSKLGFQAVPWKSLI 160 (192) Q Consensus 103 LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~---g~~~yY~rfGF~~~~~~~l~ 160 (192) |+.|.|+|++||+|||++||+++++.... ...-+.. .+..||+|.||+.+...... T Consensus 72 i~~LFV~p~~~g~GiG~~Ll~~a~~~~~~--l~L~V~~~N~~A~~FY~~~GF~~~gr~~~d 130 (146) T PRK10514 72 MEALFVDPDVRGCGVGRMLVEHALTLHPE--LTTDVNEQNEQAVGFYKKMGFKVTGRSEVD 130 (146) T ss_pred EEEEEECHHHCCCCHHHHHHHHHHHHCCC--CEEEEECCCHHHHHHHHHCCCEEECCCCCC T ss_conf 87899878773999799999999873854--098500158689999997799897135679 No 25 >COG2153 ElaA Predicted acyltransferase [General function prediction only] Probab=99.19 E-value=1.4e-10 Score=81.87 Aligned_cols=96 Identities=20% Similarity=0.342 Sum_probs=75.1 Q ss_pred EEEEE-CCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECH----HHHHHCC Q ss_conf 99998-898999999877611882238997678851113899589999999999961388479984086----5787429 Q gi|254781162|r 76 FLCAE-GKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDI----AYYSKLG 150 (192) Q Consensus 76 lVA~~-~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~----~yY~rfG 150 (192) +..++ +|+++++.++-|....... +.+|.|+|.|+.||+|+|++||+++|+.+....-..-+.+++. +||.+|| T Consensus 52 l~~~~~~g~LvAyaRLlp~~~~~~~-~~iGRV~v~~~~RG~glG~~Lm~~AL~~~~~~~p~~~v~l~AQahLq~fYa~~G 130 (155) T COG2153 52 LLGWTPDGELVAYARLLPPGAEYEE-VSIGRVIVSPAARGQGLGQQLMEKALETAGREWPDKPVYLGAQAHLQDFYASFG 130 (155) T ss_pred EEEECCCCEEEEEEECCCCCCCCCC-EEEEEEEECHHHHCCCHHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHHHHHHHC T ss_conf 9998699969999860777777575-255469977756224545999999999988628999869841787999999709 Q ss_pred CEECCCCCEECCCCCCHHHEEEEEC Q ss_conf 4874636536689999566698435 Q gi|254781162|r 151 FQAVPWKSLILPAPVDPNRVLFLPL 175 (192) Q Consensus 151 F~~~~~~~l~~p~pv~~~~~l~l~L 175 (192) |.++.. ..+..++| .-=|.++. T Consensus 131 Fv~~~e--~yledGIp-Hv~M~r~~ 152 (155) T COG2153 131 FVRVGE--EYLEDGIP-HVGMIREV 152 (155) T ss_pred CEECCC--HHHCCCCC-CHHHHHCC T ss_conf 677673--22137997-42344045 No 26 >COG1247 Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane] Probab=99.03 E-value=1.2e-08 Score=70.42 Aligned_cols=126 Identities=20% Similarity=0.234 Sum_probs=84.2 Q ss_pred EEEEEECCHHHHHHHHHH---HCC--CCCCCCE--EEEEE----CCCCCCCCEEEEEEEC-CEEEEEEEEEEEEEC-CCC Q ss_conf 034420798999999998---538--9872100--13110----3888876078999988-989999998776118-822 Q gi|254781162|r 33 FFLERLEHDSSICAMHAD---SFG--PGRFVRA--AVLLR----EQGMHDLSLSFLCAEG-KRIVGSVRMTPISIE-KIT 99 (192) Q Consensus 33 ~~~~R~e~~~di~ai~~~---~fg--pgr~~~~--~~~lR----~~~~~~~~lslVA~~~-~~IVG~i~~~pv~~~-~~~ 99 (192) .|.||+-+.+|+++|.+. ... ...+.+. ....| .....+--.-+|++++ |+++|.+.+.+..-. ... T Consensus 1 ~~~ir~~~~~Dl~~I~~IY~~~v~~~~a~~e~~~~~~~~~~~~~~~~~~~g~p~~V~~~~~g~v~G~a~~~~fr~r~ay~ 80 (169) T COG1247 1 EMEIRPATAADLEAILEIYNGAVENTAATFEEDPVSLEERAAWFSGRTRDGYPVVVAEEEDGKVLGYASAGPFRERPAYR 80 (169) T ss_pred CCEEECCHHHHHHHHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHCCCCCCEEEEEECCCCEEEEEEEEEECCCCCCCC T ss_conf 91783170776799999987765304479851699989999999833668854999974798099999851004755655 Q ss_pred EEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE-EE-----CHHHHHHCCCEECCCCC Q ss_conf 3899767885111389958999999999996138847998-40-----86578742948746365 Q gi|254781162|r 100 GHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL-VG-----DIAYYSKLGFQAVPWKS 158 (192) Q Consensus 100 ~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l-~g-----~~~yY~rfGF~~~~~~~ 158 (192) ...--+++|+|++||+|+|++|++++++.++++|+..++- +. ...+.+++||+.+.... T Consensus 81 ~tve~SiYv~~~~~g~GiG~~Ll~~Li~~~~~~g~~~lva~I~~~n~aSi~lh~~~GF~~~G~~~ 145 (169) T COG1247 81 HTVELSIYLDPAARGKGLGKKLLQALITEARALGVRELVAGIESDNLASIALHEKLGFEEVGTFP 145 (169) T ss_pred EEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEEECCCCCHHHHHHHHCCCEEECCCC T ss_conf 38999999985102466789999999999885884899999737873769999977977736345 No 27 >KOG3397 consensus Probab=98.97 E-value=2.6e-09 Score=74.43 Aligned_cols=117 Identities=16% Similarity=0.245 Sum_probs=81.2 Q ss_pred EEECCHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCEEEE--EEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCC Q ss_conf 420798999999998538987210013110388887607899--998898999999877611882238997678851113 Q gi|254781162|r 36 ERLEHDSSICAMHADSFGPGRFVRAAVLLREQGMHDLSLSFL--CAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQ 113 (192) Q Consensus 36 ~R~e~~~di~ai~~~~fgpgr~~~~~~~lR~~~~~~~~lslV--A~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~r 113 (192) -|+|--.|-.++-|...--.+.+ ..+--+..-....+|++ -|...++|||.+++++. +...+++...|.|+..+| T Consensus 20 ~rPELlk~~~~LIN~eWPRS~Ts--R~hSL~~ScDs~P~sL~Ll~E~~~~VigH~rLS~i~-n~~~al~VEsVVV~k~~R 96 (225) T KOG3397 20 DRPELLKESMTLINSEWPRSDTS--REHSLKKSCDSPPMSLLLLNEENDEVLGHSRLSHLP-NRDHALWVESVVVKKDQR 96 (225) T ss_pred CCHHHHHHHHHHHHCCCCCCHHH--HHHHHHCCCCCCCEEEEEECCCCCCEEEEECCCCCC-CCCCEEEEEEEEEEHHHC T ss_conf 66789999999874438752257--676665135777615555336652001110014567-888516888888744331 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCEEEEEEC--HHHHHHCCCEECC Q ss_conf 89958999999999996138847998408--6578742948746 Q gi|254781162|r 114 NKGIGRKLISMSVDAAEKKGSQVIVLVGD--IAYYSKLGFQAVP 155 (192) Q Consensus 114 g~GiG~~Li~~~l~~a~~~g~~~v~l~g~--~~yY~rfGF~~~~ 155 (192) |+|.|+.||+-+.+++|..|+..+.|..+ ..||+++||+... T Consensus 97 G~GFGk~lMk~~E~~~R~~gf~~~yLsT~DQ~~FYe~lGYe~c~ 140 (225) T KOG3397 97 GLGFGKFLMKSTEKWMREKGFNEAYLSTDDQCRFYESLGYEKCD 140 (225) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHEEEECCCCCHHHHHHCCCCCC T ss_conf 55388999999999999853321464112310166651410248 No 28 >KOG3216 consensus Probab=98.94 E-value=2.2e-08 Score=69.02 Aligned_cols=123 Identities=21% Similarity=0.185 Sum_probs=88.5 Q ss_pred CEEEEEECCHHHHHHHHHHH-----CC----CCCCCCEEEEEEC--CCCCCCCEEEEEEE---CCEEEEEEEEEEEEE-- Q ss_conf 20344207989999999985-----38----9872100131103--88887607899998---898999999877611-- Q gi|254781162|r 32 AFFLERLEHDSSICAMHADS-----FG----PGRFVRAAVLLRE--QGMHDLSLSFLCAE---GKRIVGSVRMTPISI-- 95 (192) Q Consensus 32 ~~~~~R~e~~~di~ai~~~~-----fg----pgr~~~~~~~lR~--~~~~~~~lslVA~~---~~~IVG~i~~~pv~~-- 95 (192) .++.||+-++.|.+.++.+. |- |-.-++.. ++. -.++.--.++|+.. ++.++|.+.|.+... T Consensus 2 ~~~~IR~at~~D~~~i~rLikela~Fek~~~~v~~te~~--l~~~~F~d~~~~~~~v~~ie~~~~~~aGf~~yf~~ystW 79 (163) T KOG3216 2 DNIRIRLATPKDCEDILRLIKELAEFEKLEDQVEATEEN--LARDGFIDPPFKHWLVAAIETSGEVVAGFALYFNNYSTW 79 (163) T ss_pred CCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCHHHCHHH--HHHHHCCCCCCCEEEEEEEECCCCCEEEEEEEECCCCCC T ss_conf 845999648435789999999998788743612203356--666404688720899999824898146776551156630 Q ss_pred CCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEE------ECHHHHHHCCCEECCC Q ss_conf 882238997678851113899589999999999961388479984------0865787429487463 Q gi|254781162|r 96 EKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLV------GDIAYYSKLGFQAVPW 156 (192) Q Consensus 96 ~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~------g~~~yY~rfGF~~~~~ 156 (192) .+....-|..++|.|+|||+|+|+.|++..-+.|.+.|+..+-.+ -++.+|++.|.+.... T Consensus 80 ~~k~~iYleDlyV~e~yR~kG~Gs~Ll~~va~~A~~~G~~rv~w~vldwN~rAi~lY~k~gaq~l~~ 146 (163) T KOG3216 80 LGKQGIYLEDLYVREQYRGKGIGSKLLKFVAEEADKLGTPRVEWVVLDWNHRAILLYEKVGAQDLKE 146 (163) T ss_pred CCCCEEEEEEEEECCHHCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHCCCCCCE T ss_conf 0344078876472532203576899999999999973997079998124616899999857334340 No 29 >TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase; InterPro: IPR010167 This entry represents a clade of amino-acid N-acetyltransferases, or N-acetylglutamate synthase, which is the product of the argA gene and the first enzyme in arginine biosynthesis. This enzyme displays more diversity between bacteria, fungi and mammals than other enzymes in arginine metabolism, and N-acetylglutamate itself can have different roles in different taxonomic groups: in prokaryotes, lower eukaryotes and plants it is the first intermediate in arginine biosynthesis, while in vertebrates it is an allosteric cofactor for the first enzyme in the urea cycle . In bacteria, arginine can regulate ornithine biosynthesis via a feedback inhibition mechanism . This enzyme may also act on aspartate.; GO: 0004042 amino-acid N-acetyltransferase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=98.88 E-value=1.4e-09 Score=75.94 Aligned_cols=122 Identities=17% Similarity=0.224 Sum_probs=92.3 Q ss_pred EEEEEECCHHHHHHHHHHHCCCCCCCCEEEEEECC---CCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEEC Q ss_conf 03442079899999999853898721001311038---888760789999889899999987761188223899767885 Q gi|254781162|r 33 FFLERLEHDSSICAMHADSFGPGRFVRAAVLLREQ---GMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVH 109 (192) Q Consensus 33 ~~~~R~e~~~di~ai~~~~fgpgr~~~~~~~lR~~---~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~ 109 (192) +-.+|.-+-.||.-|.++.- | -+.-.-.+-|+| .+.+..+| |.+.||-|+||..+.|.. ..+.-.|+.|+|+ T Consensus 291 ~e~iR~At~~DvGGi~~LI~-P-LEeqGiLv~RsRe~LE~~I~~FS-v~e~dG~~~GCAALypya--~~~~GE~AClAv~ 365 (439) T TIGR01890 291 FESIREATIDDVGGILELIR-P-LEEQGILVRRSRELLEREIEEFS-VIEHDGLIIGCAALYPYA--EEDVGEMACLAVS 365 (439) T ss_pred HHHHHHHCCCCCCHHHHHCC-C-HHHCCEECCHHHHHHHHHHHHCE-EEECCCCCHHHHHCCCCC--CCCCCCEEEEECC T ss_conf 89875311244100887435-1-34557310044787777530061-440176112121258876--3223301211217 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC--HHHHHHCCCEECCCCCE Q ss_conf 111389958999999999996138847998408--65787429487463653 Q gi|254781162|r 110 PLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD--IAYYSKLGFQAVPWKSL 159 (192) Q Consensus 110 P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~--~~yY~rfGF~~~~~~~l 159 (192) |+||+-|.|..|+.+..++||+.|...++++.. ..||.+.||..+....| T Consensus 366 P~~~~GG~G~~LL~h~~~~Ar~~Gl~~LFvLtT~t~hWF~erGF~~a~vd~L 417 (439) T TIGR01890 366 PEYRDGGRGERLLKHIEDRARQMGLSRLFVLTTRTEHWFRERGFQEASVDEL 417 (439) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCHHHHCEECCCCHHHHHHCCCCCCCHHCC T ss_conf 6788888717899999998642251352000256134786548972664215 No 30 >pfam08445 FR47 FR47-like protein. The members of this family are similar to the C-terminal region of the D. melanogaster hypothetical protein FR47. This protein has been found to consist of two N-acyltransferase-like domains swapped with the C-terminal strands. Probab=98.88 E-value=1.4e-08 Score=70.08 Aligned_cols=74 Identities=20% Similarity=0.371 Sum_probs=57.9 Q ss_pred EEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEE--EC---HHHHHHCC Q ss_conf 99998898999999877611882238997678851113899589999999999961388479984--08---65787429 Q gi|254781162|r 76 FLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLV--GD---IAYYSKLG 150 (192) Q Consensus 76 lVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~--g~---~~yY~rfG 150 (192) +.-.+.|++++.+..++. + .|+.+.|+|+|||+|+|+.|+++..+.+.++|......+ .+ ..+|+|+| T Consensus 4 lf~~~~~el~aW~l~~~~---G----~i~~l~v~Pe~rgrG~G~~L~~~~~~~~~~~G~~~~l~V~~~N~~a~~lYeklG 76 (86) T pfam08445 4 IYRGDTGELAAWCLRLPG---G----ELGALQTLPEHRRRGLGSRLVAALARGIAERGITPFAVVVAGNTPSRRLYEKLG 76 (86) T ss_pred EEECCCCCEEEEEEECCC---C----CEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCC T ss_conf 787899989999998899---7----689999998983698899999999999997699469999879868999999869 Q ss_pred CEECCC Q ss_conf 487463 Q gi|254781162|r 151 FQAVPW 156 (192) Q Consensus 151 F~~~~~ 156 (192) |+.+.. T Consensus 77 F~~v~~ 82 (86) T pfam08445 77 FRKIDE 82 (86) T ss_pred CEEEEE T ss_conf 979146 No 31 >TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species. Probab=98.80 E-value=1.3e-07 Score=64.47 Aligned_cols=133 Identities=26% Similarity=0.339 Sum_probs=80.8 Q ss_pred CHHHHHHHHHHHCC-CCCC--CCEE-EEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCC Q ss_conf 98999999998538-9872--1001-311038888760789999889899999987761188223899767885111389 Q gi|254781162|r 40 HDSSICAMHADSFG-PGRF--VRAA-VLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNK 115 (192) Q Consensus 40 ~~~di~ai~~~~fg-pgr~--~~~~-~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~ 115 (192) ...++.+|-+.+-. -|.. +|+. ..+|.... ...-.+++.++++|||+..+.+....... ..-++|+|+||++ T Consensus 10 ~~~~v~~l~~~a~~~DGv~plsE~~ll~Lr~~~~-~~~~hl~~~~~~~~vGya~l~~~~~~~~~---~aElvVhP~~R~r 85 (292) T TIGR03448 10 LRRDVRELLAAATAVDGVAPVSEQVLRGLREPGA-GHTRHLVAVDSDPIVGYANLVPARGTDPA---MAELVVHPAHRRR 85 (292) T ss_pred HHHHHHHHHHHHHHHCCCCCCCHHHHHHCCCCCC-CCCEEEEEECCCEEEEEEEECCCCCCCCC---EEEEEECCCCCCC T ss_conf 9999999999999842999755799964436888-86017999449868999996257888776---4789989000379 Q ss_pred CHHHHHHHHHHHHHHHCCCCEEEEEEC----HHHHHHCCCEECCCC--------CEECCCCCCHHHEEEEECCCC Q ss_conf 958999999999996138847998408----657874294874636--------536689999566698435778 Q gi|254781162|r 116 GIGRKLISMSVDAAEKKGSQVIVLVGD----IAYYSKLGFQAVPWK--------SLILPAPVDPNRVLFLPLVQN 178 (192) Q Consensus 116 GiG~~Li~~~l~~a~~~g~~~v~l~g~----~~yY~rfGF~~~~~~--------~l~~p~pv~~~~~l~l~L~~~ 178 (192) |||++|++++++.+. +...+...|+ ..+.+++||+++... .+.+|.|..|+.+-..+..++ T Consensus 86 GiG~aLl~a~~~~~~--~~l~~WahG~~p~A~alA~~~G~~~~R~l~~Mr~~l~~~~~p~p~~p~Gv~lr~f~~~ 158 (292) T TIGR03448 86 GIGRALIRALLAKGG--GRLRVWAHGDLPAARALASRLGLVPTRELLQMRRPLRDLELPEPQVPDGVTVRAYVGA 158 (292) T ss_pred CHHHHHHHHHHHHCC--CCCEEEECCCCHHHHHHHHHCCCEEEEHHHHHHCCCCCCCCCCCCCCCCEEEECCCCC T ss_conf 879999999997438--7732886599889999999779900132344305765567788889998688436787 No 32 >COG4552 Eis Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only] Probab=98.67 E-value=3.7e-08 Score=67.65 Aligned_cols=120 Identities=18% Similarity=0.224 Sum_probs=86.4 Q ss_pred CCCCCCCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEE--CC--CCEEEEEEEEECCCCCCCCHHHHHHHHHHH Q ss_conf 38987210013110388887607899998898999999877611--88--223899767885111389958999999999 Q gi|254781162|r 52 FGPGRFVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISI--EK--ITGHLLGPIVVHPLYQNKGIGRKLISMSVD 127 (192) Q Consensus 52 fgpgr~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~--~~--~~~~~LgplaV~P~~rg~GiG~~Li~~~l~ 127 (192) +-|--..+..++.+.-+-+ =++|...+.++++.+...|... ++ .+.-+++-|++.|+|||+|.-++|+.+.+. T Consensus 21 ~k~~~~~~~~~f~kil~~~---n~~vi~~nqkl~s~L~i~~f~~~f~~q~l~t~GIa~Vas~P~~R~~G~~~~Ll~~sLr 97 (389) T COG4552 21 WKPLVPTDGAVFVKILAEP---NSYVIYMNQKLASRLHIPPFIFWFGNQVLPTAGIAGVASAPTYRRRGALRALLAHSLR 97 (389) T ss_pred CCCCCCCHHHHHHHHCCCC---CCEEEEEHHHHHHCCCCCCHHEEECCEEEECCCEEEEEECHHHCCCCHHHHHHHHHHH T ss_conf 0755401135664330577---6068863344531033564120167736311534778726523047189999999999 Q ss_pred HHHHCCCCEEEEEE-CHHHHHHCCCEECCCCCEECCCCCCHHHEEEEEC Q ss_conf 99613884799840-8657874294874636536689999566698435 Q gi|254781162|r 128 AAEKKGSQVIVLVG-DIAYYSKLGFQAVPWKSLILPAPVDPNRVLFLPL 175 (192) Q Consensus 128 ~a~~~g~~~v~l~g-~~~yY~rfGF~~~~~~~l~~p~pv~~~~~l~l~L 175 (192) ..+.+|+...+|.- .+.+|+||||+.+... ...+..++-..+.+.+- T Consensus 98 e~~~kG~p~s~L~P~s~~iYrKfGye~asn~-~~~~~d~~~~~~~a~~p 145 (389) T COG4552 98 EIARKGYPVSALHPFSGGIYRKFGYEYASNY-HELTFDVRKARFHADEP 145 (389) T ss_pred HHHHCCCEEEEECCCCHHHHHHCCCCCCCEE-EEEECCHHHHHHHCCCC T ss_conf 9987697069963675445765063213348-99505548887612587 No 33 >KOG2488 consensus Probab=98.49 E-value=8e-07 Score=59.83 Aligned_cols=81 Identities=20% Similarity=0.271 Sum_probs=63.7 Q ss_pred EEEEECC-EEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE------EECHHHHHH Q ss_conf 9999889-899999987761188223899767885111389958999999999996138847998------408657874 Q gi|254781162|r 76 FLCAEGK-RIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL------VGDIAYYSK 148 (192) Q Consensus 76 lVA~~~~-~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l------~g~~~yY~r 148 (192) ++|.+++ .+||+.+|-=...-+.+.+-+--|-|.+.|||+|||+.|++.+...+.......|.| .+...||.+ T Consensus 95 i~a~~~~~~~vgf~~Frf~vd~g~~vlYcyEvqv~~~yR~kGiGk~LL~~l~~~a~~~~~~kVmLTVf~~N~~al~Fy~~ 174 (202) T KOG2488 95 ICAWNNKSKLVGFTMFRFTVDTGDPVLYCYEVQVASAYRGKGIGKFLLDTLEKLADSRHMRKVMLTVFSENIRALGFYHR 174 (202) T ss_pred EEEECCCCCEEEEEEEEEECCCCCEEEEEEEEEEHHHHHCCCHHHHHHHHHHHHHHHHHHHHHEEEEECCCCHHHHHHHH T ss_conf 99986888624688999870249869999987645666526858999999999888887665114443146504778997 Q ss_pred CCCEECCC Q ss_conf 29487463 Q gi|254781162|r 149 LGFQAVPW 156 (192) Q Consensus 149 fGF~~~~~ 156 (192) +||.+.+. T Consensus 175 ~gf~~~~~ 182 (202) T KOG2488 175 LGFVVDEE 182 (202) T ss_pred CCCCCCCC T ss_conf 59624788 No 34 >KOG4144 consensus Probab=98.46 E-value=6.8e-08 Score=66.11 Aligned_cols=124 Identities=15% Similarity=0.179 Sum_probs=76.2 Q ss_pred EEEECCHHHHHHHH---HHHCCCCCCCCEEEEEECCCCCCCCEE--EEEE-------ECCEEEEEEEEEEEEE------- Q ss_conf 44207989999999---985389872100131103888876078--9999-------8898999999877611------- Q gi|254781162|r 35 LERLEHDSSICAMH---ADSFGPGRFVRAAVLLREQGMHDLSLS--FLCA-------EGKRIVGSVRMTPISI------- 95 (192) Q Consensus 35 ~~R~e~~~di~ai~---~~~fgpgr~~~~~~~lR~~~~~~~~ls--lVA~-------~~~~IVG~i~~~pv~~------- 95 (192) -||+--.+|.+.++ +..|-+ ...-+-.+.|++.-..++++ +.++ ..+.+||||.-|.... T Consensus 13 ~irp~i~e~~q~~~~Lea~~FPe-~erasfeii~~r~i~~pevc~glf~~~~h~~~~~~~tLIghIigs~~~~E~lt~ES 91 (190) T KOG4144 13 RIRPGIPESCQRRHTLEASEFPE-DERASFEIIRERFISVPEVCPGLFDEIRHFLTLCEGTLIGHIIGSLWDKERLTQES 91 (190) T ss_pred CCCCCCHHHHHHHHCCCCCCCCH-HHHHHHHHHHHHHHCCHHHCCHHHHHHHHHHHHCCCCCEEHHHCCCCCCHHHHHHH T ss_conf 57988868999874265133985-67778999999983453326113366776454411301000000567601201877 Q ss_pred ------CCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCC-CCEEEEEE-CH--HHHHHCCCEECCCCCEE Q ss_conf ------88223899767885111389958999999999996138-84799840-86--57874294874636536 Q gi|254781162|r 96 ------EKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKG-SQVIVLVG-DI--AYYSKLGFQAVPWKSLI 160 (192) Q Consensus 96 ------~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g-~~~v~l~g-~~--~yY~rfGF~~~~~~~l~ 160 (192) ++.. ..+-.|+|+|+||.+|.|..|+..-++..-.+- ...++|.. +| +||+||||+.+....++ T Consensus 92 m~kh~s~g~n-i~iHsl~Ihpa~rk~g~a~~Ll~~ylq~l~~q~i~~r~~Li~h~pLvPFYEr~gFk~vgp~~~~ 165 (190) T KOG4144 92 MTKHRSGGHN-IHIHSLAIHPAFRKQGRAPILLWRYLQHLGSQPIVRRAALICHDPLVPFYERFGFKAVGPCAIT 165 (190) T ss_pred HHHHHCCCCC-EEEEEEEECHHHHHCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHCCCEEECCCCCC T ss_conf 7654058840-3678887459898668631689999998626753121013345775316675074032464222 No 35 >COG3393 Predicted acetyltransferase [General function prediction only] Probab=98.41 E-value=1.4e-06 Score=58.47 Aligned_cols=80 Identities=18% Similarity=0.280 Sum_probs=60.4 Q ss_pred EEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEE--ECH---HHHHHCCC Q ss_conf 9998898999999877611882238997678851113899589999999999961388479984--086---57874294 Q gi|254781162|r 77 LCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLV--GDI---AYYSKLGF 151 (192) Q Consensus 77 VA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~--g~~---~yY~rfGF 151 (192) --+++|+||...... ..+.....++-++++|+|||+|++++|+...-...-+.|...++.+ -+| .-|+|.|| T Consensus 181 f~~~d~~iVa~A~t~---a~~~~~~~I~gV~T~peyR~kGyAt~lva~L~~~lL~eGk~~~L~~~~~N~~A~~iY~riGF 257 (268) T COG3393 181 FLEGDGKIVAKAETA---AENPAYAQINGVYTHPEYRGKGYATALVATLAAKLLAEGKIPCLFVNSDNPVARRIYQRIGF 257 (268) T ss_pred EECCCCCEEEEEECC---CCCCCCEEEEEEECCHHHCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHHHCC T ss_conf 982688389862005---66886168987871978726548999999999999857981489983478788999998097 Q ss_pred EECCCCCE Q ss_conf 87463653 Q gi|254781162|r 152 QAVPWKSL 159 (192) Q Consensus 152 ~~~~~~~l 159 (192) +....... T Consensus 258 ~~~g~~~~ 265 (268) T COG3393 258 REIGEFRE 265 (268) T ss_pred EECCEEEE T ss_conf 14241789 No 36 >KOG3396 consensus Probab=98.32 E-value=3.8e-06 Score=55.86 Aligned_cols=81 Identities=25% Similarity=0.441 Sum_probs=63.0 Q ss_pred EEEEEEE--CCEEEEEEEEEEE--EEC--CCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECH---H Q ss_conf 7899998--8989999998776--118--82238997678851113899589999999999961388479984086---5 Q gi|254781162|r 74 LSFLCAE--GKRIVGSVRMTPI--SIE--KITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDI---A 144 (192) Q Consensus 74 lslVA~~--~~~IVG~i~~~pv--~~~--~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~---~ 144 (192) .-.|.++ .++|||+..+.-- .+. +..+. +..++|+++|||+++|+.|+...++.++..|+--++|-.+| . T Consensus 54 ~i~Vied~~s~~vigtatL~IE~KfIh~~g~rGh-iEDVVV~~~~rgk~LGkllv~~Lv~l~k~lgcYKi~LdC~~~nv~ 132 (150) T KOG3396 54 YIVVIEDKESEKVIGTATLFIERKFIHGCGSRGH-IEDVVVDSEYRGKQLGKLLVETLVDLAKSLGCYKIILDCDPKNVK 132 (150) T ss_pred EEEEEEECCCCEEEEEEEEEEEHHHHHCCCCCCC-EEEEEECHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHH T ss_conf 9999996786727888777776666650444575-147986754515678699999999998742827999963621014 Q ss_pred HHHHCCCEECC Q ss_conf 78742948746 Q gi|254781162|r 145 YYSKLGFQAVP 155 (192) Q Consensus 145 yY~rfGF~~~~ 155 (192) ||+++||...+ T Consensus 133 FYeKcG~s~~~ 143 (150) T KOG3396 133 FYEKCGYSNAG 143 (150) T ss_pred HHHHCCCCCCC T ss_conf 89973965542 No 37 >TIGR01211 ELP3 histone acetyltransferase, ELP3 family; InterPro: IPR005910 Histone acetylation is carried out by a class of enzymes known as histone acetyltransferases (HATs), which catalyse the transfer of an acetyl group from acetyl-CoA to the lysine E-amino groups on the N-terminal tails of histones . Early indication that HATs were involved in transcription came from the observation that in actively transcribed regions of chromatin, histones tend to be hyperacetylated, whereas in transcriptionally silent regions histones are hypoacetylated. The histone acetyltransferases are divided into five families. These include the Gcn5-related acetyltransferases (GNATs); the MYST (for 'MOZ, Ybf2/Sas3, Sas2 and Tip60)-related HATs; p300/CBP HATs; the general transcription factor HATs, which include the TFIID subunit TAF250; and the nuclear hormone-related HATs SRC1 and ACTR (SRC3). The GCN5-related N-acetyltransferase superfamily includes such enzymes as the histone acetyltransferases GCN5 and Hat1, the elongator complex subunit Elp3, the mediator-complex subunit Nut1, and Hpa2 . Many GNATs share several functional domains, including an N-terminal region of variable length, an acetyltransferase domain that encompasses the conserved sequence motifs described above, a region that interacts with the coactivator Ada2, and a C-terminal bromodomain that is believed to interact with acetyl-lysine residues. Members of the GNAT family are important for the regulation of cell growth and development. In mice, knockouts of Gcn5L are embryonic lethal. Yeast Gcn5 is needed for normal progression through the G2M boundary and mitotic gene expression. The importance of GNATs is probably related to their role in transcription and DNA repair. The yeast GCN5 (yGCN5) transcriptional coactivator functions as a histone acetyltransferase (HAT) to promote transcriptional activation. The crystal structure of the yeast histone acetyltransferase Hat1-acetyl coenzyme A (AcCoA) shows that Hat1 has an elongated, curved structure, and the AcCoA molecule is bound in a cleft on the concave surface of the protein, marking the active site of the enzyme. A channel of variable width and depth that runs across the protein is probably the binding site for the histone substrate . The central protein core associated with AcCoA binding that appears to be structurally conserved among a superfamily of N-acetyltransferases, including yeast histone acetyltransferase 1 and Serratia marcescens aminoglycoside 3-N-acetyltransferase . The Saccharomyces cerevisiae member YPL086C has been characterised in vitro as an N-terminal acetyltransferase (2.3.1.48 from EC) for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases. In vivo, ELP3 gene deletion confers typical ELP phenotypes such as slow growth adaptation, slow gene activation, and temperature sensitivity. This suggests a role for the proteins as novel, tightly RNAPII-associated histone acetyltransferases in transcription of DNA packaged in chromatin .. Probab=98.26 E-value=1.7e-06 Score=57.93 Aligned_cols=98 Identities=28% Similarity=0.516 Sum_probs=70.2 Q ss_pred CEEEEEECC----CCCCCCEEEEEEECCEEEEEEEEE----E-EE---ECC-CCE-----EEEE---EEEE---CCCCCC Q ss_conf 001311038----888760789999889899999987----7-61---188-223-----8997---6788---511138 Q gi|254781162|r 59 RAAVLLREQ----GMHDLSLSFLCAEGKRIVGSVRMT----P-IS---IEK-ITG-----HLLG---PIVV---HPLYQN 114 (192) Q Consensus 59 ~~~~~lR~~----~~~~~~lslVA~~~~~IVG~i~~~----p-v~---~~~-~~~-----~~Lg---plaV---~P~~rg 114 (192) ++...+|+. .-....|||.-.++|.|||.+++- | .. .+. .++ +..| ||.= .-+||. T Consensus 441 e~v~L~ve~Y~A~GG~E~FlS~Ed~~~D~LiGFlRLR~P~e~ya~r~e~d~~~TAlvRELHVYG~~vP~g~D~~~~~~QH 520 (573) T TIGR01211 441 ENVELIVEEYAASGGREIFLSYEDPKNDILIGFLRLRFPSEPYAHRKEVDAQRTALVRELHVYGSEVPIGEDRGEDEWQH 520 (573) T ss_pred CCEEEEEEEEECCCCEEEEEEEECCCCCEEEEEEECCCCCCCHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 01377763101488568987731100040021000068887213455430576103210321057212567787776444 Q ss_pred CCHHHHHHHHHHHHHHHC-CCCEEEEE---ECHHHHHH-CCCEECCC Q ss_conf 995899999999999613-88479984---08657874-29487463 Q gi|254781162|r 115 KGIGRKLISMSVDAAEKK-GSQVIVLV---GDIAYYSK-LGFQAVPW 156 (192) Q Consensus 115 ~GiG~~Li~~~l~~a~~~-g~~~v~l~---g~~~yY~r-fGF~~~~~ 156 (192) ||+|++||+.|...|++. |...|.|+ |.=+||.| |||+.... T Consensus 521 rGyG~~Ll~EAEriA~EefG~~KIlVISGiGVReYYrKqlGY~~~GP 567 (573) T TIGR01211 521 RGYGRRLLEEAERIAKEEFGLEKILVISGIGVREYYRKQLGYEKDGP 567 (573) T ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHCCCCCCCC T ss_conf 42557899999998563069701678514101000022326321377 No 38 >PRK10809 ribosomal-protein-S5-alanine N-acetyltransferase; Provisional Probab=98.24 E-value=6.8e-05 Score=48.53 Aligned_cols=121 Identities=16% Similarity=0.183 Sum_probs=80.9 Q ss_pred EEEEECCHHHHHHHHHHH------C---CCCCCC----CEEEEE-----ECCCCCCCCEEEEEE--ECCEEEEEEEEEEE Q ss_conf 344207989999999985------3---898721----001311-----038888760789999--88989999998776 Q gi|254781162|r 34 FLERLEHDSSICAMHADS------F---GPGRFV----RAAVLL-----REQGMHDLSLSFLCA--EGKRIVGSVRMTPI 93 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~------f---gpgr~~----~~~~~l-----R~~~~~~~~lslVA~--~~~~IVG~i~~~pv 93 (192) +.+|+=+++|.+++.+.. + -|.+.. ...... .+.......+.|+.. ++++|||.|.++.+ T Consensus 18 L~LR~~~~~Da~~l~~~~~~~r~~l~~w~p~~d~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~k~~~~liG~i~l~~i 97 (194) T PRK10809 18 LVVRLVHERDAWRLADYYAENRHFLKPWEPVRDESHCYPSGWQARLGMINEFHKQGSAFYFGLLDPEEKEIIGVANFSNV 97 (194) T ss_pred EEEECCCHHHHHHHHHHHHHCHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEEEEEEC T ss_conf 89978998999999999971875554847667702158899999999999998669817999999899979999998742 Q ss_pred EECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHH-HCCCCEEEEEECH------HHHHHCCCEECC Q ss_conf 1188223899767885111389958999999999996-1388479984086------578742948746 Q gi|254781162|r 94 SIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAE-KKGSQVIVLVGDI------AYYSKLGFQAVP 155 (192) Q Consensus 94 ~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~-~~g~~~v~l~g~~------~yY~rfGF~~~~ 155 (192) ..+......+| -.++|+|||+|+++.-++.+++.+- ..+...|...-+| ...+|.||+... T Consensus 98 ~~~~~~~~eIG-Y~l~~~~~GkGyatEA~~ali~~aF~~l~lhRIea~~~~~N~~S~rvleklGF~~EG 165 (194) T PRK10809 98 VRGSFHACYLG-YSIGQKWQGQGLMFEALTAAIRYMQRTQHIHRIMANYMPHNKRSGDLLARLGFEKEG 165 (194) T ss_pred CCCCCCEEEEE-EEECHHHHCCCHHHHHHHHHHHHHHHHCCCCEEEEEHHHCCHHHHHHHHHCCCEEEE T ss_conf 57765168899-999888948888999999999999873795389564322287753999987996888 No 39 >KOG3235 consensus Probab=98.16 E-value=8.2e-06 Score=53.92 Aligned_cols=120 Identities=21% Similarity=0.327 Sum_probs=74.5 Q ss_pred EEEEECCHHHHHHHHHHHC--CCCCCCCEEEEEECCCCCCCCEEEEEE-ECCEEEEEEEEEEEEE--CCCC--EEEEEEE Q ss_conf 3442079899999999853--898721001311038888760789999-8898999999877611--8822--3899767 Q gi|254781162|r 34 FLERLEHDSSICAMHADSF--GPGRFVRAAVLLREQGMHDLSLSFLCA-EGKRIVGSVRMTPISI--EKIT--GHLLGPI 106 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~f--gpgr~~~~~~~lR~~~~~~~~lslVA~-~~~~IVG~i~~~pv~~--~~~~--~~~Lgpl 106 (192) |-||.-+..|+-...+... -|..+--..+. -....-+.+||||+ .+|+|||++.- .+.. +..+ +. +-++ T Consensus 2 m~iR~ar~~DL~~mQ~~Nl~~lpENyqmkyyl--yh~lswp~lSyVA~D~~gkiVGYvlA-kmee~p~~~~~hGh-ItSl 77 (193) T KOG3235 2 MNIRRARPDDLLEMQHCNLLNLPENYQMKYYL--YHGLSWPQLSYVAEDENGKIVGYVLA-KMEEDPDDEPPHGH-ITSL 77 (193) T ss_pred CCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHH--HHHCCCCCCEEEEECCCCCEEEEEEE-EHHHCCCCCCCCCE-EEEE T ss_conf 56345888888776643433682777478899--86226655347877589968987601-12306567898870-6876 Q ss_pred EECCCCCCCCHHHHHHHHHHHHHHH-CCCCEEEEE---E---CHHHHH-HCCCEECCCC Q ss_conf 8851113899589999999999961-388479984---0---865787-4294874636 Q gi|254781162|r 107 VVHPLYQNKGIGRKLISMSVDAAEK-KGSQVIVLV---G---DIAYYS-KLGFQAVPWK 157 (192) Q Consensus 107 aV~P~~rg~GiG~~Li~~~l~~a~~-~g~~~v~l~---g---~~~yY~-rfGF~~~~~~ 157 (192) +|.-+||+.||+++||..+..+..+ .+...+.|. + +..+|. .+||+..... T Consensus 78 aV~rs~RrlGla~kLm~qa~rAm~E~~~A~yvsLHVR~SNraAl~LY~~tl~F~v~eve 136 (193) T KOG3235 78 AVKRSYRRLGLAQKLMNQASRAMVEVYEAKYVSLHVRKSNRAALHLYKNTLGFVVCEVE 136 (193) T ss_pred EEHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEEECCCHHHHHHHHHCCCEEEEECC T ss_conf 53246887618999999999999986353388886411068888766613141886136 No 40 >COG1444 Predicted P-loop ATPase fused to an acetyltransferase [General function prediction only] Probab=98.13 E-value=7.3e-06 Score=54.21 Aligned_cols=80 Identities=25% Similarity=0.418 Sum_probs=58.2 Q ss_pred EEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEE-EEEEC----HHHHHHCCCEECCCCCEECCCCCCHHHEEEEE Q ss_conf 38997678851113899589999999999961388479-98408----65787429487463653668999956669843 Q gi|254781162|r 100 GHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVI-VLVGD----IAYYSKLGFQAVPWKSLILPAPVDPNRVLFLP 174 (192) Q Consensus 100 ~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v-~l~g~----~~yY~rfGF~~~~~~~l~~p~pv~~~~~l~l~ 174 (192) ++=+-.+||||++|++|||++|++..++.++ ++++.+ +-+|. ..|+.|-||.++.-....-+.--.-..+|.+| T Consensus 531 G~RIvRIAvhPe~q~~GiGsrlL~~l~~~a~-~~~DwlgvsFG~t~~L~rFW~rnGF~pVhls~~rn~~SGeys~i~lkp 609 (758) T COG1444 531 GWRIVRIAVHPELQRMGIGSRLLALLIEEAR-KGLDWLGVSFGYTEELLRFWLRNGFVPVHLSPTRNASSGEYTAIVLKP 609 (758) T ss_pred CEEEEEEEECHHHHHCCHHHHHHHHHHHHHH-CCCCEEEECCCCCHHHHHHHHHCCEEEEEECCCCCCCCCCEEEEEEEC T ss_conf 1268888708878635888999999999984-589889745678889999999759079873686676778646898854 Q ss_pred CCCCCC Q ss_conf 577810 Q gi|254781162|r 175 LVQNVA 180 (192) Q Consensus 175 L~~~~l 180 (192) |.+.+- T Consensus 610 Ls~~~~ 615 (758) T COG1444 610 LSDAGK 615 (758) T ss_pred CCHHHH T ss_conf 887799 No 41 >PRK10151 ribosomal-protein-L7/L12-serine acetyltransferase; Provisional Probab=98.10 E-value=4.1e-05 Score=49.80 Aligned_cols=122 Identities=12% Similarity=0.057 Sum_probs=79.5 Q ss_pred CEEEEEECCHHHHHHHHHHH------CCC------CCC----CCEEEE-EECCCCCCCCEEEEEEECCEEEEEEEEEEEE Q ss_conf 20344207989999999985------389------872----100131-1038888760789999889899999987761 Q gi|254781162|r 32 AFFLERLEHDSSICAMHADS------FGP------GRF----VRAAVL-LREQGMHDLSLSFLCAEGKRIVGSVRMTPIS 94 (192) Q Consensus 32 ~~~~~R~e~~~di~ai~~~~------fgp------gr~----~~~~~~-lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~ 94 (192) -.+..|+=+.+|.++++++. +.+ ... ++..+. ..........+.|+...++++||.|.+..+. T Consensus 9 erl~LR~~~~~Da~~L~~li~~nr~~L~~wlpW~~~~~s~~d~~~~i~~~~~~~~~g~~~~f~I~~~~~liG~i~l~~i~ 88 (179) T PRK10151 9 ESLELHAVDESHVTPLHQLVCKNKTWLQQSLNWPQYVQSEEDTRKTVQGNVMLHQRGYAKMFLIFKEDELIGVLSFNRIE 88 (179) T ss_pred CCEEEEECCHHHHHHHHHHHHHCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEEEEEEEE T ss_conf 99799808888999999999967898637899999889999999999999999971867489999999899999999970 Q ss_pred ECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHH-HCCCCEEEEEE---C---HHHHHHCCCEECC Q ss_conf 188223899767885111389958999999999996-13884799840---8---6578742948746 Q gi|254781162|r 95 IEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAE-KKGSQVIVLVG---D---IAYYSKLGFQAVP 155 (192) Q Consensus 95 ~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~-~~g~~~v~l~g---~---~~yY~rfGF~~~~ 155 (192) .. .....+| ..+.|+|||+|++++-++..++.+- +.+...+.... + ....+|.||+... T Consensus 89 ~~-~~~aeiG-Ywl~~~~~GkG~~tea~~~li~~~f~~l~l~Ri~~~~~~~N~~S~~v~ek~Gf~~EG 154 (179) T PRK10151 89 PL-NKTAYIG-YWLDESHQGQGIISQSLQALIHHYAQSGELRRFVIKCRVDNPASNQVALRNGFILEG 154 (179) T ss_pred CC-CCEEEEE-EEECHHHCCCCHHHHHHHHHHHHHHHHCCEEEEEEEEECCCHHHHHHHHHCCCEEEE T ss_conf 56-6769999-998877748968999999999999972891799999918998999999988999999 No 42 >TIGR02406 ectoine_EctA L-2,4-diaminobutyric acid acetyltransferase; InterPro: IPR012772 This enzyme family is the EctA of ectoine biosynthesis. Ectoine is a compatible solute, analogous to trehalose, betaines, etc., found often in halotolerant organisms. EctA is L-2,4-diaminobutyric acid acetyltransferase, also called DABA acetyltransferase.; GO: 0008415 acyltransferase activity, 0019491 ectoine biosynthetic process. Probab=98.08 E-value=1.2e-05 Score=53.02 Aligned_cols=80 Identities=19% Similarity=0.233 Sum_probs=56.2 Q ss_pred EEEEEEE-CCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCE--EEEE-----EC--- Q ss_conf 7899998-89899999987761188223899767885111389958999999999996138847--9984-----08--- Q gi|254781162|r 74 LSFLCAE-GKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQV--IVLV-----GD--- 142 (192) Q Consensus 74 lslVA~~-~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~--v~l~-----g~--- 142 (192) =|.||+. ||+|+|.|.-.- .-+..+.+.+..|||++..||+|++++|+++.+++-....... -.|- ++ T Consensus 40 TsiVae~egg~i~GFV~GY~-~P~~Pd~LFVWQVAV~~~~RG~GLA~rll~all~R~~~~~v~~~I~~~EtTiTPdN~AS 118 (162) T TIGR02406 40 TSIVAEEEGGEIVGFVSGYL-KPDRPDVLFVWQVAVDPKARGKGLASRLLEALLERVALERVRHCIQAVETTITPDNEAS 118 (162) T ss_pred CEEEEECCCCEEEEEEEEEE-CCCCCCEEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHH T ss_conf 22765379990335886212-67999788898754271314316899999997610333137982104662107666689 Q ss_pred HHHHHHCCCEEC Q ss_conf 657874294874 Q gi|254781162|r 143 IAYYSKLGFQAV 154 (192) Q Consensus 143 ~~yY~rfGF~~~ 154 (192) -.++++++=+.- T Consensus 119 ~aLF~~La~~~~ 130 (162) T TIGR02406 119 RALFKKLARRRG 130 (162) T ss_pred HHHHHHHHHHCC T ss_conf 999998765428 No 43 >COG1670 RimL Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis] Probab=98.02 E-value=0.00015 Score=46.56 Aligned_cols=86 Identities=21% Similarity=0.230 Sum_probs=61.3 Q ss_pred CCEEEEEEEC--CEEEEEEEEEEEEE-CCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHH-CCCCEEEEEEC----- Q ss_conf 6078999988--98999999877611-882238997678851113899589999999999961-38847998408----- Q gi|254781162|r 72 LSLSFLCAEG--KRIVGSVRMTPISI-EKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEK-KGSQVIVLVGD----- 142 (192) Q Consensus 72 ~~lslVA~~~--~~IVG~i~~~pv~~-~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~-~g~~~v~l~g~----- 142 (192) ..+.++...+ +++||.+.++.... .......+| ....|+|+|+|+|+..++..++.+-. .+...+...-+ T Consensus 65 ~~~~~~~~~~~~~~~iG~~~~~~~~~~~~~~~~~ig-~~l~~~~~g~G~~tea~~~~l~~~f~~~~~~rv~~~~~~~N~~ 143 (187) T COG1670 65 GAFAIELKATGDGELIGVIGLSDIDRAANGDLAEIG-YWLDPEYWGKGYATEALRALLDYAFEELGLHRIEATVDPENEA 143 (187) T ss_pred CEEEEEEECCCCCCEEEEEEEEECCCCCCCCCEEEE-EEECHHHCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHH T ss_conf 459999802788708999997863465678857999-9999666389269999999999975007965899998889999 Q ss_pred -HHHHHHCCCEECCCCC Q ss_conf -6578742948746365 Q gi|254781162|r 143 -IAYYSKLGFQAVPWKS 158 (192) Q Consensus 143 -~~yY~rfGF~~~~~~~ 158 (192) ...|+|.||+...... T Consensus 144 S~rv~ek~Gf~~~g~~~ 160 (187) T COG1670 144 SIRVYEKLGFRLEGELR 160 (187) T ss_pred HHHHHHHCCCEEECEEE T ss_conf 99999987999925241 No 44 >KOG3234 consensus Probab=98.02 E-value=2e-05 Score=51.65 Aligned_cols=142 Identities=15% Similarity=0.118 Sum_probs=85.3 Q ss_pred EEECCHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCEEEEEEE-CCEEEEEEEEEEEEECCCCEEE-EEEEEECCCCC Q ss_conf 420798999999998538987210013110388887607899998-8989999998776118822389-97678851113 Q gi|254781162|r 36 ERLEHDSSICAMHADSFGPGRFVRAAVLLREQGMHDLSLSFLCAE-GKRIVGSVRMTPISIEKITGHL-LGPIVVHPLYQ 113 (192) Q Consensus 36 ~R~e~~~di~ai~~~~fgpgr~~~~~~~lR~~~~~~~~lslVA~~-~~~IVG~i~~~pv~~~~~~~~~-LgplaV~P~~r 113 (192) +|+=...|+....+..+-|-.++-.--..-.--..-++++.||+. +++|-|+|+- ++.-....+++ ...|+|.|+|| T Consensus 4 ~r~f~~~Dlf~fNninLDpltEt~~~~Fyl~yl~~~pe~~~~a~~p~~~imgyimg-k~Eg~~~~wh~HvTAltVap~~R 82 (173) T KOG3234 4 IRPFTPQDLFKFNNINLDPLTETFPISFYLIYLAIWPEDFIVAEAPTGEIMGYIMG-KVEGKDTEWHGHVTALTVAPDYR 82 (173) T ss_pred CCCCCHHHHHHHCCCCCCCCCCCCCEEHHHHHHHHCHHHHEECCCCCCCEEEEEEE-ECCCCCCCEEEEEEEEEECHHHH T ss_conf 55366788775356214634134660005999875758727311799846899854-20466751366789887354688 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCEEEE---EEC---HHHHHHCCCEECCCC-CEECCCCCCHHHEEEEECCCC Q ss_conf 89958999999999996138847998---408---657874294874636-536689999566698435778 Q gi|254781162|r 114 NKGIGRKLISMSVDAAEKKGSQVIVL---VGD---IAYYSKLGFQAVPWK-SLILPAPVDPNRVLFLPLVQN 178 (192) Q Consensus 114 g~GiG~~Li~~~l~~a~~~g~~~v~l---~g~---~~yY~rfGF~~~~~~-~l~~p~pv~~~~~l~l~L~~~ 178 (192) +.|+|+.||....+-....+.-.+-| .++ +++|+++|+..-.+- +...-||-++.+=|.++|.-. T Consensus 83 rl~la~~lm~~led~~d~~~a~fvDLfVr~sN~iAI~mYkkLGY~~YR~Vi~YY~~g~deda~dMRKalSrD 154 (173) T KOG3234 83 RLGLAAKLMDTLEDVSDVDNAYFVDLFVRVSNQIAIDMYKKLGYSVYRTVIEYYSVGPDEDAYDMRKALSRD 154 (173) T ss_pred HHHHHHHHHHHHHHHHHHHHHHEEEEEEECCCHHHHHHHHHCCCEEEEEEEEEECCCCCCCHHHHHHHHCCC T ss_conf 877999999999998875321231032000654589999852936887654665038873067666555037 No 45 >TIGR03585 PseH pseudaminic acid biosynthesis N-acetyl transferase. Sequences in this family are members of the pfam00583 (GNAT) superfamily of acetyltransferases and are proposed to perform a N-acetylation step in the process of pseudaminic acid biosynthesis in Campylobacter species. This gene is commonly observed in apparent operons with other genes responsible for the biosynthesis of pseudaminic acid and as a component of flagellar and exopolysaccharide biosynthesis loci. Significantly, many genomes containing other components of this pathway lack this gene, indicating that some other N-acetyl transferases may be incolved and/or the step is optional, resulting in a non-acetylated pseudaminic acid variant sugar. Probab=97.99 E-value=4.2e-05 Score=49.75 Aligned_cols=81 Identities=22% Similarity=0.410 Sum_probs=60.7 Q ss_pred CEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHH-HHCCCCEEEEE---EC---HHH Q ss_conf 078999988989999998776118822389976788511138995899999999999-61388479984---08---657 Q gi|254781162|r 73 SLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAA-EKKGSQVIVLV---GD---IAY 145 (192) Q Consensus 73 ~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a-~~~g~~~v~l~---g~---~~y 145 (192) ..-|+.+.+++.||.|.++.+..... ...+| +...| ++++|+|+.++..+++.| ...+...+.+. .+ ..+ T Consensus 51 ~~~~ii~~~~~~IG~i~l~~id~~~~-~ae~g-i~~~~-~~~~G~g~~~~~~~l~~aF~~l~l~rl~~~v~~~N~~si~l 127 (156) T TIGR03585 51 RRYWIVCQESRPIGVISFTDINLVHK-SAFWG-IYANP-FCKPGVGSVLEEAALEYAFEHLGLHKLSLEVLEFNNKALKL 127 (156) T ss_pred CEEEEEEECCCEEEEEEEEECCCCCC-EEEEE-EEECH-HHCCCHHHHHHHHHHHHHHHHCCEEEEEEEECCCCHHHHHH T ss_conf 38999997992899998874030039-89999-98222-10257799999999999987569269999962568999999 Q ss_pred HHHCCCEECCC Q ss_conf 87429487463 Q gi|254781162|r 146 YSKLGFQAVPW 156 (192) Q Consensus 146 Y~rfGF~~~~~ 156 (192) |+++||+.... T Consensus 128 ~ek~GF~~EG~ 138 (156) T TIGR03585 128 YEKFGFEREGV 138 (156) T ss_pred HHHCCCEEEEE T ss_conf 99789999999 No 46 >KOG3138 consensus Probab=97.98 E-value=1.5e-05 Score=52.46 Aligned_cols=142 Identities=21% Similarity=0.201 Sum_probs=81.3 Q ss_pred EEEEECCHHHHHHH---HHHHCCCCCCCCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCC-------CEEEE Q ss_conf 34420798999999---998538987210013110388887607899998898999999877611882-------23899 Q gi|254781162|r 34 FLERLEHDSSICAM---HADSFGPGRFVRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKI-------TGHLL 103 (192) Q Consensus 34 ~~~R~e~~~di~ai---~~~~fgpgr~~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~-------~~~~L 103 (192) +.+|.-++.++..+ +..+|- -.+...-+..-..+. .+.=.|.-++..||-++.-....... +...+ T Consensus 17 ~~l~~it~~nl~~~~~l~~~~fP-~~y~~kfy~~~~~~~---~~~~~A~~~~~~v~a~~~k~~~~~~~~~r~~~~~~~yi 92 (187) T KOG3138 17 IELRLITPNNLKQLKQLNEDIFP-ISYVDKFYPDVLSNG---DLTQLAYYNEIAVGAVACKLIKFVQNAKRLFGNRVIYI 92 (187) T ss_pred EEECCCCCCHHHHHHHHHCCCCC-CCHHHHHHHHHHHCC---CHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHCCCEEEE T ss_conf 35436783067899887202167-635777778887157---77774111014530013310122456665403321477 Q ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHHCC-CCEEEEE----E--CHHHHHHCCCEECCCCC-EECCCCCCHHHEEEEEC Q ss_conf 767885111389958999999999996138-8479984----0--86578742948746365-36689999566698435 Q gi|254781162|r 104 GPIVVHPLYQNKGIGRKLISMSVDAAEKKG-SQVIVLV----G--DIAYYSKLGFQAVPWKS-LILPAPVDPNRVLFLPL 175 (192) Q Consensus 104 gplaV~P~~rg~GiG~~Li~~~l~~a~~~g-~~~v~l~----g--~~~yY~rfGF~~~~~~~-l~~p~pv~~~~~l~l~L 175 (192) -.+.|+|.||.+|||+.|+..+.+.+.... +..+.+. + ...||+++||+.+..-. ...+.-..+..+|.++| T Consensus 93 ~~Lgvl~~yR~~gIGs~Ll~~~~~~~~~~~~~~~v~lHv~~~n~~ai~~Y~~~gF~~~~~~~~~y~~~~~~~~~~l~~~~ 172 (187) T KOG3138 93 LSLGVLPRYRNKGIGSKLLEFVKKYCSEAHQCRRVYLHVQAVNESAIEFYEKRGFEIVERLKNYYSILGPPDDSFLRKLL 172 (187) T ss_pred EEECCCHHHHHCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHHHE T ss_conf 76224387774254878899999887505645517999971797388998834956740254421036672055653220 Q ss_pred CCCC Q ss_conf 7781 Q gi|254781162|r 176 VQNV 179 (192) Q Consensus 176 ~~~~ 179 (192) .-+. T Consensus 173 ~~~~ 176 (187) T KOG3138 173 IHGS 176 (187) T ss_pred ECCC T ss_conf 1488 No 47 >COG3981 Predicted acetyltransferase [General function prediction only] Probab=97.93 E-value=4.5e-05 Score=49.60 Aligned_cols=76 Identities=16% Similarity=0.362 Sum_probs=56.6 Q ss_pred CCCEEEEEEE-CCEEEEEEEEEEEEECCC---CEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECHHHH Q ss_conf 7607899998-898999999877611882---238997678851113899589999999999961388479984086578 Q gi|254781162|r 71 DLSLSFLCAE-GKRIVGSVRMTPISIEKI---TGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDIAYY 146 (192) Q Consensus 71 ~~~lslVA~~-~~~IVG~i~~~pv~~~~~---~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~~yY 146 (192) .+...+.|.. ++++||.|.+-.. .+.. .+=-+| -+|.|+.||+|+++++++.+++.|++.|..-|+++.|.+.- T Consensus 66 V~~~~y~~v~~d~~ivG~i~lRh~-Ln~~ll~~gGHIG-Y~VrPseR~KGYA~emLkl~L~~ar~lgi~~Vlvtcd~dN~ 143 (174) T COG3981 66 VPASTYWAVDEDGQIVGFINLRHQ-LNDFLLEEGGHIG-YSVRPSERRKGYAKEMLKLALEKARELGIKKVLVTCDKDNI 143 (174) T ss_pred EECEEEEEEECCCCEEEEEEEEEE-CCHHHHHCCCCCC-CEECHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 303368999338808999986210-0568886278564-11476352467899999999999998499769998479981 Q ss_pred HH Q ss_conf 74 Q gi|254781162|r 147 SK 148 (192) Q Consensus 147 ~r 148 (192) .. T Consensus 144 AS 145 (174) T COG3981 144 AS 145 (174) T ss_pred HH T ss_conf 11 No 48 >cd02169 Citrate_lyase_ligase Citrate lyase ligase, also known as [Citrate (pro-3S)-lyase] ligase, is responsible for acetylation of the (2-(5''-phosphoribosyl)-3'-dephosphocoenzyme-A) prosthetic group of the gamma subunit of citrate lyase, converting the inactive thiol form of this enzyme to the active form. The acetylation of 1 molecule of deacetyl-citrate lyase to enzymatically active citrate lyase requires 6 molecules of ATP. The Adenylylyltranferase activity of the enzyme involves the formation of AMP and and pyrophosphate in the acetylation reaction. Probab=97.86 E-value=0.00011 Score=47.34 Aligned_cols=69 Identities=23% Similarity=0.473 Sum_probs=58.8 Q ss_pred EECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECH---HHHHHCCCEECC Q ss_conf 98898999999877611882238997678851113899589999999999961388479984086---578742948746 Q gi|254781162|r 79 AEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDI---AYYSKLGFQAVP 155 (192) Q Consensus 79 ~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~---~yY~rfGF~~~~ 155 (192) .++++|||+-.+. +-.|-.+||+|++||.|+..+|+.+.++.+.++|..-.+++..| .+++..||..+. T Consensus 12 d~~~~lvatGs~~--------gnvlKcvAV~~~~qg~gl~~~lvs~L~~~~~~~G~~h~FvfTKp~~~~~F~~lGF~~ia 83 (296) T cd02169 12 DDAGNLIATGSLA--------GNILKCVAVSPKYQGEGLLLKLVSELINRAFERGIFHLFVFTKPENVKFFSSLGFKELA 83 (296) T ss_pred ECCCCEEEEEECC--------CCEEEEEEECHHCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHCCCEEEE T ss_conf 0799789973021--------47879999872105864799999999999997699648999371057778758977988 No 49 >COG2388 Predicted acetyltransferase [General function prediction only] Probab=97.84 E-value=7e-05 Score=48.45 Aligned_cols=67 Identities=15% Similarity=0.251 Sum_probs=55.1 Q ss_pred CCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 6078999988989999998776118822389976788511138995899999999999613884799840 Q gi|254781162|r 72 LSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVG 141 (192) Q Consensus 72 ~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g 141 (192) ....++...+|..+|.+.+++. +.+..-+---.|.+++||||+|++|++++++.||+.|+..+=+-. T Consensus 14 ~~~~y~~~~~G~~~~e~~y~~~---~~~~i~i~HT~V~d~lrGqGia~~L~~~al~~ar~~g~kiiP~Cs 80 (99) T COG2388 14 ENGRYVLTDEGEVIGEATYYDR---GENLIIIDHTYVPDELRGQGIAQKLVEKALEEAREAGLKIIPLCS 80 (99) T ss_pred CCEEEEEECCCCEEEEEEEECC---CCCEEEEECCCCCHHHCCCCHHHHHHHHHHHHHHHCCCEECCCCH T ss_conf 7518999049938878888417---998799853768988858747999999999999981985733663 No 50 >pfam00765 Autoind_synth Autoinducer synthetase. Probab=97.55 E-value=0.0021 Score=39.85 Aligned_cols=108 Identities=16% Similarity=0.051 Sum_probs=74.3 Q ss_pred ECCCCCCCCEEEEEEECCEEEEEEEEEEEEE------------C------CCCEEEEEEEEECCCCCCC-C-----HHHH Q ss_conf 0388887607899998898999999877611------------8------8223899767885111389-9-----5899 Q gi|254781162|r 65 REQGMHDLSLSFLCAEGKRIVGSVRMTPISI------------E------KITGHLLGPIVVHPLYQNK-G-----IGRK 120 (192) Q Consensus 65 R~~~~~~~~lslVA~~~~~IVG~i~~~pv~~------------~------~~~~~~LgplaV~P~~rg~-G-----iG~~ 120 (192) +++-+.....-+++.++|+++|+.++.|..- . +.+.+.+..++|+++.... + +... T Consensus 37 ~DqyD~~~~~Yll~~~~g~v~g~~RLlpTt~p~ML~d~Fp~ll~~~~~P~~~~iwE~SRf~Vd~~~~~~~~~~~~~~~~~ 116 (182) T pfam00765 37 FDEYDNLNTTYLLGIKDGRVVCSVRFIETKYPNMITGTFSSYFDDISLPESGNYIESSRFFVDKARARGLLGNLAPISLM 116 (182) T ss_pred CCCCCCCCCEEEEEEECCEEEEEECCCCCCCCCHHHCCCHHHCCCCCCCCCCCEEEECEEEECCCCCCCCCCCCCHHHHH T ss_conf 55789999869999989989999813788995524210176628999999999899740379675554444452779999 Q ss_pred HHHHHHHHHHHCCCCEEEEEECH---HHHHHCCCEECCCCCEECCCCCCHHHEEEEEC Q ss_conf 99999999961388479984086---57874294874636536689999566698435 Q gi|254781162|r 121 LISMSVDAAEKKGSQVIVLVGDI---AYYSKLGFQAVPWKSLILPAPVDPNRVLFLPL 175 (192) Q Consensus 121 Li~~~l~~a~~~g~~~v~l~g~~---~yY~rfGF~~~~~~~l~~p~pv~~~~~l~l~L 175 (192) |+...++.|...|...++.+.++ ..|.|.||...+ +..|-.+..+...++-+ T Consensus 117 L~~~~~~~a~~~Gi~~~v~V~~~~~~rilrr~G~~~~~---lG~p~~~~g~~~vA~~i 171 (182) T pfam00765 117 LFLSMINYARASGYDGIYTIVSHPMLRILKRSGWPISV---LGEGLSEKNERIYLLHL 171 (182) T ss_pred HHHHHHHHHHHCCCCEEEEEECHHHHHHHHHCCCCEEE---CCCCEEECCEEEEEEEE T ss_conf 99999999998799889999768999999986995598---88886679848999997 No 51 >PHA00673 acetyltransferase domain containing protein Probab=97.50 E-value=0.00051 Score=43.44 Aligned_cols=83 Identities=17% Similarity=0.258 Sum_probs=61.2 Q ss_pred CEEEEEEECCEEEEEEEEEEEEECCCCEEE---EEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECH-----H Q ss_conf 078999988989999998776118822389---97678851113899589999999999961388479984086-----5 Q gi|254781162|r 73 SLSFLCAEGKRIVGSVRMTPISIEKITGHL---LGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDI-----A 144 (192) Q Consensus 73 ~lslVA~~~~~IVG~i~~~pv~~~~~~~~~---LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~-----~ 144 (192) -.-|-|.+|+++||++.+.-....-..+.. -..+.|+++||+-|.|-+|++++.+.+++.|+..+.+.+.+ . T Consensus 55 ~~~l~a~q~d~LIGf~~llVT~~pHy~g~~i~t~es~fV~~eHR~ggaGlkLlr~te~~A~~LGa~~ll~sa~~~~~Ls~ 134 (154) T PHA00673 55 AHFLGVFRGEELVGFACLLVTPVPHFKGQLIGTTESIFVAAAHRPGGAGMALLRATEALARDLGATGLYVSGPTEGRLVQ 134 (154) T ss_pred EEEEEECCCCEEEEEEEEECCCCCCCCCCEEECCCEEEEEHHHCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHH T ss_conf 37875212871466544421367632477751000257746443886168999999999986087369995699981412 Q ss_pred HHHHCCCEECC Q ss_conf 78742948746 Q gi|254781162|r 145 YYSKLGFQAVP 155 (192) Q Consensus 145 yY~rfGF~~~~ 155 (192) .-.+.||.+.. T Consensus 135 ~Lp~~~y~~tN 145 (154) T PHA00673 135 LLPAAGYRETN 145 (154) T ss_pred CCCCCCCCCCC T ss_conf 12534776466 No 52 >TIGR02382 wecD_rffC TDP-D-fucosamine acetyltransferase; InterPro: IPR012752 This entry represents the WecD protein (formerly RffC) for the biosynthesis of enterobacterial common antigen (ECA), an outer leaflet, outer membrane glycolipid with a trisaccharide repeat unit. WecD is a member of the GNAT family of acetyltransferases.. Probab=97.46 E-value=0.00014 Score=46.68 Aligned_cols=118 Identities=19% Similarity=0.323 Sum_probs=80.8 Q ss_pred EEEEECCHHHHHHHHHHH---CCCCCCC------------CEEEE-EECCCCCCCCEEEEEEE-CCEEEEEEEEEEEEEC Q ss_conf 344207989999999985---3898721------------00131-10388887607899998-8989999998776118 Q gi|254781162|r 34 FLERLEHDSSICAMHADS---FGPGRFV------------RAAVL-LREQGMHDLSLSFLCAE-GKRIVGSVRMTPISIE 96 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~---fgpgr~~------------~~~~~-lR~~~~~~~~lslVA~~-~~~IVG~i~~~pv~~~ 96 (192) ...|.-++.||-+|.+++ |+--||. ++..+ .--...+|-. ||+.++ .|.+.|+|.+-..... T Consensus 54 y~~rvAte~DIPaLR~~As~aFAlSRFRaPWY~p~dSgRFYA~WvEnAV~GTFDHq-CL~~~da~g~p~G~VtLR~L~d~ 132 (201) T TIGR02382 54 YSLRVATEADIPALRDVASAAFALSRFRAPWYDPEDSGRFYAQWVENAVLGTFDHQ-CLLLRDAAGDPRGYVTLRELDDS 132 (201) T ss_pred CHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCH-HHHHHCCCCCCEEEEEEEECCCC T ss_conf 02000033030689999999986410588888824232138888886505676401-35222447794046874004787 Q ss_pred CCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE---EEC---HHHHHHCCCEECC Q ss_conf 8223899767885111389958999999999996138847998---408---6578742948746 Q gi|254781162|r 97 KITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL---VGD---IAYYSKLGFQAVP 155 (192) Q Consensus 97 ~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l---~g~---~~yY~rfGF~~~~ 155 (192) ... +|=|+|-|--+-||||.+||+.+.++|+.+|...+-+ +|+ ...|=|-|=..-+ T Consensus 133 dAR---IGLLav~PG~~~rGiG~~LM~~A~~Wc~~~Gl~rLRVATQ~gN~AALrlYirsGA~iES 194 (201) T TIGR02382 133 DAR---IGLLAVFPGVTIRGIGARLMALAKAWCRRRGLIRLRVATQMGNVAALRLYIRSGASIES 194 (201) T ss_pred CCC---EECCCCCCCCCEECHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHCCCCEEC T ss_conf 653---30100378741204568999999999987598244033324679999999863897000 No 53 >COG1243 ELP3 Histone acetyltransferase [Transcription / Chromatin structure and dynamics] Probab=97.40 E-value=0.0002 Score=45.76 Aligned_cols=85 Identities=27% Similarity=0.443 Sum_probs=58.9 Q ss_pred CCEEEEEEECCEEEEEEEEEEEEE-------CCCC-----EEEEEE---EEE-CCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 607899998898999999877611-------8822-----389976---788-511138995899999999999613884 Q gi|254781162|r 72 LSLSFLCAEGKRIVGSVRMTPISI-------EKIT-----GHLLGP---IVV-HPLYQNKGIGRKLISMSVDAAEKKGSQ 135 (192) Q Consensus 72 ~~lslVA~~~~~IVG~i~~~pv~~-------~~~~-----~~~Lgp---laV-~P~~rg~GiG~~Li~~~l~~a~~~g~~ 135 (192) ..+|+.-.+++-+||.+++-.... .+.. ....|+ +.= .-.||.+|+|+.||+.|...|++.+.. T Consensus 406 ~Fls~ed~~~d~lig~lrlR~p~e~~~r~e~~~~~aivrelhvyg~~vpig~~~~~~QH~G~G~~L~~~AE~ia~ee~~~ 485 (515) T COG1243 406 IFLSYEDPKNDILIGFLRLREPSEGAHREEIDDKTAIVRELHVYGSEVPIGKREDEWQHRGYGRELLEEAERIAREEGAK 485 (515) T ss_pred EEEECCCCHHHHHHHEEEECCCCCCHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHHHCCC T ss_conf 87652462231133123532445300101026631355432020210256667430100547899999999998761356 Q ss_pred EEEEE---ECHHHHHHCCCEECCC Q ss_conf 79984---0865787429487463 Q gi|254781162|r 136 VIVLV---GDIAYYSKLGFQAVPW 156 (192) Q Consensus 136 ~v~l~---g~~~yY~rfGF~~~~~ 156 (192) .+.++ |.-+||.|+||+.... T Consensus 486 ki~viSgiG~ReYy~k~GY~~~gp 509 (515) T COG1243 486 KILVISGIGVREYYRKLGYELDGP 509 (515) T ss_pred CEEEEECCCHHHHHHHHCCCCCCC T ss_conf 179983332899999738633477 No 54 >COG3053 CitC Citrate lyase synthetase [Energy production and conversion] Probab=97.37 E-value=0.0026 Score=39.31 Aligned_cols=76 Identities=16% Similarity=0.336 Sum_probs=60.7 Q ss_pred CCEEEEEE-ECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECHH---HHH Q ss_conf 60789999-88989999998776118822389976788511138995899999999999613884799840865---787 Q gi|254781162|r 72 LSLSFLCA-EGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDIA---YYS 147 (192) Q Consensus 72 ~~lslVA~-~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~~---yY~ 147 (192) .+...++. ++++||++- +..+-.|-.+||+|.+||.|+.-+|+.+.++.+-++|..-++++..|. +++ T Consensus 35 ve~~v~~~~~~~~iiacG--------siaGnvikcvAvs~s~qGeGl~lkl~TeLin~ay~~g~~hLFiyTKp~~~~lFk 106 (352) T COG3053 35 VEYFVAIYRDNEEIIACG--------SIAGNVIKCVAVSESLQGEGLALKLVTELINLAYERGRTHLFIYTKPEYAALFK 106 (352) T ss_pred CEEEEEEECCCCCEEEEC--------CCCCCEEEEEEECHHCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHH T ss_conf 208999981788689953--------401356689996420156308999999999999974982699994666799887 Q ss_pred HCCCEECC Q ss_conf 42948746 Q gi|254781162|r 148 KLGFQAVP 155 (192) Q Consensus 148 rfGF~~~~ 155 (192) ..||..+. T Consensus 107 ~~GF~~i~ 114 (352) T COG3053 107 QCGFSEIA 114 (352) T ss_pred HCCCEEEE T ss_conf 38966864 No 55 >pfam04958 AstA Arginine N-succinyltransferase beta subunit. Arginine N-succinyltransferase EC:2.3.1.109 catalyses the transfer of succinyl-CoA to arginine to produce succinylarginine. This is the first step in arginine catabolism by the arginine succinyltransferase pathway. Probab=96.96 E-value=0.0061 Score=37.10 Aligned_cols=118 Identities=18% Similarity=0.213 Sum_probs=70.9 Q ss_pred EEEEECCHHHHHHHHHHHC--CCCCC---------------CCEEEEEECC-CCCCCCEEEEEEE--CCEEEEEEEEEEE Q ss_conf 3442079899999999853--89872---------------1001311038-8887607899998--8989999998776 Q gi|254781162|r 34 FLERLEHDSSICAMHADSF--GPGRF---------------VRAAVLLREQ-GMHDLSLSFLCAE--GKRIVGSVRMTPI 93 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~f--gpgr~---------------~~~~~~lR~~-~~~~~~lslVA~~--~~~IVG~i~~~pv 93 (192) |.+|+-+.+|+.+|..++- |+|-. ++.++.-... ...+..+-||.|+ .|+|||+...... T Consensus 1 lviRpv~~~Dl~~L~~LA~~sg~G~TsLP~d~~~L~~rI~~S~~Sf~~~~~~~~~~~~YlFVLEd~~tg~vvGts~I~a~ 80 (338) T pfam04958 1 LVIRPARASDLPALYRLAKESGHGFTSLPADRELLRAKIARSEASFAGEEVDQPGDEGYLFVLEDTETGEVVGTSGIEAA 80 (338) T ss_pred CEEEECCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCCCEEEEEEEEEEC T ss_conf 95721751039999999998099865599999999999999999864677799775217999971578848888766643 Q ss_pred ----------EEC----------------------C-CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHH---HHCCCCEE Q ss_conf ----------118----------------------8-22389976788511138995899999999999---61388479 Q gi|254781162|r 94 ----------SIE----------------------K-ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAA---EKKGSQVI 137 (192) Q Consensus 94 ----------~~~----------------------~-~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a---~~~g~~~v 137 (192) ++. . ...-.|+.+.++|+||+.|.|+.|-+.-.-.+ +++=...+ T Consensus 81 vG~~~Pfy~yr~~~~~h~S~~L~v~~~~~~L~l~~d~tg~sEl~tLfl~p~~R~~~~G~lLS~~RfLfma~~~~rF~~~v 160 (338) T pfam04958 81 VGLDEPFYSYRLSTLVHASRELGVHNRVEVLTLCNDLTGCSELCTLFLDPEYRKGGNGRLLSRSRFLFIAAHRERFAERV 160 (338) T ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCEEEEEEECCCCCCEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 46787867899322332161248520101588750677870368887488881776315789999999876266643556 Q ss_pred EE--------EECHHHHHHCCC Q ss_conf 98--------408657874294 Q gi|254781162|r 138 VL--------VGDIAYYSKLGF 151 (192) Q Consensus 138 ~l--------~g~~~yY~rfGF 151 (192) +. -|..+||..+|= T Consensus 161 iAEmRG~~De~G~SPFW~~lg~ 182 (338) T pfam04958 161 IAEMRGVSDEDGRSPFWDALGR 182 (338) T ss_pred HHHHCCCCCCCCCCCHHHHHHH T ss_conf 6451586188999823787632 No 56 >TIGR03245 arg_AOST_alph arginine/ornithine succinyltransferase, alpha subunit. In some bacteria, including Pseudomonas aeruginosa, the astB gene (arginine N-succinyltransferase) is replaced by tandem paralogs that form a heterodimer. This heterodimer from P. aeruginosa is characterized as arginine and ornithine N-2 succinyltransferase (AOST). Members of this protein family represent the less widespread paralog, designated AruI, or arginine/ornithine succinyltransferase, alpha subunit. Probab=96.94 E-value=0.011 Score=35.70 Aligned_cols=117 Identities=12% Similarity=0.149 Sum_probs=71.1 Q ss_pred EEEECCHHHHHHHHHHHC--CCCCCC---------------CEEEEEECCCCCCCCEEEEEEE--CCEEEEEEEEEEE-- Q ss_conf 442079899999999853--898721---------------0013110388887607899998--8989999998776-- Q gi|254781162|r 35 LERLEHDSSICAMHADSF--GPGRFV---------------RAAVLLREQGMHDLSLSFLCAE--GKRIVGSVRMTPI-- 93 (192) Q Consensus 35 ~~R~e~~~di~ai~~~~f--gpgr~~---------------~~~~~lR~~~~~~~~lslVA~~--~~~IVG~i~~~pv-- 93 (192) .+|+-+.+|..+|..++- |+|-.+ +.++.-....+....+-||.|+ .|+|||+...... T Consensus 1 vvRpv~~~Dl~~l~~LA~~sg~G~TsLP~d~~~L~~rI~~S~~sf~~~~~~~~~e~YlFVLEd~~tg~vvG~s~I~a~vG 80 (336) T TIGR03245 1 IVRPSRFADLPAIERLANESAIGVTSLPADRAKLGEKIAQSERSFAAEVSFVGEERYLFVLEDTETGKLLGTSSIVASAG 80 (336) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEECCCCEEEEEEEEEEEEC T ss_conf 94447411199999999973999655999999999999999998635567888740899998568891899874786104 Q ss_pred --------EEC----------------------C-CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHH---HHCCCCEEEE Q ss_conf --------118----------------------8-22389976788511138995899999999999---6138847998 Q gi|254781162|r 94 --------SIE----------------------K-ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAA---EKKGSQVIVL 139 (192) Q Consensus 94 --------~~~----------------------~-~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a---~~~g~~~v~l 139 (192) ++. . ...-.|+.+.++|+||+.|.|+.|-+.-.-.+ +++=...++. T Consensus 81 ~~~Pfy~yr~~~~vh~S~~L~~~~~~~~L~l~~d~tG~sEl~tLfL~p~~R~~~~G~LLS~~RfLFmA~~~~rF~~~viA 160 (336) T TIGR03245 81 YGEPFYSYRNDTLIHASRELKVNNKIHVLYMCHELTGSSLLCSFYVDPRLRKTEAAELLSRARLLFMAAHRERFQSRIIV 160 (336) T ss_pred CCCCCEEEEECCEEEECCCCCCCCCCEEEEEECCCCCCEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHHHCHHHEE T ss_conf 89872899926467525035974230059986168887057999978787478605689999999998717661203103 Q ss_pred --------EECHHHHHHCCC Q ss_conf --------408657874294 Q gi|254781162|r 140 --------VGDIAYYSKLGF 151 (192) Q Consensus 140 --------~g~~~yY~rfGF 151 (192) -|..+||..+|= T Consensus 161 EmRG~~d~~G~SPFW~~lg~ 180 (336) T TIGR03245 161 EIQGVQDDNGDSPFWDAIGR 180 (336) T ss_pred HHCCCCCCCCCCCCHHHHHC T ss_conf 00585489999830787635 No 57 >PRK13688 hypothetical protein; Provisional Probab=96.92 E-value=0.00076 Score=42.39 Aligned_cols=77 Identities=23% Similarity=0.233 Sum_probs=54.2 Q ss_pred EEEEECCEEEEEEEEEEEEE-------CCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCC--EEEEEECHHHH Q ss_conf 99998898999999877611-------8822389976788511138995899999999999613884--79984086578 Q gi|254781162|r 76 FLCAEGKRIVGSVRMTPISI-------EKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQ--VIVLVGDIAYY 146 (192) Q Consensus 76 lVA~~~~~IVG~i~~~pv~~-------~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~--~v~l~g~~~yY 146 (192) +-..-|+++|+-..+..... ...+-+.|..|-|+|.||++|+|++|+. .|+..+.+ .+.-....+|+ T Consensus 48 YGIy~G~kLvARMsLy~~~~~~~~yF~~~~dYleLWKLEVLp~yQ~~GyG~~LVd----fAKsf~lPIKt~aR~~S~~Fw 123 (157) T PRK13688 48 YGIYYGDSLVARMSLYKKGDVEEPYFPIFGDYLELWKLEVLPKYQNRGYGEMLVD----FAKSFNMPIKTIARNDSKDFW 123 (157) T ss_pred EEEEECCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCHHHHHHH----HHHHCCCCEEECCCCCHHHHH T ss_conf 2588764088888888607865456798415767644210622023665799999----998769970410121407799 Q ss_pred HHCCCEECCC Q ss_conf 7429487463 Q gi|254781162|r 147 SKLGFQAVPW 156 (192) Q Consensus 147 ~rfGF~~~~~ 156 (192) .+.||+.+.. T Consensus 124 ~k~gF~~v~~ 133 (157) T PRK13688 124 DKLGFTPVEA 133 (157) T ss_pred HHCCCEECCC T ss_conf 9749954223 No 58 >COG0454 WecD Histone acetyltransferase HPA2 and related acetyltransferases [Transcription / General function prediction only] Probab=96.91 E-value=0.0006 Score=43.00 Aligned_cols=44 Identities=34% Similarity=0.570 Sum_probs=35.9 Q ss_pred EEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHCCC Q ss_conf 7885111389958999999999996138847998408657874294 Q gi|254781162|r 106 IVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDIAYYSKLGF 151 (192) Q Consensus 106 laV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~~yY~rfGF 151 (192) ++|+|++||+|+|++|++++++.++..|.. .......+|.++|| T Consensus 87 l~v~~~~rg~Gig~~Ll~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 130 (156) T COG0454 87 LYVLPEYRGKGIGSALLEAALEWARKRGIS--LNRLALEVYEKNGF 130 (156) T ss_pred EEECHHHHCCCCHHHHHHHHHHHHHHCCCC--HHHHHHHHHHHHCC T ss_conf 788698855892899999999999980950--67888989987510 No 59 >PRK00756 acyltransferase NodA; Provisional Probab=96.87 E-value=0.0066 Score=36.92 Aligned_cols=112 Identities=15% Similarity=0.189 Sum_probs=78.8 Q ss_pred ECCHHHHHHHHHHHCCC-CCC-CCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEE--EEECCCCE--EEEEEEEECCC Q ss_conf 07989999999985389-872-10013110388887607899998898999999877--61188223--89976788511 Q gi|254781162|r 38 LEHDSSICAMHADSFGP-GRF-VRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTP--ISIEKITG--HLLGPIVVHPL 111 (192) Q Consensus 38 ~e~~~di~ai~~~~fgp-gr~-~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~p--v~~~~~~~--~~LgplaV~P~ 111 (192) .++..+....+..+||| |-| ++.-+--|.=...-+++-.++++..-|..|+-+.+ ++++..+. ..||-.+|.|+ T Consensus 17 l~dH~eLaeFfrktYgptGafnakpFeG~RSWAGARPE~R~IgyD~~GVaAH~G~LRRFIkVg~vdlLVaElGLygVRpD 96 (196) T PRK00756 17 LSDHAELAEFFRKTYGPTGAFNAKPFEGGRSWAGARPELRAIAYDSHGVAAHMGLLRRFIKVGEVDLLVAELGLYGVRPD 96 (196) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHEEECCEEEEEEEEEEEEECCC T ss_conf 01279999999983288764246456777565667873378865673088887777530255664057773024787577 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCEEEEEECH---HHHHHCC Q ss_conf 13899589999999999961388479984086---5787429 Q gi|254781162|r 112 YQNKGIGRKLISMSVDAAEKKGSQVIVLVGDI---AYYSKLG 150 (192) Q Consensus 112 ~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~---~yY~rfG 150 (192) ..|.||+.. |+...-.+++.|..+-+-.-.+ +.++||+ T Consensus 97 lEGlGi~~s-~~~m~PvLq~LgVPF~FGtVR~al~~Hv~R~~ 137 (196) T PRK00756 97 LEGLGIAHS-IRAMYPVLQELGVPFAFGTVRHALRNHVERLC 137 (196) T ss_pred CCCCCHHHH-HHHHHHHHHHCCCCEECCCCHHHHHHHHHHHH T ss_conf 443551355-88887888863997342342598999999997 No 60 >PRK10456 arginine succinyltransferase; Provisional Probab=96.65 E-value=0.012 Score=35.43 Aligned_cols=117 Identities=15% Similarity=0.138 Sum_probs=70.2 Q ss_pred EEEEECCHHHHHHHHHHHCC--CCCCC---------------CEEEEEECCCCCCCCEEEEEEE--CCEEEEEEEEEEE- Q ss_conf 34420798999999998538--98721---------------0013110388887607899998--8989999998776- Q gi|254781162|r 34 FLERLEHDSSICAMHADSFG--PGRFV---------------RAAVLLREQGMHDLSLSFLCAE--GKRIVGSVRMTPI- 93 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~fg--pgr~~---------------~~~~~lR~~~~~~~~lslVA~~--~~~IVG~i~~~pv- 93 (192) |.+|+-+.+|..+|..++-. +|-.+ +.++. .+-...+..+-||.|+ .|+|||+...... T Consensus 2 livRpv~~~Dl~~l~~LA~~sg~G~TsLP~d~~~L~~rI~~S~~sf~-~~~~~~~~~YlFVLED~~tg~vvGts~I~a~v 80 (344) T PRK10456 2 MVIRPVERSDLAALMQLAGKTGGGLTSLPANEATLAARIERALKTWQ-GELPKSEQGYVFVLEDSETGTVAGICAIEVAV 80 (344) T ss_pred EEEEECCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHC-CCCCCCCCEEEEEEEECCCCEEEEEEEEEECC T ss_conf 48973760049999999997099976599999999999999999864-78997643179999855788088887578503 Q ss_pred ---------EEC----------------------C-CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHH---HHCCCCEEE Q ss_conf ---------118----------------------8-22389976788511138995899999999999---613884799 Q gi|254781162|r 94 ---------SIE----------------------K-ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAA---EKKGSQVIV 138 (192) Q Consensus 94 ---------~~~----------------------~-~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a---~~~g~~~v~ 138 (192) ++. . ...-.|+.+.++|+||+.|.|+.|-+.-.-.+ +++-...++ T Consensus 81 G~~~Pfy~yr~~~~~h~S~~L~~~~~~~~L~l~~d~tG~sEl~tLfL~p~~R~~~~G~lLS~~RfLFia~~~~rF~~~vi 160 (344) T PRK10456 81 GLNDPWYNYRVGTLVHASKELNVYNALPTLFLSNDHTGSSELCTLFLDPDWRKEGNGYLLSKSRFMFMAAFRDKFNDKVV 160 (344) T ss_pred CCCCCCEEEEECCEEECCCCCCCCCCCEEEEEECCCCCCEEEEEEEECHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 57878478882224530523485201015887616778703688875788816764047889899999871766534455 Q ss_pred E--------EECHHHHHHCCC Q ss_conf 8--------408657874294 Q gi|254781162|r 139 L--------VGDIAYYSKLGF 151 (192) Q Consensus 139 l--------~g~~~yY~rfGF 151 (192) . -|..+||..+|= T Consensus 161 AEmRG~~D~~G~SPFW~~lg~ 181 (344) T PRK10456 161 AEMRGVIDEHGYSPFWQSLGK 181 (344) T ss_pred HHHCCCCCCCCCCCHHHHHHC T ss_conf 441586189999822787632 No 61 >PRK13834 putative autoinducer synthesis protein; Provisional Probab=96.52 E-value=0.032 Score=32.88 Aligned_cols=109 Identities=14% Similarity=0.143 Sum_probs=70.3 Q ss_pred CCEEEEEE-ECCEEEEEEEEEEEEEC------------------CCCEEEEEEEEECCCCCC---CC----HHHHHHHHH Q ss_conf 60789999-88989999998776118------------------822389976788511138---99----589999999 Q gi|254781162|r 72 LSLSFLCA-EGKRIVGSVRMTPISIE------------------KITGHLLGPIVVHPLYQN---KG----IGRKLISMS 125 (192) Q Consensus 72 ~~lslVA~-~~~~IVG~i~~~pv~~~------------------~~~~~~LgplaV~P~~rg---~G----iG~~Li~~~ 125 (192) ...-+++. ++|+|+|++++.|..-. +.....+...+|++.... .+ +...|+..+ T Consensus 52 ~~~Yll~~~~~g~v~G~~RLlpTt~p~ML~~~Fp~ll~~~~~p~~~~i~E~SRF~Vd~~~~~~~~~~~~~~~t~~L~~a~ 131 (207) T PRK13834 52 KPTYILAISDSGRVAGCARLLPALGPTMLADVFPQLLPDGRLNAHPAMIESSRFCVDTTLAEGRGGGSLHEATLTMFAGI 131 (207) T ss_pred CCEEEEEECCCCCEEEEEEECCCCCCCCCCCHHHHHCCCCCCCCCCCEEEECCEEECCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 98699999089848989963378997633470577638999999999899783489776665556887277899999999 Q ss_pred HHHHHHCCCCEEEEEECHH---HHHHCCCEECCCCCEECCCCCCHHHEEEEEC--CCCCCCCC Q ss_conf 9999613884799840865---7874294874636536689999566698435--77810178 Q gi|254781162|r 126 VDAAEKKGSQVIVLVGDIA---YYSKLGFQAVPWKSLILPAPVDPNRVLFLPL--VQNVAQNI 183 (192) Q Consensus 126 l~~a~~~g~~~v~l~g~~~---yY~rfGF~~~~~~~l~~p~pv~~~~~l~l~L--~~~~l~~~ 183 (192) ++.+...|+..++.+.++. .+.|.||...+ +-.|-++.....++..+ .+..++.+ T Consensus 132 ve~a~~~Gi~~ivtV~~~~~eRiL~R~G~~~~r---LG~p~~ig~~~~vA~~l~vd~~tl~~l 191 (207) T PRK13834 132 IEWSMANGYTEIVTATDLRFERILARAGWPMQR---LGEPKAIGNTMAVAGTLPADADSFERV 191 (207) T ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHCCCCEEE---CCCCEEECCEEEEEEEEECCHHHHHHH T ss_conf 999998799889999438999999985995596---768877788799999987699999872 No 62 >TIGR03244 arg_catab_AstA arginine N-succinyltransferase. In many bacteria, the arginine succinyltransferase (ast) pathway operon consists of five genes, including this protein, arginine N-succinyltransferase (EC 2.3.1.109). In a few species, such as Pseudomonas aeruginosa, the member of this family is encoded adjacent to a paralog, and the two polypeptides form a heterodimeric enzyme, active on both arginine and ornithine. In such species, this polypeptide may be treated as the beta subunit of an enzyme that may be named either arginine N-succinyltransferase (AST) or arginine and orthithine N-succinyltransferase (AOST). Probab=96.48 E-value=0.021 Score=33.99 Aligned_cols=116 Identities=17% Similarity=0.186 Sum_probs=70.1 Q ss_pred EEEECCHHHHHHHHHHHC--CCCCC---------------CCEEEEEECCCCCCCCEEEEEEE--CCEEEEEEEEEEE-- Q ss_conf 442079899999999853--89872---------------10013110388887607899998--8989999998776-- Q gi|254781162|r 35 LERLEHDSSICAMHADSF--GPGRF---------------VRAAVLLREQGMHDLSLSFLCAE--GKRIVGSVRMTPI-- 93 (192) Q Consensus 35 ~~R~e~~~di~ai~~~~f--gpgr~---------------~~~~~~lR~~~~~~~~lslVA~~--~~~IVG~i~~~pv-- 93 (192) .+|+-+.+|..+|..++- |+|-. ++.++.- +..+....+-||.|+ .|+|||+...... T Consensus 1 vvRpv~~~Dl~~l~~LA~~sg~G~TsLP~d~~~L~~rI~~S~~sf~~-~~~~~~~~YlFVLEd~~~g~vvGts~I~a~vG 79 (336) T TIGR03244 1 IVRPVETSDLDALYQLAQSTGIGLTSLPANEDLLSARIERAEKTFSG-ELTRAEQGYLFVLEDTETGTVAGVSAIEAAVG 79 (336) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCC-CCCCCCCEEEEEEEECCCCEEEEEEEEEECCC T ss_conf 94437422099999999973998655999999999999999998658-78877631899998778993999874675225 Q ss_pred --------EEC----------------------C-CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHH---HHCCCCEEEE Q ss_conf --------118----------------------8-22389976788511138995899999999999---6138847998 Q gi|254781162|r 94 --------SIE----------------------K-ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAA---EKKGSQVIVL 139 (192) Q Consensus 94 --------~~~----------------------~-~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a---~~~g~~~v~l 139 (192) ++. . ...-.|+.+.++|+||+.|.|+.|-+.-.-.+ +++=...++. T Consensus 80 ~~~Pfy~yr~~~~~~~S~~L~~~~~~~~L~l~~d~tg~sEl~tLfl~p~~R~~~~G~lLS~~RfLFia~~~erF~~~viA 159 (336) T TIGR03244 80 LEEPFYNYRVGTVVHASKELGIYKALETLFLSNDLTGYSELCTLFLDPDYRKGGNGRLLSKSRFLFIAQFRERFSKKIIA 159 (336) T ss_pred CCCCCEEEEECCEEEECCCCCCCCCCCEEEEECCCCCCEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 89887889967556624457975311159987377687268999978898278723589999999998717651214220 Q ss_pred --------EECHHHHHHCCC Q ss_conf --------408657874294 Q gi|254781162|r 140 --------VGDIAYYSKLGF 151 (192) Q Consensus 140 --------~g~~~yY~rfGF 151 (192) -|..+||..+|= T Consensus 160 EmRG~~d~~G~SPFW~~lg~ 179 (336) T TIGR03244 160 EMRGVSDEQGRSPFWNALGR 179 (336) T ss_pred HHCCCCCCCCCCCCHHHHHC T ss_conf 20585489999831777624 No 63 >pfam02474 NodA Nodulation protein A (NodA). Rhizobia nodulation (nod) genes control the biosynthesis of Nod factors required for infection and nodulation of their legume hosts. Nodulation protein A (NodA) is a N-acetyltransferase involved in production of Nod factors that stimulate mitosis in various plant protoplasts. Probab=96.43 E-value=0.011 Score=35.54 Aligned_cols=113 Identities=14% Similarity=0.175 Sum_probs=78.9 Q ss_pred ECCHHHHHHHHHHHCCC-CCC-CCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEE--EEECCCCE--EEEEEEEECCC Q ss_conf 07989999999985389-872-10013110388887607899998898999999877--61188223--89976788511 Q gi|254781162|r 38 LEHDSSICAMHADSFGP-GRF-VRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTP--ISIEKITG--HLLGPIVVHPL 111 (192) Q Consensus 38 ~e~~~di~ai~~~~fgp-gr~-~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~p--v~~~~~~~--~~LgplaV~P~ 111 (192) ..+..++...+..+||| |.| ++.-+=-|.=...-+++-.++++..-|..|+-+.+ +++++.+. ..||-.+|.|+ T Consensus 17 l~dH~eLaeFfrktYgptgafnakpFeG~RSWAGARPElR~IgyD~~GVaAH~G~LRRFIkVg~vDlLVaELGLygVRpD 96 (196) T pfam02474 17 LADHIELSEFFRKTYGPTGAFNAKPFEGGRSWAGARPELRAIAYDSRGVAAHMGLLRRFIKVGEVDLLVAELGLYGVRPD 96 (196) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHEECCEEEEEEEEEEEEECCC T ss_conf 11169999999984288764346656777564557851378876573088988888775124761013443001476565 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCEEEEEECH---HHHHHCCC Q ss_conf 13899589999999999961388479984086---57874294 Q gi|254781162|r 112 YQNKGIGRKLISMSVDAAEKKGSQVIVLVGDI---AYYSKLGF 151 (192) Q Consensus 112 ~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~---~yY~rfGF 151 (192) .+|.||+.. |+...-.+++.|.++-+-.-.+ +.++|||- T Consensus 97 lEGlGi~~s-~~~m~PvLq~LgVPF~FGtVR~al~~Hv~R~~r 138 (196) T pfam02474 97 LEGLGISHS-MRVMYPVLQELGVPFAFGTVRHALRKHVERLCR 138 (196) T ss_pred CCCCCHHHH-HHHHHHHHHHCCCCEECCCCHHHHHHHHHHHHC T ss_conf 133534677-989988888629974523525989999999821 No 64 >TIGR03243 arg_catab_AOST arginine and ornithine succinyltransferase subunits. In many bacteria, the sole member of this protein family is arginine N-succinyltransferase (EC 2.3.1.109), the AstA protein of the arginine succinyltransferase (ast) pathway. However, in Pseudomonas aeruginosa and several other species, a tandem gene pair encodes alpha and beta subunits of a heterodimer that is designated arginine and ornithine succinyltransferase (AOST). Probab=96.25 E-value=0.029 Score=33.17 Aligned_cols=150 Identities=19% Similarity=0.271 Sum_probs=82.6 Q ss_pred EEEECCHHHHHHHHHHHC--CCCCC---------------CCEEEEEECCCCCCCCEEEEEEE--CCEEEEEEEEEEE-- Q ss_conf 442079899999999853--89872---------------10013110388887607899998--8989999998776-- Q gi|254781162|r 35 LERLEHDSSICAMHADSF--GPGRF---------------VRAAVLLREQGMHDLSLSFLCAE--GKRIVGSVRMTPI-- 93 (192) Q Consensus 35 ~~R~e~~~di~ai~~~~f--gpgr~---------------~~~~~~lR~~~~~~~~lslVA~~--~~~IVG~i~~~pv-- 93 (192) .+|+-+.+|..+|..++- |+|-. ++.++.- +..+.+..+-||.|+ .|+|||+...... T Consensus 1 viRpv~~~Dl~~l~~LA~~sg~G~TsLP~d~~~L~~rI~~S~~sf~~-~~~~~~~~YlFVLED~~tg~vvG~s~I~a~vG 79 (335) T TIGR03243 1 IVRPVRTSDLDALMQLARESGIGLTSLPADRAALGSRIARSEKSFAG-ESTRGEEGYLFVLEDTETGTVAGVSAIEAAVG 79 (335) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCC-CCCCCCCEEEEEEEECCCCCEEEEEEEEECCC T ss_conf 94437323099999999973998655999999999999999998648-77888734899998778991899874676145 Q ss_pred --------EEC----------------------C-CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHH---HHCCCCEEEE Q ss_conf --------118----------------------8-22389976788511138995899999999999---6138847998 Q gi|254781162|r 94 --------SIE----------------------K-ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAA---EKKGSQVIVL 139 (192) Q Consensus 94 --------~~~----------------------~-~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a---~~~g~~~v~l 139 (192) ++. . ...-.|+.+.++|+||+.|.|+.|-+.-.-.+ +++=...++. T Consensus 80 ~~~Pfy~yr~~~~~h~S~~L~~~~~~~~L~l~~d~tg~sEl~tLfl~p~~R~~~~G~lLS~~RflFma~~~~rF~~~viA 159 (335) T TIGR03243 80 LDEPFYNYRVGTLVHASRELGVYNKIPTLTLSNDLTGSSELCTLFLDPDYRKGGNGRLLSRSRFLFIAAFRERFGDKIIA 159 (335) T ss_pred CCCCCEEEEECCEEEECCCCCCCCCCCEEEEECCCCCCEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88874799917556506346872121169976278787057999987787378724589999999998737664367885 Q ss_pred --------EECHHHHHHCCCE-------ECCC------CCEECCCCCCHHHEEEEECCCCCCCCCEEEE Q ss_conf --------4086578742948-------7463------6536689999566698435778101781677 Q gi|254781162|r 140 --------VGDIAYYSKLGFQ-------AVPW------KSLILPAPVDPNRVLFLPLVQNVAQNIKGIV 187 (192) Q Consensus 140 --------~g~~~yY~rfGF~-------~~~~------~~l~~p~pv~~~~~l~l~L~~~~l~~~~G~v 187 (192) -|..+||..+|=. .+-. ..+... .=|.+-...+|-+...+.+-|.| T Consensus 160 EmRG~~d~~G~SPFW~~lg~~Ff~mdF~~AD~ls~~~~k~FIae--LmP~~PIYv~lLp~~Aq~vIG~v 226 (335) T TIGR03243 160 EMRGVSDEQGRSPFWEALGRHFFSMDFAQADYLSGIGSKTFIAE--LMPKYPIYVPLLPEEAQDVIGQV 226 (335) T ss_pred HCCCCCCCCCCCCCHHHHHCCCCCCCHHHHHHHHCCCCCHHHHH--HCCCCCEEEECCCHHHHHHHCCC T ss_conf 41787389999830787634235998778778872687244887--68899754205998999972984 No 65 >COG3818 Predicted acetyltransferase, GNAT superfamily [General function prediction only] Probab=95.97 E-value=0.0053 Score=37.49 Aligned_cols=84 Identities=19% Similarity=0.218 Sum_probs=61.7 Q ss_pred EEEEECCEEEEEEEEEEEEE--CC----------CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE--EE Q ss_conf 99998898999999877611--88----------223899767885111389958999999999996138847998--40 Q gi|254781162|r 76 FLCAEGKRIVGSVRMTPISI--EK----------ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL--VG 141 (192) Q Consensus 76 lVA~~~~~IVG~i~~~pv~~--~~----------~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l--~g 141 (192) |||+.+|.+-|.++-.-... ++ .+...+-.+.|....||+|+|++|-+...+.|+..|+..+.- .. T Consensus 48 F~ArR~G~l~afl~tFd~~a~ydSpNFlWFrErYe~F~YvDRvVVA~~aRGrG~aRalY~Dlf~~Ae~agy~~~tCEVn~ 127 (167) T COG3818 48 FVARRDGNLAAFLVTFDSSARYDSPNFLWFRERYENFFYVDRVVVASRARGRGVARALYADLFSYAELAGYPYLTCEVNL 127 (167) T ss_pred HHHHHCCCHHHHEEECCCCCCCCCCCEEEHHHHCCCEEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 88863244444035336655679986023565377427998999974134664389999999999986588569998607 Q ss_pred CH------HHHHHCCCEECCCCCE Q ss_conf 86------5787429487463653 Q gi|254781162|r 142 DI------AYYSKLGFQAVPWKSL 159 (192) Q Consensus 142 ~~------~yY~rfGF~~~~~~~l 159 (192) || .|-..+||.++.+..+ T Consensus 128 DppnpasdaFHaalGF~eVG~a~i 151 (167) T COG3818 128 DPPNPASDAFHAALGFHEVGQATI 151 (167) T ss_pred CCCCHHHHHHHHHCCCEECCCEEE T ss_conf 999827788766508647562478 No 66 >KOG2535 consensus Probab=95.76 E-value=0.015 Score=34.88 Aligned_cols=77 Identities=29% Similarity=0.473 Sum_probs=50.2 Q ss_pred ECCEEEEEEEEEEEEECC--------C----CEEEEEE-EEEC----CCCCCCCHHHHHHHHHHHHHH-HCCCCEEEE-- Q ss_conf 889899999987761188--------2----2389976-7885----111389958999999999996-138847998-- Q gi|254781162|r 80 EGKRIVGSVRMTPISIEK--------I----TGHLLGP-IVVH----PLYQNKGIGRKLISMSVDAAE-KKGSQVIVL-- 139 (192) Q Consensus 80 ~~~~IVG~i~~~pv~~~~--------~----~~~~Lgp-laV~----P~~rg~GiG~~Li~~~l~~a~-~~g~~~v~l-- 139 (192) ..+-+||-.++.++.... . +.+..|+ +.|+ ..+|.||.|+.||+++...|+ +.|...+.+ T Consensus 450 kqDILiGLLRLRkcs~~~~~~el~g~~SivRELHVYGs~vpv~~rDp~KfQHQG~GtLLmeEAERIAr~EHgS~KiavIS 529 (554) T KOG2535 450 KQDILIGLLRLRKCSKKTTRPELFGSQSIVRELHVYGSVVPVHSRDPTKFQHQGFGTLLMEEAERIAREEHGSGKIAVIS 529 (554) T ss_pred CHHHHHHHHHHHHCCCCCCCHHHCCCHHHHEEEEECCEEEECCCCCCHHHHHCCHHHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 32378888988642654346001475001101112102532246883343213434789999999988750786368994 Q ss_pred -EECHHHHHHCCCEECCC Q ss_conf -40865787429487463 Q gi|254781162|r 140 -VGDIAYYSKLGFQAVPW 156 (192) Q Consensus 140 -~g~~~yY~rfGF~~~~~ 156 (192) +|.-+||.|+||+-... T Consensus 530 GVGtR~YY~klGY~LdGP 547 (554) T KOG2535 530 GVGTRNYYRKLGYELDGP 547 (554) T ss_pred CCCHHHHHHHHCCEECCC T ss_conf 353488898718042370 No 67 >PHA01807 hypothetical protein Probab=95.51 E-value=0.086 Score=30.39 Aligned_cols=69 Identities=14% Similarity=0.125 Sum_probs=51.6 Q ss_pred CCCEEEEEEECCEEEEEEEEEEEEECCC--CEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 7607899998898999999877611882--238997678851113899589999999999961388479984 Q gi|254781162|r 71 DLSLSFLCAEGKRIVGSVRMTPISIEKI--TGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLV 140 (192) Q Consensus 71 ~~~lslVA~~~~~IVG~i~~~pv~~~~~--~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~ 140 (192) ...+-+.+..||+++|.....- ..+.. +.+...--+|.|+|+..|+..++++.++..|...+++.+.-. T Consensus 48 ~dr~e~Lv~rdG~laG~Av~~~-dDD~HVG~cl~vQWqyVlPe~rnaGv~~~f~R~l~~lA~~~n~p~va~t 118 (145) T PHA01807 48 NDRTELLVFRDGQLAGGAVIVF-DDDPHVGPCLTVQWQYVLPEYRNAGVAREFIRELHRLAGWGNIPLVAWT 118 (145) T ss_pred CCCEEEEEEECCEEEEEEEEEE-CCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEE T ss_conf 7725899997788632389984-5998743315788888523445664249999999997503687258987 No 68 >COG3375 Uncharacterized conserved protein [Function unknown] Probab=95.35 E-value=0.089 Score=30.29 Aligned_cols=104 Identities=15% Similarity=0.025 Sum_probs=68.1 Q ss_pred EEECCHHHHHHHHHHHCCC-C-CCC----CEEEEEECCCCCCCCEEEEEEECC-EEEEEEEEEEEEECCCCEEEEEEEEE Q ss_conf 4207989999999985389-8-721----001311038888760789999889-89999998776118822389976788 Q gi|254781162|r 36 ERLEHDSSICAMHADSFGP-G-RFV----RAAVLLREQGMHDLSLSFLCAEGK-RIVGSVRMTPISIEKITGHLLGPIVV 108 (192) Q Consensus 36 ~R~e~~~di~ai~~~~fgp-g-r~~----~~~~~lR~~~~~~~~lslVA~~~~-~IVG~i~~~pv~~~~~~~~~LgplaV 108 (192) -|..++.++.+.+++.+.. | +.. +..++ .-...-.|-+-|..++ ++||..--+|..-++.-.+---.++| T Consensus 6 rrl~dp~el~~~~dV~~~aWg~~d~~~~~~d~i~---al~~~GGlvlgAf~~dg~lVGls~G~pg~r~g~~y~ySH~~gV 82 (266) T COG3375 6 RRLTDPAELDEAEDVQASAWGSEDRDGAPADTIR---ALRYHGGLVLGAFSADGRLVGLSYGYPGGRGGSLYLYSHMLGV 82 (266) T ss_pred EECCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHH---HHHHCCCEEEEEECCCCCEEEEEECCCCCCCCCEEEEEEEHHC T ss_conf 7537989988888899987176533544388999---9986188699987389837878852577678755655300214 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 5111389958999999999996138847998408 Q gi|254781162|r 109 HPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD 142 (192) Q Consensus 109 ~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~ 142 (192) .|+++++|+|-+|-..--+.+.++|+..+--.=| T Consensus 83 ~e~~k~sglg~aLK~~Qre~a~~~G~tli~WTfD 116 (266) T COG3375 83 REEVKGSGLGVALKMKQRERALSMGYTLIAWTFD 116 (266) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEECC T ss_conf 5010346655655777788998658036787606 No 69 >pfam08444 Gly_acyl_tr_C Aralkyl acyl-CoA:amino acid N-acyltransferase, C-terminal region. This family features the C-terminal region of several mammalian specific aralkyl acyl-CoA:amino acid N-acyltransferase (glycine N-acyltransferase) proteins EC:2.3.1.13. Probab=95.28 E-value=0.042 Score=32.21 Aligned_cols=54 Identities=24% Similarity=0.275 Sum_probs=45.4 Q ss_pred EEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC-----HHHHHHCCCEECCC Q ss_conf 9767885111389958999999999996138847998408-----65787429487463 Q gi|254781162|r 103 LGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD-----IAYYSKLGFQAVPW 156 (192) Q Consensus 103 LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~-----~~yY~rfGF~~~~~ 156 (192) ++--+..|+||++|+.+.++...+..+.++|++.-.-+-+ ...-.++||...|- T Consensus 22 ~rmgyTlPeyR~kG~~~~~~~~~~~~L~~~g~P~Y~hv~~~N~~~~k~~~~lg~~~~pc 80 (89) T pfam08444 22 LRMAGTLPKYRRQGLMSHVIYHQAQYLEKLGFPVYSHVDKANEISQKMSGNLGHVPMPC 80 (89) T ss_pred EEECCCCHHHHCCCHHHHHHHHHHHHHHHCCCCCEEEEHHCCHHHHHHHHHCCCEECCC T ss_conf 10013377787558699999999999986799946864115788999999779825798 No 70 >COG2401 ABC-type ATPase fused to a predicted acetyltransferase domain [General function prediction only] Probab=95.20 E-value=0.015 Score=34.87 Aligned_cols=55 Identities=18% Similarity=0.364 Sum_probs=43.3 Q ss_pred EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE-------E----ECHHHHHHCCCEECC Q ss_conf 899767885111389958999999999996138847998-------4----086578742948746 Q gi|254781162|r 101 HLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL-------V----GDIAYYSKLGFQAVP 155 (192) Q Consensus 101 ~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l-------~----g~~~yY~rfGF~~~~ 155 (192) .-++.+.|||+||+.|+|..-|..++++..++-.+-..- . ..-+|+++.||+..- T Consensus 242 ariarvvvhpdyr~dglg~~sv~~a~ewI~eRriPEmr~rkHlvetiaqmarynpffe~~gfkylw 307 (593) T COG2401 242 ARIARVVVHPDYRADGLGQLSVIAALEWIIERRIPEMRPRKHLVETIAQMARYNPFFEKVGFKYLW 307 (593) T ss_pred HHEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCCHHHHHCEEEEE T ss_conf 430678856654567610637999999999851802324565999999987507455430303665 No 71 >pfam01853 MOZ_SAS MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases. Probab=94.33 E-value=0.09 Score=30.26 Aligned_cols=60 Identities=32% Similarity=0.428 Sum_probs=41.8 Q ss_pred CCEEEEEEE----CCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 607899998----898999999877611882238997678851113899589999999999961388 Q gi|254781162|r 72 LSLSFLCAE----GKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGS 134 (192) Q Consensus 72 ~~lslVA~~----~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~ 134 (192) +-+-+|..+ |-.+||+ ||+-.. ..+.+-|+-+-|.|.||++|+|+-||.-+-..++..|. T Consensus 51 ~FlFYVl~e~d~~g~h~vGy--FSKEk~-s~~~~NLaCIltlP~~QrkGyG~~LI~fSY~LSr~E~~ 114 (189) T pfam01853 51 PFLFYILTETDETGCHIVGY--FSKEKE-SSENYNLACILTLPPYQRKGYGKLLIDFSYELSRKEGK 114 (189) T ss_pred CEEEEEEEEECCCCCEEEEE--ECHHHC-CCCCCCEEEEEECCHHHHCCHHHHHHHHHHHHHHHCCC T ss_conf 55999999855998588766--311003-55567347899638788525233226555026565388 No 72 >COG3916 LasI N-acyl-L-homoserine lactone synthetase [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism] Probab=93.47 E-value=0.59 Score=25.48 Aligned_cols=119 Identities=22% Similarity=0.189 Sum_probs=75.3 Q ss_pred EECCHHHHHHHHHH---HCCC--C-CCCCEEEEEECCCCCCCCEEEEE-EECCEEEEEEEEEEEEE------------C- Q ss_conf 20798999999998---5389--8-72100131103888876078999-98898999999877611------------8- Q gi|254781162|r 37 RLEHDSSICAMHAD---SFGP--G-RFVRAAVLLREQGMHDLSLSFLC-AEGKRIVGSVRMTPISI------------E- 96 (192) Q Consensus 37 R~e~~~di~ai~~~---~fgp--g-r~~~~~~~lR~~~~~~~~lslVA-~~~~~IVG~i~~~pv~~------------~- 96 (192) +.+.+..+++++.+ +|.- + .-.-...--.++-++..+.-+++ -.+|+|+|++++-|..- + T Consensus 10 ~~~~~~~l~em~rlR~~vF~erL~W~v~~~~g~E~DqyD~~~t~Yll~~~~~g~I~G~~RlLptt~P~mL~~vF~~Ll~~ 89 (209) T COG3916 10 RELFPKALEEMHRLRYQVFKERLGWDVVCIDGFEIDQYDNLDTVYLLALTSDGRIVGCVRLLPTTGPYMLTDVFPALLEG 89 (209) T ss_pred CHHCHHHHHHHHHHHHHHHHHHCCCCEECCCCCCCCCCCCCCCEEEEEECCCCCEEEEEEECCCCCCCHHHHHHHHHHCC T ss_conf 20157999999888999988754771031688440146788852899985899689899863677852555056898469 Q ss_pred C-----CCEEEEEEEEECC--CCCCCC----HHHHHHHHHHHHHHHCCCCEEEEEECH---HHHHHCCCEECC Q ss_conf 8-----2238997678851--113899----589999999999961388479984086---578742948746 Q gi|254781162|r 97 K-----ITGHLLGPIVVHP--LYQNKG----IGRKLISMSVDAAEKKGSQVIVLVGDI---AYYSKLGFQAVP 155 (192) Q Consensus 97 ~-----~~~~~LgplaV~P--~~rg~G----iG~~Li~~~l~~a~~~g~~~v~l~g~~---~yY~rfGF~~~~ 155 (192) + .....+...+|++ .-+..| ++..|+.-.++.+.+.|+..++.+.+. ....|.||.... T Consensus 90 ~~~P~~p~vwEsSRF~vd~~~a~~~~g~~~~a~~el~~g~ie~a~~~G~~~IvtVt~~~meril~r~Gw~~~r 162 (209) T COG3916 90 GPPPSSPGVWESSRFAVDKPSARRAAGGVSPAAYELFAGMIEYALARGITGIVTVTDTGMERILRRAGWPLTR 162 (209) T ss_pred CCCCCCCCEEEEEEEEECCCCCHHHCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHCCCCEEE T ss_conf 9999998827876455156425334289548899999999999997699659999700899999984997087 No 73 >COG5628 Predicted acetyltransferase [General function prediction only] Probab=93.02 E-value=0.44 Score=26.24 Aligned_cols=75 Identities=15% Similarity=0.192 Sum_probs=50.8 Q ss_pred EEEEECCEEEEEEEEEEE-EECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEE--EC--HHHHHHCC Q ss_conf 999988989999998776-11882238997678851113899589999999999961388479984--08--65787429 Q gi|254781162|r 76 FLCAEGKRIVGSVRMTPI-SIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLV--GD--IAYYSKLG 150 (192) Q Consensus 76 lVA~~~~~IVG~i~~~pv-~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~--g~--~~yY~rfG 150 (192) +....||.+||.+.+..- -....--..++...+...||++|+|++-.++.-..+ +|...|..+ .. .++++++- T Consensus 40 ~~~~~~~~~igf~l~L~~~~~~~~iD~~~~efFIi~k~~~~GvGR~aaK~If~~~--~g~w~Va~i~EN~PA~~fwK~~~ 117 (143) T COG5628 40 WLFRIGGLPVGFALVLDLAHSPTPIDRAVAEFFIVRKHRRRGVGRAAAKAIFGSA--WGVWQVATVRENTPARAFWKRVA 117 (143) T ss_pred EEEEECCCCEEEEEEECCCCCCCCCCCCCCEEEEEEHHHCCCHHHHHHHHHHHHH--HCEEEEEEECCCCCHHHHHHHCC T ss_conf 5999889414553443256888743236200687604304665399999999876--14589999527880688998620 Q ss_pred CE Q ss_conf 48 Q gi|254781162|r 151 FQ 152 (192) Q Consensus 151 F~ 152 (192) .. T Consensus 118 ~t 119 (143) T COG5628 118 ET 119 (143) T ss_pred CC T ss_conf 22 No 74 >KOG3698 consensus Probab=93.00 E-value=0.7 Score=25.05 Aligned_cols=149 Identities=15% Similarity=0.073 Sum_probs=93.0 Q ss_pred HHHHHCCCCCEEEEEEECCCHHHHCCEEEEEECCHHHHHHHHHHHCC----CCCC--CC--EEEEEECCC----CCCCCE Q ss_conf 24311034322467642150332012034420798999999998538----9872--10--013110388----887607 Q gi|254781162|r 7 ELRKFLSIAFFEFICWRRSRWQKIGAFFLERLEHDSSICAMHADSFG----PGRF--VR--AAVLLREQG----MHDLSL 74 (192) Q Consensus 7 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~R~e~~~di~ai~~~~fg----pgr~--~~--~~~~lR~~~----~~~~~l 74 (192) |.-|.++.|..-=+++.+....-..+++.||+=..+|-+.+..++-. -|+. -+ +.+..+--. .-.+.| T Consensus 653 e~qRllpvd~~ndlf~~~~p~~Pt~~~y~iRPy~~~De~~v~~~ct~my~d~g~~lpf~n~pn~~~d~liggllsls~~l 732 (891) T KOG3698 653 ETQRLLPVDDDNDLFLSNKPFLPTCMFYDIRPYTIADEEYVSGMCTVMYTDNGELLPFRNAPNFADDNLIGGLLSLSEHL 732 (891) T ss_pred HHHHCCCCCCCCCEEECCCCCCCCCEEEEECCCCCCCHHHHHHHHHHEECCCCEECCCCCCCCCCCCCCHHHEECCCHHH T ss_conf 66440345787630322787786310576216766458999756661342586033578887500135422112257556 Q ss_pred EEEEEE-CCEEEEEEEEE----EEE----E-----------------C-C-------------------CCEEEEEE--- Q ss_conf 899998-89899999987----761----1-----------------8-8-------------------22389976--- Q gi|254781162|r 75 SFLCAE-GKRIVGSVRMT----PIS----I-----------------E-K-------------------ITGHLLGP--- 105 (192) Q Consensus 75 slVA~~-~~~IVG~i~~~----pv~----~-----------------~-~-------------------~~~~~Lgp--- 105 (192) |+|+.+ +++|||+..-+ +.. + + + ..++---| T Consensus 733 C~v~~de~~~i~gYa~a~~Dvt~F~rn~~i~w~~~l~EKY~~~i~p~~~g~~~~~~~e~i~~S~h~~~~~~~~~~~P~~~ 812 (891) T KOG3698 733 CEVVDDEGHKIVGYASAHFDVTLFSRNFLITWKEKLKEKYRGLIEPIGSGKLTDEYIEFIQNSQHPMDIEEWYPKIPDQI 812 (891) T ss_pred EEEEECCCCCEEEEEEEECCCCHHHHCEEEEEHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCHHHCCCCCCHHH T ss_conf 36552479966677642001221000315654798899864455666786007999999987058430000131380888 Q ss_pred ---------EEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC------HHHHHHCCCEECC Q ss_conf ---------7885111389958999999999996138847998408------6578742948746 Q gi|254781162|r 106 ---------IVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD------IAYYSKLGFQAVP 155 (192) Q Consensus 106 ---------laV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~------~~yY~rfGF~~~~ 155 (192) ...+-+.-..++.+++|+-.+..+++.|....++.-. ..||.++||..+. T Consensus 813 ~~nfPa~v~~~~~~~a~D~~~~k~m~~vll~tL~aNGsrGaf~~V~~dD~~~~~fys~lG~~d~~ 877 (891) T KOG3698 813 FENFPAWVETYFGMDASDAHPMKKMIQVLLVTLAANGSRGAFLTVAIDDIERQKFYSELGLTDLG 877 (891) T ss_pred HHCCHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHCHHHHH T ss_conf 70370877321353445650789999999999873587511587551378999999871558776 No 75 >PTZ00064 histone acetyltransferase; Provisional Probab=92.80 E-value=0.24 Score=27.79 Aligned_cols=59 Identities=25% Similarity=0.343 Sum_probs=41.5 Q ss_pred CCCEEEEEEE----CCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 7607899998----898999999877611882238997678851113899589999999999961388 Q gi|254781162|r 71 DLSLSFLCAE----GKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGS 134 (192) Q Consensus 71 ~~~lslVA~~----~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~ 134 (192) ++.|-+|..+ |-.+||+ ||+-.. ..+-|+.+-+.|.||++|+|+.||.-+-..++..|. T Consensus 268 epFlFYVl~e~D~~G~HivGY--FSKEK~---s~~NlaCIltLP~yQrkGyGklLI~fSYeLSk~Egk 330 (442) T PTZ00064 268 ELFIFYVMTEVDENGYHITGY--FSKEKH---SHNNVSCILSLPQHQRKGYGKLLTAFSYLLSLKEGK 330 (442) T ss_pred CCEEEEEEEEECCCCCEEEEE--ECHHHC---CCCCEEEEEECCHHHHCCCCHHHHHHHHHHHHCCCC T ss_conf 755799998855998468888--444323---566625889547788515001124446456431576 No 76 >pfam01233 NMT Myristoyl-CoA:protein N-myristoyltransferase, N-terminal domain. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold. Probab=92.73 E-value=0.77 Score=24.82 Aligned_cols=100 Identities=12% Similarity=0.111 Sum_probs=64.0 Q ss_pred CCHHHHHHHHHHH---C-----CCCCCCCEEEEEE--CCCCC-CCCEE--EEEEECCEEEEEEEEEEEEE--CC--CCEE Q ss_conf 7989999999985---3-----8987210013110--38888-76078--99998898999999877611--88--2238 Q gi|254781162|r 39 EHDSSICAMHADS---F-----GPGRFVRAAVLLR--EQGMH-DLSLS--FLCAEGKRIVGSVRMTPISI--EK--ITGH 101 (192) Q Consensus 39 e~~~di~ai~~~~---f-----gpgr~~~~~~~lR--~~~~~-~~~ls--lVA~~~~~IVG~i~~~pv~~--~~--~~~~ 101 (192) +.+.+.+.++++- | +..||..+...++ +..+. ..+.. .-...++++||+|.-.|+.+ .+ .... T Consensus 32 ~d~~~l~Evy~lL~~nYVEDdd~~FRF~YS~efL~Wal~~Pg~~~~whvGVR~~~s~kLvgfIs~iP~~irv~~~~~~~~ 111 (162) T pfam01233 32 DDEEQLKELYVLLNENYVEDDDAMFRFNYSPEFLNWALKPPGWRPDWHCGVRVKSTGKLVAFISAIPATIRVRDKTVKMV 111 (162) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHCCCCCCCCEEEEEEECCCCEEEEEEECCEEEEEEECCEEEEE T ss_conf 99899999999998615148776598867999998763498988645899998779909999704148999926484578 Q ss_pred EEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 9976788511138995899999999999613884799 Q gi|254781162|r 102 LLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIV 138 (192) Q Consensus 102 ~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~ 138 (192) .+.-|+||+.+|.+++.--|+++.-.++..+|....+ T Consensus 112 eINFLCVhkklRskrLAPvLIkEItRRvn~~gIwQAv 148 (162) T pfam01233 112 EINFLCVHKKLRSKRLAPVLIKEITRRVNLTGIWQAL 148 (162) T ss_pred EEEEEEECHHHHHCCCCHHHHHHHHHHHHCCCEEEEE T ss_conf 8999887044553577438999988872006756999 No 77 >pfam09390 DUF1999 Protein of unknown function (DUF1999). This family contains a putative Fe-S binding reductase whose structure adopts an alpha and beta fold. Probab=92.70 E-value=0.39 Score=26.55 Aligned_cols=121 Identities=14% Similarity=0.030 Sum_probs=77.2 Q ss_pred EEEEECCHHHHHHHHHHHCCCCC-----CCCEEEEEECC----------CCCCCCEEEEEE-ECCEEEEEEEEEEEEECC Q ss_conf 34420798999999998538987-----21001311038----------888760789999-889899999987761188 Q gi|254781162|r 34 FLERLEHDSSICAMHADSFGPGR-----FVRAAVLLREQ----------GMHDLSLSFLCA-EGKRIVGSVRMTPISIEK 97 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~fgpgr-----~~~~~~~lR~~----------~~~~~~lslVA~-~~~~IVG~i~~~pv~~~~ 97 (192) |..|+=.+.|.+++.++-.+.-| |.-.-.+-|++ .-.-+..||||+ +++.+.|++.-.++..+. T Consensus 1 MryR~F~e~D~~~lqaldla~qr~~~P~yd~lPere~~grl~~SL~alrfyeRsgHSF~Ae~e~e~~~Gf~lAQavWqGd 80 (161) T pfam09390 1 MRYRPFTEPDLERLQALDLAEQRRTAPEYDVLPEREQAGRLSVSLGALRFYERSGHSFVAEGEGEEPMGFALAQAVWQGD 80 (161) T ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHCCCCCCCHHHHHHHHHCCCCEEEECCCCCEEEEEEHHHHHCCC T ss_conf 97555675009999987888742479633446114541645554245543431476035446887645564313565488 Q ss_pred CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECHHH---HHHCCCEECC Q ss_conf 223899767885111389958999999999996138847998408657---8742948746 Q gi|254781162|r 98 ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDIAY---YSKLGFQAVP 155 (192) Q Consensus 98 ~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~~y---Y~rfGF~~~~ 155 (192) .+..++-.+.. .+-+..+.-+.|+++..+-|-..+.--+-+.=+|+. -..-||...+ T Consensus 81 rptVl~~~ivl-~~~~~e~~~~gLL~AvvKSAYDaaVYEvhlpl~P~L~~Aa~~e~a~l~g 140 (161) T pfam09390 81 RPTVLVRAIVL-DEGRSEDALRGLLRAVVKSAYDAAVYEVHLPLDPELEAAARAEEAALTG 140 (161) T ss_pred CCEEEEEEEEC-CCCCCHHHHHHHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHCCCCCCC T ss_conf 73599998742-7898689999999999987521225787425898999999850234365 No 78 >KOG2747 consensus Probab=92.46 E-value=0.43 Score=26.27 Aligned_cols=61 Identities=30% Similarity=0.411 Sum_probs=43.3 Q ss_pred CCCEEEEEEECC--EEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 760789999889--8999999877611882238997678851113899589999999999961388 Q gi|254781162|r 71 DLSLSFLCAEGK--RIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGS 134 (192) Q Consensus 71 ~~~lslVA~~~~--~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~ 134 (192) ++.|-+|..+.+ ..||+ ||+ .....+.+-|+-+-|.|.||++|+|+.||+-.-...|..|. T Consensus 232 dpFlFYVLte~d~~G~VGY--FSK-EK~s~~~yNlaCILtLPpyQRkGYGklLIdFSYeLSr~E~~ 294 (396) T KOG2747 232 DPFLFYVLTECDSYGCVGY--FSK-EKESSENYNLACILTLPPYQRKGYGKLLIDFSYELSRREGK 294 (396) T ss_pred CCEEEEEEEECCCCEEEEE--ECC-CCCCCCCCCEEEEEECCHHHHCCCCHHHHHHHHHHHHCCCC T ss_conf 7638999996288406677--224-55674456447899557466255112189888798733476 No 79 >KOG4135 consensus Probab=92.19 E-value=0.58 Score=25.55 Aligned_cols=94 Identities=17% Similarity=0.231 Sum_probs=54.3 Q ss_pred EECCCCCCCCEEEEEEEC-------CEEEEEEEEEEEEE----CCCCEEEEEE---EEECCCCCCCCHHHHHHHHHHHHH Q ss_conf 103888876078999988-------98999999877611----8822389976---788511138995899999999999 Q gi|254781162|r 64 LREQGMHDLSLSFLCAEG-------KRIVGSVRMTPISI----EKITGHLLGP---IVVHPLYQNKGIGRKLISMSVDAA 129 (192) Q Consensus 64 lR~~~~~~~~lslVA~~~-------~~IVG~i~~~pv~~----~~~~~~~Lgp---laV~P~~rg~GiG~~Li~~~l~~a 129 (192) +|+..+....+.+-|+.+ +..||-+.+.-... +..+.+..|- +.-.|.-||+|+|+.-+...+.++ T Consensus 57 W~~DeDKlTFIVLdaE~~ea~~~ev~~MvGDvNlFlt~~~~~~n~s~~~~~gE~EvMIAEP~~RgKG~G~eav~~ml~y~ 136 (185) T KOG4135 57 WREDEDKLTFIVLDAEMNEAGEDEVDHMVGDVNLFLTTSPDTENPSDDVITGEVEVMIAEPRGRGKGIGTEAVRAMLAYA 136 (185) T ss_pred HCCCCCCEEEEEEECHHCCCCCHHHHHHCCCEEEEEECCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHH T ss_conf 20477614899983020235750576641330368752777688643425455899980555567773188999999999 Q ss_pred HHC-CCCEE-EEE--EC---HHHHHHCCCEECCCC Q ss_conf 613-88479-984--08---657874294874636 Q gi|254781162|r 130 EKK-GSQVI-VLV--GD---IAYYSKLGFQAVPWK 157 (192) Q Consensus 130 ~~~-g~~~v-~l~--g~---~~yY~rfGF~~~~~~ 157 (192) ... +..-- +-+ ++ ..+|.+|+|..+... T Consensus 137 ~s~l~l~Ky~vkig~~nk~sl~lFkk~~f~q~~~n 171 (185) T KOG4135 137 YSVLKLDKYEVKIGMDNKPSLRLFKKFLFTQVFYN 171 (185) T ss_pred HHHHHHHEEEEEECCCCCHHHHHHHHHHHEEEEEE T ss_conf 98740204899725888627899997534045320 No 80 >TIGR01686 FkbH FkbH domain; InterPro: IPR010037 This entry describes a domain of unknown function. One of proteins with this domain is a modular polyketide synthase, which is 4800 amino acids in length from Streptomyces avermilitis, where the domain is the C-terminal segment. By contrast, the FkbH protein from Streptomyces hygroscopicus (Q84G18 from SWISSPROT) apparently contains only this domain. The remaining members of the family all contain an additional N-terminal domain of between 200 and 275 amino acids, which show less than 20 0dentity to one another. It seems likely then that these proteins are involved in disparate functions, probably the biosynthesis of different natural products. For instance, the FkbH gene is found in a gene cluster believed to be responsible for the biosynthesis of unusual 'PKS extender units' in the ascomycin pathway . This domain is composed of two parts, the first of which is a member of subfamily IIIC of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. All of the characterised enzymes within subfamily III are phosphatases, and all of the active site residues characteristic of HAD-superfamily phosphatases are present in this domain. The C-terminal portion of this domain is unique to this family (by BLAST).. Probab=90.81 E-value=0.65 Score=25.24 Aligned_cols=124 Identities=15% Similarity=0.222 Sum_probs=83.8 Q ss_pred CHHHHCCEEEEEECCHHHHHHHHHHHCCCCCCC-------CEEEE-EECCCCCCCCEEEEEEE---CCEEEEEEEEEEEE Q ss_conf 033201203442079899999999853898721-------00131-10388887607899998---89899999987761 Q gi|254781162|r 26 RWQKIGAFFLERLEHDSSICAMHADSFGPGRFV-------RAAVL-LREQGMHDLSLSFLCAE---GKRIVGSVRMTPIS 94 (192) Q Consensus 26 ~~~~~~~~~~~R~e~~~di~ai~~~~fgpgr~~-------~~~~~-lR~~~~~~~~lslVA~~---~~~IVG~i~~~pv~ 94 (192) ..+.+.+.|.++.-...+|..++.++-.-.-|. ..... .+.......-+++-+.+ +.-+||.+.+.+ T Consensus 190 yL~~Lel~l~~~~~~~~~v~Rv~~L~~rTnQf~~~~~~~~~~dl~~~~~~~s~~~v~~~~~~DrfgD~G~vG~~~~~~-- 267 (337) T TIGR01686 190 YLKKLELSLDISKLKEEEVQRVEELVRRTNQFNLTAERLDEEDLKQTMEEESAEEVLTVAMSDRFGDSGLVGILLLEK-- 267 (337) T ss_pred HHHHHCCEEEHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCEEEEEEEEECCCCCCCCEEEEEEEC-- T ss_conf 874010054210136213604888731133231210136777899984489853899998641458876089999852-- Q ss_pred ECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHH---HCCCCEEEEEEC--------HHHHHHCCCEE Q ss_conf 188223899767885111389958999999999996---138847998408--------65787429487 Q gi|254781162|r 95 IEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAE---KKGSQVIVLVGD--------IAYYSKLGFQA 153 (192) Q Consensus 95 ~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~---~~g~~~v~l~g~--------~~yY~rfGF~~ 153 (192) +...+.+--+++.=--=++|++..+++.+.++|. ..|...+.+.-. ..+|+..||.- T Consensus 268 --~e~~l~i~~l~~SCR~l~r~~e~~~L~~l~d~A~hff~~g~~~~~~~y~~T~rN~~~~~~~~~~g~~~ 335 (337) T TIGR01686 268 --KEGQLVIKDLVMSCRVLGRGVEERLLRWLVDQAKHFFELGAHKVRLIYRRTERNVPVESFYEELGFED 335 (337) T ss_pred --CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCCHHHHHHHHHHCCCC T ss_conf --87515287676640141011679999999998899998413540310068987768999999853123 No 81 >KOG2779 consensus Probab=90.31 E-value=0.54 Score=25.71 Aligned_cols=99 Identities=13% Similarity=0.096 Sum_probs=63.9 Q ss_pred EECCHHHHHHHHHHH---C-----CCCCCCCEEEEEECCCCC---CCCEEEEE--EECCEEEEEEEEEEEEEC--C--CC Q ss_conf 207989999999985---3-----898721001311038888---76078999--988989999998776118--8--22 Q gi|254781162|r 37 RLEHDSSICAMHADS---F-----GPGRFVRAAVLLREQGMH---DLSLSFLC--AEGKRIVGSVRMTPISIE--K--IT 99 (192) Q Consensus 37 R~e~~~di~ai~~~~---f-----gpgr~~~~~~~lR~~~~~---~~~lslVA--~~~~~IVG~i~~~pv~~~--~--~~ 99 (192) -.++..|++.+.++- + .--||..+.+.++=...+ ..+..... ....++||.|.--|..+. + .. T Consensus 87 dv~~~~~l~el~~lL~enyVEd~~~m~rf~Ys~eFl~Wal~~pg~~~~WHiGVRv~~s~kLVaFIsaiP~~irvrdk~vk 166 (421) T KOG2779 87 DVSDFKDLEELYNLLNENYVEDDDSMFRFDYSPEFLKWALQPPGWKKEWHIGVRVKSSKKLVAFISAIPATIRVRDKVVK 166 (421) T ss_pred CCCCHHHHHHHHHHCCCCCCCCCCCCHHHHCCHHHHHHHHCCCCCCCCEEEEEEEECCCCEEEEEECCCCEEEECCCEEE T ss_conf 77507679999865214777766321122045898876404999762137999970377258998525317887142655 Q ss_pred EEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 389976788511138995899999999999613884 Q gi|254781162|r 100 GHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQ 135 (192) Q Consensus 100 ~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~ 135 (192) .+.+.-|+||...|++++.--|+++.-.++...|.. T Consensus 167 ~veINFLCVHKkLRSKRlaPvLIrEITRRvnl~gIf 202 (421) T KOG2779 167 MVEINFLCVHKKLRSKRLAPVLIREITRRVNLEGIF 202 (421) T ss_pred EEEEEEEEEEHHHHCCCCCCHHHHHHHHHHHHHHHH T ss_conf 645788997333301665307788887774044266 No 82 >COG3882 FkbH Predicted enzyme involved in methoxymalonyl-ACP biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism] Probab=89.94 E-value=0.78 Score=24.77 Aligned_cols=124 Identities=13% Similarity=0.150 Sum_probs=71.7 Q ss_pred HHHCCEEEEEECCHHHHHHHHHHH-----CC--CCCCCCEEEEEECCCCCCCCEEEEEEE---CCEEEEEEEEEEEEECC Q ss_conf 320120344207989999999985-----38--987210013110388887607899998---89899999987761188 Q gi|254781162|r 28 QKIGAFFLERLEHDSSICAMHADS-----FG--PGRFVRAAVLLREQGMHDLSLSFLCAE---GKRIVGSVRMTPISIEK 97 (192) Q Consensus 28 ~~~~~~~~~R~e~~~di~ai~~~~-----fg--pgr~~~~~~~lR~~~~~~~~lslVA~~---~~~IVG~i~~~pv~~~~ 97 (192) +.+.+-+..+.-+..+|..|-.++ |. .-|+++...+.--.......+|+-..+ ++-|||.+-+..- . T Consensus 408 K~Lem~l~vs~~de~~i~RIsQLtqkTNQFnlTtkRy~e~dV~~~~~~~~~li~sv~l~DKfgDnGiigvviv~kk---~ 484 (574) T COG3882 408 KNLEMRLTVSKFDEVNIPRISQLTQKTNQFNLTTKRYNEEDVRQMQEDPNFLIFSVSLKDKFGDNGIIGVVIVEKK---E 484 (574) T ss_pred HHHEEEEEEEECCCCCCHHHHHHHHHCCCEEECHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCEEEEEEEEEC---C T ss_conf 7233799984145558488999862043005225441488899873187717999986123467855899999931---8 Q ss_pred CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC--------HHHHHHCCCEECC Q ss_conf 223899767885111389958999999999996138847998408--------6578742948746 Q gi|254781162|r 98 ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD--------IAYYSKLGFQAVP 155 (192) Q Consensus 98 ~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~--------~~yY~rfGF~~~~ 155 (192) ..+ .+--+...=-.-|+++-.+||..+.+.|...|...+...=. .+||++.||+-.. T Consensus 485 ~~w-~IDt~lmSCRVlgRkvE~~l~~~~~e~A~~~gi~tir~~Y~pt~kN~pv~~FyE~mgf~l~~ 549 (574) T COG3882 485 SEW-FIDTFLMSCRVLGRKVEQRLMNSLEEQALSEGINTIRGYYIPTEKNAPVSDFYERMGFKLKG 549 (574) T ss_pred CEE-EHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHCHHHCC T ss_conf 727-76788998999866789999999999998646322235864643578389999873500013 No 83 >COG3138 AstA Arginine/ornithine N-succinyltransferase beta subunit [Amino acid transport and metabolism] Probab=88.97 E-value=1.1 Score=23.86 Aligned_cols=90 Identities=16% Similarity=0.198 Sum_probs=57.7 Q ss_pred EEEEECCHHHHHHHHHHHCCC--C------CC---------CCEEEEEECCCCCCCCEEEEEEE--CCEEEEEEEEE--- Q ss_conf 344207989999999985389--8------72---------10013110388887607899998--89899999987--- Q gi|254781162|r 34 FLERLEHDSSICAMHADSFGP--G------RF---------VRAAVLLREQGMHDLSLSFLCAE--GKRIVGSVRMT--- 91 (192) Q Consensus 34 ~~~R~e~~~di~ai~~~~fgp--g------r~---------~~~~~~lR~~~~~~~~lslVA~~--~~~IVG~i~~~--- 91 (192) +.+|+-..+|++++..++-.. | .. ++.++.- +-.++...+-||.|+ .|+++|....- T Consensus 2 lvvRP~~~aDl~al~~LA~~sg~G~TsLP~de~~L~~Ri~~se~sf~~-~~~~ge~~Y~fVLEDsetG~VvG~saI~a~v 80 (336) T COG3138 2 LVVRPVERADLEALMELAVKTGVGLTSLPADEATLRARIERSEKSFQG-ELPPGEAGYLFVLEDSETGTVVGISAIEAAV 80 (336) T ss_pred CCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHC-CCCCCCCCEEEEEEECCCCEEEEEEEEEEEC T ss_conf 613223203789999999865998535877789999999988887735-5678776289999845777287677788740 Q ss_pred -------EEEE-------------CCCC----------EEEEEEEEECCCCCCCCHHHHHHHH Q ss_conf -------7611-------------8822----------3899767885111389958999999 Q gi|254781162|r 92 -------PISI-------------EKIT----------GHLLGPIVVHPLYQNKGIGRKLISM 124 (192) Q Consensus 92 -------pv~~-------------~~~~----------~~~LgplaV~P~~rg~GiG~~Li~~ 124 (192) ..++ +..+ .-.++.+.++|+||.-+.|+.|-+. T Consensus 81 Gl~~PfYsyRv~tlvhaS~~L~v~~~i~~L~L~Nd~TG~SEl~sLFl~pd~Rkg~nG~Llsr~ 143 (336) T COG3138 81 GLNDPFYSYRVGTLVHASPELNVYNEIPTLFLSNDLTGNSELCTLFLDPDWRKGGNGRLLSKS 143 (336) T ss_pred CCCCCCCEEEEEEEEECCCCCCCCCCCEEEEEECCCCCCHHHHHEEECHHHHCCCCHHHHHHH T ss_conf 357765111453335327433641021179985266684245411506767335450024430 No 84 >TIGR00124 cit_ly_ligase [citrate (pro-3S)-lyase] ligase; InterPro: IPR005216 [Citrate (pro-3S)-lyase] ligase (6.2.1.22 from EC), also known as citrate lyase ligase, is responsible for acetylation of the prosthetic group (2-(5''-phosphoribosyl)-3'-dephosphocoenzyme-A) of the gamma subunit of citrate lyase converting the inactive thiol form of the enzyme to the active form. ; GO: 0008771 [citrate (pro-3S)-lyase] ligase activity. Probab=87.71 E-value=2.2 Score=22.20 Aligned_cols=80 Identities=15% Similarity=0.280 Sum_probs=64.5 Q ss_pred CCCCCEEEEEEEC-CEEEEEEEEEEEEECCCCEEEEEEEEECCCCC-CCC-HHHHHHHHHHHHHHHCCCCEEEEEECHH- Q ss_conf 8876078999988-98999999877611882238997678851113-899-5899999999999613884799840865- Q gi|254781162|r 69 MHDLSLSFLCAEG-KRIVGSVRMTPISIEKITGHLLGPIVVHPLYQ-NKG-IGRKLISMSVDAAEKKGSQVIVLVGDIA- 144 (192) Q Consensus 69 ~~~~~lslVA~~~-~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~r-g~G-iG~~Li~~~l~~a~~~g~~~v~l~g~~~- 144 (192) ....++..++..+ .+++|+-.+ .+-.+-.+++.+..+ |.| +...|+...+..+...+..-.++...|. T Consensus 28 ~~~~~~~~~~~~~~~~~~~~g~~--------~g~~~~~~~~~~~~~~g~g~~~~~l~~~l~~~~~~~~~~~~~~~~~p~~ 99 (342) T TIGR00124 28 DAPLEVFIVVYEDNEELVGCGGI--------AGNVIKCVAIDESLRSGEGTLALQLLTELLNLAYELGRPHLFLFTKPEY 99 (342) T ss_pred CCCCEEEEEEEECCCEEEEECCC--------CCCCEEEEEECCHHCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECHHH T ss_conf 55412688875057504650320--------0000001230300103421578999999998877515630356512256 Q ss_pred --HHHHCCCEECCC Q ss_conf --787429487463 Q gi|254781162|r 145 --YYSKLGFQAVPW 156 (192) Q Consensus 145 --yY~rfGF~~~~~ 156 (192) .|...||..... T Consensus 100 ~~~~~~~g~~~~~~ 113 (342) T TIGR00124 100 AALFEGCGFWTLAE 113 (342) T ss_pred HHHHHHCCCCEECC T ss_conf 77764326400103 No 85 >pfam06852 DUF1248 Protein of unknown function (DUF1248). This family represents a conserved region within a number of proteins of unknown function that seem to be specific to C. elegans. Note that some family members contain more than one copy of this region. Probab=87.58 E-value=2.2 Score=22.15 Aligned_cols=90 Identities=17% Similarity=0.332 Sum_probs=58.2 Q ss_pred EECCCCCCCCEEEEEEEC-CEEEE---EEEEEEEEEC-CCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEE- Q ss_conf 103888876078999988-98999---9998776118-82238997678851113899589999999999961388479- Q gi|254781162|r 64 LREQGMHDLSLSFLCAEG-KRIVG---SVRMTPISIE-KITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVI- 137 (192) Q Consensus 64 lR~~~~~~~~lslVA~~~-~~IVG---~i~~~pv~~~-~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v- 137 (192) +++.-..+-.|-|++..| ++||. +|-|.|+... +.+...+|-.-++|+|||+.+....-..+.+..+..+-..+ T Consensus 37 wk~sf~d~Y~l~l~clKgT~rvi~~~h~i~f~pL~~~~d~~~~f~G~~Wi~PdyR~~~~~~l~~~~~~~~~~~~~~N~~~ 116 (180) T pfam06852 37 WKDSFPDDYRLKLTCLKGTNRVIATFHTITFRPLPSSKDKPFVFLGFGWIDPDYRGKDIMKLTDDIAKEEERRKSDNAVA 116 (180) T ss_pred HHHHCCCCCEEEEEEECCCCEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 99856756079999976998078874079853677789986289860346855464168998999998763375555313 Q ss_pred -EEEECHHHHHH-CCCEE Q ss_conf -98408657874-29487 Q gi|254781162|r 138 -VLVGDIAYYSK-LGFQA 153 (192) Q Consensus 138 -~l~g~~~yY~r-fGF~~ 153 (192) ...-...||.+ .|... T Consensus 117 ~~~~~~~~fW~k~~G~~d 134 (180) T pfam06852 117 QNNPPSMNFWKKLTGHSD 134 (180) T ss_pred CCCHHHHHHHHHHHCCCC T ss_conf 268489999999509876 No 86 >COG5653 Protein involved in cellulose biosynthesis (CelD) [Cell envelope biogenesis, outer membrane] Probab=86.35 E-value=2.6 Score=21.74 Aligned_cols=84 Identities=19% Similarity=0.103 Sum_probs=60.9 Q ss_pred EEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEE-EEEECHHHHHHCCCEE Q ss_conf 899998898999999877611882238997678851113899589999999999961388479-9840865787429487 Q gi|254781162|r 75 SFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVI-VLVGDIAYYSKLGFQA 153 (192) Q Consensus 75 slVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v-~l~g~~~yY~rfGF~~ 153 (192) -+..+.+|+||+.--.+. +...+..--.+.+|++-.--=|..|+-..|+++..+|+... +.+|+..|=..+|=+. T Consensus 274 l~gL~~G~~lvAV~~~lr----~~~t~h~~l~a~dpe~~~~SPG~~lf~d~i~~~~~~g~~~~DfgvG~q~YKR~~~~~~ 349 (406) T COG5653 274 LFGLHAGGRLVAVHGLLR----QGGTYHAWLGAIDPEFARASPGMLLFLDLIEWACGQGLARFDFGVGDQSYKRHWGDQS 349 (406) T ss_pred EEEEEECCEEEEEEEEEC----CCCEEEEEEECCCHHHHHCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHH T ss_conf 998742777899886312----5877999750348778636954789999999986489717863589747877640378 Q ss_pred CCCCCEECC Q ss_conf 463653668 Q gi|254781162|r 154 VPWKSLILP 162 (192) Q Consensus 154 ~~~~~l~~p 162 (192) .-..++..| T Consensus 350 tvl~~~~~~ 358 (406) T COG5653 350 TVLFGLLGA 358 (406) T ss_pred HHHHHHHCC T ss_conf 899876415 No 87 >COG5027 SAS2 Histone acetyltransferase (MYST family) [Chromatin structure and dynamics] Probab=85.43 E-value=0.51 Score=25.84 Aligned_cols=58 Identities=29% Similarity=0.374 Sum_probs=37.2 Q ss_pred CCCEEEEEEECC----EEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 760789999889----8999999877611882238997678851113899589999999999961 Q gi|254781162|r 71 DLSLSFLCAEGK----RIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEK 131 (192) Q Consensus 71 ~~~lslVA~~~~----~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~ 131 (192) ++-+-+|..+.| .+||+ ||+-. .+...+-|+.+-+.|.||++|+|..||+-.-...+. T Consensus 232 DpflFYvl~~~~~~~~h~vGy--FSKEK-~S~~~yNLaCILtLP~yQRrGYG~lLIdFSY~Ls~~ 293 (395) T COG5027 232 DPFLFYVLTERGDTGCHLVGY--FSKEK-ESEQDYNLACILTLPPYQRRGYGKLLIDFSYLLSQK 293 (395) T ss_pred CCEEEEEEEECCCCCEEEEEE--ECHHH-CCCCCCCEEEEEECCHHHHCCCCEEEEEEEEECCCC T ss_conf 655999999738754035666--33211-354567557999657567066445755542201014 No 88 >pfam05301 Mec-17 Touch receptor neuron protein Mec-17. Mec-17 is the protein product of one of the 18 genes required for the development and function of the touch receptor neuron for gentle touch. Mec-17 is specifically required for maintaining the differentiation of the touch receptor. This family is conserved to higher eukaryotes. Probab=83.48 E-value=1.4 Score=23.34 Aligned_cols=51 Identities=18% Similarity=0.157 Sum_probs=32.4 Q ss_pred CEEEEEEEEEEEE----E-CC-----CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 9899999987761----1-88-----22389976788511138995899999999999613 Q gi|254781162|r 82 KRIVGSVRMTPIS----I-EK-----ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKK 132 (192) Q Consensus 82 ~~IVG~i~~~pv~----~-~~-----~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~ 132 (192) +.|+|-.....=. . .+ .+++-+=..+|+++.|++|+|.+|.++.+..-.-. T Consensus 18 g~v~G~LKvG~K~Lfl~d~~g~~~e~~~~~CvLDFYVhes~QR~G~Gk~Lf~~ML~~e~~~ 78 (120) T pfam05301 18 SALKGLLKVGYKKLFLFDNEGNTYEVEQTPCILDFYVHESRQRSGNGKKLFDEMLKKENVE 78 (120) T ss_pred CEEEEEEEEEEEEEEEECCCCCEEEECCCCEEEEEEECHHHHCCCCHHHHHHHHHHHCCCC T ss_conf 1699999874035889989999888516758999998024624575799999999871998 No 89 >KOG2036 consensus Probab=82.42 E-value=1.4 Score=23.25 Aligned_cols=77 Identities=23% Similarity=0.345 Sum_probs=46.1 Q ss_pred EEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEE----------------------------------EE-------- Q ss_conf 997678851113899589999999999961388479----------------------------------98-------- Q gi|254781162|r 102 LLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVI----------------------------------VL-------- 139 (192) Q Consensus 102 ~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v----------------------------------~l-------- 139 (192) -+-.+||||+||+.|+|++-++-..+.-..+....- .| T Consensus 616 RIVRIAvhP~y~~MGYGsrAvqLL~~y~eG~~~~i~e~~~~~~~~~k~v~e~~~vsllee~i~pR~~lppLL~~L~er~p 695 (1011) T KOG2036 616 RIVRIAVHPEYQKMGYGSRAVQLLTDYFEGKFTSISEDVLAVDHSIKRVEEAEKVSLLEEQIKPRKDLPPLLLKLSERPP 695 (1011) T ss_pred EEEEEEECCCHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCEEEECCCCCC T ss_conf 17999856103113740889999999971558774544345675442034155445666533654579861557023897 Q ss_pred ---------EEC----HHHHHHCCCEECCCCCEECCCCCCHHHEEEEECCCC Q ss_conf ---------408----657874294874636536689999566698435778 Q gi|254781162|r 140 ---------VGD----IAYYSKLGFQAVPWKSLILPAPVDPNRVLFLPLVQN 178 (192) Q Consensus 140 ---------~g~----~~yY~rfGF~~~~~~~l~~p~pv~~~~~l~l~L~~~ 178 (192) +|- ..|+.|-||.++--.+....-.-...-+|...|+++ T Consensus 696 erldylGvSfGLT~~L~kFWk~~gF~PvylrQt~n~lTGEHtcimLk~L~~~ 747 (1011) T KOG2036 696 ERLDYLGVSFGLTPSLLKFWKKNGFVPVYLRQTSNDLTGEHTCIMLKTLEGD 747 (1011) T ss_pred CCCCEEEECCCCCHHHHHHHHHCCCEEEEEECCCCCCCCCEEEEEEECCCCC T ss_conf 6553231033678899999986696047763065402464458998258876 No 90 >KOG2696 consensus Probab=82.22 E-value=2.2 Score=22.14 Aligned_cols=41 Identities=20% Similarity=0.353 Sum_probs=29.5 Q ss_pred EEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECH Q ss_conf 97678851113899589999999999961388479984086 Q gi|254781162|r 103 LGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDI 143 (192) Q Consensus 103 LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~ 143 (192) +..+-+.|.||++|+|+.|++.....-...--..-+.+-+| T Consensus 220 iSQmlilpPfq~~Glgs~l~E~i~r~~~~~p~v~DiTVEdP 260 (403) T KOG2696 220 ISQMLILPPFQGKGLGSQLYEAIARDYLEEPTVLDITVEDP 260 (403) T ss_pred HHEEEEECCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCC T ss_conf 41257736556775478999999886405983258873382 No 91 >TIGR03019 pepcterm_femAB FemAB-related protein, PEP-CTERM system-associated. Members of this protein family are found always as part of extended exopolysaccharide biosynthesis loci in bacteria. In nearly every case, these loci contain determinants for the processing of the PEP-CTERM proposed C-terminal protein sorting signal. This family shows remote, local sequence similarity to the FemAB protein family (see pfam02388), whose members Probab=81.57 E-value=4.1 Score=20.53 Aligned_cols=90 Identities=12% Similarity=0.043 Sum_probs=64.0 Q ss_pred EEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE--- Q ss_conf 11038888760789999889899999987761188223899767885111389958999999999996138847998--- Q gi|254781162|r 63 LLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL--- 139 (192) Q Consensus 63 ~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l--- 139 (192) .+.+.......+..|+..++.++|.+.+... +...+-. =-+..++++.-.---.|.-++|..|.++|+...-. T Consensus 186 ~l~~~f~~~~~il~v~~~~g~~~ag~l~~~~---~~~v~~~-y~as~~~~r~~~~n~llyW~~i~~A~e~G~~~fDFGrS 261 (330) T TIGR03019 186 LLKDVFGEDCEVLTVRLGDGVVASAVLSFYF---RDEVLPY-YAGGLREARDVAANDLMYWELMRRACERGLRVFDFGRS 261 (330) T ss_pred HHHHHCCCCCEEEEEEECCCCEEEEEEEEEC---CCEEEEE-EECCCHHHHHCCCHHHHHHHHHHHHHHCCCCEEECCCC T ss_conf 9998616673799999279968999999952---9889996-53451988745814889999999999879968837999 Q ss_pred ---EECHHHHHHCCCEECCC Q ss_conf ---40865787429487463 Q gi|254781162|r 140 ---VGDIAYYSKLGFQAVPW 156 (192) Q Consensus 140 ---~g~~~yY~rfGF~~~~~ 156 (192) .|.-.|=+++||++.+- T Consensus 262 ~~~~G~~~FKk~WG~~p~pl 281 (330) T TIGR03019 262 KRGTGPFKFKKNWGFEPQPL 281 (330) T ss_pred CCCCCCHHHHHCCCCEEEEE T ss_conf 99996867785379833201 No 92 >PRK01305 arginyl-tRNA-protein transferase; Provisional Probab=78.25 E-value=5.3 Score=19.89 Aligned_cols=68 Identities=13% Similarity=0.100 Sum_probs=49.4 Q ss_pred CCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEE--ECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 388887607899998898999999877611882238997678--85111389958999999999996138847998 Q gi|254781162|r 66 EQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIV--VHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL 139 (192) Q Consensus 66 ~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~Lgpla--V~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l 139 (192) ....+..+..+-...+|++||....-. ..++ |..|+ =+|++..+++|+-.+=..|+.|++.|...+.| T Consensus 137 l~~s~~~t~~~e~~~~~~Li~v~v~D~-l~dg-----lSAVYtFyDP~~~~~SLGt~~IL~~I~~a~~~~l~y~YL 206 (241) T PRK01305 137 LEDSWVNTRFIEFRLDGRLVAVAVTDV-LDDG-----LSAVYTFYDPDEEHRSLGTFAILWQIEQAKRLGLPYVYL 206 (241) T ss_pred HHCCCCCCEEEEEEECCCEEEEEEEEC-CCCC-----CCCCEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCEEEE T ss_conf 827898768999980894799999835-8864-----323601207565668975999999999999859986952 No 93 >COG5092 NMT1 N-myristoyl transferase [Lipid metabolism] Probab=77.88 E-value=5.5 Score=19.82 Aligned_cols=80 Identities=14% Similarity=0.075 Sum_probs=54.8 Q ss_pred CCCCEEEEEECC---CCCCCCEEEEEEECC--EEEEEEEEEEEEE--CC--CCEEEEEEEEECCCCCCCCHHHHHHHHHH Q ss_conf 721001311038---888760789999889--8999999877611--88--22389976788511138995899999999 Q gi|254781162|r 56 RFVRAAVLLREQ---GMHDLSLSFLCAEGK--RIVGSVRMTPISI--EK--ITGHLLGPIVVHPLYQNKGIGRKLISMSV 126 (192) Q Consensus 56 r~~~~~~~lR~~---~~~~~~lslVA~~~~--~IVG~i~~~pv~~--~~--~~~~~LgplaV~P~~rg~GiG~~Li~~~l 126 (192) |++.+.+.++=. .......+.+....+ ++||.|.--|..+ .+ ...+.+.-++|+.+.|++.+.-.|+++.- T Consensus 112 rf~Y~~EFl~Wal~~pg~kK~whigvRvk~t~klVaFIsa~p~~v~vRgK~~~~~evNFLCihk~lRsKRltPvLIkEiT 191 (451) T COG5092 112 RFRYSVEFLQWALDGPGGKKRWHIGVRVKGTQKLVAFISAKPHLVSVRGKRSSVLEVNFLCIHKELRSKRLTPVLIKEIT 191 (451) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEEEEECCCCEEEEEEECCEEEEEECCCCCCCCEEEEEEEEHHHHHCCCCHHHHHHHH T ss_conf 79999999988633899742568999970453268897425058997263311003789997455530756208999998 Q ss_pred HHHHHCCCC Q ss_conf 999613884 Q gi|254781162|r 127 DAAEKKGSQ 135 (192) Q Consensus 127 ~~a~~~g~~ 135 (192) .++...|.. T Consensus 192 RR~n~~~iw 200 (451) T COG5092 192 RRANVDGIW 200 (451) T ss_pred HHHHHHHHH T ss_conf 763144267 No 94 >pfam04377 ATE_C Arginine-tRNA-protein transferase, C terminus. This family represents the C terminal region of the enzyme arginine-tRNA-protein transferase (EC 2.3.2.8), which catalyses the post-translational conjugation of arginine to the N terminus of a protein. In eukaryotes, this functions as part of the N-end rule pathway of protein degradation by conjugating a destabilising amino acid to the amino terminal aspartate or glutamate of a protein, targeting the protein for ubiquitin-dependent proteolysis. N terminal cysteine is sometimes modified. Probab=75.82 E-value=6.3 Score=19.48 Aligned_cols=65 Identities=11% Similarity=0.144 Sum_probs=46.4 Q ss_pred CCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEE--ECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 887607899998898999999877611882238997678--85111389958999999999996138847998 Q gi|254781162|r 69 MHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIV--VHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL 139 (192) Q Consensus 69 ~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~Lgpla--V~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l 139 (192) .+..+.-+-...+|++||...+-.. .++- ..|+ =+|++..+++|+-.+=.-|+.|+++|...+.| T Consensus 35 s~~~t~~~eyr~~~~Liav~v~D~l-~dgl-----SaVY~ffdP~~~~~SlG~~~iL~~I~~ak~~~~~y~YL 101 (128) T pfam04377 35 SPLGTRFLEYRLDGKLIAVAVTDIL-PDGL-----SAVYTFYDPDYSKRSLGTYSILWQIELAKELGLPYVYL 101 (128) T ss_pred CCCCCEEEEEEECCEEEEEEEEECC-CCCC-----HHEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEC T ss_conf 8988789999828968999998427-7420-----10456537220003631899999999999829973864 No 95 >COG4671 Predicted glycosyl transferase [General function prediction only] Probab=56.21 E-value=15 Score=17.34 Aligned_cols=29 Identities=24% Similarity=0.384 Sum_probs=15.2 Q ss_pred CCCCHHHHHHHHHHHHHHH-CCCC--EEEEEE Q ss_conf 3899589999999999961-3884--799840 Q gi|254781162|r 113 QNKGIGRKLISMSVDAAEK-KGSQ--VIVLVG 141 (192) Q Consensus 113 rg~GiG~~Li~~~l~~a~~-~g~~--~v~l~g 141 (192) =|-+.|..|+..+++++.. .+.. .+++.| T Consensus 227 GGG~dG~eLi~~~l~A~~~l~~l~~~~~ivtG 258 (400) T COG4671 227 GGGADGAELIETALAAAQLLAGLNHKWLIVTG 258 (400) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCCCEEEEEC T ss_conf 88720599999999875507788743389848 No 96 >KOG4601 consensus Probab=55.43 E-value=13 Score=17.59 Aligned_cols=49 Identities=16% Similarity=0.165 Sum_probs=32.9 Q ss_pred CEEEEEEEEEE--E--EECC-----CCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHH Q ss_conf 98999999877--6--1188-----223899767885111389958999999999996 Q gi|254781162|r 82 KRIVGSVRMTP--I--SIEK-----ITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAE 130 (192) Q Consensus 82 ~~IVG~i~~~p--v--~~~~-----~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~ 130 (192) +.|.|.+...- . ..++ ...+-+-..+|+++.|+.|.|.+|.++.++.-. T Consensus 81 s~l~GllKVG~KkLfl~D~~~~~ye~e~lcILDFyVheS~QR~G~G~~lfdyMl~kE~ 138 (264) T KOG4601 81 SILKGLLKVGYKKLFLTDNEQNQYEEEALCILDFYVHESEQRSGNGFKLFDYMLKKEN 138 (264) T ss_pred HHEEEEEHCCCEEEEEECCHHHHHCCCCCEEEEEEEEHHHHHCCCHHHHHHHHHHHCC T ss_conf 3031100004215888530766630477447888861466514745899999998528 No 97 >pfam02388 FemAB FemAB family. The femAB operon codes for two nearly identical approximately 50-kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan. These proteins are also considered as a factor influencing the level of methicillin resistance. Probab=54.18 E-value=17 Score=16.94 Aligned_cols=71 Identities=20% Similarity=0.245 Sum_probs=46.1 Q ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC--------------------HHHHHHCCCEECCCCCEECCC Q ss_conf 767885111389958999999999996138847998408--------------------657874294874636536689 Q gi|254781162|r 104 GPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD--------------------IAYYSKLGFQAVPWKSLILPA 163 (192) Q Consensus 104 gplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~--------------------~~yY~rfGF~~~~~~~l~~p~ 163 (192) ||+. +|...-+-..+++.+.+.|++.++-.+-+--. .+.+.++||+... ....+.. T Consensus 70 GP~~---d~~d~~l~~~~l~~Lk~~akk~~a~~lridP~~~~~~~d~~~~~~~~~~~~~~~~l~~~G~~~~g-~~~~~~~ 145 (407) T pfam02388 70 GPVM---DYSNKELVEFFLKELKKYAKKKRALFLKIDPYIPYQLRDLDGEPISAENDALIDKLESLGYKHQG-FTTGFDD 145 (407) T ss_pred CCCC---CCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHCCCCCCCCCCCCHHHHHHHHHCCCEECC-CCCCCCC T ss_conf 9856---88898999999999999999869289997676454430355664675518899999975976037-7678787 Q ss_pred CCCHHHEEEEECCCC Q ss_conf 999566698435778 Q gi|254781162|r 164 PVDPNRVLFLPLVQN 178 (192) Q Consensus 164 pv~~~~~l~l~L~~~ 178 (192) .+.|...+++.|++. T Consensus 146 ~~qpr~~~vldL~~~ 160 (407) T pfam02388 146 TIQPRWQAVLDLEDK 160 (407) T ss_pred CCCCCEEEEEECCCC T ss_conf 667455999878999 No 98 >PRK12568 glycogen branching enzyme; Provisional Probab=52.43 E-value=16 Score=17.04 Aligned_cols=48 Identities=13% Similarity=0.108 Sum_probs=37.1 Q ss_pred HHHHHHHHHHHHHCCCCEEEEE--ECHHHHHHCCCEECCCCCEECCCCCC Q ss_conf 9999999999961388479984--08657874294874636536689999 Q gi|254781162|r 119 RKLISMSVDAAEKKGSQVIVLV--GDIAYYSKLGFQAVPWKSLILPAPVD 166 (192) Q Consensus 119 ~~Li~~~l~~a~~~g~~~v~l~--g~~~yY~rfGF~~~~~~~l~~p~pv~ 166 (192) +.|.+..+..+++.|+..|-|. .+.+||..+||++...+..+.-...| T Consensus 269 ~ela~~Lipyvk~mGyThIELMPi~EhP~d~SWGYQvtg~fAptSRyGtP 318 (730) T PRK12568 269 PTLAEQLIPYVQQLGFTHIELLPITEHPFGGSWGYQPLGLYAPTARHGSP 318 (730) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCEEECCCCCCCCH T ss_conf 99999999999984987799734445688776565565564154678898 No 99 >TIGR02482 PFKA_ATP 6-phosphofructokinase; InterPro: IPR012828 6-phosphofructokinase (2.7.1.11 from EC) catalyses the addition of phosphate from ATP to fructose 6-phosphate to give fructose 1,6-bisphosphate. This represents a key control step in glycolysis. This entry contains bacterial ATP-dependent 6-phosphofructokinases, which lack a beta-hairpin loop present in IPR012829 from INTERPRO family members. IPR012829 from INTERPRO contains members that are ATP-dependent as well as members that are pyrophosphate-dependent. IPR011183 from INTERPRO represents the pyrophosphate-dependent phosphofructokinase, diphosphate--fructose-6-phosphate 1-phosphotransferase (2.7.1.90 from EC).; GO: 0003872 6-phosphofructokinase activity, 0005524 ATP binding, 0006002 fructose 6-phosphate metabolic process, 0006096 glycolysis. Probab=52.19 E-value=18 Score=16.77 Aligned_cols=39 Identities=26% Similarity=0.533 Sum_probs=30.2 Q ss_pred HHHHHHHHHCCCCEEEEEE-------CHHHHHHCCCEECCCCCEECCCCCC Q ss_conf 9999999613884799840-------8657874294874636536689999 Q gi|254781162|r 123 SMSVDAAEKKGSQVIVLVG-------DIAYYSKLGFQAVPWKSLILPAPVD 166 (192) Q Consensus 123 ~~~l~~a~~~g~~~v~l~g-------~~~yY~rfGF~~~~~~~l~~p~pv~ 166 (192) +.|.+.+|+.|.+.++++| +..+|+.+|++.+. +|+.+| T Consensus 81 ~kA~~nLK~~GI~~LVViGGDGSy~GA~~L~~~gg~~~iG-----lPGTID 126 (302) T TIGR02482 81 EKAVENLKKLGIEALVVIGGDGSYTGAQKLYEEGGIPVIG-----LPGTID 126 (302) T ss_pred HHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHCCCCEEE-----ECCCCC T ss_conf 9999998874886689986844068899999717984787-----458502 No 100 >TIGR00542 hxl6Piso_put hexulose-6-phosphate isomerase, putative; InterPro: IPR004560 This family shows similarity to other isomerases. Putative identification as hexulose-6-phosphate isomerase has been reported. This family is conserved at better than 40 0dentity among the four known examples from three species: Escherichia coli (SgbU and SgaU), Haemophilus influenzae, and Mycoplasma pneumoniae. The rarity of the family, high level of conservation, and proposed catabolic role suggests lateral transfer may be a part of the evolutionary history of this protein.; GO: 0016861 intramolecular oxidoreductase activity interconverting aldoses and ketoses, 0005975 carbohydrate metabolic process. Probab=51.73 E-value=15 Score=17.32 Aligned_cols=37 Identities=22% Similarity=0.385 Sum_probs=31.9 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHCCCEE Q ss_conf 5899999999999613884799840865787429487 Q gi|254781162|r 117 IGRKLISMSVDAAEKKGSQVIVLVGDIAYYSKLGFQA 153 (192) Q Consensus 117 iG~~Li~~~l~~a~~~g~~~v~l~g~~~yY~rfGF~~ 153 (192) -|...|++|+..|+..|...|-|-|..=||+..-=+. T Consensus 91 ~~~eIm~KAi~La~~lGIR~IQLAgYDVYYe~~d~eT 127 (290) T TIGR00542 91 QGLEIMEKAIQLARDLGIRIIQLAGYDVYYEEHDEET 127 (290) T ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCHH T ss_conf 9999999999999757960662233300335576045 No 101 >PRK05402 glycogen branching enzyme; Provisional Probab=48.83 E-value=21 Score=16.44 Aligned_cols=46 Identities=13% Similarity=0.176 Sum_probs=36.6 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEE--ECHHHHHHCCCEECCCCCEECCC Q ss_conf 89999999999961388479984--08657874294874636536689 Q gi|254781162|r 118 GRKLISMSVDAAEKKGSQVIVLV--GDIAYYSKLGFQAVPWKSLILPA 163 (192) Q Consensus 118 G~~Li~~~l~~a~~~g~~~v~l~--g~~~yY~rfGF~~~~~~~l~~p~ 163 (192) =+.|.+..+..+++.|+..|-|. ...+||..+||++...+..+.-. T Consensus 269 Y~ela~~Lipyvk~mGythIElMPv~EhP~d~SWGYQ~tgyfAptSRy 316 (730) T PRK05402 269 YRELADQLIPYVKEMGFTHVELLPVAEHPFDGSWGYQPTGYYAPTSRF 316 (730) T ss_pred HHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 999999999999973987799644304788887666655401563246 No 102 >cd03173 DUF619-like DUF619-like: This CD includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present C-terminal of a NAG kinase-like domain seen in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAG is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. This CD also includes the DUF619 domain of the FABP N-acetylglutamate kinase (NAGK). The nuclear-encoded mitochondrial polyprotein precursor (ARG5,6) consists of an N-terminal NAGK (ArgB) domain, a central DUF619 domain, and a C-ter Probab=47.52 E-value=22 Score=16.31 Aligned_cols=49 Identities=14% Similarity=0.086 Sum_probs=33.1 Q ss_pred EEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHH Q ss_conf 998898999999877611882238997678851113899589999999999 Q gi|254781162|r 78 CAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDA 128 (192) Q Consensus 78 A~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~ 128 (192) +..++...|.+-.++-.. ....-|.-++|.++.||.|+|..+..+..+. T Consensus 14 ~y~~~~y~g~AIvt~~~~--~~~~yLdKFavs~~~~g~gv~d~v~~~l~~d 62 (99) T cd03173 14 AYVDEPYEGLAIVTPEGN--GGTAYLDKFAVSSSGWGNGVGDNIFEALRKD 62 (99) T ss_pred EEEECCCCEEEEEECCCC--CCCEEEEEEEECHHHHCCCHHHHHHHHHHHH T ss_conf 999489758999944899--9965777799963454049899999999854 No 103 >COG3623 SgaU Putative L-xylulose-5-phosphate 3-epimerase [Carbohydrate transport and metabolism] Probab=46.19 E-value=16 Score=17.04 Aligned_cols=33 Identities=21% Similarity=0.394 Sum_probs=27.0 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEECHHHHHHCC Q ss_conf 899999999999613884799840865787429 Q gi|254781162|r 118 GRKLISMSVDAAEKKGSQVIVLVGDIAYYSKLG 150 (192) Q Consensus 118 G~~Li~~~l~~a~~~g~~~v~l~g~~~yY~rfG 150 (192) +-.+|+.++..|+..|...+-|-|..-||+.-. T Consensus 94 aleiM~KaI~LA~dLGIRtIQLAGYDVYYE~~d 126 (287) T COG3623 94 ALEIMEKAIQLAQDLGIRTIQLAGYDVYYEEAD 126 (287) T ss_pred HHHHHHHHHHHHHHHCCEEEEECCCEEEECCCC T ss_conf 999999999999970842676324234414688 No 104 >TIGR02089 TTC tartrate dehydrogenase; InterPro: IPR011829 Tartrate dehydrogenase catalyzes the oxidation of both meso- and (+)-tartrate as well as a D-malate . These enzymes are closely related to the 3-isopropylmalate and isohomocitrate dehydrogenases found in IPR004429 from INTERPRO and IPR011828 from INTERPRO, respectively.; GO: 0009027 tartrate dehydrogenase activity, 0051287 NAD binding. Probab=45.50 E-value=23 Score=16.13 Aligned_cols=14 Identities=21% Similarity=0.741 Sum_probs=5.9 Q ss_pred CCCHHHHHHHHHHH Q ss_conf 89958999999999 Q gi|254781162|r 114 NKGIGRKLISMSVD 127 (192) Q Consensus 114 g~GiG~~Li~~~l~ 127 (192) |-|||+..|.++++ T Consensus 11 GDGIG~eV~~aa~~ 24 (355) T TIGR02089 11 GDGIGKEVVAAALQ 24 (355) T ss_pred CCCCCHHHHHHHHH T ss_conf 88986537999999 No 105 >TIGR00455 apsK adenylylsulfate kinase; InterPro: IPR002891 Enzyme that catalyses the phosphorylation of adenylylsulphate to 3'-phosphoadenylylsulphate. This domain contains an ATP binding P-loop motif .; GO: 0005524 ATP binding, 0016301 kinase activity, 0016772 transferase activity transferring phosphorus-containing groups, 0000103 sulfate assimilation. Probab=42.81 E-value=24 Score=16.11 Aligned_cols=39 Identities=38% Similarity=0.581 Sum_probs=31.5 Q ss_pred CHHHHHHHHHHHH-HHHCCCCEEEEEEC---HHHHHHCCCEEC Q ss_conf 9589999999999-96138847998408---657874294874 Q gi|254781162|r 116 GIGRKLISMSVDA-AEKKGSQVIVLVGD---IAYYSKLGFQAV 154 (192) Q Consensus 116 GiG~~Li~~~l~~-a~~~g~~~v~l~g~---~~yY~rfGF~~~ 154 (192) |=|+--+..|+++ +.++|+...+|-|| ..+|.-+||... T Consensus 29 GsGKsTiA~Al~~~L~~~G~~~~~LDGDnvR~gL~~dLGFS~~ 71 (187) T TIGR00455 29 GSGKSTIANALEKKLEKKGYRVYVLDGDNVRHGLNKDLGFSEE 71 (187) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEEECCCEEECCCCCCCCCCHH T ss_conf 5635799999999999669749997586342477888888856 No 106 >TIGR02362 dhaK1b probable dihydroxyacetone kinase DhaK1b subunit; InterPro: IPR012735 Two types of dihydroxyacetone kinase (glycerone kinase) are described. In yeast and a few bacteria, e.g. Citrobacter freundii, the enzyme is a single chain that uses ATP as phosphoryl donor and is designated 2.7.1.29 from EC. By contract, Escherichia coli and many other bacterial species have a multisubunit form with a phosphoprotein donor related to PTS transport proteins. This family represents a protein, unique to the firmicutes (low GC Gram-positives), that appears to be a divergent second copy of the K subunit of that complex; its gene is always found in operons with the other three proteins of the complex.. Probab=40.74 E-value=21 Score=16.45 Aligned_cols=47 Identities=26% Similarity=0.397 Sum_probs=33.0 Q ss_pred EEECCCC----CCCCH-HHHHHHHHHHHHHHCCCCEEEEE--EC--HHHHHHCCCE Q ss_conf 7885111----38995-89999999999961388479984--08--6578742948 Q gi|254781162|r 106 IVVHPLY----QNKGI-GRKLISMSVDAAEKKGSQVIVLV--GD--IAYYSKLGFQ 152 (192) Q Consensus 106 laV~P~~----rg~Gi-G~~Li~~~l~~a~~~g~~~v~l~--g~--~~yY~rfGF~ 152 (192) ++|.++- |+||+ |+-|+.+.|-+|.+.|+..--|. |. ..--.-.||- T Consensus 134 iSVe~~~~f~~R~RGvAGtvLvHKIlGaAA~~GaSL~~L~~la~~l~~~~aTlGvA 189 (328) T TIGR02362 134 ISVESESSFKKRRRGVAGTVLVHKILGAAAKEGASLDELEHLAAALVTNIATLGVA 189 (328) T ss_pred CCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 33176455633035632577998999998523766789999999986336767667 No 107 >COG2921 Uncharacterized conserved protein [Function unknown] Probab=37.76 E-value=31 Score=15.41 Aligned_cols=43 Identities=9% Similarity=0.155 Sum_probs=34.7 Q ss_pred HHHHCCCCCEEEEEEECCCH-HHHCCEEEEEECCHHHHHHHHHH Q ss_conf 43110343224676421503-32012034420798999999998 Q gi|254781162|r 8 LRKFLSIAFFEFICWRRSRW-QKIGAFFLERLEHDSSICAMHAD 50 (192) Q Consensus 8 ~~~~~~~~~~~~~~~r~~~~-~~~~~~~~~R~e~~~di~ai~~~ 50 (192) ++++.+.+..+=+.||.|.+ .-.+.-+.||+++-..+++++.. T Consensus 36 vqr~ap~~~~~~~~~k~SSkGnY~svsI~i~A~~~EQ~e~ly~e 79 (90) T COG2921 36 VQRHAPGDYTPRVSWKPSSKGNYLSVSITIRATNIEQVEALYRE 79 (90) T ss_pred HHHHCCCCCCCEEEECCCCCCCEEEEEEEEEECCHHHHHHHHHH T ss_conf 99878765576056314889846889999997778999999999 No 108 >PRK12313 glycogen branching enzyme; Provisional Probab=36.81 E-value=32 Score=15.32 Aligned_cols=44 Identities=11% Similarity=0.139 Sum_probs=34.1 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEE--CHHHHHHCCCEECCCCCEECC Q ss_conf 99999999999613884799840--865787429487463653668 Q gi|254781162|r 119 RKLISMSVDAAEKKGSQVIVLVG--DIAYYSKLGFQAVPWKSLILP 162 (192) Q Consensus 119 ~~Li~~~l~~a~~~g~~~v~l~g--~~~yY~rfGF~~~~~~~l~~p 162 (192) +.+.+..+..+++.|+..|-|.. +.++|..+||.+...+....- T Consensus 169 ~e~a~~l~~yl~~lG~t~velmpv~e~p~~~swGY~~~~~fa~~sr 214 (632) T PRK12313 169 RELADELIPYVKEMGYTHVEFMPLMEHPLDGSWGYQLTGYFAPTSR 214 (632) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCEECCCCCC T ss_conf 9999999999998198779982044568988756666500146567 No 109 >cd04266 DUF619-NAGS-FABP DUF619-NAGS-FABP: DUF619 domain of N-acetylglutamate synthase (NAGS) of the fungal arginine-biosynthetic pathway (FABP). This NAGS (ARG2) consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain, yet to be characterized, is predicted to function in NAG synthase association in fungi. Probab=36.13 E-value=33 Score=15.25 Aligned_cols=55 Identities=18% Similarity=0.092 Sum_probs=33.9 Q ss_pred EEEEEEECCEEEEEEEEEEEEEC--CCCEEEEEEEEECCCCCC-CCHHHHHHHHHHHH Q ss_conf 78999988989999998776118--822389976788511138-99589999999999 Q gi|254781162|r 74 LSFLCAEGKRIVGSVRMTPISIE--KITGHLLGPIVVHPLYQN-KGIGRKLISMSVDA 128 (192) Q Consensus 74 lslVA~~~~~IVG~i~~~pv~~~--~~~~~~LgplaV~P~~rg-~GiG~~Li~~~l~~ 128 (192) |..|...++---+.|..+....+ .....-|-..||.|+.|| .|+|..|.....+. T Consensus 11 la~iii~gdY~g~AIlT~E~~~~~~~~~vpYLDKFAV~~~aqG~~Gl~d~iw~~m~~~ 68 (108) T cd04266 11 LATVIIAGDYEGAAILTWEGPDGSTPEKIAYLDKFAVLPKAQGSDGIADILFNAMLDE 68 (108) T ss_pred EEEEEEECCCEEEEEEEECCCCCCCCCCCCEEEEEEECHHHCCCCCHHHHHHHHHHHH T ss_conf 2489995574058999844789888886505520454322248632899999999986 No 110 >TIGR01490 HAD-SF-IB-hyp1 HAD-superfamily subfamily IB hydrolase, TIGR01490; InterPro: IPR006385 This group of sequences belong to the IB subfamily of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. The sequences are all bacterial. The IB subfamily includes the enzyme phosphoserine phosphatase. Due to this relationship, several of these sequences have been annotated as "phosphoserine phosphatase related proteins," or "phosphoserine phosphatase-family enzymes." There is presently no evidence that any of the proteins in this family possess PSPase activity. . Probab=35.30 E-value=34 Score=15.17 Aligned_cols=35 Identities=14% Similarity=0.258 Sum_probs=27.1 Q ss_pred HHHHHHHHHHCCCCEEEEEECH-----HHHHHCCCEECCC Q ss_conf 9999999961388479984086-----5787429487463 Q gi|254781162|r 122 ISMSVDAAEKKGSQVIVLVGDI-----AYYSKLGFQAVPW 156 (192) Q Consensus 122 i~~~l~~a~~~g~~~v~l~g~~-----~yY~rfGF~~~~~ 156 (192) +...+++-+++|+..+++.+.+ ++++++||..+-. T Consensus 94 a~~~~~~H~~~G~~~vlvSaS~~~~v~~~A~~~G~~~~~g 133 (204) T TIGR01490 94 ARDLIKAHKAEGHTIVLVSASLEILVKPLARKLGIDNAIG 133 (204) T ss_pred HHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHCCCCEEEE T ss_conf 9999999998089389981678899999998718660301 No 111 >PRK09936 hypothetical protein; Provisional Probab=34.74 E-value=35 Score=15.12 Aligned_cols=33 Identities=24% Similarity=0.391 Sum_probs=20.9 Q ss_pred HHHHHHHHHHHHHHCCCCEEE-EEECHHHHHHCC Q ss_conf 899999999999613884799-840865787429 Q gi|254781162|r 118 GRKLISMSVDAAEKKGSQVIV-LVGDIAYYSKLG 150 (192) Q Consensus 118 G~~Li~~~l~~a~~~g~~~v~-l~g~~~yY~rfG 150 (192) -+..+.+.++.|+.+|...++ |..||+||.+.- T Consensus 70 ~~gwLa~~l~~A~~aGl~lvlGLyaDP~yf~~~~ 103 (294) T PRK09936 70 QRGWLFKRLAAAQQAGLKLVVGLYADPEYFMHQK 103 (294) T ss_pred HHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHH T ss_conf 2039999999998669628886315858865656 No 112 >PRK09331 Sep-tRNA:Cys-tRNA synthetase; Provisional Probab=33.83 E-value=15 Score=17.34 Aligned_cols=42 Identities=12% Similarity=0.154 Sum_probs=16.8 Q ss_pred EEEEECCC-CCCCCHHHHHHHHHHHHHHHCC--CCEEEEEECHHH Q ss_conf 76788511-1389958999999999996138--847998408657 Q gi|254781162|r 104 GPIVVHPL-YQNKGIGRKLISMSVDAAEKKG--SQVIVLVGDIAY 145 (192) Q Consensus 104 gplaV~P~-~rg~GiG~~Li~~~l~~a~~~g--~~~v~l~g~~~y 145 (192) -|+.|.|+ +...||-..-....++.+.+.+ ...++++.+|+| T Consensus 124 ~~~~v~~~~~~~~~I~~~~~~~~Iee~~~e~gk~~aL~llt~pdy 168 (386) T PRK09331 124 NVREVPKTGYPDYRITPEAYAEKIEEVEDETGKPPALALLTHVDG 168 (386) T ss_pred EEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCC T ss_conf 589936877723450800579999999986189855999968886 No 113 >PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed Probab=32.83 E-value=20 Score=16.58 Aligned_cols=32 Identities=16% Similarity=0.346 Sum_probs=22.2 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 8511138995899999999999613884799840 Q gi|254781162|r 108 VHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVG 141 (192) Q Consensus 108 V~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g 141 (192) .+|+.|.+++ .+|+.+++.|...|...+.+.| T Consensus 84 ~d~~~r~~~l--e~l~kaI~lA~~LGi~~I~l~g 115 (284) T PRK13210 84 RDEATRERAL--EIMKKAIRLAQDLGIRTIQLAG 115 (284) T ss_pred CCHHHHHHHH--HHHHHHHHHHHHCCCCEEEECC T ss_conf 8989999999--9999999999980997899688 No 114 >pfam06559 DCD 2'-deoxycytidine 5'-triphosphate deaminase (DCD). This family consists of several bacterial 2'-deoxycytidine 5'-triphosphate deaminase proteins (EC:3.5.4.13). Probab=32.14 E-value=38 Score=14.87 Aligned_cols=51 Identities=25% Similarity=0.369 Sum_probs=26.7 Q ss_pred CCEEEEEECCCCCCCCEEEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCH Q ss_conf 100131103888876078999988989999998776118822389976788511138995 Q gi|254781162|r 58 VRAAVLLREQGMHDLSLSFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGI 117 (192) Q Consensus 58 ~~~~~~lR~~~~~~~~lslVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~Gi 117 (192) +++..-.|-+..| |+ .++|++||-+.|-++.......| | -.+...||+||+ T Consensus 308 sraVLEVRs~evP-----F~-ledgQ~v~rL~yE~m~~~P~~lY--G-~~i~SNYq~QgL 358 (366) T pfam06559 308 SRAVLEVRSHEVP-----FI-LEHGQIVGRLVYERMAARPATLY--G-AGLGSNYQGQGL 358 (366) T ss_pred CCEEEEEECCCCC-----EE-EECCCEEEEEEEEECCCCCHHHC--C-CCCCCCCCCCCC T ss_conf 4079998348787-----67-66797889999877004861210--7-876654212376 No 115 >PRK07559 2'-deoxycytidine 5'-triphosphate deaminase; Provisional Probab=31.77 E-value=39 Score=14.83 Aligned_cols=36 Identities=19% Similarity=0.124 Sum_probs=20.3 Q ss_pred EECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCH Q ss_conf 988989999998776118822389976788511138995 Q gi|254781162|r 79 AEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGI 117 (192) Q Consensus 79 ~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~Gi 117 (192) .++|++||-+.|-++.......|+ -.+...||+||+ T Consensus 323 le~gQ~v~rl~ye~m~~~p~~lYG---~~i~SnYq~QgL 358 (366) T PRK07559 323 LEHGQIVGRLVYERMLERPDALYG---AGIGSNYQAQGL 358 (366) T ss_pred EECCCEEEEEEEEECCCCCCCCCC---CCCCCCCHHCCC T ss_conf 768978899998870148711116---775754110266 No 116 >TIGR02277 PaaX_trns_reg phenylacetic acid degradation operon negative regulatory protein PaaX; InterPro: IPR011965 This transcriptional regulator is always found in association with operons believed to be involved in the degradation of phenylacetic acid . The gene product has been shown to bind to the promoter sites and repress their transcription .. Probab=31.11 E-value=29 Score=15.62 Aligned_cols=73 Identities=22% Similarity=0.307 Sum_probs=47.6 Q ss_pred EEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECHHHHH-----HCCCEECCCCCEECCCCCCHHHEEEEE Q ss_conf 389976788511138995899999999999613884799840865787-----429487463653668999956669843 Q gi|254781162|r 100 GHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGDIAYYS-----KLGFQAVPWKSLILPAPVDPNRVLFLP 174 (192) Q Consensus 100 ~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~~~yY~-----rfGF~~~~~~~l~~p~pv~~~~~l~l~ 174 (192) ...||.|. .=.++-||+.++++.++.++.++|+-..-=.|.-.||. +--|..+...=...|.+--...+.-+- T Consensus 17 ~IWlG~Li--~~l~~~Gi~E~~VRTAvsRL~a~GWL~~Er~GRrSfY~L~D~G~~~~~~A~~~IY~~p~~~W~g~W~Ll~ 94 (287) T TIGR02277 17 AIWLGSLI--ELLEGLGINERLVRTAVSRLVAQGWLQSERKGRRSFYRLSDKGRREFAAAARRIYDPPAPAWDGKWELLL 94 (287) T ss_pred EEEHHHHH--HHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEE T ss_conf 03376699--9962258883379999999865100010245441457788999999999987405788867041156655 No 117 >PRK13876 conjugal transfer coupling protein TraG; Provisional Probab=30.92 E-value=39 Score=14.84 Aligned_cols=12 Identities=33% Similarity=0.664 Sum_probs=5.1 Q ss_pred EECCCCCCCCHH Q ss_conf 885111389958 Q gi|254781162|r 107 VVHPLYQNKGIG 118 (192) Q Consensus 107 aV~P~~rg~GiG 118 (192) .+.|+-.|||+| T Consensus 149 ~~APTRSGKGVG 160 (662) T PRK13876 149 CFAPTRSGKGVG 160 (662) T ss_pred EECCCCCCCEEE T ss_conf 975799987015 No 118 >cd04265 DUF619-NAGS-U DUF619-NAGS-U: This CD includes the DUF619 domain of various N-acetylglutamate Synthases (NAGS) of the urea (U) cycle found in humans and fish, as well as the DUF619 domain present C-terminal of a NAG kinase-like domain seen in a limited number of bacterial and Dictyostelium predicted NAGSs. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAG is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain is yet to be characterized. Probab=29.03 E-value=44 Score=14.55 Aligned_cols=30 Identities=23% Similarity=0.262 Sum_probs=25.2 Q ss_pred EEEEEEEEECCCCCCCCHHHHHHHHHHHHH Q ss_conf 389976788511138995899999999999 Q gi|254781162|r 100 GHLLGPIVVHPLYQNKGIGRKLISMSVDAA 129 (192) Q Consensus 100 ~~~LgplaV~P~~rg~GiG~~Li~~~l~~a 129 (192) ..-|-..+|.++.||.|+|..+..+..+.- T Consensus 30 vpyLDKFaV~~~a~G~gv~d~vw~~m~~d~ 59 (95) T cd04265 30 TPYLDKFAVSSSAQGEGLGRALWRCLREDF 59 (95) T ss_pred CEEEEEEEECCCCCCCCHHHHHHHHHHHHC T ss_conf 878865788652102487999999998558 No 119 >COG2243 CobF Precorrin-2 methylase [Coenzyme metabolism] Probab=27.76 E-value=46 Score=14.42 Aligned_cols=21 Identities=24% Similarity=0.558 Sum_probs=14.1 Q ss_pred CC-CCEEEEEECHHHHHHCCCE Q ss_conf 38-8479984086578742948 Q gi|254781162|r 132 KG-SQVIVLVGDIAYYSKLGFQ 152 (192) Q Consensus 132 ~g-~~~v~l~g~~~yY~rfGF~ 152 (192) .| -.+.+.+|||.||..|++- T Consensus 89 ~G~~VAf~~lGDP~~YsTf~~l 110 (234) T COG2243 89 AGRDVAFLTLGDPTFYSTFMYL 110 (234) T ss_pred CCCEEEEEECCCCCHHHHHHHH T ss_conf 6992899981485288879999 No 120 >cd04736 MDH_FMN Mandelate dehydrogenase (MDH)-like FMN-binding domain. MDH is part of a widespread family of homologous FMN-dependent a-hydroxy acid oxidizing enzymes that oxidizes (S)-mandelate to phenylglyoxalate. MDH is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). Probab=26.99 E-value=47 Score=14.34 Aligned_cols=28 Identities=29% Similarity=0.350 Sum_probs=22.5 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECHHHHH Q ss_conf 9999999999613884799840865787 Q gi|254781162|r 120 KLISMSVDAAEKKGSQVIVLVGDIAYYS 147 (192) Q Consensus 120 ~Li~~~l~~a~~~g~~~v~l~g~~~yY~ 147 (192) .+.+..+++|++.|+..++|.-|.+.+. T Consensus 128 ~~~~~li~RA~~aG~~alvlTvD~pv~G 155 (361) T cd04736 128 ELAELLVKRALAAGYTTLVLTTDVAVNG 155 (361) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCC T ss_conf 9999999999985998689950788878 No 121 >PRK05576 cobalt-precorrin-2 C(20)-methyltransferase; Validated Probab=26.52 E-value=48 Score=14.31 Aligned_cols=11 Identities=36% Similarity=0.597 Sum_probs=4.5 Q ss_pred ECHHHHHHCCC Q ss_conf 08657874294 Q gi|254781162|r 141 GDIAYYSKLGF 151 (192) Q Consensus 141 g~~~yY~rfGF 151 (192) |||.+|.++++ T Consensus 99 GDP~iyst~~~ 109 (224) T PRK05576 99 GDPNVYSTFSH 109 (224) T ss_pred CCCCHHCHHHH T ss_conf 89621120999 No 122 >pfam04765 DUF616 Protein of unknown function (DUF616). Family of uncharacterized proteins. Probab=25.42 E-value=38 Score=14.91 Aligned_cols=57 Identities=14% Similarity=0.123 Sum_probs=37.6 Q ss_pred HHHHHHHHHHHCCCCEEEEEECHHHHHHCCCEECCCCCEECCCCCCHHHEEEEECCC Q ss_conf 999999999613884799840865787429487463653668999956669843577 Q gi|254781162|r 121 LISMSVDAAEKKGSQVIVLVGDIAYYSKLGFQAVPWKSLILPAPVDPNRVLFLPLVQ 177 (192) Q Consensus 121 Li~~~l~~a~~~g~~~v~l~g~~~yY~rfGF~~~~~~~l~~p~pv~~~~~l~l~L~~ 177 (192) .++++.+-.+...+.--.+--..++|.+-|-.+-....+.+|.+||+..+.+.+.++ T Consensus 196 v~eEa~a~kr~~Ky~~~~id~Q~e~Y~~~GL~~~s~~k~pl~SdVPEg~vIvReHtp 252 (306) T pfam04765 196 VYEEAEANKRLKKYNHESIDEQMEFYRSDGLTPWSDPKLPLPSDVPEGSVIVREHTP 252 (306) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEECCCC T ss_conf 899999999876507589999999998659997766666776889863178640662 No 123 >cd02922 FCB2_FMN Flavocytochrome b2 (FCB2) FMN-binding domain. FCB2 (AKA L-lactate:cytochrome c oxidoreductase) is a respiratory enzyme located in the intermembrane space of fungal mitochondria which catalyzes the oxidation of L-lactate to pyruvate. FCB2 also participates in a short electron-transport chain involving cytochrome c and cytochrome oxidase which ultimately directs the reducing equivalents gained from L-lactate oxidation to oxygen, yielding one molecule of ATP for every L-lactate molecule consumed. FCB2 is composed of 2 domains: a C-terminal flavin-binding domain, which includes the active site for lacate oxidation, and an N-terminal b2-cytochrome domain, required for efficient cytochrome c reduction. FCB2 is a homotetramer and contains two noncovalently bound cofactors, FMN and heme per subunit. Probab=24.83 E-value=52 Score=14.10 Aligned_cols=38 Identities=24% Similarity=0.216 Sum_probs=28.8 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEECHHHH------HHCCCEECCC Q ss_conf 9999999999961388479984086578------7429487463 Q gi|254781162|r 119 RKLISMSVDAAEKKGSQVIVLVGDIAYY------SKLGFQAVPW 156 (192) Q Consensus 119 ~~Li~~~l~~a~~~g~~~v~l~g~~~yY------~rfGF~~~~~ 156 (192) +.+++..+++|++.|+..+++.-|.+.+ .|.||...+. T Consensus 130 r~~~~~li~RA~~aG~~alvlTvD~p~~G~R~rd~r~~~~~~~~ 173 (344) T cd02922 130 RTKTEELLKRAEKLGAKAIFLTVDAPVLGKRERDERLKAEEAVS 173 (344) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHCCCCCCCC T ss_conf 79999999999986998899956788877522666507777887 No 124 >pfam09924 DUF2156 Uncharacterized conserved protein (DUF2156). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=23.31 E-value=56 Score=13.92 Aligned_cols=55 Identities=15% Similarity=0.060 Sum_probs=38.1 Q ss_pred EEEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 8999988989999998776118822389976788511138995899999999999613 Q gi|254781162|r 75 SFLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKK 132 (192) Q Consensus 75 slVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~ 132 (192) .+++..+|+|+|.+.+.|+. +.+++.+--+=-+|+. -+|+-..|+.++++-+++. T Consensus 181 ~~~~~~~g~i~aF~~~~p~~--~~~~~~idl~r~~~da-~~G~~~~l~~~~~~gLr~~ 235 (250) T pfam09924 181 GAVVRVDGKIVAFASGEPLN--GRDTLSLDLMRRDPDA-PNGIMEFLNAELAEGLRRF 235 (250) T ss_pred EEEEEECCEEEEEEEEEEEC--CCCEEEEEEEECCCCC-CCCHHHHHHHHHHHHHHHC T ss_conf 89999999999999999957--9997999998659999-8869999999999999974 No 125 >pfam02457 DisA_N DisA bacterial checkpoint controller nucleotide-binding. The DisA protein is a bacterial checkpoint protein that dimerizes into an octameric complex. The protein consists of three distinct domains. This domain is the first and is a globular, nucleotide-binding region; the next 146-289 residues constitute the DisA-linker family, pfam10635, that consists of an elongated bundle of three alpha helices (alpha-6, alpha-10, and alpha-11), one side of which carries an additional three helices (alpha7-9), which thus forms a spine like-linker between domains 1 and 3. The C-terminal residues, of domain 3, are represented by family HHH, pfam00633, the specific DNA-binding domain. The octameric complex thus has structurally linked nucleotide-binding and DNA-binding HhH domains and the nucleotide-binding domains are bound to a cyclic di-adenosine phosphate such that DisA is a specific di-adenylate cyclase. The di-adenylate cyclase activity is strongly suppressed by binding to branch Probab=22.84 E-value=50 Score=14.22 Aligned_cols=48 Identities=27% Similarity=0.416 Sum_probs=22.3 Q ss_pred EEEEECCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 9999889899999987761188223899767885111389958999999999996138847998 Q gi|254781162|r 76 FLCAEGKRIVGSVRMTPISIEKITGHLLGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVL 139 (192) Q Consensus 76 lVA~~~~~IVG~i~~~pv~~~~~~~~~LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l 139 (192) +|- .+++|+..-.+-|+.. .+. ..+++|++= ++++...++-+...+++ T Consensus 59 vII-~~~~i~aa~~~lpls~-------------~~~-l~~~~GtRH-RAA~gise~TdaivivV 106 (122) T pfam02457 59 VII-RGGRIVAAGCYLPLSP-------------DPS-IPKELGTRH-RAALGISEQTDALVIVV 106 (122) T ss_pred EEE-ECCCEEEEEEEEECCC-------------CCC-CCCCCCHHH-HHHHHHHHHCCCEEEEE T ss_conf 999-8993799999997487-------------777-772115567-88879887519889999 No 126 >COG0434 SgcQ Predicted TIM-barrel enzyme [General function prediction only] Probab=22.68 E-value=46 Score=14.42 Aligned_cols=19 Identities=21% Similarity=0.608 Sum_probs=7.3 Q ss_pred CCCEEEE--EECHHHHHHCCC Q ss_conf 8847998--408657874294 Q gi|254781162|r 133 GSQVIVL--VGDIAYYSKLGF 151 (192) Q Consensus 133 g~~~v~l--~g~~~yY~rfGF 151 (192) |++++++ +||.+||+..|+ T Consensus 47 G~DavivEN~gD~Pf~k~v~~ 67 (263) T COG0434 47 GVDAVIVENYGDAPFLKDVGP 67 (263) T ss_pred CCCEEEEECCCCCCCCCCCCH T ss_conf 976899713578877777974 No 127 >pfam09837 DUF2064 Uncharacterized protein conserved in bacteria (DUF2064). This domain, found in various prokaryotic proteins, has no known function. Probab=22.04 E-value=59 Score=13.77 Aligned_cols=71 Identities=21% Similarity=0.311 Sum_probs=41.8 Q ss_pred EEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC--H---HHHHHCCCEECCCCCEECCCCCCHHHEEEEECC Q ss_conf 9767885111389958999999999996138847998408--6---578742948746365366899995666984357 Q gi|254781162|r 103 LGPIVVHPLYQNKGIGRKLISMSVDAAEKKGSQVIVLVGD--I---AYYSKLGFQAVPWKSLILPAPVDPNRVLFLPLV 176 (192) Q Consensus 103 LgplaV~P~~rg~GiG~~Li~~~l~~a~~~g~~~v~l~g~--~---~yY~rfGF~~~~~~~l~~p~pv~~~~~l~l~L~ 176 (192) .+.+.+. .-++..+|.+|- ++++.+-++|+..++++|. | +.|=.--|+......+++- |..+-.+-.+-+. T Consensus 32 ~~~~~~~-~Q~~gdLG~Rm~-~a~~~~~~~g~~~v~lIGsD~P~l~~~~l~~A~~~L~~~d~Vlg-Pa~DGGy~LiG~~ 107 (121) T pfam09837 32 LGGVTVI-PQGGGDLGERMA-RAFRQAFAAGYRPVLLIGTDCPDLTAELLAQAFEALERHDAVLG-PAEDGGYYLLGLR 107 (121) T ss_pred CCCCEEE-ECCCCCHHHHHH-HHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCEEEE-ECCCCCEEEEEEC T ss_conf 7996688-259999899999-99999984389858998488465899999999998621997999-7488999999747 No 128 >pfam06849 DUF1246 Protein of unknown function (DUF1246). This family represents the N-terminus of a number of hypothetical archaeal proteins of unknown function. Probab=21.99 E-value=59 Score=13.77 Aligned_cols=29 Identities=21% Similarity=0.510 Sum_probs=21.2 Q ss_pred HHHHHHHHCCCCEEEEE--ECHHHHHHCCCE Q ss_conf 99999961388479984--086578742948 Q gi|254781162|r 124 MSVDAAEKKGSQVIVLV--GDIAYYSKLGFQ 152 (192) Q Consensus 124 ~~l~~a~~~g~~~v~l~--g~~~yY~rfGF~ 152 (192) +.++-|+..|...+.+. |...+|.+|+|. T Consensus 11 qIl~GAK~EGF~Tv~vc~~gr~~~Y~~f~~~ 41 (124) T pfam06849 11 QILDGAKDEGFRTVAVCQKGREKFYRRFPFV 41 (124) T ss_pred HHHHHHHHCCCCEEEEEECCCCCHHHHCCCC T ss_conf 9972388739957899817975213326867 No 129 >TIGR02665 molyb_mobA molybdopterin-guanine dinucleotide biosynthesis protein A; InterPro: IPR013482 In many molybdopterin-containing enzymes, including nitrate reductase and dimethylsulphoxide reductase, the cofactor is molybdopterin-guanine dinucleotide. This entry describes proteobacterial MobA (molybdopterin-guanine dinucleotide biosynthesis protein A). MobA synthesises molybdopterin-guanine dinucleotide from molybdopterin and GTP. ; GO: 0003824 catalytic activity, 0006777 Mo-molybdopterin cofactor biosynthetic process. Probab=21.14 E-value=51 Score=14.15 Aligned_cols=47 Identities=23% Similarity=0.374 Sum_probs=34.4 Q ss_pred HHHHHHHHHHHHHHCCCCEEE-EEECHHHHHHCCC--EECCCC---CEECCCC Q ss_conf 899999999999613884799-8408657874294--874636---5366899 Q gi|254781162|r 118 GRKLISMSVDAAEKKGSQVIV-LVGDIAYYSKLGF--QAVPWK---SLILPAP 164 (192) Q Consensus 118 G~~Li~~~l~~a~~~g~~~v~-l~g~~~yY~rfGF--~~~~~~---~l~~p~p 164 (192) |+-|++|++++.+=|-...++ .=++++=|+-+|| ..++.. .-..+|| T Consensus 26 g~PL~~hv~~rL~PQv~~~~IsANRn~~~Y~~~g~Gl~V~~D~~DA~~~F~GP 78 (202) T TIGR02665 26 GKPLIEHVLARLRPQVSDLAISANRNPERYAQAGFGLPVVPDDVDALADFPGP 78 (202) T ss_pred CCCHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHCCCCEECCCCCCCCCCCCCC T ss_conf 65289999998435076667413889778988608973127853435788887 No 130 >TIGR02708 L_lactate_ox L-lactate oxidase; InterPro: IPR014080 Members of this entry oxidize L-lactate to pyruvate, reducing molecular oxygen to hydrogen peroxide. The enzyme is known in Aerococcus viridans, Streptococcus iniae, and some strains of Streptococcus pyogenes where it appears to contribute to virulence.. Probab=20.82 E-value=63 Score=13.62 Aligned_cols=45 Identities=18% Similarity=0.283 Sum_probs=27.3 Q ss_pred HHHHHHHHHCCCCEEEEEECHHHHHHCCCEECCCCCEECC--CCCCHHH Q ss_conf 9999999613884799840865787429487463653668--9999566 Q gi|254781162|r 123 SMSVDAAEKKGSQVIVLVGDIAYYSKLGFQAVPWKSLILP--APVDPNR 169 (192) Q Consensus 123 ~~~l~~a~~~g~~~v~l~g~~~yY~rfGF~~~~~~~l~~p--~pv~~~~ 169 (192) ++.++++|..|+.+|+|+.|...=+..-- ..++++++| .|+-.+| T Consensus 149 R~I~D~~Ka~G~~AIvLTADaTV~GNR~~--D~~N~FVfP~GMPIV~~Y 195 (368) T TIGR02708 149 RDILDRVKADGAKAIVLTADATVGGNREV--DKRNGFVFPVGMPIVQEY 195 (368) T ss_pred HHHHHHHHHCCCEEEEEECCCCCCCCCCC--CCCCCEEECCCCCHHHHC T ss_conf 46788875278528997214633577441--355873611556033310 No 131 >pfam01276 OKR_DC_1 Orn/Lys/Arg decarboxylase, major domain. Probab=20.26 E-value=59 Score=13.77 Aligned_cols=23 Identities=13% Similarity=-0.026 Sum_probs=13.5 Q ss_pred ECHHHHHHCCCEECCCCCEECCC Q ss_conf 08657874294874636536689 Q gi|254781162|r 141 GDIAYYSKLGFQAVPWKSLILPA 163 (192) Q Consensus 141 g~~~yY~rfGF~~~~~~~l~~p~ 163 (192) -|.++...|+|.+.+...+.+.. T Consensus 202 vDEAhGah~~F~plp~~a~~~ga 224 (417) T pfam01276 202 FDSAWVGYEQFIPIYADASPMGG 224 (417) T ss_pred EECCCCCCCCCCCCCHHHHHCCC T ss_conf 97887520246878334665678 Done!