Query gi|254781133|ref|YP_003065546.1| hypothetical protein CLIBASIA_05175 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 159 No_of_seqs 116 out of 1701 Neff 7.6 Searched_HMMs 39220 Date Mon May 30 05:19:17 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781133.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10809 ribosomal-protein-S5- 100.0 9.1E-30 2.3E-34 191.0 15.0 117 32-148 70-194 (194) 2 PRK10151 ribosomal-protein-L7/ 100.0 4.2E-27 1.1E-31 175.7 15.3 134 8-143 37-178 (179) 3 TIGR03585 PseH pseudaminic aci 99.9 6E-24 1.5E-28 157.6 14.2 131 2-137 18-155 (156) 4 COG1670 RimL Acetyltransferase 99.9 6.4E-21 1.6E-25 140.1 14.0 131 15-148 45-186 (187) 5 PRK10140 putative acetyltransf 99.8 2.5E-19 6.4E-24 130.9 13.5 134 1-141 20-162 (162) 6 COG1247 Sortase and related ac 99.4 5.9E-12 1.5E-16 88.6 13.7 124 13-142 32-165 (169) 7 PRK09491 rimI ribosomal-protei 99.3 6.5E-11 1.6E-15 82.6 12.6 116 13-139 22-142 (144) 8 TIGR01575 rimI ribosomal-prote 99.0 5E-09 1.3E-13 71.7 11.3 107 13-128 16-127 (140) 9 cd04301 GNAT GCN5-related N-ac 99.0 5.2E-09 1.3E-13 71.6 9.5 78 42-120 2-83 (83) 10 PRK10975 TDP-fucosamine acetyl 99.0 2.1E-08 5.4E-13 68.1 12.6 103 19-125 121-225 (231) 11 COG3981 Predicted acetyltransf 98.9 1.8E-08 4.6E-13 68.5 10.8 111 18-130 42-165 (174) 12 pfam00583 Acetyltransf_1 Acety 98.9 6.8E-09 1.7E-13 71.0 7.7 75 45-120 2-80 (80) 13 PRK03624 putative acetyltransf 98.8 8.1E-08 2.1E-12 64.8 9.4 82 41-124 47-130 (140) 14 KOG4135 consensus 98.7 1E-07 2.6E-12 64.2 7.7 116 3-123 32-169 (185) 15 TIGR03448 mycothiol_MshD mycot 98.6 1.4E-06 3.5E-11 57.7 11.1 88 36-124 197-288 (292) 16 PRK10146 aminoalkylphosphonic 98.5 3.1E-06 7.9E-11 55.7 11.9 85 39-124 47-138 (144) 17 KOG3139 consensus 98.5 3.2E-06 8.1E-11 55.6 10.6 101 35-138 53-161 (165) 18 PTZ00330 acetyltransferase; Pr 98.3 1.9E-05 5E-10 51.1 12.3 105 12-124 29-141 (147) 19 pfam08445 FR47 FR47-like prote 98.2 9.7E-06 2.5E-10 52.8 7.9 74 42-123 4-81 (86) 20 COG0456 RimI Acetyltransferase 97.9 0.00017 4.4E-09 45.6 10.3 80 49-128 72-158 (177) 21 PRK10514 hypothetical protein; 97.8 0.00014 3.7E-09 46.1 8.4 77 42-128 53-130 (146) 22 PRK10562 hypothetical protein; 97.8 0.00029 7.4E-09 44.3 9.8 88 41-138 50-139 (145) 23 TIGR02382 wecD_rffC TDP-D-fuco 97.6 0.00012 3.1E-09 46.5 4.9 103 17-124 90-195 (201) 24 COG3393 Predicted acetyltransf 97.6 0.00075 1.9E-08 41.9 8.8 84 39-125 177-263 (268) 25 PRK07757 acetyltransferase; Pr 97.6 0.00061 1.6E-08 42.4 8.3 73 43-123 45-121 (152) 26 TIGR03103 trio_acet_GNAT GNAT- 97.6 0.00052 1.3E-08 42.8 7.9 93 35-128 121-224 (547) 27 TIGR02406 ectoine_EctA L-2,4-d 97.5 0.00044 1.1E-08 43.3 6.6 83 41-123 41-131 (162) 28 PRK09831 hypothetical protein; 97.2 0.0027 7E-08 38.7 8.3 109 12-134 26-138 (147) 29 KOG3216 consensus 97.2 0.0032 8.3E-08 38.3 8.6 85 38-123 53-145 (163) 30 PRK07922 N-acetylglutamate syn 97.2 0.0031 7.9E-08 38.4 8.1 75 40-122 48-126 (170) 31 KOG3396 consensus 97.1 0.0038 9.8E-08 37.8 8.0 102 16-124 34-144 (150) 32 PRK10314 hypothetical protein; 96.5 0.025 6.3E-07 33.2 8.5 81 40-123 49-133 (153) 33 KOG2488 consensus 96.4 0.017 4.3E-07 34.1 7.1 82 39-122 92-180 (202) 34 COG1246 ArgA N-acetylglutamate 96.2 0.034 8.6E-07 32.4 7.8 74 44-123 45-122 (153) 35 PRK12308 bifunctional arginino 96.0 0.037 9.5E-07 32.2 7.2 72 43-122 507-582 (614) 36 TIGR03448 mycothiol_MshD mycot 96.0 0.11 2.9E-06 29.4 9.5 80 41-125 48-129 (292) 37 PRK05279 N-acetylglutamate syn 95.3 0.14 3.5E-06 28.9 8.0 76 43-124 338-417 (441) 38 pfam08444 Gly_acyl_tr_C Aralky 93.8 0.3 7.6E-06 27.0 6.7 71 45-121 5-77 (89) 39 COG3818 Predicted acetyltransf 93.2 0.22 5.6E-06 27.7 5.2 56 74-130 97-154 (167) 40 KOG3235 consensus 91.7 0.99 2.5E-05 24.0 7.8 94 35-128 38-139 (193) 41 PRK01346 hypothetical protein; 90.2 1.4 3.5E-05 23.1 9.1 82 35-122 43-134 (411) 42 KOG3138 consensus 88.3 1.3 3.4E-05 23.2 5.3 82 46-127 62-155 (187) 43 PHA01733 hypothetical protein 88.3 1.9 4.9E-05 22.3 8.3 85 44-131 52-138 (153) 44 KOG3397 consensus 86.9 2.3 6E-05 21.8 7.1 100 15-123 37-140 (225) 45 KOG3234 consensus 86.9 2.3 6E-05 21.8 7.0 93 41-135 44-145 (173) 46 cd02169 Citrate_lyase_ligase C 84.9 3 7.6E-05 21.2 7.0 77 39-123 5-83 (296) 47 COG3882 FkbH Predicted enzyme 83.3 3.5 8.9E-05 20.8 5.4 83 38-123 459-549 (574) 48 COG3153 Predicted acetyltransf 82.4 3.8 9.7E-05 20.6 9.3 85 37-128 44-135 (171) 49 TIGR01890 N-Ac-Glu-synth amino 79.1 2 5E-05 22.2 2.9 91 15-124 320-414 (439) 50 TIGR01686 FkbH FkbH domain; In 75.2 4.7 0.00012 20.1 4.0 85 39-123 245-337 (337) 51 pfam02100 ODC_AZ Ornithine dec 63.8 11 0.00029 17.9 5.6 51 76-126 114-164 (187) 52 pfam11039 DUF2824 Protein of u 58.6 5.9 0.00015 19.5 1.7 100 39-145 38-144 (151) 53 PHA00771 head assembly protein 58.3 6 0.00015 19.5 1.7 100 39-145 38-144 (151) 54 TIGR00715 precor6x_red precorr 55.3 16 0.0004 17.1 3.4 135 11-154 73-209 (260) 55 COG2153 ElaA Predicted acyltra 51.9 18 0.00047 16.7 7.9 79 42-123 53-135 (155) 56 PHA00432 internal virion prote 48.9 20 0.00052 16.4 3.2 43 80-122 77-120 (138) 57 pfam11090 DUF2833 Protein of u 46.8 22 0.00057 16.2 3.7 42 80-121 39-81 (86) 58 TIGR01211 ELP3 histone acetylt 46.7 22 0.00057 16.2 4.5 88 33-123 453-566 (573) 59 COG5628 Predicted acetyltransf 45.7 23 0.00059 16.1 6.8 95 40-139 38-138 (143) 60 pfam04958 AstA Arginine N-succ 45.4 23 0.00059 16.1 6.1 46 12-57 26-78 (338) 61 pfam04377 ATE_C Arginine-tRNA- 44.5 24 0.00061 16.0 8.1 98 12-120 19-118 (128) 62 TIGR03244 arg_catab_AstA argin 43.8 25 0.00063 15.9 6.2 46 12-57 25-75 (336) 63 TIGR00538 hemN oxygen-independ 41.9 26 0.00068 15.8 5.9 79 7-117 108-186 (462) 64 TIGR03243 arg_catab_AOST argin 38.8 30 0.00076 15.5 6.1 30 95-124 218-247 (335) 65 TIGR00758 UDG_fam4 uracil-DNA 38.5 18 0.00045 16.8 1.5 54 80-133 84-137 (185) 66 TIGR03245 arg_AOST_alph argini 36.5 32 0.00083 15.3 6.3 47 11-57 24-76 (336) 67 cd02034 CooC The accessory pro 35.2 34 0.00087 15.1 3.8 40 71-110 3-42 (116) 68 PRK01305 arginyl-tRNA-protein 31.8 39 0.00099 14.8 8.0 97 14-121 126-224 (241) 69 COG1243 ELP3 Histone acetyltra 30.7 41 0.001 14.7 4.5 15 109-123 494-508 (515) 70 PRK10556 hypothetical protein; 30.0 42 0.0011 14.6 2.9 17 111-127 9-25 (111) 71 PRK10456 arginine succinyltran 26.6 48 0.0012 14.3 6.2 47 11-57 26-77 (344) 72 TIGR01263 4HPPD 4-hydroxypheny 25.4 51 0.0013 14.1 2.2 30 95-124 2-32 (379) 73 pfam09400 DUF2002 Protein of u 25.0 52 0.0013 14.1 2.8 17 111-127 9-25 (110) 74 COG3053 CitC Citrate lyase syn 24.6 52 0.0013 14.1 8.3 98 17-123 16-114 (352) 75 PRK13886 conjugal transfer pro 23.9 54 0.0014 14.0 4.2 12 112-123 170-181 (241) 76 KOG2239 consensus 23.3 49 0.0012 14.2 1.6 19 108-126 67-86 (209) 77 KOG4700 consensus 21.7 46 0.0012 14.4 1.3 28 2-30 72-99 (207) 78 KOG4387 consensus 21.6 60 0.0015 13.7 4.4 45 78-122 116-163 (191) 79 PRK13239 alkylmercury lyase; P 21.2 61 0.0016 13.7 2.5 85 11-109 32-123 (206) 80 TIGR02826 RNR_activ_nrdG3 anae 20.6 63 0.0016 13.6 2.5 25 8-32 39-64 (165) 81 TIGR02768 TraA_Ti Ti-type conj 20.4 39 0.00099 14.8 0.7 51 76-128 511-569 (888) No 1 >PRK10809 ribosomal-protein-S5-alanine N-acetyltransferase; Provisional Probab=99.97 E-value=9.1e-30 Score=191.03 Aligned_cols=117 Identities=10% Similarity=0.045 Sum_probs=103.4 Q ss_pred CCCCCCCEEEEEECCCCEEEEEEEEEEEEC-CCCCCCEEEECC---CCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCC Q ss_conf 114878549999825760767999999834-678311368637---7660359999999998624865079999982642 Q gi|254781133|r 32 CSSGWDRFVSMGILKSNFLVAGIIYHNYCP-ISRVIELSGASD---CKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDED 107 (159) Q Consensus 32 ~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~-~~~~~eig~~~~---~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N 107 (159) ...+....++|..++++++||.|+|+++.. ....+||||++. .++||++||++++++|+|+++++|||++.|.++| T Consensus 70 ~~~g~~~~~~i~~k~~~~liG~i~l~~i~~~~~~~~eIGY~l~~~~~GkGyatEA~~ali~~aF~~l~lhRIea~~~~~N 149 (194) T PRK10809 70 HKQGSAFYFGLLDPEEKEIIGVANFSNVVRGSFHACYLGYSIGQKWQGQGLMFEALTAAIRYMQRTQHIHRIMANYMPHN 149 (194) T ss_pred HHCCCCEEEEEEECCCCEEEEEEEEEECCCCCCCEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHHCCCCEEEEEHHHCC T ss_conf 86698179999998999799999987425776516889999988894888899999999999987379538956432228 Q ss_pred CHHHHHHHHCCCEEEEEECCC----CCEEEEEEEEECHHHHHHHH Q ss_conf 158999987495896676128----94100799986288986101 Q gi|254781133|r 108 YPQHRMLTSLGGIRYRIPRLR----GRNAAENIYVITHEAWMHNK 148 (159) Q Consensus 108 ~~S~rv~eklGF~~egv~R~~----G~~~d~~v~~l~~~eW~~~~ 148 (159) .+|++|+||+||++||+.|++ |+|.|+++|+||++||.++| T Consensus 150 ~~S~rvleklGF~~EG~~r~~~~~~G~w~D~~~yall~~ew~~~r 194 (194) T PRK10809 150 KRSGDLLARLGFEKEGYAKDYLLIDGQWRDHVLTALTTPDWTPGR 194 (194) T ss_pred HHHHHHHHHCCCEEEEEEEEEEEECCEEEEEEEEEEEHHHCCCCC T ss_conf 775399998799688855100999999989889998877738998 No 2 >PRK10151 ribosomal-protein-L7/L12-serine acetyltransferase; Provisional Probab=99.95 E-value=4.2e-27 Score=175.71 Aligned_cols=134 Identities=10% Similarity=0.029 Sum_probs=115.0 Q ss_pred EECCCC-CCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCCCCEEEECC---CCCCHHHHHH Q ss_conf 736898-799989999999997200114878549999825760767999999834678311368637---7660359999 Q gi|254781133|r 8 TWGGNA-KPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRVIELSGASD---CKSWLSRSVL 83 (159) Q Consensus 8 ~W~~~~-~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~~~~---~~~~~~~ea~ 83 (159) .|.+.+ .+.+.+...+|+......++.+....+ .+..++++||.|+|+.+++.++++|||||+. ..+|++++|+ T Consensus 37 ~wlpW~~~~~s~~d~~~~i~~~~~~~~~g~~~~f--~I~~~~~liG~i~l~~i~~~~~~aeiGYwl~~~~~GkG~~tea~ 114 (179) T PRK10151 37 QSLNWPQYVQSEEDTRKTVQGNVMLHQRGYAKMF--LIFKEDELIGVLSFNRIEPLNKTAYIGYWLDESHQGQGIISQSL 114 (179) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEE--EEEECCEEEEEEEEEEEECCCCEEEEEEEECHHHCCCCHHHHHH T ss_conf 7899999889999999999999999971867489--99999989999999997056676999999887774896899999 Q ss_pred HHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECCC----CCEEEEEEEEECHHH Q ss_conf 999998624865079999982642158999987495896676128----941007999862889 Q gi|254781133|r 84 KEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRLR----GRNAAENIYVITHEA 143 (159) Q Consensus 84 ~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~~----G~~~d~~v~~l~~~e 143 (159) +++++|+|+++++|||++.|.++|.+|++|+||+||+.||+.|++ |+++|+++|+++.++ T Consensus 115 ~~li~~~f~~l~l~Ri~~~~~~~N~~S~~v~ek~Gf~~EG~lr~~~~~~G~~~D~~~ys~i~d~ 178 (179) T PRK10151 115 QALIHHYAQSGELRRFVIKCRVDNPASNQVALRNGFILEGCLKQAEYLNGAYDDVNLYARIIDS 178 (179) T ss_pred HHHHHHHHHHCCEEEEEEEEECCCHHHHHHHHHCCCEEEEEEEEEEEECCEEEEEEEEEEEECC T ss_conf 9999999972891799999918998999999988999999864229998999881308898638 No 3 >TIGR03585 PseH pseudaminic acid biosynthesis N-acetyl transferase. Sequences in this family are members of the pfam00583 (GNAT) superfamily of acetyltransferases and are proposed to perform a N-acetylation step in the process of pseudaminic acid biosynthesis in Campylobacter species. This gene is commonly observed in apparent operons with other genes responsible for the biosynthesis of pseudaminic acid and as a component of flagellar and exopolysaccharide biosynthesis loci. Significantly, many genomes containing other components of this pathway lack this gene, indicating that some other N-acetyl transferases may be incolved and/or the step is optional, resulting in a non-acetylated pseudaminic acid variant sugar. Probab=99.92 E-value=6e-24 Score=157.56 Aligned_cols=131 Identities=9% Similarity=0.031 Sum_probs=114.0 Q ss_pred CCCCEE-EECCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCCCCEEEECC--CCCCH Q ss_conf 776205-736898799989999999997200114878549999825760767999999834678311368637--76603 Q gi|254781133|r 2 YNTIEI-TWGGNAKPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRVIELSGASD--CKSWL 78 (159) Q Consensus 2 ~n~~~~-~W~~~~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~~~~--~~~~~ 78 (159) .|+.++ .|+.+..|.+.+....|++....+.. . ..+.+..+|++||.++|.++++.++.+|+|+... .+.|+ T Consensus 18 rNdpev~~~~~~~~~~s~ee~~~w~~~~~~~~~---~--~~~ii~~~~~~IG~i~l~~id~~~~~ae~gi~~~~~~~~G~ 92 (156) T TIGR03585 18 RNHPDVRANMYSDHLIDWEEHLHFIEALKQDPN---R--RYWIVCQESRPIGVISFTDINLVHKSAFWGIYANPFCKPGV 92 (156) T ss_pred HCCHHHHHHHCCCCCCCHHHHHHHHHHHHCCCC---C--EEEEEEECCCEEEEEEEEECCCCCCEEEEEEEECHHHCCCH T ss_conf 589999975289887799999999999745997---3--89999979928999988740300398999998222102577 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECCC----CCEEEEEEE Q ss_conf 59999999998624865079999982642158999987495896676128----941007999 Q gi|254781133|r 79 SRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRLR----GRNAAENIY 137 (159) Q Consensus 79 ~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~~----G~~~d~~v~ 137 (159) +++++.++++|||++|+++||.+.|.++|.+|+++++|+||++||++|++ |++.|.++| T Consensus 93 g~~~~~~~l~~aF~~l~l~rl~~~v~~~N~~si~l~ek~GF~~EG~lr~~~~~~g~~~D~i~M 155 (156) T TIGR03585 93 GSVLEEAALEYAFEHLGLHKLSLEVLEFNNKALKLYEKFGFEREGVFRQGIFKEGEYYDVLLM 155 (156) T ss_pred HHHHHHHHHHHHHHHCCEEEEEEEECCCCHHHHHHHHHCCCEEEEEEEEEEEECCEEEEEEEE T ss_conf 999999999999875692699999625689999999978999999988249798999899997 No 4 >COG1670 RimL Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis] Probab=99.87 E-value=6.4e-21 Score=140.12 Aligned_cols=131 Identities=14% Similarity=0.157 Sum_probs=108.7 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCC--EEEEEEEEEEEEC--CCCCCCEEEECC---CCCCHHHHHHHHHH Q ss_conf 9998999999999720011487854999982576--0767999999834--678311368637---76603599999999 Q gi|254781133|r 15 PRINQIIADFVAKRIKDCSSGWDRFVSMGILKSN--FLVAGIIYHNYCP--ISRVIELSGASD---CKSWLSRSVLKEIY 87 (159) Q Consensus 15 P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g--~lvG~v~l~~~~~--~~~~~eig~~~~---~~~~~~~ea~~~l~ 87 (159) +.+.+....++..... ......+++....++ ++||.+++.+++. .....++||++. +++|++++|+.+++ T Consensus 45 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~iG~~~~~~~~~~~~~~~~~ig~~l~~~~~g~G~~tea~~~~l 121 (187) T COG1670 45 PTSDEELLRLLAEAWE---DLGGGAFAIELKATGDGELIGVIGLSDIDRAANGDLAEIGYWLDPEYWGKGYATEALRALL 121 (187) T ss_pred CCCHHHHHHHHHHHHH---CCCCCEEEEEEECCCCCCEEEEEEEEECCCCCCCCCEEEEEEECHHHCCCCHHHHHHHHHH T ss_conf 5788999999998763---1588459999802788708999997863465678857999999966638926999999999 Q ss_pred HHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECCC----CCEEEEEEEEECHHHHHHHH Q ss_conf 98624865079999982642158999987495896676128----94100799986288986101 Q gi|254781133|r 88 SYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRLR----GRNAAENIYVITHEAWMHNK 148 (159) Q Consensus 88 ~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~~----G~~~d~~v~~l~~~eW~~~~ 148 (159) +|+|++++++||.+.|++.|.+|+++++|+||+.||..|.+ |.+.|.++|+|++++|...+ T Consensus 122 ~~~f~~~~~~rv~~~~~~~N~~S~rv~ek~Gf~~~g~~~~~~~~~g~~~d~~~~~l~~~~~~~~~ 186 (187) T COG1670 122 DYAFEELGLHRIEATVDPENEASIRVYEKLGFRLEGELRQHEFIKGRWRDTVLYSLLRDEWEAIR 186 (187) T ss_pred HHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCEEECEEEECCCCCCEEEEEEEEEEEHHHHHCCC T ss_conf 99750079658999988899999999998799992524101010880689985533576552127 No 5 >PRK10140 putative acetyltransferase YhhY; Provisional Probab=99.82 E-value=2.5e-19 Score=130.95 Aligned_cols=134 Identities=10% Similarity=0.046 Sum_probs=98.3 Q ss_pred CCCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEEC--CCCCCCEEEECC---CC Q ss_conf 9776205736898799989999999997200114878549999825760767999999834--678311368637---76 Q gi|254781133|r 1 MYNTIEITWGGNAKPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCP--ISRVIELSGASD---CK 75 (159) Q Consensus 1 ~~n~~~~~W~~~~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~--~~~~~eig~~~~---~~ 75 (159) +||+-.+...+-..|+..+ ..|.++. .+. .+. ..+....||++||.+++...+. ....+++|+.+. ++ T Consensus 20 i~~~~~v~~~t~~~p~p~~--~~~~~~l-~~~-~~~---~~lVa~~dg~vvG~~~~~~~~~~r~~h~a~~gi~V~~~~~g 92 (162) T PRK10140 20 IHAQPEVYHNTLQVPHPSD--HMWQERL-TDR-PGI---KQLVACIDGIVVGHLTIDVQQRPRRSHVADFGICVDSRWKN 92 (162) T ss_pred HHHCHHHHCCCCCCCCCCH--HHHHHHH-HCC-CCC---CEEEEEECCEEEEEEEEECCCCCCEEEEEEEEEEECHHHCC T ss_conf 9849354416067999699--9999998-528-898---47999999999999998726787222699999999988869 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECC----CCCEEEEEEEEECH Q ss_conf 6035999999999862486507999998264215899998749589667612----89410079998628 Q gi|254781133|r 76 SWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRL----RGRNAAENIYVITH 141 (159) Q Consensus 76 ~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~----~G~~~d~~v~~l~~ 141 (159) +|++++++++++++||+.++++||++.|+++|.+|+++++|+||+.||+.++ .|++.|.+.|+.++ T Consensus 93 ~GiG~~Ll~~~i~~a~~~~~l~ri~l~V~~~N~~Ai~Ly~k~GF~~eG~~~~~~~~~G~y~D~~~MaRlr 162 (162) T PRK10140 93 RGVASALMREMIDMCDNWLRVDRIELTVFVDNAPAIKVYKKYGFEIEGTGKKYALRNGEYVDAYYMARVK 162 (162) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEEEEEEECCCHHHHHHHHHCCCEEEEEEHHEEEECCEEEEEEEEEEEC T ss_conf 8999999999999997316869999999869999999999889989988701188989998989999839 No 6 >COG1247 Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane] Probab=99.44 E-value=5.9e-12 Score=88.55 Aligned_cols=124 Identities=15% Similarity=0.098 Sum_probs=100.3 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEEC---CCCCCCEEEECC---CCCCHHHHHHHHH Q ss_conf 8799989999999997200114878549999825760767999999834---678311368637---7660359999999 Q gi|254781133|r 13 AKPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCP---ISRVIELSGASD---CKSWLSRSVLKEI 86 (159) Q Consensus 13 ~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~---~~~~~eig~~~~---~~~~~~~ea~~~l 86 (159) ..|.+.+.+.+|...+..+ + ..+.+...++|+++|..++..+.. -..++|.++.++ +.+++++..++++ T Consensus 32 ~~~~~~~~~~~~~~~~~~~---g--~p~~V~~~~~g~v~G~a~~~~fr~r~ay~~tve~SiYv~~~~~g~GiG~~Ll~~L 106 (169) T COG1247 32 EDPVSLEERAAWFSGRTRD---G--YPVVVAEEEDGKVLGYASAGPFRERPAYRHTVELSIYLDPAARGKGLGKKLLQAL 106 (169) T ss_pred CCCCCHHHHHHHHHHCCCC---C--CEEEEEECCCCEEEEEEEEEECCCCCCCCEEEEEEEEECCCCCCCCHHHHHHHHH T ss_conf 1699989999999833668---8--5499997479809999985100475565538999999985102466789999999 Q ss_pred HHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECC----CCCEEEEEEEEECHH Q ss_conf 99862486507999998264215899998749589667612----894100799986288 Q gi|254781133|r 87 YSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRL----RGRNAAENIYVITHE 142 (159) Q Consensus 87 ~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~----~G~~~d~~v~~l~~~ 142 (159) ++.+ ..+|++.+.+.+.++|.+|+++++++||++.|++++ +|+|.|.+++.+.-+ T Consensus 107 i~~~-~~~g~~~lva~I~~~n~aSi~lh~~~GF~~~G~~~~vg~k~g~wld~~~~~~~l~ 165 (169) T COG1247 107 ITEA-RALGVRELVAGIESDNLASIALHEKLGFEEVGTFPEVGDKFGRWLDLVLMQLLLE 165 (169) T ss_pred HHHH-HHCCEEEEEEEECCCCCHHHHHHHHCCCEEECCCCCCCCCCCEEEEEEEEEHHHC T ss_conf 9998-8588489999973787376999997797773634534531466876664402221 No 7 >PRK09491 rimI ribosomal-protein-alanine N-acetyltransferase; Provisional Probab=99.31 E-value=6.5e-11 Score=82.58 Aligned_cols=116 Identities=13% Similarity=0.116 Sum_probs=87.0 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCCCCEEEEC-CCCCCHHHHHHHHHHHHHH Q ss_conf 879998999999999720011487854999982576076799999983467831136863-7766035999999999862 Q gi|254781133|r 13 AKPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRVIELSGAS-DCKSWLSRSVLKEIYSYPW 91 (159) Q Consensus 13 ~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~~~-~~~~~~~~ea~~~l~~~aF 91 (159) +.|+|.+.+.. . +. .....+.+..++++||.+.+.-.........|++.. .++.|+|+..++.+++++ T Consensus 22 ~~PWs~~~f~~---~-~~------~~~~~~~~~~~~~ivGy~~~~~~~de~~i~~iaV~p~~r~~G~g~~Ll~~~~~~~- 90 (144) T PRK09491 22 AFPWSEKTFAS---N-QG------ERYLNLKLTQNGQMAAFAITQVVLDEATLFNIAVDPDYQRQGLGRALLEHLIDEL- 90 (144) T ss_pred CCCCCHHHHHH---H-HH------CCCEEEEEEECCEEEEEEEEEEECCCEEEEEEEECHHHHHCCHHHHHHHHHHHHH- T ss_conf 89999999997---7-55------6917999998998999999998179189999999688864897999999999999- Q ss_pred CCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECCCC--C-E-EEEEEEEE Q ss_conf 48650799999826421589999874958966761289--4-1-00799986 Q gi|254781133|r 92 NQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRLRG--R-N-AAENIYVI 139 (159) Q Consensus 92 ~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~~G--~-~-~d~~v~~l 139 (159) .+.++.++.++|.++|.+++++|+|+||++.|+++.+- . . .|.++|++ T Consensus 91 ~~~g~~~i~LEVr~sN~~A~~lY~k~GF~~~g~R~~YY~~~dg~EDAiiM~l 142 (144) T PRK09491 91 EKRGVATLWLEVRASNAAAIALYESLGFNEVTIRRNYYPTADGREDAIIMAL 142 (144) T ss_pred HHCCCCEEEEEEECCCHHHHHHHHHCCCEEECEECCCCCCCCCCCCCEEEEC T ss_conf 9879979999995787899999998899891788785689789902168862 No 8 >TIGR01575 rimI ribosomal-protein-alanine acetyltransferase; InterPro: IPR006464 Histone acetylation is carried out by a class of enzymes known as histone acetyltransferases (HATs), which catalyse the transfer of an acetyl group from acetyl-CoA to the lysine E-amino groups on the N-terminal tails of histones. Early indication that HATs were involved in transcription came from the observation that in actively transcribed regions of chromatin, histones tend to be hyperacetylated, whereas in transcriptionally silent regions histones are hypoacetylated. The histone acetyltransferases are divided into five families. These include the Gcn5-related acetyltransferases (GNATs); the MYST (for 'MOZ, Ybf2/Sas3, Sas2 and Tip60)-related HATs; p300/CBP HATs; the general transcription factor HATs, which include the TFIID subunit TAF250; and the nuclear hormone-related HATs SRC1 and ACTR (SRC3). The GCN5-related N-acetyltransferase superfamily includes such enzymes as the histone acetyltransferases GCN5 and Hat1, the elongator complex subunit Elp3, the mediator-complex subunit Nut1, and Hpa2 . Many GNATs share several functional domains, including an N-terminal region of variable length, an acetyltransferase domain that encompasses the conserved sequence motifs described above, a region that interacts with the coactivator Ada2, and a C-terminal bromodomain that is believed to interact with acetyl-lysine residues. Members of the GNAT family are important for the regulation of cell growth and development. In mice, knockouts of Gcn5L are embryonic lethal. Yeast Gcn5 is needed for normal progression through the G2M boundary and mitotic gene expression. The importance of GNATs is probably related to their role in transcription and DNA repair. Members of this entry belong to the GCN5-related N-acetyltransferase (GNAT) superfamily. This entry covers prokaryotes and the archaea. It contains rimI, which catalyses the acetylation of the N-terminal alanine of ribosomal protein S18 .; GO: 0016407 acetyltransferase activity, 0006474 N-terminal protein amino acid acetylation. Probab=99.03 E-value=5e-09 Score=71.69 Aligned_cols=107 Identities=10% Similarity=0.045 Sum_probs=79.3 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEE---EEEEECCCCC-CCEEEECC-CCCCHHHHHHHHHH Q ss_conf 8799989999999997200114878549999825760767999---9998346783-11368637-76603599999999 Q gi|254781133|r 13 AKPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGII---YHNYCPISRV-IELSGASD-CKSWLSRSVLKEIY 87 (159) Q Consensus 13 ~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~---l~~~~~~~~~-~eig~~~~-~~~~~~~ea~~~l~ 87 (159) ..|+|++.+..=+.. .. ...+ +....++++||.++ +.++....+. ..|++... ++.|+|+.-++.|+ T Consensus 16 ~~PW~~~~f~~~l~i---~~----~~~y-l~a~~g~~~~Gy~~~~r~~~~~~~Ea~i~~iaV~pe~Qg~GiG~~LL~~L~ 87 (140) T TIGR01575 16 DSPWTEAAFAEELAI---NY----YTCY-LLAEIGGKVVGYIGISRIVQIVLDEAHILNIAVKPEYQGQGIGRALLRELI 87 (140) T ss_pred CCCCCHHHHHHHHCC---CC----CCEE-EEEEECCEEEEEEEEEEEEEEECCCCEEEEEEECCCCCCCCHHHHHHHHHH T ss_conf 999887898865101---33----3014-456628778987765678875437744899988712016637889999999 Q ss_pred HHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECCC Q ss_conf 98624865079999982642158999987495896676128 Q gi|254781133|r 88 SYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRLR 128 (159) Q Consensus 88 ~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~~ 128 (159) +++++. ++..|+++|.++|.+++++|+|+||.+.+++|.+ T Consensus 88 ~~~~~~-~~~~~~LEVR~SN~~A~~LY~k~GF~~~~~r~~Y 127 (140) T TIGR01575 88 KEAKGR-GVNEIFLEVRVSNVAAQALYKKLGFNEIAIRKNY 127 (140) T ss_pred HHHHCC-CCEEEEEEECCHHHHHHHHHHHHCCEEECCCCCC T ss_conf 998518-9838998461406899998777185010104356 No 9 >cd04301 GNAT GCN5-related N-acetyltransferases (GNAT) represent a large superfamily of functionally diverse enzymes that catalyze the transfer of an acetyl group from acetyl-Coenzyme A to the primary amine of a wide range of acceptor substrates. Members of this superfamily include aminoglycoside N-acetyltransferases, serotonin N-acetyltransferase, glucosamine-6-phosphate N-acetyltransferase, the histone acetyltransferases, mycothiol synthase, and the Fem family of amino acyl transferases. Probab=98.97 E-value=5.2e-09 Score=71.62 Aligned_cols=78 Identities=14% Similarity=0.121 Sum_probs=65.3 Q ss_pred EEECCCCEEEEEEEEEEEECCCCCCCEE-EEC---CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHC Q ss_conf 9982576076799999983467831136-863---776603599999999986248650799999826421589999874 Q gi|254781133|r 42 MGILKSNFLVAGIIYHNYCPISRVIELS-GAS---DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSL 117 (159) Q Consensus 42 i~i~~~g~lvG~v~l~~~~~~~~~~eig-~~~---~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~ekl 117 (159) +..+.+|++||.+.+....+....++|. +.+ .+++|+++..+..+.+++.+. ++++|.+.|.++|.+|+++++|+ T Consensus 2 ~Vae~~g~ivG~~~~~~~~~~~~~~~i~~i~V~~~~rg~GiG~~L~~~~~~~a~~~-~~~~i~l~v~~~N~~a~~~y~k~ 80 (83) T cd04301 2 LVAEDDGELVGFASFFIYRPAPREAEIEGLAVDPSYRGQGLGSALLDALIDWALER-GARRLYLDTSPDNEAAIAFYEKL 80 (83) T ss_pred EEEEECCEEEEEEEEEEECCCCCEEEEEEEEECHHHCCCCCHHHHHHHHHHHHHHC-CCCEEEEEEECCCHHHHHHHHHC T ss_conf 99999998999999999738999799999999999929993899999999999987-99899999928989999999976 Q ss_pred CCE Q ss_conf 958 Q gi|254781133|r 118 GGI 120 (159) Q Consensus 118 GF~ 120 (159) ||+ T Consensus 81 Gft 83 (83) T cd04301 81 GFT 83 (83) T ss_pred CCC T ss_conf 999 No 10 >PRK10975 TDP-fucosamine acetyltransferase; Provisional Probab=98.97 E-value=2.1e-08 Score=68.10 Aligned_cols=103 Identities=5% Similarity=0.010 Sum_probs=77.0 Q ss_pred HHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCCCC-EEEEC-CCCCCHHHHHHHHHHHHHHCCCCE Q ss_conf 9999999997200114878549999825760767999999834678311-36863-776603599999999986248650 Q gi|254781133|r 19 QIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRVIE-LSGAS-DCKSWLSRSVLKEIYSYPWNQLCC 96 (159) Q Consensus 19 e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~e-ig~~~-~~~~~~~~ea~~~l~~~aF~~l~l 96 (159) +.-+.||+..+.+. .+...-+....+|+++|.|.+..++.....+. |++.. .+++|+|+..+.++..||..+ |+ T Consensus 121 rfY~~Wie~aV~G~---fD~~cLv~~~~~g~i~GFvtlr~l~~~~a~IgLlAV~P~~rG~GiG~~Lm~aa~~wa~~~-g~ 196 (231) T PRK10975 121 RFYAQWIENAVRGT---FDHQCLILRDATGQIRGFVSLRELNDTDARIGLLAVFPGAQGRGIGAELMQAALNWAQAR-GL 196 (231) T ss_pred HHHHHHHHHHHCCC---CCCEEEEEECCCCCEEEEEEEEECCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHC-CC T ss_conf 89999999984666---585689998689977899999852798637889889830127988999999999999985-99 Q ss_pred EEEEEEEECCCCHHHHHHHHCCCEEEEEE Q ss_conf 79999982642158999987495896676 Q gi|254781133|r 97 QAVIHRIPDEDYPQHRMLTSLGGIRYRIP 125 (159) Q Consensus 97 ~Ri~~~~~~~N~~S~rv~eklGF~~egv~ 125 (159) .+|.+.+...|.+++++|+|+||+-|.+. T Consensus 197 ~~l~l~tq~~N~~AirLY~~~GF~ie~t~ 225 (231) T PRK10975 197 TTLRVATQMGNTAALKLYIQSGANIESTA 225 (231) T ss_pred CEEEEEEECCCHHHHHHHHHCCCEEEEEE T ss_conf 78999985789999999998698275679 No 11 >COG3981 Predicted acetyltransferase [General function prediction only] Probab=98.93 E-value=1.8e-08 Score=68.51 Aligned_cols=111 Identities=11% Similarity=0.083 Sum_probs=80.7 Q ss_pred HHHHHHHHHHHHHCCCCC--C-----CCEEEEEECCCCEEEEEEEEEEE-ECC--CCCCCEEEECC---CCCCHHHHHHH Q ss_conf 899999999972001148--7-----85499998257607679999998-346--78311368637---76603599999 Q gi|254781133|r 18 NQIIADFVAKRIKDCSSG--W-----DRFVSMGILKSNFLVAGIIYHNY-CPI--SRVIELSGASD---CKSWLSRSVLK 84 (159) Q Consensus 18 ~e~~~~wi~~~~~~~~~~--~-----~~~~ai~i~~~g~lvG~v~l~~~-~~~--~~~~eig~~~~---~~~~~~~ea~~ 84 (159) .+.+++|+..- .+.+.+ . +....+.+..++++||.+.++.. +.. ....+|||.+. +++||++++++ T Consensus 42 ~~~fed~L~~~-~~~~~~~~~~~g~V~~~~y~~v~~d~~ivG~i~lRh~Ln~~ll~~gGHIGY~VrPseR~KGYA~emLk 120 (174) T COG3981 42 QEDFEDWLEDL-TRQEPGNNLPEGWVPASTYWAVDEDGQIVGFINLRHQLNDFLLEEGGHIGYSVRPSERRKGYAKEMLK 120 (174) T ss_pred CCCHHHHHHHH-HCCCCCCCCCCCCEECEEEEEEECCCCEEEEEEEEEECCHHHHHCCCCCCCEECHHHHCCCHHHHHHH T ss_conf 30099999998-60277868999803033689993388089999862100568886278564114763524678999999 Q ss_pred HHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECCCCC Q ss_conf 9999862486507999998264215899998749589667612894 Q gi|254781133|r 85 EIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRLRGR 130 (159) Q Consensus 85 ~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~~G~ 130 (159) .++..| ..+++.+|...|+.+|.||+++.++.|=+.|..+..-|. T Consensus 121 l~L~~a-r~lgi~~Vlvtcd~dN~ASrkvI~~NGGile~~~~~~g~ 165 (174) T COG3981 121 LALEKA-RELGIKKVLVTCDKDNIASRKVIEANGGILENEFFGEGK 165 (174) T ss_pred HHHHHH-HHCCCCEEEEEECCCCCHHHHHHHHCCCEEEEEECCCCC T ss_conf 999999-984997699984799811167787538788677714885 No 12 >pfam00583 Acetyltransf_1 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions such as Elp3-related proteins. Probab=98.90 E-value=6.8e-09 Score=70.95 Aligned_cols=75 Identities=13% Similarity=0.049 Sum_probs=64.7 Q ss_pred CCCCEEEEEEEEEEEECCCCCCCEE-EEC---CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCE Q ss_conf 2576076799999983467831136-863---776603599999999986248650799999826421589999874958 Q gi|254781133|r 45 LKSNFLVAGIIYHNYCPISRVIELS-GAS---DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGI 120 (159) Q Consensus 45 ~~~g~lvG~v~l~~~~~~~~~~eig-~~~---~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~ 120 (159) +.+|++||.+.+..++.....++|. +.+ .++.++++..+..+++++.+ .++++|.+.+.+.|.+|+++++|+||+ T Consensus 2 ~~~g~ivG~~~~~~~~~~~~~~~i~~l~V~p~~rg~GiG~~Ll~~~~~~~~~-~g~~~i~l~~~~~n~~a~~~y~k~GF~ 80 (80) T pfam00583 2 EEDGELVGFASLSIIDEEGNVAEIEGLAVDPEYRGKGIGTALLEALEEYARE-LGLKRIELEVLEDNEAAIALYEKLGFK 80 (80) T ss_pred CCCCEEEEEEEEEEECCCCCEEEEEEEEECHHHCCCCHHHHHHHHHHHHHHH-CCCCEEEEEECCCCHHHHHHHHHCCCC T ss_conf 6789899999999964899999999999998995998799999999999998-799999999927879999999986979 No 13 >PRK03624 putative acetyltransferase; Provisional Probab=98.76 E-value=8.1e-08 Score=64.76 Aligned_cols=82 Identities=10% Similarity=-0.031 Sum_probs=61.9 Q ss_pred EEEECCCCEEEEEEEEEEEECCCCC-CCEEEEC-CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCC Q ss_conf 9998257607679999998346783-1136863-7766035999999999862486507999998264215899998749 Q gi|254781133|r 41 SMGILKSNFLVAGIIYHNYCPISRV-IELSGAS-DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLG 118 (159) Q Consensus 41 ai~i~~~g~lvG~v~l~~~~~~~~~-~eig~~~-~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklG 118 (159) .+....+|++||.+... .+...+. ..|++.. .+++|+|+..+..+.+++ .+.|+++|.+.|.++|.+++++|+|+| T Consensus 47 ~lVae~~g~ivG~~~~~-~~~~~~~i~~laV~p~~Rg~GiG~~Ll~~~~~~~-~~~G~~~i~L~v~~~N~~A~~fY~~~G 124 (140) T PRK03624 47 FLVAEVGGRVVGTVMGG-YDGHRGWLYYLAVHPDFRGRGIGRALLAHAEKKL-IARGCPKINLQVREDNDAVLGFYEALG 124 (140) T ss_pred EEEEEECCEEEEEEEEE-EECCEEEEEEEEECHHHHHCCHHHHHHHHHHHHH-HHCCCCEEEEEEECCCHHHHHHHHHCC T ss_conf 99999999999999999-8177899999888876765389999999999999-976999999999088599999999889 Q ss_pred CEEEEE Q ss_conf 589667 Q gi|254781133|r 119 GIRYRI 124 (159) Q Consensus 119 F~~egv 124 (159) |++++. T Consensus 125 F~~~~r 130 (140) T PRK03624 125 YEEEDR 130 (140) T ss_pred CEECCE T ss_conf 938328 No 14 >KOG4135 consensus Probab=98.67 E-value=1e-07 Score=64.17 Aligned_cols=116 Identities=11% Similarity=0.111 Sum_probs=77.3 Q ss_pred CCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEEC-----C-----CCEEEEEEEEEEEE-CCC-------- Q ss_conf 7620573689879998999999999720011487854999982-----5-----76076799999983-467-------- Q gi|254781133|r 3 NTIEITWGGNAKPRINQIIADFVAKRIKDCSSGWDRFVSMGIL-----K-----SNFLVAGIIYHNYC-PIS-------- 63 (159) Q Consensus 3 n~~~~~W~~~~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~-----~-----~g~lvG~v~l~~~~-~~~-------- 63 (159) .+.+++=++..+|.|-+.--+-- ..|... .+-+++.+. + -...||.+.++-.+ +.+ T Consensus 32 knEelr~LT~SE~LtLdeEyeMQ----~sW~~D-eDKlTFIVLdaE~~ea~~~ev~~MvGDvNlFlt~~~~~~n~s~~~~ 106 (185) T KOG4135 32 KNEELRRLTASEPLTLDEEYEMQ----KSWRED-EDKLTFIVLDAEMNEAGEDEVDHMVGDVNLFLTTSPDTENPSDDVI 106 (185) T ss_pred HHHHHHHHHCCCCCCHHHHHHHH----HHHCCC-CCCEEEEEEECHHCCCCCHHHHHHCCCEEEEEECCCCCCCCCCCEE T ss_conf 25888875427886522788866----312047-7614899983020235750576641330368752777688643425 Q ss_pred -CCCCEEEE--CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEE Q ss_conf -83113686--3776603599999999986248650799999826421589999874958966 Q gi|254781133|r 64 -RVIELSGA--SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYR 123 (159) Q Consensus 64 -~~~eig~~--~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~eg 123 (159) +-+|+-++ ..++.|+|++|+.+++.|+..++++.+.++.+..+|.||++++.|++|++.. T Consensus 107 ~gE~EvMIAEP~~RgKG~G~eav~~ml~y~~s~l~l~Ky~vkig~~nk~sl~lFkk~~f~q~~ 169 (185) T KOG4135 107 TGEVEVMIAEPRGRGKGIGTEAVRAMLAYAYSVLKLDKYEVKIGMDNKPSLRLFKKFLFTQVF 169 (185) T ss_pred EEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHEEEEEECCCCCHHHHHHHHHHHEEEE T ss_conf 455899980555567773188999999999987402048997258886278999975340453 No 15 >TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species. Probab=98.57 E-value=1.4e-06 Score=57.70 Aligned_cols=88 Identities=9% Similarity=-0.182 Sum_probs=61.4 Q ss_pred CCCEEEEEECCCCEEEEEEEEEEEECCCCCCCE---EEE-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHH Q ss_conf 785499998257607679999998346783113---686-3776603599999999986248650799999826421589 Q gi|254781133|r 36 WDRFVSMGILKSNFLVAGIIYHNYCPISRVIEL---SGA-SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQH 111 (159) Q Consensus 36 ~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~ei---g~~-~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~ 111 (159) ...++......+|+++|.+-..-........|| |+. .++.+|+++..+.+.+++.-. -|+.++.+.|+.+|+++. T Consensus 197 P~g~lla~~~~~g~lvGF~wtk~h~~~~~~GEvYvvGV~p~~rGrGLG~aL~~~gL~~l~~-~G~~~v~LyVd~dN~~A~ 275 (292) T TIGR03448 197 PAGLFLAFDDAPGELLGFHWTKVHPDEPALGEVYVVGVDPAAQGRGLGDALTLIGLHHLAA-RGLPAVMLYVEADNEAAV 275 (292) T ss_pred CCCCEEEEECCCCEEEEEEEEEECCCCCCCCEEEEEEECHHHCCCCHHHHHHHHHHHHHHH-CCCCEEEEEECCCCHHHH T ss_conf 4773899847998399999998637888852599999883424776789999999999998-799889998378888999 Q ss_pred HHHHHCCCEEEEE Q ss_conf 9998749589667 Q gi|254781133|r 112 RMLTSLGGIRYRI 124 (159) Q Consensus 112 rv~eklGF~~egv 124 (159) ++|++|||+...+ T Consensus 276 ~lY~~LGF~~~~~ 288 (292) T TIGR03448 276 RTYEKLGFTVAEV 288 (292) T ss_pred HHHHHCCCEEEEE T ss_conf 9998569958400 No 16 >PRK10146 aminoalkylphosphonic acid N-acetyltransferase; Provisional Probab=98.52 E-value=3.1e-06 Score=55.65 Aligned_cols=85 Identities=12% Similarity=-0.069 Sum_probs=63.0 Q ss_pred EEEEEECCCCEEEEEEEEEEEECCCC---CCCE---EEE-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHH Q ss_conf 49999825760767999999834678---3113---686-3776603599999999986248650799999826421589 Q gi|254781133|r 39 FVSMGILKSNFLVAGIIYHNYCPISR---VIEL---SGA-SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQH 111 (159) Q Consensus 39 ~~ai~i~~~g~lvG~v~l~~~~~~~~---~~ei---g~~-~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~ 111 (159) ...+....+|++||+++++-...-.. .++| -+. ..+..|+|+..+..+.++| .+.|+.+|++.+...|..++ T Consensus 47 ~~~~Vae~dG~vvG~~~l~~~~~l~~~~~~~~I~~l~V~~~~Rg~GiG~~Ll~~a~~~A-r~~G~~~~~L~s~~~r~~Ah 125 (144) T PRK10146 47 MRYHLALLDGEVVGMIGLHLQFHLHHVNWIGEIQELVVMPQARGLNVGSKLLAWAEEEA-RQAGAEMTELSTNVKRHDAH 125 (144) T ss_pred CEEEEEEECCEEEEEEEEEEECCCCCCCCEEEEEEEEECHHHCCCCHHHHHHHHHHHHH-HHCCCCEEEEECCCCCHHHH T ss_conf 48999998997999999999635246883699999999856417699999999999999-98699189998499678999 Q ss_pred HHHHHCCCEEEEE Q ss_conf 9998749589667 Q gi|254781133|r 112 RMLTSLGGIRYRI 124 (159) Q Consensus 112 rv~eklGF~~egv 124 (159) +.|+|+||++-+. T Consensus 126 ~FY~r~Gf~~~~~ 138 (144) T PRK10146 126 RFYLREGYEQSHF 138 (144) T ss_pred HHHHHCCCEEECC T ss_conf 9999869963354 No 17 >KOG3139 consensus Probab=98.46 E-value=3.2e-06 Score=55.57 Aligned_cols=101 Identities=12% Similarity=0.065 Sum_probs=71.9 Q ss_pred CCCCEEEEEECCCCEEEEEEEEEEEECCC-CCCCEE---EE-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCH Q ss_conf 87854999982576076799999983467-831136---86-37766035999999999862486507999998264215 Q gi|254781133|r 35 GWDRFVSMGILKSNFLVAGIIYHNYCPIS-RVIELS---GA-SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYP 109 (159) Q Consensus 35 ~~~~~~ai~i~~~g~lvG~v~l~~~~~~~-~~~eig---~~-~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~ 109 (159) .|+.++.+...+++..||.+.-..-...+ ..++|. +. -.++.|.+++.++.+++++ ...|+..|.+.+.+.|.+ T Consensus 53 ~wp~~~~~a~d~~~~~VGai~ck~~~~r~~~rgyi~mLaV~~e~Rg~GIg~aLvr~aId~m-~~~g~~eVvLeTe~~n~~ 131 (165) T KOG3139 53 NWPCFCFLALDEKGDTVGAIVCKLDTHRNTLRGYIAMLAVDSEYRGQGIGKALVRKAIDAM-RSRGYSEVVLETEVTNLS 131 (165) T ss_pred CCCEEEEEEECCCCCEEEEEEEECCCCCCCCEEEEEEEEECHHHCCCCHHHHHHHHHHHHH-HHCCCCEEEEECCCCCHH T ss_conf 7866899999078866889998344557764489999996125334568999999999999-977996899832433418 Q ss_pred HHHHHHHCCCEEEE-EECCC--CCEEEEEEEE Q ss_conf 89999874958966-76128--9410079998 Q gi|254781133|r 110 QHRMLTSLGGIRYR-IPRLR--GRNAAENIYV 138 (159) Q Consensus 110 S~rv~eklGF~~eg-v~R~~--G~~~d~~v~~ 138 (159) +.+++++|||++.+ .++.+ |. |.+-+- T Consensus 132 A~~LY~sLGF~r~~r~~~YYlng~--dA~rl~ 161 (165) T KOG3139 132 ALRLYESLGFKRDKRLFRYYLNGM--DALRLK 161 (165) T ss_pred HHHHHHHCCCEEECCEEEEEECCC--CEEEEE T ss_conf 999988549467331247897785--148998 No 18 >PTZ00330 acetyltransferase; Provisional Probab=98.34 E-value=1.9e-05 Score=51.05 Aligned_cols=105 Identities=14% Similarity=0.023 Sum_probs=67.7 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCC------CC-EEEE-CCCCCCHHHHHH Q ss_conf 987999899999999972001148785499998257607679999998346783------11-3686-377660359999 Q gi|254781133|r 12 NAKPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRV------IE-LSGA-SDCKSWLSRSVL 83 (159) Q Consensus 12 ~~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~------~e-ig~~-~~~~~~~~~ea~ 83 (159) ..+|.+.+...++....... .+....+....+|++||+..+.-...-... +| +-+. -.+++|+|+..+ T Consensus 29 ~~~~~~~e~~~~~~~~~~~~----~~~~~v~Vae~~g~iVg~~~l~i~~~~~~~g~~~~~iEdvvV~~~~RG~GiG~~L~ 104 (147) T PTZ00330 29 SAPALSQEELEQIAARRRLA----GVVTRVFVHSPTQRIVGTASLFVEPKFTRGGKCVGHIEDVVVDPSYRGQGLGRALI 104 (147) T ss_pred CCCCCCHHHHHHHHHHHHCC----CCCEEEEEEECCCEEEEEEEEEEEEEEEECCCCEEEEEEEEECHHHCCCCHHHHHH T ss_conf 38997889999999987347----99439999952997999999999921331795369998899987765888899999 Q ss_pred HHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEE Q ss_conf 99999862486507999998264215899998749589667 Q gi|254781133|r 84 KEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRI 124 (159) Q Consensus 84 ~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv 124 (159) ..++++| .+.||.+|.+.| |...+..|+|+||++.++ T Consensus 105 ~~l~~~a-~~~gc~kv~L~~---~~~~~~FY~k~Gf~~~~~ 141 (147) T PTZ00330 105 SDLCEIA-RSSGCYKVILDC---TEDMVAFYKKLGFRACER 141 (147) T ss_pred HHHHHHH-HHCCCEEEEEEC---CCHHHHHHHHCCCEECCC T ss_conf 9999999-987997999947---951679999869997420 No 19 >pfam08445 FR47 FR47-like protein. The members of this family are similar to the C-terminal region of the D. melanogaster hypothetical protein FR47. This protein has been found to consist of two N-acyltransferase-like domains swapped with the C-terminal strands. Probab=98.19 E-value=9.7e-06 Score=52.80 Aligned_cols=74 Identities=12% Similarity=-0.006 Sum_probs=55.1 Q ss_pred EEECCCCEEEEEEEEEEEECCCCCCCEEE----ECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHC Q ss_conf 99825760767999999834678311368----63776603599999999986248650799999826421589999874 Q gi|254781133|r 42 MGILKSNFLVAGIIYHNYCPISRVIELSG----ASDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSL 117 (159) Q Consensus 42 i~i~~~g~lvG~v~l~~~~~~~~~~eig~----~~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~ekl 117 (159) +..+++|++++.+-..+. .+|+. +-++++|+++..++++++.+.+ .|. .+.+.+.++|.||++++||+ T Consensus 4 lf~~~~~el~aW~l~~~~------G~i~~l~v~Pe~rgrG~G~~L~~~~~~~~~~-~G~-~~~l~V~~~N~~a~~lYekl 75 (86) T pfam08445 4 IYRGDTGELAAWCLRLPG------GELGALQTLPEHRRRGLGSRLVAALARGIAE-RGI-TPFAVVVAGNTPSRRLYEKL 75 (86) T ss_pred EEECCCCCEEEEEEECCC------CCEEEEEECHHHCCCCHHHHHHHHHHHHHHH-CCC-CEEEEEECCCHHHHHHHHHC T ss_conf 787899989999998899------7689999998983698899999999999997-699-46999987986899999986 Q ss_pred CCEEEE Q ss_conf 958966 Q gi|254781133|r 118 GGIRYR 123 (159) Q Consensus 118 GF~~eg 123 (159) ||++.. T Consensus 76 GF~~v~ 81 (86) T pfam08445 76 GFRKID 81 (86) T ss_pred CCEEEE T ss_conf 997914 No 20 >COG0456 RimI Acetyltransferases [General function prediction only] Probab=97.93 E-value=0.00017 Score=45.60 Aligned_cols=80 Identities=10% Similarity=0.044 Sum_probs=58.8 Q ss_pred EEEEEEEEEEEECC------CCCCCEEEEC-CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEE Q ss_conf 07679999998346------7831136863-7766035999999999862486507999998264215899998749589 Q gi|254781133|r 49 FLVAGIIYHNYCPI------SRVIELSGAS-DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIR 121 (159) Q Consensus 49 ~lvG~v~l~~~~~~------~~~~eig~~~-~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~ 121 (159) +++|.+........ .....|++.. .++.|+++..+..++..+.+......+.+.|.++|.+++++++|+||+. T Consensus 72 ~~~G~~~~~~~~~~~~~~~~~~i~~iaV~p~~r~~Gig~~Ll~~~~~~~~~~~~~~~~~L~V~~~N~~Ai~lY~~~GF~~ 151 (177) T COG0456 72 KVVGFLLVRVVDGRPSADHEGHIYNLAVDPEYRGRGIGRALLDEALERLRERGLADKIVLEVRESNEAAIGLYRKLGFEV 151 (177) T ss_pred CEEEEEEEEEECCCCCCCCCEEEEEEEECHHHCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHCCCEE T ss_conf 04589999740476566760899999999787699899999999999998527673699999478699999999869943 Q ss_pred EEEECCC Q ss_conf 6676128 Q gi|254781133|r 122 YRIPRLR 128 (159) Q Consensus 122 egv~R~~ 128 (159) .+....+ T Consensus 152 ~~~~~~y 158 (177) T COG0456 152 VKIRKNY 158 (177) T ss_pred EHHHHHH T ss_conf 1213754 No 21 >PRK10514 hypothetical protein; Provisional Probab=97.82 E-value=0.00014 Score=46.06 Aligned_cols=77 Identities=5% Similarity=-0.033 Sum_probs=52.0 Q ss_pred EEECCCCEEEEEEEEEEEECCCCCCCEEEEC-CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCE Q ss_conf 9982576076799999983467831136863-776603599999999986248650799999826421589999874958 Q gi|254781133|r 42 MGILKSNFLVAGIIYHNYCPISRVIELSGAS-DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGI 120 (159) Q Consensus 42 i~i~~~g~lvG~v~l~~~~~~~~~~eig~~~-~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~ 120 (159) +...++|+++|.+++.+ +....|-+.. .+++|+|+ +++++|....+ .+.+.|...|.++++.|+|.||+ T Consensus 53 v~~~~~~~~~GFi~l~~----~~i~~LFV~p~~~g~GiG~----~Ll~~a~~~~~--~l~L~V~~~N~~A~~FY~~~GF~ 122 (146) T PRK10514 53 VAVDERDQPVGFMLLSG----GHMEALFVDPDVRGCGVGR----MLVEHALTLHP--ELTTDVNEQNEQAVGFYKKMGFK 122 (146) T ss_pred EEEECCCCEEEEEEECC----CEEEEEEECHHHCCCCHHH----HHHHHHHHHCC--CCEEEEECCCHHHHHHHHHCCCE T ss_conf 99966995699999788----9887899878773999799----99999987385--40985001586899999977998 Q ss_pred EEEEECCC Q ss_conf 96676128 Q gi|254781133|r 121 RYRIPRLR 128 (159) Q Consensus 121 ~egv~R~~ 128 (159) ..|....- T Consensus 123 ~~gr~~~d 130 (146) T PRK10514 123 VTGRSEVD 130 (146) T ss_pred EECCCCCC T ss_conf 97135679 No 22 >PRK10562 hypothetical protein; Provisional Probab=97.81 E-value=0.00029 Score=44.31 Aligned_cols=88 Identities=8% Similarity=0.004 Sum_probs=55.8 Q ss_pred EEEECCCCEEEEEEEEEEEECCCCCCC-EEEEC-CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCC Q ss_conf 999825760767999999834678311-36863-7766035999999999862486507999998264215899998749 Q gi|254781133|r 41 SMGILKSNFLVAGIIYHNYCPISRVIE-LSGAS-DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLG 118 (159) Q Consensus 41 ai~i~~~g~lvG~v~l~~~~~~~~~~e-ig~~~-~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklG 118 (159) .+...++|+++|.+++.+ ...++ |-+.. .+++|+|++ |++++-... ..+.+.|...|.++++.|+|.| T Consensus 50 ~~V~e~~g~ivGFi~~~~----~~~i~~LfV~p~~~g~GiG~~----Ll~~a~~~~--~~l~L~v~~~N~~A~~FY~~~G 119 (145) T PRK10562 50 TWVWEEDGKLLGFVSILE----GRFVGALFVAPKAHRRGIGKA----LMQYVQQRY--PHLSLEVYQKNQRAVNFYHAQG 119 (145) T ss_pred EEEEEECCEEEEEEEECC----CCEEEEEEECHHHCCCCHHHH----HHHHHHHHC--CCEEEEECCCCHHHHHHHHHCC T ss_conf 999996996899999569----997611688764337889999----999999638--9349998257978999999889 Q ss_pred CEEEEEECCCCCEEEEEEEE Q ss_conf 58966761289410079998 Q gi|254781133|r 119 GIRYRIPRLRGRNAAENIYV 138 (159) Q Consensus 119 F~~egv~R~~G~~~d~~v~~ 138 (159) |+..+.........-.+.|. T Consensus 120 F~~~~~~~~~~tg~p~~~m~ 139 (145) T PRK10562 120 FRIVDCAWQEETQHPTWIMS 139 (145) T ss_pred CEEEEEECCCCCCCCEEEEE T ss_conf 99962336888899159999 No 23 >TIGR02382 wecD_rffC TDP-D-fucosamine acetyltransferase; InterPro: IPR012752 This entry represents the WecD protein (formerly RffC) for the biosynthesis of enterobacterial common antigen (ECA), an outer leaflet, outer membrane glycolipid with a trisaccharide repeat unit. WecD is a member of the GNAT family of acetyltransferases.. Probab=97.57 E-value=0.00012 Score=46.48 Aligned_cols=103 Identities=6% Similarity=-0.079 Sum_probs=79.0 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCEEEEEEC-CCCEEEEEEEEEEEECCCCCCC-EEEEC-CCCCCHHHHHHHHHHHHHHCC Q ss_conf 98999999999720011487854999982-5760767999999834678311-36863-776603599999999986248 Q gi|254781133|r 17 INQIIADFVAKRIKDCSSGWDRFVSMGIL-KSNFLVAGIIYHNYCPISRVIE-LSGAS-DCKSWLSRSVLKEIYSYPWNQ 93 (159) Q Consensus 17 t~e~~~~wi~~~~~~~~~~~~~~~ai~i~-~~g~lvG~v~l~~~~~~~~~~e-ig~~~-~~~~~~~~ea~~~l~~~aF~~ 93 (159) |-..-+.||+..+.+. .++-|+.++ .+|++.|.|.|++.+..+-.+. ++.+. -.-+|.|.+.|.+...|| .+ T Consensus 90 SgRFYA~WvEnAV~GT----FDHqCL~~~da~g~p~G~VtLR~L~d~dARIGLLav~PG~~~rGiG~~LM~~A~~Wc-~~ 164 (201) T TIGR02382 90 SGRFYAQWVENAVLGT----FDHQCLLLRDAAGDPRGYVTLRELDDSDARIGLLAVFPGVTIRGIGARLMALAKAWC-RR 164 (201) T ss_pred CCCHHHHHHHHHCCCC----CCCHHHHHHCCCCCCEEEEEEEECCCCCCCEECCCCCCCCCEECHHHHHHHHHHHHH-HH T ss_conf 3213888888650567----640135222447794046874004787653301003787412045689999999999-87 Q ss_pred CCEEEEEEEEECCCCHHHHHHHHCCCEEEEE Q ss_conf 6507999998264215899998749589667 Q gi|254781133|r 94 LCCQAVIHRIPDEDYPQHRMLTSLGGIRYRI 124 (159) Q Consensus 94 l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv 124 (159) -|++|+-.-+-.+|.++.|+|-+-|-.-|-+ T Consensus 165 ~Gl~rLRVATQ~gN~AALrlYirsGA~iEST 195 (201) T TIGR02382 165 RGLIRLRVATQMGNVAALRLYIRSGASIEST 195 (201) T ss_pred CCCEEEEHHHHHHHHHHHHHHHHCCCCEECH T ss_conf 5982440333246799999998638970000 No 24 >COG3393 Predicted acetyltransferase [General function prediction only] Probab=97.56 E-value=0.00075 Score=41.93 Aligned_cols=84 Identities=7% Similarity=-0.118 Sum_probs=59.3 Q ss_pred EEEEEECCCCEEEEEEEEEEEECCCCCCCEEEE---CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHH Q ss_conf 499998257607679999998346783113686---37766035999999999862486507999998264215899998 Q gi|254781133|r 39 FVSMGILKSNFLVAGIIYHNYCPISRVIELSGA---SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLT 115 (159) Q Consensus 39 ~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~~---~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~e 115 (159) ...+-..++|++|...+.....+....+. |++ -++++||++..+.+|..-..++-..+ .+.+..+|..+.++|+ T Consensus 177 ~~~~f~~~d~~iVa~A~t~a~~~~~~~I~-gV~T~peyR~kGyAt~lva~L~~~lL~eGk~~--~L~~~~~N~~A~~iY~ 253 (268) T COG3393 177 SRTYFLEGDGKIVAKAETAAENPAYAQIN-GVYTHPEYRGKGYATALVATLAAKLLAEGKIP--CLFVNSDNPVARRIYQ 253 (268) T ss_pred EEEEEECCCCCEEEEEECCCCCCCCEEEE-EEECCHHHCCCCHHHHHHHHHHHHHHHCCCEE--EEEEECCCHHHHHHHH T ss_conf 05799826883898620056688616898-78719787265489999999999998579814--8998347878899999 Q ss_pred HCCCEEEEEE Q ss_conf 7495896676 Q gi|254781133|r 116 SLGGIRYRIP 125 (159) Q Consensus 116 klGF~~egv~ 125 (159) |+||+.-|-. T Consensus 254 riGF~~~g~~ 263 (268) T COG3393 254 RIGFREIGEF 263 (268) T ss_pred HHCCEECCEE T ss_conf 8097142417 No 25 >PRK07757 acetyltransferase; Provisional Probab=97.56 E-value=0.00061 Score=42.43 Aligned_cols=73 Identities=11% Similarity=0.052 Sum_probs=56.1 Q ss_pred EECCCCEEEEEEEEEEEECCCCCCCEEE---E-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCC Q ss_conf 9825760767999999834678311368---6-37766035999999999862486507999998264215899998749 Q gi|254781133|r 43 GILKSNFLVAGIIYHNYCPISRVIELSG---A-SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLG 118 (159) Q Consensus 43 ~i~~~g~lvG~v~l~~~~~~~~~~eig~---~-~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklG 118 (159) ..+.||++||++.++.... ..+||.. . -.++.|+|+..+..+.+.| .+.|+.+|.+.+.. -..++++| T Consensus 45 Vae~dg~IvGc~aL~~~~~--~~aEi~~l~V~p~~rg~GiG~~Ll~~l~~~A-r~~G~~~lf~LTt~-----~~fF~~~G 116 (152) T PRK07757 45 VAEEDGEIVGCCALHILWE--DLAEIRSLAVSEEYRGKGIGRMLVEACLEEA-RELGVKRVFALTYQ-----PEFFEKLG 116 (152) T ss_pred EEEECCEEEEEEEEEECCC--CCEEEEEEEECHHHCCCCHHHHHHHHHHHHH-HHCCCCEEEEEECC-----HHHHHHCC T ss_conf 9999999999999897799--9110677898376628888999999999999-98699999990586-----67898789 Q ss_pred CEEEE Q ss_conf 58966 Q gi|254781133|r 119 GIRYR 123 (159) Q Consensus 119 F~~eg 123 (159) |++.- T Consensus 117 F~~~~ 121 (152) T PRK07757 117 FREVD 121 (152) T ss_pred CEECC T ss_conf 98888 No 26 >TIGR03103 trio_acet_GNAT GNAT-family acetyltransferase TIGR03103. Members of this protein family belong to the GNAT family of acetyltransferases. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and an asparagine synthetase family protein. Probab=97.55 E-value=0.00052 Score=42.83 Aligned_cols=93 Identities=9% Similarity=0.063 Sum_probs=63.0 Q ss_pred CCCCEEEEEECCCCEEEEEEEEEEE-----ECCCCCC--CEEEECC-CCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECC Q ss_conf 8785499998257607679999998-----3467831--1368637-766035999999999862486507999998264 Q gi|254781133|r 35 GWDRFVSMGILKSNFLVAGIIYHNY-----CPISRVI--ELSGASD-CKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDE 106 (159) Q Consensus 35 ~~~~~~ai~i~~~g~lvG~v~l~~~-----~~~~~~~--eig~~~~-~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~ 106 (159) +...++.-+..++|.+||++.=-+. |+.+++- .+.+..+ ..-|.+.+.++.+.++ |...|+.-+-+.|.-+ T Consensus 121 ~~~~y~vA~d~~tg~ivg~v~gvdH~~af~dpe~gsSlW~LaVdpqa~~pgvGeaLvr~lae~-~~~rg~~~~DLSV~hd 199 (547) T TIGR03103 121 RAITYLVAEDEASGAIIGTVMGVDHRKAFNDPEHGSSLWCLAVDPQAAHPGVGEALVRALAEH-FQSRGCAYMDLSVMHD 199 (547) T ss_pred CCCEEEEEEECCCCCEEEEEECCCHHHHCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHH-HHHCCCCEEEEEECCC T ss_conf 872599986079995799986542686528976783136886668889998129999999999-9876931051101048 Q ss_pred CCHHHHHHHHCCCEEEE---EECCC Q ss_conf 21589999874958966---76128 Q gi|254781133|r 107 DYPQHRMLTSLGGIRYR---IPRLR 128 (159) Q Consensus 107 N~~S~rv~eklGF~~eg---v~R~~ 128 (159) |..++++|+||||++.- ++|+. T Consensus 200 N~~Ai~lY~kLgF~~~p~f~vKrkn 224 (547) T TIGR03103 200 NEQAIALYEKLGFRRIPVFALKRKN 224 (547) T ss_pred CHHHHHHHHHHCCEECCEEEEECCC T ss_conf 6889999997197556457886057 No 27 >TIGR02406 ectoine_EctA L-2,4-diaminobutyric acid acetyltransferase; InterPro: IPR012772 This enzyme family is the EctA of ectoine biosynthesis. Ectoine is a compatible solute, analogous to trehalose, betaines, etc., found often in halotolerant organisms. EctA is L-2,4-diaminobutyric acid acetyltransferase, also called DABA acetyltransferase.; GO: 0008415 acyltransferase activity, 0019491 ectoine biosynthetic process. Probab=97.46 E-value=0.00044 Score=43.26 Aligned_cols=83 Identities=11% Similarity=-0.031 Sum_probs=55.5 Q ss_pred EEEEC-CCCEEEEEEEEE-EEECCCCCCC--EEEEC-CCCCCHHHHHHHHHHHHHHCCCCEE---EEEEEEECCCCHHHH Q ss_conf 99982-576076799999-9834678311--36863-7766035999999999862486507---999998264215899 Q gi|254781133|r 41 SMGIL-KSNFLVAGIIYH-NYCPISRVIE--LSGAS-DCKSWLSRSVLKEIYSYPWNQLCCQ---AVIHRIPDEDYPQHR 112 (159) Q Consensus 41 ai~i~-~~g~lvG~v~l~-~~~~~~~~~e--ig~~~-~~~~~~~~ea~~~l~~~aF~~l~l~---Ri~~~~~~~N~~S~r 112 (159) +|..+ ++|+++|.|.=+ .-+..+..+- +++.. .+..|++++++.+++.=---+-..| .||+++-|+|.||.+ T Consensus 41 siVae~egg~i~GFV~GY~~P~~Pd~LFVWQVAV~~~~RG~GLA~rll~all~R~~~~~v~~~I~~~EtTiTPdN~AS~a 120 (162) T TIGR02406 41 SIVAEEEGGEIVGFVSGYLKPDRPDVLFVWQVAVDPKARGKGLASRLLEALLERVALERVRHCIQAVETTITPDNEASRA 120 (162) T ss_pred EEEEECCCCEEEEEEEEEECCCCCCEEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHH T ss_conf 27653799903358862126799978889875427131431689999999761033313798210466210766668999 Q ss_pred HHHHCCCEEEE Q ss_conf 99874958966 Q gi|254781133|r 113 MLTSLGGIRYR 123 (159) Q Consensus 113 v~eklGF~~eg 123 (159) ++.+|+=+.+- T Consensus 121 LF~~La~~~~~ 131 (162) T TIGR02406 121 LFKKLARRRGV 131 (162) T ss_pred HHHHHHHHCCC T ss_conf 99987654287 No 28 >PRK09831 hypothetical protein; Provisional Probab=97.22 E-value=0.0027 Score=38.70 Aligned_cols=109 Identities=11% Similarity=0.023 Sum_probs=63.5 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCCCCEEE-EC-CCCCCHHHHHHHHHHHH Q ss_conf 98799989999999997200114878549999825760767999999834678311368-63-77660359999999998 Q gi|254781133|r 12 NAKPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRVIELSG-AS-DCKSWLSRSVLKEIYSY 89 (159) Q Consensus 12 ~~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~-~~-~~~~~~~~ea~~~l~~~ 89 (159) +...||++++++|.......|.........+....+|++||.+.+. ++.+..-| .. ..++|.++..+.++.. T Consensus 26 ~~~~Ys~~Q~~aWa~~~~~~w~~~l~~~~~~VA~~~~~ivGF~~~~-----~~~id~lfV~p~~~g~Gvg~~Ll~~l~~- 99 (147) T PRK09831 26 ASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFITCI-----EHYIDMLFVDPEYTRRGVASALLKPLIK- 99 (147) T ss_pred CCCCCCHHHHHHHHCCCHHHHHHHHHHCCEEEEEECCEEEEEEECC-----CCEEEEEEECHHHCCCCHHHHHHHHHHH- T ss_conf 4012999999988679989999987509569999999988988548-----8808999988665389899999999984- Q ss_pred HHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECC--CCCEEEE Q ss_conf 62486507999998264215899998749589667612--8941007 Q gi|254781133|r 90 PWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRL--RGRNAAE 134 (159) Q Consensus 90 aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~--~G~~~d~ 134 (159) +..++...+ +..+...++|.||+.+..... .|..-.- T Consensus 100 -----~~~~l~~~a---S~~A~pFFe~~Gf~vv~~q~v~~~G~~l~n 138 (147) T PRK09831 100 -----SESELTVDA---SITAKPFFERYGFQTVKQQRVECRGAWFTN 138 (147) T ss_pred -----HCCCEEEEH---HHHHHHHHHHCCCEEEEEEEEEECCEEECC T ss_conf -----172058631---576388899769799877568569989707 No 29 >KOG3216 consensus Probab=97.21 E-value=0.0032 Score=38.27 Aligned_cols=85 Identities=12% Similarity=0.081 Sum_probs=60.5 Q ss_pred CEEEEEECC-CCEEEEEEEEEE-EECCCCCCCEEE-----E-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCH Q ss_conf 549999825-760767999999-834678311368-----6-37766035999999999862486507999998264215 Q gi|254781133|r 38 RFVSMGILK-SNFLVAGIIYHN-YCPISRVIELSG-----A-SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYP 109 (159) Q Consensus 38 ~~~ai~i~~-~g~lvG~v~l~~-~~~~~~~~eig~-----~-~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~ 109 (159) .+.++.+.. ++.++|.+.+.. .+.-.+.-.|.. . -.++.|+|+..++.+.+-| ..+|+.|+++.|..-|++ T Consensus 53 ~~~v~~ie~~~~~~aGf~~yf~~ystW~~k~~iYleDlyV~e~yR~kG~Gs~Ll~~va~~A-~~~G~~rv~w~vldwN~r 131 (163) T KOG3216 53 HWLVAAIETSGEVVAGFALYFNNYSTWLGKQGIYLEDLYVREQYRGKGIGSKLLKFVAEEA-DKLGTPRVEWVVLDWNHR 131 (163) T ss_pred EEEEEEEECCCCCEEEEEEEECCCCCCCCCCEEEEEEEEECCHHCCCCHHHHHHHHHHHHH-HHCCCCCEEEEEECCCHH T ss_conf 8999998248981467765511566300344078876472532203576899999999999-973997079998124616 Q ss_pred HHHHHHHCCCEEEE Q ss_conf 89999874958966 Q gi|254781133|r 110 QHRMLTSLGGIRYR 123 (159) Q Consensus 110 S~rv~eklGF~~eg 123 (159) ++.+++++|.+.-. T Consensus 132 Ai~lY~k~gaq~l~ 145 (163) T KOG3216 132 AILLYEKVGAQDLK 145 (163) T ss_pred HHHHHHHHCCCCCC T ss_conf 89999985733434 No 30 >PRK07922 N-acetylglutamate synthase; Validated Probab=97.17 E-value=0.0031 Score=38.39 Aligned_cols=75 Identities=9% Similarity=0.020 Sum_probs=55.9 Q ss_pred EEEEECCCCEEEEEEEEEEEECCCCCCCEEE-EC---CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHH Q ss_conf 9999825760767999999834678311368-63---7766035999999999862486507999998264215899998 Q gi|254781133|r 40 VSMGILKSNFLVAGIIYHNYCPISRVIELSG-AS---DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLT 115 (159) Q Consensus 40 ~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~-~~---~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~e 115 (159) +.+....||++||++.|+-+.. ..+||.. +. .+++|.|+..+..+.+-| .++|+.||.+.+..- ..++ T Consensus 48 F~Vae~~DG~IvGCaAL~~~~~--dlAEIrsLAV~p~~rg~G~G~~Lv~~l~~~A-r~lGi~~vFvLT~~~-----~fF~ 119 (170) T PRK07922 48 FWVAEHLDGEVVGCGALHVLWE--DLAEVRTVAVDPAMRGHGVGHAIVERLLDVA-RELGLSRVFVLTFEV-----EFFA 119 (170) T ss_pred EEEEEECCCCEEEEEEEEECCC--CHHHHEEEEECHHHCCCCHHHHHHHHHHHHH-HHCCCCEEEEEECCH-----HHHH T ss_conf 8999945996899997464565--3213044588787818984999999999999-985998699997836-----8999 Q ss_pred HCCCEEE Q ss_conf 7495896 Q gi|254781133|r 116 SLGGIRY 122 (159) Q Consensus 116 klGF~~e 122 (159) ++||.+. T Consensus 120 k~GF~ev 126 (170) T PRK07922 120 RHGFVEI 126 (170) T ss_pred HCCCEEC T ss_conf 7699876 No 31 >KOG3396 consensus Probab=97.09 E-value=0.0038 Score=37.84 Aligned_cols=102 Identities=12% Similarity=0.005 Sum_probs=65.9 Q ss_pred CCHHHHHHHHHHHHHCCCCCC-CCEEEEEECCCCEEEEEEEEEEEEC------CCCCCC-EEEEC-CCCCCHHHHHHHHH Q ss_conf 998999999999720011487-8549999825760767999999834------678311-36863-77660359999999 Q gi|254781133|r 16 RINQIIADFVAKRIKDCSSGW-DRFVSMGILKSNFLVAGIIYHNYCP------ISRVIE-LSGAS-DCKSWLSRSVLKEI 86 (159) Q Consensus 16 ~t~e~~~~wi~~~~~~~~~~~-~~~~ai~i~~~g~lvG~v~l~~~~~------~~~~~e-ig~~~-~~~~~~~~ea~~~l 86 (159) -++|++..-++.. ...+. .....++...++++||+..+.-... .-+.+| +-+.. .++.++++-.+..+ T Consensus 34 vt~e~F~krf~~m---k~~~~~Y~i~Vied~~s~~vigtatL~IE~KfIh~~g~rGhiEDVVV~~~~rgk~LGkllv~~L 110 (150) T KOG3396 34 VTREQFEKRFEAM---KKSGDWYYIVVIEDKESEKVIGTATLFIERKFIHGCGSRGHIEDVVVDSEYRGKQLGKLLVETL 110 (150) T ss_pred CCHHHHHHHHHHH---HHCCCCEEEEEEEECCCCEEEEEEEEEEEHHHHHCCCCCCCEEEEEECHHHHHHHHHHHHHHHH T ss_conf 3799999999999---8538967999999678672788877777666665044457514798675451567869999999 Q ss_pred HHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEE Q ss_conf 99862486507999998264215899998749589667 Q gi|254781133|r 87 YSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRI 124 (159) Q Consensus 87 ~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv 124 (159) ++.+|+ +|+.++.+.|.+.| ...++|+||+.-++ T Consensus 111 v~l~k~-lgcYKi~LdC~~~n---v~FYeKcG~s~~~~ 144 (150) T KOG3396 111 VDLAKS-LGCYKIILDCDPKN---VKFYEKCGYSNAGN 144 (150) T ss_pred HHHHHH-CCCEEEEEECCHHH---HHHHHHCCCCCCCH T ss_conf 999874-28279999636210---14899739655420 No 32 >PRK10314 hypothetical protein; Provisional Probab=96.53 E-value=0.025 Score=33.22 Aligned_cols=81 Identities=9% Similarity=-0.091 Sum_probs=59.0 Q ss_pred EEEEECCCCEEEEEEEEEEEECCCCCCCEEEE----CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHH Q ss_conf 99998257607679999998346783113686----37766035999999999862486507999998264215899998 Q gi|254781133|r 40 VSMGILKSNFLVAGIIYHNYCPISRVIELSGA----SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLT 115 (159) Q Consensus 40 ~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~~----~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~e 115 (159) +.+....+|++||+.-+...+.....+.||-. ..+++++|+..+..+++++-.+.+...|.+. .-......|+ T Consensus 49 ~H~~~~~~~~~va~~Ri~~~~~~~~~~~IGRVaV~k~~Rg~glG~~Lm~~~l~~~~~~~~~~~i~l~---AQ~~~~~FY~ 125 (153) T PRK10314 49 RHILGWKNDELVAYARILKSDDDLEPVVIGRVIVSEALRGEKVGQQLMSKTLESCTRHWPDKPLYLG---AQAHLQNFYA 125 (153) T ss_pred EEEEEEECCEEEEEEEEECCCCCCCCEEEEEEEECHHHCCCCHHHHHHHHHHHHHHHHCCCCCEEEE---EHHHHHHHHH T ss_conf 8999995993889998725898877347878997587849989999999999999997799978996---2899999999 Q ss_pred HCCCEEEE Q ss_conf 74958966 Q gi|254781133|r 116 SLGGIRYR 123 (159) Q Consensus 116 klGF~~eg 123 (159) |+||+.+| T Consensus 126 ~~GF~~~g 133 (153) T PRK10314 126 SFGFIPVT 133 (153) T ss_pred HCCCEECC T ss_conf 87998889 No 33 >KOG2488 consensus Probab=96.40 E-value=0.017 Score=34.13 Aligned_cols=82 Identities=11% Similarity=0.029 Sum_probs=56.2 Q ss_pred EEEEEECCCC-EEEEEEEEEEEECCCCCC-----CEEEE-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHH Q ss_conf 4999982576-076799999983467831-----13686-3776603599999999986248650799999826421589 Q gi|254781133|r 39 FVSMGILKSN-FLVAGIIYHNYCPISRVI-----ELSGA-SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQH 111 (159) Q Consensus 39 ~~ai~i~~~g-~lvG~v~l~~~~~~~~~~-----eig~~-~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~ 111 (159) ...|...++. ++||.+.|.. +-.++.. |+-+. ..++.|+|+-.+..+..-+ +-..+.+|.+.|+.+|.+|+ T Consensus 92 ~~Yi~a~~~~~~~vgf~~Frf-~vd~g~~vlYcyEvqv~~~yR~kGiGk~LL~~l~~~a-~~~~~~kVmLTVf~~N~~al 169 (202) T KOG2488 92 LRYICAWNNKSKLVGFTMFRF-TVDTGDPVLYCYEVQVASAYRGKGIGKFLLDTLEKLA-DSRHMRKVMLTVFSENIRAL 169 (202) T ss_pred CEEEEEECCCCCEEEEEEEEE-ECCCCCEEEEEEEEEEHHHHHCCCHHHHHHHHHHHHH-HHHHHHHHEEEEECCCCHHH T ss_conf 369999868886246889998-7024986999998764566652685899999999988-88876651144431465047 Q ss_pred HHHHHCCCEEE Q ss_conf 99987495896 Q gi|254781133|r 112 RMLTSLGGIRY 122 (159) Q Consensus 112 rv~eklGF~~e 122 (159) ..++++||... T Consensus 170 ~Fy~~~gf~~~ 180 (202) T KOG2488 170 GFYHRLGFVVD 180 (202) T ss_pred HHHHHCCCCCC T ss_conf 78997596247 No 34 >COG1246 ArgA N-acetylglutamate synthase and related acetyltransferases [Amino acid transport and metabolism] Probab=96.22 E-value=0.034 Score=32.41 Aligned_cols=74 Identities=12% Similarity=0.053 Sum_probs=55.2 Q ss_pred ECCCCEEEEEEEEEEEECCCCCCCE---EEEC-CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCC Q ss_conf 8257607679999998346783113---6863-77660359999999998624865079999982642158999987495 Q gi|254781133|r 44 ILKSNFLVAGIIYHNYCPISRVIEL---SGAS-DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGG 119 (159) Q Consensus 44 i~~~g~lvG~v~l~~~~~~~~~~ei---g~~~-~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF 119 (159) ++.+|.+||++.++.+ ......|+ ++.. .+.+|.+...+..++.-| .++|+.++.+.+. .|...++++|| T Consensus 45 ~E~~g~viGCaaL~~~-~~~~~gE~~~laV~pd~r~~G~G~~Ll~~~~~~A-r~~gi~~~f~LTt----~~~~~F~~~GF 118 (153) T COG1246 45 IERDGKVIGCAALHPV-LEEDLGELRSLAVHPDYRGSGRGERLLERLLADA-RELGIKELFVLTT----RSPEFFAERGF 118 (153) T ss_pred EEECCCEEEEEECCCC-CCCCEEEEEEEEECHHHCCCCCHHHHHHHHHHHH-HHCCCCEEEEEEC----CCHHHHHHCCC T ss_conf 3227938999840356-7567256777787787638870899999999998-8749701343314----66889998098 Q ss_pred EEEE Q ss_conf 8966 Q gi|254781133|r 120 IRYR 123 (159) Q Consensus 120 ~~eg 123 (159) ++.- T Consensus 119 ~~vd 122 (153) T COG1246 119 TRVD 122 (153) T ss_pred EECC T ss_conf 6676 No 35 >PRK12308 bifunctional argininosuccinate lyase/N-acetylglutamate synthase; Provisional Probab=96.00 E-value=0.037 Score=32.18 Aligned_cols=72 Identities=8% Similarity=-0.069 Sum_probs=55.2 Q ss_pred EECCCCEEEEEEEEEEEECCCCCCCE---EEE-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCC Q ss_conf 98257607679999998346783113---686-37766035999999999862486507999998264215899998749 Q gi|254781133|r 43 GILKSNFLVAGIIYHNYCPISRVIEL---SGA-SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLG 118 (159) Q Consensus 43 ~i~~~g~lvG~v~l~~~~~~~~~~ei---g~~-~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklG 118 (159) ..+.+|+++|+.+++-+ ..+.+|| ++. ...++|.|+..++.++..| .++++.||...+.. --.++|+| T Consensus 507 vaE~~g~v~g~~sl~i~--~~~LAEIrsl~v~~~~~~~G~G~~lV~~~l~~a-~~~~~~rvfvLT~~-----p~fF~k~g 578 (614) T PRK12308 507 VAEHHGEVTGCASLYIY--DSGLAEIRSLGVEAGWQVQGQGKALVQYLVEKA-RQMAIKKVFVLTRV-----PEFFMKQG 578 (614) T ss_pred EEEECCEEEEEEEEEEE--ECCHHHHHHHHCCHHHHHCCCHHHHHHHHHHHH-HHHCCCEEEEEECC-----CHHHHHCC T ss_conf 66557818888789886--167199998616787774283289999999999-98378758998437-----18899759 Q ss_pred CEEE Q ss_conf 5896 Q gi|254781133|r 119 GIRY 122 (159) Q Consensus 119 F~~e 122 (159) |... T Consensus 579 f~~~ 582 (614) T PRK12308 579 FSPT 582 (614) T ss_pred CEEC T ss_conf 8207 No 36 >TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species. Probab=95.96 E-value=0.11 Score=29.39 Aligned_cols=80 Identities=10% Similarity=0.008 Sum_probs=56.0 Q ss_pred EEEECCCCEEEEEEEEEEEE-CCCCCCCEEEEC-CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCC Q ss_conf 99982576076799999983-467831136863-7766035999999999862486507999998264215899998749 Q gi|254781133|r 41 SMGILKSNFLVAGIIYHNYC-PISRVIELSGAS-DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLG 118 (159) Q Consensus 41 ai~i~~~g~lvG~v~l~~~~-~~~~~~eig~~~-~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklG 118 (159) .+...+++++||...+..-+ ....++|+-+.. ++++|++++.+.+++.-+ + .++..-+..+..++.++++++| T Consensus 48 hl~~~~~~~~vGya~l~~~~~~~~~~aElvVhP~~R~rGiG~aLl~a~~~~~----~-~~l~~WahG~~p~A~alA~~~G 122 (292) T TIGR03448 48 HLVAVDSDPIVGYANLVPARGTDPAMAELVVHPAHRRRGIGRALIRALLAKG----G-GRLRVWAHGDLPAARALASRLG 122 (292) T ss_pred EEEEECCCEEEEEEEECCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHC----C-CCCEEEECCCCHHHHHHHHHCC T ss_conf 7999449868999996257888776478998900037987999999999743----8-7732886599889999999779 Q ss_pred CEEEEEE Q ss_conf 5896676 Q gi|254781133|r 119 GIRYRIP 125 (159) Q Consensus 119 F~~egv~ 125 (159) |+..+.. T Consensus 123 ~~~~R~l 129 (292) T TIGR03448 123 LVPTREL 129 (292) T ss_pred CEEEEHH T ss_conf 9001323 No 37 >PRK05279 N-acetylglutamate synthase; Validated Probab=95.34 E-value=0.14 Score=28.88 Aligned_cols=76 Identities=12% Similarity=0.050 Sum_probs=53.7 Q ss_pred EECCCCEEEEEEEEEEEECCCCCCCEEEE-C---CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCC Q ss_conf 98257607679999998346783113686-3---7766035999999999862486507999998264215899998749 Q gi|254781133|r 43 GILKSNFLVAGIIYHNYCPISRVIELSGA-S---DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLG 118 (159) Q Consensus 43 ~i~~~g~lvG~v~l~~~~~~~~~~eig~~-~---~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklG 118 (159) .++.||.+|||+.++.+ +..+.+|++.- . .++++.+...++.+...| .+.|+.++.+.+- .+..-+..-| T Consensus 338 V~e~Dg~iigCaALy~~-~~~~~aElaclav~~~y~~~g~G~~Ll~~~e~~A-~~~g~~~lF~LTT----qt~hwF~e~G 411 (441) T PRK05279 338 VIERDGVIIGCAALYPF-PEEKMGEMACLAVHPDYRGSGRGERLLKRIEQRA-RQQGLSRLFVLTT----RTAHWFLERG 411 (441) T ss_pred EEEECCCEEEEEEEEEC-CCCCEEEEEEEEECHHHCCCCHHHHHHHHHHHHH-HHCCCCEEEEEEC----CCHHHHHHCC T ss_conf 99988959999960003-8778068888898786618987999999999999-9869987999965----6388899759 Q ss_pred CEEEEE Q ss_conf 589667 Q gi|254781133|r 119 GIRYRI 124 (159) Q Consensus 119 F~~egv 124 (159) |++..+ T Consensus 412 F~~~~~ 417 (441) T PRK05279 412 FVPADV 417 (441) T ss_pred CCCCCH T ss_conf 830892 No 38 >pfam08444 Gly_acyl_tr_C Aralkyl acyl-CoA:amino acid N-acyltransferase, C-terminal region. This family features the C-terminal region of several mammalian specific aralkyl acyl-CoA:amino acid N-acyltransferase (glycine N-acyltransferase) proteins EC:2.3.1.13. Probab=93.80 E-value=0.3 Score=26.98 Aligned_cols=71 Identities=14% Similarity=0.132 Sum_probs=47.6 Q ss_pred CCCCEEEEEEEEEEEECCCCCCCEEEEC--CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEE Q ss_conf 2576076799999983467831136863--7766035999999999862486507999998264215899998749589 Q gi|254781133|r 45 LKSNFLVAGIIYHNYCPISRVIELSGAS--DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIR 121 (159) Q Consensus 45 ~~~g~lvG~v~l~~~~~~~~~~eig~~~--~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~ 121 (159) -.+|.+|-.+-.. .++....||.+ .+++|+.+-++..++.+. .++|.- ++..|+.+|..|+++..++||.. T Consensus 5 gpeG~PVSW~Lmd----qtge~rmgyTlPeyR~kG~~~~~~~~~~~~L-~~~g~P-~Y~hv~~~N~~~~k~~~~lg~~~ 77 (89) T pfam08444 5 GPEGTPVSWSLMD----QTGELRMAGTLPKYRRQGLMSHVIYHQAQYL-EKLGFP-VYSHVDKANEISQKMSGNLGHVP 77 (89) T ss_pred CCCCCEEEEEEEC----CCCEEEECCCCHHHHCCCHHHHHHHHHHHHH-HHCCCC-CEEEEHHCCHHHHHHHHHCCCEE T ss_conf 8999874799853----7421100133777875586999999999999-867999-46864115788999999779825 No 39 >COG3818 Predicted acetyltransferase, GNAT superfamily [General function prediction only] Probab=93.23 E-value=0.22 Score=27.73 Aligned_cols=56 Identities=11% Similarity=0.166 Sum_probs=43.2 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCEEEEEEEEE--CCCCHHHHHHHHCCCEEEEEECCCCC Q ss_conf 7660359999999998624865079999982--64215899998749589667612894 Q gi|254781133|r 74 CKSWLSRSVLKEIYSYPWNQLCCQAVIHRIP--DEDYPQHRMLTSLGGIRYRIPRLRGR 130 (159) Q Consensus 74 ~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~--~~N~~S~rv~eklGF~~egv~R~~G~ 130 (159) +.+|.+++.-.-+.+|| +--|.-+|.++|+ +.|++|-...-.+||.+.|....+|- T Consensus 97 RGrG~aRalY~Dlf~~A-e~agy~~~tCEVn~DppnpasdaFHaalGF~eVG~a~ihgg 154 (167) T COG3818 97 RGRGVARALYADLFSYA-ELAGYPYLTCEVNLDPPNPASDAFHAALGFHEVGQATIHGG 154 (167) T ss_pred CCCCHHHHHHHHHHHHH-HHCCCCEEEEEECCCCCCHHHHHHHHHCCCEECCCEEEECC T ss_conf 46643899999999999-86588569998607999827788766508647562478664 No 40 >KOG3235 consensus Probab=91.72 E-value=0.99 Score=23.97 Aligned_cols=94 Identities=11% Similarity=0.054 Sum_probs=60.1 Q ss_pred CCCCEEEEEECCCCEEEEEEEEEEEE-CCCC--CCCE-EEE---CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCC Q ss_conf 87854999982576076799999983-4678--3113-686---377660359999999998624865079999982642 Q gi|254781133|r 35 GWDRFVSMGILKSNFLVAGIIYHNYC-PISR--VIEL-SGA---SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDED 107 (159) Q Consensus 35 ~~~~~~ai~i~~~g~lvG~v~l~~~~-~~~~--~~ei-g~~---~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N 107 (159) .|+....+....+|++||.+----.. +.+. ...| +.+ ..++.|+++..+.....---+.....-|.+.|...| T Consensus 38 swp~lSyVA~D~~gkiVGYvlAkmee~p~~~~~hGhItSlaV~rs~RrlGla~kLm~qa~rAm~E~~~A~yvsLHVR~SN 117 (193) T KOG3235 38 SWPQLSYVAEDENGKIVGYVLAKMEEDPDDEPPHGHITSLAVKRSYRRLGLAQKLMNQASRAMVEVYEAKYVSLHVRKSN 117 (193) T ss_pred CCCCCEEEEECCCCCEEEEEEEEHHHCCCCCCCCCEEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEEECCC T ss_conf 66553478775899689876011230656789887068765324688761899999999999998635338888641106 Q ss_pred CHHHHHHH-HCCCEEEEEECCC Q ss_conf 15899998-7495896676128 Q gi|254781133|r 108 YPQHRMLT-SLGGIRYRIPRLR 128 (159) Q Consensus 108 ~~S~rv~e-klGF~~egv~R~~ 128 (159) .+++.++. .+||+-..+.-++ T Consensus 118 raAl~LY~~tl~F~v~eve~kY 139 (193) T KOG3235 118 RAALHLYKNTLGFVVCEVEPKY 139 (193) T ss_pred HHHHHHHHHCCCEEEEECCCCC T ss_conf 8888766613141886136531 No 41 >PRK01346 hypothetical protein; Provisional Probab=90.22 E-value=1.4 Score=23.13 Aligned_cols=82 Identities=10% Similarity=-0.012 Sum_probs=47.5 Q ss_pred CCCCEEEEEECCCCEEEEEEEEEEEECC--CC-CC---C---EEEE-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEE Q ss_conf 8785499998257607679999998346--78-31---1---3686-377660359999999998624865079999982 Q gi|254781133|r 35 GWDRFVSMGILKSNFLVAGIIYHNYCPI--SR-VI---E---LSGA-SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIP 104 (159) Q Consensus 35 ~~~~~~ai~i~~~g~lvG~v~l~~~~~~--~~-~~---e---ig~~-~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~ 104 (159) .......++..+++++||++..+...-. .+ .+ . +++. .++++|+.++.++..++-+-+ -|.. +.. T Consensus 43 ~~~~~~~~ga~d~~~lVg~~~~~~~~l~vpgG~~vp~agIt~Vav~PehRrrG~~~~Lm~~~L~~~r~-~G~~-ls~--- 117 (411) T PRK01346 43 LVEPDRTLGAFDGDDVVGTALAFDLELTVPGGAVLPAAAVTAVTVAPTHRRRGLLTALMREQLRRIAE-RGEP-VAA--- 117 (411) T ss_pred CCCCCCEEEEEECCEEEEEEEEECEEEEEECCCEEECCCEEEEEECHHHCCCCHHHHHHHHHHHHHHH-CCCE-EEE--- T ss_conf 26867469999999999999961448773088388716178999883106785899999999999997-7983-999--- Q ss_pred CCCCHHHHHHHHCCCEEE Q ss_conf 642158999987495896 Q gi|254781133|r 105 DEDYPQHRMLTSLGGIRY 122 (159) Q Consensus 105 ~~N~~S~rv~eklGF~~e 122 (159) --..+..+|.|+||-.- T Consensus 118 -L~pse~~iY~RfGyg~a 134 (411) T PRK01346 118 -LTASEGGIYGRFGYGVA 134 (411) T ss_pred -EECCCCCCCCCCCCEEE T ss_conf -95067660146542130 No 42 >KOG3138 consensus Probab=88.34 E-value=1.3 Score=23.24 Aligned_cols=82 Identities=9% Similarity=0.022 Sum_probs=55.4 Q ss_pred CCCEEEEEEEEEEEECCCC-----------CCCEEEE-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHH Q ss_conf 5760767999999834678-----------3113686-377660359999999998624865079999982642158999 Q gi|254781133|r 46 KSNFLVAGIIYHNYCPISR-----------VIELSGA-SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRM 113 (159) Q Consensus 46 ~~g~lvG~v~l~~~~~~~~-----------~~eig~~-~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv 113 (159) .++..||.+...-+..... ...+|+- -.+..+.|.-.+..+.+|+-..--++++.+.+...|...+.. T Consensus 62 ~~~~~v~a~~~k~~~~~~~~~r~~~~~~~yi~~Lgvl~~yR~~gIGs~Ll~~~~~~~~~~~~~~~v~lHv~~~n~~ai~~ 141 (187) T KOG3138 62 YNEIAVGAVACKLIKFVQNAKRLFGNRVIYILSLGVLPRYRNKGIGSKLLEFVKKYCSEAHQCRRVYLHVQAVNESAIEF 141 (187) T ss_pred HCCCCCCCEEEEEHHHHHHHHHHHCCCEEEEEEECCCHHHHHCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCHHHH T ss_conf 10145300133101224566654033214777622438777425487889999988750564551799997179738899 Q ss_pred HHHCCCEEEEEECC Q ss_conf 98749589667612 Q gi|254781133|r 114 LTSLGGIRYRIPRL 127 (159) Q Consensus 114 ~eklGF~~egv~R~ 127 (159) +++.||+..+..-. T Consensus 142 Y~~~gF~~~~~~~~ 155 (187) T KOG3138 142 YEKRGFEIVERLKN 155 (187) T ss_pred HHHCCCEEEECCCC T ss_conf 88349567402544 No 43 >PHA01733 hypothetical protein Probab=88.27 E-value=1.9 Score=22.29 Aligned_cols=85 Identities=12% Similarity=-0.042 Sum_probs=50.7 Q ss_pred ECCCCEEEEEEEEEEEECCCCCCCEEEECCCC--CCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEE Q ss_conf 82576076799999983467831136863776--6035999999999862486507999998264215899998749589 Q gi|254781133|r 44 ILKSNFLVAGIIYHNYCPISRVIELSGASDCK--SWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIR 121 (159) Q Consensus 44 i~~~g~lvG~v~l~~~~~~~~~~eig~~~~~~--~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~ 121 (159) ...||+++|..|... +.......|.- +..+ .-+....++..-.|.-...+..-++-.|+..|..+++.++-|||+- T Consensus 52 ~~~dG~~aGmaGV~p-~~~~~~G~iWm-L~T~~i~k~~~~FlR~~k~~v~~~~~y~~L~N~vd~rN~~hikwLk~lGF~f 129 (153) T PHA01733 52 VAPDGSLAGVAGLVE-DMGNRVGEIWM-VCTPAIEKNPIALLRGAKWWLPKSRNYDLLWNIVDKRNLVHRKLLRKLGFKG 129 (153) T ss_pred EECCCCEEEEEEEEC-CCCCCCCEEEE-ECCHHHHHCHHHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHCCCCE T ss_conf 956996876877506-76577762689-7076887573999985399998537987478862457599999999769707 Q ss_pred EEEECCCCCE Q ss_conf 6676128941 Q gi|254781133|r 122 YRIPRLRGRN 131 (159) Q Consensus 122 egv~R~~G~~ 131 (159) ..-. .+|.. T Consensus 130 ~~~~-~~G~~ 138 (153) T PHA01733 130 LRYV-QPGPL 138 (153) T ss_pred EEEE-CCCCC T ss_conf 7641-23888 No 44 >KOG3397 consensus Probab=86.93 E-value=2.3 Score=21.83 Aligned_cols=100 Identities=10% Similarity=0.016 Sum_probs=64.0 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCC--CCEEE-EC-CCCCCHHHHHHHHHHHHH Q ss_conf 999899999999972001148785499998257607679999998346783--11368-63-776603599999999986 Q gi|254781133|r 15 PRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRV--IELSG-AS-DCKSWLSRSVLKEIYSYP 90 (159) Q Consensus 15 P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~--~eig~-~~-~~~~~~~~ea~~~l~~~a 90 (159) |+++..-. ..++......+..+.+..+...++||..-|..+...... +|--+ .- .+..|+|+-.++-+-+|+ T Consensus 37 PRS~TsR~----hSL~~ScDs~P~sL~Ll~E~~~~VigH~rLS~i~n~~~al~VEsVVV~k~~RG~GFGk~lMk~~E~~~ 112 (225) T KOG3397 37 PRSDTSRE----HSLKKSCDSPPMSLLLLNEENDEVLGHSRLSHLPNRDHALWVESVVVKKDQRGLGFGKFLMKSTEKWM 112 (225) T ss_pred CCCHHHHH----HHHHCCCCCCCEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEEEEEEHHHCCCCHHHHHHHHHHHHH T ss_conf 75225767----66651357776155553366520011100145678885168888887443315538899999999999 Q ss_pred HCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEE Q ss_conf 248650799999826421589999874958966 Q gi|254781133|r 91 WNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYR 123 (159) Q Consensus 91 F~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~eg 123 (159) -. .++..+.+.++.. .+.||++|+.... T Consensus 113 R~-~gf~~~yLsT~DQ----~~FYe~lGYe~c~ 140 (225) T KOG3397 113 RE-KGFNEAYLSTDDQ----CRFYESLGYEKCD 140 (225) T ss_pred HH-HHHHHEEEECCCC----CHHHHHHCCCCCC T ss_conf 98-5332146411231----0166651410248 No 45 >KOG3234 consensus Probab=86.91 E-value=2.3 Score=21.82 Aligned_cols=93 Identities=12% Similarity=-0.030 Sum_probs=54.7 Q ss_pred EEEECCCCEEEEEEEEEEEECCCCCCC-----EEEEC-CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHH Q ss_conf 999825760767999999834678311-----36863-776603599999999986248650799999826421589999 Q gi|254781133|r 41 SMGILKSNFLVAGIIYHNYCPISRVIE-----LSGAS-DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRML 114 (159) Q Consensus 41 ai~i~~~g~lvG~v~l~~~~~~~~~~e-----ig~~~-~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~ 114 (159) -+....++++.|.+-= .........+ +.++. .++-++++-.+. .+.-.-++.+..=+.+.+...|+-+|.++ T Consensus 44 ~~a~~p~~~imgyimg-k~Eg~~~~wh~HvTAltVap~~Rrl~la~~lm~-~led~~d~~~a~fvDLfVr~sN~iAI~mY 121 (173) T KOG3234 44 IVAEAPTGEIMGYIMG-KVEGKDTEWHGHVTALTVAPDYRRLGLAAKLMD-TLEDVSDVDNAYFVDLFVRVSNQIAIDMY 121 (173) T ss_pred EECCCCCCCEEEEEEE-ECCCCCCCEEEEEEEEEECHHHHHHHHHHHHHH-HHHHHHHHHHHHEEEEEEECCCHHHHHHH T ss_conf 7311799846899854-204667513667898873546888779999999-99998875321231032000654589999 Q ss_pred HHCCCEEEE-EECCC--CCEEEEE Q ss_conf 874958966-76128--9410079 Q gi|254781133|r 115 TSLGGIRYR-IPRLR--GRNAAEN 135 (159) Q Consensus 115 eklGF~~eg-v~R~~--G~~~d~~ 135 (159) ++||+...+ |...+ |+..|.. T Consensus 122 kkLGY~~YR~Vi~YY~~g~deda~ 145 (173) T KOG3234 122 KKLGYSVYRTVIEYYSVGPDEDAY 145 (173) T ss_pred HHCCCEEEEEEEEEECCCCCCCHH T ss_conf 852936887654665038873067 No 46 >cd02169 Citrate_lyase_ligase Citrate lyase ligase, also known as [Citrate (pro-3S)-lyase] ligase, is responsible for acetylation of the (2-(5''-phosphoribosyl)-3'-dephosphocoenzyme-A) prosthetic group of the gamma subunit of citrate lyase, converting the inactive thiol form of this enzyme to the active form. The acetylation of 1 molecule of deacetyl-citrate lyase to enzymatically active citrate lyase requires 6 molecules of ATP. The Adenylylyltranferase activity of the enzyme involves the formation of AMP and and pyrophosphate in the acetylation reaction. Probab=84.87 E-value=3 Score=21.22 Aligned_cols=77 Identities=14% Similarity=0.102 Sum_probs=52.7 Q ss_pred EEEEEECC-CCEEEEEEEEEEEECCCCCCCEEEECC-CCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHH Q ss_conf 49999825-760767999999834678311368637-7660359999999998624865079999982642158999987 Q gi|254781133|r 39 FVSMGILK-SNFLVAGIIYHNYCPISRVIELSGASD-CKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTS 116 (159) Q Consensus 39 ~~ai~i~~-~g~lvG~v~l~~~~~~~~~~eig~~~~-~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~ek 116 (159) -+++++.+ +|++||+-++..- .-+ .+++.-. +..++....+..+++++|+ .|...+...+-|.|. +.++. T Consensus 5 Dytv~i~d~~~~lvatGs~~gn--vlK--cvAV~~~~qg~gl~~~lvs~L~~~~~~-~G~~h~FvfTKp~~~---~~F~~ 76 (296) T cd02169 5 DYTVGIFDDAGNLIATGSLAGN--ILK--CVAVSPKYQGEGLLLKLVSELINRAFE-RGIFHLFVFTKPENV---KFFSS 76 (296) T ss_pred CEEEEEEECCCCEEEEEECCCC--EEE--EEEECHHCCCCCHHHHHHHHHHHHHHH-CCCCEEEEEECCCHH---HHHHH T ss_conf 4699998079978997302147--879--999872105864799999999999997-699648999371057---77875 Q ss_pred CCCEEEE Q ss_conf 4958966 Q gi|254781133|r 117 LGGIRYR 123 (159) Q Consensus 117 lGF~~eg 123 (159) |||.+-. T Consensus 77 lGF~~ia 83 (296) T cd02169 77 LGFKELA 83 (296) T ss_pred CCCEEEE T ss_conf 8977988 No 47 >COG3882 FkbH Predicted enzyme involved in methoxymalonyl-ACP biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism] Probab=83.30 E-value=3.5 Score=20.84 Aligned_cols=83 Identities=11% Similarity=-0.027 Sum_probs=47.4 Q ss_pred CEEEEEECC---CCEEEEEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHHHHHHHHCC---CCEEEEEEEEEC--CCCH Q ss_conf 549999825---76076799999983467831136863776603599999999986248---650799999826--4215 Q gi|254781133|r 38 RFVSMGILK---SNFLVAGIIYHNYCPISRVIELSGASDCKSWLSRSVLKEIYSYPWNQ---LCCQAVIHRIPD--EDYP 109 (159) Q Consensus 38 ~~~ai~i~~---~g~lvG~v~l~~~~~~~~~~eig~~~~~~~~~~~ea~~~l~~~aF~~---l~l~Ri~~~~~~--~N~~ 109 (159) .-+++.+.+ |+-+||.+.+ ....+...|--....=+-+++..-..++.+++++ .|.+.+-+...| .|.| T Consensus 459 li~sv~l~DKfgDnGiigvviv---~kk~~~w~IDt~lmSCRVlgRkvE~~l~~~~~e~A~~~gi~tir~~Y~pt~kN~p 535 (574) T COG3882 459 LIFSVSLKDKFGDNGIIGVVIV---EKKESEWFIDTFLMSCRVLGRKVEQRLMNSLEEQALSEGINTIRGYYIPTEKNAP 535 (574) T ss_pred EEEEEEECCCCCCCCEEEEEEE---EECCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCC T ss_conf 7999986123467855899999---9318727767889989998667899999999999986463222358646435783 Q ss_pred HHHHHHHCCCEEEE Q ss_conf 89999874958966 Q gi|254781133|r 110 QHRMLTSLGGIRYR 123 (159) Q Consensus 110 S~rv~eklGF~~eg 123 (159) -...+|++||..+| T Consensus 536 v~~FyE~mgf~l~~ 549 (574) T COG3882 536 VSDFYERMGFKLKG 549 (574) T ss_pred HHHHHHHHCHHHCC T ss_conf 89999873500013 No 48 >COG3153 Predicted acetyltransferase [General function prediction only] Probab=82.41 E-value=3.8 Score=20.62 Aligned_cols=85 Identities=12% Similarity=-0.005 Sum_probs=58.3 Q ss_pred CCEEEEEECCCCEEEEEEEEEEEECCCCC---CCEEE---E-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCH Q ss_conf 85499998257607679999998346783---11368---6-37766035999999999862486507999998264215 Q gi|254781133|r 37 DRFVSMGILKSNFLVAGIIYHNYCPISRV---IELSG---A-SDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYP 109 (159) Q Consensus 37 ~~~~ai~i~~~g~lvG~v~l~~~~~~~~~---~eig~---~-~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~ 109 (159) ...+.+...++|++||.|-|..++-.... .-++. . .+++++++.+.++..++.+ ...|...+...=++. T Consensus 44 ~~~LslVA~d~g~vvG~Il~s~v~~~g~~~~~~~LaPLaV~p~~qg~GIG~~Lvr~~le~a-~~~G~~~v~vlGdp~--- 119 (171) T COG3153 44 DLTLSLVAEDDGEVVGHILFSPVTVGGEELGWLGLAPLAVDPEYQGQGIGSALVREGLEAL-RLAGASAVVVLGDPT--- 119 (171) T ss_pred CCCEEEEEEECCEEEEEEEEEEEEECCCCCCEEEEEEEEECHHHCCCCHHHHHHHHHHHHH-HHCCCCEEEEECCCC--- T ss_conf 5533699820998999999868985686414699976787622157867899999999999-977997899906844--- Q ss_pred HHHHHHHCCCEEEEEECCC Q ss_conf 8999987495896676128 Q gi|254781133|r 110 QHRMLTSLGGIRYRIPRLR 128 (159) Q Consensus 110 S~rv~eklGF~~egv~R~~ 128 (159) -+.|+||+...-.... T Consensus 120 ---YY~rfGF~~~~~~~l~ 135 (171) T COG3153 120 ---YYSRFGFEPAAGAKLY 135 (171) T ss_pred ---CCCCCCCEECCCCCCC T ss_conf ---2445474886555311 No 49 >TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase; InterPro: IPR010167 This entry represents a clade of amino-acid N-acetyltransferases, or N-acetylglutamate synthase, which is the product of the argA gene and the first enzyme in arginine biosynthesis. This enzyme displays more diversity between bacteria, fungi and mammals than other enzymes in arginine metabolism, and N-acetylglutamate itself can have different roles in different taxonomic groups: in prokaryotes, lower eukaryotes and plants it is the first intermediate in arginine biosynthesis, while in vertebrates it is an allosteric cofactor for the first enzyme in the urea cycle . In bacteria, arginine can regulate ornithine biosynthesis via a feedback inhibition mechanism . This enzyme may also act on aspartate.; GO: 0004042 amino-acid N-acetyltransferase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=79.11 E-value=2 Score=22.25 Aligned_cols=91 Identities=12% Similarity=0.100 Sum_probs=65.4 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCCCCEEE---EC-CCCCCHHHHHHHHHHHHH Q ss_conf 99989999999997200114878549999825760767999999834678311368---63-776603599999999986 Q gi|254781133|r 15 PRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRVIELSG---AS-DCKSWLSRSVLKEIYSYP 90 (159) Q Consensus 15 P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~---~~-~~~~~~~~ea~~~l~~~a 90 (159) |+++|.++.-|.. |.| ++.||-++||..|+.+ ++....|++. .. .+.+++|.+.+..+.+-| T Consensus 320 ~RsRe~LE~~I~~------------FSv-~e~dG~~~GCAALypy-a~~~~GE~AClAv~P~~~~GG~G~~LL~h~~~~A 385 (439) T TIGR01890 320 RRSRELLEREIEE------------FSV-IEHDGLIIGCAALYPY-AEEDVGEMACLAVSPEYRDGGRGERLLKHIEDRA 385 (439) T ss_pred CHHHHHHHHHHHH------------CEE-EECCCCCHHHHHCCCC-CCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHH T ss_conf 0447877775300------------614-4017611212125887-6322330121121767888887178999999986 Q ss_pred HCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEE Q ss_conf 2486507999998264215899998749589667 Q gi|254781133|r 91 WNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRI 124 (159) Q Consensus 91 F~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv 124 (159) .++|+.|+.+-+ +.+-.=+.+=||..-.+ T Consensus 386 -r~~Gl~~LFvLt----T~t~hWF~erGF~~a~v 414 (439) T TIGR01890 386 -RQMGLSRLFVLT----TRTEHWFRERGFQEASV 414 (439) T ss_pred -HHCCHHHHCEEC----CCCHHHHHHCCCCCCCH T ss_conf -422513520002----56134786548972664 No 50 >TIGR01686 FkbH FkbH domain; InterPro: IPR010037 This entry describes a domain of unknown function. One of proteins with this domain is a modular polyketide synthase, which is 4800 amino acids in length from Streptomyces avermilitis, where the domain is the C-terminal segment. By contrast, the FkbH protein from Streptomyces hygroscopicus (Q84G18 from SWISSPROT) apparently contains only this domain. The remaining members of the family all contain an additional N-terminal domain of between 200 and 275 amino acids, which show less than 20 0dentity to one another. It seems likely then that these proteins are involved in disparate functions, probably the biosynthesis of different natural products. For instance, the FkbH gene is found in a gene cluster believed to be responsible for the biosynthesis of unusual 'PKS extender units' in the ascomycin pathway . This domain is composed of two parts, the first of which is a member of subfamily IIIC of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. All of the characterised enzymes within subfamily III are phosphatases, and all of the active site residues characteristic of HAD-superfamily phosphatases are present in this domain. The C-terminal portion of this domain is unique to this family (by BLAST).. Probab=75.20 E-value=4.7 Score=20.07 Aligned_cols=85 Identities=8% Similarity=-0.001 Sum_probs=51.5 Q ss_pred EEEEEECCCCEEEEEEEEEEEECCCCCCCEEEE-CCCC---CCHHHHHHHHHHHHHH--CCCCEEEEEEEE--ECCCCHH Q ss_conf 499998257607679999998346783113686-3776---6035999999999862--486507999998--2642158 Q gi|254781133|r 39 FVSMGILKSNFLVAGIIYHNYCPISRVIELSGA-SDCK---SWLSRSVLKEIYSYPW--NQLCCQAVIHRI--PDEDYPQ 110 (159) Q Consensus 39 ~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~~-~~~~---~~~~~ea~~~l~~~aF--~~l~l~Ri~~~~--~~~N~~S 110 (159) -++++..+-=-.-|.|++..+.+..+...|--+ +.+| ++...++++.+.+.|- -.+|+|.|.+.. -+.|.+. T Consensus 245 v~~~~~~DrfgD~G~vG~~~~~~~e~~l~i~~l~~SCR~l~r~~e~~~L~~l~d~A~hff~~g~~~~~~~y~~T~rN~~~ 324 (337) T TIGR01686 245 VLTVAMSDRFGDSGLVGILLLEKKEGQLVIKDLVMSCRVLGRGVEERLLRWLVDQAKHFFELGAHKVRLIYRRTERNVPV 324 (337) T ss_pred EEEEEEECCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCCHHH T ss_conf 99998641458876089999852875152876766401410116799999999988999984135403100689877689 Q ss_pred HHHHHHCCCEEEE Q ss_conf 9999874958966 Q gi|254781133|r 111 HRMLTSLGGIRYR 123 (159) Q Consensus 111 ~rv~eklGF~~eg 123 (159) .++++.+||..++ T Consensus 325 ~~~~~~~g~~~~~ 337 (337) T TIGR01686 325 ESFYEELGFEDED 337 (337) T ss_pred HHHHHHHCCCCCC T ss_conf 9999985312369 No 51 >pfam02100 ODC_AZ Ornithine decarboxylase antizyme. This family consists of ornithine decarboxylase antizyme proteins. The polyamine biosynthetic enzyme ornithine decarboxylase (ODC) is degraded by the 26 S proteasome via a ubiquitin-independent pathway. Its degradation is greatly accelerated by association with the polyamine-induced regulatory protein antizyme 1 (AZ1). Probab=63.78 E-value=11 Score=17.86 Aligned_cols=51 Identities=12% Similarity=0.183 Sum_probs=39.0 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEEC Q ss_conf 603599999999986248650799999826421589999874958966761 Q gi|254781133|r 76 SWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPR 126 (159) Q Consensus 76 ~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R 126 (159) ++..++.+-++++||=+.|++.+|.+.+.-....-..+.+.++|.=--+.+ T Consensus 114 ~~~~K~~~~aLle~Aee~l~c~~vv~c~~k~~~d~~~l~r~~~~~GF~~~~ 164 (187) T pfam02100 114 GGDLKEGLVALLEFAEEKLECSHVVICFDKNREDRAALLRTLSWVGFELVT 164 (187) T ss_pred CCCHHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHCEECCEEECC T ss_conf 677189999999974111588789999847863289887650002327558 No 52 >pfam11039 DUF2824 Protein of unknown function (DUF2824). This family of proteins has no known function. Some members in the family are annotated as the P22 head assembly protein gp14 however this cannot be confirmed. Probab=58.58 E-value=5.9 Score=19.50 Aligned_cols=100 Identities=11% Similarity=0.059 Sum_probs=58.8 Q ss_pred EEEEEECCCCEEEEEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHH----- Q ss_conf 499998257607679999998346783113686377660359999999998624865079999982642158999----- Q gi|254781133|r 39 FVSMGILKSNFLVAGIIYHNYCPISRVIELSGASDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRM----- 113 (159) Q Consensus 39 ~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~~~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv----- 113 (159) ...+++...|++=|.+.+..+.+-.-.++.-|-. .-++++++.-.+.-.|.....-++-|... ++++. T Consensus 38 ~~yfevnvhgqfggivyy~eiqp~tfdchamylp-e~rg~sk~iglafw~~il~~t~vqcvtsf------aarkfrhgq~ 110 (151) T pfam11039 38 CRYFEVNVHGQFGGIVYYNEIQPLTFDCHAMYLP-EIRGFSKEIGLAFWRYILTNTTVQCVTSF------AARKFRHGQM 110 (151) T ss_pred CEEEEEEECCEECCEEEEEECCCCEEEHHHEECH-HHCCCCHHHHHHHHHHHHCCCEEEEEHHH------HHHHHCCCEE T ss_conf 3489997403105489872004413410211065-44374055618999886607635643166------6764026607 Q ss_pred -HHHCCCEEEEEECCCCCEEEEE-EEEECHHHHH Q ss_conf -9874958966761289410079-9986288986 Q gi|254781133|r 114 -LTSLGGIRYRIPRLRGRNAAEN-IYVITHEAWM 145 (159) Q Consensus 114 -~eklGF~~egv~R~~G~~~d~~-v~~l~~~eW~ 145 (159) ..-+|..+.|..|.+-+..|++ -|+-+++|-. T Consensus 111 yc~miglkrvgti~kyfkgvddvtfysatreel~ 144 (151) T pfam11039 111 YCAMIGLKRVGTIKKYFKGVDDVTFYSATREELI 144 (151) T ss_pred EEEEECCHHHHHHHHHHCCCCCEEEEECCHHHHH T ss_conf 7755541112048887468752477600399999 No 53 >PHA00771 head assembly protein Probab=58.31 E-value=6 Score=19.47 Aligned_cols=100 Identities=11% Similarity=0.059 Sum_probs=58.7 Q ss_pred EEEEEECCCCEEEEEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHH----- Q ss_conf 499998257607679999998346783113686377660359999999998624865079999982642158999----- Q gi|254781133|r 39 FVSMGILKSNFLVAGIIYHNYCPISRVIELSGASDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRM----- 113 (159) Q Consensus 39 ~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~~~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv----- 113 (159) ...+++...|++=|.+.+..+.+-.-.++.-|-. .-++++++.-.+.-.|.....-++-|... ++++. T Consensus 38 ~~yfevnvhgqfggivyy~eiqp~tfdchamylp-e~rg~sk~iglafw~~il~~t~vqcvtsf------aarkfrhgq~ 110 (151) T PHA00771 38 CRYFEVNVHGQFGGIVYYNEIQPLTFDCHAMYLP-EIRGFSKEIGLAFWRYILTNTTVQCVTSF------AARKFRHGQM 110 (151) T ss_pred CEEEEEEECCEECCEEEEEECCCCEEEHHHEECH-HHCCCCHHHHHHHHHHHHCCCEEEEEHHH------HHHHHCCCEE T ss_conf 3489997403005489872004413410211065-44374055618999886607635643166------6764026607 Q ss_pred -HHHCCCEEEEEECCCCCEEEEE-EEEECHHHHH Q ss_conf -9874958966761289410079-9986288986 Q gi|254781133|r 114 -LTSLGGIRYRIPRLRGRNAAEN-IYVITHEAWM 145 (159) Q Consensus 114 -~eklGF~~egv~R~~G~~~d~~-v~~l~~~eW~ 145 (159) ..-+|..+.|..|.+-+..|++ -|+-+++|-. T Consensus 111 yc~miglkrvgti~kyfkgvddvtfysatreel~ 144 (151) T PHA00771 111 YCAMIGLKRVGTIKKYFKGVDDVTFYSATREELI 144 (151) T ss_pred EEEEECCHHHHHHHHHHCCCCCEEEEECCHHHHH T ss_conf 7755541112048887468752477600399999 No 54 >TIGR00715 precor6x_red precorrin-6x reductase; InterPro: IPR003723 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobK and CbiJ precorrin-6x reductase (1.3.1.54 from EC). In the aerobic pathway, CobK catalyses the reduction of the macrocycle of precorrin-6X to produce precorrin-6Y; while in the anaerobic pathway CbiJ catalyses the reduction of the macrocycle of cobalt-precorrin-6X into cobalt-precorrin-6Y , .; GO: 0016994 precorrin-6A reductase activity, 0009236 cobalamin biosynthetic process. Probab=55.30 E-value=16 Score=17.09 Aligned_cols=135 Identities=17% Similarity=0.118 Sum_probs=80.8 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEE--EEEEEEEECCCCCCCEEEECCCCCCHHHHHHHHHHH Q ss_conf 898799989999999997200114878549999825760767--999999834678311368637766035999999999 Q gi|254781133|r 11 GNAKPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVA--GIIYHNYCPISRVIELSGASDCKSWLSRSVLKEIYS 88 (159) Q Consensus 11 ~~~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG--~v~l~~~~~~~~~~eig~~~~~~~~~~~ea~~~l~~ 88 (159) ++++|+--+..+.=++- |....-.+..+ .+.....+| .+...||.......+=+|-. +.+-+-..-.+.+-+ T Consensus 73 DAtHPFA~~~t~~a~~v----c~E~~~~Yvrf-eRp~e~~~~~~ii~V~d~~~a~~~a~q~~~~-g~~V~l~~G~~~L~~ 146 (260) T TIGR00715 73 DATHPFAAQITKNALEV----CKELGIPYVRF-ERPEELVLGKNIIEVSDIEEAARVAEQGYLK-GKRVFLTAGVSTLAA 146 (260) T ss_pred ECCCHHHHHHHHHHHHH----HHHCCCEEEEE-CCCCCCCCCCCEEEECCHHHHHHHHHCCCCC-CCEEEEEECHHHHHH T ss_conf 57973589999999999----97619817996-0883536998668856778887765303445-312444011177899 Q ss_pred HHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECCCCCEEEEEEEEECHHHHHHHHHHHHHH Q ss_conf 862486507999998264215899998749589667612894100799986288986101234566 Q gi|254781133|r 89 YPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRLRGRNAAENIYVITHEAWMHNKINRQSS 154 (159) Q Consensus 89 ~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~~G~~~d~~v~~l~~~eW~~~~~~~~~~ 154 (159) ++ +-.+-.+|.+.+.|+- -|+.=+-++||....+-=++|++-.+.-++|+++ |+..-+-=|.| T Consensus 147 ~~-~~~~~~~v~~rvlP~~-~al~~~~~~G~p~~~Iva~~G~~~~~~e~aLlr~-y~~~avvTK~S 209 (260) T TIGR00715 147 VV-NSQDKEKVIVRVLPDG-TALEKALKLGLPSDRIVAMRGPFSEELEKALLRE-YRIDAVVTKES 209 (260) T ss_pred HH-HHCCCCEEEEEECCCC-HHHHHHHHCCCCCCCEEECCCCCCCHHHHHHHHH-HCCCEEEECCC T ss_conf 97-4037727999988880-2379999658981127872789771367888886-20696898576 No 55 >COG2153 ElaA Predicted acyltransferase [General function prediction only] Probab=51.86 E-value=18 Score=16.67 Aligned_cols=79 Identities=11% Similarity=0.015 Sum_probs=53.7 Q ss_pred EEECCCCEEEEEEEEEEEECCCCCCCEEEEC----CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHC Q ss_conf 9982576076799999983467831136863----776603599999999986248650799999826421589999874 Q gi|254781133|r 42 MGILKSNFLVAGIIYHNYCPISRVIELSGAS----DCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSL 117 (159) Q Consensus 42 i~i~~~g~lvG~v~l~~~~~~~~~~eig~~~----~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~ekl 117 (159) ++...+|++|++.-+.........+.||-.+ .++.+++...+...+..+-++..-.-++. .....-...+.+. T Consensus 53 ~~~~~~g~LvAyaRLlp~~~~~~~~~iGRV~v~~~~RG~glG~~Lm~~AL~~~~~~~p~~~v~l---~AQahLq~fYa~~ 129 (155) T COG2153 53 LGWTPDGELVAYARLLPPGAEYEEVSIGRVIVSPAARGQGLGQQLMEKALETAGREWPDKPVYL---GAQAHLQDFYASF 129 (155) T ss_pred EEECCCCEEEEEEECCCCCCCCCCEEEEEEEECHHHHCCCHHHHHHHHHHHHHHHHCCCCCEEE---EHHHHHHHHHHHH T ss_conf 9986999699998607777775752554699777562245459999999999886289998698---4178799999970 Q ss_pred CCEEEE Q ss_conf 958966 Q gi|254781133|r 118 GGIRYR 123 (159) Q Consensus 118 GF~~eg 123 (159) ||++.+ T Consensus 130 GFv~~~ 135 (155) T COG2153 130 GFVRVG 135 (155) T ss_pred CCEECC T ss_conf 967767 No 56 >PHA00432 internal virion protein A Probab=48.93 E-value=20 Score=16.40 Aligned_cols=43 Identities=21% Similarity=0.186 Sum_probs=32.1 Q ss_pred HHHHHHHHHHHHCCC-CEEEEEEEEECCCCHHHHHHHHCCCEEE Q ss_conf 999999999862486-5079999982642158999987495896 Q gi|254781133|r 80 RSVLKEIYSYPWNQL-CCQAVIHRIPDEDYPQHRMLTSLGGIRY 122 (159) Q Consensus 80 ~ea~~~l~~~aF~~l-~l~Ri~~~~~~~N~~S~rv~eklGF~~e 122 (159) +|.-+.++.|.-.-| ....++-.|.++|.+.||.++.+|++-. T Consensus 77 ~eFrk~I~e~~d~ll~~y~~lwNyVw~gN~shIrfLksiGa~Fh 120 (138) T PHA00432 77 REFRKLIMEYRDKMLDQYPTLWNYVWVGNKSHIRFLKSIGAVFH 120 (138) T ss_pred HHHHHHHHHHHHHHHHHCHHHEEEEEECCHHHHHHHHHHCCEEH T ss_conf 99999999999999874412036887088889999987352301 No 57 >pfam11090 DUF2833 Protein of unknown function (DUF2833). This family of proteins with unknown function are found in the bacteriophage T7. Some of the members of this family are annotated as gene 13 protein. Probab=46.76 E-value=22 Score=16.20 Aligned_cols=42 Identities=24% Similarity=0.222 Sum_probs=30.2 Q ss_pred HHHHHHHHHHHHCCC-CEEEEEEEEECCCCHHHHHHHHCCCEE Q ss_conf 999999999862486-507999998264215899998749589 Q gi|254781133|r 80 RSVLKEIYSYPWNQL-CCQAVIHRIPDEDYPQHRMLTSLGGIR 121 (159) Q Consensus 80 ~ea~~~l~~~aF~~l-~l~Ri~~~~~~~N~~S~rv~eklGF~~ 121 (159) ++..+.+..|.-.-| ....++-.|..+|..-||.++.+||+- T Consensus 39 ~efrk~i~eyrD~ll~~Y~~LwNyVw~gN~sHIRfLKsiGavF 81 (86) T pfam11090 39 REFRKLIHEYRDGMLDQYQTIWNYVWVGNKSHIRFLKSIGAVF 81 (86) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHCCEE T ss_conf 9999999999999987360003788709889999999716110 No 58 >TIGR01211 ELP3 histone acetyltransferase, ELP3 family; InterPro: IPR005910 Histone acetylation is carried out by a class of enzymes known as histone acetyltransferases (HATs), which catalyse the transfer of an acetyl group from acetyl-CoA to the lysine E-amino groups on the N-terminal tails of histones . Early indication that HATs were involved in transcription came from the observation that in actively transcribed regions of chromatin, histones tend to be hyperacetylated, whereas in transcriptionally silent regions histones are hypoacetylated. The histone acetyltransferases are divided into five families. These include the Gcn5-related acetyltransferases (GNATs); the MYST (for 'MOZ, Ybf2/Sas3, Sas2 and Tip60)-related HATs; p300/CBP HATs; the general transcription factor HATs, which include the TFIID subunit TAF250; and the nuclear hormone-related HATs SRC1 and ACTR (SRC3). The GCN5-related N-acetyltransferase superfamily includes such enzymes as the histone acetyltransferases GCN5 and Hat1, the elongator complex subunit Elp3, the mediator-complex subunit Nut1, and Hpa2 . Many GNATs share several functional domains, including an N-terminal region of variable length, an acetyltransferase domain that encompasses the conserved sequence motifs described above, a region that interacts with the coactivator Ada2, and a C-terminal bromodomain that is believed to interact with acetyl-lysine residues. Members of the GNAT family are important for the regulation of cell growth and development. In mice, knockouts of Gcn5L are embryonic lethal. Yeast Gcn5 is needed for normal progression through the G2M boundary and mitotic gene expression. The importance of GNATs is probably related to their role in transcription and DNA repair. The yeast GCN5 (yGCN5) transcriptional coactivator functions as a histone acetyltransferase (HAT) to promote transcriptional activation. The crystal structure of the yeast histone acetyltransferase Hat1-acetyl coenzyme A (AcCoA) shows that Hat1 has an elongated, curved structure, and the AcCoA molecule is bound in a cleft on the concave surface of the protein, marking the active site of the enzyme. A channel of variable width and depth that runs across the protein is probably the binding site for the histone substrate . The central protein core associated with AcCoA binding that appears to be structurally conserved among a superfamily of N-acetyltransferases, including yeast histone acetyltransferase 1 and Serratia marcescens aminoglycoside 3-N-acetyltransferase . The Saccharomyces cerevisiae member YPL086C has been characterised in vitro as an N-terminal acetyltransferase (2.3.1.48 from EC) for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases. In vivo, ELP3 gene deletion confers typical ELP phenotypes such as slow growth adaptation, slow gene activation, and temperature sensitivity. This suggests a role for the proteins as novel, tightly RNAPII-associated histone acetyltransferases in transcription of DNA packaged in chromatin .. Probab=46.73 E-value=22 Score=16.20 Aligned_cols=88 Identities=15% Similarity=0.063 Sum_probs=63.5 Q ss_pred CCCCCCEEEEEECCCCEEEEEEEEEEEECC-CCCC-----------CEEE-----E--------CCCCCCHHHHHHHHHH Q ss_conf 148785499998257607679999998346-7831-----------1368-----6--------3776603599999999 Q gi|254781133|r 33 SSGWDRFVSMGILKSNFLVAGIIYHNYCPI-SRVI-----------ELSG-----A--------SDCKSWLSRSVLKEIY 87 (159) Q Consensus 33 ~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~-~~~~-----------eig~-----~--------~~~~~~~~~ea~~~l~ 87 (159) +.|...|+-+++.+.+-|||.+-|+--+.. ...- ||.+ + .+..+|||++.|.-+= T Consensus 453 ~GG~E~FlS~Ed~~~D~LiGFlRLR~P~e~ya~r~e~d~~~TAlvRELHVYG~~vP~g~D~~~~~~QHrGyG~~Ll~EAE 532 (573) T TIGR01211 453 SGGREIFLSYEDPKNDILIGFLRLRFPSEPYAHRKEVDAQRTALVRELHVYGSEVPIGEDRGEDEWQHRGYGRRLLEEAE 532 (573) T ss_pred CCCEEEEEEEECCCCCEEEEEEECCCCCCCHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 88568987731100040021000068887213455430576103210321057212567787776444425578999999 Q ss_pred HHHHCCCCEEEEEEEEECCCCHHHHHHHH-CCCEEEE Q ss_conf 98624865079999982642158999987-4958966 Q gi|254781133|r 88 SYPWNQLCCQAVIHRIPDEDYPQHRMLTS-LGGIRYR 123 (159) Q Consensus 88 ~~aF~~l~l~Ri~~~~~~~N~~S~rv~ek-lGF~~eg 123 (159) +-|=++.|++||... .=...+-=|.| ||+.++| T Consensus 533 riA~EefG~~KIlVI---SGiGVReYYrKqlGY~~~G 566 (573) T TIGR01211 533 RIAKEEFGLEKILVI---SGIGVREYYRKQLGYEKDG 566 (573) T ss_pred HHHHHCCCCEEEEEE---CCCCCCCHHHHHCCCCCCC T ss_conf 985630697016785---1410100002232632137 No 59 >COG5628 Predicted acetyltransferase [General function prediction only] Probab=45.74 E-value=23 Score=16.11 Aligned_cols=95 Identities=11% Similarity=0.029 Sum_probs=54.3 Q ss_pred EEEEECCCCEEEEEEEEEEEECCCCC-----CCEEE-ECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHH Q ss_conf 99998257607679999998346783-----11368-6377660359999999998624865079999982642158999 Q gi|254781133|r 40 VSMGILKSNFLVAGIIYHNYCPISRV-----IELSG-ASDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRM 113 (159) Q Consensus 40 ~ai~i~~~g~lvG~v~l~~~~~~~~~-----~eig~-~~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv 113 (159) -+.-++-+|.+||.+-+-+......- .|+-+ .-..+.|+++.+.+++..- ..++ -+.-+.++|.|+++. T Consensus 38 ~~~~~~~~~~~igf~l~L~~~~~~~~iD~~~~efFIi~k~~~~GvGR~aaK~If~~---~~g~--w~Va~i~EN~PA~~f 112 (143) T COG5628 38 EAWLFRIGGLPVGFALVLDLAHSPTPIDRAVAEFFIVRKHRRRGVGRAAAKAIFGS---AWGV--WQVATVRENTPARAF 112 (143) T ss_pred CEEEEEECCCCEEEEEEECCCCCCCCCCCCCCEEEEEEHHHCCCHHHHHHHHHHHH---HHCE--EEEEEECCCCCHHHH T ss_conf 52599988941455344325688874323620068760430466539999999987---6145--899995278806889 Q ss_pred HHHCCCEEEEEECCCCCEEEEEEEEE Q ss_conf 98749589667612894100799986 Q gi|254781133|r 114 LTSLGGIRYRIPRLRGRNAAENIYVI 139 (159) Q Consensus 114 ~eklGF~~egv~R~~G~~~d~~v~~l 139 (159) ..++-.+.-=+....-.-++...|.+ T Consensus 113 wK~~~~t~~i~~E~r~d~~~d~iy~i 138 (143) T COG5628 113 WKRVAETYPVVEEDRQDARWDGIYAV 138 (143) T ss_pred HHHCCCCCCCCHHHHHCCCCCCCEEE T ss_conf 98620224552466642268873588 No 60 >pfam04958 AstA Arginine N-succinyltransferase beta subunit. Arginine N-succinyltransferase EC:2.3.1.109 catalyses the transfer of succinyl-CoA to arginine to produce succinylarginine. This is the first step in arginine catabolism by the arginine succinyltransferase pathway. Probab=45.45 E-value=23 Score=16.08 Aligned_cols=46 Identities=9% Similarity=0.073 Sum_probs=31.0 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCC------CCC-CEEEEEECCCCEEEEEEEEE Q ss_conf 98799989999999997200114------878-54999982576076799999 Q gi|254781133|r 12 NAKPRINQIIADFVAKRIKDCSS------GWD-RFVSMGILKSNFLVAGIIYH 57 (159) Q Consensus 12 ~~~P~t~e~~~~wi~~~~~~~~~------~~~-~~~ai~i~~~g~lvG~v~l~ 57 (159) ++-|..++.+.+-|++....-.. +.. ..+.++..++|++||++++. T Consensus 26 TsLP~d~~~L~~rI~~S~~Sf~~~~~~~~~~~~YlFVLEd~~tg~vvGts~I~ 78 (338) T pfam04958 26 TSLPADRELLRAKIARSEASFAGEEVDQPGDEGYLFVLEDTETGEVVGTSGIE 78 (338) T ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCCCEEEEEEEEE T ss_conf 55999999999999999998646777997752179999715788488887666 No 61 >pfam04377 ATE_C Arginine-tRNA-protein transferase, C terminus. This family represents the C terminal region of the enzyme arginine-tRNA-protein transferase (EC 2.3.2.8), which catalyses the post-translational conjugation of arginine to the N terminus of a protein. In eukaryotes, this functions as part of the N-end rule pathway of protein degradation by conjugating a destabilising amino acid to the amino terminal aspartate or glutamate of a protein, targeting the protein for ubiquitin-dependent proteolysis. N terminal cysteine is sometimes modified. Probab=44.49 E-value=24 Score=16.00 Aligned_cols=98 Identities=11% Similarity=-0.094 Sum_probs=60.3 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCCCCEEE--ECCCCCCHHHHHHHHHHHH Q ss_conf 98799989999999997200114878549999825760767999999834678311368--6377660359999999998 Q gi|254781133|r 12 NAKPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRVIELSG--ASDCKSWLSRSVLKEIYSY 89 (159) Q Consensus 12 ~~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~--~~~~~~~~~~ea~~~l~~~ 89 (159) .-.|.+.+....|+... +....-++.+.+|+|||...+ |+-+..-++...+ +...+..+|+-++...+++ T Consensus 19 ~m~~~~~~~y~~fl~~s-------~~~t~~~eyr~~~~Liav~v~-D~l~dglSaVY~ffdP~~~~~SlG~~~iL~~I~~ 90 (128) T pfam04377 19 GMDPPDREQYASFLEDS-------PLGTRFLEYRLDGKLIAVAVT-DILPDGLSAVYTFYDPDYSKRSLGTYSILWQIEL 90 (128) T ss_pred CCCCCCHHHHHHHHHCC-------CCCCEEEEEEECCEEEEEEEE-ECCCCCCHHEEEECCCCCCCCCCHHHHHHHHHHH T ss_conf 98988999999998468-------988789999828968999998-4277420104565372200036318999999999 Q ss_pred HHCCCCEEEEEEEEECCCCHHHHHHHHCCCE Q ss_conf 6248650799999826421589999874958 Q gi|254781133|r 90 PWNQLCCQAVIHRIPDEDYPQHRMLTSLGGI 120 (159) Q Consensus 90 aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~ 120 (159) | .++++.-+.+.-...|.+.+ .=|.-|+ T Consensus 91 a-k~~~~~y~YLGY~I~~c~kM--~YK~~f~ 118 (128) T pfam04377 91 A-KELGLPYVYLGYWIKGCPKM--NYKARFR 118 (128) T ss_pred H-HHCCCCEEECEEEECCCCCC--CCCCCCC T ss_conf 9-98299738640586688766--2645688 No 62 >TIGR03244 arg_catab_AstA arginine N-succinyltransferase. In many bacteria, the arginine succinyltransferase (ast) pathway operon consists of five genes, including this protein, arginine N-succinyltransferase (EC 2.3.1.109). In a few species, such as Pseudomonas aeruginosa, the member of this family is encoded adjacent to a paralog, and the two polypeptides form a heterodimeric enzyme, active on both arginine and ornithine. In such species, this polypeptide may be treated as the beta subunit of an enzyme that may be named either arginine N-succinyltransferase (AST) or arginine and orthithine N-succinyltransferase (AOST). Probab=43.80 E-value=25 Score=15.93 Aligned_cols=46 Identities=7% Similarity=0.044 Sum_probs=29.5 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCC----CC-CCEEEEEECCCCEEEEEEEEE Q ss_conf 98799989999999997200114----87-854999982576076799999 Q gi|254781133|r 12 NAKPRINQIIADFVAKRIKDCSS----GW-DRFVSMGILKSNFLVAGIIYH 57 (159) Q Consensus 12 ~~~P~t~e~~~~wi~~~~~~~~~----~~-~~~~ai~i~~~g~lvG~v~l~ 57 (159) ++-|..++.+.+-|++....... +. ...+.++..++|++||++++. T Consensus 25 TsLP~d~~~L~~rI~~S~~sf~~~~~~~~~~YlFVLEd~~~g~vvGts~I~ 75 (336) T TIGR03244 25 TSLPANEDLLSARIERAEKTFSGELTRAEQGYLFVLEDTETGTVAGVSAIE 75 (336) T ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEEEEEE T ss_conf 559999999999999999986587887763189999877899399987467 No 63 >TIGR00538 hemN oxygen-independent coproporphyrinogen III oxidase; InterPro: IPR004558 This family represents HemN, the oxygen-independent coproporphyrinogen III oxidase that replaces HemF function under anaerobic conditions. HemN catalyses the anaerobic transformation of coproporhyrinogen-III into protoporphyrinogen-IX during porphyrin biosynthesis. Several species, including Escherichia coli, Helicobacter pylori, and Aquifex aeolicus, have both a member of this family and a member of another, closely related family for which there is no evidence of coproporphyrinogen III oxidase activity, IPR004559 from INTERPRO. Members of this family have a perfectly conserved motif PYRT[SC]YP in a region N-terminal to the region of homology with the related uncharacterised protein.; GO: 0004109 coproporphyrinogen oxidase activity, 0006779 porphyrin biosynthetic process, 0005737 cytoplasm. Probab=41.93 E-value=26 Score=15.76 Aligned_cols=79 Identities=18% Similarity=0.296 Sum_probs=52.8 Q ss_pred EEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHHH Q ss_conf 57368987999899999999972001148785499998257607679999998346783113686377660359999999 Q gi|254781133|r 7 ITWGGNAKPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRVIELSGASDCKSWLSRSVLKEI 86 (159) Q Consensus 7 ~~W~~~~~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~~~~~~~~~~~ea~~~l 86 (159) +=|||.++-|-...-.+|+...+.+. +..-+..+||++-+++++ ...+.+.++ T Consensus 108 LHwGGGTP~YL~~~Q~~~l~~~i~~~--------------------------F~nf~~daEiSiEidPR~-~~~e~~~~L 160 (462) T TIGR00538 108 LHWGGGTPTYLSPEQIEELMKEIREA--------------------------FPNFSEDAEISIEIDPRY-LTKEVIKAL 160 (462) T ss_pred EECCCCCCCCCCHHHHHHHHHHHHHH--------------------------HHHHCCCCEEEEEECCCC-CCHHHHHHH T ss_conf 76278983337889999999999987--------------------------320115844776523741-378899999 Q ss_pred HHHHHCCCCEEEEEEEEECCCCHHHHHHHHC Q ss_conf 9986248650799999826421589999874 Q gi|254781133|r 87 YSYPWNQLCCQAVIHRIPDEDYPQHRMLTSL 117 (159) Q Consensus 87 ~~~aF~~l~l~Ri~~~~~~~N~~S~rv~ekl 117 (159) -+ .|.+||...|-..|..-+++.-|. T Consensus 161 ~~-----~GFNRlS~GvQDfd~~VQ~avnR~ 186 (462) T TIGR00538 161 RD-----EGFNRLSFGVQDFDKEVQQAVNRI 186 (462) T ss_pred HH-----CCCCEEECCEECCCHHHHHHHCCC T ss_conf 97-----589664235210785554443134 No 64 >TIGR03243 arg_catab_AOST arginine and ornithine succinyltransferase subunits. In many bacteria, the sole member of this protein family is arginine N-succinyltransferase (EC 2.3.1.109), the AstA protein of the arginine succinyltransferase (ast) pathway. However, in Pseudomonas aeruginosa and several other species, a tandem gene pair encodes alpha and beta subunits of a heterodimer that is designated arginine and ornithine succinyltransferase (AOST). Probab=38.77 E-value=30 Score=15.47 Aligned_cols=30 Identities=27% Similarity=0.206 Sum_probs=24.5 Q ss_pred CEEEEEEEEECCCCHHHHHHHHCCCEEEEE Q ss_conf 507999998264215899998749589667 Q gi|254781133|r 95 CCQAVIHRIPDEDYPQHRMLTSLGGIRYRI 124 (159) Q Consensus 95 ~l~Ri~~~~~~~N~~S~rv~eklGF~~egv 124 (159) ..+++...++++-.++.+++++-||+..|. T Consensus 218 ~Aq~vIG~vH~~t~pA~~lLe~EGF~~~gy 247 (335) T TIGR03243 218 EAQDVIGQVHPDTRPARAMLESEGFRYQGY 247 (335) T ss_pred HHHHHHCCCCCCCHHHHHHHHHCCCCCCCC T ss_conf 999972984846799999999739976896 No 65 >TIGR00758 UDG_fam4 uracil-DNA glycosylase, family 4; InterPro: IPR005273 This well-conserved family of proteins is about 200 residues in length and homologous to the N-terminus of the DNA polymerase of phage SPO1 of Bacillus subtilis. The function of these proteins is unknown. . Probab=38.45 E-value=18 Score=16.78 Aligned_cols=54 Identities=17% Similarity=0.023 Sum_probs=47.8 Q ss_pred HHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHCCCEEEEEECCCCCEEE Q ss_conf 999999999862486507999998264215899998749589667612894100 Q gi|254781133|r 80 RSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSLGGIRYRIPRLRGRNAA 133 (159) Q Consensus 80 ~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~egv~R~~G~~~d 133 (159) .+=+.+-..|.-.|+.++|=-+-|..+-.|+..++.+.||.-+++-+-+|++.. T Consensus 84 ~eE~~~C~pyL~~~i~lI~Pkvi~~lGr~Aa~~~l~~~gl~~~~I~k~rG~vf~ 137 (185) T TIGR00758 84 PEEVEACAPYLVKQIELIRPKVIICLGRTAAQSILGKAGLEFEGITKVRGRVFE 137 (185) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHHHCCCCCCCCCEEEEEEEE T ss_conf 899985157889999750980689960278898876528762673012115773 No 66 >TIGR03245 arg_AOST_alph arginine/ornithine succinyltransferase, alpha subunit. In some bacteria, including Pseudomonas aeruginosa, the astB gene (arginine N-succinyltransferase) is replaced by tandem paralogs that form a heterodimer. This heterodimer from P. aeruginosa is characterized as arginine and ornithine N-2 succinyltransferase (AOST). Members of this protein family represent the less widespread paralog, designated AruI, or arginine/ornithine succinyltransferase, alpha subunit. Probab=36.51 E-value=32 Score=15.26 Aligned_cols=47 Identities=6% Similarity=-0.004 Sum_probs=30.7 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCC----CCC--CEEEEEECCCCEEEEEEEEE Q ss_conf 898799989999999997200114----878--54999982576076799999 Q gi|254781133|r 11 GNAKPRINQIIADFVAKRIKDCSS----GWD--RFVSMGILKSNFLVAGIIYH 57 (159) Q Consensus 11 ~~~~P~t~e~~~~wi~~~~~~~~~----~~~--~~~ai~i~~~g~lvG~v~l~ 57 (159) -++-|..++.+.+-|++....... ..+ ..+.++..++|++||++++. T Consensus 24 ~TsLP~d~~~L~~rI~~S~~sf~~~~~~~~~e~YlFVLEd~~tg~vvG~s~I~ 76 (336) T TIGR03245 24 VTSLPADRAKLGEKIAQSERSFAAEVSFVGEERYLFVLEDTETGKLLGTSSIV 76 (336) T ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEECCCCEEEEEEEEE T ss_conf 65599999999999999999863556788874089999856889189987478 No 67 >cd02034 CooC The accessory protein CooC, which contains a nucleotide-binding domain (P-loop) near the N-terminus, participates in the maturation of the nickel center of carbon monoxide dehydrogenase (CODH). CODH from Rhodospirillum rubrum catalyzes the reversible oxidation of CO to CO2. CODH contains a nickel-iron-sulfur cluster (C-center) and an iron-sulfur cluster (B-center). CO oxidation occurs at the C-center. Three accessory proteins encoded by cooCTJ genes are involved in nickel incorporation into a nickel site. CooC functions as a nickel insertase that mobilizes nickel to apoCODH using energy released from ATP hydrolysis. CooC is a homodimer and has NTPase activities. Mutation at the P-loop abolishs its function. Probab=35.19 E-value=34 Score=15.14 Aligned_cols=40 Identities=10% Similarity=0.108 Sum_probs=21.6 Q ss_pred ECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHH Q ss_conf 6377660359999999998624865079999982642158 Q gi|254781133|r 71 ASDCKSWLSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQ 110 (159) Q Consensus 71 ~~~~~~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S 110 (159) ++.+++|.++..+.+++...|.+.+..-+...++|+|.++ T Consensus 3 a~~GKGGvGKtt~~~~la~~l~~~g~~vl~iD~Dp~dlpe 42 (116) T cd02034 3 AITGKGGVGKTTIAALLARYLAEKGKPVLAIDADPDDLPE 42 (116) T ss_pred EEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHH T ss_conf 8988997749999999999999789969999898971235 No 68 >PRK01305 arginyl-tRNA-protein transferase; Provisional Probab=31.78 E-value=39 Score=14.81 Aligned_cols=97 Identities=11% Similarity=-0.060 Sum_probs=59.1 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCCCCEE-EEC-CCCCCHHHHHHHHHHHHHH Q ss_conf 79998999999999720011487854999982576076799999983467831136-863-7766035999999999862 Q gi|254781133|r 14 KPRINQIIADFVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRVIELS-GAS-DCKSWLSRSVLKEIYSYPW 91 (159) Q Consensus 14 ~P~t~e~~~~wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig-~~~-~~~~~~~~ea~~~l~~~aF 91 (159) .|.+.+....|+... +....-++...+|+|||...+ |+-+..-++-.. +.. ..+..+|+-++...+++| T Consensus 126 ~~~~~~~y~~fl~~s-------~~~t~~~e~~~~~~Li~v~v~-D~l~dglSAVYtFyDP~~~~~SLGt~~IL~~I~~a- 196 (241) T PRK01305 126 DPPSRDQYAQFLEDS-------WVNTRFIEFRLDGRLVAVAVT-DVLDDGLSAVYTFYDPDEEHRSLGTFAILWQIEQA- 196 (241) T ss_pred CCCCHHHHHHHHHCC-------CCCCEEEEEEECCCEEEEEEE-ECCCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHH- T ss_conf 864899999998278-------987689999808947999998-35886432360120756566897599999999999- Q ss_pred CCCCEEEEEEEEECCCCHHHHHHHHCCCEE Q ss_conf 486507999998264215899998749589 Q gi|254781133|r 92 NQLCCQAVIHRIPDEDYPQHRMLTSLGGIR 121 (159) Q Consensus 92 ~~l~l~Ri~~~~~~~N~~S~rv~eklGF~~ 121 (159) .++++.-+.+.-...+.+. +.=|..|+- T Consensus 197 ~~~~l~y~YLGYwI~~c~k--M~YK~~f~P 224 (241) T PRK01305 197 KRLGLPYVYLGYWIKGSRK--MNYKARFRP 224 (241) T ss_pred HHCCCCEEEEEEEECCCCC--CCCCCCCCC T ss_conf 9859986952268788877--777557883 No 69 >COG1243 ELP3 Histone acetyltransferase [Transcription / Chromatin structure and dynamics] Probab=30.69 E-value=41 Score=14.70 Aligned_cols=15 Identities=13% Similarity=-0.097 Sum_probs=5.9 Q ss_pred HHHHHHHHCCCEEEE Q ss_conf 589999874958966 Q gi|254781133|r 109 PQHRMLTSLGGIRYR 123 (159) Q Consensus 109 ~S~rv~eklGF~~eg 123 (159) ..+.-+.|+|+..+| T Consensus 494 G~ReYy~k~GY~~~g 508 (515) T COG1243 494 GVREYYRKLGYELDG 508 (515) T ss_pred CHHHHHHHHCCCCCC T ss_conf 289999973863347 No 70 >PRK10556 hypothetical protein; Provisional Probab=29.98 E-value=42 Score=14.63 Aligned_cols=17 Identities=18% Similarity=0.098 Sum_probs=9.3 Q ss_pred HHHHHHCCCEEEEEECC Q ss_conf 99998749589667612 Q gi|254781133|r 111 HRMLTSLGGIRYRIPRL 127 (159) Q Consensus 111 ~rv~eklGF~~egv~R~ 127 (159) .+|+||.||+..-|--+ T Consensus 9 ArVLe~aGF~~d~vt~k 25 (111) T PRK10556 9 ARVLEKAGFTVDVVTQK 25 (111) T ss_pred HHHHHHCCCEEEEEEEE T ss_conf 99999669347886400 No 71 >PRK10456 arginine succinyltransferase; Provisional Probab=26.57 E-value=48 Score=14.27 Aligned_cols=47 Identities=6% Similarity=-0.010 Sum_probs=29.9 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCC----CC-CCEEEEEECCCCEEEEEEEEE Q ss_conf 898799989999999997200114----87-854999982576076799999 Q gi|254781133|r 11 GNAKPRINQIIADFVAKRIKDCSS----GW-DRFVSMGILKSNFLVAGIIYH 57 (159) Q Consensus 11 ~~~~P~t~e~~~~wi~~~~~~~~~----~~-~~~~ai~i~~~g~lvG~v~l~ 57 (159) -++-|..++.+.+-|++....... +. ...+.++..++|++||++++. T Consensus 26 ~TsLP~d~~~L~~rI~~S~~sf~~~~~~~~~~YlFVLED~~tg~vvGts~I~ 77 (344) T PRK10456 26 LTSLPANEATLAARIERALKTWQGELPKSEQGYVFVLEDSETGTVAGICAIE 77 (344) T ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCEEEEEEEEE T ss_conf 7659999999999999999986478997643179999855788088887578 No 72 >TIGR01263 4HPPD 4-hydroxyphenylpyruvate dioxygenase; InterPro: IPR005956 4-hydroxyphenylpyruvate dioxygenase (1.13.11.27 from EC) oxidizes 4-hydroxyphenylpyruvate, a tyrosine and phenylalanine catabolite, to homogentisate. Homogentisate can undergo a further non-enzymatic oxidation and polymerization into brown pigments that protect some bacterial species from light. A similar process occurs spontaneously in blood and is hemolytic . In some bacterial species, this enzyme has been studied as a hemolysin.; GO: 0003868 4-hydroxyphenylpyruvate dioxygenase activity, 0009072 aromatic amino acid family metabolic process. Probab=25.35 E-value=51 Score=14.14 Aligned_cols=30 Identities=20% Similarity=0.142 Sum_probs=17.4 Q ss_pred CEEEEEEEEECCC-CHHHHHHHHCCCEEEEE Q ss_conf 5079999982642-15899998749589667 Q gi|254781133|r 95 CCQAVIHRIPDED-YPQHRMLTSLGGIRYRI 124 (159) Q Consensus 95 ~l~Ri~~~~~~~N-~~S~rv~eklGF~~egv 124 (159) +.+-|+..|...+ .++..+..++||+..+. T Consensus 2 gfd~~Ef~v~d~~i~a~~~~~~~~GF~~vA~ 32 (379) T TIGR01263 2 GFDFVEFYVGDAKIQAAHYLFTRLGFEKVAK 32 (379) T ss_pred CEEEEEEECCCHHHHHHHHHHHHHCCCHHHC T ss_conf 6038987438878998899999725301003 No 73 >pfam09400 DUF2002 Protein of unknown function (DUF2002). This is a family of putative cytoplasmic proteins. The structure of these proteins form an antiparallel beta and sheet and contain some alpha helical regions. Probab=24.96 E-value=52 Score=14.10 Aligned_cols=17 Identities=18% Similarity=0.075 Sum_probs=9.2 Q ss_pred HHHHHHCCCEEEEEECC Q ss_conf 99998749589667612 Q gi|254781133|r 111 HRMLTSLGGIRYRIPRL 127 (159) Q Consensus 111 ~rv~eklGF~~egv~R~ 127 (159) .+|+||.||+..-+--+ T Consensus 9 ArVLe~aGF~~d~vt~k 25 (110) T pfam09400 9 ARVLEKAGFTVDVVTNK 25 (110) T ss_pred HHHHHHCCCEEEEEECC T ss_conf 99999739447874021 No 74 >COG3053 CitC Citrate lyase synthetase [Energy production and conversion] Probab=24.58 E-value=52 Score=14.05 Aligned_cols=98 Identities=9% Similarity=0.012 Sum_probs=55.9 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCEEEEE-ECCCCEEEEEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 989999999997200114878549999-8257607679999998346783113686377660359999999998624865 Q gi|254781133|r 17 INQIIADFVAKRIKDCSSGWDRFVSMG-ILKSNFLVAGIIYHNYCPISRVIELSGASDCKSWLSRSVLKEIYSYPWNQLC 95 (159) Q Consensus 17 t~e~~~~wi~~~~~~~~~~~~~~~ai~-i~~~g~lvG~v~l~~~~~~~~~~eig~~~~~~~~~~~ea~~~l~~~aF~~l~ 95 (159) ....++.|....--.- ....-+.+. ..+++++|++=++..- .-+.+-+.-..++ .|.+-..+..+++++++ +| T Consensus 16 k~~~i~~fL~~~~l~~--d~~ve~~v~~~~~~~~iiacGsiaGn--vikcvAvs~s~qG-eGl~lkl~TeLin~ay~-~g 89 (352) T COG3053 16 KMAEIAEFLHQNDLRV--DTTVEYFVAIYRDNEEIIACGSIAGN--VIKCVAVSESLQG-EGLALKLVTELINLAYE-RG 89 (352) T ss_pred HHHHHHHHHHHCCCEE--CCCCEEEEEEECCCCCEEEECCCCCC--EEEEEEECHHCCC-CCHHHHHHHHHHHHHHH-CC T ss_conf 7999999886347511--36520899998178868995340135--6689996420156-30899999999999997-49 Q ss_pred EEEEEEEEECCCCHHHHHHHHCCCEEEE Q ss_conf 0799999826421589999874958966 Q gi|254781133|r 96 CQAVIHRIPDEDYPQHRMLTSLGGIRYR 123 (159) Q Consensus 96 l~Ri~~~~~~~N~~S~rv~eklGF~~eg 123 (159) ...+...+-+. +.+++..|||.+.. T Consensus 90 ~~hLFiyTKp~---~~~lFk~~GF~~i~ 114 (352) T COG3053 90 RTHLFIYTKPE---YAALFKQCGFSEIA 114 (352) T ss_pred CCEEEEEECHH---HHHHHHHCCCEEEE T ss_conf 82699994666---79988738966864 No 75 >PRK13886 conjugal transfer protein TraL; Provisional Probab=23.91 E-value=54 Score=13.98 Aligned_cols=12 Identities=8% Similarity=-0.133 Sum_probs=3.9 Q ss_pred HHHHHCCCEEEE Q ss_conf 999874958966 Q gi|254781133|r 112 RMLTSLGGIRYR 123 (159) Q Consensus 112 rv~eklGF~~eg 123 (159) +++++-..+-.| T Consensus 170 k~y~~~k~ri~~ 181 (241) T PRK13886 170 KAYTANKARVSA 181 (241) T ss_pred HHHHHHHHHEEE T ss_conf 999975442589 No 76 >KOG2239 consensus Probab=23.25 E-value=49 Score=14.25 Aligned_cols=19 Identities=16% Similarity=0.211 Sum_probs=11.8 Q ss_pred CHHHHHHHHCCCEE-EEEEC Q ss_conf 15899998749589-66761 Q gi|254781133|r 108 YPQHRMLTSLGGIR-YRIPR 126 (159) Q Consensus 108 ~~S~rv~eklGF~~-egv~R 126 (159) ..+++++.|||+.+ +||.| T Consensus 67 kKark~m~KLGLk~v~gV~R 86 (209) T KOG2239 67 KKARKAMLKLGLKQVTGVTR 86 (209) T ss_pred HHHHHHHHHCCCCCCCCEEE T ss_conf 99999998649724146247 No 77 >KOG4700 consensus Probab=21.74 E-value=46 Score=14.40 Aligned_cols=28 Identities=18% Similarity=0.466 Sum_probs=19.1 Q ss_pred CCCCEEEECCCCCCCCHHHHHHHHHHHHH Q ss_conf 77620573689879998999999999720 Q gi|254781133|r 2 YNTIEITWGGNAKPRINQIIADFVAKRIK 30 (159) Q Consensus 2 ~n~~~~~W~~~~~P~t~e~~~~wi~~~~~ 30 (159) |...+++|++...-. ..++++++.+... T Consensus 72 FS~~~vYWm~~~~ge-N~e~e~~L~rs~~ 99 (207) T KOG4700 72 FSQVSVYWMCRGDGE-NSEIEDFLERSKH 99 (207) T ss_pred HHHHEEEEEECCCCC-HHHHHHHHHHHHH T ss_conf 014235777358853-7899999999799 No 78 >KOG4387 consensus Probab=21.59 E-value=60 Score=13.70 Aligned_cols=45 Identities=13% Similarity=0.108 Sum_probs=35.1 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHC---CCEEE Q ss_conf 3599999999986248650799999826421589999874---95896 Q gi|254781133|r 78 LSRSVLKEIYSYPWNQLCCQAVIHRIPDEDYPQHRMLTSL---GGIRY 122 (159) Q Consensus 78 ~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N~~S~rv~ekl---GF~~e 122 (159) ..++.+-++++||-++|.+.+|.+....+|.-=-.+.+.+ ||.-. T Consensus 116 gsKe~lvalLEfAEekl~~d~Vfi~F~K~R~dr~~LlrtfsyvGFEpv 163 (191) T KOG4387 116 GSKEGLVALLEFAEEKLHVDKVFICFDKNREDRAALLRTFSYVGFEPV 163 (191) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHEHCCEEEEC T ss_conf 225768999999888633536999985486670766441110102642 No 79 >PRK13239 alkylmercury lyase; Provisional Probab=21.24 E-value=61 Score=13.66 Aligned_cols=85 Identities=13% Similarity=0.108 Sum_probs=45.2 Q ss_pred CCCCCCCHHHHHH-------HHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEEEEEECCCCCCCEEEECCCCCCHHHHHH Q ss_conf 8987999899999-------999972001148785499998257607679999998346783113686377660359999 Q gi|254781133|r 11 GNAKPRINQIIAD-------FVAKRIKDCSSGWDRFVSMGILKSNFLVAGIIYHNYCPISRVIELSGASDCKSWLSRSVL 83 (159) Q Consensus 11 ~~~~P~t~e~~~~-------wi~~~~~~~~~~~~~~~ai~i~~~g~lvG~v~l~~~~~~~~~~eig~~~~~~~~~~~ea~ 83 (159) ..++|-+.+.++. -+...+.... .++..++|++||. +|.- .+....+++ .++..|+=.|+ T Consensus 32 a~G~PVs~~~LA~~~g~~~e~v~~~L~~~~-------~teyD~~G~Iig~-GLTl-rpT~Hrf~v----~G~~LyTWCAl 98 (206) T PRK13239 32 AKGRPVSRETLAGALGWPVEEVAAVLEQAP-------DTEYDEDGRIIGY-GLTL-RPTPHRFEV----DGRQLYTWCAL 98 (206) T ss_pred HCCCCCCHHHHHHHHCCCHHHHHHHHHHCC-------CEEECCCCCEEEE-ECCC-CCCCCEEEE----CCEEEEHHHHH T ss_conf 569999999999886999999999998577-------7258589858865-0411-666867898----99255578888 Q ss_pred HHHHHHHHCCCCEEEEEEEEECCCCH Q ss_conf 99999862486507999998264215 Q gi|254781133|r 84 KEIYSYPWNQLCCQAVIHRIPDEDYP 109 (159) Q Consensus 84 ~~l~~~aF~~l~l~Ri~~~~~~~N~~ 109 (159) .+++==+.- -.--+|...|...+.+ T Consensus 99 DaL~fPall-g~~a~V~S~Cp~tG~p 123 (206) T PRK13239 99 DTLMFPALI-GRTARVSSHCPATGAP 123 (206) T ss_pred HHHHHHHHH-CCCEEEEECCCCCCCE T ss_conf 876447862-9852798459999974 No 80 >TIGR02826 RNR_activ_nrdG3 anaerobic ribonucleoside-triphosphate reductase activating protein; InterPro: IPR014191 Members of this entry represent a set of proteins related to, yet architecturally different from, the activating protein for the glycine radical-containing, oxygen-sensitive ribonucleoside-triphosphate reductase (RNR, see IPR012837 from INTERPRO). Members of this entry are found paired with members of a similarly divergent set of anaerobic ribonucleoside-triphosphate reductases. Identification of these proteins as RNR activating proteins is partly from pairing with the candidate RNR and further supported by the finding that upstream of these operons are examples of a conserved regulatory element that is found in nearly all bacteria and that occurs specifically upstream of operons for all three classes of RNR genes .. Probab=20.60 E-value=63 Score=13.58 Aligned_cols=25 Identities=16% Similarity=0.320 Sum_probs=20.9 Q ss_pred EE-CCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 73-68987999899999999972001 Q gi|254781133|r 8 TW-GGNAKPRINQIIADFVAKRIKDC 32 (159) Q Consensus 8 ~W-~~~~~P~t~e~~~~wi~~~~~~~ 32 (159) +| .+.+.|.|.+.++.++++....- T Consensus 39 lW~~d~G~~Lt~e~~~~~l~~y~~~I 64 (165) T TIGR02826 39 LWHEDEGTPLTEEALAQLLDKYRSLI 64 (165) T ss_pred HCCCCCCCCCCHHHHHHHHHHHCCCC T ss_conf 11356798889899999998740001 No 81 >TIGR02768 TraA_Ti Ti-type conjugative transfer relaxase TraA; InterPro: IPR014136 This entry represents the Ti-type conjugative transfer relaxase TraA. TraA contains domains distinctive of a single strand exonuclease (N-terminus, MobA/MobL, IPR005053 from INTERPRO) as well as a helicase domain (central region, homologous to the corresponding region of the F-type relaxase TraI, IPR014129 from INTERPRO). This protein likely fills the same role as TraI(F), nicking (at the oriT site) and unwinding the coiled plasmid prior to conjugative transfer .. Probab=20.42 E-value=39 Score=14.79 Aligned_cols=51 Identities=10% Similarity=0.085 Sum_probs=27.3 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEEEEECCC-------CHHHHHHHHCCCEE-EEEECCC Q ss_conf 60359999999998624865079999982642-------15899998749589-6676128 Q gi|254781133|r 76 SWLSRSVLKEIYSYPWNQLCCQAVIHRIPDED-------YPQHRMLTSLGGIR-YRIPRLR 128 (159) Q Consensus 76 ~~~~~ea~~~l~~~aF~~l~l~Ri~~~~~~~N-------~~S~rv~eklGF~~-egv~R~~ 128 (159) |-.+..-|.-++.-| .+-|. ||.+-=++.- .+=+.+++++||.+ ++|+||+ T Consensus 511 GMV~S~Qm~r~l~~A-~~AGa-KvVLvGD~~QLqaI~AGaaFRaia~riG~~El~~vrRQr 569 (888) T TIGR02768 511 GMVGSRQMARVLKEA-EEAGA-KVVLVGDPEQLQAIEAGAAFRAIAERIGYAELETVRRQR 569 (888) T ss_pred CCHHHHHHHHHHHHH-HHCCC-CEEEECCHHHHCHHHCCCCCHHHHHHCCEEEECCCCCCH T ss_conf 441467788999999-87276-059848857824424475001234101403254641002 Done!