Query 537021.9.peg.888_1 Match_columns 217 No_of_seqs 126 out of 2919 Neff 6.8 Searched_HMMs 39220 Date Wed May 25 08:10:05 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_888.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00091 miaA tRNA delta(2)-is 100.0 0 0 458.7 21.4 213 2-216 82-295 (304) 2 pfam01715 IPPT IPP transferase 100.0 0 0 458.6 21.4 208 2-211 45-253 (253) 3 TIGR00174 miaA tRNA delta(2)-i 100.0 0 0 445.5 18.0 210 2-213 77-307 (307) 4 COG0324 MiaA tRNA delta(2)-iso 100.0 0 0 427.6 21.0 212 2-214 81-293 (308) 5 KOG1384 consensus 99.9 3.4E-27 8.6E-32 199.2 11.6 149 2-207 85-248 (348) 6 pfam01745 IPT Isopentenyl tran 93.1 0.86 2.2E-05 26.3 9.6 110 91-202 96-225 (232) 7 TIGR00381 cdhD CO dehydrogenas 87.7 0.38 9.6E-06 28.7 2.2 59 2-84 180-239 (401) 8 KOG1221 consensus 79.4 4.3 0.00011 21.8 4.7 80 6-89 6-111 (467) 9 TIGR02075 pyrH_bact uridylate 78.5 1.7 4.2E-05 24.5 2.4 22 3-24 32-54 (236) 10 TIGR02386 rpoC_TIGR DNA-direct 72.9 6.3 0.00016 20.7 4.1 126 45-176 228-384 (1552) 11 COG2069 CdhD CO dehydrogenase/ 72.3 3.1 7.9E-05 22.7 2.5 35 2-48 187-222 (403) 12 TIGR02440 FadJ fatty oxidation 68.1 4.6 0.00012 21.6 2.6 84 108-194 624-714 (732) 13 PRK12443 uridylate kinase; Rev 66.5 5 0.00013 21.4 2.5 143 3-169 34-191 (247) 14 pfam09862 DUF2089 Protein of u 64.8 9.4 0.00024 19.5 3.7 49 137-185 48-113 (113) 15 PRK08309 short chain dehydroge 62.0 3.3 8.5E-05 22.5 0.9 15 15-29 3-17 (182) 16 COG3877 Uncharacterized protei 61.1 10 0.00026 19.3 3.3 39 147-185 82-121 (122) 17 TIGR03333 salvage_mtnX 2-hydro 60.4 10 0.00026 19.3 3.2 27 6-32 79-107 (214) 18 PRK09552 mtnX 2-hydroxy-3-keto 60.2 11 0.00027 19.2 3.2 27 6-32 82-110 (219) 19 COG0528 PyrH Uridylate kinase 59.0 8.3 0.00021 19.9 2.5 19 153-171 175-193 (238) 20 COG2100 Predicted Fe-S oxidore 53.8 18 0.00046 17.7 3.6 119 42-172 202-322 (414) 21 cd04254 AAK_UMPK-PyrH-Ec UMP k 52.8 12 0.0003 18.9 2.5 23 2-24 29-52 (231) 22 PRK10542 glutathionine S-trans 49.6 23 0.00058 17.0 7.0 19 6-24 44-64 (201) 23 TIGR01036 pyrD_sub2 dihydrooro 49.4 12 0.00032 18.8 2.1 13 177-189 201-213 (370) 24 pfam06414 Zeta_toxin Zeta toxi 49.0 23 0.00059 17.0 5.5 104 59-172 56-169 (191) 25 cd02021 GntK Gluconate kinase 48.6 13 0.00034 18.6 2.2 39 110-148 97-135 (150) 26 pfam02679 ComA (2R)-phospho-3- 47.0 25 0.00064 16.8 6.9 24 2-25 57-80 (245) 27 pfam00491 Arginase Arginase fa 45.7 16 0.0004 18.1 2.2 27 6-32 73-99 (274) 28 pfam09823 DUF2357 Domain of un 45.4 25 0.00063 16.8 3.2 39 130-170 206-245 (246) 29 KOG3347 consensus 44.8 27 0.00069 16.6 3.4 62 65-132 51-114 (176) 30 COG3892 Uncharacterized protei 44.6 27 0.00069 16.5 10.9 137 36-193 142-297 (310) 31 PRK06153 hypothetical protein; 44.6 11 0.00027 19.2 1.2 23 7-29 172-195 (393) 32 TIGR02788 VirB11 P-type DNA tr 44.2 26 0.00066 16.7 3.1 24 9-32 155-182 (328) 33 KOG0150 consensus 43.8 25 0.00065 16.7 3.0 21 123-143 35-55 (336) 34 pfam01113 DapB_N Dihydrodipico 43.6 17 0.00044 17.8 2.1 21 73-93 70-90 (122) 35 cd04742 NPD_FabD 2-Nitropropan 43.6 28 0.00071 16.4 4.9 15 13-27 289-303 (418) 36 PRK00131 aroK shikimate kinase 42.6 24 0.00061 16.9 2.7 75 111-194 97-175 (175) 37 COG1735 Php Predicted metal-de 41.3 16 0.00041 18.0 1.7 31 156-187 271-306 (316) 38 COG4359 Uncharacterized conser 40.2 32 0.0008 16.1 3.2 27 7-33 83-111 (220) 39 TIGR02782 TrbB_P P-type conjug 39.3 16 0.00041 18.0 1.5 24 10-34 138-165 (315) 40 TIGR01694 MTAP methylthioadeno 38.8 11 0.00028 19.1 0.5 12 17-28 4-16 (263) 41 TIGR02667 moaB_proteo molybden 38.2 12 0.00031 18.8 0.7 29 56-84 25-54 (163) 42 TIGR02078 AspKin_pair Pyrococc 37.8 21 0.00054 17.2 1.9 22 127-148 152-173 (341) 43 PRK09825 idnK D-gluconate kina 37.4 26 0.00067 16.6 2.3 73 111-193 99-171 (176) 44 PRK13776 formimidoylglutamase; 37.0 26 0.00066 16.7 2.2 29 4-32 104-132 (311) 45 PRK13773 formimidoylglutamase; 36.7 26 0.00068 16.6 2.2 18 6-23 112-129 (320) 46 PRK13775 formimidoylglutamase; 36.6 31 0.00079 16.2 2.5 19 7-25 118-136 (328) 47 PRK00048 dihydrodipicolinate r 36.4 25 0.00063 16.8 2.0 24 4-27 84-107 (265) 48 PRK04195 replication factor C 36.1 36 0.00093 15.7 7.8 88 4-96 122-215 (403) 49 COG3311 AlpA Predicted transcr 35.6 37 0.00095 15.6 4.3 60 73-134 5-67 (70) 50 PRK11545 gntK gluconate kinase 35.0 30 0.00076 16.3 2.3 38 111-148 104-141 (177) 51 pfam01725 Ham1p_like Ham1 fami 32.9 41 0.001 15.4 4.7 26 4-29 49-74 (181) 52 TIGR01489 DKMTPPase-SF 2,3-dik 32.7 41 0.0011 15.3 3.3 30 5-34 81-114 (207) 53 pfam09074 Mer2 Mer2. Mer2 (Rec 32.7 41 0.0011 15.3 5.7 61 131-202 93-154 (193) 54 pfam07271 Cytadhesin_P30 Cytad 32.4 42 0.0011 15.3 3.9 50 13-65 69-119 (279) 55 TIGR01977 am_tr_V_EF2568 cyste 32.2 22 0.00056 17.1 1.2 115 13-139 226-342 (384) 56 TIGR00972 3a0107s01c2 phosphat 32.2 24 0.0006 16.9 1.3 93 101-194 89-208 (248) 57 TIGR00963 secA preprotein tran 31.1 35 0.00088 15.8 2.1 17 4-21 427-443 (904) 58 PRK13772 formimidoylglutamase; 30.9 37 0.00095 15.6 2.2 23 6-28 108-130 (313) 59 pfam11469 DUF3204 Protein of u 30.8 42 0.0011 15.3 2.4 21 167-187 67-87 (120) 60 PRK01722 formimidoylglutamase; 30.6 38 0.00098 15.5 2.2 27 6-32 111-137 (320) 61 TIGR01929 menB naphthoate synt 30.4 18 0.00047 17.6 0.6 87 62-148 91-205 (278) 62 TIGR01481 ccpA catabolite cont 29.6 40 0.001 15.4 2.2 31 5-42 131-161 (332) 63 PRK09875 putative hydrolase; P 27.2 25 0.00064 16.8 0.8 25 73-98 133-158 (292) 64 PRK02190 agmatinase; Provision 27.1 40 0.001 15.4 1.8 16 7-22 111-126 (305) 65 PRK09195 gatY tagatose-bisphos 27.0 52 0.0013 14.7 2.9 76 127-202 196-281 (284) 66 PRK13774 formimidoylglutamase; 26.8 49 0.0013 14.8 2.2 27 5-31 113-139 (311) 67 PRK11230 glycolate oxidase sub 26.6 52 0.0013 14.7 2.4 88 2-95 68-192 (499) 68 TIGR02133 RPI_actino ribose 5- 26.4 25 0.00065 16.7 0.7 16 13-28 66-81 (154) 69 PRK05699 fliP flagellar biosyn 26.3 53 0.0014 14.6 3.4 29 167-195 113-141 (245) 70 KOG2553 consensus 26.0 49 0.0012 14.9 2.1 40 9-55 34-75 (416) 71 COG1700 Uncharacterized conser 25.7 54 0.0014 14.6 2.8 42 128-171 257-299 (503) 72 TIGR00110 ilvD dihydroxy-acid 24.9 29 0.00074 16.3 0.8 20 10-29 118-139 (601) 73 pfam10007 DUF2250 Uncharacteri 24.6 35 0.00088 15.8 1.1 21 127-147 38-58 (93) 74 TIGR02076 pyrH_arch uridylate 24.2 58 0.0015 14.4 2.4 18 155-172 158-175 (232) 75 pfam05583 consensus 24.1 38 0.00098 15.6 1.2 29 139-167 4-32 (76) 76 TIGR02911 sulfite_red_B sulfit 24.0 36 0.00093 15.7 1.1 117 6-148 94-222 (261) 77 KOG1436 consensus 23.7 37 0.00095 15.6 1.1 16 39-54 56-71 (398) 78 TIGR01358 DAHP_synth_II 3-deox 23.5 44 0.0011 15.2 1.5 63 61-133 133-204 (450) 79 PRK08213 gluconate 5-dehydroge 23.4 48 0.0012 14.9 1.6 26 4-29 3-29 (259) 80 PRK08931 5'-methylthioadenosin 23.1 27 0.00068 16.6 0.3 16 174-189 219-234 (289) 81 PRK00440 rfc replication facto 22.7 62 0.0016 14.2 5.3 59 70-130 110-171 (318) 82 cd05637 SIS_PGI_PMI_2 The memb 22.6 62 0.0016 14.2 2.3 23 3-25 3-25 (132) 83 PRK07823 5'-methylthioadenosin 22.2 31 0.0008 16.1 0.5 25 175-199 215-242 (264) 84 COG0731 Fe-S oxidoreductases [ 22.1 64 0.0016 14.1 3.9 67 2-91 97-170 (296) 85 pfam04665 Pox_A32 Poxvirus A32 21.9 48 0.0012 14.9 1.4 16 16-31 17-36 (241) 86 KOG0331 consensus 21.9 44 0.0011 15.2 1.2 64 77-155 208-271 (519) 87 pfam05202 Flp_C Recombinase Fl 21.7 39 0.001 15.5 0.9 29 58-86 55-83 (244) 88 PRK07718 fliL flagellar basal 21.7 65 0.0017 14.1 2.5 64 62-144 71-134 (142) 89 pfam11451 DUF3202 Protein of u 21.1 61 0.0015 14.2 1.8 24 1-24 3-26 (67) 90 TIGR03589 PseB UDP-N-acetylglu 21.0 51 0.0013 14.7 1.4 19 11-29 3-21 (324) 91 PRK08185 hypothetical protein; 21.0 67 0.0017 14.0 2.9 79 126-204 191-279 (283) 92 PRK09401 reverse gyrase; Revie 20.9 50 0.0013 14.8 1.3 32 4-37 366-397 (1176) 93 pfam02126 PTE Phosphotriestera 20.8 42 0.0011 15.3 0.9 27 71-98 134-161 (308) 94 KOG2373 consensus 20.0 43 0.0011 15.2 0.8 21 2-22 263-283 (514) No 1 >PRK00091 miaA tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed Probab=100.00 E-value=0 Score=458.66 Aligned_cols=213 Identities=38% Similarity=0.618 Sum_probs=201.8 Q ss_pred HHHHH-HHHHCCCCEEEECCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCHHHHH Q ss_conf 46899-99963994089789327999996172448899989999999999962235556655311623202357015699 Q 537021.9.peg.8 2 LSKKL-QRYKKRFLPIIVGGTGLYFRALTGQLSIMPEIPTAIREKIREKLKQYGSHILHDELSSLDSIVARQIHPSDGQR 80 (217) Q Consensus 2 ~~k~~-e~~~r~kiPIiVGGTglY~~all~g~~~~p~~~~~ir~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i~~nd~~R 80 (217) +.+.+ +++++|++||||||||||++||++|+++.|++++++|++++...+..|++.||++|+++||++|.+||+||++| T Consensus 82 a~~~i~~i~~~~kiPIiVGGTglYl~aLl~g~~~~p~~~~~ir~~l~~~~~~~g~~~l~~~L~~~Dp~~a~~I~~nd~~R 161 (304) T PRK00091 82 ALAAIEDITARGKLPILVGGTGLYFKALLEGLSFLPPADPEVRAELEALAAEEGWEALHAELAEIDPVAAARIHPNDPQR 161 (304) T ss_pred HHHHHHHHHHCCCCCEEECCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHCHHHHHHCCCCCHHH T ss_conf 99999999976998789808389999997187778888989999999999975899999999875999996159861267 Q ss_pred HHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 99999998534887467754154567770010000107867899999997776542113456777751138631013442 Q 537021.9.peg.8 81 IARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKIIILPERSALKERIRRRFTQMLESGAIDEIRSLMKMNLSLDLPIMK 160 (217) Q Consensus 81 i~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~~~~~~~~~~~~~ 160 (217) |+|||||+..||+++|+|+...+.+ +.+++.+|+|.+||+.|++||+.||+.|+++||++||+.|++.+++.++++++ T Consensus 162 l~RAlEv~~~tGk~~s~~~~~~~~~--~~~~~~~i~L~~~r~~L~~rI~~R~~~Mi~~GlieEv~~L~~~~~~~~~~~~~ 239 (304) T PRK00091 162 IIRALEVYELTGKPLSELQKTPGKA--LPYDVLIIGLDPDREVLYERINQRFDQMLEEGLLEEVRALLAGGLHPDLPAMR 239 (304) T ss_pred HHHHHHHHHHHCCCHHHHHCCCCCC--CCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCHHHHH T ss_conf 9999999999799789985037889--87662899968999999999999999999868999999999769973046773 Q ss_pred CCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHCC Q ss_conf 06856778986789998999999999999888887688628887531169830104 Q 537021.9.peg.8 161 AIGVRDIIALLKGEINYDETLQRGIIATNKYAKRQKTWLCHQFQADWIRISSIDDL 216 (217) Q Consensus 161 ~IGYke~~~yL~g~~~~~e~~e~~~~~Tr~yAKRQ~TWfr~~~~~~w~~~~~~d~i 216 (217) ||||||+.+||+|+++++++++.++++||||||||+||||++++++|+++++.+++ T Consensus 240 aIGykE~~~yL~g~~s~~e~~~~i~~~Tr~yAKRQ~TWfr~~~~i~w~~~~~~~~~ 295 (304) T PRK00091 240 AIGYREVWEYLDGEISLEEAVEKIKQATRQYAKRQLTWFRRQPDIHWLDAEALDKV 295 (304) T ss_pred CCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCHHHH T ss_conf 43899999998699999999999999999999999997289999867799776899 No 2 >pfam01715 IPPT IPP transferase. This is a family of IPP transferases EC:2.5.1.8 also known as tRNA delta(2)-isopentenylpyrophosphate transferase. These enzymes modify both cytoplasmic and mitochondrial tRNAs at A(37) to give isopentenyl A(37). Probab=100.00 E-value=0 Score=458.58 Aligned_cols=208 Identities=40% Similarity=0.666 Sum_probs=197.3 Q ss_pred HHHHH-HHHHCCCCEEEECCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCHHHHH Q ss_conf 46899-99963994089789327999996172448899989999999999962235556655311623202357015699 Q 537021.9.peg.8 2 LSKKL-QRYKKRFLPIIVGGTGLYFRALTGQLSIMPEIPTAIREKIREKLKQYGSHILHDELSSLDSIVARQIHPSDGQR 80 (217) Q Consensus 2 ~~k~~-e~~~r~kiPIiVGGTglY~~all~g~~~~p~~~~~ir~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i~~nd~~R 80 (217) +.+.| +++++|++||||||||||++||++|++++|++|+++|++++..++..|.+.||++|+++||++|++|||||++| T Consensus 45 a~~~i~~i~~~~k~PIiVGGTglYl~all~g~~~~p~~~~~~r~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i~~nd~~R 124 (253) T pfam01715 45 ALEAIAEIRARGKIPLLVGGTGLYFKALLDGLSDTPSADPKVRAKLEEQLEELGNDYLHAELASVDPEAAAKIHPNDGRR 124 (253) T ss_pred HHHHHHHHHHCCCCCEEEECCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHCCCCCCHHH T ss_conf 99999999966997289838089999997698778887777999999999985199999999874969785479643667 Q ss_pred HHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 99999998534887467754154567770010000107867899999997776542113456777751138631013442 Q 537021.9.peg.8 81 IARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKIIILPERSALKERIRRRFTQMLESGAIDEIRSLMKMNLSLDLPIMK 160 (217) Q Consensus 81 i~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~~~~~~~~~~~~~ 160 (217) |+|||||+..||+++|+++..++.+ +.+++++|+|.+||+.||+||+.||+.|+++||++||+.|++.+++.++++++ T Consensus 125 i~RAlEi~~~tGk~~s~~~~~~~~~--~~~~~~~i~l~~~r~~L~~rI~~Rv~~Ml~~GlieEv~~L~~~~~~~~~~~~~ 202 (253) T pfam01715 125 IVRALEVFYATGKPISEFQKEQKNA--LPYDIVQIGLARDREVLHERIARRVDDMLESGLVEEVKALYAQGLNEDLPSIR 202 (253) T ss_pred HHHHHHHHHHHCCCHHHHHHHCCCC--CCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCHHHHH T ss_conf 8999999999798889987433799--99877378837988999999999999998768999999999758996448773 Q ss_pred CCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 068567789867899989999999999998888876886288875311698 Q 537021.9.peg.8 161 AIGVRDIIALLKGEINYDETLQRGIIATNKYAKRQKTWLCHQFQADWIRIS 211 (217) Q Consensus 161 ~IGYke~~~yL~g~~~~~e~~e~~~~~Tr~yAKRQ~TWfr~~~~~~w~~~~ 211 (217) ||||||+.+||+|++|+++|++.++++||||||||+||||++++++|+|+. T Consensus 203 aIGYkE~~~yl~g~~s~~e~~e~i~~~TrqyAKRQ~TWfr~~~~i~W~d~~ 253 (253) T pfam01715 203 AIGYKEFLLYLDGECTLEEAIERIIKATRQYAKRQMTWFRKDSQVHWLDIG 253 (253) T ss_pred CCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECC T ss_conf 125999999985999999999999999999998889971899998045186 No 3 >TIGR00174 miaA tRNA delta(2)-isopentenylpyrophosphate transferase; InterPro: IPR002627 tRNA isopentenyltransferases 2.5.1.8 from EC also known as tRNA delta(2)-isopentenylpyrophosphate transferases or IPP transferases. These enzymes modify both cytoplasmic and mitochondrial tRNAs at A(37) to give isopentenyl A(37) .; GO: 0004811 tRNA isopentenyltransferase activity, 0005524 ATP binding, 0008033 tRNA processing. Probab=100.00 E-value=0 Score=445.49 Aligned_cols=210 Identities=33% Similarity=0.547 Sum_probs=196.8 Q ss_pred HHHHH-HHHHCCCCEEEECCCCHHHHHHHCCCCCCCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCHHHH Q ss_conf 46899-99963994089789327999996172448899-98999999999996223555665531162320235701569 Q 537021.9.peg.8 2 LSKKL-QRYKKRFLPIIVGGTGLYFRALTGQLSIMPEI-PTAIREKIREKLKQYGSHILHDELSSLDSIVARQIHPSDGQ 79 (217) Q Consensus 2 ~~k~~-e~~~r~kiPIiVGGTglY~~all~g~~~~p~~-~~~ir~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i~~nd~~ 79 (217) |.+.+ ++.+||+|||||||||||++|+++|++++|.+ ++.+|+.++...+..|++.+|+.|+++||++|++|||||.+ T Consensus 77 ~~~~~~~i~~~Gkipl~VGGT~lY~k~l~~gl~~~~~~~~~~~r~~~~~~~~~~g~~~ly~~L~~~DP~~a~~ihPND~~ 156 (307) T TIGR00174 77 ALNAIADITARGKIPLLVGGTGLYLKALLEGLSPTPSAEDKLIRESLEILAERQGKSFLYKELKKVDPVAAAKIHPNDTR 156 (307) T ss_pred HHHHHHHHHHCCCCEEEECCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHHHHCCCCHH T ss_conf 99999999856983488685788999997147777555650579999999985491578887775076989721776147 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEEEC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC----- Q ss_conf 999999998534887467754154567770010000107-86789999999777654211345677775113863----- Q 537021.9.peg.8 80 RIARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKIIIL-PERSALKERIRRRFTQMLESGAIDEIRSLMKMNLS----- 153 (217) Q Consensus 80 Ri~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~l~-~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~~~~~~----- 153 (217) |+.||||||+.||+++|++++.++.. ..++++.|+|. .+|+.|++||+.|++.||+.||++||+.|+.+|.. T Consensus 157 R~~RaLEvfY~TG~~~Se~~~~~~~~--L~~~~v~~~l~~~~~e~L~~RI~~R~~~Ml~sGl~~Evk~Ly~~~~~~kftP 234 (307) T TIGR00174 157 RVKRALEVFYATGKPISELFKEQKKE--LFYNIVQIGLAPLEREALHERIEQRVHEMLESGLLAEVKGLYAKYDKNKFTP 234 (307) T ss_pred HHHHHHHHHHHHCCCHHHHHHCCCCC--CHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 89899998875078656664036665--4146677763005864567789999999998638999999985175556575 Q ss_pred --CHHHHHHCCCHHHHHHHHCC-----CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCC------CCCCCCCCH Q ss_conf --10134420685677898678-----9998999999999999888887688628887------531169830 Q 537021.9.peg.8 154 --LDLPIMKAIGVRDIIALLKG-----EINYDETLQRGIIATNKYAKRQKTWLCHQFQ------ADWIRISSI 213 (217) Q Consensus 154 --~~~~~~~~IGYke~~~yL~g-----~~~~~e~~e~~~~~Tr~yAKRQ~TWfr~~~~------~~w~~~~~~ 213 (217) .++|++++|||||+..||+| .++++|+++.++.+||||||||+||||+++. ++|+|+.++ T Consensus 235 eq~d~p~~~~iGy~E~~~yl~g~tddn~~~L~~~~e~~~~~tr~yAKRQ~Tw~r~~~~PdikGd~~wld~~~~ 307 (307) T TIGR00174 235 EQEDLPSIKAIGYKEFLLYLEGKTDDNRVTLEDAIERIKKNTRQYAKRQLTWFRKKSAPDIKGDVLWLDSEDP 307 (307) T ss_pred CHHCCCCCEEHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCCCC T ss_conf 0004763000128888887368876421149999999988799988888888602237873430343388899 No 4 >COG0324 MiaA tRNA delta(2)-isopentenylpyrophosphate transferase [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=427.57 Aligned_cols=212 Identities=40% Similarity=0.671 Sum_probs=199.1 Q ss_pred HHHHH-HHHHCCCCEEEECCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCHHHHH Q ss_conf 46899-99963994089789327999996172448899989999999999962235556655311623202357015699 Q 537021.9.peg.8 2 LSKKL-QRYKKRFLPIIVGGTGLYFRALTGQLSIMPEIPTAIREKIREKLKQYGSHILHDELSSLDSIVARQIHPSDGQR 80 (217) Q Consensus 2 ~~k~~-e~~~r~kiPIiVGGTglY~~all~g~~~~p~~~~~ir~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i~~nd~~R 80 (217) +.+.| ++.+|||+||||||||||++||++|++..|..++.+|.+++.+++..|+++||+.|+++||++|++|||||++| T Consensus 81 a~~~i~~i~~rgk~pIlVGGTglY~~aL~~g~~~~p~~~~~~r~~~~~~~~~~g~~~L~~~L~~~Dp~~a~~i~pnD~~R 160 (308) T COG0324 81 ALAAIDDILARGKLPILVGGTGLYLKALLEGLSLLPEADPEVRRRLEAELAELGNDALHAELKKIDPEAAAKIHPNDPQR 160 (308) T ss_pred HHHHHHHHHHCCCCCEEECCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHCHHHHHHCCCCCHHH T ss_conf 99999999968998799766799999997387678999989999999999861879999999864999998658886247 Q ss_pred HHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 99999998534887467754154567770010000107867899999997776542113456777751138631013442 Q 537021.9.peg.8 81 IARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKIIILPERSALKERIRRRFTQMLESGAIDEIRSLMKMNLSLDLPIMK 160 (217) Q Consensus 81 i~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~~~~~~~~~~~~~ 160 (217) ++||||||+.||+|+|+++... .+....+++..|+|.++|+.|++||+.||+.|+++||++||+.|+++|+..++++++ T Consensus 161 i~RALEv~~~tGk~~s~~~~~~-~~~~~~~~~~~~~l~~~r~~L~~rI~~R~d~Ml~~Gli~EV~~L~~~g~~~~~~~~~ 239 (308) T COG0324 161 IIRALEVYYLTGKPISELQKRS-RPILEPYDILIIALAADREVLYERINRRVDAMLEQGLIEEVKALYARGLHLDLPAMQ 239 (308) T ss_pred HHHHHHHHHHHCCCHHHHHHCC-CCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCHHHH T ss_conf 8999999999897978873113-677777663899985888889999999999999864999999998556786543777 Q ss_pred CCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHH Q ss_conf 068567789867899989999999999998888876886288875311698301 Q 537021.9.peg.8 161 AIGVRDIIALLKGEINYDETLQRGIIATNKYAKRQKTWLCHQFQADWIRISSID 214 (217) Q Consensus 161 ~IGYke~~~yL~g~~~~~e~~e~~~~~Tr~yAKRQ~TWfr~~~~~~w~~~~~~d 214 (217) +|||+|+.+||+|+++++|+++.++.+||||||||+||||+++.++|+++...+ T Consensus 240 ~iGy~e~~~yl~g~~~~~ea~~~~~~~TRqyAKRQ~TWfr~~~~~~w~~~~~~~ 293 (308) T COG0324 240 AIGYKEILAYLDGGISLEEAIERIKTATRQYAKRQLTWFRNQLGVHWLDSESPD 293 (308) T ss_pred HCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEECCCCCC T ss_conf 647989999982877799999999999999999999997036645210168863 No 5 >KOG1384 consensus Probab=99.95 E-value=3.4e-27 Score=199.18 Aligned_cols=149 Identities=27% Similarity=0.365 Sum_probs=121.8 Q ss_pred HHHHH-HHHHCCCCEEEECCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCHHHHH Q ss_conf 46899-99963994089789327999996172448899989999999999962235556655311623202357015699 Q 537021.9.peg.8 2 LSKKL-QRYKKRFLPIIVGGTGLYFRALTGQLSIMPEIPTAIREKIREKLKQYGSHILHDELSSLDSIVARQIHPSDGQR 80 (217) Q Consensus 2 ~~k~~-e~~~r~kiPIiVGGTglY~~all~g~~~~p~~~~~ir~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i~~nd~~R 80 (217) +...| +|++|||+|||||||++|++||+++-.+ | +.||.++. T Consensus 85 a~~aie~I~~rgk~PIv~GGs~~yi~al~~~~~d-~---------------------------~~dp~~~~--------- 127 (348) T KOG1384 85 ASRAIEEIHSRGKLPIVVGGSNSYLQALLSKRFD-P---------------------------KIDPFSSN--------- 127 (348) T ss_pred HHHHHHHHHHCCCCCEEECCCHHHHHHHHHCCCC-C---------------------------CCCCCCCC--------- T ss_conf 9999999985799777967840668999606877-4---------------------------55854356--------- Q ss_pred HHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CHHHHH Q ss_conf 9999999853488746775415456777001000010786789999999777654211345677775113863-101344 Q 537021.9.peg.8 81 IARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKIIILPERSALKERIRRRFTQMLESGAIDEIRSLMKMNLS-LDLPIM 159 (217) Q Consensus 81 i~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~~~~~~-~~~~~~ 159 (217) ||.++|++ .|+|+++|++.+...|++++++|||.|+++||+||++.++..... ...+.- T Consensus 128 ----------~g~~pS~l----------ryd~c~lWlda~~~VL~~~l~~RVD~Ml~~Gl~eE~~~f~~~~~s~~~~~i~ 187 (348) T KOG1384 128 ----------TGSIPSEL----------RYDCCFLWLDADQAVLFERLDKRVDDMLESGLLEELRDFYDPYNSSYRSGIR 187 (348) T ss_pred ----------CCCCCCCC----------CCCEEEEEEECCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCH T ss_conf ----------78887556----------6650799972666777888988899999733099999985554168654211 Q ss_pred HCCCHHHHHHHHCCCC------------CHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC-C Q ss_conf 2068567789867899------------9899999999999988888768862888753-1 Q 537021.9.peg.8 160 KAIGVRDIIALLKGEI------------NYDETLQRGIIATNKYAKRQKTWLCHQFQAD-W 207 (217) Q Consensus 160 ~~IGYke~~~yL~g~~------------~~~e~~e~~~~~Tr~yAKRQ~TWfr~~~~~~-w 207 (217) ++||..|+..|+.-.. -+++|++.++.+|++|||||.+|..+.+... | T Consensus 188 ~~iGv~e~d~f~~~~~~~~~k~d~~~~~~l~~aie~iK~nT~~lakrQ~~~I~~l~~~~~~ 248 (348) T KOG1384 188 KAIGVPEFDGFKEFYPWLTDKWDLARKELLEKAIEAIKENTRRLAKRQKRKIEKLFLPRKW 248 (348) T ss_pred HCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 2058278754300152000011277788899999999999999999999999876557876 No 6 >pfam01745 IPT Isopentenyl transferase. Isopentenyl transferase / dimethylallyl transferase synthesizes isopentenyladensosine 5'-monophosphate, a cytokinin that induces shoot formation on host plants infected with the Ti plasmid. Probab=93.15 E-value=0.86 Score=26.33 Aligned_cols=110 Identities=23% Similarity=0.289 Sum_probs=72.1 Q ss_pred CCCCHHHHHHHCCCC-C-CCCHHHEEEEECCCHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHCCCCCHHHHHHCC- Q ss_conf 488746775415456-7-77001000010786789999999777654211-----345677775113863101344206- Q 537021.9.peg.8 91 SGQSIIEFWKQAPNP-F-IPLESAHKIIILPERSALKERIRRRFTQMLES-----GAIDEIRSLMKMNLSLDLPIMKAI- 162 (217) Q Consensus 91 tGk~~s~~~~~~~~~-~-~~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~-----GlieEv~~L~~~~~~~~~~~~~~I- 162 (217) -|-++|=+..-...+ | .++.=.....-.+|++..-.+-.+|+.+||.- ++++|...+... ....+.+..| T Consensus 96 EGGSISLl~~ma~~~~w~~~f~w~v~r~~l~d~~~f~~ra~~Rv~~ML~p~~~~~Sll~EL~~lW~~--p~~r~~ledId 173 (232) T pfam01745 96 EGGSISLLKRMAQSPYWNAGFPWHVKRMRLPDRDVFLAQAKARVRQMLRPDSGGPSLLDELAELWVL--PAARPILEDID 173 (232) T ss_pred ECCHHHHHHHHHHCHHHCCCCEEEEEEEECCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCC--CCCCCHHHHCC T ss_conf 2750999999975734148964899997479878999999999998628898997399999998708--22252143122 Q ss_pred CHHHHHHHHC-CCCC-----------HHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 8567789867-8999-----------89999999999998888876886288 Q 537021.9.peg.8 163 GVRDIIALLK-GEIN-----------YDETLQRGIIATNKYAKRQKTWLCHQ 202 (217) Q Consensus 163 GYke~~~yL~-g~~~-----------~~e~~e~~~~~Tr~yAKRQ~TWfr~~ 202 (217) ||+.++.|=. ..++ ++++++.+...=..+|..|-.=|-.. T Consensus 174 GYr~~i~~a~~~~l~~d~L~~~d~~~~~~Li~~ia~eY~~Ha~~QE~~fp~~ 225 (232) T pfam01745 174 GYRCAIDFARKHQLTIDQLTNIDADQLEELIEGIAQEYLEHALWQEQEFPLT 225 (232) T ss_pred CHHHHHHHHHHCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 3999999999849998998640679999999999999999999877755799 No 7 >TIGR00381 cdhD CO dehydrogenase/acetyl-CoA synthase, delta subunit; InterPro: IPR004486 This is the small subunit of a heterodimer which catalyzes the reaction CO + H2O + Acceptor = CO2 + Reduced acceptor and is involved in the synthesis of acetyl-CoA from CO2 and H2.; GO: 0006730 one-carbon compound metabolic process. Probab=87.70 E-value=0.38 Score=28.70 Aligned_cols=59 Identities=32% Similarity=0.494 Sum_probs=36.8 Q ss_pred HHHHHHH-HHCCCCEEEECCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCHHHHH Q ss_conf 4689999-963994089789327999996172448899989999999999962235556655311623202357015699 Q 537021.9.peg.8 2 LSKKLQR-YKKRFLPIIVGGTGLYFRALTGQLSIMPEIPTAIREKIREKLKQYGSHILHDELSSLDSIVARQIHPSDGQR 80 (217) Q Consensus 2 ~~k~~e~-~~r~kiPIiVGGTglY~~all~g~~~~p~~~~~ir~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i~~nd~~R 80 (217) +.|.+|. ++-=++|||||||| -|+-||.+-++..+-+| |---| |+. +.++ =|.+| T Consensus 180 AaK~~EdvLQAVdvP~viGGSG------------npeKDPlVLEkaAEvAE--GeR~l---LAS------AnLd-lDy~k 235 (401) T TIGR00381 180 AAKVLEDVLQAVDVPLVIGGSG------------NPEKDPLVLEKAAEVAE--GERCL---LAS------ANLD-LDYKK 235 (401) T ss_pred HHHHHHHHHHHCCCCEEECCCC------------CCCCCHHHHHHHHHHCC--CCHHH---HHH------CCCC-CCHHH T ss_conf 7889998763406775774788------------88667578998843113--62164---541------3513-45789 Q ss_pred HHHH Q ss_conf 9999 Q 537021.9.peg.8 81 IARA 84 (217) Q Consensus 81 i~Ra 84 (217) |.+| T Consensus 236 ia~A 239 (401) T TIGR00381 236 IANA 239 (401) T ss_pred HHHH T ss_conf 9999 No 8 >KOG1221 consensus Probab=79.35 E-value=4.3 Score=21.80 Aligned_cols=80 Identities=23% Similarity=0.368 Sum_probs=48.8 Q ss_pred HHHHHCCCCEEEECCCCHHHHHHHCCCCC-CCCC-----------CHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCC Q ss_conf 99996399408978932799999617244-8899-----------98999999999996223555665531162320235 Q 537021.9.peg.8 6 LQRYKKRFLPIIVGGTGLYFRALTGQLSI-MPEI-----------PTAIREKIREKLKQYGSHILHDELSSLDSIVARQI 73 (217) Q Consensus 6 ~e~~~r~kiPIiVGGTglY~~all~g~~~-~p~~-----------~~~ir~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i 73 (217) +..+=+||.-+|+|||||--+.+++-+-. .|++ ..+..++++.+. .+.+++.|++.-|+.-.++ T Consensus 6 i~~f~~~k~i~vTG~tGFlgKVliEklLr~~p~v~~IYlLiR~k~g~~~~~Rl~~~~----~~~lF~~l~~~~p~~l~Kv 81 (467) T KOG1221 6 IVQFYKNKTIFVTGATGFLGKVLIEKLLRTTPDVKRIYLLIRAKKGKAAQERLRTEL----KDPLFEVLKEKKPEALEKV 81 (467) T ss_pred HHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCHHHHHHHHH----HHHHHHHHHHHCCCCEECC T ss_conf 789957985999727634578999999850767656999983478987789999987----4469999986395210200 Q ss_pred CC--------------HHHHHHHHHHHHHH Q ss_conf 70--------------15699999999985 Q 537021.9.peg.8 74 HP--------------SDGQRIARALEIKL 89 (217) Q Consensus 74 ~~--------------nd~~Ri~RaLEi~~ 89 (217) -| .|.+.+..-.+|.. T Consensus 82 ~pi~GDi~~~~LGis~~D~~~l~~eV~ivi 111 (467) T KOG1221 82 VPIAGDISEPDLGISESDLRTLADEVNIVI 111 (467) T ss_pred EECCCCCCCCCCCCCHHHHHHHHHCCCEEE T ss_conf 001256668666888277888874577899 No 9 >TIGR02075 pyrH_bact uridylate kinase; InterPro: IPR015963 Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes. Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected. This entry represents the bacterial proteins.. Probab=78.51 E-value=1.7 Score=24.49 Aligned_cols=22 Identities=23% Similarity=0.437 Sum_probs=11.3 Q ss_pred HHHHHHHHCC-CCEEEECCCCHH Q ss_conf 6899999639-940897893279 Q 537021.9.peg.8 3 SKKLQRYKKR-FLPIIVGGTGLY 24 (217) Q Consensus 3 ~k~~e~~~r~-kiPIiVGGTglY 24 (217) ++.-++.+.| .|-|||||=-.+ T Consensus 32 ~eik~~~~~G~evgIViGGGNIf 54 (236) T TIGR02075 32 NEIKEVVKKGIEVGIVIGGGNIF 54 (236) T ss_pred HHHHHHHHCCCEEEEEECCHHHH T ss_conf 99999986794899997782788 No 10 >TIGR02386 rpoC_TIGR DNA-directed RNA polymerase, beta' subunit; InterPro: IPR012754 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry represents the beta-prime subunit, RpoC, found in most bacteria. It excludes some, mainly cyanobacterial, species where RpoC is replaced by two homologous proteins that include an additional domain. One arm of the "claw" is predominantly formed by this subunit, the other being predominantly formed by the beta subunit. The active site of the enzyme is defined by three invariant aspartate residues within the beta-prime subunit .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=72.87 E-value=6.3 Score=20.69 Aligned_cols=126 Identities=20% Similarity=0.230 Sum_probs=69.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHH-CCCCCHHHHHHHHHHHHHH---HCCCCHHHHHHHCCCCCCCCHHHEEEEEC-- Q ss_conf 99999996223555665531162320-2357015699999999985---34887467754154567770010000107-- Q 537021.9.peg.8 45 KIREKLKQYGSHILHDELSSLDSIVA-RQIHPSDGQRIARALEIKL---VSGQSIIEFWKQAPNPFIPLESAHKIIIL-- 118 (217) Q Consensus 45 ~~~~~~~~~g~~~l~~~L~~~DP~~a-~~i~~nd~~Ri~RaLEi~~---~tGk~~s~~~~~~~~~~~~~~~~~~i~l~-- 118 (217) .++++++..+.+...++|++-=-++- .....+++.|++|-|||.. .||-.|. |.=-..-|-.|+.--..+-|+ T Consensus 228 AI~~LL~~iDL~~e~~~l~~~l~~~~kd~~~~~~~kkllKRLe~~e~F~~Sgn~Pe-WMvl~~~PVIPPelRPmVQLDGG 306 (1552) T TIGR02386 228 AIKELLEKIDLDKEIEELKEELRESKKDSKSQQKRKKLLKRLEILEAFKNSGNRPE-WMVLDVIPVIPPELRPMVQLDGG 306 (1552) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCC-CEECCCCCCCCCCCCCCCCCCCC T ss_conf 99987513786899999999986214345676688888898788999971378875-30353157588422677647775 Q ss_pred ----CCHHHHHHHH---HHHHHHHHHHH-----------HHHHH-HHHHHCC------CCCHHHHHHCCCHHHHHHHHCC Q ss_conf ----8678999999---97776542113-----------45677-7751138------6310134420685677898678 Q 537021.9.peg.8 119 ----PERSALKERI---RRRFTQMLESG-----------AIDEI-RSLMKMN------LSLDLPIMKAIGVRDIIALLKG 173 (217) Q Consensus 119 ----~~r~~L~~rI---~~R~~~Ml~~G-----------lieEv-~~L~~~~------~~~~~~~~~~IGYke~~~yL~g 173 (217) .|=..||.|+ |.|.++|++-| |++|+ =.|+++| .+.+...++|+ -+.|+| T Consensus 307 RFATSDLNDLYRRVINRNNRLKrlL~l~APeIIV~NEKRMLQEAVDALfDNgRR~kPV~G~~nRpLKSL-----SdmLKG 381 (1552) T TIGR02386 307 RFATSDLNDLYRRVINRNNRLKRLLELKAPEIIVRNEKRMLQEAVDALFDNGRRGKPVVGKNNRPLKSL-----SDMLKG 381 (1552) T ss_pred CHHHHHHHCCCCEEEECCHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCH-----HHHHCC T ss_conf 101223201023023010378898745888688715533567888887447888863107889775432-----444126 Q ss_pred CCC Q ss_conf 999 Q 537021.9.peg.8 174 EIN 176 (217) Q Consensus 174 ~~~ 176 (217) |.- T Consensus 382 KQG 384 (1552) T TIGR02386 382 KQG 384 (1552) T ss_pred CCC T ss_conf 677 No 11 >COG2069 CdhD CO dehydrogenase/acetyl-CoA synthase delta subunit (corrinoid Fe-S protein) [Energy production and conversion] Probab=72.28 E-value=3.1 Score=22.70 Aligned_cols=35 Identities=37% Similarity=0.583 Sum_probs=24.6 Q ss_pred HHHHHHH-HHCCCCEEEECCCCHHHHHHHCCCCCCCCCCHHHHHHHHH Q ss_conf 4689999-9639940897893279999961724488999899999999 Q 537021.9.peg.8 2 LSKKLQR-YKKRFLPIIVGGTGLYFRALTGQLSIMPEIPTAIREKIRE 48 (217) Q Consensus 2 ~~k~~e~-~~r~kiPIiVGGTglY~~all~g~~~~p~~~~~ir~~~~~ 48 (217) +.|.+|. ++-=++||++|||| -|+.||++-++..+ T Consensus 187 Aak~lEdvLqAVdvPiiiGGSG------------npeKDpeVlekaAE 222 (403) T COG2069 187 AAKTLEDVLQAVDVPIIIGGSG------------NPEKDPEVLEKAAE 222 (403) T ss_pred HHHHHHHHHHHCCCCEEECCCC------------CCCCCHHHHHHHHH T ss_conf 9999999997548688966899------------97649799999987 No 12 >TIGR02440 FadJ fatty oxidation complex, alpha subunit FadJ; InterPro: IPR012802 Members of this family represent the alpha subunit of the multifunctional enzyme complex of the fatty acid degradation cycle, which plays a minor role in aerobic beta-oxidation of fatty acids. The FadJI complex is necessary for anaerobic growth on short-chain acids with nitrate as an electron acceptor. Activities include: enoyl-CoA hydratase (4.2.1.17 from EC), 3-hydroxyacyl-CoA dehydrogenase (1.1.1.35 from EC), and 3-hydroxybutyryl-CoA epimerase (5.1.2.3 from EC). A representative is the Escherichia coli FadJ protein (also named YfcX) (P77399 from SWISSPROT). This entry excludes FadB of IPR012799 from INTERPRO.; GO: 0003857 3-hydroxyacyl-CoA dehydrogenase activity, 0004300 enoyl-CoA hydratase activity, 0008692 3-hydroxybutyryl-CoA epimerase activity, 0051287 NAD binding, 0006635 fatty acid beta-oxidation, 0016507 fatty acid beta-oxidation multienzyme complex. Probab=68.13 E-value=4.6 Score=21.59 Aligned_cols=84 Identities=13% Similarity=0.190 Sum_probs=57.3 Q ss_pred CCHHHEEEEECCCHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHCCCCCHHHHHHCCCHHHH----HHHHCCCCCHHHH Q ss_conf 7001000010786789999999777654211---34567777511386310134420685677----8986789998999 Q 537021.9.peg.8 108 PLESAHKIIILPERSALKERIRRRFTQMLES---GAIDEIRSLMKMNLSLDLPIMKAIGVRDI----IALLKGEINYDET 180 (217) Q Consensus 108 ~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~---GlieEv~~L~~~~~~~~~~~~~~IGYke~----~~yL~g~~~~~e~ 180 (217) +.--...+++.|....--..|..||--|+=| =.+|| +++..-=+-+-.+--+||+.=| ..|+|. +--++. T Consensus 624 D~~vY~llg~~P~~~~~~~~va~RC~~~MLNEAvRClDe--~vI~s~RDGDIGAvFGIGFPPFLGGPFrY~D~-LGA~~v 700 (732) T TIGR02440 624 DESVYKLLGVKPEAKLEASAVAERCVMLMLNEAVRCLDE--GVIRSPRDGDIGAVFGIGFPPFLGGPFRYIDT-LGADKV 700 (732) T ss_pred CHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH--HHHCCCCCCCEEEEECCCCCCCCCCCCHHHHH-CCHHHH T ss_conf 033442314587520004458788899887778875213--43157898860115067888888671014302-026899 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999988888 Q 537021.9.peg.8 181 LQRGIIATNKYAKR 194 (217) Q Consensus 181 ~e~~~~~Tr~yAKR 194 (217) ++.+...+.||--| T Consensus 701 V~~L~~~~~qYG~R 714 (732) T TIGR02440 701 VKILKRLATQYGER 714 (732) T ss_pred HHHHHHHHHHCCCC T ss_conf 99999998733885 No 13 >PRK12443 uridylate kinase; Reviewed Probab=66.52 E-value=5 Score=21.37 Aligned_cols=143 Identities=15% Similarity=0.235 Sum_probs=60.2 Q ss_pred HHHHHHHHCC-CCEEEECCCCHHHHHHH--CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCHHHH------ Q ss_conf 6899999639-94089789327999996--17244889998999999999996223555665531---162320------ Q 537021.9.peg.8 3 SKKLQRYKKR-FLPIIVGGTGLYFRALT--GQLSIMPEIPTAIREKIREKLKQYGSHILHDELSS---LDSIVA------ 70 (217) Q Consensus 3 ~k~~e~~~r~-kiPIiVGGTglY~~all--~g~~~~p~~~~~ir~~~~~~~~~~g~~~l~~~L~~---~DP~~a------ 70 (217) .+..++.+.| .+.|||||-.++=-... .|++. +..+----+-...+. -.|.+.|.+ ++-... T Consensus 34 ~eI~~~~~~G~evaiVvGGGNi~RG~~~~~~gidr---~~aD~iGMLaTvmNa---l~L~~~l~~~~g~~~rv~sai~~~ 107 (247) T PRK12443 34 NEILSIVDLGIEVSIVIGGGNIFRGHLAEEWGIDR---VEADNIGTLGTIINS---LMLRGVLTSKTNKEVRVMTSIPFN 107 (247) T ss_pred HHHHHHHHCCCEEEEEECCCCCCCCHHHHHCCCCC---CHHHHHHHHHHHHHH---HHHHHHHHHHCCCCEEEEECCCCC T ss_conf 99999997798799997688502322365539873---103688899999999---999999997559946997323367 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEEECCCHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Q ss_conf 2357015699999999985348874677541545677700100001078678999999977765421134-567777511 Q 537021.9.peg.8 71 RQIHPSDGQRIARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKIIILPERSALKERIRRRFTQMLESGA-IDEIRSLMK 149 (217) Q Consensus 71 ~~i~~nd~~Ri~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~Gl-ieEv~~L~~ 149 (217) .-..|-+++|-+|.||= |+-.- +-.-.-.|+ ...|-.---.-++-.++.+|...- |+=| ++ T Consensus 108 ~v~e~y~~~rA~~~Lek----G~VVI-fagGTGnP~----------fTTDtaAaLrA~Ei~Ad~lL~at~~VDGV---Yd 169 (247) T PRK12443 108 AVAEPYIRLRAVHHLDN----GYIVI-FGGGNGQPF----------VTTDYPSVQRAIEMNSDAILVAKQGVDGV---FT 169 (247) T ss_pred CCCCCCCHHHHHHHHHC----CCEEE-EECCCCCCC----------CCCCHHHHHHHHHHCCHHHHCCCCCCCCC---CC T ss_conf 45677778999999736----97899-978878875----------20439999999882861332105665776---46 Q ss_pred CC--CCCHHHHHHCCCHHHHHH Q ss_conf 38--631013442068567789 Q 537021.9.peg.8 150 MN--LSLDLPIMKAIGVRDIIA 169 (217) Q Consensus 150 ~~--~~~~~~~~~~IGYke~~~ 169 (217) .. .+++..-+..|.|.|+++ T Consensus 170 ~DP~k~~dA~k~~~ls~~e~l~ 191 (247) T PRK12443 170 SDPKHNKSAKMYRKLNYNDVVR 191 (247) T ss_pred CCCCCCCCCEEEEECCHHHHHH T ss_conf 8988999754443337999998 No 14 >pfam09862 DUF2089 Protein of unknown function(DUF2089). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=64.79 E-value=9.4 Score=19.54 Aligned_cols=49 Identities=29% Similarity=0.411 Sum_probs=30.7 Q ss_pred HHHHHHHHHHHHHCCCCCH----HHHHHCCCH------------HHHHHHH-CCCCCHHHHHHHHH Q ss_conf 1134567777511386310----134420685------------6778986-78999899999999 Q 537021.9.peg.8 137 ESGAIDEIRSLMKMNLSLD----LPIMKAIGV------------RDIIALL-KGEINYDETLQRGI 185 (217) Q Consensus 137 ~~GlieEv~~L~~~~~~~~----~~~~~~IGY------------ke~~~yL-~g~~~~~e~~e~~~ 185 (217) ..|=+.||...+.-.|..- ..+.+++|| +++++-| +|+||.+||++.++ T Consensus 48 ~~Gnlke~~~~lgiSYpTvR~rLd~ii~~lGy~~~~~~~~~~~~~eIL~~Le~GEIs~eEA~~~Lk 113 (113) T pfam09862 48 CRGNIKEVEKELGISYPTVRNRLDEIIAALGYEEEPEPEKRVDKKEILDKLEKGEISVEEALRLLK 113 (113) T ss_pred HCCCHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHC T ss_conf 168899999997888188999999999980898777777762299999999849979999999749 No 15 >PRK08309 short chain dehydrogenase; Provisional Probab=62.04 E-value=3.3 Score=22.50 Aligned_cols=15 Identities=33% Similarity=0.722 Sum_probs=11.9 Q ss_pred EEEECCCCHHHHHHH Q ss_conf 089789327999996 Q 537021.9.peg.8 15 PIIVGGTGLYFRALT 29 (217) Q Consensus 15 PIiVGGTglY~~all 29 (217) .+|||||||--.... T Consensus 3 aLVIGGTGML~~vs~ 17 (182) T PRK08309 3 ALVIGGTGMLKRVSL 17 (182) T ss_pred EEEECCCHHHHHHHH T ss_conf 699724175599999 No 16 >COG3877 Uncharacterized protein conserved in bacteria [Function unknown] Probab=61.07 E-value=10 Score=19.27 Aligned_cols=39 Identities=33% Similarity=0.350 Sum_probs=28.4 Q ss_pred HHHCCCCCHHHHHHCCCHHHHHHHH-CCCCCHHHHHHHHH Q ss_conf 5113863101344206856778986-78999899999999 Q 537021.9.peg.8 147 LMKMNLSLDLPIMKAIGVRDIIALL-KGEINYDETLQRGI 185 (217) Q Consensus 147 L~~~~~~~~~~~~~~IGYke~~~yL-~g~~~~~e~~e~~~ 185 (217) |.++||.++.+--+-||=+-+++-| .|+++-|||++.+. T Consensus 82 lramgy~p~~e~~~~i~~~~i~~qle~Gei~peeA~~~L~ 121 (122) T COG3877 82 LRAMGYNPDSENSVNIGKKKIIDQLEKGEISPEEAIKMLN 121 (122) T ss_pred HHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHC T ss_conf 9980899899870455389999999817879999999851 No 17 >TIGR03333 salvage_mtnX 2-hydroxy-3-keto-5-methylthiopentenyl-1-phosphate phosphatase. Members of this family are the methionine salvage enzyme MnxX, a member of the HAD-superfamily hydrolases, subfamily IB (see TIGR01488). Members are found in Bacillus subtilis and related species, paired with MtnW (TIGR03332). In most species that recycle methionine from methylthioadenosine, the single protein MtnC replaces the MtnW/MtnX pair. In B. subtilis, mtnX was first known as ykrX. Probab=60.41 E-value=10 Score=19.30 Aligned_cols=27 Identities=19% Similarity=0.387 Sum_probs=21.0 Q ss_pred HHHHHCCCCEEEE--CCCCHHHHHHHCCC Q ss_conf 9999639940897--89327999996172 Q 537021.9.peg.8 6 LQRYKKRFLPIIV--GGTGLYFRALTGQL 32 (217) Q Consensus 6 ~e~~~r~kiPIiV--GGTglY~~all~g~ 32 (217) ++--+++.+|++| ||.-+|+++++.++ T Consensus 79 ~~~~~~~~ip~~IvS~G~d~~I~~lL~~~ 107 (214) T TIGR03333 79 VAFINEHGIPFYVISGGMDFFVYPLLEGI 107 (214) T ss_pred HHHHHHCCCCEEEECCCCHHHHHHHHHHH T ss_conf 99999859978998699338999999963 No 18 >PRK09552 mtnX 2-hydroxy-3-keto-5-methylthiopentenyl-1-phosphate phosphatase; Reviewed Probab=60.17 E-value=11 Score=19.23 Aligned_cols=27 Identities=22% Similarity=0.364 Sum_probs=21.6 Q ss_pred HHHHHCCCCEEEE--CCCCHHHHHHHCCC Q ss_conf 9999639940897--89327999996172 Q 537021.9.peg.8 6 LQRYKKRFLPIIV--GGTGLYFRALTGQL 32 (217) Q Consensus 6 ~e~~~r~kiPIiV--GGTglY~~all~g~ 32 (217) ++--+++.+|++| ||.-+|+++++.++ T Consensus 82 ~~~~~~~~ip~~IvS~G~d~~I~~~L~~~ 110 (219) T PRK09552 82 VQFVNENNISFYVISGGMDFFVYPLLQGL 110 (219) T ss_pred HHHHHHCCCCEEEECCCCHHHHHHHHHHC T ss_conf 99999859979998799469999999960 No 19 >COG0528 PyrH Uridylate kinase [Nucleotide transport and metabolism] Probab=58.98 E-value=8.3 Score=19.89 Aligned_cols=19 Identities=5% Similarity=0.121 Sum_probs=13.4 Q ss_pred CCHHHHHHCCCHHHHHHHH Q ss_conf 3101344206856778986 Q 537021.9.peg.8 153 SLDLPIMKAIGVRDIIALL 171 (217) Q Consensus 153 ~~~~~~~~~IGYke~~~yL 171 (217) .++..-+..|.|+|+++-. T Consensus 175 ~pdA~~~~~Lty~e~l~~~ 193 (238) T COG0528 175 DPDAKKYDTLTYDEVLKIG 193 (238) T ss_pred CCCCEECCCCCHHHHHHHC T ss_conf 9882232237999999724 No 20 >COG2100 Predicted Fe-S oxidoreductase [General function prediction only] Probab=53.76 E-value=18 Score=17.68 Aligned_cols=119 Identities=13% Similarity=0.165 Sum_probs=62.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEEECCCH Q ss_conf 99999999996223555665531162320235701569999999998534887467754154567770010000107867 Q 537021.9.peg.8 42 IREKIREKLKQYGSHILHDELSSLDSIVARQIHPSDGQRIARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKIIILPER 121 (217) Q Consensus 42 ir~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i~~nd~~Ri~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r 121 (217) +.+++-+++++-|.+.+---+..+||..|..++-.+.+-+.+.||+... ..+..-. .-..++...+++. + T Consensus 202 L~~~lv~eLeeAGLdRiNlSv~aLDpk~Ak~L~G~~dYdv~kvle~aE~--------i~~a~id-vlIaPv~lPG~ND-~ 271 (414) T COG2100 202 LSKKLVDELEEAGLDRINLSVDALDPKLAKMLAGRKDYDVKKVLEVAEY--------IANAGID-VLIAPVWLPGVND-D 271 (414) T ss_pred CCHHHHHHHHHHCCCEEEEECCCCCHHHHHHHCCCCCCCHHHHHHHHHH--------HHHCCCC-EEEEEEECCCCCH-H T ss_conf 4599999999708755886202379889877428401178999999999--------9867988-8983144278681-7 Q ss_pred HHHHHHHH--HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCHHHHHHHHC Q ss_conf 89999999--7776542113456777751138631013442068567789867 Q 537021.9.peg.8 122 SALKERIR--RRFTQMLESGAIDEIRSLMKMNLSLDLPIMKAIGVRDIIALLK 172 (217) Q Consensus 122 ~~L~~rI~--~R~~~Ml~~GlieEv~~L~~~~~~~~~~~~~~IGYke~~~yL~ 172 (217) +.-+-|. .++..==..+.+- .+..+...++.....++.++++||..+|. T Consensus 272 -E~~~iIe~A~~iGaGkk~p~lg-iQkyipyk~GRkp~~~k~~~fkeFYrwLr 322 (414) T COG2100 272 -EMPKIIEWAREIGAGKKWPPLG-IQKYIPYKFGRKPVIAKVWPFKEFYRWLR 322 (414) T ss_pred -HHHHHHHHHHHHCCCCCCCCCC-EEEEEEECCCCCCCCCCCCCHHHHHHHHH T ss_conf -7899999999848887799853-07755402068863035575999999999 No 21 >cd04254 AAK_UMPK-PyrH-Ec UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of E. coli (Ec) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial and chloroplast UMPKs (this CD) have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of this CD be Probab=52.84 E-value=12 Score=18.93 Aligned_cols=23 Identities=17% Similarity=0.302 Sum_probs=14.2 Q ss_pred HHHHHHHHHCC-CCEEEECCCCHH Q ss_conf 46899999639-940897893279 Q 537021.9.peg.8 2 LSKKLQRYKKR-FLPIIVGGTGLY 24 (217) Q Consensus 2 ~~k~~e~~~r~-kiPIiVGGTglY 24 (217) +.+..+..++| .+-|||||..++ T Consensus 29 a~~I~~l~~~G~eVviVvggG~i~ 52 (231) T cd04254 29 AREIKEVVDLGVEVAIVVGGGNIF 52 (231) T ss_pred HHHHHHHHHCCCEEEEEECCCCHH T ss_conf 999999997798699998177301 No 22 >PRK10542 glutathionine S-transferase; Provisional Probab=49.58 E-value=23 Score=17.03 Aligned_cols=19 Identities=32% Similarity=0.336 Sum_probs=12.9 Q ss_pred HHHHHCCCCEEEE--CCCCHH Q ss_conf 9999639940897--893279 Q 537021.9.peg.8 6 LQRYKKRFLPIIV--GGTGLY 24 (217) Q Consensus 6 ~e~~~r~kiPIiV--GGTglY 24 (217) +.+.-.|+||.+| ||+.++ T Consensus 44 l~iNP~gkVP~L~~ddg~~l~ 64 (201) T PRK10542 44 LAINPKGQVPALLLDDGTLLT 64 (201) T ss_pred HHHCCCCCCCEEEECCCEEEE T ss_conf 973979998569966983897 No 23 >TIGR01036 pyrD_sub2 dihydroorotate oxidase; InterPro: IPR005719 Dihydroorotate dehydrogenase (1.3.3.1 from EC) (DHOdehase) catalyzes the fourth step in the de novo biosynthesis of pyrimidine, the conversion of dihydroorotate into orotate. DHOdehase is a ubiquitous FAD flavoprotein. In bacteria (gene pyrD), DHOdease is located on the inner side of the cytosolic membrane. In some yeasts, such as in Saccharomyces cerevisiae (gene URA1, subfamily 2), it is a cytosolic protein while in other eukaryotes it is found in the mitochondria . This model describes dihydroorotate dehydrogenase subfamily 2 and includes members from bacteria and eukaryotes. The subfamilies 1 and 2 share extensive homology, particularly toward the C-terminus, however subfamily 2 has a longer N-terminal region.; GO: 0004158 dihydroorotate oxidase activity, 0006207 'de novo' pyrimidine base biosynthetic process, 0016020 membrane. Probab=49.42 E-value=12 Score=18.77 Aligned_cols=13 Identities=15% Similarity=0.031 Sum_probs=4.7 Q ss_pred HHHHHHHHHHHHH Q ss_conf 8999999999999 Q 537021.9.peg.8 177 YDETLQRGIIATN 189 (217) Q Consensus 177 ~~e~~e~~~~~Tr 189 (217) +++++..++..|- T Consensus 201 ~~~LL~~~k~~~~ 213 (370) T TIGR01036 201 LRDLLTAVKEETD 213 (370) T ss_pred HHHHHHHHHHHHH T ss_conf 8999999999999 No 24 >pfam06414 Zeta_toxin Zeta toxin. This family consists of several bacterial zeta toxin proteins. Zeta toxin is thought to be part of a postregulational killing system in bacteria. It relies on antitoxin/toxin systems that secure stable inheritance of low and medium copy number plasmids during cell division and kill cells that have lost the plasmid. Probab=49.02 E-value=23 Score=16.98 Aligned_cols=104 Identities=15% Similarity=0.074 Sum_probs=57.1 Q ss_pred HHHHHHCCHHHHCCCCCHHHHHHHHHHH-HHH---------HCCCCHHHHHHHCCCCCCCCHHHEEEEECCCHHHHHHHH Q ss_conf 6655311623202357015699999999-985---------348874677541545677700100001078678999999 Q 537021.9.peg.8 59 HDELSSLDSIVARQIHPSDGQRIARALE-IKL---------VSGQSIIEFWKQAPNPFIPLESAHKIIILPERSALKERI 128 (217) Q Consensus 59 ~~~L~~~DP~~a~~i~~nd~~Ri~RaLE-i~~---------~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r~~L~~rI 128 (217) |..+..-||..+...-..+..++...+- .+. .|+..+....+....-...=|.+..+++..+.+.-..++ T Consensus 56 y~~l~~~~~~~~~~~~~~~a~~~~~~~~~~a~~~r~n~iiegT~~~~~~~~~~~~~lk~~GY~v~v~~Va~~~e~S~~r~ 135 (191) T pfam06414 56 YDELQKADPKDASELTQPDASRWVEKLIDYAIERGYNIILEGTLRSPDVARKLARKLKAAGYEVEVYVVAVPPELSWLGV 135 (191) T ss_pred HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEEECCHHHHHHHH T ss_conf 78655407677899989999999999999999759998985777897999999999997899799999988999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCHHHHHHHHC Q ss_conf 97776542113456777751138631013442068567789867 Q 537021.9.peg.8 129 RRRFTQMLESGAIDEIRSLMKMNLSLDLPIMKAIGVRDIIALLK 172 (217) Q Consensus 129 ~~R~~~Ml~~GlieEv~~L~~~~~~~~~~~~~~IGYke~~~yL~ 172 (217) ..|.+.+...| +..+.+..-...-|..+.++.|+ T Consensus 136 ~~Ry~~~~~~g----------R~v~~~~hd~~~~~~~~~~~~l~ 169 (191) T pfam06414 136 LDRYEEELAAG----------RRVPKEVHDAAYNGLPESVEALE 169 (191) T ss_pred HHHHHHHHCCC----------CCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99998510578----------74899999999985999999999 No 25 >cd02021 GntK Gluconate kinase (GntK) catalyzes the phosphoryl transfer from ATP to gluconate. The resulting product gluconate-6-phoshate is an important precursor of gluconate metabolism. GntK acts as a dimmer composed of two identical subunits. Probab=48.57 E-value=13 Score=18.57 Aligned_cols=39 Identities=15% Similarity=0.194 Sum_probs=30.3 Q ss_pred HHHEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 010000107867899999997776542113456777751 Q 537021.9.peg.8 110 ESAHKIIILPERSALKERIRRRFTQMLESGAIDEIRSLM 148 (217) Q Consensus 110 ~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~ 148 (217) .++.+++|..+.++|.+|+.+|-..-+...|++---..+ T Consensus 97 ~~v~fi~L~~~~~~l~~Rl~~R~~hfm~~~ll~sQ~~~l 135 (150) T cd02021 97 PRVRFVHLDGPREVLAERLAARKGHFMPADLLDSQFETL 135 (150) T ss_pred CCEEEEEEECCHHHHHHHHHHCCCCCCCHHHHHHHHHHC T ss_conf 985899986999999999984635799989999999972 No 26 >pfam02679 ComA (2R)-phospho-3-sulfolactate synthase (ComA). In methanobacteria (2R)-phospho-3-sulfolactate synthase (ComA) catalyses the first step of the biosynthesis of coenzyme M from phosphoenolpyruvate (P-enolpyruvate). This novel enzyme catalyses the stereospecific Michael addition of sulfite to P-enolpyruvate, forming L-2-phospho-3-sulfolactate (PSL). It is suggested that the ComA-catalysed reaction is analogous to those reactions catalysed by beta-elimination enzymes that proceed through an enolate intermediate. Probab=46.99 E-value=25 Score=16.78 Aligned_cols=24 Identities=21% Similarity=0.247 Sum_probs=20.6 Q ss_pred HHHHHHHHHCCCCEEEECCCCHHH Q ss_conf 468999996399408978932799 Q 537021.9.peg.8 2 LSKKLQRYKKRFLPIIVGGTGLYF 25 (217) Q Consensus 2 ~~k~~e~~~r~kiPIiVGGTglY~ 25 (217) ++++|+...+..|++..|||-|=+ T Consensus 57 l~eKI~l~~~~~V~v~~GGtlfE~ 80 (245) T pfam02679 57 LKEKIDLAHEHGVYVYTGGTLFEI 80 (245) T ss_pred HHHHHHHHHHCCCEEECCCHHHHH T ss_conf 999999999859948479699999 No 27 >pfam00491 Arginase Arginase family. Probab=45.71 E-value=16 Score=18.07 Aligned_cols=27 Identities=15% Similarity=0.172 Sum_probs=19.9 Q ss_pred HHHHHCCCCEEEECCCCHHHHHHHCCC Q ss_conf 999963994089789327999996172 Q 537021.9.peg.8 6 LQRYKKRFLPIIVGGTGLYFRALTGQL 32 (217) Q Consensus 6 ~e~~~r~kiPIiVGGTglY~~all~g~ 32 (217) .++++.|++||++||-+----+.+.|+ T Consensus 73 ~~~~~~~~~pi~lGGdHsit~~~~~~l 99 (274) T pfam00491 73 AAILAAGKFPLVLGGDHSITLGALRAL 99 (274) T ss_pred HHHHHCCCEEEEECCCCCCCHHHHHHH T ss_conf 999977985799757864428899999 No 28 >pfam09823 DUF2357 Domain of unknown function (DUF2357). This entry was previously the N terminal portion of DUF524 (pfam04411) before it was split into two. This domain has no known function. It is predicted to adopt an all beta secondary structure pattern followed by mainly alpha-helical structures. Probab=45.41 E-value=25 Score=16.82 Aligned_cols=39 Identities=21% Similarity=0.359 Sum_probs=21.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHH-HHHCCCHHHHHHH Q ss_conf 7776542113456777751138631013-4420685677898 Q 537021.9.peg.8 130 RRFTQMLESGAIDEIRSLMKMNLSLDLP-IMKAIGVRDIIAL 170 (217) Q Consensus 130 ~R~~~Ml~~GlieEv~~L~~~~~~~~~~-~~~~IGYke~~~y 170 (217) .+++..+..++..||-.|- +...+++ ..++-||+++..+ T Consensus 206 ~~L~~~l~~~f~~~VG~l~--~~p~~SlVLq~~~GYrevyr~ 245 (246) T pfam09823 206 RQLERLLRHPFFREVGELD--GLPSNSLVLQKRAGYREVYRI 245 (246) T ss_pred HHHHHHHCCCHHHHHHCCC--CCCCCCHHHHCCCCHHHHHHH T ss_conf 9999997483877764226--887528776245448999850 No 29 >KOG3347 consensus Probab=44.76 E-value=27 Score=16.55 Aligned_cols=62 Identities=19% Similarity=0.355 Sum_probs=45.2 Q ss_pred CCHHHHCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHC--CCCCCCCHHHEEEEECCCHHHHHHHHHHHH Q ss_conf 16232023570156999999999853488746775415--456777001000010786789999999777 Q 537021.9.peg.8 65 LDSIVARQIHPSDGQRIARALEIKLVSGQSIIEFWKQA--PNPFIPLESAHKIIILPERSALKERIRRRF 132 (217) Q Consensus 65 ~DP~~a~~i~~nd~~Ri~RaLEi~~~tGk~~s~~~~~~--~~~~~~~~~~~~i~l~~~r~~L~~rI~~R~ 132 (217) .|-++-. |.-|-.+++-.||-....|-.+-+|+... |..|+. ++|+|.+|-+.||+|+..|. T Consensus 51 yDE~y~c--~i~DEdkv~D~Le~~m~~Gg~IVDyHgCd~Fperwfd----lVvVLr~~~s~LY~RL~sRg 114 (176) T KOG3347 51 YDEEYKC--HILDEDKVLDELEPLMIEGGNIVDYHGCDFFPERWFD----LVVVLRTPNSVLYDRLKSRG 114 (176) T ss_pred CCCCCCC--CCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCHHHEE----EEEEEECCCHHHHHHHHHCC T ss_conf 2322367--5456789998888987568848860056765011355----79999468459999999758 No 30 >COG3892 Uncharacterized protein conserved in bacteria [Function unknown] Probab=44.65 E-value=27 Score=16.54 Aligned_cols=137 Identities=16% Similarity=0.176 Sum_probs=76.6 Q ss_pred CCCCHHHHHHHHHH----HHHHHHHHHHHHHHHCCHHHHCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHCCCC----C- Q ss_conf 89998999999999----99622355566553116232023570156999999999853488746775415456----7- Q 537021.9.peg.8 36 PEIPTAIREKIREK----LKQYGSHILHDELSSLDSIVARQIHPSDGQRIARALEIKLVSGQSIIEFWKQAPNP----F- 106 (217) Q Consensus 36 p~~~~~ir~~~~~~----~~~~g~~~l~~~L~~~DP~~a~~i~~nd~~Ri~RaLEi~~~tGk~~s~~~~~~~~~----~- 106 (217) |..+.++|.+-+.. ++.. ...-++.|-++=|-. .-+.|-+-+.|++|-++.-|..+- |++-.|.. | T Consensus 142 PdD~a~lr~~Qe~k~~~l~eA~-r~~g~ElLlEiI~pk---~~~~dd~t~~Raier~Y~lGv~PD-WWKLePl~~~~aW~ 216 (310) T COG3892 142 PDDPAELRAEQEQKLLELFEAA-RKSGHELLLEIILPK---AGKSDDRTVPRAIERFYNLGVYPD-WWKLEPLASPDAWA 216 (310) T ss_pred CCCHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHCCCC---CCCCCHHHHHHHHHHHHHCCCCCC-CCCCCCCCCHHHHH T ss_conf 9998999899999999999999-863046655530454---688511589999999997377864-01278777867999 Q ss_pred ---------CC-CHHHEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCHHHHHHHHCCCCC Q ss_conf ---------77-00100001078678999999977765421134567777511386310134420685677898678999 Q 537021.9.peg.8 107 ---------IP-LESAHKIIILPERSALKERIRRRFTQMLESGAIDEIRSLMKMNLSLDLPIMKAIGVRDIIALLKGEIN 176 (217) Q Consensus 107 ---------~~-~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~~~~~~~~~~~~~~IGYke~~~yL~g~~~ 176 (217) .| ..-+.+++|+.|-++|..--..-... -+-+...+-++|=+--...+|.|.++ T Consensus 217 ~i~~~I~~~DP~cRGiViLGLdAP~~eL~~sF~aaa~~----------------piVKGFAVGRTiF~daar~WL~g~i~ 280 (310) T COG3892 217 EIEAAIERRDPHCRGIVILGLDAPLEELEDSFAAAAGF----------------PIVKGFAVGRTIFADAARAWLGGDID 280 (310) T ss_pred HHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHCCC----------------CEEEEEECCHHHHHHHHHHHHCCCCC T ss_conf 99999980595402257854667199999998854489----------------70020000104558789999748887 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 89999999999998888 Q 537021.9.peg.8 177 YDETLQRGIIATNKYAK 193 (217) Q Consensus 177 ~~e~~e~~~~~Tr~yAK 193 (217) -++++..+..+-.+++- T Consensus 281 D~qaia~ma~~f~~L~d 297 (310) T COG3892 281 DEQAIADMAAKFKQLID 297 (310) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 79999999999999999 No 31 >PRK06153 hypothetical protein; Provisional Probab=44.61 E-value=11 Score=19.16 Aligned_cols=23 Identities=35% Similarity=0.624 Sum_probs=17.5 Q ss_pred HHHHCCCCEEE-ECCCCHHHHHHH Q ss_conf 99963994089-789327999996 Q 537021.9.peg.8 7 QRYKKRFLPII-VGGTGLYFRALT 29 (217) Q Consensus 7 e~~~r~kiPIi-VGGTglY~~all 29 (217) +.++-.+|.|| +||||-|+=-++ T Consensus 172 ~kL~~~~IAIIGLGGTGsYILD~v 195 (393) T PRK06153 172 AKLEGEKVAIIGLGGTGAYILDLV 195 (393) T ss_pred HHHCCCEEEEEECCCCCHHHHHHH T ss_conf 652477599995587512222242 No 32 >TIGR02788 VirB11 P-type DNA transfer ATPase VirB11; InterPro: IPR014155 This entry contains VirB11, a protein that is found in the vir locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for DNA transfer . VirB11 is believed to be an ATPase and is a homologue of the P-like conjugation system TrbB protein and the Flp pilus system protein TadA .. Probab=44.15 E-value=26 Score=16.68 Aligned_cols=24 Identities=38% Similarity=0.589 Sum_probs=18.9 Q ss_pred HHCCCCEEEECCCC----HHHHHHHCCC Q ss_conf 96399408978932----7999996172 Q 537021.9.peg.8 9 YKKRFLPIIVGGTG----LYFRALTGQL 32 (217) Q Consensus 9 ~~r~kiPIiVGGTg----lY~~all~g~ 32 (217) ...+|-=||+|||| -|++|+++-+ T Consensus 155 i~~~knIii~GGTgSGKTTf~kal~~~I 182 (328) T TIGR02788 155 IASRKNIIISGGTGSGKTTFLKALVKEI 182 (328) T ss_pred HHCCCEEEEEECCCCHHHHHHHHHHHCC T ss_conf 8738919999068971899999997327 No 33 >KOG0150 consensus Probab=43.84 E-value=25 Score=16.73 Aligned_cols=21 Identities=24% Similarity=0.338 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999977765421134567 Q 537021.9.peg.8 123 ALKERIRRRFTQMLESGAIDE 143 (217) Q Consensus 123 ~L~~rI~~R~~~Ml~~GlieE 143 (217) .....+.+|+..|...++..| T Consensus 35 rHke~V~Kritdi~rks~~ke 55 (336) T KOG0150 35 RHKENVAKRITDIHRKSLKKE 55 (336) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 799999999999987657778 No 34 >pfam01113 DapB_N Dihydrodipicolinate reductase, N-terminus. Dihydrodipicolinate reductase (DapB) reduces the alpha,beta-unsaturated cyclic imine, dihydro-dipicolinate. This reaction is the second committed step in the biosynthesis of L-lysine and its precursor meso-diaminopimelate, which are critical for both protein and cell wall biosynthesis. The N-terminal domain of DapB binds the dinucleotide NADPH. Probab=43.64 E-value=17 Score=17.83 Aligned_cols=21 Identities=24% Similarity=0.320 Sum_probs=9.1 Q ss_pred CCCHHHHHHHHHHHHHHHCCC Q ss_conf 570156999999999853488 Q 537021.9.peg.8 73 IHPSDGQRIARALEIKLVSGQ 93 (217) Q Consensus 73 i~~nd~~Ri~RaLEi~~~tGk 93 (217) ||...+.=+...++.+...|. T Consensus 70 IDFS~p~~~~~~~~~~~~~~~ 90 (122) T pfam01113 70 IDFTTPEATLENLELALKHGK 90 (122) T ss_pred EEECCHHHHHHHHHHHHHCCC T ss_conf 990687899999999996799 No 35 >cd04742 NPD_FabD 2-Nitropropane dioxygenase (NPD)-like domain, associated with the (acyl-carrier-protein) S-malonyltransferase FabD. NPD is part of the nitroalkaneoxidizing enzyme family, that catalyzes oxidative denitrification of nitroalkanes to their corresponding carbonyl compounds and nitrites. NDPs are members of the NAD(P)H-dependent flavin oxidoreductase family that reduce a range of alternative electron acceptors. Most use FAD/FMN as a cofactor and NAD(P)H as electron donor. Some contain 4Fe-4S cluster to transfer electron from FAD to FMN. Probab=43.63 E-value=28 Score=16.44 Aligned_cols=15 Identities=27% Similarity=0.222 Sum_probs=5.5 Q ss_pred CCEEEECCCCHHHHH Q ss_conf 940897893279999 Q 537021.9.peg.8 13 FLPIIVGGTGLYFRA 27 (217) Q Consensus 13 kiPIiVGGTglY~~a 27 (217) ++.++-=||.|=.+| T Consensus 289 kvQVLkrGt~Fp~RA 303 (418) T cd04742 289 KVQVLKRGTLFPARA 303 (418) T ss_pred EEEEECCCCCCHHHH T ss_conf 568860455448779 No 36 >PRK00131 aroK shikimate kinase; Reviewed Probab=42.64 E-value=24 Score=16.90 Aligned_cols=75 Identities=13% Similarity=0.241 Sum_probs=48.5 Q ss_pred HHEEEEECCCHHHHHHHHHHHHHH-HHH-HHHHHHHHHHHHCCCCCHHHHHHCCCHHHHHHH-HC-CCCCHHHHHHHHHH Q ss_conf 100001078678999999977765-421-134567777511386310134420685677898-67-89998999999999 Q 537021.9.peg.8 111 SAHKIIILPERSALKERIRRRFTQ-MLE-SGAIDEIRSLMKMNLSLDLPIMKAIGVRDIIAL-LK-GEINYDETLQRGII 186 (217) Q Consensus 111 ~~~~i~l~~~r~~L~~rI~~R~~~-Ml~-~GlieEv~~L~~~~~~~~~~~~~~IGYke~~~y-L~-g~~~~~e~~e~~~~ 186 (217) ...+++|..+-+.+.+|+..+-+. ++. .+..++++.|++.- .+. |.+..++ ++ +..|.++.++.+.. T Consensus 97 ~g~vV~L~~~~e~l~~Rl~~~~~RPll~~~~~~~~l~~l~~~R----~~~-----Y~~~Ad~~Idt~~~s~~ei~~~Ii~ 167 (175) T PRK00131 97 RGTVVYLDASFEELLRRLARDRNRPLLQTEDPKEKLRALYEER----DPL-----YEEVADLTIETDRRSPEEVVNEILE 167 (175) T ss_pred CCCEEEEECCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHH----HHH-----HHHHCCEEEECCCCCHHHHHHHHHH T ss_conf 6937998579999999864899998788998699999999999----998-----9753899998999999999999999 Q ss_pred HHHHHHHH Q ss_conf 99988888 Q 537021.9.peg.8 187 ATNKYAKR 194 (217) Q Consensus 187 ~Tr~yAKR 194 (217) .-..|.+| T Consensus 168 ~L~~~~~~ 175 (175) T PRK00131 168 LLELLLRR 175 (175) T ss_pred HHHHHHCC T ss_conf 99998529 No 37 >COG1735 Php Predicted metal-dependent hydrolase with the TIM-barrel fold [General function prediction only] Probab=41.28 E-value=16 Score=17.99 Aligned_cols=31 Identities=23% Similarity=0.317 Sum_probs=18.6 Q ss_pred HHHHHCCCHHHHH----HHHC-CCCCHHHHHHHHHHH Q ss_conf 1344206856778----9867-899989999999999 Q 537021.9.peg.8 156 LPIMKAIGVRDII----ALLK-GEINYDETLQRGIIA 187 (217) Q Consensus 156 ~~~~~~IGYke~~----~yL~-g~~~~~e~~e~~~~~ 187 (217) .+.+...||..+. ++|+ ..++ ++.+..+.+. T Consensus 271 ~~~~~~~g~~~I~~~fIP~Lk~~Gvd-e~~i~~mlvd 306 (316) T COG1735 271 LKANGGWGYGYILNDFIPRLKRHGVD-EETIDTMLVD 306 (316) T ss_pred HHHCCCCCCCHHHHHHHHHHHHCCCC-HHHHHHHHHH T ss_conf 54058866142067567999874998-9999999861 No 38 >COG4359 Uncharacterized conserved protein [Function unknown] Probab=40.22 E-value=32 Score=16.10 Aligned_cols=27 Identities=22% Similarity=0.363 Sum_probs=19.0 Q ss_pred HHHHCCCCEEEE--CCCCHHHHHHHCCCC Q ss_conf 999639940897--893279999961724 Q 537021.9.peg.8 7 QRYKKRFLPIIV--GGTGLYFRALTGQLS 33 (217) Q Consensus 7 e~~~r~kiPIiV--GGTglY~~all~g~~ 33 (217) |-.+-+-+|++| ||+-.|+++|+.|+- T Consensus 83 e~ike~di~fiVvSsGm~~fI~~lfe~iv 111 (220) T COG4359 83 EWIKEHDIPFIVVSSGMDPFIYPLFEGIV 111 (220) T ss_pred HHHHHCCCCEEEEECCCCHHHHHHHHHHC T ss_conf 99997599889993897547999998643 No 39 >TIGR02782 TrbB_P P-type conjugative transfer ATPase TrbB; InterPro: IPR014149 This entry represents TrbB, a protein, which is encoded in the trb locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for plasmid conjugative transfer . TrbB is a homologue of the vir system VirB11 ATPase , and the Flp pilus system ATPase TadA .. Probab=39.31 E-value=16 Score=17.99 Aligned_cols=24 Identities=29% Similarity=0.585 Sum_probs=16.1 Q ss_pred HCCCCEEEECCCC----HHHHHHHCCCCC Q ss_conf 6399408978932----799999617244 Q 537021.9.peg.8 10 KKRFLPIIVGGTG----LYFRALTGQLSI 34 (217) Q Consensus 10 ~r~kiPIiVGGTg----lY~~all~g~~~ 34 (217) +|.+| +|+|||| -=.+||+..++. T Consensus 138 ar~NI-lv~GGTGSGKTTLaNAlla~I~~ 165 (315) T TIGR02782 138 ARKNI-LVVGGTGSGKTTLANALLAEIAK 165 (315) T ss_pred HCCCE-EEECCCCCCHHHHHHHHHHHHHH T ss_conf 12988-99814588579999999998852 No 40 >TIGR01694 MTAP methylthioadenosine phosphorylase; InterPro: IPR010044 This entry represents the methylthioadenosine phosphorylase found in metazoa, cyanobacteria and a limited number of archaea such as Sulfolobus, Aeropyrum, Pyrobaculum, Pyrococcus, and Thermoplasma. This enzyme is responsible for the first step in the methionine salvage pathway after the transfer of the amino acid moiety from S-adenosylmethionine. The enzyme from human is well-characterised and its crystal structure has been described , . A misleading characterisation is found for a Sulfolobus solfataricus enzyme which is called a MTAP. In fact, as uncovered by the genome sequence of S. solfataricus, there are at least two nucleotide phosphorylases and the one found in the MTAP clade is not the one annotated as such. The sequence in this clade has not been isolated but is likely to be the authentic SsMTAP as it displays all of the conserved active site residues found in the human enzyme . This explains the finding that the characterised enzyme has greater efficiency towards the purines inosine, guanosine and adenosine over MTA. In fact, this misnaming of this enzyme has been carried forward to several publications including a crystal structure . Related sequences include: 1) several archaeal sequences which appear to contain several residues characteristic of phosphorylases which act on guanosine or inosine (according to the crystal structure of MTAP and alignments). 2) sequences from Mycobacterium tuberculosis and Streptomyces coelicolor which have better, although not perfect retention of the active site residues, but considering the general observation that bacteria utilise the MTA/SAH nucleotidase enzyme and a kinase to do this reaction; and 3) a sequence from Drosophila which appears to be a recent divergence (long branch in neighbour-joining trees) and lacks some of the conserved active site residues. ; GO: 0016763 transferase activity transferring pentosyl groups. Probab=38.81 E-value=11 Score=19.13 Aligned_cols=12 Identities=58% Similarity=1.306 Sum_probs=5.5 Q ss_pred EECCCCHH-HHHH Q ss_conf 97893279-9999 Q 537021.9.peg.8 17 IVGGTGLY-FRAL 28 (217) Q Consensus 17 iVGGTglY-~~al 28 (217) |.|||||| +..+ T Consensus 4 iIGGsG~y~~~~~ 16 (263) T TIGR01694 4 IIGGSGLYDLEGL 16 (263) T ss_pred EECCCCCCCCCCC T ss_conf 7517320040011 No 41 >TIGR02667 moaB_proteo molybdenum cofactor biosynthesis protein B; InterPro: IPR013484 MoaB is thought to be involved in molybdopterin biosynthesis, though its exact role is not known. Structural studies of this polypeptide suggest that it may play a role in substrate-shuttling during biosynthesis . MoaB was capable of binding GTP, and it was suggested that the putative active site could also bind precursor Z and/or molybdenum. Potential protein interaction domains were also found, implying that MoaB may play a transport and/or storage role in molybdopterin biosynthesis. . Probab=38.16 E-value=12 Score=18.84 Aligned_cols=29 Identities=24% Similarity=0.284 Sum_probs=11.0 Q ss_pred HHHHHHHHHCCHHHH-CCCCCHHHHHHHHH Q ss_conf 555665531162320-23570156999999 Q 537021.9.peg.8 56 HILHDELSSLDSIVA-RQIHPSDGQRIARA 84 (217) Q Consensus 56 ~~l~~~L~~~DP~~a-~~i~~nd~~Ri~Ra 84 (217) ..|-+.|.+-=..-| ..|-+.|.|+|..- T Consensus 25 ~~Lv~~L~~AGH~la~R~iv~DD~y~iRA~ 54 (163) T TIGR02667 25 QVLVERLTEAGHRLAERAIVKDDIYQIRAQ 54 (163) T ss_pred HHHHHHHHHHCHHHHCCCCCHHHHHHHHHH T ss_conf 789999987150321143267448999999 No 42 >TIGR02078 AspKin_pair Pyrococcus aspartate kinase subunit, putative; InterPro: IPR011819 This family consists of proteins restricted to and found as paralogous pairs (typically close together) in species of Pyrococcus, a hyperthermophilic archaeal genus. Members are always found close to other genes of threonine biosynthesis and appear to represent the Pyrococcal form of aspartate kinase. Alignment to aspartokinase III from Escherichia coli shows that 300 N-terminal and 20 C-terminal amino acids are homologous, but the form in Pyrococcus lacks ~100 amino acids in between.. Probab=37.82 E-value=21 Score=17.22 Aligned_cols=22 Identities=32% Similarity=0.421 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9997776542113456777751 Q 537021.9.peg.8 127 RIRRRFTQMLESGAIDEIRSLM 148 (217) Q Consensus 127 rI~~R~~~Ml~~GlieEv~~L~ 148 (217) |=..++-+|++.|.+==|-+++ T Consensus 152 ~~~k~~~~~l~~G~iPvipGF~ 173 (341) T TIGR02078 152 RNAKILYEVLESGKIPVIPGFY 173 (341) T ss_pred HHHHHHHHHHHCCCEEEECCCC T ss_conf 5777889888658302214620 No 43 >PRK09825 idnK D-gluconate kinase; Provisional Probab=37.40 E-value=26 Score=16.63 Aligned_cols=73 Identities=8% Similarity=0.129 Sum_probs=47.7 Q ss_pred HHEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 10000107867899999997776542113456777751138631013442068567789867899989999999999998 Q 537021.9.peg.8 111 SAHKIIILPERSALKERIRRRFTQMLESGAIDEIRSLMKMNLSLDLPIMKAIGVRDIIALLKGEINYDETLQRGIIATNK 190 (217) Q Consensus 111 ~~~~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~~~~~~~~~~~~~~IGYke~~~yL~g~~~~~e~~e~~~~~Tr~ 190 (217) ++.+++|..+++.+.+|+..|-..-+...|++-=-..++.=...+..+. .++=.-+.++.++.+..+-+- T Consensus 99 ~v~fv~L~g~~~~i~~Rl~~R~gHFMp~~LL~SQf~tLE~P~~dE~~v~----------~idi~~~~e~iv~~~~~al~a 168 (176) T PRK09825 99 NVHFLWLDGDYETILARMQRRAGHFMPPDLLQSQFDALERPCADEHDIA----------RIDVNHDIENVTEQCRQAVQA 168 (176) T ss_pred CEEEEEEECCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCCCEE----------EEECCCCHHHHHHHHHHHHHH T ss_conf 8799997189999999997460379997999989998179998889869----------997899999999999999999 Q ss_pred HHH Q ss_conf 888 Q 537021.9.peg.8 191 YAK 193 (217) Q Consensus 191 yAK 193 (217) |.| T Consensus 169 ~~~ 171 (176) T PRK09825 169 FRQ 171 (176) T ss_pred HHH T ss_conf 998 No 44 >PRK13776 formimidoylglutamase; Provisional Probab=36.97 E-value=26 Score=16.68 Aligned_cols=29 Identities=14% Similarity=0.012 Sum_probs=20.3 Q ss_pred HHHHHHHCCCCEEEECCCCHHHHHHHCCC Q ss_conf 89999963994089789327999996172 Q 537021.9.peg.8 4 KKLQRYKKRFLPIIVGGTGLYFRALTGQL 32 (217) Q Consensus 4 k~~e~~~r~kiPIiVGGTglY~~all~g~ 32 (217) +.-++++.|+.||+.||-+---.+-+.|+ T Consensus 104 ~v~~ll~~g~~pIvLGGgH~iA~g~~~G~ 132 (311) T PRK13776 104 RVADLLARGHRVVGLGGGHEIAYASFAGL 132 (311) T ss_pred HHHHHHHCCCEEEEECCCCHHHHHHHHHH T ss_conf 99999976985799888532534306999 No 45 >PRK13773 formimidoylglutamase; Provisional Probab=36.66 E-value=26 Score=16.60 Aligned_cols=18 Identities=17% Similarity=0.318 Sum_probs=14.7 Q ss_pred HHHHHCCCCEEEECCCCH Q ss_conf 999963994089789327 Q 537021.9.peg.8 6 LQRYKKRFLPIIVGGTGL 23 (217) Q Consensus 6 ~e~~~r~kiPIiVGGTgl 23 (217) -++++.|+.||+.||-+= T Consensus 112 ~~ll~~g~~PIvLGGgHs 129 (320) T PRK13773 112 AGLLDAGNFPVVLGGGHE 129 (320) T ss_pred HHHHHCCCEEEEECCCCC T ss_conf 999977990699778664 No 46 >PRK13775 formimidoylglutamase; Provisional Probab=36.55 E-value=31 Score=16.15 Aligned_cols=19 Identities=21% Similarity=0.319 Sum_probs=15.2 Q ss_pred HHHHCCCCEEEECCCCHHH Q ss_conf 9996399408978932799 Q 537021.9.peg.8 7 QRYKKRFLPIIVGGTGLYF 25 (217) Q Consensus 7 e~~~r~kiPIiVGGTglY~ 25 (217) ++.+.|.+||+.||-+--- T Consensus 118 ~il~~g~~PivlGGdHsia 136 (328) T PRK13775 118 RMCDLNLKPIVLGGGHETA 136 (328) T ss_pred HHHHCCCEEEEECCCCHHH T ss_conf 9997798789974872577 No 47 >PRK00048 dihydrodipicolinate reductase; Provisional Probab=36.41 E-value=25 Score=16.80 Aligned_cols=24 Identities=21% Similarity=0.311 Sum_probs=15.0 Q ss_pred HHHHHHHCCCCEEEECCCCHHHHH Q ss_conf 899999639940897893279999 Q 537021.9.peg.8 4 KKLQRYKKRFLPIIVGGTGLYFRA 27 (217) Q Consensus 4 k~~e~~~r~kiPIiVGGTglY~~a 27 (217) ..++...+.++|+|+|=||+--+. T Consensus 84 ~~~~~~~~~~~~~ViGTTG~~~~~ 107 (265) T PRK00048 84 ENLEFALEHGKPLVIGTTGFTEEQ 107 (265) T ss_pred HHHHHHHHCCCCEEEEECCCCHHH T ss_conf 999999974997799608999899 No 48 >PRK04195 replication factor C large subunit; Provisional Probab=36.13 E-value=36 Score=15.69 Aligned_cols=88 Identities=17% Similarity=0.154 Sum_probs=49.5 Q ss_pred HHHHHHHCCCCEEEECCCCHH------HHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCHH Q ss_conf 899999639940897893279------99996172448899989999999999962235556655311623202357015 Q 537021.9.peg.8 4 KKLQRYKKRFLPIIVGGTGLY------FRALTGQLSIMPEIPTAIREKIREKLKQYGSHILHDELSSLDSIVARQIHPSD 77 (217) Q Consensus 4 k~~e~~~r~kiPIiVGGTglY------~~all~g~~~~p~~~~~ir~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i~~nd 77 (217) ..++..++.+.||+|=-+-.| +++...-+.+.|..+..+...++.....+|.+.=-+.|..+ .....+| T Consensus 122 al~~~ik~s~~PiIli~Nd~~~~~~~~lrs~c~~i~F~~~~~~~I~~~L~~I~~~Egi~i~~~aL~~I-----a~~s~GD 196 (403) T PRK04195 122 AILEIIKKAKNPIILTANDPYDPSLRPLRNACLMIEFKRLSKRSIVPVLKRICRKEGIECEEEALREI-----AERSGGD 196 (403) T ss_pred HHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHH-----HHHCCCH T ss_conf 99999854887089982684556717799766122179949999999999999976999999999999-----9987973 Q ss_pred HHHHHHHHHHHHHCCCCHH Q ss_conf 6999999999853488746 Q 537021.9.peg.8 78 GQRIARALEIKLVSGQSII 96 (217) Q Consensus 78 ~~Ri~RaLEi~~~tGk~~s 96 (217) .|..+..|+....+++.++ T Consensus 197 lR~aIN~LQ~~~~~~~~~~ 215 (403) T PRK04195 197 LRSAINDLQAIAEGGGRIT 215 (403) T ss_pred HHHHHHHHHHHHCCCCCCC T ss_conf 9999999999974899775 No 49 >COG3311 AlpA Predicted transcriptional regulator [Transcription] Probab=35.56 E-value=37 Score=15.63 Aligned_cols=60 Identities=18% Similarity=0.215 Sum_probs=42.5 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEE---ECCCHHHHHHHHHHHHHH Q ss_conf 57015699999999985348874677541545677700100001---078678999999977765 Q 537021.9.peg.8 73 IHPSDGQRIARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKII---ILPERSALKERIRRRFTQ 134 (217) Q Consensus 73 i~~nd~~Ri~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~---l~~~r~~L~~rI~~R~~~ 134 (217) -+.+...|++|.=||...||.+-+.++..-+...+|. ...++ +-.+..+..+.|+.|+.. T Consensus 5 ~~~~~~~r~lrl~ev~~~~GlSrstiYr~i~~~~FPk--pvklG~r~v~W~~SEI~~Wi~~~~~~ 67 (70) T COG3311 5 VEMRHTDRLLRLPEVAQLTGLSRSTIYRLIKDGTFPK--PVKLGGRSVAWPESEIDEWIASRKAA 67 (70) T ss_pred CCCCCCCHHHHHHHHHHHHCCCHHHHHHHHCCCCCCC--CEECCCCCCCCCHHHHHHHHHHHHHC T ss_conf 5454421152089999997766899999980477999--86227641245599999999987750 No 50 >PRK11545 gntK gluconate kinase 1; Provisional Probab=35.01 E-value=30 Score=16.26 Aligned_cols=38 Identities=8% Similarity=0.134 Sum_probs=30.8 Q ss_pred HHEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 10000107867899999997776542113456777751 Q 537021.9.peg.8 111 SAHKIIILPERSALKERIRRRFTQMLESGAIDEIRSLM 148 (217) Q Consensus 111 ~~~~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~ 148 (217) ++.+++|..+++.+.+|+..|-..-+...|++-=-..+ T Consensus 104 ~~~fv~L~g~~~~i~~Rl~~R~~HFmp~~LL~SQf~tL 141 (177) T PRK11545 104 NLSFIYLKGDFDVIESRLKARKGHFFKTQMLVTQFETL 141 (177) T ss_pred CEEEEEECCCHHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 75999972999999999974646899878999899981 No 51 >pfam01725 Ham1p_like Ham1 family. This family consists of the HAM1 protein and hypothetical archaeal bacterial and C. elegans proteins. HAM1 controls 6-N-hydroxylaminopurine (HAP) sensitivity and mutagenesis in S. cerevisiae HAM1. The HAM1 protein protects the cell from HAP, either on the level of deoxynucleoside triphosphate or the DNA level by a yet unidentified set of reactions. Probab=32.86 E-value=41 Score=15.35 Aligned_cols=26 Identities=27% Similarity=0.298 Sum_probs=20.1 Q ss_pred HHHHHHHCCCCEEEECCCCHHHHHHH Q ss_conf 89999963994089789327999996 Q 537021.9.peg.8 4 KKLQRYKKRFLPIIVGGTGLYFRALT 29 (217) Q Consensus 4 k~~e~~~r~kiPIiVGGTglY~~all 29 (217) |....++.=..|++|--|||++.||- T Consensus 49 Ka~~~~~~~~~p~iaDDSGL~i~aL~ 74 (181) T pfam01725 49 KAKAAAKALGKPVLADDSGLCVDALN 74 (181) T ss_pred HHHHHHHHHCCCEEEECCHHHHHHHC T ss_conf 99999997089779824067899875 No 52 >TIGR01489 DKMTPPase-SF 2,3-diketo-5-methylthio-1-phosphopentane phosphatase; InterPro: IPR006384 This group of sequences belong to the IB subfamily of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. With exceptions from Bacillus subtilis and Clostridium acetabutylicum, the members of this group are all eukaryotic, spanning metazoa, plants and fungi. . Probab=32.70 E-value=41 Score=15.34 Aligned_cols=30 Identities=17% Similarity=0.347 Sum_probs=20.0 Q ss_pred HHHHHHCCC--CEEEE--CCCCHHHHHHHCCCCC Q ss_conf 999996399--40897--8932799999617244 Q 537021.9.peg.8 5 KLQRYKKRF--LPIIV--GGTGLYFRALTGQLSI 34 (217) Q Consensus 5 ~~e~~~r~k--iPIiV--GGTglY~~all~g~~~ 34 (217) -++-.+... +|++| +|+-+|+++|++++.. T Consensus 81 f~~~~~~~~dr~d~~v~S~g~~~FI~~vlE~~~~ 114 (207) T TIGR01489 81 FIEFIKEKGDRIDFIVISDGMEFFIDKVLEQIGV 114 (207) T ss_pred HHHHHHHCCCCCCEEEEECCCCHHHHHHHHHHHH T ss_conf 9999983588113798858863678999987420 No 53 >pfam09074 Mer2 Mer2. Mer2 (Rec107) forms part of a complex that is required for meiotic double strand DNA break formation. Mer2 increases in abundance and is phosphorylated during the prophase phase of cell division. Blocking double strand break formation results in delayed dephosphorylation and dissociation of Mer2 from the chromosome. Probab=32.66 E-value=41 Score=15.33 Aligned_cols=61 Identities=15% Similarity=0.053 Sum_probs=39.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHHHH-HHHHHHHHHHHHCCC Q ss_conf 776542113456777751138631013442068567789867899989999999999-998888876886288 Q 537021.9.peg.8 131 RFTQMLESGAIDEIRSLMKMNLSLDLPIMKAIGVRDIIALLKGEINYDETLQRGIIA-TNKYAKRQKTWLCHQ 202 (217) Q Consensus 131 R~~~Ml~~GlieEv~~L~~~~~~~~~~~~~~IGYke~~~yL~g~~~~~e~~e~~~~~-Tr~yAKRQ~TWfr~~ 202 (217) -+..-+++++.+++.......-....+++ .||.+. +-+++++..-+ ||++-|||..|++.- T Consensus 93 dl~~~i~nql~~~~~s~~~q~~~~~~~~~--~~q~~~---------l~e~m~r~~~nI~r~~~~rq~e~~k~~ 154 (193) T pfam09074 93 DLDVNIENQLQDVKKSFSDQLKSINTNSN--LGQIKN---------LTECMARQEANILRDIIKSQEKLQKAV 154 (193) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCH--HHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88899988999999999988613567730--577775---------368999998889999999999999998 No 54 >pfam07271 Cytadhesin_P30 Cytadhesin P30/P32. This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localized on the tip organelle. It is thought that it is important in cytadherence and virulence. Probab=32.42 E-value=42 Score=15.31 Aligned_cols=50 Identities=24% Similarity=0.432 Sum_probs=28.6 Q ss_pred CCEEEECCCCHHHHHHHCCCCC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9408978932799999617244-8899989999999999962235556655311 Q 537021.9.peg.8 13 FLPIIVGGTGLYFRALTGQLSI-MPEIPTAIREKIREKLKQYGSHILHDELSSL 65 (217) Q Consensus 13 kiPIiVGGTglY~~all~g~~~-~p~~~~~ir~~~~~~~~~~g~~~l~~~L~~~ 65 (217) -+|.|.||-||-.-|++-|+.. +|-+. |++-+-+-+++-.+.+.+.|..+ T Consensus 69 F~P~V~~~~G~~alaiiLGL~IGlPIvK---rKEk~lleEkE~~eqmaEqL~~I 119 (279) T pfam07271 69 FIPTVAGGFGFSALAIILGLAIGLPIVK---RKEKRLLEEKERQEQMAEQLQRI 119 (279) T ss_pred EEEECCCCHHHHHHHHHHHHHCCCHHHH---HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 2221034201899999987421751233---68888788889999999999888 No 55 >TIGR01977 am_tr_V_EF2568 cysteine desulfurase family protein; InterPro: IPR010969 This entry describes a subfamily of probable pyridoxal phosphate-dependent enzymes in the aminotransferase class V family. Related families contain members active as cysteine desulphurases, selenocysteine lyases, or both. The members of this family form a distinct clade and all are shorter at the N terminus. The function of members of this family are unknown.. Probab=32.24 E-value=22 Score=17.13 Aligned_cols=115 Identities=14% Similarity=0.205 Sum_probs=61.1 Q ss_pred CCEEEECCCCHHHHHHHCCCCCCCCCCHHHHHHHHH-HHHHHHHHHHHHHHHHCCHHHHCCCCCHHHHHHHHHHHHHHHC Q ss_conf 940897893279999961724488999899999999-9996223555665531162320235701569999999998534 Q 537021.9.peg.8 13 FLPIIVGGTGLYFRALTGQLSIMPEIPTAIREKIRE-KLKQYGSHILHDELSSLDSIVARQIHPSDGQRIARALEIKLVS 91 (217) Q Consensus 13 kiPIiVGGTglY~~all~g~~~~p~~~~~ir~~~~~-~~~~~g~~~l~~~L~~~DP~~a~~i~~nd~~Ri~RaLEi~~~t 91 (217) -.||..||||.+=..+-.+ +.+|+ +.+. -++..|.-.|-+=++.+.+..-++|......--.+.++.... T Consensus 226 l~pl~~GGTGs~S~~~~~P-~~~Pd-------rfE~GTLN~pGI~gL~agik~~~~~gi~~i~~kE~~l~~~l~~~L~~- 296 (384) T TIGR01977 226 LKPLKSGGTGSHSALIDQP-SELPD-------RFESGTLNTPGIAGLNAGIKFIEKIGIANIAKKELKLTEKLLNGLKE- 296 (384) T ss_pred HHHHCCCCCCCCCCCCCCC-CCCCC-------HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC- T ss_conf 4532068875453444577-60432-------01144003799999988878875644778999999999999874016- Q ss_pred CCCHHHHHHHCCCCCCCCHHHEEEEECC-CHHHHHHHHHHHHHHHHHHH Q ss_conf 8874677541545677700100001078-67899999997776542113 Q 537021.9.peg.8 92 GQSIIEFWKQAPNPFIPLESAHKIIILP-ERSALKERIRRRFTQMLESG 139 (217) Q Consensus 92 Gk~~s~~~~~~~~~~~~~~~~~~i~l~~-~r~~L~~rI~~R~~~Ml~~G 139 (217) ..++. .+ .+......-.++.|-++- |.+.+=.-++..++-|.-+| T Consensus 297 ~~~V~-iy--G~~d~~~~~~vvSf~~~g~d~~~va~~L~~~~~I~~RtG 342 (384) T TIGR01977 297 IKKVK-IY--GPKDPAKRVGVVSFTVEGIDSEEVADILDEKFDIAVRTG 342 (384) T ss_pred CCCEE-EE--CCCCHHHCCCEEEEEECCCCHHHHHHHHHHCCCCEEECC T ss_conf 78067-62--770343326758998348887789999986189189767 No 56 >TIGR00972 3a0107s01c2 phosphate ABC transporter, ATP-binding protein; InterPro: IPR005670 This is a family of phosphate transport system permease proteins.; GO: 0005315 inorganic phosphate transmembrane transporter activity, 0015114 phosphate transmembrane transporter activity, 0006817 phosphate transport, 0016020 membrane. Probab=32.22 E-value=24 Score=16.94 Aligned_cols=93 Identities=22% Similarity=0.260 Sum_probs=49.6 Q ss_pred HCCCCCC-CCHHHEEEEECCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH-CCC--CCHHH----HHHCCCHH-HHHHH Q ss_conf 1545677-700100001078678999999977765421-134567777511-386--31013----44206856-77898 Q 537021.9.peg.8 101 QAPNPFI-PLESAHKIIILPERSALKERIRRRFTQMLE-SGAIDEIRSLMK-MNL--SLDLP----IMKAIGVR-DIIAL 170 (217) Q Consensus 101 ~~~~~~~-~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~-~GlieEv~~L~~-~~~--~~~~~----~~~~IGYk-e~~~y 170 (217) |+|+|+. ..|+...+++...--.=++.++.+|++-|. ++|=||||.=+. .++ +-.-+ +.+||... |++ - T Consensus 89 Q~PNPFpmSIydNiayG~r~~G~~~K~~L~e~Ve~sL~~AALWDEVKD~L~~sa~~LSGGQQQRLCIARalA~ePeVl-L 167 (248) T TIGR00972 89 QKPNPFPMSIYDNIAYGPRLHGIKDKKELDEIVEESLKKAALWDEVKDRLHDSALGLSGGQQQRLCIARALAVEPEVL-L 167 (248) T ss_pred CCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCEE-C T ss_conf 478978840556754524521633778999999999861687135524213588978726889999998752488521-0 Q ss_pred HCCCC---------CHHHHHHHH--------HHHHHHHHHH Q ss_conf 67899---------989999999--------9999988888 Q 537021.9.peg.8 171 LKGEI---------NYDETLQRG--------IIATNKYAKR 194 (217) Q Consensus 171 L~g~~---------~~~e~~e~~--------~~~Tr~yAKR 194 (217) +|--. ..||++..+ .+|..|-|.| T Consensus 168 lDEPTSALDPIaT~~IEeLi~eLk~~YTivIVTHnMqQA~R 208 (248) T TIGR00972 168 LDEPTSALDPIATGKIEELIQELKKKYTIVIVTHNMQQAAR 208 (248) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCEEEEEECCHHHHHH T ss_conf 57888757877899999999987652979988177567899 No 57 >TIGR00963 secA preprotein translocase, SecA subunit; InterPro: IPR000185 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component. . From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone protein (SecB), an ATPase (SecA), an integral membrane complex (SecCY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) . The chaperone protein SecB is a highly acidic homotetrameric protein that exists as a "dimer of dimers" in the bacterial cytoplasm. SecB maintains preproteins in an unfolded state after translation, and targets these to the peripheral membrane protein ATPase SecA for secretion . SecA is a cytoplasmic protein of 800 to 960 amino acid residues. Homologs of secA are also encoded in the chloroplast genome of some algae as well as in the nuclear genome of plants . It could be involved in the intraorganellar protein transport into thylakoids.; GO: 0005524 ATP binding, 0006605 protein targeting, 0006886 intracellular protein transport. Probab=31.13 E-value=35 Score=15.84 Aligned_cols=17 Identities=29% Similarity=0.358 Sum_probs=7.0 Q ss_pred HHHHHHHCCCCEEEECCC Q ss_conf 899999639940897893 Q 537021.9.peg.8 4 KKLQRYKKRFLPIIVGGT 21 (217) Q Consensus 4 k~~e~~~r~kiPIiVGGT 21 (217) .+.++|.+|+ |||||=| T Consensus 427 e~~~~h~~Gq-PvLvGT~ 443 (904) T TIGR00963 427 EIKEIHAKGQ-PVLVGTT 443 (904) T ss_pred HHHHHHHCCC-CEEEEEC T ss_conf 9999874689-8777522 No 58 >PRK13772 formimidoylglutamase; Provisional Probab=30.89 E-value=37 Score=15.63 Aligned_cols=23 Identities=13% Similarity=0.156 Sum_probs=16.7 Q ss_pred HHHHHCCCCEEEECCCCHHHHHH Q ss_conf 99996399408978932799999 Q 537021.9.peg.8 6 LQRYKKRFLPIIVGGTGLYFRAL 28 (217) Q Consensus 6 ~e~~~r~kiPIiVGGTglY~~al 28 (217) -++++.|+.||+.||-+---.+- T Consensus 108 ~~ll~~g~~pivLGGdH~iA~g~ 130 (313) T PRK13772 108 AALLARGARPLVLGGGHEVAWGT 130 (313) T ss_pred HHHHHCCCEEEEECCCCHHHHHH T ss_conf 99997798579988853255654 No 59 >pfam11469 DUF3204 Protein of unknown function (DUF3204). This archaeal family of proteins has no known function. Probab=30.76 E-value=42 Score=15.31 Aligned_cols=21 Identities=29% Similarity=0.328 Sum_probs=15.3 Q ss_pred HHHHHCCCCCHHHHHHHHHHH Q ss_conf 789867899989999999999 Q 537021.9.peg.8 167 IIALLKGEINYDETLQRGIIA 187 (217) Q Consensus 167 ~~~yL~g~~~~~e~~e~~~~~ 187 (217) ...||+|.+|.+|++|.++.+ T Consensus 67 AyAWleg~is~eEavEil~~n 87 (120) T pfam11469 67 AYAWLKGSISCEEAVEILYTN 87 (120) T ss_pred HHHHHHCCCCHHHHHHHHHHC T ss_conf 999980535199999999964 No 60 >PRK01722 formimidoylglutamase; Provisional Probab=30.62 E-value=38 Score=15.55 Aligned_cols=27 Identities=15% Similarity=0.010 Sum_probs=19.5 Q ss_pred HHHHHCCCCEEEECCCCHHHHHHHCCC Q ss_conf 999963994089789327999996172 Q 537021.9.peg.8 6 LQRYKKRFLPIIVGGTGLYFRALTGQL 32 (217) Q Consensus 6 ~e~~~r~kiPIiVGGTglY~~all~g~ 32 (217) -++++.|..||+.||-+---...+.|+ T Consensus 111 ~~ll~~g~~pIvLGGdHsia~g~~~gl 137 (320) T PRK01722 111 SHCLKANPRTIVLGGGHEIAFGSFKGV 137 (320) T ss_pred HHHHHCCCEEEEECCCCHHHHHHHHHH T ss_conf 999967986899847642577879999 No 61 >TIGR01929 menB naphthoate synthase; InterPro: IPR010198 This entry represents an enzyme, naphthoate synthase, MenB or dihydroxynaphthoic acid synthetase), which is involved in the fifth step of the menaquinone biosynthesis pathway. Menaquinone (vitamin K2), is an essential quinone used in electron-transfer pathways serving as the major electron carrier during anaerobic growth. In the bacterium, the biosynthetic pathway in bacteria typically comprises 6-7 steps . Together with o-succinylbenzoate-CoA ligase (menE IPR010192 from INTERPRO),MenB takes 2-succinylbenzoate and converts it into 1,4-di-hydroxy-2-naphthoic acid (DHNA) , , , . The conversion of o-succinylbenzoate-CoA ligase to DHNA is mediated by the enzyme DHNA synthase, which is encoded by menB . The protein structure of naphthoate synthase has been expressed in Escherichia coli, purified and crystallized, and found to have fold characteristics of the crotonase family of enzymes, which is notable for the presence of several highly flexible regions around the active site. The C-terminal region of the protein may play a critical role both in completion of the binding pocket and in stabilisation of the hexamer, suggesting a link between oligomerization and catalytic activity . Included in this entry are two enzymes from Arabidopsis thaliana and one from Staphylococcus aureus, which are identified as putative enoyl-CoA hydratase/isomerases. These enzymes group with the naphthoate synthases when building a tree and when doing BLAST searches.; GO: 0008935 naphthoate synthase activity, 0009234 menaquinone biosynthetic process. Probab=30.45 E-value=18 Score=17.64 Aligned_cols=87 Identities=18% Similarity=0.143 Sum_probs=47.7 Q ss_pred HHHCCHHHHCCCCCHHHHHHHHHHH---HHHHCC--------------CCH-H-H--HHHHCCCCCC-------CCHHHE Q ss_conf 5311623202357015699999999---985348--------------874-6-7--7541545677-------700100 Q 537021.9.peg.8 62 LSSLDSIVARQIHPSDGQRIARALE---IKLVSG--------------QSI-I-E--FWKQAPNPFI-------PLESAH 113 (217) Q Consensus 62 L~~~DP~~a~~i~~nd~~Ri~RaLE---i~~~tG--------------k~~-s-~--~~~~~~~~~~-------~~~~~~ 113 (217) -..+||..|.++|--|.+|+||.+= |+..+| -++ | | .+.++.++-- .-|-.. T Consensus 91 adtvdPara~~LnvL~~qr~IR~~PKpVia~V~Gya~GGGh~Lh~~CDLTiAsre~A~fgQtgpkVGSFDgGyGsaYlAR 170 (278) T TIGR01929 91 ADTVDPARASRLNVLEVQRLIRTLPKPVIAMVNGYAIGGGHVLHVVCDLTIASREHARFGQTGPKVGSFDGGYGSAYLAR 170 (278) T ss_pred CCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEECCEEECCCEEEHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 13336312465634655458641898168875375325762013323022653213401478877356656345789986 Q ss_pred EEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00107867899999997776542113456777751 Q 537021.9.peg.8 114 KIIILPERSALKERIRRRFTQMLESGAIDEIRSLM 148 (217) Q Consensus 114 ~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~ 148 (217) +++=.--||+-.-.=.--..+|.+.|+|+.|=.+- T Consensus 171 ~vGQKkaREiwFlcrqY~A~~a~~MG~VN~VV~~a 205 (278) T TIGR01929 171 IVGQKKAREIWFLCRQYDAEEAEKMGLVNAVVPLA 205 (278) T ss_pred HHCCCHHHHEEECCCCCCHHHHHHCCCCEEECCHH T ss_conf 40550001100015632568897379530100354 No 62 >TIGR01481 ccpA catabolite control protein A; InterPro: IPR006377 Catabolite control protein A is a LacI family global transcriptional regulator found in Gram-positive bacteria. CcpA is involved in repressing carbohydrate utilization genes [ex: alpha-amylase (amyE), acetyl-coenzyme A synthase (acsA)] and in activating genes involved in transporting excess carbon from the cell [e.g. acetate kinase (ackA), alpha-acetolactate synthase (alsS)]. Additionally, disruption of CcpA in Bacillus megaterium, Staphylococcus xylosus, Lactobacillus casei and Lactocacillus pentosus also decreases growth rate, which suggests CcpA is involved in the regulation of other metabolic pathways.; GO: 0030528 transcription regulator activity, 0045449 regulation of transcription. Probab=29.55 E-value=40 Score=15.43 Aligned_cols=31 Identities=13% Similarity=0.398 Sum_probs=18.3 Q ss_pred HHHHHHCCCCEEEECCCCHHHHHHHCCCCCCCCCCHHH Q ss_conf 99999639940897893279999961724488999899 Q 537021.9.peg.8 5 KLQRYKKRFLPIIVGGTGLYFRALTGQLSIMPEIPTAI 42 (217) Q Consensus 5 ~~e~~~r~kiPIiVGGTglY~~all~g~~~~p~~~~~i 42 (217) .=|+|+|-++|||+-|| .+.-..+|.|+=+. T Consensus 131 ~re~F~rs~~PVVLAGt-------~D~~~~~pSVNIDy 161 (332) T TIGR01481 131 LREEFSRSPVPVVLAGT-------VDKENELPSVNIDY 161 (332) T ss_pred HHHHHHCCCCCEEEECC-------CCCCCCCCCCCCCH T ss_conf 99997128998899535-------48654677310167 No 63 >PRK09875 putative hydrolase; Provisional Probab=27.19 E-value=25 Score=16.76 Aligned_cols=25 Identities=20% Similarity=0.312 Sum_probs=15.0 Q ss_pred CCCHHHHHHHHHHHH-HHHCCCCHHHH Q ss_conf 570156999999999-85348874677 Q 537021.9.peg.8 73 IHPSDGQRIARALEI-KLVSGQSIIEF 98 (217) Q Consensus 73 i~~nd~~Ri~RaLEi-~~~tGk~~s~~ 98 (217) +++. -.|+.||--. -..||.|++.. T Consensus 133 it~~-E~k~lrAaA~A~~~TG~pI~~H 158 (292) T PRK09875 133 ITPL-EEKVFIAAALAHNQTGRPISTH 158 (292) T ss_pred CCHH-HHHHHHHHHHHHHHHCCCEEEC T ss_conf 8989-9999999999999879976866 No 64 >PRK02190 agmatinase; Provisional Probab=27.07 E-value=40 Score=15.44 Aligned_cols=16 Identities=13% Similarity=0.110 Sum_probs=12.7 Q ss_pred HHHHCCCCEEEECCCC Q ss_conf 9996399408978932 Q 537021.9.peg.8 7 QRYKKRFLPIIVGGTG 22 (217) Q Consensus 7 e~~~r~kiPIiVGGTg 22 (217) ++++.|++||+.||-+ T Consensus 111 ~i~~~g~~pi~lGGDH 126 (305) T PRK02190 111 KILAAGKRMLTLGGDH 126 (305) T ss_pred HHHHCCCEEEEECCCC T ss_conf 9985698688857776 No 65 >PRK09195 gatY tagatose-bisphosphate aldolase; Reviewed Probab=27.00 E-value=52 Score=14.71 Aligned_cols=76 Identities=16% Similarity=0.114 Sum_probs=38.1 Q ss_pred HHHHHHHHHHH----HHHH-HHHHHHHHCCCCC-HH-HHHHCCCHHHHHHHHC---CCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99977765421----1345-6777751138631-01-3442068567789867---899989999999999998888876 Q 537021.9.peg.8 127 RIRRRFTQMLE----SGAI-DEIRSLMKMNLSL-DL-PIMKAIGVRDIIALLK---GEINYDETLQRGIIATNKYAKRQK 196 (217) Q Consensus 127 rI~~R~~~Ml~----~Gli-eEv~~L~~~~~~~-~~-~~~~~IGYke~~~yL~---g~~~~~e~~e~~~~~Tr~yAKRQ~ 196 (217) .|..+++.=|- +|+= ++++..++.|+.+ |. +-++---.+-+.++|. ...+.-..+.....+-...+|+-+ T Consensus 196 ~I~~~~~vPLVLHGgSG~~~e~i~~ai~~Gv~KiNi~T~l~~a~~~~~r~~l~~~p~~~dp~~~~~~~~~a~~~~v~~~i 275 (284) T PRK09195 196 NIRQWVNIPLVLHGASGLPTKDIQQTIKLGICKVNVATELKIAFSQALKNYLTEHPEANDPRHYLQPAKAAMKDVVSKVI 275 (284) T ss_pred HHHHHHCCCEEEECCCCCCHHHHHHHHHCCCEEEEECHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99997499989878999899999999984976998685899999999999998687968869999999999999999999 Q ss_pred HHHCCC Q ss_conf 886288 Q 537021.9.peg.8 197 TWLCHQ 202 (217) Q Consensus 197 TWfr~~ 202 (217) .+|-.. T Consensus 276 ~~~GS~ 281 (284) T PRK09195 276 ADCGCE 281 (284) T ss_pred HHHCCC T ss_conf 996889 No 66 >PRK13774 formimidoylglutamase; Provisional Probab=26.83 E-value=49 Score=14.84 Aligned_cols=27 Identities=11% Similarity=-0.033 Sum_probs=19.3 Q ss_pred HHHHHHCCCCEEEECCCCHHHHHHHCC Q ss_conf 999996399408978932799999617 Q 537021.9.peg.8 5 KLQRYKKRFLPIIVGGTGLYFRALTGQ 31 (217) Q Consensus 5 ~~e~~~r~kiPIiVGGTglY~~all~g 31 (217) .-++++.|..||+.||-+---.+.+.| T Consensus 113 v~~~l~~g~~pivLGGdH~ia~~~~~g 139 (311) T PRK13774 113 AAKSIANHRQTFLLGGGHDIAYAQYLA 139 (311) T ss_pred HHHHHHCCCCEEEECCCCHHHHHHHHH T ss_conf 999997699528988860788898999 No 67 >PRK11230 glycolate oxidase subunit GlcD; Provisional Probab=26.64 E-value=52 Score=14.67 Aligned_cols=88 Identities=17% Similarity=0.153 Sum_probs=48.2 Q ss_pred HHHHHHHHHCCCCEEEE--CCCCHHHHHHH-C-C-------CCCCCCCCH----------HHHHHHHHHHHHHHHHHHHH Q ss_conf 46899999639940897--89327999996-1-7-------244889998----------99999999999622355566 Q 537021.9.peg.8 2 LSKKLQRYKKRFLPIIV--GGTGLYFRALT-G-Q-------LSIMPEIPT----------AIREKIREKLKQYGSHILHD 60 (217) Q Consensus 2 ~~k~~e~~~r~kiPIiV--GGTglY~~all-~-g-------~~~~p~~~~----------~ir~~~~~~~~~~g~~~l~~ 60 (217) +++.+..-++.++||+. ||||+-=.|+- . | ++.+-++|+ .+-..+++.++..|. T Consensus 68 V~~iv~~a~~~~ipvvprGggTgL~Gga~p~~~gIvl~l~rmn~Ileid~~~~~v~VeaGv~~~~l~~~l~~~Gl----- 142 (499) T PRK11230 68 VQALLAVCHRLRVPVVARGAGTGLSGGALPLEKGVLLVMARFNRILDIDPVGRRARVQPGVRNLAISQAAAPHGL----- 142 (499) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCEEEEEECCCCCEEEEECCCCEEEEECCCCHHHHHHHHHHCCC----- T ss_conf 999999999879909997898322658126898299980012770688268997999458107999998986698----- Q ss_pred HHHHCCHHHHC----------------CCCCHHHHHHHHHHHHHHHCCCCH Q ss_conf 55311623202----------------357015699999999985348874 Q 537021.9.peg.8 61 ELSSLDSIVAR----------------QIHPSDGQRIARALEIKLVSGQSI 95 (217) Q Consensus 61 ~L~~~DP~~a~----------------~i~~nd~~Ri~RaLEi~~~tGk~~ 95 (217) .--.||-+.. .+.-.+.+-.+.+|||...+|..+ T Consensus 143 -~~p~dP~S~~~~tIGG~iAtNAgG~~~~rYG~trd~VlglevVlpdG~v~ 192 (499) T PRK11230 143 -YYAPDPSSQIACSIGGNVAENAGGVHCLKYGLTVHNLLKVEILTLDGERL 192 (499) T ss_pred -CCCCCCCCCCCCCCCEEEECCCCCCCEEEECCHHHHHHCCEEEECCCCEE T ss_conf -54889872335525307730787863033156899751043893688621 No 68 >TIGR02133 RPI_actino ribose 5-phosphate isomerase; InterPro: IPR011860 Ribose 5-phosphate isomerase (5.3.1.6 from EC) forms a homodimer and catalyses the interconversion of D-ribose 5-phosphate and D-ribulose 5-phosphate in the non-oxidative branch of the pentose phosphate pathway. This reaction permits the synthesis of ribose from other sugars, as well as the recycling of sugars from nucleotide breakdown. Two unrelated enzymes can catalyse this reaction: RpiA (found in most organisms) and RpiB (found in some bacteria and eukaryotes) . This entry represents the only candidates for ribose 5-phosphate isomerase in Actinobacteria.. Probab=26.35 E-value=25 Score=16.71 Aligned_cols=16 Identities=25% Similarity=0.351 Sum_probs=10.3 Q ss_pred CCEEEECCCCHHHHHH Q ss_conf 9408978932799999 Q 537021.9.peg.8 13 FLPIIVGGTGLYFRAL 28 (217) Q Consensus 13 kiPIiVGGTglY~~al 28 (217) ...|||||||+=.+-- T Consensus 66 ~~g~vigGsG~G~~IA 81 (154) T TIGR02133 66 DLGLVIGGSGNGEQIA 81 (154) T ss_pred CCEEEECCCCCHHHHH T ss_conf 2435771478602221 No 69 >PRK05699 fliP flagellar biosynthesis protein FliP; Reviewed Probab=26.32 E-value=53 Score=14.63 Aligned_cols=29 Identities=24% Similarity=0.417 Sum_probs=21.6 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 78986789998999999999999888887 Q 537021.9.peg.8 167 IIALLKGEINYDETLQRGIIATNKYAKRQ 195 (217) Q Consensus 167 ~~~yL~g~~~~~e~~e~~~~~Tr~yAKRQ 195 (217) ..+|++|+++.+++.++...--|.|-.|| T Consensus 113 ~~P~~~~~i~~~~a~~~~~~Plr~FM~~~ 141 (245) T PRK05699 113 IQPYLDEKITLEEALERAAAPLREFMLKQ 141 (245) T ss_pred HCHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 29020378789999999889999999985 No 70 >KOG2553 consensus Probab=26.03 E-value=49 Score=14.85 Aligned_cols=40 Identities=30% Similarity=0.501 Sum_probs=22.9 Q ss_pred HHCCCCEEEEC--CCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 96399408978--932799999617244889998999999999996223 Q 537021.9.peg.8 9 YKKRFLPIIVG--GTGLYFRALTGQLSIMPEIPTAIREKIREKLKQYGS 55 (217) Q Consensus 9 ~~r~kiPIiVG--GTglY~~all~g~~~~p~~~~~ir~~~~~~~~~~g~ 55 (217) ..+-|+.|+|| |||+| |+..-|+. +.|..++.+-+...|. T Consensus 34 ~~KrKVail~gY~G~gY~------GmQ~N~~~-kTIEgeL~~al~~aGa 75 (416) T KOG2553 34 PRKRKVAILLGYCGTGYH------GMQYNPPL-KTIEGELFEALFKAGA 75 (416) T ss_pred CCCEEEEEEEEECCCCCC------CEECCCCC-CCHHHHHHHHHHHCCC T ss_conf 553179999986367753------21218998-7347899999997297 No 71 >COG1700 Uncharacterized conserved protein [Function unknown] Probab=25.70 E-value=54 Score=14.56 Aligned_cols=42 Identities=21% Similarity=0.363 Sum_probs=26.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHH-HHHCCCHHHHHHHH Q ss_conf 997776542113456777751138631013-44206856778986 Q 537021.9.peg.8 128 IRRRFTQMLESGAIDEIRSLMKMNLSLDLP-IMKAIGVRDIIALL 171 (217) Q Consensus 128 I~~R~~~Ml~~GlieEv~~L~~~~~~~~~~-~~~~IGYke~~~yL 171 (217) |-.-+.-.+.+||..||-.|-- ...+++ .++.=||||++.|- T Consensus 257 i~eev~~lls~~~~~eVG~l~v--fP~nSQvL~K~eGYRell~~y 299 (503) T COG1700 257 ILEEVNLLLSAGWLMEVGELQV--FPLNSQVLQKREGYRELLRYY 299 (503) T ss_pred HHHHHHHHHCCCHHHHHCCEEE--ECCCHHHHHHCCCHHHHHHHH T ss_conf 9999988750547876154545--206148876200399999999 No 72 >TIGR00110 ilvD dihydroxy-acid dehydratase; InterPro: IPR004404 Two dehydratases, dihydroxy-acid dehydratase (gene ilvD or ILV3) and 6-phosphogluconate dehydratase (gene edd) have been shown to be evolutionary related . Dihydroxy-acid dehydratase catalyzes the fourth step in the biosynthesis of isoleucine and valine, the dehydratation of 2,3-dihydroxy-isovaleic acid into alpha-ketoisovaleric acid. 6-Phosphogluconate dehydratase catalyzes the first step in the Entner-Doudoroff pathway, the dehydratation of 6-phospho-D-gluconate into 6-phospho-2-dehydro-3-deoxy-D-gluconate. Another protein containing this signature is the Escherichia coli hypothetical protein yjhG. The N-terminal part of the proteins contains a cysteine that could be involved in the binding of a 2Fe-2S iron-sulphur cluster . This family represents dihydroxy-acid dehydratase (DAD). It contains a catalytically essential [4Fe-4S] cluster and catalyses the fourth step in valine and isoleucine biosynthesis.; GO: 0004160 dihydroxy-acid dehydratase activity, 0009082 branched chain family amino acid biosynthetic process. Probab=24.85 E-value=29 Score=16.34 Aligned_cols=20 Identities=35% Similarity=0.313 Sum_probs=13.2 Q ss_pred HCCCCEEE-E-CCCCHHHHHHH Q ss_conf 63994089-7-89327999996 Q 537021.9.peg.8 10 KKRFLPII-V-GGTGLYFRALT 29 (217) Q Consensus 10 ~r~kiPIi-V-GGTglY~~all 29 (217) -|-.||.+ | |||+.|=+.-. T Consensus 118 ~RlNiP~I~V~GGPm~pG~~~~ 139 (601) T TIGR00110 118 LRLNIPSIFVTGGPMLPGRTKL 139 (601) T ss_pred HHCCCCEEEEECCCCCCCCCCC T ss_conf 9718988997288787663236 No 73 >pfam10007 DUF2250 Uncharacterized protein conserved in archaea (DUF2250). Members of this family of hypothetical archaeal proteins have no known function. Probab=24.61 E-value=35 Score=15.85 Aligned_cols=21 Identities=14% Similarity=0.381 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999777654211345677775 Q 537021.9.peg.8 127 RIRRRFTQMLESGAIDEIRSL 147 (217) Q Consensus 127 rI~~R~~~Ml~~GlieEv~~L 147 (217) .+..+++.+.+.||+|+|.+= T Consensus 38 ev~~~l~~L~~~GLlEr~~g~ 58 (93) T pfam10007 38 EVRDLLDKLEEMGLLERVTGS 58 (93) T ss_pred HHHHHHHHHHHCCCEEEHHHH T ss_conf 999999999987680010014 No 74 >TIGR02076 pyrH_arch uridylate kinase, putative; InterPro: IPR011818 Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes. Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected. This entry represents the archaeal and spirochete proteins.; GO: 0009041 uridylate kinase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=24.18 E-value=58 Score=14.37 Aligned_cols=18 Identities=6% Similarity=0.348 Sum_probs=8.1 Q ss_pred HHHHHHCCCHHHHHHHHC Q ss_conf 013442068567789867 Q 537021.9.peg.8 155 DLPIMKAIGVRDIIALLK 172 (217) Q Consensus 155 ~~~~~~~IGYke~~~yL~ 172 (217) +..-+.-+.|+|+.+-+. T Consensus 158 ~A~~~~~l~~~eL~~i~~ 175 (232) T TIGR02076 158 DAKKFDKLTPEELVEIVR 175 (232) T ss_pred CCCCCCCCCHHHHHHHHC T ss_conf 840002589889999860 No 75 >pfam05583 consensus Probab=24.06 E-value=38 Score=15.55 Aligned_cols=29 Identities=24% Similarity=0.384 Sum_probs=16.9 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHCCCHHHH Q ss_conf 34567777511386310134420685677 Q 537021.9.peg.8 139 GAIDEIRSLMKMNLSLDLPIMKAIGVRDI 167 (217) Q Consensus 139 GlieEv~~L~~~~~~~~~~~~~~IGYke~ 167 (217) .|+.||+.+++.|.+++++..+++..+-. T Consensus 4 ~Lva~vr~~m~~g~~p~s~~~q~LA~rW~ 32 (76) T pfam05583 4 ALVAEVRQAMDAGVPPDSPRAQALAARWM 32 (76) T ss_pred HHHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 89999999998799999999999999999 No 76 >TIGR02911 sulfite_red_B sulfite reductase, subunit B; InterPro: IPR014260 This entry represents subunit B, one of the three subunits of the anaerobic sulphite reductase of Salmonella, and close homologues from various Clostridium species, where the three-gene neighbourhood is preserved. Two such gene clusters are found in Clostridium perfringens, but it may be that these sets of genes correspond to the distinct assimilatory and dissimilatory forms seen in Clostridium pasteurianum. Note that any one of these enzymes may have secondary substrates such as NH2OH, SeO3(2-), and SO3(2-). Heterologous expression of the anaerobic sulphite reductase of Salmonella confers on Escherichia coli the ability to produce hydrogen sulphide gas from sulphite.. Probab=24.04 E-value=36 Score=15.70 Aligned_cols=117 Identities=16% Similarity=0.229 Sum_probs=54.8 Q ss_pred HHHHHCCCCEEEECCCCHH-HHHHHCCCCCCCCCCHHHH-----------HHHHHHHHHHHHHHHHHHHHHCCHHHHCCC Q ss_conf 9999639940897893279-9999617244889998999-----------999999996223555665531162320235 Q 537021.9.peg.8 6 LQRYKKRFLPIIVGGTGLY-FRALTGQLSIMPEIPTAIR-----------EKIREKLKQYGSHILHDELSSLDSIVARQI 73 (217) Q Consensus 6 ~e~~~r~kiPIiVGGTglY-~~all~g~~~~p~~~~~ir-----------~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i 73 (217) ++.++..-+-|+.||||.= ++.++.=++.-|+ ++. +.+--.-+-..|+.-++....+|.--+. T Consensus 94 vd~yk~Kel~vvAGGTGvaPVkG~~~yF~~N~~---e~k~l~~i~GfK~~~~iLfk~d~~~W~~~~n~~lT~D~ae~d-- 168 (261) T TIGR02911 94 VDNYKDKELVVVAGGTGVAPVKGVVEYFVKNPK---EIKSLNLILGFKTPDDILFKEDIAEWKENINLVLTLDEAEED-- 168 (261) T ss_pred HHHCCCCCEEEEECCCCCCHHHHHHHHHHHCHH---HCCEEEEEEECCCCCCCCCHHHHHHHHHCCCEEEEEECCCCC-- T ss_conf 233068617998279776603477777530601---103477997225711100077899887226779998336777-- Q ss_pred CCHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 701569999999998534887467754154567770010000107867899999997776542113456777751 Q 537021.9.peg.8 74 HPSDGQRIARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKIIILPERSALKERIRRRFTQMLESGAIDEIRSLM 148 (217) Q Consensus 74 ~~nd~~Ri~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~ 148 (217) +.-+.=+|-. +..+ -...+......+++-||-- +.--+..+|+.|+-+|--++- T Consensus 169 ~~~~~G~vT~--------------~~~~--l~l~~~~~~~~ivVGPPiM-----~~F~v~~lL~~G~K~E~iWv~ 222 (261) T TIGR02911 169 YKGNIGLVTK--------------YIPE--LKLKDIEEVQAIVVGPPIM-----MKFTVQELLKKGLKEENIWVS 222 (261) T ss_pred CCCCEEEEEC--------------CCCC--CCCCCCCEEEEEEECCCHH-----HHHHHHHHHHHCCCCCCEEEC T ss_conf 6533301020--------------0023--5412333257999779846-----678899998706860111542 No 77 >KOG1436 consensus Probab=23.68 E-value=37 Score=15.63 Aligned_cols=16 Identities=6% Similarity=0.227 Sum_probs=6.7 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9899999999999622 Q 537021.9.peg.8 39 PTAIREKIREKLKQYG 54 (217) Q Consensus 39 ~~~ir~~~~~~~~~~g 54 (217) |++.-.++.-.+...| T Consensus 56 d~E~sHrlAv~aas~g 71 (398) T KOG1436 56 DPEFSHRLAVLAASWG 71 (398) T ss_pred CHHHHHHHHHHHHHHC T ss_conf 9777779999997717 No 78 >TIGR01358 DAHP_synth_II 3-deoxy-7-phosphoheptulonate synthase; InterPro: IPR002480 Members of the 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthetase family (2.5.1.54 from EC) catalyse the first step in aromatic amino acid biosynthesis from chorismate. Class I (see IPR006218 from INTERPRO) includes bacterial and yeast enzymes; class II includes higher plants and various microorganisms . The first step in the common pathway leading to the biosynthesis of aromatic compounds is the stereospecific condensation of phosphoenolpyruvate (PEP) and D-erythrose-4-phosphate (E4P) giving rise to 3-deoxy-D-arabino-heptulosonate-7-phosphate (DAHP). This reaction is catalyzed by DAHP synthase, a metal-activated enzyme, which in microorganisms is the target for negative-feedback regulation by pathway intermediates or by end products.; GO: 0003849 3-deoxy-7-phosphoheptulonate synthase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=23.51 E-value=44 Score=15.17 Aligned_cols=63 Identities=14% Similarity=0.173 Sum_probs=36.8 Q ss_pred HHHHCCHHHHCCCCCHHHHHHHHHHHHHH---------HCCCCHHHHHHHCCCCCCCCHHHEEEEECCCHHHHHHHHHHH Q ss_conf 55311623202357015699999999985---------348874677541545677700100001078678999999977 Q 537021.9.peg.8 61 ELSSLDSIVARQIHPSDGQRIARALEIKL---------VSGQSIIEFWKQAPNPFIPLESAHKIIILPERSALKERIRRR 131 (217) Q Consensus 61 ~L~~~DP~~a~~i~~nd~~Ri~RaLEi~~---------~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r~~L~~rI~~R 131 (217) ...--|+..|++++ |+.|++||+.--- .||- +.+++ +...|. +-|+=..|--.-|+.+..+ T Consensus 133 ~iNG~~F~eaaR~P--DP~RL~~AY~~saatLNllRAl~~GG-yAdL~--~vh~WN-----~~FV~~SP~G~RY~~la~~ 202 (450) T TIGR01358 133 IINGPAFTEAARVP--DPERLVRAYHQSAATLNLLRALTSGG-YADLH--QVHEWN-----LEFVRSSPAGARYEKLAEE 202 (450) T ss_pred CCCCCCCCHHHCCC--CHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHC--CHHHHH-----HHHHCCCCHHHHHHHHHHH T ss_conf 11688888544679--85799999999999999999985221-46301--235653-----4552388435789999998 Q ss_pred HH Q ss_conf 76 Q 537021.9.peg.8 132 FT 133 (217) Q Consensus 132 ~~ 133 (217) ++ T Consensus 203 I~ 204 (450) T TIGR01358 203 ID 204 (450) T ss_pred HH T ss_conf 86 No 79 >PRK08213 gluconate 5-dehydrogenase; Provisional Probab=23.39 E-value=48 Score=14.93 Aligned_cols=26 Identities=12% Similarity=0.172 Sum_probs=18.6 Q ss_pred HHHHHHH-CCCCEEEECCCCHHHHHHH Q ss_conf 8999996-3994089789327999996 Q 537021.9.peg.8 4 KKLQRYK-KRFLPIIVGGTGLYFRALT 29 (217) Q Consensus 4 k~~e~~~-r~kiPIiVGGTglY~~all 29 (217) ...+.|+ +||+.||+||++=-=.++- T Consensus 3 ~i~~lf~L~gKvalVTG~s~GIG~aia 29 (259) T PRK08213 3 TVLELFDLTGKTALVTGGSRGLGLQIA 29 (259) T ss_pred CHHHHHCCCCCEEEEECCCCHHHHHHH T ss_conf 767873999998999487768999999 No 80 >PRK08931 5'-methylthioadenosine phosphorylase; Provisional Probab=23.06 E-value=27 Score=16.56 Aligned_cols=16 Identities=6% Similarity=0.167 Sum_probs=10.6 Q ss_pred CCCHHHHHHHHHHHHH Q ss_conf 9998999999999999 Q 537021.9.peg.8 174 EINYDETLQRGIIATN 189 (217) Q Consensus 174 ~~~~~e~~e~~~~~Tr 189 (217) ..|.++.++.+..++- T Consensus 219 ~v~~~~v~~~~~~n~~ 234 (289) T PRK08931 219 AVTVDAVIAVLLANAD 234 (289) T ss_pred CCCHHHHHHHHHHHHH T ss_conf 8389999999999999 No 81 >PRK00440 rfc replication factor C small subunit; Reviewed Probab=22.71 E-value=62 Score=14.19 Aligned_cols=59 Identities=19% Similarity=0.310 Sum_probs=24.9 Q ss_pred HCCCCCHHHHHHHHHHHHHHHCCCCHHH--HHHHCCCCCCCCHHHEEEEECC-CHHHHHHHHHH Q ss_conf 0235701569999999998534887467--7541545677700100001078-67899999997 Q 537021.9.peg.8 70 ARQIHPSDGQRIARALEIKLVSGQSIIE--FWKQAPNPFIPLESAHKIIILP-ERSALKERIRR 130 (217) Q Consensus 70 a~~i~~nd~~Ri~RaLEi~~~tGk~~s~--~~~~~~~~~~~~~~~~~i~l~~-~r~~L~~rI~~ 130 (217) |..+......-+.|.+|-+-.+-+-+-- ...+-.++. ...|..|-+.+ +.+.+.+++.. T Consensus 110 ~d~l~~~aq~aL~~~mE~~~~~~~fil~~n~~~kii~~i--~SRc~~i~f~~~~~~~i~~~L~~ 171 (318) T PRK00440 110 ADNLTSDAQQALRRTMEMYSQTTRFILSCNYSSKIIDPI--QSRCAVFRFSPLPKEAVIERLRY 171 (318) T ss_pred CCCCCHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCH--HHHHEEEECCCCCHHHHHHHHHH T ss_conf 553225567888764310566625886348833376155--65510111578999999999999 No 82 >cd05637 SIS_PGI_PMI_2 The members of this protein family contain the SIS (Sugar ISomerase) domain and have both the phosphoglucose isomerase (PGI) and the phosphomannose isomerase (PMI) functions. These functions catalyze the reversible reactions of glucose 6-phosphate to fructose 6-phosphate, and mannose 6-phosphate to fructose 6-phosphate, respectively at an equal rate. This protein contains two SIS domains. This alignment is based on the second SIS domain. Probab=22.55 E-value=62 Score=14.17 Aligned_cols=23 Identities=30% Similarity=0.199 Sum_probs=15.1 Q ss_pred HHHHHHHHCCCCEEEECCCCHHH Q ss_conf 68999996399408978932799 Q 537021.9.peg.8 3 SKKLQRYKKRFLPIIVGGTGLYF 25 (217) Q Consensus 3 ~k~~e~~~r~kiPIiVGGTglY~ 25 (217) .|.+-..-.|++||+.|.+.|.- T Consensus 3 Ak~LA~~l~~~iPvIy~~~~~~~ 25 (132) T cd05637 3 AKELALELAGRIPIIYGSTLYEP 25 (132) T ss_pred HHHHHHHHCCCCCEEEECCCCHH T ss_conf 89999997089878997863289 No 83 >PRK07823 5'-methylthioadenosine phosphorylase; Validated Probab=22.19 E-value=31 Score=16.12 Aligned_cols=25 Identities=0% Similarity=-0.087 Sum_probs=14.6 Q ss_pred CCHHHHHHHHHHHHHH---HHHHHHHHH Q ss_conf 9989999999999998---888876886 Q 537021.9.peg.8 175 INYDETLQRGIIATNK---YAKRQKTWL 199 (217) Q Consensus 175 ~~~~e~~e~~~~~Tr~---yAKRQ~TWf 199 (217) +|.++.++.++.+... +.+.=+..+ T Consensus 215 vt~~eV~~~~~~n~~~~~~ll~~~i~~i 242 (264) T PRK07823 215 VKAVDVFAEFGRNIERLKKLVRDAIAAV 242 (264) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 5899999999998999999999999847 No 84 >COG0731 Fe-S oxidoreductases [Energy production and conversion] Probab=22.11 E-value=64 Score=14.11 Aligned_cols=67 Identities=19% Similarity=0.227 Sum_probs=38.4 Q ss_pred HHHHHHH-HHCCCCE--EEECCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCC--- Q ss_conf 4689999-9639940--897893279999961724488999899999999999622355566553116232023570--- Q 537021.9.peg.8 2 LSKKLQR-YKKRFLP--IIVGGTGLYFRALTGQLSIMPEIPTAIREKIREKLKQYGSHILHDELSSLDSIVARQIHP--- 75 (217) Q Consensus 2 ~~k~~e~-~~r~kiP--IiVGGTglY~~all~g~~~~p~~~~~ir~~~~~~~~~~g~~~l~~~L~~~DP~~a~~i~~--- 75 (217) +.+.|+. +++|++| ||+-||. +++++++. -.+.+|-.|...|.+...+|+. T Consensus 97 L~elI~~~k~~g~~~tflvTNgsl-----------------pdv~~~L~------~~dql~~sLdA~~~~~~~~InRP~~ 153 (296) T COG0731 97 LGELIEEIKKRGKKTTFLVTNGSL-----------------PDVLEELK------LPDQLYVSLDAPDEKTFRRINRPHK 153 (296) T ss_pred HHHHHHHHHHCCCCEEEEEECCCH-----------------HHHHHHHC------CCCEEEEEECCCCHHHHHHHCCCCC T ss_conf 899999998607950899938976-----------------99998740------5887999814688889998348887 Q ss_pred -HHHHHHHHHHHHHHHC Q ss_conf -1569999999998534 Q 537021.9.peg.8 76 -SDGQRIARALEIKLVS 91 (217) Q Consensus 76 -nd~~Ri~RaLEi~~~t 91 (217) .++.+|++.|+++... T Consensus 154 ~~~~e~ile~L~~~~~~ 170 (296) T COG0731 154 KDSWEKILEGLEIFRSE 170 (296) T ss_pred CCHHHHHHHHHHHHHHC T ss_conf 45299999999974012 No 85 >pfam04665 Pox_A32 Poxvirus A32 protein. The A32 protein is thought to be involved in viral DNA packaging. Probab=21.94 E-value=48 Score=14.92 Aligned_cols=16 Identities=44% Similarity=0.771 Sum_probs=12.6 Q ss_pred EEECCCC----HHHHHHHCC Q ss_conf 8978932----799999617 Q 537021.9.peg.8 16 IIVGGTG----LYFRALTGQ 31 (217) Q Consensus 16 IiVGGTg----lY~~all~g 31 (217) .+||||| .|+.+|+.. T Consensus 17 aivGgSGSGKT~yLlsLf~t 36 (241) T pfam04665 17 AIVGGSGSGKTTYLLSLLRT 36 (241) T ss_pred EEEECCCCCHHHHHHHHHHH T ss_conf 99815887566999999999 No 86 >KOG0331 consensus Probab=21.90 E-value=44 Score=15.17 Aligned_cols=64 Identities=14% Similarity=0.271 Sum_probs=40.8 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 5699999999985348874677541545677700100001078678999999977765421134567777511386310 Q 537021.9.peg.8 77 DGQRIARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKIIILPERSALKERIRRRFTQMLESGAIDEIRSLMKMNLSLD 155 (217) Q Consensus 77 d~~Ri~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~GlieEv~~L~~~~~~~~ 155 (217) -.+.+.|..+|.-.|.-.+-++.....- +...+...+|+- +|.||+.|+...++.++..=..+. T Consensus 208 Q~~~l~~gvdiviaTPGRl~d~le~g~~---~l~~v~ylVLDE------------ADrMldmGFe~qI~~Il~~i~~~~ 271 (519) T KOG0331 208 QLRDLERGVDVVIATPGRLIDLLEEGSL---NLSRVTYLVLDE------------ADRMLDMGFEPQIRKILSQIPRPD 271 (519) T ss_pred HHHHHHCCCCEEEECCHHHHHHHHCCCC---CCCCEEEEEECC------------HHHHHCCCCHHHHHHHHHHCCCCC T ss_conf 8898755981898077178999974885---645303999634------------776631353799999987558975 No 87 >pfam05202 Flp_C Recombinase Flp protein. Probab=21.70 E-value=39 Score=15.49 Aligned_cols=29 Identities=28% Similarity=0.228 Sum_probs=25.4 Q ss_pred HHHHHHHCCHHHHCCCCCHHHHHHHHHHH Q ss_conf 56655311623202357015699999999 Q 537021.9.peg.8 58 LHDELSSLDSIVARQIHPSDGQRIARALE 86 (217) Q Consensus 58 l~~~L~~~DP~~a~~i~~nd~~Ri~RaLE 86 (217) -+.-|+..||.+.+-++..-.-||+||+- T Consensus 55 R~sDLkN~DP~TFeiv~nk~lG~ilra~V 83 (244) T pfam05202 55 RHSDLKNADPKTFEIVKNKFLGRILRALV 83 (244) T ss_pred CCHHCCCCCCCCEEEECCCHHHHHHHHHC T ss_conf 10001458943157624520545776420 No 88 >PRK07718 fliL flagellar basal body-associated protein FliL; Reviewed Probab=21.70 E-value=65 Score=14.06 Aligned_cols=64 Identities=9% Similarity=0.217 Sum_probs=40.6 Q ss_pred HHHCCHHHHCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHEEEEECCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 53116232023570156999999999853488746775415456777001000010786789999999777654211345 Q 537021.9.peg.8 62 LSSLDSIVARQIHPSDGQRIARALEIKLVSGQSIIEFWKQAPNPFIPLESAHKIIILPERSALKERIRRRFTQMLESGAI 141 (217) Q Consensus 62 L~~~DP~~a~~i~~nd~~Ri~RaLEi~~~tGk~~s~~~~~~~~~~~~~~~~~~i~l~~~r~~L~~rI~~R~~~Ml~~Gli 141 (217) |.--|+.....+...++ .+|..=+-..++++..++.. ...++.|.+.|..|+..++..|=+ T Consensus 71 le~~~~~~~~El~~r~p--qIrD~Ii~~Ls~kt~~dl~~-----------------~~Gk~~Lk~ei~~~iN~~l~~G~V 131 (142) T PRK07718 71 IETDSKKAKEELEKRDF--QVKDIIISELAEMNAEDLEG-----------------KKGLEKLKEQLKEKINNLMQEGKV 131 (142) T ss_pred EEECCHHHHHHHHHCCH--HHHHHHHHHHHCCCHHHHCC-----------------HHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 99889889977665142--56999999997499979459-----------------778999999999999864678946 Q ss_pred HHH Q ss_conf 677 Q 537021.9.peg.8 142 DEI 144 (217) Q Consensus 142 eEv 144 (217) ..| T Consensus 132 ~~V 134 (142) T PRK07718 132 KKV 134 (142) T ss_pred EEE T ss_conf 999 No 89 >pfam11451 DUF3202 Protein of unknown function (DUF3202). This archaeal family of proteins has no known function. Probab=21.06 E-value=61 Score=14.24 Aligned_cols=24 Identities=25% Similarity=0.540 Sum_probs=20.4 Q ss_pred CHHHHHHHHHCCCCEEEECCCCHH Q ss_conf 946899999639940897893279 Q 537021.9.peg.8 1 MLSKKLQRYKKRFLPIIVGGTGLY 24 (217) Q Consensus 1 ~~~k~~e~~~r~kiPIiVGGTglY 24 (217) ++.|.+|.|+-.|+.+-|||-.-+ T Consensus 3 LLdktL~~WKg~kvAv~vg~ehSF 26 (67) T pfam11451 3 LLDKTLEEWKGHKVAVGIGGDHSF 26 (67) T ss_pred HHHHHHHHHCCCEEEEEECCCCEE T ss_conf 688899984796799996685214 No 90 >TIGR03589 PseB UDP-N-acetylglucosamine 4,6-dehydratase. This enzyme catalyzes the first step in the biosynthesis of pseudaminic acid, the conversion of UDP-N-acetylglucosamine to UDP-4-keto-6-deoxy-N-acetylglucosamine. These sequences are members of the broader pfam01073 (3-beta hydroxysteroid dehydrogenase/isomerase family) family. Probab=21.01 E-value=51 Score=14.75 Aligned_cols=19 Identities=32% Similarity=0.466 Sum_probs=13.9 Q ss_pred CCCCEEEECCCCHHHHHHH Q ss_conf 3994089789327999996 Q 537021.9.peg.8 11 KRFLPIIVGGTGLYFRALT 29 (217) Q Consensus 11 r~kiPIiVGGTglY~~all 29 (217) +||.-+|+||+|+-=..|+ T Consensus 3 ~~K~ILVTGGaGfIGS~lv 21 (324) T TIGR03589 3 NNKSILITGGTGSFGKAFI 21 (324) T ss_pred CCCEEEEECCCCHHHHHHH T ss_conf 9399999079779999999 No 91 >PRK08185 hypothetical protein; Provisional Probab=20.99 E-value=67 Score=13.97 Aligned_cols=79 Identities=16% Similarity=0.119 Sum_probs=40.8 Q ss_pred HHHHHHHHHHHH----HHHHHH-HHHHHHCCC-----CCHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 999977765421----134567-777511386-----3101344206856778986789998999999999999888887 Q 537021.9.peg.8 126 ERIRRRFTQMLE----SGAIDE-IRSLMKMNL-----SLDLPIMKAIGVRDIIALLKGEINYDETLQRGIIATNKYAKRQ 195 (217) Q Consensus 126 ~rI~~R~~~Ml~----~GlieE-v~~L~~~~~-----~~~~~~~~~IGYke~~~yL~g~~~~~e~~e~~~~~Tr~yAKRQ 195 (217) +.|..+++.=|- +|+-+| .+..++.|+ +.++...-.-+.++.+.-=.+..+....+.....+-.+-+++. T Consensus 191 ~~I~~~~~vPLVLHGgSG~~~e~i~~ai~~Gv~KiNi~T~l~~a~~~~~~~~l~~~p~~~d~~~~~~~~~~a~~~~v~~k 270 (283) T PRK08185 191 KEINERVDIPLVLHGGSANPDAEIAESVTLGVGKINISSDMKYAFFQKVREILVANPSWWEPNVIYPDAINAAKEVVRHK 270 (283) T ss_pred HHHHHHCCCCEEEECCCCCCHHHHHHHHHCCEEEEEECHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99998649998975899999999999998693799848679999999999999859775787999899999999999999 Q ss_pred HHHHCCCCC Q ss_conf 688628887 Q 537021.9.peg.8 196 KTWLCHQFQ 204 (217) Q Consensus 196 ~TWfr~~~~ 204 (217) +.+|...-. T Consensus 271 i~~fGS~gK 279 (283) T PRK08185 271 MKLFNSTGK 279 (283) T ss_pred HHHHCCCCC T ss_conf 999699878 No 92 >PRK09401 reverse gyrase; Reviewed Probab=20.85 E-value=50 Score=14.77 Aligned_cols=32 Identities=19% Similarity=0.527 Sum_probs=21.0 Q ss_pred HHHHHHHCCCCEEEECCCCHHHHHHHCCCCCCCC Q ss_conf 8999996399408978932799999617244889 Q 537021.9.peg.8 4 KKLQRYKKRFLPIIVGGTGLYFRALTGQLSIMPE 37 (217) Q Consensus 4 k~~e~~~r~kiPIiVGGTglY~~all~g~~~~p~ 37 (217) +.+|.|++|++.++||=+.+| ..|+-|++ +|. T Consensus 366 ~~le~f~~Ge~dvLvG~asyy-g~lvRGiD-lP~ 397 (1176) T PRK09401 366 EFLEKFEEGEIDVLIGVASYY-GVLVRGID-LPE 397 (1176) T ss_pred HHHHHHHCCCCCEEEEECCCC-CCCCCCCC-CCC T ss_conf 688897578864899970124-52101577-641 No 93 >pfam02126 PTE Phosphotriesterase family. Probab=20.77 E-value=42 Score=15.32 Aligned_cols=27 Identities=15% Similarity=0.288 Sum_probs=15.5 Q ss_pred CCCCCHHHHHHHHHHHH-HHHCCCCHHHH Q ss_conf 23570156999999999-85348874677 Q 537021.9.peg.8 71 RQIHPSDGQRIARALEI-KLVSGQSIIEF 98 (217) Q Consensus 71 ~~i~~nd~~Ri~RaLEi-~~~tGk~~s~~ 98 (217) ..+++... |+.||--. -..||-|++.. T Consensus 134 ~~it~~E~-k~lrAaA~A~~~TG~pI~~H 161 (308) T pfam02126 134 WKLTPSEE-KVLEAAAHAHAQTGCPVSTH 161 (308) T ss_pred CCCCHHHH-HHHHHHHHHHHHHCCCEEEC T ss_conf 99987899-99999999999769937970 No 94 >KOG2373 consensus Probab=20.01 E-value=43 Score=15.24 Aligned_cols=21 Identities=33% Similarity=0.471 Sum_probs=11.6 Q ss_pred HHHHHHHHHCCCCEEEECCCC Q ss_conf 468999996399408978932 Q 537021.9.peg.8 2 LSKKLQRYKKRFLPIIVGGTG 22 (217) Q Consensus 2 ~~k~~e~~~r~kiPIiVGGTg 22 (217) ++|.+.-|.+|-+-|++|||| T Consensus 263 LNk~LkGhR~GElTvlTGpTG 283 (514) T KOG2373 263 LNKYLKGHRPGELTVLTGPTG 283 (514) T ss_pred HHHHHCCCCCCCEEEEECCCC T ss_conf 899853678772699856888 Done!