Query gi|254781017|ref|YP_003065430.1| GHMP kinase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 324 No_of_seqs 133 out of 2174 Neff 8.7 Searched_HMMs 39220 Date Mon May 30 03:13:51 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781017.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK03926 mevalonate kinase; Pr 100.0 0 0 442.0 33.3 299 7-324 2-301 (302) 2 PTZ00298 mevalonate kinase; Pr 100.0 0 0 431.6 30.4 306 11-321 14-328 (328) 3 TIGR00549 mevalon_kin mevalona 100.0 0 0 438.3 24.3 271 12-287 1-345 (345) 4 COG1577 ERG12 Mevalonate kinas 100.0 0 0 416.3 27.0 298 8-314 1-302 (307) 5 PRK03817 galactokinase; Provis 100.0 0 0 393.6 32.1 314 8-324 1-351 (351) 6 PRK05322 galactokinase; Provis 100.0 0 0 386.2 30.7 319 2-322 15-386 (387) 7 PRK00555 galactokinase; Provis 100.0 0 0 381.8 31.6 316 6-324 1-363 (363) 8 KOG1511 consensus 100.0 0 0 381.5 28.9 313 5-322 1-379 (397) 9 PRK05101 galactokinase; Provis 100.0 0 0 369.4 30.6 319 2-322 16-381 (382) 10 COG0153 GalK Galactokinase [Ca 100.0 0 0 364.8 32.8 320 4-324 20-390 (390) 11 PTZ00290 galactokinase; Provis 100.0 0 0 339.5 29.5 321 2-323 27-456 (468) 12 TIGR01220 Pmev_kin_Gr_pos phos 100.0 0 0 332.9 19.1 305 8-319 1-366 (367) 13 TIGR00131 gal_kin galactokinas 100.0 0 0 315.2 22.3 299 5-303 15-472 (500) 14 COG2605 Predicted kinase relat 100.0 7.6E-42 0 284.0 29.4 298 9-316 3-322 (333) 15 PRK13412 fkp bifunctional fuco 100.0 7.2E-33 1.8E-37 227.2 25.8 302 5-316 606-964 (974) 16 KOG0631 consensus 100.0 4.5E-32 1.1E-36 222.1 22.4 303 7-310 39-464 (489) 17 COG3890 ERG8 Phosphomevalonate 100.0 1.2E-28 2.9E-33 200.6 24.7 302 5-322 1-332 (337) 18 PRK01212 homoserine kinase; Pr 100.0 2.5E-28 6.5E-33 198.4 23.4 284 5-321 1-292 (293) 19 TIGR01219 Pmev_kin_ERG8 phosph 100.0 6E-29 1.5E-33 202.4 16.9 308 9-318 1-487 (499) 20 PRK03188 4-diphosphocytidyl-2- 100.0 9E-26 2.3E-30 182.3 23.6 268 8-310 1-280 (299) 21 PRK04344 consensus 100.0 7.7E-26 2E-30 182.7 22.6 273 6-313 1-281 (284) 22 PRK02534 4-diphosphocytidyl-2- 100.0 5.8E-25 1.5E-29 177.1 26.5 288 5-322 1-306 (308) 23 PRK00128 ipk 4-diphosphocytidy 100.0 4E-25 1E-29 178.1 25.1 270 8-315 1-278 (281) 24 COG0083 ThrB Homoserine kinase 99.9 1.6E-24 4.1E-29 174.3 26.8 276 5-311 1-282 (299) 25 PTZ00299 homoserine kinase; Pr 99.9 4.9E-25 1.3E-29 177.6 20.1 293 1-322 1-321 (336) 26 KOG4519 consensus 99.9 4.4E-24 1.1E-28 171.6 23.6 310 7-318 2-444 (459) 27 TIGR00191 thrB homoserine kina 99.9 5.4E-25 1.4E-29 177.3 18.8 290 8-323 1-357 (359) 28 PRK01123 shikimate kinase; Pro 99.9 7.2E-23 1.8E-27 163.9 24.0 251 31-321 23-280 (283) 29 PRK03009 consensus 99.9 1.1E-20 2.8E-25 150.1 22.8 271 9-320 4-280 (287) 30 PRK01726 consensus 99.9 1E-20 2.7E-25 150.2 20.9 256 6-316 20-285 (312) 31 PRK00343 ipk 4-diphosphocytidy 99.9 1.1E-20 2.9E-25 150.0 20.6 257 7-318 2-268 (279) 32 COG1947 IspE 4-diphosphocytidy 99.9 7E-20 1.8E-24 145.0 23.8 259 5-298 1-266 (289) 33 PRK04943 consensus 99.9 1.8E-20 4.5E-25 148.7 20.5 262 4-316 2-269 (288) 34 PRK03288 consensus 99.9 3.3E-20 8.3E-25 147.1 17.6 236 10-296 5-256 (283) 35 PRK04648 consensus 99.8 4.6E-19 1.2E-23 139.8 19.4 257 11-316 18-280 (295) 36 PRK00650 4-diphosphocytidyl-2- 99.8 3.8E-18 9.8E-23 134.0 21.9 237 9-295 2-247 (288) 37 KOG4644 consensus 99.8 3.2E-17 8.2E-22 128.1 22.7 294 2-301 566-919 (948) 38 TIGR00154 ispE 4-diphosphocyti 99.8 1.2E-18 3E-23 137.2 11.2 265 5-295 1-293 (322) 39 PRK05905 hypothetical protein; 99.8 3.3E-17 8.3E-22 128.1 18.6 243 9-291 2-256 (258) 40 COG1685 Archaeal shikimate kin 99.8 1.5E-15 3.9E-20 117.5 23.9 266 10-323 5-276 (278) 41 TIGR01920 Shik_kin_archae shik 99.8 9.9E-16 2.5E-20 118.7 21.7 242 31-310 18-268 (273) 42 PRK04181 4-diphosphocytidyl-2- 99.8 8.8E-17 2.2E-21 125.4 16.2 236 10-289 1-253 (254) 43 COG3407 MVD1 Mevalonate pyroph 99.5 9.1E-12 2.3E-16 93.6 21.8 268 7-302 5-306 (329) 44 COG1907 Predicted archaeal sug 99.5 8.7E-11 2.2E-15 87.4 22.6 271 8-311 2-296 (312) 45 COG1829 Predicted archaeal kin 99.4 2.1E-09 5.2E-14 78.7 23.4 239 8-288 3-252 (283) 46 KOG2833 consensus 99.4 3.2E-09 8.2E-14 77.5 23.2 278 1-301 1-326 (395) 47 pfam00288 GHMP_kinases_N GHMP 99.3 1.6E-12 4E-17 98.5 5.6 67 90-158 1-67 (67) 48 COG4542 PduX Protein involved 99.3 1.5E-09 3.8E-14 79.6 17.5 246 3-291 9-264 (293) 49 TIGR01240 mevDPdecarb diphosph 99.2 2.9E-09 7.3E-14 77.8 17.7 269 9-301 2-321 (325) 50 pfam10509 GalKase_gal_bdg Gala 98.8 8.3E-09 2.1E-13 74.9 6.1 44 2-46 9-52 (52) 51 pfam08544 GHMP_kinases_C GHMP 98.8 1.7E-08 4.3E-13 73.0 7.3 66 225-292 2-67 (82) 52 KOG1537 consensus 98.5 1.8E-06 4.7E-11 60.1 10.2 215 88-317 94-346 (355) 53 TIGR00144 beta_RFAP_syn beta-r 98.3 0.00042 1.1E-08 45.2 17.6 290 10-316 3-326 (337) 54 TIGR00500 met_pdase_I methioni 65.0 9.7 0.00025 17.5 4.4 84 108-191 16-118 (265) 55 PRK00068 hypothetical protein; 63.5 10 0.00026 17.3 4.5 30 218-247 942-971 (978) 56 PRK05182 DNA-directed RNA poly 55.1 14 0.00037 16.5 5.9 90 178-268 193-301 (306) 57 PRK05456 ATP-dependent proteas 45.1 21 0.00053 15.5 5.1 125 7-139 4-156 (172) 58 cd01913 protease_HslV Protease 41.3 24 0.0006 15.1 5.8 125 7-139 3-155 (171) 59 PRK11302 DNA-binding transcrip 39.4 25 0.00064 14.9 6.9 42 277-320 172-213 (284) 60 TIGR02957 SigX4 RNA polymerase 37.8 24 0.00061 15.1 2.1 24 219-242 170-193 (287) 61 TIGR01418 PEP_synth phosphoeno 35.1 29 0.00075 14.5 4.5 157 150-322 647-868 (877) 62 PRK07201 short chain dehydroge 33.0 32 0.00081 14.3 6.5 73 243-317 331-433 (663) 63 TIGR02667 moaB_proteo molybden 30.6 23 0.00059 15.1 1.1 32 221-254 81-112 (163) 64 PRK07774 short chain dehydroge 29.8 36 0.00092 13.9 5.3 20 74-96 72-91 (250) 65 PRK06194 hypothetical protein; 28.5 38 0.00097 13.8 5.3 14 220-233 242-255 (301) 66 cd05563 PTS_IIB_ascorbate PTS_ 28.2 28 0.00072 14.6 1.2 21 100-120 7-27 (86) 67 PRK05855 short chain dehydroge 27.8 39 0.001 13.7 5.4 19 298-316 477-495 (582) 68 pfam11921 DUF3439 Domain of un 27.1 40 0.001 13.6 2.0 11 233-243 81-91 (122) 69 pfam03118 RNA_pol_A_CTD Bacter 26.8 41 0.001 13.6 3.3 46 219-264 14-60 (62) 70 PRK08507 prephenate dehydrogen 26.6 41 0.001 13.6 3.9 25 222-246 247-271 (275) 71 TIGR00555 panK_eukar pantothen 26.2 35 0.00089 14.0 1.4 33 100-139 207-239 (387) 72 COG0024 Map Methionine aminope 26.1 42 0.0011 13.5 5.3 19 123-141 33-51 (255) 73 PRK05876 short chain dehydroge 25.1 44 0.0011 13.4 5.0 14 220-233 229-242 (275) 74 PRK07417 arogenate dehydrogena 24.6 45 0.0011 13.3 5.4 27 221-247 249-275 (280) 75 PRK08655 prephenate dehydrogen 24.6 45 0.0011 13.3 6.0 59 197-259 225-287 (441) 76 CHL00013 rpoA RNA polymerase a 24.3 45 0.0012 13.3 3.6 48 219-266 278-326 (333) 77 pfam00392 GntR Bacterial regul 23.8 46 0.0012 13.2 3.3 58 220-280 5-62 (64) 78 TIGR02068 cya_phycin_syn cyano 23.5 47 0.0012 13.2 1.7 29 103-131 687-715 (876) 79 COG3424 BcsA Predicted naringe 23.1 48 0.0012 13.2 5.3 49 237-292 295-344 (356) 80 cd05567 PTS_IIB_mannitol PTS_I 23.0 38 0.00097 13.8 1.1 10 100-109 8-17 (87) 81 PRK06720 hypothetical protein; 22.8 48 0.0012 13.1 4.9 15 265-279 139-153 (169) 82 PRK11199 tyrA bifunctional cho 21.8 51 0.0013 13.0 6.0 63 199-265 306-369 (374) 83 PRK11337 DNA-binding transcrip 21.3 52 0.0013 12.9 8.5 42 278-320 186-227 (293) 84 COG0041 PurE Phosphoribosylcar 21.0 52 0.0013 12.9 5.0 16 126-141 41-56 (162) No 1 >PRK03926 mevalonate kinase; Provisional Probab=100.00 E-value=0 Score=442.00 Aligned_cols=299 Identities=32% Similarity=0.430 Sum_probs=260.4 Q ss_pred EEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHHHHCCC Q ss_conf 89997170499974353308942898776772899999868984999953644446566120150899999999971877 Q gi|254781017|r 7 KICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAINHIKPS 86 (324) Q Consensus 7 ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~ 86 (324) =|++||||||+|+|||+||||++||++|||++++++++++++. +.+.+...+ ...+.+++..++..++...+. T Consensus 2 mi~asAPGrv~LiGEH~v~~G~~aia~AI~~~~~v~~~~~~d~-i~~~~~~~~------~~~~~~~~~~~l~~~~~~~~~ 74 (302) T PRK03926 2 MVLCSAPGKIYLFGEHAVVYGKPAIACAIDLRTRVRAELNDDS-IYIQSDYGK------TGEVHPYVRAAIEKVREEADK 74 (302) T ss_pred EEEEEECEEEEEEECCEEECCCEEEEEEECCCEEEEEEECCCE-EEEECCCCC------CCCHHHHHHHHHHHHHHHCCC T ss_conf 7999807899999387698696089999378199999998990-699754776------542007999999999975689 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEEEEEECC Q ss_conf 87389998268753548874024566556777640456666014557888877652146887544776629858998527 Q gi|254781017|r 87 CGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLICYQMPK 166 (324) Q Consensus 87 ~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i~~~~~~ 166 (324) .||++.|+|++|+|+|||||||++||++.+++.+ ++.++++++++++|+.+|+.+||.|||+||++++|||+++|+... T Consensus 75 ~G~~i~I~S~iP~gaGLgSSAAl~va~~~al~~~-~g~~ls~~~la~la~~~E~~~~G~~sGiD~~~s~~Gg~~~~~~~~ 153 (302) T PRK03926 75 DGVDLSITSQIPVGSGLGSSAAVTVATIGALNKL-LGLGLSLEEIAKMGHEVELLVQGAASPTDTYVSTIGGFVTIPDRK 153 (302) T ss_pred CCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHH-CCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCEEEECCCC T ss_conf 9837999656887778763899999999999985-489989999999999999874689978899999759989966789 Q ss_pred CEEEEEECCCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 51567304777526765841798999988899886082348999999984033207889873048644568999998999 Q gi|254781017|r 167 YSIEKIDFIFPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQQGL 246 (324) Q Consensus 167 ~~~~~~~~~~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~~~~ 246 (324) . ++ ....++++++||.+++|++++..|++.+.++|+ ..+.+++.++++++++.++++++|+..||++||++|.+ T Consensus 154 ~-l~--~~~~~l~l~~tg~~~~t~~~v~~v~~~~~~~~~---~~~~~~~~~~~i~~~a~~~l~~~d~~~lG~Lm~~~h~l 227 (302) T PRK03926 154 K-LP--FMDCGIVVGYTGSSGSTKELVANVRKLYEEYPE---LIEPILESIGKLSEKAEELVLSGDYVSIGELMNINQGL 227 (302) T ss_pred C-CC--CCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 7-77--789788999839746599999999999876699---99999999999999999877656499999999996887 Q ss_pred HHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC-CCCCCCCC Q ss_conf 9861678779999999999719945997268997548999846653268999999999977978999414-54564459 Q gi|254781017|r 247 LETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITP-SHSTSLYR 324 (324) Q Consensus 247 L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~ 324 (324) |++++||+|+||++++.++. .|++|+||||||||||+|+|+++|. .+++.+.|.+.|++.+.++| .+|.++.+ T Consensus 228 l~~lgVS~peld~lv~~a~~-~GalGAKltGaG~GGc~iaL~~~d~----~~~v~~al~~ag~~~~~~~~~~~g~~~~~ 301 (302) T PRK03926 228 LDALGVSTKELSELVYAART-AGAFGAKITGAGGGGCMVALTAPDK----CREVATAIGIAGGKPMITKPTEEGLRIEE 301 (302) T ss_pred HHHCCCCCHHHHHHHHHHHH-CCCCEEEECCCCCCCEEEEEECCCC----HHHHHHHHHHCCCEEEEEEECCCCEEEEC T ss_conf 87316898899999999996-7982888058987628999946131----99999999976996899567798237514 No 2 >PTZ00298 mevalonate kinase; Provisional Probab=100.00 E-value=0 Score=431.64 Aligned_cols=306 Identities=23% Similarity=0.276 Sum_probs=246.0 Q ss_pred ECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCC-CEEEEEECCCCCCCCCC--HHHCCCHHHHHHHHHHHHCCCC Q ss_conf 71704999743533089428987767728999998689-84999953644446566--1201508999999999718778 Q gi|254781017|r 11 SAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKD-RLINIDSSLGQYCGSLD--LAMFHPSFSFIIMAINHIKPSC 87 (324) Q Consensus 11 ~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~-~~i~i~~~~~~~~~~~~--~~~~~~~~~~i~~~l~~~~~~~ 87 (324) .|+|||+|||||+||||+|||++|||+|++++++..++ ..+.+..........+. ..........+...++...... T Consensus 14 ~a~GKvILfGEHaVVyG~pAIA~aI~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~ 93 (328) T PTZ00298 14 IGYGKVILFGEHFVVYGAEAIVAGIDEYTECRLELTKGVPGLQVVDQRPAVPGYIVEKREEQRKAHQLVLRHLNIDTSVD 93 (328) T ss_pred EEEEEEEEEECCEECCCCCEEEEECCCCEEEEEEECCCCCCCEEECCCCCCCCHHHHCCHHHHHHHHHHHHHHCCCCCCC T ss_conf 47089999836831019735998834866999997579997166237855641233212667778999999854666677 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEEEEEECCC Q ss_conf 73899982687535488740245665567776404566660145578888776521468875447766298589985275 Q gi|254781017|r 88 GFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLICYQMPKY 167 (324) Q Consensus 88 g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i~~~~~~~ 167 (324) |+++.++|++|+++|||||||++||+++||+.+ ++.+++.++++++|+.+|+.+||+|||+|+++++|||+++|++.+. T Consensus 94 g~~i~i~S~iP~gaGLGSSAAv~VA~~~Al~~~-~~~~ls~~~i~~~A~~~E~~~hG~pSGiD~~~s~~Gg~i~f~~~~~ 172 (328) T PTZ00298 94 GLKMHLGGPLVPSSGIGASASDVVSLSRALSEL-YQLNLTEEEVNLSAFVGEGGYHGTPSGADNTAATYGGLISYRRVNG 172 (328) T ss_pred CEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCEEEEEECCC T ss_conf 538997058886567350688999999999998-4799999999999999999855899727699994598799985488 Q ss_pred --EEEEEECC--CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHH Q ss_conf --15673047--77526765841798999988899886082348999999984033207889873048644568999998 Q gi|254781017|r 168 --SIEKIDFI--FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQ 243 (324) Q Consensus 168 --~~~~~~~~--~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~ 243 (324) ...++++. ..+++++||++++|+++++.|++++.++|+ ..+.+++.+++++.++.++|+++|++.||++||+| T Consensus 173 ~~~~~~i~~~~~~~lvi~dtg~~~~T~~~V~~V~~~~~~~p~---~~~~i~~~~~~iv~~a~~aL~~~d~~~lG~Lm~~n 249 (328) T PTZ00298 173 KSVFKRIAFQQPLYLVVCSTGITASTTKVVGDVRKLKENQPT---WFNRLLENYNACVSEAKEALQKGNLFRVGELMNAN 249 (328) T ss_pred CCCCEECCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCHH---HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 644202367897069998369865369999999999865869---99999999999999999998648999999999999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHH-HHHHHHHH-HHHCCCEEEEEECCCCCC Q ss_conf 9999861678779999999999719945997268997548999846653268-99999999-997797899941454564 Q gi|254781017|r 244 QGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSL-PYQSVNCH-MHAKGIDIVPITPSHSTS 321 (324) Q Consensus 244 ~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~-~~~~~~~~-~~~~g~~~~~~~~~~~~~ 321 (324) |.+|++||||+|+||++|+.++. .|++|+||||||+|||+|+|++++.+++ +.+.+.+. -.+.-.|.+.+.|+-.+| T Consensus 250 h~lL~~lgVS~p~Ld~lv~~a~~-~GAlGAKLTGAG~GGcvIaL~~~~~~a~~i~~al~~~~~~a~~~w~~~~~~~~~~~ 328 (328) T PTZ00298 250 HDLCQKLTVSCRELDSIVQTCRT-YGALGAKMSGTGRGGLVVALAASEDQRDAIAKAVRARCPEAKFVWRYTVQPSSAKN 328 (328) T ss_pred HHHHHHCCCCCHHHHHHHHHHHH-CCCCEEEECCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCC T ss_conf 99897468897899999999997-69965575146887589998386666999999999746677305999955666688 No 3 >TIGR00549 mevalon_kin mevalonate kinase; InterPro: IPR006205 Mevalonate kinase (2.7.1.36 from EC) is well-characterised among the Eukaryotes, where it plays a role in the synthesis of isopentanyl pyrophosphate, a common intermediate for a number of pathways including cholesterol biosynthesis. It is also involved in mevalonate catabolism. Close homologs are found in the Archaea. A bacterial example is found in the Lyme disease spirochete Borrelia burgdorferi, but is not necessarily the same biosynthetic enzyme.; GO: 0004496 mevalonate kinase activity, 0005524 ATP binding, 0008299 isoprenoid biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=438.33 Aligned_cols=271 Identities=30% Similarity=0.460 Sum_probs=230.4 Q ss_pred CCEEEEEECCCEEECCCEEEEEEECCEEEEEEEE-------------------------CCCCE--EE---EEECCCCCC Q ss_conf 1704999743533089428987767728999998-------------------------68984--99---995364444 Q gi|254781017|r 12 APGSLVLMGEHGVLHGHAALVFAINKRVILYLTL-------------------------RKDRL--IN---IDSSLGQYC 61 (324) Q Consensus 12 aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~-------------------------~~~~~--i~---i~~~~~~~~ 61 (324) |||||+|||||+||||.|||++||+.++++++.. ..+.. .. +++...... T Consensus 1 ApgK~IL~GEHAVVYG~PAiA~~i~~~~~~~~~~~~S~~~~~Le~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (345) T TIGR00549 1 APGKIILFGEHAVVYGEPAIAAAIDLRTTVTVIELDSAVAFRLEALAKDIILTLEDLARNSDSILYSLSSKIESELGRGS 80 (345) T ss_pred CCCEEEEECCCEEECCCCCEEEEECCEEEEEEEECCCHHHHHHHHHCCCHHHHHHHHHCCCCHHHHHHCCCCCCCCCCCC T ss_conf 97405762360011278504666336068998636732456665405644443387620254034442012110015676 Q ss_pred CCCCHHH-------------------CCCHHHHHHH-HHHHHCC---C--CCEEEEEECCCCCCCCCCCCCHHHHHHHHH Q ss_conf 6566120-------------------1508999999-9997187---7--873899982687535488740245665567 Q gi|254781017|r 62 GSLDLAM-------------------FHPSFSFIIM-AINHIKP---S--CGFDLKVISQLDSQLGLGSSAAITVAITAA 116 (324) Q Consensus 62 ~~~~~~~-------------------~~~~~~~i~~-~l~~~~~---~--~g~~i~i~s~iP~g~GLGSSaa~~va~~~a 116 (324) .....+. ....+.+|.. .+.++.. . .|++|+|+|+||+|+|||||||++||+++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~p~~~~i~S~iP~g~GLGSSAAv~vA~~~a 160 (345) T TIGR00549 81 LDDAPQELDGLVSELVEPADKQLTPSLSESLSYIAAAALSKLSELNEPPKTPLEIEIESEIPPGRGLGSSAAVAVALIRA 160 (345) T ss_pred CCCHHHHHCCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHH T ss_conf 44013765122443166556530389999999999999999750267554872799850536655563479999999999 Q ss_pred HHHHHCC---------CCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEEEEEECCC--EEEEEECC-CCCCEEECC Q ss_conf 7764045---------66660145578888776521468875447766298589985275--15673047-775267658 Q gi|254781017|r 117 LLTLQYH---------KEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLICYQMPKY--SIEKIDFI-FPIHLIYSG 184 (324) Q Consensus 117 l~~~~~~---------~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i~~~~~~~--~~~~~~~~-~~l~l~~tg 184 (324) ++.+ ++ .+++.++++++|+++|+.+||+|||+|.+++.|||+++|+++.. ........ ..|+++||| T Consensus 161 l~~~-~g~~~n~~s~L~~~~~~~~~~~a~~~E~~~HG~pSG~D~~~~~~g~~~~f~k~~~~~~~~~~~~~~~~~~i~dtg 239 (345) T TIGR00549 161 LADY-FGIRSNDLSKLSELSKEELAKLANEAEKIVHGKPSGIDTATSTYGGPVLFEKGEGVEETKLISLDALYLVIADTG 239 (345) T ss_pred HHHH-HCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCEEEEECCCCCCEEEECCCCCCEEEEEECC T ss_conf 9987-044668621024689889999998877651799853122123448668987688873144027899608998348 Q ss_pred CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCC-------HHHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 417989999888998860823489999999840332078898730486-------4456899999899998616787799 Q gi|254781017|r 185 YKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKN-------LKVLAQAMNRQQGLLETLGVSDSKL 257 (324) Q Consensus 185 ~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d-------~~~l~~lm~~~~~~L~~lgvs~~~l 257 (324) ++++|+++|+.|+++..++| +..+.+++.+++++.++.++|+++| .+.||++||.||.+|++||||+|+| T Consensus 240 ~~~~T~~~V~~V~~l~~~~p---~~~~~~~~~~g~l~~~a~~a~~~~d~~~~~ek~~~lg~lm~~nq~lL~~lGVS~~~l 316 (345) T TIGR00549 240 VSGSTKEAVAKVRQLLERFP---ELIDAIMDAIGELTKEAKAALEDGDEVEDSEKVESLGELMNINQGLLKALGVSTPKL 316 (345) T ss_pred CCCCHHHHHHHHHHHHHHCH---HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 77748999999999875171---689999999999999999999718831035589999999999998888707985799 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCEEEEE Q ss_conf 999999997199459972689975489998 Q gi|254781017|r 258 SEIVWKLREQPHIMASKISGSGLGDCVIAL 287 (324) Q Consensus 258 ~~li~~~~~~~g~~~aklsGAG~GG~~i~l 287 (324) |+||+.+++ .|++|+||||||+|||+|+| T Consensus 317 d~l~~~~~~-~GAlGaKlTGaG~GGc~iaL 345 (345) T TIGR00549 317 DQLVELARK-AGALGAKLTGAGGGGCMIAL 345 (345) T ss_pred HHHHHHHHH-CCCCEEEECCCCCCCEEEEC T ss_conf 999999986-79746310157776536539 No 4 >COG1577 ERG12 Mevalonate kinase [Lipid metabolism] Probab=100.00 E-value=0 Score=416.28 Aligned_cols=298 Identities=31% Similarity=0.444 Sum_probs=255.8 Q ss_pred EEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHHHHCC-- Q ss_conf 999717049997435330894289877677289999986898499995364444656612015089999999997187-- Q gi|254781017|r 8 ICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAINHIKP-- 85 (324) Q Consensus 8 i~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~-- 85 (324) +.++||||++|||||+||||+|||++||+++.++++++++++.+.+++...... ....+..++.....+..+..+.+ T Consensus 1 ~~~~aPgKliL~GEHAVVyG~pAI~~aI~~~~~v~~~~s~~~~~~i~~~~~~~~-~~~~~~~~~~~~~~v~~~~e~~~~~ 79 (307) T COG1577 1 ISVSAPGKLILFGEHAVVYGYPAIAAAIDLRVTVTISESDSNKIVIESSDLKSS-TLERDEDEGYIQAAVRLASELLNQS 79 (307) T ss_pred CCCCCCCCEEEEECCEEEECCCHHHEEEEEEEEEEEEECCCCCEEEECCCCCCC-CCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 932235308998113455187315135010089999966888579841677765-5456663248999999999873325 Q ss_pred -CCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEEEEEE Q ss_conf -7873899982687535488740245665567776404566660145578888776521468875447766298589985 Q gi|254781017|r 86 -SCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLICYQM 164 (324) Q Consensus 86 -~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i~~~~ 164 (324) ..+++++|+|++|+|+|||||||++|++++|++.+ ++.++++++++++|+.+|..+||+|||+|.++|+|||+++|+. T Consensus 80 ~~~~~~l~I~S~iP~g~GLGSSAAVsva~i~al~~~-~g~~ls~~~l~~la~~~e~~vqG~~Sg~D~a~~~~gg~v~~~~ 158 (307) T COG1577 80 SLKPFSLEIDSEIPIGAGLGSSAAVSVAVIKALSAY-FGVELSPEELAKLANKVELIVQGKASGIDIATITYGGLVAFKK 158 (307) T ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHH-CCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCEEEEEC T ss_conf 789847999627998878627999999999999986-2898898999999999999875898761403886188899845 Q ss_pred C-CCEEEEEECCCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 2-751567304777526765841798999988899886082348999999984033207889873048644568999998 Q gi|254781017|r 165 P-KYSIEKIDFIFPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQ 243 (324) Q Consensus 165 ~-~~~~~~~~~~~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~ 243 (324) . +.....+++...+++.+||++.+|+++|+.|+.++++.| +....+++.++++++++..+++++|.+.|+++||.| T Consensus 159 ~~~~~~l~~~~~~~~~I~~tg~~~sT~e~V~~V~~l~~~~~---~~~~~~~~~ig~~~~~a~~al~~~d~e~lgelm~~n 235 (307) T COG1577 159 GFDFEKLEIELLGTLVIGDTGVPGSTKELVAGVAKLLEEEP---EVIDPILDAIGELVQEAEAALQTGDFEELGELMNIN 235 (307) T ss_pred CCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 88752013566771899974886608999999999998646---778999999999999999998616499999999999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEE Q ss_conf 99998616787799999999997199459972689975489998466532689999999999779789994 Q gi|254781017|r 244 QGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPI 314 (324) Q Consensus 244 ~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~ 314 (324) |.+|++||||+|+++++++.++. .|++|+|+||||+|||+|+|++++.. .+.+...|.+.|...+.. T Consensus 236 q~LL~~LgVs~~~L~~lv~~a~~-~Ga~gaKlTGAGgGGc~IaL~~~~~~---~~~l~~~~~~~~i~~~~~ 302 (307) T COG1577 236 QGLLKALGVSTPELDELVEAARS-LGALGAKLTGAGGGGCIIALAKNEEI---AETLSNRLEKAGIALLWI 302 (307) T ss_pred HHHHHHCCCCCHHHHHHHHHHHH-CCCCCCCCCCCCCCCEEEEEECCCHH---HHHHHHHHHHCCCCHHCC T ss_conf 98898658486899999999986-69510021468888669998355158---999998888658630000 No 5 >PRK03817 galactokinase; Provisional Probab=100.00 E-value=0 Score=393.64 Aligned_cols=314 Identities=22% Similarity=0.210 Sum_probs=242.3 Q ss_pred EEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCH----HHCCCHHHHHHHHHHHH Q ss_conf 99971704999743533089428987767728999998689849999536444465661----20150899999999971 Q gi|254781017|r 8 ICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDL----AMFHPSFSFIIMAINHI 83 (324) Q Consensus 8 i~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~----~~~~~~~~~i~~~l~~~ 83 (324) |.|||||||+|||||+||+|++|+++|||+++.+++++++ .+.+.+...+....++. ..|..++..++..+... T Consensus 1 i~v~APGRV~LiGEH~Dy~gg~vl~~AI~~~~~i~~~~~~--~~~~~s~~~~~~~~~~~~~~~~~w~~yv~gvi~~l~~~ 78 (351) T PRK03817 1 MKVKSPGRVNLIGEHTDYNDGYVLPFAIDRYTVLEIEKSD--KFIFYSENFNETFELDKLEKTNSWADYIKGVIWVLEKE 78 (351) T ss_pred CEEECCCCEEEECCCEEECCCCEEEEEECCEEEEEEEECC--CEEEEECCCCCCEEECCCCCCCCHHHHHHHHHHHHHHC T ss_conf 9696871379863887788986795882124999999789--18999888887234045566797688999999999970 Q ss_pred -CCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCC-HHHHHHHCCC--- Q ss_conf -87787389998268753548874024566556777640456666014557888877652146887-5447766298--- Q gi|254781017|r 84 -KPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSG-IDLAASIHGG--- 158 (324) Q Consensus 84 -~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG-~D~~a~~~Gg--- 158 (324) ....|+++.|+|+||+++|||||||++||++.|+..+ ++.++++.+++++++.+|+.++|.||| |||+++++|+ T Consensus 79 g~~~~G~~i~i~s~iP~gsGLsSSAAl~va~~~al~~~-~~~~l~~~~la~~~~~aE~~~~G~~~G~mDq~as~~g~~~~ 157 (351) T PRK03817 79 GYEVGPVKGKVSSNLPLGAGLSSSAALEVAVAYALNEY-YNLNLSKLELALIARKAENEFVGVPCGIMDQFAVAFGKKDH 157 (351) T ss_pred CCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCE T ss_conf 99988749999778889888764899999999999998-48999999999999999998615885088899998545773 Q ss_pred EEEEEECCCEEEEEECC---CCCCEEECCCCCCHHH---------HHHHHHHHHHHC------CCHHHHHHHHHHHH--- Q ss_conf 58998527515673047---7752676584179899---------998889988608------23489999999840--- Q gi|254781017|r 159 LICYQMPKYSIEKIDFI---FPIHLIYSGYKTPTAQ---------VLKKISYIEIEY------PEINEINQKIYALM--- 217 (324) Q Consensus 159 ~i~~~~~~~~~~~~~~~---~~l~l~~tg~~~~T~~---------~v~~v~~~~~~~------~~~~~~~~~~~~~~--- 217 (324) +++++..+...+.++++ ..+++++||.++.+.. +...++.+.++. .+..+......+++ T Consensus 158 ~~~~d~~~~~~~~~~~~~~~~~~vv~~s~~~~~l~~~~yn~r~~~~~~a~~~l~~~~~~~~~~~~l~~l~~~~~~r~~hv 237 (351) T PRK03817 158 AIFLDTMTLEYEYIPFPLKDYEILVFDSGVKHELASSEYNERRKECEEALKILGKKSFREVTEEDLEKLPPLLRKRAGHV 237 (351) T ss_pred EEEECCCCCCEEEECCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHHCCHHHHHHCCHHHHHHHHHH T ss_conf 89821688844763368877589999779754345106789999999999996600044316999985768899999999 Q ss_pred ---CCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCC Q ss_conf ---332078898730486445689999989999861-6787799999999997199459972689975489998466532 Q gi|254781017|r 218 ---GKLSQISCQALRNKNLKVLAQAMNRQQGLLETL-GVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLN 293 (324) Q Consensus 218 ---~~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~l-gvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~ 293 (324) ...+.++.+||+++|++.||++||++|..|+++ +||+|++|.+++.++...|++|+||||||||||+++|++++.. T Consensus 238 ~~E~~rv~~~~~al~~~d~~~lG~Lm~~sh~sLr~~~~vS~~eld~lv~~a~~~~Ga~GakltGaG~GGc~vaLv~~~~~ 317 (351) T PRK03817 238 LRENERVLKVRDALKEGDVETLGKLLYESHESLADLYEVSCEELDFIVEFLLGKLGIYGARLVGGGFGGSVIVLVKKGAF 317 (351) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEECEEECCCCCEEEEEEECHHHH T ss_conf 98889999999998738999999999998899988664897789999999986089867533148875589999754679 Q ss_pred HHHHHHHHHHHHH-CCCE--EEEEECCCCCCCCC Q ss_conf 6899999999997-7978--99941454564459 Q gi|254781017|r 294 SLPYQSVNCHMHA-KGID--IVPITPSHSTSLYR 324 (324) Q Consensus 294 ~~~~~~~~~~~~~-~g~~--~~~~~~~~~~~~~~ 324 (324) +.+++.+.+++.+ .|.. .+.+.|..|+++.+ T Consensus 318 e~~~~~v~~~Y~~~~~~~~~~~~~~~~~Ga~~~~ 351 (351) T PRK03817 318 ESIGEELLEEYTKKFGITPEYFKVESSDGVQKIE 351 (351) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEEECCCCEECC T ss_conf 9999999999999639998199998488956187 No 6 >PRK05322 galactokinase; Provisional Probab=100.00 E-value=0 Score=386.17 Aligned_cols=319 Identities=19% Similarity=0.230 Sum_probs=250.1 Q ss_pred CCCCCEEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCC----CCCC------HHHCCC Q ss_conf 988878999717049997435330894289877677289999986898499995364444----6566------120150 Q gi|254781017|r 2 GQCLHKICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYC----GSLD------LAMFHP 71 (324) Q Consensus 2 ~~~~~ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~----~~~~------~~~~~~ 71 (324) |..|. +.++|||||+|+|||+||+|++|+++|||+++++.+.+++++.+++.+...+.. .+++ ...|.+ T Consensus 15 g~~p~-~~~~APGRVnLiGeHtDY~gg~Vlp~AI~~~~~ia~~~~~d~~i~i~s~~~~~~~~~~~~~~~~~~~~~~~W~~ 93 (387) T PRK05322 15 GEEED-RVFFSPGRINLIGEHTDYNGGHVFPAAITLGTYGAARKRDDKKVRLYSANFEDLGIIEFDLDDLSYDKEDDWAN 93 (387) T ss_pred CCCCC-EEEECCCCCCCCCCCCCCCCCEEEEEECCCCEEEEEEECCCCEEEEEECCCCCCCEEEEECCCCCCCCCCCHHH T ss_conf 89989-89976476365614430599817958800374999998899879999898887754998733258788766677 Q ss_pred HHHHHHHHHHHHCC--CCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCC- Q ss_conf 89999999997187--787389998268753548874024566556777640456666014557888877652146887- Q gi|254781017|r 72 SFSFIIMAINHIKP--SCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSG- 148 (324) Q Consensus 72 ~~~~i~~~l~~~~~--~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG- 148 (324) +++.++..++.... ..||++.|.|+||.|+||+||||+.|+++.++..+ ++.++++.+++++|+.+|+.+.|.+|| T Consensus 94 Yv~Gvi~~l~~~g~~~~~G~d~~I~s~IP~gaGLSSSAAl~va~~~al~~~-~~~~l~~~ela~la~~aE~~~~G~~~Gi 172 (387) T PRK05322 94 YPKGVLKYLQEAGYKIDHGFDLYIYGNIPNGAGLSSSASIELLTGVILKDL-FNLDLDRLDLVKLGQKVENEFIGVNSGI 172 (387) T ss_pred HHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHHHHHHHCCCCCC T ss_conf 999999999973998778738999578888777530889999999999997-3899999999999999988871698742 Q ss_pred HHHHHHHCCC---EEEEEECCCEEEEEECCC---CCCEEECCCCCCH---------HHHHHHHHHHHHHCC--CHHH--- Q ss_conf 5447766298---589985275156730477---7526765841798---------999988899886082--3489--- Q gi|254781017|r 149 IDLAASIHGG---LICYQMPKYSIEKIDFIF---PIHLIYSGYKTPT---------AQVLKKISYIEIEYP--EINE--- 208 (324) Q Consensus 149 ~D~~a~~~Gg---~i~~~~~~~~~~~~~~~~---~l~l~~tg~~~~T---------~~~v~~v~~~~~~~~--~~~~--- 208 (324) |||+++++|. .+++++.+...+.++++. .+++++|+.++.. .++.+.++.+++... ...+ T Consensus 173 MDQ~as~~g~~~~al~iD~~~l~~~~~p~~~~~~~~vi~~s~~~~~l~~s~Yn~R~~ec~~a~~~l~~~~~i~~l~~~~~ 252 (387) T PRK05322 173 MDQFAIGMGKKDHAILLDTNTLEYEYVPLDLGDYVIVIMNTNKRRELADSKYNERRAECEKALKELQKKLDIKSLGELTE 252 (387) T ss_pred HHHHHHHHCCCCEEEEECCCCCCEEEECCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHCCH T ss_conf 34677774257748862278884366125788827999857976434632125689999999999876249423552999 Q ss_pred --------------HHHHHHHHH--CCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHH-HCCCCHHHHHHHHHHHHCCCCE Q ss_conf --------------999999840--33207889873048644568999998999986-1678779999999999719945 Q gi|254781017|r 209 --------------INQKIYALM--GKLSQISCQALRNKNLKVLAQAMNRQQGLLET-LGVSDSKLSEIVWKLREQPHIM 271 (324) Q Consensus 209 --------------~~~~~~~~~--~~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~-lgvs~~~l~~li~~~~~~~g~~ 271 (324) ..++....+ .+.+.++.+||+++|++.||++|+++|.+|++ ++||||+||.|++.+...+|++ T Consensus 253 ~~~~~~~~~l~~~~~~~Ra~Hvv~En~Rv~~a~~aL~~~d~~~~G~Lm~~sH~sLrd~yevS~~elD~Lv~~a~~~~Ga~ 332 (387) T PRK05322 253 EEFDEYSYLIKDETLLKRARHAVTENQRTLKAVKALKAGDLEKFGRLMNASHVSLRDDYEVTGLELDTLVEAAWKQPGVL 332 (387) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCE T ss_conf 99998875425688899999999877999999999866999999999999899999866279888999999987638973 Q ss_pred EEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH-CCCE--EEEEECCCCCCC Q ss_conf 99726899754899984665326899999999997-7978--999414545644 Q gi|254781017|r 272 ASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHA-KGID--IVPITPSHSTSL 322 (324) Q Consensus 272 ~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~-~g~~--~~~~~~~~~~~~ 322 (324) |+||||||||||+++|++.+..+.+++.+.+.+.+ .|++ .+...|+.|.++ T Consensus 333 GARmtG~GfGGc~vaLv~~~~~~~~~~~v~~~y~~~~g~~~~~~~~~~~~Ga~~ 386 (387) T PRK05322 333 GARMTGAGFGGCAIAIVKKDEVEAFKENVGKAYEEKIGYAASFYVAEIGDGARE 386 (387) T ss_pred ECEEECCCCCEEEEEEECHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCC T ss_conf 773624888226999962778999999999999985299987899861898200 No 7 >PRK00555 galactokinase; Provisional Probab=100.00 E-value=0 Score=381.76 Aligned_cols=316 Identities=18% Similarity=0.210 Sum_probs=248.3 Q ss_pred CEEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCH-------HHCCCHHHHHHH Q ss_conf 7899971704999743533089428987767728999998689849999536444465661-------201508999999 Q gi|254781017|r 6 HKICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDL-------AMFHPSFSFIIM 78 (324) Q Consensus 6 ~ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~-------~~~~~~~~~i~~ 78 (324) .+++.+|||||+|+|||+||+|+.++++|||+++++.+.+++++.+.+.+...+....++. ..|..++..++. T Consensus 1 ~~~~~sAPGRVnliGEH~DY~gG~VLp~AI~~~~~v~~~~~~~~~i~~~s~~~~~~~~~~~~~~~~~~~~W~~y~~gv~~ 80 (363) T PRK00555 1 MTVSYAAPGRINLIGEHTDYNLGFALPIALPQRTVVTFTPEHTGAITARSDRADGSVRIPLDTTPGQVTGWAAYAAGVIW 80 (363) T ss_pred CCEEEECCCCEEEEECCEEECCCEEEEEEHHHCEEEEEEECCCCEEEEEECCCCCCEEEECCCCCCCCCCHHHHHHHHHH T ss_conf 92699766658976287578997469478201289999988998699997688986798677687766227899999999 Q ss_pred HHHHHC-CCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCC-HHHHHHHC Q ss_conf 999718-7787389998268753548874024566556777640456666014557888877652146887-54477662 Q gi|254781017|r 79 AINHIK-PSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSG-IDLAASIH 156 (324) Q Consensus 79 ~l~~~~-~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG-~D~~a~~~ 156 (324) .++... +..|+++.|.|+||+|+||+||||+.||++.++..+ ++..+++.+++++|+.+|+.++|.+|| |||+++++ T Consensus 81 ~l~~~g~~~~g~~~~I~s~IP~gaGLSSSAAl~va~~~al~~~-~~~~~~~~ela~la~~aE~~~~G~~~GiMDQ~as~~ 159 (363) T PRK00555 81 ALRGAGHPVPGGAMSITSDVEIGSGLSSSAALECAVLGAVGAA-TGTRIDRLEQARLAQRAENDYVGAPTGLLDQLAALF 159 (363) T ss_pred HHHHHCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHC T ss_conf 9997189988852899778887667441899999999999997-089999999999999999751788875054788862 Q ss_pred CC---EEEEEECCCEEEEEECCC-----CCCEEECCCCCCHHH-----H---HHHHHHHHHHCCCHHHH----------- Q ss_conf 98---589985275156730477-----752676584179899-----9---98889988608234899----------- Q gi|254781017|r 157 GG---LICYQMPKYSIEKIDFIF-----PIHLIYSGYKTPTAQ-----V---LKKISYIEIEYPEINEI----------- 209 (324) Q Consensus 157 Gg---~i~~~~~~~~~~~~~~~~-----~l~l~~tg~~~~T~~-----~---v~~v~~~~~~~~~~~~~----------- 209 (324) |. .+++++.+..+..++++. .+++++|+.++.+.. . .+.+..... .+...+. T Consensus 160 g~~g~al~id~~~l~~~~v~~~~~~~~~~~vi~dS~~~~~~~~~~y~~R~~ec~~aa~~l~-~~~lr~~~~~~l~~l~~~ 238 (363) T PRK00555 160 GAPKTALLIDFRDLTVRPVAFDPDACGVVLLLMDSRARHRHAGGEYAARRASCERAAADLG-VSSLRAVQDRGLAALGAI 238 (363) T ss_pred CCCCEEEEECCCCCCEECCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHH T ss_conf 7888398732678853214447776776999976676443453235899999999999859-123544207689998750 Q ss_pred -----HHHHHHHHC--CCCHHHHHHHCCCCHHHHHHHHHHHHHHHH-HHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCC Q ss_conf -----999998403--320788987304864456899999899998-616787799999999997199459972689975 Q gi|254781017|r 210 -----NQKIYALMG--KLSQISCQALRNKNLKVLAQAMNRQQGLLE-TLGVSDSKLSEIVWKLREQPHIMASKISGSGLG 281 (324) Q Consensus 210 -----~~~~~~~~~--~i~~~~~~al~~~d~~~l~~lm~~~~~~L~-~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~G 281 (324) .+.....+. +.+.++.++++++|++.||++||++|.+|+ .+++|||++|.+++.+++ .|++|+|||||||| T Consensus 239 ~~~~~~~Ra~hv~~E~~RV~~~~~al~~~d~~~~G~Lm~~SH~Slrd~yevS~~elD~lv~~a~~-~Ga~GARltGaGfG 317 (363) T PRK00555 239 ADPIDARRARHVLTENQRVLDFVAALADSDFTAAGQILTASHASMRDDFAITTERIDLIADSAVR-AGALGARMTGGGFG 317 (363) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH-CCCCEEEEECCCCC T ss_conf 58888888888888888999999998739999999999999999998651796879999999997-69927889737887 Q ss_pred CEEEEEECCCCCHHHHHHHHHHHHHCCCE---EEEEECCCCCCCCC Q ss_conf 48999846653268999999999977978---99941454564459 Q gi|254781017|r 282 DCVIALGKGDLNSLPYQSVNCHMHAKGID---IVPITPSHSTSLYR 324 (324) Q Consensus 282 G~~i~l~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~~~~~~~~~ 324 (324) ||+++|++.+..+.+++.+.++..++++. ++...|+.|...-| T Consensus 318 Gc~vaLv~~~~~~~~~~~v~~~y~~~~~~~p~~~~~~p~~Ga~~~~ 363 (363) T PRK00555 318 GCVIALVPADRARDVADTVRRAAVTAGYDEPAVSRTYAAPGAGECR 363 (363) T ss_pred EEEEEEECHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCC T ss_conf 4899996687899999999999997379997499967777877789 No 8 >KOG1511 consensus Probab=100.00 E-value=0 Score=381.49 Aligned_cols=313 Identities=28% Similarity=0.341 Sum_probs=252.9 Q ss_pred CCEE-EEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHC-------------- Q ss_conf 8789-9971704999743533089428987767728999998689849999536444465661201-------------- Q gi|254781017|r 5 LHKI-CVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMF-------------- 69 (324) Q Consensus 5 ~~ki-~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~-------------- 69 (324) |.++ .+||||||+|||||+||||.+|++.+||+|+|+.+.+..++.+.+...+......+..+.+ T Consensus 1 ms~~l~vsaPGKvILfGEHAVVyg~~AlAaai~LrTyl~l~~san~~i~l~l~di~~~~~w~l~~~~~~l~~~~~~~~~~ 80 (397) T KOG1511 1 MSKVLLVSAPGKVILFGEHAVVYGRTALAAAIDLRTYLRLQTSANDRILLQLPDISIEKAWSLADFNGALPEQRSTYESV 80 (397) T ss_pred CCCEEEECCCCCEEEECCCEEEECCCEEEEEEECCEEEEEEECCCCEEEEECCCCCCEEEEEHHHHHHHHHHHHHHHHCC T ss_conf 97302405896279952310577770258896041269998458973899636677347887234332314665543213 Q ss_pred -C---------------------------CHHHHHHHHHHHHCCCCC----EEEEEECCCCCCCCCCCCCHHHHHHHHHH Q ss_conf -5---------------------------089999999997187787----38999826875354887402456655677 Q gi|254781017|r 70 -H---------------------------PSFSFIIMAINHIKPSCG----FDLKVISQLDSQLGLGSSAAITVAITAAL 117 (324) Q Consensus 70 -~---------------------------~~~~~i~~~l~~~~~~~g----~~i~i~s~iP~g~GLGSSaa~~va~~~al 117 (324) + +.+.+++..+....+..| +.+.++|++|+|+|||||||++|+++.++ T Consensus 81 q~p~~~~~~e~~k~l~~l~~~~~~~~~~~a~~~~lYlf~~l~~~~~g~lp~~~v~v~SelP~GaGLGSSAa~sv~lAtal 160 (397) T KOG1511 81 QTPASEVRVELLKQLGGLLENQEKVKEHLAGLSFLYLFLGLCLRAPGTLPALTVVVDSELPLGAGLGSSAAISVALATAL 160 (397) T ss_pred CCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHH T ss_conf 78410033899997543332011356788999999999876404668886569998446787677650689999999999 Q ss_pred HHHHCCCCCC----------CHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEEEEEEC-CCEEEEEECCCCCCEEECCCC Q ss_conf 7640456666----------01455788887765214688754477662985899852-751567304777526765841 Q gi|254781017|r 118 LTLQYHKEPS----------PDEILTTAHAIVLKVQGISSGIDLAASIHGGLICYQMP-KYSIEKIDFIFPIHLIYSGYK 186 (324) Q Consensus 118 ~~~~~~~~~~----------~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i~~~~~-~~~~~~~~~~~~l~l~~tg~~ 186 (324) +.+....+.+ .+-+.+||++.|+.+||+|||+|++.|+|||++.|..+ +....+...+.+++|++|.++ T Consensus 161 l~~~g~i~~p~~~~~~~e~~l~Li~~WAf~gE~~iHGtpSGiDnaV~t~Gg~i~f~kg~~~~~Lk~~~~L~illtnTrv~ 240 (397) T KOG1511 161 LRLAGLIPPPGSNLSLAENDLALINKWAFEGEKCIHGTPSGIDNAVCTYGGLISFKKGVEIESLKHLPPLRILLTNTRVP 240 (397) T ss_pred HHHCCCCCCCCCHHCCCCCHHHHHHHHHHCCCEEECCCCCCCCHHHHCCCCEEEEECCCCCEECCCCCCCEEEEECCCCC T ss_conf 99735689985300013226999998875056011488765421221147647861476502036688733898716567 Q ss_pred CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCC----CHH---HHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 798999988899886082348999999984033207889873048----644---5689999989999861678779999 Q gi|254781017|r 187 TPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNK----NLK---VLAQAMNRQQGLLETLGVSDSKLSE 259 (324) Q Consensus 187 ~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~----d~~---~l~~lm~~~~~~L~~lgvs~~~l~~ 259 (324) ++|+.+|+.|+.+..++| +..+.+++.+++++.++...+.++ +.. +|.++|..||.+|..||||+|+++. T Consensus 241 RnTk~lVa~Vr~~~~kfP---evi~~i~~aid~is~ea~~il~~e~~~~~~~~Eq~L~eLi~iNq~LL~alGVsH~~le~ 317 (397) T KOG1511 241 RNTKALVAGVRELLEKFP---EVIKAIFDAIDEISLEAVWILQRENDEFSSPKEQKLEELIRINQDLLDALGVSHPSLEL 317 (397) T ss_pred CCHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 519999999999877535---88999999999999999999742445678827889999999717889875779488999 Q ss_pred HHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC-CCCCCC Q ss_conf 999999719945997268997548999846653268999999999977978999414-545644 Q gi|254781017|r 260 IVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITP-SHSTSL 322 (324) Q Consensus 260 li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~ 322 (324) ++..++.. | +.+|+||||+|||+++|.+++.+.+.++.+++.+...||+++...- ..|.++ T Consensus 318 v~~~t~k~-g-i~sKLTGAGgGGc~itlL~~~~~qe~i~~~ke~L~s~gf~v~~t~lGG~G~~v 379 (397) T KOG1511 318 VCTTTRKL-G-IHSKLTGAGGGGCVITLLKPGTEQEQIDKWKEELESHGFEVFETELGGPGVSV 379 (397) T ss_pred HHHHHHHH-C-CCEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCEEECCCCCCCEEE T ss_conf 99999982-8-60001237878659999789986478888999998568625502588882588 No 9 >PRK05101 galactokinase; Provisional Probab=100.00 E-value=0 Score=369.38 Aligned_cols=319 Identities=20% Similarity=0.262 Sum_probs=245.3 Q ss_pred CCCCCEEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCC---CCCCC-------HHHCCC Q ss_conf 98887899971704999743533089428987767728999998689849999536444---46566-------120150 Q gi|254781017|r 2 GQCLHKICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQY---CGSLD-------LAMFHP 71 (324) Q Consensus 2 ~~~~~ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~---~~~~~-------~~~~~~ 71 (324) |..|. +.++|||||+|+|||+||+|+.|+++||++++++.+.+++++.+++.+...+. ..+++ ...|.+ T Consensus 16 G~~P~-~~~~APGRVnliGEHtDY~gG~Vlp~AI~~~~~ia~~~~~d~~~~v~s~~~~~~~~~~~l~~~~~~~~~~~W~~ 94 (382) T PRK05101 16 GYPPT-HTIQAPGRVNLIGEHTDYNDGFVLPCAINYQTVISCAKRDDRIVRVIAADYDNQQDEFSLDAPIVPHENYQWAN 94 (382) T ss_pred CCCCC-EEEECCCCCCCCCHHHHCCCCCEEEEECCCCEEEEEEECCCCEEEEEECCCCCCEEEEECCCCCCCCCCCCHHH T ss_conf 89985-89977674033203234289857868811257999998899859999888898525885687667787766799 Q ss_pred HHHHHHHHHHHHCC-CCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCC-H Q ss_conf 89999999997187-787389998268753548874024566556777640456666014557888877652146887-5 Q gi|254781017|r 72 SFSFIIMAINHIKP-SCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSG-I 149 (324) Q Consensus 72 ~~~~i~~~l~~~~~-~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG-~ 149 (324) ++..++..++...+ ..|+++.|.|+||+|+||+||||+.||++.++..+ ++..+++.+++++++.+|+.+.|.+|| | T Consensus 95 Yv~Gv~~~l~~~g~~~~G~d~~I~s~IP~GaGLSSSAAl~va~~~al~~l-~~~~~~~~ela~l~~~aE~~~~G~~~GiM 173 (382) T PRK05101 95 YVRGVVKHLQERNPDFGGADLVISGNVPQGAGLSSSASLEVAVGQTFQQL-YHLPLSGAEIALNGQEAENQFVGCNCGIM 173 (382) T ss_pred HHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHH-CCCCCCHHHHHHHHHHHHHHCCCCCCCCH T ss_conf 99999999997389988738999678888777541789999999999997-38998999999999999984017888716 Q ss_pred HHHHHHCCC---EEEEEECCCEEEEEECCC--CCCEEECCCCCCHH---------HHHHHHHH-----HH--------HH Q ss_conf 447766298---589985275156730477--75267658417989---------99988899-----88--------60 Q gi|254781017|r 150 DLAASIHGG---LICYQMPKYSIEKIDFIF--PIHLIYSGYKTPTA---------QVLKKISY-----IE--------IE 202 (324) Q Consensus 150 D~~a~~~Gg---~i~~~~~~~~~~~~~~~~--~l~l~~tg~~~~T~---------~~v~~v~~-----~~--------~~ 202 (324) ||+++++|+ .+++++.+...+.++++. .+++++|+.++... ++....+. ++ .. T Consensus 174 DQ~as~~g~~~~al~id~~~l~~~~v~lp~~~~~vi~dS~v~~~l~~~~Yn~R~~ec~~aa~~l~~~~Lrd~~~~~l~~~ 253 (382) T PRK05101 174 DQLISALGKKDHALLIDCRSLETKAVSMPEGVAVVIINSNVKRGLVDSEYNTRREQCETAARFFGVKALRDVTLEQFNAV 253 (382) T ss_pred HHHHHHHCCCCEEEECCCCCCCEEEEECCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCHHHHHCCCHHHHHHH T ss_conf 78899852678288503678843665469996799953886543454505789999999999969241332899999987 Q ss_pred CCCHHH-HHHHHHHHHC--CCCHHHHHHHCCCCHHHHHHHHHHHHHHHHH-HCCCCHHHHHHHHHHHHCCCCE-EEECCC Q ss_conf 823489-9999998403--3207889873048644568999998999986-1678779999999999719945-997268 Q gi|254781017|r 203 YPEINE-INQKIYALMG--KLSQISCQALRNKNLKVLAQAMNRQQGLLET-LGVSDSKLSEIVWKLREQPHIM-ASKISG 277 (324) Q Consensus 203 ~~~~~~-~~~~~~~~~~--~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~-lgvs~~~l~~li~~~~~~~g~~-~aklsG 277 (324) .+...+ ..+.....+. +.+.++.+||+++|++.||++||++|.+|++ ++||||++|.+++.+++..|.. |+|||| T Consensus 254 ~~~l~~~~~~Ra~Hvi~E~~Rv~~a~~aL~~~d~~~lG~Lm~~SH~Slr~~yevS~~elD~lv~~a~~~~g~~gGARmtG 333 (382) T PRK05101 254 AAELDPVVAKRARHVITENARTLEAASALAQGDLKRMGELMAESHASMRDDFEITVPQIDTLVEIVKAVIGDQGGVRMTG 333 (382) T ss_pred HHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCEEEEEC T ss_conf 71089999999999999999999999998628999999999862798998770798899999999997618775468963 Q ss_pred CCCCCEEEEEECCCCCHHHHHHHHHH-HHHCCCE--EEEEECCCCCCC Q ss_conf 99754899984665326899999999-9977978--999414545644 Q gi|254781017|r 278 SGLGDCVIALGKGDLNSLPYQSVNCH-MHAKGID--IVPITPSHSTSL 322 (324) Q Consensus 278 AG~GG~~i~l~~~~~~~~~~~~~~~~-~~~~g~~--~~~~~~~~~~~~ 322 (324) ||||||+++|++.+..+.+++.+.+. ..+.|.+ .+.++|+.|..+ T Consensus 334 ~GfGGc~vaLv~~~~~~~~~~~v~~~Y~~~~g~~~~~~~~~ps~GA~~ 381 (382) T PRK05101 334 GGFGGCIVALVPEELVEAVRQAVAEQYEAKTGLKETFYVCKASQGAGL 381 (382) T ss_pred CCCCEEEEEEECHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCC T ss_conf 676002899974778999999999999996499987899605688758 No 10 >COG0153 GalK Galactokinase [Carbohydrate transport and metabolism] Probab=100.00 E-value=0 Score=364.84 Aligned_cols=320 Identities=22% Similarity=0.283 Sum_probs=256.4 Q ss_pred CCCEEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCC---C-------CHHHCCCHH Q ss_conf 887899971704999743533089428987767728999998689849999536444465---6-------612015089 Q gi|254781017|r 4 CLHKICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGS---L-------DLAMFHPSF 73 (324) Q Consensus 4 ~~~ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~---~-------~~~~~~~~~ 73 (324) ++..+.+.|||||+|+|||+||+|+.++++|||.++++.+++++|..+.+.+..++.... . ..+.|.+++ T Consensus 20 ~~~~~~~~aPGRvNLIGEHtDYn~G~VlP~Ain~~t~v~v~~r~d~~v~l~s~n~~~~~~~~~~~~d~~~~~~~~W~nYv 99 (390) T COG0153 20 VEPTVTAFAPGRVNLIGEHTDYNGGFVLPCAINYGTYVAVAKRDDGKVRLYSANFGNAGDIFFLLLDIAKEKIDDWANYV 99 (390) T ss_pred CCCCEEECCCCEEEEECCCEECCCCEEEEEEEECCEEEEEEECCCCEEEEEECCCCCCCCEEECCHHHCCCCCCHHHHHH T ss_conf 37645761785588601222126866788896054289999736855999967876334243044331445311033247 Q ss_pred HHHHHHHHHHCC-CCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCC-HHH Q ss_conf 999999997187-787389998268753548874024566556777640456666014557888877652146887-544 Q gi|254781017|r 74 SFIIMAINHIKP-SCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSG-IDL 151 (324) Q Consensus 74 ~~i~~~l~~~~~-~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG-~D~ 151 (324) +.+...++.... ..|+++.+.||||.|+||+||||+.||+..++.++ ++.++++.+++++++.+|+.+-|.+|| ||| T Consensus 100 kgvi~~l~~~g~~~~G~~i~i~gnIP~GaGLSSSAAleva~~~al~~l-~~~~~~k~~la~i~q~AEn~fvGvn~G~mDQ 178 (390) T COG0153 100 KGVIKALQKRGYAFTGLDIVISGNIPIGAGLSSSAALEVAVALALQRL-FNLPLDKAELAKIAQVAENQFVGVNCGIMDQ 178 (390) T ss_pred HHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 899999986587767705999667888888672689999999999997-3899988899999999875134776750889 Q ss_pred HHHHCC---CEEEEEECCCEEEEEECCCC---CCEEECCCCCCH---------HHHHHHHHHHHHHCCCHHHHHHH---- Q ss_conf 776629---85899852751567304777---526765841798---------99998889988608234899999---- Q gi|254781017|r 152 AASIHG---GLICYQMPKYSIEKIDFIFP---IHLIYSGYKTPT---------AQVLKKISYIEIEYPEINEINQK---- 212 (324) Q Consensus 152 ~a~~~G---g~i~~~~~~~~~~~~~~~~~---l~l~~tg~~~~T---------~~~v~~v~~~~~~~~~~~~~~~~---- 212 (324) .++.+| ..++.++.+...++.+++.. +++++|+.++.+ .++...++.+....+.+.+.... T Consensus 179 ~~s~~G~~~~al~ld~~~l~~~~~~~p~~~~~ivI~ns~vkr~la~seYn~Rr~ece~A~~~l~~~~~~L~d~~~~~~~~ 258 (390) T COG0153 179 LASAFGKKDHALLLDCRTLEYEPVPFPVGGVSIVIVNSNVKRELADSEYNERRAECEEAAEFLGVSIKSLRDVTDEEFAA 258 (390) T ss_pred HHHHHCCCCCEEEEECCCCCEEEECCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 99986877848999736662687304766459999637976654234789999999999999987550354337999996 Q ss_pred ----------HHHHH------CCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHH-HCCCCHHHHHHHHHHHHCCCCEEEEC Q ss_conf ----------99840------33207889873048644568999998999986-16787799999999997199459972 Q gi|254781017|r 213 ----------IYALM------GKLSQISCQALRNKNLKVLAQAMNRQQGLLET-LGVSDSKLSEIVWKLREQPHIMASKI 275 (324) Q Consensus 213 ----------~~~~~------~~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~-lgvs~~~l~~li~~~~~~~g~~~akl 275 (324) ..++. .+.+.++.+||+++|+.+||++|+++|.+|++ ++||+|++|.+++.+....|++|+|| T Consensus 259 ~~~~i~~~~~~~rRa~hvv~En~Rvl~a~~Al~~~dl~~fG~Lm~~SH~slrddyevt~pElD~lve~a~~~~G~~GaRm 338 (390) T COG0153 259 LQAEIEVDPKIARRARHVVTENQRVLEAAKALRSGDLTEFGELMNESHESLRDDYEVTCPELDTLVEIALAAGGAYGARM 338 (390) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCEE T ss_conf 45241446688899999986777899999998718899999999998998874356656628999999987278650120 Q ss_pred CCCCCCCEEEEEECCCCCHHHHHHHH-HHHHHCCCE--EEEEECCCCCCCCC Q ss_conf 68997548999846653268999999-999977978--99941454564459 Q gi|254781017|r 276 SGSGLGDCVIALGKGDLNSLPYQSVN-CHMHAKGID--IVPITPSHSTSLYR 324 (324) Q Consensus 276 sGAG~GG~~i~l~~~~~~~~~~~~~~-~~~~~~g~~--~~~~~~~~~~~~~~ 324 (324) ||||||||+|+|++++.-..+.+.+. .|....|++ .+...+++|..+.+ T Consensus 339 TGaGfGGc~IaLv~~~~v~~~~e~v~~~y~~~~g~k~~~yv~~~~~G~~~~~ 390 (390) T COG0153 339 TGAGFGGCVIALVPNDDVEAVAEAVAEEYEKVTGLKAAFYVVEASQGAGVCD 390 (390) T ss_pred CCCCCCCEEEEEECHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCC T ss_conf 3788785599993220279999999876776407655389985067742369 No 11 >PTZ00290 galactokinase; Provisional Probab=100.00 E-value=0 Score=339.50 Aligned_cols=321 Identities=16% Similarity=0.168 Sum_probs=222.6 Q ss_pred CCCCCE-----EEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEEC---CCCEEEEEECCCCC-CC-----CCCHH Q ss_conf 988878-----999717049997435330894289877677289999986---89849999536444-46-----56612 Q gi|254781017|r 2 GQCLHK-----ICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLR---KDRLINIDSSLGQY-CG-----SLDLA 67 (324) Q Consensus 2 ~~~~~k-----i~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~---~~~~i~i~~~~~~~-~~-----~~~~~ 67 (324) |+-|.+ |.++|||||||+|||+||+|+.|++|||++++++.+.+. .++.+++.+...+. .. ..... T Consensus 27 ~~~~~~~~~~~~farAPGRVNLIGEHtDYngg~VLP~AI~~~t~i~va~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 106 (468) T PTZ00290 27 KVENDADVEWLLFTFAPGRVNFIGEHVDYMGGYVCPAAVLEGCHILVGRVKHFCDHKLRFATETDEHFVLDHLGGAKHNK 106 (468) T ss_pred CCCCCCCCCCEEEEECCCEEEEECCCCCCCCCEEECEEECCCEEEEEEECCCCCCCEEEEECCCCCCCCCCCCCCCCCCC T ss_conf 79986542124799778726540231022898546747016809999842578774699850677652157667788898 Q ss_pred HCCCHHHHHHH-HHHHHCC------CCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC--------------- Q ss_conf 01508999999-9997187------7873899982687535488740245665567776404566--------------- Q gi|254781017|r 68 MFHPSFSFIIM-AINHIKP------SCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKE--------------- 125 (324) Q Consensus 68 ~~~~~~~~i~~-~l~~~~~------~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~--------------- 125 (324) .|.+++..+.. .+++.+. ..|+++.++|+||.|+||+||||+.||++.++..+. ... T Consensus 107 ~W~nY~~Gv~~~~l~~~g~~~~~~~~~G~~~~v~g~VP~GsGLSSSAAleva~~~al~~~~-~~~~~~~~~~~~~~~~i~ 185 (468) T PTZ00290 107 AWTTFVRGAATLRLNRLGVAIDAPSLQGVCMVVHGTLPMGAGMSASASFGVALLNAINTVV-TRRYKGCPTSPGRRYSIL 185 (468) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHH-HCCCCCCCCCCCCCCCCC T ss_conf 7631789999999997198767788860599998988774157899999999999999987-422344333432111234 Q ss_pred --CCCH---HHHHHHHHHHHHHCCCCCC-HHHHHHHCCC---EEEEEECCCEEEEEECC------CCCCEEECCCCCCH- Q ss_conf --6601---4557888877652146887-5447766298---58998527515673047------77526765841798- Q gi|254781017|r 126 --PSPD---EILTTAHAIVLKVQGISSG-IDLAASIHGG---LICYQMPKYSIEKIDFI------FPIHLIYSGYKTPT- 189 (324) Q Consensus 126 --~~~~---~i~~lA~~~E~~~~g~~sG-~D~~a~~~Gg---~i~~~~~~~~~~~~~~~------~~l~l~~tg~~~~T- 189 (324) +++. +++.+|+.+|+.+-|.+|| |||+++++|. .+++++.+...+.++++ ..+++++|+.++.- T Consensus 186 ~~~sk~e~~~la~~aqraEn~yvGv~~GiMDQ~aS~~g~~g~al~iDc~~l~~e~v~~~p~~~~~~~~vi~nS~v~h~L~ 265 (468) T PTZ00290 186 PPMSKEELIELAKQARRIETEFCGVNVGIMDQFISAFAEEDKFMFLDCKSLTFESHDMTPLLGDGACFLLIDSMIKHDLL 265 (468) T ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCEEEEECCCCCEEECCCCCCCCCCCEEEEEECCCHHCCC T ss_conf 55567789999999999987105788640778899755579589997277753431578666787189998377302046 Q ss_pred -----------HHHHHHHHHHHHHC----C----------C-------HHHHHHHH--------HHHHCCCCH------H Q ss_conf -----------99998889988608----2----------3-------48999999--------984033207------8 Q gi|254781017|r 190 -----------AQVLKKISYIEIEY----P----------E-------INEINQKI--------YALMGKLSQ------I 223 (324) Q Consensus 190 -----------~~~v~~v~~~~~~~----~----------~-------~~~~~~~~--------~~~~~~i~~------~ 223 (324) .++....+.+.+.. | . ..+..... +++..++.. + T Consensus 266 ~sa~~~Yn~Rr~ece~a~~~l~~~~l~~~~~~~~~l~~~~~~~~~~~d~~~~l~~~~~~~~~~~~~Ra~HVv~E~~RVl~ 345 (468) T PTZ00290 266 GGTAGMYNTVRSDQEGAQKKIGKHRYRGKPFTFSDLVRNPKKYTFDGDVVAFMESCKPLMTPGEFERGTYNIMEQIRTLE 345 (468) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHEEECHHHHHHH T ss_conf 66423678999999999999818752145310777751624302113589999875642486663000046328899999 Q ss_pred H-------HHHHCCCCHHHHHHHHHHHHHHHHH-HCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHH Q ss_conf 8-------9873048644568999998999986-1678779999999999719945997268997548999846653268 Q gi|254781017|r 224 S-------CQALRNKNLKVLAQAMNRQQGLLET-LGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSL 295 (324) Q Consensus 224 ~-------~~al~~~d~~~l~~lm~~~~~~L~~-lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~ 295 (324) + ....+++++++||+|||++|.+|++ ++||||+||.|++.++..+|++|+||||||||||+|+|++.+..+. T Consensus 346 ~~~~~~~~~~l~~~~~~~~lG~LMneSH~Slrd~YEvScpELD~Lve~~~~~~Ga~GARmTGaGFGGC~VaLV~~~~ve~ 425 (468) T PTZ00290 346 FIKLNDPELPLSREERFRKAGEILNAGHQGMRDLMKITTPELDFIHELINEEKGVAGGRMMGGGFGGCIILLLKKNAVDR 425 (468) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEEEECHHHHHH T ss_conf 99866655544122169999999997278898575898188999999997616985700037764021799955878999 Q ss_pred HHHHHHHHHHH-CCCE--EEEEECCCCCCCC Q ss_conf 99999999997-7978--9994145456445 Q gi|254781017|r 296 PYQSVNCHMHA-KGID--IVPITPSHSTSLY 323 (324) Q Consensus 296 ~~~~~~~~~~~-~g~~--~~~~~~~~~~~~~ 323 (324) +++.+.+++.+ .|+. ++...|..|..|+ T Consensus 426 f~~~v~~~Y~~~~g~~~~~~~~~~~~GA~V~ 456 (468) T PTZ00290 426 VVAHVREKFKARFGVENDVYPVVAGDGAFVV 456 (468) T ss_pred HHHHHHHHHHHHHCCCCCEEEEECCCCCEEE T ss_conf 9999999999975999858997227993899 No 12 >TIGR01220 Pmev_kin_Gr_pos phosphomevalonate kinase; InterPro: IPR005917 Protein kinases are a group of enzymes that possess a catalytic subunit which transfers the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. The enzymes fall into two broad classes, characterised with respect to substrate specificity: serine/threonine specific and tyrosine specific . Protein kinase function has been evolutionarily conserved from Escherichia coli to Homo sapiens. Protein kinases play a role in a mulititude of cellular processes, including division, proliferation, apoptosis, and differentiation . Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved , leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases . Phosphomevalonate kinase (2.7.4.2 from EC)catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. This pathway starts with three molecules of acetyl-CoA, which, in a series of six different enzyme reactions, are converted to isopentanyl pyrophosphate (IPP), the basic C5 isoprene unit and a common intermediate for a number of pathways including isoprenoid and cholesterol biosynthesis. There are two unrelated types of PMK. The first type belongs to the GHMP kinase and includes a eukaryotic group which is typified by Saccharomyces cerevisiae ERG8 and a bacterial group typified by by Streptococcus pneumoniae PMK (MvaK2) . The second type includes animal PMKs, typified by human PMK . PMKs of the first type are present in eubacteria, fungi, and plants, while the second type is found only in animals, indicative of a nonorthologous gene displacement early in animal evolution . PMKs of the GHMP kinase type are closely related to mevalonate kinases. The two types of PMK have different consensus ATP-binding motifs: a protein kinase motif in the ERG8 orthologs versus a P-loop or Walker A motif in the animal orthologs. The fact that ERG8 orthologs are found in pathogenic eubacteria and fungi but not in humans makes them attractive targets for the development of antibacterial and/or antifungal drugs . This group of enzymes belongs to the GHMP kinase domain superfamily. GHMP kinases are a unique class of ATP-dependent enzymes (the abbreviation of which refers to the original members: galactokinase, homoserine kinase, mevalonate kinase, and phosphomevalonate kinase) . Enzymes belonging to this superfamily contain three well-conserved motifs, the second of which has the typical sequence Pro-X-X-X-Gly-Leu-X-Ser-Ser-Ala and is involved in ATP binding . The phosphate binding loop in GHMP kinases is distinct from the classical P-loops found in many ATP/GTP binding proteins. The bound ADP molecule adopts a rare syn conformation and is in the opposite orientation from those bound to the P-loop-containing proteins . GHMP kinases display a distinctly bilobal appearance with their N-terminal subdomains dominated by a mixed beta-sheet flanked on one side by alpha-helices and their C-terminal subdomains containing a four stranded anti-parallel beta-sheet , , , .. Probab=100.00 E-value=0 Score=332.87 Aligned_cols=305 Identities=28% Similarity=0.349 Sum_probs=248.6 Q ss_pred EEEECCEEEEEECCCEEEC-CCEEEEEEECCEEEEEEEECC---CCEEEEEECCC--------CCCCCCCHHHC------ Q ss_conf 9997170499974353308-942898776772899999868---98499995364--------44465661201------ Q gi|254781017|r 8 ICVSAPGSLVLMGEHGVLH-GHAALVFAINKRVILYLTLRK---DRLINIDSSLG--------QYCGSLDLAMF------ 69 (324) Q Consensus 8 i~~~aPGKv~L~GEh~v~~-G~~ai~~aI~~~~~~~~~~~~---~~~i~i~~~~~--------~~~~~~~~~~~------ 69 (324) |.++||||+++.|||+|+. |+|+|.+|+|+++.++++..+ +..-.|.|... ....++...+. T Consensus 1 I~v~aPGKLfvAGEYAVvePG~pailvAVDrfvtvtve~~~adt~~~~~i~S~~l~~~pv~w~r~~g~lv~~d~~dGqqA 80 (367) T TIGR01220 1 IEVKAPGKLFVAGEYAVVEPGYPAILVAVDRFVTVTVEDADADTDAADVIQSSDLSSEPVRWRRRDGKLVVDDADDGQQA 80 (367) T ss_pred CEEECCCCEEEEEEEEEECCCCCEEEEEECCEEEEEEECCCCCCCCCCCEEECCCCCCCCEEEEECCEEEEECCCCCHHH T ss_conf 90208963066201344377795599997565899986356754344514521147776034642685787237743556 Q ss_pred CCHHHHHHHHHHHHCC--------CCCEEEEEECCCC----CCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 5089999999997187--------7873899982687----535488740245665567776404566660145578888 Q gi|254781017|r 70 HPSFSFIIMAINHIKP--------SCGFDLKVISQLD----SQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHA 137 (324) Q Consensus 70 ~~~~~~i~~~l~~~~~--------~~g~~i~i~s~iP----~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~ 137 (324) ...++||+.++..+.. ..-|++.|+|++. .+.|||||+|++|++++|++.| ++.+++.+++++||.. T Consensus 81 ~s~l~yVv~AIe~~e~ya~e~n~kl~~~hL~v~S~Ld~~~GrKyGLGSSgAVtV~tVkAln~F-ydl~Lsn~eifkLa~l 159 (367) T TIGR01220 81 ESALKYVVAAIEVVEKYARERNKKLKAYHLEVSSELDEADGRKYGLGSSGAVTVATVKALNAF-YDLELSNEEIFKLAVL 159 (367) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEEECCCCCEECCCCHHHHHHHHHHHHHHH-CCCCCCHHHHHHHHHH T ss_conf 645787667899999997650971102676687641057884504662078999999999986-2877575678899898 Q ss_pred HHHHHCCCCCCHHHHHHHCCCEEEEE-----------------------ECCCEEEEEECCC--CCCEEECCCCCCHHHH Q ss_conf 77652146887544776629858998-----------------------5275156730477--7526765841798999 Q gi|254781017|r 138 IVLKVQGISSGIDLAASIHGGLICYQ-----------------------MPKYSIEKIDFIF--PIHLIYSGYKTPTAQV 192 (324) Q Consensus 138 ~E~~~~g~~sG~D~~a~~~Gg~i~~~-----------------------~~~~~~~~~~~~~--~l~l~~tg~~~~T~~~ 192 (324) ++...++++|+.|.++|+|||+|-|+ +|.+.+.+++.|. .+++.|||.|.||+++ T Consensus 160 a~~~lq~~gScGDiAaS~ygGWiAystfDhdwV~q~~~~~~v~~~L~~~WPgl~i~pL~aPk~l~lliGWTGsPASt~~L 239 (367) T TIGR01220 160 ATLELQSKGSCGDIAASVYGGWIAYSTFDHDWVKQKVEEVSVDEVLKKDWPGLSIEPLKAPKNLELLIGWTGSPASTADL 239 (367) T ss_pred HHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCHHHHHHCCCCCCEEECCCCCCCCEEEEEECCCCCCCHHH T ss_conf 64220689975411110111033305766688875541303588861477885022067998677988631788777678 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHH------CCCCHHHHHHHHHHHH Q ss_conf 9888998860823489999999840332078898730486445689999989999861------6787799999999997 Q gi|254781017|r 193 LKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQQGLLETL------GVSDSKLSEIVWKLRE 266 (324) Q Consensus 193 v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~l------gvs~~~l~~li~~~~~ 266 (324) |..|++.+.+.. ...+++++.-..+++.++.||+.+|...+-+.+++||..|.+| ++.+|.|..||+.+.. T Consensus 240 Vs~vhr~k~~~~---a~y~rFl~~s~~CVe~~i~af~tg~i~~~~k~Ir~nR~~la~ld~e~gv~IeTe~Lk~LCD~aE~ 316 (367) T TIGR01220 240 VSEVHRRKEEKS---AAYERFLEKSRDCVESLITAFETGDIKSVQKEIRKNRQLLARLDDEVGVDIETEKLKALCDLAEA 316 (367) T ss_pred HHHHHHCCCCCC---HHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEECHHHHHHHHHHHH T ss_conf 888874046762---36888887678999999864315773257888899999998510114202344436655426431 Q ss_pred CCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCC Q ss_conf 19945997268997548999846653268999999999977978999414545 Q gi|254781017|r 267 QPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITPSHS 319 (324) Q Consensus 267 ~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 319 (324) .+| +||-||||||||.|+|...+. ...+..+...|++.|+.++|+++-.+ T Consensus 317 ygg--AAK~SGAGGGDCGIai~d~~~-srdi~~~~~~We~~Gi~PLpl~~y~~ 366 (367) T TIGR01220 317 YGG--AAKSSGAGGGDCGIAILDAKA-SRDILRVRQEWEKDGIKPLPLKVYLE 366 (367) T ss_pred CCC--CCCCCCCCCCCHHHHEEHHHH-HCCCHHHHHHHHHCCCCCCCCEECCC T ss_conf 112--237788887441121220023-13605788888866887564300268 No 13 >TIGR00131 gal_kin galactokinase; InterPro: IPR000705 Galactokinase catalyses the first reaction in the galactose metabolism pathway, the ATP-dependent phosphorylation of galactose, yielding galactose-1-phosphate , . Deficiency in this enzyme results in the disease galactosemia, which is responsible for the formation of cataracts in newborn babies, and is possibly responsible for presenile cataracts in adults . In yeast, the GAL3 gene product is required for the GAL4-mediated induction of other enzymes involved in galactose metabolism. The induction of GAL1 production then reinforces this process, increasing the expression of other galactose-inducible genes. GAL3 has been shown to be similar to the GAL1 protein .; GO: 0004335 galactokinase activity, 0005524 ATP binding, 0006012 galactose metabolic process, 0046835 carbohydrate phosphorylation. Probab=100.00 E-value=0 Score=315.21 Aligned_cols=299 Identities=19% Similarity=0.271 Sum_probs=218.2 Q ss_pred CCEEEEE-CCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCC-CC---CC-------------CH Q ss_conf 8789997-1704999743533089428987767728999998689849999536444-46---56-------------61 Q gi|254781017|r 5 LHKICVS-APGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQY-CG---SL-------------DL 66 (324) Q Consensus 5 ~~ki~~~-aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~-~~---~~-------------~~ 66 (324) ...++++ |||||||+|||+||+++.|+++|||..+.+.+.+++|..+++.+...+- .. ++ .. T Consensus 15 ~~~~~~~faPGRVNLIGEHtDYnDg~VlPcAid~~t~~av~~rdd~~~~~~~~n~~na~~~e~slD~~~~~~e~~iDPSv 94 (500) T TIGR00131 15 KPDFTARFAPGRVNLIGEHTDYNDGFVLPCAIDFGTLVAVAERDDKNVRVYAANADNAKFDEFSLDLPLDGSEVAIDPSV 94 (500) T ss_pred CCCEEEEECCCCEEEEEEEEEECCCEECCCHHCCCCEEEEEEECCCCEEEEEECCCCCCCCCEECCCCCCCCCEEECCCH T ss_conf 97669997788462213124226857555010547168754304886068862588644310003763235530227770 Q ss_pred HHCCCHHHHHHHHHH----HHCC------C-CCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH-HHCCCCCCCHHHHHH Q ss_conf 201508999999999----7187------7-873899982687535488740245665567776-404566660145578 Q gi|254781017|r 67 AMFHPSFSFIIMAIN----HIKP------S-CGFDLKVISQLDSQLGLGSSAAITVAITAALLT-LQYHKEPSPDEILTT 134 (324) Q Consensus 67 ~~~~~~~~~i~~~l~----~~~~------~-~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~-~~~~~~~~~~~i~~l 134 (324) ..|.++++.++..++ ...| . .|++|.+.+|||.|+||+||||+.||+..++++ -.++...+..+++.. T Consensus 95 ~~WaNY~kgv~~~~qS~LKK~AP~R~~~~p~~G~~~v~~GnvP~G~GLSSSAa~~~avgL~v~qA~~g~~~~~~~~~~~~ 174 (500) T TIGR00131 95 SDWANYVKGVVKVAQSYLKKLAPERFNSFPLVGADIVISGNVPTGAGLSSSAALEVAVGLAVLQAQLGHLPLDSKQILLR 174 (500) T ss_pred HHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 00135443047888888887344445445755347899657788741338999999999999886158740247899886 Q ss_pred HHHHHHHHCCCCCC-HHHHHHHCCC---EEEEEEC-CCEEEEEECCC------CCCEEECCCCCC---H---------HH Q ss_conf 88877652146887-5447766298---5899852-75156730477------752676584179---8---------99 Q gi|254781017|r 135 AHAIVLKVQGISSG-IDLAASIHGG---LICYQMP-KYSIEKIDFIF------PIHLIYSGYKTP---T---------AQ 191 (324) Q Consensus 135 A~~~E~~~~g~~sG-~D~~a~~~Gg---~i~~~~~-~~~~~~~~~~~------~l~l~~tg~~~~---T---------~~ 191 (324) ++.+|..+-|..|| |||.++++|. .+.++++ .....+..++. -++|++|+.++. + .+ T Consensus 175 ~q~aenhfvGvNcG~MDQ~~s~~G~~dHaL~~~c~P~L~~~~~~~P~~KNH~~a~VI~Ntn~~~~KFEla~~~YNtRr~e 254 (500) T TIGR00131 175 GQKAENHFVGVNCGIMDQLISALGKEDHALLIDCRPSLKTTPFKLPQLKNHGVAVVIANTNVKRTKFELAPSEYNTRRQE 254 (500) T ss_pred HHHHHHCEEECCCCHHHHHHHHCCCCCCEEEEECCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEECCCCCCCHHHH T ss_conf 55564452510477378899652752533555237777844100786225645899961476455010148874622888 Q ss_pred HHHHHHHHHHHCC---------------------------------CHH------------------------------- Q ss_conf 9988899886082---------------------------------348------------------------------- Q gi|254781017|r 192 VLKKISYIEIEYP---------------------------------EIN------------------------------- 207 (324) Q Consensus 192 ~v~~v~~~~~~~~---------------------------------~~~------------------------------- 207 (324) +-...+.+...|. .|. T Consensus 255 c~~Aa~~l~~~Y~v~L~S~K~~S~~~~gaLRD~~~~~f~r~~N~~~PWNGDi~~Gia~~~~l~~lv~~S~~~KK~GF~v~ 334 (500) T TIGR00131 255 CETAANFLAKKYSVVLLSGKENSSLDKGALRDVMVEYFARYENIATPWNGDIESGIARLTKLLPLVAKSLARKKSGFSVD 334 (500) T ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCHHCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHH T ss_conf 99999973377207882266765332000278876677654146789787455789999887899999985046885188 Q ss_pred -----------HHH-------------HHHHHHHCCC------CHHHHHHHC------CCC-HHHHHHHHHHHHHHHHH- Q ss_conf -----------999-------------9999840332------078898730------486-44568999998999986- Q gi|254781017|r 208 -----------EIN-------------QKIYALMGKL------SQISCQALR------NKN-LKVLAQAMNRQQGLLET- 249 (324) Q Consensus 208 -----------~~~-------------~~~~~~~~~i------~~~~~~al~------~~d-~~~l~~lm~~~~~~L~~- 249 (324) |+. -+++++..++ +-++.++++ ++| +..||+||+|+|.++++ T Consensus 335 ~~~~~LNCSREEFtRDYLtt~PvRFqvLKLY~RAkHv~~En~Rvl~A~~~l~~A~F~~~~dF~~~~G~LM~eSh~S~~dd 414 (500) T TIGR00131 335 EVATALNCSREEFTRDYLTTSPVRFQVLKLYQRAKHVVTENLRVLKAVKMLKSASFTADNDFLKELGKLMNESHASMRDD 414 (500) T ss_pred HHHHHHCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHEECCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 88764203123431122104745666666753423110015578999998741133233048999989899998645207 Q ss_pred HCCCCHHHHHHHHHHHHCCC-CEEEECCCCCCCCEEEEEEC---CCCCHHHHHHHHHH Q ss_conf 16787799999999997199-45997268997548999846---65326899999999 Q gi|254781017|r 250 LGVSDSKLSEIVWKLREQPH-IMASKISGSGLGDCVIALGK---GDLNSLPYQSVNCH 303 (324) Q Consensus 250 lgvs~~~l~~li~~~~~~~g-~~~aklsGAG~GG~~i~l~~---~~~~~~~~~~~~~~ 303 (324) +++|+|++|++|..+....| .+|+||||||||||+|+|+| ++.-+.+.+.+.+. T Consensus 415 yE~tcPeiD~~v~~~~~~~G~~~GsRmTGaGfGGC~v~Lvp~G~~~~V~~v~~~~a~~ 472 (500) T TIGR00131 415 YEITCPEIDELVESIALVIGDSGGSRMTGAGFGGCTVALVPSGANENVDKVRKAVADR 472 (500) T ss_pred CEECCHHHHHHHHHHHHHCCCEECEEECCCCCCCEEEEEECCCCCCCHHHHHHHHHHH T ss_conf 1433511423877877643854110313787566078850488987538999998653 No 14 >COG2605 Predicted kinase related to galactokinase and mevalonate kinase [General function prediction only] Probab=100.00 E-value=7.6e-42 Score=283.98 Aligned_cols=298 Identities=19% Similarity=0.180 Sum_probs=231.7 Q ss_pred EEECCEEEEEECCCEEE------CCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHH-HH Q ss_conf 99717049997435330------894289877677289999986898499995364444656612015089999999-99 Q gi|254781017|r 9 CVSAPGSLVLMGEHGVL------HGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMA-IN 81 (324) Q Consensus 9 ~~~aPGKv~L~GEh~v~------~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~-l~ 81 (324) ..+||-|+.+.|.-+|+ ||+.+++++||+|+|+++.+..++.|.+..+..+...+.......|.+..++.. +. T Consensus 3 i~raPLRItfgGGGTDvepy~~k~GGaVlnatIdky~y~~i~~~~d~~I~~~~~~~~~v~~~~~~~h~~~~~~~l~r~~l 82 (333) T COG2605 3 ISRAPLRITFGGGGTDVEPYCSKHGGAVLNATIDKYIYVTIEKGFDDEIRVRYDRTEFVKSYLENEHKPLVVESLKRDFL 82 (333) T ss_pred CCCCCEEEEECCCCCCCHHHHHHCCCEEEEEEEEEEEEEEECCCCCCEEEEECCHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 02055389852787676077875387777755310799997168885389952357765126765376189999999987 Q ss_pred HHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCC-HHHHHHHCCCEE Q ss_conf 7187787389998268753548874024566556777640456666014557888877652146887-544776629858 Q gi|254781017|r 82 HIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSG-IDLAASIHGGLI 160 (324) Q Consensus 82 ~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG-~D~~a~~~Gg~i 160 (324) .+.....+++...+|+|+|+|||||||+.||++.|+..+. +..+++++++++|+.+|+...+...| +||++++|||+. T Consensus 83 ~~~g~~~~el~~~~D~P~GSGLGSSSa~vvaLl~a~~~~k-g~~~~~~~LA~eAy~IER~~l~~~gG~QDqYaaA~GGFn 161 (333) T COG2605 83 EFNGGTPIELHTQSDAPPGSGLGSSSAFVVALLNALHAWK-GESLGPYELAREAYEIEREDLKIVGGKQDQYAAAFGGFN 161 (333) T ss_pred HHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCCE T ss_conf 3149974699984378999987734799999999999985-787898999999999889985454666237899847921 Q ss_pred EEEECC---CEEEEEE--------CCCCCCEEECCCCCCHHHHHHH-HHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 998527---5156730--------4777526765841798999988-899886082348999999984033207889873 Q gi|254781017|r 161 CYQMPK---YSIEKID--------FIFPIHLIYSGYKTPTAQVLKK-ISYIEIEYPEINEINQKIYALMGKLSQISCQAL 228 (324) Q Consensus 161 ~~~~~~---~~~~~~~--------~~~~l~l~~tg~~~~T~~~v~~-v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al 228 (324) +.++.. ..+.++. +..+++|+|||..+.+++.+++ ++.....+.+..+ .+..|.+++.+..+++ T Consensus 162 fMEf~~~~~V~v~pL~i~~e~~~Ele~~~lL~yTGi~R~Ss~V~~dQ~~~~~~~~~~~~e----~~~~mk~~A~~~~~al 237 (333) T COG2605 162 FMEFRGNGEVVVNPLRINRERTAELEARLLLYYTGITRQSSEVIEDQVRNVVDGDEETLE----ALHEMKALAYEMKDAL 237 (333) T ss_pred EEEECCCCCEEEEECCCCHHHHHHHHHCEEEEEECCCCCHHHHHHHHHHHHHCCCHHHHH----HHHHHHHHHHHHHHHH T ss_conf 799748983898003344668999874069988164022558899999876315188999----9999999999999999 Q ss_pred CCCCHHHHHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH Q ss_conf 0486445689999989999861--67877999999999971994599726899754899984665326899999999997 Q gi|254781017|r 229 RNKNLKVLAQAMNRQQGLLETL--GVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHA 306 (324) Q Consensus 229 ~~~d~~~l~~lm~~~~~~L~~l--gvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~ 306 (324) -++|+..|+++|+.+|+..+++ .+|+|.+|++++.+. .+|++|+|++|||+||+++.+|+|...... .+.+.. T Consensus 238 ~~nd~~~f~~~l~~gW~~KK~ls~~ISN~~IDriy~~A~-~~GA~~gKl~GaG~gGFllf~~~p~k~~~l----~r~l~~ 312 (333) T COG2605 238 VRNDIPEFGQILDRGWEAKKKLSSRISNDAIDRIYELAL-KNGAYGGKLSGAGGGGFLLFFCDPSKRNEL----ARALEK 312 (333) T ss_pred HHCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH-HCCCHHCEEECCCCCCEEEEEECCCCHHHH----HHHHHH T ss_conf 863357799999767776455335767477899999998-667322223026886279999685424899----999998 Q ss_pred CCCEEEEEEC Q ss_conf 7978999414 Q gi|254781017|r 307 KGIDIVPITP 316 (324) Q Consensus 307 ~g~~~~~~~~ 316 (324) ++....+.+. T Consensus 313 ~~~~~~~~~F 322 (333) T COG2605 313 EQGFVVDTSF 322 (333) T ss_pred HCCCEEEEEE T ss_conf 4497378776 No 15 >PRK13412 fkp bifunctional fucokinase/L-fucose-1-P-guanylyltransferase; Provisional Probab=100.00 E-value=7.2e-33 Score=227.17 Aligned_cols=302 Identities=14% Similarity=0.159 Sum_probs=217.4 Q ss_pred CCEEEE-ECCEEEEEECCCEE------ECCCEEEEEEECC----EEEEEEEECCCCEEEEEECCCCCCCC-------CCH Q ss_conf 878999-71704999743533------0894289877677----28999998689849999536444465-------661 Q gi|254781017|r 5 LHKICV-SAPGSLVLMGEHGV------LHGHAALVFAINK----RVILYLTLRKDRLINIDSSLGQYCGS-------LDL 66 (324) Q Consensus 5 ~~ki~~-~aPGKv~L~GEh~v------~~G~~ai~~aI~~----~~~~~~~~~~~~~i~i~~~~~~~~~~-------~~~ 66 (324) .++|+. ++|.||.|.|.-+| .+|+.|+.+||.. -+.+.+++..+..+.+.+........ .+. T Consensus 606 ~dqiv~~r~PvRIDlaGGWTDTPPy~~E~GG~VlNvAI~l~G~~Pi~~~v~~~~e~~i~l~s~d~g~~~~~~s~~~l~~y 685 (974) T PRK13412 606 SDQIVWGRSPVRIDLAGGWTDTPPYCLYSGGNVVNIAIELNGQPPLQVYVKPCSEFHIVLRSIDMGAMEVVRTNEELQDY 685 (974) T ss_pred HHHEEEECCCEEEECCCCCCCCCCCHHCCCCEEEEEEEEECCCCCCEEEEEECCCCEEEEEECCCCCEEEECCHHHHHCC T ss_conf 00025630754884057736899431105967999999867977736898506886589995278754895566786154 Q ss_pred HHCCCHHHHHHHHH---------------------HHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 20150899999999---------------------971877873899982687535488740245665567776404566 Q gi|254781017|r 67 AMFHPSFSFIIMAI---------------------NHIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKE 125 (324) Q Consensus 67 ~~~~~~~~~i~~~l---------------------~~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~ 125 (324) ...+.++....+++ +.+ ..|++|+..+++|.|||||+||.+.+|++.|+..+. +.. T Consensus 686 ~~p~~pfaL~KAal~~~G~~p~~~~~~~~sL~e~L~~f--g~GlEi~t~s~lP~GSGLGTSSILA~avl~Al~~~~-g~~ 762 (974) T PRK13412 686 KKVGSPFSIPKAALCLAGFAPAFSAESYASLEEQLKAF--GSGIEITLLAAIPAGSGLGTSSILASTVLGAISDFC-GLA 762 (974) T ss_pred CCCCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHC--CCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHH-CCC T ss_conf 78999514788999981785321235632399999860--896588862378887886388999999999999983-899 Q ss_pred CCCHHHHHHHHHHHHHHCCCCCC-HHHHHHHCCCEEEEEECCC-----EEEEEE--------CCCCCCEEECCCCCCHHH Q ss_conf 66014557888877652146887-5447766298589985275-----156730--------477752676584179899 Q gi|254781017|r 126 PSPDEILTTAHAIVLKVQGISSG-IDLAASIHGGLICYQMPKY-----SIEKID--------FIFPIHLIYSGYKTPTAQ 191 (324) Q Consensus 126 ~~~~~i~~lA~~~E~~~~g~~sG-~D~~a~~~Gg~i~~~~~~~-----~~~~~~--------~~~~l~l~~tg~~~~T~~ 191 (324) .+.+++++....+|+.. +.++| +||+.-+|+|+-+.+..+. .+..++ +..+++++|||..+..+. T Consensus 763 ~d~~~l~~~vL~lEQlL-tTGGGWQDQ~GG~~~GiK~~~s~pg~~~~~~V~~L~~~~~~~~el~~~llLvyTG~tRlAk~ 841 (974) T PRK13412 763 WDKNEICNRTLVLEQLL-TTGGGWQDQYGGVLPGVKLLQTEAGFQQSPLVRWLPDSLFTQPEYEDCHLLYYTGITRTAKG 841 (974) T ss_pred CCHHHHHHHHHHHHHHH-CCCCCHHHHCCCCCCCEEEEECCCCCCCCCEEEEECCCHHHHHHHHHCEEEEECCCCHHHHH T ss_conf 88899999999999886-15882554016742660466337998666246760561766898861479999183065768 Q ss_pred HHHHH-HHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHHCC Q ss_conf 99888-998860823489999999840332078898730486445689999989999861--678779999999999719 Q gi|254781017|r 192 VLKKI-SYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQQGLLETL--GVSDSKLSEIVWKLREQP 268 (324) Q Consensus 192 ~v~~v-~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~l--gvs~~~l~~li~~~~~~~ 268 (324) +.+.| +.+..+.+...+..+ .+.+.+.+..+|+.++|++.||++|+++|..-+.| |+++|.++++++.+ .+ T Consensus 842 iL~~VVr~~~~r~~~~l~~l~----~l~~~A~~m~~Al~~g~~~~~g~~l~~~W~q~k~Ldpg~~~~~V~~l~~~i--~p 915 (974) T PRK13412 842 ILAEIVSSMFLNSTAHLQLLH----EMKAHALDMAEAIQRGEFEEFGRLVGKTWEQNKALDSGTNPAAVEAIIETI--KD 915 (974) T ss_pred HHHHHHHHHHHCCHHHHHHHH----HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH--HH T ss_conf 999999999845788999999----999999999999982899999999999999875008899958999999975--22 Q ss_pred CCEEEECCCCCCCCEEEEEECCCCCHHH-HHHHHHHHHHCCCEEEEEEC Q ss_conf 9459972689975489998466532689-99999999977978999414 Q gi|254781017|r 269 HIMASKISGSGLGDCVIALGKGDLNSLP-YQSVNCHMHAKGIDIVPITP 316 (324) Q Consensus 269 g~~~aklsGAG~GG~~i~l~~~~~~~~~-~~~~~~~~~~~g~~~~~~~~ 316 (324) .++|.|++||||||+++.++|....+.. .+.+...-...+-+..+.+- T Consensus 916 ~~~G~kL~GAGGGGfl~~~aKdp~~a~~ir~~L~~~~p~~~arfve~s~ 964 (974) T PRK13412 916 YTLGYKLPGAGGGGYLYMVAKDPGAAERIRKILTEQAPNPRARFVEMSL 964 (974) T ss_pred HHHCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHCCCCCCCEEEEEEE T ss_conf 3310414556777679999748778999999997379899828999886 No 16 >KOG0631 consensus Probab=100.00 E-value=4.5e-32 Score=222.12 Aligned_cols=303 Identities=18% Similarity=0.174 Sum_probs=189.1 Q ss_pred EEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCC-----CCCC---------HHHCCCH Q ss_conf 8999717049997435330894289877677289999986898499995364444-----6566---------1201508 Q gi|254781017|r 7 KICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYC-----GSLD---------LAMFHPS 72 (324) Q Consensus 7 ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~-----~~~~---------~~~~~~~ 72 (324) .+.+++|||+||+|||+||+|+.|++||||..+.+.+.++++....+....++.. ..++ ...|..+ T Consensus 39 ~~~a~~PgRVnLiGEHiDy~~~sVlpmaid~~~l~~~~~~~d~~~sl~~tN~~~~f~~~~~~~p~~~~~I~~~~~~w~ny 118 (489) T KOG0631 39 VFVARAPGRVNLIGEHIDYCGYSVLPMAIDVDTLIAVAPSDDGIVSLRLTNFNPDFIYFKYPLPSIVWQIDPDVSKWENY 118 (489) T ss_pred EEEEECCCCEECCCCEEEECCCEEEEEEEEEEEEEEEEECCCCCEEEEEECCCCCCCEEECCCCCHHCCCCCCCCCHHHH T ss_conf 08985687400132113405733545776410079997757884468973279863014634770110047996302210 Q ss_pred H----HHHHHHHHH--HCCC--CCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC--CCCHHHHHHHHHHHHHH Q ss_conf 9----999999997--1877--873899982687535488740245665567776404566--66014557888877652 Q gi|254781017|r 73 F----SFIIMAINH--IKPS--CGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKE--PSPDEILTTAHAIVLKV 142 (324) Q Consensus 73 ~----~~i~~~l~~--~~~~--~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~--~~~~~i~~lA~~~E~~~ 142 (324) + +.+...++. -... -|+.+..++++|+|+||.||||+.++...|..++..+.+ ..+.++..+....|+.+ T Consensus 119 ~~C~~~g~h~~~~~~~~~~~~~vGl~~l~~g~vPtgsgLsSsaa~~c~a~lA~~~~~~gpn~~~~kkd~~~i~~~ae~~~ 198 (489) T KOG0631 119 FYCGMKGFHEYIKRKPVRFEPPVGLSILNDGSVPTGSGLSSSAAWLCAAALATLKLNLGPNFIISKKDLATITVVAESYI 198 (489) T ss_pred HCCCHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHCCEEEEECCC T ss_conf 02551889999721553337876538986478789988443589999999999997608873421555405217863126 Q ss_pred CCCCCCHHHHHHHCCC---EEEEE--ECCCEEEEEECCCC--CCEEECCCCCC--------------------------- Q ss_conf 1468875447766298---58998--52751567304777--52676584179--------------------------- Q gi|254781017|r 143 QGISSGIDLAASIHGG---LICYQ--MPKYSIEKIDFIFP--IHLIYSGYKTP--------------------------- 188 (324) Q Consensus 143 ~g~~sG~D~~a~~~Gg---~i~~~--~~~~~~~~~~~~~~--l~l~~tg~~~~--------------------------- 188 (324) .-..+||||+++++|- .++.+ +.+.....++++-. +++.++-...+ T Consensus 199 G~~~gGmdq~asvl~~~~~Al~v~~~~~Pf~~~~lk~~~~~vfvI~~~L~~~nk~~~a~tnynlRv~E~~ia~~~la~k~ 278 (489) T KOG0631 199 GLNSGGMDQAASVLAEKGHALLVDPYFTPFRRSMLKLPDGGVFVIANSLVESNKAETAETNYNLRVVEGTIAAGELAAKI 278 (489) T ss_pred CCCCCCHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEEEECHHHHHCCHHHHHHHHHCEEEEEEHHHHHHHHHH T ss_conf 75877478899998733745884166785445565577786688834366532301256652010576540267787776 Q ss_pred ---HH------------------------HHHHHHHHHHHHCCC-HHHH-----------HH-------------HHHHH Q ss_conf ---89------------------------999888998860823-4899-----------99-------------99984 Q gi|254781017|r 189 ---TA------------------------QVLKKISYIEIEYPE-INEI-----------NQ-------------KIYAL 216 (324) Q Consensus 189 ---T~------------------------~~v~~v~~~~~~~~~-~~~~-----------~~-------------~~~~~ 216 (324) +. ++...|.+..+..+. ..+. .+ ++..+ T Consensus 279 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~v~~~~~~e~f~~ee~~~~l~~~~~~f~~~~~T~~~v~~~~~k~~~r 358 (489) T KOG0631 279 LVELPAYILRYQLQRAWRGDIGEGYERAEEMLGLVEESLKPEGFNIEEVARALGLDTEEFLQSLLTLAAVDLQVKKLYQR 358 (489) T ss_pred HCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 03428888765443013565306677777787777865176788888987785304588998760242134588888988 Q ss_pred HCCC------CHHHHHHHC----C--CCHHHHHHHHHHHHHHHHH-HCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCE Q ss_conf 0332------078898730----4--8644568999998999986-1678779999999999719945997268997548 Q gi|254781017|r 217 MGKL------SQISCQALR----N--KNLKVLAQAMNRQQGLLET-LGVSDSKLSEIVWKLREQPHIMASKISGSGLGDC 283 (324) Q Consensus 217 ~~~i------~~~~~~al~----~--~d~~~l~~lm~~~~~~L~~-lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~ 283 (324) ..++ +.++.++.. + +.+..||+|||++|.+.+. ++.|||++++|++.+... |++++|+|||||||| T Consensus 359 akHv~sea~rv~q~~~~~~~a~~~~d~~~~~~g~LmneS~~Sc~~~yEcscpel~qL~kiala~-g~~gaRlTGaGwGGc 437 (489) T KOG0631 359 AKHVYSEALRVLQEEKLCARAPGRADGFLADFGRLMNESHRSCDVLYECSCPELDQLCKIALAN-GGVGARLTGAGWGGC 437 (489) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHC-CCCCCEEECCCCCCC T ss_conf 9999988888887899985496320000888888861556788887754887299999999966-886553632554640 Q ss_pred EEEEECCCCCHHHHHHHHHHHHHCCCE Q ss_conf 999846653268999999999977978 Q gi|254781017|r 284 VIALGKGDLNSLPYQSVNCHMHAKGID 310 (324) Q Consensus 284 ~i~l~~~~~~~~~~~~~~~~~~~~g~~ 310 (324) +++|++.+..+...+.+.+...++-+. T Consensus 438 ~v~lvp~d~~~~~~~~~~~~~Y~ka~~ 464 (489) T KOG0631 438 TVALVPADLVDFAVAALKEIYYEKAYP 464 (489) T ss_pred EEEECCCCCHHHHHHHHHHHHHCCCCC T ss_conf 012045320588987633555302560 No 17 >COG3890 ERG8 Phosphomevalonate kinase [Lipid metabolism] Probab=99.97 E-value=1.2e-28 Score=200.55 Aligned_cols=302 Identities=22% Similarity=0.273 Sum_probs=200.4 Q ss_pred CCEEEEECCEEEEEECCCEEEC-CCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHH-CCCHHHHHHHHHHH Q ss_conf 8789997170499974353308-942898776772899999868984999953644446566120-15089999999997 Q gi|254781017|r 5 LHKICVSAPGSLVLMGEHGVLH-GHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAM-FHPSFSFIIMAINH 82 (324) Q Consensus 5 ~~ki~~~aPGKv~L~GEh~v~~-G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~-~~~~~~~i~~~l~~ 82 (324) |.++.+|||||+.|+|+|+++. |..+.+.|++++.......-.+...++..... ...+... .+....+|...+.. T Consensus 1 M~~~~fSaPGk~LlaGdYs~lv~glsahaia~nkr~~cs~vrvKskdswlfhisi---~g~~knsfiEl~~~~Vf~yF~~ 77 (337) T COG3890 1 MDLISFSAPGKLLLAGDYSILVEGLSAHAIAINKRAFCSFVRVKSKDSWLFHISI---KGIDKNSFIELRSDFVFKYFAY 77 (337) T ss_pred CCEEEECCCCCEEEECCCEEEEECCEEEEEEECCCCCCCEEEECCCCEEEEEEEE---CCCCCCHHHHHHHHHHHHHHHH T ss_conf 9524405887369840311355233124787305552013674267458987330---1368871899999999999875 Q ss_pred HCCCCCEEEEEECC---------C---CCCCCCCCCCHHHHHHH-HHHHHHHCCCCCCC-HHHHHHHHHHHHHHCC-CCC Q ss_conf 18778738999826---------8---75354887402456655-67776404566660-1455788887765214-688 Q gi|254781017|r 83 IKPSCGFDLKVISQ---------L---DSQLGLGSSAAITVAIT-AALLTLQYHKEPSP-DEILTTAHAIVLKVQG-ISS 147 (324) Q Consensus 83 ~~~~~g~~i~i~s~---------i---P~g~GLGSSaa~~va~~-~al~~~~~~~~~~~-~~i~~lA~~~E~~~~g-~~s 147 (324) ..+.|..++.+.++ + -.++||||||++++.++ .+++.+..+.+++. .+|.++|+.+++..|| .+| T Consensus 78 ~~~~c~qnlev~~ddayy~q~~f~~d~g~KtGlGSSAa~~tsLt~~lfls~~~~~nvd~k~eIhklaqiAhc~aQggIGS 157 (337) T COG3890 78 LSQNCFQNLEVFADDAYYDQSNFFFDDGTKTGLGSSAAVATSLTCGLFLSHANATNVDEKGEIHKLAQIAHCYAQGGIGS 157 (337) T ss_pred HHHHCCCCCEEECCHHHCCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 41422333223232321056322214776577762468899999999998860565016789999999988997179776 Q ss_pred CHHHHHHHCCCEEEEEECCCEEEEE------EC-CCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCC Q ss_conf 7544776629858998527515673------04-7775267658417989999888998860823489999999840332 Q gi|254781017|r 148 GIDLAASIHGGLICYQMPKYSIEKI------DF-IFPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKL 220 (324) Q Consensus 148 G~D~~a~~~Gg~i~~~~~~~~~~~~------~~-~~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i 220 (324) |+|+++++||++++.++++.-++++ ++ .+.+++.+++....|..+|++|.++++.+. ++..+ ..+ ++ T Consensus 158 GfDiaaA~fGsiiyrRF~p~li~~l~qig~~nfg~y~LmmGd~a~gSeTvglV~kv~~w~ns~~--peelk-c~d---e~ 231 (337) T COG3890 158 GFDIAAAIFGSIIYRRFEPGLIPKLRQIGAVNFGDYYLMMGDQAIGSETVGLVCKVNKWRNSIL--PEELK-CND---EM 231 (337) T ss_pred CCHHHHHHHCCEEEEECCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCC--HHHHH-HHH---HH T ss_conf 4035676623157863483100345763750306765662232368553489999987763378--58877-678---88 Q ss_pred CHHHHHHHCCCCHHHHHHHHHHHHHHHHHH-----C-CCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCH Q ss_conf 078898730486445689999989999861-----6-7877999999999971994599726899754899984665326 Q gi|254781017|r 221 SQISCQALRNKNLKVLAQAMNRQQGLLETL-----G-VSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNS 294 (324) Q Consensus 221 ~~~~~~al~~~d~~~l~~lm~~~~~~L~~l-----g-vs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~ 294 (324) ...+.+.++. .+++.-.....-+..|+.+ + ...+...++.....+..|++++-+.|||||+.+++|.++.. T Consensus 232 nsrvl~l~~~-sLdE~~s~~~~irrslrritee~~adiep~kqt~~Ld~i~~l~gvl~~lipgaGggdaif~l~~~~~-- 308 (337) T COG3890 232 NSRVLNLIKL-SLDESNSKSKAIRRSLRRITEEGLADIEPIKQTAALDSIFDLLGVLCDLIPGAGGGDAIFLLYRPNI-- 308 (337) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCH-- T ss_conf 8999988862-0888999999999999999997523333411355400677602754750246788753799845628-- Q ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCC Q ss_conf 8999999999977978999414545644 Q gi|254781017|r 295 LPYQSVNCHMHAKGIDIVPITPSHSTSL 322 (324) Q Consensus 295 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 322 (324) ++.+.|...|+.+++++.++|.+. T Consensus 309 ----e~re~w~~~gi~i~dv~adeGlr~ 332 (337) T COG3890 309 ----EAREAWLSKGISIVDVNADEGLRF 332 (337) T ss_pred ----HHHHHHHHCCEEEEECCCCCCCCC T ss_conf ----988878757807997144436112 No 18 >PRK01212 homoserine kinase; Provisional Probab=99.97 E-value=2.5e-28 Score=198.39 Aligned_cols=284 Identities=18% Similarity=0.178 Sum_probs=201.1 Q ss_pred CCEEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHHHHC Q ss_conf 87899971704999743533089428987767728999998689849999536444465661201508999999999718 Q gi|254781017|r 5 LHKICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAINHIK 84 (324) Q Consensus 5 ~~ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~ 84 (324) |++|.+++|+-.--+|- |+.++.+||++|.++++++.+ ..+.+.... ...++.+..+-..+.....+++.. T Consensus 1 ~~~v~v~vPAtsANlG~-----GFD~lg~Al~l~d~v~v~~~~-~~~~~~g~~---~~~lP~d~~Nlv~~a~~~~~~~~g 71 (293) T PRK01212 1 MMMVKVYVPASSANLGP-----GFDSLGLALSLYDEVTVEEAD-FSLEVEGEF---ADKLPLDEENLVYQAALKFLEKLG 71 (293) T ss_pred CEEEEEEECCCHHHCCC-----CHHHHHCCCCCCCEEEEEECC-CEEEEEECC---CCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 90799993243663573-----746665456777599999868-789998168---666999998179999999999839 Q ss_pred CCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHH-HHHCCCEEEEE Q ss_conf 77873899982687535488740245665567776404566660145578888776521468875447-76629858998 Q gi|254781017|r 85 PSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLA-ASIHGGLICYQ 163 (324) Q Consensus 85 ~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~-a~~~Gg~i~~~ 163 (324) ...|++|+++++||.++|||||||.+||.+.+++.+ ++.+++.+++.++|.++| |+| |+. .|.+||..... T Consensus 72 ~~~g~~I~i~k~IP~~~GLGsssa~aaa~l~a~n~l-~~~~ls~~eL~~lA~~~e----~H~---DNv~P~~~GG~~~~~ 143 (293) T PRK01212 72 KPPGLRLELEKNIPLGRGLGSSAASIVAALVAANEL-LGLPLSKEELLQLAGEGE----GHP---DNVAPALLGGLVLAL 143 (293) T ss_pred CCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHHC----CCC---CCCCCEEECCEEEEE T ss_conf 998879999718986566575188999999999998-179989899999999842----676---655774526778996 Q ss_pred -ECCCEEEEEECC--CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHH Q ss_conf -527515673047--77526765841798999988899886082348999999984033207889873048644568999 Q gi|254781017|r 164 -MPKYSIEKIDFI--FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAM 240 (324) Q Consensus 164 -~~~~~~~~~~~~--~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm 240 (324) ..+....+++.+ +.++++..+...+|+++.+.+ .++++. .+... ..+ .......++.++|++.+++.| T Consensus 144 ~~~~~~~~~~~~~~~~~~vl~~P~~~vsT~~aR~~L---p~~~~~-~dav~----~~~-~~~~l~~al~~~d~~~l~~~~ 214 (293) T PRK01212 144 EENGVISVKIPVFDDWKWVVAIPGIELSTAEARAVL---PKQYSR-KDAVF----NSS-RLALLVAALYTGDYELAGAAM 214 (293) T ss_pred CCCCEEEEECCCCCCEEEEEEECCCCCCHHHHHHHC---CCCCCH-HHHHH----HHH-HHHHHHHHHHCCCHHHHHHHH T ss_conf 279648997478876489999489674489999856---032887-89886----553-437889999839999999874 Q ss_pred HHH-HHHHH-HHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCH-HHHHHHHHHHHHCCCEEEEEECC Q ss_conf 998-99998-6167877999999999971994599726899754899984665326-89999999999779789994145 Q gi|254781017|r 241 NRQ-QGLLE-TLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNS-LPYQSVNCHMHAKGIDIVPITPS 317 (324) Q Consensus 241 ~~~-~~~L~-~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~-~~~~~~~~~~~~~g~~~~~~~~~ 317 (324) +.. |+--+ .+ .|+++++.+.+. ..|++++.|||||+ ++++|++++..+ .+.+.+++.+...++ ++.+..| T Consensus 215 ~D~lhep~r~~l---iP~~~~i~~~~~-~~GA~~~~lSGSGp--tv~al~~~~~~A~~i~~~l~~~~~~~~~-v~~l~~d 287 (293) T PRK01212 215 KDVLHEPYRAKL---IPGFAEVRQAAL-EAGALGAGISGAGP--TVFALCDKEDAAEAVADALEKAFLQEGF-VHVLRLD 287 (293) T ss_pred CCHHHHHHHHHH---CCCHHHHHHHHH-HCCCCEEEEECHHH--HEEEEECCHHHHHHHHHHHHHHHHHCCE-EEEECCC T ss_conf 002235678763---856999999999-78997899987411--1248977889999999999997643888-9990469 Q ss_pred -CCCC Q ss_conf -4564 Q gi|254781017|r 318 -HSTS 321 (324) Q Consensus 318 -~~~~ 321 (324) +|.+ T Consensus 288 ~~G~~ 292 (293) T PRK01212 288 TAGAR 292 (293) T ss_pred CCCCC T ss_conf 99888 No 19 >TIGR01219 Pmev_kin_ERG8 phosphomevalonate kinase; InterPro: IPR016005 Phosphomevalonate kinase (2.7.4.2 from EC) catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. In an example of nonorthologous gene displacement, two different types of phosphomevalonate kinase are found - the higher eukaryotic form and the ERG8 type . This model represents plant and fungal forms of the ERG8 type of phosphomevalonate kinase.. Probab=99.97 E-value=6e-29 Score=202.37 Aligned_cols=308 Identities=19% Similarity=0.287 Sum_probs=212.8 Q ss_pred EEECCEEEEEECCCEEEC-CCEEEEEEECCEEEEEEEECCCC-----------EEEEEEC---CCCCCCCCCHH------ Q ss_conf 997170499974353308-94289877677289999986898-----------4999953---64444656612------ Q gi|254781017|r 9 CVSAPGSLVLMGEHGVLH-GHAALVFAINKRVILYLTLRKDR-----------LINIDSS---LGQYCGSLDLA------ 67 (324) Q Consensus 9 ~~~aPGKv~L~GEh~v~~-G~~ai~~aI~~~~~~~~~~~~~~-----------~i~i~~~---~~~~~~~~~~~------ 67 (324) ++|||||+.|.|.|.|+. -+.++++.++.+.|+.+.+-++. .+++.|. ..++.+.+++. T Consensus 1 ~~SAPGKvL~aGGYlVLd~~y~glV~gl~Ar~yA~~~p~~~~~gPesW~~K~t~VrvkSpQf~~~EwlY~is~~~~~~~~ 80 (499) T TIGR01219 1 VASAPGKVLLAGGYLVLDKKYEGLVVGLSARFYAVVKPLDEKVGPESWADKFTEVRVKSPQFKDAEWLYKISLKVSHLRL 80 (499) T ss_pred CCCCCCCHHHHCCEEEECCCCCEEEEEEECCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCCCCEEECCCCCCCEEE T ss_conf 97687547563854466477660688640414688610122458651002123567636643465200221564461589 Q ss_pred -----------HCCCHHHHHHH-HHH--HHCCC-------CCEEEEEECC--------------CC-------------- Q ss_conf -----------01508999999-999--71877-------8738999826--------------87-------------- Q gi|254781017|r 68 -----------MFHPSFSFIIM-AIN--HIKPS-------CGFDLKVISQ--------------LD-------------- 98 (324) Q Consensus 68 -----------~~~~~~~~i~~-~l~--~~~~~-------~g~~i~i~s~--------------iP-------------- 98 (324) ..+|.+..++. .+. ++.+. .++.|+|.++ +| T Consensus 81 ~~~~~~~s~~~~~npfiq~aia~v~~atyf~~~~~~~l~~q~l~iti~~dnayysqrnqiesaGlPltp~s~~~~~~~~~ 160 (499) T TIGR01219 81 QSVYASVSLEKSKNPFIQLAIAAVFSATYFKKESRDKLLLQDLDITILSDNAYYSQRNQIESAGLPLTPESLTELASYAK 160 (499) T ss_pred EEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHCCCEEEEEECCCHHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 88864123688866689999999987542165320026331622689633522444331210578567212455577752 Q ss_pred -----------------CCCCCCCCCHHHHHHHHHHHHHHCCCC------------CCCHHHHHHHHHHHHHHCCC-CCC Q ss_conf -----------------535488740245665567776404566------------66014557888877652146-887 Q gi|254781017|r 99 -----------------SQLGLGSSAAITVAITAALLTLQYHKE------------PSPDEILTTAHAIVLKVQGI-SSG 148 (324) Q Consensus 99 -----------------~g~GLGSSaa~~va~~~al~~~~~~~~------------~~~~~i~~lA~~~E~~~~g~-~sG 148 (324) .++|||||||++.+++.||..+..... -+++-|..+|+-+++..||+ +|| T Consensus 161 ~~F~~~~snGans~~eV~KTGLGSSAam~T~l~~aL~~~l~~~~~~~~dK~~K~~~S~~~viH~lAQ~aHC~AQGKvGSG 240 (499) T TIGR01219 161 LSFLAAESNGANSLEEVAKTGLGSSAAMVTALVAALLLYLRSVDEDDVDKELKIDDSDKEVIHNLAQVAHCSAQGKVGSG 240 (499) T ss_pred EEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCC T ss_conf 02202222565565553656763148999999999999865420457873433267504555544788886327853463 Q ss_pred HHHHHHHCCCEEEEEECCCEEEEE-------------------------------EC----CCCCCEEECC-CCCCHHHH Q ss_conf 544776629858998527515673-------------------------------04----7775267658-41798999 Q gi|254781017|r 149 IDLAASIHGGLICYQMPKYSIEKI-------------------------------DF----IFPIHLIYSG-YKTPTAQV 192 (324) Q Consensus 149 ~D~~a~~~Gg~i~~~~~~~~~~~~-------------------------------~~----~~~l~l~~tg-~~~~T~~~ 192 (324) +|+.|++||+.++.+|++..+..+ ++ ...|+|.+.+ ...+|..| T Consensus 241 FDV~aA~yGS~rYrRF~P~lis~l~vPyd~~i~~~~~~~~l~~iv~~kw~Ni~~~~f~LP~~~~L~mGd~~sgGSsTp~m 320 (499) T TIGR01219 241 FDVAAAVYGSIRYRRFDPELISQLLVPYDVDIKSLTYSEKLAKIVKEKWDNIKVKEFSLPALLTLLMGDVKSGGSSTPKM 320 (499) T ss_pred HHHHHHHHCCEEEEECCHHHHHHCCCCCHHHCCCCCHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCCCCCCCCCHHH T ss_conf 23777643351144468799964168601311457537899987412356530100457624475504437888673678 Q ss_pred HHHHHHHHHHCCCH-HH----HHHH---HHHHHCCCCH----------HHHH---HHCCCCH----------------HH Q ss_conf 98889988608234-89----9999---9984033207----------8898---7304864----------------45 Q gi|254781017|r 193 LKKISYIEIEYPEI-NE----INQK---IYALMGKLSQ----------ISCQ---ALRNKNL----------------KV 235 (324) Q Consensus 193 v~~v~~~~~~~~~~-~~----~~~~---~~~~~~~i~~----------~~~~---al~~~d~----------------~~ 235 (324) |++|+++...+|+- ++ ...+ +.+.+..+.+ +..+ .|..+|+ .+ T Consensus 321 V~kVk~Wq~s~pees~~~~~~L~~aN~~l~~~L~~L~~L~~~H~D~~~~~~~sCsvL~~~~c~l~~~~Pineaiike~~E 400 (499) T TIGR01219 321 VKKVKKWQKSDPEESKEIYDDLDDANLRLEDKLSDLDKLAKDHDDVYLRVIKSCSVLESEDCVLSAEEPINEAIIKELLE 400 (499) T ss_pred HHHHHHHHCCCCHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCHHHHHHHHHHH T ss_conf 89997130158368898640366656788731347777652001367888765201212350331577457899999999 Q ss_pred HHHHHHHHHHHHHHHC------CCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCC Q ss_conf 6899999899998616------7877999999999971994599726899754899984665326899999999997797 Q gi|254781017|r 236 LAQAMNRQQGLLETLG------VSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGI 309 (324) Q Consensus 236 l~~lm~~~~~~L~~lg------vs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~ 309 (324) +-+.|..-|..|++++ +.++.-.+|.+.+...+|++++-++||||-|.+++|+..+.+ ..+.+.+.|...+. T Consensus 401 ~r~a~~~IRr~~r~it~ea~vdIEP~~QT~lLD~~~~l~GVl~~gvPGAGGfDAif~it~~~~d--~~~~~~~~w~~~~V 478 (499) T TIGR01219 401 VREAVLSIRRLLRKITKEAKVDIEPEKQTSLLDDIESLEGVLLVGVPGAGGFDAIFVITLNDVD--LREKVIKAWKSDKV 478 (499) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCEEEEECCCCCHHHHHHHHHHCCHH--HHHHHHHHHHHCCC T ss_conf 9999999999999844550788887344334302002586368852898547899988740212--68899875642683 Q ss_pred EEEEEECCC Q ss_conf 899941454 Q gi|254781017|r 310 DIVPITPSH 318 (324) Q Consensus 310 ~~~~~~~~~ 318 (324) ..++|++++ T Consensus 479 ~~ldV~~a~ 487 (499) T TIGR01219 479 LALDVSEAD 487 (499) T ss_pred EEEEECCCC T ss_conf 477604587 No 20 >PRK03188 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.95 E-value=9e-26 Score=182.26 Aligned_cols=268 Identities=17% Similarity=0.168 Sum_probs=193.6 Q ss_pred EEEECCEEEEEE---------CCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHH Q ss_conf 999717049997---------43533089428987767728999998689849999536444465661201508999999 Q gi|254781017|r 8 ICVSAPGSLVLM---------GEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIM 78 (324) Q Consensus 8 i~~~aPGKv~L~---------GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~ 78 (324) |+++||+||||+ |.|.. ..+-.+|+.+..+++++.++..+.+..... ..++.+..+-.++.+.. T Consensus 1 it~~apAKINL~L~V~~kr~dGyH~l----~s~~~~i~l~D~i~i~~~~~~~~~~~~~~~---~~~P~d~~Nlv~kA~~~ 73 (299) T PRK03188 1 VTVRAPGKVNLHLGVGPLREDGYHEL----ATVFQAVSLYDEVTVTPADDLSLTVSGEGA---DQVPTDESNLAWRAAEL 73 (299) T ss_pred CEEEEEEEEECCCCCCCCCCCCCCCE----EEEEEEECCCEEEEEEECCCCEEEEECCCC---CCCCCCCCCHHHHHHHH T ss_conf 97975012820403388799998513----589999368719999999998299957873---44889986499999999 Q ss_pred HHHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCC Q ss_conf 99971877873899982687535488740245665567776404566660145578888776521468875447766298 Q gi|254781017|r 79 AINHIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGG 158 (324) Q Consensus 79 ~l~~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg 158 (324) ..+.+....|++|+++++||+++|||++||.++|++++++++ ++.+++.+++.++|.++ |.|+..+.+|| T Consensus 74 ~~~~~~~~~~~~I~l~K~IP~gaGLGGGSsdAAa~L~~ln~l-~~l~ls~~eL~~lA~~l---------GsDVPffL~gg 143 (299) T PRK03188 74 LAEHVGRAPDVHIHIDKGIPVAGGMAGGSADAAAALVACDAL-WGLGLSRDELLALAAEL---------GSDVPFALLGG 143 (299) T ss_pred HHHHHCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHH---------CCCCCEEEECC T ss_conf 999729999769999957977677774078999999999997-17997989999999861---------78876376278 Q ss_pred EEEEEECCCEEEEEECC--CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHH Q ss_conf 58998527515673047--7752676584179899998889988608234899999998403320788987304864456 Q gi|254781017|r 159 LICYQMPKYSIEKIDFI--FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVL 236 (324) Q Consensus 159 ~i~~~~~~~~~~~~~~~--~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l 236 (324) +.+.+.....+.+++.. +.++|++.+...+|+++.+.+...+...+... .. ........++..+|++.+ T Consensus 144 ~a~~~G~Ge~l~pl~~~~~~~~vlv~P~~~vST~~ay~~ld~~~~~~~~~~--------~~-~~~~~l~~al~~gd~~~l 214 (299) T PRK03188 144 TALGTGRGEELAPVLARGTFHWVLAFADGGLSTPAVFRELDRLREAGPDPP--------RL-GDPEPLLAALAAGDPAQL 214 (299) T ss_pred CEEEEECCCEEEECCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCCCCC--------CC-CCHHHHHHHHHCCCHHHH T ss_conf 689997486768887678637999868999472999997776423667765--------66-787999999983999999 Q ss_pred HHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCE Q ss_conf 8999998999986167-8779999999999719945997268997548999846653268999999999977978 Q gi|254781017|r 237 AQAMNRQQGLLETLGV-SDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGID 310 (324) Q Consensus 237 ~~lm~~~~~~L~~lgv-s~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~ 310 (324) .+.|. +.|+..-. -.|++.++.+.+ ...|++++.|||+|. ++++|++.+.++ +++.+.+...|+- T Consensus 215 ~~~l~---NdLe~~a~~l~P~i~~i~~~l-~~~Ga~~a~mSGSGs--TvFalf~~~~~A---~~~a~~l~~~g~~ 280 (299) T PRK03188 215 APLLG---NDLQAAALSLRPELRRTLRAG-EEAGALAGIVSGSGP--TCAFLCASADSA---VDVAAALSGAGVC 280 (299) T ss_pred HHHHC---CCHHHHHHHCCCHHHHHHHHH-HHCCCCEEEEECCCC--CEEEEECCHHHH---HHHHHHHHHCCCE T ss_conf 88716---860677877190999999999-967998789978440--102797899999---9999999757930 No 21 >PRK04344 consensus Probab=99.95 E-value=7.7e-26 Score=182.70 Aligned_cols=273 Identities=17% Similarity=0.154 Sum_probs=186.3 Q ss_pred CEEEEECCEEEEEE----CCCEE-ECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHH Q ss_conf 78999717049997----43533-08942898776772899999868984999953644446566120150899999999 Q gi|254781017|r 6 HKICVSAPGSLVLM----GEHGV-LHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAI 80 (324) Q Consensus 6 ~ki~~~aPGKv~L~----GEh~v-~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l 80 (324) .||.++||+||||+ |-..| ||--..+-.+||.+..++++..+.+.+.+......... +..+-..+.+.... T Consensus 1 M~i~~kapAKINL~L~V~~kr~dGyH~l~s~~~~i~l~D~i~i~~~~~~~i~i~~~~~~~p~----~~~Nlv~ka~~~l~ 76 (284) T PRK04344 1 MKIYEKAPAKINLSLDTLFKRDDGYHEVEMIMTTIDLADRLTLEELKEDKIVVDSENRFVPS----DQRNLAYQAAKLLK 76 (284) T ss_pred CCEEEEECEEEECCCCCCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCCEEEEECCCCCCC----CCCHHHHHHHHHHH T ss_conf 90516712128533322775899982268999995688299999988998899978876888----76219999999999 Q ss_pred HHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEE Q ss_conf 97187787389998268753548874024566556777640456666014557888877652146887544776629858 Q gi|254781017|r 81 NHIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLI 160 (324) Q Consensus 81 ~~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i 160 (324) +.+....+++|+++.+||+|+|||++||.++|++++|+++ ++.+++.+++.++|..+ |.|+..+.+||.. T Consensus 77 ~~~~~~~~~~I~l~KnIPvgaGLGGGSSdAAa~L~~Ln~l-~~l~ls~~~L~~la~~l---------GaDVPffl~g~~a 146 (284) T PRK04344 77 ERYNIKKGVTISIDKHIPVAAGLAGGSSDAAATLRGLNRL-WNLGLSLEELAELGSEI---------GSDVPFCVYGGTA 146 (284) T ss_pred HHHCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHC---------CCCEEEEEECCCE T ss_conf 9759899769999978988678898615799999999998-56899999999999862---------7986489827618 Q ss_pred EEEECCCEEEEEECC--CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHH Q ss_conf 998527515673047--775267658417989999888998860823489999999840332078898730486445689 Q gi|254781017|r 161 CYQMPKYSIEKIDFI--FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQ 238 (324) Q Consensus 161 ~~~~~~~~~~~~~~~--~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~ 238 (324) +.+.....+.+++.. +.++|++.+...+|+++-+.+......++.. .....++.++|+..+.. T Consensus 147 ~~~G~Ge~l~~l~~~~~~~~vlv~P~~~vST~~vy~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~ 211 (284) T PRK04344 147 LATGRGEKIEPLPTPPSCWVVLAKPNIGVSTPDIFKNLNLDRIEHPDT---------------DKMVEAIENGDYQQICQ 211 (284) T ss_pred EEECCCCEEEECCCCCCCEEEEECCCCCCCHHHHHHHCCCCCCCCCCH---------------HHHHHHHHCCCHHHHHH T ss_conf 998589650799877875899988999967399997458000468777---------------89999886279999997 Q ss_pred HHHHHHHHHHHHC-CCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEE Q ss_conf 9999899998616-78779999999999719945997268997548999846653268999999999977978999 Q gi|254781017|r 239 AMNRQQGLLETLG-VSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVP 313 (324) Q Consensus 239 lm~~~~~~L~~lg-vs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~ 313 (324) .+ ++.|+..- -..|++.++.+.+.+ .|+.++.|||+|+ |++++++++.+++.+..--+......|.+-+ T Consensus 212 ~~---~Ndle~~~~~~~P~i~~i~~~l~~-~Ga~~~~mSGSGs--tvF~lf~~~~~A~~~~~~lk~~~~~~~~~~~ 281 (284) T PRK04344 212 SL---GNVLEPVTLAKHPEIAKIKNRMLK-FGADAALMSGSGP--TVFGLCRKESRAQRIYNGLKGFCEEVYLVRL 281 (284) T ss_pred HC---CCCHHHHHHHHCHHHHHHHHHHHH-CCCCEEEEECCCC--CEEEEECCHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 73---485689999859399999999996-7998689977251--2489989999999999987875792899997 No 22 >PRK02534 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.95 E-value=5.8e-25 Score=177.14 Aligned_cols=288 Identities=18% Similarity=0.161 Sum_probs=193.5 Q ss_pred CCEEEEECCEEEEEE---------CCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHH Q ss_conf 878999717049997---------43533089428987767728999998689849999536444465661201508999 Q gi|254781017|r 5 LHKICVSAPGSLVLM---------GEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSF 75 (324) Q Consensus 5 ~~ki~~~aPGKv~L~---------GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~ 75 (324) |+++.++||+||||+ |-|- -..+-.+||++..+++++.++..+.+.+...+.. .+..+-..+ T Consensus 1 M~~~~l~apAKINL~L~V~~kr~dGyH~----l~s~~~~i~l~D~i~i~~~~~~~i~~~~~~~~~~----~~~~Nlv~k- 71 (308) T PRK02534 1 MHSYTLIAPAKINLHLEILGDRPDGFHE----LAMVMQSIDLADRLEFRNNGDGTIKLRCDHPDLS----TDDDNLIYR- 71 (308) T ss_pred CCEEEEEECEEEECCEECCCCCCCCCCE----EEEEEEEECCCCEEEEEECCCCCEEEEECCCCCC----CCCCCHHHH- T ss_conf 9537898043681043018879999850----4789998168839999998999889986887688----898649999- Q ss_pred HHHHH-HHHCCC-CCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 99999-971877-8738999826875354887402456655677764045666601455788887765214688754477 Q gi|254781017|r 76 IIMAI-NHIKPS-CGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAA 153 (324) Q Consensus 76 i~~~l-~~~~~~-~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a 153 (324) ....+ ++.... .|++|++..+||+++|||++||-++|++++++++ ++.+++.+++.++|..+ |.|+.. T Consensus 72 a~~~l~~~~~~~~~gv~I~l~KnIPv~aGLGGGSsdAAa~L~~Ln~l-~~~~ls~~~L~~ia~~l---------GsDVPf 141 (308) T PRK02534 72 AAQLLRKRFPFAELGVDITLEKNIPIGAGLAGGSSDAAAVLVGLNLL-WGLGLTQPELESLAAEL---------GSDVPF 141 (308) T ss_pred HHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHC---------CCCCCC T ss_conf 99999996399988658999836886678884068999999999998-47999989999987511---------688550 Q ss_pred HHCCCEEEEEECCCEEEEEECCC--CCCEE-ECCCCCCHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHCCCCHHHHHHHC Q ss_conf 66298589985275156730477--75267-658417989999888998860-823489999999840332078898730 Q gi|254781017|r 154 SIHGGLICYQMPKYSIEKIDFIF--PIHLI-YSGYKTPTAQVLKKISYIEIE-YPEINEINQKIYALMGKLSQISCQALR 229 (324) Q Consensus 154 ~~~Gg~i~~~~~~~~~~~~~~~~--~l~l~-~tg~~~~T~~~v~~v~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~al~ 229 (324) +.+||..+.+.....+.+++... .++|+ +.+...+|+++-+.+++.... +.............+. .....+++. T Consensus 142 fl~g~~a~~~G~Ge~l~pl~~~~~~~~llv~~P~~~vST~~~y~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~ 219 (308) T PRK02534 142 CIAGGTQLCFGRGEILEPLPDLDSLGVVLAKYPSLSVSTPWAYKTYRQQFGDTYLSDEEDFEQRRQALR--SGSLLQAIS 219 (308) T ss_pred EECCCEEEEECCCCEEEECCCCCCCEEEEEECCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHCC--CCHHHHHHH T ss_conf 220851898358867678887765649999589998274999999987503433576034555554213--106889887 Q ss_pred CCCHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCH-HHHHHHHHHHHHC Q ss_conf 4864456899999899998616-7877999999999971994599726899754899984665326-8999999999977 Q gi|254781017|r 230 NKNLKVLAQAMNRQQGLLETLG-VSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNS-LPYQSVNCHMHAK 307 (324) Q Consensus 230 ~~d~~~l~~lm~~~~~~L~~lg-vs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~-~~~~~~~~~~~~~ 307 (324) ..+...+ ++.|+..- -..|++.++.+.++...|++.++|||+|. ++++++++..++ ...+.+++.+... T Consensus 220 ~~~~~~l-------~NdLe~~a~~~~P~i~~~~~~L~~~~Ga~~~~mSGSGs--tvFglf~~~~~A~~a~~~l~~~~p~~ 290 (308) T PRK02534 220 AKDPPPL-------RNDLQKVVLPEYPQVAKLLELLSSLPGCLGTMMSGSGP--TCFALFEDQEQAEQALEQVREAFADP 290 (308) T ss_pred HCCHHHH-------HHHHHHHHHHCCHHHHHHHHHHHHCCCCCEEEEECHHH--HHEEEECCHHHHHHHHHHHHHHCCCC T ss_conf 3478988-------64369998873949999999997447988889972410--00379789999999999999867899 Q ss_pred CCEEEEEE-CCCCCCC Q ss_conf 97899941-4545644 Q gi|254781017|r 308 GIDIVPIT-PSHSTSL 322 (324) Q Consensus 308 g~~~~~~~-~~~~~~~ 322 (324) |++.+.-. +.+|.++ T Consensus 291 ~~~~~v~~~~~~g~~~ 306 (308) T PRK02534 291 GLDAWVCQFISHGIQL 306 (308) T ss_pred CCCEEEECCCCCCEEE T ss_conf 9868975141787374 No 23 >PRK00128 ipk 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.95 E-value=4e-25 Score=178.13 Aligned_cols=270 Identities=16% Similarity=0.156 Sum_probs=188.7 Q ss_pred EEEECCEEEEEEC----CCEE-ECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHHH Q ss_conf 9997170499974----3533-0894289877677289999986898499995364444656612015089999999997 Q gi|254781017|r 8 ICVSAPGSLVLMG----EHGV-LHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAINH 82 (324) Q Consensus 8 i~~~aPGKv~L~G----Eh~v-~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~ 82 (324) |..+||+||||+= -..| ||--..+-..||.+..+++++.++..+.+.+....... +..+-..+.+....+. T Consensus 1 ~~~kapAKINL~L~V~~kr~dGyH~l~sl~~~i~l~D~i~i~~~~~~~i~~~~~~~~~p~----~~~Nlv~ka~~~l~~~ 76 (281) T PRK00128 1 ILEKAPAKINLSLDVLGKREDGYHEVEMIMTTIDLADRLEIEKLKEDKIVVESNSGYVPN----DERNLAYKAAKLLKER 76 (281) T ss_pred CEEECCCCEECCEECCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCCEEEEECCCCCCC----CCCCHHHHHHHHHHHH T ss_conf 906214558344302883899988356899991687299999978998899978887888----9856999999999997 Q ss_pred HCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEEEE Q ss_conf 18778738999826875354887402456655677764045666601455788887765214688754477662985899 Q gi|254781017|r 83 IKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLICY 162 (324) Q Consensus 83 ~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i~~ 162 (324) .....|++|+++.+||+|+|||++||.++|++++++++ ++.+++.+++.++|..+ |.|+..+.+||+.+. T Consensus 77 ~~~~~~~~I~l~K~IP~gaGLGGGSSdAAa~L~~ln~l-~~~~ls~~eL~~la~~i---------GaDVpffl~g~~a~~ 146 (281) T PRK00128 77 YNIKKGVRITIDKNIPVAAGLAGGSSDAAATLRGLNKL-WNLGLSLEELAEIGLKI---------GSDVPFCIYGGTALA 146 (281) T ss_pred HCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHH---------CCCCCCHHCCCCEEE T ss_conf 49998748999857986677785128899999999998-47999999999999872---------788441104775698 Q ss_pred EECCCEEEEEECC--CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHH Q ss_conf 8527515673047--77526765841798999988899886082348999999984033207889873048644568999 Q gi|254781017|r 163 QMPKYSIEKIDFI--FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAM 240 (324) Q Consensus 163 ~~~~~~~~~~~~~--~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm 240 (324) +.....+.+++.. ..++|++.+...+|+++-+.+......++. ......++.++|+..+.+.+ T Consensus 147 ~G~Ge~i~~l~~~~~~~~vLv~P~~~vsT~~vy~~~~~~~~~~~~---------------~~~~~~~l~~~d~~~~~~~l 211 (281) T PRK00128 147 TGRGEKITPLKPPPSCWVVLAKPDIGVSTKDVYKNLDLDKIYHPD---------------TEKLIEAIENGDYQGICASM 211 (281) T ss_pred ECCCCEEEECCCCCCCEEEEECCCCCCCHHHHHHHCCCCCCCCCC---------------HHHHHHHHHCCCHHHHHHHH T ss_conf 558967678887888589998899997869999852800267877---------------68899988628999998863 Q ss_pred HHHHHHHHHHC-CCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 99899998616-7877999999999971994599726899754899984665326899999999997797899941 Q gi|254781017|r 241 NRQQGLLETLG-VSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPIT 315 (324) Q Consensus 241 ~~~~~~L~~lg-vs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 315 (324) . +.|+..- --.|++.++.+.+.+ .|++++.|||+|+ |++++++++.+++ ++.+.+.+.--+++-++ T Consensus 212 ~---Ndle~~a~~~~P~i~~i~~~l~~-~Ga~~~~mSGSGs--t~Falf~~~~~A~---~~~~~Lk~~~~~~~~~~ 278 (281) T PRK00128 212 G---NVLEPVTLKKHPEIAKIKERMLK-FGADGALMSGSGP--TVFGLFDDESRAQ---RIYNGLKGFCKEVYLVR 278 (281) T ss_pred H---HHHHHHHHHHCHHHHHHHHHHHH-CCCCEEEEECCCC--EEEEEECCHHHHH---HHHHHHHHHCCCEEEEE T ss_conf 4---03799999859599999999985-7998589877212--1899989999999---99998671489399997 No 24 >COG0083 ThrB Homoserine kinase [Amino acid transport and metabolism] Probab=99.95 E-value=1.6e-24 Score=174.34 Aligned_cols=276 Identities=19% Similarity=0.197 Sum_probs=196.3 Q ss_pred CCEEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHHHHC Q ss_conf 87899971704999743533089428987767728999998689849999536444465661201508999999999718 Q gi|254781017|r 5 LHKICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAINHIK 84 (324) Q Consensus 5 ~~ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~ 84 (324) |..+.+++|+-.--+|- |++++.+|+++|..+.+....+ .+.+.... +....++.+..+-.....+..++.+. T Consensus 1 ~~~~~v~aPASSANlGp-----GFD~lGlAl~~~~~~~v~~~~~-~~~i~~~g-~~~~~iP~~~~n~~~~~~~~~~~~~~ 73 (299) T COG0083 1 MLMMKVRVPASSANLGP-----GFDVLGLALDLYNDVVVVEVVD-KFEIEVEG-EGADKIPLDPENLVYQAALKFLEALG 73 (299) T ss_pred CCEEEEEEEECCCCCCC-----CCCCEEEECCCCCCEEEEEECC-CEEEEEEC-CCCCCCCCCCCEEHHHHHHHHHHHHC T ss_conf 94089997621024677-----7000012014557279997047-37999834-65667999830009999999999829 Q ss_pred CCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHH-HHCCCEEEEE Q ss_conf 778738999826875354887402456655677764045666601455788887765214688754477-6629858998 Q gi|254781017|r 85 PSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAA-SIHGGLICYQ 163 (324) Q Consensus 85 ~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a-~~~Gg~i~~~ 163 (324) ...+++|+++++||.++|||||||.+||-+.+++++. +.+++++++.++|...|. +| |+.+ |.+||+.... T Consensus 74 ~~~~~~i~i~k~IP~~rGLGSSaAsiVAal~aan~l~-~~~L~~~~ll~~a~~~Eg----Hp---DNVapa~lGG~~l~~ 145 (299) T COG0083 74 IEAGVKIRIEKGIPLGRGLGSSAASIVAALAAANELA-GLPLSKEELLQLALEIEG----HP---DNVAPAVLGGLVLVE 145 (299) T ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHH-CCCCCHHHHHHHHHHHCC----CC---CHHHHHHHCCEEEEE T ss_conf 9865799997699877787589999999999999984-599989999999997558----97---107777607879986 Q ss_pred E-CCCEEEEEE--CCCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHH Q ss_conf 5-275156730--4777526765841798999988899886082348999999984033207889873048644568999 Q gi|254781017|r 164 M-PKYSIEKID--FIFPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAM 240 (324) Q Consensus 164 ~-~~~~~~~~~--~~~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm 240 (324) . .+....+++ ..+.+++++.+.+.+|+++.+-+ .++++ ..+...+. ........|+.++|.+.+...| T Consensus 146 ~~~~~~~~~v~~~~~~~~v~~iP~~e~sT~~aR~vL---P~~~~-~~daV~n~-----s~~a~lv~al~~~~~~l~~~~~ 216 (299) T COG0083 146 EESGIISVKVPFPSDLKLVVVIPNFEVSTAEARKVL---PKSYS-RKDAVFNL-----SRAALLVAALLEGDPELLRAMM 216 (299) T ss_pred ECCCCEEEECCCCCCEEEEEEECCCCCCHHHHHHHC---CCCCC-HHHHHHHH-----HHHHHHHHHHHCCCHHHHHHHH T ss_conf 158734797158764689999089656579999746---56688-89999999-----9999999999769999999870 Q ss_pred HH-HHHHHH-HHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 99-899998-616787799999999997199459972689975489998466532689999999999779789 Q gi|254781017|r 241 NR-QQGLLE-TLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDI 311 (324) Q Consensus 241 ~~-~~~~L~-~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~ 311 (324) +. -|+--| .+ -|.++++. ....+.|++|+-+||||+ ++++|+++. .++....+.+.+...|+.. T Consensus 217 ~D~ihepyR~~L---~P~~~~v~-~~a~~~gA~g~~lSGAGP--Ti~al~~~~-~~e~~~~~~~~~~~~~~~~ 282 (299) T COG0083 217 KDVIHEPYRAKL---VPGYAEVR-EAALEAGALGATLSGAGP--TVFALADES-DAEKAAALLEELYEQGIKG 282 (299) T ss_pred CCCCCHHHHHHH---CCCHHHHH-HHHHHCCCEEEEEECCCC--EEEEEECCC-HHHHHHHHHHHHHHHCCCC T ss_conf 360104666532---70189999-998647965999964787--079984542-0667999999999728850 No 25 >PTZ00299 homoserine kinase; Provisional Probab=99.94 E-value=4.9e-25 Score=177.59 Aligned_cols=293 Identities=16% Similarity=0.126 Sum_probs=195.9 Q ss_pred CCCCCCEEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHH Q ss_conf 99888789997170499974353308942898776772899999868984999953644446566120150899999999 Q gi|254781017|r 1 MGQCLHKICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAI 80 (324) Q Consensus 1 ~~~~~~ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l 80 (324) |-|.++||++++|+-.--+|- |++++.+|++.|..++++..++..+.++.... ..+..+..+-....+...+ T Consensus 1 ~~~~~~kV~V~VPATsANLGp-----GFD~LGlAL~ly~ev~v~~~~~~~i~i~g~g~---~~ip~d~~Nlv~~a~~~~~ 72 (336) T PTZ00299 1 MSPLPKKVVLRVPATTANIGP-----AYDTLGMALSIFMELTVEHADAFSMTVEGEGS---EHISTDEDNMVVQACRLAF 72 (336) T ss_pred CCCCCCEEEEEECCCHHHCCC-----CHHHHHHHCCCCCEEEEEECCCCEEEEEECCC---CCCCCCCCCHHHHHHHHHH T ss_conf 998897599996744464452-----36752530367748999988971799970677---7799996114999999999 Q ss_pred HHHC--CCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCC---CCHHHHHHHHHHHHHHCCCCCCHHHHH-H Q ss_conf 9718--778738999826875354887402456655677764045666---601455788887765214688754477-6 Q gi|254781017|r 81 NHIK--PSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEP---SPDEILTTAHAIVLKVQGISSGIDLAA-S 154 (324) Q Consensus 81 ~~~~--~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~---~~~~i~~lA~~~E~~~~g~~sG~D~~a-~ 154 (324) .... ...++++.++++||.++|||||||.+||-+.+.+.+. +..+ +.+++++++.++| |+| |+.+ | T Consensus 73 ~~~~~~~~~~~~i~i~~~IP~~rGLGSSAaAiVaG~~aan~L~-g~~l~~~~~~~ll~~a~~~E----gHP---DNVAPA 144 (336) T PTZ00299 73 EEYAHKSMPPLKFIMHSNIPYGCGCGSSSAAAVAGFVAGMKLC-GLTMETENEEALLQAIAKFE----GHP---DNAAPA 144 (336) T ss_pred HHHCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHC-CCCCCCCCHHHHHHHHHHHC----CCC---CCCCCC T ss_conf 9856978896389983688876675138899999999999862-68877558999999876216----897---455432 Q ss_pred HCCCEEEE-EECCCEEE--EEECCCC--CCEEECC--CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 62985899-85275156--7304777--5267658--4179899998889988608234899999998403320788987 Q gi|254781017|r 155 IHGGLICY-QMPKYSIE--KIDFIFP--IHLIYSG--YKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQA 227 (324) Q Consensus 155 ~~Gg~i~~-~~~~~~~~--~~~~~~~--l~l~~tg--~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~a 227 (324) .|||++.. ...+..+. +++++.. ++++... .+.+|+++.+-+ .++.+.. +...+ ++ .+.....+ T Consensus 145 llGG~~l~~~~~~~~~~~~~ip~p~~l~~Vv~iP~~~~~lsT~~AR~vL---P~~v~~~-daV~n----~~-r~a~lv~a 215 (336) T PTZ00299 145 IYGGIQLVYKKDNGRFLTYRVPTPPNLSVVLFVPHNKMKANTHVTRNLI---PTSVSLE-DAVFN----IS-RTSILVLA 215 (336) T ss_pred CCCCEEEEEECCCCCEEEEECCCCCCCEEEEEECCCCCEECHHHHHHHC---CCCCCHH-HHHHH----HH-HHHHHHHH T ss_conf 4588699997289827999779998848999978976330689999658---6768899-99999----99-99999999 Q ss_pred HCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECC------------CCCHH Q ss_conf 304864456899999899998616787799999999997199459972689975489998466------------53268 Q gi|254781017|r 228 LRNKNLKVLAQAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKG------------DLNSL 295 (324) Q Consensus 228 l~~~d~~~l~~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~------------~~~~~ 295 (324) |..+|++.+..+-+.=|+-.|. ..--|.++++.+. ..+.|++|+-+||||+ ++++|++. +..+. T Consensus 216 l~~~d~~ll~~~~D~ihepyR~-~~LiP~~~~v~~~-a~~aGA~g~~iSGAGP--Ti~Al~~g~~~~~~~~~~~~~~a~~ 291 (336) T PTZ00299 216 LSTGDLRMLKSCSDKLHEQQRS-DALFPHFRPCVKA-AREAGAHYAFLSGAGP--SVCALVGGRHGDPLTQPREERKAES 291 (336) T ss_pred HHCCCHHHHHHCCCEEECCCHH-HHCCCCHHHHHHH-HHHCCCEEEEEECHHH--HHHEEECCCCCCCCCCCCHHHHHHH T ss_conf 9549898997487348652023-3226279999999-9878994999976422--2442621556653224410345999 Q ss_pred HHHHHHHHHHHCCCE--EEEEECC-CCCCC Q ss_conf 999999999977978--9994145-45644 Q gi|254781017|r 296 PYQSVNCHMHAKGID--IVPITPS-HSTSL 322 (324) Q Consensus 296 ~~~~~~~~~~~~g~~--~~~~~~~-~~~~~ 322 (324) +-+.+.+.|.+.|++ .+-+.++ +|.++ T Consensus 292 V~~aM~~a~~~~gi~~~~~i~~~~~~G~~v 321 (336) T PTZ00299 292 VAEAMIKAAEAVGVAGRVIITQPSDQGVHL 321 (336) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCCCCEE T ss_conf 999999999986984589993467788789 No 26 >KOG4519 consensus Probab=99.94 E-value=4.4e-24 Score=171.59 Aligned_cols=310 Identities=20% Similarity=0.246 Sum_probs=202.5 Q ss_pred EEEEECCEEEEEECCCEEEC-CCEEEEEEECCEEEEEEEECCCCE------EEEEEC---CCCCCCCCCH---------- Q ss_conf 89997170499974353308-942898776772899999868984------999953---6444465661---------- Q gi|254781017|r 7 KICVSAPGSLVLMGEHGVLH-GHAALVFAINKRVILYLTLRKDRL------INIDSS---LGQYCGSLDL---------- 66 (324) Q Consensus 7 ki~~~aPGKv~L~GEh~v~~-G~~ai~~aI~~~~~~~~~~~~~~~------i~i~~~---~~~~~~~~~~---------- 66 (324) ++++|||||+.+.|.|.|+. -+..++..++.+.|+.+.+-++.. +++.|. +.++...+++ T Consensus 2 ~v~~SAPGKvL~aGGYlVLd~~y~glV~gl~Ar~yAi~~p~~~~~g~~~~~VrvkSpQf~d~EwlY~is~~~~~i~v~~~ 81 (459) T KOG4519 2 AVVASAPGKVLMAGGYLVLDKPYAGLVLGLNARFYAIVKPINEEVGPEWTDVRVKSPQFSDREWLYKISLNHLTIQVVSA 81 (459) T ss_pred CCEECCCCCEEEECCEEEECCCCCEEEEEEECEEEEEEECCCCCCCCCCCEEEECCCCCCCCHHEEECCCCCCEEEEEEC T ss_conf 60305887458765469965776516996303037875225310277640267237554562122550655440577753 Q ss_pred -HHCCCHHHHHHH-HHHHHCCC-------CCEEEE-EECC-------------------------CC--CCCCCCCCCHH Q ss_conf -201508999999-99971877-------873899-9826-------------------------87--53548874024 Q gi|254781017|r 67 -AMFHPSFSFIIM-AINHIKPS-------CGFDLK-VISQ-------------------------LD--SQLGLGSSAAI 109 (324) Q Consensus 67 -~~~~~~~~~i~~-~l~~~~~~-------~g~~i~-i~s~-------------------------iP--~g~GLGSSaa~ 109 (324) +..+|.+..++. .+.+|.+. .|+.++ +.|| +| .++|||||||+ T Consensus 82 ~~s~NPFve~ai~yvi~yf~~~~~~~lc~q~L~Vt~l~sddaY~~qp~s~~~~~~~~~~~F~~~~~~eV~KTGLGSSAam 161 (459) T KOG4519 82 SDSRNPFVEHAIQYVIAYFHLATLHKLCLQGLDVTILGSDDAYSYQPESLGTLAPFASITFNAAEKPEVAKTGLGSSAAM 161 (459) T ss_pred CCCCCCHHHHHHHHHHHHHHHCCCHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCHHHH T ss_conf 78998079999999999860213136776586689950365110374432345777713441244554444576623789 Q ss_pred HHHHHHHHHHHHCCC-----------CC-CCHHHHHHHHHHHHHHCCC-CCCHHHHHHHCCCEEEEEECCCEEEEE---- Q ss_conf 566556777640456-----------66-6014557888877652146-887544776629858998527515673---- Q gi|254781017|r 110 TVAITAALLTLQYHK-----------EP-SPDEILTTAHAIVLKVQGI-SSGIDLAASIHGGLICYQMPKYSIEKI---- 172 (324) Q Consensus 110 ~va~~~al~~~~~~~-----------~~-~~~~i~~lA~~~E~~~~g~-~sG~D~~a~~~Gg~i~~~~~~~~~~~~---- 172 (324) +.+++.+++...... +. +.+-|.++|+.+++..||+ +||+|+.|++||...+.++.+..+..+ T Consensus 162 ~T~lv~~ll~sl~~~~~d~~~k~~k~d~s~~~viHnlAQ~aHC~AQGKvGSGFDV~aA~yGS~rYrRF~P~lis~lp~~i 241 (459) T KOG4519 162 TTALVAALLHSLGVVDLDDPCKEGKFDCSDLDVIHNLAQTAHCLAQGKVGSGFDVSAAVYGSQRYRRFSPELISFLPQVI 241 (459) T ss_pred HHHHHHHHHHHHCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEHHHCCCEEEEECCHHHHHHHHHHH T ss_conf 99999999986131325787645666842899999999988778617756774300111032346637999998879987 Q ss_pred ---------------------------ECCCCCCEEECCCCCCHHHHHHHHHHHHHHCC-CHHH----HHHHHHHHHCCC Q ss_conf ---------------------------04777526765841798999988899886082-3489----999999840332 Q gi|254781017|r 173 ---------------------------DFIFPIHLIYSGYKTPTAQVLKKISYIEIEYP-EINE----INQKIYALMGKL 220 (324) Q Consensus 173 ---------------------------~~~~~l~l~~tg~~~~T~~~v~~v~~~~~~~~-~~~~----~~~~~~~~~~~i 220 (324) +....+++.+.....+|..+|.+|+++...+| +..+ .+....+.++.+ T Consensus 242 ~~~~~~~el~~~V~k~~W~~~~~~f~LP~~~~L~mGd~~gGSsTp~mV~kVk~Wq~s~~pe~~k~~~~lddansr~~~~l 321 (459) T KOG4519 242 VTGLPLNELIGTVLKGKWDNKRTEFSLPPLMNLFMGDGSGGSSTPSMVGKVKKWQMSDPPEARKNWQNLDDANSRLETKL 321 (459) T ss_pred HCCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHCCCHHHHHHHCCHHHHHHHHHHHH T ss_conf 61671677887752300134454356885101564157788776789999987765288667654200006789987530 Q ss_pred C-------------HHHHHHHCCCC--------HHHHHHHHHHHHHHHHHHC------CCCHHHHHHHHHHHHCCCCEEE Q ss_conf 0-------------78898730486--------4456899999899998616------7877999999999971994599 Q gi|254781017|r 221 S-------------QISCQALRNKN--------LKVLAQAMNRQQGLLETLG------VSDSKLSEIVWKLREQPHIMAS 273 (324) Q Consensus 221 ~-------------~~~~~al~~~d--------~~~l~~lm~~~~~~L~~lg------vs~~~l~~li~~~~~~~g~~~a 273 (324) . .+..+.+...+ +.++.+.|..-+.++++++ +.++.-.++.......+|++++ T Consensus 322 ~kl~~l~~~h~d~~~~v~eSl~~~~~~~~~~~e~~E~r~a~~~IRr~~r~it~ea~vdIEP~~QT~lLD~~~sl~GVl~~ 401 (459) T KOG4519 322 NKLSKLAKDHWDVYLRVIESLSVLTINEAIIKELLEAREAMLRIRRLMRQITEEASVDIEPESQTQLLDSTMSLEGVLLA 401 (459) T ss_pred HHHHHHHHHCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEEE T ss_conf 56667665000678888764012332155546789999999999999999778608987863554576666405663784 Q ss_pred ECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 726899754899984665326899999999997797899941454 Q gi|254781017|r 274 KISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITPSH 318 (324) Q Consensus 274 klsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 318 (324) -+.||||-|.+++++..+.+ ..+.+.+.|...+...++++++. T Consensus 402 gvPGAGGfDAif~it~~dvd--~~~~~~~~w~~~~V~~ldV~~a~ 444 (459) T KOG4519 402 GVPGAGGFDAIFAITLGDVD--SGTKLTQAWSSHNVLALDVREAP 444 (459) T ss_pred CCCCCCCCCEEEEEEECCHH--HHHHHHHHHCCCCEEEEEEEECC T ss_conf 46678874448888606446--87888765200574477620077 No 27 >TIGR00191 thrB homoserine kinase; InterPro: IPR000870 Saccharomyces cerevisiae strains containing the erg8-1 mutation are temperature sensitive for growth due to a defect in phosphomevalonate kinase, an enzyme of isoprene and ergosterol biosynthesis. Subcloning and DNA sequencing have defined the functional ERG8 regulon as an 850bp upstream region and an adjacent 1,272bp open reading frame. The deduced ERG8 protein contains 424 residues and shows no similarity to known proteins, except within a putative ATP-binding domain present in many kinases . Enzymes that share the N-terminal Gly/Ser-rich putative ATP-binding region include galactokinase, homoserine kinase, mevalonate kinase and phosphomevalonate kinase. Homoserine kinase is a homodimeric enzyme involved in threonine biosynthesis. Sequence comparison of the yeast enzyme with the corresponding proteins from bacterial sources reveals the presence of several highly conserved regions, the pattern of occurrence of which suggests that the ancestral sequences might have been composed from separate (functional) domains. A block of similar residues, found towards the C-terminus, is also present in many other proteins involved in threonine (or serine) metabolism; this motif may therefore represent the binding site for the hydroxy-amino acids. Limited similarity was detected between a motif conserved among the homoserine kinases and consensus sequences found in other mono- or dinucleotide-binding proteins .; GO: 0004413 homoserine kinase activity, 0005524 ATP binding, 0006566 threonine metabolic process. Probab=99.94 E-value=5.4e-25 Score=177.31 Aligned_cols=290 Identities=21% Similarity=0.185 Sum_probs=196.6 Q ss_pred EEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEE----CCCCE-----EEEEECCCCCCCCCCHHHCCCHH-HHHH Q ss_conf 99971704999743533089428987767728999998----68984-----99995364444656612015089-9999 Q gi|254781017|r 8 ICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTL----RKDRL-----INIDSSLGQYCGSLDLAMFHPSF-SFII 77 (324) Q Consensus 8 i~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~----~~~~~-----i~i~~~~~~~~~~~~~~~~~~~~-~~i~ 77 (324) |.|++|+=-==+|- |++++.+|++++..+.+.. ..+.. +.+.... +....+......+.+ .... T Consensus 1 ~~V~vPAsSANlGp-----GFD~LG~aL~~~~~~~~~~~~~~e~~~~~nnCk~~~~~~g-~~~~~iP~~p~~N~~yq~~~ 74 (359) T TIGR00191 1 FKVKVPASSANLGP-----GFDVLGLALSLYNGLTVTDVVAVESDDTPNNCKFEIEAEG-EDVEKIPTEPDDNLIYQVAK 74 (359) T ss_pred CEEEEECCCCCCCC-----CCCEEECEECCCCCEEEECCCCHHHCCCCCCCEEEEEEEC-CCCCCCCCCCHHHHHHHHHH T ss_conf 96862110057774-----4001201003334405504664111058874126888622-45125889612105688999 Q ss_pred HHHHHHCCC---CCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCH---HH Q ss_conf 999971877---873899982687535488740245665567776404566660145578888776521468875---44 Q gi|254781017|r 78 MAINHIKPS---CGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGI---DL 151 (324) Q Consensus 78 ~~l~~~~~~---~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~---D~ 151 (324) ..++.+... .+++++++++||.|+|||||||.+||.+.|++++. +.++++.+++++|...|..+ ||. |+ T Consensus 75 ~~~~~~g~~~~~~p~k~~~~~~iP~grGLGSSAaaiVaa~~a~ne~~-g~~l~~~~~L~~a~~~E~~~----SGs~HpDN 149 (359) T TIGR00191 75 RFLDQLGKRNFIKPVKLTLEKNIPLGRGLGSSAAAIVAALLAANELA-GLPLSKEELLDLALELEGRI----SGSIHPDN 149 (359) T ss_pred HHHHHHCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHH-CCCCCHHHHHHHHHHHHHHC----CCCCCHHH T ss_conf 99998187778887689985385101366638999999999999983-88989899999999986430----57757134 Q ss_pred HH-HHCCCEEEE-EECCC--EE----------EEEE--------------------------CCCCCCEEECCCCCCHHH Q ss_conf 77-662985899-85275--15----------6730--------------------------477752676584179899 Q gi|254781017|r 152 AA-SIHGGLICY-QMPKY--SI----------EKID--------------------------FIFPIHLIYSGYKTPTAQ 191 (324) Q Consensus 152 ~a-~~~Gg~i~~-~~~~~--~~----------~~~~--------------------------~~~~l~l~~tg~~~~T~~ 191 (324) .| |.+||+... +.++. .+ .+.| ..+.+++++.+...+|++ T Consensus 150 vaPa~~GG~~~~~~~~~~Pqeverrei~~~~~~~~P~~GedtGlvPPlPPtdiGrhvkyqWn~~~~~v~aIP~~~~sT~~ 229 (359) T TIGR00191 150 VAPALLGGFQLAVKEDDKPQEVERREIVLLEVLKIPIFGEDTGLVPPLPPTDIGRHVKYQWNSKLKVVLAIPNIEVSTAE 229 (359) T ss_pred HHHHHHCCEEEEEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEECCCCCCCHHH T ss_conf 67885166279884279876113566656664148898766656797785223442002016765179981784205688 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCC-HHHHHHHHHH--HHHHHH-HHCCCCHHHHHHHH---HH Q ss_conf 99888998860823489999999840332078898730486-4456899999--899998-61678779999999---99 Q gi|254781017|r 192 VLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKN-LKVLAQAMNR--QQGLLE-TLGVSDSKLSEIVW---KL 264 (324) Q Consensus 192 ~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d-~~~l~~lm~~--~~~~L~-~lgvs~~~l~~li~---~~ 264 (324) +.+- +-++|+...-++. .. -..-+..|+..+| ...|++.|-. -|+--| +| -|.+.++ + .. T Consensus 230 AR~v---LPk~Y~~~d~v~n-----~~-~l~~l~~A~~~~dP~~~l~~~~~kD~~hqPyR~~L---~P~l~~i-~sCv~~ 296 (359) T TIGR00191 230 ARAV---LPKAYPRQDLVFN-----LS-HLAGLVHALYQKDPKPDLLAIMMKDRIHQPYRESL---IPNLFKI-KSCVQA 296 (359) T ss_pred HHHH---CCCCCCHHHHHHH-----HH-HHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCCCCC---CCHHHHH-HHHHHH T ss_conf 9620---5554678899998-----87-78777756107996289999973385011634223---6148998-999999 Q ss_pred HHCCCCEEEECCCCCCCCEEEEEECCCC-CHHHHHHHHHHHHHCCCEE---EEEECCCCCCCC Q ss_conf 9719945997268997548999846653-2689999999999779789---994145456445 Q gi|254781017|r 265 REQPHIMASKISGSGLGDCVIALGKGDL-NSLPYQSVNCHMHAKGIDI---VPITPSHSTSLY 323 (324) Q Consensus 265 ~~~~g~~~aklsGAG~GG~~i~l~~~~~-~~~~~~~~~~~~~~~g~~~---~~~~~~~~~~~~ 323 (324) ....|+|+.-|||||+ ++++|++++. ++++...+.+.+...+... ..-...+|+.++ T Consensus 297 ~~~~Ga~~~~iSGsGP--Tila~~~e~~G~~~~~~~~~~~~~~~~~e~~v~~~~~d~~G~~~~ 357 (359) T TIGR00191 297 ALEKGALGITISGSGP--TILALADEEFGTEQKAQDLLENLHKQGIEGTVHVLEIDTDGARVE 357 (359) T ss_pred HHHCCCEEEEECCCCH--HHHHCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCEEE T ss_conf 9735864887112010--555168888775777699987776416532444202335761465 No 28 >PRK01123 shikimate kinase; Provisional Probab=99.93 E-value=7.2e-23 Score=163.88 Aligned_cols=251 Identities=17% Similarity=0.147 Sum_probs=166.0 Q ss_pred EEEEECCEEEEEEEECCCCE-EEEEECCCCCCCCCCHHHCCCHH-HHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCH Q ss_conf 98776772899999868984-99995364444656612015089-99999999718778738999826875354887402 Q gi|254781017|r 31 LVFAINKRVILYLTLRKDRL-INIDSSLGQYCGSLDLAMFHPSF-SFIIMAINHIKPSCGFDLKVISQLDSQLGLGSSAA 108 (324) Q Consensus 31 i~~aI~~~~~~~~~~~~~~~-i~i~~~~~~~~~~~~~~~~~~~~-~~i~~~l~~~~~~~g~~i~i~s~iP~g~GLGSSaa 108 (324) =+++||+.++++++..++.. +..+. +...+.....+ +.+...+.++....|+++++.|+||+++||||||| T Consensus 23 ~a~~idL~~~a~v~~~~~~~~~~~~~-------~~~~~~~~~l~~~~~~~~~~~~~i~~g~~i~i~s~IP~~~GL~SSSA 95 (283) T PRK01123 23 AAFGIDLKTTATVELSDSGSKIKGEV-------SGYPDADTRLVERCVELVLERFGIDYGGTVRTESEIPVASGLKSSSA 95 (283) T ss_pred CEEEECCEEEEEEEECCCCCCCCCCC-------CCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCHHHH T ss_conf 16763645999999877776444200-------57887563899999999998629987759999558986678537899 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCC-H-HHHHHHCCCEEEEEECCCEEEE-EECCCCCCEEECCC Q ss_conf 4566556777640456666014557888877652146887-5-4477662985899852751567-30477752676584 Q gi|254781017|r 109 ITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSG-I-DLAASIHGGLICYQMPKYSIEK-IDFIFPIHLIYSGY 185 (324) Q Consensus 109 ~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG-~-D~~a~~~Gg~i~~~~~~~~~~~-~~~~~~l~l~~tg~ 185 (324) .++|.+.|++.+ ++.+++++|+.+++.+++.......+| + |.++|.+||++..++.+..+.. .+++.+++++.++. T Consensus 96 ~a~A~i~a~~~~-~~~~Ls~~ei~~la~ea~~~~g~siTGa~DDa~As~~GG~~ltdn~~~~ii~~~~~~~~~vV~ip~~ 174 (283) T PRK01123 96 AANALVLATLDA-LGEKLDDLDILRLGVKASKEAGVTVTGAFDDACASYFGGVCVTDNREMKILKRDEVEGPVVVLIPPE 174 (283) T ss_pred HHHHHHHHHHHH-HCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHCCCEEEEECCCCEEEECCCCCCCEEEEECCC T ss_conf 999999999998-5899899999999999862078152367511667744977998188743665148998589998997 Q ss_pred CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHHHHH-HHHCCCCHHHHHHHHHH Q ss_conf 17989999888998860823489999999840332078898730486445689999989999-86167877999999999 Q gi|254781017|r 186 KTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQQGLL-ETLGVSDSKLSEIVWKL 264 (324) Q Consensus 186 ~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~~~~L-~~lgvs~~~l~~li~~~ 264 (324) ..+|++. .+..++ . +....+.+.+...++++. +.|..|--.. ..++... +++..+ T Consensus 175 ~~~t~~~--~~~~~~-----------~----i~~~~~~a~~~~~~g~~~---~Am~~Ngl~~~~~l~~~~----~~~~~a 230 (283) T PRK01123 175 KAFSANV--DVERMK-----------L----IAPYVEMAFELALAGDYF---KAMTLNGLLYSSALGFPT----EPILDA 230 (283) T ss_pred CCCCCCC--CHHHHH-----------H----HHHHHHHHHHHHHCCCHH---HHHHHHHHHHHHHCCCCH----HHHHHH T ss_conf 7737627--678887-----------7----438999999998779898---998750264245417987----999999 Q ss_pred HHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC-CCCCC Q ss_conf 9719945997268997548999846653268999999999977978999414-54564 Q gi|254781017|r 265 REQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITP-SHSTS 321 (324) Q Consensus 265 ~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~ 321 (324) . ..|++++.|||+|+ ++++|++++.. +.+.+.|.+.|-- +..++ .++.+ T Consensus 231 l-~~GAlga~iSGsGP--si~al~~~e~~----~~v~ea~~~~G~v-i~t~i~n~g~~ 280 (283) T PRK01123 231 L-EAGAVGAGLSGTGP--SYVALFDEENP----EEVKEAWSKYGKV-IVTKINNRGAR 280 (283) T ss_pred H-HCCCEEEEEECCCC--EEEEEECCCCH----HHHHHHHHHCCCE-EEEEECCCCCE T ss_conf 9-75984999826898--79999807767----9999999867988-99956598735 No 29 >PRK03009 consensus Probab=99.89 E-value=1.1e-20 Score=150.06 Aligned_cols=271 Identities=15% Similarity=0.125 Sum_probs=175.9 Q ss_pred EEECCEEEEEEC----CCEE-ECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHHHH Q ss_conf 997170499974----3533-08942898776772899999868984999953644446566120150899999999971 Q gi|254781017|r 9 CVSAPGSLVLMG----EHGV-LHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAINHI 83 (324) Q Consensus 9 ~~~aPGKv~L~G----Eh~v-~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ 83 (324) .+.||+||||+= --.| ||.-..+-..||.+.+++++.+++..+.+......... ..+-.++.+....+.. T Consensus 4 ~~~APAKINL~L~V~~kr~DGyH~l~Sl~~~i~l~D~i~i~~~~~~~~~~~~~~~~i~~-----~~Nlv~kA~~~l~~~~ 78 (287) T PRK03009 4 DCPAPAKLNLFLHVTGRRPDGYHLLQTVFQLLDWGDTLHFTLRDDGKVARVTDVPGVPE-----ESDLVVRAARLLQAHT 78 (287) T ss_pred CCCCCCEEECCCCCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCCEEECCCCCCCCC-----CCCHHHHHHHHHHHHH T ss_conf 68962127025024886899987067899980678699999888994796257778988-----6659999999999853 Q ss_pred CCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEEEEE Q ss_conf 87787389998268753548874024566556777640456666014557888877652146887544776629858998 Q gi|254781017|r 84 KPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLICYQ 163 (324) Q Consensus 84 ~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i~~~ 163 (324) ....|++|+++.+||+++|||+.||-++|++++|+++ ++.+++.+++.++|..+ |.|+..+.+|+..+.+ T Consensus 79 ~~~~~~~I~l~KnIPv~AGLGGGSSdAAa~L~~Ln~l-~~l~ls~~~l~~ia~~l---------GADVPffl~~~~a~~~ 148 (287) T PRK03009 79 GTRLGVDIEIDKRLPMGGGLGGGSSDAATTLLALNRL-WGLDLPRAELQSLALKL---------GADVPFFVFGKNAFAE 148 (287) T ss_pred CCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH-HCCCCCHHHHHHHHHHC---------CCCCCCEECCCCEEEE T ss_conf 9988549999806876567787753799999999997-07999999999998763---------9982303227767998 Q ss_pred ECCCEEEEEECC-CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHH Q ss_conf 527515673047-7752676584179899998889988608234899999998403320788987304864456899999 Q gi|254781017|r 164 MPKYSIEKIDFI-FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNR 242 (324) Q Consensus 164 ~~~~~~~~~~~~-~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~ 242 (324) .....+.+++++ ..++|++.+...+|+++-+.....+...+... ..+.... .........+.++|++...- T Consensus 149 G~Ge~l~~l~~~~~~~llv~P~~~iST~~vy~~~~~~~~~~~~~~---~~~~~~~-~~~~~~~~~~~~NdLe~~a~---- 220 (287) T PRK03009 149 GIGEALTAVELPPRWFLVVTPRVHVPTAEIFSDESLTRDTKPITI---ADFLAQQ-TSDAGWPDSFGRNDLQPVVT---- 220 (287) T ss_pred CCCCCCEECCCCCCEEEEECCCCCCCHHHHHHCCHHCCCCCCCCH---HHHHHHH-CCCHHHHHHHCCCCCHHHHH---- T ss_conf 588464074789747999859998467999745020256776668---7887523-32455678754787589998---- Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf 899998616787799999999997199459972689975489998466532689999999999779789994145456 Q gi|254781017|r 243 QQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITPSHST 320 (324) Q Consensus 243 ~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 320 (324) + -.|++.++.+.+... + +++|||+|. ||++|.+++.+++ +..+.+. .++..+-+++=+.. T Consensus 221 -----~----~~PeI~~i~~~l~~~-~--~a~MSGSGS--t~Fglf~~~~~A~---~a~~~l~-~~~~~~v~~~l~~~ 280 (287) T PRK03009 221 -----S----KYAEVARAVEWLYNL-T--PARMTGSGA--CVFAAFKSKAEAE---AAQAKLP-AGWNGAVAESLSEH 280 (287) T ss_pred -----H----HCHHHHHHHHHHHCC-C--CEEEECCCC--EEEEEECCHHHHH---HHHHHCC-CCCEEEEECCCCCC T ss_conf -----7----488999999998678-9--718777150--2889989999999---9998688-76659991798768 No 30 >PRK01726 consensus Probab=99.89 E-value=1e-20 Score=150.21 Aligned_cols=256 Identities=15% Similarity=0.143 Sum_probs=172.2 Q ss_pred CEEEEECCEEEEEE---------CCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHH Q ss_conf 78999717049997---------435330894289877677289999986898499995364444656612015089999 Q gi|254781017|r 6 HKICVSAPGSLVLM---------GEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFI 76 (324) Q Consensus 6 ~ki~~~aPGKv~L~---------GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i 76 (324) +++.++||+||||+ |.|-. ..+-..||.+..+++++++++.+.+........ +..+-..+.+ T Consensus 20 k~~~~~aPAKINL~L~V~gkR~DGYH~L----~Sl~~~idl~D~i~i~~~~~~~i~~~~~~~~i~-----~~~Nli~kA~ 90 (312) T PRK01726 20 KPLRFPSPAKLNLFLYINGKRPDGYHEL----QTLFQFLDFGDWLDISIREDNQIVLTPEIPNLK-----TEDNLIYRAA 90 (312) T ss_pred CEEEEECCEEECCCCCCCCCCCCCCCEE----EEEEEEECCCCEEEEEECCCCCEEEECCCCCCC-----CCCCHHHHHH T ss_conf 7788746304804502078289998526----899999468969999996899789936789998-----8675999999 Q ss_pred HHHHHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHC Q ss_conf 99999718778738999826875354887402456655677764045666601455788887765214688754477662 Q gi|254781017|r 77 IMAINHIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIH 156 (324) Q Consensus 77 ~~~l~~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~ 156 (324) ....+......|++|.+..+||+++|||+-||-+++++++|+.+ ++.+++.+++.++|..+ |.|+..+.+ T Consensus 91 ~~l~~~~~~~~~~~I~l~KnIPvgAGLGGGSSdAAavL~~Ln~l-~~l~ls~~~L~~ia~~i---------GADVPffl~ 160 (312) T PRK01726 91 KLLQQKTNCQLGANIHLDKILPMGGGVGGGSSNAATALVALNYL-WQTNLSIDELAKLGLTL---------GADVPIFVH 160 (312) T ss_pred HHHHHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHH---------CCCEEEEEE T ss_conf 99999769987638999737866567775434699999999998-44699989999999861---------898166861 Q ss_pred CCEEEEEECCCEEEEEECCC-CCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHH Q ss_conf 98589985275156730477-75267658417989999888998860823489999999840332078898730486445 Q gi|254781017|r 157 GGLICYQMPKYSIEKIDFIF-PIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKV 235 (324) Q Consensus 157 Gg~i~~~~~~~~~~~~~~~~-~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~ 235 (324) |+.-+.+.....+.+++.+. .++|++.+...+|+++-+.... ....+. +.+...+ ..-..+|++. T Consensus 161 ~~~a~~~GiGE~l~~i~~~~~~~lLv~P~i~vST~~vf~~~~l-~~~~~~-----~~~~~~l--------~~~~~NDLE~ 226 (312) T PRK01726 161 GHAAFAEGVGEKITYCEPPEKWFVVLKPDDSISTAVIFQDPNL-PRNTPK-----KSLEQLL--------SEPYKNDCEK 226 (312) T ss_pred CCCEEEECCCCEEEECCCCCCEEEEECCCCCCCCHHHHHCCCC-CCCCCH-----HHHHHHH--------HHHHCCCHHH T ss_conf 7778998898682066777666999899998687799709675-547631-----1499998--------6061164489 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 68999998999986167877999999999971994599726899754899984665326899999999997797899941 Q gi|254781017|r 236 LAQAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPIT 315 (324) Q Consensus 236 l~~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 315 (324) ..- + -.|++.++.+.+.+.+ + ++|||+|. ||+++...+.+++. +.+.+. .+++.+.++ T Consensus 227 ~a~----------~---~~PeI~~i~~~L~~~~-~--arMSGSGS--T~FglF~sk~eA~k---i~~~~p-~~~~~~~~k 284 (312) T PRK01726 227 VVI----------N---HYSEVEEALNWLLQYA-P--ARLTGTGA--CVFAEFDHEASAQA---VFRQKP-EAFFGFVAK 284 (312) T ss_pred HHH----------H---HCHHHHHHHHHHHHCC-C--CEEECCCC--EEEEEECCHHHHHH---HHHHCC-CCCEEEECC T ss_conf 999----------8---6899999999986138-9--83675380--49999799999999---998689-764699836 Q ss_pred C Q ss_conf 4 Q gi|254781017|r 316 P 316 (324) Q Consensus 316 ~ 316 (324) + T Consensus 285 ~ 285 (312) T PRK01726 285 G 285 (312) T ss_pred C T ss_conf 8 No 31 >PRK00343 ipk 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.89 E-value=1.1e-20 Score=149.96 Aligned_cols=257 Identities=16% Similarity=0.159 Sum_probs=172.4 Q ss_pred EEEEECCEEEEEE---------CCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHH Q ss_conf 8999717049997---------4353308942898776772899999868984999953644446566120150899999 Q gi|254781017|r 7 KICVSAPGSLVLM---------GEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFII 77 (324) Q Consensus 7 ki~~~aPGKv~L~---------GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~ 77 (324) .++++||+||||+ |-|-. ..+-..||.+..++++++++..+.+......... ..+-..+.+. T Consensus 2 ~~~~~aPAKINL~L~V~~kr~DGyH~i----~Sl~~~idl~D~i~i~~~~~~~i~~~~~~~~i~~-----~~Nlv~ka~~ 72 (279) T PRK00343 2 LLDWPAPAKLNLFLHITGRRADGYHEL----QTLFQFLDYGDTLHFEPRDDGQIRLLTEIPGVPE-----EDNLIVRAAR 72 (279) T ss_pred CCCCCCCEEEEECCCCCCCCCCCCCEE----EEEEEECCCCCEEEEEECCCCCEEEECCCCCCCC-----CCCHHHHHHH T ss_conf 766365002730322387089998625----7899982788399999988996899468777986-----4759999999 Q ss_pred HHHHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf 99997187787389998268753548874024566556777640456666014557888877652146887544776629 Q gi|254781017|r 78 MAINHIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHG 157 (324) Q Consensus 78 ~~l~~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~G 157 (324) ..-++.....|++|++..+||+++|||+.||-+++++++++.+ ++.+++.+++.++|..+ |.|+..+.+| T Consensus 73 ~l~~~~~~~~~~~I~l~KnIPv~AGLGGGSSnAAa~L~~Ln~l-~~l~ls~~~l~~ia~~l---------GaDVPffl~~ 142 (279) T PRK00343 73 LLQKATGTPLGADISLDKRLPMGGGLGGGSSDAATTLVALNRL-WQLGLSRDELAALGLKL---------GADVPVFVRG 142 (279) T ss_pred HHHHHHCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH-HCCCCCHHHHHHHHHHC---------CCCEEEEECC T ss_conf 9999869999749999965876356788871289999999997-46678999999998751---------8977778608 Q ss_pred CEEEEEECCCEEEEEECC-CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHH Q ss_conf 858998527515673047-7752676584179899998889988608234899999998403320788987304864456 Q gi|254781017|r 158 GLICYQMPKYSIEKIDFI-FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVL 236 (324) Q Consensus 158 g~i~~~~~~~~~~~~~~~-~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l 236 (324) +..+.+.....+.+++++ ..++|++.+...+|+++-+..... +..+. ..+ +........+|++.. T Consensus 143 ~~a~~~G~Ge~l~~~~~~~~~~llv~P~~~vST~~vf~~~~l~-~~~~~-----~~~--------~~~~~~~~~NDLe~~ 208 (279) T PRK00343 143 HAAFAEGVGEKLTPVELPEKWYLVVKPGVHVSTAEIFSDPELT-RDTPK-----RTI--------ADFLAGPGRNDCEPV 208 (279) T ss_pred CCEEEEECCCEEEECCCCCCEEEEECCCCCCCHHHHHHCCCCC-CCCCC-----CCH--------HHHHHCHHCCCHHHH T ss_conf 7689980686877756787479998799995619998382455-47730-----069--------999843121858999 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 89999989999861678779999999999719945997268997548999846653268999999999977978999414 Q gi|254781017|r 237 AQAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITP 316 (324) Q Consensus 237 ~~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 316 (324) .. + -.|++.++.+.+.+. + .++|||+|. |++++.+++.+++ ...+.+ .+++..+.+++ T Consensus 209 a~----------~---~~PeI~~~~~~L~~~-~--~~~MSGSGS--t~Fglf~~~~~A~---~a~~~l-~~~~~~f~~k~ 266 (279) T PRK00343 209 VR----------K---RYPEVAQALSWLLEY-A--PARMTGTGA--CVFAEFDTEAEAE---AVLAQL-PEGWNGFVAKG 266 (279) T ss_pred HH----------H---HCHHHHHHHHHHHHC-C--CCEEECCCC--EEEEEECCHHHHH---HHHHHC-CCCCCEEEECC T ss_conf 98----------8---698999999998602-8--874654222--5889979999999---999864-74487899478 Q ss_pred CC Q ss_conf 54 Q gi|254781017|r 317 SH 318 (324) Q Consensus 317 ~~ 318 (324) =+ T Consensus 267 l~ 268 (279) T PRK00343 267 LN 268 (279) T ss_pred CC T ss_conf 87 No 32 >COG1947 IspE 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase [Lipid metabolism] Probab=99.88 E-value=7e-20 Score=144.98 Aligned_cols=259 Identities=17% Similarity=0.137 Sum_probs=168.7 Q ss_pred CCEEEEECCEEEEEE----CCCEEEC-CCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHH Q ss_conf 878999717049997----4353308-94289877677289999986898499995364444656612015089999999 Q gi|254781017|r 5 LHKICVSAPGSLVLM----GEHGVLH-GHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMA 79 (324) Q Consensus 5 ~~ki~~~aPGKv~L~----GEh~v~~-G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~ 79 (324) |.++.+.||+|+||+ |-..|=| --..+-..|+.+..++++.++++.+.+.... ...+..+. +-..+..... T Consensus 1 ~~~~~~~apAKiNL~L~V~gkr~DGYHel~sl~~~id~~D~l~i~~~~~~~~~~~~~~---~~~lp~~~-NLv~rAa~ll 76 (289) T COG1947 1 MMSTKFPAPAKINLFLHVTGKRADGYHELETLFQFIDLGDELTIRPRDDDGFIVLGTF---ADGLPTDE-NLVYRAAELL 76 (289) T ss_pred CCCEEEECCCEEEEEEEECCCCCCCCEEEEEEEEEECCCCEEEEEECCCCCCEEECCC---CCCCCCCC-HHHHHHHHHH T ss_conf 9636872154488888853558998434589999941487899998778872696677---77887743-1999999999 Q ss_pred HHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCE Q ss_conf 99718778738999826875354887402456655677764045666601455788887765214688754477662985 Q gi|254781017|r 80 INHIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGL 159 (324) Q Consensus 80 l~~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~ 159 (324) .++.....|++|.++.+||+++|||.-||-+.+++++|+.+ |+..++.+++..++..+ |.|+..+.+|+. T Consensus 77 ~~~~~~~~~v~I~l~K~IPv~aGLGGGSSdAAa~L~~Ln~l-w~~~ls~~eL~~Lg~~L---------GaDVPffl~g~t 146 (289) T COG1947 77 RKRTGIAGGVSIHLDKNIPVGAGLGGGSSDAAAVLVALNEL-WGLGLSLEELAELGLRL---------GADVPFFLSGGT 146 (289) T ss_pred HHHHCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHH---------CCCCCEEEECCC T ss_conf 99868999827999836865576766457899999999998-67999999999999986---------898671431885 Q ss_pred EEEEECCCEEEEEE-CC-CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHH Q ss_conf 89985275156730-47-77526765841798999988899886082348999999984033207889873048644568 Q gi|254781017|r 160 ICYQMPKYSIEKID-FI-FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLA 237 (324) Q Consensus 160 i~~~~~~~~~~~~~-~~-~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~ 237 (324) .+-......+.+++ .+ ..++++..+...+|+++-+ ........|....... ......-+.......+|++... T Consensus 147 A~a~G~GE~l~~~~~~~~~~~vl~~P~v~vsT~~vy~-~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~NdLe~~~ 221 (289) T COG1947 147 AFAEGRGEKLEPLEDPPEKWYVLAKPGVGVSTKEVYK-DPELTRNTPKSEPLIA----ALSLENLKQIAPFLINDLEKVA 221 (289) T ss_pred EEEEECCCEEEECCCCCCCEEEEEECCCCCCHHHHHC-CCCCCCCCCCCHHHHH----HHHHHHHHHHHHHCCCCHHHHH T ss_conf 6988736351477778875299995898998799970-8675646677224558----7865527764102036457889 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHH Q ss_conf 9999989999861678779999999999719945997268997548999846653268999 Q gi|254781017|r 238 QAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQ 298 (324) Q Consensus 238 ~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~ 298 (324) . + ..|++.+....+.+ .|+..++|||+|. |++++++.+.+++... T Consensus 222 ~----------~---~~p~v~~~~~~l~~-~ga~~~~mSGSGs--tvF~l~~~~~~a~~~~ 266 (289) T COG1947 222 L----------R---LYPEVKEALSELLE-YGALPARMSGSGS--TVFALFDTEKEAQRVA 266 (289) T ss_pred H----------H---HCHHHHHHHHHHHH-CCCCCCEEECCCC--CEEEEECCHHHHHHHH T ss_conf 8----------8---58689999999764-1354436824787--2899938768899999 No 33 >PRK04943 consensus Probab=99.88 E-value=1.8e-20 Score=148.74 Aligned_cols=262 Identities=13% Similarity=0.106 Sum_probs=173.9 Q ss_pred CCCEEEEECCEEEEEEC----CCEE-ECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHH Q ss_conf 88789997170499974----3533-089428987767728999998689849999536444465661201508999999 Q gi|254781017|r 4 CLHKICVSAPGSLVLMG----EHGV-LHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIM 78 (324) Q Consensus 4 ~~~ki~~~aPGKv~L~G----Eh~v-~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~ 78 (324) .|.-+..+||+||||+= -..| ||.-..+-..||++.++++++++...+.+.....+... + .+-..+.+.. T Consensus 2 ~~~~~~~~aPAKINL~L~V~~kr~DGYH~i~S~~~~idl~D~l~i~~~~~~~i~~~~~~~~i~~----~-~Nli~kA~~~ 76 (288) T PRK04943 2 ITGTTRWPSPAKLNLFLYINGRTENGYHELQTLFQFLDHGDELTITANNSGNITLSPAIEGVPL----E-DNLIWKAATA 76 (288) T ss_pred CCCCEEEECCEEEECCCCCCCCCCCCCCEEEEEEEEECCCEEEEEEECCCCCEEEECCCCCCCC----C-CCHHHHHHHH T ss_conf 7787487131168234023887999987157899981675099999989998898178678987----6-7499999999 Q ss_pred HHHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCC Q ss_conf 99971877873899982687535488740245665567776404566660145578888776521468875447766298 Q gi|254781017|r 79 AINHIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGG 158 (324) Q Consensus 79 ~l~~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg 158 (324) .-+.+....|++|+++.+||+++|||+.||.+++++++++.+ ++.+++.+++.++|..+ |.|+..+.+|+ T Consensus 77 l~~~~~~~~~~~I~l~KnIPv~AGLGGGSSnAAa~L~~Ln~l-~~l~ls~~~l~~ia~~l---------GADVPffl~~~ 146 (288) T PRK04943 77 LQNAAQCSLGAHIELHKILPMGGGIGGGSSNAATTLVALNYL-WQTGLSDDELAEIGLAL---------GADVPVFVRGF 146 (288) T ss_pred HHHHHCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHC---------CCCEEEEECCC T ss_conf 999769998808999971787578887620699999999998-47999989999988743---------99857686177 Q ss_pred EEEEEECCCEEEEEECCCC-CCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHH Q ss_conf 5899852751567304777-526765841798999988899886082348999999984033207889873048644568 Q gi|254781017|r 159 LICYQMPKYSIEKIDFIFP-IHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLA 237 (324) Q Consensus 159 ~i~~~~~~~~~~~~~~~~~-l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~ 237 (324) ..+.+.....+.++..... ++|++.+...+|+++-+....... .+. .+. .........+|++... T Consensus 147 ~a~~~G~GE~l~~~~~~~~~~lLv~P~~~iST~~vf~~~~~~~~-~~~-~~~------------~~~~~~~~~NDLE~~~ 212 (288) T PRK04943 147 AAFAEGVGEKLSPATPEEKWYLVVRPNVSIATVDIFTHPDLTRN-TPK-RDL------------ETLLNTPYVNDCEKIV 212 (288) T ss_pred CEEEECCCCEEEECCCCCCEEEEECCCCCCCCHHHHCCCHHCCC-CCC-CCH------------HHHHHCHHCCCCHHHH T ss_conf 68998688574266777767999879999727788059300137-652-179------------9987360047857999 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 9999989999861678779999999999719945997268997548999846653268999999999977978999414 Q gi|254781017|r 238 QAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITP 316 (324) Q Consensus 238 ~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 316 (324) ..+ .|++.++.+.+.... +++|||+|. |++++.+.+.+++ +..+.+. ..++.+.+++ T Consensus 213 ~~~-------------~PeI~~~~~~l~~~~---~~~MSGSGs--tvFglf~s~~~A~---~~~~~l~-~~~~~f~~kg 269 (288) T PRK04943 213 RML-------------YPEVDKQLSWLLQYA---PSRLTGTGS--CVFAEFSSRSEAE---AILAQLS-DNVSAFVAQG 269 (288) T ss_pred HHH-------------CHHHHHHHHHHHCCC---CCEEECHHH--EEEEEECCHHHHH---HHHHHCC-CCCCEEEECC T ss_conf 986-------------889999999985659---871326405--5889989999999---9998689-7672899368 No 34 >PRK03288 consensus Probab=99.87 E-value=3.3e-20 Score=147.07 Aligned_cols=236 Identities=17% Similarity=0.186 Sum_probs=160.1 Q ss_pred EECCEEEEEE---------CCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHH Q ss_conf 9717049997---------4353308942898776772899999868984999953644446566120150899999999 Q gi|254781017|r 10 VSAPGSLVLM---------GEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAI 80 (324) Q Consensus 10 ~~aPGKv~L~---------GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l 80 (324) -.||+||||+ |-|- -..+-..||.+..++++++++..+.+......... ..+-..+.+.... T Consensus 5 ~~APAKINL~L~V~gkR~DGYH~----i~S~~~~idl~D~i~i~~~~~~~i~~~~~~~~~~~-----~~Nlv~ka~~~l~ 75 (283) T PRK03288 5 WPSPAKLNLFLYITGQRADGYHT----LQTLFQFLDYGDTLTIEPRDDGEIRLLTPVEGVEN-----EDNLIVRAARLLM 75 (283) T ss_pred CCCCEEEECCCCCCCCCCCCCCE----EEEEEEECCCCCEEEEEECCCCCEEEECCCCCCCC-----CCCHHHHHHHHHH T ss_conf 78601581140338869999872----57899981788299999989997899558767877-----5418999999999 Q ss_pred HHH------CCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 971------87787389998268753548874024566556777640456666014557888877652146887544776 Q gi|254781017|r 81 NHI------KPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAAS 154 (324) Q Consensus 81 ~~~------~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~ 154 (324) ++. ....|++|+++.+||+++|||+.||-++|++++++++ ++.+++.+++.++|..+ |.|+..+ T Consensus 76 ~~~~~~~~~~~~~g~~I~l~KnIPv~AGLGGGSSdAAa~L~~L~~l-~~l~ls~~~l~~ia~~i---------GaDVPff 145 (283) T PRK03288 76 KTAADSGRLPAGSGADISIDKRLPMGGGLGGGSSNAATVLVALNHL-WQCGLSDDELAELGLTL---------GADVPVF 145 (283) T ss_pred HHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHH---------CCCCCCE T ss_conf 9764402456666607999436764456787761599999999998-58999999999999985---------8983105 Q ss_pred HCCCEEEEEECCCEEEEEECC-CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCH Q ss_conf 629858998527515673047-7752676584179899998889988608234899999998403320788987304864 Q gi|254781017|r 155 IHGGLICYQMPKYSIEKIDFI-FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNL 233 (324) Q Consensus 155 ~~Gg~i~~~~~~~~~~~~~~~-~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~ 233 (324) .+|+.-+.+.....+.+++.+ ..+++++.+...+|+++-+.... ....+. +. + +...+....+|+ T Consensus 146 l~~~~a~~~G~Ge~l~~~~~~~~~~llv~P~~~vST~~vy~~~~~-~~~~~~-----~~-------~-~~~l~~~~~Ndl 211 (283) T PRK03288 146 VRGHAAFAEGVGEILTPVDPPEKWYLVAHPGVSIPTPVIFKDPEL-PRNTPK-----RS-------I-ETLLKCEFSNDC 211 (283) T ss_pred ECCCCEEEECCCCEEEECCCCCCEEEEECCCCCCCHHHHHCCCCC-CCCCCC-----CC-------H-HHHHHCHHCCCH T ss_conf 427758998288681476778756999899999782898449444-678753-----48-------9-999737110646 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHH Q ss_conf 456899999899998616787799999999997199459972689975489998466532689 Q gi|254781017|r 234 KVLAQAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLP 296 (324) Q Consensus 234 ~~l~~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~ 296 (324) +.... . . .|++.++++.+.+.. +++|||+|. ||+++.+.+.++.. T Consensus 212 e~~a~--~-------~----~peI~~~~~~l~~~~---~~~MSGSGS--t~Fglf~~~~~A~k 256 (283) T PRK03288 212 EVIAR--K-------R----FREVDAALSWLLEYA---PSRLTGTGA--CVFAEFDTESEARQ 256 (283) T ss_pred HHHHH--H-------H----HHHHHHHHHHHHCCC---CCCEECHHH--EEEEEECCHHHHHH T ss_conf 99998--7-------5----599999999985539---871226404--17999899999999 No 35 >PRK04648 consensus Probab=99.85 E-value=4.6e-19 Score=139.82 Aligned_cols=257 Identities=13% Similarity=0.133 Sum_probs=163.9 Q ss_pred ECCEEEEEEC----CCEE-ECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHHHHCC Q ss_conf 7170499974----3533-0894289877677289999986898499995364444656612015089999999997187 Q gi|254781017|r 11 SAPGSLVLMG----EHGV-LHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAINHIKP 85 (324) Q Consensus 11 ~aPGKv~L~G----Eh~v-~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~ 85 (324) .||+||||+= --.+ ||.-..+-..||.+.+++++++++..+......... +. +..+-..+.+...-+.... T Consensus 18 ~APAKINL~L~V~gkr~DGYH~l~Sl~~~idl~D~l~i~~~~~~~~~~~~~~~~~---i~-~~~Nlv~ka~~~l~~~~~~ 93 (295) T PRK04648 18 PAPAKLNLFLQITGRRADGYHLLQTVFRLLDWGDTIHLRVRSDGQIHRIGESLPG---VA-EDDDLVVRAARLLQSAAGT 93 (295) T ss_pred CCCCEEECCCCCCCCCCCCCCEEEEEEEECCCCEEEEEEECCCCCEEEECCCCCC---CC-CCCHHHHHHHHHHHHHHCC T ss_conf 5621370251328879999872568999837880999998899947993688888---99-9774999999999996599 Q ss_pred CCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEEEEEEC Q ss_conf 78738999826875354887402456655677764045666601455788887765214688754477662985899852 Q gi|254781017|r 86 SCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLICYQMP 165 (324) Q Consensus 86 ~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i~~~~~ 165 (324) ..+++|.++.+||+++|||+-||-++|++++|+++ ++.+++.+++.++|..+ |.|+..+.+|+..+.+.. T Consensus 94 ~~~~~I~l~K~IPvgAGLGGGSSnAAa~L~~Ln~~-~~l~ls~~~L~~ia~~l---------GaDVPffl~~~~a~~~G~ 163 (295) T PRK04648 94 ALGAEIRVDKRIPAGGGFGGGSSDAATVLVALNAL-WGLGLDVDTLAELGLRL---------GADVPVFVRGHNAWAEGV 163 (295) T ss_pred CCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHH---------CCCCEEEEECCCEEEEEC T ss_conf 98749999806866756776546899999999998-58999989999999964---------899315874776899967 Q ss_pred CCEEEEEECC-CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 7515673047-775267658417989999888998860823489999999840332078898730486445689999989 Q gi|254781017|r 166 KYSIEKIDFI-FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQQ 244 (324) Q Consensus 166 ~~~~~~~~~~-~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~~ 244 (324) ...+.+++++ ..++|++.+...+|+++-+.....+...+. .+.+.....+..+|++.... T Consensus 164 GE~l~~~~~~~~~~lLv~P~~~vST~~vy~~~~~~~~~~~~-------------~~~~~~~~~~~~NdlE~~~~------ 224 (295) T PRK04648 164 GEQLTPISLPEAAYLLVDPGVHVPTPVLFRSQELTRDAAPA-------------KIADFASGSLLDNAFEPVLR------ 224 (295) T ss_pred CCEEEECCCCCCEEEEECCCCCCCHHHHHHCCCCCCCCCCC-------------CHHHHHHCCHHHCHHHHHHH------ T ss_conf 83635867788579998899996659997383445666645-------------56777628431181689998------ Q ss_pred HHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 999861678779999999999719945997268997548999846653268999999999977978999414 Q gi|254781017|r 245 GLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITP 316 (324) Q Consensus 245 ~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 316 (324) + ..|++.++.+.+.+. | .++|||+|. ||+++..++.++. +..+++.+ .+..+-+++ T Consensus 225 ----~---~~P~I~~i~~~L~~~-g--~a~MSGSGS--t~FglF~~~~~A~---~a~~~l~~-~~~~w~v~~ 280 (295) T PRK04648 225 ----R---REPAVEAVFQALSRV-G--TPRLTGSGS--GCFVEFATRAAAE---QALAQLPG-GLRAWVVEG 280 (295) T ss_pred ----H---CCHHHHHHHHHHHCC-C--CCCEEEECC--CEEEEECCHHHHH---HHHHHCCC-CCCEEEECC T ss_conf ----6---288999999998605-9--865795352--1389989999999---99996345-288699745 No 36 >PRK00650 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.84 E-value=3.8e-18 Score=133.99 Aligned_cols=237 Identities=15% Similarity=0.113 Sum_probs=161.6 Q ss_pred EEECCEEEEEE----CCCEE-ECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHHHH Q ss_conf 99717049997----43533-08942898776772899999868984999953644446566120150899999999971 Q gi|254781017|r 9 CVSAPGSLVLM----GEHGV-LHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAINHI 83 (324) Q Consensus 9 ~~~aPGKv~L~----GEh~v-~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ 83 (324) ...||+||||+ |-..| ||--..+-..||.+..+++++.++..+.+... .. ...+ .+-.++.+....++. T Consensus 2 ~~~aPAKINL~L~V~~kr~DGYH~l~Sl~~~idl~D~i~i~~~~~~~~~~~~~-~~--~~~~---~Nli~kA~~~l~~~~ 75 (288) T PRK00650 2 HFFSPAKLNLFLQLLGKREDGFHEIVTRYQAIDFGDQLSLSISSRDSLQVINL-CE--LETP---QNSIWKSVALFRDYT 75 (288) T ss_pred CEECCEEEECCCCCCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCCEEEECC-CC--CCCC---HHHHHHHHHHHHHHH T ss_conf 56077107325045887999988157999996789899999889996898389-87--8994---569999999999974 Q ss_pred CCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEEEEE Q ss_conf 87787389998268753548874024566556777640456666014557888877652146887544776629858998 Q gi|254781017|r 84 KPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLICYQ 163 (324) Q Consensus 84 ~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i~~~ 163 (324) ....+++|+++.+||+++|||+.||-+++++++|+.+ ++..++.+++.++|..+ |.|+..+.+|+..+.. T Consensus 76 ~~~~~~~I~l~KnIPvgAGLGGGSSnAAavL~~Ln~l-~~l~l~~~~L~~ia~~l---------GADVPffl~~~~a~~~ 145 (288) T PRK00650 76 GITTPVSWRVVKQIPIGAGLAGGSSNAATALFALNQH-FQTGLSDEELRSLAEKI---------GMDTPFFFSSGSALGV 145 (288) T ss_pred CCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHC---------CCCCCEEECCCCEEEE T ss_conf 9998737999957976556761037899999999998-55898999999999872---------9981407538758999 Q ss_pred ECCCEEEEEEC--CCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHH Q ss_conf 52751567304--7775267658417989999888998860823489999999840332078898730486445689999 Q gi|254781017|r 164 MPKYSIEKIDF--IFPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMN 241 (324) Q Consensus 164 ~~~~~~~~~~~--~~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~ 241 (324) .....+.+++. ...+++++.+...+|+++-+.++ |......+. .+ ..++|++... +. T Consensus 146 G~GE~i~~l~~~~~~~~vLv~p~~~vsT~~vf~~~~------~~~~~~~~~------------l~-~~~NDLE~~a--~~ 204 (288) T PRK00650 146 GRGEKILALEESVSDRYVLYFSDQGVLTSDAFAYVQ------PSDCSSRKN------------LE-YTQNDLEKPV--FR 204 (288) T ss_pred ECCCEEEECCCCCCCEEEEECCCCCCCCHHHHHHCC------CCCCCHHHH------------HH-HCCCCCHHHH--HH T ss_conf 379677788767765199991998977669875049------100663666------------65-4589788999--98 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHH--CCCCEEEECCCCCCCCEEEEEECCCCCHH Q ss_conf 9899998616787799999999997--19945997268997548999846653268 Q gi|254781017|r 242 RQQGLLETLGVSDSKLSEIVWKLRE--QPHIMASKISGSGLGDCVIALGKGDLNSL 295 (324) Q Consensus 242 ~~~~~L~~lgvs~~~l~~li~~~~~--~~g~~~aklsGAG~GG~~i~l~~~~~~~~ 295 (324) -.|++.++.+.+.+ .+++..++|||+|. |++++-++..+.. T Consensus 205 -----------~~P~i~~ik~~L~~l~a~~~~~~rMSGSGs--tvFa~F~~~~e~d 247 (288) T PRK00650 205 -----------LRLDLKEKKHWLESLWSPFPVHVGLTGSGA--TLFVRYPRILEED 247 (288) T ss_pred -----------HCHHHHHHHHHHHHCCCCCCCEEEEECCCC--EEEEEECCCCCCC T ss_conf -----------597999999999964698776588873130--4999908811018 No 37 >KOG4644 consensus Probab=99.82 E-value=3.2e-17 Score=128.14 Aligned_cols=294 Identities=18% Similarity=0.237 Sum_probs=176.4 Q ss_pred CCCCCEEEEECCEEEEEECCCEEE-------CCCEEEEEEECCE----EEE------------EEEECCCCE-EEEEEC- Q ss_conf 988878999717049997435330-------8942898776772----899------------999868984-999953- Q gi|254781017|r 2 GQCLHKICVSAPGSLVLMGEHGVL-------HGHAALVFAINKR----VIL------------YLTLRKDRL-INIDSS- 56 (324) Q Consensus 2 ~~~~~ki~~~aPGKv~L~GEh~v~-------~G~~ai~~aI~~~----~~~------------~~~~~~~~~-i~i~~~- 56 (324) |+-|.-|.+.||.||.+||...|. ..+.++.+||... +.. .+.++.|.. +++.+. T Consensus 566 g~g~s~Viae~PaRiDF~GGW~DTPPiafel~n~AVlglAiklDGk~PIga~a~kI~ePelwlai~~rQDel~V~I~crc 645 (948) T KOG4644 566 GAGPSTVIAEAPARIDFFGGWLDTPPIAFELDNAAVLGLAIKLDGKNPIGAFAEKIDEPELWLAIEIRQDELFVHIKCRC 645 (948) T ss_pred CCCCCEEEEECCEEEEECCCCCCCCCEEEECCCCCEEEEEEEECCCCCCCHHHHCCCCCHHEEEEEEECCCEEEEEEEEH T ss_conf 99986389956556652266567997057616531134688734889621466518971020113564264189998500 Q ss_pred ---CCCCCCCCCHHHC-CC-H----------HHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf ---6444465661201-50-8----------9999999997187787389998268753548874024566556777640 Q gi|254781017|r 57 ---LGQYCGSLDLAMF-HP-S----------FSFIIMAINHIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQ 121 (324) Q Consensus 57 ---~~~~~~~~~~~~~-~~-~----------~~~i~~~l~~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~ 121 (324) ..++...-..... .. . +-.+...+.+. .-+||+|...|++|.|+|||+||-+....++|++.+. T Consensus 646 laDlrD~cqpHa~gal~~aafiCA~IVhl~sel~i~d~~~k~-f~~GfeihT~SdLPHGSGLGTSSIlA~TaLaAi~~aa 724 (948) T KOG4644 646 LADLRDLCQPHAKGALEAAAFICACIVHLGSELNILDIFEKL-FCCGFEIHTSSDLPHGSGLGTSSILACTALAAICAAA 724 (948) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHEEEEECCCHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 101577625578760555424411333203143399999997-3475473032458887776568889899999999860 Q ss_pred CCCCCCCHH----HHHHHHHHHHHHCCCCCC-HHHHHHHCCCEEE---EEECCCEEEEEE----------CCCCCCEEEC Q ss_conf 456666014----557888877652146887-5447766298589---985275156730----------4777526765 Q gi|254781017|r 122 YHKEPSPDE----ILTTAHAIVLKVQGISSG-IDLAASIHGGLIC---YQMPKYSIEKID----------FIFPIHLIYS 183 (324) Q Consensus 122 ~~~~~~~~~----i~~lA~~~E~~~~g~~sG-~D~~a~~~Gg~i~---~~~~~~~~~~~~----------~~~~l~l~~t 183 (324) ......+. +.......|+.. ..++| +||.-..|.|+-. +--.+..++..+ +.-+++++|| T Consensus 725 -gr~~gTeaLiHailHtvlrlEQil-TTGGGWQDQ~G~im~GIK~gr~rael~~~ie~eeiTipe~f~ekL~dhLLLVYT 802 (948) T KOG4644 725 -GRADGTEALIHAILHTVLRLEQIL-TTGGGWQDQCGAIMEGIKKGRCRAELNHGIEHEEITIPEEFREKLEDHLLLVYT 802 (948) T ss_pred -CCCCCHHHHHHHHHHHHHHHHHHH-HCCCCHHHHCCCHHHHHHHCCCHHHCCCCCEEEEECCCHHHHHHHHHCEEEEEE T ss_conf -544563676899999999999886-237853653051221012032021256774352101778999997620799872 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHH--CCCCHHHHHHH Q ss_conf 8417989999888998860823489999999840332078898730486445689999989999861--67877999999 Q gi|254781017|r 184 GYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQQGLLETL--GVSDSKLSEIV 261 (324) Q Consensus 184 g~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~l--gvs~~~l~~li 261 (324) |..+..+.+...|-+ +.+.+.++..++.-. +.+-+++..+.++++.++.+++++..+++..+-| |..++.+.++. T Consensus 803 GKTRLAkNLLQdViR--n~far~~a~~Q~ah~-l~~~tdecAegf~kGsl~LlgecL~~YweqKk~MapgCEPl~Vr~ll 879 (948) T KOG4644 803 GKTRLAKNLLQDVIR--NFFARCKATKQKAHK-LAEATDECAEGFEKGSLELLGECLEHYWEQKKFMAPGCEPLNVRELL 879 (948) T ss_pred CCHHHHHHHHHHHHH--HHHHHHHHHHHHHHH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHH T ss_conf 721889999999999--999766899999999-99999999998862758789999999987651047899997489999 Q ss_pred HHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHH Q ss_conf 9999719945997268997548999846653268999999 Q gi|254781017|r 262 WKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVN 301 (324) Q Consensus 262 ~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~ 301 (324) +.++.+...-..-..||||||++..|.+.-...+.+|.+- T Consensus 880 dmLaph~hgesgw~AGAGGGGFiYLl~kEpqqkeaiEa~L 919 (948) T KOG4644 880 DMLAPHKHGESGWAAGAGGGGFIYLLIKEPQQKEAIEAFL 919 (948) T ss_pred HHHCCCCCCCCCHHCCCCCCCEEEEEECCCCCHHHHHHHH T ss_conf 8746001132120015788737999965889978999860 No 38 >TIGR00154 ispE 4-diphosphocytidyl-2C-methyl-D-erythritol kinase; InterPro: IPR004424 4-diphosphocytidyl-2C-methyl-D-erythritol kinase is a member of the family of GHMP kinases that were previously designated as conserved hypothetical protein YchB or as isopentenyl monophosphate kinase. In Lycopersicon esculentum (tomato) and Escherichia coli the protein has been indentified as 4-diphosphocytidyl-2C-methyl-D-erythritol kinase, an enzyme of the deoxyxylulose phosphate pathway of terpenoid biosynthesis.; GO: 0050515 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity, 0016114 terpenoid biosynthetic process. Probab=99.79 E-value=1.2e-18 Score=137.23 Aligned_cols=265 Identities=19% Similarity=0.243 Sum_probs=173.5 Q ss_pred CCEEEEECCEEEEEE---------CCCEEECCCEEEEEEECCEEEEEEEECC-CCEEEEEECCCCCCCCCCHHHCCCHHH Q ss_conf 878999717049997---------4353308942898776772899999868-984999953644446566120150899 Q gi|254781017|r 5 LHKICVSAPGSLVLM---------GEHGVLHGHAALVFAINKRVILYLTLRK-DRLINIDSSLGQYCGSLDLAMFHPSFS 74 (324) Q Consensus 5 ~~ki~~~aPGKv~L~---------GEh~v~~G~~ai~~aI~~~~~~~~~~~~-~~~i~i~~~~~~~~~~~~~~~~~~~~~ 74 (324) |.++...||.||||| |-|-+. .+-..|+....+.++.++ ++.+++.+...+...... +....+.+ T Consensus 1 ~~~~~~~~PAK~NLFL~I~~krpdGYH~L~----~~~~~~~~gD~i~i~~~~q~~~i~l~~~~~~~~~e~~-NL~yrAa~ 75 (322) T TIGR00154 1 MMRIVFPAPAKINLFLYILGKRPDGYHELQ----MLMQFIDLGDKIEISVRSQDDDIRLSKGDSDVPDEER-NLIYRAAK 75 (322) T ss_pred CCCCEECCCCCCCCHHHCCCCCCCCCEEHH----HHHHHHHCCCEEEEEEECCCCCEEECCCCCCCCCCHH-HHHHHHHH T ss_conf 986100488732111310167998523133----3334562167379998535784787377898886201-06999999 Q ss_pred HHHHHHHH-------HCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCC Q ss_conf 99999997-------18778738999826875354887402456655677764045666601455788887765214688 Q gi|254781017|r 75 FIIMAINH-------IKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISS 147 (324) Q Consensus 75 ~i~~~l~~-------~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~s 147 (324) .+...++. +....|++|.|+..||+++|||+-||-+.+++.+|+.+ |+..++.+|+++++..+ T Consensus 76 Ll~~~~~~~nfsH~~~~~~~G~~I~~~K~iP~~aGLgGGSsdAA~~L~~LN~L-W~~~LS~~EL~~lG~~l--------- 145 (322) T TIGR00154 76 LLKNKANSKNFSHSVIKVKKGVNIEITKVIPMAAGLGGGSSDAAAVLVGLNQL-WNLNLSLEELAELGATL--------- 145 (322) T ss_pred HHHHHHHCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHH-HHCCCCHHHHHHHHHHH--------- T ss_conf 99985100233343012567614898641466577654045799999987677-62168878999998874--------- Q ss_pred CHHHHHHHCC-CEEEEEECCCEEEEEE--CCCCCCE-EEC-CCCCCHHHHHHHHH-HHHHHC-CCH--HHHHHHHHHHHC Q ss_conf 7544776629-8589985275156730--4777526-765-84179899998889-988608-234--899999998403 Q gi|254781017|r 148 GIDLAASIHG-GLICYQMPKYSIEKID--FIFPIHL-IYS-GYKTPTAQVLKKIS-YIEIEY-PEI--NEINQKIYALMG 218 (324) Q Consensus 148 G~D~~a~~~G-g~i~~~~~~~~~~~~~--~~~~l~l-~~t-g~~~~T~~~v~~v~-~~~~~~-~~~--~~~~~~~~~~~~ 218 (324) |.|+-.|++| +.-+=......+.+++ .+..+++ .-. .+..||+.+-+..+ +.-++| |+. ....++.+.. + T Consensus 146 GaDvpffv~GY~~A~AtG~GE~i~~~~e~P~e~W~~~~kP~~~~~ST~~vyq~~~qkfGknyLp~~~~~~~~~ktiha-G 224 (322) T TIGR00154 146 GADVPFFVSGYGTAFATGRGEIITPLEEEPPEKWVVIAKPRSVSVSTAVVYQAYKQKFGKNYLPRNTPKRAKEKTIHA-G 224 (322) T ss_pred CCCCEEEECCCCEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHC-C T ss_conf 698307863633121123442505788898740388972798532738987124553152247877765677632002-5 Q ss_pred CCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHCCCCEEEE-CCCCCCCCEEEEEECCCCCHH Q ss_conf 3207889873048644568999998999986167-8779999999999719945997-268997548999846653268 Q gi|254781017|r 219 KLSQISCQALRNKNLKVLAQAMNRQQGLLETLGV-SDSKLSEIVWKLREQPHIMASK-ISGSGLGDCVIALGKGDLNSL 295 (324) Q Consensus 219 ~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~lgv-s~~~l~~li~~~~~~~g~~~ak-lsGAG~GG~~i~l~~~~~~~~ 295 (324) -+ +++++. +.+..+-...++.|++.-+ ..++++.+.+.+.+. |+.+.+ |||+|. |+++|++.+.+.+ T Consensus 225 ~~----l~~i~~---~~~~~~~~~~~NDlEkv~l~~~~~V~~~l~~l~~~-g~~~~~f~sGsG~--~vF~l~~~e~ea~ 293 (322) T TIGR00154 225 GL----LKKIEE---KLLQLLDSNLKNDLEKVALDAHTEVAQALNWLLEY-GLAPERFLSGSGP--CVFALFDEEAEAE 293 (322) T ss_pred CH----HHHHHH---HHHHHHHHHCCCCCCHHHHCCCCCHHHHHHHHHHH-CCCCEEEEECCCH--HHHHCCHHHHHHH T ss_conf 10----245667---65776654047720200001576189999999860-5884257644854--6642151146789 No 39 >PRK05905 hypothetical protein; Provisional Probab=99.79 E-value=3.3e-17 Score=128.09 Aligned_cols=243 Identities=12% Similarity=0.053 Sum_probs=149.8 Q ss_pred EEECCEEEEEECC----CEEECCCE---EEEEEE-CCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHH Q ss_conf 9971704999743----53308942---898776-772899999868984999953644446566120150899999999 Q gi|254781017|r 9 CVSAPGSLVLMGE----HGVLHGHA---ALVFAI-NKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAI 80 (324) Q Consensus 9 ~~~aPGKv~L~GE----h~v~~G~~---ai~~aI-~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l 80 (324) ..+||+||||+=. -.+ +|+. .+-+.| |++..+++++.+.....+......... . ...+..+......+ T Consensus 2 k~ks~AKINL~L~I~~kr~d-~GyH~l~S~f~~i~~l~D~i~i~~~~~~~~~i~~~~~~~~~-~--~~~~ni~~ka~~~l 77 (258) T PRK05905 2 KYKSYAKINLGLSIYKKCKK-VTKHKLESIFILVENVYDDIEIEKIEKNIDDIHYFDETNEI-L--VYSRLILVKTLEWL 77 (258) T ss_pred CCCCCCCCCCCEEECCCCCC-CCCEEEEEEEEEECCCCEEEEEEECCCCCCEEEECCCCCCC-C--CCHHHHHHHHHHHH T ss_conf 36535507145224877889-99668999999937887299999877998759970887667-7--82469999999999 Q ss_pred -HHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCC-C Q ss_conf -97187787389998268753548874024566556777640456666014557888877652146887544776629-8 Q gi|254781017|r 81 -NHIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHG-G 158 (324) Q Consensus 81 -~~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~G-g 158 (324) +......+++|+++.+||+|+|||+.||-+++++++|+++. .++..++.++|..+ |.|+..+.+| + T Consensus 78 ~~~~~~~~~~~I~l~K~IP~gAGLGGGSSnAAa~L~~L~~~~---~l~~~~l~~ia~~l---------GsDVPffl~~~~ 145 (258) T PRK05905 78 RDKYNIKNHFKIKIKKRIPIGSGLGSGSSNAAVLMKWILEFE---GINEINYKDVVNKL---------GSDIPFFLSGYK 145 (258) T ss_pred HHHHCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHC---CCCHHHHHHHHHHH---------CCCCCEEECCCC T ss_conf 886099986699999458765677875277999999999865---99989999999861---------899887981687 Q ss_pred EEEEEECCCEEEEEEC--CCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHH Q ss_conf 5899852751567304--77752676584179899998889988608234899999998403320788987304864456 Q gi|254781017|r 159 LICYQMPKYSIEKIDF--IFPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVL 236 (324) Q Consensus 159 ~i~~~~~~~~~~~~~~--~~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l 236 (324) ..+.+.....+.+++. ..++++++.+...+|+++-+.++...... .+........ . .........+|++.. T Consensus 146 ~a~~~G~GE~l~~l~~~~~~~~llv~P~~~~ST~~vy~~~~~~~~~~---~~~~~~~~~~---~-~~~~~~~~~Ndle~~ 218 (258) T PRK05905 146 TAYISDYGSQVEDLIGQFKLTYKVIFMNVNVSTKKVFEKFDDNQHVI---KNNFKTIIKN---L-KENIVVNIHNDLQEP 218 (258) T ss_pred CEEEECCCCEEEECCCCCCCEEEEEECCCCCCHHHHHHHHCCCCCCC---CCHHHHHHHH---H-HHHHHHCCCCCHHHH T ss_conf 38998788778888778775299992799957599997226244456---3107888876---6-777763370173999 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCC Q ss_conf 8999998999986167877999999999971994599726899754899984665 Q gi|254781017|r 237 AQAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGD 291 (324) Q Consensus 237 ~~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~ 291 (324) . +. -.|++.++.+.+.. .| ..+.|||+|. |++++.+.+ T Consensus 219 ~--~~-----------~~P~i~~~~~~l~~-~~-~~~~MSGSGS--t~F~i~~~~ 256 (258) T PRK05905 219 C--FE-----------LYPNLLYKYNELLN-DG-FYTILSGAGS--SFIVIKKIN 256 (258) T ss_pred H--HH-----------HCHHHHHHHHHHHC-CC-CCEEEECCCC--CEEEEEECC T ss_conf 9--98-----------79799999999867-99-8389976371--489999636 No 40 >COG1685 Archaeal shikimate kinase [Amino acid transport and metabolism / Coenzyme metabolism] Probab=99.77 E-value=1.5e-15 Score=117.50 Aligned_cols=266 Identities=19% Similarity=0.156 Sum_probs=167.7 Q ss_pred EECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHH-HHHHHHHHHCCCCC Q ss_conf 97170499974353308942898776772899999868984999953644446566120150899-99999997187787 Q gi|254781017|r 10 VSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFS-FIIMAINHIKPSCG 88 (324) Q Consensus 10 ~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~-~i~~~l~~~~~~~g 88 (324) ++|||-..++---+.-.| =+++||+.+.++++..++..+.- .. ......++ .+.....++....+ T Consensus 5 a~A~g~~TIiNAiatG~G---~AfgidL~v~a~v~~~~~~~~~~-------~~----~~d~~li~~~~~~v~e~~g~~~~ 70 (278) T COG1685 5 ARAYGGGTIINAIATGKG---SAFGIDLKVEAEVRLSDEGKVRG-------EP----EGDTRLIERCVERVREKYGIPLG 70 (278) T ss_pred EEECCCEEEEEEHHCCCC---CEEEECCEEEEEEEECCCCCCCC-------CC----CCCHHHHHHHHHHHHHHCCCCCC T ss_conf 884276067530124766---25652445899999857653136-------77----78747999999999987298865 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCC-H-HHHHHHCCCEEEEEECC Q ss_conf 389998268753548874024566556777640456666014557888877652146887-5-44776629858998527 Q gi|254781017|r 89 FDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSG-I-DLAASIHGGLICYQMPK 166 (324) Q Consensus 89 ~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG-~-D~~a~~~Gg~i~~~~~~ 166 (324) +++.++|+||.++||.||||+..|++.|+..+. +.++++.++++++.++-+...-.-.| + |.+++.|||+..-++.+ T Consensus 71 ~~v~v~SeiP~~~GLkSSSA~~nAlv~A~~~~~-g~~~~~~~i~~l~a~~S~~aGvSvTGA~DDa~AS~~GG~~iTDN~~ 149 (278) T COG1685 71 VEVEVESEIPVGSGLKSSSAASNALVKAVLKAL-GEEIDDFEILRLGARASKEAGVSVTGAFDDACASYLGGIVITDNRK 149 (278) T ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHC-CCCCCHHHHHHHHHHHHHHCCCEEECCCHHHHHHHHCCEEEECCHH T ss_conf 599982578766674245899999999999971-8988814888877798874594474140688999758868861611 Q ss_pred CEEEEE-ECC-CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 515673-047-775267658417989999888998860823489999999840332078898730486445689999989 Q gi|254781017|r 167 YSIEKI-DFI-FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQQ 244 (324) Q Consensus 167 ~~~~~~-~~~-~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~~ 244 (324) ..+... +.+ ...++.-.+.++.+++. +++.++.. ....+.+.+.-.++++.. .|..|- T Consensus 150 m~Ilrr~~~~~~~vlI~~p~~k~~~~~v--dv~~~r~~---------------a~~~e~A~~lA~~G~~~~---Am~lNG 209 (278) T COG1685 150 MRILRRLDLPELTVLILAPGEKRLSANV--DVNRLRLI---------------APVVEEAFRLALKGEYFK---AMVLNG 209 (278) T ss_pred HEEHHCCCCCCCEEEEEECCCCCCCCCC--CHHHHHHH---------------HHHHHHHHHHHHCCCHHH---HHHHHH T ss_conf 1000003467740899816876525547--87898774---------------389999999985352888---888767 Q ss_pred HHH-HHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCC Q ss_conf 999-8616787799999999997199459972689975489998466532689999999999779789994145456445 Q gi|254781017|r 245 GLL-ETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITPSHSTSLY 323 (324) Q Consensus 245 ~~L-~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 323 (324) -.- ..||+..... .++. ..|+.++-+||-|+ ..++|+.++ +.+.+.|.+.|--+..-.+.+..+.| T Consensus 210 ~~y~~aLG~~~e~~----~~al-e~GA~~aglSGtGP--a~~Al~~~~------~~v~ea~~~~G~V~~t~~~~~~~~~~ 276 (278) T COG1685 210 ILYCSALGYDLEPA----LKAL-EAGAAAAGLSGTGP--AYFALTEDP------EEVAEAWSKIGDVIETRNVGERARDY 276 (278) T ss_pred HHHHHHHCCCHHHH----HHHH-HCCCCEECCCCCCC--CEEEEECCC------HHHHHHHHHCCEEEEEECCCCCCEEC T ss_conf 99898708982899----9998-63540643578897--169995493------78999998678699980489874014 No 41 >TIGR01920 Shik_kin_archae shikimate kinase; InterPro: IPR010189 Shikimate kinase (2.7.1.71 from EC) catalyses the fifth step in the shikimate pathway of aromatic amino acids biosynthesis. It converts shikimate to shikimate 3-phosphate (3-phosphoshikimate). This part of the pathway leads to the biosynthesis of chorismate, the precursor of aromatic amino acids, folates, ubiquinones, and other aromatic compounds. The shikimate pathway links metabolism of carbohydrates to biosynthesis of the aromatic amino acids phenylalanine, tyrosine, tryptophan and their derivatives in microorganisms and plants . In a sequence of seven enzymatic reactions, D-erythrose 4-phosphate (E4P), an intermediate of the pentose phosphate pathway and phosphoenol pyruvate (PEP), a glycolytic intermediate, are converted to chorismate. The shikimate pathway is present in bacteria, archaea, fungi and plants. The absence of the shikimate pathway in animals makes it an attractive target for non-toxic herbicides, antimicrobial and antifungal agents. Shikimate kinase enzymes present an example of convergent evolution via enzyme recruitment from an unrelated group. This group represents shikimate kinases of the archaeal type. Members have no sequence similarity with the typical form of shikimate kinase (PIRSF000702 from PIRSF, IPR000623 from INTERPRO) found in bacteria and eukaryotes, but are instead distantly related to homoserine kinases, which belong to the GHMP kinase domain superfamily (GHMP = galactose, homoserine, mevalonate, and phosphomevalonate) . All known non-archaeal shikimate kinases (the typical form) belong to the non-homologous, structurally unrelated nucleoside monophosphate (NMP) kinase domain superfamily . Nomenclature note: the name AroK is used for non-homologous shikimate kinases of both the archaeal and bacterial types. ; GO: 0004765 shikimate kinase activity, 0009073 aromatic amino acid family biosynthetic process, 0005737 cytoplasm. Probab=99.76 E-value=9.9e-16 Score=118.71 Aligned_cols=242 Identities=18% Similarity=0.146 Sum_probs=160.2 Q ss_pred EEEEECCEEEEEEEECCCCE--EEEEECCCCCCCCCCHHHCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCH Q ss_conf 98776772899999868984--9999536444465661201508999999999718778738999826875354887402 Q gi|254781017|r 31 LVFAINKRVILYLTLRKDRL--INIDSSLGQYCGSLDLAMFHPSFSFIIMAINHIKPSCGFDLKVISQLDSQLGLGSSAA 108 (324) Q Consensus 31 i~~aI~~~~~~~~~~~~~~~--i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~g~~i~i~s~iP~g~GLGSSaa 108 (324) =+++||+.+.+.++..+|.. +++.... .. .. ........+.....++....++++.|+|+||.|+||.|||| T Consensus 18 ~AfgidL~v~a~V~~~~d~~~~v~~~~~~--~~-~~---~~~~~~~~~~~~~~~~g~~~~~~v~~~seiP~g~GLKSSSA 91 (273) T TIGR01920 18 GAFGIDLKVEAKVRLSDDGEAKVSLKVRG--NP-EL---DPRLIERILTALIEKFGIVEGLEVEVESEIPLGSGLKSSSA 91 (273) T ss_pred EEECCCCEEEEEEEEECCCCCCEEEEECC--CC-CC---CHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCHHHH T ss_conf 12126860799999706885431477667--65-47---97899999999898538886248999617678866437899 Q ss_pred HHHHHHHHHHHHHCC--CCCCCHHHHHHHHHHHHHHCCCCCC-H-HHHHHHCCCEEEEEECCCEEEEE-ECCCC-CCEEE Q ss_conf 456655677764045--6666014557888877652146887-5-44776629858998527515673-04777-52676 Q gi|254781017|r 109 ITVAITAALLTLQYH--KEPSPDEILTTAHAIVLKVQGISSG-I-DLAASIHGGLICYQMPKYSIEKI-DFIFP-IHLIY 182 (324) Q Consensus 109 ~~va~~~al~~~~~~--~~~~~~~i~~lA~~~E~~~~g~~sG-~-D~~a~~~Gg~i~~~~~~~~~~~~-~~~~~-l~l~~ 182 (324) +..|++.|...+. + .+.+..+++++..++=+...=.-.| + |.+||.|||+..-+|....+.+. +++.+ ..++. T Consensus 92 ~~nAlv~A~~~~~-gva~~~~~~~~~~l~a~~S~~AG~s~TGAfDDa~AS~~GG~~~TDN~~~~ilk~~~~~~~l~~~vl 170 (273) T TIGR01920 92 LVNALVLAVLKAK-GVAEEIDDIDILRLGAELSKEAGVSVTGAFDDAAASYLGGIVITDNRKMKILKRDKLEGNLTAVVL 170 (273) T ss_pred HHHHHHHHHHHHH-HHHHCCCHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHCCCCEEEECCEEEEEEECCCCCCCEEEEE T ss_conf 9999999999861-234217733689999998886685153058899998608200521430246552167898658999 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHHHHH-HHHCCCCHHHHHHH Q ss_conf 58417989999888998860823489999999840332078898730486445689999989999-86167877999999 Q gi|254781017|r 183 SGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQQGLL-ETLGVSDSKLSEIV 261 (324) Q Consensus 183 tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~~~~L-~~lgvs~~~l~~li 261 (324) ......+++. ++++++ .+..+.+.+.+.=.+++|-+ .|..|=-.- ..||+..+.. T Consensus 171 ~P~~~~~~~~--~~~~~~---------------~~~~~~e~a~~~A~~G~y~~---Am~lNG~~y~~~Lgyp~e~~---- 226 (273) T TIGR01920 171 VPKEERRENV--DLNRLR---------------KISPVVEEAFKLALKGEYLK---AMVLNGVAYATALGYPLEPA---- 226 (273) T ss_pred ECCCCCCCCC--CHHHHH---------------HHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHHHCCCHHHH---- T ss_conf 7687667676--888888---------------66689999999964688078---99999999998818986489---- Q ss_pred HHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCE Q ss_conf 9999719945997268997548999846653268999999999977978 Q gi|254781017|r 262 WKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGID 310 (324) Q Consensus 262 ~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~ 310 (324) .++. +.||..+-+||-|+ .++++++++.+ +.+.+.|.+.||- T Consensus 227 ~~al-~~GA~~aG~SG~GP--sy~A~~~~pe~----~~~~~~~~~~G~G 268 (273) T TIGR01920 227 SKAL-EKGAAVAGLSGKGP--SYFALTEEPEE----EEVAEALEEFGFG 268 (273) T ss_pred HHHH-HCCCEEEEECCCCC--EEEEEECCCCH----HHHHHHHHHCCCE T ss_conf 9997-38943886305755--68997306864----8999999746970 No 42 >PRK04181 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.76 E-value=8.8e-17 Score=125.37 Aligned_cols=236 Identities=13% Similarity=0.113 Sum_probs=146.3 Q ss_pred EECCEEEEEE----CCCEEECCCEEEEEEE-CCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHHHH- Q ss_conf 9717049997----4353308942898776-772899999868984999953644446566120150899999999971- Q gi|254781017|r 10 VSAPGSLVLM----GEHGVLHGHAALVFAI-NKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAINHI- 83 (324) Q Consensus 10 ~~aPGKv~L~----GEh~v~~G~~ai~~aI-~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~- 83 (324) .|||+||||+ |--.+||.-..+-++| |.+..+++.+.+...+.+.... ....+ .+.+.-....++.. T Consensus 1 lKs~AKINL~L~V~~kRd~yH~l~s~~~~i~dl~D~i~i~~~~~~~~~~~~~~---~~~~~----~Nlv~ka~~~l~~~~ 73 (254) T PRK04181 1 MKSYAKVNIFLKITGKRGNYHELISRFVLVKDLFDEIEFVDDSAKSFEIIGNF---DCPLE----ENIIFKAYQELKSKG 73 (254) T ss_pred CCCCCEECCCEEECCCCCCCCEEEEEEEEHHHCCEEEEEEECCCCCEEEECCC---CCCCC----CCHHHHHHHHHHHHC T ss_conf 99765376743577534998732547778200857999998899958997888---88854----159999999998753 Q ss_pred ------CCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf ------87787389998268753548874024566556777640456666014557888877652146887544776629 Q gi|254781017|r 84 ------KPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHG 157 (324) Q Consensus 84 ------~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~G 157 (324) ....+++|.++.+||+++|||+.||-+++++++++++ ++.+++.+++.++|..+ |.|+..+.+| T Consensus 74 ~~~~~~~~~~~~~I~l~KnIP~gAGLGGGSSdAAa~L~~l~~~-~~l~ls~~~L~~ia~~l---------GaDVPffl~g 143 (254) T PRK04181 74 FSNELIEFFSKLAIEVTKNIPTGAGLGGGSSNAATFLLMANEI-LNLKLSLEELINIGSKI---------GADVAFFISG 143 (254) T ss_pred CCCCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHC---------CCCCCEEECC T ss_conf 3234311366417999525766567887466899999999998-28999999999987531---------8982449757 Q ss_pred -CEEEEEECCCEEEEEEC-CCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHH---HCCCC Q ss_conf -85899852751567304-77752676584179899998889988608234899999998403320788987---30486 Q gi|254781017|r 158 -GLICYQMPKYSIEKIDF-IFPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQA---LRNKN 232 (324) Q Consensus 158 -g~i~~~~~~~~~~~~~~-~~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~a---l~~~d 232 (324) +..+.+.....+.+++. +..+++++.+...+|+++-+.++......... ...+.+.. . -..+.... ...+| T Consensus 144 ~~~a~~~G~Ge~l~~~~~~~~~~~lv~P~~~vST~~vy~~~~~~~~~~~~~-~~~~~~~~-~--~~~~~l~~~~~~~~Nd 219 (254) T PRK04181 144 YKSANVSGIGEIIEEFEEEILNLEIYTPNIFCSTKAVYQAFRPEFFDFINF-NQAKEWLK-L--SSKELLKNFKNYELND 219 (254) T ss_pred CCEEEEECCCCEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCCH-HHHHHHHH-C--CCHHHHHHHHCCCCCC T ss_conf 863788757738358777786599994989986799999875540465552-57899885-7--7389999751444567 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEEC Q ss_conf 445689999989999861678779999999999719945997268997548999846 Q gi|254781017|r 233 LKVLAQAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGK 289 (324) Q Consensus 233 ~~~l~~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~ 289 (324) ++... + + -.|++.++.. . ++.|||+|. |+++|.+ T Consensus 220 Le~~a---------~-~---~~P~i~~il~----~----~~~mSGSGs--t~Fal~r 253 (254) T PRK04181 220 LLKPA---------L-L---LYPALKDYLG----K----DWFFSGSGS--SFFRVKR 253 (254) T ss_pred CHHHH---------H-H---HHHHHHHHHC----C----CCEEECCCH--HHEEEEE T ss_conf 39999---------9-9---6799999872----1----909973072--1458973 No 43 >COG3407 MVD1 Mevalonate pyrophosphate decarboxylase [Lipid metabolism] Probab=99.55 E-value=9.1e-12 Score=93.64 Aligned_cols=268 Identities=14% Similarity=0.090 Sum_probs=156.6 Q ss_pred EEEEECCEEEEEE--CCCEEEC----CCEEEEEEE-CCEEEEEEEEC---C-CCEEEEEECCCCCCCCCCHHHCCCHHHH Q ss_conf 8999717049997--4353308----942898776-77289999986---8-9849999536444465661201508999 Q gi|254781017|r 7 KICVSAPGSLVLM--GEHGVLH----GHAALVFAI-NKRVILYLTLR---K-DRLINIDSSLGQYCGSLDLAMFHPSFSF 75 (324) Q Consensus 7 ki~~~aPGKv~L~--GEh~v~~----G~~ai~~aI-~~~~~~~~~~~---~-~~~i~i~~~~~~~~~~~~~~~~~~~~~~ 75 (324) +++++||--+-|+ +.+.|.. -.+.|+++. |.++..++... . .+.+.++. .... ..+.-.+. T Consensus 5 ~~t~~ay~nialIKywGk~D~~l~iP~~~SiS~t~~df~t~t~~~~~~~~~~~D~~~lNG-------~~~~-~~~~k~~~ 76 (329) T COG3407 5 IITASAYPNIALIKYWGKRDEALNIPLNSSISVTLEDFSTTTTAEELTENDEEDTFILNG-------ELSE-DENEKARR 76 (329) T ss_pred CEEEEECCHHHHHHHCCCCCHHHCCCCCCEEEEEECCCCCEEEEEEECCCCCCCEEEECC-------CCCC-HHHHHHHH T ss_conf 513673523378776142316424778880478841444336778753788762799988-------2271-57899999 Q ss_pred HHHHHH-HHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 999999-7187787389998268753548874024566556777640456666014557888877652146887544776 Q gi|254781017|r 76 IIMAIN-HIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAAS 154 (324) Q Consensus 76 i~~~l~-~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~ 154 (324) ++..++ +......++|...++.|.+.||+||||..+|+.+|+..+ .+..++..++.++|... .+|+ .-+ T Consensus 77 ~ld~~R~~~~~~~~~~i~s~n~~ptaaGLaSSaag~AAl~~Al~~~-~~~~~d~~~lS~~AR~g------SGSa---~RS 146 (329) T COG3407 77 VLDRFRKEYGISFKVKIVSYNNFPTAAGLASSAAGAAALAAALNRL-YDLDLDDEFLSRIARLG------SGSA---SRS 146 (329) T ss_pred HHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHH------CCCH---HHH T ss_conf 9999987635466079997457876345210188999999999765-24478879999998775------2411---454 Q ss_pred HCCCEEEEEECCC------EEEEEECCCC---CCEE-ECCCCC-CHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHH Q ss_conf 6298589985275------1567304777---5267-658417-989999888998860823489999999840332078 Q gi|254781017|r 155 IHGGLICYQMPKY------SIEKIDFIFP---IHLI-YSGYKT-PTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQI 223 (324) Q Consensus 155 ~~Gg~i~~~~~~~------~~~~~~~~~~---l~l~-~tg~~~-~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~ 223 (324) +|||++.+...+. ++.+.++..+ ++++ ....+. +|.+..+. ...+.+...++.+.... -.++ T Consensus 147 ~~Gg~~~W~~~~g~~~~~~~~~~~~~~~e~~~i~~~~~~~~k~vsS~~gm~~---~~~tS~~y~~w~~~~~~----~~~~ 219 (329) T COG3407 147 IFGGFVLWEKGEGEDSAAEQLFRLDLWKELAMIVLVISPKKKKVSSREGMQL---TAETSPFYDAWLEHSEE----DLEE 219 (329) T ss_pred HCCCEEEECCCCCCCCCEEEECCCCCCCCCCEEEEEECCCCCCCCCHHHHHH---HHHCCHHHHHHHHHHHH----HHHH T ss_conf 2288667306888752101202346762314279997455677873687777---77707278999998887----5999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHHH--CCC------CHHHH---HHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCC Q ss_conf 898730486445689999989999861--678------77999---9999999719945997268997548999846653 Q gi|254781017|r 224 SCQALRNKNLKVLAQAMNRQQGLLETL--GVS------DSKLS---EIVWKLREQPHIMASKISGSGLGDCVIALGKGDL 292 (324) Q Consensus 224 ~~~al~~~d~~~l~~lm~~~~~~L~~l--gvs------~~~l~---~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~ 292 (324) .+++++++|++.++++..++-..+.+. +.. +++.. +.|+.+++.++ ...-..-||+ .+..++..+. T Consensus 220 m~~~~~~~Df~~i~~~~e~dsl~mHA~l~~s~p~~~y~~~~s~~ii~~v~~~r~~g~-~~~fT~DaGP--nV~v~~~~~~ 296 (329) T COG3407 220 MKEAIREKDFEKIGELAENDSLEMHATLMSSGPPFFYLTDESLRIIEFVHELRKEGN-AVYFTMDAGP--NVKVITLEEN 296 (329) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHHCCC-CEEEEECCCC--CEEEEEECCC T ss_conf 999986147999999998659999999845699639988318999999999873388-4699972799--6599972154 Q ss_pred CHHHHHHHHH Q ss_conf 2689999999 Q gi|254781017|r 293 NSLPYQSVNC 302 (324) Q Consensus 293 ~~~~~~~~~~ 302 (324) .....+.+.. T Consensus 297 l~~~~~~~~~ 306 (329) T COG3407 297 LIDLLEILKT 306 (329) T ss_pred HHHHHHHHHH T ss_conf 8999998731 No 44 >COG1907 Predicted archaeal sugar kinases [General function prediction only] Probab=99.49 E-value=8.7e-11 Score=87.42 Aligned_cols=271 Identities=20% Similarity=0.222 Sum_probs=175.1 Q ss_pred EEEECCEEEEEE-----CCCEEECCCEEEEEEECC-EEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHH Q ss_conf 999717049997-----435330894289877677-28999998689849999536444465661201508999999999 Q gi|254781017|r 8 ICVSAPGSLVLM-----GEHGVLHGHAALVFAINK-RVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAIN 81 (324) Q Consensus 8 i~~~aPGKv~L~-----GEh~v~~G~~ai~~aI~~-~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~ 81 (324) |.+++|-|+.+. |+.-=++|+ +.+|++- +..+++++++ .+.++- .+.. ..+...++ T Consensus 2 v~v~tpSrlH~gliDl~G~~GRv~Gg--vG~aLe~P~l~i~~~~s~--~~~~~g----------e~~~----~~~~~~a~ 63 (312) T COG1907 2 VRVRTPSRLHAGLIDLNGSIGRVDGG--VGLALEEPRLEIEAKPSD--DIEVDG----------EDRR----ERVEKAAR 63 (312) T ss_pred EEEECCCEEEECCCCCCCCCCCEECC--EEEEEECCCEEEEEECCC--CCCCCC----------HHHH----HHHHHHHH T ss_conf 58704750110044578753213044--048850781599972055--411255----------0668----99999999 Q ss_pred -HHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEE Q ss_conf -7187787389998268753548874024566556777640456666014557888877652146887544776629858 Q gi|254781017|r 82 -HIKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLI 160 (324) Q Consensus 82 -~~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i 160 (324) ......|++|+|.+++|.-.||||-..++.|+..+++++ +++.++.++++...-. |..||+-.++.-+||+| T Consensus 64 ~~le~~~gv~I~I~~~~P~HvGLGS~TQlaLa~a~ai~~i-~gl~~~~~elA~~vgR------G~tSgiGv~afe~GGFI 136 (312) T COG1907 64 LVLEVGEGVKIEIRSDIPAHVGLGSTTQLALAVASAILEI-YGLELSIRELAFAVGR------GGTSGIGVYAFEYGGFI 136 (312) T ss_pred HHHCCCCCEEEEEEECCCHHCCCCHHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHCC------CCCCCEEEEEEEECCEE T ss_conf 7514167529999752732207773779999999999998-6589998999999736------77564137998878899 Q ss_pred EE-----EEC--CC--EEEEEECC--CCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHC Q ss_conf 99-----852--75--15673047--775267658417989999888998860823489999999840332078898730 Q gi|254781017|r 161 CY-----QMP--KY--SIEKIDFI--FPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALR 229 (324) Q Consensus 161 ~~-----~~~--~~--~~~~~~~~--~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~ 229 (324) .= .+. .. .+...+++ |.++++....++.-.+ .+.+.-+++-.|--.+..+.+-.. +.....-|+. T Consensus 137 VDGGh~~~f~ps~~sP~I~R~dfPedW~~VlaIP~~~rg~~~-~~E~~if~~~~p~p~~~~~~ls~~---vLm~mmPavv 212 (312) T COG1907 137 VDGGHSFGFLPSSASPLIFRLDFPEDWRFVLAIPEVERGVSG-RREVDIFKKYCPVPLEEVGELSHR---VLMKMMPAVV 212 (312) T ss_pred EECCCCCCCCCCCCCCEEEEECCCCCEEEEEEECCCCCCCCC-HHHHHHHHHCCCCCHHHHHHHHHH---HHHHHHHHHH T ss_conf 878744676668999605532288865999994478843210-278889975689997999899999---9998758988 Q ss_pred CCCHHHHHHHHHHHHHHH----HH--HCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHH Q ss_conf 486445689999989999----86--167877999999999971994599726899754899984665326899999999 Q gi|254781017|r 230 NKNLKVLAQAMNRQQGLL----ET--LGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCH 303 (324) Q Consensus 230 ~~d~~~l~~lm~~~~~~L----~~--lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~ 303 (324) ++|++.||+.+++-|.++ ++ -++-++.+.++++.+.+. ++++-+|-=|+ ++.++++......+..++... T Consensus 213 E~Die~fg~~l~~iQ~l~g~~f~~~e~~~~~~~V~~iv~~m~~~--a~~agqSSwGP--tvY~i~d~~~~~~~~~~~~~~ 288 (312) T COG1907 213 ERDIESFGEALNEIQELGGKWFKKVEGGLQREDVKEIVDEMVEA--AYGAGQSSWGP--TVYGIVDSREAGSVVRKLIDI 288 (312) T ss_pred HHCHHHHHHHHHHHHHHHHHHHHHHHCEECCHHHHHHHHHHHHH--CCCCCCCCCCC--EEEEECCCCCCCHHHHHHHHH T ss_conf 63799999999999988766465641310228999999999872--30443256688--789861653212479999999 Q ss_pred HHHCCCEE Q ss_conf 99779789 Q gi|254781017|r 304 MHAKGIDI 311 (324) Q Consensus 304 ~~~~g~~~ 311 (324) +..+|... T Consensus 289 ~~~~g~~g 296 (312) T COG1907 289 LLEEGIGG 296 (312) T ss_pred HHHCCCCE T ss_conf 87438861 No 45 >COG1829 Predicted archaeal kinase (sugar kinase superfamily) [General function prediction only] Probab=99.38 E-value=2.1e-09 Score=78.74 Aligned_cols=239 Identities=18% Similarity=0.220 Sum_probs=147.6 Q ss_pred EEEECCEEEEEE-----CCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHHHHH Q ss_conf 999717049997-----435330894289877677289999986898499995364444656612015089999999997 Q gi|254781017|r 8 ICVSAPGSLVLM-----GEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMAINH 82 (324) Q Consensus 8 i~~~aPGKv~L~-----GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~l~~ 82 (324) +++.+|+-+.=| +.--...|..-..++++.++.+.++..+...++++. . .. .- ......+.. T Consensus 3 ~~~fvP~hITgfF~pv~~~~p~~SGSiGaGv~l~~gv~v~v~~~~~~~v~~Ng--~----~~----d~---~~~~~v~e~ 69 (283) T COG1829 3 VRLFVPGHITGFFVPVIGKDPLKSGSIGAGVALERGVTVEVRFGEGTGVRLNG--K----KI----DL---PITRKVIEK 69 (283) T ss_pred CEEECCCEEEEEEEECCCCCCCCCCCCCEEEEECCCEEEEEEECCCCEEEECC--E----EC----CC---HHHHHHHHH T ss_conf 14852642689888524899766788642379638626999965873588899--6----31----30---269999998 Q ss_pred HCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCC-HHHHHHHCCCEEE Q ss_conf 187787389998268753548874024566556777640456666014557888877652146887-5447766298589 Q gi|254781017|r 83 IKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSG-IDLAASIHGGLIC 161 (324) Q Consensus 83 ~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG-~D~~a~~~Gg~i~ 161 (324) ..+. ++.+.+.+++|+|+|+|-|+|...+.+.+++.. ..++.++.++.||.+|... ..| .|+.|..+||++. T Consensus 70 L~~~-~~~v~~~~~~P~G~G~G~Sga~AL~~Ala~a~~---~~~~~~~a~~~AH~aEV~~---gtGLGDVvAq~~GGlVi 142 (283) T COG1829 70 LGPD-GVGVRIESPVPLGCGYGVSGAGALGTALALAEE---LGLGEESAARIAHVAEVEN---GTGLGDVVAQYTGGLVI 142 (283) T ss_pred HCCC-CCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHH---CCCCHHHHHHHHHHHHHHC---CCCCHHHHHHHCCCEEE T ss_conf 4766-862699851798865414479999999998866---4998789999888999871---78704788875085799 Q ss_pred EEECCC----EEEEEECCCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHH-HHHHCCCCHHHH Q ss_conf 985275----156730477752676584179899998889988608234899999998403320788-987304864456 Q gi|254781017|r 162 YQMPKY----SIEKIDFIFPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQIS-CQALRNKNLKVL 236 (324) Q Consensus 162 ~~~~~~----~~~~~~~~~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~al~~~d~~~l 236 (324) ...+.. .+..++.+.--++..+--+-+|++++..-. .+++. ....++ .+.+++.+++. T Consensus 143 R~~pG~Pg~~~vd~Ip~~~~~V~~~~~g~l~T~~vi~~~~------------~~~i~----~~g~~~l~ellk~Ptle~- 205 (283) T COG1829 143 RVKPGGPGEGEVDRIPVPGLRVITISLGELSTKSVITDEV------------VRKIN----EAGKRALAELLKNPTLEN- 205 (283) T ss_pred EECCCCCCEEEEEEEECCCCEEEEEECCCCCHHHHHHHHH------------HHHHH----HHHHHHHHHHHHCCCHHH- T ss_conf 8318999857898860488269999736666777630678------------99999----999999999883999999- Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEE Q ss_conf 8999998999986167877999999999971994599726899754899984 Q gi|254781017|r 237 AQAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALG 288 (324) Q Consensus 237 ~~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~ 288 (324) .|+++++.-+..|+++.++.+.++.+.. .+.+++-+ +=.|-.++|+. T Consensus 206 --f~~~a~~FA~~~Gl~~~~v~~~~~~~~~-~g~l~as~--~mlg~tVFa~~ 252 (283) T COG1829 206 --FMEEAREFARRTGLMSEEVKEAAEALHS-VGGLGASM--AMLGVTVFALS 252 (283) T ss_pred --HHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCHHHHH--HHHCCEEEEEC T ss_conf --9999999998728678899999987543-47445767--86386899962 No 46 >KOG2833 consensus Probab=99.35 E-value=3.2e-09 Score=77.51 Aligned_cols=278 Identities=16% Similarity=0.236 Sum_probs=147.5 Q ss_pred CCCCCCEEEEECCEEEEEECC---C---EEECCCEEEEEEECC---E--EEEEEEECC-CCEEEEEECCCCCCCCCCHHH Q ss_conf 998887899971704999743---5---330894289877677---2--899999868-984999953644446566120 Q gi|254781017|r 1 MGQCLHKICVSAPGSLVLMGE---H---GVLHGHAALVFAINK---R--VILYLTLRK-DRLINIDSSLGQYCGSLDLAM 68 (324) Q Consensus 1 ~~~~~~ki~~~aPGKv~L~GE---h---~v~~G~~ai~~aI~~---~--~~~~~~~~~-~~~i~i~~~~~~~~~~~~~~~ 68 (324) |--++-.+++++|-.|-++-. - ..+--...|+++++. . +++.+.+.. .+.+++..... .+.... T Consensus 1 ~~k~~~~~t~taPvNIAvIKYWGKRD~~l~LP~N~SISvTL~~D~L~t~Tt~~~s~~fe~dr~wLNGk~~----~i~~~R 76 (395) T KOG2833 1 MDKPVAEVTATAPVNIAVIKYWGKRDEELNLPTNDSISVTLSQDDLCTLTTAVVSPSFEEDRMWLNGKEV----PISNKR 76 (395) T ss_pred CCCCEEEEEECCCCEEEEEEECCCCCHHHCCCCCCEEEEEECCCCCEEEEEEEECCCCCCCEEEECCCCC----CCCCHH T ss_conf 9974036763166236665412444132137777706888535531445788842664454178878221----366378 Q ss_pred CCCHHHHHHHHHHHHC------C---CCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 1508999999999718------7---787389998268753548874024566556777640456666014557888877 Q gi|254781017|r 69 FHPSFSFIIMAINHIK------P---SCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIV 139 (324) Q Consensus 69 ~~~~~~~i~~~l~~~~------~---~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E 139 (324) ...-++.++..-.... + ...++|.-.+|+|..+||.||||-.+|++.||..+ ++++.+++++.++|.+ T Consensus 77 ~q~cl~e~r~~~~d~~~~~~~~~~~~~~~lHI~S~nNFPtAAGLASSAAG~Aalv~alarl-y~l~~~~~els~iAR~-- 153 (395) T KOG2833 77 YQRCLREIRRLARDREESEASLPSNGPLKLHIASVNNFPTAAGLASSAAGFAALVLALARL-YGLDDSPEELSRIARQ-- 153 (395) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHC-- T ss_conf 9999999999865323311256767871178874378830320011024089999999998-5899998999999861-- Q ss_pred HHHCCCCCCHHHHHHHCCCEEEEEECCC-------EEEEEEC-CCC----CCEEECCCC---CCHHHHHHHHHHHHHHCC Q ss_conf 6521468875447766298589985275-------1567304-777----526765841---798999988899886082 Q gi|254781017|r 140 LKVQGISSGIDLAASIHGGLICYQMPKY-------SIEKIDF-IFP----IHLIYSGYK---TPTAQVLKKISYIEIEYP 204 (324) Q Consensus 140 ~~~~g~~sG~D~~a~~~Gg~i~~~~~~~-------~~~~~~~-~~~----l~l~~tg~~---~~T~~~v~~v~~~~~~~~ 204 (324) |.+| ..-+.|||++.|+.++. .+...+. .++ +.++.++.. .+|.-|.+.|..-.- T Consensus 154 ----GSGS---ACRSl~GG~V~W~mG~~~DGsDsvAvq~~p~~~W~el~ililVvs~~~K~t~ST~GM~~sveTS~L--- 223 (395) T KOG2833 154 ----GSGS---ACRSLYGGFVAWEMGELDDGSDSVAVQIAPSSHWPELRILILVVSDAKKKTGSTEGMRRSVETSQL--- 223 (395) T ss_pred ----CCCH---HHHHHCCCEEEEECCCCCCCCCEEEEEECCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHH--- T ss_conf ----4860---455430453677315645787503787035347873189999960553344661888988887699--- Q ss_pred CHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHH-HHHH---HH-HHHHH------CCCCHHHHHHHHHHHHCCCC-EE Q ss_conf 34899999998403320788987304864456899-9998---99-99861------67877999999999971994-59 Q gi|254781017|r 205 EINEINQKIYALMGKLSQISCQALRNKNLKVLAQA-MNRQ---QG-LLETL------GVSDSKLSEIVWKLREQPHI-MA 272 (324) Q Consensus 205 ~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~l-m~~~---~~-~L~~l------gvs~~~l~~li~~~~~~~g~-~~ 272 (324) +....+. .+-+...+..+|+.+.|++.|.++ |..+ |. +|+.. .=.+-.+-.+|++.....|. .. T Consensus 224 -~qhRi~~---vVP~Ri~~m~eaI~~rDF~~FA~lTm~DSNqFHAvclDT~PPI~YmNd~S~~iI~~vh~~N~~~G~t~v 299 (395) T KOG2833 224 -LQHRIES---VVPQRIQQMREAIRERDFESFAKLTMKDSNQFHAVCLDTFPPIFYLNDTSWRIISLVHEFNASAGGTRV 299 (395) T ss_pred -HHHHHHH---HHHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHCCCCEEE T ss_conf -9999986---647999999999986577887777761312456442315897478536407999999998751697168 Q ss_pred EECCCCCCCCEEEEEECCCCCHHHHHHHH Q ss_conf 97268997548999846653268999999 Q gi|254781017|r 273 SKISGSGLGDCVIALGKGDLNSLPYQSVN 301 (324) Q Consensus 273 aklsGAG~GG~~i~l~~~~~~~~~~~~~~ 301 (324) |--=-||+.-|+++|-.+ .++.++.+- T Consensus 300 AYTFDAGPNAvl~~l~e~--~~~~l~~~~ 326 (395) T KOG2833 300 AYTFDAGPNAVLIVLEEN--VSQLLAAVL 326 (395) T ss_pred EEEECCCCCEEEEEHHHH--HHHHHHHHH T ss_conf 988338986289983640--999999999 No 47 >pfam00288 GHMP_kinases_N GHMP kinases N terminal domain. This family includes homoserine kinases, galactokinases and mevalonate kinases. Probab=99.34 E-value=1.6e-12 Score=98.45 Aligned_cols=67 Identities=39% Similarity=0.433 Sum_probs=61.1 Q ss_pred EEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCC Q ss_conf 899982687535488740245665567776404566660145578888776521468875447766298 Q gi|254781017|r 90 DLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGG 158 (324) Q Consensus 90 ~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg 158 (324) +|.++++||.++|||||||+++|++++++++. +.++++++++++|+.+|+ .+|.++|.|++++.||| T Consensus 1 ~i~i~s~iP~g~GLgSSaa~~~a~~~al~~~~-~~~l~~~~l~~~a~~~E~-~~g~~~G~d~~~~~~gg 67 (67) T pfam00288 1 DIEIESNIPLGAGLGSSAALAVALLLALNELF-GLPLSKEELARLALEAEG-AIGVNSGDDVAASVYGG 67 (67) T ss_pred CEEEECCCCCCCCCCHHHHHHHHHHHHHHHHH-CCCCCHHHHHHHHHHHHH-HHCCCCHHHHHHHHCCC T ss_conf 92575689986786556999999999999995-899899999999999999-81898811677642277 No 48 >COG4542 PduX Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=99.27 E-value=1.5e-09 Score=79.61 Aligned_cols=246 Identities=21% Similarity=0.213 Sum_probs=147.5 Q ss_pred CCCCEEEEECCEEEEEECCC--EEECCC-EEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHHH Q ss_conf 88878999717049997435--330894-289877677289999986898499995364444656612015089999999 Q gi|254781017|r 3 QCLHKICVSAPGSLVLMGEH--GVLHGH-AALVFAINKRVILYLTLRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIMA 79 (324) Q Consensus 3 ~~~~ki~~~aPGKv~L~GEh--~v~~G~-~ai~~aI~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~~ 79 (324) +.-+++.+.+|+-+ ||- -...|. .-+.++|+.|..+++..+++.. .-...+..++.... T Consensus 9 ~~~~~~~A~cpAs~---GELlQG~~~g~~~Lvs~PIa~~s~v~~~~~s~~~---------------s~~~t~a~~~~e~~ 70 (293) T COG4542 9 PRVMVISAKCPASC---GELLQGSFNGSEKLVSFPIALFSEVTLFDRSGKD---------------SVNVTPAHKFKENI 70 (293) T ss_pred CCCCCEEEECCCCH---HHHHEEEECCCCEEEECCHHHHHHEEEECCCCCC---------------CCCCCHHHHHHHHH T ss_conf 88751114535024---2252336638752898003541004550147876---------------55556668999999 Q ss_pred HHHHC----CCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 99718----77873899982687535488740245665567776404566660145578888776521468875447766 Q gi|254781017|r 80 INHIK----PSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASI 155 (324) Q Consensus 80 l~~~~----~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~ 155 (324) +.++. ...|+++.+.|+||+|.||+||.|-.||+++|...+ ++..++..+|++++..+| |+ |. .+ T Consensus 71 la~~~~~~~~~~~i~l~lqSsIPvgKG~ASSTADl~At~~A~A~~-l~~~l~es~iakLcv~iE------Pt--Ds--ii 139 (293) T COG4542 71 LARWGVTKLINTGIDLLLQSSIPVGKGMASSTADLVATARATARF-LGRELRESEIAKLCVSIE------PT--DS--II 139 (293) T ss_pred HHHHCCCCEECCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHCC------CC--CC--EE T ss_conf 997285101058736998636655556563478899999999999-678778889999986258------76--54--00 Q ss_pred CCCEEEEEECCCEE-EEEECCCCCCE--EECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCC Q ss_conf 29858998527515-67304777526--7658417989999888998860823489999999840332078898730486 Q gi|254781017|r 156 HGGLICYQMPKYSI-EKIDFIFPIHL--IYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMGKLSQISCQALRNKN 232 (324) Q Consensus 156 ~Gg~i~~~~~~~~~-~~~~~~~~l~l--~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d 232 (324) |-+.+.|+..+..+ +.....+++.+ +..+....|..- +.|...+..-+-...++++.+.+.+|+..+| T Consensus 140 F~~~tlFd~r~g~~~~~~g~~PpL~ilv~e~~~~v~T~~y---------~q~~r~e~~~~~~~e~~~l~~~v~~A~~~~~ 210 (293) T COG4542 140 FDKATLFDQREGRVIEFLGEMPPLHILVFEGKGTVETVDY---------NQPPRGEKLLAPLAELGNLINLVEKALKVGD 210 (293) T ss_pred CCCCEEEHHCCCHHHHHCCCCCCEEEEEECCCCCEEEEEC---------CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 0561000220430888637899627999748982333314---------6785465655449999999999999872389 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCC Q ss_conf 44568999998999986167877999999999971994599726899754899984665 Q gi|254781017|r 233 LKVLAQAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGD 291 (324) Q Consensus 233 ~~~l~~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~ 291 (324) ...+|+.-..+ ..+..--+.-|.++++.+ +.+..+++|.-..-+| .++-|.=.. T Consensus 211 ~~~lG~AAT~S-Av~~Q~~LPK~~~~~lL~-l~e~~~~~Gv~VAHSG---tmlGli~D~ 264 (293) T COG4542 211 PKLLGEAATLS-AVKNQDRLPKPGLNELLR-LVEETCAIGVIVAHSG---TMLGLIYDR 264 (293) T ss_pred HHHHHHHHHHH-HHHHCCCCCCHHHHHHHH-HHHHHCCCCEEEECCC---CEEEEEECC T ss_conf 88987888899-886320168323899999-9987365445884167---467765403 No 49 >TIGR01240 mevDPdecarb diphosphomevalonate decarboxylase; InterPro: IPR005935 Synonyms: mevalonate diphosphate decarboxylase; pyrophosphomevalonate decarboxylase. This group of enzymes belongs to the GHMP kinase domain superfamily. GHMP kinases are a unique class of ATP-dependent enzymes (the abbreviation of which refers to the original members: galactokinase, homoserine kinase, mevalonate kinase, and phosphomevalonate kinase) . Enzymes belonging to this superfamily contain three well-conserved motifs, the second of which has the typical sequence Pro-X-X-X-Gly-Leu-X-Ser-Ser-Ala and is involved in ATP binding . The phosphate binding loop in GHMP kinases is distinct from the classical P-loops found in many ATP/GTP binding proteins. The bound ADP molecule adopts a rare syn conformation and is in the opposite orientation from those bound to the P-loop-containing proteins . GHMP kinases display a distinctly bilobal appearance with their N-terminal subdomains dominated by a mixed beta-sheet flanked on one side by alpha-helices and their C-terminal subdomains containing a four stranded anti-parallel beta-sheet , , , . Diphosphomevalonate decarboxylase (mevalonate pyrophosphate decarboxylase, (4.1.1.33 from EC) catalyzes the decarboxylation of mevalonate pyrophosphate to isopentyl pyrophosphate (IPP) , the last step in the synthesis of IPP in the mevalonate pathway. Mevalonate is a key intermediate in the biosynthesis of sterols and non-sterol isoprenes in the mevalonate pathway. In mammals, the majority of mevalonate is converted into cholesterol.ATP + (R)-5-diphosphomevalonate = ADP + phosphate + isopentenyl diphosphate + CO2 ; GO: 0004163 diphosphomevalonate decarboxylase activity, 0008299 isoprenoid biosynthetic process. Probab=99.24 E-value=2.9e-09 Score=77.83 Aligned_cols=269 Identities=16% Similarity=0.139 Sum_probs=149.6 Q ss_pred EEECCEEEEEE---C---CCEEECCCEEEEEEECC---EEEEEEE-ECCCCEEEEEECCCCCCCCCCHHHCCCHHHHHHH Q ss_conf 99717049997---4---35330894289877677---2899999-8689849999536444465661201508999999 Q gi|254781017|r 9 CVSAPGSLVLM---G---EHGVLHGHAALVFAINK---RVILYLT-LRKDRLINIDSSLGQYCGSLDLAMFHPSFSFIIM 78 (324) Q Consensus 9 ~~~aPGKv~L~---G---Eh~v~~G~~ai~~aI~~---~~~~~~~-~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~i~~ 78 (324) .|+||..|-++ | +--.+=-.-.|+++++. ||...+. ..++-...- +...+.+..+..+.-+.-++. T Consensus 2 ~vtay~NIA~iKYWGK~n~~l~lP~nsSiSlTL~~dd~~T~Tsv~f~~d~~~~D~----fylnG~lq~~~~~~k~s~~ld 77 (325) T TIGR01240 2 KVTAYVNIALIKYWGKKNEKLILPTNSSISLTLEQDDLYTLTSVAFLADEFERDE----FYLNGVLQDEKDNEKVSNILD 77 (325) T ss_pred EEECCHHHEEHCCCCCCCCCCCCCCCCCEEEEECCCCCEEEEEEEECCCCCCCCE----EEECCCCCCHHHHHHHHHHHH T ss_conf 2214102000002576242114874564257634764302123356288631012----477573787034558999999 Q ss_pred HHHHHC-------------CCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 999718-------------7787389998268753548874024566556777640456666014557888877652146 Q gi|254781017|r 79 AINHIK-------------PSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGI 145 (324) Q Consensus 79 ~l~~~~-------------~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~ 145 (324) .++.-. ..+-++|....++|..+||.||||=..|++.|+..+ +.++++..+|.+.|... . T Consensus 78 ~fRq~~~~~~skdaslPtlsqWk~~~~s~Nn~PTAAGLaSSasGlaALv~A~~~l-~~l~ld~~~lsr~Ar~g------S 150 (325) T TIGR01240 78 DFRQEAKESESKDASLPTLSQWKVKVVSQNNVPTAAGLASSASGLAALVSACDKL-YKLQLDDKELSRIARKG------S 150 (325) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHH-HHCCCCHHHHHHHHHHC------C T ss_conf 8622211222221256410000467885078621444666899999999999999-70798878999886421------6 Q ss_pred CCCHHHHHHHCCCEEEEEECCC------EEEEEECC---CC---CCEEECCCC--CCHHH-HHHHHHHHHHHCCCHHHHH Q ss_conf 8875447766298589985275------15673047---77---526765841--79899-9988899886082348999 Q gi|254781017|r 146 SSGIDLAASIHGGLICYQMPKY------SIEKIDFI---FP---IHLIYSGYK--TPTAQ-VLKKISYIEIEYPEINEIN 210 (324) Q Consensus 146 ~sG~D~~a~~~Gg~i~~~~~~~------~~~~~~~~---~~---l~l~~tg~~--~~T~~-~v~~v~~~~~~~~~~~~~~ 210 (324) +| ..-+.|||++-++.+.. .+.+++-. .. ++|+....+ .++++ |...|. .- +++ T Consensus 151 GS---acRS~FGg~vaWekG~~edd~ss~a~~v~~dS~~~~ma~~~lv~nd~KK~vSSr~GM~ltv~-TS-------~~f 219 (325) T TIGR01240 151 GS---ACRSLFGGYVAWEKGKDEDDLSSAAVKVEDDSDEDKMAMVVLVVNDIKKDVSSREGMKLTVE-TS-------ELF 219 (325) T ss_pred CC---CHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHH-HH-------HHH T ss_conf 73---00320588000027767678874036567778500110777763136887453035688898-77-------764 Q ss_pred HHHHHHH-CCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHH--HC------CCCHH---HHHHHHHHHH-CCCCEEEECCC Q ss_conf 9999840-33207889873048644568999998999986--16------78779---9999999997-19945997268 Q gi|254781017|r 211 QKIYALM-GKLSQISCQALRNKNLKVLAQAMNRQQGLLET--LG------VSDSK---LSEIVWKLRE-QPHIMASKISG 277 (324) Q Consensus 211 ~~~~~~~-~~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~--lg------vs~~~---l~~li~~~~~-~~g~~~aklsG 277 (324) +..++.+ ...-+..+++|+.+|++.||+.|..+-..+.. +. .-+++ +=+.|+.+++ ..|...+--.= T Consensus 220 ~ew~e~v~~dr~~~m~~aik~kdF~~~g~~t~~n~l~mHaTtl~a~PPf~Y~~~~S~~a~~av~~lrqfy~g~~~yfTmD 299 (325) T TIGR01240 220 KEWVESVVIDRFEVMKKAIKTKDFEKLGKETEANSLSMHATTLDAQPPFSYLSDESKKAMEAVKELRQFYEGIICYFTMD 299 (325) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEECC T ss_conf 22343100058999999973212899999998745652000014774800038446899999999988738813687415 Q ss_pred CCCCCEEEEEECCCCCHHHHHHHH Q ss_conf 997548999846653268999999 Q gi|254781017|r 278 SGLGDCVIALGKGDLNSLPYQSVN 301 (324) Q Consensus 278 AG~GG~~i~l~~~~~~~~~~~~~~ 301 (324) ||+ -+.+|+.++.+++..+.+. T Consensus 300 AGP--Nvkvl~l~~~ls~l~e~~~ 321 (325) T TIGR01240 300 AGP--NVKVLVLEEDLSKLAETLT 321 (325) T ss_pred CCC--CEEEEEECCCHHHHHHHHH T ss_conf 888--5388762323689999984 No 50 >pfam10509 GalKase_gal_bdg Galactokinase galactose-binding signature. This is the highly conserved galactokinase signature sequence which appears to be present in all galactokinases irrespective of how many other ATP binding sites, etc that they carry. The function of this domain appears to be to bind galactose, and the domain is normally at the N-terminus of the enzymes, EC:2.7.1.6. This domain is associated with the families GHMP_kinases_C, pfam08544 and GHMP_kinases_N, pfam00288. Probab=98.82 E-value=8.3e-09 Score=74.92 Aligned_cols=44 Identities=34% Similarity=0.375 Sum_probs=38.7 Q ss_pred CCCCCEEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEEC Q ss_conf 988878999717049997435330894289877677289999986 Q gi|254781017|r 2 GQCLHKICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLR 46 (324) Q Consensus 2 ~~~~~ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~ 46 (324) |..|. +.++||||++|+|||+||+|+.++++||++++++.+.++ T Consensus 9 g~~p~-~~~~APGRvnlIGeHtDY~~G~Vlp~Ai~~~~~va~~~r 52 (52) T pfam10509 9 GVEPE-GVASAPGRVNLIGEHTDYNGGFVLPAAINLDTYVAVSKR 52 (52) T ss_pred CCCCC-EEEECCCEEEEEECCEECCCCEEEEEEECCCEEEEEECC T ss_conf 89986-699889758870114603897788898316369999749 No 51 >pfam08544 GHMP_kinases_C GHMP kinases C terminal. This family includes homoserine kinases, galactokinases and mevalonate kinases. Probab=98.81 E-value=1.7e-08 Score=72.99 Aligned_cols=66 Identities=33% Similarity=0.458 Sum_probs=55.5 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCC Q ss_conf 98730486445689999989999861678779999999999719945997268997548999846653 Q gi|254781017|r 225 CQALRNKNLKVLAQAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDL 292 (324) Q Consensus 225 ~~al~~~d~~~l~~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~ 292 (324) .+++.++|++.++++|+.++.. ...++.+|+++++++.+ ...|++++++||||+|+|+++|++++. T Consensus 2 ~~~l~~~d~~~~~~~~~~~~~~-~~~~~~~~~i~~~~~~~-~~~G~~~a~~sGsg~G~~v~~l~~~~~ 67 (82) T pfam08544 2 LEALREGDLELLGELLNENALL-LYLGILSPELDELLEEL-RELGALGAKLSGSGGGPTVFALFKDEE 67 (82) T ss_pred HHHHHCCCHHHHHHHHHHHHHC-CHHHCCCHHHHHHHHHH-HHCCCCEEECCCCCCCCEEEEEECCHH T ss_conf 3788818999999999999864-05505778899999999-985998554125898885888858989 No 52 >KOG1537 consensus Probab=98.49 E-value=1.8e-06 Score=60.07 Aligned_cols=215 Identities=17% Similarity=0.191 Sum_probs=119.0 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHC----------------------CC Q ss_conf 73899982687535488740245665567776404566660145578888776521----------------------46 Q gi|254781017|r 88 GFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQ----------------------GI 145 (324) Q Consensus 88 g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~----------------------g~ 145 (324) +-++.+..-||.|+|+|||++..++-++..+.- .++.+++..+......+|+.-. -+ T Consensus 94 ~Tk~hvtNPiplgrGigssgta~~aGv~l~ne~-a~LGlsk~~mldy~lmierhpdn~~a~mmGgf~GSflr~l~e~E~~ 172 (355) T KOG1537 94 TTKKHVTNPIPLGRGIGSSGTAKMAGVRLVNES-ADLGLSKGSMLDYSLMIERHPDNAVAEMMGGFLGSFLRALLESEAK 172 (355) T ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHEECCHH-HHCCCCCCCCHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCHHHHH T ss_conf 002541377656555230345665442130447-6528761023017888764717789999866789999875876664 Q ss_pred CCC--HH-HHHHHCCCEEEEEEC-CCEE---EEEECCC--CCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 887--54-477662985899852-7515---6730477--7526765841798999988899886082348999999984 Q gi|254781017|r 146 SSG--ID-LAASIHGGLICYQMP-KYSI---EKIDFIF--PIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYAL 216 (324) Q Consensus 146 ~sG--~D-~~a~~~Gg~i~~~~~-~~~~---~~~~~~~--~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~ 216 (324) .+| .| ..-...||++....- +..+ .+++|.. .+.++......+|+++... +...+|...-... T Consensus 173 ~~~~~ad~ilp~~~gg~~li~~lpP~dlg~~~r~pw~~~lk~i~viP~Fel~T~k~R~v---LPt~yp~~d~V~N----- 244 (355) T KOG1537 173 VSGYHADNILPAIMGGFVLIRNLPPLDLGKPLRFPWDKDLKFILVIPDFELPTKKMRAV---LPTEYPMVDHVWN----- 244 (355) T ss_pred HCCCCHHHHCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHH---CCCCCCCEEEEEC----- T ss_conf 23788777250125870550588952236633377776613899944444660455420---6755663002200----- Q ss_pred HCCCCHHHHHHHCCC-CHHHHHH-HHHH-HHHHHHHHCCCCHHHHHHHHHHHH--CCCCEEEECCCCCCCCEEEEEECCC Q ss_conf 033207889873048-6445689-9999-899998616787799999999997--1994599726899754899984665 Q gi|254781017|r 217 MGKLSQISCQALRNK-NLKVLAQ-AMNR-QQGLLETLGVSDSKLSEIVWKLRE--QPHIMASKISGSGLGDCVIALGKGD 291 (324) Q Consensus 217 ~~~i~~~~~~al~~~-d~~~l~~-lm~~-~~~~L~~lgvs~~~l~~li~~~~~--~~g~~~aklsGAG~GG~~i~l~~~~ 291 (324) +++++ ....|+..+ |-..|+. +|.. -|+--++ .-.|.++++...+.. ++|.+|.-+||||. .++||.. + T Consensus 245 lqrlA-~LttAl~~~p~n~~L~y~~m~DkvhqPyRa--~LIPGl~~il~~~~p~t~pGl~GiclSGAGP--T~lAlat-e 318 (355) T KOG1537 245 LQRLA-ALTTALLEGPDNVMLGYALMSDKVHQPYRA--PLIPGLEAILKAALPATYPGLFGICLSGAGP--TALALAT-E 318 (355) T ss_pred HHHHH-HHHHHHHCCCCCHHHHHHHHHCCCCCCCCC--CCCCCHHHHHHHHCCCCCCCEEEEEECCCCC--EEEEEEC-C T ss_conf 88899-999998149971355556653212475314--5574189999861855678624588438897--3688743-7 Q ss_pred CCHHHHHHHHHHHHHCCCE--EEEEECC Q ss_conf 3268999999999977978--9994145 Q gi|254781017|r 292 LNSLPYQSVNCHMHAKGID--IVPITPS 317 (324) Q Consensus 292 ~~~~~~~~~~~~~~~~g~~--~~~~~~~ 317 (324) +.+++-+++-+.+.+.|.. ...++|. T Consensus 319 nf~eI~~~mv~~F~K~G~kcs~~~l~pa 346 (355) T KOG1537 319 NFQEIGEKMVEAFWKVGHKCSVASLKPA 346 (355) T ss_pred CHHHHHHHHHHHHHHHCCEEEEEEECCC T ss_conf 4789999999999861844346762002 No 53 >TIGR00144 beta_RFAP_syn beta-ribofuranosylaminobenzene 5'-phosphate synthase family; InterPro: IPR004422 This protein family contains several archaeal examples of beta-ribofuranosylaminobenzene 5-prime-phosphate synthase (beta-RFAP synthase), an enzyme involved in methanopterin biosynthesis . In some species, two members of this family are found. It is unclear whether both act as beta-RFAP synthase. This family is related to the GHMP kinases (Galactokinase, Homoserine kinase, Mevalonate kinase, Phosphomevalonate kinase). Members are found so far only in the Archaea and in Methylobacterium extorquens.They have an amino-terminal domain probably related to ATP binding.. Probab=98.26 E-value=0.00042 Score=45.17 Aligned_cols=290 Identities=20% Similarity=0.230 Sum_probs=171.1 Q ss_pred EECCEEEEE-----ECCCEEECCCEEEEEEECC-EEEEEEEECCCCEEEEEECCCCC-CCCCCHHHCCCHHHHHHHHHHH Q ss_conf 971704999-----7435330894289877677-28999998689849999536444-4656612015089999999997 Q gi|254781017|r 10 VSAPGSLVL-----MGEHGVLHGHAALVFAINK-RVILYLTLRKDRLINIDSSLGQY-CGSLDLAMFHPSFSFIIMAINH 82 (324) Q Consensus 10 ~~aPGKv~L-----~GEh~v~~G~~ai~~aI~~-~~~~~~~~~~~~~i~i~~~~~~~-~~~~~~~~~~~~~~~i~~~l~~ 82 (324) ++.|.|+.+ .|+-.-+.|+. .++++. .+.+....+++-.+.++...... ...................+.+ T Consensus 3 ~~~p~~~~~~l~dl~g~~g~~dgg~--g~~l~~p~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (337) T TIGR00144 3 LNTPSRLHLTLIDLNGSIGRVDGGV--GLALEEPELEIGGKESDDLGVEFDKKLVGKLGEPYNKSVRDRVYEAAAKTLDY 80 (337) T ss_pred CCCCCCEEEEEEECCCCCCCCCCCC--CEEECCCCEEEECCCCCCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH T ss_conf 3575410345552166422224552--10112653144023333333101135565411045677888889989999987 Q ss_pred HCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCEEEE Q ss_conf 18778738999826875354887402456655677764045666601455788887765214688754477662985899 Q gi|254781017|r 83 IKPSCGFDLKVISQLDSQLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQGISSGIDLAASIHGGLICY 162 (324) Q Consensus 83 ~~~~~g~~i~i~s~iP~g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~g~~sG~D~~a~~~Gg~i~~ 162 (324) .....++++++.+..|.-+||||...+..++.+.+..+ ++..++..+++++.-. |-.+|+-..+--+||++.- T Consensus 81 ~g~d~~~~~~~~~~~~~~~glg~g~~~~l~~~~~~~~~-~~~~~~~~~~~~~~g~------gg~~g~g~~~~~~gg~~~d 153 (337) T TIGR00144 81 IGGDEGFDLEVLSLFPAHSGLGSGTQLSLAVGRLVSEL-YGLELTARELAEIVGR------GGTSGIGVAAFEDGGFLVD 153 (337) T ss_pred HCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHH-HCCCCHHHHHHHHHCC------CCCCCCCEEEECCCCEEEE T ss_conf 26765511687530320014566512467788888887-3243104556665327------7754330122215752664 Q ss_pred EE----------------------CCCEEEEEEC--CCCCCEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHC Q ss_conf 85----------------------2751567304--77752676584179899998889988608234899999998403 Q gi|254781017|r 163 QM----------------------PKYSIEKIDF--IFPIHLIYSGYKTPTAQVLKKISYIEIEYPEINEINQKIYALMG 218 (324) Q Consensus 163 ~~----------------------~~~~~~~~~~--~~~l~l~~tg~~~~T~~~v~~v~~~~~~~~~~~~~~~~~~~~~~ 218 (324) .. +...+...++ .|..++......+... -.+.+.-.++.-|...+..+.+-.. T Consensus 154 ~g~~~g~g~~~~~~~p~~~~~g~~p~p~~~~~d~p~~w~~~~~~p~~~~~~~-g~~~~~~~~~~cp~pl~~~~~~~~~-- 230 (337) T TIGR00144 154 GGHSTGKGKEKEDFLPSSASKGAKPAPVIARLDFPEDWNVILAIPELDRGVF-GKREVNLFQKYCPVPLEDVEKLCHL-- 230 (337) T ss_pred CCCCCCCCCCCHHCCCCCCCCCCCCCCEEEECCCCCCCCEEEEEECCCCCCC-CCHHHHHHHHHCCCCHHHHHHHHHH-- T ss_conf 5654466643000255321256776531221257745415887313211124-4113344554258766788999999-- Q ss_pred CCCHHHHHHHCCCCHHHHHHHHHHHHHH-HHH--HCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHH Q ss_conf 3207889873048644568999998999-986--1678779999999999719945997268997548999846653268 Q gi|254781017|r 219 KLSQISCQALRNKNLKVLAQAMNRQQGL-LET--LGVSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSL 295 (324) Q Consensus 219 ~i~~~~~~al~~~d~~~l~~lm~~~~~~-L~~--lgvs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~ 295 (324) +......++.+.|++.|++.+++-+.. .++ +....+-++.+++.+. ..|+.++-++.-|+ .+.++... .... T Consensus 231 -~l~~~~p~~~~~~~~~~g~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~-~~g~~~~g~~~~gp--~~~~~~d~-~~~~ 305 (337) T TIGR00144 231 -ILMKMLPAVVEGDLDDFGEALNKLQGLGFKKVELELQGPLIKGLIDLLD-KDGAPGAGLSSFGP--TVYAVTDE-KPGG 305 (337) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH-HCCCCCCCCCCCCC--EEEEECCC-CCHH T ss_conf -9987655666301567778887776313023445553357888887654-14753134222475--24652254-4014 Q ss_pred HHHHHHHHHHHCCCEEEEEEC Q ss_conf 999999999977978999414 Q gi|254781017|r 296 PYQSVNCHMHAKGIDIVPITP 316 (324) Q Consensus 296 ~~~~~~~~~~~~g~~~~~~~~ 316 (324) ....+...+...|+....+.. T Consensus 306 ~~~~~~~~~~~~g~~~~~~~~ 326 (337) T TIGR00144 306 IAGALNDIFGPYGYYGRIIVT 326 (337) T ss_pred HHHHHHHHHHCCCCCEEEEEE T ss_conf 677788752003641137874 No 54 >TIGR00500 met_pdase_I methionine aminopeptidase, type I; InterPro: IPR002467 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002468 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=65.03 E-value=9.7 Score=17.53 Aligned_cols=84 Identities=21% Similarity=0.205 Sum_probs=38.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHC-CCCC--CHHHH--HHHCC---CEEEEEECCCEEEEEECC---- Q ss_conf 245665567776404566660145578888776521-4688--75447--76629---858998527515673047---- Q gi|254781017|r 108 AITVAITAALLTLQYHKEPSPDEILTTAHAIVLKVQ-GISS--GIDLA--ASIHG---GLICYQMPKYSIEKIDFI---- 175 (324) Q Consensus 108 a~~va~~~al~~~~~~~~~~~~~i~~lA~~~E~~~~-g~~s--G~D~~--a~~~G---g~i~~~~~~~~~~~~~~~---- 175 (324) +-.+|.++...+-.-....+..+|=++|.+.=..-+ =+|. |+-+. -.-|= |.+|....+..+.-+|-. T Consensus 16 g~l~a~vl~~~~~~v~PGvST~eLD~i~~~~i~~~~hA~pAf~GY~PPfsdvgfPPft~~~C~S~Ne~V~HGiP~~~~vL 95 (265) T TIGR00500 16 GRLVAEVLEILEREVKPGVSTKELDRIAKDFIEKRGHAKPAFLGYYPPFSDVGFPPFTGSVCISVNEVVIHGIPDKKKVL 95 (265) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCEEEECCCCCCCEE T ss_conf 99999999999863689887899999999999847972353567877866687885556156321678885138847265 Q ss_pred -------CCCCEEECCCCCCHHH Q ss_conf -------7752676584179899 Q gi|254781017|r 176 -------FPIHLIYSGYKTPTAQ 191 (324) Q Consensus 176 -------~~l~l~~tg~~~~T~~ 191 (324) ..+-+.+-|+-..|+. T Consensus 96 KdGDivniD~g~~~~Gy~gD~a~ 118 (265) T TIGR00500 96 KDGDIVNIDVGVIYDGYYGDTAI 118 (265) T ss_pred ECCCEEEEEEEEEECCEEECCCC T ss_conf 13677875577886030211323 No 55 >PRK00068 hypothetical protein; Validated Probab=63.47 E-value=10 Score=17.35 Aligned_cols=30 Identities=17% Similarity=0.222 Sum_probs=14.9 Q ss_pred CCCCHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 332078898730486445689999989999 Q gi|254781017|r 218 GKLSQISCQALRNKNLKVLAQAMNRQQGLL 247 (324) Q Consensus 218 ~~i~~~~~~al~~~d~~~l~~lm~~~~~~L 247 (324) .+.-+++.+|++++|+..+|+.+++=.+.| T Consensus 942 ~~a~~~a~~Alk~GDw~~yG~a~~~L~~aL 971 (978) T PRK00068 942 QDAYNKAIEAQKNGDFAEYGEALQRLDDAL 971 (978) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 999999999887599999999999999999 No 56 >PRK05182 DNA-directed RNA polymerase subunit alpha; Provisional Probab=55.14 E-value=14 Score=16.45 Aligned_cols=90 Identities=21% Similarity=0.220 Sum_probs=53.5 Q ss_pred CCEEECCCCCCHHHHHHHHHHHHHHC------------------CCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHH Q ss_conf 52676584179899998889988608------------------234899999998403320788987304864456899 Q gi|254781017|r 178 IHLIYSGYKTPTAQVLKKISYIEIEY------------------PEINEINQKIYALMGKLSQISCQALRNKNLKVLAQA 239 (324) Q Consensus 178 l~l~~tg~~~~T~~~v~~v~~~~~~~------------------~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~l 239 (324) ++-+||+-.-.-.++++........+ +...+....-++.+ +++-++...|++.++..+++| T Consensus 193 ~leI~TnGsI~P~eAl~~Aa~il~~~~~~f~~~~~~~~~~~~~~~~~~~~l~~~IeeL-eLSvRs~NcLk~a~I~tI~dL 271 (306) T PRK05182 193 VLEIETNGSITPEEALALAAKILVEQLSVFVDLEEEEAEVEEEEPEFDPILLRPIEEL-ELSVRSYNCLKRAGINTIGDL 271 (306) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHCCCHHHH-CCCHHHHHHHHHCCCCCHHHH T ss_conf 9999959979999999999999999999974866455454556655264644969881-386888755878599369998 Q ss_pred HHHHHHHHHHH-CCCCHHHHHHHHHHHHCC Q ss_conf 99989999861-678779999999999719 Q gi|254781017|r 240 MNRQQGLLETL-GVSDSKLSEIVWKLREQP 268 (324) Q Consensus 240 m~~~~~~L~~l-gvs~~~l~~li~~~~~~~ 268 (324) ++.+.+.|.++ ++.-..++++.+.+.+.+ T Consensus 272 v~~se~dLl~ikNfGkKSl~EI~~kL~~~g 301 (306) T PRK05182 272 VQRTEEELLKTPNLGKKSLEEIKEKLAELG 301 (306) T ss_pred HHCCHHHHHHCCCCCCCCHHHHHHHHHHCC T ss_conf 747999996189976203999999999729 No 57 >PRK05456 ATP-dependent protease peptidase subunit; Provisional Probab=45.14 E-value=21 Score=15.46 Aligned_cols=125 Identities=18% Similarity=0.167 Sum_probs=59.6 Q ss_pred EEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEE--C--------------CCCCCCCCCH---- Q ss_conf 8999717049997435330894289877677289999986898499995--3--------------6444465661---- Q gi|254781017|r 7 KICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDS--S--------------LGQYCGSLDL---- 66 (324) Q Consensus 7 ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~--~--------------~~~~~~~~~~---- 66 (324) -++||-+|++.+.|.-.|-.|..++ +..--.++.-.++.+-.-. . ..++..++.. T Consensus 4 Il~Vr~~g~v~iaGDGQVT~G~~V~-----K~~a~KVRrl~~~~Vi~GFAGstADAftLferfE~KLe~~~GnL~RaaVE 78 (172) T PRK05456 4 ILAVRRNGKVVIAGDGQVTLGNTVM-----KGNARKVRRLYNGKVLAGFAGSTADAFTLFERFEAKLEQHQGNLLRAAVE 78 (172) T ss_pred EEEEEECCEEEEEECCEEEECCEEE-----CCCCEEEEECCCCCEEEEECCHHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 9999989999998277653176888-----06632679916996999835407789999999999999773067899999 Q ss_pred ---HH-CCCHHHHHHHHHHHHCCCCCEEEEEECCC--CC--CCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf ---20-15089999999997187787389998268--75--354887402456655677764045666601455788887 Q gi|254781017|r 67 ---AM-FHPSFSFIIMAINHIKPSCGFDLKVISQL--DS--QLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAI 138 (324) Q Consensus 67 ---~~-~~~~~~~i~~~l~~~~~~~g~~i~i~s~i--P~--g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~ 138 (324) ++ ....++.+.+.+-......-+-|+=.+|+ |- =.+.||-+.+..|.++||.. +.+++.++|++.+..+ T Consensus 79 LAK~WRtDk~LRrLEAmlivaD~~~~llisG~GDViEPddgi~aIGSGG~yA~aAArAL~~---~~~l~a~~Ia~~al~I 155 (172) T PRK05456 79 LAKDWRTDRYLRRLEAMLIVADKEASLIISGNGDVIEPEDGIIAIGSGGNYALAAARALLE---NTDLSAEEIAEKALKI 155 (172) T ss_pred HHHHHHHHHHHHHHHHHEEEECCCCEEEEECCCCEECCCCCEEEECCCHHHHHHHHHHHHH---CCCCCHHHHHHHHHHH T ss_conf 9999854078887651347764575899925755668998768872867999999999987---2799999999999998 Q ss_pred H Q ss_conf 7 Q gi|254781017|r 139 V 139 (324) Q Consensus 139 E 139 (324) - T Consensus 156 A 156 (172) T PRK05456 156 A 156 (172) T ss_pred H T ss_conf 8 No 58 >cd01913 protease_HslV Protease HslV and the ATPase/chaperone HslU are part of an ATP-dependent proteolytic system that is the prokaryotic homolog of the proteasome. HslV is a dimer of hexamers (a dodecamer) that forms a central proteolytic chamber with active sites on the interior walls of the cavity. HslV shares significant sequence and structural similarity with the proteasomal beta-subunit and both are members of the Ntn-family of hydrolases. HslV has a nucleophilic threonine residue at its N-terminus that is exposed after processing of the propeptide and is directly involved in active site catalysis. Probab=41.34 E-value=24 Score=15.09 Aligned_cols=125 Identities=19% Similarity=0.186 Sum_probs=60.6 Q ss_pred EEEEECCEEEEEECCCEEECCCEEEEEEECCEEEEEEEECCCCEEEEEE----------------CCCCCCCCCCH---- Q ss_conf 8999717049997435330894289877677289999986898499995----------------36444465661---- Q gi|254781017|r 7 KICVSAPGSLVLMGEHGVLHGHAALVFAINKRVILYLTLRKDRLINIDS----------------SLGQYCGSLDL---- 66 (324) Q Consensus 7 ki~~~aPGKv~L~GEh~v~~G~~ai~~aI~~~~~~~~~~~~~~~i~i~~----------------~~~~~~~~~~~---- 66 (324) -++||-+|++.+.|.-.|-.|..++ +..--.++.-.++.+-.-. ...++...+.. T Consensus 3 Il~Vr~~g~vaiagDGQvT~G~~v~-----K~~A~KVRrl~~g~vlaGFAGstADAftL~erfE~KLe~~~g~L~raaVE 77 (171) T cd01913 3 ILAVRKNGKVVIAGDGQVTLGNTVM-----KGNARKVRRLYNGKVIAGFAGSTADAFTLFERFEAKLEQYPGNLLRAAVE 77 (171) T ss_pred EEEEEECCEEEEEECCEEEECCEEE-----CCCCEEEEECCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 9999989999999577332567878-----37704999947997999735617889999999999999775820799999 Q ss_pred ---H-HCCCHHHHHHHHHHHHCCCCCEEEEEECCC--CCC--CCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf ---2-015089999999997187787389998268--753--54887402456655677764045666601455788887 Q gi|254781017|r 67 ---A-MFHPSFSFIIMAINHIKPSCGFDLKVISQL--DSQ--LGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAI 138 (324) Q Consensus 67 ---~-~~~~~~~~i~~~l~~~~~~~g~~i~i~s~i--P~g--~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~ 138 (324) + .....++.+.+.+-......-+-|.=.+|+ |-. .+.||-+.+..|-++||.. +.+++.++|++-+..+ T Consensus 78 LAK~WRtDk~LRrLEAmlivad~~~~l~iSG~GdViEP~~~i~aIGSGg~yA~aAArAL~~---~~~l~A~~Ia~~sl~I 154 (171) T cd01913 78 LAKDWRTDRYLRRLEAMLIVADKEHTLLISGNGDVIEPDDGIAAIGSGGNYALAAARALLD---HTDLSAEEIARKALKI 154 (171) T ss_pred HHHHHHHHHHHHHHHHHEEEECCCCEEEEECCCCEECCCCCEEEECCCHHHHHHHHHHHHH---CCCCCHHHHHHHHHHH T ss_conf 9988612288887773505775674899926865677888768972876999999999876---4899999999999998 Q ss_pred H Q ss_conf 7 Q gi|254781017|r 139 V 139 (324) Q Consensus 139 E 139 (324) - T Consensus 155 A 155 (171) T cd01913 155 A 155 (171) T ss_pred H T ss_conf 8 No 59 >PRK11302 DNA-binding transcriptional regulator HexR; Provisional Probab=39.41 E-value=25 Score=14.91 Aligned_cols=42 Identities=17% Similarity=0.240 Sum_probs=30.1 Q ss_pred CCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf 89975489998466532689999999999779789994145456 Q gi|254781017|r 277 GSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITPSHST 320 (324) Q Consensus 277 GAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 320 (324) ..+.||+++++.-.....+.++ ..+...+.|..++.++ +.+. T Consensus 172 ~~~~~Dvvi~iS~sG~t~e~i~-~~~~Ak~~ga~vIaIT-~~~S 213 (284) T PRK11302 172 NSSDGDVVVLISHTGRTKSLVE-LAQLARENGATVIGIT-AAGS 213 (284) T ss_pred HCCCCCEEEEECCCCCCHHHHH-HHHHHHHCCCCEEEEC-CCCC T ss_conf 1999888999737999879999-9999998799589977-9998 No 60 >TIGR02957 SigX4 RNA polymerase sigma-70 factor, TIGR02957 family; InterPro: IPR014303 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This group of sigma factors appear by homology, tree building and bidirectional best hits, to represent a conserved family. This entry is found in a limited number of bacterial lineages and ncludes apparent paralogous expansion in Streptomyces coelicolor A3(2), and multiple copies in Mycobacterium smegmatis str. MC2 155, Streptomyces avermitilis MA-4680 and Nocardia farcinica IFM 10152.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=37.85 E-value=24 Score=15.07 Aligned_cols=24 Identities=21% Similarity=0.264 Sum_probs=18.2 Q ss_pred CCCHHHHHHHCCCCHHHHHHHHHH Q ss_conf 320788987304864456899999 Q gi|254781017|r 219 KLSQISCQALRNKNLKVLAQAMNR 242 (324) Q Consensus 219 ~i~~~~~~al~~~d~~~l~~lm~~ 242 (324) ++++.+.+|++.||++.|-+++.+ T Consensus 170 ~l~~~F~~A~~~GD~~~L~~~L~~ 193 (287) T TIGR02957 170 ELLERFVEALQTGDLEGLLELLAE 193 (287) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 999999999831897899986253 No 61 >TIGR01418 PEP_synth phosphoenolpyruvate synthase; InterPro: IPR006319 This family represents phosphoenolpyruvate synthase; also called pyruvate,water dikinase and PEP synthase. The member from Methanococcus jannaschii contains a large intein. This enzyme generates phosphoenolpyruvate (PEP) from pyruvate, hydrolyzing ATP to AMP and releasing inorganic phosphate in the process. The enzyme shows extensive homology to other enzymes that use PEP as substrate or product. This enzyme may provide PEP for gluconeogenesis, for PTS-type carbohydrate transport systems, or for other processes. ; GO: 0008986 pyruvate water dikinase activity, 0006094 gluconeogenesis. Probab=35.11 E-value=29 Score=14.48 Aligned_cols=157 Identities=16% Similarity=0.237 Sum_probs=82.5 Q ss_pred HHHHHHCCCEEEEEECCCEEEEE----------ECCCCCCEEECCCCCCHH-----------HHHHHHH-HHHHHC---- Q ss_conf 44776629858998527515673----------047775267658417989-----------9998889-988608---- Q gi|254781017|r 150 DLAASIHGGLICYQMPKYSIEKI----------DFIFPIHLIYSGYKTPTA-----------QVLKKIS-YIEIEY---- 203 (324) Q Consensus 150 D~~a~~~Gg~i~~~~~~~~~~~~----------~~~~~l~l~~tg~~~~T~-----------~~v~~v~-~~~~~~---- 203 (324) -++++.|=.++..++.+.+..-. |-.-+=.|.|=|..|.-+ +++++|+ ++=-.+ T Consensus 647 ~vA~AFYPkPV~VR~SDFKsNEY~~L~GG~~yEP~EeNPMlG~RGasRY~s~~y~eaF~lE~~A~k~VR~eMGLtNV~vM 726 (877) T TIGR01418 647 KVAAAFYPKPVIVRTSDFKSNEYRNLIGGEKYEPDEENPMLGWRGASRYYSESYEEAFRLECKAIKRVREEMGLTNVEVM 726 (877) T ss_pred HHHHHHCCCCCEEEECCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEC T ss_conf 99875388971898136983699871688867877677783324404026766047999999999988765085133113 Q ss_pred -CC--HHHHHHHHHHHHCC-----------------------CCHHHHHH-HC-----CCCHHHHHHHHHHHHHHHHHHC Q ss_conf -23--48999999984033-----------------------20788987-30-----4864456899999899998616 Q gi|254781017|r 204 -PE--INEINQKIYALMGK-----------------------LSQISCQA-LR-----NKNLKVLAQAMNRQQGLLETLG 251 (324) Q Consensus 204 -~~--~~~~~~~~~~~~~~-----------------------i~~~~~~a-l~-----~~d~~~l~~lm~~~~~~L~~lg 251 (324) |. ..+-.++.++.|.+ +++++.+. +. .||+.+|-=-++++-+++..|| T Consensus 727 iPFvRT~~E~~rv~~im~~~GL~~g~nGl~~~vM~E~PsnvlLAd~f~~~GfDGFSiGSNDLTQLTLgvDRDSelVA~lG 806 (877) T TIGR01418 727 IPFVRTPEEGKRVLEIMAEEGLKRGKNGLEVYVMCEVPSNVLLADEFAKLGFDGFSIGSNDLTQLTLGVDRDSELVAHLG 806 (877) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCEEEECCCHHHHHHHHHHCCCHHHHHCC T ss_conf 57548889999999999972888477855799988356899999989850877156575168776642010431433037 Q ss_pred CC---CHHH----HHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCC Q ss_conf 78---7799----99999999719945997268997548999846653268999999999977978999414545644 Q gi|254781017|r 252 VS---DSKL----SEIVWKLREQPHIMASKISGSGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITPSHSTSL 322 (324) Q Consensus 252 vs---~~~l----~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 322 (324) .- +|.+ ..+|+.++++ |. .+-|||=|+ .| ..++.++|=+.|++-+.++||-..++ T Consensus 807 YFDErn~AVL~li~~~I~aAk~~-G~-~vgICGQAP---------SD-----yPe~vefLVe~GIDSiSlNPDaV~~t 868 (877) T TIGR01418 807 YFDERNPAVLRLIEMAIKAAKEH-GK-KVGICGQAP---------SD-----YPEVVEFLVEEGIDSISLNPDAVLKT 868 (877) T ss_pred CCCCCCHHHHHHHHHHHHHHHHC-CC-EEEEECCCC---------CC-----HHHHHHHHHHCCCCEEEECCCCCHHH T ss_conf 77576879999999999999873-99-698706888---------87-----38999999854987698554010889 No 62 >PRK07201 short chain dehydrogenase; Provisional Probab=33.03 E-value=32 Score=14.27 Aligned_cols=73 Identities=14% Similarity=0.129 Sum_probs=43.7 Q ss_pred HHHHHHHHCCCCHHHHHHH----HHHH-----------HCCCCEE---EECCCCCCC------------CEEEEEECCCC Q ss_conf 8999986167877999999----9999-----------7199459---972689975------------48999846653 Q gi|254781017|r 243 QQGLLETLGVSDSKLSEIV----WKLR-----------EQPHIMA---SKISGSGLG------------DCVIALGKGDL 292 (324) Q Consensus 243 ~~~~L~~lgvs~~~l~~li----~~~~-----------~~~g~~~---aklsGAG~G------------G~~i~l~~~~~ 292 (324) .+..|+..|+.+|.++.-- +--. ...|-+. +-+|||..| |+-++++.. T Consensus 331 t~~~l~~~~i~~p~l~~ya~~lw~yw~~~ldp~~~~~~~~~g~L~GKvalITGASSGIG~A~A~~LA~~GA~Vvl~AR-- 408 (663) T PRK07201 331 TQEALKGTGIEVPRLASYAPRLWDYWERHLDPDRARDRDLRGPLEGKHVIITGASSGIGRATAIKVAEAGATVFALAR-- 408 (663) T ss_pred HHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHCCCEEEEEEC-- T ss_conf 788760058888774551278888887526920013777688879947999388759999999999987998999989-- Q ss_pred CHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 2689999999999779789994145 Q gi|254781017|r 293 NSLPYQSVNCHMHAKGIDIVPITPS 317 (324) Q Consensus 293 ~~~~~~~~~~~~~~~g~~~~~~~~~ 317 (324) ..+.++++.+.+.+.|-+......| T Consensus 409 ~~e~Le~v~~ei~~~Gg~a~~~~~D 433 (663) T PRK07201 409 DGEKLDELVAEIRARGGQAHAYTCD 433 (663) T ss_pred CHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 9999999999999559918999962 No 63 >TIGR02667 moaB_proteo molybdenum cofactor biosynthesis protein B; InterPro: IPR013484 MoaB is thought to be involved in molybdopterin biosynthesis, though its exact role is not known. Structural studies of this polypeptide suggest that it may play a role in substrate-shuttling during biosynthesis . MoaB was capable of binding GTP, and it was suggested that the putative active site could also bind precursor Z and/or molybdenum. Potential protein interaction domains were also found, implying that MoaB may play a transport and/or storage role in molybdopterin biosynthesis. . Probab=30.58 E-value=23 Score=15.14 Aligned_cols=32 Identities=19% Similarity=0.231 Sum_probs=26.8 Q ss_pred CHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 0788987304864456899999899998616787 Q gi|254781017|r 221 SQISCQALRNKNLKVLAQAMNRQQGLLETLGVSD 254 (324) Q Consensus 221 ~~~~~~al~~~d~~~l~~lm~~~~~~L~~lgvs~ 254 (324) +-+|++-|.++..+-||++|+ +-+.+++|.|+ T Consensus 81 TPEAl~PLfDK~~~GFGElFR--~~S~~eIGTST 112 (163) T TIGR02667 81 TPEALEPLFDKTVDGFGELFR--QISYEEIGTST 112 (163) T ss_pred CHHHHHHCCCCCCCCCHHHHH--HHHHHHHHHHH T ss_conf 255310013676797135665--52233220044 No 64 >PRK07774 short chain dehydrogenase; Provisional Probab=29.79 E-value=36 Score=13.93 Aligned_cols=20 Identities=10% Similarity=0.059 Sum_probs=10.7 Q ss_pred HHHHHHHHHHCCCCCEEEEEECC Q ss_conf 99999999718778738999826 Q gi|254781017|r 74 SFIIMAINHIKPSCGFDLKVISQ 96 (324) Q Consensus 74 ~~i~~~l~~~~~~~g~~i~i~s~ 96 (324) ..+....+++.. +++-+..- T Consensus 72 ~~~~~~~~~fG~---iDilVNnA 91 (250) T PRK07774 72 AMADRTVSAFGG---IDYLVNNA 91 (250) T ss_pred HHHHHHHHHHCC---CCEEEECC T ss_conf 999999998399---98999888 No 65 >PRK06194 hypothetical protein; Provisional Probab=28.49 E-value=38 Score=13.79 Aligned_cols=14 Identities=14% Similarity=0.349 Sum_probs=6.5 Q ss_pred CCHHHHHHHCCCCH Q ss_conf 20788987304864 Q gi|254781017|r 220 LSQISCQALRNKNL 233 (324) Q Consensus 220 i~~~~~~al~~~d~ 233 (324) +.+...++++++.+ T Consensus 242 va~~v~~~i~~~~f 255 (301) T PRK06194 242 VAQLVFDAIAADRF 255 (301) T ss_pred HHHHHHHHHHHCCE T ss_conf 99999999970977 No 66 >cd05563 PTS_IIB_ascorbate PTS_IIB_ascorbate: subunit IIB of enzyme II (EII) of the L-ascorbate-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In this system, EII is an L-ascorbate-specific permease with two cytoplasmic subunits (IIA and IIB) and a transmembrane channel IIC subunit. Subunits IIA, IIB, and IIC are encoded by the sgaA, sgaB, and sgaT genes of the E. coli sgaTBA operon. In some bacteria, the IIB (SgaB) domain is fused C-terminal to the IIA (SgaT) domain. The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include ascorbate, chitobiose/lichenan, lactose, galactitol, mannitol, fructose, and a sensory system with similarity to the bacterial bgl system. Probab=28.19 E-value=28 Score=14.59 Aligned_cols=21 Identities=33% Similarity=0.466 Sum_probs=13.4 Q ss_pred CCCCCCCCHHHHHHHHHHHHH Q ss_conf 354887402456655677764 Q gi|254781017|r 100 QLGLGSSAAITVAITAALLTL 120 (324) Q Consensus 100 g~GLGSSaa~~va~~~al~~~ 120 (324) |.|||||--+..-+=.++..+ T Consensus 7 g~GlGSS~mlkm~i~~vl~~l 27 (86) T cd05563 7 GSGLGSSLMLKMNVEKVLKEL 27 (86) T ss_pred CCCCCHHHHHHHHHHHHHHHC T ss_conf 999438999999999999986 No 67 >PRK05855 short chain dehydrogenase; Validated Probab=27.79 E-value=39 Score=13.71 Aligned_cols=19 Identities=21% Similarity=0.555 Sum_probs=9.4 Q ss_pred HHHHHHHHHCCCEEEEEEC Q ss_conf 9999999977978999414 Q gi|254781017|r 298 QSVNCHMHAKGIDIVPITP 316 (324) Q Consensus 298 ~~~~~~~~~~g~~~~~~~~ 316 (324) +.+..++...|+.+-.+-| T Consensus 477 esLr~ELa~~GI~V~aVcP 495 (582) T PRK05855 477 ECLRAELAEAGIGVTAICP 495 (582) T ss_pred HHHHHHHCCCCCEEEEEEC T ss_conf 9999985302977999931 No 68 >pfam11921 DUF3439 Domain of unknown function (DUF3439). This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 46 to 94 amino acids in length. This domain is found associated with pfam01462, pfam00560. Probab=27.09 E-value=40 Score=13.63 Aligned_cols=11 Identities=18% Similarity=0.250 Sum_probs=6.6 Q ss_pred HHHHHHHHHHH Q ss_conf 44568999998 Q gi|254781017|r 233 LKVLAQAMNRQ 243 (324) Q Consensus 233 ~~~l~~lm~~~ 243 (324) .+.||+.||.. T Consensus 81 pe~fgrvmnac 91 (122) T pfam11921 81 PEAFGRVMNAC 91 (122) T ss_pred HHHHHHHHHHH T ss_conf 79988888887 No 69 >pfam03118 RNA_pol_A_CTD Bacterial RNA polymerase, alpha chain C terminal domain. The alpha subunit of RNA polymerase consists of two independently folded domains, referred to as amino-terminal and carboxyl terminal domains. The amino terminal domain is involved in the interaction with the other subunits of the RNA polymerase. The carboxyl-terminal domain interacts with the DNA and activators. The amino acid sequence of the alpha subunit is conserved in prokaryotic and chloroplast RNA polymerases. There are three regions of particularly strong conservation, two in the amino-terminal and one in the carboxyl- terminal. Probab=26.79 E-value=41 Score=13.60 Aligned_cols=46 Identities=22% Similarity=0.300 Sum_probs=33.4 Q ss_pred CCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHH-CCCCHHHHHHHHHH Q ss_conf 32078898730486445689999989999861-67877999999999 Q gi|254781017|r 219 KLSQISCQALRNKNLKVLAQAMNRQQGLLETL-GVSDSKLSEIVWKL 264 (324) Q Consensus 219 ~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~l-gvs~~~l~~li~~~ 264 (324) .+..++..+|++.++..+++++..+...|.++ ++....++++.+.+ T Consensus 14 ~LS~R~~N~Lk~~~I~tv~dL~~~s~~dLl~i~N~G~kSl~EI~~~L 60 (62) T pfam03118 14 ELSVRSYNCLKRAGINTVGDLLSKSEEDLLKIKNFGKKSLEEIKEKL 60 (62) T ss_pred CCCHHHHHHHHHCCCCCHHHHHHCCHHHHHHCCCCCHHHHHHHHHHH T ss_conf 68689999998949967999985899999748898685799999998 No 70 >PRK08507 prephenate dehydrogenase; Validated Probab=26.55 E-value=41 Score=13.57 Aligned_cols=25 Identities=16% Similarity=0.219 Sum_probs=12.7 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 7889873048644568999998999 Q gi|254781017|r 222 QISCQALRNKNLKVLAQAMNRQQGL 246 (324) Q Consensus 222 ~~~~~al~~~d~~~l~~lm~~~~~~ 246 (324) .+..++|+++|.+.|-+++.+++.. T Consensus 247 ~~l~~~l~~~d~~~l~~~l~~a~~~ 271 (275) T PRK08507 247 EQAKELIENEDWEELFEWMAQANTL 271 (275) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 9999999816999999999999998 No 71 >TIGR00555 panK_eukar pantothenate kinase; InterPro: IPR004567 Pantothenate kinase (2.7.1.33 from EC) catalyses the conversion of CAATP and pantothenateto to ADP and D-4'-phosphopantothenate, in the key regulatory step in the biosynthesis of coenzyme A (CoA) . This group describes a eukaryotic form of pantothenate kinase, characterised from the fungus Emericella nidulans and with similar forms known in several other eukaryotes. It also includes forms from several Gram-positive bacteria suggested to have originated from the eukaryotic form by lateral transfer. It differs in a number of biochemical properties (such as inhibition by acetyl-CoA) from eubacterial PanK and shows little sequence similarity , .; GO: 0004594 pantothenate kinase activity, 0005524 ATP binding, 0015937 coenzyme A biosynthetic process. Probab=26.22 E-value=35 Score=14.01 Aligned_cols=33 Identities=30% Similarity=0.168 Sum_probs=16.4 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 3548874024566556777640456666014557888877 Q gi|254781017|r 100 QLGLGSSAAITVAITAALLTLQYHKEPSPDEILTTAHAIV 139 (324) Q Consensus 100 g~GLGSSaa~~va~~~al~~~~~~~~~~~~~i~~lA~~~E 139 (324) |+|||. +++..|..|. -.--+=+|+.++|++.. T Consensus 207 Gts~GG------Gt~~GLg~LL-T~~~~f~ell~mA~~GD 239 (387) T TIGR00555 207 GTSLGG------GTFLGLGKLL-TKIQSFDELLEMAQHGD 239 (387) T ss_pred EEECHH------HHHHHHHHHC-CCCCCHHHHHHHHHCCC T ss_conf 642325------8999999753-68878899999986488 No 72 >COG0024 Map Methionine aminopeptidase [Translation, ribosomal structure and biogenesis] Probab=26.09 E-value=42 Score=13.52 Aligned_cols=19 Identities=11% Similarity=0.048 Sum_probs=11.2 Q ss_pred CCCCCCHHHHHHHHHHHHH Q ss_conf 5666601455788887765 Q gi|254781017|r 123 HKEPSPDEILTTAHAIVLK 141 (324) Q Consensus 123 ~~~~~~~~i~~lA~~~E~~ 141 (324) ....+..||.+++.+.-+. T Consensus 33 ~pGvtt~Eld~~~~~~i~~ 51 (255) T COG0024 33 KPGVTTLELDEIAEEFIRE 51 (255) T ss_pred CCCCCHHHHHHHHHHHHHH T ss_conf 7999799999999999997 No 73 >PRK05876 short chain dehydrogenase; Provisional Probab=25.09 E-value=44 Score=13.40 Aligned_cols=14 Identities=21% Similarity=0.479 Sum_probs=5.8 Q ss_pred CCHHHHHHHCCCCH Q ss_conf 20788987304864 Q gi|254781017|r 220 LSQISCQALRNKNL 233 (324) Q Consensus 220 i~~~~~~al~~~d~ 233 (324) +.+...++++.+.+ T Consensus 229 vA~~v~~~i~~~~f 242 (275) T PRK05876 229 IAQLTADAILANRL 242 (275) T ss_pred HHHHHHHHHHCCCE T ss_conf 99999999960985 No 74 >PRK07417 arogenate dehydrogenase; Reviewed Probab=24.62 E-value=45 Score=13.34 Aligned_cols=27 Identities=22% Similarity=0.211 Sum_probs=15.3 Q ss_pred CHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 078898730486445689999989999 Q gi|254781017|r 221 SQISCQALRNKNLKVLAQAMNRQQGLL 247 (324) Q Consensus 221 ~~~~~~al~~~d~~~l~~lm~~~~~~L 247 (324) .+...++++++|.+.|-++|.+++..- T Consensus 249 L~~l~~~l~~~D~~~l~~~l~~a~~~R 275 (280) T PRK07417 249 LDQLEEAILQENWSALEQKLEQTQELR 275 (280) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 999999998079999999999999999 No 75 >PRK08655 prephenate dehydrogenase; Provisional Probab=24.59 E-value=45 Score=13.34 Aligned_cols=59 Identities=22% Similarity=0.347 Sum_probs=37.8 Q ss_pred HHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHH----HHCCCCHHHHH Q ss_conf 9988608234899999998403320788987304864456899999899998----61678779999 Q gi|254781017|r 197 SYIEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQQGLLE----TLGVSDSKLSE 259 (324) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~~~~L~----~lgvs~~~l~~ 259 (324) ++++..+|...++.+.+++...++ .+.++++|.+.|.+.|.++-.-+. .++-|...+.. T Consensus 225 AdIQ~~Np~~~ev~~~fi~~~~~l----~~~v~~~d~e~F~~~m~~aa~~~gd~~~~~~rSdkai~~ 287 (441) T PRK08655 225 ASIQMHNPQIPEIHETFIKECEEL----SELVRNGDEESFVEKMKEAAKHFGDTERGLGRSDKAIYA 287 (441) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHH----HHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHCHHHHHH T ss_conf 999870886378999999999999----999865689999999999998634156665205888999 No 76 >CHL00013 rpoA RNA polymerase alpha subunit Probab=24.34 E-value=45 Score=13.31 Aligned_cols=48 Identities=19% Similarity=0.273 Sum_probs=36.0 Q ss_pred CCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHH Q ss_conf 32078898730486445689999989999861-6787799999999997 Q gi|254781017|r 219 KLSQISCQALRNKNLKVLAQAMNRQQGLLETL-GVSDSKLSEIVWKLRE 266 (324) Q Consensus 219 ~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~l-gvs~~~l~~li~~~~~ 266 (324) +++-++...|++.++..+++|+..+.+.|-++ +..-..++++.+.+.+ T Consensus 278 eLSvRs~NCLKra~I~tvgdLv~~se~dLl~ikNfGkKSl~EI~ekL~e 326 (333) T CHL00013 278 ELSVRAYNCLKRANIHTLLDLLNYSQEDLLKIKNFGQKSVKQVLDALEK 326 (333) T ss_pred CCCHHHHHHHHHCCCCCHHHHHHCCHHHHHCCCCCCHHHHHHHHHHHHH T ss_conf 1878877436373893799986389999941899863029999999988 No 77 >pfam00392 GntR Bacterial regulatory proteins, gntR family. This family of regulatory proteins consists of the N-terminal HTH region of GntR-like bacterial transcription factors. At the C-terminus there is usually an effector-binding/oligomerisation domain. The GntR-like proteins include the following sub-families: MocR, YtrR, FadR, AraR, HutC and PlmA, DevA, DasR. Many of these proteins have been shown experimentally to be autoregulatory, enabling the prediction of operator sites and the discovery of cis/trans relationships. The DasR regulator has been shown to be a global regulator of primary metabolism and development in Streptomyces coelicolor. Probab=23.79 E-value=46 Score=13.24 Aligned_cols=58 Identities=16% Similarity=0.291 Sum_probs=34.7 Q ss_pred CCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCC Q ss_conf 2078898730486445689999989999861678779999999999719945997268997 Q gi|254781017|r 220 LSQISCQALRNKNLKVLAQAMNRQQGLLETLGVSDSKLSEIVWKLREQPHIMASKISGSGL 280 (324) Q Consensus 220 i~~~~~~al~~~d~~~l~~lm~~~~~~L~~lgvs~~~l~~li~~~~~~~g~~~aklsGAG~ 280 (324) +.+...+.+.++.+. -|+.+-.-+.+-+.++||-..+.+-++.+... |. -.+..|.|. T Consensus 5 i~~~i~~~I~~g~~~-~G~~LPs~~~La~~~~vSr~tvr~Al~~L~~~-G~-i~~~~g~G~ 62 (64) T pfam00392 5 VYARLREDILSGRLR-PGDKLPSERELAAEFGVSRTTVREALRRLEAE-GL-VERRPGRGT 62 (64) T ss_pred HHHHHHHHHHCCCCC-CCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHC-CC-EEEECCCEE T ss_conf 999999999849999-92998479999999796999999999999988-93-899779536 No 78 >TIGR02068 cya_phycin_syn cyanophycin synthetase; InterPro: IPR011810 Cyanophycin is an insoluble storage polymer for carbon, nitrogen, and energy, found in most Cyanobacteria. The polymer has a backbone of L-aspartic acid, with most Asp side chain carboxyl groups attached to L-arginine. The polymer is made by this enzyme, cyanophycin synthetase, and degraded by cyanophycinase. Heterologously expressed cyanophycin synthetase in Escherichia coli produces a closely related, water-soluble polymer with some Arg replaced by Lys. It is unclear whether enzymes that produce soluble cyanophycin-like polymers in vivo in non-Cyanobacterial species should be designated as cyanophycin synthetase itself or as a related enzyme. Cyanophycin synthesis is analogous to polyhydroxyalkanoic acid (PHA) biosynthesis, except that PHA polymers lack nitrogen and may be made under nitrogen-limiting conditions .; GO: 0005524 ATP binding, 0016874 ligase activity, 0009059 macromolecule biosynthetic process. Probab=23.53 E-value=47 Score=13.21 Aligned_cols=29 Identities=14% Similarity=0.179 Sum_probs=13.8 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCCCCHHH Q ss_conf 88740245665567776404566660145 Q gi|254781017|r 103 LGSSAAITVAITAALLTLQYHKEPSPDEI 131 (324) Q Consensus 103 LGSSaa~~va~~~al~~~~~~~~~~~~~i 131 (324) +|.-.++.++=+.|.....|..+.+-+.| T Consensus 687 ~GG~~~F~ieN~LAA~aAA~~~~v~ie~I 715 (876) T TIGR02068 687 MGGRVAFQIENALAAVAAAWALGVDIELI 715 (876) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 57715788889999999985348989999 No 79 >COG3424 BcsA Predicted naringenin-chalcone synthase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=23.14 E-value=48 Score=13.16 Aligned_cols=49 Identities=18% Similarity=0.251 Sum_probs=33.0 Q ss_pred HHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEECCCC Q ss_conf 899999899998616-78779999999999719945997268997548999846653 Q gi|254781017|r 237 AQAMNRQQGLLETLG-VSDSKLSEIVWKLREQPHIMASKISGSGLGDCVIALGKGDL 292 (324) Q Consensus 237 ~~lm~~~~~~L~~lg-vs~~~l~~li~~~~~~~g~~~aklsGAG~GG~~i~l~~~~~ 292 (324) -+.+..+|+.|+++| .|++.+-.+++...++. . =+|.-||++|+.+.=. T Consensus 295 ~e~l~~s~~~L~~~GNMSSatvLfVL~d~l~~~----~---~~g~~Gl~~A~GPGf~ 344 (356) T COG3424 295 PEALELSWDVLREYGNMSSATVLFVLEDTLQKA----P---KSGSRGLMFAMGPGFC 344 (356) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC----C---CCCCCEEEEECCCCCE T ss_conf 889999999999827865210299999999736----8---8776303453278632 No 80 >cd05567 PTS_IIB_mannitol PTS_IIB_mannitol: subunit IIB of enzyme II (EII) of the mannitol-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In this system, EII is a mannitol-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. The IIA, IIB, and IIC domains are expressed from the mtlA gene as a single protein, also known as the mannitol PTS permease, the mtl transporter, or MtlA. MtlA is only functional as a dimer with the dimer contacts occuring between the IIC domains. MtlA takes up exogenous mannitol releasing the phosphate ester into the cytoplasm in preparation for oxidation to fructose-6-phosphate by the NAD-dependent mannitol-P dehydrogenase (MtlD). The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include mannitol, chitobiose/lichenan, ascorbate, lactose, galactitol, fructose, and a s Probab=22.99 E-value=38 Score=13.78 Aligned_cols=10 Identities=50% Similarity=0.577 Sum_probs=6.2 Q ss_pred CCCCCCCCHH Q ss_conf 3548874024 Q gi|254781017|r 100 QLGLGSSAAI 109 (324) Q Consensus 100 g~GLGSSaa~ 109 (324) .+|+|||+-. T Consensus 8 DAGmGSSaMg 17 (87) T cd05567 8 DAGMGSSAMG 17 (87) T ss_pred CCCCCHHHHH T ss_conf 8998589999 No 81 >PRK06720 hypothetical protein; Provisional Probab=22.82 E-value=48 Score=13.12 Aligned_cols=15 Identities=20% Similarity=0.350 Sum_probs=6.1 Q ss_pred HHCCCCEEEECCCCC Q ss_conf 971994599726899 Q gi|254781017|r 265 REQPHIMASKISGSG 279 (324) Q Consensus 265 ~~~~g~~~aklsGAG 279 (324) +++.|++=---|-+| T Consensus 139 kq~~G~IIN~aSi~G 153 (169) T PRK06720 139 KQQEEVVLSDLPIFG 153 (169) T ss_pred HCCCCEEEEECCCCC T ss_conf 759978999887151 No 82 >PRK11199 tyrA bifunctional chorismate mutase/prephenate dehydrogenase; Provisional Probab=21.78 E-value=51 Score=12.99 Aligned_cols=63 Identities=6% Similarity=0.051 Sum_probs=38.4 Q ss_pred HHHHCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHH Q ss_conf 88608234899999998403320788987304864456899999899998616-78779999999999 Q gi|254781017|r 199 IEIEYPEINEINQKIYALMGKLSQISCQALRNKNLKVLAQAMNRQQGLLETLG-VSDSKLSEIVWKLR 265 (324) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~lm~~~~~~L~~lg-vs~~~l~~li~~~~ 265 (324) +...+|...+..+.+.+. ..+..+.++++|.+.|.+.|.+.+..+.++. -.-.+-+.|++.+. T Consensus 306 I~~~n~~~~~vi~~~~~~----~~~l~~lv~~~Dk~~Fi~~F~~v~~~Fgd~~~~~l~eS~~ll~~~~ 369 (374) T PRK11199 306 IIMSSEENLALIKRYYQR----FGEALELLEQGDKQAFIDSFEKVEHWFGDYAEQFLKESRSLLQQAN 369 (374) T ss_pred HHHCCHHHHHHHHHHHHH----HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 998194479999999999----9999999985899999999999999999878999999999999865 No 83 >PRK11337 DNA-binding transcriptional repressor RpiR; Provisional Probab=21.30 E-value=52 Score=12.93 Aligned_cols=42 Identities=24% Similarity=0.196 Sum_probs=31.8 Q ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf 9975489998466532689999999999779789994145456 Q gi|254781017|r 278 SGLGDCVIALGKGDLNSLPYQSVNCHMHAKGIDIVPITPSHST 320 (324) Q Consensus 278 AG~GG~~i~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 320 (324) .+.||+++++.-.....+.++.+ +...+.|..++.++..... T Consensus 186 l~~~Dv~i~iS~sG~t~e~i~~~-~~Ak~~Ga~vI~IT~~~~S 227 (293) T PRK11337 186 LQEGDVVLVVSHSGRTSDVKAAV-ELAKQNGAKIICITHSYHS 227 (293) T ss_pred CCCCCEEEEEECCCCCHHHHHHH-HHHHHCCCEEEEECCCCCC T ss_conf 89998899981899988999999-9999879949997699998 No 84 >COG0041 PurE Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase [Nucleotide transport and metabolism] Probab=21.02 E-value=52 Score=12.89 Aligned_cols=16 Identities=13% Similarity=0.306 Sum_probs=10.5 Q ss_pred CCCHHHHHHHHHHHHH Q ss_conf 6601455788887765 Q gi|254781017|r 126 PSPDEILTTAHAIVLK 141 (324) Q Consensus 126 ~~~~~i~~lA~~~E~~ 141 (324) -+++.+.+.+..+|.. T Consensus 41 RTPe~m~~ya~~a~~~ 56 (162) T COG0041 41 RTPEKMFEYAEEAEER 56 (162) T ss_pred CCHHHHHHHHHHHHHC T ss_conf 7989999999999878 Done!