Query gi|254781018|ref|YP_003065431.1| GHMP kinase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 337 No_of_seqs 143 out of 3344 Neff 9.3 Searched_HMMs 39220 Date Mon May 30 03:44:36 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781018.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK03926 mevalonate kinase; Pr 100.0 0 0 368.1 34.3 288 11-336 3-299 (302) 2 TIGR00549 mevalon_kin mevalona 100.0 0 0 368.6 26.4 285 15-306 1-344 (345) 3 PTZ00298 mevalonate kinase; Pr 100.0 0 0 357.3 30.9 288 1-313 1-298 (328) 4 COG1577 ERG12 Mevalonate kinas 100.0 0 0 354.3 26.1 296 12-335 2-306 (307) 5 PRK05322 galactokinase; Provis 100.0 0 0 340.4 29.1 318 2-336 11-386 (387) 6 PRK05101 galactokinase; Provis 100.0 0 0 330.8 28.4 300 2-318 12-354 (382) 7 KOG1511 consensus 100.0 0 0 328.7 29.8 324 11-337 5-380 (397) 8 PRK03817 galactokinase; Provis 100.0 0 0 318.8 33.0 300 11-336 1-349 (351) 9 PRK00555 galactokinase; Provis 100.0 0 0 318.6 30.8 288 11-318 3-334 (363) 10 COG0153 GalK Galactokinase [Ca 100.0 0 0 312.2 30.3 300 3-319 15-362 (390) 11 TIGR00131 gal_kin galactokinas 100.0 1.4E-45 0 306.0 20.7 315 3-326 10-475 (500) 12 PTZ00290 galactokinase; Provis 100.0 5.5E-44 0 295.9 27.9 316 2-336 23-455 (468) 13 TIGR01220 Pmev_kin_Gr_pos phos 100.0 3.1E-43 0 291.2 23.3 293 12-312 2-340 (367) 14 COG2605 Predicted kinase relat 100.0 3.6E-37 9.3E-42 252.9 26.9 296 11-336 2-329 (333) 15 PRK13412 fkp bifunctional fuco 100.0 1.4E-31 3.5E-36 217.8 25.2 322 10-337 609-972 (974) 16 KOG0631 consensus 100.0 1.2E-29 3.1E-34 205.5 24.0 303 3-316 32-450 (489) 17 PRK01212 homoserine kinase; Pr 99.9 1.9E-23 4.7E-28 166.5 21.4 277 11-336 4-293 (293) 18 COG0083 ThrB Homoserine kinase 99.9 1.6E-20 4E-25 148.1 26.4 264 30-335 18-295 (299) 19 PRK03188 4-diphosphocytidyl-2- 99.9 3.8E-21 9.7E-26 152.0 22.6 280 11-336 1-295 (299) 20 TIGR00191 thrB homoserine kina 99.9 4.2E-21 1.1E-25 151.7 20.9 215 112-336 85-356 (359) 21 PRK00128 ipk 4-diphosphocytidy 99.9 2.3E-20 5.9E-25 147.1 23.1 239 12-298 2-249 (281) 22 PRK01123 shikimate kinase; Pro 99.9 6E-20 1.5E-24 144.4 24.9 256 30-336 19-281 (283) 23 PRK04344 consensus 99.9 2.6E-20 6.5E-25 146.8 21.6 241 10-298 2-251 (284) 24 PRK02534 4-diphosphocytidyl-2- 99.9 3.3E-19 8.3E-24 139.8 26.6 278 10-336 3-306 (308) 25 PTZ00299 homoserine kinase; Pr 99.9 2.4E-20 6E-25 147.0 18.6 281 11-336 8-321 (336) 26 COG3890 ERG8 Phosphomevalonate 99.8 1.1E-17 2.9E-22 130.1 26.6 295 11-336 4-332 (337) 27 PRK03009 consensus 99.8 4.6E-17 1.2E-21 126.2 23.7 233 13-298 5-247 (287) 28 PRK01726 consensus 99.8 4.1E-17 1E-21 126.6 22.6 233 6-297 17-255 (312) 29 TIGR01219 Pmev_kin_ERG8 phosph 99.8 5.5E-18 1.4E-22 132.1 16.6 321 12-336 1-492 (499) 30 PRK00343 ipk 4-diphosphocytidy 99.8 5.1E-17 1.3E-21 126.0 21.2 254 11-328 3-265 (279) 31 PRK04943 consensus 99.8 3.8E-17 9.8E-22 126.8 20.5 229 10-297 5-239 (288) 32 PRK03288 consensus 99.8 7.7E-17 2E-21 124.8 21.1 231 14-297 6-242 (283) 33 KOG4519 consensus 99.8 8.5E-16 2.2E-20 118.3 25.2 319 11-336 3-449 (459) 34 COG1947 IspE 4-diphosphocytidy 99.8 5.7E-16 1.5E-20 119.4 21.2 237 11-297 4-250 (289) 35 KOG4644 consensus 99.8 5E-16 1.3E-20 119.7 20.8 325 6-336 566-942 (948) 36 PRK04648 consensus 99.8 2.8E-16 7.2E-21 121.3 19.2 232 10-298 13-251 (295) 37 PRK00650 4-diphosphocytidyl-2- 99.8 2.9E-16 7.3E-21 121.3 18.7 222 12-297 2-234 (288) 38 TIGR00154 ispE 4-diphosphocyti 99.7 5E-16 1.3E-20 119.7 12.7 264 11-311 4-289 (322) 39 PRK05905 hypothetical protein; 99.7 1.8E-14 4.6E-19 109.9 19.7 235 13-298 3-248 (258) 40 TIGR01920 Shik_kin_archae shik 99.7 1.6E-13 4E-18 104.0 22.6 234 32-318 16-260 (273) 41 PRK04181 4-diphosphocytidyl-2- 99.7 9.3E-14 2.4E-18 105.4 19.9 180 14-227 2-188 (254) 42 COG1685 Archaeal shikimate kin 99.6 1.3E-12 3.4E-17 98.2 25.6 266 11-336 3-275 (278) 43 COG3407 MVD1 Mevalonate pyroph 99.4 1.2E-09 2.9E-14 79.6 24.4 200 112-326 87-311 (329) 44 COG4542 PduX Protein involved 99.3 9.1E-11 2.3E-15 86.6 14.5 246 9-309 12-264 (293) 45 pfam10509 GalKase_gal_bdg Gala 99.3 2E-12 5.1E-17 97.1 5.7 48 2-49 5-52 (52) 46 COG1829 Predicted archaeal kin 99.3 1.4E-08 3.6E-13 72.8 22.6 262 11-333 3-277 (283) 47 pfam00288 GHMP_kinases_N GHMP 99.2 3.4E-11 8.6E-16 89.3 6.3 67 117-185 1-67 (67) 48 KOG2833 consensus 99.1 2.5E-07 6.4E-12 64.9 24.0 272 10-315 7-320 (395) 49 COG1907 Predicted archaeal sug 99.0 1.1E-06 2.9E-11 60.8 21.6 272 11-337 2-311 (312) 50 TIGR01240 mevDPdecarb diphosph 98.9 1.7E-06 4.3E-11 59.7 20.1 190 115-319 100-319 (325) 51 KOG1537 consensus 98.6 4.6E-07 1.2E-11 63.3 10.5 215 110-336 89-354 (355) 52 pfam08544 GHMP_kinases_C GHMP 98.2 6.1E-06 1.5E-10 56.2 6.4 61 244-308 2-63 (82) 53 TIGR00144 beta_RFAP_syn beta-r 95.4 0.19 5E-06 27.8 15.5 178 113-297 84-293 (337) 54 PRK05182 DNA-directed RNA poly 70.7 7.1 0.00018 18.0 4.3 51 238-288 251-302 (306) 55 TIGR00738 rrf2_super rrf2 fami 68.6 7.3 0.00019 17.9 3.7 50 264-319 30-79 (133) 56 TIGR02027 rpoA DNA-directed RN 63.2 9.9 0.00025 17.1 5.2 51 238-288 267-319 (324) 57 TIGR01821 5aminolev_synth 5-am 52.2 14 0.00035 16.2 2.7 15 274-288 195-209 (427) 58 PRK00068 hypothetical protein; 51.3 16 0.0004 15.8 5.0 21 20-40 427-448 (978) 59 PRK08507 prephenate dehydrogen 51.0 16 0.0004 15.8 4.9 37 233-269 239-275 (275) 60 CHL00013 rpoA RNA polymerase a 49.3 17 0.00043 15.6 4.0 50 238-287 278-329 (333) 61 PRK07417 arogenate dehydrogena 43.2 21 0.00053 15.0 6.2 38 232-269 241-278 (280) 62 PRK07502 cyclohexadienyl dehyd 41.9 22 0.00056 14.9 5.0 37 233-269 252-288 (307) 63 COG1959 Predicted transcriptio 35.6 27 0.0007 14.3 3.5 17 272-288 38-54 (150) 64 COG0041 PurE Phosphoribosylcar 34.5 29 0.00073 14.2 5.4 12 155-166 43-54 (162) 65 pfam03118 RNA_pol_A_CTD Bacter 33.0 30 0.00077 14.0 3.9 47 238-284 14-61 (62) 66 pfam00392 GntR Bacterial regul 30.2 34 0.00086 13.7 4.9 59 237-296 3-61 (64) 67 TIGR01825 gly_Cac_T_rel pyrido 27.1 38 0.00097 13.4 1.8 15 129-143 64-78 (392) 68 TIGR01162 purE phosphoribosyla 26.8 38 0.00098 13.4 5.4 41 258-298 13-65 (159) 69 COG1117 PstB ABC-type phosphat 23.3 23 0.00059 14.8 0.0 36 271-323 183-218 (253) 70 PRK07012 consensus 23.1 45 0.0011 12.9 9.2 135 171-311 363-525 (1173) 71 TIGR01959 nuoF_fam NADH-quinon 23.1 27 0.0007 14.3 0.4 24 167-190 148-173 (420) 72 pfam02082 Rrf2 Transcriptional 23.0 45 0.0012 12.9 2.5 43 264-312 29-71 (82) 73 smart00345 HTH_GNTR helix_turn 22.1 47 0.0012 12.8 4.2 56 240-296 2-57 (60) 74 TIGR03108 eps_aminotran_1 exos 21.6 48 0.0012 12.7 7.1 58 2-59 139-213 (628) 75 TIGR02313 HpaI-NOT-DapA 2,4-di 21.6 48 0.0012 12.7 5.0 49 240-288 215-291 (294) 76 PRK09074 consensus 21.5 49 0.0012 12.7 7.0 135 171-311 370-532 (1149) 77 cd05567 PTS_IIB_mannitol PTS_I 20.9 29 0.00073 14.2 0.1 16 127-143 8-23 (87) 78 PRK06326 consensus 20.6 51 0.0013 12.6 5.9 135 171-311 411-570 (1240) 79 TIGR01405 polC_Gram_pos DNA po 20.4 50 0.0013 12.6 1.2 13 155-167 716-728 (1264) 80 pfam07862 Nif11 Nitrogen fixat 20.2 52 0.0013 12.5 2.3 37 251-288 5-41 (49) No 1 >PRK03926 mevalonate kinase; Provisional Probab=100.00 E-value=0 Score=368.08 Aligned_cols=288 Identities=23% Similarity=0.279 Sum_probs=231.1 Q ss_pred EEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 89985817999744544079548998766727999998489829999557666753210012333343321111122103 Q gi|254781018|r 11 AHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKIPI 90 (337) Q Consensus 11 ~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (337) +.++|||||+|+|||+||||++||++|||+++++.++++++.. .+.++ .. . ...+. T Consensus 3 i~asAPGrv~LiGEH~v~~G~~aia~AI~~~~~v~~~~~~d~i-~~~~~-~~------------~-~~~~~--------- 58 (302) T PRK03926 3 VLCSAPGKIYLFGEHAVVYGKPAIACAIDLRTRVRAELNDDSI-YIQSD-YG------------K-TGEVH--------- 58 (302) T ss_pred EEEEECEEEEEEECCEEECCCEEEEEEECCCEEEEEEECCCEE-EEECC-CC------------C-CCCHH--------- T ss_conf 9998078999993876986960899993781999999989906-99754-77------------6-54200--------- Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 34777888999998751156678876999980554103898530378999999999818775402589999998641157 Q gi|254781018|r 91 TSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQH 170 (337) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~ 170 (337) .+....+.... . .....|+++.++|++|+++|||||||++||++.+++.+++.+++ .++++++++.+|+.++ T Consensus 59 ----~~~~~~l~~~~-~--~~~~~G~~i~I~S~iP~gaGLgSSAAl~va~~~al~~~~g~~ls-~~~la~la~~~E~~~~ 130 (302) T PRK03926 59 ----PYVRAAIEKVR-E--EADKDGVDLSITSQIPVGSGLGSSAAVTVATIGALNKLLGLGLS-LEEIAKMGHEVELLVQ 130 (302) T ss_pred ----HHHHHHHHHHH-H--HCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCC-HHHHHHHHHHHHHHHC T ss_conf ----79999999999-7--56899837999656887778763899999999999985489989-9999999999998746 Q ss_pred CCCCCHHHHHHHHCCCEEEECCCCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCC-----HHHHHHHHHHHHHHH Q ss_conf 787734578765326302402785200147884399974971453799999999764120-----689998765489999 Q gi|254781018|r 171 GKTGIIDSTTIVMGGIIYINPPQIIKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQS-----SIWSEFNAVTNEMME 245 (337) Q Consensus 171 ~~~sg~D~~~~~~gg~~~~~~~~~~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~-----~~~~~~~~i~~~~~~ 245 (337) |.+||+||++|.+||++.|+..+.. +.. ...+++..++...+|.+.+..+++...+. +.++++.++++++.+ T Consensus 131 G~~sGiD~~~s~~Gg~~~~~~~~~l--~~~-~~~l~l~~tg~~~~t~~~v~~v~~~~~~~~~~~~~~~~~~~~i~~~a~~ 207 (302) T PRK03926 131 GAASPTDTYVSTIGGFVTIPDRKKL--PFM-DCGIVVGYTGSSGSTKELVANVRKLYEEYPELIEPILESIGKLSEKAEE 207 (302) T ss_pred CCCCHHHHHHHHCCCEEEECCCCCC--CCC-CCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8997889999975998996678977--778-9788999839746599999999999876699999999999999999998 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEEECCCHHHHH---HHCC Q ss_conf 9983872168999999599999718997799999999985798-08950788678850899998579979999---7459 Q gi|254781018|r 246 NIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLVGGYNPEKLS---SLYG 321 (337) Q Consensus 246 al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~---~~~~ 321 (337) +|+++|++.++++|++||.++++++||+|+||+|++.+++.|+ |+|||||| ||+|+++|++.+..+++. +..+ T Consensus 208 ~l~~~d~~~lG~Lm~~~h~ll~~lgVS~peld~lv~~a~~~GalGAKltGaG---~GGc~iaL~~~d~~~~v~~al~~ag 284 (302) T PRK03926 208 LVLSGDYVSIGELMNINQGLLDALGVSTKELSELVYAARTAGAFGAKITGAG---GGGCMVALTAPDKCREVATAIGIAG 284 (302) T ss_pred HHHCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCEEEECCCC---CCCEEEEEECCCCHHHHHHHHHHCC T ss_conf 7765649999999999688787316898899999999996798288805898---7628999946131999999999769 Q ss_pred CEEEEEEECCCCEEE Q ss_conf 869999987997092 Q gi|254781018|r 322 YTCYKIKEDKNGTML 336 (337) Q Consensus 322 ~~~~~~~~~~~G~~i 336 (337) ++.+.+++..+|++| T Consensus 285 ~~~~~~~~~~~g~~~ 299 (302) T PRK03926 285 GKPMITKPTEEGLRI 299 (302) T ss_pred CEEEEEEECCCCEEE T ss_conf 968995677982375 No 2 >TIGR00549 mevalon_kin mevalonate kinase; InterPro: IPR006205 Mevalonate kinase (2.7.1.36 from EC) is well-characterised among the Eukaryotes, where it plays a role in the synthesis of isopentanyl pyrophosphate, a common intermediate for a number of pathways including cholesterol biosynthesis. It is also involved in mevalonate catabolism. Close homologs are found in the Archaea. A bacterial example is found in the Lyme disease spirochete Borrelia burgdorferi, but is not necessarily the same biosynthetic enzyme.; GO: 0004496 mevalonate kinase activity, 0005524 ATP binding, 0008299 isoprenoid biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=368.56 Aligned_cols=285 Identities=25% Similarity=0.306 Sum_probs=217.3 Q ss_pred CCCEEEEECCCEEECCCEEEEEEECCCEEEEEEE-------------------------CCCCE-EEE---EECCCCCCC Q ss_conf 5817999744544079548998766727999998-------------------------48982-999---955766675 Q gi|254781018|r 15 APAKVILSGEYSSLYGASALAVAITFYLRALLTT-------------------------IEPSL-IRI---INSQTIEYS 65 (337) Q Consensus 15 APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~-------------------------~~~~~-~~~---~s~~~~~~~ 65 (337) |||||+|+|||+||||.|||++||++++++++.. ..+.. ... ......... T Consensus 1 ApgK~IL~GEHAVVYG~PAiA~~i~~~~~~~~~~~~S~~~~~Le~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (345) T TIGR00549 1 APGKIILFGEHAVVYGEPAIAAAIDLRTTVTVIELDSAVAFRLEALAKDIILTLEDLARNSDSILYSLSSKIESELGRGS 80 (345) T ss_pred CCCEEEEECCCEEECCCCCEEEEECCEEEEEEEECCCHHHHHHHHHCCCHHHHHHHHHCCCCHHHHHHCCCCCCCCCCCC T ss_conf 97405762360011278504666336068998636732456665405644443387620254034442012110015676 Q ss_pred CCCCCCCCCCCCCC--CCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC---CEEEEEECCCCCCCCCCCCHHHHHHH Q ss_conf 32100123333433--21111122103347778889999987511566788---76999980554103898530378999 Q gi|254781018|r 66 FEACRLLGHQIDQR--HRDFIAKKIPITSVLTHPNDLILYIFNRHLPHISC---GISLDLHSTIPIGSGFGSSSAIISAL 140 (337) Q Consensus 66 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---g~~i~i~s~iP~~~GLGSSaA~~va~ 140 (337) ..+.....+..-.. ..+...........+.|...+ .+.+....... |+++.|+|+||+++|||||||++||+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~---~~~~~~~~~~~~~~p~~~~i~S~iP~g~GLGSSAAv~vA~ 157 (345) T TIGR00549 81 LDDAPQELDGLVSELVEPADKQLTPSLSESLSYIAAA---ALSKLSELNEPPKTPLEIEIESEIPPGRGLGSSAAVAVAL 157 (345) T ss_pred CCCHHHHHCCHHHHCCCCCCCCCHHHHHHHHHHHHHH---HHHHHHHCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHH T ss_conf 4401376512244316655653038999999999999---9999750267554872799850536655563479999999 Q ss_pred HHHHHHHHC---------CCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCEEEECC---CCCCCCCCCCCEEEEE Q ss_conf 999999818---------7754025899999986411577877345787653263024027---8520014788439997 Q gi|254781018|r 141 SLALSSITH---------QPFCNKEKLIAETGYIERLQHGKTGIIDSTTIVMGGIIYINPP---QIIKNETLVGEWWAIN 208 (337) Q Consensus 141 ~~al~~~~~---------~~~~~~~~~~~~a~~~e~~~~~~~sg~D~~~~~~gg~~~~~~~---~~~~~~~~~~~~~~i~ 208 (337) +.+++.+++ .+++ .+++.++|..+|...||++||+|.+++.||++++|++. +.......+...++|. T Consensus 158 ~~al~~~~g~~~n~~s~L~~~~-~~~~~~~a~~~E~~~HG~pSG~D~~~~~~g~~~~f~k~~~~~~~~~~~~~~~~~~i~ 236 (345) T TIGR00549 158 IRALADYFGIRSNDLSKLSELS-KEELAKLANEAEKIVHGKPSGIDTATSTYGGPVLFEKGEGVEETKLISLDALYLVIA 236 (345) T ss_pred HHHHHHHHCCCCCCCCCCCCCC-HHHHHHHHHHHHHHHCCCCCCCCCEEEECCCEEEEECCCCCCEEEECCCCCCEEEEE T ss_conf 9999987044668621024689-889999998877651799853122123448668987688873144027899608998 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCH-----HHHHHHHHHHHHHHHHHCCC-------CHHHHHHHHHHHHHHHHCCCCCHHH Q ss_conf 49714537999999997641206-----89998765489999998387-------2168999999599999718997799 Q gi|254781018|r 209 TGMPESSTGECVSFVEQHFSQSS-----IWSEFNAVTNEMMENIQKKD-------AEKTYKSVRVNQSLLESIGVVPETV 276 (337) Q Consensus 209 ~~~~~~st~~~~~~~~~~~~~~~-----~~~~~~~i~~~~~~al~~~d-------~~~l~~~~~~~~~ll~~l~vs~~~l 276 (337) .++...+|++++..|++..++.+ .++++++++.++.++|+++| .+.++++|++||.+|+.||||+|+| T Consensus 237 dtg~~~~T~~~V~~V~~l~~~~p~~~~~~~~~~g~l~~~a~~a~~~~d~~~~~ek~~~lg~lm~~nq~lL~~lGVS~~~l 316 (345) T TIGR00549 237 DTGVSGSTKEAVAKVRQLLERFPELIDAIMDAIGELTKEAKAALEDGDEVEDSEKVESLGELMNINQGLLKALGVSTPKL 316 (345) T ss_pred ECCCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 34877748999999999875171689999999999999999999718831035589999999999998888707985799 Q ss_pred HHHHHHHHHCCC-EEEECCCCCCCCCEEEEE Q ss_conf 999999985798-089507886788508999 Q gi|254781018|r 277 GKFIRCIENNGG-SAKIAGAGSIRGEKAGLV 306 (337) Q Consensus 277 ~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~ 306 (337) |+|+..+++.|+ |+|+|||| ||+|+++ T Consensus 317 d~l~~~~~~~GAlGaKlTGaG---~GGc~ia 344 (345) T TIGR00549 317 DQLVELARKAGALGAKLTGAG---GGGCMIA 344 (345) T ss_pred HHHHHHHHHCCCCEEEECCCC---CCCEEEE T ss_conf 999999986797463101577---7653653 No 3 >PTZ00298 mevalonate kinase; Provisional Probab=100.00 E-value=0 Score=357.34 Aligned_cols=288 Identities=20% Similarity=0.227 Sum_probs=225.9 Q ss_pred CCCCCCCCCCEEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 95435799608998581799974454407954899876672799999848982999955766675321001233334332 Q gi|254781018|r 1 MKQQTSQTPHAHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRH 80 (337) Q Consensus 1 ~~~~~~~~p~~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~ 80 (337) ||+.--++-...-.|.|||+|||||+||||+|||++|||+|+++.++..++............ .. T Consensus 1 ~~~~~~~~~~~~~~a~GKvILfGEHaVVyG~pAIA~aI~~~~~~~i~~~~~~~~i~~~~~~~~-------~~-------- 65 (328) T PTZ00298 1 MRKAVKEKTTGKHIGYGKVILFGEHFVVYGAEAIVAGIDEYTECRLELTKGVPGLQVVDQRPA-------VP-------- 65 (328) T ss_pred CCCCHHHCCCCCCEEEEEEEEEECCEECCCCCEEEEECCCCEEEEEEECCCCCCCEEECCCCC-------CC-------- T ss_conf 973222003246247089999836831019735998834866999997579997166237855-------64-------- Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 11111221033477788899999875115667887699998055410389853037899999999981877540258999 Q gi|254781018|r 81 RDFIAKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIA 160 (337) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~ 160 (337) .+.... ..........+... ........|+++.+.|+||+++|||||||++||++++++.+++.+++ .+++.+ T Consensus 66 -~~~~~~---~~~~~~~~~~v~~~--l~~~~~~~g~~i~i~S~iP~gaGLGSSAAv~VA~~~Al~~~~~~~ls-~~~i~~ 138 (328) T PTZ00298 66 -GYIVEK---REEQRKAHQLVLRH--LNIDTSVDGLKMHLGGPLVPSSGIGASASDVVSLSRALSELYQLNLT-EEEVNL 138 (328) T ss_pred -CHHHHC---CHHHHHHHHHHHHH--HCCCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCC-HHHHHH T ss_conf -123321---26677789999998--54666677538997058886567350688999999999998479999-999999 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHCCCEEEECCCCC----CCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCC-----H Q ss_conf 99986411577877345787653263024027852----00147884399974971453799999999764120-----6 Q gi|254781018|r 161 ETGYIERLQHGKTGIIDSTTIVMGGIIYINPPQII----KNETLVGEWWAINTGMPESSTGECVSFVEQHFSQS-----S 231 (337) Q Consensus 161 ~a~~~e~~~~~~~sg~D~~~~~~gg~~~~~~~~~~----~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~-----~ 231 (337) +++.+|+.++|++||+|++++.|||++.|++.+.. .+....+.++++..++...+|.+++..+++..++. . T Consensus 139 ~A~~~E~~~hG~pSGiD~~~s~~Gg~i~f~~~~~~~~~~~i~~~~~~~lvi~dtg~~~~T~~~V~~V~~~~~~~p~~~~~ 218 (328) T PTZ00298 139 SAFVGEGGYHGTPSGADNTAATYGGLISYRRVNGKSVFKRIAFQQPLYLVVCSTGITASTTKVVGDVRKLKENQPTWFNR 218 (328) T ss_pred HHHHHHHHHCCCCCHHHHHHHHCCCEEEEEECCCCCCCEECCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCHHHHHH T ss_conf 99999998558997276999945987999854886442023678970699983698653699999999998658699999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEEEC Q ss_conf 899987654899999983872168999999599999718997799999999985798-0895078867885089999857 Q gi|254781018|r 232 IWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLVGG 310 (337) Q Consensus 232 ~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~~ 310 (337) .++++++++.++.++|+++|++.++++|++||.+|++|+||+|+||.|++.+++.|+ |+|||||| ||+|++.|+++ T Consensus 219 i~~~~~~iv~~a~~aL~~~d~~~lG~Lm~~nh~lL~~lgVS~p~Ld~lv~~a~~~GAlGAKLTGAG---~GGcvIaL~~~ 295 (328) T PTZ00298 219 LLENYNACVSEAKEALQKGNLFRVGELMNANHDLCQKLTVSCRELDSIVQTCRTYGALGAKMSGTG---RGGLVVALAAS 295 (328) T ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCEEEECCCC---CCCEEEEECCC T ss_conf 999999999999999864899999999999999897468897899999999997699655751468---87589998386 Q ss_pred CCH Q ss_conf 997 Q gi|254781018|r 311 YNP 313 (337) Q Consensus 311 ~~~ 313 (337) .+. T Consensus 296 ~~~ 298 (328) T PTZ00298 296 EDQ 298 (328) T ss_pred HHH T ss_conf 666 No 4 >COG1577 ERG12 Mevalonate kinase [Lipid metabolism] Probab=100.00 E-value=0 Score=354.29 Aligned_cols=296 Identities=25% Similarity=0.280 Sum_probs=234.3 Q ss_pred EEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 99858179997445440795489987667279999984898299995576667532100123333433211111221033 Q gi|254781018|r 12 HSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKIPIT 91 (337) Q Consensus 12 ~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 91 (337) .++||||++|||||+|+||.|||++||+++.+++++.++++.+.+.+...... .. ..+.. . T Consensus 2 ~~~aPgKliL~GEHAVVyG~pAI~~aI~~~~~v~~~~s~~~~~~i~~~~~~~~-----~~---~~~~~---~-------- 62 (307) T COG1577 2 SVSAPGKLILFGEHAVVYGYPAIAAAIDLRVTVTISESDSNKIVIESSDLKSS-----TL---ERDED---E-------- 62 (307) T ss_pred CCCCCCCEEEEECCEEEECCCHHHEEEEEEEEEEEEECCCCCEEEECCCCCCC-----CC---CCCCC---C-------- T ss_conf 32235308998113455187315135010089999966888579841677765-----54---56663---2-------- Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 47778889999987511566788769999805541038985303789999999998187754025899999986411577 Q gi|254781018|r 92 SVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQHG 171 (337) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~~ 171 (337) .+....+..+..........|++++|.|++|.++|||||||++|+++.+++.+++..++ +++++++++..|..+|| T Consensus 63 ---~~~~~~v~~~~e~~~~~~~~~~~l~I~S~iP~g~GLGSSAAVsva~i~al~~~~g~~ls-~~~l~~la~~~e~~vqG 138 (307) T COG1577 63 ---GYIQAAVRLASELLNQSSLKPFSLEIDSEIPIGAGLGSSAAVSVAVIKALSAYFGVELS-PEELAKLANKVELIVQG 138 (307) T ss_pred ---HHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCC-HHHHHHHHHHHHHHHCC T ss_conf ---48999999999873325789847999627998878627999999999999986289889-89999999999998758 Q ss_pred CCCCHHHHHHHHCCCEEEECC-CCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCC-----CHHHHHHHHHHHHHHH Q ss_conf 877345787653263024027-8520014788439997497145379999999976412-----0689998765489999 Q gi|254781018|r 172 KTGIIDSTTIVMGGIIYINPP-QIIKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQ-----SSIWSEFNAVTNEMME 245 (337) Q Consensus 172 ~~sg~D~~~~~~gg~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~-----~~~~~~~~~i~~~~~~ 245 (337) ++||+|.++|.|||++.|+.. +..+++......++|...+...+|.+++..+++..++ .+++++++++++++.. T Consensus 139 ~~Sg~D~a~~~~gg~v~~~~~~~~~~l~~~~~~~~~I~~tg~~~sT~e~V~~V~~l~~~~~~~~~~~~~~ig~~~~~a~~ 218 (307) T COG1577 139 KASGIDIATITYGGLVAFKKGFDFEKLEIELLGTLVIGDTGVPGSTKELVAGVAKLLEEEPEVIDPILDAIGELVQEAEA 218 (307) T ss_pred CCCCCCCEEEEECCEEEEECCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98761403886188899845887520135667718999748866089999999999986467789999999999999999 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEEECCCHH-HH-HHHCCC Q ss_conf 9983872168999999599999718997799999999985798-08950788678850899998579979-99-974598 Q gi|254781018|r 246 NIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLVGGYNPE-KL-SSLYGY 322 (337) Q Consensus 246 al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~~~~~~-~~-~~~~~~ 322 (337) +|+++|.+.++++|+.||.+|+.|+||+|+|++|++.+++.|+ |+|+|||| ||+|++.|+...+.. .+ .++.++ T Consensus 219 al~~~d~e~lgelm~~nq~LL~~LgVs~~~L~~lv~~a~~~Ga~gaKlTGAG---gGGc~IaL~~~~~~~~~l~~~~~~~ 295 (307) T COG1577 219 ALQTGDFEELGELMNINQGLLKALGVSTPELDELVEAARSLGALGAKLTGAG---GGGCIIALAKNEEIAETLSNRLEKA 295 (307) T ss_pred HHHCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCC---CCCEEEEEECCCHHHHHHHHHHHHC T ss_conf 9861649999999999998898658486899999999986695100214688---8866999835515899999888865 Q ss_pred EEEEEEECCCCEE Q ss_conf 6999998799709 Q gi|254781018|r 323 TCYKIKEDKNGTM 335 (337) Q Consensus 323 ~~~~~~~~~~G~~ 335 (337) .+..+.+ +|++ T Consensus 296 ~i~~~~~--~~v~ 306 (307) T COG1577 296 GIALLWI--EGVR 306 (307) T ss_pred CCCHHCC--CCCC T ss_conf 8630000--4356 No 5 >PRK05322 galactokinase; Provisional Probab=100.00 E-value=0 Score=340.44 Aligned_cols=318 Identities=16% Similarity=0.167 Sum_probs=225.4 Q ss_pred CCCCCCCCCEEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 54357996089985817999744544079548998766727999998489829999557666753210012333343321 Q gi|254781018|r 2 KQQTSQTPHAHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHR 81 (337) Q Consensus 2 ~~~~~~~p~~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 81 (337) ++.|+.+|+..++|||||+|+|||+||.||+|++|||++++++.+.+++++.+++.+..+......... .+.. T Consensus 11 ~~~fg~~p~~~~~APGRVnLiGeHtDY~gg~Vlp~AI~~~~~ia~~~~~d~~i~i~s~~~~~~~~~~~~-----~~~~-- 83 (387) T PRK05322 11 AEVFGEEEDRVFFSPGRINLIGEHTDYNGGHVFPAAITLGTYGAARKRDDKKVRLYSANFEDLGIIEFD-----LDDL-- 83 (387) T ss_pred HHHHCCCCCEEEECCCCCCCCCCCCCCCCCEEEEEECCCCEEEEEEECCCCEEEEEECCCCCCCEEEEE-----CCCC-- T ss_conf 998689989899764763656144305998179588003749999988998799998988877549987-----3325-- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 11112210334777888999998751156678876999980554103898530378999999999818775402589999 Q gi|254781018|r 82 DFIAKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAE 161 (337) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~ 161 (337) .......|.+|+...+..+. ......+.||++.|+|+||+++|||||||++|++..+++.+++.+++ +.+++++ T Consensus 84 ----~~~~~~~W~~Yv~Gvi~~l~-~~g~~~~~G~d~~I~s~IP~gaGLSSSAAl~va~~~al~~~~~~~l~-~~ela~l 157 (387) T PRK05322 84 ----SYDKEDDWANYPKGVLKYLQ-EAGYKIDHGFDLYIYGNIPNGAGLSSSASIELLTGVILKDLFNLDLD-RLDLVKL 157 (387) T ss_pred ----CCCCCCCHHHHHHHHHHHHH-HCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-HHHHHHH T ss_conf ----87887666779999999999-73998778738999578888777530889999999999997389999-9999999 Q ss_pred HHHHHHCCCCCCCC-HHHHHHHHCC---CEEEECCCC--CCCCCC--CCCEEEEECCCCCCCH--------HHHHHHHH- Q ss_conf 99864115778773-4578765326---302402785--200147--8843999749714537--------99999999- Q gi|254781018|r 162 TGYIERLQHGKTGI-IDSTTIVMGG---IIYINPPQI--IKNETL--VGEWWAINTGMPESST--------GECVSFVE- 224 (337) Q Consensus 162 a~~~e~~~~~~~sg-~D~~~~~~gg---~~~~~~~~~--~~~~~~--~~~~~~i~~~~~~~st--------~~~~~~~~- 224 (337) ++.+|+.+.|.+|| |||++|++|+ .++++.... ...+.. ...+++++++.++... .++...++ T Consensus 158 a~~aE~~~~G~~~GiMDQ~as~~g~~~~al~iD~~~l~~~~~p~~~~~~~~vi~~s~~~~~l~~s~Yn~R~~ec~~a~~~ 237 (387) T PRK05322 158 GQKVENEFIGVNSGIMDQFAIGMGKKDHAILLDTNTLEYEYVPLDLGDYVIVIMNTNKRRELADSKYNERRAECEKALKE 237 (387) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHCCCCEEEEECCCCCCEEEECCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 99998887169874234677774257748862278884366125788827999857976434632125689999999999 Q ss_pred --HHH--------------------CCCH------HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH-HHHCCCCCHH Q ss_conf --764--------------------1206------8999876548999999838721689999995999-9971899779 Q gi|254781018|r 225 --QHF--------------------SQSS------IWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSL-LESIGVVPET 275 (337) Q Consensus 225 --~~~--------------------~~~~------~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~l-l~~l~vs~~~ 275 (337) +.. .+.. ++-..++++.++.++|+++|++.+|++|+++|.. +++++||+|+ T Consensus 238 l~~~~~i~~l~~~~~~~~~~~~~~l~~~~~~~Ra~Hvv~En~Rv~~a~~aL~~~d~~~~G~Lm~~sH~sLrd~yevS~~e 317 (387) T PRK05322 238 LQKKLDIKSLGELTEEEFDEYSYLIKDETLLKRARHAVTENQRTLKAVKALKAGDLEKFGRLMNASHVSLRDDYEVTGLE 317 (387) T ss_pred HHHHCCCHHHHHCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHH T ss_conf 87624942355299999998875425688899999999877999999999866999999999999899999866279888 Q ss_pred HHHHHHHHHHC-CC-EEEECCCCCCCCCEEEEEEEECCCHHHHHHH--------CC--CEEEEEEECCCCEEE Q ss_conf 99999999857-98-0895078867885089999857997999974--------59--869999987997092 Q gi|254781018|r 276 VGKFIRCIENN-GG-SAKIAGAGSIRGEKAGLVLVGGYNPEKLSSL--------YG--YTCYKIKEDKNGTML 336 (337) Q Consensus 276 l~~l~~~~~~~-g~-~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~--------~~--~~~~~~~~~~~G~~i 336 (337) ||.|++.+.+. |+ |+||||+| +|+|.+.|.+....++..+. .+ -..|.++. .+|+++ T Consensus 318 lD~Lv~~a~~~~Ga~GARmtG~G---fGGc~vaLv~~~~~~~~~~~v~~~y~~~~g~~~~~~~~~~-~~Ga~~ 386 (387) T PRK05322 318 LDTLVEAAWKQPGVLGARMTGAG---FGGCAIAIVKKDEVEAFKENVGKAYEEKIGYAASFYVAEI-GDGARE 386 (387) T ss_pred HHHHHHHHHHCCCCEECEEECCC---CCEEEEEEECHHHHHHHHHHHHHHHHHHHCCCCCEEEEEC-CCCCCC T ss_conf 99999998763897377362488---8226999962778999999999999985299987899861-898200 No 6 >PRK05101 galactokinase; Provisional Probab=100.00 E-value=0 Score=330.78 Aligned_cols=300 Identities=15% Similarity=0.148 Sum_probs=215.4 Q ss_pred CCCCCCCCCEEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 54357996089985817999744544079548998766727999998489829999557666753210012333343321 Q gi|254781018|r 2 KQQTSQTPHAHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHR 81 (337) Q Consensus 2 ~~~~~~~p~~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 81 (337) +++|+.+|+++++|||||+|+|||+||+||+|++|||++++++.+++++++.+++++..+.+.... +. .+. T Consensus 12 ~~~fG~~P~~~~~APGRVnliGEHtDY~gG~Vlp~AI~~~~~ia~~~~~d~~~~v~s~~~~~~~~~-~~-----l~~--- 82 (382) T PRK05101 12 AEQFGYPPTHTIQAPGRVNLIGEHTDYNDGFVLPCAINYQTVISCAKRDDRIVRVIAADYDNQQDE-FS-----LDA--- 82 (382) T ss_pred HHHHCCCCCEEEECCCCCCCCCHHHHCCCCCEEEEECCCCEEEEEEECCCCEEEEEECCCCCCEEE-EE-----CCC--- T ss_conf 998689985899776740332032342898578688112579999988998599998888985258-85-----687--- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 11112210334777888999998751156678876999980554103898530378999999999818775402589999 Q gi|254781018|r 82 DFIAKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAE 161 (337) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~ 161 (337) .........|.+|+...+..+... +....|+++.|.|+||+|+||+||||+.|+++.+++.+++.+++ +.++.++ T Consensus 83 --~~~~~~~~~W~~Yv~Gv~~~l~~~--g~~~~G~d~~I~s~IP~GaGLSSSAAl~va~~~al~~l~~~~~~-~~ela~l 157 (382) T PRK05101 83 --PIVPHENYQWANYVRGVVKHLQER--NPDFGGADLVISGNVPQGAGLSSSASLEVAVGQTFQQLYHLPLS-GAEIALN 157 (382) T ss_pred --CCCCCCCCCHHHHHHHHHHHHHHH--CCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC-HHHHHHH T ss_conf --667787766799999999999973--89988738999678888777541789999999999997389989-9999999 Q ss_pred HHHHHHCCCCCCCC-HHHHHHHHCC---CEEEECCC--CCCCCCCCCC-EEEEECCCCCCCHH--------HHHHHHHH- Q ss_conf 99864115778773-4578765326---30240278--5200147884-39997497145379--------99999997- Q gi|254781018|r 162 TGYIERLQHGKTGI-IDSTTIVMGG---IIYINPPQ--IIKNETLVGE-WWAINTGMPESSTG--------ECVSFVEQ- 225 (337) Q Consensus 162 a~~~e~~~~~~~sg-~D~~~~~~gg---~~~~~~~~--~~~~~~~~~~-~~~i~~~~~~~st~--------~~~~~~~~- 225 (337) ++.+|+.+.|.+|| |||+++++|+ .++++..+ ...+..+... +++++++.++.... ++....+. T Consensus 158 ~~~aE~~~~G~~~GiMDQ~as~~g~~~~al~id~~~l~~~~v~lp~~~~~vi~dS~v~~~l~~~~Yn~R~~ec~~aa~~l 237 (382) T PRK05101 158 GQEAENQFVGCNCGIMDQLISALGKKDHALLIDCRSLETKAVSMPEGVAVVIINSNVKRGLVDSEYNTRREQCETAARFF 237 (382) T ss_pred HHHHHHHCCCCCCCCHHHHHHHHCCCCEEEECCCCCCCEEEEECCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 99999840178887167889985267828850367884366546999679995388654345450578999999999996 Q ss_pred ----HH-------------------CCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH-HHHHCCCCCHHHHHHHH Q ss_conf ----64-------------------1206899987654899999983872168999999599-99971899779999999 Q gi|254781018|r 226 ----HF-------------------SQSSIWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQS-LLESIGVVPETVGKFIR 281 (337) Q Consensus 226 ----~~-------------------~~~~~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~-ll~~l~vs~~~l~~l~~ 281 (337) .. .+-.++-..++++.++.++|+++|++.+|++|+++|. ++++++||+|+||.|++ T Consensus 238 ~~~~Lrd~~~~~l~~~~~~l~~~~~~Ra~Hvi~E~~Rv~~a~~aL~~~d~~~lG~Lm~~SH~Slr~~yevS~~elD~lv~ 317 (382) T PRK05101 238 GVKALRDVTLEQFNAVAAELDPVVAKRARHVITENARTLEAASALAQGDLKRMGELMAESHASMRDDFEITVPQIDTLVE 317 (382) T ss_pred CHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 92413328999999877108999999999999999999999999862899999999986279899877079889999999 Q ss_pred HHHHC--CC-EEEECCCCCCCCCEEEEEEEECCCHHHHHH Q ss_conf 99857--98-089507886788508999985799799997 Q gi|254781018|r 282 CIENN--GG-SAKIAGAGSIRGEKAGLVLVGGYNPEKLSS 318 (337) Q Consensus 282 ~~~~~--g~-~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~ 318 (337) ++++. +. |+||||+| +|+|.+.|.+..+.+++.+ T Consensus 318 ~a~~~~g~~gGARmtG~G---fGGc~vaLv~~~~~~~~~~ 354 (382) T PRK05101 318 IVKAVIGDQGGVRMTGGG---FGGCIVALVPEELVEAVRQ 354 (382) T ss_pred HHHHHCCCCCEEEEECCC---CCEEEEEEECHHHHHHHHH T ss_conf 999761877546896367---6002899974778999999 No 7 >KOG1511 consensus Probab=100.00 E-value=0 Score=328.66 Aligned_cols=324 Identities=24% Similarity=0.287 Sum_probs=247.7 Q ss_pred EEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEE-EECCCCCCCCCCCCCC----CCCCCCCCCCCCC Q ss_conf 8998581799974454407954899876672799999848982999-9557666753210012----3333433211111 Q gi|254781018|r 11 AHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRI-INSQTIEYSFEACRLL----GHQIDQRHRDFIA 85 (337) Q Consensus 11 ~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~-~s~~~~~~~~~~~~~~----~~~~~~~~~~~~~ 85 (337) +.++|||||+|+|||+|+||.+|++++|++|||+.+.++.++.+.+ .++...++.|...+.. ++.......+... T Consensus 5 l~vsaPGKvILfGEHAVVyg~~AlAaai~LrTyl~l~~san~~i~l~l~di~~~~~w~l~~~~~~l~~~~~~~~~~q~p~ 84 (397) T KOG1511 5 LLVSAPGKVILFGEHAVVYGRTALAAAIDLRTYLRLQTSANDRILLQLPDISIEKAWSLADFNGALPEQRSTYESVQTPA 84 (397) T ss_pred EEECCCCCEEEECCCEEEECCCEEEEEEECCEEEEEEECCCCEEEEECCCCCCEEEEEHHHHHHHHHHHHHHHHCCCCCC T ss_conf 24058962799523105777702588960412699984589738996366773478872343323146655432137841 Q ss_pred CCC-------------CHHHHHHHHHHH-HHHHHHHCCCCCCCC----EEEEEECCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 221-------------033477788899-999875115667887----69999805541038985303789999999998 Q gi|254781018|r 86 KKI-------------PITSVLTHPNDL-ILYIFNRHLPHISCG----ISLDLHSTIPIGSGFGSSSAIISALSLALSSI 147 (337) Q Consensus 86 ~~~-------------~~~~~~~~~~~~-~~~~~~~~~~~~~~g----~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~ 147 (337) ... ...+...+...+ +.+.+ ........| +.+.+.|++|.|+|||||||++|++..++..+ T Consensus 85 ~~~~~e~~k~l~~l~~~~~~~~~~~a~~~~lYlf-~~l~~~~~g~lp~~~v~v~SelP~GaGLGSSAa~sv~lAtall~~ 163 (397) T KOG1511 85 SEVRVELLKQLGGLLENQEKVKEHLAGLSFLYLF-LGLCLRAPGTLPALTVVVDSELPLGAGLGSSAAISVALATALLRL 163 (397) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH-HHHHHCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 0033899997543332011356788999999999-876404668886569998446787677650689999999999997 Q ss_pred HCCCC----------CCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCEEEECC-CCCCCCCCCCCEEEEECCCCCCCH Q ss_conf 18775----------4025899999986411577877345787653263024027-852001478843999749714537 Q gi|254781018|r 148 THQPF----------CNKEKLIAETGYIERLQHGKTGIIDSTTIVMGGIIYINPP-QIIKNETLVGEWWAINTGMPESST 216 (337) Q Consensus 148 ~~~~~----------~~~~~~~~~a~~~e~~~~~~~sg~D~~~~~~gg~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~st 216 (337) .+.-. .+.+.+.++|...|+.+||++||.|+++|.|||.++|+.. +...++..+...+++.++...++| T Consensus 164 ~g~i~~p~~~~~~~e~~l~Li~~WAf~gE~~iHGtpSGiDnaV~t~Gg~i~f~kg~~~~~Lk~~~~L~illtnTrv~RnT 243 (397) T KOG1511 164 AGLIPPPGSNLSLAENDLALINKWAFEGEKCIHGTPSGIDNAVCTYGGLISFKKGVEIESLKHLPPLRILLTNTRVPRNT 243 (397) T ss_pred CCCCCCCCCHHCCCCCHHHHHHHHHHCCCEEECCCCCCCCHHHHCCCCEEEEECCCCCEECCCCCCCEEEEECCCCCCCH T ss_conf 35689985300013226999998875056011488765421221147647861476502036688733898716567519 Q ss_pred HHHHHHHHHHHCCCH-----HHHHHHHHHHHHHHHHHCC----CCH---HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 999999997641206-----8999876548999999838----721---6899999959999971899779999999998 Q gi|254781018|r 217 GECVSFVEQHFSQSS-----IWSEFNAVTNEMMENIQKK----DAE---KTYKSVRVNQSLLESIGVVPETVGKFIRCIE 284 (337) Q Consensus 217 ~~~~~~~~~~~~~~~-----~~~~~~~i~~~~~~al~~~----d~~---~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~ 284 (337) +.++..++.+.++.+ .+++++.++.++..++.+. +.. .+.++++.||.+|..+|||+|++|.++.... T Consensus 244 k~lVa~Vr~~~~kfPevi~~i~~aid~is~ea~~il~~e~~~~~~~~Eq~L~eLi~iNq~LL~alGVsH~~le~v~~~t~ 323 (397) T KOG1511 244 KALVAGVRELLEKFPEVIKAIFDAIDEISLEAVWILQRENDEFSSPKEQKLEELIRINQDLLDALGVSHPSLELVCTTTR 323 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 99999999987753588999999999999999999742445678827889999999717889875779488999999999 Q ss_pred HCCCEEEECCCCCCCCCEEEEEEEECCCHHHHHHH------CCCEEEEEEECCCCEEEC Q ss_conf 57980895078867885089999857997999974------598699999879970929 Q gi|254781018|r 285 NNGGSAKIAGAGSIRGEKAGLVLVGGYNPEKLSSL------YGYTCYKIKEDKNGTMLA 337 (337) Q Consensus 285 ~~g~~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~------~~~~~~~~~~~~~G~~i~ 337 (337) +.|..+|+||||| |||...++.+....+.+.++ .+|.++...+..+|++|. T Consensus 324 k~gi~sKLTGAGg--GGc~itlL~~~~~qe~i~~~ke~L~s~gf~v~~t~lGG~G~~v~ 380 (397) T KOG1511 324 KLGIHSKLTGAGG--GGCVITLLKPGTEQEQIDKWKEELESHGFEVFETELGGPGVSVH 380 (397) T ss_pred HHCCCEECCCCCC--CCEEEEEECCCCCHHHHHHHHHHHHHCCCCEEECCCCCCCEEEE T ss_conf 8286000123787--86599997899864788889999985686255025888825887 No 8 >PRK03817 galactokinase; Provisional Probab=100.00 E-value=0 Score=318.82 Aligned_cols=300 Identities=17% Similarity=0.156 Sum_probs=211.0 Q ss_pred EEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 89985817999744544079548998766727999998489829999557666753210012333343321111122103 Q gi|254781018|r 11 AHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKIPI 90 (337) Q Consensus 11 ~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (337) +.++|||||+|+|||+||+|++|++|||++++++.+++.+ .+++.+..+.. .+. .+. .... T Consensus 1 i~v~APGRV~LiGEH~Dy~gg~vl~~AI~~~~~i~~~~~~--~~~~~s~~~~~-~~~-----~~~-----------~~~~ 61 (351) T PRK03817 1 MKVKSPGRVNLIGEHTDYNDGYVLPFAIDRYTVLEIEKSD--KFIFYSENFNE-TFE-----LDK-----------LEKT 61 (351) T ss_pred CEEECCCCEEEECCCEEECCCCEEEEEECCEEEEEEEECC--CEEEEECCCCC-CEE-----ECC-----------CCCC T ss_conf 9696871379863887788986795882124999999789--18999888887-234-----045-----------5667 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 34777888999998751156678876999980554103898530378999999999818775402589999998641157 Q gi|254781018|r 91 TSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQH 170 (337) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~ 170 (337) ..|.+|....+..+... .....|+++.++|+||+++|||||||++||++.+++.+++.+++ ++++.++++.+|+.++ T Consensus 62 ~~w~~yv~gvi~~l~~~--g~~~~G~~i~i~s~iP~gsGLsSSAAl~va~~~al~~~~~~~l~-~~~la~~~~~aE~~~~ 138 (351) T PRK03817 62 NSWADYIKGVIWVLEKE--GYEVGPVKGKVSSNLPLGAGLSSSAALEVAVAYALNEYYNLNLS-KLELALIARKAENEFV 138 (351) T ss_pred CCHHHHHHHHHHHHHHC--CCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHHC T ss_conf 97688999999999970--99988749999778889888764899999999999998489999-9999999999999861 Q ss_pred CCCCC-HHHHHHHHCC---CEEEECCCCC--CCCCCCC--CEEEEECCCCCCCHHH--------HHHHHHHH-------- Q ss_conf 78773-4578765326---3024027852--0014788--4399974971453799--------99999976-------- Q gi|254781018|r 171 GKTGI-IDSTTIVMGG---IIYINPPQII--KNETLVG--EWWAINTGMPESSTGE--------CVSFVEQH-------- 226 (337) Q Consensus 171 ~~~sg-~D~~~~~~gg---~~~~~~~~~~--~~~~~~~--~~~~i~~~~~~~st~~--------~~~~~~~~-------- 226 (337) |.+|| |||+++++|+ .++++..+.. ..+..+. .+++++++.++..... +...++.. T Consensus 139 G~~~G~mDq~as~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~vv~~s~~~~~l~~~~yn~r~~~~~~a~~~l~~~~~~~~ 218 (351) T PRK03817 139 GVPCGIMDQFAVAFGKKDHAIFLDTMTLEYEYIPFPLKDYEILVFDSGVKHELASSEYNERRKECEEALKILGKKSFREV 218 (351) T ss_pred CCCCCHHHHHHHHHCCCCEEEEECCCCCCEEEECCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHHC T ss_conf 58850888999985457738982168884476336887758999977975434510678999999999999660004431 Q ss_pred ------------HCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHHH-CCC-EEE Q ss_conf ------------41206899987654899999983872168999999599999-718997799999999985-798-089 Q gi|254781018|r 227 ------------FSQSSIWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLLE-SIGVVPETVGKFIRCIEN-NGG-SAK 291 (337) Q Consensus 227 ------------~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~-~l~vs~~~l~~l~~~~~~-~g~-~ak 291 (337) ..+...+...+.++.++.++|+++|++.+|++|+++|..++ .++||+|+||.|++.+++ .|+ |+| T Consensus 219 ~~~~l~~l~~~~~~r~~hv~~E~~rv~~~~~al~~~d~~~lG~Lm~~sh~sLr~~~~vS~~eld~lv~~a~~~~Ga~Gak 298 (351) T PRK03817 219 TEEDLEKLPPLLRKRAGHVLRENERVLKVRDALKEGDVETLGKLLYESHESLADLYEVSCEELDFIVEFLLGKLGIYGAR 298 (351) T ss_pred CHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEECE T ss_conf 69999857688999999999888999999999873899999999999889998866489778999999998608986753 Q ss_pred ECCCCCCCCCEEEEEEEECCCHHHHHHH----------CCCEEEEEEECCCCEEE Q ss_conf 5078867885089999857997999974----------59869999987997092 Q gi|254781018|r 292 IAGAGSIRGEKAGLVLVGGYNPEKLSSL----------YGYTCYKIKEDKNGTML 336 (337) Q Consensus 292 ~sGAG~~~Gg~~~~~l~~~~~~~~~~~~----------~~~~~~~~~~~~~G~~i 336 (337) ||||| +|+|+++|.+....+++.+. ..-.+|.++. .+|+++ T Consensus 299 ltGaG---~GGc~vaLv~~~~~e~~~~~v~~~Y~~~~~~~~~~~~~~~-~~Ga~~ 349 (351) T PRK03817 299 LVGGG---FGGSVIVLVKKGAFESIGEELLEEYTKKFGITPEYFKVES-SDGVQK 349 (351) T ss_pred EECCC---CCEEEEEEECHHHHHHHHHHHHHHHHHHHCCCCEEEEEEE-CCCCEE T ss_conf 31488---7558999975467999999999999996399981999984-889561 No 9 >PRK00555 galactokinase; Provisional Probab=100.00 E-value=0 Score=318.64 Aligned_cols=288 Identities=15% Similarity=0.120 Sum_probs=210.3 Q ss_pred EEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 89985817999744544079548998766727999998489829999557666753210012333343321111122103 Q gi|254781018|r 11 AHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKIPI 90 (337) Q Consensus 11 ~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (337) ..-+|||||+|+|||+||+||+|++|||++++++.+++++++.+++.+....... ... .+ ...... T Consensus 3 ~~~sAPGRVnliGEH~DY~gG~VLp~AI~~~~~v~~~~~~~~~i~~~s~~~~~~~----~~~---~~-------~~~~~~ 68 (363) T PRK00555 3 VSYAAPGRINLIGEHTDYNLGFALPIALPQRTVVTFTPEHTGAITARSDRADGSV----RIP---LD-------TTPGQV 68 (363) T ss_pred EEEECCCCEEEEECCEEECCCEEEEEEHHHCEEEEEEECCCCEEEEEECCCCCCE----EEE---CC-------CCCCCC T ss_conf 6997666589762875789974694782012899999889986999976889867----986---77-------687766 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 34777888999998751156678876999980554103898530378999999999818775402589999998641157 Q gi|254781018|r 91 TSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQH 170 (337) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~ 170 (337) ..|.+|+...+..+.. .+....|+++.|.|+||+++||+||||+.||++.+++.+++.+++ +.++.++++.+|+.+. T Consensus 69 ~~W~~y~~gv~~~l~~--~g~~~~g~~~~I~s~IP~gaGLSSSAAl~va~~~al~~~~~~~~~-~~ela~la~~aE~~~~ 145 (363) T PRK00555 69 TGWAAYAAGVIWALRG--AGHPVPGGAMSITSDVEIGSGLSSSAALECAVLGAVGAATGTRID-RLEQARLAQRAENDYV 145 (363) T ss_pred CCHHHHHHHHHHHHHH--HCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHCC T ss_conf 2278999999999997--189988852899778887667441899999999999997089999-9999999999997517 Q ss_pred CCCCC-HHHHHHHHCCC---EEEECCC--CCCCCCCCC----CEEEEECCCCCCCHHHHHHHHHH--------------- Q ss_conf 78773-45787653263---0240278--520014788----43999749714537999999997--------------- Q gi|254781018|r 171 GKTGI-IDSTTIVMGGI---IYINPPQ--IIKNETLVG----EWWAINTGMPESSTGECVSFVEQ--------------- 225 (337) Q Consensus 171 ~~~sg-~D~~~~~~gg~---~~~~~~~--~~~~~~~~~----~~~~i~~~~~~~st~~~~~~~~~--------------- 225 (337) |.+|| |||+++.+|.. ++++..+ ...+...+. .+++++++.++......+...+. T Consensus 146 G~~~GiMDQ~as~~g~~g~al~id~~~l~~~~v~~~~~~~~~~~vi~dS~~~~~~~~~~y~~R~~ec~~aa~~l~~~~lr 225 (363) T PRK00555 146 GAPTGLLDQLAALFGAPKTALLIDFRDLTVRPVAFDPDACGVVLLLMDSRARHRHAGGEYAARRASCERAAADLGVSSLR 225 (363) T ss_pred CCCCCCHHHHHHHCCCCCEEEEECCCCCCEECCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHH T ss_conf 88875054788862788839873267885321444777677699997667644345323589999999999985912354 Q ss_pred ------------H-----HCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH-HHHCCCCCHHHHHHHHHHHHCC Q ss_conf ------------6-----412068999876548999999838721689999995999-9971899779999999998579 Q gi|254781018|r 226 ------------H-----FSQSSIWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSL-LESIGVVPETVGKFIRCIENNG 287 (337) Q Consensus 226 ------------~-----~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~l-l~~l~vs~~~l~~l~~~~~~~g 287 (337) . ..+..++-..++++.++.++|+++|++.+|++|+++|.. ++++++|+|+||.|++.+.+.| T Consensus 226 ~~~~~~l~~l~~~~~~~~~~Ra~hv~~E~~RV~~~~~al~~~d~~~~G~Lm~~SH~Slrd~yevS~~elD~lv~~a~~~G 305 (363) T PRK00555 226 AVQDRGLAALGAIADPIDARRARHVLTENQRVLDFVAALADSDFTAAGQILTASHASMRDDFAITTERIDLIADSAVRAG 305 (363) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCC T ss_conf 42076899987505888888888888888899999999873999999999999999999865179687999999999769 Q ss_pred C-EEEECCCCCCCCCEEEEEEEECCCHHHHHH Q ss_conf 8-089507886788508999985799799997 Q gi|254781018|r 288 G-SAKIAGAGSIRGEKAGLVLVGGYNPEKLSS 318 (337) Q Consensus 288 ~-~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~ 318 (337) + |+||||+| +|+|.+.|++.+..++... T Consensus 306 a~GARltGaG---fGGc~vaLv~~~~~~~~~~ 334 (363) T PRK00555 306 ALGARMTGGG---FGGCVIALVPADRARDVAD 334 (363) T ss_pred CCEEEEECCC---CCEEEEEEECHHHHHHHHH T ss_conf 9278897378---8748999966878999999 No 10 >COG0153 GalK Galactokinase [Carbohydrate transport and metabolism] Probab=100.00 E-value=0 Score=312.23 Aligned_cols=300 Identities=17% Similarity=0.176 Sum_probs=217.6 Q ss_pred CCCC-CCCCEEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4357-996089985817999744544079548998766727999998489829999557666753210012333343321 Q gi|254781018|r 3 QQTS-QTPHAHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHR 81 (337) Q Consensus 3 ~~~~-~~p~~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 81 (337) +.|+ .+|...++|||||+|+|||+||+||+|+|+|||.+|++.++++++..+++++..+.+...... . T Consensus 15 ~~f~~~~~~~~~~aPGRvNLIGEHtDYn~G~VlP~Ain~~t~v~v~~r~d~~v~l~s~n~~~~~~~~~-----------~ 83 (390) T COG0153 15 EHFGYVEPTVTAFAPGRVNLIGEHTDYNGGFVLPCAINYGTYVAVAKRDDGKVRLYSANFGNAGDIFF-----------L 83 (390) T ss_pred HHHCCCCCCEEECCCCEEEEECCCEECCCCEEEEEEEECCEEEEEEECCCCEEEEEECCCCCCCCEEE-----------C T ss_conf 98621376457617855886012221268667888960542899997368559999678763342430-----------4 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 11112210334777888999998751156678876999980554103898530378999999999818775402589999 Q gi|254781018|r 82 DFIAKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAE 161 (337) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~ 161 (337) +..........|.+|+.+.+....... ....|+++.+.|+||.|+|||||||+.|++..++.++++++++ +.++.++ T Consensus 84 ~~d~~~~~~~~W~nYvkgvi~~l~~~g--~~~~G~~i~i~gnIP~GaGLSSSAAleva~~~al~~l~~~~~~-k~~la~i 160 (390) T COG0153 84 LLDIAKEKIDDWANYVKGVIKALQKRG--YAFTGLDIVISGNIPIGAGLSSSAALEVAVALALQRLFNLPLD-KAELAKI 160 (390) T ss_pred CHHHCCCCCCHHHHHHHHHHHHHHHCC--CCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCC-HHHHHHH T ss_conf 433144531103324789999998658--7767705999667888888672689999999999997389998-8899999 Q ss_pred HHHHH-HCCCCCCCCHHHHHHHHCC---CEEEECCCC--CCCCCCCC--CEEEEECCCCCCCHHHHHHHHH--------- Q ss_conf 99864-1157787734578765326---302402785--20014788--4399974971453799999999--------- Q gi|254781018|r 162 TGYIE-RLQHGKTGIIDSTTIVMGG---IIYINPPQI--IKNETLVG--EWWAINTGMPESSTGECVSFVE--------- 224 (337) Q Consensus 162 a~~~e-~~~~~~~sg~D~~~~~~gg---~~~~~~~~~--~~~~~~~~--~~~~i~~~~~~~st~~~~~~~~--------- 224 (337) ++..| +.++.+++.|||.++.+|. .++++.... ...+.+.. .++++++..++......+...+ T Consensus 161 ~q~AEn~fvGvn~G~mDQ~~s~~G~~~~al~ld~~~l~~~~~~~p~~~~~ivI~ns~vkr~la~seYn~Rr~ece~A~~~ 240 (390) T COG0153 161 AQVAENQFVGVNCGIMDQLASAFGKKDHALLLDCRTLEYEPVPFPVGGVSIVIVNSNVKRELADSEYNERRAECEEAAEF 240 (390) T ss_pred HHHHHHHCCCCCCCHHHHHHHHHCCCCCEEEEECCCCCEEEECCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 99987513477675088999986877848999736662687304766459999637976654234789999999999999 Q ss_pred --H------------------HHC-------CCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH-HHHCCCCCHHH Q ss_conf --7------------------641-------2068999876548999999838721689999995999-99718997799 Q gi|254781018|r 225 --Q------------------HFS-------QSSIWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSL-LESIGVVPETV 276 (337) Q Consensus 225 --~------------------~~~-------~~~~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~l-l~~l~vs~~~l 276 (337) . ... +...+-..++++.++.++|+++|+..+|++|+++|.. +++++||+|+| T Consensus 241 l~~~~~~L~d~~~~~~~~~~~~i~~~~~~~rRa~hvv~En~Rvl~a~~Al~~~dl~~fG~Lm~~SH~slrddyevt~pEl 320 (390) T COG0153 241 LGVSIKSLRDVTDEEFAALQAEIEVDPKIARRARHVVTENQRVLEAAKALRSGDLTEFGELMNESHESLRDDYEVTCPEL 320 (390) T ss_pred HHHHHHHHHHCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCHHH T ss_conf 98755035433799999645241446688899999986777899999998718899999999998998874356656628 Q ss_pred HHHHHHHHHC-C-CEEEECCCCCCCCCEEEEEEEECCCHHHHHHH Q ss_conf 9999999857-9-80895078867885089999857997999974 Q gi|254781018|r 277 GKFIRCIENN-G-GSAKIAGAGSIRGEKAGLVLVGGYNPEKLSSL 319 (337) Q Consensus 277 ~~l~~~~~~~-g-~~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~ 319 (337) |.|++++... | .|+|||||| +|+|.+.|++..+.++..+. T Consensus 321 D~lve~a~~~~G~~GaRmTGaG---fGGc~IaLv~~~~v~~~~e~ 362 (390) T COG0153 321 DTLVEIALAAGGAYGARMTGAG---FGGCVIALVPNDDVEAVAEA 362 (390) T ss_pred HHHHHHHHHCCCCCCCEECCCC---CCCEEEEEECHHHHHHHHHH T ss_conf 9999999872786501203788---78559999322027999999 No 11 >TIGR00131 gal_kin galactokinase; InterPro: IPR000705 Galactokinase catalyses the first reaction in the galactose metabolism pathway, the ATP-dependent phosphorylation of galactose, yielding galactose-1-phosphate , . Deficiency in this enzyme results in the disease galactosemia, which is responsible for the formation of cataracts in newborn babies, and is possibly responsible for presenile cataracts in adults . In yeast, the GAL3 gene product is required for the GAL4-mediated induction of other enzymes involved in galactose metabolism. The induction of GAL1 production then reinforces this process, increasing the expression of other galactose-inducible genes. GAL3 has been shown to be similar to the GAL1 protein .; GO: 0004335 galactokinase activity, 0005524 ATP binding, 0006012 galactose metabolic process, 0046835 carbohydrate phosphorylation. Probab=100.00 E-value=1.4e-45 Score=306.02 Aligned_cols=315 Identities=16% Similarity=0.183 Sum_probs=213.5 Q ss_pred CCCCCCCCEEEE-CCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCC--CCCCCCCCCCCCCCCC Q ss_conf 435799608998-5817999744544079548998766727999998489829999557666--7532100123333433 Q gi|254781018|r 3 QQTSQTPHAHSI-APAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIE--YSFEACRLLGHQIDQR 79 (337) Q Consensus 3 ~~~~~~p~~~~~-APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~--~~~~~~~~~~~~~~~~ 79 (337) ++|+.+|+.+++ |||||||+|||+||.++.|+|+|||..|.+.++++++..+++......+ ..-...+.+.... T Consensus 10 ~~~~~~~~~~~~faPGRVNLIGEHtDYnDg~VlPcAid~~t~~av~~rdd~~~~~~~~n~~na~~~e~slD~~~~~~--- 86 (500) T TIGR00131 10 SAFGEKPDFTARFAPGRVNLIGEHTDYNDGFVLPCAIDFGTLVAVAERDDKNVRVYAANADNAKFDEFSLDLPLDGS--- 86 (500) T ss_pred HHHCCCCCEEEEECCCCEEEEEEEEEECCCEECCCHHCCCCEEEEEEECCCCEEEEEECCCCCCCCCEECCCCCCCC--- T ss_conf 86088976699977884622131242268575550105471687543048860688625886443100037632355--- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHH--------HHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHH-HHHH-HHHC Q ss_conf 21111122103347778889999987--------511566788769999805541038985303789999-9999-9818 Q gi|254781018|r 80 HRDFIAKKIPITSVLTHPNDLILYIF--------NRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALS-LALS-SITH 149 (337) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~-~al~-~~~~ 149 (337) +.........|.+|+.+.+..+. .+.......|+++.+.+|||.|+||+||||+.||+. ..+- .++. T Consensus 87 ---e~~iDPSv~~WaNY~kgv~~~~qS~LKK~AP~R~~~~p~~G~~~v~~GnvP~G~GLSSSAa~~~avgL~v~qA~~g~ 163 (500) T TIGR00131 87 ---EVAIDPSVSDWANYVKGVVKVAQSYLKKLAPERFNSFPLVGADIVISGNVPTGAGLSSSAALEVAVGLAVLQAQLGH 163 (500) T ss_pred ---CEEECCCHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf ---30227770001354430478888888873444454457553478996577887413389999999999998861587 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCE---EEECCC---CCCCCCCC----CCEEEEECCCCCCC---H Q ss_conf 77540258999999864115778773457876532630---240278---52001478----84399974971453---7 Q gi|254781018|r 150 QPFCNKEKLIAETGYIERLQHGKTGIIDSTTIVMGGII---YINPPQ---IIKNETLV----GEWWAINTGMPESS---T 216 (337) Q Consensus 150 ~~~~~~~~~~~~a~~~e~~~~~~~sg~D~~~~~~gg~~---~~~~~~---~~~~~~~~----~~~~~i~~~~~~~s---t 216 (337) .++...+.+......+|+.++.++|+|||+++.+|.-. +++..+ ......+. ..-++|.++...++ + T Consensus 164 ~~~~~~~~~~~~q~aenhfvGvNcG~MDQ~~s~~G~~dHaL~~~c~P~L~~~~~~~P~~KNH~~a~VI~Ntn~~~~KFEl 243 (500) T TIGR00131 164 LPLDSKQILLRGQKAENHFVGVNCGIMDQLISALGKEDHALLIDCRPSLKTTPFKLPQLKNHGVAVVIANTNVKRTKFEL 243 (500) T ss_pred CHHHHHHHHHHHHHHHHCEEECCCCHHHHHHHHCCCCCCEEEEECCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEE T ss_conf 40247899886555644525104773788996527525335552377778441007862256458999614764550101 Q ss_pred H----------------------------------------------------------------------HHHHHHHH- Q ss_conf 9----------------------------------------------------------------------99999997- Q gi|254781018|r 217 G----------------------------------------------------------------------ECVSFVEQ- 225 (337) Q Consensus 217 ~----------------------------------------------------------------------~~~~~~~~- 225 (337) . ++...+.+ T Consensus 244 a~~~YNtRr~ec~~Aa~~l~~~Y~v~L~S~K~~S~~~~gaLRD~~~~~f~r~~N~~~PWNGDi~~Gia~~~~l~~lv~~S 323 (500) T TIGR00131 244 APSEYNTRRQECETAANFLAKKYSVVLLSGKENSSLDKGALRDVMVEYFARYENIATPWNGDIESGIARLTKLLPLVAKS 323 (500) T ss_pred CCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 48874622888999999733772078822667653320002788766776541467897874557899998878999999 Q ss_pred ----------------------------------------HHCCCHHHHHHHHHHHHHHHHHH------CCC-CHHHHHH Q ss_conf ----------------------------------------64120689998765489999998------387-2168999 Q gi|254781018|r 226 ----------------------------------------HFSQSSIWSEFNAVTNEMMENIQ------KKD-AEKTYKS 258 (337) Q Consensus 226 ----------------------------------------~~~~~~~~~~~~~i~~~~~~al~------~~d-~~~l~~~ 258 (337) .+.+-.++-..+.++.++.++|+ ++| ++.+|+| T Consensus 324 ~~~KK~GF~v~~~~~~LNCSREEFtRDYLtt~PvRFqvLKLY~RAkHv~~En~Rvl~A~~~l~~A~F~~~~dF~~~~G~L 403 (500) T TIGR00131 324 LARKKSGFSVDEVATALNCSREEFTRDYLTTSPVRFQVLKLYQRAKHVVTENLRVLKAVKMLKSASFTADNDFLKELGKL 403 (500) T ss_pred HHHCCCCCCHHHHHHHHCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHEECCHHHHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 85046885188887642031234311221047456666667534231100155789999987411332330489999898 Q ss_pred HHHHHHH-HHHCCCCCHHHHHHHHHHHHC-C--CEEEECCCCCCCCCEEEEEEEE---CCCHHHHHHHCCCEEEE Q ss_conf 9995999-997189977999999999857-9--8089507886788508999985---79979999745986999 Q gi|254781018|r 259 VRVNQSL-LESIGVVPETVGKFIRCIENN-G--GSAKIAGAGSIRGEKAGLVLVG---GYNPEKLSSLYGYTCYK 326 (337) Q Consensus 259 ~~~~~~l-l~~l~vs~~~l~~l~~~~~~~-g--~~ak~sGAG~~~Gg~~~~~l~~---~~~~~~~~~~~~~~~~~ 326 (337) |+++|.. .++++|++|++|.+|..++.. | .|+|||||| +|+|.+.|.+ ....+...+...-+.|+ T Consensus 404 M~eSh~S~~ddyE~tcPeiD~~v~~~~~~~G~~~GsRmTGaG---fGGC~v~Lvp~G~~~~V~~v~~~~a~~FY~ 475 (500) T TIGR00131 404 MNESHASMRDDYEITCPEIDELVESIALVIGDSGGSRMTGAG---FGGCTVALVPSGANENVDKVRKAVADRFYK 475 (500) T ss_pred HHHHHHHHHHCCEECCHHHHHHHHHHHHHCCCEECEEECCCC---CCCEEEEEECCCCCCCHHHHHHHHHHHHCC T ss_conf 999986452071433511423877877643854110313787---566078850488987538999998653302 No 12 >PTZ00290 galactokinase; Provisional Probab=100.00 E-value=5.5e-44 Score=295.88 Aligned_cols=316 Identities=15% Similarity=0.203 Sum_probs=191.6 Q ss_pred CCCCCCCCC------EEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECC---CCEEEEEECCCCCCCCCCCCCC Q ss_conf 543579960------8998581799974454407954899876672799999848---9829999557666753210012 Q gi|254781018|r 2 KQQTSQTPH------AHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIE---PSLIRIINSQTIEYSFEACRLL 72 (337) Q Consensus 2 ~~~~~~~p~------~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~---~~~~~~~s~~~~~~~~~~~~~~ 72 (337) +++|+++|+ ++++|||||||+|||+||+||.|+||||++++++.+.+.. ++.+++.+.....+.+.. .. T Consensus 23 ~~~~~~~~~~~~~~~~farAPGRVNLIGEHtDYngg~VLP~AI~~~t~i~va~~~~~~~~~~~~~~~~~~~~~~~~--~~ 100 (468) T PTZ00290 23 LETFKVENDADVEWLLFTFAPGRVNFIGEHVDYMGGYVCPAAVLEGCHILVGRVKHFCDHKLRFATETDEHFVLDH--LG 100 (468) T ss_pred HHHHCCCCCCCCCCEEEEECCCEEEEECCCCCCCCCEEECEEECCCEEEEEEECCCCCCCEEEEECCCCCCCCCCC--CC T ss_conf 9985799865421247997787265402310228985467470168099998425787746998506776521576--67 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCC----CCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 33334332111112210334777888999998751156----67887699998055410389853037899999999981 Q gi|254781018|r 73 GHQIDQRHRDFIAKKIPITSVLTHPNDLILYIFNRHLP----HISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSIT 148 (337) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~ 148 (337) . ......|.+|+.+.+.....+... ....|+++.++|++|.|+||+||||+.||++.+++.+. T Consensus 101 ~-------------~~~~~~W~nY~~Gv~~~~l~~~g~~~~~~~~~G~~~~v~g~VP~GsGLSSSAAleva~~~al~~~~ 167 (468) T PTZ00290 101 G-------------AKHNKAWTTFVRGAATLRLNRLGVAIDAPSLQGVCMVVHGTLPMGAGMSASASFGVALLNAINTVV 167 (468) T ss_pred C-------------CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 7-------------888987631789999999997198767788860599998988774157899999999999999987 Q ss_pred CCCCC----------------CHHHH---HHHHHHHHH-CCCCCCCCHHHHHHHHCCC---EEEECCCC--CCCCCCC-- Q ss_conf 87754----------------02589---999998641-1577877345787653263---02402785--2001478-- Q gi|254781018|r 149 HQPFC----------------NKEKL---IAETGYIER-LQHGKTGIIDSTTIVMGGI---IYINPPQI--IKNETLV-- 201 (337) Q Consensus 149 ~~~~~----------------~~~~~---~~~a~~~e~-~~~~~~sg~D~~~~~~gg~---~~~~~~~~--~~~~~~~-- 201 (337) ..... .++++ ..+++.+|+ .++.++|+|||+++++|.. ++++..+. ..++..| T Consensus 168 ~~~~~~~~~~~~~~~~i~~~~sk~e~~~la~~aqraEn~yvGv~~GiMDQ~aS~~g~~g~al~iDc~~l~~e~v~~~p~~ 247 (468) T PTZ00290 168 TRRYKGCPTSPGRRYSILPPMSKEELIELAKQARRIETEFCGVNVGIMDQFISAFAEEDKFMFLDCKSLTFESHDMTPLL 247 (468) T ss_pred HCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCEEEEECCCCCEEECCCCCCC T ss_conf 42234433343211123455567789999999999987105788640778899755579589997277753431578666 Q ss_pred ---CCEEEEECCCCCCC---HHHHHHHHH-----------HHH-CCC---------------------HH---------- Q ss_conf ---84399974971453---799999999-----------764-120---------------------68---------- Q gi|254781018|r 202 ---GEWWAINTGMPESS---TGECVSFVE-----------QHF-SQS---------------------SI---------- 232 (337) Q Consensus 202 ---~~~~~i~~~~~~~s---t~~~~~~~~-----------~~~-~~~---------------------~~---------- 232 (337) ..++++++..++.. ....++..+ ... ... .. T Consensus 248 ~~~~~~vi~nS~v~h~L~~sa~~~Yn~Rr~ece~a~~~l~~~~l~~~~~~~~~l~~~~~~~~~~~d~~~~l~~~~~~~~~ 327 (468) T PTZ00290 248 GDGACFLLIDSMIKHDLLGGTAGMYNTVRSDQEGAQKKIGKHRYRGKPFTFSDLVRNPKKYTFDGDVVAFMESCKPLMTP 327 (468) T ss_pred CCCCEEEEEECCCHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCHHHHCCHHHHHHHHHHHHHHCCH T ss_conf 78718999837730204666423678999999999999818752145310777751624302113589999875642486 Q ss_pred --HHHH------HHHHHHHHHH-------HHCCCCHHHHHHHHHHHHH-HHHCCCCCHHHHHHHHHH-HHCCC-EEEECC Q ss_conf --9998------7654899999-------9838721689999995999-997189977999999999-85798-089507 Q gi|254781018|r 233 --WSEF------NAVTNEMMEN-------IQKKDAEKTYKSVRVNQSL-LESIGVVPETVGKFIRCI-ENNGG-SAKIAG 294 (337) Q Consensus 233 --~~~~------~~i~~~~~~a-------l~~~d~~~l~~~~~~~~~l-l~~l~vs~~~l~~l~~~~-~~~g~-~ak~sG 294 (337) +++. +.++.++.++ .+.++++.+|++|+++|.. +++++||||+||.|++.+ +..|+ |+|||| T Consensus 328 ~~~~Ra~HVv~E~~RVl~~~~~~~~~~~l~~~~~~~~lG~LMneSH~Slrd~YEvScpELD~Lve~~~~~~Ga~GARmTG 407 (468) T PTZ00290 328 GEFERGTYNIMEQIRTLEFIKLNDPELPLSREERFRKAGEILNAGHQGMRDLMKITTPELDFIHELINEEKGVAGGRMMG 407 (468) T ss_pred HHHHHHHEEECHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCEEEEECC T ss_conf 66300004632889999999866655544122169999999997278898575898188999999997616985700037 Q ss_pred CCCCCCCEEEEEEEECCCHHHHHHH--------C--CCEEEEEEECCCCEEE Q ss_conf 8867885089999857997999974--------5--9869999987997092 Q gi|254781018|r 295 AGSIRGEKAGLVLVGGYNPEKLSSL--------Y--GYTCYKIKEDKNGTML 336 (337) Q Consensus 295 AG~~~Gg~~~~~l~~~~~~~~~~~~--------~--~~~~~~~~~~~~G~~i 336 (337) || +|+|.+.|.+....+++.+. . .-+.|.++. .+|..| T Consensus 408 aG---FGGC~VaLV~~~~ve~f~~~v~~~Y~~~~g~~~~~~~~~~-~~GA~V 455 (468) T PTZ00290 408 GG---FGGCIILLLKKNAVDRVVAHVREKFKARFGVENDVYPVVA-GDGAFV 455 (468) T ss_pred CC---CCEEEEEEECHHHHHHHHHHHHHHHHHHHCCCCCEEEEEC-CCCCEE T ss_conf 76---4021799955878999999999999997599985899722-799389 No 13 >TIGR01220 Pmev_kin_Gr_pos phosphomevalonate kinase; InterPro: IPR005917 Protein kinases are a group of enzymes that possess a catalytic subunit which transfers the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. The enzymes fall into two broad classes, characterised with respect to substrate specificity: serine/threonine specific and tyrosine specific . Protein kinase function has been evolutionarily conserved from Escherichia coli to Homo sapiens. Protein kinases play a role in a mulititude of cellular processes, including division, proliferation, apoptosis, and differentiation . Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved , leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases . Phosphomevalonate kinase (2.7.4.2 from EC)catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. This pathway starts with three molecules of acetyl-CoA, which, in a series of six different enzyme reactions, are converted to isopentanyl pyrophosphate (IPP), the basic C5 isoprene unit and a common intermediate for a number of pathways including isoprenoid and cholesterol biosynthesis. There are two unrelated types of PMK. The first type belongs to the GHMP kinase and includes a eukaryotic group which is typified by Saccharomyces cerevisiae ERG8 and a bacterial group typified by by Streptococcus pneumoniae PMK (MvaK2) . The second type includes animal PMKs, typified by human PMK . PMKs of the first type are present in eubacteria, fungi, and plants, while the second type is found only in animals, indicative of a nonorthologous gene displacement early in animal evolution . PMKs of the GHMP kinase type are closely related to mevalonate kinases. The two types of PMK have different consensus ATP-binding motifs: a protein kinase motif in the ERG8 orthologs versus a P-loop or Walker A motif in the animal orthologs. The fact that ERG8 orthologs are found in pathogenic eubacteria and fungi but not in humans makes them attractive targets for the development of antibacterial and/or antifungal drugs . This group of enzymes belongs to the GHMP kinase domain superfamily. GHMP kinases are a unique class of ATP-dependent enzymes (the abbreviation of which refers to the original members: galactokinase, homoserine kinase, mevalonate kinase, and phosphomevalonate kinase) . Enzymes belonging to this superfamily contain three well-conserved motifs, the second of which has the typical sequence Pro-X-X-X-Gly-Leu-X-Ser-Ser-Ala and is involved in ATP binding . The phosphate binding loop in GHMP kinases is distinct from the classical P-loops found in many ATP/GTP binding proteins. The bound ADP molecule adopts a rare syn conformation and is in the opposite orientation from those bound to the P-loop-containing proteins . GHMP kinases display a distinctly bilobal appearance with their N-terminal subdomains dominated by a mixed beta-sheet flanked on one side by alpha-helices and their C-terminal subdomains containing a four stranded anti-parallel beta-sheet , , , .. Probab=100.00 E-value=3.1e-43 Score=291.18 Aligned_cols=293 Identities=20% Similarity=0.204 Sum_probs=227.4 Q ss_pred EEECCCEEEEECCCEEEC-CCEEEEEEECCCEEEEEEECC---CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 998581799974454407-954899876672799999848---9829999557666753210012333343321111122 Q gi|254781018|r 12 HSIAPAKVILSGEYSSLY-GASALAVAITFYLRALLTTIE---PSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKK 87 (337) Q Consensus 12 ~~~APGKi~L~GEh~v~~-G~~ai~~aI~~~~~v~i~~~~---~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (337) .+.||||++++|||||+. |+|+|.+|+|++++++++.++ +....+.|..+....+ .++....+++-...+- - T Consensus 2 ~v~aPGKLfvAGEYAVvePG~pailvAVDrfvtvtve~~~adt~~~~~i~S~~l~~~pv-~w~r~~g~lv~~d~~d---G 77 (367) T TIGR01220 2 EVKAPGKLFVAGEYAVVEPGYPAILVAVDRFVTVTVEDADADTDAADVIQSSDLSSEPV-RWRRRDGKLVVDDADD---G 77 (367) T ss_pred EEECCCCEEEEEEEEEECCCCCEEEEEECCEEEEEEECCCCCCCCCCCEEECCCCCCCC-EEEEECCEEEEECCCC---C T ss_conf 02089630662013443777955999975658999863567543445145211477760-3464268578723774---3 Q ss_pred CCHHHHHHHHHHHH--HHHHHHCCCCCCCCEEEEEECCCCC----CCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 10334777888999--9987511566788769999805541----03898530378999999999818775402589999 Q gi|254781018|r 88 IPITSVLTHPNDLI--LYIFNRHLPHISCGISLDLHSTIPI----GSGFGSSSAIISALSLALSSITHQPFCNKEKLIAE 161 (337) Q Consensus 88 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~i~i~s~iP~----~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~ 161 (337) -...+.+.|+...+ .....+..+....-+++.|+|++-- +.|||||+|++|++++|++.||+++|+ .++++++ T Consensus 78 qqA~s~l~yVv~AIe~~e~ya~e~n~kl~~~hL~v~S~Ld~~~GrKyGLGSSgAVtV~tVkAln~Fydl~Ls-n~eifkL 156 (367) T TIGR01220 78 QQAESALKYVVAAIEVVEKYARERNKKLKAYHLEVSSELDEADGRKYGLGSSGAVTVATVKALNAFYDLELS-NEEIFKL 156 (367) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEEECCCCCEECCCCHHHHHHHHHHHHHHHCCCCCC-HHHHHHH T ss_conf 556645787667899999997650971102676687641057884504662078999999999986287757-5678899 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHCCCEEEECCCC-------------------------CCCCCCCCCEEEEECCCCCCCH Q ss_conf 998641157787734578765326302402785-------------------------2001478843999749714537 Q gi|254781018|r 162 TGYIERLQHGKTGIIDSTTIVMGGIIYINPPQI-------------------------IKNETLVGEWWAINTGMPESST 216 (337) Q Consensus 162 a~~~e~~~~~~~sg~D~~~~~~gg~~~~~~~~~-------------------------~~~~~~~~~~~~i~~~~~~~st 216 (337) +...+..+++++|+.|+++|+|||++.|+..|. .++..+...-++|++++...|| T Consensus 157 a~la~~~lq~~gScGDiAaS~ygGWiAystfDhdwV~q~~~~~~v~~~L~~~WPgl~i~pL~aPk~l~lliGWTGsPASt 236 (367) T TIGR01220 157 AVLATLELQSKGSCGDIAASVYGGWIAYSTFDHDWVKQKVEEVSVDEVLKKDWPGLSIEPLKAPKNLELLIGWTGSPAST 236 (367) T ss_pred HHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCHHHHHHCCCCCCEEECCCCCCCCEEEEEECCCCCCC T ss_conf 89864220689975411110111033305766688875541303588861477885022067998677988631788777 Q ss_pred HHHHHHHHHHHCCC-----HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC------CCCCHHHHHHHHHHHH Q ss_conf 99999999764120-----689998765489999998387216899999959999971------8997799999999985 Q gi|254781018|r 217 GECVSFVEQHFSQS-----SIWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLLESI------GVVPETVGKFIRCIEN 285 (337) Q Consensus 217 ~~~~~~~~~~~~~~-----~~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l------~vs~~~l~~l~~~~~~ 285 (337) ..++..+++...+. .|+++...|++.++.+|+++|...+.+.+++|..++.+| +|.+|.|..||++++. T Consensus 237 ~~LVs~vhr~k~~~~a~y~rFl~~s~~CVe~~i~af~tg~i~~~~k~Ir~nR~~la~ld~e~gv~IeTe~Lk~LCD~aE~ 316 (367) T TIGR01220 237 ADLVSEVHRRKEEKSAAYERFLEKSRDCVESLITAFETGDIKSVQKEIRKNRQLLARLDDEVGVDIETEKLKALCDLAEA 316 (367) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEECHHHHHHHHHHHH T ss_conf 67888887404676236888887678999999864315773257888899999998510114202344436655426431 Q ss_pred CCCEEEECCCCCCCCCEEEEEEEECCC Q ss_conf 798089507886788508999985799 Q gi|254781018|r 286 NGGSAKIAGAGSIRGEKAGLVLVGGYN 312 (337) Q Consensus 286 ~g~~ak~sGAG~~~Gg~~~~~l~~~~~ 312 (337) +|++||-|||| ||+||+++..... T Consensus 317 yggAAK~SGAG---GGDCGIai~d~~~ 340 (367) T TIGR01220 317 YGGAAKSSGAG---GGDCGIAILDAKA 340 (367) T ss_pred CCCCCCCCCCC---CCCHHHHEEHHHH T ss_conf 11223778888---7441121220023 No 14 >COG2605 Predicted kinase related to galactokinase and mevalonate kinase [General function prediction only] Probab=100.00 E-value=3.6e-37 Score=252.91 Aligned_cols=296 Identities=21% Similarity=0.218 Sum_probs=222.9 Q ss_pred EEEECCCEEEEECCCEEE------CCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 899858179997445440------79548998766727999998489829999557666753210012333343321111 Q gi|254781018|r 11 AHSIAPAKVILSGEYSSL------YGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFI 84 (337) Q Consensus 11 ~~~~APGKi~L~GEh~v~------~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (337) ++.+||-||-+.|.-+|+ |||.|++++||+|+|+++.+..+..+++.-+.+ +....... T Consensus 2 ii~raPLRItfgGGGTDvepy~~k~GGaVlnatIdky~y~~i~~~~d~~I~~~~~~~---------------~~v~~~~~ 66 (333) T COG2605 2 IISRAPLRITFGGGGTDVEPYCSKHGGAVLNATIDKYIYVTIEKGFDDEIRVRYDRT---------------EFVKSYLE 66 (333) T ss_pred CCCCCCEEEEECCCCCCCHHHHHHCCCEEEEEEEEEEEEEEECCCCCCEEEEECCHH---------------HHHHHHHH T ss_conf 302055389852787676077875387777755310799997168885389952357---------------76512676 Q ss_pred CCC-CCHHHHHHHHHHHHHHHHHHCCCCCCCC--EEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 122-1033477788899999875115667887--6999980554103898530378999999999818775402589999 Q gi|254781018|r 85 AKK-IPITSVLTHPNDLILYIFNRHLPHISCG--ISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAE 161 (337) Q Consensus 85 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~ 161 (337) ..+ ......+ ...+ .....| +++...+|+|.|+|||||||+.||++.+++.+.+..+. ++++++. T Consensus 67 ~~h~~~~~~~l-------~r~~----l~~~g~~~~el~~~~D~P~GSGLGSSSa~vvaLl~a~~~~kg~~~~-~~~LA~e 134 (333) T COG2605 67 NEHKPLVVESL-------KRDF----LEFNGGTPIELHTQSDAPPGSGLGSSSAFVVALLNALHAWKGESLG-PYELARE 134 (333) T ss_pred HCCCHHHHHHH-------HHHH----HHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCC-HHHHHHH T ss_conf 53761899999-------9998----7314997469998437899998773479999999999998578789-8999999 Q ss_pred HHHHHHCCCCCCCC-HHHHHHHHCCCEEEECCCCC--CCCC----------CCCCEEEEECCCCCCCHHHHHHHHHHHHC Q ss_conf 99864115778773-45787653263024027852--0014----------78843999749714537999999997641 Q gi|254781018|r 162 TGYIERLQHGKTGI-IDSTTIVMGGIIYINPPQII--KNET----------LVGEWWAINTGMPESSTGECVSFVEQHFS 228 (337) Q Consensus 162 a~~~e~~~~~~~sg-~D~~~~~~gg~~~~~~~~~~--~~~~----------~~~~~~~i~~~~~~~st~~~~~~~~~~~~ 228 (337) |..+|+...+..+| |||++++|||+.+.++.+.. ..+. +....++++++..+.|+.....+++...+ T Consensus 135 Ay~IER~~l~~~gG~QDqYaaA~GGFnfMEf~~~~~V~v~pL~i~~e~~~Ele~~~lL~yTGi~R~Ss~V~~dQ~~~~~~ 214 (333) T COG2605 135 AYEIEREDLKIVGGKQDQYAAAFGGFNFMEFRGNGEVVVNPLRINRERTAELEARLLLYYTGITRQSSEVIEDQVRNVVD 214 (333) T ss_pred HHHHHHHHHCCCCCCCCHHHHHHCCCEEEEECCCCCEEEEECCCCHHHHHHHHHCEEEEEECCCCCHHHHHHHHHHHHHC T ss_conf 99988998545466623789984792179974898389800334466899987406998816402255889999987631 Q ss_pred C----CHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC--CCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCC Q ss_conf 2----0689998765489999998387216899999959999971--8997799999999985798-0895078867885 Q gi|254781018|r 229 Q----SSIWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLLESI--GVVPETVGKFIRCIENNGG-SAKIAGAGSIRGE 301 (337) Q Consensus 229 ~----~~~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l--~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg 301 (337) . .+.+.+|.+.+.++.++|-++|+..|++.|+.+|+..+.+ ++|+|.+|++++.+++.|+ ++|++|||+ || T Consensus 215 ~~~~~~e~~~~mk~~A~~~~~al~~nd~~~f~~~l~~gW~~KK~ls~~ISN~~IDriy~~A~~~GA~~gKl~GaG~--gG 292 (333) T COG2605 215 GDEETLEALHEMKALAYEMKDALVRNDIPEFGQILDRGWEAKKKLSSRISNDAIDRIYELALKNGAYGGKLSGAGG--GG 292 (333) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHCEEECCCC--CC T ss_conf 5188999999999999999999986335779999976777645533576747789999999866732222302688--62 Q ss_pred EEEEEEEECCCHHH-HHHHC--CCEEEEEEECCCCEEE Q ss_conf 08999985799799-99745--9869999987997092 Q gi|254781018|r 302 KAGLVLVGGYNPEK-LSSLY--GYTCYKIKEDKNGTML 336 (337) Q Consensus 302 ~~~~~l~~~~~~~~-~~~~~--~~~~~~~~~~~~G~~i 336 (337) |- .++++...... +..+. ......++||.+|+++ T Consensus 293 Fl-lf~~~p~k~~~l~r~l~~~~~~~~~~~Fd~~Gsr~ 329 (333) T COG2605 293 FL-LFFCDPSKRNELARALEKEQGFVVDTSFDKEGSRI 329 (333) T ss_pred EE-EEEECCCCHHHHHHHHHHHCCCEEEEEECCCCEEE T ss_conf 79-99968542489999999844973787764787079 No 15 >PRK13412 fkp bifunctional fucokinase/L-fucose-1-P-guanylyltransferase; Provisional Probab=100.00 E-value=1.4e-31 Score=217.80 Aligned_cols=322 Identities=19% Similarity=0.216 Sum_probs=227.2 Q ss_pred CEEEECCCEEEEECCCEEE------CCCEEEEEEECCC----EEEEEEECCCCEEEEEECCCCCCC-CCCCCCCCCCCCC Q ss_conf 0899858179997445440------7954899876672----799999848982999955766675-3210012333343 Q gi|254781018|r 10 HAHSIAPAKVILSGEYSSL------YGASALAVAITFY----LRALLTTIEPSLIRIINSQTIEYS-FEACRLLGHQIDQ 78 (337) Q Consensus 10 ~~~~~APGKi~L~GEh~v~------~G~~ai~~aI~~~----~~v~i~~~~~~~~~~~s~~~~~~~-~~~~~~~~~~~~~ 78 (337) .+.+++|-||.|.|.++|. +||.|+.+||.++ +.+.+++.++..+++.+....... ...++...+-.++ T Consensus 609 iv~~r~PvRIDlaGGWTDTPPy~~E~GG~VlNvAI~l~G~~Pi~~~v~~~~e~~i~l~s~d~g~~~~~~s~~~l~~y~~p 688 (974) T PRK13412 609 IVWGRSPVRIDLAGGWTDTPPYCLYSGGNVVNIAIELNGQPPLQVYVKPCSEFHIVLRSIDMGAMEVVRTNEELQDYKKV 688 (974) T ss_pred EEEECCCEEEECCCCCCCCCCCHHCCCCEEEEEEEEECCCCCCEEEEEECCCCEEEEEECCCCCEEEECCHHHHHCCCCC T ss_conf 25630754884057736899431105967999999867977736898506886589995278754895566786154789 Q ss_pred CCCCCCCCCCCHHHHHHHHHHH-HHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHH Q ss_conf 3211111221033477788899-999875115667887699998055410389853037899999999981877540258 Q gi|254781018|r 79 RHRDFIAKKIPITSVLTHPNDL-ILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEK 157 (337) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~ 157 (337) ..+-.+.+.......+...... -...+......+..||+++..+++|.|+|||+||.++++++.|+.++++...+ .+. T Consensus 689 ~~pfaL~KAal~~~G~~p~~~~~~~~sL~e~L~~fg~GlEi~t~s~lP~GSGLGTSSILA~avl~Al~~~~g~~~d-~~~ 767 (974) T PRK13412 689 GSPFSIPKAALCLAGFAPAFSAESYASLEEQLKAFGSGIEITLLAAIPAGSGLGTSSILASTVLGAISDFCGLAWD-KNE 767 (974) T ss_pred CCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-HHH T ss_conf 9951478899998178532123563239999986089658886237888788638899999999999998389988-899 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHCCCEEEECCCCCCC----CC----------CCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 9999998641157787734578765326302402785200----14----------788439997497145379999999 Q gi|254781018|r 158 LIAETGYIERLQHGKTGIIDSTTIVMGGIIYINPPQIIKN----ET----------LVGEWWAINTGMPESSTGECVSFV 223 (337) Q Consensus 158 ~~~~a~~~e~~~~~~~sg~D~~~~~~gg~~~~~~~~~~~~----~~----------~~~~~~~i~~~~~~~st~~~~~~~ 223 (337) +.......|+.+...+|.|||+...|+|+.+.+..+.... .. +....++++|++.+.....+...+ T Consensus 768 l~~~vL~lEQlLtTGGGWQDQ~GG~~~GiK~~~s~pg~~~~~~V~~L~~~~~~~~el~~~llLvyTG~tRlAk~iL~~VV 847 (974) T PRK13412 768 ICNRTLVLEQLLTTGGGWQDQYGGVLPGVKLLQTEAGFQQSPLVRWLPDSLFTQPEYEDCHLLYYTGITRTAKGILAEIV 847 (974) T ss_pred HHHHHHHHHHHHCCCCCHHHHCCCCCCCEEEEECCCCCCCCCEEEEECCCHHHHHHHHHCEEEEECCCCHHHHHHHHHHH T ss_conf 99999999988615882554016742660466337998666246760561766898861479999183065768999999 Q ss_pred HHHHCCCH----HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC--CCCCHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 97641206----89998765489999998387216899999959999971--8997799999999985798089507886 Q gi|254781018|r 224 EQHFSQSS----IWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLLESI--GVVPETVGKFIRCIENNGGSAKIAGAGS 297 (337) Q Consensus 224 ~~~~~~~~----~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l--~vs~~~l~~l~~~~~~~g~~ak~sGAG~ 297 (337) +..+.+.+ .++++.+.+.+|.+|+.++|++.++++|+++|...+.+ ++++|.++.|++..+++-.|.|+.|||| T Consensus 848 r~~~~r~~~~l~~l~~l~~~A~~m~~Al~~g~~~~~g~~l~~~W~q~k~Ldpg~~~~~V~~l~~~i~p~~~G~kL~GAGG 927 (974) T PRK13412 848 SSMFLNSTAHLQLLHEMKAHALDMAEAIQRGEFEEFGRLVGKTWEQNKALDSGTNPAAVEAIIETIKDYTLGYKLPGAGG 927 (974) T ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 99984578899999999999999999998289999999999999987500889995899999997522331041455677 Q ss_pred CCCCEEEEEEEECCCHHHHHHH----------CCCEEEEEEECCCCEEEC Q ss_conf 7885089999857997999974----------598699999879970929 Q gi|254781018|r 298 IRGEKAGLVLVGGYNPEKLSSL----------YGYTCYKIKEDKNGTMLA 337 (337) Q Consensus 298 ~~Gg~~~~~l~~~~~~~~~~~~----------~~~~~~~~~~~~~G~~i~ 337 (337) || ...++ .++++.+.++ -+-+.+-..++.+|.+|| T Consensus 928 --GG-fl~~~--aKdp~~a~~ir~~L~~~~p~~~arfve~s~s~~g~~vs 972 (974) T PRK13412 928 --GG-YLYMV--AKDPGAAERIRKILTEQAPNPRARFVEMSLSDKGLQVS 972 (974) T ss_pred --CC-EEEEE--ECCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCEEC T ss_conf --76-79999--74877899999999737989982899988637675331 No 16 >KOG0631 consensus Probab=99.98 E-value=1.2e-29 Score=205.52 Aligned_cols=303 Identities=17% Similarity=0.143 Sum_probs=181.7 Q ss_pred CCCCCCCCEEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCC-CCCCCCCCCCCCCCCCC Q ss_conf 43579960899858179997445440795489987667279999984898299995576667-53210012333343321 Q gi|254781018|r 3 QQTSQTPHAHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEY-SFEACRLLGHQIDQRHR 81 (337) Q Consensus 3 ~~~~~~p~~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~-~~~~~~~~~~~~~~~~~ 81 (337) +.|+.+|+.+|++||||+|+|||.||+|+.|++||||..+.+.+.++++....+....+... ....+..+. T Consensus 32 ~~~~~kp~~~a~~PgRVnLiGEHiDy~~~sVlpmaid~~~l~~~~~~~d~~~sl~~tN~~~~f~~~~~~~p~-------- 103 (489) T KOG0631 32 AAYGAKPVFVARAPGRVNLIGEHIDYCGYSVLPMAIDVDTLIAVAPSDDGIVSLRLTNFNPDFIYFKYPLPS-------- 103 (489) T ss_pred HHHCCCCEEEEECCCCEECCCCEEEECCCEEEEEEEEEEEEEEEEECCCCCEEEEEECCCCCCCEEECCCCC-------- T ss_conf 862898608985687400132113405733545776410079997757884468973279863014634770-------- Q ss_pred CCCCCCCCHHHHHHHH---HHHHHHHHHHCCCCC--CCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHH-CCCC-CC Q ss_conf 1111221033477788---899999875115667--887699998055410389853037899999999981-8775-40 Q gi|254781018|r 82 DFIAKKIPITSVLTHP---NDLILYIFNRHLPHI--SCGISLDLHSTIPIGSGFGSSSAIISALSLALSSIT-HQPF-CN 154 (337) Q Consensus 82 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~--~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~-~~~~-~~ 154 (337) ..........+|..|. ...+...+.+..... +.|+.+..++++|.|+||+||||++++...|..++. +.+. .. T Consensus 104 ~~~~I~~~~~~w~ny~~C~~~g~h~~~~~~~~~~~~~vGl~~l~~g~vPtgsgLsSsaa~~c~a~lA~~~~~~gpn~~~~ 183 (489) T KOG0631 104 IVWQIDPDVSKWENYFYCGMKGFHEYIKRKPVRFEPPVGLSILNDGSVPTGSGLSSSAAWLCAAALATLKLNLGPNFIIS 183 (489) T ss_pred HHCCCCCCCCCHHHHHCCCHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 11004799630221002551889999721553337876538986478789988443589999999999997608873421 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCC---EEEECC--CC--CCCCCCCCCEEEEECCCCCCCH----------- Q ss_conf 25899999986411577877345787653263---024027--85--2001478843999749714537----------- Q gi|254781018|r 155 KEKLIAETGYIERLQHGKTGIIDSTTIVMGGI---IYINPP--QI--IKNETLVGEWWAINTGMPESST----------- 216 (337) Q Consensus 155 ~~~~~~~a~~~e~~~~~~~sg~D~~~~~~gg~---~~~~~~--~~--~~~~~~~~~~~~i~~~~~~~st----------- 216 (337) +.+...+....|+..+-+++||||+++.++-- ++.++. +. ..++....-.++|......... T Consensus 184 kkd~~~i~~~ae~~~G~~~gGmdq~asvl~~~~~Al~v~~~~~Pf~~~~lk~~~~~vfvI~~~L~~~nk~~~a~tnynlR 263 (489) T KOG0631 184 KKDLATITVVAESYIGLNSGGMDQAASVLAEKGHALLVDPYFTPFRRSMLKLPDGGVFVIANSLVESNKAETAETNYNLR 263 (489) T ss_pred HHHHHCCEEEEECCCCCCCCCHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEEEECHHHHHCCHHHHHHHHHCE T ss_conf 55540521786312675877478899998733745884166785445565577786688834366532301256652010 Q ss_pred -------------------H------------------------HHHHHHHHHHCCC----------------HHH---- Q ss_conf -------------------9------------------------9999999764120----------------689---- Q gi|254781018|r 217 -------------------G------------------------ECVSFVEQHFSQS----------------SIW---- 233 (337) Q Consensus 217 -------------------~------------------------~~~~~~~~~~~~~----------------~~~---- 233 (337) . ++...++...... .++ T Consensus 264 v~E~~ia~~~la~k~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~v~~~~~~e~f~~ee~~~~l~~~~~~f~~~~~ 343 (489) T KOG0631 264 VVEGTIAAGELAAKILVELPAYILRYQLQRAWRGDIGEGYERAEEMLGLVEESLKPEGFNIEEVARALGLDTEEFLQSLL 343 (489) T ss_pred EEEEEHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHHC T ss_conf 57654026778777603428888765443013565306677777787777865176788888987785304588998760 Q ss_pred -------------HHHHH------HHHHHHHHHHCC------CCHHHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHHHCC Q ss_conf -------------99876------548999999838------72168999999599999-71899779999999998579 Q gi|254781018|r 234 -------------SEFNA------VTNEMMENIQKK------DAEKTYKSVRVNQSLLE-SIGVVPETVGKFIRCIENNG 287 (337) Q Consensus 234 -------------~~~~~------i~~~~~~al~~~------d~~~l~~~~~~~~~ll~-~l~vs~~~l~~l~~~~~~~g 287 (337) ++-.+ ++.+...+..+. .+..+|++|+++|...+ .++.|+|++++|++++++.| T Consensus 344 T~~~v~~~~~k~~~rakHv~sea~rv~q~~~~~~~a~~~~d~~~~~~g~LmneS~~Sc~~~yEcscpel~qL~kiala~g 423 (489) T KOG0631 344 TLAAVDLQVKKLYQRAKHVYSEALRVLQEEKLCARAPGRADGFLADFGRLMNESHRSCDVLYECSCPELDQLCKIALANG 423 (489) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCC T ss_conf 24213458888898899999888888878999854963200008888888615567888877548872999999999668 Q ss_pred C-EEEECCCCCCCCCEEEEEEEECCCHHHH Q ss_conf 8-0895078867885089999857997999 Q gi|254781018|r 288 G-SAKIAGAGSIRGEKAGLVLVGGYNPEKL 316 (337) Q Consensus 288 ~-~ak~sGAG~~~Gg~~~~~l~~~~~~~~~ 316 (337) + ++++|||| +|+|...+.+.++.+.+ T Consensus 424 ~~gaRlTGaG---wGGc~v~lvp~d~~~~~ 450 (489) T KOG0631 424 GVGARLTGAG---WGGCTVALVPADLVDFA 450 (489) T ss_pred CCCCEEECCC---CCCCEEEECCCCCHHHH T ss_conf 8655363255---46400120453205889 No 17 >PRK01212 homoserine kinase; Provisional Probab=99.93 E-value=1.9e-23 Score=166.55 Aligned_cols=277 Identities=19% Similarity=0.251 Sum_probs=182.2 Q ss_pred EEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 89985817999744544079548998766727999998489829999557666753210012333343321111122103 Q gi|254781018|r 11 AHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKIPI 90 (337) Q Consensus 11 ~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (337) +.+++|+-.==+|- |.-++.+|+++|-++++++.+. .+++......+ ++ .|.. +.. T Consensus 4 v~v~vPAtsANlG~-----GFD~lg~Al~l~d~v~v~~~~~-~~~~~g~~~~~-------lP---~d~~--------Nlv 59 (293) T PRK01212 4 VKVYVPASSANLGP-----GFDSLGLALSLYDEVTVEEADF-SLEVEGEFADK-------LP---LDEE--------NLV 59 (293) T ss_pred EEEEECCCHHHCCC-----CHHHHHCCCCCCCEEEEEECCC-EEEEEECCCCC-------CC---CCCC--------CHH T ss_conf 99993243663573-----7466654567775999998687-89998168666-------99---9998--------179 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 34777888999998751156678876999980554103898530378999999999818775402589999998641157 Q gi|254781018|r 91 TSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQH 170 (337) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~ 170 (337) .. .+...... .+ ...|++|+++++||+++|||||||.++|.+.+++++++++++ .+++.++|...|. T Consensus 60 ~~-------a~~~~~~~-~g-~~~g~~I~i~k~IP~~~GLGsssa~aaa~l~a~n~l~~~~ls-~~eL~~lA~~~e~--- 126 (293) T PRK01212 60 YQ-------AALKFLEK-LG-KPPGLRLELEKNIPLGRGLGSSAASIVAALVAANELLGLPLS-KEELLQLAGEGEG--- 126 (293) T ss_pred HH-------HHHHHHHH-HC-CCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHCC--- T ss_conf 99-------99999998-39-998879999718986566575188999999999998179989-8999999998426--- Q ss_pred CCCCCHHHHHHHHCCCEEEECCCC---CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 787734578765326302402785---20014788439997497145379999999976412068999876548999999 Q gi|254781018|r 171 GKTGIIDSTTIVMGGIIYINPPQI---IKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEMMENI 247 (337) Q Consensus 171 ~~~sg~D~~~~~~gg~~~~~~~~~---~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al 247 (337) ++.++++|++||......... ..++.++..++++..+....+|.+.+..+++.+...+.+....+ ...+..+| T Consensus 127 ---H~DNv~P~~~GG~~~~~~~~~~~~~~~~~~~~~~~vl~~P~~~vsT~~aR~~Lp~~~~~~dav~~~~~-~~~l~~al 202 (293) T PRK01212 127 ---HPDNVAPALLGGLVLALEENGVISVKIPVFDDWKWVVAIPGIELSTAEARAVLPKQYSRKDAVFNSSR-LALLVAAL 202 (293) T ss_pred ---CCCCCCCEEECCEEEEECCCCEEEEECCCCCCEEEEEEECCCCCCHHHHHHHCCCCCCHHHHHHHHHH-HHHHHHHH T ss_conf ---76655774526778996279648997478876489999489674489999856032887898865534-37889999 Q ss_pred HCCCCHHHHHHHHH--HHHHHHHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEEECCCH-HHHH----HH Q ss_conf 83872168999999--599999718997799999999985798-0895078867885089999857997-9999----74 Q gi|254781018|r 248 QKKDAEKTYKSVRV--NQSLLESIGVVPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLVGGYNP-EKLS----SL 319 (337) Q Consensus 248 ~~~d~~~l~~~~~~--~~~ll~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~~~~~-~~~~----~~ 319 (337) .++|++.+.+.|+. +|..+..+ .|.++++.+.+.+.|+ ++.|||||+ +.+.+..+.+. +++. +. T Consensus 203 ~~~d~~~l~~~~~D~lhep~r~~l---iP~~~~i~~~~~~~GA~~~~lSGSGp-----tv~al~~~~~~A~~i~~~l~~~ 274 (293) T PRK01212 203 YTGDYELAGAAMKDVLHEPYRAKL---IPGFAEVRQAALEAGALGAGISGAGP-----TVFALCDKEDAAEAVADALEKA 274 (293) T ss_pred HCCCHHHHHHHHCCHHHHHHHHHH---CCCHHHHHHHHHHCCCCEEEEECHHH-----HEEEEECCHHHHHHHHHHHHHH T ss_conf 839999999874002235678763---85699999999978997899987411-----1248977889999999999997 Q ss_pred C--CCEEEEEEECCCCEEE Q ss_conf 5--9869999987997092 Q gi|254781018|r 320 Y--GYTCYKIKEDKNGTML 336 (337) Q Consensus 320 ~--~~~~~~~~~~~~G~~i 336 (337) + .+-++.+++|.+|++| T Consensus 275 ~~~~~~v~~l~~d~~G~~v 293 (293) T PRK01212 275 FLQEGFVHVLRLDTAGARV 293 (293) T ss_pred HHHCCEEEEECCCCCCCCC T ss_conf 6438889990469998889 No 18 >COG0083 ThrB Homoserine kinase [Amino acid transport and metabolism] Probab=99.90 E-value=1.6e-20 Score=148.13 Aligned_cols=264 Identities=19% Similarity=0.244 Sum_probs=183.7 Q ss_pred CCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC Q ss_conf 95489987667279999984898299995576667532100123333433211111221033477788899999875115 Q gi|254781018|r 30 GASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKIPITSVLTHPNDLILYIFNRHL 109 (337) Q Consensus 30 G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (337) |.-++.+|++++.++.+....+ .+++..+.... ...+.++.+ +. ...+....... T Consensus 18 GFD~lGlAl~~~~~~~v~~~~~-~~~i~~~g~~~--------~~iP~~~~n-------~~--------~~~~~~~~~~~- 72 (299) T COG0083 18 GFDVLGLALDLYNDVVVVEVVD-KFEIEVEGEGA--------DKIPLDPEN-------LV--------YQAALKFLEAL- 72 (299) T ss_pred CCCCEEEECCCCCCEEEEEECC-CEEEEEECCCC--------CCCCCCCCE-------EH--------HHHHHHHHHHH- T ss_conf 7000012014557279997047-37999834656--------679998300-------09--------99999999982- Q ss_pred CCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCEEE Q ss_conf 66788769999805541038985303789999999998187754025899999986411577877345787653263024 Q gi|254781018|r 110 PHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQHGKTGIIDSTTIVMGGIIYI 189 (337) Q Consensus 110 ~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~~~~sg~D~~~~~~gg~~~~ 189 (337) ....++++.++++||.++|||||||.+||.+.++|.+++.+++ ++++.+++...|. ++.++++|.+||.... T Consensus 73 -~~~~~~~i~i~k~IP~~rGLGSSaAsiVAal~aan~l~~~~L~-~~~ll~~a~~~Eg------HpDNVapa~lGG~~l~ 144 (299) T COG0083 73 -GIEAGVKIRIEKGIPLGRGLGSSAASIVAALAAANELAGLPLS-KEELLQLALEIEG------HPDNVAPAVLGGLVLV 144 (299) T ss_pred -CCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHCC------CCCHHHHHHHCCEEEE T ss_conf -9986579999769987778758999999999999998459998-9999999997558------9710777760787998 Q ss_pred EC-CCCC--CCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH--HHHH Q ss_conf 02-7852--00147884399974971453799999999764120689998765489999998387216899999--9599 Q gi|254781018|r 190 NP-PQII--KNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEMMENIQKKDAEKTYKSVR--VNQS 264 (337) Q Consensus 190 ~~-~~~~--~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~~~~--~~~~ 264 (337) .. .+.. .+..++...+++..+....+|.+.++.+.+.+...+.+....+. .-++.+|.++|.+.+...|+ .+|. T Consensus 145 ~~~~~~~~~~v~~~~~~~~v~~iP~~e~sT~~aR~vLP~~~~~~daV~n~s~~-a~lv~al~~~~~~l~~~~~~D~ihep 223 (299) T COG0083 145 EEESGIISVKVPFPSDLKLVVVIPNFEVSTAEARKVLPKSYSRKDAVFNLSRA-ALLVAALLEGDPELLRAMMKDVIHEP 223 (299) T ss_pred EECCCCEEEECCCCCCEEEEEEECCCCCCHHHHHHHCCCCCCHHHHHHHHHHH-HHHHHHHHCCCHHHHHHHHCCCCCHH T ss_conf 61587347971587646899990896565799997465668889999999999-99999997699999998703601046 Q ss_pred HHHHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEEECCCHHHHHHH--------CCCEEEEEEECCCCEE Q ss_conf 999718997799999999985798-0895078867885089999857997999974--------5986999998799709 Q gi|254781018|r 265 LLESIGVVPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLVGGYNPEKLSSL--------YGYTCYKIKEDKNGTM 335 (337) Q Consensus 265 ll~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~--------~~~~~~~~~~~~~G~~ 335 (337) .|.+| -|.++++.+.+.+.|+ ++-+||||++ .+++....+.+....+ .....+-+.++.+|+. T Consensus 224 yR~~L---~P~~~~v~~~a~~~gA~g~~lSGAGPT-----i~al~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~G~~ 295 (299) T COG0083 224 YRAKL---VPGYAEVREAALEAGALGATLSGAGPT-----VFALADESDAEKAAALLEELYEQGIKGRVHILALDSDGAR 295 (299) T ss_pred HHHHH---CCCHHHHHHHHHHCCCEEEEEECCCCE-----EEEEECCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCE T ss_conf 66532---701899999986479659999647870-----7998454206679999999997288506999721588616 No 19 >PRK03188 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.90 E-value=3.8e-21 Score=151.99 Aligned_cols=280 Identities=13% Similarity=0.139 Sum_probs=178.7 Q ss_pred EEEECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 899858179997----44544-0795489987667279999984898299995576667532100123333433211111 Q gi|254781018|r 11 AHSIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIA 85 (337) Q Consensus 11 ~~~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (337) +...||+||||+ |-..| ||.-..+..+|+.+-++++++.++..+.+....... ++ .+. T Consensus 1 it~~apAKINL~L~V~~kr~dGyH~l~s~~~~i~l~D~i~i~~~~~~~~~~~~~~~~~-------~P---~d~------- 63 (299) T PRK03188 1 VTVRAPGKVNLHLGVGPLREDGYHELATVFQAVSLYDEVTVTPADDLSLTVSGEGADQ-------VP---TDE------- 63 (299) T ss_pred CEEEEEEEEECCCCCCCCCCCCCCCEEEEEEEECCCEEEEEEECCCCEEEEECCCCCC-------CC---CCC------- T ss_conf 9797501282040338879999851358999936871999999999829995787344-------88---998------- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 22103347778889999987511566788769999805541038985303789999999998187754025899999986 Q gi|254781018|r 86 KKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYI 165 (337) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~ 165 (337) .+...+.. ..+.... ....|++|+++++||+++|||++||.+++++.+++++++.+++. +++.+++..+ T Consensus 64 -~Nlv~kA~--------~~~~~~~-~~~~~~~I~l~K~IP~gaGLGGGSsdAAa~L~~ln~l~~l~ls~-~eL~~lA~~l 132 (299) T PRK03188 64 -SNLAWRAA--------ELLAEHV-GRAPDVHIHIDKGIPVAGGMAGGSADAAAALVACDALWGLGLSR-DELLALAAEL 132 (299) T ss_pred -CCHHHHHH--------HHHHHHH-CCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCH-HHHHHHHHHH T ss_conf -64999999--------9999972-99997699999579776777740789999999999971799798-9999999861 Q ss_pred HHCCCCCCCCHHHHHHHHCCCEEEECC-CC-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 411577877345787653263024027-85-2001478843999749714537999999997641206899987654899 Q gi|254781018|r 166 ERLQHGKTGIIDSTTIVMGGIIYINPP-QI-IKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEM 243 (337) Q Consensus 166 e~~~~~~~sg~D~~~~~~gg~~~~~~~-~~-~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~ 243 (337) |.|+..|.+||....... +. ..+...+..++++..+....+|.++++............... .....+ T Consensus 133 ---------GsDVPffL~gg~a~~~G~Ge~l~pl~~~~~~~~vlv~P~~~vST~~ay~~ld~~~~~~~~~~~~-~~~~~l 202 (299) T PRK03188 133 ---------GSDVPFALLGGTALGTGRGEELAPVLARGTFHWVLAFADGGLSTPAVFRELDRLREAGPDPPRL-GDPEPL 202 (299) T ss_pred ---------CCCCCEEEECCCEEEEECCCEEEECCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCC-CCHHHH T ss_conf ---------7887637627868999748676888767863799986899947299999777642366776566-787999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEEECCCH-HHH-HHH Q ss_conf 9999838721689999995999997189-97799999999985798-0895078867885089999857997-999-974 Q gi|254781018|r 244 MENIQKKDAEKTYKSVRVNQSLLESIGV-VPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLVGGYNP-EKL-SSL 319 (337) Q Consensus 244 ~~al~~~d~~~l~~~~~~~~~ll~~l~v-s~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~~~~~-~~~-~~~ 319 (337) ..+|.++|++.+.+.|... ++..-. -.|++.++.+.+++.|+ ++.|||||++ .|.|..+.+. +.+ ..+ T Consensus 203 ~~al~~gd~~~l~~~l~Nd---Le~~a~~l~P~i~~i~~~l~~~Ga~~a~mSGSGsT-----vFalf~~~~~A~~~a~~l 274 (299) T PRK03188 203 LAALAAGDPAQLAPLLGND---LQAAALSLRPELRRTLRAGEEAGALAGIVSGSGPT-----CAFLCASADSAVDVAAAL 274 (299) T ss_pred HHHHHCCCHHHHHHHHCCC---HHHHHHHCCCHHHHHHHHHHHCCCCEEEEECCCCC-----EEEEECCHHHHHHHHHHH T ss_conf 9999839999998871686---06778771909999999999679987899784401-----027978999999999999 Q ss_pred C--C--CEEEEEEECCCCEEE Q ss_conf 5--9--869999987997092 Q gi|254781018|r 320 Y--G--YTCYKIKEDKNGTML 336 (337) Q Consensus 320 ~--~--~~~~~~~~~~~G~~i 336 (337) . + .+.+-..-+-.|.++ T Consensus 275 ~~~g~~~~~~~~~~~~~~~~~ 295 (299) T PRK03188 275 SGAGVCRTVRVATGPVPGARV 295 (299) T ss_pred HHCCCEEEEEECCCCCCCCEE T ss_conf 757930489990588888562 No 20 >TIGR00191 thrB homoserine kinase; InterPro: IPR000870 Saccharomyces cerevisiae strains containing the erg8-1 mutation are temperature sensitive for growth due to a defect in phosphomevalonate kinase, an enzyme of isoprene and ergosterol biosynthesis. Subcloning and DNA sequencing have defined the functional ERG8 regulon as an 850bp upstream region and an adjacent 1,272bp open reading frame. The deduced ERG8 protein contains 424 residues and shows no similarity to known proteins, except within a putative ATP-binding domain present in many kinases . Enzymes that share the N-terminal Gly/Ser-rich putative ATP-binding region include galactokinase, homoserine kinase, mevalonate kinase and phosphomevalonate kinase. Homoserine kinase is a homodimeric enzyme involved in threonine biosynthesis. Sequence comparison of the yeast enzyme with the corresponding proteins from bacterial sources reveals the presence of several highly conserved regions, the pattern of occurrence of which suggests that the ancestral sequences might have been composed from separate (functional) domains. A block of similar residues, found towards the C-terminus, is also present in many other proteins involved in threonine (or serine) metabolism; this motif may therefore represent the binding site for the hydroxy-amino acids. Limited similarity was detected between a motif conserved among the homoserine kinases and consensus sequences found in other mono- or dinucleotide-binding proteins .; GO: 0004413 homoserine kinase activity, 0005524 ATP binding, 0006566 threonine metabolic process. Probab=99.90 E-value=4.2e-21 Score=151.74 Aligned_cols=215 Identities=20% Similarity=0.233 Sum_probs=169.1 Q ss_pred CCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCEEE-E Q ss_conf 788769999805541038985303789999999998187754025899999986411577877345787653263024-0 Q gi|254781018|r 112 ISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQHGKTGIIDSTTIVMGGIIYI-N 190 (337) Q Consensus 112 ~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~~~~sg~D~~~~~~gg~~~~-~ 190 (337) ...|++++++++||+++|||||||.+||.+.+++++++++++ +.++.++|...|....|.-+..++++|++||+... . T Consensus 85 ~~~p~k~~~~~~iP~grGLGSSAaaiVaa~~a~ne~~g~~l~-~~~~L~~a~~~E~~~SGs~HpDNvaPa~~GG~~~~~~ 163 (359) T TIGR00191 85 FIKPVKLTLEKNIPLGRGLGSSAAAIVAALLAANELAGLPLS-KEELLDLALELEGRISGSIHPDNVAPALLGGFQLAVK 163 (359) T ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCEEEEEE T ss_conf 888768998538510136663899999999999998388989-8999999999864305775713467885166279884 Q ss_pred CCCC------------CCCCCC--------------------------CCCEEEEECCCCCCCHHHHHHHHHHHHCCCHH Q ss_conf 2785------------200147--------------------------88439997497145379999999976412068 Q gi|254781018|r 191 PPQI------------IKNETL--------------------------VGEWWAINTGMPESSTGECVSFVEQHFSQSSI 232 (337) Q Consensus 191 ~~~~------------~~~~~~--------------------------~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~ 232 (337) ..+. ..+..+ ....+++.-+...-||++.+..+.+.+...+. T Consensus 164 ~~~~Pqeverrei~~~~~~~~P~~GedtGlvPPlPPtdiGrhvkyqWn~~~~~v~aIP~~~~sT~~AR~vLPk~Y~~~d~ 243 (359) T TIGR00191 164 EDDKPQEVERREIVLLEVLKIPIFGEDTGLVPPLPPTDIGRHVKYQWNSKLKVVLAIPNIEVSTAEARAVLPKAYPRQDL 243 (359) T ss_pred ECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEECCCCCCCHHHHHHHCCCCCCHHHH T ss_conf 27987611356665666414889876665679778522344200201676517998178420568896205554678899 Q ss_pred HHHHHHHHHHHHHHHHCCC-CHHHHHHHH---HHHHHHHHCCCCCHHHHHHH---HHHHHCCC-EEEECCCCCCCCCEEE Q ss_conf 9998765489999998387-216899999---95999997189977999999---99985798-0895078867885089 Q gi|254781018|r 233 WSEFNAVTNEMMENIQKKD-AEKTYKSVR---VNQSLLESIGVVPETVGKFI---RCIENNGG-SAKIAGAGSIRGEKAG 304 (337) Q Consensus 233 ~~~~~~i~~~~~~al~~~d-~~~l~~~~~---~~~~ll~~l~vs~~~l~~l~---~~~~~~g~-~ak~sGAG~~~Gg~~~ 304 (337) +....+..- ++.|+.++| ...+++.|. .+|-.|++| -|.+.++. +.+++.|+ +.-+||||++ . T Consensus 244 v~n~~~l~~-l~~A~~~~dP~~~l~~~~~kD~~hqPyR~~L---~P~l~~i~sCv~~~~~~Ga~~~~iSGsGPT-----i 314 (359) T TIGR00191 244 VFNLSHLAG-LVHALYQKDPKPDLLAIMMKDRIHQPYRESL---IPNLFKIKSCVQAALEKGALGITISGSGPT-----I 314 (359) T ss_pred HHHHHHHHH-HHHHHCCCCCHHHHHHHHHCCCCCCCCCCCC---CCHHHHHHHHHHHHHHCCCEEEEECCCCHH-----H T ss_conf 998877877-7756107996289999973385011634223---614899899999997358648871120105-----5 Q ss_pred EEEEECCC--HHHHHHH--------CCCEEEEEEECCCCEEE Q ss_conf 99985799--7999974--------59869999987997092 Q gi|254781018|r 305 LVLVGGYN--PEKLSSL--------YGYTCYKIKEDKNGTML 336 (337) Q Consensus 305 ~~l~~~~~--~~~~~~~--------~~~~~~~~~~~~~G~~i 336 (337) +.++.... +++..++ ....+....+|.+|+.| T Consensus 315 la~~~e~~G~~~~~~~~~~~~~~~~~e~~v~~~~~d~~G~~~ 356 (359) T TIGR00191 315 LALADEEFGTEQKAQDLLENLHKQGIEGTVHVLEIDTDGARV 356 (359) T ss_pred HHCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCEE T ss_conf 516888877577769998777641653244420233576146 No 21 >PRK00128 ipk 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.89 E-value=2.3e-20 Score=147.06 Aligned_cols=239 Identities=15% Similarity=0.157 Sum_probs=166.0 Q ss_pred EEECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 99858179997----44544-07954899876672799999848982999955766675321001233334332111112 Q gi|254781018|r 12 HSIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAK 86 (337) Q Consensus 12 ~~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (337) +.+||+||||+ |-..| ||.-..+...|+.+-++++++.++..+.+..+... . +.+. T Consensus 2 ~~kapAKINL~L~V~~kr~dGyH~l~sl~~~i~l~D~i~i~~~~~~~i~~~~~~~~--------~---p~~~-------- 62 (281) T PRK00128 2 LEKAPAKINLSLDVLGKREDGYHEVEMIMTTIDLADRLEIEKLKEDKIVVESNSGY--------V---PNDE-------- 62 (281) T ss_pred EEECCCCEECCEECCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCCEEEEECCCC--------C---CCCC-------- T ss_conf 06214558344302883899988356899991687299999978998899978887--------8---8898-------- Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 21033477788899999875115667887699998055410389853037899999999981877540258999999864 Q gi|254781018|r 87 KIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIE 166 (337) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e 166 (337) .+...+.. ..+ +.......|++++++.+||+++|||++||.+++++.+++++++.+++.. ++.+++..+ T Consensus 63 ~Nlv~ka~--------~~l-~~~~~~~~~~~I~l~K~IP~gaGLGGGSSdAAa~L~~ln~l~~~~ls~~-eL~~la~~i- 131 (281) T PRK00128 63 RNLAYKAA--------KLL-KERYNIKKGVRITIDKNIPVAAGLAGGSSDAAATLRGLNKLWNLGLSLE-ELAEIGLKI- 131 (281) T ss_pred CCHHHHHH--------HHH-HHHHCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHH-HHHHHHHHH- T ss_conf 56999999--------999-9974999874899985798667778512889999999999847999999-999999872- Q ss_pred HCCCCCCCCHHHHHHHHCCCEEEECCC-C-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 115778773457876532630240278-5-20014788439997497145379999999976412068999876548999 Q gi|254781018|r 167 RLQHGKTGIIDSTTIVMGGIIYINPPQ-I-IKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEMM 244 (337) Q Consensus 167 ~~~~~~~sg~D~~~~~~gg~~~~~~~~-~-~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 244 (337) |.|+..|.+|+....+... . ..++.++..++++..+....||+++++.+.......+ -+..+. T Consensus 132 --------GaDVpffl~g~~a~~~G~Ge~i~~l~~~~~~~~vLv~P~~~vsT~~vy~~~~~~~~~~~-------~~~~~~ 196 (281) T PRK00128 132 --------GSDVPFCIYGGTALATGRGEKITPLKPPPSCWVVLAKPDIGVSTKDVYKNLDLDKIYHP-------DTEKLI 196 (281) T ss_pred --------CCCCCCHHCCCCEEEECCCCEEEECCCCCCCEEEEECCCCCCCHHHHHHHCCCCCCCCC-------CHHHHH T ss_conf --------78844110477569855896767888788858999889999786999985280026787-------768899 Q ss_pred HHHHCCCCHHHHHHHHHH-HHHHHHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCC Q ss_conf 999838721689999995-99999718997799999999985798-0895078867 Q gi|254781018|r 245 ENIQKKDAEKTYKSVRVN-QSLLESIGVVPETVGKFIRCIENNGG-SAKIAGAGSI 298 (337) Q Consensus 245 ~al~~~d~~~l~~~~~~~-~~ll~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~ 298 (337) .+++++|++.+...+.-. +..... ..|++.++.+.+++.|+ ++.|||+|++ T Consensus 197 ~~l~~~d~~~~~~~l~Ndle~~a~~---~~P~i~~i~~~l~~~Ga~~~~mSGSGst 249 (281) T PRK00128 197 EAIENGDYQGICASMGNVLEPVTLK---KHPEIAKIKERMLKFGADGALMSGSGPT 249 (281) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHH---HCHHHHHHHHHHHHCCCCEEEEECCCCE T ss_conf 9886289999988634037999998---5959999999998579985898772121 No 22 >PRK01123 shikimate kinase; Provisional Probab=99.89 E-value=6e-20 Score=144.42 Aligned_cols=256 Identities=16% Similarity=0.102 Sum_probs=161.4 Q ss_pred CCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC Q ss_conf 95489987667279999984898299995576667532100123333433211111221033477788899999875115 Q gi|254781018|r 30 GASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKIPITSVLTHPNDLILYIFNRHL 109 (337) Q Consensus 30 G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (337) -+..=+++||+.++++++..++........ .+.... ...... ..+.....+ T Consensus 19 tg~G~a~~idL~~~a~v~~~~~~~~~~~~~----------------~~~~~~----~~~l~~-------~~~~~~~~~-- 69 (283) T PRK01123 19 TGKGAAFGIDLKTTATVELSDSGSKIKGEV----------------SGYPDA----DTRLVE-------RCVELVLER-- 69 (283) T ss_pred CCCCCEEEECCEEEEEEEECCCCCCCCCCC----------------CCCCCC----CHHHHH-------HHHHHHHHH-- T ss_conf 286516763645999999877776444200----------------578875----638999-------999999986-- Q ss_pred CCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCC--CCCHHHHHHHHCCCE Q ss_conf 667887699998055410389853037899999999981877540258999999864115778--773457876532630 Q gi|254781018|r 110 PHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQHGK--TGIIDSTTIVMGGII 187 (337) Q Consensus 110 ~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~~~--~sg~D~~~~~~gg~~ 187 (337) ...+.|+++.+.|+||+++|||||||.++|++.+++++++.+++ ++++.+++..++...+.. ++..|.++|++||+. T Consensus 70 ~~i~~g~~i~i~s~IP~~~GL~SSSA~a~A~i~a~~~~~~~~Ls-~~ei~~la~ea~~~~g~siTGa~DDa~As~~GG~~ 148 (283) T PRK01123 70 FGIDYGGTVRTESEIPVASGLKSSSAAANALVLATLDALGEKLD-DLDILRLGVKASKEAGVTVTGAFDDACASYFGGVC 148 (283) T ss_pred CCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHCCCCCCCCCCCHHHHHCCCEE T ss_conf 29987759999558986678537899999999999998589989-99999999998620781523675116677449779 Q ss_pred EEECCCCCCCCCC-CCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 2402785200147-884399974971453799999999764120689998765489999998387216899999959999 Q gi|254781018|r 188 YINPPQIIKNETL-VGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLL 266 (337) Q Consensus 188 ~~~~~~~~~~~~~-~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll 266 (337) ..++.+...+... .+..++|..+....+|.+. +...+..+....+.+.+...++++. +.|..|..+. T Consensus 149 ltdn~~~~ii~~~~~~~~~vV~ip~~~~~t~~~---------~~~~~~~i~~~~~~a~~~~~~g~~~---~Am~~Ngl~~ 216 (283) T PRK01123 149 VTDNREMKILKRDEVEGPVVVLIPPEKAFSANV---------DVERMKLIAPYVEMAFELALAGDYF---KAMTLNGLLY 216 (283) T ss_pred EEECCCCEEEECCCCCCCEEEEECCCCCCCCCC---------CHHHHHHHHHHHHHHHHHHHCCCHH---HHHHHHHHHH T ss_conf 981887436651489985899989977737627---------6788877438999999998779898---9987502642 Q ss_pred -HHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEEECCCHHHHHHH-CC-CEEEEEEECCCCEEE Q ss_conf -9718997799999999985798-0895078867885089999857997999974-59-869999987997092 Q gi|254781018|r 267 -ESIGVVPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLVGGYNPEKLSSL-YG-YTCYKIKEDKNGTML 336 (337) Q Consensus 267 -~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~-~~-~~~~~~~~~~~G~~i 336 (337) ..++... +++..+.+.|+ ++.|||+|+ ..+.+..+.+.+.+.+. .+ .+++-.+++.+|.+. T Consensus 217 ~~~l~~~~----~~~~~al~~GAlga~iSGsGP-----si~al~~~e~~~~v~ea~~~~G~vi~t~i~n~g~~~ 281 (283) T PRK01123 217 SSALGFPT----EPILDALEAGAVGAGLSGTGP-----SYVALFDEENPEEVKEAWSKYGKVIVTKINNRGARY 281 (283) T ss_pred HHHCCCCH----HHHHHHHHCCCEEEEEECCCC-----EEEEEECCCCHHHHHHHHHHCCCEEEEEECCCCCEE T ss_conf 45417987----999999975984999826898-----799998077679999999867988999565987355 No 23 >PRK04344 consensus Probab=99.88 E-value=2.6e-20 Score=146.76 Aligned_cols=241 Identities=19% Similarity=0.205 Sum_probs=165.0 Q ss_pred CEEEECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0899858179997----44544-079548998766727999998489829999557666753210012333343321111 Q gi|254781018|r 10 HAHSIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFI 84 (337) Q Consensus 10 ~~~~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (337) ....+||+||||+ |-..| ||.-..+..+|+++-++++++.+...+.+...... . +.+. T Consensus 2 ~i~~kapAKINL~L~V~~kr~dGyH~l~s~~~~i~l~D~i~i~~~~~~~i~i~~~~~~--------~---p~~~------ 64 (284) T PRK04344 2 KIYEKAPAKINLSLDTLFKRDDGYHEVEMIMTTIDLADRLTLEELKEDKIVVDSENRF--------V---PSDQ------ 64 (284) T ss_pred CEEEEECEEEECCCCCCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCCEEEEECCCC--------C---CCCC------ T ss_conf 0516712128533322775899982268999995688299999988998899978876--------8---8876------ Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 12210334777888999998751156678876999980554103898530378999999999818775402589999998 Q gi|254781018|r 85 AKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGY 164 (337) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~ 164 (337) .+..... ...+.. ......|++|+++.+||+++|||++||.+++++++++++++++++. +++.+++.. T Consensus 65 --~Nlv~ka--------~~~l~~-~~~~~~~~~I~l~KnIPvgaGLGGGSSdAAa~L~~Ln~l~~l~ls~-~~L~~la~~ 132 (284) T PRK04344 65 --RNLAYQA--------AKLLKE-RYNIKKGVTISIDKHIPVAAGLAGGSSDAAATLRGLNRLWNLGLSL-EELAELGSE 132 (284) T ss_pred --CHHHHHH--------HHHHHH-HHCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCH-HHHHHHHHH T ss_conf --2199999--------999999-7598997699999789886788986157999999999985689999-999999986 Q ss_pred HHHCCCCCCCCHHHHHHHHCCCEEEECC-C-CCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 6411577877345787653263024027-8-5200147884399974971453799999999764120689998765489 Q gi|254781018|r 165 IERLQHGKTGIIDSTTIVMGGIIYINPP-Q-IIKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNE 242 (337) Q Consensus 165 ~e~~~~~~~sg~D~~~~~~gg~~~~~~~-~-~~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~ 242 (337) + |.|+..|.+|+....+.. + ...++.++..++++..+....||+++++........... ... T Consensus 133 l---------GaDVPffl~g~~a~~~G~Ge~l~~l~~~~~~~~vlv~P~~~vST~~vy~~~~~~~~~~~~-------~~~ 196 (284) T PRK04344 133 I---------GSDVPFCVYGGTALATGRGEKIEPLPTPPSCWVVLAKPNIGVSTPDIFKNLNLDRIEHPD-------TDK 196 (284) T ss_pred C---------CCCEEEEEECCCEEEECCCCEEEECCCCCCCEEEEECCCCCCCHHHHHHHCCCCCCCCCC-------HHH T ss_conf 2---------798648982761899858965079987787589998899996739999745800046877-------789 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHCCC-EEEECCCCCC Q ss_conf 9999983872168999999599999718-997799999999985798-0895078867 Q gi|254781018|r 243 MMENIQKKDAEKTYKSVRVNQSLLESIG-VVPETVGKFIRCIENNGG-SAKIAGAGSI 298 (337) Q Consensus 243 ~~~al~~~d~~~l~~~~~~~~~ll~~l~-vs~~~l~~l~~~~~~~g~-~ak~sGAG~~ 298 (337) ...++.++|+..+...+. | .++..- -..|++.++.+.+++.|+ ++.|||||++ T Consensus 197 ~~~~~~~~~~~~~~~~~~-N--dle~~~~~~~P~i~~i~~~l~~~Ga~~~~mSGSGst 251 (284) T PRK04344 197 MVEAIENGDYQQICQSLG-N--VLEPVTLAKHPEIAKIKNRMLKFGADAALMSGSGPT 251 (284) T ss_pred HHHHHHCCCHHHHHHHCC-C--CHHHHHHHHCHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 999886279999997734-8--568999985939999999999679986899772512 No 24 >PRK02534 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.88 E-value=3.3e-19 Score=139.80 Aligned_cols=278 Identities=17% Similarity=0.194 Sum_probs=171.6 Q ss_pred CEEEECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0899858179997----44544-079548998766727999998489829999557666753210012333343321111 Q gi|254781018|r 10 HAHSIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFI 84 (337) Q Consensus 10 ~~~~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (337) .+...||+||||+ |--.| ||.-..+..+|+++-+++++++++..+.+..... .. ..+ T Consensus 3 ~~~l~apAKINL~L~V~~kr~dGyH~l~s~~~~i~l~D~i~i~~~~~~~i~~~~~~~--------~~---~~~------- 64 (308) T PRK02534 3 SYTLIAPAKINLHLEILGDRPDGFHELAMVMQSIDLADRLEFRNNGDGTIKLRCDHP--------DL---STD------- 64 (308) T ss_pred EEEEEECEEEECCEECCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCCEEEEECCC--------CC---CCC------- T ss_conf 378980436810430188799998504789998168839999998999889986887--------68---889------- Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 12210334777888999998751156678876999980554103898530378999999999818775402589999998 Q gi|254781018|r 85 AKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGY 164 (337) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~ 164 (337) ..+.... ....+.........|++|++..+||+++|||++||.+++++++++++++.++++. ++.+++.. T Consensus 65 -~~Nlv~k--------a~~~l~~~~~~~~~gv~I~l~KnIPv~aGLGGGSsdAAa~L~~Ln~l~~~~ls~~-~L~~ia~~ 134 (308) T PRK02534 65 -DDNLIYR--------AAQLLRKRFPFAELGVDITLEKNIPIGAGLAGGSSDAAAVLVGLNLLWGLGLTQP-ELESLAAE 134 (308) T ss_pred -CCCHHHH--------HHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCHH-HHHHHHHH T ss_conf -8649999--------9999999639998865899983688667888406899999999999847999989-99998751 Q ss_pred HHHCCCCCCCCHHHHHHHHCCCEEEECCCCC--CCCCCCCCEEEEE-CCCCCCCHHHHHHHHHHHHCC-----CHHHHHH Q ss_conf 6411577877345787653263024027852--0014788439997-497145379999999976412-----0689998 Q gi|254781018|r 165 IERLQHGKTGIIDSTTIVMGGIIYINPPQII--KNETLVGEWWAIN-TGMPESSTGECVSFVEQHFSQ-----SSIWSEF 236 (337) Q Consensus 165 ~e~~~~~~~sg~D~~~~~~gg~~~~~~~~~~--~~~~~~~~~~~i~-~~~~~~st~~~~~~~~~~~~~-----~~~~~~~ 236 (337) + |.|+..|.+|+..+....... .++..+..++++. .+....||+++++..++.... ...+... T Consensus 135 l---------GsDVPffl~g~~a~~~G~Ge~l~pl~~~~~~~~llv~~P~~~vST~~~y~~~~~~~~~~~~~~~~~~~~~ 205 (308) T PRK02534 135 L---------GSDVPFCIAGGTQLCFGRGEILEPLPDLDSLGVVLAKYPSLSVSTPWAYKTYRQQFGDTYLSDEEDFEQR 205 (308) T ss_pred C---------CCCCCCEECCCEEEEECCCCEEEECCCCCCCEEEEEECCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHH T ss_conf 1---------6885502208518983588676788877656499995899982749999999875034335760345555 Q ss_pred HHH--HHHHHHHHHCCCCHHHHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHH-CCC-EEEECCCCCCCCCEEEEEEEECC Q ss_conf 765--489999998387216899999959999971-8997799999999985-798-08950788678850899998579 Q gi|254781018|r 237 NAV--TNEMMENIQKKDAEKTYKSVRVNQSLLESI-GVVPETVGKFIRCIEN-NGG-SAKIAGAGSIRGEKAGLVLVGGY 311 (337) Q Consensus 237 ~~i--~~~~~~al~~~d~~~l~~~~~~~~~ll~~l-~vs~~~l~~l~~~~~~-~g~-~ak~sGAG~~~Gg~~~~~l~~~~ 311 (337) ... .....+.+...+...+.. -+++. --..|++..+.+.++. .|+ .+.|||+|++ .|.+..+. T Consensus 206 ~~~l~~~~~~~~~~~~~~~~l~N-------dLe~~a~~~~P~i~~~~~~L~~~~Ga~~~~mSGSGst-----vFglf~~~ 273 (308) T PRK02534 206 RQALRSGSLLQAISAKDPPPLRN-------DLQKVVLPEYPQVAKLLELLSSLPGCLGTMMSGSGPT-----CFALFEDQ 273 (308) T ss_pred HHHCCCCHHHHHHHHCCHHHHHH-------HHHHHHHHCCHHHHHHHHHHHHCCCCCEEEEECHHHH-----HEEEECCH T ss_conf 54213106889887347898864-------3699988739499999999974479888899724100-----03797899 Q ss_pred -CHHHH-HHH------CCCEEEEEEECCCCEEE Q ss_conf -97999-974------59869999987997092 Q gi|254781018|r 312 -NPEKL-SSL------YGYTCYKIKEDKNGTML 336 (337) Q Consensus 312 -~~~~~-~~~------~~~~~~~~~~~~~G~~i 336 (337) +.+++ ..+ .++.++-+.++.+|..+ T Consensus 274 ~~A~~a~~~l~~~~p~~~~~~~v~~~~~~g~~~ 306 (308) T PRK02534 274 EQAEQALEQVREAFADPGLDAWVCQFISHGIQL 306 (308) T ss_pred HHHHHHHHHHHHHCCCCCCCEEEECCCCCCEEE T ss_conf 999999999998678999868975141787374 No 25 >PTZ00299 homoserine kinase; Provisional Probab=99.87 E-value=2.4e-20 Score=146.99 Aligned_cols=281 Identities=16% Similarity=0.160 Sum_probs=183.8 Q ss_pred EEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 89985817999744544079548998766727999998489829999557666753210012333343321111122103 Q gi|254781018|r 11 AHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKIPI 90 (337) Q Consensus 11 ~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (337) +.+++|+-.==+|- |.-++.+|++.|..++++..+...+.+..+.... ...+.. +.. T Consensus 8 V~V~VPATsANLGp-----GFD~LGlAL~ly~ev~v~~~~~~~i~i~g~g~~~----------ip~d~~--------Nlv 64 (336) T PTZ00299 8 VVLRVPATTANIGP-----AYDTLGMALSIFMELTVEHADAFSMTVEGEGSEH----------ISTDED--------NMV 64 (336) T ss_pred EEEEECCCHHHCCC-----CHHHHHHHCCCCCEEEEEECCCCEEEEEECCCCC----------CCCCCC--------CHH T ss_conf 99996744464452-----3675253036774899998897179997067777----------999961--------149 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC--CHHHHHHHHHHHHHC Q ss_conf 347778889999987511566788769999805541038985303789999999998187754--025899999986411 Q gi|254781018|r 91 TSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFC--NKEKLIAETGYIERL 168 (337) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~--~~~~~~~~a~~~e~~ 168 (337) ...+...+.+.......++++.++++||.++|||||||.+||.+.+.+.+.+.++. +++++.+++...| T Consensus 65 -------~~a~~~~~~~~~~~~~~~~~i~i~~~IP~~rGLGSSAaAiVaG~~aan~L~g~~l~~~~~~~ll~~a~~~E-- 135 (336) T PTZ00299 65 -------VQACRLAFEEYAHKSMPPLKFIMHSNIPYGCGCGSSSAAAVAGFVAGMKLCGLTMETENEEALLQAIAKFE-- 135 (336) T ss_pred -------HHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHC-- T ss_conf -------99999999985697889638998368887667513889999999999986268877558999999876216-- Q ss_pred CCCCCCCHHHHHHHHCCCEEEECCC---CC--CCCCCCCCEEEEECCC--CCCCHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 5778773457876532630240278---52--0014788439997497--145379999999976412068999876548 Q gi|254781018|r 169 QHGKTGIIDSTTIVMGGIIYINPPQ---II--KNETLVGEWWAINTGM--PESSTGECVSFVEQHFSQSSIWSEFNAVTN 241 (337) Q Consensus 169 ~~~~~sg~D~~~~~~gg~~~~~~~~---~~--~~~~~~~~~~~i~~~~--~~~st~~~~~~~~~~~~~~~~~~~~~~i~~ 241 (337) ++..++++|+|||+......+ .. +++.++...+++..+. ...+|.+.++.+++.....+.+..+.+.+ T Consensus 136 ----gHPDNVAPAllGG~~l~~~~~~~~~~~~~ip~p~~l~~Vv~iP~~~~~lsT~~AR~vLP~~v~~~daV~n~~r~a- 210 (336) T PTZ00299 136 ----GHPDNAAPAIYGGIQLVYKKDNGRFLTYRVPTPPNLSVVLFVPHNKMKANTHVTRNLIPTSVSLEDAVFNISRTS- 210 (336) T ss_pred ----CCCCCCCCCCCCCEEEEEECCCCCEEEEECCCCCCCEEEEEECCCCCEECHHHHHHHCCCCCCHHHHHHHHHHHH- T ss_conf ----897455432458869999728982799977999884899997897633068999965867688999999999999- Q ss_pred HHHHHHHCCCCHHHHHHH--HHHHHHHHHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEEEC-------- Q ss_conf 999999838721689999--99599999718997799999999985798-0895078867885089999857-------- Q gi|254781018|r 242 EMMENIQKKDAEKTYKSV--RVNQSLLESIGVVPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLVGG-------- 310 (337) Q Consensus 242 ~~~~al~~~d~~~l~~~~--~~~~~ll~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~~-------- 310 (337) -+..+|.++|++.+. .+ +.||..|.+ .--|.++++.+.+.+.|+ |.-+||||++ .+.|+.. T Consensus 211 ~lv~al~~~d~~ll~-~~~D~ihepyR~~--~LiP~~~~v~~~a~~aGA~g~~iSGAGPT-----i~Al~~g~~~~~~~~ 282 (336) T PTZ00299 211 ILVLALSTGDLRMLK-SCSDKLHEQQRSD--ALFPHFRPCVKAAREAGAHYAFLSGAGPS-----VCALVGGRHGDPLTQ 282 (336) T ss_pred HHHHHHHCCCHHHHH-HCCCEEECCCHHH--HCCCCHHHHHHHHHHCCCEEEEEECHHHH-----HHEEECCCCCCCCCC T ss_conf 999999549898997-4873486520233--22627999999998789949999764222-----442621556653224 Q ss_pred -CCH---HHHH----H---H--CCCEEEEEEECCCCEEE Q ss_conf -997---9999----7---4--59869999987997092 Q gi|254781018|r 311 -YNP---EKLS----S---L--YGYTCYKIKEDKNGTML 336 (337) Q Consensus 311 -~~~---~~~~----~---~--~~~~~~~~~~~~~G~~i 336 (337) ... ++.. + . .+++.+-++++.+|++| T Consensus 283 ~~~~~~a~~V~~aM~~a~~~~gi~~~~~i~~~~~~G~~v 321 (336) T PTZ00299 283 PREERKAESVAEAMIKAAEAVGVAGRVIITQPSDQGVHL 321 (336) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCEE T ss_conf 410345999999999999986984589993467788789 No 26 >COG3890 ERG8 Phosphomevalonate kinase [Lipid metabolism] Probab=99.85 E-value=1.1e-17 Score=130.13 Aligned_cols=295 Identities=18% Similarity=0.156 Sum_probs=159.3 Q ss_pred EEEECCCEEEEECCCEEEC-CCEEEEEEECCCEEEEEEECCCC-EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8998581799974454407-95489987667279999984898-299995576667532100123333433211111221 Q gi|254781018|r 11 AHSIAPAKVILSGEYSSLY-GASALAVAITFYLRALLTTIEPS-LIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKI 88 (337) Q Consensus 11 ~~~~APGKi~L~GEh~v~~-G~~ai~~aI~~~~~v~i~~~~~~-~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (337) ...+||||++|+|+|+++. |..+.+.|++++..+...+..+. .|+........ ..++ + .++ T Consensus 4 ~~fSaPGk~LlaGdYs~lv~glsahaia~nkr~~cs~vrvKskdswlfhisi~g~--------------~kns-f--iEl 66 (337) T COG3890 4 ISFSAPGKLLLAGDYSILVEGLSAHAIAINKRAFCSFVRVKSKDSWLFHISIKGI--------------DKNS-F--IEL 66 (337) T ss_pred EEECCCCCEEEECCCEEEEECCEEEEEEECCCCCCCEEEECCCCEEEEEEEECCC--------------CCCH-H--HHH T ss_conf 4405887369840311355233124787305552013674267458987330136--------------8871-8--999 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEE------ECC----CCCCCCCCCCHHHHHHHHHHHH-HH-HCCCCCCHH Q ss_conf 03347778889999987511566788769999------805----5410389853037899999999-98-187754025 Q gi|254781018|r 89 PITSVLTHPNDLILYIFNRHLPHISCGISLDL------HST----IPIGSGFGSSSAIISALSLALS-SI-THQPFCNKE 156 (337) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i------~s~----iP~~~GLGSSaA~~va~~~al~-~~-~~~~~~~~~ 156 (337) .. .++.. ++......-+.++++.. +++ -..+.||||||++++.++..+. .+ .+-+++.+. T Consensus 67 ~~----~~Vf~----yF~~~~~~c~qnlev~~ddayy~q~~f~~d~g~KtGlGSSAa~~tsLt~~lfls~~~~~nvd~k~ 138 (337) T COG3890 67 RS----DFVFK----YFAYLSQNCFQNLEVFADDAYYDQSNFFFDDGTKTGLGSSAAVATSLTCGLFLSHANATNVDEKG 138 (337) T ss_pred HH----HHHHH----HHHHHHHHCCCCCEEECCHHHCCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHH T ss_conf 99----99999----98754142233322323232105632221477657776246889999999999886056501678 Q ss_pred HHHHHHHHHHHCCC-CCCCCHHHHHHHHCCCEEEECCCC--CCC---CC--CCCCEEEEECCCCCCCHHHHHHHHHHHHC Q ss_conf 89999998641157-787734578765326302402785--200---14--78843999749714537999999997641 Q gi|254781018|r 157 KLIAETGYIERLQH-GKTGIIDSTTIVMGGIIYINPPQI--IKN---ET--LVGEWWAINTGMPESSTGECVSFVEQHFS 228 (337) Q Consensus 157 ~~~~~a~~~e~~~~-~~~sg~D~~~~~~gg~~~~~~~~~--~~~---~~--~~~~~~~i~~~~~~~st~~~~~~~~~~~~ 228 (337) ++.++|+.++...+ +.+||.|+++++||++++-++.+. .++ .. +-...++++.....++|..++..+.++.. T Consensus 139 eIhklaqiAhc~aQggIGSGfDiaaA~fGsiiyrRF~p~li~~l~qig~~nfg~y~LmmGd~a~gSeTvglV~kv~~w~n 218 (337) T COG3890 139 EIHKLAQIAHCYAQGGIGSGFDIAAAIFGSIIYRRFEPGLIPKLRQIGAVNFGDYYLMMGDQAIGSETVGLVCKVNKWRN 218 (337) T ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHHHHCCEEEEECCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHH T ss_conf 99999999889971797764035676623157863483100345763750306765662232368553489999987763 Q ss_pred CC--HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC------CCCCHHHHHHH-HHHHHCCC-EEEECCCCCC Q ss_conf 20--689998765489999998387216899999959999971------89977999999-99985798-0895078867 Q gi|254781018|r 229 QS--SIWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLLESI------GVVPETVGKFI-RCIENNGG-SAKIAGAGSI 298 (337) Q Consensus 229 ~~--~~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l------~vs~~~l~~l~-~~~~~~g~-~ak~sGAG~~ 298 (337) .. +.+.-+++.-..+.+.++. .+++.-+..+.-.+.++++ .+.++....+. .+....|+ +.-+-||| T Consensus 219 s~~peelkc~de~nsrvl~l~~~-sLdE~~s~~~~irrslrritee~~adiep~kqt~~Ld~i~~l~gvl~~lipgaG-- 295 (337) T COG3890 219 SILPEELKCNDEMNSRVLNLIKL-SLDESNSKSKAIRRSLRRITEEGLADIEPIKQTAALDSIFDLLGVLCDLIPGAG-- 295 (337) T ss_pred CCCHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCEEEECCCCC-- T ss_conf 37858877678888999988862-088899999999999999999752333341135540067760275475024678-- Q ss_pred CCCEEEEEEEECCCHHHHHHH--CCCEEEEEEECCCCEEE Q ss_conf 885089999857997999974--59869999987997092 Q gi|254781018|r 299 RGEKAGLVLVGGYNPEKLSSL--YGYTCYKIKEDKNGTML 336 (337) Q Consensus 299 ~Gg~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~G~~i 336 (337) ||...|++.. ++.+....+ .+-..+..+ .-||+++ T Consensus 296 -ggdaif~l~~-~~~e~re~w~~~gi~i~dv~-adeGlr~ 332 (337) T COG3890 296 -GGDAIFLLYR-PNIEAREAWLSKGISIVDVN-ADEGLRF 332 (337) T ss_pred -CCCEEEEEEC-CCHHHHHHHHHCCEEEEECC-CCCCCCC T ss_conf -8753799845-62898887875780799714-4436112 No 27 >PRK03009 consensus Probab=99.82 E-value=4.6e-17 Score=126.24 Aligned_cols=233 Identities=14% Similarity=0.163 Sum_probs=148.2 Q ss_pred EECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9858179997----44544-079548998766727999998489829999557666753210012333343321111122 Q gi|254781018|r 13 SIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKK 87 (337) Q Consensus 13 ~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (337) .-||.||||+ |--.| ||.-..+...|+++-++.++++++..+....... ... . .. T Consensus 5 ~~APAKINL~L~V~~kr~DGyH~l~Sl~~~i~l~D~i~i~~~~~~~~~~~~~~~--------~i~------~------~~ 64 (287) T PRK03009 5 CPAPAKLNLFLHVTGRRPDGYHLLQTVFQLLDWGDTLHFTLRDDGKVARVTDVP--------GVP------E------ES 64 (287) T ss_pred CCCCCEEECCCCCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCCEEECCCCC--------CCC------C------CC T ss_conf 896212702502488689998706789998067869999988899479625777--------898------8------66 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH Q ss_conf 10334777888999998751156678876999980554103898530378999999999818775402589999998641 Q gi|254781018|r 88 IPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIER 167 (337) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~ 167 (337) +.... ....+. .......|++|+++.+||+++|||+.||.+.+++.+++++++++++. +++.+++..+ T Consensus 65 Nlv~k--------A~~~l~-~~~~~~~~~~I~l~KnIPv~AGLGGGSSdAAa~L~~Ln~l~~l~ls~-~~l~~ia~~l-- 132 (287) T PRK03009 65 DLVVR--------AARLLQ-AHTGTRLGVDIEIDKRLPMGGGLGGGSSDAATTLLALNRLWGLDLPR-AELQSLALKL-- 132 (287) T ss_pred CHHHH--------HHHHHH-HHHCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCH-HHHHHHHHHC-- T ss_conf 59999--------999999-85399885499998068765677877537999999999970799999-9999998763-- Q ss_pred CCCCCCCCHHHHHHHHCCCEEEECC-CCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCH-HHHHH-HH--HHHH Q ss_conf 1577877345787653263024027-852001478843999749714537999999997641206-89998-76--5489 Q gi|254781018|r 168 LQHGKTGIIDSTTIVMGGIIYINPP-QIIKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSS-IWSEF-NA--VTNE 242 (337) Q Consensus 168 ~~~~~~sg~D~~~~~~gg~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~-~~~~~-~~--i~~~ 242 (337) |.|+..|.+|+....+.. +.......+..++++..+....||.++............ ....+ .. .... T Consensus 133 -------GADVPffl~~~~a~~~G~Ge~l~~l~~~~~~~llv~P~~~iST~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (287) T PRK03009 133 -------GADVPFFVFGKNAFAEGIGEALTAVELPPRWFLVVTPRVHVPTAEIFSDESLTRDTKPITIADFLAQQTSDAG 205 (287) T ss_pred -------CCCCCCEECCCCEEEECCCCCCEECCCCCCEEEEECCCCCCCHHHHHHCCHHCCCCCCCCHHHHHHHHCCCHH T ss_conf -------9982303227767998588464074789747999859998467999745020256776668788752332455 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCCC Q ss_conf 99999838721689999995999997189977999999999857980895078867 Q gi|254781018|r 243 MMENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAGSI 298 (337) Q Consensus 243 ~~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~~ 298 (337) ....+..+|++... ++. .|++.++.+.+++.++ ++|||+|++ T Consensus 206 ~~~~~~~NdLe~~a---------~~~----~PeI~~i~~~l~~~~~-a~MSGSGSt 247 (287) T PRK03009 206 WPDSFGRNDLQPVV---------TSK----YAEVARAVEWLYNLTP-ARMTGSGAC 247 (287) T ss_pred HHHHHCCCCCHHHH---------HHH----CHHHHHHHHHHHCCCC-EEEECCCCE T ss_conf 67875478758999---------874----8899999999867897-187771502 No 28 >PRK01726 consensus Probab=99.81 E-value=4.1e-17 Score=126.58 Aligned_cols=233 Identities=16% Similarity=0.160 Sum_probs=153.0 Q ss_pred CCCCCEEEECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 79960899858179997----44544-07954899876672799999848982999955766675321001233334332 Q gi|254781018|r 6 SQTPHAHSIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRH 80 (337) Q Consensus 6 ~~~p~~~~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~ 80 (337) .|---....||+||+|+ |.-.| ||.-..+...||.+-+++++++++..+.+..... . T Consensus 17 ~~~k~~~~~aPAKINL~L~V~gkR~DGYH~L~Sl~~~idl~D~i~i~~~~~~~i~~~~~~~--------~---------- 78 (312) T PRK01726 17 SQGKPLRFPSPAKLNLFLYINGKRPDGYHELQTLFQFLDFGDWLDISIREDNQIVLTPEIP--------N---------- 78 (312) T ss_pred CCCCEEEEECCEEECCCCCCCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCCEEEECCCC--------C---------- T ss_conf 3897788746304804502078289998526899999468969999996899789936789--------9---------- Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 11111221033477788899999875115667887699998055410389853037899999999981877540258999 Q gi|254781018|r 81 RDFIAKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIA 160 (337) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~ 160 (337) .....+...... ..+ ........|++|.+..+||+++|||+.||.+++++++++++++.+++. +++.+ T Consensus 79 --i~~~~Nli~kA~--------~~l-~~~~~~~~~~~I~l~KnIPvgAGLGGGSSdAAavL~~Ln~l~~l~ls~-~~L~~ 146 (312) T PRK01726 79 --LKTEDNLIYRAA--------KLL-QQKTNCQLGANIHLDKILPMGGGVGGGSSNAATALVALNYLWQTNLSI-DELAK 146 (312) T ss_pred --CCCCCCHHHHHH--------HHH-HHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCH-HHHHH T ss_conf --988675999999--------999-997699876389997378665677754346999999999984469998-99999 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHCCCEEEECC-CCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 99986411577877345787653263024027-85200147884399974971453799999999764120689998765 Q gi|254781018|r 161 ETGYIERLQHGKTGIIDSTTIVMGGIIYINPP-QIIKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAV 239 (337) Q Consensus 161 ~a~~~e~~~~~~~sg~D~~~~~~gg~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i 239 (337) ++..+ |.|+..|.+|+..+.+.. +......++..|+++..+....||+++.+....... . ....+... T Consensus 147 ia~~i---------GADVPffl~~~~a~~~GiGE~l~~i~~~~~~~lLv~P~i~vST~~vf~~~~l~~~-~-~~~~~~~~ 215 (312) T PRK01726 147 LGLTL---------GADVPIFVHGHAAFAEGVGEKITYCEPPEKWFVVLKPDDSISTAVIFQDPNLPRN-T-PKKSLEQL 215 (312) T ss_pred HHHHH---------CCCEEEEEECCCEEEECCCCEEEECCCCCCEEEEECCCCCCCCHHHHHCCCCCCC-C-CHHHHHHH T ss_conf 99861---------8981668617778998898682066777666999899998687799709675547-6-31149999 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 4899999983872168999999599999718997799999999985798089507886 Q gi|254781018|r 240 TNEMMENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAGS 297 (337) Q Consensus 240 ~~~~~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~ 297 (337) ..+ --.+|++... ++. .|++.++.+.+.+.++ ++|||+|+ T Consensus 216 ---l~~-~~~NDLE~~a---------~~~----~PeI~~i~~~L~~~~~-arMSGSGS 255 (312) T PRK01726 216 ---LSE-PYKNDCEKVV---------INH----YSEVEEALNWLLQYAP-ARLTGTGA 255 (312) T ss_pred ---HHH-HHCCCHHHHH---------HHH----CHHHHHHHHHHHHCCC-CEEECCCC T ss_conf ---860-6116448999---------986----8999999999861389-83675380 No 29 >TIGR01219 Pmev_kin_ERG8 phosphomevalonate kinase; InterPro: IPR016005 Phosphomevalonate kinase (2.7.4.2 from EC) catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. In an example of nonorthologous gene displacement, two different types of phosphomevalonate kinase are found - the higher eukaryotic form and the ERG8 type . This model represents plant and fungal forms of the ERG8 type of phosphomevalonate kinase.. Probab=99.80 E-value=5.5e-18 Score=132.08 Aligned_cols=321 Identities=19% Similarity=0.203 Sum_probs=184.9 Q ss_pred EEECCCEEEEECCCEEEC-CCEEEEEEECCCEEEEEEECCCC-----------EEEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 998581799974454407-95489987667279999984898-----------299995576667532100123333433 Q gi|254781018|r 12 HSIAPAKVILSGEYSSLY-GASALAVAITFYLRALLTTIEPS-----------LIRIINSQTIEYSFEACRLLGHQIDQR 79 (337) Q Consensus 12 ~~~APGKi~L~GEh~v~~-G~~ai~~aI~~~~~v~i~~~~~~-----------~~~~~s~~~~~~~~~~~~~~~~~~~~~ 79 (337) .++||||++|.|.|.|+. -+.++++.++-|.|+..+|.++. .+++.|.++.+..+...-......-.. T Consensus 1 ~~SAPGKvL~aGGYlVLd~~y~glV~gl~Ar~yA~~~p~~~~~gPesW~~K~t~VrvkSpQf~~~EwlY~is~~~~~~~~ 80 (499) T TIGR01219 1 VASAPGKVLLAGGYLVLDKKYEGLVVGLSARFYAVVKPLDEKVGPESWADKFTEVRVKSPQFKDAEWLYKISLKVSHLRL 80 (499) T ss_pred CCCCCCCHHHHCCEEEECCCCCEEEEEEECCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCCCCEEECCCCCCCEEE T ss_conf 97687547563854466477660688640414688610122458651002123567636643465200221564461589 Q ss_pred CCCCCCCCC--CHHHHHHHHHHHHHH--HHHHC--CCCCCCCEEEEEE-------------------------------- Q ss_conf 211111221--033477788899999--87511--5667887699998-------------------------------- Q gi|254781018|r 80 HRDFIAKKI--PITSVLTHPNDLILY--IFNRH--LPHISCGISLDLH-------------------------------- 121 (337) Q Consensus 80 ~~~~~~~~~--~~~~~~~~~~~~~~~--~~~~~--~~~~~~g~~i~i~-------------------------------- 121 (337) ..-+..... ....+..+....+.. .++.. ......++.|+|. T Consensus 81 ~~~~~~~s~~~~~npfiq~aia~v~~atyf~~~~~~~l~~q~l~iti~~dnayysqrnqiesaGlPltp~s~~~~~~~~~ 160 (499) T TIGR01219 81 QSVYASVSLEKSKNPFIQLAIAAVFSATYFKKESRDKLLLQDLDITILSDNAYYSQRNQIESAGLPLTPESLTELASYAK 160 (499) T ss_pred EEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHCCCEEEEEECCCHHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 88864123688866689999999987542165320026331622689633522444331210578567212455577752 Q ss_pred --------------CCCCCCCCCCCCHHHHHHHHHHHHHHHCCC------------CCCHHHHHHHHHHHHHCCCCC-CC Q ss_conf --------------055410389853037899999999981877------------540258999999864115778-77 Q gi|254781018|r 122 --------------STIPIGSGFGSSSAIISALSLALSSITHQP------------FCNKEKLIAETGYIERLQHGK-TG 174 (337) Q Consensus 122 --------------s~iP~~~GLGSSaA~~va~~~al~~~~~~~------------~~~~~~~~~~a~~~e~~~~~~-~s 174 (337) .++| +.|||||||++++++.+|..++... .++++.+..+|+..+..-+|+ || T Consensus 161 ~~F~~~~snGans~~eV~-KTGLGSSAam~T~l~~aL~~~l~~~~~~~~dK~~K~~~S~~~viH~lAQ~aHC~AQGKvGS 239 (499) T TIGR01219 161 LSFLAAESNGANSLEEVA-KTGLGSSAAMVTALVAALLLYLRSVDEDDVDKELKIDDSDKEVIHNLAQVAHCSAQGKVGS 239 (499) T ss_pred EEECCCCCCCCCCCCCCC-CCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCC T ss_conf 022022225655655536-5676314899999999999986542045787343326750455554478888632785346 Q ss_pred CHHHHHHHHCCCEEEECCCCCC-----------------------------------CCCCCCCEEEEECCC-CCCCHHH Q ss_conf 3457876532630240278520-----------------------------------014788439997497-1453799 Q gi|254781018|r 175 IIDSTTIVMGGIIYINPPQIIK-----------------------------------NETLVGEWWAINTGM-PESSTGE 218 (337) Q Consensus 175 g~D~~~~~~gg~~~~~~~~~~~-----------------------------------~~~~~~~~~~i~~~~-~~~st~~ 218 (337) |.|++|++||...+.++.+... ..+++...++++... .-++|.. T Consensus 240 GFDV~aA~yGS~rYrRF~P~lis~l~vPyd~~i~~~~~~~~l~~iv~~kw~Ni~~~~f~LP~~~~L~mGd~~sgGSsTp~ 319 (499) T TIGR01219 240 GFDVAAAVYGSIRYRRFDPELISQLLVPYDVDIKSLTYSEKLAKIVKEKWDNIKVKEFSLPALLTLLMGDVKSGGSSTPK 319 (499) T ss_pred CHHHHHHHHCCEEEEECCHHHHHHCCCCCHHHCCCCCHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCCCCCCCCCHH T ss_conf 32377764335114446879996416860131145753789998741235653010045762447550443788867367 Q ss_pred HHHHHHHHHCCCH-----HHHHHHHH---------------------HHHHHH---HHHCCCC---------HHHHHHHH Q ss_conf 9999997641206-----89998765---------------------489999---9983872---------16899999 Q gi|254781018|r 219 CVSFVEQHFSQSS-----IWSEFNAV---------------------TNEMME---NIQKKDA---------EKTYKSVR 260 (337) Q Consensus 219 ~~~~~~~~~~~~~-----~~~~~~~i---------------------~~~~~~---al~~~d~---------~~l~~~~~ 260 (337) +++.|++.....+ .++.+.+. ..+.++ .|..+|+ +.+-+.+. T Consensus 320 mV~kVk~Wq~s~pees~~~~~~L~~aN~~l~~~L~~L~~L~~~H~D~~~~~~~sCsvL~~~~c~l~~~~Pineaiike~~ 399 (499) T TIGR01219 320 MVKKVKKWQKSDPEESKEIYDDLDDANLRLEDKLSDLDKLAKDHDDVYLRVIKSCSVLESEDCVLSAEEPINEAIIKELL 399 (499) T ss_pred HHHHHHHHHCCCCHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCHHHHHHHHHH T ss_conf 88999713015836889864036665678873134777765200136788876520121235033157745789999999 Q ss_pred HHH-------HHHHHC------CCCCHHHHHHHHHHHHCCC--EEEECCCCCCCCCEEEEEEEECCCH--HHHHH-H--C Q ss_conf 959-------999971------8997799999999985798--0895078867885089999857997--99997-4--5 Q gi|254781018|r 261 VNQ-------SLLESI------GVVPETVGKFIRCIENNGG--SAKIAGAGSIRGEKAGLVLVGGYNP--EKLSS-L--Y 320 (337) Q Consensus 261 ~~~-------~ll~~l------~vs~~~l~~l~~~~~~~g~--~ak~sGAG~~~Gg~~~~~l~~~~~~--~~~~~-~--~ 320 (337) |-+ .+++++ .|.++.=-.|.+-++...+ .+-+-||| |-+..|+++..+.. ++..+ . . T Consensus 400 E~r~a~~~IRr~~r~it~ea~vdIEP~~QT~lLD~~~~l~GVl~~gvPGAG---GfDAif~it~~~~d~~~~~~~~w~~~ 476 (499) T TIGR01219 400 EVREAVLSIRRLLRKITKEAKVDIEPEKQTSLLDDIESLEGVLLVGVPGAG---GFDAIFVITLNDVDLREKVIKAWKSD 476 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCEEEEECCCCC---HHHHHHHHHHCCHHHHHHHHHHHHHC T ss_conf 999999999999998445507888873443343020025863688528985---47899988740212688998756426 Q ss_pred CCEEEEEEECCCCEEE Q ss_conf 9869999987997092 Q gi|254781018|r 321 GYTCYKIKEDKNGTML 336 (337) Q Consensus 321 ~~~~~~~~~~~~G~~i 336 (337) +=..+.++-+.+|+++ T Consensus 477 ~V~~ldV~~a~~Gv~~ 492 (499) T TIGR01219 477 KVLALDVSEADDGVAL 492 (499) T ss_pred CCEEEEECCCCCCCEE T ss_conf 8347760458774031 No 30 >PRK00343 ipk 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.80 E-value=5.1e-17 Score=126.00 Aligned_cols=254 Identities=16% Similarity=0.178 Sum_probs=155.2 Q ss_pred EEEECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 899858179997----44544-0795489987667279999984898299995576667532100123333433211111 Q gi|254781018|r 11 AHSIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIA 85 (337) Q Consensus 11 ~~~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (337) ++..||.||+|+ |--.| ||.-..+...||.+-+++++++++..+.+..... . +. . T Consensus 3 ~~~~aPAKINL~L~V~~kr~DGyH~i~Sl~~~idl~D~i~i~~~~~~~i~~~~~~~-~-------i~------------~ 62 (279) T PRK00343 3 LDWPAPAKLNLFLHITGRRADGYHELQTLFQFLDYGDTLHFEPRDDGQIRLLTEIP-G-------VP------------E 62 (279) T ss_pred CCCCCCEEEEECCCCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCCEEEECCCC-C-------CC------------C T ss_conf 66365002730322387089998625789998278839999998899689946877-7-------98------------6 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 22103347778889999987511566788769999805541038985303789999999998187754025899999986 Q gi|254781018|r 86 KKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYI 165 (337) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~ 165 (337) ..+...+.. ..+.. ......|++|.+..+||+++|||+.||.+++++.+++++++++++. +++.+++..+ T Consensus 63 ~~Nlv~ka~--------~~l~~-~~~~~~~~~I~l~KnIPv~AGLGGGSSnAAa~L~~Ln~l~~l~ls~-~~l~~ia~~l 132 (279) T PRK00343 63 EDNLIVRAA--------RLLQK-ATGTPLGADISLDKRLPMGGGLGGGSSDAATTLVALNRLWQLGLSR-DELAALGLKL 132 (279) T ss_pred CCCHHHHHH--------HHHHH-HHCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCH-HHHHHHHHHC T ss_conf 475999999--------99999-8699997499999658763567888712899999999974667899-9999998751 Q ss_pred HHCCCCCCCCHHHHHHHHCCCEEEECC-CCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 411577877345787653263024027-8520014788439997497145379999999976412068999876548999 Q gi|254781018|r 166 ERLQHGKTGIIDSTTIVMGGIIYINPP-QIIKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEMM 244 (337) Q Consensus 166 e~~~~~~~sg~D~~~~~~gg~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 244 (337) |.|+..+.+|+....... +......++..++++..+....||+++......... .....+.. .. T Consensus 133 ---------GaDVPffl~~~~a~~~G~Ge~l~~~~~~~~~~llv~P~~~vST~~vf~~~~l~~~--~~~~~~~~----~~ 197 (279) T PRK00343 133 ---------GADVPVFVRGHAAFAEGVGEKLTPVELPEKWYLVVKPGVHVSTAEIFSDPELTRD--TPKRTIAD----FL 197 (279) T ss_pred ---------CCCEEEEECCCCEEEEECCCEEEECCCCCCEEEEECCCCCCCHHHHHHCCCCCCC--CCCCCHHH----HH T ss_conf ---------8977778608768998068687775678747999879999561999838245547--73006999----98 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCEEEEEEEECC-CHHHHHHH--CC Q ss_conf 9998387216899999959999971899779999999998579808950788678850899998579-97999974--59 Q gi|254781018|r 245 ENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAGSIRGEKAGLVLVGGY-NPEKLSSL--YG 321 (337) Q Consensus 245 ~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~~~Gg~~~~~l~~~~-~~~~~~~~--~~ 321 (337) ..-..+|++... ++ .+|++.++.+.+.+.+. ++|||+|++ .|.+..+. +.+++.+. .+ T Consensus 198 ~~~~~NDLe~~a---------~~----~~PeI~~~~~~L~~~~~-~~MSGSGSt-----~Fglf~~~~~A~~a~~~l~~~ 258 (279) T PRK00343 198 AGPGRNDCEPVV---------RK----RYPEVAQALSWLLEYAP-ARMTGTGAC-----VFAEFDTEAEAEAVLAQLPEG 258 (279) T ss_pred HCHHCCCHHHHH---------HH----HCHHHHHHHHHHHHCCC-CEEECCCCE-----EEEEECCHHHHHHHHHHCCCC T ss_conf 431218589999---------88----69899999999860288-746542225-----889979999999999864744 Q ss_pred CEEEEEE Q ss_conf 8699999 Q gi|254781018|r 322 YTCYKIK 328 (337) Q Consensus 322 ~~~~~~~ 328 (337) |+++-++ T Consensus 259 ~~~f~~k 265 (279) T PRK00343 259 WNGFVAK 265 (279) T ss_pred CCEEEEC T ss_conf 8789947 No 31 >PRK04943 consensus Probab=99.80 E-value=3.8e-17 Score=126.76 Aligned_cols=229 Identities=16% Similarity=0.177 Sum_probs=151.2 Q ss_pred CEEEECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0899858179997----44544-079548998766727999998489829999557666753210012333343321111 Q gi|254781018|r 10 HAHSIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFI 84 (337) Q Consensus 10 ~~~~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (337) -..-.||.||+|+ |--.| ||.-..+...||++-++++++++...+.+..... .+. T Consensus 5 ~~~~~aPAKINL~L~V~~kr~DGYH~i~S~~~~idl~D~l~i~~~~~~~i~~~~~~~--------~i~------------ 64 (288) T PRK04943 5 TTRWPSPAKLNLFLYINGRTENGYHELQTLFQFLDHGDELTITANNSGNITLSPAIE--------GVP------------ 64 (288) T ss_pred CEEEECCEEEECCCCCCCCCCCCCCEEEEEEEEECCCEEEEEEECCCCCEEEECCCC--------CCC------------ T ss_conf 748713116823402388799998715789998167509999998999889817867--------898------------ Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 12210334777888999998751156678876999980554103898530378999999999818775402589999998 Q gi|254781018|r 85 AKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGY 164 (337) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~ 164 (337) ...+....... . .+.......|++|+++.+||+++|||+.||.+.+++.+++++++.++++ +++.+++.. T Consensus 65 ~~~Nli~kA~~--------~-l~~~~~~~~~~~I~l~KnIPv~AGLGGGSSnAAa~L~~Ln~l~~l~ls~-~~l~~ia~~ 134 (288) T PRK04943 65 LEDNLIWKAAT--------A-LQNAAQCSLGAHIELHKILPMGGGIGGGSSNAATTLVALNYLWQTGLSD-DELAEIGLA 134 (288) T ss_pred CCCCHHHHHHH--------H-HHHHHCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCH-HHHHHHHHH T ss_conf 76749999999--------9-9997699988089999717875788876206999999999984799998-999998874 Q ss_pred HHHCCCCCCCCHHHHHHHHCCCEEEECC-CCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 6411577877345787653263024027-852001478843999749714537999999997641206899987654899 Q gi|254781018|r 165 IERLQHGKTGIIDSTTIVMGGIIYINPP-QIIKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEM 243 (337) Q Consensus 165 ~e~~~~~~~sg~D~~~~~~gg~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~ 243 (337) + |.|+..+.+|+..+.+.. +.......+..|+++..+....||.++........... ...+ +.. T Consensus 135 l---------GADVPffl~~~~a~~~G~GE~l~~~~~~~~~~lLv~P~~~iST~~vf~~~~~~~~~~--~~~~----~~~ 199 (288) T PRK04943 135 L---------GADVPVFVRGFAAFAEGVGEKLSPATPEEKWYLVVRPNVSIATVDIFTHPDLTRNTP--KRDL----ETL 199 (288) T ss_pred C---------CCCEEEEECCCCEEEECCCCEEEECCCCCCEEEEECCCCCCCCHHHHCCCHHCCCCC--CCCH----HHH T ss_conf 3---------998576861776899868857426677776799987999972778805930013765--2179----998 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 999983872168999999599999718997799999999985798089507886 Q gi|254781018|r 244 MENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAGS 297 (337) Q Consensus 244 ~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~ 297 (337) .+....+|++.....+ .|++.++.+.+.+.++ ++|||+|+ T Consensus 200 ~~~~~~NDLE~~~~~~-------------~PeI~~~~~~l~~~~~-~~MSGSGs 239 (288) T PRK04943 200 LNTPYVNDCEKIVRML-------------YPEVDKQLSWLLQYAP-SRLTGTGS 239 (288) T ss_pred HHCHHCCCCHHHHHHH-------------CHHHHHHHHHHHCCCC-CEEECHHH T ss_conf 7360047857999986-------------8899999999856598-71326405 No 32 >PRK03288 consensus Probab=99.79 E-value=7.7e-17 Score=124.85 Aligned_cols=231 Identities=13% Similarity=0.156 Sum_probs=146.6 Q ss_pred ECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 858179997----44544-0795489987667279999984898299995576667532100123333433211111221 Q gi|254781018|r 14 IAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKI 88 (337) Q Consensus 14 ~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (337) -||.||+|+ |--.| ||.-..+...||++-+++++++++..+++..... . .. + ..+ T Consensus 6 ~APAKINL~L~V~gkR~DGYH~i~S~~~~idl~D~i~i~~~~~~~i~~~~~~~-~-------~~----~--------~~N 65 (283) T PRK03288 6 PSPAKLNLFLYITGQRADGYHTLQTLFQFLDYGDTLTIEPRDDGEIRLLTPVE-G-------VE----N--------EDN 65 (283) T ss_pred CCCEEEECCCCCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCCEEEECCCC-C-------CC----C--------CCC T ss_conf 86015811403388699998725789998178829999998999789955876-7-------87----7--------541 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHC Q ss_conf 03347778889999987511566788769999805541038985303789999999998187754025899999986411 Q gi|254781018|r 89 PITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERL 168 (337) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~ 168 (337) ...+.... +..+...........|++|.++.+||+++|||+.||.+.++++++++++++++++ +++.+++..+ T Consensus 66 lv~ka~~~---l~~~~~~~~~~~~~~g~~I~l~KnIPv~AGLGGGSSdAAa~L~~L~~l~~l~ls~-~~l~~ia~~i--- 138 (283) T PRK03288 66 LIVRAARL---LMKTAADSGRLPAGSGADISIDKRLPMGGGLGGGSSNAATVLVALNHLWQCGLSD-DELAELGLTL--- 138 (283) T ss_pred HHHHHHHH---HHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCH-HHHHHHHHHH--- T ss_conf 89999999---9997644024566666079994367644567877615999999999985899999-9999999985--- Q ss_pred CCCCCCCHHHHHHHHCCCEEEECC-CCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 577877345787653263024027-8520014788439997497145379999999976412068999876548999999 Q gi|254781018|r 169 QHGKTGIIDSTTIVMGGIIYINPP-QIIKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEMMENI 247 (337) Q Consensus 169 ~~~~~sg~D~~~~~~gg~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al 247 (337) |.|+..+.+|+....+.. +.......+..++++..+....||+++.+...... ......+ ......- T Consensus 139 ------GaDVPffl~~~~a~~~G~Ge~l~~~~~~~~~~llv~P~~~vST~~vy~~~~~~~--~~~~~~~----~~~l~~~ 206 (283) T PRK03288 139 ------GADVPVFVRGHAAFAEGVGEILTPVDPPEKWYLVAHPGVSIPTPVIFKDPELPR--NTPKRSI----ETLLKCE 206 (283) T ss_pred ------CCCCCCEECCCCEEEECCCCEEEECCCCCCEEEEECCCCCCCHHHHHCCCCCCC--CCCCCCH----HHHHHCH T ss_conf ------898310542775899828868147677875699989999978289844944467--8753489----9997371 Q ss_pred HCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 83872168999999599999718997799999999985798089507886 Q gi|254781018|r 248 QKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAGS 297 (337) Q Consensus 248 ~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~ 297 (337) -.+|++... ++. .|++.++.+.+.++++ ++|||+|+ T Consensus 207 ~~Ndle~~a---------~~~----~peI~~~~~~l~~~~~-~~MSGSGS 242 (283) T PRK03288 207 FSNDCEVIA---------RKR----FREVDAALSWLLEYAP-SRLTGTGA 242 (283) T ss_pred HCCCHHHHH---------HHH----HHHHHHHHHHHHCCCC-CCEECHHH T ss_conf 106469999---------875----5999999999855398-71226404 No 33 >KOG4519 consensus Probab=99.79 E-value=8.5e-16 Score=118.28 Aligned_cols=319 Identities=18% Similarity=0.217 Sum_probs=178.8 Q ss_pred EEEECCCEEEEECCCEEEC-CCEEEEEEECCCEEEEEEECCCCE------EEEEECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8998581799974454407-954899876672799999848982------999955766675321001233334332111 Q gi|254781018|r 11 AHSIAPAKVILSGEYSSLY-GASALAVAITFYLRALLTTIEPSL------IRIINSQTIEYSFEACRLLGHQIDQRHRDF 83 (337) Q Consensus 11 ~~~~APGKi~L~GEh~v~~-G~~ai~~aI~~~~~v~i~~~~~~~------~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (337) ..++||||+++.|.|.|+. -+.+++..++-+.|+..+|.++.. +++.|.++.+..|.......... -.-... T Consensus 3 v~~SAPGKvL~aGGYlVLd~~y~glV~gl~Ar~yAi~~p~~~~~g~~~~~VrvkSpQf~d~EwlY~is~~~~~-i~v~~~ 81 (459) T KOG4519 3 VVASAPGKVLMAGGYLVLDKPYAGLVLGLNARFYAIVKPINEEVGPEWTDVRVKSPQFSDREWLYKISLNHLT-IQVVSA 81 (459) T ss_pred CEECCCCCEEEECCEEEECCCCCEEEEEEECEEEEEEECCCCCCCCCCCEEEECCCCCCCCHHEEECCCCCCE-EEEEEC T ss_conf 0305887458765469965776516996303037875225310277640267237554562122550655440-577753 Q ss_pred CCCCCCH-HHHHHHHHHHHHHHHHH-CCCCCCCCEEEE-------------------------EE-CCCC--CCCCCCCC Q ss_conf 1122103-34777888999998751-156678876999-------------------------98-0554--10389853 Q gi|254781018|r 84 IAKKIPI-TSVLTHPNDLILYIFNR-HLPHISCGISLD-------------------------LH-STIP--IGSGFGSS 133 (337) Q Consensus 84 ~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~g~~i~-------------------------i~-s~iP--~~~GLGSS 133 (337) ....+++ .....|+.. +.... ....-.+++.++ .. .++| .+.||||| T Consensus 82 ~~s~NPFve~ai~yvi~---yf~~~~~~~lc~q~L~Vt~l~sddaY~~qp~s~~~~~~~~~~~F~~~~~~eV~KTGLGSS 158 (459) T KOG4519 82 SDSRNPFVEHAIQYVIA---YFHLATLHKLCLQGLDVTILGSDDAYSYQPESLGTLAPFASITFNAAEKPEVAKTGLGSS 158 (459) T ss_pred CCCCCCHHHHHHHHHHH---HHHHCCCHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCH T ss_conf 78998079999999999---860213136776586689950365110374432345777713441244554444576623 Q ss_pred HHHHHHHHHHHHHHHCC------------CCCCHHHHHHHHHHHHHCCCCC-CCCHHHHHHHHCCCEEEECCCCC----- Q ss_conf 03789999999998187------------7540258999999864115778-77345787653263024027852----- Q gi|254781018|r 134 SAIISALSLALSSITHQ------------PFCNKEKLIAETGYIERLQHGK-TGIIDSTTIVMGGIIYINPPQII----- 195 (337) Q Consensus 134 aA~~va~~~al~~~~~~------------~~~~~~~~~~~a~~~e~~~~~~-~sg~D~~~~~~gg~~~~~~~~~~----- 195 (337) ||++.+++.++.+.++. ..++.+.+..+|+..+...+|+ +||.|+.++.||...+-++.+.. T Consensus 159 Aam~T~lv~~ll~sl~~~~~d~~~k~~k~d~s~~~viHnlAQ~aHC~AQGKvGSGFDV~aA~yGS~rYrRF~P~lis~lp 238 (459) T KOG4519 159 AAMTTALVAALLHSLGVVDLDDPCKEGKFDCSDLDVIHNLAQTAHCLAQGKVGSGFDVSAAVYGSQRYRRFSPELISFLP 238 (459) T ss_pred HHHHHHHHHHHHHHHCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEHHHCCCEEEEECCHHHHHHHH T ss_conf 78999999999986131325787645666842899999999988778617756774300111032346637999998879 Q ss_pred -------------------------C-CCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCC-----HHHHHHHHHHH--- Q ss_conf -------------------------0-0147884399974971453799999999764120-----68999876548--- Q gi|254781018|r 196 -------------------------K-NETLVGEWWAINTGMPESSTGECVSFVEQHFSQS-----SIWSEFNAVTN--- 241 (337) Q Consensus 196 -------------------------~-~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~-----~~~~~~~~i~~--- 241 (337) + ..+++...++++.+...++|..++..+++..... +.++.+++... T Consensus 239 ~~i~~~~~~~el~~~V~k~~W~~~~~~f~LP~~~~L~mGd~~gGSsTp~mV~kVk~Wq~s~~pe~~k~~~~lddansr~~ 318 (459) T KOG4519 239 QVIVTGLPLNELIGTVLKGKWDNKRTEFSLPPLMNLFMGDGSGGSSTPSMVGKVKKWQMSDPPEARKNWQNLDDANSRLE 318 (459) T ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHCCCHHHHHHHCCHHHHHHHHH T ss_conf 98761671677887752300134454356885101564157788776789999987765288667654200006789987 Q ss_pred ------------------HHHHHHHCCCC-----H---HHHHHHHHHHHHHHHC------CCCCHHHHHHHHHHHHCCC- Q ss_conf ------------------99999983872-----1---6899999959999971------8997799999999985798- Q gi|254781018|r 242 ------------------EMMENIQKKDA-----E---KTYKSVRVNQSLLESI------GVVPETVGKFIRCIENNGG- 288 (337) Q Consensus 242 ------------------~~~~al~~~d~-----~---~l~~~~~~~~~ll~~l------~vs~~~l~~l~~~~~~~g~- 288 (337) +.++.|...+. + ++.+.|..=..+++++ .+.+|.-..|.+-++...+ T Consensus 319 ~~l~kl~~l~~~h~d~~~~v~eSl~~~~~~~~~~~e~~E~r~a~~~IRr~~r~it~ea~vdIEP~~QT~lLD~~~sl~GV 398 (459) T KOG4519 319 TKLNKLSKLAKDHWDVYLRVIESLSVLTINEAIIKELLEAREAMLRIRRLMRQITEEASVDIEPESQTQLLDSTMSLEGV 398 (459) T ss_pred HHHHHHHHHHHHCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCE T ss_conf 53056667665000678888764012332155546789999999999999999778608987863554576666405663 Q ss_pred -EEEECCCCCCCCCEEEEEEEECCCH--HHHHH-H--CCCEEEEEEECCCCEEE Q ss_conf -0895078867885089999857997--99997-4--59869999987997092 Q gi|254781018|r 289 -SAKIAGAGSIRGEKAGLVLVGGYNP--EKLSS-L--YGYTCYKIKEDKNGTML 336 (337) Q Consensus 289 -~ak~sGAG~~~Gg~~~~~l~~~~~~--~~~~~-~--~~~~~~~~~~~~~G~~i 336 (337) .+.+-||| |-+..|+++..+.. ++..+ + ++-..+.++-+.+|+++ T Consensus 399 l~~gvPGAG---GfDAif~it~~dvd~~~~~~~~w~~~~V~~ldV~~a~~Gv~~ 449 (459) T KOG4519 399 LLAGVPGAG---GFDAIFAITLGDVDSGTKLTQAWSSHNVLALDVREAPHGVCL 449 (459) T ss_pred EEECCCCCC---CCCEEEEEEECCHHHHHHHHHHHCCCCEEEEEEEECCCCCEE T ss_conf 784466788---744488886064468788876520057447762007775031 No 34 >COG1947 IspE 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase [Lipid metabolism] Probab=99.76 E-value=5.7e-16 Score=119.38 Aligned_cols=237 Identities=16% Similarity=0.168 Sum_probs=149.4 Q ss_pred EEEECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 899858179997----44544-0795489987667279999984898299995576667532100123333433211111 Q gi|254781018|r 11 AHSIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIA 85 (337) Q Consensus 11 ~~~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (337) ....||.||||+ |-..| ||--..+...|+.+-++++++.++........... .+ +.. T Consensus 4 ~~~~apAKiNL~L~V~gkr~DGYHel~sl~~~id~~D~l~i~~~~~~~~~~~~~~~~-------~l------p~~----- 65 (289) T COG1947 4 TKFPAPAKINLFLHVTGKRADGYHELETLFQFIDLGDELTIRPRDDDGFIVLGTFAD-------GL------PTD----- 65 (289) T ss_pred EEEECCCEEEEEEEECCCCCCCCEEEEEEEEEECCCCEEEEEECCCCCCEEECCCCC-------CC------CCC----- T ss_conf 687215448888885355899843458999994148789999877887269667777-------88------774----- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 22103347778889999987511566788769999805541038985303789999999998187754025899999986 Q gi|254781018|r 86 KKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYI 165 (337) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~ 165 (337) .+.... ....+.+... ...|++|.++++||+++|||..||.+.+++++|+.+++.+++.. ++..++... T Consensus 66 -~NLv~r--------Aa~ll~~~~~-~~~~v~I~l~K~IPv~aGLGGGSSdAAa~L~~Ln~lw~~~ls~~-eL~~Lg~~L 134 (289) T COG1947 66 -ENLVYR--------AAELLRKRTG-IAGGVSIHLDKNIPVGAGLGGGSSDAAAVLVALNELWGLGLSLE-ELAELGLRL 134 (289) T ss_pred -CHHHHH--------HHHHHHHHHC-CCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHH-HHHHHHHHH T ss_conf -319999--------9999999868-99982799983686557676645789999999999867999999-999999986 Q ss_pred HHCCCCCCCCHHHHHHHHCCCEEEECCCC--CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 41157787734578765326302402785--2001478843999749714537999999997641206899987654899 Q gi|254781018|r 166 ERLQHGKTGIIDSTTIVMGGIIYINPPQI--IKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEM 243 (337) Q Consensus 166 e~~~~~~~sg~D~~~~~~gg~~~~~~~~~--~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~ 243 (337) |.|+..|.+|+.......-. ..++.++..|+++.......+|+.+++ .+....+.... ... T Consensus 135 ---------GaDVPffl~g~tA~a~G~GE~l~~~~~~~~~~~vl~~P~v~vsT~~vy~-~~~~~~~~~~~-------~~~ 197 (289) T COG1947 135 ---------GADVPFFLSGGTAFAEGRGEKLEPLEDPPEKWYVLAKPGVGVSTKEVYK-DPELTRNTPKS-------EPL 197 (289) T ss_pred ---------CCCCCEEEECCCEEEEECCCEEEECCCCCCCEEEEEECCCCCCHHHHHC-CCCCCCCCCCC-------HHH T ss_conf ---------8986714318856988736351477778875299995898998799970-86756466772-------245 Q ss_pred HHHHHCCCCHHHHHHHH-HHHH-HHHHCCCCCHHHHHHHHHHHHCCC-EEEECCCCC Q ss_conf 99998387216899999-9599-999718997799999999985798-089507886 Q gi|254781018|r 244 MENIQKKDAEKTYKSVR-VNQS-LLESIGVVPETVGKFIRCIENNGG-SAKIAGAGS 297 (337) Q Consensus 244 ~~al~~~d~~~l~~~~~-~~~~-ll~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~ 297 (337) ..++...+++....... .=+. .++. .|++........++|+ .+.|||+|+ T Consensus 198 ~~~~~~~~~~~~~~~~~NdLe~~~~~~----~p~v~~~~~~l~~~ga~~~~mSGSGs 250 (289) T COG1947 198 IAALSLENLKQIAPFLINDLEKVALRL----YPEVKEALSELLEYGALPARMSGSGS 250 (289) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHH----CHHHHHHHHHHHHCCCCCCEEECCCC T ss_conf 587865527764102036457889885----86899999997641354436824787 No 35 >KOG4644 consensus Probab=99.76 E-value=5e-16 Score=119.74 Aligned_cols=325 Identities=19% Similarity=0.210 Sum_probs=197.8 Q ss_pred CCCC-CEEEECCCEEEEECCCEEE-------CCCEEEEEEECCC----EEEEEEECCCC-EEEEEECCCCCCCC------ Q ss_conf 7996-0899858179997445440-------7954899876672----79999984898-29999557666753------ Q gi|254781018|r 6 SQTP-HAHSIAPAKVILSGEYSSL-------YGASALAVAITFY----LRALLTTIEPS-LIRIINSQTIEYSF------ 66 (337) Q Consensus 6 ~~~p-~~~~~APGKi~L~GEh~v~-------~G~~ai~~aI~~~----~~v~i~~~~~~-~~~~~s~~~~~~~~------ 66 (337) +..| -+++.||.||.++|.+.|. ..+-|+.+||... +.+.+++.+.. .|..+...+++..+ T Consensus 566 g~g~s~Viae~PaRiDF~GGW~DTPPiafel~n~AVlglAiklDGk~PIga~a~kI~ePelwlai~~rQDel~V~I~crc 645 (948) T KOG4644 566 GAGPSTVIAEAPARIDFFGGWLDTPPIAFELDNAAVLGLAIKLDGKNPIGAFAEKIDEPELWLAIEIRQDELFVHIKCRC 645 (948) T ss_pred CCCCCEEEEECCEEEEECCCCCCCCCEEEECCCCCEEEEEEEECCCCCCCHHHHCCCCCHHEEEEEEECCCEEEEEEEEH T ss_conf 99986389956556652266567997057616531134688734889621466518971020113564264189998500 Q ss_pred -CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf -2100123333433211111221033477788899999875115667887699998055410389853037899999999 Q gi|254781018|r 67 -EACRLLGHQIDQRHRDFIAKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALS 145 (337) Q Consensus 67 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~ 145 (337) .+.+...++.+.- ... ...+.+..--......++.+.....+-+||++...|++|.|+|||+||-++...+.+++ T Consensus 646 laDlrD~cqpHa~g---al~-~aafiCA~IVhl~sel~i~d~~~k~f~~GfeihT~SdLPHGSGLGTSSIlA~TaLaAi~ 721 (948) T KOG4644 646 LADLRDLCQPHAKG---ALE-AAAFICACIVHLGSELNILDIFEKLFCCGFEIHTSSDLPHGSGLGTSSILACTALAAIC 721 (948) T ss_pred HHHHHHHHCCCCCC---HHH-HHHHHHEEEEECCCHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 10157762557876---055-54244113332031433999999973475473032458887776568889899999999 Q ss_pred HHHCCCCCCHHHHHHHHH---HHHHCCCCCCCCHHHHHHHHCCCEE----EECCCCC----------CCCCCCCCEEEEE Q ss_conf 981877540258999999---8641157787734578765326302----4027852----------0014788439997 Q gi|254781018|r 146 SITHQPFCNKEKLIAETG---YIERLQHGKTGIIDSTTIVMGGIIY----INPPQII----------KNETLVGEWWAIN 208 (337) Q Consensus 146 ~~~~~~~~~~~~~~~~a~---~~e~~~~~~~sg~D~~~~~~gg~~~----~~~~~~~----------~~~~~~~~~~~i~ 208 (337) ...+.......++....+ ..|+.+-..++.+||....+.|+.. ++.+-.. ....+....++|| T Consensus 722 ~aagr~~gTeaLiHailHtvlrlEQilTTGGGWQDQ~G~im~GIK~gr~rael~~~ie~eeiTipe~f~ekL~dhLLLVY 801 (948) T KOG4644 722 AAAGRADGTEALIHAILHTVLRLEQILTTGGGWQDQCGAIMEGIKKGRCRAELNHGIEHEEITIPEEFREKLEDHLLLVY 801 (948) T ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCHHHHHHHCCCHHHCCCCCEEEEECCCHHHHHHHHHCEEEEE T ss_conf 86054456367689999999999988623785365305122101203202125677435210177899999762079987 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCH----HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC--CCCCHHHHHHHHH Q ss_conf 49714537999999997641206----89998765489999998387216899999959999971--8997799999999 Q gi|254781018|r 209 TGMPESSTGECVSFVEQHFSQSS----IWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLLESI--GVVPETVGKFIRC 282 (337) Q Consensus 209 ~~~~~~st~~~~~~~~~~~~~~~----~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l--~vs~~~l~~l~~~ 282 (337) +++.+..........+..+.+-+ ....+.+.++++.+.++++.++.+++.+...|...+-. |..++...+|++. T Consensus 802 TGKTRLAkNLLQdViRn~far~~a~~Q~ah~l~~~tdecAegf~kGsl~LlgecL~~YweqKk~MapgCEPl~Vr~lldm 881 (948) T KOG4644 802 TGKTRLAKNLLQDVIRNFFARCKATKQKAHKLAEATDECAEGFEKGSLELLGECLEHYWEQKKFMAPGCEPLNVRELLDM 881 (948) T ss_pred ECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH T ss_conf 27218899999999999997668999999999999999999886275878999999998765104789999748999987 Q ss_pred HHHCCC--EEEECCCCCCCCCEEEEEEEECCCHHHHHHH-------CCCEEEEEEECCCCEEE Q ss_conf 985798--0895078867885089999857997999974-------59869999987997092 Q gi|254781018|r 283 IENNGG--SAKIAGAGSIRGEKAGLVLVGGYNPEKLSSL-------YGYTCYKIKEDKNGTML 336 (337) Q Consensus 283 ~~~~g~--~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~-------~~~~~~~~~~~~~G~~i 336 (337) +..... .+-..|||| ||+-...+.-...++++..+ .+|.|.-+.+|.+|.-| T Consensus 882 Laph~hgesgw~AGAGG--GGFiYLl~kEpqqkeaiEa~Lak~eg~gN~s~Hlieid~egl~i 942 (948) T KOG4644 882 LAPHKHGESGWAAGAGG--GGFIYLLIKEPQQKEAIEAFLAKNEGFGNMSCHLIEIDLEGLFI 942 (948) T ss_pred HCCCCCCCCCHHCCCCC--CCEEEEEECCCCCHHHHHHHHCCCCCCCCEEEEEEEECCCCCHH T ss_conf 46001132120015788--73799996588997899986016877686168999841775268 No 36 >PRK04648 consensus Probab=99.76 E-value=2.8e-16 Score=121.32 Aligned_cols=232 Identities=14% Similarity=0.102 Sum_probs=149.3 Q ss_pred CEEE-ECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0899-858179997----44544-07954899876672799999848982999955766675321001233334332111 Q gi|254781018|r 10 HAHS-IAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDF 83 (337) Q Consensus 10 ~~~~-~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (337) +.++ .||.||||+ |--.| ||.-..+...||.+-+++++++.+..+......... + .+. T Consensus 13 ~~~~~~APAKINL~L~V~gkr~DGYH~l~Sl~~~idl~D~l~i~~~~~~~~~~~~~~~~~-------i----~~~----- 76 (295) T PRK04648 13 DWSAWPAPAKLNLFLQITGRRADGYHLLQTVFRLLDWGDTIHLRVRSDGQIHRIGESLPG-------V----AED----- 76 (295) T ss_pred CEEECCCCCEEECCCCCCCCCCCCCCEEEEEEEECCCCEEEEEEECCCCCEEEECCCCCC-------C----CCC----- T ss_conf 830445621370251328879999872568999837880999998899947993688888-------9----997----- Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 11221033477788899999875115667887699998055410389853037899999999981877540258999999 Q gi|254781018|r 84 IAKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETG 163 (337) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~ 163 (337) .+...+. ...+ +.......|+++.++.+||+++|||+.||.+.+++++++++++++++. +++.+++. T Consensus 77 ---~Nlv~ka--------~~~l-~~~~~~~~~~~I~l~K~IPvgAGLGGGSSnAAa~L~~Ln~~~~l~ls~-~~L~~ia~ 143 (295) T PRK04648 77 ---DDLVVRA--------ARLL-QSAAGTALGAEIRVDKRIPAGGGFGGGSSDAATVLVALNALWGLGLDV-DTLAELGL 143 (295) T ss_pred ---CHHHHHH--------HHHH-HHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCH-HHHHHHHH T ss_conf ---7499999--------9999-996599987499998068667567765468999999999985899998-99999999 Q ss_pred HHHHCCCCCCCCHHHHHHHHCCCEEEEC-CCCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 8641157787734578765326302402-785200147884399974971453799999999764120689998765489 Q gi|254781018|r 164 YIERLQHGKTGIIDSTTIVMGGIIYINP-PQIIKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNE 242 (337) Q Consensus 164 ~~e~~~~~~~sg~D~~~~~~gg~~~~~~-~~~~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~ 242 (337) .+ |.|+..|.+|+..+.+. .+......++..++++..+....||+++++........ ....+ ... T Consensus 144 ~l---------GaDVPffl~~~~a~~~G~GE~l~~~~~~~~~~lLv~P~~~vST~~vy~~~~~~~~~--~~~~~---~~~ 209 (295) T PRK04648 144 RL---------GADVPVFVRGHNAWAEGVGEQLTPISLPEAAYLLVDPGVHVPTPVLFRSQELTRDA--APAKI---ADF 209 (295) T ss_pred HH---------CCCCEEEEECCCEEEEECCCEEEECCCCCCEEEEECCCCCCCHHHHHHCCCCCCCC--CCCCH---HHH T ss_conf 64---------89931587477689996783635867788579998899996659997383445666--64556---777 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCCC Q ss_conf 99999838721689999995999997189977999999999857980895078867 Q gi|254781018|r 243 MMENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAGSI 298 (337) Q Consensus 243 ~~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~~ 298 (337) ....+..+|++... + + ..|++.++.+.+.+.|. ++|||+|++ T Consensus 210 ~~~~~~~NdlE~~~---------~-~---~~P~I~~i~~~L~~~g~-a~MSGSGSt 251 (295) T PRK04648 210 ASGSLLDNAFEPVL---------R-R---REPAVEAVFQALSRVGT-PRLTGSGSG 251 (295) T ss_pred HHCCHHHCHHHHHH---------H-H---CCHHHHHHHHHHHCCCC-CCEEEECCC T ss_conf 62843118168999---------8-6---28899999999860598-657953521 No 37 >PRK00650 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.76 E-value=2.9e-16 Score=121.27 Aligned_cols=222 Identities=19% Similarity=0.218 Sum_probs=149.6 Q ss_pred EEECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 99858179997----44544-07954899876672799999848982999955766675321001233334332111112 Q gi|254781018|r 12 HSIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAK 86 (337) Q Consensus 12 ~~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (337) ..+||.||+|+ |-..| ||.-..+...|+++-++++++++.....+...... + .. T Consensus 2 ~~~aPAKINL~L~V~~kr~DGYH~l~Sl~~~idl~D~i~i~~~~~~~~~~~~~~~~--------------~-------~~ 60 (288) T PRK00650 2 HFFSPAKLNLFLQLLGKREDGFHEIVTRYQAIDFGDQLSLSISSRDSLQVINLCEL--------------E-------TP 60 (288) T ss_pred CEECCEEEECCCCCCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCCEEEECCCCC--------------C-------CC T ss_conf 56077107325045887999988157999996789899999889996898389878--------------9-------94 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 21033477788899999875115667887699998055410389853037899999999981877540258999999864 Q gi|254781018|r 87 KIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIE 166 (337) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e 166 (337) .+...+. ...+ +.......++++.++.+||+++|||+.||.+.+++.+++++++..+++ +++.+++..+ T Consensus 61 ~Nli~kA--------~~~l-~~~~~~~~~~~I~l~KnIPvgAGLGGGSSnAAavL~~Ln~l~~l~l~~-~~L~~ia~~l- 129 (288) T PRK00650 61 QNSIWKS--------VALF-RDYTGITTPVSWRVVKQIPIGAGLAGGSSNAATALFALNQHFQTGLSD-EELRSLAEKI- 129 (288) T ss_pred HHHHHHH--------HHHH-HHHHCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCH-HHHHHHHHHC- T ss_conf 5699999--------9999-997499987379999579765567610378999999999985589899-9999999872- Q ss_pred HCCCCCCCCHHHHHHHHCCCEEEECCCC--CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 1157787734578765326302402785--20014788439997497145379999999976412068999876548999 Q gi|254781018|r 167 RLQHGKTGIIDSTTIVMGGIIYINPPQI--IKNETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEMM 244 (337) Q Consensus 167 ~~~~~~~sg~D~~~~~~gg~~~~~~~~~--~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 244 (337) |.|+..+.+|+.......-. ..+...+..++++..+....+|+++...++.. +....+. . T Consensus 130 --------GADVPffl~~~~a~~~G~GE~i~~l~~~~~~~~vLv~p~~~vsT~~vf~~~~~~--~~~~~~~--------l 191 (288) T PRK00650 130 --------GMDTPFFFSSGSALGVGRGEKILALEESVSDRYVLYFSDQGVLTSDAFAYVQPS--DCSSRKN--------L 191 (288) T ss_pred --------CCCCCEEECCCCEEEEECCCEEEECCCCCCCEEEEECCCCCCCCHHHHHHCCCC--CCCHHHH--------H T ss_conf --------998140753875899937967778876776519999199897766987504910--0663666--------6 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC----EEEECCCCC Q ss_conf 99983872168999999599999718997799999999985798----089507886 Q gi|254781018|r 245 ENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGG----SAKIAGAGS 297 (337) Q Consensus 245 ~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~----~ak~sGAG~ 297 (337) . ..++|++...-. ..|++.++...+++.+. .++|||+|+ T Consensus 192 ~-~~~NDLE~~a~~-------------~~P~i~~ik~~L~~l~a~~~~~~rMSGSGs 234 (288) T PRK00650 192 E-YTQNDLEKPVFR-------------LRLDLKEKKHWLESLWSPFPVHVGLTGSGA 234 (288) T ss_pred H-HCCCCCHHHHHH-------------HCHHHHHHHHHHHHCCCCCCCEEEEECCCC T ss_conf 5-458978899998-------------597999999999964698776588873130 No 38 >TIGR00154 ispE 4-diphosphocytidyl-2C-methyl-D-erythritol kinase; InterPro: IPR004424 4-diphosphocytidyl-2C-methyl-D-erythritol kinase is a member of the family of GHMP kinases that were previously designated as conserved hypothetical protein YchB or as isopentenyl monophosphate kinase. In Lycopersicon esculentum (tomato) and Escherichia coli the protein has been indentified as 4-diphosphocytidyl-2C-methyl-D-erythritol kinase, an enzyme of the deoxyxylulose phosphate pathway of terpenoid biosynthesis.; GO: 0050515 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity, 0016114 terpenoid biosynthetic process. Probab=99.70 E-value=5e-16 Score=119.72 Aligned_cols=264 Identities=18% Similarity=0.214 Sum_probs=168.6 Q ss_pred EEEECCCEEEEE----CCCEE-ECCCEEEEEEECCCEEEEEEECC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 899858179997----44544-07954899876672799999848-9829999557666753210012333343321111 Q gi|254781018|r 11 AHSIAPAKVILS----GEYSS-LYGASALAVAITFYLRALLTTIE-PSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFI 84 (337) Q Consensus 11 ~~~~APGKi~L~----GEh~v-~~G~~ai~~aI~~~~~v~i~~~~-~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (337) ..-.||.||||| |.-.| ||--..+...|+..-.+.|+.+. ++.+++...... -..|..|.-|- T Consensus 4 ~~~~~PAK~NLFL~I~~krpdGYH~L~~~~~~~~~gD~i~i~~~~q~~~i~l~~~~~~-----------~~~e~~NL~yr 72 (322) T TIGR00154 4 IVFPAPAKINLFLYILGKRPDGYHELQMLMQFIDLGDKIEISVRSQDDDIRLSKGDSD-----------VPDEERNLIYR 72 (322) T ss_pred CEECCCCCCCCHHHCCCCCCCCCEEHHHHHHHHHCCCEEEEEEECCCCCEEECCCCCC-----------CCCCHHHHHHH T ss_conf 1004887321113101679985231333334562167379998535784787377898-----------88620106999 Q ss_pred CCCCCHHHHHHHHHHHHHHH-HHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 12210334777888999998-75115667887699998055410389853037899999999981877540258999999 Q gi|254781018|r 85 AKKIPITSVLTHPNDLILYI-FNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETG 163 (337) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~ 163 (337) ...+.. ...... +-+.......|++|.+++.||+++|||+.||.+.+++.+||.+++..|+ .+++++++. T Consensus 73 Aa~Ll~--------~~~~~~nfsH~~~~~~~G~~I~~~K~iP~~aGLgGGSsdAA~~L~~LN~LW~~~LS-~~EL~~lG~ 143 (322) T TIGR00154 73 AAKLLK--------NKANSKNFSHSVIKVKKGVNIEITKVIPMAAGLGGGSSDAAAVLVGLNQLWNLNLS-LEELAELGA 143 (322) T ss_pred HHHHHH--------HHHHCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCC-HHHHHHHHH T ss_conf 999999--------85100233343012567614898641466577654045799999987677621688-789999988 Q ss_pred HHHHCCCCCCCCHHHHHHHHC-CCEEEEC-CCCCC-CC-CCCCCEEEEECC-CCCCCHHHHHHHHHHHHCCCHH----HH Q ss_conf 864115778773457876532-6302402-78520-01-478843999749-7145379999999976412068----99 Q gi|254781018|r 164 YIERLQHGKTGIIDSTTIVMG-GIIYINP-PQIIK-NE-TLVGEWWAINTG-MPESSTGECVSFVEQHFSQSSI----WS 234 (337) Q Consensus 164 ~~e~~~~~~~sg~D~~~~~~g-g~~~~~~-~~~~~-~~-~~~~~~~~i~~~-~~~~st~~~~~~~~~~~~~~~~----~~ 234 (337) .+ |.|+-.|.+| +.-.-.. .+... ++ .++..|++|+.+ ....||..+++..++++.++.. .. T Consensus 144 ~l---------GaDvpffv~GY~~A~AtG~GE~i~~~~e~P~e~W~~~~kP~~~~~ST~~vyq~~~qkfGknyLp~~~~~ 214 (322) T TIGR00154 144 TL---------GADVPFFVSGYGTAFATGRGEIITPLEEEPPEKWVVIAKPRSVSVSTAVVYQAYKQKFGKNYLPRNTPK 214 (322) T ss_pred HH---------CCCCEEEECCCCEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCH T ss_conf 74---------698307863633121123442505788898740388972798532738987124553152247877765 Q ss_pred HHHHHHH---HHHHHHHCCCCHHHHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHCCC-EEE-ECCCCCCCCCEEEEEEE Q ss_conf 9876548---999999838721689999995999997189-97799999999985798-089-50788678850899998 Q gi|254781018|r 235 EFNAVTN---EMMENIQKKDAEKTYKSVRVNQSLLESIGV-VPETVGKFIRCIENNGG-SAK-IAGAGSIRGEKAGLVLV 308 (337) Q Consensus 235 ~~~~i~~---~~~~al~~~d~~~l~~~~~~~~~ll~~l~v-s~~~l~~l~~~~~~~g~-~ak-~sGAG~~~Gg~~~~~l~ 308 (337) +.++-+. -+.+..+......+.+.. +.-+++.-+ ..++.+.+.+.++++|+ ... |||+|+ |.|.+. T Consensus 215 ~~~~ktihaG~~l~~i~~~~~~~~~~~~---~NDlEkv~l~~~~~V~~~l~~l~~~g~~~~~f~sGsG~-----~vF~l~ 286 (322) T TIGR00154 215 RAKEKTIHAGGLLKKIEEKLLQLLDSNL---KNDLEKVALDAHTEVAQALNWLLEYGLAPERFLSGSGP-----CVFALF 286 (322) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHC---CCCCCHHHHCCCCCHHHHHHHHHHHCCCCEEEEECCCH-----HHHHCC T ss_conf 6776320025102456676577665404---77202000015761899999998605884257644854-----664215 Q ss_pred ECC Q ss_conf 579 Q gi|254781018|r 309 GGY 311 (337) Q Consensus 309 ~~~ 311 (337) ... T Consensus 287 ~~e 289 (322) T TIGR00154 287 DEE 289 (322) T ss_pred HHH T ss_conf 114 No 39 >PRK05905 hypothetical protein; Provisional Probab=99.69 E-value=1.8e-14 Score=109.90 Aligned_cols=235 Identities=15% Similarity=0.100 Sum_probs=138.3 Q ss_pred EECCCEEEEE----CCCEE--ECCCEEEEEEE-CCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9858179997----44544--07954899876-67279999984898299995576667532100123333433211111 Q gi|254781018|r 13 SIAPAKVILS----GEYSS--LYGASALAVAI-TFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIA 85 (337) Q Consensus 13 ~~APGKi~L~----GEh~v--~~G~~ai~~aI-~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (337) ..||+||+|+ |--.| ||.-..+.++| |++-++++++.+.....+....... . .... T Consensus 3 ~ks~AKINL~L~I~~kr~d~GyH~l~S~f~~i~~l~D~i~i~~~~~~~~~i~~~~~~~---------------~--~~~~ 65 (258) T PRK05905 3 YKSYAKINLGLSIYKKCKKVTKHKLESIFILVENVYDDIEIEKIEKNIDDIHYFDETN---------------E--ILVY 65 (258) T ss_pred CCCCCCCCCCEEECCCCCCCCCEEEEEEEEEECCCCEEEEEEECCCCCCEEEECCCCC---------------C--CCCC T ss_conf 6535507145224877889996689999999378872999998779987599708876---------------6--7782 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 22103347778889999987511566788769999805541038985303789999999998187754025899999986 Q gi|254781018|r 86 KKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYI 165 (337) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~ 165 (337) ..+.... ... +.+.......+++|.+..+||+++|||+.||.++++++++++++++. .+++.+++..+ T Consensus 66 ~~ni~~k--------a~~-~l~~~~~~~~~~~I~l~K~IP~gAGLGGGSSnAAa~L~~L~~~~~l~---~~~l~~ia~~l 133 (258) T PRK05905 66 SRLILVK--------TLE-WLRDKYNIKNHFKIKIKKRIPIGSGLGSGSSNAAVLMKWILEFEGIN---EINYKDVVNKL 133 (258) T ss_pred HHHHHHH--------HHH-HHHHHHCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCCC---HHHHHHHHHHH T ss_conf 4699999--------999-99886099986699999458765677875277999999999865999---89999999861 Q ss_pred HHCCCCCCCCHHHHHHHHC-CCEEEECC-CCC-CCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCC-HHHHHHHHHHH Q ss_conf 4115778773457876532-63024027-852-00147884399974971453799999999764120-68999876548 Q gi|254781018|r 166 ERLQHGKTGIIDSTTIVMG-GIIYINPP-QII-KNETLVGEWWAINTGMPESSTGECVSFVEQHFSQS-SIWSEFNAVTN 241 (337) Q Consensus 166 e~~~~~~~sg~D~~~~~~g-g~~~~~~~-~~~-~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~-~~~~~~~~i~~ 241 (337) |.|+..+.+| +....... +.. .+...+..++++..+....||+++++......... ........... T Consensus 134 ---------GsDVPffl~~~~~a~~~G~GE~l~~l~~~~~~~~llv~P~~~~ST~~vy~~~~~~~~~~~~~~~~~~~~~~ 204 (258) T PRK05905 134 ---------GSDIPFFLSGYKTAYISDYGSQVEDLIGQFKLTYKVIFMNVNVSTKKVFEKFDDNQHVIKNNFKTIIKNLK 204 (258) T ss_pred ---------CCCCCEEECCCCCEEEECCCCEEEECCCCCCCEEEEEECCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHH T ss_conf ---------89988798168738998788778888778775299992799957599997226244456310788887667 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCCC Q ss_conf 999999838721689999995999997189977999999999857980895078867 Q gi|254781018|r 242 EMMENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAGSI 298 (337) Q Consensus 242 ~~~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~~ 298 (337) +.......+|++... ++ -.|++..+.+.+++.|..+.|||+|++ T Consensus 205 ~~~~~~~~Ndle~~~---------~~----~~P~i~~~~~~l~~~~~~~~MSGSGSt 248 (258) T PRK05905 205 ENIVVNIHNDLQEPC---------FE----LYPNLLYKYNELLNDGFYTILSGAGSS 248 (258) T ss_pred HHHHHCCCCCHHHHH---------HH----HCHHHHHHHHHHHCCCCCEEEECCCCC T ss_conf 777633701739999---------98----797999999998679983899763714 No 40 >TIGR01920 Shik_kin_archae shikimate kinase; InterPro: IPR010189 Shikimate kinase (2.7.1.71 from EC) catalyses the fifth step in the shikimate pathway of aromatic amino acids biosynthesis. It converts shikimate to shikimate 3-phosphate (3-phosphoshikimate). This part of the pathway leads to the biosynthesis of chorismate, the precursor of aromatic amino acids, folates, ubiquinones, and other aromatic compounds. The shikimate pathway links metabolism of carbohydrates to biosynthesis of the aromatic amino acids phenylalanine, tyrosine, tryptophan and their derivatives in microorganisms and plants . In a sequence of seven enzymatic reactions, D-erythrose 4-phosphate (E4P), an intermediate of the pentose phosphate pathway and phosphoenol pyruvate (PEP), a glycolytic intermediate, are converted to chorismate. The shikimate pathway is present in bacteria, archaea, fungi and plants. The absence of the shikimate pathway in animals makes it an attractive target for non-toxic herbicides, antimicrobial and antifungal agents. Shikimate kinase enzymes present an example of convergent evolution via enzyme recruitment from an unrelated group. This group represents shikimate kinases of the archaeal type. Members have no sequence similarity with the typical form of shikimate kinase (PIRSF000702 from PIRSF, IPR000623 from INTERPRO) found in bacteria and eukaryotes, but are instead distantly related to homoserine kinases, which belong to the GHMP kinase domain superfamily (GHMP = galactose, homoserine, mevalonate, and phosphomevalonate) . All known non-archaeal shikimate kinases (the typical form) belong to the non-homologous, structurally unrelated nucleoside monophosphate (NMP) kinase domain superfamily . Nomenclature note: the name AroK is used for non-homologous shikimate kinases of both the archaeal and bacterial types. ; GO: 0004765 shikimate kinase activity, 0009073 aromatic amino acid family biosynthetic process, 0005737 cytoplasm. Probab=99.67 E-value=1.6e-13 Score=103.99 Aligned_cols=234 Identities=16% Similarity=0.193 Sum_probs=150.9 Q ss_pred EEEEEEECCCEEEEEEECCCCE--EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC Q ss_conf 4899876672799999848982--99995576667532100123333433211111221033477788899999875115 Q gi|254781018|r 32 SALAVAITFYLRALLTTIEPSL--IRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKIPITSVLTHPNDLILYIFNRHL 109 (337) Q Consensus 32 ~ai~~aI~~~~~v~i~~~~~~~--~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (337) .-=+++||+.++++++..+|.. +.+..... .+.+. ....+++.....+. T Consensus 16 ~G~AfgidL~v~a~V~~~~d~~~~v~~~~~~~-----------------~~~~~-----------~~~~~~~~~~~~~~- 66 (273) T TIGR01920 16 LGGAFGIDLKVEAKVRLSDDGEAKVSLKVRGN-----------------PELDP-----------RLIERILTALIEKF- 66 (273) T ss_pred CCEEECCCCEEEEEEEEECCCCCCEEEEECCC-----------------CCCCH-----------HHHHHHHHHHHHHC- T ss_conf 21121268607999997068854314776676-----------------54797-----------89999999989853- Q ss_pred CCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHC--CCCCCHHHHHHHHHHHHHCCCC--CCCCHHHHHHHHCC Q ss_conf 6678876999980554103898530378999999999818--7754025899999986411577--87734578765326 Q gi|254781018|r 110 PHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITH--QPFCNKEKLIAETGYIERLQHG--KTGIIDSTTIVMGG 185 (337) Q Consensus 110 ~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~--~~~~~~~~~~~~a~~~e~~~~~--~~sg~D~~~~~~gg 185 (337) -...++++.++|+||.++||+||||+..|++.|..++.+ ...++. .+..+....-...+- ++-..|.++|++|| T Consensus 67 -g~~~~~~v~~~seiP~g~GLKSSSA~~nAlv~A~~~~~gva~~~~~~-~~~~l~a~~S~~AG~s~TGAfDDa~AS~~GG 144 (273) T TIGR01920 67 -GIVEGLEVEVESEIPLGSGLKSSSALVNALVLAVLKAKGVAEEIDDI-DILRLGAELSKEAGVSVTGAFDDAAASYLGG 144 (273) T ss_pred -CCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHH-HHHHHHHHHHHHCCCEEEEHHHHHHHHHCCC T ss_conf -88862489996176788664378999999999999861234217733-6899999988866851530588999986082 Q ss_pred CEEEECCCCCCCCCC--CCC-EEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 302402785200147--884-39997497145379999999976412068999876548999999838721689999995 Q gi|254781018|r 186 IIYINPPQIIKNETL--VGE-WWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEMMENIQKKDAEKTYKSVRVN 262 (337) Q Consensus 186 ~~~~~~~~~~~~~~~--~~~-~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~ 262 (337) ...-++.+...+... ++. ..+|..+.. ...++ .+...++++.+.++++.+.-.+|+| .+.|..| T Consensus 145 ~~~TDN~~~~ilk~~~~~~~l~~~vl~P~~-~~~~~---------~~~~~~~~~~~~~e~a~~~A~~G~y---~~Am~lN 211 (273) T TIGR01920 145 IVITDNRKMKILKRDKLEGNLTAVVLVPKE-ERREN---------VDLNRLRKISPVVEEAFKLALKGEY---LKAMVLN 211 (273) T ss_pred CEEEECCEEEEEEECCCCCCCEEEEEECCC-CCCCC---------CCHHHHHHHHHHHHHHHHHHHCCCC---HHHHHHH T ss_conf 005214302465521678986589997687-66767---------6888888666899999999646880---7899999 Q ss_pred HHHH-HHCCCCCHHHHHHHHHHHHCC-CEEEECCCCCCCCCEEEEEEEECCCHHHHHH Q ss_conf 9999-971899779999999998579-8089507886788508999985799799997 Q gi|254781018|r 263 QSLL-ESIGVVPETVGKFIRCIENNG-GSAKIAGAGSIRGEKAGLVLVGGYNPEKLSS 318 (337) Q Consensus 263 ~~ll-~~l~vs~~~l~~l~~~~~~~g-~~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~ 318 (337) .-+. .-|++.++- +..+.+.| +.+-+||-|+ ..|.+..+++.+.+.+ T Consensus 212 G~~y~~~Lgyp~e~----~~~al~~GA~~aG~SG~GP-----sy~A~~~~pe~~~~~~ 260 (273) T TIGR01920 212 GVAYATALGYPLEP----ASKALEKGAAVAGLSGKGP-----SYFALTEEPEEEEVAE 260 (273) T ss_pred HHHHHHHHCCCHHH----HHHHHHCCCEEEEECCCCC-----EEEEEECCCCHHHHHH T ss_conf 99999881898648----9999738943886305755-----6899730686489999 No 41 >PRK04181 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional Probab=99.65 E-value=9.3e-14 Score=105.43 Aligned_cols=180 Identities=17% Similarity=0.110 Sum_probs=117.2 Q ss_pred ECCCEEEEE----CCCEEECCCEEEEEEE-CCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 858179997----4454407954899876-67279999984898299995576667532100123333433211111221 Q gi|254781018|r 14 IAPAKVILS----GEYSSLYGASALAVAI-TFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKI 88 (337) Q Consensus 14 ~APGKi~L~----GEh~v~~G~~ai~~aI-~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (337) .||+||+|+ |--.+||.-..+.++| |.+-++++.+.+...+.+..... . .. ..+ T Consensus 2 Ks~AKINL~L~V~~kRd~yH~l~s~~~~i~dl~D~i~i~~~~~~~~~~~~~~~----~---------~~--------~~N 60 (254) T PRK04181 2 KSYAKVNIFLKITGKRGNYHELISRFVLVKDLFDEIEFVDDSAKSFEIIGNFD----C---------PL--------EEN 60 (254) T ss_pred CCCCEECCCEEECCCCCCCCEEEEEEEEHHHCCEEEEEEECCCCCEEEECCCC----C---------CC--------CCC T ss_conf 97653767435775349987325477782008579999988999589978888----8---------85--------415 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHC Q ss_conf 03347778889999987511566788769999805541038985303789999999998187754025899999986411 Q gi|254781018|r 89 PITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERL 168 (337) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~ 168 (337) .......... .............+++|.++.+||+++|||+.||.+++++++++++++++++. +++.+++..+ T Consensus 61 lv~ka~~~l~---~~~~~~~~~~~~~~~~I~l~KnIP~gAGLGGGSSdAAa~L~~l~~~~~l~ls~-~~L~~ia~~l--- 133 (254) T PRK04181 61 IIFKAYQELK---SKGFSNELIEFFSKLAIEVTKNIPTGAGLGGGSSNAATFLLMANEILNLKLSL-EELINIGSKI--- 133 (254) T ss_pred HHHHHHHHHH---HHCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCH-HHHHHHHHHC--- T ss_conf 9999999998---75332343113664179995257665678874668999999999982899999-9999987531--- Q ss_pred CCCCCCCHHHHHHHHC-CCEEEEC-CCCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 5778773457876532-6302402-785200147884399974971453799999999764 Q gi|254781018|r 169 QHGKTGIIDSTTIVMG-GIIYINP-PQIIKNETLVGEWWAINTGMPESSTGECVSFVEQHF 227 (337) Q Consensus 169 ~~~~~sg~D~~~~~~g-g~~~~~~-~~~~~~~~~~~~~~~i~~~~~~~st~~~~~~~~~~~ 227 (337) |.|+..|.+| +....+. .+....-......+++..+....||+++++..+... T Consensus 134 ------GaDVPffl~g~~~a~~~G~Ge~l~~~~~~~~~~~lv~P~~~vST~~vy~~~~~~~ 188 (254) T PRK04181 134 ------GADVAFFISGYKSANVSGIGEIIEEFEEEILNLEIYTPNIFCSTKAVYQAFRPEF 188 (254) T ss_pred ------CCCCCEEECCCCEEEEECCCCEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHC T ss_conf ------8982449757863788757738358777786599994989986799999875540 No 42 >COG1685 Archaeal shikimate kinase [Amino acid transport and metabolism / Coenzyme metabolism] Probab=99.65 E-value=1.3e-12 Score=98.15 Aligned_cols=266 Identities=15% Similarity=0.117 Sum_probs=159.4 Q ss_pred EEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 89985817999744544079548998766727999998489829999557666753210012333343321111122103 Q gi|254781018|r 11 AHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIAKKIPI 90 (337) Q Consensus 11 ~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (337) ..++|||-+-+..--+. +..=+++|++.++++++.++...+.-.... .. T Consensus 3 ~~a~A~g~~TIiNAiat---G~G~AfgidL~v~a~v~~~~~~~~~~~~~~--------------------------d~-- 51 (278) T COG1685 3 GRARAYGGGTIINAIAT---GKGSAFGIDLKVEAEVRLSDEGKVRGEPEG--------------------------DT-- 51 (278) T ss_pred CEEEECCCEEEEEEHHC---CCCCEEEECCEEEEEEEECCCCCCCCCCCC--------------------------CH-- T ss_conf 43884276067530124---766256524458999998576531367778--------------------------74-- Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 34777888999998751156678876999980554103898530378999999999818775402589999998641157 Q gi|254781018|r 91 TSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQH 170 (337) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~ 170 (337) .....++....++ .....++++.++|+||.++||.||||+..|++.|+.++.+..+.+. ++..+....-...+ T Consensus 52 ----~li~~~~~~v~e~--~g~~~~~~v~v~SeiP~~~GLkSSSA~~nAlv~A~~~~~g~~~~~~-~i~~l~a~~S~~aG 124 (278) T COG1685 52 ----RLIERCVERVREK--YGIPLGVEVEVESEIPVGSGLKSSSAASNALVKAVLKALGEEIDDF-EILRLGARASKEAG 124 (278) T ss_pred ----HHHHHHHHHHHHH--CCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHH-HHHHHHHHHHHHCC T ss_conf ----7999999999987--2988655999825787666742458999999999999718988814-88887779887459 Q ss_pred C--CCCCHHHHHHHHCCCEEEECCCCCCCCC--CCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 7--8773457876532630240278520014--78843999749714537999999997641206899987654899999 Q gi|254781018|r 171 G--KTGIIDSTTIVMGGIIYINPPQIIKNET--LVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNEMMEN 246 (337) Q Consensus 171 ~--~~sg~D~~~~~~gg~~~~~~~~~~~~~~--~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~~~~a 246 (337) - .+-..|..+++|||...-++.+...+.. .++...+|..+.....+... +-..++.+...++.+.+. T Consensus 125 vSvTGA~DDa~AS~~GG~~iTDN~~m~Ilrr~~~~~~~vlI~~p~~k~~~~~v---------dv~~~r~~a~~~e~A~~l 195 (278) T COG1685 125 VSVTGAFDDACASYLGGIVITDNRKMRILRRLDLPELTVLILAPGEKRLSANV---------DVNRLRLIAPVVEEAFRL 195 (278) T ss_pred CEEECCCHHHHHHHHCCEEEECCHHHEEHHCCCCCCCEEEEEECCCCCCCCCC---------CHHHHHHHHHHHHHHHHH T ss_conf 44741406889997588688616111000003467740899816876525547---------878987743899999999 Q ss_pred HHCCCCHHHHHHHHHHHHHH-HHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEEECCCHHHHHHHCC-CE Q ss_conf 98387216899999959999-9718997799999999985798-089507886788508999985799799997459-86 Q gi|254781018|r 247 IQKKDAEKTYKSVRVNQSLL-ESIGVVPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLVGGYNPEKLSSLYG-YT 323 (337) Q Consensus 247 l~~~d~~~l~~~~~~~~~ll-~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~~~-~~ 323 (337) -.++++ .+.|..|.-+. ..||...+- +..+.+.|+ ++-+||-|+ ..+.+..++ .+....+.+ ++ T Consensus 196 A~~G~~---~~Am~lNG~~y~~aLG~~~e~----~~~ale~GA~~aglSGtGP-----a~~Al~~~~-~~v~ea~~~~G~ 262 (278) T COG1685 196 ALKGEY---FKAMVLNGILYCSALGYDLEP----ALKALEAGAAAAGLSGTGP-----AYFALTEDP-EEVAEAWSKIGD 262 (278) T ss_pred HHCCCH---HHHHHHHHHHHHHHHCCCHHH----HHHHHHCCCCEECCCCCCC-----CEEEEECCC-HHHHHHHHHCCE T ss_conf 853528---888887679989870898289----9999863540643578897-----169995493-789999986786 Q ss_pred EEEEEECCCCEEE Q ss_conf 9999987997092 Q gi|254781018|r 324 CYKIKEDKNGTML 336 (337) Q Consensus 324 ~~~~~~~~~G~~i 336 (337) ++..++..++.+. T Consensus 263 V~~t~~~~~~~~~ 275 (278) T COG1685 263 VIETRNVGERARD 275 (278) T ss_pred EEEEECCCCCCEE T ss_conf 9998048987401 No 43 >COG3407 MVD1 Mevalonate pyrophosphate decarboxylase [Lipid metabolism] Probab=99.42 E-value=1.2e-09 Score=79.65 Aligned_cols=200 Identities=18% Similarity=0.150 Sum_probs=119.3 Q ss_pred CCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCEEEEC Q ss_conf 78876999980554103898530378999999999818775402589999998641157787734578765326302402 Q gi|254781018|r 112 ISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQHGKTGIIDSTTIVMGGIIYINP 191 (337) Q Consensus 112 ~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~~~~sg~D~~~~~~gg~~~~~~ 191 (337) ...++++..+++.|.++||+||||...|+.+|++.++++++++ ..+..+|+.. .+|+ .-++|||++.+.. T Consensus 87 ~~~~~~i~s~n~~ptaaGLaSSaag~AAl~~Al~~~~~~~~d~-~~lS~~AR~g------SGSa---~RS~~Gg~~~W~~ 156 (329) T COG3407 87 ISFKVKIVSYNNFPTAAGLASSAAGAAALAAALNRLYDLDLDD-EFLSRIARLG------SGSA---SRSIFGGFVLWEK 156 (329) T ss_pred CCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCH-HHHHHHHHHH------CCCH---HHHHCCCEEEECC T ss_conf 4660799974578763452101889999999997652447887-9999998775------2411---4542288667306 Q ss_pred CC---C--CC---CCCC--CCCEE-EEECCCCCCCHHHHHHH-HHHHHCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 78---5--20---0147--88439-99749714537999999-9976412068999876548999999838721689999 Q gi|254781018|r 192 PQ---I--IK---NETL--VGEWW-AINTGMPESSTGECVSF-VEQHFSQSSIWSEFNAVTNEMMENIQKKDAEKTYKSV 259 (337) Q Consensus 192 ~~---~--~~---~~~~--~~~~~-~i~~~~~~~st~~~~~~-~~~~~~~~~~~~~~~~i~~~~~~al~~~d~~~l~~~~ 259 (337) .+ . .+ .... ....+ .+....+..++.+..+. +....-...-+.....-.+++.++++.+|++.++++. T Consensus 157 ~~g~~~~~~~~~~~~~~~e~~~i~~~~~~~~k~vsS~~gm~~~~~tS~~y~~w~~~~~~~~~~m~~~~~~~Df~~i~~~~ 236 (329) T COG3407 157 GEGEDSAAEQLFRLDLWKELAMIVLVISPKKKKVSSREGMQLTAETSPFYDAWLEHSEEDLEEMKEAIREKDFEKIGELA 236 (329) T ss_pred CCCCCCCEEEECCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 88875210120234676231427999745567787368777777707278999998887599999998614799999999 Q ss_pred HHHHHHHHHC--C------CCCHHHHHHHH---HHHHCCC-EEEECCCCCCCCCEEEEEEEECCCHHHHHHH-CCCEEEE Q ss_conf 9959999971--8------99779999999---9985798-0895078867885089999857997999974-5986999 Q gi|254781018|r 260 RVNQSLLESI--G------VVPETVGKFIR---CIENNGG-SAKIAGAGSIRGEKAGLVLVGGYNPEKLSSL-YGYTCYK 326 (337) Q Consensus 260 ~~~~~ll~~l--~------vs~~~l~~l~~---~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~-~~~~~~~ 326 (337) ..+-..+... + +-+++..++++ .+++.|. .+-..=||+ -..++....+.....+. ....+.+ T Consensus 237 e~dsl~mHA~l~~s~p~~~y~~~~s~~ii~~v~~~r~~g~~~~fT~DaGP-----nV~v~~~~~~l~~~~~~~~~~~~~~ 311 (329) T COG3407 237 ENDSLEMHATLMSSGPPFFYLTDESLRIIEFVHELRKEGNAVYFTMDAGP-----NVKVITLEENLIDLLEILKTLECID 311 (329) T ss_pred HHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHHCCCCEEEEECCCC-----CEEEEEECCCHHHHHHHHHHCCCCC T ss_conf 86599999998456996399883189999999998733884699972799-----6599972154899999873135656 No 44 >COG4542 PduX Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=99.34 E-value=9.1e-11 Score=86.59 Aligned_cols=246 Identities=16% Similarity=0.126 Sum_probs=148.1 Q ss_pred CCEEEECCCEEEEECCCE--EEC-CCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 608998581799974454--407-95489987667279999984898299995576667532100123333433211111 Q gi|254781018|r 9 PHAHSIAPAKVILSGEYS--SLY-GASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIA 85 (337) Q Consensus 9 p~~~~~APGKi~L~GEh~--v~~-G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (337) +...|.+|+-+ ||-. ... +..-+.+||+.+..+++..+.+..-.. T Consensus 12 ~~~~A~cpAs~---GELlQG~~~g~~~Lvs~PIa~~s~v~~~~~s~~~s~~----------------------------- 59 (293) T COG4542 12 MVISAKCPASC---GELLQGSFNGSEKLVSFPIALFSEVTLFDRSGKDSVN----------------------------- 59 (293) T ss_pred CCEEEECCCCH---HHHHEEEECCCCEEEECCHHHHHHEEEECCCCCCCCC----------------------------- T ss_conf 51114535024---2252336638752898003541004550147876555----------------------------- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 22103347778889999987511566788769999805541038985303789999999998187754025899999986 Q gi|254781018|r 86 KKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYI 165 (337) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~ 165 (337) ..+.+....... ...........|+.+.+.|+||.|.||+||+|..||++.+...+++.++. ..++++++..+ T Consensus 60 -~t~a~~~~e~~l-----a~~~~~~~~~~~i~l~lqSsIPvgKG~ASSTADl~At~~A~A~~l~~~l~-es~iakLcv~i 132 (293) T COG4542 60 -VTPAHKFKENIL-----ARWGVTKLINTGIDLLLQSSIPVGKGMASSTADLVATARATARFLGRELR-ESEIAKLCVSI 132 (293) T ss_pred -CCHHHHHHHHHH-----HHHCCCCEECCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHC T ss_conf -566689999999-----97285101058736998636655556563478899999999999678778-88999998625 Q ss_pred HHCCCCCCCCHHHHHHHHCCCEEEECCCCCCCCC---CCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 4115778773457876532630240278520014---7884399974971453799999999764120689998765489 Q gi|254781018|r 166 ERLQHGKTGIIDSTTIVMGGIIYINPPQIIKNET---LVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTNE 242 (337) Q Consensus 166 e~~~~~~~sg~D~~~~~~gg~~~~~~~~~~~~~~---~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~~ 242 (337) | ..| ..+|.+...|+..+...++. .|+.-+++.-+...-.|..- +.-++.........+++...+. T Consensus 133 E--------PtD--siiF~~~tlFd~r~g~~~~~~g~~PpL~ilv~e~~~~v~T~~y-~q~~r~e~~~~~~~e~~~l~~~ 201 (293) T COG4542 133 E--------PTD--SIIFDKATLFDQREGRVIEFLGEMPPLHILVFEGKGTVETVDY-NQPPRGEKLLAPLAELGNLINL 201 (293) T ss_pred C--------CCC--CEECCCCEEEHHCCCHHHHHCCCCCCEEEEEECCCCCEEEEEC-CCCCHHHHHHHHHHHHHHHHHH T ss_conf 8--------765--4000561000220430888637899627999748982333314-6785465655449999999999 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEEE Q ss_conf 9999983872168999999599999718997799999999985798-089507886788508999985 Q gi|254781018|r 243 MMENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLVG 309 (337) Q Consensus 243 ~~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~~ 309 (337) ..+|+..+|...+|+.-..+--. ..-.+.-|.+++|.+.+++.++ |.-+.-+|. .-|.++-. T Consensus 202 v~~A~~~~~~~~lG~AAT~SAv~-~Q~~LPK~~~~~lL~l~e~~~~~Gv~VAHSGt----mlGli~D~ 264 (293) T COG4542 202 VEKALKVGDPKLLGEAATLSAVK-NQDRLPKPGLNELLRLVEETCAIGVIVAHSGT----MLGLIYDR 264 (293) T ss_pred HHHHHCCCCHHHHHHHHHHHHHH-HCCCCCCHHHHHHHHHHHHHCCCCEEEECCCC----EEEEEECC T ss_conf 99987238988987888899886-32016832389999999873654458841674----67765403 No 45 >pfam10509 GalKase_gal_bdg Galactokinase galactose-binding signature. This is the highly conserved galactokinase signature sequence which appears to be present in all galactokinases irrespective of how many other ATP binding sites, etc that they carry. The function of this domain appears to be to bind galactose, and the domain is normally at the N-terminus of the enzymes, EC:2.7.1.6. This domain is associated with the families GHMP_kinases_C, pfam08544 and GHMP_kinases_N, pfam00288. Probab=99.33 E-value=2e-12 Score=97.07 Aligned_cols=48 Identities=23% Similarity=0.272 Sum_probs=45.4 Q ss_pred CCCCCCCCCEEEECCCEEEEECCCEEECCCEEEEEEECCCEEEEEEEC Q ss_conf 543579960899858179997445440795489987667279999984 Q gi|254781018|r 2 KQQTSQTPHAHSIAPAKVILSGEYSSLYGASALAVAITFYLRALLTTI 49 (337) Q Consensus 2 ~~~~~~~p~~~~~APGKi~L~GEh~v~~G~~ai~~aI~~~~~v~i~~~ 49 (337) ++.|+.+|+..++|||||+|+|||+||+||.|+++||++++++.++++ T Consensus 5 ~~~fg~~p~~~~~APGRvnlIGeHtDY~~G~Vlp~Ai~~~~~va~~~r 52 (52) T pfam10509 5 EEFFGVEPEGVASAPGRVNLIGEHTDYNGGFVLPAAINLDTYVAVSKR 52 (52) T ss_pred HHHHCCCCCEEEECCCEEEEEECCEECCCCEEEEEEECCCEEEEEECC T ss_conf 787789986699889758870114603897788898316369999749 No 46 >COG1829 Predicted archaeal kinase (sugar kinase superfamily) [General function prediction only] Probab=99.27 E-value=1.4e-08 Score=72.79 Aligned_cols=262 Identities=18% Similarity=0.191 Sum_probs=156.4 Q ss_pred EEEECCCEEEEE-----CCCEEECCCEEEEEEECCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 899858179997-----445440795489987667279999984898299995576667532100123333433211111 Q gi|254781018|r 11 AHSIAPAKVILS-----GEYSSLYGASALAVAITFYLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFIA 85 (337) Q Consensus 11 ~~~~APGKi~L~-----GEh~v~~G~~ai~~aI~~~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (337) ..+++||-|-=| +.--...|....-.+++.++++.++..++-.+.. +..- .|.. T Consensus 3 ~~~fvP~hITgfF~pv~~~~p~~SGSiGaGv~l~~gv~v~v~~~~~~~v~~-Ng~~--------------~d~~------ 61 (283) T COG1829 3 VRLFVPGHITGFFVPVIGKDPLKSGSIGAGVALERGVTVEVRFGEGTGVRL-NGKK--------------IDLP------ 61 (283) T ss_pred CEEECCCEEEEEEEECCCCCCCCCCCCCEEEEECCCEEEEEEECCCCEEEE-CCEE--------------CCCH------ T ss_conf 148526426898885248997667886423796386269999658735888-9963--------------1302------ Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 22103347778889999987511566788769999805541038985303789999999998187754025899999986 Q gi|254781018|r 86 KKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYI 165 (337) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~ 165 (337) . .....+. ..+.++.+.+.+++|.|+|+|-|+|.+.+.+.+++..++.. .+...+.||.+ T Consensus 62 ---~-----------~~~v~e~---L~~~~~~v~~~~~~P~G~G~G~Sga~AL~~Ala~a~~~~~~---~~~a~~~AH~a 121 (283) T COG1829 62 ---I-----------TRKVIEK---LGPDGVGVRIESPVPLGCGYGVSGAGALGTALALAEELGLG---EESAARIAHVA 121 (283) T ss_pred ---H-----------HHHHHHH---HCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCC---HHHHHHHHHHH T ss_conf ---6-----------9999998---47668626998517988654144799999999988664998---78999988899 Q ss_pred HHCCCCCCCCHHHHHHHHCCCE-EEECC--CCCCCCCCCCC-EEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 4115778773457876532630-24027--85200147884-39997497145379999999976412068999876548 Q gi|254781018|r 166 ERLQHGKTGIIDSTTIVMGGII-YINPP--QIIKNETLVGE-WWAINTGMPESSTGECVSFVEQHFSQSSIWSEFNAVTN 241 (337) Q Consensus 166 e~~~~~~~sg~D~~~~~~gg~~-~~~~~--~~~~~~~~~~~-~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~~i~~ 241 (337) |... ..+-.|+.+..+||++ +.... ..-..+.++.+ ..++.-.....+|++++... ..+++++... T Consensus 122 EV~~--gtGLGDVvAq~~GGlViR~~pG~Pg~~~vd~Ip~~~~~V~~~~~g~l~T~~vi~~~--------~~~~i~~~g~ 191 (283) T COG1829 122 EVEN--GTGLGDVVAQYTGGLVIRVKPGGPGEGEVDRIPVPGLRVITISLGELSTKSVITDE--------VVRKINEAGK 191 (283) T ss_pred HHHC--CCCCHHHHHHHCCCEEEEECCCCCCEEEEEEEECCCCEEEEEECCCCCHHHHHHHH--------HHHHHHHHHH T ss_conf 9871--78704788875085799831899985789886048826999973666677763067--------8999999999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC---EEEECCCCCCCCCEEEEEEEECCCH-HHHH Q ss_conf 99999983872168999999599999718997799999999985798---0895078867885089999857997-9999 Q gi|254781018|r 242 EMMENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGG---SAKIAGAGSIRGEKAGLVLVGGYNP-EKLS 317 (337) Q Consensus 242 ~~~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~---~ak~sGAG~~~Gg~~~~~l~~~~~~-~~~~ 317 (337) ++.+.|.++ ..+.+.|+++.+.-+..|++++++...++.....+. ...|.| -..|.+.+.... +... T Consensus 192 ~~l~ellk~--Ptle~f~~~a~~FA~~~Gl~~~~v~~~~~~~~~~g~l~as~~mlg-------~tVFa~~~~~e~~~~~~ 262 (283) T COG1829 192 RALAELLKN--PTLENFMEEAREFARRTGLMSEEVKEAAEALHSVGGLGASMAMLG-------VTVFALSPEAERLAEVG 262 (283) T ss_pred HHHHHHHHC--CCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHC-------CEEEEECCCCCHHHHHH T ss_conf 999998839--999999999999998728678899999987543474457678638-------68999623431077887 Q ss_pred HHCCCEEEEEEECCCC Q ss_conf 7459869999987997 Q gi|254781018|r 318 SLYGYTCYKIKEDKNG 333 (337) Q Consensus 318 ~~~~~~~~~~~~~~~G 333 (337) + .-+.-+.+.+...| T Consensus 263 ~-~~~~~~~~~l~~~g 277 (283) T COG1829 263 K-DLGAPYDIRLIYGG 277 (283) T ss_pred H-HHCCCCCEEEEECC T ss_conf 7-51466332688447 No 47 >pfam00288 GHMP_kinases_N GHMP kinases N terminal domain. This family includes homoserine kinases, galactokinases and mevalonate kinases. Probab=99.20 E-value=3.4e-11 Score=89.31 Aligned_cols=67 Identities=36% Similarity=0.489 Sum_probs=59.6 Q ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCC Q ss_conf 999980554103898530378999999999818775402589999998641157787734578765326 Q gi|254781018|r 117 SLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQHGKTGIIDSTTIVMGG 185 (337) Q Consensus 117 ~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~~~~sg~D~~~~~~gg 185 (337) ++.+.++||.++|||||||+++|++++++++++.+++ .+++.++++.+|+.. +.++|+|++++.||| T Consensus 1 ~i~i~s~iP~g~GLgSSaa~~~a~~~al~~~~~~~l~-~~~l~~~a~~~E~~~-g~~~G~d~~~~~~gg 67 (67) T pfam00288 1 DIEIESNIPLGAGLGSSAALAVALLLALNELFGLPLS-KEELARLALEAEGAI-GVNSGDDVAASVYGG 67 (67) T ss_pred CEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHH-CCCCHHHHHHHHCCC T ss_conf 9257568998678655699999999999999589989-999999999999981-898811677642277 No 48 >KOG2833 consensus Probab=99.15 E-value=2.5e-07 Score=64.94 Aligned_cols=272 Identities=16% Similarity=0.127 Sum_probs=141.4 Q ss_pred CEEEECCCEEEEECCCEEECCCE--EEEEEECCCEEEEEEECCCCEEEE----EECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 08998581799974454407954--899876672799999848982999----955766675321001233334332111 Q gi|254781018|r 10 HAHSIAPAKVILSGEYSSLYGAS--ALAVAITFYLRALLTTIEPSLIRI----INSQTIEYSFEACRLLGHQIDQRHRDF 83 (337) Q Consensus 10 ~~~~~APGKi~L~GEh~v~~G~~--ai~~aI~~~~~v~i~~~~~~~~~~----~s~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (337) .+.+.||-.|-++ -|+|.. .+.+|+|.-+-+++++ |..... .+..+ ..|.-|.+. T Consensus 7 ~~t~taPvNIAvI----KYWGKRD~~l~LP~N~SISvTL~~--D~L~t~Tt~~~s~~f-------------e~dr~wLNG 67 (395) T KOG2833 7 EVTATAPVNIAVI----KYWGKRDEELNLPTNDSISVTLSQ--DDLCTLTTAVVSPSF-------------EEDRMWLNG 67 (395) T ss_pred EEEECCCCEEEEE----EECCCCCHHHCCCCCCEEEEEECC--CCCEEEEEEEECCCC-------------CCCEEEECC T ss_conf 6763166236665----412444132137777706888535--531445788842664-------------454178878 Q ss_pred CC---CCCCHHHHHHHHHHHHHHHHHHC---CCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHH Q ss_conf 11---22103347778889999987511---5667887699998055410389853037899999999981877540258 Q gi|254781018|r 84 IA---KKIPITSVLTHPNDLILYIFNRH---LPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEK 157 (337) Q Consensus 84 ~~---~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~ 157 (337) .. .....+.+++.......-..+.. ....+..++|...+++|..+||.||||--.|++.++.++++++.+ .++ T Consensus 68 k~~~i~~~R~q~cl~e~r~~~~d~~~~~~~~~~~~~~~lHI~S~nNFPtAAGLASSAAG~Aalv~alarly~l~~~-~~e 146 (395) T KOG2833 68 KEVPISNKRYQRCLREIRRLARDREESEASLPSNGPLKLHIASVNNFPTAAGLASSAAGFAALVLALARLYGLDDS-PEE 146 (395) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC-HHH T ss_conf 2213663789999999999865323311256767871178874378830320011024089999999998589999-899 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHCCCEEEEC------CCCCCCCCCCC------CEE-EEEC--CCCCCCHHHHHHH Q ss_conf 9999998641157787734578765326302402------78520014788------439-9974--9714537999999 Q gi|254781018|r 158 LIAETGYIERLQHGKTGIIDSTTIVMGGIIYINP------PQIIKNETLVG------EWW-AINT--GMPESSTGECVSF 222 (337) Q Consensus 158 ~~~~a~~~e~~~~~~~sg~D~~~~~~gg~~~~~~------~~~~~~~~~~~------~~~-~i~~--~~~~~st~~~~~~ 222 (337) +..+|++. || -..-++|||.+.++. .+.......|. ..+ +|.+ .+...||.-+... T Consensus 147 ls~iAR~G--------SG-SACRSl~GG~V~W~mG~~~DGsDsvAvq~~p~~~W~el~ililVvs~~~K~t~ST~GM~~s 217 (395) T KOG2833 147 LSRIARQG--------SG-SACRSLYGGFVAWEMGELDDGSDSVAVQIAPSSHWPELRILILVVSDAKKKTGSTEGMRRS 217 (395) T ss_pred HHHHHHCC--------CC-HHHHHHCCCEEEEECCCCCCCCCEEEEEECCCCCCCCEEEEEEEECCCCCCCCCCHHHHHH T ss_conf 99998614--------86-0455430453677315645787503787035347873189999960553344661888988 Q ss_pred HHHHHCCCHHHHH-HHHHHHHHHHHHHCCCCHHHHHHH-HH-H--HHH-HHHC---CCCCHHHHHHHHHHHHCC---C-- Q ss_conf 9976412068999-876548999999838721689999-99-5--999-9971---899779999999998579---8-- Q gi|254781018|r 223 VEQHFSQSSIWSE-FNAVTNEMMENIQKKDAEKTYKSV-RV-N--QSL-LESI---GVVPETVGKFIRCIENNG---G-- 288 (337) Q Consensus 223 ~~~~~~~~~~~~~-~~~i~~~~~~al~~~d~~~l~~~~-~~-~--~~l-l~~l---~vs~~~l~~l~~~~~~~g---~-- 288 (337) ++...--...++. +.++..+|.+++.+.||+.|.++. +. | |.. ++.+ -.-++.-.+++..+.++. + T Consensus 218 veTS~L~qhRi~~vVP~Ri~~m~eaI~~rDF~~FA~lTm~DSNqFHAvclDT~PPI~YmNd~S~~iI~~vh~~N~~~G~t 297 (395) T KOG2833 218 VETSQLLQHRIESVVPQRIQQMREAIRERDFESFAKLTMKDSNQFHAVCLDTFPPIFYLNDTSWRIISLVHEFNASAGGT 297 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHCCCCE T ss_conf 88769999999866479999999999865778877777613124564423158974785364079999999987516971 Q ss_pred -EEEECCCCCCCCCEEEEEEEECCCHHH Q ss_conf -089507886788508999985799799 Q gi|254781018|r 289 -SAKIAGAGSIRGEKAGLVLVGGYNPEK 315 (337) Q Consensus 289 -~ak~sGAG~~~Gg~~~~~l~~~~~~~~ 315 (337) .|----||+ ..+++....+... T Consensus 298 ~vAYTFDAGP-----NAvl~~l~e~~~~ 320 (395) T KOG2833 298 RVAYTFDAGP-----NAVLIVLEENVSQ 320 (395) T ss_pred EEEEEECCCC-----CEEEEEHHHHHHH T ss_conf 6898833898-----6289983640999 No 49 >COG1907 Predicted archaeal sugar kinases [General function prediction only] Probab=98.98 E-value=1.1e-06 Score=60.78 Aligned_cols=272 Identities=17% Similarity=0.186 Sum_probs=159.1 Q ss_pred EEEECCCEEEE-----ECCCEEECCCEEEEEEECC-CEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 89985817999-----7445440795489987667-27999998489829999557666753210012333343321111 Q gi|254781018|r 11 AHSIAPAKVIL-----SGEYSSLYGASALAVAITF-YLRALLTTIEPSLIRIINSQTIEYSFEACRLLGHQIDQRHRDFI 84 (337) Q Consensus 11 ~~~~APGKi~L-----~GEh~v~~G~~ai~~aI~~-~~~v~i~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (337) ..+++|.|+.+ -|+.-=++|+-.++ +.- +..++++++++-.. ... . . T Consensus 2 v~v~tpSrlH~gliDl~G~~GRv~GgvG~a--Le~P~l~i~~~~s~~~~~--~ge------------------~----~- 54 (312) T COG1907 2 VRVRTPSRLHAGLIDLNGSIGRVDGGVGLA--LEEPRLEIEAKPSDDIEV--DGE------------------D----R- 54 (312) T ss_pred EEEECCCEEEECCCCCCCCCCCEECCEEEE--EECCCEEEEEECCCCCCC--CCH------------------H----H- T ss_conf 587047501100445787532130440488--507815999720554112--550------------------6----6- Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 12210334777888999998751156678876999980554103898530378999999999818775402589999998 Q gi|254781018|r 85 AKKIPITSVLTHPNDLILYIFNRHLPHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGY 164 (337) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~ 164 (337) ....... + +.......|++|.|.+++|.-.||||-..++.++..++++++++.++- .+ +|.. T Consensus 55 ------~~~~~~~---a-----~~~le~~~gv~I~I~~~~P~HvGLGS~TQlaLa~a~ai~~i~gl~~~~-~e---lA~~ 116 (312) T COG1907 55 ------RERVEKA---A-----RLVLEVGEGVKIEIRSDIPAHVGLGSTTQLALAVASAILEIYGLELSI-RE---LAFA 116 (312) T ss_pred ------HHHHHHH---H-----HHHHCCCCCEEEEEEECCCHHCCCCHHHHHHHHHHHHHHHHHCCCCCH-HH---HHHH T ss_conf ------8999999---9-----975141675299997527322077737799999999999986589998-99---9999 Q ss_pred HHHCCCCCCCCHHHHHHHHCCCEE-----EE--CCCC---CCCCCCCCCEEEEE-CCC--CCCCHHHHHHHHHH-HHCCC Q ss_conf 641157787734578765326302-----40--2785---20014788439997-497--14537999999997-64120 Q gi|254781018|r 165 IERLQHGKTGIIDSTTIVMGGIIY-----IN--PPQI---IKNETLVGEWWAIN-TGM--PESSTGECVSFVEQ-HFSQS 230 (337) Q Consensus 165 ~e~~~~~~~sg~D~~~~~~gg~~~-----~~--~~~~---~~~~~~~~~~~~i~-~~~--~~~st~~~~~~~~~-~~~~~ 230 (337) .-+ +..||.-.++--+||++. +. +... .....+|++|.++. -+. +..+..+.+...++ +.-.. T Consensus 117 vgR---G~tSgiGv~afe~GGFIVDGGh~~~f~ps~~sP~I~R~dfPedW~~VlaIP~~~rg~~~~~E~~if~~~~p~p~ 193 (312) T COG1907 117 VGR---GGTSGIGVYAFEYGGFIVDGGHSFGFLPSSASPLIFRLDFPEDWRFVLAIPEVERGVSGRREVDIFKKYCPVPL 193 (312) T ss_pred HCC---CCCCCEEEEEEEECCEEEECCCCCCCCCCCCCCEEEEECCCCCEEEEEEECCCCCCCCCHHHHHHHHHCCCCCH T ss_conf 736---77564137998878899878744676668999605532288865999994478843210278889975689997 Q ss_pred HHHHHHHHHH-HHHHHHHHCCCCHHHHHHHHHHHHHH----HHC--CCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCEE Q ss_conf 6899987654-89999998387216899999959999----971--8997799999999985798089507886788508 Q gi|254781018|r 231 SIWSEFNAVT-NEMMENIQKKDAEKTYKSVRVNQSLL----ESI--GVVPETVGKFIRCIENNGGSAKIAGAGSIRGEKA 303 (337) Q Consensus 231 ~~~~~~~~i~-~~~~~al~~~d~~~l~~~~~~~~~ll----~~l--~vs~~~l~~l~~~~~~~g~~ak~sGAG~~~Gg~~ 303 (337) +....+.+++ ..+.-++..+|++.+++.+++=|.+. +.. .+.++.++++++.+.+...++.+|-=|+ T Consensus 194 ~~~~~ls~~vLm~mmPavvE~Die~fg~~l~~iQ~l~g~~f~~~e~~~~~~~V~~iv~~m~~~a~~agqSSwGP------ 267 (312) T COG1907 194 EEVGELSHRVLMKMMPAVVERDIESFGEALNEIQELGGKWFKKVEGGLQREDVKEIVDEMVEAAYGAGQSSWGP------ 267 (312) T ss_pred HHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCEECCHHHHHHHHHHHHHCCCCCCCCCCC------ T ss_conf 99989999999987589886379999999999998876646564131022899999999987230443256688------ Q ss_pred EEEEEECCCHH-------HHHHH----CCCEEEEEEECCCCEEEC Q ss_conf 99998579979-------99974----598699999879970929 Q gi|254781018|r 304 GLVLVGGYNPE-------KLSSL----YGYTCYKIKEDKNGTMLA 337 (337) Q Consensus 304 ~~~l~~~~~~~-------~~~~~----~~~~~~~~~~~~~G~~i~ 337 (337) ++....++.+ .+..+ .+.+++-.+.+..|..|. T Consensus 268 -tvY~i~d~~~~~~~~~~~~~~~~~~g~~gev~vT~~rN~Ga~i~ 311 (312) T COG1907 268 -TVYGIVDSREAGSVVRKLIDILLEEGIGGEVFVTKARNRGAEIL 311 (312) T ss_pred -EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCEEC T ss_conf -78986165321247999999987438861899952677776521 No 50 >TIGR01240 mevDPdecarb diphosphomevalonate decarboxylase; InterPro: IPR005935 Synonyms: mevalonate diphosphate decarboxylase; pyrophosphomevalonate decarboxylase. This group of enzymes belongs to the GHMP kinase domain superfamily. GHMP kinases are a unique class of ATP-dependent enzymes (the abbreviation of which refers to the original members: galactokinase, homoserine kinase, mevalonate kinase, and phosphomevalonate kinase) . Enzymes belonging to this superfamily contain three well-conserved motifs, the second of which has the typical sequence Pro-X-X-X-Gly-Leu-X-Ser-Ser-Ala and is involved in ATP binding . The phosphate binding loop in GHMP kinases is distinct from the classical P-loops found in many ATP/GTP binding proteins. The bound ADP molecule adopts a rare syn conformation and is in the opposite orientation from those bound to the P-loop-containing proteins . GHMP kinases display a distinctly bilobal appearance with their N-terminal subdomains dominated by a mixed beta-sheet flanked on one side by alpha-helices and their C-terminal subdomains containing a four stranded anti-parallel beta-sheet , , , . Diphosphomevalonate decarboxylase (mevalonate pyrophosphate decarboxylase, (4.1.1.33 from EC) catalyzes the decarboxylation of mevalonate pyrophosphate to isopentyl pyrophosphate (IPP) , the last step in the synthesis of IPP in the mevalonate pathway. Mevalonate is a key intermediate in the biosynthesis of sterols and non-sterol isoprenes in the mevalonate pathway. In mammals, the majority of mevalonate is converted into cholesterol.ATP + (R)-5-diphosphomevalonate = ADP + phosphate + isopentenyl diphosphate + CO2 ; GO: 0004163 diphosphomevalonate decarboxylase activity, 0008299 isoprenoid biosynthetic process. Probab=98.90 E-value=1.7e-06 Score=59.69 Aligned_cols=190 Identities=16% Similarity=0.085 Sum_probs=115.5 Q ss_pred CEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCEEEECCCC Q ss_conf 76999980554103898530378999999999818775402589999998641157787734578765326302402785 Q gi|254781018|r 115 GISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQHGKTGIIDSTTIVMGGIIYINPPQI 194 (337) Q Consensus 115 g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~~~~sg~D~~~~~~gg~~~~~~~~~ 194 (337) -+++....++|.++||.||||=..|++.|+++++.+++++. .+...|+.. ||- -.=|+|||.+..+.+.- T Consensus 100 k~~~~s~Nn~PTAAGLaSSasGlaALv~A~~~l~~l~ld~~-~lsr~Ar~g--------SGS-acRS~FGg~vaWekG~~ 169 (325) T TIGR01240 100 KVKVVSQNNVPTAAGLASSASGLAALVSACDKLYKLQLDDK-ELSRIARKG--------SGS-ACRSLFGGYVAWEKGKD 169 (325) T ss_pred EEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH-HHHHHHHHC--------CCC-CHHHHCCCCCCCCCCCC T ss_conf 46788507862144466689999999999999970798878-999886421--------673-00320588000027767 Q ss_pred -CCC-------------CCCCCCEEEEECCCCCCCHH-HHHHHHHHHHCCCHHHHH-HHHHHHHHHHHHHCCCCHHHHHH Q ss_conf -200-------------14788439997497145379-999999976412068999-87654899999983872168999 Q gi|254781018|r 195 -IKN-------------ETLVGEWWAINTGMPESSTG-ECVSFVEQHFSQSSIWSE-FNAVTNEMMENIQKKDAEKTYKS 258 (337) Q Consensus 195 -~~~-------------~~~~~~~~~i~~~~~~~st~-~~~~~~~~~~~~~~~~~~-~~~i~~~~~~al~~~d~~~l~~~ 258 (337) ..+ +.....+++|...+..-|+. -+...+..-.-...-++. ..++-+.+.++|+.+||+.+++. T Consensus 170 edd~ss~a~~v~~dS~~~~ma~~~lv~nd~KK~vSSr~GM~ltv~TS~~f~ew~e~v~~dr~~~m~~aik~kdF~~~g~~ 249 (325) T TIGR01240 170 EDDLSSAAVKVEDDSDEDKMAMVVLVVNDIKKDVSSREGMKLTVETSELFKEWVESVVIDRFEVMKKAIKTKDFEKLGKE 249 (325) T ss_pred CCCCCCEEEECCCCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 67887403656777850011077776313688745303568889877764223431000589999999732128999999 Q ss_pred HHHHHHHHHH--------CCCCC---HHHHHHHHHHHH-C-CC-EEEECCCCCCCCCEEEEEEEECCCHHHHHHH Q ss_conf 9995999997--------18997---799999999985-7-98-0895078867885089999857997999974 Q gi|254781018|r 259 VRVNQSLLES--------IGVVP---ETVGKFIRCIEN-N-GG-SAKIAGAGSIRGEKAGLVLVGGYNPEKLSSL 319 (337) Q Consensus 259 ~~~~~~ll~~--------l~vs~---~~l~~l~~~~~~-~-g~-~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~ 319 (337) +..|-.-++. +---+ -++-.-|+.+++ + |. .+---=||+ ..-++....+..++.+. T Consensus 250 t~~n~l~mHaTtl~a~PPf~Y~~~~S~~a~~av~~lrqfy~g~~~yfTmDAGP-----Nvkvl~l~~~ls~l~e~ 319 (325) T TIGR01240 250 TEANSLSMHATTLDAQPPFSYLSDESKKAMEAVKELRQFYEGIICYFTMDAGP-----NVKVLVLEEDLSKLAET 319 (325) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCCC-----CEEEEEECCCHHHHHHH T ss_conf 98745652000014774800038446899999999988738813687415888-----53887623236899999 No 51 >KOG1537 consensus Probab=98.65 E-value=4.6e-07 Score=63.28 Aligned_cols=215 Identities=15% Similarity=0.140 Sum_probs=130.7 Q ss_pred CCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH---------------------- Q ss_conf 6678876999980554103898530378999999999818775402589999998641---------------------- Q gi|254781018|r 110 PHISCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIER---------------------- 167 (337) Q Consensus 110 ~~~~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~---------------------- 167 (337) ..++.+-+..+...||.++|+|||++..++.++..+...++.++... ....+..+|+ T Consensus 89 s~fp~~Tk~hvtNPiplgrGigssgta~~aGv~l~ne~a~LGlsk~~-mldy~lmierhpdn~~a~mmGgf~GSflr~l~ 167 (355) T KOG1537 89 SIFPITTKKHVTNPIPLGRGIGSSGTAKMAGVRLVNESADLGLSKGS-MLDYSLMIERHPDNAVAEMMGGFLGSFLRALL 167 (355) T ss_pred EECCCCEEEEECCCCCCCCCCCCHHHHHHHHHEECCHHHHCCCCCCC-CHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHC T ss_conf 41353002541377656555230345665442130447652876102-30178887647177899998667899998758 Q ss_pred ----CCCCCCCCHHHHHHHHCCCEEEECCCCCCC------CCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHH Q ss_conf ----157787734578765326302402785200------1478843999749714537999999997641206899987 Q gi|254781018|r 168 ----LQHGKTGIIDSTTIVMGGIIYINPPQIIKN------ETLVGEWWAINTGMPESSTGECVSFVEQHFSQSSIWSEFN 237 (337) Q Consensus 168 ----~~~~~~sg~D~~~~~~gg~~~~~~~~~~~~------~~~~~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~~~~ 237 (337) ...+ .+-.|+.....||...+.+.+...+ ...+..++++....-+..|++++..+...+-....+-.++ T Consensus 168 e~E~~~~~-~~ad~ilp~~~gg~~li~~lpP~dlg~~~r~pw~~~lk~i~viP~Fel~T~k~R~vLPt~yp~~d~V~Nlq 246 (355) T KOG1537 168 ESEAKVSG-YHADNILPAIMGGFVLIRNLPPLDLGKPLRFPWDKDLKFILVIPDFELPTKKMRAVLPTEYPMVDHVWNLQ 246 (355) T ss_pred HHHHHHCC-CCHHHHCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCEEEEECHH T ss_conf 76664237-88777250125870550588952236633377776613899944444660455420675566300220088 Q ss_pred HHHHHHHHHHHCC-CCHHHH-HHH--HHHHHHHHHCCCCCHHHHHHHHHHHHC---CC-EEEECCCCCCCCCEEEEEEEE Q ss_conf 6548999999838-721689-999--995999997189977999999999857---98-089507886788508999985 Q gi|254781018|r 238 AVTNEMMENIQKK-DAEKTY-KSV--RVNQSLLESIGVVPETVGKFIRCIENN---GG-SAKIAGAGSIRGEKAGLVLVG 309 (337) Q Consensus 238 ~i~~~~~~al~~~-d~~~l~-~~~--~~~~~ll~~l~vs~~~l~~l~~~~~~~---g~-~ak~sGAG~~~Gg~~~~~l~~ 309 (337) +.+. +..|+..+ |-..++ .+| +.+|..+..| .|.++.+...+.+. |+ |..+||||++ ...++. T Consensus 247 rlA~-LttAl~~~p~n~~L~y~~m~DkvhqPyRa~L---IPGl~~il~~~~p~t~pGl~GiclSGAGPT-----~lAlat 317 (355) T KOG1537 247 RLAA-LTTALLEGPDNVMLGYALMSDKVHQPYRAPL---IPGLEAILKAALPATYPGLFGICLSGAGPT-----ALALAT 317 (355) T ss_pred HHHH-HHHHHHCCCCCHHHHHHHHHCCCCCCCCCCC---CCCHHHHHHHHCCCCCCCEEEEEECCCCCE-----EEEEEC T ss_conf 8999-9999814997135555665321247531455---741899998618556786245884388973-----688743 Q ss_pred CCCHHHHHH-----H--C----CCEEEEEEECCCCEEE Q ss_conf 799799997-----4--5----9869999987997092 Q gi|254781018|r 310 GYNPEKLSS-----L--Y----GYTCYKIKEDKNGTML 336 (337) Q Consensus 310 ~~~~~~~~~-----~--~----~~~~~~~~~~~~G~~i 336 (337) .+-+.+.+ + . .|..++.++|.-|++. T Consensus 318 -enf~eI~~~mv~~F~K~G~kcs~~~l~pa~Dga~v~~ 354 (355) T KOG1537 318 -ENFQEIGEKMVEAFWKVGHKCSVASLKPALDGAGVRL 354 (355) T ss_pred -CCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCEEE T ss_conf -7478999999999986184434676200237764233 No 52 >pfam08544 GHMP_kinases_C GHMP kinases C terminal. This family includes homoserine kinases, galactokinases and mevalonate kinases. Probab=98.16 E-value=6.1e-06 Score=56.22 Aligned_cols=61 Identities=21% Similarity=0.388 Sum_probs=50.7 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC-EEEECCCCCCCCCEEEEEEE Q ss_conf 999983872168999999599999718997799999999985798-08950788678850899998 Q gi|254781018|r 244 MENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGG-SAKIAGAGSIRGEKAGLVLV 308 (337) Q Consensus 244 ~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~~~Gg~~~~~l~ 308 (337) .+++.++|++.+++.|+.++.+ ....+.+|.++++++.+++.|+ +++||||| +|.+.+++. T Consensus 2 ~~~l~~~d~~~~~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~~G~~~a~~sGsg---~G~~v~~l~ 63 (82) T pfam08544 2 LEALREGDLELLGELLNENALL-LYLGILSPELDELLEELRELGALGAKLSGSG---GGPTVFALF 63 (82) T ss_pred HHHHHCCCHHHHHHHHHHHHHC-CHHHCCCHHHHHHHHHHHHCCCCEEECCCCC---CCCEEEEEE T ss_conf 3788818999999999999864-0550577889999999998599855412589---888588885 No 53 >TIGR00144 beta_RFAP_syn beta-ribofuranosylaminobenzene 5'-phosphate synthase family; InterPro: IPR004422 This protein family contains several archaeal examples of beta-ribofuranosylaminobenzene 5-prime-phosphate synthase (beta-RFAP synthase), an enzyme involved in methanopterin biosynthesis . In some species, two members of this family are found. It is unclear whether both act as beta-RFAP synthase. This family is related to the GHMP kinases (Galactokinase, Homoserine kinase, Mevalonate kinase, Phosphomevalonate kinase). Members are found so far only in the Archaea and in Methylobacterium extorquens.They have an amino-terminal domain probably related to ATP binding.. Probab=95.40 E-value=0.19 Score=27.83 Aligned_cols=178 Identities=15% Similarity=0.212 Sum_probs=103.0 Q ss_pred CCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCEEEECC Q ss_conf 88769999805541038985303789999999998187754025899999986411577877345787653263024027 Q gi|254781018|r 113 SCGISLDLHSTIPIGSGFGSSSAIISALSLALSSITHQPFCNKEKLIAETGYIERLQHGKTGIIDSTTIVMGGIIYINPP 192 (337) Q Consensus 113 ~~g~~i~i~s~iP~~~GLGSSaA~~va~~~al~~~~~~~~~~~~~~~~~a~~~e~~~~~~~sg~D~~~~~~gg~~~~~~~ 192 (337) ..++++.+.+..|.-.||||...+..++.+.+..+++..+...+ ++... -.+..+|.....--+||.+.--.. T Consensus 84 d~~~~~~~~~~~~~~~glg~g~~~~l~~~~~~~~~~~~~~~~~~----~~~~~---g~gg~~g~g~~~~~~gg~~~d~g~ 156 (337) T TIGR00144 84 DEGFDLEVLSLFPAHSGLGSGTQLSLAVGRLVSELYGLELTARE----LAEIV---GRGGTSGIGVAAFEDGGFLVDGGH 156 (337) T ss_pred CCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHH----HHHHH---CCCCCCCCCEEEECCCCEEEECCC T ss_conf 65511687530320014566512467788888887324310455----66653---277754330122215752664565 Q ss_pred -----------------------CCCCCCCCCCCEEE-EECCCC--CCCHHHHHHHHHH-HHCCCHHHHHHHH-HHHHHH Q ss_conf -----------------------85200147884399-974971--4537999999997-6412068999876-548999 Q gi|254781018|r 193 -----------------------QIIKNETLVGEWWA-INTGMP--ESSTGECVSFVEQ-HFSQSSIWSEFNA-VTNEMM 244 (337) Q Consensus 193 -----------------------~~~~~~~~~~~~~~-i~~~~~--~~st~~~~~~~~~-~~~~~~~~~~~~~-i~~~~~ 244 (337) +......+|..|-. +..... .......+...++ ...--..++++.+ +..++. T Consensus 157 ~~g~g~~~~~~~p~~~~~g~~p~p~~~~~d~p~~w~~~~~~p~~~~~~~g~~~~~~~~~~cp~pl~~~~~~~~~~l~~~~ 236 (337) T TIGR00144 157 STGKGKEKEDFLPSSASKGAKPAPVIARLDFPEDWNVILAIPELDRGVFGKREVNLFQKYCPVPLEDVEKLCHLILMKML 236 (337) T ss_pred CCCCCCCCHHCCCCCCCCCCCCCCEEEECCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 44666430002553212567765312212577454158873132111244113344554258766788999999998765 Q ss_pred HHHHCCCCHHHHHHHHHHHHH-HHH--CCCCCHHHHHHHHHHHHCCC-EEEECCCCC Q ss_conf 999838721689999995999-997--18997799999999985798-089507886 Q gi|254781018|r 245 ENIQKKDAEKTYKSVRVNQSL-LES--IGVVPETVGKFIRCIENNGG-SAKIAGAGS 297 (337) Q Consensus 245 ~al~~~d~~~l~~~~~~~~~l-l~~--l~vs~~~l~~l~~~~~~~g~-~ak~sGAG~ 297 (337) -++..+|++.+++.+++-+.+ .++ +....|.++.+++.+.+.|+ ++.++..|+ T Consensus 237 p~~~~~~~~~~g~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~~~gp 293 (337) T TIGR00144 237 PAVVEGDLDDFGEALNKLQGLGFKKVELELQGPLIKGLIDLLDKDGAPGAGLSSFGP 293 (337) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 566630156777888777631302344555335788888765414753134222475 No 54 >PRK05182 DNA-directed RNA polymerase subunit alpha; Provisional Probab=70.69 E-value=7.1 Score=18.00 Aligned_cols=51 Identities=6% Similarity=0.038 Sum_probs=42.0 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHHCCC Q ss_conf 65489999998387216899999959999971-8997799999999985798 Q gi|254781018|r 238 AVTNEMMENIQKKDAEKTYKSVRVNQSLLESI-GVVPETVGKFIRCIENNGG 288 (337) Q Consensus 238 ~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l-~vs~~~l~~l~~~~~~~g~ 288 (337) +++-++.++|++.++..+++++..+++-+.++ ..--..++++.+.+.+.|+ T Consensus 251 eLSvRs~NcLk~a~I~tI~dLv~~se~dLl~ikNfGkKSl~EI~~kL~~~gl 302 (306) T PRK05182 251 ELSVRSYNCLKRAGINTIGDLVQRTEEELLKTPNLGKKSLEEIKEKLAELGL 302 (306) T ss_pred CCCHHHHHHHHHCCCCCHHHHHHCCHHHHHHCCCCCCCCHHHHHHHHHHCCC T ss_conf 3868887558785993699987479999961899762039999999997298 No 55 >TIGR00738 rrf2_super rrf2 family protein (putative transcriptional regulator); InterPro: IPR000944 The following uncharacterised bacterial proteins have been shown to be evolutionary related, Desulphovibrio vulgaris protein Rrf2; Escherichia coli hypothetical proteins yfhP and yjeB; Bacillus subtilis hypothetical proteins yhdE, yrzC and ywgB; Mycobacterium tuberculosis hypothetical protein Rv1287; and Synechocystis strain PCC 6803 hypothetical protein slr0846. These are small proteins of 12 to 18 kD which seem to contain a signal sequence, and may represent a family of probable transcriptional regulators.. Probab=68.61 E-value=7.3 Score=17.90 Aligned_cols=50 Identities=18% Similarity=0.318 Sum_probs=38.0 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCEEEEEEEECCCHHHHHHH Q ss_conf 99997189977999999999857980895078867885089999857997999974 Q gi|254781018|r 264 SLLESIGVVPETVGKFIRCIENNGGSAKIAGAGSIRGEKAGLVLVGGYNPEKLSSL 319 (337) Q Consensus 264 ~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~ 319 (337) ++-++.+||.+-|.+|+..+.+.|..-=+-|+| ||+.|+.+++.-.+.++ T Consensus 30 ~ia~~~~Is~~yL~K~l~~L~~~GlV~S~rG~~------GGY~L~~~~~~I~~~dv 79 (133) T TIGR00738 30 EIAERQGISRSYLEKVLRTLRKAGLVESVRGPG------GGYRLARPPEEITVGDV 79 (133) T ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCEECCCCCC------CCCCCCCCHHHCCHHHH T ss_conf 999884899879999999887707642115898------56310775343049888 No 56 >TIGR02027 rpoA DNA-directed RNA polymerase, alpha subunit; InterPro: IPR011773 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This family consists of the bacterial (and chloroplast) DNA-directed RNA polymerase alpha subunit, encoded by the rpoA gene. The RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. The amino terminal domain is involved in dimerizing and assembling the other RNA polymerase subunits into a transcriptionally active enzyme. The carboxy-terminal domain contains determinants for interaction with DNA and with transcriptional activator proteins , .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription DNA-dependent. Probab=63.20 E-value=9.9 Score=17.07 Aligned_cols=51 Identities=8% Similarity=0.051 Sum_probs=42.1 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHH-CCC Q ss_conf 65489999998387216899999959999971-8997799999999985-798 Q gi|254781018|r 238 AVTNEMMENIQKKDAEKTYKSVRVNQSLLESI-GVVPETVGKFIRCIEN-NGG 288 (337) Q Consensus 238 ~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l-~vs~~~l~~l~~~~~~-~g~ 288 (337) +++-++.++|++...+.+|+|+..+..-|.++ .+--..++++.+.+.+ .|+ T Consensus 267 dLSVRs~NCLk~~gI~t~geL~~~~e~eLl~i~NfGkKSl~EI~ekL~~~lgL 319 (324) T TIGR02027 267 DLSVRSYNCLKRAGIHTLGELVSKSEEELLKIKNFGKKSLDEIKEKLAEKLGL 319 (324) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 45489886464603541788886038988417885733389999998876188 No 57 >TIGR01821 5aminolev_synth 5-aminolevulinic acid synthase; InterPro: IPR010961 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin . The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) . This entry represents 5-aminoaevulinic acid (ALA) synthase (2.3.1.37 from EC), which catalyses the first stage of tetrapyrrole biosynthesis by the C4 pathway, namely the condensation of succinyl CoA and glycine. ALA synthase is a pyridoxal-phosphate-dependent enzyme. During catalysis, glycine initially binds to the enzyme cofactor, and after condensation with succinyl CoA, CoA, carbon dioxide and 5-aminolevulinic acid are produced .; GO: 0003870 5-aminolevulinate synthase activity, 0030170 pyridoxal phosphate binding, 0033014 tetrapyrrole biosynthetic process. Probab=52.19 E-value=14 Score=16.20 Aligned_cols=15 Identities=7% Similarity=0.220 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHCCC Q ss_conf 799999999985798 Q gi|254781018|r 274 ETVGKFIRCIENNGG 288 (337) Q Consensus 274 ~~l~~l~~~~~~~g~ 288 (337) --|++||+++.+||| T Consensus 195 aPi~eiCDlA~kYGA 209 (427) T TIGR01821 195 APIKEICDLADKYGA 209 (427) T ss_pred CCHHHHHHHHHHHCC T ss_conf 667889668887188 No 58 >PRK00068 hypothetical protein; Validated Probab=51.29 E-value=16 Score=15.81 Aligned_cols=21 Identities=10% Similarity=-0.054 Sum_probs=11.0 Q ss_pred EEECCCEEE-CCCEEEEEEECC Q ss_conf 997445440-795489987667 Q gi|254781018|r 20 ILSGEYSSL-YGASALAVAITF 40 (337) Q Consensus 20 ~L~GEh~v~-~G~~ai~~aI~~ 40 (337) .+..+|.+| ||.-+++.+.|+ T Consensus 427 tWvN~hlvYTHGyG~v~apvn~ 448 (978) T PRK00068 427 DWINRHTVYTHGNGFIASPANS 448 (978) T ss_pred CCCCCCEEECCCCCEEECCCCC T ss_conf 7521113741357168535321 No 59 >PRK08507 prephenate dehydrogenase; Validated Probab=50.95 E-value=16 Score=15.78 Aligned_cols=37 Identities=19% Similarity=0.212 Sum_probs=22.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC Q ss_conf 9998765489999998387216899999959999971 Q gi|254781018|r 233 WSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLLESI 269 (337) Q Consensus 233 ~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l 269 (337) ++++.+...++.++|+++|.+.+.+++.+....++.| T Consensus 239 l~~f~~~L~~l~~~l~~~d~~~l~~~l~~a~~~R~~L 275 (275) T PRK08507 239 IDFFEKELEQAKELIENEDWEELFEWMAQANTLREIL 275 (275) T ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999998169999999999999988529 No 60 >CHL00013 rpoA RNA polymerase alpha subunit Probab=49.35 E-value=17 Score=15.62 Aligned_cols=50 Identities=12% Similarity=0.193 Sum_probs=39.7 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHH-CC Q ss_conf 65489999998387216899999959999971-8997799999999985-79 Q gi|254781018|r 238 AVTNEMMENIQKKDAEKTYKSVRVNQSLLESI-GVVPETVGKFIRCIEN-NG 287 (337) Q Consensus 238 ~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l-~vs~~~l~~l~~~~~~-~g 287 (337) +++-++.++|++.++..+++++..+.+-|.++ ..--..++++.+.+.+ +| T Consensus 278 eLSvRs~NCLKra~I~tvgdLv~~se~dLl~ikNfGkKSl~EI~ekL~e~~G 329 (333) T CHL00013 278 ELSVRAYNCLKRANIHTLLDLLNYSQEDLLKIKNFGQKSVKQVLDALEKRFG 329 (333) T ss_pred CCCHHHHHHHHHCCCCCHHHHHHCCHHHHHCCCCCCHHHHHHHHHHHHHHHC T ss_conf 1878877436373893799986389999941899863029999999988629 No 61 >PRK07417 arogenate dehydrogenase; Reviewed Probab=43.22 E-value=21 Score=15.03 Aligned_cols=38 Identities=11% Similarity=0.131 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC Q ss_conf 89998765489999998387216899999959999971 Q gi|254781018|r 232 IWSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLLESI 269 (337) Q Consensus 232 ~~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l 269 (337) .++++.+..+++.++|+++|.+.+.+++.+....++++ T Consensus 241 ~l~~~~~~L~~l~~~l~~~D~~~l~~~l~~a~~~R~~~ 278 (280) T PRK07417 241 SLASYRWSLDQLEEAILQENWSALEQKLEQTQELRPNF 278 (280) T ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999998079999999999999999874 No 62 >PRK07502 cyclohexadienyl dehydrogenase; Validated Probab=41.92 E-value=22 Score=14.90 Aligned_cols=37 Identities=14% Similarity=0.208 Sum_probs=27.7 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC Q ss_conf 9998765489999998387216899999959999971 Q gi|254781018|r 233 WSEFNAVTNEMMENIQKKDAEKTYKSVRVNQSLLESI 269 (337) Q Consensus 233 ~~~~~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l 269 (337) ++.+.+-.+++.++|+++|.+.+.+++.+.+..|+.| T Consensus 252 ld~~~~~L~~l~~~i~~~d~~~l~~~~~~a~~~R~~l 288 (307) T PRK07502 252 LGRFTEDLAALQRAIRWGDGEALFDLFTRTRAIRRGI 288 (307) T ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999998359999999999999999999 No 63 >COG1959 Predicted transcriptional regulator [Transcription] Probab=35.60 E-value=27 Score=14.28 Aligned_cols=17 Identities=18% Similarity=0.311 Sum_probs=5.6 Q ss_pred CCHHHHHHHHHHHHCCC Q ss_conf 97799999999985798 Q gi|254781018|r 272 VPETVGKFIRCIENNGG 288 (337) Q Consensus 272 s~~~l~~l~~~~~~~g~ 288 (337) +.+-+.+++..+++.|. T Consensus 38 s~~~L~kil~~L~kaGl 54 (150) T COG1959 38 SPSYLEKILSKLRKAGL 54 (150) T ss_pred CHHHHHHHHHHHHHCCC T ss_conf 99999999999987798 No 64 >COG0041 PurE Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase [Nucleotide transport and metabolism] Probab=34.49 E-value=29 Score=14.17 Aligned_cols=12 Identities=25% Similarity=0.202 Sum_probs=5.7 Q ss_pred HHHHHHHHHHHH Q ss_conf 258999999864 Q gi|254781018|r 155 KEKLIAETGYIE 166 (337) Q Consensus 155 ~~~~~~~a~~~e 166 (337) ++.+.+++...+ T Consensus 43 Pe~m~~ya~~a~ 54 (162) T COG0041 43 PEKMFEYAEEAE 54 (162) T ss_pred HHHHHHHHHHHH T ss_conf 899999999998 No 65 >pfam03118 RNA_pol_A_CTD Bacterial RNA polymerase, alpha chain C terminal domain. The alpha subunit of RNA polymerase consists of two independently folded domains, referred to as amino-terminal and carboxyl terminal domains. The amino terminal domain is involved in the interaction with the other subunits of the RNA polymerase. The carboxyl-terminal domain interacts with the DNA and activators. The amino acid sequence of the alpha subunit is conserved in prokaryotic and chloroplast RNA polymerases. There are three regions of particularly strong conservation, two in the amino-terminal and one in the carboxyl- terminal. Probab=33.04 E-value=30 Score=14.02 Aligned_cols=47 Identities=6% Similarity=0.064 Sum_probs=34.7 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC-CCCCHHHHHHHHHHH Q ss_conf 65489999998387216899999959999971-899779999999998 Q gi|254781018|r 238 AVTNEMMENIQKKDAEKTYKSVRVNQSLLESI-GVVPETVGKFIRCIE 284 (337) Q Consensus 238 ~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l-~vs~~~l~~l~~~~~ 284 (337) .++.++.++|++.++..+++++..+..-+.++ ++-...++++.+.+. T Consensus 14 ~LS~R~~N~Lk~~~I~tv~dL~~~s~~dLl~i~N~G~kSl~EI~~~L~ 61 (62) T pfam03118 14 ELSVRSYNCLKRAGINTVGDLLSKSEEDLLKIKNFGKKSLEEIKEKLE 61 (62) T ss_pred CCCHHHHHHHHHCCCCCHHHHHHCCHHHHHHCCCCCHHHHHHHHHHHC T ss_conf 686899999989499679999858999997488986857999999982 No 66 >pfam00392 GntR Bacterial regulatory proteins, gntR family. This family of regulatory proteins consists of the N-terminal HTH region of GntR-like bacterial transcription factors. At the C-terminus there is usually an effector-binding/oligomerisation domain. The GntR-like proteins include the following sub-families: MocR, YtrR, FadR, AraR, HutC and PlmA, DevA, DasR. Many of these proteins have been shown experimentally to be autoregulatory, enabling the prediction of operator sites and the discovery of cis/trans relationships. The DasR regulator has been shown to be a global regulator of primary metabolism and development in Streptomyces coelicolor. Probab=30.23 E-value=34 Score=13.73 Aligned_cols=59 Identities=24% Similarity=0.280 Sum_probs=43.3 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCC Q ss_conf 765489999998387216899999959999971899779999999998579808950788 Q gi|254781018|r 237 NAVTNEMMENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAG 296 (337) Q Consensus 237 ~~i~~~~~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG 296 (337) .++.+...+.+.++.+..-.++-. -..+.+.++||...+.+-+..+.+.|.....-|.| T Consensus 3 ~qi~~~i~~~I~~g~~~~G~~LPs-~~~La~~~~vSr~tvr~Al~~L~~~G~i~~~~g~G 61 (64) T pfam00392 3 EQVYARLREDILSGRLRPGDKLPS-ERELAAEFGVSRTTVREALRRLEAEGLVERRPGRG 61 (64) T ss_pred HHHHHHHHHHHHCCCCCCCCCCHH-HHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCE T ss_conf 999999999998499999299847-99999997969999999999999889389977953 No 67 >TIGR01825 gly_Cac_T_rel pyridoxal phosphate-dependent acyltransferase, putative; InterPro: IPR010962 Pyridoxal phosphate is the active form of vitamin B6 (pyridoxine or pyridoxal). PLP is a versatile catalyst, acting as a coenzyme in a multitude of reactions, including decarboxylation, deamination and transamination , , . PLP-dependent enzymes are primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and in the synthesis or catabolism of neurotransmitters; pyridoxal phosphate can also inhibit DNA polymerases and several steroid receptors . Inadequate levels of pyridoxal phosphate in the brain can cause neurological dysfunction, particularly epilepsy . PLP enzymes exist in their resting state as a Schiff base, the aldehyde group of PLP forming a linkage with the epsilon-amino group of an active site lysine residue on the enzyme. The alpha-amino group of the substrate displaces the lysine epsilon-amino group, in the process forming a new aldimine with the substrate. This aldimine is the common central intermediate for all PLP-catalyzed reactions, enzymatic and non-enzymatic . This entry contains a number of enzyme families: Serine palmitoyltransferase Glycine C-acetyltransferase (2-amino-3-ketobutyrate coenzyme A ligase) 5-aminolevulinic acid synthase 8-amino-7-oxononanoate synthase All transfer the R-group (acetyl, succinyl, or 6-carboxyhexanoyl) from coenzyme A to an amino acid (Gly, Gly, Ala, respectively), with release of CO2 for the latter two reactions.; GO: 0016740 transferase activity, 0030170 pyridoxal phosphate binding. Probab=27.07 E-value=38 Score=13.38 Aligned_cols=15 Identities=27% Similarity=0.328 Sum_probs=5.5 Q ss_pred CCCCCHHHHHHHHHH Q ss_conf 898530378999999 Q gi|254781018|r 129 GFGSSSAIISALSLA 143 (337) Q Consensus 129 GLGSSaA~~va~~~a 143 (337) |.|..|-=++|...- T Consensus 64 GvGAGAVRTIaGTl~ 78 (392) T TIGR01825 64 GVGAGAVRTIAGTLE 78 (392) T ss_pred CCCCCCHHHHHHHHH T ss_conf 962241210433778 No 68 >TIGR01162 purE phosphoribosylaminoimidazole carboxylase, catalytic subunit; InterPro: IPR000031 Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. PurK, N5-carboxyaminoimidazole ribonucleotide (N5_CAIR) synthetase, catalyzes the conversion of 5-aminoimidazole ribonucleotide (AIR), ATP, and bicarbonate to N5-CAIR, ADP, and Pi. PurE converts N5-CAIR to CAIR, the sixth step of de novo purine biosynthesis. In the presence of high concentrations of bicarbonate, PurE is reported able to convert AIR to CAIR directly and without ATP. Some members of this family contain two copies of this domain . The crystal structure of PurE indicates a unique quaternary structure that confirms the octameric nature of the enzyme .; GO: 0004638 phosphoribosylaminoimidazole carboxylase activity, 0006189 'de novo' IMP biosynthetic process, 0009320 phosphoribosylaminoimidazole carboxylase complex. Probab=26.84 E-value=38 Score=13.36 Aligned_cols=41 Identities=29% Similarity=0.508 Sum_probs=23.4 Q ss_pred HHHHHHHHHHHCCC-----------CCHHHHHHHHHHHHCCC-EEEECCCCCC Q ss_conf 99995999997189-----------97799999999985798-0895078867 Q gi|254781018|r 258 SVRVNQSLLESIGV-----------VPETVGKFIRCIENNGG-SAKIAGAGSI 298 (337) Q Consensus 258 ~~~~~~~ll~~l~v-----------s~~~l~~l~~~~~~~g~-~ak~sGAG~~ 298 (337) .|+...+.|++++| +++.+.+..+.+++-|. -.=+-|||+. T Consensus 13 ~m~~a~~~L~~fgi~~e~~V~SAHRTP~~~~~ya~~a~~~G~P~ViIAgAGga 65 (159) T TIGR01162 13 TMKKAAEILEEFGIPYELRVVSAHRTPELMFEYAKEAEERGIPKVIIAGAGGA 65 (159) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 99999999985599667898606778088999999998678997998403511 No 69 >COG1117 PstB ABC-type phosphate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=23.34 E-value=23 Score=14.75 Aligned_cols=36 Identities=11% Similarity=0.237 Sum_probs=20.6 Q ss_pred CCCHHHHHHHHHHHHCCCEEEECCCCCCCCCEEEEEEEECCCHHHHHHHCCCE Q ss_conf 99779999999998579808950788678850899998579979999745986 Q gi|254781018|r 271 VVPETVGKFIRCIENNGGSAKIAGAGSIRGEKAGLVLVGGYNPEKLSSLYGYT 323 (337) Q Consensus 271 vs~~~l~~l~~~~~~~g~~ak~sGAG~~~Gg~~~~~l~~~~~~~~~~~~~~~~ 323 (337) +|+-++++|+..+++- . +++-..++.+.+.++++|. T Consensus 183 IsT~kIEeLi~eLk~~-----y------------TIviVTHnmqQAaRvSD~t 218 (253) T COG1117 183 ISTLKIEELITELKKK-----Y------------TIVIVTHNMQQAARVSDYT 218 (253) T ss_pred HHHHHHHHHHHHHHHC-----C------------EEEEEECCHHHHHHHHHHH T ss_conf 3588999999998746-----4------------8999937999987877755 No 70 >PRK07012 consensus Probab=23.15 E-value=45 Score=12.93 Aligned_cols=135 Identities=11% Similarity=0.194 Sum_probs=60.5 Q ss_pred CCCCCHHHHHHHHCCCEEEECCCCCC-CCCCCC----CEEEEECCCCCCCHHHHHHHHHHHHCCCHHHH-----HH---- Q ss_conf 78773457876532630240278520-014788----43999749714537999999997641206899-----98---- Q gi|254781018|r 171 GKTGIIDSTTIVMGGIIYINPPQIIK-NETLVG----EWWAINTGMPESSTGECVSFVEQHFSQSSIWS-----EF---- 236 (337) Q Consensus 171 ~~~sg~D~~~~~~gg~~~~~~~~~~~-~~~~~~----~~~~i~~~~~~~st~~~~~~~~~~~~~~~~~~-----~~---- 236 (337) |.||+.--.+|..=|+.-+++.+..- ++.+.+ .++-|+..-....-.++++.+.+++.+...-+ .+ T Consensus 363 GRGSaAGSLVaY~LgIT~vDPi~y~LLFERFLNpeR~smPDIDiDF~~~rR~eVi~Yv~~kYG~d~VAqI~Tf~T~~ak~ 442 (1173) T PRK07012 363 GRGSGAGSLVAYALGITDLDPLRYNLLFERFLNPERVSMPDFDIDFCQHGRDRVIQYVKEKYGADAVSQIATFGTMAAKA 442 (1173) T ss_pred CCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 87703767999984474438422586378861667788999986577545299999999986302111466688899999 Q ss_pred --HHHHH------HHHHHHHC------CCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCE Q ss_conf --76548------99999983------87216899999959999971899779999999998579808950788678850 Q gi|254781018|r 237 --NAVTN------EMMENIQK------KDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAGSIRGEK 302 (337) Q Consensus 237 --~~i~~------~~~~al~~------~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~~~Gg~ 302 (337) .++.. ...+.+.+ ++.-.+.+.+.++..+.+.+ -+.|.+..+++.++..-+-.|-+|- .. T Consensus 443 AiRDvgRvlg~~~~~~d~iaK~ip~~~~~~~tl~~a~~~~~~l~~~~-~~~~~~~~l~~~A~~LeGl~R~~g~-----HA 516 (1173) T PRK07012 443 AVRDIGRVLDLGYMFTDGVAKLIPFKPGKHVTIADAMKEEPLLQERY-DNEDEVHQLLDLAQRVEGLTRNVGM-----HA 516 (1173) T ss_pred HHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHCHHHHHHH-CCCHHHHHHHHHHHHHHCCCCCCCC-----CC T ss_conf 99999998589988887786218877888677899977474456553-1689999999999998278531003-----66 Q ss_pred EEEEEEECC Q ss_conf 899998579 Q gi|254781018|r 303 AGLVLVGGY 311 (337) Q Consensus 303 ~~~~l~~~~ 311 (337) ||+++++.+ T Consensus 517 aGvVIs~~~ 525 (1173) T PRK07012 517 GGVLIAPGK 525 (1173) T ss_pred CCEEECCCC T ss_conf 725863443 No 71 >TIGR01959 nuoF_fam NADH-quinone oxidoreductase, F subunit; InterPro: IPR011537 This entry describes the F chain of complexes that resemble NADH-quinone oxidoreductases. The electron acceptor is a quinone, ubiquinone, in mitochondria and most bacteria, including Escherichia coli, where the recommended gene symbol is nuoF. This family does not have any members in chloroplast or cyanobacteria, where the quinone may be plastoquinone and NADH may be replaced by NADPH, nor in Methanosarcina, where NADH is replaced by F420H2. ; GO: 0010181 FMN binding, 0016651 oxidoreductase activity acting on NADH or NADPH, 0051287 NAD binding, 0051539 4 iron 4 sulfur cluster binding, 0006118 electron transport. Probab=23.12 E-value=27 Score=14.29 Aligned_cols=24 Identities=17% Similarity=-0.085 Sum_probs=14.2 Q ss_pred HCCCCC--CCCHHHHHHHHCCCEEEE Q ss_conf 115778--773457876532630240 Q gi|254781018|r 167 RLQHGK--TGIIDSTTIVMGGIIYIN 190 (337) Q Consensus 167 ~~~~~~--~sg~D~~~~~~gg~~~~~ 190 (337) ..++.| +||.|.-...+-|.=.|- T Consensus 148 G~LGkNIlGsGFDf~l~~H~GAGAYI 173 (420) T TIGR01959 148 GLLGKNILGSGFDFELFVHRGAGAYI 173 (420) T ss_pred CCCCCCCCCCCCCEEEEEECCCCCEE T ss_conf 98200337886205763433888505 No 72 >pfam02082 Rrf2 Transcriptional regulator. This family is related to pfam001022 and other transcription regulation families (personal obs: Yeats C). Probab=22.95 E-value=45 Score=12.90 Aligned_cols=43 Identities=19% Similarity=0.304 Sum_probs=30.0 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCEEEEEEEECCC Q ss_conf 9999718997799999999985798089507886788508999985799 Q gi|254781018|r 264 SLLESIGVVPETVGKFIRCIENNGGSAKIAGAGSIRGEKAGLVLVGGYN 312 (337) Q Consensus 264 ~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~~~Gg~~~~~l~~~~~ 312 (337) ++-+.++++.+.+.+++..+.+.|...-.-|++ ||+.|+.+++ T Consensus 29 ~iA~~~~i~~~~l~kil~~L~~~gli~s~rG~~------GGy~L~k~~~ 71 (82) T pfam02082 29 EIAERQNISPVYLRKILAKLRKAGLVKSVRGPG------GGYRLARPPE 71 (82) T ss_pred HHHHHHCCCHHHHHHHHHHHHCCCCEEECCCCC------CCCCCCCCHH T ss_conf 999878909999999999810279825658999------8811348989 No 73 >smart00345 HTH_GNTR helix_turn_helix gluconate operon transcriptional repressor. Probab=22.08 E-value=47 Score=12.79 Aligned_cols=56 Identities=21% Similarity=0.301 Sum_probs=39.4 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCC Q ss_conf 489999998387216899999959999971899779999999998579808950788 Q gi|254781018|r 240 TNEMMENIQKKDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAG 296 (337) Q Consensus 240 ~~~~~~al~~~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG 296 (337) ++...+.+.++.+..-..+-. ...+.+.++||...+.+-++.+.+-|.....-|.| T Consensus 2 a~~i~~~I~~g~~~~G~~LPs-~~~la~~~~vSr~tvr~A~~~L~~~G~i~~~~g~G 57 (60) T smart00345 2 AERLREDIVSGELRPGDKLPS-ERELAAQLGVSRTTVREALSRLEAEGLVQRRPGSG 57 (60) T ss_pred HHHHHHHHHCCCCCCCCCCCC-HHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCE T ss_conf 899999998399998398810-99999998949999999999999889789966854 No 74 >TIGR03108 eps_aminotran_1 exosortase 1 system-associated amidotransferase 1. The predicted protein-sorting transpeptidase that we call exosortase (see TIGR02602) has distinct subclasses that associated with different types of exopolysaccharide production loci. This model represents a distinct clade among a set of amidotransferases largely annotated (not necessarily accurately) as glutatime-hydrolyzing asparagine synthases. Members of this clade are essentially restricted to the characteristic exopolysaccharide (EPS) regions that contain the exosortase 1 genome (xrtA), in genomes that also have numbers of PEP-CTERM domain (TIGR02595) proteins. Probab=21.56 E-value=48 Score=12.73 Aligned_cols=58 Identities=12% Similarity=0.152 Sum_probs=33.3 Q ss_pred CCCCCCCCCEEEECCC-EEEEECCCEEECCCEEEEEEECC----------------CEEEEEEECCCCEEEEEEC Q ss_conf 5435799608998581-79997445440795489987667----------------2799999848982999955 Q gi|254781018|r 2 KQQTSQTPHAHSIAPA-KVILSGEYSSLYGASALAVAITF----------------YLRALLTTIEPSLIRIINS 59 (337) Q Consensus 2 ~~~~~~~p~~~~~APG-Ki~L~GEh~v~~G~~ai~~aI~~----------------~~~v~i~~~~~~~~~~~s~ 59 (337) |-+||.+|-+++.-+. ++.+.-|--.+...+.+...+|. -.+-.+.+.+..+...+.. T Consensus 139 RDr~GiKPLYY~~~~~g~~~FaSEikall~~~~~~~~id~~~l~~yl~~~~~~~~~T~f~~I~~l~PG~~l~~~~ 213 (628) T TIGR03108 139 RDRLGIKPLYYALLADGWFIFGSELKALTAHPSLPRELDPLAVEDYFAYGYVPDPRTIFKGVKKLEPGHTLTLRR 213 (628) T ss_pred CCCCCCCCEEEEECCCCEEEEEHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCEECCCCCEEEEEC T ss_conf 686667530689818962998100468873788763115999999997378899864015741318970799988 No 75 >TIGR02313 HpaI-NOT-DapA 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase; InterPro: IPR012691 This entry represents a subset of the DapA (dihydrodipicolinate synthase) family which has apparently evolved a separate function. The product of DapA, dihydrodipicolinate, results from the non-enzymatic cyclization and dehydration of 6-amino-2,4-dihydroxyhept-2-ene-1,7-dioic acid, which is different from the substrate of this reaction only in the presence of the amino group. In the absence of this amino group, and running the reaction in the opposite direction, the reaction corresponds to the HpaI aldolase component of the 4-hydroxyphenylacetic acid catabolism pathway (see IPR012689 from INTERPRO). At present, this variant of DapA is found only in Oceanobacillus iheyensis HTE831 and Thermus thermophilus HB27. In both of these cases, one or more other DapA genes can be found and the members of this family are found in the operon for 4-hydroxyphenylacetic acid catabolism.; GO: 0008840 dihydrodipicolinate synthase activity, 0019877 diaminopimelate biosynthetic process, 0005737 cytoplasm. Probab=21.55 E-value=48 Score=12.72 Aligned_cols=49 Identities=12% Similarity=0.297 Sum_probs=34.7 Q ss_pred HHHHHHHHHCCCCHHHHH----HHHHHHHHHHH-----------------------CCCCCHHHH-HHHHHHHHCCC Q ss_conf 489999998387216899----99995999997-----------------------189977999-99999985798 Q gi|254781018|r 240 TNEMMENIQKKDAEKTYK----SVRVNQSLLES-----------------------IGVVPETVG-KFIRCIENNGG 288 (337) Q Consensus 240 ~~~~~~al~~~d~~~l~~----~~~~~~~ll~~-----------------------l~vs~~~l~-~l~~~~~~~g~ 288 (337) +-++.+++..||++...+ +|+.|..+++| +|..+|+++ +|.+.+.+||. T Consensus 215 VA~l~~~~~~GD~~~A~~LHF~Ll~~N~~~F~DTNP~P~K~~L~~MGl~E~~~R~PlG~~~~~~~E~~R~~~~~YG~ 291 (294) T TIGR02313 215 VALLHEKALEGDIKRARDLHFELLELNDVLFKDTNPIPLKAVLSMMGLLEKVLRRPLGLTSDELEERLRDTLVKYGK 291 (294) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCC T ss_conf 99999887516534688999999875466540689930899875520010453588877876788999999986168 No 76 >PRK09074 consensus Probab=21.53 E-value=49 Score=12.72 Aligned_cols=135 Identities=12% Similarity=0.162 Sum_probs=58.3 Q ss_pred CCCCCHHHHHHHHCCCEEEECCCCCC-CCCCCC----CEEEEECCCCCCCHHHHHHHHHHHHCCCHH-----HHHH---- Q ss_conf 78773457876532630240278520-014788----439997497145379999999976412068-----9998---- Q gi|254781018|r 171 GKTGIIDSTTIVMGGIIYINPPQIIK-NETLVG----EWWAINTGMPESSTGECVSFVEQHFSQSSI-----WSEF---- 236 (337) Q Consensus 171 ~~~sg~D~~~~~~gg~~~~~~~~~~~-~~~~~~----~~~~i~~~~~~~st~~~~~~~~~~~~~~~~-----~~~~---- 236 (337) |.||+.--.+|..=|+.-+++.+..- ++.+.+ .++-|+..-....-.++++.+.+++..... +..+ T Consensus 370 GRGSaAGSLVaY~LgIT~vDPi~y~LlFERFLNpeR~smPDIDiDF~~~rR~eVi~Yv~~kYG~~~VAqI~Tfgt~~ak~ 449 (1149) T PRK09074 370 GRGSGAGSLVAYALTITDLDPLRFSLLFERFLNPERVSMPDFDIDFCQDRREEVIRYVQEKYGRDQVAQIITFGTLQARA 449 (1149) T ss_pred CCCCHHHHHHHHHHCCCCCCCHHCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHH T ss_conf 87727889999982787658020474188761444567899984436130899999999985846545420187799999 Q ss_pred --HHHHH------HHHHHHHC------CCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCE Q ss_conf --76548------99999983------87216899999959999971899779999999998579808950788678850 Q gi|254781018|r 237 --NAVTN------EMMENIQK------KDAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAGSIRGEK 302 (337) Q Consensus 237 --~~i~~------~~~~al~~------~d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~~~Gg~ 302 (337) .++.. ...+.+.+ +....+.+.+..+..+. ++--+.|.+.++++.++..-+-.|=+|- .. T Consensus 450 aiRDvgRvlg~~~~~vd~iak~ip~~~~~~~tl~~a~~~~~~l~-~~~~~~~~~~~l~~~A~~leGl~R~~s~-----HA 523 (1149) T PRK09074 450 VLRDVGRVLQMPYGQVDRLCKLVPNNPANPVSLAQAIEDEPRLQ-EERDKDPVVARLLDIALKLEGLYRHAST-----HA 523 (1149) T ss_pred HHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCHHHHHHCCHHHH-HHHHCCHHHHHHHHHHHHHHCCCCCCCC-----CC T ss_conf 99999998589989999998527757888765898873486788-7764488999999999986346232244-----67 Q ss_pred EEEEEEECC Q ss_conf 899998579 Q gi|254781018|r 303 AGLVLVGGY 311 (337) Q Consensus 303 ~~~~l~~~~ 311 (337) ||+++...+ T Consensus 524 aGvVIs~~p 532 (1149) T PRK09074 524 AGIVIGDRP 532 (1149) T ss_pred CEEEECCCC T ss_conf 616733766 No 77 >cd05567 PTS_IIB_mannitol PTS_IIB_mannitol: subunit IIB of enzyme II (EII) of the mannitol-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In this system, EII is a mannitol-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. The IIA, IIB, and IIC domains are expressed from the mtlA gene as a single protein, also known as the mannitol PTS permease, the mtl transporter, or MtlA. MtlA is only functional as a dimer with the dimer contacts occuring between the IIC domains. MtlA takes up exogenous mannitol releasing the phosphate ester into the cytoplasm in preparation for oxidation to fructose-6-phosphate by the NAD-dependent mannitol-P dehydrogenase (MtlD). The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include mannitol, chitobiose/lichenan, ascorbate, lactose, galactitol, fructose, and a s Probab=20.95 E-value=29 Score=14.15 Aligned_cols=16 Identities=25% Similarity=0.326 Sum_probs=8.8 Q ss_pred CCCCCCCHHHHHHHHHH Q ss_conf 03898530378999999 Q gi|254781018|r 127 GSGFGSSSAIISALSLA 143 (337) Q Consensus 127 ~~GLGSSaA~~va~~~a 143 (337) .+|||||+=- .++++- T Consensus 8 DAGmGSSaMg-as~lrk 23 (87) T cd05567 8 DAGMGSSAMG-ASVLRK 23 (87) T ss_pred CCCCCHHHHH-HHHHHH T ss_conf 8998589999-999999 No 78 >PRK06326 consensus Probab=20.56 E-value=51 Score=12.60 Aligned_cols=135 Identities=16% Similarity=0.266 Sum_probs=60.9 Q ss_pred CCCCCHHHHHHHHCCCEEEECCCCCC-CCCCC----CCEEEEECCCCCCCHHHHHHHHHHHHCCCHH-----HH------ Q ss_conf 78773457876532630240278520-01478----8439997497145379999999976412068-----99------ Q gi|254781018|r 171 GKTGIIDSTTIVMGGIIYINPPQIIK-NETLV----GEWWAINTGMPESSTGECVSFVEQHFSQSSI-----WS------ 234 (337) Q Consensus 171 ~~~sg~D~~~~~~gg~~~~~~~~~~~-~~~~~----~~~~~i~~~~~~~st~~~~~~~~~~~~~~~~-----~~------ 234 (337) |.||+..-.+|..=|+.-+++.+..- ++.+. ..++-|+..-......++++.+.+++.+... +. T Consensus 411 GRGSAAGSLVaY~LgIT~VDPi~y~LLFERFLNpeR~smPDIDiDF~~~rR~eVI~Yv~~kYG~d~VAqI~Tf~T~~ak~ 490 (1240) T PRK06326 411 GRGSGAGSVMLFLLGITEIEPIRFDLFFERFINPERLSYPDIDIDICMAGRERVINYAIERHGKDNVAQIITFGTMKAKM 490 (1240) T ss_pred CCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECHHHHHHHH T ss_conf 88731878999995474418431587588744766789998507388324799999999985877579860123137889 Q ss_pred HHHHHHH------HHHHHHHCC--CC-HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCEEEE Q ss_conf 9876548------999999838--72-16899999959999971899779999999998579808950788678850899 Q gi|254781018|r 235 EFNAVTN------EMMENIQKK--DA-EKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGGSAKIAGAGSIRGEKAGL 305 (337) Q Consensus 235 ~~~~i~~------~~~~al~~~--d~-~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~~ak~sGAG~~~Gg~~~~ 305 (337) .+.++.. ...+.+.+. ++ ..+.+.+..+.. ++++..+.|....+++.+...-+-.+=+|- ..||+ T Consensus 491 AiRDvgRvlg~~~~~~d~iaK~ip~~~~tl~~a~~~~~~-lr~~~~~d~~~~~l~~~A~~LeGlpR~~s~-----HAaGv 564 (1240) T PRK06326 491 AVKDVGRTLDVPLSKVNHIAKHIPDLNTTLSKALETDPD-LRQLYINDAEAAQVIDMAKKLEGSIRNTGV-----HAAGV 564 (1240) T ss_pred HHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHHHHHCCHH-HHHHHCCCHHHHHHHHHHHHHCCCCCCCCC-----CCCEE T ss_conf 999999984999889999983177688627777642988-897750688899999999985398666566-----66526 Q ss_pred EEEECC Q ss_conf 998579 Q gi|254781018|r 306 VLVGGY 311 (337) Q Consensus 306 ~l~~~~ 311 (337) ++...+ T Consensus 565 VIs~~p 570 (1240) T PRK06326 565 IICGDP 570 (1240) T ss_pred EEECCC T ss_conf 861675 No 79 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=20.36 E-value=50 Score=12.65 Aligned_cols=13 Identities=23% Similarity=0.171 Sum_probs=4.6 Q ss_pred HHHHHHHHHHHHH Q ss_conf 2589999998641 Q gi|254781018|r 155 KEKLIAETGYIER 167 (337) Q Consensus 155 ~~~~~~~a~~~e~ 167 (337) ++-+..-|+..|- T Consensus 716 PHY~Cp~Cky~Ef 728 (1264) T TIGR01405 716 PHYLCPNCKYSEF 728 (1264) T ss_pred CCCCCCCCCEEEE T ss_conf 8750878735530 No 80 >pfam07862 Nif11 Nitrogen fixation protein of unknown function. This domain is found in the Cyanobacteria, and may be involved in nitrogen fixation, but no role has been assigned. Probab=20.15 E-value=52 Score=12.54 Aligned_cols=37 Identities=14% Similarity=0.159 Sum_probs=25.7 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC Q ss_conf 72168999999599999718997799999999985798 Q gi|254781018|r 251 DAEKTYKSVRVNQSLLESIGVVPETVGKFIRCIENNGG 288 (337) Q Consensus 251 d~~~l~~~~~~~~~ll~~l~vs~~~l~~l~~~~~~~g~ 288 (337) +++.|.+.+..+..+++++. .+...+.++.++++.|. T Consensus 5 ~l~~Fl~~~~~d~~l~~~l~-~~~~~~~vi~lA~~~Gf 41 (49) T pfam07862 5 QLKAFLEKVKSDPSLREQLK-ACSDAEEVVALAKEAGF 41 (49) T ss_pred HHHHHHHHHHCCHHHHHHHH-HCCCHHHHHHHHHHCCC T ss_conf 99999999853999999998-75999999999999599 Done!