Query gi|254781100|ref|YP_003065513.1| phospho-N-acetylmuramoyl-pentapeptide-transferase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 366 No_of_seqs 212 out of 2824 Neff 7.0 Searched_HMMs 39220 Date Mon May 30 05:56:31 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781100.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00445 mraY phospho-N-acety 100.0 0 0 739.9 20.9 324 36-366 1-336 (336) 2 PRK00108 mraY phospho-N-acetyl 100.0 0 0 671.2 35.6 336 22-366 1-336 (336) 3 cd06852 GT_MraY Phospho-N-acet 100.0 0 0 525.9 29.4 279 62-361 1-279 (280) 4 COG0472 Rfe UDP-N-acetylmuramy 100.0 0 0 497.8 26.4 307 19-366 1-316 (319) 5 cd06856 GT_GPT_archaea UDP-Glc 100.0 0 0 442.1 23.0 262 60-359 3-280 (280) 6 TIGR02380 ECA_wecA undecapreny 100.0 0 0 412.6 25.8 291 33-363 4-311 (349) 7 cd06854 GT_WbpL_WbcO_like The 100.0 0 0 384.3 25.2 243 57-344 2-253 (253) 8 cd06853 GT_WecA_like This subf 100.0 0 0 352.8 24.1 237 63-339 1-249 (249) 9 cd06855 GT_GPT_euk UDP-GlcNAc: 100.0 0 0 346.6 17.5 259 41-334 2-269 (283) 10 cd06851 GT_GPT_like This famil 100.0 0 0 327.2 18.3 210 61-307 4-221 (223) 11 cd06499 GT_MraY-like Glycosylt 100.0 0 0 304.6 17.5 185 71-287 1-185 (185) 12 cd06912 GT_MraY_like This subf 100.0 0 0 302.1 17.1 193 60-287 1-193 (193) 13 pfam00953 Glycos_transf_4 Glyc 100.0 1.7E-38 4.4E-43 231.0 14.7 156 100-290 2-157 (158) 14 KOG2788 consensus 99.8 9.3E-20 2.4E-24 127.6 4.2 191 101-314 103-302 (418) 15 pfam10555 MraY_sig1 Phospho-N- 93.0 0.015 3.7E-07 32.7 -1.1 15 69-83 1-15 (26) 16 COG4956 Integral membrane prot 86.6 2 5.1E-05 20.8 9.4 87 15-114 34-127 (356) 17 COG4589 Predicted CDP-diglycer 48.7 18 0.00045 15.7 7.4 13 263-275 205-217 (303) 18 TIGR02838 spore_V_AC stage V s 42.0 23 0.00057 15.1 4.1 46 75-120 26-78 (141) 19 KOG2844 consensus 41.4 13 0.00033 16.4 1.2 16 323-338 450-465 (856) 20 PRK09699 D-allose transporter 38.5 26 0.00065 14.7 11.0 20 337-356 215-234 (325) 21 pfam06830 Root_cap Root cap. T 35.5 16 0.00041 15.9 0.9 29 52-80 13-41 (57) 22 pfam11821 DUF3341 Protein of u 34.0 30 0.00078 14.3 4.3 61 201-265 59-120 (173) 23 KOG4112 consensus 31.5 34 0.00086 14.1 3.2 30 30-59 57-86 (101) 24 COG3715 ManY Phosphotransferas 30.8 35 0.00088 14.0 2.4 20 202-221 177-196 (265) 25 PRK03839 putative kinase; Prov 23.0 42 0.0011 13.6 1.3 16 330-345 93-108 (180) 26 TIGR01920 Shik_kin_archae shik 22.3 50 0.0013 13.1 2.3 52 67-118 75-137 (273) 27 PRK11492 hyfE hydrogenase 4 me 21.9 51 0.0013 13.1 6.8 54 29-88 58-111 (213) No 1 >TIGR00445 mraY phospho-N-acetylmuramoyl-pentapeptide-transferase; InterPro: IPR003524 Phospho-N-acetylmuramoyl-pentapeptide-transferase (2.7.8.13 from EC) (mraY) is a bacterial enzyme responsible for the formation of the first lipid intermediate of the cell wall peptidoglycan synthesis. It catalyzes the formation of undecaprenyl-pyrophosphoryl-N-acetylmuramoyl-pentapeptide from UDP-MurNAc-pentapeptide and undecaprenyl-phosphate. It is an integral membrane protein with probably ten transmembrane domains. It belongs to family 4 of glycosyl transferases. Homologs of mraY have been found in the archaebacteria Methanobacterium thermoautotrophicum and in Arabidopsis thaliana (Mouse-ear cress).; GO: 0008963 phospho-N-acetylmuramoyl-pentapeptide-transferase activity, 0009252 peptidoglycan biosynthetic process, 0016020 membrane. Probab=100.00 E-value=0 Score=739.94 Aligned_cols=324 Identities=51% Similarity=0.885 Sum_probs=314.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHHHHHCCCC---CHHHHHHHHHHHHHHH Q ss_conf 99998899999998622069853887311024657-8851257787779999998753567---8989998877899999 Q gi|254781100|r 36 VFAFGPKVIDCLRSLQEFRGQPVRVPNLPIHARKI-DTPTMGGIIILMGLIGSSLLWADFS---SLHVSIILLLTLSFGL 111 (366) Q Consensus 36 ~~~~~~~~I~~l~~~~~~~g~~~r~~~~~~H~~K~-~tP~~GGiai~~~~l~~~ll~~~~~---~~~~~~~l~~~l~~~l 111 (366) |.+++|.+|+.+|++| .+|++|+||||+|++|+ |||||||+.|+.++.++.+++.+.. |++.+..++.++.++. T Consensus 1 sl~~gP~~I~~L~~Lk--a~Q~ir~dGP~SH~~K~aGtPTMGGi~i~~~~~v~~vl~~~~~seiNpyv~~~l~V~lGy~~ 78 (336) T TIGR00445 1 SLLLGPKVILKLKKLK--AGQVIREDGPKSHLKKKAGTPTMGGIMIVFAIIVSTVLWAQLLSEINPYVLVVLFVLLGYGF 78 (336) T ss_pred CEEECHHHHHHHHHCC--CCCEECCCCCCCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHEEHHHHHHHHHH T ss_conf 9145257999998415--79512045670112126756977424577999999999875300047121003343361245 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCC-CCCCCCHHHHH Q ss_conf 99874467875313566036889999999999999986402776655666653377512110000000-12222023578 Q gi|254781100|r 112 VGFYDDYIKITTGDAGGLSGKVRILIEFIIAILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMG-LFFIPFAAFVI 190 (366) Q Consensus 112 iG~iDD~~ki~~~~~~GLs~~~Kl~~Q~i~a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg-~l~i~~~~~~i 190 (366) +||+||+.|+.+|+|+||+||+|+..|+++|.+++.+++++..++ ..+.+..||++|..+++| ++|+++..|++ T Consensus 79 ~GF~DDy~Ki~~K~n~GL~ak~K~~~~~i~al~~~~~L~~~G~~t-----~~~~i~~Pf~k~~~~~LGtl~y~~l~~F~l 153 (336) T TIGR00445 79 IGFVDDYRKIKRKSNKGLTAKQKLFGQIIIALIFVVWLYYYGLDT-----DATDIYIPFIKDFLFDLGTLLYILLALFVL 153 (336) T ss_pred HHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC-----CEEEEEECCCCCCCCCHHHHHHHHHHHHHH T ss_conf 545545565531157998525789999999999999998238987-----601567354002554325799999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 89985987521111665798999999999976414753466664101478834678998886778889888556741001 Q gi|254781100|r 191 VATANAVNLTDGLDGLAIVPVMIASVAFSFIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLWFNISPAVIF 270 (366) Q Consensus 191 vg~~NavNl~DGlDGLa~g~~~i~~~~l~ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~~PAkIF 270 (366) ++.+||+|+|||+||||++++++++.++++++|.+||.++++|+++|.+.+..|++++|.|++||++||||||.+||||| T Consensus 154 v~~SNAvNLTDGLDGLA~~~~~~~~gal~i~a~~~gn~~F~~YLhiP~~~~~~El~if~~A~vG~~lGFLwfN~~PA~VF 233 (336) T TIGR00445 154 VGTSNAVNLTDGLDGLAIGPSAIAFGALAILAWATGNANFAKYLHIPKIKDSGELVIFCTALVGASLGFLWFNTYPAKVF 233 (336) T ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCEEEEHHHHHHHHHCCHHCCCCCCEEE T ss_conf 86045310000013889999999999999999995151068524787221563246621778723422110328856484 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC------CCCCCCCHHHHHHHHCCCCCCHH Q ss_conf 646108879999999999962458999999999999999999999998608------87666781228999679885158 Q gi|254781100|r 271 MGDTGSLALGAFIGGIAVSTKHEILLIIIGGLFVVETLSVIMQVGYFKMTK------KRVFLMAPIHHHFEKKGWTENQI 344 (366) Q Consensus 271 mGD~GS~~LG~~la~~ai~~~~~~~l~ii~~i~i~~~~sviiq~~~~k~~~------kr~f~~~plHH~l~~~G~s~~~i 344 (366) |||+||++||+.+|.+||++++|+++.+++++|++||+|||+||.|||.|| ||+|||||+|||||.+||||+|+ T Consensus 234 MGDtGSLALGG~l~~~Ai~~~~EiLLvi~gGvFviEtLSVIlQVgsYK~tkGPdGvGkR~FkMAPiHHHfEl~GW~E~~V 313 (336) T TIGR00445 234 MGDTGSLALGGALGAVAILTKNEILLVIMGGVFVIETLSVILQVGSYKTTKGPDGVGKRIFKMAPIHHHFELKGWSEPRV 313 (336) T ss_pred ECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHCCCCHHHHCCCCCCCCCH T ss_conf 17705588999999999998789999999999999999999987664105799740002444086210011578844501 Q ss_pred HHHHHHHHHHHHHHH-HHHHHCC Q ss_conf 987789999999999-9862049 Q gi|254781100|r 345 VIRFWIISFIFAVIG-LLTLKMR 366 (366) Q Consensus 345 v~~~~~i~~~~~~~~-i~~~k~~ 366 (366) |.|||+++++++++| ++++|+| T Consensus 314 vvrFWiI~llla~~ail~t~K~R 336 (336) T TIGR00445 314 VVRFWIISLLLALVAILATLKVR 336 (336) T ss_pred HHHHHHHHHHHHHHHHHHHEECC T ss_conf 00477899999999988520059 No 2 >PRK00108 mraY phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional Probab=100.00 E-value=0 Score=671.22 Aligned_cols=336 Identities=54% Similarity=0.990 Sum_probs=320.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 99999999999999999988999999986220698538873110246578851257787779999998753567898999 Q gi|254781100|r 22 TFRSSAAFFSAMLIVFAFGPKVIDCLRSLQEFRGQPVRVPNLPIHARKIDTPTMGGIIILMGLIGSSLLWADFSSLHVSI 101 (366) Q Consensus 22 ~fr~~~a~l~sf~i~~~~~~~~I~~l~~~~~~~g~~~r~~~~~~H~~K~~tP~~GGiai~~~~l~~~ll~~~~~~~~~~~ 101 (366) +||+++|+++||++++++||.+||++||++ .||++|||+||+|++|+|||||||++++.+++++.+++.+.++.+.+. T Consensus 1 ~~~~~~a~~isf~is~~~tp~~I~~~~k~~--~g~~ir~~g~~~H~~K~~tP~mGGi~i~~~~~i~~~~~~~~~~~~~~~ 78 (336) T PRK00108 1 TFRAILALLTALLISLLLGPWLIRFLRKLK--FGQPIREDGPQSHLKKAGTPTMGGILILLAIVISTLLWADLSNPYVWL 78 (336) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHCC--CCCCCCCCCCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 999999999999999999999999999839--999876767142177999988059999999999999997258779999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCC Q ss_conf 88778999999987446787531356603688999999999999998640277665566665337751211000000012 Q gi|254781100|r 102 ILLLTLSFGLVGFYDDYIKITTGDAGGLSGKVRILIEFIIAILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMGLF 181 (366) Q Consensus 102 ~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~~Kl~~Q~i~a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg~l 181 (366) .+.+.+.++.+|++||+.|+++|+++|++||.|++.|+++|.+...+.....+ ..+++.+||+++..++++++ T Consensus 79 ~l~~~l~~~~iG~~DD~~ki~~~~~~GL~~~~Kl~~Q~~~a~i~~~~~~~~~~-------~~~~l~ipf~~~~~i~lg~l 151 (336) T PRK00108 79 VLFVTLGFGLIGFLDDYLKVVKKNNKGLSARQKLLLQSLIALIFAVLLYFFGP-------FSTSLIIPFFKDVMLPLGFL 151 (336) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCC-------CCCCEEECCCCCCCCCHHHH T ss_conf 99999999999999989998704778977579999999999999999998277-------67506603543433026799 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 22202357889985987521111665798999999999976414753466664101478834678998886778889888 Q gi|254781100|r 182 FIPFAAFVIVATANAVNLTDGLDGLAIVPVMIASVAFSFIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLW 261 (366) Q Consensus 182 ~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~i~~~~l~ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~ 261 (366) +++++++|++|++||+|++||+||||+|.+++++.++++++|..++...+.+...++.++..|++++|.+++|||+|||| T Consensus 152 ~i~~~i~~ivg~~NAvNliDGLDGLA~G~~~i~~~~~~iia~~~~~~~~~~~~~~~~~~~~~~la~~~~~l~Ga~lGFL~ 231 (336) T PRK00108 152 YIPFAYFVIVGTSNAVNLTDGLDGLAIGPSVIVAAALGIIAYLQGNAVFANYLLIPYIPGAGELAIFCAALVGAGLGFLW 231 (336) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999875503202662267789999999999999984766655554066578934899999999987778876 Q ss_pred HCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCC Q ss_conf 55674100164610887999999999996245899999999999999999999999860887666781228999679885 Q gi|254781100|r 262 FNISPAVIFMGDTGSLALGAFIGGIAVSTKHEILLIIIGGLFVVETLSVIMQVGYFKMTKKRVFLMAPIHHHFEKKGWTE 341 (366) Q Consensus 262 fN~~PAkIFmGD~GS~~LG~~la~~ai~~~~~~~l~ii~~i~i~~~~sviiq~~~~k~~~kr~f~~~plHH~l~~~G~s~ 341 (366) ||+|||||||||+||+++|+.+|++++++++|+++++++++|++|++|||+||.+||.+|||+|||+|+|||||++|||| T Consensus 232 ~N~~PAkIFMGDtGSl~LG~~la~~ai~~~~~~~l~i~~~i~v~e~~sviiqv~~~k~~gkrif~m~PlHHh~e~~G~se 311 (336) T PRK00108 232 FNAYPAKVFMGDTGSLALGGALGAIAVLLRQEILLLIIGGVFVIETLSVILQVGSFKLTGKRIFRMAPIHHHFELKGWSE 311 (336) T ss_pred CCCCCCEEECCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCC T ss_conf 27982737136727799999999999997467999999999999999999998889960884677787299999779994 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 1589877899999999999862049 Q gi|254781100|r 342 NQIVIRFWIISFIFAVIGLLTLKMR 366 (366) Q Consensus 342 ~~iv~~~~~i~~~~~~~~i~~~k~~ 366 (366) +|+|.++|+++.+++++|++.++.| T Consensus 312 ~~iv~~f~~i~~~~~~~~~~~~~~~ 336 (336) T PRK00108 312 TKVVVRFWIITLILVLIGLATLKLR 336 (336) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 3168999999999999999995079 No 3 >cd06852 GT_MraY Phospho-N-acetylmuramoyl-pentapeptide-transferase (mraY) is an enzyme responsible for the formation of the first lipid intermediate in the synthesis of bacterial cell wall peptidoglycan. It catalyzes the formation of undecaprenyl-pyrophosphoryl-N-acetylmuramoyl-pentapeptide from UDP-MurNAc-pentapeptide and undecaprenyl-phosphate. It is an integral membrane protein with possibly ten transmembrane domains. Probab=100.00 E-value=0 Score=525.94 Aligned_cols=279 Identities=55% Similarity=0.990 Sum_probs=255.0 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 31102465788512577877799999987535678989998877899999998744678753135660368899999999 Q gi|254781100|r 62 NLPIHARKIDTPTMGGIIILMGLIGSSLLWADFSSLHVSIILLLTLSFGLVGFYDDYIKITTGDAGGLSGKVRILIEFII 141 (366) Q Consensus 62 ~~~~H~~K~~tP~~GGiai~~~~l~~~ll~~~~~~~~~~~~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~~Kl~~Q~i~ 141 (366) +|++|++|+||||+||++++.+++++.+++.+.++.+.+..+.+++.++++|++||+.|+++|+++|+||+.|+..|+++ T Consensus 1 ~~~~h~~K~~TP~~GGlai~~~~l~~~l~~~~~~~~~~~~~l~~~~~~~~iGl~DD~~~i~~~~~~gl~~~~Kl~~q~~~ 80 (280) T cd06852 1 GPKSHLKKAGTPTMGGILFILAILISTLLWADLDSPEVLLLLLLTLGFGLIGFLDDYLKVVKKRNLGLSARQKLLLQFLI 80 (280) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 98311359999870799999999999999998457699999999999999999875888872156788776899999999 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999986402776655666653377512110000000122220235788998598752111166579899999999997 Q gi|254781100|r 142 AILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMGLFFIPFAAFVIVATANAVNLTDGLDGLAIVPVMIASVAFSFI 221 (366) Q Consensus 142 a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg~l~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~i~~~~l~ii 221 (366) |.+.........+.... ...|++.....+.++++++++++|+++++||+|++||+||||+|++.+++.+++++ T Consensus 81 a~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~ti~~iv~~~Na~N~iDGlDGLa~g~~~i~~~~l~~i 153 (280) T cd06852 81 AIVFALLLYYFNGSGTL-------ITLPFFKNGLIDLGILYIPFAIFVIVGSSNAVNLTDGLDGLAAGVSIIVALALAII 153 (280) T ss_pred HHHHHHHHHHCCCCCCC-------CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 99999999974664444-------00465433210379999999999999998888976244206689999999999999 Q ss_pred HHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 64147534666641014788346789988867788898885567410016461088799999999999624589999999 Q gi|254781100|r 222 AYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLWFNISPAVIFMGDTGSLALGAFIGGIAVSTKHEILLIIIGG 301 (366) Q Consensus 222 ~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~~PAkIFmGD~GS~~LG~~la~~ai~~~~~~~l~ii~~ 301 (366) ++..++.. +++.++.+++|||+|||+||+|||||||||+||+++|+.+|++++.+++|...+++++ T Consensus 154 ~~~~~~~~--------------~~~~~~~~l~ga~lgFL~~N~~pAkIFmGD~GS~~LG~~la~~ai~~~~~~~~~~~~~ 219 (280) T cd06852 154 AYLAGNAV--------------FLAVFCAALVGACLGFLWFNAYPAKVFMGDTGSLALGGALAALAILTKQELLLLIIGG 219 (280) T ss_pred HHHHCCHH--------------HHHHHHHHHHHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 99816826--------------9999999999999999984188163765785278999999999999766699999999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999986088766678122899967988515898778999999999998 Q gi|254781100|r 302 LFVVETLSVIMQVGYFKMTKKRVFLMAPIHHHFEKKGWTENQIVIRFWIISFIFAVIGLL 361 (366) Q Consensus 302 i~i~~~~sviiq~~~~k~~~kr~f~~~plHH~l~~~G~s~~~iv~~~~~i~~~~~~~~i~ 361 (366) +|++|+++|++||.++|++|||+|+|+|+|||+|++||||+|+|+++|.++.+++.+|+. T Consensus 220 i~~~~~l~~~i~~~~~r~~~~~~f~~~plHH~l~~~G~s~~~~v~~~~~is~~~~~~~i~ 279 (280) T cd06852 220 VFVIEALSVILQVGSFKLTGKRIFKMAPLHHHFELKGWSETKVVVRFWIISLILALIGLL 279 (280) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999288878889739999876987589999999999999999996 No 4 >COG0472 Rfe UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N- acetylglucosamine-1-phosphate transferase [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=497.81 Aligned_cols=307 Identities=48% Similarity=0.720 Sum_probs=255.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCC-CCH Q ss_conf 9999999999999999999998899999998622069853887311024657885125778777999999875356-789 Q gi|254781100|r 19 KYVTFRSSAAFFSAMLIVFAFGPKVIDCLRSLQEFRGQPVRVPNLPIHARKIDTPTMGGIIILMGLIGSSLLWADF-SSL 97 (366) Q Consensus 19 ~~~~fr~~~a~l~sf~i~~~~~~~~I~~l~~~~~~~g~~~r~~~~~~H~~K~~tP~~GGiai~~~~l~~~ll~~~~-~~~ 97 (366) +|.++|. +++++++++++++|.+++++||.+.. | .|+|+||.|++ +||||||++|+.+++++.+++.+. ++. T Consensus 1 ~~~~~~~--~~i~a~~~s~~~~p~~~~~~~~~~~~--~-~~~~~~k~H~~--~tP~mGGl~I~~~~~~~~~l~~~~~~~~ 73 (319) T COG0472 1 MYLTFRA--ALIISFVISLILTPILIKFLRKLGLG--D-IPEDGPKSHKK--GTPTMGGLAILLSILLASLLAANLLTNP 73 (319) T ss_pred CHHHHHH--HHHHHHHHHHHHHHHHHHHHHHCCCC--C-CCCCCCCCCCC--CCCCCCHHHHHHHHHHHHHHHHHHCCCH T ss_conf 9389999--99999999999999999999982324--6-87767545799--9997644999999999999998721222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH-HHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCC Q ss_conf 8999887789999999874467875313566036889999999999-999986402776655666653377512110000 Q gi|254781100|r 98 HVSIILLLTLSFGLVGFYDDYIKITTGDAGGLSGKVRILIEFIIAI-LAVCALLFYSNSTLLGIETKTSIVFPFFRDIIL 176 (366) Q Consensus 98 ~~~~~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~~Kl~~Q~i~a~-i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l 176 (366) +.+..+.....++.+|++||+.|+ +++.|.+.|...+. +............ ......|++.. T Consensus 74 ~~~~~ll~~~~~~~iG~lDD~~~l--------~~~~r~l~~~~~a~~~~~~~~~~~~~~~------~~~~~~~~~~~--- 136 (319) T COG0472 74 YVWLVLLGLLGFGLIGFLDDRLKL--------SPKIRGLIQKLKALLLIIALGELPIKFL------DIPLGIPFFKL--- 136 (319) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHC------CCCCCCCHHHH--- T ss_conf 899999999999999999727650--------3888679999999999999998878744------56433307887--- Q ss_pred CCC-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHH Q ss_conf 000-1222202357889985987521111665798999999999976414753466664101478834678998886778 Q gi|254781100|r 177 DMG-LFFIPFAAFVIVATANAVNLTDGLDGLAIVPVMIASVAFSFIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGA 255 (366) Q Consensus 177 ~lg-~l~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~i~~~~l~ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga 255 (366) .+ ++++++++++++|.+||+|++||+||||+|++.+++.+++.+++..++. |.+++|++++|| T Consensus 137 -~~~~~~~~~~~~~ivg~sNAvNl~DGlDGLa~g~~~i~~~~~~~i~~~~~~~---------------~~~~~~~al~ga 200 (319) T COG0472 137 -PGPLLFILFAVFAIVGASNAVNLTDGLDGLAAGLSAIALLALALIALLQGLG---------------ELALICAALAGA 200 (319) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCH---------------HHHHHHHHHHHH T ss_conf -8999999999999999978999862650278899999999999999983216---------------799999999999 Q ss_pred HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHCCCCCC-CCCHHHH Q ss_conf 8898885567410016461088799999999999624589999999--999999999999999986088766-6781228 Q gi|254781100|r 256 GIGFLWFNISPAVIFMGDTGSLALGAFIGGIAVSTKHEILLIIIGG--LFVVETLSVIMQVGYFKMTKKRVF-LMAPIHH 332 (366) Q Consensus 256 ~lgFL~fN~~PAkIFmGD~GS~~LG~~la~~ai~~~~~~~l~ii~~--i~i~~~~sviiq~~~~k~~~kr~f-~~~plHH 332 (366) |+||||||+|||||||||+||+++|+.+|+++++.++|..+.+.+. +|++|++++++||.++|. ++|.- +|+|+|| T Consensus 201 ~LGFL~~N~~PAkIFmGDtGSl~LG~~la~iail~~~~~~l~~~~~~~v~v~et~~vil~r~~~~~-~~~~~~~~~~~HH 279 (319) T COG0472 201 CLGFLWFNFYPAKIFMGDTGSLALGAALAAIALLLKQEILLAIIGPLGVPVIETLSVILRRLSFKL-KPIRLPDMAHLHH 279 (319) T ss_pred HHHHHHHCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCC-CCCCCCCCCCCCH T ss_conf 887987507850504773656999999999999996058899996305687651313146555277-7777767788628 Q ss_pred HHHHCC---CCCCHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 999679---8851589877899999999999862049 Q gi|254781100|r 333 HFEKKG---WTENQIVIRFWIISFIFAVIGLLTLKMR 366 (366) Q Consensus 333 ~l~~~G---~s~~~iv~~~~~i~~~~~~~~i~~~k~~ 366 (366) |+|+.| |+|+|++.++|+++++++.+++...|.| T Consensus 280 ~~~~~g~~~~~e~~vv~~~w~i~~i~~~i~l~~~~~~ 316 (319) T COG0472 280 HLELKGLSEWSERQVVLRFWLISLILALIALLLLKSR 316 (319) T ss_pred HHHHCCCCCCCCCEEHHHHHHHHHHHHHHHHHHHCCC T ss_conf 9997476556665211678999999999999996243 No 5 >cd06856 GT_GPT_archaea UDP-GlcNAc:dolichol-P GlcNAc-1-P transferase (GPT)-like proteins in archaea. Eukaryotic GPT catalyzes the transfer of GlcNAc-1-P from UDP-GlcNAc to dolichol-P to form GlcNAc-P-P-dolichol. The reaction is the first step in the assembly of dolichol-linked oligosaccharide intermediates and is essential for eukaryotic N-linked glycosylation. Evidence for the existence of the N-glycosylation pathway in archaea has emerged and genes responsible for the pathway have been identified. A glycosyl transferase gene Mv1751 in M. voltae encodes for the enzyme that carries out the first step in the pathway, the attachment of GlcNAc to a dolichol lipid carrier in the membrane. A lethal mutation in the alg7 (GPT) gene in Saccharomyces cerevisiae was successfully complemented with Mv1751, the archaea gene, indicating that eukaryotic and archaeal enzymes may use the same substrates and are evolutionarily closer than the bacterial enzyme, which uses a different substrate. Probab=100.00 E-value=0 Score=442.08 Aligned_cols=262 Identities=25% Similarity=0.389 Sum_probs=221.3 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 8731102465788512577877799999987535678-989998877899999998744678753135660368899999 Q gi|254781100|r 60 VPNLPIHARKIDTPTMGGIIILMGLIGSSLLWADFSS-LHVSIILLLTLSFGLVGFYDDYIKITTGDAGGLSGKVRILIE 138 (366) Q Consensus 60 ~~~~~~H~~K~~tP~~GGiai~~~~l~~~ll~~~~~~-~~~~~~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~~Kl~~Q 138 (366) +|-+|.| |++||||||++++.+++++.+++...++ .+.+..+.+.+.++++|++||+. +++++.|+..| T Consensus 3 ~D~~K~~--~~~iP~~GGi~i~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~iG~~DD~~--------~l~~~~Kl~~q 72 (280) T cd06856 3 RDVHKPG--KPEVPEMGGIAVLLGFSLGLLFLSALTHSVEALALLITSLLAGLIGLLDDIL--------GLSQSEKVLLT 72 (280) T ss_pred CCCCCCC--CCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH--------CCCHHHHHHHH T ss_conf 9888889--8899873299999999999999999535299999999999999999998750--------68699999999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999986402776655666653377512110000000122220235788998598752111166579899999999 Q gi|254781100|r 139 FIIAILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMGLFFIPFAAFVIVATANAVNLTDGLDGLAIVPVMIASVAF 218 (366) Q Consensus 139 ~i~a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg~l~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~i~~~~l 218 (366) .+++............ ...|+. ...+.++++++++.+++++++||+|++||+||||+|++++++.++ T Consensus 73 ~~~a~~~~~~~~~~~~-----------~~~~~~--~~~~~~~~~~~~~~~~ivg~~NavNl~DGlDGLa~g~~~i~~~~l 139 (280) T cd06856 73 ALPAIPLLVLKAGNPL-----------TSLPIG--GRVLGILYYLLIVPLGITGASNAFNMLAGFNGLEAGMGIIILLAL 139 (280) T ss_pred HHHHHHHHHHHHHCCC-----------CCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 9999999999872543-----------347520--355578999999999999997788885155027789999999999 Q ss_pred HHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99764147534666641014788346789988867788898885567410016461088799999999999624589999 Q gi|254781100|r 219 SFIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLWFNISPAVIFMGDTGSLALGAFIGGIAVSTKHEILLII 298 (366) Q Consensus 219 ~ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~~PAkIFmGD~GS~~LG~~la~~ai~~~~~~~l~i 298 (366) +++++..+|. |.+.+|.+++|||+||||||+|||||||||+||+++|+.+|++++++++|..+++ T Consensus 140 ~~i~~~~~~~---------------~~~~~~~~l~ga~lgFL~~N~~PAkIFmGDtGSl~lG~~la~~aIl~~~e~~l~i 204 (280) T cd06856 140 AIILLINGDY---------------DALIIALILVAALLAFLLYNKYPAKVFPGDVGTLPIGALIGTIAVLGGLEIILLI 204 (280) T ss_pred HHHHHHCCCH---------------HHHHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHCHHHHHHH T ss_conf 9999974869---------------9999999999998898872077151870658789999999999999751999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCC-------------CCCCHHHHHHHHCCC--CCCHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999998608876-------------667812289996798--85158987789999999999 Q gi|254781100|r 299 IGGLFVVETLSVIMQVGYFKMTKKRV-------------FLMAPIHHHFEKKGW--TENQIVIRFWIISFIFAVIG 359 (366) Q Consensus 299 i~~i~i~~~~sviiq~~~~k~~~kr~-------------f~~~plHH~l~~~G~--s~~~iv~~~~~i~~~~~~~~ 359 (366) +++++++|+++++.|+.++|.|+||. ++++|+|||++..+| +|+|+|.++|+++++|++++ T Consensus 205 ~~~~~vie~l~~i~q~~~~~~~~~~~~~~~~~g~l~~~~~~~~~~hhh~~~~~~~~~E~~vv~~~w~i~~~~~ila 280 (280) T cd06856 205 LLLPYVIDFLLKLRSKGGGKEHREKPTKVLEDGTLYPPPDKSSLLTLRLLLRKGPMTEKEVVLVLWALEALLGILA 280 (280) T ss_pred HHHHHHHHHHHHHHEEEEEEEECEECCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 9999999999988723688872201254466675367765667157899971699984617899999999999959 No 6 >TIGR02380 ECA_wecA undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphatetransferase; InterPro: IPR012750 Members of this family are the WecA enzymes of enterobacterial common antigen (ECA) biosynthesis, undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphatetransferase. This family represents one narrow clade, and closely related sequences outside this clade may represent enzymes that catalyse the same specific reaction, but in the context of different pathways. A His-rich motif in a cytosolic loop of this integral membrane protein, shown to be critical to enzymatic activity for WecA is variously present or absent in the clade that includes Bacillus subtilis TagO teichoic acid biosynthesis enzyme, which may catalyse the same reaction as WecA. ; GO: 0000287 magnesium ion binding, 0016780 phosphotransferase activity for other substituted phosphate groups, 0030145 manganese ion binding, 0009103 lipopolysaccharide biosynthetic process, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=100.00 E-value=0 Score=412.63 Aligned_cols=291 Identities=23% Similarity=0.365 Sum_probs=249.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH-HHHHHHHHCCCCCHHHHHHHHHHHHHHH Q ss_conf 999999988999999986220698538873110246578851257787779-9999987535678989998877899999 Q gi|254781100|r 33 MLIVFAFGPKVIDCLRSLQEFRGQPVRVPNLPIHARKIDTPTMGGIIILMG-LIGSSLLWADFSSLHVSIILLLTLSFGL 111 (366) Q Consensus 33 f~i~~~~~~~~I~~l~~~~~~~g~~~r~~~~~~H~~K~~tP~~GGiai~~~-~l~~~ll~~~~~~~~~~~~l~~~l~~~l 111 (366) ++.+|+.+-..+=.+||.+.+.|.+|||+.||+|+. -+|..||+.++.+ +......+.+..+ ..++.++++..+.. T Consensus 4 li~iFL~sF~~LF~~RkvA~~vGLVDKPn~RK~H~G--~iPLVGGiSif~~nLc~~l~~~~~~i~-~~~LyL~ca~~Lv~ 80 (349) T TIGR02380 4 LIVIFLSSFLTLFLMRKVAKIVGLVDKPNARKRHQG--VIPLVGGISIFLTNLCITLLLHDELIP-HLSLYLACATILVV 80 (349) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC--CCCCHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH T ss_conf 878999999999998675312057268888765545--413330199999999999977555651-03789999999999 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCC-CCCCCCHHHHH Q ss_conf 99874467875313566036889999999999999986402776655666653377512110000000-12222023578 Q gi|254781100|r 112 VGFYDDYIKITTGDAGGLSGKVRILIEFIIAILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMG-LFFIPFAAFVI 190 (366) Q Consensus 112 iG~iDD~~ki~~~~~~GLs~~~Kl~~Q~i~a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg-~l~i~~~~~~i 190 (366) +|.+||+.| +|.+.|+..|..++++.......+-+ ++. ..++..++.+| +++.++|+|++ T Consensus 81 iG~lDDrfD--------isv~iR~~IQa~vs~vMi~~G~~~L~----sLG-------~i~g~~el~LgP~~g~iITiFAv 141 (349) T TIGR02380 81 IGVLDDRFD--------ISVKIRLAIQAAVSIVMIQFGNIYLH----SLG-------NIFGSKELTLGPLFGYIITIFAV 141 (349) T ss_pred HHHHHCCCC--------CCCCHHHHHHHHHHHHHHHHCCCCHH----HCC-------CCCCHHHHHHHHHHHHHHHHHHH T ss_conf 852320226--------78207799999999999985472523----155-------30204688887678899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCC-----C Q ss_conf 89985987521111665798999999999976414753466664101478834678998886778889888556-----7 Q gi|254781100|r 191 VATANAVNLTDGLDGLAIVPVMIASVAFSFIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLWFNI-----S 265 (366) Q Consensus 191 vg~~NavNl~DGlDGLa~g~~~i~~~~l~ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~-----~ 265 (366) +|.+||+||+||+|||.||.+++++++++.+.|..||. |++..|.+++-|.+.|+.-|. + T Consensus 142 ~gAINAFNMvDGIDGLLGGLs~V~F~ALg~Lf~~dg~~---------------~La~~c~~~i~A~lPYl~LNLGiplGr 206 (349) T TIGR02380 142 IGAINAFNMVDGIDGLLGGLSCVTFAALGYLFYLDGQV---------------ELAYWCFALIVAILPYLLLNLGIPLGR 206 (349) T ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCHH---------------HHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 99986124534310344589999999999998616626---------------899999999999988999846663468 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC--CCHHHHHHH Q ss_conf 410016461088799999999999624--------5899999999999999999999999860887666--781228999 Q gi|254781100|r 266 PAVIFMGDTGSLALGAFIGGIAVSTKH--------EILLIIIGGLFVVETLSVIMQVGYFKMTKKRVFL--MAPIHHHFE 335 (366) Q Consensus 266 PAkIFmGD~GS~~LG~~la~~ai~~~~--------~~~l~ii~~i~i~~~~sviiq~~~~k~~~kr~f~--~~plHH~l~ 335 (366) +=||||||+||+++||++-|+.+.+.| |++.+|+-++|.+|+.+.|+||. ++|++||| |.|+||.++ T Consensus 207 k~KVFMGDAGS~~IGFTViWLLl~~tQg~~~p~~~PVTALWlIA~PLmDM~aIm~RR~---rkG~SPFkPDR~HlHH~~~ 283 (349) T TIGR02380 207 KFKVFMGDAGSTLIGFTVIWLLLEATQGESSPSMRPVTALWLIALPLMDMAAIMLRRI---RKGDSPFKPDREHLHHILQ 283 (349) T ss_pred CCCEECCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH---HCCCCCCCCCHHHHHHHHH T ss_conf 8603115600157889999998720013665788740579998614777999999874---2777769777356879998 Q ss_pred HCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6798851589877899999999999862 Q gi|254781100|r 336 KKGWTENQIVIRFWIISFIFAVIGLLTL 363 (366) Q Consensus 336 ~~G~s~~~iv~~~~~i~~~~~~~~i~~~ 363 (366) +.|+|.||+.+.+...|++++.+|+.+. T Consensus 284 r~GLtSRq~L~vIc~~a~l~A~iGI~se 311 (349) T TIGR02380 284 RLGLTSRQSLVVICLAAALLAAIGILSE 311 (349) T ss_pred HCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7077277999999999999998768987 No 7 >cd06854 GT_WbpL_WbcO_like The members of this subfamily catalyze the formation of a phosphodiester bond between a membrane-associated undecaprenyl-phosphate (Und-P) molecule and N-acetylhexosamine 1-phosphate, which is usually donated by a soluble UDP-N-acetylhexosamine precursor. The WbcO/WbpL substrate specificity has not yet been determined, but the structure of their biosynthetic end products implies that UDP-N-acetyl-D-fucosamine (UDP-FucNAc) and/or UDPN-acetyl-D-quinosamine (UDP-QuiNAc) are used. The subgroup of bacterial UDP-HexNAc:polyprenol-P HexNAc-1-P transferases includes the WbcO protein from Yersinia enterocolitica and the WbpL protein from Pseudomonas aeruginosa. These transferases initiate LPS O-antigen biosynthesis. Similar to other GlcNAc/MurNAc-1-P transferase family members, WbpL is a highly hydrophobic protein possessing 11 predicted transmembrane segments. Probab=100.00 E-value=0 Score=384.30 Aligned_cols=243 Identities=27% Similarity=0.393 Sum_probs=202.9 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC---CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 538873110246578851257787779999998753---56789899988778999999987446787531356603688 Q gi|254781100|r 57 PVRVPNLPIHARKIDTPTMGGIIILMGLIGSSLLWA---DFSSLHVSIILLLTLSFGLVGFYDDYIKITTGDAGGLSGKV 133 (366) Q Consensus 57 ~~r~~~~~~H~~K~~tP~~GGiai~~~~l~~~ll~~---~~~~~~~~~~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~~ 133 (366) .|+|++||.| |++|||+||++++.+++++.+++. ..++.+.+..+.+...++++|++||+. ++||+. T Consensus 2 vD~P~~Rk~H--~~~tP~~GGi~i~~~~~i~~~~~~~~~~~~~~~~~~ll~~~~~~~~iGl~DD~~--------~l~~~~ 71 (253) T cd06854 2 LDIPNERSSH--TKPTPRGGGIAFVLAFLLALLLAAAAGPLNDLSYLLLLIGLLLLAAVGFIDDLR--------SLSPKI 71 (253) T ss_pred CCCCCCCCCC--CCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHC--------CCCHHH T ss_conf 6799998889--999887519999999999999999835521289999999999999999999760--------687779 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999864027766556666533775121100000001222202357889985987521111665798999 Q gi|254781100|r 134 RILIEFIIAILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMGLFFIPFAAFVIVATANAVNLTDGLDGLAIVPVMI 213 (366) Q Consensus 134 Kl~~Q~i~a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg~l~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~i 213 (366) |+..|++++.+.+........ .+.....+++.++++++|+++.+||+|++||+||||+|.+++ T Consensus 72 kl~~q~~~~~~~i~~~~~~~~-----------------~~~~~~~~~~~~~~ti~~iv~~iNa~N~iDGiDGLa~g~~~i 134 (253) T cd06854 72 RLLVQLLAAALALYALGPLTS-----------------LLLNFLPPWLIALLLLLAIVWIINLYNFMDGIDGLAGGEALV 134 (253) T ss_pred HHHHHHHHHHHHHHHHHHHHH-----------------CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 999999999999999875111-----------------000121489999999999999999999971751178899999 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH- Q ss_conf 9999999764147534666641014788346789988867788898885567410016461088799999999999624- Q gi|254781100|r 214 ASVAFSFIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLWFNISPAVIFMGDTGSLALGAFIGGIAVSTKH- 292 (366) Q Consensus 214 ~~~~l~ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~~PAkIFmGD~GS~~LG~~la~~ai~~~~- 292 (366) ++.++..+++..++. +...++.+++|+++|||+||+||||+||||+||+++|+.+|++++...+ T Consensus 135 ~~~~l~~i~~~~~~~---------------~~~~~~~~l~~~ll~Fl~~N~~pakiFmGDsGS~~lG~~l~~~~i~~~~~ 199 (253) T cd06854 135 VFLALALLGYLAGEP---------------ALALLALALAGALLGFLPFNWPPAKIFMGDVGSTFLGFLLAALLLLLALS 199 (253) T ss_pred HHHHHHHHHHHCCCH---------------HHHHHHHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHHHHHHCC T ss_conf 999999999845666---------------89999999999988899870775767625607899999999999998435 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC--CCHHHHHHHHCCCCCCHH Q ss_conf ---5899999999999999999999999860887666--781228999679885158 Q gi|254781100|r 293 ---EILLIIIGGLFVVETLSVIMQVGYFKMTKKRVFL--MAPIHHHFEKKGWTENQI 344 (366) Q Consensus 293 ---~~~l~ii~~i~i~~~~sviiq~~~~k~~~kr~f~--~~plHH~l~~~G~s~~~i 344 (366) +...++...+|+.|++.++++|. .+||++|+ +.|+||+++++|+||||+ T Consensus 200 ~~~~~~~l~~~~~~~~D~~~v~~~R~---~~~~~p~~~D~~Hlhh~L~~~G~s~~~v 253 (253) T cd06854 200 GQSPWAWLLLLSPFLVDATVTLLRRL---LRGENIFQAHRKHLYQRLARAGKSHRKV 253 (253) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH---HCCCCCCCCCCCCHHHHHHHCCCCCCCC T ss_conf 52599999999999999999999999---8699999999867899998768997889 No 8 >cd06853 GT_WecA_like This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins. WecA is an UDP-N-acetylglucosamine (GlcNAc):undecaprenyl-phosphate (Und-P) GlcNAc-1-phosphate transferase that catalyzes the formation of a phosphodiester bond between a membrane-associated undecaprenyl-phosphate molecule and N-acetylglucosamine 1-phosphate, which is usually donated by a soluble UDP-N-acetylglucosamine precursor. WecA participates in the biosynthesis of O antigen LPS in many enteric bacteria and is also involved in the biosynthesis of enterobacterial common antigen. A conserved short sequence motif and a conserved arginine at a cytosolic loop of this integral membrane protein were shown to be critical in recognition of substrate UDP-N-acetylglucosamine. Probab=100.00 E-value=0 Score=352.83 Aligned_cols=237 Identities=32% Similarity=0.482 Sum_probs=192.0 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCC---CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 11024657885125778777999999875356---789899988778999999987446787531356603688999999 Q gi|254781100|r 63 LPIHARKIDTPTMGGIIILMGLIGSSLLWADF---SSLHVSIILLLTLSFGLVGFYDDYIKITTGDAGGLSGKVRILIEF 139 (366) Q Consensus 63 ~~~H~~K~~tP~~GGiai~~~~l~~~ll~~~~---~~~~~~~~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~~Kl~~Q~ 139 (366) ||.| |++|||+||++++.+++++..++... .+.+....+.+...++++|++||+. ++||+.|+..|+ T Consensus 1 Rk~H--~~~tP~~GGi~i~~~~~i~~~~~~~~~~~~~~~~~~~l~~~~~~~~iG~~DD~~--------~l~~~~Rl~~q~ 70 (249) T cd06853 1 RKVH--KGPIPRLGGLAIFLGFLLALLLALLFPFFLLPELLGLLAGATIIVLLGLLDDLF--------DLSPKVKLLGQI 70 (249) T ss_pred CCCC--CCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHC--------CCCHHHHHHHHH T ss_conf 9999--999898119999999999999999863010489999999999999999999830--------598999999999 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999864027766556666533775121100000001222202357889985987521111665798999999999 Q gi|254781100|r 140 IIAILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMGLFFIPFAAFVIVATANAVNLTDGLDGLAIVPVMIASVAFS 219 (366) Q Consensus 140 i~a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg~l~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~i~~~~l~ 219 (366) +++.+.+........ ...+++ +.....++++++++++++++.+||+|++||+|||+++++.+++.++. T Consensus 71 ~~~~l~i~~~~~~~~-----------~~~~~~-~~~~~~~~~~~~~t~~~i~~~iNa~N~iDGidGLa~~~~~i~~~~~~ 138 (249) T cd06853 71 LAALIVVFGGGVILS-----------LLGPFG-GGIILLGWLSIPLTVLWIVGIINAINLIDGLDGLAGGVALIASLALA 138 (249) T ss_pred HHHHHHHHHHHHHHH-----------HCCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 999999999668876-----------235401-46653789999999999999999999862753024699999999999 Q ss_pred HHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH------- Q ss_conf 9764147534666641014788346789988867788898885567410016461088799999999999624------- Q gi|254781100|r 220 FIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLWFNISPAVIFMGDTGSLALGAFIGGIAVSTKH------- 292 (366) Q Consensus 220 ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~~PAkIFmGD~GS~~LG~~la~~ai~~~~------- 292 (366) ++++..++.. ...++.+++|+++|||+||+||||+||||+||+++|+.+|+.++...+ T Consensus 139 ~~~~~~~~~~---------------~~~~~~~l~~~ll~Fl~fN~~~akiFlGD~GS~~lG~~i~~~~i~~~~~~~~~~~ 203 (249) T cd06853 139 ILALLNGQVL---------------VALLALALAGALLGFLPYNFHPARIFMGDAGSLFLGFLLAVLSILGTQKSSTAIS 203 (249) T ss_pred HHHHHCCCHH---------------HHHHHHHHHHHHHHHHHHHCCCCHHCCCCCHHHHHHHHHHHHHHHHHCCCCCHHH T ss_conf 9999845588---------------9999999999999999870543202157732789999999999999723662688 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC--CCHHHHHHHHCCC Q ss_conf 5899999999999999999999999860887666--7812289996798 Q gi|254781100|r 293 EILLIIIGGLFVVETLSVIMQVGYFKMTKKRVFL--MAPIHHHFEKKGW 339 (366) Q Consensus 293 ~~~l~ii~~i~i~~~~sviiq~~~~k~~~kr~f~--~~plHH~l~~~G~ 339 (366) +...+++..+|+.|++.++++|. .+||++|+ +.|+||+++++|. T Consensus 204 ~~~~~~~l~~p~~D~~~~~~~R~---~~~~~~~~~d~~H~hh~l~~~G~ 249 (249) T cd06853 204 PVVPLLILAVPLFDTLFVIIRRL---LRGRSPFQADRDHLHHRLLRLGL 249 (249) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HCCCCCCCCCCCCHHHHHHHCCC T ss_conf 99999999999999999999999---86999888998668999988769 No 9 >cd06855 GT_GPT_euk UDP-GlcNAc:dolichol-P GlcNAc-1-P transferase (GPT) catalyzes the transfer of GlcNAc-1-P from UDP-GlcNAc to dolichol-P to form GlcNAc-P-P-dolichol. The reaction is the first step in the assembly of dolichol-linked oligosaccharide intermediates and is essential for eukaryotic N-glycosylation. GPT activity has been identified in all eukaryotic cells examined to date. A series of six conserved motifs designated A through F, ranging in length from 5 to 13 amino acid residues, has been identified in this family. They have been determined to be important for stable expression, substrate binding, or catalytic activities. Probab=100.00 E-value=0 Score=346.59 Aligned_cols=259 Identities=15% Similarity=0.249 Sum_probs=190.2 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH------HHCCCCCH---HHHHHHHHHHHHHH Q ss_conf 8999999986220698538873110246578851257787779999998------75356789---89998877899999 Q gi|254781100|r 41 PKVIDCLRSLQEFRGQPVRVPNLPIHARKIDTPTMGGIIILMGLIGSSL------LWADFSSL---HVSIILLLTLSFGL 111 (366) Q Consensus 41 ~~~I~~l~~~~~~~g~~~r~~~~~~H~~K~~tP~~GGiai~~~~l~~~l------l~~~~~~~---~~~~~l~~~l~~~l 111 (366) |.+++.+.|.+ ..||..|+++++. +|++||+....+++++.. .+...++. +.+..+.+...+++ T Consensus 2 P~~~~~~~k~~-l~G~Di~K~~~~~------vpe~gGi~~~~~fli~~~~~i~~~~~~~~~~~~~~~~~~~Ll~~~~~~l 74 (283) T cd06855 2 PKFGPLFIKAG-LYGIDLNKNGEEK------IPESAGLVPGIVFLIVLFLFIPFPFLKDFPHDKLVEYLSALLSICCMTF 74 (283) T ss_pred CCHHHHHHHCC-CEEECCCCCCCCC------CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH T ss_conf 60389999868-9531688899986------8754488999999999999999988513523779999999999999976 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHH Q ss_conf 99874467875313566036889999999999999986402776655666653377512110000000122220235788 Q gi|254781100|r 112 VGFYDDYIKITTGDAGGLSGKVRILIEFIIAILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMGLFFIPFAAFVIV 191 (366) Q Consensus 112 iG~iDD~~ki~~~~~~GLs~~~Kl~~Q~i~a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg~l~i~~~~~~iv 191 (366) +|++||..| ++||.|+.+|.++++........ ..... ......+.+.+..+++|++++++++++++ T Consensus 75 lGf~DD~~~--------L~~r~Kl~l~~~aalpl~~~y~~--~~~~~----~~~~~~~~~~~~~i~lg~ly~~~~~~~~v 140 (283) T cd06855 75 LGFADDVLD--------LRWRHKLILPTFASLPLLMVYYG--NTGIT----LPIVPLRPLLGTLIDLGILYYVYMILLAV 140 (283) T ss_pred HEEEECCCC--------CCHHHHHHHHHHHHHHHHHHHCC--CCCEE----EECCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 414502012--------56898989999999998885103--65036----50477765556620037999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 99859875211116657989999999999764147534666641014788346789988867788898885567410016 Q gi|254781100|r 192 ATANAVNLTDGLDGLAIVPVMIASVAFSFIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLWFNISPAVIFM 271 (366) Q Consensus 192 g~~NavNl~DGlDGLa~g~~~i~~~~l~ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~~PAkIFm 271 (366) +.+||+|++||+||||+|++++++.++.+.++...+.... .....+..+.+.++.+++|+++||||||+||||||| T Consensus 141 ~~tNaiNl~dGldGLa~G~s~i~~~~~~i~~~~~~~~~~~----~~~~~~~~~~~~~~~~l~g~~lgFL~~N~yPAkIFm 216 (283) T cd06855 141 FCTNSINIYAGINGLEVGQSLVIALSILLYNLLELNGSSG----SMTLDAHLFSLYLLLPFIAVSLALLYYNWYPSKVFV 216 (283) T ss_pred HHCCHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEC T ss_conf 8242102340663077789999999999999998601221----233331499999999999999999872378463744 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH Q ss_conf 461088799999999999624589999999999999999999999986088766678122899 Q gi|254781100|r 272 GDTGSLALGAFIGGIAVSTKHEILLIIIGGLFVVETLSVIMQVGYFKMTKKRVFLMAPIHHHF 334 (366) Q Consensus 272 GD~GS~~LG~~la~~ai~~~~~~~l~ii~~i~i~~~~sviiq~~~~k~~~kr~f~~~plHH~l 334 (366) ||+||+++|+++|+++|+.+.|...++ +++.|+..|+.++||.||+.|.|||= T Consensus 217 GDtgs~~lG~~la~v~IlG~~~~~~~~----------~~ipqiinF~~s~~~lF~~~p~~~Hr 269 (283) T cd06855 217 GDTFTYFAGMVFAVVGILGHFSKTLLL----------FFIPQIINFLYSLPQLFGIVPCPRHR 269 (283) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHEECCCEEEEEECCCCCC T ss_conf 552778889999999998318999999----------99999865012256178640166667 No 10 >cd06851 GT_GPT_like This family includes eukaryotic UDP-GlcNAc:dolichol-P GlcNAc-1-P transferase (GPT) and archaeal GPT-like glycosyltransferases. Eukaryotic GPT catalyzes the transfer of GlcNAc-1-P from UDP-GlcNAc to dolichol-P to form GlcNAc-P-P-dolichol. The reaction is the first step in the assembly of dolichol-linked oligosaccharide intermediates and is essential for eukaryotic N-linked glycosylation. GPT activity has been identified in all eukaryotic cells examined to date. Evidence for the existence of the N-glycosylation pathway in archaea has emerged and genes responsible for the pathway have been identified. A glycosyl transferase gene Mv1751 in M. voltae encodes for the enzyme that carries out the first step in the pathway, the attachment of GlcNAc to a dolichol lipid carrier in the membrane. A lethal mutation in the alg7 (GPT) gene in Saccharomyces cerevisiae was successfully complemented with Mv1751, the archaeal gene, indicating eukaryotic and archaeal enzymes may use the Probab=100.00 E-value=0 Score=327.23 Aligned_cols=210 Identities=25% Similarity=0.315 Sum_probs=170.9 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCC--------CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 73110246578851257787779999998753567--------8989998877899999998744678753135660368 Q gi|254781100|r 61 PNLPIHARKIDTPTMGGIIILMGLIGSSLLWADFS--------SLHVSIILLLTLSFGLVGFYDDYIKITTGDAGGLSGK 132 (366) Q Consensus 61 ~~~~~H~~K~~tP~~GGiai~~~~l~~~ll~~~~~--------~~~~~~~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~ 132 (366) |.+|.| |+++||+||++++.+++++.+.+...+ ..+.+..+.+...++.+|++||++ ++++| T Consensus 4 D~~K~~--~~~IP~~GGiai~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~~~~~~iG~~DD~~--------~l~~~ 73 (223) T cd06851 4 DMNKKG--NVMIPEPGGISILIGFVASEITLIFFPFLSFPHFPISEILAALITSVLGFSVGIIDDRL--------TMGGW 73 (223) T ss_pred CCCCCC--CCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHEEEEECCCC--------CCCHH T ss_conf 878899--99986816999999999999999982101122002999999999999998057631267--------88799 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89999999999999986402776655666653377512110000000122220235788998598752111166579899 Q gi|254781100|r 133 VRILIEFIIAILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMGLFFIPFAAFVIVATANAVNLTDGLDGLAIVPVM 212 (366) Q Consensus 133 ~Kl~~Q~i~a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg~l~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~ 212 (366) .|+..|..++........... ....|+++ ...+.++++++++.+|+++++||+|++||+||||+|+++ T Consensus 74 ~kl~~~~~~a~~~~~~~~~~~-----------~~~~~~~~-~~~~~~~~~~~~~~~~ivg~~Na~NliDGlDGLa~g~~~ 141 (223) T cd06851 74 FKPVALAFAAAPILLLGAYDS-----------NLDFPLFG-GSVKIPSLYLVLVVFMIVITGNAFNSIAGLNGVEAGFTT 141 (223) T ss_pred HHHHHHHHHHHHHHHHHHHCC-----------CCCCCCCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 999999999999999998536-----------61167654-311003899999999999997788841365017789999 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99999999764147534666641014788346789988867788898885567410016461088799999999999624 Q gi|254781100|r 213 IASVAFSFIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLWFNISPAVIFMGDTGSLALGAFIGGIAVSTKH 292 (366) Q Consensus 213 i~~~~l~ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~~PAkIFmGD~GS~~LG~~la~~ai~~~~ 292 (366) +++.+++++++..++. +.+.+|.+++|||+|||+||+|||||||||+||+++|+.+|+++++++. T Consensus 142 i~~~~l~~~~~~~~~~---------------~~~~~~~~l~ga~lgFL~~N~~PAkIFmGD~GS~~lG~~la~lailg~~ 206 (223) T cd06851 142 IISFALAISLLVQQNY---------------EIGIACLCLAFASLAFLYYNKYPSRIFPGDTGAYMFGATYAVVAILGEV 206 (223) T ss_pred HHHHHHHHHHHHCCCH---------------HHHHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999995779---------------9999999999999999985268053770657779999999999999848 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 589999999999999 Q gi|254781100|r 293 EILLIIIGGLFVVET 307 (366) Q Consensus 293 ~~~l~ii~~i~i~~~ 307 (366) |...++...+.++|- T Consensus 207 ~~~~~~l~iP~ii~f 221 (223) T cd06851 207 EKIAAVAFIPAIINF 221 (223) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999999986 No 11 >cd06499 GT_MraY-like Glycosyltransferase 4 (GT4) includes both eukaryotic and prokaryotic UDP-D-N-acetylhexosamine:polyprenol phosphate D-N-acetylhexosamine-1-phosphate transferases. They catalyze the transfer of a D-N-acetylhexosamine 1-phosphate to a membrane-bound polyprenol phosphate, which is the initiation step of protein N-glycosylation in eukaryotes and peptidoglycan biosynthesis in bacteria. One member, D-N-acetylhexosamine 1-phosphate transferase (GPT) is a eukaryotic enzyme, which is specific for UDP-GlcNAc as donor substrate and dolichol-phosphate as the membrane bound acceptor. The bacterial members MraY, WecA, and WbpL/WbcO utilize undecaprenol phosphate as the acceptor substrate, but use different UDP-sugar donor substrates. MraY-type transferases are highly specific for UDP-N-acetylmuramate-pentapeptide, whereas WecA proteins are selective for UDP-N-acetylglucosamine (UDP-GlcNAc). The WbcO/WbpL substrate specificity has not yet been determined, but the structure of thei Probab=100.00 E-value=0 Score=304.64 Aligned_cols=185 Identities=41% Similarity=0.662 Sum_probs=159.5 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88512577877799999987535678989998877899999998744678753135660368899999999999999864 Q gi|254781100|r 71 DTPTMGGIIILMGLIGSSLLWADFSSLHVSIILLLTLSFGLVGFYDDYIKITTGDAGGLSGKVRILIEFIIAILAVCALL 150 (366) Q Consensus 71 ~tP~~GGiai~~~~l~~~ll~~~~~~~~~~~~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~~Kl~~Q~i~a~i~~~~l~ 150 (366) ||||+||++++.+++++.+++...++.+.+..+.+.+.++++|++||+.|++ .++|+|.|+..|++++........ T Consensus 1 PtP~~GGiai~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~l~G~iDD~~~l~----~~~~~~~kl~~q~~~~~~~~~~~~ 76 (185) T cd06499 1 PTPTMGGLAILLGFLLGVLLYIPHSNTLILLALLSGLVAGIVGFIDDLLGLK----VELSEREKLLLQILAALFLLLIGG 76 (185) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCC----CCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 9698569999999999999998745569999999999999999997773256----577789999999999999999999 Q ss_pred HCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 02776655666653377512110000000122220235788998598752111166579899999999997641475346 Q gi|254781100|r 151 FYSNSTLLGIETKTSIVFPFFRDIILDMGLFFIPFAAFVIVATANAVNLTDGLDGLAIVPVMIASVAFSFIAYVAGNILL 230 (366) Q Consensus 151 ~~~~~~~~~~~~~~~~~iPf~~~~~l~lg~l~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~i~~~~l~ii~~~~g~~~~ 230 (366) ..... ..|+ +...++++++++++++++++.+||+|++||+||||+|++++++.+++.+++..++.. T Consensus 77 ~~~~~-----------~~~~--~~~~~~~~~~~~~tv~~iv~~~Na~N~iDGiDGLa~g~~~i~~~~l~~~~~~~~~~~- 142 (185) T cd06499 77 GHTTV-----------TTPL--GFVLDLGIFYIPFAIIAIVGATNAVNLIDGMDGLAAGISVIASIACALFALLSGQTT- 142 (185) T ss_pred HHHCC-----------CCCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCHH- T ss_conf 83243-----------5663--311236899999999999999999897506651768999999999999999907688- Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 666410147883467899888677888988855674100164610887999999999 Q gi|254781100|r 231 SHYLQVNFVPGVGELVVVISALIGAGIGFLWFNISPAVIFMGDTGSLALGAFIGGIA 287 (366) Q Consensus 231 ~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~~PAkIFmGD~GS~~LG~~la~~a 287 (366) ...+|.+++|+++|||+||+||||+||||+||+++|+.+|++| T Consensus 143 --------------~~~~~~~l~g~~lgFL~~N~~pakIFmGD~GS~~lG~~la~~A 185 (185) T cd06499 143 --------------SALLFIILAGACLGFLYFNFYPAKIFMGDTGSYFLGAAYAAVA 185 (185) T ss_pred --------------HHHHHHHHHHHHHHHHHHHCCCCCEECCCHHHHHHHHHHHHHC T ss_conf --------------9999999999999999986684216316575799999999939 No 12 >cd06912 GT_MraY_like This subfamily is composed of uncharacterized bacterial glycosyltransferases in the MraY-like family. This family contains both eukaryotic and prokaryotic UDP-D-N-acetylhexosamine:polyprenol phosphate D-N-acetylhexosamine-1-phosphate transferases, which catalyze the transfer of a D-N-acetylhexosamine 1-phosphate to a membrane-bound polyprenol phosphate. This is the initiation step of protein N-glycosylation in eukaryotes and peptidoglycan biosynthesis in bacteria. The three bacterial members MraY, WecA, and WbpL/WbcO, utilize undecaprenol phosphate as the acceptor substrate, but use different UDP-sugar donor substrates. MraY-type transferases are highly specific for UDP-N-acetylmuramate-pentapeptide, whereas WecA proteins are selective for UDP-N-acetylglucosamine (UDP-GlcNAc). The WbcO/WbpL substrate specificity has not yet been determined, but the structure of their biosynthetic end products implies that UDP-N-acetyl-D-fucosamine (UDP-FucNAc) and/or UDPN-acetyl-D- Probab=100.00 E-value=0 Score=302.11 Aligned_cols=193 Identities=33% Similarity=0.529 Sum_probs=158.0 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 87311024657885125778777999999875356789899988778999999987446787531356603688999999 Q gi|254781100|r 60 VPNLPIHARKIDTPTMGGIIILMGLIGSSLLWADFSSLHVSIILLLTLSFGLVGFYDDYIKITTGDAGGLSGKVRILIEF 139 (366) Q Consensus 60 ~~~~~~H~~K~~tP~~GGiai~~~~l~~~ll~~~~~~~~~~~~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~~Kl~~Q~ 139 (366) |++||.| |+||||+||++++.+++++.+.....++...+..+...+.++++|++||++| .++++.|+..|+ T Consensus 1 dg~Rk~H--~~~tP~~GGi~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~iG~~DD~~~-------~~~~~~rl~~q~ 71 (193) T cd06912 1 DGIQKFH--TRPTPRIGGVAIFLGLLAGLLLLSLLSGSLLLLLLLAALPAFLAGLLEDITK-------RVSPRIRLLATF 71 (193) T ss_pred CCCCCCC--CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-------CCHHHHHHHHHH T ss_conf 9998775--9699873499999999999999998411799999999999999998997235-------430789999999 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999864027766556666533775121100000001222202357889985987521111665798999999999 Q gi|254781100|r 140 IIAILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMGLFFIPFAAFVIVATANAVNLTDGLDGLAIVPVMIASVAFS 219 (366) Q Consensus 140 i~a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg~l~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~i~~~~l~ 219 (366) .++.+.......... ....++. +...+.++++++++++|+++.+||+|++||+|||++|++++++.++. T Consensus 72 ~~~~~~~~~~~~~~~----------~~~~~~~-~~~~~~~~~~~~~ti~~i~~~iNa~N~~DGiDGLa~g~~~i~~~~l~ 140 (193) T cd06912 72 LSALLAVWLLGASIT----------RLDLPGL-DLLLSFPPFAIIFTIFAVAGVANAFNIIDGFNGLASGVAIISLLSLA 140 (193) T ss_pred HHHHHHHHHHHHHHH----------HCCCCCC-CHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 999999999977765----------2556552-06864118999998999999987878751775336899999999999 Q ss_pred HHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 97641475346666410147883467899888677888988855674100164610887999999999 Q gi|254781100|r 220 FIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLWFNISPAVIFMGDTGSLALGAFIGGIA 287 (366) Q Consensus 220 ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~~PAkIFmGD~GS~~LG~~la~~a 287 (366) .+++..++. +...++.+++|+++|||+||+||||+||||+||+++|+.+|++| T Consensus 141 ~~~~~~~~~---------------~~~~~~~~l~~~~lgFL~~N~~pakiFmGDsGS~~lG~~la~la 193 (193) T cd06912 141 LVAFQVGDT---------------DLAFLALLLAGALLGFLIFNFPFGKIFLGDGGAYLLGFLLAWLA 193 (193) T ss_pred HHHHHCCCH---------------HHHHHHHHHHHHHHHHHHHHCCCCCEECCCHHHHHHHHHHHHHC T ss_conf 999972758---------------99999999999999999987793306256287899999999979 No 13 >pfam00953 Glycos_transf_4 Glycosyl transferase family 4. Probab=100.00 E-value=1.7e-38 Score=230.96 Aligned_cols=156 Identities=42% Similarity=0.571 Sum_probs=128.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCC Q ss_conf 99887789999999874467875313566036889999999999999986402776655666653377512110000000 Q gi|254781100|r 100 SIILLLTLSFGLVGFYDDYIKITTGDAGGLSGKVRILIEFIIAILAVCALLFYSNSTLLGIETKTSIVFPFFRDIILDMG 179 (366) Q Consensus 100 ~~~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~~Kl~~Q~i~a~i~~~~l~~~~~~~~~~~~~~~~~~iPf~~~~~l~lg 179 (366) +..+.++..++++|++||++ ++||+.|+..|++++...........+ ...|+.+ .....+ T Consensus 2 ~~il~~~~~~~~iG~~DD~~--------~l~~~~rl~~~~i~~~~~i~~~~~~~~-----------~~~~~~~-~~~~~~ 61 (158) T pfam00953 2 LGLLLGALLIGLIGLLDDLK--------GLSPKIRLLLQALAALLLILAGGGLSS-----------LGLLFGG-GLELGP 61 (158) T ss_pred HHHHHHHHHHHHHHHHHHCC--------CCCHHHHHHHHHHHHHHHHHHHHHHHH-----------CCCCCHH-HHHHHH T ss_conf 99999999999999998313--------698899999999999999999989850-----------6642013-777678 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 12222023578899859875211116657989999999999764147534666641014788346789988867788898 Q gi|254781100|r 180 LFFIPFAAFVIVATANAVNLTDGLDGLAIVPVMIASVAFSFIAYVAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGF 259 (366) Q Consensus 180 ~l~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~i~~~~l~ii~~~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgF 259 (366) +++++++++++++.+||+|++||+|||+++++.+++.++..+++..++.. ...++.+++++++|| T Consensus 62 ~~~~~~~i~~i~~~~Na~N~iDGidGL~~~~~i~~~~~l~~~~~~~~~~~---------------~~~~~~~~~~~~l~F 126 (158) T pfam00953 62 VLAILLTVFAIVGIINAVNFMDGLDGLAGGVALIAALALAILAYVNGDIE---------------LAILSLALLGALLGF 126 (158) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCHH---------------HHHHHHHHHHHHHHH T ss_conf 99999999999999999898606774578999999999999999668678---------------999999999999999 Q ss_pred HHHCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 8855674100164610887999999999996 Q gi|254781100|r 260 LWFNISPAVIFMGDTGSLALGAFIGGIAVST 290 (366) Q Consensus 260 L~fN~~PAkIFmGD~GS~~LG~~la~~ai~~ 290 (366) ++||+||||+||||+||+++|+.+|+.++.. T Consensus 127 l~~N~~~akiFlGD~GS~~lG~~ia~~~I~~ 157 (158) T pfam00953 127 LLFNFPPAKIFMGDSGSLFLGFVLAVLAILL 157 (158) T ss_pred HHHHCCCCCEEECCHHHHHHHHHHHHHHHHH T ss_conf 9986573407525666799999999999994 No 14 >KOG2788 consensus Probab=99.78 E-value=9.3e-20 Score=127.57 Aligned_cols=191 Identities=19% Similarity=0.307 Sum_probs=126.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEE-----CCCCCC Q ss_conf 988778999999987446787531356603688999999999999998640277665566665337751-----211000 Q gi|254781100|r 101 IILLLTLSFGLVGFYDDYIKITTGDAGGLSGKVRILIEFIIAILAVCALLFYSNSTLLGIETKTSIVFP-----FFRDII 175 (366) Q Consensus 101 ~~l~~~l~~~liG~iDD~~ki~~~~~~GLs~~~Kl~~Q~i~a~i~~~~l~~~~~~~~~~~~~~~~~~iP-----f~~~~~ 175 (366) ..+.+...+.++|+.||..+ ++||.|+....++++-.. +.++.+.+ .+...+| +.+... T Consensus 103 sallsi~~~ilLGfaDD~ld--------lrWRhKl~lPt~AtlPLL--mVYyv~~g------~TtV~iP~~~r~~l~~~~ 166 (418) T KOG2788 103 SALLSICSMILLGFADDVLD--------LRWRHKLLLPTIATLPLL--MVYYVSFG------VTTVVIPKPLRNILGLTS 166 (418) T ss_pred HHHHHHHHHHHHCCCCHHCC--------CHHHHHEECCCHHHCCEE--EEEEECCC------CEEEEECHHHHHHHCCCE T ss_conf 99987768887504510022--------256542005313217458--99998378------628993678886511005 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCHH---H-HHHHHH Q ss_conf 000012222023578899859875211116657989999999999764147534666641014788346---7-899888 Q gi|254781100|r 176 LDMGLFFIPFAAFVIVATANAVNLTDGLDGLAIVPVMIASVAFSFIAYVAGNILLSHYLQVNFVPGVGE---L-VVVISA 251 (366) Q Consensus 176 l~lg~l~i~~~~~~ivg~~NavNl~DGlDGLa~g~~~i~~~~l~ii~~~~g~~~~~~~~~~~~~~~~~~---l-~i~~~~ 251 (366) +|+|.+|++......+..+|++|...|+|||..|++.+.+.....-....-. ....-...++ . ..+... T Consensus 167 ~nlg~lYy~yM~~lavFCtNsINIlAGiNGlEvGQslvia~~i~l~~l~~~~-------~~~~~~~~~~~h~fSlyl~~p 239 (418) T KOG2788 167 LNLGFLYYVYMGLLAVFCTNSINILAGINGLEVGQSLVIAASILLNNLVYLW-------RSVDPEATRFAHAFSLYLTIP 239 (418) T ss_pred ECCCCHHHHHHCEEEEECCCCHHHHCCCCHHEECCCHHHHHHHHHHHHHHHH-------CCCCCCCCCHHHHHHHHHHHH T ss_conf 4330548864110256605730233055400203003344999853388873-------126830000256677887768 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 677888988855674100164610887999999999996245899999999999999999999 Q gi|254781100|r 252 LIGAGIGFLWFNISPAVIFMGDTGSLALGAFIGGIAVSTKHEILLIIIGGLFVVETLSVIMQV 314 (366) Q Consensus 252 l~Ga~lgFL~fN~~PAkIFmGD~GS~~LG~~la~~ai~~~~~~~l~ii~~i~i~~~~sviiq~ 314 (366) +.+..+|-+.|||+|+|+|.||+=-|+-|.++|+++|+++-.-.+++-...-++.-+-.+=|- T Consensus 240 ~~~~sl~ll~~n~yPsrVFVGDTfcyfaGMt~AVVgILGHFSKTllLFFiPQI~NFlyS~PQL 302 (418) T KOG2788 240 LLATSLALLKYNWYPSRVFVGDTFCYFAGMTFAVVGILGHFSKTLLLFFIPQILNFLYSLPQL 302 (418) T ss_pred HHHHHHHHHHHCCCCCEEEECCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 888778887745687338864602230563664567754427999999999999999741887 No 15 >pfam10555 MraY_sig1 Phospho-N-acetylmuramoyl-pentapeptide-transferase signature 1. Phospho-N-acetylmuramoyl-pentapeptide-transferase (EC 2.7.8.13) (mraY) is a bacterial enzyme responsible for the formation of the first lipid intermediate of the cell wall peptidoglycan synthesis. It catalyses the formation of undecaprenyl-pyrophosphoryl-N-acetylmuramoyl-pentapeptide from UDP-MurNAc-pentapeptide and undecaprenyl-phosphate. It is an integral membrane protein with probably ten transmembrane domains. This domain is located at the end of the first cytoplasmic loop and the beginning of the second transmembrane domain. Probab=93.00 E-value=0.015 Score=32.66 Aligned_cols=15 Identities=53% Similarity=0.977 Sum_probs=12.2 Q ss_pred CCCCCCHHHHHHHHH Q ss_conf 578851257787779 Q gi|254781100|r 69 KIDTPTMGGIIILMG 83 (366) Q Consensus 69 K~~tP~~GGiai~~~ 83 (366) |++||||||+.++.. T Consensus 1 K~gTPTMGGv~iI~~ 15 (26) T pfam10555 1 KKGTPTMGGVLIIXX 15 (26) T ss_pred CCCCCCCCEEEEEEE T ss_conf 988864263999984 No 16 >COG4956 Integral membrane protein (PIN domain superfamily) [General function prediction only] Probab=86.59 E-value=2 Score=20.85 Aligned_cols=87 Identities=29% Similarity=0.402 Sum_probs=54.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC--HHHHHHHHHHHHHHHHHC Q ss_conf 766999999999999999999999988999999986220698538873110246578851--257787779999998753 Q gi|254781100|r 15 THVLKYVTFRSSAAFFSAMLIVFAFGPKVIDCLRSLQEFRGQPVRVPNLPIHARKIDTPT--MGGIIILMGLIGSSLLWA 92 (366) Q Consensus 15 ~~~f~~~~fr~~~a~l~sf~i~~~~~~~~I~~l~~~~~~~g~~~r~~~~~~H~~K~~tP~--~GGiai~~~~l~~~ll~~ 92 (366) .+..+--..++.+..+..|++++++....-+++|++... ..|.+.++ .|.+..+++.+++.+... T Consensus 34 ~~~~~n~~v~~ligai~~~li~~~~~~~~~~~~~~le~~-------------i~k~~~~~ilf~tiGLiiGLlia~l~~~ 100 (356) T COG4956 34 TAFLNNEYVDALIGAIIFFLISFWFGKYVLNWLKRLEEQ-------------IRKLPVTTILFGTIGLIIGLLIAVLLSS 100 (356) T ss_pred CHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 112131879999999999999999999999999999999-------------9836899999987889999999999853 Q ss_pred CCC---CHHH--HHHHHHHHHHHHHHH Q ss_conf 567---8989--998877899999998 Q gi|254781100|r 93 DFS---SLHV--SIILLLTLSFGLVGF 114 (366) Q Consensus 93 ~~~---~~~~--~~~l~~~l~~~liG~ 114 (366) ++. .+.. ....+.++.++..|+ T Consensus 101 pL~~~~ip~~~~ii~vi~t~il~y~G~ 127 (356) T COG4956 101 PLFLLPIPFISTIIPVILTIILAYFGF 127 (356) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 776279607776899999999999736 No 17 >COG4589 Predicted CDP-diglyceride synthetase/phosphatidate cytidylyltransferase [General function prediction only] Probab=48.74 E-value=18 Score=15.66 Aligned_cols=13 Identities=31% Similarity=0.457 Sum_probs=6.1 Q ss_pred CCCCCCCCCCCCC Q ss_conf 5674100164610 Q gi|254781100|r 263 NISPAVIFMGDTG 275 (366) Q Consensus 263 N~~PAkIFmGD~G 275 (366) |.+|.|.--|=.| T Consensus 205 ~vSPnKTveGl~G 217 (303) T COG4589 205 KVSPNKTVEGLIG 217 (303) T ss_pred CCCCCCHHHHHHH T ss_conf 7698623788862 No 18 >TIGR02838 spore_V_AC stage V sporulation protein AC; InterPro: IPR014203 This entry describes stage V sporulation protein AC; a paralog of stage V sporulation protein AE. Both are proteins found to be present in a species, if and only if, that species is one of the Firmicutes capable of endospore formation, as of the time of the publication of the genome of Carboxydothermus hydrogenoformans. Mutants in spoVAC have a stage V sporulation defect.. Probab=41.99 E-value=23 Score=15.06 Aligned_cols=46 Identities=30% Similarity=0.567 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHHHH--HCCCC-----CHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2577877799999987--53567-----8989998877899999998744678 Q gi|254781100|r 75 MGGIIILMGLIGSSLL--WADFS-----SLHVSIILLLTLSFGLVGFYDDYIK 120 (366) Q Consensus 75 ~GGiai~~~~l~~~ll--~~~~~-----~~~~~~~l~~~l~~~liG~iDD~~k 120 (366) +||+.-.++=++..+. +.+++ ++....+++.+.++.-+|..||+-+ T Consensus 26 ~GGLIC~iGq~~~~fy~~~f~~se~~a~~~t~~tlIfl~alLTG~G~YDkia~ 78 (141) T TIGR02838 26 VGGLICLIGQLISDFYLRYFQFSEKTAGSPTVATLIFLSALLTGLGVYDKIAQ 78 (141) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHCCCHHHHHHH T ss_conf 77599989999999998553534765037289999999999715654766775 No 19 >KOG2844 consensus Probab=41.44 E-value=13 Score=16.40 Aligned_cols=16 Identities=31% Similarity=0.600 Sum_probs=13.7 Q ss_pred CCCCCCHHHHHHHHCC Q ss_conf 7666781228999679 Q gi|254781100|r 323 RVFLMAPIHHHFEKKG 338 (366) Q Consensus 323 r~f~~~plHH~l~~~G 338 (366) |.++|+|+||+++.+| T Consensus 450 RnlR~sply~~L~~aG 465 (856) T KOG2844 450 RNLRMSPLYDRLESAG 465 (856) T ss_pred CCCCCCCCHHHHHHCC T ss_conf 0123470289898526 No 20 >PRK09699 D-allose transporter subunit; Provisional Probab=38.53 E-value=26 Score=14.75 Aligned_cols=20 Identities=25% Similarity=0.273 Sum_probs=7.2 Q ss_pred CCCCCCHHHHHHHHHHHHHH Q ss_conf 79885158987789999999 Q gi|254781100|r 337 KGWTENQIVIRFWIISFIFA 356 (366) Q Consensus 337 ~G~s~~~iv~~~~~i~~~~~ 356 (366) .|.+.+++.+.-|.++..++ T Consensus 215 sGI~v~r~~i~~y~lsG~~a 234 (325) T PRK09699 215 SGIDVKFHILVVFIISGVCA 234 (325) T ss_pred HCCCCHHEEEEHHHHHHHHH T ss_conf 48871340443389878999 No 21 >pfam06830 Root_cap Root cap. The cells at the periphery of the root cap are continuously sloughed off from the root into the mucilage, and are thought to be programmed to die.This family represents a conserved region approximately 60 residues in length within plant root cap proteins, which may be involved in the process. Probab=35.45 E-value=16 Score=15.89 Aligned_cols=29 Identities=24% Similarity=0.205 Sum_probs=18.5 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 20698538873110246578851257787 Q gi|254781100|r 52 EFRGQPVRVPNLPIHARKIDTPTMGGIII 80 (366) Q Consensus 52 ~~~g~~~r~~~~~~H~~K~~tP~~GGiai 80 (366) .-.||.-|||-...-+....-|.|||--= T Consensus 13 GVLGqTYr~dyvn~~k~g~~MPvmGG~~~ 41 (57) T pfam06830 13 GVLGQTYRPDYVNPVKVGVAMPVMGGEDK 41 (57) T ss_pred EECCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 22144106674254336762330148764 No 22 >pfam11821 DUF3341 Protein of unknown function (DUF3341). This family of proteins are functionally uncharacterized. This family is found in bacteria. Proteins in this family are about 170 amino acids in length. Probab=34.02 E-value=30 Score=14.33 Aligned_cols=61 Identities=31% Similarity=0.339 Sum_probs=30.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 11116657989999999999764-147534666641014788346789988867788898885567 Q gi|254781100|r 201 DGLDGLAIVPVMIASVAFSFIAY-VAGNILLSHYLQVNFVPGVGELVVVISALIGAGIGFLWFNIS 265 (366) Q Consensus 201 DGlDGLa~g~~~i~~~~l~ii~~-~~g~~~~~~~~~~~~~~~~~~l~i~~~~l~Ga~lgFL~fN~~ 265 (366) -|+=|.++|..+.......-.-. +.|....+ ..+++|..-|+.++++++ +.++|+++.|.= T Consensus 59 gGl~G~~~g~~l~~~~~~~dyPl~iGGKP~~s---~pafipI~FEltiL~aa~-~~v~g~l~~~~L 120 (173) T pfam11821 59 GGLTGAATAFLLQYYTSAIDWPLNIGGKPLFS---WPAFIPITFELTVLFAAL-GMVIGFLALSGL 120 (173) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEECCCCCCCC---CCCCHHHHHHHHHHHHHH-HHHHHHHHHCCC T ss_conf 88999999999999999852650148977776---766227999999999999-999999998499 No 23 >KOG4112 consensus Probab=31.50 E-value=34 Score=14.09 Aligned_cols=30 Identities=10% Similarity=-0.048 Sum_probs=14.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 999999999988999999986220698538 Q gi|254781100|r 30 FSAMLIVFAFGPKVIDCLRSLQEFRGQPVR 59 (366) Q Consensus 30 l~sf~i~~~~~~~~I~~l~~~~~~~g~~~r 59 (366) ..+|.++.+++-.--|+.||...+..++.. T Consensus 57 g~~~v~t~li~LPpwP~y~rn~LkW~~Pa~ 86 (101) T KOG4112 57 GAGFVFTLLITLPPWPWYRRNPLKWAQPAI 86 (101) T ss_pred HHHHHHHHHHCCCCCHHHHCCCCCCCCCCC T ss_conf 888998887507895656238643468765 No 24 >COG3715 ManY Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC [Carbohydrate transport and metabolism] Probab=30.77 E-value=35 Score=14.02 Aligned_cols=20 Identities=30% Similarity=0.546 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 11166579899999999997 Q gi|254781100|r 202 GLDGLAIVPVMIASVAFSFI 221 (366) Q Consensus 202 GlDGLa~g~~~i~~~~l~ii 221 (366) -.|||..+-.++...+++.+ T Consensus 177 l~~GL~vaGgmlpavGfAml 196 (265) T COG3715 177 LTNGLAVAGGMLPAVGFAML 196 (265) T ss_pred HHHHHHHHCCHHHHHHHHHH T ss_conf 86256674051689899999 No 25 >PRK03839 putative kinase; Provisional Probab=23.01 E-value=42 Score=13.58 Aligned_cols=16 Identities=19% Similarity=0.578 Sum_probs=7.4 Q ss_pred HHHHHHHCCCCCCHHH Q ss_conf 2289996798851589 Q gi|254781100|r 330 IHHHFEKKGWTENQIV 345 (366) Q Consensus 330 lHH~l~~~G~s~~~iv 345 (366) +-.||+.+|||++|+. T Consensus 93 L~~RL~~RgYs~~KI~ 108 (180) T PRK03839 93 IAERLKERGYSKKKIG 108 (180) T ss_pred HHHHHHHCCCCHHHHH T ss_conf 9999997699989999 No 26 >TIGR01920 Shik_kin_archae shikimate kinase; InterPro: IPR010189 Shikimate kinase (2.7.1.71 from EC) catalyses the fifth step in the shikimate pathway of aromatic amino acids biosynthesis. It converts shikimate to shikimate 3-phosphate (3-phosphoshikimate). This part of the pathway leads to the biosynthesis of chorismate, the precursor of aromatic amino acids, folates, ubiquinones, and other aromatic compounds. The shikimate pathway links metabolism of carbohydrates to biosynthesis of the aromatic amino acids phenylalanine, tyrosine, tryptophan and their derivatives in microorganisms and plants . In a sequence of seven enzymatic reactions, D-erythrose 4-phosphate (E4P), an intermediate of the pentose phosphate pathway and phosphoenol pyruvate (PEP), a glycolytic intermediate, are converted to chorismate. The shikimate pathway is present in bacteria, archaea, fungi and plants. The absence of the shikimate pathway in animals makes it an attractive target for non-toxic herbicides, antimicrobial and antifungal agents. Shikimate kinase enzymes present an example of convergent evolution via enzyme recruitment from an unrelated group. This group represents shikimate kinases of the archaeal type. Members have no sequence similarity with the typical form of shikimate kinase (PIRSF000702 from PIRSF, IPR000623 from INTERPRO) found in bacteria and eukaryotes, but are instead distantly related to homoserine kinases, which belong to the GHMP kinase domain superfamily (GHMP = galactose, homoserine, mevalonate, and phosphomevalonate) . All known non-archaeal shikimate kinases (the typical form) belong to the non-homologous, structurally unrelated nucleoside monophosphate (NMP) kinase domain superfamily . Nomenclature note: the name AroK is used for non-homologous shikimate kinases of both the archaeal and bacterial types. ; GO: 0004765 shikimate kinase activity, 0009073 aromatic amino acid family biosynthetic process, 0005737 cytoplasm. Probab=22.33 E-value=50 Score=13.14 Aligned_cols=52 Identities=19% Similarity=0.245 Sum_probs=31.6 Q ss_pred CCCCCCCCHHHH----HHHHHHHHHHHHHCC--CCCHHHHHH-HHHHH----HHHHHHHHHHH Q ss_conf 465788512577----877799999987535--678989998-87789----99999987446 Q gi|254781100|r 67 ARKIDTPTMGGI----IILMGLIGSSLLWAD--FSSLHVSII-LLLTL----SFGLVGFYDDY 118 (366) Q Consensus 67 ~~K~~tP~~GGi----ai~~~~l~~~ll~~~--~~~~~~~~~-l~~~l----~~~liG~iDD~ 118 (366) .-|+..|...|+ |...+++.+++-... ...+..... +..-+ .+...|.+||- T Consensus 75 ~~~seiP~g~GLKSSSA~~nAlv~A~~~~~gva~~~~~~~~~~l~a~~S~~AG~s~TGAfDDa 137 (273) T TIGR01920 75 EVESEIPLGSGLKSSSALVNALVLAVLKAKGVAEEIDDIDILRLGAELSKEAGVSVTGAFDDA 137 (273) T ss_pred EEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCEEEEHHHHH T ss_conf 996176788664378999999999999861234217733689999998886685153058899 No 27 >PRK11492 hyfE hydrogenase 4 membrane subunit; Provisional Probab=21.94 E-value=51 Score=13.09 Aligned_cols=54 Identities=9% Similarity=0.029 Sum_probs=20.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 999999999998899999998622069853887311024657885125778777999999 Q gi|254781100|r 29 FFSAMLIVFAFGPKVIDCLRSLQEFRGQPVRVPNLPIHARKIDTPTMGGIIILMGLIGSS 88 (366) Q Consensus 29 ~l~sf~i~~~~~~~~I~~l~~~~~~~g~~~r~~~~~~H~~K~~tP~~GGiai~~~~l~~~ 88 (366) ..+++++-.++.|.+..+.-|+ -...||.+|-... ..++-.|+.....++.++. T Consensus 58 a~~al~~K~iliP~lL~r~~~k----l~~~re~ep~i~~--~~s~l~~~~lv~ls~~~~~ 111 (213) T PRK11492 58 SISAFITKVLLVPLIMTYAARK----LPVNIEEKAVFGP--AVSALLAALIVLLCAFVVQ 111 (213) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----CCCCCCCCCCCCH--HHHHHHHHHHHHHHHHHHC T ss_conf 9999999999999999999998----3985112473681--7999999999999999985 Done!