Query gi|254780201|ref|YP_003064614.1| hypothetical protein CLIBASIA_00430 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 394 No_of_seqs 155 out of 190 Neff 5.5 Searched_HMMs 39220 Date Mon May 23 13:03:11 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780201.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam05045 RgpF Rhamnan synthes 100.0 0 0 519.5 24.9 228 147-378 263-498 (498) 2 COG3754 RgpF Lipopolysaccharid 100.0 0 0 377.9 15.5 238 148-388 292-534 (595) 3 COG3754 RgpF Lipopolysaccharid 99.3 3.4E-13 8.6E-18 100.7 0.0 289 11-363 301-595 (595) 4 cd06421 CESA_CelA_like CESA_Ce 97.1 0.022 5.5E-07 34.6 15.0 194 149-384 1-204 (234) 5 cd06435 CESA_NdvC_like NdvC_li 96.1 0.11 2.8E-06 30.2 13.5 188 154-383 3-201 (236) 6 cd06434 GT2_HAS Hyaluronan syn 95.8 0.16 4.1E-06 29.2 15.0 195 151-383 2-205 (235) 7 TIGR03030 CelA cellulose synth 94.2 0.45 1.1E-05 26.5 12.7 171 145-342 127-323 (713) 8 cd04191 Glucan_BSP_ModH Glucan 93.9 0.52 1.3E-05 26.1 16.6 198 152-385 2-225 (254) 9 cd06423 CESA_like CESA_like is 93.1 0.7 1.8E-05 25.3 13.4 154 159-342 8-170 (180) 10 cd02520 Glucosylceramide_synth 91.5 1.1 2.8E-05 24.1 17.0 160 149-384 1-170 (196) 11 cd02525 Succinoglycan_BP_ExoA 91.2 1.2 3E-05 23.9 13.3 187 151-389 2-206 (249) 12 cd04186 GT_2_like_c Subfamily 90.4 1.4 3.6E-05 23.4 12.6 150 160-386 9-160 (166) 13 cd06439 CESA_like_1 CESA_like_ 90.2 1.5 3.7E-05 23.3 15.0 194 137-383 17-221 (251) 14 pfam11913 DUF3431 Protein of u 89.6 1.2 3.2E-05 23.8 5.9 191 167-388 14-215 (222) 15 PRK11204 N-glycosyltransferase 88.3 2 5.2E-05 22.5 13.7 192 145-383 50-252 (421) 16 TIGR02699 archaeo_AfpA archaeo 88.0 0.64 1.6E-05 25.6 3.6 65 161-234 11-87 (176) 17 PRK05454 glucosyltransferase M 87.4 2.3 5.8E-05 22.2 9.8 120 141-286 115-256 (663) 18 PRK11498 bcsA cellulose syntha 86.5 2.6 6.5E-05 21.9 10.3 148 176-342 274-447 (858) 19 cd06437 CESA_CaSu_A2 Cellulose 85.5 2.9 7.4E-05 21.6 14.0 190 150-384 2-206 (232) 20 TIGR03472 HpnI hopanoid biosyn 84.4 3.2 8.3E-05 21.2 13.4 195 147-387 39-249 (373) 21 PTZ00260 glycosyl transferase 84.0 3.4 8.6E-05 21.2 11.8 77 160-240 95-182 (336) 22 PRK09450 cyaA adenylate cyclas 80.9 2.8 7.2E-05 21.6 4.2 69 84-173 589-658 (833) 23 TIGR02918 TIGR02918 conserved 80.6 2.1 5.3E-05 22.4 3.5 290 17-369 137-475 (511) 24 COG1215 Glycosyltransferases, 79.4 4.9 0.00013 20.1 14.5 197 148-388 53-262 (439) 25 cd03817 GT1_UGDG_like This fam 75.6 3.4 8.6E-05 21.1 3.3 40 175-218 147-186 (374) 26 cd06433 GT_2_WfgS_like WfgS an 73.5 7.2 0.00018 19.1 10.2 183 158-390 8-194 (202) 27 KOG2571 consensus 73.0 2.8 7.2E-05 21.6 2.4 160 202-389 394-579 (862) 28 pfam01295 Adenylate_cycl Adeny 71.9 6.1 0.00016 19.6 3.9 113 80-215 590-704 (830) 29 cd02526 GT2_RfbF_like RfbF is 71.3 8.1 0.00021 18.8 13.5 185 160-384 10-197 (237) 30 TIGR03584 PseF pseudaminic aci 70.1 8.6 0.00022 18.7 11.2 152 162-341 26-186 (222) 31 cd04184 GT2_RfbC_Mx_like Myxoc 69.5 8.8 0.00023 18.6 14.6 192 149-391 1-201 (202) 32 cd02513 CMP-NeuAc_Synthase CMP 63.9 11 0.00029 17.9 8.7 154 162-341 28-189 (223) 33 cd04192 GT_2_like_e Subfamily 63.8 11 0.00029 17.9 12.2 180 160-383 9-201 (229) 34 cd04190 Chitin_synth_C C-termi 60.3 13 0.00034 17.5 5.1 113 263-385 87-214 (244) 35 KOG1170 consensus 60.2 8.4 0.00022 18.7 2.7 122 230-381 627-748 (1099) 36 cd06427 CESA_like_2 CESA_like_ 58.8 14 0.00036 17.3 14.9 188 150-386 2-207 (241) 37 cd06438 EpsO_like EpsO protein 55.2 16 0.00041 17.0 11.3 150 160-341 9-170 (183) 38 cd06436 GlcNAc-1-P_transferase 53.9 17 0.00043 16.8 12.9 155 160-345 9-182 (191) 39 COG1428 Deoxynucleoside kinase 53.6 17 0.00044 16.8 4.4 72 157-229 107-178 (216) 40 KOG3969 consensus 50.5 19 0.00047 16.6 3.2 12 230-241 193-204 (310) 41 pfam02514 CobN-Mg_chel CobN/Ma 46.8 19 0.00048 16.6 2.7 29 145-173 239-278 (1064) 42 pfam00535 Glycos_transf_2 Glyc 45.6 23 0.00058 16.0 11.1 152 160-342 10-168 (168) 43 KOG2791 consensus 44.7 24 0.0006 16.0 4.7 101 13-118 160-278 (455) 44 COG1036 Archaeal flavoproteins 43.3 25 0.00063 15.8 3.1 75 149-233 9-94 (187) 45 KOG1402 consensus 41.4 27 0.00068 15.6 3.1 116 87-219 38-171 (427) 46 COG1154 Dxs Deoxyxylulose-5-ph 38.9 27 0.00069 15.6 2.5 81 206-288 248-344 (627) 47 KOG1115 consensus 38.8 29 0.00075 15.4 5.5 96 158-269 142-246 (516) 48 pfam03808 Glyco_tran_WecB Glyc 38.4 30 0.00076 15.4 5.1 53 155-207 29-81 (172) 49 TIGR03111 glyc2_xrt_Gpos1 puta 38.0 30 0.00077 15.3 14.0 171 145-342 45-231 (439) 50 cd04185 GT_2_like_b Subfamily 37.9 30 0.00077 15.3 9.4 111 152-287 1-116 (202) 51 cd03814 GT1_like_2 This family 36.9 31 0.0008 15.2 3.4 13 358-370 319-331 (364) 52 TIGR02470 sucr_synth sucrose s 35.7 11 0.00029 17.9 0.2 229 47-335 129-439 (790) 53 COG4098 comFA Superfamily II D 34.5 34 0.00087 15.0 7.9 16 49-64 229-244 (441) 54 cd06533 Glyco_transf_WecG_TagA 34.1 35 0.00089 14.9 5.1 53 154-206 26-78 (171) 55 cd04195 GT2_AmsE_like GT2_AmsE 33.3 36 0.00091 14.9 12.6 188 153-389 2-199 (201) 56 COG2943 MdoH Membrane glycosyl 32.9 36 0.00093 14.8 10.0 145 119-286 114-277 (736) 57 cd03813 GT1_like_3 This family 32.8 25 0.00065 15.8 1.6 59 158-219 225-284 (475) 58 cd03801 GT1_YqgM_like This fam 32.7 37 0.00093 14.8 5.1 38 175-216 143-182 (374) 59 TIGR02708 L_lactate_ox L-lacta 32.7 37 0.00093 14.8 4.6 60 155-214 115-180 (368) 60 pfam09746 Membralin Tumour-ass 32.2 37 0.00095 14.7 3.9 102 245-356 216-327 (375) 61 TIGR01357 aroB 3-dehydroquinat 32.2 17 0.00045 16.8 0.7 195 149-372 26-253 (361) 62 pfam07845 DUF1636 Protein of u 31.8 34 0.00086 15.0 2.1 13 328-340 95-108 (116) 63 COG1803 MgsA Methylglyoxal syn 29.6 41 0.0011 14.5 4.6 44 146-189 2-45 (142) 64 KOG4425 consensus 29.3 42 0.0011 14.4 2.5 75 266-365 561-655 (900) 65 COG5469 Predicted metal-bindin 28.7 43 0.0011 14.4 3.3 13 328-340 115-128 (143) 66 COG1216 Predicted glycosyltran 27.8 44 0.0011 14.3 14.2 170 150-342 4-183 (305) 67 COG1922 WecG Teichoic acid bio 27.7 45 0.0011 14.3 6.0 130 52-206 11-140 (253) 68 COG3660 Predicted nucleoside-d 27.5 45 0.0011 14.3 6.0 88 146-240 159-255 (329) 69 COG0337 AroB 3-dehydroquinate 27.3 24 0.0006 16.0 0.7 36 148-184 33-69 (360) 70 cd02509 GDP-M1P_Guanylyltransf 26.6 47 0.0012 14.2 5.4 77 160-237 33-112 (274) 71 cd06321 PBP1_ABC_sugar_binding 25.4 49 0.0013 14.0 6.3 71 150-221 1-79 (271) 72 cd05840 SPBC215_ISWI_like The 25.0 23 0.00059 16.0 0.3 43 325-368 48-90 (93) 73 TIGR02088 LEU3_arch isopropylm 24.5 51 0.0013 13.9 2.4 52 162-216 216-270 (350) 74 TIGR00287 cas1 CRISPR-associat 24.3 52 0.0013 13.9 2.3 39 223-261 235-274 (353) 75 cd03802 GT1_AviGT4_like This f 23.9 53 0.0013 13.8 4.3 56 150-216 109-164 (335) 76 pfam08589 DUF1770 Fungal prote 22.5 38 0.00096 14.7 1.0 21 106-126 70-90 (96) 77 cd07969 OBF_DNA_ligase_I The O 21.4 32 0.00082 15.1 0.4 33 207-239 16-48 (144) 78 TIGR01169 rplA_bact ribosomal 20.9 60 0.0015 13.5 2.3 73 164-239 96-176 (227) 79 TIGR02814 pfaD_fam PfaD family 20.7 41 0.001 14.5 0.8 253 60-369 52-338 (449) 80 PRK07482 hypothetical protein; 20.6 61 0.0016 13.4 3.6 220 101-340 40-282 (461) 81 pfam00862 Sucrose_synth Sucros 20.4 42 0.0011 14.4 0.9 72 250-336 371-450 (550) No 1 >pfam05045 RgpF Rhamnan synthesis protein F. This family consists of a group of proteins which are related to the Streptococcus rhamnose-glucose polysaccharide assembly protein (RgpF). Rhamnan backbones are found in several O polysaccharides of phytopathogenic bacteria and are regarded as pathogenic factors. Probab=100.00 E-value=0 Score=519.53 Aligned_cols=228 Identities=30% Similarity=0.443 Sum_probs=199.9 Q ss_pred CCCCEEEEEEECCHHHHHHHHHHHHHCCCCEEEEEECCCC-HHHHHHHHHHC---CCCCEEEEECCCCHHHHHHHHHHCC Q ss_conf 0147389999416778999999997089981499971765-28999999711---7873899945666047999986221 Q gi|254780201|r 147 IKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-NKDFEQDVLKY---FPSAQLYVMENKGRDVRPFLYLLEL 222 (394) Q Consensus 147 ~k~kIAVvlHlyY~DLl~E~~~~L~~ip~~~DL~ITt~~~-~~~~~~~~~~~---~~~a~V~vv~NRGRDI~PfL~~~~~ 222 (394) -.+|||||+|+||+||++|+++|++++|+++||||||+++ ++++.++..+. ..+++|++||||||||+|||+++++ T Consensus 263 ~~~kIav~~HvyY~DLl~e~l~y~~~ip~~~Dl~ITT~~~ek~~~i~~~l~~~~~~~~~~V~vv~NRGRDv~pfLv~~~~ 342 (498) T pfam05045 263 DDQKIAVHAHVYYVDMLDEILTAFKNIPFSYDLIITTDTDDKKAEIEEILAKRKGAKNAIVRVVQNRGRDMSPLLISLKD 342 (498) T ss_pred CCCEEEEEEEEECHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHH T ss_conf 57859999995347269999999982898741899859877899999999856677732899905777417889999887 Q ss_pred CCC-CCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCC Q ss_conf 002-7867899973687767777644407899999987479989999999997609976898168765443323455432 Q gi|254780201|r 223 GVF-DRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRS 301 (394) Q Consensus 223 ~~l-~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~ 301 (394) .++ ++||||||+|||||+|.+. ..|++||++|+++||||++.|++|+++|++||++|||.|+.++...+ .++..|. T Consensus 343 ~l~~~~YD~v~hiHtKKS~~~~~--~~G~~wr~~l~~nLlgs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~-~~~~~W~ 419 (498) T pfam05045 343 ELVDDRYDLVCRFHTKKSPQADF--WAGESFKEELIDNLLKPRGYADNILAMFEQNPSIGLVIPDIVHIRYP-TIGNAWN 419 (498) T ss_pred HHCCCCCCEEEEEECCCCCCCCC--CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCHHHCCCC-CCCCCCC T ss_conf 74536774899965574766686--40579999999972198899999999871098636873872312575-4356764 Q ss_pred CCHHHHHHHHHHCCCCCC--CCCCCCCCEEEEEEEHHHHHHHHCCCC-CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 213899999997489988--554773770123320988875765882-23585446888873899999999999998698 Q gi|254780201|r 302 EVYRRVIDLAKRAGFPTK--RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 (394) Q Consensus 302 ~~~~~~~~L~~rlgi~~~--~~~~~FpaGSMFW~R~~AL~pL~~l~l-~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf 378 (394) +|.+.+.+|++++|+..+ ...+.||+||||||||+||+||++++| ++|||+||+| ||||||||||+++|+|+.+|| T Consensus 420 ~N~~~~~~L~~~lgl~~~~d~~~~~~p~GTMFW~Rp~AL~~L~~l~l~~~DFpeEp~~-DGTLaHAIERli~yia~~~GY 498 (498) T pfam05045 420 ENAPEMNKLARRMGIKKKFDDNTPVASYGTMFWFRPEALKKLFELEWKWEDFPNEPLP-DNSLLHAIERLLAYAAWDAGY 498 (498) T ss_pred CCHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHCCCCCCCCCCCCCC-CCHHHHHHHHHHHHHHHHCCC T ss_conf 5679999999984999877756666676549997189989998379980006646897-428999999999999987539 No 2 >COG3754 RgpF Lipopolysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=377.86 Aligned_cols=238 Identities=29% Similarity=0.423 Sum_probs=212.4 Q ss_pred CCCEEEEEEECCHHHHHHHHHHHHHCCCCEEEEEECCCCH--HHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHCCCCC Q ss_conf 1473899994167789999999970899814999717652--89999997117873899945666047999986221002 Q gi|254780201|r 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEAN--KDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVF 225 (394) Q Consensus 148 k~kIAVvlHlyY~DLl~E~~~~L~~ip~~~DL~ITt~~~~--~~~~~~~~~~~~~a~V~vv~NRGRDI~PfL~~~~~~~l 225 (394) ..||||++|+||+||++|++.++.|.+.+|||++||.+.. ..+++.+++...++.|.++||||||++||++++++... T Consensus 292 ~~kIav~lHv~ytdll~e~l~laen~~~~ydl~~tt~~p~k~~~i~~~~arkp~~A~v~v~~nrGrd~~pll~~~~~~~~ 371 (595) T COG3754 292 IQKIAVHLHVYYTDLLDEFLALAENTPFPYDLILTTDTPDKVAEIQRALARKPKNAFVRVVENRGRDIGPLLHLLRDLKG 371 (595) T ss_pred CEEEEEEECCCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHHHHCCCCCCCEEECCCCCCCCCHHHHHHHHHCCC T ss_conf 10268884146403589999998647988651774488156669999854488763687047876300317999875478 Q ss_pred CCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHH Q ss_conf 78678999736877677776444078999999874799899999999976099768981687654433234554322138 Q gi|254780201|r 226 DRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYR 305 (394) Q Consensus 226 ~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~ 305 (394) +.||+||++|||||+|.. .+.|+.||.+++++|+++...|.+++.+|+++|.+|+..|+.+++. ......+|.+|.. T Consensus 372 d~ydlI~~~htkksgq~~--~~~Ge~~r~~~~~~L~~~~~~v~n~~~~~~~~~~~~~~ip~~f~i~-~~~~~~aw~~~~a 448 (595) T COG3754 372 DRYDLVCRLHTKKSGQAD--FWAGELWRHHTIDNLLASRGYVTNALEIFADNPSVGADIPPFFRLN-YPTEGIAWFENRA 448 (595) T ss_pred CCEEEEEEEEECCCCCCC--CCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCEEEEE-CCCCCCHHHHHHH T ss_conf 753576455421675410--1334268999998762477203236564300554220148705880-3566417988678 Q ss_pred HHHHHHHHCCCCCCC--CCCCCCCEEEEEEEHHHHHHHHCCCC-CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 999999974899885--54773770123320988875765882-235854468888738999999999999986980335 Q gi|254780201|r 306 RVIDLAKRAGFPTKR--LHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIES 382 (394) Q Consensus 306 ~~~~L~~rlgi~~~~--~~~~FpaGSMFW~R~~AL~pL~~l~l-~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf~i~~ 382 (394) .+.++++.|++.... ..+.+++|||||+||+||+||+++.| +++|+.|++++||+++|||||+++|++...||.... T Consensus 449 ~~~dl~~~m~~~~~~d~~T~~~~ygtm~W~r~~aLr~lfe~~l~~~~f~~e~~~~d~~~ahaiEr~l~yva~~~g~~~~~ 528 (595) T COG3754 449 AVADLLRLMGMNVKTDRNTFVAAYGTMFWFRPRALRPLFELELGWNDFPAEPLPQDGHLAHAIERLLAYVAWDAGYAFRH 528 (595) T ss_pred HHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCHHHHHC T ss_conf 99989987077665332321454355543278888888875413645432468877388889999999998310023321 Q ss_pred EHHHHH Q ss_conf 247565 Q gi|254780201|r 383 VDCVAE 388 (394) Q Consensus 383 v~~~~~ 388 (394) ++|-.. T Consensus 529 ~m~a~~ 534 (595) T COG3754 529 VMTAFA 534 (595) T ss_pred CCCCCC T ss_conf 567100 No 3 >COG3754 RgpF Lipopolysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane] Probab=99.29 E-value=3.4e-13 Score=100.68 Aligned_cols=289 Identities=11% Similarity=-0.051 Sum_probs=191.6 Q ss_pred HCCHHHHHHHHHHHHHCCCCCCCCC-CCCCC-CCCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 3324789998878763012211155-32688-740578888789886157568978999999999999999751001585 Q gi|254780201|r 11 LGKIENLLLRLDVEEKGNMQAIYIP-AHVSG-YYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFP 88 (394) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~y~p-~~~pg-~~~~WDntpR~~~~~~~~~~~tP~~f~~~~~wl~~~l~~~~~~~~~~~ 88 (394) ++.++.+-...++.+....+...++ ...|+ ..+.|++-+|++ +..++..+|-..+ ....+-... .... T Consensus 301 v~ytdll~e~l~laen~~~~ydl~~tt~~p~k~~~i~~~~arkp--~~A~v~v~~nrGr-----d~~pll~~~--~~~~- 370 (595) T COG3754 301 VYYTDLLDEFLALAENTPFPYDLILTTDTPDKVAEIQRALARKP--KNAFVRVVENRGR-----DIGPLLHLL--RDLK- 370 (595) T ss_pred CCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHHHHCCCC--CCCEEECCCCCCC-----CCHHHHHHH--HHCC- T ss_conf 46403589999998647988651774488156669999854488--7636870478763-----003179998--7547- Q ss_pred CCEEEEEEEECC--CCCCCCHHHHHHHHHHHHHHHHHHHH-HHHHCCCCCCCCCCHHHHHHCCCCEEEEEEECCHHHHHH Q ss_conf 643999973137--74100732687187559999997302-664202467822101100000147389999416778999 Q gi|254780201|r 89 SCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV-KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIE 165 (394) Q Consensus 89 ~~~ivfinaWNE--EG~yLePD~~~G~~yL~a~~~al~~~-~~~~~~~~~~p~~~~~~~~~~k~kIAVvlHlyY~DLl~E 165 (394) .++..+|+.|++ +|++ |...|.+|...+...|++. +.+.+. .+..+...+..+..--++++- |++....+ T Consensus 371 ~d~ydlI~~~htkksgq~---~~~~Ge~~r~~~~~~L~~~~~~v~n~---~~~~~~~~~~~~~ip~~f~i~-~~~~~~aw 443 (595) T COG3754 371 GDRYDLVCRLHTKKSGQA---DFWAGELWRHHTIDNLLASRGYVTNA---LEIFADNPSVGADIPPFFRLN-YPTEGIAW 443 (595) T ss_pred CCCEEEEEEEEECCCCCC---CCCCCHHHHHHHHHHHHCCCCCHHHH---HHHHHHCCCCCCCCCCEEEEE-CCCCCCHH T ss_conf 875357645542167541---01334268999998762477203236---564300554220148705880-35664179 Q ss_pred HHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCC Q ss_conf 99999708998149997176528999999711787389994566604799998622100278678999736877677776 Q gi|254780201|r 166 ISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYH 245 (394) Q Consensus 166 ~~~~L~~ip~~~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv~NRGRDI~PfL~~~~~~~l~~YD~v~klHtKKS~~~~~~ 245 (394) + .....+-.-.++.++.++. -...+..+--....+|+|||-||+.. .-++..+|+.|++.+++ T Consensus 444 ~-~~~a~~~dl~~~m~~~~~~-------d~~T~~~~ygtm~W~r~~aLr~lfe~-----~l~~~~f~~e~~~~d~~---- 506 (595) T COG3754 444 F-ENRAAVADLLRLMGMNVKT-------DRNTFVAAYGTMFWFRPRALRPLFEL-----ELGWNDFPAEPLPQDGH---- 506 (595) T ss_pred H-HHHHHHHHHHHHHCCCCCC-------CCCCHHHHHHHHHHCCHHHHHHHHHH-----HCCHHHHHHCCCCCCHH---- T ss_conf 8-8678999899870776653-------32321454355543278888888875-----41364543246887738---- Q ss_pred CCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 44407899999987479989999999997609976898168765443323455432213899999997489988554773 Q gi|254780201|r 246 PIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF 325 (394) Q Consensus 246 ~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~~L~~rlgi~~~~~~~~F 325 (394) ..-+.+|+--++...+| .....+..+++.+|+++|+... +. +|-...++. +| T Consensus 507 ~ahaiEr~l~yva~~~g--~~~~~~m~a~~~~~~~~~l~~~-----------------------~~-~~~~~~~~~--~~ 558 (595) T COG3754 507 LAHAIERLLAYVAWDAG--YAFRHVMTAFAAEPNYDMLEAK-----------------------LQ-RLRSGDPSA--DF 558 (595) T ss_pred HHHHHHHHHHHHHHCCH--HHHHCCCCCCCCCCCCCHHHHH-----------------------HH-HHHCCCCCC--CC T ss_conf 88899999999983100--2332156710047883068887-----------------------77-751589963--31 Q ss_pred CCEEEEEEEHHHHHHHHCCCCC-CCCCCCCCCCCCHHHH Q ss_conf 7701233209888757658822-3585446888873899 Q gi|254780201|r 326 FNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEH 363 (394) Q Consensus 326 paGSMFW~R~~AL~pL~~l~l~-~DF~~E~g~~DGTlaH 363 (394) +-- ||.++++-++|.++... -.+-+|-++++++.+| T Consensus 559 ~~~--~~~~~~vk~~l~~l~~~~~~~~~~~~~~l~~~~~ 595 (595) T COG3754 559 NLR--FWGRTSVKRALRDLIDAIKKSGKERGPILFRKLR 595 (595) T ss_pred HHH--CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 233--0440436777778779998751555066665229 No 4 >cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end. Probab=97.15 E-value=0.022 Score=34.56 Aligned_cols=194 Identities=10% Similarity=0.145 Sum_probs=102.1 Q ss_pred CCEEEEEEECCH--HHHHHHHHHHHHCCCC---EEEEEECCCCHHH---HHHHHHHCCCCCEEEEECCCCHHHHHHHHHH Q ss_conf 473899994167--7899999999708998---1499971765289---9999971178738999456660479999862 Q gi|254780201|r 149 SKIAIVVHCYYQ--DTWIEISHILLRLNFD---FDLFVTVVEANKD---FEQDVLKYFPSAQLYVMENKGRDVRPFLYLL 220 (394) Q Consensus 149 ~kIAVvlHlyY~--DLl~E~~~~L~~ip~~---~DL~ITt~~~~~~---~~~~~~~~~~~a~V~vv~NRGRDI~PfL~~~ 220 (394) ++|.|++=.|-. ++++++++.+.+...+ ++++|--+..... +.++.........+..-.|+|.=-+.+=.++ T Consensus 1 P~VsViIPayNE~~~~i~~~l~sl~~q~YP~~~~eIiVvdD~std~t~~~~~~~~~~~~~~~~~~~~~~g~Ka~alN~g~ 80 (234) T cd06421 1 PTVDVFIPTYNEPLEIVRKTLRAALAIDYPHDKLRVYVLDDGRRPELRALAAELGVEYGYRYLTRPDNRHAKAGNLNNAL 80 (234) T ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHH T ss_conf 93999998389978999999999996799998289999989898788988888456643688874889997699999999 Q ss_pred CCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCC Q ss_conf 21002786789997368776777764440789999998747998999999999760997689816876544332345543 Q gi|254780201|r 221 ELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR 300 (394) Q Consensus 221 ~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~ 300 (394) + ..++|+|+.+-+== +-.++.+.+++..|.++|++|+|..+..........+... T Consensus 81 ~---~a~gd~v~~~DaD~----------------------~~~~~~L~~~~~~~~~~~~v~~v~~~~~~~~~~~~~~~~~ 135 (234) T cd06421 81 A---HTTGDFVAILDADH----------------------VPTPDFLRRTLGYFLDDPKVALVQTPQFFYNPDPFDWLAD 135 (234) T ss_pred H---HCCCCEEEEECCCC----------------------CCCHHHHHHHHHHHHHCCCEEEEECCEEEECCCCCCHHHH T ss_conf 9---77899999988987----------------------7087699999999986997589953567734883328999 Q ss_pred -CCCHHH-HHHHHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf -221389-999999748998855477377012332098887576588223585446888873899999999999998698 Q gi|254780201|r 301 -SEVYRR-VIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 (394) Q Consensus 301 -~~~~~~-~~~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf 378 (394) ...... ....... +. ......|..|+.+-+|.++|+.+-. |++. +++ =..-+++-...+|| T Consensus 136 ~~~~~~~~~~~~~~~-~~--~~~~~~~~~G~~~~~rr~~l~~vGg------f~~~------~~~--ED~dl~~rl~~~G~ 198 (234) T cd06421 136 GAPNEQELFYGVIQP-GR--DRWGAAFCCGSGAVVRREALDEIGG------FPTD------SVT--EDLATSLRLHAKGW 198 (234) T ss_pred HHHHHHHHHHHHHHH-HH--HCCCCEEECCCHHHHHHHHHHHCCC------CCCC------CCC--HHHHHHHHHHHCCC T ss_conf 999999999999998-87--3388425535506788999998189------9999------976--59999999997699 Q ss_pred EEEEEH Q ss_conf 033524 Q gi|254780201|r 379 SIESVD 384 (394) Q Consensus 379 ~i~~v~ 384 (394) ++.-+. T Consensus 199 k~~y~p 204 (234) T cd06421 199 RSVYVP 204 (234) T ss_pred EEEEEC T ss_conf 799936 No 5 >cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response. Probab=96.14 E-value=0.11 Score=30.20 Aligned_cols=188 Identities=15% Similarity=0.145 Sum_probs=98.1 Q ss_pred EEEECC--HHHHHHHHHHHHHCCCC-EEEEEECCC--CH--HHHHHHHHHC-CCCCEEE-EECCCCHHHHHHHHHHCCCC Q ss_conf 999416--77899999999708998-149997176--52--8999999711-7873899-94566604799998622100 Q gi|254780201|r 154 VVHCYY--QDTWIEISHILLRLNFD-FDLFVTVVE--AN--KDFEQDVLKY-FPSAQLY-VMENKGRDVRPFLYLLELGV 224 (394) Q Consensus 154 vlHlyY--~DLl~E~~~~L~~ip~~-~DL~ITt~~--~~--~~~~~~~~~~-~~~a~V~-vv~NRGRDI~PfL~~~~~~~ 224 (394) ++=.|= ++.+.++++.|.....+ ++++|--+. +. .+...+..+. .+...+. +-+|.|.--+.+=.+++. . T Consensus 3 iIPayNE~~~vi~~~l~sl~~~~Yp~~eIiVvdd~stD~t~~~~v~~~~~~~~~~~~~~~~~~~~g~K~~alN~~l~~-~ 81 (236) T cd06435 3 HVPCYEEPPEMVKETLDSLAALDYPNFEVIVIDNNTKDEALWKPVEAHCAQLGERFRFFHVEPLPGAKAGALNYALER-T 81 (236) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHH-C T ss_conf 983589928999999999995799998799998929981589999999987299679999488899709999999996-4 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCH Q ss_conf 27867899973687767777644407899999987479989999999997609976898168765443323455432213 Q gi|254780201|r 225 FDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY 304 (394) Q Consensus 225 l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~ 304 (394) ..+||+|+.+.+==- + .++.+..++..|+ ||++|+|..+....... .+..... T Consensus 82 ~~~~e~i~~~DaD~~---------------------~-~pd~L~~~v~~f~-~p~vg~v~~~~~~~~~~----~~~~~~~ 134 (236) T cd06435 82 APDAEIIAVIDADYQ---------------------V-EPDWLKRLVPIFD-DPRVGFVQAPQDYRDGE----ESLFKRM 134 (236) T ss_pred CCCCEEEEEECCCCC---------------------C-CHHHHHHHHHHHC-CCCEEEEECCEECCCCC----CCHHHHH T ss_conf 798549999768877---------------------7-8689999998604-98768995561314877----6799999 Q ss_pred HHHH-HHHHHCCCCC-CCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 8999-9999748998-8554773770123320988875765882235854468888738999999999999986980335 Q gi|254780201|r 305 RRVI-DLAKRAGFPT-KRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIES 382 (394) Q Consensus 305 ~~~~-~L~~rlgi~~-~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf~i~~ 382 (394) .... ...-.++... ......|..|+++-+|.++|+.+-- |++...--|- -+++-+..+||++.- T Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~Rr~~l~~iGg------f~~~~~~ED~--------dl~~rl~~~G~~~~y 200 (236) T cd06435 135 CYAEYKGFFDIGMVSRNERNAIIQHGTMCLIRRSALDDVGG------WDEWCITEDS--------ELGLRMHEAGYIGVY 200 (236) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEECCEEEEEEHHHHHHHCC------CCCCCCCCHH--------HHHHHHHHCCCEEEE T ss_conf 99999999998898885389747666468878899998389------8877757779--------999999985997997 Q ss_pred E Q ss_conf 2 Q gi|254780201|r 383 V 383 (394) Q Consensus 383 v 383 (394) . T Consensus 201 ~ 201 (236) T cd06435 201 V 201 (236) T ss_pred C T ss_conf 2 No 6 >cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate. Probab=95.77 E-value=0.16 Score=29.20 Aligned_cols=195 Identities=14% Similarity=0.202 Sum_probs=105.1 Q ss_pred EEEEEEECCH--HHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHH--HHCCCCCEEEEECCCCHHHHHHHHHHCCCCCC Q ss_conf 3899994167--7899999999708998149997176528999999--71178738999456660479999862210027 Q gi|254780201|r 151 IAIVVHCYYQ--DTWIEISHILLRLNFDFDLFVTVVEANKDFEQDV--LKYFPSAQLYVMENKGRDVRPFLYLLELGVFD 226 (394) Q Consensus 151 IAVvlHlyY~--DLl~E~~~~L~~ip~~~DL~ITt~~~~~~~~~~~--~~~~~~a~V~vv~NRGRDI~PfL~~~~~~~l~ 226 (394) |+|++=+|-. |.+.++++.+.... +++++|-.+.......+.+ ....+.+.|...+|.|.- +++=.+++ .. T Consensus 2 VsviIPayNE~~~il~~~l~s~~~~~-~~eiivV~D~s~d~~~~~~~~~~~~~~~~v~~~~~~GK~-~Aln~~l~---~a 76 (235) T cd06434 2 VTVIIPVYDEDPDVFRECLRSILRQK-PLEIIVVTDGDDEPYLSILSQTVKYGGIFVITVPHPGKR-RALAEGIR---HV 76 (235) T ss_pred EEEEEEECCCCHHHHHHHHHHHHCCC-CCEEEEEECCCCCCHHHHHHHHCCCCCEEEEECCCCCHH-HHHHHHHH---HC T ss_conf 79999522697689999999998089-998999988999728999998634897899988989989-99999999---75 Q ss_pred CCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHH- Q ss_conf 8678999736877677776444078999999874799899999999976099768981687654433234554322138- Q gi|254780201|r 227 RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYR- 305 (394) Q Consensus 227 ~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~- 305 (394) ++|+|+-+-.=- .+ .++.+.+++..|+ ||++|.|++..... +.. .+++.... T Consensus 77 ~~d~v~~~DaD~---------------------~~-~~~~l~~l~~~f~-d~~vg~V~~~~~~~-~~~---~~~~~~~~~ 129 (235) T cd06434 77 TTDIVVLLDSDT---------------------VW-PPNALPEMLKPFE-DPKVGGVGTNQRIL-RPR---DSKWSFLAA 129 (235) T ss_pred CCCEEEEECCCC---------------------CC-CHHHHHHHHHHHC-CCCEEEEEEEEEEE-CCC---CCHHHHHHH T ss_conf 778899985885---------------------35-8889999999648-99879997038864-699---989999999 Q ss_pred ---HHHHHHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH-HHHHHHHHHCCCEEE Q ss_conf ---9999999748998855477377012332098887576588223585446888873899999-999999998698033 Q gi|254780201|r 306 ---RVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVE-RFFACSVRYTEFSIE 381 (394) Q Consensus 306 ---~~~~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIE-Rl~~~iv~~~gf~i~ 381 (394) .......+-.... .....+..|+..-+|.++|+..... ..|..|... |...+.-| ..+..-+.+.||++. T Consensus 130 ~~~~~~~~~~~~~~~~-~g~~~~~~G~~~~~Rr~~l~~~~~~---~~~~~~~~~--g~~~~~~ED~~lt~r~~~~G~r~~ 203 (235) T cd06434 130 EYLERRNEEIRAAMSY-DGGVPCLSGRTAAYRTEILKDFLFL---EEFTNETFM--GRRLNAGDDRFLTRYVLSHGYKTV 203 (235) T ss_pred HHHHHHHHHHHHHHHH-CCCEEEECCHHHHHHHHHHHHHCCH---HCCCCCCCC--CCCCCCCHHHHHHHHHHHCCCEEE T ss_conf 9999999999999985-5986896370799999999853301---012300027--777764269999999998699899 Q ss_pred EE Q ss_conf 52 Q gi|254780201|r 382 SV 383 (394) Q Consensus 382 ~v 383 (394) -. T Consensus 204 y~ 205 (235) T cd06434 204 YQ 205 (235) T ss_pred EE T ss_conf 92 No 7 >TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils. Probab=94.23 E-value=0.45 Score=26.51 Aligned_cols=171 Identities=9% Similarity=0.059 Sum_probs=90.9 Q ss_pred HHCCCCEEEEEEECCHH--HHHHHHHHHHHCCCCEE---EEEECCCC-----------------HHHHHHHHHHCCCCCE Q ss_conf 00014738999941677--89999999970899814---99971765-----------------2899999971178738 Q gi|254780201|r 145 LTIKSKIAIVVHCYYQD--TWIEISHILLRLNFDFD---LFVTVVEA-----------------NKDFEQDVLKYFPSAQ 202 (394) Q Consensus 145 ~~~k~kIAVvlHlyY~D--Ll~E~~~~L~~ip~~~D---L~ITt~~~-----------------~~~~~~~~~~~~~~a~ 202 (394) ...-++|.|++-.|-++ ++...+....++..|-| +||-=+-. .+...+++.+....-- T Consensus 127 ~~~~PtVDV~IPtYNEp~~vv~~Tl~aa~~ldYP~~kl~V~vLDDG~td~~~~~~~~~~~~~~~rR~e~~~la~~lGv~Y 206 (713) T TIGR03030 127 PEEWPTVDVFIPTYNEDLEIVATTVLAAKNMDYPADKFRVWILDDGGTDQKRNDPDPEQAEAAQRREELKEFCRKLGVNY 206 (713) T ss_pred HHHCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCEE T ss_conf 10189534896489999899999999998689998546699997989710023430123333441399999999809865 Q ss_pred EEEECCCCHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEE Q ss_conf 99945666047999986221002786789997368776777764440789999998747998999999999760997689 Q gi|254780201|r 203 LYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGM 282 (394) Q Consensus 203 V~vv~NRGRDI~PfL~~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGl 282 (394) |..-.|++---|-+=.+++ .. +-|+|+-+---- ...++...+.+..|.+||++|+ T Consensus 207 itR~~n~hAKAGNLN~AL~--~t-~GelIaVfDAD~----------------------vP~~~FL~~tvg~F~~Dpklal 261 (713) T TIGR03030 207 ITRPRNVHAKAGNINNALK--HT-DGELILIFDADH----------------------VPTRDFLQRTVGWFVEDPKLFL 261 (713) T ss_pred EECCCCCCCCCHHHHHHHH--HC-CCCEEEEECCCC----------------------CCCHHHHHHHHHHHHCCCCEEE T ss_conf 5379878876167899987--36-987899967887----------------------8668999999898702997699 Q ss_pred EECCCCCC-CCCCCCCC--CCCCCHHHHHHHHHHCCCCCCC-CCCCCCCEEEEEEEHHHHHHHH Q ss_conf 81687654-43323455--4322138999999974899885-5477377012332098887576 Q gi|254780201|r 283 IGSRRYRR-YKRWSFFA--KRSEVYRRVIDLAKRAGFPTKR-LHLDFFNGTMFWVKPKCLEPLR 342 (394) Q Consensus 283 v~p~~~~~-~~~~~~~~--~~~~~~~~~~~L~~rlgi~~~~-~~~~FpaGSMFW~R~~AL~pL~ 342 (394) |-.+++.. .++...+- ...-+++ .++--+.--+..+ -..-|..||++-.|.+||..+= T Consensus 262 VQTPq~F~N~Dp~~~NL~~~~~~p~e--~~~Fy~~iq~g~d~wnaaf~cGS~aV~RR~AL~eIG 323 (713) T TIGR03030 262 VQTPHFFVSPDPIERNLGTFRRMPNE--NELFYGLIQDGNDFWNAAFFCGSAAVLRREALDEIG 323 (713) T ss_pred EECCEEEECCCHHHHCCHHHCCCCCH--HHHHHHHHHCCCCCCCEEEECCCHHHHHHHHHHHCC T ss_conf 94897568988787411120367504--778899985564668807982617987499999807 No 8 >cd04191 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elongation of beta-1,2 polyglucose chains of glucan. Periplasmic Glucan Biosynthesis protein ModH is a glucosyltransferase that catalyzes the elongation of beta-1,2 polyglucose chains of glucan, requiring a beta-glucoside as a primer and UDP-glucose as a substrate. Glucans are composed of 5 to 10 units of glucose forming a highly branched structure, where beta-1,2-linked glucose constitutes a linear backbone to which branches are attached by beta-1,6 linkages. In Escherichia coli, glucans are located in the periplasmic space, functioning as regulator of osmolarity. It is synthesized at a maximum when cells are grown in a medium with low osmolarity. It has been shown to span the cytoplasmic membrane. Probab=93.91 E-value=0.52 Score=26.12 Aligned_cols=198 Identities=19% Similarity=0.220 Sum_probs=108.6 Q ss_pred EEEEEECCHHHHHHHH------HHHHHC--CCCEEEEEECCCCHHHH----H---HHHHHCCC-CCE---EEEECCCCHH Q ss_conf 8999941677899999------999708--99814999717652899----9---99971178-738---9994566604 Q gi|254780201|r 152 AIVVHCYYQDTWIEIS------HILLRL--NFDFDLFVTVVEANKDF----E---QDVLKYFP-SAQ---LYVMENKGRD 212 (394) Q Consensus 152 AVvlHlyY~DLl~E~~------~~L~~i--p~~~DL~ITt~~~~~~~----~---~~~~~~~~-~a~---V~vv~NRGRD 212 (394) ||++=+|-.|...-+. +.|... ...||+||=-++.+.++ + ..+.+... ... ++...|.|+- T Consensus 2 AvlvP~yNEd~~~v~~~l~a~~~sl~~~~~~~~f~v~vLsDs~~p~~~~~E~~a~~~l~~~~~~~~~v~Yr~R~~n~g~K 81 (254) T cd04191 2 AIVMPVYNEDPARVFAGLRAMYESLAKTGLADHFDFFILSDTRDPDIWLAEEAAWLDLCEELGAQGRIYYRRRRENTGRK 81 (254) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCC T ss_conf 79995688998999999999999998638856537999808997578999999999999984788745988568667987 Q ss_pred HHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCC Q ss_conf 79999862210027867899973687767777644407899999987479989999999997609976898168765443 Q gi|254780201|r 213 VRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYK 292 (394) Q Consensus 213 I~PfL~~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~ 292 (394) -|-.=..++. .-.+|||++=+-.-- +-+++.+...+..|+.||++|+|-.++... + T Consensus 82 AGni~~~l~~-~g~~yd~~~vlDAD~----------------------~~~~d~l~~lv~~~e~dp~~glVQt~~~~~-n 137 (254) T cd04191 82 AGNIADFCRR-WGSRYDYMVVLDADS----------------------LMSGDTIVRLVRRMEANPRAGIIQTAPKLI-G 137 (254) T ss_pred HHHHHHHHHH-CCCCCCEEEEECCCC----------------------CCCHHHHHHHHHHHHHCCCEEEEECCCCCC-C T ss_conf 3779999995-399846799975888----------------------998699999999997698868972897464-8 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCC-------CCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 32345543221389999999748-------99885547737701233209888757658822358544688887389999 Q gi|254780201|r 293 RWSFFAKRSEVYRRVIDLAKRAG-------FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365 (394) Q Consensus 293 ~~~~~~~~~~~~~~~~~L~~rlg-------i~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAI 365 (394) ... ....+.....++. +..-.....+.-||..-.|.+|+..--.+.-. +.-+|-..-|+.|=+ T Consensus 138 ~~s-------~~~r~~qf~~~~~~~~~~~G~~~w~~~~~~f~GhnAIiR~~af~~~~glp~L---~~~g~~~g~~lsED~ 207 (254) T cd04191 138 AET-------LFARLQQFANRLYGPVFGRGLAAWQGGEGNYWGHNAIIRVAAFMEHCALPVL---PGRPPFGGHILSHDF 207 (254) T ss_pred CCC-------HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEHHHHHHHCCCCCC---CCCCCCCCCCCCHHH T ss_conf 878-------8999888999860199875687752553344673146660877764176446---788997886130579 Q ss_pred HHHHHHHHHHCCCEEEEEHH Q ss_conf 99999999986980335247 Q gi|254780201|r 366 ERFFACSVRYTEFSIESVDC 385 (394) Q Consensus 366 ERl~~~iv~~~gf~i~~v~~ 385 (394) ..++..+.+||+|.-..- T Consensus 208 --~~a~~L~r~Gw~v~~~~~ 225 (254) T cd04191 208 --VEAALMRRAGWEVRLAPD 225 (254) T ss_pred --HHHHHHHHCCCEEEEECC T ss_conf --999999987998998125 No 9 >cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the Probab=93.13 E-value=0.7 Score=25.33 Aligned_cols=154 Identities=14% Similarity=0.096 Sum_probs=81.0 Q ss_pred CHHHHHHHHHHHHHCC-CCEEEEEECCCC---HHHHHHHHHHCCCCCEEEE--ECCCCHHHHHHHHHHCCCCCCCCCEEE Q ss_conf 6778999999997089-981499971765---2899999971178738999--456660479999862210027867899 Q gi|254780201|r 159 YQDTWIEISHILLRLN-FDFDLFVTVVEA---NKDFEQDVLKYFPSAQLYV--MENKGRDVRPFLYLLELGVFDRYDYLC 232 (394) Q Consensus 159 Y~DLl~E~~~~L~~ip-~~~DL~ITt~~~---~~~~~~~~~~~~~~a~V~v--v~NRGRDI~PfL~~~~~~~l~~YD~v~ 232 (394) ..|.+.++++.+.+-. .+++++|--+.. ..+..++.....+...+++ -+|.|+ -+.+=.++ ...++||++ T Consensus 8 ~~~~l~~~l~sl~~Q~~~~~eIivvdd~S~d~t~~~~~~~~~~~~~~~~~~~~~~~~g~-~~a~N~g~---~~a~~d~i~ 83 (180) T cd06423 8 EEAVIERTIESLLALDYPKLEVIVVDDGSTDDTLEILEELAALYIRRVLVVRDKENGGK-AGALNAGL---RHAKGDIVV 83 (180) T ss_pred CHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCH-HHHHHHCC---CCCCCCEEE T ss_conf 78999999999981899981899998999757899999986217971898447776779-99975251---026755798 Q ss_pred EECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHH-HHH Q ss_conf 9736877677776444078999999874799899999999976099768981687654433234554322138999-999 Q gi|254780201|r 233 KIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVI-DLA 311 (394) Q Consensus 233 klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~-~L~ 311 (394) .++.==.+ .++.+..++..+.++|++|+|++......... +++....... ... T Consensus 84 ~lD~D~~~----------------------~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~----~~~~~~~~~~~~~~ 137 (180) T cd06423 84 VLDADTIL----------------------EPDALKRLVVPFFADPKVGAVQGRVRVRNGSE----NLLTRLQAIEYLSI 137 (180) T ss_pred EECCCCCC----------------------CHHHHHHHHHHHHHCCCEEEEEEEEEEECCCC----CHHHHHHHHHHHHH T ss_conf 74387200----------------------85699999999987959899985798546998----78999998998877 Q ss_pred HHCCCCC--CCCCCCCCCEEEEEEEHHHHHHHH Q ss_conf 9748998--855477377012332098887576 Q gi|254780201|r 312 KRAGFPT--KRLHLDFFNGTMFWVKPKCLEPLR 342 (394) Q Consensus 312 ~rlgi~~--~~~~~~FpaGSMFW~R~~AL~pL~ 342 (394) ....... ......++.|+.+-+|.++|+.+= T Consensus 138 ~~~~~~~~~~~~~~~~~~G~~~~~rr~~~e~vG 170 (180) T cd06423 138 FRLGRRAQSALGGVLVLSGAFGAFRREALREVG 170 (180) T ss_pred HHHHHHHHHHCCCCEEECCEEEEEEHHHHHHCC T ss_conf 566666787659950678559998899999839 No 10 >cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment. Probab=91.52 E-value=1.1 Score=24.10 Aligned_cols=160 Identities=13% Similarity=0.236 Sum_probs=93.1 Q ss_pred CCEEEEEEECCHHH-HHHHHHHHHHCCCC-EEEEEECC---CCHHHHHHHHHHCCCCCEEEE-ECCCCHH----HHHHHH Q ss_conf 47389999416778-99999999708998-14999717---652899999971178738999-4566604----799998 Q gi|254780201|r 149 SKIAIVVHCYYQDT-WIEISHILLRLNFD-FDLFVTVV---EANKDFEQDVLKYFPSAQLYV-MENKGRD----VRPFLY 218 (394) Q Consensus 149 ~kIAVvlHlyY~DL-l~E~~~~L~~ip~~-~DL~ITt~---~~~~~~~~~~~~~~~~a~V~v-v~NRGRD----I~PfL~ 218 (394) ++|.|++=+|=.+- +.+++..|.+...+ +.++|..+ +...++.+++.+.+|+..+++ +.++.+- +..+.. T Consensus 1 P~VSIivP~~nee~~i~~~l~sll~qdYp~~Eiivv~d~s~D~t~~i~~~~~~~~p~~~~~~~~~~~~~g~~~K~~~l~~ 80 (196) T cd02520 1 PGVSILKPLCGVDPNLYENLESFFQQDYPKYEILFCVQDEDDPAIPVVRKLIAKYPNVDARLLIGGEKVGINPKVNNLIK 80 (196) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHH T ss_conf 96799996899927799999999966899808999978999978999999874287766599976888784489999999 Q ss_pred HHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCC Q ss_conf 62210027867899973687767777644407899999987479989999999997609976898168765443323455 Q gi|254780201|r 219 LLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFA 298 (394) Q Consensus 219 ~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~ 298 (394) +++ ..+.|+++.+.+-=.+ .++.+.+++..|. +|++|+|... T Consensus 81 g~~---~a~gdii~~~DaD~~~----------------------~~~~L~~lv~~f~-~p~vg~V~~~------------ 122 (196) T cd02520 81 GYE---EARYDILVISDSDISV----------------------PPDYLRRMVAPLM-DPGVGLVTCL------------ 122 (196) T ss_pred HHH---HCCCCEEEEECCCCCC----------------------CHHHHHHHHHHHC-CCCCCEEECC------------ T ss_conf 999---6458999998899772----------------------9559999999856-9994679647------------ Q ss_pred CCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 43221389999999748998855477377012332098887576588223585446888873899999999999998698 Q gi|254780201|r 299 KRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 (394) Q Consensus 299 ~~~~~~~~~~~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf 378 (394) |..|+..-+|.++|..+-......++ ++ =--.++.-++.+|| T Consensus 123 --------------------------~~~G~~~~~Rr~~l~~~Gg~~~~~~~----------l~--ED~~l~~~l~~~G~ 164 (196) T cd02520 123 --------------------------CAFGKSMALRREVLDAIGGFEAFADY----------LA--EDYFLGKLIWRLGY 164 (196) T ss_pred --------------------------CCCCCEEEEEHHHHHHCCCHHHHCCC----------CH--HHHHHHHHHHHCCC T ss_conf --------------------------76785567889999874994453454----------17--99999999998599 Q ss_pred EEEEEH Q ss_conf 033524 Q gi|254780201|r 379 SIESVD 384 (394) Q Consensus 379 ~i~~v~ 384 (394) +|.-.. T Consensus 165 ri~~~p 170 (196) T cd02520 165 RVVLSP 170 (196) T ss_pred EEEEEC T ss_conf 799835 No 11 >cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus. Probab=91.24 E-value=1.2 Score=23.92 Aligned_cols=187 Identities=10% Similarity=0.096 Sum_probs=90.3 Q ss_pred EEEEEEECC-HHHHHHHHHHHHHCC---CCEEEEEECC--C-CHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHH-HHCC Q ss_conf 389999416-778999999997089---9814999717--6-528999999711787389994566604799998-6221 Q gi|254780201|r 151 IAIVVHCYY-QDTWIEISHILLRLN---FDFDLFVTVV--E-ANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLY-LLEL 222 (394) Q Consensus 151 IAVvlHlyY-~DLl~E~~~~L~~ip---~~~DL~ITt~--~-~~~~~~~~~~~~~~~a~V~vv~NRGRDI~PfL~-~~~~ 222 (394) |.|++=.|= .+.+.++++.|.+.. ..+.++|--+ + +..++.++.....+. |+++.|.|+-.+.=+. ++ T Consensus 2 VSViIp~yN~~~~l~~~l~Sl~~q~yp~~~~EVIvVDd~S~D~t~~~~~~~~~~~~~--i~~~~~~~~~~~~a~N~gi-- 77 (249) T cd02525 2 VSIIIPVRNEEKYIEELLESLLNQSYPKDLIEIIVVDGGSTDGTREIVQEYAAKDPR--IRLIDNPKRIQSAGLNIGI-- 77 (249) T ss_pred EEEEECCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCHHHHHHHHHHCCCC--EEECCCCCCCHHHHHHHHH-- T ss_conf 899993569789999999999846899998899999895961569999999740771--3211355565899998989-- Q ss_pred CCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCC Q ss_conf 00278678999736877677776444078999999874799899999999976099768981687654433234554322 Q gi|254780201|r 223 GVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE 302 (394) Q Consensus 223 ~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~ 302 (394) ...+.|+|+.+.+-=- + +++.+++++..|++++..++.++....... .. T Consensus 78 -~~a~gd~i~~lD~D~~---------------------~-~~~~l~~~~~~~~~~~~~~v~g~~~~~~~~-------~~- 126 (249) T cd02525 78 -RNSRGDIIIRVDAHAV---------------------Y-PKDYILELVEALKRTGADNVGGPMETIGES-------KF- 126 (249) T ss_pred -HHCCCCEEEEECCCCC---------------------C-CHHHHHHHHHHHHCCCCEEEEEEEEECCCC-------HH- T ss_conf -9668776884147765---------------------6-956999999998778971999888506897-------27- Q ss_pred CHHHHHHHH-HHCCC-----CCCCCCCC-CCC--EEEEEEEHHHHHHHHCCCCC-CCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 138999999-97489-----98855477-377--01233209888757658822-3585446888873899999999999 Q gi|254780201|r 303 VYRRVIDLA-KRAGF-----PTKRLHLD-FFN--GTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACS 372 (394) Q Consensus 303 ~~~~~~~L~-~rlgi-----~~~~~~~~-Fpa--GSMFW~R~~AL~pL~~l~l~-~DF~~E~g~~DGTlaHAIERl~~~i 372 (394) ........ ...+. ........ .+. +.|| |.++++.+ +.+ ++|. ++=+--+++= T Consensus 127 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--rr~~~~~i---Ggfde~~~-----------~~ED~dl~~R 189 (249) T cd02525 127 -QKAIAVAQSSPLGSGGSAYRGGAVKIGYVDTVHHGAY--RREVFEKV---GGFDESLV-----------RNEDAELNYR 189 (249) T ss_pred -HHHHHHHHHCCCCCCCCCCCCCCCCCCEECCEEEEEE--EHHHHHHH---CCCCCCCC-----------CCHHHHHHHH T ss_conf -7777887626432352000132245845421357988--88899973---89987778-----------2389999999 Q ss_pred HHHCCCEEEEEHHHHHH Q ss_conf 99869803352475657 Q gi|254780201|r 373 VRYTEFSIESVDCVAEY 389 (394) Q Consensus 373 v~~~gf~i~~v~~~~~~ 389 (394) +..+||++.-+..+.-| T Consensus 190 ~~~~G~ki~~~p~a~v~ 206 (249) T cd02525 190 LRKAGYKIWLSPDIRVY 206 (249) T ss_pred HHHCCCEEEEECCEEEE T ss_conf 99769979991887999 No 12 >cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=90.36 E-value=1.4 Score=23.43 Aligned_cols=150 Identities=16% Similarity=0.212 Sum_probs=86.0 Q ss_pred HHHHHHHHHHHHHCCC-CEEEEEECCCCHHHHHHHHHHCCCCCEEEEE-CCCCHHHHHHHHHHCCCCCCCCCEEEEECCC Q ss_conf 7789999999970899-8149997176528999999711787389994-5666047999986221002786789997368 Q gi|254780201|r 160 QDTWIEISHILLRLNF-DFDLFVTVVEANKDFEQDVLKYFPSAQLYVM-ENKGRDVRPFLYLLELGVFDRYDYLCKIHGK 237 (394) Q Consensus 160 ~DLl~E~~~~L~~ip~-~~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv-~NRGRDI~PfL~~~~~~~l~~YD~v~klHtK 237 (394) ++.+.++++.+.+-.. ++.++|--........+.+.+..++..+... +|+|. -.+.=.+++ ..+.|+++.+++- T Consensus 9 ~~~l~~~l~Sl~~q~~~~~eiivVD~~S~d~t~~~~~~~~~~i~~i~~~~n~G~-~~a~N~g~~---~a~g~~i~~lD~D 84 (166) T cd04186 9 LEYLKACLDSLLAQTYPDFEVIVVDNASTDGSVELLRELFPEVRLIRNGENLGF-GAGNNQGIR---EAKGDYVLLLNPD 84 (166) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHCCCCEEEEECCCCCCH-HHHHHHHHH---HHCCCCEEEECCC T ss_conf 899999999998377998099999689983156777742898899989898776-787616898---7165726998799 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCC Q ss_conf 77677776444078999999874799899999999976099768981687654433234554322138999999974899 Q gi|254780201|r 238 KSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP 317 (394) Q Consensus 238 KS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~~L~~rlgi~ 317 (394) =-+ +++.+.+++..|+++|++|+++|. T Consensus 85 ~~~----------------------~~~~l~~~~~~~~~~~~~~~~~~~------------------------------- 111 (166) T cd04186 85 TVV----------------------EPGALLELLDAAEQDPDVGIVGPK------------------------------- 111 (166) T ss_pred EEE----------------------CCCHHHHHHHHHHHCCCEEEEEEC------------------------------- T ss_conf 388----------------------839999999999869798999950------------------------------- Q ss_pred CCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEHHH Q ss_conf 885547737701233209888757658822358544688887389999999999999869803352475 Q gi|254780201|r 318 TKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVDCV 386 (394) Q Consensus 318 ~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf~i~~v~~~ 386 (394) ..|+.+-+|.++++.+=-. .++|-. + .=+-=+.+=+.++||++.-+..+ T Consensus 112 --------~~g~~~~~rr~~~~~vGgf--de~~~~--~--------~ED~dl~~R~~~~G~~i~~~p~~ 160 (166) T cd04186 112 --------VSGAFLLVRREVFEEVGGF--DEDFFL--Y--------YEDVDLCLRARLAGYRVLYVPQA 160 (166) T ss_pred --------CCEEEEEEEHHHHHHHCCC--CHHHCC--C--------CCHHHHHHHHHHCCCCEEEECCE T ss_conf --------5489999999999985899--768657--4--------65999999999829949994998 No 13 >cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Probab=90.20 E-value=1.5 Score=23.35 Aligned_cols=194 Identities=13% Similarity=0.224 Sum_probs=100.5 Q ss_pred CCCCHHHHHHCCCCEEEEEEECCH-HHHHHHHHHHHHCCCC---EEEEEECCCCHHHHHHHHHHCC--CCCEEEE-ECCC Q ss_conf 221011000001473899994167-7899999999708998---1499971765289999997117--8738999-4566 Q gi|254780201|r 137 PSSPKKSGLTIKSKIAIVVHCYYQ-DTWIEISHILLRLNFD---FDLFVTVVEANKDFEQDVLKYF--PSAQLYV-MENK 209 (394) Q Consensus 137 p~~~~~~~~~~k~kIAVvlHlyY~-DLl~E~~~~L~~ip~~---~DL~ITt~~~~~~~~~~~~~~~--~~a~V~v-v~NR 209 (394) |-.|......-.++|.|++=+|=. +.+.++++.+.+...+ ++++|--+-.... ..++.+.+ +..++.. .+|. T Consensus 17 ~~~~~~~~~~~~P~VsViIP~yNE~~~i~~~l~sl~~q~Yp~~~~eVIvvdD~StD~-T~ei~~~~~~~~~~v~~~~~n~ 95 (251) T cd06439 17 PKPPSLPDPAYLPTVTIIIPAYNEEAVIEAKLENLLALDYPRDRLEIIVVSDGSTDG-TAEIAREYADKGVKLLRFPERR 95 (251) T ss_pred CCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHH-HHHHHHHHHCCCCEEECCCCCC T ss_conf 899999898899935999973898899999999999669998878999997999814-9999999745585043167776 Q ss_pred CHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCC Q ss_conf 60479999862210027867899973687767777644407899999987479989999999997609976898168765 Q gi|254780201|r 210 GRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYR 289 (394) Q Consensus 210 GRDI~PfL~~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~ 289 (394) |. -+++=.+++ ....|+|+.+.+=- .+ .++.+.+++..| .||++|.|+..... T Consensus 96 Gk-~~AlN~gi~---~a~gd~i~~lDaD~---------------------~~-~~~~l~~l~~~f-~d~~vg~V~g~~~~ 148 (251) T cd06439 96 GK-AAALNRALA---LATGEIVVFTDANA---------------------LL-DPDALRLLVRHF-ADPSVGAVSGELVI 148 (251) T ss_pred CH-HHHHHHHHH---HCCCCEEEECCCCC---------------------CC-CHHHHHHHHHHH-HCCCEEEEEEEEEC T ss_conf 79-999999998---76789898337876---------------------51-876999999986-09983899714870 Q ss_pred CCCCCCCCCCCCCCHHH----HHHHHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 44332345543221389----99999974899885547737701233209888757658822358544688887389999 Q gi|254780201|r 290 RYKRWSFFAKRSEVYRR----VIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365 (394) Q Consensus 290 ~~~~~~~~~~~~~~~~~----~~~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAI 365 (394) ....... . ....+.. +.....+++ ......|+++-+|.++++.+ +.... . = T Consensus 149 ~~~~~~~-~-~~~~y~~~~~~~~~~~s~~~------~~~~~~G~~~~~Rr~~~~~~---------~~~~~------~--E 203 (251) T cd06439 149 VDGGGSG-S-GEGLYWKYENWLKRAESRLG------STVGANGAIYAIRRELFRPL---------PADTI------N--D 203 (251) T ss_pred CCCCCHH-H-HHHHHHHHHHHHHHHHHHCC------CEEEECCHHHHHHHHHHCCC---------CCCCC------C--C T ss_conf 4898037-8-99999999999999998549------92488668999589998097---------87776------5--3 Q ss_pred HHHHHHHHHHCCCEEEEE Q ss_conf 999999999869803352 Q gi|254780201|r 366 ERFFACSVRYTEFSIESV 383 (394) Q Consensus 366 ERl~~~iv~~~gf~i~~v 383 (394) ..-+++-+..+||+|.-. T Consensus 204 D~dl~~rl~~~G~ki~y~ 221 (251) T cd06439 204 DFVLPLRIARQGYRVVYE 221 (251) T ss_pred HHHHHHHHHHCCCEEEEC T ss_conf 899999999869979974 No 14 >pfam11913 DUF3431 Protein of unknown function (DUF3431). This family of proteins are functionally uncharacterized. This protein is found in eukaryotes. Proteins in this family are typically between 291 to 390 amino acids in length. This protein has a conserved NLRC sequence motif. Probab=89.65 E-value=1.2 Score=23.80 Aligned_cols=191 Identities=19% Similarity=0.124 Sum_probs=91.0 Q ss_pred HHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCC----CEEEEECCCCCCCC Q ss_conf 99997089981499971765289999997117873899945666047999986221002786----78999736877677 Q gi|254780201|r 167 SHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRY----DYLCKIHGKKSQRE 242 (394) Q Consensus 167 ~~~L~~ip~~~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv~NRGRDI~PfL~~~~~~~l~~Y----D~v~klHtKKS~~~ 242 (394) .+.+.+........|-|.+ -+++...+-.|+||--...|.= +.++| |++.++|+.+-.+ T Consensus 14 t~W~~~~~p~~~~~IY~vd------------~~~~~l~vp~NkGrEam~YLty----IIdnYd~Lpd~~vFlH~~~~~W- 76 (222) T pfam11913 14 TSWLSELLPDWQLAIYTVD------------DPNAPLTVPKNKGREAMAYLTY----IIDHYDHLPDTTVFIHAHRFAW- 76 (222) T ss_pred CCHHHHHCCCCCEEEEECC------------CCCCCCCCCCCCCCHHHHHHHH----HHHHHHCCCCCEEEECCCCHHH- T ss_conf 2057775877742689738------------9876655888777213567999----9877642886469962775121- Q ss_pred CCCCCCCHHHHHHHHHHHCCCHHHHHHH-HHHHHHCCCEEEEECCCCCCCCCCCCCCC---CCCCHHHHHHHH-HHCCCC Q ss_conf 7764440789999998747998999999-99976099768981687654433234554---322138999999-974899 Q gi|254780201|r 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRI-INTFEQNPCLGMIGSRRYRRYKRWSFFAK---RSEVYRRVIDLA-KRAGFP 317 (394) Q Consensus 243 ~~~~~~G~~Wr~~Ll~~LLGs~~~v~~I-l~~F~~~p~lGlv~p~~~~~~~~~~~~~~---~~~~~~~~~~L~-~rlgi~ 317 (394) |....+.=...++.+|- -+.|.+- .-.+.-+...| .|.. .+++..... .........+.. .-++-. T Consensus 77 --Hnd~~~~d~~~~l~~L~--~~~v~~~Gy~NLRC~~~pg--Cp~~---i~p~~~~~~~~~~~~~~~~~~~~~~~lF~~~ 147 (222) T pfam11913 77 --HNDDPDYDNVALLRRLR--LDFVQRNGYVNLRCNWSPG--CPDE---IHPFGPTDPDIERKPEEAVLPQAWEELFPGT 147 (222) T ss_pred --CCCCCCCCHHHHHHHCC--HHHHHHCCEEEEECCCCCC--CCCC---CCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf --04786523699998667--8776535915721547899--9755---3768877876446717778999999976999 Q ss_pred CCCCCCCCCCEEEEEEEHHHHHH--HHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEHHHHH Q ss_conf 88554773770123320988875--765882235854468888738999999999999986980335247565 Q gi|254780201|r 318 TKRLHLDFFNGTMFWVKPKCLEP--LRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVDCVAE 388 (394) Q Consensus 318 ~~~~~~~FpaGSMFW~R~~AL~p--L~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf~i~~v~~~~~ 388 (394) ..|+...++.+++|-+..+-++. +-++-.+-+.--+....|..-.+.+|++.-+|= .-..|.|-.+ T Consensus 148 ~vP~~v~~~cCaQFaVsre~Ir~rP~~~Y~~~r~Wll~t~l~D~~sGrv~EylWhiIF-----~~~~v~CP~~ 215 (222) T pfam11913 148 PVPEVVGAPCCAQFAVSRERIRKRPREDYVRYRQWLLETDLDDDLSGRVMEYLWHIIF-----GGEAVYCPDE 215 (222) T ss_pred CCCCCCCCCCEEEEEEEHHHHHHCCHHHHHHHHHHHHCCCCCCCCEEEEEHHHHHHHC-----CCCCCCCCCH T ss_conf 9994106763342326399998689999999999996488998726865011047760-----8986669856 No 15 >PRK11204 N-glycosyltransferase PgaC; Provisional Probab=88.28 E-value=2 Score=22.49 Aligned_cols=192 Identities=17% Similarity=0.221 Sum_probs=105.3 Q ss_pred HHCCCCEEEEEEECCHH-HHHHHHHHHHHCCC-CEEEEEECC--C-CHHHHHHHHHHCCCCCEEEE-ECCCCHHHHHHHH Q ss_conf 00014738999941677-89999999970899-814999717--6-52899999971178738999-4566604799998 Q gi|254780201|r 145 LTIKSKIAIVVHCYYQD-TWIEISHILLRLNF-DFDLFVTVV--E-ANKDFEQDVLKYFPSAQLYV-MENKGRDVRPFLY 218 (394) Q Consensus 145 ~~~k~kIAVvlHlyY~D-Ll~E~~~~L~~ip~-~~DL~ITt~--~-~~~~~~~~~~~~~~~a~V~v-v~NRGRDI~PfL~ 218 (394) ..--+.|.|++=+|-++ .+.+.++.+.+... .++++|-.+ + +..++..+..+..++..|.. -+|+| --+.+=. T Consensus 50 ~~~~P~VsVlIPayNEe~~I~~tI~sll~~~YP~~eIiVVdDgStD~T~~i~~~~~~~~p~~~vi~~~~n~G-Ka~ALN~ 128 (421) T PRK11204 50 LKEYPLVSILVPCYNEGENVEETISAALALRYPNYEVIAINDGSSDNTGEILDRLAAEDPRLRVIHLAENQG-KAIALNT 128 (421) T ss_pred CCCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECCCCCC-HHHHHHH T ss_conf 889994699984799789999999999958999866999989997408999999970689689997699988-9999999 Q ss_pred HHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCC Q ss_conf 62210027867899973687767777644407899999987479989999999997609976898168765443323455 Q gi|254780201|r 219 LLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFA 298 (394) Q Consensus 219 ~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~ 298 (394) +++ ..++|+|+-+-.-- +-+++.+..++..|.+||++|.|.... ...+.. T Consensus 129 gl~---~a~ge~Vv~~DAD~----------------------~~~~d~L~~~v~~f~~dp~VgaV~g~~-~v~n~~---- 178 (421) T PRK11204 129 GAA---AARSEYLVCIDGDA----------------------LLDPDAAAYMVEHFLHNPRVGAVTGNP-RIRNRS---- 178 (421) T ss_pred HHH---HCCCCEEEEECCCC----------------------CCCHHHHHHHHHHHHHCCCCEEEECCC-EEECCC---- T ss_conf 998---43789899989987----------------------148679999999997198643883785-430433---- Q ss_pred CCCCCHHHHHHHHHHCCCCC---C-CCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHH-HHHHH Q ss_conf 43221389999999748998---8-5547737701233209888757658822358544688887389999999-99999 Q gi|254780201|r 299 KRSEVYRRVIDLAKRAGFPT---K-RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERF-FACSV 373 (394) Q Consensus 299 ~~~~~~~~~~~L~~rlgi~~---~-~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl-~~~iv 373 (394) +..+.- ...+.....++-. . -.......|+.+-+|.+||+.+-- | ..+|+. |=+ ++.-. T Consensus 179 tlL~r~-q~~Ey~~~~~~~kr~q~~~g~~~~vsG~~~~fRrsaL~~vGG------w------~~~tlt---ED~dls~rl 242 (421) T PRK11204 179 TLLGRI-QVGEFSSIIGLIKRAQRVYGRVFTVSGVITAFRKSALADVGY------W------STDMIT---EDIDISWKL 242 (421) T ss_pred CHHHHH-HHHHHHHHHHHHHHHHHHHCCCEEECCEEEEEHHHHHHHHCC------C------CCCCHH---HHHHHHHHH T ss_conf 388875-799999999999999998388145277212204999998289------7------998522---479999999 Q ss_pred HHCCCEEEEE Q ss_conf 9869803352 Q gi|254780201|r 374 RYTEFSIESV 383 (394) Q Consensus 374 ~~~gf~i~~v 383 (394) +..|+++.-+ T Consensus 243 ~~~G~ri~y~ 252 (421) T PRK11204 243 QLRGWDIRYE 252 (421) T ss_pred HHCCCEEEEC T ss_conf 9879969953 No 16 >TIGR02699 archaeo_AfpA archaeoflavoprotein AfpA; InterPro: IPR014072 The prototypical member of this archaeal protein family is AF1518 from Archaeoglobus fulgidus. This homodimer with two non-covalently bound FMN cofactors can receive electrons from ferredoxin, but not from a number of other electron donors such as NADH or rubredoxin. It can then donate electrons to various reductases.. Probab=88.02 E-value=0.64 Score=25.55 Aligned_cols=65 Identities=22% Similarity=0.329 Sum_probs=46.6 Q ss_pred HHHHHHHHHHHHC----CC--CEEEEEECCCCHHH------HHHHHHHCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCC Q ss_conf 7899999999708----99--81499971765289------999997117873899945666047999986221002786 Q gi|254780201|r 161 DTWIEISHILLRL----NF--DFDLFVTVVEANKD------FEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRY 228 (394) Q Consensus 161 DLl~E~~~~L~~i----p~--~~DL~ITt~~~~~~------~~~~~~~~~~~a~V~vv~NRGRDI~PfL~~~~~~~l~~Y 228 (394) |-|+|..+-+..+ +. ..|+|+|= +-++ .=..+...|++..|.+=+| +|||.+ ..++..| T Consensus 11 d~L~Et~~im~~vK~~y~~rV~~~vfLSK--~G~~VvKyY~L~~~L~~~F~~~~Ve~~AN-----sPFLaG--~lQ~G~Y 81 (176) T TIGR02699 11 DKLPETVEIMKDVKKKYGDRVEIDVFLSK--AGEQVVKYYKLWDKLEEDFDDLRVEINAN-----SPFLAG--QLQLGKY 81 (176) T ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEEEEC--CCCEEEEHHHHHHHHHHHCCCEEEEECCC-----CCCCCC--CCCCCEE T ss_conf 84167888889975205987899998605--86745316676788863106405870786-----781006--4134601 Q ss_pred CEEEEE Q ss_conf 789997 Q gi|254780201|r 229 DYLCKI 234 (394) Q Consensus 229 D~v~kl 234 (394) |+++=- T Consensus 82 dflLvA 87 (176) T TIGR02699 82 DFLLVA 87 (176) T ss_pred EEEEEC T ss_conf 102215 No 17 >PRK05454 glucosyltransferase MdoH; Provisional Probab=87.39 E-value=2.3 Score=22.16 Aligned_cols=120 Identities=21% Similarity=0.328 Sum_probs=76.0 Q ss_pred HHHHHHCCCCEEEEEEECCHHHHHHHH------HHHHHC--CCCEEEEEECCCCHHH----HHH---HHHHCCCC-CEEE Q ss_conf 110000014738999941677899999------999708--9981499971765289----999---99711787-3899 Q gi|254780201|r 141 KKSGLTIKSKIAIVVHCYYQDTWIEIS------HILLRL--NFDFDLFVTVVEANKD----FEQ---DVLKYFPS-AQLY 204 (394) Q Consensus 141 ~~~~~~~k~kIAVvlHlyY~DLl~E~~------~~L~~i--p~~~DL~ITt~~~~~~----~~~---~~~~~~~~-a~V~ 204 (394) .........|.||++-+|-+|-..-+. +-|+.. ...||+||=.++.+.+ .++ .+.+..+. .+|. T Consensus 115 ~~~~~~~~~rtAilmPiyNEd~~~vfa~l~Am~~sl~~tg~~~~Fd~fILSDT~dp~i~~~Ee~a~~~L~~~~~~~~ri~ 194 (663) T PRK05454 115 GDEPPNPEARTAILMPIYNEDPARVFAGLRAMYESLAATGHLAHFDFFILSDTTDPDIAAAEEAAWLELRAELGGEGRIF 194 (663) T ss_pred CCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCEE T ss_conf 77788877776999517789999999999999999997188666038997489976689999999999998658878605 Q ss_pred ---EECCCCHH---HHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf ---94566604---799998622100278678999736877677776444078999999874799899999999976099 Q gi|254780201|r 205 ---VMENKGRD---VRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNP 278 (394) Q Consensus 205 ---vv~NRGRD---I~PfL~~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p 278 (394) ...|.||- |+-|..-++ .+|||..-+.- |+++ +.+.+.+.+..++.|| T Consensus 195 YRRR~~N~~rKaGNIaDf~rrwG----~~Y~~MiVLDA---------------------DSlM-sG~ti~~lv~~Me~~P 248 (663) T PRK05454 195 YRRRRRNVKRKAGNIADFCRRWG----GRYDYMVVLDA---------------------DSLM-SGDTLVRLVRLMEANP 248 (663) T ss_pred EECCCCCCCCCCCCHHHHHHHCC----CCCCEEEEECC---------------------CCCC-CHHHHHHHHHHHHHCC T ss_conf 53155355777756999998608----87752666234---------------------1346-7499999999986499 Q ss_pred CEEEEECC Q ss_conf 76898168 Q gi|254780201|r 279 CLGMIGSR 286 (394) Q Consensus 279 ~lGlv~p~ 286 (394) ++||+-.. T Consensus 249 ~~GliQT~ 256 (663) T PRK05454 249 RAGLIQTL 256 (663) T ss_pred CCCEECCC T ss_conf 86451156 No 18 >PRK11498 bcsA cellulose synthase catalytic subunit; Provisional Probab=86.52 E-value=2.6 Score=21.87 Aligned_cols=148 Identities=14% Similarity=0.180 Sum_probs=76.6 Q ss_pred CEEEEEECCCCHHH-HHH----HHHHCCCC--CEEEEECCCCHHHHHHHHHHCCCCCCCCCEEEE---ECCCCCCCCCCC Q ss_conf 81499971765289-999----99711787--389994566604799998622100278678999---736877677776 Q gi|254780201|r 176 DFDLFVTVVEANKD-FEQ----DVLKYFPS--AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCK---IHGKKSQREGYH 245 (394) Q Consensus 176 ~~DL~ITt~~~~~~-~~~----~~~~~~~~--a~V~vv~NRGRDI~PfL~~~~~~~l~~YD~v~k---lHtKKS~~~~~~ 245 (394) ..|+||.|-.|..+ ++. ...-++|. .+|.+..--+|+-. ..+.++ + +-.|+-. -|-|- T Consensus 274 tVDIfIPTYNEp~~vvr~Ti~aa~~iDYP~dKl~VyvLDDG~R~e~---r~lAee-l-Gv~YitR~~N~HAKA------- 341 (858) T PRK11498 274 TVDIFVPTYNEDLNVVKNTIYASLGIDWPKDKLNIWILDDGGREEF---RQFAQE-V-GVGYIARPTHEHAKA------- 341 (858) T ss_pred EEEEEEECCCCCHHHHHHHHHHHHHCCCCHHHCEEEEECCCCCHHH---HHHHHH-H-CCCEEECCCCCCCCC------- T ss_conf 1005764589988999999999982899801176999329999899---999998-5-997673688866642------- Q ss_pred CCCCHHHHHHHHH------------HHCCCHHHHHHHHHHHHHCCCEEEEE-CCCCCCCCCCCCCCCCCC--CHHHHHHH Q ss_conf 4440789999998------------74799899999999976099768981-687654433234554322--13899999 Q gi|254780201|r 246 PIEGIIWRRWLFF------------DLLGFSDIAIRIINTFEQNPCLGMIG-SRRYRRYKRWSFFAKRSE--VYRRVIDL 310 (394) Q Consensus 246 ~~~G~~Wr~~Ll~------------~LLGs~~~v~~Il~~F~~~p~lGlv~-p~~~~~~~~~~~~~~~~~--~~~~~~~L 310 (394) |+ -++-+. .-+..++...+.+..|.+||++++|- |+.++-++|+.-+-+... .++ .++ T Consensus 342 ---GN--LNhAL~~T~GelVaIfDADhVP~r~FL~rTvG~Fl~DpklalVQTPq~FyNpDPfqrNLg~~~~~P~E--~el 414 (858) T PRK11498 342 ---GN--INNALKYAKGEFVAIFDCDHVPTRSFLQMTMGWFLKDKKLAMMQTPHHFFSPDPFERNLGRFRKTPNE--GTL 414 (858) T ss_pred ---CC--HHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHCCCCCCCCCC--HHH T ss_conf ---26--78888645887899963888874789998668772288648986883337998254412245788982--667 Q ss_pred HHHCCCCCCC-CCCCCCCEEEEEEEHHHHHHHH Q ss_conf 9974899885-5477377012332098887576 Q gi|254780201|r 311 AKRAGFPTKR-LHLDFFNGTMFWVKPKCLEPLR 342 (394) Q Consensus 311 ~~rlgi~~~~-~~~~FpaGSMFW~R~~AL~pL~ 342 (394) --+.=-+-.| -+.-|..||.+-.|.+||..+= T Consensus 415 FY~~IQ~g~D~wnAaffCGS~AVlRR~AL~eIG 447 (858) T PRK11498 415 FYGLVQDGNDMWDATFFCGSCAVIRRKPLDEIG 447 (858) T ss_pred HHHHHCCCCCCCCEEEEECCHHHHHHHHHHHHC T ss_conf 889863864345706996307999799999828 No 19 >cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex. Cellulose synthase (CESA) catalyzes the polymerization reaction of cellulose using UDP-glucose as the substrate. Cellulose is an aggregate of unbranched polymers of beta-1,4-linked glucose residues, which is an abundant polysaccharide produced by plants and in varying degrees by several other organisms including algae, bacteria, fungi, and even some animals. Genomes from higher plants harbor multiple CESA genes. There are ten in Arabidopsis. At least three different CESA proteins are required to form a functional complex. In Arabidopsis, CESA1, 3 and 6 and CESA4, 7 and 8, are required for cellulose biosynthesis during primary and secondary cell wall formation. CESA2 is very closely related to CESA6 and is viewed as a prime substitute for CESA6. They functionally compensate each other. The cesa2 and cesa6 double mutant plants we Probab=85.50 E-value=2.9 Score=21.55 Aligned_cols=190 Identities=10% Similarity=0.112 Sum_probs=95.3 Q ss_pred CEEEEEEECCH-HHHHHHHHHHHHCCCCEE---EEEECC-CC-HHHHHHHHHHCCC----CCEEEE-ECCCCHHHHHHHH Q ss_conf 73899994167-789999999970899814---999717-65-2899999971178----738999-4566604799998 Q gi|254780201|r 150 KIAIVVHCYYQ-DTWIEISHILLRLNFDFD---LFVTVV-EA-NKDFEQDVLKYFP----SAQLYV-MENKGRDVRPFLY 218 (394) Q Consensus 150 kIAVvlHlyY~-DLl~E~~~~L~~ip~~~D---L~ITt~-~~-~~~~~~~~~~~~~----~a~V~v-v~NRGRDI~PfL~ 218 (394) +|.|++=+|=. +.+.++++.+.+...+-| ++|--+ ++ ..++..+....+. ++.... -+|+|.=-+.+=. T Consensus 2 ~VSViIPa~NE~~~I~~~l~sl~~q~YP~~~~~I~VvDdstD~t~~~~~~~~~~~~~~~~~~~~~~~~~~~g~Ka~aln~ 81 (232) T cd06437 2 MVTVQLPVFNEKYVVERLIEAACALDYPKDRLEIQVLDDSTDETVRLAREIVEEYAAQGVNIKHVRRADRTGYKAGALAE 81 (232) T ss_pred EEEEEEECCCCHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHH T ss_conf 39999955898899999999999679999808999997999669999999997666419968998369888866999999 Q ss_pred HHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCC Q ss_conf 62210027867899973687767777644407899999987479989999999997609976898168765443323455 Q gi|254780201|r 219 LLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFA 298 (394) Q Consensus 219 ~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~ 298 (394) ++. ..++|+|+.+-.==. + .++.+..++..| ++|++|+|....... +.. . T Consensus 82 gl~---~a~gd~i~~~DaD~~---------------------~-~~d~L~~~~~~f-~~~~~g~v~~~~~~~-n~~---~ 131 (232) T cd06437 82 GMK---VAKGEYVAIFDADFV---------------------P-PPDFLQKTPPYF-ADPKLGFVQTRWGHI-NAN---Y 131 (232) T ss_pred HHH---HCCCCEEEEECCCCC---------------------C-CHHHHHHHHHHH-CCCCEEEEECCCCCC-CCC---C T ss_conf 999---778988999776447---------------------3-847999999983-199859996664325-888---6 Q ss_pred CCCCCHHHHH----HHHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 4322138999----999974899885547737701233209888757658822358544688887389999999999999 Q gi|254780201|r 299 KRSEVYRRVI----DLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVR 374 (394) Q Consensus 299 ~~~~~~~~~~----~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~ 374 (394) ++......+. ....+.+.... .......|++.-+|.++|+.+-..+ ++ ++.. ..-+++-.. T Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~G~~~~~Rr~~l~~~Gg~~--~~----------~~~E--D~~l~~r~~ 196 (232) T cd06437 132 SLLTRVQAMSLDYHFTIEQVARSST-GLFFNFNGTAGVWRKECIEDAGGWN--HD----------TLTE--DLDLSYRAQ 196 (232) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHC-CCCEECCCEEEEEEHHHHHHHCCCC--CC----------CCCH--HHHHHHHHH T ss_conf 7999999999999999999998825-9934126515877599999857998--86----------6101--899999999 Q ss_pred HCCCEEEEEH Q ss_conf 8698033524 Q gi|254780201|r 375 YTEFSIESVD 384 (394) Q Consensus 375 ~~gf~i~~v~ 384 (394) ..||++.-+. T Consensus 197 ~~G~ri~~~~ 206 (232) T cd06437 197 LKGWKFVYLD 206 (232) T ss_pred HCCCEEEEEC T ss_conf 8799899806 No 20 >TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano Probab=84.39 E-value=3.2 Score=21.24 Aligned_cols=195 Identities=16% Similarity=0.200 Sum_probs=104.3 Q ss_pred CCCCEEEEEEECCHH-HHHHHHHHHHHCCCC-EEEEEECCC--C-HHHHHHHHHHCCCCCEEEEEC---CCC--HHHHHH Q ss_conf 014738999941677-899999999708998-149997176--5-289999997117873899945---666--047999 Q gi|254780201|r 147 IKSKIAIVVHCYYQD-TWIEISHILLRLNFD-FDLFVTVVE--A-NKDFEQDVLKYFPSAQLYVME---NKG--RDVRPF 216 (394) Q Consensus 147 ~k~kIAVvlHlyY~D-Ll~E~~~~L~~ip~~-~DL~ITt~~--~-~~~~~~~~~~~~~~a~V~vv~---NRG--RDI~Pf 216 (394) .-+.|.|++=++=.| -+++.++.+-....+ +.+++.+++ + ..++.+++.+.+|.+.++++. +.| .-+..+ T Consensus 39 ~~P~VSil~P~~~e~~~l~~~L~sl~~qdYP~~evi~~vd~~~D~a~~i~~~l~~~~p~~~~~~v~~~~~~g~n~K~~~L 118 (373) T TIGR03472 39 AWPPVSVLKPLHGDEPELYENLASFCRQDYPGFQMLFGVQDPDDPALAVVRRLRADFPDADIDLVIDARRHGPNRKVSNL 118 (373) T ss_pred CCCCEEEEEECCCCCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHH T ss_conf 99980799807998304999999998678998289999789999789999999986899987998588767887899999 Q ss_pred HHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCC Q ss_conf 98622100278678999736877677776444078999999874799899999999976099768981687654433234 Q gi|254780201|r 217 LYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSF 296 (394) Q Consensus 217 L~~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~ 296 (394) ..+.. ..+||+++..-+-=. + +++.+.+++..| ++|++|+|.......... . T Consensus 119 ~~~~~---~a~~dil~~~DaD~~---------------------~-~p~~L~~lVa~l-~~~~vglVt~~~~~~~~~--~ 170 (373) T TIGR03472 119 INMLP---HARHDILVIADSDIS---------------------V-GPDYLRQVVAPL-ADPDVGLVTCLYRGRPVP--G 170 (373) T ss_pred HHHHH---HCCCCEEEEECCCCC---------------------C-CHHHHHHHHHHH-HCCCCCEEECCCCCCCCC--C T ss_conf 99998---627888999879975---------------------2-848999999997-489977583256467888--7 Q ss_pred CCCCCCCHHHHHH------HHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 5543221389999------9997489988554773770123320988875765882235854468888738999999999 Q gi|254780201|r 297 FAKRSEVYRRVID------LAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFA 370 (394) Q Consensus 297 ~~~~~~~~~~~~~------L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~ 370 (394) +......-..... .+..++ ...|..|...-+|.++|+.+=...-..+...| |-. ++ T Consensus 171 ~~~~l~~~~~~~~~~~~~~~~~~~~------~~~~~~G~~~~~RR~~l~~iGG~~~~~~~laE----D~~--------L~ 232 (373) T TIGR03472 171 FWSRLGAMGINHNFLPSVMVARALG------RARFCFGATMALRRATLEAIGGLAALAHHLAD----DYW--------LG 232 (373) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC------CCCCCCCCCCEEEHHHHHHCCCHHHHHCCCHH----HHH--------HH T ss_conf 8999999999998888899997614------65335674011359999974797777215256----899--------99 Q ss_pred HHHHHCCCEEEEEHHHH Q ss_conf 99998698033524756 Q gi|254780201|r 371 CSVRYTEFSIESVDCVA 387 (394) Q Consensus 371 ~iv~~~gf~i~~v~~~~ 387 (394) -.++..|+++.-.+... T Consensus 233 ~~i~~~G~rv~l~~~~v 249 (373) T TIGR03472 233 ELVRALGLRVVLAPVVV 249 (373) T ss_pred HHHHHCCCCEEECCHHH T ss_conf 99996699889656266 No 21 >PTZ00260 glycosyl transferase group 2; Provisional Probab=84.04 E-value=3.4 Score=21.15 Aligned_cols=77 Identities=8% Similarity=0.107 Sum_probs=45.6 Q ss_pred HHHHHHHHHHHHHC-----CCCEEEEEECC---CCHHHHHHHHHHCCCCCEEEEE---CCCCHHHHHHHHHHCCCCCCCC Q ss_conf 77899999999708-----99814999717---6528999999711787389994---5666047999986221002786 Q gi|254780201|r 160 QDTWIEISHILLRL-----NFDFDLFVTVV---EANKDFEQDVLKYFPSAQLYVM---ENKGRDVRPFLYLLELGVFDRY 228 (394) Q Consensus 160 ~DLl~E~~~~L~~i-----p~~~DL~ITt~---~~~~~~~~~~~~~~~~a~V~vv---~NRGRDI~PfL~~~~~~~l~~Y 228 (394) +-+++|+++||+.- ...|.++|--+ +...++..+..+.++...|+++ .|||--= +.-.++ ....- T Consensus 95 ~~mL~e~~~yL~~~~~~~~~~~yEIIVVDDGStD~T~eVa~~~~~~~~~~~IRvl~l~kNrGKG~-AVr~Gm---l~ArG 170 (336) T PTZ00260 95 GRTLEATFKHFESRNREDPKFIYEIIIINDGSKDKTLKVAKKYWKSINNKNFRLLTYNRNRGKGG-AVKLGM---LASAG 170 (336) T ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCHH-HHHHHH---HHHCC T ss_conf 88999999999861045889867999981699988699999999976998099997167778469-999999---98179 Q ss_pred CEEEEECCCCCC Q ss_conf 789997368776 Q gi|254780201|r 229 DYLCKIHGKKSQ 240 (394) Q Consensus 229 D~v~klHtKKS~ 240 (394) ++++..----|. T Consensus 171 ~~ILfaDADgAt 182 (336) T PTZ00260 171 KYQLMMDADGAT 182 (336) T ss_pred CEEEEEECCCCC T ss_conf 889999478898 No 22 >PRK09450 cyaA adenylate cyclase; Provisional Probab=80.93 E-value=2.8 Score=21.59 Aligned_cols=69 Identities=14% Similarity=0.125 Sum_probs=34.0 Q ss_pred CCCCCCCEEEEEEEECC-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCEEEEEEECCHHH Q ss_conf 01585643999973137-74100732687187559999997302664202467822101100000147389999416778 Q gi|254780201|r 84 KLSFPSCRIFFYGSRKE-QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDT 162 (394) Q Consensus 84 ~~~~~~~~ivfinaWNE-EG~yLePD~~~G~~yL~a~~~al~~~~~~~~~~~~~p~~~~~~~~~~k~kIAVvlHlyY~DL 162 (394) ++.-..-.+|..||||| -..+-|- +.+-|+|++..+-.. .++. ..|. .|-+.+|=.-+ T Consensus 589 enLVgSiD~i~rNSWnEv~t~hFeG----~~AlL~aLk~il~~~---~~~~-~~p~-------------~v~V~cys~~~ 647 (833) T PRK09450 589 ENLVGSVDLIYRNSWNEVRTLHFEG----EQALLDALKTILGKM---HQDA-APPP-------------SVEVFCYSQHL 647 (833) T ss_pred HCCEEEEHEEEECCCCHHEEEECCC----HHHHHHHHHHHHHHC---CCCC-CCCC-------------CEEEEECCCCH T ss_conf 1011121100213401200332068----179999999998633---7788-9998-------------53799766025 Q ss_pred HHHHHHHHHHC Q ss_conf 99999999708 Q gi|254780201|r 163 WIEISHILLRL 173 (394) Q Consensus 163 l~E~~~~L~~i 173 (394) -.+|..+...+ T Consensus 648 r~~i~~~V~~L 658 (833) T PRK09450 648 RGLIRTRVQQL 658 (833) T ss_pred HHHHHHHHHHH T ss_conf 79999999999 No 23 >TIGR02918 TIGR02918 conserved hypothetical protein TIGR02918; InterPro: IPR014267 Members of this entry are found only in Gram-positive bacteria of the Firmicutes lineage, including several species of Staphylococcus, Streptococcus, and Lactobacillus.. Probab=80.61 E-value=2.1 Score=22.40 Aligned_cols=290 Identities=16% Similarity=0.145 Sum_probs=157.0 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC------------------C-CCCCEEECCC-HHHHHHHHHHHHHH Q ss_conf 99988787630122111553268874057888878------------------9-8861575689-78999999999999 Q gi|254780201|r 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQ------------------R-ITSKDVHFQE-LSIFESFIFWLRSF 76 (394) Q Consensus 17 ~~~~~~~~~~~~~~~~y~p~~~pg~~~~WDntpR~------------------~-~~~~~~~~~t-P~~f~~~~~wl~~~ 76 (394) -++|+|...--|+.+.|+-+ -||.|+= . ++..||+... --+|.|. .|... T Consensus 137 ~LiRKDfYSYTr~fsEYYAP--------~Dn~A~LY~R~FYNEDGSIAY~e~~dqG~~s~F~F~dG~iLYsK~--eliay 206 (511) T TIGR02918 137 KLIRKDFYSYTRVFSEYYAP--------ADNKAKLYQRTFYNEDGSIAYEEYVDQGDESVFVFKDGKILYSKQ--ELIAY 206 (511) T ss_pred EEEEEECCCCEEEEEEECCC--------CCCCCEEEECEEECCCCCHHHHHCCCCCCCEEEECCCCCEECCHH--HHHHH T ss_conf 22423113413235641177--------679854710112425883434420379884367837974541869--99999 Q ss_pred HHHHHHHCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCEEEEEE Q ss_conf 99975100158564399997313774100732687187559999997302664202467822101100000147389999 Q gi|254780201|r 77 LAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVH 156 (394) Q Consensus 77 l~~~~~~~~~~~~~~ivfinaWNEEG~yLePD~~~G~~yL~a~~~al~~~~~~~~~~~~~p~~~~~~~~~~k~kIAVvlH 156 (394) . .+.... ..++ ||.+ |+--|...| .+.|+.+ +.....|+|||+| T Consensus 207 F--l~~L~L-T~~D-i~Il------------DR~~~dknl-iiGQ~Vl-------------------~Nkg~AklgVVvH 250 (511) T TIGR02918 207 F--LKQLNL-TKKD-IIIL------------DRSTGDKNL-IIGQAVL-------------------ENKGPAKLGVVVH 250 (511) T ss_pred H--HHHCCC-CCCC-EEEE------------ECCCCCCEE-EEEEEEE-------------------EECCCEEEEEEEE T ss_conf 9--996069-8566-8999------------736899305-8713888-------------------7508702899997 Q ss_pred ECCHH---------HHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCC----CCCEE--------EEECC-CCHHHH Q ss_conf 41677---------8999999997089981499971765289999997117----87389--------99456-660479 Q gi|254780201|r 157 CYYQD---------TWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYF----PSAQL--------YVMEN-KGRDVR 214 (394) Q Consensus 157 lyY~D---------Ll~E~~~~L~~ip~~~DL~ITt~~~~~~~~~~~~~~~----~~a~V--------~vv~N-RGRDI~ 214 (394) .=|-+ ||.=+.+|.=+.....|-|||-|+.++++..+-++.. |.+.- ..-|. .||-=- T Consensus 251 AEHf~~~~t~e~~ILWNNyYEYqF~nA~~iDFFItATd~Q~~~l~~QF~kY~~~~P~i~TIPVGSl~~L~~p~W~~Rkp~ 330 (511) T TIGR02918 251 AEHFSESATNETYILWNNYYEYQFSNADYIDFFITATDIQNQILLEQFKKYKNIEPKIYTIPVGSLDELQYPEWQERKPF 330 (511) T ss_pred EEECCCCCCCCCCCCCCCCHHHHCCCCCEEEEEEECHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHCCCCCCCCCCCH T ss_conf 46368886168602214652553156866511340057889999998887479987788850575433578863464521 Q ss_pred HHHHHHCCCCCCC-CCEEEE--ECCCCC-CCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCC- Q ss_conf 9998622100278-678999--736877-67777644407899999987479989999999997609976898168765- Q gi|254780201|r 215 PFLYLLELGVFDR-YDYLCK--IHGKKS-QREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYR- 289 (394) Q Consensus 215 PfL~~~~~~~l~~-YD~v~k--lHtKKS-~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~- 289 (394) +++.+=| +.-.+ =|.+.+ |--||| |+... .+-|++==+.+|.+|+. +..++.=|.. .-|.+|--|+-.... T Consensus 331 SiiTASR-LA~EKHiDWLV~AVv~Ak~~~P~l~F-DIYG~GgE~~~L~~iI~-~n~A~DYI~L-kGH~~L~~vY~~Yely 406 (511) T TIGR02918 331 SIITASR-LAKEKHIDWLVKAVVKAKKSVPELTF-DIYGEGGEKSKLKKIIN-ENQAEDYIRL-KGHKNLSEVYKDYELY 406 (511) T ss_pred HHHHHHH-HCCCCCHHHHHHHHHHHCCCCCCEEE-EECCCCHHHHHHHHHHH-HHCCCCHHHH-CCCCCHHHHHCCCCEE T ss_conf 5677734-13767126888999995133885110-00356378899999876-3120012431-1543356662323223 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCC-CCCCC-CCCCCCHHHHHHHH Q ss_conf 44332345543221389999999748998855477377012332098887576588223-58544-68888738999999 Q gi|254780201|r 290 RYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEE-RNLKDGALEHAVER 367 (394) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~-DF~~E-~g~~DGTlaHAIER 367 (394) +.... ...+|. .|++-.|=-..---++.|.|+==. .+-=.|-=|.. ++.+. .-++--.+||+|=+ T Consensus 407 LsaSt---SEGFGL-----TLmEAvGSGLgmIGFDV~YGN~TF-----I~d~~NGYLIP~~~~~~~~~~I~~~lA~~Iv~ 473 (511) T TIGR02918 407 LSAST---SEGFGL-----TLMEAVGSGLGMIGFDVNYGNPTF-----IKDNKNGYLIPIDEEEDDEDEIITALAEKIVE 473 (511) T ss_pred EEECC---CCCCHH-----HHHHHHHCCCCCCCCCCCCCCCCE-----EECCCCCEECCCCCCCCCHHHHHHHHHHHHHH T ss_conf 45212---144115-----799997504332366187438870-----24088840433457878878999999999898 Q ss_pred HH Q ss_conf 99 Q gi|254780201|r 368 FF 369 (394) Q Consensus 368 l~ 369 (394) +| T Consensus 474 ~F 475 (511) T TIGR02918 474 YF 475 (511) T ss_pred HH T ss_conf 61 No 24 >COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane] Probab=79.39 E-value=4.9 Score=20.12 Aligned_cols=197 Identities=13% Similarity=0.192 Sum_probs=113.8 Q ss_pred CCCEEEEEEECCHHH--HHHHHHHHHHCCCC-EEEEEECCCC---HHHHHHHHHHCC-CCCEEEEE-CCCCHHHHHHHHH Q ss_conf 147389999416778--99999999708998-1499971765---289999997117-87389994-5666047999986 Q gi|254780201|r 148 KSKIAIVVHCYYQDT--WIEISHILLRLNFD-FDLFVTVVEA---NKDFEQDVLKYF-PSAQLYVM-ENKGRDVRPFLYL 219 (394) Q Consensus 148 k~kIAVvlHlyY~DL--l~E~~~~L~~ip~~-~DL~ITt~~~---~~~~~~~~~~~~-~~a~V~vv-~NRGRDI~PfL~~ 219 (394) .++|.|++=.|-+|. +++++..+.+...+ +++++..+.. ..++..+..... ++..+... .+++-..+.+-.+ T Consensus 53 ~p~vsviiP~ynE~~~~~~~~l~s~~~~dyp~~evivv~d~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~gK~~al~~~ 132 (439) T COG1215 53 LPKVSVIIPAYNEEPEVLEETLESLLSQDYPRYEVIVVDDGSTDETYEILEELGAEYGPNFRVIYPEKKNGGKAGALNNG 132 (439) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHH T ss_conf 87269998369998678999999998278987159999689983099999999744378579996244555248999988 Q ss_pred HCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCC Q ss_conf 22100278678999736877677776444078999999874799899999999976099768981687654433234554 Q gi|254780201|r 220 LELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAK 299 (394) Q Consensus 220 ~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~ 299 (394) +.. .++|+|+.+-+=- ...++.+...+..|++++..+.++.+..... .+. .+ T Consensus 133 l~~---~~~d~V~~~DaD~----------------------~~~~d~l~~~~~~f~~~~~~~v~~~~~~~~~-~~~--~~ 184 (439) T COG1215 133 LKR---AKGDVVVILDADT----------------------VPEPDALRELVSPFEDPPVGAVVGTPRIRNR-PDP--SN 184 (439) T ss_pred HHH---CCCCEEEEECCCC----------------------CCCCHHHHHHHHHHCCCCEEEEEECCEECCC-CCC--HH T ss_conf 750---5788899983887----------------------7871199999997325880499955632145-563--01 Q ss_pred CCCCHHHHHH-----HHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 3221389999-----99974899885547737701233209888757658822358544688887389999999999999 Q gi|254780201|r 300 RSEVYRRVID-----LAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVR 374 (394) Q Consensus 300 ~~~~~~~~~~-----L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~ 374 (394) ....-+.... ...+.+.. .....+..|+..-+|.+||+..-.. .++.-.| | +-++..++ T Consensus 185 ~l~~~~~~~~~~~~~~~~~~~~~--~g~~~~~~G~~~~~rr~aL~~~g~~--~~~~i~E----D--------~~lt~~l~ 248 (439) T COG1215 185 LLGRIQAIEYLSAFYFRLRAASK--GGLISFLSGSSSAFRRSALEEVGGW--LEDTITE----D--------ADLTLRLH 248 (439) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH--CCCEEECCCCHHEEEHHHHHHHCCC--CCCEEEC----C--------HHHHHHHH T ss_conf 77889999999999888999985--8974652672130339999982785--7754541----5--------99999999 Q ss_pred HCCCEEEEEHHHHH Q ss_conf 86980335247565 Q gi|254780201|r 375 YTEFSIESVDCVAE 388 (394) Q Consensus 375 ~~gf~i~~v~~~~~ 388 (394) ..|+++.-+..... T Consensus 249 ~~G~~~~~~~~~~~ 262 (439) T COG1215 249 LRGYRVVYVPEAIV 262 (439) T ss_pred HCCCEEEEECCCEE T ss_conf 88988999457457 No 25 >cd03817 GT1_UGDG_like This family is most closely related to the GT1 family of glycosyltransferases. UDP-glucose-diacylglycerol glucosyltransferase (UGDG; also known as 1,2-diacylglycerol 3-glucosyltransferase) catalyzes the transfer of glucose from UDP-glucose to 1,2-diacylglycerol forming 3-D-glucosyl-1,2-diacylglycerol. Probab=75.59 E-value=3.4 Score=21.13 Aligned_cols=40 Identities=20% Similarity=0.156 Sum_probs=22.6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHH Q ss_conf 98149997176528999999711787389994566604799998 Q gi|254780201|r 175 FDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLY 218 (394) Q Consensus 175 ~~~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv~NRGRDI~PfL~ 218 (394) ..+|.+|++++..++ .+.+......+.+++| |-|...|-. T Consensus 147 ~~~d~ii~~S~~~~~---~l~~~~~~~~i~vI~n-gvd~~~f~~ 186 (374) T cd03817 147 NRCDAVIAPSEKIAD---LLREYGVKRPIEVIPT-GIDLDRFEP 186 (374) T ss_pred HHCCEEEECHHHHHH---HHHHCCCCCCEEEECC-CCCHHHCCC T ss_conf 859999978099999---9997089998899869-606664398 No 26 >cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=73.54 E-value=7.2 Score=19.14 Aligned_cols=183 Identities=12% Similarity=0.120 Sum_probs=87.2 Q ss_pred CCHHHHHHHHHHHHHCCC-CEEEEEECCCCHHHHHHHHHHCCCCCEEEE--ECCCCHHHHHHHHHHCCCCCCCCCEEEEE Q ss_conf 167789999999970899-814999717652899999971178738999--45666047999986221002786789997 Q gi|254780201|r 158 YYQDTWIEISHILLRLNF-DFDLFVTVVEANKDFEQDVLKYFPSAQLYV--MENKGRDVRPFLYLLELGVFDRYDYLCKI 234 (394) Q Consensus 158 yY~DLl~E~~~~L~~ip~-~~DL~ITt~~~~~~~~~~~~~~~~~a~V~v--v~NRGRDI~PfL~~~~~~~l~~YD~v~kl 234 (394) =.++.++++++.+.+-.. ++.++|- +....+-..++.+.+....+++ -+|+|.= .++=.++. ..+.||++.+ T Consensus 8 N~~~~l~~~l~Si~~Q~~~~~EiIvV-Dd~S~D~t~~~~~~~~~~~~~~~~~~~~G~~-~a~N~g~~---~a~g~yi~~l 82 (202) T cd06433 8 NQAETLEETIDSVLSQTYPNIEYIVI-DGGSTDGTVDIIKKYEDKITYWISEPDKGIY-DAMNKGIA---LATGDIIGFL 82 (202) T ss_pred CCHHHHHHHHHHHHHCCCCCEEEEEE-ECCCCCCHHHHHHHCCCCEEEEEECCCCCHH-HHHHHHHH---HCCCCEEECC T ss_conf 98899999999998378999799999-7998842344533113412599988888889-99873277---5485553224 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 36877677776444078999999874799899999999976099768981687654433234554322138999999974 Q gi|254780201|r 235 HGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRA 314 (394) Q Consensus 235 HtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~~L~~rl 314 (394) +.== ++ .++.+...+..+.++|..++|++........................+.. T Consensus 83 d~DD---------------------~~-~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 138 (202) T cd06433 83 NSDD---------------------TL-LPGALLAVVAAFAEHPEVDVVYGDVLLVDENGRVIGRRRPPPFLDKFLLY-- 138 (202) T ss_pred CCCC---------------------EE-CCHHHHHHHHHHHHCCCCCEEECCEEEECCCCCEEEEECCCCCCHHHHHC-- T ss_conf 8886---------------------26-84499999999987899409993469986999687542266536788854-- Q ss_pred CCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEH-HHHHHH Q ss_conf 8998855477377012332098887576588223585446888873899999999999998698033524-756577 Q gi|254780201|r 315 GFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVD-CVAEYE 390 (394) Q Consensus 315 gi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf~i~~v~-~~~~~~ 390 (394) ...++..+|| +|.++++.+ +. |+++ +.++-.--+.+=+.++|+++.-+. ++..|| T Consensus 139 -------~~~~~~~~~~-~r~~~~~~i---g~---fd~~-------~~~~eD~d~~~R~~~~g~~i~~ip~~l~~yR 194 (202) T cd06433 139 -------GMPICHQATF-FRRSLFEKY---GG---FDES-------YRIAADYDLLLRLLLAGKIFKYLPEVLAAFR 194 (202) T ss_pred -------CCCCCCCCEE-EEEEEHHHH---CC---CCCC-------CCCCHHHHHHHHHHHCCCCEEECCCCEEEEE T ss_conf -------6886665135-640304550---89---8867-------6802299999999983993998487789994 No 27 >KOG2571 consensus Probab=73.04 E-value=2.8 Score=21.62 Aligned_cols=160 Identities=18% Similarity=0.237 Sum_probs=89.1 Q ss_pred EEEEECCC-CHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCC---------------HH Q ss_conf 89994566-604799998622100278678999736877677776444078999999874799---------------89 Q gi|254780201|r 202 QLYVMENK-GRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF---------------SD 265 (394) Q Consensus 202 ~V~vv~NR-GRDI~PfL~~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs---------------~~ 265 (394) .+..+.|+ |||.-|.+..+.. ... |.|| +.+-.|--+.+--.|+. +. T Consensus 394 ~~~~~~~~~~r~~v~i~m~~~~-Ki~--------~~kr--------w~~~r~~~y~~~~~L~~~v~~il~vD~dT~~~P~ 456 (862) T KOG2571 394 EGSLAENRPGRDSVPILLKFCL-KIR--------HKKR--------WNQHRWVMYTAFKALMPSVDYILVVDADTRLDPD 456 (862) T ss_pred CCCCCCCCCCCCCEEEHHHHHH-HHH--------HHHH--------HHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCH T ss_conf 5501446899870420245678-899--------8766--------7789999999999846762389996389852847 Q ss_pred HHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHH-HHHHCCCCCCCCCCCC---CCEEEEEEEHHHHHHH Q ss_conf 99999999760997689816876544332345543221389999-9997489988554773---7701233209888757 Q gi|254780201|r 266 IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVID-LAKRAGFPTKRLHLDF---FNGTMFWVKPKCLEPL 341 (394) Q Consensus 266 ~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~~-L~~rlgi~~~~~~~~F---paGSMFW~R~~AL~pL 341 (394) .+-..++.|+.||++|=++-+....++.|..+...++= .+.. |.+-.. ..+.+ ..|-+--+|-+||..= T Consensus 457 ai~~lv~~f~~dp~VggaCG~I~~~~~~w~v~~Q~FEY--~Ish~l~Ka~E-----SvFG~VsclPGcfs~yR~~aL~~~ 529 (862) T KOG2571 457 ALYHLVKVFDEDPQVGGACGRILNKGGSWVVAYQNFEY--AISHNLQKATE-----SVFGCVSCLPGCFSLYRASALMDQ 529 (862) T ss_pred HHHHHHHHHCCCCCCCEECCCCCCCCCCEEEEHHHHHH--HHHHHHHHHHH-----HHCEEEEECCCHHHHHHHHHHHCC T ss_conf 89999998520863020156113677844875788899--99988777656-----421047866751689999998514 Q ss_pred HCCCCC-CCCCCC--CCCCCCHHHHHHHHHHHHHHHHCCCEEE---EEHHHHHH Q ss_conf 658822-358544--6888873899999999999998698033---52475657 Q gi|254780201|r 342 RNLHLI-GEFEEE--RNLKDGALEHAVERFFACSVRYTEFSIE---SVDCVAEY 389 (394) Q Consensus 342 ~~l~l~-~DF~~E--~g~~DGTlaHAIERl~~~iv~~~gf~i~---~v~~~~~~ 389 (394) .--.++ +-+..+ ..|.| -|=+|.+..-+-..||++. ..||.-++ T Consensus 530 ~~~~~y~~~~~~~~~~~~~~----~geDR~L~~~llskgy~l~Y~a~s~a~t~~ 579 (862) T KOG2571 530 FVEYFYGEKFSGPRHGIQYS----LGEDRWLCTLLLSKGYRLKYVAASDAETEA 579 (862) T ss_pred HHHHHHCHHHCCCCCCCCCC----CCHHHHHHHHHHHCCCEEEEECCCCCCCCC T ss_conf 48765122312766444323----103579999998526145541254433018 No 28 >pfam01295 Adenylate_cycl Adenylate cyclase, class-I. Probab=71.90 E-value=6.1 Score=19.56 Aligned_cols=113 Identities=15% Similarity=0.154 Sum_probs=54.6 Q ss_pred HHHHCCCCCCCEEEEEEEECC-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCEEEEEEEC Q ss_conf 751001585643999973137-7410073268718755999999730266420246782210110000014738999941 Q gi|254780201|r 80 SKYSKLSFPSCRIFFYGSRKE-QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCY 158 (394) Q Consensus 80 ~~~~~~~~~~~~ivfinaWNE-EG~yLePD~~~G~~yL~a~~~al~~~~~~~~~~~~~p~~~~~~~~~~k~kIAVvlHly 158 (394) ....++.-..-.+|..||||| -.-+-|- ..+-|+|++..+-+. .+++ ..|. .|=+.+| T Consensus 590 g~~~enLVgSiDlvyrNSWnEv~t~hfeG----~~AiL~aLk~il~km---~~~a-~~p~-------------sv~VfCy 648 (830) T pfam01295 590 GREQECLVGSIDLIYRNSWNEIRTLHFEG----ENAILDALKTISNKI---HRGA-APPE-------------SIEVFCY 648 (830) T ss_pred CCCCCCCEEEEEEEEECCCCCEEEEEECC----HHHHHHHHHHHHCCC---CCCC-CCCC-------------CEEEEEC T ss_conf 76545544455033321423045642057----179999999985646---8888-9998-------------5479977 Q ss_pred CHHHHHHHHHHHHHC-CCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCCHHHHH Q ss_conf 677899999999708-998149997176528999999711787389994566604799 Q gi|254780201|r 159 YQDTWIEISHILLRL-NFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRP 215 (394) Q Consensus 159 Y~DLl~E~~~~L~~i-p~~~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv~NRGRDI~P 215 (394) =.-+-..|..+...+ ....+|..++.+.... -...+.....-....++||-.+-. T Consensus 649 S~~~r~~i~~~V~~L~~eci~l~~~~~~~~~~--~~~l~v~g~~~glfFe~~gvs~q~ 704 (830) T pfam01295 649 SQHLRSQLRNTVQQLVNECISLRLGTRSSPQR--VKTLRVAGKNWGLFFERRGVSLQK 704 (830) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCE--EEEEEECCEEEEEEECCCCCEEEE T ss_conf 47668999999999999999985155666420--126898570799998888751055 No 29 >cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen. Probab=71.26 E-value=8.1 Score=18.81 Aligned_cols=185 Identities=15% Similarity=0.060 Sum_probs=84.2 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEECCC-CHHHHHHHHHHCCCCCEEEEE-CCCCHHHHHHHHHHCCCCCCCCCEEEEECCC Q ss_conf 77899999999708998149997176-528999999711787389994-5666047999986221002786789997368 Q gi|254780201|r 160 QDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKYFPSAQLYVM-ENKGRDVRPFLYLLELGVFDRYDYLCKIHGK 237 (394) Q Consensus 160 ~DLl~E~~~~L~~ip~~~DL~ITt~~-~~~~~~~~~~~~~~~a~V~vv-~NRGRDI~PfL~~~~~~~l~~YD~v~klHtK 237 (394) .+++.+++..|.+-. ..++| ++. ........+....+++++... +|.|=--+ +=.+++...-.++|+++.+-+= T Consensus 10 ~~~l~~~L~sl~~q~--~eIiV-VDN~S~d~~~~~~~~~~~~v~~i~~~~N~G~a~g-~N~Gi~~a~~~~~d~i~~ln~D 85 (237) T cd02526 10 LSKLKELLAALAEQV--DKVVV-VDNSSGNDIELRLRLNSEKIELIHLGENLGIAKA-LNIGIKAALENGADYVLLFDQD 85 (237) T ss_pred HHHHHHHHHHHHCCC--CEEEE-EECCCCHHHHHHHHHCCCCEEEEECCCCCCHHHH-CCCCCHHHHCCCCCEEEEECCC T ss_conf 999999999766379--98999-9698890289999861998699989997785887-2743033411797789993575 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCC Q ss_conf 77677776444078999999874799899999999976099768981687654433234554322138999999974899 Q gi|254780201|r 238 KSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP 317 (394) Q Consensus 238 KS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~~L~~rlgi~ 317 (394) =-+ -..|-..|+ ..+....+++.+|+++|............ ..... .........-. T Consensus 86 ~~~--------~~~~l~~l~-----------~~~~~~~~~~~vg~~~p~~~d~~~~~~~~-~~~~~---~~~~~~~~~~~ 142 (237) T cd02526 86 SVP--------PPDMVEKLL-----------AYKILSDKNSNIGAVGPRIIDRRTGENSP-GVRKS---GYKLRIQKEGE 142 (237) T ss_pred CCC--------CHHHHHHHH-----------HHHHHCCCCCCEEEEEEEEECCCCCCCCC-CCCCC---CCCCCCCCCCC T ss_conf 655--------905999999-----------99985222798899971798689980045-42112---53322354424 Q ss_pred CCCCCCCCCCEEEEEEEHHHHHHHHCCCCC-CCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEH Q ss_conf 885547737701233209888757658822-3585446888873899999999999998698033524 Q gi|254780201|r 318 TKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVD 384 (394) Q Consensus 318 ~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~-~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf~i~~v~ 384 (394) ..+...+|..||...+|.++++.+ +.+ ++|-- . .=|-=+++=++++||++.-+- T Consensus 143 ~~~~~~~~~~~sg~lir~~~~~~v---G~fde~fF~-----y-----~ED~Dl~~R~~~~G~~i~~~p 197 (237) T cd02526 143 EGLKEVDFLITSGSLISLEALEKV---GGFDEDLFI-----D-----YVDTEWCLRARSKGYKIYVVP 197 (237) T ss_pred CCCEECCEEEECCEEEEHHHHHHH---CCCCHHHCC-----C-----CCHHHHHHHHHHCCCCEEEEC T ss_conf 784460430301437789999985---799689779-----3-----719999999998599799989 No 30 >TIGR03584 PseF pseudaminic acid CMP-transferase. The sequences in this family include the pfam02348 (cytidyltransferase) domain and are homologous to the NeuA protein responsible for the transfer of CMP to neuraminic acid. According to, this gene is responsible for the transfer of CMP to the structurally related sugar, pseudaminic acid which is observed as a component of sugar modifications of flagellin in Campylobacter species. This gene is commonly observed in apparent operons with other genes responsible for the biosynthesis of pseudaminic acid and as a component of flagellar and exopolysaccharide biosynthesis loci. Probab=70.14 E-value=8.6 Score=18.66 Aligned_cols=152 Identities=11% Similarity=0.026 Sum_probs=71.6 Q ss_pred HHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCC-CCEEEEECCCCHHHHHHHHHHCC----CC-CCCCCEEEEEC Q ss_conf 89999999970899814999717652899999971178-73899945666047999986221----00-27867899973 Q gi|254780201|r 162 TWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFP-SAQLYVMENKGRDVRPFLYLLEL----GV-FDRYDYLCKIH 235 (394) Q Consensus 162 Ll~E~~~~L~~ip~~~DL~ITt~~~~~~~~~~~~~~~~-~a~V~vv~NRGRDI~PfL~~~~~----~~-l~~YD~v~klH 235 (394) |++-.++.+.+...--+++|||+.++.. .+.+.+. .+....-++---|-++...++.. .. ..+||++|-+| T Consensus 26 Li~~~i~~~~~s~~~d~i~vSTD~~~i~---~ia~~~~~~~~~~Rp~~ls~d~~~~~~vi~~~l~~~~~~~~~d~i~~l~ 102 (222) T TIGR03584 26 MIAYSIEAALNSGLFDEVVVSTDDEEIA---EVAKSYGASVPFMRPAELSDDFTGTAPVVAHAIERLQLQKQPDHACCIY 102 (222) T ss_pred HHHHHHHHHHHCCCCCEEEECCCHHHHH---HHHHCCCCCCEEECCHHHCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEC T ss_conf 5999999998579965388727889987---5552169864251865555886214599999999998658999899954 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCC---CHHHHHHHHH Q ss_conf 6877677776444078999999874799899999999976099768981687654433234554322---1389999999 Q gi|254780201|r 236 GKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE---VYRRVIDLAK 312 (394) Q Consensus 236 tKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~---~~~~~~~L~~ 312 (394) . -|| +-+.+.+...++.|.++..=.++.-..+.....+....+..+ ....-....+ T Consensus 103 p-TsP--------------------~r~~~~I~~ai~~~~~~~~ds~~sv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 161 (222) T TIGR03584 103 A-TAP--------------------LLQAKILKEAFELLKQPNKHFVFTVTEFAFPIQRAFSLKENGGVTMFFPEHYNTR 161 (222) T ss_pred C-CCC--------------------CCCHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHEEECCCCCEEECCHHHHHHH T ss_conf 8-976--------------------1589999999999972699855065337768589379867994633474552024 Q ss_pred HCCCCCCCCCCCCCCEEEEEEEHHHHHHH Q ss_conf 74899885547737701233209888757 Q gi|254780201|r 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPL 341 (394) Q Consensus 313 rlgi~~~~~~~~FpaGSMFW~R~~AL~pL 341 (394) +-.+ ...-+..|++||++.+.+..- T Consensus 162 ~Qdl----~~~y~~~G~~y~~~~~~~~~~ 186 (222) T TIGR03584 162 SQDL----EEAYHDAGQFYWGKSQAWLES 186 (222) T ss_pred HHHC----CCCEEECCEEEEEEHHHHHHC T ss_conf 4424----433754478999889999844 No 31 >cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=69.48 E-value=8.8 Score=18.58 Aligned_cols=192 Identities=16% Similarity=0.156 Sum_probs=98.5 Q ss_pred CCEEEEEEECC--HHHHHHHHHHHHHCC-CCEEEEEECC--CC--HHHHHHHHHHCCCCCEEEE-ECCCCHHHHHHHHHH Q ss_conf 47389999416--778999999997089-9814999717--65--2899999971178738999-456660479999862 Q gi|254780201|r 149 SKIAIVVHCYY--QDTWIEISHILLRLN-FDFDLFVTVV--EA--NKDFEQDVLKYFPSAQLYV-MENKGRDVRPFLYLL 220 (394) Q Consensus 149 ~kIAVvlHlyY--~DLl~E~~~~L~~ip-~~~DL~ITt~--~~--~~~~~~~~~~~~~~a~V~v-v~NRGRDI~PfL~~~ 220 (394) ++|.|++=.|= .+.+.++++.+.+-. ..+.++|--+ ++ ..++.+...+..+.+.+.. -+|.|.- ..+=.++ T Consensus 1 P~vSViip~yN~~~~~l~~~l~Si~~Qt~~~~EiIvvDd~StD~~~~~~~~~~~~~~~~i~~~~~~~n~G~~-~a~N~gi 79 (202) T cd04184 1 PLISIVMPVYNTPEKYLREAIESVRAQTYPNWELCIADDASTDPEVKRVLKKYAAQDPRIKVVFREENGGIS-AATNSAL 79 (202) T ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHH-HHHHHHH T ss_conf 939999974889669999999999837899879999989989478999999988518852001267887899-9988645 Q ss_pred CCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCC Q ss_conf 21002786789997368776777764440789999998747998999999999760997689816876544332345543 Q gi|254780201|r 221 ELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR 300 (394) Q Consensus 221 ~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~ 300 (394) + ..+-|||+.+++-=- + .++.+..++..+.++|.+++++.....+... +.. T Consensus 80 ~---~a~geyi~flD~DD~---------------------~-~p~~l~~~~~~~~~~~~~~~v~~~~~~~~~~----~~~ 130 (202) T cd04184 80 E---LATGEFVALLDHDDE---------------------L-APHALYEVVKALNEHPDADLIYSDEDKIDEG----GKR 130 (202) T ss_pred H---CCCCCEEEECCCCCE---------------------E-CHHHHHHHHHHHHHCCCCCEEECCHHEECCC----CCE T ss_conf 1---368766773277855---------------------2-8439999999998689935997371168679----978 Q ss_pred CCCHHHHHHHHHHCCCCCC-CCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCE Q ss_conf 2213899999997489988-554773770123320988875765882235854468888738999999999999986980 Q gi|254780201|r 301 SEVYRRVIDLAKRAGFPTK-RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFS 379 (394) Q Consensus 301 ~~~~~~~~~L~~rlgi~~~-~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf~ 379 (394) ... .. +.+.... .....+ .|.+.-+|.++++.+-.. .++|. .+=.--|.+=...+|++ T Consensus 131 ~~~------~~-~~~~~~~~~~~~~~-~~~~~~~rr~~~~~iGgf--de~~~-----------~~eD~dl~lRl~~~g~k 189 (202) T cd04184 131 SEP------FF-KPDWSPDLLLSQNY-IGHLLVYRRSLVRQVGGF--REGFE-----------GAQDYDLVLRVSEHTDR 189 (202) T ss_pred EEC------CC-CCCCCHHHHHHCCC-CCHHHHHHHHHHHHHCCC--CCCCC-----------CCHHHHHHHHHHHHCCE T ss_conf 721------24-78779889853476-320366689999995898--98888-----------31699999999981997 Q ss_pred EEEEHHHHHHHH Q ss_conf 335247565776 Q gi|254780201|r 380 IESVDCVAEYER 391 (394) Q Consensus 380 i~~v~~~~~~~~ 391 (394) |..+.=++-+-| T Consensus 190 i~~ip~~Ly~yR 201 (202) T cd04184 190 IAHIPRVLYHWR 201 (202) T ss_pred EEEECHHHEEEC T ss_conf 999280884968 No 32 >cd02513 CMP-NeuAc_Synthase CMP-NeuAc_Synthase activates N-acetylneuraminic acid by adding CMP moiety. CMP-N-acetylneuraminic acid synthetase (CMP-NeuAc synthetase) or acylneuraminate cytidylyltransferase catalyzes the transfer the CMP moiety of CTP to the anomeric hydroxyl group of NeuAc in the presence of Mg++. It is the second to last step in the sialylation of the oligosaccharide component of glycoconjugates by providing the activated sugar-nucleotide cytidine 5'-monophosphate N-acetylneuraminic acid (CMP-Neu5Ac), the substrate for sialyltransferases. Eukaryotic CMP-NeuAc synthetases are predominantly located in the nucleus. The activated CMP-Neu5Ac diffuses from the nucleus into the cytoplasm. Probab=63.88 E-value=11 Score=17.90 Aligned_cols=154 Identities=12% Similarity=0.086 Sum_probs=69.5 Q ss_pred HHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEE-EECCCCHHHHH----HHHHHCC--CCCCCCCEEEEE Q ss_conf 8999999997089981499971765289999997117873899-94566604799----9986221--002786789997 Q gi|254780201|r 162 TWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLY-VMENKGRDVRP----FLYLLEL--GVFDRYDYLCKI 234 (394) Q Consensus 162 Ll~E~~~~L~~ip~~~DL~ITt~~~~~~~~~~~~~~~~~a~V~-vv~NRGRDI~P----fL~~~~~--~~l~~YD~v~kl 234 (394) |++-.++.+.....--+++|||++++. ..+.+......+. .-++-.-|-++ +...+.. .....||++|-+ T Consensus 28 Li~~~i~~a~ks~~~d~IivSTDs~~i---~~~~~k~~~~~~~~Rpk~ls~d~~~~~dvi~~~l~~~~~~~~~~d~iv~l 104 (223) T cd02513 28 LIAWTIEAALESKLFDRVVVSTDDEEI---AEVARKYGAEVPFLRPAELATDTASSIDVILHALDQLEELGRDFDIVVLL 104 (223) T ss_pred HHHHHHHHHHHCCCCCEEEEECCHHHH---HHHHCCCCCEEEECCCHHHCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 699999999966995507895587897---54212456417973846876886780677778998899868998999996 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHH-HHHHHH Q ss_conf 3687767777644407899999987479989999999997609976898168765443323455432213899-999997 Q gi|254780201|r 235 HGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRV-IDLAKR 313 (394) Q Consensus 235 HtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~-~~L~~r 313 (394) +-- || +-+.+.+.+.+..|..+..=.+++...... .+|............. ...... T Consensus 105 ~pT-sP--------------------~r~~~~I~~~i~~~~~~k~dsl~sv~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 162 (223) T cd02513 105 QPT-SP--------------------LRSAEDIDEAIELLLSEGADSVFSVTEFHR-FPWRALGLDDNGLEPVNYPEDKR 162 (223) T ss_pred CCC-CC--------------------CCCHHHHHHHHHHHHHCCCCEEEEEEECCC-CHHHHEEECCCCCCCCCCCHHHC T ss_conf 279-76--------------------278999999999998579999999985476-81885676268702136861106 Q ss_pred CCCCCCCCCCCCCCEEEEEEEHHHHHHH Q ss_conf 4899885547737701233209888757 Q gi|254780201|r 314 AGFPTKRLHLDFFNGTMFWVKPKCLEPL 341 (394) Q Consensus 314 lgi~~~~~~~~FpaGSMFW~R~~AL~pL 341 (394) .. ...-...-++.|+++|++++.+..- T Consensus 163 ~~-rQ~~~~~y~~~g~~yi~~~~~~~k~ 189 (223) T cd02513 163 TR-RQDLPPAYHENGAIYIAKREALLES 189 (223) T ss_pred CC-CCCCCEEEEECCEEEEEEHHHHHHC T ss_conf 76-4467604786048999999999854 No 33 >cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=63.80 E-value=11 Score=17.89 Aligned_cols=180 Identities=11% Similarity=0.174 Sum_probs=80.2 Q ss_pred HHHHHHHHHHHHHCCCC---EEEEEECC--CC-HHHHHHHHHHCCCCCEEEEECCCCHHH----HHHHHHHCCCCCCCCC Q ss_conf 77899999999708998---14999717--65-289999997117873899945666047----9999862210027867 Q gi|254780201|r 160 QDTWIEISHILLRLNFD---FDLFVTVV--EA-NKDFEQDVLKYFPSAQLYVMENKGRDV----RPFLYLLELGVFDRYD 229 (394) Q Consensus 160 ~DLl~E~~~~L~~ip~~---~DL~ITt~--~~-~~~~~~~~~~~~~~a~V~vv~NRGRDI----~PfL~~~~~~~l~~YD 229 (394) .+.+.++++.|.+...+ ++++|--+ ++ ..++.+.... .....+.++.+..+.. ..+=.+++ ..+.| T Consensus 9 ~~~i~~~l~sl~~q~yp~~~~EVivvdd~StD~T~~i~~~~~~-~~~~~~~~~~~~~~~~~gk~~aln~g~~---~a~ge 84 (229) T cd04192 9 AENLPRLLQSLSALDYPKEKFEVILVDDHSTDGTVQILEFAAA-KPNFQLKILNNSRVSISGKKNALTTAIK---AAKGD 84 (229) T ss_pred HHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCHHHHHHHHHH-CCCCCEEECCCCCCCCCHHHHHHHHHHH---HCCCC T ss_conf 8999999999995789988689999989797167999999970-5697455302467777259999999998---64677 Q ss_pred EEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 89997368776777764440789999998747998999999999760997689816876544332345543221389999 Q gi|254780201|r 230 YLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVID 309 (394) Q Consensus 230 ~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~~ 309 (394) +|+.+.+==- + +++.+..++..|.++ ..++|+....... . .++......+ + T Consensus 85 ~i~~lDaD~~---------------------~-~~~~l~~~~~~~~~~-~~~~v~g~~~~~~-~----~~~~~~~~~~-~ 135 (229) T cd04192 85 WIVTTDADCV---------------------V-PSNWLLTFVAFIQKE-QIGLVAGPVIYFK-G----KSLLAKFQRL-D 135 (229) T ss_pred EEEEECCCCC---------------------C-CHHHHHHHHHHHHCC-CCEEEECCEEEEC-C----CCHHHHHHHH-H T ss_conf 6998567656---------------------5-876999999997489-9479988741007-9----9999999999-9 Q ss_pred HHHHCCCCC--CCCCCC-CCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 999748998--855477-37701233209888757658822358544688887389999999999999869803352 Q gi|254780201|r 310 LAKRAGFPT--KRLHLD-FFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383 (394) Q Consensus 310 L~~rlgi~~--~~~~~~-FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf~i~~v 383 (394) .....+... ...... +..|+.+-+|.++|..+-. |++.....-|- ..++.......|++|.-+ T Consensus 136 ~~~~~~~~~~~~~~~~~~~~~G~~~~~rr~~~~~vGG------fd~~~~~~~~D-----d~~~~~~l~~~g~ki~y~ 201 (229) T cd04192 136 WLSLLGLIAGSFGLGKPFMCNGANMAYRKEAFFEVGG------FEGNDHIASGD-----DELLLAKVASKYPKVAYL 201 (229) T ss_pred HHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHHCCC------CCCCCCCCHHH-----HHHHHHHHHHCCCEEEEE T ss_conf 9999999999850699558506215323999998099------88999863699-----999999998089979998 No 34 >cd04190 Chitin_synth_C C-terminal domain of Chitin Synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin. Chitin synthase, also called UDP-N-acetyl-D-glucosamine:chitin 4-beta-N-acetylglucosaminyltransferase, catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of GlcNAc residues formed by covalent beta-1,4 linkages. Chitin is an important component of the cell wall of fungi and bacteria and it is synthesized on the cytoplasmic surface of the cell membrane by membrane bound chitin synthases. Studies with fungi have revealed that most of them contain more than one chitin synthase gene. At least five subclasses of chitin synthases have been identified. Probab=60.26 E-value=13 Score=17.50 Aligned_cols=113 Identities=11% Similarity=0.073 Sum_probs=59.4 Q ss_pred CHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHH----HHHH-----HHCCCCCCCCCCCCCCEEEEEE Q ss_conf 989999999997609976898168765443323455432213899----9999-----9748998855477377012332 Q gi|254780201|r 263 FSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRV----IDLA-----KRAGFPTKRLHLDFFNGTMFWV 333 (394) Q Consensus 263 s~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~----~~L~-----~rlgi~~~~~~~~FpaGSMFW~ 333 (394) .++.+.+++..|+.||++|-|+......... .++....+.. .... ..+|. ..--.|.+--+ T Consensus 87 ~~~al~~lv~~f~~dp~vgaV~G~i~v~n~~----~~~l~~~q~~EY~~~~~~~r~~~s~~G~------v~~~pGa~s~y 156 (244) T cd04190 87 DPDSIVQLYKAMDKDPEIGGVCGEIHPMGKK----QGPLVMYQVFEYAISHWLDKAFESVFGF------VTCLPGCFSMY 156 (244) T ss_pred CHHHHHHHHHHHHHCCCEEEEEEEEEECCCC----CCHHHHHHHHHHHHHHHHHHHHHHHCCC------EEECCCCHHHH T ss_conf 8579999999996399837887668965788----8889999999999999988999986799------89717812552 Q ss_pred EHHHHHHHHCCCCC-CCCCCCCCCCCCHH-----HHHHHHHHHHHHHHCCCEEEEEHH Q ss_conf 09888757658822-35854468888738-----999999999999986980335247 Q gi|254780201|r 334 KPKCLEPLRNLHLI-GEFEEERNLKDGAL-----EHAVERFFACSVRYTEFSIESVDC 385 (394) Q Consensus 334 R~~AL~pL~~l~l~-~DF~~E~g~~DGTl-----aHAIERl~~~iv~~~gf~i~~v~~ 385 (394) |.+||+........ .++..-....+-.+ .=|=.|.+..-+...|+++..+.| T Consensus 157 R~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~laEDr~Lt~~l~~~g~~~~~~y~ 214 (244) T cd04190 157 RIEALKGDNGGKGPLLDYAYLTNTVDSLHKKNNLDLGEDRILCTLLLKAGPKRKYLYV 214 (244) T ss_pred HHHHHHHCCCCCCCCHHCCCCCCCHHHHHHHCCCCHHCHHHHHHHHHHHCCCCEEEEC T ss_conf 0998873478744411002233310133321111021109999999982887348874 No 35 >KOG1170 consensus Probab=60.18 E-value=8.4 Score=18.70 Aligned_cols=122 Identities=18% Similarity=0.166 Sum_probs=60.5 Q ss_pred EEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 89997368776777764440789999998747998999999999760997689816876544332345543221389999 Q gi|254780201|r 230 YLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVID 309 (394) Q Consensus 230 ~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~~ 309 (394) +-+-+|+|+--|-.. --..=+.++|-++||+++...+--..+++--.|-.=+-+.+. | . T Consensus 627 ISLDFhnKReEhPeK---crSR~kn~MWYGvLGtKeLLhrTyrnLEQRV~LECDG~~i~l---P---------------~ 685 (1099) T KOG1170 627 ISLDFHNKREEHPEK---CRSRSKNFMWYGVLGTKELLHRTYRNLEQRVKLECDGVPIDL---P---------------S 685 (1099) T ss_pred EEEECCCCCCCCHHH---HHHHHHHCCHHHHCCHHHHHHHHHHHHHHHEEEECCCCCCCC---C---------------C T ss_conf 753036643117577---767765112221026699999999708977054038832478---3---------------3 Q ss_pred HHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 999748998855477377012332098887576588223585446888873899999999999998698033 Q gi|254780201|r 310 LAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381 (394) Q Consensus 310 L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf~i~ 381 (394) |. ||-+. .-..|..||=||.+...= +.|...-++|=.-|--.+=|++--|.-|+ ++.+..||+ T Consensus 686 LQ---GIviL-NIpSyaGGtNFWGsnk~d-d~f~apSfDDriLEVVAvFGsvqMA~SRv----I~LqhHRIA 748 (1099) T KOG1170 686 LQ---GIVIL-NIPSYAGGTNFWGSNKDD-DEFTAPSFDDRILEVVAVFGSVQMATSRV----IRLQHHRIA 748 (1099) T ss_pred CC---EEEEE-ECCCCCCCCCCCCCCCCC-CCCCCCCCCCCEEEEEEEEHHHHHHHHHH----HHHHHHHHH T ss_conf 03---16998-145456765545778888-73048776665168865400578999888----874101232 No 36 >cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot Probab=58.79 E-value=14 Score=17.34 Aligned_cols=188 Identities=14% Similarity=0.179 Sum_probs=95.5 Q ss_pred CEEEEEEECCH-HHHHHHHHHHHHCCCC---EEEEEECCCCHHHH--HHHHHHCCCCCEEEEECCCC-H-HHHHHHHHHC Q ss_conf 73899994167-7899999999708998---14999717652899--99997117873899945666-0-4799998622 Q gi|254780201|r 150 KIAIVVHCYYQ-DTWIEISHILLRLNFD---FDLFVTVVEANKDF--EQDVLKYFPSAQLYVMENKG-R-DVRPFLYLLE 221 (394) Q Consensus 150 kIAVvlHlyY~-DLl~E~~~~L~~ip~~---~DL~ITt~~~~~~~--~~~~~~~~~~a~V~vv~NRG-R-DI~PfL~~~~ 221 (394) +|.|++=.|=. +.+..+++.+.++..| ++++|-.+.+.... ..+.........+..+++.| | =-+.+=.++. T Consensus 2 ~vsViIPa~NE~~vI~~ti~sl~~~~YP~~~~evivv~d~s~~~t~~~~~~~~~~~~~~~~~~~~~~~~gK~~alN~al~ 81 (241) T cd06427 2 VYTILVPLYKEAEVLPQLIASLSALDYPRSKLDVKLLLEEDDEETIAAARALRLPSIFRVVVVPPSQPRTKPKACNYALA 81 (241) T ss_pred EEEEEEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHH T ss_conf 49999966898899999999999679997617999998899968999999857876708999638988746999999999 Q ss_pred CCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCEEEEECCCCCCCCCCCCCCCC Q ss_conf 1002786789997368776777764440789999998747998999999999760-997689816876544332345543 Q gi|254780201|r 222 LGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ-NPCLGMIGSRRYRRYKRWSFFAKR 300 (394) Q Consensus 222 ~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~-~p~lGlv~p~~~~~~~~~~~~~~~ 300 (394) ....|+|+.+-.=- +-+++.+..++..|.. +|++|.|...... .++. .++ T Consensus 82 ---~a~gd~v~~~DAD~----------------------~~~p~~L~~~v~~f~~~~~~v~~v~~~~~~-~n~~---~~~ 132 (241) T cd06427 82 ---FARGEYVVIYDAED----------------------APDPDQLKKAVAAFARLDDKLACVQAPLNY-YNAR---ENW 132 (241) T ss_pred ---HCCCCEEEEECCCC----------------------CCCHHHHHHHHHHHHHCCCCEEEEECEEEE-CCCC---CCH T ss_conf ---61588599978655----------------------649779999999998579855777331674-5899---988 Q ss_pred CCCHHHH---------HHHHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 2213899---------9999974899885547737701233209888757658822358544688887389999999999 Q gi|254780201|r 301 SEVYRRV---------IDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFAC 371 (394) Q Consensus 301 ~~~~~~~---------~~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~ 371 (394) ......+ ..-..++|.. ....||.+-+|.++|+.+-..+ + +|+.-= --+++ T Consensus 133 ~~~~~~~e~~~~~~~~~~~~~~~~~~------~~~~G~~~~~Rr~~l~~vGg~~------~------~~~tED--~dl~~ 192 (241) T cd06427 133 LTRMFALEYAAWFDYLLPGLARLGLP------IPLGGTSNHFRTDVLRELGGWD------P------FNVTED--ADLGL 192 (241) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCE------EECCCCHHHHHHHHHHHHCCCC------C------CCCCCH--HHHHH T ss_conf 99988999999999999999966993------4338726877999999809998------8------764647--99999 Q ss_pred HHHHCCCEEEEEHHH Q ss_conf 999869803352475 Q gi|254780201|r 372 SVRYTEFSIESVDCV 386 (394) Q Consensus 372 iv~~~gf~i~~v~~~ 386 (394) -....||+|.-++++ T Consensus 193 rl~~~G~ri~~~~~~ 207 (241) T cd06427 193 RLARAGYRTGVLNST 207 (241) T ss_pred HHHHCCCEEEECCCC T ss_conf 999869949853754 No 37 >cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose. A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier. Probab=55.21 E-value=16 Score=16.98 Aligned_cols=150 Identities=9% Similarity=-0.003 Sum_probs=69.6 Q ss_pred HHHHHHHHHHHHHCCCC---EEEEEECCCCHHHHHHHHHHCCCC-CEEEEEC-CCCHHHHHHHHHHCC--CCCCCCCEEE Q ss_conf 77899999999708998---149997176528999999711787-3899945-666047999986221--0027867899 Q gi|254780201|r 160 QDTWIEISHILLRLNFD---FDLFVTVVEANKDFEQDVLKYFPS-AQLYVME-NKGRDVRPFLYLLEL--GVFDRYDYLC 232 (394) Q Consensus 160 ~DLl~E~~~~L~~ip~~---~DL~ITt~~~~~~~~~~~~~~~~~-a~V~vv~-NRGRDI~PfL~~~~~--~~l~~YD~v~ 232 (394) .+.+.++++.+.++..+ ++++|=.+..... ..++.+.... +..+..+ ++|. -+.+=.+++. .....+|+|+ T Consensus 9 e~vI~~ti~~l~~~~YP~~~~eIivvdD~stD~-T~~~a~~~~~~v~~~~~~~~~gK-~~aln~~~~~~~~~~~~~d~v~ 86 (183) T cd06438 9 EAVIGNTVRSLKAQDYPRELYRIFVVADNCTDD-TAQVARAAGATVLERHDPERRGK-GYALDFGFRHLLNLADDPDAVV 86 (183) T ss_pred HHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCH-HHHHHHHCCCEEEEEECCCCCCC-HHHHHHHHHHHHCCCCCCCEEE T ss_conf 999999999998368997856999984898620-89999983996999737777762-8899988999842478866899 Q ss_pred EECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHH---- Q ss_conf 9736877677776444078999999874799899999999976099768981687654433234554322138999---- Q gi|254780201|r 233 KIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVI---- 308 (394) Q Consensus 233 klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~---- 308 (394) -+..--.+ .++.+..+...|..++.+ |..... ..++. .++......+. T Consensus 87 i~DAD~~~----------------------~~d~l~~~~~~f~~~~~~--Vq~~~~-~~N~~---~n~lt~~~~~e~~~~ 138 (183) T cd06438 87 VFDADNLV----------------------DPNALEELNARFAAGARV--VQAYYN-SKNPD---DSWITRLYAFAFLVF 138 (183) T ss_pred EECCCCCC----------------------CHHHHHHHHHHHHCCCCE--EEEEEE-ECCCC---CCHHHHHHHHHHHHH T ss_conf 95566677----------------------978999999998389967--984202-11798---799999999999999 Q ss_pred -HHHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHH Q ss_conf -999974899885547737701233209888757 Q gi|254780201|r 309 -DLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPL 341 (394) Q Consensus 309 -~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL 341 (394) .+. +.+... -....+..||.+.+|.++|+.. T Consensus 139 ~~v~-~~~~~~-~g~~~~~~G~~~~~Rr~~L~~~ 170 (183) T cd06438 139 NRLR-PLGRSN-LGLSCQLGGTGMCFPWAVLRQA 170 (183) T ss_pred HHHH-HHHHHH-CCCCEEECCCHHHHHHHHHHHC T ss_conf 9999-999997-6993588690465219999973 No 38 >cd06436 GlcNAc-1-P_transferase N-acetyl-glucosamine transferase is involved in the synthesis of Poly-beta-1,6-N-acetyl-D-glucosamine. N-acetyl-glucosamine transferase is responsible for the synthesis of bacteria Poly-beta-1,6-N-acetyl-D-glucosamine (PGA). Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer that serves as an adhesion for the maintenance of biofilm structural stability in diverse eubacteria. N-acetyl-glucosamine transferase is the product of gene pgaC. Genetic analysis indicated that all four genes of the pgaABCD locus were required for the PGA production, pgaC being a glycosyltransferase. Probab=53.93 E-value=17 Score=16.85 Aligned_cols=155 Identities=12% Similarity=0.022 Sum_probs=75.1 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEECCC--CHHHHHHHHHHCCCCCEEEEE--C--CCCHHHHHHHHHH---CCCC-CC--- Q ss_conf 77899999999708998149997176--528999999711787389994--5--6660479999862---2100-27--- Q gi|254780201|r 160 QDTWIEISHILLRLNFDFDLFVTVVE--ANKDFEQDVLKYFPSAQLYVM--E--NKGRDVRPFLYLL---ELGV-FD--- 226 (394) Q Consensus 160 ~DLl~E~~~~L~~ip~~~DL~ITt~~--~~~~~~~~~~~~~~~a~V~vv--~--NRGRDI~PfL~~~---~~~~-l~--- 226 (394) .+.+..++..|...+..+.++|--+. +......+.....+.+++... + |.|. -+.+=.++ +... .. T Consensus 9 e~~I~~ti~sl~~~~~~~eIivvdDgS~D~T~~~~~~~~~~~~~~vi~~~~~~~~~GK-~~ALN~al~~~~~~~~~~~~~ 87 (191) T cd06436 9 EAVIQRTLASLLRNKPNFLVLVIDDASDDDTAGIVRLAITDSRVHLLRRHLPNARTGK-GDALNAAYDQIRQILIEEGAD 87 (191) T ss_pred HHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHCCCEEEEEECCCCCCCCC-HHHHHHHHHHHHHHHHHCCCC T ss_conf 8999999999973899968999979999209999999830898799995477667851-578888999866432000135 Q ss_pred -CCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHH Q ss_conf -8678999736877677776444078999999874799899999999976099768981687654433234554322138 Q gi|254780201|r 227 -RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYR 305 (394) Q Consensus 227 -~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~ 305 (394) +.++|+-+-.--. + +++.+..++..|+ +|++|.|...... .+. ..++...-. T Consensus 88 ~~~eii~v~DAD~~---------------------~-~~d~L~~~~~~f~-dp~v~~Vqg~~~~-~N~---~~n~ltr~q 140 (191) T cd06436 88 PERVIIAVIDADGR---------------------L-DPNALEAVAPYFS-DPRVAGTQSRVRM-YNR---HKNLLTILQ 140 (191) T ss_pred CCCEEEEEECCCCC---------------------C-CHHHHHHHHHHHC-CCCEEEEECCCEE-CCC---CCCHHHHHH T ss_conf 66418999537873---------------------2-8899999999725-9881599434032-688---878999999 Q ss_pred HHH-----HHHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCC Q ss_conf 999-----9999748998855477377012332098887576588 Q gi|254780201|r 306 RVI-----DLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLH 345 (394) Q Consensus 306 ~~~-----~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~ 345 (394) .+. .+.++.. . .....+..||+-.+|.+||+.+-..+ T Consensus 141 ~~E~~~~~~~~~~~r-~--~~g~~~~~G~~~~~Rr~al~~vGG~~ 182 (191) T cd06436 141 DLEFFIIIAATQSLR-A--LTGTVGLGGNGQFMRLSALDGLIGEE 182 (191) T ss_pred HHHHHHHHHHHHHHH-H--HCCEEEECCEEEEEHHHHHHHCCCCC T ss_conf 999999999999899-8--67928967338888699999767998 No 39 >COG1428 Deoxynucleoside kinases [Nucleotide transport and metabolism] Probab=53.65 E-value=17 Score=16.82 Aligned_cols=72 Identities=18% Similarity=0.118 Sum_probs=47.9 Q ss_pred ECCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCC Q ss_conf 4167789999999970899814999717652899999971178738999456660479999862210027867 Q gi|254780201|r 157 CYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYD 229 (394) Q Consensus 157 lyY~DLl~E~~~~L~~ip~~~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv~NRGRDI~PfL~~~~~~~l~~YD 229 (394) -+|-||++-+++-+..+|...||+|-+.-+-...++.+.++....++.. .+++-|-..=|...=+....+|+ T Consensus 107 ~~Y~~L~~~~~~~l~~~p~~PdllIyLd~~~e~~l~RI~~RgR~~E~~~-~~~~~~Y~~~l~~~Y~~~~~~~~ 178 (216) T COG1428 107 KYYDDLYDNMLEELPYLPGRPDLLIYLDASLETLLRRIAKRGRPFEIDN-FDENKDYLKDLHRRYDDWFENYD 178 (216) T ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEEEECCHHHHHHHHHHHCCCCCCCC-CCCHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999879987325789998899982789999999998199740246-65168999999999999998455 No 40 >KOG3969 consensus Probab=50.46 E-value=19 Score=16.60 Aligned_cols=12 Identities=25% Similarity=0.158 Sum_probs=7.4 Q ss_pred EEEEECCCCCCC Q ss_conf 899973687767 Q gi|254780201|r 230 YLCKIHGKKSQR 241 (394) Q Consensus 230 ~v~klHtKKS~~ 241 (394) .|||=++=||.. T Consensus 193 aIvhr~dikSiR 204 (310) T KOG3969 193 AIVHRRDIKSIR 204 (310) T ss_pred EEEECCCCCHHH T ss_conf 999427863464 No 41 >pfam02514 CobN-Mg_chel CobN/Magnesium Chelatase. This family contains a domain common to the cobN protein and to magnesium protoporphyrin chelatase. CobN is implicated in the conversion of hydrogenobyrinic acid a,c-diamide to cobyrinic acid. Magnesium protoporphyrin chelatase is involved in chlorophyll biosynthesis. Probab=46.82 E-value=19 Score=16.57 Aligned_cols=29 Identities=24% Similarity=0.282 Sum_probs=14.5 Q ss_pred HHCCCCEEEEEEEC-----------CHHHHHHHHHHHHHC Q ss_conf 00014738999941-----------677899999999708 Q gi|254780201|r 145 LTIKSKIAIVVHCY-----------YQDTWIEISHILLRL 173 (394) Q Consensus 145 ~~~k~kIAVvlHly-----------Y~DLl~E~~~~L~~i 173 (394) .+.++|||+++|=| |.|....+...|..+ T Consensus 239 ~n~eKkVAIi~~nyPpg~~~iG~A~~LDv~~Sl~~iL~~L 278 (1064) T pfam02514 239 PNAEKRVAIVLYNYPPGKGNIGTAAGLDVPASLVNLLRAL 278 (1064) T ss_pred CCCCCEEEEEEECCCCCCCCEEEECCCCHHHHHHHHHHHH T ss_conf 7302669999857998776610224537399999999999 No 42 >pfam00535 Glycos_transf_2 Glycosyl transferase family 2. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids. Probab=45.63 E-value=23 Score=16.05 Aligned_cols=152 Identities=18% Similarity=0.161 Sum_probs=79.5 Q ss_pred HHHHHHHHHHHHHC-CCCEEEEEECCCC---HHHHHHHHHHCCCCCEEEEEC-CCCHHHHHHHHHHCCCCCCCCCEEEEE Q ss_conf 77899999999708-9981499971765---289999997117873899945-666047999986221002786789997 Q gi|254780201|r 160 QDTWIEISHILLRL-NFDFDLFVTVVEA---NKDFEQDVLKYFPSAQLYVME-NKGRDVRPFLYLLELGVFDRYDYLCKI 234 (394) Q Consensus 160 ~DLl~E~~~~L~~i-p~~~DL~ITt~~~---~~~~~~~~~~~~~~a~V~vv~-NRGRDI~PfL~~~~~~~l~~YD~v~kl 234 (394) .+.+.++++.+.+- ...++++|.-+.. ..++........+.+.+...+ |.|.- +.+-.+++ ..+.|+|+.+ T Consensus 10 ~~~l~~~l~sl~~q~~~~~eiiivDd~S~d~t~~~~~~~~~~~~~v~~~~~~~~~g~~-~a~n~g~~---~a~~~~v~~l 85 (168) T pfam00535 10 EKYLEECLESLLNQTYPNFEIIVVDDGSTDGTVEIAEEYAKKDPRIRVIRLEENLGKA-AARNAGLK---LATGDYILFL 85 (168) T ss_pred HHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHCEEEEECCCCCCCEEEEEECCCCCH-HHHHHHHH---HCCCCEEEEE T ss_conf 8999999999973879985999998999810100121001346760024420245746-88889999---6799859998 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCC--CHHHHHHHHH Q ss_conf 36877677776444078999999874799899999999976099768981687654433234554322--1389999999 Q gi|254780201|r 235 HGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE--VYRRVIDLAK 312 (394) Q Consensus 235 HtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~--~~~~~~~L~~ 312 (394) .+-=.. +++.+..+++.|++++.-+.+++.......... ...... .......... T Consensus 86 D~D~~~----------------------~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 142 (168) T pfam00535 86 DADDEV----------------------APDWLEKLVELLEKNGADIVIGSRVVIFGETRL-DGRALRFELLLLLGKLGD 142 (168) T ss_pred CCCCCC----------------------CHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCC-CCHHHHHHHHHHHHHHHH T ss_conf 589986----------------------876999999999839976999986867896254-512688899999999999 Q ss_pred HCCCCCCCCCCCCCCEEEEEEEHHHHHHHH Q ss_conf 748998855477377012332098887576 Q gi|254780201|r 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLR 342 (394) Q Consensus 313 rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~ 342 (394) .. ......+..|.+..+|.++++.++ T Consensus 143 ~~----~~~~~~~~~~~~~~~~r~~~~~~~ 168 (168) T pfam00535 143 RS----LGLKVLFLIGSNALYRREALEELL 168 (168) T ss_pred HH----HCCCCEEECCCEEEEEHHHHHHHC T ss_conf 98----589963716652899899999659 No 43 >KOG2791 consensus Probab=44.71 E-value=24 Score=15.96 Aligned_cols=101 Identities=11% Similarity=0.161 Sum_probs=55.7 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC--CCCCCCC-----CCEEECCCHHHHHHH-----------HHHHH Q ss_conf 24789998878763012211155326887405788--8878988-----615756897899999-----------99999 Q gi|254780201|r 13 KIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSF--SPKQRIT-----SKDVHFQELSIFESF-----------IFWLR 74 (394) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~y~p~~~pg~~~~WDn--tpR~~~~-----~~~~~~~tP~~f~~~-----------~~wl~ 74 (394) -|-.++...+..+...+=++|-|..+|+.+||=|- -|++-++ +..-.+++|+.|.-| --|.- T Consensus 160 eiN~~I~~i~Fc~V~QiF~Pys~qlypt~FPG~~p~DCp~kmkk~~a~k~~C~n~ssPD~yGnyR~ak~~q~KHHWWWkm 239 (455) T KOG2791 160 EINRIIESIKFCQVKQIFSPYSPQLYPTSFPGVTPNDCPNKMKKGDAAKGHCENNSSPDQYGNYRSAKIVQLKHHWWWKM 239 (455) T ss_pred HHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCHHCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 99999863441026787537771005676899880015331245556634776888952015732013212235565767 Q ss_pred HHHHHHHHHCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHH Q ss_conf 99999751001585643999973137741007326871875599 Q gi|254780201|r 75 SFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPF 118 (394) Q Consensus 75 ~~l~~~~~~~~~~~~~~ivfinaWNEEG~yLePD~~~G~~yL~a 118 (394) +++-. ...+.+..+.-|+|+ ||-.||-||...--.-|.- T Consensus 240 nfVwd-~ve~~~~~~g~iLfl----EEDH~LaPdayhv~~~l~~ 278 (455) T KOG2791 240 NFVWD-GVEETKGHEGHILFL----EEDHFLAPDAYHVIQTLTR 278 (455) T ss_pred HHHHH-HHHHHCCCCCEEEEE----ECCCCCCHHHHHHHHHHHH T ss_conf 87998-899733677658997----1343337018999999984 No 44 >COG1036 Archaeal flavoproteins [Energy production and conversion] Probab=43.31 E-value=25 Score=15.83 Aligned_cols=75 Identities=21% Similarity=0.298 Sum_probs=44.2 Q ss_pred CCEEEEEEECCHHHHHHHHHHHHHCCC-----CEEEEEECCCCHHHHH------HHHHHCCCCCEEEEECCCCHHHHHHH Q ss_conf 473899994167789999999970899-----8149997176528999------99971178738999456660479999 Q gi|254780201|r 149 SKIAIVVHCYYQDTWIEISHILLRLNF-----DFDLFVTVVEANKDFE------QDVLKYFPSAQLYVMENKGRDVRPFL 217 (394) Q Consensus 149 ~kIAVvlHlyY~DLl~E~~~~L~~ip~-----~~DL~ITt~~~~~~~~------~~~~~~~~~a~V~vv~NRGRDI~PfL 217 (394) .|||=.+-.= -|+++|..+.+..+.. ..|.|+|- ...++. ..+.+.++..++..=.| +||+ T Consensus 9 ~rIaWgITGa-G~~L~Et~~imk~lk~~~~~~~v~v~lSk--ageeVvk~YgL~~~l~~~~~~~~~e~~an-----sPfi 80 (187) T COG1036 9 KRIAWGITGA-GHLLPETYQIMKELKKEYGDVEVDVFLSK--AGEEVVKMYGLWDKLEKIFGGLEVEIGAN-----SPFI 80 (187) T ss_pred CEEEEEEECC-CCCCHHHHHHHHHHHHHCCCCEEEEEEHH--HHHHHHHHHHHHHHHHHHCCCEEEECCCC-----CCCE T ss_conf 1589997055-30109999999999864588058876223--08999999989999998759867643899-----9745 Q ss_pred HHHCCCCCCCCCEEEE Q ss_conf 8622100278678999 Q gi|254780201|r 218 YLLELGVFDRYDYLCK 233 (394) Q Consensus 218 ~~~~~~~l~~YD~v~k 233 (394) .+ ...+.+||+++- T Consensus 81 ~G--rlqlGkYD~llv 94 (187) T COG1036 81 AG--RLQLGKYDFLLV 94 (187) T ss_pred EC--CEECCCCCEEEE T ss_conf 31--100253017998 No 45 >KOG1402 consensus Probab=41.39 E-value=27 Score=15.65 Aligned_cols=116 Identities=16% Similarity=0.261 Sum_probs=73.3 Q ss_pred CCCCEEEEEEE-----ECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCEEEEEEECCHH Q ss_conf 85643999973-----1377410073268718755999999730266420246782210110000014738999941677 Q gi|254780201|r 87 FPSCRIFFYGS-----RKEQKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQD 161 (394) Q Consensus 87 ~~~~~ivfina-----WNEEG~yLePD~~~G~~yL~a~~~al~~~~~~~~~~~~~p~~~~~~~~~~k~kIAVvlHlyY~D 161 (394) +...+++|-++ |.+|| +.||+.+..-= ..+-+.+||-.-++.+.... |..+---+||-| T Consensus 38 yhpLpvvf~ka~g~~vwD~eG----------k~ylDflsays-----aVnqGhchpki~~aLqeq~~-kLtlssrafYnd 101 (427) T KOG1402 38 YHPLPVVFSKAKGSRVWDPEG----------KEYLDFLSAYS-----AVNQGHCHPKIIKALQEQAD-KLTLSSRAFYND 101 (427) T ss_pred CCCCCEEEEECCCCEEECCCC----------CCHHHHHHHHH-----HCCCCCCCHHHHHHHHHHHH-HHHHHHHHHHHH T ss_conf 776755888167857987775----------50344665542-----10257787799999999876-751336777655 Q ss_pred HHHHHHHHHHHCCCCEEEEEECCCCHHHHH--HHHHH-------CCC--CCEEEEECC--CCHHHHHHHHH Q ss_conf 899999999708998149997176528999--99971-------178--738999456--66047999986 Q gi|254780201|r 162 TWIEISHILLRLNFDFDLFVTVVEANKDFE--QDVLK-------YFP--SAQLYVMEN--KGRDVRPFLYL 219 (394) Q Consensus 162 Ll~E~~~~L~~ip~~~DL~ITt~~~~~~~~--~~~~~-------~~~--~a~V~vv~N--RGRDI~PfL~~ 219 (394) ++.+|.+|+..+ ++||-.+-+.+.....+ -++.+ ..| .+.|.-..| .||-++..--. T Consensus 102 ~~~~f~~~vt~l-f~~~kvlpmnTGaEa~Eta~KLaR~wgy~~K~ip~nka~il~~~~nFhGrT~~ais~s 171 (427) T KOG1402 102 VLGEFAEYVTKL-FGYDKVLPMNTGAEAVETACKLARKWGYRKKNIPKNKAKILSAENNFHGRTLGAISLS 171 (427) T ss_pred HHHHHHHHHHHH-CCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCEEEEEEEC T ss_conf 678999999986-6854553235540677879999999777631688660048985055567334567715 No 46 >COG1154 Dxs Deoxyxylulose-5-phosphate synthase [Coenzyme metabolism / Lipid metabolism] Probab=38.91 E-value=27 Score=15.60 Aligned_cols=81 Identities=14% Similarity=0.019 Sum_probs=44.5 Q ss_pred ECCCCHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHH---HHH-HCC------------CHHHHHH Q ss_conf 456660479999862210027867899973687767777644407899999---987-479------------9899999 Q gi|254780201|r 206 MENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWL---FFD-LLG------------FSDIAIR 269 (394) Q Consensus 206 v~NRGRDI~PfL~~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~L---l~~-LLG------------s~~~v~~ 269 (394) -|=-|-||-.++..++.....+...++|++|||..- +-..+.+.=..|- ++. .-| |+-.-+. T Consensus 248 GPiDGHni~~Li~~Lk~~kd~~gPvllHv~T~KGKG--Y~pAE~d~~~~H~v~~f~~~~tg~~~~~~~~~~sys~vf~~~ 325 (627) T COG1154 248 GPIDGHNLEELIPTLKNAKDLKGPVLLHVVTKKGKG--YKPAEEDPIKYHGVGPFDPIETGQSKKSKPSAPSYTKVFGDT 325 (627) T ss_pred CCCCCCCHHHHHHHHHHHHCCCCCEEEEEEECCCCC--CCHHHCCHHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 885877899999999998537998899999658888--882354965351777877433576677788888878999999 Q ss_pred HHHHHHHCCCEEEEECCCC Q ss_conf 9999760997689816876 Q gi|254780201|r 270 IINTFEQNPCLGMIGSRRY 288 (394) Q Consensus 270 Il~~F~~~p~lGlv~p~~~ 288 (394) +++.-++|+++=-+.|..+ T Consensus 326 L~~~a~~d~~ivaITaAM~ 344 (627) T COG1154 326 LCELAAKDEKIVAITAAMP 344 (627) T ss_pred HHHHHHHCCCEEEEECCCC T ss_conf 9999841897599835777 No 47 >KOG1115 consensus Probab=38.76 E-value=29 Score=15.39 Aligned_cols=96 Identities=16% Similarity=0.103 Sum_probs=54.8 Q ss_pred CCHHHHHHHHHHHHHCCCCEEEEEECCCC--H-HHHHHHHHHCC---C-CCEEEEEC--CCCHHHHHHHHHHCCCCCCCC Q ss_conf 16778999999997089981499971765--2-89999997117---8-73899945--666047999986221002786 Q gi|254780201|r 158 YYQDTWIEISHILLRLNFDFDLFVTVVEA--N-KDFEQDVLKYF---P-SAQLYVME--NKGRDVRPFLYLLELGVFDRY 228 (394) Q Consensus 158 yY~DLl~E~~~~L~~ip~~~DL~ITt~~~--~-~~~~~~~~~~~---~-~a~V~vv~--NRGRDI~PfL~~~~~~~l~~Y 228 (394) |..|++++.+..+.+=|...=+||----. + +.+-+.+.+-| + +..|.++| |.-||+.. ..+...++.| T Consensus 142 ~W~~~l~~~L~k~~~RPknllvFinPfgGkG~g~ki~e~V~~~F~la~v~tkvivTErAnhA~d~~~---ei~~~~~~~y 218 (516) T KOG1115 142 SWMDQLNYSLIKEVERPKNLLVFINPFGGKGNGSKIWETVSKIFILAKVNTKVIVTERANHAFDVMA---EIQNKELHTY 218 (516) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHHHHHEEEEECCEEEEEECCCCCHHHHHH---HCCHHHHHHC T ss_conf 9999999999987438762799975788887633556641232896211214999702452446554---2787562011 Q ss_pred CEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHH Q ss_conf 78999736877677776444078999999874799899999 Q gi|254780201|r 229 DYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIR 269 (394) Q Consensus 229 D~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~ 269 (394) |-+.-+ -|+.+..+++.++|+....+.+ T Consensus 219 DGiv~V-------------GGDG~FnEiL~G~llrtQ~~ag 246 (516) T KOG1115 219 DGIVAV-------------GGDGFFNEILNGYLLRTQEVAG 246 (516) T ss_pred CCEEEE-------------CCCHHHHHHHHHHHHHHHHHCC T ss_conf 667995-------------4732699987666651265318 No 48 >pfam03808 Glyco_tran_WecB Glycosyl transferase WecB/TagA/CpsF family. Probab=38.39 E-value=30 Score=15.36 Aligned_cols=53 Identities=6% Similarity=-0.118 Sum_probs=39.5 Q ss_pred EEECCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEEC Q ss_conf 99416778999999997089981499971765289999997117873899945 Q gi|254780201|r 155 VHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVME 207 (394) Q Consensus 155 lHlyY~DLl~E~~~~L~~ip~~~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv~ 207 (394) -.+..+|++++++++...-....=++=++++......+.+.+.+|+.+|.... T Consensus 29 ~rv~G~dl~~~ll~~~~~~~~~v~llG~~~~~~~~~~~~l~~~yP~l~i~g~~ 81 (172) T pfam03808 29 ERVAGTDLIPALLERAAERGKRVFLLGGKPGVLEKAAARLRARYPGLRIVGTH 81 (172) T ss_pred CCCCHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 51685999999999998649838998088899999999999887995599987 No 49 >TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602). Probab=38.05 E-value=30 Score=15.33 Aligned_cols=171 Identities=15% Similarity=0.167 Sum_probs=97.9 Q ss_pred HHCCCCEEEEEEECCH-HHHHHHHHHHHHCCCCE---EEEEECC--CC-HHHHHHHHHHCCCCCEEEEE-CCCCHHHHHH Q ss_conf 0001473899994167-78999999997089981---4999717--65-28999999711787389994-5666047999 Q gi|254780201|r 145 LTIKSKIAIVVHCYYQ-DTWIEISHILLRLNFDF---DLFVTVV--EA-NKDFEQDVLKYFPSAQLYVM-ENKGRDVRPF 216 (394) Q Consensus 145 ~~~k~kIAVvlHlyY~-DLl~E~~~~L~~ip~~~---DL~ITt~--~~-~~~~~~~~~~~~~~a~V~vv-~NRGRDI~Pf 216 (394) ..--++|.|++=+|=. +.+..+++.+.+...|- +++|=-+ ++ ..++..+..+.+|...|+.. +|.|. -.++ T Consensus 45 ~~~lP~VSViIPayNEe~~I~~~i~Sl~~~dYP~~~lEIIvVdDgStD~T~ei~~~~~~~~p~~~v~~~~~n~GK-a~AL 123 (439) T TIGR03111 45 IGKLPDITIIIPVYNSEDTLFNCIESIYNQTYPIELIDIILANNQSTDDSFQVFCRAQNEFPGLSLRYMNSDQGK-AKAL 123 (439) T ss_pred CCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECCCCCCH-HHHH T ss_conf 667997699993587388999999999967999985599999789951799999998865898399976898985-9999 Q ss_pred HHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCC Q ss_conf 98622100278678999736877677776444078999999874799899999999976099768981687654433234 Q gi|254780201|r 217 LYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSF 296 (394) Q Consensus 217 L~~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~ 296 (394) =.+++ ..+.|+++.+-.--. + .++.+.+++..|++||++|.++-..-........ T Consensus 124 N~gl~---~A~Ge~iv~~DADt~---------------------l-e~daL~~lv~~F~~dp~V~Av~G~V~~~~~~i~~ 178 (439) T TIGR03111 124 NAAIY---NSIGKYIIHIDSDGK---------------------L-HKDAIKNMVTRFENNPDIHAMTGVILTDKELIEK 178 (439) T ss_pred HHHHH---HCCCCEEEEEECCCC---------------------C-CHHHHHHHHHHHHHCCCEEEEECEEEECCCHHCC T ss_conf 99999---778898999808988---------------------6-8569999999872099668996637405421013 Q ss_pred CCCCCCCH-H--HHHH-----HHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHH Q ss_conf 55432213-8--9999-----999748998855477377012332098887576 Q gi|254780201|r 297 FAKRSEVY-R--RVID-----LAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLR 342 (394) Q Consensus 297 ~~~~~~~~-~--~~~~-----L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~ 342 (394) ..+++-+- . +..+ +..|.+.... ....-..|++.-+|.+||...- T Consensus 179 ~~n~~~~ll~~~q~~EY~~~f~~~R~~~s~~-~~i~~isGa~~afRr~aL~~vG 231 (439) T TIGR03111 179 TKGRFLKLIRRCEYFEYAQAFLAGRNFESQV-NSLFTLSGAFSAFRRETILKTQ 231 (439) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCEEEECCCHHHEEHHHHHHCC T ss_conf 2227878999999999999999989999973-9978986831210299999809 No 50 >cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=37.93 E-value=30 Score=15.31 Aligned_cols=111 Identities=15% Similarity=0.080 Sum_probs=59.5 Q ss_pred EEEEEECCHHHHHHHHHHHHHCCCCEE-EEEECCCC-HHHHHHHHHHCCCCCEEE---EECCCCHHHHHHHHHHCCCCCC Q ss_conf 899994167789999999970899814-99971765-289999997117873899---9456660479999862210027 Q gi|254780201|r 152 AIVVHCYYQDTWIEISHILLRLNFDFD-LFVTVVEA-NKDFEQDVLKYFPSAQLY---VMENKGRDVRPFLYLLELGVFD 226 (394) Q Consensus 152 AVvlHlyY~DLl~E~~~~L~~ip~~~D-L~ITt~~~-~~~~~~~~~~~~~~a~V~---vv~NRGRDI~PfL~~~~~~~l~ 226 (394) ||++--==.+++.++++.|.+-..+.+ ++| ++.. .....+.+.+..+...|. .-+|.|-- +.+=.+++...-. T Consensus 1 aVI~t~N~~~~l~~~L~Sl~~q~~~~~eIiV-VDn~StD~t~~~l~~~~~~~~v~~i~~~~N~G~a-~~~N~Gi~~a~~~ 78 (202) T cd04185 1 AVVVTYNRLDLLKECLDALLAQTRPPDHIIV-IDNASTDGTAEWLTSLGDLDNIVYLRLPENLGGA-GGFYEGVRRAYEL 78 (202) T ss_pred CEEEECCCHHHHHHHHHHHHHCCCCCCEEEE-EECCCCCCHHHHHHHHCCCCCEEEEECCCCCCHH-HHHHHHHHHHHHC T ss_conf 9990028889999999999827799988999-9794987616653000356668998568887327-8999999988755 Q ss_pred CCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 8678999736877677776444078999999874799899999999976099768981687 Q gi|254780201|r 227 RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRR 287 (394) Q Consensus 227 ~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~ 287 (394) ++|+|+.+.+==-+ .++.++.+++.++. +..++++|.. T Consensus 79 ~~d~v~~ld~D~~~----------------------~~~~l~~l~~~~~~-~~~~~~~~~~ 116 (202) T cd04185 79 GYDWIWLMDDDAIP----------------------DPDALEKLLAYADK-DNPQFLAPLV 116 (202) T ss_pred CCEEEEEECCCCCC----------------------CCCHHHHHHHHHHC-CCCEEEEEEE T ss_conf 98189998998754----------------------92299999998657-9977999789 No 51 >cd03814 GT1_like_2 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=36.86 E-value=31 Score=15.21 Aligned_cols=13 Identities=23% Similarity=0.322 Sum_probs=9.9 Q ss_pred CCHHHHHHHHHHH Q ss_conf 8738999999999 Q gi|254780201|r 358 DGALEHAVERFFA 370 (394) Q Consensus 358 DGTlaHAIERl~~ 370 (394) ...|+.+|++|+. T Consensus 319 ~~~la~~i~~l~~ 331 (364) T cd03814 319 AEAFAAALAALLA 331 (364) T ss_pred HHHHHHHHHHHHC T ss_conf 9999999999976 No 52 >TIGR02470 sucr_synth sucrose synthase; InterPro: IPR012820 This entry represents sucrose synthase an enzyme that despite its name, generally uses rather produces sucrose. Sucrose plus UDP (or ADP) becomes D-fructose plus UDP-glucose (or ADP-glucose), which is then available for cell wall (or starch) biosynthesis. The enzyme is homologous to sucrose phosphate synthase, which catalyses the penultimate step in sucrose synthesis. Sucrose synthase is found, so far, exclusively in plants and cyanobacteria.; GO: 0016157 sucrose synthase activity. Probab=35.70 E-value=11 Score=17.88 Aligned_cols=229 Identities=13% Similarity=0.182 Sum_probs=112.2 Q ss_pred CCCCCCCCCCEEECC--CHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHH------ Q ss_conf 888789886157568--978999999999999999751001585643999973137741007326871875599------ Q gi|254780201|r 47 FSPKQRITSKDVHFQ--ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPF------ 118 (394) Q Consensus 47 ntpR~~~~~~~~~~~--tP~~f~~~~~wl~~~l~~~~~~~~~~~~~~ivfinaWNEEG~yLePD~~~G~~yL~a------ 118 (394) ..+|+-++|..|.|- |=..|.+-=.|...++.+.+..+ +.+..| .||-.=.....|+--++---.||.+ T Consensus 129 ~~s~sIG~Gv~FLNR~LSS~lf~~~~~w~~~Ll~FLr~H~--y~G~~L-mlnd~i~~~~~L~~~L~~A~~~ls~lp~~tP 205 (790) T TIGR02470 129 SDSKSIGNGVQFLNRHLSSKLFQDKESWMEPLLNFLRVHN--YNGEQL-MLNDRIQSVSALQSQLRKAEEFLSALPPDTP 205 (790) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH--HCCCCC-CCCCCCCCHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 7863224317898899989853683146799999985442--168401-1344566578999999999999845887674 Q ss_pred -----------------------HHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHHCCC Q ss_conf -----------------------999973026642024678221011000001473899994167789999999970899 Q gi|254780201|r 119 -----------------------DSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNF 175 (394) Q Consensus 119 -----------------------~~~al~~~~~~~~~~~~~p~~~~~~~~~~k~kIAVvlHlyY~DLl~E~~~~L~~ip~ 175 (394) .++.+.=.-++.++ .+ |..|++| |+.||- T Consensus 206 Y~eF~~~lQ~lGfE~GWGdTA~Rv~Et~~lL~~lL~a--Pd-----------------------p~~LE~F---l~~iPm 257 (790) T TIGR02470 206 YSEFEFELQELGFEPGWGDTAERVRETLHLLLDLLEA--PD-----------------------PSVLEAF---LGRIPM 257 (790) T ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCC--CC-----------------------HHHHHHH---HCCCCC T ss_conf 4789889874488888530689999999999998528--89-----------------------5589998---512341 Q ss_pred CEEEEEECCCCH-----------------------HHHHHHHHHCCC----C---C--EE----EEEC-CCCHHHHHHHH Q ss_conf 814999717652-----------------------899999971178----7---3--89----9945-66604799998 Q gi|254780201|r 176 DFDLFVTVVEAN-----------------------KDFEQDVLKYFP----S---A--QL----YVME-NKGRDVRPFLY 218 (394) Q Consensus 176 ~~DL~ITt~~~~-----------------------~~~~~~~~~~~~----~---a--~V----~vv~-NRGRDI~PfL~ 218 (394) -|.++|=-|..- +..+.++.++.. . + +| |.+| +.| + T Consensus 258 vFnvvilSpHGyFgQ~~VLG~PDTGGQVVYiLDQVRaLE~em~~ri~~~GL~vL~~~PkIlIvTRLiPdA~G-------T 330 (790) T TIGR02470 258 VFNVVILSPHGYFGQENVLGLPDTGGQVVYILDQVRALENEMLERIKLQGLEVLEITPKILIVTRLIPDAEG-------T 330 (790) T ss_pred CEEEEEECCCCCCCCCCCCCCCCCCCEEEEEEHHHHHHHHHHHHHHHHCCCCEECCCCEEEEEECCCCCCCC-------C T ss_conf 000545546640014776588888865775104469999999999987384011565528998501768988-------6 Q ss_pred HHCCCCCC---C--CCEEEEECCCCCCCCCCCCCCCHHHH--HHHHHHHCCC-HHHHHHHHHHHHHCCCEEEEECCCCCC Q ss_conf 62210027---8--67899973687767777644407899--9999874799-899999999976099768981687654 Q gi|254780201|r 219 LLELGVFD---R--YDYLCKIHGKKSQREGYHPIEGIIWR--RWLFFDLLGF-SDIAIRIINTFEQNPCLGMIGSRRYRR 290 (394) Q Consensus 219 ~~~~~~l~---~--YD~v~klHtKKS~~~~~~~~~G~~Wr--~~Ll~~LLGs-~~~v~~Il~~F~~~p~lGlv~p~~~~~ 290 (394) .|+ ..|. | +-.|+.+==++- +...-..|= -..|=.|=-- .+....|+..|...|+|-+ T Consensus 331 ~Cn-qRLEKv~Gt~~~~ILRVPFr~~-----nG~~~~~WISRFeiWPYLE~FA~D~~~e~~~el~g~PDLII-------- 396 (790) T TIGR02470 331 TCN-QRLEKVSGTEHAHILRVPFRTE-----NGIILRKWISRFEIWPYLETFAEDAEKEILAELQGKPDLII-------- 396 (790) T ss_pred CCC-CEEECCCCCCCCEEEECCCCCC-----CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEE-------- T ss_conf 057-2452012798531642166434-----77300366330124720456689999999998468997067-------- Q ss_pred CCCCCCCCCCCCCHHHHH-HHHHHCCCCCC-----CCCCCCCCEEEEEEEH Q ss_conf 433234554322138999-99997489988-----5547737701233209 Q gi|254780201|r 291 YKRWSFFAKRSEVYRRVI-DLAKRAGFPTK-----RLHLDFFNGTMFWVKP 335 (394) Q Consensus 291 ~~~~~~~~~~~~~~~~~~-~L~~rlgi~~~-----~~~~~FpaGSMFW~R~ 335 (394) |+ ...|| .++ =|+.|||+..| -++.-++.-...|-.. T Consensus 397 GN------YSDGN--LVA~LLA~kLgVTQC~IAHALEKtKY~~Sd~yW~~~ 439 (790) T TIGR02470 397 GN------YSDGN--LVASLLARKLGVTQCTIAHALEKTKYPDSDIYWQEL 439 (790) T ss_pred EC------CCCHH--HHHHHHHHHCCCCCCCHHHHCCCCCCCCCCCHHHHC T ss_conf 32------56358--999999853362222033313343676645404202 No 53 >COG4098 comFA Superfamily II DNA/RNA helicase required for DNA uptake (late competence protein) [DNA replication, recombination, and repair] Probab=34.48 E-value=34 Score=14.98 Aligned_cols=16 Identities=0% Similarity=-0.112 Sum_probs=7.1 Q ss_pred CCCCCCCCEEECCCHH Q ss_conf 8789886157568978 Q gi|254780201|r 49 PKQRITSKDVHFQELS 64 (394) Q Consensus 49 pR~~~~~~~~~~~tP~ 64 (394) +|+...+.||.-+||. T Consensus 229 ark~~g~~IylTATp~ 244 (441) T COG4098 229 ARKKEGATIYLTATPT 244 (441) T ss_pred HHCCCCCEEEEECCCH T ss_conf 5123673699964880 No 54 >cd06533 Glyco_transf_WecG_TagA The glycosyltransferase WecG/TagA superfamily contains Escherichia coli WecG, Bacillus subtilis TagA and related proteins. E. coli WecG is believed to be a UDP-N-acetyl-D-mannosaminuronic acid transferase, and is involved in enterobacterial common antigen (eca) synthesis. B. subtilis TagA plays a key role in the Wall Teichoic Acid (WTA) biosynthetic pathway, catalyzing the transfer of N-acetylmannosamine to the C4 hydroxyl of a membrane-anchored N-acetylglucosaminyl diphospholipid to make ManNAc-beta-(1,4)-GlcNAc-pp-undecaprenyl. This is the first committed step in this pathway. Also included in this group is Xanthomonas campestris pv. campestris GumM, a glycosyltransferase participating in the biosynthesis of the exopolysaccharide xanthan. Probab=34.05 E-value=35 Score=14.93 Aligned_cols=53 Identities=8% Similarity=0.002 Sum_probs=38.8 Q ss_pred EEEECCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEE Q ss_conf 99941677899999999708998149997176528999999711787389994 Q gi|254780201|r 154 VVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVM 206 (394) Q Consensus 154 vlHlyY~DLl~E~~~~L~~ip~~~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv 206 (394) .-.+-.+|+++++++....-....=++=++++..+.....+.+.+|+.+|..+ T Consensus 26 ~~rv~G~dl~~~ll~~~~~~~~~v~llG~~~~~~~~~~~~l~~~yp~l~i~g~ 78 (171) T cd06533 26 PERVTGSDLMPALLELAAQKGLRVFLLGAKPEVLEKAAERLRARYPGLKIVGY 78 (171) T ss_pred CCCCCHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 86268199999999999864974999808989999999999978899379998 No 55 >cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=33.29 E-value=36 Score=14.86 Aligned_cols=188 Identities=11% Similarity=0.154 Sum_probs=94.1 Q ss_pred EEEEECC---HHHHHHHHHHHHHCCCC-EEEEEECC---CC-HHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHCCCC Q ss_conf 9999416---77899999999708998-14999717---65-28999999711787389994566604799998622100 Q gi|254780201|r 153 IVVHCYY---QDTWIEISHILLRLNFD-FDLFVTVV---EA-NKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGV 224 (394) Q Consensus 153 VvlHlyY---~DLl~E~~~~L~~ip~~-~DL~ITt~---~~-~~~~~~~~~~~~~~a~V~vv~NRGRDI~PfL~~~~~~~ 224 (394) |++=+|- ++.++++++-+.+=..+ +.++|--+ ++ ..++.+...+..+-..+..-+|.|.- ...=.++. T Consensus 2 Viip~YN~e~~~~l~~~l~Svl~Qt~~~~EiIiVdDgSs~d~~~~i~~~~~~~~~i~~i~~~~N~G~~-~a~N~gi~--- 77 (201) T cd04195 2 VLMSVYIKEKPEFLREALESILKQTLPPDEVVLVKDGPVTQSLNEVLEEFKRKLPLKVVPLEKNRGLG-KALNEGLK--- 77 (201) T ss_pred EEEECCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCHH-HHHHHHHH--- T ss_conf 89988848978999999999995799981899998999965439999986147998999878878989-99776364--- Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCC-- Q ss_conf 278678999736877677776444078999999874799899999999976099768981687654433234554322-- Q gi|254780201|r 225 FDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-- 302 (394) Q Consensus 225 l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~-- 302 (394) ..+.|||+.+.+== .| .++.++..+..|+++|.+++++.................. T Consensus 78 ~a~g~yI~~lD~DD------------~~----------~p~~l~~~~~~l~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~ 135 (201) T cd04195 78 HCTYDWVARMDTDD------------IS----------LPDRFEKQLDFIEKNPEIDIVGGGVLEFDSDGNDIGKRRLPT 135 (201) T ss_pred HCCCCEEEECCCCC------------CC----------CHHHHHHHHHHHHHCCCEEEEECCEEEECCCCCEEEEEECCC T ss_conf 26766999818898------------46----------716999999999878996999803799857897766797787 Q ss_pred CHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 13899999997489988554773770123320988875765882235854468888738999999999999986980335 Q gi|254780201|r 303 VYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIES 382 (394) Q Consensus 303 ~~~~~~~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl~~~iv~~~gf~i~~ 382 (394) ....+..... ....|..+|++ +|.+++..+- .++++. ...| .| +-+=.-.+|+++.. T Consensus 136 ~~~~~~~~~~--------~~~~~~~~~~~-~rr~~~~~~G---g~~~~~---~~eD------~d--l~lRl~~~G~~~~~ 192 (201) T cd04195 136 SHDDILKFAR--------RRSPFNHPTVM-FRKSKVLAVG---GYQDLP---LVED------YA--LWARMLANGARFAN 192 (201) T ss_pred CHHHHHHHHH--------HCCCCCCCHHH-HHHHHHHHHC---CCCCCC---CCHH------HH--HHHHHHHCCCEEEE T ss_conf 5057899887--------43987575289-9999999848---999889---6834------99--99999976993999 Q ss_pred EHHHHHH Q ss_conf 2475657 Q gi|254780201|r 383 VDCVAEY 389 (394) Q Consensus 383 v~~~~~~ 389 (394) ++=++-| T Consensus 193 ipe~L~~ 199 (201) T cd04195 193 LPEILVK 199 (201) T ss_pred CCCHHHE T ss_conf 0816851 No 56 >COG2943 MdoH Membrane glycosyltransferase [Cell envelope biogenesis, outer membrane] Probab=32.88 E-value=36 Score=14.82 Aligned_cols=145 Identities=19% Similarity=0.233 Sum_probs=82.5 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCEEEEEEECCHHHHHHH------HHHHHHC--CCCEEEEEECCCCHHHH Q ss_conf 999973026642024678221011000001473899994167789999------9999708--99814999717652899 Q gi|254780201|r 119 DSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEI------SHILLRL--NFDFDLFVTVVEANKDF 190 (394) Q Consensus 119 ~~~al~~~~~~~~~~~~~p~~~~~~~~~~k~kIAVvlHlyY~DLl~E~------~~~L~~i--p~~~DL~ITt~~~~~~~ 190 (394) .-.||..+=.+.-+....-+....+.+....+-|+++-+|..|..--| .+.++.. ...||.||-.++.+.++ T Consensus 114 F~tAl~GF~~L~~~~~r~~~~~p~~p~p~~hrTAilmPiynEd~~rVfAgLrA~~eSla~Tg~~~~FD~FVLSDs~dpdi 193 (736) T COG2943 114 FWTALMGFLVLLFGRDRYLSIAPNEPLPDLHRTAILMPIYNEDVNRVFAGLRATYESLAATGHAEHFDFFVLSDSRDPDI 193 (736) T ss_pred HHHHHHHHHHEEECCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCHH T ss_conf 99999877611016887678899998985322157730346678999998999999998617612002899748998225 Q ss_pred ---HH----HHHHCCC-CCEEEE---ECCCCHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf ---99----9971178-738999---456660479999862210027867899973687767777644407899999987 Q gi|254780201|r 191 ---EQ----DVLKYFP-SAQLYV---MENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259 (394) Q Consensus 191 ---~~----~~~~~~~-~a~V~v---v~NRGRDI~PfL~~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~ 259 (394) ++ .+.+... ..+|.. -.|.||--+-.=--|+. --+.|++.+-+.-- | T Consensus 194 alAEq~a~~~l~~e~~g~~~ifYRrRr~n~~RKaGNIaDfcrR-wG~~Y~~MlVLDAD-S-------------------- 251 (736) T COG2943 194 ALAEQKAWAELCRELGGEGNIFYRRRRRNVKRKAGNIADFCRR-WGSAYSYMLVLDAD-S-------------------- 251 (736) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEHHHHHHHCCCCCCHHHHHHH-HCCCCCEEEEEECC-C-------------------- T ss_conf 6668999999999836777266406755522334678899987-37653258996054-3-------------------- Q ss_pred HCCCHHHHHHHHHHHHHCCCEEEEECC Q ss_conf 479989999999997609976898168 Q gi|254780201|r 260 LLGFSDIAIRIINTFEQNPCLGMIGSR 286 (394) Q Consensus 260 LLGs~~~v~~Il~~F~~~p~lGlv~p~ 286 (394) +. +.+...+..+.++.||+.|++-.. T Consensus 252 vM-tgd~lvrLv~~ME~~P~aGlIQt~ 277 (736) T COG2943 252 VM-TGDCLVRLVRLMEANPDAGLIQTS 277 (736) T ss_pred CC-CHHHHHHHHHHHHHCCCCCEEECC T ss_conf 35-727899999998529997603215 No 57 >cd03813 GT1_like_3 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=32.78 E-value=25 Score=15.78 Aligned_cols=59 Identities=25% Similarity=0.413 Sum_probs=37.9 Q ss_pred CCHHHHHHHHHHHHHCCCC-EEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHH Q ss_conf 1677899999999708998-1499971765289999997117873899945666047999986 Q gi|254780201|r 158 YYQDTWIEISHILLRLNFD-FDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYL 219 (394) Q Consensus 158 yY~DLl~E~~~~L~~ip~~-~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv~NRGRDI~PfL~~ 219 (394) |+-+||-.|+..|+.+-.. .|.++|+.+.+...+.+ -.-+..+++++|| |-|+..|=.. T Consensus 225 ~~r~~wi~ff~~l~~~~Y~~Ad~I~~l~~~nr~~qi~--~Ga~~~k~~vIpN-GId~~~F~~~ 284 (475) T cd03813 225 YFRRLWIRFFESLGRLAYQAADRITTLYEGNRERQIE--DGADPEKIRVIPN-GIDPERFAPA 284 (475) T ss_pred HHHHHHHHHHHHHHHHHHHHHCEEECCCHHHHHHHHH--HCCCHHHCEECCC-CCCHHHCCCC T ss_conf 9999999999999999998517000134244899998--1989999441179-8329884887 No 58 >cd03801 GT1_YqgM_like This family is most closely related to the GT1 family of glycosyltransferases and named after YqgM in Bacillus licheniformis about which little is known. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. Probab=32.74 E-value=37 Score=14.80 Aligned_cols=38 Identities=26% Similarity=0.377 Sum_probs=21.6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHCC--CCCEEEEECCCCHHHHHH Q ss_conf 981499971765289999997117--873899945666047999 Q gi|254780201|r 175 FDFDLFVTVVEANKDFEQDVLKYF--PSAQLYVMENKGRDVRPF 216 (394) Q Consensus 175 ~~~DL~ITt~~~~~~~~~~~~~~~--~~a~V~vv~NRGRDI~Pf 216 (394) ...|.+|++++..++. +.+.+ +...+.+++| |-|+..| T Consensus 143 ~~~d~ii~~S~~~~~~---l~~~~~~~~~ki~vI~n-gid~~~~ 182 (374) T cd03801 143 RRADRIIAVSEATREE---LRELGGVPPEKITVIPN-GVDTERF 182 (374) T ss_pred HHCCEEEECCHHHHHH---HHHHCCCCHHHEEEECC-CCCHHHC T ss_conf 8389999989999999---99861998568999889-7675541 No 59 >TIGR02708 L_lactate_ox L-lactate oxidase; InterPro: IPR014080 Members of this entry oxidize L-lactate to pyruvate, reducing molecular oxygen to hydrogen peroxide. The enzyme is known in Aerococcus viridans, Streptococcus iniae, and some strains of Streptococcus pyogenes where it appears to contribute to virulence.. Probab=32.72 E-value=37 Score=14.80 Aligned_cols=60 Identities=23% Similarity=0.196 Sum_probs=46.6 Q ss_pred EEECCHHHHHHHHHHHHHCCCCEEEEEECCCC-HHHHHHHHHHCCCCCEE-----EEECCCCHHHH Q ss_conf 99416778999999997089981499971765-28999999711787389-----99456660479 Q gi|254780201|r 155 VHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-NKDFEQDVLKYFPSAQL-----YVMENKGRDVR 214 (394) Q Consensus 155 lHlyY~DLl~E~~~~L~~ip~~~DL~ITt~~~-~~~~~~~~~~~~~~a~V-----~vv~NRGRDI~ 214 (394) .=-|=.--++||.+.|.--|.=|.+|.+-++. ++++..++....-.|-| .|-.||-||.- T Consensus 115 ~S~YS~~~l~EIS~aL~G~P~WFQ~Y~~KDD~~NR~I~D~~Ka~G~~AIvLTADaTV~GNR~~D~~ 180 (368) T TIGR02708 115 TSSYSTADLPEISEALNGTPHWFQFYMSKDDGINRDILDRVKADGAKAIVLTADATVGGNREVDKR 180 (368) T ss_pred EECCCCCCHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCC T ss_conf 101246767999976279971588887403433346788875278528997214633577441355 No 60 >pfam09746 Membralin Tumour-associated protein. Membralin is evolutionarily highly conserved; though it seems to represent a unique protein family. The protein appears to contain several transmembrane regions. In humans it is expressed in certain cancers, particularly ovarian cancers. Probab=32.15 E-value=37 Score=14.74 Aligned_cols=102 Identities=20% Similarity=0.301 Sum_probs=55.2 Q ss_pred CCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCEEEEE----CCCCCCCCCCCCCCCCCCCHHHHHHHHHH-----C Q ss_conf 64440789999998747998999999999760-99768981----68765443323455432213899999997-----4 Q gi|254780201|r 245 HPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ-NPCLGMIG----SRRYRRYKRWSFFAKRSEVYRRVIDLAKR-----A 314 (394) Q Consensus 245 ~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~-~p~lGlv~----p~~~~~~~~~~~~~~~~~~~~~~~~L~~r-----l 314 (394) +.+.|+.|-+.+++..+|-.+.+-+-+..+++ +++-|.+. -+.+....-|. -..+ ..++.- . T Consensus 216 ~~cfg~~~~r~ll~~flGYddilm~s~k~lAe~~~n~Gyl~n~~t~e~yrf~~~~~---~~~~-----sy~~a~~~m~~f 287 (375) T pfam09746 216 DKCFGDSFSRFLLKEFLGYDDILMASVKTLAEQEENKGYLRNVITGEHYRFVSMWW---MARS-----SYPAAFCIMLLF 287 (375) T ss_pred CCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHEEHHH---CCCH-----HHHHHHHHHHHH T ss_conf 63100479999999862635888989998753378876155421211310100110---1202-----788999999875 Q ss_pred CCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 899885547737701233209888757658822358544688 Q gi|254780201|r 315 GFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNL 356 (394) Q Consensus 315 gi~~~~~~~~FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~ 356 (394) .+.+ .-.+.+.---||-+=.+.|+.+ +.+..-.||.-|.. T Consensus 288 t~sv-s~~lr~s~~~~~~f~v~ll~~~-~~~~~~~~p~ap~l 327 (375) T pfam09746 288 TFSV-SMLLRYSHHQIFVFIVDLLQML-EYNVSARFPAAPLL 327 (375) T ss_pred HHHH-HHHHHHHCCHHHHHHHHHHHHH-HHCCCCCCCCHHHH T ss_conf 3431-0110320220212189888888-61673023522689 No 61 >TIGR01357 aroB 3-dehydroquinate synthase; InterPro: IPR016037 The 3-dehydroquinate synthase (4.2.3.4 from EC) domain is present in isolation in various bacterial 3-dehydroquinate synthases and also present as a domain in the pentafunctional AROM polypeptide (P07547 from SWISSPROT) . 3-dehydroquinate (DHQ) synthase catalyses the formation of dehydroquinate (DHQ) and orthophosphate from 3-deoxy-D-arabino heptulosonic 7 phosphate . This reaction is part of the shikimate pathway which is involved in the biosynthesis of aromatic amino acids.; GO: 0003856 3-dehydroquinate synthase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=32.15 E-value=17 Score=16.76 Aligned_cols=195 Identities=17% Similarity=0.168 Sum_probs=85.7 Q ss_pred CCEEEEEEECCHHHH-HHHHHHHHHCCCCEEEEEECC--CCHHH------HHHH-HHHCCCC-CEEEEEC-CCCHHHHHH Q ss_conf 473899994167789-999999970899814999717--65289------9999-9711787-3899945-666047999 Q gi|254780201|r 149 SKIAIVVHCYYQDTW-IEISHILLRLNFDFDLFVTVV--EANKD------FEQD-VLKYFPS-AQLYVME-NKGRDVRPF 216 (394) Q Consensus 149 ~kIAVvlHlyY~DLl-~E~~~~L~~ip~~~DL~ITt~--~~~~~------~~~~-~~~~~~~-a~V~vv~-NRGRDI~Pf 216 (394) +|+.++.+=-=.++. +.+.+.+++...+. +.+|+| |+.|. +... +.+.++. ..|..+. =-==|++-| T Consensus 26 ~~~~~~~d~~v~~~~~~~~~~~~~~~g~~v-~~~~~p~GE~~K~l~~~~~l~~~L~~~~~~R~~~i~~~GGGvvgDlaGF 104 (361) T TIGR01357 26 SKLVIVTDETVADLYAEKLLEALQSAGFNV-LVLTVPDGEESKSLETVQRLYDQLLEAGLDRSSTIIALGGGVVGDLAGF 104 (361) T ss_pred CEEEEEECHHHHHHHHHHHHHHHHHCCCCE-EEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHH T ss_conf 507998674568888889999998538752-6887078987454888999999999658895647888545377878999 Q ss_pred HHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHH-----HHHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 9862210027867899973687767777644407899999987479989-----99999999760997689816876544 Q gi|254780201|r 217 LYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSD-----IAIRIINTFEQNPCLGMIGSRRYRRY 291 (394) Q Consensus 217 L~~~~~~~l~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~-----~v~~Il~~F~~~p~lGlv~p~~~~~~ 291 (394) +-.- ..++=++|- .+ - =.--++|+=+|.+- .+.|.|.+|-+ |+.=++-|.... . T Consensus 105 vAat---y~RGi~~iq---------vP-T------TLLAmVDsSvGGKtGiN~~~gKNlIGaFy~-P~~V~~D~~~L~-T 163 (361) T TIGR01357 105 VAAT---YMRGIRFIQ---------VP-T------TLLAMVDSSVGGKTGINFPLGKNLIGAFYQ-PKAVLIDPDFLK-T 163 (361) T ss_pred HHHH---HHHCCCEEE---------CC-C------HHHHHHHHCCCCCCCCCCCCCCCCEEEEEC-CCCEEEEHHHHC-C T ss_conf 9999---961558264---------14-0------477776530376411137888630421137-740265014415-8 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCC---EEEEE--EEHHHHHHHHCCC------CCCCCCCCCCC---- Q ss_conf 332345543221389999999748998855477377---01233--2098887576588------22358544688---- Q gi|254780201|r 292 KRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN---GTMFW--VKPKCLEPLRNLH------LIGEFEEERNL---- 356 (394) Q Consensus 292 ~~~~~~~~~~~~~~~~~~L~~rlgi~~~~~~~~Fpa---GSMFW--~R~~AL~pL~~l~------l~~DF~~E~g~---- 356 (394) .|..- ...-+.+..| +|+-..++-+++.- ..+.= +.-++|..+-... -=...+-|.|. T Consensus 164 LP~~E------~~~G~AEviK-~g~i~D~~~~~~l~~~~~~~~~~~~~~~~l~~~i~~S~~~Ka~iV~~De~E~GlR~~L 236 (361) T TIGR01357 164 LPERE------LRSGMAEVIK-HGLIADAELFDELEKNEKLLLNLESELEALEELIKRSIEVKASIVAEDEKESGLRAIL 236 (361) T ss_pred CCHHH------HHCCCHHHHH-HHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH T ss_conf 84688------8644357788-8751897799887200676433116668999999988688899831281200247665 Q ss_pred -CCCHHHHHHHHHHHHH Q ss_conf -8873899999999999 Q gi|254780201|r 357 -KDGALEHAVERFFACS 372 (394) Q Consensus 357 -~DGTlaHAIERl~~~i 372 (394) .=-|++||||-+.+|= T Consensus 237 NfGHT~GHAiE~~~~y~ 253 (361) T TIGR01357 237 NFGHTIGHAIEALSGYG 253 (361) T ss_pred HCCCHHHHHHHHHHCCC T ss_conf 22545999999740689 No 62 >pfam07845 DUF1636 Protein of unknown function (DUF1636). The sequences featured in this family are derived from a number of hypothetical prokaryotic proteins. The region in question is approximately 130 amino acids long. Probab=31.81 E-value=34 Score=15.03 Aligned_cols=13 Identities=31% Similarity=0.785 Sum_probs=5.5 Q ss_pred EEEEEE-EHHHHHH Q ss_conf 012332-0988875 Q gi|254780201|r 328 GTMFWV-KPKCLEP 340 (394) Q Consensus 328 GSMFW~-R~~AL~p 340 (394) |-|=|. ||+.|+. T Consensus 95 G~vp~~~rP~~lr~ 108 (116) T pfam07845 95 GLVPWRERPEVLRK 108 (116) T ss_pred CCCCCCCCHHHHHH T ss_conf 86442337598973 No 63 >COG1803 MgsA Methylglyoxal synthase [Carbohydrate transport and metabolism] Probab=29.55 E-value=41 Score=14.47 Aligned_cols=44 Identities=18% Similarity=0.197 Sum_probs=37.0 Q ss_pred HCCCCEEEEEEECCHHHHHHHHHHHHHCCCCEEEEEECCCCHHH Q ss_conf 00147389999416778999999997089981499971765289 Q gi|254780201|r 146 TIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD 189 (394) Q Consensus 146 ~~k~kIAVvlHlyY~DLl~E~~~~L~~ip~~~DL~ITt~~~~~~ 189 (394) ..+++||+++|=.--|.+-.|...-+.+-..+|||-|-++.... T Consensus 2 ~~~k~IALIAHD~kK~~lv~f~~~~~~~L~~h~L~ATGTTG~~i 45 (142) T COG1803 2 PARKHIALIAHDHKKDDLVNFVQAHKELLSRHDLYATGTTGGLI 45 (142) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHHHHHHCEEEEECCCHHHH T ss_conf 85515888843110889999999989975315689844712788 No 64 >KOG4425 consensus Probab=29.25 E-value=42 Score=14.44 Aligned_cols=75 Identities=28% Similarity=0.455 Sum_probs=41.3 Q ss_pred HHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEHHHH-HH---- Q ss_conf 9999999976099768981687654433234554322138999999974899885547737701233209888-75---- Q gi|254780201|r 266 IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCL-EP---- 340 (394) Q Consensus 266 ~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~~L~~rlgi~~~~~~~~FpaGSMFW~R~~AL-~p---- 340 (394) +....|++++..|-|||+.|- .. ..+|. |+.. ++.- -|-+||+||.-- -| T Consensus 561 nfke~ldal~aapflg~ilpw--------------gd----fselq---gi~e-d~sd---dgpifwvrpgeqmiptadg 615 (900) T KOG4425 561 NFKEFLDALEAAPFLGMILPW--------------GD----FSELQ---GICE-DQSD---DGPIFWVRPGEQMIPTADG 615 (900) T ss_pred CHHHHHHHHHCCCCCCCCCCC--------------CC----HHHHH---HHHC-CCCC---CCCEEEECCCCCCCCCCCC T ss_conf 579999987407300145664--------------44----68763---1110-4578---8865785463323646778 Q ss_pred -------------HHCCCCC--CCCCCCCCCCCCHHHHHH Q ss_conf -------------7658822--358544688887389999 Q gi|254780201|r 341 -------------LRNLHLI--GEFEEERNLKDGALEHAV 365 (394) Q Consensus 341 -------------L~~l~l~--~DF~~E~g~~DGTlaHAI 365 (394) +.++.+. ..-+.|..-.|-|-|||= T Consensus 616 pkspfkrnrs~ne~knhpla~rgnd~retaf~drtnahad 655 (900) T KOG4425 616 PKSPFKRNRSMNEIKNHPLAPRGNDPRETAFEDRTNAHAD 655 (900) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCHHHHH T ss_conf 8771000343242013666678897034333102112355 No 65 >COG5469 Predicted metal-binding protein [Function unknown] Probab=28.71 E-value=43 Score=14.38 Aligned_cols=13 Identities=31% Similarity=0.718 Sum_probs=8.6 Q ss_pred EEEEEE-EHHHHHH Q ss_conf 012332-0988875 Q gi|254780201|r 328 GTMFWV-KPKCLEP 340 (394) Q Consensus 328 GSMFW~-R~~AL~p 340 (394) |-|=|- ||+.|+. T Consensus 115 G~~pw~~rPe~Lk~ 128 (143) T COG5469 115 GLVPWRKRPEPLKR 128 (143) T ss_pred CCCCCCCCCHHHHC T ss_conf 75542429866750 No 66 >COG1216 Predicted glycosyltransferases [General function prediction only] Probab=27.79 E-value=44 Score=14.28 Aligned_cols=170 Identities=16% Similarity=0.169 Sum_probs=91.0 Q ss_pred CEEEEEEEC-CHHHHHHHHHHHHHCCCCEEEEEECCCCHH-HHHHHHHHC-CCCCEEEEE-CCCCHHHHHHHHHHCCCCC Q ss_conf 738999941-677899999999708998149997176528-999999711-787389994-5666047999986221002 Q gi|254780201|r 150 KIAIVVHCY-YQDTWIEISHILLRLNFDFDLFVTVVEANK-DFEQDVLKY-FPSAQLYVM-ENKGRDVRPFLYLLELGVF 225 (394) Q Consensus 150 kIAVvlHly-Y~DLl~E~~~~L~~ip~~~DL~ITt~~~~~-~~~~~~~~~-~~~a~V~vv-~NRGRDI~PfL~~~~~~~l 225 (394) +|++++=.| ..+.+.+++..|.+...+.+.+|-++.... .....+... ++++.+... +|-|=- +.+=++.....- T Consensus 4 ~i~~iiv~yn~~~~l~~~l~~l~~~~~~~~~iv~vDn~s~d~~~~~~~~~~~~~v~~i~~~~NlG~a-gg~n~g~~~a~~ 82 (305) T COG1216 4 KISIIIVTYNRGEDLVECLASLAAQTYPDDVIVVVDNGSTDGSLEALKARFFPNVRLIENGENLGFA-GGFNRGIKYALA 82 (305) T ss_pred CEEEEEEEECCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECCCCCEEH-HHHHHHHHHHHC T ss_conf 3599999968888999999999717687635998079997322888776058868999889886708-888899999862 Q ss_pred CCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHH Q ss_conf 78678999736877677776444078999999874799899999999976099768981687654433234554322138 Q gi|254780201|r 226 DRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYR 305 (394) Q Consensus 226 ~~YD~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~ 305 (394) .++++++.+-.= . .-+++.+++.++.++.++..|+++|.................... T Consensus 83 ~~~~~~l~LN~D-~---------------------~~~~~~l~~ll~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 140 (305) T COG1216 83 KGDDYVLLLNPD-T---------------------VVEPDLLEELLKAAEEDPAAGVVGPLIRNYDESLYIDRRGGESDG 140 (305) T ss_pred CCCCEEEEEECC-C---------------------CCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCCCC T ss_conf 898489998388-6---------------------879069999999998689877963465488753234434564443 Q ss_pred -HHHHHHH-HCCCCC--CCCCCC--CCCEEEEEEEHHHHHHHH Q ss_conf -9999999-748998--855477--377012332098887576 Q gi|254780201|r 306 -RVIDLAK-RAGFPT--KRLHLD--FFNGTMFWVKPKCLEPLR 342 (394) Q Consensus 306 -~~~~L~~-rlgi~~--~~~~~~--FpaGSMFW~R~~AL~pL~ 342 (394) ....... ...... ...... +..|.++-.|.++++.+- T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~li~~~~~~~vG 183 (305) T COG1216 141 LTGGWRASPLLEIAPDLSSYLEVVASLSGACLLIRREAFEKVG 183 (305) T ss_pred CCCCHHCCCCCCCCCCCCCCCHHHHHCCCCEEEEEHHHHHHHC T ss_conf 3220002433344443334201333225617997499999847 No 67 >COG1922 WecG Teichoic acid biosynthesis proteins [Cell envelope biogenesis, outer membrane] Probab=27.67 E-value=45 Score=14.27 Aligned_cols=130 Identities=7% Similarity=-0.101 Sum_probs=68.2 Q ss_pred CCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 98861575689789999999999999997510015856439999731377410073268718755999999730266420 Q gi|254780201|r 52 RITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFLYVKELFE 131 (394) Q Consensus 52 ~~~~~~~~~~tP~~f~~~~~wl~~~l~~~~~~~~~~~~~~ivfinaWNEEG~yLePD~~~G~~yL~a~~~al~~~~~~~~ 131 (394) ..-|.-+.+.|++.+-.. .......... ..++|.|+++--- .+-| ..|-+.+.+|-+-.|. T Consensus 11 ~ilg~~v~~~~~~~~~~~---------i~~~~~~~~~-~~vv~~N~e~~~~--a~~d----~e~~~~i~~A~li~pD--- 71 (253) T COG1922 11 NILGLPVDNVTWDEAVAL---------ILGRIEQGKP-TTVVTLNAEKVLL--ARKD----PEFREILNQADLILPD--- 71 (253) T ss_pred EEECCEEECCCHHHHHHH---------HHHHHHCCCC-CEEEECCHHHHHH--HCCC----HHHHHHHHHCCEECCC--- T ss_conf 871624531777999999---------9988743775-2899847999998--6238----9999998615788268--- Q ss_pred CCCCCCCCCHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEE Q ss_conf 246782210110000014738999941677899999999708998149997176528999999711787389994 Q gi|254780201|r 132 GWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVM 206 (394) Q Consensus 132 ~~~~~p~~~~~~~~~~k~kIAVvlHlyY~DLl~E~~~~L~~ip~~~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv 206 (394) +++-....-....-++.--+.+.|++++++++...-..+.=|+=..++-..+..+.+.+.+|...|... T Consensus 72 ------G~gvV~~ar~~~g~~~~~rv~G~Dl~~~Ll~~a~~~~~~vfllGgkp~V~~~a~~~l~~~~p~l~ivg~ 140 (253) T COG1922 72 ------GIGVVRAARRLLGQPLPERVAGTDLVEALLKRAAEEGKRVFLLGGKPGVAEQAAAKLRAKYPGLKIVGS 140 (253) T ss_pred ------CHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEE T ss_conf ------653999999981766765478489999999971756864999648778999999999987899569995 No 68 >COG3660 Predicted nucleoside-diphosphate-sugar epimerase [Cell envelope biogenesis, outer membrane] Probab=27.50 E-value=45 Score=14.25 Aligned_cols=88 Identities=16% Similarity=0.303 Sum_probs=60.5 Q ss_pred HCCCCEEEEEE------ECCHHHHHHHHHHHHHC--CCCEEEEEECCCCH-HHHHHHHHHCCCCCEEEEECCCCHHHHHH Q ss_conf 00147389999------41677899999999708--99814999717652-89999997117873899945666047999 Q gi|254780201|r 146 TIKSKIAIVVH------CYYQDTWIEISHILLRL--NFDFDLFVTVVEAN-KDFEQDVLKYFPSAQLYVMENKGRDVRPF 216 (394) Q Consensus 146 ~~k~kIAVvlH------lyY~DLl~E~~~~L~~i--p~~~DL~ITt~~~~-~~~~~~~~~~~~~a~V~vv~NRGRDI~Pf 216 (394) ..+++|||++- .|.+|-+.++.+.|..+ .+...++||.+... ...++.+.+..+...+.+-+|+--+--|. T Consensus 159 ~~rq~vAVlVGg~nk~f~~~~d~a~q~~~~l~k~l~~~g~~~lisfSRRTp~~~~s~l~~~l~s~~~i~w~~~d~g~NPY 238 (329) T COG3660 159 LPRQRVAVLVGGNNKAFVFQEDKAHQFASLLVKILENQGGSFLISFSRRTPDTVKSILKNNLNSSPGIVWNNEDTGYNPY 238 (329) T ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHCCCCCCEEEECCCCCCCCCH T ss_conf 87744999966788777667789999999999998747851899960689178999997135668445737987898816 Q ss_pred HHHHCCCCCCCCCEEEEECCCCCC Q ss_conf 986221002786789997368776 Q gi|254780201|r 217 LYLLELGVFDRYDYLCKIHGKKSQ 240 (394) Q Consensus 217 L~~~~~~~l~~YD~v~klHtKKS~ 240 (394) +--+. .=||| |-|+-|. T Consensus 239 ~~~La-----~Adyi--i~TaDSi 255 (329) T COG3660 239 IDMLA-----AADYI--ISTADSI 255 (329) T ss_pred HHHHH-----HCCEE--EEECCHH T ss_conf 88885-----21337--8704301 No 69 >COG0337 AroB 3-dehydroquinate synthetase [Amino acid transport and metabolism] Probab=27.29 E-value=24 Score=15.96 Aligned_cols=36 Identities=14% Similarity=0.031 Sum_probs=19.0 Q ss_pred CCCEEEEEEECCHHH-HHHHHHHHHHCCCCEEEEEECC Q ss_conf 147389999416778-9999999970899814999717 Q gi|254780201|r 148 KSKIAIVVHCYYQDT-WIEISHILLRLNFDFDLFVTVV 184 (394) Q Consensus 148 k~kIAVvlHlyY~DL-l~E~~~~L~~ip~~~DL~ITt~ 184 (394) ..|++++----=.++ ++.+...|.+..... ..+++| T Consensus 33 ~~k~~ivtd~~v~~~y~~~~~~~l~~~g~~v-~~~~lp 69 (360) T COG0337 33 GRKVAIVTDETVAPLYLEKLLATLEAAGVEV-DSIVLP 69 (360) T ss_pred CCEEEEEECCHHHHHHHHHHHHHHHHCCCEE-EEEEEC T ss_conf 9749999771157877999999998558605-799957 No 70 >cd02509 GDP-M1P_Guanylyltransferase GDP-M1P_Guanylyltransferase catalyzes the formation of GDP-Mannose. GDP-mannose-1-phosphate guanylyltransferase, also called GDP-mannose pyrophosphorylase (GDP-MP), catalyzes the formation of GDP-Mannose from mannose-1-phosphate and GTP. Mannose is a key monosaccharide for glycosylation of proteins and lipids. GDP-Mannose is the activated donor for mannosylation of various biomolecules. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase and mannose-1-phosphate guanylyltransferase. This CD covers the N-terminal GDP-mannose-1-phosphate guanylyltransferase domain, whereas the isomerase function is located at the C-terminal half. GDP-MP is a member of the nucleotidyltransferase family of enzymes. Probab=26.61 E-value=47 Score=14.15 Aligned_cols=77 Identities=17% Similarity=0.175 Sum_probs=49.2 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEECCCCHHHH-HHHHHHCCCCCEEEEECCCCHHHHHHHHHHCCCCC--CCCCEEEEECC Q ss_conf 7789999999970899814999717652899-99997117873899945666047999986221002--78678999736 Q gi|254780201|r 160 QDTWIEISHILLRLNFDFDLFVTVVEANKDF-EQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVF--DRYDYLCKIHG 236 (394) Q Consensus 160 ~DLl~E~~~~L~~ip~~~DL~ITt~~~~~~~-~~~~~~~~~~a~V~vv~NRGRDI~PfL~~~~~~~l--~~YD~v~klHt 236 (394) --|++..++++.++...-+++|.|.++.... +.++....+..+|. +|=.|||-+|=+...-.... ..-+.++-+.+ T Consensus 33 ~sLlq~T~~R~~~~~~~~~i~IvTn~~~~~~v~~ql~~~~~~~~ii-~EP~~rnTApAI~laa~~~~~~~~d~~l~v~Ps 111 (274) T cd02509 33 KSLLQQTLDRLKGLVPPDRILVVTNEEYRFLVREQLPEGLPEENII-LEPEGRNTAPAIALAALYLAKRDPDAVLLVLPS 111 (274) T ss_pred CCHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHHHHCCCCCCEE-ECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 9899999998746898678099957217999999988439866489-676888979999999999997599977999847 Q ss_pred C Q ss_conf 8 Q gi|254780201|r 237 K 237 (394) Q Consensus 237 K 237 (394) - T Consensus 112 D 112 (274) T cd02509 112 D 112 (274) T ss_pred C T ss_conf 4 No 71 >cd06321 PBP1_ABC_sugar_binding_like_11 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consist of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=25.39 E-value=49 Score=14.02 Aligned_cols=71 Identities=10% Similarity=0.104 Sum_probs=36.9 Q ss_pred CEEEEE----EECCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHH----HHHHCCCCCEEEEECCCCHHHHHHHHHHC Q ss_conf 738999----9416778999999997089981499971765289999----99711787389994566604799998622 Q gi|254780201|r 150 KIAIVV----HCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLE 221 (394) Q Consensus 150 kIAVvl----HlyY~DLl~E~~~~L~~ip~~~DL~ITt~~~~~~~~~----~~~~~~~~a~V~vv~NRGRDI~PfL~~~~ 221 (394) ||+|++ +-||.++..-+-++++.....+.++++-.+.+.+.+. .+...-.. -|.+.+.-...+.|.+..+. T Consensus 1 KIGvivp~i~npff~~~~~gie~~a~~~g~~~~v~~~~~~~d~~~q~~~i~~li~~~vD-gIii~~~~~~~~~~~i~~a~ 79 (271) T cd06321 1 KIGVSVGDLGNPFFVALAKGAEAAAKKLNPGVKVTVVSADYDLNKQVSQIDNFIAAKVD-LILLNAVDSKGIAPAVKRAQ 79 (271) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCC-EEEEECCCCCHHHHHHHHHH T ss_conf 98999489989899999999999999719976999807988999999999999974998-89993478403599999999 No 72 >cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S. pombe hypothetical protein SPBC215, as well as ISWI complex protein 4. The ISWI (imitation switch) proteins are ATPases responsible for chromatin remodeling in eukaryotes, and SPBC215 is proposed to also bind chromatin. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. Probab=24.96 E-value=23 Score=16.03 Aligned_cols=43 Identities=33% Similarity=0.521 Sum_probs=33.6 Q ss_pred CCCEEEEEEEHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 37701233209888757658822358544688887389999999 Q gi|254780201|r 325 FFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERF 368 (394) Q Consensus 325 FpaGSMFW~R~~AL~pL~~l~l~~DF~~E~g~~DGTlaHAIERl 368 (394) |+.++..|+.+.-|+||.. +..++|..++-..|..|.+|.|.. T Consensus 48 F~~~~y~Wi~~~dL~pL~~-e~i~~~L~~~~~K~k~L~~AY~~A 90 (93) T cd05840 48 FPDGDYYWVPNKDLKPLTE-EKIAKFLKKPKRKDKELIKAYKAA 90 (93) T ss_pred ECCCCEEEECHHHCCCCCH-HHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 1699668757654774999-999998643355888999999985 No 73 >TIGR02088 LEU3_arch isopropylmalate/isohomocitrate dehydrogenases; InterPro: IPR011828 This entry represents a group of archaeal decarboxylating dehydrogenases which include the leucine biosynthesis enzyme 3-isopropylmalate dehydrogenase (LeuB, LEU3) and the methanogenic cofactor CoB biosynthesis enzyme isohomocitrate dehydrogenase (AksF). Both of these have been characterised in Methanococcus janaschii . Sequences from the non-methanogenic archaea are also represented in this entry, presumably LeuB, although phylogenetic trees cannot establish which gene is which in the methanogens.; GO: 0003862 3-isopropylmalate dehydrogenase activity, 0051287 NAD binding, 0009098 leucine biosynthetic process. Probab=24.46 E-value=51 Score=13.91 Aligned_cols=52 Identities=19% Similarity=0.125 Sum_probs=25.0 Q ss_pred HHHHHHHHHHHCCCCEEEEEECCC-C--HHHHHHHHHHCCCCCEEEEECCCCHHHHHH Q ss_conf 899999999708998149997176-5--289999997117873899945666047999 Q gi|254780201|r 162 TWIEISHILLRLNFDFDLFVTVVE-A--NKDFEQDVLKYFPSAQLYVMENKGRDVRPF 216 (394) Q Consensus 162 Ll~E~~~~L~~ip~~~DL~ITt~~-~--~~~~~~~~~~~~~~a~V~vv~NRGRDI~Pf 216 (394) +.|.++-.|-.-|..||.+|||.- . -++..+.+.-.+ =+....|.|=+.+=| T Consensus 216 ~vDs~a~~Lv~~P~~fDVIVTtNmFGDIlSD~A~~l~GsL---GLaPSANiGd~~AlF 270 (350) T TIGR02088 216 LVDSAAMNLVKDPEKFDVIVTTNMFGDILSDLAAALVGSL---GLAPSANIGDRKALF 270 (350) T ss_pred HHHHHHHHHCCCCCCCCEEEEECCCCCHHHHHHHHHHHHH---CCCCCCCCCCCCCCC T ss_conf 8999999852386306767862345241889999985022---136765210452335 No 74 >TIGR00287 cas1 CRISPR-associated protein Cas1; InterPro: IPR002729 This family of proteins are found in archaea and bacteria and are, as yet, functionally uncharacterised. It is one of four protein families in prokaryotic genomes that contain multiple CRISPR elements. CRISPR is an acronym for Clustered Regularly Interspaced Short Palindromic Repeats. The cas genes are found near the repeats . This protein is otherwise uncharacterised.. Probab=24.27 E-value=52 Score=13.89 Aligned_cols=39 Identities=18% Similarity=0.099 Sum_probs=25.7 Q ss_pred CCCCCCC-EEEEECCCCCCCCCCCCCCCHHHHHHHHHHHC Q ss_conf 0027867-89997368776777764440789999998747 Q gi|254780201|r 223 GVFDRYD-YLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261 (394) Q Consensus 223 ~~l~~YD-~v~klHtKKS~~~~~~~~~G~~Wr~~Ll~~LL 261 (394) ...-++| +++++|+-+|-..+.-.+--+.||.-+.|.++ T Consensus 235 ~~~~GLdPy~GflH~~~~~r~sLvlDlmE~Fr~~~vD~~~ 274 (353) T TIGR00287 235 IIIVGLDPYIGFLHTDRSGRPSLVLDLMEEFRPVIVDRLV 274 (353) T ss_pred HHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 9982578403474211589865455578764377889999 No 75 >cd03802 GT1_AviGT4_like This family is most closely related to the GT1 family of glycosyltransferases. aviGT4 in Streptomyces viridochromogenes has been shown to be involved in biosynthesis of oligosaccharide antibiotic avilamycin A. Inactivation of aviGT4 resulted in a mutant that accumulated a novel avilamycin derivative lacking the terminal eurekanate residue. Probab=23.87 E-value=53 Score=13.84 Aligned_cols=56 Identities=14% Similarity=0.194 Sum_probs=24.6 Q ss_pred CEEEEEEECCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCCHHHHHH Q ss_conf 7389999416778999999997089981499971765289999997117873899945666047999 Q gi|254780201|r 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPF 216 (394) Q Consensus 150 kIAVvlHlyY~DLl~E~~~~L~~ip~~~DL~ITt~~~~~~~~~~~~~~~~~a~V~vv~NRGRDI~Pf 216 (394) ++.+.+|-....-....... ....+.++++++..+ ....+...+.+++| |=|..-| T Consensus 109 p~v~t~H~~~~~~~~~~~~~----~~~~~~~i~vS~~~~------~~~~~~~~~~vI~n-gvd~~~f 164 (335) T cd03802 109 PVVTTLHGPPDPELLKLYYA----ARPDVPFVSISDAQR------RPWPPLPWVATVHN-GIDLDDY 164 (335) T ss_pred CEEEEECCCCCHHHHHHHHH----HCCCCEEEECCHHHH------HHCCCCCCEEEECC-CCCHHHC T ss_conf 78999899970677999975----235868999459999------54577677799879-9888767 No 76 >pfam08589 DUF1770 Fungal protein of unknown function (DUF1770). The function of this family is unknown. These proteins are rather dissimilar except for a single strongly conserved motif (PDLRFEQ). Probab=22.49 E-value=38 Score=14.73 Aligned_cols=21 Identities=5% Similarity=-0.243 Sum_probs=18.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHH Q ss_conf 732687187559999997302 Q gi|254780201|r 106 RLNRFMSNSRMPFDSEKFLYV 126 (394) Q Consensus 106 ePD~~~G~~yL~a~~~al~~~ 126 (394) .||+||--+||..++.|-.+. T Consensus 70 lPDLRFEQSYL~SI~~A~tW~ 90 (96) T pfam08589 70 LPDLRFEQSYLASIRGADSWG 90 (96) T ss_pred CCCHHHHHHHHHHHHCCCCHH T ss_conf 965137899999875242121 No 77 >cd07969 OBF_DNA_ligase_I The Oligonucleotide/oligosaccharide binding (OB)-fold domain of ATP-dependent DNA ligase I is a DNA-binding module that is part of the catalytic core unit. ATP-dependent polynucleotide ligases catalyze phosphodiester bond formation using nicked nucleic acid substrates with the high energy nucleotide of ATP as a cofactor in a three step reaction mechanism. DNA ligases play a vital role in the diverse processes of DNA replication, recombination and repair. ATP-dependent ligases are present in many organisms such as viruses, bacteriohages, eukarya, archaea and bacteria. There are three classes of ATP-dependent DNA ligases in eukaryotic cells (I, III and IV). This group is composed of eukaryotic DNA ligase I, Sulfolobus solfataricus DNA ligase and similar proteins. DNA ligase I is required for the ligation of Okazaki fragments during lagging-strand DNA synthesis and for base excision repair (BER). ATP dependent DNA ligases have a highly modular architecture consist Probab=21.40 E-value=32 Score=15.13 Aligned_cols=33 Identities=18% Similarity=0.151 Sum_probs=25.1 Q ss_pred CCCCHHHHHHHHHHCCCCCCCCCEEEEECCCCC Q ss_conf 566604799998622100278678999736877 Q gi|254780201|r 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKS 239 (394) Q Consensus 207 ~NRGRDI~PfL~~~~~~~l~~YD~v~klHtKKS 239 (394) ..|+.=+++||.++.+..-..|-.|||++|-=| T Consensus 16 GkR~g~~ssfLlgv~d~~~~~~~~v~kVGtGfs 48 (144) T cd07969 16 GKRTGVYGAFLLACYDPETEEFQTVCKIGTGFS 48 (144) T ss_pred CCCCCCCCCEEEEEECCCCCEEEEEEEECCCCC T ss_conf 776774023799999389993999999647899 No 78 >TIGR01169 rplA_bact ribosomal protein L1; InterPro: IPR005878 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L1 is the largest protein from the large ribosomal subunit. In Escherichia coli, L1 is known to bind to the 23S rRNA. This model describe s bacterial and chloroplast ribosomal protein L1. Most mitochondrial L1 sequences are sufficiently divergent to be the contained in a different entry (IPR005879 from INTERPRO).; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0006412 translation, 0015934 large ribosomal subunit. Probab=20.90 E-value=60 Score=13.47 Aligned_cols=73 Identities=15% Similarity=0.132 Sum_probs=43.8 Q ss_pred HHHHHHHHHCCCCEEEEEECCCCHHHH--HHHHHH---CCCCCEEEEECCCCHHHHHHHHHHCCCCCC-CCC--EEEEEC Q ss_conf 999999970899814999717652899--999971---178738999456660479999862210027-867--899973 Q gi|254780201|r 164 IEISHILLRLNFDFDLFVTVVEANKDF--EQDVLK---YFPSAQLYVMENKGRDVRPFLYLLELGVFD-RYD--YLCKIH 235 (394) Q Consensus 164 ~E~~~~L~~ip~~~DL~ITt~~~~~~~--~~~~~~---~~~~a~V~vv~NRGRDI~PfL~~~~~~~l~-~YD--~v~klH 235 (394) +|+.+.+++--..||..|+||+-=..+ .-.++- ..||-.+=.| .-||+-=+..++.+... +=| =+||+- T Consensus 96 ~DLie~Ik~G~~dFDV~IATPDmM~~VGkLG~iLGPRGLMPNPK~GTV---T~dva~Av~~~K~G~vefR~DK~G~iH~~ 172 (227) T TIGR01169 96 DDLIEKIKKGWLDFDVVIATPDMMRKVGKLGRILGPRGLMPNPKTGTV---TADVAKAVKEAKKGRVEFRADKAGNIHAP 172 (227) T ss_pred HHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCC---CHHHHHHHHHHHCCCEEEEECCCEEEEEE T ss_conf 879999955898502588275776887633100478884753596652---24379999987279504651585158777 Q ss_pred CCCC Q ss_conf 6877 Q gi|254780201|r 236 GKKS 239 (394) Q Consensus 236 tKKS 239 (394) ==|. T Consensus 173 ~GK~ 176 (227) T TIGR01169 173 IGKV 176 (227) T ss_pred ECCC T ss_conf 5266 No 79 >TIGR02814 pfaD_fam PfaD family protein; InterPro: IPR014179 The protein PfaD is part of a four-gene locus, similar to polyketide biosynthesis systems, which is responsible for omega-3 polyunsaturated fatty acid biosynthesis in several high pressure and/or cold-adapted bacteria. Several other members of the entry are found in loci presumed to act in polyketide biosyntheses per se.. Probab=20.67 E-value=41 Score=14.52 Aligned_cols=253 Identities=13% Similarity=0.116 Sum_probs=130.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECC---CCCCCCHHHHHHHHHHHH----------HHHHHHHH Q ss_conf 68978999999999999999751001585643999973137---741007326871875599----------99997302 Q gi|254780201|r 60 FQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE---QKAFLRLNRFMSNSRMPF----------DSEKFLYV 126 (394) Q Consensus 60 ~~tP~~f~~~~~wl~~~l~~~~~~~~~~~~~~ivfinaWNE---EG~yLePD~~~G~~yL~a----------~~~al~~~ 126 (394) |=+++.+|+-+..+...| +.-+ .=-|=+|-|=.| |-.-.|--+|+|=.+.|| ++=++... T Consensus 52 GL~l~~VE~Ai~~Iq~~L------~~g~-~YgvNL~hs~~~p~~E~~~vdL~Lr~gV~~vEAsAFm~lTPalV~YRa~Gl 124 (449) T TIGR02814 52 GLSLEEVEQAIHRIQQAL------QNGP-AYGVNLIHSPSDPELENGLVDLLLRHGVRIVEASAFMQLTPALVRYRAKGL 124 (449) T ss_pred CCCHHHHHHHHHHHHHHC------CCCC-CEEEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCC T ss_conf 888689999999999862------7998-606778647778356899999997369967998720203688888856157 Q ss_pred HHHHCCCCCCCCCCHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHHCCCCEEEEEECCCCH-HHHHHHHHHCCCCCE-EE Q ss_conf 6642024678221011000001473899994167789999999970899814999717652-899999971178738-99 Q gi|254780201|r 127 KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEAN-KDFEQDVLKYFPSAQ-LY 204 (394) Q Consensus 127 ~~~~~~~~~~p~~~~~~~~~~k~kIAVvlHlyY~DLl~E~~~~L~~ip~~~DL~ITt~~~~-~~~~~~~~~~~~~a~-V~ 204 (394) .+ .....+..+-=|+.-+==||+...|++=.-.-= .+..|+ +.. ...++++.+..|-++ |- T Consensus 125 ~R-------------~~~G~v~~~~R~~AKVSRpEVA~~Fm~PAP~~~--L~~L~a--~G~it~eeAeLA~~vPvAdDi~ 187 (449) T TIGR02814 125 HR-------------DADGRVVIRNRLIAKVSRPEVAEAFMSPAPAAI--LQKLLA--EGRITEEEAELAAEVPVADDIC 187 (449) T ss_pred CC-------------CCCCCEEECCEEEEEECCHHHHHHHCCCCHHHH--HHHHHH--CCCCCHHHHHHHHHCCCCCCEE T ss_conf 21-------------689847774556785166799986215234899--999853--3876878999984177303603 Q ss_pred EECCCC--HHHHHHHHHHCCCCCCCCCEEEEECCCCCCC-CCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEE Q ss_conf 945666--0479999862210027867899973687767-7776444078999999874799899999999976099768 Q gi|254780201|r 205 VMENKG--RDVRPFLYLLELGVFDRYDYLCKIHGKKSQR-EGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLG 281 (394) Q Consensus 205 vv~NRG--RDI~PfL~~~~~~~l~~YD~v~klHtKKS~~-~~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lG 281 (394) |---=| =|-+|..++++. ++.-=|-+|.=|.=+++= ..=+..-| ||+|+.| +.+|--..+= T Consensus 188 vEADSGGHTD~r~~~vLLP~-i~rLRD~~~~~~gY~~~irvRvGaAGG-----------iGtPeAv---aAAF~LGA~F- 251 (449) T TIGR02814 188 VEADSGGHTDNRPLVVLLPA-IIRLRDTLMRKYGYRKPIRVRVGAAGG-----------IGTPEAV---AAAFMLGADF- 251 (449) T ss_pred EECCCCCCCCCHHHHHHHHH-HHHHHHHHHHHHCCCCCCEEEEECCCC-----------CCCHHHH---HHHHHCCCCE- T ss_conf 75268339762288999899-999999999982789874378735777-----------7727999---9997337425- Q ss_pred EEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCC---------------EEEEEEEHHHHHHHHCCCC Q ss_conf 9816876544332345543221389999999748998855477377---------------0123320988875765882 Q gi|254780201|r 282 MIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN---------------GTMFWVKPKCLEPLRNLHL 346 (394) Q Consensus 282 lv~p~~~~~~~~~~~~~~~~~~~~~~~~L~~rlgi~~~~~~~~Fpa---------------GSMFW~R~~AL~pL~~l~l 346 (394) +|.-+ .+-..-+-+-.+.++.|++.+++. |+. =.|| ||||=.|-.-|=.|+- T Consensus 252 iVTGS-------vNQc~~EAGtSd~Vk~lLa~~~v~--Dta-yAPAgDMFE~GvklQVLKrGtlFP~RANkLY~LYr--- 318 (449) T TIGR02814 252 IVTGS-------VNQCTVEAGTSDEVKKLLAKADVQ--DTA-YAPAGDMFELGVKLQVLKRGTLFPARANKLYELYR--- 318 (449) T ss_pred EECCC-------HHHCCCCCCCCHHHHHHHHCCCCC--CCC-CCCCHHHHHCCCEEEEEECCCCCHHHCCHHHHHHC--- T ss_conf 74186-------120033458886799998158976--501-36520054427768884025232101111579863--- Q ss_pred CCCCCC-CCCCCCCHHHHHHHHHH Q ss_conf 235854-46888873899999999 Q gi|254780201|r 347 IGEFEE-ERNLKDGALEHAVERFF 369 (394) Q Consensus 347 ~~DF~~-E~g~~DGTlaHAIERl~ 369 (394) +++. | ..|-+..+.||+=| T Consensus 319 --~YdSle--~l~~~~r~~lE~~~ 338 (449) T TIGR02814 319 --RYDSLE--ELDAKTRAQLEKKY 338 (449) T ss_pred --CCCCHH--HCCHHHHHHHHHHH T ss_conf --898842--14879999999986 No 80 >PRK07482 hypothetical protein; Provisional Probab=20.59 E-value=61 Score=13.43 Aligned_cols=220 Identities=9% Similarity=0.047 Sum_probs=94.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCEEEEEEEC---CHHHHHHHHHHHHHC-CCC Q ss_conf 7410073268718755999999730266420246782210110000014738999941---677899999999708-998 Q gi|254780201|r 101 QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCY---YQDTWIEISHILLRL-NFD 176 (394) Q Consensus 101 EG~yLePD~~~G~~yL~a~~~al~~~~~~~~~~~~~p~~~~~~~~~~k~kIAVvlHly---Y~DLl~E~~~~L~~i-p~~ 176 (394) ||.||. | .-|+.||++..-.-. --+.+ .||...++...-+.+- .+.|.| ..+...++.+.|..+ |.. T Consensus 40 ~G~~l~-D-~dG~~ylD~~~g~~~--~~lGH---~~p~v~~Ai~~Q~~~l--~~~~~~~~~~~~~~~~lAe~L~~~~p~~ 110 (461) T PRK07482 40 EGIRIT-D-HQGRRYIDAFAGLYC--VNVGY---GRTEVAEAIAEQAKEL--AYYHTYVGHGTEASIELSKRIIDRAPAG 110 (461) T ss_pred EEEEEE-E-CCCCEEEECCCCHHH--HHHCC---CCHHHHHHHHHHHHHC--CCCCCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 246899-7-999999996635886--42576---9499999999999658--7611467756889999999999757888 Q ss_pred E-EEEEECC-CCHHHHHHHHHHCC-------CCCEEEEEC--CCCHHHHHHHHHHCCCCCCCCCE--EEEECCCCCCCC- Q ss_conf 1-4999717-65289999997117-------873899945--66604799998622100278678--999736877677- Q gi|254780201|r 177 F-DLFVTVV-EANKDFEQDVLKYF-------PSAQLYVME--NKGRDVRPFLYLLELGVFDRYDY--LCKIHGKKSQRE- 242 (394) Q Consensus 177 ~-DL~ITt~-~~~~~~~~~~~~~~-------~~a~V~vv~--NRGRDI~PfL~~~~~~~l~~YD~--v~klHtKKS~~~- 242 (394) . .+|.+.+ ++..+.-.++.+.+ .+..|.... =.||-++..=..-....-..|+. ..-.|+. .|+. T Consensus 111 l~~v~f~~SGSEAvE~AiKlAr~y~~~~g~~~r~~Iis~~~syHG~T~~a~s~tg~~~~~~~~~~~~~~~~~~~-~P~~~ 189 (461) T PRK07482 111 MSKVYYGLSGSDANETQIKLVWYYNNVLGRPEKKKIISRWRGYHGSGIVTGSLTGLSLFHQHFDLPIDRVLHTE-APHYY 189 (461) T ss_pred CCEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECC-CCCCC T ss_conf 68799618669999999999999988618876545999617855675313002466333466679987832246-88545 Q ss_pred --CCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCC-C Q ss_conf --7764440789999998747998999999999760997689816876544332345543221389999999748998-8 Q gi|254780201|r 243 --GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-K 319 (394) Q Consensus 243 --~~~~~~G~~Wr~~Ll~~LLGs~~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~~L~~rlgi~~-~ 319 (394) .......+.+..++.+.| +.++..- ...+++-|.-++.. +.- ....-..+-.+.+.+|+++.|+-. . T Consensus 190 ~~~~~~~~~~~~~~~~~~~l-------e~~i~~~-~~~~iAAvI~EPi~-g~g-G~~~p~~~yl~~lr~lc~~~gilLI~ 259 (461) T PRK07482 190 RRADAGMSEEQFSAYCADEL-------EELILAE-GPDTIAAFIGEPVL-GTG-GIVPPPAGYWEAIQAVLKKYDILLIA 259 (461) T ss_pred CCCCCCCCHHHHHHHHHHHH-------HHHHHHH-CCCCEEEEEECCCC-CCC-CCCCCCHHHHHHHHHHHHCCCCEEEE T ss_conf 67886633777789999999-------9999850-98848999805635-768-85518878979999996245963771 Q ss_pred CCC-CCC-CCEEEEEEEHHHHHH Q ss_conf 554-773-770123320988875 Q gi|254780201|r 320 RLH-LDF-FNGTMFWVKPKCLEP 340 (394) Q Consensus 320 ~~~-~~F-paGSMFW~R~~AL~p 340 (394) |+- ..| =.|+||-+-..-++| T Consensus 260 DEV~tGfGRtG~~fa~e~~gv~P 282 (461) T PRK07482 260 DEVVTGFGRLGSMFGSQHYGIEP 282 (461) T ss_pred CCCEECCCCCCCCCHHCCCCCCC T ss_conf 54432475035841111689999 No 81 >pfam00862 Sucrose_synth Sucrose synthase. Sucrose synthases catalyse the synthesis of sucrose from UDP-glucose and fructose. This family includes the bulk of the sucrose synthase protein. However the carboxyl terminal region of the sucrose synthases belongs to the glycosyl transferase family pfam00534. Probab=20.37 E-value=42 Score=14.41 Aligned_cols=72 Identities=19% Similarity=0.244 Sum_probs=36.2 Q ss_pred HHH--HHHHHHHHCCCH-HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCC-----CC Q ss_conf 789--999998747998-9999999997609976898168765443323455432213899999997489988-----55 Q gi|254780201|r 250 IIW--RRWLFFDLLGFS-DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK-----RL 321 (394) Q Consensus 250 ~~W--r~~Ll~~LLGs~-~~v~~Il~~F~~~p~lGlv~p~~~~~~~~~~~~~~~~~~~~~~~~L~~rlgi~~~-----~~ 321 (394) ..| |-..|-.|=.-+ +....|+..|...|+|-+ ++ ...||- -+.-|+.+||+..+ -+ T Consensus 371 ~~WISRFeiwPYLE~fa~Da~~ei~~el~g~PDLII--------GN------YSDGNL-VAsLla~klgVTQc~IAHALE 435 (550) T pfam00862 371 RKWISRFEVWPYLETFAEDVASEIAAELQAKPDLII--------GN------YSDGNL-VASLLAHKLGVTQCTIAHALE 435 (550) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCEEE--------EC------CCCCHH-HHHHHHHHCCCCEEHHHHHHH T ss_conf 435455111187788889999999998489988798--------41------777229-999998653871012555553 Q ss_pred CCCCCCEEEEEEEHH Q ss_conf 477377012332098 Q gi|254780201|r 322 HLDFFNGTMFWVKPK 336 (394) Q Consensus 322 ~~~FpaGSMFW~R~~ 336 (394) +..++--..+|-.-+ T Consensus 436 KtKY~~SdlyWk~~e 450 (550) T pfam00862 436 KTKYPDSDIYWKKFE 450 (550) T ss_pred CCCCCCCCCCHHHCC T ss_conf 046877543432335 Done!