Query gi|254781114|ref|YP_003065527.1| glycosyl transferase family protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 263 No_of_seqs 181 out of 1482 Neff 8.8 Searched_HMMs 39220 Date Mon May 30 06:21:09 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781114.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam01755 Glyco_transf_25 Glyc 100.0 0 0 297.1 15.7 178 3-180 2-200 (200) 2 COG3306 Glycosyltransferase in 100.0 1.8E-43 0 269.8 15.9 211 1-214 1-222 (255) 3 cd06532 Glyco_transf_25 Glycos 100.0 7.9E-40 2E-44 248.8 13.5 127 4-180 1-128 (128) 4 KOG4179 consensus 100.0 5.3E-38 1.4E-42 238.3 11.0 193 4-201 293-509 (568) 5 pfam03213 Pox_P35 Poxvirus P35 98.1 2.1E-05 5.4E-10 50.2 9.0 103 68-173 97-207 (325) 6 pfam02434 Fringe Fringe-like. 97.6 0.0018 4.6E-08 39.1 10.8 178 13-202 17-205 (248) 7 cd02514 GT13_GLCNAC-TI GT13_GL 93.5 0.52 1.3E-05 24.9 13.5 188 3-199 2-205 (334) 8 cd02526 GT2_RfbF_like RfbF is 93.1 0.59 1.5E-05 24.6 13.5 136 65-201 56-204 (237) 9 pfam01762 Galactosyl_T Galacto 93.0 0.62 1.6E-05 24.5 8.7 118 68-187 62-187 (196) 10 pfam03071 GNT-I GNT-I family. 92.5 0.72 1.8E-05 24.1 7.2 185 3-198 95-299 (434) 11 KOG2287 consensus 90.2 1.3 3.2E-05 22.7 12.4 172 5-187 99-293 (349) 12 cd06433 GT_2_WfgS_like WfgS an 86.0 2.4 6.1E-05 21.1 8.5 178 12-199 8-192 (202) 13 KOG2791 consensus 85.9 2.4 6.2E-05 21.1 6.2 91 76-173 242-338 (455) 14 cd04185 GT_2_like_b Subfamily 84.5 2.8 7.2E-05 20.7 14.1 160 12-202 7-173 (202) 15 pfam05060 MGAT2 N-acetylglucos 76.5 5.4 0.00014 19.1 8.6 92 76-174 156-255 (357) 16 pfam04666 Glyco_transf_54 N-Ac 73.1 6.6 0.00017 18.6 4.8 86 77-188 167-254 (301) 17 pfam04908 SH3BGR SH3-binding, 62.0 11 0.00029 17.2 4.3 37 1-37 1-40 (92) 18 PTZ00215 ribose 5-phosphate is 58.1 13 0.00034 16.8 4.8 35 1-35 1-35 (156) 19 cd04196 GT_2_like_d Subfamily 57.3 14 0.00035 16.8 12.4 177 18-201 15-203 (214) 20 TIGR01990 bPGM beta-phosphoglu 56.1 5.8 0.00015 18.9 1.3 12 72-83 18-29 (190) 21 TIGR01533 lipo_e_P4 5'-nucleot 55.7 12 0.00029 17.2 2.8 28 3-31 160-188 (295) 22 pfam00483 NTP_transferase Nucl 53.5 16 0.0004 16.4 11.9 123 63-192 86-219 (247) 23 PRK03600 nrdI ribonucleotide r 50.3 14 0.00037 16.6 2.6 29 1-35 1-29 (136) 24 KOG1413 consensus 49.8 18 0.00046 16.1 11.3 183 3-198 69-275 (411) 25 COG3426 Butyrate kinase [Energ 48.7 19 0.00048 16.0 5.9 26 16-41 51-76 (358) 26 COG1519 KdtA 3-deoxy-D-manno-o 42.2 24 0.00061 15.4 5.1 20 2-21 146-168 (419) 27 COG1245 Predicted ATPase, RNas 36.1 30 0.00076 14.8 2.7 28 71-99 250-277 (591) 28 cd04184 GT2_RfbC_Mx_like Myxoc 35.6 30 0.00078 14.7 10.4 98 14-118 14-117 (202) 29 TIGR01469 cobA_cysG_Cterm urop 34.1 29 0.00074 14.9 2.1 86 22-116 100-192 (242) 30 cd02508 ADP_Glucose_PP ADP-glu 31.7 35 0.0009 14.4 9.1 76 67-171 93-168 (200) 31 pfam03445 DUF294 Putative nucl 29.2 31 0.00078 14.7 1.5 19 78-96 62-80 (168) 32 PRK10063 predicted glycosyl tr 29.0 39 0.001 14.1 12.2 185 3-201 1-195 (248) 33 PRK10710 DNA-binding transcrip 28.7 40 0.001 14.1 6.2 21 150-170 102-122 (240) 34 COG2047 Uncharacterized protei 28.3 28 0.0007 15.0 1.1 27 152-178 191-218 (258) 35 cd04186 GT_2_like_c Subfamily 27.9 41 0.0011 14.0 12.8 154 12-199 7-163 (166) 36 TIGR01465 cobM_cbiF precorrin- 27.3 42 0.0011 13.9 2.8 20 20-39 91-110 (252) 37 cd04195 GT2_AmsE_like GT2_AmsE 26.4 44 0.0011 13.8 6.0 87 19-111 17-108 (201) 38 PRK09275 aspartate aminotransf 26.1 44 0.0011 13.8 2.7 23 147-169 252-274 (531) 39 pfam01745 IPT Isopentenyl tran 25.7 45 0.0012 13.7 6.9 93 3-96 1-100 (232) 40 pfam07459 CTX_RstB CTX phage R 25.6 34 0.00087 14.5 1.2 11 3-13 1-11 (117) 41 PRK00648 Maf-like protein; Rev 25.3 46 0.0012 13.7 9.0 72 1-95 1-74 (191) 42 cd04187 DPM1_like_bac Bacteria 25.3 46 0.0012 13.7 6.1 101 12-119 7-113 (181) 43 TIGR03030 CelA cellulose synth 21.7 55 0.0014 13.3 8.6 40 60-108 213-253 (713) 44 cd06420 GT2_Chondriotin_Pol_N 21.3 56 0.0014 13.2 8.6 165 11-198 6-177 (182) 45 pfam06258 DUF1022 Protein of u 20.8 57 0.0014 13.2 3.6 16 19-34 6-21 (308) 46 COG1497 Predicted transcriptio 20.4 47 0.0012 13.6 1.1 33 65-97 85-121 (260) No 1 >pfam01755 Glyco_transf_25 Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of various sugars onto the growing LPS chain during its biosynthesis. Probab=100.00 E-value=0 Score=297.13 Aligned_cols=178 Identities=32% Similarity=0.411 Sum_probs=146.3 Q ss_pred CCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHH--HHHCCHHCCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 86899808998899999999998779978999663735289677--4202000024678878878999999999999999 Q gi|254781114|r 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICN--RIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRI 80 (263) Q Consensus 3 ~kiyvInL~~~~~Rr~~~~~~~~~~gi~~e~~~Avdg~~~~~~~--~~~~~~~~~~~~~~~l~~geigC~lSH~~~w~~~ 80 (263) ||||||||++++|||++|.+|++++|++||||+||||++++..+ ..++.......++++|++|||||+|||+++|++| T Consensus 2 i~iyvInL~r~~~Rr~~~~~~l~~~~i~~~~~~Avdg~~l~~~~~~~~~~~~~~~~~~~r~lt~gEiGC~lSH~~~w~~~ 81 (200) T pfam01755 2 IKIYVISLKRATERREHIQKEFGKLNIPFQFFDAIDGKELPNHLLFKLYNFLLRSKYAGRPLTLGEIGCFLSHYSLWQKC 81 (200) T ss_pred CEEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCHHHHHHHCCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 76999979995899999999998769976999846310102267887325555334678988844514899999999999 Q ss_pred HHCCCCEEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCC-------------------CCCCCCCCCCCCCE Q ss_conf 82799859997302213300899999998415537754451136654-------------------22211234677705 Q gi|254781114|r 81 AYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKP-------------------KKDSYLCTLPGNFD 141 (263) Q Consensus 81 ~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~~~~~~d~~~l~~~~~~~-------------------~~~~~~~~~~~~~~ 141 (263) +++|.+++|||||||.|.++|.+.++.+.....+++.+.++...... .............. T Consensus 82 v~~~~~~alIlEDD~~~~~~~~~~l~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 161 (200) T pfam01755 82 VELNLEYIIILEDDVILSENFKAILQFLLKEINELEFLRLGEWLKTRFDPNDKFIVRLETFLMDSKKFKRTPIADYGNRA 161 (200) T ss_pred HHCCCCEEEEECCCEECCHHHHHHHHHHHHCCCCCHHEEECCCCCCCCCCCCCCCCCCEEECCCCCHHCCCCCCCCCCCE T ss_conf 97599849998167204430899999998504420110004432345455310002210111543000135555557703 Q ss_pred EEEECCCCCCCEEEEECHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 899405798512356669999999996254578046887 Q gi|254781114|r 142 IHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDM 180 (263) Q Consensus 142 l~~~~~~~~gt~aY~Is~~~AkkLl~~~~~~~~pvD~~l 180 (263) +......+.||+||+|||+||++||+..+++.+|||++| T Consensus 162 ~~~~~~~~~gt~aYiIs~~gAkkLL~~~~~~~~PvD~~m 200 (200) T pfam01755 162 FYLLKSGHAGTAGYLITPKAAKKLLKYSKNWPLPVDDLM 200 (200) T ss_pred EEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHC T ss_conf 654058888067688419999999976776797623019 No 2 >COG3306 Glycosyltransferase involved in LPS biosynthesis [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=1.8e-43 Score=269.82 Aligned_cols=211 Identities=24% Similarity=0.264 Sum_probs=164.8 Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHCC-CCEEEEECCCCCCCCHHHHH-HCCHHCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9886899808998899999999998779-97899966373528967742-020000246788788789999999999999 Q gi|254781114|r 1 MPIPVYVISLPFSHARREKFCHRAARIH-LQFSFFDAIYGENNPICNRI-FSHQKRQCQFKRLLSLPEIGCYISHIHLWK 78 (263) Q Consensus 1 m~~kiyvInL~~~~~Rr~~~~~~~~~~g-i~~e~~~Avdg~~~~~~~~~-~~~~~~~~~~~~~l~~geigC~lSH~~~w~ 78 (263) |.|++|||||+++.+|++++.+.+...+ ++|++|+||+|+..+..... .....+...+++.+++||+|||+||+.+|| T Consensus 1 ~~~~~~vIsL~~s~~R~~~~~~~f~~~~~~~f~~~~av~~~~~~~~~~~~~~~~~~~~~~~~~ls~gEiGC~lSH~~lw~ 80 (255) T COG3306 1 SLMPIHVISLKSSQERLEHVAETFEALGGLPFQRFDAVNGKSEDEKDLIAELDAGHLLYEGRRLSPGEIGCYLSHLKLWK 80 (255) T ss_pred CCCCEEHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHH T ss_conf 99632103154428879999998764258882575346853339888722301023455223567116989999999999 Q ss_pred HHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHCC-----CCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCE Q ss_conf 9982799859997302213300899999998415-----53775445113665422211234677705899405798512 Q gi|254781114|r 79 RIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCD-----INNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTT 153 (263) Q Consensus 79 ~~~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~~~-----~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~gt~ 153 (263) +|++.|.+|++|||||+.|+++|.+.++...+.+ .+|+++++............ ....+..+......+.||+ T Consensus 81 ~~~~~~~~yi~I~EDDV~l~e~f~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~gt~ 158 (255) T COG3306 81 KALEENLPYILILEDDVVLGEDFEEFLEDDLKLPVRFLGDDIDIHRLETFLSPNPLAFN--AVFIGRNFPLLNSYHLGTA 158 (255) T ss_pred HHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCEEECC--CCCCCCCCHHHHHCCCCCC T ss_conf 99847998489963661035148999999985244323037888998875056522125--3100102111011115744 Q ss_pred EEEECHHHHHHHHHHHCC--CCCHHHHHHHHHHH--CCCCEEEEECCCEECCCCCCCCHHHHHHH Q ss_conf 356669999999996254--57804688799997--59817999603131267656511332221 Q gi|254781114|r 154 GYFIGKEAAIHLLNVRKN--IYRPIDMDMKHWWE--HNIPSLVTEPGAVYEAIDTNDSTIEESRL 214 (263) Q Consensus 154 aY~Is~~~AkkLl~~~~~--~~~pvD~~l~~~~~--~~l~~~~~~P~~v~~~~~~~~sti~~~~~ 214 (263) ||+||++||+++|+.++. +.+|+|.++...|. .+..+|++.|..+.|.+. ..|..+.++. T Consensus 159 gYiis~~aAk~fl~~~~~~~~~~pvD~~~~~~~~~~~~~~vyq~~p~~~~~~~~-l~s~~~~~r~ 222 (255) T COG3306 159 GYIISRKAAKKFLELTESFKVVLPVDWFMFLEFLHGVGNKVYQPFPAICAQGSA-LSSILENERD 222 (255) T ss_pred CEEECHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCEEEEECCCCCCCCCC-CCCCCHHHCC T ss_conf 323348999999998612365567168876222045676688744651232234-6653143236 No 3 >cd06532 Glyco_transf_25 Glycosyltransferase family 25 [lipooligosaccharide (LOS) biosynthesis protein] is a family of glycosyltransferases involved in LOS biosynthesis. The members include the beta(1,4) galactosyltransferases: Lgt2 of Moraxella catarrhalis, LgtB and LgtE of Neisseria gonorrhoeae and Lic2A of Haemophilus influenzae. M. catarrhalis Lgt2 catalyzes the addition of galactose (Gal) to the growing chain of LOS on the cell surface. N. gonorrhoeae LgtB and LgtE link Gal-beta(1,4) to GlcNAc (N-acetylglucosamine) and Glc (glucose), respectively. The genes encoding LgtB and LgtE are two genes of a five gene locus involved in the synthesis of gonococcal LOS. LgtE is believed to perform the first step in LOS biosynthesis. Probab=100.00 E-value=7.9e-40 Score=248.80 Aligned_cols=127 Identities=34% Similarity=0.534 Sum_probs=116.2 Q ss_pred CEEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHH-HCCHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 689980899889999999999877997899966373528967742-0200002467887887899999999999999982 Q gi|254781114|r 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNRI-FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAY 82 (263) Q Consensus 4 kiyvInL~~~~~Rr~~~~~~~~~~gi~~e~~~Avdg~~~~~~~~~-~~~~~~~~~~~~~l~~geigC~lSH~~~w~~~~~ 82 (263) |||||||++++||+++|.+++.+.|++|+||+||||++++..... .........+++++++||+||++||+++||++++ T Consensus 1 ki~vInL~~~~dR~~~~~~~~~~~~l~~~~~~Avdg~~~~~~~~~~~~~~~~~~~~~~~lt~gEigC~lSH~~~w~~~v~ 80 (128) T cd06532 1 KIFVINLDRSTDRRERMEAQLAALGLDFEFFDAVDGKDLSEEELAALYDALFLPRYGRPLTPGEIGCFLSHYKLWQKIVE 80 (128) T ss_pred CEEEEECCCCHHHHHHHHHHHHHCCCCCEEEEEECHHHCCHHHHHHHCHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99999898968999999999977698735886034102799999874304344304898874357899999999999997 Q ss_pred CCCCEEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEEEEECHHHH Q ss_conf 79985999730221330089999999841553775445113665422211234677705899405798512356669999 Q gi|254781114|r 83 SPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAA 162 (263) Q Consensus 83 ~~~~~~lIlEDD~~~~~~f~~~l~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~gt~aY~Is~~~A 162 (263) +|.+++||||||++|.++ ||+||+|||.|| T Consensus 81 ~~~~~~lIlEDDv~~~~~--------------------------------------------------~t~aY~is~~~A 110 (128) T cd06532 81 SNLEYALILEDDAILDPD--------------------------------------------------GTAGYLVSRKGA 110 (128) T ss_pred CCCCEEEEEECCCEECCC--------------------------------------------------CEEEEEECHHHH T ss_conf 599839998178356799--------------------------------------------------736899789999 Q ss_pred HHHHHHHCCCCCHHHHHH Q ss_conf 999996254578046887 Q gi|254781114|r 163 IHLLNVRKNIYRPIDMDM 180 (263) Q Consensus 163 kkLl~~~~~~~~pvD~~l 180 (263) ++||+.+.++.+|+|.++ T Consensus 111 ~klL~~~~~~~~pvD~~~ 128 (128) T cd06532 111 KKLLAALEPIDLPVDVFL 128 (128) T ss_pred HHHHHHCCCCCCCCCCCC T ss_conf 999982787667650419 No 4 >KOG4179 consensus Probab=100.00 E-value=5.3e-38 Score=238.25 Aligned_cols=193 Identities=22% Similarity=0.339 Sum_probs=150.5 Q ss_pred CEEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHH---HHC-CHHCCC-CCCCCCCHHHHHHHHHHHHHHH Q ss_conf 68998089988999999999987799789996637352896774---202-000024-6788788789999999999999 Q gi|254781114|r 4 PVYVISLPFSHARREKFCHRAARIHLQFSFFDAIYGENNPICNR---IFS-HQKRQC-QFKRLLSLPEIGCYISHIHLWK 78 (263) Q Consensus 4 kiyvInL~~~~~Rr~~~~~~~~~~gi~~e~~~Avdg~~~~~~~~---~~~-~~~~~~-~~~~~l~~geigC~lSH~~~w~ 78 (263) +||+|||+|++|||++|.+.|..+||+++.++||||+.++.... .++ .+.+.. ..+|+|+.||||||||||++|+ T Consensus 293 eIfmInLkRR~eRreRM~r~fde~gIe~~~veAvDgk~lnt~~~~~lgv~~LpGY~DPys~Rplt~GEiGCFLSHY~iW~ 372 (568) T KOG4179 293 EIFMINLKRRPERRERMLRIFDELGIEYSLVEAVDGKKLNTSQLEALGVQMLPGYRDPYSGRPLTKGEIGCFLSHYNIWK 372 (568) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHCCEEEEEECCCCCCCCHHHHHHCCCEECCCCCCCCCCCCCCCCCEEEEHHHHHHHH T ss_conf 58998624476789999999887524278874245532364478753943057866865678663763013322778999 Q ss_pred HHHHCCCCEEEEEEECCCCCCCHHHHH----HHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEE Q ss_conf 998279985999730221330089999----9998415537754451136654222112346777058994057985123 Q gi|254781114|r 79 RIAYSPAIGAIILEDDADFSDEFSQLL----PHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTG 154 (263) Q Consensus 79 ~~~~~~~~~~lIlEDD~~~~~~f~~~l----~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~gt~a 154 (263) .+++.+.+-.||||||+.|..+|...+ +.+.....+||+++++..+..... ........+ ++.. ....-|.| T Consensus 373 evV~r~l~kvlvfEDD~RFe~~f~~rl~~lm~d~~~~~~~wdLIY~GRKr~~~e~-E~av~~~~n--Lv~a-~YSYWTLg 448 (568) T KOG4179 373 EVVDRGLEKVLVFEDDLRFEHDFRTRLMRLMQDVDAEHLDWDLIYLGRKRMQEEE-EKAVENVRN--LVEA-GYSYWTLG 448 (568) T ss_pred HHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECHHHCCCHH-HHCCCCCCC--CEEC-CCCCEEHH T ss_conf 9997162058996145210377899999999998861488317985031247212-101136677--1351-40010255 Q ss_pred EEECHHHHHHHHHHHCCC--CCHHHHHHHHHH-------------HCCCCEEEEECCCEECC Q ss_conf 566699999999962545--780468879999-------------75981799960313126 Q gi|254781114|r 155 YFIGKEAAIHLLNVRKNI--YRPIDMDMKHWW-------------EHNIPSLVTEPGAVYEA 201 (263) Q Consensus 155 Y~Is~~~AkkLl~~~~~~--~~pvD~~l~~~~-------------~~~l~~~~~~P~~v~~~ 201 (263) |+||.+||+|||.. .|. -.|||.|+--++ ..++..|+..|.++++. T Consensus 449 YvisL~GArKLlaa-~Pl~kmlPVDEfLPiMfdkHPn~~~k~~f~~R~l~afsa~pll~ypt 509 (568) T KOG4179 449 YVISLQGARKLLAA-NPLKKMLPVDEFLPIMFDKHPNKEYKEHFCARNLVAFSAEPLLIYPT 509 (568) T ss_pred HHHHHHHHHHHHCC-CCHHCCCCHHHHHHHHHCCCCCHHHHHHCCHHHHHHCCCCCEEEECE T ss_conf 57511337776255-83101374655203442468527877532511001003563053010 No 5 >pfam03213 Pox_P35 Poxvirus P35 protein. Probab=98.15 E-value=2.1e-05 Score=50.24 Aligned_cols=103 Identities=17% Similarity=0.150 Sum_probs=65.5 Q ss_pred HHHHHHHHHHHHH-----HHCCCCEEEEEEECCCCCCCHHHHHHHH--HHCCCCCCEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 9999999999999-----8279985999730221330089999999--84155377544511366542221123467770 Q gi|254781114|r 68 GCYISHIHLWKRI-----AYSPAIGAIILEDDADFSDEFSQLLPHL--SKCDINNILIKFDALRKKPKKDSYLCTLPGNF 140 (263) Q Consensus 68 gC~lSH~~~w~~~-----~~~~~~~~lIlEDD~~~~~~f~~~l~~~--~~~~~~~d~~~l~~~~~~~~~~~~~~~~~~~~ 140 (263) --.+-|+++|+.. .+.+.++++|+|||-.+.+ .+.+..+ .....+||+++|-...... .........++. T Consensus 97 ~sia~H~SLWe~~~~~~~~~~~~~fivVvEdDnTl~~--it~i~~~I~~M~~knidilQLrE~~~~~-~~rt~~~~~~~p 173 (325) T pfam03213 97 RSIANHLSLWESYAKADIKDPESKFIVVVEDDNTLRD--ITTIHPLIKSMKEKNIDILQLRETLHNN-NVRTLLNQSGNP 173 (325) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCC--CHHHHHHHHHHHHCCCEEEEEHHHHHCC-CHHHHCCCCCCC T ss_conf 9999888999987334545877656999937972135--3769999999986595487512355033-244423566896 Q ss_pred EEEEEC-CCCCCCEEEEECHHHHHHHHHHHCCCC Q ss_conf 589940-579851235666999999999625457 Q gi|254781114|r 141 DIHQPR-ILSPRTTGYFIGKEAAIHLLNVRKNIY 173 (263) Q Consensus 141 ~l~~~~-~~~~gt~aY~Is~~~AkkLl~~~~~~~ 173 (263) .+.-+. .......||+|..++|.+|++...... T Consensus 174 ~l~~YtGgyDvSLsAYIIr~~ta~kl~~~i~~~~ 207 (325) T pfam03213 174 SLYSYTGGYDVSLSAYIIRVSTAKKLYAEIIKNG 207 (325) T ss_pred CEEEECCCCCEEEEEEEEEHHHHHHHHHHHHHCC T ss_conf 4266337532267899997278899999998658 No 6 >pfam02434 Fringe Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localized to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs through glycosylation by FNG. The xenopus homologue, lunatic fringe, has been implicated in a variety of functions. Probab=97.57 E-value=0.0018 Score=39.13 Aligned_cols=178 Identities=11% Similarity=0.043 Sum_probs=95.7 Q ss_pred CHHHHHHHHHHHHHCCCCEEE-EECCCCCCCCHHHHHHCCHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 889999999999877997899-9663735289677420200002467887887899999999999999982799859997 Q gi|254781114|r 13 SHARREKFCHRAARIHLQFSF-FDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIIL 91 (263) Q Consensus 13 ~~~Rr~~~~~~~~~~gi~~e~-~~Avdg~~~~~~~~~~~~~~~~~~~~~~l~~geigC~lSH~~~w~~~~~~~~~~~lIl 91 (263) ..+|...+.+...+..-.-++ |...+-..++.... ...........-.+...+|-++- .+..+++++.++.++. T Consensus 17 H~sRl~~v~~TW~~~a~~~~~~fSD~~D~~lp~~~~---~~~~~~n~~~~h~~k~ls~~m~~--~~d~~~~~~~kWf~~~ 91 (248) T pfam02434 17 HKTRLPLLLKTWISRAKHQTYIFTDGEDEGLPTRTG---GHLINTNCSAGHCRKALSCKMAV--EYDRFLESGKKWFCHV 91 (248) T ss_pred HHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHH--HHHHHHHCCCCEEEEE T ss_conf 798999999999843677489963777556675568---86102687776454555888899--9887764288669993 Q ss_pred EECCCCCC-CHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEEEEECHHHHHHHHH--- Q ss_conf 30221330-08999999984155377544511366542221123467770589940579851235666999999999--- Q gi|254781114|r 92 EDDADFSD-EFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN--- 167 (263) Q Consensus 92 EDD~~~~~-~f~~~l~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~gt~aY~Is~~~AkkLl~--- 167 (263) .||..+.. .+..+|. ......+ ++++-.... ...........+. ..-+.+..|.+|++|||..|.||.- T Consensus 92 DDDtYvnv~rL~~lL~---~yn~s~d-iylG~~s~~-~~~~~~~~~~~~~--~~~f~faTGGaG~ciSR~la~km~P~~s 164 (248) T pfam02434 92 DDDNYVNVPRLVRLLS---CYNHTQD-VYLGKPSLY-RPIEATERVKNNR--KVGFWFATGGAGFCISRGLALKMSPWAS 164 (248) T ss_pred CCCCEECHHHHHHHHH---CCCCCCC-EEECCCCCC-CCCCCCCCCCCCC--CCCEEEECCCCEEEEEHHHHHHHHHHCC T ss_conf 5873205788999973---1687788-672353567-7542201234567--5643554389529976999986404215 Q ss_pred ------HHCCCCCHHHHHHHHHHHCCCCEEEEECCCEECCC Q ss_conf ------62545780468879999759817999603131267 Q gi|254781114|r 168 ------VRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEAI 202 (263) Q Consensus 168 ------~~~~~~~pvD~~l~~~~~~~l~~~~~~P~~v~~~~ 202 (263) ....+.+|-|..+++-.++.|.+-.+.-++-.|.- T Consensus 165 ~~~~~s~~~~i~lPDD~tlG~ii~~~Lgv~lt~s~lFHs~~ 205 (248) T pfam02434 165 GGRFMSTSEKIRLPDDCTLGYIIENLLGVPLTHSPLFHSHL 205 (248) T ss_pred CCCCCCCCCCCCCCCCEEEHHHHHHHCCCCCEECHHHHHCC T ss_conf 89875453455799746620231423588756555655238 No 7 >cd02514 GT13_GLCNAC-TI GT13_GLCNAC-TI is involved in an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GLCNAC-T I , GNT-I) transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide, an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. The enzyme is an integral membrane protein localized to the Golgi apparatus. The catalytic domain is located at the C-terminus. These proteins are members of the glycosy transferase family 13. Probab=93.49 E-value=0.52 Score=24.94 Aligned_cols=188 Identities=14% Similarity=0.050 Sum_probs=93.1 Q ss_pred CCEEEEECCCCHHHHHHHHHHHHHC--CC-CEEEEECCCCCCCCHHHH--HHCCHHCCC-CC-CC--CC-CHHH-HH--H Q ss_conf 8689980899889999999999877--99-789996637352896774--202000024-67-88--78-8789-99--9 Q gi|254781114|r 3 IPVYVISLPFSHARREKFCHRAARI--HL-QFSFFDAIYGENNPICNR--IFSHQKRQC-QF-KR--LL-SLPE-IG--C 69 (263) Q Consensus 3 ~kiyvInL~~~~~Rr~~~~~~~~~~--gi-~~e~~~Avdg~~~~~~~~--~~~~~~~~~-~~-~~--~l-~~ge-ig--C 69 (263) +||-||.-. +++=..++.+++-++ +. .+-++--.||......+. .|....... +. .. .. ..++ .| - T Consensus 2 ipVlV~a~N-Rp~~l~r~L~sLl~~~p~~~~~pIiVsqDg~~~~~~~v~~~yg~~i~~i~~~~~~~~~~~~~~k~~~Y~~ 80 (334) T cd02514 2 IPVLVIACN-RPDYLRRMLDSLLSYRPSAEKFPIIVSQDGGYEEVADVAKSFGDGVTHIQHPPISIKNVNPPHKFQGYYR 80 (334) T ss_pred CCEEEEECC-CHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCHHHHHHHHHHCCEEEEEECCCCCCCCCCCHHHHHHHHH T ss_conf 118999558-7799999999998518667874489994799879999999725102789678866556675255566899 Q ss_pred HHHHHHHH-HHH-HHCCCCEEEEEEECCCCCCCHHHHHHHHHHC-CCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEC Q ss_conf 99999999-999-8279985999730221330089999999841-55377544511366542221123467770589940 Q gi|254781114|r 70 YISHIHLW-KRI-AYSPAIGAIILEDDADFSDEFSQLLPHLSKC-DINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPR 146 (263) Q Consensus 70 ~lSH~~~w-~~~-~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~~-~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~ 146 (263) ...||+-- .++ -..+.+.++|+|||..+.++|.++.+....+ ..|--+......+-. .+.. ........+++.. T Consensus 81 Ia~HYk~aL~~~F~~~~~~~vIIlEDDL~vspDFf~YF~~t~~LL~~D~sl~cISAWNDn-G~~~--~v~~~~~~lyRsd 157 (334) T cd02514 81 IARHYKWALTQTFNLFGYSFVIILEDDLDIAPDFFSYFQATLPLLEEDPSLWCISAWNDN-GKEH--FVDDTPSLLYRTD 157 (334) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEEEECCC-CCCC--CCCCCCCCEEEEC T ss_conf 999999999998630588769999456333575999999756876159966899733167-8655--5787611024304 Q ss_pred CCCCCCEEEEECHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEEECCCEE Q ss_conf 57985123566699999999962545780468879999759817999603131 Q gi|254781114|r 147 ILSPRTTGYFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVY 199 (263) Q Consensus 147 ~~~~gt~aY~Is~~~AkkLl~~~~~~~~pvD~~l~~~~~~~l~~~~~~P~~v~ 199 (263) +...-|-+++++-=+.| +---| ....|.||...-.++-+.+ +.|-+.. T Consensus 158 --ffpGLGWml~r~lw~El-~pkWP-~~~WDdWmR~p~~RkgR~c-IrPEisR 205 (334) T cd02514 158 --FFPGLGWMLTRKLWKEL-EPKWP-KAFWDDWMRLPEQRKGREC-IRPEISR 205 (334) T ss_pred --CCCCHHHHHHHHHHHHH-CCCCC-CCCHHHHHCCHHHHCCCCC-CCCCCCC T ss_conf --67735665108789762-55799-8741766505565179741-2566653 No 8 >cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen. Probab=93.13 E-value=0.59 Score=24.61 Aligned_cols=136 Identities=11% Similarity=-0.008 Sum_probs=65.6 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHH-HHHCCCCCCEEEEECC--C--CCCCCCC------CC Q ss_conf 8999999999999999827998599973022133008999999-9841553775445113--6--6542221------12 Q gi|254781114|r 65 PEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPH-LSKCDINNILIKFDAL--R--KKPKKDS------YL 133 (263) Q Consensus 65 geigC~lSH~~~w~~~~~~~~~~~lIlEDD~~~~~~f~~~l~~-~~~~~~~~d~~~l~~~--~--~~~~~~~------~~ 133 (263) .-.|.+..+-...+.+...+.++++++-||+++.+++.+.+.. ......+-.+..++.. . ....... .. T Consensus 56 ~N~G~a~g~N~Gi~~a~~~~~d~i~~ln~D~~~~~~~l~~l~~~~~~~~~~~~vg~~~p~~~d~~~~~~~~~~~~~~~~~ 135 (237) T cd02526 56 ENLGIAKALNIGIKAALENGADYVLLFDQDSVPPPDMVEKLLAYKILSDKNSNIGAVGPRIIDRRTGENSPGVRKSGYKL 135 (237) T ss_pred CCCCHHHHCCCCCHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCC T ss_conf 97785887274303341179778999357565590599999999985222798899971798689980045421125332 Q ss_pred CCCCCCCEEEEECCCCCCCEEEEECHHHHHHHHHHHCCCCC-HHHHHH-HHHHHCCCCEEEEECCCEECC Q ss_conf 34677705899405798512356669999999996254578-046887-999975981799960313126 Q gi|254781114|r 134 CTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIYR-PIDMDM-KHWWEHNIPSLVTEPGAVYEA 201 (263) Q Consensus 134 ~~~~~~~~l~~~~~~~~gt~aY~Is~~~AkkLl~~~~~~~~-pvD~~l-~~~~~~~l~~~~~~P~~v~~~ 201 (263) ..............+. .+.|-++.++..+++=-.-+.+.+ --|..+ -+....|.+++.+--..|... T Consensus 136 ~~~~~~~~~~~~~~~~-~~sg~lir~~~~~~vG~fde~fF~y~ED~Dl~~R~~~~G~~i~~~p~a~v~H~ 204 (237) T cd02526 136 RIQKEGEEGLKEVDFL-ITSGSLISLEALEKVGGFDEDLFIDYVDTEWCLRARSKGYKIYVVPDAVLKHE 204 (237) T ss_pred CCCCCCCCCCEECCEE-EECCEEEEHHHHHHHCCCCHHHCCCCCHHHHHHHHHHCCCCEEEECCEEEEEC T ss_conf 2354424784460430-30143778999998579968977937199999999985997999898899978 No 9 >pfam01762 Galactosyl_T Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2. Probab=93.01 E-value=0.62 Score=24.50 Aligned_cols=118 Identities=14% Similarity=0.010 Sum_probs=55.3 Q ss_pred HHHHHHHHHHHHHHHC--CCCEEEEEEECCCCC-CCHHHHHHHHHHCCCCCCEEEEECCCCC-CCCCCCCCCCCCCCEEE Q ss_conf 9999999999999827--998599973022133-0089999999841553775445113665-42221123467770589 Q gi|254781114|r 68 GCYISHIHLWKRIAYS--PAIGAIILEDDADFS-DEFSQLLPHLSKCDINNILIKFDALRKK-PKKDSYLCTLPGNFDIH 143 (263) Q Consensus 68 gC~lSH~~~w~~~~~~--~~~~~lIlEDD~~~~-~~f~~~l~~~~~~~~~~d~~~l~~~~~~-~~~~~~~~~~~~~~~l~ 143 (263) .-.+..+...+.+.+. +..+.+-..||+.+. +.+.+.+......+ .... ..+..... .........-......+ T Consensus 62 nLt~K~l~~~~W~~~~c~~~~fv~K~DDDv~Vn~~~L~~~L~~~~~~~-~~~~-~~g~~~~~~~p~R~~~~kwyv~~~~y 139 (196) T pfam01762 62 NLTFKTLTGLLYAVSKAPSAKYIGKIDDDVYFFPDKLLSLLDRGNINP-SESS-FYGYVMKEGPVIRNKKSKWYVPPSDY 139 (196) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEEECCCEEECHHHHHHHHHHCCCCC-CCCE-EEEEEECCCCCEECCCCCCCCCHHHC T ss_conf 068999999999998699765999937856882999999998515686-6776-89876327864346898776488887 Q ss_pred EEC-CCC-CCCEEEEECHHHHHHHHHHHCCC--CCHHHHHHHHHHHCC Q ss_conf 940-579-85123566699999999962545--780468879999759 Q gi|254781114|r 144 QPR-ILS-PRTTGYFIGKEAAIHLLNVRKNI--YRPIDMDMKHWWEHN 187 (263) Q Consensus 144 ~~~-~~~-~gt~aY~Is~~~AkkLl~~~~~~--~~pvD~~l~~~~~~~ 187 (263) ... ..+ ++..||++|.+.|++|+...... ..--|+++...+..+ T Consensus 140 ~~~~YP~Y~~G~~YvlS~d~v~~l~~~~~~~~~~~~eDv~vtGila~~ 187 (196) T pfam01762 140 PCSRYPPYASGPFYLLSRDAAELLLKASKHRRFLQIEDVYVTGILADD 187 (196) T ss_pred CCCCCCCCCCCCEEEECHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 887798789888888669999999998504898880133569999998 No 10 >pfam03071 GNT-I GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. The enzyme is an integral membrane protein localized to the Golgi apparatus, and is probably distributed in all tissues. The catalytic domain is located at the C-terminus. Probab=92.55 E-value=0.72 Score=24.13 Aligned_cols=185 Identities=11% Similarity=0.037 Sum_probs=86.0 Q ss_pred CCEEEEECCCCHHHHHHHHHHHHHCCC---CEEEEECCCCCCCCHHHH--HHCCHHC-CCCC---CCCCCHHH---HHH- Q ss_conf 868998089988999999999987799---789996637352896774--2020000-2467---88788789---999- Q gi|254781114|r 3 IPVYVISLPFSHARREKFCHRAARIHL---QFSFFDAIYGENNPICNR--IFSHQKR-QCQF---KRLLSLPE---IGC- 69 (263) Q Consensus 3 ~kiyvInL~~~~~Rr~~~~~~~~~~gi---~~e~~~Avdg~~~~~~~~--~~~~~~~-~~~~---~~~l~~ge---igC- 69 (263) ++|-||...|.+-=+.++.+-+. +-- .|-+|--=|+.+...... .|..... .... .-...++| .|- T Consensus 95 ipVlV~ACnR~~~v~r~ld~llk-yRPS~e~fPIiVSQDc~~~~t~~vi~syg~~v~~i~q~d~~~i~v~p~~kk~~gYy 173 (434) T pfam03071 95 IPVLVFACNRADYVRRHVEKLLH-YRPSAEKFPIIVSQDCDHEEVKKEVLSYGNQVTYIKHLDFEPIAIPPGHRKFTAYY 173 (434) T ss_pred EEEEEEECCCHHHHHHHHHHHHH-CCCCCCCCCEEEECCCCCHHHHHHHHHHHHCEEEEECCCCCCCCCCCCCCCHHHHH T ss_conf 58999974874899999999971-38851129979950699889999999653002787158878887897523014489 Q ss_pred -HHHHHHH-H-HHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHH-CCCCCCEEEEECCCCCCCCCCCCCCCC-CCCEEEE Q ss_conf -9999999-9-999827998599973022133008999999984-155377544511366542221123467-7705899 Q gi|254781114|r 70 -YISHIHL-W-KRIAYSPAIGAIILEDDADFSDEFSQLLPHLSK-CDINNILIKFDALRKKPKKDSYLCTLP-GNFDIHQ 144 (263) Q Consensus 70 -~lSH~~~-w-~~~~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~-~~~~~d~~~l~~~~~~~~~~~~~~~~~-~~~~l~~ 144 (263) ..-||+- . +-+...+.+.++|+|||..+.++|.++...... +..|-.+......+-. +....... ....+++ T Consensus 174 kIarHYkwAL~~vF~~~~~~~vIIvEdDL~vapDFf~yF~~t~~lL~~D~sl~cvSAWNDn---G~~~~v~~~~~~~LyR 250 (434) T pfam03071 174 KIARHYKWALNQVFYKHNFSSVIITEDDLEIAPDFFSYFSATRYLLERDPSLWCVSAWNDN---GKEQNVDDSRPYTLYR 250 (434) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCEEEEECCCC---CCCCCCCCCCCCCEEE T ss_conf 9998888899878766388759998246443635799999737987159973067520457---8644567788661462 Q ss_pred ECCCCCCCEEEEECHHHHHHHHHHHCCCC--CHHHHHHHHHHHCCCCEEEEECCCE Q ss_conf 40579851235666999999999625457--8046887999975981799960313 Q gi|254781114|r 145 PRILSPRTTGYFIGKEAAIHLLNVRKNIY--RPIDMDMKHWWEHNIPSLVTEPGAV 198 (263) Q Consensus 145 ~~~~~~gt~aY~Is~~~AkkLl~~~~~~~--~pvD~~l~~~~~~~l~~~~~~P~~v 198 (263) . . +.-..|.+++++-=. ...|-+ .-.|.||...-.++-+.+ +.|-+. T Consensus 251 s-d-ffPGLGWml~r~lW~----EL~pkWP~~~WDdWmR~p~~rkgR~c-IrPEvs 299 (434) T pfam03071 251 S-D-FFPGLGWMLSRKTWD----ELEPKWPKAFWDDWMRLPEQRKGRQC-IRPEIS 299 (434) T ss_pred C-C-CCCCHHHHHHHHHHH----HHCCCCCCCCHHHHHCCHHHCCCCCC-CCCCCC T ss_conf 2-6-777357761388897----63557987652755302343078742-456653 No 11 >KOG2287 consensus Probab=90.22 E-value=1.3 Score=22.71 Aligned_cols=172 Identities=15% Similarity=0.119 Sum_probs=83.6 Q ss_pred EEEEECCCCHHHHHHHHHHHHHCC------CCEEEEECCCCCCC------CHHHHHHCCH---HCCCCCCCCCCHHHHHH Q ss_conf 899808998899999999998779------97899966373528------9677420200---00246788788789999 Q gi|254781114|r 5 VYVISLPFSHARREKFCHRAARIH------LQFSFFDAIYGENN------PICNRIFSHQ---KRQCQFKRLLSLPEIGC 69 (263) Q Consensus 5 iyvInL~~~~~Rr~~~~~~~~~~g------i~~e~~~Avdg~~~------~~~~~~~~~~---~~~~~~~~~l~~geigC 69 (263) +-|-+-.++-+||+.+.+..-..+ +..-|+-|..+... -.+...|.+- .+...+ ..+ T Consensus 99 l~V~S~~~~farR~aiR~TW~~~~~v~~~~v~~~FLvG~~~~~~~~~~~l~~Ea~~ygDIi~~df~Dty-~nl------- 170 (349) T KOG2287 99 LLVKSAPDNFARRNAIRKTWGNENNVRGGRVRVLFLVGLPSNEDKLNKLLADEARLYGDIIQVDFEDTY-FNL------- 170 (349) T ss_pred EEEEECCCCHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEECCCCC-CCC------- T ss_conf 999746876899999998755745458872689999468886478999999888864977998502622-064------- Q ss_pred HHHHHHHHHHHH--HCCCCEEEEEEECCCCCC-CHHHHHHHHHHCCCCCCEEEEECCCCC-CCCCCCCCCCCCCCEEEEE Q ss_conf 999999999998--279985999730221330-089999999841553775445113665-4222112346777058994 Q gi|254781114|r 70 YISHIHLWKRIA--YSPAIGAIILEDDADFSD-EFSQLLPHLSKCDINNILIKFDALRKK-PKKDSYLCTLPGNFDIHQP 145 (263) Q Consensus 70 ~lSH~~~w~~~~--~~~~~~~lIlEDD~~~~~-~f~~~l~~~~~~~~~~d~~~l~~~~~~-~~~~~~~~~~~~~~~l~~~ 145 (263) .+.++...+... ....++++=..||+.+.. ++.+.+. .....-....+|..... ...+.+...-.-....+.. T Consensus 171 tlKtl~~l~w~~~~cp~akfi~K~DDDvfv~~~~L~~~L~---~~~~~~~~~~~G~v~~~~~p~R~~~~KwyVp~~~y~~ 247 (349) T KOG2287 171 TLKTLAILLWGVSKCPDAKFILKIDDDVFVNPDNLLEYLD---KLNDPSSDLYYGRVIQNAPPIRDKTSKWYVPESEYPC 247 (349) T ss_pred CHHHHHHHHHHHCCCCCCCEEEEECCCEEECHHHHHHHHH---CCCCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHCCC T ss_conf 2999999998762599520677605765872899999986---0489876237866323677656788787668866788 Q ss_pred --CCCCCCCEEEEECHHHHHHHHHHHCCC-CCH-HHHHHHHHHHCC Q ss_conf --057985123566699999999962545-780-468879999759 Q gi|254781114|r 146 --RILSPRTTGYFIGKEAAIHLLNVRKNI-YRP-IDMDMKHWWEHN 187 (263) Q Consensus 146 --~~~~~gt~aY~Is~~~AkkLl~~~~~~-~~p-vD~~l~~~~~~~ 187 (263) +-...+..||++|+++|+.+++.+... ..+ -|+++..+.... T Consensus 248 ~~YP~Y~sG~gYvis~~~a~~l~~~s~~~~~~~iEDV~~g~~l~~~ 293 (349) T KOG2287 248 SVYPPYASGPGYVISGDAARRLLKASKHLKFFPIEDVFVGGCLAED 293 (349) T ss_pred CCCCCCCCCCEEEECHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHC T ss_conf 7788835787589659999999997267877668887849999850 No 12 >cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=86.03 E-value=2.4 Score=21.13 Aligned_cols=178 Identities=8% Similarity=-0.055 Sum_probs=69.2 Q ss_pred CCHHHHHHHHHHHHHCCC-CEEEEECCCCCCCCHH-HHHHCCHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHH-CCCCEE Q ss_conf 988999999999987799-7899966373528967-7420200002467887887899999999999999982-799859 Q gi|254781114|r 12 FSHARREKFCHRAARIHL-QFSFFDAIYGENNPIC-NRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAY-SPAIGA 88 (263) Q Consensus 12 ~~~~Rr~~~~~~~~~~gi-~~e~~~Avdg~~~~~~-~~~~~~~~~~~~~~~~l~~geigC~lSH~~~w~~~~~-~~~~~~ 88 (263) +..+-.+...+.+.++.. ++|.+ -||+...+.. +..-......... .+....|.+. ++...++ ..-+++ T Consensus 8 N~~~~l~~~l~Si~~Q~~~~~EiI-vVDd~S~D~t~~~~~~~~~~~~~~---~~~~~~G~~~----a~N~g~~~a~g~yi 79 (202) T cd06433 8 NQAETLEETIDSVLSQTYPNIEYI-VIDGGSTDGTVDIIKKYEDKITYW---ISEPDKGIYD----AMNKGIALATGDII 79 (202) T ss_pred CCHHHHHHHHHHHHHCCCCCEEEE-EEECCCCCCHHHHHHHCCCCEEEE---EECCCCCHHH----HHHHHHHHCCCCEE T ss_conf 988999999999983789997999-997998842344533113412599---9888888899----98732775485553 Q ss_pred EEEEECCCCCCC-HHHHHHHHHHCCCCCCEEEEE--CCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEEEEECHHHHHHH Q ss_conf 997302213300-899999998415537754451--13665422211234677705899405798512356669999999 Q gi|254781114|r 89 IILEDDADFSDE-FSQLLPHLSKCDINNILIKFD--ALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHL 165 (263) Q Consensus 89 lIlEDD~~~~~~-f~~~l~~~~~~~~~~d~~~l~--~~~~~~~~~~~~~~~~~~~~l~~~~~~~~gt~aY~Is~~~AkkL 165 (263) +++.+|..+.++ +.+.+..+... .+.+++.-. ................... .........+..+.++.++..+++ T Consensus 80 ~~ld~DD~~~~~~~~~~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~r~~~~~~i 157 (202) T cd06433 80 GFLNSDDTLLPGALLAVVAAFAEH-PEVDVVYGDVLLVDENGRVIGRRRPPPFLD-KFLLYGMPICHQATFFRRSLFEKY 157 (202) T ss_pred ECCCCCCEECCHHHHHHHHHHHHC-CCCCEEECCEEEECCCCCEEEEECCCCCCH-HHHHCCCCCCCCCEEEEEEEHHHH T ss_conf 224888626844999999999878-994099934699869996875422665367-888546886665135640304550 Q ss_pred HHHHCCCCCHHHHHH-HHHHHCCCCEEEEECCCEE Q ss_conf 996254578046887-9999759817999603131 Q gi|254781114|r 166 LNVRKNIYRPIDMDM-KHWWEHNIPSLVTEPGAVY 199 (263) Q Consensus 166 l~~~~~~~~pvD~~l-~~~~~~~l~~~~~~P~~v~ 199 (263) =-.-..+...-|..+ -+....+.++..+.-+++. T Consensus 158 g~fd~~~~~~eD~d~~~R~~~~g~~i~~ip~~l~~ 192 (202) T cd06433 158 GGFDESYRIAADYDLLLRLLLAGKIFKYLPEVLAA 192 (202) T ss_pred CCCCCCCCCCHHHHHHHHHHHCCCCEEECCCCEEE T ss_conf 89886768022999999999839939984877899 No 13 >KOG2791 consensus Probab=85.90 E-value=2.4 Score=21.09 Aligned_cols=91 Identities=10% Similarity=0.199 Sum_probs=59.3 Q ss_pred HHHHH--HHCCCCEEEEEEECCCCCCCHHHHHHHHHHC----CCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCC Q ss_conf 99999--8279985999730221330089999999841----55377544511366542221123467770589940579 Q gi|254781114|r 76 LWKRI--AYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS 149 (263) Q Consensus 76 ~w~~~--~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~~----~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 149 (263) +|..+ ...-+..+|.||.|-.+.|++-.+++.+..+ -.+.+++.|+.......++ .. .....++ ... T Consensus 242 Vwd~ve~~~~~~g~iLflEEDH~LaPdayhv~~~l~~lkp~~Cp~C~~~sLg~y~s~sryG---qD-~~~v~~w---~s~ 314 (455) T KOG2791 242 VWDGVEETKGHEGHILFLEEDHFLAPDAYHVIQTLTRLKPAKCPDCFAASLGPYDSKSRYG---QD-EGLVSLW---ASR 314 (455) T ss_pred HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCCCCC---CC-CCCEEEH---HHH T ss_conf 9988997336776589971343337018999999984485559865055023401223245---44-4552211---331 Q ss_pred CCCEEEEECHHHHHHHHHHHCCCC Q ss_conf 851235666999999999625457 Q gi|254781114|r 150 PRTTGYFIGKEAAIHLLNVRKNIY 173 (263) Q Consensus 150 ~gt~aY~Is~~~AkkLl~~~~~~~ 173 (263) .+..||.++|.+-+|+...+..+. T Consensus 315 ~hNmG~al~rn~wqki~~c~~~FC 338 (455) T KOG2791 315 MHNMGYALNRNVWQKIHQCAREFC 338 (455) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 466304441779999998677631 No 14 >cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=84.46 E-value=2.8 Score=20.69 Aligned_cols=160 Identities=12% Similarity=-0.017 Sum_probs=79.3 Q ss_pred CCHHHHHHHHHHHHHCCC-CEEEEECCC-CCCCCHHHHHHC--CHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCE Q ss_conf 988999999999987799-789996637-352896774202--0000246788788789999999999999998279985 Q gi|254781114|r 12 FSHARREKFCHRAARIHL-QFSFFDAIY-GENNPICNRIFS--HQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIG 87 (263) Q Consensus 12 ~~~~Rr~~~~~~~~~~gi-~~e~~~Avd-g~~~~~~~~~~~--~~~~~~~~~~~l~~geigC~lSH~~~w~~~~~~~~~~ 87 (263) ++.+....+.+.+.++.. ++|++= || |......+.... ........ ..+.-.|.+...-..++.+.+++.++ T Consensus 7 N~~~~l~~~L~Sl~~q~~~~~eIiV-VDn~StD~t~~~l~~~~~~~~v~~i---~~~~N~G~a~~~N~Gi~~a~~~~~d~ 82 (202) T cd04185 7 NRLDLLKECLDALLAQTRPPDHIIV-IDNASTDGTAEWLTSLGDLDNIVYL---RLPENLGGAGGFYEGVRRAYELGYDW 82 (202) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEEE-EECCCCCCHHHHHHHHCCCCCEEEE---ECCCCCCHHHHHHHHHHHHHHCCCEE T ss_conf 8889999999999827799988999-9794987616653000356668998---56888732789999999887559818 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEEEEECHHHHHHHHH Q ss_conf 99973022133008999999984155377544511366542221123467770589940579851235666999999999 Q gi|254781114|r 88 AIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLN 167 (263) Q Consensus 88 ~lIlEDD~~~~~~f~~~l~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~gt~aY~Is~~~AkkLl~ 167 (263) ++++.||+.+.++..+.+-.....+ +..+ +.... ....+ +-.|.++.++..+++= T Consensus 83 v~~ld~D~~~~~~~l~~l~~~~~~~-~~~~--~~~~~---------~~~~~------------~F~g~l~~r~v~~~vG- 137 (202) T cd04185 83 IWLMDDDAIPDPDALEKLLAYADKD-NPQF--LAPLV---------LDPDG------------SFVGVLISRRVVEKIG- 137 (202) T ss_pred EEEECCCCCCCCCHHHHHHHHHHCC-CCEE--EEEEE---------ECCCC------------CEEEEEEEHHHHHHHC- T ss_conf 9998998754922999999986579-9779--99789---------84799------------9899988899999758- Q ss_pred HHCC-CC-CHHHHHH-HHHHHCCCCEEEEECCCEECCC Q ss_conf 6254-57-8046887-9999759817999603131267 Q gi|254781114|r 168 VRKN-IY-RPIDMDM-KHWWEHNIPSLVTEPGAVYEAI 202 (263) Q Consensus 168 ~~~~-~~-~pvD~~l-~~~~~~~l~~~~~~P~~v~~~~ 202 (263) ...+ +. .--|..+ .+....+.++|. |..+..+. T Consensus 138 ~fde~~f~~~eD~D~~~R~~~~G~~i~~--p~a~v~H~ 173 (202) T cd04185 138 LPDKEFFIWGDDTEYTLRASKAGPGIYV--PDAVVVHK 173 (202) T ss_pred CCCHHHCCCCCHHHHHHHHHHCCCCEEE--CCEEEEEC T ss_conf 9976866738299999999974996997--78899987 No 15 >pfam05060 MGAT2 N-acetylglucosaminyltransferase II (MGAT2). UDP-N-acetyl-D-glucosamine:alpha-6-D-mannoside beta-1,2-N- acetylglucosaminyltransferase II (EC 2.4.1.143) (GnT II/MGAT2) is a Golgi resident enzyme that catalyses an essential step in the biosynthetic pathway leading from high mannose to complex N-linked oligosaccharides. Mutations in the MGAT2 gene lead to congenital disorder of glycosylation (CDG IIa). CDG IIa patients have an increased bleeding tendency, unrelated to coagulation factors. Probab=76.48 E-value=5.4 Score=19.09 Aligned_cols=92 Identities=8% Similarity=0.097 Sum_probs=57.9 Q ss_pred HHHHH--HHCCCCEEEEEEECCCCCCCHHHHHHHHHHC----CCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCC Q ss_conf 99999--8279985999730221330089999999841----55377544511366542221123467770589940579 Q gi|254781114|r 76 LWKRI--AYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILS 149 (263) Q Consensus 76 ~w~~~--~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~~----~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 149 (263) +|..+ ...-.-.+|.+|+|-.+.|+|...+..+... -.+.+++.|+.......+. ..........+. T Consensus 156 V~d~l~~t~~~~G~vlflEEDHyvapDf~~~l~lm~~~k~~~Cp~C~ilsLG~y~~~~~~~-------~~~~~~ev~~W~ 228 (357) T pfam05060 156 VWDGLEETKGHDGHVLFLEEDHYLAPDAYHVLQLMQNLKPEKCPDCYALSLGPYDKSRGEG-------WDSLVAEVMGWV 228 (357) T ss_pred HHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCCCCC-------CCCCEEEEECCC T ss_conf 8866210367763089982565426739999999987360349977777423444567766-------654336773354 Q ss_pred C--CCEEEEECHHHHHHHHHHHCCCCC Q ss_conf 8--512356669999999996254578 Q gi|254781114|r 150 P--RTTGYFIGKEAAIHLLNVRKNIYR 174 (263) Q Consensus 150 ~--gt~aY~Is~~~AkkLl~~~~~~~~ 174 (263) . -..||.++|..-+++...+..|.. T Consensus 229 SskHNmGma~nR~~w~kI~~Ca~~FC~ 255 (357) T pfam05060 229 SPKHNMGMAFNRTVWRKIHGCAEEFCT 255 (357) T ss_pred CCCCCCEEEECHHHHHHHHHHHHHHCC T ss_conf 666552368749999999999997488 No 16 >pfam04666 Glyco_transf_54 N-Acetylglucosaminyltransferase-IV (GnT-IV) conserved region. The complex-type of oligosaccharides are synthesized through elongation by glycosyltransferases after trimming of the precursor oligosaccharides transferred to proteins in the endoplasmic reticulum. N-Acetylglucosaminyltransferases (GnTs) take part in the formation of branches in the biosynthesis of complex-type sugar chains. In vertebrates, six GnTs, designated as GnT-I to -VI, which catalyse the transfer of GlcNAc to the core mannose residues of Asn-linked sugar chains, have been identified. GnT-IV (EC:2.4.1.145) catalyses the transfer of GlcNAc from UDP-GlcNAc to the GlcNAc1-2Man1-3 arm of core oligosaccharide [Gn2(22)core oligosaccharide] and forms GlcNAc1-4(GlcNAc1-2)Man1-3 structure on the core oligosaccharide (Gn3(2,4,2)core oligosaccharide). In some members the conserved region occupies all but the very for N-terminal, where there is a signal sequence on all members. For other members the co Probab=73.08 E-value=6.6 Score=18.58 Aligned_cols=86 Identities=16% Similarity=0.140 Sum_probs=49.0 Q ss_pred HHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHC--CCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEE Q ss_conf 99998279985999730221330089999999841--5537754451136654222112346777058994057985123 Q gi|254781114|r 77 WKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC--DINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTG 154 (263) Q Consensus 77 w~~~~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~~--~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~gt~a 154 (263) ...|. ....|-|-||||++-.++|.+.+..+... ..+|.++.+...+--.+ +++. T Consensus 167 m~Ya~-~~~~YYlqLEDDV~~~~~f~t~Ik~~~~~~~~~~W~~LeFs~lGfIGK-------------LF~s--------- 223 (301) T pfam04666 167 MNYAQ-NLGTYYLQLEDDVVAKKGFLTTIKNFALSQTSSEWVVLEFSQLGFIGK-------------LFRS--------- 223 (301) T ss_pred HHHHC-CCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCCCEEEEEECCCCHHHH-------------HHCC--------- T ss_conf 99860-448046875255245641899999999964689739998215770244-------------2240--------- Q ss_pred EEECHHHHHHHHHHHCCCCCHHHHHHHHHHHCCC Q ss_conf 5666999999999625457804688799997598 Q gi|254781114|r 155 YFIGKEAAIHLLNVRKNIYRPIDMDMKHWWEHNI 188 (263) Q Consensus 155 Y~Is~~~AkkLl~~~~~~~~pvD~~l~~~~~~~l 188 (263) .-+ +.-|+-++ +.-...|+|+.+..+....+ T Consensus 224 ~dL-~~l~~F~~--mFy~d~P~DwLL~~f~~~~~ 254 (301) T pfam04666 224 KDL-PRLVEFLL--MFYKDKPIDWLLDHFRLLLV 254 (301) T ss_pred CCH-HHHHHHHH--HHHHCCCHHHHHHHHHHHHC T ss_conf 228-99999999--98427864999999998731 No 17 >pfam04908 SH3BGR SH3-binding, glutamic acid-rich protein. Probab=61.97 E-value=11 Score=17.25 Aligned_cols=37 Identities=22% Similarity=0.215 Sum_probs=29.2 Q ss_pred CCCCEEEEECCCCHH---HHHHHHHHHHHCCCCEEEEECC Q ss_conf 988689980899889---9999999998779978999663 Q gi|254781114|r 1 MPIPVYVISLPFSHA---RREKFCHRAARIHLQFSFFDAI 37 (263) Q Consensus 1 m~~kiyvInL~~~~~---Rr~~~~~~~~~~gi~~e~~~Av 37 (263) |-+|||+=++..+++ |.+.+..-++..+|+||-+|=. T Consensus 1 mvikVy~ss~sg~~~vKk~Qq~v~~iL~s~kI~ye~vDIt 40 (92) T pfam04908 1 MVLKVYVASSSGSPEIKKKQQRVLMILDANKIPFDEVDIT 40 (92) T ss_pred CEEEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCEEEECC T ss_conf 9799999505187889999999999999859973898555 No 18 >PTZ00215 ribose 5-phosphate isomerase; Provisional Probab=58.08 E-value=13 Score=16.85 Aligned_cols=35 Identities=14% Similarity=0.010 Sum_probs=30.3 Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 98868998089988999999999987799789996 Q gi|254781114|r 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFD 35 (263) Q Consensus 1 m~~kiyvInL~~~~~Rr~~~~~~~~~~gi~~e~~~ 35 (263) |.|||++=+=-.-.+-++.+.+.|...|.+++.+| T Consensus 1 M~mkI~igsDHaG~~lK~~l~~~L~~~g~~~~V~D 35 (156) T PTZ00215 1 MTRRVAIGCDHAAFAIHEEIMDYVSAAGDEFKVMY 35 (156) T ss_pred CCCEEEEEECCHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 99639998284489999999999997699569896 No 19 >cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=57.27 E-value=14 Score=16.77 Aligned_cols=177 Identities=8% Similarity=-0.071 Sum_probs=74.5 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHCC--H-HCCCCCCCCCCHHHHHHHHHHHHHHHHHH-HCCCCEEEEEEE Q ss_conf 9999999877997899966373528967742020--0-00246788788789999999999999998-279985999730 Q gi|254781114|r 18 EKFCHRAARIHLQFSFFDAIYGENNPICNRIFSH--Q-KRQCQFKRLLSLPEIGCYISHIHLWKRIA-YSPAIGAIILED 93 (263) Q Consensus 18 ~~~~~~~~~~gi~~e~~~Avdg~~~~~~~~~~~~--~-~~~~~~~~~l~~geigC~lSH~~~w~~~~-~~~~~~~lIlED 93 (263) +.+..-+++.--++|++=-=||......+..-.. . ......-+ ..+-+|++.+ |...+ ..+.+++.++.+ T Consensus 15 ~~i~Sil~Qt~~~~EiivvDDgStD~t~~ii~~~~~~~~~~~~~~~--~~~n~G~~~n----~n~gi~~a~geyi~~lD~ 88 (214) T cd04196 15 EQLDSILAQTYKNDELIISDDGSTDGTVEIIKEYIDKDPFIIILIR--NGKNLGVARN----FESLLQAADGDYVFFCDQ 88 (214) T ss_pred HHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEE--CCCCCHHHHH----HHHHHHHCCCCEEECCCC T ss_conf 9999998379988699999899972379999999975886513666--5888059999----999998647875741377 Q ss_pred CCCCCCCHHH-HHHHHHHCCCCCCEEEEEC--CCCCCCC-CCC---CCCCCCCCEE-EEECCCCCCCEEEEECHHHHHHH Q ss_conf 2213300899-9999984155377544511--3665422-211---2346777058-99405798512356669999999 Q gi|254781114|r 94 DADFSDEFSQ-LLPHLSKCDINNILIKFDA--LRKKPKK-DSY---LCTLPGNFDI-HQPRILSPRTTGYFIGKEAAIHL 165 (263) Q Consensus 94 D~~~~~~f~~-~l~~~~~~~~~~d~~~l~~--~~~~~~~-~~~---~~~~~~~~~l-~~~~~~~~gt~aY~Is~~~AkkL 165 (263) |-++.|++.+ .++.+... .+.+++.-.. ....... ... .......... ......+....+.++.++..++. T Consensus 89 DD~~~p~~l~~~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~Rk~~~~~~ 167 (214) T cd04196 89 DDIWLPDKLERLLKAFLKD-DKPLLVYSDLELVDENGNPIGESFFEYQKIKPGTSFNNLLFQNVVTGCTMAFNRELLELA 167 (214) T ss_pred CCCCCHHHHHHHHHHHHHC-CCEEEEECCEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHH T ss_conf 6562865999999999858-990899927399989998923443333567702339999971888761899889999985 Q ss_pred HHHHCCCCCHHHHHHHHHHHCCCCEEEEECCCEECC Q ss_conf 996254578046887999975981799960313126 Q gi|254781114|r 166 LNVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEA 201 (263) Q Consensus 166 l~~~~~~~~pvD~~l~~~~~~~l~~~~~~P~~v~~~ 201 (263) ...-......-|+++-.......+++.+.-.++.-+ T Consensus 168 g~~~~~~~~~~D~~l~l~~~~~~~i~~i~~~l~~YR 203 (214) T cd04196 168 LPFPDADVIMHDWWLALLASAFGKVVFLDEPLILYR 203 (214) T ss_pred CCCCCCCCCCHHHHHHHHHHHCCCEEEECCCEEEEE T ss_conf 799944474549999999996898999889879999 No 20 >TIGR01990 bPGM beta-phosphoglucomutase; InterPro: IPR010972 This model represents the beta-phosphoglucomutase enzyme which catalyses the interconversion of beta-D-glucose-1-phosphate and beta-D-glucose-6-phosphate. The 6-phosphate is capable of non-enzymatic anomerization (alpha to beta or vice versa) while the 1-phosphate is not. A separate enzyme is responsible for the isomerisation of the alpha anomers. Beta-D-glucose-1-phosphate results from the phosphorolysis of maltose (2.4.1.8 from EC), trehalose (2.4.1.64 from EC) or trehalose-6-phosphate (2.4.1.216 from EC). Alternatively, these reactions can be run in the synthetic direction to create the disaccharides. All sequenced genomes which contain a member of this family also appear to contain at least one putative maltose or trehalose phosphorylase. Three species, Lactococcus, Enterococcus and Neisseria appear to contain a pair of paralogous beta-PGM's. Beta-phosphoglucomutase is a member of the haloacid dehalogenase superfamily of hydrolase enzymes. These enzymes are characterised by a series of three catalytic motifs positioned within an alpha-beta (Rossman) fold . Beta-PGM contains an inserted alpha helical domain in between the first and second conserved motifs and thus is a member of subfamily IA of the superfamily , . The third catalytic motif comes in three variants, the third of which, containing a conserved DD or ED, is the only one found here as well as in several other related enzymes. The enzyme from Lactococcus lactis has been extensively characterised including a remarkable crystal structure which traps the pentacoordinate transition state . . Probab=56.13 E-value=5.8 Score=18.90 Aligned_cols=12 Identities=33% Similarity=0.581 Sum_probs=7.6 Q ss_pred HHHHHHHHHHHC Q ss_conf 999999999827 Q gi|254781114|r 72 SHIHLWKRIAYS 83 (263) Q Consensus 72 SH~~~w~~~~~~ 83 (263) =||++||+++++ T Consensus 18 yHy~AWk~~Ade 29 (190) T TIGR01990 18 YHYLAWKHLADE 29 (190) T ss_pred HHHHHHHHHHHH T ss_conf 889999999988 No 21 >TIGR01533 lipo_e_P4 5'-nucleotidase, lipoprotein e(P4) family; InterPro: IPR006423 This group of sequences represents a set of bacterial lipoproteins belonging to a larger acid phosphatase family, which in turn belongs to the haloacid dehalogenase (HAD) superfamily of aspartate-dependent hydrolases. Members are found on the outer membrane of Gram-negative bacteria and the cytoplasmic membrane of Gram-positive bacteria. Most members have classic lipoprotein signal sequences. A critical role of this 5'-nucleotidase in Haemophilus influenzae is the degradation of external riboside in order to allow transport into the cell. An earlier suggested role in hemin transport is no longer current. This enzyme may also have other physiologically significant roles.. Probab=55.69 E-value=12 Score=17.18 Aligned_cols=28 Identities=11% Similarity=0.158 Sum_probs=12.4 Q ss_pred CCEEEEECCCC-HHHHHHHHHHHHHCCCCE Q ss_conf 86899808998-899999999998779978 Q gi|254781114|r 3 IPVYVISLPFS-HARREKFCHRAARIHLQF 31 (263) Q Consensus 3 ~kiyvInL~~~-~~Rr~~~~~~~~~~gi~~ 31 (263) ++||+|+ +|+ ..-+....+.|.+.|+|+ T Consensus 160 v~iFYvs-nR~~~~~~~aTl~nLk~~g~P~ 188 (295) T TIGR01533 160 VKIFYVS-NRSDEKEKEATLENLKKKGFPQ 188 (295) T ss_pred CEEEEEE-CCCCCHHHHHHHHHHHHCCCCC T ss_conf 2589965-6873012442079998638795 No 22 >pfam00483 NTP_transferase Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars. Probab=53.52 E-value=16 Score=16.40 Aligned_cols=123 Identities=11% Similarity=0.068 Sum_probs=60.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEE Q ss_conf 87899999999999999982799859997302213300899999998415537754451136654222112346777058 Q gi|254781114|r 63 SLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDI 142 (263) Q Consensus 63 ~~geigC~lSH~~~w~~~~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l 142 (263) |.|.+.+++.++ -+.+.+..+|+-+|.++..+|.+.++.......+..+.... ................+..+ T Consensus 86 ta~ai~~a~~~l------~~~~~~~~lv~~~D~~~~~~~~~~l~~h~~~~~~~~~~~~~-~~~~~~~~yGvv~~d~~~~V 158 (247) T pfam00483 86 TAPAVALAADFL------GDDDPELVLVLGGDHIYRMDFEEAVQKARAKAADGTVTFGI-VPVEDPTGYGVIEFDENGRV 158 (247) T ss_pred CHHHHHHHHHHH------CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEE-ECCCCCCCCCEEEECCCCCC T ss_conf 056899999864------34787627997785334440999999999847998688855-65557634514898886650 Q ss_pred ---E-EEC----CCCCCCEEEEECHHHHHHHHHHHCCCCCH---HHHHHHHHHHCCCCEEE Q ss_conf ---9-940----57985123566699999999962545780---46887999975981799 Q gi|254781114|r 143 ---H-QPR----ILSPRTTGYFIGKEAAIHLLNVRKNIYRP---IDMDMKHWWEHNIPSLV 192 (263) Q Consensus 143 ---~-~~~----~~~~gt~aY~Is~~~AkkLl~~~~~~~~p---vD~~l~~~~~~~l~~~~ 192 (263) . .+. .....+..|+++++.-..+++...+...- +-..+......+...+. T Consensus 159 ~~~~EKP~~~~~~~~~~~G~Y~~~~~i~~~~~~~~~~~~~~e~~~~d~i~~~i~~g~~~~~ 219 (247) T pfam00483 159 IRFVEKPDLPKASNYASMGIYFFNSGVFLFLAKYLKELARGEDEITDAIPAALEDGDLDLA 219 (247) T ss_pred CCEEECCCCHHHHHHEEEEEEEECHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCEEE T ss_conf 3046779962443425611499889999999965777767994499999999987996399 No 23 >PRK03600 nrdI ribonucleotide reductase stimulatory protein; Reviewed Probab=50.35 E-value=14 Score=16.63 Aligned_cols=29 Identities=21% Similarity=0.120 Sum_probs=21.8 Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 98868998089988999999999987799789996 Q gi|254781114|r 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFD 35 (263) Q Consensus 1 m~~kiyvInL~~~~~Rr~~~~~~~~~~gi~~e~~~ 35 (263) |+|.||.+|+...++|-- ++++++..+++ T Consensus 1 m~m~V~y~S~TGNt~rFv------~kL~~~~~~i~ 29 (136) T PRK03600 1 MSMLVYFSSKTGNTHRFV------QKLGLPAVRIP 29 (136) T ss_pred CCEEEEEECCCCCHHHHH------HHHCCCCCCCC T ss_conf 977999985875799999------98288201156 No 24 >KOG1413 consensus Probab=49.81 E-value=18 Score=16.06 Aligned_cols=183 Identities=12% Similarity=0.054 Sum_probs=85.2 Q ss_pred CCEEEEECCCCHHHHHHHHHHHHHCCC---CEEEEECCCCCCCCHHHH--HHCCHHCCCCCC----CCC-------CHHH Q ss_conf 868998089988999999999987799---789996637352896774--202000024678----878-------8789 Q gi|254781114|r 3 IPVYVISLPFSHARREKFCHRAARIHL---QFSFFDAIYGENNPICNR--IFSHQKRQCQFK----RLL-------SLPE 66 (263) Q Consensus 3 ~kiyvInL~~~~~Rr~~~~~~~~~~gi---~~e~~~Avdg~~~~~~~~--~~~~~~~~~~~~----~~l-------~~ge 66 (263) +||-|..-.+. +-..++.+++-.+-- .|-++-+=|+.+.+..+. .|.......... ... ...+ T Consensus 69 ~pvvVf~csR~-~~lr~~v~kll~yrPsaekfpiiVSQD~~~e~vk~~~~~~g~~v~~i~~~~h~~~ei~v~~~~~k~~~ 147 (411) T KOG1413 69 IPVVVFACSRA-DALRRHVKKLLEYRPSAEKFPIIVSQDCEKEAVKKKLLSYGSDVSHIQHPMHLKDEISVPPRHKKFNA 147 (411) T ss_pred EEEEEEECCCH-HHHHHHHHHHHHHCCCHHHCCEEEECCCCCHHHHHHHHHHCCCHHHHCCCCCCCCCCCCCCCCCCCCH T ss_conf 35888754759-99999999998728314536778832687377999999841505544075334454145775000231 Q ss_pred HHHHHHHHHHH--HHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHC----CCCCCEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 99999999999--9998279985999730221330089999999841----55377544511366542221123467770 Q gi|254781114|r 67 IGCYISHIHLW--KRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKC----DINNILIKFDALRKKPKKDSYLCTLPGNF 140 (263) Q Consensus 67 igC~lSH~~~w--~~~~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~~----~~~~d~~~l~~~~~~~~~~~~~~~~~~~~ 140 (263) .=-..-||+-- +-++..+++.++|.|||....++|.+.......+ +.-|-+-.+-..+++. .+...... T Consensus 148 Yy~IarHYkwAL~q~F~~~~~s~vii~eDDl~iapDFF~YF~~t~~llk~D~siwcvsaWNDNGk~~-----~Id~~~~~ 222 (411) T KOG1413 148 YYKIARHYKWALNQLFIVFRESRVIITEDDLNIAPDFFSYFRNTIILLKGDPSIWCVSAWNDNGKKQ-----TIDSTRPS 222 (411) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCC-----CCCCCCCC T ss_conf 6889999999986477652775368832314434689999987799872698537753204689865-----32356610 Q ss_pred EEEEECCCCCCCEEEEECHHHHHHHHHHHCCCC--CHHHHHHHHHHHCCCCEEEEECCCE Q ss_conf 589940579851235666999999999625457--8046887999975981799960313 Q gi|254781114|r 141 DIHQPRILSPRTTGYFIGKEAAIHLLNVRKNIY--RPIDMDMKHWWEHNIPSLVTEPGAV 198 (263) Q Consensus 141 ~l~~~~~~~~gt~aY~Is~~~AkkLl~~~~~~~--~pvD~~l~~~~~~~l~~~~~~P~~v 198 (263) .+++. .++. ..|.+++ +++.+...|.+ .-.|.||...-.++.+.+ +.|-+. T Consensus 223 ~lYRt-DFFp-GLGWml~----~~~W~ELsp~wP~~fWDDWmr~pe~rK~R~c-IRPEiS 275 (411) T KOG1413 223 LLYRT-DFFP-GLGWMLT----KKLWEELSPKWPVAFWDDWMRIPENRKGRQC-IRPEIS 275 (411) T ss_pred HHHHC-CCCC-CCHHHHH----HHHHHHHCCCCCCCCHHHHHHCHHHHCCCCC-CCHHHH T ss_conf 20113-5555-4047878----9889861877764321565426565235521-373764 No 25 >COG3426 Butyrate kinase [Energy production and conversion] Probab=48.70 E-value=19 Score=15.95 Aligned_cols=26 Identities=23% Similarity=0.341 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCC Q ss_conf 99999999987799789996637352 Q gi|254781114|r 16 RREKFCHRAARIHLQFSFFDAIYGEN 41 (263) Q Consensus 16 Rr~~~~~~~~~~gi~~e~~~Avdg~~ 41 (263) |++.+.+.+...|++++-|+||=|+. T Consensus 51 R~~~i~~~i~e~g~~i~~~dAvvgRG 76 (358) T COG3426 51 RKDAILEFIDEQGYNISKFDAVVGRG 76 (358) T ss_pred HHHHHHHHHHHHCCCCCCCCCEEECC T ss_conf 99999999997187757740013057 No 26 >COG1519 KdtA 3-deoxy-D-manno-octulosonic-acid transferase [Cell envelope biogenesis, outer membrane] Probab=42.17 E-value=24 Score=15.36 Aligned_cols=20 Identities=30% Similarity=0.355 Sum_probs=11.4 Q ss_pred CCCEEEEEC---CCCHHHHHHHH Q ss_conf 886899808---99889999999 Q gi|254781114|r 2 PIPVYVISL---PFSHARREKFC 21 (263) Q Consensus 2 ~~kiyvInL---~~~~~Rr~~~~ 21 (263) .+|++++|= ++|-.|...+. T Consensus 146 ~~p~~LvNaRLS~rS~~~y~k~~ 168 (419) T COG1519 146 GIPLVLVNARLSDRSFARYAKLK 168 (419) T ss_pred CCCEEEEEEEECHHHHHHHHHHH T ss_conf 99899994230232577798778 No 27 >COG1245 Predicted ATPase, RNase L inhibitor (RLI) homolog [General function prediction only] Probab=36.08 E-value=30 Score=14.79 Aligned_cols=28 Identities=18% Similarity=0.265 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHCCCCEEEEEEECCCCCC Q ss_conf 99999999998279985999730221330 Q gi|254781114|r 71 ISHIHLWKRIAYSPAIGAIILEDDADFSD 99 (263) Q Consensus 71 lSH~~~w~~~~~~~~~~~lIlEDD~~~~~ 99 (263) ++--++.+..++. ..+++|.|+|....+ T Consensus 250 l~~ar~Irel~~~-~k~ViVVEHDLavLD 277 (591) T COG1245 250 LNAARVIRELAED-GKYVIVVEHDLAVLD 277 (591) T ss_pred HHHHHHHHHHHCC-CCEEEEEECHHHHHH T ss_conf 8799999998626-986999933478998 No 28 >cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=35.58 E-value=30 Score=14.74 Aligned_cols=98 Identities=11% Similarity=0.062 Sum_probs=44.4 Q ss_pred HHHHHHHHHHHHHCC-CCEEEEECCCCCCCCHHHHH---HCCH-HCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEE Q ss_conf 899999999998779-97899966373528967742---0200-002467887887899999999999999982799859 Q gi|254781114|r 14 HARREKFCHRAARIH-LQFSFFDAIYGENNPICNRI---FSHQ-KRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGA 88 (263) Q Consensus 14 ~~Rr~~~~~~~~~~g-i~~e~~~Avdg~~~~~~~~~---~~~~-~~~~~~~~~l~~geigC~lSH~~~w~~~~~~~~~~~ 88 (263) .+-.+...+.+.++. -++|.+=--||...+..... +... ....... .+...|.+.+--.+.+. ...+++ T Consensus 14 ~~~l~~~l~Si~~Qt~~~~EiIvvDd~StD~~~~~~~~~~~~~~~~i~~~~---~~~n~G~~~a~N~gi~~---a~geyi 87 (202) T cd04184 14 EKYLREAIESVRAQTYPNWELCIADDASTDPEVKRVLKKYAAQDPRIKVVF---REENGGISAATNSALEL---ATGEFV 87 (202) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCCCCCC---CCCCCCHHHHHHHHHHC---CCCCEE T ss_conf 699999999998378998799999899894789999999885188520012---67887899998864513---687667 Q ss_pred EEEEECCCCCCCHH-HHHHHHHHCCCCCCEE Q ss_conf 99730221330089-9999998415537754 Q gi|254781114|r 89 IILEDDADFSDEFS-QLLPHLSKCDINNILI 118 (263) Q Consensus 89 lIlEDD~~~~~~f~-~~l~~~~~~~~~~d~~ 118 (263) +++.+|..+.++.. ..++.+... .+.+++ T Consensus 88 ~flD~DD~~~p~~l~~~~~~~~~~-~~~~~v 117 (202) T cd04184 88 ALLDHDDELAPHALYEVVKALNEH-PDADLI 117 (202) T ss_pred EECCCCCEECHHHHHHHHHHHHHC-CCCCEE T ss_conf 732778552843999999999868-993599 No 29 >TIGR01469 cobA_cysG_Cterm uroporphyrin-III C-methyltransferase; InterPro: IPR006366 This domain is found in a family of sequences that represent enzymes, or enzyme domains, with uroporphyrin-III C-methyltransferase activity. This enzyme catalyzes the first step committed to the biosynthesis of either siroheme or cobalamin (vitamin B12) rather than protoheme (heme). Cobalamin contains cobalt while siroheme contains iron. Siroheme is a cofactor for nitrite and sulphite reductases and therefore plays a role in cysteine biosynthesis; many members of this family are CysG, siroheme synthase, with an additional N-terminal domain and with additional oxidation and iron insertion activities.; GO: 0008168 methyltransferase activity, 0006779 porphyrin biosynthetic process. Probab=34.07 E-value=29 Score=14.85 Aligned_cols=86 Identities=12% Similarity=0.077 Sum_probs=37.4 Q ss_pred HHHHHCCCCEEEEECCCCC-------CCCHHHHHHCCHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEC Q ss_conf 9998779978999663735-------289677420200002467887887899999999999999982799859997302 Q gi|254781114|r 22 HRAARIHLQFSFFDAIYGE-------NNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD 94 (263) Q Consensus 22 ~~~~~~gi~~e~~~Avdg~-------~~~~~~~~~~~~~~~~~~~~~l~~geigC~lSH~~~w~~~~~~~~~~~lIlEDD 94 (263) ..|.+.||+||.+++|+-. .+|.+...+..-.. ...|+.-.+++ | .+----|+..++.+ ++++|+-=. T Consensus 100 ~~L~~~GI~~EvvPGvTsA~Aa~a~aGiPlThR~~~~s~~-f~TGh~~~~~~--~-~~~~~~W~~lA~~~-~Tlv~yMG~ 174 (242) T TIGR01469 100 EALAEAGIPFEVVPGVTSAIAAAAYAGIPLTHRGLARSVT-FVTGHEAKEGE--E-ALNEVDWEALAKGG-GTLVIYMGV 174 (242) T ss_pred HHHHHCCCCEEEECCCCHHHHHHHHCCCCCCCCCCCCCEE-EEEEEEECCCC--C-CCCCCCHHHCCCCC-CCEEEEHHH T ss_conf 9999668979873881167789985589954130217316-88535406888--7-76757866715767-726886217 Q ss_pred CCCCCCHHHHHHHHHHCCCCCC Q ss_conf 2133008999999984155377 Q gi|254781114|r 95 ADFSDEFSQLLPHLSKCDINNI 116 (263) Q Consensus 95 ~~~~~~f~~~l~~~~~~~~~~d 116 (263) . ++.+....+...-.+-| T Consensus 175 ~----~l~~I~~~Li~~G~~~~ 192 (242) T TIGR01469 175 A----NLAEIAKELIEAGRDPD 192 (242) T ss_pred H----HHHHHHHHHHHCCCCCC T ss_conf 8----89999999996388967 No 30 >cd02508 ADP_Glucose_PP ADP-glucose pyrophosphorylase is involved in the biosynthesis of glycogen or starch. ADP-glucose pyrophosphorylase (glucose-1-phosphate adenylyltransferase) catalyzes a very important step in the biosynthesis of alpha 1,4-glucans (glycogen or starch) in bacteria and plants: synthesis of the activated glucosyl donor, ADP-glucose, from glucose-1-phosphate and ATP. ADP-glucose pyrophosphorylase is a tetrameric allosterically regulated enzyme. While a homotetramer in bacteria, in plant chloroplasts and amyloplasts, it is a heterotetramer of two different, yet evolutionary related, subunits. There are a number of conserved regions in the sequence of bacterial and plant ADP-glucose pyrophosphorylase subunits. It is a subfamily of a very diverse glycosy transferase family 2. Probab=31.68 E-value=35 Score=14.37 Aligned_cols=76 Identities=16% Similarity=0.143 Sum_probs=50.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEC Q ss_conf 99999999999999827998599973022133008999999984155377544511366542221123467770589940 Q gi|254781114|r 67 IGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPR 146 (263) Q Consensus 67 igC~lSH~~~w~~~~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~ 146 (263) .|.+-+-.++...+-+++.++.+|+-=|.+..-++.++++.-.+...+.-++.. T Consensus 93 ~Gt~~~i~~~~~~l~~~~~~~~lv~ngD~l~~~D~~~~l~~H~~~~a~~Ti~~~-------------------------- 146 (200) T cd02508 93 RGTADAIYQNLDYIERSDPEYVLILSGDHIYNMDYREMLDFHIESGADITVVYK-------------------------- 146 (200) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEEECCCEEEECCHHHHHHHHHHCCCCEEEEEE-------------------------- T ss_conf 347899999999872178872999769978706999999999876999899998-------------------------- Q ss_pred CCCCCCEEEEECHHHHHHHHHHHCC Q ss_conf 5798512356669999999996254 Q gi|254781114|r 147 ILSPRTTGYFIGKEAAIHLLNVRKN 171 (263) Q Consensus 147 ~~~~gt~aY~Is~~~AkkLl~~~~~ 171 (263) ..+..|+++++.-..+++...+ T Consensus 147 ---in~GiYi~~~~~l~~~l~~~~~ 168 (200) T cd02508 147 ---ASMGIYIFSKDLLIELLEEDAA 168 (200) T ss_pred ---EECEEEEEEHHHHHHHHHHHCC T ss_conf ---5048999808999999986324 No 31 >pfam03445 DUF294 Putative nucleotidyltransferase DUF294. This domain is found associated with pfam00571. This region is uncharacterized, however it seems to be similar to pfam01909, conserving the DXD motif. This strongly suggests that members of this family are also nucleotidyltransferases (Bateman A pers. obs.). Probab=29.23 E-value=31 Score=14.74 Aligned_cols=19 Identities=16% Similarity=0.279 Sum_probs=14.7 Q ss_pred HHHHHCCCCEEEEEEECCC Q ss_conf 9998279985999730221 Q gi|254781114|r 78 KRIAYSPAIGAIILEDDAD 96 (263) Q Consensus 78 ~~~~~~~~~~~lIlEDD~~ 96 (263) ++.+.++.+.+||+||+.. T Consensus 62 Eq~l~tDQDnali~~d~~~ 80 (168) T pfam03445 62 EQTLKTDQDNALVLADEEG 80 (168) T ss_pred CCCCCCCCCEEEEECCCCC T ss_conf 5667755464688559999 No 32 >PRK10063 predicted glycosyl transferase; Provisional Probab=29.03 E-value=39 Score=14.10 Aligned_cols=185 Identities=11% Similarity=0.032 Sum_probs=78.9 Q ss_pred CCEEEEEC-CCCHHHHHHHHHHH----HHCCCCEEEEECCCCCCCCHHHHHHCC-H-HCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 86899808-99889999999999----877997899966373528967742020-0-00246788788789999999999 Q gi|254781114|r 3 IPVYVISL-PFSHARREKFCHRA----ARIHLQFSFFDAIYGENNPICNRIFSH-Q-KRQCQFKRLLSLPEIGCYISHIH 75 (263) Q Consensus 3 ~kiyvInL-~~~~~Rr~~~~~~~----~~~gi~~e~~~Avdg~~~~~~~~~~~~-~-~~~~~~~~~l~~geigC~lSH~~ 75 (263) |++=||.. -++.+-.+...+.+ .+.+.++|.+ -|||...+.+...... . .....+ .+...-|-|-.-=+ T Consensus 1 M~iSIITv~~N~~~~l~~Tl~Sv~~~~~~~~~~~E~I-IIDGgStDgT~~ii~~~~~~~~~~~---iSEpD~GIYdAmNK 76 (248) T PRK10063 1 MLLSVVTVAFRNLEGIVKTHASLRHLAQDPGISFEWI-VVDGGSNDGTREYLENLNGIFNLRF---VSEPDNGIYDAMNK 76 (248) T ss_pred CEEEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEE-EEECCCCCCHHHHHHHCCCCCCEEE---EEECCCCCHHHHHH T ss_conf 9699999947498999999999999861878986999-9979396117999997066666799---99389880889885 Q ss_pred HHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEECCCC--CCCCCCCCCCCCCCCEEEEECCCCCCCE Q ss_conf 99999827998599973022133008999999984155377544511366--5422211234677705899405798512 Q gi|254781114|r 76 LWKRIAYSPAIGAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRK--KPKKDSYLCTLPGNFDIHQPRILSPRTT 153 (263) Q Consensus 76 ~w~~~~~~~~~~~lIlEDD~~~~~~f~~~l~~~~~~~~~~d~~~l~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~~gt~ 153 (263) -+.. .+| +++.+|--|-.+.++..+.+..+.....+ + +..+.... +........ .... .......+.+-+ T Consensus 77 GI~~--A~G-d~I~FLNsdD~~~~~~~~~~~~~~~~~~~-~-~~~g~~~~~~~~~~~~~r~-~~~~--~~~~~~~p~~Hq 148 (248) T PRK10063 77 GIAM--AQG-KFALFLNSGDIFHQDAALFVRQLKRQKDN-A-MITGDALLDFGDGHKIKRK-AKPG--WYIYHSLPASHQ 148 (248) T ss_pred HHHH--CCC-CEEEEECCCCCCCCCHHHHHHHHHHCCCC-E-EEEEEEEEECCCCCEEEEC-CCCH--HHHCCCCCCCCC T ss_conf 1987--788-88999838866484769999998717998-6-9994379981898689860-3781--563389999988 Q ss_pred EEEECHHHHHHHH-HHHCCCCCHHHHHHHHHHHCCCCEEEEECCCEECC Q ss_conf 3566699999999-96254578046887999975981799960313126 Q gi|254781114|r 154 GYFIGKEAAIHLL-NVRKNIYRPIDMDMKHWWEHNIPSLVTEPGAVYEA 201 (263) Q Consensus 154 aY~Is~~~AkkLl-~~~~~~~~pvD~~l~~~~~~~l~~~~~~P~~v~~~ 201 (263) |.++.++..++.. +...++.+=+|.++ +.+.++.+...+ |.++... T Consensus 149 a~f~rr~~~~~~~fD~~y~iaaDydl~l-Rl~~~g~~~~yl-~~vl~~~ 195 (248) T PRK10063 149 AIFFPVSGLKKWRYDLEYKVSSDYALAA-RLYKAGYAFKKL-NGLVSEF 195 (248) T ss_pred CEEEEHHHHHCCCCCCCCCCHHHHHHHH-HHHHCCCCEEEC-CCEEEEE T ss_conf 8788878962788999887426799999-999769978974-8135980 No 33 >PRK10710 DNA-binding transcriptional regulator BaeR; Provisional Probab=28.66 E-value=40 Score=14.06 Aligned_cols=21 Identities=10% Similarity=0.039 Sum_probs=12.6 Q ss_pred CCCEEEEECHHHHHHHHHHHC Q ss_conf 851235666999999999625 Q gi|254781114|r 150 PRTTGYFIGKEAAIHLLNVRK 170 (263) Q Consensus 150 ~gt~aY~Is~~~AkkLl~~~~ 170 (263) .|+..|+..|--.+.|+.... T Consensus 102 ~GAddYl~KPf~~~eLlari~ 122 (240) T PRK10710 102 IGADDYICKPYSPREVVARVK 122 (240) T ss_pred CCCCEEECCCCCHHHHHHHHH T ss_conf 699876007765389999999 No 34 >COG2047 Uncharacterized protein (ATP-grasp superfamily) [General function prediction only] Probab=28.26 E-value=28 Score=14.99 Aligned_cols=27 Identities=37% Similarity=0.488 Sum_probs=20.5 Q ss_pred CEEEEECHHHHHHHHHHHCC-CCCHHHH Q ss_conf 12356669999999996254-5780468 Q gi|254781114|r 152 TTGYFIGKEAAIHLLNVRKN-IYRPIDM 178 (263) Q Consensus 152 t~aY~Is~~~AkkLl~~~~~-~~~pvD~ 178 (263) |.||++.|++|+..|+.... ...-+|. T Consensus 191 T~GY~vDpkaAkaVL~vl~k~lgiEid~ 218 (258) T COG2047 191 TPGYLVDPKAAKAVLEVLCKMLGIEIDM 218 (258) T ss_pred CCCCCCCHHHHHHHHHHHHHHHCCEECH T ss_conf 8764228588999999999985855248 No 35 >cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=27.88 E-value=41 Score=13.98 Aligned_cols=154 Identities=8% Similarity=-0.012 Sum_probs=71.7 Q ss_pred CCHHHHHHHHHHHHHCCC-CEEEEECCCCCCCCHHHHHHCCHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 988999999999987799-7899966373528967742020000246788788789999999999999998279985999 Q gi|254781114|r 12 FSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAII 90 (263) Q Consensus 12 ~~~~Rr~~~~~~~~~~gi-~~e~~~Avdg~~~~~~~~~~~~~~~~~~~~~~l~~geigC~lSH~~~w~~~~~~~~~~~lI 90 (263) ++.+-.....+.+.+... ++|.+=-=||......+............. .+...|...+.-..++.+ ..+++++ T Consensus 7 N~~~~l~~~l~Sl~~q~~~~~eiivVD~~S~d~t~~~~~~~~~~i~~i~---~~~n~G~~~a~N~g~~~a---~g~~i~~ 80 (166) T cd04186 7 NSLEYLKACLDSLLAQTYPDFEVIVVDNASTDGSVELLRELFPEVRLIR---NGENLGFGAGNNQGIREA---KGDYVLL 80 (166) T ss_pred CCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHCCCCEEEEE---CCCCCCHHHHHHHHHHHH---CCCCEEE T ss_conf 9889999999999837799809999968998315677774289889998---989877678761689871---6572699 Q ss_pred EEECCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEEEEECHHHHHHHHHHHC Q ss_conf 73022133008999999984155377544511366542221123467770589940579851235666999999999625 Q gi|254781114|r 91 LEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLLNVRK 170 (263) Q Consensus 91 lEDD~~~~~~f~~~l~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~gt~aY~Is~~~AkkLl~~~~ 170 (263) +.||+.+.+++.+.+-.......+... .+ + .....+.++.++.-+++=-.-. T Consensus 81 lD~D~~~~~~~l~~~~~~~~~~~~~~~--~~-----------------------~---~~~g~~~~~rr~~~~~vGgfde 132 (166) T cd04186 81 LNPDTVVEPGALLELLDAAEQDPDVGI--VG-----------------------P---KVSGAFLLVRREVFEEVGGFDE 132 (166) T ss_pred ECCCEEECCCHHHHHHHHHHHCCCEEE--EE-----------------------E---CCCEEEEEEEHHHHHHHCCCCH T ss_conf 879938883999999999986979899--99-----------------------5---0548999999999998589976 Q ss_pred CCC-CHHHHHH-HHHHHCCCCEEEEECCCEE Q ss_conf 457-8046887-9999759817999603131 Q gi|254781114|r 171 NIY-RPIDMDM-KHWWEHNIPSLVTEPGAVY 199 (263) Q Consensus 171 ~~~-~pvD~~l-~~~~~~~l~~~~~~P~~v~ 199 (263) .+. ..-|..+ .+....+.+++.+--..|. T Consensus 133 ~~~~~~ED~dl~~R~~~~G~~i~~~p~~~v~ 163 (166) T cd04186 133 DFFLYYEDVDLCLRARLAGYRVLYVPQAVIY 163 (166) T ss_pred HHCCCCCHHHHHHHHHHCCCCEEEECCEEEE T ss_conf 8657465999999999829949994998999 No 36 >TIGR01465 cobM_cbiF precorrin-4 C11-methyltransferase; InterPro: IPR006362 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobM and CibF precorrin-4 C11-methyltransferase (2.1.1.133 from EC), both as stand-alone enzymes and when CobJ forms part of a bifunctional enzyme. In the aerobic pathway, CobM catalyses the methylation of precorrin-4 at C-11 to yield precorrin-5. The extruded acyl group is then removed in the subsequent step catalysed by CobF. In the anaerobic pathway, CibF catalyses the methylation of cobalt-precorrin-4 to cobalt-precoriin-5 . Nomenclature note: occasionally precorrin-4 C11-methyltransferase is one of two methyltransferases often referred to as precorrin-3 methylase (the other is precorrin-3 methylase (the other is precorrin-4 C11-methyltransferase, 2.1.1.131 from EC). ; GO: 0046026 precorrin-4 C11-methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=27.28 E-value=42 Score=13.92 Aligned_cols=20 Identities=0% Similarity=0.079 Sum_probs=16.0 Q ss_pred HHHHHHHCCCCEEEEECCCC Q ss_conf 99999877997899966373 Q gi|254781114|r 20 FCHRAARIHLQFSFFDAIYG 39 (263) Q Consensus 20 ~~~~~~~~gi~~e~~~Avdg 39 (263) ....|+++||+||.+++|.- T Consensus 91 Q~~~L~~~gI~~e~vPGvSs 110 (252) T TIGR01465 91 QMQLLEALGIPYEVVPGVSS 110 (252) T ss_pred HHHHHHHCCCCEEEECCCHH T ss_conf 99999867897798688738 No 37 >cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=26.37 E-value=44 Score=13.82 Aligned_cols=87 Identities=9% Similarity=-0.125 Sum_probs=40.5 Q ss_pred HHHHHHHHCC-CCEEEEECCCCCCCCHHHHH---HCCHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEC Q ss_conf 9999998779-97899966373528967742---0200002467887887899999999999999982799859997302 Q gi|254781114|r 19 KFCHRAARIH-LQFSFFDAIYGENNPICNRI---FSHQKRQCQFKRLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDD 94 (263) Q Consensus 19 ~~~~~~~~~g-i~~e~~~Avdg~~~~~~~~~---~~~~~~~~~~~~~l~~geigC~lSH~~~w~~~~~~~~~~~lIlEDD 94 (263) ...+..-.+. -++|.+=--||...+..... +......+....+-. .|...+--.+++. ...+++.++.+| T Consensus 17 ~~l~Svl~Qt~~~~EiIiVdDgSs~d~~~~i~~~~~~~~~i~~i~~~~N---~G~~~a~N~gi~~---a~g~yI~~lD~D 90 (201) T cd04195 17 EALESILKQTLPPDEVVLVKDGPVTQSLNEVLEEFKRKLPLKVVPLEKN---RGLGKALNEGLKH---CTYDWVARMDTD 90 (201) T ss_pred HHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCC---CCHHHHHHHHHHH---CCCCEEEECCCC T ss_conf 9999999579998189999899996543999998614799899987887---8989997763642---676699981889 Q ss_pred CCCCCCHHH-HHHHHHHC Q ss_conf 213300899-99999841 Q gi|254781114|r 95 ADFSDEFSQ-LLPHLSKC 111 (263) Q Consensus 95 ~~~~~~f~~-~l~~~~~~ 111 (263) -++.|+..+ .++.+... T Consensus 91 D~~~p~~l~~~~~~l~~~ 108 (201) T cd04195 91 DISLPDRFEKQLDFIEKN 108 (201) T ss_pred CCCCHHHHHHHHHHHHHC T ss_conf 846716999999999878 No 38 >PRK09275 aspartate aminotransferase; Provisional Probab=26.14 E-value=44 Score=13.80 Aligned_cols=23 Identities=4% Similarity=0.120 Sum_probs=14.4 Q ss_pred CCCCCCEEEEECHHHHHHHHHHH Q ss_conf 57985123566699999999962 Q gi|254781114|r 147 ILSPRTTGYFIGKEAAIHLLNVR 169 (263) Q Consensus 147 ~~~~gt~aY~Is~~~AkkLl~~~ 169 (263) -+|..-.++.++.+..+++.+-. T Consensus 252 VNPsNP~S~~ls~e~l~~i~~IV 274 (531) T PRK09275 252 VNPSNPPSVAMSDESLEKIADIV 274 (531) T ss_pred ECCCCCCCCCCCHHHHHHHHHHH T ss_conf 89999866656999999999999 No 39 >pfam01745 IPT Isopentenyl transferase. Isopentenyl transferase / dimethylallyl transferase synthesizes isopentenyladensosine 5'-monophosphate, a cytokinin that induces shoot formation on host plants infected with the Ti plasmid. Probab=25.69 E-value=45 Score=13.75 Aligned_cols=93 Identities=18% Similarity=0.180 Sum_probs=60.8 Q ss_pred CCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEECC-------CCCCCCHHHHHHCCHHCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 86899808998899999999998779978999663-------73528967742020000246788788789999999999 Q gi|254781114|r 3 IPVYVISLPFSHARREKFCHRAARIHLQFSFFDAI-------YGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIH 75 (263) Q Consensus 3 ~kiyvInL~~~~~Rr~~~~~~~~~~gi~~e~~~Av-------dg~~~~~~~~~~~~~~~~~~~~~~l~~geigC~lSH~~ 75 (263) |.+|+|-=+..+-+-+.-..-+.+.|.+.--.|.+ -|..-|..+.. ......--..|+++.|-|..--+|.. T Consensus 1 m~l~~I~GpT~~GKT~~ai~lA~~~g~~iis~D~~Q~y~el~igs~rp~~~El-~gT~RiYL~~R~l~~Gii~a~eA~~~ 79 (232) T pfam01745 1 MGLYLIWGATCTGKTAEAIALAKETGWPVIVLDRVQCCSQLATGSGRPLPAEL-QGTRRIYLDNRPLSEGIIDAEEAHDR 79 (232) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHCCCEEEECHHHHHHCCCCCCCCCCHHHH-CCCCEEEECCCCCCCCCCCHHHHHHH T ss_conf 92689978877771699999999959977962034430011367789997996-57526986167343466488999999 Q ss_pred HHHHHHHCCCCEEEEEEECCC Q ss_conf 999998279985999730221 Q gi|254781114|r 76 LWKRIAYSPAIGAIILEDDAD 96 (263) Q Consensus 76 ~w~~~~~~~~~~~lIlEDD~~ 96 (263) +..++.....+..||||--.+ T Consensus 80 Li~~V~~~~~~~glILEGGSI 100 (232) T pfam01745 80 LIAEVTSHKDEGGVILEGGSI 100 (232) T ss_pred HHHHHHCCCCCCCEEEECCHH T ss_conf 999996046678668727509 No 40 >pfam07459 CTX_RstB CTX phage RstB protein. This family contains a number of RstB proteins approximately 120 residues long, including RstB1 and RstB2, from the Vibrio cholerae phage CTX. Functional analyses indicate that rstB2 is required for integration of the CTXphi phage into the V. cholerae chromosome. Probab=25.63 E-value=34 Score=14.47 Aligned_cols=11 Identities=18% Similarity=0.504 Sum_probs=9.1 Q ss_pred CCEEEEECCCC Q ss_conf 86899808998 Q gi|254781114|r 3 IPVYVISLPFS 13 (263) Q Consensus 3 ~kiyvInL~~~ 13 (263) ||++|||++.+ T Consensus 1 mk~wvinmksr 11 (117) T pfam07459 1 MKIWVINMKSR 11 (117) T ss_pred CEEEEEECCCE T ss_conf 90799952224 No 41 >PRK00648 Maf-like protein; Reviewed Probab=25.30 E-value=46 Score=13.70 Aligned_cols=72 Identities=17% Similarity=0.163 Sum_probs=40.1 Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEC-CCCCCCCHHHHHHCCHHCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 988689980899889999999999877997899966-3735289677420200002467887887899999999999999 Q gi|254781114|r 1 MPIPVYVISLPFSHARREKFCHRAARIHLQFSFFDA-IYGENNPICNRIFSHQKRQCQFKRLLSLPEIGCYISHIHLWKR 79 (263) Q Consensus 1 m~~kiyvInL~~~~~Rr~~~~~~~~~~gi~~e~~~A-vdg~~~~~~~~~~~~~~~~~~~~~~l~~geigC~lSH~~~w~~ 79 (263) |+|++..=| +..||..+.+ ++|++|+.+++ +|-... ...+++.++...++..++- . T Consensus 1 m~~~iILAS---~S~~R~~lL~---~~gi~f~v~~~~~dEe~~----------------~~~~~p~~~~~~lA~~Ka~-~ 57 (191) T PRK00648 1 MKYKIVLAS---SSPRRKEILE---GFRIPFEVIPSPFVEESY----------------PYSLDPEEYTLELARLKAE-A 57 (191) T ss_pred CCCCEEEEC---CCHHHHHHHH---HCCCCCEEECCCCCCCCC----------------CCCCCHHHHHHHHHHHHHH-H T ss_conf 997499968---9999999999---779995997899987767----------------7789999999999999999-9 Q ss_pred HHHC-CCCEEEEEEECC Q ss_conf 9827-998599973022 Q gi|254781114|r 80 IAYS-PAIGAIILEDDA 95 (263) Q Consensus 80 ~~~~-~~~~~lIlEDD~ 95 (263) +.+. +.+..+|-=|=+ T Consensus 58 v~~~~~~~~~vI~aDtv 74 (191) T PRK00648 58 VRSDHFPDELIITADTI 74 (191) T ss_pred HHHHHCCCCEEEEECEE T ss_conf 99721899889996849 No 42 >cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily. Probab=25.25 E-value=46 Score=13.70 Aligned_cols=101 Identities=7% Similarity=0.051 Sum_probs=47.4 Q ss_pred CCHHHHHH----HHHHHHHCCCCEEEEECCCCCCCCHHHHHHCCHHCCCCCCCCC-CHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 98899999----9999987799789996637352896774202000024678878-878999999999999999827998 Q gi|254781114|r 12 FSHARREK----FCHRAARIHLQFSFFDAIYGENNPICNRIFSHQKRQCQFKRLL-SLPEIGCYISHIHLWKRIAYSPAI 86 (263) Q Consensus 12 ~~~~Rr~~----~~~~~~~~gi~~e~~~Avdg~~~~~~~~~~~~~~~~~~~~~~l-~~geigC~lSH~~~w~~~~~~~~~ 86 (263) +..++... +...+.+.+.++|++=.=||......+..-+........ +-+ .+.-.|...+....++.+ ..+ T Consensus 7 NEe~~I~~~i~~i~~~~~~~~~~~eIivVDD~StD~T~~i~~~l~~~~~~v-~~i~~~~N~G~g~Ai~~g~~~a---~~d 82 (181) T cd04187 7 NEEENLPELYERLKAVLESLGYDYEIIFVDDGSTDRTLEILRELAARDPRV-KVIRLSRNFGQQAALLAGLDHA---RGD 82 (181) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHCCCCCCCCE-EEEECCCCCCHHHHHHHHHHHC---CCC T ss_conf 788879999999999986559987999998999756245420121233320-4531455764467889899855---697 Q ss_pred EEEEEEECCCCCCC-HHHHHHHHHHCCCCCCEEE Q ss_conf 59997302213300-8999999984155377544 Q gi|254781114|r 87 GAIILEDDADFSDE-FSQLLPHLSKCDINNILIK 119 (263) Q Consensus 87 ~~lIlEDD~~~~~~-f~~~l~~~~~~~~~~d~~~ 119 (263) +++++..|-...++ +.+.++.+ ..+.|++. T Consensus 83 ~i~~~D~D~q~~p~~i~~l~~~~---~~~~D~V~ 113 (181) T cd04187 83 AVITMDADLQDPPELIPEMLAKW---EEGYDVVY 113 (181) T ss_pred EEEECCCCCCCCHHHHHHHHHHH---HCCCEEEE T ss_conf 47751899982999999999999---75991999 No 43 >TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils. Probab=21.66 E-value=55 Score=13.28 Aligned_cols=40 Identities=15% Similarity=0.083 Sum_probs=27.7 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHH-HHH Q ss_conf 8788789999999999999998279985999730221330089999-999 Q gi|254781114|r 60 RLLSLPEIGCYISHIHLWKRIAYSPAIGAIILEDDADFSDEFSQLL-PHL 108 (263) Q Consensus 60 ~~l~~geigC~lSH~~~w~~~~~~~~~~~lIlEDD~~~~~~f~~~l-~~~ 108 (263) ++-..|-+--+|+|- +.|+++||.=|-+..++|.+.. -.+ T Consensus 213 ~hAKAGNLN~AL~~t---------~GelIaVfDAD~vP~~~FL~~tvg~F 253 (713) T TIGR03030 213 VHAKAGNINNALKHT---------DGELILIFDADHVPTRDFLQRTVGWF 253 (713) T ss_pred CCCCCHHHHHHHHHC---------CCCEEEEECCCCCCCHHHHHHHHHHH T ss_conf 887616789998736---------98789996788786689999998987 No 44 >cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm Probab=21.28 E-value=56 Score=13.24 Aligned_cols=165 Identities=11% Similarity=-0.023 Sum_probs=74.1 Q ss_pred CCCHHHHHHHHHHHHHCCC-CEEEEECCCCCCCCHHHHH--HCCHHC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9988999999999987799-7899966373528967742--020000-24678878878999999999999999827998 Q gi|254781114|r 11 PFSHARREKFCHRAARIHL-QFSFFDAIYGENNPICNRI--FSHQKR-QCQFKRLLSLPEIGCYISHIHLWKRIAYSPAI 86 (263) Q Consensus 11 ~~~~~Rr~~~~~~~~~~gi-~~e~~~Avdg~~~~~~~~~--~~~~~~-~~~~~~~l~~geigC~lSH~~~w~~~~~~~~~ 86 (263) -++.+......+.+.++.. ++|.+=.=||......+.. +..... ...+-+.-..| .+.+.+--.+.+. ...+ T Consensus 6 yN~~~~L~~~L~Sl~~Qt~~~~EIIvvDd~StD~t~~~~~~~~~~~~~~i~~~~~~~~g-~~~~~arN~gi~~---a~g~ 81 (182) T cd06420 6 YNRPEALELVLKSVLNQSILPFEVIIADDGSTEETKELIEEFKSQFPIPIKHVWQEDEG-FRKAKIRNKAIAA---AKGD 81 (182) T ss_pred CCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCEEEECCCC-HHHHHHHHHHHHH---CCCC T ss_conf 79889999999999837999958999979996148999999985188741000100567-2689999999985---5887 Q ss_pred EEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEEEEECHHHHHHHH Q ss_conf 59997302213300899999998415537754451136654222112346777058994057985123566699999999 Q gi|254781114|r 87 GAIILEDDADFSDEFSQLLPHLSKCDINNILIKFDALRKKPKKDSYLCTLPGNFDIHQPRILSPRTTGYFIGKEAAIHLL 166 (263) Q Consensus 87 ~~lIlEDD~~~~~~f~~~l~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~gt~aY~Is~~~AkkLl 166 (263) +++++.||+++.+++.+...... ..+. + +.+. ... ..... .... . .++...+.|+...+.- T Consensus 82 ~i~flD~D~~~~~~~i~~~~~~~--~~~~-~--~~g~--~~~-------~~~~~---~~~~-~-~~~~~~~~r~~~~~vG 142 (182) T cd06420 82 YLIFIDGDCIPHPDFIADHIELA--EPGV-F--LSGS--RVL-------LNEKL---TERG-I-RGCNMSFWKKDLLAVN 142 (182) T ss_pred EEEEECCCCCCCCHHHHHHHHHH--CCCE-E--EECC--EEE-------ECCCC---CCCC-C-CEEEEEEEHHHHHHHC T ss_conf 28887898327910999999970--8990-8--9861--687-------17534---5264-0-2448999999999858 Q ss_pred HHHCCCCC--HHHHH-HHHHHHCCCCEEEEECCCE Q ss_conf 96254578--04688-7999975981799960313 Q gi|254781114|r 167 NVRKNIYR--PIDMD-MKHWWEHNIPSLVTEPGAV 198 (263) Q Consensus 167 ~~~~~~~~--pvD~~-l~~~~~~~l~~~~~~P~~v 198 (263) -.-..+.. .=|.. ..+.+..|.++..+.+..+ T Consensus 143 gfde~~~~~g~ED~Dl~~Rl~~~G~k~~~~~~~~~ 177 (182) T cd06420 143 GFDEEFTGWGGEDSELVARLLNSGIKFRKLKFAAI 177 (182) T ss_pred CCCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCEE T ss_conf 99700278997799999999982999996168769 No 45 >pfam06258 DUF1022 Protein of unknown function (DUF1022). This family consists of several hypothetical eukaryotic and prokaryotic proteins. The function of this family is unknown. Probab=20.82 E-value=57 Score=13.18 Aligned_cols=16 Identities=13% Similarity=0.090 Sum_probs=8.6 Q ss_pred HHHHHHHHCCCCEEEE Q ss_conf 9999998779978999 Q gi|254781114|r 19 KFCHRAARIHLQFSFF 34 (263) Q Consensus 19 ~~~~~~~~~gi~~e~~ 34 (263) ....-.+.+|++|+.. T Consensus 6 Q~~gLAeaL~~~~~~~ 21 (308) T pfam06258 6 QALGLAEALGLPYEIR 21 (308) T ss_pred HHHHHHHHCCCCCEEE T ss_conf 9999999709987179 No 46 >COG1497 Predicted transcriptional regulator [Transcription] Probab=20.36 E-value=47 Score=13.63 Aligned_cols=33 Identities=15% Similarity=0.262 Sum_probs=21.3 Q ss_pred HHHHHHHHHHHHHHHHHHC----CCCEEEEEEECCCC Q ss_conf 8999999999999999827----99859997302213 Q gi|254781114|r 65 PEIGCYISHIHLWKRIAYS----PAIGAIILEDDADF 97 (263) Q Consensus 65 geigC~lSH~~~w~~~~~~----~~~~~lIlEDD~~~ 97 (263) -|+.-.+.|..+|..+... ++..-|-++|-... T Consensus 85 ~ev~~~l~~~~vw~AIA~edI~~Gd~V~L~M~dG~Ly 121 (260) T COG1497 85 EEVELVLDYVMVWTAIAKEDIKEGDTVYLRMKDGYLY 121 (260) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCEEE T ss_conf 9999887667788886575634698899995383799 Done!