Query         020721
Match_columns 322
No_of_seqs    246 out of 1355
Neff          7.3 
Searched_HMMs 46136
Date          Fri Mar 29 04:37:13 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/020721.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/020721hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PTZ00210 UDP-GlcNAc-dependent  100.0 1.1E-51 2.3E-56  388.1  21.9  253   58-319    58-368 (382)
  2 KOG2287 Galactosyltransferases 100.0 4.6E-47 9.9E-52  363.6  27.3  230   73-317    94-347 (349)
  3 PLN03133 beta-1,3-galactosyltr 100.0   2E-44 4.3E-49  361.8  26.0  226   66-310   376-627 (636)
  4 PLN03193 beta-1,3-galactosyltr 100.0 3.6E-43 7.7E-48  335.1  21.5  192   70-273   135-356 (408)
  5 PF01762 Galactosyl_T:  Galacto 100.0 3.5E-40 7.6E-45  292.0  16.1  167   88-262     1-192 (195)
  6 KOG2288 Galactosyltransferases 100.0 1.8E-30 3.9E-35  231.6  12.8  189   71-271     8-224 (274)
  7 KOG2246 Galactosyltransferases  99.8 1.3E-17 2.8E-22  160.2  17.7  169   66-261    83-265 (364)
  8 PF02434 Fringe:  Fringe-like;   99.8 1.4E-17 3.1E-22  153.2  14.7  197   74-295     6-232 (252)
  9 PLN03153 hypothetical protein;  99.4 7.1E-12 1.5E-16  123.3  19.2  204   73-296   121-344 (537)
 10 KOG3708 Uncharacterized conser  97.8 6.5E-05 1.4E-09   74.0   8.7  204   71-309    23-245 (681)
 11 TIGR03472 HpnI hopanoid biosyn  95.8     1.1 2.4E-05   43.4  18.5  171   74-259    41-240 (373)
 12 TIGR03469 HonB hopene-associat  95.5     1.6 3.5E-05   42.5  18.4  174   73-259    39-249 (384)
 13 PF13641 Glyco_tranf_2_3:  Glyc  95.1    0.18 3.9E-06   44.5   9.6  169   74-259     1-197 (228)
 14 cd02520 Glucosylceramide_synth  95.1     1.3 2.8E-05   38.3  14.9  130  106-259    30-164 (196)
 15 PF13506 Glyco_transf_21:  Glyc  95.0    0.14   3E-06   44.5   8.3  106  145-260    15-142 (175)
 16 cd04192 GT_2_like_e Subfamily   94.5     2.8 6.1E-05   36.4  15.7  141  107-259    29-195 (229)
 17 cd04186 GT_2_like_c Subfamily   93.7     2.6 5.6E-05   34.4  13.1   76  169-259    74-152 (166)
 18 PF04646 DUF604:  Protein of un  92.0    0.41 8.9E-06   44.0   6.2   74  218-295    12-92  (255)
 19 PRK11204 N-glycosyltransferase  90.1      12 0.00026   36.5  15.1  169   72-259    52-247 (420)
 20 cd06420 GT2_Chondriotin_Pol_N   90.1      10 0.00022   31.7  13.3   90  169-260    79-168 (182)
 21 cd06532 Glyco_transf_25 Glycos  90.0     3.8 8.2E-05   33.4   9.7  110   78-236     2-117 (128)
 22 cd06439 CESA_like_1 CESA_like_  89.1      16 0.00034   32.5  17.0  165   73-259    28-216 (251)
 23 cd04185 GT_2_like_b Subfamily   89.0     5.7 0.00012   34.0  10.7   82  169-260    79-162 (202)
 24 cd02510 pp-GalNAc-T pp-GalNAc-  88.9      19 0.00042   33.3  15.2   43  215-258   169-211 (299)
 25 PF01755 Glyco_transf_25:  Glyc  88.1     6.8 0.00015   34.0  10.6   34   78-116     4-37  (200)
 26 cd06434 GT2_HAS Hyaluronan syn  87.9      18 0.00038   31.6  14.3  139  107-259    29-200 (235)
 27 COG1216 Predicted glycosyltran  87.5      23 0.00051   33.0  14.5  169   74-259     3-207 (305)
 28 cd06427 CESA_like_2 CESA_like_  87.4      16 0.00035   32.5  12.9   87  169-259    84-199 (241)
 29 cd04195 GT2_AmsE_like GT2_AmsE  86.5      17 0.00037   30.8  12.2   87  169-259    80-188 (201)
 30 cd04196 GT_2_like_d Subfamily   86.2      19  0.0004   30.7  12.3   87  169-258    79-190 (214)
 31 TIGR01556 rhamnosyltran L-rham  85.6     8.3 0.00018   35.3  10.3  118  131-259    43-188 (281)
 32 cd02525 Succinoglycan_BP_ExoA   85.6      24 0.00052   30.8  15.9   88  169-259    81-195 (249)
 33 cd06421 CESA_CelA_like CESA_Ce  85.5      24 0.00051   30.7  15.1   87  169-259    84-198 (234)
 34 PRK14583 hmsR N-glycosyltransf  85.0      35 0.00076   33.8  15.0  168   73-259    74-268 (444)
 35 PF00535 Glycos_transf_2:  Glyc  83.1     3.8 8.2E-05   33.1   6.2  114  106-232    27-168 (169)
 36 cd04179 DPM_DPG-synthase_like   80.7      32 0.00068   28.6  11.6  116  106-233    28-167 (185)
 37 COG1215 Glycosyltransferases,   80.4      58  0.0013   31.5  17.4  170   73-259    53-252 (439)
 38 PF13632 Glyco_trans_2_3:  Glyc  79.4     5.3 0.00011   34.2   6.0   85  172-259     1-111 (193)
 39 PF05679 CHGN:  Chondroitin N-a  79.0     3.4 7.3E-05   42.0   5.3   67  240-308     8-81  (499)
 40 cd04187 DPM1_like_bac Bacteria  76.4      44 0.00095   27.9  12.9  119  106-236    29-166 (181)
 41 cd04191 Glucan_BSP_ModH Glucan  76.3      62  0.0014   29.6  14.8  173   77-259     2-218 (254)
 42 cd02526 GT2_RfbF_like RfbF is   75.6      54  0.0012   28.5  13.9   89  169-259    75-191 (237)
 43 TIGR03111 glyc2_xrt_Gpos1 puta  75.5      89  0.0019   30.9  16.8  166   74-256    49-250 (439)
 44 cd06423 CESA_like CESA_like is  72.7      46   0.001   26.4  11.3   66  169-234    78-171 (180)
 45 PRK14716 bacteriophage N4 adso  66.0 1.6E+02  0.0035   30.0  18.6  185   72-260    64-277 (504)
 46 cd06438 EpsO_like EpsO protein  65.0      83  0.0018   26.4  10.1   63  169-231    81-169 (183)
 47 cd06442 DPM1_like DPM1_like re  63.9      94   0.002   26.6  12.5   65  169-233    78-167 (224)
 48 PRK11234 nfrB bacteriophage N4  61.7 2.3E+02   0.005   30.4  18.1  184   70-260    59-274 (727)
 49 cd04184 GT2_RfbC_Mx_like Myxoc  61.3      99  0.0021   26.0  14.5   88  169-259    83-188 (202)
 50 TIGR03030 CelA cellulose synth  58.2 2.6E+02  0.0056   29.8  16.0   87  169-259   228-345 (713)
 51 PF05412 Peptidase_C33:  Equine  54.2      10 0.00022   30.1   2.2   18  164-181    63-80  (108)
 52 PRK10714 undecaprenyl phosphat  53.5   2E+02  0.0044   27.2  14.2  146   75-236     7-176 (325)
 53 PRK05454 glucosyltransferase M  53.1 1.6E+02  0.0035   31.4  11.5  181   70-259   120-343 (691)
 54 cd04190 Chitin_synth_C C-termi  50.8      35 0.00076   30.6   5.5   91  169-259    73-207 (244)
 55 cd06435 CESA_NdvC_like NdvC_li  49.1 1.8E+02  0.0039   25.2  16.2   87  169-259    84-196 (236)
 56 PLN02726 dolichyl-phosphate be  48.4 1.9E+02  0.0042   25.5  16.3  168   73-259     8-205 (243)
 57 PTZ00260 dolichyl-phosphate be  46.0 2.7E+02  0.0059   26.5  14.7  174   70-259    66-279 (333)
 58 cd06433 GT_2_WfgS_like WfgS an  42.7 1.9E+02  0.0041   23.8  14.0   88  169-259    75-181 (202)
 59 cd02522 GT_2_like_a GT_2_like_  42.2 2.2E+02  0.0047   24.2  13.6   87  169-259    72-174 (221)
 60 PHA01631 hypothetical protein   38.7 1.1E+02  0.0025   26.4   6.3   59  169-235    71-133 (176)
 61 PF09680 Tiny_TM_bacill:  Prote  35.4      39 0.00084   19.5   2.0   17   11-27      4-20  (24)
 62 TIGR01732 tiny_TM_bacill conse  34.2      44 0.00096   19.7   2.1   18   11-28      6-23  (26)
 63 KOG3588 Chondroitin synthase 1  33.8      12 0.00027   36.4  -0.3   67  224-303     1-73  (494)
 64 cd04188 DPG_synthase DPG_synth  32.3 3.1E+02  0.0066   23.4   8.4  139  106-259    30-195 (211)
 65 PF11395 DUF2873:  Protein of u  29.8      44 0.00096   21.4   1.8   19   24-44     23-41  (43)
 66 PF13704 Glyco_tranf_2_4:  Glyc  29.4 2.4E+02  0.0052   20.9   6.8   47  133-186    41-88  (97)
 67 cd00761 Glyco_tranf_GTA_type G  29.2 2.6E+02  0.0056   21.2  12.6   73  169-259    77-152 (156)
 68 PRK11498 bcsA cellulose syntha  25.4 9.3E+02    0.02   26.5  15.0   87  169-259   339-456 (852)
 69 KOG2547 Ceramide glucosyltrans  23.7 2.5E+02  0.0054   27.7   6.5   26  169-194   170-195 (431)
 70 cd04459 Rho_CSD Rho_CSD: Rho p  23.2 1.5E+02  0.0033   21.6   3.9   39  174-214    23-61  (68)
 71 PF08525 OapA_N:  Opacity-assoc  22.0      88  0.0019   18.9   2.1   17    8-24      7-23  (30)
 72 COG4425 Predicted membrane pro  20.6 7.5E+02   0.016   25.2   9.2   78   66-146   281-361 (588)

No 1  
>PTZ00210 UDP-GlcNAc-dependent glycosyltransferase; Provisional
Probab=100.00  E-value=1.1e-51  Score=388.15  Aligned_cols=253  Identities=21%  Similarity=0.270  Sum_probs=220.5

Q ss_pred             cCCccce------eecCCCCCCeeEEEEEECCCCC--HHHHHHHHHHhccCC--------CCCCceEEEEEecCCCch--
Q 020721           58 FSNASHV------VATNSSSEDIRILIGILTLPDQ--YHRRHFLRMIYGTQS--------PTGAQVDVKFVFCNLTKE--  119 (322)
Q Consensus        58 ~~~~~~l------~~~~~~~~~~~lli~V~S~~~~--~~rR~aIR~TW~~~~--------~~~~~v~~~Fvlg~~~~~--  119 (322)
                      +.+|++|      ++.+|+.++..+++||+|..++  +.||+++|+||.+-.        ++|+.+ ++|++|.+++.  
T Consensus        58 ~~~~~~l~~~~~~~~~~w~~~~~lv~~Gi~S~d~~~r~~rR~lqr~t~w~y~~va~~~n~ftg~~l-v~y~l~~H~~~~~  136 (382)
T PTZ00210         58 IDEDEYLMFVPSNVAAVWKAQRFLAVLGIPSVDNSERSRRRDLQRQTCWKYSGVATRSNNFSGSLL-PLYLLAPHQSNSY  136 (382)
T ss_pred             cchhhhhhccCHHHHHHhccCCceEEEeccCCCchHHHHHHHHHHhhhhcchhhhhhccCCchhhh-hhhhhccCCccch
Confidence            5566665      6899999999999999999999  799999999996432        566665 99999999988  


Q ss_pred             hHHHHHHHHHhhCCCeEEeccCCcC------------------Cc--hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCc
Q 020721          120 DQKVLVALEIMRYDDIIILNCKENM------------------NK--GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDD  179 (322)
Q Consensus       120 ~~~~~l~~E~~~~~DIv~ld~~D~y------------------~n--~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD  179 (322)
                      +.+.+|.+|+++|||||++||.|.|                  .+  +||+++|+|+.+.       ||+++||+|+|||
T Consensus       137 ~~~~~L~eEA~~~~DIVilpf~d~~~tTnKkiG~~g~WG~e~e~~mT~KT~l~~~wA~~~-------cP~a~YImKgDDD  209 (382)
T PTZ00210        137 LISHSLKEEAARTHDIITLPTNDVSPSTRKKIGENGNWGIEAEVAMSRKTYLWLRFALHM-------FPNVSYIVKGDDD  209 (382)
T ss_pred             hhhHHHHHHHHHhCCEEEEecccCccccccccccCCcccchhhcchhHHHHHHHHHHHHh-------CCCCCeEEEcCCC
Confidence            7788999999999999999996666                  23  9999999999886       8899999999999


Q ss_pred             eEeeHHHHHHHhcCCCCCCeEEEEeecCC-C-CCCCCcccccceeeeCHHHHHHHHHcCCC-CC---------------C
Q 020721          180 TYIRLENLVKSLVPLPREDLYYGYVIPCR-S-MDPFVDYMSGMGYLVSWDIAEWIRDSDIP-KN---------------H  241 (322)
Q Consensus       180 ~fv~~~~L~~~L~~~~~~~~y~G~~~p~r-~-~~~yp~y~~G~gYvlS~d~~~~i~~~~~~-~~---------------~  241 (322)
                      +|||+++++++|+..|++++|+|.+.... + .+.+|+||+|+||+||+|+|+.|++..++ +.               .
T Consensus       210 vFVrVp~lL~~Lr~~prr~LY~G~v~~~~~p~Rd~~PpY~~G~gYvLSrDVA~~Lvs~~pl~rL~~~pys~~~~~~y~~~  289 (382)
T PTZ00210        210 IFIRVPKYLADLRVMPRHGLYMGRYNYYNRIWRRNQLTYVNGYCITLSRDTAQAIISYKPLERLVNMPFSMWDYFDFLDL  289 (382)
T ss_pred             eEeeHHHHHHHHhhCCCCceEEEeeCCCCccccCCCCCccccceeeccHHHHHHHHhhChHhHhhcCCCchHHHHHHHHh
Confidence            99999999999999999999999986431 1 24579999999999999999999997543 22               4


Q ss_pred             CCCchhHHHHHHHHhcCCCCccccCccceeecCCC-CCcccccccCCEEEEEccCCHHHHHHHHHHccccCC-CCCCCCC
Q 020721          242 LEGPEDKVFGAWIREGRRAKNRYNAKWSMYNFPVP-RTACTHELWADTIAVHLLKNQEKWIETLTYFNVTAN-LKPSKLY  319 (322)
Q Consensus       242 ~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~~~~~~-~~~c~~~~~~~~ia~H~~~~~~~~~~~~~~fn~~~~-~~p~~~~  319 (322)
                      .+..||+++|.+|+..++.++.++.+..+|+|+.. ...|.+++.+++|++|+++++| |+.++++|+.+.+ .||+|.+
T Consensus       290 ~~~~EDiMvG~vLr~~~k~~~l~~V~~~~c~Fhd~~~~~~~~~v~~~sVvvHhike~d-Ya~Lm~~F~n~~~~~~P~~~~  368 (382)
T PTZ00210        290 GMFYEDVMVGMILREKVVYRNLISVEMGRCHFHNAGKFGVRKSVRNMSVVIHHIQEAD-YEMLMDYFPEGVIPPAPSKVM  368 (382)
T ss_pred             hcCchHHHHHHHHHHhcCcCceeeeccccccceecCCCCCccccccceEEEEecCHHH-HHHHHHHhcCCCCCCCCcceE
Confidence            57899999999999988888888888999999875 5778888999999999999997 9999999999988 6898865


No 2  
>KOG2287 consensus Galactosyltransferases [Carbohydrate transport and metabolism]
Probab=100.00  E-value=4.6e-47  Score=363.60  Aligned_cols=230  Identities=28%  Similarity=0.419  Sum_probs=197.1

Q ss_pred             CeeEEEEEECCCCCHHHHHHHHHHhccCC-CCCCceEEEEEecCCCch-hHHHHHHHHHhhCCCeEEeccCCcCCc--hH
Q 020721           73 DIRILIGILTLPDQYHRRHFLRMIYGTQS-PTGAQVDVKFVFCNLTKE-DQKVLVALEIMRYDDIIILNCKENMNK--GK  148 (322)
Q Consensus        73 ~~~lli~V~S~~~~~~rR~aIR~TW~~~~-~~~~~v~~~Fvlg~~~~~-~~~~~l~~E~~~~~DIv~ld~~D~y~n--~K  148 (322)
                      .++++++|+|++++++||++||+|||++. ..+.+++++|++|.++++ ..+.+|.+|++.||||||.||.|+|.|  .|
T Consensus        94 ~~~lLl~V~S~~~~farR~aiR~TW~~~~~v~~~~v~~~FLvG~~~~~~~~~~~l~~Ea~~ygDIi~~df~Dty~nltlK  173 (349)
T KOG2287|consen   94 PPELLLLVKSAPDNFARRNAIRKTWGNENNVRGGRVRVLFLVGLPSNEDKLNKLLADEARLYGDIIQVDFEDTYFNLTLK  173 (349)
T ss_pred             CceEEEEEecCCCCHHHHHHHHHHhcCccccCCCcEEEEEEecCCCcHHHHHHHHHHHHHHhCCEEEEecccchhchHHH
Confidence            48999999999999999999999999986 567899999999988865 457799999999999999999999998  99


Q ss_pred             HHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC--CCCCeEEEEee----cCC-------------C
Q 020721          149 TYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL--PREDLYYGYVI----PCR-------------S  209 (322)
Q Consensus       149 t~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~--~~~~~y~G~~~----p~r-------------~  209 (322)
                      |++++.|...       +||+++|++|+|||+||++++|+++|...  +.+.+|+|++.    |.|             +
T Consensus       174 tl~~l~w~~~-------~cp~akfi~K~DDDvfv~~~~L~~~L~~~~~~~~~~~~G~v~~~~~p~R~~~~KwyVp~~~y~  246 (349)
T KOG2287|consen  174 TLAILLWGVS-------KCPDAKFILKIDDDVFVNPDNLLEYLDKLNDPSSDLYYGRVIQNAPPIRDKTSKWYVPESEYP  246 (349)
T ss_pred             HHHHHHHHHh-------cCCcceEEEeccCceEEcHHHHHHHHhccCCCCcceEEEeecccCCCCCCCCCCCccCHHHCC
Confidence            9999999986       49999999999999999999999999887  67899999875    222             3


Q ss_pred             CCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhc-CCCCccccCccceeecCCCCCcccccccCCE
Q 020721          210 MDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREG-RRAKNRYNAKWSMYNFPVPRTACTHELWADT  288 (322)
Q Consensus       210 ~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~-~~~~~~~~~~~~f~~~~~~~~~c~~~~~~~~  288 (322)
                      ...||+||+|+|||+|+|+|++|++.+ .+.+.+.+|||++|+|+++. |+.+..   +..++.++...+.|.   +++.
T Consensus       247 ~~~YP~Y~sG~gYvis~~~a~~l~~~s-~~~~~~~iEDV~~g~~l~~~~gi~~~~---~~~~~~~~~~~~~~~---~~~~  319 (349)
T KOG2287|consen  247 CSVYPPYASGPGYVISGDAARRLLKAS-KHLKFFPIEDVFVGGCLAEDLGIKPVN---HPGFFEIPLSFDPCC---YRDL  319 (349)
T ss_pred             CCCCCCcCCCceeEecHHHHHHHHHHh-cCCCccchHHHHHHHHHHHhcCCCccc---CcccccccccCCCCc---ccce
Confidence            557999999999999999999999976 57899999999999999997 776542   233555554445554   4889


Q ss_pred             EEEEccCCHHHHHHHHHHccccCCCCCCC
Q 020721          289 IAVHLLKNQEKWIETLTYFNVTANLKPSK  317 (322)
Q Consensus       289 ia~H~~~~~~~~~~~~~~fn~~~~~~p~~  317 (322)
                      +++|..++++ +..+|+....+.+++-++
T Consensus       320 ~~~H~~~p~e-~~~~w~~~~~~~~~~c~~  347 (349)
T KOG2287|consen  320 LAVHRLSPNE-MIYLWKKLKDLANLKCKN  347 (349)
T ss_pred             EEEecCCHHH-HHHHHHHhhccccccccc
Confidence            9999999777 777888888766666443


No 3  
>PLN03133 beta-1,3-galactosyltransferase; Provisional
Probab=100.00  E-value=2e-44  Score=361.79  Aligned_cols=226  Identities=22%  Similarity=0.347  Sum_probs=183.8

Q ss_pred             ecCCC-CCCeeEEEEEECCCCCHHHHHHHHHHhccCC-CCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCc
Q 020721           66 ATNSS-SEDIRILIGILTLPDQYHRRHFLRMIYGTQS-PTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKEN  143 (322)
Q Consensus        66 ~~~~~-~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~-~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~  143 (322)
                      ++..+ ..+++|+|+|+|+|+|++||+|||+|||+.. ..+..++++|++|.+.++..+..|.+|+++||||||+||.|+
T Consensus       376 AppL~~~~~~~LlI~V~Sap~nf~rR~AIR~TWg~~~~~~~~~v~~rFvVG~s~n~~l~~~L~~Ea~~ygDIIq~dF~Ds  455 (636)
T PLN03133        376 SPPLSPKKPLDLFIGVFSTANNFKRRMAVRRTWMQYDAVRSGAVAVRFFVGLHKNQMVNEELWNEARTYGDIQLMPFVDY  455 (636)
T ss_pred             CCCCCCCCceEEEEEEeCCcccHHHHHHHHHhhccccccCCCceEEEEEEecCCcHHHHHHHHHHHHHcCCeEEEeeech
Confidence            33444 4568999999999999999999999999864 344568899999999888888899999999999999999999


Q ss_pred             CCc--hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC-CCCCeEEEEee----cCC--------
Q 020721          144 MNK--GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL-PREDLYYGYVI----PCR--------  208 (322)
Q Consensus       144 y~n--~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~-~~~~~y~G~~~----p~r--------  208 (322)
                      |+|  .||++++.|+..        |++++|+||+|||+|||+++|+++|+.. +.+++|+|++.    |.|        
T Consensus       456 Y~NLTlKtl~~~~wa~~--------c~~akFilK~DDDvFVnv~~Ll~~L~~~~~~~~Ly~G~v~~~~~PiRd~~sKWYV  527 (636)
T PLN03133        456 YSLITWKTLAICIFGTE--------VVSAKYVMKTDDDAFVRVDEVLASLKRTNVSHGLLYGLINSDSQPHRNPDSKWYI  527 (636)
T ss_pred             hhhhHHHHHHHHHHHHh--------CCCceEEEEcCCceEEcHHHHHHHHHhcCCCCceEEEEeccCCCcccCCCCCCCC
Confidence            998  999999998753        8899999999999999999999999764 45679999985    333        


Q ss_pred             -----CCCCCCcccccceeeeCHHHHHHHHHcC-CCCCCCCCchhHHHHHHHHhcCC---CCccccCccceeecCCCCCc
Q 020721          209 -----SMDPFVDYMSGMGYLVSWDIAEWIRDSD-IPKNHLEGPEDKVFGAWIREGRR---AKNRYNAKWSMYNFPVPRTA  279 (322)
Q Consensus       209 -----~~~~yp~y~~G~gYvlS~d~~~~i~~~~-~~~~~~~~~EDv~vG~~l~~~~~---~~~~~~~~~~f~~~~~~~~~  279 (322)
                           +.+.|||||+|+|||||+|+|+.|++.. ..+.+.+++|||++|.|++++++   ... +..+.+|+     .+.
T Consensus       528 s~~eyp~~~YPpYasG~gYVlS~Dla~~L~~~s~s~~l~~f~lEDVyvGi~l~~l~k~gl~v~-~~~~~r~~-----~~~  601 (636)
T PLN03133        528 SPEEWPEETYPPWAHGPGYVVSRDIAKEVYKRHKEGRLKMFKLEDVAMGIWIAEMKKEGLEVK-YENDGRIY-----NEG  601 (636)
T ss_pred             CHHHCCCCCCCCCCCcCEEEEcHHHHHHHHHhhhhcccCcCChhhHhHHHHHHHhcccCCCce-eeCCCccc-----CCc
Confidence                 3567999999999999999999999854 23678999999999999986543   221 11112222     356


Q ss_pred             ccccccCCEEEEEccCCHHHHHHHHHHcccc
Q 020721          280 CTHELWADTIAVHLLKNQEKWIETLTYFNVT  310 (322)
Q Consensus       280 c~~~~~~~~ia~H~~~~~~~~~~~~~~fn~~  310 (322)
                      |.    .+.|++|+.+|.+ +..+|+....+
T Consensus       602 C~----~~~i~~H~~sP~e-M~~lW~~l~~~  627 (636)
T PLN03133        602 CK----DGYVVAHYQSPRE-MLCLWQKLQEG  627 (636)
T ss_pred             CC----CCeEEEecCCHHH-HHHHHHHHhcc
Confidence            64    4579999999998 55677776544


No 4  
>PLN03193 beta-1,3-galactosyltransferase; Provisional
Probab=100.00  E-value=3.6e-43  Score=335.12  Aligned_cols=192  Identities=20%  Similarity=0.256  Sum_probs=162.3

Q ss_pred             CCCCeeEEEEEECCCCCHHHHHHHHHHhccCC------CCCCceEEEEEecCCC--chhHHHHHHHHHhhCCCeEEeccC
Q 020721           70 SSEDIRILIGILTLPDQYHRRHFLRMIYGTQS------PTGAQVDVKFVFCNLT--KEDQKVLVALEIMRYDDIIILNCK  141 (322)
Q Consensus        70 ~~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~------~~~~~v~~~Fvlg~~~--~~~~~~~l~~E~~~~~DIv~ld~~  141 (322)
                      .+.+++++|+|.|++++++||++||+|||++.      .....++++||+|++.  +...+.+|++|+++|||||++||.
T Consensus       135 ~~~~~~LvIgI~Sap~~~~RR~AIR~TWg~~~~~~~kle~~~gv~vrFVIG~s~~~~~~ldr~Le~Ea~~ygDIL~lDfv  214 (408)
T PLN03193        135 GKRRYLMVVGINTAFSSRKRRDSVRATWMPQGEKRKKLEEEKGIIIRFVIGHSATSGGILDRAIEAEDRKHGDFLRLDHV  214 (408)
T ss_pred             CcceEEEEEEEeCCCCCHHHHHHHHHHHcCCcccccccccCCcEEEEEEeecCCCcchHHHHHHHHHHHHhCCEEEEecc
Confidence            45568999999999999999999999999763      1235689999999986  456777899999999999999999


Q ss_pred             CcCCc--hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC-CCCeEEEEee--cCC--------
Q 020721          142 ENMNK--GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP-REDLYYGYVI--PCR--------  208 (322)
Q Consensus       142 D~y~n--~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~-~~~~y~G~~~--p~r--------  208 (322)
                      |+|.|  .||+++|+|+.++        .+++|+||+|||+|||+++|+.+|+... .+++|+|++.  |.+        
T Consensus       215 DsY~NLT~KTl~~f~wA~~~--------~dAkF~mK~DDDvfVnv~~L~~~L~~~~~~~rlYiG~m~~gPvr~~~~~ky~  286 (408)
T PLN03193        215 EGYLELSAKTKTYFATAVAM--------WDADFYVKVDDDVHVNIATLGETLVRHRKKPRVYIGCMKSGPVLSQKGVRYH  286 (408)
T ss_pred             cccccchHHHHHHHHHHHHc--------CCCeEEEEcCCCceEcHHHHHHHHHhcCCCCCEEEEecccCccccCCCCcCc
Confidence            99987  9999999999874        3889999999999999999999997653 4469999985  332        


Q ss_pred             ---------CCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCCCCccccCccceeec
Q 020721          209 ---------SMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRRAKNRYNAKWSMYNF  273 (322)
Q Consensus       209 ---------~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~~~  273 (322)
                               +.+.|||||+|+|||||+|+|+.|+.+.. ..+.+.+|||++|.||..+++.  +++ +.+||+.
T Consensus       287 epe~w~~~~~~~~YPpyAsG~gYVlS~DLa~~I~~n~~-~L~~y~~EDV~vG~Wl~~L~V~--~vd-d~~fcc~  356 (408)
T PLN03193        287 EPEYWKFGENGNKYFRHATGQLYAISKDLASYISINQH-VLHKYANEDVSLGSWFIGLDVE--HID-DRRLCCG  356 (408)
T ss_pred             CcccccccCccccCCCCCCcceEEehHHHHHHHHhChh-hhcccCcchhhhhhHhccCCce--eee-cccccCC
Confidence                     12679999999999999999999997663 5678899999999999777664  333 4678775


No 5  
>PF01762 Galactosyl_T:  Galactosyltransferase;  InterPro: IPR002659 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 31 (GH31 from CAZY) comprises enzymes with a number of known activities; N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase (2.4.1.149 from EC); beta-1,3-galactosyltransferase (2.4.1 from EC); fucose-specific beta-1,3-N-acetylglucosaminyltransferase (2.4.1 from EC); globotriosylceramide beta-1,3-GalNAc transferase (2.4.1.79 from EC) [, ].; GO: 0008378 galactosyltransferase activity, 0006486 protein glycosylation, 0016020 membrane
Probab=100.00  E-value=3.5e-40  Score=291.96  Aligned_cols=167  Identities=27%  Similarity=0.359  Sum_probs=146.8

Q ss_pred             HHHHHHHHHhccCC-CCCCceEEEEEecCCC--chhHHHHHHHHHhhCCCeEEeccCCcCCc--hHHHHHHHhhhhHhcC
Q 020721           88 HRRHFLRMIYGTQS-PTGAQVDVKFVFCNLT--KEDQKVLVALEIMRYDDIIILNCKENMNK--GKTYTYFSSLPEIFNT  162 (322)
Q Consensus        88 ~rR~aIR~TW~~~~-~~~~~v~~~Fvlg~~~--~~~~~~~l~~E~~~~~DIv~ld~~D~y~n--~Kt~~~~~w~~~~~~~  162 (322)
                      +||++||+|||+.. ..+.+++++||+|.++  ++..+..|.+|+++|+||||+|+.|+|.|  .||+++|+|+.++   
T Consensus         1 ~rR~~IR~TW~~~~~~~~~~~~~~FvvG~~~~~~~~~~~~l~~E~~~y~Dil~~d~~D~y~nlt~K~~~~~~w~~~~---   77 (195)
T PF01762_consen    1 ERRQAIRETWGNQRNFKGVRVKVVFVVGESPNSDSDLQEALQEEAEKYGDILQGDFVDSYRNLTLKTLAGLKWASKH---   77 (195)
T ss_pred             ChHHHHHHHHhcccccCCCcEEEEEEEecCCCCcHHHHHHhhhhhhhcCceEeeecccccchhhHHHHHHHHHHHhh---
Confidence            59999999999886 5668999999999988  56677789999999999999999999988  9999999999874   


Q ss_pred             CCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeec----CC-------------CCCCCCccccccee
Q 020721          163 SDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIP----CR-------------SMDPFVDYMSGMGY  222 (322)
Q Consensus       163 ~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p----~r-------------~~~~yp~y~~G~gY  222 (322)
                          ||+++|++|+|||+|||+++|.+.|...   +.++.++|....    .+             +.+.||+||+|+||
T Consensus        78 ----c~~~~~v~k~DDD~~vn~~~l~~~L~~~~~~~~~~~~~g~~~~~~~~~r~~~~kw~v~~~~y~~~~yP~y~~G~~y  153 (195)
T PF01762_consen   78 ----CPNAKYVLKVDDDVFVNPDRLVSFLKSLKQDPSKNSIYGGCIKNGPPIRDPSSKWYVSEEEYPDDYYPPYCSGGGY  153 (195)
T ss_pred             ----CCchhheeecCcEEEEehHHhhhhhhhcccCccccccccccccCCccccccccCceeeeeecccccCCCcCCCCeE
Confidence                8899999999999999999999999887   445677776541    11             35579999999999


Q ss_pred             eeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCCCCc
Q 020721          223 LVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRRAKN  262 (322)
Q Consensus       223 vlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~~~~  262 (322)
                      +||+++|+.|++.+ ...+.+++|||++|.|+.++|+.+.
T Consensus       154 vls~~~v~~i~~~~-~~~~~~~~eDv~iGi~~~~~~i~~~  192 (195)
T PF01762_consen  154 VLSSDVVKRIYKAS-SHTPFFPLEDVFIGILAEKLGIKPI  192 (195)
T ss_pred             EecHHHHHHHHHHh-hcCCCCCchHHHHHHHHHHCCCCcc
Confidence            99999999999987 4679999999999999999998653


No 6  
>KOG2288 consensus Galactosyltransferases [Carbohydrate transport and metabolism]
Probab=99.97  E-value=1.8e-30  Score=231.55  Aligned_cols=189  Identities=23%  Similarity=0.318  Sum_probs=157.9

Q ss_pred             CCCeeEEEEEECCCCCHHHHHHHHHHhccCC------CCCCceEEEEEecCC-CchhHHHHHHHHHhhCCCeEEec-cCC
Q 020721           71 SEDIRILIGILTLPDQYHRRHFLRMIYGTQS------PTGAQVDVKFVFCNL-TKEDQKVLVALEIMRYDDIIILN-CKE  142 (322)
Q Consensus        71 ~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~------~~~~~v~~~Fvlg~~-~~~~~~~~l~~E~~~~~DIv~ld-~~D  142 (322)
                      .++++++|+|.|+++..+||+.+|+||..+.      .....|.++|++|.. .+.....+|++|.++|+|.+.+| .+|
T Consensus         8 ~~k~l~vigI~T~f~s~~RR~~vR~TWmp~~~~l~rle~e~gv~~RFvIG~~~~g~~~~r~ie~E~~~~~DfllLd~h~E   87 (274)
T KOG2288|consen    8 RRKVLLVIGINTAFSSRKRRDSVRQTWMPSGEGLKRLEEEKGVIIRFVIGTATLGASLDRALEEENAQHGDFLLLDRHEE   87 (274)
T ss_pred             ccceEEEEEeecccchhhhHHHHHHhhcCCccchhhhccccceEEEEEeccCCccHHHHHHHHHHHHhcCCeEeechhHH
Confidence            6789999999999999999999999998762      223467799999984 45566789999999999999999 999


Q ss_pred             cCCc--hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC-CCCeEEEEeecC----C-------
Q 020721          143 NMNK--GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP-REDLYYGYVIPC----R-------  208 (322)
Q Consensus       143 ~y~n--~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~-~~~~y~G~~~p~----r-------  208 (322)
                      .|.+  .||+.+|.++..++        +++|++|+|||+|||++.|...|.... +..+|||++...    +       
T Consensus        88 ~Y~~Ls~Kt~~~f~~A~~~~--------daeFyvKvDDDv~v~l~~L~~~la~~r~~pr~YiGcmksg~v~~~~~~kw~E  159 (274)
T KOG2288|consen   88 AYEELSAKTKAFFSAAVAHW--------DAEFYVKVDDDVYVRLARLGTLLARERSHPRLYIGCMKSGPVLTQPGGKWYE  159 (274)
T ss_pred             HHHHHHHHHHHHHHHHHHhc--------cceEEEEccccceecHHHHHHHHHhhccCCceEEEEecCCccccCCCCcccC
Confidence            9987  99999999999986        789999999999999999999997764 468999998421    0       


Q ss_pred             C----CC--CCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCCCCccccCcccee
Q 020721          209 S----MD--PFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRRAKNRYNAKWSMY  271 (322)
Q Consensus       209 ~----~~--~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~  271 (322)
                      |    .+  .|-+|+.|++|+||+|++..|+-+.. ....+..|||.+|-|+..+.+  ++++. +++|
T Consensus       160 peWkfg~~g~YfrhA~G~~YvlS~dLa~yi~in~~-lL~~y~nEDVSlGaW~~gldV--~h~dd-~rlC  224 (274)
T KOG2288|consen  160 PEWKFGDNGNYFRHATGGGYVLSKDLATYISINRQ-LLHKYANEDVSLGAWMIGLDV--EHVDD-PRLC  224 (274)
T ss_pred             hhhhcCcccccchhccCceEEeeHHHHHHHHHhHH-HHHhhccCCcccceeeeeeee--eEecC-Cccc
Confidence            1    11  28899999999999999999998764 367899999999999976664  34443 3444


No 7  
>KOG2246 consensus Galactosyltransferases [Carbohydrate transport and metabolism]
Probab=99.77  E-value=1.3e-17  Score=160.18  Aligned_cols=169  Identities=22%  Similarity=0.287  Sum_probs=129.0

Q ss_pred             ecCCCCCCeeEEEEEECCCCCH-HHHHHHHHHhccCCCCCCceEEEEEe---cCCCchhHHHHHHHHHhhCCCeEEeccC
Q 020721           66 ATNSSSEDIRILIGILTLPDQY-HRRHFLRMIYGTQSPTGAQVDVKFVF---CNLTKEDQKVLVALEIMRYDDIIILNCK  141 (322)
Q Consensus        66 ~~~~~~~~~~lli~V~S~~~~~-~rR~aIR~TW~~~~~~~~~v~~~Fvl---g~~~~~~~~~~l~~E~~~~~DIv~ld~~  141 (322)
                      .+.|...+.+|+|.|.|.+.+. .|-..+-+||+++|..     ..|+-   ++..            ..+. .|..+..
T Consensus        83 ~~~~l~r~~~v~cwv~t~~~~~~~~~~~v~~TW~~rc~~-----~~f~s~~~s~~~------------~~f~-~v~~~~~  144 (364)
T KOG2246|consen   83 IALWLSRSGRVLCWVLTSPMRHVTRADAVKETWLKRCDK-----GIFFSPTLSKDD------------SRFP-TVYYNLP  144 (364)
T ss_pred             chhccCCCceEEEEEEecCcCceeehhhhhcccccccCc-----ceecCccCCCCC------------CcCc-eeeccCC
Confidence            5667889999999999998877 5666999999998732     35554   3222            1122 3578899


Q ss_pred             CcCCc--hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC-CCCCeEEEEeecCCCCCCCCccc-
Q 020721          142 ENMNK--GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL-PREDLYYGYVIPCRSMDPFVDYM-  217 (322)
Q Consensus       142 D~y~n--~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~-~~~~~y~G~~~p~r~~~~yp~y~-  217 (322)
                      |+|.+  +||...++++.++..      .+++|++|+|||||+.++||...|... |.+.+|+|+........   .|+ 
T Consensus       145 ~g~~~~~~ktr~~~~yv~~~~~------~~~dWf~~aDDDTy~i~eNLr~~L~~yDp~~p~YiG~~~~~~~~~---~y~~  215 (364)
T KOG2246|consen  145 DGYRSLWRKTRIAFKYVYDHIL------KDYDWFLKADDDTYFIMENLRYVLSKYDPEKPVYLGYRSKSYFQN---GYSS  215 (364)
T ss_pred             cchHHHHHHHHHHHHHHHHhcc------CCCCeEEeccCCeEEeHHHHHHHHhhcCCCCcEEecccccccccc---cccc
Confidence            99988  999999999986543      489999999999999999999999997 56799999975311112   276 


Q ss_pred             ccceeeeCHHHHHHHHHcCC-----CCCCCC-CchhHHHHHHHHhcCCCC
Q 020721          218 SGMGYLVSWDIAEWIRDSDI-----PKNHLE-GPEDKVFGAWIREGRRAK  261 (322)
Q Consensus       218 ~G~gYvlS~d~~~~i~~~~~-----~~~~~~-~~EDv~vG~~l~~~~~~~  261 (322)
                      +|+||++|+++.+.+++...     +..... +.||+.+|.||+++|+..
T Consensus       216 g~ag~~ls~aa~~~la~~l~~~~~~C~~~~~~~~eD~~i~~Cl~~~GV~~  265 (364)
T KOG2246|consen  216 GGAGYVLSFAALRRLAERLLNNEDKCPQRYPSYGEDRRIGRCLAEVGVPA  265 (364)
T ss_pred             CCCCcceeHHHHHHHHHHHhcchhhcccccCCchhHHHHHHHHHHhCCCc
Confidence            56899999999999887421     111111 599999999999999863


No 8  
>PF02434 Fringe:  Fringe-like;  InterPro: IPR003378 The Notch receptor is a large, cell surface transmembrane protein involved in a wide variety of developmental processes in higher organisms []. It becomes activated when its extracellular region binds to ligands located on adjacent cells. Much of this extracellular region is composed of EGF-like repeats, many of which can be O-fucosylated. A number of these O-fucosylated repeats can in turn be further modified by the action of a beta-1,3-N-acetylglucosaminyltransferase enzyme known as Fringe []. Fringe potentiates the activation of Notch by Delta ligands, while inhibiting activation by Serrate/Jagged ligands. This regulation of Notch signalling by Fringe is important in many processes []. Four distinct Fringe proteins have so far been studied in detail; Drosophila Fringe (Dfng) and its three mammalian homologues Lunatic Fringe (Lfng), Radical Fringe (Rfng) and Manic Fringe (Mfng). Dfng, Lfng and Rfng have all been shown to play important roles in developmental processes within their host, though the phenotype of mutants can vary between species e.g. Rfng mutants are retarded in wing development in chickens, but have no obvious phenotype in mice [, , ]. Mfng mutants have not, so far, been charcterised. Biochemical studies indicate that the Fringe proteins are fucose-specific transferases requiring manganese for activity and utilising UDP-N-acetylglucosamine as a donor substrate []. The three mammalian proteins show distinct variations in their catalytic efficiencies with different substrates.  Dfng is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands which is localised to the Golgi apparatus [] (not secreted as previously thought). Modification of Notch occurs through glycosylation by Dfng.  This entry consists of Fringe proteins and related glycosyltransferase enzymes including:   Beta-1,3-glucosyltransferase, which glucosylates O-linked fucosylglycan on thrombospondin type 1 repeat domains [].  Core 1 beta1,3-galactosyltransferase 1, generates the core T antigen, which is a precursor for many extended O-glycans in glycoproteins and plays a central role in many processes, such as angiogenesis, thrombopoiesis and kidney homeostasis development [].  ; GO: 0016757 transferase activity, transferring glycosyl groups, 0016020 membrane; PDB: 2J0B_A 2J0A_A.
Probab=99.75  E-value=1.4e-17  Score=153.20  Aligned_cols=197  Identities=17%  Similarity=0.200  Sum_probs=104.1

Q ss_pred             eeEEEEEECCCCCH-HHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCc--hHHH
Q 020721           74 IRILIGILTLPDQY-HRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNK--GKTY  150 (322)
Q Consensus        74 ~~lli~V~S~~~~~-~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n--~Kt~  150 (322)
                      -.|+|+|+|++++. .|-.+|++||++++.   .  ..|+.....++.+.    .+  .-.+++.-++..++..  .+..
T Consensus         6 ~dI~i~V~T~~k~h~tR~~~I~~TW~~~~~---~--~~~ifsd~~d~~l~----~~--~~~~l~~~~~~~~~~~~~~~~~   74 (252)
T PF02434_consen    6 DDIFIAVKTTKKFHKTRAPAIKQTWAKRCN---K--QTFIFSDAEDPSLP----TV--TGVHLVNPNCDAGHCRKTLSCK   74 (252)
T ss_dssp             GGEEEEEE--GGGTTTTHHHHHHTGGGGSG---G--GEEEEESS--HHHH----HH--HGGGEEE-------------HH
T ss_pred             ccEEEEEEeCHHHHHHHHHHHHHHHHhhcC---C--ceEEecCccccccc----cc--cccccccCCCcchhhHHHHHHH
Confidence            36899999999876 777899999999872   2  35654544443332    22  2234555555555442  2333


Q ss_pred             HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC-CCCCeEEEEeecCCCC------------CCCCccc
Q 020721          151 TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL-PREDLYYGYVIPCRSM------------DPFVDYM  217 (322)
Q Consensus       151 ~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~-~~~~~y~G~~~p~r~~------------~~yp~y~  217 (322)
                      +.+.+-..+.       ++++|++++|||+||++++|++.|... +.+++|+|.....++.            ..--.|+
T Consensus        75 ~~~~y~~~~~-------~~~~Wf~~~DDDtyv~~~~L~~~L~~~~~~~~~yiG~~~~~~~~~~~~~~~~~~~~~~~~~f~  147 (252)
T PF02434_consen   75 MAYEYDHFLN-------SDKDWFCFADDDTYVNVENLRRLLSKYDPSEPIYIGRPSGDRPIEIIHRFNPNKSKDSGFWFA  147 (252)
T ss_dssp             HHHHHHHHHH-------HT-SEEEEEETTEEE-HHHHHHHHTTS-TTS--EEE-EE----------------------EE
T ss_pred             HHHHHHhhhc-------CCceEEEEEeCCceecHHHHHHHHhhCCCccCEEeeeeccCccceeeccccccccCcCceEee
Confidence            3222222211       367999999999999999999999997 5579999987522110            0111466


Q ss_pred             -ccceeeeCHHHHHHHHHc--CCC----CCCCCCchhHHHHHHHHh-cCCCCccccCccceeecCCC------CCccccc
Q 020721          218 -SGMGYLVSWDIAEWIRDS--DIP----KNHLEGPEDKVFGAWIRE-GRRAKNRYNAKWSMYNFPVP------RTACTHE  283 (322)
Q Consensus       218 -~G~gYvlS~d~~~~i~~~--~~~----~~~~~~~EDv~vG~~l~~-~~~~~~~~~~~~~f~~~~~~------~~~c~~~  283 (322)
                       +|+||+||+.++++|...  ...    ....-..||+.+|.|+.. +|+...+.   +.|+.....      ...+.  
T Consensus       148 ~GGaG~vlSr~~~~k~~~~~~~~~~~~~~~~~~~~dD~~lG~ci~~~lgv~lt~s---~~fhs~~~~l~~~~~~~l~~--  222 (252)
T PF02434_consen  148 TGGAGYVLSRALLKKMSPWASGCKCPSTDEKIRLPDDMTLGYCIENLLGVPLTHS---PLFHSHLENLQDYNPETLHR--  222 (252)
T ss_dssp             -GGG-EEEEHHHHHHHHHHHTT-TTS--TTTTTS-HHHHHHHHHHHTT---EEE----TT---SSS-GGG--TTTGGG--
T ss_pred             CCCeeHHHhHHHHHHHhhhcccccccCCcCCCCCcccChhhhhHHhcCCcceeec---hhhcccCcccccCCHHHhcc--
Confidence             678999999999999542  111    112345899999999999 88865432   345443322      11121  


Q ss_pred             ccCCEEEEEccC
Q 020721          284 LWADTIAVHLLK  295 (322)
Q Consensus       284 ~~~~~ia~H~~~  295 (322)
                        .-+|++|...
T Consensus       223 --q~~~s~~~~~  232 (252)
T PF02434_consen  223 --QVPISYHKFE  232 (252)
T ss_dssp             ---SEEE-EEET
T ss_pred             --CCCeecCCCc
Confidence              3479999985


No 9  
>PLN03153 hypothetical protein; Provisional
Probab=99.44  E-value=7.1e-12  Score=123.28  Aligned_cols=204  Identities=19%  Similarity=0.238  Sum_probs=122.3

Q ss_pred             CeeEEEEEECCCCCH-HHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEecc-CCc----CCc
Q 020721           73 DIRILIGILTLPDQY-HRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNC-KEN----MNK  146 (322)
Q Consensus        73 ~~~lli~V~S~~~~~-~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~-~D~----y~n  146 (322)
                      --.|+++|.+..+.. +|+..|+.+|....-.|    .+|+.....+...          -.++-.... .|+    |+|
T Consensus       121 ~~hIvF~I~~s~~~w~~R~~yik~wW~p~~~rg----~v~ld~~~~~~~~----------~~~~P~i~is~d~s~f~y~~  186 (537)
T PLN03153        121 LNHIMFGIAGSSQLWKRRKELVRLWWRPNQMRG----HVWLEEQVSPEEG----------DDSLPPIMVSEDTSRFRYTN  186 (537)
T ss_pred             cccEEEEEEEchhhhhhhhhhhhhhcCccccee----EEEecccCCCCCC----------cCCCCCEEeCCCcccccccC
Confidence            356889999988776 77789999998642111    4555443322100          001111111 122    333


Q ss_pred             ----hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC-CCCCeEEEEeecCCCCCC-CC-ccc-c
Q 020721          147 ----GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL-PREDLYYGYVIPCRSMDP-FV-DYM-S  218 (322)
Q Consensus       147 ----~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~-~~~~~y~G~~~p~r~~~~-yp-~y~-~  218 (322)
                          .-+..+.+-+.+.+..   .-|+++|++++|||||+.++||++.|... +.+..|+|........+. +. .|+ +
T Consensus       187 ~~Gh~sa~rI~rmv~et~~~---~~pd~kWfVf~DDDTyf~~~NLv~~Ls~YDptkp~YIGs~Se~~~qn~~f~~~fA~G  263 (537)
T PLN03153        187 PTGHPSGLRISRIVLESFRL---GLPDVRWFVLGDDDTIFNADNLVAVLSKYDPSEMVYVGGPSESHSANSYFSHNMAFG  263 (537)
T ss_pred             CCCcHHHHHHHHHHHHHHHh---hCCCCCEEEEecCCccccHHHHHHHHhhcCCCCCEEecccccccccccccccccccC
Confidence                1222222222222211   13689999999999999999999999987 567899997643222222 11 344 7


Q ss_pred             cceeeeCHHHHHHHHHcC--CC-CCCCCCchhHHHHHHHHhcCCCCccccCccceeecCC---CCCcccccccCCEEEEE
Q 020721          219 GMGYLVSWDIAEWIRDSD--IP-KNHLEGPEDKVFGAWIREGRRAKNRYNAKWSMYNFPV---PRTACTHELWADTIAVH  292 (322)
Q Consensus       219 G~gYvlS~d~~~~i~~~~--~~-~~~~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~~~~~---~~~~c~~~~~~~~ia~H  292 (322)
                      |+||+||+.+++.|.+.-  +. +-+....+|..+|.|+..+|+..+.   .++|++..-   +.+-....-..-.+++|
T Consensus       264 GAG~~LSrPLae~L~~~~d~C~~rY~~~~~gD~rL~~CL~elGV~LT~---~~gfhQ~D~~Gd~~G~les~p~~P~vSlH  340 (537)
T PLN03153        264 GGGIAISYPLAEALSRILDDCLDRYPKLYGSDDRLHACITELGVPLSR---EPGFHQWDIRGNAHGLLSSHPIAPFVSIH  340 (537)
T ss_pred             CceEEEcHHHHHHHHHHhhhhhhhcccCCCcHHHHHHHHHHcCCCcee---cCCccccccCCCcchHhhcCCCCCceeee
Confidence            899999999999988742  11 1123456899999999999986653   356665432   22222211012368999


Q ss_pred             ccCC
Q 020721          293 LLKN  296 (322)
Q Consensus       293 ~~~~  296 (322)
                      +.+-
T Consensus       341 H~~~  344 (537)
T PLN03153        341 HVEA  344 (537)
T ss_pred             eccc
Confidence            8875


No 10 
>KOG3708 consensus Uncharacterized conserved protein [Function unknown]
Probab=97.84  E-value=6.5e-05  Score=73.99  Aligned_cols=204  Identities=16%  Similarity=0.206  Sum_probs=127.1

Q ss_pred             CCCeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEec---cCCcCCch
Q 020721           71 SEDIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILN---CKENMNKG  147 (322)
Q Consensus        71 ~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld---~~D~y~n~  147 (322)
                      ...=+|+++|+|.   ..---+|-.|-|..-+     ++.|+.+...-..             |.-+++   +.|... .
T Consensus        23 G~RErl~~aVmte---~tlA~a~NrT~ahhvp-----rv~~F~~~~~i~~-------------~~a~~~~vs~~d~r~-~   80 (681)
T KOG3708|consen   23 GTRERLMAAVMTE---STLALAINRTLAHHVP-----RVHLFADSSRIDN-------------DLAQLTNVSPYDLRG-Q   80 (681)
T ss_pred             hhHHHHHHHHHHH---HHHHHHHHHHHHhhcc-----eeEEeeccccccc-------------cHhhccccCccccCc-c
Confidence            3345677888882   2566688888886631     3677777543211             222222   333333 3


Q ss_pred             HHH-HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCC-CCeEEEEeecCCCCCCCCcccc-cceeee
Q 020721          148 KTY-TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPR-EDLYYGYVIPCRSMDPFVDYMS-GMGYLV  224 (322)
Q Consensus       148 Kt~-~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~-~~~y~G~~~p~r~~~~yp~y~~-G~gYvl  224 (322)
                      ||. +-++++.+++.+      +|||++-+-||+||+...|++.+..+.- .++|+|.-.-..+ +    =|+ |.||+|
T Consensus        81 ~~~s~vl~~l~~~~~~------~YDwFll~~D~tYv~a~~L~~l~~hmsin~dlymGEe~~~gs-~----rC~l~~G~LL  149 (681)
T KOG3708|consen   81 KTHSMVLGLLFNMVHN------NYDWFLLAKDSTYVNAFVLLRLIDHMSINEDLYMGEEAEDGS-G----RCRLDTGMLL  149 (681)
T ss_pred             ccHHHHHHHHHHhhcc------ccceEEEecCcceecHHHHHHHHhhcccccccccchhhhCcc-C----ccccccceee
Confidence            554 456777777643      7899999999999999999999988864 6899996432111 2    274 689999


Q ss_pred             CHHHHHHHHHcC--CCCCCCCCchhHHHHHHHHh-cCCCCccccCccceeecCC----CCC------cccccccCCEEEE
Q 020721          225 SWDIAEWIRDSD--IPKNHLEGPEDKVFGAWIRE-GRRAKNRYNAKWSMYNFPV----PRT------ACTHELWADTIAV  291 (322)
Q Consensus       225 S~d~~~~i~~~~--~~~~~~~~~EDv~vG~~l~~-~~~~~~~~~~~~~f~~~~~----~~~------~c~~~~~~~~ia~  291 (322)
                      |+.++..+-.+-  +.+.-.-.=.|+++|+|+.- .|++-.-.  +.++.+|..    +..      .-..+-..+..++
T Consensus       150 S~s~l~~lrnnle~C~~~~lsad~d~~lgrCi~~At~v~C~~~--hQGvrq~s~~~dspgr~~~~~e~~~s~aFr~A~tv  227 (681)
T KOG3708|consen  150 SQSLLHALRNNLEGCRNDILSADPDEWLGRCIQDATGVGCKPL--HQGVRQYSEREDSPGRHDSIPEWEGSPAFRSALTV  227 (681)
T ss_pred             cHHHHHHHHhhHHHhhcccccCCcHHHHHHHHHHhhcCCccch--hhhHHhhhHhhcCCCccccchhhcCChHHhhhhcc
Confidence            999999998752  22233345568999999975 45542111  122212110    000      0000112568999


Q ss_pred             EccCCHHHHHHHHHHccc
Q 020721          292 HLLKNQEKWIETLTYFNV  309 (322)
Q Consensus       292 H~~~~~~~~~~~~~~fn~  309 (322)
                      |.+..+..+.++.+||-.
T Consensus       228 ~pv~~p~d~yrLH~yfsr  245 (681)
T KOG3708|consen  228 HPVLSPADMYRLHKYFSR  245 (681)
T ss_pred             CccCCHHHHHHHHHHHHH
Confidence            999888777778888853


No 11 
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano
Probab=95.77  E-value=1.1  Score=43.38  Aligned_cols=171  Identities=13%  Similarity=0.053  Sum_probs=97.8

Q ss_pred             eeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCC--eEEeccCCcC-CchHHH
Q 020721           74 IRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDD--IIILNCKENM-NKGKTY  150 (322)
Q Consensus        74 ~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~D--Iv~ld~~D~y-~n~Kt~  150 (322)
                      +.+-|+|++.-....-.+.++.-- .+..  ..+.++++...+++.... .+++=.++|.+  |..+.-.++- .|.|.-
T Consensus        41 p~VSViiP~~nee~~l~~~L~Sl~-~q~Y--p~~EIivvdd~s~D~t~~-iv~~~~~~~p~~~i~~v~~~~~~G~~~K~~  116 (373)
T TIGR03472        41 PPVSVLKPLHGDEPELYENLASFC-RQDY--PGFQMLFGVQDPDDPALA-VVRRLRADFPDADIDLVIDARRHGPNRKVS  116 (373)
T ss_pred             CCeEEEEECCCCChhHHHHHHHHH-hcCC--CCeEEEEEeCCCCCcHHH-HHHHHHHhCCCCceEEEECCCCCCCChHHH
Confidence            557778887755554455665433 3331  136677776666554332 33333355665  3233222222 236765


Q ss_pred             HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC--CCCeEEEEee--cCC-----------CCC----
Q 020721          151 TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP--REDLYYGYVI--PCR-----------SMD----  211 (322)
Q Consensus       151 ~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~--~~~~y~G~~~--p~r-----------~~~----  211 (322)
                      ...+-+..         .+.+|++..|+|+.+..+.|.+.+....  .-++..|...  +..           ...    
T Consensus       117 ~l~~~~~~---------a~ge~i~~~DaD~~~~p~~L~~lv~~~~~~~v~~V~~~~~~~~~~~~~~~l~~~~~~~~~~~~  187 (373)
T TIGR03472       117 NLINMLPH---------ARHDILVIADSDISVGPDYLRQVVAPLADPDVGLVTCLYRGRPVPGFWSRLGAMGINHNFLPS  187 (373)
T ss_pred             HHHHHHHh---------ccCCEEEEECCCCCcChhHHHHHHHHhcCCCcceEeccccCCCCCCHHHHHHHHHhhhhhhHH
Confidence            54332222         2679999999999999888877765542  2233333211  110           000    


Q ss_pred             -------CCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          212 -------PFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       212 -------~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                             .-+.++.|.++++.+++.+.+--...  ....-.||..+|.-+++.|.
T Consensus       188 ~~~~~~~~~~~~~~G~~~a~RR~~l~~iGGf~~--~~~~~~ED~~l~~~i~~~G~  240 (373)
T TIGR03472       188 VMVARALGRARFCFGATMALRRATLEAIGGLAA--LAHHLADDYWLGELVRALGL  240 (373)
T ss_pred             HHHHHhccCCccccChhhheeHHHHHHcCChHH--hcccchHHHHHHHHHHHcCC
Confidence                   01245789999999999999865331  12234699999999988775


No 12 
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms.
Probab=95.46  E-value=1.6  Score=42.47  Aligned_cols=174  Identities=14%  Similarity=0.076  Sum_probs=93.9

Q ss_pred             CeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHH-HhhCC---CeEEeccCCcC-C-c
Q 020721           73 DIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALE-IMRYD---DIIILNCKENM-N-K  146 (322)
Q Consensus        73 ~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E-~~~~~---DIv~ld~~D~y-~-n  146 (322)
                      .+.+-|+|++.-+...-.+.++.--.+ ... ...++++|-..++++..  ++.++ .+++.   .+..+...+.- . +
T Consensus        39 ~p~VSVIIpa~Ne~~~L~~~L~sL~~q-~yp-~~~eIIVVDd~StD~T~--~i~~~~~~~~~~~~~i~vi~~~~~~~g~~  114 (384)
T TIGR03469        39 WPAVVAVVPARNEADVIGECVTSLLEQ-DYP-GKLHVILVDDHSTDGTA--DIARAAARAYGRGDRLTVVSGQPLPPGWS  114 (384)
T ss_pred             CCCEEEEEecCCcHhHHHHHHHHHHhC-CCC-CceEEEEEeCCCCCcHH--HHHHHHHHhcCCCCcEEEecCCCCCCCCc
Confidence            356777788775443333444433322 211 13567777665655432  22223 22333   45545433221 1 2


Q ss_pred             hHHHH---HHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC--CCCeEEEEeec-CC------------
Q 020721          147 GKTYT---YFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP--REDLYYGYVIP-CR------------  208 (322)
Q Consensus       147 ~Kt~~---~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~--~~~~y~G~~~p-~r------------  208 (322)
                      +|..+   +++.+.+       .+++.+|++.+|+|+.+..+.|.+.+....  ..++..|.... ..            
T Consensus       115 Gk~~A~n~g~~~A~~-------~~~~gd~llflDaD~~~~p~~l~~lv~~~~~~~~~~vs~~~~~~~~~~~~~~~~~~~~  187 (384)
T TIGR03469       115 GKLWAVSQGIAAART-------LAPPADYLLLTDADIAHGPDNLARLVARARAEGLDLVSLMVRLRCESFWEKLLIPAFV  187 (384)
T ss_pred             chHHHHHHHHHHHhc-------cCCCCCEEEEECCCCCCChhHHHHHHHHHHhCCCCEEEecccccCCCHHHHHHHHHHH
Confidence            67554   3333322       233479999999999999887777765432  22232222110 00            


Q ss_pred             -------------CCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          209 -------------SMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       209 -------------~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                                   .......++.|++.++++++.+.+--...  ......||+.++.-+++.|.
T Consensus       188 ~~~~~~~~~~~~~~~~~~~~~~~G~~~lirr~~~~~vGGf~~--~~~~~~ED~~L~~r~~~~G~  249 (384)
T TIGR03469       188 FFFQKLYPFRWVNDPRRRTAAAAGGCILIRREALERIGGIAA--IRGALIDDCTLAAAVKRSGG  249 (384)
T ss_pred             HHHHHhcchhhhcCCCccceeecceEEEEEHHHHHHcCCHHH--HhhCcccHHHHHHHHHHcCC
Confidence                         00011234679999999999998854321  12235799999999988763


No 13 
>PF13641 Glyco_tranf_2_3:  Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A.
Probab=95.10  E-value=0.18  Score=44.47  Aligned_cols=169  Identities=21%  Similarity=0.206  Sum_probs=79.4

Q ss_pred             eeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCe--EEeccCCcCC-chHHH
Q 020721           74 IRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDI--IILNCKENMN-KGKTY  150 (322)
Q Consensus        74 ~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DI--v~ld~~D~y~-n~Kt~  150 (322)
                      ++|.|+|++.-....-++.++.-....-   .+++++.+...++++ ....+++-.+.+...  ..+.-..+.. +.|.-
T Consensus         1 P~v~Vvip~~~~~~~l~~~l~sl~~~~~---~~~~v~vvd~~~~~~-~~~~~~~~~~~~~~~~v~vi~~~~~~g~~~k~~   76 (228)
T PF13641_consen    1 PRVSVVIPAYNEDDVLRRCLESLLAQDY---PRLEVVVVDDGSDDE-TAEILRALAARYPRVRVRVIRRPRNPGPGGKAR   76 (228)
T ss_dssp             --EEEE--BSS-HHHHHHHHHHHTTSHH---HTEEEEEEEE-SSS--GCTTHHHHHHTTGG-GEEEEE----HHHHHHHH
T ss_pred             CEEEEEEEecCCHHHHHHHHHHHHcCCC---CCeEEEEEECCCChH-HHHHHHHHHHHcCCCceEEeecCCCCCcchHHH
Confidence            3577788877554445555555553211   245555555333332 222344444555542  3333222221 12433


Q ss_pred             HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC--CCCCeEEEEeecC---------------------
Q 020721          151 TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL--PREDLYYGYVIPC---------------------  207 (322)
Q Consensus       151 ~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~--~~~~~y~G~~~p~---------------------  207 (322)
                      + +.+.....        +.+|++..|||+.+..+.|.+.+...  +.-....|.....                     
T Consensus        77 a-~n~~~~~~--------~~d~i~~lD~D~~~~p~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~  147 (228)
T PF13641_consen   77 A-LNEALAAA--------RGDYILFLDDDTVLDPDWLERLLAAFADPGVGAVGGPVFPDNDRNWLTRLQDLFFARWHLRF  147 (228)
T ss_dssp             H-HHHHHHH-----------SEEEEE-SSEEE-CHHHHHHHHHHHBSS--EEEEEEEETTCCCEEEE-TT--S-EETTTS
T ss_pred             H-HHHHHHhc--------CCCEEEEECCCcEECHHHHHHHHHHHHhCCCCeEeeeEeecCCCCHHHHHHHHHHhhhhhhh
Confidence            2 23333321        46999999999999988777766554  2222222322110                     


Q ss_pred             CC-CCCC-CcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          208 RS-MDPF-VDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       208 r~-~~~y-p~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      .. ...+ -.++.|++.++.+++++.+.....    ....||..++.-+.+.|.
T Consensus       148 ~~~~~~~~~~~~~G~~~~~rr~~~~~~g~fd~----~~~~eD~~l~~r~~~~G~  197 (228)
T PF13641_consen  148 RSGRRALGVAFLSGSGMLFRRSALEEVGGFDP----FILGEDFDLCLRLRAAGW  197 (228)
T ss_dssp             -TT-B----S-B--TEEEEEHHHHHHH-S--S----SSSSHHHHHHHHHHHTT-
T ss_pred             hhhhcccceeeccCcEEEEEHHHHHHhCCCCC----CCcccHHHHHHHHHHCCC
Confidence            00 0011 135689999999999999965331    445599999999988765


No 14 
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans,  glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment.
Probab=95.08  E-value=1.3  Score=38.33  Aligned_cols=130  Identities=17%  Similarity=0.117  Sum_probs=77.0

Q ss_pred             ceEEEEEecCCCchhHHHHHHHHHhhCC--CeEEeccCCcC-CchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEe
Q 020721          106 QVDVKFVFCNLTKEDQKVLVALEIMRYD--DIIILNCKENM-NKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYI  182 (322)
Q Consensus       106 ~v~~~Fvlg~~~~~~~~~~l~~E~~~~~--DIv~ld~~D~y-~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv  182 (322)
                      .+.+++|-..++++.. ..+++-.+.+.  ++.......+. .+.|.-.. ....+.        .+.+|++..|+|+.+
T Consensus        30 ~~eiivVdd~s~d~t~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-n~g~~~--------a~~d~i~~~D~D~~~   99 (196)
T cd02520          30 KYEILFCVQDEDDPAI-PVVRKLIAKYPNVDARLLIGGEKVGINPKVNNL-IKGYEE--------ARYDILVISDSDISV   99 (196)
T ss_pred             CeEEEEEeCCCcchHH-HHHHHHHHHCCCCcEEEEecCCcCCCCHhHHHH-HHHHHh--------CCCCEEEEECCCceE
Confidence            3667777776665433 33444445554  33223222221 12343321 111221        256999999999999


Q ss_pred             eHHHHHHHhcCCC--CCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          183 RLENLVKSLVPLP--REDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       183 ~~~~L~~~L~~~~--~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      ..+.|.+.+....  .-++..|.            ++.|++.++.+++.+.+....  .......||..++.-+.+.|.
T Consensus       100 ~~~~l~~l~~~~~~~~~~~v~~~------------~~~g~~~~~r~~~~~~~ggf~--~~~~~~~eD~~l~~rl~~~G~  164 (196)
T cd02520         100 PPDYLRRMVAPLMDPGVGLVTCL------------CAFGKSMALRREVLDAIGGFE--AFADYLAEDYFLGKLIWRLGY  164 (196)
T ss_pred             ChhHHHHHHHHhhCCCCCeEEee------------cccCceeeeEHHHHHhccChH--HHhHHHHHHHHHHHHHHHcCC
Confidence            8888777776532  22222222            678999999999999875432  122334799999988887764


No 15 
>PF13506 Glyco_transf_21:  Glycosyl transferase family 21
Probab=94.96  E-value=0.14  Score=44.47  Aligned_cols=106  Identities=18%  Similarity=0.138  Sum_probs=72.0

Q ss_pred             CchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC--CCCeEEEEee--cCC------------
Q 020721          145 NKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP--REDLYYGYVI--PCR------------  208 (322)
Q Consensus       145 ~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~--~~~~y~G~~~--p~r------------  208 (322)
                      -|.|.......+.. .       .++++++-.|+|+.++.+.|.+.+..+.  .-++..|...  +.+            
T Consensus        15 ~N~Kv~nL~~~~~~-~-------a~~d~~~~~DsDi~v~p~~L~~lv~~l~~p~vglVt~~~~~~~~~~~~~~l~~~~~~   86 (175)
T PF13506_consen   15 CNPKVNNLAQGLEA-G-------AKYDYLVISDSDIRVPPDYLRELVAPLADPGVGLVTGLPRGVPARGFWSRLEAAFFN   86 (175)
T ss_pred             CChHHHHHHHHHHh-h-------CCCCEEEEECCCeeECHHHHHHHHHHHhCCCCcEEEecccccCCcCHHHHHHHHHHh
Confidence            45788776666554 1       2679999999999999998888876653  3333333221  110            


Q ss_pred             ------CCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCCC
Q 020721          209 ------SMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRRA  260 (322)
Q Consensus       209 ------~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~~  260 (322)
                            ..-...+++.|+++++.+++++.+.--.  .....-.||..+|..+++.|..
T Consensus        87 ~~~~~~~a~~~~~~~~G~~m~~rr~~L~~~GG~~--~l~~~ladD~~l~~~~~~~G~~  142 (175)
T PF13506_consen   87 FLPGVLQALGGAPFAWGGSMAFRREALEEIGGFE--ALADYLADDYALGRRLRARGYR  142 (175)
T ss_pred             HHHHHHHHhcCCCceecceeeeEHHHHHHcccHH--HHhhhhhHHHHHHHHHHHCCCe
Confidence                  0011357899999999999999874322  1233569999999999998763


No 16 
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=94.48  E-value=2.8  Score=36.39  Aligned_cols=141  Identities=13%  Similarity=-0.011  Sum_probs=77.4

Q ss_pred             eEEEEEecCCCchhHHHHHH-HHHhhCCCeEEeccCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHH
Q 020721          107 VDVKFVFCNLTKEDQKVLVA-LEIMRYDDIIILNCKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLE  185 (322)
Q Consensus       107 v~~~Fvlg~~~~~~~~~~l~-~E~~~~~DIv~ld~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~  185 (322)
                      +.++.|-..+++.. ...++ .....+..+..++..+..+.+|.... .+....        ...+|++.+|+|..+..+
T Consensus        29 ~eiivvdd~s~d~t-~~~~~~~~~~~~~~v~~~~~~~~~~~g~~~a~-n~g~~~--------~~~d~i~~~D~D~~~~~~   98 (229)
T cd04192          29 FEVILVDDHSTDGT-VQILEFAAAKPNFQLKILNNSRVSISGKKNAL-TTAIKA--------AKGDWIVTTDADCVVPSN   98 (229)
T ss_pred             eEEEEEcCCCCcCh-HHHHHHHHhCCCcceEEeeccCcccchhHHHH-HHHHHH--------hcCCEEEEECCCcccCHH
Confidence            55666655544332 22333 22222334555655543333555332 222221        256999999999999887


Q ss_pred             HHHHHhcCC--CCCCeEEEEeecC--C---------------------CCCCCCcccccceeeeCHHHHHHHHHcCCCCC
Q 020721          186 NLVKSLVPL--PREDLYYGYVIPC--R---------------------SMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKN  240 (322)
Q Consensus       186 ~L~~~L~~~--~~~~~y~G~~~p~--r---------------------~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~  240 (322)
                      .|.+.+...  +....+.|.....  .                     .....+..+.|++..+++++.+.+---.  ..
T Consensus        99 ~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~rr~~~~~~ggf~--~~  176 (229)
T cd04192          99 WLLTFVAFIQKEQIGLVAGPVIYFKGKSLLAKFQRLDWLSLLGLIAGSFGLGKPFMCNGANMAYRKEAFFEVGGFE--GN  176 (229)
T ss_pred             HHHHHHHHhhcCCCcEEeeeeeecCCccHHHHHHHHHHHHHHHHHhhHHHhcCccccccceEEEEHHHHHHhcCCc--cc
Confidence            777777643  2234555543211  0                     0112344567899999999999986533  12


Q ss_pred             CCCCchhHHHHHHHHhcCC
Q 020721          241 HLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       241 ~~~~~EDv~vG~~l~~~~~  259 (322)
                      .....||..++.-+...|.
T Consensus       177 ~~~~~eD~~~~~~~~~~g~  195 (229)
T cd04192         177 DHIASGDDELLLAKVASKY  195 (229)
T ss_pred             cccccCCHHHHHHHHHhCC
Confidence            3456677777654444433


No 17 
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=93.70  E-value=2.6  Score=34.39  Aligned_cols=76  Identities=21%  Similarity=0.107  Sum_probs=53.1

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCc
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGP  245 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~  245 (322)
                      +.+|++..|||..+..+.+...++..   +.-.++.|.             +.|++.++++++++.+.....  ......
T Consensus        74 ~~~~i~~~D~D~~~~~~~l~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~--~~~~~~  138 (166)
T cd04186          74 KGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIVGPK-------------VSGAFLLVRREVFEEVGGFDE--DFFLYY  138 (166)
T ss_pred             CCCEEEEECCCcEECccHHHHHHHHHHhCCCceEEEcc-------------CceeeEeeeHHHHHHcCCCCh--hhhccc
Confidence            56999999999999988887777642   221222211             788999999999998754321  112277


Q ss_pred             hhHHHHHHHHhcCC
Q 020721          246 EDKVFGAWIREGRR  259 (322)
Q Consensus       246 EDv~vG~~l~~~~~  259 (322)
                      ||..+..-+...|.
T Consensus       139 eD~~~~~~~~~~g~  152 (166)
T cd04186         139 EDVDLCLRARLAGY  152 (166)
T ss_pred             cHHHHHHHHHHcCC
Confidence            99998887776654


No 18 
>PF04646 DUF604:  Protein of unknown function, DUF604;  InterPro: IPR006740 This family includes a conserved region found in several uncharacterised plant proteins.
Probab=92.04  E-value=0.41  Score=43.95  Aligned_cols=74  Identities=16%  Similarity=0.168  Sum_probs=49.0

Q ss_pred             ccceeeeCHHHHHHHHHcC--C-CCCCCCCchhHHHHHHHHhcCCCCccccCccceeecCCC---CCcc-cccccCCEEE
Q 020721          218 SGMGYLVSWDIAEWIRDSD--I-PKNHLEGPEDKVFGAWIREGRRAKNRYNAKWSMYNFPVP---RTAC-THELWADTIA  290 (322)
Q Consensus       218 ~G~gYvlS~d~~~~i~~~~--~-~~~~~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~~~~~~---~~~c-~~~~~~~~ia  290 (322)
                      +|+|+.||..+|+.|.+.-  + .+.+.+.--|-.+..|+.++|+..++   +++|++....   .+-. +++. .-.++
T Consensus        12 GGgG~~iS~pLa~~L~~~~d~C~~r~~~~~g~D~~i~~C~~~lgv~LT~---e~g~hQ~Di~Gd~~G~~~a~~~-~pl~S   87 (255)
T PF04646_consen   12 GGGGFAISYPLAKALAKMQDDCIERYPHLYGGDQRIQACIAELGVPLTK---EPGFHQMDIRGDPSGFLEAHPL-APLVS   87 (255)
T ss_pred             cCceeEEcHHHHHHHHHHHHHHHHhcCCCCCchHHHHHHHHHhCCCcee---cCCceeEeeccCcceeeecCCC-Cceee
Confidence            7999999999999999852  1 12344555788999999999886553   3566665422   1211 1111 23689


Q ss_pred             EEccC
Q 020721          291 VHLLK  295 (322)
Q Consensus       291 ~H~~~  295 (322)
                      +|+++
T Consensus        88 lHH~~   92 (255)
T PF04646_consen   88 LHHWD   92 (255)
T ss_pred             eeehh
Confidence            99875


No 19 
>PRK11204 N-glycosyltransferase; Provisional
Probab=90.13  E-value=12  Score=36.50  Aligned_cols=169  Identities=11%  Similarity=-0.009  Sum_probs=91.9

Q ss_pred             CCeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHH
Q 020721           72 EDIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYT  151 (322)
Q Consensus        72 ~~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~  151 (322)
                      +.+.+-|+|++.-+...-++.++.--. +.  ....+++. +...++++..+.+++..+++..+...+..++.  +|.-+
T Consensus        52 ~~p~vsViIp~yne~~~i~~~l~sl~~-q~--yp~~eiiV-vdD~s~d~t~~~l~~~~~~~~~v~~i~~~~n~--Gka~a  125 (420)
T PRK11204         52 EYPGVSILVPCYNEGENVEETISHLLA-LR--YPNYEVIA-INDGSSDNTGEILDRLAAQIPRLRVIHLAENQ--GKANA  125 (420)
T ss_pred             CCCCEEEEEecCCCHHHHHHHHHHHHh-CC--CCCeEEEE-EECCCCccHHHHHHHHHHhCCcEEEEEcCCCC--CHHHH
Confidence            345688888887554333333333222 21  11344443 34333333334455555666666666544433  45433


Q ss_pred             HHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCCC-------------------
Q 020721          152 YFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCRS-------------------  209 (322)
Q Consensus       152 ~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r~-------------------  209 (322)
                       +....+.        .+.+|++..|+|+.+..+.|.+.++..   |.-....|.......                   
T Consensus       126 -ln~g~~~--------a~~d~i~~lDaD~~~~~d~L~~l~~~~~~~~~v~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~  196 (420)
T PRK11204        126 -LNTGAAA--------ARSEYLVCIDGDALLDPDAAAYMVEHFLHNPRVGAVTGNPRIRNRSTLLGRIQVGEFSSIIGLI  196 (420)
T ss_pred             -HHHHHHH--------cCCCEEEEECCCCCCChhHHHHHHHHHHhCCCeEEEECCceeccchhHHHHHHHHHHHHhhhHH
Confidence             2222221        267999999999999998777766543   322233332210000                   


Q ss_pred             ---CCC--CCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          210 ---MDP--FVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       210 ---~~~--yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                         ...  .+...+|.+.++++++++.+.--.    ...-.||+.++.-+.+.|.
T Consensus       197 ~~~~~~~~~~~~~~G~~~~~rr~~l~~vgg~~----~~~~~ED~~l~~rl~~~G~  247 (420)
T PRK11204        197 KRAQRVYGRVFTVSGVITAFRKSALHEVGYWS----TDMITEDIDISWKLQLRGW  247 (420)
T ss_pred             HHHHHHhCCceEecceeeeeeHHHHHHhCCCC----CCcccchHHHHHHHHHcCC
Confidence               000  112357888999999998874322    1234699999988877664


No 20 
>cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm 
Probab=90.07  E-value=10  Score=31.67  Aligned_cols=90  Identities=11%  Similarity=-0.065  Sum_probs=58.1

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCCCCCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhH
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPLPREDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDK  248 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv  248 (322)
                      ..+|++..|+|..+..+.|.+.++.........|...-. ....-.....|+++.+.++....+.... ........||+
T Consensus        79 ~g~~i~~lD~D~~~~~~~l~~~~~~~~~~~~v~g~~~~~-~~~~~~~~~~~~~~~~~r~~~~~~ggf~-~~~~~~~~eD~  156 (182)
T cd06420          79 KGDYLIFIDGDCIPHPDFIADHIELAEPGVFLSGSRVLL-NEKLTERGIRGCNMSFWKKDLLAVNGFD-EEFTGWGGEDS  156 (182)
T ss_pred             cCCEEEEEcCCcccCHHHHHHHHHHhCCCcEEecceeec-ccccceeEeccceEEEEHHHHHHhCCCC-cccccCCcchH
Confidence            569999999999998887777776664434444543211 1111113456888888888887554432 12223458999


Q ss_pred             HHHHHHHhcCCC
Q 020721          249 VFGAWIREGRRA  260 (322)
Q Consensus       249 ~vG~~l~~~~~~  260 (322)
                      .++.=+.+.|..
T Consensus       157 ~l~~r~~~~g~~  168 (182)
T cd06420         157 ELVARLLNSGIK  168 (182)
T ss_pred             HHHHHHHHcCCc
Confidence            999888888743


No 21 
>cd06532 Glyco_transf_25 Glycosyltransferase family 25 [lipooligosaccharide (LOS) biosynthesis protein] is a family of glycosyltransferases involved in LOS biosynthesis. The members include the beta(1,4) galactosyltransferases: Lgt2 of Moraxella catarrhalis, LgtB and LgtE of Neisseria gonorrhoeae and Lic2A of Haemophilus influenzae. M. catarrhalis Lgt2 catalyzes the addition of galactose (Gal) to the growing chain of LOS on the cell surface. N. gonorrhoeae LgtB and LgtE link Gal-beta(1,4)  to GlcNAc (N-acetylglucosamine) and Glc (glucose), respectively. The genes encoding LgtB and LgtE are two genes of a five gene locus involved in the synthesis of gonococcal LOS. LgtE is believed to perform the first step in LOS biosynthesis.
Probab=89.99  E-value=3.8  Score=33.40  Aligned_cols=110  Identities=17%  Similarity=0.166  Sum_probs=61.9

Q ss_pred             EEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCc------hHHHH
Q 020721           78 IGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNK------GKTYT  151 (322)
Q Consensus        78 i~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n------~Kt~~  151 (322)
                      +.|.|-+...+||+.+++......     +...|+-|-.........+......+.   ..........      ..-+.
T Consensus         2 i~vInL~~~~~Rr~~~~~~~~~~~-----~~~~~~~Avd~~~~~~~~~~~~~~~~~---~~~~~~~l~~gEiGC~lSH~~   73 (128)
T cd06532           2 IFVINLDRSTDRRERMEAQLAALG-----LDFEFFDAVDGKDLSEEELAALYDALF---LPRYGRPLTPGEIGCFLSHYK   73 (128)
T ss_pred             EEEEECCCCHHHHHHHHHHHHHcC-----CCeEEEeccccccCCHHHHHHHhHHHh---hhhcCCCCChhhHHHHHHHHH
Confidence            457788888999999998665432     335666664332211111111111000   0001111111      11122


Q ss_pred             HHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCCCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHH
Q 020721          152 YFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPREDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEW  231 (322)
Q Consensus       152 ~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~  231 (322)
                      .++-+.+         .+.++.+-..||+.+..+                                +..||++|++.|+.
T Consensus        74 ~w~~~~~---------~~~~~alIlEDDv~~~~~--------------------------------~~~~Y~vs~~~A~~  112 (128)
T cd06532          74 LWQKIVE---------SNLEYALILEDDAILDPD--------------------------------GTAGYLVSRKGAKK  112 (128)
T ss_pred             HHHHHHH---------cCCCeEEEEccCcEECCC--------------------------------CceEEEeCHHHHHH
Confidence            3333333         155899999999997665                                55799999999999


Q ss_pred             HHHcC
Q 020721          232 IRDSD  236 (322)
Q Consensus       232 i~~~~  236 (322)
                      +.+..
T Consensus       113 ll~~~  117 (128)
T cd06532         113 LLAAL  117 (128)
T ss_pred             HHHhC
Confidence            99965


No 22 
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily.  CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains.  The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins.
Probab=89.14  E-value=16  Score=32.49  Aligned_cols=165  Identities=12%  Similarity=-0.001  Sum_probs=84.9

Q ss_pred             CeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHH
Q 020721           73 DIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTY  152 (322)
Q Consensus        73 ~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~  152 (322)
                      ...+-|+|++.-....-...++.-..+.. ....+.++++-..++++. ...+.+..+.  .+......++.  +|....
T Consensus        28 ~~~isVvip~~n~~~~l~~~l~si~~q~~-~~~~~eiivvdd~s~d~t-~~~~~~~~~~--~v~~i~~~~~~--g~~~a~  101 (251)
T cd06439          28 LPTVTIIIPAYNEEAVIEAKLENLLALDY-PRDRLEIIVVSDGSTDGT-AEIAREYADK--GVKLLRFPERR--GKAAAL  101 (251)
T ss_pred             CCEEEEEEecCCcHHHHHHHHHHHHhCcC-CCCcEEEEEEECCCCccH-HHHHHHHhhC--cEEEEEcCCCC--ChHHHH
Confidence            35677788887654444556666555432 112355555554444332 2223222222  34443322322  344332


Q ss_pred             HHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC--CCCeEEEEeecCCCC--------------------
Q 020721          153 FSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP--REDLYYGYVIPCRSM--------------------  210 (322)
Q Consensus       153 ~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~--~~~~y~G~~~p~r~~--------------------  210 (322)
                      -..+..         ...+|++-.|+|+.+..+.|.+.+....  .-.+..|......+.                    
T Consensus       102 n~gi~~---------a~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  172 (251)
T cd06439         102 NRALAL---------ATGEIVVFTDANALLDPDALRLLVRHFADPSVGAVSGELVIVDGGGSGSGEGLYWKYENWLKRAE  172 (251)
T ss_pred             HHHHHH---------cCCCEEEEEccccCcCHHHHHHHHHHhcCCCccEEEeEEEecCCcccchhHHHHHHHHHHHHHHH
Confidence            222222         1459999999999999776766666543  224444543211100                    


Q ss_pred             --CCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          211 --DPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       211 --~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                        ......+.|+++.+.+++.+     . . ......||..++.-+...|.
T Consensus       173 ~~~~~~~~~~g~~~~~rr~~~~-----~-~-~~~~~~eD~~l~~~~~~~G~  216 (251)
T cd06439         173 SRLGSTVGANGAIYAIRRELFR-----P-L-PADTINDDFVLPLRIARQGY  216 (251)
T ss_pred             HhcCCeeeecchHHHhHHHHhc-----C-C-CcccchhHHHHHHHHHHcCC
Confidence              01122355666667776665     1 1 12334799999888877664


No 23 
>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=88.96  E-value=5.7  Score=34.03  Aligned_cols=82  Identities=22%  Similarity=0.196  Sum_probs=55.1

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCC--CCCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCch
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPL--PREDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPE  246 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~--~~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~E  246 (322)
                      ..+|++..|||..+..+.+.+.+...  +.-.+..|.....   +.     .++|.++++++.+.+--..  ..-....|
T Consensus        79 ~~d~v~~ld~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~---~~-----~~~~~~~~~~~~~~~g~~~--~~~~~~~e  148 (202)
T cd04185          79 GYDWIWLMDDDAIPDPDALEKLLAYADKDNPQFLAPLVLDP---DG-----SFVGVLISRRVVEKIGLPD--KEFFIWGD  148 (202)
T ss_pred             CCCEEEEeCCCCCcChHHHHHHHHHHhcCCceEecceeEcC---CC-----ceEEEEEeHHHHHHhCCCC--hhhhccch
Confidence            66999999999999987776666554  3334454544321   11     4467899999998874321  11234569


Q ss_pred             hHHHHHHHHhcCCC
Q 020721          247 DKVFGAWIREGRRA  260 (322)
Q Consensus       247 Dv~vG~~l~~~~~~  260 (322)
                      |+.++.-+.+.|..
T Consensus       149 D~~~~~r~~~~G~~  162 (202)
T cd04185         149 DTEYTLRASKAGPG  162 (202)
T ss_pred             HHHHHHHHHHcCCc
Confidence            99999999888764


No 24 
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins.
Probab=88.92  E-value=19  Score=33.28  Aligned_cols=43  Identities=12%  Similarity=0.004  Sum_probs=31.6

Q ss_pred             cccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcC
Q 020721          215 DYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGR  258 (322)
Q Consensus       215 ~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~  258 (322)
                      ++++|++.++++++.+.+-.-.+ .....+.||+-+..=+...|
T Consensus       169 ~~~~g~~~~irr~~~~~vGgfDe-~~~~~~~ED~Dl~~R~~~~G  211 (299)
T cd02510         169 PTMAGGLFAIDREWFLELGGYDE-GMDIWGGENLELSFKVWQCG  211 (299)
T ss_pred             ccccceeeEEEHHHHHHhCCCCC-cccccCchhHHHHHHHHHcC
Confidence            46789999999999999865432 33444679998877666655


No 25 
>PF01755 Glyco_transf_25:  Glycosyltransferase family 25 (LPS biosynthesis protein);  InterPro: IPR002654 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 25 GT25 from CAZY comprises enzymes with only one known activity; as a lipopolysaccharide biosynthesis protein. These enzymes catalyse the transfer of various sugars onto the growing lipopolysaccharide chain during its biosynthesis [].; GO: 0009103 lipopolysaccharide biosynthetic process
Probab=88.09  E-value=6.8  Score=34.03  Aligned_cols=34  Identities=15%  Similarity=0.174  Sum_probs=24.7

Q ss_pred             EEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCC
Q 020721           78 IGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNL  116 (322)
Q Consensus        78 i~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~  116 (322)
                      |.|.|-+...+||+.+.+.....     .+...|+-|-.
T Consensus         4 i~vInL~~~~~Rr~~~~~~~~~~-----~~~~e~~~Avd   37 (200)
T PF01755_consen    4 IYVINLDRSTERRERIQQQLAKL-----GINFEFFDAVD   37 (200)
T ss_pred             EEEEECCCCHHHHHHHHHHHHHc-----CCceEEEEeec
Confidence            56778888999999999888754     22356666543


No 26 
>cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate.
Probab=87.89  E-value=18  Score=31.64  Aligned_cols=139  Identities=14%  Similarity=0.088  Sum_probs=75.2

Q ss_pred             eEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHH
Q 020721          107 VDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLEN  186 (322)
Q Consensus       107 v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~  186 (322)
                      .++++|...++++. ...+ ++...+..+.+.. .++  .+|..+.-.-+..         .+.+|++..|+|+.+..+.
T Consensus        29 ~eiivvdd~s~d~~-~~~l-~~~~~~~~~~v~~-~~~--~g~~~a~n~g~~~---------a~~d~v~~lD~D~~~~~~~   94 (235)
T cd06434          29 LEIIVVTDGDDEPY-LSIL-SQTVKYGGIFVIT-VPH--PGKRRALAEGIRH---------VTTDIVVLLDSDTVWPPNA   94 (235)
T ss_pred             CEEEEEeCCCChHH-HHHH-HhhccCCcEEEEe-cCC--CChHHHHHHHHHH---------hCCCEEEEECCCceeChhH
Confidence            44666655554432 2223 3455566665553 222  2455432222221         1669999999999999998


Q ss_pred             HHHHhcCCCCC--CeEEEEeecCC-----------------------CCCCC--CcccccceeeeCHHHHHHHHHcCC--
Q 020721          187 LVKSLVPLPRE--DLYYGYVIPCR-----------------------SMDPF--VDYMSGMGYLVSWDIAEWIRDSDI--  237 (322)
Q Consensus       187 L~~~L~~~~~~--~~y~G~~~p~r-----------------------~~~~y--p~y~~G~gYvlS~d~~~~i~~~~~--  237 (322)
                      |.+.+......  ....|......                       ....+  -..++|.+.++.+++++.+.-...  
T Consensus        95 l~~l~~~~~~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~rr~~l~~~~~~~~~~  174 (235)
T cd06434          95 LPEMLKPFEDPKVGGVGTNQRILRPRDSKWSFLAAEYLERRNEEIRAAMSYDGGVPCLSGRTAAYRTEILKDFLFLEEFT  174 (235)
T ss_pred             HHHHHHhccCCCEeEEcCceEeecCcccHHHHHHHHHHHHHHHHHHHHHhhCCCEEEccCcHHHHHHHHHhhhhhHHHhh
Confidence            87777665322  22222211100                       00001  123567778888888877543211  


Q ss_pred             ----CCCCCCCchhHHHHHHHHhcCC
Q 020721          238 ----PKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       238 ----~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                          ...+....||..++.-+.+.|.
T Consensus       175 ~~~~~~~~~~~~eD~~l~~~~~~~g~  200 (235)
T cd06434         175 NETFMGRRLNAGDDRFLTRYVLSHGY  200 (235)
T ss_pred             hhhhcCCCCCcCchHHHHHHHHHCCC
Confidence                1124557799999887777654


No 27 
>COG1216 Predicted glycosyltransferases [General function prediction only]
Probab=87.46  E-value=23  Score=33.05  Aligned_cols=169  Identities=17%  Similarity=0.085  Sum_probs=95.8

Q ss_pred             eeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchH-HHHH
Q 020721           74 IRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGK-TYTY  152 (322)
Q Consensus        74 ~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~K-t~~~  152 (322)
                      ..+-++|.|--......+.++..-....    ....+.++-+.+.++..+.+....  +.++..+...+|..-.. --.+
T Consensus         3 ~~i~~iiv~yn~~~~l~~~l~~l~~~~~----~~~~iv~vDn~s~d~~~~~~~~~~--~~~v~~i~~~~NlG~agg~n~g   76 (305)
T COG1216           3 PKISIIIVTYNRGEDLVECLASLAAQTY----PDDVIVVVDNGSTDGSLEALKARF--FPNVRLIENGENLGFAGGFNRG   76 (305)
T ss_pred             cceEEEEEecCCHHHHHHHHHHHhcCCC----CCcEEEEccCCCCCCCHHHHHhhc--CCcEEEEEcCCCccchhhhhHH
Confidence            3455556665444455555555554332    111222444444333333333322  77888888888864201 1123


Q ss_pred             HHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCC--CCeEEEEee-------------------------
Q 020721          153 FSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPR--EDLYYGYVI-------------------------  205 (322)
Q Consensus       153 ~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~--~~~y~G~~~-------------------------  205 (322)
                      ++.+...         ..+|++-.++|+.+..+.|.+.++....  .....|...                         
T Consensus        77 ~~~a~~~---------~~~~~l~LN~D~~~~~~~l~~ll~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~  147 (305)
T COG1216          77 IKYALAK---------GDDYVLLLNPDTVVEPDLLEELLKAAEEDPAAGVVGPLIRNYDESLYIDRRGGESDGLTGGWRA  147 (305)
T ss_pred             HHHHhcC---------CCcEEEEEcCCeeeChhHHHHHHHHHHhCCCCeEeeeeEecCCCCcchheecccccccccccee
Confidence            4444331         1239999999999998888887765321  111111111                         


Q ss_pred             -cCC---C----CCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          206 -PCR---S----MDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       206 -p~r---~----~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                       +..   +    ....-.+++|++.++++++.+.+---.  ..-+...||+-.+.=+++.|.
T Consensus       148 ~~~~~~~~~~~~~~~~~~~~~G~~~li~~~~~~~vG~~d--e~~F~y~eD~D~~~R~~~~G~  207 (305)
T COG1216         148 SPLLEIAPDLSSYLEVVASLSGACLLIRREAFEKVGGFD--ERFFIYYEDVDLCLRARKAGY  207 (305)
T ss_pred             cccccccccccchhhhhhhcceeeeEEcHHHHHHhCCCC--cccceeehHHHHHHHHHHcCC
Confidence             000   0    011112578999999999999997632  245779999999999998885


No 28 
>cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains.  The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in  plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose.  Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot
Probab=87.38  E-value=16  Score=32.47  Aligned_cols=87  Identities=15%  Similarity=0.085  Sum_probs=55.1

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCCCC--CCe-EE-EEee-cCC------------------------CCCCCCccccc
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPLPR--EDL-YY-GYVI-PCR------------------------SMDPFVDYMSG  219 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~--~~~-y~-G~~~-p~r------------------------~~~~yp~y~~G  219 (322)
                      +.+|++..|+|+.+..+.|.+.++....  .++ +. |.+. ...                        .....+..++|
T Consensus        84 ~gd~i~~~DaD~~~~~~~l~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g  163 (241)
T cd06427          84 RGEYVVIYDAEDAPDPDQLKKAVAAFARLDDKLACVQAPLNYYNARENWLTRMFALEYAAWFDYLLPGLARLGLPIPLGG  163 (241)
T ss_pred             CCCEEEEEcCCCCCChHHHHHHHHHHHhcCCCEEEEeCceEeeCCCccHHHHHHHHHHHHHHHHHHHHHHhcCCeeecCC
Confidence            5599999999999999988887765422  222 22 2210 000                        00011224578


Q ss_pred             ceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          220 MGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       220 ~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      ++.++++++.+.+.....    ....||..++.=+...|.
T Consensus       164 ~~~~~rr~~~~~vgg~~~----~~~~eD~~l~~rl~~~G~  199 (241)
T cd06427         164 TSNHFRTDVLRELGGWDP----FNVTEDADLGLRLARAGY  199 (241)
T ss_pred             chHHhhHHHHHHcCCCCc----ccchhhHHHHHHHHHCCc
Confidence            888999999998855331    234799999887766553


No 29 
>cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=86.46  E-value=17  Score=30.84  Aligned_cols=87  Identities=16%  Similarity=0.096  Sum_probs=52.1

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCC--C----CCCCC-------------cccccceeeeCH
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCR--S----MDPFV-------------DYMSGMGYLVSW  226 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r--~----~~~yp-------------~y~~G~gYvlS~  226 (322)
                      +.+|++..|+|..+..+.|...+...   +.-.++.|......  .    ....|             ....|++.++.+
T Consensus        80 ~gd~i~~lD~Dd~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rr  159 (201)
T cd04195          80 TYDWVARMDTDDISLPDRFEKQLDFIEKNPEIDIVGGGVLEFDSDGNDIGKRRLPTSHDDILKFARRRSPFNHPTVMFRK  159 (201)
T ss_pred             CCCEEEEeCCccccCcHHHHHHHHHHHhCCCeEEEcccEEEECCCCCeeccccCCCCHHHHHHHhccCCCCCChHHhhhH
Confidence            56999999999999988777766553   33345555432100  0    00000             112355667777


Q ss_pred             HHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          227 DIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       227 d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      ++.+.+....    .....||..+..-+...|.
T Consensus       160 ~~~~~~g~~~----~~~~~eD~~~~~r~~~~g~  188 (201)
T cd04195         160 SKVLAVGGYQ----DLPLVEDYALWARMLANGA  188 (201)
T ss_pred             HHHHHcCCcC----CCCCchHHHHHHHHHHcCC
Confidence            7766653322    2267999999888877654


No 30 
>cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=86.20  E-value=19  Score=30.74  Aligned_cols=87  Identities=14%  Similarity=0.027  Sum_probs=55.6

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEee--cCC---------CCCC-----------CCcccccceee
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVI--PCR---------SMDP-----------FVDYMSGMGYL  223 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~--p~r---------~~~~-----------yp~y~~G~gYv  223 (322)
                      +.+|++..|+|..+..+.|.+.++..   +...++.|...  ...         ....           ...+..|++.+
T Consensus        79 ~g~~v~~ld~Dd~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  158 (214)
T cd04196          79 DGDYVFFCDQDDIWLPDKLERLLKAFLKDDKPLLVYSDLELVDENGNPIGESFFEYQKIKPGTSFNNLLFQNVVTGCTMA  158 (214)
T ss_pred             CCCEEEEECCCcccChhHHHHHHHHHhcCCCceEEecCcEEECCCCCCcccccccccccCCccCHHHHHHhCccCCceee
Confidence            67999999999988877777776642   22234444321  000         0000           12456789999


Q ss_pred             eCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcC
Q 020721          224 VSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGR  258 (322)
Q Consensus       224 lS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~  258 (322)
                      +.+++++.+.....   .....||.++...+...|
T Consensus       159 ~r~~~~~~~~~~~~---~~~~~~D~~~~~~~~~~~  190 (214)
T cd04196         159 FNRELLELALPFPD---ADVIMHDWWLALLASAFG  190 (214)
T ss_pred             EEHHHHHhhccccc---cccccchHHHHHHHHHcC
Confidence            99999999865432   115789998887776654


No 31 
>TIGR01556 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids are glycolipids containing mono- or di- L-rhamnose molecules. Rhamnolipid synthesis occurs by sequential glycosyltransferase reactions involving two distinct rhamnosyltransferase enzymes. In P.aeruginosa, the synthesis of mono-rhamnolipids is catalyzed by rhamnosyltransferase 1, and proceeds by a glycosyltransfer reaction catalyzed by rhamnosyltransferase 2 to yield di-rhamnolipids.
Probab=85.62  E-value=8.3  Score=35.28  Aligned_cols=118  Identities=12%  Similarity=0.076  Sum_probs=65.5

Q ss_pred             hCCCeEEeccCCcCCchHHH-HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCC---CCeEEEEee-
Q 020721          131 RYDDIIILNCKENMNKGKTY-TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPR---EDLYYGYVI-  205 (322)
Q Consensus       131 ~~~DIv~ld~~D~y~n~Kt~-~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~---~~~y~G~~~-  205 (322)
                      .+..|..+...++..-.+.+ .+++++.+.         .++|++-.|||+.+..+.|.+.+.....   .-...|... 
T Consensus        43 ~~~~i~~i~~~~N~G~a~a~N~Gi~~a~~~---------~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~  113 (281)
T TIGR01556        43 RGQKIALIHLGDNQGIAGAQNQGLDASFRR---------GVQGVLLLDQDSRPGNAFLAAQWKLLSAENGQACALGPRFF  113 (281)
T ss_pred             cCCCeEEEECCCCcchHHHHHHHHHHHHHC---------CCCEEEEECCCCCCCHHHHHHHHHHHHhcCCceEEECCeEE
Confidence            34566666666655422322 245555431         5699999999999987766665543211   111222110 


Q ss_pred             --------c-------------CCCCC--CCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          206 --------P-------------CRSMD--PFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       206 --------p-------------~r~~~--~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                              +             .....  .-..++.++|.++++++.+.+.--.  ..-++..||+.+..=+++.|-
T Consensus       114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~sg~li~~~~~~~iG~fd--e~~fi~~~D~e~~~R~~~~G~  188 (281)
T TIGR01556       114 DRGTSRRLPAIHLDGLLLRQISLDGLTTPQKTSFLISSGCLITREVYQRLGMMD--EELFIDHVDTEWSLRAQNYGI  188 (281)
T ss_pred             cCCCcccCCceeecccceeeecccccCCceeccEEEcCcceeeHHHHHHhCCcc--HhhcccchHHHHHHHHHHCCC
Confidence                    0             00000  0012455667789999999885422  122456799988766666653


No 32 
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus.
Probab=85.56  E-value=24  Score=30.85  Aligned_cols=88  Identities=13%  Similarity=-0.026  Sum_probs=53.3

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCCC--CCCeEEEEeec--C---------------CC---CCCC-----Ccccccce
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPLP--REDLYYGYVIP--C---------------RS---MDPF-----VDYMSGMG  221 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~--~~~~y~G~~~p--~---------------r~---~~~y-----p~y~~G~g  221 (322)
                      +.+|++..|||..+....|.+.++...  ......|....  .               ..   ....     -.++.|++
T Consensus        81 ~~d~v~~lD~D~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  160 (249)
T cd02525          81 RGDIIIRVDAHAVYPKDYILELVEALKRTGADNVGGPMETIGESKFQKAIAVAQSSPLGSGGSAYRGGAVKIGYVDTVHH  160 (249)
T ss_pred             CCCEEEEECCCccCCHHHHHHHHHHHhcCCCCEEecceecCCCChHHHHHHHHhhchhccCCcccccccccccccccccc
Confidence            569999999999998776766664432  22333333210  0               00   0000     11457778


Q ss_pred             eeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          222 YLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       222 YvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      .++++++.+.+.-..   ......||..++.-+.+.|.
T Consensus       161 ~~~~~~~~~~~g~~~---~~~~~~eD~~l~~r~~~~G~  195 (249)
T cd02525         161 GAYRREVFEKVGGFD---ESLVRNEDAELNYRLRKAGY  195 (249)
T ss_pred             ceEEHHHHHHhCCCC---cccCccchhHHHHHHHHcCc
Confidence            889999988875322   12345799999877766654


No 33 
>cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to  Agrobacterium tumefaciens CelA and  Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end.
Probab=85.50  E-value=24  Score=30.69  Aligned_cols=87  Identities=14%  Similarity=-0.031  Sum_probs=57.2

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCCCC-C--CeEEEEee--cC------C----------------C-CCCCCcccccc
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPLPR-E--DLYYGYVI--PC------R----------------S-MDPFVDYMSGM  220 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~-~--~~y~G~~~--p~------r----------------~-~~~yp~y~~G~  220 (322)
                      +.+|++..|+|.++..+.|.+.+..... .  .+..|...  ..      .                . ......++.|+
T Consensus        84 ~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~  163 (234)
T cd06421          84 TGDFVAILDADHVPTPDFLRRTLGYFLDDPKVALVQTPQFFYNPDPFDWLADGAPNEQELFYGVIQPGRDRWGAAFCCGS  163 (234)
T ss_pred             CCCEEEEEccccCcCccHHHHHHHHHhcCCCeEEEecceEEecCCcchhHHHHHHHHHHHHHHHHHHHHhhcCCceecCc
Confidence            5699999999999998887777765432 2  22223211  00      0                0 00113567899


Q ss_pred             eeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          221 GYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       221 gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      +.++++++.+.+....    .....||..++.-+.+.|.
T Consensus       164 ~~~~r~~~~~~ig~~~----~~~~~eD~~l~~r~~~~g~  198 (234)
T cd06421         164 GAVVRREALDEIGGFP----TDSVTEDLATSLRLHAKGW  198 (234)
T ss_pred             eeeEeHHHHHHhCCCC----ccceeccHHHHHHHHHcCc
Confidence            9999999999875432    1234799999988877764


No 34 
>PRK14583 hmsR N-glycosyltransferase; Provisional
Probab=84.99  E-value=35  Score=33.83  Aligned_cols=168  Identities=13%  Similarity=0.057  Sum_probs=93.1

Q ss_pred             CeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHH
Q 020721           73 DIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTY  152 (322)
Q Consensus        73 ~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~  152 (322)
                      .+.+-|+|++.-+...-++.|+.- .++..  .+++++.+-..++++ ..+.+++..+++..+......++-  +|..+ 
T Consensus        74 ~p~vsViIP~yNE~~~i~~~l~sl-l~q~y--p~~eIivVdDgs~D~-t~~~~~~~~~~~~~v~vv~~~~n~--Gka~A-  146 (444)
T PRK14583         74 HPLVSILVPCFNEGLNARETIHAA-LAQTY--TNIEVIAINDGSSDD-TAQVLDALLAEDPRLRVIHLAHNQ--GKAIA-  146 (444)
T ss_pred             CCcEEEEEEeCCCHHHHHHHHHHH-HcCCC--CCeEEEEEECCCCcc-HHHHHHHHHHhCCCEEEEEeCCCC--CHHHH-
Confidence            467888888886554444444432 22221  245555554444433 333445555666665554433322  45432 


Q ss_pred             HHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeec-CCC-------------------
Q 020721          153 FSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIP-CRS-------------------  209 (322)
Q Consensus       153 ~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p-~r~-------------------  209 (322)
                      +......        .+.+|++-.|+|+.+..+.+.+.++..   |+-...-|.... .+.                   
T Consensus       147 lN~gl~~--------a~~d~iv~lDAD~~~~~d~L~~lv~~~~~~~~~g~v~g~~~~~~~~~~~~~~~~~e~~~~~~~~~  218 (444)
T PRK14583        147 LRMGAAA--------ARSEYLVCIDGDALLDKNAVPYLVAPLIANPRTGAVTGNPRIRTRSTLIGRVQVGEFSSIIGLIK  218 (444)
T ss_pred             HHHHHHh--------CCCCEEEEECCCCCcCHHHHHHHHHHHHhCCCeEEEEccceecCCCcchhhHHHHHHHHHHHHHH
Confidence            2222221        267999999999999998877766532   332333332211 000                   


Q ss_pred             --CCCC-Cc-ccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          210 --MDPF-VD-YMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       210 --~~~y-p~-y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                        ...+ +. .++|.+..+.+++++.+.-..    +..-.||..+|.-+...|-
T Consensus       219 ~~~~~~g~~~~~sG~~~~~rr~al~~vGg~~----~~~i~ED~dl~~rl~~~G~  268 (444)
T PRK14583        219 RTQRVYGQVFTVSGVVAAFRRRALADVGYWS----PDMITEDIDISWKLQLKHW  268 (444)
T ss_pred             HHHHHhCCceEecCceeEEEHHHHHHcCCCC----CCcccccHHHHHHHHHcCC
Confidence              0111 12 357888999999988875332    1224699999999988764


No 35 
>PF00535 Glycos_transf_2:  Glycosyl transferase family 2;  InterPro: IPR001173 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This domain is found in a diverse family of glycosyl transferases that transfer the sugar from UDP-glucose, UDP-N-acetyl-galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids.; PDB: 2Z87_A 2Z86_B 2D7R_A 2D7I_A 3CKN_A 3CKQ_A 3CKJ_A 3CKV_A 3CKO_A 2FFU_A ....
Probab=83.09  E-value=3.8  Score=33.08  Aligned_cols=114  Identities=19%  Similarity=0.115  Sum_probs=59.0

Q ss_pred             ceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHH-HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeH
Q 020721          106 QVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTY-TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRL  184 (322)
Q Consensus       106 ~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~-~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~  184 (322)
                      .++++++-..++++ ....+++-.+....+..+...++..-.+.. .+++.+            ..+|++..|||.++..
T Consensus        27 ~~eiivvdd~s~d~-~~~~~~~~~~~~~~i~~i~~~~n~g~~~~~n~~~~~a------------~~~~i~~ld~D~~~~~   93 (169)
T PF00535_consen   27 DFEIIVVDDGSTDE-TEEILEEYAESDPNIRYIRNPENLGFSAARNRGIKHA------------KGEYILFLDDDDIISP   93 (169)
T ss_dssp             EEEEEEEECS-SSS-HHHHHHHHHCCSTTEEEEEHCCCSHHHHHHHHHHHH--------------SSEEEEEETTEEE-T
T ss_pred             CEEEEEeccccccc-ccccccccccccccccccccccccccccccccccccc------------ceeEEEEeCCCceEcH
Confidence            45555555444333 333333333335556666666665322221 122222            3479999999999998


Q ss_pred             HHHHHHhcCC--CCCCeEEEEeec-----------C------C--------CCCCCCcccccceeeeCHHHHHHH
Q 020721          185 ENLVKSLVPL--PREDLYYGYVIP-----------C------R--------SMDPFVDYMSGMGYLVSWDIAEWI  232 (322)
Q Consensus       185 ~~L~~~L~~~--~~~~~y~G~~~p-----------~------r--------~~~~yp~y~~G~gYvlS~d~~~~i  232 (322)
                      +.|...++..  ++....+|....           .      .        ....-..++.|++-++++++.+.+
T Consensus        94 ~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rr~~~~~~  168 (169)
T PF00535_consen   94 DWLEELVEALEKNPPDVVIGSVIYIDDDNRYPDRRLRFSFWNRFERKIFNNIRFWKISFFIGSCALFRRSVFEEI  168 (169)
T ss_dssp             THHHHHHHHHHHCTTEEEEEEEEEEECTTETEECCCTSEEEECCHCHHHHTTHSTTSSEESSSCEEEEEHHHHHC
T ss_pred             HHHHHHHHHHHhCCCcEEEEEEEEecCCccccccccchhhhhhhhhHHHHhhhcCCcccccccEEEEEHHHHHhh
Confidence            7666555443  233444444320           0      0        001123567888889998888764


No 36 
>cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex
Probab=80.69  E-value=32  Score=28.62  Aligned_cols=116  Identities=13%  Similarity=0.004  Sum_probs=64.0

Q ss_pred             ceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHH
Q 020721          106 QVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLE  185 (322)
Q Consensus       106 ~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~  185 (322)
                      ...++.+-..++++ ....+....+++..+..+...++..  |..+. ....+.        ...+|++-.|+|..+..+
T Consensus        28 ~~eiivvd~~s~d~-~~~~~~~~~~~~~~~~~~~~~~n~G--~~~a~-n~g~~~--------a~gd~i~~lD~D~~~~~~   95 (185)
T cd04179          28 DYEIIVVDDGSTDG-TAEIARELAARVPRVRVIRLSRNFG--KGAAV-RAGFKA--------ARGDIVVTMDADLQHPPE   95 (185)
T ss_pred             CEEEEEEcCCCCCC-hHHHHHHHHHhCCCeEEEEccCCCC--ccHHH-HHHHHH--------hcCCEEEEEeCCCCCCHH
Confidence            34445554444333 3334444455666665566555543  22111 111111        134999999999999988


Q ss_pred             HHHHHhcC--CCCCCeEEEEeecCCC----------------------CCCCCcccccceeeeCHHHHHHHH
Q 020721          186 NLVKSLVP--LPREDLYYGYVIPCRS----------------------MDPFVDYMSGMGYLVSWDIAEWIR  233 (322)
Q Consensus       186 ~L~~~L~~--~~~~~~y~G~~~p~r~----------------------~~~yp~y~~G~gYvlS~d~~~~i~  233 (322)
                      .|.+.+..  .+..++..|.......                      ...-...+.|+.+++++++++.+-
T Consensus        96 ~l~~l~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~i~  167 (185)
T cd04179          96 DIPKLLEKLLEGGADVVIGSRFVRGGGAGMPLLRRLGSRLFNFLIRLLLGVRISDTQSGFRLFRREVLEALL  167 (185)
T ss_pred             HHHHHHHHHhccCCcEEEEEeecCCCcccchHHHHHHHHHHHHHHHHHcCCCCcCCCCceeeeHHHHHHHHH
Confidence            88877764  2345666676432110                      011122355667788888888885


No 37 
>COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]
Probab=80.40  E-value=58  Score=31.51  Aligned_cols=170  Identities=14%  Similarity=0.026  Sum_probs=102.9

Q ss_pred             CeeEEEEEECCCCCH-HHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHH
Q 020721           73 DIRILIGILTLPDQY-HRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYT  151 (322)
Q Consensus        73 ~~~lli~V~S~~~~~-~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~  151 (322)
                      .+.+-++|++--++. .-.+.++..-...-   ...++..+...++++. .+.+++-.+++++.+.....+.-+.+|--+
T Consensus        53 ~p~vsviiP~ynE~~~~~~~~l~s~~~~dy---p~~evivv~d~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~~~gK~~a  128 (439)
T COG1215          53 LPKVSVIIPAYNEEPEVLEETLESLLSQDY---PRYEVIVVDDGSTDET-YEILEELGAEYGPNFRVIYPEKKNGGKAGA  128 (439)
T ss_pred             CCceEEEEecCCCchhhHHHHHHHHHhCCC---CCceEEEECCCCChhH-HHHHHHHHhhcCcceEEEeccccCccchHH
Confidence            477888899887666 45556665554432   1255666665454443 335555566665334433221222244332


Q ss_pred             HHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCCCCe--EEEEee---------c---C----------
Q 020721          152 YFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPREDL--YYGYVI---------P---C----------  207 (322)
Q Consensus       152 ~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~--y~G~~~---------p---~----------  207 (322)
                       +.+....        .+.++|+..|.|+.+..+.|.+.+......+.  ..|...         .   .          
T Consensus       129 -l~~~l~~--------~~~d~V~~~DaD~~~~~d~l~~~~~~f~~~~~~~v~~~~~~~~~~~~~~~l~~~~~~~~~~~~~  199 (439)
T COG1215         129 -LNNGLKR--------AKGDVVVILDADTVPEPDALRELVSPFEDPPVGAVVGTPRIRNRPDPSNLLGRIQAIEYLSAFY  199 (439)
T ss_pred             -HHHHHhh--------cCCCEEEEEcCCCCCChhHHHHHHhhhcCCCeeEEeCCceeeecCChhhhcchhcchhhhhhHH
Confidence             2333322        14799999999999999999998877533211  222210         0   0          


Q ss_pred             -----CCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          208 -----RSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       208 -----r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                           ........+++|++.++.+++++.+...    ....-.||..+|..+...|-
T Consensus       200 ~~~~~~~~~g~~~~~~G~~~~~rr~aL~~~g~~----~~~~i~ED~~lt~~l~~~G~  252 (439)
T COG1215         200 FRLRAASKGGLISFLSGSSSAFRRSALEEVGGW----LEDTITEDADLTLRLHLRGY  252 (439)
T ss_pred             HhhhhhhhcCCeEEEcceeeeEEHHHHHHhCCC----CCCceeccHHHHHHHHHCCC
Confidence                 0112246789999999999999998732    23446799999999986654


No 38 
>PF13632 Glyco_trans_2_3:  Glycosyl transferase family group 2
Probab=79.40  E-value=5.3  Score=34.16  Aligned_cols=85  Identities=25%  Similarity=0.235  Sum_probs=55.4

Q ss_pred             EEEEecCceEeeHHHHHHHhcCC--CCCCeEEEEeecC--C----------------------CCCCCCcccccceeeeC
Q 020721          172 YVMKGDDDTYIRLENLVKSLVPL--PREDLYYGYVIPC--R----------------------SMDPFVDYMSGMGYLVS  225 (322)
Q Consensus       172 yvlK~DDD~fv~~~~L~~~L~~~--~~~~~y~G~~~p~--r----------------------~~~~yp~y~~G~gYvlS  225 (322)
                      ||+-.|+|+-+..+-|.+.+...  |.-.+.-|...+.  .                      .....+.++.|++.+++
T Consensus         1 ~v~~~DaDt~~~~d~l~~~~~~~~~~~~~~vq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~r   80 (193)
T PF13632_consen    1 YVLFLDADTRLPPDFLERLVAALEDPKVDAVQGPIIFRNRGSLLTRLQDFEYAISHGLSRLSQSSLGRPLFLSGSGMLFR   80 (193)
T ss_pred             CEEEEcCCCCCChHHHHHHHHHHhCCCceEEEccEEecCCCChhheeehhhhhhhhhhhHHHHHhcCCCccccCcceeee
Confidence            67889999999988777665443  3222222222110  0                      01123567899999999


Q ss_pred             HHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          226 WDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       226 ~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      +++++.+.-..   ......||..+|.=+.+.|.
T Consensus        81 ~~~l~~vg~~~---~~~~~~ED~~l~~~l~~~G~  111 (193)
T PF13632_consen   81 REALREVGGFD---DPFSIGEDMDLGFRLRRAGY  111 (193)
T ss_pred             HHHHHHhCccc---ccccccchHHHHHHHHHCCC
Confidence            99999885322   13456699999999888773


No 39 
>PF05679 CHGN:  Chondroitin N-acetylgalactosaminyltransferase;  InterPro: IPR008428 This family represents Chondroitin N-acetylgalactosaminyltransferase. Proteins have a type II transmembrane topology. The enzyme is involved in the biosynthetic initiation and elongation of chondroitin sulphate and is the key enzyme responsible for the selective chain assembly of chondroitin/dermatan sulphate on the linkage region tetrasaccharide common to various proteoglycans containing chondroitin/dermatan sulphate or heparin/heparan sulphate chains. ; GO: 0016758 transferase activity, transferring hexosyl groups, 0032580 Golgi cisterna membrane
Probab=79.03  E-value=3.4  Score=42.00  Aligned_cols=67  Identities=18%  Similarity=0.167  Sum_probs=41.0

Q ss_pred             CCCCCchhHHHHHHHHh-cCCCCccccCccceeecCC-C-----CCcccccccCCEEEEEccCCHHHHHHHHHHcc
Q 020721          240 NHLEGPEDKVFGAWIRE-GRRAKNRYNAKWSMYNFPV-P-----RTACTHELWADTIAVHLLKNQEKWIETLTYFN  308 (322)
Q Consensus       240 ~~~~~~EDv~vG~~l~~-~~~~~~~~~~~~~f~~~~~-~-----~~~c~~~~~~~~ia~H~~~~~~~~~~~~~~fn  308 (322)
                      .-....||+.+|+|+.+ .|+.-..-. ...+|+... .     ...-..+ ..++|++|.++.++.+.++.+||-
T Consensus         8 ~~~s~~~Dv~lGRCI~~~~gi~Ct~~~-q~l~y~~~~~~~~~~~~~~~~~~-~~~AiTlHPvk~p~~mYrLH~y~~   81 (499)
T PF05679_consen    8 NIYSNHEDVELGRCIKKFTGISCTWSY-QGLFYHNYELNKNDFIGDLKNKE-FHNAITLHPVKSPAYMYRLHRYFL   81 (499)
T ss_pred             hcCCCCchhHHHHHHHHhcCCCeeecc-cceEEEeeccCCCcccccccchh-hhcceeeccCCCHHHHHHHHHHHH
Confidence            34567899999999987 577532211 234443221 1     0111112 367899999999987777766663


No 40 
>cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of  bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the  bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily.
Probab=76.44  E-value=44  Score=27.89  Aligned_cols=119  Identities=13%  Similarity=0.034  Sum_probs=66.5

Q ss_pred             ceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHH
Q 020721          106 QVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLE  185 (322)
Q Consensus       106 ~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~  185 (322)
                      .+.++.+-..++++... .++...+++..+..+....+.  ++.-+. ......        -..+|++..|+|.....+
T Consensus        29 ~~eiivvdd~s~d~t~~-~~~~~~~~~~~i~~i~~~~n~--G~~~a~-n~g~~~--------a~~d~i~~~D~D~~~~~~   96 (181)
T cd04187          29 DYEIIFVDDGSTDRTLE-ILRELAARDPRVKVIRLSRNF--GQQAAL-LAGLDH--------ARGDAVITMDADLQDPPE   96 (181)
T ss_pred             CeEEEEEeCCCCccHHH-HHHHHHhhCCCEEEEEecCCC--CcHHHH-HHHHHh--------cCCCEEEEEeCCCCCCHH
Confidence            45566666555544322 333334455556665554443  333221 111111        145999999999999877


Q ss_pred             HHHHHhcCC-CCCCeEEEEeecCC------------------CCCCCCcccccceeeeCHHHHHHHHHcC
Q 020721          186 NLVKSLVPL-PREDLYYGYVIPCR------------------SMDPFVDYMSGMGYLVSWDIAEWIRDSD  236 (322)
Q Consensus       186 ~L~~~L~~~-~~~~~y~G~~~p~r------------------~~~~yp~y~~G~gYvlS~d~~~~i~~~~  236 (322)
                      .|.+.++.. +..++..|......                  .....-+...|++.++++++++.+....
T Consensus        97 ~l~~l~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~i~~~d  166 (181)
T cd04187          97 LIPEMLAKWEEGYDVVYGVRKNRKESWLKRLTSKLFYRLINKLSGVDIPDNGGDFRLMDRKVVDALLLLP  166 (181)
T ss_pred             HHHHHHHHHhCCCcEEEEEecCCcchHHHHHHHHHHHHHHHHHcCCCCCCCCCCEEEEcHHHHHHHHhcC
Confidence            666666543 23456666543110                  0111124567788899999999988654


No 41 
>cd04191 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elongation of beta-1,2 polyglucose chains of glucan. Periplasmic Glucan Biosynthesis protein ModH is a glucosyltransferase that catalyzes the elongation of beta-1,2 polyglucose chains of glucan, requiring a beta-glucoside as a primer and UDP-glucose as a substrate. Glucans are composed of 5 to 10 units of glucose forming a highly branched structure, where beta-1,2-linked glucose constitutes a linear backbone to which branches are attached by beta-1,6 linkages. In Escherichia coli, glucans are located in the periplasmic space, functioning as regulator of osmolarity. It is synthesized at a maximum when cells are grown in a medium with low osmolarity. It has been shown to span the cytoplasmic membrane.
Probab=76.34  E-value=62  Score=29.61  Aligned_cols=173  Identities=14%  Similarity=0.077  Sum_probs=93.6

Q ss_pred             EEEEECCCCCHH-HHHHHHHHhcc---CCCCCCceEEEEEecCCCchhHHHH-------HHHHHhhCCCeEEeccCCcCC
Q 020721           77 LIGILTLPDQYH-RRHFLRMIYGT---QSPTGAQVDVKFVFCNLTKEDQKVL-------VALEIMRYDDIIILNCKENMN  145 (322)
Q Consensus        77 li~V~S~~~~~~-rR~aIR~TW~~---~~~~~~~v~~~Fvlg~~~~~~~~~~-------l~~E~~~~~DIv~ld~~D~y~  145 (322)
                      -|+|++.-...+ -.+.++.++..   +.. ...+.+ |++...++++....       +.+|....-.+....-.++ .
T Consensus         2 SIliP~~ne~~~~l~~~l~~~~~~~~~~~~-~~~~eI-~vldD~~d~~~~~~~~~~~~~l~~~~~~~~~v~~~~r~~~-~   78 (254)
T cd04191           2 AIVMPVYNEDPARVFAGLRAMYESLAKTGL-ADHFDF-FILSDTRDPDIWLAEEAAWLDLCEELGAQGRIYYRRRREN-T   78 (254)
T ss_pred             EEEEeCCCCCHHHHHHHHHHHHHHHHhcCC-cCceEE-EEECCCCChHHHHHHHHHHHHHHHHhCCCCcEEEEEcCCC-C
Confidence            367888876664 66778877742   211 124556 88877666543221       2222222223333333333 2


Q ss_pred             chHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEee--cCC------------
Q 020721          146 KGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVI--PCR------------  208 (322)
Q Consensus       146 n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~--p~r------------  208 (322)
                      +.|+-..-..+.. ..      .+++|++-.|.|+.+..+.|.+.+...   |.=.+.-+...  ..+            
T Consensus        79 g~Kag~l~~~~~~-~~------~~~~~i~~~DaD~~~~p~~l~~~v~~~~~~~~vg~vq~~~~~~n~~~~~~~~~~~~~~  151 (254)
T cd04191          79 GRKAGNIADFCRR-WG------SRYDYMVVLDADSLMSGDTIVRLVRRMEANPRAGIIQTAPKLIGAETLFARLQQFANR  151 (254)
T ss_pred             CccHHHHHHHHHH-hC------CCCCEEEEEeCCCCCCHHHHHHHHHHHHhCCCEEEEeCCceeECCCCHHHHHHHHHHH
Confidence            3455332222211 00      256999999999999999888877653   22111111100  000            


Q ss_pred             ----------C-CCCCCcccccceeeeCHHHHHHHHHcCCC-----CCCCCCchhHHHHHHHHhcCC
Q 020721          209 ----------S-MDPFVDYMSGMGYLVSWDIAEWIRDSDIP-----KNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       209 ----------~-~~~yp~y~~G~gYvlS~d~~~~i~~~~~~-----~~~~~~~EDv~vG~~l~~~~~  259 (322)
                                . +...-.++.|...++.++++..+...+..     -....-.||..+|..+...|-
T Consensus       152 ~~~~~~~~~~~~~~~~~~~~~G~~~~~Rr~al~~~~~~~~i~g~g~~~~~~l~eD~~l~~~~~~~G~  218 (254)
T cd04191         152 LYGPVFGRGLAAWQGGEGNYWGHNAIIRVAAFMEHCALPVLPGRPPFGGHILSHDFVEAALMRRAGW  218 (254)
T ss_pred             HHHHHHHHHHHHhcCCccCccceEEEEEHHHHHHhcCCccccCCCCCCCCeecHHHHHHHHHHHcCC
Confidence                      0 00112356799999999998876432100     012246899999999988764


No 42 
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl  transferases of Shigella flexneri  add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen.
Probab=75.62  E-value=54  Score=28.53  Aligned_cols=89  Identities=21%  Similarity=0.104  Sum_probs=54.0

Q ss_pred             CceEEEEecCceEeeHHHHHHHh--cC-C--CCCCeEEEEeecCC-----------C----------CCCC--Ccccccc
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSL--VP-L--PREDLYYGYVIPCR-----------S----------MDPF--VDYMSGM  220 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L--~~-~--~~~~~y~G~~~p~r-----------~----------~~~y--p~y~~G~  220 (322)
                      +++|++-.|||+.+..+.|...+  .. .  ...-..+|......           .          ....  ..++.|+
T Consensus        75 ~~d~v~~lD~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  154 (237)
T cd02526          75 GADYVLLFDQDSVPPPDMVEKLLAYKILSDKNSNIGAVGPRIIDRRTGENSPGVRKSGYKLRIQKEGEEGLKEVDFLITS  154 (237)
T ss_pred             CCCEEEEECCCCCcCHhHHHHHHHHHHhhccCCCeEEEeeeEEcCCCCeeccceeccCccceecccccCCceEeeeeecc
Confidence            56999999999999987777774  22 1  11111222211000           0          0001  1245577


Q ss_pred             eeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          221 GYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       221 gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      |.++++++.+.+.-...  ......||+.++.-+.+.|.
T Consensus       155 ~~~~rr~~~~~~ggfd~--~~~~~~eD~d~~~r~~~~G~  191 (237)
T cd02526         155 GSLISLEALEKVGGFDE--DLFIDYVDTEWCLRARSKGY  191 (237)
T ss_pred             ceEEcHHHHHHhCCCCH--HHcCccchHHHHHHHHHcCC
Confidence            88999999998754321  22356799999988887775


No 43 
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602).
Probab=75.46  E-value=89  Score=30.94  Aligned_cols=166  Identities=13%  Similarity=0.063  Sum_probs=85.8

Q ss_pred             eeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHHH
Q 020721           74 IRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTYF  153 (322)
Q Consensus        74 ~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~~  153 (322)
                      +.+-|+|++--+...-++.|+.--.+. .....+.++.+-..++++. .+.+++-.+.+..+...-..++  .+|..+ +
T Consensus        49 P~vsVIIP~yNe~~~l~~~l~sl~~q~-yp~~~~eIiVVDd~StD~T-~~il~~~~~~~~~v~v~~~~~~--~Gka~A-l  123 (439)
T TIGR03111        49 PDITIIIPVYNSEDTLFNCIESIYNQT-YPIELIDIILANNQSTDDS-FQVFCRAQNEFPGLSLRYMNSD--QGKAKA-L  123 (439)
T ss_pred             CCEEEEEEeCCChHHHHHHHHHHHhcC-CCCCCeEEEEEECCCChhH-HHHHHHHHHhCCCeEEEEeCCC--CCHHHH-H
Confidence            456777777654444455555543322 1223454544444444332 2222322344444432222222  256553 2


Q ss_pred             HhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCC---------------C------
Q 020721          154 SSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCR---------------S------  209 (322)
Q Consensus       154 ~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r---------------~------  209 (322)
                      .+..+.        .+.+|++..|+|..+..+.|.+.++..   |.-....|.+.+..               .      
T Consensus       124 N~gl~~--------s~g~~v~~~DaD~~~~~d~L~~l~~~f~~~~~v~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~y  195 (439)
T TIGR03111       124 NAAIYN--------SIGKYIIHIDSDGKLHKDAIKNMVTRFENNPDIHAMTGVILTDKELIEKTKGRFLKLIRRCEYFEY  195 (439)
T ss_pred             HHHHHH--------ccCCEEEEECCCCCcChHHHHHHHHHHHhCCCeEEEEeEEecCchhhhhhcchhhhHhHHhHHHHH
Confidence            222222        256899999999999988887776553   22223344443210               0      


Q ss_pred             ------------CCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHh
Q 020721          210 ------------MDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIRE  256 (322)
Q Consensus       210 ------------~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~  256 (322)
                                  ...-+..++|++..+.+++++.+.-..    ...-.||..++.=++.
T Consensus       196 ~~~~l~~r~~~s~~~~~~~~sGa~~~~Rr~~l~~vggf~----~~~i~ED~~l~~rl~~  250 (439)
T TIGR03111       196 AQAFLAGRNFESQVNSLFTLSGAFSAFRRETILKTQLYN----SETVGEDTDMTFQIRE  250 (439)
T ss_pred             HHHHHhhhHHHHhcCCeEEEccHHHhhhHHHHHHhCCCC----CCCcCccHHHHHHHHH
Confidence                        000112357888889999887764322    1223899999876654


No 44 
>cd06423 CESA_like CESA_like is  the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in  plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the
Probab=72.75  E-value=46  Score=26.44  Aligned_cols=66  Identities=20%  Similarity=0.088  Sum_probs=39.3

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCC----------------------C---CCCCCcccccc
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCR----------------------S---MDPFVDYMSGM  220 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r----------------------~---~~~yp~y~~G~  220 (322)
                      +.+|++..|+|..+....|.+.+...   +.-....|......                      .   .......+.|.
T Consensus        78 ~~~~i~~~D~D~~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~  157 (180)
T cd06423          78 KGDIVVVLDADTILEPDALKRLVVPFFADPKVGAVQGRVRVRNGSENLLTRLQAIEYLSIFRLGRRAQSALGGVLVLSGA  157 (180)
T ss_pred             CCCEEEEECCCCCcChHHHHHHHHHhccCCCeeeEeeeEEEecCcCcceeccchheecceeeeeeehhheecceeecCch
Confidence            67999999999999877666663332   22223333321100                      0   01123456788


Q ss_pred             eeeeCHHHHHHHHH
Q 020721          221 GYLVSWDIAEWIRD  234 (322)
Q Consensus       221 gYvlS~d~~~~i~~  234 (322)
                      ++++++++++.+..
T Consensus       158 ~~~~~~~~~~~~gg  171 (180)
T cd06423         158 FGAFRREALREVGG  171 (180)
T ss_pred             HHHHHHHHHHHhCC
Confidence            88888888877653


No 45 
>PRK14716 bacteriophage N4 adsorption protein B; Provisional
Probab=65.96  E-value=1.6e+02  Score=30.04  Aligned_cols=185  Identities=14%  Similarity=0.021  Sum_probs=94.8

Q ss_pred             CCeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHH
Q 020721           72 EDIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYT  151 (322)
Q Consensus        72 ~~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~  151 (322)
                      +.+.+-|+|+..-+..--.+.|+..-.+...  .+.+++++.. ..+++....+++=+.+|..+...-...+=.+.|..+
T Consensus        64 ~~p~vaIlIPA~NE~~vI~~~l~s~L~~ldY--~~~eIiVv~d-~ndd~T~~~v~~l~~~~p~v~~vv~~~~gp~~Ka~a  140 (504)
T PRK14716         64 PEKRIAIFVPAWREADVIGRMLEHNLATLDY--ENYRIFVGTY-PNDPATLREVDRLAARYPRVHLVIVPHDGPTSKADC  140 (504)
T ss_pred             CCCceEEEEeccCchhHHHHHHHHHHHcCCC--CCeEEEEEEC-CCChhHHHHHHHHHHHCCCeEEEEeCCCCCCCHHHH
Confidence            3566778888775544344455544222221  2444444432 322222222322245566653322222222356653


Q ss_pred             HHHhhhhHh-cCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCCCCeEEEEeecC-C---------------------
Q 020721          152 YFSSLPEIF-NTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPREDLYYGYVIPC-R---------------------  208 (322)
Q Consensus       152 ~~~w~~~~~-~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~y~G~~~p~-r---------------------  208 (322)
                       +.++.+.. ..+...-.++++++-.|-|..+..+.|...-...+..++.-..+.+. +                     
T Consensus       141 -LN~~l~~~~~~e~~~G~~~d~vvi~DAD~~v~Pd~Lr~~~~~~~~~~~VQ~pv~~~~~~~~~~~ag~y~~ef~~~~~~~  219 (504)
T PRK14716        141 -LNWIYQAIFAFERERGIRFAIIVLHDAEDVIHPLELRLYNYLLPRHDFVQLPVFSLPRDWGEWVAGTYMDEFAESHLKD  219 (504)
T ss_pred             -HHHHHHHHHHhhhhcCCCcCEEEEEcCCCCcCccHHHHHHhhcCCCCEEecceeccCCchhHHHHHHHHHHHHHHHHHH
Confidence             33332211 00000112579999999999999998876533333322111111000 0                     


Q ss_pred             ----CCCCCCcccccceeeeCHHHHHHHHHcCC--CCCCCCCchhHHHHHHHHhcCCC
Q 020721          209 ----SMDPFVDYMSGMGYLVSWDIAEWIRDSDI--PKNHLEGPEDKVFGAWIREGRRA  260 (322)
Q Consensus       209 ----~~~~yp~y~~G~gYvlS~d~~~~i~~~~~--~~~~~~~~EDv~vG~~l~~~~~~  260 (322)
                          ..-..+-.+.|.|+.+++++++.+.....  .-....-.||..+|.-+...|..
T Consensus       220 l~~r~~LG~~~~~~Gtg~afRR~aLe~l~~~~GG~~fd~~sLTED~dLglRL~~~G~r  277 (504)
T PRK14716        220 LPVREALGGLIPSAGVGTAFSRRALERLAAERGGQPFDSDSLTEDYDIGLRLKRAGFR  277 (504)
T ss_pred             HHHHHhcCCccccCCeeEEeEHHHHHHHHhhcCCCCCCCCCcchHHHHHHHHHHCCCE
Confidence                00001224679999999999999864221  01234578999999999888763


No 46 
>cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose.  A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier.
Probab=65.00  E-value=83  Score=26.40  Aligned_cols=63  Identities=14%  Similarity=0.111  Sum_probs=40.1

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCCC-CCCeEEEEeec---CCC----------------------CCCCCccccccee
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPLP-REDLYYGYVIP---CRS----------------------MDPFVDYMSGMGY  222 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~-~~~~y~G~~~p---~r~----------------------~~~yp~y~~G~gY  222 (322)
                      +.+|++-.|.|+.+..+.|...+.... ..++..|+...   ..+                      .-..+.++.|.++
T Consensus        81 ~~d~v~~~DaD~~~~p~~l~~l~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~  160 (183)
T cd06438          81 DPDAVVVFDADNLVDPNALEELNARFAAGARVVQAYYNSKNPDDSWITRLYAFAFLVFNRLRPLGRSNLGLSCQLGGTGM  160 (183)
T ss_pred             CCCEEEEEcCCCCCChhHHHHHHHHHhhCCCeeEEEEeeeCCccCHHHHHHHHHHHHHHHHHHHHHHHcCCCeeecCchh
Confidence            579999999999999877777665542 23455565421   100                      0011235677778


Q ss_pred             eeCHHHHHH
Q 020721          223 LVSWDIAEW  231 (322)
Q Consensus       223 vlS~d~~~~  231 (322)
                      ++++++++.
T Consensus       161 ~~rr~~l~~  169 (183)
T cd06438         161 CFPWAVLRQ  169 (183)
T ss_pred             hhHHHHHHh
Confidence            888888777


No 47 
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, 
Probab=63.90  E-value=94  Score=26.63  Aligned_cols=65  Identities=14%  Similarity=-0.052  Sum_probs=43.0

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcC--CCCCCeEEEEee-cCCC----------------------CCCCCcccccceee
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVP--LPREDLYYGYVI-PCRS----------------------MDPFVDYMSGMGYL  223 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~--~~~~~~y~G~~~-p~r~----------------------~~~yp~y~~G~gYv  223 (322)
                      ..+|++..|+|..+..+.|...++.  .+..++..|... ....                      ...-...+.|++.+
T Consensus        78 ~gd~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  157 (224)
T cd06442          78 RGDVIVVMDADLSHPPEYIPELLEAQLEGGADLVIGSRYVEGGGVEGWGLKRKLISRGANLLARLLLGRKVSDPTSGFRA  157 (224)
T ss_pred             CCCEEEEEECCCCCCHHHHHHHHHHHhcCCCCEEEEeeeecCCccCCCcHHHHHHHHHHHHHHHHHcCCCCCCCCCccch
Confidence            3499999999999998888777765  234455555432 1000                      01112346777888


Q ss_pred             eCHHHHHHHH
Q 020721          224 VSWDIAEWIR  233 (322)
Q Consensus       224 lS~d~~~~i~  233 (322)
                      +++++++.+.
T Consensus       158 ~~r~~~~~ig  167 (224)
T cd06442         158 YRREVLEKLI  167 (224)
T ss_pred             hhHHHHHHHh
Confidence            9999999987


No 48 
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=61.73  E-value=2.3e+02  Score=30.38  Aligned_cols=184  Identities=13%  Similarity=0.043  Sum_probs=99.9

Q ss_pred             CCCCeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHH
Q 020721           70 SSEDIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKT  149 (322)
Q Consensus        70 ~~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt  149 (322)
                      ..++..+-|+|+-.-+...-.+.|...-....-.  +++++++.. ++++.....+++-.++|.++..+.....-+++|.
T Consensus        59 ~~~~~~vsIlVPa~nE~~vi~~~i~~ll~~ldYP--~~eI~vi~~-~nD~~T~~~~~~l~~~~p~~~~v~~~~~g~~gKa  135 (727)
T PRK11234         59 KPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTY-PNDPATQADVDAVCARFPNVHKVVCARPGPTSKA  135 (727)
T ss_pred             cCCCCCEEEEEecCcchhhHHHHHHHHHHhCCCC--CeEEEEEec-CCChhHHHHHHHHHHHCCCcEEEEeCCCCCCCHH
Confidence            3455778888988755554444555432222211  354555543 3333333345555567777643333332245776


Q ss_pred             HHHHHhhhhHh-cCCCCCCC--CceEEEEecCceEeeHHHHHHHhcCC-CCCCeEEEEeecC-C----------------
Q 020721          150 YTYFSSLPEIF-NTSDRPYP--PYHYVMKGDDDTYIRLENLVKSLVPL-PREDLYYGYVIPC-R----------------  208 (322)
Q Consensus       150 ~~~~~w~~~~~-~~~~~~c~--~~~yvlK~DDD~fv~~~~L~~~L~~~-~~~~~y~G~~~p~-r----------------  208 (322)
                      .+ +.++.... ..+  .++  .++.++-.|-|+-+..+.|. .+..+ ++.+..-+...|. +                
T Consensus       136 ~a-LN~~l~~~~~~e--~~~~~~~~vvvi~DAD~~v~pd~L~-~~~~l~~~~~~VQ~p~~p~~~~~~~~~~~~~~~EFa~  211 (727)
T PRK11234        136 DC-LNNVLDAITQFE--RSANFAFAGFILHDAEDVISPMELR-LFNYLVERKDLIQIPVYPFEREWTHFTSGTYIDEFAE  211 (727)
T ss_pred             HH-HHHHHHHHHhhh--cccCCcccEEEEEcCCCCCChhHHH-HHHhhcCCCCeEeecccCCCccHHHHHHHHHHHHHHH
Confidence            54 33333321 100  112  34567779999999999997 22222 1112211111110 0                


Q ss_pred             ---------CCCCCCcccccceeeeCHHHHHHHHHcCC--CCCCCCCchhHHHHHHHHhcCCC
Q 020721          209 ---------SMDPFVDYMSGMGYLVSWDIAEWIRDSDI--PKNHLEGPEDKVFGAWIREGRRA  260 (322)
Q Consensus       209 ---------~~~~yp~y~~G~gYvlS~d~~~~i~~~~~--~~~~~~~~EDv~vG~~l~~~~~~  260 (322)
                               ..-..+-.++|.|..+||.+++.+.+...  .-....--||.-+|.-+...|..
T Consensus       212 ~~~~~~~~~~~lgg~~~l~G~~~af~Rr~l~al~~~ggg~~~~~~~lTED~dlg~rL~~~G~~  274 (727)
T PRK11234        212 LHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLEDGDGIAFDVQSLTEDYDIGFRLKEKGMR  274 (727)
T ss_pred             HhhhhhHHHHHcCCCcccCCceEEEecccHHHHHHhcCCCCcCCCcchHHHHHHHHHHHCCCE
Confidence                     00012345889999999988776666430  12455678999999999988864


No 49 
>cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=61.33  E-value=99  Score=26.00  Aligned_cols=88  Identities=18%  Similarity=0.181  Sum_probs=52.2

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEee----------cCC-C-C--CCCCcc-cccceeeeCHHHHH
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVI----------PCR-S-M--DPFVDY-MSGMGYLVSWDIAE  230 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~----------p~r-~-~--~~yp~y-~~G~gYvlS~d~~~  230 (322)
                      ..+|++..|+|..+..+.|...++..   +.-++.+|...          +.. + .  ..+-.. +.|.+-++++++++
T Consensus        83 ~~d~i~~ld~D~~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~  162 (202)
T cd04184          83 TGEFVALLDHDDELAPHALYEVVKALNEHPDADLIYSDEDKIDEGGKRSEPFFKPDWSPDLLLSQNYIGHLLVYRRSLVR  162 (202)
T ss_pred             cCCEEEEECCCCcCChHHHHHHHHHHHhCCCCCEEEccHHhccCCCCEeccccCCCCCHHHhhhcCCccceEeEEHHHHH
Confidence            56999999999999887777766553   33344443221          000 0 0  000111 23455578888888


Q ss_pred             HHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          231 WIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       231 ~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      .+.--.   ......||..++.-+.+.|.
T Consensus       163 ~iggf~---~~~~~~eD~~l~~rl~~~g~  188 (202)
T cd04184         163 QVGGFR---EGFEGAQDYDLVLRVSEHTD  188 (202)
T ss_pred             HhCCCC---cCcccchhHHHHHHHHhccc
Confidence            775432   12446799988887766654


No 50 
>TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils.
Probab=58.19  E-value=2.6e+02  Score=29.79  Aligned_cols=87  Identities=14%  Similarity=0.018  Sum_probs=54.2

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEee---cC---C---------------------CCCC-CCccc
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVI---PC---R---------------------SMDP-FVDYM  217 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~---p~---r---------------------~~~~-yp~y~  217 (322)
                      +.+|++-.|.|+.+..+.|.+.+...   |.=.+..+...   |.   +                     ..+. -..++
T Consensus       228 ~gd~Il~lDAD~v~~pd~L~~~v~~f~~dp~v~~Vqtp~~f~~p~~~~~nl~~~~~~~~e~~~f~~~i~~g~~~~~~~~~  307 (713)
T TIGR03030       228 DGELILIFDADHVPTRDFLQRTVGWFVEDPKLFLVQTPHFFVSPDPIERNLGTFRRMPNENELFYGLIQDGNDFWNAAFF  307 (713)
T ss_pred             CCCEEEEECCCCCcChhHHHHHHHHHHhCCCEEEEeCCeeccCCCHHhhhhHHHHHhhhHHHHHHHHHHHHHhhhCCeee
Confidence            56999999999999988777765442   22011111100   00   0                     0000 12467


Q ss_pred             ccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          218 SGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       218 ~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      .|++.++.+++.+.+---..    ..-.||..+|.-+.+.|-
T Consensus       308 ~Gs~~~iRR~al~~iGGf~~----~~vtED~~l~~rL~~~G~  345 (713)
T TIGR03030       308 CGSAAVLRREALDEIGGIAG----ETVTEDAETALKLHRRGW  345 (713)
T ss_pred             cCceeEEEHHHHHHcCCCCC----CCcCcHHHHHHHHHHcCC
Confidence            79999999999998753221    123799999999987764


No 51 
>PF05412 Peptidase_C33:  Equine arterivirus Nsp2-type cysteine proteinase;  InterPro: IPR008743 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases corresponds to MEROPS peptidase family C33 (clan CA). The type example is equine arteritis virus Nsp2-type cysteine proteinase, which is involved in viral polyprotein processing [].; GO: 0016032 viral reproduction, 0019082 viral protein processing
Probab=54.17  E-value=10  Score=30.14  Aligned_cols=18  Identities=28%  Similarity=0.523  Sum_probs=15.8

Q ss_pred             CCCCCCceEEEEecCceE
Q 020721          164 DRPYPPYHYVMKGDDDTY  181 (322)
Q Consensus       164 ~~~c~~~~yvlK~DDD~f  181 (322)
                      +..||+++|++|.|++=+
T Consensus        63 ~~~Cp~ArYv~~l~~qHW   80 (108)
T PF05412_consen   63 NGACPHARYVLKLDGQHW   80 (108)
T ss_pred             CCCCCCCEEEEEecCceE
Confidence            467999999999999866


No 52 
>PRK10714 undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional
Probab=53.54  E-value=2e+02  Score=27.19  Aligned_cols=146  Identities=10%  Similarity=0.077  Sum_probs=75.3

Q ss_pred             eEEEEEECCCCCHHHHHHHHHHhc--cCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCC-eEEeccCCcCCchHHHH
Q 020721           75 RILIGILTLPDQYHRRHFLRMIYG--TQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDD-IIILNCKENMNKGKTYT  151 (322)
Q Consensus        75 ~lli~V~S~~~~~~rR~aIR~TW~--~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~D-Iv~ld~~D~y~n~Kt~~  151 (322)
                      .+-|+|++--+...-.+.+++.=.  .+  ....+.+++|-..+++.... .+++-++.+++ ++......++  +|.-+
T Consensus         7 ~vSVVIP~yNE~~~i~~~l~~l~~~~~~--~~~~~EIIvVDDgS~D~T~~-il~~~~~~~~~~v~~i~~~~n~--G~~~A   81 (325)
T PRK10714          7 KVSVVIPVYNEQESLPELIRRTTAACES--LGKEYEILLIDDGSSDNSAE-MLVEAAQAPDSHIVAILLNRNY--GQHSA   81 (325)
T ss_pred             eEEEEEcccCchhhHHHHHHHHHHHHHh--CCCCEEEEEEeCCCCCcHHH-HHHHHHhhcCCcEEEEEeCCCC--CHHHH
Confidence            466667765433322333333211  11  12346677776656554332 22222333444 4444444444  34332


Q ss_pred             HHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCC-CCeEEEEeec-CC------------------CCC
Q 020721          152 YFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPR-EDLYYGYVIP-CR------------------SMD  211 (322)
Q Consensus       152 ~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~-~~~y~G~~~p-~r------------------~~~  211 (322)
                      ...-+ +.        .+.+|++-.|+|.-.+++.+.+.++.... -++..|.... ..                  ...
T Consensus        82 ~~~G~-~~--------A~gd~vv~~DaD~q~~p~~i~~l~~~~~~~~DvV~~~r~~~~~~~~r~~~s~~~~~l~~~~~g~  152 (325)
T PRK10714         82 IMAGF-SH--------VTGDLIITLDADLQNPPEEIPRLVAKADEGYDVVGTVRQNRQDSWFRKTASKMINRLIQRTTGK  152 (325)
T ss_pred             HHHHH-Hh--------CCCCEEEEECCCCCCCHHHHHHHHHHHHhhCCEEEEEEcCCCCcHHHHHHHHHHHHHHHHHcCC
Confidence            21111 11        15699999999999999888888776532 2455454321 00                  122


Q ss_pred             CCCccccccee-eeCHHHHHHHHHcC
Q 020721          212 PFVDYMSGMGY-LVSWDIAEWIRDSD  236 (322)
Q Consensus       212 ~yp~y~~G~gY-vlS~d~~~~i~~~~  236 (322)
                      .++...  +|| ++++++++.+....
T Consensus       153 ~~~d~~--~gfr~~~r~~~~~l~~~~  176 (325)
T PRK10714        153 AMGDYG--CMLRAYRRHIVDAMLHCH  176 (325)
T ss_pred             CCCCCC--cCeEEEcHHHHHHHHHCC
Confidence            233333  344 89999999997543


No 53 
>PRK05454 glucosyltransferase MdoH; Provisional
Probab=53.11  E-value=1.6e+02  Score=31.35  Aligned_cols=181  Identities=15%  Similarity=0.105  Sum_probs=97.1

Q ss_pred             CCCCeeEEEEEECCCCCHH-HHHHHHHHhccCC--CCCCceEEEEEecCCCchhHHH----HHHHHHhhCC---CeEEec
Q 020721           70 SSEDIRILIGILTLPDQYH-RRHFLRMIYGTQS--PTGAQVDVKFVFCNLTKEDQKV----LVALEIMRYD---DIIILN  139 (322)
Q Consensus        70 ~~~~~~lli~V~S~~~~~~-rR~aIR~TW~~~~--~~~~~v~~~Fvlg~~~~~~~~~----~l~~E~~~~~---DIv~ld  139 (322)
                      .....++.|+|++.-+..+ -+..++.++.+-.  ....++++ |++....+++...    .+.+=.++++   .|....
T Consensus       120 ~~~~~~VaVliP~yNEd~~~v~~~L~a~~~Sl~~~~~~~~~e~-~vLdD~~d~~~~~~e~~~~~~L~~~~~~~~~i~yr~  198 (691)
T PRK05454        120 PPPEARTAILMPIYNEDPARVFAGLRAMYESLAATGHGAHFDF-FILSDTRDPDIAAAEEAAWLELRAELGGEGRIFYRR  198 (691)
T ss_pred             CCCCCceEEEEeCCCCChHHHHHHHHHHHHHHHhcCCCCCEEE-EEEECCCChhHHHHHHHHHHHHHHhcCCCCcEEEEE
Confidence            3455788899999976653 3457777775321  11235655 8888776654321    1222223332   344433


Q ss_pred             cCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCC--C-----
Q 020721          140 CKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCR--S-----  209 (322)
Q Consensus       140 ~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r--~-----  209 (322)
                      -.+|- ++|.-..-.++... +      .+++|++-.|-|+.+..+.|.+.+..+   |+-.+.-+......  +     
T Consensus       199 R~~n~-~~KaGNl~~~~~~~-~------~~~eyivvLDADs~m~~d~L~~lv~~m~~dP~vGlVQt~~~~~n~~slfaR~  270 (691)
T PRK05454        199 RRRNV-GRKAGNIADFCRRW-G------GAYDYMVVLDADSLMSGDTLVRLVRLMEANPRAGLIQTLPVAVGADTLFARL  270 (691)
T ss_pred             CCcCC-CccHHHHHHHHHhc-C------CCcCEEEEEcCCCCCCHHHHHHHHHHHhhCcCEEEEeCCccCcCCCCHHHHH
Confidence            33332 24554433333321 1      267999999999999999888887653   22111111111000  0     


Q ss_pred             ----CCCC--------------CcccccceeeeCHHHHHHHHHcCCC--CC---CCCCchhHHHHHHHHhcCC
Q 020721          210 ----MDPF--------------VDYMSGMGYLVSWDIAEWIRDSDIP--KN---HLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       210 ----~~~y--------------p~y~~G~gYvlS~d~~~~i~~~~~~--~~---~~~~~EDv~vG~~l~~~~~  259 (322)
                          .+.|              -....|..-++.+++......-...  ..   ...-.||...|..++..|-
T Consensus       271 qqf~~~~y~~~~~~G~~~w~~~~g~f~G~naIiR~~af~~~~glp~L~g~~p~~~~~LseD~~~a~~l~~~Gy  343 (691)
T PRK05454        271 QQFATRVYGPLFAAGLAWWQGGEGNYWGHNAIIRVKAFAEHCGLPPLPGRGPFGGHILSHDFVEAALMRRAGW  343 (691)
T ss_pred             HHHHHHHHHHHHHhhhhhhccCccccccceEEEEHHHHHHhcCCccccccCCCCCCcccHHHHHHHHHHHCCC
Confidence                0000              0112356678888877765421100  11   2346789999999998765


No 54 
>cd04190 Chitin_synth_C C-terminal domain of Chitin Synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin. Chitin synthase, also called UDP-N-acetyl-D-glucosamine:chitin 4-beta-N-acetylglucosaminyltransferase, catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of GlcNAc residues formed by covalent beta-1,4 linkages. Chitin is an important component of the cell wall of fungi and bacteria and it is synthesized on the cytoplasmic surface of the cell membrane by  membrane bound chitin synthases. Studies with fungi have revealed that most of them contain more than one chitin synthase gene. At least five subclasses of chitin synthases have been identified.
Probab=50.80  E-value=35  Score=30.59  Aligned_cols=91  Identities=18%  Similarity=0.216  Sum_probs=55.3

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCC-------------------------CCCCCCcccccc
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCR-------------------------SMDPFVDYMSGM  220 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r-------------------------~~~~yp~y~~G~  220 (322)
                      +.+|++.+|.|+.+..+.|.+.+..+   |.-....|...+..                         +.-.+...+.|+
T Consensus        73 ~~e~i~~~DaD~~~~~~~l~~l~~~~~~~p~vg~v~g~~~~~~~~~~~~~~~q~~ey~~~~~~~~~~~s~~g~~~~~~G~  152 (244)
T cd04190          73 DPEFILLVDADTKFDPDSIVQLYKAMDKDPEIGGVCGEIHPMGKKQGPLVMYQVFEYAISHWLDKAFESVFGFVTCLPGC  152 (244)
T ss_pred             CCCEEEEECCCCcCCHhHHHHHHHHHHhCCCEEEEEeeeEEcCCcchhHHHhHheehhhhhhhcccHHHcCCceEECCCc
Confidence            67999999999999988877666443   33234455543210                         001134456788


Q ss_pred             eeeeCHHHHHHHHHcCCC-------C---------CCCCCchhHHHHHHHHhcCC
Q 020721          221 GYLVSWDIAEWIRDSDIP-------K---------NHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       221 gYvlS~d~~~~i~~~~~~-------~---------~~~~~~EDv~vG~~l~~~~~  259 (322)
                      +.++.+++++.+......       +         ....-.||..++..+...|.
T Consensus       153 ~~~~R~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ED~~l~~~l~~~G~  207 (244)
T cd04190         153 FSMYRIEALKGDNGGKGPLLDYAYLTNTVDSLHKKNNLDLGEDRILCTLLLKAGP  207 (244)
T ss_pred             eEEEEehhhcCCccccccchhhccccCcccchHHHHHHhHhcccceeHHHhccCC
Confidence            888988887765321100       0         01224688888888876654


No 55 
>cd06435 CESA_NdvC_like NdvC_like  proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like  proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response.
Probab=49.09  E-value=1.8e+02  Score=25.24  Aligned_cols=87  Identities=11%  Similarity=-0.022  Sum_probs=54.9

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCCCCC--CeEEEEe-ecCCC---------------------C-CC-CCccccccee
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPLPRE--DLYYGYV-IPCRS---------------------M-DP-FVDYMSGMGY  222 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~~--~~y~G~~-~p~r~---------------------~-~~-yp~y~~G~gY  222 (322)
                      +++|++..|+|+.+..+.|.+.+......  ++..|.. .....                     . .. --.++.|++.
T Consensus        84 ~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~  163 (236)
T cd06435          84 DAEIIAVIDADYQVEPDWLKRLVPIFDDPRVGFVQAPQDYRDGEESLFKRMCYAEYKGFFDIGMVSRNERNAIIQHGTMC  163 (236)
T ss_pred             CCCEEEEEcCCCCcCHHHHHHHHHHhcCCCeeEEecCccccCCCccHHHHHHhHHHHHHHHHHhccccccCceEEecceE
Confidence            36999999999999998888877654322  2222211 00000                     0 00 0124678888


Q ss_pred             eeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          223 LVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       223 vlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      ++++++.+.+.-...    ....||.-++.=+.+.|.
T Consensus       164 ~~rr~~~~~iGgf~~----~~~~eD~dl~~r~~~~G~  196 (236)
T cd06435         164 LIRRSALDDVGGWDE----WCITEDSELGLRMHEAGY  196 (236)
T ss_pred             EEEHHHHHHhCCCCC----ccccchHHHHHHHHHCCc
Confidence            999999998854321    235899999987777664


No 56 
>PLN02726 dolichyl-phosphate beta-D-mannosyltransferase
Probab=48.40  E-value=1.9e+02  Score=25.49  Aligned_cols=168  Identities=12%  Similarity=-0.012  Sum_probs=84.1

Q ss_pred             CeeEEEEEECCCCCHHHHH---HHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCC--eEEeccCCcCCch
Q 020721           73 DIRILIGILTLPDQYHRRH---FLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDD--IIILNCKENMNKG  147 (322)
Q Consensus        73 ~~~lli~V~S~~~~~~rR~---aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~D--Iv~ld~~D~y~n~  147 (322)
                      .+.+-|+|++--....-..   .+++.-...    ..+.+++|-..+++... +.+++-.++|++  +....-..+..  
T Consensus         8 ~~~vsVvIp~yne~~~l~~~l~~l~~~~~~~----~~~eiivvDdgS~D~t~-~i~~~~~~~~~~~~v~~~~~~~n~G--   80 (243)
T PLN02726          8 AMKYSIIVPTYNERLNIALIVYLIFKALQDV----KDFEIIVVDDGSPDGTQ-DVVKQLQKVYGEDRILLRPRPGKLG--   80 (243)
T ss_pred             CceEEEEEccCCchhhHHHHHHHHHHHhccC----CCeEEEEEeCCCCCCHH-HHHHHHHHhcCCCcEEEEecCCCCC--
Confidence            4567777887643222222   233322111    25566666655554432 233333344543  33332223222  


Q ss_pred             HHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC--CCCCeEEEEeecC----------CC------
Q 020721          148 KTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL--PREDLYYGYVIPC----------RS------  209 (322)
Q Consensus       148 Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~--~~~~~y~G~~~p~----------r~------  209 (322)
                      |..+ +......        .+.+|++-.|+|..+.++.|.+.+...  +.-++..|.....          +.      
T Consensus        81 ~~~a-~n~g~~~--------a~g~~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~g~r~~~~~~~~~~~~~r~~~~~~~  151 (243)
T PLN02726         81 LGTA-YIHGLKH--------ASGDFVVIMDADLSHHPKYLPSFIKKQRETGADIVTGTRYVKGGGVHGWDLRRKLTSRGA  151 (243)
T ss_pred             HHHH-HHHHHHH--------cCCCEEEEEcCCCCCCHHHHHHHHHHHHhcCCcEEEEccccCCCCcCCccHHHHHHHHHH
Confidence            2221 1222211        145999999999999988887776543  2335566643210          00      


Q ss_pred             -------CCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          210 -------MDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       210 -------~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                             ....-+...|++.++++++++.+.....  . ....+|..+..-+...|.
T Consensus       152 ~~~~~~~~~~~~~d~~g~~~~~rr~~~~~i~~~~~--~-~~~~~~~el~~~~~~~g~  205 (243)
T PLN02726        152 NVLAQTLLWPGVSDLTGSFRLYKRSALEDLVSSVV--S-KGYVFQMEIIVRASRKGY  205 (243)
T ss_pred             HHHHHHHhCCCCCcCCCcccceeHHHHHHHHhhcc--C-CCcEEehHHHHHHHHcCC
Confidence                   0001123567778999999999975321  1 122345656555555553


No 57 
>PTZ00260 dolichyl-phosphate beta-glucosyltransferase; Provisional
Probab=45.95  E-value=2.7e+02  Score=26.45  Aligned_cols=174  Identities=13%  Similarity=-0.014  Sum_probs=87.0

Q ss_pred             CCCCeeEEEEEECCCCCHHHHHHHHHHhccCC-----CCCCceEEEEEecCCCchhHHHHHHHHHhhC----CCeEEecc
Q 020721           70 SSEDIRILIGILTLPDQYHRRHFLRMIYGTQS-----PTGAQVDVKFVFCNLTKEDQKVLVALEIMRY----DDIIILNC  140 (322)
Q Consensus        70 ~~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~-----~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~----~DIv~ld~  140 (322)
                      ...++.+-|+|+.--+...-.+.++++...-.     .......+++|-..+++.... .+++-.+.+    .++-.+..
T Consensus        66 ~~~~~~isVVIP~yNe~~~i~~~L~~l~~~~~~~~~~~~~~~~EIIVVDDgStD~T~~-i~~~~~~~~~~~~~~i~vi~~  144 (333)
T PTZ00260         66 KDSDVDLSIVIPAYNEEDRLPKMLKETIKYLESRSRKDPKFKYEIIIVNDGSKDKTLK-VAKDFWRQNINPNIDIRLLSL  144 (333)
T ss_pred             CCCCeEEEEEEeeCCCHHHHHHHHHHHHHHHHhhhccCCCCCEEEEEEeCCCCCchHH-HHHHHHHhcCCCCCcEEEEEc
Confidence            45567788888877554444456666664210     112245666665545543322 222222232    13555544


Q ss_pred             CCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC-----CCCCeEEEEeec---C-----
Q 020721          141 KENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL-----PREDLYYGYVIP---C-----  207 (322)
Q Consensus       141 ~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~-----~~~~~y~G~~~p---~-----  207 (322)
                      ..+..  |..+.-.-+ .        +...+|++-+|.|....++.+.+.++..     +.-++..|....   .     
T Consensus       145 ~~N~G--~~~A~~~Gi-~--------~a~gd~I~~~DaD~~~~~~~l~~l~~~l~~~~~~~~dvV~GsR~~~~~~~~~~~  213 (333)
T PTZ00260        145 LRNKG--KGGAVRIGM-L--------ASRGKYILMVDADGATDIDDFDKLEDIMLKIEQNGLGIVFGSRNHLVDSDVVAK  213 (333)
T ss_pred             CCCCC--hHHHHHHHH-H--------HccCCEEEEEeCCCCCCHHHHHHHHHHHHHhhccCCceEEeeccccccCccccc
Confidence            44433  332211111 1        1245999999999998877755444332     233567775421   0     


Q ss_pred             CC------------------CCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          208 RS------------------MDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       208 r~------------------~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      ++                  ....... ..+.-++++++++.+....  ....+ .-|+.+-..+.+.|.
T Consensus       214 ~~~~r~~~~~~~~~l~~~~~~~~i~D~-~~Gfk~~~r~~~~~i~~~~--~~~~~-~fd~Ell~~a~~~g~  279 (333)
T PTZ00260        214 RKWYRNILMYGFHFIVNTICGTNLKDT-QCGFKLFTRETARIIFPSL--HLERW-AFDIEIVMIAQKLNL  279 (333)
T ss_pred             CcHHHHHHHHHHHHHHHHHcCCCcccC-CCCeEEEeHHHHHHHhhhc--cccCc-cchHHHHHHHHHcCC
Confidence            00                  1111111 1223488999999986532  11112 235666555655554


No 58 
>cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=42.68  E-value=1.9e+02  Score=23.78  Aligned_cols=88  Identities=11%  Similarity=-0.009  Sum_probs=57.2

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcC---CCCCCeEEEEeec---CC--C----------CC-CCCcccccceeeeCHHHH
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVP---LPREDLYYGYVIP---CR--S----------MD-PFVDYMSGMGYLVSWDIA  229 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~---~~~~~~y~G~~~p---~r--~----------~~-~yp~y~~G~gYvlS~d~~  229 (322)
                      +.+|++-.|+|..+..+.+...+..   .+.-.+..|....   ..  .          .. ....++.|++.++++++.
T Consensus        75 ~~~~v~~ld~D~~~~~~~~~~~~~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  154 (202)
T cd06433          75 TGDIIGFLNSDDTLLPGALLAVVAAFAEHPEVDVVYGDVLLVDENGRVIGRRRPPPFLDKFLLYGMPICHQATFFRRSLF  154 (202)
T ss_pred             CCCEEEEeCCCcccCchHHHHHHHHHHhCCCccEEEeeeEEEcCCCCcccCCCCcchhhhHHhhcCcccCcceEEEHHHH
Confidence            5699999999999998888877632   3444566665421   00  0          00 123467788899999999


Q ss_pred             HHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          230 EWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       230 ~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      +.+..-.   ......||..+..-+.+.|.
T Consensus       155 ~~~~~f~---~~~~~~~D~~~~~r~~~~g~  181 (202)
T cd06433         155 EKYGGFD---ESYRIAADYDLLLRLLLAGK  181 (202)
T ss_pred             HHhCCCc---hhhCchhhHHHHHHHHHcCC
Confidence            8885422   12234689887777766654


No 59 
>cd02522 GT_2_like_a GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function. Glycosyltransferase family 2 (GT-2) subfamily of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=42.19  E-value=2.2e+02  Score=24.24  Aligned_cols=87  Identities=13%  Similarity=0.034  Sum_probs=53.2

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCCCCCCeEEEEee--cCC-----------C--CCCCCcc-cccceeeeCHHHHHHH
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPLPREDLYYGYVI--PCR-----------S--MDPFVDY-MSGMGYLVSWDIAEWI  232 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~y~G~~~--p~r-----------~--~~~yp~y-~~G~gYvlS~d~~~~i  232 (322)
                      ..+|++..|+|..+..+.+.+.+......+...|...  ...           .  ...+... .++.|..+++++.+.+
T Consensus        72 ~~~~i~~~D~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~  151 (221)
T cd02522          72 RGDWLLFLHADTRLPPDWDAAIIETLRADGAVAGAFRLRFDDPGPRLRLLELGANLRSRLFGLPYGDQGLFIRRELFEEL  151 (221)
T ss_pred             cCCEEEEEcCCCCCChhHHHHHHHHhhcCCcEEEEEEeeecCCccchhhhhhcccceecccCCCcCCceEEEEHHHHHHh
Confidence            4599999999999988777776555433333333321  000           0  0000011 2356889999998887


Q ss_pred             HHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          233 RDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       233 ~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      .....    ....||..++.=+.+.|.
T Consensus       152 G~fd~----~~~~ED~d~~~r~~~~G~  174 (221)
T cd02522         152 GGFPE----LPLMEDVELVRRLRRRGR  174 (221)
T ss_pred             CCCCc----cccccHHHHHHHHHhCCC
Confidence            54332    127899999877777764


No 60 
>PHA01631 hypothetical protein
Probab=38.74  E-value=1.1e+02  Score=26.43  Aligned_cols=59  Identities=17%  Similarity=0.265  Sum_probs=36.7

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCCCCCCe----EEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHc
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPLPREDL----YYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDS  235 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~----y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~  235 (322)
                      +-|.++-+|.|++++.-.  +.   .++..+    |.-...   +.+..-+||.|.-+++.+..+..+...
T Consensus        71 ~DDi~~iIDSDV~ipn~~--~~---~~~~~v~t~CiPA~~k---p~~~v~~FC~sTNf~~pr~~l~~l~~v  133 (176)
T PHA01631         71 EDDIIAIIDSDLIIPNLR--EI---IPNERVFTPCYWLYYD---WANEIRPFCSGTNYIFRKSLLPYLEYT  133 (176)
T ss_pred             CccEEEEeccceEecCcc--cc---ccCCCccceeeeeeec---CCCcEEEEEccccEEeeHHHhHHHHHH
Confidence            347888899999976432  21   222221    222211   223345899999999999988887653


No 61 
>PF09680 Tiny_TM_bacill:  Protein of unknown function (Tiny_TM_bacill);  InterPro: IPR010070 This entry represents a family of hypothetical proteins, half of which are 40 residues or less in length. Members are found only in spore-forming species. A Gly-rich variable region is followed by a strongly conserved, highly hydrophobic region, predicted to form a transmembrane helix, ending with an invariant Gly. The consensus for this stretch is FALLVVFILLIIV.
Probab=35.36  E-value=39  Score=19.53  Aligned_cols=17  Identities=29%  Similarity=0.465  Sum_probs=11.9

Q ss_pred             chhHHHHHHHHHHHHHH
Q 020721           11 RQFIVWSFFFILFLCVI   27 (322)
Q Consensus        11 ~~~~~~~~~~i~~~~~~   27 (322)
                      ..|++..++||+++.+-
T Consensus         4 ~~FalivVLFILLiIvG   20 (24)
T PF09680_consen    4 SGFALIVVLFILLIIVG   20 (24)
T ss_pred             ccchhHHHHHHHHHHhc
Confidence            35778888888776653


No 62 
>TIGR01732 tiny_TM_bacill conserved hypothetical tiny transmembrane protein. This model represents a family of hypothetical proteins, half of which are 40 residues or less in length. Members are found only in spore-forming species. A Gly-rich variable region is followed by a strongly conserved, highly hydrophobic region, predicted to form a transmembrane helix, ending with an invariant Gly. The consensus for this stretch is FALLVVFILLIIV.
Probab=34.17  E-value=44  Score=19.69  Aligned_cols=18  Identities=33%  Similarity=0.451  Sum_probs=13.5

Q ss_pred             chhHHHHHHHHHHHHHHH
Q 020721           11 RQFIVWSFFFILFLCVIA   28 (322)
Q Consensus        11 ~~~~~~~~~~i~~~~~~~   28 (322)
                      ..|+++.++||+++.+-.
T Consensus         6 ~gf~livVLFILLIIiga   23 (26)
T TIGR01732         6 GGFALIVVLFILLVIVGA   23 (26)
T ss_pred             cchHHHHHHHHHHHHhhe
Confidence            468888888888877654


No 63 
>KOG3588 consensus Chondroitin synthase 1 [Carbohydrate transport and metabolism]
Probab=33.77  E-value=12  Score=36.42  Aligned_cols=67  Identities=19%  Similarity=0.196  Sum_probs=42.2

Q ss_pred             eCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCCCCccccCccceeecCC------CCCcccccccCCEEEEEccCCH
Q 020721          224 VSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRRAKNRYNAKWSMYNFPV------PRTACTHELWADTIAVHLLKNQ  297 (322)
Q Consensus       224 lS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~~~~~------~~~~c~~~~~~~~ia~H~~~~~  297 (322)
                      ||++++..|.+-  +...+-.+|||.+|+|.++.-.        ..|+.--.      ..+   +....+.|+.|.+|++
T Consensus         1 Msr~~l~hl~~c--L~~m~t~hedve~grc~r~~mq--------~Lf~~n~~ank~sy~~~---~~~~~~aitlhp~K~p   67 (494)
T KOG3588|consen    1 MSRDTLLHLESC--LQHMLTSHEDVELGRCIRKHMQ--------KLFHNNQSANKESYAKN---MKELKDAITLHPIKDP   67 (494)
T ss_pred             CcHHHHHHHHHH--HHHHHhhccCcchhHHHHHHHH--------HHhhhhhhhcChhhhcc---chhhhhhhccccccCH
Confidence            688888877653  3456678999999999988511        11211100      011   1123678999999999


Q ss_pred             HHHHHH
Q 020721          298 EKWIET  303 (322)
Q Consensus       298 ~~~~~~  303 (322)
                      -.+.++
T Consensus        68 ~~~~r~   73 (494)
T KOG3588|consen   68 AVMRRV   73 (494)
T ss_pred             HHHHHH
Confidence            844443


No 64 
>cd04188 DPG_synthase DPG_synthase is involved in protein N-linked glycosylation. UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate.
Probab=32.26  E-value=3.1e+02  Score=23.36  Aligned_cols=139  Identities=15%  Similarity=-0.003  Sum_probs=73.2

Q ss_pred             ceEEEEEecCCCchhHHHHHHHHHhhCCCe-EEeccCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeH
Q 020721          106 QVDVKFVFCNLTKEDQKVLVALEIMRYDDI-IILNCKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRL  184 (322)
Q Consensus       106 ~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DI-v~ld~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~  184 (322)
                      .+.++.+-+.+++.. ...+++..++++.. ..+...++.  ++.-+.-.-+..         ...+|++-.|+|.....
T Consensus        30 ~~eiivvdd~S~D~t-~~~~~~~~~~~~~~i~~i~~~~n~--G~~~a~~~g~~~---------a~gd~i~~ld~D~~~~~   97 (211)
T cd04188          30 SYEIIVVDDGSKDGT-AEVARKLARKNPALIRVLTLPKNR--GKGGAVRAGMLA---------ARGDYILFADADLATPF   97 (211)
T ss_pred             CEEEEEEeCCCCCch-HHHHHHHHHhCCCcEEEEEcccCC--CcHHHHHHHHHH---------hcCCEEEEEeCCCCCCH
Confidence            455666655555433 23444445556654 334444433  233222222221         14599999999999998


Q ss_pred             HHHHHHhcCC--CCCCeEEEEeecC-------CC---------------CCCCCcc--cccceeeeCHHHHHHHHHcCCC
Q 020721          185 ENLVKSLVPL--PREDLYYGYVIPC-------RS---------------MDPFVDY--MSGMGYLVSWDIAEWIRDSDIP  238 (322)
Q Consensus       185 ~~L~~~L~~~--~~~~~y~G~~~p~-------r~---------------~~~yp~y--~~G~gYvlS~d~~~~i~~~~~~  238 (322)
                      +.+.+.++..  +..++..|.....       ++               .-.-+++  ...+..++++++++.+....  
T Consensus        98 ~~l~~l~~~~~~~~~~~v~g~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~g~~~~~r~~~~~~~~~~--  175 (211)
T cd04188          98 EELEKLEEALKTSGYDIAIGSRAHLASAAVVKRSWLRNLLGRGFNFLVRLLLGLGIKDTQCGFKLFTRDAARRLFPRL--  175 (211)
T ss_pred             HHHHHHHHHHhccCCcEEEEEeeccCCcccccccHHHHHHHHHHHHHHHHHcCCCCcccccCceeEcHHHHHHHHhhh--
Confidence            8888877752  3446667754310       00               0000122  12245688999998886432  


Q ss_pred             CCCCCCchhHHHHHHHHhcCC
Q 020721          239 KNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       239 ~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      ... ...+|..+..-+.+.|.
T Consensus       176 ~~~-~~~~d~el~~r~~~~g~  195 (211)
T cd04188         176 HLE-RWAFDVELLVLARRLGY  195 (211)
T ss_pred             hcc-ceEeeHHHHHHHHHcCC
Confidence            112 23357766555555553


No 65 
>PF11395 DUF2873:  Protein of unknown function (DUF2873);  InterPro: IPR021532 This entry is represented by the human SARS coronavirus, Orf7b; it is a family of uncharacterised viral proteins.
Probab=29.76  E-value=44  Score=21.43  Aligned_cols=19  Identities=16%  Similarity=0.289  Sum_probs=7.9

Q ss_pred             HHHHHhccccccccccccCcc
Q 020721           24 LCVIASINEVRFDGLLKFGRC   44 (322)
Q Consensus        24 ~~~~~~~~~~~~~~~~~~~~~   44 (322)
                      +.+++|+..+.+|..-+  ||
T Consensus        23 imliif~f~le~qdl~e--pc   41 (43)
T PF11395_consen   23 IMLIIFWFSLEIQDLNE--PC   41 (43)
T ss_pred             HHHHHHHHHHhhhhhcc--cc
Confidence            33334334444444333  55


No 66 
>PF13704 Glyco_tranf_2_4:  Glycosyl transferase family 2
Probab=29.38  E-value=2.4e+02  Score=20.94  Aligned_cols=47  Identities=13%  Similarity=0.116  Sum_probs=27.9

Q ss_pred             CCeEEeccCCcCCc-hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHH
Q 020721          133 DDIIILNCKENMNK-GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLEN  186 (322)
Q Consensus       133 ~DIv~ld~~D~y~n-~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~  186 (322)
                      .++-...+...+.. .....+...+.+..       ..++|++-+|-|=|+.++.
T Consensus        41 ~~v~i~~~~~~~~~~~~~~~~~~~~~~~~-------~~~dWvl~~D~DEfl~~~~   88 (97)
T PF13704_consen   41 PGVGIIRWVDPYRDERRQRAWRNALIERA-------FDADWVLFLDADEFLVPPP   88 (97)
T ss_pred             CCcEEEEeCCCccchHHHHHHHHHHHHhC-------CCCCEEEEEeeeEEEecCC
Confidence            44555555556643 22233333333321       2679999999999988764


No 67 
>cd00761 Glyco_tranf_GTA_type Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein.  Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold.  This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it als
Probab=29.23  E-value=2.6e+02  Score=21.24  Aligned_cols=73  Identities=25%  Similarity=0.249  Sum_probs=48.3

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCc
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGP  245 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~  245 (322)
                      +.+|++-+|+|..+..+.+...+...   +...+..|.                +++++++++.+.+.....  ......
T Consensus        77 ~~d~v~~~d~D~~~~~~~~~~~~~~~~~~~~~~~v~~~----------------~~~~~~~~~~~~~~~~~~--~~~~~~  138 (156)
T cd00761          77 RGEYILFLDADDLLLPDWLERLVAELLADPEADAVGGP----------------GNLLFRRELLEEIGGFDE--ALLSGE  138 (156)
T ss_pred             cCCEEEEECCCCccCccHHHHHHHHHhcCCCceEEecc----------------chheeeHHHHHHhCCcch--HhcCCc
Confidence            56999999999999888887763322   111221111                889999999998865432  122237


Q ss_pred             hhHHHHHHHHhcCC
Q 020721          246 EDKVFGAWIREGRR  259 (322)
Q Consensus       246 EDv~vG~~l~~~~~  259 (322)
                      ||..++..+...|.
T Consensus       139 ed~~~~~~~~~~g~  152 (156)
T cd00761         139 EDDDFLLRLLRGGK  152 (156)
T ss_pred             chHHHHHHHHhhcc
Confidence            88888877766553


No 68 
>PRK11498 bcsA cellulose synthase catalytic subunit; Provisional
Probab=25.43  E-value=9.3e+02  Score=26.45  Aligned_cols=87  Identities=11%  Similarity=-0.030  Sum_probs=54.5

Q ss_pred             CceEEEEecCceEeeHHHHHHHhc---CCCCCCeEEEEe-----------------ecCCC----------CCC-CCccc
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLV---PLPREDLYYGYV-----------------IPCRS----------MDP-FVDYM  217 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~---~~~~~~~y~G~~-----------------~p~r~----------~~~-yp~y~  217 (322)
                      +.+|++-.|.|..+..+-|.+.+.   ..|+-.+.-|..                 .+...          .+. --.++
T Consensus       339 ~GEyIavlDAD~ip~pdfL~~~V~~f~~dP~VglVQtp~~f~n~dp~~rnl~~~~~~~~e~~~fy~~iq~g~~~~~a~~~  418 (852)
T PRK11498        339 KGEFVAIFDCDHVPTRSFLQMTMGWFLKDKKLAMMQTPHHFFSPDPFERNLGRFRKTPNEGTLFYGLVQDGNDMWDATFF  418 (852)
T ss_pred             CCCEEEEECCCCCCChHHHHHHHHHHHhCCCeEEEEcceeccCCchHHHhhHHHhhcccchhHHHHHHHhHHHhhccccc
Confidence            569999999999998877776543   223211111110                 00000          000 02457


Q ss_pred             ccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721          218 SGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR  259 (322)
Q Consensus       218 ~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~  259 (322)
                      .|++.++.+++++.+.--..    ....||..++.-+...|-
T Consensus       419 ~Gs~aviRReaLeeVGGfd~----~titED~dlslRL~~~Gy  456 (852)
T PRK11498        419 CGSCAVIRRKPLDEIGGIAV----ETVTEDAHTSLRLHRRGY  456 (852)
T ss_pred             ccceeeeEHHHHHHhcCCCC----CccCccHHHHHHHHHcCC
Confidence            89999999999999865331    124799999999987764


No 69 
>KOG2547 consensus Ceramide glucosyltransferase [Lipid transport and metabolism; Cell wall/membrane/envelope biogenesis]
Probab=23.69  E-value=2.5e+02  Score=27.71  Aligned_cols=26  Identities=12%  Similarity=0.429  Sum_probs=22.8

Q ss_pred             CceEEEEecCceEeeHHHHHHHhcCC
Q 020721          169 PYHYVMKGDDDTYIRLENLVKSLVPL  194 (322)
Q Consensus       169 ~~~yvlK~DDD~fv~~~~L~~~L~~~  194 (322)
                      +++||+-.|||+++..+.+.+.-..+
T Consensus       170 ~ydlvlisDsgI~m~pdtildm~t~M  195 (431)
T KOG2547|consen  170 KYDLVLISDSGIFMKPDTILDMATTM  195 (431)
T ss_pred             cCCEEEEecCCeeecCchHHHHHHhh
Confidence            56899999999999999999877654


No 70 
>cd04459 Rho_CSD Rho_CSD: Rho protein cold-shock domain (CSD). Rho protein is a transcription termination factor in most bacteria. In bacteria, there are two distinct mechanisms for mRNA transcription termination. In intrinsic termination, RNA polymerase and nascent mRNA are released from DNA template by an mRNA stem loop structure, which resembles the transcription termination mechanism used by eukaryotic pol III. The second mechanism is mediated by Rho factor. Rho factor terminates transcription by using energy from ATP hydrolysis to forcibly dissociate the transcripts from RNA polymerase. Rho protein contains an N-terminal S1-like domain, which binds single-stranded RNA. Rho has a C-terminal ATPase domain which hydrolyzes ATP to provide energy to strip RNA polymerase and mRNA from the DNA template. Rho functions as a homohexamer.
Probab=23.20  E-value=1.5e+02  Score=21.58  Aligned_cols=39  Identities=21%  Similarity=0.375  Sum_probs=27.5

Q ss_pred             EEecCceEeeHHHHHHHhcCCCCCCeEEEEeecCCCCCCCC
Q 020721          174 MKGDDDTYIRLENLVKSLVPLPREDLYYGYVIPCRSMDPFV  214 (322)
Q Consensus       174 lK~DDD~fv~~~~L~~~L~~~~~~~~y~G~~~p~r~~~~yp  214 (322)
                      +...||+||....+.++=  +-+-+...|.+.+.+..+.|+
T Consensus        23 ~~~~~DvyVs~~~Irr~~--LR~GD~V~G~vr~p~~~ek~~   61 (68)
T cd04459          23 LPGPDDIYVSPSQIRRFN--LRTGDTVVGQIRPPKEGERYF   61 (68)
T ss_pred             CCCCCCEEECHHHHHHhC--CCCCCEEEEEEeCCCCCCCcc
Confidence            678999999998777652  223478889887655555553


No 71 
>PF08525 OapA_N:  Opacity-associated protein A N-terminal motif;  InterPro: IPR013731 This domain is found in the Haemophilus influenzae opacity-associated protein (OapA). It is required for efficient nasopharyngeal mucosal colonisation, and its expression is associated with a distinctive transparent colony phenotype. OapA is thought to be a secreted protein, and its expression exhibits high-frequency phase variation [, ]. This motif occurs at the N terminus of these proteins. It contains a conserved histidine followed by a run of hydrophobic residues.  Many of the proteins in this entry are unassigned peptidases belonging to MEROPS peptidase family M23B. 
Probab=21.97  E-value=88  Score=18.93  Aligned_cols=17  Identities=6%  Similarity=0.225  Sum_probs=7.1

Q ss_pred             CccchhHHHHHHHHHHH
Q 020721            8 PHQRQFIVWSFFFILFL   24 (322)
Q Consensus         8 ~~~~~~~~~~~~~i~~~   24 (322)
                      |+..|.++..+..+.++
T Consensus         7 P~~Hr~~l~~l~~v~l~   23 (30)
T PF08525_consen    7 PKLHRRALIALSAVVLV   23 (30)
T ss_pred             CHHHHHHHHHHHHHHHH
Confidence            33344444444444444


No 72 
>COG4425 Predicted membrane protein [Function unknown]
Probab=20.58  E-value=7.5e+02  Score=25.20  Aligned_cols=78  Identities=17%  Similarity=0.143  Sum_probs=45.5

Q ss_pred             ecCCCCCCeeEEEEEECCCCCHHHHH-HHHHHhccCCCCCCceEEEEEecCCCchhH--HHHHHHHHhhCCCeEEeccCC
Q 020721           66 ATNSSSEDIRILIGILTLPDQYHRRH-FLRMIYGTQSPTGAQVDVKFVFCNLTKEDQ--KVLVALEIMRYDDIIILNCKE  142 (322)
Q Consensus        66 ~~~~~~~~~~lli~V~S~~~~~~rR~-aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~--~~~l~~E~~~~~DIv~ld~~D  142 (322)
                      ..+...+++++++|..|+..-.+|-+ ++||-= +.  .|.+=+++-|+-.+....+  .++=..|-...||+..+-+.=
T Consensus       281 t~t~a~ePiRvy~Glnsa~s~~~ra~Lal~El~-Rt--Gg~rR~vlvVv~~TGTGWIdp~a~~t~EyL~~Gd~asVsmQY  357 (588)
T COG4425         281 TGTPAVEPIRVYAGLNSADSPEERAELALRELK-RT--GGLRRKVLVVVTSTGTGWIDPAAADTLEYLYNGDVASVSMQY  357 (588)
T ss_pred             cCCCCCCceeEEecccccCCHHHHHHHHHHHHH-Hh--cCcceeEEEEEcCCCCCCCCHHHHhHHHHHhCCceEEEEEeh
Confidence            34445778999999999988766655 444432 11  1112123333322222111  123356889999999988887


Q ss_pred             cCCc
Q 020721          143 NMNK  146 (322)
Q Consensus       143 ~y~n  146 (322)
                      +|-.
T Consensus       358 SyL~  361 (588)
T COG4425         358 SYLP  361 (588)
T ss_pred             hhHH
Confidence            7754


Done!