Query 020721
Match_columns 322
No_of_seqs 246 out of 1355
Neff 7.3
Searched_HMMs 46136
Date Fri Mar 29 04:37:13 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/020721.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/020721hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PTZ00210 UDP-GlcNAc-dependent 100.0 1.1E-51 2.3E-56 388.1 21.9 253 58-319 58-368 (382)
2 KOG2287 Galactosyltransferases 100.0 4.6E-47 9.9E-52 363.6 27.3 230 73-317 94-347 (349)
3 PLN03133 beta-1,3-galactosyltr 100.0 2E-44 4.3E-49 361.8 26.0 226 66-310 376-627 (636)
4 PLN03193 beta-1,3-galactosyltr 100.0 3.6E-43 7.7E-48 335.1 21.5 192 70-273 135-356 (408)
5 PF01762 Galactosyl_T: Galacto 100.0 3.5E-40 7.6E-45 292.0 16.1 167 88-262 1-192 (195)
6 KOG2288 Galactosyltransferases 100.0 1.8E-30 3.9E-35 231.6 12.8 189 71-271 8-224 (274)
7 KOG2246 Galactosyltransferases 99.8 1.3E-17 2.8E-22 160.2 17.7 169 66-261 83-265 (364)
8 PF02434 Fringe: Fringe-like; 99.8 1.4E-17 3.1E-22 153.2 14.7 197 74-295 6-232 (252)
9 PLN03153 hypothetical protein; 99.4 7.1E-12 1.5E-16 123.3 19.2 204 73-296 121-344 (537)
10 KOG3708 Uncharacterized conser 97.8 6.5E-05 1.4E-09 74.0 8.7 204 71-309 23-245 (681)
11 TIGR03472 HpnI hopanoid biosyn 95.8 1.1 2.4E-05 43.4 18.5 171 74-259 41-240 (373)
12 TIGR03469 HonB hopene-associat 95.5 1.6 3.5E-05 42.5 18.4 174 73-259 39-249 (384)
13 PF13641 Glyco_tranf_2_3: Glyc 95.1 0.18 3.9E-06 44.5 9.6 169 74-259 1-197 (228)
14 cd02520 Glucosylceramide_synth 95.1 1.3 2.8E-05 38.3 14.9 130 106-259 30-164 (196)
15 PF13506 Glyco_transf_21: Glyc 95.0 0.14 3E-06 44.5 8.3 106 145-260 15-142 (175)
16 cd04192 GT_2_like_e Subfamily 94.5 2.8 6.1E-05 36.4 15.7 141 107-259 29-195 (229)
17 cd04186 GT_2_like_c Subfamily 93.7 2.6 5.6E-05 34.4 13.1 76 169-259 74-152 (166)
18 PF04646 DUF604: Protein of un 92.0 0.41 8.9E-06 44.0 6.2 74 218-295 12-92 (255)
19 PRK11204 N-glycosyltransferase 90.1 12 0.00026 36.5 15.1 169 72-259 52-247 (420)
20 cd06420 GT2_Chondriotin_Pol_N 90.1 10 0.00022 31.7 13.3 90 169-260 79-168 (182)
21 cd06532 Glyco_transf_25 Glycos 90.0 3.8 8.2E-05 33.4 9.7 110 78-236 2-117 (128)
22 cd06439 CESA_like_1 CESA_like_ 89.1 16 0.00034 32.5 17.0 165 73-259 28-216 (251)
23 cd04185 GT_2_like_b Subfamily 89.0 5.7 0.00012 34.0 10.7 82 169-260 79-162 (202)
24 cd02510 pp-GalNAc-T pp-GalNAc- 88.9 19 0.00042 33.3 15.2 43 215-258 169-211 (299)
25 PF01755 Glyco_transf_25: Glyc 88.1 6.8 0.00015 34.0 10.6 34 78-116 4-37 (200)
26 cd06434 GT2_HAS Hyaluronan syn 87.9 18 0.00038 31.6 14.3 139 107-259 29-200 (235)
27 COG1216 Predicted glycosyltran 87.5 23 0.00051 33.0 14.5 169 74-259 3-207 (305)
28 cd06427 CESA_like_2 CESA_like_ 87.4 16 0.00035 32.5 12.9 87 169-259 84-199 (241)
29 cd04195 GT2_AmsE_like GT2_AmsE 86.5 17 0.00037 30.8 12.2 87 169-259 80-188 (201)
30 cd04196 GT_2_like_d Subfamily 86.2 19 0.0004 30.7 12.3 87 169-258 79-190 (214)
31 TIGR01556 rhamnosyltran L-rham 85.6 8.3 0.00018 35.3 10.3 118 131-259 43-188 (281)
32 cd02525 Succinoglycan_BP_ExoA 85.6 24 0.00052 30.8 15.9 88 169-259 81-195 (249)
33 cd06421 CESA_CelA_like CESA_Ce 85.5 24 0.00051 30.7 15.1 87 169-259 84-198 (234)
34 PRK14583 hmsR N-glycosyltransf 85.0 35 0.00076 33.8 15.0 168 73-259 74-268 (444)
35 PF00535 Glycos_transf_2: Glyc 83.1 3.8 8.2E-05 33.1 6.2 114 106-232 27-168 (169)
36 cd04179 DPM_DPG-synthase_like 80.7 32 0.00068 28.6 11.6 116 106-233 28-167 (185)
37 COG1215 Glycosyltransferases, 80.4 58 0.0013 31.5 17.4 170 73-259 53-252 (439)
38 PF13632 Glyco_trans_2_3: Glyc 79.4 5.3 0.00011 34.2 6.0 85 172-259 1-111 (193)
39 PF05679 CHGN: Chondroitin N-a 79.0 3.4 7.3E-05 42.0 5.3 67 240-308 8-81 (499)
40 cd04187 DPM1_like_bac Bacteria 76.4 44 0.00095 27.9 12.9 119 106-236 29-166 (181)
41 cd04191 Glucan_BSP_ModH Glucan 76.3 62 0.0014 29.6 14.8 173 77-259 2-218 (254)
42 cd02526 GT2_RfbF_like RfbF is 75.6 54 0.0012 28.5 13.9 89 169-259 75-191 (237)
43 TIGR03111 glyc2_xrt_Gpos1 puta 75.5 89 0.0019 30.9 16.8 166 74-256 49-250 (439)
44 cd06423 CESA_like CESA_like is 72.7 46 0.001 26.4 11.3 66 169-234 78-171 (180)
45 PRK14716 bacteriophage N4 adso 66.0 1.6E+02 0.0035 30.0 18.6 185 72-260 64-277 (504)
46 cd06438 EpsO_like EpsO protein 65.0 83 0.0018 26.4 10.1 63 169-231 81-169 (183)
47 cd06442 DPM1_like DPM1_like re 63.9 94 0.002 26.6 12.5 65 169-233 78-167 (224)
48 PRK11234 nfrB bacteriophage N4 61.7 2.3E+02 0.005 30.4 18.1 184 70-260 59-274 (727)
49 cd04184 GT2_RfbC_Mx_like Myxoc 61.3 99 0.0021 26.0 14.5 88 169-259 83-188 (202)
50 TIGR03030 CelA cellulose synth 58.2 2.6E+02 0.0056 29.8 16.0 87 169-259 228-345 (713)
51 PF05412 Peptidase_C33: Equine 54.2 10 0.00022 30.1 2.2 18 164-181 63-80 (108)
52 PRK10714 undecaprenyl phosphat 53.5 2E+02 0.0044 27.2 14.2 146 75-236 7-176 (325)
53 PRK05454 glucosyltransferase M 53.1 1.6E+02 0.0035 31.4 11.5 181 70-259 120-343 (691)
54 cd04190 Chitin_synth_C C-termi 50.8 35 0.00076 30.6 5.5 91 169-259 73-207 (244)
55 cd06435 CESA_NdvC_like NdvC_li 49.1 1.8E+02 0.0039 25.2 16.2 87 169-259 84-196 (236)
56 PLN02726 dolichyl-phosphate be 48.4 1.9E+02 0.0042 25.5 16.3 168 73-259 8-205 (243)
57 PTZ00260 dolichyl-phosphate be 46.0 2.7E+02 0.0059 26.5 14.7 174 70-259 66-279 (333)
58 cd06433 GT_2_WfgS_like WfgS an 42.7 1.9E+02 0.0041 23.8 14.0 88 169-259 75-181 (202)
59 cd02522 GT_2_like_a GT_2_like_ 42.2 2.2E+02 0.0047 24.2 13.6 87 169-259 72-174 (221)
60 PHA01631 hypothetical protein 38.7 1.1E+02 0.0025 26.4 6.3 59 169-235 71-133 (176)
61 PF09680 Tiny_TM_bacill: Prote 35.4 39 0.00084 19.5 2.0 17 11-27 4-20 (24)
62 TIGR01732 tiny_TM_bacill conse 34.2 44 0.00096 19.7 2.1 18 11-28 6-23 (26)
63 KOG3588 Chondroitin synthase 1 33.8 12 0.00027 36.4 -0.3 67 224-303 1-73 (494)
64 cd04188 DPG_synthase DPG_synth 32.3 3.1E+02 0.0066 23.4 8.4 139 106-259 30-195 (211)
65 PF11395 DUF2873: Protein of u 29.8 44 0.00096 21.4 1.8 19 24-44 23-41 (43)
66 PF13704 Glyco_tranf_2_4: Glyc 29.4 2.4E+02 0.0052 20.9 6.8 47 133-186 41-88 (97)
67 cd00761 Glyco_tranf_GTA_type G 29.2 2.6E+02 0.0056 21.2 12.6 73 169-259 77-152 (156)
68 PRK11498 bcsA cellulose syntha 25.4 9.3E+02 0.02 26.5 15.0 87 169-259 339-456 (852)
69 KOG2547 Ceramide glucosyltrans 23.7 2.5E+02 0.0054 27.7 6.5 26 169-194 170-195 (431)
70 cd04459 Rho_CSD Rho_CSD: Rho p 23.2 1.5E+02 0.0033 21.6 3.9 39 174-214 23-61 (68)
71 PF08525 OapA_N: Opacity-assoc 22.0 88 0.0019 18.9 2.1 17 8-24 7-23 (30)
72 COG4425 Predicted membrane pro 20.6 7.5E+02 0.016 25.2 9.2 78 66-146 281-361 (588)
No 1
>PTZ00210 UDP-GlcNAc-dependent glycosyltransferase; Provisional
Probab=100.00 E-value=1.1e-51 Score=388.15 Aligned_cols=253 Identities=21% Similarity=0.270 Sum_probs=220.5
Q ss_pred cCCccce------eecCCCCCCeeEEEEEECCCCC--HHHHHHHHHHhccCC--------CCCCceEEEEEecCCCch--
Q 020721 58 FSNASHV------VATNSSSEDIRILIGILTLPDQ--YHRRHFLRMIYGTQS--------PTGAQVDVKFVFCNLTKE-- 119 (322)
Q Consensus 58 ~~~~~~l------~~~~~~~~~~~lli~V~S~~~~--~~rR~aIR~TW~~~~--------~~~~~v~~~Fvlg~~~~~-- 119 (322)
+.+|++| ++.+|+.++..+++||+|..++ +.||+++|+||.+-. ++|+.+ ++|++|.+++.
T Consensus 58 ~~~~~~l~~~~~~~~~~w~~~~~lv~~Gi~S~d~~~r~~rR~lqr~t~w~y~~va~~~n~ftg~~l-v~y~l~~H~~~~~ 136 (382)
T PTZ00210 58 IDEDEYLMFVPSNVAAVWKAQRFLAVLGIPSVDNSERSRRRDLQRQTCWKYSGVATRSNNFSGSLL-PLYLLAPHQSNSY 136 (382)
T ss_pred cchhhhhhccCHHHHHHhccCCceEEEeccCCCchHHHHHHHHHHhhhhcchhhhhhccCCchhhh-hhhhhccCCccch
Confidence 5566665 6899999999999999999999 799999999996432 566665 99999999988
Q ss_pred hHHHHHHHHHhhCCCeEEeccCCcC------------------Cc--hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCc
Q 020721 120 DQKVLVALEIMRYDDIIILNCKENM------------------NK--GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDD 179 (322)
Q Consensus 120 ~~~~~l~~E~~~~~DIv~ld~~D~y------------------~n--~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD 179 (322)
+.+.+|.+|+++|||||++||.|.| .+ +||+++|+|+.+. ||+++||+|+|||
T Consensus 137 ~~~~~L~eEA~~~~DIVilpf~d~~~tTnKkiG~~g~WG~e~e~~mT~KT~l~~~wA~~~-------cP~a~YImKgDDD 209 (382)
T PTZ00210 137 LISHSLKEEAARTHDIITLPTNDVSPSTRKKIGENGNWGIEAEVAMSRKTYLWLRFALHM-------FPNVSYIVKGDDD 209 (382)
T ss_pred hhhHHHHHHHHHhCCEEEEecccCccccccccccCCcccchhhcchhHHHHHHHHHHHHh-------CCCCCeEEEcCCC
Confidence 7788999999999999999996666 23 9999999999886 8899999999999
Q ss_pred eEeeHHHHHHHhcCCCCCCeEEEEeecCC-C-CCCCCcccccceeeeCHHHHHHHHHcCCC-CC---------------C
Q 020721 180 TYIRLENLVKSLVPLPREDLYYGYVIPCR-S-MDPFVDYMSGMGYLVSWDIAEWIRDSDIP-KN---------------H 241 (322)
Q Consensus 180 ~fv~~~~L~~~L~~~~~~~~y~G~~~p~r-~-~~~yp~y~~G~gYvlS~d~~~~i~~~~~~-~~---------------~ 241 (322)
+|||+++++++|+..|++++|+|.+.... + .+.+|+||+|+||+||+|+|+.|++..++ +. .
T Consensus 210 vFVrVp~lL~~Lr~~prr~LY~G~v~~~~~p~Rd~~PpY~~G~gYvLSrDVA~~Lvs~~pl~rL~~~pys~~~~~~y~~~ 289 (382)
T PTZ00210 210 IFIRVPKYLADLRVMPRHGLYMGRYNYYNRIWRRNQLTYVNGYCITLSRDTAQAIISYKPLERLVNMPFSMWDYFDFLDL 289 (382)
T ss_pred eEeeHHHHHHHHhhCCCCceEEEeeCCCCccccCCCCCccccceeeccHHHHHHHHhhChHhHhhcCCCchHHHHHHHHh
Confidence 99999999999999999999999986431 1 24579999999999999999999997543 22 4
Q ss_pred CCCchhHHHHHHHHhcCCCCccccCccceeecCCC-CCcccccccCCEEEEEccCCHHHHHHHHHHccccCC-CCCCCCC
Q 020721 242 LEGPEDKVFGAWIREGRRAKNRYNAKWSMYNFPVP-RTACTHELWADTIAVHLLKNQEKWIETLTYFNVTAN-LKPSKLY 319 (322)
Q Consensus 242 ~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~~~~~~-~~~c~~~~~~~~ia~H~~~~~~~~~~~~~~fn~~~~-~~p~~~~ 319 (322)
.+..||+++|.+|+..++.++.++.+..+|+|+.. ...|.+++.+++|++|+++++| |+.++++|+.+.+ .||+|.+
T Consensus 290 ~~~~EDiMvG~vLr~~~k~~~l~~V~~~~c~Fhd~~~~~~~~~v~~~sVvvHhike~d-Ya~Lm~~F~n~~~~~~P~~~~ 368 (382)
T PTZ00210 290 GMFYEDVMVGMILREKVVYRNLISVEMGRCHFHNAGKFGVRKSVRNMSVVIHHIQEAD-YEMLMDYFPEGVIPPAPSKVM 368 (382)
T ss_pred hcCchHHHHHHHHHHhcCcCceeeeccccccceecCCCCCccccccceEEEEecCHHH-HHHHHHHhcCCCCCCCCcceE
Confidence 57899999999999988888888888999999875 5778888999999999999997 9999999999988 6898865
No 2
>KOG2287 consensus Galactosyltransferases [Carbohydrate transport and metabolism]
Probab=100.00 E-value=4.6e-47 Score=363.60 Aligned_cols=230 Identities=28% Similarity=0.419 Sum_probs=197.1
Q ss_pred CeeEEEEEECCCCCHHHHHHHHHHhccCC-CCCCceEEEEEecCCCch-hHHHHHHHHHhhCCCeEEeccCCcCCc--hH
Q 020721 73 DIRILIGILTLPDQYHRRHFLRMIYGTQS-PTGAQVDVKFVFCNLTKE-DQKVLVALEIMRYDDIIILNCKENMNK--GK 148 (322)
Q Consensus 73 ~~~lli~V~S~~~~~~rR~aIR~TW~~~~-~~~~~v~~~Fvlg~~~~~-~~~~~l~~E~~~~~DIv~ld~~D~y~n--~K 148 (322)
.++++++|+|++++++||++||+|||++. ..+.+++++|++|.++++ ..+.+|.+|++.||||||.||.|+|.| .|
T Consensus 94 ~~~lLl~V~S~~~~farR~aiR~TW~~~~~v~~~~v~~~FLvG~~~~~~~~~~~l~~Ea~~ygDIi~~df~Dty~nltlK 173 (349)
T KOG2287|consen 94 PPELLLLVKSAPDNFARRNAIRKTWGNENNVRGGRVRVLFLVGLPSNEDKLNKLLADEARLYGDIIQVDFEDTYFNLTLK 173 (349)
T ss_pred CceEEEEEecCCCCHHHHHHHHHHhcCccccCCCcEEEEEEecCCCcHHHHHHHHHHHHHHhCCEEEEecccchhchHHH
Confidence 48999999999999999999999999986 567899999999988865 457799999999999999999999998 99
Q ss_pred HHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC--CCCCeEEEEee----cCC-------------C
Q 020721 149 TYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL--PREDLYYGYVI----PCR-------------S 209 (322)
Q Consensus 149 t~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~--~~~~~y~G~~~----p~r-------------~ 209 (322)
|++++.|... +||+++|++|+|||+||++++|+++|... +.+.+|+|++. |.| +
T Consensus 174 tl~~l~w~~~-------~cp~akfi~K~DDDvfv~~~~L~~~L~~~~~~~~~~~~G~v~~~~~p~R~~~~KwyVp~~~y~ 246 (349)
T KOG2287|consen 174 TLAILLWGVS-------KCPDAKFILKIDDDVFVNPDNLLEYLDKLNDPSSDLYYGRVIQNAPPIRDKTSKWYVPESEYP 246 (349)
T ss_pred HHHHHHHHHh-------cCCcceEEEeccCceEEcHHHHHHHHhccCCCCcceEEEeecccCCCCCCCCCCCccCHHHCC
Confidence 9999999986 49999999999999999999999999887 67899999875 222 3
Q ss_pred CCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhc-CCCCccccCccceeecCCCCCcccccccCCE
Q 020721 210 MDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREG-RRAKNRYNAKWSMYNFPVPRTACTHELWADT 288 (322)
Q Consensus 210 ~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~-~~~~~~~~~~~~f~~~~~~~~~c~~~~~~~~ 288 (322)
...||+||+|+|||+|+|+|++|++.+ .+.+.+.+|||++|+|+++. |+.+.. +..++.++...+.|. +++.
T Consensus 247 ~~~YP~Y~sG~gYvis~~~a~~l~~~s-~~~~~~~iEDV~~g~~l~~~~gi~~~~---~~~~~~~~~~~~~~~---~~~~ 319 (349)
T KOG2287|consen 247 CSVYPPYASGPGYVISGDAARRLLKAS-KHLKFFPIEDVFVGGCLAEDLGIKPVN---HPGFFEIPLSFDPCC---YRDL 319 (349)
T ss_pred CCCCCCcCCCceeEecHHHHHHHHHHh-cCCCccchHHHHHHHHHHHhcCCCccc---CcccccccccCCCCc---ccce
Confidence 557999999999999999999999976 57899999999999999997 776542 233555554445554 4889
Q ss_pred EEEEccCCHHHHHHHHHHccccCCCCCCC
Q 020721 289 IAVHLLKNQEKWIETLTYFNVTANLKPSK 317 (322)
Q Consensus 289 ia~H~~~~~~~~~~~~~~fn~~~~~~p~~ 317 (322)
+++|..++++ +..+|+....+.+++-++
T Consensus 320 ~~~H~~~p~e-~~~~w~~~~~~~~~~c~~ 347 (349)
T KOG2287|consen 320 LAVHRLSPNE-MIYLWKKLKDLANLKCKN 347 (349)
T ss_pred EEEecCCHHH-HHHHHHHhhccccccccc
Confidence 9999999777 777888888766666443
No 3
>PLN03133 beta-1,3-galactosyltransferase; Provisional
Probab=100.00 E-value=2e-44 Score=361.79 Aligned_cols=226 Identities=22% Similarity=0.347 Sum_probs=183.8
Q ss_pred ecCCC-CCCeeEEEEEECCCCCHHHHHHHHHHhccCC-CCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCc
Q 020721 66 ATNSS-SEDIRILIGILTLPDQYHRRHFLRMIYGTQS-PTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKEN 143 (322)
Q Consensus 66 ~~~~~-~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~-~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~ 143 (322)
++..+ ..+++|+|+|+|+|+|++||+|||+|||+.. ..+..++++|++|.+.++..+..|.+|+++||||||+||.|+
T Consensus 376 AppL~~~~~~~LlI~V~Sap~nf~rR~AIR~TWg~~~~~~~~~v~~rFvVG~s~n~~l~~~L~~Ea~~ygDIIq~dF~Ds 455 (636)
T PLN03133 376 SPPLSPKKPLDLFIGVFSTANNFKRRMAVRRTWMQYDAVRSGAVAVRFFVGLHKNQMVNEELWNEARTYGDIQLMPFVDY 455 (636)
T ss_pred CCCCCCCCceEEEEEEeCCcccHHHHHHHHHhhccccccCCCceEEEEEEecCCcHHHHHHHHHHHHHcCCeEEEeeech
Confidence 33444 4568999999999999999999999999864 344568899999999888888899999999999999999999
Q ss_pred CCc--hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC-CCCCeEEEEee----cCC--------
Q 020721 144 MNK--GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL-PREDLYYGYVI----PCR-------- 208 (322)
Q Consensus 144 y~n--~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~-~~~~~y~G~~~----p~r-------- 208 (322)
|+| .||++++.|+.. |++++|+||+|||+|||+++|+++|+.. +.+++|+|++. |.|
T Consensus 456 Y~NLTlKtl~~~~wa~~--------c~~akFilK~DDDvFVnv~~Ll~~L~~~~~~~~Ly~G~v~~~~~PiRd~~sKWYV 527 (636)
T PLN03133 456 YSLITWKTLAICIFGTE--------VVSAKYVMKTDDDAFVRVDEVLASLKRTNVSHGLLYGLINSDSQPHRNPDSKWYI 527 (636)
T ss_pred hhhhHHHHHHHHHHHHh--------CCCceEEEEcCCceEEcHHHHHHHHHhcCCCCceEEEEeccCCCcccCCCCCCCC
Confidence 998 999999998753 8899999999999999999999999764 45679999985 333
Q ss_pred -----CCCCCCcccccceeeeCHHHHHHHHHcC-CCCCCCCCchhHHHHHHHHhcCC---CCccccCccceeecCCCCCc
Q 020721 209 -----SMDPFVDYMSGMGYLVSWDIAEWIRDSD-IPKNHLEGPEDKVFGAWIREGRR---AKNRYNAKWSMYNFPVPRTA 279 (322)
Q Consensus 209 -----~~~~yp~y~~G~gYvlS~d~~~~i~~~~-~~~~~~~~~EDv~vG~~l~~~~~---~~~~~~~~~~f~~~~~~~~~ 279 (322)
+.+.|||||+|+|||||+|+|+.|++.. ..+.+.+++|||++|.|++++++ ... +..+.+|+ .+.
T Consensus 528 s~~eyp~~~YPpYasG~gYVlS~Dla~~L~~~s~s~~l~~f~lEDVyvGi~l~~l~k~gl~v~-~~~~~r~~-----~~~ 601 (636)
T PLN03133 528 SPEEWPEETYPPWAHGPGYVVSRDIAKEVYKRHKEGRLKMFKLEDVAMGIWIAEMKKEGLEVK-YENDGRIY-----NEG 601 (636)
T ss_pred CHHHCCCCCCCCCCCcCEEEEcHHHHHHHHHhhhhcccCcCChhhHhHHHHHHHhcccCCCce-eeCCCccc-----CCc
Confidence 3567999999999999999999999854 23678999999999999986543 221 11112222 356
Q ss_pred ccccccCCEEEEEccCCHHHHHHHHHHcccc
Q 020721 280 CTHELWADTIAVHLLKNQEKWIETLTYFNVT 310 (322)
Q Consensus 280 c~~~~~~~~ia~H~~~~~~~~~~~~~~fn~~ 310 (322)
|. .+.|++|+.+|.+ +..+|+....+
T Consensus 602 C~----~~~i~~H~~sP~e-M~~lW~~l~~~ 627 (636)
T PLN03133 602 CK----DGYVVAHYQSPRE-MLCLWQKLQEG 627 (636)
T ss_pred CC----CCeEEEecCCHHH-HHHHHHHHhcc
Confidence 64 4579999999998 55677776544
No 4
>PLN03193 beta-1,3-galactosyltransferase; Provisional
Probab=100.00 E-value=3.6e-43 Score=335.12 Aligned_cols=192 Identities=20% Similarity=0.256 Sum_probs=162.3
Q ss_pred CCCCeeEEEEEECCCCCHHHHHHHHHHhccCC------CCCCceEEEEEecCCC--chhHHHHHHHHHhhCCCeEEeccC
Q 020721 70 SSEDIRILIGILTLPDQYHRRHFLRMIYGTQS------PTGAQVDVKFVFCNLT--KEDQKVLVALEIMRYDDIIILNCK 141 (322)
Q Consensus 70 ~~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~------~~~~~v~~~Fvlg~~~--~~~~~~~l~~E~~~~~DIv~ld~~ 141 (322)
.+.+++++|+|.|++++++||++||+|||++. .....++++||+|++. +...+.+|++|+++|||||++||.
T Consensus 135 ~~~~~~LvIgI~Sap~~~~RR~AIR~TWg~~~~~~~kle~~~gv~vrFVIG~s~~~~~~ldr~Le~Ea~~ygDIL~lDfv 214 (408)
T PLN03193 135 GKRRYLMVVGINTAFSSRKRRDSVRATWMPQGEKRKKLEEEKGIIIRFVIGHSATSGGILDRAIEAEDRKHGDFLRLDHV 214 (408)
T ss_pred CcceEEEEEEEeCCCCCHHHHHHHHHHHcCCcccccccccCCcEEEEEEeecCCCcchHHHHHHHHHHHHhCCEEEEecc
Confidence 45568999999999999999999999999763 1235689999999986 456777899999999999999999
Q ss_pred CcCCc--hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC-CCCeEEEEee--cCC--------
Q 020721 142 ENMNK--GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP-REDLYYGYVI--PCR-------- 208 (322)
Q Consensus 142 D~y~n--~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~-~~~~y~G~~~--p~r-------- 208 (322)
|+|.| .||+++|+|+.++ .+++|+||+|||+|||+++|+.+|+... .+++|+|++. |.+
T Consensus 215 DsY~NLT~KTl~~f~wA~~~--------~dAkF~mK~DDDvfVnv~~L~~~L~~~~~~~rlYiG~m~~gPvr~~~~~ky~ 286 (408)
T PLN03193 215 EGYLELSAKTKTYFATAVAM--------WDADFYVKVDDDVHVNIATLGETLVRHRKKPRVYIGCMKSGPVLSQKGVRYH 286 (408)
T ss_pred cccccchHHHHHHHHHHHHc--------CCCeEEEEcCCCceEcHHHHHHHHHhcCCCCCEEEEecccCccccCCCCcCc
Confidence 99987 9999999999874 3889999999999999999999997653 4469999985 332
Q ss_pred ---------CCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCCCCccccCccceeec
Q 020721 209 ---------SMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRRAKNRYNAKWSMYNF 273 (322)
Q Consensus 209 ---------~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~~~ 273 (322)
+.+.|||||+|+|||||+|+|+.|+.+.. ..+.+.+|||++|.||..+++. +++ +.+||+.
T Consensus 287 epe~w~~~~~~~~YPpyAsG~gYVlS~DLa~~I~~n~~-~L~~y~~EDV~vG~Wl~~L~V~--~vd-d~~fcc~ 356 (408)
T PLN03193 287 EPEYWKFGENGNKYFRHATGQLYAISKDLASYISINQH-VLHKYANEDVSLGSWFIGLDVE--HID-DRRLCCG 356 (408)
T ss_pred CcccccccCccccCCCCCCcceEEehHHHHHHHHhChh-hhcccCcchhhhhhHhccCCce--eee-cccccCC
Confidence 12679999999999999999999997663 5678899999999999777664 333 4678775
No 5
>PF01762 Galactosyl_T: Galactosyltransferase; InterPro: IPR002659 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 31 (GH31 from CAZY) comprises enzymes with a number of known activities; N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase (2.4.1.149 from EC); beta-1,3-galactosyltransferase (2.4.1 from EC); fucose-specific beta-1,3-N-acetylglucosaminyltransferase (2.4.1 from EC); globotriosylceramide beta-1,3-GalNAc transferase (2.4.1.79 from EC) [, ].; GO: 0008378 galactosyltransferase activity, 0006486 protein glycosylation, 0016020 membrane
Probab=100.00 E-value=3.5e-40 Score=291.96 Aligned_cols=167 Identities=27% Similarity=0.359 Sum_probs=146.8
Q ss_pred HHHHHHHHHhccCC-CCCCceEEEEEecCCC--chhHHHHHHHHHhhCCCeEEeccCCcCCc--hHHHHHHHhhhhHhcC
Q 020721 88 HRRHFLRMIYGTQS-PTGAQVDVKFVFCNLT--KEDQKVLVALEIMRYDDIIILNCKENMNK--GKTYTYFSSLPEIFNT 162 (322)
Q Consensus 88 ~rR~aIR~TW~~~~-~~~~~v~~~Fvlg~~~--~~~~~~~l~~E~~~~~DIv~ld~~D~y~n--~Kt~~~~~w~~~~~~~ 162 (322)
+||++||+|||+.. ..+.+++++||+|.++ ++..+..|.+|+++|+||||+|+.|+|.| .||+++|+|+.++
T Consensus 1 ~rR~~IR~TW~~~~~~~~~~~~~~FvvG~~~~~~~~~~~~l~~E~~~y~Dil~~d~~D~y~nlt~K~~~~~~w~~~~--- 77 (195)
T PF01762_consen 1 ERRQAIRETWGNQRNFKGVRVKVVFVVGESPNSDSDLQEALQEEAEKYGDILQGDFVDSYRNLTLKTLAGLKWASKH--- 77 (195)
T ss_pred ChHHHHHHHHhcccccCCCcEEEEEEEecCCCCcHHHHHHhhhhhhhcCceEeeecccccchhhHHHHHHHHHHHhh---
Confidence 59999999999886 5668999999999988 56677789999999999999999999988 9999999999874
Q ss_pred CCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeec----CC-------------CCCCCCccccccee
Q 020721 163 SDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIP----CR-------------SMDPFVDYMSGMGY 222 (322)
Q Consensus 163 ~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p----~r-------------~~~~yp~y~~G~gY 222 (322)
||+++|++|+|||+|||+++|.+.|... +.++.++|.... .+ +.+.||+||+|+||
T Consensus 78 ----c~~~~~v~k~DDD~~vn~~~l~~~L~~~~~~~~~~~~~g~~~~~~~~~r~~~~kw~v~~~~y~~~~yP~y~~G~~y 153 (195)
T PF01762_consen 78 ----CPNAKYVLKVDDDVFVNPDRLVSFLKSLKQDPSKNSIYGGCIKNGPPIRDPSSKWYVSEEEYPDDYYPPYCSGGGY 153 (195)
T ss_pred ----CCchhheeecCcEEEEehHHhhhhhhhcccCccccccccccccCCccccccccCceeeeeecccccCCCcCCCCeE
Confidence 8899999999999999999999999887 445677776541 11 35579999999999
Q ss_pred eeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCCCCc
Q 020721 223 LVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRRAKN 262 (322)
Q Consensus 223 vlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~~~~ 262 (322)
+||+++|+.|++.+ ...+.+++|||++|.|+.++|+.+.
T Consensus 154 vls~~~v~~i~~~~-~~~~~~~~eDv~iGi~~~~~~i~~~ 192 (195)
T PF01762_consen 154 VLSSDVVKRIYKAS-SHTPFFPLEDVFIGILAEKLGIKPI 192 (195)
T ss_pred EecHHHHHHHHHHh-hcCCCCCchHHHHHHHHHHCCCCcc
Confidence 99999999999987 4679999999999999999998653
No 6
>KOG2288 consensus Galactosyltransferases [Carbohydrate transport and metabolism]
Probab=99.97 E-value=1.8e-30 Score=231.55 Aligned_cols=189 Identities=23% Similarity=0.318 Sum_probs=157.9
Q ss_pred CCCeeEEEEEECCCCCHHHHHHHHHHhccCC------CCCCceEEEEEecCC-CchhHHHHHHHHHhhCCCeEEec-cCC
Q 020721 71 SEDIRILIGILTLPDQYHRRHFLRMIYGTQS------PTGAQVDVKFVFCNL-TKEDQKVLVALEIMRYDDIIILN-CKE 142 (322)
Q Consensus 71 ~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~------~~~~~v~~~Fvlg~~-~~~~~~~~l~~E~~~~~DIv~ld-~~D 142 (322)
.++++++|+|.|+++..+||+.+|+||..+. .....|.++|++|.. .+.....+|++|.++|+|.+.+| .+|
T Consensus 8 ~~k~l~vigI~T~f~s~~RR~~vR~TWmp~~~~l~rle~e~gv~~RFvIG~~~~g~~~~r~ie~E~~~~~DfllLd~h~E 87 (274)
T KOG2288|consen 8 RRKVLLVIGINTAFSSRKRRDSVRQTWMPSGEGLKRLEEEKGVIIRFVIGTATLGASLDRALEEENAQHGDFLLLDRHEE 87 (274)
T ss_pred ccceEEEEEeecccchhhhHHHHHHhhcCCccchhhhccccceEEEEEeccCCccHHHHHHHHHHHHhcCCeEeechhHH
Confidence 6789999999999999999999999998762 223467799999984 45566789999999999999999 999
Q ss_pred cCCc--hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC-CCCeEEEEeecC----C-------
Q 020721 143 NMNK--GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP-REDLYYGYVIPC----R------- 208 (322)
Q Consensus 143 ~y~n--~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~-~~~~y~G~~~p~----r------- 208 (322)
.|.+ .||+.+|.++..++ +++|++|+|||+|||++.|...|.... +..+|||++... +
T Consensus 88 ~Y~~Ls~Kt~~~f~~A~~~~--------daeFyvKvDDDv~v~l~~L~~~la~~r~~pr~YiGcmksg~v~~~~~~kw~E 159 (274)
T KOG2288|consen 88 AYEELSAKTKAFFSAAVAHW--------DAEFYVKVDDDVYVRLARLGTLLARERSHPRLYIGCMKSGPVLTQPGGKWYE 159 (274)
T ss_pred HHHHHHHHHHHHHHHHHHhc--------cceEEEEccccceecHHHHHHHHHhhccCCceEEEEecCCccccCCCCcccC
Confidence 9987 99999999999986 789999999999999999999997764 468999998421 0
Q ss_pred C----CC--CCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCCCCccccCcccee
Q 020721 209 S----MD--PFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRRAKNRYNAKWSMY 271 (322)
Q Consensus 209 ~----~~--~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~ 271 (322)
| .+ .|-+|+.|++|+||+|++..|+-+.. ....+..|||.+|-|+..+.+ ++++. +++|
T Consensus 160 peWkfg~~g~YfrhA~G~~YvlS~dLa~yi~in~~-lL~~y~nEDVSlGaW~~gldV--~h~dd-~rlC 224 (274)
T KOG2288|consen 160 PEWKFGDNGNYFRHATGGGYVLSKDLATYISINRQ-LLHKYANEDVSLGAWMIGLDV--EHVDD-PRLC 224 (274)
T ss_pred hhhhcCcccccchhccCceEEeeHHHHHHHHHhHH-HHHhhccCCcccceeeeeeee--eEecC-Cccc
Confidence 1 11 28899999999999999999998764 367899999999999976664 34443 3444
No 7
>KOG2246 consensus Galactosyltransferases [Carbohydrate transport and metabolism]
Probab=99.77 E-value=1.3e-17 Score=160.18 Aligned_cols=169 Identities=22% Similarity=0.287 Sum_probs=129.0
Q ss_pred ecCCCCCCeeEEEEEECCCCCH-HHHHHHHHHhccCCCCCCceEEEEEe---cCCCchhHHHHHHHHHhhCCCeEEeccC
Q 020721 66 ATNSSSEDIRILIGILTLPDQY-HRRHFLRMIYGTQSPTGAQVDVKFVF---CNLTKEDQKVLVALEIMRYDDIIILNCK 141 (322)
Q Consensus 66 ~~~~~~~~~~lli~V~S~~~~~-~rR~aIR~TW~~~~~~~~~v~~~Fvl---g~~~~~~~~~~l~~E~~~~~DIv~ld~~ 141 (322)
.+.|...+.+|+|.|.|.+.+. .|-..+-+||+++|.. ..|+- ++.. ..+. .|..+..
T Consensus 83 ~~~~l~r~~~v~cwv~t~~~~~~~~~~~v~~TW~~rc~~-----~~f~s~~~s~~~------------~~f~-~v~~~~~ 144 (364)
T KOG2246|consen 83 IALWLSRSGRVLCWVLTSPMRHVTRADAVKETWLKRCDK-----GIFFSPTLSKDD------------SRFP-TVYYNLP 144 (364)
T ss_pred chhccCCCceEEEEEEecCcCceeehhhhhcccccccCc-----ceecCccCCCCC------------CcCc-eeeccCC
Confidence 5667889999999999998877 5666999999998732 35554 3222 1122 3578899
Q ss_pred CcCCc--hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC-CCCCeEEEEeecCCCCCCCCccc-
Q 020721 142 ENMNK--GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL-PREDLYYGYVIPCRSMDPFVDYM- 217 (322)
Q Consensus 142 D~y~n--~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~-~~~~~y~G~~~p~r~~~~yp~y~- 217 (322)
|+|.+ +||...++++.++.. .+++|++|+|||||+.++||...|... |.+.+|+|+........ .|+
T Consensus 145 ~g~~~~~~ktr~~~~yv~~~~~------~~~dWf~~aDDDTy~i~eNLr~~L~~yDp~~p~YiG~~~~~~~~~---~y~~ 215 (364)
T KOG2246|consen 145 DGYRSLWRKTRIAFKYVYDHIL------KDYDWFLKADDDTYFIMENLRYVLSKYDPEKPVYLGYRSKSYFQN---GYSS 215 (364)
T ss_pred cchHHHHHHHHHHHHHHHHhcc------CCCCeEEeccCCeEEeHHHHHHHHhhcCCCCcEEecccccccccc---cccc
Confidence 99988 999999999986543 489999999999999999999999997 56799999975311112 276
Q ss_pred ccceeeeCHHHHHHHHHcCC-----CCCCCC-CchhHHHHHHHHhcCCCC
Q 020721 218 SGMGYLVSWDIAEWIRDSDI-----PKNHLE-GPEDKVFGAWIREGRRAK 261 (322)
Q Consensus 218 ~G~gYvlS~d~~~~i~~~~~-----~~~~~~-~~EDv~vG~~l~~~~~~~ 261 (322)
+|+||++|+++.+.+++... +..... +.||+.+|.||+++|+..
T Consensus 216 g~ag~~ls~aa~~~la~~l~~~~~~C~~~~~~~~eD~~i~~Cl~~~GV~~ 265 (364)
T KOG2246|consen 216 GGAGYVLSFAALRRLAERLLNNEDKCPQRYPSYGEDRRIGRCLAEVGVPA 265 (364)
T ss_pred CCCCcceeHHHHHHHHHHHhcchhhcccccCCchhHHHHHHHHHHhCCCc
Confidence 56899999999999887421 111111 599999999999999863
No 8
>PF02434 Fringe: Fringe-like; InterPro: IPR003378 The Notch receptor is a large, cell surface transmembrane protein involved in a wide variety of developmental processes in higher organisms []. It becomes activated when its extracellular region binds to ligands located on adjacent cells. Much of this extracellular region is composed of EGF-like repeats, many of which can be O-fucosylated. A number of these O-fucosylated repeats can in turn be further modified by the action of a beta-1,3-N-acetylglucosaminyltransferase enzyme known as Fringe []. Fringe potentiates the activation of Notch by Delta ligands, while inhibiting activation by Serrate/Jagged ligands. This regulation of Notch signalling by Fringe is important in many processes []. Four distinct Fringe proteins have so far been studied in detail; Drosophila Fringe (Dfng) and its three mammalian homologues Lunatic Fringe (Lfng), Radical Fringe (Rfng) and Manic Fringe (Mfng). Dfng, Lfng and Rfng have all been shown to play important roles in developmental processes within their host, though the phenotype of mutants can vary between species e.g. Rfng mutants are retarded in wing development in chickens, but have no obvious phenotype in mice [, , ]. Mfng mutants have not, so far, been charcterised. Biochemical studies indicate that the Fringe proteins are fucose-specific transferases requiring manganese for activity and utilising UDP-N-acetylglucosamine as a donor substrate []. The three mammalian proteins show distinct variations in their catalytic efficiencies with different substrates. Dfng is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands which is localised to the Golgi apparatus [] (not secreted as previously thought). Modification of Notch occurs through glycosylation by Dfng. This entry consists of Fringe proteins and related glycosyltransferase enzymes including: Beta-1,3-glucosyltransferase, which glucosylates O-linked fucosylglycan on thrombospondin type 1 repeat domains []. Core 1 beta1,3-galactosyltransferase 1, generates the core T antigen, which is a precursor for many extended O-glycans in glycoproteins and plays a central role in many processes, such as angiogenesis, thrombopoiesis and kidney homeostasis development []. ; GO: 0016757 transferase activity, transferring glycosyl groups, 0016020 membrane; PDB: 2J0B_A 2J0A_A.
Probab=99.75 E-value=1.4e-17 Score=153.20 Aligned_cols=197 Identities=17% Similarity=0.200 Sum_probs=104.1
Q ss_pred eeEEEEEECCCCCH-HHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCc--hHHH
Q 020721 74 IRILIGILTLPDQY-HRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNK--GKTY 150 (322)
Q Consensus 74 ~~lli~V~S~~~~~-~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n--~Kt~ 150 (322)
-.|+|+|+|++++. .|-.+|++||++++. . ..|+.....++.+. .+ .-.+++.-++..++.. .+..
T Consensus 6 ~dI~i~V~T~~k~h~tR~~~I~~TW~~~~~---~--~~~ifsd~~d~~l~----~~--~~~~l~~~~~~~~~~~~~~~~~ 74 (252)
T PF02434_consen 6 DDIFIAVKTTKKFHKTRAPAIKQTWAKRCN---K--QTFIFSDAEDPSLP----TV--TGVHLVNPNCDAGHCRKTLSCK 74 (252)
T ss_dssp GGEEEEEE--GGGTTTTHHHHHHTGGGGSG---G--GEEEEESS--HHHH----HH--HGGGEEE-------------HH
T ss_pred ccEEEEEEeCHHHHHHHHHHHHHHHHhhcC---C--ceEEecCccccccc----cc--cccccccCCCcchhhHHHHHHH
Confidence 36899999999876 777899999999872 2 35654544443332 22 2234555555555442 2333
Q ss_pred HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC-CCCCeEEEEeecCCCC------------CCCCccc
Q 020721 151 TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL-PREDLYYGYVIPCRSM------------DPFVDYM 217 (322)
Q Consensus 151 ~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~-~~~~~y~G~~~p~r~~------------~~yp~y~ 217 (322)
+.+.+-..+. ++++|++++|||+||++++|++.|... +.+++|+|.....++. ..--.|+
T Consensus 75 ~~~~y~~~~~-------~~~~Wf~~~DDDtyv~~~~L~~~L~~~~~~~~~yiG~~~~~~~~~~~~~~~~~~~~~~~~~f~ 147 (252)
T PF02434_consen 75 MAYEYDHFLN-------SDKDWFCFADDDTYVNVENLRRLLSKYDPSEPIYIGRPSGDRPIEIIHRFNPNKSKDSGFWFA 147 (252)
T ss_dssp HHHHHHHHHH-------HT-SEEEEEETTEEE-HHHHHHHHTTS-TTS--EEE-EE----------------------EE
T ss_pred HHHHHHhhhc-------CCceEEEEEeCCceecHHHHHHHHhhCCCccCEEeeeeccCccceeeccccccccCcCceEee
Confidence 3222222211 367999999999999999999999997 5579999987522110 0111466
Q ss_pred -ccceeeeCHHHHHHHHHc--CCC----CCCCCCchhHHHHHHHHh-cCCCCccccCccceeecCCC------CCccccc
Q 020721 218 -SGMGYLVSWDIAEWIRDS--DIP----KNHLEGPEDKVFGAWIRE-GRRAKNRYNAKWSMYNFPVP------RTACTHE 283 (322)
Q Consensus 218 -~G~gYvlS~d~~~~i~~~--~~~----~~~~~~~EDv~vG~~l~~-~~~~~~~~~~~~~f~~~~~~------~~~c~~~ 283 (322)
+|+||+||+.++++|... ... ....-..||+.+|.|+.. +|+...+. +.|+..... ...+.
T Consensus 148 ~GGaG~vlSr~~~~k~~~~~~~~~~~~~~~~~~~~dD~~lG~ci~~~lgv~lt~s---~~fhs~~~~l~~~~~~~l~~-- 222 (252)
T PF02434_consen 148 TGGAGYVLSRALLKKMSPWASGCKCPSTDEKIRLPDDMTLGYCIENLLGVPLTHS---PLFHSHLENLQDYNPETLHR-- 222 (252)
T ss_dssp -GGG-EEEEHHHHHHHHHHHTT-TTS--TTTTTS-HHHHHHHHHHHTT---EEE----TT---SSS-GGG--TTTGGG--
T ss_pred CCCeeHHHhHHHHHHHhhhcccccccCCcCCCCCcccChhhhhHHhcCCcceeec---hhhcccCcccccCCHHHhcc--
Confidence 678999999999999542 111 112345899999999999 88865432 345443322 11121
Q ss_pred ccCCEEEEEccC
Q 020721 284 LWADTIAVHLLK 295 (322)
Q Consensus 284 ~~~~~ia~H~~~ 295 (322)
.-+|++|...
T Consensus 223 --q~~~s~~~~~ 232 (252)
T PF02434_consen 223 --QVPISYHKFE 232 (252)
T ss_dssp ---SEEE-EEET
T ss_pred --CCCeecCCCc
Confidence 3479999985
No 9
>PLN03153 hypothetical protein; Provisional
Probab=99.44 E-value=7.1e-12 Score=123.28 Aligned_cols=204 Identities=19% Similarity=0.238 Sum_probs=122.3
Q ss_pred CeeEEEEEECCCCCH-HHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEecc-CCc----CCc
Q 020721 73 DIRILIGILTLPDQY-HRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNC-KEN----MNK 146 (322)
Q Consensus 73 ~~~lli~V~S~~~~~-~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~-~D~----y~n 146 (322)
--.|+++|.+..+.. +|+..|+.+|....-.| .+|+.....+... -.++-.... .|+ |+|
T Consensus 121 ~~hIvF~I~~s~~~w~~R~~yik~wW~p~~~rg----~v~ld~~~~~~~~----------~~~~P~i~is~d~s~f~y~~ 186 (537)
T PLN03153 121 LNHIMFGIAGSSQLWKRRKELVRLWWRPNQMRG----HVWLEEQVSPEEG----------DDSLPPIMVSEDTSRFRYTN 186 (537)
T ss_pred cccEEEEEEEchhhhhhhhhhhhhhcCccccee----EEEecccCCCCCC----------cCCCCCEEeCCCcccccccC
Confidence 356889999988776 77789999998642111 4555443322100 001111111 122 333
Q ss_pred ----hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC-CCCCeEEEEeecCCCCCC-CC-ccc-c
Q 020721 147 ----GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL-PREDLYYGYVIPCRSMDP-FV-DYM-S 218 (322)
Q Consensus 147 ----~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~-~~~~~y~G~~~p~r~~~~-yp-~y~-~ 218 (322)
.-+..+.+-+.+.+.. .-|+++|++++|||||+.++||++.|... +.+..|+|........+. +. .|+ +
T Consensus 187 ~~Gh~sa~rI~rmv~et~~~---~~pd~kWfVf~DDDTyf~~~NLv~~Ls~YDptkp~YIGs~Se~~~qn~~f~~~fA~G 263 (537)
T PLN03153 187 PTGHPSGLRISRIVLESFRL---GLPDVRWFVLGDDDTIFNADNLVAVLSKYDPSEMVYVGGPSESHSANSYFSHNMAFG 263 (537)
T ss_pred CCCcHHHHHHHHHHHHHHHh---hCCCCCEEEEecCCccccHHHHHHHHhhcCCCCCEEecccccccccccccccccccC
Confidence 1222222222222211 13689999999999999999999999987 567899997643222222 11 344 7
Q ss_pred cceeeeCHHHHHHHHHcC--CC-CCCCCCchhHHHHHHHHhcCCCCccccCccceeecCC---CCCcccccccCCEEEEE
Q 020721 219 GMGYLVSWDIAEWIRDSD--IP-KNHLEGPEDKVFGAWIREGRRAKNRYNAKWSMYNFPV---PRTACTHELWADTIAVH 292 (322)
Q Consensus 219 G~gYvlS~d~~~~i~~~~--~~-~~~~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~~~~~---~~~~c~~~~~~~~ia~H 292 (322)
|+||+||+.+++.|.+.- +. +-+....+|..+|.|+..+|+..+. .++|++..- +.+-....-..-.+++|
T Consensus 264 GAG~~LSrPLae~L~~~~d~C~~rY~~~~~gD~rL~~CL~elGV~LT~---~~gfhQ~D~~Gd~~G~les~p~~P~vSlH 340 (537)
T PLN03153 264 GGGIAISYPLAEALSRILDDCLDRYPKLYGSDDRLHACITELGVPLSR---EPGFHQWDIRGNAHGLLSSHPIAPFVSIH 340 (537)
T ss_pred CceEEEcHHHHHHHHHHhhhhhhhcccCCCcHHHHHHHHHHcCCCcee---cCCccccccCCCcchHhhcCCCCCceeee
Confidence 899999999999988742 11 1123456899999999999986653 356665432 22222211012368999
Q ss_pred ccCC
Q 020721 293 LLKN 296 (322)
Q Consensus 293 ~~~~ 296 (322)
+.+-
T Consensus 341 H~~~ 344 (537)
T PLN03153 341 HVEA 344 (537)
T ss_pred eccc
Confidence 8875
No 10
>KOG3708 consensus Uncharacterized conserved protein [Function unknown]
Probab=97.84 E-value=6.5e-05 Score=73.99 Aligned_cols=204 Identities=16% Similarity=0.206 Sum_probs=127.1
Q ss_pred CCCeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEec---cCCcCCch
Q 020721 71 SEDIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILN---CKENMNKG 147 (322)
Q Consensus 71 ~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld---~~D~y~n~ 147 (322)
...=+|+++|+|. ..---+|-.|-|..-+ ++.|+.+...-.. |.-+++ +.|... .
T Consensus 23 G~RErl~~aVmte---~tlA~a~NrT~ahhvp-----rv~~F~~~~~i~~-------------~~a~~~~vs~~d~r~-~ 80 (681)
T KOG3708|consen 23 GTRERLMAAVMTE---STLALAINRTLAHHVP-----RVHLFADSSRIDN-------------DLAQLTNVSPYDLRG-Q 80 (681)
T ss_pred hhHHHHHHHHHHH---HHHHHHHHHHHHhhcc-----eeEEeeccccccc-------------cHhhccccCccccCc-c
Confidence 3345677888882 2566688888886631 3677777543211 222222 333333 3
Q ss_pred HHH-HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCC-CCeEEEEeecCCCCCCCCcccc-cceeee
Q 020721 148 KTY-TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPR-EDLYYGYVIPCRSMDPFVDYMS-GMGYLV 224 (322)
Q Consensus 148 Kt~-~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~-~~~y~G~~~p~r~~~~yp~y~~-G~gYvl 224 (322)
||. +-++++.+++.+ +|||++-+-||+||+...|++.+..+.- .++|+|.-.-..+ + =|+ |.||+|
T Consensus 81 ~~~s~vl~~l~~~~~~------~YDwFll~~D~tYv~a~~L~~l~~hmsin~dlymGEe~~~gs-~----rC~l~~G~LL 149 (681)
T KOG3708|consen 81 KTHSMVLGLLFNMVHN------NYDWFLLAKDSTYVNAFVLLRLIDHMSINEDLYMGEEAEDGS-G----RCRLDTGMLL 149 (681)
T ss_pred ccHHHHHHHHHHhhcc------ccceEEEecCcceecHHHHHHHHhhcccccccccchhhhCcc-C----ccccccceee
Confidence 554 456777777643 7899999999999999999999988864 6899996432111 2 274 689999
Q ss_pred CHHHHHHHHHcC--CCCCCCCCchhHHHHHHHHh-cCCCCccccCccceeecCC----CCC------cccccccCCEEEE
Q 020721 225 SWDIAEWIRDSD--IPKNHLEGPEDKVFGAWIRE-GRRAKNRYNAKWSMYNFPV----PRT------ACTHELWADTIAV 291 (322)
Q Consensus 225 S~d~~~~i~~~~--~~~~~~~~~EDv~vG~~l~~-~~~~~~~~~~~~~f~~~~~----~~~------~c~~~~~~~~ia~ 291 (322)
|+.++..+-.+- +.+.-.-.=.|+++|+|+.- .|++-.-. +.++.+|.. +.. .-..+-..+..++
T Consensus 150 S~s~l~~lrnnle~C~~~~lsad~d~~lgrCi~~At~v~C~~~--hQGvrq~s~~~dspgr~~~~~e~~~s~aFr~A~tv 227 (681)
T KOG3708|consen 150 SQSLLHALRNNLEGCRNDILSADPDEWLGRCIQDATGVGCKPL--HQGVRQYSEREDSPGRHDSIPEWEGSPAFRSALTV 227 (681)
T ss_pred cHHHHHHHHhhHHHhhcccccCCcHHHHHHHHHHhhcCCccch--hhhHHhhhHhhcCCCccccchhhcCChHHhhhhcc
Confidence 999999998752 22233345568999999975 45542111 122212110 000 0000112568999
Q ss_pred EccCCHHHHHHHHHHccc
Q 020721 292 HLLKNQEKWIETLTYFNV 309 (322)
Q Consensus 292 H~~~~~~~~~~~~~~fn~ 309 (322)
|.+..+..+.++.+||-.
T Consensus 228 ~pv~~p~d~yrLH~yfsr 245 (681)
T KOG3708|consen 228 HPVLSPADMYRLHKYFSR 245 (681)
T ss_pred CccCCHHHHHHHHHHHHH
Confidence 999888777778888853
No 11
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano
Probab=95.77 E-value=1.1 Score=43.38 Aligned_cols=171 Identities=13% Similarity=0.053 Sum_probs=97.8
Q ss_pred eeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCC--eEEeccCCcC-CchHHH
Q 020721 74 IRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDD--IIILNCKENM-NKGKTY 150 (322)
Q Consensus 74 ~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~D--Iv~ld~~D~y-~n~Kt~ 150 (322)
+.+-|+|++.-....-.+.++.-- .+.. ..+.++++...+++.... .+++=.++|.+ |..+.-.++- .|.|.-
T Consensus 41 p~VSViiP~~nee~~l~~~L~Sl~-~q~Y--p~~EIivvdd~s~D~t~~-iv~~~~~~~p~~~i~~v~~~~~~G~~~K~~ 116 (373)
T TIGR03472 41 PPVSVLKPLHGDEPELYENLASFC-RQDY--PGFQMLFGVQDPDDPALA-VVRRLRADFPDADIDLVIDARRHGPNRKVS 116 (373)
T ss_pred CCeEEEEECCCCChhHHHHHHHHH-hcCC--CCeEEEEEeCCCCCcHHH-HHHHHHHhCCCCceEEEECCCCCCCChHHH
Confidence 557778887755554455665433 3331 136677776666554332 33333355665 3233222222 236765
Q ss_pred HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC--CCCeEEEEee--cCC-----------CCC----
Q 020721 151 TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP--REDLYYGYVI--PCR-----------SMD---- 211 (322)
Q Consensus 151 ~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~--~~~~y~G~~~--p~r-----------~~~---- 211 (322)
...+-+.. .+.+|++..|+|+.+..+.|.+.+.... .-++..|... +.. ...
T Consensus 117 ~l~~~~~~---------a~ge~i~~~DaD~~~~p~~L~~lv~~~~~~~v~~V~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 187 (373)
T TIGR03472 117 NLINMLPH---------ARHDILVIADSDISVGPDYLRQVVAPLADPDVGLVTCLYRGRPVPGFWSRLGAMGINHNFLPS 187 (373)
T ss_pred HHHHHHHh---------ccCCEEEEECCCCCcChhHHHHHHHHhcCCCcceEeccccCCCCCCHHHHHHHHHhhhhhhHH
Confidence 54332222 2679999999999999888877765542 2233333211 110 000
Q ss_pred -------CCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 212 -------PFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 212 -------~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
.-+.++.|.++++.+++.+.+--... ....-.||..+|.-+++.|.
T Consensus 188 ~~~~~~~~~~~~~~G~~~a~RR~~l~~iGGf~~--~~~~~~ED~~l~~~i~~~G~ 240 (373)
T TIGR03472 188 VMVARALGRARFCFGATMALRRATLEAIGGLAA--LAHHLADDYWLGELVRALGL 240 (373)
T ss_pred HHHHHhccCCccccChhhheeHHHHHHcCChHH--hcccchHHHHHHHHHHHcCC
Confidence 01245789999999999999865331 12234699999999988775
No 12
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms.
Probab=95.46 E-value=1.6 Score=42.47 Aligned_cols=174 Identities=14% Similarity=0.076 Sum_probs=93.9
Q ss_pred CeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHH-HhhCC---CeEEeccCCcC-C-c
Q 020721 73 DIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALE-IMRYD---DIIILNCKENM-N-K 146 (322)
Q Consensus 73 ~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E-~~~~~---DIv~ld~~D~y-~-n 146 (322)
.+.+-|+|++.-+...-.+.++.--.+ ... ...++++|-..++++.. ++.++ .+++. .+..+...+.- . +
T Consensus 39 ~p~VSVIIpa~Ne~~~L~~~L~sL~~q-~yp-~~~eIIVVDd~StD~T~--~i~~~~~~~~~~~~~i~vi~~~~~~~g~~ 114 (384)
T TIGR03469 39 WPAVVAVVPARNEADVIGECVTSLLEQ-DYP-GKLHVILVDDHSTDGTA--DIARAAARAYGRGDRLTVVSGQPLPPGWS 114 (384)
T ss_pred CCCEEEEEecCCcHhHHHHHHHHHHhC-CCC-CceEEEEEeCCCCCcHH--HHHHHHHHhcCCCCcEEEecCCCCCCCCc
Confidence 356777788775443333444433322 211 13567777665655432 22223 22333 45545433221 1 2
Q ss_pred hHHHH---HHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC--CCCeEEEEeec-CC------------
Q 020721 147 GKTYT---YFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP--REDLYYGYVIP-CR------------ 208 (322)
Q Consensus 147 ~Kt~~---~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~--~~~~y~G~~~p-~r------------ 208 (322)
+|..+ +++.+.+ .+++.+|++.+|+|+.+..+.|.+.+.... ..++..|.... ..
T Consensus 115 Gk~~A~n~g~~~A~~-------~~~~gd~llflDaD~~~~p~~l~~lv~~~~~~~~~~vs~~~~~~~~~~~~~~~~~~~~ 187 (384)
T TIGR03469 115 GKLWAVSQGIAAART-------LAPPADYLLLTDADIAHGPDNLARLVARARAEGLDLVSLMVRLRCESFWEKLLIPAFV 187 (384)
T ss_pred chHHHHHHHHHHHhc-------cCCCCCEEEEECCCCCCChhHHHHHHHHHHhCCCCEEEecccccCCCHHHHHHHHHHH
Confidence 67554 3333322 233479999999999999887777765432 22232222110 00
Q ss_pred -------------CCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 209 -------------SMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 209 -------------~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
.......++.|++.++++++.+.+--... ......||+.++.-+++.|.
T Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~G~~~lirr~~~~~vGGf~~--~~~~~~ED~~L~~r~~~~G~ 249 (384)
T TIGR03469 188 FFFQKLYPFRWVNDPRRRTAAAAGGCILIRREALERIGGIAA--IRGALIDDCTLAAAVKRSGG 249 (384)
T ss_pred HHHHHhcchhhhcCCCccceeecceEEEEEHHHHHHcCCHHH--HhhCcccHHHHHHHHHHcCC
Confidence 00011234679999999999998854321 12235799999999988763
No 13
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A.
Probab=95.10 E-value=0.18 Score=44.47 Aligned_cols=169 Identities=21% Similarity=0.206 Sum_probs=79.4
Q ss_pred eeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCe--EEeccCCcCC-chHHH
Q 020721 74 IRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDI--IILNCKENMN-KGKTY 150 (322)
Q Consensus 74 ~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DI--v~ld~~D~y~-n~Kt~ 150 (322)
++|.|+|++.-....-++.++.-....- .+++++.+...++++ ....+++-.+.+... ..+.-..+.. +.|.-
T Consensus 1 P~v~Vvip~~~~~~~l~~~l~sl~~~~~---~~~~v~vvd~~~~~~-~~~~~~~~~~~~~~~~v~vi~~~~~~g~~~k~~ 76 (228)
T PF13641_consen 1 PRVSVVIPAYNEDDVLRRCLESLLAQDY---PRLEVVVVDDGSDDE-TAEILRALAARYPRVRVRVIRRPRNPGPGGKAR 76 (228)
T ss_dssp --EEEE--BSS-HHHHHHHHHHHTTSHH---HTEEEEEEEE-SSS--GCTTHHHHHHTTGG-GEEEEE----HHHHHHHH
T ss_pred CEEEEEEEecCCHHHHHHHHHHHHcCCC---CCeEEEEEECCCChH-HHHHHHHHHHHcCCCceEEeecCCCCCcchHHH
Confidence 3577788877554445555555553211 245555555333332 222344444555542 3333222221 12433
Q ss_pred HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC--CCCCeEEEEeecC---------------------
Q 020721 151 TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL--PREDLYYGYVIPC--------------------- 207 (322)
Q Consensus 151 ~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~--~~~~~y~G~~~p~--------------------- 207 (322)
+ +.+..... +.+|++..|||+.+..+.|.+.+... +.-....|.....
T Consensus 77 a-~n~~~~~~--------~~d~i~~lD~D~~~~p~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 147 (228)
T PF13641_consen 77 A-LNEALAAA--------RGDYILFLDDDTVLDPDWLERLLAAFADPGVGAVGGPVFPDNDRNWLTRLQDLFFARWHLRF 147 (228)
T ss_dssp H-HHHHHHH-----------SEEEEE-SSEEE-CHHHHHHHHHHHBSS--EEEEEEEETTCCCEEEE-TT--S-EETTTS
T ss_pred H-HHHHHHhc--------CCCEEEEECCCcEECHHHHHHHHHHHHhCCCCeEeeeEeecCCCCHHHHHHHHHHhhhhhhh
Confidence 2 23333321 46999999999999988777766554 2222222322110
Q ss_pred CC-CCCC-CcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 208 RS-MDPF-VDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 208 r~-~~~y-p~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
.. ...+ -.++.|++.++.+++++.+..... ....||..++.-+.+.|.
T Consensus 148 ~~~~~~~~~~~~~G~~~~~rr~~~~~~g~fd~----~~~~eD~~l~~r~~~~G~ 197 (228)
T PF13641_consen 148 RSGRRALGVAFLSGSGMLFRRSALEEVGGFDP----FILGEDFDLCLRLRAAGW 197 (228)
T ss_dssp -TT-B----S-B--TEEEEEHHHHHHH-S--S----SSSSHHHHHHHHHHHTT-
T ss_pred hhhhcccceeeccCcEEEEEHHHHHHhCCCCC----CCcccHHHHHHHHHHCCC
Confidence 00 0011 135689999999999999965331 445599999999988765
No 14
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment.
Probab=95.08 E-value=1.3 Score=38.33 Aligned_cols=130 Identities=17% Similarity=0.117 Sum_probs=77.0
Q ss_pred ceEEEEEecCCCchhHHHHHHHHHhhCC--CeEEeccCCcC-CchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEe
Q 020721 106 QVDVKFVFCNLTKEDQKVLVALEIMRYD--DIIILNCKENM-NKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYI 182 (322)
Q Consensus 106 ~v~~~Fvlg~~~~~~~~~~l~~E~~~~~--DIv~ld~~D~y-~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv 182 (322)
.+.+++|-..++++.. ..+++-.+.+. ++.......+. .+.|.-.. ....+. .+.+|++..|+|+.+
T Consensus 30 ~~eiivVdd~s~d~t~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-n~g~~~--------a~~d~i~~~D~D~~~ 99 (196)
T cd02520 30 KYEILFCVQDEDDPAI-PVVRKLIAKYPNVDARLLIGGEKVGINPKVNNL-IKGYEE--------ARYDILVISDSDISV 99 (196)
T ss_pred CeEEEEEeCCCcchHH-HHHHHHHHHCCCCcEEEEecCCcCCCCHhHHHH-HHHHHh--------CCCCEEEEECCCceE
Confidence 3667777776665433 33444445554 33223222221 12343321 111221 256999999999999
Q ss_pred eHHHHHHHhcCCC--CCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 183 RLENLVKSLVPLP--REDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 183 ~~~~L~~~L~~~~--~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
..+.|.+.+.... .-++..|. ++.|++.++.+++.+.+.... .......||..++.-+.+.|.
T Consensus 100 ~~~~l~~l~~~~~~~~~~~v~~~------------~~~g~~~~~r~~~~~~~ggf~--~~~~~~~eD~~l~~rl~~~G~ 164 (196)
T cd02520 100 PPDYLRRMVAPLMDPGVGLVTCL------------CAFGKSMALRREVLDAIGGFE--AFADYLAEDYFLGKLIWRLGY 164 (196)
T ss_pred ChhHHHHHHHHhhCCCCCeEEee------------cccCceeeeEHHHHHhccChH--HHhHHHHHHHHHHHHHHHcCC
Confidence 8888777776532 22222222 678999999999999875432 122334799999988887764
No 15
>PF13506 Glyco_transf_21: Glycosyl transferase family 21
Probab=94.96 E-value=0.14 Score=44.47 Aligned_cols=106 Identities=18% Similarity=0.138 Sum_probs=72.0
Q ss_pred CchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC--CCCeEEEEee--cCC------------
Q 020721 145 NKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP--REDLYYGYVI--PCR------------ 208 (322)
Q Consensus 145 ~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~--~~~~y~G~~~--p~r------------ 208 (322)
-|.|.......+.. . .++++++-.|+|+.++.+.|.+.+..+. .-++..|... +.+
T Consensus 15 ~N~Kv~nL~~~~~~-~-------a~~d~~~~~DsDi~v~p~~L~~lv~~l~~p~vglVt~~~~~~~~~~~~~~l~~~~~~ 86 (175)
T PF13506_consen 15 CNPKVNNLAQGLEA-G-------AKYDYLVISDSDIRVPPDYLRELVAPLADPGVGLVTGLPRGVPARGFWSRLEAAFFN 86 (175)
T ss_pred CChHHHHHHHHHHh-h-------CCCCEEEEECCCeeECHHHHHHHHHHHhCCCCcEEEecccccCCcCHHHHHHHHHHh
Confidence 45788776666554 1 2679999999999999998888876653 3333333221 110
Q ss_pred ------CCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCCC
Q 020721 209 ------SMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRRA 260 (322)
Q Consensus 209 ------~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~~ 260 (322)
..-...+++.|+++++.+++++.+.--. .....-.||..+|..+++.|..
T Consensus 87 ~~~~~~~a~~~~~~~~G~~m~~rr~~L~~~GG~~--~l~~~ladD~~l~~~~~~~G~~ 142 (175)
T PF13506_consen 87 FLPGVLQALGGAPFAWGGSMAFRREALEEIGGFE--ALADYLADDYALGRRLRARGYR 142 (175)
T ss_pred HHHHHHHHhcCCCceecceeeeEHHHHHHcccHH--HHhhhhhHHHHHHHHHHHCCCe
Confidence 0011357899999999999999874322 1233569999999999998763
No 16
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=94.48 E-value=2.8 Score=36.39 Aligned_cols=141 Identities=13% Similarity=-0.011 Sum_probs=77.4
Q ss_pred eEEEEEecCCCchhHHHHHH-HHHhhCCCeEEeccCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHH
Q 020721 107 VDVKFVFCNLTKEDQKVLVA-LEIMRYDDIIILNCKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLE 185 (322)
Q Consensus 107 v~~~Fvlg~~~~~~~~~~l~-~E~~~~~DIv~ld~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~ 185 (322)
+.++.|-..+++.. ...++ .....+..+..++..+..+.+|.... .+.... ...+|++.+|+|..+..+
T Consensus 29 ~eiivvdd~s~d~t-~~~~~~~~~~~~~~v~~~~~~~~~~~g~~~a~-n~g~~~--------~~~d~i~~~D~D~~~~~~ 98 (229)
T cd04192 29 FEVILVDDHSTDGT-VQILEFAAAKPNFQLKILNNSRVSISGKKNAL-TTAIKA--------AKGDWIVTTDADCVVPSN 98 (229)
T ss_pred eEEEEEcCCCCcCh-HHHHHHHHhCCCcceEEeeccCcccchhHHHH-HHHHHH--------hcCCEEEEECCCcccCHH
Confidence 55666655544332 22333 22222334555655543333555332 222221 256999999999999887
Q ss_pred HHHHHhcCC--CCCCeEEEEeecC--C---------------------CCCCCCcccccceeeeCHHHHHHHHHcCCCCC
Q 020721 186 NLVKSLVPL--PREDLYYGYVIPC--R---------------------SMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKN 240 (322)
Q Consensus 186 ~L~~~L~~~--~~~~~y~G~~~p~--r---------------------~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~ 240 (322)
.|.+.+... +....+.|..... . .....+..+.|++..+++++.+.+---. ..
T Consensus 99 ~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~rr~~~~~~ggf~--~~ 176 (229)
T cd04192 99 WLLTFVAFIQKEQIGLVAGPVIYFKGKSLLAKFQRLDWLSLLGLIAGSFGLGKPFMCNGANMAYRKEAFFEVGGFE--GN 176 (229)
T ss_pred HHHHHHHHhhcCCCcEEeeeeeecCCccHHHHHHHHHHHHHHHHHhhHHHhcCccccccceEEEEHHHHHHhcCCc--cc
Confidence 777777643 2234555543211 0 0112344567899999999999986533 12
Q ss_pred CCCCchhHHHHHHHHhcCC
Q 020721 241 HLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 241 ~~~~~EDv~vG~~l~~~~~ 259 (322)
.....||..++.-+...|.
T Consensus 177 ~~~~~eD~~~~~~~~~~g~ 195 (229)
T cd04192 177 DHIASGDDELLLAKVASKY 195 (229)
T ss_pred cccccCCHHHHHHHHHhCC
Confidence 3456677777654444433
No 17
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=93.70 E-value=2.6 Score=34.39 Aligned_cols=76 Identities=21% Similarity=0.107 Sum_probs=53.1
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCc
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGP 245 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~ 245 (322)
+.+|++..|||..+..+.+...++.. +.-.++.|. +.|++.++++++++.+..... ......
T Consensus 74 ~~~~i~~~D~D~~~~~~~l~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 138 (166)
T cd04186 74 KGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIVGPK-------------VSGAFLLVRREVFEEVGGFDE--DFFLYY 138 (166)
T ss_pred CCCEEEEECCCcEECccHHHHHHHHHHhCCCceEEEcc-------------CceeeEeeeHHHHHHcCCCCh--hhhccc
Confidence 56999999999999988887777642 221222211 788999999999998754321 112277
Q ss_pred hhHHHHHHHHhcCC
Q 020721 246 EDKVFGAWIREGRR 259 (322)
Q Consensus 246 EDv~vG~~l~~~~~ 259 (322)
||..+..-+...|.
T Consensus 139 eD~~~~~~~~~~g~ 152 (166)
T cd04186 139 EDVDLCLRARLAGY 152 (166)
T ss_pred cHHHHHHHHHHcCC
Confidence 99998887776654
No 18
>PF04646 DUF604: Protein of unknown function, DUF604; InterPro: IPR006740 This family includes a conserved region found in several uncharacterised plant proteins.
Probab=92.04 E-value=0.41 Score=43.95 Aligned_cols=74 Identities=16% Similarity=0.168 Sum_probs=49.0
Q ss_pred ccceeeeCHHHHHHHHHcC--C-CCCCCCCchhHHHHHHHHhcCCCCccccCccceeecCCC---CCcc-cccccCCEEE
Q 020721 218 SGMGYLVSWDIAEWIRDSD--I-PKNHLEGPEDKVFGAWIREGRRAKNRYNAKWSMYNFPVP---RTAC-THELWADTIA 290 (322)
Q Consensus 218 ~G~gYvlS~d~~~~i~~~~--~-~~~~~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~~~~~~---~~~c-~~~~~~~~ia 290 (322)
+|+|+.||..+|+.|.+.- + .+.+.+.--|-.+..|+.++|+..++ +++|++.... .+-. +++. .-.++
T Consensus 12 GGgG~~iS~pLa~~L~~~~d~C~~r~~~~~g~D~~i~~C~~~lgv~LT~---e~g~hQ~Di~Gd~~G~~~a~~~-~pl~S 87 (255)
T PF04646_consen 12 GGGGFAISYPLAKALAKMQDDCIERYPHLYGGDQRIQACIAELGVPLTK---EPGFHQMDIRGDPSGFLEAHPL-APLVS 87 (255)
T ss_pred cCceeEEcHHHHHHHHHHHHHHHHhcCCCCCchHHHHHHHHHhCCCcee---cCCceeEeeccCcceeeecCCC-Cceee
Confidence 7999999999999999852 1 12344555788999999999886553 3566665422 1211 1111 23689
Q ss_pred EEccC
Q 020721 291 VHLLK 295 (322)
Q Consensus 291 ~H~~~ 295 (322)
+|+++
T Consensus 88 lHH~~ 92 (255)
T PF04646_consen 88 LHHWD 92 (255)
T ss_pred eeehh
Confidence 99875
No 19
>PRK11204 N-glycosyltransferase; Provisional
Probab=90.13 E-value=12 Score=36.50 Aligned_cols=169 Identities=11% Similarity=-0.009 Sum_probs=91.9
Q ss_pred CCeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHH
Q 020721 72 EDIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYT 151 (322)
Q Consensus 72 ~~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~ 151 (322)
+.+.+-|+|++.-+...-++.++.--. +. ....+++. +...++++..+.+++..+++..+...+..++. +|.-+
T Consensus 52 ~~p~vsViIp~yne~~~i~~~l~sl~~-q~--yp~~eiiV-vdD~s~d~t~~~l~~~~~~~~~v~~i~~~~n~--Gka~a 125 (420)
T PRK11204 52 EYPGVSILVPCYNEGENVEETISHLLA-LR--YPNYEVIA-INDGSSDNTGEILDRLAAQIPRLRVIHLAENQ--GKANA 125 (420)
T ss_pred CCCCEEEEEecCCCHHHHHHHHHHHHh-CC--CCCeEEEE-EECCCCccHHHHHHHHHHhCCcEEEEEcCCCC--CHHHH
Confidence 345688888887554333333333222 21 11344443 34333333334455555666666666544433 45433
Q ss_pred HHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCCC-------------------
Q 020721 152 YFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCRS------------------- 209 (322)
Q Consensus 152 ~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r~------------------- 209 (322)
+....+. .+.+|++..|+|+.+..+.|.+.++.. |.-....|.......
T Consensus 126 -ln~g~~~--------a~~d~i~~lDaD~~~~~d~L~~l~~~~~~~~~v~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (420)
T PRK11204 126 -LNTGAAA--------ARSEYLVCIDGDALLDPDAAAYMVEHFLHNPRVGAVTGNPRIRNRSTLLGRIQVGEFSSIIGLI 196 (420)
T ss_pred -HHHHHHH--------cCCCEEEEECCCCCCChhHHHHHHHHHHhCCCeEEEECCceeccchhHHHHHHHHHHHHhhhHH
Confidence 2222221 267999999999999998777766543 322233332210000
Q ss_pred ---CCC--CCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 210 ---MDP--FVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 210 ---~~~--yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
... .+...+|.+.++++++++.+.--. ...-.||+.++.-+.+.|.
T Consensus 197 ~~~~~~~~~~~~~~G~~~~~rr~~l~~vgg~~----~~~~~ED~~l~~rl~~~G~ 247 (420)
T PRK11204 197 KRAQRVYGRVFTVSGVITAFRKSALHEVGYWS----TDMITEDIDISWKLQLRGW 247 (420)
T ss_pred HHHHHHhCCceEecceeeeeeHHHHHHhCCCC----CCcccchHHHHHHHHHcCC
Confidence 000 112357888999999998874322 1234699999988877664
No 20
>cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm
Probab=90.07 E-value=10 Score=31.67 Aligned_cols=90 Identities=11% Similarity=-0.065 Sum_probs=58.1
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCCCCCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhH
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPLPREDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDK 248 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv 248 (322)
..+|++..|+|..+..+.|.+.++.........|...-. ....-.....|+++.+.++....+.... ........||+
T Consensus 79 ~g~~i~~lD~D~~~~~~~l~~~~~~~~~~~~v~g~~~~~-~~~~~~~~~~~~~~~~~r~~~~~~ggf~-~~~~~~~~eD~ 156 (182)
T cd06420 79 KGDYLIFIDGDCIPHPDFIADHIELAEPGVFLSGSRVLL-NEKLTERGIRGCNMSFWKKDLLAVNGFD-EEFTGWGGEDS 156 (182)
T ss_pred cCCEEEEEcCCcccCHHHHHHHHHHhCCCcEEecceeec-ccccceeEeccceEEEEHHHHHHhCCCC-cccccCCcchH
Confidence 569999999999998887777776664434444543211 1111113456888888888887554432 12223458999
Q ss_pred HHHHHHHhcCCC
Q 020721 249 VFGAWIREGRRA 260 (322)
Q Consensus 249 ~vG~~l~~~~~~ 260 (322)
.++.=+.+.|..
T Consensus 157 ~l~~r~~~~g~~ 168 (182)
T cd06420 157 ELVARLLNSGIK 168 (182)
T ss_pred HHHHHHHHcCCc
Confidence 999888888743
No 21
>cd06532 Glyco_transf_25 Glycosyltransferase family 25 [lipooligosaccharide (LOS) biosynthesis protein] is a family of glycosyltransferases involved in LOS biosynthesis. The members include the beta(1,4) galactosyltransferases: Lgt2 of Moraxella catarrhalis, LgtB and LgtE of Neisseria gonorrhoeae and Lic2A of Haemophilus influenzae. M. catarrhalis Lgt2 catalyzes the addition of galactose (Gal) to the growing chain of LOS on the cell surface. N. gonorrhoeae LgtB and LgtE link Gal-beta(1,4) to GlcNAc (N-acetylglucosamine) and Glc (glucose), respectively. The genes encoding LgtB and LgtE are two genes of a five gene locus involved in the synthesis of gonococcal LOS. LgtE is believed to perform the first step in LOS biosynthesis.
Probab=89.99 E-value=3.8 Score=33.40 Aligned_cols=110 Identities=17% Similarity=0.166 Sum_probs=61.9
Q ss_pred EEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCc------hHHHH
Q 020721 78 IGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNK------GKTYT 151 (322)
Q Consensus 78 i~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n------~Kt~~ 151 (322)
+.|.|-+...+||+.+++...... +...|+-|-.........+......+. .......... ..-+.
T Consensus 2 i~vInL~~~~~Rr~~~~~~~~~~~-----~~~~~~~Avd~~~~~~~~~~~~~~~~~---~~~~~~~l~~gEiGC~lSH~~ 73 (128)
T cd06532 2 IFVINLDRSTDRRERMEAQLAALG-----LDFEFFDAVDGKDLSEEELAALYDALF---LPRYGRPLTPGEIGCFLSHYK 73 (128)
T ss_pred EEEEECCCCHHHHHHHHHHHHHcC-----CCeEEEeccccccCCHHHHHHHhHHHh---hhhcCCCCChhhHHHHHHHHH
Confidence 457788888999999998665432 335666664332211111111111000 0001111111 11122
Q ss_pred HHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCCCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHH
Q 020721 152 YFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPREDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEW 231 (322)
Q Consensus 152 ~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~ 231 (322)
.++-+.+ .+.++.+-..||+.+..+ +..||++|++.|+.
T Consensus 74 ~w~~~~~---------~~~~~alIlEDDv~~~~~--------------------------------~~~~Y~vs~~~A~~ 112 (128)
T cd06532 74 LWQKIVE---------SNLEYALILEDDAILDPD--------------------------------GTAGYLVSRKGAKK 112 (128)
T ss_pred HHHHHHH---------cCCCeEEEEccCcEECCC--------------------------------CceEEEeCHHHHHH
Confidence 3333333 155899999999997665 55799999999999
Q ss_pred HHHcC
Q 020721 232 IRDSD 236 (322)
Q Consensus 232 i~~~~ 236 (322)
+.+..
T Consensus 113 ll~~~ 117 (128)
T cd06532 113 LLAAL 117 (128)
T ss_pred HHHhC
Confidence 99965
No 22
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins.
Probab=89.14 E-value=16 Score=32.49 Aligned_cols=165 Identities=12% Similarity=-0.001 Sum_probs=84.9
Q ss_pred CeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHH
Q 020721 73 DIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTY 152 (322)
Q Consensus 73 ~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~ 152 (322)
...+-|+|++.-....-...++.-..+.. ....+.++++-..++++. ...+.+..+. .+......++. +|....
T Consensus 28 ~~~isVvip~~n~~~~l~~~l~si~~q~~-~~~~~eiivvdd~s~d~t-~~~~~~~~~~--~v~~i~~~~~~--g~~~a~ 101 (251)
T cd06439 28 LPTVTIIIPAYNEEAVIEAKLENLLALDY-PRDRLEIIVVSDGSTDGT-AEIAREYADK--GVKLLRFPERR--GKAAAL 101 (251)
T ss_pred CCEEEEEEecCCcHHHHHHHHHHHHhCcC-CCCcEEEEEEECCCCccH-HHHHHHHhhC--cEEEEEcCCCC--ChHHHH
Confidence 35677788887654444556666555432 112355555554444332 2223222222 34443322322 344332
Q ss_pred HHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCC--CCCeEEEEeecCCCC--------------------
Q 020721 153 FSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLP--REDLYYGYVIPCRSM-------------------- 210 (322)
Q Consensus 153 ~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~--~~~~y~G~~~p~r~~-------------------- 210 (322)
-..+.. ...+|++-.|+|+.+..+.|.+.+.... .-.+..|......+.
T Consensus 102 n~gi~~---------a~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (251)
T cd06439 102 NRALAL---------ATGEIVVFTDANALLDPDALRLLVRHFADPSVGAVSGELVIVDGGGSGSGEGLYWKYENWLKRAE 172 (251)
T ss_pred HHHHHH---------cCCCEEEEEccccCcCHHHHHHHHHHhcCCCccEEEeEEEecCCcccchhHHHHHHHHHHHHHHH
Confidence 222222 1459999999999999776766666543 224444543211100
Q ss_pred --CCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 211 --DPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 211 --~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
......+.|+++.+.+++.+ . . ......||..++.-+...|.
T Consensus 173 ~~~~~~~~~~g~~~~~rr~~~~-----~-~-~~~~~~eD~~l~~~~~~~G~ 216 (251)
T cd06439 173 SRLGSTVGANGAIYAIRRELFR-----P-L-PADTINDDFVLPLRIARQGY 216 (251)
T ss_pred HhcCCeeeecchHHHhHHHHhc-----C-C-CcccchhHHHHHHHHHHcCC
Confidence 01122355666667776665 1 1 12334799999888877664
No 23
>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=88.96 E-value=5.7 Score=34.03 Aligned_cols=82 Identities=22% Similarity=0.196 Sum_probs=55.1
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCC--CCCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCch
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPL--PREDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPE 246 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~--~~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~E 246 (322)
..+|++..|||..+..+.+.+.+... +.-.+..|..... +. .++|.++++++.+.+--.. ..-....|
T Consensus 79 ~~d~v~~ld~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~---~~-----~~~~~~~~~~~~~~~g~~~--~~~~~~~e 148 (202)
T cd04185 79 GYDWIWLMDDDAIPDPDALEKLLAYADKDNPQFLAPLVLDP---DG-----SFVGVLISRRVVEKIGLPD--KEFFIWGD 148 (202)
T ss_pred CCCEEEEeCCCCCcChHHHHHHHHHHhcCCceEecceeEcC---CC-----ceEEEEEeHHHHHHhCCCC--hhhhccch
Confidence 66999999999999987776666554 3334454544321 11 4467899999998874321 11234569
Q ss_pred hHHHHHHHHhcCCC
Q 020721 247 DKVFGAWIREGRRA 260 (322)
Q Consensus 247 Dv~vG~~l~~~~~~ 260 (322)
|+.++.-+.+.|..
T Consensus 149 D~~~~~r~~~~G~~ 162 (202)
T cd04185 149 DTEYTLRASKAGPG 162 (202)
T ss_pred HHHHHHHHHHcCCc
Confidence 99999999888764
No 24
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins.
Probab=88.92 E-value=19 Score=33.28 Aligned_cols=43 Identities=12% Similarity=0.004 Sum_probs=31.6
Q ss_pred cccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcC
Q 020721 215 DYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGR 258 (322)
Q Consensus 215 ~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~ 258 (322)
++++|++.++++++.+.+-.-.+ .....+.||+-+..=+...|
T Consensus 169 ~~~~g~~~~irr~~~~~vGgfDe-~~~~~~~ED~Dl~~R~~~~G 211 (299)
T cd02510 169 PTMAGGLFAIDREWFLELGGYDE-GMDIWGGENLELSFKVWQCG 211 (299)
T ss_pred ccccceeeEEEHHHHHHhCCCCC-cccccCchhHHHHHHHHHcC
Confidence 46789999999999999865432 33444679998877666655
No 25
>PF01755 Glyco_transf_25: Glycosyltransferase family 25 (LPS biosynthesis protein); InterPro: IPR002654 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 25 GT25 from CAZY comprises enzymes with only one known activity; as a lipopolysaccharide biosynthesis protein. These enzymes catalyse the transfer of various sugars onto the growing lipopolysaccharide chain during its biosynthesis [].; GO: 0009103 lipopolysaccharide biosynthetic process
Probab=88.09 E-value=6.8 Score=34.03 Aligned_cols=34 Identities=15% Similarity=0.174 Sum_probs=24.7
Q ss_pred EEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCC
Q 020721 78 IGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNL 116 (322)
Q Consensus 78 i~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~ 116 (322)
|.|.|-+...+||+.+.+..... .+...|+-|-.
T Consensus 4 i~vInL~~~~~Rr~~~~~~~~~~-----~~~~e~~~Avd 37 (200)
T PF01755_consen 4 IYVINLDRSTERRERIQQQLAKL-----GINFEFFDAVD 37 (200)
T ss_pred EEEEECCCCHHHHHHHHHHHHHc-----CCceEEEEeec
Confidence 56778888999999999888754 22356666543
No 26
>cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate.
Probab=87.89 E-value=18 Score=31.64 Aligned_cols=139 Identities=14% Similarity=0.088 Sum_probs=75.2
Q ss_pred eEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHH
Q 020721 107 VDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLEN 186 (322)
Q Consensus 107 v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~ 186 (322)
.++++|...++++. ...+ ++...+..+.+.. .++ .+|..+.-.-+.. .+.+|++..|+|+.+..+.
T Consensus 29 ~eiivvdd~s~d~~-~~~l-~~~~~~~~~~v~~-~~~--~g~~~a~n~g~~~---------a~~d~v~~lD~D~~~~~~~ 94 (235)
T cd06434 29 LEIIVVTDGDDEPY-LSIL-SQTVKYGGIFVIT-VPH--PGKRRALAEGIRH---------VTTDIVVLLDSDTVWPPNA 94 (235)
T ss_pred CEEEEEeCCCChHH-HHHH-HhhccCCcEEEEe-cCC--CChHHHHHHHHHH---------hCCCEEEEECCCceeChhH
Confidence 44666655554432 2223 3455566665553 222 2455432222221 1669999999999999998
Q ss_pred HHHHhcCCCCC--CeEEEEeecCC-----------------------CCCCC--CcccccceeeeCHHHHHHHHHcCC--
Q 020721 187 LVKSLVPLPRE--DLYYGYVIPCR-----------------------SMDPF--VDYMSGMGYLVSWDIAEWIRDSDI-- 237 (322)
Q Consensus 187 L~~~L~~~~~~--~~y~G~~~p~r-----------------------~~~~y--p~y~~G~gYvlS~d~~~~i~~~~~-- 237 (322)
|.+.+...... ....|...... ....+ -..++|.+.++.+++++.+.-...
T Consensus 95 l~~l~~~~~~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~rr~~l~~~~~~~~~~ 174 (235)
T cd06434 95 LPEMLKPFEDPKVGGVGTNQRILRPRDSKWSFLAAEYLERRNEEIRAAMSYDGGVPCLSGRTAAYRTEILKDFLFLEEFT 174 (235)
T ss_pred HHHHHHhccCCCEeEEcCceEeecCcccHHHHHHHHHHHHHHHHHHHHHhhCCCEEEccCcHHHHHHHHHhhhhhHHHhh
Confidence 87777665322 22222211100 00001 123567778888888877543211
Q ss_pred ----CCCCCCCchhHHHHHHHHhcCC
Q 020721 238 ----PKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 238 ----~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
...+....||..++.-+.+.|.
T Consensus 175 ~~~~~~~~~~~~eD~~l~~~~~~~g~ 200 (235)
T cd06434 175 NETFMGRRLNAGDDRFLTRYVLSHGY 200 (235)
T ss_pred hhhhcCCCCCcCchHHHHHHHHHCCC
Confidence 1124557799999887777654
No 27
>COG1216 Predicted glycosyltransferases [General function prediction only]
Probab=87.46 E-value=23 Score=33.05 Aligned_cols=169 Identities=17% Similarity=0.085 Sum_probs=95.8
Q ss_pred eeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchH-HHHH
Q 020721 74 IRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGK-TYTY 152 (322)
Q Consensus 74 ~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~K-t~~~ 152 (322)
..+-++|.|--......+.++..-.... ....+.++-+.+.++..+.+.... +.++..+...+|..-.. --.+
T Consensus 3 ~~i~~iiv~yn~~~~l~~~l~~l~~~~~----~~~~iv~vDn~s~d~~~~~~~~~~--~~~v~~i~~~~NlG~agg~n~g 76 (305)
T COG1216 3 PKISIIIVTYNRGEDLVECLASLAAQTY----PDDVIVVVDNGSTDGSLEALKARF--FPNVRLIENGENLGFAGGFNRG 76 (305)
T ss_pred cceEEEEEecCCHHHHHHHHHHHhcCCC----CCcEEEEccCCCCCCCHHHHHhhc--CCcEEEEEcCCCccchhhhhHH
Confidence 3455556665444455555555554332 111222444444333333333322 77888888888864201 1123
Q ss_pred HHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCC--CCeEEEEee-------------------------
Q 020721 153 FSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPR--EDLYYGYVI------------------------- 205 (322)
Q Consensus 153 ~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~--~~~y~G~~~------------------------- 205 (322)
++.+... ..+|++-.++|+.+..+.|.+.++.... .....|...
T Consensus 77 ~~~a~~~---------~~~~~l~LN~D~~~~~~~l~~ll~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 147 (305)
T COG1216 77 IKYALAK---------GDDYVLLLNPDTVVEPDLLEELLKAAEEDPAAGVVGPLIRNYDESLYIDRRGGESDGLTGGWRA 147 (305)
T ss_pred HHHHhcC---------CCcEEEEEcCCeeeChhHHHHHHHHHHhCCCCeEeeeeEecCCCCcchheecccccccccccee
Confidence 4444331 1239999999999998888887765321 111111111
Q ss_pred -cCC---C----CCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 206 -PCR---S----MDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 206 -p~r---~----~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
+.. + ....-.+++|++.++++++.+.+---. ..-+...||+-.+.=+++.|.
T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~G~~~li~~~~~~~vG~~d--e~~F~y~eD~D~~~R~~~~G~ 207 (305)
T COG1216 148 SPLLEIAPDLSSYLEVVASLSGACLLIRREAFEKVGGFD--ERFFIYYEDVDLCLRARKAGY 207 (305)
T ss_pred cccccccccccchhhhhhhcceeeeEEcHHHHHHhCCCC--cccceeehHHHHHHHHHHcCC
Confidence 000 0 011112578999999999999997632 245779999999999998885
No 28
>cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot
Probab=87.38 E-value=16 Score=32.47 Aligned_cols=87 Identities=15% Similarity=0.085 Sum_probs=55.1
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCCCC--CCe-EE-EEee-cCC------------------------CCCCCCccccc
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPLPR--EDL-YY-GYVI-PCR------------------------SMDPFVDYMSG 219 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~--~~~-y~-G~~~-p~r------------------------~~~~yp~y~~G 219 (322)
+.+|++..|+|+.+..+.|.+.++.... .++ +. |.+. ... .....+..++|
T Consensus 84 ~gd~i~~~DaD~~~~~~~l~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 163 (241)
T cd06427 84 RGEYVVIYDAEDAPDPDQLKKAVAAFARLDDKLACVQAPLNYYNARENWLTRMFALEYAAWFDYLLPGLARLGLPIPLGG 163 (241)
T ss_pred CCCEEEEEcCCCCCChHHHHHHHHHHHhcCCCEEEEeCceEeeCCCccHHHHHHHHHHHHHHHHHHHHHHhcCCeeecCC
Confidence 5599999999999999988887765422 222 22 2210 000 00011224578
Q ss_pred ceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 220 MGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 220 ~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
++.++++++.+.+..... ....||..++.=+...|.
T Consensus 164 ~~~~~rr~~~~~vgg~~~----~~~~eD~~l~~rl~~~G~ 199 (241)
T cd06427 164 TSNHFRTDVLRELGGWDP----FNVTEDADLGLRLARAGY 199 (241)
T ss_pred chHHhhHHHHHHcCCCCc----ccchhhHHHHHHHHHCCc
Confidence 888999999998855331 234799999887766553
No 29
>cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=86.46 E-value=17 Score=30.84 Aligned_cols=87 Identities=16% Similarity=0.096 Sum_probs=52.1
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCC--C----CCCCC-------------cccccceeeeCH
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCR--S----MDPFV-------------DYMSGMGYLVSW 226 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r--~----~~~yp-------------~y~~G~gYvlS~ 226 (322)
+.+|++..|+|..+..+.|...+... +.-.++.|...... . ....| ....|++.++.+
T Consensus 80 ~gd~i~~lD~Dd~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rr 159 (201)
T cd04195 80 TYDWVARMDTDDISLPDRFEKQLDFIEKNPEIDIVGGGVLEFDSDGNDIGKRRLPTSHDDILKFARRRSPFNHPTVMFRK 159 (201)
T ss_pred CCCEEEEeCCccccCcHHHHHHHHHHHhCCCeEEEcccEEEECCCCCeeccccCCCCHHHHHHHhccCCCCCChHHhhhH
Confidence 56999999999999988777766553 33345555432100 0 00000 112355667777
Q ss_pred HHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 227 DIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 227 d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
++.+.+.... .....||..+..-+...|.
T Consensus 160 ~~~~~~g~~~----~~~~~eD~~~~~r~~~~g~ 188 (201)
T cd04195 160 SKVLAVGGYQ----DLPLVEDYALWARMLANGA 188 (201)
T ss_pred HHHHHcCCcC----CCCCchHHHHHHHHHHcCC
Confidence 7766653322 2267999999888877654
No 30
>cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=86.20 E-value=19 Score=30.74 Aligned_cols=87 Identities=14% Similarity=0.027 Sum_probs=55.6
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEee--cCC---------CCCC-----------CCcccccceee
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVI--PCR---------SMDP-----------FVDYMSGMGYL 223 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~--p~r---------~~~~-----------yp~y~~G~gYv 223 (322)
+.+|++..|+|..+..+.|.+.++.. +...++.|... ... .... ...+..|++.+
T Consensus 79 ~g~~v~~ld~Dd~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (214)
T cd04196 79 DGDYVFFCDQDDIWLPDKLERLLKAFLKDDKPLLVYSDLELVDENGNPIGESFFEYQKIKPGTSFNNLLFQNVVTGCTMA 158 (214)
T ss_pred CCCEEEEECCCcccChhHHHHHHHHHhcCCCceEEecCcEEECCCCCCcccccccccccCCccCHHHHHHhCccCCceee
Confidence 67999999999988877777776642 22234444321 000 0000 12456789999
Q ss_pred eCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcC
Q 020721 224 VSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGR 258 (322)
Q Consensus 224 lS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~ 258 (322)
+.+++++.+..... .....||.++...+...|
T Consensus 159 ~r~~~~~~~~~~~~---~~~~~~D~~~~~~~~~~~ 190 (214)
T cd04196 159 FNRELLELALPFPD---ADVIMHDWWLALLASAFG 190 (214)
T ss_pred EEHHHHHhhccccc---cccccchHHHHHHHHHcC
Confidence 99999999865432 115789998887776654
No 31
>TIGR01556 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids are glycolipids containing mono- or di- L-rhamnose molecules. Rhamnolipid synthesis occurs by sequential glycosyltransferase reactions involving two distinct rhamnosyltransferase enzymes. In P.aeruginosa, the synthesis of mono-rhamnolipids is catalyzed by rhamnosyltransferase 1, and proceeds by a glycosyltransfer reaction catalyzed by rhamnosyltransferase 2 to yield di-rhamnolipids.
Probab=85.62 E-value=8.3 Score=35.28 Aligned_cols=118 Identities=12% Similarity=0.076 Sum_probs=65.5
Q ss_pred hCCCeEEeccCCcCCchHHH-HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCC---CCeEEEEee-
Q 020721 131 RYDDIIILNCKENMNKGKTY-TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPR---EDLYYGYVI- 205 (322)
Q Consensus 131 ~~~DIv~ld~~D~y~n~Kt~-~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~---~~~y~G~~~- 205 (322)
.+..|..+...++..-.+.+ .+++++.+. .++|++-.|||+.+..+.|.+.+..... .-...|...
T Consensus 43 ~~~~i~~i~~~~N~G~a~a~N~Gi~~a~~~---------~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~ 113 (281)
T TIGR01556 43 RGQKIALIHLGDNQGIAGAQNQGLDASFRR---------GVQGVLLLDQDSRPGNAFLAAQWKLLSAENGQACALGPRFF 113 (281)
T ss_pred cCCCeEEEECCCCcchHHHHHHHHHHHHHC---------CCCEEEEECCCCCCCHHHHHHHHHHHHhcCCceEEECCeEE
Confidence 34566666666655422322 245555431 5699999999999987766665543211 111222110
Q ss_pred --------c-------------CCCCC--CCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 206 --------P-------------CRSMD--PFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 206 --------p-------------~r~~~--~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
+ ..... .-..++.++|.++++++.+.+.--. ..-++..||+.+..=+++.|-
T Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~sg~li~~~~~~~iG~fd--e~~fi~~~D~e~~~R~~~~G~ 188 (281)
T TIGR01556 114 DRGTSRRLPAIHLDGLLLRQISLDGLTTPQKTSFLISSGCLITREVYQRLGMMD--EELFIDHVDTEWSLRAQNYGI 188 (281)
T ss_pred cCCCcccCCceeecccceeeecccccCCceeccEEEcCcceeeHHHHHHhCCcc--HhhcccchHHHHHHHHHHCCC
Confidence 0 00000 0012455667789999999885422 122456799988766666653
No 32
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus.
Probab=85.56 E-value=24 Score=30.85 Aligned_cols=88 Identities=13% Similarity=-0.026 Sum_probs=53.3
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCCC--CCCeEEEEeec--C---------------CC---CCCC-----Ccccccce
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPLP--REDLYYGYVIP--C---------------RS---MDPF-----VDYMSGMG 221 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~--~~~~y~G~~~p--~---------------r~---~~~y-----p~y~~G~g 221 (322)
+.+|++..|||..+....|.+.++... ......|.... . .. .... -.++.|++
T Consensus 81 ~~d~v~~lD~D~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (249)
T cd02525 81 RGDIIIRVDAHAVYPKDYILELVEALKRTGADNVGGPMETIGESKFQKAIAVAQSSPLGSGGSAYRGGAVKIGYVDTVHH 160 (249)
T ss_pred CCCEEEEECCCccCCHHHHHHHHHHHhcCCCCEEecceecCCCChHHHHHHHHhhchhccCCcccccccccccccccccc
Confidence 569999999999998776766664432 22333333210 0 00 0000 11457778
Q ss_pred eeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 222 YLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 222 YvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
.++++++.+.+.-.. ......||..++.-+.+.|.
T Consensus 161 ~~~~~~~~~~~g~~~---~~~~~~eD~~l~~r~~~~G~ 195 (249)
T cd02525 161 GAYRREVFEKVGGFD---ESLVRNEDAELNYRLRKAGY 195 (249)
T ss_pred ceEEHHHHHHhCCCC---cccCccchhHHHHHHHHcCc
Confidence 889999988875322 12345799999877766654
No 33
>cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end.
Probab=85.50 E-value=24 Score=30.69 Aligned_cols=87 Identities=14% Similarity=-0.031 Sum_probs=57.2
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCCCC-C--CeEEEEee--cC------C----------------C-CCCCCcccccc
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPLPR-E--DLYYGYVI--PC------R----------------S-MDPFVDYMSGM 220 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~-~--~~y~G~~~--p~------r----------------~-~~~yp~y~~G~ 220 (322)
+.+|++..|+|.++..+.|.+.+..... . .+..|... .. . . ......++.|+
T Consensus 84 ~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 163 (234)
T cd06421 84 TGDFVAILDADHVPTPDFLRRTLGYFLDDPKVALVQTPQFFYNPDPFDWLADGAPNEQELFYGVIQPGRDRWGAAFCCGS 163 (234)
T ss_pred CCCEEEEEccccCcCccHHHHHHHHHhcCCCeEEEecceEEecCCcchhHHHHHHHHHHHHHHHHHHHHhhcCCceecCc
Confidence 5699999999999998887777765432 2 22223211 00 0 0 00113567899
Q ss_pred eeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 221 GYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 221 gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
+.++++++.+.+.... .....||..++.-+.+.|.
T Consensus 164 ~~~~r~~~~~~ig~~~----~~~~~eD~~l~~r~~~~g~ 198 (234)
T cd06421 164 GAVVRREALDEIGGFP----TDSVTEDLATSLRLHAKGW 198 (234)
T ss_pred eeeEeHHHHHHhCCCC----ccceeccHHHHHHHHHcCc
Confidence 9999999999875432 1234799999988877764
No 34
>PRK14583 hmsR N-glycosyltransferase; Provisional
Probab=84.99 E-value=35 Score=33.83 Aligned_cols=168 Identities=13% Similarity=0.057 Sum_probs=93.1
Q ss_pred CeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHH
Q 020721 73 DIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTY 152 (322)
Q Consensus 73 ~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~ 152 (322)
.+.+-|+|++.-+...-++.|+.- .++.. .+++++.+-..++++ ..+.+++..+++..+......++- +|..+
T Consensus 74 ~p~vsViIP~yNE~~~i~~~l~sl-l~q~y--p~~eIivVdDgs~D~-t~~~~~~~~~~~~~v~vv~~~~n~--Gka~A- 146 (444)
T PRK14583 74 HPLVSILVPCFNEGLNARETIHAA-LAQTY--TNIEVIAINDGSSDD-TAQVLDALLAEDPRLRVIHLAHNQ--GKAIA- 146 (444)
T ss_pred CCcEEEEEEeCCCHHHHHHHHHHH-HcCCC--CCeEEEEEECCCCcc-HHHHHHHHHHhCCCEEEEEeCCCC--CHHHH-
Confidence 467888888886554444444432 22221 245555554444433 333445555666665554433322 45432
Q ss_pred HHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeec-CCC-------------------
Q 020721 153 FSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIP-CRS------------------- 209 (322)
Q Consensus 153 ~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p-~r~------------------- 209 (322)
+...... .+.+|++-.|+|+.+..+.+.+.++.. |+-...-|.... .+.
T Consensus 147 lN~gl~~--------a~~d~iv~lDAD~~~~~d~L~~lv~~~~~~~~~g~v~g~~~~~~~~~~~~~~~~~e~~~~~~~~~ 218 (444)
T PRK14583 147 LRMGAAA--------ARSEYLVCIDGDALLDKNAVPYLVAPLIANPRTGAVTGNPRIRTRSTLIGRVQVGEFSSIIGLIK 218 (444)
T ss_pred HHHHHHh--------CCCCEEEEECCCCCcCHHHHHHHHHHHHhCCCeEEEEccceecCCCcchhhHHHHHHHHHHHHHH
Confidence 2222221 267999999999999998877766532 332333332211 000
Q ss_pred --CCCC-Cc-ccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 210 --MDPF-VD-YMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 210 --~~~y-p~-y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
...+ +. .++|.+..+.+++++.+.-.. +..-.||..+|.-+...|-
T Consensus 219 ~~~~~~g~~~~~sG~~~~~rr~al~~vGg~~----~~~i~ED~dl~~rl~~~G~ 268 (444)
T PRK14583 219 RTQRVYGQVFTVSGVVAAFRRRALADVGYWS----PDMITEDIDISWKLQLKHW 268 (444)
T ss_pred HHHHHhCCceEecCceeEEEHHHHHHcCCCC----CCcccccHHHHHHHHHcCC
Confidence 0111 12 357888999999988875332 1224699999999988764
No 35
>PF00535 Glycos_transf_2: Glycosyl transferase family 2; InterPro: IPR001173 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This domain is found in a diverse family of glycosyl transferases that transfer the sugar from UDP-glucose, UDP-N-acetyl-galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids.; PDB: 2Z87_A 2Z86_B 2D7R_A 2D7I_A 3CKN_A 3CKQ_A 3CKJ_A 3CKV_A 3CKO_A 2FFU_A ....
Probab=83.09 E-value=3.8 Score=33.08 Aligned_cols=114 Identities=19% Similarity=0.115 Sum_probs=59.0
Q ss_pred ceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHH-HHHHhhhhHhcCCCCCCCCceEEEEecCceEeeH
Q 020721 106 QVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTY-TYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRL 184 (322)
Q Consensus 106 ~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~-~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~ 184 (322)
.++++++-..++++ ....+++-.+....+..+...++..-.+.. .+++.+ ..+|++..|||.++..
T Consensus 27 ~~eiivvdd~s~d~-~~~~~~~~~~~~~~i~~i~~~~n~g~~~~~n~~~~~a------------~~~~i~~ld~D~~~~~ 93 (169)
T PF00535_consen 27 DFEIIVVDDGSTDE-TEEILEEYAESDPNIRYIRNPENLGFSAARNRGIKHA------------KGEYILFLDDDDIISP 93 (169)
T ss_dssp EEEEEEEECS-SSS-HHHHHHHHHCCSTTEEEEEHCCCSHHHHHHHHHHHH--------------SSEEEEEETTEEE-T
T ss_pred CEEEEEeccccccc-ccccccccccccccccccccccccccccccccccccc------------ceeEEEEeCCCceEcH
Confidence 45555555444333 333333333335556666666665322221 122222 3479999999999998
Q ss_pred HHHHHHhcCC--CCCCeEEEEeec-----------C------C--------CCCCCCcccccceeeeCHHHHHHH
Q 020721 185 ENLVKSLVPL--PREDLYYGYVIP-----------C------R--------SMDPFVDYMSGMGYLVSWDIAEWI 232 (322)
Q Consensus 185 ~~L~~~L~~~--~~~~~y~G~~~p-----------~------r--------~~~~yp~y~~G~gYvlS~d~~~~i 232 (322)
+.|...++.. ++....+|.... . . ....-..++.|++-++++++.+.+
T Consensus 94 ~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rr~~~~~~ 168 (169)
T PF00535_consen 94 DWLEELVEALEKNPPDVVIGSVIYIDDDNRYPDRRLRFSFWNRFERKIFNNIRFWKISFFIGSCALFRRSVFEEI 168 (169)
T ss_dssp THHHHHHHHHHHCTTEEEEEEEEEEECTTETEECCCTSEEEECCHCHHHHTTHSTTSSEESSSCEEEEEHHHHHC
T ss_pred HHHHHHHHHHHhCCCcEEEEEEEEecCCccccccccchhhhhhhhhHHHHhhhcCCcccccccEEEEEHHHHHhh
Confidence 7666555443 233444444320 0 0 001123567888889998888764
No 36
>cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex
Probab=80.69 E-value=32 Score=28.62 Aligned_cols=116 Identities=13% Similarity=0.004 Sum_probs=64.0
Q ss_pred ceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHH
Q 020721 106 QVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLE 185 (322)
Q Consensus 106 ~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~ 185 (322)
...++.+-..++++ ....+....+++..+..+...++.. |..+. ....+. ...+|++-.|+|..+..+
T Consensus 28 ~~eiivvd~~s~d~-~~~~~~~~~~~~~~~~~~~~~~n~G--~~~a~-n~g~~~--------a~gd~i~~lD~D~~~~~~ 95 (185)
T cd04179 28 DYEIIVVDDGSTDG-TAEIARELAARVPRVRVIRLSRNFG--KGAAV-RAGFKA--------ARGDIVVTMDADLQHPPE 95 (185)
T ss_pred CEEEEEEcCCCCCC-hHHHHHHHHHhCCCeEEEEccCCCC--ccHHH-HHHHHH--------hcCCEEEEEeCCCCCCHH
Confidence 34445554444333 3334444455666665566555543 22111 111111 134999999999999988
Q ss_pred HHHHHhcC--CCCCCeEEEEeecCCC----------------------CCCCCcccccceeeeCHHHHHHHH
Q 020721 186 NLVKSLVP--LPREDLYYGYVIPCRS----------------------MDPFVDYMSGMGYLVSWDIAEWIR 233 (322)
Q Consensus 186 ~L~~~L~~--~~~~~~y~G~~~p~r~----------------------~~~yp~y~~G~gYvlS~d~~~~i~ 233 (322)
.|.+.+.. .+..++..|....... ...-...+.|+.+++++++++.+-
T Consensus 96 ~l~~l~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~i~ 167 (185)
T cd04179 96 DIPKLLEKLLEGGADVVIGSRFVRGGGAGMPLLRRLGSRLFNFLIRLLLGVRISDTQSGFRLFRREVLEALL 167 (185)
T ss_pred HHHHHHHHHhccCCcEEEEEeecCCCcccchHHHHHHHHHHHHHHHHHcCCCCcCCCCceeeeHHHHHHHHH
Confidence 88877764 2345666676432110 011122355667788888888885
No 37
>COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]
Probab=80.40 E-value=58 Score=31.51 Aligned_cols=170 Identities=14% Similarity=0.026 Sum_probs=102.9
Q ss_pred CeeEEEEEECCCCCH-HHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHH
Q 020721 73 DIRILIGILTLPDQY-HRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYT 151 (322)
Q Consensus 73 ~~~lli~V~S~~~~~-~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~ 151 (322)
.+.+-++|++--++. .-.+.++..-...- ...++..+...++++. .+.+++-.+++++.+.....+.-+.+|--+
T Consensus 53 ~p~vsviiP~ynE~~~~~~~~l~s~~~~dy---p~~evivv~d~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~~~gK~~a 128 (439)
T COG1215 53 LPKVSVIIPAYNEEPEVLEETLESLLSQDY---PRYEVIVVDDGSTDET-YEILEELGAEYGPNFRVIYPEKKNGGKAGA 128 (439)
T ss_pred CCceEEEEecCCCchhhHHHHHHHHHhCCC---CCceEEEECCCCChhH-HHHHHHHHhhcCcceEEEeccccCccchHH
Confidence 477888899887666 45556665554432 1255666665454443 335555566665334433221222244332
Q ss_pred HHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCCCCe--EEEEee---------c---C----------
Q 020721 152 YFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPREDL--YYGYVI---------P---C---------- 207 (322)
Q Consensus 152 ~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~--y~G~~~---------p---~---------- 207 (322)
+.+.... .+.++|+..|.|+.+..+.|.+.+......+. ..|... . .
T Consensus 129 -l~~~l~~--------~~~d~V~~~DaD~~~~~d~l~~~~~~f~~~~~~~v~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 199 (439)
T COG1215 129 -LNNGLKR--------AKGDVVVILDADTVPEPDALRELVSPFEDPPVGAVVGTPRIRNRPDPSNLLGRIQAIEYLSAFY 199 (439)
T ss_pred -HHHHHhh--------cCCCEEEEEcCCCCCChhHHHHHHhhhcCCCeeEEeCCceeeecCChhhhcchhcchhhhhhHH
Confidence 2333322 14799999999999999999998877533211 222210 0 0
Q ss_pred -----CCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 208 -----RSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 208 -----r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
........+++|++.++.+++++.+... ....-.||..+|..+...|-
T Consensus 200 ~~~~~~~~~g~~~~~~G~~~~~rr~aL~~~g~~----~~~~i~ED~~lt~~l~~~G~ 252 (439)
T COG1215 200 FRLRAASKGGLISFLSGSSSAFRRSALEEVGGW----LEDTITEDADLTLRLHLRGY 252 (439)
T ss_pred HhhhhhhhcCCeEEEcceeeeEEHHHHHHhCCC----CCCceeccHHHHHHHHHCCC
Confidence 0112246789999999999999998732 23446799999999986654
No 38
>PF13632 Glyco_trans_2_3: Glycosyl transferase family group 2
Probab=79.40 E-value=5.3 Score=34.16 Aligned_cols=85 Identities=25% Similarity=0.235 Sum_probs=55.4
Q ss_pred EEEEecCceEeeHHHHHHHhcCC--CCCCeEEEEeecC--C----------------------CCCCCCcccccceeeeC
Q 020721 172 YVMKGDDDTYIRLENLVKSLVPL--PREDLYYGYVIPC--R----------------------SMDPFVDYMSGMGYLVS 225 (322)
Q Consensus 172 yvlK~DDD~fv~~~~L~~~L~~~--~~~~~y~G~~~p~--r----------------------~~~~yp~y~~G~gYvlS 225 (322)
||+-.|+|+-+..+-|.+.+... |.-.+.-|...+. . .....+.++.|++.+++
T Consensus 1 ~v~~~DaDt~~~~d~l~~~~~~~~~~~~~~vq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~r 80 (193)
T PF13632_consen 1 YVLFLDADTRLPPDFLERLVAALEDPKVDAVQGPIIFRNRGSLLTRLQDFEYAISHGLSRLSQSSLGRPLFLSGSGMLFR 80 (193)
T ss_pred CEEEEcCCCCCChHHHHHHHHHHhCCCceEEEccEEecCCCChhheeehhhhhhhhhhhHHHHHhcCCCccccCcceeee
Confidence 67889999999988777665443 3222222222110 0 01123567899999999
Q ss_pred HHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 226 WDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 226 ~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
+++++.+.-.. ......||..+|.=+.+.|.
T Consensus 81 ~~~l~~vg~~~---~~~~~~ED~~l~~~l~~~G~ 111 (193)
T PF13632_consen 81 REALREVGGFD---DPFSIGEDMDLGFRLRRAGY 111 (193)
T ss_pred HHHHHHhCccc---ccccccchHHHHHHHHHCCC
Confidence 99999885322 13456699999999888773
No 39
>PF05679 CHGN: Chondroitin N-acetylgalactosaminyltransferase; InterPro: IPR008428 This family represents Chondroitin N-acetylgalactosaminyltransferase. Proteins have a type II transmembrane topology. The enzyme is involved in the biosynthetic initiation and elongation of chondroitin sulphate and is the key enzyme responsible for the selective chain assembly of chondroitin/dermatan sulphate on the linkage region tetrasaccharide common to various proteoglycans containing chondroitin/dermatan sulphate or heparin/heparan sulphate chains. ; GO: 0016758 transferase activity, transferring hexosyl groups, 0032580 Golgi cisterna membrane
Probab=79.03 E-value=3.4 Score=42.00 Aligned_cols=67 Identities=18% Similarity=0.167 Sum_probs=41.0
Q ss_pred CCCCCchhHHHHHHHHh-cCCCCccccCccceeecCC-C-----CCcccccccCCEEEEEccCCHHHHHHHHHHcc
Q 020721 240 NHLEGPEDKVFGAWIRE-GRRAKNRYNAKWSMYNFPV-P-----RTACTHELWADTIAVHLLKNQEKWIETLTYFN 308 (322)
Q Consensus 240 ~~~~~~EDv~vG~~l~~-~~~~~~~~~~~~~f~~~~~-~-----~~~c~~~~~~~~ia~H~~~~~~~~~~~~~~fn 308 (322)
.-....||+.+|+|+.+ .|+.-..-. ...+|+... . ...-..+ ..++|++|.++.++.+.++.+||-
T Consensus 8 ~~~s~~~Dv~lGRCI~~~~gi~Ct~~~-q~l~y~~~~~~~~~~~~~~~~~~-~~~AiTlHPvk~p~~mYrLH~y~~ 81 (499)
T PF05679_consen 8 NIYSNHEDVELGRCIKKFTGISCTWSY-QGLFYHNYELNKNDFIGDLKNKE-FHNAITLHPVKSPAYMYRLHRYFL 81 (499)
T ss_pred hcCCCCchhHHHHHHHHhcCCCeeecc-cceEEEeeccCCCcccccccchh-hhcceeeccCCCHHHHHHHHHHHH
Confidence 34567899999999987 577532211 234443221 1 0111112 367899999999987777766663
No 40
>cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily.
Probab=76.44 E-value=44 Score=27.89 Aligned_cols=119 Identities=13% Similarity=0.034 Sum_probs=66.5
Q ss_pred ceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHH
Q 020721 106 QVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLE 185 (322)
Q Consensus 106 ~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~ 185 (322)
.+.++.+-..++++... .++...+++..+..+....+. ++.-+. ...... -..+|++..|+|.....+
T Consensus 29 ~~eiivvdd~s~d~t~~-~~~~~~~~~~~i~~i~~~~n~--G~~~a~-n~g~~~--------a~~d~i~~~D~D~~~~~~ 96 (181)
T cd04187 29 DYEIIFVDDGSTDRTLE-ILRELAARDPRVKVIRLSRNF--GQQAAL-LAGLDH--------ARGDAVITMDADLQDPPE 96 (181)
T ss_pred CeEEEEEeCCCCccHHH-HHHHHHhhCCCEEEEEecCCC--CcHHHH-HHHHHh--------cCCCEEEEEeCCCCCCHH
Confidence 45566666555544322 333334455556665554443 333221 111111 145999999999999877
Q ss_pred HHHHHhcCC-CCCCeEEEEeecCC------------------CCCCCCcccccceeeeCHHHHHHHHHcC
Q 020721 186 NLVKSLVPL-PREDLYYGYVIPCR------------------SMDPFVDYMSGMGYLVSWDIAEWIRDSD 236 (322)
Q Consensus 186 ~L~~~L~~~-~~~~~y~G~~~p~r------------------~~~~yp~y~~G~gYvlS~d~~~~i~~~~ 236 (322)
.|.+.++.. +..++..|...... .....-+...|++.++++++++.+....
T Consensus 97 ~l~~l~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~i~~~d 166 (181)
T cd04187 97 LIPEMLAKWEEGYDVVYGVRKNRKESWLKRLTSKLFYRLINKLSGVDIPDNGGDFRLMDRKVVDALLLLP 166 (181)
T ss_pred HHHHHHHHHhCCCcEEEEEecCCcchHHHHHHHHHHHHHHHHHcCCCCCCCCCCEEEEcHHHHHHHHhcC
Confidence 666666543 23456666543110 0111124567788899999999988654
No 41
>cd04191 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elongation of beta-1,2 polyglucose chains of glucan. Periplasmic Glucan Biosynthesis protein ModH is a glucosyltransferase that catalyzes the elongation of beta-1,2 polyglucose chains of glucan, requiring a beta-glucoside as a primer and UDP-glucose as a substrate. Glucans are composed of 5 to 10 units of glucose forming a highly branched structure, where beta-1,2-linked glucose constitutes a linear backbone to which branches are attached by beta-1,6 linkages. In Escherichia coli, glucans are located in the periplasmic space, functioning as regulator of osmolarity. It is synthesized at a maximum when cells are grown in a medium with low osmolarity. It has been shown to span the cytoplasmic membrane.
Probab=76.34 E-value=62 Score=29.61 Aligned_cols=173 Identities=14% Similarity=0.077 Sum_probs=93.6
Q ss_pred EEEEECCCCCHH-HHHHHHHHhcc---CCCCCCceEEEEEecCCCchhHHHH-------HHHHHhhCCCeEEeccCCcCC
Q 020721 77 LIGILTLPDQYH-RRHFLRMIYGT---QSPTGAQVDVKFVFCNLTKEDQKVL-------VALEIMRYDDIIILNCKENMN 145 (322)
Q Consensus 77 li~V~S~~~~~~-rR~aIR~TW~~---~~~~~~~v~~~Fvlg~~~~~~~~~~-------l~~E~~~~~DIv~ld~~D~y~ 145 (322)
-|+|++.-...+ -.+.++.++.. +.. ...+.+ |++...++++.... +.+|....-.+....-.++ .
T Consensus 2 SIliP~~ne~~~~l~~~l~~~~~~~~~~~~-~~~~eI-~vldD~~d~~~~~~~~~~~~~l~~~~~~~~~v~~~~r~~~-~ 78 (254)
T cd04191 2 AIVMPVYNEDPARVFAGLRAMYESLAKTGL-ADHFDF-FILSDTRDPDIWLAEEAAWLDLCEELGAQGRIYYRRRREN-T 78 (254)
T ss_pred EEEEeCCCCCHHHHHHHHHHHHHHHHhcCC-cCceEE-EEECCCCChHHHHHHHHHHHHHHHHhCCCCcEEEEEcCCC-C
Confidence 367888876664 66778877742 211 124556 88877666543221 2222222223333333333 2
Q ss_pred chHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEee--cCC------------
Q 020721 146 KGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVI--PCR------------ 208 (322)
Q Consensus 146 n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~--p~r------------ 208 (322)
+.|+-..-..+.. .. .+++|++-.|.|+.+..+.|.+.+... |.=.+.-+... ..+
T Consensus 79 g~Kag~l~~~~~~-~~------~~~~~i~~~DaD~~~~p~~l~~~v~~~~~~~~vg~vq~~~~~~n~~~~~~~~~~~~~~ 151 (254)
T cd04191 79 GRKAGNIADFCRR-WG------SRYDYMVVLDADSLMSGDTIVRLVRRMEANPRAGIIQTAPKLIGAETLFARLQQFANR 151 (254)
T ss_pred CccHHHHHHHHHH-hC------CCCCEEEEEeCCCCCCHHHHHHHHHHHHhCCCEEEEeCCceeECCCCHHHHHHHHHHH
Confidence 3455332222211 00 256999999999999999888877653 22111111100 000
Q ss_pred ----------C-CCCCCcccccceeeeCHHHHHHHHHcCCC-----CCCCCCchhHHHHHHHHhcCC
Q 020721 209 ----------S-MDPFVDYMSGMGYLVSWDIAEWIRDSDIP-----KNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 209 ----------~-~~~yp~y~~G~gYvlS~d~~~~i~~~~~~-----~~~~~~~EDv~vG~~l~~~~~ 259 (322)
. +...-.++.|...++.++++..+...+.. -....-.||..+|..+...|-
T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~G~~~~~Rr~al~~~~~~~~i~g~g~~~~~~l~eD~~l~~~~~~~G~ 218 (254)
T cd04191 152 LYGPVFGRGLAAWQGGEGNYWGHNAIIRVAAFMEHCALPVLPGRPPFGGHILSHDFVEAALMRRAGW 218 (254)
T ss_pred HHHHHHHHHHHHhcCCccCccceEEEEEHHHHHHhcCCccccCCCCCCCCeecHHHHHHHHHHHcCC
Confidence 0 00112356799999999998876432100 012246899999999988764
No 42
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen.
Probab=75.62 E-value=54 Score=28.53 Aligned_cols=89 Identities=21% Similarity=0.104 Sum_probs=54.0
Q ss_pred CceEEEEecCceEeeHHHHHHHh--cC-C--CCCCeEEEEeecCC-----------C----------CCCC--Ccccccc
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSL--VP-L--PREDLYYGYVIPCR-----------S----------MDPF--VDYMSGM 220 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L--~~-~--~~~~~y~G~~~p~r-----------~----------~~~y--p~y~~G~ 220 (322)
+++|++-.|||+.+..+.|...+ .. . ...-..+|...... . .... ..++.|+
T Consensus 75 ~~d~v~~lD~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (237)
T cd02526 75 GADYVLLFDQDSVPPPDMVEKLLAYKILSDKNSNIGAVGPRIIDRRTGENSPGVRKSGYKLRIQKEGEEGLKEVDFLITS 154 (237)
T ss_pred CCCEEEEECCCCCcCHhHHHHHHHHHHhhccCCCeEEEeeeEEcCCCCeeccceeccCccceecccccCCceEeeeeecc
Confidence 56999999999999987777774 22 1 11111222211000 0 0001 1245577
Q ss_pred eeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 221 GYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 221 gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
|.++++++.+.+.-... ......||+.++.-+.+.|.
T Consensus 155 ~~~~rr~~~~~~ggfd~--~~~~~~eD~d~~~r~~~~G~ 191 (237)
T cd02526 155 GSLISLEALEKVGGFDE--DLFIDYVDTEWCLRARSKGY 191 (237)
T ss_pred ceEEcHHHHHHhCCCCH--HHcCccchHHHHHHHHHcCC
Confidence 88999999998754321 22356799999988887775
No 43
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602).
Probab=75.46 E-value=89 Score=30.94 Aligned_cols=166 Identities=13% Similarity=0.063 Sum_probs=85.8
Q ss_pred eeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHHHH
Q 020721 74 IRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYTYF 153 (322)
Q Consensus 74 ~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~~~ 153 (322)
+.+-|+|++--+...-++.|+.--.+. .....+.++.+-..++++. .+.+++-.+.+..+...-..++ .+|..+ +
T Consensus 49 P~vsVIIP~yNe~~~l~~~l~sl~~q~-yp~~~~eIiVVDd~StD~T-~~il~~~~~~~~~v~v~~~~~~--~Gka~A-l 123 (439)
T TIGR03111 49 PDITIIIPVYNSEDTLFNCIESIYNQT-YPIELIDIILANNQSTDDS-FQVFCRAQNEFPGLSLRYMNSD--QGKAKA-L 123 (439)
T ss_pred CCEEEEEEeCCChHHHHHHHHHHHhcC-CCCCCeEEEEEECCCChhH-HHHHHHHHHhCCCeEEEEeCCC--CCHHHH-H
Confidence 456777777654444455555543322 1223454544444444332 2222322344444432222222 256553 2
Q ss_pred HhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCC---------------C------
Q 020721 154 SSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCR---------------S------ 209 (322)
Q Consensus 154 ~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r---------------~------ 209 (322)
.+..+. .+.+|++..|+|..+..+.|.+.++.. |.-....|.+.+.. .
T Consensus 124 N~gl~~--------s~g~~v~~~DaD~~~~~d~L~~l~~~f~~~~~v~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~y 195 (439)
T TIGR03111 124 NAAIYN--------SIGKYIIHIDSDGKLHKDAIKNMVTRFENNPDIHAMTGVILTDKELIEKTKGRFLKLIRRCEYFEY 195 (439)
T ss_pred HHHHHH--------ccCCEEEEECCCCCcChHHHHHHHHHHHhCCCeEEEEeEEecCchhhhhhcchhhhHhHHhHHHHH
Confidence 222222 256899999999999988887776553 22223344443210 0
Q ss_pred ------------CCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHh
Q 020721 210 ------------MDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIRE 256 (322)
Q Consensus 210 ------------~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~ 256 (322)
...-+..++|++..+.+++++.+.-.. ...-.||..++.=++.
T Consensus 196 ~~~~l~~r~~~s~~~~~~~~sGa~~~~Rr~~l~~vggf~----~~~i~ED~~l~~rl~~ 250 (439)
T TIGR03111 196 AQAFLAGRNFESQVNSLFTLSGAFSAFRRETILKTQLYN----SETVGEDTDMTFQIRE 250 (439)
T ss_pred HHHHHhhhHHHHhcCCeEEEccHHHhhhHHHHHHhCCCC----CCCcCccHHHHHHHHH
Confidence 000112357888889999887764322 1223899999876654
No 44
>cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the
Probab=72.75 E-value=46 Score=26.44 Aligned_cols=66 Identities=20% Similarity=0.088 Sum_probs=39.3
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCC----------------------C---CCCCCcccccc
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCR----------------------S---MDPFVDYMSGM 220 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r----------------------~---~~~yp~y~~G~ 220 (322)
+.+|++..|+|..+....|.+.+... +.-....|...... . .......+.|.
T Consensus 78 ~~~~i~~~D~D~~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 157 (180)
T cd06423 78 KGDIVVVLDADTILEPDALKRLVVPFFADPKVGAVQGRVRVRNGSENLLTRLQAIEYLSIFRLGRRAQSALGGVLVLSGA 157 (180)
T ss_pred CCCEEEEECCCCCcChHHHHHHHHHhccCCCeeeEeeeEEEecCcCcceeccchheecceeeeeeehhheecceeecCch
Confidence 67999999999999877666663332 22223333321100 0 01123456788
Q ss_pred eeeeCHHHHHHHHH
Q 020721 221 GYLVSWDIAEWIRD 234 (322)
Q Consensus 221 gYvlS~d~~~~i~~ 234 (322)
++++++++++.+..
T Consensus 158 ~~~~~~~~~~~~gg 171 (180)
T cd06423 158 FGAFRREALREVGG 171 (180)
T ss_pred HHHHHHHHHHHhCC
Confidence 88888888877653
No 45
>PRK14716 bacteriophage N4 adsorption protein B; Provisional
Probab=65.96 E-value=1.6e+02 Score=30.04 Aligned_cols=185 Identities=14% Similarity=0.021 Sum_probs=94.8
Q ss_pred CCeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHHHH
Q 020721 72 EDIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKTYT 151 (322)
Q Consensus 72 ~~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt~~ 151 (322)
+.+.+-|+|+..-+..--.+.|+..-.+... .+.+++++.. ..+++....+++=+.+|..+...-...+=.+.|..+
T Consensus 64 ~~p~vaIlIPA~NE~~vI~~~l~s~L~~ldY--~~~eIiVv~d-~ndd~T~~~v~~l~~~~p~v~~vv~~~~gp~~Ka~a 140 (504)
T PRK14716 64 PEKRIAIFVPAWREADVIGRMLEHNLATLDY--ENYRIFVGTY-PNDPATLREVDRLAARYPRVHLVIVPHDGPTSKADC 140 (504)
T ss_pred CCCceEEEEeccCchhHHHHHHHHHHHcCCC--CCeEEEEEEC-CCChhHHHHHHHHHHHCCCeEEEEeCCCCCCCHHHH
Confidence 3566778888775544344455544222221 2444444432 322222222322245566653322222222356653
Q ss_pred HHHhhhhHh-cCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCCCCeEEEEeecC-C---------------------
Q 020721 152 YFSSLPEIF-NTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPREDLYYGYVIPC-R--------------------- 208 (322)
Q Consensus 152 ~~~w~~~~~-~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~y~G~~~p~-r--------------------- 208 (322)
+.++.+.. ..+...-.++++++-.|-|..+..+.|...-...+..++.-..+.+. +
T Consensus 141 -LN~~l~~~~~~e~~~G~~~d~vvi~DAD~~v~Pd~Lr~~~~~~~~~~~VQ~pv~~~~~~~~~~~ag~y~~ef~~~~~~~ 219 (504)
T PRK14716 141 -LNWIYQAIFAFERERGIRFAIIVLHDAEDVIHPLELRLYNYLLPRHDFVQLPVFSLPRDWGEWVAGTYMDEFAESHLKD 219 (504)
T ss_pred -HHHHHHHHHHhhhhcCCCcCEEEEEcCCCCcCccHHHHHHhhcCCCCEEecceeccCCchhHHHHHHHHHHHHHHHHHH
Confidence 33332211 00000112579999999999999998876533333322111111000 0
Q ss_pred ----CCCCCCcccccceeeeCHHHHHHHHHcCC--CCCCCCCchhHHHHHHHHhcCCC
Q 020721 209 ----SMDPFVDYMSGMGYLVSWDIAEWIRDSDI--PKNHLEGPEDKVFGAWIREGRRA 260 (322)
Q Consensus 209 ----~~~~yp~y~~G~gYvlS~d~~~~i~~~~~--~~~~~~~~EDv~vG~~l~~~~~~ 260 (322)
..-..+-.+.|.|+.+++++++.+..... .-....-.||..+|.-+...|..
T Consensus 220 l~~r~~LG~~~~~~Gtg~afRR~aLe~l~~~~GG~~fd~~sLTED~dLglRL~~~G~r 277 (504)
T PRK14716 220 LPVREALGGLIPSAGVGTAFSRRALERLAAERGGQPFDSDSLTEDYDIGLRLKRAGFR 277 (504)
T ss_pred HHHHHhcCCccccCCeeEEeEHHHHHHHHhhcCCCCCCCCCcchHHHHHHHHHHCCCE
Confidence 00001224679999999999999864221 01234578999999999888763
No 46
>cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose. A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier.
Probab=65.00 E-value=83 Score=26.40 Aligned_cols=63 Identities=14% Similarity=0.111 Sum_probs=40.1
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCCC-CCCeEEEEeec---CCC----------------------CCCCCccccccee
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPLP-REDLYYGYVIP---CRS----------------------MDPFVDYMSGMGY 222 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~-~~~~y~G~~~p---~r~----------------------~~~yp~y~~G~gY 222 (322)
+.+|++-.|.|+.+..+.|...+.... ..++..|+... ..+ .-..+.++.|.++
T Consensus 81 ~~d~v~~~DaD~~~~p~~l~~l~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~ 160 (183)
T cd06438 81 DPDAVVVFDADNLVDPNALEELNARFAAGARVVQAYYNSKNPDDSWITRLYAFAFLVFNRLRPLGRSNLGLSCQLGGTGM 160 (183)
T ss_pred CCCEEEEEcCCCCCChhHHHHHHHHHhhCCCeeEEEEeeeCCccCHHHHHHHHHHHHHHHHHHHHHHHcCCCeeecCchh
Confidence 579999999999999877777665542 23455565421 100 0011235677778
Q ss_pred eeCHHHHHH
Q 020721 223 LVSWDIAEW 231 (322)
Q Consensus 223 vlS~d~~~~ 231 (322)
++++++++.
T Consensus 161 ~~rr~~l~~ 169 (183)
T cd06438 161 CFPWAVLRQ 169 (183)
T ss_pred hhHHHHHHh
Confidence 888888777
No 47
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi,
Probab=63.90 E-value=94 Score=26.63 Aligned_cols=65 Identities=14% Similarity=-0.052 Sum_probs=43.0
Q ss_pred CceEEEEecCceEeeHHHHHHHhcC--CCCCCeEEEEee-cCCC----------------------CCCCCcccccceee
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVP--LPREDLYYGYVI-PCRS----------------------MDPFVDYMSGMGYL 223 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~--~~~~~~y~G~~~-p~r~----------------------~~~yp~y~~G~gYv 223 (322)
..+|++..|+|..+..+.|...++. .+..++..|... .... ...-...+.|++.+
T Consensus 78 ~gd~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (224)
T cd06442 78 RGDVIVVMDADLSHPPEYIPELLEAQLEGGADLVIGSRYVEGGGVEGWGLKRKLISRGANLLARLLLGRKVSDPTSGFRA 157 (224)
T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHhcCCCCEEEEeeeecCCccCCCcHHHHHHHHHHHHHHHHHcCCCCCCCCCccch
Confidence 3499999999999998888777765 234455555432 1000 01112346777888
Q ss_pred eCHHHHHHHH
Q 020721 224 VSWDIAEWIR 233 (322)
Q Consensus 224 lS~d~~~~i~ 233 (322)
+++++++.+.
T Consensus 158 ~~r~~~~~ig 167 (224)
T cd06442 158 YRREVLEKLI 167 (224)
T ss_pred hhHHHHHHHh
Confidence 9999999987
No 48
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=61.73 E-value=2.3e+02 Score=30.38 Aligned_cols=184 Identities=13% Similarity=0.043 Sum_probs=99.9
Q ss_pred CCCCeeEEEEEECCCCCHHHHHHHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCCeEEeccCCcCCchHH
Q 020721 70 SSEDIRILIGILTLPDQYHRRHFLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDDIIILNCKENMNKGKT 149 (322)
Q Consensus 70 ~~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DIv~ld~~D~y~n~Kt 149 (322)
..++..+-|+|+-.-+...-.+.|...-....-. +++++++.. ++++.....+++-.++|.++..+.....-+++|.
T Consensus 59 ~~~~~~vsIlVPa~nE~~vi~~~i~~ll~~ldYP--~~eI~vi~~-~nD~~T~~~~~~l~~~~p~~~~v~~~~~g~~gKa 135 (727)
T PRK11234 59 KPDEKPLAIMVPAWNETGVIGNMAELAATTLDYE--NYHIFVGTY-PNDPATQADVDAVCARFPNVHKVVCARPGPTSKA 135 (727)
T ss_pred cCCCCCEEEEEecCcchhhHHHHHHHHHHhCCCC--CeEEEEEec-CCChhHHHHHHHHHHHCCCcEEEEeCCCCCCCHH
Confidence 3455778888988755554444555432222211 354555543 3333333345555567777643333332245776
Q ss_pred HHHHHhhhhHh-cCCCCCCC--CceEEEEecCceEeeHHHHHHHhcCC-CCCCeEEEEeecC-C----------------
Q 020721 150 YTYFSSLPEIF-NTSDRPYP--PYHYVMKGDDDTYIRLENLVKSLVPL-PREDLYYGYVIPC-R---------------- 208 (322)
Q Consensus 150 ~~~~~w~~~~~-~~~~~~c~--~~~yvlK~DDD~fv~~~~L~~~L~~~-~~~~~y~G~~~p~-r---------------- 208 (322)
.+ +.++.... ..+ .++ .++.++-.|-|+-+..+.|. .+..+ ++.+..-+...|. +
T Consensus 136 ~a-LN~~l~~~~~~e--~~~~~~~~vvvi~DAD~~v~pd~L~-~~~~l~~~~~~VQ~p~~p~~~~~~~~~~~~~~~EFa~ 211 (727)
T PRK11234 136 DC-LNNVLDAITQFE--RSANFAFAGFILHDAEDVISPMELR-LFNYLVERKDLIQIPVYPFEREWTHFTSGTYIDEFAE 211 (727)
T ss_pred HH-HHHHHHHHHhhh--cccCCcccEEEEEcCCCCCChhHHH-HHHhhcCCCCeEeecccCCCccHHHHHHHHHHHHHHH
Confidence 54 33333321 100 112 34567779999999999997 22222 1112211111110 0
Q ss_pred ---------CCCCCCcccccceeeeCHHHHHHHHHcCC--CCCCCCCchhHHHHHHHHhcCCC
Q 020721 209 ---------SMDPFVDYMSGMGYLVSWDIAEWIRDSDI--PKNHLEGPEDKVFGAWIREGRRA 260 (322)
Q Consensus 209 ---------~~~~yp~y~~G~gYvlS~d~~~~i~~~~~--~~~~~~~~EDv~vG~~l~~~~~~ 260 (322)
..-..+-.++|.|..+||.+++.+.+... .-....--||.-+|.-+...|..
T Consensus 212 ~~~~~~~~~~~lgg~~~l~G~~~af~Rr~l~al~~~ggg~~~~~~~lTED~dlg~rL~~~G~~ 274 (727)
T PRK11234 212 LHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLEDGDGIAFDVQSLTEDYDIGFRLKEKGMR 274 (727)
T ss_pred HhhhhhHHHHHcCCCcccCCceEEEecccHHHHHHhcCCCCcCCCcchHHHHHHHHHHHCCCE
Confidence 00012345889999999988776666430 12455678999999999988864
No 49
>cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=61.33 E-value=99 Score=26.00 Aligned_cols=88 Identities=18% Similarity=0.181 Sum_probs=52.2
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEee----------cCC-C-C--CCCCcc-cccceeeeCHHHHH
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVI----------PCR-S-M--DPFVDY-MSGMGYLVSWDIAE 230 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~----------p~r-~-~--~~yp~y-~~G~gYvlS~d~~~ 230 (322)
..+|++..|+|..+..+.|...++.. +.-++.+|... +.. + . ..+-.. +.|.+-++++++++
T Consensus 83 ~~d~i~~ld~D~~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 162 (202)
T cd04184 83 TGEFVALLDHDDELAPHALYEVVKALNEHPDADLIYSDEDKIDEGGKRSEPFFKPDWSPDLLLSQNYIGHLLVYRRSLVR 162 (202)
T ss_pred cCCEEEEECCCCcCChHHHHHHHHHHHhCCCCCEEEccHHhccCCCCEeccccCCCCCHHHhhhcCCccceEeEEHHHHH
Confidence 56999999999999887777766553 33344443221 000 0 0 000111 23455578888888
Q ss_pred HHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 231 WIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 231 ~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
.+.--. ......||..++.-+.+.|.
T Consensus 163 ~iggf~---~~~~~~eD~~l~~rl~~~g~ 188 (202)
T cd04184 163 QVGGFR---EGFEGAQDYDLVLRVSEHTD 188 (202)
T ss_pred HhCCCC---cCcccchhHHHHHHHHhccc
Confidence 775432 12446799988887766654
No 50
>TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils.
Probab=58.19 E-value=2.6e+02 Score=29.79 Aligned_cols=87 Identities=14% Similarity=0.018 Sum_probs=54.2
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEee---cC---C---------------------CCCC-CCccc
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVI---PC---R---------------------SMDP-FVDYM 217 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~---p~---r---------------------~~~~-yp~y~ 217 (322)
+.+|++-.|.|+.+..+.|.+.+... |.=.+..+... |. + ..+. -..++
T Consensus 228 ~gd~Il~lDAD~v~~pd~L~~~v~~f~~dp~v~~Vqtp~~f~~p~~~~~nl~~~~~~~~e~~~f~~~i~~g~~~~~~~~~ 307 (713)
T TIGR03030 228 DGELILIFDADHVPTRDFLQRTVGWFVEDPKLFLVQTPHFFVSPDPIERNLGTFRRMPNENELFYGLIQDGNDFWNAAFF 307 (713)
T ss_pred CCCEEEEECCCCCcChhHHHHHHHHHHhCCCEEEEeCCeeccCCCHHhhhhHHHHHhhhHHHHHHHHHHHHHhhhCCeee
Confidence 56999999999999988777765442 22011111100 00 0 0000 12467
Q ss_pred ccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 218 SGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 218 ~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
.|++.++.+++.+.+---.. ..-.||..+|.-+.+.|-
T Consensus 308 ~Gs~~~iRR~al~~iGGf~~----~~vtED~~l~~rL~~~G~ 345 (713)
T TIGR03030 308 CGSAAVLRREALDEIGGIAG----ETVTEDAETALKLHRRGW 345 (713)
T ss_pred cCceeEEEHHHHHHcCCCCC----CCcCcHHHHHHHHHHcCC
Confidence 79999999999998753221 123799999999987764
No 51
>PF05412 Peptidase_C33: Equine arterivirus Nsp2-type cysteine proteinase; InterPro: IPR008743 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases corresponds to MEROPS peptidase family C33 (clan CA). The type example is equine arteritis virus Nsp2-type cysteine proteinase, which is involved in viral polyprotein processing [].; GO: 0016032 viral reproduction, 0019082 viral protein processing
Probab=54.17 E-value=10 Score=30.14 Aligned_cols=18 Identities=28% Similarity=0.523 Sum_probs=15.8
Q ss_pred CCCCCCceEEEEecCceE
Q 020721 164 DRPYPPYHYVMKGDDDTY 181 (322)
Q Consensus 164 ~~~c~~~~yvlK~DDD~f 181 (322)
+..||+++|++|.|++=+
T Consensus 63 ~~~Cp~ArYv~~l~~qHW 80 (108)
T PF05412_consen 63 NGACPHARYVLKLDGQHW 80 (108)
T ss_pred CCCCCCCEEEEEecCceE
Confidence 467999999999999866
No 52
>PRK10714 undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional
Probab=53.54 E-value=2e+02 Score=27.19 Aligned_cols=146 Identities=10% Similarity=0.077 Sum_probs=75.3
Q ss_pred eEEEEEECCCCCHHHHHHHHHHhc--cCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCC-eEEeccCCcCCchHHHH
Q 020721 75 RILIGILTLPDQYHRRHFLRMIYG--TQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDD-IIILNCKENMNKGKTYT 151 (322)
Q Consensus 75 ~lli~V~S~~~~~~rR~aIR~TW~--~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~D-Iv~ld~~D~y~n~Kt~~ 151 (322)
.+-|+|++--+...-.+.+++.=. .+ ....+.+++|-..+++.... .+++-++.+++ ++......++ +|.-+
T Consensus 7 ~vSVVIP~yNE~~~i~~~l~~l~~~~~~--~~~~~EIIvVDDgS~D~T~~-il~~~~~~~~~~v~~i~~~~n~--G~~~A 81 (325)
T PRK10714 7 KVSVVIPVYNEQESLPELIRRTTAACES--LGKEYEILLIDDGSSDNSAE-MLVEAAQAPDSHIVAILLNRNY--GQHSA 81 (325)
T ss_pred eEEEEEcccCchhhHHHHHHHHHHHHHh--CCCCEEEEEEeCCCCCcHHH-HHHHHHhhcCCcEEEEEeCCCC--CHHHH
Confidence 466667765433322333333211 11 12346677776656554332 22222333444 4444444444 34332
Q ss_pred HHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCCCC-CCeEEEEeec-CC------------------CCC
Q 020721 152 YFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPLPR-EDLYYGYVIP-CR------------------SMD 211 (322)
Q Consensus 152 ~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~~~-~~~y~G~~~p-~r------------------~~~ 211 (322)
...-+ +. .+.+|++-.|+|.-.+++.+.+.++.... -++..|.... .. ...
T Consensus 82 ~~~G~-~~--------A~gd~vv~~DaD~q~~p~~i~~l~~~~~~~~DvV~~~r~~~~~~~~r~~~s~~~~~l~~~~~g~ 152 (325)
T PRK10714 82 IMAGF-SH--------VTGDLIITLDADLQNPPEEIPRLVAKADEGYDVVGTVRQNRQDSWFRKTASKMINRLIQRTTGK 152 (325)
T ss_pred HHHHH-Hh--------CCCCEEEEECCCCCCCHHHHHHHHHHHHhhCCEEEEEEcCCCCcHHHHHHHHHHHHHHHHHcCC
Confidence 21111 11 15699999999999999888888776532 2455454321 00 122
Q ss_pred CCCccccccee-eeCHHHHHHHHHcC
Q 020721 212 PFVDYMSGMGY-LVSWDIAEWIRDSD 236 (322)
Q Consensus 212 ~yp~y~~G~gY-vlS~d~~~~i~~~~ 236 (322)
.++... +|| ++++++++.+....
T Consensus 153 ~~~d~~--~gfr~~~r~~~~~l~~~~ 176 (325)
T PRK10714 153 AMGDYG--CMLRAYRRHIVDAMLHCH 176 (325)
T ss_pred CCCCCC--cCeEEEcHHHHHHHHHCC
Confidence 233333 344 89999999997543
No 53
>PRK05454 glucosyltransferase MdoH; Provisional
Probab=53.11 E-value=1.6e+02 Score=31.35 Aligned_cols=181 Identities=15% Similarity=0.105 Sum_probs=97.1
Q ss_pred CCCCeeEEEEEECCCCCHH-HHHHHHHHhccCC--CCCCceEEEEEecCCCchhHHH----HHHHHHhhCC---CeEEec
Q 020721 70 SSEDIRILIGILTLPDQYH-RRHFLRMIYGTQS--PTGAQVDVKFVFCNLTKEDQKV----LVALEIMRYD---DIIILN 139 (322)
Q Consensus 70 ~~~~~~lli~V~S~~~~~~-rR~aIR~TW~~~~--~~~~~v~~~Fvlg~~~~~~~~~----~l~~E~~~~~---DIv~ld 139 (322)
.....++.|+|++.-+..+ -+..++.++.+-. ....++++ |++....+++... .+.+=.++++ .|....
T Consensus 120 ~~~~~~VaVliP~yNEd~~~v~~~L~a~~~Sl~~~~~~~~~e~-~vLdD~~d~~~~~~e~~~~~~L~~~~~~~~~i~yr~ 198 (691)
T PRK05454 120 PPPEARTAILMPIYNEDPARVFAGLRAMYESLAATGHGAHFDF-FILSDTRDPDIAAAEEAAWLELRAELGGEGRIFYRR 198 (691)
T ss_pred CCCCCceEEEEeCCCCChHHHHHHHHHHHHHHHhcCCCCCEEE-EEEECCCChhHHHHHHHHHHHHHHhcCCCCcEEEEE
Confidence 3455788899999976653 3457777775321 11235655 8888776654321 1222223332 344433
Q ss_pred cCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCC--C-----
Q 020721 140 CKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCR--S----- 209 (322)
Q Consensus 140 ~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r--~----- 209 (322)
-.+|- ++|.-..-.++... + .+++|++-.|-|+.+..+.|.+.+..+ |+-.+.-+...... +
T Consensus 199 R~~n~-~~KaGNl~~~~~~~-~------~~~eyivvLDADs~m~~d~L~~lv~~m~~dP~vGlVQt~~~~~n~~slfaR~ 270 (691)
T PRK05454 199 RRRNV-GRKAGNIADFCRRW-G------GAYDYMVVLDADSLMSGDTLVRLVRLMEANPRAGLIQTLPVAVGADTLFARL 270 (691)
T ss_pred CCcCC-CccHHHHHHHHHhc-C------CCcCEEEEEcCCCCCCHHHHHHHHHHHhhCcCEEEEeCCccCcCCCCHHHHH
Confidence 33332 24554433333321 1 267999999999999999888887653 22111111111000 0
Q ss_pred ----CCCC--------------CcccccceeeeCHHHHHHHHHcCCC--CC---CCCCchhHHHHHHHHhcCC
Q 020721 210 ----MDPF--------------VDYMSGMGYLVSWDIAEWIRDSDIP--KN---HLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 210 ----~~~y--------------p~y~~G~gYvlS~d~~~~i~~~~~~--~~---~~~~~EDv~vG~~l~~~~~ 259 (322)
.+.| -....|..-++.+++......-... .. ...-.||...|..++..|-
T Consensus 271 qqf~~~~y~~~~~~G~~~w~~~~g~f~G~naIiR~~af~~~~glp~L~g~~p~~~~~LseD~~~a~~l~~~Gy 343 (691)
T PRK05454 271 QQFATRVYGPLFAAGLAWWQGGEGNYWGHNAIIRVKAFAEHCGLPPLPGRGPFGGHILSHDFVEAALMRRAGW 343 (691)
T ss_pred HHHHHHHHHHHHHhhhhhhccCccccccceEEEEHHHHHHhcCCccccccCCCCCCcccHHHHHHHHHHHCCC
Confidence 0000 0112356678888877765421100 11 2346789999999998765
No 54
>cd04190 Chitin_synth_C C-terminal domain of Chitin Synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin. Chitin synthase, also called UDP-N-acetyl-D-glucosamine:chitin 4-beta-N-acetylglucosaminyltransferase, catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of GlcNAc residues formed by covalent beta-1,4 linkages. Chitin is an important component of the cell wall of fungi and bacteria and it is synthesized on the cytoplasmic surface of the cell membrane by membrane bound chitin synthases. Studies with fungi have revealed that most of them contain more than one chitin synthase gene. At least five subclasses of chitin synthases have been identified.
Probab=50.80 E-value=35 Score=30.59 Aligned_cols=91 Identities=18% Similarity=0.216 Sum_probs=55.3
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCC-------------------------CCCCCCcccccc
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCR-------------------------SMDPFVDYMSGM 220 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r-------------------------~~~~yp~y~~G~ 220 (322)
+.+|++.+|.|+.+..+.|.+.+..+ |.-....|...+.. +.-.+...+.|+
T Consensus 73 ~~e~i~~~DaD~~~~~~~l~~l~~~~~~~p~vg~v~g~~~~~~~~~~~~~~~q~~ey~~~~~~~~~~~s~~g~~~~~~G~ 152 (244)
T cd04190 73 DPEFILLVDADTKFDPDSIVQLYKAMDKDPEIGGVCGEIHPMGKKQGPLVMYQVFEYAISHWLDKAFESVFGFVTCLPGC 152 (244)
T ss_pred CCCEEEEECCCCcCCHhHHHHHHHHHHhCCCEEEEEeeeEEcCCcchhHHHhHheehhhhhhhcccHHHcCCceEECCCc
Confidence 67999999999999988877666443 33234455543210 001134456788
Q ss_pred eeeeCHHHHHHHHHcCCC-------C---------CCCCCchhHHHHHHHHhcCC
Q 020721 221 GYLVSWDIAEWIRDSDIP-------K---------NHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 221 gYvlS~d~~~~i~~~~~~-------~---------~~~~~~EDv~vG~~l~~~~~ 259 (322)
+.++.+++++.+...... + ....-.||..++..+...|.
T Consensus 153 ~~~~R~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ED~~l~~~l~~~G~ 207 (244)
T cd04190 153 FSMYRIEALKGDNGGKGPLLDYAYLTNTVDSLHKKNNLDLGEDRILCTLLLKAGP 207 (244)
T ss_pred eEEEEehhhcCCccccccchhhccccCcccchHHHHHHhHhcccceeHHHhccCC
Confidence 888988887765321100 0 01224688888888876654
No 55
>cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response.
Probab=49.09 E-value=1.8e+02 Score=25.24 Aligned_cols=87 Identities=11% Similarity=-0.022 Sum_probs=54.9
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCCCCC--CeEEEEe-ecCCC---------------------C-CC-CCccccccee
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPLPRE--DLYYGYV-IPCRS---------------------M-DP-FVDYMSGMGY 222 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~~--~~y~G~~-~p~r~---------------------~-~~-yp~y~~G~gY 222 (322)
+++|++..|+|+.+..+.|.+.+...... ++..|.. ..... . .. --.++.|++.
T Consensus 84 ~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 163 (236)
T cd06435 84 DAEIIAVIDADYQVEPDWLKRLVPIFDDPRVGFVQAPQDYRDGEESLFKRMCYAEYKGFFDIGMVSRNERNAIIQHGTMC 163 (236)
T ss_pred CCCEEEEEcCCCCcCHHHHHHHHHHhcCCCeeEEecCccccCCCccHHHHHHhHHHHHHHHHHhccccccCceEEecceE
Confidence 36999999999999998888877654322 2222211 00000 0 00 0124678888
Q ss_pred eeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 223 LVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 223 vlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
++++++.+.+.-... ....||.-++.=+.+.|.
T Consensus 164 ~~rr~~~~~iGgf~~----~~~~eD~dl~~r~~~~G~ 196 (236)
T cd06435 164 LIRRSALDDVGGWDE----WCITEDSELGLRMHEAGY 196 (236)
T ss_pred EEEHHHHHHhCCCCC----ccccchHHHHHHHHHCCc
Confidence 999999998854321 235899999987777664
No 56
>PLN02726 dolichyl-phosphate beta-D-mannosyltransferase
Probab=48.40 E-value=1.9e+02 Score=25.49 Aligned_cols=168 Identities=12% Similarity=-0.012 Sum_probs=84.1
Q ss_pred CeeEEEEEECCCCCHHHHH---HHHHHhccCCCCCCceEEEEEecCCCchhHHHHHHHHHhhCCC--eEEeccCCcCCch
Q 020721 73 DIRILIGILTLPDQYHRRH---FLRMIYGTQSPTGAQVDVKFVFCNLTKEDQKVLVALEIMRYDD--IIILNCKENMNKG 147 (322)
Q Consensus 73 ~~~lli~V~S~~~~~~rR~---aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~~D--Iv~ld~~D~y~n~ 147 (322)
.+.+-|+|++--....-.. .+++.-... ..+.+++|-..+++... +.+++-.++|++ +....-..+..
T Consensus 8 ~~~vsVvIp~yne~~~l~~~l~~l~~~~~~~----~~~eiivvDdgS~D~t~-~i~~~~~~~~~~~~v~~~~~~~n~G-- 80 (243)
T PLN02726 8 AMKYSIIVPTYNERLNIALIVYLIFKALQDV----KDFEIIVVDDGSPDGTQ-DVVKQLQKVYGEDRILLRPRPGKLG-- 80 (243)
T ss_pred CceEEEEEccCCchhhHHHHHHHHHHHhccC----CCeEEEEEeCCCCCCHH-HHHHHHHHhcCCCcEEEEecCCCCC--
Confidence 4567777887643222222 233322111 25566666655554432 233333344543 33332223222
Q ss_pred HHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC--CCCCeEEEEeecC----------CC------
Q 020721 148 KTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL--PREDLYYGYVIPC----------RS------ 209 (322)
Q Consensus 148 Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~--~~~~~y~G~~~p~----------r~------ 209 (322)
|..+ +...... .+.+|++-.|+|..+.++.|.+.+... +.-++..|..... +.
T Consensus 81 ~~~a-~n~g~~~--------a~g~~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~g~r~~~~~~~~~~~~~r~~~~~~~ 151 (243)
T PLN02726 81 LGTA-YIHGLKH--------ASGDFVVIMDADLSHHPKYLPSFIKKQRETGADIVTGTRYVKGGGVHGWDLRRKLTSRGA 151 (243)
T ss_pred HHHH-HHHHHHH--------cCCCEEEEEcCCCCCCHHHHHHHHHHHHhcCCcEEEEccccCCCCcCCccHHHHHHHHHH
Confidence 2221 1222211 145999999999999988887776543 2335566643210 00
Q ss_pred -------CCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 210 -------MDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 210 -------~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
....-+...|++.++++++++.+..... . ....+|..+..-+...|.
T Consensus 152 ~~~~~~~~~~~~~d~~g~~~~~rr~~~~~i~~~~~--~-~~~~~~~el~~~~~~~g~ 205 (243)
T PLN02726 152 NVLAQTLLWPGVSDLTGSFRLYKRSALEDLVSSVV--S-KGYVFQMEIIVRASRKGY 205 (243)
T ss_pred HHHHHHHhCCCCCcCCCcccceeHHHHHHHHhhcc--C-CCcEEehHHHHHHHHcCC
Confidence 0001123567778999999999975321 1 122345656555555553
No 57
>PTZ00260 dolichyl-phosphate beta-glucosyltransferase; Provisional
Probab=45.95 E-value=2.7e+02 Score=26.45 Aligned_cols=174 Identities=13% Similarity=-0.014 Sum_probs=87.0
Q ss_pred CCCCeeEEEEEECCCCCHHHHHHHHHHhccCC-----CCCCceEEEEEecCCCchhHHHHHHHHHhhC----CCeEEecc
Q 020721 70 SSEDIRILIGILTLPDQYHRRHFLRMIYGTQS-----PTGAQVDVKFVFCNLTKEDQKVLVALEIMRY----DDIIILNC 140 (322)
Q Consensus 70 ~~~~~~lli~V~S~~~~~~rR~aIR~TW~~~~-----~~~~~v~~~Fvlg~~~~~~~~~~l~~E~~~~----~DIv~ld~ 140 (322)
...++.+-|+|+.--+...-.+.++++...-. .......+++|-..+++.... .+++-.+.+ .++-.+..
T Consensus 66 ~~~~~~isVVIP~yNe~~~i~~~L~~l~~~~~~~~~~~~~~~~EIIVVDDgStD~T~~-i~~~~~~~~~~~~~~i~vi~~ 144 (333)
T PTZ00260 66 KDSDVDLSIVIPAYNEEDRLPKMLKETIKYLESRSRKDPKFKYEIIIVNDGSKDKTLK-VAKDFWRQNINPNIDIRLLSL 144 (333)
T ss_pred CCCCeEEEEEEeeCCCHHHHHHHHHHHHHHHHhhhccCCCCCEEEEEEeCCCCCchHH-HHHHHHHhcCCCCCcEEEEEc
Confidence 45567788888877554444456666664210 112245666665545543322 222222232 13555544
Q ss_pred CCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHHHHHHhcCC-----CCCCeEEEEeec---C-----
Q 020721 141 KENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLENLVKSLVPL-----PREDLYYGYVIP---C----- 207 (322)
Q Consensus 141 ~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~L~~~L~~~-----~~~~~y~G~~~p---~----- 207 (322)
..+.. |..+.-.-+ . +...+|++-+|.|....++.+.+.++.. +.-++..|.... .
T Consensus 145 ~~N~G--~~~A~~~Gi-~--------~a~gd~I~~~DaD~~~~~~~l~~l~~~l~~~~~~~~dvV~GsR~~~~~~~~~~~ 213 (333)
T PTZ00260 145 LRNKG--KGGAVRIGM-L--------ASRGKYILMVDADGATDIDDFDKLEDIMLKIEQNGLGIVFGSRNHLVDSDVVAK 213 (333)
T ss_pred CCCCC--hHHHHHHHH-H--------HccCCEEEEEeCCCCCCHHHHHHHHHHHHHhhccCCceEEeeccccccCccccc
Confidence 44433 332211111 1 1245999999999998877755444332 233567775421 0
Q ss_pred CC------------------CCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 208 RS------------------MDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 208 r~------------------~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
++ ....... ..+.-++++++++.+.... ....+ .-|+.+-..+.+.|.
T Consensus 214 ~~~~r~~~~~~~~~l~~~~~~~~i~D~-~~Gfk~~~r~~~~~i~~~~--~~~~~-~fd~Ell~~a~~~g~ 279 (333)
T PTZ00260 214 RKWYRNILMYGFHFIVNTICGTNLKDT-QCGFKLFTRETARIIFPSL--HLERW-AFDIEIVMIAQKLNL 279 (333)
T ss_pred CcHHHHHHHHHHHHHHHHHcCCCcccC-CCCeEEEeHHHHHHHhhhc--cccCc-cchHHHHHHHHHcCC
Confidence 00 1111111 1223488999999986532 11112 235666555655554
No 58
>cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=42.68 E-value=1.9e+02 Score=23.78 Aligned_cols=88 Identities=11% Similarity=-0.009 Sum_probs=57.2
Q ss_pred CceEEEEecCceEeeHHHHHHHhcC---CCCCCeEEEEeec---CC--C----------CC-CCCcccccceeeeCHHHH
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVP---LPREDLYYGYVIP---CR--S----------MD-PFVDYMSGMGYLVSWDIA 229 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~---~~~~~~y~G~~~p---~r--~----------~~-~yp~y~~G~gYvlS~d~~ 229 (322)
+.+|++-.|+|..+..+.+...+.. .+.-.+..|.... .. . .. ....++.|++.++++++.
T Consensus 75 ~~~~v~~ld~D~~~~~~~~~~~~~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (202)
T cd06433 75 TGDIIGFLNSDDTLLPGALLAVVAAFAEHPEVDVVYGDVLLVDENGRVIGRRRPPPFLDKFLLYGMPICHQATFFRRSLF 154 (202)
T ss_pred CCCEEEEeCCCcccCchHHHHHHHHHHhCCCccEEEeeeEEEcCCCCcccCCCCcchhhhHHhhcCcccCcceEEEHHHH
Confidence 5699999999999998888877632 3444566665421 00 0 00 123467788899999999
Q ss_pred HHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 230 EWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 230 ~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
+.+..-. ......||..+..-+.+.|.
T Consensus 155 ~~~~~f~---~~~~~~~D~~~~~r~~~~g~ 181 (202)
T cd06433 155 EKYGGFD---ESYRIAADYDLLLRLLLAGK 181 (202)
T ss_pred HHhCCCc---hhhCchhhHHHHHHHHHcCC
Confidence 8885422 12234689887777766654
No 59
>cd02522 GT_2_like_a GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function. Glycosyltransferase family 2 (GT-2) subfamily of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=42.19 E-value=2.2e+02 Score=24.24 Aligned_cols=87 Identities=13% Similarity=0.034 Sum_probs=53.2
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCCCCCCeEEEEee--cCC-----------C--CCCCCcc-cccceeeeCHHHHHHH
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPLPREDLYYGYVI--PCR-----------S--MDPFVDY-MSGMGYLVSWDIAEWI 232 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~y~G~~~--p~r-----------~--~~~yp~y-~~G~gYvlS~d~~~~i 232 (322)
..+|++..|+|..+..+.+.+.+......+...|... ... . ...+... .++.|..+++++.+.+
T Consensus 72 ~~~~i~~~D~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~ 151 (221)
T cd02522 72 RGDWLLFLHADTRLPPDWDAAIIETLRADGAVAGAFRLRFDDPGPRLRLLELGANLRSRLFGLPYGDQGLFIRRELFEEL 151 (221)
T ss_pred cCCEEEEEcCCCCCChhHHHHHHHHhhcCCcEEEEEEeeecCCccchhhhhhcccceecccCCCcCCceEEEEHHHHHHh
Confidence 4599999999999988777776555433333333321 000 0 0000011 2356889999998887
Q ss_pred HHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 233 RDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 233 ~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
..... ....||..++.=+.+.|.
T Consensus 152 G~fd~----~~~~ED~d~~~r~~~~G~ 174 (221)
T cd02522 152 GGFPE----LPLMEDVELVRRLRRRGR 174 (221)
T ss_pred CCCCc----cccccHHHHHHHHHhCCC
Confidence 54332 127899999877777764
No 60
>PHA01631 hypothetical protein
Probab=38.74 E-value=1.1e+02 Score=26.43 Aligned_cols=59 Identities=17% Similarity=0.265 Sum_probs=36.7
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCCCCCCe----EEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHc
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPLPREDL----YYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDS 235 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~~~~~~----y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~ 235 (322)
+-|.++-+|.|++++.-. +. .++..+ |.-... +.+..-+||.|.-+++.+..+..+...
T Consensus 71 ~DDi~~iIDSDV~ipn~~--~~---~~~~~v~t~CiPA~~k---p~~~v~~FC~sTNf~~pr~~l~~l~~v 133 (176)
T PHA01631 71 EDDIIAIIDSDLIIPNLR--EI---IPNERVFTPCYWLYYD---WANEIRPFCSGTNYIFRKSLLPYLEYT 133 (176)
T ss_pred CccEEEEeccceEecCcc--cc---ccCCCccceeeeeeec---CCCcEEEEEccccEEeeHHHhHHHHHH
Confidence 347888899999976432 21 222221 222211 223345899999999999988887653
No 61
>PF09680 Tiny_TM_bacill: Protein of unknown function (Tiny_TM_bacill); InterPro: IPR010070 This entry represents a family of hypothetical proteins, half of which are 40 residues or less in length. Members are found only in spore-forming species. A Gly-rich variable region is followed by a strongly conserved, highly hydrophobic region, predicted to form a transmembrane helix, ending with an invariant Gly. The consensus for this stretch is FALLVVFILLIIV.
Probab=35.36 E-value=39 Score=19.53 Aligned_cols=17 Identities=29% Similarity=0.465 Sum_probs=11.9
Q ss_pred chhHHHHHHHHHHHHHH
Q 020721 11 RQFIVWSFFFILFLCVI 27 (322)
Q Consensus 11 ~~~~~~~~~~i~~~~~~ 27 (322)
..|++..++||+++.+-
T Consensus 4 ~~FalivVLFILLiIvG 20 (24)
T PF09680_consen 4 SGFALIVVLFILLIIVG 20 (24)
T ss_pred ccchhHHHHHHHHHHhc
Confidence 35778888888776653
No 62
>TIGR01732 tiny_TM_bacill conserved hypothetical tiny transmembrane protein. This model represents a family of hypothetical proteins, half of which are 40 residues or less in length. Members are found only in spore-forming species. A Gly-rich variable region is followed by a strongly conserved, highly hydrophobic region, predicted to form a transmembrane helix, ending with an invariant Gly. The consensus for this stretch is FALLVVFILLIIV.
Probab=34.17 E-value=44 Score=19.69 Aligned_cols=18 Identities=33% Similarity=0.451 Sum_probs=13.5
Q ss_pred chhHHHHHHHHHHHHHHH
Q 020721 11 RQFIVWSFFFILFLCVIA 28 (322)
Q Consensus 11 ~~~~~~~~~~i~~~~~~~ 28 (322)
..|+++.++||+++.+-.
T Consensus 6 ~gf~livVLFILLIIiga 23 (26)
T TIGR01732 6 GGFALIVVLFILLVIVGA 23 (26)
T ss_pred cchHHHHHHHHHHHHhhe
Confidence 468888888888877654
No 63
>KOG3588 consensus Chondroitin synthase 1 [Carbohydrate transport and metabolism]
Probab=33.77 E-value=12 Score=36.42 Aligned_cols=67 Identities=19% Similarity=0.196 Sum_probs=42.2
Q ss_pred eCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCCCCccccCccceeecCC------CCCcccccccCCEEEEEccCCH
Q 020721 224 VSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRRAKNRYNAKWSMYNFPV------PRTACTHELWADTIAVHLLKNQ 297 (322)
Q Consensus 224 lS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~~~~~~~~~~~f~~~~~------~~~~c~~~~~~~~ia~H~~~~~ 297 (322)
||++++..|.+- +...+-.+|||.+|+|.++.-. ..|+.--. ..+ +....+.|+.|.+|++
T Consensus 1 Msr~~l~hl~~c--L~~m~t~hedve~grc~r~~mq--------~Lf~~n~~ank~sy~~~---~~~~~~aitlhp~K~p 67 (494)
T KOG3588|consen 1 MSRDTLLHLESC--LQHMLTSHEDVELGRCIRKHMQ--------KLFHNNQSANKESYAKN---MKELKDAITLHPIKDP 67 (494)
T ss_pred CcHHHHHHHHHH--HHHHHhhccCcchhHHHHHHHH--------HHhhhhhhhcChhhhcc---chhhhhhhccccccCH
Confidence 688888877653 3456678999999999988511 11211100 011 1123678999999999
Q ss_pred HHHHHH
Q 020721 298 EKWIET 303 (322)
Q Consensus 298 ~~~~~~ 303 (322)
-.+.++
T Consensus 68 ~~~~r~ 73 (494)
T KOG3588|consen 68 AVMRRV 73 (494)
T ss_pred HHHHHH
Confidence 844443
No 64
>cd04188 DPG_synthase DPG_synthase is involved in protein N-linked glycosylation. UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate.
Probab=32.26 E-value=3.1e+02 Score=23.36 Aligned_cols=139 Identities=15% Similarity=-0.003 Sum_probs=73.2
Q ss_pred ceEEEEEecCCCchhHHHHHHHHHhhCCCe-EEeccCCcCCchHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeH
Q 020721 106 QVDVKFVFCNLTKEDQKVLVALEIMRYDDI-IILNCKENMNKGKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRL 184 (322)
Q Consensus 106 ~v~~~Fvlg~~~~~~~~~~l~~E~~~~~DI-v~ld~~D~y~n~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~ 184 (322)
.+.++.+-+.+++.. ...+++..++++.. ..+...++. ++.-+.-.-+.. ...+|++-.|+|.....
T Consensus 30 ~~eiivvdd~S~D~t-~~~~~~~~~~~~~~i~~i~~~~n~--G~~~a~~~g~~~---------a~gd~i~~ld~D~~~~~ 97 (211)
T cd04188 30 SYEIIVVDDGSKDGT-AEVARKLARKNPALIRVLTLPKNR--GKGGAVRAGMLA---------ARGDYILFADADLATPF 97 (211)
T ss_pred CEEEEEEeCCCCCch-HHHHHHHHHhCCCcEEEEEcccCC--CcHHHHHHHHHH---------hcCCEEEEEeCCCCCCH
Confidence 455666655555433 23444445556654 334444433 233222222221 14599999999999998
Q ss_pred HHHHHHhcCC--CCCCeEEEEeecC-------CC---------------CCCCCcc--cccceeeeCHHHHHHHHHcCCC
Q 020721 185 ENLVKSLVPL--PREDLYYGYVIPC-------RS---------------MDPFVDY--MSGMGYLVSWDIAEWIRDSDIP 238 (322)
Q Consensus 185 ~~L~~~L~~~--~~~~~y~G~~~p~-------r~---------------~~~yp~y--~~G~gYvlS~d~~~~i~~~~~~ 238 (322)
+.+.+.++.. +..++..|..... ++ .-.-+++ ...+..++++++++.+....
T Consensus 98 ~~l~~l~~~~~~~~~~~v~g~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~g~~~~~r~~~~~~~~~~-- 175 (211)
T cd04188 98 EELEKLEEALKTSGYDIAIGSRAHLASAAVVKRSWLRNLLGRGFNFLVRLLLGLGIKDTQCGFKLFTRDAARRLFPRL-- 175 (211)
T ss_pred HHHHHHHHHHhccCCcEEEEEeeccCCcccccccHHHHHHHHHHHHHHHHHcCCCCcccccCceeEcHHHHHHHHhhh--
Confidence 8888877752 3446667754310 00 0000122 12245688999998886432
Q ss_pred CCCCCCchhHHHHHHHHhcCC
Q 020721 239 KNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 239 ~~~~~~~EDv~vG~~l~~~~~ 259 (322)
... ...+|..+..-+.+.|.
T Consensus 176 ~~~-~~~~d~el~~r~~~~g~ 195 (211)
T cd04188 176 HLE-RWAFDVELLVLARRLGY 195 (211)
T ss_pred hcc-ceEeeHHHHHHHHHcCC
Confidence 112 23357766555555553
No 65
>PF11395 DUF2873: Protein of unknown function (DUF2873); InterPro: IPR021532 This entry is represented by the human SARS coronavirus, Orf7b; it is a family of uncharacterised viral proteins.
Probab=29.76 E-value=44 Score=21.43 Aligned_cols=19 Identities=16% Similarity=0.289 Sum_probs=7.9
Q ss_pred HHHHHhccccccccccccCcc
Q 020721 24 LCVIASINEVRFDGLLKFGRC 44 (322)
Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~ 44 (322)
+.+++|+..+.+|..-+ ||
T Consensus 23 imliif~f~le~qdl~e--pc 41 (43)
T PF11395_consen 23 IMLIIFWFSLEIQDLNE--PC 41 (43)
T ss_pred HHHHHHHHHHhhhhhcc--cc
Confidence 33334334444444333 55
No 66
>PF13704 Glyco_tranf_2_4: Glycosyl transferase family 2
Probab=29.38 E-value=2.4e+02 Score=20.94 Aligned_cols=47 Identities=13% Similarity=0.116 Sum_probs=27.9
Q ss_pred CCeEEeccCCcCCc-hHHHHHHHhhhhHhcCCCCCCCCceEEEEecCceEeeHHH
Q 020721 133 DDIIILNCKENMNK-GKTYTYFSSLPEIFNTSDRPYPPYHYVMKGDDDTYIRLEN 186 (322)
Q Consensus 133 ~DIv~ld~~D~y~n-~Kt~~~~~w~~~~~~~~~~~c~~~~yvlK~DDD~fv~~~~ 186 (322)
.++-...+...+.. .....+...+.+.. ..++|++-+|-|=|+.++.
T Consensus 41 ~~v~i~~~~~~~~~~~~~~~~~~~~~~~~-------~~~dWvl~~D~DEfl~~~~ 88 (97)
T PF13704_consen 41 PGVGIIRWVDPYRDERRQRAWRNALIERA-------FDADWVLFLDADEFLVPPP 88 (97)
T ss_pred CCcEEEEeCCCccchHHHHHHHHHHHHhC-------CCCCEEEEEeeeEEEecCC
Confidence 44555555556643 22233333333321 2679999999999988764
No 67
>cd00761 Glyco_tranf_GTA_type Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it als
Probab=29.23 E-value=2.6e+02 Score=21.24 Aligned_cols=73 Identities=25% Similarity=0.249 Sum_probs=48.3
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCC---CCCCeEEEEeecCCCCCCCCcccccceeeeCHHHHHHHHHcCCCCCCCCCc
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPL---PREDLYYGYVIPCRSMDPFVDYMSGMGYLVSWDIAEWIRDSDIPKNHLEGP 245 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~---~~~~~y~G~~~p~r~~~~yp~y~~G~gYvlS~d~~~~i~~~~~~~~~~~~~ 245 (322)
+.+|++-+|+|..+..+.+...+... +...+..|. +++++++++.+.+..... ......
T Consensus 77 ~~d~v~~~d~D~~~~~~~~~~~~~~~~~~~~~~~v~~~----------------~~~~~~~~~~~~~~~~~~--~~~~~~ 138 (156)
T cd00761 77 RGEYILFLDADDLLLPDWLERLVAELLADPEADAVGGP----------------GNLLFRRELLEEIGGFDE--ALLSGE 138 (156)
T ss_pred cCCEEEEECCCCccCccHHHHHHHHHhcCCCceEEecc----------------chheeeHHHHHHhCCcch--HhcCCc
Confidence 56999999999999888887763322 111221111 889999999998865432 122237
Q ss_pred hhHHHHHHHHhcCC
Q 020721 246 EDKVFGAWIREGRR 259 (322)
Q Consensus 246 EDv~vG~~l~~~~~ 259 (322)
||..++..+...|.
T Consensus 139 ed~~~~~~~~~~g~ 152 (156)
T cd00761 139 EDDDFLLRLLRGGK 152 (156)
T ss_pred chHHHHHHHHhhcc
Confidence 88888877766553
No 68
>PRK11498 bcsA cellulose synthase catalytic subunit; Provisional
Probab=25.43 E-value=9.3e+02 Score=26.45 Aligned_cols=87 Identities=11% Similarity=-0.030 Sum_probs=54.5
Q ss_pred CceEEEEecCceEeeHHHHHHHhc---CCCCCCeEEEEe-----------------ecCCC----------CCC-CCccc
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLV---PLPREDLYYGYV-----------------IPCRS----------MDP-FVDYM 217 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~---~~~~~~~y~G~~-----------------~p~r~----------~~~-yp~y~ 217 (322)
+.+|++-.|.|..+..+-|.+.+. ..|+-.+.-|.. .+... .+. --.++
T Consensus 339 ~GEyIavlDAD~ip~pdfL~~~V~~f~~dP~VglVQtp~~f~n~dp~~rnl~~~~~~~~e~~~fy~~iq~g~~~~~a~~~ 418 (852)
T PRK11498 339 KGEFVAIFDCDHVPTRSFLQMTMGWFLKDKKLAMMQTPHHFFSPDPFERNLGRFRKTPNEGTLFYGLVQDGNDMWDATFF 418 (852)
T ss_pred CCCEEEEECCCCCCChHHHHHHHHHHHhCCCeEEEEcceeccCCchHHHhhHHHhhcccchhHHHHHHHhHHHhhccccc
Confidence 569999999999998877776543 223211111110 00000 000 02457
Q ss_pred ccceeeeCHHHHHHHHHcCCCCCCCCCchhHHHHHHHHhcCC
Q 020721 218 SGMGYLVSWDIAEWIRDSDIPKNHLEGPEDKVFGAWIREGRR 259 (322)
Q Consensus 218 ~G~gYvlS~d~~~~i~~~~~~~~~~~~~EDv~vG~~l~~~~~ 259 (322)
.|++.++.+++++.+.--.. ....||..++.-+...|-
T Consensus 419 ~Gs~aviRReaLeeVGGfd~----~titED~dlslRL~~~Gy 456 (852)
T PRK11498 419 CGSCAVIRRKPLDEIGGIAV----ETVTEDAHTSLRLHRRGY 456 (852)
T ss_pred ccceeeeEHHHHHHhcCCCC----CccCccHHHHHHHHHcCC
Confidence 89999999999999865331 124799999999987764
No 69
>KOG2547 consensus Ceramide glucosyltransferase [Lipid transport and metabolism; Cell wall/membrane/envelope biogenesis]
Probab=23.69 E-value=2.5e+02 Score=27.71 Aligned_cols=26 Identities=12% Similarity=0.429 Sum_probs=22.8
Q ss_pred CceEEEEecCceEeeHHHHHHHhcCC
Q 020721 169 PYHYVMKGDDDTYIRLENLVKSLVPL 194 (322)
Q Consensus 169 ~~~yvlK~DDD~fv~~~~L~~~L~~~ 194 (322)
+++||+-.|||+++..+.+.+.-..+
T Consensus 170 ~ydlvlisDsgI~m~pdtildm~t~M 195 (431)
T KOG2547|consen 170 KYDLVLISDSGIFMKPDTILDMATTM 195 (431)
T ss_pred cCCEEEEecCCeeecCchHHHHHHhh
Confidence 56899999999999999999877654
No 70
>cd04459 Rho_CSD Rho_CSD: Rho protein cold-shock domain (CSD). Rho protein is a transcription termination factor in most bacteria. In bacteria, there are two distinct mechanisms for mRNA transcription termination. In intrinsic termination, RNA polymerase and nascent mRNA are released from DNA template by an mRNA stem loop structure, which resembles the transcription termination mechanism used by eukaryotic pol III. The second mechanism is mediated by Rho factor. Rho factor terminates transcription by using energy from ATP hydrolysis to forcibly dissociate the transcripts from RNA polymerase. Rho protein contains an N-terminal S1-like domain, which binds single-stranded RNA. Rho has a C-terminal ATPase domain which hydrolyzes ATP to provide energy to strip RNA polymerase and mRNA from the DNA template. Rho functions as a homohexamer.
Probab=23.20 E-value=1.5e+02 Score=21.58 Aligned_cols=39 Identities=21% Similarity=0.375 Sum_probs=27.5
Q ss_pred EEecCceEeeHHHHHHHhcCCCCCCeEEEEeecCCCCCCCC
Q 020721 174 MKGDDDTYIRLENLVKSLVPLPREDLYYGYVIPCRSMDPFV 214 (322)
Q Consensus 174 lK~DDD~fv~~~~L~~~L~~~~~~~~y~G~~~p~r~~~~yp 214 (322)
+...||+||....+.++= +-+-+...|.+.+.+..+.|+
T Consensus 23 ~~~~~DvyVs~~~Irr~~--LR~GD~V~G~vr~p~~~ek~~ 61 (68)
T cd04459 23 LPGPDDIYVSPSQIRRFN--LRTGDTVVGQIRPPKEGERYF 61 (68)
T ss_pred CCCCCCEEECHHHHHHhC--CCCCCEEEEEEeCCCCCCCcc
Confidence 678999999998777652 223478889887655555553
No 71
>PF08525 OapA_N: Opacity-associated protein A N-terminal motif; InterPro: IPR013731 This domain is found in the Haemophilus influenzae opacity-associated protein (OapA). It is required for efficient nasopharyngeal mucosal colonisation, and its expression is associated with a distinctive transparent colony phenotype. OapA is thought to be a secreted protein, and its expression exhibits high-frequency phase variation [, ]. This motif occurs at the N terminus of these proteins. It contains a conserved histidine followed by a run of hydrophobic residues. Many of the proteins in this entry are unassigned peptidases belonging to MEROPS peptidase family M23B.
Probab=21.97 E-value=88 Score=18.93 Aligned_cols=17 Identities=6% Similarity=0.225 Sum_probs=7.1
Q ss_pred CccchhHHHHHHHHHHH
Q 020721 8 PHQRQFIVWSFFFILFL 24 (322)
Q Consensus 8 ~~~~~~~~~~~~~i~~~ 24 (322)
|+..|.++..+..+.++
T Consensus 7 P~~Hr~~l~~l~~v~l~ 23 (30)
T PF08525_consen 7 PKLHRRALIALSAVVLV 23 (30)
T ss_pred CHHHHHHHHHHHHHHHH
Confidence 33344444444444444
No 72
>COG4425 Predicted membrane protein [Function unknown]
Probab=20.58 E-value=7.5e+02 Score=25.20 Aligned_cols=78 Identities=17% Similarity=0.143 Sum_probs=45.5
Q ss_pred ecCCCCCCeeEEEEEECCCCCHHHHH-HHHHHhccCCCCCCceEEEEEecCCCchhH--HHHHHHHHhhCCCeEEeccCC
Q 020721 66 ATNSSSEDIRILIGILTLPDQYHRRH-FLRMIYGTQSPTGAQVDVKFVFCNLTKEDQ--KVLVALEIMRYDDIIILNCKE 142 (322)
Q Consensus 66 ~~~~~~~~~~lli~V~S~~~~~~rR~-aIR~TW~~~~~~~~~v~~~Fvlg~~~~~~~--~~~l~~E~~~~~DIv~ld~~D 142 (322)
..+...+++++++|..|+..-.+|-+ ++||-= +. .|.+=+++-|+-.+....+ .++=..|-...||+..+-+.=
T Consensus 281 t~t~a~ePiRvy~Glnsa~s~~~ra~Lal~El~-Rt--Gg~rR~vlvVv~~TGTGWIdp~a~~t~EyL~~Gd~asVsmQY 357 (588)
T COG4425 281 TGTPAVEPIRVYAGLNSADSPEERAELALRELK-RT--GGLRRKVLVVVTSTGTGWIDPAAADTLEYLYNGDVASVSMQY 357 (588)
T ss_pred cCCCCCCceeEEecccccCCHHHHHHHHHHHHH-Hh--cCcceeEEEEEcCCCCCCCCHHHHhHHHHHhCCceEEEEEeh
Confidence 34445778999999999988766655 444432 11 1112123333322222111 123356889999999988887
Q ss_pred cCCc
Q 020721 143 NMNK 146 (322)
Q Consensus 143 ~y~n 146 (322)
+|-.
T Consensus 358 SyL~ 361 (588)
T COG4425 358 SYLP 361 (588)
T ss_pred hhHH
Confidence 7754
Done!