Query 015921
Match_columns 398
No_of_seqs 28 out of 30
Neff 2.9
Searched_HMMs 46136
Date Fri Mar 29 02:09:18 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/015921.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/015921hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00505 Glyco_transf_8 Members 99.3 4.2E-11 9.1E-16 109.0 12.8 193 142-365 17-242 (246)
2 cd02537 GT8_Glycogenin Glycoge 99.2 5.7E-11 1.2E-15 109.0 11.6 188 140-365 15-210 (240)
3 cd04194 GT8_A4GalT_like A4GalT 99.1 4.9E-10 1.1E-14 101.3 9.2 194 141-365 15-244 (248)
4 PF01501 Glyco_transf_8: Glyco 99.0 4.5E-11 9.7E-16 103.8 0.4 138 197-365 85-244 (250)
5 PF03407 Nucleotid_trans: Nucl 98.8 2.6E-08 5.6E-13 88.2 9.7 99 215-337 67-172 (212)
6 cd06431 GT8_LARGE_C LARGE cata 98.8 1.6E-07 3.4E-12 89.9 15.1 210 142-377 17-269 (280)
7 cd06430 GT8_like_2 GT8_like_2 98.7 2.6E-07 5.7E-12 90.6 14.1 176 154-365 29-257 (304)
8 cd06429 GT8_like_1 GT8_like_1 98.6 7.1E-07 1.5E-11 84.7 13.9 188 139-366 12-239 (257)
9 cd06914 GT8_GNT1 GNT1 is a fun 98.6 7.1E-07 1.5E-11 86.4 13.8 187 143-363 18-232 (278)
10 PLN00176 galactinol synthase 98.5 1.8E-06 3.9E-11 85.6 13.7 196 142-379 40-295 (333)
11 PRK15171 lipopolysaccharide 1, 98.4 8.8E-07 1.9E-11 86.4 9.6 206 130-366 29-269 (334)
12 cd06432 GT8_HUGT1_C_like The C 98.1 1.8E-05 3.9E-10 74.4 9.9 181 142-360 17-237 (248)
13 PLN02769 Probable galacturonos 98.1 1.7E-05 3.6E-10 84.7 9.7 159 193-387 437-627 (629)
14 COG1442 RfaJ Lipopolysaccharid 98.0 4.4E-05 9.6E-10 75.7 9.9 189 143-375 19-252 (325)
15 PLN02659 Probable galacturonos 97.8 8.3E-05 1.8E-09 78.2 8.7 149 214-389 344-532 (534)
16 PLN02867 Probable galacturonos 97.7 0.00018 3.9E-09 75.8 10.1 167 193-389 327-534 (535)
17 PLN02870 Probable galacturonos 97.6 0.00018 3.8E-09 75.8 8.8 166 193-389 328-531 (533)
18 PLN02910 polygalacturonate 4-a 97.5 0.00037 7.9E-09 74.8 8.8 162 192-388 459-655 (657)
19 PF05637 Glyco_transf_34: gala 97.4 0.00012 2.6E-09 68.8 3.6 142 197-365 64-234 (239)
20 PLN02829 Probable galacturonos 97.4 0.00046 1E-08 74.0 8.2 48 214-261 457-528 (639)
21 PLN02718 Probable galacturonos 97.2 0.0013 2.9E-08 70.3 9.5 153 196-388 403-601 (603)
22 PLN02742 Probable galacturonos 97.2 0.00093 2E-08 70.6 7.6 141 214-386 353-531 (534)
23 PLN02523 galacturonosyltransfe 97.2 0.0027 5.8E-08 67.5 10.7 148 214-387 377-556 (559)
24 PF03414 Glyco_transf_6: Glyco 94.2 0.36 7.8E-06 49.1 10.2 100 124-232 99-207 (337)
25 cd02515 Glyco_transf_6 Glycosy 92.0 1.7 3.7E-05 43.3 10.8 200 130-365 39-265 (271)
26 PLN03181 glycosyltransferase; 86.3 2.5 5.4E-05 44.7 7.6 100 197-306 186-308 (453)
27 PF05704 Caps_synth: Capsular 83.8 3.4 7.4E-05 40.2 6.8 119 135-282 57-186 (276)
28 PF03314 DUF273: Protein of un 82.0 2.1 4.6E-05 41.6 4.6 100 186-303 25-125 (222)
29 cd01833 XynB_like SGNH_hydrola 73.8 24 0.00053 29.4 8.1 74 133-228 58-131 (157)
30 PF13641 Glyco_tranf_2_3: Glyc 62.1 25 0.00054 30.6 6.1 78 139-229 15-102 (228)
31 PF04488 Gly_transf_sug: Glyco 57.1 6.1 0.00013 32.2 1.4 23 143-165 6-28 (103)
32 COG5603 TRS20 Subunit of TRAPP 56.1 19 0.00042 32.7 4.5 35 252-287 75-109 (136)
33 PLN03182 xyloglucan 6-xylosylt 54.8 39 0.00086 35.9 7.1 98 197-304 185-305 (429)
34 cd01841 NnaC_like NnaC (CMP-Ne 54.6 42 0.0009 28.5 6.1 75 133-229 69-143 (174)
35 PF04765 DUF616: Protein of un 52.5 30 0.00066 35.1 5.6 129 153-303 90-279 (305)
36 cd02510 pp-GalNAc-T pp-GalNAc- 52.1 60 0.0013 30.4 7.2 75 139-229 12-99 (299)
37 PRK15383 type III secretion sy 49.8 21 0.00045 36.2 4.0 64 213-285 218-282 (335)
38 PRK15382 non-LEE encoded effec 49.6 22 0.00047 36.1 4.0 65 212-285 209-274 (326)
39 cd06439 CESA_like_1 CESA_like_ 47.9 2E+02 0.0043 25.6 9.5 78 143-236 46-133 (251)
40 cd02520 Glucosylceramide_synth 47.0 1.1E+02 0.0025 26.5 7.7 24 145-168 20-44 (196)
41 PF00535 Glycos_transf_2: Glyc 47.0 1.4E+02 0.0031 23.5 7.9 79 142-236 14-102 (169)
42 cd01834 SGNH_hydrolase_like_2 46.7 1E+02 0.0023 25.9 7.2 76 134-228 84-159 (191)
43 PRK15384 type III secretion sy 46.6 24 0.00052 35.8 3.9 64 213-285 215-279 (336)
44 cd06182 CYPOR_like NADPH cytoc 45.6 1.7E+02 0.0036 28.0 9.2 27 221-247 160-186 (267)
45 cd04186 GT_2_like_c Subfamily 44.2 1.6E+02 0.0034 23.6 7.6 79 140-234 11-96 (166)
46 cd04179 DPM_DPG-synthase_like 44.0 1.4E+02 0.003 24.8 7.5 21 215-235 81-102 (185)
47 cd04192 GT_2_like_e Subfamily 41.0 1.2E+02 0.0026 26.0 6.8 16 214-229 83-98 (229)
48 cd02526 GT2_RfbF_like RfbF is 40.8 1.2E+02 0.0027 26.4 7.0 77 141-233 13-96 (237)
49 cd01836 FeeA_FeeB_like SGNH_hy 39.5 1.5E+02 0.0033 25.5 7.3 28 134-161 86-113 (191)
50 PRK13889 conjugal transfer rel 36.7 1E+02 0.0022 35.9 7.2 87 174-283 87-173 (988)
51 cd00761 Glyco_tranf_GTA_type G 36.7 1.3E+02 0.0028 22.9 5.8 73 143-231 14-95 (156)
52 PRK11204 N-glycosyltransferase 35.9 1.2E+02 0.0025 29.9 6.7 65 149-229 78-150 (420)
53 PRK13826 Dtr system oriT relax 34.6 1.2E+02 0.0025 35.9 7.4 54 174-247 97-150 (1102)
54 PF10465 Inhibitor_I24: PinA p 34.2 31 0.00067 31.6 2.3 18 186-204 122-139 (140)
55 PF11051 Mannosyl_trans3: Mann 34.0 36 0.00079 32.7 2.9 55 212-280 89-143 (271)
56 cd01832 SGNH_hydrolase_like_1 33.9 2.3E+02 0.005 24.1 7.4 65 134-222 87-151 (185)
57 TIGR00282 metallophosphoestera 33.8 55 0.0012 32.2 4.1 18 118-135 57-74 (266)
58 cd04190 Chitin_synth_C C-termi 33.8 1.2E+02 0.0025 27.9 6.0 15 215-229 75-89 (244)
59 cd04187 DPM1_like_bac Bacteria 33.8 2.6E+02 0.0057 23.5 7.7 79 140-235 15-103 (181)
60 TIGR03472 HpnI hopanoid biosyn 33.6 1.3E+02 0.0027 29.8 6.6 16 214-229 127-142 (373)
61 KOG1928 Alpha-1,4-N-acetylgluc 33.3 1.3E+02 0.0029 32.0 6.9 119 145-286 150-298 (409)
62 TIGR01618 phage_P_loop phage n 33.3 1E+02 0.0023 29.5 5.7 64 153-236 33-96 (220)
63 PRK10964 ADP-heptose:LPS hepto 33.1 1.2E+02 0.0026 28.9 6.2 43 138-180 13-58 (322)
64 PF13774 Longin: Regulated-SNA 33.1 41 0.00088 26.4 2.6 32 255-286 31-64 (83)
65 cd04506 SGNH_hydrolase_YpmR_li 33.0 1.8E+02 0.0039 25.4 6.8 26 136-161 103-128 (204)
66 cd01844 SGNH_hydrolase_like_6 32.1 2.6E+02 0.0056 24.1 7.6 29 137-165 76-104 (177)
67 PLN03180 reversibly glycosylat 29.7 25 0.00054 36.4 1.1 108 136-265 21-152 (346)
68 PRK14583 hmsR N-glycosyltransf 29.5 1.4E+02 0.0031 30.2 6.4 68 146-229 96-171 (444)
69 TIGR02193 heptsyl_trn_I lipopo 28.5 1.7E+02 0.0036 27.7 6.2 43 139-181 14-58 (319)
70 cd01820 PAF_acetylesterase_lik 27.2 3E+02 0.0065 24.6 7.4 28 134-161 108-135 (214)
71 cd06438 EpsO_like EpsO protein 27.2 2.5E+02 0.0054 24.0 6.6 17 214-230 82-98 (183)
72 KOG3487 TRAPP 20 K subunit [In 27.2 75 0.0016 29.3 3.5 40 255-295 79-118 (139)
73 cd06435 CESA_NdvC_like NdvC_li 27.1 3.3E+02 0.0071 23.9 7.5 29 141-169 14-43 (236)
74 COG0369 CysJ Sulfite reductase 26.8 7E+02 0.015 27.6 11.2 143 123-280 390-559 (587)
75 cd01828 sialate_O-acetylestera 26.5 2.2E+02 0.0049 24.0 6.1 69 134-227 67-135 (169)
76 cd04184 GT2_RfbC_Mx_like Myxoc 26.5 2E+02 0.0043 24.3 5.8 17 324-340 185-201 (202)
77 PF00072 Response_reg: Respons 26.2 85 0.0018 24.0 3.2 24 143-166 60-83 (112)
78 PF00871 Acetate_kinase: Aceto 24.9 37 0.00081 34.9 1.3 91 115-210 91-194 (388)
79 TIGR03111 glyc2_xrt_Gpos1 puta 24.9 2.4E+02 0.0053 28.7 7.1 22 325-346 252-273 (439)
80 PF13358 DDE_3: DDE superfamil 24.5 92 0.002 24.9 3.2 59 144-206 68-136 (146)
81 TIGR01556 rhamnosyltran L-rham 24.2 3.3E+02 0.0071 25.1 7.2 73 141-229 10-89 (281)
82 PRK10422 lipopolysaccharide co 23.1 3.5E+02 0.0077 26.3 7.5 40 141-180 22-63 (352)
83 TIGR02768 TraA_Ti Ti-type conj 22.8 1.9E+02 0.0041 32.2 6.2 84 175-283 88-181 (744)
84 PTZ00138 small nuclear ribonuc 22.2 73 0.0016 27.0 2.3 67 185-265 8-81 (89)
85 PF08331 DUF1730: Domain of un 21.7 2.8E+02 0.0062 22.1 5.5 65 149-222 5-78 (78)
86 cd01838 Isoamyl_acetate_hydrol 21.2 3.3E+02 0.0071 23.1 6.1 76 135-223 88-163 (199)
87 cd06423 CESA_like CESA_like is 21.1 2.8E+02 0.0061 21.6 5.3 16 214-229 79-94 (180)
88 COG5597 Alpha-N-acetylglucosam 20.2 37 0.00081 35.3 0.2 23 213-235 167-189 (368)
No 1
>cd00505 Glyco_transf_8 Members of glycosyltransferase family 8 (GT-8) are involved in lipopolysaccharide biosynthesis and glycogen synthesis. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. GT-8 comprises enzymes with a number of known activities: lipopolysaccharide galactosyltransferase, lipopolysaccharide glucosyltransferase 1, glycogenin glucosyltransferase, and N-acetylglucosaminyltransferase. GT-8 enzymes contains a conserved DXD motif which is essential in the coordination of a catalytic divalent cation, most commonly Mn2+.
Probab=99.28 E-value=4.2e-11 Score=109.03 Aligned_cols=193 Identities=23% Similarity=0.297 Sum_probs=119.0
Q ss_pred HHHHHHHHhhCC-CccEEEEeCCCCCC---C------CCCCceEEEecCCccCchhH---HHHHHHHHHHHHHHHHHhhc
Q 015921 142 NVFINFIQVTMP-KSDVFILTDPASDL---S------MPRKGVTIYPIHGEYSRDKL---MLQRIRSYITFLERRIREHS 208 (398)
Q Consensus 142 n~Fi~sIrvsmP-~s~VVILTD~~T~~---s------l~~~~~~v~pi~g~YSR~~L---mlQRi~sYi~fLkT~i~e~~ 208 (398)
-+-+.||...++ .-++.||||.-++- . ..+.++++++++.......- ..-...+|.|+. |.++-
T Consensus 17 ~v~i~Sl~~~~~~~~~~~il~~~is~~~~~~L~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~y~RL~---i~~ll 93 (246)
T cd00505 17 IVLMKSVLRHRTKPLRFHVLTNPLSDTFKAALDNLRKLYNFNYELIPVDILDSVDSEHLKRPIKIVTLTKLH---LPNLV 93 (246)
T ss_pred HHHHHHHHHhCCCCeEEEEEEccccHHHHHHHHHHHhccCceEEEEeccccCcchhhhhcCccccceeHHHH---HHHHh
Confidence 455778877776 46899999875531 1 11235778888532221100 011223344443 33331
Q ss_pred cCCCccccEEEeecCeeeecCchhhhccC-CCceEEEe-----------ecc-----CCCCcCcceEEEEecCCCchhHH
Q 015921 209 QGQGHINHYVFTDSDIAVVDDLGHIFHDY-QNFHLALT-----------FRN-----NKDQPLNSGFIAVRGTPDGISRA 271 (398)
Q Consensus 209 ~~~~~~~dfIFiDsDi~VvDdLg~IFk~~-PdFHvALT-----------fRn-----nkdqPiNSGfIfVRgt~d~~~kA 271 (398)
. ..+.+|++|+|++|.+||.++|+.- .+..+|.. ++. ..+..+||||+.+.=.. . +-
T Consensus 94 p---~~~kvlYLD~D~iv~~di~~L~~~~l~~~~~aav~d~~~~~~~~~~~~~~~~~~~~~yfNsGVmlinl~~--~-r~ 167 (246)
T cd00505 94 P---DYDKILYVDADILVLTDIDELWDTPLGGQELAAAPDPGDRREGKYYRQKRSHLAGPDYFNSGVFVVNLSK--E-RR 167 (246)
T ss_pred h---ccCeEEEEcCCeeeccCHHHHhhccCCCCeEEEccCchhhhccchhhcccCCCCCCCceeeeeEEEechH--H-HH
Confidence 1 3678999999999999999999972 34555552 000 12457999999994333 2 23
Q ss_pred HHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCCCC-CCc--c
Q 015921 272 KIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEG-AGQ--F 348 (398)
Q Consensus 272 ~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPpeg-aGQ--f 348 (398)
.++++++++.|... . .+-..+||-+|+.+...+| ..+..|||. ||+.+... ... +
T Consensus 168 ~~~~~~~~~~~~~~-~-~~~~~~DQd~LN~~~~~~~-------------------~~i~~L~~~-wN~~~~~~~~~~~~~ 225 (246)
T cd00505 168 NQLLKVALEKWLQS-L-SSLSGGDQDLLNTFFKQVP-------------------FIVKSLPCI-WNVRLTGCYRSLNCF 225 (246)
T ss_pred HHHHHHHHHHHHhh-c-ccCccCCcHHHHHHHhcCC-------------------CeEEECCCe-eeEEecCccccccch
Confidence 56677777766543 1 2457899999999975332 478999986 99985311 110 0
Q ss_pred ccCCcceeEEeeecCch
Q 015921 349 HGMPLDVKVVHFKGSRK 365 (398)
Q Consensus 349 hgmP~dvKVlHFKGsRK 365 (398)
.-...+.|||||-|+.|
T Consensus 226 ~~~~~~~~iiHy~g~~K 242 (246)
T cd00505 226 KAFVKNAKVIHFNGPTK 242 (246)
T ss_pred hhhcCCCEEEEeCCCCC
Confidence 11344899999999866
No 2
>cd02537 GT8_Glycogenin Glycogenin belongs the GT 8 family and initiates the biosynthesis of glycogen. Glycogenin initiates the biosynthesis of glycogen by incorporating glucose residues through a self-glucosylation reaction at a Tyr residue, and then acts as substrate for chain elongation by glycogen synthase and branching enzyme. It contains a conserved DxD motif and an N-terminal beta-alpha-beta Rossmann-like fold that are common to the nucleotide-binding domains of most glycosyltransferases. The DxD motif is essential for coordination of the catalytic divalent cation, most commonly Mn2+. Glycogenin can be classified as a retaining glycosyltransferase, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed. It is placed in glycosyltransferase family 8 which includes lipopolysaccharide glucose and galactose transferases and galactinol synthases.
Probab=99.25 E-value=5.7e-11 Score=108.97 Aligned_cols=188 Identities=20% Similarity=0.175 Sum_probs=113.3
Q ss_pred HHHHHHHHHHhhCCCccEEEEeCCC-CCC---CCCCCceEEEecCCccCchhHHH--HHHHHHHHHHHHHHHhhccCCCc
Q 015921 140 ILNVFINFIQVTMPKSDVFILTDPA-SDL---SMPRKGVTIYPIHGEYSRDKLML--QRIRSYITFLERRIREHSQGQGH 213 (398)
Q Consensus 140 vln~Fi~sIrvsmP~s~VVILTD~~-T~~---sl~~~~~~v~pi~g~YSR~~Lml--QRi~sYi~fLkT~i~e~~~~~~~ 213 (398)
-+.+.+.||+..+|+.+++||.++. ++. .+....+++.+++. ........ ++-.+-..|.|-.+-++ ..
T Consensus 15 ~a~vl~~SL~~~~~~~~~~vl~~~~is~~~~~~L~~~~~~~~~v~~-i~~~~~~~~~~~~~~~~~~~kl~~~~l----~~ 89 (240)
T cd02537 15 GALVLGYSLRKVGSSYDLVVLVTPGVSEESREALEEVGWIVREVEP-IDPPDSANLLKRPRFKDTYTKLRLWNL----TE 89 (240)
T ss_pred HHHHHHHHHHhcCCCCCEEEEECCCCCHHHHHHHHHcCCEEEecCc-cCCcchhhhccchHHHHHhHHHHhccc----cc
Confidence 4556788999999999998887753 321 11222344555442 11111110 12223345555444443 46
Q ss_pred cccEEEeecCeeeecCchhhhccCCCceEEEeeccCCCCcCcceEEEEecCCCchhHHHHHHHHHHHHHHHhHHhHhhhh
Q 015921 214 INHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRML 293 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfRnnkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~~a~~ml 293 (398)
.+.+|++|+|++|..||.++|+. ++. ++-.........+||||+..+-. .+.+++..+...+.. +-..
T Consensus 90 ~drvlylD~D~~v~~~i~~Lf~~-~~~-~~a~~d~~~~~~fNsGv~l~~~~-------~~~~~~~~~~~~~~~---~~~~ 157 (240)
T cd02537 90 YDKVVFLDADTLVLRNIDELFDL-PGE-FAAAPDCGWPDLFNSGVFVLKPS-------EETFNDLLDALQDTP---SFDG 157 (240)
T ss_pred cceEEEEeCCeeEccCHHHHhCC-CCc-eeeecccCccccccceEEEEcCC-------HHHHHHHHHHHhccC---CCCC
Confidence 78999999999999999999998 333 33333212245799999999432 234444444333221 1246
Q ss_pred cchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCCCCCCcc--ccCCcceeEEeeecCch
Q 015921 294 GDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQF--HGMPLDVKVVHFKGSRK 365 (398)
Q Consensus 294 GDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPpegaGQf--hgmP~dvKVlHFKGsRK 365 (398)
+||-.|+.+...++ .+..|||. ||+..+-....- .-...+.||+||.|..|
T Consensus 158 ~DQdiLN~~~~~~~--------------------~~~~l~~~-yN~~~~~~~~~~~~~~~~~~~~iiHf~g~~K 210 (240)
T cd02537 158 GDQGLLNSYFSDRG--------------------IWKRLPFT-YNALKPLRYLHPEALWFGDEIKVVHFIGGDK 210 (240)
T ss_pred CCHHHHHHHHcCCC--------------------CEeECCcc-eeeehhhhccCchhhcccCCcEEEEEeCCCC
Confidence 89999999975332 57889998 998743221000 11234789999999755
No 3
>cd04194 GT8_A4GalT_like A4GalT_like proteins catalyze the addition of galactose or glucose residues to the lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of the bacterial cell surface. The members of this family of glycosyltransferases catalyze the addition of galactose or glucose residues to the lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of the bacterial cell surface. The enzymes exhibit broad substrate specificities. The known functions found in this family include: Alpha-1,4-galactosyltransferase, LOS-alpha-1,3-D-galactosyltransferase, UDP-glucose:(galactosyl) LPS alpha1,2-glucosyltransferase, UDP-galactose: (glucosyl) LPS alpha1,2-galactosyltransferase, and UDP-glucose:(glucosyl) LPS alpha1,2-glucosyltransferase. Alpha-1,4-galactosyltransferase from N. meningitidis adds an alpha-galactose from UDP-Gal (the donor) to a terminal lactose (the acceptor) of the LOS structure of outer membrane. LOSs are virulence factors that enable the organism to evade the immune sys
Probab=99.08 E-value=4.9e-10 Score=101.30 Aligned_cols=194 Identities=22% Similarity=0.244 Sum_probs=116.6
Q ss_pred HHHHHHHHHhhCC--CccEEEEeCCCCCCCC---------CCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhcc
Q 015921 141 LNVFINFIQVTMP--KSDVFILTDPASDLSM---------PRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQ 209 (398)
Q Consensus 141 ln~Fi~sIrvsmP--~s~VVILTD~~T~~sl---------~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~ 209 (398)
+-+-+.||...+| ..++.||||.-++... .+..+++++++...-......++.-+...|.+--+.++-
T Consensus 15 ~~~~l~Sl~~~~~~~~~~~~il~~~is~~~~~~L~~~~~~~~~~i~~~~i~~~~~~~~~~~~~~~~~~~y~rl~l~~ll- 93 (248)
T cd04194 15 LAVTIKSILANNSKRDYDFYILNDDISEENKKKLKELLKKYNSSIEFIKIDNDDFKFFPATTDHISYATYYRLLIPDLL- 93 (248)
T ss_pred HHHHHHHHHhcCCCCceEEEEEeCCCCHHHHHHHHHHHHhcCCeEEEEEcCHHHHhcCCcccccccHHHHHHHHHHHHh-
Confidence 4466889988888 5699999986432111 123566777753211100001111122333333333331
Q ss_pred CCCccccEEEeecCeeeecCchhhhcc-CCCceEEEe--ec-------------cCCCCcCcceEEEEecCCCchhHHHH
Q 015921 210 GQGHINHYVFTDSDIAVVDDLGHIFHD-YQNFHLALT--FR-------------NNKDQPLNSGFIAVRGTPDGISRAKI 273 (398)
Q Consensus 210 ~~~~~~dfIFiDsDi~VvDdLg~IFk~-~PdFHvALT--fR-------------nnkdqPiNSGfIfVRgt~d~~~kA~~ 273 (398)
...+.+|++|+|++|.+||.++|+. .++.-+|.. .. ......+||||+.+ ..+. -+...
T Consensus 94 --~~~~rvlylD~D~lv~~di~~L~~~~~~~~~~aa~~d~~~~~~~~~~~~~~~~~~~~yfNsGv~l~--nl~~-~r~~~ 168 (248)
T cd04194 94 --PDYDKVLYLDADIIVLGDLSELFDIDLGDNLLAAVRDPFIEQEKKRKRRLGGYDDGSYFNSGVLLI--NLKK-WREEN 168 (248)
T ss_pred --cccCEEEEEeCCEEecCCHHHHhcCCcCCCEEEEEecccHHHHHHHHhhcCCCcccceeeecchhe--eHHH-HHHhh
Confidence 1367899999999999999999997 233344442 10 12244699999999 3322 23345
Q ss_pred HHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCCCCCCccc----
Q 015921 274 FLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFH---- 349 (398)
Q Consensus 274 FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPpegaGQfh---- 349 (398)
+.+++++.+...- ..-.++||-+|+.+... .+..|| ..|||.+-...-...
T Consensus 169 ~~~~~~~~~~~~~--~~~~~~DQd~LN~~~~~----------------------~~~~L~-~~~N~~~~~~~~~~~~~~~ 223 (248)
T cd04194 169 ITEKLLELIKEYG--GRLIYPDQDILNAVLKD----------------------KILYLP-PRYNFQTGFYYLLKKKSKE 223 (248)
T ss_pred hHHHHHHHHHhCC--CceeeCChHHHHHHHhC----------------------CeEEcC-cccccchhHhHHhhccchh
Confidence 6666666554431 23468999999998631 278999 559998554332211
Q ss_pred -----cCCcceeEEeeecCch
Q 015921 350 -----GMPLDVKVVHFKGSRK 365 (398)
Q Consensus 350 -----gmP~dvKVlHFKGsRK 365 (398)
-...+.|||||-|+.|
T Consensus 224 ~~~~~~~~~~~~iiHf~g~~K 244 (248)
T cd04194 224 EQELEEARKNPVIIHYTGSDK 244 (248)
T ss_pred HHHHHHHhcCCEEEEeCCCCC
Confidence 1345899999999866
No 4
>PF01501 Glyco_transf_8: Glycosyl transferase family 8; InterPro: IPR002495 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 8 GT8 from CAZY comprises enzymes with a number of known activities; lipopolysaccharide galactosyltransferase (2.4.1.44 from EC), lipopolysaccharide glucosyltransferase 1 (2.4.1.58 from EC), glycogenin glucosyltransferase (2.4.1.186 from EC), inositol 1-alpha-galactosyltransferase (2.4.1.123 from EC). These enzymes have a distant similarity to family GT_24. ; GO: 0016757 transferase activity, transferring glycosyl groups; PDB: 1LL0_D 1ZCV_A 3USR_A 3V90_A 1ZCU_A 1ZCT_A 3V91_A 1ZCY_A 1ZDG_A 1ZDF_A ....
Probab=99.03 E-value=4.5e-11 Score=103.77 Aligned_cols=138 Identities=24% Similarity=0.317 Sum_probs=78.8
Q ss_pred HHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhccCCC-ceEEEe-e-----------------ccCCCCcCcce
Q 015921 197 ITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQN-FHLALT-F-----------------RNNKDQPLNSG 257 (398)
Q Consensus 197 i~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~~Pd-FHvALT-f-----------------RnnkdqPiNSG 257 (398)
..|.|-.+.++- ...+.+||+|+||+|.+||.++|..-++ --+|.. . .......+|+|
T Consensus 85 ~~~~rl~i~~ll---~~~drilyLD~D~lv~~dl~~lf~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~fNsG 161 (250)
T PF01501_consen 85 ATFARLFIPDLL---PDYDRILYLDADTLVLGDLDELFDLDLQGKYLAAVEDESFDNFPNKRFPFSERKQPGNKPYFNSG 161 (250)
T ss_dssp GGGGGGGHHHHS---TTSSEEEEE-TTEEESS-SHHHHC---TTSSEEEEE----HHHHTSTTSSEEECESTTTTSEEEE
T ss_pred HHHHHhhhHHHH---hhcCeEEEEcCCeeeecChhhhhcccchhhhccccccchhhhhhhcccchhhcccCcccccccCc
Confidence 444444443331 3457899999999999999999985222 112221 1 11335669999
Q ss_pred EEEEecCCCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccc
Q 015921 258 FIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATY 337 (398)
Q Consensus 258 fIfVRgt~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~Y 337 (398)
|+.+ .++.. +...+.+++.+.....- .+-.++||-.|+.+.. ..+..||| .|
T Consensus 162 v~l~--~~~~~-~~~~~~~~~~~~~~~~~--~~~~~~DQ~~ln~~~~----------------------~~~~~L~~-~~ 213 (250)
T PF01501_consen 162 VMLF--NPSKW-RKENILQKLIEWLEQNG--MKLGFPDQDILNIVFY----------------------GNIKPLPC-RY 213 (250)
T ss_dssp EEEE--EHHHH-HHHHHHHHHHHHHHHTT--TT-SSCHHHHHHHHHT----------------------TGEEEEEG-GG
T ss_pred EEEE--eechh-hhhhhhhhhhhhhhhcc--cccCcCchHHHhhhcc----------------------ceeEEECc-hh
Confidence 9999 33212 22344444444422221 1346999999999973 56789998 69
Q ss_pred cCCCCCC---CCccccCCcceeEEeeecCch
Q 015921 338 NWTPPEG---AGQFHGMPLDVKVVHFKGSRK 365 (398)
Q Consensus 338 NwtPpeg---aGQfhgmP~dvKVlHFKGsRK 365 (398)
||.+.-. .-.+...-.++||+||-|..|
T Consensus 214 N~~~~~~~~~~~~~~~~~~~~~iiHy~g~~K 244 (250)
T PF01501_consen 214 NCQPSWYNQSDDYFNPILEDAKIIHYSGPPK 244 (250)
T ss_dssp SEEHHHHHHTHHHHHHHGCC-SEEE--SSS-
T ss_pred ccccccccccchhhHhhcCCeEEEEeCCCCc
Confidence 9984322 111233445999999999877
No 5
>PF03407 Nucleotid_trans: Nucleotide-diphospho-sugar transferase; InterPro: IPR005069 Proteins in this family have been been predicted to be nucleotide-diphospho-sugar transferases [].
Probab=98.81 E-value=2.6e-08 Score=88.23 Aligned_cols=99 Identities=24% Similarity=0.339 Sum_probs=77.9
Q ss_pred ccEEEeecCeeeecCchhhhccCCCceEEEeecc-------CCCCcCcceEEEEecCCCchhHHHHHHHHHHHHHHHhHH
Q 015921 215 NHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRN-------NKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYM 287 (398)
Q Consensus 215 ~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfRn-------nkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~ 287 (398)
-+++|+|.|++...|.-+.| +.++.|+.+.-.. ..+..+|+|++++|.|+ ++++|+++|.+...+.
T Consensus 67 ~~vl~~D~Dvv~~~dp~~~~-~~~~~Di~~~~d~~~~~~~~~~~~~~n~G~~~~r~t~----~~~~~~~~w~~~~~~~-- 139 (212)
T PF03407_consen 67 YDVLFSDADVVWLRDPLPYF-ENPDADILFSSDGWDGTNSDRNGNLVNTGFYYFRPTP----RTIAFLEDWLERMAES-- 139 (212)
T ss_pred CceEEecCCEEEecCcHHhh-ccCCCceEEecCCCcccchhhcCCccccceEEEecCH----HHHHHHHHHHHHHHhC--
Confidence 38999999999999999999 5589999886311 12334799999999999 9999999999977544
Q ss_pred hHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccc
Q 015921 288 NASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATY 337 (398)
Q Consensus 288 ~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~Y 337 (398)
.-..||.+|+++....-... .+.++.+||...+
T Consensus 140 ---~~~~DQ~~~n~~l~~~~~~~--------------~~~~~~~L~~~~f 172 (212)
T PF03407_consen 140 ---PGCWDQQAFNELLREQAARY--------------GGLRVRFLPPSLF 172 (212)
T ss_pred ---CCcchHHHHHHHHHhcccCC--------------cCcEEEEeCHHHe
Confidence 11229999999985432211 8999999999887
No 6
>cd06431 GT8_LARGE_C LARGE catalytic domain has closest homology to GT8 glycosyltransferase involved in lipooligosaccharide synthesis. The catalytic domain of LARGE is a putative glycosyltransferase. Mutations of LARGE in mouse and human cause dystroglycanopathies, a disease associated with hypoglycosylation of the membrane protein alpha-dystroglycan (alpha-DG) and consequent loss of extracellular ligand binding. LARGE needs to both physically interact with alpha-dystroglycan and function as a glycosyltransferase in order to stimulate alpha-dystroglycan hyperglycosylation. LARGE localizes to the Golgi apparatus and contains three conserved DxD motifs. While two of the motifs are indispensible for glycosylation function, one is important for localization of th eenzyme. LARGE was originally named because it covers approximately large trunck of genomic DNA, more than 600bp long. The predicted protein structure contains an N-terminal cytoplasmic domain, a transmembrane region, a coiled-coil
Probab=98.79 E-value=1.6e-07 Score=89.86 Aligned_cols=210 Identities=19% Similarity=0.263 Sum_probs=116.4
Q ss_pred HHHHHHHHhhCCC-ccEEEEeCCCCCCCC---------CCCceEEEecCCccCchh-HHHHHHHHHHHHHHHHHHhhccC
Q 015921 142 NVFINFIQVTMPK-SDVFILTDPASDLSM---------PRKGVTIYPIHGEYSRDK-LMLQRIRSYITFLERRIREHSQG 210 (398)
Q Consensus 142 n~Fi~sIrvsmP~-s~VVILTD~~T~~sl---------~~~~~~v~pi~g~YSR~~-LmlQRi~sYi~fLkT~i~e~~~~ 210 (398)
.+.|.||-..+.. -++=|++|..++-.+ .+-.+.++.+...-++-. +.-.++.+|.+|..-.|.++-
T Consensus 17 ~~~i~Sil~n~~~~~~fhii~d~~s~~~~~~l~~~~~~~~~~i~f~~i~~~~~~~~~~~~~~~s~~y~y~RL~ip~ll-- 94 (280)
T cd06431 17 VTLVKSVLFYRRNPLHFHLITDEIARRILATLFQTWMVPAVEVSFYNAEELKSRVSWIPNKHYSGIYGLMKLVLTEAL-- 94 (280)
T ss_pred HHHHHHHHHcCCCCEEEEEEECCcCHHHHHHHHHhccccCcEEEEEEhHHhhhhhccCcccchhhHHHHHHHHHHHhc--
Confidence 4456666333322 488899997764331 122445666531111100 001123334445555555542
Q ss_pred CCccccEEEeecCeeeecCchhhhccC---CCce-EEE-e-----e-----c-----cCCCCcCcceEEEEecCCCchhH
Q 015921 211 QGHINHYVFTDSDIAVVDDLGHIFHDY---QNFH-LAL-T-----F-----R-----NNKDQPLNSGFIAVRGTPDGISR 270 (398)
Q Consensus 211 ~~~~~dfIFiDsDi~VvDdLg~IFk~~---PdFH-vAL-T-----f-----R-----nnkdqPiNSGfIfVRgt~d~~~k 270 (398)
++..+-.|++|||++|.+||.++|+.. -+-. +|. + | + .+.+..+||||+.. .-+. -+
T Consensus 95 p~~~dkvLYLD~Diiv~~di~eL~~~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~yFNsGVmli--nL~~-wR 171 (280)
T cd06431 95 PSDLEKVIVLDTDITFATDIAELWKIFHKFTGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL--DLDK-LR 171 (280)
T ss_pred hhhcCEEEEEcCCEEEcCCHHHHHHHhhhcCCCcEEEEeccchhhhhhhhhhccCCCcccccceeeeeeee--eHHH-HH
Confidence 123678999999999999999999862 1111 222 0 1 1 11134799999999 3321 22
Q ss_pred HHHHHHHHHHHHHHhHHh--HhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCCCCCC--
Q 015921 271 AKIFLEEVLRVYSSKYMN--ASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAG-- 346 (398)
Q Consensus 271 A~~FLeeV~eiY~sky~~--a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPpegaG-- 346 (398)
...+-|+|.+ +.+++.+ .+-.++||=+||-+.+.+|. .++.|||. |||.+....-
T Consensus 172 ~~~~~~~~~~-~~~~~~~~~~~~~~~DQDiLN~v~~~~~~-------------------~~~~L~~~-wN~~~~~~~~~~ 230 (280)
T cd06431 172 KMKWESMWRL-TAERELMSMLSTSLADQDIFNAVIKQNPF-------------------LVYQLPCA-WNVQLSDHTRSE 230 (280)
T ss_pred hhCHHHHHHH-HHHHHHhhcCCCCcCcHHHHHHHHcCCcc-------------------eeEECCCc-cccccCccchHh
Confidence 3345555432 2223321 22368999999999865544 36789995 8987421110
Q ss_pred ccccCCcceeEEeeecCch--------hhhHHhhhhhcc
Q 015921 347 QFHGMPLDVKVVHFKGSRK--------RLMLESWNFFSS 377 (398)
Q Consensus 347 QfhgmP~dvKVlHFKGsRK--------rlMle~Wnf~~s 377 (398)
.-...+.+.+||||-|+-| ..--+.|..|..
T Consensus 231 ~~~~~~~~p~IIHf~g~~KPW~~~~~~~~~~~~~~~~~~ 269 (280)
T cd06431 231 QCYRDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLE 269 (280)
T ss_pred HhhcCcCCCEEEEeCCCCCCCCcCCCChHHHHHHHHHHh
Confidence 1112356889999999877 344455666653
No 7
>cd06430 GT8_like_2 GT8_like_2 represents a subfamily of GT8 with unknown function. A subfamily of glycosyltransferase family 8 with unknown function: Glycosyltransferase family 8 comprises enzymes with a number of known activities; lipopolysaccharide galactosyltransferase lipopolysaccharide glucosyltransferase 1, glycogenin glucosyltransferase and inositol 1-alpha-galactosyltransferase. It is classified as a retaining glycosyltransferase, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed.
Probab=98.71 E-value=2.6e-07 Score=90.62 Aligned_cols=176 Identities=18% Similarity=0.344 Sum_probs=107.2
Q ss_pred CccEEEEeCCCCCCCCCC----------Cce--EEEecCCccCchh------HHHHHHHHHHHHHHHH-HHhhccCCCcc
Q 015921 154 KSDVFILTDPASDLSMPR----------KGV--TIYPIHGEYSRDK------LMLQRIRSYITFLERR-IREHSQGQGHI 214 (398)
Q Consensus 154 ~s~VVILTD~~T~~sl~~----------~~~--~v~pi~g~YSR~~------LmlQRi~sYi~fLkT~-i~e~~~~~~~~ 214 (398)
.-+.-|+||.....+... ..+ .+++|. +..++ |+. + ..|.|++-.. |++ .
T Consensus 29 ~l~Fhi~~d~~~~~~~~~~l~~~~~~~~~~i~~~i~~I~--~P~~~~~~ws~l~~-~-~~y~RL~ip~lLp~-------~ 97 (304)
T cd06430 29 PLRFHIFAEDQLKQSFKEKLDDWPELIDRKFNYTLHPIT--FPSGNAAEWKKLFK-P-CAAQRLFLPSLLPD-------V 97 (304)
T ss_pred CEEEEEEECCccCHHHHHHHHHHHHhccceeeeEEEEEe--cCccchhhhhhccc-H-HHHHHHHHHHHhhh-------h
Confidence 448889999833333221 223 566663 32222 444 2 4577764433 442 4
Q ss_pred ccEEEeecCeeeecCchhhhccCCCc---eEE-Ee----------eccCC--C----CcCcceEEEEecCCCchhHH---
Q 015921 215 NHYVFTDSDIAVVDDLGHIFHDYQNF---HLA-LT----------FRNNK--D----QPLNSGFIAVRGTPDGISRA--- 271 (398)
Q Consensus 215 ~dfIFiDsDi~VvDdLg~IFk~~PdF---HvA-LT----------fRnnk--d----qPiNSGfIfVRgt~d~~~kA--- 271 (398)
+..|.+|+||+|..||.++|+.+-+| .+| .. +.+.. + ..+||||++. .-+..-+.
T Consensus 98 dkvLYLD~Dii~~~dI~eL~~~~~df~~~~~aA~v~e~~~~~~~~~~~~~~~~~~~~~gFNSGVmLm--NL~~wR~~~~~ 175 (304)
T cd06430 98 DSLLYVDTDILFLRPVEEIWSFLKKFNSTQLAAMAPEHEEPNIGWYNRFARHPYYGKTGVNSGVMLM--NLTRMRRKYFK 175 (304)
T ss_pred ceEEEeccceeecCCHHHHHHHHhhcCCCeEEEEEecccccchhhhhhhcccCcccccccccceeee--eHHHHHhhhcc
Confidence 68999999999999999999863333 233 21 11001 1 1299999999 43333221
Q ss_pred -------HHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCCCC
Q 015921 272 -------KIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEG 344 (398)
Q Consensus 272 -------~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPpeg 344 (398)
..+-|+|.+.+. +|- .+--+|||=+||-+...||. .+++|||. |||.| .+
T Consensus 176 ~~~~~~~~~~~~~~~~~~~-~~~-~~l~~~DQDiLN~v~~~~p~-------------------~~~~Lp~~-wN~~~-d~ 232 (304)
T cd06430 176 NDMTPVGLRWEEILMPLYK-KYK-LKITWGDQDLINIIFHHNPE-------------------MLYVFPCH-WNYRP-DH 232 (304)
T ss_pred cccchhhhhHHHHHHHHHH-hcc-cCCCCCCHHHHHHHHcCCCC-------------------eEEEcCcc-ccCCc-cc
Confidence 234455555433 221 22358999999999987776 68999997 99997 44
Q ss_pred C--CccccC--CcceeEEeeecCch
Q 015921 345 A--GQFHGM--PLDVKVVHFKGSRK 365 (398)
Q Consensus 345 a--GQfhgm--P~dvKVlHFKGsRK 365 (398)
. |.-|-. +.++||||.-+.-|
T Consensus 233 ~~y~~~~~~~~~~~~~~~H~n~~~~ 257 (304)
T cd06430 233 CMYGSNCKAAEEEGVFILHGNRGVY 257 (304)
T ss_pred eeecccccccccccceEEEcCCCCC
Confidence 3 444443 35999999875544
No 8
>cd06429 GT8_like_1 GT8_like_1 represents a subfamily of GT8 with unknown function. A subfamily of glycosyltransferase family 8 with unknown function: Glycosyltransferase family 8 comprises enzymes with a number of known activities; lipopolysaccharide galactosyltransferase lipopolysaccharide glucosyltransferase 1, glycogenin glucosyltransferase and inositol 1-alpha-galactosyltransferase. It is classified as a retaining glycosyltransferase, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed.
Probab=98.62 E-value=7.1e-07 Score=84.75 Aligned_cols=188 Identities=18% Similarity=0.122 Sum_probs=101.2
Q ss_pred HHHHHHHHHHHhh-CCCc-cEEEEeCCCCCCCC---------CCCceEEEecCCccCchhHH------------------
Q 015921 139 AILNVFINFIQVT-MPKS-DVFILTDPASDLSM---------PRKGVTIYPIHGEYSRDKLM------------------ 189 (398)
Q Consensus 139 avln~Fi~sIrvs-mP~s-~VVILTD~~T~~sl---------~~~~~~v~pi~g~YSR~~Lm------------------ 189 (398)
|..-+.++++... .|.. ..-|+||..+.-.+ .+-.++++.+....-++...
T Consensus 12 ~~~v~i~S~l~nn~~~~~~~fhvvtd~~s~~~~~~~~~~~~~~~~~i~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 91 (257)
T cd06429 12 AAAVVINSSISNNKDPSNLVFHIVTDNQNYGAMRSWFDLNPLKIATVKVLNFDDFKLLGKVKVDSLMQLESEADTSNLKQ 91 (257)
T ss_pred HHHHHHHHHHHhCCCCCceEEEEecCccCHHHHHHHHHhcCCCCceEEEEEeCcHHhhcccccchhhhhhcccccccccc
Confidence 3444555666443 3322 56688998874221 13356688875332111100
Q ss_pred --HHH--HHHHHHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhccCCCceEEEeeccCCCCcCcceEEEEecCC
Q 015921 190 --LQR--IRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTP 265 (398)
Q Consensus 190 --lQR--i~sYi~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfRnnkdqPiNSGfIfVRgt~ 265 (398)
.++ ..+|.|++ |.++-. ..+-.|++|||++|..||+++|+.-.+-+++---.+ .+||||+.+ .-
T Consensus 92 ~~~~~~s~~~y~Rl~---ip~llp---~~~kvlYLD~Dviv~~dl~eL~~~dl~~~~~aav~d----yfNsGV~li--nl 159 (257)
T cd06429 92 RKPEYISLLNFARFY---LPELFP---KLEKVIYLDDDVVVQKDLTELWNTDLGGGVAGAVET----SWNPGVNVV--NL 159 (257)
T ss_pred CCccccCHHHHHHHH---HHHHhh---hhCeEEEEeCCEEEeCCHHHHhhCCCCCCEEEEEhh----hcccceEEE--eH
Confidence 011 24455554 444321 146899999999999999999998222232222222 799999999 33
Q ss_pred CchhHHHHHHHHHHHHHHHhHHh---HhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCC
Q 015921 266 DGISRAKIFLEEVLRVYSSKYMN---ASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 342 (398)
Q Consensus 266 d~~~kA~~FLeeV~eiY~sky~~---a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPp 342 (398)
+. -+..++-|++++ |.++.-+ .....|||.+|+-+.. -.+.+|||. ||..
T Consensus 160 ~~-wr~~~i~~~~~~-~~~~~~~~~~~~~~~~dqd~ln~~~~----------------------~~~~~L~~~-wN~~-- 212 (257)
T cd06429 160 TE-WRRQNVTETYEK-WMELNQEEEVTLWKLITLPPGLIVFY----------------------GLTSPLDPS-WHVR-- 212 (257)
T ss_pred HH-HHhccHHHHHHH-HHHHhhhcccchhhcCCccHHHHHcc----------------------CeeEECChH-Hccc--
Confidence 21 122233333332 2222111 1234678999987752 246677774 6653
Q ss_pred CCCCcc----ccCCcceeEEeeecCchh
Q 015921 343 EGAGQF----HGMPLDVKVVHFKGSRKR 366 (398)
Q Consensus 343 egaGQf----hgmP~dvKVlHFKGsRKr 366 (398)
+.|.- ...-.+.|||||-|+.|=
T Consensus 213 -~l~~~~~~~~~~~~~~~IIHy~G~~KP 239 (257)
T cd06429 213 -GLGYNYGIRPQDIKAAAVLHFNGNMKP 239 (257)
T ss_pred -CCcccccccccccCCcEEEEECCCCCC
Confidence 11110 111238899999998774
No 9
>cd06914 GT8_GNT1 GNT1 is a fungal enzyme that belongs to the GT 8 family. N-acetylglucosaminyltransferase is a fungal enzyme that catalyzes the addition of N-acetyl-D-glucosamine to mannotetraose side chains by an alpha 1-2 linkage during the synthesis of mannan. The N-acetyl-D-glucosamine moiety in mannan plays a role in the attachment of mannan to asparagine residues in proteins. The mannotetraose and its N-acetyl-D-glucosamine derivative side chains of mannan are the principle immunochemical determinants on the cell surface. N-acetylglucosaminyltransferase is a member of glycosyltransferase family 8, which are, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed, retaining glycosyltransferases.
Probab=98.62 E-value=7.1e-07 Score=86.41 Aligned_cols=187 Identities=17% Similarity=0.177 Sum_probs=119.4
Q ss_pred HHHHHHHhhCCCccEEEEeCCCCCCC-----------CCCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCC
Q 015921 143 VFINFIQVTMPKSDVFILTDPASDLS-----------MPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQ 211 (398)
Q Consensus 143 ~Fi~sIrvsmP~s~VVILTD~~T~~s-----------l~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~ 211 (398)
++..|||......+.|||.++.-... +.+.++.+.++|-...++ ...+. + ..|.|=.+-++
T Consensus 18 vL~~sLr~~gs~~dlVvLvt~~~~~~~~~~~~~~~~~l~~~~~~v~~v~~~~~~~--~~~~~-~-~~~tKl~~~~l---- 89 (278)
T cd06914 18 ILFEQLRRLGSKAKLVLLVPETLLDRNLDDFVRRDLLLARDKVIVKLIPVIIASG--GDAYW-A-KSLTKLRAFNQ---- 89 (278)
T ss_pred HHHHHHHHhCCCCCEEEEECCCCChhhhhhHHHHHHHhhccCcEEEEcCcccCCC--CCccH-H-HHHHHHHhccc----
Confidence 56789999999998888887765432 123456677776444332 11221 1 12444444333
Q ss_pred CccccEEEeecCeeeecCchhhhccCCCceEEEeeccCCCCcCcceEEEEecCCCchhHHHHHHHHHHHHHHHhHHhHhh
Q 015921 212 GHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASR 291 (398)
Q Consensus 212 ~~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfRnnkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~~a~~ 291 (398)
..-+..||+|+|++|..+|.++|..--..++|--- ....+|||++.+ .++ ...-.++.+++.+.... ..
T Consensus 90 ~~y~kvlyLDaD~l~~~~ideLf~~~~~~~~Aap~---~~~~FNSGvmvi--~ps-~~~~~~l~~~~~~~~~~-----~~ 158 (278)
T cd06914 90 TEYDRIIYFDSDSIIRHPMDELFFLPNYIKFAAPR---AYWKFASHLMVI--KPS-KEAFKELMTEILPAYLN-----KK 158 (278)
T ss_pred cceeeEEEecCChhhhcChHHHhcCCcccceeeec---CcceecceeEEE--eCC-HHHHHHHHHHHHHhccc-----CC
Confidence 56789999999999999999999872245555421 123799999999 551 23334555555544332 22
Q ss_pred hhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCCCCCCcccc-------------CC----cc
Q 015921 292 MLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHG-------------MP----LD 354 (398)
Q Consensus 292 mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPpegaGQfhg-------------mP----~d 354 (398)
-.+||=.||.....+.. + ---++++||...||-...|=....|- +| .+
T Consensus 159 ~~~DQdiLN~~~~~~~~----------~-----~~~~~~~Lp~~~y~llt~~~r~~~~~~~l~~~~~~~~~w~~~~~~~~ 223 (278)
T cd06914 159 NEYDMDLINEEFYNSKQ----------L-----FKPSVLVLPHRQYGLLTGEFREKLHKSFLSNAQHLYEKWDPDDVFKE 223 (278)
T ss_pred CCCChHHHHHHHhCCcc----------c-----cCcceEEcCccccccCChhhcccCHHHhhccccccccccCHHHHHhh
Confidence 56899999999754332 1 33468999999999776554333222 12 37
Q ss_pred eeEEeeecC
Q 015921 355 VKVVHFKGS 363 (398)
Q Consensus 355 vKVlHFKGs 363 (398)
.|+|||-++
T Consensus 224 ~k~vHFSd~ 232 (278)
T cd06914 224 SKVIHFSDS 232 (278)
T ss_pred CeEEEecCC
Confidence 899999988
No 10
>PLN00176 galactinol synthase
Probab=98.51 E-value=1.8e-06 Score=85.61 Aligned_cols=196 Identities=17% Similarity=0.267 Sum_probs=109.4
Q ss_pred HHHHHHHHhhCCCccEEEEeCCC-CC---CCCCCCceEEEecCCccCchh---HHHHHHHHHHHHHHHHHHhhccCCCcc
Q 015921 142 NVFINFIQVTMPKSDVFILTDPA-SD---LSMPRKGVTIYPIHGEYSRDK---LMLQRIRSYITFLERRIREHSQGQGHI 214 (398)
Q Consensus 142 n~Fi~sIrvsmP~s~VVILTD~~-T~---~sl~~~~~~v~pi~g~YSR~~---LmlQRi~sYi~fLkT~i~e~~~~~~~~ 214 (398)
-+.+.|||.......+|||+.+. ++ ..|.....+|.+|+.-+..++ +-..++ -+.|=|=+|-++ ...
T Consensus 40 ~vL~~SLr~~~s~~~lVvlVt~dVp~e~r~~L~~~g~~V~~V~~i~~~~~~~~~~~~~~--~i~~tKl~iw~l----~~y 113 (333)
T PLN00176 40 VGLAKGLRKVKSAYPLVVAVLPDVPEEHRRILVSQGCIVREIEPVYPPENQTQFAMAYY--VINYSKLRIWEF----VEY 113 (333)
T ss_pred HHHHHHHHHhCCCCCEEEEECCCCCHHHHHHHHHcCCEEEEecccCCcccccccccchh--hhhhhhhhhccc----ccc
Confidence 35678899998888888876543 21 112222334444432221111 101110 133334444444 456
Q ss_pred ccEEEeecCeeeecCchhhhccCCCceEEE-e--ecc-----------------------------CCCCcCcceEEEEe
Q 015921 215 NHYVFTDSDIAVVDDLGHIFHDYQNFHLAL-T--FRN-----------------------------NKDQPLNSGFIAVR 262 (398)
Q Consensus 215 ~dfIFiDsDi~VvDdLg~IFk~~PdFHvAL-T--fRn-----------------------------nkdqPiNSGfIfVR 262 (398)
+-.||+|+||+|.++|.++|.. |+-.++- . |-+ .....+|||++.+
T Consensus 114 dkvlyLDaD~lv~~nid~Lf~~-~~~~~aAV~dc~~~~~~~~~p~~~~~~c~~~~~~~~wp~~~g~~~~~yFNSGVlvi- 191 (333)
T PLN00176 114 SKMIYLDGDIQVFENIDHLFDL-PDGYFYAVMDCFCEKTWSHTPQYKIGYCQQCPDKVTWPAELGPPPPLYFNAGMFVF- 191 (333)
T ss_pred ceEEEecCCEEeecChHHHhcC-CCcceEEEecccccccccccccccccccccchhhccchhhccCCCCCeEEeEEEEE-
Confidence 7899999999999999999987 4432211 0 110 0123599999999
Q ss_pred cCCCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCC
Q 015921 263 GTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 342 (398)
Q Consensus 263 gt~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPp 342 (398)
.++ ..-++++++..... ..-.++||-.||...+. .+..||. +||+.-.
T Consensus 192 -nps-----~~~~~~ll~~l~~~---~~~~f~DQD~LN~~F~~----------------------~~~~Lp~-~YN~~~~ 239 (333)
T PLN00176 192 -EPS-----LSTYEDLLETLKIT---PPTPFAEQDFLNMFFRD----------------------IYKPIPP-VYNLVLA 239 (333)
T ss_pred -EcC-----HHHHHHHHHHHHhc---CCCCCCCHHHHHHHHcC----------------------cEEECCc-hhcCchh
Confidence 452 22234454432211 12257999999999741 2466775 4776521
Q ss_pred ---CCCCccccCCcceeEEeeecC-c-----------------hhhhHHhhhhhccCC
Q 015921 343 ---EGAGQFHGMPLDVKVVHFKGS-R-----------------KRLMLESWNFFSSSS 379 (398)
Q Consensus 343 ---egaGQfhgmP~dvKVlHFKGs-R-----------------KrlMle~Wnf~~s~~ 379 (398)
++..-+. ..++|||||-|. . |.+...-|..|+..+
T Consensus 240 ~~~~~~~~~~--~~~vkIIHY~~~~~KPW~~~~~~~~~~~~~~~~~~~~Ww~~~~~~~ 295 (333)
T PLN00176 240 MLWRHPENVE--LDKVKVVHYCAAGSKPWRYTGKEENMDREDIKMLVKKWWDIYNDES 295 (333)
T ss_pred hhhhChhhcc--cCCcEEEEeeCCCCCCCCCCCcccCCChHHHHHHHHHHHHHhcccc
Confidence 0111111 348999999963 1 445556888888655
No 11
>PRK15171 lipopolysaccharide 1,3-galactosyltransferase; Provisional
Probab=98.44 E-value=8.8e-07 Score=86.45 Aligned_cols=206 Identities=17% Similarity=0.206 Sum_probs=114.6
Q ss_pred eeeccchhHHHHHHHHHHHHhhCCCc--cEEEEeCCCCCCCCC---------CCceEEEecCCc-cCc-hhHHHHHHHHH
Q 015921 130 SYSKTERSMAILNVFINFIQVTMPKS--DVFILTDPASDLSMP---------RKGVTIYPIHGE-YSR-DKLMLQRIRSY 196 (398)
Q Consensus 130 ~~~k~~rs~avln~Fi~sIrvsmP~s--~VVILTD~~T~~sl~---------~~~~~v~pi~g~-YSR-~~LmlQRi~sY 196 (398)
.|.--+.-..-+-+-|.||-..+|+. ++-||+|.-+.-... +..+.++.+..+ +.. ...-.-.+..|
T Consensus 29 v~~~D~ny~~~~~vsi~Sil~nn~~~~~~f~Il~~~is~e~~~~l~~l~~~~~~~i~~~~id~~~~~~~~~~~~~s~atY 108 (334)
T PRK15171 29 AYGIDKNFLFGCGVSIASVLLNNPDKSLVFHVFTDYISDADKQRFSALAKQYNTRINIYLINCERLKSLPSTKNWTYATY 108 (334)
T ss_pred EEECcHhhHHHHHHHHHHHHHhCCCCCEEEEEEeCCCCHHHHHHHHHHHHhcCCeEEEEEeCHHHHhCCcccCcCCHHHH
Confidence 33333444455667788887778884 789999866533221 224556665321 110 00001234556
Q ss_pred HHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhcc-CCCceEEEe--------ecc--------C-CCCcCcceE
Q 015921 197 ITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHD-YQNFHLALT--------FRN--------N-KDQPLNSGF 258 (398)
Q Consensus 197 i~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~-~PdFHvALT--------fRn--------n-kdqPiNSGf 258 (398)
.|++- .++- ++..+-.|++|||++|.+||.++|.. ..+..+|-- +.+ . .+.++||||
T Consensus 109 ~Rl~i---p~ll--p~~~dkvLYLD~Diiv~~dl~~L~~~dl~~~~~aav~~d~~~~~~~~~~~~l~~~~~~~~YFNsGV 183 (334)
T PRK15171 109 FRFII---ADYF--IDKTDKVLYLDADIACKGSIKELIDLDFAENEIAAVVAEGDAEWWSKRAQSLQTPGLASGYFNSGF 183 (334)
T ss_pred HHHHH---HHhh--hhhcCEEEEeeCCEEecCCHHHHHhccCCCCeEEEEEeccchhHHHHHHHhcCCccccccceecce
Confidence 66543 3321 12367899999999999999999988 222333321 110 0 134799999
Q ss_pred EEEecCCCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeeccccc
Q 015921 259 IAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYN 338 (398)
Q Consensus 259 IfVRgt~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YN 338 (398)
+.+ .-+.. +...+-+++.+....+-+...-..+||=+||-+.. -++++||| .||
T Consensus 184 lli--Nl~~w-Re~~i~~k~~~~l~~~~~~~~~~~~DQDiLN~~~~----------------------~~~~~L~~-~wN 237 (334)
T PRK15171 184 LLI--NIPAW-AQENISAKAIEMLADPEIVSRITHLDQDVLNILLA----------------------GKVKFIDA-KYN 237 (334)
T ss_pred EEE--cHHHH-HHhhHHHHHHHHHhccccccceeecChhHHHHHHc----------------------CCeEECCH-hhC
Confidence 999 43222 22344455554443221111225679999998863 15788888 788
Q ss_pred CCCCCCC--CccccCC--cceeEEeeecCchh
Q 015921 339 WTPPEGA--GQFHGMP--LDVKVVHFKGSRKR 366 (398)
Q Consensus 339 wtPpega--GQfhgmP--~dvKVlHFKGsRKr 366 (398)
+.---.- ..-+-.| -+.+||||-|+.|=
T Consensus 238 ~~~~~~~~~~~~~~~~~~~~p~IIHy~G~~KP 269 (334)
T PRK15171 238 TQFSLNYELKDSVINPVNDETVFIHYIGPTKP 269 (334)
T ss_pred CccchhHHHHhcccccccCCCEEEEECCCCCC
Confidence 6521000 0000112 36789999998774
No 12
>cd06432 GT8_HUGT1_C_like The C-terminal domain of HUGT1-like is highly homologous to the GT 8 family. C-terminal domain of glycoprotein glucosyltransferase (UGT). UGT is a large glycoprotein whose C-terminus contains the catalytic activity. This catalytic C-terminal domain is highly homologous to Glycosyltransferase Family 8 (GT 8) and contains the DXD motif that coordinates donor sugar binding, characteristic for Family 8 glycosyltransferases. GT 8 proteins are retaining enzymes based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed. The non-catalytic N-terminal portion of the human UTG1 (HUGT1) has been shown to monitor the protein folding status and activate its glucosyltransferase activity.
Probab=98.11 E-value=1.8e-05 Score=74.42 Aligned_cols=181 Identities=13% Similarity=0.107 Sum_probs=100.7
Q ss_pred HHHHHHHHhhCC-CccEEEEeCCCCCCC---CC------CCceEEEecC-CccCchhHHHHH-HHHHHHHHHH-HHHhhc
Q 015921 142 NVFINFIQVTMP-KSDVFILTDPASDLS---MP------RKGVTIYPIH-GEYSRDKLMLQR-IRSYITFLER-RIREHS 208 (398)
Q Consensus 142 n~Fi~sIrvsmP-~s~VVILTD~~T~~s---l~------~~~~~v~pi~-g~YSR~~LmlQR-i~sYi~fLkT-~i~e~~ 208 (398)
-+-+.||...+. .-++-||+|.-|+-. +. +..++++.+. .++-+...-.+| +.+|.+++-. -|++
T Consensus 17 ~v~l~Sll~nn~~~~~fyil~~~is~e~~~~l~~~~~~~~~~i~~i~i~~~~~~~~~~~~~~~~~~y~rL~~~~lLP~-- 94 (248)
T cd06432 17 RIMMLSVMKNTKSPVKFWFIKNFLSPQFKEFLPEMAKEYGFEYELVTYKWPRWLHKQTEKQRIIWGYKILFLDVLFPL-- 94 (248)
T ss_pred HHHHHHHHHcCCCCEEEEEEeCCCCHHHHHHHHHHHHHhCCceEEEEecChhhhhcccccchhHHHHHHHHHHHhhhh--
Confidence 445677765554 348889998766211 11 2234555553 111111111122 3456654433 4442
Q ss_pred cCCCccccEEEeecCeeeecCchhhhcc---------CCCceEEEeec-----------c--CCCCcCcceEEEEecCCC
Q 015921 209 QGQGHINHYVFTDSDIAVVDDLGHIFHD---------YQNFHLALTFR-----------N--NKDQPLNSGFIAVRGTPD 266 (398)
Q Consensus 209 ~~~~~~~dfIFiDsDi~VvDdLg~IFk~---------~PdFHvALTfR-----------n--nkdqPiNSGfIfVRgt~d 266 (398)
.++..|++|+||+|.+||.++++. ++|++..-+.+ + ...-++||||+.+ .-
T Consensus 95 ----~vdkvLYLD~Dilv~~dL~eL~~~dl~~~~~Aav~d~~~~~~~~~~~~~~~~~~~~~l~~~~YfNSGVmli--NL- 167 (248)
T cd06432 95 ----NVDKVIFVDADQIVRTDLKELMDMDLKGAPYGYTPFCDSRKEMDGFRFWKQGYWKSHLRGRPYHISALYVV--DL- 167 (248)
T ss_pred ----ccCEEEEEcCCceecccHHHHHhcCcCCCeEEEeeccccchhcccchhhhhhhhhhhcCCCCccceeeEEE--eH-
Confidence 357899999999999999999987 34333211110 0 1123799999999 33
Q ss_pred chhHHHHHHHHHHHHHHHhHHhHhh-hhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCC----C
Q 015921 267 GISRAKIFLEEVLRVYSSKYMNASR-MLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWT----P 341 (398)
Q Consensus 267 ~~~kA~~FLeeV~eiY~sky~~a~~-mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwt----P 341 (398)
..-+...+-++|.++|...--...+ .+.||=+||-+. .+.++++|||. |||+ +
T Consensus 168 ~~wR~~~i~~~~~~~~~~l~~~~~~l~~~DQDiLN~v~---------------------~~~~i~~Lp~~-w~~~~~~~~ 225 (248)
T cd06432 168 KRFRRIAAGDRLRGQYQQLSQDPNSLANLDQDLPNNMQ---------------------HQVPIFSLPQE-WLWCETWCS 225 (248)
T ss_pred HHHHHHhHHHHHHHHHHHHhcCCCccccCCchhhHHHh---------------------ccCCeEECChH-HHHHHHHhc
Confidence 2234455666666666421000122 457999999885 23448899987 6654 4
Q ss_pred CCCCCccccCCcceeEEee
Q 015921 342 PEGAGQFHGMPLDVKVVHF 360 (398)
Q Consensus 342 pegaGQfhgmP~dvKVlHF 360 (398)
.|- ..+.|++||
T Consensus 226 ~~~-------~~~~~~~~~ 237 (248)
T cd06432 226 DES-------KKKAKTIDL 237 (248)
T ss_pred ccc-------cCccceeec
Confidence 433 235566666
No 13
>PLN02769 Probable galacturonosyltransferase
Probab=98.06 E-value=1.7e-05 Score=84.68 Aligned_cols=159 Identities=21% Similarity=0.208 Sum_probs=87.4
Q ss_pred HHHHHHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhcc---------CCCceEEE---------eeccCCCCcC
Q 015921 193 IRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHD---------YQNFHLAL---------TFRNNKDQPL 254 (398)
Q Consensus 193 i~sYi~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~---------~PdFHvAL---------TfRnnkdqPi 254 (398)
+-+|.||+ |.++-. ..+-+|++|+||+|.+||.++|+. +.+|...+ .+.+.+.+.+
T Consensus 437 ~~nh~Rfy---IPELLP---~LdKVLYLD~DVVVqgDLseLw~iDL~gkviAAVedc~~rl~~~~~yl~~~~F~~~~CyF 510 (629)
T PLN02769 437 VFSHSHFL---LPEIFK---KLKKVVVLDDDVVVQRDLSFLWNLDMGGKVNGAVQFCGVRLGQLKNYLGDTNFDTNSCAW 510 (629)
T ss_pred HHHHHHHH---HHHHhh---hcCeEEEEeCCEEecCcHHHHhcCCCCCCeEEEehhhhhhhhhhhhhhcccCCCcccccc
Confidence 34455554 555421 256899999999999999999986 22222111 0112345668
Q ss_pred cceEEEEecCCCchhHHHHHHHHHHHHHHHhHHh---HhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEE
Q 015921 255 NSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMN---ASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLF 331 (398)
Q Consensus 255 NSGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~~---a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlf 331 (398)
|+||+.+ .-+..-+ .++-+.+++ |.+++.+ ..-..++|.+++-+.. -.+..
T Consensus 511 NSGVLLI--NL~~WRk-~nITe~~~~-~~~~~~~~~~~~~~~~~Lp~lnlvF~----------------------g~v~~ 564 (629)
T PLN02769 511 MSGLNVI--DLDKWRE-LDVTETYLK-LLQKFSKDGEESLRAAALPASLLTFQ----------------------DLIYP 564 (629)
T ss_pred ccCeeEe--eHHHHHH-hCHHHHHHH-HHHHhhhcccccccccCcCHHHHHhc----------------------CeEEE
Confidence 9999999 2211111 111111111 2222211 1112345555554432 24677
Q ss_pred eecccccCCCCCCCCcccc----CCcceeEEeeecCchhh-------hHHhhhhhccCCCcccceee
Q 015921 332 LPCATYNWTPPEGAGQFHG----MPLDVKVVHFKGSRKRL-------MLESWNFFSSSSDISDMLCL 387 (398)
Q Consensus 332 LPC~~YNwtPpegaGQfhg----mP~dvKVlHFKGsRKrl-------Mle~Wnf~~s~~~~~dm~cl 387 (398)
||+. ||+. |.|...+ .-.+.+||||-|..|=- .-++|..|...++-.=-.|-
T Consensus 565 LD~r-WNv~---gLG~~~~i~~~~i~~paIIHYnG~~KPW~e~~i~~yr~~W~kYl~~~~~fl~~Cn 627 (629)
T PLN02769 565 LDDR-WVLS---GLGHDYGIDEQAIKKAAVLHYNGNMKPWLELGIPKYKKYWKRFLNRDDRFMDECN 627 (629)
T ss_pred CCHH-Hccc---cccccccccccccCCcEEEEECCCCCCccCCCCChHHHHHHHHhccCChHHhhCC
Confidence 7773 5755 4443332 23478999999987643 57889988876654444453
No 14
>COG1442 RfaJ Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases [Cell envelope biogenesis, outer membrane]
Probab=97.96 E-value=4.4e-05 Score=75.68 Aligned_cols=189 Identities=20% Similarity=0.302 Sum_probs=117.7
Q ss_pred HHHHHHHhhCC--CccEEEEeCCCCCCCCC---------CCceEEEec-----------CCccCchhHHHHHHHHHHHHH
Q 015921 143 VFINFIQVTMP--KSDVFILTDPASDLSMP---------RKGVTIYPI-----------HGEYSRDKLMLQRIRSYITFL 200 (398)
Q Consensus 143 ~Fi~sIrvsmP--~s~VVILTD~~T~~sl~---------~~~~~v~pi-----------~g~YSR~~LmlQRi~sYi~fL 200 (398)
+=|+||=.+|+ ..+.=||+|.-++-... +..+.++.+ ..+||+ ..|.||+
T Consensus 19 vsI~SiL~~n~~~~~~fhil~~~i~~e~~~~l~~~~~~f~~~i~~~~id~~~~~~~~~~~~~~s~--------~v~~R~f 90 (325)
T COG1442 19 VSIYSLLEHNRKIFYKFHILVDGLNEEDKKKLNETAEPFKSFIVLEVIDIEPFLDYPPFTKRFSK--------MVLVRYF 90 (325)
T ss_pred HHHHHHHHhCccccEEEEEEecCCCHHHHHHHHHHHHhhccceeeEEEechhhhcccccccchHH--------HHHHHHH
Confidence 44778888888 56777888876542211 112222222 223332 3445555
Q ss_pred HHHH-HhhccCCCccccEEEeecCeeeecCchhhhcc-CCCceEEEe--e--------------ccCCCCcCcceEEEEe
Q 015921 201 ERRI-REHSQGQGHINHYVFTDSDIAVVDDLGHIFHD-YQNFHLALT--F--------------RNNKDQPLNSGFIAVR 262 (398)
Q Consensus 201 kT~i-~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~-~PdFHvALT--f--------------RnnkdqPiNSGfIfVR 262 (398)
-..+ ++. +-.|.+|+|++|+.||+++|.- .++..+|-. + .+-.+.++|+||+.+
T Consensus 91 iadlf~~~-------dK~lylD~Dvi~~g~l~~lf~~~~~~~~~aaV~D~~~~~~~~~~~~~~~~~~~~~yFNaG~lli- 162 (325)
T COG1442 91 LADLFPQY-------DKMLYLDVDVIFCGDLSELFFIDLEEYYLAAVRDVFSHYMKEGALRLEKGDLEGSYFNAGVLLI- 162 (325)
T ss_pred HHHhcccc-------CeEEEEecCEEEcCcHHHHHhcCCCcceEEEEeehhhhhhhhhhhHhhhcccccccCccceeee-
Confidence 5443 322 6789999999999999999987 445555552 1 123467799999999
Q ss_pred cCCCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCC
Q 015921 263 GTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 342 (398)
Q Consensus 263 gt~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPp 342 (398)
.. ..=+.....+..++..... +..--.+||-.|+.+.+ ..+++||| .||.. |
T Consensus 163 -nl-~~W~~~~i~~k~i~~~~~~--~~~~~~~DQdiLN~i~~----------------------~~~~~L~~-~YN~~-~ 214 (325)
T COG1442 163 -NL-KLWREENIFEKLIELLKDK--ENDLLYPDQDILNMIFE----------------------DRVLELPI-RYNAI-P 214 (325)
T ss_pred -hH-HHHHHhhhHHHHHHHHhcc--ccccCCccccHHHHHHH----------------------hhhhccCc-cccee-e
Confidence 32 1112233333333332111 01225778999988864 35778887 58988 5
Q ss_pred CCCCcccc---CCc--ceeEEeeecCchhhhHHhhhhh
Q 015921 343 EGAGQFHG---MPL--DVKVVHFKGSRKRLMLESWNFF 375 (398)
Q Consensus 343 egaGQfhg---mP~--dvKVlHFKGsRKrlMle~Wnf~ 375 (398)
....|..- +|. +.-++||-|.+|=-+..+|+.+
T Consensus 215 ~~~~~~~~~~~~~~~~~~~iiHy~g~~KPW~~~~~~~~ 252 (325)
T COG1442 215 YIDSQLKDKYIYPFGDDPVILHYAGPTKPWHSDSSNYP 252 (325)
T ss_pred hhhhccchhhhccCCCCceEEEecCCCCCCcCcccccc
Confidence 66666655 555 7789999999999888887664
No 15
>PLN02659 Probable galacturonosyltransferase
Probab=97.77 E-value=8.3e-05 Score=78.24 Aligned_cols=149 Identities=19% Similarity=0.176 Sum_probs=83.1
Q ss_pred cccEEEeecCeeeecCchhhhcc---------CCCceE--------------EE------eeccCCCCcCcceEEEEecC
Q 015921 214 INHYVFTDSDIAVVDDLGHIFHD---------YQNFHL--------------AL------TFRNNKDQPLNSGFIAVRGT 264 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDdLg~IFk~---------~PdFHv--------------AL------TfRnnkdqPiNSGfIfVRgt 264 (398)
.+-+|++|+|++|.+||.++|+. ++||.. .+ .+.|.+.+.+|+||+.+ .
T Consensus 344 LdKVLYLD~DVVVqgDLseLw~iDL~gkv~AAVeDc~~~d~~~~~~~~~~yL~~s~p~i~~yFn~~~cYfNsGVlLI--N 421 (534)
T PLN02659 344 LNKVVFLDDDIVVQTDLSPLWDIDMNGKVNGAVETCRGEDKFVMSKKLKSYLNFSHPLIAKNFDPNECAWAYGMNIF--D 421 (534)
T ss_pred cCeEEEeeCCEEEcCchHHHHhCCCCCcEEEEeeccccccchhhhHHHHHhhcccchhhhhccCccccceecceeEe--e
Confidence 56899999999999999999987 444321 11 11244567899999999 2
Q ss_pred CCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCCCC
Q 015921 265 PDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEG 344 (398)
Q Consensus 265 ~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPpeg 344 (398)
-+ .=+-.++-|+.++. .++..+....+.||-+|+-.+. .| .-.+.+||. .||.. |
T Consensus 422 Lk-~WRe~nITek~l~~-l~~n~~~~l~l~DQdaLp~~Li--------------vF-----~g~v~~LD~-rWN~~---g 476 (534)
T PLN02659 422 LE-AWRKTNISSTYHHW-LEENLKSDLSLWQLGTLPPGLI--------------AF-----HGHVHVIDP-FWHML---G 476 (534)
T ss_pred HH-HHHhcChHHHHHHH-HHhcccccccccccccchHHHH--------------Hh-----cCCEEECCh-hheec---C
Confidence 10 00111222222221 1222212234567777753321 01 112445554 34432 1
Q ss_pred CCcc----ccCCcceeEEeeecCchhh-------hHHhhhhhccCCCcccceeeee
Q 015921 345 AGQF----HGMPLDVKVVHFKGSRKRL-------MLESWNFFSSSSDISDMLCLIL 389 (398)
Q Consensus 345 aGQf----hgmP~dvKVlHFKGsRKrl-------Mle~Wnf~~s~~~~~dm~clil 389 (398)
-|.- .....+.+||||-|..|=- --++|.-|...++-.=-.|-|+
T Consensus 477 Lg~~~~~~~~~i~~paIIHYnG~~KPW~~~~~~~yr~~W~kYl~~s~~fl~~Cni~ 532 (534)
T PLN02659 477 LGYQENTSLADAESAGVVHFNGRAKPWLDIAFPQLRPLWAKYIDSSDKFIKSCHIR 532 (534)
T ss_pred CcccccccccccCCcEEEEECCCCCccccccCCcchhHHHHHhccCCHHHHhcCCC
Confidence 1110 1134588999999986633 2378998888877665667665
No 16
>PLN02867 Probable galacturonosyltransferase
Probab=97.71 E-value=0.00018 Score=75.83 Aligned_cols=167 Identities=22% Similarity=0.222 Sum_probs=91.9
Q ss_pred HHHHHHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhccCCCce-E-EE----------------eecc------
Q 015921 193 IRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFH-L-AL----------------TFRN------ 248 (398)
Q Consensus 193 i~sYi~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~~PdFH-v-AL----------------TfRn------ 248 (398)
+-|...|+.=-|.|+-. ..+-.|++|+|++|..||.++|+.=-+-+ + |+ +|-|
T Consensus 327 ylS~lnYlRflIPeLLP---~LdKVLYLD~DVVVqgDLseLwdiDL~gkviaAV~D~~c~~~~~~~~~~~~YlNfsnp~i 403 (535)
T PLN02867 327 CLSLLNHLRIYIPELFP---DLNKIVFLDDDVVVQHDLSSLWELDLNGKVVGAVVDSWCGDNCCPGRKYKDYLNFSHPLI 403 (535)
T ss_pred hhhHHHHHHHHHHHHhh---ccCeEEEecCCEEEcCchHHHHhCcCCCCeEEEEeccccccccccchhhhhhccccchhh
Confidence 33434444445565532 35789999999999999999998811111 0 11 0111
Q ss_pred -----CCCCcCcceEEEEecCCCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCcccccc
Q 015921 249 -----NKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVED 323 (398)
Q Consensus 249 -----nkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~ 323 (398)
.+.+.+|+||+.+ .-+.. +-.+.-++.++ |.+...+....+.||-+|+.+.. .|.
T Consensus 404 ~~~~~p~~cYFNSGVmLI--NL~~W-Re~nITek~~~-~Le~n~~~~~~l~dqd~LN~~Ll--------------vF~-- 463 (535)
T PLN02867 404 SSNLDQERCAWLYGMNVF--DLKAW-RRTNITEAYHK-WLKLSLNSGLQLWQPGALPPALL--------------AFK-- 463 (535)
T ss_pred hccCCCCCcceecceeee--eHHHH-HHhcHHHHHHH-HHHhchhcccccccccccchHHH--------------Hhc--
Confidence 2345689999999 21000 11112111111 11111111124567777765321 111
Q ss_pred ccceEEEEeecccccCCCCCCCCc-----cccCCcceeEEeeecCchhh-------hHHhhhhhccCCCcccceeeee
Q 015921 324 IIGASVLFLPCATYNWTPPEGAGQ-----FHGMPLDVKVVHFKGSRKRL-------MLESWNFFSSSSDISDMLCLIL 389 (398)
Q Consensus 324 v~g~sVlfLPC~~YNwtPpegaGQ-----fhgmP~dvKVlHFKGsRKrl-------Mle~Wnf~~s~~~~~dm~clil 389 (398)
-.+..||. .|||. |.|. ......+.+||||-|+.|=- .-++|.-|..-++..--.|-|+
T Consensus 464 ---g~v~~LD~-rWNv~---gLgy~~~~~~~~~i~~paIIHYnG~~KPW~e~~~~~yR~~W~kyl~~~~~fl~~cni~ 534 (535)
T PLN02867 464 ---GHVHPIDP-SWHVA---GLGSRPPEVPREILESAAVLHFSGPAKPWLEIGFPEVRSLWYRHVNFSDKFIRKCRIM 534 (535)
T ss_pred ---CcEEECCh-hhccc---CCCcccccchhhhcCCcEEEEECCCCCcccccCCCchhHHHHHhcCccchHHHhcccC
Confidence 13556664 46653 3221 11234578999999987632 4788999998888765568775
No 17
>PLN02870 Probable galacturonosyltransferase
Probab=97.64 E-value=0.00018 Score=75.83 Aligned_cols=166 Identities=13% Similarity=0.144 Sum_probs=88.1
Q ss_pred HHHHHHHH-HHHHHhhccCCCccccEEEeecCeeeecCchhhhcc---------CCCceEE--------E-eec------
Q 015921 193 IRSYITFL-ERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHD---------YQNFHLA--------L-TFR------ 247 (398)
Q Consensus 193 i~sYi~fL-kT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~---------~PdFHvA--------L-TfR------ 247 (398)
+-+|.||+ -.-+++ .+-+|++|+|++|.+||.++|+. ++||..+ + .|-
T Consensus 328 ~lny~Rl~LPelLP~-------LdKVLYLD~DVVVqgDLseLw~iDL~gkviaAVeDc~~~~~~~~~~~~~~YfNfs~p~ 400 (533)
T PLN02870 328 LLNHLRIYLPELFPN-------LDKVVFLDDDVVIQRDLSPLWDIDLGGKVNGAVETCRGEDEWVMSKRFRNYFNFSHPL 400 (533)
T ss_pred HHHHHHHHHHHHhhh-------cCeEEEEeCCEEecCcHHHHhhCCCCCceEEEEccccccchhhhhhhhhhhcccccch
Confidence 45666653 333342 56899999999999999999997 3333211 0 011
Q ss_pred -----cCCCCcCcceEEEEecCCCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccc
Q 015921 248 -----NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVE 322 (398)
Q Consensus 248 -----nnkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~ 322 (398)
+.+++.+|+||+.+ .-+.. +-.++-|+.++. .++..+...-+.||-+|.-.+.. |.
T Consensus 401 i~~~fd~~~cyfNSGVlLI--NL~~W-Re~nITek~~~~-l~~n~~~~l~l~DQdaLp~~liv--------------f~- 461 (533)
T PLN02870 401 IAKNLDPEECAWAYGMNIF--DLRAW-RKTNIRETYHSW-LKENLKSNLTMWKLGTLPPALIA--------------FK- 461 (533)
T ss_pred hhcccCcccceeeccchhc--cHHHH-HHcChHHHHHHH-HHhhhhcCceecccccccHhHHH--------------hc-
Confidence 13456678999998 21000 011122222221 11111111256788777532210 00
Q ss_pred cccceEEEEeecccccCCCCC-CCCccccCCcceeEEeeecCchhhh-------HHhhhhhccCCCcccceeeee
Q 015921 323 DIIGASVLFLPCATYNWTPPE-GAGQFHGMPLDVKVVHFKGSRKRLM-------LESWNFFSSSSDISDMLCLIL 389 (398)
Q Consensus 323 ~v~g~sVlfLPC~~YNwtPpe-gaGQfhgmP~dvKVlHFKGsRKrlM-------le~Wnf~~s~~~~~dm~clil 389 (398)
-.+.+||- .||...-. ..+.......+.+||||-|..|=-. -.+|.-|...++-.=-.|-|.
T Consensus 462 ----g~v~~LD~-rWN~~gLgy~~~~~~~~i~~aaIIHY~G~~KPW~~~~~~~yr~~W~kYl~~s~~fl~~Cni~ 531 (533)
T PLN02870 462 ----GHVHPIDP-SWHMLGLGYQSKTNIESVKKAAVIHYNGQSKPWLEIGFEHLRPFWTKYVNYSNDFIRNCHIL 531 (533)
T ss_pred ----CceEECCh-HHhcCCCCCcccccccccCCcEEEEECCCCCCccccCccchhHHHHHHHccCchHhhhcCCC
Confidence 12334433 23322000 0122334556889999999877554 458888887776665567665
No 18
>PLN02910 polygalacturonate 4-alpha-galacturonosyltransferase
Probab=97.49 E-value=0.00037 Score=74.85 Aligned_cols=162 Identities=16% Similarity=0.177 Sum_probs=89.1
Q ss_pred HHHHHHHHH-HHHHHhhccCCCccccEEEeecCeeeecCchhhhcc---------CCCceEEE---------------ee
Q 015921 192 RIRSYITFL-ERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHD---------YQNFHLAL---------------TF 246 (398)
Q Consensus 192 Ri~sYi~fL-kT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~---------~PdFHvAL---------------Tf 246 (398)
.+.+|.||+ -..+++ .+.+|++|+|++|.+||.++|+. +++|+..+ .+
T Consensus 459 S~lnY~Rf~LPelLp~-------l~KVLYLD~DVVV~gDLseLw~iDL~g~v~AAVedc~~~f~r~~~ylnfs~P~i~~y 531 (657)
T PLN02910 459 SMLNHLRFYLPEVYPK-------LEKILFLDDDIVVQKDLTPLWSIDMQGMVNGAVETCKESFHRFDKYLNFSNPKISEN 531 (657)
T ss_pred hHHHHHHHHHHHHhhh-------cCeEEEEeCCEEecCchHHHHhCCcCCceEEEecccchhhhhhhhhhccCChhhhhc
Confidence 455667753 333342 46899999999999999999987 44443321 12
Q ss_pred ccCCCCcCcceEEEEecCCCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHH---HHhcCCchhhhhcccCcccccc
Q 015921 247 RNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAW---VVKSHPSFDARRFTKAQPFVED 323 (398)
Q Consensus 247 RnnkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~---avk~~~~~~~~~f~~~~af~~~ 323 (398)
.|.+.+.+|+||+.+ .-+.- +-.+ +.+.++-|.+. ...+-+.||-+|.= +...+-...-.+++..--
T Consensus 532 FNs~aCyfNsGVmVI--DL~~W-Re~n-ITe~ye~w~el--n~~~~L~dqgsLPpgLLvF~g~i~pLD~rWNv~GL---- 601 (657)
T PLN02910 532 FDPNACGWAFGMNMF--DLKEW-RKRN-ITGIYHYWQDL--NEDRTLWKLGSLPPGLITFYNLTYPLDRSWHVLGL---- 601 (657)
T ss_pred cCCCCceeecccEEE--eHHHH-HHhh-HHHHHHHHHHh--cccccccccCCCChHHHHHhCceeecCchheecCC----
Confidence 345677788899998 21000 0011 11233333332 24457788876662 221111111111111110
Q ss_pred ccceEEEEeecccccCCCCCCCCccccCCcceeEEeeecCchhh-------hHHhhhhhccCCCcccceeee
Q 015921 324 IIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL-------MLESWNFFSSSSDISDMLCLI 388 (398)
Q Consensus 324 v~g~sVlfLPC~~YNwtPpegaGQfhgmP~dvKVlHFKGsRKrl-------Mle~Wnf~~s~~~~~dm~cli 388 (398)
| | - .+.+.+...+.+||||-|..|=- -..+|.-|...++-.=-.|-|
T Consensus 602 --G----------y--d----~~v~~~~i~~AAVLHynG~~KPWl~l~i~~Yr~~W~kYl~~d~~fl~~Cni 655 (657)
T PLN02910 602 --G----------Y--D----PALNQTEIENAAVVHYNGNYKPWLDLAIAKYKPYWSRYVQYDNPYLQLCNI 655 (657)
T ss_pred --C----------C--C----cccccccccCcEEEEeCCCCCcccccCcccchHHHHHHccCCChHHHhCCC
Confidence 0 1 1 22233455679999999997743 346899888776655445654
No 19
>PF05637 Glyco_transf_34: galactosyl transferase GMA12/MNN10 family; InterPro: IPR008630 This family contains a number of glycosyltransferase enzymes that contain a DXD motif. This family includes a number of Caenorhabditis elegans homologues where the DXD is replaced by DXH. Some members of this family are included in glycosyltransferase family 34.; GO: 0016758 transferase activity, transferring hexosyl groups, 0016021 integral to membrane; PDB: 2P72_B 2P73_A 2P6W_A.
Probab=97.40 E-value=0.00012 Score=68.75 Aligned_cols=142 Identities=20% Similarity=0.264 Sum_probs=15.9
Q ss_pred HHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhcc-----------------CC-Cce-----------EEEeec
Q 015921 197 ITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHD-----------------YQ-NFH-----------LALTFR 247 (398)
Q Consensus 197 i~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~-----------------~P-dFH-----------vALTfR 247 (398)
+-.|...|+++ ...+=++++|+|++|.+-=-.|.+. .| +-. +.+-.
T Consensus 64 ~~~lr~~m~~~----P~~~wv~~lD~Dali~n~~~~L~~~il~p~~L~~~~~r~~~~~p~~~~~~~~~~~~~~~~~li~- 138 (239)
T PF05637_consen 64 IPALRAAMKKY----PEAEWVWWLDSDALIMNPDFSLEEHILSPSRLDSLLLRDVPIVPPDSIIKTYSVIDGNDIHLII- 138 (239)
T ss_dssp HHHHHHHHHH-----TT-SEEEEE-TTEEE--------------------------------------------------
T ss_pred HHHHHHHHHhC----CCCCEEEEEcCCeEEEeccccccccccccccccccccccccccccccccccccccccccccccc-
Confidence 44455666665 2345578999999998743332211 01 100 11111
Q ss_pred cCCCCcCcceEEEEecCCCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccce
Q 015921 248 NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGA 327 (398)
Q Consensus 248 nnkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~ 327 (398)
.....-+|.|++++|-++ -++.||+.|.+-....+-......+||-||+..++.||... .
T Consensus 139 t~d~~gLNtGsFliRns~----ws~~fLd~w~~~~~~~~~~~~~~~~EQsAl~~ll~~~~~~~----------------~ 198 (239)
T PF05637_consen 139 TQDWNGLNTGSFLIRNSP----WSRDFLDAWADPLYRNYDWDQLEFDEQSALEHLLQWHPEIL----------------S 198 (239)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred cccccccccccccccccc----ccccccccccccccccccccccccccccccccccccccccc----------------c
Confidence 012334999999999999 89999999986554443222234799999999998887521 2
Q ss_pred EEEEeecccccCCCCCCCCccccCCcceeEEeeecCch
Q 015921 328 SVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRK 365 (398)
Q Consensus 328 sVlfLPC~~YNwtPpegaGQfhgmP~dvKVlHFKGsRK 365 (398)
++.++|...-|=.|....+ +.-.+-| =|+||.|=..
T Consensus 199 ~~~~vpq~~~nsy~~~~~~-~~~~~GD-fvvhfaGC~~ 234 (239)
T PF05637_consen 199 KVALVPQRWFNSYPEDECN-YQYKEGD-FVVHFAGCKV 234 (239)
T ss_dssp --------------------------------------
T ss_pred ccccccccccccccccccc-ccccccc-cccccccccc
Confidence 2345555443432222111 3344445 4889999654
No 20
>PLN02829 Probable galacturonosyltransferase
Probab=97.40 E-value=0.00046 Score=73.98 Aligned_cols=48 Identities=19% Similarity=0.223 Sum_probs=36.3
Q ss_pred cccEEEeecCeeeecCchhhhcc---------CCCceEEE---------------eeccCCCCcCcceEEEE
Q 015921 214 INHYVFTDSDIAVVDDLGHIFHD---------YQNFHLAL---------------TFRNNKDQPLNSGFIAV 261 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDdLg~IFk~---------~PdFHvAL---------------TfRnnkdqPiNSGfIfV 261 (398)
.+.+|++|+|++|++||.++|+. +++|+..+ .+.|...+.+|+||+.+
T Consensus 457 LdKVLYLD~DVVVqgDLseLw~iDL~gkviAAVedc~~~f~r~~~~l~fs~p~i~~~Fn~~~CyFNSGVmVI 528 (639)
T PLN02829 457 LNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPHACGWAYGMNVF 528 (639)
T ss_pred cCeEEEEeCCEEeCCChHHHHhCCCCCceEEEeccchhhhhhhhhhhhccchHhhhccCCcccceecceEEE
Confidence 56899999999999999999987 44443221 11134677899999999
No 21
>PLN02718 Probable galacturonosyltransferase
Probab=97.25 E-value=0.0013 Score=70.30 Aligned_cols=153 Identities=18% Similarity=0.142 Sum_probs=85.1
Q ss_pred HHHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhcc---------CCCceEEEe----------e--------cc
Q 015921 196 YITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHD---------YQNFHLALT----------F--------RN 248 (398)
Q Consensus 196 Yi~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~---------~PdFHvALT----------f--------Rn 248 (398)
+..|+.=-|.++-. ..+-+|++|+|++|.+||.++|+. +++|+..+. | -|
T Consensus 403 ~~~y~Rl~ipellp---~l~KvLYLD~DvVV~~DL~eL~~iDl~~~v~aaVedC~~~~~~~~~~~~~lnfs~p~i~~~fn 479 (603)
T PLN02718 403 ALNHARFYLPDIFP---GLNKIVLFDHDVVVQRDLSRLWSLDMKGKVVGAVETCLEGEPSFRSMDTFINFSDPWVAKKFD 479 (603)
T ss_pred HHHHHHHHHHHHhc---ccCEEEEEECCEEecCCHHHHhcCCCCCcEEEEeccccccccchhhhhhhhhccchhhhcccC
Confidence 34444444444422 257899999999999999999886 444433211 1 12
Q ss_pred CCCCcCcceEEEEecCCCchhHHHHHHHHHHH--------HHHHhHHhHhhhhcchhhHH---HHHhcCCchhhhhcccC
Q 015921 249 NKDQPLNSGFIAVRGTPDGISRAKIFLEEVLR--------VYSSKYMNASRMLGDQLALA---WVVKSHPSFDARRFTKA 317 (398)
Q Consensus 249 nkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~e--------iY~sky~~a~~mlGDQlALa---~avk~~~~~~~~~f~~~ 317 (398)
.+.+.+|+||+.+ . |++|-+ -|.++. ..+.+.||-+|+ -+...
T Consensus 480 ~~~CyfNsGVlLI--D----------Lk~WReenITe~~~~~l~~n--~~~~l~dqdaLpp~LlvF~g------------ 533 (603)
T PLN02718 480 PKACTWAFGMNLF--D----------LEEWRRQKLTSVYHKYLQLG--VKRPLWKAGSLPIGWLTFYN------------ 533 (603)
T ss_pred CCccccccceEEE--e----------HHHHHhcChHHHHHHHHHhc--cCccccCcccccHHHHHhcC------------
Confidence 3567789999998 1 222222 222221 233566777765 33322
Q ss_pred ccccccccceEEEEeecccccCCCCCC-CCccccCCcceeEEeeecCchh-------hhHHhhhhhccCCCcccceeee
Q 015921 318 QPFVEDIIGASVLFLPCATYNWTPPEG-AGQFHGMPLDVKVVHFKGSRKR-------LMLESWNFFSSSSDISDMLCLI 388 (398)
Q Consensus 318 ~af~~~v~g~sVlfLPC~~YNwtPpeg-aGQfhgmP~dvKVlHFKGsRKr-------lMle~Wnf~~s~~~~~dm~cli 388 (398)
.+..||. .||...-.. .+--.....+.+||||-|..|= -.-++|.-|...++-.=-.|-|
T Consensus 534 ----------ri~~LD~-rWNv~gLG~~~~i~~~~i~~aaIIHYnG~~KPWle~~i~~yr~~W~k~v~~~~~~l~~Cn~ 601 (603)
T PLN02718 534 ----------QTVALDK-RWHVLGLGHESGVGASDIEQAAVIHYDGVMKPWLDIGIGKYKRYWNIHVPYHHPYLQQCNI 601 (603)
T ss_pred ----------ceeecCh-HHhccCccccccccccccCCCEEEEECCCCCccccCChhhHHHHHHhhcCCCChHHHhcCC
Confidence 2334443 233321000 0011123568899999998652 2357899998777655445544
No 22
>PLN02742 Probable galacturonosyltransferase
Probab=97.19 E-value=0.00093 Score=70.59 Aligned_cols=141 Identities=18% Similarity=0.191 Sum_probs=78.7
Q ss_pred cccEEEeecCeeeecCchhhhcc---------CCCceEEEe-ec--------------cCCCCcCcceEEEEecCCCchh
Q 015921 214 INHYVFTDSDIAVVDDLGHIFHD---------YQNFHLALT-FR--------------NNKDQPLNSGFIAVRGTPDGIS 269 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDdLg~IFk~---------~PdFHvALT-fR--------------nnkdqPiNSGfIfVRgt~d~~~ 269 (398)
.+.+|++|+|++|.+||.++|.. +++|+..+. |. +.+.+.+|+||..+ .-+.
T Consensus 353 l~KvlYLD~DvVV~~DL~eL~~~DL~~~viaAVedC~~~f~ry~~yLnfS~p~i~~~f~~~aC~fNsGV~Vi--DL~~-- 428 (534)
T PLN02742 353 LEKVVFLDDDVVVQKDLTPLFSIDLHGNVNGAVETCLETFHRYHKYLNFSHPLISSHFDPDACGWAFGMNVF--DLVA-- 428 (534)
T ss_pred cCeEEEEeCCEEecCChHHHhcCCCCCCEEEEeCchhhhhhhhhhhhcccchhhhccCCCCccccccCcEEE--eHHH--
Confidence 57899999999999999999987 445532221 11 24567799999988 2100
Q ss_pred HHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHH---HhcCCchhhhhcccCccccccccceEEEEeecccccCCCCCCCC
Q 015921 270 RAKIFLEEVLRVYSSKYMNASRMLGDQLALAWV---VKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAG 346 (398)
Q Consensus 270 kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~a---vk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPpegaG 346 (398)
=-++-+.+.++.|.+. +..+-+.||-+|.-. ...+-. -|+. .||.. |.|
T Consensus 429 WRe~nITe~~~~w~e~--n~~~~l~d~gaLpp~LLaF~g~~~----------------------~LD~-rWNv~---gLG 480 (534)
T PLN02742 429 WRKANVTAIYHYWQEQ--NVDRTLWKLGTLPPGLLTFYGLTE----------------------PLDR-RWHVL---GLG 480 (534)
T ss_pred HHhhcHHHHHHHHHHh--ccccccccccccchHHHHHcCcce----------------------ecCh-hheec---ccc
Confidence 0111122233334332 244466777666643 211111 1221 22222 111
Q ss_pred c----cccCCcceeEEeeecCchh-------hhHHhhhhhccCCCccccee
Q 015921 347 Q----FHGMPLDVKVVHFKGSRKR-------LMLESWNFFSSSSDISDMLC 386 (398)
Q Consensus 347 Q----fhgmP~dvKVlHFKGsRKr-------lMle~Wnf~~s~~~~~dm~c 386 (398)
- .+....+.+||||-|..|= -.-++|.-|...++-.=-.|
T Consensus 481 ~~~~v~~~~i~~aaILHynG~~KPWl~~~i~~yr~~W~kYl~~s~~fl~~C 531 (534)
T PLN02742 481 YDTNIDPRLIESAAVLHFNGNMKPWLKLAIERYKPLWERYVNYSHPYLQQC 531 (534)
T ss_pred cccccchhhccCCeEEEECCCCCcccccCCcccchHHHHHHccCCHHHHhC
Confidence 1 1124458899999998662 24568888877665443344
No 23
>PLN02523 galacturonosyltransferase
Probab=97.17 E-value=0.0027 Score=67.50 Aligned_cols=148 Identities=17% Similarity=0.161 Sum_probs=77.4
Q ss_pred cccEEEeecCeeeecCchhhhcc---------CCCceE---------EE------eeccCCCCcCcceEEEEecCCCchh
Q 015921 214 INHYVFTDSDIAVVDDLGHIFHD---------YQNFHL---------AL------TFRNNKDQPLNSGFIAVRGTPDGIS 269 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDdLg~IFk~---------~PdFHv---------AL------TfRnnkdqPiNSGfIfVRgt~d~~~ 269 (398)
.+.+|++|+|++|.+||.++|+. +.||.. .+ .|.|.+.+..|+||+.+ .-+. =
T Consensus 377 ldKVLYLD~DVVVq~DLseLw~iDL~gkv~aAVeDc~~~~~r~~~~ln~s~p~i~~yFNs~aC~wnsGVmlI--NL~~-W 453 (559)
T PLN02523 377 LHRILFLDDDVVVQKDLTGLWKIDMDGKVNGAVETCFGSFHRYAQYLNFSHPLIKEKFNPKACAWAYGMNIF--DLDA-W 453 (559)
T ss_pred cCeEEEEeCCEEecCCHHHHHhCcCCCceEEEehhhhhHHHHHHHhhcccchhhhhCcCCCcccccCCcEEE--eHHH-H
Confidence 67899999999999999999986 333311 11 13356666777799998 2200 0
Q ss_pred HHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCCC-CCCcc
Q 015921 270 RAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPE-GAGQF 348 (398)
Q Consensus 270 kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPpe-gaGQf 348 (398)
+-.++-|+. ..|. + ++..+-+.||-+|+-.+. .|. -.+..|+- .||+...- ..+-.
T Consensus 454 Re~nITek~-~~w~-~-ln~~~~l~DqdaLpp~Li--------------vF~-----gri~~LD~-rWNvlglGy~~~i~ 510 (559)
T PLN02523 454 RREKCTEQY-HYWQ-N-LNENRTLWKLGTLPPGLI--------------TFY-----STTKPLDK-SWHVLGLGYNPSIS 510 (559)
T ss_pred HHhchHHHH-HHHH-H-hccccccccccccchHHH--------------Hhc-----CceEecCc-hhhccCCccCCCcc
Confidence 111222222 1222 1 223345667766652110 000 11333332 33322000 00111
Q ss_pred ccCCcceeEEeeecCchhh-------hHHhhhhhccCCCcccceee
Q 015921 349 HGMPLDVKVVHFKGSRKRL-------MLESWNFFSSSSDISDMLCL 387 (398)
Q Consensus 349 hgmP~dvKVlHFKGsRKrl-------Mle~Wnf~~s~~~~~dm~cl 387 (398)
.....+.+||||-|..|=- -.++|.-|...++-.=-.|-
T Consensus 511 ~~~i~~paIIHYnG~~KPWle~~i~~yr~~W~kYl~~~~~fl~~Cn 556 (559)
T PLN02523 511 MDEIRNAAVIHFNGNMKPWLDIAMNQFKPLWTKYVDYDMEFVQACN 556 (559)
T ss_pred ccccCCCEEEEECCCCCccccCCCCcchHHHHHHHccCCHHHHhCC
Confidence 2223578999999987744 35678888766654433453
No 24
>PF03414 Glyco_transf_6: Glycosyltransferase family 6; InterPro: IPR005076 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 6 GT6 from CAZY comprises enzymes with three known activities; alpha-1,3-galactosyltransferase (2.4.1.151 from EC); alpha-1,3 N-acetylgalactosaminyltransferase (2.4.1.40 from EC); alpha-galactosyltransferase (2.4.1.37 from EC).; GO: 0016758 transferase activity, transferring hexosyl groups, 0005975 carbohydrate metabolic process, 0016020 membrane; PDB: 2Y7A_B 2O1G_A 1R82_A 2RJ1_A 3IOJ_B 2RJ4_A 3I0C_A 3SX8_A 1ZJ1_A 3I0E_A ....
Probab=94.23 E-value=0.36 Score=49.12 Aligned_cols=100 Identities=21% Similarity=0.400 Sum_probs=64.6
Q ss_pred eEEeceeeeccchhHHHHHHHHHHH-HhhCCCccEE--EEeCCCCCCC---CC-CCceEEEecCCccCchhHHHHHHHHH
Q 015921 124 VTVGNASYSKTERSMAILNVFINFI-QVTMPKSDVF--ILTDPASDLS---MP-RKGVTIYPIHGEYSRDKLMLQRIRSY 196 (398)
Q Consensus 124 v~vg~~~~~k~~rs~avln~Fi~sI-rvsmP~s~VV--ILTD~~T~~s---l~-~~~~~v~pi~g~YSR~~LmlQRi~sY 196 (398)
+|||= .+--+.|=..-|.-|+.|. +..||+.+|+ |+||.-..+. +. +.+..|.+|+++-.=...-+.|.+..
T Consensus 99 ~tIGL-~vfA~GkY~~fl~~Fl~SAek~Fm~g~~V~YYVFTD~p~~vP~i~l~~~r~~~V~~v~~~~~Wqd~sm~Rm~~i 177 (337)
T PF03414_consen 99 ITIGL-TVFATGKYIVFLKDFLESAEKHFMVGHRVIYYVFTDQPSKVPRIELGPGRRLKVFEVQEEKRWQDISMMRMEMI 177 (337)
T ss_dssp -EEEE-EEEE-CCHHHHHHHHHHHHHHHBSTTSEEEEEEEES-GGGS------TTEEEEEEE-SGGSSHHHHHHHHHHHH
T ss_pred ceEEE-EEEecccHHHHHHHHHHhHHHhccCCcEEEEEEEeCchhhCCccccCCCceeEEEEecccCCCccchhHHHHHH
Confidence 35543 3334567777999999999 7789999875 8999987654 22 34677999986443345556787777
Q ss_pred HHHHHHHH-HhhccCCCccccEEE-eecCeeeecCchh
Q 015921 197 ITFLERRI-REHSQGQGHINHYVF-TDSDIAVVDDLGH 232 (398)
Q Consensus 197 i~fLkT~i-~e~~~~~~~~~dfIF-iDsDi~VvDdLg~ 232 (398)
..+++..+ +|. |||| .|.|++..++.|.
T Consensus 178 ~~~i~~~~~~Ev--------DYLFc~dvd~~F~~~vGv 207 (337)
T PF03414_consen 178 SEHIEQHIQHEV--------DYLFCMDVDMVFQDHVGV 207 (337)
T ss_dssp HHHHHHCHHHH---------SEEEEEESSEEE-S-B-G
T ss_pred HHHHHHHHhhcC--------CEEEEEecceEEecccCH
Confidence 77776654 555 8776 5999999987773
No 25
>cd02515 Glyco_transf_6 Glycosyltransferase family 6 comprises enzymes responsible for the production of the human ABO blood group antigens. Glycosyltransferase family 6, GT_6, comprises enzymes with three known activities: alpha-1,3-galactosyltransferase, alpha-1,3 N-acetylgalactosaminyltransferase, and alpha-galactosyltransferase. UDP-galactose:beta-galactosyl alpha-1,3-galactosyltransferase (alpha3GT) catalyzes the transfer of galactose from UDP-alpha-d-galactose into an alpha-1,3 linkage with beta-galactosyl groups in glycoconjugates. The enzyme exists in most mammalian species but is absent from humans, apes, and old world monkeys as a result of the mutational inactivation of the gene. The alpha-1,3 N-acetylgalactosaminyltransferase and alpha-galactosyltransferase are responsible for the production of the human ABO blood group antigens. A N-acetylgalactosaminyltransferases use a UDP-GalNAc donor to convert the H-antigen acceptor to the A antigen, whereas a galactosyltransferase use
Probab=92.02 E-value=1.7 Score=43.25 Aligned_cols=200 Identities=16% Similarity=0.214 Sum_probs=112.8
Q ss_pred eeeccchhHHHHHHHHHHH-HhhCCCccEE--EEeCCCCCCCC---C-CCceEEEecCCccCchhHHHHHHHHHHHHHHH
Q 015921 130 SYSKTERSMAILNVFINFI-QVTMPKSDVF--ILTDPASDLSM---P-RKGVTIYPIHGEYSRDKLMLQRIRSYITFLER 202 (398)
Q Consensus 130 ~~~k~~rs~avln~Fi~sI-rvsmP~s~VV--ILTD~~T~~sl---~-~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT 202 (398)
.|=-+.|=..-+--|+.|. +..||+.+|+ |.||......- . +.+..|.+++.+-.=...-+.|.+.....+
T Consensus 39 ~vfatGkY~~f~~~F~~SAEk~Fm~g~~v~YyVFTD~~~~~p~v~lg~~r~~~V~~v~~~~~W~~~sl~Rm~~~~~~~-- 116 (271)
T cd02515 39 TVFAVGKYTEFLERFLESAEKHFMVGYRVIYYIFTDKPAAVPEVELGPGRRLTVLKIAEESRWQDISMRRMKTLADHI-- 116 (271)
T ss_pred EEEEeccHHHHHHHHHHHHHHhccCCCeeEEEEEeCCcccCcccccCCCceeEEEEeccccCCcHHHHHHHHHHHHHH--
Confidence 4445566666899999998 9999999775 89997775432 2 345889999755333334455633333333
Q ss_pred HHHhhccCCCccccEEE-eecCeeeecCch-hhhcc-----CCCceE----EEee-ccC---CCCcCcceEEEEecCCCc
Q 015921 203 RIREHSQGQGHINHYVF-TDSDIAVVDDLG-HIFHD-----YQNFHL----ALTF-RNN---KDQPLNSGFIAVRGTPDG 267 (398)
Q Consensus 203 ~i~e~~~~~~~~~dfIF-iDsDi~VvDdLg-~IFk~-----~PdFHv----ALTf-Rnn---kdqPiNSGfIfVRgt~d~ 267 (398)
+++.+ ..-|||| .|.|+.+.+..| |+..+ +|.+-- .+|| ||. .=.|.+-|=..+.|.--|
T Consensus 117 --~~~~~---~e~DYlF~~dvd~~F~~~ig~E~Lg~lva~lHp~~y~~~~~~fpYERrp~S~AyIp~~eGdfYy~Ga~~G 191 (271)
T cd02515 117 --ADRIG---HEVDYLFCMDVDMVFQGPFGVETLGDSVAQLHPWWYGKPRKQFPYERRPSSAAYIPEGEGDFYYHGAVFG 191 (271)
T ss_pred --HHhhc---ccCCEEEEeeCCceEeecCCHHHhhhhheecChhhhcCCCCCCCCcCCCCccccccCCCCCeEEeeeecC
Confidence 32311 2348886 599999998887 33332 232111 1132 111 134555555555443321
Q ss_pred hhHHHHHHHHHHHHHHHhHHhHhh-----hhcchhhHHHHHhcCCchhhhhcccCccccccccceEEEEeecccccCCCC
Q 015921 268 ISRAKIFLEEVLRVYSSKYMNASR-----MLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 342 (398)
Q Consensus 268 ~~kA~~FLeeV~eiY~sky~~a~~-----mlGDQlALa~avk~~~~~~~~~f~~~~af~~~v~g~sVlfLPC~~YNwtPp 342 (398)
.+..+||+-...|-..- +..+. .-+|-.-||.-...|++ ++ +| .-.|+|-
T Consensus 192 -G~~~~vl~l~~~c~~~i-~~D~~n~I~A~wHDESHLNkYf~~~Kp------------------~K--iL-SPeY~w~-- 246 (271)
T cd02515 192 -GSVEEVYRLTRACHEGI-LADKANGIEARWHDESHLNKYFLLHKP------------------TK--VL-SPEYLWD-- 246 (271)
T ss_pred -ccHHHHHHHHHHHHHHH-HHHHhCCceEEeecHhHhHHHHhhCCC------------------Ce--ec-ChhhcCC--
Confidence 14567777666665422 22211 34555555555544433 22 23 3578886
Q ss_pred CCCCccccCCcceeEEeeecCch
Q 015921 343 EGAGQFHGMPLDVKVVHFKGSRK 365 (398)
Q Consensus 343 egaGQfhgmP~dvKVlHFKGsRK 365 (398)
| .-+.|.++|++-|..--|
T Consensus 247 e----~~~~p~~~k~~r~~~~~K 265 (271)
T cd02515 247 D----RIGQAAEIRLPRLSWLPK 265 (271)
T ss_pred c----cCCCCcccceeEEEEecc
Confidence 3 245699999988765433
No 26
>PLN03181 glycosyltransferase; Provisional
Probab=86.31 E-value=2.5 Score=44.73 Aligned_cols=100 Identities=20% Similarity=0.343 Sum_probs=63.3
Q ss_pred HHHHHHHHHhhccCCCccccEEEeecCeeeecCchhh-hccCCCceEEE------eeccCCCCc-CcceEEEEecCCCch
Q 015921 197 ITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHI-FHDYQNFHLAL------TFRNNKDQP-LNSGFIAVRGTPDGI 268 (398)
Q Consensus 197 i~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~I-Fk~~PdFHvAL------TfRnnkdqP-iNSGfIfVRgt~d~~ 268 (398)
+--|.+.|.++ -..+-|-++|+|++|.+---.| .+.+.+.++-+ .| ..++.- +|.|+-++|.|.
T Consensus 186 ipalRaAM~a~----PeAEWfWWLDsDALIMNp~~sLPl~ry~~~NLvvhg~p~~vy-~~qdw~GlN~GsFLIRNcq--- 257 (453)
T PLN03181 186 LPVVRAAMLAH----PEAEWIWWVDSDAVFTDMDFKLPLHRYRDHNLVVHGWPKLIY-EKRSWTALNAGVFLIRNCQ--- 257 (453)
T ss_pred HHHHHHHHHHC----CCceEEEEecCCceeecCCCCCCHhhcCCccccccCCccccc-ccccccccceeeeEEecCH---
Confidence 44445556656 3567888999999999862222 11244444433 23 234444 999999999999
Q ss_pred hHHHHHHHHHHH------HHH-------HhHHhHhh--hhcchhhHHHHHhcC
Q 015921 269 SRAKIFLEEVLR------VYS-------SKYMNASR--MLGDQLALAWVVKSH 306 (398)
Q Consensus 269 ~kA~~FLeeV~e------iY~-------sky~~a~~--mlGDQlALa~avk~~ 306 (398)
=+..||+.|-. .|. ++ ++.+- ---||.||.+...++
T Consensus 258 -WSl~LLDaWa~Mgp~~p~~~~~G~~l~~~-l~~r~~~eaDDQsaLvyll~~~ 308 (453)
T PLN03181 258 -WSLDFMDAWASMGPASPEYAKWGKILRST-FKDKLFPESDDQSALVYLLYKH 308 (453)
T ss_pred -HHHHHHHHHHhcCCCCchHHHHHHHHHHH-hCCCCCCCccchHHHHHHHHhc
Confidence 99999998875 221 22 22222 235999999887433
No 27
>PF05704 Caps_synth: Capsular polysaccharide synthesis protein; InterPro: IPR008441 This entry consists of several capsular polysaccharide proteins. Capsular polysaccharide (CPS) is a major virulence factor in Streptococcus pneumoniae. This family is often transcribed with putative glycosyl transferases to give rise to bifunctional proteins [].
Probab=83.80 E-value=3.4 Score=40.24 Aligned_cols=119 Identities=20% Similarity=0.281 Sum_probs=71.6
Q ss_pred chhHHHHHHHHHHHHhhCCCccEEEEeCCCCCCCCCCCceEEEecC----CccCchhHHHHHHHHHHHHHHHHH-Hhhcc
Q 015921 135 ERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIH----GEYSRDKLMLQRIRSYITFLERRI-REHSQ 209 (398)
Q Consensus 135 ~rs~avln~Fi~sIrvsmP~s~VVILTD~~T~~sl~~~~~~v~pi~----g~YSR~~LmlQRi~sYi~fLkT~i-~e~~~ 209 (398)
|..=.+...-+.|+|..+|+-+|++||+.|-.- .+.+| ..+..+++ ..+-++.+|--.| .++ +
T Consensus 57 e~aP~~Vk~ci~s~~k~~~~~~Vi~lt~~Ni~~--------Yv~~P~~i~~k~~~g~i---~~a~~SDilR~~LL~~y-G 124 (276)
T PF05704_consen 57 ENAPEIVKKCINSWRKNAPDYEVILLTEDNIKD--------YVDIPDFILEKYEKGKI---SPAHFSDILRLALLYKY-G 124 (276)
T ss_pred cccCHHHHHHHHHHHHHCCCCeEEEEChHHHHH--------HcCCchhHHHHHHcCCC---chhHHHHHHHHHHHHHc-C
Confidence 444567888999999999999999999865331 22333 22233222 2234577776664 555 2
Q ss_pred CCCccccEEEeecCeeeecCchhhhccCCCceEEEeeccC-CCCc-----CcceEEEEecCCCchhHHHHHHHHHHHHH
Q 015921 210 GQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNN-KDQP-----LNSGFIAVRGTPDGISRAKIFLEEVLRVY 282 (398)
Q Consensus 210 ~~~~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfRnn-kdqP-----iNSGfIfVRgt~d~~~kA~~FLeeV~eiY 282 (398)
+ |.+|+++.++.+|.+...+-+=|-+ .+.. ...+ ..++||+.+... .|++.|++++
T Consensus 125 G-------vWiDatv~~t~~l~~~~~~~~ff~~---~~~~~~~~~~~~~~w~~~fi~a~~~n-------~~~~~~~~~~ 186 (276)
T PF05704_consen 125 G-------VWIDATVYLTKPLDDEIFDSDFFSF---SRPDKDYNPISISSWTNFFIAAKKGN-------PFIKFWRDLL 186 (276)
T ss_pred c-------EEeCCceEECCchhHHHhcCCeeEE---eccCcCcccchHHHhHhhheeECCCC-------HHHHHHHHHH
Confidence 2 6899999999999877664222222 1111 2222 244577775555 3555555554
No 28
>PF03314 DUF273: Protein of unknown function, DUF273; InterPro: IPR004988 This is a family of proteins of unknown function.
Probab=82.04 E-value=2.1 Score=41.62 Aligned_cols=100 Identities=22% Similarity=0.207 Sum_probs=62.9
Q ss_pred hhHHHHHHHHHHHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhccCCCceEEEeeccCCC-CcCcceEEEEecC
Q 015921 186 DKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKD-QPLNSGFIAVRGT 264 (398)
Q Consensus 186 ~~LmlQRi~sYi~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfRnnkd-qPiNSGfIfVRgt 264 (398)
...|..| =--+...|++. +-.+|+|.||.||.-=-.|.+. -|=.+++||-|-.- .-+..|-=+||-|
T Consensus 25 kd~fFrR----HCvva~~L~~~-------~~vlflDaDigVvNp~~~iEef-id~~~Di~fydR~~n~Ei~agsYlvkNT 92 (222)
T PF03314_consen 25 KDKFFRR----HCVVAKILPEY-------DWVLFLDADIGVVNPNRRIEEF-IDEGYDIIFYDRFFNWEIAAGSYLVKNT 92 (222)
T ss_pred hhHHHHH----HHHHHHHhccC-------CEEEEEcCCceeecCcccHHHh-cCCCCcEEEEecccchhhhhccceeeCC
Confidence 4567777 33333344433 4789999999999766555544 34455556433232 3488999999999
Q ss_pred CCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHH
Q 015921 265 PDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVV 303 (398)
Q Consensus 265 ~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~av 303 (398)
+ .|++||++|.+ |..+ +-.+--..|--||--..
T Consensus 93 ~----~~~~fl~~~a~-~E~~-lP~sfhGtDNGAlH~~L 125 (222)
T PF03314_consen 93 E----YSRDFLKEWAD-YEFK-LPNSFHGTDNGALHIFL 125 (222)
T ss_pred H----HHHHHHHHHhh-hCcc-CCCccccCccHHHHHHH
Confidence 9 99999999976 3333 21121245666665544
No 29
>cd01833 XynB_like SGNH_hydrolase subfamily, similar to Ruminococcus flavefaciens XynB. Most likely a secreted hydrolase with xylanase activity. SGNH hydrolases are a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases.
Probab=73.81 E-value=24 Score=29.37 Aligned_cols=74 Identities=12% Similarity=0.136 Sum_probs=50.8
Q ss_pred ccchhHHHHHHHHHHHHhhCCCccEEEEeCCCCCCCCCCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCC
Q 015921 133 KTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQG 212 (398)
Q Consensus 133 k~~rs~avln~Fi~sIrvsmP~s~VVILTD~~T~~sl~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~ 212 (398)
..++-..=+..++..+|...|++.+++.+=+.+... + ..+++..|-+.+++..+++.+.
T Consensus 58 ~~~~~~~~~~~~i~~i~~~~p~~~ii~~~~~p~~~~----------------~---~~~~~~~~n~~l~~~~~~~~~~-- 116 (157)
T cd01833 58 DPDTAPDRLRALIDQMRAANPDVKIIVATLIPTTDA----------------S---GNARIAEYNAAIPGVVADLRTA-- 116 (157)
T ss_pred CHHHHHHHHHHHHHHHHHhCCCeEEEEEeCCCCCCc----------------c---hhHHHHHHHHHHHHHHHHHhcC--
Confidence 355666778899999999999999988764332211 1 0456677788888887776321
Q ss_pred ccccEEEeecCeeeec
Q 015921 213 HINHYVFTDSDIAVVD 228 (398)
Q Consensus 213 ~~~dfIFiDsDi~VvD 228 (398)
..++.|+|......+
T Consensus 117 -~~~v~~vd~~~~~~~ 131 (157)
T cd01833 117 -GSPVVLVDMSTGYTT 131 (157)
T ss_pred -CCCEEEEecCCCCCC
Confidence 357889998776654
No 30
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A.
Probab=62.05 E-value=25 Score=30.62 Aligned_cols=78 Identities=13% Similarity=0.188 Sum_probs=36.5
Q ss_pred HHHHHHHHHHHhhCCCccEEEEeCCCCCCCCC---------CC-ceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhc
Q 015921 139 AILNVFINFIQVTMPKSDVFILTDPASDLSMP---------RK-GVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHS 208 (398)
Q Consensus 139 avln~Fi~sIrvsmP~s~VVILTD~~T~~sl~---------~~-~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~ 208 (398)
.+..+.-+.++...|+..|+|+.|..++-+.. +. ++++++.+.+...... .+=+...++.-
T Consensus 15 ~l~~~l~sl~~~~~~~~~v~vvd~~~~~~~~~~~~~~~~~~~~~~v~vi~~~~~~g~~~k--------~~a~n~~~~~~- 85 (228)
T PF13641_consen 15 VLRRCLESLLAQDYPRLEVVVVDDGSDDETAEILRALAARYPRVRVRVIRRPRNPGPGGK--------ARALNEALAAA- 85 (228)
T ss_dssp HHHHHHHHHTTSHHHTEEEEEEEE-SSS-GCTTHHHHHHTTGG-GEEEEE----HHHHHH--------HHHHHHHHHH--
T ss_pred HHHHHHHHHHcCCCCCeEEEEEECCCChHHHHHHHHHHHHcCCCceEEeecCCCCCcchH--------HHHHHHHHHhc-
Confidence 44444444445445999999998776654332 11 4566666543321001 11111122221
Q ss_pred cCCCccccEEEeecCeeeecC
Q 015921 209 QGQGHINHYVFTDSDIAVVDD 229 (398)
Q Consensus 209 ~~~~~~~dfIFiDsDi~VvDd 229 (398)
..+.++|+|.|+++-.+
T Consensus 86 ----~~d~i~~lD~D~~~~p~ 102 (228)
T PF13641_consen 86 ----RGDYILFLDDDTVLDPD 102 (228)
T ss_dssp ------SEEEEE-SSEEE-CH
T ss_pred ----CCCEEEEECCCcEECHH
Confidence 25589999999998443
No 31
>PF04488 Gly_transf_sug: Glycosyltransferase sugar-binding region containing DXD motif ; InterPro: IPR007577 This entry represents those sugar-binding regions of glycosyltransferases that contain a DXD motif. The DXD motif is a short conserved motif found in many families of glycosyltransferases, which add a range of different sugars to other sugars, phosphates and proteins. DXD-containing glycosyltransferases all use nucleoside diphosphate sugars as donors and require divalent cations, usually manganese. The DXD motif is expected to play a carbohydrate binding role in sugar-nucleoside diphosphate and manganese dependent glycosyltransferases [].
Probab=57.08 E-value=6.1 Score=32.23 Aligned_cols=23 Identities=17% Similarity=0.176 Sum_probs=20.5
Q ss_pred HHHHHHHhhCCCccEEEEeCCCC
Q 015921 143 VFINFIQVTMPKSDVFILTDPAS 165 (398)
Q Consensus 143 ~Fi~sIrvsmP~s~VVILTD~~T 165 (398)
-.+.+.+..||+.+++++||...
T Consensus 6 ~~i~s~~~~nP~~~~~~~~d~~~ 28 (103)
T PF04488_consen 6 CSIESWARHNPDYEYILWTDESD 28 (103)
T ss_pred HHHHHHHHHCCCCEEEEEECCCc
Confidence 35778899999999999999988
No 32
>COG5603 TRS20 Subunit of TRAPP, an ER-Golgi tethering complex [Cell motility and secretion]
Probab=56.15 E-value=19 Score=32.74 Aligned_cols=35 Identities=26% Similarity=0.458 Sum_probs=27.7
Q ss_pred CcCcceEEEEecCCCchhHHHHHHHHHHHHHHHhHH
Q 015921 252 QPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYM 287 (398)
Q Consensus 252 qPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~ 287 (398)
+|=|--|+|+++.. .-..|+.|||||||+|-++-|
T Consensus 75 ~pSgmkf~~iH~n~-s~~N~rsF~qevHely~ktLm 109 (136)
T COG5603 75 MPSGMKFLFIHQNQ-SRKNARSFLQEVHELYAKTLM 109 (136)
T ss_pred ccCCceEEEEeccc-hhhhHHHHHHHHHHHHHHHhh
Confidence 66677789998665 234789999999999987765
No 33
>PLN03182 xyloglucan 6-xylosyltransferase; Provisional
Probab=54.82 E-value=39 Score=35.93 Aligned_cols=98 Identities=19% Similarity=0.330 Sum_probs=64.2
Q ss_pred HHHHHHHHHhhccCCCccccEEEeecCeeeecCchhh-hccCCCceEEEe------eccCCC-CcCcceEEEEecCCCch
Q 015921 197 ITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHI-FHDYQNFHLALT------FRNNKD-QPLNSGFIAVRGTPDGI 268 (398)
Q Consensus 197 i~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~I-Fk~~PdFHvALT------fRnnkd-qPiNSGfIfVRgt~d~~ 268 (398)
+-+|.+.|.++ ...+-|-.+|+|++|+|---++ -+.+.+.|+-++ | ..++ .=+|.|+-++|-|.
T Consensus 185 lpaLR~aM~~~----PeaEWiWWLDsDALImNmsfelPlery~~~NlVihg~~~~l~-~~kdW~GLNtGsFLIRNcq--- 256 (429)
T PLN03182 185 LPLLRKLMLAH----PEVEWIWWMDSDALFTDMTFEIPLEKYEGYNLVIHGWDELVY-DQKSWIGLNTGSFLIRNCQ--- 256 (429)
T ss_pred HHHHHHHHHHC----CCceEEEEecCCceeecCCCCCCHhHcCCcCeeeccchhhhe-eccccCccceeeEEEEcCH---
Confidence 56677777766 4667788899999999832221 112445554442 3 2333 45999999999999
Q ss_pred hHHHHHHHHHHHHHH-------------HhHHhHhh--hhcchhhHHHHHh
Q 015921 269 SRAKIFLEEVLRVYS-------------SKYMNASR--MLGDQLALAWVVK 304 (398)
Q Consensus 269 ~kA~~FLeeV~eiY~-------------sky~~a~~--mlGDQlALa~avk 304 (398)
=+..||..|..-=+ ++ ++.+- -..||.||.+...
T Consensus 257 -WSldlLDaWa~mgp~~~~~~~~g~~l~~~-l~~rp~~eaDDQSAlvyLl~ 305 (429)
T PLN03182 257 -WSLDLLDAWAPMGPKGPIRDEAGKILTAE-LKGRPAFEADDQSALVYLLL 305 (429)
T ss_pred -HHHHHHHHHHhcCCCCchhhhHHHHHHHh-hcCCCCCCcccHHHHHHHHH
Confidence 99999988754221 22 11111 2579999999984
No 34
>cd01841 NnaC_like NnaC (CMP-NeuNAc synthetase) _like subfamily of SGNH_hydrolases, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles two of the three components of typical Ser-His-Asp(Glu) triad from other serine hydrolases. E. coli NnaC appears to be involved in polysaccharide synthesis.
Probab=54.59 E-value=42 Score=28.49 Aligned_cols=75 Identities=15% Similarity=0.174 Sum_probs=50.5
Q ss_pred ccchhHHHHHHHHHHHHhhCCCccEEEEeCCCCCCCCCCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCC
Q 015921 133 KTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQG 212 (398)
Q Consensus 133 k~~rs~avln~Fi~sIrvsmP~s~VVILTD~~T~~sl~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~ 212 (398)
..++-..=+..++..++...|+++|++++=+-.... . . . ...+.++++.|-+.+++..+++
T Consensus 69 ~~~~~~~~~~~l~~~~~~~~p~~~vi~~~~~p~~~~---~--~----~-----~~~~~~~~~~~n~~l~~~a~~~----- 129 (174)
T cd01841 69 SSNQFIKWYRDIIEQIREEFPNTKIYLLSVLPVLEE---D--E----I-----KTRSNTRIQRLNDAIKELAPEL----- 129 (174)
T ss_pred CHHHHHHHHHHHHHHHHHHCCCCEEEEEeeCCcCcc---c--c----c-----ccCCHHHHHHHHHHHHHHHHHC-----
Confidence 456677778889999999999999999874311100 0 0 0 1123578888888888876666
Q ss_pred ccccEEEeecCeeeecC
Q 015921 213 HINHYVFTDSDIAVVDD 229 (398)
Q Consensus 213 ~~~dfIFiDsDi~VvDd 229 (398)
.+.|+|......+.
T Consensus 130 ---~~~~id~~~~~~~~ 143 (174)
T cd01841 130 ---GVTFIDLNDVLVDE 143 (174)
T ss_pred ---CCEEEEcHHHHcCC
Confidence 47788877665444
No 35
>PF04765 DUF616: Protein of unknown function (DUF616); InterPro: IPR006852 The entry represents a protein of unknown function. The function of is unknown although a number of the members are thought to be glycosyltransferases.
Probab=52.50 E-value=30 Score=35.05 Aligned_cols=129 Identities=18% Similarity=0.292 Sum_probs=76.8
Q ss_pred CCccEEEEeCCCCCCCCCC-----------CceEEEecCCc-cCchhHHHHHHHHHHHHHHHHH-HhhccCCCccccEEE
Q 015921 153 PKSDVFILTDPASDLSMPR-----------KGVTIYPIHGE-YSRDKLMLQRIRSYITFLERRI-REHSQGQGHINHYVF 219 (398)
Q Consensus 153 P~s~VVILTD~~T~~sl~~-----------~~~~v~pi~g~-YSR~~LmlQRi~sYi~fLkT~i-~e~~~~~~~~~dfIF 219 (398)
.++--++.||..|..++.. +.=+++.+... |+ +.-|.+| |.-+|-+++ +++ .--|.
T Consensus 90 ~~vcf~mF~D~~t~~~l~~~~~~~~~~~~ig~WrIv~v~~lp~~-d~rr~~r---~~K~lpHrlfp~y-------~ySIW 158 (305)
T PF04765_consen 90 KNVCFFMFVDEETLKSLESEGHIPDENKKIGIWRIVVVKNLPYD-DPRRNGR---IPKLLPHRLFPNY-------DYSIW 158 (305)
T ss_pred cCccEEEEEehhhHHHHHhcCCccccccccCceEEEEecCCCCc-chhhcCc---ccceeccccCCCC-------ceEEE
Confidence 4677899999999754432 11225555322 33 3334444 233333332 333 23488
Q ss_pred eecCeeeecCchhhhccC---CCceEEE--------------------------------eeccCCCCc--------Cc-
Q 015921 220 TDSDIAVVDDLGHIFHDY---QNFHLAL--------------------------------TFRNNKDQP--------LN- 255 (398)
Q Consensus 220 iDsDi~VvDdLg~IFk~~---PdFHvAL--------------------------------TfRnnkdqP--------iN- 255 (398)
||+=+-++.|...+.+.+ .+..+|+ +|+ +.+.| +.
T Consensus 159 ID~ki~L~~Dp~~lie~~l~~~~~~~Ai~~H~~R~cvyeEa~a~~~~~k~~~~~I~~Qm~~Y~-~eGlp~~s~~k~~l~s 237 (305)
T PF04765_consen 159 IDGKIQLIVDPLLLIERFLWRKNADIAISKHPERNCVYEEAEACKRLGKYDPERIDEQMEFYK-QEGLPPWSPAKLPLPS 237 (305)
T ss_pred EeeeEEEecCHHHHHHHHHhcCCCcEEEeCCCCcccHHHHHHHHHHhcCCChHHHHHHHHHHH-HcCCCccccccccccc
Confidence 999999999988888874 5677888 344 22333 32
Q ss_pred ----ceEEEEecCCCchhHHHHHHHHHHHHHHHhHHhHhhhhcchhhHHHHH
Q 015921 256 ----SGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVV 303 (398)
Q Consensus 256 ----SGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~~a~~mlGDQlALa~av 303 (398)
+++|+=++++ .+..|-..|-+-+.. | ---|||+|+++.
T Consensus 238 ~v~E~~iIiR~H~~----~~nlf~clWfnEv~r-f-----s~RDQLSF~Yv~ 279 (305)
T PF04765_consen 238 DVPEGNIIIRKHNP----MSNLFMCLWFNEVER-F-----SPRDQLSFPYVL 279 (305)
T ss_pred CCccceEEEecCCc----hhHHHHHHHHHHHhc-C-----CCcccchHHHHH
Confidence 4677766666 666676665444321 2 125999999997
No 36
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins.
Probab=52.13 E-value=60 Score=30.36 Aligned_cols=75 Identities=20% Similarity=0.101 Sum_probs=39.1
Q ss_pred HHHHHHHHHHHhhCC-Cc-cEEEEeCCCCCCCCC-----------CCceEEEecCCccCchhHHHHHHHHHHHHHHHHHH
Q 015921 139 AILNVFINFIQVTMP-KS-DVFILTDPASDLSMP-----------RKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIR 205 (398)
Q Consensus 139 avln~Fi~sIrvsmP-~s-~VVILTD~~T~~sl~-----------~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~ 205 (398)
..|-..+.||....+ .. .=||+.|++|..... ..+++++..+.+.... ..| + .-++
T Consensus 12 ~~l~~~l~Sl~~~~~~~~~~EIIvVDd~S~d~t~~~~~~~~~~~~~~~v~vi~~~~n~G~~---~a~--N------~g~~ 80 (299)
T cd02510 12 STLLRTVHSVINRTPPELLKEIILVDDFSDKPELKLLLEEYYKKYLPKVKVLRLKKREGLI---RAR--I------AGAR 80 (299)
T ss_pred HHHHHHHHHHHhcCchhcCCEEEEEECCCCchHHHHHHHHHHhhcCCcEEEEEcCCCCCHH---HHH--H------HHHH
Confidence 445555777755444 32 256667776654321 1257777666443321 111 1 1111
Q ss_pred hhccCCCccccEEEeecCeeeecC
Q 015921 206 EHSQGQGHINHYVFTDSDIAVVDD 229 (398)
Q Consensus 206 e~~~~~~~~~dfIFiDsDi~VvDd 229 (398)
+| .-+-++|+|+|+.+..+
T Consensus 81 -~A----~gd~i~fLD~D~~~~~~ 99 (299)
T cd02510 81 -AA----TGDVLVFLDSHCEVNVG 99 (299)
T ss_pred -Hc----cCCEEEEEeCCcccCcc
Confidence 11 23578899999998544
No 37
>PRK15383 type III secretion system protein; Provisional
Probab=49.79 E-value=21 Score=36.21 Aligned_cols=64 Identities=25% Similarity=0.389 Sum_probs=45.6
Q ss_pred ccccEEEeecCeeeecCchhhhccCCCceEEEe-eccCCCCcCcceEEEEecCCCchhHHHHHHHHHHHHHHHh
Q 015921 213 HINHYVFTDSDIAVVDDLGHIFHDYQNFHLALT-FRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSK 285 (398)
Q Consensus 213 ~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALT-fRnnkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sk 285 (398)
.-+--|++|+||+++|.||-++- ||. +|+. -|++....+--|.|+| .| .-+--|.+=+++-.+|
T Consensus 218 ~~~GCIYLD~DMilT~KLG~ly~--PDG-IavhV~r~~~~~slENg~I~V--nR----snHPALl~GL~iMhsK 282 (335)
T PRK15383 218 PGGGCIYLDADMLLTDKLGTLYL--PDG-IAIHVSRKDNHVSLENGIIAV--NR----SEHPALIKGLEIMHSK 282 (335)
T ss_pred CCCceEEeecceeeecccccEEc--CCc-eEEEEEecCCceecccceEEE--cc----CCCHHHHhhhHHhhcC
Confidence 34567999999999999999874 454 5555 5678888899999999 55 3344444445554444
No 38
>PRK15382 non-LEE encoded effector protein NleB; Provisional
Probab=49.56 E-value=22 Score=36.09 Aligned_cols=65 Identities=23% Similarity=0.352 Sum_probs=46.1
Q ss_pred CccccEEEeecCeeeecCchhhhccCCCceEEEe-eccCCCCcCcceEEEEecCCCchhHHHHHHHHHHHHHHHh
Q 015921 212 GHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALT-FRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSK 285 (398)
Q Consensus 212 ~~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALT-fRnnkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sk 285 (398)
+.-+--|++|+||+++|.||-++- ||. +|+. -|++....+--|.|+| .| .-+--|-+=+++-.+|
T Consensus 209 s~~~GCIYLD~DMilT~KLG~ly~--PDG-IavhV~r~~~~~slENg~I~V--nR----snHPALl~GL~iMhsK 274 (326)
T PRK15382 209 SPCEGCIYLDADMIITDKLGVLYA--PDG-IAVHVDCNDDSKSLENGAIVV--NR----SNHPALLAGLDIMKSK 274 (326)
T ss_pred CCCCceEEeecceeeecccccEEc--CCc-eEEEEEecCCccccccceEEE--cc----CCCHHHHhhhHHhhcC
Confidence 344567999999999999999874 454 5555 5778888899999999 55 3344444445554444
No 39
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins.
Probab=47.89 E-value=2e+02 Score=25.55 Aligned_cols=78 Identities=17% Similarity=0.280 Sum_probs=43.0
Q ss_pred HHHHHH-HhhCCC--ccEEEEeCCCCCCCCC------CCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCc
Q 015921 143 VFINFI-QVTMPK--SDVFILTDPASDLSMP------RKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGH 213 (398)
Q Consensus 143 ~Fi~sI-rvsmP~--s~VVILTD~~T~~sl~------~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~ 213 (398)
..+.|| +..+|. .++||+-|..++.+.. .+++.++..+.+..+..-+. .-++.. .
T Consensus 46 ~~l~si~~q~~~~~~~eiivvdd~s~d~t~~~~~~~~~~~v~~i~~~~~~g~~~a~n-----------~gi~~a-----~ 109 (251)
T cd06439 46 AKLENLLALDYPRDRLEIIVVSDGSTDGTAEIAREYADKGVKLLRFPERRGKAAALN-----------RALALA-----T 109 (251)
T ss_pred HHHHHHHhCcCCCCcEEEEEEECCCCccHHHHHHHHhhCcEEEEEcCCCCChHHHHH-----------HHHHHc-----C
Confidence 334444 446666 5777776665543322 12467777776655522111 112211 1
Q ss_pred cccEEEeecCeeeecC-chhhhcc
Q 015921 214 INHYVFTDSDIAVVDD-LGHIFHD 236 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDd-Lg~IFk~ 236 (398)
.+.++|+|+|+....+ |..+.+.
T Consensus 110 ~d~i~~lD~D~~~~~~~l~~l~~~ 133 (251)
T cd06439 110 GEIVVFTDANALLDPDALRLLVRH 133 (251)
T ss_pred CCEEEEEccccCcCHHHHHHHHHH
Confidence 2678999999999765 4444444
No 40
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment.
Probab=47.03 E-value=1.1e+02 Score=26.45 Aligned_cols=24 Identities=17% Similarity=0.325 Sum_probs=16.3
Q ss_pred HHHH-HhhCCCccEEEEeCCCCCCC
Q 015921 145 INFI-QVTMPKSDVFILTDPASDLS 168 (398)
Q Consensus 145 i~sI-rvsmP~s~VVILTD~~T~~s 168 (398)
+.|| ...+|+..+||..|..|+-+
T Consensus 20 L~sl~~q~~~~~eiivVdd~s~d~t 44 (196)
T cd02520 20 LESFFQQDYPKYEILFCVQDEDDPA 44 (196)
T ss_pred HHHHHhccCCCeEEEEEeCCCcchH
Confidence 4444 34578888888888777654
No 41
>PF00535 Glycos_transf_2: Glycosyl transferase family 2; InterPro: IPR001173 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This domain is found in a diverse family of glycosyl transferases that transfer the sugar from UDP-glucose, UDP-N-acetyl-galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids.; PDB: 2Z87_A 2Z86_B 2D7R_A 2D7I_A 3CKN_A 3CKQ_A 3CKJ_A 3CKV_A 3CKO_A 2FFU_A ....
Probab=46.99 E-value=1.4e+02 Score=23.48 Aligned_cols=79 Identities=13% Similarity=0.159 Sum_probs=40.6
Q ss_pred HHHHHHHHh-hCCCccEEEEeCCCCCCCC--------CCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCC
Q 015921 142 NVFINFIQV-TMPKSDVFILTDPASDLSM--------PRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQG 212 (398)
Q Consensus 142 n~Fi~sIrv-smP~s~VVILTD~~T~~sl--------~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~ 212 (398)
.-.+.|++. ..+..+|||+-|..++-+. ...+++++..+.+.. +...+ -.-++..
T Consensus 14 ~~~l~sl~~q~~~~~eiivvdd~s~d~~~~~~~~~~~~~~~i~~i~~~~n~g---~~~~~----n~~~~~a--------- 77 (169)
T PF00535_consen 14 ERTLESLLKQTDPDFEIIVVDDGSTDETEEILEEYAESDPNIRYIRNPENLG---FSAAR----NRGIKHA--------- 77 (169)
T ss_dssp HHHHHHHHHHSGCEEEEEEEECS-SSSHHHHHHHHHCCSTTEEEEEHCCCSH---HHHHH----HHHHHH----------
T ss_pred HHHHHHHhhccCCCEEEEEecccccccccccccccccccccccccccccccc---ccccc----ccccccc---------
Confidence 344455544 4666666555444433322 134678888886552 22222 2222222
Q ss_pred ccccEEEeecCeeeecC-chhhhcc
Q 015921 213 HINHYVFTDSDIAVVDD-LGHIFHD 236 (398)
Q Consensus 213 ~~~dfIFiDsDi~VvDd-Lg~IFk~ 236 (398)
..+=++|+|.|..+..+ |..+.+.
T Consensus 78 ~~~~i~~ld~D~~~~~~~l~~l~~~ 102 (169)
T PF00535_consen 78 KGEYILFLDDDDIISPDWLEELVEA 102 (169)
T ss_dssp -SSEEEEEETTEEE-TTHHHHHHHH
T ss_pred ceeEEEEeCCCceEcHHHHHHHHHH
Confidence 22267899999999988 4444443
No 42
>cd01834 SGNH_hydrolase_like_2 SGNH_hydrolase subfamily. SGNH hydrolases are a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases.
Probab=46.73 E-value=1e+02 Score=25.89 Aligned_cols=76 Identities=12% Similarity=0.142 Sum_probs=46.3
Q ss_pred cchhHHHHHHHHHHHHhhCCCccEEEEeCCCCCCCCCCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCc
Q 015921 134 TERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGH 213 (398)
Q Consensus 134 ~~rs~avln~Fi~sIrvsmP~s~VVILTD~~T~~sl~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~ 213 (398)
.+....=+..++..+|...|++.||++|=+-...... +-| -.....++++.|.++|++-.++.
T Consensus 84 ~~~~~~~l~~~v~~~~~~~~~~~ii~~~p~~~~~~~~-------~~~----~~~~~~~~~~~~n~~l~~~a~~~------ 146 (191)
T cd01834 84 LEKFKTNLRRLIDRLKNKESAPRIVLVSPIAYEANED-------PLP----DGAEYNANLAAYADAVRELAAEN------ 146 (191)
T ss_pred HHHHHHHHHHHHHHHHcccCCCcEEEECCcccCCCCC-------CCC----ChHHHHHHHHHHHHHHHHHHHHc------
Confidence 4555566777888888789999988886332211100 001 12345667777888887765554
Q ss_pred cccEEEeecCeeeec
Q 015921 214 INHYVFTDSDIAVVD 228 (398)
Q Consensus 214 ~~dfIFiDsDi~VvD 228 (398)
+..|+|...+..+
T Consensus 147 --~~~~iD~~~~~~~ 159 (191)
T cd01834 147 --GVAFVDLFTPMKE 159 (191)
T ss_pred --CCeEEecHHHHHH
Confidence 5778876655443
No 43
>PRK15384 type III secretion system protein; Provisional
Probab=46.57 E-value=24 Score=35.79 Aligned_cols=64 Identities=22% Similarity=0.347 Sum_probs=45.5
Q ss_pred ccccEEEeecCeeeecCchhhhccCCCceEEEe-eccCCCCcCcceEEEEecCCCchhHHHHHHHHHHHHHHHh
Q 015921 213 HINHYVFTDSDIAVVDDLGHIFHDYQNFHLALT-FRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSK 285 (398)
Q Consensus 213 ~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALT-fRnnkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~eiY~sk 285 (398)
.-+--|++|+||+++|.||-++- ||. +|+. -|++....+--|.|+| .| .-+--|.+=+++-.+|
T Consensus 215 ~~~GCIYLDaDMilT~KLG~ly~--PDG-IavhV~r~~~~~slENg~I~V--nR----snHPALl~GL~iMhsK 279 (336)
T PRK15384 215 TNSGCIYLDADMIITEKLGGIYI--PDG-IAVHVERIDGRASMENGIIAV--DR----NNHPALLAGLEIMHTK 279 (336)
T ss_pred CCCceEEeeccceeecccccEEc--CCc-eEEEEEecCCceecccceEEE--cc----CCCHHHHhhHHHhhcC
Confidence 44567999999999999999874 454 5555 5678888899999999 55 2344444445554444
No 44
>cd06182 CYPOR_like NADPH cytochrome p450 reductase (CYPOR) serves as an electron donor in several oxygenase systems and is a component of nitric oxide synthases and methionine synthase reductases. CYPOR transfers two electrons from NADPH to the heme of cytochrome p450 via FAD and FMN. CYPOR has a C-terminal ferredoxin reducatase (FNR)- like FAD and NAD binding module, an FMN-binding domain, and an additional conecting domain (inserted within the FAD binding region) that orients the FNR and FMN binding domains. Ferredoxin-NADP+ (oxido)reductase is an FAD-containing enzyme that catalyzes the reversible electron transfer between NADP(H) and electron carrier proteins such as ferredoxin and flavodoxin. Isoforms of these flavoproteins (i.e. having a non-covalently bound FAD as a prosthetic group) are present in chloroplasts, mitochondria, and bacteria and participate in a wide variety of redox metabolic pathways. The C-terminal domain contains most of the NADP(H) binding residues and the N-t
Probab=45.61 E-value=1.7e+02 Score=27.96 Aligned_cols=27 Identities=22% Similarity=0.261 Sum_probs=22.1
Q ss_pred ecCeeeecCchhhhccCCCceEEEeec
Q 015921 221 DSDIAVVDDLGHIFHDYQNFHLALTFR 247 (398)
Q Consensus 221 DsDi~VvDdLg~IFk~~PdFHvALTfR 247 (398)
|.|++..|.|.++-+..++|++-+++.
T Consensus 160 ~~d~~~~del~~~~~~~~~~~~~~~~S 186 (267)
T cd06182 160 ASDYLYREELQEALKDGALTRLDVAFS 186 (267)
T ss_pred cccccHHHHHHHHHhCCCcceEEEEEc
Confidence 589999999999988888887777653
No 45
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=44.21 E-value=1.6e+02 Score=23.59 Aligned_cols=79 Identities=16% Similarity=0.181 Sum_probs=42.5
Q ss_pred HHHHHHHHHHh-hCCCccEEEEeCCCCCCCCC-----CCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCc
Q 015921 140 ILNVFINFIQV-TMPKSDVFILTDPASDLSMP-----RKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGH 213 (398)
Q Consensus 140 vln~Fi~sIrv-smP~s~VVILTD~~T~~sl~-----~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~ 213 (398)
.|.-.+.||.. ..|..+++|.-|..++-+.. ..+++++..+.+..... =+...++.. .
T Consensus 11 ~l~~~l~sl~~~~~~~~~iiivdd~s~~~~~~~~~~~~~~~~~~~~~~~~g~~~-----------a~n~~~~~~-----~ 74 (166)
T cd04186 11 YLKACLDSLLAQTYPDFEVIVVDNASTDGSVELLRELFPEVRLIRNGENLGFGA-----------GNNQGIREA-----K 74 (166)
T ss_pred HHHHHHHHHHhccCCCeEEEEEECCCCchHHHHHHHhCCCeEEEecCCCcChHH-----------HhhHHHhhC-----C
Confidence 34445556643 34677777776665543321 12355665554444311 112223322 4
Q ss_pred cccEEEeecCeeeecC-chhhh
Q 015921 214 INHYVFTDSDIAVVDD-LGHIF 234 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDd-Lg~IF 234 (398)
.+.++|+|.|.....+ |..+-
T Consensus 75 ~~~i~~~D~D~~~~~~~l~~~~ 96 (166)
T cd04186 75 GDYVLLLNPDTVVEPGALLELL 96 (166)
T ss_pred CCEEEEECCCcEECccHHHHHH
Confidence 5689999999998776 44443
No 46
>cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex
Probab=43.97 E-value=1.4e+02 Score=24.81 Aligned_cols=21 Identities=14% Similarity=0.284 Sum_probs=15.5
Q ss_pred ccEEEeecCeeeecC-chhhhc
Q 015921 215 NHYVFTDSDIAVVDD-LGHIFH 235 (398)
Q Consensus 215 ~dfIFiDsDi~VvDd-Lg~IFk 235 (398)
+.++|+|+|..+..+ |..+..
T Consensus 81 d~i~~lD~D~~~~~~~l~~l~~ 102 (185)
T cd04179 81 DIVVTMDADLQHPPEDIPKLLE 102 (185)
T ss_pred CEEEEEeCCCCCCHHHHHHHHH
Confidence 689999999887666 444444
No 47
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=41.03 E-value=1.2e+02 Score=25.98 Aligned_cols=16 Identities=31% Similarity=0.287 Sum_probs=12.7
Q ss_pred cccEEEeecCeeeecC
Q 015921 214 INHYVFTDSDIAVVDD 229 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDd 229 (398)
.+-++|+|.|..+..|
T Consensus 83 ~d~i~~~D~D~~~~~~ 98 (229)
T cd04192 83 GDWIVTTDADCVVPSN 98 (229)
T ss_pred CCEEEEECCCcccCHH
Confidence 4578899999988654
No 48
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen.
Probab=40.83 E-value=1.2e+02 Score=26.37 Aligned_cols=77 Identities=13% Similarity=0.181 Sum_probs=40.7
Q ss_pred HHHHHHHHHhhCCCccEEEEeCCCCCCCCC------CCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCcc
Q 015921 141 LNVFINFIQVTMPKSDVFILTDPASDLSMP------RKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHI 214 (398)
Q Consensus 141 ln~Fi~sIrvsmP~s~VVILTD~~T~~sl~------~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~~ 214 (398)
|--.+.|+... ...||+.|.+|+.+.. ..++.++..+.+.....- +..-++.. .+. ..
T Consensus 13 l~~~l~sl~~q---~~~iivvDn~s~~~~~~~~~~~~~~i~~i~~~~n~G~~~a-----------~N~g~~~a-~~~-~~ 76 (237)
T cd02526 13 LKELLAALAEQ---VDKVVVVDNSSGNDIELRLRLNSEKIELIHLGENLGIAKA-----------LNIGIKAA-LEN-GA 76 (237)
T ss_pred HHHHHHHHhcc---CCEEEEEeCCCCccHHHHhhccCCcEEEEECCCceehHHh-----------hhHHHHHH-HhC-CC
Confidence 34445555433 3467777887764421 235667777655443111 11122221 111 35
Q ss_pred ccEEEeecCeeeecC-chhh
Q 015921 215 NHYVFTDSDIAVVDD-LGHI 233 (398)
Q Consensus 215 ~dfIFiDsDi~VvDd-Lg~I 233 (398)
+.++|+|.|+.+-.+ |..+
T Consensus 77 d~v~~lD~D~~~~~~~l~~l 96 (237)
T cd02526 77 DYVLLFDQDSVPPPDMVEKL 96 (237)
T ss_pred CEEEEECCCCCcCHhHHHHH
Confidence 689999999987644 4444
No 49
>cd01836 FeeA_FeeB_like SGNH_hydrolase subfamily, FeeA, FeeB and similar esterases/lipases. FeeA and FeeB are part of a biosynthetic gene cluster and may participate in the biosynthesis of long-chain N-acyltyrosines by providing saturated and unsaturated fatty acids, which it turn are loaded onto the acyl carrier protein FeeL. SGNH hydrolases are a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases.
Probab=39.54 E-value=1.5e+02 Score=25.49 Aligned_cols=28 Identities=14% Similarity=0.274 Sum_probs=22.5
Q ss_pred cchhHHHHHHHHHHHHhhCCCccEEEEe
Q 015921 134 TERSMAILNVFINFIQVTMPKSDVFILT 161 (398)
Q Consensus 134 ~~rs~avln~Fi~sIrvsmP~s~VVILT 161 (398)
.+.-..=+..++..||..+|++.|++++
T Consensus 86 ~~~~~~~l~~li~~i~~~~~~~~iiv~~ 113 (191)
T cd01836 86 IARWRKQLAELVDALRAKFPGARVVVTA 113 (191)
T ss_pred HHHHHHHHHHHHHHHHhhCCCCEEEEEC
Confidence 3445566788899999889999999886
No 50
>PRK13889 conjugal transfer relaxase TraA; Provisional
Probab=36.71 E-value=1e+02 Score=35.88 Aligned_cols=87 Identities=17% Similarity=0.198 Sum_probs=50.6
Q ss_pred eEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhccCCCceEEEeeccCCCCc
Q 015921 174 VTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQP 253 (398)
Q Consensus 174 ~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfRnnkdqP 253 (398)
.-.+..|.+.+. -|| ++.+..-++++ .+..-+- +|.+|-.+.++=-..-||+||.+|.|-..+.
T Consensus 87 e~~iALP~EL~~----eq~----~~L~~~f~~~~-----~~~~G~~--ad~aiH~~~~~dg~~NpHaHim~T~R~~~~~- 150 (988)
T PRK13889 87 EVEFAIPREMTQ----AQG----IELARDFVQAE-----FVDRGMI--ADLNVHWDIGEDGMAKPHAHVMLTMRAVDEN- 150 (988)
T ss_pred eeeEeCchhcCH----HHH----HHHHHHHHHHH-----HhcCCce--EEEEeecccccCCCCCCeEEEEeccCccCCC-
Confidence 337889999999 888 44444444333 1111111 4666665443222246999999998854332
Q ss_pred CcceEEEEecCCCchhHHHHHHHHHHHHHH
Q 015921 254 LNSGFIAVRGTPDGISRAKIFLEEVLRVYS 283 (398)
Q Consensus 254 iNSGfIfVRgt~d~~~kA~~FLeeV~eiY~ 283 (398)
|| =..++ --..++++++|-+.|-
T Consensus 151 ---G~--g~K~r--~w~~~~~l~~~R~~Wa 173 (988)
T PRK13889 151 ---GF--GAKVR--DWNRTELVERWRERWA 173 (988)
T ss_pred ---CC--CCccc--ccCCHHHHHHHHHHHH
Confidence 31 11222 2234689999999994
No 51
>cd00761 Glyco_tranf_GTA_type Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it als
Probab=36.65 E-value=1.3e+02 Score=22.87 Aligned_cols=73 Identities=12% Similarity=0.106 Sum_probs=38.2
Q ss_pred HHHHHHHhh-CCCccEEEEeCCCCCCCCCC--------CceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCc
Q 015921 143 VFINFIQVT-MPKSDVFILTDPASDLSMPR--------KGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGH 213 (398)
Q Consensus 143 ~Fi~sIrvs-mP~s~VVILTD~~T~~sl~~--------~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~ 213 (398)
.-+.++... .+..+++|++|..++..... ..+..++-+++.....- ++ ..++.. .
T Consensus 14 ~~l~s~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~-~~----------~~~~~~-----~ 77 (156)
T cd00761 14 RCLESLLAQTYPNFEVIVVDDGSTDGTLEILEEYAKKDPRVIRVINEENQGLAAA-RN----------AGLKAA-----R 77 (156)
T ss_pred HHHHHHHhCCccceEEEEEeCCCCccHHHHHHHHHhcCCCeEEEEecCCCChHHH-HH----------HHHHHh-----c
Confidence 334444333 36778888888766543321 12334444444444111 11 111111 3
Q ss_pred cccEEEeecCeeeecCch
Q 015921 214 INHYVFTDSDIAVVDDLG 231 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDdLg 231 (398)
.+.++++|+|..+..+.-
T Consensus 78 ~d~v~~~d~D~~~~~~~~ 95 (156)
T cd00761 78 GEYILFLDADDLLLPDWL 95 (156)
T ss_pred CCEEEEECCCCccCccHH
Confidence 457899999999876643
No 52
>PRK11204 N-glycosyltransferase; Provisional
Probab=35.93 E-value=1.2e+02 Score=29.94 Aligned_cols=65 Identities=15% Similarity=0.198 Sum_probs=37.6
Q ss_pred HhhCCCccEEEEeCCCCCCCCC--------CCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCccccEEEe
Q 015921 149 QVTMPKSDVFILTDPASDLSMP--------RKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFT 220 (398)
Q Consensus 149 rvsmP~s~VVILTD~~T~~sl~--------~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~~~dfIFi 220 (398)
+...|+.+|||+.|..++-+.. ..++++++.+++-.+.+ . +..-++. + ..+-++|+
T Consensus 78 ~q~yp~~eiiVvdD~s~d~t~~~l~~~~~~~~~v~~i~~~~n~Gka~--a---------ln~g~~~-a----~~d~i~~l 141 (420)
T PRK11204 78 ALRYPNYEVIAINDGSSDNTGEILDRLAAQIPRLRVIHLAENQGKAN--A---------LNTGAAA-A----RSEYLVCI 141 (420)
T ss_pred hCCCCCeEEEEEECCCCccHHHHHHHHHHhCCcEEEEEcCCCCCHHH--H---------HHHHHHH-c----CCCEEEEE
Confidence 5667888888887766654332 12466776554443311 1 1112221 1 34578999
Q ss_pred ecCeeeecC
Q 015921 221 DSDIAVVDD 229 (398)
Q Consensus 221 DsDi~VvDd 229 (398)
|+|.++-.|
T Consensus 142 DaD~~~~~d 150 (420)
T PRK11204 142 DGDALLDPD 150 (420)
T ss_pred CCCCCCChh
Confidence 999988665
No 53
>PRK13826 Dtr system oriT relaxase; Provisional
Probab=34.62 E-value=1.2e+02 Score=35.87 Aligned_cols=54 Identities=20% Similarity=0.255 Sum_probs=34.1
Q ss_pred eEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhccCCCceEEEeec
Q 015921 174 VTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFR 247 (398)
Q Consensus 174 ~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfR 247 (398)
.-.+..|.+.++ -|| +..+..-++++....+..- |.+|-|+ ..-||.|+.+|.|
T Consensus 97 E~~iALP~EL~~----eq~----i~Lvr~fv~~~~~~~Gm~a-------D~aiHd~-----~~NpHaHim~T~R 150 (1102)
T PRK13826 97 DVTIALPLELTA----EQN----IALVRDFVEKHILAKGMVA-------DWVYHDA-----PGNPHVHLMTTLR 150 (1102)
T ss_pred eeeEeCccccCH----HHH----HHHHHHHHHHHHHhCCCeE-------EEEeecC-----CCCCeeeeeeccc
Confidence 347888999999 788 5555444444422112222 4445443 4589999999987
No 54
>PF10465 Inhibitor_I24: PinA peptidase inhibitor ; InterPro: IPR019506 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. PinA inhibits the endopeptidase La. It binds to the La homotetramer but does not interfere with the ATP binding site or the active site of La.
Probab=34.16 E-value=31 Score=31.60 Aligned_cols=18 Identities=33% Similarity=0.615 Sum_probs=14.8
Q ss_pred hhHHHHHHHHHHHHHHHHH
Q 015921 186 DKLMLQRIRSYITFLERRI 204 (398)
Q Consensus 186 ~~LmlQRi~sYi~fLkT~i 204 (398)
+|||.- -+.||+||+++|
T Consensus 122 gnLMQA-AAeYIewLE~ql 139 (140)
T PF10465_consen 122 GNLMQA-AAEYIEWLETQL 139 (140)
T ss_pred hhHHHH-HHHHHHHHHhhc
Confidence 578854 599999999986
No 55
>PF11051 Mannosyl_trans3: Mannosyltransferase putative; InterPro: IPR022751 Alpha-mannosyltransferase is responsible for the addition of residues to the outer chain of core N-linked polysaccharides and to O-linked mannotriose. It is implicated in late Golgi modifications [][][]. The proteins matching this entry are conserved in fungi and also found in some phototrophic organisms.; GO: 0006486 protein glycosylation
Probab=33.99 E-value=36 Score=32.72 Aligned_cols=55 Identities=15% Similarity=0.167 Sum_probs=37.4
Q ss_pred CccccEEEeecCeeeecCchhhhccCCCceEEEeeccCCCCcCcceEEEEecCCCchhHHHHHHHHHHH
Q 015921 212 GHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLR 280 (398)
Q Consensus 212 ~~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfRnnkdqPiNSGfIfVRgt~d~~~kA~~FLeeV~e 280 (398)
..-+.+||+|+|.+..-|.+.+|+. +..-..|.+|-||..=.-.++...+-++++
T Consensus 89 ssFeevllLDaD~vpl~~p~~lF~~--------------~~yk~tG~lfw~dpd~~~~~~~~~~~~~~~ 143 (271)
T PF11051_consen 89 SSFEEVLLLDADNVPLVDPEKLFES--------------EEYKKTGALFWRDPDFWKRSTSPFCYEIFG 143 (271)
T ss_pred CCcceEEEEcCCcccccCHHHHhcC--------------ccccccCEEEECCCccccccCChhHhHHhh
Confidence 4567899999999999999999984 233467999995554222333344444444
No 56
>cd01832 SGNH_hydrolase_like_1 Members of the SGNH-hydrolase superfamily, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad from other serine hydrolases, but may lack the carboxlic acid. Myxobacterial members of this subfamily have been reported to be involved in adventurous gliding motility.
Probab=33.86 E-value=2.3e+02 Score=24.11 Aligned_cols=65 Identities=15% Similarity=0.169 Sum_probs=40.6
Q ss_pred cchhHHHHHHHHHHHHhhCCCccEEEEeCCCCCCCCCCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCc
Q 015921 134 TERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGH 213 (398)
Q Consensus 134 ~~rs~avln~Fi~sIrvsmP~s~VVILTD~~T~~sl~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~ 213 (398)
.++-..=+..+|..|| .|+++||++|=+... + +..+.+ .+.+++..|.+.|++..+++
T Consensus 87 ~~~~~~~~~~~i~~i~--~~~~~vil~~~~~~~-~-----------~~~~~~--~~~~~~~~~n~~l~~~a~~~------ 144 (185)
T cd01832 87 PDTYRADLEEAVRRLR--AAGARVVVFTIPDPA-V-----------LEPFRR--RVRARLAAYNAVIRAVAARY------ 144 (185)
T ss_pred HHHHHHHHHHHHHHHH--hCCCEEEEecCCCcc-c-----------cchhHH--HHHHHHHHHHHHHHHHHHHc------
Confidence 4556667788899998 799999998865440 0 001111 23456666777666665544
Q ss_pred cccEEEeec
Q 015921 214 INHYVFTDS 222 (398)
Q Consensus 214 ~~dfIFiDs 222 (398)
.+.|+|.
T Consensus 145 --~v~~vd~ 151 (185)
T cd01832 145 --GAVHVDL 151 (185)
T ss_pred --CCEEEec
Confidence 5777775
No 57
>TIGR00282 metallophosphoesterase, MG_246/BB_0505 family. A member of this family from Mycoplasma Pneumoniae has been crystallized and described as a novel phosphatase.
Probab=33.83 E-value=55 Score=32.22 Aligned_cols=18 Identities=22% Similarity=0.399 Sum_probs=14.9
Q ss_pred CCCccceEEeceeeeccc
Q 015921 118 SRASNMVTVGNASYSKTE 135 (398)
Q Consensus 118 ~~~~~~v~vg~~~~~k~~ 135 (398)
+.--+++|.|||+++|.|
T Consensus 57 ~~GvDviT~GNH~~Dkge 74 (266)
T TIGR00282 57 QSGVNYITMGNHTWFQKL 74 (266)
T ss_pred hcCCCEEEccchhccCcH
Confidence 344578999999999997
No 58
>cd04190 Chitin_synth_C C-terminal domain of Chitin Synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin. Chitin synthase, also called UDP-N-acetyl-D-glucosamine:chitin 4-beta-N-acetylglucosaminyltransferase, catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of GlcNAc residues formed by covalent beta-1,4 linkages. Chitin is an important component of the cell wall of fungi and bacteria and it is synthesized on the cytoplasmic surface of the cell membrane by membrane bound chitin synthases. Studies with fungi have revealed that most of them contain more than one chitin synthase gene. At least five subclasses of chitin synthases have been identified.
Probab=33.80 E-value=1.2e+02 Score=27.85 Aligned_cols=15 Identities=20% Similarity=0.326 Sum_probs=12.5
Q ss_pred ccEEEeecCeeeecC
Q 015921 215 NHYVFTDSDIAVVDD 229 (398)
Q Consensus 215 ~dfIFiDsDi~VvDd 229 (398)
+=++|+|+|+++-.|
T Consensus 75 e~i~~~DaD~~~~~~ 89 (244)
T cd04190 75 EFILLVDADTKFDPD 89 (244)
T ss_pred CEEEEECCCCcCCHh
Confidence 457899999999777
No 59
>cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily.
Probab=33.75 E-value=2.6e+02 Score=23.50 Aligned_cols=79 Identities=6% Similarity=0.127 Sum_probs=41.3
Q ss_pred HHHHHHHHHHhhCCCccEEEEeCCCCCCCCC---------CCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccC
Q 015921 140 ILNVFINFIQVTMPKSDVFILTDPASDLSMP---------RKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQG 210 (398)
Q Consensus 140 vln~Fi~sIrvsmP~s~VVILTD~~T~~sl~---------~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~ 210 (398)
+|.++...+....+...|||+ |+++..... ..++.++....++..++-+ ..-+++-
T Consensus 15 ~l~sl~~~~~~~~~~~eiivv-dd~s~d~t~~~~~~~~~~~~~i~~i~~~~n~G~~~a~-----------n~g~~~a--- 79 (181)
T cd04187 15 LYERLKAVLESLGYDYEIIFV-DDGSTDRTLEILRELAARDPRVKVIRLSRNFGQQAAL-----------LAGLDHA--- 79 (181)
T ss_pred HHHHHHHHHHhcCCCeEEEEE-eCCCCccHHHHHHHHHhhCCCEEEEEecCCCCcHHHH-----------HHHHHhc---
Confidence 445554444445666776665 555543311 1235666665555542222 1222222
Q ss_pred CCccccEEEeecCeeeecC-chhhhc
Q 015921 211 QGHINHYVFTDSDIAVVDD-LGHIFH 235 (398)
Q Consensus 211 ~~~~~dfIFiDsDi~VvDd-Lg~IFk 235 (398)
..+-++|+|.|.....+ |..+.+
T Consensus 80 --~~d~i~~~D~D~~~~~~~l~~l~~ 103 (181)
T cd04187 80 --RGDAVITMDADLQDPPELIPEMLA 103 (181)
T ss_pred --CCCEEEEEeCCCCCCHHHHHHHHH
Confidence 22578899999987544 555554
No 60
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano
Probab=33.65 E-value=1.3e+02 Score=29.81 Aligned_cols=16 Identities=44% Similarity=0.493 Sum_probs=12.7
Q ss_pred cccEEEeecCeeeecC
Q 015921 214 INHYVFTDSDIAVVDD 229 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDd 229 (398)
-+-++|+|+|+.+-.|
T Consensus 127 ge~i~~~DaD~~~~p~ 142 (373)
T TIGR03472 127 HDILVIADSDISVGPD 142 (373)
T ss_pred CCEEEEECCCCCcChh
Confidence 4578999999988654
No 61
>KOG1928 consensus Alpha-1,4-N-acetylglucosaminyltransferase [Carbohydrate transport and metabolism]
Probab=33.28 E-value=1.3e+02 Score=31.98 Aligned_cols=119 Identities=19% Similarity=0.190 Sum_probs=65.6
Q ss_pred HHHHHhhCCCccEEEEeCCCCCCCCCCCceE---------EEec------CCccCchhHHHHHHHHHH-----HHHHHH-
Q 015921 145 INFIQVTMPKSDVFILTDPASDLSMPRKGVT---------IYPI------HGEYSRDKLMLQRIRSYI-----TFLERR- 203 (398)
Q Consensus 145 i~sIrvsmP~s~VVILTD~~T~~sl~~~~~~---------v~pi------~g~YSR~~LmlQRi~sYi-----~fLkT~- 203 (398)
|+|+-..||+.-|+||.- |-++.++.... +.++ --.+.-+..+++++++-. .+|.+-
T Consensus 150 IESa~k~hP~~cv~vls~--t~ds~~~~s~~kp~~~~~lsv~~v~~~lp~llk~t~~e~~l~~~k~g~~~~~~~~l~~~l 227 (409)
T KOG1928|consen 150 IESAFKTHPEGCVVVLSK--TMDSPNGYSILKPFLDSGLSVIAVTPDLPFLLKDTPGETWLERWKDGRLDPGKIPLLQNL 227 (409)
T ss_pred hHHHHhhCCCceEEEEEc--cccCCCCccccccHhHhhhhhcccccCchhhHhhCccccHHHHHHhcccCCCcccchhhH
Confidence 678889999999999987 65555554332 2222 122334444556554432 111111
Q ss_pred --------HHhhccCCCccccEEEeecCeeeecCchhhhccCCCceEEEeeccCCCCcCcceE-EEEecCCCchhHHHHH
Q 015921 204 --------IREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGF-IAVRGTPDGISRAKIF 274 (398)
Q Consensus 204 --------i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfRnnkdqPiNSGf-IfVRgt~d~~~kA~~F 274 (398)
|=++ -=|+.|+|+||.-+|..+=+-.+ ++= + +...--+|.++ +|=++ |.|
T Consensus 228 Sdl~RLA~LyKY--------GGvYLDTDvIvLksl~~l~N~ig-~~~--~--~~~~~~lnnavl~F~k~--------Hpf 286 (409)
T KOG1928|consen 228 SDLSRLALLYKY--------GGVYLDTDVIVLKSLSNLRNVIG-VDP--A--TQAWTRLNNAVLIFDKN--------HPF 286 (409)
T ss_pred HHHHHHHHHHHh--------CCEEeeccEEEeccccccccccc-ccc--h--hhHHHhhcCceeecCCC--------CHH
Confidence 1122 12789999999999988776544 110 0 01122255554 34444 446
Q ss_pred HHHHHHHHHHhH
Q 015921 275 LEEVLRVYSSKY 286 (398)
Q Consensus 275 LeeV~eiY~sky 286 (398)
+-+-++-|..+|
T Consensus 287 l~~cl~eF~~tf 298 (409)
T KOG1928|consen 287 LLECLREFALTY 298 (409)
T ss_pred HHHHHHHHHHhc
Confidence 666666666665
No 62
>TIGR01618 phage_P_loop phage nucleotide-binding protein. This model represents an uncharacterized family of proteins from a number of phage of Gram-positive bacteria. This protein contains a P-loop motif, G/A-X-X-G-X-G-K-T near its amino end. The function of this protein is unknown.
Probab=33.28 E-value=1e+02 Score=29.46 Aligned_cols=64 Identities=16% Similarity=0.180 Sum_probs=39.9
Q ss_pred CCccEEEEeCCCCCCCCCCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCccccEEEeecCeeeecCchh
Q 015921 153 PKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGH 232 (398)
Q Consensus 153 P~s~VVILTD~~T~~sl~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~ 232 (398)
|+-.+++-+|..+..-....++.+.+++...+ ..++..+|+.. ++ -..++|++|+|++++
T Consensus 33 ~~~~~~~~~d~~~~~l~g~~~~~v~~~d~~~~--------~~~~~d~l~~~-~~-----------~~~~ydtVVIDsI~~ 92 (220)
T TIGR01618 33 PGKTLVLSFDMSSKVLIGDENVDIADHDDMPP--------IQAMVEFYVMQ-NI-----------QAVKYDNIVIDNISA 92 (220)
T ss_pred CCCCEEEeccccchhccCCCCCceeecCCCCC--------HHHHHHHHHHH-Hh-----------ccccCCEEEEecHHH
Confidence 77788888888766544334566766643222 33344444311 11 136899999999999
Q ss_pred hhcc
Q 015921 233 IFHD 236 (398)
Q Consensus 233 IFk~ 236 (398)
+.+.
T Consensus 93 l~~~ 96 (220)
T TIGR01618 93 LQNL 96 (220)
T ss_pred HHHH
Confidence 8775
No 63
>PRK10964 ADP-heptose:LPS heptosyl transferase I; Provisional
Probab=33.12 E-value=1.2e+02 Score=28.94 Aligned_cols=43 Identities=7% Similarity=-0.035 Sum_probs=29.5
Q ss_pred HHHHH-HHHHHHHhhCCCccEEEEeCCCCCCCC--CCCceEEEecC
Q 015921 138 MAILN-VFINFIQVTMPKSDVFILTDPASDLSM--PRKGVTIYPIH 180 (398)
Q Consensus 138 ~avln-~Fi~sIrvsmP~s~VVILTD~~T~~sl--~~~~~~v~pi~ 180 (398)
..|+. -.+..||..+|++++.+||.+.+.--+ .+.--++++++
T Consensus 13 D~v~~~p~~~~lk~~~P~a~I~~l~~~~~~~l~~~~p~vd~vi~~~ 58 (322)
T PRK10964 13 DVLHTLPALTDAQQAIPGIQFDWVVEEGFAQIPSWHPAVDRVIPVA 58 (322)
T ss_pred HHHhHHHHHHHHHHhCCCCEEEEEECHHHHHHHhcCCCccEEEeec
Confidence 34444 577899999999999999999876333 33322356554
No 64
>PF13774 Longin: Regulated-SNARE-like domain; PDB: 1IOU_A 3BW6_A 1H8M_A 3EGX_C 2NUP_C 3EGD_C 2NUT_C 3KYQ_A 1IFQ_B 2VX8_D ....
Probab=33.06 E-value=41 Score=26.39 Aligned_cols=32 Identities=25% Similarity=0.363 Sum_probs=25.4
Q ss_pred cceEEEEecCCCchhH--HHHHHHHHHHHHHHhH
Q 015921 255 NSGFIAVRGTPDGISR--AKIFLEEVLRVYSSKY 286 (398)
Q Consensus 255 NSGfIfVRgt~d~~~k--A~~FLeeV~eiY~sky 286 (398)
+.|++++.=|.+.+++ |-.||+++.+.|.++|
T Consensus 31 ~~~i~~~citd~~~~~r~aF~fL~~i~~~F~~~~ 64 (83)
T PF13774_consen 31 EDGIAYLCITDKSYPKRVAFAFLEEIKQEFIQTY 64 (83)
T ss_dssp ETTEEEEEEEETTS-HHHHHHHHHHHHHHHHHHC
T ss_pred cCCeEEEEEEcCCCCcchHHHHHHHHHHHHHHHc
Confidence 5677777767776666 7788999999999997
No 65
>cd04506 SGNH_hydrolase_YpmR_like Members of the SGNH-hydrolase superfamily, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad from other serine hydrolases, but may lack the carboxlic acid. This subfamily contains sequences similar to Bacillus YpmR.
Probab=33.00 E-value=1.8e+02 Score=25.38 Aligned_cols=26 Identities=15% Similarity=0.235 Sum_probs=21.2
Q ss_pred hhHHHHHHHHHHHHhhCCCccEEEEe
Q 015921 136 RSMAILNVFINFIQVTMPKSDVFILT 161 (398)
Q Consensus 136 rs~avln~Fi~sIrvsmP~s~VVILT 161 (398)
+--.=|..+|..||..+|++.|++++
T Consensus 103 ~~~~~l~~~i~~ir~~~p~~~Ivv~~ 128 (204)
T cd04506 103 TYQNNLKKIFKEIRKLNPDAPIFLVG 128 (204)
T ss_pred HHHHHHHHHHHHHHHHCCCCeEEEEe
Confidence 34456788999999999999988875
No 66
>cd01844 SGNH_hydrolase_like_6 SGNH_hydrolase subfamily. SGNH hydrolases are a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases.
Probab=32.13 E-value=2.6e+02 Score=24.12 Aligned_cols=29 Identities=10% Similarity=0.211 Sum_probs=22.1
Q ss_pred hHHHHHHHHHHHHhhCCCccEEEEeCCCC
Q 015921 137 SMAILNVFINFIQVTMPKSDVFILTDPAS 165 (398)
Q Consensus 137 s~avln~Fi~sIrvsmP~s~VVILTD~~T 165 (398)
-..=+..++.-||..+|++.|++++=+-+
T Consensus 76 ~~~~~~~~i~~i~~~~p~~~iil~~~~~~ 104 (177)
T cd01844 76 VRERLGPLVKGLRETHPDTPILLVSPRYC 104 (177)
T ss_pred HHHHHHHHHHHHHHHCcCCCEEEEecCCC
Confidence 34556778889999999999888775433
No 67
>PLN03180 reversibly glycosylated polypeptide; Provisional
Probab=29.69 E-value=25 Score=36.43 Aligned_cols=108 Identities=25% Similarity=0.442 Sum_probs=61.1
Q ss_pred hhHHHHHHHHHHHHhhCCCccEEEEe--CCCCCCCCCCC-ceEEEecCCccCchhH--HHHHHHH----------HHHHH
Q 015921 136 RSMAILNVFINFIQVTMPKSDVFILT--DPASDLSMPRK-GVTIYPIHGEYSRDKL--MLQRIRS----------YITFL 200 (398)
Q Consensus 136 rs~avln~Fi~sIrvsmP~s~VVILT--D~~T~~sl~~~-~~~v~pi~g~YSR~~L--mlQRi~s----------Yi~fL 200 (398)
|....|+.+...++.++ +||.+ |+.+.+-.+++ ..++ |+|+.. .+.++.+ |+-||
T Consensus 21 ~~~~fL~~~r~~l~~~h----~iiV~d~D~~~~~~~~~G~d~~v------y~r~d~~~~Lg~~~~~Ip~~~~a~R~fGyL 90 (346)
T PLN03180 21 RNLDFLEMWRPFFQPYH----LIIVQDGDPSKEIKVPEGFDYEL------YNRNDINRILGPKASCISFKDSACRCFGYL 90 (346)
T ss_pred CchhHHHHHHHhcCccc----EEEEecCCcccceeccCCCceee------cCHHHHHhhhcccccccccCcccchhhhhe
Confidence 33567778888877776 55555 45555444433 2223 233222 0111111 23344
Q ss_pred HHHHHhhccCCCccccEEEeecCeeeecCch----hhhccCCCceEEE---e--eccCCCCcCcceEEEEecCC
Q 015921 201 ERRIREHSQGQGHINHYVFTDSDIAVVDDLG----HIFHDYQNFHLAL---T--FRNNKDQPLNSGFIAVRGTP 265 (398)
Q Consensus 201 kT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg----~IFk~~PdFHvAL---T--fRnnkdqPiNSGfIfVRgt~ 265 (398)
..+ ++ =.++||-|.+..+|-. +.+.+ |.+.-. | |.|.-=+|+..|-.||||=|
T Consensus 91 ~s~-~~---------yivsiDDD~~Pa~d~~g~~i~~~~q--H~~NL~~pstp~~fNtLYdp~r~g~~fvRGYP 152 (346)
T PLN03180 91 VSK-KK---------YIFTIDDDCFVAKDPSGKLINALEQ--HIKNLLSPSTPFFFNTLYDPYREGADFVRGYP 152 (346)
T ss_pred eec-ce---------EEEEECCCCCCCCCCccccccHHHH--HHHhcCCCCCCceeecccccCccCCcccCCCC
Confidence 443 22 3578999999999954 34444 222222 2 55666788999999999988
No 68
>PRK14583 hmsR N-glycosyltransferase; Provisional
Probab=29.49 E-value=1.4e+02 Score=30.18 Aligned_cols=68 Identities=13% Similarity=0.135 Sum_probs=39.2
Q ss_pred HHHHhhCCCccEEEEeCCCCCCCCC--------CCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCccccE
Q 015921 146 NFIQVTMPKSDVFILTDPASDLSMP--------RKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHY 217 (398)
Q Consensus 146 ~sIrvsmP~s~VVILTD~~T~~sl~--------~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~~~df 217 (398)
+.++...|+-+|||+.|..++.+.. ..++++++.+++-.+.+- ++.-++. ...+.+
T Consensus 96 sll~q~yp~~eIivVdDgs~D~t~~~~~~~~~~~~~v~vv~~~~n~Gka~A-----------lN~gl~~-----a~~d~i 159 (444)
T PRK14583 96 AALAQTYTNIEVIAINDGSSDDTAQVLDALLAEDPRLRVIHLAHNQGKAIA-----------LRMGAAA-----ARSEYL 159 (444)
T ss_pred HHHcCCCCCeEEEEEECCCCccHHHHHHHHHHhCCCEEEEEeCCCCCHHHH-----------HHHHHHh-----CCCCEE
Confidence 3346678998988887766654432 124667776544333111 1111211 134678
Q ss_pred EEeecCeeeecC
Q 015921 218 VFTDSDIAVVDD 229 (398)
Q Consensus 218 IFiDsDi~VvDd 229 (398)
+++|.|.+.-.|
T Consensus 160 v~lDAD~~~~~d 171 (444)
T PRK14583 160 VCIDGDALLDKN 171 (444)
T ss_pred EEECCCCCcCHH
Confidence 999999997655
No 69
>TIGR02193 heptsyl_trn_I lipopolysaccharide heptosyltransferase I. This family consists of examples of ADP-heptose:LPS heptosyltransferase I, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria.
Probab=28.47 E-value=1.7e+02 Score=27.73 Aligned_cols=43 Identities=9% Similarity=0.059 Sum_probs=31.8
Q ss_pred HHHHHHHHHHHhhCCCccEEEEeCCCCCCCCCCC-ce-EEEecCC
Q 015921 139 AILNVFINFIQVTMPKSDVFILTDPASDLSMPRK-GV-TIYPIHG 181 (398)
Q Consensus 139 avln~Fi~sIrvsmP~s~VVILTD~~T~~sl~~~-~~-~v~pi~g 181 (398)
-+..-++..||..+|++++.+||.+...--+... .+ ++++++.
T Consensus 14 vl~~p~l~~Lr~~~P~a~I~~l~~~~~~~~~~~~p~vd~v~~~~~ 58 (319)
T TIGR02193 14 IHTLPALTDIKRALPDVEIDWVVEEGFADIVRLHPAVDEVIPVAL 58 (319)
T ss_pred HHHHHHHHHHHHhCCCCEEEEEEChhHhhhhhcCCCccEEEEech
Confidence 4556789999999999999999998885444321 23 4777663
No 70
>cd01820 PAF_acetylesterase_like PAF_acetylhydrolase (PAF-AH)_like subfamily of SGNH-hydrolases. Platelet-activating factor (PAF) and PAF-AH are key players in inflammation and in atherosclerosis. PAF-AH is a calcium independent phospholipase A2 which exhibits strong substrate specificity towards PAF, hydrolyzing an acetyl ester at the sn-2 position. PAF-AH also degrades a family of oxidized PAF-like phospholipids with short sn-2 residues. In addition, PAF and PAF-AH are associated with neural migration and mammalian reproduction.
Probab=27.22 E-value=3e+02 Score=24.63 Aligned_cols=28 Identities=14% Similarity=0.310 Sum_probs=23.2
Q ss_pred cchhHHHHHHHHHHHHhhCCCccEEEEe
Q 015921 134 TERSMAILNVFINFIQVTMPKSDVFILT 161 (398)
Q Consensus 134 ~~rs~avln~Fi~sIrvsmP~s~VVILT 161 (398)
.+.-.+=+..++.-+|..+|+++|++++
T Consensus 108 ~~~~~~~l~~ii~~l~~~~P~~~Iil~~ 135 (214)
T cd01820 108 AEEIAEGILAIVEEIREKLPNAKILLLG 135 (214)
T ss_pred HHHHHHHHHHHHHHHHHHCCCCeEEEEe
Confidence 5555666778899999999999999986
No 71
>cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose. A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier.
Probab=27.19 E-value=2.5e+02 Score=23.97 Aligned_cols=17 Identities=24% Similarity=0.126 Sum_probs=13.1
Q ss_pred cccEEEeecCeeeecCc
Q 015921 214 INHYVFTDSDIAVVDDL 230 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDdL 230 (398)
-+.++|+|+|..+..+.
T Consensus 82 ~d~v~~~DaD~~~~p~~ 98 (183)
T cd06438 82 PDAVVVFDADNLVDPNA 98 (183)
T ss_pred CCEEEEEcCCCCCChhH
Confidence 45678999999987543
No 72
>KOG3487 consensus TRAPP 20 K subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=27.15 E-value=75 Score=29.34 Aligned_cols=40 Identities=25% Similarity=0.351 Sum_probs=28.1
Q ss_pred cceEEEEecCCCchhHHHHHHHHHHHHHHHhHHhHhhhhcc
Q 015921 255 NSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGD 295 (398)
Q Consensus 255 NSGfIfVRgt~d~~~kA~~FLeeV~eiY~sky~~a~~mlGD 295 (398)
+.=||.+++++- =..-++|||||+|.|.+.-|+-.=+..|
T Consensus 79 ~i~f~mlh~~~~-~~~ik~Ffqev~elyik~lmnpFy~~~d 118 (139)
T KOG3487|consen 79 HIRFIMLHINRA-DDNIKLFFQEVHELYIKTLMNPFYEIND 118 (139)
T ss_pred cEEEEEEeeccc-cccHHHHHHHHHHHHHHHHhCcccccCC
Confidence 345777877763 2356899999999999887655445444
No 73
>cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response.
Probab=27.14 E-value=3.3e+02 Score=23.90 Aligned_cols=29 Identities=10% Similarity=0.361 Sum_probs=17.0
Q ss_pred HHHHHHHH-HhhCCCccEEEEeCCCCCCCC
Q 015921 141 LNVFINFI-QVTMPKSDVFILTDPASDLSM 169 (398)
Q Consensus 141 ln~Fi~sI-rvsmP~s~VVILTD~~T~~sl 169 (398)
|...+.|+ +...|+.++||.-|..|+-+.
T Consensus 14 l~~~l~sl~~q~~~~~eiiVvdd~s~D~t~ 43 (236)
T cd06435 14 VKETLDSLAALDYPNFEVIVIDNNTKDEAL 43 (236)
T ss_pred HHHHHHHHHhCCCCCcEEEEEeCCCCchhH
Confidence 33344445 344588888777666666543
No 74
>COG0369 CysJ Sulfite reductase, alpha subunit (flavoprotein) [Inorganic ion transport and metabolism]
Probab=26.82 E-value=7e+02 Score=27.57 Aligned_cols=143 Identities=18% Similarity=0.343 Sum_probs=86.7
Q ss_pred ceEEeceeeeccch-hHHHHHHHHHHHHhhCCCccEEEEeCCCCCCCCCCCceE-EEec-CCccCchhHHHHHHHHHHHH
Q 015921 123 MVTVGNASYSKTER-SMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVT-IYPI-HGEYSRDKLMLQRIRSYITF 199 (398)
Q Consensus 123 ~v~vg~~~~~k~~r-s~avln~Fi~sIrvsmP~s~VVILTD~~T~~sl~~~~~~-v~pi-~g~YSR~~LmlQRi~sYi~f 199 (398)
-+|||-+.|...+| -..|...|+.-+.. ++..+-|-..++.+.-++.+..+ ++=| || -++-+. =.|
T Consensus 390 hltV~vV~y~~~~~~r~GvcS~~L~~~~~--~g~~i~v~v~~n~nf~lp~~~~~PiIMIG~G------TGIAPF---Raf 458 (587)
T COG0369 390 HLTVGVVRYQAEGRERYGVCSGYLADLLE--EGDTIPVFVQPNKNFRLPEDPETPIIMIGPG------TGIAPF---RAF 458 (587)
T ss_pred EEEEEEEEeccCCCcccccchHHHHhhhc--CCCeEEEEeccCCccccCCCCCCceEEEcCC------CCchhH---HHH
Confidence 46888899988887 67788888887766 56677777777777766665321 1111 10 012222 245
Q ss_pred HHHHHHhhccCCCccccEEEe-----ecCeeeecCchhhhccCCCceEEEeeccCCCCc------------------Ccc
Q 015921 200 LERRIREHSQGQGHINHYVFT-----DSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQP------------------LNS 256 (398)
Q Consensus 200 LkT~i~e~~~~~~~~~dfIFi-----DsDi~VvDdLg~IFk~~PdFHvALTfRnnkdqP------------------iNS 256 (398)
|+.++..-+.+ ...||. +.|.+.-|++.+-.+++-.+++.+-|+..++.. -|+
T Consensus 459 vq~r~~~~~~g----k~wLfFG~R~~~~DfLY~~Ewe~~~~~G~~~~l~~AfSRdq~~KiYVQd~lre~~del~~~l~~g 534 (587)
T COG0369 459 VQERAANGAEG----KNWLFFGCRHFTEDFLYQEEWEEYLKDGVLTRLDLAFSRDQEEKIYVQDRLREQADELWEWLEEG 534 (587)
T ss_pred HHHHHhccccC----ceEEEecCCCCccchhhHHHHHHHHhcCCceeEEEEEeecCCCCccHHHHHHHhHHHHHHHHHCC
Confidence 55554333222 355554 889999999999777766788877654333222 267
Q ss_pred eEEEEecCCCchhH-HHHHHHHHHH
Q 015921 257 GFIAVRGTPDGISR-AKIFLEEVLR 280 (398)
Q Consensus 257 GfIfVRgt~d~~~k-A~~FLeeV~e 280 (398)
|.|+|+|...+..+ -++-|+++++
T Consensus 535 a~~YVCGd~~~Ma~dV~~AL~~il~ 559 (587)
T COG0369 535 AHIYVCGDAKGMAKDVEEALLDILA 559 (587)
T ss_pred CEEEEeCCCccchHHHHHHHHHHHH
Confidence 89999996644432 2333444444
No 75
>cd01828 sialate_O-acetylesterase_like2 sialate_O-acetylesterase_like subfamily of the SGNH-hydrolases, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases.
Probab=26.50 E-value=2.2e+02 Score=24.02 Aligned_cols=69 Identities=10% Similarity=0.084 Sum_probs=41.9
Q ss_pred cchhHHHHHHHHHHHHhhCCCccEEEEeCCCCCCCCCCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCc
Q 015921 134 TERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGH 213 (398)
Q Consensus 134 ~~rs~avln~Fi~sIrvsmP~s~VVILTD~~T~~sl~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~ 213 (398)
.++-..=+..+|..+|..+|+++|+++|=+.+. +. . .-..++++.|-+-+++..++.
T Consensus 67 ~~~~~~~l~~li~~~~~~~~~~~vi~~~~~p~~----~~----------~---~~~~~~~~~~n~~l~~~a~~~------ 123 (169)
T cd01828 67 DEDIVANYRTILEKLRKHFPNIKIVVQSILPVG----EL----------K---SIPNEQIEELNRQLAQLAQQE------ 123 (169)
T ss_pred HHHHHHHHHHHHHHHHHHCCCCeEEEEecCCcC----cc----------C---cCCHHHHHHHHHHHHHHHHHC------
Confidence 355566678889999999999999998733322 00 0 111345555555555443333
Q ss_pred cccEEEeecCeeee
Q 015921 214 INHYVFTDSDIAVV 227 (398)
Q Consensus 214 ~~dfIFiDsDi~Vv 227 (398)
...|+|....+.
T Consensus 124 --~~~~id~~~~~~ 135 (169)
T cd01828 124 --GVTFLDLWAVFT 135 (169)
T ss_pred --CCEEEechhhhc
Confidence 667888765543
No 76
>cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=26.46 E-value=2e+02 Score=24.33 Aligned_cols=17 Identities=18% Similarity=0.497 Sum_probs=13.9
Q ss_pred ccceEEEEeecccccCC
Q 015921 324 IIGASVLFLPCATYNWT 340 (398)
Q Consensus 324 v~g~sVlfLPC~~YNwt 340 (398)
-.|.++.++|=.+|+|.
T Consensus 185 ~~g~~~~~~~~~~~~~r 201 (202)
T cd04184 185 EHTDRIAHIPRVLYHWR 201 (202)
T ss_pred hccceEEEccHhhhhhc
Confidence 34889999999999885
No 77
>PF00072 Response_reg: Response regulator receiver domain; InterPro: IPR001789 Two-component signal transduction systems enable bacteria to sense, respond, and adapt to a wide range of environments, stressors, and growth conditions []. Some bacteria can contain up to as many as 200 two-component systems that need tight regulation to prevent unwanted cross-talk []. These pathways have been adapted to response to a wide variety of stimuli, including nutrients, cellular redox state, changes in osmolarity, quorum signals, antibiotics, and more []. Two-component systems are comprised of a sensor histidine kinase (HK) and its cognate response regulator (RR) []. The HK catalyses its own auto-phosphorylation followed by the transfer of the phosphoryl group to the receiver domain on RR; phosphorylation of the RR usually activates an attached output domain, which can then effect changes in cellular physiology, often by regulating gene expression. Some HK are bifunctional, catalysing both the phosphorylation and dephosphorylation of their cognate RR. The input stimuli can regulate either the kinase or phosphatase activity of the bifunctional HK. A variant of the two-component system is the phospho-relay system. Here a hybrid HK auto-phosphorylates and then transfers the phosphoryl group to an internal receiver domain, rather than to a separate RR protein. The phosphoryl group is then shuttled to histidine phosphotransferase (HPT) and subsequently to a terminal RR, which can evoke the desired response [, ]. Bipartite response regulator proteins are involved in a two-component signal transduction system in bacteria, and certain eukaryotes like protozoa, that functions to detect and respond to environmental changes []. These systems have been detected during host invasion, drug resistance, motility, phosphate uptake, osmoregulation, and nitrogen fixation, amongst others []. The two-component system consists of a histidine protein kinase environmental sensor that phosphorylates the receiver domain of a response regulator protein; phosphorylation induces a conformational change in the response regulator, which activates the effector domain, triggering the cellular response []. The domains of the two-component proteins are highly modular, but the core structures and activities are maintained. The response regulators act as phosphorylation-activated switches to affect a cellular response, usually by transcriptional regulation. Most of these proteins consist of two domains, an N-terminal response regulator receiver domain, and a variable C-terminal effector domain with DNA-binding activity. This entry represents the response regulator receiver domain, which belongs to the CheY family, and receives the signal from the sensor partner in the two-component system.; GO: 0000156 two-component response regulator activity, 0000160 two-component signal transduction system (phosphorelay), 0006355 regulation of transcription, DNA-dependent; PDB: 2QR3_A 2QXY_A 1I3C_A 1JLK_A 2PKX_A 2PL1_A 3H1F_A 3H1E_A 3GWG_A 3H1G_A ....
Probab=26.17 E-value=85 Score=23.99 Aligned_cols=24 Identities=17% Similarity=0.335 Sum_probs=20.0
Q ss_pred HHHHHHHhhCCCccEEEEeCCCCC
Q 015921 143 VFINFIQVTMPKSDVFILTDPASD 166 (398)
Q Consensus 143 ~Fi~sIrvsmP~s~VVILTD~~T~ 166 (398)
.++..||..+|+..+|++|+..+.
T Consensus 60 ~~~~~i~~~~~~~~ii~~t~~~~~ 83 (112)
T PF00072_consen 60 ELLEQIRQINPSIPIIVVTDEDDS 83 (112)
T ss_dssp HHHHHHHHHTTTSEEEEEESSTSH
T ss_pred ccccccccccccccEEEecCCCCH
Confidence 456778889999999999988773
No 78
>PF00871 Acetate_kinase: Acetokinase family; InterPro: IPR000890 Acetate kinase, which is predominantly found in micro-organisms, facilitates the production of acetyl-CoA by phosphorylating acetate in the presence of ATP and a divalent cation [, ]. The enzyme is important in the process of glycolysis, enzyme levels being increased in the presence of excess glucose. The growth of a bacterial mutant lacking acetate kinase has been shown to be inhibited by glucose, suggesting that the enzyme is involved in excretion of excess carbohydrate []. A related enzyme, butyrate kinase, facilitates the formation of butyryl-CoA by phosphorylating butyrate in the presence of ATP to form butyryl phosphate [].; GO: 0016301 kinase activity, 0016774 phosphotransferase activity, carboxyl group as acceptor, 0008152 metabolic process, 0016310 phosphorylation, 0005622 intracellular; PDB: 3P4I_B 3R9P_B 2IIR_J 1SAZ_A 1X9J_D 4DQ8_B 1TUU_A 1TUY_B 1G99_A 1X3N_A ....
Probab=24.93 E-value=37 Score=34.93 Aligned_cols=91 Identities=16% Similarity=0.178 Sum_probs=61.8
Q ss_pred ccCCCCccceEEeceeeeccchhHHH------HHH-HHHHHHhhCCCccEEEEeCCCCCCCCCCCceEEEecCCccCchh
Q 015921 115 HVDSRASNMVTVGNASYSKTERSMAI------LNV-FINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDK 187 (398)
Q Consensus 115 ~~~~~~~~~v~vg~~~~~k~~rs~av------ln~-Fi~sIrvsmP~s~VVILTD~~T~~sl~~~~~~v~pi~g~YSR~~ 187 (398)
|-+...+..+.|.+..+..+++--.. -|. -|...+...|++..|...|+..-.++++. +..|++|+.+-+
T Consensus 91 hGG~~~~~~~~I~~~v~~~l~~~~~laplhnp~~l~~i~~~~~~~p~~~~vavfDt~fh~~lp~~-ar~y~lP~~~~~-- 167 (388)
T PF00871_consen 91 HGGPIFGGPVRIDDEVLDDLEELSPLAPLHNPGALIAIRAARELFPPAPQVAVFDTAFHDTLPPV-ARLYGLPYIWYE-- 167 (388)
T ss_dssp TTTTT-SSEEE-SHHHHHHHHHHGGGSTTTHHHHHHHHHHHHHHSTTSEEEEEETTGGGGGS-HH-HHS-SSTHHHHH--
T ss_pred cCCCCCCCcEEECHHHHHHHHhccCCCCCcChHHHHHHHHHHHhCCCccEEEEcCCcccccchHH-HHHcCCCHHHHH--
Confidence 45666777777777666666543211 122 36777889999999999999999999887 889999988866
Q ss_pred HHHHHHHHH------HHHHHHHHHhhccC
Q 015921 188 LMLQRIRSY------ITFLERRIREHSQG 210 (398)
Q Consensus 188 LmlQRi~sY------i~fLkT~i~e~~~~ 210 (398)
.+.||.| .+|+-.++.+..++
T Consensus 168 --~~giRrygfHgLS~~~va~~~a~~lgk 194 (388)
T PF00871_consen 168 --KYGIRRYGFHGLSYRYVARRAAELLGK 194 (388)
T ss_dssp --HHT-S--BS-HHHHHHHHHHHHHHTTS
T ss_pred --hcCceEecchHHhHHHHHHHHHHHcCC
Confidence 5777778 78888888776443
No 79
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602).
Probab=24.92 E-value=2.4e+02 Score=28.67 Aligned_cols=22 Identities=18% Similarity=0.327 Sum_probs=15.5
Q ss_pred cceEEEEeecccccCCCCCCCC
Q 015921 325 IGASVLFLPCATYNWTPPEGAG 346 (398)
Q Consensus 325 ~g~sVlfLPC~~YNwtPpegaG 346 (398)
.|.+|.+.|=++.-..|||.-.
T Consensus 252 ~g~kv~~~~~a~~~~~~p~t~~ 273 (439)
T TIGR03111 252 LDGKVYLCENAIFYVDPIDGLN 273 (439)
T ss_pred cCCeEEECCCCEEEEECCcCHH
Confidence 5778888887777666776543
No 80
>PF13358 DDE_3: DDE superfamily endonuclease
Probab=24.45 E-value=92 Score=24.93 Aligned_cols=59 Identities=25% Similarity=0.453 Sum_probs=43.0
Q ss_pred HHHHHHhhCCCc-cEEEEeCCCCCC-------CCCC--CceEEEecCCccCchhHHHHHHHHHHHHHHHHHHh
Q 015921 144 FINFIQVTMPKS-DVFILTDPASDL-------SMPR--KGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIRE 206 (398)
Q Consensus 144 Fi~sIrvsmP~s-~VVILTD~~T~~-------sl~~--~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e 206 (398)
|+..+....++. +++++.|+++-= -+.. .++++...| .||. .+++|..+..+||..+..
T Consensus 68 ~l~~~~~~~~~~~~~~li~DNa~~H~~~~~~~~l~~~~~~~~~~~~P-~~sP---dLNpiE~~w~~lk~~~~~ 136 (146)
T PF13358_consen 68 FLEQLLRPYPRKGRIVLIMDNASIHKSKKVQEWLEEHERGIELLFLP-PYSP---DLNPIENVWGYLKRRIRR 136 (146)
T ss_pred cccccccccccceEEEEecccccccccccccceeecccccccccccc-CcCC---ccCHHHHHHHHHHHHHHh
Confidence 555555555665 999999987632 1223 567788887 6777 399999999999999876
No 81
>TIGR01556 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids are glycolipids containing mono- or di- L-rhamnose molecules. Rhamnolipid synthesis occurs by sequential glycosyltransferase reactions involving two distinct rhamnosyltransferase enzymes. In P.aeruginosa, the synthesis of mono-rhamnolipids is catalyzed by rhamnosyltransferase 1, and proceeds by a glycosyltransfer reaction catalyzed by rhamnosyltransferase 2 to yield di-rhamnolipids.
Probab=24.22 E-value=3.3e+02 Score=25.13 Aligned_cols=73 Identities=12% Similarity=0.141 Sum_probs=37.2
Q ss_pred HHHHHHHHHhhCCCccEEEEeCCCCCCC--C----C-CCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCc
Q 015921 141 LNVFINFIQVTMPKSDVFILTDPASDLS--M----P-RKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGH 213 (398)
Q Consensus 141 ln~Fi~sIrvsmP~s~VVILTD~~T~~s--l----~-~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~ 213 (398)
|--.+.||... ..-||+.|+++..+ + . ..++++++-+.+..- -.=.++ =++...+ ..
T Consensus 10 l~~~l~sl~~q---~~~iiVVDN~S~~~~~~~~~~~~~~~i~~i~~~~N~G~-----a~a~N~--Gi~~a~~------~~ 73 (281)
T TIGR01556 10 LGELITSLPKQ---VDRIIAVDNSPHSDQPLKNARLRGQKIALIHLGDNQGI-----AGAQNQ--GLDASFR------RG 73 (281)
T ss_pred HHHHHHHHHhc---CCEEEEEECcCCCcHhHHHHhccCCCeEEEECCCCcch-----HHHHHH--HHHHHHH------CC
Confidence 33344555432 34688888886432 1 1 135667776644332 210111 0111111 13
Q ss_pred cccEEEeecCeeeecC
Q 015921 214 INHYVFTDSDIAVVDD 229 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDd 229 (398)
.+.++|+|.|..+-++
T Consensus 74 ~d~i~~lD~D~~~~~~ 89 (281)
T TIGR01556 74 VQGVLLLDQDSRPGNA 89 (281)
T ss_pred CCEEEEECCCCCCCHH
Confidence 4678899999999643
No 82
>PRK10422 lipopolysaccharide core biosynthesis protein; Provisional
Probab=23.14 E-value=3.5e+02 Score=26.30 Aligned_cols=40 Identities=13% Similarity=0.151 Sum_probs=28.7
Q ss_pred HHHHHHHHHhhCCCccEEEEeCCCCCCCCC--CCceEEEecC
Q 015921 141 LNVFINFIQVTMPKSDVFILTDPASDLSMP--RKGVTIYPIH 180 (398)
Q Consensus 141 ln~Fi~sIrvsmP~s~VVILTD~~T~~sl~--~~~~~v~pi~ 180 (398)
..-++..||...|++++.+||.+.+.--+. +.--+++.++
T Consensus 22 ~~P~l~~Lk~~~P~a~I~~l~~~~~~~l~~~~P~id~vi~~~ 63 (352)
T PRK10422 22 TTPVISSLKKNYPDAKIDVLLYQDTIPILSENPEINALYGIK 63 (352)
T ss_pred HHHHHHHHHHHCCCCeEEEEeccChHHHhccCCCceEEEEec
Confidence 356889999999999999999998764333 2222366665
No 83
>TIGR02768 TraA_Ti Ti-type conjugative transfer relaxase TraA. This protein contains domains distinctive of a single strand exonuclease (N-terminus, MobA/MobL, pfam03389) as well as a helicase domain (central region, homologous to the corresponding region of the F-type relaxase TraI, TIGR02760). This protein likely fills the same role as TraI(F), nicking (at the oriT site) and unwinding the coiled plasmid prior to conjugative transfer.
Probab=22.79 E-value=1.9e+02 Score=32.21 Aligned_cols=84 Identities=21% Similarity=0.165 Sum_probs=48.2
Q ss_pred EEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCccccEEEeecCeeeecCchhhhccCCCceEEEeeccCCCCcC
Q 015921 175 TIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPL 254 (398)
Q Consensus 175 ~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~~~dfIFiDsDi~VvDdLg~IFk~~PdFHvALTfRnnkdqPi 254 (398)
-.+..|.+.|+ -|| ++.++.-+++.....+..- |.+|-++ ...-||+|+.+|.|.-.+
T Consensus 88 ~~iaLP~El~~----~~~----~~L~~~f~~~~~~~~g~~~-------d~aiH~~----~~~NpHaHim~t~R~~~~--- 145 (744)
T TIGR02768 88 FEIALPRELNL----EQN----IELARRFVRDHFVEKGMVA-------DWAIHDD----GDGNPHAHLLTTTRPLEE--- 145 (744)
T ss_pred EeEeCchhcCH----HHH----HHHHHHHHHHHHHhCCCeE-------EEEEecC----CCCCCEEEEEeeceeecc---
Confidence 37888999999 888 4444444443322222333 3344431 245899999999874332
Q ss_pred cceEEEEecCC----------CchhHHHHHHHHHHHHHH
Q 015921 255 NSGFIAVRGTP----------DGISRAKIFLEEVLRVYS 283 (398)
Q Consensus 255 NSGfIfVRgt~----------d~~~kA~~FLeeV~eiY~ 283 (398)
.| |=+..+ ++...-++++++|-+.|-
T Consensus 146 -~g--f~~k~~~~~~~~~~~~~~~~~~~~~~~~~R~~wa 181 (744)
T TIGR02768 146 -NG--FGAKKVRVLGEDGQPYEFWAGSKEDLDYWREQWA 181 (744)
T ss_pred -cc--CCCcccccccccccccccccCCHHHHHHHHHHHH
Confidence 12 111111 122235788999999995
No 84
>PTZ00138 small nuclear ribonucleoprotein; Provisional
Probab=22.16 E-value=73 Score=27.00 Aligned_cols=67 Identities=16% Similarity=0.291 Sum_probs=40.5
Q ss_pred chhHHHHHHHHHHHHHHHHHHhhccCCCccccEEEee-cCeeeecCchhhhccCCCceEEEeecc------CCCCcCcce
Q 015921 185 RDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTD-SDIAVVDDLGHIFHDYQNFHLALTFRN------NKDQPLNSG 257 (398)
Q Consensus 185 R~~LmlQRi~sYi~fLkT~i~e~~~~~~~~~dfIFiD-sDi~VvDdLg~IFk~~PdFHvALTfRn------nkdqPiNSG 257 (398)
..++|.|+|.-+.+|++++.+= -+...| .+.-+...|.-.++ |..+...| ..+..-+-|
T Consensus 8 ~~~~~~~P~~~~~~~~~~~~~V---------~i~l~~~~~r~~~G~L~gfD~-----~mNlVL~d~~E~~~~~~~~~~lG 73 (89)
T PTZ00138 8 LQKIMTQPINQIFRFFTEKTRV---------QIWLYDHPNLRIEGKILGFDE-----YMNMVLDDAEEVYTKKNTRKDLG 73 (89)
T ss_pred cceeecCCHHHHHHHhcCCcEE---------EEEEEeCCCcEEEEEEEEEcc-----cceEEEccEEEEecCCceeeEcC
Confidence 3579999999999998887321 122334 35555555544332 33443322 234556789
Q ss_pred EEEEecCC
Q 015921 258 FIAVRGTP 265 (398)
Q Consensus 258 fIfVRgt~ 265 (398)
.+++||+.
T Consensus 74 ~ilIRGnn 81 (89)
T PTZ00138 74 RILLKGDN 81 (89)
T ss_pred eEEEcCCE
Confidence 99999975
No 85
>PF08331 DUF1730: Domain of unknown function (DUF1730); InterPro: IPR013542 This domain of unknown function occurs in iron-sulphur cluster-binding proteins together with the 4Fe-4S binding domain (IPR001450 from INTERPRO).
Probab=21.68 E-value=2.8e+02 Score=22.08 Aligned_cols=65 Identities=11% Similarity=0.234 Sum_probs=36.4
Q ss_pred HhhCCCccEEEEe--CCCC-------CCCCCCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCccccEEE
Q 015921 149 QVTMPKSDVFILT--DPAS-------DLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVF 219 (398)
Q Consensus 149 rvsmP~s~VVILT--D~~T-------~~sl~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~~~dfIF 219 (398)
+..+|+++=||.. .=.+ ...-..+++.-|---.+|-+ .|.+|++.-.+||+.+..+. +--+|
T Consensus 5 ~~~~p~arSvIv~a~~Y~~~~~~~~~~~~~~~g~iarYA~G~DYH~--vlk~~L~~l~~~i~~~~~~~-------~~r~~ 75 (78)
T PF08331_consen 5 RKLLPGARSVIVLAFPYYPEPPPPPPPPGPGRGRIARYAWGRDYHK--VLKKKLEQLAEWIRELGPDF-------EYRIF 75 (78)
T ss_pred HHhCCCCcEEEEEEccCCCccccccccCCCCCeeEeehhccCChHH--HHHHHHHHHHHHHHHHCCCC-------CeEEe
Confidence 5678888544433 2222 11222334555555556666 88888777777777775332 24566
Q ss_pred eec
Q 015921 220 TDS 222 (398)
Q Consensus 220 iDs 222 (398)
.|+
T Consensus 76 VDT 78 (78)
T PF08331_consen 76 VDT 78 (78)
T ss_pred ecC
Confidence 664
No 86
>cd01838 Isoamyl_acetate_hydrolase_like Isoamyl-acetate hydrolyzing esterase-like proteins. SGNH_hydrolase subfamily similar to the Saccharomyces cerevisiae IAH1. IAH1 may be the major esterase that hydrolyses isoamyl acetate in sake mash. The SGNH-family of hydrolases is a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases
Probab=21.18 E-value=3.3e+02 Score=23.07 Aligned_cols=76 Identities=16% Similarity=0.118 Sum_probs=42.7
Q ss_pred chhHHHHHHHHHHHHhhCCCccEEEEeCCCCCCCCCCCceEEEecCCccCchhHHHHHHHHHHHHHHHHHHhhccCCCcc
Q 015921 135 ERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHI 214 (398)
Q Consensus 135 ~rs~avln~Fi~sIrvsmP~s~VVILTD~~T~~sl~~~~~~v~pi~g~YSR~~LmlQRi~sYi~fLkT~i~e~~~~~~~~ 214 (398)
++--.=+...+.-+|..+|+++||++|=+-.......... -.+. ..+.| ..++++.|.+.+++..+++
T Consensus 88 ~~~~~~~~~~i~~~~~~~~~~~ii~~t~~~~~~~~~~~~~-~~~~-~~~~~---~~~~~~~~~~~~~~~a~~~------- 155 (199)
T cd01838 88 DEYKENLRKIVSHLKSLSPKTKVILITPPPVDEEAWEKSL-EDGG-SQPGR---TNELLKQYAEACVEVAEEL------- 155 (199)
T ss_pred HHHHHHHHHHHHHHHhhCCCCeEEEeCCCCCCHHHHhhhh-cccc-CCccc---cHHHHHHHHHHHHHHHHHh-------
Confidence 4444557778888888899999999973322111000000 0011 11222 2466777888877766666
Q ss_pred ccEEEeecC
Q 015921 215 NHYVFTDSD 223 (398)
Q Consensus 215 ~dfIFiDsD 223 (398)
+.-|+|..
T Consensus 156 -~~~~iD~~ 163 (199)
T cd01838 156 -GVPVIDLW 163 (199)
T ss_pred -CCcEEEHH
Confidence 46677654
No 87
>cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the
Probab=21.05 E-value=2.8e+02 Score=21.60 Aligned_cols=16 Identities=25% Similarity=0.202 Sum_probs=13.1
Q ss_pred cccEEEeecCeeeecC
Q 015921 214 INHYVFTDSDIAVVDD 229 (398)
Q Consensus 214 ~~dfIFiDsDi~VvDd 229 (398)
.+.++|+|+|..+..+
T Consensus 79 ~~~i~~~D~D~~~~~~ 94 (180)
T cd06423 79 GDIVVVLDADTILEPD 94 (180)
T ss_pred CCEEEEECCCCCcChH
Confidence 4468999999998776
No 88
>COG5597 Alpha-N-acetylglucosamine transferase [Cell envelope biogenesis, outer membrane]
Probab=20.18 E-value=37 Score=35.32 Aligned_cols=23 Identities=26% Similarity=0.606 Sum_probs=21.1
Q ss_pred ccccEEEeecCeeeecCchhhhc
Q 015921 213 HINHYVFTDSDIAVVDDLGHIFH 235 (398)
Q Consensus 213 ~~~dfIFiDsDi~VvDdLg~IFk 235 (398)
--+..||+|+|+||.-++.+||.
T Consensus 167 EyDRvifLDsDaivlknmDklFd 189 (368)
T COG5597 167 EYDRVIFLDSDAIVLKNMDKLFD 189 (368)
T ss_pred hhceEEEeccchHHhhhhHHHhc
Confidence 55689999999999999999997
Done!