Query psy3380
Match_columns 290
No_of_seqs 222 out of 1411
Neff 7.3
Searched_HMMs 46136
Date Fri Aug 16 23:33:45 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3380.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3380hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3736|consensus 100.0 3.1E-71 6.7E-76 540.0 17.2 256 2-288 235-534 (578)
2 KOG3738|consensus 100.0 1.3E-67 2.9E-72 485.0 12.5 253 2-286 213-513 (559)
3 KOG3737|consensus 100.0 1.8E-59 3.8E-64 430.7 12.1 260 2-289 249-559 (603)
4 KOG3736|consensus 100.0 4.2E-46 9E-51 363.8 2.2 270 3-289 151-499 (578)
5 KOG3738|consensus 100.0 6E-46 1.3E-50 342.0 2.3 268 3-286 133-471 (559)
6 KOG3737|consensus 100.0 1.7E-34 3.7E-39 266.0 0.6 268 4-288 162-522 (603)
7 cd02510 pp-GalNAc-T pp-GalNAc- 99.9 4.6E-24 1E-28 196.3 11.4 181 2-213 92-298 (299)
8 cd00161 RICIN Ricin-type beta- 98.8 1.3E-08 2.8E-13 79.8 7.7 70 217-288 9-84 (124)
9 PF00652 Ricin_B_lectin: Ricin 98.8 9.1E-09 2E-13 81.4 5.8 77 209-288 4-87 (124)
10 smart00458 RICIN Ricin-type be 98.8 2E-08 4.3E-13 78.8 7.0 69 216-288 5-77 (117)
11 cd00161 RICIN Ricin-type beta- 98.5 5.1E-07 1.1E-11 70.5 6.8 67 217-286 51-124 (124)
12 PF00652 Ricin_B_lectin: Ricin 98.4 2.7E-07 5.9E-12 72.8 5.0 76 206-284 38-124 (124)
13 PF14200 RicinB_lectin_2: Rici 98.4 9.8E-07 2.1E-11 68.9 6.2 72 214-287 21-105 (105)
14 smart00458 RICIN Ricin-type be 98.3 1.3E-06 2.8E-11 68.3 6.7 66 217-286 45-116 (117)
15 cd02510 pp-GalNAc-T pp-GalNAc- 98.3 7.6E-07 1.6E-11 81.9 5.3 159 10-182 13-203 (299)
16 PF02709 Glyco_transf_7C: N-te 98.1 7.2E-07 1.6E-11 66.4 0.4 41 86-127 16-56 (78)
17 PF10111 Glyco_tranf_2_2: Glyc 97.5 8.7E-05 1.9E-09 68.0 3.5 41 86-127 165-205 (281)
18 cd00899 b4GalT Beta-4-Galactos 97.2 0.00016 3.4E-09 64.1 1.8 40 87-127 109-148 (219)
19 COG1216 Predicted glycosyltran 96.6 0.0029 6.3E-08 58.5 5.1 117 5-159 96-213 (305)
20 cd06420 GT2_Chondriotin_Pol_N 96.4 0.0035 7.5E-08 52.3 4.0 39 88-127 125-163 (182)
21 cd04184 GT2_RfbC_Mx_like Myxoc 95.5 0.012 2.6E-07 49.9 3.1 45 91-160 151-195 (202)
22 cd02526 GT2_RfbF_like RfbF is 95.1 0.031 6.8E-07 48.6 4.8 46 91-160 153-198 (237)
23 cd04195 GT2_AmsE_like GT2_AmsE 94.8 0.035 7.6E-07 47.0 4.0 95 2-127 89-184 (201)
24 PF14200 RicinB_lectin_2: Rici 94.7 0.023 5.1E-07 43.9 2.5 36 214-250 68-104 (105)
25 cd02525 Succinoglycan_BP_ExoA 94.5 0.021 4.7E-07 49.8 2.0 110 2-159 90-201 (249)
26 cd02522 GT_2_like_a GT_2_like_ 94.1 0.063 1.4E-06 46.1 4.2 31 93-127 140-170 (221)
27 TIGR01556 rhamnosyltran L-rham 94.0 0.067 1.5E-06 48.4 4.2 46 89-159 149-194 (281)
28 PF13641 Glyco_tranf_2_3: Glyc 93.8 0.083 1.8E-06 45.7 4.4 50 86-161 156-205 (228)
29 PF02709 Glyco_transf_7C: N-te 92.5 0.043 9.3E-07 40.7 0.4 35 148-182 17-51 (78)
30 cd06913 beta3GnTL1_like Beta 1 90.9 0.2 4.3E-06 43.3 2.9 31 94-125 161-191 (219)
31 cd04186 GT_2_like_c Subfamily 90.2 0.23 5E-06 40.0 2.6 76 50-127 73-148 (166)
32 cd02520 Glucosylceramide_synth 90.1 0.18 4E-06 42.9 1.9 49 88-160 123-171 (196)
33 PF13632 Glyco_trans_2_3: Glyc 90.0 0.56 1.2E-05 39.6 4.9 50 87-161 70-119 (193)
34 cd06435 CESA_NdvC_like NdvC_li 89.4 0.12 2.6E-06 45.1 0.3 36 88-127 157-192 (236)
35 cd06442 DPM1_like DPM1_like re 87.5 0.39 8.3E-06 41.2 2.2 50 88-163 150-200 (224)
36 COG4092 Predicted glycosyltran 87.3 0.28 6.1E-06 44.7 1.3 35 91-125 179-213 (346)
37 TIGR03469 HonB hopene-associat 87.1 0.05 1.1E-06 52.1 -3.9 38 88-127 208-245 (384)
38 cd06421 CESA_CelA_like CESA_Ce 86.7 0.31 6.8E-06 42.0 1.2 38 86-127 157-194 (234)
39 cd06437 CESA_CaSu_A2 Cellulose 86.5 0.31 6.7E-06 42.5 1.1 43 91-159 164-206 (232)
40 cd04185 GT_2_like_b Subfamily 86.3 0.69 1.5E-05 39.1 3.1 33 93-127 125-157 (202)
41 cd04192 GT_2_like_e Subfamily 84.0 0.84 1.8E-05 39.0 2.6 36 89-125 155-190 (229)
42 cd04179 DPM_DPG-synthase_like 80.5 1.3 2.8E-05 36.6 2.4 35 86-124 148-184 (185)
43 PF00535 Glycos_transf_2: Glyc 78.5 8.5 0.00018 30.3 6.6 83 2-105 87-169 (169)
44 cd06433 GT_2_WfgS_like WfgS an 77.7 4.4 9.6E-05 33.4 4.9 34 91-127 144-177 (202)
45 PF10555 MraY_sig1: Phospho-N- 76.6 1.2 2.5E-05 21.6 0.6 10 86-96 4-13 (13)
46 PLN02899 alpha-galactosidase 73.6 2.4 5.1E-05 43.1 2.4 49 217-266 486-539 (633)
47 cd06427 CESA_like_2 CESA_like_ 72.4 1.7 3.8E-05 38.1 1.1 45 90-160 162-206 (241)
48 cd00899 b4GalT Beta-4-Galactos 69.8 2.2 4.8E-05 37.9 1.1 34 149-182 110-143 (219)
49 TIGR03472 HpnI hopanoid biosyn 69.2 1.6 3.4E-05 41.5 0.1 48 90-161 201-248 (373)
50 KOG3916|consensus 68.6 2.3 5E-05 40.1 1.0 37 89-125 260-296 (372)
51 PF10111 Glyco_tranf_2_2: Glyc 68.1 2.3 5E-05 38.7 0.9 36 148-183 166-201 (281)
52 cd04187 DPM1_like_bac Bacteria 65.6 2.4 5.3E-05 35.1 0.5 27 86-112 144-170 (181)
53 KOG3588|consensus 56.7 3.6 7.9E-05 39.3 0.0 33 92-124 386-418 (494)
54 PF11397 GlcNAc: Glycosyltrans 49.1 21 0.00045 34.0 3.8 114 2-127 125-245 (343)
55 PRK14583 hmsR N-glycosyltransf 47.8 12 0.00025 36.5 2.0 35 89-127 230-264 (444)
56 PRK11204 N-glycosyltransferase 45.7 11 0.00025 36.0 1.5 35 89-127 209-243 (420)
57 cd04190 Chitin_synth_C C-termi 45.5 25 0.00054 31.0 3.6 34 2-35 82-116 (244)
58 PF05679 CHGN: Chondroitin N-a 43.3 12 0.00026 37.3 1.3 34 92-125 412-447 (499)
59 cd06423 CESA_like CESA_like is 42.1 22 0.00047 27.9 2.5 26 86-111 151-176 (180)
60 PF03498 CDtoxinA: Cytolethal 40.1 28 0.0006 29.1 2.8 69 214-288 58-143 (150)
61 PF03002 Somatostatin: Somatos 36.1 9.1 0.0002 20.1 -0.4 9 200-208 4-12 (18)
62 TIGR03111 glyc2_xrt_Gpos1 puta 29.5 21 0.00045 34.8 0.4 35 88-125 214-248 (439)
63 PRK11498 bcsA cellulose syntha 28.5 40 0.00087 36.0 2.4 36 87-125 416-451 (852)
64 PLN02726 dolichyl-phosphate be 26.2 34 0.00073 29.9 1.2 35 89-125 166-200 (243)
No 1
>KOG3736|consensus
Probab=100.00 E-value=3.1e-71 Score=539.97 Aligned_cols=256 Identities=47% Similarity=0.880 Sum_probs=231.8
Q ss_pred ceecccCCchHHHHHHHhcCCCeEEEeeEeeecCCCcccccCCCccCCCCCceEEEEeeeeeeEEeeCCHHHHhcccCCC
Q psy3380 2 MCCEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHKNAA 81 (290)
Q Consensus 2 ~h~e~~~~Wl~pll~~i~~~~~~~v~p~I~~id~~t~~y~~~~~~~~~~~~~~~g~fdw~l~~~w~~~~~~~~~~~~~~~ 81 (290)
|||||+.|||||||++|++++++||||+||+||++||+|... +...+|+|||.|.|+|+.+|++++.++..++
T Consensus 235 sHcE~n~gWLePLL~~I~~~r~tvv~PvID~Id~~tf~y~~~-------~~~~rGgFdW~l~f~w~~lP~~~~~~~~~~t 307 (578)
T KOG3736|consen 235 SHCEVNVGWLEPLLARIAEDRKTVVCPVIDVIDDNTFEYEKQ-------SELMRGGFDWELTFKWERLPLPEEKRRELPT 307 (578)
T ss_pred cceeEecCcchHHHHHhhhcCceeecceEEeecCcCceeccc-------CccceeeeecceeEEeccCCccHhhcccCCC
Confidence 899999999999999999999999999999999999999863 4567899999999999999999998889999
Q ss_pred CcccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEe--
Q psy3380 82 EPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDK-- 159 (290)
Q Consensus 82 ~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~-- 159 (290)
+|++||+||||||||||++|+++|+||+||++|||||+|||||+ | ||||.++|.|
T Consensus 308 ~PirsPtMaGglFAI~r~yF~eiG~yD~gMdiwGGENlElSfrv-W----------------------qCGG~lei~PCS 364 (578)
T KOG3736|consen 308 DPIRSPTMAGGLFAIDRKYFGELGSYDEGMDIWGGENLELSFRV-W----------------------QCGGRLEIVPCS 364 (578)
T ss_pred CCcCCcccCCceEEeeHHHHhhccCccccccccChhhceeeEEE-e----------------------ccCCeEEecCcc
Confidence 99999999999999999999999999999999999999999999 9 9999999999
Q ss_pred ---eeecccCCCCC--CCcccccccccc------------------ccCCCCCChhhHHHHHHHhccccchhheecc---
Q psy3380 160 ---AFFEKLGTYDS--GFDIWGGENLEL------------------SFKGDFGDVTSRKELRRNLGCKSFKWYLEVS--- 213 (290)
Q Consensus 160 ---~~F~~~g~y~~--g~~~~~~e~~e~------------------~~~~~~Gdv~~r~~lR~~l~ck~FkWyl~nv--- 213 (290)
|+||+.-||.. +.++..+|++|+ ..+.++|||++|++||++|+||+|+|||+||
T Consensus 365 rVGHifRk~~pY~~p~~~~~~~~N~~RlAeVWmDeyK~~~y~~~P~~~~~d~GDvseR~~LR~~L~CKsFkWyL~nVyPe 444 (578)
T KOG3736|consen 365 RVGHIFRKRKPYTFPDGTDTATRNLKRLAEVWMDEYKEQFYKRMPGLRNIDEGDLTERKALRERLNCKSFKWYLENVYPE 444 (578)
T ss_pred ceeeeeecCCCccCCCcchhhhhchhhhhhhhhHHHHHHHHhhCccccccCCCCchhHHHHHHhcCCccccchHhhcCcc
Confidence 99999999943 346667777776 3357999999999999999999999999998
Q ss_pred ---------------cCCCCceeccCCCCCCCCCCeEEEcCCCCCCcceeEEccccccccCCceeee-eCCeEEEEeCCC
Q psy3380 214 ---------------NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY-AGGDVILYPCHG 277 (290)
Q Consensus 214 ---------------~~~~~~CLD~~~~~~~~g~~v~l~~Ch~~~~nQ~w~lt~~G~Ir~~~~CLD~-~g~~V~l~~C~g 277 (290)
+..++.|+|........+.++++++||+.+.||.|.+|+.|+||..+.|||+ ..+.|+|..||.
T Consensus 445 l~~~~~~~~~~G~i~~~~~~~cld~~~~~~~~~~~~~~~~Ch~~~~~Q~~~yT~~~eir~~~~cl~~~~~~~v~l~~C~~ 524 (578)
T KOG3736|consen 445 LYLPTPHVYASGEIRNGNPNLCLDTERAPAGQGMAVGLYPCHGPGGNQYFPYTKQGEIRIGDLCLDVDDAGKVTLYDCHK 524 (578)
T ss_pred ccCCCCcccccceeccCCcchhhhhhchhccCCCcceEecCCCccccccccccCCcceEECCEEeccccCCceEEEeccc
Confidence 2446789998775444467899999999999999999999999999999998 333499999999
Q ss_pred CCCceeEEEcc
Q psy3380 278 SKGNQYFEYDY 288 (290)
Q Consensus 278 ~~~nQ~W~~~~ 288 (290)
.+ ||.|.|+.
T Consensus 525 ~~-~q~w~~~~ 534 (578)
T KOG3736|consen 525 MG-NQLWHYDK 534 (578)
T ss_pred cc-ccceEEcC
Confidence 87 99999875
No 2
>KOG3738|consensus
Probab=100.00 E-value=1.3e-67 Score=485.00 Aligned_cols=253 Identities=40% Similarity=0.762 Sum_probs=222.0
Q ss_pred ceecccCCchHHHHHHHhcCCCeEEEeeEeeecCCCcccccCCCccCCCCCceEEEEeeeeeeEEeeCCHHHHhcccCCC
Q psy3380 2 MCCEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHKNAA 81 (290)
Q Consensus 2 ~h~e~~~~Wl~pll~~i~~~~~~~v~p~I~~id~~t~~y~~~~~~~~~~~~~~~g~fdw~l~~~w~~~~~~~~~~~~~~~ 81 (290)
||||||++||||||+||++|+++||+||||+||.+||.|+.. +...+|||||+|.|+|+.+..+.++.+.+|+
T Consensus 213 SHcEvN~~WLePLL~Rvaed~trvVsPiiDvIn~dnf~Y~~a-------sadLrGGFDWsLhF~We~~~~eqr~sr~~Pt 285 (559)
T KOG3738|consen 213 SHCEVNEGWLEPLLERVAEDTTRVVSPIIDVINLDNFSYVGA-------SADLRGGFDWSLHFKWEQMQLEQRESRADPT 285 (559)
T ss_pred cceeecchhhHHHHHHHhhcccceeecccccccccccccccc-------hhhhcCCcceEEEEEehhcCHHHHhhccCCC
Confidence 899999999999999999999999999999999999999854 4566899999999999988888888889999
Q ss_pred CcccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEe--
Q psy3380 82 EPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDK-- 159 (290)
Q Consensus 82 ~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~-- 159 (290)
.|++||+||||||+||+.||.++|.||..|++|||||+|||||+ | ||||.++|+|
T Consensus 286 ~PirtP~iAGGlfvidk~wF~~LGkyd~~mdiWGGEn~Elsfrv-W----------------------~CGGslEIvPCS 342 (559)
T KOG3738|consen 286 APIRTPAIAGGLFVIDKEWFNELGKYDMDMDIWGGENLELSFRV-W----------------------QCGGSLEIVPCS 342 (559)
T ss_pred CcccCccccceeEEecHHHHHHhcccCccccccCCcceEEEEEE-E----------------------eeCCeeEEEecc
Confidence 99999999999999999999999999999999999999999999 9 9999999999
Q ss_pred ---eeecccCCCC--CC-Ccccccccccc------------------ccCCCCCChhhHHHHHHHhccccchhheecc--
Q psy3380 160 ---AFFEKLGTYD--SG-FDIWGGENLEL------------------SFKGDFGDVTSRKELRRNLGCKSFKWYLEVS-- 213 (290)
Q Consensus 160 ---~~F~~~g~y~--~g-~~~~~~e~~e~------------------~~~~~~Gdv~~r~~lR~~l~ck~FkWyl~nv-- 213 (290)
|+||+.+||+ +| ..++-+|..|. +....+|+|..|.++|+++.||+|+|||+||
T Consensus 343 RVGHVFRkrHpy~FP~gs~~ty~~NTkr~AEvWmDEYK~~yyaarPsAr~vpfg~i~~rL~~Rk~l~CksFkWYLenVyP 422 (559)
T KOG3738|consen 343 RVGHVFRKRHPYTFPGGSGNTYIKNTKRAAEVWMDEYKNYYYAARPSARRVPFGNIYDRLELRKKLRCKSFKWYLENVYP 422 (559)
T ss_pred chhhhhhccCCCcCCCCCCcchhhcchHHHHHHHHHHHHHHHhcCchhhcCCCccHHHHHHHHhhcCCcchhhhhhhcCc
Confidence 9999999994 22 23344444433 3457899999999999999999999999998
Q ss_pred --------------cCCCCceeccCCCCCCCCCCeEEEcCCCCCCcceeEE--ccccccccCCceeeee----CCeEEEE
Q psy3380 214 --------------NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMM--SKHGEIRRDEACLDYA----GGDVILY 273 (290)
Q Consensus 214 --------------~~~~~~CLD~~~~~~~~g~~v~l~~Ch~~~~nQ~w~l--t~~G~Ir~~~~CLD~~----g~~V~l~ 273 (290)
-.+...|||+.++.. ...++++.||+.++||.|.+ ++...+.+.++||++. |++|.|.
T Consensus 423 eL~ip~~~~~~~g~lrqg~~Cl~s~~~~~--~~~~gl~~C~~s~~nqqwa~~~t~~~~~~~~elCL~v~~~~pg~~v~l~ 500 (559)
T KOG3738|consen 423 ELRIPFKELIATGTLRQGDNCLDSQGQNS--QEALGLASCHGSGGNQQWAFLRTSTQLITHRELCLAVGSNTPGSPVALV 500 (559)
T ss_pred ceeccccccccccchhccchhhhhhhccc--ccCcceeecccCCCCcchhhhhhhhhHHHHHhhhheeecCCCCCeEEEE
Confidence 123567999987653 23688999999889999987 6666667789999985 6789999
Q ss_pred eCCCCCCceeEEE
Q psy3380 274 PCHGSKGNQYFEY 286 (290)
Q Consensus 274 ~C~g~~~nQ~W~~ 286 (290)
.|....+.|+|.-
T Consensus 501 ~C~~~e~~q~~v~ 513 (559)
T KOG3738|consen 501 PCGNNETKQRWVE 513 (559)
T ss_pred ecCCCCCceEEEe
Confidence 9988777898864
No 3
>KOG3737|consensus
Probab=100.00 E-value=1.8e-59 Score=430.72 Aligned_cols=260 Identities=34% Similarity=0.664 Sum_probs=230.8
Q ss_pred ceecccCCchHHHHHHHhcCCCeEEEeeEeeecCCCcccccCCCccCCCCCceEEEEeeeeeeEEeeCCHHHHhcccCCC
Q psy3380 2 MCCEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHKNAA 81 (290)
Q Consensus 2 ~h~e~~~~Wl~pll~~i~~~~~~~v~p~I~~id~~t~~y~~~~~~~~~~~~~~~g~fdw~l~~~w~~~~~~~~~~~~~~~ 81 (290)
|||||+.+||.|||++|.+|++....|+||.||.+||||+..-+ ......+|.|+|.|-++-.+++++++..|+..+
T Consensus 249 AHCEVntNWlpPLlAPI~rdRtvmTVP~IDgId~n~~EyrpvyG---~dn~h~rGifeWgmLyKe~~~t~rE~r~Rkhns 325 (603)
T KOG3737|consen 249 AHCEVNTNWLPPLLAPISRDRTVMTVPLIDGIDGNTYEYRPVYG---GDNDHARGIFEWGMLYKEVPLTPREKRLRKHNS 325 (603)
T ss_pred cceeeecccccccccccccCceEEEEeeeeeecCCceEEeeccC---CcchhhcchhhhhheeccCCCCHHHHHhhhccC
Confidence 89999999999999999999999999999999999999985411 112245899999999888899999999999999
Q ss_pred CcccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEe--
Q psy3380 82 EPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDK-- 159 (290)
Q Consensus 82 ~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~-- 159 (290)
+|++||++|||||||+|++|.++|.||+|+.+|||||+|||||+ | +|||.+..+|
T Consensus 326 ePyRSPthAGGLfAInRe~F~ELG~YDpgLqiWGGEnfElSfKI-W----------------------QCGG~i~fVPCS 382 (603)
T KOG3737|consen 326 EPYRSPTHAGGLFAINREFFFELGLYDPGLQIWGGENFELSFKI-W----------------------QCGGKILFVPCS 382 (603)
T ss_pred CCCCCcccccceeeehHHHHHHhccCCCcceeecCcceeEEEEE-E----------------------eeCCEEEEEEcc
Confidence 99999999999999999999999999999999999999999998 9 9999999999
Q ss_pred ---eeecccCCCCCC-------Ccccccccccc------------------ccCCCCCChhhHHHHHHHhccccchhhee
Q psy3380 160 ---AFFEKLGTYDSG-------FDIWGGENLEL------------------SFKGDFGDVTSRKELRRNLGCKSFKWYLE 211 (290)
Q Consensus 160 ---~~F~~~g~y~~g-------~~~~~~e~~e~------------------~~~~~~Gdv~~r~~lR~~l~ck~FkWyl~ 211 (290)
|+||.+-||.-| --+...|.+|+ +...++||||++++||++|+||||||||+
T Consensus 383 rVGHvYR~~mpy~fgk~~~k~~~p~i~iNy~RVvetW~Ddyk~YfytreP~a~~l~~GDISeqlalr~~lnCkSFkWfMe 462 (603)
T KOG3737|consen 383 RVGHVYRSLMPYQFGKPPIKVGSPPILINYVRVVETWWDDYKDYFYTREPEAQALPYGDISEQLALREDLNCKSFKWFME 462 (603)
T ss_pred ccchhhhccccccCCCCccccCCCceEeehhhHHHHHHHhhhhheeecChhhccCCcccHHHHHHhHhhcCCchhHHHHH
Confidence 999999999432 12334556665 34568999999999999999999999999
Q ss_pred cc---------------------cCCCCceeccCCCCCCCCCCeEEEcCCCCCCcceeEEccccccccCCceeeeeCCeE
Q psy3380 212 VS---------------------NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDV 270 (290)
Q Consensus 212 nv---------------------~~~~~~CLD~~~~~~~~g~~v~l~~Ch~~~~nQ~w~lt~~G~Ir~~~~CLD~~g~~V 270 (290)
+| +.+++.|||.+++. .++.+++-+||++++||.+.++..|++...+.||.++|..+
T Consensus 463 ~iAYDv~~~yP~lP~N~~WGE~R~~at~~ClDsMG~~--p~g~mglt~CHg~GgNQL~RlN~agQl~qge~CltAdg~~i 540 (603)
T KOG3737|consen 463 EIAYDVTSHYPLLPKNVDWGEIRGFATAYCLDSMGKT--PGGFMGLTPCHGMGGNQLFRLNEAGQLMQGEQCLTADGSKI 540 (603)
T ss_pred HHHHHHHhcCCCCCCCCcchhccCcccchhHHhcCCC--CCCccccccccCCCCceEEEeccccchhccceeeecCCceE
Confidence 87 45688999999874 56789999999999999999999999999999999999999
Q ss_pred EEEeCCCCCCceeEEEccc
Q psy3380 271 ILYPCHGSKGNQYFEYDYK 289 (290)
Q Consensus 271 ~l~~C~g~~~nQ~W~~~~~ 289 (290)
.+..|.-...+-.|+|+++
T Consensus 541 ~~~hC~lgtv~g~WqY~~~ 559 (603)
T KOG3737|consen 541 MITHCNLGTVKGEWQYFKN 559 (603)
T ss_pred EEEEeecccccCceehhhc
Confidence 9999987666778988754
No 4
>KOG3736|consensus
Probab=100.00 E-value=4.2e-46 Score=363.77 Aligned_cols=270 Identities=33% Similarity=0.469 Sum_probs=209.5
Q ss_pred eecccCCchHHHHHHHhcCCCeEEEeeEeeecCCCccccc--------C---------C-------Ccc---CCCCCceE
Q psy3380 3 CCEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRF--------P---------P-------GRL---TSSYKFFI 55 (290)
Q Consensus 3 h~e~~~~Wl~pll~~i~~~~~~~v~p~I~~id~~t~~y~~--------~---------~-------~~~---~~~~~~~~ 55 (290)
|-|...-||+.++++|+++|..++++||+|+|.++.++.. . + +|+ +.|+|.++
T Consensus 151 ~nE~~s~llRtv~Svi~rtp~~lLkEIiLVdD~S~~~~l~~~Ld~y~k~~~~v~i~r~~~R~GLIrARl~GA~~A~geVL 230 (578)
T KOG3736|consen 151 HNEAWSTLLRTVHSVINRTPPYLLKEIILVDDFSDRDHLKDKLEEYVKRFSKVRILRTKKREGLIRARLLGASMATGEVL 230 (578)
T ss_pred ecCCCcchhheEEeehccCChhHeEEEEEeecCcchhhhhhhhHHHHhhhcceeEEeecchhhhHHHHhhhhhhhhchhe
Confidence 6677777799999999999999999999999999865421 1 1 222 24788899
Q ss_pred EEEeeeeeeE--E-eeCCHHHHhcccCCCCcccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCH
Q psy3380 56 GGFDWNLQFN--W-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPE 132 (290)
Q Consensus 56 g~fdw~l~~~--w-~~~~~~~~~~~~~~~~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~ 132 (290)
+++|.|+|++ | +||+++++++++.+++|++ ++||.++|+|... +...+|||||+|.|+ |..+|.
T Consensus 231 ~FLDsHcE~n~gWLePLL~~I~~~r~tvv~PvI--------D~Id~~tf~y~~~---~~~~rGgFdW~l~f~--w~~lP~ 297 (578)
T KOG3736|consen 231 TFLDSHCEVNVGWLEPLLARIAEDRKTVVCPVI--------DVIDDNTFEYEKQ---SELMRGGFDWELTFK--WERLPL 297 (578)
T ss_pred eeeecceeEecCcchHHHHHhhhcCceeecceE--------EeecCcCceeccc---CccceeeeecceeEE--eccCCc
Confidence 9999999998 8 6999999999999887666 7999999987644 335799999999999 999999
Q ss_pred HHHhccCCCCCccccCCCCCeEEEEEeeeecccCCCCCCCccccccccccccCC--------------------------
Q psy3380 133 RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKG-------------------------- 186 (290)
Q Consensus 133 ~~~~~~~~~~~p~~~p~~~GG~~~i~~~~F~~~g~y~~g~~~~~~e~~e~~~~~-------------------------- 186 (290)
+++.++++++.|||||||+||+|+|+++||.++|.||+||+||||||+||||++
T Consensus 298 ~~~~~~~~~t~PirsPtMaGglFAI~r~yF~eiG~yD~gMdiwGGENlElSfrvWqCGG~lei~PCSrVGHifRk~~pY~ 377 (578)
T KOG3736|consen 298 PEEKRRELPTDPIRSPTMAGGLFAIDRKYFGELGSYDEGMDIWGGENLELSFRVWQCGGRLEIVPCSRVGHIFRKRKPYT 377 (578)
T ss_pred cHhhcccCCCCCcCCcccCCceEEeeHHHHhhccCccccccccChhhceeeEEEeccCCeEEecCccceeeeeecCCCcc
Confidence 999999999999999999999999999999999999999999999999997641
Q ss_pred --CCCChhhHHHHHHH-hccccchhheeccc--CCCCceecc----------CCCCCCCCCCeEEEcCCCCCCcceeEEc
Q psy3380 187 --DFGDVTSRKELRRN-LGCKSFKWYLEVSN--DWSGMCIDS----------ACKPTDMHKPVGLYPCHKQGGNQFWMMS 251 (290)
Q Consensus 187 --~~Gdv~~r~~lR~~-l~ck~FkWyl~nv~--~~~~~CLD~----------~~~~~~~g~~v~l~~Ch~~~~nQ~w~lt 251 (290)
+.+++..||.+|.| +|||+||-+++... .....-.|+ .++++++. .-++||.......- ..
T Consensus 378 ~p~~~~~~~~N~~RlAeVWmDeyK~~~y~~~P~~~~~d~GDvseR~~LR~~L~CKsFkWy-L~nVyPel~~~~~~---~~ 453 (578)
T KOG3736|consen 378 FPDGTDTATRNLKRLAEVWMDEYKEQFYKRMPGLRNIDEGDLTERKALRERLNCKSFKWY-LENVYPELYLPTPH---VY 453 (578)
T ss_pred CCCcchhhhhchhhhhhhhhHHHHHHHHhhCccccccCCCCchhHHHHHHhcCCccccch-HhhcCccccCCCCc---cc
Confidence 22378899999998 89999997555431 111011111 12222221 11234433211111 13
Q ss_pred ccccccc--CCceeeeeC------CeEEEEeCCCCCCceeEEEccc
Q psy3380 252 KHGEIRR--DEACLDYAG------GDVILYPCHGSKGNQYFEYDYK 289 (290)
Q Consensus 252 ~~G~Ir~--~~~CLD~~g------~~V~l~~C~g~~~nQ~W~~~~~ 289 (290)
..|+|++ .+.|+|..+ ..+++++||+++++|.|.|+.+
T Consensus 454 ~~G~i~~~~~~~cld~~~~~~~~~~~~~~~~Ch~~~~~Q~~~yT~~ 499 (578)
T KOG3736|consen 454 ASGEIRNGNPNLCLDTERAPAGQGMAVGLYPCHGPGGNQYFPYTKQ 499 (578)
T ss_pred ccceeccCCcchhhhhhchhccCCCcceEecCCCccccccccccCC
Confidence 4588887 467999863 6899999999999999999753
No 5
>KOG3738|consensus
Probab=100.00 E-value=6e-46 Score=342.05 Aligned_cols=268 Identities=28% Similarity=0.494 Sum_probs=208.2
Q ss_pred eecccCCchHHHHHHHhcCCCeEEEeeEeeecCCCc-c------------cccCCCcc----------CCCCCceEEEEe
Q psy3380 3 CCEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTF-E------------LRFPPGRL----------TSSYKFFIGGFD 59 (290)
Q Consensus 3 h~e~~~~Wl~pll~~i~~~~~~~v~p~I~~id~~t~-~------------y~~~~~~~----------~~~~~~~~g~fd 59 (290)
|.|+.+..|+++.+.++++|.+++.+||+|+|.+.= | ..++.+|. ..|.+.++.++|
T Consensus 133 HNEARS~LLRTv~SvlnrsP~~li~EiILVDD~S~Dped~~~L~ri~kvr~LRN~~ReGLirSRvrGAdvA~a~vltFLD 212 (559)
T KOG3738|consen 133 HNEARSTLLRTVVSVLNRSPEHLIHEIILVDDFSQDPEDGKLLKRIPKVRVLRNNEREGLIRSRVRGADVAQATVLTFLD 212 (559)
T ss_pred ccHHHHHHHHHHHHHHcCChHHhhheeEEecCCCCChHHHHHHhhhheeeeecccchhhhhhhhccccccccceEEEEEe
Confidence 899999999999999999999999999999998751 1 11111221 136677788999
Q ss_pred eeeeeE--E-eeCCHHHHhcccCCCCcccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHh
Q psy3380 60 WNLQFN--W-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERK 136 (290)
Q Consensus 60 w~l~~~--w-~~~~~~~~~~~~~~~~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~ 136 (290)
.|+|+| | +||++|.+++++.+. ||++ ++|+.++|.|+|+ +.++.|||||.|.|| |+.+++..+.
T Consensus 213 SHcEvN~~WLePLL~Rvaed~trvV----sPii----DvIn~dnf~Y~~a---sadLrGGFDWsLhF~--We~~~~eqr~ 279 (559)
T KOG3738|consen 213 SHCEVNEGWLEPLLERVAEDTTRVV----SPII----DVINLDNFSYVGA---SADLRGGFDWSLHFK--WEQMQLEQRE 279 (559)
T ss_pred cceeecchhhHHHHHHHhhccccee----eccc----ccccccccccccc---hhhhcCCcceEEEEE--ehhcCHHHHh
Confidence 999998 8 699999998887765 6666 7999999999985 558899999999999 9999999999
Q ss_pred ccCCCCCccccCCCCCeEEEEEeeeecccCCCCCCCcccccccccccc-----------------------------CCC
Q psy3380 137 RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF-----------------------------KGD 187 (290)
Q Consensus 137 ~~~~~~~p~~~p~~~GG~~~i~~~~F~~~g~y~~g~~~~~~e~~e~~~-----------------------------~~~ 187 (290)
.+.+|++|||||++|||+|+|++.+|.++|.||..|+||||||+||+| ++.
T Consensus 280 sr~~Pt~PirtP~iAGGlfvidk~wF~~LGkyd~~mdiWGGEn~ElsfrvW~CGGslEIvPCSRVGHVFRkrHpy~FP~g 359 (559)
T KOG3738|consen 280 SRADPTAPIRTPAIAGGLFVIDKEWFNELGKYDMDMDIWGGENLELSFRVWQCGGSLEIVPCSRVGHVFRKRHPYTFPGG 359 (559)
T ss_pred hccCCCCcccCccccceeEEecHHHHHHhcccCccccccCCcceEEEEEEEeeCCeeEEEeccchhhhhhccCCCcCCCC
Confidence 889999999999999999999999999999999999999999999955 345
Q ss_pred CCChhhHHHHHHH-hccccchhh-eecccCCCCceeccCCCCCCCCCCeEEEcCCCC---CCcceeEE-------ccccc
Q psy3380 188 FGDVTSRKELRRN-LGCKSFKWY-LEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ---GGNQFWMM-------SKHGE 255 (290)
Q Consensus 188 ~Gdv~~r~~lR~~-l~ck~FkWy-l~nv~~~~~~CLD~~~~~~~~g~~v~l~~Ch~~---~~nQ~w~l-------t~~G~ 255 (290)
.|.+.-+|..|.+ +|||+||-| +.++..+...-++........++++ .|+.. ..|.+-++ .+.|.
T Consensus 360 s~~ty~~NTkr~AEvWmDEYK~~yyaarPsAr~vpfg~i~~rL~~Rk~l---~CksFkWYLenVyPeL~ip~~~~~~~g~ 436 (559)
T KOG3738|consen 360 SGNTYIKNTKRAAEVWMDEYKNYYYAARPSARRVPFGNIYDRLELRKKL---RCKSFKWYLENVYPELRIPFKELIATGT 436 (559)
T ss_pred CCcchhhcchHHHHHHHHHHHHHHHhcCchhhcCCCccHHHHHHHHhhc---CCcchhhhhhhcCcceeccccccccccc
Confidence 5666667778877 899999974 5555333222222111111122332 34432 23333322 45688
Q ss_pred cccCCceeeeeC----CeEEEEeCCCCCCceeEEE
Q psy3380 256 IRRDEACLDYAG----GDVILYPCHGSKGNQYFEY 286 (290)
Q Consensus 256 Ir~~~~CLD~~g----~~V~l~~C~g~~~nQ~W~~ 286 (290)
||+.+.|||+.+ ..++|+.||+.++||+|.+
T Consensus 437 lrqg~~Cl~s~~~~~~~~~gl~~C~~s~~nqqwa~ 471 (559)
T KOG3738|consen 437 LRQGDNCLDSQGQNSQEALGLASCHGSGGNQQWAF 471 (559)
T ss_pred hhccchhhhhhhcccccCcceeecccCCCCcchhh
Confidence 999999999964 2478999999999999987
No 6
>KOG3737|consensus
Probab=100.00 E-value=1.7e-34 Score=266.00 Aligned_cols=268 Identities=29% Similarity=0.419 Sum_probs=200.4
Q ss_pred ecccCCc---hHHHHHHHhcCCCeEEEeeEeeecCCCcc--------ccc----------CC-------Ccc---CCCCC
Q psy3380 4 CEVQKRW---LQPLLDVLARNSSHVVSPLIANICDDTFE--------LRF----------PP-------GRL---TSSYK 52 (290)
Q Consensus 4 ~e~~~~W---l~pll~~i~~~~~~~v~p~I~~id~~t~~--------y~~----------~~-------~~~---~~~~~ 52 (290)
++-|+|| +++++++|.++|+..+.+||.++|.++.| |.. .. +|. ..++|
T Consensus 162 VFHNEGws~LmRTVHSVi~RsP~~~l~eivlvDDfSdKehLkekLDeYv~~fnGlVkV~Rne~REGLI~aRSiGA~~atG 241 (603)
T KOG3737|consen 162 VFHNEGWSTLMRTVHSVIKRSPRKYLAEIVLVDDFSDKEHLKEKLDEYVKLFNGLVKVFRNERREGLIQARSIGAQKATG 241 (603)
T ss_pred EEecCccHHHHHHHHHHHhcCcHHhhheEEEeccCCccHHHHHHHHHHHHHhcCEEEEEecchhhhhhhhhccchhhccc
Confidence 4568888 79999999999999999999999999875 321 11 111 23788
Q ss_pred ceEEEEeeeeeeE--Ee-eCCHHHHhcccCCCCcccCccccCceeeechhhHhhhcccCCCC-cccCcccccccccccce
Q psy3380 53 FFIGGFDWNLQFN--WH-AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIWGGENLELSFKFNWH 128 (290)
Q Consensus 53 ~~~g~fdw~l~~~--w~-~~~~~~~~~~~~~~~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~-~~~Gg~~~elsfk~~W~ 128 (290)
.++-++|.|++++ |+ ||+..+..+|+.-+- |++ +.||.++|+|.-.|...- ...|.|+|.|-+| =.
T Consensus 242 eV~ifLDAHCEVntNWlpPLlAPI~rdRtvmTV----P~I----DgId~n~~EyrpvyG~dn~h~rGifeWgmLyK--e~ 311 (603)
T KOG3737|consen 242 EVLIFLDAHCEVNTNWLPPLLAPISRDRTVMTV----PLI----DGIDGNTYEYRPVYGGDNDHARGIFEWGMLYK--EV 311 (603)
T ss_pred cEEEEEecceeeecccccccccccccCceEEEE----eee----eeecCCceEEeeccCCcchhhcchhhhhheec--cC
Confidence 8888999999886 95 788899888887774 554 689999999985554332 4689999999998 58
Q ss_pred ecCHHHHhccCCCCCccccCCCCCeEEEEEeeeecccCCCCCCCccccccccccccCC-------------CCC------
Q psy3380 129 AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKG-------------DFG------ 189 (290)
Q Consensus 129 ~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~~~F~~~g~y~~g~~~~~~e~~e~~~~~-------------~~G------ 189 (290)
+++++|.+.|+..++|+||||+|||+|+|.|.+|.++|.||+|+.||||||+||||++ .+|
T Consensus 312 ~~t~rE~r~RkhnsePyRSPthAGGLfAInRe~F~ELG~YDpgLqiWGGEnfElSfKIWQCGG~i~fVPCSrVGHvYR~~ 391 (603)
T KOG3737|consen 312 PLTPREKRLRKHNSEPYRSPTHAGGLFAINREFFFELGLYDPGLQIWGGENFELSFKIWQCGGKILFVPCSRVGHVYRSL 391 (603)
T ss_pred CCCHHHHHhhhccCCCCCCcccccceeeehHHHHHHhccCCCcceeecCcceeEEEEEEeeCCEEEEEEccccchhhhcc
Confidence 8999999999999999999999999999999999999999999999999999997752 122
Q ss_pred --------------ChhhHHHHHHH-hccccchhheecc---------cCCCCc---eeccCCCCCCCCCC------eEE
Q psy3380 190 --------------DVTSRKELRRN-LGCKSFKWYLEVS---------NDWSGM---CIDSACKPTDMHKP------VGL 236 (290)
Q Consensus 190 --------------dv~~r~~lR~~-l~ck~FkWyl~nv---------~~~~~~---CLD~~~~~~~~g~~------v~l 236 (290)
.+...|-.|.. .|+|++|-|+..- ++.+.. --|..++++++--. ..-
T Consensus 392 mpy~fgk~~~k~~~p~i~iNy~RVvetW~Ddyk~YfytreP~a~~l~~GDISeqlalr~~lnCkSFkWfMe~iAYDv~~~ 471 (603)
T KOG3737|consen 392 MPYQFGKPPIKVGSPPILINYVRVVETWWDDYKDYFYTREPEAQALPYGDISEQLALREDLNCKSFKWFMEEIAYDVTSH 471 (603)
T ss_pred ccccCCCCccccCCCceEeehhhHHHHHHHhhhhheeecChhhccCCcccHHHHHHhHhhcCCchhHHHHHHHHHHHHhc
Confidence 33344667764 7999998776543 111110 01122222211100 001
Q ss_pred EcCCCCCCcceeEEcccccccc--CCceeeeeC----CeEEEEeCCCCCCceeEEEcc
Q psy3380 237 YPCHKQGGNQFWMMSKHGEIRR--DEACLDYAG----GDVILYPCHGSKGNQYFEYDY 288 (290)
Q Consensus 237 ~~Ch~~~~nQ~w~lt~~G~Ir~--~~~CLD~~g----~~V~l~~C~g~~~nQ~W~~~~ 288 (290)
|| ....|..| |+||+ ++.|||..| +.+++..||++++||.++++.
T Consensus 472 yP--~lP~N~~W-----GE~R~~at~~ClDsMG~~p~g~mglt~CHg~GgNQL~RlN~ 522 (603)
T KOG3737|consen 472 YP--LLPKNVDW-----GEIRGFATAYCLDSMGKTPGGFMGLTPCHGMGGNQLFRLNE 522 (603)
T ss_pred CC--CCCCCCcc-----hhccCcccchhHHhcCCCCCCccccccccCCCCceEEEecc
Confidence 11 12356777 99998 578999975 468999999999999999874
No 7
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins.
Probab=99.91 E-value=4.6e-24 Score=196.30 Aligned_cols=181 Identities=48% Similarity=0.849 Sum_probs=150.4
Q ss_pred ceecccCCchHHHHHHHhcCCCeEEEeeEeeecCCCcccccCCCccCCCCCceEEEEeeeeeeEEeeCCHHHHhcccCCC
Q psy3380 2 MCCEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHKNAA 81 (290)
Q Consensus 2 ~h~e~~~~Wl~pll~~i~~~~~~~v~p~I~~id~~t~~y~~~~~~~~~~~~~~~g~fdw~l~~~w~~~~~~~~~~~~~~~ 81 (290)
+|+++.++||++|++.+.+++..+++|.|+.++.+++.|... .....++|+|.+.+.|.++..... .+....
T Consensus 92 ~D~~~~~~wL~~ll~~l~~~~~~~v~p~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 163 (299)
T cd02510 92 SHCEVNVGWLEPLLARIAENRKTVVCPIIDVIDADTFEYRGS-------SGDARGGFDWSLHFKWLPLPEEER-RRESPT 163 (299)
T ss_pred CCcccCccHHHHHHHHHHhCCCeEEEeeeccccCCCeeEecC-------CCceeEEecccceeccccCCHHHh-hhcCCC
Confidence 689999999999999999999999999999999988887532 122578899999999987655443 334455
Q ss_pred CcccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEe--
Q psy3380 82 EPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDK-- 159 (290)
Q Consensus 82 ~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~-- 159 (290)
.|..+|++.|++++|.++.|..+|+||+++..||+||.||++|+ | +||+.+...|
T Consensus 164 ~~~~~~~~~g~~~~irr~~~~~vGgfDe~~~~~~~ED~Dl~~R~-~----------------------~~G~~i~~~p~a 220 (299)
T cd02510 164 APIRSPTMAGGLFAIDREWFLELGGYDEGMDIWGGENLELSFKV-W----------------------QCGGSIEIVPCS 220 (299)
T ss_pred CCccCccccceeeEEEHHHHHHhCCCCCcccccCchhHHHHHHH-H----------------------HcCCeEEEeecc
Confidence 67889999999999999999999999999999999999999999 9 9999999888
Q ss_pred ---eeec-ccCCCCCCCc--ccccccccc------------------ccCCCCCChhhHHHHHHHhccccchhheecc
Q psy3380 160 ---AFFE-KLGTYDSGFD--IWGGENLEL------------------SFKGDFGDVTSRKELRRNLGCKSFKWYLEVS 213 (290)
Q Consensus 160 ---~~F~-~~g~y~~g~~--~~~~e~~e~------------------~~~~~~Gdv~~r~~lR~~l~ck~FkWyl~nv 213 (290)
|+++ ...+|+.+.. ....|..|+ ......||+.+|+++++++.||+|+||++.+
T Consensus 221 ~v~H~~~~~~~~~~~~~~~~~~~~n~~r~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~ 298 (299)
T cd02510 221 RVGHIFRRKRKPYTFPGGSGTVLRNYKRVAEVWMDEYKEYFYKARPELRNIDYGDLSERKALRERLKCKSFKWYLENV 298 (299)
T ss_pred EEEEeccccCCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCCCCHHHHHHHHHHcCCCCchHHHhhc
Confidence 8888 7888865433 244555444 1125679999999999999999999998764
No 8
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
Probab=98.83 E-value=1.3e-08 Score=79.76 Aligned_cols=70 Identities=40% Similarity=0.859 Sum_probs=58.9
Q ss_pred CCceeccCCCCCCCCCCeEEEcCCCCCCcceeEEccccccccC--CceeeeeC----CeEEEEeCCCCCCceeEEEcc
Q psy3380 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD--EACLDYAG----GDVILYPCHGSKGNQYFEYDY 288 (290)
Q Consensus 217 ~~~CLD~~~~~~~~g~~v~l~~Ch~~~~nQ~w~lt~~G~Ir~~--~~CLD~~g----~~V~l~~C~g~~~nQ~W~~~~ 288 (290)
+++|||+.... .+..+.+++|++...+|.|.++..|.|+.. ++|||+.+ ..+.+++|++...+|+|.+..
T Consensus 9 ~~~cL~~~~~~--~~~~v~~~~c~~~~~~Q~W~~~~~g~~~~~~~~~Cl~~~~~~~~~~~~~~~c~~~~~~Q~W~~~~ 84 (124)
T cd00161 9 TGLCLDVNGGS--DGGPVQLYPCHGNGNNQKWTLTSDGTIRIKSSNLCLDVGGDAPGSKVRLYTCSGGSDNQRWTFNK 84 (124)
T ss_pred CCeEEECCCCC--CCCEEEEEECCCCCccCCEEEeCCCeEEEcCCCeEEcccCCCCCCEEEEEECCCCCcCCEEEECC
Confidence 78999987653 467899999998658999999989998875 89999864 689999999855799999875
No 9
>PF00652 Ricin_B_lectin: Ricin-type beta-trefoil lectin domain; InterPro: IPR000772 Ricin is a legume lectin from the seeds of the castor bean plant, Ricinus communis. The seeds are poisonous to people, animals and insects and just one milligram of ricin can kill an adult. Primary structure analysis has shown the presence of a similar domain in many carbohydrate-recognition proteins like plant and bacterial AB-toxins, glycosidases or proteases [, , ]. This domain, known as the ricin B lectin domain, can be present in one or more copies and has been shown in some instance to bind simple sugars, such as galactose or lactose. The ricin B lectin domain is composed of three homologous subdomains of 40 amino acids (alpha, beta and gamma) and a linker peptide of around 15 residues (lambda). It has been proposed that the ricin B lectin domain arose by gene triplication from a primitive 40 residue galactoside-binding peptide [, ]. The most characteristic, though not completely conserved, sequence feature is the presence of a Q-W pattern. Consequently, the ricin B lectin domain as also been refered as the (QxW)3 domain and the three homologous regions as the QxW repeats [, ]. A disulphide bond is also conserved in some of the QxW repeats []. The 3D structure of the ricin B chain has shown that the three QxW repeats pack around a pseudo threefold axis that is stabilised by the lambda linker []. The ricin B lectin domain has no major segments of a helix or beta sheet but each of the QxW repeats contains an omega loop []. An idealized omega-loop is a compact, contiguous segment of polypeptide that traces a 'loop-shaped' path in three-dimensional space; the main chain resembles a Greek omega.; PDB: 2VLC_B 3A22_B 3A21_B 3A23_B 1GGP_B 1VCL_A 2Z48_B 2Z49_A 2D7R_A 2D7I_A ....
Probab=98.79 E-value=9.1e-09 Score=81.37 Aligned_cols=77 Identities=38% Similarity=0.859 Sum_probs=61.5
Q ss_pred heecccCCCCceeccCCCCCCCCCCeEEEcCCCCCCcceeEEccccccccC---CceeeeeCC----eEEEEeCCCCCCc
Q psy3380 209 YLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGG----DVILYPCHGSKGN 281 (290)
Q Consensus 209 yl~nv~~~~~~CLD~~~~~~~~g~~v~l~~Ch~~~~nQ~w~lt~~G~Ir~~---~~CLD~~g~----~V~l~~C~g~~~n 281 (290)
++.+++. +++|||+. .....+.++.+++|++ ..+|.|.++..+.|+.. ..||++.+. .|.+.+|+....+
T Consensus 4 ~i~~~~~-~~~cl~~~-~~~~~~~~v~l~~c~~-~~~Q~w~~~~~~~i~~~~~~~~CL~~~~~~~~~~i~l~~C~~~~~~ 80 (124)
T PF00652_consen 4 YIRNVNK-SGLCLDVQ-GSTKNGSPVVLYPCDG-SDNQLWRFDPDGQIRSNNNPNLCLDVDGSSPGTKIVLWPCDSNSSN 80 (124)
T ss_dssp EEEEEEG-GGEEEEEG-GSSSTTTBEEEEE--S-SGGGEEEEETTSBEEETTETTEEEEESSSSTTEBEEEEETTTTGGG
T ss_pred EEEEeeC-CCCeEEEc-CCCCCCCEEEEEECCC-CCceeEEEcCCCceeeccCcceEEEeeccCCCceEEEeeccCCccC
Confidence 4556556 89999998 5556678999999999 67899999999999874 349999754 4999999987667
Q ss_pred eeEEEcc
Q psy3380 282 QYFEYDY 288 (290)
Q Consensus 282 Q~W~~~~ 288 (290)
|.|.++.
T Consensus 81 Q~W~~~~ 87 (124)
T PF00652_consen 81 QRWKFDP 87 (124)
T ss_dssp GBEEEET
T ss_pred CeEEEcC
Confidence 9999976
No 10
>smart00458 RICIN Ricin-type beta-trefoil. Carbohydrate-binding domain formed from presumed gene triplication.
Probab=98.77 E-value=2e-08 Score=78.82 Aligned_cols=69 Identities=41% Similarity=0.905 Sum_probs=58.0
Q ss_pred CCCceeccCCCCCCCCCCeEEEcCCCCCCcceeEEcccccccc-CCceeeeeC---CeEEEEeCCCCCCceeEEEcc
Q psy3380 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR-DEACLDYAG---GDVILYPCHGSKGNQYFEYDY 288 (290)
Q Consensus 216 ~~~~CLD~~~~~~~~g~~v~l~~Ch~~~~nQ~w~lt~~G~Ir~-~~~CLD~~g---~~V~l~~C~g~~~nQ~W~~~~ 288 (290)
.+++|+|+.+.. ..+.+++|++.+.+|.|.++..|.|+. .++|||+.+ ..|.++.|++...+|+|.+..
T Consensus 5 ~~~~Cl~~~~~~----~~v~l~~c~~~~~~Q~w~~~~~g~~~~~~~~Cl~~~~~~~~~v~l~~c~~~~~~Q~W~~~~ 77 (117)
T smart00458 5 NTGKCLDVNGNS----NPVGLFDCHGTGGNQLWKLTSDGAIRIATDLCLTANGNTGSTVTLYSCDGDADNQYWTVNK 77 (117)
T ss_pred cCCccEecCCCC----ceEEEEeCCCCCccceEEEeCCCeEEecCCccCccCCCCCCEEEEEECCCCCcCCEEEECC
Confidence 568899986542 478999999856789999999999988 799999975 479999999866799999875
No 11
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
Probab=98.45 E-value=5.1e-07 Score=70.55 Aligned_cols=67 Identities=36% Similarity=0.757 Sum_probs=55.3
Q ss_pred CCceeccCCCCCCCCCCeEEEcCCCCCCcceeEEcccccccc--CCceeeeeC-----CeEEEEeCCCCCCceeEEE
Q psy3380 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR--DEACLDYAG-----GDVILYPCHGSKGNQYFEY 286 (290)
Q Consensus 217 ~~~CLD~~~~~~~~g~~v~l~~Ch~~~~nQ~w~lt~~G~Ir~--~~~CLD~~g-----~~V~l~~C~g~~~nQ~W~~ 286 (290)
+++|||+.... .+..+.+++|+....+|.|.+...+.|++ .++|||+.+ ..|.+++|++. .+|+|.+
T Consensus 51 ~~~Cl~~~~~~--~~~~~~~~~c~~~~~~Q~W~~~~~~~i~~~~~~~cl~~~~~~~~~~~v~~~~c~~~-~~Q~W~~ 124 (124)
T cd00161 51 SNLCLDVGGDA--PGSKVRLYTCSGGSDNQRWTFNKDGTIRNLKSGKCLDVKGGNTNGTNLILWTCDGG-PNQKWKF 124 (124)
T ss_pred CCeEEcccCCC--CCCEEEEEECCCCCcCCEEEECCCcEEEECCCCeEEeCCCCCCCCCEEEEEeCCCC-ccceEeC
Confidence 68999986543 46789999999866899999988788887 479999863 57999999987 6999975
No 12
>PF00652 Ricin_B_lectin: Ricin-type beta-trefoil lectin domain; InterPro: IPR000772 Ricin is a legume lectin from the seeds of the castor bean plant, Ricinus communis. The seeds are poisonous to people, animals and insects and just one milligram of ricin can kill an adult. Primary structure analysis has shown the presence of a similar domain in many carbohydrate-recognition proteins like plant and bacterial AB-toxins, glycosidases or proteases [, , ]. This domain, known as the ricin B lectin domain, can be present in one or more copies and has been shown in some instance to bind simple sugars, such as galactose or lactose. The ricin B lectin domain is composed of three homologous subdomains of 40 amino acids (alpha, beta and gamma) and a linker peptide of around 15 residues (lambda). It has been proposed that the ricin B lectin domain arose by gene triplication from a primitive 40 residue galactoside-binding peptide [, ]. The most characteristic, though not completely conserved, sequence feature is the presence of a Q-W pattern. Consequently, the ricin B lectin domain as also been refered as the (QxW)3 domain and the three homologous regions as the QxW repeats [, ]. A disulphide bond is also conserved in some of the QxW repeats []. The 3D structure of the ricin B chain has shown that the three QxW repeats pack around a pseudo threefold axis that is stabilised by the lambda linker []. The ricin B lectin domain has no major segments of a helix or beta sheet but each of the QxW repeats contains an omega loop []. An idealized omega-loop is a compact, contiguous segment of polypeptide that traces a 'loop-shaped' path in three-dimensional space; the main chain resembles a Greek omega.; PDB: 2VLC_B 3A22_B 3A21_B 3A23_B 1GGP_B 1VCL_A 2Z48_B 2Z49_A 2D7R_A 2D7I_A ....
Probab=98.44 E-value=2.7e-07 Score=72.84 Aligned_cols=76 Identities=32% Similarity=0.721 Sum_probs=57.3
Q ss_pred chhheecc-----cCCCCceeccCCCCCCCCCCeEEEcCCCCCCcceeEEcccccccc--CCceeeee----CCeEEEEe
Q psy3380 206 FKWYLEVS-----NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR--DEACLDYA----GGDVILYP 274 (290)
Q Consensus 206 FkWyl~nv-----~~~~~~CLD~~~~~~~~g~~v~l~~Ch~~~~nQ~w~lt~~G~Ir~--~~~CLD~~----g~~V~l~~ 274 (290)
..|+++.. ......||++.... .+..+.+++|+....+|.|.+...+.|++ .++|||+. +..|.+.+
T Consensus 38 Q~w~~~~~~~i~~~~~~~~CL~~~~~~--~~~~i~l~~C~~~~~~Q~W~~~~~~~i~n~~s~~cL~~~~~~~~~~l~~~~ 115 (124)
T PF00652_consen 38 QLWRFDPDGQIRSNNNPNLCLDVDGSS--PGTKIVLWPCDSNSSNQRWKFDPDGRIRNKNSGLCLDVKGGSDGNPLVLWP 115 (124)
T ss_dssp GEEEEETTSBEEETTETTEEEEESSSS--TTEBEEEEETTTTGGGGBEEEETTSBEEETTTTEEEEEGGGSTTEBEEEEE
T ss_pred eeEEEcCCCceeeccCcceEEEeeccC--CCceEEEeeccCCccCCeEEEcCCeeEEeCCCCEEEEecCCCCCCEEEEEE
Confidence 45765543 22334599987654 56789999999866779999988889987 47999995 56899999
Q ss_pred CCCCCCceeE
Q psy3380 275 CHGSKGNQYF 284 (290)
Q Consensus 275 C~g~~~nQ~W 284 (290)
|++. .+|+|
T Consensus 116 c~~~-~~Q~W 124 (124)
T PF00652_consen 116 CNGS-PNQQW 124 (124)
T ss_dssp -TSS-GGGBE
T ss_pred CCCC-ccccC
Confidence 9655 69998
No 13
>PF14200 RicinB_lectin_2: Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A ....
Probab=98.35 E-value=9.8e-07 Score=68.89 Aligned_cols=72 Identities=32% Similarity=0.466 Sum_probs=57.1
Q ss_pred cCCCCceeccCCCCCCCCCCeEEEcCCCCCCcceeEEcccc----cccc--CCceeeee------CCeEEEEeC-CCCCC
Q psy3380 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHG----EIRR--DEACLDYA------GGDVILYPC-HGSKG 280 (290)
Q Consensus 214 ~~~~~~CLD~~~~~~~~g~~v~l~~Ch~~~~nQ~w~lt~~G----~Ir~--~~~CLD~~------g~~V~l~~C-~g~~~ 280 (290)
+..+++|||+.......|..+.+++|++ ..+|.|.+...+ .|++ .++|||+. |+.|.+++| ++. .
T Consensus 21 n~~sg~~L~v~~~~~~~g~~v~~~~~~~-~~~Q~W~i~~~~~g~y~I~n~~s~~~Ldv~~~~~~~g~~v~~~~~~~~~-~ 98 (105)
T PF14200_consen 21 NVNSGKYLDVAGGSTANGTNVQQWTCNG-NDNQQWKIEPVGDGYYRIRNKNSGKVLDVAGGSTANGTNVQQWEYDNGS-D 98 (105)
T ss_dssp ETTTTEEEEEGCTTCSTTEBEEEEESSS-SGGGEEEEEESTTSEEEEEETSTTEEEEEGGGSSSTTEBEEEEE-STSS-G
T ss_pred ECCCCCEEEeCCCCcCCCcEEEEecCCC-CcCcEEEEEEecCCeEEEEECCCCcEEEECCCCCCCCCEEEEEeCCCCC-c
Confidence 4578999999887777888999999998 689999987543 3544 58999995 357999999 654 6
Q ss_pred ceeEEEc
Q psy3380 281 NQYFEYD 287 (290)
Q Consensus 281 nQ~W~~~ 287 (290)
+|+|.+.
T Consensus 99 ~Q~W~l~ 105 (105)
T PF14200_consen 99 NQQWKLE 105 (105)
T ss_dssp GGEEEEE
T ss_pred cCEEEeC
Confidence 9999974
No 14
>smart00458 RICIN Ricin-type beta-trefoil. Carbohydrate-binding domain formed from presumed gene triplication.
Probab=98.34 E-value=1.3e-06 Score=68.35 Aligned_cols=66 Identities=35% Similarity=0.808 Sum_probs=53.9
Q ss_pred CCceeccCCCCCCCCCCeEEEcCCCCCCcceeEEcccccccc--CCceeeeeCC----eEEEEeCCCCCCceeEEE
Q psy3380 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR--DEACLDYAGG----DVILYPCHGSKGNQYFEY 286 (290)
Q Consensus 217 ~~~CLD~~~~~~~~g~~v~l~~Ch~~~~nQ~w~lt~~G~Ir~--~~~CLD~~g~----~V~l~~C~g~~~nQ~W~~ 286 (290)
+++|||+..... ..+.++.|++...+|.|.+...|.|++ .++|||..+. .+.++.|++. .+|+|.+
T Consensus 45 ~~~Cl~~~~~~~---~~v~l~~c~~~~~~Q~W~~~~~~~i~~~~~~~cl~~~~~~~~~~~~~~~c~~~-~~Q~W~~ 116 (117)
T smart00458 45 TDLCLTANGNTG---STVTLYSCDGDADNQYWTVNKDGTIRNPDSGLCLDVKDGNTGTKVILWTCNGN-PNQKWIF 116 (117)
T ss_pred CCccCccCCCCC---CEEEEEECCCCCcCCEEEECCCeeEEeCCCCEEEecCCCCCCceEEEEeCCCC-ccccEEe
Confidence 678999865321 578999999866899999988888886 6899998642 6999999987 5999986
No 15
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins.
Probab=98.31 E-value=7.6e-07 Score=81.87 Aligned_cols=159 Identities=35% Similarity=0.559 Sum_probs=104.8
Q ss_pred chHHHH-HHHhcCCCeEEEeeEeeecCCCc---ccc---------------cCCCcc----------CCCCCceEEEEee
Q psy3380 10 WLQPLL-DVLARNSSHVVSPLIANICDDTF---ELR---------------FPPGRL----------TSSYKFFIGGFDW 60 (290)
Q Consensus 10 Wl~pll-~~i~~~~~~~v~p~I~~id~~t~---~y~---------------~~~~~~----------~~~~~~~~g~fdw 60 (290)
.|+..| +.+++++.....+||.|+|.+|- +.. ..+... ..+.|..+.++|.
T Consensus 13 ~l~~~l~Sl~~~~~~~~~~EIIvVDd~S~d~t~~~~~~~~~~~~~~~v~vi~~~~n~G~~~a~N~g~~~A~gd~i~fLD~ 92 (299)
T cd02510 13 TLLRTVHSVINRTPPELLKEIILVDDFSDKPELKLLLEEYYKKYLPKVKVLRLKKREGLIRARIAGARAATGDVLVFLDS 92 (299)
T ss_pred HHHHHHHHHHhcCchhcCCEEEEEECCCCchHHHHHHHHHHhhcCCcEEEEEcCCCCCHHHHHHHHHHHccCCEEEEEeC
Confidence 344444 44455655555699999998752 111 111111 2366778888998
Q ss_pred eeeeE--Ee-eCCHHHHhcccCCCCcccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhc
Q psy3380 61 NLQFN--WH-AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKR 137 (290)
Q Consensus 61 ~l~~~--w~-~~~~~~~~~~~~~~~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~ 137 (290)
++.+. |. .+.+....+ +. -+++|.+ ..++.+.|.+.+.-. ..-|+++|.+.+. |...++... .
T Consensus 93 D~~~~~~wL~~ll~~l~~~---~~-~~v~p~~----~~~~~~~~~~~~~~~---~~~~~~~~~~~~~--~~~~~~~~~-~ 158 (299)
T cd02510 93 HCEVNVGWLEPLLARIAEN---RK-TVVCPII----DVIDADTFEYRGSSG---DARGGFDWSLHFK--WLPLPEEER-R 158 (299)
T ss_pred CcccCccHHHHHHHHHHhC---CC-eEEEeee----ccccCCCeeEecCCC---ceeEEecccceec--cccCCHHHh-h
Confidence 87664 63 333333221 11 1344443 567777776644311 1468889988886 877766554 5
Q ss_pred cCCCCCccccCCCCCeEEEEEeeeecccCCCCCCCcccccccccc
Q psy3380 138 HKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182 (290)
Q Consensus 138 ~~~~~~p~~~p~~~GG~~~i~~~~F~~~g~y~~g~~~~~~e~~e~ 182 (290)
+..+..|.++|++.|+.++|.+..|.++|.||+++..|+.|.+++
T Consensus 159 ~~~~~~~~~~~~~~g~~~~irr~~~~~vGgfDe~~~~~~~ED~Dl 203 (299)
T cd02510 159 RESPTAPIRSPTMAGGLFAIDREWFLELGGYDEGMDIWGGENLEL 203 (299)
T ss_pred hcCCCCCccCccccceeeEEEHHHHHHhCCCCCcccccCchhHHH
Confidence 566778899999999999999999999999999999999898876
No 16
>PF02709 Glyco_transf_7C: N-terminal domain of galactosyltransferase; InterPro: IPR003859 This is a family of galactosyltransferases from a wide range of metazoa with three related galactosyltransferase activities; all three of which are possessed by one sequence in some cases. The three functions are N-acetyllactosamine synthase (2.4.1.90 from EC); beta-N-acetylglucosaminyl-glycopeptide beta-1,4-galactosyltransferase (2.4.1.38 from EC); and lactose synthase (2.4.1.22 from EC). Note that N-acetyllactosamine synthase is a component of lactose synthase along with alpha-lactalbumin, in the absence of alpha-lactalbumin N-acetyllactosamine synthase is used.; GO: 0016757 transferase activity, transferring glycosyl groups, 0005975 carbohydrate metabolic process; PDB: 2AGD_B 3EE5_A 2AE7_B 2AEC_A 2FYA_A 2AES_B 2AH9_A 2FYB_A 2FY7_A 3LW6_A ....
Probab=98.10 E-value=7.2e-07 Score=66.42 Aligned_cols=41 Identities=39% Similarity=0.831 Sum_probs=34.5
Q ss_pred CccccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 86 TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 86 sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
.+.+.||.++|+++.|..+|+||+.+..|||||.||+.|+ +
T Consensus 16 ~~~~~Gg~~~~~~~~f~~vnGfde~f~gWG~ED~Dl~~Rl-~ 56 (78)
T PF02709_consen 16 YPNFFGGVFAISREDFEKVNGFDERFWGWGGEDDDLYNRL-W 56 (78)
T ss_dssp STT---SEEEEEHHHHHHTTSS-SS-TSCSSHHHHHHHHH-H
T ss_pred CCCeeEEEEEEeHHHHHHcCCCCccccccCccHHHHHHHH-H
Confidence 5788899999999999999999999999999999999998 7
No 17
>PF10111 Glyco_tranf_2_2: Glycosyltransferase like family 2; InterPro: IPR019290 This conserved domain is found in a set of prokaryotic proteins including putative glucosyltransferases, which are involved in bacterial capsule biosynthesis [, ].
Probab=97.48 E-value=8.7e-05 Score=68.00 Aligned_cols=41 Identities=24% Similarity=0.571 Sum_probs=37.4
Q ss_pred CccccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 86 TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 86 sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
...++||+++|+++.|..+||||+.+..||+||.||.+|+ -
T Consensus 165 ~~~~~s~~~~i~r~~f~~iGGfDE~f~G~G~ED~D~~~RL-~ 205 (281)
T PF10111_consen 165 FIAFASSCFLINREDFLEIGGFDERFRGWGYEDIDFGYRL-K 205 (281)
T ss_pred cccccceEEEEEHHHHHHhCCCCccccCCCcchHHHHHHH-H
Confidence 3456789999999999999999999999999999999997 5
No 18
>cd00899 b4GalT Beta-4-Galactosyltransferase is involved in the formation of the poly-N-acetyllactosamine core structures present in glycoproteins and glycosphingolipids. Beta-4-Galactosyltransferase transfers galactose from uridine diphosphogalactose to the terminal beta-N-acetylglucosamine residues, hereby forming the poly-N-acetyllactosamine core structures present in glycoproteins and glycosphingolipids. At least seven homologous beta-4-galactosyltransferase isoforms have been identified that use different types of glycoproteins and glycolipids as substrates. Of the seven identified members of the beta-1,4-galactosyltransferase subfamily (beta1,4-Gal-T1 to -T7), b1,4-Gal-T1 is most characterized (biochemically). It is a Golgi-resident type II membrane enzyme with a cytoplasmic domain, membrane spanning region, and a stem region and catalytic domain facing the lumen.
Probab=97.20 E-value=0.00016 Score=64.10 Aligned_cols=40 Identities=25% Similarity=0.547 Sum_probs=37.8
Q ss_pred ccccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 87 PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 87 P~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
+.+.||++++.++.|..++|||+++..|||||.||+.|+ |
T Consensus 109 ~~~~Gg~~~~~k~~f~~VNGf~n~f~GWGgEDdd~~~Rl-~ 148 (219)
T cd00899 109 KTYFGGVLALTREQFRKVNGFSNAYWGWGGEDDDLYNRI-K 148 (219)
T ss_pred ccccccceeeEHHHHHHhCCcCCcCccCCcchHHHHHHH-H
Confidence 457899999999999999999999999999999999998 8
No 19
>COG1216 Predicted glycosyltransferases [General function prediction only]
Probab=96.62 E-value=0.0029 Score=58.50 Aligned_cols=117 Identities=25% Similarity=0.294 Sum_probs=73.4
Q ss_pred cccCCchHHHHHHHhcCCCeEEE-eeEeeecCCCcccccCCCccCCCCCceEEEEeeeeeeEEeeCCHHHHhcccCCCCc
Q psy3380 5 EVQKRWLQPLLDVLARNSSHVVS-PLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHKNAAEP 83 (290)
Q Consensus 5 e~~~~Wl~pll~~i~~~~~~~v~-p~I~~id~~t~~y~~~~~~~~~~~~~~~g~fdw~l~~~w~~~~~~~~~~~~~~~~p 83 (290)
.++++||+-|++.+.+++...+. |.|...|.+..-- .+++....+...|......+....... .+
T Consensus 96 ~~~~~~l~~ll~~~~~~~~~~~~~~~i~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 161 (305)
T COG1216 96 VVEPDLLEELLKAAEEDPAAGVVGPLIRNYDESLYID-------------RRGGESDGLTGGWRASPLLEIAPDLSS-YL 161 (305)
T ss_pred eeChhHHHHHHHHHHhCCCCeEeeeeEecCCCCcchh-------------eeccccccccccceecccccccccccc-hh
Confidence 56789999999999999876666 5555544432111 011112222222321111110000011 11
Q ss_pred ccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEe
Q psy3380 84 VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDK 159 (290)
Q Consensus 84 ~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~ 159 (290)
...+.+.|+++.|+++.|+.+|++|..+.++ .||.|+++|+ . .+|..+...|
T Consensus 162 ~~~~~~~G~~~li~~~~~~~vG~~de~~F~y-~eD~D~~~R~-~----------------------~~G~~i~~~p 213 (305)
T COG1216 162 EVVASLSGACLLIRREAFEKVGGFDERFFIY-YEDVDLCLRA-R----------------------KAGYKIYYVP 213 (305)
T ss_pred hhhhhcceeeeEEcHHHHHHhCCCCccccee-ehHHHHHHHH-H----------------------HcCCeEEEee
Confidence 1222678999999999999999999988776 5999999998 7 8898888887
No 20
>cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm
Probab=96.43 E-value=0.0035 Score=52.34 Aligned_cols=39 Identities=28% Similarity=0.440 Sum_probs=34.8
Q ss_pred cccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 88 TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 88 ~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
...|+.+++.++.|...|+|++.+..||+||+||++|+ +
T Consensus 125 ~~~~~~~~~~r~~~~~~ggf~~~~~~~~~eD~~l~~r~-~ 163 (182)
T cd06420 125 GIRGCNMSFWKKDLLAVNGFDEEFTGWGGEDSELVARL-L 163 (182)
T ss_pred EeccceEEEEHHHHHHhCCCCcccccCCcchHHHHHHH-H
Confidence 34567788999999999999999989999999999998 7
No 21
>cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=95.47 E-value=0.012 Score=49.87 Aligned_cols=45 Identities=16% Similarity=0.392 Sum_probs=37.6
Q ss_pred CceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEee
Q psy3380 91 GGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160 (290)
Q Consensus 91 ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~~ 160 (290)
++..+++++.|..+|+||+++. ++||+||++|+ + .+|+.+...|.
T Consensus 151 ~~~~~~~r~~~~~iggf~~~~~--~~eD~~l~~rl-~----------------------~~g~~~~~~~~ 195 (202)
T cd04184 151 GHLLVYRRSLVRQVGGFREGFE--GAQDYDLVLRV-S----------------------EHTDRIAHIPR 195 (202)
T ss_pred cceEeEEHHHHHHhCCCCcCcc--cchhHHHHHHH-H----------------------hccceEEEccH
Confidence 4445799999999999998753 89999999998 7 88888877773
No 22
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen.
Probab=95.15 E-value=0.031 Score=48.64 Aligned_cols=46 Identities=15% Similarity=0.060 Sum_probs=38.7
Q ss_pred CceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEee
Q psy3380 91 GGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160 (290)
Q Consensus 91 ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~~ 160 (290)
|...+|+++.|+.+|+||+.+. .+++|.+|++|+ . ++|..+...|.
T Consensus 153 ~~~~~~rr~~~~~~ggfd~~~~-~~~eD~d~~~r~-~----------------------~~G~~~~~~~~ 198 (237)
T cd02526 153 TSGSLISLEALEKVGGFDEDLF-IDYVDTEWCLRA-R----------------------SKGYKIYVVPD 198 (237)
T ss_pred ccceEEcHHHHHHhCCCCHHHc-CccchHHHHHHH-H----------------------HcCCcEEEEcC
Confidence 3335799999999999998865 578999999998 7 89988888883
No 23
>cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=94.78 E-value=0.035 Score=47.05 Aligned_cols=95 Identities=14% Similarity=0.082 Sum_probs=57.6
Q ss_pred ceecccCCchHHHHHHHhcCC-CeEEEeeEeeecCCCcccccCCCccCCCCCceEEEEeeeeeeEEeeCCHHHHhcccCC
Q psy3380 2 MCCEVQKRWLQPLLDVLARNS-SHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHKNA 80 (290)
Q Consensus 2 ~h~e~~~~Wl~pll~~i~~~~-~~~v~p~I~~id~~t~~y~~~~~~~~~~~~~~~g~fdw~l~~~w~~~~~~~~~~~~~~ 80 (290)
+.+...++||+.+++.+.+++ ..++...+..++.+.-.... ...+............
T Consensus 89 ~Dd~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~ 146 (201)
T cd04195 89 TDDISLPDRFEKQLDFIEKNPEIDIVGGGVLEFDSDGNDIGK----------------------RRLPTSHDDILKFARR 146 (201)
T ss_pred CccccCcHHHHHHHHHHHhCCCeEEEcccEEEECCCCCeecc----------------------ccCCCCHHHHHHHhcc
Confidence 456778899999999997765 45556656555544211100 0011111111111122
Q ss_pred CCcccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 81 AEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 81 ~~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
.+|+..|. +++.++.|..+|+|++. .++||++|..++ .
T Consensus 147 ~~~~~~~~-----~~~rr~~~~~~g~~~~~---~~~eD~~~~~r~-~ 184 (201)
T cd04195 147 RSPFNHPT-----VMFRKSKVLAVGGYQDL---PLVEDYALWARM-L 184 (201)
T ss_pred CCCCCChH-----HhhhHHHHHHcCCcCCC---CCchHHHHHHHH-H
Confidence 34444444 48999999999999865 589999999986 5
No 24
>PF14200 RicinB_lectin_2: Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A ....
Probab=94.71 E-value=0.023 Score=43.92 Aligned_cols=36 Identities=28% Similarity=0.336 Sum_probs=29.7
Q ss_pred cCCCCceeccCCCCCCCCCCeEEEcC-CCCCCcceeEE
Q psy3380 214 NDWSGMCIDSACKPTDMHKPVGLYPC-HKQGGNQFWMM 250 (290)
Q Consensus 214 ~~~~~~CLD~~~~~~~~g~~v~l~~C-h~~~~nQ~w~l 250 (290)
+..+++|||+.+.+...|..+.+|+| ++ ..+|.|.+
T Consensus 68 n~~s~~~Ldv~~~~~~~g~~v~~~~~~~~-~~~Q~W~l 104 (105)
T PF14200_consen 68 NKNSGKVLDVAGGSTANGTNVQQWEYDNG-SDNQQWKL 104 (105)
T ss_dssp ETSTTEEEEEGGGSSSTTEBEEEEE-STS-SGGGEEEE
T ss_pred ECCCCcEEEECCCCCCCCCEEEEEeCCCC-CccCEEEe
Confidence 45788999998877778899999999 55 68999987
No 25
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus.
Probab=94.49 E-value=0.021 Score=49.75 Aligned_cols=110 Identities=17% Similarity=0.168 Sum_probs=68.7
Q ss_pred ceecccCCchHHHHHHHhcCCCeEEEeeEeeecCCCcccccCCCccCCCCCceEEEEeeeeeeE--EeeCCHHHHhcccC
Q psy3380 2 MCCEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFN--WHAIPERERKRHKN 79 (290)
Q Consensus 2 ~h~e~~~~Wl~pll~~i~~~~~~~v~p~I~~id~~t~~y~~~~~~~~~~~~~~~g~fdw~l~~~--w~~~~~~~~~~~~~ 79 (290)
+.+...++||+.+++.+.+.+..++...+..++...+..... . .+...+... +..+.
T Consensus 90 ~D~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~--~----------~~~~~~~~~~~~~~~~--------- 148 (249)
T cd02525 90 AHAVYPKDYILELVEALKRTGADNVGGPMETIGESKFQKAIA--V----------AQSSPLGSGGSAYRGG--------- 148 (249)
T ss_pred CCccCCHHHHHHHHHHHhcCCCCEEecceecCCCChHHHHHH--H----------HhhchhccCCcccccc---------
Confidence 456788999999999888877777766665554443211000 0 000000000 00000
Q ss_pred CCCcccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEe
Q psy3380 80 AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDK 159 (290)
Q Consensus 80 ~~~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~ 159 (290)
.... .....|+..++.++.|..+|+++..+. .+||+||.+|+ + .+|+.+...|
T Consensus 149 ~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~~--~~eD~~l~~r~-~----------------------~~G~~~~~~~ 201 (249)
T cd02525 149 AVKI--GYVDTVHHGAYRREVFEKVGGFDESLV--RNEDAELNYRL-R----------------------KAGYKIWLSP 201 (249)
T ss_pred cccc--ccccccccceEEHHHHHHhCCCCcccC--ccchhHHHHHH-H----------------------HcCcEEEEcC
Confidence 0000 223456777999999999999998754 58999999998 8 8899988777
No 26
>cd02522 GT_2_like_a GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function. Glycosyltransferase family 2 (GT-2) subfamily of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=94.15 E-value=0.063 Score=46.08 Aligned_cols=31 Identities=26% Similarity=0.293 Sum_probs=28.1
Q ss_pred eeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 93 LFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 93 l~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
.+++.++.|+.+|+||+.+ .+||++|.+|+ +
T Consensus 140 ~~~~r~~~~~~~G~fd~~~---~~ED~d~~~r~-~ 170 (221)
T cd02522 140 GLFIRRELFEELGGFPELP---LMEDVELVRRL-R 170 (221)
T ss_pred eEEEEHHHHHHhCCCCccc---cccHHHHHHHH-H
Confidence 4689999999999999876 88999999998 8
No 27
>TIGR01556 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids are glycolipids containing mono- or di- L-rhamnose molecules. Rhamnolipid synthesis occurs by sequential glycosyltransferase reactions involving two distinct rhamnosyltransferase enzymes. In P.aeruginosa, the synthesis of mono-rhamnolipids is catalyzed by rhamnosyltransferase 1, and proceeds by a glycosyltransfer reaction catalyzed by rhamnosyltransferase 2 to yield di-rhamnolipids.
Probab=93.95 E-value=0.067 Score=48.39 Aligned_cols=46 Identities=24% Similarity=0.288 Sum_probs=38.9
Q ss_pred ccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEe
Q psy3380 89 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDK 159 (290)
Q Consensus 89 ~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~ 159 (290)
+.+|+ .|+++.|+.+|++|+.+.++ ++|.|+++|+ . ..|..+.+.|
T Consensus 149 ~~sg~-li~~~~~~~iG~fde~~fi~-~~D~e~~~R~-~----------------------~~G~~i~~~~ 194 (281)
T TIGR01556 149 ISSGC-LITREVYQRLGMMDEELFID-HVDTEWSLRA-Q----------------------NYGIPLYIDP 194 (281)
T ss_pred EcCcc-eeeHHHHHHhCCccHhhccc-chHHHHHHHH-H----------------------HCCCEEEEeC
Confidence 34554 69999999999999998875 5999999998 7 7888888888
No 28
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A.
Probab=93.84 E-value=0.083 Score=45.74 Aligned_cols=50 Identities=18% Similarity=0.291 Sum_probs=34.3
Q ss_pred CccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEeee
Q psy3380 86 TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF 161 (290)
Q Consensus 86 sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~~~ 161 (290)
.+.+.|+.++|.++.|+.+|++|+. ..+||++|++|+ . ..|..+...|..
T Consensus 156 ~~~~~G~~~~~rr~~~~~~g~fd~~---~~~eD~~l~~r~-~----------------------~~G~~~~~~~~~ 205 (228)
T PF13641_consen 156 VAFLSGSGMLFRRSALEEVGGFDPF---ILGEDFDLCLRL-R----------------------AAGWRIVYAPDA 205 (228)
T ss_dssp -S-B--TEEEEEHHHHHHH-S--SS---SSSHHHHHHHHH-H----------------------HTT--EEEEEEE
T ss_pred eeeccCcEEEEEHHHHHHhCCCCCC---CcccHHHHHHHH-H----------------------HCCCcEEEECCc
Confidence 3556799999999999999999982 256999999998 7 888888888743
No 29
>PF02709 Glyco_transf_7C: N-terminal domain of galactosyltransferase; InterPro: IPR003859 This is a family of galactosyltransferases from a wide range of metazoa with three related galactosyltransferase activities; all three of which are possessed by one sequence in some cases. The three functions are N-acetyllactosamine synthase (2.4.1.90 from EC); beta-N-acetylglucosaminyl-glycopeptide beta-1,4-galactosyltransferase (2.4.1.38 from EC); and lactose synthase (2.4.1.22 from EC). Note that N-acetyllactosamine synthase is a component of lactose synthase along with alpha-lactalbumin, in the absence of alpha-lactalbumin N-acetyllactosamine synthase is used.; GO: 0016757 transferase activity, transferring glycosyl groups, 0005975 carbohydrate metabolic process; PDB: 2AGD_B 3EE5_A 2AE7_B 2AEC_A 2FYA_A 2AES_B 2AH9_A 2FYB_A 2FY7_A 3LW6_A ....
Probab=92.54 E-value=0.043 Score=40.70 Aligned_cols=35 Identities=43% Similarity=0.860 Sum_probs=27.8
Q ss_pred CCCCCeEEEEEeeeecccCCCCCCCcccccccccc
Q psy3380 148 PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182 (290)
Q Consensus 148 p~~~GG~~~i~~~~F~~~g~y~~g~~~~~~e~~e~ 182 (290)
+..+||.+++.+.-|.+++.||+.+.-||+|..++
T Consensus 17 ~~~~Gg~~~~~~~~f~~vnGfde~f~gWG~ED~Dl 51 (78)
T PF02709_consen 17 PNFFGGVFAISREDFEKVNGFDERFWGWGGEDDDL 51 (78)
T ss_dssp TT---SEEEEEHHHHHHTTSS-SS-TSCSSHHHHH
T ss_pred CCeeEEEEEEeHHHHHHcCCCCccccccCccHHHH
Confidence 56999999999999999999999999999999765
No 30
>cd06913 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine . This family includes human Beta3GnTL1 and related eukaryotic proteins. Human Beta3GnTL1 is a putative beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is expressed at various levels in most of tissues examined. Beta 1, 3-N-acetylglucosaminyltransferase has been found to be essential for the formation of poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a unique carbohydrate composed of N-acetyllactosamine repeats. It is often an important part of cell-type-specific oligosaccharide structures and some functional oligosaccharides. It has been shown that the structure and biosynthesis of poly-N-acetyllactosamine display a dramatic change during development and oncogenesis. Several members of beta-1, 3-N-acetylglucosaminyltransferase have been identified.
Probab=90.86 E-value=0.2 Score=43.30 Aligned_cols=31 Identities=19% Similarity=0.412 Sum_probs=25.7
Q ss_pred eeechhhHhhhcccCCCCcccCcccccccccc
Q psy3380 94 FSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125 (290)
Q Consensus 94 ~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~ 125 (290)
++|+++.|+.+|+|++... -.+||++|.+|+
T Consensus 161 ~~~rr~~~~~~g~f~~~~~-~~~eD~~l~~r~ 191 (219)
T cd06913 161 WFCSREWFSHVGPFDEGGK-GVPEDLLFFYEH 191 (219)
T ss_pred ceeehhHHhhcCCccchhc-cchhHHHHHHHH
Confidence 5799999999999997643 246999999996
No 31
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=90.23 E-value=0.23 Score=40.03 Aligned_cols=76 Identities=14% Similarity=0.234 Sum_probs=48.5
Q ss_pred CCCceEEEEeeeeeeEEeeCCHHHHhcccCCCCcccCccccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 50 SYKFFIGGFDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 50 ~~~~~~g~fdw~l~~~w~~~~~~~~~~~~~~~~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
+.+....++|-...+.-..+..-...-...+..-+.++...|+.++++++.|..+|+|+..+.. +++|++|..|+ .
T Consensus 73 ~~~~~i~~~D~D~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~eD~~~~~~~-~ 148 (166)
T cd04186 73 AKGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIVGPKVSGAFLLVRREVFEEVGGFDEDFFL-YYEDVDLCLRA-R 148 (166)
T ss_pred CCCCEEEEECCCcEECccHHHHHHHHHHhCCCceEEEccCceeeEeeeHHHHHHcCCCChhhhc-cccHHHHHHHH-H
Confidence 3455566777665543211111111111222223445558899999999999999999987655 88999999997 6
No 32
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment.
Probab=90.07 E-value=0.18 Score=42.92 Aligned_cols=49 Identities=14% Similarity=0.312 Sum_probs=38.9
Q ss_pred cccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEee
Q psy3380 88 TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160 (290)
Q Consensus 88 ~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~~ 160 (290)
...|+..+|+++.|+.+|+|+. +....+||++|+.|+ . ..|..+.+.|.
T Consensus 123 ~~~g~~~~~r~~~~~~~ggf~~-~~~~~~eD~~l~~rl-~----------------------~~G~~i~~~~~ 171 (196)
T cd02520 123 CAFGKSMALRREVLDAIGGFEA-FADYLAEDYFLGKLI-W----------------------RLGYRVVLSPY 171 (196)
T ss_pred cccCceeeeEHHHHHhccChHH-HhHHHHHHHHHHHHH-H----------------------HcCCeEEEcch
Confidence 4567888999999999999964 333468999999997 6 77888877764
No 33
>PF13632 Glyco_trans_2_3: Glycosyl transferase family group 2
Probab=89.98 E-value=0.56 Score=39.58 Aligned_cols=50 Identities=18% Similarity=0.256 Sum_probs=40.2
Q ss_pred ccccCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEeee
Q psy3380 87 PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF 161 (290)
Q Consensus 87 P~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~~~ 161 (290)
....|...+++++.|+++|++| ..... +||.++++|+ + .+|..+...|..
T Consensus 70 ~~~~G~~~~~r~~~l~~vg~~~-~~~~~-~ED~~l~~~l-~----------------------~~G~~~~~~~~~ 119 (193)
T PF13632_consen 70 LFLSGSGMLFRREALREVGGFD-DPFSI-GEDMDLGFRL-R----------------------RAGYRIVYVPDA 119 (193)
T ss_pred ccccCcceeeeHHHHHHhCccc-ccccc-cchHHHHHHH-H----------------------HCCCEEEEeccc
Confidence 3456888999999999999999 33333 4999999998 8 889888777743
No 34
>cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response.
Probab=89.40 E-value=0.12 Score=45.07 Aligned_cols=36 Identities=19% Similarity=0.339 Sum_probs=30.7
Q ss_pred cccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 88 TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 88 ~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
...|...+++++.|..+|+||+.. .+||++|++|+ .
T Consensus 157 ~~~g~~~~~rr~~~~~iGgf~~~~---~~eD~dl~~r~-~ 192 (236)
T cd06435 157 IQHGTMCLIRRSALDDVGGWDEWC---ITEDSELGLRM-H 192 (236)
T ss_pred EEecceEEEEHHHHHHhCCCCCcc---ccchHHHHHHH-H
Confidence 445777899999999999999764 58999999998 6
No 35
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi,
Probab=87.49 E-value=0.39 Score=41.22 Aligned_cols=50 Identities=18% Similarity=0.022 Sum_probs=36.1
Q ss_pred cccCceeeechhhHhhhcccCCCCccc-CcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEeeeec
Q psy3380 88 TMAGGLFSIDKAFFEKLGTYDSGFDIW-GGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE 163 (290)
Q Consensus 88 ~~~ggl~~I~~~~F~~~g~yd~~~~~~-Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~~~F~ 163 (290)
-..||.++++++.|..+|. .+... ..+++||.+|. | .+|..+...|..|.
T Consensus 150 ~~~~~~~~~~r~~~~~ig~---~~~~~~~~~~~~l~~~~-~----------------------~~g~~i~~~p~~~~ 200 (224)
T cd06442 150 DPTSGFRAYRREVLEKLID---SLVSKGYKFQLELLVRA-R----------------------RLGYRIVEVPITFV 200 (224)
T ss_pred CCCCccchhhHHHHHHHhh---hccCCCcEEeHHHHHHH-H----------------------HcCCeEEEeCeEEe
Confidence 3457888999999999971 12111 23567888887 8 89989888886664
No 36
>COG4092 Predicted glycosyltransferase involved in capsule biosynthesis [Cell envelope biogenesis, outer membrane]
Probab=87.33 E-value=0.28 Score=44.70 Aligned_cols=35 Identities=29% Similarity=0.573 Sum_probs=32.2
Q ss_pred CceeeechhhHhhhcccCCCCcccCcccccccccc
Q psy3380 91 GGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125 (290)
Q Consensus 91 ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~ 125 (290)
.|++.|+++.|.-+||||+.+...|-||+|+--|+
T Consensus 179 T~~~liN~~~F~~tgGydE~F~GhG~EDfe~~~R~ 213 (346)
T COG4092 179 TNIFLINRRMFSLTGGYDERFRGHGSEDFEFLTRL 213 (346)
T ss_pred cceEEEehhHHHHhcCCccccccCCchhHHHHHHH
Confidence 36899999999999999999999999999998775
No 37
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms.
Probab=87.08 E-value=0.05 Score=52.07 Aligned_cols=38 Identities=18% Similarity=0.166 Sum_probs=30.9
Q ss_pred cccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 88 TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 88 ~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
...|++.+|+++.|+.+|+||+-... -.||.+|..|+ .
T Consensus 208 ~~~G~~~lirr~~~~~vGGf~~~~~~-~~ED~~L~~r~-~ 245 (384)
T TIGR03469 208 AAAGGCILIRREALERIGGIAAIRGA-LIDDCTLAAAV-K 245 (384)
T ss_pred eecceEEEEEHHHHHHcCCHHHHhhC-cccHHHHHHHH-H
Confidence 35699999999999999999864333 36999999987 5
No 38
>cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end.
Probab=86.66 E-value=0.31 Score=42.04 Aligned_cols=38 Identities=13% Similarity=0.268 Sum_probs=31.4
Q ss_pred CccccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 86 TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 86 sP~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
.+...|+..+++++.|+.+|+|+... .+||++|..|+ +
T Consensus 157 ~~~~~g~~~~~r~~~~~~ig~~~~~~---~~eD~~l~~r~-~ 194 (234)
T cd06421 157 AAFCCGSGAVVRREALDEIGGFPTDS---VTEDLATSLRL-H 194 (234)
T ss_pred CceecCceeeEeHHHHHHhCCCCccc---eeccHHHHHHH-H
Confidence 44556888899999999999998543 58999999997 6
No 39
>cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex. Cellulose synthase (CESA) catalyzes the polymerization reaction of cellulose using UDP-glucose as the substrate. Cellulose is an aggregate of unbranched polymers of beta-1,4-linked glucose residues, which is an abundant polysaccharide produced by plants and in varying degrees by several other organisms including algae, bacteria, fungi, and even some animals. Genomes from higher plants harbor multiple CESA genes. There are ten in Arabidopsis. At least three different CESA proteins are required to form a functional complex. In Arabidopsis, CESA1, 3 and 6 and CESA4, 7 and 8, are required for cellulose biosynthesis during primary and secondary cell wall formation. CESA2 is very closely related to CESA6 and is viewed as a prime substitute for CESA6. They functionally compensate each other. The cesa2 and cesa6 double mutant plants we
Probab=86.50 E-value=0.31 Score=42.48 Aligned_cols=43 Identities=23% Similarity=0.254 Sum_probs=35.5
Q ss_pred CceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEe
Q psy3380 91 GGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDK 159 (290)
Q Consensus 91 ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~ 159 (290)
|+..++.++.|+.+|+|+... .+||++|+.|+ + +.|..+...|
T Consensus 164 g~~~~~rr~~~~~vgg~~~~~---~~ED~~l~~rl-~----------------------~~G~~~~~~~ 206 (232)
T cd06437 164 GTAGVWRKECIEDAGGWNHDT---LTEDLDLSYRA-Q----------------------LKGWKFVYLD 206 (232)
T ss_pred cchhhhhHHHHHHhCCCCCCc---chhhHHHHHHH-H----------------------HCCCeEEEec
Confidence 444578999999999999753 57999999998 7 8888887777
No 40
>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=86.30 E-value=0.69 Score=39.14 Aligned_cols=33 Identities=30% Similarity=0.519 Sum_probs=27.8
Q ss_pred eeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 93 LFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 93 l~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
.++|.++.|..+|.+++.+.++ |+|.+++.++ .
T Consensus 125 ~~~~~~~~~~~~g~~~~~~~~~-~eD~~~~~r~-~ 157 (202)
T cd04185 125 GVLISRRVVEKIGLPDKEFFIW-GDDTEYTLRA-S 157 (202)
T ss_pred EEEEeHHHHHHhCCCChhhhcc-chHHHHHHHH-H
Confidence 3589999999999988876665 5999999997 6
No 41
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=83.99 E-value=0.84 Score=39.03 Aligned_cols=36 Identities=17% Similarity=0.205 Sum_probs=29.0
Q ss_pred ccCceeeechhhHhhhcccCCCCcccCcccccccccc
Q psy3380 89 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125 (290)
Q Consensus 89 ~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~ 125 (290)
..|..+++.++.|+.+|+||.... ..+++.++.+|.
T Consensus 155 ~~g~~~~~rr~~~~~~ggf~~~~~-~~~eD~~~~~~~ 190 (229)
T cd04192 155 CNGANMAYRKEAFFEVGGFEGNDH-IASGDDELLLAK 190 (229)
T ss_pred cccceEEEEHHHHHHhcCCccccc-cccCCHHHHHHH
Confidence 347778999999999999986643 467888999886
No 42
>cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex
Probab=80.49 E-value=1.3 Score=36.58 Aligned_cols=35 Identities=23% Similarity=0.251 Sum_probs=24.0
Q ss_pred CccccCceeeechhhHhhh--cccCCCCcccCccccccccc
Q psy3380 86 TPTMAGGLFSIDKAFFEKL--GTYDSGFDIWGGENLELSFK 124 (290)
Q Consensus 86 sP~~~ggl~~I~~~~F~~~--g~yd~~~~~~Gg~~~elsfk 124 (290)
.+.+.||.++++++.|+.+ |..+.++ +++.||.+|
T Consensus 148 ~~~~~~~~~~~~r~~~~~i~~~~~~~~~----~~~~~~~~~ 184 (185)
T cd04179 148 ISDTQSGFRLFRREVLEALLSLLESNGF----EFGLELLVG 184 (185)
T ss_pred CcCCCCceeeeHHHHHHHHHhhccccCc----ceeeEeeec
Confidence 4556788999999999999 5555443 344455544
No 43
>PF00535 Glycos_transf_2: Glycosyl transferase family 2; InterPro: IPR001173 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This domain is found in a diverse family of glycosyl transferases that transfer the sugar from UDP-glucose, UDP-N-acetyl-galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids.; PDB: 2Z87_A 2Z86_B 2D7R_A 2D7I_A 3CKN_A 3CKQ_A 3CKJ_A 3CKV_A 3CKO_A 2FFU_A ....
Probab=78.46 E-value=8.5 Score=30.33 Aligned_cols=83 Identities=19% Similarity=0.248 Sum_probs=50.4
Q ss_pred ceecccCCchHHHHHHHhcCCCeEEEeeEeeecCCCcccccCCCccCCCCCceEEEEeeeeeeEEeeCCHHHHhcccCCC
Q psy3380 2 MCCEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHKNAA 81 (290)
Q Consensus 2 ~h~e~~~~Wl~pll~~i~~~~~~~v~p~I~~id~~t~~y~~~~~~~~~~~~~~~g~fdw~l~~~w~~~~~~~~~~~~~~~ 81 (290)
++++..++||+.|++.+.+++..++.......+.+....... ... ..+...........
T Consensus 87 ~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~----------~~~~~~~~~~~~~~---- 145 (169)
T PF00535_consen 87 DDDIISPDWLEELVEALEKNPPDVVIGSVIYIDDDNRYPDRR-------LRF----------SFWNRFERKIFNNI---- 145 (169)
T ss_dssp TTEEE-TTHHHHHHHHHHHCTTEEEEEEEEEEECTTETEECC-------CTS----------EEEECCHCHHHHTT----
T ss_pred CCceEcHHHHHHHHHHHHhCCCcEEEEEEEEecCCccccccc-------cch----------hhhhhhhhHHHHhh----
Confidence 457788999999999999988877776666666654333211 000 11111111111111
Q ss_pred CcccCccccCceeeechhhHhhhc
Q psy3380 82 EPVWTPTMAGGLFSIDKAFFEKLG 105 (290)
Q Consensus 82 ~p~~sP~~~ggl~~I~~~~F~~~g 105 (290)
.....+...|+..+++++.|+++|
T Consensus 146 ~~~~~~~~~~~~~~~rr~~~~~~~ 169 (169)
T PF00535_consen 146 RFWKISFFIGSCALFRRSVFEEIG 169 (169)
T ss_dssp HSTTSSEESSSCEEEEEHHHHHCH
T ss_pred hcCCcccccccEEEEEHHHHHhhC
Confidence 222366777888999999999875
No 44
>cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=77.67 E-value=4.4 Score=33.40 Aligned_cols=34 Identities=18% Similarity=0.405 Sum_probs=27.6
Q ss_pred CceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 91 GGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 91 ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
+...++.++.|..+|+|+..+. .++|++|.+|+ +
T Consensus 144 ~~~~~~~~~~~~~~~~f~~~~~--~~~D~~~~~r~-~ 177 (202)
T cd06433 144 HQATFFRRSLFEKYGGFDESYR--IAADYDLLLRL-L 177 (202)
T ss_pred CcceEEEHHHHHHhCCCchhhC--chhhHHHHHHH-H
Confidence 3335799999999999988764 45799999998 7
No 45
>PF10555 MraY_sig1: Phospho-N-acetylmuramoyl-pentapeptide-transferase signature 1 ; InterPro: IPR018480 Phospho-N-acetylmuramoyl-pentapeptide-transferase (2.7.8.13 from EC) (MraY) is a bacterial enzyme responsible for the formation of the first lipid intermediate of the cell wall peptidoglycan synthesis []. It catalyses the formation of undecaprenyl-pyrophosphoryl-N-acetylmuramoyl-pentapeptide from UDP-MurNAc-pentapeptide and undecaprenyl-phosphate. MraY is an integral membrane protein with probably ten transmembrane domains. It belongs to family 4 of glycosyl transferases. Homologues of MraY have been found in archaebacteria Methanobacterium thermoautotrophicum and in Arabidopsis thaliana (Mouse-ear cress). This entry represents two conserved sites found in these proteins. The first site is located at the end of the first cytoplasmic loop and the beginning of the second transmembrane domain. The second site is located in the third cytoplasmic loop.
Probab=76.64 E-value=1.2 Score=21.63 Aligned_cols=10 Identities=70% Similarity=1.139 Sum_probs=6.8
Q ss_pred CccccCceeee
Q psy3380 86 TPTMAGGLFSI 96 (290)
Q Consensus 86 sP~~~ggl~~I 96 (290)
+|+| ||++.|
T Consensus 4 TPTM-GGi~~i 13 (13)
T PF10555_consen 4 TPTM-GGIVFI 13 (13)
T ss_pred Cccc-eeEEEC
Confidence 6888 676554
No 46
>PLN02899 alpha-galactosidase
Probab=73.59 E-value=2.4 Score=43.11 Aligned_cols=49 Identities=22% Similarity=0.539 Sum_probs=34.7
Q ss_pred CCceeccCCCCCCCCCC---eEEEcCCCCCCcceeEEcccccccc--CCceeeee
Q psy3380 217 SGMCIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRR--DEACLDYA 266 (290)
Q Consensus 217 ~~~CLD~~~~~~~~g~~---v~l~~Ch~~~~nQ~w~lt~~G~Ir~--~~~CLD~~ 266 (290)
...|||+..+....-+. -...+|.. ..+|.|+++.+|+|.+ +++|..+.
T Consensus 486 ~~~c~~~~~~~~~~~~~~~~~~fs~c~~-~~~q~w~l~~~g~l~~~~sglca~v~ 539 (633)
T PLN02899 486 GELCLDASPKQKRTSKDFRSGSFSPCRW-DANQMWELNNNGTLISSYSGLCATVN 539 (633)
T ss_pred cChhhccCCCCCcCHhHhhccccCCCCC-ChhhceeeCCCCCEecCccccceEee
Confidence 45688876543222111 23578966 6899999999999987 58999885
No 47
>cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot
Probab=72.41 E-value=1.7 Score=38.09 Aligned_cols=45 Identities=18% Similarity=0.194 Sum_probs=36.1
Q ss_pred cCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEee
Q psy3380 90 AGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160 (290)
Q Consensus 90 ~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~~ 160 (290)
.|...++.++.|+.+|+|++.. -+||++|++|+ + ..|..+...+.
T Consensus 162 ~g~~~~~rr~~~~~vgg~~~~~---~~eD~~l~~rl-~----------------------~~G~r~~~~~~ 206 (241)
T cd06427 162 GGTSNHFRTDVLRELGGWDPFN---VTEDADLGLRL-A----------------------RAGYRTGVLNS 206 (241)
T ss_pred CCchHHhhHHHHHHcCCCCccc---chhhHHHHHHH-H----------------------HCCceEEEecc
Confidence 3555789999999999998742 37999999998 7 77878776664
No 48
>cd00899 b4GalT Beta-4-Galactosyltransferase is involved in the formation of the poly-N-acetyllactosamine core structures present in glycoproteins and glycosphingolipids. Beta-4-Galactosyltransferase transfers galactose from uridine diphosphogalactose to the terminal beta-N-acetylglucosamine residues, hereby forming the poly-N-acetyllactosamine core structures present in glycoproteins and glycosphingolipids. At least seven homologous beta-4-galactosyltransferase isoforms have been identified that use different types of glycoproteins and glycolipids as substrates. Of the seven identified members of the beta-1,4-galactosyltransferase subfamily (beta1,4-Gal-T1 to -T7), b1,4-Gal-T1 is most characterized (biochemically). It is a Golgi-resident type II membrane enzyme with a cytoplasmic domain, membrane spanning region, and a stem region and catalytic domain facing the lumen.
Probab=69.80 E-value=2.2 Score=37.88 Aligned_cols=34 Identities=29% Similarity=0.696 Sum_probs=31.3
Q ss_pred CCCCeEEEEEeeeecccCCCCCCCcccccccccc
Q psy3380 149 TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182 (290)
Q Consensus 149 ~~~GG~~~i~~~~F~~~g~y~~g~~~~~~e~~e~ 182 (290)
+..||.+++.++-|.+.++||.++.-||+|.-++
T Consensus 110 ~~~Gg~~~~~k~~f~~VNGf~n~f~GWGgEDdd~ 143 (219)
T cd00899 110 TYFGGVLALTREQFRKVNGFSNAYWGWGGEDDDL 143 (219)
T ss_pred cccccceeeEHHHHHHhCCcCCcCccCCcchHHH
Confidence 3899999999999999999999998999999765
No 49
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano
Probab=69.19 E-value=1.6 Score=41.51 Aligned_cols=48 Identities=13% Similarity=0.074 Sum_probs=37.3
Q ss_pred cCceeeechhhHhhhcccCCCCcccCcccccccccccceecCHHHHhccCCCCCccccCCCCCeEEEEEeee
Q psy3380 90 AGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF 161 (290)
Q Consensus 90 ~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W~~~p~~~~~~~~~~~~p~~~p~~~GG~~~i~~~~ 161 (290)
.|..+++.|+.|+.+|+|+.-... =+||++|..++ . .+|..+.+.+..
T Consensus 201 ~G~~~a~RR~~l~~iGGf~~~~~~-~~ED~~l~~~i-~----------------------~~G~~v~~~~~~ 248 (373)
T TIGR03472 201 FGATMALRRATLEAIGGLAALAHH-LADDYWLGELV-R----------------------ALGLRVVLAPVV 248 (373)
T ss_pred cChhhheeHHHHHHcCChHHhccc-chHHHHHHHHH-H----------------------HcCCeEEecchh
Confidence 366779999999999999864333 36999999997 6 778887766643
No 50
>KOG3916|consensus
Probab=68.58 E-value=2.3 Score=40.07 Aligned_cols=37 Identities=30% Similarity=0.573 Sum_probs=34.5
Q ss_pred ccCceeeechhhHhhhcccCCCCcccCcccccccccc
Q psy3380 89 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125 (290)
Q Consensus 89 ~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~ 125 (290)
+-||+-|++++-|..+-||.+.+-.|||||=+|.-|+
T Consensus 260 ~FGGVsalt~~qf~kINGFsN~fWGWGGEDDDl~nRv 296 (372)
T KOG3916|consen 260 YFGGVSALTKEQFRKINGFSNAFWGWGGEDDDLWNRV 296 (372)
T ss_pred hhCchhhccHHHHHHhcCCCchhcccCCcchHHHHHH
Confidence 5599999999999999999999999999999998886
No 51
>PF10111 Glyco_tranf_2_2: Glycosyltransferase like family 2; InterPro: IPR019290 This conserved domain is found in a set of prokaryotic proteins including putative glucosyltransferases, which are involved in bacterial capsule biosynthesis [, ].
Probab=68.08 E-value=2.3 Score=38.72 Aligned_cols=36 Identities=28% Similarity=0.612 Sum_probs=32.4
Q ss_pred CCCCCeEEEEEeeeecccCCCCCCCccccccccccc
Q psy3380 148 PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183 (290)
Q Consensus 148 p~~~GG~~~i~~~~F~~~g~y~~g~~~~~~e~~e~~ 183 (290)
.+++||.+++.++.|.++|+||+.+.-||.|-.++.
T Consensus 166 ~~~~s~~~~i~r~~f~~iGGfDE~f~G~G~ED~D~~ 201 (281)
T PF10111_consen 166 IAFASSCFLINREDFLEIGGFDERFRGWGYEDIDFG 201 (281)
T ss_pred ccccceEEEEEHHHHHHhCCCCccccCCCcchHHHH
Confidence 447889999999999999999999999999999863
No 52
>cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily.
Probab=65.63 E-value=2.4 Score=35.09 Aligned_cols=27 Identities=15% Similarity=0.213 Sum_probs=23.3
Q ss_pred CccccCceeeechhhHhhhcccCCCCc
Q psy3380 86 TPTMAGGLFSIDKAFFEKLGTYDSGFD 112 (290)
Q Consensus 86 sP~~~ggl~~I~~~~F~~~g~yd~~~~ 112 (290)
.|.+.||+++|+++.|+.+|+||..+.
T Consensus 144 ~~~~~~~~~~~~r~~~~~i~~~d~~~~ 170 (181)
T cd04187 144 IPDNGGDFRLMDRKVVDALLLLPERHR 170 (181)
T ss_pred CCCCCCCEEEEcHHHHHHHHhcCCCCc
Confidence 466778999999999999999998764
No 53
>KOG3588|consensus
Probab=56.70 E-value=3.6 Score=39.33 Aligned_cols=33 Identities=30% Similarity=0.585 Sum_probs=26.8
Q ss_pred ceeeechhhHhhhcccCCCCcccCccccccccc
Q psy3380 92 GLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124 (290)
Q Consensus 92 gl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk 124 (290)
|+-.+-+..|-.+||||.....|||||..|-=|
T Consensus 386 Gmtc~yrsd~~~vgGFD~~I~GWG~EDV~Ly~K 418 (494)
T KOG3588|consen 386 GMTCQYRSDFLTVGGFDMEIKGWGGEDVDLYRK 418 (494)
T ss_pred ceeEEeeccceeecCcceeeeccCcchHHHHHH
Confidence 445667778888999998888899999988665
No 54
>PF11397 GlcNAc: Glycosyltransferase (GlcNAc); InterPro: IPR021067 GlcNAc is an enzyme that carries out the first glycosylation step of hydroxylated Skp1; it is found in the cytoplasm and results in a pentasaccharide-linked 'HyPro-143[, ].
Probab=49.10 E-value=21 Score=33.97 Aligned_cols=114 Identities=19% Similarity=0.288 Sum_probs=63.8
Q ss_pred ceecccCCchHHHHHHHhc--CCCeEEEeeEeeecC--CCcccccCCCccCCCCCceEEEEeee--eeeEEeeCCHHHHh
Q psy3380 2 MCCEVQKRWLQPLLDVLAR--NSSHVVSPLIANICD--DTFELRFPPGRLTSSYKFFIGGFDWN--LQFNWHAIPERERK 75 (290)
Q Consensus 2 ~h~e~~~~Wl~pll~~i~~--~~~~~v~p~I~~id~--~t~~y~~~~~~~~~~~~~~~g~fdw~--l~~~w~~~~~~~~~ 75 (290)
||+...++|=+-|++.+.+ +++.|++=-...++. ...+.... ....+....|+.. +.++ .+...
T Consensus 125 SH~rF~~~WD~~li~~~~~~~~~~aVLS~YP~~~~~~~~~~~~~~~-----~~~~lc~~~~~~~g~~~~~-----~~~~~ 194 (343)
T PF11397_consen 125 SHMRFVPGWDEILIEMLKSLRNPKAVLSTYPPGYEPDGGQPEPEKT-----TVPRLCAARFGPDGMVRLG-----ARWIK 194 (343)
T ss_pred ccceeeccHHHHHHHHHHhcCCCCeEEecCCCCcccccCCccccCC-----cccEEEEeEECCCCcEeec-----ceecc
Confidence 8999999997777776664 666666632222222 11111100 0011112233221 1111 11111
Q ss_pred cccCCCCcccCccccCceeeechhhHhhhcccCCCCc-ccCcccccccccccc
Q psy3380 76 RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD-IWGGENLELSFKFNW 127 (290)
Q Consensus 76 ~~~~~~~p~~sP~~~ggl~~I~~~~F~~~g~yd~~~~-~~Gg~~~elsfk~~W 127 (290)
.......|+.+|..++|...-+-+ |..--.||+.+. ++-||...++.|. |
T Consensus 195 ~~~~~~~P~~~~f~aaGF~Fa~~~-~~~eVP~DP~lp~lF~GEE~~~aaRl-w 245 (343)
T PF11397_consen 195 PAPKLEEPVPQPFWAAGFSFAPGH-FVREVPYDPHLPFLFDGEEISMAARL-W 245 (343)
T ss_pred cccccCCCeeeceecccEEEcchh-heecCCCCCCcccccccHHHHHHHHH-H
Confidence 122345789999999997666656 544446998863 6789999999998 8
No 55
>PRK14583 hmsR N-glycosyltransferase; Provisional
Probab=47.76 E-value=12 Score=36.50 Aligned_cols=35 Identities=14% Similarity=0.378 Sum_probs=30.0
Q ss_pred ccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 89 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 89 ~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
..|+..++.++.++++|++++.+ -+||.||++|+ .
T Consensus 230 ~sG~~~~~rr~al~~vGg~~~~~---i~ED~dl~~rl-~ 264 (444)
T PRK14583 230 VSGVVAAFRRRALADVGYWSPDM---ITEDIDISWKL-Q 264 (444)
T ss_pred ecCceeEEEHHHHHHcCCCCCCc---ccccHHHHHHH-H
Confidence 35777899999999999999875 37999999997 6
No 56
>PRK11204 N-glycosyltransferase; Provisional
Probab=45.71 E-value=11 Score=35.99 Aligned_cols=35 Identities=17% Similarity=0.431 Sum_probs=30.0
Q ss_pred ccCceeeechhhHhhhcccCCCCcccCcccccccccccc
Q psy3380 89 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW 127 (290)
Q Consensus 89 ~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~~W 127 (290)
+.|...+++++.|..+|+||+.+ -+||.|+++|+ .
T Consensus 209 ~~G~~~~~rr~~l~~vgg~~~~~---~~ED~~l~~rl-~ 243 (420)
T PRK11204 209 VSGVITAFRKSALHEVGYWSTDM---ITEDIDISWKL-Q 243 (420)
T ss_pred ecceeeeeeHHHHHHhCCCCCCc---ccchHHHHHHH-H
Confidence 35777899999999999999875 36999999997 6
No 57
>cd04190 Chitin_synth_C C-terminal domain of Chitin Synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin. Chitin synthase, also called UDP-N-acetyl-D-glucosamine:chitin 4-beta-N-acetylglucosaminyltransferase, catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of GlcNAc residues formed by covalent beta-1,4 linkages. Chitin is an important component of the cell wall of fungi and bacteria and it is synthesized on the cytoplasmic surface of the cell membrane by membrane bound chitin synthases. Studies with fungi have revealed that most of them contain more than one chitin synthase gene. At least five subclasses of chitin synthases have been identified.
Probab=45.45 E-value=25 Score=31.00 Aligned_cols=34 Identities=9% Similarity=0.044 Sum_probs=23.0
Q ss_pred ceecccCCchHHHHHHHhcCCCe-EEEeeEeeecC
Q psy3380 2 MCCEVQKRWLQPLLDVLARNSSH-VVSPLIANICD 35 (290)
Q Consensus 2 ~h~e~~~~Wl~pll~~i~~~~~~-~v~p~I~~id~ 35 (290)
|++...++||+.|++.+.++|.. +|+..+..++.
T Consensus 82 aD~~~~~~~l~~l~~~~~~~p~vg~v~g~~~~~~~ 116 (244)
T cd04190 82 ADTKFDPDSIVQLYKAMDKDPEIGGVCGEIHPMGK 116 (244)
T ss_pred CCCcCCHhHHHHHHHHHHhCCCEEEEEeeeEEcCC
Confidence 56677788888888888777754 45555555443
No 58
>PF05679 CHGN: Chondroitin N-acetylgalactosaminyltransferase; InterPro: IPR008428 This family represents Chondroitin N-acetylgalactosaminyltransferase. Proteins have a type II transmembrane topology. The enzyme is involved in the biosynthetic initiation and elongation of chondroitin sulphate and is the key enzyme responsible for the selective chain assembly of chondroitin/dermatan sulphate on the linkage region tetrasaccharide common to various proteoglycans containing chondroitin/dermatan sulphate or heparin/heparan sulphate chains. ; GO: 0016758 transferase activity, transferring hexosyl groups, 0032580 Golgi cisterna membrane
Probab=43.32 E-value=12 Score=37.34 Aligned_cols=34 Identities=32% Similarity=0.701 Sum_probs=29.0
Q ss_pred ceeeechhhHhhh--cccCCCCcccCcccccccccc
Q psy3380 92 GLFSIDKAFFEKL--GTYDSGFDIWGGENLELSFKF 125 (290)
Q Consensus 92 gl~~I~~~~F~~~--g~yd~~~~~~Gg~~~elsfk~ 125 (290)
|+++|-+..|-.+ |+++.....||+|||+|-=|+
T Consensus 412 g~~~~YksDy~~~~~~~~~~~~~gwg~ED~~l~~~~ 447 (499)
T PF05679_consen 412 GMVCFYKSDYMRIRGGGFDLSIRGWGGEDVDLYDKF 447 (499)
T ss_pred ceEEEEhhhhhhhcccccccccccccccHHHHHHHH
Confidence 7788889999888 888888888999999886554
No 59
>cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the
Probab=42.06 E-value=22 Score=27.87 Aligned_cols=26 Identities=12% Similarity=0.328 Sum_probs=21.9
Q ss_pred CccccCceeeechhhHhhhcccCCCC
Q psy3380 86 TPTMAGGLFSIDKAFFEKLGTYDSGF 111 (290)
Q Consensus 86 sP~~~ggl~~I~~~~F~~~g~yd~~~ 111 (290)
.+.+.|..++++++.|..+|+|++.+
T Consensus 151 ~~~~~g~~~~~~~~~~~~~ggf~~~~ 176 (180)
T cd06423 151 VLVLSGAFGAFRREALREVGGWDEDT 176 (180)
T ss_pred eeecCchHHHHHHHHHHHhCCccccC
Confidence 45567888999999999999998764
No 60
>PF03498 CDtoxinA: Cytolethal distending toxin A/C family; InterPro: IPR003558 Escherichia coli, Haemophilus spp and Campylobacter spp. all produce a toxin that is seen to cause distension in certain cell lines [, ], which eventually disintegrate and die. This novel toxin, termed cytolethal distending toxin (cdt), has three subunits: A, B and C. Their sizes are approx. 27.7, 29.5 and 19.9kDa respectively [], and they appear to be entirely novel []. Further research on the complete toxin has revealed that it blocks the cell cycle at stage G2, through inactivation of the cyclin-dependent kinase Cdk1, and without induction of DNA breaks. This leads to multipolar abortive mitosis and micronucleation, associated with centrosomal amplification []. The roles of each subunit are unclear, but it is believed that they have separate roles in pathogenicity. This entry represents the A and C subunits.; GO: 0009405 pathogenesis; PDB: 2F2F_A 1SR4_C.
Probab=40.05 E-value=28 Score=29.10 Aligned_cols=69 Identities=23% Similarity=0.399 Sum_probs=38.0
Q ss_pred cCCCCceeccCCCCCCCCCCeEEEc-CCCCCCcceeEE--cccc--cccc--CCceeeeeC-------CeEEEEeCCCC-
Q psy3380 214 NDWSGMCIDSACKPTDMHKPVGLYP-CHKQGGNQFWMM--SKHG--EIRR--DEACLDYAG-------GDVILYPCHGS- 278 (290)
Q Consensus 214 ~~~~~~CLD~~~~~~~~g~~v~l~~-Ch~~~~nQ~w~l--t~~G--~Ir~--~~~CLD~~g-------~~V~l~~C~g~- 278 (290)
+..++.||..... -+...+ |......|.|.+ +..| +||+ .+.||.+.. ..+.+.+|...
T Consensus 58 n~~~~~CL~~~~~------G~~~~~~C~~~~~~q~F~iiPtttgAVQIks~~~~~Cl~~~~~~~~~~~~~i~l~~Cd~~~ 131 (150)
T PF03498_consen 58 NPKTGTCLAAYGN------GVFHYKSCDQDNLEQVFSIIPTTTGAVQIKSLSTGECLQTFNNSRTPIYYSIGLTPCDKSK 131 (150)
T ss_dssp ETTTSEEEEEETT------CEEEE--TTTCHGHH-EEEEEBTTS-EEEEETTT--EEEE-STTSS-SSEEEEEE---S-E
T ss_pred cCCCCcceeecCC------CeEeecccCCCChhceEEEEEcCCCcEEEEecCCCceEEecCCCceeEEeeEEeeeCCCcC
Confidence 4566669987321 233444 988656799976 4445 4665 578998852 26899999322
Q ss_pred --CCceeEEEcc
Q psy3380 279 --KGNQYFEYDY 288 (290)
Q Consensus 279 --~~nQ~W~~~~ 288 (290)
.-.|+|.+++
T Consensus 132 ~~~l~~lw~itP 143 (150)
T PF03498_consen 132 EINLDQLWFITP 143 (150)
T ss_dssp EETGGGEEEEE-
T ss_pred CCCHHHcEEEcC
Confidence 2368898764
No 61
>PF03002 Somatostatin: Somatostatin/Cortistatin family; InterPro: IPR018142 Somatostatin inhibits the release of the pituitary growth hormone, somatotropin and inhibits the release of glucagon and insulin from the pancreas of fasted animals. Cortistatin is a cortical neuropeptide with neuronal depressant and sleep-modulating properties [].; GO: 0005179 hormone activity, 0005576 extracellular region
Probab=36.15 E-value=9.1 Score=20.15 Aligned_cols=9 Identities=56% Similarity=1.468 Sum_probs=7.4
Q ss_pred Hhccccchh
Q psy3380 200 NLGCKSFKW 208 (290)
Q Consensus 200 ~l~ck~FkW 208 (290)
+-.||.|.|
T Consensus 4 k~~CknffW 12 (18)
T PF03002_consen 4 KAGCKNFFW 12 (18)
T ss_pred cccccceee
Confidence 468999998
No 62
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602).
Probab=29.48 E-value=21 Score=34.79 Aligned_cols=35 Identities=23% Similarity=0.442 Sum_probs=29.4
Q ss_pred cccCceeeechhhHhhhcccCCCCcccCcccccccccc
Q psy3380 88 TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125 (290)
Q Consensus 88 ~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~ 125 (290)
.+.|+..++.++.+..+|+|++.. -+||.||.+|+
T Consensus 214 ~~sGa~~~~Rr~~l~~vggf~~~~---i~ED~~l~~rl 248 (439)
T TIGR03111 214 TLSGAFSAFRRETILKTQLYNSET---VGEDTDMTFQI 248 (439)
T ss_pred EEccHHHhhhHHHHHHhCCCCCCC---cCccHHHHHHH
Confidence 356777799999999999999764 38999999986
No 63
>PRK11498 bcsA cellulose synthase catalytic subunit; Provisional
Probab=28.49 E-value=40 Score=36.03 Aligned_cols=36 Identities=14% Similarity=0.204 Sum_probs=30.6
Q ss_pred ccccCceeeechhhHhhhcccCCCCcccCcccccccccc
Q psy3380 87 PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125 (290)
Q Consensus 87 P~~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~ 125 (290)
....|...+|.++-|+++|+||.+. -.||+++++|+
T Consensus 416 ~~~~Gs~aviRReaLeeVGGfd~~t---itED~dlslRL 451 (852)
T PRK11498 416 TFFCGSCAVIRRKPLDEIGGIAVET---VTEDAHTSLRL 451 (852)
T ss_pred cccccceeeeEHHHHHHhcCCCCCc---cCccHHHHHHH
Confidence 3456888999999999999999764 37999999986
No 64
>PLN02726 dolichyl-phosphate beta-D-mannosyltransferase
Probab=26.20 E-value=34 Score=29.94 Aligned_cols=35 Identities=11% Similarity=0.088 Sum_probs=23.7
Q ss_pred ccCceeeechhhHhhhcccCCCCcccCcccccccccc
Q psy3380 89 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125 (290)
Q Consensus 89 ~~ggl~~I~~~~F~~~g~yd~~~~~~Gg~~~elsfk~ 125 (290)
..||+.+++++.|+.++.+.... .+ .+++||.++.
T Consensus 166 ~~g~~~~~rr~~~~~i~~~~~~~-~~-~~~~el~~~~ 200 (243)
T PLN02726 166 LTGSFRLYKRSALEDLVSSVVSK-GY-VFQMEIIVRA 200 (243)
T ss_pred CCCcccceeHHHHHHHHhhccCC-Cc-EEehHHHHHH
Confidence 45888899999999987543221 11 2467887775
Done!