Query psy5439
Match_columns 381
No_of_seqs 234 out of 808
Neff 5.3
Searched_HMMs 46136
Date Fri Aug 16 17:32:30 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5439.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5439hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00852 Glyco_transf_10: Glyc 100.0 3.5E-71 7.7E-76 551.3 9.8 268 91-368 45-349 (349)
2 KOG2619|consensus 100.0 2.1E-67 4.6E-72 524.4 21.8 270 91-368 66-371 (372)
3 PF03016 Exostosin: Exostosin 96.8 0.0069 1.5E-07 58.1 9.2 133 158-307 146-284 (302)
4 KOG1021|consensus 93.0 0.49 1.1E-05 49.8 9.2 124 156-287 250-376 (464)
5 PF13524 Glyco_trans_1_2: Glyc 92.5 0.23 5.1E-06 39.4 4.7 60 264-329 9-69 (92)
6 PF00159 Hormone_3: Pancreatic 72.1 3.8 8.3E-05 28.6 2.5 17 304-320 9-25 (36)
7 cd00126 PAH Pancreatic Hormone 69.8 4.6 9.9E-05 28.2 2.4 17 304-320 9-25 (36)
8 cd03820 GT1_amsD_like This fam 66.9 11 0.00024 34.6 5.3 78 247-329 249-326 (348)
9 smart00309 PAH Pancreatic horm 61.7 7.7 0.00017 27.1 2.3 17 304-320 9-25 (36)
10 cd03794 GT1_wbuB_like This fam 55.2 27 0.00058 32.7 5.6 80 248-330 292-373 (394)
11 cd03801 GT1_YqgM_like This fam 53.5 27 0.00058 32.1 5.3 77 247-329 272-348 (374)
12 PF00534 Glycos_transf_1: Glyc 53.3 17 0.00036 31.4 3.6 78 244-328 86-164 (172)
13 cd05844 GT1_like_7 Glycosyltra 52.9 29 0.00062 33.4 5.6 58 268-329 285-343 (367)
14 cd03808 GT1_cap1E_like This fa 47.5 40 0.00088 31.1 5.5 61 267-330 277-337 (359)
15 cd04946 GT1_AmsK_like This fam 45.0 30 0.00065 35.1 4.5 56 268-329 325-384 (407)
16 cd03800 GT1_Sucrose_synthase T 43.8 44 0.00095 32.4 5.3 57 269-329 318-375 (398)
17 TIGR03088 stp2 sugar transfera 43.8 43 0.00094 32.7 5.3 59 268-329 287-345 (374)
18 cd03822 GT1_ecORF704_like This 43.4 52 0.0011 31.0 5.6 58 268-329 284-341 (366)
19 cd03795 GT1_like_4 This family 39.0 54 0.0012 31.1 5.0 58 269-329 281-339 (357)
20 cd03812 GT1_CapH_like This fam 38.0 48 0.001 31.6 4.5 59 268-331 281-340 (358)
21 PLN02871 UDP-sulfoquinovose:DA 38.0 57 0.0012 33.7 5.3 56 269-329 347-407 (465)
22 cd03809 GT1_mtfB_like This fam 37.2 71 0.0015 30.0 5.5 57 269-330 288-344 (365)
23 cd03823 GT1_ExpE7_like This fa 36.7 58 0.0013 30.4 4.7 58 269-330 279-337 (359)
24 cd03814 GT1_like_2 This family 36.0 83 0.0018 29.5 5.7 59 269-330 282-340 (364)
25 cd03799 GT1_amsK_like This is 32.4 95 0.0021 29.3 5.5 78 247-329 252-334 (355)
26 cd03825 GT1_wcfI_like This fam 30.7 1.2E+02 0.0025 28.9 5.8 59 268-330 279-338 (365)
27 PRK13893 conjugal transfer pro 30.4 31 0.00068 32.5 1.7 58 261-329 132-189 (193)
28 PF13692 Glyco_trans_1_4: Glyc 29.7 32 0.00069 28.3 1.6 69 246-322 66-135 (135)
29 cd03818 GT1_ExpC_like This fam 29.3 1.2E+02 0.0026 30.2 5.9 58 269-329 316-373 (396)
30 cd03798 GT1_wlbH_like This fam 28.9 97 0.0021 28.6 4.8 72 248-325 276-347 (377)
31 COG0438 RfaG Glycosyltransfera 26.5 1.7E+02 0.0037 26.1 5.9 57 270-331 293-351 (381)
32 PRK15427 colanic acid biosynth 26.0 1.3E+02 0.0028 30.7 5.5 59 267-329 318-378 (406)
33 KOG1890|consensus 24.9 54 0.0012 36.3 2.6 37 250-287 263-308 (949)
34 TIGR03087 stp1 sugar transfera 24.4 1.6E+02 0.0035 29.3 5.8 56 267-328 312-368 (397)
35 cd03816 GT1_ALG1_like This fam 23.8 1.2E+02 0.0027 30.8 4.9 57 266-329 330-391 (415)
36 cd03807 GT1_WbnK_like This fam 22.7 1.5E+02 0.0032 27.5 4.8 57 267-328 282-338 (365)
37 PRK15484 lipopolysaccharide 1, 22.4 1.5E+02 0.0032 29.7 5.1 54 269-325 293-347 (380)
38 PF14281 PDDEXK_4: PD-(D/E)XK 21.4 78 0.0017 28.2 2.6 120 197-322 17-163 (179)
39 cd04962 GT1_like_5 This family 21.3 2.3E+02 0.005 27.1 6.0 58 269-329 286-343 (371)
40 cd03821 GT1_Bme6_like This fam 21.3 1.6E+02 0.0034 27.4 4.7 57 269-330 297-353 (375)
41 cd04951 GT1_WbdM_like This fam 20.7 1.8E+02 0.004 27.4 5.1 54 268-327 277-331 (360)
No 1
>PF00852 Glyco_transf_10: Glycosyltransferase family 10 (fucosyltransferase); InterPro: IPR001503 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 10 GT10 from CAZY comprises enzymes with two known activities; galactoside 3(4)-L-fucosyltransferase (2.4.1.65 from EC) and galactoside 3-fucosyltransferase (2.4.1.152 from EC). The galactoside 3-fucosyltransferases display similarities with the alpha-2 and alpha-6-fucosyltranferases []. The biosynthesis of the carbohydrate antigen sialyl Lewis X (sLe(x)) is dependent on the activity of an galactoside 3-fucosyltransferase. This enzyme catalyses the transfer of fucose from GDP-beta-fucose to the 3-OH of N-acetylglucosamine present in lactosamine acceptors []. Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Galactoside 3(4)-L-fucosyltransferase (2.4.1.65 from EC) belongs to the Lewis blood group system and is associated with Le(a/b) antigen. ; GO: 0008417 fucosyltransferase activity, 0006486 protein glycosylation, 0016020 membrane; PDB: 2NZX_B 2NZW_C 2NZY_C.
Probab=100.00 E-value=3.5e-71 Score=551.26 Aligned_cols=268 Identities=44% Similarity=0.774 Sum_probs=100.6
Q ss_pred CCCeEEEEeccCCCCccccCCCCCCCCCC----CCcEEEecCCCCCc---------------------------eEEEEe
Q psy5439 91 KNVKKILLFNALTSWNVKLGSSMFSNCPV----NACALTTNKAESPF---------------------------VWALYF 139 (381)
Q Consensus 91 ~~~k~ILlw~~~~~w~~~~g~~~f~~C~v----~~C~iT~dr~~~~~---------------------------~wv~~s 139 (381)
.+.+.||+|++...+... +.+|+. ++|.+|+||+++.+ +||+|+
T Consensus 45 ~~~~~IL~W~~~f~~~~~-----~~~C~~~~~~~~C~lT~dr~~~~~aDAVVFh~~~~~~~~~~lP~~~R~~~Q~wV~~s 119 (349)
T PF00852_consen 45 KKKKLILLWTWPFGVPFD-----LSKCPDNFGIGNCELTNDRSLLSEADAVVFHGRDFSESLSDLPKKRRPPHQRWVFWS 119 (349)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred cccccccccccccccccc-----ccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
Confidence 556789999865333222 468865 47999999997543 899999
Q ss_pred cCCCcCCC-CCCccccccceeeeccCCCccCCcccceeccccccCCChhhhccccccCcceEEEEEeccCCcchHHHHHH
Q psy5439 140 LECPYHTQ-HVAHNDVFNWTATYRSDSDIVTPYEKWVYYDDRHKQMPLDKMKNYAANKTHKVAWFVSNCGAKNNRLQYAH 218 (381)
Q Consensus 140 ~EsP~~~~-~~~~~~~FNwTMTYR~DSDI~~pYg~~~~~~~~~~~~~~~~~~~~~~~K~k~vaw~vSnC~~~~~R~~yv~ 218 (381)
+|||.++. ...+.++||||||||+||||++|||.+......... ....+...+|+++|+||+|||++.++|++|++
T Consensus 120 ~ESP~~~~~~~~~~~~FN~TMTYr~dSDi~~py~~~~~~~~~~~~---~~~~~~~~~K~~~~~w~~Snc~~~~~R~~~~~ 196 (349)
T PF00852_consen 120 MESPSNTPRLKDFDNFFNWTMTYRRDSDIPLPYGYFSPRESPSEK---DDLPNILKKKTKLAAWIVSNCNPHSGREEYVR 196 (349)
T ss_dssp --------------------------------------------------------TSSEEEEE--S-S--H-HHHHHHH
T ss_pred ccccccccccccccccccccccccccccccccccccccccccccc---ccccccccCCCceEEEEeeCcCCcccHHHHHH
Confidence 99999963 346788999999999999999999976532221111 12244567899999999999999989999999
Q ss_pred HHhcCCeEeEcCCCCCCCCCCchhHHHhhcccceEEEeeccCCCCccchHHHHHhhhcCCcceEEEcCCccccccCCCCC
Q psy5439 219 ELGKYIEVDIYGSCGSKKCPRISKCFRLLDTKYKFYLAFENSNCVDYITEKLYINGLSHNVLPIVMGARPEDYKKAAPMN 298 (381)
Q Consensus 219 eL~k~i~VD~yG~C~~~~c~~~~~c~~~l~~~YKFyLAfENS~c~DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~ 298 (381)
+|+++++||+||+|+++.+... +|...++++||||||||||+|+|||||||| +||.+|+||||+|++..+|++++|||
T Consensus 197 ~L~~~~~vd~yG~c~~~~~~~~-~~~~~~~~~ykF~lafENs~c~dYiTEK~~-~al~~g~VPI~~G~~~~~~~~~~P~~ 274 (349)
T PF00852_consen 197 ELSKYIPVDSYGKCGNNNPCPR-DCKLELLSKYKFYLAFENSNCPDYITEKFW-NALLAGTVPIYWGPPRPNYEEFAPPN 274 (349)
T ss_dssp HHHTTS-EEE-SSTT--SSS---S-HHHHHHTEEEEEEE-SS--TT---HHHH-HHHHTTSEEEEES---TTHHHHS-GG
T ss_pred HHHhhcCeEccCCCCCCCCccc-ccccccccCcEEEEEecCCCCCCCCCHHHH-HHHHCCeEEEEECCEecccccCCCCC
Confidence 9999999999999954433222 344556799999999999999999999999 89999999999997778999999999
Q ss_pred CccccCCCCCHHHHHHHHHHHccCHHHHHHHhhhhccCcccc----cccchHHhHHhhhC-CCCceecchhhhcC
Q psy5439 299 SYIHVDDFPSAKHLAEYLNKLDANDTLYNSYFRWKGTGEFVN----TFFWCRLCAMLHDS-FPIKSYSDINEWWR 368 (381)
Q Consensus 299 SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Yf~Wr~~~~~~~----~~~~C~LC~~L~~~-~~~k~y~dl~~Ww~ 368 (381)
||||++||+|+++||+||+.|++|+++|++||+||+++++.. ..++|+||++|++. .+.++|+||.+||.
T Consensus 275 SfI~~~df~s~~~La~yl~~l~~n~~~Y~~yf~Wr~~~~~~~~~~~~~~~C~lC~~l~~~~~~~~~~~~l~~W~~ 349 (349)
T PF00852_consen 275 SFIHVDDFKSPKELADYLKYLDKNDELYNKYFEWRKRYEVKRFDSWSEGFCRLCKKLHENRRKPKVYPDLNDWWW 349 (349)
T ss_dssp GSEEGGGSSSHHHHHHHHHHHHT-HHHHH----------------------------------------------
T ss_pred CccchhcCCCHHHHHHHHHHHhcCHHHHhhhccccccccccccccccccccccccccccccccccccccccccCC
Confidence 999999999999999999999999999999999999998753 36899999999998 67799999999994
No 2
>KOG2619|consensus
Probab=100.00 E-value=2.1e-67 Score=524.41 Aligned_cols=270 Identities=37% Similarity=0.643 Sum_probs=222.7
Q ss_pred CCCeEEEEeccCCCCccccCCCCC--CCCCCCCcEEEecCCCCCc-------------------------eEEEEecCCC
Q psy5439 91 KNVKKILLFNALTSWNVKLGSSMF--SNCPVNACALTTNKAESPF-------------------------VWALYFLECP 143 (381)
Q Consensus 91 ~~~k~ILlw~~~~~w~~~~g~~~f--~~C~v~~C~iT~dr~~~~~-------------------------~wv~~s~EsP 143 (381)
...|.|++|.....|.. ..... ..|...+|.||++|+.++. .||+++||||
T Consensus 66 ~~~~~i~~~~~~f~~~~--~~~~~~~~~~~~~~C~iT~~rs~~~~adAvlfh~~d~~~~~~~l~~R~p~q~~i~~s~ESP 143 (372)
T KOG2619|consen 66 LLMKVILLWTWPFGLIP--ELSLDNQCPFGIDGCRITTNRSLFDSADAVLFHHRDIPKSPLPLPGRKPGQPWIWMSMESP 143 (372)
T ss_pred ccceEEEEEecCCCCCC--CcccccccccCcCceEEeCCccccCCCCEEEEeeccCCCCcccccCCCCCCceEEEeccCC
Confidence 44578999975432222 11112 4566789999999998765 8999999999
Q ss_pred cCCCCCC-ccc-cccceeeeccCCCccCCcccceeccccccCCChhhhccccccCcceEEEEEeccCCcchHHHHHHHHh
Q psy5439 144 YHTQHVA-HND-VFNWTATYRSDSDIVTPYEKWVYYDDRHKQMPLDKMKNYAANKTHKVAWFVSNCGAKNNRLQYAHELG 221 (381)
Q Consensus 144 ~~~~~~~-~~~-~FNwTMTYR~DSDI~~pYg~~~~~~~~~~~~~~~~~~~~~~~K~k~vaw~vSnC~~~~~R~~yv~eL~ 221 (381)
.|+..+. ..+ +||||||||+|||+++|||.+..... ..... ++.+....|++.+||++|||.+.+.|.+|+++|+
T Consensus 144 ~~~~~~~~~~~~~fN~T~Tyr~dSd~~~pygy~~~~~~--~~~~~-p~~~~~~~k~~~~aw~vSnc~~~~~R~~~~~~L~ 220 (372)
T KOG2619|consen 144 SNSPGLSLIDGRLFNWTMTYRRDSDLFVPYGYLEKPEA--NPVLV-PVNSILSAKTKLAAWLVSNCIPRSARLDYYKELM 220 (372)
T ss_pred CCCccccccccccccceEEEeccCCCCCccceEeeccc--Cceec-ccccccccccceeeeeccccCcchHHHHHHHHHH
Confidence 9966543 344 89999999999999999997754221 11111 1234457899999999999999999999999999
Q ss_pred cCCeEeEcCCCCCCCCCCc-hhHHHhhcccceEEEeeccCCCCccchHHHHHhhhcCCcceEEEcCCccccccCCCCCCc
Q psy5439 222 KYIEVDIYGSCGSKKCPRI-SKCFRLLDTKYKFYLAFENSNCVDYITEKLYINGLSHNVLPIVMGARPEDYKKAAPMNSY 300 (381)
Q Consensus 222 k~i~VD~yG~C~~~~c~~~-~~c~~~l~~~YKFyLAfENS~c~DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~Sf 300 (381)
++++||+||+|+.+.+.+. ..|.-.++++||||||||||+|+|||||||| +||.+|+||||+|++ +|+.++|++||
T Consensus 221 k~l~iD~YG~c~~~~~~~~~~~~~~~~~s~YKFyLAfENS~c~DYVTEKfw-~al~~gsVPVvlg~~--n~e~fvP~~Sf 297 (372)
T KOG2619|consen 221 KHLEIDSYGECLRKNANRDPSDCLLETLSHYKFYLAFENSNCEDYVTEKFW-NALDAGSVPVVLGPP--NYENFVPPDSF 297 (372)
T ss_pred hhCceeeccccccccccCCCCCcceeecccceEEEEecccCCcccccHHHH-hhhhcCcccEEECCc--cccccCCCcce
Confidence 9999999999997543222 2566667799999999999999999999999 899999999999994 79999999999
Q ss_pred cccCCCCCHHHHHHHHHHHccCHHHHHHHhhhhccCcccc----cccchHHhHHhhhCC--CCceecchhhhcC
Q psy5439 301 IHVDDFPSAKHLAEYLNKLDANDTLYNSYFRWKGTGEFVN----TFFWCRLCAMLHDSF--PIKSYSDINEWWR 368 (381)
Q Consensus 301 I~v~dF~S~~eLA~yL~~L~~n~~~Y~~Yf~Wr~~~~~~~----~~~~C~LC~~L~~~~--~~k~y~dl~~Ww~ 368 (381)
||++||+||+|||+||++|++|+++|++||+||+.+.+.. ..++|+||.+|.+.. ..+++.++..||.
T Consensus 298 I~vdDF~s~~ela~ylk~L~~n~~~Y~~Yf~WR~~~~~~~~~~~~~~~CrlC~~l~~~~r~~~~~~~~~~~W~~ 371 (372)
T KOG2619|consen 298 IHVDDFQSPQELAAYLKKLDKNPAAYLSYFEWRKDYKVRLTSAWDSGFCRLCEKLQREPRNRYKSSRNLANWFT 371 (372)
T ss_pred EehhhcCCHHHHHHHHHHhhcCHHHHHHHHHHHHhhcccccccccchhhHHHHHHHhccccccccccchHhhhc
Confidence 9999999999999999999999999999999999987653 368999999999874 5678889999985
No 3
>PF03016 Exostosin: Exostosin family; InterPro: IPR004263 Hereditary multiple exostoses (EXT) is an autosomal dominant disorder that is characterised by the appearance of multiple outgrowths of the long bones (exostoses) at their epiphyses []. Mutations in two homologous genes, EXT1 and EXT2, are responsible for the EXT syndrome. The human and mouse EXT genes have at least two homologs in the invertebrate Caenorhabditis elegans, indicating that they do not function exclusively as regulators of bone growth. EXT1 and EXT2 have both been shown to encode glycosyltransferases involved in the chain elongation step of heparan sulphate biosynthesis [].; GO: 0016020 membrane
Probab=96.78 E-value=0.0069 Score=58.08 Aligned_cols=133 Identities=18% Similarity=0.229 Sum_probs=75.9
Q ss_pred eeeeccCCCccCCcccceeccccccCCChhhhccccccCcceEEEEEeccCC------cchHHHHHHHHhcCCeEeEcCC
Q psy5439 158 TATYRSDSDIVTPYEKWVYYDDRHKQMPLDKMKNYAANKTHKVAWFVSNCGA------KNNRLQYAHELGKYIEVDIYGS 231 (381)
Q Consensus 158 TMTYR~DSDI~~pYg~~~~~~~~~~~~~~~~~~~~~~~K~k~vaw~vSnC~~------~~~R~~yv~eL~k~i~VD~yG~ 231 (381)
.-+||..-||++|+..... . . ...... ......+.+..+.|+.+... ...|..++..+.+.-...+..
T Consensus 146 ~~~~~~~~Di~~P~~~~~~--~-~-~~~~~~-~~~~~~~R~~l~~f~g~~~~~~~~~~~~~r~~l~~~~~~~~~~~~~~- 219 (302)
T PF03016_consen 146 SSCFRPGFDIVIPPFVPPS--S-L-PDWRPW-PQRPPARRPYLLFFAGTIRPSSNDYSGGVRQRLLDECKSDPDFRCSD- 219 (302)
T ss_pred cCcccCCCCeecccccccc--c-c-CCcccc-ccCCccCCceEEEEeeeccccccccchhhhhHHHHhcccCCcceeee-
Confidence 4567888889999863211 0 0 000000 00112233445555555432 235888888876643323221
Q ss_pred CCCCCCCCchhHHHhhcccceEEEeeccCCCCccchHHHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCC
Q psy5439 232 CGSKKCPRISKCFRLLDTKYKFYLAFENSNCVDYITEKLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFP 307 (381)
Q Consensus 232 C~~~~c~~~~~c~~~l~~~YKFyLAfENS~c~DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~ 307 (381)
+...+....+ +..++.+-+|.|+. .-....+--|| +||.+|||||+++. . -.+|=...|+.++|.
T Consensus 220 -~~~~~~~~~~-~~~~l~~S~FCL~p---~G~~~~s~Rl~-eal~~GcIPVii~d---~--~~lPf~~~ldw~~fs 284 (302)
T PF03016_consen 220 -GSETCPSPSE-YMELLRNSKFCLCP---RGDGPWSRRLY-EALAAGCIPVIISD---D--YVLPFEDVLDWSRFS 284 (302)
T ss_pred -cccccccchH-HHHhcccCeEEEEC---CCCCcccchHH-HHhhhceeeEEecC---c--ccCCcccccCHHHEE
Confidence 1122222222 33456899999993 33344888999 89999999999965 2 237888888877775
No 4
>KOG1021|consensus
Probab=92.97 E-value=0.49 Score=49.78 Aligned_cols=124 Identities=13% Similarity=0.162 Sum_probs=73.3
Q ss_pred cceeeeccC-CCccCCcccceeccccccCCChhhhccccccCcceEEEEEeccCCcchHHHHHHHHhcCCeEeEcCCCCC
Q psy5439 156 NWTATYRSD-SDIVTPYEKWVYYDDRHKQMPLDKMKNYAANKTHKVAWFVSNCGAKNNRLQYAHELGKYIEVDIYGSCGS 234 (381)
Q Consensus 156 NwTMTYR~D-SDI~~pYg~~~~~~~~~~~~~~~~~~~~~~~K~k~vaw~vSnC~~~~~R~~yv~eL~k~i~VD~yG~C~~ 234 (381)
+++.+.... .|+.+||-... .+ ...............+.+..+.|+.+=....-|....+.+.+.-....++.|-+
T Consensus 250 ~ls~~~~~~~~dv~iP~~~~~--~~-~~~~~~~~~~~~~~~~R~~L~~F~G~~~~~~iR~~L~~~~~~~~~~~~~~~~~~ 326 (464)
T KOG1021|consen 250 LLSLEFFPWNKDVAIPYPTIP--HP-LSPPENSWQGGVPFSNRPILAFFAGAPAGGQIRSILLDLWKKDPDTEVFVNCPR 326 (464)
T ss_pred eeecccccCCCcccCCCccCc--Cc-cCccccccccCCCCCCCceEEEEeccccCCcHHHHHHHHhhcCcCccccccCCC
Confidence 455556656 89988885321 11 111100000111223556677777763334569999999988333455666655
Q ss_pred CC--CCCchhHHHhhcccceEEEeeccCCCCccchHHHHHhhhcCCcceEEEcCC
Q psy5439 235 KK--CPRISKCFRLLDTKYKFYLAFENSNCVDYITEKLYINGLSHNVLPIVMGAR 287 (381)
Q Consensus 235 ~~--c~~~~~c~~~l~~~YKFyLAfENS~c~DYVTEKl~~~al~~g~VPIv~G~~ 287 (381)
.. |.+. .-+.+++.+.+|.|+.-= .-.-++-+| +|+.+|||||++-..
T Consensus 327 g~~~~~~~-~~y~~~m~~S~FCL~p~G---d~~ts~R~f-dai~~gCvPViisd~ 376 (464)
T KOG1021|consen 327 GKVSCDRP-LNYMEGMQDSKFCLCPPG---DTPTSPRLF-DAIVSGCVPVIISDG 376 (464)
T ss_pred CccccCCc-chHHHHhhcCeEEECCCC---CCcccHhHH-HHHHhCCccEEEcCC
Confidence 43 2222 334456699999998421 234568888 899999999999753
No 5
>PF13524 Glyco_trans_1_2: Glycosyl transferases group 1
Probab=92.48 E-value=0.23 Score=39.45 Aligned_cols=60 Identities=13% Similarity=0.111 Sum_probs=47.1
Q ss_pred ccchHHHHHhhhcCCcceEEEcCCccccccCCCCC-CccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 264 DYITEKLYINGLSHNVLPIVMGARPEDYKKAAPMN-SYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 264 DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~-SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
+.++-|+| +++..|++.|.-.. ..+..++.++ .+|-++ ++++|++-|.+|.+|+++.++.
T Consensus 9 ~~~~~r~~-E~~a~G~~vi~~~~--~~~~~~~~~~~~~~~~~---~~~el~~~i~~ll~~~~~~~~i 69 (92)
T PF13524_consen 9 DGPNMRIF-EAMACGTPVISDDS--PGLREIFEDGEHIITYN---DPEELAEKIEYLLENPEERRRI 69 (92)
T ss_pred CCCchHHH-HHHHCCCeEEECCh--HHHHHHcCCCCeEEEEC---CHHHHHHHHHHHHCCHHHHHHH
Confidence 67888999 79999977666544 3566778777 555444 8999999999999999887665
No 6
>PF00159 Hormone_3: Pancreatic hormone peptide; InterPro: IPR001955 Pancreatic hormone (PP) [] is a peptide synthesized in pancreatic islets of Langherhans, which acts as a regulator of pancreatic and gastrointestinal functions. The hormone is produced as a larger propeptide, which is enzymatically cleaved to yield the mature active peptide: this is 36 amino acids in length [] and has an amidated C terminus []. The hormone has a globular structure, residues 2-8 forming a left-handed poly-proline-II-like helix, residues 9-13 a beta turn, and 14-32 an alpha-helix,held close to the first helix by hydrophobic interactions []. Unlike glucagon, another peptide hormone, the structure of pancreatic peptide is preserved in aqueous solution []. Both N and C termini are required for activity: receptor binding and activation functions may reside in the N and C termini respectively []. Pancreatic hormone is part of a wider family of active peptides that includes: Neuropeptide Y (NPY) [], one of the most abundant peptides in the mammalian nervous system. NPY is implicated in the control of feeding and the secretion of the gonadotrophin-releasing hormone. Peptide YY (PYY) []. PPY is a gut peptide that inhibits exocrine pancreatic secretion, has a vasoconstrictory action and inhibits jejunal and colonic mobility. Various NPY and PYY-like polypeptides from fish and amphibians [, ]. Neuropeptide F (NPF) from invertebrates such as worms and snail. Skin peptide Tyr-Tyr (SPYY) from the frog Phyllomedusa bicolor. SPYY shows a large spectra of antibacterial and antifungal activity. All these peptides are 36 to 39 amino acids long. Like most active peptides, their C-terminal is amidated and they are synthesized as larger protein precursors.; GO: 0005179 hormone activity, 0005576 extracellular region; PDB: 1LJV_A 1BBA_A 1V1D_A 1PPT_A 2H3T_A 2H4B_A 2BF9_A 2H3S_B 1K8V_A 2DF0_A ....
Probab=72.12 E-value=3.8 Score=28.60 Aligned_cols=17 Identities=41% Similarity=0.688 Sum_probs=15.1
Q ss_pred CCCCCHHHHHHHHHHHc
Q psy5439 304 DDFPSAKHLAEYLNKLD 320 (381)
Q Consensus 304 ~dF~S~~eLA~yL~~L~ 320 (381)
.+|+|+++|+.|+..|.
T Consensus 9 ~~~aspeel~~Y~~~L~ 25 (36)
T PF00159_consen 9 GDFASPEELAQYYAALR 25 (36)
T ss_dssp STTSSHHHHHHHHHHHH
T ss_pred CCCCCHHHHHHHHHHHH
Confidence 47899999999999993
No 7
>cd00126 PAH Pancreatic Hormone domain, a regulator of pancreatic and gastrointestinal functions; neuropeptide Y (NPY)b, peptide YY (PYY), and pancreatic polypetide (PP) are closely related; propeptide is enzymatically cleaved to yield the mature active peptide with amidated C-terminal ends; receptor binding and activation functions may reside in the N- and C-termini respectively; occurs in neurons, intestinal endocrine cells, and pancreas; exist as monomers and dimers
Probab=69.82 E-value=4.6 Score=28.25 Aligned_cols=17 Identities=35% Similarity=0.483 Sum_probs=14.8
Q ss_pred CCCCCHHHHHHHHHHHc
Q psy5439 304 DDFPSAKHLAEYLNKLD 320 (381)
Q Consensus 304 ~dF~S~~eLA~yL~~L~ 320 (381)
.||+|+++|++|+..|.
T Consensus 9 g~~a~~eel~~Y~~~L~ 25 (36)
T cd00126 9 GDDASPEELRQYLAALR 25 (36)
T ss_pred CCCCCHHHHHHHHHHHH
Confidence 46789999999999983
No 8
>cd03820 GT1_amsD_like This family is most closely related to the GT1 family of glycosyltransferases. AmSD in Erwinia amylovora has been shown to be involved in the biosynthesis of amylovoran, the acidic exopolysaccharide acting as a virulence factor. This enzyme may be responsible for the formation of galactose alpha-1,6 linkages in amylovoran.
Probab=66.92 E-value=11 Score=34.63 Aligned_cols=78 Identities=13% Similarity=0.051 Sum_probs=44.8
Q ss_pred hcccceEEEeeccCCCCccchHHHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHH
Q psy5439 247 LDTKYKFYLAFENSNCVDYITEKLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLY 326 (381)
Q Consensus 247 l~~~YKFyLAfENS~c~DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y 326 (381)
+++...+++.- |..+. ..=++. +|+..|+.+|+.-.. ...+.++..+.--.+-+..++++||+.|..|..|++.+
T Consensus 249 ~~~~ad~~i~p--s~~e~-~~~~~~-Ea~a~G~Pvi~~~~~-~~~~~~~~~~~~g~~~~~~~~~~~~~~i~~ll~~~~~~ 323 (348)
T cd03820 249 YYAKASIFVLT--SRFEG-FPMVLL-EAMAFGLPVISFDCP-TGPSEIIEDGVNGLLVPNGDVEALAEALLRLMEDEELR 323 (348)
T ss_pred HHHhCCEEEeC--ccccc-cCHHHH-HHHHcCCCEEEecCC-CchHhhhccCcceEEeCCCCHHHHHHHHHHHHcCHHHH
Confidence 33455555542 22222 234787 699999766654211 12223344331222224457999999999999998876
Q ss_pred HHH
Q psy5439 327 NSY 329 (381)
Q Consensus 327 ~~Y 329 (381)
.+.
T Consensus 324 ~~~ 326 (348)
T cd03820 324 KRM 326 (348)
T ss_pred HHH
Confidence 654
No 9
>smart00309 PAH Pancreatic hormones / neuropeptide F / peptide YY family. Pancreatic hormone is a regulator of pancreatic and gastrointestinal functions.
Probab=61.68 E-value=7.7 Score=27.12 Aligned_cols=17 Identities=35% Similarity=0.483 Sum_probs=14.5
Q ss_pred CCCCCHHHHHHHHHHHc
Q psy5439 304 DDFPSAKHLAEYLNKLD 320 (381)
Q Consensus 304 ~dF~S~~eLA~yL~~L~ 320 (381)
.|+.|+++||+|+..|.
T Consensus 9 g~~a~~e~l~~Y~~~L~ 25 (36)
T smart00309 9 GDDASPEDLRQYLAALR 25 (36)
T ss_pred CCCCCHHHHHHHHHHHH
Confidence 36789999999999983
No 10
>cd03794 GT1_wbuB_like This family is most closely related to the GT1 family of glycosyltransferases. wbuB in E. coli is involved in the biosynthesis of the O26 O-antigen. It has been proposed to function as an N-acetyl-L-fucosamine (L-FucNAc) transferase.
Probab=55.17 E-value=27 Score=32.67 Aligned_cols=80 Identities=10% Similarity=-0.053 Sum_probs=45.7
Q ss_pred cccceEEEeeccCCCC-cc-chHHHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHH
Q psy5439 248 DTKYKFYLAFENSNCV-DY-ITEKLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTL 325 (381)
Q Consensus 248 ~~~YKFyLAfENS~c~-DY-VTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~ 325 (381)
++...+.++.-..... +. +.=|+. +|+..|+ ||+..... ....++..+..-.+-+..++++||+.|..+..|++.
T Consensus 292 ~~~~di~i~~~~~~~~~~~~~p~~~~-Ea~~~G~-pvi~~~~~-~~~~~~~~~~~g~~~~~~~~~~l~~~i~~~~~~~~~ 368 (394)
T cd03794 292 LAAADVGLVPLKPGPAFEGVSPSKLF-EYMAAGK-PVLASVDG-ESAELVEEAGAGLVVPPGDPEALAAAILELLDDPEE 368 (394)
T ss_pred HHhhCeeEEeccCcccccccCchHHH-HHHHCCC-cEEEecCC-CchhhhccCCcceEeCCCCHHHHHHHHHHHHhChHH
Confidence 3555655554332221 22 244777 6999995 55544321 223334332111122334799999999999999988
Q ss_pred HHHHh
Q psy5439 326 YNSYF 330 (381)
Q Consensus 326 Y~~Yf 330 (381)
+.++-
T Consensus 369 ~~~~~ 373 (394)
T cd03794 369 RAEMG 373 (394)
T ss_pred HHHHH
Confidence 87653
No 11
>cd03801 GT1_YqgM_like This family is most closely related to the GT1 family of glycosyltransferases and named after YqgM in Bacillus licheniformis about which little is known. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold.
Probab=53.49 E-value=27 Score=32.08 Aligned_cols=77 Identities=9% Similarity=-0.017 Sum_probs=46.6
Q ss_pred hcccceEEEeeccCCCCccchHHHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHH
Q psy5439 247 LDTKYKFYLAFENSNCVDYITEKLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLY 326 (381)
Q Consensus 247 l~~~YKFyLAfENS~c~DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y 326 (381)
+++...+.|..-.. +-..-|+. +|+..|+..|+...+ ....++..+....+-+..++++|++.|..+..|++.+
T Consensus 272 ~~~~~di~i~~~~~---~~~~~~~~-Ea~~~g~pvI~~~~~--~~~~~~~~~~~g~~~~~~~~~~l~~~i~~~~~~~~~~ 345 (374)
T cd03801 272 LYAAADVFVLPSLY---EGFGLVLL-EAMAAGLPVVASDVG--GIPEVVEDGETGLLVPPGDPEALAEAILRLLDDPELR 345 (374)
T ss_pred HHHhcCEEEecchh---ccccchHH-HHHHcCCcEEEeCCC--ChhHHhcCCcceEEeCCCCHHHHHHHHHHHHcChHHH
Confidence 34556666643322 33455888 699999655544322 3344454233222334456999999999998888876
Q ss_pred HHH
Q psy5439 327 NSY 329 (381)
Q Consensus 327 ~~Y 329 (381)
.+.
T Consensus 346 ~~~ 348 (374)
T cd03801 346 RRL 348 (374)
T ss_pred HHH
Confidence 654
No 12
>PF00534 Glycos_transf_1: Glycosyl transferases group 1; InterPro: IPR001296 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Proteins containign this domain transfer UDP, ADP, GDP or CMP linked sugars to a variety of substrates, including glycogen, fructose-6-phosphate and lipopolysaccharides. The bacterial enzymes are involved in various biosynthetic processes that include exopolysaccharide biosynthesis, lipopolysaccharide core biosynthesis and the biosynthesis of the slime polysaccaride colanic acid. Mutations in this domain of the human N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein are the cause of paroxysmal nocturnal hemoglobinuria (PNH), an acquired hemolytic blood disorder characterised by venous thrombosis, erythrocyte hemolysis, infections and defective hematopoiesis.; GO: 0009058 biosynthetic process; PDB: 2L7C_A 2IV3_B 2IUY_B 2XA9_A 2XA1_B 2X6R_A 2XMP_B 2XA2_B 2X6Q_A 3QHP_B ....
Probab=53.27 E-value=17 Score=31.37 Aligned_cols=78 Identities=12% Similarity=0.081 Sum_probs=49.7
Q ss_pred HHhhcccceEEEeeccCCCCccchHHHHHhhhcCCcceEEEcCCccccccCCCCCC-ccccCCCCCHHHHHHHHHHHccC
Q psy5439 244 FRLLDTKYKFYLAFENSNCVDYITEKLYINGLSHNVLPIVMGARPEDYKKAAPMNS-YIHVDDFPSAKHLAEYLNKLDAN 322 (381)
Q Consensus 244 ~~~l~~~YKFyLAfENS~c~DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~S-fI~v~dF~S~~eLA~yL~~L~~n 322 (381)
+..+++...+++..=.. +.+=+ .+. +|+..|+.+|+...+ ....++..+. -+-+++. ++++||+-|..+..|
T Consensus 86 l~~~~~~~di~v~~s~~--e~~~~-~~~-Ea~~~g~pvI~~~~~--~~~e~~~~~~~g~~~~~~-~~~~l~~~i~~~l~~ 158 (172)
T PF00534_consen 86 LDELYKSSDIFVSPSRN--EGFGL-SLL-EAMACGCPVIASDIG--GNNEIINDGVNGFLFDPN-DIEELADAIEKLLND 158 (172)
T ss_dssp HHHHHHHTSEEEE-BSS--BSS-H-HHH-HHHHTT-EEEEESST--HHHHHSGTTTSEEEESTT-SHHHHHHHHHHHHHH
T ss_pred cccccccceeccccccc--ccccc-ccc-cccccccceeecccc--CCceeeccccceEEeCCC-CHHHHHHHHHHHHCC
Confidence 34455677777776444 33333 676 699999988887654 3345555444 2334444 899999999999998
Q ss_pred HHHHHH
Q psy5439 323 DTLYNS 328 (381)
Q Consensus 323 ~~~Y~~ 328 (381)
++.+.+
T Consensus 159 ~~~~~~ 164 (172)
T PF00534_consen 159 PELRQK 164 (172)
T ss_dssp HHHHHH
T ss_pred HHHHHH
Confidence 855544
No 13
>cd05844 GT1_like_7 Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center
Probab=52.91 E-value=29 Score=33.40 Aligned_cols=58 Identities=19% Similarity=0.146 Sum_probs=37.0
Q ss_pred HHHHHhhhcCCcceEEEcCCccccccCCCCC-CccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 268 EKLYINGLSHNVLPIVMGARPEDYKKAAPMN-SYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 268 EKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~-SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
=++. +|+..| +||+.+... ...+++..+ .-+-+ +-.++++||+-|..+.+|++.+.++
T Consensus 285 ~~~~-EA~a~G-~PvI~s~~~-~~~e~i~~~~~g~~~-~~~d~~~l~~~i~~l~~~~~~~~~~ 343 (367)
T cd05844 285 VVLL-EAQASG-VPVVATRHG-GIPEAVEDGETGLLV-PEGDVAALAAALGRLLADPDLRARM 343 (367)
T ss_pred hHHH-HHHHcC-CCEEEeCCC-CchhheecCCeeEEE-CCCCHHHHHHHHHHHHcCHHHHHHH
Confidence 3677 699999 577765432 223333322 22222 2357899999999999998866554
No 14
>cd03808 GT1_cap1E_like This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides.
Probab=47.48 E-value=40 Score=31.07 Aligned_cols=61 Identities=13% Similarity=0.087 Sum_probs=38.3
Q ss_pred hHHHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHHHHHh
Q psy5439 267 TEKLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNSYF 330 (381)
Q Consensus 267 TEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Yf 330 (381)
.=++. +|+..|+ ||+.... .....++..+-.-.+-+..+++++|+-|..+..|++.+.+.-
T Consensus 277 ~~~~~-Ea~~~G~-Pvi~s~~-~~~~~~i~~~~~g~~~~~~~~~~~~~~i~~l~~~~~~~~~~~ 337 (359)
T cd03808 277 PRVLL-EAMAMGR-PVIATDV-PGCREAVIDGVNGFLVPPGDAEALADAIERLIEDPELRARMG 337 (359)
T ss_pred chHHH-HHHHcCC-CEEEecC-CCchhhhhcCcceEEECCCCHHHHHHHHHHHHhCHHHHHHHH
Confidence 45787 6999995 5555332 223444432211112244579999999999999988777653
No 15
>cd04946 GT1_AmsK_like This family is most closely related to the GT1 family of glycosyltransferases. AmsK is involved in the biosynthesis of amylovoran, which functions as a virulence factor. It functions as a glycosyl transferase which transfers galactose from UDP-galactose to a lipid-linked amylovoran-subunit precursor. The members of this family are found mainly in bacteria and Archaea.
Probab=45.03 E-value=30 Score=35.13 Aligned_cols=56 Identities=16% Similarity=0.350 Sum_probs=38.1
Q ss_pred HHHHHhhhcCCcceEE---EcCCccccccCCCCC-CccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 268 EKLYINGLSHNVLPIV---MGARPEDYKKAAPMN-SYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 268 EKl~~~al~~g~VPIv---~G~~~~dy~~~~Pp~-SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
=++. +|+..| +||+ .|+- .+++..+ .=+-+.+-.++++||+-|..|.+|++++.++
T Consensus 325 ~~ll-EAma~G-~PVIas~vgg~----~e~i~~~~~G~l~~~~~~~~~la~~I~~ll~~~~~~~~m 384 (407)
T cd04946 325 VSIM-EAMSFG-IPVIATNVGGT----PEIVDNGGNGLLLSKDPTPNELVSSLSKFIDNEEEYQTM 384 (407)
T ss_pred HHHH-HHHHcC-CCEEeCCCCCc----HHHhcCCCcEEEeCCCCCHHHHHHHHHHHHhCHHHHHHH
Confidence 3566 699999 6888 4442 2334433 2223444457999999999999999888754
No 16
>cd03800 GT1_Sucrose_synthase This family is most closely related to the GT1 family of glycosyltransferases. The sucrose-phosphate synthases in this family may be unique to plants and photosynthetic bacteria. This enzyme catalyzes the synthesis of sucrose 6-phosphate from fructose 6-phosphate and uridine 5'-diphosphate-glucose, a key regulatory step of sucrose metabolism. The activity of this enzyme is regulated by phosphorylation and moderated by the concentration of various metabolites and light.
Probab=43.77 E-value=44 Score=32.39 Aligned_cols=57 Identities=19% Similarity=0.198 Sum_probs=36.8
Q ss_pred HHHHhhhcCCcceEEEcCCccccccCCCCC-CccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 269 KLYINGLSHNVLPIVMGARPEDYKKAAPMN-SYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 269 Kl~~~al~~g~VPIv~G~~~~dy~~~~Pp~-SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
++. +|+..| +||+.-.. ....+++..+ .-+.+ +..++++||+.|..|.+|++.+.++
T Consensus 318 ~l~-Ea~a~G-~Pvi~s~~-~~~~e~i~~~~~g~~~-~~~~~~~l~~~i~~l~~~~~~~~~~ 375 (398)
T cd03800 318 TAL-EAMACG-LPVVATAV-GGPRDIVVDGVTGLLV-DPRDPEALAAALRRLLTDPALRRRL 375 (398)
T ss_pred HHH-HHHhcC-CCEEECCC-CCHHHHccCCCCeEEe-CCCCHHHHHHHHHHHHhCHHHHHHH
Confidence 577 699999 68775432 2233444433 22223 3347999999999999988776544
No 17
>TIGR03088 stp2 sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate.
Probab=43.77 E-value=43 Score=32.68 Aligned_cols=59 Identities=12% Similarity=0.071 Sum_probs=37.7
Q ss_pred HHHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 268 EKLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 268 EKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
=++. +||..| +|||.... ....+++..+-.--+-+-.++++||+-|..|.+|++.+.+.
T Consensus 287 ~~~l-EAma~G-~Pvv~s~~-~g~~e~i~~~~~g~~~~~~d~~~la~~i~~l~~~~~~~~~~ 345 (374)
T TIGR03088 287 NTIL-EAMASG-LPVIATAV-GGNPELVQHGVTGALVPPGDAVALARALQPYVSDPAARRAH 345 (374)
T ss_pred hHHH-HHHHcC-CCEEEcCC-CCcHHHhcCCCceEEeCCCCHHHHHHHHHHHHhCHHHHHHH
Confidence 3666 699999 58876543 23344444332111223367999999999998888776544
No 18
>cd03822 GT1_ecORF704_like This family is most closely related to the GT1 family of glycosyltransferases. ORF704 in E. coli has been shown to be involved in the biosynthesis of O-specific mannose homopolysaccharides.
Probab=43.37 E-value=52 Score=30.95 Aligned_cols=58 Identities=16% Similarity=0.118 Sum_probs=37.1
Q ss_pred HHHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 268 EKLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 268 EKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
-.+. .|+..|. ||+...... .+.+...+.-+-+ +..+++++|+.|..+.+|++++.++
T Consensus 284 ~~~~-Ea~a~G~-PvI~~~~~~-~~~i~~~~~g~~~-~~~d~~~~~~~l~~l~~~~~~~~~~ 341 (366)
T cd03822 284 GVLA-YAIGFGK-PVISTPVGH-AEEVLDGGTGLLV-PPGDPAALAEAIRRLLADPELAQAL 341 (366)
T ss_pred hHHH-HHHHcCC-CEEecCCCC-hheeeeCCCcEEE-cCCCHHHHHHHHHHHHcChHHHHHH
Confidence 3566 6899997 777654322 2333333322222 3346999999999999998877654
No 19
>cd03795 GT1_like_4 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP-linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=38.96 E-value=54 Score=31.06 Aligned_cols=58 Identities=14% Similarity=0.088 Sum_probs=34.4
Q ss_pred HHHHhhhcCCcceEEEcCCccccccCCCC-CCccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 269 KLYINGLSHNVLPIVMGARPEDYKKAAPM-NSYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 269 Kl~~~al~~g~VPIv~G~~~~dy~~~~Pp-~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
++. +|+..| +||+....... ..++.. +-.-.+-+..+++++|+.|..|.+|++.++++
T Consensus 281 ~~~-Ea~~~g-~Pvi~~~~~~~-~~~i~~~~~~g~~~~~~d~~~~~~~i~~l~~~~~~~~~~ 339 (357)
T cd03795 281 VLL-EAMAFG-KPVISTEIGTG-GSYVNLHGVTGLVVPPGDPAALAEAIRRLLEDPELRERL 339 (357)
T ss_pred HHH-HHHHcC-CCEEecCCCCc-hhHHhhCCCceEEeCCCCHHHHHHHHHHHHHCHHHHHHH
Confidence 466 688888 55554322111 122221 21112234568999999999999999876654
No 20
>cd03812 GT1_CapH_like This family is most closely related to the GT1 family of glycosyltransferases. capH in Staphylococcus aureus has been shown to be required for the biosynthesis of the type 1 capsular polysaccharide (CP1).
Probab=38.04 E-value=48 Score=31.58 Aligned_cols=59 Identities=15% Similarity=0.021 Sum_probs=36.9
Q ss_pred HHHHHhhhcCCcceEEEcCCccccccCCCCC-CccccCCCCCHHHHHHHHHHHccCHHHHHHHhh
Q psy5439 268 EKLYINGLSHNVLPIVMGARPEDYKKAAPMN-SYIHVDDFPSAKHLAEYLNKLDANDTLYNSYFR 331 (381)
Q Consensus 268 EKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~-SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Yf~ 331 (381)
=++. +||..|+.+|.--.+ ....++..+ .++.. -.++++||+.|..|-+|++..++.-.
T Consensus 281 ~~~l-EAma~G~PvI~s~~~--~~~~~i~~~~~~~~~--~~~~~~~a~~i~~l~~~~~~~~~~~~ 340 (358)
T cd03812 281 LVLI-EAQASGLPCILSDTI--TKEVDLTDLVKFLSL--DESPEIWAEEILKLKSEDRRERSSES 340 (358)
T ss_pred HHHH-HHHHhCCCEEEEcCC--chhhhhccCccEEeC--CCCHHHHHHHHHHHHhCcchhhhhhh
Confidence 3566 699999655554222 122333333 33332 23689999999999999998876643
No 21
>PLN02871 UDP-sulfoquinovose:DAG sulfoquinovosyltransferase
Probab=37.96 E-value=57 Score=33.66 Aligned_cols=56 Identities=14% Similarity=0.219 Sum_probs=36.5
Q ss_pred HHHHhhhcCCcceEEEcCCccccccCCCC-----CCccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 269 KLYINGLSHNVLPIVMGARPEDYKKAAPM-----NSYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 269 Kl~~~al~~g~VPIv~G~~~~dy~~~~Pp-----~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
+++ +|+..| +|||.-.. ....+++.. +.++- +-.++++||+.|..|.+|++...++
T Consensus 347 ~vl-EAmA~G-~PVI~s~~-gg~~eiv~~~~~~~~G~lv--~~~d~~~la~~i~~ll~~~~~~~~~ 407 (465)
T PLN02871 347 VVL-EAMASG-VPVVAARA-GGIPDIIPPDQEGKTGFLY--TPGDVDDCVEKLETLLADPELRERM 407 (465)
T ss_pred HHH-HHHHcC-CCEEEcCC-CCcHhhhhcCCCCCceEEe--CCCCHHHHHHHHHHHHhCHHHHHHH
Confidence 577 699999 88885332 223344443 22221 2347999999999999898876543
No 22
>cd03809 GT1_mtfB_like This family is most closely related to the GT1 family of glycosyltransferases. mtfB (mannosyltransferase B) in E. coli has been shown to direct the growth of the O9-specific polysaccharide chain. It transfers two mannoses into the position 3 of the previously synthesized polysaccharide.
Probab=37.16 E-value=71 Score=30.03 Aligned_cols=57 Identities=12% Similarity=0.065 Sum_probs=37.0
Q ss_pred HHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHHHHHh
Q psy5439 269 KLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNSYF 330 (381)
Q Consensus 269 Kl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Yf 330 (381)
+++ +|+..|+.+|.-..+ ...+++..+.++ -+..++++|++.|..+..|++.+.+.-
T Consensus 288 ~~~-Ea~a~G~pvI~~~~~--~~~e~~~~~~~~--~~~~~~~~~~~~i~~l~~~~~~~~~~~ 344 (365)
T cd03809 288 PVL-EAMACGTPVIASNIS--SLPEVAGDAALY--FDPLDPEALAAAIERLLEDPALREELR 344 (365)
T ss_pred CHH-HHhcCCCcEEecCCC--CccceecCceee--eCCCCHHHHHHHHHHHhcCHHHHHHHH
Confidence 577 699999644443221 233444434332 233479999999999989998877654
No 23
>cd03823 GT1_ExpE7_like This family is most closely related to the GT1 family of glycosyltransferases. ExpE7 in Sinorhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucans (exopolysaccharide II).
Probab=36.68 E-value=58 Score=30.37 Aligned_cols=58 Identities=14% Similarity=0.131 Sum_probs=37.2
Q ss_pred HHHHhhhcCCcceEEEcCCccccccCCCCCC-ccccCCCCCHHHHHHHHHHHccCHHHHHHHh
Q psy5439 269 KLYINGLSHNVLPIVMGARPEDYKKAAPMNS-YIHVDDFPSAKHLAEYLNKLDANDTLYNSYF 330 (381)
Q Consensus 269 Kl~~~al~~g~VPIv~G~~~~dy~~~~Pp~S-fI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Yf 330 (381)
++. +|+..| +||+.-.. ....+++..+. -+.+ +..++++||+-|..+-+|++.+++.-
T Consensus 279 ~~~-Ea~a~G-~Pvi~~~~-~~~~e~i~~~~~g~~~-~~~d~~~l~~~i~~l~~~~~~~~~~~ 337 (359)
T cd03823 279 VIR-EALAAG-VPVIASDI-GGMAELVRDGVNGLLF-PPGDAEDLAAALERLIDDPDLLERLR 337 (359)
T ss_pred HHH-HHHHCC-CCEEECCC-CCHHHHhcCCCcEEEE-CCCCHHHHHHHHHHHHhChHHHHHHH
Confidence 677 699999 56655432 22344444421 1222 23358999999999999998877653
No 24
>cd03814 GT1_like_2 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=36.03 E-value=83 Score=29.47 Aligned_cols=59 Identities=14% Similarity=0.072 Sum_probs=38.1
Q ss_pred HHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHHHHHh
Q psy5439 269 KLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNSYF 330 (381)
Q Consensus 269 Kl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Yf 330 (381)
++. +|+..|+ ||+.... ....+++..+..-.+-+-.++++||+-|..+..|++.+++.-
T Consensus 282 ~~l-Ea~a~g~-PvI~~~~-~~~~~~i~~~~~g~~~~~~~~~~l~~~i~~l~~~~~~~~~~~ 340 (364)
T cd03814 282 VVL-EAMASGL-PVVAPDA-GGPADIVTDGENGLLVEPGDAEAFAAALAALLADPELRRRMA 340 (364)
T ss_pred HHH-HHHHcCC-CEEEcCC-CCchhhhcCCcceEEcCCCCHHHHHHHHHHHHcCHHHHHHHH
Confidence 577 6999996 5554433 233445544222223344578999999999999998876654
No 25
>cd03799 GT1_amsK_like This is a family of GT1 glycosyltransferases found specifically in certain bacteria. amsK in Erwinia amylovora, has been reported to be involved in the biosynthesis of amylovoran, a exopolysaccharide acting as a virulence factor.
Probab=32.41 E-value=95 Score=29.32 Aligned_cols=78 Identities=10% Similarity=0.058 Sum_probs=45.2
Q ss_pred hcccceEEEeecc----CCCCccchHHHHHhhhcCCcceEEEcCCccccccCCCCC-CccccCCCCCHHHHHHHHHHHcc
Q psy5439 247 LDTKYKFYLAFEN----SNCVDYITEKLYINGLSHNVLPIVMGARPEDYKKAAPMN-SYIHVDDFPSAKHLAEYLNKLDA 321 (381)
Q Consensus 247 l~~~YKFyLAfEN----S~c~DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~-SfI~v~dF~S~~eLA~yL~~L~~ 321 (381)
+++...++|+.=. ...+.+ .=|+. +|+..|+..|+-.. .....++..+ .=+.+ +-.++++||+.|..+.+
T Consensus 252 ~~~~adi~l~~s~~~~~~~~e~~-~~~~~-Ea~a~G~Pvi~~~~--~~~~~~i~~~~~g~~~-~~~~~~~l~~~i~~~~~ 326 (355)
T cd03799 252 LLRAADLFVLPSVTAADGDREGL-PVVLM-EAMAMGLPVISTDV--SGIPELVEDGETGLLV-PPGDPEALADAIERLLD 326 (355)
T ss_pred HHHhCCEEEecceecCCCCccCc-cHHHH-HHHHcCCCEEecCC--CCcchhhhCCCceEEe-CCCCHHHHHHHHHHHHh
Confidence 3456666665311 111332 56788 69999966655322 2233444433 22222 23379999999999999
Q ss_pred CHHHHHHH
Q psy5439 322 NDTLYNSY 329 (381)
Q Consensus 322 n~~~Y~~Y 329 (381)
|++.+.+.
T Consensus 327 ~~~~~~~~ 334 (355)
T cd03799 327 DPELRREM 334 (355)
T ss_pred CHHHHHHH
Confidence 98876554
No 26
>cd03825 GT1_wcfI_like This family is most closely related to the GT1 family of glycosyltransferases. wcfI in Bacteroides fragilis has been shown to be involved in the capsular polysaccharide biosynthesis.
Probab=30.67 E-value=1.2e+02 Score=28.87 Aligned_cols=59 Identities=10% Similarity=-0.023 Sum_probs=37.6
Q ss_pred HHHHHhhhcCCcceEEEcCCccccccCCCCC-CccccCCCCCHHHHHHHHHHHccCHHHHHHHh
Q psy5439 268 EKLYINGLSHNVLPIVMGARPEDYKKAAPMN-SYIHVDDFPSAKHLAEYLNKLDANDTLYNSYF 330 (381)
Q Consensus 268 EKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~-SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Yf 330 (381)
=++. +|+..|+..|+...+ .+..++..+ .-+-+ +..++++||+-|..+.+|++.+.+.-
T Consensus 279 ~~~~-Eam~~g~PvI~~~~~--~~~e~~~~~~~g~~~-~~~~~~~~~~~l~~l~~~~~~~~~~~ 338 (365)
T cd03825 279 NTAI-EALACGTPVVAFDVG--GIPDIVDHGVTGYLA-KPGDPEDLAEGIEWLLADPDEREELG 338 (365)
T ss_pred HHHH-HHHhcCCCEEEecCC--CChhheeCCCceEEe-CCCCHHHHHHHHHHHHhCHHHHHHHH
Confidence 4666 699999776665432 233444433 22222 33479999999999999988666543
No 27
>PRK13893 conjugal transfer protein TrbM; Provisional
Probab=30.42 E-value=31 Score=32.50 Aligned_cols=58 Identities=19% Similarity=0.395 Sum_probs=43.1
Q ss_pred CCCccchHHHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 261 NCVDYITEKLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 261 ~c~DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
.|.-|.+-++- + +..+.+|.|.|.+.. -|=.++..||+ .+||+|-+.+...+....++
T Consensus 132 ~C~ay~~h~yT-d-~~~~~~prYVG~~~~-------gG~WV~~~~y~--~a~a~y~~r~~~~~~~~~~~ 189 (193)
T PRK13893 132 YCAAYTGHAYT-D-FASGTLPRYVGTPER-------GGYWVEARDYD--RALAEYNERIRREDEERRRQ 189 (193)
T ss_pred HHHHHHhCCCc-e-eccCCCceEeccCCC-------CCeecChhHHH--HHHHHHHHhhchHhhhhccc
Confidence 57777766653 3 666789999998742 26678888885 89999999998777765554
No 28
>PF13692 Glyco_trans_1_4: Glycosyl transferases group 1; PDB: 3OY2_A 3OY7_B 2Q6V_A 2HY7_A 3CV3_A 3CUY_A.
Probab=29.70 E-value=32 Score=28.33 Aligned_cols=69 Identities=14% Similarity=0.051 Sum_probs=36.9
Q ss_pred hhcccceEEEeeccCCCCccchHHHHHhhhcCCcceEEEcCCccccccCCC-CCCccccCCCCCHHHHHHHHHHHccC
Q psy5439 246 LLDTKYKFYLAFENSNCVDYITEKLYINGLSHNVLPIVMGARPEDYKKAAP-MNSYIHVDDFPSAKHLAEYLNKLDAN 322 (381)
Q Consensus 246 ~l~~~YKFyLAfENS~c~DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~P-p~SfI~v~dF~S~~eLA~yL~~L~~n 322 (381)
.+++.+.+.|+.-... .=..=|++ +++..|..+|+.+.+ +..+.. .+.=+-+ ..++++|++-|..|.+|
T Consensus 66 ~~l~~~dv~l~p~~~~--~~~~~k~~-e~~~~G~pvi~~~~~---~~~~~~~~~~~~~~--~~~~~~l~~~i~~l~~d 135 (135)
T PF13692_consen 66 EILAAADVGLIPSRFN--EGFPNKLL-EAMAAGKPVIASDNG---AEGIVEEDGCGVLV--ANDPEELAEAIERLLND 135 (135)
T ss_dssp HHHHC-SEEEE-BSS---SCC-HHHH-HHHCTT--EEEEHHH---CHCHS---SEEEE---TT-HHHHHHHHHHHHH-
T ss_pred HHHHhCCEEEEEeeCC--CcCcHHHH-HHHHhCCCEEECCcc---hhhheeecCCeEEE--CCCHHHHHHHHHHHhcC
Confidence 3457788887643211 12346888 799999877776652 223222 1221222 56899999999998654
No 29
>cd03818 GT1_ExpC_like This family is most closely related to the GT1 family of glycosyltransferases. ExpC in Rhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucan (exopolysaccharide II).
Probab=29.26 E-value=1.2e+02 Score=30.23 Aligned_cols=58 Identities=12% Similarity=0.056 Sum_probs=35.9
Q ss_pred HHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 269 KLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 269 Kl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
++. +||..|+ |||.... ....+++..+.---+-+..++++||+-|..|.+|++...++
T Consensus 316 ~ll-EAmA~G~-PVIas~~-~g~~e~i~~~~~G~lv~~~d~~~la~~i~~ll~~~~~~~~l 373 (396)
T cd03818 316 SLL-EAMACGC-LVVGSDT-APVREVITDGENGLLVDFFDPDALAAAVIELLDDPARRARL 373 (396)
T ss_pred HHH-HHHHCCC-CEEEcCC-CCchhhcccCCceEEcCCCCHHHHHHHHHHHHhCHHHHHHH
Confidence 677 6999996 5544322 23344444321111234557999999999999998765543
No 30
>cd03798 GT1_wlbH_like This family is most closely related to the GT1 family of glycosyltransferases. wlbH in Bordetella parapertussis has been shown to be required for the biosynthesis of a trisaccharide that, when attached to the B. pertussis lipopolysaccharide (LPS) core (band B), generates band A LPS.
Probab=28.92 E-value=97 Score=28.57 Aligned_cols=72 Identities=10% Similarity=-0.026 Sum_probs=42.0
Q ss_pred cccceEEEeeccCCCCccchHHHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHH
Q psy5439 248 DTKYKFYLAFENSNCVDYITEKLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTL 325 (381)
Q Consensus 248 ~~~YKFyLAfENS~c~DYVTEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~ 325 (381)
+....+.|..--. +=+.=|+. +|+..|+..|+-.. .....++..+..-.+-+..++++|++-|..+..++..
T Consensus 276 ~~~ad~~i~~~~~---~~~~~~~~-Ea~~~G~pvI~~~~--~~~~~~~~~~~~g~~~~~~~~~~l~~~i~~~~~~~~~ 347 (377)
T cd03798 276 YAAADVFVLPSLR---EGFGLVLL-EAMACGLPVVATDV--GGIPEIITDGENGLLVPPGDPEALAEAILRLLADPWL 347 (377)
T ss_pred HHhcCeeecchhh---ccCChHHH-HHHhcCCCEEEecC--CChHHHhcCCcceeEECCCCHHHHHHHHHHHhcCcHH
Confidence 3455555542211 23445787 69999975554322 2233444444431233445799999999999888763
No 31
>COG0438 RfaG Glycosyltransferase [Cell envelope biogenesis, outer membrane]
Probab=26.54 E-value=1.7e+02 Score=26.11 Aligned_cols=57 Identities=12% Similarity=0.136 Sum_probs=39.3
Q ss_pred HHHhhhcCCcceEEEcCCccccccCCCCC--CccccCCCCCHHHHHHHHHHHccCHHHHHHHhh
Q psy5439 270 LYINGLSHNVLPIVMGARPEDYKKAAPMN--SYIHVDDFPSAKHLAEYLNKLDANDTLYNSYFR 331 (381)
Q Consensus 270 l~~~al~~g~VPIv~G~~~~dy~~~~Pp~--SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Yf~ 331 (381)
+. .|+..| +||+..... ....++..+ .+ +....+++++++.+..+.++.+.++....
T Consensus 293 ~~-Ea~a~g-~pvi~~~~~-~~~e~~~~~~~g~--~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 351 (381)
T COG0438 293 LL-EAMAAG-TPVIASDVG-GIPEVVEDGETGL--LVPPGDVEELADALEQLLEDPELREELGE 351 (381)
T ss_pred HH-HHHhcC-CcEEECCCC-ChHHHhcCCCceE--ecCCCCHHHHHHHHHHHhcCHHHHHHHHH
Confidence 55 688899 999887653 233444443 33 22333789999999999999877777765
No 32
>PRK15427 colanic acid biosynthesis glycosyltransferase WcaL; Provisional
Probab=25.99 E-value=1.3e+02 Score=30.72 Aligned_cols=59 Identities=15% Similarity=0.146 Sum_probs=37.1
Q ss_pred hHHHHHhhhcCCcceEEEcCCccccccCCCCC-CccccCCCCCHHHHHHHHHHHcc-CHHHHHHH
Q psy5439 267 TEKLYINGLSHNVLPIVMGARPEDYKKAAPMN-SYIHVDDFPSAKHLAEYLNKLDA-NDTLYNSY 329 (381)
Q Consensus 267 TEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~-SfI~v~dF~S~~eLA~yL~~L~~-n~~~Y~~Y 329 (381)
-=++. +|+..| +|||--.. ....+++..+ +=+-+ +-.++++||+-|..|.+ |++...++
T Consensus 318 p~~ll-EAma~G-~PVI~t~~-~g~~E~v~~~~~G~lv-~~~d~~~la~ai~~l~~~d~~~~~~~ 378 (406)
T PRK15427 318 PVALM-EAMAVG-IPVVSTLH-SGIPELVEADKSGWLV-PENDAQALAQRLAAFSQLDTDELAPV 378 (406)
T ss_pred cHHHH-HHHhCC-CCEEEeCC-CCchhhhcCCCceEEe-CCCCHHHHHHHHHHHHhCCHHHHHHH
Confidence 45777 699999 57765322 1233444332 22222 33579999999999988 88766544
No 33
>KOG1890|consensus
Probab=24.94 E-value=54 Score=36.31 Aligned_cols=37 Identities=27% Similarity=0.442 Sum_probs=26.3
Q ss_pred cceEEEeeccCCCCccc-hHHHHHhhh--------cCCcceEEEcCC
Q psy5439 250 KYKFYLAFENSNCVDYI-TEKLYINGL--------SHNVLPIVMGAR 287 (381)
Q Consensus 250 ~YKFyLAfENS~c~DYV-TEKl~~~al--------~~g~VPIv~G~~ 287 (381)
+||=-=+=||++|-.|| ||.+. +-+ -+|+|||+|.-+
T Consensus 263 RYkRRGvDe~gNvANyVETEqil-~~~~H~LSFvQ~RGSvPvfWSQ~ 308 (949)
T KOG1890|consen 263 RYKRRGVDEAGNVANYVETEQIL-NIFEHELSFVQCRGSVPVFWSQR 308 (949)
T ss_pred hhhhcCcccccchhhhhhHHHHH-HhhhceeeEEEeecccceeeccc
Confidence 34333444889999998 67775 433 369999999875
No 34
>TIGR03087 stp1 sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate.
Probab=24.39 E-value=1.6e+02 Score=29.35 Aligned_cols=56 Identities=14% Similarity=0.138 Sum_probs=34.9
Q ss_pred hHHHHHhhhcCCcceEEEcCCccccccCC-CCCCccccCCCCCHHHHHHHHHHHccCHHHHHH
Q psy5439 267 TEKLYINGLSHNVLPIVMGARPEDYKKAA-PMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNS 328 (381)
Q Consensus 267 TEKl~~~al~~g~VPIv~G~~~~dy~~~~-Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~ 328 (381)
.-|+. +||..|+ |||--... .+.+. -.+.-+-+. .++++||+-|..|.+|+++..+
T Consensus 312 ~~~~l-EAma~G~-PVV~t~~~--~~~i~~~~~~g~lv~--~~~~~la~ai~~ll~~~~~~~~ 368 (397)
T TIGR03087 312 QNKVL-EAMAMAK-PVVASPEA--AEGIDALPGAELLVA--ADPADFAAAILALLANPAEREE 368 (397)
T ss_pred ccHHH-HHHHcCC-CEEecCcc--cccccccCCcceEeC--CCHHHHHHHHHHHHcCHHHHHH
Confidence 34888 6999995 77764311 11111 122222333 5899999999999988876544
No 35
>cd03816 GT1_ALG1_like This family is most closely related to the GT1 family of glycosyltransferases. The yeast gene ALG1 has been shown to function as a mannosyltransferase that catalyzes the formation of dolichol pyrophosphate (Dol-PP)-GlcNAc2Man from GDP-Man and Dol-PP-Glc-NAc2, and participates in the formation of the lipid-linked precursor oligosaccharide for N-glycosylation. In humans ALG1 has been associated with the congenital disorders of glycosylation (CDG) designated as subtype CDG-Ik.
Probab=23.82 E-value=1.2e+02 Score=30.78 Aligned_cols=57 Identities=19% Similarity=0.170 Sum_probs=38.1
Q ss_pred chHHHHHhhhcCCcceEEEcCCccccccCCC--CCCccccCCCCCHHHHHHHHHHHccC---HHHHHHH
Q psy5439 266 ITEKLYINGLSHNVLPIVMGARPEDYKKAAP--MNSYIHVDDFPSAKHLAEYLNKLDAN---DTLYNSY 329 (381)
Q Consensus 266 VTEKl~~~al~~g~VPIv~G~~~~dy~~~~P--p~SfI~v~dF~S~~eLA~yL~~L~~n---~~~Y~~Y 329 (381)
+-=|++ +|+..|. ||+..... ...+++- .+.++ + .++++||+-|..|.+| ++.++++
T Consensus 330 ~p~~~~-Eama~G~-PVI~s~~~-~~~eiv~~~~~G~l-v---~d~~~la~~i~~ll~~~~~~~~~~~m 391 (415)
T cd03816 330 LPMKVV-DMFGCGL-PVCALDFK-CIDELVKHGENGLV-F---GDSEELAEQLIDLLSNFPNRGKLNSL 391 (415)
T ss_pred CcHHHH-HHHHcCC-CEEEeCCC-CHHHHhcCCCCEEE-E---CCHHHHHHHHHHHHhcCCCHHHHHHH
Confidence 455888 6999995 88764322 2223332 23333 3 3799999999999888 7877766
No 36
>cd03807 GT1_WbnK_like This family is most closely related to the GT1 family of glycosyltransferases. WbnK in Shigella dysenteriae has been shown to be involved in the type 7 O-antigen biosynthesis.
Probab=22.73 E-value=1.5e+02 Score=27.49 Aligned_cols=57 Identities=18% Similarity=0.222 Sum_probs=36.1
Q ss_pred hHHHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHHHH
Q psy5439 267 TEKLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNS 328 (381)
Q Consensus 267 TEKl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~ 328 (381)
.=++. +|+..|+ ||+.-.. .....++..+.++ -+..++++|++.|..+.+|++.+.+
T Consensus 282 ~~~~~-Ea~a~g~-PvI~~~~-~~~~e~~~~~g~~--~~~~~~~~l~~~i~~l~~~~~~~~~ 338 (365)
T cd03807 282 PNVLL-EAMACGL-PVVATDV-GDNAELVGDTGFL--VPPGDPEALAEAIEALLADPALRQA 338 (365)
T ss_pred CcHHH-HHHhcCC-CEEEcCC-CChHHHhhcCCEE--eCCCCHHHHHHHHHHHHhChHHHHH
Confidence 34787 6999995 5555322 2233444433332 2345799999999999988776654
No 37
>PRK15484 lipopolysaccharide 1,2-N-acetylglucosaminetransferase; Provisional
Probab=22.36 E-value=1.5e+02 Score=29.72 Aligned_cols=54 Identities=9% Similarity=0.139 Sum_probs=34.4
Q ss_pred HHHHhhhcCCcceEEEcCCccccccCCCCC-CccccCCCCCHHHHHHHHHHHccCHHH
Q psy5439 269 KLYINGLSHNVLPIVMGARPEDYKKAAPMN-SYIHVDDFPSAKHLAEYLNKLDANDTL 325 (381)
Q Consensus 269 Kl~~~al~~g~VPIv~G~~~~dy~~~~Pp~-SfI~v~dF~S~~eLA~yL~~L~~n~~~ 325 (381)
.+. +|+..| +||+-.... ...+++-.+ .=+.+.+-.++++||+.|..|.+|+++
T Consensus 293 ~~l-EAma~G-~PVI~s~~g-g~~Eiv~~~~~G~~l~~~~d~~~la~~I~~ll~d~~~ 347 (380)
T PRK15484 293 VAV-EAMAAG-KPVLASTKG-GITEFVLEGITGYHLAEPMTSDSIISDINRTLADPEL 347 (380)
T ss_pred HHH-HHHHcC-CCEEEeCCC-CcHhhcccCCceEEEeCCCCHHHHHHHHHHHHcCHHH
Confidence 466 699999 777764321 223344332 212233445799999999999988864
No 38
>PF14281 PDDEXK_4: PD-(D/E)XK nuclease superfamily
Probab=21.40 E-value=78 Score=28.20 Aligned_cols=120 Identities=16% Similarity=0.191 Sum_probs=67.0
Q ss_pred cceEEEEEeccCCcchHHHHHHHHhcCCeEeEcCCCCC--------CCCCC-----chhHHHhhc-ccceEEEeeccCCC
Q psy5439 197 THKVAWFVSNCGAKNNRLQYAHELGKYIEVDIYGSCGS--------KKCPR-----ISKCFRLLD-TKYKFYLAFENSNC 262 (381)
Q Consensus 197 ~k~vaw~vSnC~~~~~R~~yv~eL~k~i~VD~yG~C~~--------~~c~~-----~~~c~~~l~-~~YKFyLAfENS~c 262 (381)
++++||+......+.....|++.+.+.+.....+.+.. -.+.+ ...-.+.++ ..-+++|++||-.-
T Consensus 17 S~~la~lLdp~~~hg~~~~fl~~fl~~~~~~~~~~~~~~~~~~~~~~~v~~E~~~~~~~riDi~i~~~~~~~iiIEnKi~ 96 (179)
T PF14281_consen 17 SNFLAWLLDPKESHGLGDLFLKSFLKALGEKDNSEKEIYDPYDFESIEVEREVSTESGGRIDILIDENDKFVIIIENKIY 96 (179)
T ss_pred HHHHHHHhCCccccccHHHHHHHHHHHhhhhcccccccccccccCceEEEEeeccCCCCCccEEEEeCCCEEEEEEEcCC
Confidence 46789999888777777888888877654443332211 01111 123345566 78899999999832
Q ss_pred -Ccc--chHHHHHhh--h---cCCcceEEEc---CCccccccCCC--CCCccccCCCCCHHHHHHHHHHHccC
Q psy5439 263 -VDY--ITEKLYING--L---SHNVLPIVMG---ARPEDYKKAAP--MNSYIHVDDFPSAKHLAEYLNKLDAN 322 (381)
Q Consensus 263 -~DY--VTEKl~~~a--l---~~g~VPIv~G---~~~~dy~~~~P--p~SfI~v~dF~S~~eLA~yL~~L~~n 322 (381)
.+. -.++....+ . ....+-||+- ..+.. .-.+ .+.|+.++ -++|+..|..+...
T Consensus 97 a~~~~~QL~rY~~~~~~~~~~~~~~~~i~Lt~~g~~p~~--~~~~~~~~~~~~is----y~~i~~~l~~~~~~ 163 (179)
T PF14281_consen 97 ASDQENQLERYYKYLKKRYDKEKNIYLIYLTPDGSNPSE--KSLSEKKNHFIVIS----YEDILSWLEECLNE 163 (179)
T ss_pred CCcCHHHHHHHHHHHHHhccCcCCEEEEEEecCCCCcch--hhccccccceEEEc----HHHHHHHHHHHHHh
Confidence 111 123333111 1 2446888882 11111 1122 24566553 48898888887655
No 39
>cd04962 GT1_like_5 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=21.29 E-value=2.3e+02 Score=27.15 Aligned_cols=58 Identities=16% Similarity=0.264 Sum_probs=34.8
Q ss_pred HHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHHHHH
Q psy5439 269 KLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNSY 329 (381)
Q Consensus 269 Kl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Y 329 (381)
.+. +|+..| +||+.-.. ....+++..+.--.+-+..++++||+-|..|.+|++.+.++
T Consensus 286 ~~~-EAma~g-~PvI~s~~-~~~~e~i~~~~~G~~~~~~~~~~l~~~i~~l~~~~~~~~~~ 343 (371)
T cd04962 286 AAL-EAMACG-VPVVASNA-GGIPEVVKHGETGFLVDVGDVEAMAEYALSLLEDDELWQEF 343 (371)
T ss_pred HHH-HHHHcC-CCEEEeCC-CCchhhhcCCCceEEcCCCCHHHHHHHHHHHHhCHHHHHHH
Confidence 466 699999 55554322 12334443321111223357999999999998888876553
No 40
>cd03821 GT1_Bme6_like This family is most closely related to the GT1 family of glycosyltransferases. Bme6 in Brucella melitensis has been shown to be involved in the biosynthesis of a polysaccharide.
Probab=21.27 E-value=1.6e+02 Score=27.41 Aligned_cols=57 Identities=16% Similarity=0.132 Sum_probs=36.3
Q ss_pred HHHHhhhcCCcceEEEcCCccccccCCCCCCccccCCCCCHHHHHHHHHHHccCHHHHHHHh
Q psy5439 269 KLYINGLSHNVLPIVMGARPEDYKKAAPMNSYIHVDDFPSAKHLAEYLNKLDANDTLYNSYF 330 (381)
Q Consensus 269 Kl~~~al~~g~VPIv~G~~~~dy~~~~Pp~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~~Yf 330 (381)
++. +|+..|+ ||+.... .....++..++-+-+.+ ++++||+-|..|.+|++.+.++-
T Consensus 297 ~~~-Eama~G~-PvI~~~~-~~~~~~~~~~~~~~~~~--~~~~~~~~i~~l~~~~~~~~~~~ 353 (375)
T cd03821 297 VVA-EALACGT-PVVTTDK-VPWQELIEYGCGWVVDD--DVDALAAALRRALELPQRLKAMG 353 (375)
T ss_pred HHH-HHHhcCC-CEEEcCC-CCHHHHhhcCceEEeCC--ChHHHHHHHHHHHhCHHHHHHHH
Confidence 577 6999995 5555322 22344444433333332 45999999999999987776653
No 41
>cd04951 GT1_WbdM_like This family is most closely related to the GT1 family of glycosyltransferases and is named after WbdM in Escherichia coli. In general glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have
Probab=20.74 E-value=1.8e+02 Score=27.44 Aligned_cols=54 Identities=6% Similarity=0.043 Sum_probs=32.6
Q ss_pred HHHHHhhhcCCcceEEEcCCccccccCCCC-CCccccCCCCCHHHHHHHHHHHccCHHHHH
Q psy5439 268 EKLYINGLSHNVLPIVMGARPEDYKKAAPM-NSYIHVDDFPSAKHLAEYLNKLDANDTLYN 327 (381)
Q Consensus 268 EKl~~~al~~g~VPIv~G~~~~dy~~~~Pp-~SfI~v~dF~S~~eLA~yL~~L~~n~~~Y~ 327 (381)
=++. +|+..|+.+|+.-. ....+++.+ +.++.. .++++||+.|..+.+++..+.
T Consensus 277 ~~~~-Ea~a~G~PvI~~~~--~~~~e~i~~~g~~~~~---~~~~~~~~~i~~ll~~~~~~~ 331 (360)
T cd04951 277 LVVA-EAMACELPVVATDA--GGVREVVGDSGLIVPI---SDPEALANKIDEILKMSGEER 331 (360)
T ss_pred hHHH-HHHHcCCCEEEecC--CChhhEecCCceEeCC---CCHHHHHHHHHHHHhCCHHHH
Confidence 3677 69999985554322 123344443 333332 579999999999975444443
Done!