Query psy6303
Match_columns 270
No_of_seqs 125 out of 773
Neff 6.9
Searched_HMMs 46136
Date Fri Aug 16 22:58:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6303.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6303hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03183 acetylglucosaminyltra 100.0 1.1E-42 2.5E-47 329.9 19.9 208 3-270 78-293 (421)
2 PF02485 Branch: Core-2/I-Bran 100.0 4.1E-40 8.8E-45 292.8 14.1 180 17-270 11-192 (244)
3 KOG0799|consensus 100.0 7E-36 1.5E-40 286.5 17.2 193 13-270 110-303 (439)
4 PF00535 Glycos_transf_2: Glyc 89.0 4.1 8.9E-05 31.9 9.1 100 15-127 7-107 (169)
5 cd02525 Succinoglycan_BP_ExoA 87.4 15 0.00032 31.3 12.1 96 14-124 8-104 (249)
6 TIGR01556 rhamnosyltran L-rham 85.7 11 0.00024 33.5 10.8 93 18-125 7-100 (281)
7 cd04187 DPM1_like_bac Bacteria 82.7 18 0.00039 29.5 10.1 99 14-124 5-103 (181)
8 cd04179 DPM_DPG-synthase_like 82.0 24 0.00052 28.5 10.6 109 14-135 5-115 (185)
9 PRK11204 N-glycosyltransferase 79.4 49 0.0011 31.3 13.2 107 14-133 62-169 (420)
10 PRK14716 bacteriophage N4 adso 78.2 37 0.00081 33.8 12.2 97 15-116 75-175 (504)
11 PRK14583 hmsR N-glycosyltransf 72.1 73 0.0016 30.7 12.3 108 14-134 83-191 (444)
12 TIGR03472 HpnI hopanoid biosyn 71.2 59 0.0013 30.5 11.2 107 15-134 50-161 (373)
13 cd06439 CESA_like_1 CESA_like_ 69.1 73 0.0016 27.3 12.2 95 14-125 37-133 (251)
14 TIGR03469 HonB hopene-associat 68.1 1.1E+02 0.0024 28.9 13.4 114 14-133 48-167 (384)
15 cd02520 Glucosylceramide_synth 67.7 70 0.0015 26.5 10.4 98 15-124 10-109 (196)
16 cd04184 GT2_RfbC_Mx_like Myxoc 67.5 68 0.0015 26.3 11.3 105 15-132 10-117 (202)
17 cd06442 DPM1_like DPM1_like re 65.6 79 0.0017 26.4 10.4 97 14-124 5-101 (224)
18 cd02526 GT2_RfbF_like RfbF is 65.1 43 0.00093 28.3 8.3 91 15-123 6-97 (237)
19 cd04192 GT_2_like_e Subfamily 61.4 94 0.002 25.8 11.5 98 14-124 5-105 (229)
20 cd04195 GT2_AmsE_like GT2_AmsE 58.8 1E+02 0.0022 25.3 12.0 111 5-132 2-114 (201)
21 cd04186 GT_2_like_c Subfamily 58.0 87 0.0019 24.3 11.8 104 14-133 5-109 (166)
22 COG4746 Uncharacterized protei 55.6 9.1 0.0002 28.2 1.9 24 106-129 16-39 (80)
23 cd04185 GT_2_like_b Subfamily 55.5 1.2E+02 0.0025 25.0 9.9 97 14-123 5-101 (202)
24 TIGR00762 DegV EDD domain prot 53.5 1.4E+02 0.003 26.9 9.9 93 17-135 63-157 (275)
25 cd02510 pp-GalNAc-T pp-GalNAc- 53.3 1.5E+02 0.0032 26.5 10.0 105 5-126 2-111 (299)
26 PF07747 MTH865: MTH865-like f 51.2 8.7 0.00019 28.3 1.2 24 106-129 11-34 (75)
27 PF07521 RMMBL: RNA-metabolisi 48.1 20 0.00043 23.1 2.5 28 12-46 14-41 (43)
28 PF13641 Glyco_tranf_2_3: Glyc 47.2 84 0.0018 26.4 7.1 109 14-134 9-121 (228)
29 cd02511 Beta4Glucosyltransfera 45.4 1.9E+02 0.0042 24.6 9.7 92 14-128 8-101 (229)
30 cd04196 GT_2_like_d Subfamily 45.3 1.7E+02 0.0037 23.9 10.3 96 14-124 6-102 (214)
31 cd06423 CESA_like CESA_like is 45.1 1.4E+02 0.003 22.8 10.4 94 14-122 5-99 (180)
32 cd06434 GT2_HAS Hyaluronan syn 42.2 2.1E+02 0.0045 24.0 11.4 106 5-132 4-110 (235)
33 COG4746 Uncharacterized protei 41.8 20 0.00043 26.4 1.9 38 72-124 38-75 (80)
34 cd04188 DPG_synthase DPG_synth 40.5 2.1E+02 0.0046 23.7 11.0 98 14-124 5-105 (211)
35 COG1216 Predicted glycosyltran 40.1 2.8E+02 0.0061 25.0 11.1 97 15-126 12-112 (305)
36 PF11051 Mannosyl_trans3: Mann 36.3 2E+02 0.0044 25.9 8.1 105 2-121 1-112 (271)
37 cd06435 CESA_NdvC_like NdvC_li 35.8 2.7E+02 0.0058 23.4 11.3 104 5-124 2-107 (236)
38 cd06421 CESA_CelA_like CESA_Ce 32.7 2.9E+02 0.0063 22.9 10.7 92 18-124 14-107 (234)
39 PLN02726 dolichyl-phosphate be 32.7 3.2E+02 0.0069 23.4 11.9 98 14-124 17-116 (243)
40 COG1307 DegV Uncharacterized p 31.4 4.1E+02 0.0089 24.3 9.6 95 15-134 63-159 (282)
41 cd06427 CESA_like_2 CESA_like_ 30.5 3.4E+02 0.0074 23.1 11.3 99 14-125 9-111 (241)
42 PRK15489 nfrB bacteriophage N4 29.2 5.1E+02 0.011 27.1 10.4 113 14-133 79-195 (703)
43 TIGR03568 NeuC_NnaA UDP-N-acet 27.9 2.2E+02 0.0047 26.8 7.0 63 3-70 202-268 (365)
44 COG1215 Glycosyltransferases, 27.6 4.6E+02 0.0099 24.5 9.3 100 15-126 63-165 (439)
45 PF01361 Tautomerase: Tautomer 27.1 2E+02 0.0043 19.2 5.1 42 3-47 1-44 (60)
46 TIGR03111 glyc2_xrt_Gpos1 puta 26.5 5.8E+02 0.013 24.5 13.1 102 14-128 57-161 (439)
47 cd00491 4Oxalocrotonate_Tautom 26.5 1.9E+02 0.0042 18.9 5.4 43 3-48 1-45 (58)
48 PF07894 DUF1669: Protein of u 25.3 2.8E+02 0.0061 25.6 7.0 37 14-56 156-192 (284)
49 PRK10714 undecaprenyl phosphat 25.2 5.4E+02 0.012 23.6 13.1 100 14-125 14-114 (325)
50 PF06956 RtcR: Regulator of RN 25.0 1.2E+02 0.0026 26.1 4.2 17 16-32 79-95 (183)
51 PF07745 Glyco_hydro_53: Glyco 24.3 2.9E+02 0.0064 26.0 7.1 94 15-114 146-242 (332)
52 PRK11234 nfrB bacteriophage N4 24.3 8.4E+02 0.018 25.6 11.2 114 15-135 72-189 (727)
53 TIGR00031 UDP-GALP_mutase UDP- 24.1 1.2E+02 0.0027 28.9 4.7 54 39-94 320-373 (377)
54 PF02645 DegV: Uncharacterised 24.1 4.6E+02 0.0099 23.5 8.2 94 16-134 63-158 (280)
55 PF04084 ORC2: Origin recognit 23.7 1.5E+02 0.0032 27.8 5.0 48 23-70 121-173 (326)
56 cd06437 CESA_CaSu_A2 Cellulose 23.3 4.5E+02 0.0097 22.1 11.3 94 14-121 9-107 (232)
57 cd02514 GT13_GLCNAC-TI GT13_GL 21.7 6.8E+02 0.015 23.6 9.7 95 14-115 8-113 (334)
58 COG3618 Predicted metal-depend 21.3 4.4E+02 0.0096 24.3 7.4 26 14-44 142-167 (279)
No 1
>PLN03183 acetylglucosaminyltransferase family protein; Provisional
Probab=100.00 E-value=1.1e-42 Score=329.88 Aligned_cols=208 Identities=28% Similarity=0.405 Sum_probs=155.7
Q ss_pred CeEEEEEechhhcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHH-HHH------hhccCCCeEEeccccee
Q psy6303 3 PHIVDTVHRRELVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRE-LLS------LELRLPNVRLSRRRQAT 75 (270)
Q Consensus 3 ~~~~~~~~~~~~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~-~~~------~~~~~~nV~~~~~r~~v 75 (270)
|.+.+++.. -.++.+||+||+++| |||+|+||||+|+||+..+.. +.. +..+++||++++++..|
T Consensus 78 ~r~AYLI~~--h~~d~~~l~RLL~aL------YhprN~y~IHlDkKS~~~er~~l~~~v~~~~~~~~~~NV~vl~k~~~V 149 (421)
T PLN03183 78 PRFAYLVSG--SKGDLEKLWRTLRAL------YHPRNQYVVHLDLESPAEERLELASRVENDPMFSKVGNVYMITKANLV 149 (421)
T ss_pred CeEEEEEEe--cCCcHHHHHHHHHHh------cCCCceEEEEecCCCChHHHHHHHHHhhccchhhccCcEEEEecceee
Confidence 445555443 225669999999999 999999999999999855432 221 33578999999989999
Q ss_pred eeCchhHHHHHHHHHHHHHhcCCCccEEEeccCCcccccChHH-HHHHHhcCCCCceeecccCCccceeeeccCcchhhh
Q psy6303 76 IWGGASLVTILLDSMQQLLESGWRWDFIVNLSESDYPVKTNRA-LVDFLSANRDKNFVKSHGRQVHRFISKQGLDKSFVE 154 (270)
Q Consensus 76 ~WGg~S~v~A~l~ll~~al~~~~~~dy~~lLSg~D~Plks~~~-i~~fl~~~~g~~Fi~~~~~~~~~~~~~~~~~r~~~~ 154 (270)
+|||+|||+|||+||+.+++...+|||||||||+||||+|+++ |+.|++.|.|+|||++... ..|...
T Consensus 150 ~WGG~S~V~AtL~~m~~LL~~~~~WDyfinLSGsDyPLkTqdelI~~F~~~nr~~NFI~~~s~--~~wk~~--------- 218 (421)
T PLN03183 150 TYRGPTMVANTLHACAILLKRSKDWDWFINLSASDYPLVTQDDLIHTFSTLDRNLNFIEHTSQ--LGWKEE--------- 218 (421)
T ss_pred ccCChHHHHHHHHHHHHHHhhCCCCCEEEEccCCcccccCHHHHHHHHHhCCCCceeeecccc--cccchh---------
Confidence 9999999999999999999988999999999999999999999 5789999999999998652 233211
Q ss_pred hccceeeeccccCCcceeecCCCCccccccccccceecCcchhHHhhhcccccccCCCChhHHHHHhhhcCceeeecccc
Q psy6303 155 CEARMWRVGDRTLPQGIVMDGGSGTGWLLSPIFVSYVASPEKDELVRGLLNPIQSTPLLPAELLMEQLANAGIVMDGGSD 234 (270)
Q Consensus 155 ~~~~~~r~~~r~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~qk~~~~~~~~~~~r~~~~~i~~~~GSq 234 (270)
+|..+ +..++|. |... +.. +....+ .|..|.++++|+|||
T Consensus 219 ---------~r~~~--~i~~pgl------------~~~~--ks~----~~~~~~-----------~R~~P~~~~lf~GS~ 258 (421)
T PLN03183 219 ---------KRAMP--LIIDPGL------------YSTN--KSD----IYWVTP-----------RRSLPTAFKLFTGSA 258 (421)
T ss_pred ---------hhcce--EEecCce------------eecc--cch----hhhhhh-----------hccCCccccccCCCc
Confidence 11111 1112221 1100 000 011111 223478899999999
Q ss_pred ceeecHhHHHHhcCCCcchhHHHHHHHhcccCCCCC
Q psy6303 235 WLVLSRSFVSYVASPEKDELVRGLLTLFKYTLLPAE 270 (270)
Q Consensus 235 W~sLtr~~v~yil~~~~~~~~~~~~~~f~~t~~PDE 270 (270)
||+|||+||+||++. +++.++.+++||.+|++|||
T Consensus 259 W~sLSR~fvey~l~~-~dnlpr~ll~y~~~t~~pdE 293 (421)
T PLN03183 259 WMVLSRSFVEYCIWG-WDNLPRTLLMYYTNFVSSPE 293 (421)
T ss_pred eEEecHHHHHHHHhc-ccchHHHHHHHHhcCCCCch
Confidence 999999999999985 56666778899999999999
No 2
>PF02485 Branch: Core-2/I-Branching enzyme; InterPro: IPR003406 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This is the glycosyltransferase family 14 GT14 from CAZY, a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme (2.4.1.150 from EC) and core-2 branching enzyme (2.4.1.102 from EC). I-branching enzyme, an integral membrane protein, converts linear into branched poly-N-acetyllactosaminoglycans in the glycosylation pathway, and is responsible for the production of the blood group I-antigen during embryonic development []. Core-2 branching enzyme, also an integral membrane protein, forms crucial side-chain branches in O-glycans in the glycosylation pathway [].; GO: 0008375 acetylglucosaminyltransferase activity, 0016020 membrane; PDB: 3OTK_D 2GAM_A 2GAK_B.
Probab=100.00 E-value=4.1e-40 Score=292.82 Aligned_cols=180 Identities=29% Similarity=0.445 Sum_probs=113.2
Q ss_pred CHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHH-HHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHHHh
Q psy6303 17 KQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLF-RELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQLLE 95 (270)
Q Consensus 17 ~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~-~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~al~ 95 (270)
+++|+++|++++ ++|+|.||||||+|++..+ .++.++..+++||+++++|++|.|||+|||+|||.||++|++
T Consensus 11 ~~~~~~~l~~~l------~~~~~~f~iHiD~k~~~~~~~~~~~~~~~~~nv~~v~~r~~v~WG~~S~v~A~l~ll~~al~ 84 (244)
T PF02485_consen 11 DPEQLERLLRLL------YHPDNDFYIHIDKKSPDYFYEEIKKLISCFPNVHFVPKRVDVRWGGFSLVEATLNLLREALK 84 (244)
T ss_dssp -HHHHHHHHHHH--------TTSEEEEEE-TTS-HHHHHHHHHHHCT-TTEEE-SS-----TTSHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHh------cCCCCEEEEEEcCCCChHHHHHHHHhcccCCceeecccccccccCCccHHHHHHHHHHHHHh
Confidence 679999999999 9999999999999988444 445667789999999999999999999999999999999998
Q ss_pred cCCCccEEEeccCCcccccChHHHHHHHhcC-CCCceeecccCCccceeeeccCcchhhhhccceeeeccccCCcceeec
Q psy6303 96 SGWRWDFIVNLSESDYPVKTNRALVDFLSAN-RDKNFVKSHGRQVHRFISKQGLDKSFVECEARMWRVGDRTLPQGIVMD 174 (270)
Q Consensus 96 ~~~~~dy~~lLSg~D~Plks~~~i~~fl~~~-~g~~Fi~~~~~~~~~~~~~~~~~r~~~~~~~~~~r~~~r~~~~~~~~~ 174 (270)
.+++|||||||||+||||+|+++|.+||+.+ .+.+|+++...+..... .|+. ....... .
T Consensus 85 ~~~~~~y~~llSg~D~Pl~s~~~i~~~l~~~~~~~~f~~~~~~~~~~~~-----~r~~---------~~~~~~~---~-- 145 (244)
T PF02485_consen 85 RDGDWDYFILLSGQDYPLKSNEEIHEFLESNNGDNNFIESFSDEDPRES-----GRYN---------PRIYDPF---R-- 145 (244)
T ss_dssp H-S---EEEEEETTEEESS-HHHHHHHHHHTTT--B---BEE--GGGG------HHHH---------EEEETTE---E--
T ss_pred cCCCCcEEEEcccccccccchHHHHHHHHhcCCCCcceecccccccchh-----hcce---------eeeeeec---c--
Confidence 6679999999999999999999999999998 45788888765431100 1110 0000000 0
Q ss_pred CCCCccccccccccceecCcchhHHhhhcccccccCCCChhHHHHHhhhcCceeeeccccceeecHhHHHHhcCCCcchh
Q psy6303 175 GGSGTGWLLSPIFVSYVASPEKDELVRGLLNPIQSTPLLPAELLMEQLANAGIVMDGGSDWLVLSRSFVSYVASPEKDEL 254 (270)
Q Consensus 175 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~qk~~~~~~~~~~~r~~~~~i~~~~GSqW~sLtr~~v~yil~~~~~~~ 254 (270)
+ . .+. -++|+|||||+|||++|+||++. ...
T Consensus 146 ---------------~-------------------~------------~~~-~~~~~GSqW~~Ltr~~v~~il~~--~~~ 176 (244)
T PF02485_consen 146 ---------------P-------------------F------------FRK-RTLYKGSQWFSLTRDFVEYILDD--PNY 176 (244)
T ss_dssp ---------------E-------------------E------------EEE-E--EEE-S--EEEHHHHHHHHH---HHH
T ss_pred ---------------c-------------------c------------ccc-ccccccceeeEeeHHHHHHhhhh--HHH
Confidence 0 0 000 06899999999999999999962 333
Q ss_pred HHHHHHHhcccCCCCC
Q psy6303 255 VRGLLTLFKYTLLPAE 270 (270)
Q Consensus 255 ~~~~~~~f~~t~~PDE 270 (270)
.+.+..++++|+||||
T Consensus 177 ~~~~~~~~~~~~~pDE 192 (244)
T PF02485_consen 177 RPKLKKYFRFSLCPDE 192 (244)
T ss_dssp HHHHHHHT-TSSSGGG
T ss_pred HHHHHHhhcCccCcch
Confidence 2344555569999999
No 3
>KOG0799|consensus
Probab=100.00 E-value=7e-36 Score=286.54 Aligned_cols=193 Identities=30% Similarity=0.501 Sum_probs=157.4
Q ss_pred hhcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHH-HHHhhccCCCeEEecccceeeeCchhHHHHHHHHHH
Q psy6303 13 ELVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRE-LLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQ 91 (270)
Q Consensus 13 ~~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~-~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~ 91 (270)
.+++|.++++|+++++ |||+|.||||||++|+.+|+. +.+++.|++||++++++..|.|||+|+++|+++||+
T Consensus 110 ~v~kd~~~verll~ai------YhPqN~ycihvD~~s~~~fk~~~~~L~~cf~NV~v~~k~~~v~~~G~s~l~a~l~c~~ 183 (439)
T KOG0799|consen 110 VVYKDYEQVERLLQAI------YHPQNVYCIHVDAKSPPEFRVAMQQLASCFPNVIVLPKRESVTYGGHSILAAHLNCLA 183 (439)
T ss_pred eecccHHHHHHHHHHH------hCCcCcceEEECCCCCHHHHHHHHHHHhcCCceEEeccccceecCCchhhHHHHHHHH
Confidence 3677889999999999 999999999999999988885 566899999999999999999999999999999999
Q ss_pred HHHhcCCCccEEEeccCCcccccChHHHHHHHhcCCCCceeecccCCccceeeeccCcchhhhhccceeeeccccCCcce
Q psy6303 92 QLLESGWRWDFIVNLSESDYPVKTNRALVDFLSANRDKNFVKSHGRQVHRFISKQGLDKSFVECEARMWRVGDRTLPQGI 171 (270)
Q Consensus 92 ~al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~~~~g~~Fi~~~~~~~~~~~~~~~~~r~~~~~~~~~~r~~~r~~~~~~ 171 (270)
.+++.+++|+|+++|||+||||||++||.++|+..+|.|||+...... |+..++..+.
T Consensus 184 ~Ll~~~~~W~yfinLs~~D~PlkT~~elv~i~~~L~g~N~i~~~~~~~--------------------~~~~~~~k~~-- 241 (439)
T KOG0799|consen 184 DLLKLSGDWDYFINLSNSDYPLKTNDELVRIFKILRGANFVEHTSEIG--------------------WKLNRKAKWD-- 241 (439)
T ss_pred HHHhcCCCCceeeeccCCCcccCCHHHHHHHHHHcCCcccccCccccc--------------------HHHhcccCCc--
Confidence 999988899999999999999999999999999999999999865432 1111111110
Q ss_pred eecCCCCccccccccccceecCcchhHHhhhcccccccCCCChhHHHHHhhhcCceeeeccccceeecHhHHHHhcCCCc
Q psy6303 172 VMDGGSGTGWLLSPIFVSYVASPEKDELVRGLLNPIQSTPLLPAELLMEQLANAGIVMDGGSDWLVLSRSFVSYVASPEK 251 (270)
Q Consensus 172 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~qk~~~~~~~~~~~r~~~~~i~~~~GSqW~sLtr~~v~yil~~~~ 251 (270)
..+.| |... + ...+ ....|.++++++||.|++|||++|+|++.
T Consensus 242 ~~~~~-------------~~~~--------~-----s~~~--------~~~lp~~~ki~~Gs~~~~LsR~fv~y~i~--- 284 (439)
T KOG0799|consen 242 IIDLK-------------YFRN--------K-----SPLP--------WVILPTALKLFKGSAWVSLSRAFVEYLIS--- 284 (439)
T ss_pred ccccc-------------hhee--------c-----CCCc--------cccCCCceEEEecceeEEEeHHHHHHHhc---
Confidence 00000 0000 0 0000 00147899999999999999999999999
Q ss_pred chhHHHHHHHhcccCCCCC
Q psy6303 252 DELVRGLLTLFKYTLLPAE 270 (270)
Q Consensus 252 ~~~~~~~~~~f~~t~~PDE 270 (270)
++.++.+++++++|++|||
T Consensus 285 ~~~~~~ll~~~~~t~~~dE 303 (439)
T KOG0799|consen 285 GNLPRTLLMYYNNTYSPDE 303 (439)
T ss_pred CccHHHHHHHHhCccCcch
Confidence 4667999999999999998
No 4
>PF00535 Glycos_transf_2: Glycosyl transferase family 2; InterPro: IPR001173 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This domain is found in a diverse family of glycosyl transferases that transfer the sugar from UDP-glucose, UDP-N-acetyl-galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids.; PDB: 2Z87_A 2Z86_B 2D7R_A 2D7I_A 3CKN_A 3CKQ_A 3CKJ_A 3CKV_A 3CKO_A 2FFU_A ....
Probab=89.00 E-value=4.1 Score=31.88 Aligned_cols=100 Identities=18% Similarity=0.136 Sum_probs=64.2
Q ss_pred cCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHHH
Q psy6303 15 VKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQLL 94 (270)
Q Consensus 15 ~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~al 94 (270)
+++++.|.+++.+|... ..+...++|--|...+.....+.+.....+++.++....+. +.-.|--.+++.|.
T Consensus 7 ~n~~~~l~~~l~sl~~q---~~~~~eiivvdd~s~d~~~~~~~~~~~~~~~i~~i~~~~n~-----g~~~~~n~~~~~a~ 78 (169)
T PF00535_consen 7 YNEAEYLERTLESLLKQ---TDPDFEIIVVDDGSTDETEEILEEYAESDPNIRYIRNPENL-----GFSAARNRGIKHAK 78 (169)
T ss_dssp SS-TTTHHHHHHHHHHH---SGCEEEEEEEECS-SSSHHHHHHHHHCCSTTEEEEEHCCCS-----HHHHHHHHHHHH--
T ss_pred eCCHHHHHHHHHHHhhc---cCCCEEEEEeccccccccccccccccccccccccccccccc-----cccccccccccccc
Confidence 44467888888888444 24677777766666444333334444457899998644332 45666677777764
Q ss_pred hcCCCccEEEeccCCcccccC-hHHHHHHHhcCC
Q psy6303 95 ESGWRWDFIVNLSESDYPVKT-NRALVDFLSANR 127 (270)
Q Consensus 95 ~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~~ 127 (270)
-+|+..+.+.|++... .+++.+++++++
T Consensus 79 -----~~~i~~ld~D~~~~~~~l~~l~~~~~~~~ 107 (169)
T PF00535_consen 79 -----GEYILFLDDDDIISPDWLEELVEALEKNP 107 (169)
T ss_dssp -----SSEEEEEETTEEE-TTHHHHHHHHHHHCT
T ss_pred -----eeEEEEeCCCceEcHHHHHHHHHHHHhCC
Confidence 2499999999999887 778888888753
No 5
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus.
Probab=87.40 E-value=15 Score=31.32 Aligned_cols=96 Identities=10% Similarity=0.056 Sum_probs=55.9
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcC-CCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANR-DKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQ 92 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~-p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~ 92 (270)
++++++.+.+++.++.+.. +. .+..++|--|...+.....+.++....+.|+++...- -| .-.|--.+++.
T Consensus 8 ~~n~~~~l~~~l~sl~~q~--~~~~~~evivvd~~s~d~~~~~~~~~~~~~~~v~~i~~~~---~~---~~~a~N~g~~~ 79 (249)
T cd02525 8 VRNEEKYIEELLESLLNQS--YPKDLIEIIVVDGGSTDGTREIVQEYAAKDPRIRLIDNPK---RI---QSAGLNIGIRN 79 (249)
T ss_pred cCCchhhHHHHHHHHHhcc--CCCCccEEEEEeCCCCccHHHHHHHHHhcCCeEEEEeCCC---CC---chHHHHHHHHH
Confidence 4567788899888883322 22 2334555545544433333344555678899986431 12 22344444444
Q ss_pred HHhcCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 93 LLESGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 93 al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
| ..||++.+.+.|.| +...+.+.++
T Consensus 80 a-----~~d~v~~lD~D~~~--~~~~l~~~~~ 104 (249)
T cd02525 80 S-----RGDIIIRVDAHAVY--PKDYILELVE 104 (249)
T ss_pred h-----CCCEEEEECCCccC--CHHHHHHHHH
Confidence 3 57999999999987 5555555553
No 6
>TIGR01556 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids are glycolipids containing mono- or di- L-rhamnose molecules. Rhamnolipid synthesis occurs by sequential glycosyltransferase reactions involving two distinct rhamnosyltransferase enzymes. In P.aeruginosa, the synthesis of mono-rhamnolipids is catalyzed by rhamnosyltransferase 1, and proceeds by a glycosyltransfer reaction catalyzed by rhamnosyltransferase 2 to yield di-rhamnolipids.
Probab=85.67 E-value=11 Score=33.54 Aligned_cols=93 Identities=14% Similarity=0.028 Sum_probs=58.8
Q ss_pred HHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHHHhcC
Q psy6303 18 QEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQLLESG 97 (270)
Q Consensus 18 ~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~al~~~ 97 (270)
.+.|++++.+| ..++.-.|=||-.|+.. ..+.+.....++|+++....+. +.-.|-=.+++.|++
T Consensus 7 ~~~l~~~l~sl-------~~q~~~iiVVDN~S~~~-~~~~~~~~~~~~i~~i~~~~N~-----G~a~a~N~Gi~~a~~-- 71 (281)
T TIGR01556 7 LEHLGELITSL-------PKQVDRIIAVDNSPHSD-QPLKNARLRGQKIALIHLGDNQ-----GIAGAQNQGLDASFR-- 71 (281)
T ss_pred HHHHHHHHHHH-------HhcCCEEEEEECcCCCc-HhHHHHhccCCCeEEEECCCCc-----chHHHHHHHHHHHHH--
Confidence 58999999999 23456677788775422 2222334467899998644333 233455566777774
Q ss_pred CCccEEEeccCCcccccC-hHHHHHHHhc
Q psy6303 98 WRWDFIVNLSESDYPVKT-NRALVDFLSA 125 (270)
Q Consensus 98 ~~~dy~~lLSg~D~Plks-~~~i~~fl~~ 125 (270)
.++||+.+|-.++.|-.. .+++.+.++.
T Consensus 72 ~~~d~i~~lD~D~~~~~~~l~~l~~~~~~ 100 (281)
T TIGR01556 72 RGVQGVLLLDQDSRPGNAFLAAQWKLLSA 100 (281)
T ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHh
Confidence 357999999999999632 2334444443
No 7
>cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily.
Probab=82.66 E-value=18 Score=29.48 Aligned_cols=99 Identities=16% Similarity=0.162 Sum_probs=55.2
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQL 93 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~a 93 (270)
+|+.++.+.+++.+|.+.+....+.-.++|--|...+.....+.+.....++|.++... -+.| ...|.-.+++.|
T Consensus 5 ~~n~~~~l~~~l~sl~~~~~~~~~~~eiivvdd~s~d~t~~~~~~~~~~~~~i~~i~~~--~n~G---~~~a~n~g~~~a 79 (181)
T cd04187 5 VYNEEENLPELYERLKAVLESLGYDYEIIFVDDGSTDRTLEILRELAARDPRVKVIRLS--RNFG---QQAALLAGLDHA 79 (181)
T ss_pred ecCchhhHHHHHHHHHHHHHhcCCCeEEEEEeCCCCccHHHHHHHHHhhCCCEEEEEec--CCCC---cHHHHHHHHHhc
Confidence 45567888888888866554223455666654544443222234444567888887532 2233 334444455554
Q ss_pred HhcCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 94 LESGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 94 l~~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
. -+|++.+.+.+. + +.+.+.+.++
T Consensus 80 ~-----~d~i~~~D~D~~-~-~~~~l~~l~~ 103 (181)
T cd04187 80 R-----GDAVITMDADLQ-D-PPELIPEMLA 103 (181)
T ss_pred C-----CCEEEEEeCCCC-C-CHHHHHHHHH
Confidence 2 278888877555 4 4445555544
No 8
>cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex
Probab=82.05 E-value=24 Score=28.52 Aligned_cols=109 Identities=13% Similarity=0.097 Sum_probs=62.8
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQL 93 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~a 93 (270)
+|++++.+.+++.++.+.... .+.-.++|--|...+.....+.+.....+.++++....+. +...|.-.+++.|
T Consensus 5 ~~n~~~~l~~~l~sl~~~~~~-~~~~eiivvd~~s~d~~~~~~~~~~~~~~~~~~~~~~~n~-----G~~~a~n~g~~~a 78 (185)
T cd04179 5 AYNEEENIPELVERLLAVLEE-GYDYEIIVVDDGSTDGTAEIARELAARVPRVRVIRLSRNF-----GKGAAVRAGFKAA 78 (185)
T ss_pred ccChHhhHHHHHHHHHHHhcc-CCCEEEEEEcCCCCCChHHHHHHHHHhCCCeEEEEccCCC-----CccHHHHHHHHHh
Confidence 355668888888888544421 2344555555555443333333444456666665433222 2446666666666
Q ss_pred HhcCCCccEEEeccCCcccccChHHHHHHHh--cCCCCceeecc
Q psy6303 94 LESGWRWDFIVNLSESDYPVKTNRALVDFLS--ANRDKNFVKSH 135 (270)
Q Consensus 94 l~~~~~~dy~~lLSg~D~Plks~~~i~~fl~--~~~g~~Fi~~~ 135 (270)
- + +|++.|.+.|.+ +.+.+.+.++ .+.+...+-..
T Consensus 79 ~---g--d~i~~lD~D~~~--~~~~l~~l~~~~~~~~~~~v~g~ 115 (185)
T cd04179 79 R---G--DIVVTMDADLQH--PPEDIPKLLEKLLEGGADVVIGS 115 (185)
T ss_pred c---C--CEEEEEeCCCCC--CHHHHHHHHHHHhccCCcEEEEE
Confidence 3 2 899999999876 5666776665 34455555443
No 9
>PRK11204 N-glycosyltransferase; Provisional
Probab=79.37 E-value=49 Score=31.29 Aligned_cols=107 Identities=14% Similarity=0.113 Sum_probs=62.7
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQL 93 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~a 93 (270)
+|++.+.+.+.+.++. +...|+..++|=-|.+.+.....+.+....++++.++....+ +| ...| +..+
T Consensus 62 ~yne~~~i~~~l~sl~---~q~yp~~eiiVvdD~s~d~t~~~l~~~~~~~~~v~~i~~~~n---~G--ka~a----ln~g 129 (420)
T PRK11204 62 CYNEGENVEETISHLL---ALRYPNYEVIAINDGSSDNTGEILDRLAAQIPRLRVIHLAEN---QG--KANA----LNTG 129 (420)
T ss_pred cCCCHHHHHHHHHHHH---hCCCCCeEEEEEECCCCccHHHHHHHHHHhCCcEEEEEcCCC---CC--HHHH----HHHH
Confidence 3556678888888772 224676677775565555433333445567888988863222 23 2333 3333
Q ss_pred HhcCCCccEEEeccCCcccccC-hHHHHHHHhcCCCCceee
Q psy6303 94 LESGWRWDFIVNLSESDYPVKT-NRALVDFLSANRDKNFVK 133 (270)
Q Consensus 94 l~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~~g~~Fi~ 133 (270)
++ ..+.||++.+-.++.|-.. .+++.+.++++++...+.
T Consensus 130 ~~-~a~~d~i~~lDaD~~~~~d~L~~l~~~~~~~~~v~~v~ 169 (420)
T PRK11204 130 AA-AARSEYLVCIDGDALLDPDAAAYMVEHFLHNPRVGAVT 169 (420)
T ss_pred HH-HcCCCEEEEECCCCCCChhHHHHHHHHHHhCCCeEEEE
Confidence 32 2357999999888887432 345666665555544443
No 10
>PRK14716 bacteriophage N4 adsorption protein B; Provisional
Probab=78.17 E-value=37 Score=33.80 Aligned_cols=97 Identities=19% Similarity=0.125 Sum_probs=58.0
Q ss_pred cCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHHH
Q psy6303 15 VKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQLL 94 (270)
Q Consensus 15 ~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~al 94 (270)
|++.+-+.+++.++...+ +.|+-.++|=.|.+.+.....+.++...+|+++++..+. =|+.+-..|--.+++.+.
T Consensus 75 ~NE~~vI~~~l~s~L~~l--dY~~~eIiVv~d~ndd~T~~~v~~l~~~~p~v~~vv~~~---~gp~~Ka~aLN~~l~~~~ 149 (504)
T PRK14716 75 WREADVIGRMLEHNLATL--DYENYRIFVGTYPNDPATLREVDRLAARYPRVHLVIVPH---DGPTSKADCLNWIYQAIF 149 (504)
T ss_pred cCchhHHHHHHHHHHHcC--CCCCeEEEEEECCCChhHHHHHHHHHHHCCCeEEEEeCC---CCCCCHHHHHHHHHHHHH
Confidence 444456667666542222 457777888777665544444555667899988653221 244566666555555442
Q ss_pred h----cCCCccEEEeccCCcccccCh
Q psy6303 95 E----SGWRWDFIVNLSESDYPVKTN 116 (270)
Q Consensus 95 ~----~~~~~dy~~lLSg~D~Plks~ 116 (270)
+ .+.++|+++.+-.+|.|=...
T Consensus 150 ~~e~~~G~~~d~vvi~DAD~~v~Pd~ 175 (504)
T PRK14716 150 AFERERGIRFAIIVLHDAEDVIHPLE 175 (504)
T ss_pred HhhhhcCCCcCEEEEEcCCCCcCccH
Confidence 2 234789999999888865433
No 11
>PRK14583 hmsR N-glycosyltransferase; Provisional
Probab=72.14 E-value=73 Score=30.73 Aligned_cols=108 Identities=18% Similarity=0.066 Sum_probs=62.9
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQL 93 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~a 93 (270)
+|++...+.+.+.++ ++.+.|+-.++|--|.+.+.....+.+....++++.++... ..+|. - .++..+
T Consensus 83 ~yNE~~~i~~~l~sl---l~q~yp~~eIivVdDgs~D~t~~~~~~~~~~~~~v~vv~~~---~n~Gk--a----~AlN~g 150 (444)
T PRK14583 83 CFNEGLNARETIHAA---LAQTYTNIEVIAINDGSSDDTAQVLDALLAEDPRLRVIHLA---HNQGK--A----IALRMG 150 (444)
T ss_pred eCCCHHHHHHHHHHH---HcCCCCCeEEEEEECCCCccHHHHHHHHHHhCCCEEEEEeC---CCCCH--H----HHHHHH
Confidence 355556777777777 23246777777766666554433344555567888887521 23332 2 233444
Q ss_pred HhcCCCccEEEeccCCcccccC-hHHHHHHHhcCCCCceeec
Q psy6303 94 LESGWRWDFIVNLSESDYPVKT-NRALVDFLSANRDKNFVKS 134 (270)
Q Consensus 94 l~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~~g~~Fi~~ 134 (270)
++ ..+.||++.+-+++.|-.. ..++.+.+.++++..-+..
T Consensus 151 l~-~a~~d~iv~lDAD~~~~~d~L~~lv~~~~~~~~~g~v~g 191 (444)
T PRK14583 151 AA-AARSEYLVCIDGDALLDKNAVPYLVAPLIANPRTGAVTG 191 (444)
T ss_pred HH-hCCCCEEEEECCCCCcCHHHHHHHHHHHHhCCCeEEEEc
Confidence 43 2468999999999987533 3445555555555544443
No 12
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano
Probab=71.20 E-value=59 Score=30.52 Aligned_cols=107 Identities=14% Similarity=0.102 Sum_probs=56.8
Q ss_pred cCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCC-HHHHHHHHhhccCCC--eEEecccceeeeCchhHHHHHHHHHH
Q psy6303 15 VKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQD-YLFRELLSLELRLPN--VRLSRRRQATIWGGASLVTILLDSMQ 91 (270)
Q Consensus 15 ~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~-~~~~~~~~~~~~~~n--V~~~~~r~~v~WGg~S~v~A~l~ll~ 91 (270)
+++.+.+++.+.++. +.+.|+-.+.| +|-.++ ....-+.++...+|+ |.++....+..|++ -+.+..++++
T Consensus 50 ~nee~~l~~~L~Sl~---~q~Yp~~EIiv-vdd~s~D~t~~iv~~~~~~~p~~~i~~v~~~~~~G~~~--K~~~l~~~~~ 123 (373)
T TIGR03472 50 HGDEPELYENLASFC---RQDYPGFQMLF-GVQDPDDPALAVVRRLRADFPDADIDLVIDARRHGPNR--KVSNLINMLP 123 (373)
T ss_pred CCCChhHHHHHHHHH---hcCCCCeEEEE-EeCCCCCcHHHHHHHHHHhCCCCceEEEECCCCCCCCh--HHHHHHHHHH
Confidence 444567777777772 22467767766 555444 333323345556776 55554333444543 2333333333
Q ss_pred HHHhcCCCccEEEeccCCcccccChHHHHHHHhc--CCCCceeec
Q psy6303 92 QLLESGWRWDFIVNLSESDYPVKTNRALVDFLSA--NRDKNFVKS 134 (270)
Q Consensus 92 ~al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~~--~~g~~Fi~~ 134 (270)
. .+.||++.+-.++.| +.+-+.+..+. +++...+..
T Consensus 124 ~-----a~ge~i~~~DaD~~~--~p~~L~~lv~~~~~~~v~~V~~ 161 (373)
T TIGR03472 124 H-----ARHDILVIADSDISV--GPDYLRQVVAPLADPDVGLVTC 161 (373)
T ss_pred h-----ccCCEEEEECCCCCc--ChhHHHHHHHHhcCCCcceEec
Confidence 3 346888877777766 55555444322 345555554
No 13
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins.
Probab=69.10 E-value=73 Score=27.27 Aligned_cols=95 Identities=9% Similarity=-0.031 Sum_probs=56.7
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCC--CeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDK--NFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQ 91 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~--~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~ 91 (270)
++++.+.|.+.+.++.... .|+ ..++|..|...+.....+.+.... +|.++.... .+ +...|--.+++
T Consensus 37 ~~n~~~~l~~~l~si~~q~---~~~~~~eiivvdd~s~d~t~~~~~~~~~~--~v~~i~~~~---~~--g~~~a~n~gi~ 106 (251)
T cd06439 37 AYNEEAVIEAKLENLLALD---YPRDRLEIIVVSDGSTDGTAEIAREYADK--GVKLLRFPE---RR--GKAAALNRALA 106 (251)
T ss_pred cCCcHHHHHHHHHHHHhCc---CCCCcEEEEEEECCCCccHHHHHHHHhhC--cEEEEEcCC---CC--ChHHHHHHHHH
Confidence 4556678888888874322 233 578888888766333322332222 677764321 22 34556555666
Q ss_pred HHHhcCCCccEEEeccCCcccccChHHHHHHHhc
Q psy6303 92 QLLESGWRWDFIVNLSESDYPVKTNRALVDFLSA 125 (270)
Q Consensus 92 ~al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~~ 125 (270)
.+ ..||++++.+.+.|- .+.+.+.++.
T Consensus 107 ~a-----~~d~i~~lD~D~~~~--~~~l~~l~~~ 133 (251)
T cd06439 107 LA-----TGEIVVFTDANALLD--PDALRLLVRH 133 (251)
T ss_pred Hc-----CCCEEEEEccccCcC--HHHHHHHHHH
Confidence 54 239999999999995 5656555544
No 14
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms.
Probab=68.12 E-value=1.1e+02 Score=28.88 Aligned_cols=114 Identities=8% Similarity=-0.014 Sum_probs=62.7
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCC-CHHHHHHHHhhccCC---CeEEecc-cceeeeCchhHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQ-DYLFRELLSLELRLP---NVRLSRR-RQATIWGGASLVTILLD 88 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks-~~~~~~~~~~~~~~~---nV~~~~~-r~~v~WGg~S~v~A~l~ 88 (270)
++++.+.+.+++.+|.. .+.|++.=.|-||-.| +.....+.++...+| +++++.. ..+-.|+|- ..|.-.
T Consensus 48 a~Ne~~~L~~~L~sL~~---q~yp~~~eIIVVDd~StD~T~~i~~~~~~~~~~~~~i~vi~~~~~~~g~~Gk--~~A~n~ 122 (384)
T TIGR03469 48 ARNEADVIGECVTSLLE---QDYPGKLHVILVDDHSTDGTADIARAAARAYGRGDRLTVVSGQPLPPGWSGK--LWAVSQ 122 (384)
T ss_pred cCCcHhHHHHHHHHHHh---CCCCCceEEEEEeCCCCCcHHHHHHHHHHhcCCCCcEEEecCCCCCCCCcch--HHHHHH
Confidence 34556888899888832 1244333344555544 433222233333444 7888863 234567653 345555
Q ss_pred HHHHHHhcCCCccEEEeccCCcccccC-hHHHHHHHhcCCCCceee
Q psy6303 89 SMQQLLESGWRWDFIVNLSESDYPVKT-NRALVDFLSANRDKNFVK 133 (270)
Q Consensus 89 ll~~al~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~~g~~Fi~ 133 (270)
+++.|-+...+.+|+..+-+++.|-.. .+++.+.++++ +...+.
T Consensus 123 g~~~A~~~~~~gd~llflDaD~~~~p~~l~~lv~~~~~~-~~~~vs 167 (384)
T TIGR03469 123 GIAAARTLAPPADYLLLTDADIAHGPDNLARLVARARAE-GLDLVS 167 (384)
T ss_pred HHHHHhccCCCCCEEEEECCCCCCChhHHHHHHHHHHhC-CCCEEE
Confidence 666665433347899888888876332 24555555542 344443
No 15
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment.
Probab=67.68 E-value=70 Score=26.52 Aligned_cols=98 Identities=15% Similarity=0.082 Sum_probs=50.6
Q ss_pred cCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCC--eEEecccceeeeCchhHHHHHHHHHHH
Q psy6303 15 VKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPN--VRLSRRRQATIWGGASLVTILLDSMQQ 92 (270)
Q Consensus 15 ~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~n--V~~~~~r~~v~WGg~S~v~A~l~ll~~ 92 (270)
|++.+.+.++|.+|... ..|+..++|=.|.+++.....+.++...+++ +.++....+. |......+ +..
T Consensus 10 ~n~~~~l~~~L~sl~~q---~~~~~eiivVdd~s~d~t~~~~~~~~~~~~~~~~~~~~~~~~~--g~~~~~~~----~n~ 80 (196)
T cd02520 10 CGVDPNLYENLESFFQQ---DYPKYEILFCVQDEDDPAIPVVRKLIAKYPNVDARLLIGGEKV--GINPKVNN----LIK 80 (196)
T ss_pred CCCCccHHHHHHHHHhc---cCCCeEEEEEeCCCcchHHHHHHHHHHHCCCCcEEEEecCCcC--CCCHhHHH----HHH
Confidence 44445677888777321 2456666665566655443333344445563 4455333233 33222222 223
Q ss_pred HHhcCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 93 LLESGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 93 al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
+++ ....+|++++-+++.+ +.+.|.+.++
T Consensus 81 g~~-~a~~d~i~~~D~D~~~--~~~~l~~l~~ 109 (196)
T cd02520 81 GYE-EARYDILVISDSDISV--PPDYLRRMVA 109 (196)
T ss_pred HHH-hCCCCEEEEECCCceE--ChhHHHHHHH
Confidence 332 2346899888777764 5555555554
No 16
>cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=67.54 E-value=68 Score=26.27 Aligned_cols=105 Identities=10% Similarity=0.115 Sum_probs=57.2
Q ss_pred cCCH-HHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHH-HhhccCCCeEEecccceeeeCchhHHHHHHHHHHH
Q psy6303 15 VKKQ-EDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELL-SLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQ 92 (270)
Q Consensus 15 ~~~~-~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~-~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~ 92 (270)
+++. +.+++.+.+|.... .+...++|--|...+.....+. +.....+++.++....+ .+...|--.+++.
T Consensus 10 ~n~~~~~l~~~l~sl~~q~---~~~~eiivvd~gs~d~~~~~~~~~~~~~~~~~~~~~~~~~-----~g~~~a~n~g~~~ 81 (202)
T cd04184 10 YNTPEKYLREAIESVRAQT---YPNWELCIADDASTDPEVKRVLKKYAAQDPRIKVVFREEN-----GGISAATNSALEL 81 (202)
T ss_pred ccCcHHHHHHHHHHHHhCc---CCCeEEEEEeCCCCChHHHHHHHHHHhcCCCEEEEEcccC-----CCHHHHHHHHHHh
Confidence 3444 88888888883221 3444565555555553333322 23345677877643322 2334454455554
Q ss_pred HHhcCCCccEEEeccCCcccccC-hHHHHHHHhcCCCCcee
Q psy6303 93 LLESGWRWDFIVNLSESDYPVKT-NRALVDFLSANRDKNFV 132 (270)
Q Consensus 93 al~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~~g~~Fi 132 (270)
| ..+|+..+...|.|-.. .+.+.+.++.+++...+
T Consensus 82 a-----~~d~i~~ld~D~~~~~~~l~~~~~~~~~~~~~~~v 117 (202)
T cd04184 82 A-----TGEFVALLDHDDELAPHALYEVVKALNEHPDADLI 117 (202)
T ss_pred h-----cCCEEEEECCCCcCChHHHHHHHHHHHhCCCCCEE
Confidence 4 34899999888877432 34555555445554444
No 17
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi,
Probab=65.64 E-value=79 Score=26.35 Aligned_cols=97 Identities=15% Similarity=0.107 Sum_probs=54.5
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQL 93 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~a 93 (270)
+|+.++.|.+++.++.... +.++..++|==|...+.....+.+..+..+++.++.... -+|.+ .|--.+++.|
T Consensus 5 ~yn~~~~l~~~l~sl~~q~--~~~~~eiiiVDd~S~d~t~~~~~~~~~~~~~i~~~~~~~---n~G~~--~a~n~g~~~a 77 (224)
T cd06442 5 TYNERENIPELIERLDAAL--KGIDYEIIVVDDNSPDGTAEIVRELAKEYPRVRLIVRPG---KRGLG--SAYIEGFKAA 77 (224)
T ss_pred ccchhhhHHHHHHHHHHhh--cCCCeEEEEEeCCCCCChHHHHHHHHHhCCceEEEecCC---CCChH--HHHHHHHHHc
Confidence 3556678889888884333 213444444444444433332334455677777764222 23433 3445566655
Q ss_pred HhcCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 94 LESGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 94 l~~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
. -+|++.|.++|.+ +.+.+...++
T Consensus 78 ~-----gd~i~~lD~D~~~--~~~~l~~l~~ 101 (224)
T cd06442 78 R-----GDVIVVMDADLSH--PPEYIPELLE 101 (224)
T ss_pred C-----CCEEEEEECCCCC--CHHHHHHHHH
Confidence 3 2889999888876 4555655554
No 18
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen.
Probab=65.14 E-value=43 Score=28.35 Aligned_cols=91 Identities=15% Similarity=0.112 Sum_probs=57.0
Q ss_pred cCCH-HHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHH
Q psy6303 15 VKKQ-EDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQL 93 (270)
Q Consensus 15 ~~~~-~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~a 93 (270)
+++. +.+.+.+.++ ... ..-.|=||-.++..-.+..+. ..+++.++.... +-| ...|--.+++.|
T Consensus 6 yn~~~~~l~~~l~sl------~~q-~~~iivvDn~s~~~~~~~~~~--~~~~i~~i~~~~--n~G---~~~a~N~g~~~a 71 (237)
T cd02526 6 YNPDLSKLKELLAAL------AEQ-VDKVVVVDNSSGNDIELRLRL--NSEKIELIHLGE--NLG---IAKALNIGIKAA 71 (237)
T ss_pred ecCCHHHHHHHHHHH------hcc-CCEEEEEeCCCCccHHHHhhc--cCCcEEEEECCC--cee---hHHhhhHHHHHH
Confidence 3444 8999999999 333 455666887765332222211 457888875332 223 444555566766
Q ss_pred HhcCCCccEEEeccCCcccccChHHHHHHH
Q psy6303 94 LESGWRWDFIVNLSESDYPVKTNRALVDFL 123 (270)
Q Consensus 94 l~~~~~~dy~~lLSg~D~Plks~~~i~~fl 123 (270)
.. .+++|++++.+++.+ +.+.+...+
T Consensus 72 ~~--~~~d~v~~lD~D~~~--~~~~l~~l~ 97 (237)
T cd02526 72 LE--NGADYVLLFDQDSVP--PPDMVEKLL 97 (237)
T ss_pred Hh--CCCCEEEEECCCCCc--CHhHHHHHH
Confidence 53 368999999999997 466666653
No 19
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=61.44 E-value=94 Score=25.80 Aligned_cols=98 Identities=15% Similarity=0.111 Sum_probs=54.0
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCC--CeEEEEecCCCCHHHHHHH-HhhccCCCeEEecccceeeeCchhHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDK--NFVKSHGRQRQDYLFRELL-SLELRLPNVRLSRRRQATIWGGASLVTILLDSM 90 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~--~~~~IHvD~ks~~~~~~~~-~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll 90 (270)
++++++.|++.|.+|.+. ..|+ ..++|--|.+++.....+. ......++|.++.... ..++| ...|--.++
T Consensus 5 ~~n~~~~l~~~l~sl~~q---~~~~~~~eiivvdd~s~d~t~~~~~~~~~~~~~~v~~~~~~~-~~~~g--~~~a~n~g~ 78 (229)
T cd04192 5 ARNEAENLPRLLQSLSAL---DYPKEKFEVILVDDHSTDGTVQILEFAAAKPNFQLKILNNSR-VSISG--KKNALTTAI 78 (229)
T ss_pred ecCcHHHHHHHHHHHHhC---CCCCCceEEEEEcCCCCcChHHHHHHHHhCCCcceEEeeccC-cccch--hHHHHHHHH
Confidence 355678888988888322 2344 5677776665543322222 2223456777765322 12222 223323333
Q ss_pred HHHHhcCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 91 QQLLESGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 91 ~~al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
+. ...+|++++.++|.| ..+.|.+.++
T Consensus 79 ~~-----~~~d~i~~~D~D~~~--~~~~l~~l~~ 105 (229)
T cd04192 79 KA-----AKGDWIVTTDADCVV--PSNWLLTFVA 105 (229)
T ss_pred HH-----hcCCEEEEECCCccc--CHHHHHHHHH
Confidence 33 246999999999977 3556665554
No 20
>cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=58.79 E-value=1e+02 Score=25.26 Aligned_cols=111 Identities=16% Similarity=0.262 Sum_probs=59.6
Q ss_pred EEEEEechhhcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHH-HhhccCCCeEEecccceeeeCchhHH
Q psy6303 5 IVDTVHRRELVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELL-SLELRLPNVRLSRRRQATIWGGASLV 83 (270)
Q Consensus 5 ~~~~~~~~~~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~-~~~~~~~nV~~~~~r~~v~WGg~S~v 83 (270)
|++.+++++ +++.+.+.+.++..+- .++..+.|=.|.+++....++. +.....+ +.++....+ - +..
T Consensus 2 viip~~n~~---~~~~l~~~l~Sl~~q~---~~~~eiiivdd~ss~d~t~~~~~~~~~~~~-i~~i~~~~n--~---G~~ 69 (201)
T cd04195 2 VLMSVYIKE---KPEFLREALESILKQT---LPPDEVVLVKDGPVTQSLNEVLEEFKRKLP-LKVVPLEKN--R---GLG 69 (201)
T ss_pred EEEEccccc---hHHHHHHHHHHHHhcC---CCCcEEEEEECCCCchhHHHHHHHHHhcCC-eEEEEcCcc--c---cHH
Confidence 344555432 2578999999984322 3455666655666554334433 3333444 776643222 2 223
Q ss_pred HHHHHHHHHHHhcCCCccEEEeccCCcccccC-hHHHHHHHhcCCCCcee
Q psy6303 84 TILLDSMQQLLESGWRWDFIVNLSESDYPVKT-NRALVDFLSANRDKNFV 132 (270)
Q Consensus 84 ~A~l~ll~~al~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~~g~~Fi 132 (270)
.|--.+++. .+.+|+..+.+.|.+... .+.+.++++++++...+
T Consensus 70 ~a~N~g~~~-----a~gd~i~~lD~Dd~~~~~~l~~~~~~~~~~~~~~~~ 114 (201)
T cd04195 70 KALNEGLKH-----CTYDWVARMDTDDISLPDRFEKQLDFIEKNPEIDIV 114 (201)
T ss_pred HHHHHHHHh-----cCCCEEEEeCCccccCcHHHHHHHHHHHhCCCeEEE
Confidence 343333433 246899999999987643 23344566555544433
No 21
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=58.04 E-value=87 Score=24.31 Aligned_cols=104 Identities=17% Similarity=0.072 Sum_probs=56.9
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQL 93 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~a 93 (270)
++++++.+.+++.+|.+.. .+...++|--|...+.....+.+ ..+++.++... ... +...|--.+++.+
T Consensus 5 ~~~~~~~l~~~l~sl~~~~---~~~~~iiivdd~s~~~~~~~~~~---~~~~~~~~~~~--~~~---g~~~a~n~~~~~~ 73 (166)
T cd04186 5 NYNSLEYLKACLDSLLAQT---YPDFEVIVVDNASTDGSVELLRE---LFPEVRLIRNG--ENL---GFGAGNNQGIREA 73 (166)
T ss_pred ecCCHHHHHHHHHHHHhcc---CCCeEEEEEECCCCchHHHHHHH---hCCCeEEEecC--CCc---ChHHHhhHHHhhC
Confidence 3556788999999994322 25567777666555433332222 22356665322 222 3344444555544
Q ss_pred HhcCCCccEEEeccCCcccccC-hHHHHHHHhcCCCCceee
Q psy6303 94 LESGWRWDFIVNLSESDYPVKT-NRALVDFLSANRDKNFVK 133 (270)
Q Consensus 94 l~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~~g~~Fi~ 133 (270)
+.+|+..+...|.|-.. .+.+.+.+..+++...+.
T Consensus 74 -----~~~~i~~~D~D~~~~~~~l~~~~~~~~~~~~~~~~~ 109 (166)
T cd04186 74 -----KGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIVG 109 (166)
T ss_pred -----CCCEEEEECCCcEECccHHHHHHHHHHhCCCceEEE
Confidence 57899999988887543 233333344444444443
No 22
>COG4746 Uncharacterized protein conserved in archaea [Function unknown]
Probab=55.59 E-value=9.1 Score=28.20 Aligned_cols=24 Identities=29% Similarity=0.560 Sum_probs=18.7
Q ss_pred ccCCcccccChHHHHHHHhcCCCC
Q psy6303 106 LSESDYPVKTNRALVDFLSANRDK 129 (270)
Q Consensus 106 LSg~D~Plks~~~i~~fl~~~~g~ 129 (270)
+-|.||||+|++++...|-.-.++
T Consensus 16 ~k~a~fPInn~~eL~~ALP~G~dt 39 (80)
T COG4746 16 LKGADFPINNPEELVAALPSGPDT 39 (80)
T ss_pred HccCCCCCCCHHHHHHhccCCCCC
Confidence 357899999999999988554333
No 23
>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=55.51 E-value=1.2e+02 Score=25.00 Aligned_cols=97 Identities=21% Similarity=0.183 Sum_probs=51.1
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQL 93 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~a 93 (270)
++++++.|.+++.+|..+- .|...++|--|.+.+.....+.+.... .++.++... -+-|... |--.+++.|
T Consensus 5 ~~n~~~~l~~~l~sl~~q~---~~~~eiiivD~~s~d~t~~~~~~~~~~-~~i~~~~~~--~n~g~~~---~~n~~~~~a 75 (202)
T cd04185 5 TYNRLDLLKECLDALLAQT---RPPDHIIVIDNASTDGTAEWLTSLGDL-DNIVYLRLP--ENLGGAG---GFYEGVRRA 75 (202)
T ss_pred eeCCHHHHHHHHHHHHhcc---CCCceEEEEECCCCcchHHHHHHhcCC-CceEEEECc--cccchhh---HHHHHHHHH
Confidence 3566788999999993221 344455555444444332222333222 335665432 2334332 223344555
Q ss_pred HhcCCCccEEEeccCCcccccChHHHHHHH
Q psy6303 94 LESGWRWDFIVNLSESDYPVKTNRALVDFL 123 (270)
Q Consensus 94 l~~~~~~dy~~lLSg~D~Plks~~~i~~fl 123 (270)
. ..+.+|+..+...|.|- .+.+.+..
T Consensus 76 ~--~~~~d~v~~ld~D~~~~--~~~l~~l~ 101 (202)
T cd04185 76 Y--ELGYDWIWLMDDDAIPD--PDALEKLL 101 (202)
T ss_pred h--ccCCCEEEEeCCCCCcC--hHHHHHHH
Confidence 4 24679999998888775 33444443
No 24
>TIGR00762 DegV EDD domain protein, DegV family. This family of proteins is related to DegV of Bacillus subtilis and includes paralogous sets in several species (B. subtilis, Deinococcus radiodurans, Mycoplasma pneumoniae) that are closer in percent identity to each than to most homologs from other species. This suggests both recent paralogy and diversity of function. DegV itself is encoded immediately downstream of DegU, a transcriptional regulator of degradation, but is itself uncharacterized. Crystallography suggested a lipid-binding site, while comparison of the crystal structure to dihydroxyacetone kinase and to a mannose transporter EIIA domain suggests a conserved domain, EDD, with phosphotransferase activity.
Probab=53.46 E-value=1.4e+02 Score=26.88 Aligned_cols=93 Identities=11% Similarity=0.093 Sum_probs=54.3
Q ss_pred CHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCC--CeEEecccceeeeCchhHHHHHHHHHHHHH
Q psy6303 17 KQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLP--NVRLSRRRQATIWGGASLVTILLDSMQQLL 94 (270)
Q Consensus 17 ~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~--nV~~~~~r~~v~WGg~S~v~A~l~ll~~al 94 (270)
+++.+...+..+ +++ . +..++||+..+-+..+..+..+++.++ +|++++.+ ..+..+-...+.+.+.+-
T Consensus 63 s~~~~~~~~~~l---~~~-~-~~vi~i~iSs~lSgty~~a~~aa~~~~~~~i~ViDS~----~~s~~~g~~v~~a~~~~~ 133 (275)
T TIGR00762 63 SPGEFLELYEKL---LEE-G-DEVLSIHLSSGLSGTYQSARQAAEMVDEAKVTVIDSK----SASMGLGLLVLEAAKLAE 133 (275)
T ss_pred CHHHHHHHHHHH---HhC-C-CeEEEEEcCCchhHHHHHHHHHHhhCCCCCEEEECCh----HHHHHHHHHHHHHHHHHH
Confidence 344555544444 222 3 679999999998877776655555555 79888643 333333333333333332
Q ss_pred hcCCCccEEEeccCCcccccChHHHHHHHhcCCCCceeecc
Q psy6303 95 ESGWRWDFIVNLSESDYPVKTNRALVDFLSANRDKNFVKSH 135 (270)
Q Consensus 95 ~~~~~~dy~~lLSg~D~Plks~~~i~~fl~~~~g~~Fi~~~ 135 (270)
.+ .+.++|.+.++....+..+-+.
T Consensus 134 -~G----------------~s~~eI~~~l~~~~~~~~~~f~ 157 (275)
T TIGR00762 134 -EG----------------KSLEEILAKLEELRERTKLYFV 157 (275)
T ss_pred -cC----------------CCHHHHHHHHHHHHhhcEEEEE
Confidence 22 2589999999885555444443
No 25
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins.
Probab=53.31 E-value=1.5e+02 Score=26.54 Aligned_cols=105 Identities=17% Similarity=0.135 Sum_probs=57.4
Q ss_pred EEEEEechhhcCCHHHHHHHHHHHHHhhhhcCCC-CeEEEEecCCCC-HHHHHHHH--hhccCCCeEEecccceeeeCch
Q psy6303 5 IVDTVHRRELVKKQEDLKRTNRALVDFLSANRDK-NFVKSHGRQRQD-YLFRELLS--LELRLPNVRLSRRRQATIWGGA 80 (270)
Q Consensus 5 ~~~~~~~~~~~~~~~ql~rLi~~l~~~l~~~~p~-~~~~IHvD~ks~-~~~~~~~~--~~~~~~nV~~~~~r~~v~WGg~ 80 (270)
|++.++++.. +.+.+.|.+|.+.. .+. ..=.|-||-.|+ .....+.+ .....++|+++....+ .
T Consensus 2 IIIp~~N~~~----~~l~~~l~Sl~~~~---~~~~~~EIIvVDd~S~d~t~~~~~~~~~~~~~~~v~vi~~~~n-----~ 69 (299)
T cd02510 2 VIIIFHNEAL----STLLRTVHSVINRT---PPELLKEIILVDDFSDKPELKLLLEEYYKKYLPKVKVLRLKKR-----E 69 (299)
T ss_pred EEEEEecCcH----HHHHHHHHHHHhcC---chhcCCEEEEEECCCCchHHHHHHHHHHhhcCCcEEEEEcCCC-----C
Confidence 5556665332 88888888883211 122 235677777665 33222222 2235678999854322 2
Q ss_pred hHHHHHHHHHHHHHhcCCCccEEEeccCCccccc-ChHHHHHHHhcC
Q psy6303 81 SLVTILLDSMQQLLESGWRWDFIVNLSESDYPVK-TNRALVDFLSAN 126 (270)
Q Consensus 81 S~v~A~l~ll~~al~~~~~~dy~~lLSg~D~Plk-s~~~i~~fl~~~ 126 (270)
+...|--.+++.| .-+|++.|.+++.|-. -.+++.+.+..+
T Consensus 70 G~~~a~N~g~~~A-----~gd~i~fLD~D~~~~~~wL~~ll~~l~~~ 111 (299)
T cd02510 70 GLIRARIAGARAA-----TGDVLVFLDSHCEVNVGWLEPLLARIAEN 111 (299)
T ss_pred CHHHHHHHHHHHc-----cCCEEEEEeCCcccCccHHHHHHHHHHhC
Confidence 2334544555554 2489999999998731 123444445443
No 26
>PF07747 MTH865: MTH865-like family; InterPro: IPR024093 This entry represents a group of uncharacterised hypothetical proteins from archaea, including the 8.4 kDa protein MTH865 from Methanobacterium thermoautotrophicum. The NMR structure of MTH865 reveals an EF-Hand-like fold consisting of four helices in two hairpins [].; PDB: 1IIO_A.
Probab=51.19 E-value=8.7 Score=28.32 Aligned_cols=24 Identities=21% Similarity=0.444 Sum_probs=17.2
Q ss_pred ccCCcccccChHHHHHHHhcCCCC
Q psy6303 106 LSESDYPVKTNRALVDFLSANRDK 129 (270)
Q Consensus 106 LSg~D~Plks~~~i~~fl~~~~g~ 129 (270)
+.|.||||+|..|+..-|-.-.++
T Consensus 11 ~~~a~FPI~s~~eL~~alP~G~~t 34 (75)
T PF07747_consen 11 FKGADFPIKSPMELLPALPKGPST 34 (75)
T ss_dssp HTTSSSTTBHHHHHHHH-TTSS--
T ss_pred HhcCCCCCCCHHHHHHhCCCCCCC
Confidence 457899999999999988553333
No 27
>PF07521 RMMBL: RNA-metabolising metallo-beta-lactamase; InterPro: IPR011108 The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in IPR001279 from INTERPRO and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in RNA metabolism [].; PDB: 3ZQ4_D 2I7T_A 2I7V_A 2YCB_B 3BK1_A 3T3N_A 3BK2_A 3T3O_A 3AF5_A 3AF6_A ....
Probab=48.09 E-value=20 Score=23.12 Aligned_cols=28 Identities=18% Similarity=0.195 Sum_probs=23.4
Q ss_pred hhhcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEec
Q psy6303 12 RELVKKQEDLKRTNRALVDFLSANRDKNFVKSHGR 46 (270)
Q Consensus 12 ~~~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD 46 (270)
-.+|-+.++|..+++.+ .|.+.|.||=|
T Consensus 14 fSgHad~~~L~~~i~~~-------~p~~vilVHGe 41 (43)
T PF07521_consen 14 FSGHADREELLEFIEQL-------NPRKVILVHGE 41 (43)
T ss_dssp CSSS-BHHHHHHHHHHH-------CSSEEEEESSE
T ss_pred ecCCCCHHHHHHHHHhc-------CCCEEEEecCC
Confidence 34788999999999999 78999999955
No 28
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A.
Probab=47.19 E-value=84 Score=26.39 Aligned_cols=109 Identities=20% Similarity=0.229 Sum_probs=53.0
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCC--eEEecccceeeeCchhHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPN--VRLSRRRQATIWGGASLVTILLDSMQ 91 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~n--V~~~~~r~~v~WGg~S~v~A~l~ll~ 91 (270)
++++++.+.+.+.++. +...|+-.++|=.|...+.....+.++...+++ |+++....+. |..+...|.-.+++
T Consensus 9 ~~~~~~~l~~~l~sl~---~~~~~~~~v~vvd~~~~~~~~~~~~~~~~~~~~~~v~vi~~~~~~--g~~~k~~a~n~~~~ 83 (228)
T PF13641_consen 9 AYNEDDVLRRCLESLL---AQDYPRLEVVVVDDGSDDETAEILRALAARYPRVRVRVIRRPRNP--GPGGKARALNEALA 83 (228)
T ss_dssp BSS-HHHHHHHHHHHT---TSHHHTEEEEEEEE-SSS-GCTTHHHHHHTTGG-GEEEEE----H--HHHHHHHHHHHHHH
T ss_pred ecCCHHHHHHHHHHHH---cCCCCCeEEEEEECCCChHHHHHHHHHHHHcCCCceEEeecCCCC--CcchHHHHHHHHHH
Confidence 5677788888888882 211266666664444443322223334445655 4666432111 22234444444444
Q ss_pred HHHhcCCCccEEEeccCCcccccC--hHHHHHHHhcCCCCceeec
Q psy6303 92 QLLESGWRWDFIVNLSESDYPVKT--NRALVDFLSANRDKNFVKS 134 (270)
Q Consensus 92 ~al~~~~~~dy~~lLSg~D~Plks--~~~i~~fl~~~~g~~Fi~~ 134 (270)
.+ +.+|++++.+++.| -+ .+++.+.| +.++...+..
T Consensus 84 ~~-----~~d~i~~lD~D~~~-~p~~l~~~~~~~-~~~~~~~v~~ 121 (228)
T PF13641_consen 84 AA-----RGDYILFLDDDTVL-DPDWLERLLAAF-ADPGVGAVGG 121 (228)
T ss_dssp H--------SEEEEE-SSEEE--CHHHHHHHHHH-HBSS--EEEE
T ss_pred hc-----CCCEEEEECCCcEE-CHHHHHHHHHHH-HhCCCCeEee
Confidence 43 37888888888888 33 33444455 4566666663
No 29
>cd02511 Beta4Glucosyltransferase UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide. UDP-glucose: lipooligosaccharide (LOS) beta-1-4-glucosyltransferase catalyzes the addition of the first residue, glucose, of the lacto-N-neotetrase structure to HepI of the LOS inner core. LOS is the major constituent of the outer leaflet of the outer membrane of gram-positive bacteria. It consists of a short oligosaccharide chain of variable composition (alpha chain) attached to a branched inner core which is lined in turn to lipid A. Beta 1,4 glucosyltransferase is required to attach the alpha chain to the inner core.
Probab=45.45 E-value=1.9e+02 Score=24.65 Aligned_cols=92 Identities=9% Similarity=0.118 Sum_probs=52.7
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCC-CeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDK-NFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQ 92 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~-~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~ 92 (270)
++++.+.|.+.|.+| . +. ..++| ||--|.+.-.++.+ ..++.++.. .|+|++-- --.+++.
T Consensus 8 ~~Ne~~~l~~~l~sl------~-~~~~eiiv-vD~gStD~t~~i~~----~~~~~v~~~----~~~g~~~~--~n~~~~~ 69 (229)
T cd02511 8 TKNEERNIERCLESV------K-WAVDEIIV-VDSGSTDRTVEIAK----EYGAKVYQR----WWDGFGAQ--RNFALEL 69 (229)
T ss_pred eCCcHHHHHHHHHHH------h-cccCEEEE-EeCCCCccHHHHHH----HcCCEEEEC----CCCChHHH--HHHHHHh
Confidence 455667899999998 2 23 34444 66655432223221 346666542 67776532 2223333
Q ss_pred HHhcCCCccEEEeccCCcccccC-hHHHHHHHhcCCC
Q psy6303 93 LLESGWRWDFIVNLSESDYPVKT-NRALVDFLSANRD 128 (270)
Q Consensus 93 al~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~~g 128 (270)
| ..+|+..|-+++.+-.. .+++.+.++.+++
T Consensus 70 a-----~~d~vl~lDaD~~~~~~~~~~l~~~~~~~~~ 101 (229)
T cd02511 70 A-----TNDWVLSLDADERLTPELADEILALLATDDY 101 (229)
T ss_pred C-----CCCEEEEEeCCcCcCHHHHHHHHHHHhCCCC
Confidence 2 34689999888886543 4567777766543
No 30
>cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=45.29 E-value=1.7e+02 Score=23.90 Aligned_cols=96 Identities=10% Similarity=0.035 Sum_probs=51.3
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCC-CeEEecccceeeeCchhHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLP-NVRLSRRRQATIWGGASLVTILLDSMQQ 92 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~-nV~~~~~r~~v~WGg~S~v~A~l~ll~~ 92 (270)
.|+.++.|.+.+.++.+.- .|+..++|=-|.+.+.....+.+....++ .+.++... ++.+...+.-.+++.
T Consensus 6 ~yn~~~~l~~~l~sl~~q~---~~~~eiiVvddgS~d~t~~~~~~~~~~~~~~~~~~~~~-----~~~G~~~~~n~g~~~ 77 (214)
T cd04196 6 TYNGEKYLREQLDSILAQT---YKNDELIISDDGSTDGTVEIIKEYIDKDPFIIILIRNG-----KNLGVARNFESLLQA 77 (214)
T ss_pred ecCcHHHHHHHHHHHHhCc---CCCeEEEEEeCCCCCCcHHHHHHHHhcCCceEEEEeCC-----CCccHHHHHHHHHHh
Confidence 4566788888888883221 23445555555555433332334444443 44444322 334444444444332
Q ss_pred HHhcCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 93 LLESGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 93 al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
...+|+.+|...|.+.. +.|...++
T Consensus 78 -----~~g~~v~~ld~Dd~~~~--~~l~~~~~ 102 (214)
T cd04196 78 -----ADGDYVFFCDQDDIWLP--DKLERLLK 102 (214)
T ss_pred -----CCCCEEEEECCCcccCh--hHHHHHHH
Confidence 35789999988887753 44444433
No 31
>cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the
Probab=45.09 E-value=1.4e+02 Score=22.82 Aligned_cols=94 Identities=19% Similarity=0.130 Sum_probs=52.4
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccC-CCeEEecccceeeeCchhHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRL-PNVRLSRRRQATIWGGASLVTILLDSMQQ 92 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~-~nV~~~~~r~~v~WGg~S~v~A~l~ll~~ 92 (270)
++++++.|.+++.++.+.. .+...++|--|..++.....+.+..... +.+.++.. .-. +| ...|--.+++.
T Consensus 5 ~~n~~~~l~~~l~sl~~q~---~~~~~iivvdd~s~d~t~~~~~~~~~~~~~~~~~~~~--~~~-~g--~~~~~n~~~~~ 76 (180)
T cd06423 5 AYNEEAVIERTIESLLALD---YPKLEVIVVDDGSTDDTLEILEELAALYIRRVLVVRD--KEN-GG--KAGALNAGLRH 76 (180)
T ss_pred ccChHHHHHHHHHHHHhCC---CCceEEEEEeCCCccchHHHHHHHhccccceEEEEEe--ccc-CC--chHHHHHHHHh
Confidence 4667789999999984332 2466777766666543333333322222 33444321 112 22 33444444444
Q ss_pred HHhcCCCccEEEeccCCcccccChHHHHHH
Q psy6303 93 LLESGWRWDFIVNLSESDYPVKTNRALVDF 122 (270)
Q Consensus 93 al~~~~~~dy~~lLSg~D~Plks~~~i~~f 122 (270)
+ +.+|++++-+.+.+- ...|...
T Consensus 77 ~-----~~~~i~~~D~D~~~~--~~~l~~~ 99 (180)
T cd06423 77 A-----KGDIVVVLDADTILE--PDALKRL 99 (180)
T ss_pred c-----CCCEEEEECCCCCcC--hHHHHHH
Confidence 3 578999998888774 4555555
No 32
>cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate.
Probab=42.16 E-value=2.1e+02 Score=24.00 Aligned_cols=106 Identities=9% Similarity=0.078 Sum_probs=58.7
Q ss_pred EEEEEechhhcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHH
Q psy6303 5 IVDTVHRRELVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVT 84 (270)
Q Consensus 5 ~~~~~~~~~~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~ 84 (270)
||+..|.+.. +.+.+.+.++.. .. ...++|=.|..++.....+ +.....+.+.++.. .++|. ..
T Consensus 4 VvIp~~ne~~----~~l~~~l~sl~~----q~-~~eiivvdd~s~d~~~~~l-~~~~~~~~~~v~~~----~~~g~--~~ 67 (235)
T cd06434 4 VIIPVYDEDP----DVFRECLRSILR----QK-PLEIIVVTDGDDEPYLSIL-SQTVKYGGIFVITV----PHPGK--RR 67 (235)
T ss_pred EEEeecCCCh----HHHHHHHHHHHh----CC-CCEEEEEeCCCChHHHHHH-HhhccCCcEEEEec----CCCCh--HH
Confidence 3445554331 889999998831 12 3455555555554332222 22335566666542 34553 34
Q ss_pred HHHHHHHHHHhcCCCccEEEeccCCcccccC-hHHHHHHHhcCCCCcee
Q psy6303 85 ILLDSMQQLLESGWRWDFIVNLSESDYPVKT-NRALVDFLSANRDKNFV 132 (270)
Q Consensus 85 A~l~ll~~al~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~~g~~Fi 132 (270)
|--.+++.| +.||++.+-+.+.|-.. .+.+.+.+. +++..-+
T Consensus 68 a~n~g~~~a-----~~d~v~~lD~D~~~~~~~l~~l~~~~~-~~~v~~v 110 (235)
T cd06434 68 ALAEGIRHV-----TTDIVVLLDSDTVWPPNALPEMLKPFE-DPKVGGV 110 (235)
T ss_pred HHHHHHHHh-----CCCEEEEECCCceeChhHHHHHHHhcc-CCCEeEE
Confidence 444455543 57999999999998855 445555554 3443333
No 33
>COG4746 Uncharacterized protein conserved in archaea [Function unknown]
Probab=41.81 E-value=20 Score=26.45 Aligned_cols=38 Identities=18% Similarity=0.334 Sum_probs=27.7
Q ss_pred cceeeeCchhHHHHHHHHHHHHHhcCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 72 RQATIWGGASLVTILLDSMQQLLESGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 72 r~~v~WGg~S~v~A~l~ll~~al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
...+.-||.+.-.|++. -+||..|||.|+.+++.+-+-
T Consensus 38 dttc~~G~~e~tA~E~~---------------kLlT~~DFPfk~a~~vad~iv 75 (80)
T COG4746 38 DTTCESGGVEVTAAEAG---------------KLLTDADFPFKSAEQVADTIV 75 (80)
T ss_pred CCCccCCCeeeeHHHHH---------------hhccccCCCCCCHHHHHHHHH
Confidence 34566778776666543 256788999999999988654
No 34
>cd04188 DPG_synthase DPG_synthase is involved in protein N-linked glycosylation. UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate.
Probab=40.53 E-value=2.1e+02 Score=23.70 Aligned_cols=98 Identities=13% Similarity=0.033 Sum_probs=51.8
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcC-CCCeEEEEecCCC-CHHHHHHHHhhccCCC-eEEecccceeeeCchhHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANR-DKNFVKSHGRQRQ-DYLFRELLSLELRLPN-VRLSRRRQATIWGGASLVTILLDSM 90 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~-p~~~~~IHvD~ks-~~~~~~~~~~~~~~~n-V~~~~~r~~v~WGg~S~v~A~l~ll 90 (270)
+|+..+.+.+.+.++.+.+.... ++..++ -||-.| +.....+.+.....+. |+++....+ .|. -.|-..++
T Consensus 5 ~yN~~~~l~~~l~~l~~~~~~~~~~~~eii-vvdd~S~D~t~~~~~~~~~~~~~~i~~i~~~~n---~G~--~~a~~~g~ 78 (211)
T cd04188 5 AYNEEKRLPPTLEEAVEYLEERPSFSYEII-VVDDGSKDGTAEVARKLARKNPALIRVLTLPKN---RGK--GGAVRAGM 78 (211)
T ss_pred ccChHHHHHHHHHHHHHHHhccCCCCEEEE-EEeCCCCCchHHHHHHHHHhCCCcEEEEEcccC---CCc--HHHHHHHH
Confidence 34556777777777755542101 344444 455554 4332223334344554 356543222 233 34555666
Q ss_pred HHHHhcCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 91 QQLLESGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 91 ~~al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
+.|. -+|++++.+.+.+- .+.|.+.++
T Consensus 79 ~~a~-----gd~i~~ld~D~~~~--~~~l~~l~~ 105 (211)
T cd04188 79 LAAR-----GDYILFADADLATP--FEELEKLEE 105 (211)
T ss_pred HHhc-----CCEEEEEeCCCCCC--HHHHHHHHH
Confidence 6663 28999999888643 444555444
No 35
>COG1216 Predicted glycosyltransferases [General function prediction only]
Probab=40.14 E-value=2.8e+02 Score=25.01 Aligned_cols=97 Identities=23% Similarity=0.180 Sum_probs=54.6
Q ss_pred cCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCC-HHHHHHHHhhcc-CCCeEEecccceeeeCchhHHHHHHHHHHH
Q psy6303 15 VKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQD-YLFRELLSLELR-LPNVRLSRRRQATIWGGASLVTILLDSMQQ 92 (270)
Q Consensus 15 ~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~-~~~~~~~~~~~~-~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~ 92 (270)
|+..+.+..-+.+|. .........|=+|-.+. .....+. .. +++|.+++...+..|||-.. .+++.
T Consensus 12 yn~~~~l~~~l~~l~----~~~~~~~~iv~vDn~s~d~~~~~~~---~~~~~~v~~i~~~~NlG~agg~n-----~g~~~ 79 (305)
T COG1216 12 YNRGEDLVECLASLA----AQTYPDDVIVVVDNGSTDGSLEALK---ARFFPNVRLIENGENLGFAGGFN-----RGIKY 79 (305)
T ss_pred cCCHHHHHHHHHHHh----cCCCCCcEEEEccCCCCCCCHHHHH---hhcCCcEEEEEcCCCccchhhhh-----HHHHH
Confidence 455666667666662 11222223335676554 2222222 13 79999999999999887555 67888
Q ss_pred HHhcCCCccEEEeccCCcccc--cChHHHHHHHhcC
Q psy6303 93 LLESGWRWDFIVNLSESDYPV--KTNRALVDFLSAN 126 (270)
Q Consensus 93 al~~~~~~dy~~lLSg~D~Pl--ks~~~i~~fl~~~ 126 (270)
|++.+ .+ ++++-..|-++ -..+++.+.++.+
T Consensus 80 a~~~~--~~-~~l~LN~D~~~~~~~l~~ll~~~~~~ 112 (305)
T COG1216 80 ALAKG--DD-YVLLLNPDTVVEPDLLEELLKAAEED 112 (305)
T ss_pred HhcCC--Cc-EEEEEcCCeeeChhHHHHHHHHHHhC
Confidence 88532 22 55555666333 2334555555543
No 36
>PF11051 Mannosyl_trans3: Mannosyltransferase putative; InterPro: IPR022751 Alpha-mannosyltransferase is responsible for the addition of residues to the outer chain of core N-linked polysaccharides and to O-linked mannotriose. It is implicated in late Golgi modifications [][][]. The proteins matching this entry are conserved in fungi and also found in some phototrophic organisms.; GO: 0006486 protein glycosylation
Probab=36.26 E-value=2e+02 Score=25.88 Aligned_cols=105 Identities=18% Similarity=0.093 Sum_probs=53.8
Q ss_pred CCeEEEEEechhhcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCC--CHHHHH-HHHhhccCCCeEEecc--cceee
Q psy6303 2 GPHIVDTVHRRELVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQ--DYLFRE-LLSLELRLPNVRLSRR--RQATI 76 (270)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks--~~~~~~-~~~~~~~~~nV~~~~~--r~~v~ 76 (270)
|=+||+.+....+ .+..++|+.| +...-.=-|-|.+.... +....+ +. ...+|.++.- .+.-.
T Consensus 1 grGIVi~~g~~~~----~~a~~lI~~L----R~~g~~LPIEI~~~~~~dl~~~~~~~l~----~~q~v~~vd~~~~~~~~ 68 (271)
T PF11051_consen 1 GRGIVITAGDKYL----WLALRLIRVL----RRLGNTLPIEIIYPGDDDLSKEFCEKLL----PDQDVWFVDASCVIDPD 68 (271)
T ss_pred CCEEEEEecCccH----HHHHHHHHHH----HHhCCCCCEEEEeCCccccCHHHHHHHh----hhhhhheecceEEeecc
Confidence 5589999988554 6666777777 21122334555554433 223322 22 1334443321 00001
Q ss_pred eCchhHH--HHHHHHHHHHHhcCCCccEEEeccCCcccccChHHHHH
Q psy6303 77 WGGASLV--TILLDSMQQLLESGWRWDFIVNLSESDYPVKTNRALVD 121 (270)
Q Consensus 77 WGg~S~v--~A~l~ll~~al~~~~~~dy~~lLSg~D~Plks~~~i~~ 121 (270)
+-+.+.. .-.+..+..+. ..++-+++|-.+.+|+++++.+-+
T Consensus 69 ~~~~~~~~~~~~~K~lA~l~---ssFeevllLDaD~vpl~~p~~lF~ 112 (271)
T PF11051_consen 69 YLGKSFSKKGFQNKWLALLF---SSFEEVLLLDADNVPLVDPEKLFE 112 (271)
T ss_pred ccccccccCCchhhhhhhhh---CCcceEEEEcCCcccccCHHHHhc
Confidence 1111011 11223333333 468999999999999999887644
No 37
>cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response.
Probab=35.76 E-value=2.7e+02 Score=23.43 Aligned_cols=104 Identities=12% Similarity=0.002 Sum_probs=54.3
Q ss_pred EEEEEechhhcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHH-HHHHHhhcc-CCCeEEecccceeeeCchhH
Q psy6303 5 IVDTVHRRELVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLF-RELLSLELR-LPNVRLSRRRQATIWGGASL 82 (270)
Q Consensus 5 ~~~~~~~~~~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~-~~~~~~~~~-~~nV~~~~~r~~v~WGg~S~ 82 (270)
|++-+|++. ++.|.+++.+|. +...|+..++|=-|.+.+... ..+.++... .+++.++... -..|+ .
T Consensus 2 iiip~~ne~----~~~l~~~l~sl~---~q~~~~~eiiVvdd~s~D~t~~~~i~~~~~~~~~~i~~i~~~--~~~G~--~ 70 (236)
T cd06435 2 IHVPCYEEP----PEMVKETLDSLA---ALDYPNFEVIVIDNNTKDEALWKPVEAHCAQLGERFRFFHVE--PLPGA--K 70 (236)
T ss_pred eeEeeCCCc----HHHHHHHHHHHH---hCCCCCcEEEEEeCCCCchhHHHHHHHHHHHhCCcEEEEEcC--CCCCC--c
Confidence 455566644 377888888882 212455566555455555433 222233223 2467666422 23454 1
Q ss_pred HHHHHHHHHHHHhcCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 83 VTILLDSMQQLLESGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 83 v~A~l~ll~~al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
..|.-.+++.+. .++||++.+-..+.+ +.+.|.+.++
T Consensus 71 ~~a~n~g~~~a~---~~~d~i~~lD~D~~~--~~~~l~~l~~ 107 (236)
T cd06435 71 AGALNYALERTA---PDAEIIAVIDADYQV--EPDWLKRLVP 107 (236)
T ss_pred hHHHHHHHHhcC---CCCCEEEEEcCCCCc--CHHHHHHHHH
Confidence 223334444432 347888888777754 5566665543
No 38
>cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end.
Probab=32.75 E-value=2.9e+02 Score=22.95 Aligned_cols=92 Identities=16% Similarity=0.018 Sum_probs=46.3
Q ss_pred HHHHHHHHHHHHHhhhhcCCC--CeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHHHh
Q psy6303 18 QEDLKRTNRALVDFLSANRDK--NFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQLLE 95 (270)
Q Consensus 18 ~~ql~rLi~~l~~~l~~~~p~--~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~al~ 95 (270)
++.+++.+.++.. ...|+ -.++| ||-.+++...++.+......++.++. ....+|+-.- + +..+++
T Consensus 14 ~~~l~~~l~sl~~---q~~~~~~~eiiv-vdd~s~d~t~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~----~n~~~~ 81 (234)
T cd06421 14 LEIVRKTLRAALA---IDYPHDKLRVYV-LDDGRRPELRALAAELGVEYGYRYLT--RPDNRHAKAG--N----LNNALA 81 (234)
T ss_pred HHHHHHHHHHHHh---cCCCcccEEEEE-EcCCCchhHHHHHHHhhcccCceEEE--eCCCCCCcHH--H----HHHHHH
Confidence 3678888888832 12344 35555 55545443333332121222445543 2345554221 1 223332
Q ss_pred cCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 96 SGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 96 ~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
. ...+|++.+...|.|- .+.+...++
T Consensus 82 ~-a~~d~i~~lD~D~~~~--~~~l~~l~~ 107 (234)
T cd06421 82 H-TTGDFVAILDADHVPT--PDFLRRTLG 107 (234)
T ss_pred h-CCCCEEEEEccccCcC--ccHHHHHHH
Confidence 1 2579999999999883 345554443
No 39
>PLN02726 dolichyl-phosphate beta-D-mannosyltransferase
Probab=32.70 E-value=3.2e+02 Score=23.43 Aligned_cols=98 Identities=13% Similarity=0.089 Sum_probs=51.1
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCC--CeEEecccceeeeCchhHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLP--NVRLSRRRQATIWGGASLVTILLDSMQ 91 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~--nV~~~~~r~~v~WGg~S~v~A~l~ll~ 91 (270)
.|++.+.+..++..+.+.+.. .++..++|=-|.+.+.....+.++...++ +|.++... +..+.-.|-..+++
T Consensus 17 ~yne~~~l~~~l~~l~~~~~~-~~~~eiivvDdgS~D~t~~i~~~~~~~~~~~~v~~~~~~-----~n~G~~~a~n~g~~ 90 (243)
T PLN02726 17 TYNERLNIALIVYLIFKALQD-VKDFEIIVVDDGSPDGTQDVVKQLQKVYGEDRILLRPRP-----GKLGLGTAYIHGLK 90 (243)
T ss_pred cCCchhhHHHHHHHHHHHhcc-CCCeEEEEEeCCCCCCHHHHHHHHHHhcCCCcEEEEecC-----CCCCHHHHHHHHHH
Confidence 466777888877777554431 23445555445544433222233333343 56555321 11222234444444
Q ss_pred HHHhcCCCccEEEeccCCcccccChHHHHHHHh
Q psy6303 92 QLLESGWRWDFIVNLSESDYPVKTNRALVDFLS 124 (270)
Q Consensus 92 ~al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~ 124 (270)
.| .-+|++.+.++|.+ +.+.|.+.++
T Consensus 91 ~a-----~g~~i~~lD~D~~~--~~~~l~~l~~ 116 (243)
T PLN02726 91 HA-----SGDFVVIMDADLSH--HPKYLPSFIK 116 (243)
T ss_pred Hc-----CCCEEEEEcCCCCC--CHHHHHHHHH
Confidence 43 34799999888874 5555655554
No 40
>COG1307 DegV Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.43 E-value=4.1e+02 Score=24.28 Aligned_cols=95 Identities=11% Similarity=0.058 Sum_probs=54.3
Q ss_pred cCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccC--CCeEEecccceeeeCchhHHHHHHHHHHH
Q psy6303 15 VKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRL--PNVRLSRRRQATIWGGASLVTILLDSMQQ 92 (270)
Q Consensus 15 ~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~--~nV~~~~~r~~v~WGg~S~v~A~l~ll~~ 92 (270)
.-.++.+.+++..+ .....++.++||++++-+..|+......+-. .+|++++.+ -.+.++-...+.+.+.
T Consensus 63 qPs~~~~~~~~~~l----~~~g~~~vi~i~iSs~LSgty~~a~~a~~~~~~~~v~viDS~----~~s~~~g~~v~~a~~l 134 (282)
T COG1307 63 QPSPGEFEELFEKL----LQKGYDEVISIHISSGLSGTYQSAQLAAELVEGAKVHVIDSK----SVSMGLGFLVLEAAEL 134 (282)
T ss_pred CcCHHHHHHHHHHH----HhCCCcEEEEEEcCCCccHHHHHHHHHHHhccCceEEEEcCc----chhhHHHHHHHHHHHH
Confidence 34566677777666 2112348999999999876665422112222 479998643 2233333334444444
Q ss_pred HHhcCCCccEEEeccCCcccccChHHHHHHHhcCCCCceeec
Q psy6303 93 LLESGWRWDFIVNLSESDYPVKTNRALVDFLSANRDKNFVKS 134 (270)
Q Consensus 93 al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~~~~g~~Fi~~ 134 (270)
+- .+ ++.++|.+.+++...+...-+
T Consensus 135 ~~-~G----------------~s~~ei~~~l~~~~~~t~~~~ 159 (282)
T COG1307 135 AK-AG----------------KSFEEILKKLEEIREKTKAYF 159 (282)
T ss_pred HH-cC----------------CCHHHHHHHHHHHHhhcEEEE
Confidence 43 33 257889999988555544433
No 41
>cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot
Probab=30.46 E-value=3.4e+02 Score=23.10 Aligned_cols=99 Identities=12% Similarity=-0.024 Sum_probs=48.0
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCC-CeEEEEecCCCCHHHHHHHHhhcc--CCCeEEecccceeeeCchhHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDK-NFVKSHGRQRQDYLFRELLSLELR--LPNVRLSRRRQATIWGGASLVTILLDSM 90 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~-~~~~IHvD~ks~~~~~~~~~~~~~--~~nV~~~~~r~~v~WGg~S~v~A~l~ll 90 (270)
++++.+.+.+++.++.. ...|. ..=.|-||-.+++.-.++.+.... ..+|..+++ ....|.+ .|--.++
T Consensus 9 ~~Ne~~~l~~~l~sl~~---~~y~~~~~eiivVdd~s~d~t~~i~~~~~~~~~~~i~~~~~---~~~~G~~--~a~n~g~ 80 (241)
T cd06427 9 LYKEAEVLPQLIASLSA---LDYPRSKLDVKLLLEEDDEETIAAARALRLPSIFRVVVVPP---SQPRTKP--KACNYAL 80 (241)
T ss_pred cCCcHHHHHHHHHHHHh---CcCCcccEEEEEEECCCCchHHHHHHHhccCCCeeEEEecC---CCCCchH--HHHHHHH
Confidence 35667888888888832 12332 233444444444332332221111 123444332 1223332 3333344
Q ss_pred HHHHhcCCCccEEEeccCCcccccC-hHHHHHHHhc
Q psy6303 91 QQLLESGWRWDFIVNLSESDYPVKT-NRALVDFLSA 125 (270)
Q Consensus 91 ~~al~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~ 125 (270)
+. ...+|++.+.++|.+-.. .+++.++|.+
T Consensus 81 ~~-----a~gd~i~~~DaD~~~~~~~l~~~~~~~~~ 111 (241)
T cd06427 81 AF-----ARGEYVVIYDAEDAPDPDQLKKAVAAFAR 111 (241)
T ss_pred Hh-----cCCCEEEEEcCCCCCChHHHHHHHHHHHh
Confidence 43 345899999888875533 3355556654
No 42
>PRK15489 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=29.18 E-value=5.1e+02 Score=27.11 Aligned_cols=113 Identities=11% Similarity=0.094 Sum_probs=62.0
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQL 93 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~a 93 (270)
+|++.+.+.+++.++...| ++|+-.+++=.+......-..+.++...+|+++++....+ |.-+--.|-=..++.+
T Consensus 79 a~nE~~VI~~~v~~ll~~l--dYp~~~I~v~~~~nD~~T~~~~~~~~~~~p~~~~v~~~~~---gp~gKa~ALN~~l~~~ 153 (703)
T PRK15489 79 AWKEYDVIAKMIENMLATL--DYRRYVIFVGTYPNDAETITEVERMRRRYKRLVRVEVPHD---GPTCKADCLNWIIQAI 153 (703)
T ss_pred CCCcHHHHHHHHHHHHhcC--CCCCeEEEEEecCCCccHHHHHHHHhccCCcEEEEEcCCC---CCCCHHHHHHHHHHHH
Confidence 4666678888888753333 6787666654333332222333445556788887642211 3234444433334433
Q ss_pred Hh----cCCCccEEEeccCCcccccChHHHHHHHhcCCCCceee
Q psy6303 94 LE----SGWRWDFIVNLSESDYPVKTNRALVDFLSANRDKNFVK 133 (270)
Q Consensus 94 l~----~~~~~dy~~lLSg~D~Plks~~~i~~fl~~~~g~~Fi~ 133 (270)
.+ .+..++.+++.-+.|.|=...-...+++. .+..++.
T Consensus 154 ~~~e~~~~~~fa~vvi~DAEd~~~P~~L~~~~~~~--~~~~~iQ 195 (703)
T PRK15489 154 FRYEAGHGIEFAGVILHDSEDVLHPLELKYFNYLL--PRKDLVQ 195 (703)
T ss_pred HhhhhhccCccceEEEEcCCCCCChhHHHHHHhhc--CCcceee
Confidence 22 24567889999999998655444334433 2345666
No 43
>TIGR03568 NeuC_NnaA UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing. This family of enzymes catalyzes the combined epimerization and UDP-hydrolysis of UDP-N-acetylglucosamine to N-acetylmannosamine. This is in contrast to the related enzyme WecB (TIGR00236) which retains the UDP moiety. NeuC acts in concert with NeuA and NeuB to synthesize CMP-N5-acetyl-neuraminate.
Probab=27.88 E-value=2.2e+02 Score=26.78 Aligned_cols=63 Identities=21% Similarity=0.264 Sum_probs=35.7
Q ss_pred CeEEEEEechh--hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEE--ecCCCCHHHHHHHHhhccCCCeEEec
Q psy6303 3 PHIVDTVHRRE--LVKKQEDLKRTNRALVDFLSANRDKNFVKSH--GRQRQDYLFRELLSLELRLPNVRLSR 70 (270)
Q Consensus 3 ~~~~~~~~~~~--~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IH--vD~ks~~~~~~~~~~~~~~~nV~~~~ 70 (270)
|-++.++|... -....+++..++++|.+. ..+.+++| .|.......+.+.+.....+||++++
T Consensus 202 ~~vlvt~Hp~~~~~~~~~~~l~~li~~L~~~-----~~~~~vi~P~~~p~~~~i~~~i~~~~~~~~~v~l~~ 268 (365)
T TIGR03568 202 PYALVTFHPVTLEKESAEEQIKELLKALDEL-----NKNYIFTYPNADAGSRIINEAIEEYVNEHPNFRLFK 268 (365)
T ss_pred CEEEEEeCCCcccccCchHHHHHHHHHHHHh-----ccCCEEEEeCCCCCchHHHHHHHHHhcCCCCEEEEC
Confidence 56678889865 345568888888888321 12346676 33322222222333322267898886
No 44
>COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]
Probab=27.64 E-value=4.6e+02 Score=24.46 Aligned_cols=100 Identities=20% Similarity=0.142 Sum_probs=59.8
Q ss_pred cCCHH-HHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccC-CCeEEecccceeeeCchhHHHHHHHHHHH
Q psy6303 15 VKKQE-DLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRL-PNVRLSRRRQATIWGGASLVTILLDSMQQ 92 (270)
Q Consensus 15 ~~~~~-ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~-~nV~~~~~r~~v~WGg~S~v~A~l~ll~~ 92 (270)
|++++ -+++.+.++ ++.+.|+.++++=.|..++...+.+.+...+. +++.++.. -....+-.+.+..
T Consensus 63 ynE~~~~~~~~l~s~---~~~dyp~~evivv~d~~~d~~~~~~~~~~~~~~~~~~~~~~--------~~~~~gK~~al~~ 131 (439)
T COG1215 63 YNEEPEVLEETLESL---LSQDYPRYEVIVVDDGSTDETYEILEELGAEYGPNFRVIYP--------EKKNGGKAGALNN 131 (439)
T ss_pred CCCchhhHHHHHHHH---HhCCCCCceEEEECCCCChhHHHHHHHHHhhcCcceEEEec--------cccCccchHHHHH
Confidence 44444 888888888 23367888899988877776555555555555 57777510 1122333444555
Q ss_pred HHhcCCCccEEEeccCCcccccC-hHHHHHHHhcC
Q psy6303 93 LLESGWRWDFIVNLSESDYPVKT-NRALVDFLSAN 126 (270)
Q Consensus 93 al~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~ 126 (270)
++. ..+.|++..+-++..|=+. ..++...|+..
T Consensus 132 ~l~-~~~~d~V~~~DaD~~~~~d~l~~~~~~f~~~ 165 (439)
T COG1215 132 GLK-RAKGDVVVILDADTVPEPDALRELVSPFEDP 165 (439)
T ss_pred HHh-hcCCCEEEEEcCCCCCChhHHHHHHhhhcCC
Confidence 554 2347888777777766543 34555666553
No 45
>PF01361 Tautomerase: Tautomerase enzyme; InterPro: IPR004370 4-Oxalocrotonate tautomerase (4-OT) catalyzes the isomerisation of beta,gamma-unsaturated enones to their alpha,beta-isomers. The enzyme is part of a plasmid-encoded pathway, which enables bacteria harbouring the plasmid to use various aromatic hydrocarbons as their sole sources of carbon and energy. The enzyme is a barrel-shaped hexamer, which can be viewed as a trimer of dimers. The hexamer contains a hydrophobic core formed by three beta-sheets and surrounded by three pairs of alpha-helices. Each 4-OT monomer of 62 amino acids has a relatively simple beta-alpha-beta fold as described by the structure of the enzyme from Pseudomonas putida []. The monomer begins with a conserved proline at the start of a beta-strand, followed by an alpha-helix and a 310 helix preceding a second parallel beta-strand, and ends with a beta-hairpin near the C terminus. The dimer results from antiparallel interactions between the beta-sheets and alpha-helices of the two monomers, forming a four-stranded beta-sheet with antiparallel alpha-helices on one side, creating two active sites, one at each end of the beta-sheet. Three dimers further associate to form a hexamer by the interactions of the strands of the C-terminal beta-hairpin loops with the edges of the four-stranded beta-sheets of neighbouring dimers, creating a series of cross-links that stabilise the hexamer Pro-1 of the mature protein functions as the general base while Arg-39 and an ordered water molecule each provide a hydrogen bond to the C-2 oxygen of substrate. Arg-39 plays an additional role in the binding of the C-1 carboxylate group. Arg-11 participates both in substrate binding and in catalysis. It interacts with the C-6 carboxylate group, thereby holding the substrate in place and drawing electron density to the C-5 position. The hydrophobic nature of the active site, which lowers the pKa of Pro-1 and provides a favourable environment for catalysis, is largely maintained by Phe-50. Because several Arg residues located near the active site are not conserved among all members of this family and because of the presence of fairly distantly related paralogs in Campylobacter jejuni, the family is regarded as not necessarily uniform in function.; GO: 0016853 isomerase activity, 0006725 cellular aromatic compound metabolic process; PDB: 4OTA_H 4OTC_G 4OTB_J 2FM7_A 1BJP_B 1S0Y_K 3EJ9_E 3EJ7_K 3EJ3_I 3MB2_A ....
Probab=27.09 E-value=2e+02 Score=19.21 Aligned_cols=42 Identities=12% Similarity=0.071 Sum_probs=26.3
Q ss_pred CeEEEEEechhhcCCHHHHHHHHHHHHHhhhh--cCCCCeEEEEecC
Q psy6303 3 PHIVDTVHRRELVKKQEDLKRTNRALVDFLSA--NRDKNFVKSHGRQ 47 (270)
Q Consensus 3 ~~~~~~~~~~~~~~~~~ql~rLi~~l~~~l~~--~~p~~~~~IHvD~ 47 (270)
|+|-+-+.. .- +.+|-.+|+..+++.+.+ .-|...++|.++-
T Consensus 1 P~I~i~~~~--g~-~~e~K~~l~~~it~~~~~~lg~~~~~i~V~i~E 44 (60)
T PF01361_consen 1 PFITIKIPE--GR-TAEQKRELAEAITDAVVEVLGIPPERISVVIEE 44 (60)
T ss_dssp -EEEEEEES--TS--HHHHHHHHHHHHHHHHHHHTS-GGGEEEEEEE
T ss_pred CEEEEEECC--CC-CHHHHHHHHHHHHHHHHHHhCcCCCeEEEEEEE
Confidence 555555554 33 789999999999887642 3466666666653
No 46
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602).
Probab=26.48 E-value=5.8e+02 Score=24.47 Aligned_cols=102 Identities=11% Similarity=0.131 Sum_probs=54.4
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeE-EEEecCCCC-HHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFV-KSHGRQRQD-YLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQ 91 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~-~IHvD~ks~-~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~ 91 (270)
+|++.+.+.++++++.+. +.|...+ .|=||-.|. .....+.+..+.++++.+..... . +|. -.|--.+++
T Consensus 57 ~yNe~~~l~~~l~sl~~q---~yp~~~~eIiVVDd~StD~T~~il~~~~~~~~~v~v~~~~~--~-~Gk--a~AlN~gl~ 128 (439)
T TIGR03111 57 VYNSEDTLFNCIESIYNQ---TYPIELIDIILANNQSTDDSFQVFCRAQNEFPGLSLRYMNS--D-QGK--AKALNAAIY 128 (439)
T ss_pred eCCChHHHHHHHHHHHhc---CCCCCCeEEEEEECCCChhHHHHHHHHHHhCCCeEEEEeCC--C-CCH--HHHHHHHHH
Confidence 466678888888888321 3444322 344555554 33332334455678877743211 1 343 222223333
Q ss_pred HHHhcCCCccEEEeccCCcccccC-hHHHHHHHhcCCC
Q psy6303 92 QLLESGWRWDFIVNLSESDYPVKT-NRALVDFLSANRD 128 (270)
Q Consensus 92 ~al~~~~~~dy~~lLSg~D~Plks-~~~i~~fl~~~~g 128 (270)
. .+-+|++.+-+++.|-.. .+++.+.|+++++
T Consensus 129 ~-----s~g~~v~~~DaD~~~~~d~L~~l~~~f~~~~~ 161 (439)
T TIGR03111 129 N-----SIGKYIIHIDSDGKLHKDAIKNMVTRFENNPD 161 (439)
T ss_pred H-----ccCCEEEEECCCCCcChHHHHHHHHHHHhCCC
Confidence 3 235789999888888432 2455566665443
No 47
>cd00491 4Oxalocrotonate_Tautomerase 4-Oxalocrotonate Tautomerase: Catalyzes the isomerization of unsaturated ketones. The structure is a homohexamer that is arranged as a trimer of dimers. The hexamer contains six active sites, each formed by residues from three monomers, two from one dimer and the third from a neighboring monomer. Each monomer is a beta-alpha-beta fold with two small beta strands at the C-terminus that fold back on themselves. A pair of monomers form a dimer with two-fold symmetry, consisting of a 4-stranded beta sheet with two helices on one side and two additional small beta strands at each end. The dimers are assembled around a 3-fold axis of rotation to form a hexamer, with the short beta strands from each dimer contacting the neighboring dimers.
Probab=26.47 E-value=1.9e+02 Score=18.89 Aligned_cols=43 Identities=7% Similarity=0.025 Sum_probs=29.8
Q ss_pred CeEEEEEechhhcCCHHHHHHHHHHHHHhhhh--cCCCCeEEEEecCC
Q psy6303 3 PHIVDTVHRRELVKKQEDLKRTNRALVDFLSA--NRDKNFVKSHGRQR 48 (270)
Q Consensus 3 ~~~~~~~~~~~~~~~~~ql~rLi~~l~~~l~~--~~p~~~~~IHvD~k 48 (270)
|.|-+-+.. . .+++|-.+|.++|++-+.. .-|.+.++|.|+--
T Consensus 1 P~i~i~~~~--g-rt~eqk~~l~~~i~~~l~~~~g~~~~~v~V~i~e~ 45 (58)
T cd00491 1 PFVQIYILE--G-RTDEQKRELIERVTEAVSEILGAPEATIVVIIDEM 45 (58)
T ss_pred CEEEEEEcC--C-CCHHHHHHHHHHHHHHHHHHhCcCcccEEEEEEEe
Confidence 555555554 3 2589999999999887753 34777777777653
No 48
>PF07894 DUF1669: Protein of unknown function (DUF1669); InterPro: IPR012461 This family is composed of sequences derived from hypothetical eukaryotic proteins of unknown function. Some members of this family are annotated as being potential phospholipases but no literature was found to support this.
Probab=25.35 E-value=2.8e+02 Score=25.61 Aligned_cols=37 Identities=14% Similarity=-0.050 Sum_probs=22.8
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFREL 56 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~ 56 (270)
+++|.+-|..|+.|- ..-+.-+||-+|...-..|-+|
T Consensus 156 ~FTD~dIf~DLleAa------~kR~VpVYiLLD~~~~~~Fl~M 192 (284)
T PF07894_consen 156 VFTDVDIFCDLLEAA------NKRGVPVYILLDEQNLPHFLEM 192 (284)
T ss_pred ccccHHHHHHHHHHH------HhcCCcEEEEechhcChHHHHH
Confidence 666666666666666 4556666666666655555554
No 49
>PRK10714 undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional
Probab=25.15 E-value=5.4e+02 Score=23.65 Aligned_cols=100 Identities=9% Similarity=0.032 Sum_probs=55.3
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhcc-CCCeEEecccceeeeCchhHHHHHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELR-LPNVRLSRRRQATIWGGASLVTILLDSMQQ 92 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~-~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~ 92 (270)
+|++.+.+..++.++.+.++.-.++..++|=-|.+.+.....+.+.... .++|..+... .+++.-.|-..+++.
T Consensus 14 ~yNE~~~i~~~l~~l~~~~~~~~~~~EIIvVDDgS~D~T~~il~~~~~~~~~~v~~i~~~-----~n~G~~~A~~~G~~~ 88 (325)
T PRK10714 14 VYNEQESLPELIRRTTAACESLGKEYEILLIDDGSSDNSAEMLVEAAQAPDSHIVAILLN-----RNYGQHSAIMAGFSH 88 (325)
T ss_pred ccCchhhHHHHHHHHHHHHHhCCCCEEEEEEeCCCCCcHHHHHHHHHhhcCCcEEEEEeC-----CCCCHHHHHHHHHHh
Confidence 5667777888888775555421234455555555555333333333332 3456544211 133444455555555
Q ss_pred HHhcCCCccEEEeccCCcccccChHHHHHHHhc
Q psy6303 93 LLESGWRWDFIVNLSESDYPVKTNRALVDFLSA 125 (270)
Q Consensus 93 al~~~~~~dy~~lLSg~D~Plks~~~i~~fl~~ 125 (270)
| +.||++.+-+++- -+.++|.++++.
T Consensus 89 A-----~gd~vv~~DaD~q--~~p~~i~~l~~~ 114 (325)
T PRK10714 89 V-----TGDLIITLDADLQ--NPPEEIPRLVAK 114 (325)
T ss_pred C-----CCCEEEEECCCCC--CCHHHHHHHHHH
Confidence 4 3588888877766 367777777655
No 50
>PF06956 RtcR: Regulator of RNA terminal phosphate cyclase; InterPro: IPR009715 RtcR is a sigma54-dependent enhancer binding protein [] that activates transcription of the rtcBA operon. The product of the rtcA gene is an RNA 3 -terminal phosphate cyclase []. This domain is found at the N terminus of the RtcR sequence. RtcR, and other sigma54-dependent activators, contain IPR002078 from INTERPRO in the central region of the protein sequence.
Probab=25.00 E-value=1.2e+02 Score=26.14 Aligned_cols=17 Identities=29% Similarity=0.331 Sum_probs=6.9
Q ss_pred CCHHHHHHHHHHHHHhh
Q psy6303 16 KKQEDLKRTNRALVDFL 32 (270)
Q Consensus 16 ~~~~ql~rLi~~l~~~l 32 (270)
+||..++..-.+|.||.
T Consensus 79 ~dPWDfeeVY~~l~dfa 95 (183)
T PF06956_consen 79 ADPWDFEEVYAALHDFA 95 (183)
T ss_pred CCCccHHHHHHHHHHHH
Confidence 33444444444444443
No 51
>PF07745 Glyco_hydro_53: Glycosyl hydrolase family 53; InterPro: IPR011683 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This domain is found in family 53 of the glycosyl hydrolase classification []. These enzymes are endo-1,4- beta-galactanases (3.2.1.89 from EC). The structure of this domain is known [] and has a TIM barrel fold.; GO: 0015926 glucosidase activity; PDB: 1HJQ_A 1HJS_A 1HJU_B 1FHL_A 1FOB_A 2GFT_A 1UR4_B 1UR0_A 1R8L_B 2CCR_A ....
Probab=24.34 E-value=2.9e+02 Score=26.00 Aligned_cols=94 Identities=16% Similarity=0.134 Sum_probs=50.3
Q ss_pred cCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCH-HHHHHHH-hhccCCCeEEecccceeeeCc-hhHHHHHHHHHH
Q psy6303 15 VKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDY-LFRELLS-LELRLPNVRLSRRRQATIWGG-ASLVTILLDSMQ 91 (270)
Q Consensus 15 ~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~-~~~~~~~-~~~~~~nV~~~~~r~~v~WGg-~S~v~A~l~ll~ 91 (270)
-.+++++.+|+++-++=.++-.|+..+.||+|.-.+. .+.-..+ +....-+.-++--.+.-.|.+ .+.+.+.++.|.
T Consensus 146 ~~~~~~~a~ll~ag~~AVr~~~p~~kV~lH~~~~~~~~~~~~~f~~l~~~g~d~DviGlSyYP~w~~~l~~l~~~l~~l~ 225 (332)
T PF07745_consen 146 PSNWDNLAKLLNAGIKAVREVDPNIKVMLHLANGGDNDLYRWFFDNLKAAGVDFDVIGLSYYPFWHGTLEDLKNNLNDLA 225 (332)
T ss_dssp TT-HHHHHHHHHHHHHHHHTHSSTSEEEEEES-TTSHHHHHHHHHHHHHTTGG-SEEEEEE-STTST-HHHHHHHHHHHH
T ss_pred ccCHHHHHHHHHHHHHHHHhcCCCCcEEEEECCCCchHHHHHHHHHHHhcCCCcceEEEecCCCCcchHHHHHHHHHHHH
Confidence 3678999999988877666558999999999987653 3222211 111222222333345557876 445555555554
Q ss_pred HHHhcCCCccEEEeccCCccccc
Q psy6303 92 QLLESGWRWDFIVNLSESDYPVK 114 (270)
Q Consensus 92 ~al~~~~~~dy~~lLSg~D~Plk 114 (270)
.-+. -=+.+....||-.
T Consensus 226 ~ry~------K~V~V~Et~yp~t 242 (332)
T PF07745_consen 226 SRYG------KPVMVVETGYPWT 242 (332)
T ss_dssp HHHT-------EEEEEEE---SB
T ss_pred HHhC------CeeEEEecccccc
Confidence 4432 2255666666554
No 52
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=24.27 E-value=8.4e+02 Score=25.57 Aligned_cols=114 Identities=11% Similarity=0.104 Sum_probs=63.9
Q ss_pred cCCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHHH
Q psy6303 15 VKKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQLL 94 (270)
Q Consensus 15 ~~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~al 94 (270)
|++...+.+++..+...+ +.|+-.+++=+|...+.....+.++.+.+|+++++..... |.-+-..|--.+++.+.
T Consensus 72 ~nE~~vi~~~i~~ll~~l--dYP~~eI~vi~~~nD~~T~~~~~~l~~~~p~~~~v~~~~~---g~~gKa~aLN~~l~~~~ 146 (727)
T PRK11234 72 WNETGVIGNMAELAATTL--DYENYHIFVGTYPNDPATQADVDAVCARFPNVHKVVCARP---GPTSKADCLNNVLDAIT 146 (727)
T ss_pred CcchhhHHHHHHHHHHhC--CCCCeEEEEEecCCChhHHHHHHHHHHHCCCcEEEEeCCC---CCCCHHHHHHHHHHHHH
Confidence 444455566665441111 7888888887764433223334456667899876532221 22345555555555553
Q ss_pred hc----CCCccEEEeccCCcccccChHHHHHHHhcCCCCceeecc
Q psy6303 95 ES----GWRWDFIVNLSESDYPVKTNRALVDFLSANRDKNFVKSH 135 (270)
Q Consensus 95 ~~----~~~~dy~~lLSg~D~Plks~~~i~~fl~~~~g~~Fi~~~ 135 (270)
+. +.+++.++++-++|.|=...-.+.+++.. ++.++..-
T Consensus 147 ~~e~~~~~~~~vvvi~DAD~~v~pd~L~~~~~l~~--~~~~VQ~p 189 (727)
T PRK11234 147 QFERSANFAFAGFILHDAEDVISPMELRLFNYLVE--RKDLIQIP 189 (727)
T ss_pred hhhcccCCcccEEEEEcCCCCCChhHHHHHHhhcC--CCCeEeec
Confidence 31 34678888888888775554444455543 34677663
No 53
>TIGR00031 UDP-GALP_mutase UDP-galactopyranose mutase. The gene is known as glf, ceoA, and rfbD. It is known experimentally in E. coli, Mycobacterium tuberculosis, and Klebsiella pneumoniae.
Probab=24.09 E-value=1.2e+02 Score=28.90 Aligned_cols=54 Identities=17% Similarity=0.134 Sum_probs=43.3
Q ss_pred CeEEEEecCCCCHHHHHHHHhhccCCCeEEecccceeeeCchhHHHHHHHHHHHHH
Q psy6303 39 NFVKSHGRQRQDYLFRELLSLELRLPNVRLSRRRQATIWGGASLVTILLDSMQQLL 94 (270)
Q Consensus 39 ~~~~IHvD~ks~~~~~~~~~~~~~~~nV~~~~~r~~v~WGg~S~v~A~l~ll~~al 94 (270)
..+|...|.+....+++..+++...+||+++. |. ..|.=++|=+|..++|+.+-
T Consensus 320 ~pyYpi~~~~~~~~~~~y~~la~~~~~v~~~G-Rl-g~y~Y~nMD~~i~~al~~~~ 373 (377)
T TIGR00031 320 EPYYPVNDNKNMALFKKYLELASREDNLILLG-RL-AEYQYYDMDQAILAALYKAE 373 (377)
T ss_pred eeeeeccCHHHHHHHHHHHHHHhcCCCEEEee-ee-eEeEeecHHHHHHHHHHHHH
Confidence 47899999888777777667677788999997 53 77888999999888888764
No 54
>PF02645 DegV: Uncharacterised protein, DegV family COG1307; InterPro: IPR003797 This family of proteins is related to DegV of Bacillus subtilis and includes paralogous sets in several species (B. subtilis, Deinococcus radiodurans, Mycoplasma pneumoniae) that are closer in percent identity to each other than to most homologs from other species. This suggests both recent paralogy and diversity of function.; PDB: 2DT8_A 3LUP_A 3NYI_B 3PL5_A 1PZX_B 1MGP_A 1VPV_B 3FYS_A 3EGL_C 3JR7_A ....
Probab=24.05 E-value=4.6e+02 Score=23.53 Aligned_cols=94 Identities=11% Similarity=0.059 Sum_probs=50.3
Q ss_pred CCHHHHHHHHHHHHHhhhhcCCCCeEEEEecCCCCHHHHHHHHhhccC--CCeEEecccceeeeCchhHHHHHHHHHHHH
Q psy6303 16 KKQEDLKRTNRALVDFLSANRDKNFVKSHGRQRQDYLFRELLSLELRL--PNVRLSRRRQATIWGGASLVTILLDSMQQL 93 (270)
Q Consensus 16 ~~~~ql~rLi~~l~~~l~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~--~nV~~~~~r~~v~WGg~S~v~A~l~ll~~a 93 (270)
-+|.++..++..+ +. ..-+..++||+..+-+..|+....++..+ .+|++++. .-++..+-...+.+.+.+
T Consensus 63 ps~~~~~~~f~~~---~~-~gyd~ii~i~iSs~LSgty~~a~~aa~~~~~~~i~ViDS----~~~s~g~g~lv~~a~~l~ 134 (280)
T PF02645_consen 63 PSPGEFEEAFEKL---LE-EGYDEIIVITISSGLSGTYNSARLAAKMLPDIKIHVIDS----KSVSAGQGLLVLEAAKLI 134 (280)
T ss_dssp --HHHHHHHHHHH---HH-TTTSEEEEEES-TTT-THHHHHHHHHHHHTTTEEEEEE-----SS-HHHHHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHH---HH-CCCCeEEEEeCCcchhhHHHHHHHHHhhcCcCEEEEEeC----CCcchhhhHHHHHHHHHH
Confidence 4566666666663 23 24567999999999876666554333333 57888753 333444444444444444
Q ss_pred HhcCCCccEEEeccCCcccccChHHHHHHHhcCCCCceeec
Q psy6303 94 LESGWRWDFIVNLSESDYPVKTNRALVDFLSANRDKNFVKS 134 (270)
Q Consensus 94 l~~~~~~dy~~lLSg~D~Plks~~~i~~fl~~~~g~~Fi~~ 134 (270)
- .+ +|.++|.+.++....+..+-+
T Consensus 135 ~-~G----------------~s~~ei~~~l~~~~~~~~~~f 158 (280)
T PF02645_consen 135 E-QG----------------KSFEEIVEKLEELRERTRTYF 158 (280)
T ss_dssp H-TT------------------HHHHHHHHHHHHHTEEEEE
T ss_pred H-cC----------------CCHHHHHHHHHHHHhhceEEE
Confidence 3 22 368999999887444444333
No 55
>PF04084 ORC2: Origin recognition complex subunit 2 ; InterPro: IPR007220 The Origin Recognition Complex (ORC) is a six-subunit ATP-dependent DNA-binding complex encoded in yeast by ORC1-6 []. ORC is a central component for eukaryotic DNA replication, and binds chromatin at replication origins throughout the cell cycle []. ORC directs DNA replication throughout the genome and is required for its initiation [, , ]. ORC bound at replication origins serves as the foundation for assembly of the pre-replicative complex (pre-RC), which includes Cdc6, Tah11 (aka Cdt1), and the Mcm2-7 complex [, , ]. Pre-RC assembly during G1 is required for replication licensing of chromosomes prior to DNA synthesis during S phase [, , ]. Cell cycle-regulated phosphorylation of Orc2, Orc6, Cdc6, and MCM by the cyclin-dependent protein kinase Cdc28 regulates initiation of DNA replication, including blocking reinitiation in G2/M phase [, , , ]. In yeast, ORC also plays a role in the establishment of silencing at the mating-type loci Hidden MAT Left (HML) and Hidden MAT Right (HMR) [, , ]. ORC participates in the assembly of transcriptionally silent chromatin at HML and HMR by recruiting the Sir1 silencing protein to the HML and HMR silencers [, , ]. Both Orc1 and Orc5 bind ATP, though only Orc1 has ATPase activity []. The binding of ATP by Orc1 is required for ORC binding to DNA and is essential for cell viability []. The ATPase activity of Orc1 is involved in formation of the pre-RC [, , ]. ATP binding by Orc5 is crucial for the stability of ORC as a whole. Only the Orc1-5 subunits are required for origin binding; Orc6 is essential for maintenance of pre-RCs once formed []. Interactions within ORC suggest that Orc2-3-6 may form a core complex []. ORC homologues have been found in various eukaryotes, including fission yeast, insects, amphibians, and humans []. This entry represents subunit 2, which binds the origin of replication. It plays a role in chromosome replication and mating type transcriptional silencing.; GO: 0006260 DNA replication, 0000808 origin recognition complex, 0005634 nucleus
Probab=23.67 E-value=1.5e+02 Score=27.84 Aligned_cols=48 Identities=15% Similarity=0.205 Sum_probs=28.4
Q ss_pred HHHHHHHHhhhhc--CCCCeEEEE-ecCCC--CHHHHHHHHhhccCCCeEEec
Q psy6303 23 RTNRALVDFLSAN--RDKNFVKSH-GRQRQ--DYLFRELLSLELRLPNVRLSR 70 (270)
Q Consensus 23 rLi~~l~~~l~~~--~p~~~~~IH-vD~ks--~~~~~~~~~~~~~~~nV~~~~ 70 (270)
..+..+.+.++.. .+.-++.|| ||.-+ +...+.+.+.+...|||+++.
T Consensus 121 ~~~~~i~~~l~~~~~~~~l~lvIHnIDg~~LR~~~~Q~~La~LA~~p~I~lIA 173 (326)
T PF04084_consen 121 EQLDFIISYLESRPSPPPLYLVIHNIDGPSLRNEKAQSLLAQLASIPNIHLIA 173 (326)
T ss_pred HHHHHHHHHHhccCCCCceEEEEECCCChhhcChHHHHHHHHHHcCCCeEEEE
Confidence 3344444444532 345566666 67655 344555556566789999985
No 56
>cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex. Cellulose synthase (CESA) catalyzes the polymerization reaction of cellulose using UDP-glucose as the substrate. Cellulose is an aggregate of unbranched polymers of beta-1,4-linked glucose residues, which is an abundant polysaccharide produced by plants and in varying degrees by several other organisms including algae, bacteria, fungi, and even some animals. Genomes from higher plants harbor multiple CESA genes. There are ten in Arabidopsis. At least three different CESA proteins are required to form a functional complex. In Arabidopsis, CESA1, 3 and 6 and CESA4, 7 and 8, are required for cellulose biosynthesis during primary and secondary cell wall formation. CESA2 is very closely related to CESA6 and is viewed as a prime substitute for CESA6. They functionally compensate each other. The cesa2 and cesa6 double mutant plants we
Probab=23.29 E-value=4.5e+02 Score=22.06 Aligned_cols=94 Identities=11% Similarity=0.047 Sum_probs=47.6
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCC-CCeEEEEecCCCCHHHHHHHHhh----ccCCCeEEecccceeeeCchhHHHHHHH
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRD-KNFVKSHGRQRQDYLFRELLSLE----LRLPNVRLSRRRQATIWGGASLVTILLD 88 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p-~~~~~IHvD~ks~~~~~~~~~~~----~~~~nV~~~~~r~~v~WGg~S~v~A~l~ 88 (270)
+|++.+.+.+.|.+|.+ ...| +....|=+|-.++.....+.+.. ...+++..+....+..+.+ .|
T Consensus 9 ~yNe~~~l~~~L~sl~~---q~~~~~~~eIiVvD~s~D~t~~~~~~~~~~~~~~~~~i~~~~~~~~~G~k~----~a--- 78 (232)
T cd06437 9 VFNEKYVVERLIEAACA---LDYPKDRLEIQVLDDSTDETVRLAREIVEEYAAQGVNIKHVRRADRTGYKA----GA--- 78 (232)
T ss_pred cCCcHHHHHHHHHHHHh---cCCCccceEEEEEECCCCcHHHHHHHHHHHHhhcCCceEEEECCCCCCCch----HH---
Confidence 45667888888888832 1233 33334457864444333332222 2345666554322222211 12
Q ss_pred HHHHHHhcCCCccEEEeccCCcccccChHHHHH
Q psy6303 89 SMQQLLESGWRWDFIVNLSESDYPVKTNRALVD 121 (270)
Q Consensus 89 ll~~al~~~~~~dy~~lLSg~D~Plks~~~i~~ 121 (270)
+..+++ ..+.+|+.++-+.+.|- .+.|..
T Consensus 79 -~n~g~~-~a~~~~i~~~DaD~~~~--~~~l~~ 107 (232)
T cd06437 79 -LAEGMK-VAKGEYVAIFDADFVPP--PDFLQK 107 (232)
T ss_pred -HHHHHH-hCCCCEEEEEcCCCCCC--hHHHHH
Confidence 222222 13579999999988874 344444
No 57
>cd02514 GT13_GLCNAC-TI GT13_GLCNAC-TI is involved in an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GLCNAC-T I , GNT-I) transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide, an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. The enzyme is an integral membrane protein localized to the Golgi apparatus. The catalytic domain is located at the C-terminus. These proteins are members of the glycosy transferase family 13.
Probab=21.74 E-value=6.8e+02 Score=23.56 Aligned_cols=95 Identities=9% Similarity=0.051 Sum_probs=55.7
Q ss_pred hcCCHHHHHHHHHHHHHhh-hhcCCCCeEEEEecCCCCHHHHHHHHhhccC-CCeEEeccc--ceeeeC-------chhH
Q psy6303 14 LVKKQEDLKRTNRALVDFL-SANRDKNFVKSHGRQRQDYLFRELLSLELRL-PNVRLSRRR--QATIWG-------GASL 82 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l-~~~~p~~~~~IHvD~ks~~~~~~~~~~~~~~-~nV~~~~~r--~~v~WG-------g~S~ 82 (270)
+++.|+.+++.|.+|...- + .....++|--|...+..-..+. ++ .+|.++... -....| ..++
T Consensus 8 ayNRp~~l~r~LesLl~~~p~--~~~~~liIs~DG~~~~~~~~v~----~~~~~i~~i~~~~~~~~~~~~~~~~~~y~~i 81 (334)
T cd02514 8 ACNRPDYLRRMLDSLLSYRPS--AEKFPIIVSQDGGYEEVADVAK----SFGDGVTHIQHPPISIKNVNPPHKFQGYYRI 81 (334)
T ss_pred ecCCHHHHHHHHHHHHhcccc--CCCceEEEEeCCCchHHHHHHH----hhccccEEEEcccccccccCcccccchhhHH
Confidence 4788999999999994321 1 1256688888886543212111 22 245444321 112222 2334
Q ss_pred HHHHHHHHHHHHhcCCCccEEEeccCCcccccC
Q psy6303 83 VTILLDSMQQLLESGWRWDFIVNLSESDYPVKT 115 (270)
Q Consensus 83 v~A~l~ll~~al~~~~~~dy~~lLSg~D~Plks 115 (270)
...-..++..++.. .+++++|.|=++|.|-..
T Consensus 82 a~hyk~aln~vF~~-~~~~~vIILEDDl~~sPd 113 (334)
T cd02514 82 ARHYKWALTQTFNL-FGYSFVIILEDDLDIAPD 113 (334)
T ss_pred HHHHHHHHHHHHHh-cCCCEEEEECCCCccCHh
Confidence 44444467777753 479999999999988765
No 58
>COG3618 Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]
Probab=21.32 E-value=4.4e+02 Score=24.28 Aligned_cols=26 Identities=8% Similarity=-0.158 Sum_probs=14.9
Q ss_pred hcCCHHHHHHHHHHHHHhhhhcCCCCeEEEE
Q psy6303 14 LVKKQEDLKRTNRALVDFLSANRDKNFVKSH 44 (270)
Q Consensus 14 ~~~~~~ql~rLi~~l~~~l~~~~p~~~~~IH 44 (270)
+.-++.|+..++..+ + ..|+..|+|=
T Consensus 142 l~~~~~ql~~~i~l~----~-~~Pd~~~Vld 167 (279)
T COG3618 142 LQVDPHQLPDLIPLA----L-KAPDVNFVLD 167 (279)
T ss_pred EEeChhhhHHHHHHH----h-hCCCCCEEec
Confidence 444566666666555 3 3676666553
Done!