Query psy11718
Match_columns 65
No_of_seqs 105 out of 1007
Neff 6.5
Searched_HMMs 46136
Date Fri Aug 16 17:09:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11718.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11718hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG1215 Glycosyltransferases, 99.7 1.4E-16 3E-21 110.1 6.1 61 2-65 223-284 (439)
2 PF13632 Glyco_trans_2_3: Glyc 99.7 1.8E-16 3.9E-21 99.7 6.0 60 2-64 80-142 (193)
3 PRK14583 hmsR N-glycosyltransf 99.7 2.3E-16 5E-21 111.8 6.3 60 3-65 240-300 (444)
4 PRK11204 N-glycosyltransferase 99.6 1.1E-15 2.5E-20 106.4 6.3 59 3-64 219-278 (420)
5 PRK14716 bacteriophage N4 adso 99.6 1E-15 2.2E-20 111.7 6.2 53 10-65 255-323 (504)
6 cd06437 CESA_CaSu_A2 Cellulose 99.6 3.3E-15 7.2E-20 96.2 6.4 59 3-64 172-231 (232)
7 PF13641 Glyco_tranf_2_3: Glyc 99.6 7.4E-15 1.6E-19 93.7 5.6 59 3-64 169-228 (228)
8 TIGR03030 CelA cellulose synth 99.6 7.4E-15 1.6E-19 109.9 6.1 61 2-65 316-377 (713)
9 cd06427 CESA_like_2 CESA_like_ 99.5 7.3E-15 1.6E-19 95.6 5.2 60 3-65 171-230 (241)
10 PRK11234 nfrB bacteriophage N4 99.5 1.5E-14 3.3E-19 109.0 6.2 58 5-65 244-328 (727)
11 TIGR03111 glyc2_xrt_Gpos1 puta 99.5 2.2E-14 4.7E-19 101.9 6.1 60 3-65 225-286 (439)
12 PRK11498 bcsA cellulose syntha 99.5 1.9E-14 4.2E-19 110.0 5.8 60 2-64 427-487 (852)
13 cd04191 Glucan_BSP_ModH Glucan 99.5 4.9E-14 1.1E-18 94.8 5.7 53 9-64 196-250 (254)
14 cd04190 Chitin_synth_C C-termi 99.5 1.2E-14 2.6E-19 95.6 2.2 49 17-65 190-241 (244)
15 cd06435 CESA_NdvC_like NdvC_li 99.5 7.2E-14 1.6E-18 89.7 5.6 60 3-65 168-228 (236)
16 PLN02893 Cellulose synthase-li 99.4 3.7E-13 8E-18 101.8 4.3 54 8-64 437-493 (734)
17 PRK15489 nfrB bacteriophage N4 99.4 3.9E-13 8.4E-18 101.4 4.4 53 10-65 260-336 (703)
18 cd06421 CESA_CelA_like CESA_Ce 99.4 1.6E-12 3.5E-17 82.7 5.6 60 3-65 170-230 (234)
19 PLN02248 cellulose synthase-li 99.4 6E-13 1.3E-17 103.7 4.3 55 7-64 826-883 (1135)
20 PLN02400 cellulose synthase 99.4 6.7E-13 1.5E-17 103.2 4.3 55 7-64 772-829 (1085)
21 PLN02189 cellulose synthase 99.4 7.2E-13 1.6E-17 102.7 4.4 55 7-64 728-785 (1040)
22 PLN02195 cellulose synthase A 99.3 7.7E-13 1.7E-17 102.1 4.5 55 7-64 663-720 (977)
23 PLN02638 cellulose synthase A 99.3 7.2E-13 1.6E-17 103.0 4.2 55 7-64 767-824 (1079)
24 PLN02915 cellulose synthase A 99.3 1E-12 2.2E-17 101.9 4.7 55 7-64 731-788 (1044)
25 PRK05454 glucosyltransferase M 99.3 1.9E-12 4.1E-17 97.4 5.5 54 8-64 320-375 (691)
26 PLN02436 cellulose synthase A 99.3 1.7E-12 3.6E-17 101.0 4.3 55 7-64 783-840 (1094)
27 PLN02190 cellulose synthase-li 99.3 2.1E-12 4.4E-17 97.9 3.8 55 7-64 449-506 (756)
28 cd04192 GT_2_like_e Subfamily 99.3 6.6E-12 1.4E-16 79.3 5.3 61 3-64 165-229 (229)
29 cd06434 GT2_HAS Hyaluronan syn 99.3 5.6E-12 1.2E-16 80.5 4.3 48 17-64 183-231 (235)
30 cd02520 Glucosylceramide_synth 99.3 1E-11 2.2E-16 78.7 4.9 61 3-64 134-195 (196)
31 TIGR03472 HpnI hopanoid biosyn 99.1 5.9E-11 1.3E-15 82.7 4.7 62 2-64 209-271 (373)
32 PF03552 Cellulose_synt: Cellu 99.1 6.8E-12 1.5E-16 94.8 -2.3 55 7-64 413-470 (720)
33 PF03142 Chitin_synth_2: Chiti 98.8 3.2E-09 6.9E-14 78.4 4.1 47 18-64 324-373 (527)
34 cd06439 CESA_like_1 CESA_like_ 98.8 1.2E-08 2.6E-13 65.9 6.1 53 10-65 195-248 (251)
35 cd02525 Succinoglycan_BP_ExoA 98.8 1.5E-08 3.2E-13 64.6 6.0 60 4-65 167-227 (249)
36 PF13506 Glyco_transf_21: Glyc 98.6 4.3E-08 9.4E-13 62.8 4.3 60 3-63 111-175 (175)
37 KOG2571|consensus 98.4 2.5E-07 5.4E-12 71.5 4.0 48 17-64 548-596 (862)
38 TIGR03469 HonB hopene-associat 97.9 1.2E-05 2.7E-10 56.3 4.1 61 2-63 218-280 (384)
39 cd02510 pp-GalNAc-T pp-GalNAc- 96.8 0.0042 9.1E-08 41.6 5.7 41 3-43 181-222 (299)
40 cd02526 GT2_RfbF_like RfbF is 96.3 0.0056 1.2E-07 38.9 3.4 45 3-50 161-208 (237)
41 cd04184 GT2_RfbC_Mx_like Myxoc 96.2 0.0055 1.2E-07 38.0 3.0 38 3-42 159-196 (202)
42 cd02522 GT_2_like_a GT_2_like_ 96.2 0.0078 1.7E-07 37.7 3.8 32 4-38 147-178 (221)
43 cd04195 GT2_AmsE_like GT2_AmsE 96.1 0.0083 1.8E-07 37.2 3.3 35 3-40 160-194 (201)
44 cd06420 GT2_Chondriotin_Pol_N 95.9 0.016 3.6E-07 35.2 4.1 38 2-39 135-172 (182)
45 cd06913 beta3GnTL1_like Beta 1 95.8 0.021 4.6E-07 36.2 4.4 37 4-41 167-203 (219)
46 COG1216 Predicted glycosyltran 95.8 0.016 3.5E-07 39.3 4.0 37 3-40 177-213 (305)
47 TIGR01556 rhamnosyltran L-rham 95.6 0.043 9.3E-07 36.3 5.5 37 4-41 159-195 (281)
48 cd04186 GT_2_like_c Subfamily 95.6 0.021 4.6E-07 33.7 3.6 38 4-42 123-160 (166)
49 PF10111 Glyco_tranf_2_2: Glyc 95.0 0.042 9.1E-07 37.0 4.0 40 3-42 178-217 (281)
50 COG2943 MdoH Membrane glycosyl 95.0 0.055 1.2E-06 41.4 4.8 48 17-64 346-395 (736)
51 cd06433 GT_2_WfgS_like WfgS an 94.4 0.063 1.4E-06 32.5 3.3 36 4-41 153-188 (202)
52 cd04185 GT_2_like_b Subfamily 93.8 0.14 3E-06 31.8 4.1 42 5-48 133-175 (202)
53 cd00899 b4GalT Beta-4-Galactos 93.0 0.14 3.1E-06 34.5 3.4 60 3-62 121-193 (219)
54 PF02709 Glyco_transf_7C: N-te 92.6 0.19 4.1E-06 28.5 3.1 37 3-39 29-65 (78)
55 cd06438 EpsO_like EpsO protein 91.7 0.052 1.1E-06 33.6 0.1 20 3-25 164-183 (183)
56 PRK10063 putative glycosyl tra 90.2 0.91 2E-05 30.2 4.9 37 4-42 155-191 (248)
57 KOG2547|consensus 86.1 0.47 1E-05 34.9 1.6 59 4-63 255-314 (431)
58 cd06442 DPM1_like DPM1_like re 83.1 2.8 6.2E-05 26.0 4.1 25 17-41 173-197 (224)
59 cd04188 DPG_synthase DPG_synth 73.6 4.7 0.0001 25.2 3.0 24 17-40 178-201 (211)
60 TIGR00620 sporelyase spore pho 70.9 8.5 0.00018 25.7 3.8 27 17-43 68-95 (199)
61 PLN02726 dolichyl-phosphate be 66.9 18 0.00039 23.3 4.7 25 17-41 188-212 (243)
62 PRK10018 putative glycosyl tra 65.7 12 0.00026 25.5 3.8 44 6-51 167-212 (279)
63 PRK10073 putative glycosyl tra 64.1 10 0.00022 26.2 3.3 29 10-41 182-210 (328)
64 PF13712 Glyco_tranf_2_5: Glyc 59.9 19 0.00041 23.6 3.9 44 10-53 160-204 (217)
65 cd04196 GT_2_like_d Subfamily 58.9 14 0.00031 22.5 3.0 34 4-41 163-197 (214)
66 COG3162 Predicted membrane pro 55.8 8.2 0.00018 23.4 1.4 19 44-62 8-26 (102)
67 smart00265 BH4 BH4 Bcl-2 homol 55.0 12 0.00025 17.5 1.6 18 17-35 6-23 (27)
68 cd00761 Glyco_tranf_GTA_type G 53.4 11 0.00025 20.8 1.8 31 6-37 125-155 (156)
69 COG2028 Uncharacterized conser 51.8 6.2 0.00014 24.9 0.5 19 46-64 80-98 (145)
70 PRK10969 DNA polymerase III su 43.4 9.8 0.00021 21.9 0.4 20 43-62 40-59 (75)
71 PF10126 Nit_Regul_Hom: Unchar 41.1 27 0.00059 21.4 2.1 46 11-59 44-90 (110)
72 PF06440 DNA_pol3_theta: DNA p 38.7 18 0.00038 20.8 1.0 21 41-61 38-58 (75)
73 PF08235 LNS2: LNS2 (Lipin/Ned 37.6 29 0.00062 22.3 1.9 24 23-46 33-56 (157)
74 PF02180 BH4: Bcl-2 homology r 36.4 48 0.001 15.4 2.1 18 17-35 6-23 (27)
75 smart00775 LNS2 LNS2 domain. T 35.6 43 0.00094 20.8 2.5 29 20-48 30-58 (157)
76 PF11397 GlcNAc: Glycosyltrans 34.4 47 0.001 23.7 2.8 38 2-39 216-254 (343)
77 PF00070 Pyr_redox: Pyridine n 34.2 79 0.0017 16.9 3.6 37 23-59 12-50 (80)
78 PF02639 DUF188: Uncharacteriz 33.8 38 0.00083 20.9 2.0 40 17-58 54-93 (130)
79 PF04033 DUF365: Domain of unk 32.1 15 0.00033 22.0 -0.0 19 46-64 42-60 (97)
80 COG4590 ABC-type uncharacteriz 31.6 5.1 0.00011 30.8 -2.5 28 33-60 697-727 (733)
81 PF13500 AAA_26: AAA domain; P 30.2 1.4E+02 0.0031 18.6 4.7 33 23-55 19-51 (199)
82 PF09140 MipZ: ATPase MipZ; I 29.2 70 0.0015 22.4 2.8 39 20-61 16-54 (261)
83 COG4464 CapC Capsular polysacc 27.4 1.3E+02 0.0028 21.0 3.8 46 10-56 11-60 (254)
84 PRK00124 hypothetical protein; 27.3 58 0.0013 20.8 2.1 40 17-58 70-109 (151)
85 KOG2116|consensus 25.8 74 0.0016 25.3 2.7 30 18-47 559-588 (738)
86 PF00289 CPSase_L_chain: Carba 24.9 1.1E+02 0.0023 18.2 2.8 22 14-36 81-102 (110)
87 PRK11835 hypothetical protein; 24.2 21 0.00045 22.0 -0.4 9 57-65 14-22 (114)
88 KOG1359|consensus 23.5 37 0.0008 24.9 0.7 19 43-61 271-289 (417)
89 PRK11750 gltB glutamate syntha 23.2 88 0.0019 27.0 2.8 39 20-58 468-512 (1485)
90 PF04328 DUF466: Protein of un 21.8 1E+02 0.0022 16.8 2.2 17 44-60 37-53 (65)
91 PF13362 Toprim_3: Toprim doma 20.3 1.7E+02 0.0038 16.1 4.0 30 26-58 64-93 (96)
92 KOG3833|consensus 20.2 96 0.0021 23.0 2.3 28 27-54 450-478 (505)
93 PF09724 DUF2036: Uncharacteri 20.0 26 0.00056 24.2 -0.6 30 34-63 287-317 (325)
94 PF05280 FlhC: Flagellar trans 20.0 31 0.00068 22.4 -0.2 37 20-57 10-46 (175)
No 1
>COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]
Probab=99.66 E-value=1.4e-16 Score=110.11 Aligned_cols=61 Identities=23% Similarity=0.475 Sum_probs=55.9
Q ss_pred hhhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcCC
Q psy11718 2 KVSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 2 ~~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
+.++++.+||++ .++|||.+++++++.+||++.|+ ++.+++++|+|++++++||.||++|.
T Consensus 223 r~aL~~~g~~~~---~~i~ED~~lt~~l~~~G~~~~~~~~~~~~~~~p~t~~~~~~Qr~RW~~g~ 284 (439)
T COG1215 223 RSALEEVGGWLE---DTITEDADLTLRLHLRGYRVVYVPEAIVWTEAPETLKELWRQRLRWARGG 284 (439)
T ss_pred HHHHHHhCCCCC---CceeccHHHHHHHHHCCCeEEEeecceEeeeCcccHHHHHHHHHHHHccc
Confidence 346677788999 99999999999999999999999 56799999999999999999999984
No 2
>PF13632 Glyco_trans_2_3: Glycosyl transferase family group 2
Probab=99.66 E-value=1.8e-16 Score=99.74 Aligned_cols=60 Identities=25% Similarity=0.536 Sum_probs=53.2
Q ss_pred hhhhhhh-ccCC-CCCCCChhhHHHHHHHHHHCCCeEEEec-ceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 2 KVSAEKQ-VSFD-NGLDGSVAEDCYFAMLAYKQGFSFNFVE-GEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 2 ~~~~e~~-~gw~-~g~~~sltED~~l~~~l~~~g~~~~~~~-~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
+.++.++ +||+ . .+++||+++++++.++||++.|++ +.+++++|.|++++++||+||..|
T Consensus 80 r~~~l~~vg~~~~~---~~~~ED~~l~~~l~~~G~~~~~~~~~~~~~~~p~t~~~~~~Qr~RW~~g 142 (193)
T PF13632_consen 80 RREALREVGGFDDP---FSIGEDMDLGFRLRRAGYRIVYVPDAIVYTEAPPTFRAFIRQRRRWARG 142 (193)
T ss_pred eHHHHHHhCccccc---ccccchHHHHHHHHHCCCEEEEecccceeeeCCCCHHHHHHHHHHHHhh
Confidence 4445554 5999 6 899999999999999999999995 569999999999999999999998
No 3
>PRK14583 hmsR N-glycosyltransferase; Provisional
Probab=99.65 E-value=2.3e-16 Score=111.83 Aligned_cols=60 Identities=23% Similarity=0.290 Sum_probs=55.3
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcCC
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
.++++.+||+. +.++||++++++|+++||++.|. ++.+++++|+|++++++||.||++|.
T Consensus 240 ~al~~vGg~~~---~~i~ED~dl~~rl~~~G~~i~~~p~a~~~~~~p~t~~~~~~Qr~RW~~G~ 300 (444)
T PRK14583 240 RALADVGYWSP---DMITEDIDISWKLQLKHWSVFFEPRGLCWILMPETLRGLWKQRLRWAQGG 300 (444)
T ss_pred HHHHHcCCCCC---CcccccHHHHHHHHHcCCeEEEeeccEEeeeCCCCHHHHHHHHHHHhCcH
Confidence 45677889999 89999999999999999999998 67799999999999999999999983
No 4
>PRK11204 N-glycosyltransferase; Provisional
Probab=99.61 E-value=1.1e-15 Score=106.41 Aligned_cols=59 Identities=20% Similarity=0.341 Sum_probs=54.9
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
..+++.+||+. ..++||++++++++++||++.|. ++.|++++|+|++++++||+||.+|
T Consensus 219 ~~l~~vgg~~~---~~~~ED~~l~~rl~~~G~~i~~~p~~~~~~~~p~t~~~~~~Qr~RW~~G 278 (420)
T PRK11204 219 SALHEVGYWST---DMITEDIDISWKLQLRGWDIRYEPRALCWILMPETLKGLWKQRLRWAQG 278 (420)
T ss_pred HHHHHhCCCCC---CcccchHHHHHHHHHcCCeEEeccccEEEeECcccHHHHHHHHHHHhcC
Confidence 45677889999 89999999999999999999998 6779999999999999999999998
No 5
>PRK14716 bacteriophage N4 adsorption protein B; Provisional
Probab=99.61 E-value=1e-15 Score=111.66 Aligned_cols=53 Identities=30% Similarity=0.463 Sum_probs=47.9
Q ss_pred cCCCCCCCChhhHHHHHHHHHHCCCeEEEec-ce---------------eeecCCCCHHHHHHHHHhhhcCC
Q psy11718 10 SFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE-GE---------------MWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 10 gw~~g~~~sltED~~l~~~l~~~g~~~~~~~-~~---------------~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
+|+. +++|||+++++++.++||++.|++ +. +++++|+|++++++||.||.+|+
T Consensus 255 ~fd~---~sLTED~dLglRL~~~G~rv~y~p~ai~~~~~~~~~~~~~v~t~e~~P~t~~a~~rQR~RW~~Gi 323 (504)
T PRK14716 255 PFDS---DSLTEDYDIGLRLKRAGFRQIFVRVRADDTTDRPDRRGEPIATREFFPDTFKAAVRQKARWIYGI 323 (504)
T ss_pred CCCC---CCcchHHHHHHHHHHCCCEEEEecccccccccccccccccccccccCccCHHHHHHHHHHHHhch
Confidence 3998 999999999999999999999983 42 45889999999999999999995
No 6
>cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex. Cellulose synthase (CESA) catalyzes the polymerization reaction of cellulose using UDP-glucose as the substrate. Cellulose is an aggregate of unbranched polymers of beta-1,4-linked glucose residues, which is an abundant polysaccharide produced by plants and in varying degrees by several other organisms including algae, bacteria, fungi, and even some animals. Genomes from higher plants harbor multiple CESA genes. There are ten in Arabidopsis. At least three different CESA proteins are required to form a functional complex. In Arabidopsis, CESA1, 3 and 6 and CESA4, 7 and 8, are required for cellulose biosynthesis during primary and secondary cell wall formation. CESA2 is very closely related to CESA6 and is viewed as a prime substitute for CESA6. They functionally compensate each other. The cesa2 and cesa6 double mutant plants we
Probab=99.59 E-value=3.3e-15 Score=96.22 Aligned_cols=59 Identities=20% Similarity=0.392 Sum_probs=54.7
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
..+++.+||+. ..+.||++|++++..+||++.|+ ++.++...|.|++++++||+||++|
T Consensus 172 ~~~~~vgg~~~---~~~~ED~~l~~rl~~~G~~~~~~~~~~v~~~~~~~~~~~~~q~~rW~~g 231 (232)
T cd06437 172 ECIEDAGGWNH---DTLTEDLDLSYRAQLKGWKFVYLDDVVVPAELPASMSAYRSQQHRWSKG 231 (232)
T ss_pred HHHHHhCCCCC---CcchhhHHHHHHHHHCCCeEEEeccceeeeeCCcCHHHHHHHHHHhccC
Confidence 45677889998 88999999999999999999998 6779999999999999999999998
No 7
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A.
Probab=99.56 E-value=7.4e-15 Score=93.69 Aligned_cols=59 Identities=25% Similarity=0.432 Sum_probs=45.6
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
..+++.+|||. ..+.||.++++++.++||++.|+ ++.++.++|.|++++++||.||.+|
T Consensus 169 ~~~~~~g~fd~---~~~~eD~~l~~r~~~~G~~~~~~~~~~v~~~~~~~~~~~~~q~~RW~~g 228 (228)
T PF13641_consen 169 SALEEVGGFDP---FILGEDFDLCLRLRAAGWRIVYAPDALVYHEEPSSLKAFFKQRFRWSRG 228 (228)
T ss_dssp HHHHHH-S--S---SSSSHHHHHHHHHHHTT--EEEEEEEEEEE--SSSTHHHHHHHHHHH--
T ss_pred HHHHHhCCCCC---CCcccHHHHHHHHHHCCCcEEEECCcEEEEeCCCCHHHHHHHHhccCcC
Confidence 35667779999 88899999999999999999998 5669999999999999999999998
No 8
>TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils.
Probab=99.55 E-value=7.4e-15 Score=109.89 Aligned_cols=61 Identities=21% Similarity=0.337 Sum_probs=56.1
Q ss_pred hhhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcCC
Q psy11718 2 KVSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 2 ~~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
|.++++.+||+. ++++||++++++|+++||++.|+ ++.+++++|+|++++++||.||++|.
T Consensus 316 R~al~~iGGf~~---~~vtED~~l~~rL~~~G~~~~y~~~~~~~g~~p~sl~~~~~Qr~RWa~G~ 377 (713)
T TIGR03030 316 REALDEIGGIAG---ETVTEDAETALKLHRRGWNSAYLDRPLIAGLAPETLSGHIGQRIRWAQGM 377 (713)
T ss_pred HHHHHHcCCCCC---CCcCcHHHHHHHHHHcCCeEEEeccccccccCCCCHHHHHHHHHHHhcCh
Confidence 456788899998 89999999999999999999998 57799999999999999999999983
No 9
>cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot
Probab=99.55 E-value=7.3e-15 Score=95.58 Aligned_cols=60 Identities=22% Similarity=0.431 Sum_probs=55.2
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecceeeecCCCCHHHHHHHHHhhhcCC
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEGEMWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~~~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
..+++.+||+. ..++||++|+++++++||++.+++..+++++|+|++.+++||.||.+|+
T Consensus 171 ~~~~~vgg~~~---~~~~eD~~l~~rl~~~G~r~~~~~~~~~~~~~~~~~~~~~q~~Rw~~g~ 230 (241)
T cd06427 171 DVLRELGGWDP---FNVTEDADLGLRLARAGYRTGVLNSTTLEEANNALGNWIRQRSRWIKGY 230 (241)
T ss_pred HHHHHcCCCCc---ccchhhHHHHHHHHHCCceEEEecccccccCcHhHHHHHHHHHHHhccH
Confidence 45677889998 7899999999999999999999987889999999999999999999984
No 10
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=99.53 E-value=1.5e-14 Score=109.04 Aligned_cols=58 Identities=26% Similarity=0.314 Sum_probs=51.6
Q ss_pred hhhhc---cCCCCCCCChhhHHHHHHHHHHCCCeEEEec------------------------ceeeecCCCCHHHHHHH
Q psy11718 5 AEKQV---SFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE------------------------GEMWEKSPFSLWDFVQQ 57 (65)
Q Consensus 5 ~e~~~---gw~~g~~~sltED~~l~~~l~~~g~~~~~~~------------------------~~~~~~~P~~~~~~~~Q 57 (65)
+.+.+ +|+. +++|||++++++|+.+||++.|++ ..+++++|+|++++++|
T Consensus 244 l~~~ggg~~~~~---~~lTED~dlg~rL~~~G~~v~f~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~t~~~~~rQ 320 (727)
T PRK11234 244 LLEDGDGIAFDV---QSLTEDYDIGFRLKEKGMREIFVRFPVVDEAKEREQRKFLQHARTSNMICVREYFPDTFSAAVRQ 320 (727)
T ss_pred HHHhcCCCCcCC---CcchHHHHHHHHHHHCCCEEEEcccccccccccccccccccccccccceEEEEeCchhHHHHHHH
Confidence 44555 5999 999999999999999999999986 22889999999999999
Q ss_pred HHhhhcCC
Q psy11718 58 RKRWLQGY 65 (65)
Q Consensus 58 R~RW~~G~ 65 (65)
|.||.+|+
T Consensus 321 R~RW~~G~ 328 (727)
T PRK11234 321 KSRWIIGI 328 (727)
T ss_pred HHHHHccc
Confidence 99999995
No 11
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602).
Probab=99.52 E-value=2.2e-14 Score=101.91 Aligned_cols=60 Identities=17% Similarity=0.215 Sum_probs=54.0
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHH-CCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcCC
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYK-QGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~-~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
..+++.+||+. .+++||+++++++.. .|+++.|+ ++.++.++|+|++++++||.||.+|.
T Consensus 225 ~~l~~vggf~~---~~i~ED~~l~~rl~~~~g~kv~~~~~a~~~~~~p~t~~~~~~QR~RW~rG~ 286 (439)
T TIGR03111 225 ETILKTQLYNS---ETVGEDTDMTFQIRELLDGKVYLCENAIFYVDPIDGLNKLYTQRQRWQRGE 286 (439)
T ss_pred HHHHHhCCCCC---CCcCccHHHHHHHHHhcCCeEEECCCCEEEEECCcCHHHHHHHHHHHhccH
Confidence 45777899999 899999999999975 59999998 67899999999999999999999984
No 12
>PRK11498 bcsA cellulose synthase catalytic subunit; Provisional
Probab=99.52 E-value=1.9e-14 Score=110.04 Aligned_cols=60 Identities=18% Similarity=0.276 Sum_probs=55.7
Q ss_pred hhhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 2 KVSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 2 ~~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
|.++++.+||+. ++++||++++++++++||++.|+ ++.+.+++|+|++++++||.||++|
T Consensus 427 ReaLeeVGGfd~---~titED~dlslRL~~~Gyrv~yl~~~~a~glaPesl~~~~~QR~RWarG 487 (852)
T PRK11498 427 RKPLDEIGGIAV---ETVTEDAHTSLRLHRRGYTSAYMRIPQAAGLATESLSAHIGQRIRWARG 487 (852)
T ss_pred HHHHHHhcCCCC---CccCccHHHHHHHHHcCCEEEEEeccceeEECCCCHHHHHHHHHHHHHH
Confidence 356788899999 89999999999999999999999 5779999999999999999999998
No 13
>cd04191 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elongation of beta-1,2 polyglucose chains of glucan. Periplasmic Glucan Biosynthesis protein ModH is a glucosyltransferase that catalyzes the elongation of beta-1,2 polyglucose chains of glucan, requiring a beta-glucoside as a primer and UDP-glucose as a substrate. Glucans are composed of 5 to 10 units of glucose forming a highly branched structure, where beta-1,2-linked glucose constitutes a linear backbone to which branches are attached by beta-1,6 linkages. In Escherichia coli, glucans are located in the periplasmic space, functioning as regulator of osmolarity. It is synthesized at a maximum when cells are grown in a medium with low osmolarity. It has been shown to span the cytoplasmic membrane.
Probab=99.49 E-value=4.9e-14 Score=94.77 Aligned_cols=53 Identities=28% Similarity=0.523 Sum_probs=48.3
Q ss_pred ccCCCCCCCChhhHHHHHHHHHHCCCeEEEec-ce-eeecCCCCHHHHHHHHHhhhcC
Q psy11718 9 VSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE-GE-MWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 9 ~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~-~~-~~~~~P~~~~~~~~QR~RW~~G 64 (65)
++|+. .+++||++++++++.+||++.|.+ .. +++++|++++++++||.||++|
T Consensus 196 g~~~~---~~l~eD~~l~~~~~~~G~ri~~~~~~~~~~~~~p~~~~~~~~qr~RW~~G 250 (254)
T cd04191 196 PPFGG---HILSHDFVEAALMRRAGWEVRLAPDLEGSYEECPPTLIDFLKRDRRWCQG 250 (254)
T ss_pred CCCCC---CeecHHHHHHHHHHHcCCEEEEccCCcceEeECCCCHHHHHHHHHHHHhh
Confidence 46766 899999999999999999999985 45 5899999999999999999998
No 14
>cd04190 Chitin_synth_C C-terminal domain of Chitin Synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin. Chitin synthase, also called UDP-N-acetyl-D-glucosamine:chitin 4-beta-N-acetylglucosaminyltransferase, catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of GlcNAc residues formed by covalent beta-1,4 linkages. Chitin is an important component of the cell wall of fungi and bacteria and it is synthesized on the cytoplasmic surface of the cell membrane by membrane bound chitin synthases. Studies with fungi have revealed that most of them contain more than one chitin synthase gene. At least five subclasses of chitin synthases have been identified.
Probab=99.48 E-value=1.2e-14 Score=95.60 Aligned_cols=49 Identities=22% Similarity=0.375 Sum_probs=45.2
Q ss_pred CChhhHHHHHHHHHHCCCeEEE--e-cceeeecCCCCHHHHHHHHHhhhcCC
Q psy11718 17 GSVAEDCYFAMLAYKQGFSFNF--V-EGEMWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~~~~--~-~~~~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
.+++||++|+++|.++||++.| + ++.+++++|+|++++++||.||.+|.
T Consensus 190 ~~~~ED~~l~~~l~~~G~~~~~~~~~~a~~~~~~p~s~~~~~~QR~RW~~g~ 241 (244)
T cd04190 190 LDLGEDRILCTLLLKAGPKRKYLYVPGAVAETDVPETFVELLSQRRRWINST 241 (244)
T ss_pred HhHhcccceeHHHhccCCccEEEEecccEEEEECCCCHHHHHHHhHhhhccc
Confidence 3589999999999999999988 6 57799999999999999999999983
No 15
>cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response.
Probab=99.48 E-value=7.2e-14 Score=89.69 Aligned_cols=60 Identities=20% Similarity=0.270 Sum_probs=54.2
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcCC
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
..+++.+||+. ..+.||++++.++.++||++.|+ ++.++++.|.++.++++||.||.+|.
T Consensus 168 ~~~~~iGgf~~---~~~~eD~dl~~r~~~~G~~~~~~~~~~~~~~~~~~~~~~~~q~~rw~~g~ 228 (236)
T cd06435 168 SALDDVGGWDE---WCITEDSELGLRMHEAGYIGVYVAQSYGHGLIPDTFEAFKKQRFRWAYGA 228 (236)
T ss_pred HHHHHhCCCCC---ccccchHHHHHHHHHCCcEEEEcchhhccCcCcccHHHHHHHHHHHhcch
Confidence 45677888998 77899999999999999999998 46799999999999999999999983
No 16
>PLN02893 Cellulose synthase-like protein
Probab=99.38 E-value=3.7e-13 Score=101.80 Aligned_cols=54 Identities=31% Similarity=0.557 Sum_probs=49.2
Q ss_pred hccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc---eeeecCCCCHHHHHHHHHhhhcC
Q psy11718 8 QVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG---EMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 8 ~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~---~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
++||.. +|+|||+.++++|+.+||++.|+++ ...+.+|+|+.+++.||+||+.|
T Consensus 437 ~~G~~y---gsvtED~~Tg~~lh~~GWrSvY~~p~~~af~G~aP~~l~~~l~Q~~RWa~G 493 (734)
T PLN02893 437 KMGFRY---GSLVEDYYTGYRLQCEGWKSIFCNPKRPAFLGDSPINLHDVLNQQKRWSVG 493 (734)
T ss_pred ccceEe---ccccccHHHHHHHHhcCCcEEecCCCchhhccCCCCCHHHHHHHHHHHHhh
Confidence 467777 9999999999999999999999953 26899999999999999999998
No 17
>PRK15489 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=99.38 E-value=3.9e-13 Score=101.36 Aligned_cols=53 Identities=26% Similarity=0.379 Sum_probs=47.0
Q ss_pred cCCCCCCCChhhHHHHHHHHHHCCCeEEEe--c---------------------ce-eeecCCCCHHHHHHHHHhhhcCC
Q psy11718 10 SFDNGLDGSVAEDCYFAMLAYKQGFSFNFV--E---------------------GE-MWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 10 gw~~g~~~sltED~~l~~~l~~~g~~~~~~--~---------------------~~-~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
+||. +|||||+|+++||+++||++.|+ + .. +++.+|.||++.++||.||..||
T Consensus 260 ~~n~---~sLTED~Dlg~RL~~~G~r~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~tre~fP~~~~a~~rQk~RW~~Gi 336 (703)
T PRK15489 260 PFNT---SSLTEDYDFSFRLAELGMQEIFVRFPVQFRVRRTSWFGPRRERTREMLLCVREYFPDTFRTAYRQKARWVLGI 336 (703)
T ss_pred CCCC---CCchHhHHHHHHHHHCCCceEEEEEeccccccccccccccccccccCceeehhhCcHHHHHHHHHHHHHHhHH
Confidence 5777 89999999999999999999882 1 12 68999999999999999999996
No 18
>cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end.
Probab=99.36 E-value=1.6e-12 Score=82.67 Aligned_cols=60 Identities=25% Similarity=0.360 Sum_probs=54.3
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcCC
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
..+++.+|++. ..+.||++++.+++++||++.|+ ++.++...|.+++.+++||.||.+|.
T Consensus 170 ~~~~~ig~~~~---~~~~eD~~l~~r~~~~g~~i~~~~~~~~~~~~~~~~~~~~~q~~rw~~~~ 230 (234)
T cd06421 170 EALDEIGGFPT---DSVTEDLATSLRLHAKGWRSVYVPEPLAAGLAPETLAAYIKQRLRWARGM 230 (234)
T ss_pred HHHHHhCCCCc---cceeccHHHHHHHHHcCceEEEecCccccccCCccHHHHHHHHHHHhcCC
Confidence 45677888997 78999999999999999999998 56799999999999999999999983
No 19
>PLN02248 cellulose synthase-like protein
Probab=99.36 E-value=6e-13 Score=103.70 Aligned_cols=55 Identities=25% Similarity=0.409 Sum_probs=50.2
Q ss_pred hhccCCCCCCCChhhHHHHHHHHHHCCCeEEEec--ce-eeecCCCCHHHHHHHHHhhhcC
Q psy11718 7 KQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE--GE-MWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 7 ~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~--~~-~~~~~P~~~~~~~~QR~RW~~G 64 (65)
+.+||.. +|+|||+.++++||.+||+..|+. .. ..+.+|+|+.+++.||+||+.|
T Consensus 826 ~evG~~Y---GSvTEDv~TGlrLH~rGWrSvY~~p~r~AF~GlAP~~L~d~L~Qr~RWA~G 883 (1135)
T PLN02248 826 DRVGWIY---GSVTEDVVTGYRMHNRGWRSVYCVTKRDAFRGTAPINLTDRLHQVLRWATG 883 (1135)
T ss_pred hhcCeee---cceechHHHHHHHHhcCCceEeCCCChHhhcCCCCCCHHHHHHHHHHHhhc
Confidence 4689999 999999999999999999999983 22 6799999999999999999998
No 20
>PLN02400 cellulose synthase
Probab=99.35 E-value=6.7e-13 Score=103.22 Aligned_cols=55 Identities=25% Similarity=0.372 Sum_probs=50.7
Q ss_pred hhccCCCCCCCChhhHHHHHHHHHHCCCeEEEec-c--eeeecCCCCHHHHHHHHHhhhcC
Q psy11718 7 KQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE-G--EMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 7 ~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~-~--~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
+.+||-- +|+|||+.++++||.+||+..|++ . .+.+.+|+|+.+++.||+||+.|
T Consensus 772 ~evGwiY---GSvTED~~TG~~LH~rGWrSvY~~p~r~af~GlAP~~l~d~L~Qr~RWA~G 829 (1085)
T PLN02400 772 KEIGWIY---GSVTEDILTGFKMHARGWISIYCMPPRPAFKGSAPINLSDRLNQVLRWALG 829 (1085)
T ss_pred hhhCeec---cceechHHHHHHHHccCCceEecCCCcHhhcCcCCCCHHHHHHHHHHHhhc
Confidence 4689999 999999999999999999999994 3 37899999999999999999998
No 21
>PLN02189 cellulose synthase
Probab=99.35 E-value=7.2e-13 Score=102.74 Aligned_cols=55 Identities=22% Similarity=0.325 Sum_probs=50.5
Q ss_pred hhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc---eeeecCCCCHHHHHHHHHhhhcC
Q psy11718 7 KQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG---EMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 7 ~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~---~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
+.+||.. +|+|||+.++++||.+||+..|+++ .+.+.+|+|+.+++.||+||+.|
T Consensus 728 ~evGw~Y---GSvTED~~TG~rlH~rGWrSvY~~p~r~AF~GlAP~~L~~~L~Qr~RWA~G 785 (1040)
T PLN02189 728 LELGWIY---GSITEDILTGFKMHCRGWRSIYCMPKRAAFKGSAPINLSDRLNQVLRWALG 785 (1040)
T ss_pred hccCeec---cccccHHHHHHHHHccCCceEecCCCcHHhcCcCCCCHHHHHHHHHHHhhh
Confidence 4679999 9999999999999999999999943 36899999999999999999998
No 22
>PLN02195 cellulose synthase A
Probab=99.35 E-value=7.7e-13 Score=102.13 Aligned_cols=55 Identities=25% Similarity=0.370 Sum_probs=50.8
Q ss_pred hhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc---eeeecCCCCHHHHHHHHHhhhcC
Q psy11718 7 KQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG---EMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 7 ~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~---~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
+.+||.. +|+|||+.++++||.+||++.|+++ .+.+.+|+|+.+++.||+||+.|
T Consensus 663 ~evGw~Y---GSvTEDv~TG~rlH~rGWrSvY~~p~r~af~G~AP~~L~~~L~Qr~RWA~G 720 (977)
T PLN02195 663 KEIGWIY---GSVTEDILTGFKMHCRGWRSIYCMPVRPAFKGSAPINLSDRLHQVLRWALG 720 (977)
T ss_pred hhcCeec---cceecHHHHHHHHHccCCcEEecCCccHHhcccCCCCHHHHHHHHHHHHhc
Confidence 4679999 8999999999999999999999854 36899999999999999999998
No 23
>PLN02638 cellulose synthase A (UDP-forming), catalytic subunit
Probab=99.35 E-value=7.2e-13 Score=103.02 Aligned_cols=55 Identities=24% Similarity=0.365 Sum_probs=50.6
Q ss_pred hhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc-e--eeecCCCCHHHHHHHHHhhhcC
Q psy11718 7 KQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG-E--MWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 7 ~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~-~--~~~~~P~~~~~~~~QR~RW~~G 64 (65)
+.+||.. +|+|||+.++++||.+||++.|+++ . +.+.+|+|+.+++.||+||+.|
T Consensus 767 ~evGw~Y---GSvTEDv~TG~rLH~rGWrSvY~~P~r~AF~GlAP~~l~d~L~Qr~RWA~G 824 (1079)
T PLN02638 767 SEIGWIY---GSVTEDILTGFKMHARGWRSIYCMPKRPAFKGSAPINLSDRLNQVLRWALG 824 (1079)
T ss_pred hhcCeee---cceecHHHHHHHHHcCCCcEEecCCCchHhcCcCCCCHHHHHHHHHHHhhc
Confidence 4689999 8999999999999999999999943 3 6799999999999999999998
No 24
>PLN02915 cellulose synthase A [UDP-forming], catalytic subunit
Probab=99.34 E-value=1e-12 Score=101.95 Aligned_cols=55 Identities=25% Similarity=0.375 Sum_probs=50.4
Q ss_pred hhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc---eeeecCCCCHHHHHHHHHhhhcC
Q psy11718 7 KQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG---EMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 7 ~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~---~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
+.+||-. +|+|||+.++++||.+||+..|+++ ...+.+|+++.+++.||+||+.|
T Consensus 731 ~evGw~Y---GSvTEDv~TG~rLH~rGWrSvY~~p~r~AF~GlAP~~L~d~L~Qr~RWA~G 788 (1044)
T PLN02915 731 KEIGWIY---GSVTEDILTGFKMHCRGWKSVYCMPKRPAFKGSAPINLSDRLHQVLRWALG 788 (1044)
T ss_pred HhhCccc---cccccHHHHHHHHHccCCcEEeeCCCcHHhcCcCCCCHHHHHHHHHHHhhh
Confidence 4689999 8999999999999999999999943 25799999999999999999998
No 25
>PRK05454 glucosyltransferase MdoH; Provisional
Probab=99.33 E-value=1.9e-12 Score=97.45 Aligned_cols=54 Identities=24% Similarity=0.414 Sum_probs=49.6
Q ss_pred hccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc--eeeecCCCCHHHHHHHHHhhhcC
Q psy11718 8 QVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG--EMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 8 ~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~--~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
.+||+. ++++||++++.+|+++||++.|++. .+++++|+|++++++||.||++|
T Consensus 320 ~~p~~~---~~LseD~~~a~~l~~~GyrV~~~pd~~~~~ee~P~tl~~~~~qr~RW~~G 375 (691)
T PRK05454 320 RGPFGG---HILSHDFVEAALMRRAGWGVWLAPDLPGSYEELPPNLLDELKRDRRWCQG 375 (691)
T ss_pred cCCCCC---CcccHHHHHHHHHHHCCCEEEEcCccccccccCCCCHHHHHHHHHHHHhc
Confidence 357777 8999999999999999999999944 48999999999999999999998
No 26
>PLN02436 cellulose synthase A
Probab=99.31 E-value=1.7e-12 Score=101.03 Aligned_cols=55 Identities=25% Similarity=0.370 Sum_probs=50.4
Q ss_pred hhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc-e--eeecCCCCHHHHHHHHHhhhcC
Q psy11718 7 KQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG-E--MWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 7 ~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~-~--~~~~~P~~~~~~~~QR~RW~~G 64 (65)
+.+||-- +|+|||+.++++||.+||++.|+.+ . ..+.+|+++.+++.||+||+.|
T Consensus 783 ~evGwiY---GSvTEDv~TG~rLH~rGWrSvY~~P~r~AF~GlAP~~L~d~L~Qr~RWA~G 840 (1094)
T PLN02436 783 KEIGWIY---GSVTEDILTGFKMHCHGWRSVYCIPKRPAFKGSAPINLSDRLHQVLRWALG 840 (1094)
T ss_pred HhhCeec---cceecHHHHHHHHHcCCCceEeCCCCchhhcCcCCCCHHHHHHHHHHHhhc
Confidence 4689999 8999999999999999999999843 3 5799999999999999999998
No 27
>PLN02190 cellulose synthase-like protein
Probab=99.29 E-value=2.1e-12 Score=97.94 Aligned_cols=55 Identities=20% Similarity=0.342 Sum_probs=50.3
Q ss_pred hhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecce---eeecCCCCHHHHHHHHHhhhcC
Q psy11718 7 KQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEGE---MWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 7 ~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~~---~~~~~P~~~~~~~~QR~RW~~G 64 (65)
+++||.. +|+|||+.++++||.+||++.|+++. ..+.+|+++.+.+.||+||+.|
T Consensus 449 ~evG~~y---gSitED~~TGl~mh~rGWrSvY~~p~~~AFlG~aP~~l~~~L~Q~~RWa~G 506 (756)
T PLN02190 449 NTIGWLY---DSVAEDLNTSIGIHSRGWTSSYISPDPPAFLGSMPPGGPEAMVQQRRWATG 506 (756)
T ss_pred hccCccc---ceeechHHHHHHHHccCCceEecCCCchhhcCcCCCChHHHhhhhhhHhhh
Confidence 4588998 99999999999999999999998432 6899999999999999999998
No 28
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=99.28 E-value=6.6e-12 Score=79.26 Aligned_cols=61 Identities=13% Similarity=0.171 Sum_probs=51.9
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCC-eEEEe-c--ceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGF-SFNFV-E--GEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~-~~~~~-~--~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
..+++.+||+.-. ...+||.++++++.++|+ ++.++ . ..+++.+|.+++++++||+||++|
T Consensus 165 ~~~~~~ggf~~~~-~~~~eD~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~Rw~~g 229 (229)
T cd04192 165 EAFFEVGGFEGND-HIASGDDELLLAKVASKYPKVAYLKNPEALVTTQPVTSWKELLNQRKRWASK 229 (229)
T ss_pred HHHHHhcCCcccc-ccccCCHHHHHHHHHhCCCCEEEeeCcchheecCCchhHHHHHHHHHHhhcC
Confidence 4567889998521 577899999999999999 99886 2 348999999999999999999988
No 29
>cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate.
Probab=99.27 E-value=5.6e-12 Score=80.53 Aligned_cols=48 Identities=17% Similarity=0.441 Sum_probs=44.9
Q ss_pred CChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 17 GSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
..+.||.+|+.++.++||++.|. .+.++.++|.++++|++||.||.+|
T Consensus 183 ~~~~eD~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~q~~Rw~~~ 231 (235)
T cd06434 183 LNAGDDRFLTRYVLSHGYKTVYQYTSEAYTETPENYKKFLKQQLRWSRS 231 (235)
T ss_pred CCcCchHHHHHHHHHCCCeEEEecCCeEEEEcchhHHHHHHHhhhhhhc
Confidence 46899999999999999999998 5679999999999999999999987
No 30
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment.
Probab=99.25 E-value=1e-11 Score=78.67 Aligned_cols=61 Identities=20% Similarity=0.310 Sum_probs=51.4
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
-.+++.+||+. +...++||++|++++.++|+++.|+ +..++.+.|.+++++++||.||.+.
T Consensus 134 ~~~~~~ggf~~-~~~~~~eD~~l~~rl~~~G~~i~~~~~~~~~~~~~~~~~~~~~q~~rw~~~ 195 (196)
T cd02520 134 EVLDAIGGFEA-FADYLAEDYFLGKLIWRLGYRVVLSPYVVMQPLGSTSLASFWRRQLRWSRT 195 (196)
T ss_pred HHHHhccChHH-HhHHHHHHHHHHHHHHHcCCeEEEcchheeccCCcccHHHHHHHHHHHhcc
Confidence 34667788864 2135789999999999999999998 5568999999999999999999874
No 31
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano
Probab=99.14 E-value=5.9e-11 Score=82.66 Aligned_cols=62 Identities=13% Similarity=0.147 Sum_probs=52.7
Q ss_pred hhhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEec-ceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 2 KVSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE-GEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 2 ~~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~-~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
|..+|+.+||+.-. .+++||++++.++.++|+++.+.+ .......|+|++++++||.||.+.
T Consensus 209 R~~l~~iGGf~~~~-~~~~ED~~l~~~i~~~G~~v~~~~~~v~~~~~~~s~~~~~~q~~RW~r~ 271 (373)
T TIGR03472 209 RATLEAIGGLAALA-HHLADDYWLGELVRALGLRVVLAPVVVDTDVHETSFATLLAHELRWSRT 271 (373)
T ss_pred HHHHHHcCChHHhc-ccchHHHHHHHHHHHcCCeEEecchhhhcCCCccCHHHHHHHHHHHHhh
Confidence 45678889998421 678999999999999999999984 457788889999999999999875
No 32
>PF03552 Cellulose_synt: Cellulose synthase; InterPro: IPR005150 Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that synthesize cellulose in higher plants differ greatly from the well-characterised genes found in Acetobacter and Agrobacterium spp. More correctly designated as "cellulose synthase catalytic subunits", plant cellulose synthase (CesA) proteins are integral membrane proteins, approximately 1,000 amino acids in length. There are a number of highly conserved residues, including several motifs shown to be necessary for processive glycosyltransferase activity [].; GO: 0016760 cellulose synthase (UDP-forming) activity, 0030244 cellulose biosynthetic process, 0016020 membrane
Probab=99.08 E-value=6.8e-12 Score=94.81 Aligned_cols=55 Identities=29% Similarity=0.457 Sum_probs=50.4
Q ss_pred hhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc---eeeecCCCCHHHHHHHHHhhhcC
Q psy11718 7 KQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG---EMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 7 ~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~---~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
+++||-. +|+|||+-++++||.+|||+.|+.+ ...+.+|.++.+.+.|++||+.|
T Consensus 413 kevGwiY---GSvtEDv~TG~rmH~rGWrSvYc~p~r~AF~G~AP~nL~d~L~Q~~RWA~G 470 (720)
T PF03552_consen 413 KEVGWIY---GSVTEDVLTGFRMHCRGWRSVYCNPKRPAFLGSAPINLSDRLHQVKRWATG 470 (720)
T ss_pred cccceEE---EecccccccceeEeeCceeeEEeccccchhcccCCCChhhhceeeeeEeee
Confidence 4688888 8999999999999999999999943 37899999999999999999988
No 33
>PF03142 Chitin_synth_2: Chitin synthase; InterPro: IPR004835 Chitin synthase (2.4.1.16 from EC), also known as chitin-UDP acetyl-glucosaminyl transferase, is a plasma membrane-bound protein which catalyses the conversion of UDP-N-acettyl-D-glucosamine and {(1,4)-(N-acetyl- beta-D-glucosaminyl)}(N) to UDP and {(1,4)-(N-acetyl-beta-D- glucosaminyl)}(N+1). It plays a major role in cell wall biogenesis. ; GO: 0016758 transferase activity, transferring hexosyl groups
Probab=98.84 E-value=3.2e-09 Score=78.40 Aligned_cols=47 Identities=28% Similarity=0.634 Sum_probs=43.1
Q ss_pred ChhhHHHHHHHHHHC--CCeEEEe-cceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 18 SVAEDCYFAMLAYKQ--GFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 18 sltED~~l~~~l~~~--g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
.+.||-.|+.-|.++ +|+..|+ .+.|+|.+|++|+.|++||+||..|
T Consensus 324 ~lGEDR~LttLlLk~~~~~k~~y~~~A~a~T~aP~t~~vflsQRRRWinS 373 (527)
T PF03142_consen 324 DLGEDRWLTTLLLKQFPGYKTEYVPSAVAYTDAPETFSVFLSQRRRWINS 373 (527)
T ss_pred hcchhHHHHHHHHhhCCCceEEEcccccccccCCccHHHHHHHhhhccch
Confidence 589999999888887 7999998 7889999999999999999999876
No 34
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins.
Probab=98.83 E-value=1.2e-08 Score=65.89 Aligned_cols=53 Identities=13% Similarity=0.244 Sum_probs=48.4
Q ss_pred cCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcCC
Q psy11718 10 SFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 10 gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
+|+. ..+.||++++.++..+|+++.++ ++.++...|.++.++++|+.||.+|.
T Consensus 195 ~~~~---~~~~eD~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~g~ 248 (251)
T cd06439 195 PLPA---DTINDDFVLPLRIARQGYRVVYEPDAVAYEEVAEDGSEEFRRRVRIAAGN 248 (251)
T ss_pred CCCc---ccchhHHHHHHHHHHcCCeEEeccccEEEEeCcccHHHHHHHHHHHHhcc
Confidence 6766 78899999999999999999998 46699999999999999999999984
No 35
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus.
Probab=98.81 E-value=1.5e-08 Score=64.64 Aligned_cols=60 Identities=18% Similarity=0.096 Sum_probs=51.0
Q ss_pred hhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEec-ceeeecCCCCHHHHHHHHHhhhcCC
Q psy11718 4 SAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE-GEMWEKSPFSLWDFVQQRKRWLQGY 65 (65)
Q Consensus 4 ~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~-~~~~~~~P~~~~~~~~QR~RW~~G~ 65 (65)
.+.+.++|+.. ....||.+|+.++.++|+++.+++ ..+....|.+++.+++|+.||.+|.
T Consensus 167 ~~~~~g~~~~~--~~~~eD~~l~~r~~~~G~~~~~~~~~~~~~~~~~s~~~~~~~~~r~~~~~ 227 (249)
T cd02525 167 VFEKVGGFDES--LVRNEDAELNYRLRKAGYKIWLSPDIRVYYYPRSTLKKLARQYFRYGKWR 227 (249)
T ss_pred HHHHhCCCCcc--cCccchhHHHHHHHHcCcEEEEcCCeEEEEcCCCCHHHHHHHHHHHhhhh
Confidence 34566778873 245799999999999999999984 6688899999999999999999884
No 36
>PF13506 Glyco_transf_21: Glycosyl transferase family 21
Probab=98.64 E-value=4.3e-08 Score=62.79 Aligned_cols=60 Identities=20% Similarity=0.325 Sum_probs=49.7
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEec-ceeeecCC----CCHHHHHHHHHhhhc
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE-GEMWEKSP----FSLWDFVQQRKRWLQ 63 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~-~~~~~~~P----~~~~~~~~QR~RW~~ 63 (65)
.++++.+|++. ..+.++||+.|+.+++++|+++.+.+ +.+.+..| .+|++++++..||++
T Consensus 111 ~~L~~~GG~~~-l~~~ladD~~l~~~~~~~G~~v~~~~~~v~~~~~~~~~~~s~~~~~~r~~RW~r 175 (175)
T PF13506_consen 111 EALEEIGGFEA-LADYLADDYALGRRLRARGYRVVLSPYPVVQTSVPRTLEDSFRDFFRRQLRWAR 175 (175)
T ss_pred HHHHHcccHHH-HhhhhhHHHHHHHHHHHCCCeEEEcchheeecccCccccccHHHHHHHHHhhcC
Confidence 46677778764 33699999999999999999999986 55666666 589999999999985
No 37
>KOG2571|consensus
Probab=98.42 E-value=2.5e-07 Score=71.53 Aligned_cols=48 Identities=29% Similarity=0.650 Sum_probs=45.8
Q ss_pred CChhhHHHHHHHHHHCCCeEEEe-cceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 17 GSVAEDCYFAMLAYKQGFSFNFV-EGEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~~~~~-~~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
.++.||--|+.++..+||++.|+ .+.+.+++|+++..++.||+||..|
T Consensus 548 ~~~geDR~L~~~llskgy~l~Y~a~s~a~t~~Pe~~~efl~QrrRW~~s 596 (862)
T KOG2571|consen 548 YSLGEDRWLCTLLLSKGYRLKYVAASDAETEAPESFLEFLNQRRRWLNS 596 (862)
T ss_pred cccchhHHHHHHHHhccceeeeeccccccccCcHhHHHHHHHhhhhccc
Confidence 57999999999999999999999 6789999999999999999999988
No 38
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms.
Probab=97.94 E-value=1.2e-05 Score=56.32 Aligned_cols=61 Identities=13% Similarity=0.046 Sum_probs=48.9
Q ss_pred hhhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe-cc-eeeecCCCCHHHHHHHHHhhhc
Q psy11718 2 KVSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV-EG-EMWEKSPFSLWDFVQQRKRWLQ 63 (65)
Q Consensus 2 ~~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~-~~-~~~~~~P~~~~~~~~QR~RW~~ 63 (65)
|...++.+||+.-. ..+.||++|+.++.++|+++.+. .. .+...+.++++++++|+.||..
T Consensus 218 r~~~~~vGGf~~~~-~~~~ED~~L~~r~~~~G~~v~~~~~~~~~s~r~~~~~~~~~~~~~r~~~ 280 (384)
T TIGR03469 218 REALERIGGIAAIR-GALIDDCTLAAAVKRSGGRIWLGLAARTRSLRPYDGLGEIWRMIARTAY 280 (384)
T ss_pred HHHHHHcCCHHHHh-hCcccHHHHHHHHHHcCCcEEEEecCceEEEEecCCHHHHHHHHHHhHH
Confidence 34577889998521 46899999999999999999887 33 3566777899999999999863
No 39
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins.
Probab=96.84 E-value=0.0042 Score=41.65 Aligned_cols=41 Identities=17% Similarity=0.164 Sum_probs=30.9
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEec-cee
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE-GEM 43 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~-~~~ 43 (65)
...++.+|||.++...-.||.||++|+..+|+++.+++ +.+
T Consensus 181 ~~~~~vGgfDe~~~~~~~ED~Dl~~R~~~~G~~i~~~p~a~v 222 (299)
T cd02510 181 EWFLELGGYDEGMDIWGGENLELSFKVWQCGGSIEIVPCSRV 222 (299)
T ss_pred HHHHHhCCCCCcccccCchhHHHHHHHHHcCCeEEEeeccEE
Confidence 34567789999642222599999999999999999884 443
No 40
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen.
Probab=96.28 E-value=0.0056 Score=38.89 Aligned_cols=45 Identities=18% Similarity=0.159 Sum_probs=33.5
Q ss_pred hhhhhhccCCCCCCCCh--hhHHHHHHHHHHCCCeEEEec-ceeeecCCCC
Q psy11718 3 VSAEKQVSFDNGLDGSV--AEDCYFAMLAYKQGFSFNFVE-GEMWEKSPFS 50 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sl--tED~~l~~~l~~~g~~~~~~~-~~~~~~~P~~ 50 (65)
..+++.+|||. ... .||+++++|+.++|+++.+++ +.++-..+.+
T Consensus 161 ~~~~~~ggfd~---~~~~~~eD~d~~~r~~~~G~~~~~~~~~~v~h~~~~~ 208 (237)
T cd02526 161 EALEKVGGFDE---DLFIDYVDTEWCLRARSKGYKIYVVPDAVLKHELGDK 208 (237)
T ss_pred HHHHHhCCCCH---HHcCccchHHHHHHHHHcCCcEEEEcCeEEEecccCc
Confidence 45677889997 332 589999999999999999985 4455444444
No 41
>cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=96.23 E-value=0.0055 Score=37.96 Aligned_cols=38 Identities=11% Similarity=0.098 Sum_probs=30.3
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecce
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEGE 42 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~~ 42 (65)
..+++.+||+.+. ...||++|.++|..+|+++.|++..
T Consensus 159 ~~~~~iggf~~~~--~~~eD~~l~~rl~~~g~~~~~~~~~ 196 (202)
T cd04184 159 SLVRQVGGFREGF--EGAQDYDLVLRVSEHTDRIAHIPRV 196 (202)
T ss_pred HHHHHhCCCCcCc--ccchhHHHHHHHHhccceEEEccHh
Confidence 3456778998843 4679999999999999999998543
No 42
>cd02522 GT_2_like_a GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function. Glycosyltransferase family 2 (GT-2) subfamily of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=96.22 E-value=0.0078 Score=37.74 Aligned_cols=32 Identities=16% Similarity=0.062 Sum_probs=27.8
Q ss_pred hhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEE
Q psy11718 4 SAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNF 38 (65)
Q Consensus 4 ~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~ 38 (65)
..++.+||+. ....||++|++++.+.|+++.+
T Consensus 147 ~~~~~G~fd~---~~~~ED~d~~~r~~~~G~~~~~ 178 (221)
T cd02522 147 LFEELGGFPE---LPLMEDVELVRRLRRRGRPALL 178 (221)
T ss_pred HHHHhCCCCc---cccccHHHHHHHHHhCCCEEEc
Confidence 4567789999 5589999999999999999876
No 43
>cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=96.06 E-value=0.0083 Score=37.18 Aligned_cols=35 Identities=11% Similarity=0.073 Sum_probs=30.5
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEec
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE 40 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~ 40 (65)
..+++.+||+. ....||+++..++..+|+++.+++
T Consensus 160 ~~~~~~g~~~~---~~~~eD~~~~~r~~~~g~~~~~~~ 194 (201)
T cd04195 160 SKVLAVGGYQD---LPLVEDYALWARMLANGARFANLP 194 (201)
T ss_pred HHHHHcCCcCC---CCCchHHHHHHHHHHcCCceeccc
Confidence 34677789988 799999999999999999999974
No 44
>cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm
Probab=95.93 E-value=0.016 Score=35.23 Aligned_cols=38 Identities=21% Similarity=0.237 Sum_probs=29.1
Q ss_pred hhhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe
Q psy11718 2 KVSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV 39 (65)
Q Consensus 2 ~~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~ 39 (65)
|..+++.+||+..+.....||++++.++.++|++...+
T Consensus 135 r~~~~~~ggf~~~~~~~~~eD~~l~~r~~~~g~~~~~~ 172 (182)
T cd06420 135 KKDLLAVNGFDEEFTGWGGEDSELVARLLNSGIKFRKL 172 (182)
T ss_pred HHHHHHhCCCCcccccCCcchHHHHHHHHHcCCcEEEe
Confidence 34567788999843233479999999999999877665
No 45
>cd06913 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine . This family includes human Beta3GnTL1 and related eukaryotic proteins. Human Beta3GnTL1 is a putative beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is expressed at various levels in most of tissues examined. Beta 1, 3-N-acetylglucosaminyltransferase has been found to be essential for the formation of poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a unique carbohydrate composed of N-acetyllactosamine repeats. It is often an important part of cell-type-specific oligosaccharide structures and some functional oligosaccharides. It has been shown that the structure and biosynthesis of poly-N-acetyllactosamine display a dramatic change during development and oncogenesis. Several members of beta-1, 3-N-acetylglucosaminyltransferase have been identified.
Probab=95.80 E-value=0.021 Score=36.17 Aligned_cols=37 Identities=24% Similarity=0.208 Sum_probs=29.4
Q ss_pred hhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc
Q psy11718 4 SAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG 41 (65)
Q Consensus 4 ~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~ 41 (65)
..++.+||+..+ ....||++|..++..+|+++.+++.
T Consensus 167 ~~~~~g~f~~~~-~~~~eD~~l~~r~~~~g~~i~~~~~ 203 (219)
T cd06913 167 WFSHVGPFDEGG-KGVPEDLLFFYEHLRKGGGVYRVDR 203 (219)
T ss_pred HHhhcCCccchh-ccchhHHHHHHHHHHcCCceEEEcc
Confidence 456778898732 2456999999999999999999854
No 46
>COG1216 Predicted glycosyltransferases [General function prediction only]
Probab=95.80 E-value=0.016 Score=39.33 Aligned_cols=37 Identities=27% Similarity=0.310 Sum_probs=31.9
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEec
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE 40 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~ 40 (65)
-++++.++||... .--.||.|++.|+.+.||++.+++
T Consensus 177 ~~~~~vG~~de~~-F~y~eD~D~~~R~~~~G~~i~~~p 213 (305)
T COG1216 177 EAFEKVGGFDERF-FIYYEDVDLCLRARKAGYKIYYVP 213 (305)
T ss_pred HHHHHhCCCCccc-ceeehHHHHHHHHHHcCCeEEEee
Confidence 3567788899865 788999999999999999999984
No 47
>TIGR01556 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids are glycolipids containing mono- or di- L-rhamnose molecules. Rhamnolipid synthesis occurs by sequential glycosyltransferase reactions involving two distinct rhamnosyltransferase enzymes. In P.aeruginosa, the synthesis of mono-rhamnolipids is catalyzed by rhamnosyltransferase 1, and proceeds by a glycosyltransfer reaction catalyzed by rhamnosyltransferase 2 to yield di-rhamnolipids.
Probab=95.64 E-value=0.043 Score=36.33 Aligned_cols=37 Identities=14% Similarity=0.053 Sum_probs=27.7
Q ss_pred hhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc
Q psy11718 4 SAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG 41 (65)
Q Consensus 4 ~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~ 41 (65)
.+++.+++|... .-=.||.|+++|+.++|+++.+++.
T Consensus 159 ~~~~iG~fde~~-fi~~~D~e~~~R~~~~G~~i~~~~~ 195 (281)
T TIGR01556 159 VYQRLGMMDEEL-FIDHVDTEWSLRAQNYGIPLYIDPD 195 (281)
T ss_pred HHHHhCCccHhh-cccchHHHHHHHHHHCCCEEEEeCC
Confidence 355667788732 1225899999999999999998853
No 48
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=95.59 E-value=0.021 Score=33.71 Aligned_cols=38 Identities=21% Similarity=0.210 Sum_probs=29.1
Q ss_pred hhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecce
Q psy11718 4 SAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEGE 42 (65)
Q Consensus 4 ~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~~ 42 (65)
.+++.+||+..+ ....||.++++++.++|+++.+++..
T Consensus 123 ~~~~~~~~~~~~-~~~~eD~~~~~~~~~~g~~i~~~~~~ 160 (166)
T cd04186 123 VFEEVGGFDEDF-FLYYEDVDLCLRARLAGYRVLYVPQA 160 (166)
T ss_pred HHHHcCCCChhh-hccccHHHHHHHHHHcCCeEEEccce
Confidence 455667888733 23669999999999999999988543
No 49
>PF10111 Glyco_tranf_2_2: Glycosyltransferase like family 2; InterPro: IPR019290 This conserved domain is found in a set of prokaryotic proteins including putative glucosyltransferases, which are involved in bacterial capsule biosynthesis [, ].
Probab=95.05 E-value=0.042 Score=37.02 Aligned_cols=40 Identities=23% Similarity=0.189 Sum_probs=32.7
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecce
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEGE 42 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~~ 42 (65)
...++.+|||..+.+-=.||.||+.||.+.|+.+.+.+..
T Consensus 178 ~~f~~iGGfDE~f~G~G~ED~D~~~RL~~~~~~~~~~~~~ 217 (281)
T PF10111_consen 178 EDFLEIGGFDERFRGWGYEDIDFGYRLKKAGYKFKRSPDY 217 (281)
T ss_pred HHHHHhCCCCccccCCCcchHHHHHHHHHcCCcEecChHH
Confidence 3456789999976666689999999999999999887443
No 50
>COG2943 MdoH Membrane glycosyltransferase [Cell envelope biogenesis, outer membrane]
Probab=94.96 E-value=0.055 Score=41.44 Aligned_cols=48 Identities=23% Similarity=0.391 Sum_probs=40.7
Q ss_pred CChhhHHHHHHHHHHCCCeEEEec--ceeeecCCCCHHHHHHHHHhhhcC
Q psy11718 17 GSVAEDCYFAMLAYKQGFSFNFVE--GEMWEKSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~~~~~~--~~~~~~~P~~~~~~~~QR~RW~~G 64 (65)
.-+.-|+-=+--|.+.||.+.... .-+|+|.|.|+-|+.+-=+||++|
T Consensus 346 ~ilSHDfvEAALmRRaGW~v~ia~dL~GSyEE~PpnLlD~l~RDRRWC~G 395 (736)
T COG2943 346 HILSHDFVEAALMRRAGWGVWIAYDLDGSYEELPPNLLDELKRDRRWCHG 395 (736)
T ss_pred cccchHHHHHHHHhhcCceEEEeccCCCchhhCCchHHHHHhhhhHhhhc
Confidence 667788777777778899997763 349999999999999999999998
No 51
>cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=94.39 E-value=0.063 Score=32.54 Aligned_cols=36 Identities=22% Similarity=0.282 Sum_probs=27.7
Q ss_pred hhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecc
Q psy11718 4 SAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG 41 (65)
Q Consensus 4 ~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~ 41 (65)
.+++.++|+.+. ...||.++.+++.+.|+++.+++.
T Consensus 153 ~~~~~~~f~~~~--~~~~D~~~~~r~~~~g~~~~~~~~ 188 (202)
T cd06433 153 LFEKYGGFDESY--RIAADYDLLLRLLLAGKIFKYLPE 188 (202)
T ss_pred HHHHhCCCchhh--CchhhHHHHHHHHHcCCceEecch
Confidence 344555687632 578999999999999999988743
No 52
>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=93.75 E-value=0.14 Score=31.80 Aligned_cols=42 Identities=21% Similarity=0.181 Sum_probs=28.4
Q ss_pred hhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEec-ceeeecCC
Q psy11718 5 AEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE-GEMWEKSP 48 (65)
Q Consensus 5 ~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~-~~~~~~~P 48 (65)
+++.++++... ....||.++++++..+|+++ |++ +.++-..+
T Consensus 133 ~~~~g~~~~~~-~~~~eD~~~~~r~~~~G~~i-~~~~~~~~h~~~ 175 (202)
T cd04185 133 VEKIGLPDKEF-FIWGDDTEYTLRASKAGPGI-YVPDAVVVHKTA 175 (202)
T ss_pred HHHhCCCChhh-hccchHHHHHHHHHHcCCcE-EecceEEEEccc
Confidence 44455566522 24579999999999999999 874 44444333
No 53
>cd00899 b4GalT Beta-4-Galactosyltransferase is involved in the formation of the poly-N-acetyllactosamine core structures present in glycoproteins and glycosphingolipids. Beta-4-Galactosyltransferase transfers galactose from uridine diphosphogalactose to the terminal beta-N-acetylglucosamine residues, hereby forming the poly-N-acetyllactosamine core structures present in glycoproteins and glycosphingolipids. At least seven homologous beta-4-galactosyltransferase isoforms have been identified that use different types of glycoproteins and glycolipids as substrates. Of the seven identified members of the beta-1,4-galactosyltransferase subfamily (beta1,4-Gal-T1 to -T7), b1,4-Gal-T1 is most characterized (biochemically). It is a Golgi-resident type II membrane enzyme with a cytoplasmic domain, membrane spanning region, and a stem region and catalytic domain facing the lumen.
Probab=93.00 E-value=0.14 Score=34.52 Aligned_cols=60 Identities=18% Similarity=0.061 Sum_probs=41.8
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecce-e------------eecCCCCHHHHHHHHHhhh
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEGE-M------------WEKSPFSLWDFVQQRKRWL 62 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~~-~------------~~~~P~~~~~~~~QR~RW~ 62 (65)
...++..|||+++-+-=.||-||..||..+|.++...+.. . ....|.-+..+..+++||+
T Consensus 121 ~~f~~VNGf~n~f~GWGgEDdd~~~Rl~~~g~~~~r~~~~~~~~~hL~H~~~~r~~~N~~r~~~l~~~~~~~~ 193 (219)
T cd00899 121 EQFRKVNGFSNAYWGWGGEDDDLYNRIKAAGLKITRPSGDTGRYKMIRHIHDKRNRDNPNRFALLQNSRERDH 193 (219)
T ss_pred HHHHHhCCcCCcCccCCcchHHHHHHHHHCCCeEEeccCcccceeeeecCCCcccccCHHHHHHHHhhCeEec
Confidence 3456778888866555579999999999999987655321 1 2344555666777787775
No 54
>PF02709 Glyco_transf_7C: N-terminal domain of galactosyltransferase; InterPro: IPR003859 This is a family of galactosyltransferases from a wide range of metazoa with three related galactosyltransferase activities; all three of which are possessed by one sequence in some cases. The three functions are N-acetyllactosamine synthase (2.4.1.90 from EC); beta-N-acetylglucosaminyl-glycopeptide beta-1,4-galactosyltransferase (2.4.1.38 from EC); and lactose synthase (2.4.1.22 from EC). Note that N-acetyllactosamine synthase is a component of lactose synthase along with alpha-lactalbumin, in the absence of alpha-lactalbumin N-acetyllactosamine synthase is used.; GO: 0016757 transferase activity, transferring glycosyl groups, 0005975 carbohydrate metabolic process; PDB: 2AGD_B 3EE5_A 2AE7_B 2AEC_A 2FYA_A 2AES_B 2AH9_A 2FYB_A 2FY7_A 3LW6_A ....
Probab=92.64 E-value=0.19 Score=28.51 Aligned_cols=37 Identities=27% Similarity=0.182 Sum_probs=24.3
Q ss_pred hhhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEe
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFV 39 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~ 39 (65)
...++..|||+.+.+-=-||.||..||...|.++..+
T Consensus 29 ~~f~~vnGfde~f~gWG~ED~Dl~~Rl~~~g~~~~~~ 65 (78)
T PF02709_consen 29 EDFEKVNGFDERFWGWGGEDDDLYNRLWKAGLKIVRV 65 (78)
T ss_dssp HHHHHTTSS-SS-TSCSSHHHHHHHHHHHTT---B-S
T ss_pred HHHHHcCCCCccccccCccHHHHHHHHHHcCCeEEec
Confidence 4567788999854343359999999999999998665
No 55
>cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose. A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier.
Probab=91.69 E-value=0.052 Score=33.60 Aligned_cols=20 Identities=25% Similarity=0.318 Sum_probs=15.2
Q ss_pred hhhhhhccCCCCCCCChhhHHHH
Q psy11718 3 VSAEKQVSFDNGLDGSVAEDCYF 25 (65)
Q Consensus 3 ~~~e~~~gw~~g~~~sltED~~l 25 (65)
.++-+.+||+. .+++||++|
T Consensus 164 r~~l~~~g~~~---~~l~ED~~~ 183 (183)
T cd06438 164 WAVLRQAPWAA---HSLTEDLEF 183 (183)
T ss_pred HHHHHhCCCCC---CCcccccCC
Confidence 34444499998 999999975
No 56
>PRK10063 putative glycosyl transferase; Provisional
Probab=90.20 E-value=0.91 Score=30.16 Aligned_cols=37 Identities=30% Similarity=0.409 Sum_probs=28.1
Q ss_pred hhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecce
Q psy11718 4 SAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEGE 42 (65)
Q Consensus 4 ~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~~ 42 (65)
++.+.++||.. ....||+++.+++..+|+++.+++..
T Consensus 155 ~~~~~~~fd~~--~~~~~Dydl~lrl~~~g~~~~~v~~~ 191 (248)
T PRK10063 155 SGLKKWRYDLQ--YKVSSDYALAARLYKAGYAFKKLNGL 191 (248)
T ss_pred HHHhcCCCCcc--cchHHhHHHHHHHHHcCCcEEEcCce
Confidence 34445678762 35679999999999999999888543
No 57
>KOG2547|consensus
Probab=86.07 E-value=0.47 Score=34.89 Aligned_cols=59 Identities=15% Similarity=0.334 Sum_probs=41.8
Q ss_pred hhhhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEecce-eeecCCCCHHHHHHHHHhhhc
Q psy11718 4 SAEKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEGE-MWEKSPFSLWDFVQQRKRWLQ 63 (65)
Q Consensus 4 ~~e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~~-~~~~~P~~~~~~~~QR~RW~~ 63 (65)
++...+|... ....++||+.++..+..+||+..+.... -.+-+-.+...+.+|=.||..
T Consensus 255 ~ld~~ggi~~-f~~yLaedyFaaksllSRG~ksaist~palQnSas~~mssf~~Ri~rwvk 314 (431)
T KOG2547|consen 255 ALDECGGISA-FGGYLAEDYFAAKSLLSRGWKSAISTHPALQNSASVTMSSFLDRIIRWVK 314 (431)
T ss_pred HHHHhccHHH-HHHHHHHHHHHHHHHHhhhhhhhhcccchhhhhhhhHHHHHHHHHHHhhh
Confidence 3444555432 2268999999999999999999888544 444555677777777778853
No 58
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi,
Probab=83.10 E-value=2.8 Score=26.04 Aligned_cols=25 Identities=12% Similarity=-0.033 Sum_probs=21.1
Q ss_pred CChhhHHHHHHHHHHCCCeEEEecc
Q psy11718 17 GSVAEDCYFAMLAYKQGFSFNFVEG 41 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~~~~~~~ 41 (65)
....+|+++.+++...|+++.+++.
T Consensus 173 ~~~~~~~~l~~~~~~~g~~i~~~p~ 197 (224)
T cd06442 173 KGYKFQLELLVRARRLGYRIVEVPI 197 (224)
T ss_pred CCcEEeHHHHHHHHHcCCeEEEeCe
Confidence 5667789999999999999988753
No 59
>cd04188 DPG_synthase DPG_synthase is involved in protein N-linked glycosylation. UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate.
Probab=73.58 E-value=4.7 Score=25.17 Aligned_cols=24 Identities=25% Similarity=0.252 Sum_probs=20.4
Q ss_pred CChhhHHHHHHHHHHCCCeEEEec
Q psy11718 17 GSVAEDCYFAMLAYKQGFSFNFVE 40 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~~~~~~ 40 (65)
.-..+|+|+.+++.++|+++.+++
T Consensus 178 ~~~~~d~el~~r~~~~g~~~~~vp 201 (211)
T cd04188 178 ERWAFDVELLVLARRLGYPIEEVP 201 (211)
T ss_pred cceEeeHHHHHHHHHcCCeEEEcC
Confidence 345689999999999999998865
No 60
>TIGR00620 sporelyase spore photoproduct lyase. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=70.89 E-value=8.5 Score=25.69 Aligned_cols=27 Identities=15% Similarity=0.249 Sum_probs=24.4
Q ss_pred CChhhHHHHHHHHHHCCCeEEEe-ccee
Q psy11718 17 GSVAEDCYFAMLAYKQGFSFNFV-EGEM 43 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~~~~~-~~~~ 43 (65)
.++.|=++-+-+++.+||++.++ +|++
T Consensus 68 ~sl~~Rl~Aa~k~a~aGy~Vg~~~~PIi 95 (199)
T TIGR00620 68 SPLDKRIEAAVKVAKAGYPLGFIIAPIY 95 (199)
T ss_pred CCHHHHHHHHHHHHHcCCeEEEEeeceE
Confidence 78999999999999999999997 7774
No 61
>PLN02726 dolichyl-phosphate beta-D-mannosyltransferase
Probab=66.94 E-value=18 Score=23.29 Aligned_cols=25 Identities=12% Similarity=0.054 Sum_probs=21.3
Q ss_pred CChhhHHHHHHHHHHCCCeEEEecc
Q psy11718 17 GSVAEDCYFAMLAYKQGFSFNFVEG 41 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~~~~~~~ 41 (65)
....+|+|+.+++...|+++.+++.
T Consensus 188 ~~~~~~~el~~~~~~~g~~i~~vp~ 212 (243)
T PLN02726 188 KGYVFQMEIIVRASRKGYRIEEVPI 212 (243)
T ss_pred CCcEEehHHHHHHHHcCCcEEEeCc
Confidence 4567799999999999999998754
No 62
>PRK10018 putative glycosyl transferase; Provisional
Probab=65.66 E-value=12 Score=25.52 Aligned_cols=44 Identities=16% Similarity=-0.039 Sum_probs=29.7
Q ss_pred hhhccCCCCCCCChhhHHHHHHHHHHCCCeEEEec-ce-eeecCCCCH
Q psy11718 6 EKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFNFVE-GE-MWEKSPFSL 51 (65)
Q Consensus 6 e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~~~~-~~-~~~~~P~~~ 51 (65)
.+.++||.. -...||++|-+++..++....+++ +. .+-.-+.+.
T Consensus 167 ~~~~~fd~~--~~~~eDydlwlrl~~~~~~~~~~~~~l~~y~~~~~s~ 212 (279)
T PRK10018 167 FKECLFDTE--LKAAQDYDIFLRMVVEYGEPWKVEEATQILHINHGEM 212 (279)
T ss_pred hhhcccCCC--CCccccHHHHHHHHHhcCceEeeccceEEEEcCCCCc
Confidence 345678763 356899999999999888776663 32 344444444
No 63
>PRK10073 putative glycosyl transferase; Provisional
Probab=64.13 E-value=10 Score=26.20 Aligned_cols=29 Identities=14% Similarity=0.246 Sum_probs=23.5
Q ss_pred cCCCCCCCChhhHHHHHHHHHHCCCeEEEecc
Q psy11718 10 SFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEG 41 (65)
Q Consensus 10 gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~ 41 (65)
.|.. +...||+.+..++...+.++.|++.
T Consensus 182 ~f~~---~~~~eD~~~~~~~~~~~~~v~~~~~ 210 (328)
T PRK10073 182 KFEP---GLHHQDIPWTTEVMFNALRVRYTEQ 210 (328)
T ss_pred ccCC---CCEeccHHHHHHHHHHCCEEEEECC
Confidence 3555 5557999999999999999998854
No 64
>PF13712 Glyco_tranf_2_5: Glycosyltransferase like family; PDB: 2QGI_A 2NXV_B.
Probab=59.91 E-value=19 Score=23.65 Aligned_cols=44 Identities=18% Similarity=0.055 Sum_probs=20.9
Q ss_pred cCCCC-CCCChhhHHHHHHHHHHCCCeEEEecceeeecCCCCHHH
Q psy11718 10 SFDNG-LDGSVAEDCYFAMLAYKQGFSFNFVEGEMWEKSPFSLWD 53 (65)
Q Consensus 10 gw~~g-~~~sltED~~l~~~l~~~g~~~~~~~~~~~~~~P~~~~~ 53 (65)
+||.. ..+-.--|+++++++.++|+++...+..|.-.+..++.+
T Consensus 160 ~fde~~~~gfH~Ydvd~cl~~~~~G~~v~~~~~~~~H~s~g~~~~ 204 (217)
T PF13712_consen 160 RFDEDLFTGFHFYDVDQCLEARRAGYRVVVPPPWCIHFSGGSFDE 204 (217)
T ss_dssp ----SS--SSSSHHHHHHHHHHHTT-EEEE-----EE-S----SH
T ss_pred CCCccccCCcchHHHHHHHHHHHhCCEEEecCceEEEcCCCCccH
Confidence 37764 234556899999999999999955455566655555443
No 65
>cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=58.91 E-value=14 Score=22.46 Aligned_cols=34 Identities=21% Similarity=0.280 Sum_probs=24.9
Q ss_pred hhhhhccCCCCCCC-ChhhHHHHHHHHHHCCCeEEEecc
Q psy11718 4 SAEKQVSFDNGLDG-SVAEDCYFAMLAYKQGFSFNFVEG 41 (65)
Q Consensus 4 ~~e~~~gw~~g~~~-sltED~~l~~~l~~~g~~~~~~~~ 41 (65)
.+++.++++. . ...||+++...+.. +.++.+++.
T Consensus 163 ~~~~~~~~~~---~~~~~~D~~~~~~~~~-~~~~~~~~~ 197 (214)
T cd04196 163 LLELALPFPD---ADVIMHDWWLALLASA-FGKVVFLDE 197 (214)
T ss_pred HHHhhccccc---cccccchHHHHHHHHH-cCceEEcch
Confidence 3555667776 5 68999999998766 668877743
No 66
>COG3162 Predicted membrane protein [Function unknown]
Probab=55.82 E-value=8.2 Score=23.39 Aligned_cols=19 Identities=16% Similarity=0.309 Sum_probs=16.0
Q ss_pred eecCCCCHHHHHHHHHhhh
Q psy11718 44 WEKSPFSLWDFVQQRKRWL 62 (65)
Q Consensus 44 ~~~~P~~~~~~~~QR~RW~ 62 (65)
..+++..|+++.+||+|-+
T Consensus 8 ~i~a~p~f~eLv~kr~~Fa 26 (102)
T COG3162 8 RIAANPRFRELVRKRRRFA 26 (102)
T ss_pred ccccCHhHHHHHHHHHHHH
Confidence 4578899999999999854
No 67
>smart00265 BH4 BH4 Bcl-2 homology region 4.
Probab=55.01 E-value=12 Score=17.49 Aligned_cols=18 Identities=11% Similarity=0.239 Sum_probs=14.5
Q ss_pred CChhhHHHHHHHHHHCCCe
Q psy11718 17 GSVAEDCYFAMLAYKQGFS 35 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~ 35 (65)
..|++|+ +..||..+||.
T Consensus 6 RelV~~y-v~yKLsQrgy~ 23 (27)
T smart00265 6 RELVVDY-VTYKLSQNGYE 23 (27)
T ss_pred HHHHHHH-HHHHHhhcCCC
Confidence 6788886 78889888875
No 68
>cd00761 Glyco_tranf_GTA_type Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it als
Probab=53.45 E-value=11 Score=20.80 Aligned_cols=31 Identities=29% Similarity=0.192 Sum_probs=20.1
Q ss_pred hhhccCCCCCCCChhhHHHHHHHHHHCCCeEE
Q psy11718 6 EKQVSFDNGLDGSVAEDCYFAMLAYKQGFSFN 37 (65)
Q Consensus 6 e~~~gw~~g~~~sltED~~l~~~l~~~g~~~~ 37 (65)
++.++++... ....||.++..++...|+.+.
T Consensus 125 ~~~~~~~~~~-~~~~ed~~~~~~~~~~g~~~~ 155 (156)
T cd00761 125 EEIGGFDEAL-LSGEEDDDFLLRLLRGGKVAF 155 (156)
T ss_pred HHhCCcchHh-cCCcchHHHHHHHHhhccccc
Confidence 4455665511 112599999999988877653
No 69
>COG2028 Uncharacterized conserved protein [Function unknown]
Probab=51.81 E-value=6.2 Score=24.94 Aligned_cols=19 Identities=32% Similarity=0.602 Sum_probs=16.8
Q ss_pred cCCCCHHHHHHHHHhhhcC
Q psy11718 46 KSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 46 ~~P~~~~~~~~QR~RW~~G 64 (65)
..|+.++.+.+-|+||..+
T Consensus 80 LT~eElkkY~ks~~rWg~r 98 (145)
T COG2028 80 LTPEELKKYMKSRKRWGSR 98 (145)
T ss_pred cCHHHHHHHHHHHHHhccc
Confidence 5789999999999999874
No 70
>PRK10969 DNA polymerase III subunit theta; Reviewed
Probab=43.36 E-value=9.8 Score=21.87 Aligned_cols=20 Identities=15% Similarity=0.177 Sum_probs=16.6
Q ss_pred eeecCCCCHHHHHHHHHhhh
Q psy11718 43 MWEKSPFSLWDFVQQRKRWL 62 (65)
Q Consensus 43 ~~~~~P~~~~~~~~QR~RW~ 62 (65)
+-.+.|+.+++||++|..-.
T Consensus 40 ve~eqPe~lR~yFreRL~~y 59 (75)
T PRK10969 40 VEREQPEHLRSYFRERLIAY 59 (75)
T ss_pred HHHhCCHHHHHHHHHHHHHH
Confidence 66679999999999997543
No 71
>PF10126 Nit_Regul_Hom: Uncharacterized protein, homolog of nitrogen regulatory protein PII; InterPro: IPR019296 This family consists of various hypothetical archaeal proteins. It includes a putative nitrogen regulatory protein PII homolog.
Probab=41.14 E-value=27 Score=21.42 Aligned_cols=46 Identities=11% Similarity=0.052 Sum_probs=35.1
Q ss_pred CCCCCCCChhhHHHHHHHHHHC-CCeEEEecceeeecCCCCHHHHHHHHH
Q psy11718 11 FDNGLDGSVAEDCYFAMLAYKQ-GFSFNFVEGEMWEKSPFSLWDFVQQRK 59 (65)
Q Consensus 11 w~~g~~~sltED~~l~~~l~~~-g~~~~~~~~~~~~~~P~~~~~~~~QR~ 59 (65)
|.. .++.||.+.++++.+. .-....+..++-.+..+++...+++|.
T Consensus 44 wkg---f~l~EDpe~ai~~I~d~s~~aV~I~TVV~~~~~~~i~~~i~ekL 90 (110)
T PF10126_consen 44 WKG---FLLDEDPEMAIKAINDLSENAVLIGTVVDEEKVEKIEKLIKEKL 90 (110)
T ss_pred hcC---cccccCHHHHHHHHHHhccCcEEEEEEECHHHHHHHHHHHHHHh
Confidence 555 6789999999999875 445555677777777788888888774
No 72
>PF06440 DNA_pol3_theta: DNA polymerase III, theta subunit; InterPro: IPR009052 This entry represents the theta subunit of DNA polymerase III from bacteria, whose core structure consists of an irregular array of three helices []. DNA polymerase III (Pol III) is the primary enzyme responsible for replication of Escherichia coli chromosomal DNA. The holoenzyme consists of 17 proteins and contains two core polymerases. The Pol III catalytic core has three tightly associated subunits: alpha, epsilon and theta. The alpha subunit is responsible for the DNA polymerase activity, while the epsilon subunit is the 3'-5' proofreading exonuclease. The epsilon subunit binds to both the alpha and theta subunits in the linear order alpha-epsilon-theta. The theta subunit is the smallest, and may act to enhance the proofreading activity of epsilon, especially under extreme conditions []. This entry also includes a homologue of polymerase III theta called HOT (homologue of theta) from Bacteriophage P1. HOT contains three alpha-helices, as reported for theta, but the folding topology of the two is different, which could account for the suggested greater heat stability of HOT as compared to theta [].; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication; PDB: 1SE7_A 2IDO_D 2AE9_A 1DU2_A 2XY8_B 2AXD_S.
Probab=38.67 E-value=18 Score=20.82 Aligned_cols=21 Identities=14% Similarity=0.382 Sum_probs=16.2
Q ss_pred ceeeecCCCCHHHHHHHHHhh
Q psy11718 41 GEMWEKSPFSLWDFVQQRKRW 61 (65)
Q Consensus 41 ~~~~~~~P~~~~~~~~QR~RW 61 (65)
..+-.+.|+.+++||++|..-
T Consensus 38 e~VereqPe~LR~YFreRL~~ 58 (75)
T PF06440_consen 38 EQVEREQPEHLREYFRERLNY 58 (75)
T ss_dssp HHHHHCS-GGGHHHHHHHHHH
T ss_pred HHHHHHCcHHHHHHHHHHHHH
Confidence 346778899999999999753
No 73
>PF08235 LNS2: LNS2 (Lipin/Ned1/Smp2); InterPro: IPR013209 This domain is found in Saccharomyces cerevisiae (Baker's yeast) protein SMP2, proteins with an N-terminal lipin domain (IPR007651 from INTERPRO) and phosphatidylinositol transfer proteins []. SMP2 is involved in plasmid maintenance and respiration []. Lipin proteins are involved in adipose tissue development and insulin resistance [].
Probab=37.62 E-value=29 Score=22.30 Aligned_cols=24 Identities=4% Similarity=0.086 Sum_probs=18.3
Q ss_pred HHHHHHHHHCCCeEEEecceeeec
Q psy11718 23 CYFAMLAYKQGFSFNFVEGEMWEK 46 (65)
Q Consensus 23 ~~l~~~l~~~g~~~~~~~~~~~~~ 46 (65)
.++-.+++.+||++.|+.+....+
T Consensus 33 ~~l~~~i~~~GY~ilYlTaRp~~q 56 (157)
T PF08235_consen 33 AELYRKIADNGYKILYLTARPIGQ 56 (157)
T ss_pred HHHHHHHHHCCeEEEEECcCcHHH
Confidence 567788889999999996654333
No 74
>PF02180 BH4: Bcl-2 homology region 4; InterPro: IPR003093 Apoptosis, or programmed cell death (PCD), is a common and evolutionarily conserved property of all metazoans []. In many biological processes, apoptosis is required to eliminate supernumerary or dangerous (such as pre-cancerous) cells and to promote normal development. Dysregulation of apoptosis can, therefore, contribute to the development of many major diseases including cancer, autoimmunity and neurodegenerative disorders. In most cases, proteins of the caspase family execute the genetic programme that leads to cell death. Bcl-2 proteins are central regulators of caspase activation, and play a key role in cell death by regulating the integrity of the mitochondrial and endoplasmic reticulum (ER) membranes []. At least 20 Bcl-2 proteins have been reported in mammals, and several others have been identified in viruses. Bcl-2 family proteins fall roughly into three subtypes, which either promote cell survival (anti-apoptotic) or trigger cell death (pro-apoptotic). All members contain at least one of four conserved motifs, termed Bcl-2 Homology (BH) domains. Bcl-2 subfamily proteins, which contain at least BH1 and BH2, promote cell survival by inhibiting the adapters needed for the activation of caspases. Pro-apoptotic members potentially exert their effects by displacing the adapters from the pro-survival proteins; these proteins belong either to the Bax subfamily, which contain BH1-BH3, or to the BH3 subfamily, which mostly only feature BH3 []. Thus, the balance between antagonistic family members is believed to play a role in determining cell fate. Members of the wider Bcl-2 family, which also includes Bcl-x, Bcl-w and Mcl-1, are described by their similarity to Bcl-2 protein, a member of the pro-survival Bcl-2 subfamily []. Full-length Bcl-2 proteins feature all four BH domains, seven alpha-helices, and a C-terminal hydrophobic motif that targets the protein to the outer mitochondrial membrane, ER and nuclear envelope. Active cell suicide (apoptosis) is induced by events such as growth factor withdrawal and toxins. It is controlled by regulators, which have either an inhibitory effect on programmed cell death (anti-apoptotic) or block the protective effect of inhibitors (pro-apoptotic) [, ]. Many viruses have found a way of countering defensive apoptosis by encoding their own anti-apoptosis genes preventing their target-cells from dying too soon. All proteins belonging to the Bcl-2 family [] contain either a BH1, BH2, BH3, or BH4 domain. All anti-apoptotic proteins contain BH1 and BH2 domains, some of them contain an additional N-terminal BH4 domain (Bcl-2, Bcl-x(L), Bcl-w), which is never seen in pro-apoptotic proteins, except for Bcl-x(S). On the other hand, all pro-apoptotic proteins contain a BH3 domain (except for Bad) necessary for dimerisation with other proteins of Bcl-2 family and crucial for their killing activity, some of them also contain BH1 and BH2 domains (Bax, Bak). The BH3 domain is also present in some anti-apoptotic protein, such as Bcl-2 or Bcl-x(L). Proteins that are known to contain these domains include vertebrate Bcl-2 (alpha and beta isoforms) and Bcl-x (isoforms (Bcl-x(L) and Bcl-x(S)); mammalian proteins Bax and Bak; mouse protein Bid; Xenopus laevis proteins Xr1 and Xr11; human induced myeloid leukemia cell differentiation protein MCL1 and Caenorhabditis elegans protein ced-9.; GO: 0042981 regulation of apoptosis; PDB: 1AF3_A 2PON_B 1YSN_A 3PL7_B 3R85_A 2O2N_A 2P1L_C 1R2G_A 2O1Y_A 1BXL_A ....
Probab=36.36 E-value=48 Score=15.42 Aligned_cols=18 Identities=17% Similarity=0.359 Sum_probs=13.6
Q ss_pred CChhhHHHHHHHHHHCCCe
Q psy11718 17 GSVAEDCYFAMLAYKQGFS 35 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~ 35 (65)
..|+||+ +..||.++||.
T Consensus 6 R~lV~~y-i~yKLsQrgy~ 23 (27)
T PF02180_consen 6 RELVEDY-ISYKLSQRGYV 23 (27)
T ss_dssp HHHHHHH-HHHHHHHTTST
T ss_pred HHHHHHH-HHHHhhhcCCC
Confidence 5678886 67888888873
No 75
>smart00775 LNS2 LNS2 domain. This domain is found in Saccharomyces cerevisiae protein SMP2, proteins with an N-terminal lipin domain and phosphatidylinositol transfer proteins. SMP2 is involved in plasmid maintenance and respiration. Lipin proteins are involved in adipose tissue development and insulin resistance.
Probab=35.57 E-value=43 Score=20.83 Aligned_cols=29 Identities=3% Similarity=0.031 Sum_probs=21.6
Q ss_pred hhHHHHHHHHHHCCCeEEEecceeeecCC
Q psy11718 20 AEDCYFAMLAYKQGFSFNFVEGEMWEKSP 48 (65)
Q Consensus 20 tED~~l~~~l~~~g~~~~~~~~~~~~~~P 48 (65)
..-.++..+++++|+++.|+.+..+.+++
T Consensus 30 ~~~~~a~~~l~~~G~~ivy~TGRp~~~~~ 58 (157)
T smart00775 30 PGVAKLYRDIQNNGYKILYLTARPIGQAD 58 (157)
T ss_pred HHHHHHHHHHHHcCCeEEEEcCCcHHHHH
Confidence 33456778888899999999776665555
No 76
>PF11397 GlcNAc: Glycosyltransferase (GlcNAc); InterPro: IPR021067 GlcNAc is an enzyme that carries out the first glycosylation step of hydroxylated Skp1; it is found in the cytoplasm and results in a pentasaccharide-linked 'HyPro-143[, ].
Probab=34.36 E-value=47 Score=23.70 Aligned_cols=38 Identities=16% Similarity=0.114 Sum_probs=28.5
Q ss_pred hhhhhhhccCCCC-CCCChhhHHHHHHHHHHCCCeEEEe
Q psy11718 2 KVSAEKQVSFDNG-LDGSVAEDCYFAMLAYKQGFSFNFV 39 (65)
Q Consensus 2 ~~~~e~~~gw~~g-~~~sltED~~l~~~l~~~g~~~~~~ 39 (65)
++.+++.+++|+- |---..|.+.++.||.-+||.+.-.
T Consensus 216 ~~~~~~eVP~DP~lp~lF~GEE~~~aaRlwT~GYD~Y~P 254 (343)
T PF11397_consen 216 PGHFVREVPYDPHLPFLFDGEEISMAARLWTHGYDFYSP 254 (343)
T ss_pred chhheecCCCCCCcccccccHHHHHHHHHHHcCCccccC
Confidence 4567888999982 0122689999999999999988554
No 77
>PF00070 Pyr_redox: Pyridine nucleotide-disulphide oxidoreductase; InterPro: IPR001327 FAD flavoproteins belonging to the family of pyridine nucleotide-disulphide oxidoreductases (glutathione reductase, trypanothione reductase, lipoamide dehydrogenase, mercuric reductase, thioredoxin reductase, alkyl hydroperoxide reductase) share sequence similarity with a number of other flavoprotein oxidoreductases, in particular with ferredoxin-NAD+ reductases involved in oxidative metabolism of a variety of hydrocarbons (rubredoxin reductase, putidaredoxin reductase, terpredoxin reductase, ferredoxin-NAD+ reductase components of benzene 1,2-dioxygenase, toluene 1,2-dioxygenase, chlorobenzene dioxygenase, biphenyl dioxygenase), NADH oxidase and NADH peroxidase [, , ]. Comparison of the crystal structures of human glutathione reductase and Escherichia coli thioredoxin reductase reveals different locations of their active sites, suggesting that the enzymes diverged from an ancestral FAD/NAD(P)H reductase and acquired their disulphide reductase activities independently []. Despite functional similarities, oxidoreductases of this family show no sequence similarity with adrenodoxin reductases [] and flavoprotein pyridine nucleotide cytochrome reductases (FPNCR) []. Assuming that disulphide reductase activity emerged later, during divergent evolution, the family can be referred to as FAD-dependent pyridine nucleotide reductases, FADPNR. To date, 3D structures of glutathione reductase [], thioredoxin reductase [], mercuric reductase [], lipoamide dehydrogenase [], trypanothione reductase [] and NADH peroxidase [] have been solved. The enzymes share similar tertiary structures based on a doubly-wound alpha/beta fold, but the relative orientations of their FAD- and NAD(P)H-binding domains may vary significantly. By contrast with the FPNCR family, the folds of the FAD- and NAD(P)H-binding domains are similar, suggesting that the domains evolved by gene duplication []. This entry describes a small NADH binding domain within a larger FAD binding domain described by IPR023753 from INTERPRO. It is found in both class I and class II oxidoreductases. ; GO: 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 1ZKQ_A 3DGZ_A 1ZDL_A 2R9Z_B 2RAB_A 2A87_B 1M6I_A 2YVG_A 2GR1_A 2GQW_A ....
Probab=34.17 E-value=79 Score=16.87 Aligned_cols=37 Identities=5% Similarity=-0.041 Sum_probs=24.7
Q ss_pred HHHHHHHHHCCCeEEEe--cceeeecCCCCHHHHHHHHH
Q psy11718 23 CYFAMLAYKQGFSFNFV--EGEMWEKSPFSLWDFVQQRK 59 (65)
Q Consensus 23 ~~l~~~l~~~g~~~~~~--~~~~~~~~P~~~~~~~~QR~ 59 (65)
+|++..|...|.++..+ .+......+++....+.++.
T Consensus 12 ~E~A~~l~~~g~~vtli~~~~~~~~~~~~~~~~~~~~~l 50 (80)
T PF00070_consen 12 IELAEALAELGKEVTLIERSDRLLPGFDPDAAKILEEYL 50 (80)
T ss_dssp HHHHHHHHHTTSEEEEEESSSSSSTTSSHHHHHHHHHHH
T ss_pred HHHHHHHHHhCcEEEEEeccchhhhhcCHHHHHHHHHHH
Confidence 57777788888888766 23344666667666665554
No 78
>PF02639 DUF188: Uncharacterized BCR, YaiI/YqxD family COG1671; InterPro: IPR003791 This entry describes proteins of unknown function.
Probab=33.83 E-value=38 Score=20.92 Aligned_cols=40 Identities=15% Similarity=0.129 Sum_probs=28.6
Q ss_pred CChhhHHHHHHHHHHCCCeEEEecceeeecCCCCHHHHHHHH
Q psy11718 17 GSVAEDCYFAMLAYKQGFSFNFVEGEMWEKSPFSLWDFVQQR 58 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~~~~~~~~~~~~~P~~~~~~~~QR 58 (65)
-.||-|+-|+-.+..+|-.+.=.....|+ ++++..++.+|
T Consensus 54 iVITqDigLA~~~l~Kga~vl~~rG~~yt--~~nI~~~L~~R 93 (130)
T PF02639_consen 54 IVITQDIGLASLLLAKGAYVLNPRGKEYT--KENIDELLAMR 93 (130)
T ss_pred EEEECCHHHHHHHHHCCCEEECCCCCCCC--HHHHHHHHHHH
Confidence 47899999999999988777554444443 45666666655
No 79
>PF04033 DUF365: Domain of unknown function (DUF365); InterPro: IPR007176 This is an archaeal family of unknown function.; PDB: 2KKU_A.
Probab=32.06 E-value=15 Score=21.96 Aligned_cols=19 Identities=32% Similarity=0.564 Sum_probs=15.4
Q ss_pred cCCCCHHHHHHHHHhhhcC
Q psy11718 46 KSPFSLWDFVQQRKRWLQG 64 (65)
Q Consensus 46 ~~P~~~~~~~~QR~RW~~G 64 (65)
..|+-++.+-+-|+||..|
T Consensus 42 LT~eE~r~Y~~~~~Rwg~~ 60 (97)
T PF04033_consen 42 LTKEELRKYERSRKRWGSG 60 (97)
T ss_dssp S-HHHHHHHHHHHHHTSS-
T ss_pred cCHHHHHHHHhhHHhhccC
Confidence 5688899999999999874
No 80
>COG4590 ABC-type uncharacterized transport system, permease component [General function prediction only]
Probab=31.63 E-value=5.1 Score=30.80 Aligned_cols=28 Identities=21% Similarity=0.311 Sum_probs=24.5
Q ss_pred CCeEEEecce---eeecCCCCHHHHHHHHHh
Q psy11718 33 GFSFNFVEGE---MWEKSPFSLWDFVQQRKR 60 (65)
Q Consensus 33 g~~~~~~~~~---~~~~~P~~~~~~~~QR~R 60 (65)
.||+.|+.+. |++-+.+|+.++++||.|
T Consensus 697 HyRvLFL~ALvLltFTF~vNtLAE~vRqRLR 727 (733)
T COG4590 697 HYRVLFLAALVLLTFTFAVNTLAEWVRQRLR 727 (733)
T ss_pred eeehHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5888888664 899999999999999988
No 81
>PF13500 AAA_26: AAA domain; PDB: 3OF5_A 2IOJ_A 4A0G_B 4A0R_A 4A0H_B 4A0F_B 3FMI_C 3FPA_D 3FMF_C 3FGN_A ....
Probab=30.21 E-value=1.4e+02 Score=18.61 Aligned_cols=33 Identities=6% Similarity=-0.028 Sum_probs=21.1
Q ss_pred HHHHHHHHHCCCeEEEecceeeecCCCCHHHHH
Q psy11718 23 CYFAMLAYKQGFSFNFVEGEMWEKSPFSLWDFV 55 (65)
Q Consensus 23 ~~l~~~l~~~g~~~~~~~~~~~~~~P~~~~~~~ 55 (65)
+-|+..|.++|.++.|++++.....-++-.+++
T Consensus 19 lgL~~~l~~~g~~v~~~KPi~~~~~~d~d~~~~ 51 (199)
T PF13500_consen 19 LGLARALRRRGIKVGYFKPIQTGPEDDEDAELI 51 (199)
T ss_dssp HHHHHHHHHTTSEEEEEEEEEESCCCSSHHHHH
T ss_pred HHHHHHHHhCCCceEEEeeeEecCCCCchHHHH
Confidence 345566777899999988875554434444444
No 82
>PF09140 MipZ: ATPase MipZ; InterPro: IPR015223 Cell division in bacteria is facilitated by a polymeric ring structure, the Z ring, composed of tubulin-like FtsZ protofilaments. Correct positioning of the division plane is a prerequisite for the generation of daughter cells with a normal chromosome complement. In Caulobacter crescentus MipZ, an essential protein, coordinates and regulates the assembly of the FtsZ cytokinetic ring during cell division. MipZ, forms a complex with the partitioning protein ParB near the origin of replication and localizes with the duplicated origin regions to the cell poles. MipZ also directly interferes with FtsZ polymerisation, thereby restricting FtsZ ring formation to mid-cell, the region of lowest MipZ concentration. In eukaryotes members of this entry belong to the Mrp/NBP35 ATP-binding protein family, and specifically the NUBP2/CFD1 subfamily. This includes the cytosolic Fe-S cluster assembly factor Cfd1, which is a component of the cytosolic iron-sulphur (Fe/S) protein assembly machinery. This protein is required for maturation of extra-mitochondrial Fe/S proteins. It may bind and transfer a labile 4Fe-4S cluster to target apoproteins. Cfd1 is also required for biogenesis and export of both ribosomal subunits, suggesting a role in assembly of the Fe/S clusters in RLI1, a protein which performs rRNA processing and ribosome export. ; PDB: 2XIT_B 2XJ4_A 2XJ9_A.
Probab=29.20 E-value=70 Score=22.37 Aligned_cols=39 Identities=15% Similarity=0.334 Sum_probs=24.6
Q ss_pred hhHHHHHHHHHHCCCeEEEecceeeecCCCCHHHHHHHHHhh
Q psy11718 20 AEDCYFAMLAYKQGFSFNFVEGEMWEKSPFSLWDFVQQRKRW 61 (65)
Q Consensus 20 tED~~l~~~l~~~g~~~~~~~~~~~~~~P~~~~~~~~QR~RW 61 (65)
|=-+.++..|+++|++|..++..+++. |+..++.-|.-|
T Consensus 16 Tva~~lA~aLa~~G~kVg~lD~Di~q~---S~~r~l~nr~~~ 54 (261)
T PF09140_consen 16 TVAVNLAVALARMGKKVGLLDLDIRQP---SLPRYLENRAAW 54 (261)
T ss_dssp HHHHHHHHHHHCTT--EEEEE--TTT----HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHCCCeEEEEecCCCCC---CHHHHHhccchh
Confidence 445778888999999999998877653 455566666555
No 83
>COG4464 CapC Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]
Probab=27.35 E-value=1.3e+02 Score=20.95 Aligned_cols=46 Identities=17% Similarity=0.154 Sum_probs=32.5
Q ss_pred cCCCCCCCChhhHHHHHHHHHHCCCeEEEecce----eeecCCCCHHHHHH
Q psy11718 10 SFDNGLDGSVAEDCYFAMLAYKQGFSFNFVEGE----MWEKSPFSLWDFVQ 56 (65)
Q Consensus 10 gw~~g~~~sltED~~l~~~l~~~g~~~~~~~~~----~~~~~P~~~~~~~~ 56 (65)
+-|.|| .|+.|.+++.-...++|.+.....+. .|+.+++-.+.-..
T Consensus 11 ~iDDGp-~s~eesl~ml~~A~~qGvt~iVaTsHh~~g~y~n~~~~v~~~~~ 60 (254)
T COG4464 11 DIDDGP-KSLEESLAMLREAVRQGVTKIVATSHHLHGRYENPIEKVKEKAN 60 (254)
T ss_pred CCCCCC-CcHHHHHHHHHHHHHcCceEEeecccccCCccCChHHHHHHHHH
Confidence 567888 99999999999999999877555321 55555554444333
No 84
>PRK00124 hypothetical protein; Validated
Probab=27.34 E-value=58 Score=20.81 Aligned_cols=40 Identities=18% Similarity=0.166 Sum_probs=28.3
Q ss_pred CChhhHHHHHHHHHHCCCeEEEecceeeecCCCCHHHHHHHH
Q psy11718 17 GSVAEDCYFAMLAYKQGFSFNFVEGEMWEKSPFSLWDFVQQR 58 (65)
Q Consensus 17 ~sltED~~l~~~l~~~g~~~~~~~~~~~~~~P~~~~~~~~QR 58 (65)
-.||.|+-|+-++..+|-.+.-.....|+ ++|+..++..|
T Consensus 70 iVIT~Di~LAa~~l~Kga~vl~prG~~yt--~~nI~~~L~~R 109 (151)
T PRK00124 70 IVITQDYGLAALALEKGAIVLNPRGYIYT--NDNIDQLLAMR 109 (151)
T ss_pred EEEeCCHHHHHHHHHCCCEEECCCCcCCC--HHHHHHHHHHH
Confidence 47899999999999998776655555554 45555555544
No 85
>KOG2116|consensus
Probab=25.80 E-value=74 Score=25.33 Aligned_cols=30 Identities=3% Similarity=0.091 Sum_probs=23.0
Q ss_pred ChhhHHHHHHHHHHCCCeEEEecceeeecC
Q psy11718 18 SVAEDCYFAMLAYKQGFSFNFVEGEMWEKS 47 (65)
Q Consensus 18 sltED~~l~~~l~~~g~~~~~~~~~~~~~~ 47 (65)
+.+-=..|--++.++||++.|+.+.+.+||
T Consensus 559 Th~GVAkLyt~Ik~NGYk~lyLSARaIgQA 588 (738)
T KOG2116|consen 559 THTGVAKLYTKIKENGYKILYLSARAIGQA 588 (738)
T ss_pred hhhhHHHHHHHHHhCCeeEEEEehhhhhhh
Confidence 445556677788889999999988766665
No 86
>PF00289 CPSase_L_chain: Carbamoyl-phosphate synthase L chain, N-terminal domain; InterPro: IPR005481 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates [, ]. CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate []. The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain []. CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites []. The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein []. The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP []. There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia []. CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains []. This entry represents the N-terminal domain of the large subunit of carbamoyl phosphate synthase. This domain can also be found in certain other related proteins. ; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 3VA7_A 3OUU_A 3OUZ_B 1W96_B 1W93_A 1ULZ_A 3HB9_C 3HO8_A 3BG5_C 3HBL_A ....
Probab=24.94 E-value=1.1e+02 Score=18.17 Aligned_cols=22 Identities=32% Similarity=0.516 Sum_probs=17.0
Q ss_pred CCCCChhhHHHHHHHHHHCCCeE
Q psy11718 14 GLDGSVAEDCYFAMLAYKQGFSF 36 (65)
Q Consensus 14 g~~~sltED~~l~~~l~~~g~~~ 36 (65)
|. +.+.|+.+|.-++.+.|.++
T Consensus 81 Gy-g~lse~~~fa~~~~~~gi~f 102 (110)
T PF00289_consen 81 GY-GFLSENAEFAEACEDAGIIF 102 (110)
T ss_dssp TS-STTTTHHHHHHHHHHTT-EE
T ss_pred cc-chhHHHHHHHHHHHHCCCEE
Confidence 44 78999999999998777654
No 87
>PRK11835 hypothetical protein; Provisional
Probab=24.15 E-value=21 Score=22.02 Aligned_cols=9 Identities=44% Similarity=1.062 Sum_probs=7.2
Q ss_pred HHHhhhcCC
Q psy11718 57 QRKRWLQGY 65 (65)
Q Consensus 57 QR~RW~~G~ 65 (65)
|.-||..||
T Consensus 14 QsYRW~sG~ 22 (114)
T PRK11835 14 QSYRWSAGF 22 (114)
T ss_pred cceeeccCc
Confidence 677998886
No 88
>KOG1359|consensus
Probab=23.52 E-value=37 Score=24.87 Aligned_cols=19 Identities=26% Similarity=0.368 Sum_probs=15.9
Q ss_pred eeecCCCCHHHHHHHHHhh
Q psy11718 43 MWEKSPFSLWDFVQQRKRW 61 (65)
Q Consensus 43 ~~~~~P~~~~~~~~QR~RW 61 (65)
-|+..|.-+.+|++||.|=
T Consensus 271 Gyttgp~~li~llrqr~Rp 289 (417)
T KOG1359|consen 271 GYTTGPKPLISLLRQRSRP 289 (417)
T ss_pred CCccCChhHHHHHHhcCCc
Confidence 3667899999999999883
No 89
>PRK11750 gltB glutamate synthase subunit alpha; Provisional
Probab=23.22 E-value=88 Score=26.96 Aligned_cols=39 Identities=15% Similarity=0.203 Sum_probs=28.8
Q ss_pred hhHHHHH-HHHHHCCCeEEE-e--cce--eeecCCCCHHHHHHHH
Q psy11718 20 AEDCYFA-MLAYKQGFSFNF-V--EGE--MWEKSPFSLWDFVQQR 58 (65)
Q Consensus 20 tED~~l~-~~l~~~g~~~~~-~--~~~--~~~~~P~~~~~~~~QR 58 (65)
.||+++. .-|+..|..... + ++. |..+.|..+-+||+|+
T Consensus 468 ~E~~~~~l~pMa~~g~EpigSMG~D~plAvLS~~~~~l~~yFkQ~ 512 (1485)
T PRK11750 468 FEELDQVIRVLAENGQEAVGSMGDDTPMAVLSSQPRSIYDYFRQQ 512 (1485)
T ss_pred HHHHHHHHHHHHhcCCcCcccCCCCcchhhhcCCCccHHHHhhhc
Confidence 5888844 557777765544 3 332 8999999999999996
No 90
>PF04328 DUF466: Protein of unknown function (DUF466); InterPro: IPR007423 This is a small bacterial protein of unknown function.
Probab=21.83 E-value=1e+02 Score=16.84 Aligned_cols=17 Identities=24% Similarity=0.495 Sum_probs=14.2
Q ss_pred eecCCCCHHHHHHHHHh
Q psy11718 44 WEKSPFSLWDFVQQRKR 60 (65)
Q Consensus 44 ~~~~P~~~~~~~~QR~R 60 (65)
-+++|-|-++||++|.-
T Consensus 37 P~~p~ms~~eF~r~r~~ 53 (65)
T PF04328_consen 37 PDEPPMSEREFFRERQD 53 (65)
T ss_pred cCCCCCCHHHHHHHHHH
Confidence 35799999999999864
No 91
>PF13362 Toprim_3: Toprim domain
Probab=20.30 E-value=1.7e+02 Score=16.14 Aligned_cols=30 Identities=23% Similarity=0.175 Sum_probs=20.2
Q ss_pred HHHHHHCCCeEEEecceeeecCCCCHHHHHHHH
Q psy11718 26 AMLAYKQGFSFNFVEGEMWEKSPFSLWDFVQQR 58 (65)
Q Consensus 26 ~~~l~~~g~~~~~~~~~~~~~~P~~~~~~~~QR 58 (65)
.-++...|..+..+.+ .....+|.|+++++
T Consensus 64 ~~~~~~~g~~~~~~~p---~~~g~D~ND~l~~~ 93 (96)
T PF13362_consen 64 AERLEAAGIAVSIVEP---GPEGKDWNDLLQAR 93 (96)
T ss_pred HHHHHhCCCeEEEECC---CCCCchHHHHHHhh
Confidence 4555567888877755 33345899988875
No 92
>KOG3833|consensus
Probab=20.21 E-value=96 Score=23.04 Aligned_cols=28 Identities=18% Similarity=0.157 Sum_probs=20.5
Q ss_pred HHHHHCCCeEEEe-cceeeecCCCCHHHH
Q psy11718 27 MLAYKQGFSFNFV-EGEMWEKSPFSLWDF 54 (65)
Q Consensus 27 ~~l~~~g~~~~~~-~~~~~~~~P~~~~~~ 54 (65)
-+|..+|..+... +-.+.+++|+++++-
T Consensus 450 D~L~~~~I~iR~aSpklvmEEAPesYKdV 478 (505)
T KOG3833|consen 450 DKLRSRGIAIRVASPKLVMEEAPESYKDV 478 (505)
T ss_pred HHHHhCCeEEEeCCccchhhhCchhhhhH
Confidence 3455667777766 444999999999875
No 93
>PF09724 DUF2036: Uncharacterized conserved protein (DUF2036); InterPro: IPR019128 Sister chromatid cohesion protein DCC1 is a component of the RFC-like complex CTF18-RFC. This complex is required for the efficient establishment of chromosome cohesion during S-phase and may load or unload POL30/PCNA. During a clamp loading circle, the RFC:clamp complex binds to DNA and the recognition of the double-stranded/single-stranded junction stimulates ATP hydrolysis by RFC. The complex presumably provides bipartite ATP sites in which one subunit supplies a catalytic site for hydrolysis of ATP bound to the neighbouring subunit. Dissociation of RFC from the clamp leaves the clamp encircling DNA [, ].
Probab=20.05 E-value=26 Score=24.24 Aligned_cols=30 Identities=7% Similarity=0.255 Sum_probs=22.7
Q ss_pred CeEEEecc-eeeecCCCCHHHHHHHHHhhhc
Q psy11718 34 FSFNFVEG-EMWEKSPFSLWDFVQQRKRWLQ 63 (65)
Q Consensus 34 ~~~~~~~~-~~~~~~P~~~~~~~~QR~RW~~ 63 (65)
-.|.|++. ..-+.+..-|+.+|+.|.+|..
T Consensus 287 ~~I~y~~~~~Lp~dp~~Rf~~LF~~~~kW~~ 317 (325)
T PF09724_consen 287 TTIQYFPESDLPTDPKERFKELFKLRPKWTL 317 (325)
T ss_pred CEEEEecHHHCCCCHHHHHHHHHhcCCCCCH
Confidence 35666643 3667788899999999999963
No 94
>PF05280 FlhC: Flagellar transcriptional activator (FlhC); InterPro: IPR007944 This family consists of several bacterial flagellar transcriptional activator (FlhC) proteins. FlhC combines with FlhD to form a regulatory complex in Escherichia coli, this complex has been shown to be a global regulator involved in many cellular processes as well as a flagellar transcriptional activator [].; GO: 0003677 DNA binding, 0030092 regulation of flagellum assembly, 0045893 positive regulation of transcription, DNA-dependent; PDB: 2AVU_E.
Probab=20.02 E-value=31 Score=22.41 Aligned_cols=37 Identities=24% Similarity=0.219 Sum_probs=22.0
Q ss_pred hhHHHHHHHHHHCCCeEEEecceeeecCCCCHHHHHHH
Q psy11718 20 AEDCYFAMLAYKQGFSFNFVEGEMWEKSPFSLWDFVQQ 57 (65)
Q Consensus 20 tED~~l~~~l~~~g~~~~~~~~~~~~~~P~~~~~~~~Q 57 (65)
.+++.+++.|.+.|.|...+...+ +.+++.+..++++
T Consensus 10 ~~~~~lA~~Li~lGaR~q~l~~et-~Ls~~rl~~Lyke 46 (175)
T PF05280_consen 10 AEQIQLAIELIKLGARLQVLESET-GLSRERLRRLYKE 46 (175)
T ss_dssp HHHHHHHHHHHHTT--SCCHHHHS-SS-HHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCcHHHHHHHHH-CCCHHHHHHHHHH
Confidence 356788999999999987765432 4444555555443
Done!