Query psy6918
Match_columns 123
No_of_seqs 114 out of 1417
Neff 10.6
Searched_HMMs 46136
Date Fri Aug 16 21:45:36 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6918.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6918hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3396|consensus 99.9 8.9E-26 1.9E-30 129.7 11.2 119 5-123 31-150 (150)
2 PLN02706 glucosamine 6-phospha 99.9 2E-21 4.4E-26 117.2 15.0 118 6-123 32-150 (150)
3 PHA00673 acetyltransferase dom 99.8 8.7E-20 1.9E-24 109.3 12.1 102 12-115 41-144 (154)
4 PTZ00330 acetyltransferase; Pr 99.8 2.2E-19 4.8E-24 107.8 13.9 113 8-122 33-146 (147)
5 PRK10146 aminoalkylphosphonic 99.8 7.4E-20 1.6E-24 109.5 10.9 111 9-122 30-143 (144)
6 KOG3139|consensus 99.8 3E-18 6.4E-23 101.7 11.2 103 16-122 46-151 (165)
7 PF00583 Acetyltransf_1: Acety 99.8 3.9E-18 8.3E-23 93.1 10.3 77 36-113 4-83 (83)
8 PF13508 Acetyltransf_7: Acety 99.8 1E-17 2.3E-22 90.9 9.2 77 26-114 3-79 (79)
9 PF13673 Acetyltransf_10: Acet 99.8 2.9E-17 6.3E-22 95.0 10.6 83 17-112 35-117 (117)
10 PRK10314 putative acyltransfer 99.7 5.1E-17 1.1E-21 98.5 9.2 84 28-117 50-134 (153)
11 TIGR01575 rimI ribosomal-prote 99.7 2.8E-16 6.2E-21 92.2 10.9 99 6-117 15-116 (131)
12 PRK10975 TDP-fucosamine acetyl 99.7 3.8E-16 8.3E-21 98.0 11.7 81 36-122 110-193 (194)
13 PRK03624 putative acetyltransf 99.7 2.6E-16 5.6E-21 93.3 10.3 84 24-116 43-129 (140)
14 TIGR02382 wecD_rffC TDP-D-fuco 99.7 3.9E-16 8.6E-21 97.7 11.6 79 36-120 107-188 (191)
15 KOG3216|consensus 99.7 6.5E-16 1.4E-20 91.0 11.2 97 20-117 46-146 (163)
16 cd02169 Citrate_lyase_ligase C 99.7 4.5E-16 9.8E-21 102.9 10.0 79 25-116 5-83 (297)
17 PF13527 Acetyltransf_9: Acety 99.7 8.6E-15 1.9E-19 85.9 14.0 84 27-114 42-126 (127)
18 PRK10140 putative acetyltransf 99.7 1.4E-15 3E-20 92.6 10.9 89 25-118 50-142 (162)
19 TIGR03103 trio_acet_GNAT GNAT- 99.7 1.6E-15 3.6E-20 107.7 12.0 97 24-120 121-220 (547)
20 PF13420 Acetyltransf_4: Acety 99.7 5E-15 1.1E-19 89.6 12.0 106 8-120 32-142 (155)
21 COG1247 Sortase and related ac 99.7 1E-14 2.3E-19 88.7 13.0 109 6-118 33-144 (169)
22 PF13523 Acetyltransf_8: Acety 99.7 8.8E-15 1.9E-19 88.4 12.8 95 22-118 44-142 (152)
23 TIGR02406 ectoine_EctA L-2,4-d 99.7 2.1E-15 4.6E-20 91.8 9.9 86 28-118 41-129 (157)
24 TIGR03827 GNAT_ablB putative b 99.6 3.8E-15 8.2E-20 97.7 11.3 87 25-118 157-246 (266)
25 PRK07922 N-acetylglutamate syn 99.6 3E-15 6.6E-20 92.1 10.1 79 28-116 47-126 (169)
26 PRK09491 rimI ribosomal-protei 99.6 1E-14 2.2E-19 87.6 12.0 82 26-116 40-124 (146)
27 PRK07757 acetyltransferase; Pr 99.6 5.2E-15 1.1E-19 89.4 10.3 79 28-116 43-121 (152)
28 PHA01807 hypothetical protein 99.6 1.5E-14 3.1E-19 87.4 11.5 82 25-110 52-136 (153)
29 PRK10809 ribosomal-protein-S5- 99.6 2.8E-14 6.2E-19 89.4 12.8 92 22-117 71-166 (194)
30 COG1246 ArgA N-acetylglutamate 99.6 1.7E-14 3.7E-19 85.7 10.1 79 29-115 43-121 (153)
31 COG0456 RimI Acetyltransferase 99.6 2.5E-14 5.5E-19 88.2 10.9 78 38-116 72-153 (177)
32 TIGR01211 ELP3 histone acetylt 99.6 2.5E-14 5.5E-19 100.7 11.5 100 24-123 410-522 (522)
33 TIGR03585 PseH pseudaminic aci 99.6 8.5E-14 1.8E-18 84.2 11.8 105 6-118 31-139 (156)
34 PLN02825 amino-acid N-acetyltr 99.6 2.4E-14 5.2E-19 100.5 10.4 80 28-115 409-488 (515)
35 PRK15130 spermidine N1-acetylt 99.6 7.6E-14 1.7E-18 86.9 11.6 96 16-118 47-146 (186)
36 PRK10151 ribosomal-protein-L7/ 99.6 1.3E-13 2.9E-18 85.4 12.6 104 8-118 46-156 (179)
37 PRK09831 putative acyltransfer 99.6 1.6E-14 3.4E-19 87.0 8.1 72 28-117 55-126 (147)
38 PRK10514 putative acetyltransf 99.6 5.5E-14 1.2E-18 84.2 10.3 94 8-117 33-126 (145)
39 TIGR03448 mycothiol_MshD mycot 99.6 8E-14 1.7E-18 92.4 11.7 79 36-118 208-289 (292)
40 PRK12308 bifunctional arginino 99.6 3.5E-14 7.6E-19 102.3 10.6 80 28-117 505-584 (614)
41 COG2153 ElaA Predicted acyltra 99.6 2.9E-14 6.4E-19 83.4 8.1 90 24-119 48-139 (155)
42 PRK05279 N-acetylglutamate syn 99.6 3.9E-14 8.5E-19 98.7 10.2 82 28-117 336-417 (441)
43 TIGR00124 cit_ly_ligase [citra 99.6 8.5E-14 1.8E-18 93.4 11.4 78 26-116 31-108 (332)
44 TIGR01890 N-Ac-Glu-synth amino 99.6 6.9E-14 1.5E-18 97.2 10.8 82 28-117 324-405 (429)
45 PF13302 Acetyltransf_3: Acety 99.5 6.6E-13 1.4E-17 79.0 12.2 101 8-113 35-142 (142)
46 TIGR03448 mycothiol_MshD mycot 99.5 1.7E-13 3.7E-18 90.8 10.5 85 28-122 48-133 (292)
47 PRK01346 hypothetical protein; 99.5 9.6E-13 2.1E-17 91.1 13.4 90 28-121 49-140 (411)
48 PRK13688 hypothetical protein; 99.5 3.3E-13 7.2E-18 81.9 9.8 86 24-116 43-132 (156)
49 COG3153 Predicted acetyltransf 99.5 1.8E-12 3.8E-17 79.1 12.3 95 15-115 35-129 (171)
50 TIGR01686 FkbH FkbH-like domai 99.5 5.5E-13 1.2E-17 89.6 11.0 99 6-115 215-319 (320)
51 COG3393 Predicted acetyltransf 99.5 1.4E-12 3.1E-17 83.5 11.4 81 36-122 185-267 (268)
52 PF08445 FR47: FR47-like prote 99.4 2.3E-12 4.9E-17 70.9 7.9 58 58-115 21-80 (86)
53 PRK10562 putative acetyltransf 99.4 1.2E-11 2.5E-16 74.2 10.4 72 28-116 50-124 (145)
54 KOG2488|consensus 99.4 9.3E-12 2E-16 76.1 9.1 87 25-116 92-181 (202)
55 PF13718 GNAT_acetyltr_2: GNAT 99.4 1.5E-11 3.2E-16 76.7 10.1 96 18-115 19-174 (196)
56 KOG3397|consensus 99.4 1.3E-11 2.9E-16 74.5 9.1 76 36-116 65-140 (225)
57 KOG3235|consensus 99.3 3.5E-11 7.6E-16 71.7 9.2 92 22-115 37-133 (193)
58 COG1670 RimL Acetyltransferase 99.3 8.6E-11 1.9E-15 72.5 10.5 79 36-117 76-158 (187)
59 cd04301 NAT_SF N-Acyltransfera 99.2 2E-10 4.3E-15 58.6 8.1 58 36-97 7-64 (65)
60 KOG3234|consensus 99.2 2.2E-10 4.8E-15 68.2 8.5 89 28-120 43-134 (173)
61 PF14542 Acetyltransf_CG: GCN5 99.1 3.2E-09 6.9E-14 57.2 8.7 56 36-99 7-62 (78)
62 COG3981 Predicted acetyltransf 99.1 1.7E-09 3.8E-14 65.4 8.3 83 27-115 70-157 (174)
63 PF12746 GNAT_acetyltran: GNAT 99.1 3.9E-09 8.5E-14 68.9 10.4 79 36-121 173-251 (265)
64 PF12568 DUF3749: Acetyltransf 99.1 1.5E-08 3.3E-13 58.7 11.4 85 24-119 36-127 (128)
65 KOG4144|consensus 99.0 1.1E-09 2.4E-14 65.1 4.3 82 36-117 70-161 (190)
66 KOG3138|consensus 98.9 3.9E-09 8.5E-14 65.3 5.3 59 58-116 89-151 (187)
67 COG1444 Predicted P-loop ATPas 98.9 1.4E-08 3E-13 74.2 8.4 93 21-115 465-589 (758)
68 COG2388 Predicted acetyltransf 98.8 5.7E-08 1.2E-12 54.3 6.7 67 24-98 13-79 (99)
69 COG1243 ELP3 Histone acetyltra 98.8 1.6E-08 3.5E-13 69.7 5.2 98 24-123 403-515 (515)
70 COG3053 CitC Citrate lyase syn 98.8 1.2E-07 2.5E-12 62.2 8.6 78 25-115 36-113 (352)
71 KOG4135|consensus 98.7 6.2E-07 1.3E-11 53.3 10.4 110 6-115 43-168 (185)
72 PF08444 Gly_acyl_tr_C: Aralky 98.7 5.8E-08 1.3E-12 53.0 5.2 69 36-115 7-78 (89)
73 COG4552 Eis Predicted acetyltr 98.6 2.2E-07 4.7E-12 62.3 7.8 82 36-119 47-129 (389)
74 PF00765 Autoind_synth: Autoin 98.6 9.4E-07 2E-11 55.1 8.2 96 17-115 37-153 (182)
75 PRK13834 putative autoinducer 98.5 1.3E-06 2.9E-11 55.5 9.0 97 17-115 45-163 (207)
76 PF06852 DUF1248: Protein of u 98.5 5.2E-06 1.1E-10 51.4 10.7 81 36-117 55-137 (181)
77 PF13480 Acetyltransf_6: Acety 98.4 3.4E-05 7.3E-10 45.6 11.7 85 6-99 51-135 (142)
78 COG3818 Predicted acetyltransf 98.4 4.3E-06 9.3E-11 48.7 7.3 85 30-116 49-147 (167)
79 TIGR03694 exosort_acyl putativ 98.3 4E-06 8.7E-11 54.5 7.7 80 36-115 65-196 (241)
80 COG5628 Predicted acetyltransf 98.3 8.4E-06 1.8E-10 46.8 7.7 74 36-113 45-119 (143)
81 COG0454 WecD Histone acetyltra 98.3 8.2E-07 1.8E-11 50.0 3.5 44 64-112 87-130 (156)
82 COG3916 LasI N-acyl-L-homoseri 98.3 1.8E-05 3.9E-10 49.7 9.1 97 17-115 44-161 (209)
83 PF05301 Mec-17: Touch recepto 97.9 0.00054 1.2E-08 39.5 9.2 51 36-86 17-74 (120)
84 KOG2535|consensus 97.8 2.3E-05 5E-10 53.0 3.5 102 21-122 436-552 (554)
85 PF13880 Acetyltransf_13: ESCO 97.8 2.8E-05 6.1E-10 40.7 3.0 29 58-86 5-33 (70)
86 PF01233 NMT: Myristoyl-CoA:pr 97.8 0.0021 4.7E-08 39.0 11.1 64 36-99 87-151 (162)
87 PF04958 AstA: Arginine N-succ 97.7 0.0022 4.7E-08 43.8 12.0 105 7-111 30-182 (342)
88 COG3375 Uncharacterized conser 97.6 0.0024 5.2E-08 40.9 9.6 86 11-101 32-117 (266)
89 TIGR03019 pepcterm_femAB FemAB 97.5 0.0096 2.1E-07 40.5 12.5 106 6-122 177-286 (330)
90 TIGR03244 arg_catab_AstA argin 97.5 0.0014 3.1E-08 44.4 8.1 105 7-111 28-178 (336)
91 PRK10456 arginine succinyltran 97.4 0.0015 3.2E-08 44.5 8.0 105 7-111 30-180 (344)
92 TIGR03245 arg_AOST_alph argini 97.4 0.0016 3.6E-08 44.1 8.2 105 7-111 28-179 (336)
93 TIGR03243 arg_catab_AOST argin 97.4 0.0018 3.8E-08 44.0 8.2 105 7-111 28-178 (335)
94 PF04377 ATE_C: Arginine-tRNA- 97.3 0.012 2.7E-07 34.7 9.7 54 36-98 47-102 (128)
95 KOG2696|consensus 96.9 0.0061 1.3E-07 41.7 6.9 66 23-90 179-249 (403)
96 PRK01305 arginyl-tRNA-protein 96.9 0.057 1.2E-06 35.3 10.9 78 8-98 128-207 (240)
97 COG3882 FkbH Predicted enzyme 96.9 0.0043 9.3E-08 44.1 5.9 87 22-115 454-548 (574)
98 KOG4601|consensus 96.8 0.013 2.9E-07 37.7 7.1 66 20-85 61-135 (264)
99 cd04264 DUF619-NAGS DUF619 dom 96.7 0.022 4.8E-07 32.0 7.1 55 37-100 17-71 (99)
100 PF01853 MOZ_SAS: MOZ/SAS fami 96.7 0.015 3.3E-07 36.4 6.9 49 38-92 66-114 (188)
101 PF13444 Acetyltransf_5: Acety 96.4 0.017 3.7E-07 32.5 5.5 43 38-80 41-100 (101)
102 cd04265 DUF619-NAGS-U DUF619 d 96.3 0.049 1.1E-06 30.7 6.8 39 57-100 33-71 (99)
103 COG2401 ABC-type ATPase fused 96.3 0.0038 8.2E-08 43.9 2.7 58 58-115 241-306 (593)
104 PLN03238 probable histone acet 96.2 0.033 7.1E-07 37.1 6.4 50 36-91 139-188 (290)
105 PF11124 Pho86: Inorganic phos 96.2 0.05 1.1E-06 36.5 7.2 88 24-116 169-270 (304)
106 KOG2036|consensus 95.8 0.0078 1.7E-07 44.8 2.4 31 58-88 614-644 (1011)
107 COG3138 AstA Arginine/ornithin 95.6 0.066 1.4E-06 35.7 6.0 73 7-79 30-140 (336)
108 KOG2779|consensus 95.2 0.24 5.3E-06 34.2 7.8 59 36-94 144-203 (421)
109 PLN03239 histone acetyltransfe 95.2 0.086 1.9E-06 36.2 5.7 51 36-92 197-247 (351)
110 PF02799 NMT_C: Myristoyl-CoA: 95.1 0.47 1E-05 30.0 11.9 109 7-122 58-170 (190)
111 PRK14852 hypothetical protein; 95.1 0.18 3.9E-06 39.4 7.6 92 24-117 73-181 (989)
112 PTZ00064 histone acetyltransfe 95.0 0.086 1.9E-06 37.9 5.5 50 36-91 368-417 (552)
113 KOG2779|consensus 94.9 0.57 1.2E-05 32.4 8.8 110 6-122 289-402 (421)
114 TIGR03827 GNAT_ablB putative b 94.5 0.085 1.8E-06 34.9 4.2 42 74-115 21-62 (266)
115 PLN00104 MYST -like histone ac 94.2 0.09 1.9E-06 37.4 4.1 50 36-91 290-339 (450)
116 cd04266 DUF619-NAGS-FABP DUF61 93.8 0.71 1.5E-05 26.5 7.5 59 37-100 18-78 (108)
117 KOG2747|consensus 93.4 0.22 4.9E-06 34.8 4.7 32 60-91 262-293 (396)
118 PF04768 DUF619: Protein of un 91.8 2 4.4E-05 26.7 8.7 72 37-115 69-144 (170)
119 COG2898 Uncharacterized conser 91.6 2.5 5.5E-05 31.1 8.3 66 26-99 393-458 (538)
120 PF09390 DUF1999: Protein of u 91.3 2.1 4.5E-05 25.9 8.5 84 28-117 57-141 (161)
121 PRK02983 lysS lysyl-tRNA synth 90.9 3.7 8.1E-05 33.0 9.2 63 29-99 423-485 (1094)
122 PF12261 T_hemolysin: Thermost 90.7 0.57 1.2E-05 29.3 3.9 76 36-115 43-140 (179)
123 COG5027 SAS2 Histone acetyltra 89.4 0.28 6.1E-06 33.7 1.9 45 38-88 248-292 (395)
124 PF02474 NodA: Nodulation prot 88.4 1.1 2.4E-05 27.9 3.8 53 58-111 85-137 (196)
125 PF09924 DUF2156: Uncharacteri 86.6 7.7 0.00017 26.1 9.7 67 24-98 179-245 (299)
126 KOG3698|consensus 86.5 2.4 5.2E-05 31.6 5.2 50 66-115 824-876 (891)
127 COG5630 ARG2 Acetylglutamate s 85.7 8.3 0.00018 27.3 7.2 47 37-86 382-429 (495)
128 COG5092 NMT1 N-myristoyl trans 85.3 9.5 0.00021 26.4 7.2 60 36-95 142-202 (451)
129 COG5653 Protein involved in ce 84.4 12 0.00027 26.6 9.4 85 6-99 252-336 (406)
130 PHA02769 hypothetical protein; 84.3 1.9 4E-05 25.0 3.1 42 74-115 92-137 (154)
131 COG2935 Putative arginyl-tRNA: 83.2 10 0.00022 25.2 6.4 54 36-98 159-214 (253)
132 KOG1472|consensus 81.4 0.54 1.2E-05 35.6 0.3 72 39-115 431-503 (720)
133 KOG3014|consensus 77.1 3.9 8.4E-05 27.0 3.1 31 57-87 182-212 (257)
134 PF02388 FemAB: FemAB family; 74.7 28 0.00061 24.8 7.8 82 28-115 37-138 (406)
135 PF00571 CBS: CBS domain CBS d 73.9 8.4 0.00018 18.4 3.4 33 8-45 16-48 (57)
136 cd07042 STAS_SulP_like_sulfate 73.2 11 0.00023 20.7 4.1 40 76-115 58-97 (107)
137 cd07043 STAS_anti-anti-sigma_f 73.0 9.6 0.00021 20.5 3.8 41 75-115 54-94 (99)
138 PF13530 SCP2_2: Sterol carrie 72.2 23 0.0005 22.7 7.2 64 22-98 21-87 (218)
139 PF08901 DUF1847: Protein of u 71.5 5.9 0.00013 24.3 2.8 38 80-117 43-87 (157)
140 cd06844 STAS Sulphate Transpor 69.7 13 0.00029 20.4 3.9 40 76-115 56-95 (100)
141 cd04590 CBS_pair_CorC_HlyC_ass 69.3 15 0.00033 20.0 4.2 30 14-45 76-105 (111)
142 TIGR02886 spore_II_AA anti-sig 68.9 15 0.00032 20.4 4.0 40 76-115 56-95 (106)
143 PF01740 STAS: STAS domain; I 67.2 9.2 0.0002 21.6 3.0 40 76-115 65-104 (117)
144 TIGR00377 ant_ant_sig anti-ant 66.9 13 0.00027 20.6 3.5 40 76-115 60-99 (108)
145 PHA00432 internal virion prote 65.2 28 0.0006 21.0 6.4 22 96-117 100-121 (137)
146 PF03588 Leu_Phe_trans: Leucyl 65.2 32 0.00068 21.6 9.5 94 5-114 77-170 (173)
147 PF04339 DUF482: Protein of un 64.8 48 0.001 23.5 8.0 52 64-115 106-158 (370)
148 PF09924 DUF2156: Uncharacteri 62.3 15 0.00032 24.8 3.6 44 74-117 55-98 (299)
149 TIGR02990 ectoine_eutA ectoine 61.4 12 0.00025 24.7 2.9 39 77-115 105-149 (239)
150 PF12953 DUF3842: Domain of un 59.4 13 0.00028 22.1 2.5 42 70-112 7-48 (131)
151 PRK04531 acetylglutamate kinas 59.3 64 0.0014 23.2 6.9 38 58-100 310-347 (398)
152 PF10887 DUF2686: Protein of u 59.0 47 0.001 21.8 5.1 79 36-115 160-242 (276)
153 cd04619 CBS_pair_6 The CBS dom 58.7 20 0.00044 19.8 3.4 32 13-46 78-109 (114)
154 COG3473 Maleate cis-trans isom 57.5 17 0.00038 23.6 3.1 30 86-115 112-147 (238)
155 PRK00301 aat leucyl/phenylalan 57.4 54 0.0012 21.7 9.8 78 5-97 107-186 (233)
156 PRK00756 acyltransferase NodA; 56.5 29 0.00062 21.8 3.8 42 58-100 85-126 (196)
157 cd04641 CBS_pair_28 The CBS do 56.4 33 0.00072 19.0 4.1 30 15-46 86-115 (120)
158 TIGR00667 aat leucyl/phenylala 55.8 51 0.0011 21.0 9.7 76 7-97 79-156 (185)
159 PF13466 STAS_2: STAS domain 55.5 29 0.00062 18.0 3.7 39 74-112 41-79 (80)
160 smart00116 CBS Domain in cysta 55.1 18 0.0004 15.7 2.8 20 24-45 22-41 (49)
161 TIGR03032 conserved hypothetic 54.4 59 0.0013 22.7 5.3 37 26-72 282-318 (335)
162 cd04583 CBS_pair_ABC_OpuCA_ass 53.9 31 0.00068 18.5 3.6 28 17-46 77-104 (109)
163 PF06574 FAD_syn: FAD syntheta 53.6 50 0.0011 20.2 5.3 35 65-99 9-44 (157)
164 cd07041 STAS_RsbR_RsbS_like Su 53.3 34 0.00074 19.0 3.7 39 76-114 58-96 (109)
165 PF04816 DUF633: Family of unk 52.9 33 0.00072 22.0 3.9 45 74-118 74-123 (205)
166 cd04640 CBS_pair_27 The CBS do 52.7 39 0.00086 19.0 4.0 32 13-46 89-121 (126)
167 COG1658 Small primase-like pro 51.4 13 0.00029 22.0 1.8 23 63-85 59-81 (127)
168 PRK03681 hypA hydrogenase nick 50.7 27 0.00059 20.1 3.0 23 73-95 4-26 (114)
169 cd04610 CBS_pair_ParBc_assoc T 49.9 40 0.00087 18.1 3.8 30 15-46 73-102 (107)
170 PF06849 DUF1246: Protein of u 49.7 21 0.00045 21.1 2.4 31 83-114 12-42 (124)
171 COG4353 Uncharacterized conser 49.5 33 0.00072 21.3 3.3 51 36-91 37-87 (192)
172 PRK02983 lysS lysyl-tRNA synth 49.4 1.1E+02 0.0025 25.2 6.9 42 74-115 295-336 (1094)
173 PF02268 TFIIA_gamma_N: Transc 49.4 32 0.00069 16.7 2.7 23 70-92 6-28 (49)
174 PF02794 HlyC: RTX toxin acylt 48.9 57 0.0012 19.5 4.9 73 6-87 17-104 (133)
175 cd04801 CBS_pair_M50_like This 48.7 45 0.00097 18.2 3.8 30 15-46 80-109 (114)
176 PF04555 XhoI: Restriction end 48.0 66 0.0014 20.6 4.5 36 63-98 144-179 (196)
177 COG1064 AdhP Zn-dependent alco 48.0 37 0.00079 23.8 3.7 41 74-115 174-214 (339)
178 PF09409 PUB: PUB domain; Int 47.6 29 0.00063 18.7 2.7 42 79-120 16-68 (87)
179 PF04339 DUF482: Protein of un 47.6 1E+02 0.0022 22.0 10.6 83 23-118 248-330 (370)
180 cd04596 CBS_pair_DRTGG_assoc T 47.3 41 0.0009 18.2 3.4 30 14-45 73-102 (108)
181 KOG4518|consensus 47.2 33 0.00072 22.2 3.1 25 75-99 83-107 (264)
182 cd04627 CBS_pair_14 The CBS do 46.7 52 0.0011 18.4 3.9 30 15-46 89-118 (123)
183 COG0375 HybF Zn finger protein 46.5 42 0.0009 19.6 3.2 22 74-95 5-26 (115)
184 COG2898 Uncharacterized conser 46.3 44 0.00095 25.0 4.0 40 76-115 271-310 (538)
185 cd04611 CBS_pair_PAS_GGDEF_DUF 46.2 48 0.001 17.9 3.8 28 17-46 79-106 (111)
186 cd04587 CBS_pair_CAP-ED_DUF294 46.2 40 0.00086 18.3 3.2 29 15-45 79-107 (113)
187 COG3620 Predicted transcriptio 45.8 47 0.001 20.8 3.5 37 4-44 139-175 (187)
188 TIGR00100 hypA hydrogenase nic 45.4 36 0.00079 19.6 3.0 23 73-95 4-26 (115)
189 cd04601 CBS_pair_IMPDH This cd 45.3 50 0.0011 17.7 3.9 29 16-46 77-105 (110)
190 cd04603 CBS_pair_KefB_assoc Th 45.0 53 0.0011 18.0 3.7 30 15-46 77-106 (111)
191 PF11090 DUF2833: Protein of u 44.8 54 0.0012 18.1 6.1 43 75-117 41-84 (86)
192 COG3543 Uncharacterized conser 44.8 35 0.00077 20.3 2.8 36 63-98 9-45 (135)
193 cd04642 CBS_pair_29 The CBS do 44.7 49 0.0011 18.6 3.5 22 22-45 99-120 (126)
194 cd04585 CBS_pair_ACT_assoc2 Th 44.5 51 0.0011 18.0 3.6 21 23-45 96-116 (122)
195 cd04615 CBS_pair_2 The CBS dom 44.2 53 0.0012 17.8 3.9 27 17-45 81-107 (113)
196 PRK00762 hypA hydrogenase nick 43.8 41 0.00089 19.7 3.1 24 73-96 4-27 (124)
197 PF01751 Toprim: Toprim domain 43.8 17 0.00037 20.1 1.4 34 63-97 64-97 (100)
198 cd04631 CBS_pair_18 The CBS do 43.6 58 0.0013 18.1 4.2 27 17-45 93-119 (125)
199 cd04602 CBS_pair_IMPDH_2 This 42.9 58 0.0012 17.8 3.7 30 14-45 79-108 (114)
200 cd03173 DUF619-like DUF619 dom 42.7 63 0.0014 18.2 7.8 39 57-100 32-70 (98)
201 cd04604 CBS_pair_KpsF_GutQ_ass 42.3 57 0.0012 17.6 4.0 27 17-45 82-108 (114)
202 COG3640 CooC CO dehydrogenase 42.2 67 0.0014 21.5 4.0 42 73-115 9-52 (255)
203 PF10116 Host_attach: Protein 42.2 49 0.0011 19.6 3.3 37 72-108 71-107 (138)
204 PF12017 Tnp_P_element: Transp 42.1 74 0.0016 21.1 4.3 41 74-114 192-234 (236)
205 PRK12380 hydrogenase nickel in 42.0 45 0.00098 19.2 3.0 23 73-95 4-26 (113)
206 cd04597 CBS_pair_DRTGG_assoc2 41.9 63 0.0014 18.0 4.1 28 17-46 81-108 (113)
207 cd04803 CBS_pair_15 The CBS do 41.7 62 0.0013 17.8 3.7 21 23-45 96-116 (122)
208 cd04605 CBS_pair_MET2_assoc Th 41.7 58 0.0013 17.5 4.1 35 7-46 10-44 (110)
209 cd04614 CBS_pair_1 The CBS dom 41.3 53 0.0012 17.6 3.2 27 17-45 64-90 (96)
210 cd04620 CBS_pair_7 The CBS dom 41.1 62 0.0013 17.6 3.9 30 14-45 80-109 (115)
211 PF01155 HypA: Hydrogenase exp 41.0 40 0.00088 19.3 2.7 23 74-96 5-27 (113)
212 cd04635 CBS_pair_22 The CBS do 40.7 64 0.0014 17.8 3.9 28 17-46 90-117 (122)
213 PRK05031 tRNA (uracil-5-)-meth 40.3 51 0.0011 23.1 3.6 47 61-115 289-339 (362)
214 cd04607 CBS_pair_NTP_transfera 40.3 63 0.0014 17.6 4.0 24 21-46 85-108 (113)
215 PRK15094 magnesium/cobalt effl 40.1 85 0.0018 21.4 4.5 33 12-46 150-182 (292)
216 cd04600 CBS_pair_HPP_assoc Thi 40.0 67 0.0014 17.8 3.9 28 17-46 92-119 (124)
217 cd04197 eIF-2B_epsilon_N The N 40.0 32 0.0007 21.9 2.4 28 76-103 30-57 (217)
218 TIGR00055 uppS undecaprenyl di 39.3 41 0.0009 22.1 2.8 32 68-99 20-51 (226)
219 PRK14837 undecaprenyl pyrophos 38.4 43 0.00094 22.1 2.8 32 68-99 27-58 (230)
220 cd04639 CBS_pair_26 The CBS do 38.2 68 0.0015 17.3 4.1 30 15-46 77-106 (111)
221 TIGR01177 conserved hypothetic 37.5 1.3E+02 0.0029 20.7 5.2 58 60-118 248-314 (329)
222 COG3146 Uncharacterized protei 37.1 69 0.0015 22.6 3.6 52 64-115 121-173 (387)
223 cd04612 CBS_pair_SpoIVFB_EriC_ 37.0 71 0.0015 17.1 4.1 23 22-46 84-106 (111)
224 cd04606 CBS_pair_Mg_transporte 36.6 73 0.0016 17.2 3.9 22 23-46 82-103 (109)
225 PRK00564 hypA hydrogenase nick 36.0 63 0.0014 18.7 3.0 24 73-96 4-27 (117)
226 cd04618 CBS_pair_5 The CBS dom 36.0 75 0.0016 17.1 3.5 30 15-46 63-93 (98)
227 cd06422 NTP_transferase_like_1 36.0 42 0.00091 21.3 2.5 32 76-107 29-60 (221)
228 cd04637 CBS_pair_24 The CBS do 35.9 79 0.0017 17.4 3.7 20 24-45 97-116 (122)
229 PF04015 DUF362: Domain of unk 35.6 85 0.0018 19.8 3.8 44 72-115 17-65 (206)
230 cd04582 CBS_pair_ABC_OpuCA_ass 35.4 74 0.0016 16.9 4.0 29 15-45 72-100 (106)
231 PF03465 eRF1_3: eRF1 domain 3 35.2 69 0.0015 18.4 3.1 53 59-111 36-103 (113)
232 COG3250 LacZ Beta-galactosidas 35.2 2.4E+02 0.0051 22.6 8.0 61 58-118 299-364 (808)
233 cd04632 CBS_pair_19 The CBS do 35.1 85 0.0018 17.5 4.1 32 9-45 11-42 (128)
234 cd00475 CIS_IPPS Cis (Z)-Isopr 35.1 51 0.0011 21.5 2.7 32 68-99 21-52 (221)
235 KOG3285|consensus 34.6 86 0.0019 19.9 3.5 44 72-115 60-104 (203)
236 cd01027 TOPRIM_RNase_M5_like T 34.6 24 0.00053 18.9 1.1 25 61-85 47-71 (81)
237 cd04588 CBS_pair_CAP-ED_DUF294 34.4 79 0.0017 17.0 4.0 22 23-46 84-105 (110)
238 cd04624 CBS_pair_11 The CBS do 34.3 81 0.0017 17.0 3.9 21 24-46 23-43 (112)
239 PF03376 Adeno_E3B: Adenovirus 34.2 20 0.00044 18.5 0.6 13 67-79 53-65 (67)
240 KOG1198|consensus 33.8 81 0.0018 22.1 3.7 39 76-114 167-205 (347)
241 cd04593 CBS_pair_EriC_assoc_ba 33.5 86 0.0019 17.1 4.3 32 15-46 79-110 (115)
242 PF00925 GTP_cyclohydro2: GTP 33.4 38 0.00082 21.0 1.9 41 68-115 123-164 (169)
243 COG2384 Predicted SAM-dependen 33.3 1.4E+02 0.003 19.7 4.4 42 74-115 93-139 (226)
244 PF06559 DCD: 2'-deoxycytidine 33.2 20 0.00044 25.0 0.7 34 36-75 323-356 (364)
245 COG3623 SgaU Putative L-xylulo 33.2 78 0.0017 21.2 3.3 24 76-99 94-117 (287)
246 PRK14831 undecaprenyl pyrophos 33.2 59 0.0013 21.7 2.8 32 68-99 41-72 (249)
247 PF11633 SUD-M: Single-strande 33.1 41 0.0009 20.2 1.9 45 71-115 16-60 (142)
248 KOG1764|consensus 33.0 81 0.0018 22.5 3.7 34 11-46 319-352 (381)
249 cd04446 DEP_DEPDC4 DEP (Dishev 32.8 30 0.00065 19.4 1.2 23 69-91 23-45 (95)
250 PF14730 DUF4468: Domain of un 32.4 90 0.0019 17.0 4.8 42 5-46 7-51 (91)
251 PRK14841 undecaprenyl pyrophos 31.8 65 0.0014 21.3 2.8 32 68-99 24-55 (233)
252 TIGR01099 galU UTP-glucose-1-p 31.6 56 0.0012 21.3 2.6 32 76-107 30-61 (260)
253 PRK00393 ribA GTP cyclohydrola 31.6 98 0.0021 19.7 3.6 42 67-115 124-166 (197)
254 cd02205 CBS_pair The CBS domai 31.3 87 0.0019 16.5 4.1 30 14-45 78-107 (113)
255 PRK14833 undecaprenyl pyrophos 31.3 66 0.0014 21.2 2.8 32 68-99 25-56 (233)
256 PF01255 Prenyltransf: Putativ 31.1 41 0.00088 21.9 1.8 31 69-99 16-46 (223)
257 PTZ00349 dehydrodolichyl dipho 30.5 60 0.0013 22.6 2.6 32 68-99 40-71 (322)
258 TIGR00505 ribA GTP cyclohydrol 30.4 1E+02 0.0022 19.6 3.5 42 67-115 121-163 (191)
259 COG3113 Predicted NTP binding 30.3 1.1E+02 0.0024 17.4 3.4 36 75-111 55-91 (99)
260 PRK07758 hypothetical protein; 30.3 67 0.0015 18.0 2.3 22 71-92 71-92 (95)
261 COG1366 SpoIIAA Anti-anti-sigm 30.2 1.1E+02 0.0024 17.3 3.7 40 75-114 60-99 (117)
262 cd04181 NTP_transferase NTP_tr 30.1 73 0.0016 19.8 2.9 29 76-104 28-56 (217)
263 PRK14842 undecaprenyl pyrophos 29.9 73 0.0016 21.2 2.8 32 68-99 29-60 (241)
264 PRK04017 hypothetical protein; 29.6 50 0.0011 19.8 1.9 25 61-85 67-91 (132)
265 cd04622 CBS_pair_9 The CBS dom 29.5 1E+02 0.0022 16.6 4.0 20 24-45 88-107 (113)
266 PRK14832 undecaprenyl pyrophos 29.5 59 0.0013 21.8 2.4 32 68-99 39-70 (253)
267 COG2185 Sbm Methylmalonyl-CoA 28.8 1.4E+02 0.0031 18.2 3.9 42 74-115 74-119 (143)
268 PRK10240 undecaprenyl pyrophos 28.7 61 0.0013 21.3 2.3 32 68-99 14-45 (229)
269 PRK14019 bifunctional 3,4-dihy 28.7 83 0.0018 22.4 3.1 29 87-115 331-359 (367)
270 cd04592 CBS_pair_EriC_assoc_eu 28.6 1.3E+02 0.0027 17.5 4.2 34 8-46 10-43 (133)
271 PRK14829 undecaprenyl pyrophos 28.6 69 0.0015 21.3 2.6 32 68-99 35-66 (243)
272 PRK14840 undecaprenyl pyrophos 28.4 68 0.0015 21.5 2.5 32 68-99 43-74 (250)
273 cd02541 UGPase_prokaryotic Pro 28.4 63 0.0014 21.2 2.4 29 76-104 30-58 (267)
274 cd04189 G1P_TT_long G1P_TT_lon 28.2 1.1E+02 0.0023 19.6 3.4 30 76-105 30-59 (236)
275 PF02743 Cache_1: Cache domain 28.2 97 0.0021 16.0 3.0 33 8-42 36-68 (81)
276 PRK10122 GalU regulator GalF; 28.0 64 0.0014 21.9 2.4 31 76-106 33-63 (297)
277 cd04621 CBS_pair_8 The CBS dom 28.0 1.3E+02 0.0027 17.4 3.5 33 9-46 11-43 (135)
278 TIGR00151 ispF 2C-methyl-D-ery 27.9 1.5E+02 0.0033 18.3 3.7 32 66-98 63-94 (155)
279 PRK14834 undecaprenyl pyrophos 27.8 91 0.002 20.8 3.0 32 68-99 35-66 (249)
280 PF02388 FemAB: FemAB family; 27.7 2.3E+02 0.0051 20.3 7.8 58 58-115 316-381 (406)
281 TIGR00083 ribF riboflavin kina 27.6 1.6E+02 0.0035 20.1 4.2 31 69-99 6-37 (288)
282 PRK14830 undecaprenyl pyrophos 27.4 87 0.0019 21.0 2.9 29 71-99 46-74 (251)
283 PF14696 Glyoxalase_5: Hydroxy 27.4 59 0.0013 19.6 1.9 28 90-117 8-37 (139)
284 COG2994 HlyC ACP:hemolysin acy 27.2 99 0.0021 18.8 2.8 13 36-48 61-73 (148)
285 PRK03824 hypA hydrogenase nick 27.0 64 0.0014 19.2 2.1 24 73-96 4-27 (135)
286 cd02508 ADP_Glucose_PP ADP-glu 26.9 66 0.0014 20.1 2.3 32 78-109 31-62 (200)
287 PRK14839 undecaprenyl pyrophos 26.9 76 0.0016 21.1 2.5 32 68-99 30-61 (239)
288 cd04617 CBS_pair_4 The CBS dom 26.8 1.2E+02 0.0026 16.7 4.4 29 16-46 82-113 (118)
289 cd04608 CBS_pair_PALP_assoc Th 26.7 1.3E+02 0.0028 17.0 3.5 21 24-46 24-44 (124)
290 TIGR00815 sulP high affinity s 26.2 1.2E+02 0.0025 22.8 3.7 40 76-115 511-550 (563)
291 cd04623 CBS_pair_10 The CBS do 26.2 1.2E+02 0.0025 16.3 4.0 34 8-46 10-43 (113)
292 KOG0122|consensus 26.1 2.1E+02 0.0046 19.3 4.9 40 7-48 200-239 (270)
293 COG0807 RibA GTP cyclohydrolas 25.9 84 0.0018 20.2 2.5 41 66-113 122-163 (193)
294 COG1041 Predicted DNA modifica 25.9 2.5E+02 0.0054 20.0 6.4 61 60-120 264-331 (347)
295 COG5270 PUA domain (predicted 25.8 86 0.0019 20.1 2.5 18 28-47 165-182 (202)
296 PRK10310 PTS system galactitol 25.8 1.2E+02 0.0027 16.6 3.0 23 71-93 9-33 (94)
297 PF11039 DUF2824: Protein of u 25.7 1.6E+02 0.0035 17.8 9.7 46 36-90 46-91 (151)
298 PF00107 ADH_zinc_N: Zinc-bind 25.6 1E+02 0.0022 17.4 2.8 32 83-114 6-37 (130)
299 PF14871 GHL6: Hypothetical gl 25.6 1.6E+02 0.0034 17.5 3.5 26 71-96 37-62 (132)
300 CHL00130 rbcS ribulose-1,5-bis 25.1 1.7E+02 0.0036 17.7 4.0 30 4-33 10-39 (138)
301 COG2428 Uncharacterized conser 25.1 1.2E+02 0.0027 19.1 3.0 40 77-116 14-53 (196)
302 PF02334 RTP: Replication term 24.8 31 0.00068 20.0 0.4 22 71-92 29-50 (122)
303 KOG1602|consensus 24.7 1.5E+02 0.0032 20.1 3.5 30 70-99 59-88 (271)
304 COG2265 TrmA SAM-dependent met 24.5 2E+02 0.0043 21.0 4.4 49 62-115 364-416 (432)
305 PRK14838 undecaprenyl pyrophos 24.3 89 0.0019 20.8 2.5 32 68-99 31-62 (242)
306 COG0529 CysC Adenylylsulfate k 24.1 1E+02 0.0022 19.8 2.5 42 74-115 33-75 (197)
307 cd04609 CBS_pair_PALP_assoc2 T 24.1 1.3E+02 0.0028 16.0 3.1 16 28-45 89-104 (110)
308 PRK14835 undecaprenyl pyrophos 23.9 85 0.0018 21.3 2.4 31 69-99 63-93 (275)
309 cd03527 RuBisCO_small Ribulose 23.8 1.5E+02 0.0033 16.8 4.7 30 4-33 8-37 (99)
310 PF02631 RecX: RecX family; I 23.8 80 0.0017 18.1 2.0 22 71-92 2-23 (121)
311 cd04625 CBS_pair_12 The CBS do 23.7 1.3E+02 0.0029 16.1 3.9 10 36-45 97-106 (112)
312 PRK14828 undecaprenyl pyrophos 23.7 1E+02 0.0023 20.7 2.7 32 68-99 47-79 (256)
313 COG2231 Uncharacterized protei 23.7 57 0.0012 21.2 1.5 40 72-115 120-159 (215)
314 KOG0023|consensus 23.6 1.2E+02 0.0027 21.4 3.1 39 74-113 189-228 (360)
315 cd04589 CBS_pair_CAP-ED_DUF294 22.6 1.4E+02 0.0031 16.0 3.9 29 15-46 78-106 (111)
316 cd04626 CBS_pair_13 The CBS do 22.5 1.4E+02 0.0031 16.0 3.8 21 24-46 23-43 (111)
317 cd06915 NTP_transferase_WcbM_l 22.5 1.5E+02 0.0032 18.5 3.3 26 76-101 28-53 (223)
318 PF01380 SIS: SIS domain SIS d 22.4 1.6E+02 0.0035 16.6 4.0 27 74-101 64-90 (131)
319 COG3448 CBS-domain-containing 22.3 1.7E+02 0.0036 20.6 3.4 25 18-44 339-363 (382)
320 PHA01733 hypothetical protein 22.3 2E+02 0.0044 17.7 7.0 72 36-115 55-130 (153)
321 KOG4717|consensus 22.2 50 0.0011 25.1 1.1 30 84-115 131-160 (864)
322 cd02523 PC_cytidylyltransferas 22.1 87 0.0019 19.9 2.2 29 77-105 29-57 (229)
323 COG5092 NMT1 N-myristoyl trans 22.1 3E+02 0.0064 19.5 7.9 92 28-122 312-419 (451)
324 PF05063 MT-A70: MT-A70 ; Int 21.9 2.1E+02 0.0045 17.7 4.2 14 104-117 62-75 (176)
325 PRK14836 undecaprenyl pyrophos 21.9 1.1E+02 0.0023 20.6 2.5 30 70-99 37-66 (253)
326 COG0196 RibF FAD synthase [Coe 21.7 2.9E+02 0.0062 19.2 4.7 35 66-100 20-55 (304)
327 TIGR02143 trmA_only tRNA (urac 21.6 1.4E+02 0.0031 20.9 3.2 46 62-115 281-330 (353)
328 PRK14827 undecaprenyl pyrophos 21.6 1E+02 0.0022 21.3 2.4 32 68-99 88-119 (296)
329 cd04586 CBS_pair_BON_assoc Thi 21.6 1.7E+02 0.0036 16.6 3.2 18 27-46 27-44 (135)
330 PF13344 Hydrolase_6: Haloacid 21.6 1.5E+02 0.0033 16.4 2.8 24 90-113 78-101 (101)
331 COG2813 RsmC 16S RNA G1207 met 21.5 2.9E+02 0.0063 19.2 4.9 39 60-100 224-267 (300)
332 cd04633 CBS_pair_20 The CBS do 21.5 1.6E+02 0.0034 16.1 3.3 28 16-46 89-116 (121)
333 PRK11660 putative transporter; 21.1 1.8E+02 0.0038 22.0 3.8 39 76-115 508-546 (568)
334 PRK12485 bifunctional 3,4-dihy 21.1 1.6E+02 0.0035 21.0 3.4 28 88-115 335-362 (369)
335 KOG2540|consensus 21.0 2.2E+02 0.0048 18.8 3.7 41 31-71 171-228 (269)
336 KOG0538|consensus 20.8 2.8E+02 0.006 19.6 4.3 35 60-101 123-157 (363)
337 PF02679 ComA: (2R)-phospho-3- 20.8 2.4E+02 0.0051 18.9 3.9 37 79-115 85-130 (244)
338 cd06426 NTP_transferase_like_2 20.8 1.6E+02 0.0035 18.5 3.2 28 76-103 28-55 (220)
339 cd04263 DUF619-NAGK-FABP DUF61 20.8 1.8E+02 0.0038 16.4 6.1 39 57-100 32-70 (98)
340 COG0296 GlgB 1,4-alpha-glucan 20.5 1.2E+02 0.0027 23.3 2.8 40 77-117 164-203 (628)
341 PRK13397 3-deoxy-7-phosphohept 20.4 2E+02 0.0044 19.3 3.6 36 72-107 60-95 (250)
342 COG3414 SgaB Phosphotransferas 20.4 1.5E+02 0.0033 16.5 2.6 23 71-93 8-32 (93)
No 1
>KOG3396|consensus
Probab=99.94 E-value=8.9e-26 Score=129.68 Aligned_cols=119 Identities=48% Similarity=0.854 Sum_probs=112.5
Q ss_pred CCCCCHHHHHHHHHhhhcCCC-eEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHH
Q psy6918 5 LPTSNVFHFAERFHRMKASQD-YLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVL 83 (123)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~ 83 (123)
++..+++++.++|........ ++++|++|.+.+++||.+++.++..+++..+..++|.++.|++++||+++|+-|+..+
T Consensus 31 vG~vt~e~F~krf~~mk~~~~~Y~i~Vied~~s~~vigtatL~IE~KfIh~~g~rGhiEDVVV~~~~rgk~LGkllv~~L 110 (150)
T KOG3396|consen 31 VGVVTREQFEKRFEAMKKSGDWYYIVVIEDKESEKVIGTATLFIERKFIHGCGSRGHIEDVVVDSEYRGKQLGKLLVETL 110 (150)
T ss_pred ccccCHHHHHHHHHHHHhcCCcEEEEEEEeCCcCeEEEEEEEEEehhhhhcccccCceeEEEeChhhhhhHHhHHHHHHH
Confidence 577899999999999887755 8888999988999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhcCCeEEEEccCCCchhHHHhcCcEEccceeEeeC
Q psy6918 84 VKLAKHFQCYKLTLDCADHMIPFYETFGFQKKNNFMQIYF 123 (123)
Q Consensus 84 ~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~~~~~~~~~ 123 (123)
+..++..|+.++.+.|.+.+.+||+|+||...+.+|.+||
T Consensus 111 v~l~k~lgcYKi~LdC~~~nv~FYeKcG~s~~~~~M~~r~ 150 (150)
T KOG3396|consen 111 VDLAKSLGCYKIILDCDPKNVKFYEKCGYSNAGNEMTKRF 150 (150)
T ss_pred HHHHHhcCcEEEEEecchhhhhHHHHcCccccchhheecC
Confidence 9999999999999999999999999999999999999886
No 2
>PLN02706 glucosamine 6-phosphate N-acetyltransferase
Probab=99.89 E-value=2e-21 Score=117.20 Aligned_cols=118 Identities=41% Similarity=0.711 Sum_probs=93.3
Q ss_pred CCCCHHHHHHHHHhhhcC-CCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHH
Q psy6918 6 PTSNVFHFAERFHRMKAS-QDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLV 84 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~ 84 (123)
++++.+.+.+.+...... ....++++++.+++++||++.+.+...........+.|..++|+|+|||+|+|++|++.++
T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~i~V~~~~rg~GiG~~ll~~~~ 111 (150)
T PLN02706 32 GDVTEEEFEARFQELASLGDDHLICVIEDAASGRIIATGSVFVERKFIRNCGKVGHIEDVVVDSAARGKGLGKKIIEALT 111 (150)
T ss_pred CCCCHHHHHHHHHHHHhCCCcEEEEEEEeCCCCcEEEEEEEEEEeecccCCCcEEEEEEEEECHHHcCCCHHHHHHHHHH
Confidence 457778888888765542 3444555554335899999988644332233345678888999999999999999999999
Q ss_pred HHHHhcCCeEEEEccCCCchhHHHhcCcEEccceeEeeC
Q psy6918 85 KLAKHFQCYKLTLDCADHMIPFYETFGFQKKNNFMQIYF 123 (123)
Q Consensus 85 ~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~~~~~~~~~ 123 (123)
+++++.|+.++.+.+.+.|.+||+|+||+..+..|.+++
T Consensus 112 ~~a~~~g~~~i~l~~~~~N~~~y~k~GF~~~g~~~~~~~ 150 (150)
T PLN02706 112 EHARSAGCYKVILDCSEENKAFYEKCGYVRKEIQMVKYF 150 (150)
T ss_pred HHHHHcCCCEEEEEeccccHHHHHHCcCEEehhheEecC
Confidence 999999999999999888999999999999988888765
No 3
>PHA00673 acetyltransferase domain containing protein
Probab=99.84 E-value=8.7e-20 Score=109.26 Aligned_cols=102 Identities=15% Similarity=0.190 Sum_probs=88.6
Q ss_pred HHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcC
Q psy6918 12 HFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQ 91 (123)
Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~ 91 (123)
.+...+..+..++...++++++ +|++||++.+...+.....+...+.|..++|+|++||+|+|++|++++++++++.|
T Consensus 41 ~y~~af~ai~~dp~~~llVa~~--~g~vVG~~~l~~~p~l~~~~~~~~~Ie~l~V~~~~RGqGIG~~Ll~~A~~~Ar~~G 118 (154)
T PHA00673 41 PDHHAYAGMEAAGVAHFLGVFR--GEELVGFACLLVTPVPHFKGQLIGTTESIFVAAAHRPGGAGMALLRATEALARDLG 118 (154)
T ss_pred hhHHHHHHHHhCCCcEEEEEEE--CCEEEEEEEEEEecCCccCCccEEEEEEEEEChhccCCCHHHHHHHHHHHHHHHCC
Confidence 3445578888888888888887 79999999998877655666678899999999999999999999999999999999
Q ss_pred CeEEEEccC--CCchhHHHhcCcEEc
Q psy6918 92 CYKLTLDCA--DHMIPFYETFGFQKK 115 (123)
Q Consensus 92 ~~~i~~~~~--~~~~~~y~~~Gf~~~ 115 (123)
+..+.+++. .+...||.++|+++.
T Consensus 119 c~~lyis~~p~~~tv~fy~~~g~~~~ 144 (154)
T PHA00673 119 ATGLYVSGPTEGRLVQLLPAAGYRET 144 (154)
T ss_pred CCEEEEecCCCccchHHHHhCCchhh
Confidence 999999874 455789999999997
No 4
>PTZ00330 acetyltransferase; Provisional
Probab=99.84 E-value=2.2e-19 Score=107.81 Aligned_cols=113 Identities=35% Similarity=0.586 Sum_probs=85.1
Q ss_pred CCHHHHHHHHHhhhcC-CCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHH
Q psy6918 8 SNVFHFAERFHRMKAS-QDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKL 86 (123)
Q Consensus 8 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~ 86 (123)
.+.+.+...++..... ....++++.+ +|++||++.+...+.....+...+.|..++|+|+|||+|+|++|++.++++
T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~vG~~~~~~~~~~~~~~~~~~~i~~~~V~~~~rg~Gig~~l~~~~~~~ 110 (147)
T PTZ00330 33 LSQEELEQIAARRRLAGVVTRVFVHSP--TQRIVGTASLFVEPKFTRGGKCVGHIEDVVVDPSYRGQGLGRALISDLCEI 110 (147)
T ss_pred cchhHHHHHHHHHhcCCCceEEEEEeC--CCEEEEEEEEEeccccccCCCceEEEEEEEECHHHcCCCHHHHHHHHHHHH
Confidence 4555555444332222 2234555555 789999999875432222333457899999999999999999999999999
Q ss_pred HHhcCCeEEEEccCCCchhHHHhcCcEEccceeEee
Q psy6918 87 AKHFQCYKLTLDCADHMIPFYETFGFQKKNNFMQIY 122 (123)
Q Consensus 87 ~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~~~~~~~~ 122 (123)
+++.++..+.+.+++.+.+||+++||+.....|.+-
T Consensus 111 a~~~~~~~l~l~~n~~a~~~y~k~GF~~~~~~~~~~ 146 (147)
T PTZ00330 111 ARSSGCYKVILDCTEDMVAFYKKLGFRACERQMRLD 146 (147)
T ss_pred HHHCCCCEEEEecChHHHHHHHHCCCEEeceEEEEe
Confidence 999999999998899999999999999998777653
No 5
>PRK10146 aminoalkylphosphonic acid N-acetyltransferase; Provisional
Probab=99.84 E-value=7.4e-20 Score=109.53 Aligned_cols=111 Identities=20% Similarity=0.208 Sum_probs=82.2
Q ss_pred CHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHH
Q psy6918 9 NVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAK 88 (123)
Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~ 88 (123)
+.+...+.+......+...++++.+ ++++||++.+..... .......+.|..++|+|++||+|+|++|++.++++|+
T Consensus 30 ~~~~~~~~~~~~l~~~~~~~~v~~~--~~~ivG~~~~~~~~~-~~~~~~~~~i~~l~v~p~~rg~GiG~~Ll~~~~~~a~ 106 (144)
T PRK10146 30 DHQAFRVGFNANLRDPNMRYHLALL--DGEVVGMIGLHLQFH-LHHVNWIGEIQELVVMPQARGLNVGSKLLAWAEEEAR 106 (144)
T ss_pred CHHHHHHHHHHHhcCCCceEEEEEE--CCEEEEEEEEEeccc-ccccchhheeheeEECHHHcCCCHHHHHHHHHHHHHH
Confidence 3344444454444444445556666 799999999875321 1112234578899999999999999999999999999
Q ss_pred hcCCeEEEEccC---CCchhHHHhcCcEEccceeEee
Q psy6918 89 HFQCYKLTLDCA---DHMIPFYETFGFQKKNNFMQIY 122 (123)
Q Consensus 89 ~~~~~~i~~~~~---~~~~~~y~~~Gf~~~~~~~~~~ 122 (123)
+.++..+.+.++ +.+++||+++||+.....+.++
T Consensus 107 ~~~~~~i~l~~~~~n~~a~~fY~~~Gf~~~~~~~~~~ 143 (144)
T PRK10146 107 QAGAEMTELSTNVKRHDAHRFYLREGYEQSHFRFTKA 143 (144)
T ss_pred HcCCcEEEEecCCCchHHHHHHHHcCCchhhhhheeC
Confidence 999999998764 5678899999999987666554
No 6
>KOG3139|consensus
Probab=99.80 E-value=3e-18 Score=101.72 Aligned_cols=103 Identities=19% Similarity=0.394 Sum_probs=82.0
Q ss_pred HHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEE
Q psy6918 16 RFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKL 95 (123)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i 95 (123)
....+...++.+++++.|+ ++..||.+....... .....++|..++|+++|||+|||++|++.+++.++..|+..|
T Consensus 46 tyrYf~~~wp~~~~~a~d~-~~~~VGai~ck~~~~---r~~~rgyi~mLaV~~e~Rg~GIg~aLvr~aId~m~~~g~~eV 121 (165)
T KOG3139|consen 46 TYRYFVPNWPCFCFLALDE-KGDTVGAIVCKLDTH---RNTLRGYIAMLAVDSEYRGQGIGKALVRKAIDAMRSRGYSEV 121 (165)
T ss_pred HHHhcccCCceEEEEEEcC-CCceEEEEEEecccc---CCcceEEEEEEEechhhccccHHHHHHHHHHHHHHHCCCcEE
Confidence 3344555567788888873 233799888875332 124568999999999999999999999999999999999999
Q ss_pred EEcc---CCCchhHHHhcCcEEccceeEee
Q psy6918 96 TLDC---ADHMIPFYETFGFQKKNNFMQIY 122 (123)
Q Consensus 96 ~~~~---~~~~~~~y~~~Gf~~~~~~~~~~ 122 (123)
.+.+ +.++.++|+++||+..+..+.+|
T Consensus 122 vLeTe~~n~~A~~LY~sLGF~r~~r~~~YY 151 (165)
T KOG3139|consen 122 VLETEVTNLSALRLYESLGFKRDKRLFRYY 151 (165)
T ss_pred EEeccccchHHHHHHHhcCceEecceeEEE
Confidence 9976 66778899999999987766665
No 7
>PF00583 Acetyltransf_1: Acetyltransferase (GNAT) family; InterPro: IPR000182 The N-acetyltransferases (NAT) (EC 2.3.1.-) are enzymes that use acetyl coenzyme A (CoA) to transfer an acetyl group to a substrate, a reaction implicated in various functions from bacterial antibiotic resistance to mammalian circadian rhythm and chromatin remodeling. The Gcn5-related N-acetyltransferases (GNAT) catalyze the transfer of the acetyl from the CoA donor to a primary amine of the acceptor. The GNAT proteins share a domain composed of four conserved sequence motifs A-D [, ]. This GNAT domain is named after yeast GCN5 (from General Control Nonrepressed) and related histone acetyltransferases (HATs) like Hat1 and PCAF. HATs acetylate lysine residues of amino terminal histone tails, resulting in transcription activation. Another category of GNAT, the aminoglycoside N-acetyltransferases, confer antibiotic resistance by catalyzing the acetylation of amino groups in aminoglycoside antibiotics []. GNAT proteins can also have anabolic and catabolic functions in both prokaryotes and eukaryotes [, , , , ]. The acetyltransferase/GNAT domain forms a structurally conserved fold of 6 to 7 beta strands (B) and 4 helices (H) in the topology B1-H1-H2-B2-B3-B4-H3-B5-H4-B6, followed by a C-terminal strand which may be from the same monomer or contributed by another [, ]. Motifs D (B2-B3), A (B4-H3) and B (B5-H4) are collectively called the HAT core [, , ], while the N-terminal motif C (B1-H1) is less conserved. Some proteins known to contain a GNAT domain: Yeast GCN5 and Hat1, which are histone acetyltransferases (EC 2.3.1.48). Human PCAF, a histone acetyltransferase. Mammalian serotonin N-acetyltransferase (SNAT) or arylalkylamine NAT (AANAT), which acetylates serotonin into a circadian neurohormone that may participate in light-dark rhythms, and human mood and behavior. Mammalian glucosamine 6-phosphate N-acetyltransferase (GNA1) (EC 2.3.1.4). Escherichia coli rimI and rimJ, which acetylate the N-terminal alanine of ribosomal proteins S18 and S5, respectively (EC 2.3.1.128). Mycobacterium tuberculosis aminoglycoside 2'-N-acetyltransferase (aac), which acetylates the 2' hydroxyl or amino group of a broad spectrum of aminoglycoside antibiotics. Bacillus subtilis bltD and paiA, which acetylate spermine and spermidine. This entry represents the entire GNAT domain.; GO: 0008080 N-acetyltransferase activity, 0008152 metabolic process; PDB: 3T9Y_A 2R7H_B 2OZH_A 1Y9W_B 1VKC_B 2OH1_C 3R9E_B 3R9G_B 3R9F_A 3R96_A ....
Probab=99.79 E-value=3.9e-18 Score=93.13 Aligned_cols=77 Identities=26% Similarity=0.420 Sum_probs=67.2
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCc---hhHHHhcCc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHM---IPFYETFGF 112 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~---~~~y~~~Gf 112 (123)
+|+|||++.+....... +......|..++|+|+|||+|+|+.|++.+++.+++.++..+.+.+.+.+ .+||+|+||
T Consensus 4 ~~~ivg~~~~~~~~~~~-~~~~~~~i~~~~v~~~~r~~Gig~~L~~~~~~~~~~~g~~~i~~~~~~~n~~~~~~~~k~Gf 82 (83)
T PF00583_consen 4 DGQIVGFASLRPPPEPF-DHGNHAYIHRLAVDPEYRGQGIGSKLLQAAEEWARKRGIKRIYLDVSPDNPAARRFYEKLGF 82 (83)
T ss_dssp TTEEEEEEEEEEEETTT-TTTTEEEEEEEEECGGGTTSSHHHHHHHHHHHHHHHTTESEEEEEEETTGHHHHHHHHHTTE
T ss_pred CCEEEEEEEEEECCCcc-ccCCEEEEEEEEEcHHHhhCCCchhhhhhhhhhHHhcCccEEEEEEeCCCHHHHHHHHHcCC
Confidence 79999999999765422 22568999999999999999999999999999999999999999886655 589999999
Q ss_pred E
Q psy6918 113 Q 113 (123)
Q Consensus 113 ~ 113 (123)
+
T Consensus 83 ~ 83 (83)
T PF00583_consen 83 E 83 (83)
T ss_dssp E
T ss_pred C
Confidence 6
No 8
>PF13508 Acetyltransf_7: Acetyltransferase (GNAT) domain; PDB: 3EY5_A 3FRM_B 3D8P_B 3GY9_A 3GYA_A 3S6F_A 2Q7B_A 1CM0_B 1XEB_B 1Y7R_A ....
Probab=99.76 E-value=1e-17 Score=90.87 Aligned_cols=77 Identities=32% Similarity=0.421 Sum_probs=63.2
Q ss_pred eEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchh
Q psy6918 26 YLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIP 105 (123)
Q Consensus 26 ~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~ 105 (123)
..++++.+ ++++||++.+.... ....|..++|+|++||+|+|++|++.+.+.+.. ..+.+.+++.+.+
T Consensus 3 ~~~~~~~~--~~~ivG~~~~~~~~-------~~~~i~~~~v~~~~rg~Gig~~ll~~~~~~~~~---~~i~l~~~~~~~~ 70 (79)
T PF13508_consen 3 ERFFVAED--DGEIVGFIRLWPNE-------DFAYIGYLAVDPEYRGKGIGSKLLNYLLEKAKS---KKIFLFTNPAAIK 70 (79)
T ss_dssp EEEEEEEE--TTEEEEEEEEEETT-------TEEEEEEEEE-GGGTTSSHHHHHHHHHHHHHTC---SEEEEEEEHHHHH
T ss_pred cEEEEEEE--CCEEEEEEEEEEcC-------CEEEEEEEEECHHHcCCCHHHHHHHHHHHHcCC---CcEEEEEcHHHHH
Confidence 34566777 79999999995332 367999999999999999999999999988854 4466777888999
Q ss_pred HHHhcCcEE
Q psy6918 106 FYETFGFQK 114 (123)
Q Consensus 106 ~y~~~Gf~~ 114 (123)
||+++||++
T Consensus 71 fY~~~GF~~ 79 (79)
T PF13508_consen 71 FYEKLGFEE 79 (79)
T ss_dssp HHHHTTEEE
T ss_pred HHHHCcCCC
Confidence 999999985
No 9
>PF13673 Acetyltransf_10: Acetyltransferase (GNAT) domain; PDB: 2FIW_A 1BOB_A 3FNC_B 3EXN_A.
Probab=99.75 E-value=2.9e-17 Score=95.01 Aligned_cols=83 Identities=25% Similarity=0.439 Sum_probs=67.4
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEE
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLT 96 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~ 96 (123)
+......+...++++++ ++++||++.+. + ...|..++|+|+|||+|+|++|++.+++.++. ++..+.
T Consensus 35 ~~~~~~~~~~~~~v~~~--~~~ivG~~~~~--~--------~~~i~~l~v~p~~r~~Gig~~Ll~~~~~~~~~-~~~~l~ 101 (117)
T PF13673_consen 35 LEEYLEEGSHTIFVAEE--GGEIVGFAWLE--P--------DGEISHLYVLPEYRGRGIGRALLDAAEKEAKD-GIRRLT 101 (117)
T ss_dssp HHHHHCTCCCEEEEEEE--TTEEEEEEEEE--T--------CEEEEEEEE-GGGTTSSHHHHHHHHHHHHHTT-TCEEEE
T ss_pred HHHHHHhcCCEEEEEEE--CCEEEEEEEEc--C--------CCeEEEEEEChhhcCCcHHHHHHHHHHHHHHc-CCcEEE
Confidence 34444455566777877 79999999985 1 23499999999999999999999999999977 999999
Q ss_pred EccCCCchhHHHhcCc
Q psy6918 97 LDCADHMIPFYETFGF 112 (123)
Q Consensus 97 ~~~~~~~~~~y~~~Gf 112 (123)
+..+..+.+||+++||
T Consensus 102 ~~~~~~a~~~y~~~GF 117 (117)
T PF13673_consen 102 VEANERARRFYRKLGF 117 (117)
T ss_dssp EEC-HHHHHHHHHTT-
T ss_pred EEeCHHHHHHHHhCCC
Confidence 9988999999999998
No 10
>PRK10314 putative acyltransferase; Provisional
Probab=99.73 E-value=5.1e-17 Score=98.46 Aligned_cols=84 Identities=17% Similarity=0.309 Sum_probs=68.6
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc-CCeEEEEccCCCchhH
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF-QCYKLTLDCADHMIPF 106 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~-~~~~i~~~~~~~~~~~ 106 (123)
.+++.+ ++++||++++.... .......|..++|+|+|||+|+|++||+.+++++++. +...+.+.++..+.+|
T Consensus 50 h~~~~~--~~~~vg~~r~~~~~----~~~~~~~i~rv~V~~~~rG~GiG~~Lm~~~~~~~~~~~~~~~i~L~a~~~a~~f 123 (153)
T PRK10314 50 HILGWK--NDELVAYARILKSD----DDLEPVVIGRVIVSEALRGEKVGQQLMSKTLESCTRHWPDKPVYLGAQAHLQNF 123 (153)
T ss_pred EEEEEE--CCEEEEEEEEecCC----CCCCCEEEEEEEECHHHhCCCHHHHHHHHHHHHHHHHCCCCcEEEehHHHHHHH
Confidence 334444 68999999997532 1123478999999999999999999999999999875 6778888888888899
Q ss_pred HHhcCcEEccc
Q psy6918 107 YETFGFQKKNN 117 (123)
Q Consensus 107 y~~~Gf~~~~~ 117 (123)
|+|+||+..++
T Consensus 124 Y~k~GF~~~g~ 134 (153)
T PRK10314 124 YQSFGFIPVTE 134 (153)
T ss_pred HHHCCCEECCC
Confidence 99999999854
No 11
>TIGR01575 rimI ribosomal-protein-alanine acetyltransferase. Members of this model belong to the GCN5-related N-acetyltransferase (GNAT) superfamily. This model covers prokarotes and the archaea. The seed contains a characterized accession for Gram negative E. coli. An untraceable characterized accession (PIR|S66013) for Gram positive B. subtilis scores well (205.0) in the full alignment. Characterized members are lacking in the archaea. Noise cutoff (72.4) was set to exclude M. loti paralog of rimI. Trusted cutoff (80.0) was set at next highest scoring member in the mini-database.
Probab=99.71 E-value=2.8e-16 Score=92.24 Aligned_cols=99 Identities=18% Similarity=0.256 Sum_probs=75.1
Q ss_pred CCCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHH
Q psy6918 6 PTSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVK 85 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~ 85 (123)
.+++.+.+...+. .....++++.+ ++++||++.+.... ....+..++|+|++||+|+|++|++++++
T Consensus 15 ~~~~~~~~~~~~~----~~~~~~~~~~~--~~~~vg~~~~~~~~-------~~~~i~~~~v~~~~rg~G~g~~ll~~~~~ 81 (131)
T TIGR01575 15 FPWTEAQFAEELA----NYHLCYLLARI--GGKVVGYAGVQIVL-------DEAHILNIAVKPEYQGQGIGRALLRELID 81 (131)
T ss_pred CCCCHHHHHHHhc----CCCceEEEEec--CCeEEEEEEEEecC-------CCeEEEEEEECHHHcCCCHHHHHHHHHHH
Confidence 3555665555543 33334455554 68999999976432 23568889999999999999999999999
Q ss_pred HHHhcCCeEEEEcc---CCCchhHHHhcCcEEccc
Q psy6918 86 LAKHFQCYKLTLDC---ADHMIPFYETFGFQKKNN 117 (123)
Q Consensus 86 ~~~~~~~~~i~~~~---~~~~~~~y~~~Gf~~~~~ 117 (123)
++.+.++..+.+.+ +..+.+||+++||+..+.
T Consensus 82 ~~~~~~~~~i~~~~~~~n~~~~~~y~~~Gf~~~~~ 116 (131)
T TIGR01575 82 EAKGRGVNEIFLEVRVSNIAAQALYKKLGFNEIAI 116 (131)
T ss_pred HHHHcCCCeEEEEEecccHHHHHHHHHcCCCcccc
Confidence 99999998888865 455678999999998744
No 12
>PRK10975 TDP-fucosamine acetyltransferase; Provisional
Probab=99.71 E-value=3.8e-16 Score=97.98 Aligned_cols=81 Identities=15% Similarity=0.166 Sum_probs=67.3
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---CCCchhHHHhcCc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---ADHMIPFYETFGF 112 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~~~~~~~y~~~Gf 112 (123)
++++||++.+..... ....|..++|+|++||+|+|++|++.+++++++.|+..+.+.+ |+.+.+||+|+||
T Consensus 110 ~g~~vG~~~l~~~~~------~~~~i~~~~V~p~~rg~Gig~~Ll~~~~~~a~~~g~~~i~l~v~~~N~~a~~~yek~Gf 183 (194)
T PRK10975 110 SGQIQGFVTLRELND------TDARIGLLAVFPGAQGRGIGARLMQAALNWCQARGLTRLRVATQMGNLAALRLYIRSGA 183 (194)
T ss_pred CCCEEEEEEEEecCC------CceEEEEEEEChhhcCCCHHHHHHHHHHHHHHHcCCCEEEEEeCCCcHHHHHHHHHCCC
Confidence 689999998864211 2367888999999999999999999999999999999998876 4455789999999
Q ss_pred EEccceeEee
Q psy6918 113 QKKNNFMQIY 122 (123)
Q Consensus 113 ~~~~~~~~~~ 122 (123)
+..+..+-.|
T Consensus 184 ~~~~~~~~~~ 193 (194)
T PRK10975 184 NIESTAYWLY 193 (194)
T ss_pred eEeEEEeeec
Confidence 9997766443
No 13
>PRK03624 putative acetyltransferase; Provisional
Probab=99.71 E-value=2.6e-16 Score=93.33 Aligned_cols=84 Identities=27% Similarity=0.427 Sum_probs=68.0
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---C
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---A 100 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~ 100 (123)
+...++++.+ ++++||++.+.... ....+..++|+|+|||+|+|++|++.+++++++.+++.+.+.+ |
T Consensus 43 ~~~~~~v~~~--~~~~vG~~~~~~~~-------~~~~i~~i~v~p~~rg~Gig~~ll~~~~~~~~~~~~~~~~~~~~~~N 113 (140)
T PRK03624 43 DPSLFLVAEV--GGEVVGTVMGGYDG-------HRGWAYYLAVHPDFRGRGIGRALVARLEKKLIARGCPKINLQVREDN 113 (140)
T ss_pred CCceEEEEEc--CCcEEEEEEeeccC-------CCceEEEEEECHHHhCCCHHHHHHHHHHHHHHHCCCCEEEEEEecCc
Confidence 3345566666 68999999875321 2356788999999999999999999999999999999888765 4
Q ss_pred CCchhHHHhcCcEEcc
Q psy6918 101 DHMIPFYETFGFQKKN 116 (123)
Q Consensus 101 ~~~~~~y~~~Gf~~~~ 116 (123)
+.+.+||+|+||+..+
T Consensus 114 ~~~~~~y~k~GF~~~~ 129 (140)
T PRK03624 114 DAVLGFYEALGYEEQD 129 (140)
T ss_pred HHHHHHHHHcCCcccc
Confidence 5567899999999874
No 14
>TIGR02382 wecD_rffC TDP-D-fucosamine acetyltransferase. This model represents the WecD protein (Formerly RffC) for the biosynthesis of enterobacterial common antigen (ECA), an outer leaflet, outer membrane glycolipid with a trisaccharide repeat unit. WecD is a member of the GNAT family of acetytransferases (pfam00583).
Probab=99.71 E-value=3.9e-16 Score=97.74 Aligned_cols=79 Identities=11% Similarity=0.069 Sum_probs=66.2
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---CCCchhHHHhcCc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---ADHMIPFYETFGF 112 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~~~~~~~y~~~Gf 112 (123)
++++||++.+..... ..+.|..++|+|++||+|+|++|++.+++++++.|+..|.+.+ |+.+.+||+|+||
T Consensus 107 ~g~iiG~i~l~~~~~------~~~~i~~l~V~p~~rGkG~G~~ll~~~~~~a~~~g~~~I~l~v~~~N~~A~~~Y~klGF 180 (191)
T TIGR02382 107 SGDPRGYVTLRELND------TDARIGLLAVFPGAQSRGIGAELMQTALNWCYARGLTRLRVATQMGNTAALRLYIRSGA 180 (191)
T ss_pred CCeEEEEEEEEecCC------CceEEEEEEECHHHcCCCHHHHHHHHHHHHHHHcCCCEEEEEeCCCCHHHHHHHHHcCC
Confidence 689999999864321 2467888999999999999999999999999999999999886 4455789999999
Q ss_pred EEccceeE
Q psy6918 113 QKKNNFMQ 120 (123)
Q Consensus 113 ~~~~~~~~ 120 (123)
+..+..+-
T Consensus 181 ~~~~~~~~ 188 (191)
T TIGR02382 181 NIESTAYW 188 (191)
T ss_pred ccccceee
Confidence 99865443
No 15
>KOG3216|consensus
Probab=99.70 E-value=6.5e-16 Score=90.99 Aligned_cols=97 Identities=19% Similarity=0.173 Sum_probs=77.1
Q ss_pred hhcCCCeEEEE-EEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc
Q psy6918 20 MKASQDYLVTV-IEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD 98 (123)
Q Consensus 20 ~~~~~~~~~~~-~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~ 98 (123)
+.+++-..+.+ ..+.+++.++|++.+...- .++.+....+|.+++|.|+|||+|+|++|++.+-+.|.+.|+.++.+.
T Consensus 46 F~d~~~~~~~v~~ie~~~~~~aGf~~yf~~y-stW~~k~~iYleDlyV~e~yR~kG~Gs~Ll~~va~~A~~~G~~rv~w~ 124 (163)
T KOG3216|consen 46 FIDPPFKHWLVAAIETSGEVVAGFALYFNNY-STWLGKQGIYLEDLYVREQYRGKGIGSKLLKFVAEEADKLGTPRVEWV 124 (163)
T ss_pred ccCCCccEEEEEEEecCCCceeEEeeeeccc-ccccccceEEEEeeEecchhcccChHHHHHHHHHHHHHHcCCCcEEEE
Confidence 34444444443 3334478999999987532 345555778999999999999999999999999999999999888776
Q ss_pred c---CCCchhHHHhcCcEEccc
Q psy6918 99 C---ADHMIPFYETFGFQKKNN 117 (123)
Q Consensus 99 ~---~~~~~~~y~~~Gf~~~~~ 117 (123)
+ |.+++.||++.|++....
T Consensus 125 vldwN~rAi~lY~k~gaq~l~~ 146 (163)
T KOG3216|consen 125 VLDWNHRAILLYEKVGAQDLKE 146 (163)
T ss_pred EeccchhHHHHHHHhCccccce
Confidence 5 778889999999999866
No 16
>cd02169 Citrate_lyase_ligase Citrate lyase ligase. Citrate lyase ligase, also known as [Citrate (pro-3S)-lyase] ligase, is responsible for acetylation of the (2-(5''-phosphoribosyl)-3'-dephosphocoenzyme-A) prosthetic group of the gamma subunit of citrate lyase, converting the inactive thiol form of this enzyme to the active form. The acetylation of 1 molecule of deacetyl-citrate lyase to enzymatically active citrate lyase requires 6 molecules of ATP. The Adenylylyltranferase activity of the enzyme involves the formation of AMP and and pyrophosphate in the acetylation reaction.
Probab=99.68 E-value=4.5e-16 Score=102.89 Aligned_cols=79 Identities=25% Similarity=0.407 Sum_probs=68.9
Q ss_pred CeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCch
Q psy6918 25 DYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMI 104 (123)
Q Consensus 25 ~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~ 104 (123)
+..+++.++ ++++||++++.. ..|..++|+|+|||+|+|++||+.+++++++.|+..+.+.++..+.
T Consensus 5 ~~~~~v~~~--~~~iVG~~~l~~-----------~~I~~vaV~p~~Rg~GiG~~Ll~~l~~~a~~~g~~~i~L~t~~~~~ 71 (297)
T cd02169 5 DYTVGIFDD--AGELIATGSIAG-----------NVLKCVAVCPKYQGEGLALKIVSELINKAYEEGIFHLFLFTKPKNA 71 (297)
T ss_pred cEEEEEEEE--CCEEEEEEEecc-----------CEEEEEEECHHHcCCCHHHHHHHHHHHHHHHCCCCEEEEEEcccHH
Confidence 344555665 699999998851 2588999999999999999999999999999999999999988889
Q ss_pred hHHHhcCcEEcc
Q psy6918 105 PFYETFGFQKKN 116 (123)
Q Consensus 105 ~~y~~~Gf~~~~ 116 (123)
+||+|+||+..+
T Consensus 72 ~fYek~GF~~~~ 83 (297)
T cd02169 72 KFFRGLGFKELA 83 (297)
T ss_pred HHHHHCCCEEec
Confidence 999999999876
No 17
>PF13527 Acetyltransf_9: Acetyltransferase (GNAT) domain; PDB: 3SXN_C 2I00_D 1M4D_B 1M44_A 1M4G_B 1M4I_A 2OZG_A 2HV2_F 3N7Z_A 3RYO_B ....
Probab=99.68 E-value=8.6e-15 Score=85.86 Aligned_cols=84 Identities=30% Similarity=0.413 Sum_probs=66.2
Q ss_pred EEEEEEcCCCCeEEEEEEEEEeeecccc-cceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchh
Q psy6918 27 LVTVIEDTRTKQVIGTGSLILEQKFIHE-CALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIP 105 (123)
Q Consensus 27 ~~~~~~~~~~~~ivG~~~~~~~~~~~~~-~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~ 105 (123)
.++++.+ +++|||++.+.+..-...+ .-..+.+..++|+|+|||+|+|++|++++++.+++.|+..+.+.. ...+
T Consensus 42 ~~~~~~~--~~~ivg~~~~~~~~~~~~g~~~~~~~i~~v~v~p~~R~~Gl~~~L~~~~~~~~~~~g~~~~~l~~--~~~~ 117 (127)
T PF13527_consen 42 RCVVAED--DGKIVGHVGLIPRRLSVGGKKFKAAYIGDVAVDPEYRGRGLGRQLMRALLERARERGVPFIFLFP--SSPP 117 (127)
T ss_dssp EEEEEEE--TTEEEEEEEEEEEEEEETTEEEEEEEEEEEEE-GGGTTSSHHHHHHHHHHHHHHHTT-SEEEEE---SSHH
T ss_pred cEEEEEE--CCEEEEEEEEEEEEEEECCEEEEEEEEEEEEECHHHcCCCHHHHHHHHHHHHHHhCCCCEEEEec--CChh
Confidence 4566666 7999999999765432222 224679999999999999999999999999999999998777754 5589
Q ss_pred HHHhcCcEE
Q psy6918 106 FYETFGFQK 114 (123)
Q Consensus 106 ~y~~~Gf~~ 114 (123)
||+++||+.
T Consensus 118 ~Y~~~G~~~ 126 (127)
T PF13527_consen 118 FYRRFGFEY 126 (127)
T ss_dssp HHHHTTEEE
T ss_pred hhhcCCCEE
Confidence 999999985
No 18
>PRK10140 putative acetyltransferase YhhY; Provisional
Probab=99.68 E-value=1.4e-15 Score=92.56 Aligned_cols=89 Identities=13% Similarity=0.272 Sum_probs=66.6
Q ss_pred CeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHh-cCCeEEEEcc---C
Q psy6918 25 DYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKH-FQCYKLTLDC---A 100 (123)
Q Consensus 25 ~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~-~~~~~i~~~~---~ 100 (123)
....+++.+ ++++||++++..... ........ ..++|+|++||+|+|++|++.+++++++ .++..+.+.+ |
T Consensus 50 ~~~~~v~~~--~~~~vG~~~~~~~~~--~~~~~~~~-~~~~v~p~~rg~Gig~~ll~~l~~~~~~~~~~~~i~l~v~~~N 124 (162)
T PRK10140 50 GIKQLVACI--DGDVVGHLTIDVQQR--PRRSHVAD-FGICVDSRWKNRGVASALMREMIEMCDNWLRVDRIELTVFVDN 124 (162)
T ss_pred CcEEEEEEE--CCEEEEEEEEecccc--cccceEEE-EEEEECHHHcCCCHHHHHHHHHHHHHHhhCCccEEEEEEEcCC
Confidence 334556665 699999999974321 11112222 3589999999999999999999999988 5888887765 5
Q ss_pred CCchhHHHhcCcEEccce
Q psy6918 101 DHMIPFYETFGFQKKNNF 118 (123)
Q Consensus 101 ~~~~~~y~~~Gf~~~~~~ 118 (123)
..+.+||+|+||+..+..
T Consensus 125 ~~a~~~y~k~GF~~~g~~ 142 (162)
T PRK10140 125 APAIKVYKKYGFEIEGTG 142 (162)
T ss_pred HHHHHHHHHCCCEEEeec
Confidence 666889999999987553
No 19
>TIGR03103 trio_acet_GNAT GNAT-family acetyltransferase TIGR03103. Members of this protein family belong to the GNAT family of acetyltransferases. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and an asparagine synthetase family protein.
Probab=99.67 E-value=1.6e-15 Score=107.69 Aligned_cols=97 Identities=21% Similarity=0.285 Sum_probs=73.9
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---C
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---A 100 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~ 100 (123)
....++++.+.++|++||++.+...............|..++|+|+|||+|+|++|++.+++++++.|+..+.+.+ |
T Consensus 121 ~~~~~~vA~~~~~g~IVG~~~~~~~~~~~~d~~~~~~i~~l~V~P~~Rg~GIG~~Ll~~l~e~a~~~G~~~i~L~V~~~N 200 (547)
T TIGR03103 121 RAITYLVAEDEASGAIIGTVMGVDHRKAFNDPEHGSSLWCLAVDPQAAHPGVGEALVRALAEHFQSRGCAYMDLSVMHDN 200 (547)
T ss_pred CCceEEEEEECCCCeEEEEEEEEeccccccCCCCCeEEEEEEECHHHcCCCHHHHHHHHHHHHHHHCCCCEEEEEEcCCC
Confidence 3445566664336999999986533222222334467899999999999999999999999999999999998876 4
Q ss_pred CCchhHHHhcCcEEccceeE
Q psy6918 101 DHMIPFYETFGFQKKNNFMQ 120 (123)
Q Consensus 101 ~~~~~~y~~~Gf~~~~~~~~ 120 (123)
..+++||+|+||+....+..
T Consensus 201 ~~Ai~fY~klGf~~~~~y~~ 220 (547)
T TIGR03103 201 EQAIALYEKLGFRRIPVFAL 220 (547)
T ss_pred HHHHHHHHHCCCEEeeEEEE
Confidence 55688999999998866543
No 20
>PF13420 Acetyltransf_4: Acetyltransferase (GNAT) domain; PDB: 3DR8_A 3DR6_A 2AE6_B 2JLM_C 2J8R_A 1YVO_B 2J8M_A 2J8N_A 2BL1_A 3IWG_A ....
Probab=99.66 E-value=5e-15 Score=89.60 Aligned_cols=106 Identities=25% Similarity=0.426 Sum_probs=77.7
Q ss_pred CCHHHHHHHHHhhh-cCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHH
Q psy6918 8 SNVFHFAERFHRMK-ASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKL 86 (123)
Q Consensus 8 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~ 86 (123)
.+.+.++.++.... .+....+.+..+ +|++||++.+.... +....+.+ .++|.|++|++|+|+.|++.++++
T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~v~~~--~g~iiG~~~~~~~~----~~~~~~~~-~~~v~~~~~~~gig~~l~~~l~~~ 104 (155)
T PF13420_consen 32 DSEESFERWIESIIDSSKQRLFLVAEE--DGKIIGYVSLRDID----PYNHTAEL-SIYVSPDYRGKGIGRKLLDELIEY 104 (155)
T ss_dssp S-HHHHHHHHHHHHHHHTTEEEEEEEC--TTEEEEEEEEEESS----SGTTEEEE-EEEEEGGGTTSSHHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHhcccCCCcEEEEEEc--CCcEEEEEEEEeee----ccCCEEEE-eeEEChhHCCCcHHHHHHHHHHHH
Confidence 45667777777653 334555555554 79999999998432 22333444 478889999999999999999999
Q ss_pred H-HhcCCeEEEEcc---CCCchhHHHhcCcEEccceeE
Q psy6918 87 A-KHFQCYKLTLDC---ADHMIPFYETFGFQKKNNFMQ 120 (123)
Q Consensus 87 ~-~~~~~~~i~~~~---~~~~~~~y~~~Gf~~~~~~~~ 120 (123)
+ ++.|++++.+.+ |+.+++||+++||+..+..-.
T Consensus 105 af~~~~~~~i~~~v~~~N~~~i~~~~~~GF~~~g~~~~ 142 (155)
T PF13420_consen 105 AFKELGIHKIYLEVFSSNEKAINFYKKLGFEEEGELKD 142 (155)
T ss_dssp H-HHTT-CEEEEEEETT-HHHHHHHHHTTEEEEEEEEE
T ss_pred hhhccCeEEEEEEEecCCHHHHHHHHhCCCEEEEEEec
Confidence 9 889999998765 667788999999999865443
No 21
>COG1247 Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane]
Probab=99.66 E-value=1e-14 Score=88.67 Aligned_cols=109 Identities=23% Similarity=0.261 Sum_probs=83.0
Q ss_pred CCCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHH
Q psy6918 6 PTSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVK 85 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~ 85 (123)
.+++.+...+++......+ +.++++++. +|+++|++.+..... .+.-......+++|+|+.||+|+|++|++++++
T Consensus 33 ~~~~~~~~~~~~~~~~~~g-~p~~V~~~~-~g~v~G~a~~~~fr~--r~ay~~tve~SiYv~~~~~g~GiG~~Ll~~Li~ 108 (169)
T COG1247 33 DPVSLEERAAWFSGRTRDG-YPVVVAEEE-DGKVLGYASAGPFRE--RPAYRHTVELSIYLDPAARGKGLGKKLLQALIT 108 (169)
T ss_pred cCCCHHHHHHHHHhcccCC-ceEEEEEcC-CCeEEEEEEeeeccC--ccccceEEEEEEEECcccccccHHHHHHHHHHH
Confidence 4667788877776554444 556666541 599999999975432 222234456679999999999999999999999
Q ss_pred HHHhcCCeEEEEcc---CCCchhHHHhcCcEEccce
Q psy6918 86 LAKHFQCYKLTLDC---ADHMIPFYETFGFQKKNNF 118 (123)
Q Consensus 86 ~~~~~~~~~i~~~~---~~~~~~~y~~~Gf~~~~~~ 118 (123)
.+.+.|++.+.... |..+.++++++||++.+.+
T Consensus 109 ~~~~~g~~~lva~I~~~n~aSi~lh~~~GF~~~G~~ 144 (169)
T COG1247 109 EARALGVRELVAGIESDNLASIALHEKLGFEEVGTF 144 (169)
T ss_pred HHHhCCeEEEEEEEcCCCcHhHHHHHHCCCEEeccc
Confidence 99999998777654 5677889999999998544
No 22
>PF13523 Acetyltransf_8: Acetyltransferase (GNAT) domain; PDB: 2VQY_A 2BUE_A 1V0C_A 1YK3_D 2PR8_A 2QIR_A 2PRB_A 2QML_A 2PC1_A.
Probab=99.66 E-value=8.8e-15 Score=88.40 Aligned_cols=95 Identities=26% Similarity=0.357 Sum_probs=71.3
Q ss_pred cCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc-CCeEEEEccC
Q psy6918 22 ASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF-QCYKLTLDCA 100 (123)
Q Consensus 22 ~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~-~~~~i~~~~~ 100 (123)
..+....+++.. +|+++|++.+...............++.++++|++||+|+|+.+++.+++.+.+. ++..|.+.++
T Consensus 44 ~~~~~~~~v~~~--dg~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rg~G~g~~~~~~~~~~~~~~~~~~~i~~~~~ 121 (152)
T PF13523_consen 44 ADPGHHPYVAED--DGEPIGYFEIYWPDEDYDADDGDRGIHRLIVDPEYRGQGLGKAMLRALIEFLFEDPGVDRIVLDPH 121 (152)
T ss_dssp HTTTEEEEEEEE--TTEEEEEEEEEEGGGSS---TTEEEEEEEESTGGGTTSSHHHHHHHHHHHHHHTSTT--EEEEEEB
T ss_pred ccCCceEEEEEE--CCEEEEEEEEecccccccCCCCEEEEeeeeechhhcCCCHHHHHHHHHHHHHHhCCCCCEEEEecC
Confidence 356666666776 7999999988653332222345667888999999999999999999999999987 8999999875
Q ss_pred CC---chhHHHhcCcEEccce
Q psy6918 101 DH---MIPFYETFGFQKKNNF 118 (123)
Q Consensus 101 ~~---~~~~y~~~Gf~~~~~~ 118 (123)
.. ++++|+|+||+..+.+
T Consensus 122 ~~N~~~~~~~~k~GF~~~g~~ 142 (152)
T PF13523_consen 122 EDNTRAIRLYEKAGFRKVGEF 142 (152)
T ss_dssp TT-HHHHHHHHHTT-EEEEEE
T ss_pred cCCHHHHHHHHHcCCEEeeEE
Confidence 54 4779999999998554
No 23
>TIGR02406 ectoine_EctA L-2,4-diaminobutyric acid acetyltransferase. This enzyme family is the EctA of ectoine biosynthesis. Ectoine is a compatible solute, analagous to trehalose, betaines, etc., found often in halotolerant organisms. EctA is L-2,4-diaminobutyric acid acetyltransferase, also called DABA acetyltransferase.
Probab=99.66 E-value=2.1e-15 Score=91.78 Aligned_cols=86 Identities=17% Similarity=0.176 Sum_probs=66.8
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---CCCch
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---ADHMI 104 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~~~~~ 104 (123)
++++.+. ++++||++.+.... .......+..++|+|++||+|+|++|++.+++++++.++..+.+.+ |+.++
T Consensus 41 ~~v~~~~-~~~ivG~~~~~~~~----~~~~~~~i~~l~V~p~~rg~GiG~~L~~~l~~~a~~~~~~~i~~~v~~~N~~a~ 115 (157)
T TIGR02406 41 SIVAESE-GGEIVGFVSGYLRP----DRPDVLFVWQVAVDPRARGKGLARRLLEALLERVACERVRHLETTITPDNQASR 115 (157)
T ss_pred EEEEEcC-CCeEEEEEEEEecC----CCCCeEEEEEEEEChHhccCcHHHHHHHHHHHHHHhCCCCEEEEEEcCCCHHHH
Confidence 4455521 68999998765322 1234578889999999999999999999999999999988888776 44557
Q ss_pred hHHHhcCcEEccce
Q psy6918 105 PFYETFGFQKKNNF 118 (123)
Q Consensus 105 ~~y~~~Gf~~~~~~ 118 (123)
+||+|+||+....+
T Consensus 116 ~ly~k~G~~~~~~~ 129 (157)
T TIGR02406 116 ALFKALARRRGVHL 129 (157)
T ss_pred HHHHHhCcccCCCe
Confidence 79999999876443
No 24
>TIGR03827 GNAT_ablB putative beta-lysine N-acetyltransferase. Members of this protein family are GNAT family acetyltransferases, based on a seed alignment in which every member is associated with a lysine 2,3-aminomutase family protein, usually as the adjacent gene. This family includes AblB, the enzyme beta-lysine acetyltransferase that completes the two-step synthesis of the osmolyte (compatible solute) N-epsilon-acetyl-beta-lysine; all members of the family may have this function. Note that N-epsilon-acetyl-beta-lysine has been observed only in methanogenic archaea (e.g. Methanosarcina) but that this model, paired with TIGR03820, suggests a much broader distribution.
Probab=99.65 E-value=3.8e-15 Score=97.67 Aligned_cols=87 Identities=18% Similarity=0.213 Sum_probs=69.8
Q ss_pred CeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---CC
Q psy6918 25 DYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---AD 101 (123)
Q Consensus 25 ~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~~ 101 (123)
...++++.+ ++++||++++.... ....+.|..++|+|+|||+|+|++|++.+++++++.|+..+.+.+ +.
T Consensus 157 ~~~~~v~~~--~g~iVG~~~~~~~~-----~~~~~eI~~i~V~P~yRG~GiG~~Ll~~l~~~a~~~g~~~l~~~~~~~n~ 229 (266)
T TIGR03827 157 NVVYFGVED--GGKIIALASAEMDP-----ENGNAEMTDFATLPEYRGKGLAKILLAAMEKEMKEKGIRTAYTIARASSY 229 (266)
T ss_pred CcEEEEEEE--CCEEEEEEEEecCC-----CCCcEEEEEEEECHHHcCCCHHHHHHHHHHHHHHHCCCcEEEeehhhcch
Confidence 344555665 69999999874321 234578999999999999999999999999999999999888766 45
Q ss_pred CchhHHHhcCcEEccce
Q psy6918 102 HMIPFYETFGFQKKNNF 118 (123)
Q Consensus 102 ~~~~~y~~~Gf~~~~~~ 118 (123)
.+..+|+|+||+..+..
T Consensus 230 ~a~~ly~k~GF~~~G~l 246 (266)
T TIGR03827 230 GMNITFARLGYAYGGTL 246 (266)
T ss_pred hHHHHHHHcCCccccEE
Confidence 55789999999987554
No 25
>PRK07922 N-acetylglutamate synthase; Validated
Probab=99.65 E-value=3e-15 Score=92.09 Aligned_cols=79 Identities=23% Similarity=0.368 Sum_probs=65.5
Q ss_pred EEEEE-cCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhH
Q psy6918 28 VTVIE-DTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPF 106 (123)
Q Consensus 28 ~~~~~-~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~ 106 (123)
++++. + +++++|++.+.... ...+.|..++|+|++||+|+|++|++++++++++.|+..+.+.+. +.+|
T Consensus 47 ~~va~~~--~~~iiG~~~~~~~~------~~~~~i~~l~V~p~~rgkGiG~~Ll~~~~~~a~~~g~~~l~~~~~--~~~f 116 (169)
T PRK07922 47 FWVAEHL--DGEVVGCGALHVMW------EDLAEIRTVAVDPAARGRGVGHAIVERLLDVARELGLSRVFVLTF--EVEF 116 (169)
T ss_pred EEEEEec--CCcEEEEEEEeecC------CCceEEEEEEECHHHhCCCHHHHHHHHHHHHHHHcCCCEEEEEec--cHHH
Confidence 44555 5 78999999876422 134678899999999999999999999999999999999887653 4789
Q ss_pred HHhcCcEEcc
Q psy6918 107 YETFGFQKKN 116 (123)
Q Consensus 107 y~~~Gf~~~~ 116 (123)
|+|+||+..+
T Consensus 117 Y~k~GF~~~~ 126 (169)
T PRK07922 117 FARHGFVEID 126 (169)
T ss_pred HHHCCCEECc
Confidence 9999999874
No 26
>PRK09491 rimI ribosomal-protein-alanine N-acetyltransferase; Provisional
Probab=99.65 E-value=1e-14 Score=87.61 Aligned_cols=82 Identities=21% Similarity=0.307 Sum_probs=65.4
Q ss_pred eEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---CCC
Q psy6918 26 YLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---ADH 102 (123)
Q Consensus 26 ~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~~~ 102 (123)
...+.+.+ ++++||++.+.... ....+..++|+|++||+|+|+++++.+++.+++.++..+.+.+ |..
T Consensus 40 ~~~~~~~~--~~~~vG~~~~~~~~-------~~~~~~~i~v~~~~rg~G~g~~ll~~~~~~~~~~~~~~~~~~~~~~N~~ 110 (146)
T PRK09491 40 YLNLKLTV--NGQMAAFAITQVVL-------DEATLFNIAVDPDYQRQGLGRALLEHLIDELEKRGVATLWLEVRASNAA 110 (146)
T ss_pred ceEEEEEE--CCeEEEEEEEEeec-------CceEEEEEEECHHHccCCHHHHHHHHHHHHHHHCCCcEEEEEEccCCHH
Confidence 33344444 68999999886322 1356788999999999999999999999999989998888765 455
Q ss_pred chhHHHhcCcEEcc
Q psy6918 103 MIPFYETFGFQKKN 116 (123)
Q Consensus 103 ~~~~y~~~Gf~~~~ 116 (123)
+.+||+|+||+..+
T Consensus 111 a~~~y~k~Gf~~~~ 124 (146)
T PRK09491 111 AIALYESLGFNEVT 124 (146)
T ss_pred HHHHHHHcCCEEee
Confidence 67899999999764
No 27
>PRK07757 acetyltransferase; Provisional
Probab=99.64 E-value=5.2e-15 Score=89.36 Aligned_cols=79 Identities=20% Similarity=0.414 Sum_probs=64.7
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHH
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY 107 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y 107 (123)
++++.+ ++++||++.+..... ....|..++|+|++||+|+|++|++.+++.+++.++..+.+.+ .+.+||
T Consensus 43 ~~i~~~--~~~lvG~~~l~~~~~------~~~~i~~v~V~p~~rg~Glg~~Ll~~l~~~a~~~g~~~i~~~~--~~~~~Y 112 (152)
T PRK07757 43 FYVAEE--EGEIVGCCALHILWE------DLAEIRSLAVSEDYRGQGIGRMLVEACLEEARELGVKRVFALT--YQPEFF 112 (152)
T ss_pred EEEEEE--CCEEEEEEEEEeccC------CceEEEEEEECHHHcCCCHHHHHHHHHHHHHHhCCCCeEEEEe--CcHHHH
Confidence 344555 699999999874321 3467889999999999999999999999999999998886654 357899
Q ss_pred HhcCcEEcc
Q psy6918 108 ETFGFQKKN 116 (123)
Q Consensus 108 ~~~Gf~~~~ 116 (123)
+|+||++.+
T Consensus 113 ~k~GF~~~~ 121 (152)
T PRK07757 113 EKLGFREVD 121 (152)
T ss_pred HHCCCEEcc
Confidence 999999973
No 28
>PHA01807 hypothetical protein
Probab=99.63 E-value=1.5e-14 Score=87.44 Aligned_cols=82 Identities=13% Similarity=0.131 Sum_probs=61.8
Q ss_pred CeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCc-
Q psy6918 25 DYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHM- 103 (123)
Q Consensus 25 ~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~- 103 (123)
....+++.+ ++++||++.+...... .......+..++|+|+|||+|+|++||+.+++.+++.|+..+.++++..+
T Consensus 52 ~~~~lva~~--dg~lvG~~~l~~~~~~--~~~~i~~l~~lYV~pe~RG~GiG~~Ll~~~~~~Ar~~G~~~l~l~v~~~n~ 127 (153)
T PHA01807 52 DRTELLVFR--DGKLAGIAVLVFEDDP--HVGPCLGVQWQYVLPEYRNAGVAREFLRELIRLAGEGNLPLIAFSHREGEG 127 (153)
T ss_pred CceEEEEEE--CCEEEEEEEEEcCCCc--ceeeeccceeEEECHHHcCCCHHHHHHHHHHHHHHHCCCCEEEEEecCCcH
Confidence 344455665 6999999998754321 11122234558999999999999999999999999999999999886655
Q ss_pred --hhHHHhc
Q psy6918 104 --IPFYETF 110 (123)
Q Consensus 104 --~~~y~~~ 110 (123)
..||++.
T Consensus 128 ~a~~~y~~~ 136 (153)
T PHA01807 128 RYTIHYRRV 136 (153)
T ss_pred HHHHHHHhc
Confidence 5699874
No 29
>PRK10809 ribosomal-protein-S5-alanine N-acetyltransferase; Provisional
Probab=99.62 E-value=2.8e-14 Score=89.43 Aligned_cols=92 Identities=10% Similarity=0.164 Sum_probs=67.3
Q ss_pred cCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHh-cCCeEEEEccC
Q psy6918 22 ASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKH-FQCYKLTLDCA 100 (123)
Q Consensus 22 ~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~-~~~~~i~~~~~ 100 (123)
..+....+++.+.+++++||.+.+..... ........++.|.|++||+|+|+++++.+++++.+ .++++|.+.+.
T Consensus 71 ~~~~~~~~~i~~~~~~~~iG~i~l~~~~~----~~~~~~eig~~i~~~~~G~G~~~ea~~~ll~~~~~~l~l~~i~~~v~ 146 (194)
T PRK10809 71 KQGSAFYFALLDPDEKEIIGVANFSNVVR----GSFHACYLGYSLGQKWQGQGLMFEALQAAIRYMQRQQHMHRIMANYM 146 (194)
T ss_pred hcCcEEEEEEEECCCCeEEEEEEEEeecC----CCeeeEEEEEEECHHHcCCCHHHHHHHHHHHHHHhcCCceEEEEEee
Confidence 34444444443323689999999974321 11222344678999999999999999999999988 59999998874
Q ss_pred ---CCchhHHHhcCcEEccc
Q psy6918 101 ---DHMIPFYETFGFQKKNN 117 (123)
Q Consensus 101 ---~~~~~~y~~~Gf~~~~~ 117 (123)
..+.++|+|+||+..+.
T Consensus 147 ~~N~~S~~l~ek~Gf~~~g~ 166 (194)
T PRK10809 147 PHNKRSGDLLARLGFEKEGY 166 (194)
T ss_pred CCCHHHHHHHHHCCCcEEee
Confidence 45578999999998643
No 30
>COG1246 ArgA N-acetylglutamate synthase and related acetyltransferases [Amino acid transport and metabolism]
Probab=99.61 E-value=1.7e-14 Score=85.71 Aligned_cols=79 Identities=25% Similarity=0.352 Sum_probs=68.2
Q ss_pred EEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHH
Q psy6918 29 TVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYE 108 (123)
Q Consensus 29 ~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~ 108 (123)
.++++ +|.++|++.+.+.. ....+.+..++|+|++|++|+|..|++.++..|++.|++.+.+.++ ....||+
T Consensus 43 ~i~E~--~g~viGC~aL~~~~-----~~~~gE~~~laV~pd~r~~G~G~~Ll~~~~~~Ar~~gi~~lf~LTt-~~~~~F~ 114 (153)
T COG1246 43 TIIER--DGKVIGCAALHPVL-----EEDLGELRSLAVHPDYRGSGRGERLLERLLADARELGIKELFVLTT-RSPEFFA 114 (153)
T ss_pred eeeee--CCcEEEEEeecccC-----ccCeeeEEEEEECHHhcCCCcHHHHHHHHHHHHHHcCCceeeeeec-ccHHHHH
Confidence 34555 79999999998411 1357899999999999999999999999999999999999988664 7889999
Q ss_pred hcCcEEc
Q psy6918 109 TFGFQKK 115 (123)
Q Consensus 109 ~~Gf~~~ 115 (123)
++||+..
T Consensus 115 ~~GF~~v 121 (153)
T COG1246 115 ERGFTRV 121 (153)
T ss_pred HcCCeEC
Confidence 9999986
No 31
>COG0456 RimI Acetyltransferases [General function prediction only]
Probab=99.60 E-value=2.5e-14 Score=88.15 Aligned_cols=78 Identities=22% Similarity=0.373 Sum_probs=62.5
Q ss_pred eEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCC-eEEEEcc---CCCchhHHHhcCcE
Q psy6918 38 QVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQC-YKLTLDC---ADHMIPFYETFGFQ 113 (123)
Q Consensus 38 ~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~-~~i~~~~---~~~~~~~y~~~Gf~ 113 (123)
+++|++........... ...+.|..++|+|+|||+|+|++|++.+++.+++.+. ..+.+.+ |..++.||+++||+
T Consensus 72 ~~~G~~~~~~~~~~~~~-~~~~~i~~iaV~p~~r~~Gig~~Ll~~~~~~~~~~~~~~~~~L~V~~~N~~Ai~lY~~~GF~ 150 (177)
T COG0456 72 KVVGFLLVRVVDGRPSA-DHEGHIYNLAVDPEYRGRGIGRALLDEALERLRERGLADKIVLEVRESNEAAIGLYRKLGFE 150 (177)
T ss_pred ceeEEEEEEEecCCccc-cCccEEEEEEEChHhhcCCHHHHHHHHHHHHHHhcCCCceEEEEEecCChHHHHHHHHcCCE
Confidence 59999998632221111 3368999999999999999999999999999999886 7777765 55668899999999
Q ss_pred Ecc
Q psy6918 114 KKN 116 (123)
Q Consensus 114 ~~~ 116 (123)
...
T Consensus 151 ~~~ 153 (177)
T COG0456 151 VVK 153 (177)
T ss_pred EEe
Confidence 863
No 32
>TIGR01211 ELP3 histone acetyltransferase, ELP3 family. The Saccharomyces cerevisiae member YPL086C has been characterized in vitro as an N-terminal acetyltransferase for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases.
Probab=99.59 E-value=2.5e-14 Score=100.71 Aligned_cols=100 Identities=17% Similarity=0.256 Sum_probs=79.5
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeeeccc--ccceeEEEEEEEec-----------ccccCCchHHHHHHHHHHHHHhc
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQKFIH--ECALKGKIEEVVVD-----------DTYRGKELGKLLIAVLVKLAKHF 90 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~--~~~~~~~i~~~~v~-----------p~~rg~G~g~~Ll~~~~~~~~~~ 90 (123)
+.-.+..+++..++.++|++.+........ .....+.|..+.|. |+|||+|+|++||+.+++.+++.
T Consensus 410 G~e~F~~y~~~~~~~l~G~lrlr~~~~~~~~~~~~~~a~IrelhV~G~~~~~~~~~~~~~rg~GiG~~Ll~~ae~~Ar~~ 489 (522)
T TIGR01211 410 GTEFFLSYEDPKNDILIGFLRLRFPSEPAHRKEVDATALVRELHVYGSEVPIGERGDDEWQHRGYGRRLLEEAERIAAEE 489 (522)
T ss_pred CCeEEEEEEcCCCCeEEEEEEEecCcccccccccCCCceEEEEEEeeeeccccccCChhHhCcCHHHHHHHHHHHHHHHC
Confidence 444445566544679999999986543111 11235677777744 89999999999999999999999
Q ss_pred CCeEEEEccCCCchhHHHhcCcEEccceeEeeC
Q psy6918 91 QCYKLTLDCADHMIPFYETFGFQKKNNFMQIYF 123 (123)
Q Consensus 91 ~~~~i~~~~~~~~~~~y~~~Gf~~~~~~~~~~~ 123 (123)
|+..+.+.++..+.+||+|+||+..+.+|.+.+
T Consensus 490 G~~~i~v~s~~~A~~FY~klGf~~~g~ym~K~l 522 (522)
T TIGR01211 490 GSEKILVISGIGVREYYRKLGYELDGPYMSKRL 522 (522)
T ss_pred CCCEEEEeeCchHHHHHHHCCCEEEcceeEEeC
Confidence 999999988999999999999999999999864
No 33
>TIGR03585 PseH pseudaminic acid biosynthesis N-acetyl transferase. Sequences in this family are members of the pfam00583 (GNAT) superfamily of acetyltransferases and are proposed to perform a N-acetylation step in the process of pseudaminic acid biosynthesis in Campylobacter species. This gene is commonly observed in apparent operons with other genes responsible for the biosynthesis of pseudaminic acid and as a component of flagellar and exopolysaccharide biosynthesis loci. Significantly, many genomes containing other components of this pathway lack this gene, indicating that some other N-acetyl transferases may be incolved and/or the step is optional, resulting in a non-acetylated pseudaminic acid variant sugar.
Probab=99.58 E-value=8.5e-14 Score=84.24 Aligned_cols=105 Identities=15% Similarity=0.200 Sum_probs=77.6
Q ss_pred CCCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHH
Q psy6918 6 PTSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVK 85 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~ 85 (123)
+..+.+....++......+....+++.. +|++||++.+..... ......+. +++.|.+| +|+|++++..+++
T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~g~~vG~~~~~~~~~----~~~~~~~g-~~~~~~~~-~G~g~~~~~~~~~ 102 (156)
T TIGR03585 31 HLIDWEEHLHFIEALKQDPNRRYWIVCQ--ESRPIGVISFTDINL----VHKSAFWG-IYANPFCK-PGVGSVLEEAALE 102 (156)
T ss_pred CCCCHHHHHHHHHHhhcCCCceEEEEEE--CCEEEEEEEEEecCh----hhCeEEEE-EEeChhhh-cCchHHHHHHHHH
Confidence 3466677777777766555444555665 799999999974321 12234443 55999999 9999999999999
Q ss_pred HHHh-cCCeEEEEcc---CCCchhHHHhcCcEEccce
Q psy6918 86 LAKH-FQCYKLTLDC---ADHMIPFYETFGFQKKNNF 118 (123)
Q Consensus 86 ~~~~-~~~~~i~~~~---~~~~~~~y~~~Gf~~~~~~ 118 (123)
++.+ .++..+.+.+ |..+++||+|+||+..+..
T Consensus 103 ~a~~~~~~~~i~~~v~~~N~~s~~~y~k~Gf~~~g~~ 139 (156)
T TIGR03585 103 YAFEHLGLHKLSLEVLEFNNKALKLYEKFGFEREGVF 139 (156)
T ss_pred HHHhhCCeeEEEEEEeccCHHHHHHHHHcCCeEeeee
Confidence 9986 5999998765 5666889999999997543
No 34
>PLN02825 amino-acid N-acetyltransferase
Probab=99.58 E-value=2.4e-14 Score=100.53 Aligned_cols=80 Identities=21% Similarity=0.280 Sum_probs=67.7
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHH
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY 107 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y 107 (123)
+++++. +++|||++.+.+... ...+.|..++|+|+|||+|+|++||+.+++.++++|++.+.+.+ ....+||
T Consensus 409 f~V~e~--Dg~IVG~aal~~~~~-----~~~aEI~~laV~P~yRGkGiG~~LL~~le~~Ar~~G~~~L~Llt-t~a~~fY 480 (515)
T PLN02825 409 FVVVER--EGSIIACAALFPFFE-----EKCGEVAAIAVSPECRGQGQGDKLLDYIEKKAASLGLEKLFLLT-TRTADWF 480 (515)
T ss_pred EEEEEE--CCEEEEEEEEEeecC-----CCcEEEEEEEECHHHcCCCHHHHHHHHHHHHHHHCCCCEEEEEe-CcHHHHH
Confidence 455665 799999998864321 24578999999999999999999999999999999999998866 4568999
Q ss_pred HhcCcEEc
Q psy6918 108 ETFGFQKK 115 (123)
Q Consensus 108 ~~~Gf~~~ 115 (123)
+++||+..
T Consensus 481 ~k~GF~~~ 488 (515)
T PLN02825 481 VRRGFSEC 488 (515)
T ss_pred HHCCCEEe
Confidence 99999986
No 35
>PRK15130 spermidine N1-acetyltransferase; Provisional
Probab=99.58 E-value=7.6e-14 Score=86.93 Aligned_cols=96 Identities=18% Similarity=0.302 Sum_probs=69.8
Q ss_pred HHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHh-cCCeE
Q psy6918 16 RFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKH-FQCYK 94 (123)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~-~~~~~ 94 (123)
.+......+....+++.. +|++||++.+..... ....+.+ .++|+|+|||+|+|+++++.+++++.+ .++.+
T Consensus 47 ~~~~~~~~~~~~~~~i~~--~g~~iG~~~~~~~~~----~~~~~~~-~~~v~~~~~g~G~g~~l~~~l~~~~~~~~~~~r 119 (186)
T PRK15130 47 LYDKHIHDQSERRFVVEC--DGEKAGLVELVEINH----VHRRAEF-QIIISPEYQGKGLATRAAKLAMDYGFTVLNLYK 119 (186)
T ss_pred HHHHhhhcccCcEEEEEE--CCEEEEEEEEEeecC----CCCeEEE-EEEECHHHcCCCHHHHHHHHHHHHHhhcCCceE
Confidence 333333333334555555 799999999864321 1223344 589999999999999999999999986 59998
Q ss_pred EEEcc---CCCchhHHHhcCcEEccce
Q psy6918 95 LTLDC---ADHMIPFYETFGFQKKNNF 118 (123)
Q Consensus 95 i~~~~---~~~~~~~y~~~Gf~~~~~~ 118 (123)
|.+.+ |..+.+||+|+||+..+..
T Consensus 120 v~~~v~~~N~~s~~~yek~GF~~~~~~ 146 (186)
T PRK15130 120 LYLIVDKENEKAIHIYRKLGFEVEGEL 146 (186)
T ss_pred EEEEEccCCHHHHHHHHHCCCEEEEEE
Confidence 98876 4566889999999987443
No 36
>PRK10151 ribosomal-protein-L7/L12-serine acetyltransferase; Provisional
Probab=99.58 E-value=1.3e-13 Score=85.37 Aligned_cols=104 Identities=11% Similarity=0.064 Sum_probs=74.9
Q ss_pred CCHHHHHHHHHhhh---cCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHH
Q psy6918 8 SNVFHFAERFHRMK---ASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLV 84 (123)
Q Consensus 8 ~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~ 84 (123)
.+.+..++.+.... .......+++.+ ++++||++.+..... ....+.| .+.++|+|||+|+|+++++.++
T Consensus 46 ~~~~~~~~~i~~~~~~~~~~~~~~~~i~~--~~~~iG~~~l~~~~~----~~~~~~i-g~~i~~~~~g~G~~tea~~~l~ 118 (179)
T PRK10151 46 QSEEDTRKTVQGNVMLHQRGYAKMFMIFK--EDELIGVLSFNRIEP----LNKTAYI-GYWLDESHQGQGIISQALQALI 118 (179)
T ss_pred CCHHHHHHHHHHHHHHHhcCCcEEEEEEE--CCEEEEEEEEEeecc----CCCceEE-EEEEChhhcCCcHHHHHHHHHH
Confidence 46677777776532 223323344445 689999999864321 1233455 4679999999999999999999
Q ss_pred HHHHhc-CCeEEEEcc---CCCchhHHHhcCcEEccce
Q psy6918 85 KLAKHF-QCYKLTLDC---ADHMIPFYETFGFQKKNNF 118 (123)
Q Consensus 85 ~~~~~~-~~~~i~~~~---~~~~~~~y~~~Gf~~~~~~ 118 (123)
+++.+. ++.++.+.+ |..+.++++|+||+..+..
T Consensus 119 ~~~~~~~~~~ri~~~v~~~N~~S~~v~ek~Gf~~~g~~ 156 (179)
T PRK10151 119 HHYAQSGELRRFVIKCRVDNPASNQVALRNGFTLEGCL 156 (179)
T ss_pred HHHHhhCCccEEEEEEcCCCHHHHHHHHHCCCEEEeEe
Confidence 999874 799988865 5556789999999987543
No 37
>PRK09831 putative acyltransferase; Provisional
Probab=99.58 E-value=1.6e-14 Score=86.97 Aligned_cols=72 Identities=29% Similarity=0.421 Sum_probs=58.9
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHH
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY 107 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y 107 (123)
++++.+ +++++|++.+.. ..+..++|+|++||+|+|++|++.+++.+.. +.+.++..+.+||
T Consensus 55 ~~v~~~--~~~iiG~~~~~~-----------~~i~~~~v~p~~~g~GiG~~Ll~~~~~~~~~-----l~v~~~~~a~~~Y 116 (147)
T PRK09831 55 VRVAVI--NAQPVGFITCIE-----------HYIDMLFVDPEYTRRGVASALLKPLIKSESE-----LTVDASITAKPFF 116 (147)
T ss_pred eEEEEE--CCEEEEEEEehh-----------ceeeeEEECHHHcCCCHHHHHHHHHHHHhhh-----eEeecchhhHHHH
Confidence 444555 699999988741 3677899999999999999999999998775 4445567789999
Q ss_pred HhcCcEEccc
Q psy6918 108 ETFGFQKKNN 117 (123)
Q Consensus 108 ~~~Gf~~~~~ 117 (123)
+|+||+..+.
T Consensus 117 ~k~Gf~~~g~ 126 (147)
T PRK09831 117 ERYGFQTVKQ 126 (147)
T ss_pred HHCCCEEeec
Confidence 9999999844
No 38
>PRK10514 putative acetyltransferase; Provisional
Probab=99.58 E-value=5.5e-14 Score=84.18 Aligned_cols=94 Identities=19% Similarity=0.324 Sum_probs=63.6
Q ss_pred CCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHH
Q psy6918 8 SNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLA 87 (123)
Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~ 87 (123)
.+.+.+...+..+..... .+++..+ ++++||++.+.. ..+..++|+|+|||+|+|++|++.+.+.+
T Consensus 33 ~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~iG~~~~~~-----------~~~~~~~v~p~~rgkGig~~Ll~~~~~~~ 98 (145)
T PRK10514 33 EDRAEIEELVRSFLPEAP-LWVAVDE--RDQPVGFMLLSG-----------GHMEALFVDPDVRGCGVGRMLVEHALSLH 98 (145)
T ss_pred hhHHHHHHHHHHHhccCc-eEEEEec--CCcEEEEEEEec-----------CcEeEEEECHHhccCCHHHHHHHHHHHhc
Confidence 345555555554433333 2333334 689999998741 23567999999999999999999999864
Q ss_pred HhcCCeEEEEccCCCchhHHHhcCcEEccc
Q psy6918 88 KHFQCYKLTLDCADHMIPFYETFGFQKKNN 117 (123)
Q Consensus 88 ~~~~~~~i~~~~~~~~~~~y~~~Gf~~~~~ 117 (123)
. .+.......|..+.+||+|+||+..+.
T Consensus 99 ~--~i~~~v~~~N~~a~~~yek~Gf~~~~~ 126 (145)
T PRK10514 99 P--ELTTDVNEQNEQAVGFYKKMGFKVTGR 126 (145)
T ss_pred c--ccEEEeecCCHHHHHHHHHCCCEEecc
Confidence 3 222222234677789999999999743
No 39
>TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species.
Probab=99.57 E-value=8e-14 Score=92.36 Aligned_cols=79 Identities=18% Similarity=0.181 Sum_probs=62.3
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---CCCchhHHHhcCc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---ADHMIPFYETFGF 112 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~~~~~~~y~~~Gf 112 (123)
++++||++.+..... ......|..+.|+|+|||+|+|++|+..+++++++.|+..+.+.+ |..+.+||+|+||
T Consensus 208 ~~~~vG~~~~~~~~~----~~~~~~i~~~~V~p~~rg~GiG~~ll~~~~~~~~~~g~~~v~l~v~~~N~~a~~~y~k~GF 283 (292)
T TIGR03448 208 PGELLGFHWTKVHPD----EPALGEVYVVGVDPAAQGRGLGDALTLIGLHHLAARGLPAVMLYVEADNEAAVRTYEKLGF 283 (292)
T ss_pred CCcEEEEEEEEecCC----CCceeEEEEEEECHHHcCCCHHHHHHHHHHHHHHHCCCCEEEEEEeCCCHHHHHHHHHcCC
Confidence 589999976543221 122456777899999999999999999999999999998888766 4456789999999
Q ss_pred EEccce
Q psy6918 113 QKKNNF 118 (123)
Q Consensus 113 ~~~~~~ 118 (123)
+.....
T Consensus 284 ~~~~~~ 289 (292)
T TIGR03448 284 TVAEVD 289 (292)
T ss_pred EEcccc
Confidence 987543
No 40
>PRK12308 bifunctional argininosuccinate lyase/N-acetylglutamate synthase; Provisional
Probab=99.57 E-value=3.5e-14 Score=102.31 Aligned_cols=80 Identities=21% Similarity=0.306 Sum_probs=66.7
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHH
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY 107 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y 107 (123)
++++++ ++++||++.+...+ .....|..++|+|+|||+|+|++|++.+++++++.|++.+.+.+ .+..||
T Consensus 505 ~~Va~~--~g~IVG~~~l~~~~------~~~~~I~~i~V~P~~rGkGIGk~Ll~~l~~~ak~~g~~~i~l~~--~a~~FY 574 (614)
T PRK12308 505 FAVAEH--HGEVTGCASLYIYD------SGLAEIRSLGVEAGWQVQGQGSALVQYLVEKARQMAIKKVFVLT--RVPEFF 574 (614)
T ss_pred EEEEEE--CCEEEEEEEEEEcC------CCeEEEEEEEECHHHcCCCHHHHHHHHHHHHHHHCCCCEEEEee--CcHHHH
Confidence 455666 69999999886432 12467999999999999999999999999999999999888764 357999
Q ss_pred HhcCcEEccc
Q psy6918 108 ETFGFQKKNN 117 (123)
Q Consensus 108 ~~~Gf~~~~~ 117 (123)
+|+||+..+.
T Consensus 575 ek~GF~~~~~ 584 (614)
T PRK12308 575 MKQGFSPTSK 584 (614)
T ss_pred HHCCCEECCc
Confidence 9999998743
No 41
>COG2153 ElaA Predicted acyltransferase [General function prediction only]
Probab=99.57 E-value=2.9e-14 Score=83.44 Aligned_cols=90 Identities=19% Similarity=0.276 Sum_probs=73.6
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcC-CeEEEEccCCC
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQ-CYKLTLDCADH 102 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~-~~~i~~~~~~~ 102 (123)
....+++... +|++++++++.+... ......|+.+.|+|++||+|+|.+||+++++.+.+.- -+.+.+.++..
T Consensus 48 ~~~Hl~~~~~--~g~LvAyaRLl~~~~----~~~~~~iGRV~v~~~~RG~glG~~Lm~~AL~~~~~~~p~~~v~l~AQah 121 (155)
T COG2153 48 DTRHLLGWTP--DGELVAYARLLPPGA----EYEEVSIGRVIVSPAARGQGLGQQLMEKALETAGREWPDKPVYLGAQAH 121 (155)
T ss_pred ccceEEEEcC--CCeEEEEEecCCCCC----CcCceeeeeEEECHhhhccchhHHHHHHHHHHHHhhCCCCCeEEehHHH
Confidence 3445666664 799999999985432 2233679999999999999999999999999998874 46788999999
Q ss_pred chhHHHhcCcEEc-ccee
Q psy6918 103 MIPFYETFGFQKK-NNFM 119 (123)
Q Consensus 103 ~~~~y~~~Gf~~~-~~~~ 119 (123)
...||.++||++. .+|.
T Consensus 122 Lq~fYa~~GFv~~~e~yl 139 (155)
T COG2153 122 LQDFYASFGFVRVGEEYL 139 (155)
T ss_pred HHHHHHHhCcEEcCchhh
Confidence 9999999999998 4443
No 42
>PRK05279 N-acetylglutamate synthase; Validated
Probab=99.56 E-value=3.9e-14 Score=98.72 Aligned_cols=82 Identities=22% Similarity=0.305 Sum_probs=67.4
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHH
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY 107 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y 107 (123)
++++++ +++++|++.+..... ...+.|..++|+|+|||+|+|++|++++++++++.|+..+.+.+ ..+.+||
T Consensus 336 ~~va~~--dg~iVG~~~~~~~~~-----~~~~~I~~l~V~p~~Rg~GiG~~Ll~~l~~~a~~~g~~~l~l~~-~~a~~fY 407 (441)
T PRK05279 336 FTVIER--DGLIIGCAALYPFPE-----EKMGEMACLAVHPDYRGSGRGERLLKRIEQRARQLGLKRLFVLT-TRTAHWF 407 (441)
T ss_pred EEEEEE--CCEEEEEEEEEEcCC-----CCeEEEEEEEECHHHcCCCHHHHHHHHHHHHHHHcCCCEEEEec-chHHHHH
Confidence 455666 699999988764321 23578999999999999999999999999999999998887654 5678899
Q ss_pred HhcCcEEccc
Q psy6918 108 ETFGFQKKNN 117 (123)
Q Consensus 108 ~~~Gf~~~~~ 117 (123)
+++||++.+.
T Consensus 408 ~k~GF~~~g~ 417 (441)
T PRK05279 408 LERGFVPVDV 417 (441)
T ss_pred HHCcCEECCh
Confidence 9999998743
No 43
>TIGR00124 cit_ly_ligase [citrate (pro-3S)-lyase] ligase. ATP is cleaved to AMP and pyrophosphate during the reaction. The carboxyl end is homologous to a number of cytidyltransferases that also release pyrophosphate.
Probab=99.56 E-value=8.5e-14 Score=93.44 Aligned_cols=78 Identities=22% Similarity=0.395 Sum_probs=68.1
Q ss_pred eEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchh
Q psy6918 26 YLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIP 105 (123)
Q Consensus 26 ~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~ 105 (123)
..++++.+ +++|||++++. . ..|..++|+|++||+|+|++|+..+++.+++.|+..+.+.+.+.+.+
T Consensus 31 d~~vv~~~--~~~lVg~g~l~--g---------~~ik~vaV~~~~rG~Glg~~L~~~L~~~a~~~G~~~l~l~Tk~~~~~ 97 (332)
T TIGR00124 31 EIFIAVYE--DEEIIGCGGIA--G---------NVIKCVAIDESLRGEGLALQLMTELENLAYELGRFHLFIFTKPEYAA 97 (332)
T ss_pred CEEEEEEE--CCEEEEEEEEe--c---------CEEEEEEEcHHHcCCCHHHHHHHHHHHHHHHcCCCEEEEEECchHHH
Confidence 34555555 69999999984 1 24889999999999999999999999999999999999999888899
Q ss_pred HHHhcCcEEcc
Q psy6918 106 FYETFGFQKKN 116 (123)
Q Consensus 106 ~y~~~Gf~~~~ 116 (123)
||+++||.+.+
T Consensus 98 fy~klGF~~i~ 108 (332)
T TIGR00124 98 LFEYCGFKTLA 108 (332)
T ss_pred HHHHcCCEEee
Confidence 99999999973
No 44
>TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase. This model represents a clade of amino-acid N-acetyltransferases acting mainly on glutamate in the first step of the "acetylated" ornithine biosynthesis pathway. For this reason it is also called N-acetylglutamate synthase. The enzyme may also act on aspartate.
Probab=99.56 E-value=6.9e-14 Score=97.18 Aligned_cols=82 Identities=20% Similarity=0.285 Sum_probs=66.8
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHH
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY 107 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y 107 (123)
++++++ +++++|++.+..... ...+.|..++|+|+|||+|+|++||+.++++++++|+..+.+.+ ....+||
T Consensus 324 ~~V~~~--dg~iVG~~~~~~~~~-----~~~~~I~~l~V~p~~Rg~GiG~~Ll~~l~~~A~~~G~~~l~v~~-~~a~~fY 395 (429)
T TIGR01890 324 FSIIEH--DGNIIGCAALYPYAE-----EDCGEMACLAVSPEYQDGGRGERLLAHIEDRARQMGISRLFVLT-TRTGHWF 395 (429)
T ss_pred EEEEEE--CCEEEEEEEEEecCC-----CCeEEEEEEEECHHHcCCCHHHHHHHHHHHHHHHcCCCEEEEee-cchHHHH
Confidence 445555 699999999875321 23578889999999999999999999999999999998876543 3457999
Q ss_pred HhcCcEEccc
Q psy6918 108 ETFGFQKKNN 117 (123)
Q Consensus 108 ~~~Gf~~~~~ 117 (123)
+++||+..+.
T Consensus 396 ~k~GF~~~g~ 405 (429)
T TIGR01890 396 RERGFQTASV 405 (429)
T ss_pred HHCCCEECCh
Confidence 9999999743
No 45
>PF13302 Acetyltransf_3: Acetyltransferase (GNAT) domain; PDB: 3TTH_C 3JUW_A 2ZXV_A 2Z0Z_A 2VI7_B 3EG7_F 1YRE_C 3IGR_B 3FBU_A 2FCK_A ....
Probab=99.53 E-value=6.6e-13 Score=79.01 Aligned_cols=101 Identities=20% Similarity=0.259 Sum_probs=73.0
Q ss_pred CCHHHHHHHHHh-hh--cCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHH
Q psy6918 8 SNVFHFAERFHR-MK--ASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLV 84 (123)
Q Consensus 8 ~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~ 84 (123)
.+.+...+++.. .. ......++++.+.+++++||++.+..... ....+.++ +.|.|++||+|+|++++..++
T Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~iG~i~~~~~~~----~~~~~eig-~~i~~~~~g~G~~~~~~~~~~ 109 (142)
T PF13302_consen 35 PTLEEAEEWIQSRQDSWENHGYYYFAIEDKDDGEIIGFIGLYNIDK----NNNWAEIG-YWIGPDYRGKGYGTEALKLLL 109 (142)
T ss_dssp SSHHHHHHHHHHHHHCHHEETEEEEEEEETTTTEEEEEEEEEEEET----TTTEEEEE-EEEEGGGTTSSHHHHHHHHHH
T ss_pred CCHHHHHHHHHHhhhhhhcccceEEEEEeccCCceEEEeeeeeccc----CCCccccc-cchhHHHHhhhHHHHHHHHHH
Confidence 377777777763 11 11225556666544568999999953321 23456665 889999999999999999999
Q ss_pred HHHH-hcCCeEEEEccC---CCchhHHHhcCcE
Q psy6918 85 KLAK-HFQCYKLTLDCA---DHMIPFYETFGFQ 113 (123)
Q Consensus 85 ~~~~-~~~~~~i~~~~~---~~~~~~y~~~Gf~ 113 (123)
+++. +.++..+.+.+. ..+.++++|+||+
T Consensus 110 ~~~~~~~~~~~i~a~~~~~N~~s~~~~~k~GF~ 142 (142)
T PF13302_consen 110 DWAFEELGLHRIIATVMADNEASRRLLEKLGFE 142 (142)
T ss_dssp HHHHHTSTSSEEEEEEETT-HHHHHHHHHTT-E
T ss_pred HHHHhcCCcEEEEEEECcCCHHHHHHHHHcCCC
Confidence 9994 679999988774 4557799999996
No 46
>TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species.
Probab=99.53 E-value=1.7e-13 Score=90.79 Aligned_cols=85 Identities=16% Similarity=0.252 Sum_probs=63.2
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc-cCCCchhH
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD-CADHMIPF 106 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~-~~~~~~~~ 106 (123)
.+++.+ ++++||++.+..... ....+..++|+|++||+|+|++|++.+++.+.. .-.+.+. .|..+++|
T Consensus 48 ~~~~~~--~~~~vG~~~~~~~~~------~~~~~~~l~V~p~~rg~GiG~~Ll~~~~~~~~~--~~~~~~~~~n~~a~~f 117 (292)
T TIGR03448 48 HLVAVD--SDPIVGYANLVPARG------TDPAMAELVVHPAHRRRGIGRALIRALLAKGGG--RLRVWAHGDLPAARAL 117 (292)
T ss_pred EEEEEE--CCEEEEEEEEEcCCC------CcceEEEEEECHhhcCCCHHHHHHHHHHHhccC--ceEEEEcCCCHHHHHH
Confidence 444555 689999999874321 124688899999999999999999999998652 2223332 24567899
Q ss_pred HHhcCcEEccceeEee
Q psy6918 107 YETFGFQKKNNFMQIY 122 (123)
Q Consensus 107 y~~~Gf~~~~~~~~~~ 122 (123)
|+++||+....++.++
T Consensus 118 y~~~Gf~~~~~~~~~~ 133 (292)
T TIGR03448 118 ASRLGLVPTRELLQMR 133 (292)
T ss_pred HHHCCCEEccEEEEEE
Confidence 9999999987776654
No 47
>PRK01346 hypothetical protein; Provisional
Probab=99.51 E-value=9.6e-13 Score=91.06 Aligned_cols=90 Identities=16% Similarity=0.122 Sum_probs=69.4
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccc--cceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchh
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHE--CALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIP 105 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~--~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~ 105 (123)
.+++.+ ++++||++.+......... ..+...|..++|+|+|||+|+|++||+++++.+++.|+..+.+.+.. .+
T Consensus 49 ~~va~~--~~~lvg~~~~~~~~~~~~~~~~~~~~~i~~v~V~P~~RgrGig~~Ll~~~l~~a~~~g~~~~~L~~~~--~~ 124 (411)
T PRK01346 49 TLGAFD--GDEVVGTAGAFDLRLTVPGGAVLPAAGVTAVTVAPTHRRRGLLTALMREQLRRIRERGEPVAALTASE--GG 124 (411)
T ss_pred eEEEEE--CCEEEEEEEEeccccccCCCCccceeEEEEEEEChhhcCCCHHHHHHHHHHHHHHHCCCcEEEEECCc--hh
Confidence 344555 6899999997643321111 12568999999999999999999999999999999999877776543 57
Q ss_pred HHHhcCcEEccceeEe
Q psy6918 106 FYETFGFQKKNNFMQI 121 (123)
Q Consensus 106 ~y~~~Gf~~~~~~~~~ 121 (123)
||+|+||........+
T Consensus 125 ~Y~r~Gf~~~~~~~~~ 140 (411)
T PRK01346 125 IYGRFGYGPATYSQSL 140 (411)
T ss_pred hHhhCCCeeccceEEE
Confidence 9999999988655543
No 48
>PRK13688 hypothetical protein; Provisional
Probab=99.51 E-value=3.3e-13 Score=81.90 Aligned_cols=86 Identities=14% Similarity=0.171 Sum_probs=60.3
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeee----cccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQK----FIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~----~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
+....+++.+ ++++||++.+...+. +.........|..++|+|+|||+|+|++|++.+. +.++. +.+.+
T Consensus 43 ~~~~~~~~~~--~~~~VG~~~l~~~dg~~~~~~~~~~~~~~L~~l~V~p~~rgkGiG~~Ll~~a~----~~~~~-~~~~~ 115 (156)
T PRK13688 43 SESPFYGIYY--GDSLVARMSLYKKGGVEEPYFEDTQDYLELWKLEVLPKYQNRGYGEMLVDFAK----SFQLP-IKTIA 115 (156)
T ss_pred CCCCEEEEEE--CCEEEEEEEEEecCCcccccccCCCCeEEEEEEEECHHHcCCCHHHHHHHHHH----HhCCe-EEEEe
Confidence 3344444555 689999988754321 1122345678999999999999999999998654 44554 44444
Q ss_pred CCCchhHHHhcCcEEcc
Q psy6918 100 ADHMIPFYETFGFQKKN 116 (123)
Q Consensus 100 ~~~~~~~y~~~Gf~~~~ 116 (123)
...+.+||+|+||+..+
T Consensus 116 ~~~a~~FY~k~GF~~~~ 132 (156)
T PRK13688 116 RNKSKDFWLKLGFTPVE 132 (156)
T ss_pred ccchHHHHHhCCCEEeE
Confidence 55678999999999874
No 49
>COG3153 Predicted acetyltransferase [General function prediction only]
Probab=99.50 E-value=1.8e-12 Score=79.15 Aligned_cols=95 Identities=18% Similarity=0.185 Sum_probs=74.4
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYK 94 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~ 94 (123)
+.+.........+.+++.+ +|++||.+.+....- .........+.-++|+|++||+|+|++|+...++.++..|+..
T Consensus 35 ~~lR~~~~~~~~LslVA~d--~g~vvG~Il~s~v~~-~g~~~~~~~LaPLaV~p~~qg~GIG~~Lvr~~le~a~~~G~~~ 111 (171)
T COG3153 35 DKLREGGRPDLTLSLVAED--DGEVVGHILFSPVTV-GGEELGWLGLAPLAVDPEYQGQGIGSALVREGLEALRLAGASA 111 (171)
T ss_pred HHHHhcCCcccceeEEEee--CCEEEEEEEEeEEEe-cCcccceEEEEeEEEchhhcCCcHHHHHHHHHHHHHHHCCCCE
Confidence 3334333345677888888 799999999985432 1223345677889999999999999999999999999999998
Q ss_pred EEEccCCCchhHHHhcCcEEc
Q psy6918 95 LTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 95 i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
+.+. ....+|.++||+..
T Consensus 112 v~vl---Gdp~YY~rfGF~~~ 129 (171)
T COG3153 112 VVVL---GDPTYYSRFGFEPA 129 (171)
T ss_pred EEEe---cCcccccccCcEEc
Confidence 8884 44578999999987
No 50
>TIGR01686 FkbH FkbH-like domain. The C-terminal portion of this domain is unique to this family (by BLAST).
Probab=99.50 E-value=5.5e-13 Score=89.59 Aligned_cols=99 Identities=15% Similarity=0.215 Sum_probs=72.7
Q ss_pred CCCCHHHHHHHHHhhhcCCCeEEEEEEcC-CCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHH
Q psy6918 6 PTSNVFHFAERFHRMKASQDYLVTVIEDT-RTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLV 84 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~ 84 (123)
..++.+.+...+. .+..+++...+. .++.+||++.+.... ....|..++|+|++||+|+|++||++++
T Consensus 215 ~~~s~~~i~~~l~----~~~~~~~~~~d~~gd~givG~~~~~~~~-------~~~~I~~l~vs~r~~grGig~~Ll~~l~ 283 (320)
T TIGR01686 215 TRLNQEDVAQHMQ----KEEIVTVSMSDRFGDSGIIGIFVFEKKE-------GNLFIDDLCMSCRALGRGVETRMLRWLF 283 (320)
T ss_pred ccCCHHHHHHHhc----CCCEEEEEEEecCCCCceEEEEEEEecC-------CcEEEEEEEEcHhHhcCcHHHHHHHHHH
Confidence 3455555555543 233333333221 157899999876422 3568999999999999999999999999
Q ss_pred HHHHhcCCeEEEEcc-----CCCchhHHHhcCcEEc
Q psy6918 85 KLAKHFQCYKLTLDC-----ADHMIPFYETFGFQKK 115 (123)
Q Consensus 85 ~~~~~~~~~~i~~~~-----~~~~~~~y~~~Gf~~~ 115 (123)
+.+++.|+..+.+.. |.+++.||+++||+.+
T Consensus 284 ~~a~~~G~~~i~l~v~~~~~N~~A~~fY~~~GF~~~ 319 (320)
T TIGR01686 284 EQALDLGNHNARLYYRRTERNMPFLSFYEQIGFEDE 319 (320)
T ss_pred HHHHHcCCCeEEEEEeeCCCchHHHHHHHHcCCccC
Confidence 999999999888754 5566789999999864
No 51
>COG3393 Predicted acetyltransferase [General function prediction only]
Probab=99.48 E-value=1.4e-12 Score=83.52 Aligned_cols=81 Identities=17% Similarity=0.213 Sum_probs=68.3
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc--cCCCchhHHHhcCcE
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD--CADHMIPFYETFGFQ 113 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~--~~~~~~~~y~~~Gf~ 113 (123)
+|+||+.+...... ...+.|..+++.|+|||+|+|+.|+..+-+...+.|..-+... .|+.+.+.|+|.||+
T Consensus 185 d~~iVa~A~t~a~~------~~~~~I~gV~T~peyR~kGyAt~lva~L~~~lL~eGk~~~L~~~~~N~~A~~iY~riGF~ 258 (268)
T COG3393 185 DGKIVAKAETAAEN------PAYAQINGVYTHPEYRGKGYATALVATLAAKLLAEGKIPCLFVNSDNPVARRIYQRIGFR 258 (268)
T ss_pred CCcEEEeeeccccC------CcceEEEEEEcCHHHccccHHHHHHHHHHHHHHhCCCeeEEEEecCCHHHHHHHHHhCCe
Confidence 47999999987443 3578999999999999999999999999999999887655443 478889999999999
Q ss_pred EccceeEee
Q psy6918 114 KKNNFMQIY 122 (123)
Q Consensus 114 ~~~~~~~~~ 122 (123)
..+++...-
T Consensus 259 ~~g~~~~~~ 267 (268)
T COG3393 259 EIGEFREYI 267 (268)
T ss_pred ecceEEEEe
Confidence 998776543
No 52
>PF08445 FR47: FR47-like protein; InterPro: IPR013653 Proteins in this entry have a conserved region similar to the C-terminal region of the Drosophila melanogaster (Fruit fly) hypothetical protein FR47 (Q9VR51 from SWISSPROT). This protein has been found to consist of two N-acyltransferase-like domains swapped with the C-terminal strands. ; GO: 0016747 transferase activity, transferring acyl groups other than amino-acyl groups; PDB: 1SQH_A 3EC4_B.
Probab=99.42 E-value=2.3e-12 Score=70.88 Aligned_cols=58 Identities=24% Similarity=0.288 Sum_probs=46.7
Q ss_pred eEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc--cCCCchhHHHhcCcEEc
Q psy6918 58 KGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD--CADHMIPFYETFGFQKK 115 (123)
Q Consensus 58 ~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~--~~~~~~~~y~~~Gf~~~ 115 (123)
.+.|..++|+|++||+|+|+.++..+.+.+.+.|....... .|..++++|+|+||+..
T Consensus 21 ~g~i~~v~t~p~~RrrGlg~~lv~~l~~~~~~~g~~~~l~v~~~N~~s~~ly~klGf~~~ 80 (86)
T PF08445_consen 21 DGEIGGVYTLPEHRRRGLGSALVAALARELLERGKTPFLYVDADNEASIRLYEKLGFREI 80 (86)
T ss_dssp TCCEEEEEE-GGGTTSSHHHHHHHHHHHHHHHTTSEEEEEEETT-HHHHHHHHHCT-EEE
T ss_pred CcEEEEEEECHHHcCCCHHHHHHHHHHHHHHhCCCcEEEEEECCCHHHHHHHHHcCCEEE
Confidence 37899999999999999999999999999999887543322 35677899999999987
No 53
>PRK10562 putative acetyltransferase; Provisional
Probab=99.39 E-value=1.2e-11 Score=74.24 Aligned_cols=72 Identities=19% Similarity=0.290 Sum_probs=55.0
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc---cCCCch
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD---CADHMI 104 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~---~~~~~~ 104 (123)
.+++.+ ++++||++.+... ..+..++|+|+|||+|+|++|++.+++.+ ..+.+. .|+.+.
T Consensus 50 ~~v~~~--~~~~iG~~~~~~~----------~~i~~~~v~~~~rg~G~g~~ll~~~~~~~-----~~~~~~v~~~N~~s~ 112 (145)
T PRK10562 50 TWVWEE--DGKLLGFVSVLEG----------RFVGALFVAPKAVRRGIGKALMQHVQQRY-----PHLSLEVYQKNQRAV 112 (145)
T ss_pred EEEEEE--CCEEEEEEEEeec----------cEEEEEEECHHHcCCCHHHHHHHHHHhhC-----CeEEEEEEcCChHHH
Confidence 344455 5899999988521 25778999999999999999999998753 233333 366778
Q ss_pred hHHHhcCcEEcc
Q psy6918 105 PFYETFGFQKKN 116 (123)
Q Consensus 105 ~~y~~~Gf~~~~ 116 (123)
+||+|+||+..+
T Consensus 113 ~~y~k~Gf~~~~ 124 (145)
T PRK10562 113 NFYHAQGFRIVD 124 (145)
T ss_pred HHHHHCCCEEcc
Confidence 999999999974
No 54
>KOG2488|consensus
Probab=99.37 E-value=9.3e-12 Score=76.10 Aligned_cols=87 Identities=24% Similarity=0.324 Sum_probs=70.4
Q ss_pred CeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---CC
Q psy6918 25 DYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---AD 101 (123)
Q Consensus 25 ~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~~ 101 (123)
..++++.++ .+.+||+..+.+..+ .+.+..++..+-|.+.|||+|||+.||+.++..+.....+.|.+++ |.
T Consensus 92 ~~Yi~a~~~--~~~~vgf~~Frf~vd---~g~~vlYcyEvqv~~~yR~kGiGk~LL~~l~~~a~~~~~~kVmLTVf~~N~ 166 (202)
T KOG2488|consen 92 LRYICAWNN--KSKLVGFTMFRFTVD---TGDPVLYCYEVQVASAYRGKGIGKFLLDTLEKLADSRHMRKVMLTVFSENI 166 (202)
T ss_pred ceEEEEEcC--CCceeeEEEEEEEcc---cCCeEEEEEEEeehhhhhccChHHHHHHHHHHHHHHHHhhhheeeeecccc
Confidence 333433433 459999999987543 3456789999999999999999999999999999888888787765 77
Q ss_pred CchhHHHhcCcEEcc
Q psy6918 102 HMIPFYETFGFQKKN 116 (123)
Q Consensus 102 ~~~~~y~~~Gf~~~~ 116 (123)
++..||.++||.++.
T Consensus 167 ~al~Fy~~~gf~~~~ 181 (202)
T KOG2488|consen 167 RALGFYHRLGFVVDE 181 (202)
T ss_pred hhHHHHHHcCcccCC
Confidence 888999999999873
No 55
>PF13718 GNAT_acetyltr_2: GNAT acetyltransferase 2; PDB: 2ZPA_B.
Probab=99.37 E-value=1.5e-11 Score=76.69 Aligned_cols=96 Identities=16% Similarity=0.224 Sum_probs=64.5
Q ss_pred HhhhcCCCeEEEEEEcCCCC--eEEEEEEEEEeeecccc-------------------------------cceeEEEEEE
Q psy6918 18 HRMKASQDYLVTVIEDTRTK--QVIGTGSLILEQKFIHE-------------------------------CALKGKIEEV 64 (123)
Q Consensus 18 ~~~~~~~~~~~~~~~~~~~~--~ivG~~~~~~~~~~~~~-------------------------------~~~~~~i~~~ 64 (123)
..+.+.+.+.+++... ++ +|+|.+.+..+...... .-....|..|
T Consensus 19 ~~LlDaP~h~l~~l~~--~~~p~il~~~~v~~EG~l~~~l~~~i~~g~rRp~G~LiP~~L~~~~~~~~f~~l~g~RIvRI 96 (196)
T PF13718_consen 19 QLLLDAPNHRLFVLLQ--PGDPDILGVAQVALEGGLSKELIEAILSGGRRPKGHLIPQTLAQHFGDPEFAQLSGARIVRI 96 (196)
T ss_dssp HHHHH-TTEEEEEEE---SS--SEEEEEEEEEEE---HHHHHHHHTTS---SS-HHHHHHHHHSS-TTGGGSEEEEEEEE
T ss_pred HHHhcCCcceeehhcc--CCCceEEEEEEEEecCCCCHHHHHHHHhCCCCCCCCCHHHHHHHHhCCHHHHhhcceeEEEE
Confidence 3344567777887777 56 99999999876542210 1234589999
Q ss_pred EecccccCCchHHHHHHHHHHHH-------------------------HhcCCeEEEE--ccCCCchhHHHhcCcEEc
Q psy6918 65 VVDDTYRGKELGKLLIAVLVKLA-------------------------KHFQCYKLTL--DCADHMIPFYETFGFQKK 115 (123)
Q Consensus 65 ~v~p~~rg~G~g~~Ll~~~~~~~-------------------------~~~~~~~i~~--~~~~~~~~~y~~~Gf~~~ 115 (123)
+|+|++|++|+|++|++.+++++ +..++..+-. ..+++..+||.|+||++.
T Consensus 97 AvhP~~q~~G~Gs~lL~~l~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~vDylGtSFG~t~~Ll~FW~k~gf~pv 174 (196)
T PF13718_consen 97 AVHPDLQRMGYGSRLLQQLEQYAEGKIPSLSEQDKEKLPPLLSKLSDRRPPGVDYLGTSFGATPELLKFWQKNGFVPV 174 (196)
T ss_dssp EE-CCC-SSSHHHHHHHHHHHT-----------------------------S-SEEEEEEE--HHHHHHHHCTT-EEE
T ss_pred EEChhhhcCCHHHHHHHHHHHHHhhhcccccccccccccccccccccccccCCCEEEeccCCCHHHHHHHHHCCcEEE
Confidence 99999999999999999999999 3556655544 357888999999999986
No 56
>KOG3397|consensus
Probab=99.36 E-value=1.3e-11 Score=74.53 Aligned_cols=76 Identities=26% Similarity=0.381 Sum_probs=66.2
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
..++||.+.+.. ........++..+.|+.+.||+|+|+.||+.++.||+..|++.+.+++ .+..+||+++||+.-
T Consensus 65 ~~~VigH~rLS~----i~n~~~al~VEsVVV~k~~RG~GFGk~lMk~~E~~~R~~gf~~~yLsT-~DQ~~FYe~lGYe~c 139 (225)
T KOG3397|consen 65 NDEVLGHSRLSH----LPNRDHALWVESVVVKKDQRGLGFGKFLMKSTEKWMREKGFNEAYLST-DDQCRFYESLGYEKC 139 (225)
T ss_pred ccceeeeecccc----CCCCCceeEEEEEEEehhhccccHHHHHHHHHHHHHHHhhhhheeeec-ccchhhhhhhccccc
Confidence 789999999973 334456789999999999999999999999999999999999999965 566789999999986
Q ss_pred c
Q psy6918 116 N 116 (123)
Q Consensus 116 ~ 116 (123)
+
T Consensus 140 ~ 140 (225)
T KOG3397|consen 140 D 140 (225)
T ss_pred C
Confidence 3
No 57
>KOG3235|consensus
Probab=99.32 E-value=3.5e-11 Score=71.68 Aligned_cols=92 Identities=21% Similarity=0.294 Sum_probs=72.4
Q ss_pred cCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHh-cCCeEEEEcc-
Q psy6918 22 ASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKH-FQCYKLTLDC- 99 (123)
Q Consensus 22 ~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~-~~~~~i~~~~- 99 (123)
-......+|++|. +|+|||++......++ ....+.+.|.+++|...||+.|+|++||.....-..+ .+.+.|.+.+
T Consensus 37 lswp~lSyVA~D~-~gkiVGYvlAkmee~p-~~~~~hGhItSlaV~rs~RrlGla~kLm~qa~rAm~E~~~A~yvsLHVR 114 (193)
T KOG3235|consen 37 LSWPQLSYVAEDE-NGKIVGYVLAKMEEDP-DDEPPHGHITSLAVKRSYRRLGLAQKLMNQASRAMVEVYEAKYVSLHVR 114 (193)
T ss_pred cccccceEEEEcC-CCcEEEEeeeehhhcc-cCCCCCCeeEEeeehhhHHHhhHHHHHHHHHHHHHHHhhcceEEEEeee
Confidence 3456667888864 8999999998876532 3334578999999999999999999999998776655 3667788776
Q ss_pred --CCCchhHHH-hcCcEEc
Q psy6918 100 --ADHMIPFYE-TFGFQKK 115 (123)
Q Consensus 100 --~~~~~~~y~-~~Gf~~~ 115 (123)
|..++.+|+ .+||+..
T Consensus 115 ~SNraAl~LY~~tl~F~v~ 133 (193)
T KOG3235|consen 115 KSNRAALHLYKNTLGFVVC 133 (193)
T ss_pred cccHHHHHhhhhccceEEe
Confidence 556678998 9999987
No 58
>COG1670 RimL Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]
Probab=99.29 E-value=8.6e-11 Score=72.54 Aligned_cols=79 Identities=18% Similarity=0.209 Sum_probs=61.6
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHh-cCCeEEEEcc---CCCchhHHHhcC
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKH-FQCYKLTLDC---ADHMIPFYETFG 111 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~-~~~~~i~~~~---~~~~~~~y~~~G 111 (123)
++++||.+.+..... ..........+.+.|+++|+|+|++++..+++++.. .++.++.+.+ |..++++++|+|
T Consensus 76 ~~~~iG~~~~~~~~~---~~~~~~~~ig~~l~~~~~g~G~~tea~~~~l~~~f~~~~l~ri~~~~~~~N~~S~rv~ek~G 152 (187)
T COG1670 76 DGELIGVIGLSDIDR---AANGDLAEIGYWLDPEYWGKGYATEALRALLDYAFEELGLHRIEATVDPENEASIRVYEKLG 152 (187)
T ss_pred CCeEEEEEEEEEecc---ccccceEEEEEEEChHHhcCchHHHHHHHHHHHhhhhcCceEEEEEecCCCHHHHHHHHHcC
Confidence 359999999985331 111223444566799999999999999999999988 6999999887 455578999999
Q ss_pred cEEccc
Q psy6918 112 FQKKNN 117 (123)
Q Consensus 112 f~~~~~ 117 (123)
|+..+.
T Consensus 153 f~~eg~ 158 (187)
T COG1670 153 FRLEGE 158 (187)
T ss_pred Chhhhh
Confidence 999854
No 59
>cd04301 NAT_SF N-Acyltransferase superfamily: Various enzymes that characteristically catalyze the transfer of an acyl group to a substrate. NAT (N-Acyltransferase) is a large superfamily of enzymes that mostly catalyze the transfer of an acyl group to a substrate and are implicated in a variety of functions, ranging from bacterial antibiotic resistance to circadian rhythms in mammals. Members include GCN5-related N-Acetyltransferases (GNAT) such as Aminoglycoside N-acetyltransferases, Histone N-acetyltransferase (HAT) enzymes, and Serotonin N-acetyltransferase, which catalyze the transfer of an acetyl group to a substrate. The kinetic mechanism of most GNATs involves the ordered formation of a ternary complex: the reaction begins with Acetyl Coenzyme A (AcCoA) binding, followed by binding of substrate, then direct transfer of the acetyl group from AcCoA to the substrate, followed by product and subsequent CoA release. Other family members include Arginine/ornithine N-succinyltransfera
Probab=99.22 E-value=2e-10 Score=58.61 Aligned_cols=58 Identities=24% Similarity=0.330 Sum_probs=50.1
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEE
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTL 97 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~ 97 (123)
+++++|++.+..... ......+..++|+|++||+|+|++++..+++++.+.++..+.+
T Consensus 7 ~~~~ig~~~~~~~~~----~~~~~~l~~~~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~v~~ 64 (65)
T cd04301 7 DGEIVGFASLSPDGS----GGDTAYIGDLAVLPEYRGKGIGSALLEAAEEEARERGAKRLRL 64 (65)
T ss_pred CCEEEEEEEEEecCC----CCccEEEEEEEECHHHcCcCHHHHHHHHHHHHHHHcCCcEEEe
Confidence 689999999986432 2356889999999999999999999999999999988888765
No 60
>KOG3234|consensus
Probab=99.21 E-value=2.2e-10 Score=68.17 Aligned_cols=89 Identities=19% Similarity=0.294 Sum_probs=70.1
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---CCCch
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---ADHMI 104 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~~~~~ 104 (123)
+.++++. +++|-|++....++ .....++++..+.|.|+||+.|+|+.||+.+++.....+...+.+.+ |.-++
T Consensus 43 ~~~a~~p-~~~imgyimgk~Eg---~~~~wh~HvTAltVap~~Rrl~la~~lm~~led~~d~~~a~fvDLfVr~sN~iAI 118 (173)
T KOG3234|consen 43 FIVAEAP-TGEIMGYIMGKVEG---KDTEWHGHVTALTVAPDYRRLGLAAKLMDTLEDVSDVDNAYFVDLFVRVSNQIAI 118 (173)
T ss_pred hEeccCC-CCceEEEEeeeccc---cCcceeeEEEEEEechhHHHHHHHHHHHHHHHHHHHhhhhheeeeeeeccchhHH
Confidence 4445543 78999999886554 34456789999999999999999999999999999887666666544 67779
Q ss_pred hHHHhcCcEEccceeE
Q psy6918 105 PFYETFGFQKKNNFMQ 120 (123)
Q Consensus 105 ~~y~~~Gf~~~~~~~~ 120 (123)
.||+++||++-..-..
T Consensus 119 ~mYkkLGY~~YR~Vi~ 134 (173)
T KOG3234|consen 119 DMYKKLGYSVYRTVIE 134 (173)
T ss_pred HHHHhcCceEEEeeee
Confidence 9999999998644333
No 61
>PF14542 Acetyltransf_CG: GCN5-related N-acetyl-transferase; PDB: 2H5M_A 2Q44_A 1XMT_A 2Q4Y_A 2IL4_A 2EVN_A 1R57_A.
Probab=99.08 E-value=3.2e-09 Score=57.24 Aligned_cols=56 Identities=16% Similarity=0.133 Sum_probs=46.1
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
+|+.+|.+.+... +....|....|.|++||+|+|++|++.++++++++|.+-+ ..|
T Consensus 7 ~g~~~a~l~Y~~~-------~~~~~i~hT~V~~~~rGqGia~~L~~~~l~~a~~~~~kv~-p~C 62 (78)
T PF14542_consen 7 DGEEIAELTYRED-------GGVIVITHTEVPPELRGQGIAKKLVEAALDYARENGLKVV-PTC 62 (78)
T ss_dssp STTEEEEEEEEES-------SSEEEEEEEEE-CSSSTTTHHHHHHHHHHHHHHHTT-EEE-ETS
T ss_pred CCEEEEEEEEEeC-------CCEEEEEEEEECccccCCcHHHHHHHHHHHHHHHCCCEEE-EEC
Confidence 5889999999752 3578899999999999999999999999999999998644 434
No 62
>COG3981 Predicted acetyltransferase [General function prediction only]
Probab=99.07 E-value=1.7e-09 Score=65.38 Aligned_cols=83 Identities=18% Similarity=0.233 Sum_probs=61.4
Q ss_pred EEEEEEcCCCCeEEEEEEEEEeee--cccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCc-
Q psy6918 27 LVTVIEDTRTKQVIGTGSLILEQK--FIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHM- 103 (123)
Q Consensus 27 ~~~~~~~~~~~~ivG~~~~~~~~~--~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~- 103 (123)
.++...+ ++++||++.++..-. .... .++| ...|.|+.||+|+|+++|+.+++.|++.|++.|.++|++.|
T Consensus 70 ~y~~v~~--d~~ivG~i~lRh~Ln~~ll~~---gGHI-GY~VrPseR~KGYA~emLkl~L~~ar~lgi~~Vlvtcd~dN~ 143 (174)
T COG3981 70 TYWAVDE--DGQIVGFINLRHQLNDFLLEE---GGHI-GYSVRPSERRKGYAKEMLKLALEKARELGIKKVLVTCDKDNI 143 (174)
T ss_pred eEEEEec--CCcEEEEEEeeeecchHHHhc---CCcc-cceeChhhhccCHHHHHHHHHHHHHHHcCCCeEEEEeCCCCc
Confidence 3444444 799999999985422 1111 2333 35899999999999999999999999999999999996555
Q ss_pred --hhHHHhcCcEEc
Q psy6918 104 --IPFYETFGFQKK 115 (123)
Q Consensus 104 --~~~y~~~Gf~~~ 115 (123)
.+.-+++|=+..
T Consensus 144 ASrkvI~~NGGile 157 (174)
T COG3981 144 ASRKVIEANGGILE 157 (174)
T ss_pred hhhHHHHhcCCEEe
Confidence 556667664443
No 63
>PF12746 GNAT_acetyltran: GNAT acetyltransferase; PDB: 3G3S_B.
Probab=99.07 E-value=3.9e-09 Score=68.94 Aligned_cols=79 Identities=16% Similarity=0.163 Sum_probs=58.2
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
+++||+.|....... ...--++.++|+|||+|+|+.+..+++..|.++|+.-.+-+.|..+.++-+|+||+..
T Consensus 173 ~~~iVs~~~s~~~~~-------~~~EI~I~T~~~yR~kGLA~~~aa~~I~~Cl~~~l~P~WDc~N~~S~~lA~kLGf~~~ 245 (265)
T PF12746_consen 173 DGEIVSGCSSYFVYE-------NGIEIDIETHPEYRGKGLATAVAAAFILECLENGLYPSWDCHNLASIALAEKLGFHFD 245 (265)
T ss_dssp TTEEEEEEEEEEEET-------TEEEEEEEE-CCCTTSSHHHHHHHHHHHHHHHTT-EEE-EESSHHHHHHHHHCT--EE
T ss_pred CCEEEEEEEEEEEEC-------CEEEEEEEECHHhhcCCHHHHHHHHHHHHHHHCCCCcCeeCCCHHHHHHHHHcCCccc
Confidence 488887766553322 1233468999999999999999999999999999876655567888999999999998
Q ss_pred cceeEe
Q psy6918 116 NNFMQI 121 (123)
Q Consensus 116 ~~~~~~ 121 (123)
.+|-.+
T Consensus 246 ~~Y~~Y 251 (265)
T PF12746_consen 246 FEYTAY 251 (265)
T ss_dssp EEEEEE
T ss_pred ceeeee
Confidence 777654
No 64
>PF12568 DUF3749: Acetyltransferase (GNAT) domain; InterPro: IPR024612 This domain is found in uncharacterised proteins from Gammaproteobacteria, and is approximately 40 amino acids in length. It contains two completely conserved residues (D and I) that may be functionally important. Proteins having this domain are frequently annotated as acetyltransferases of the GNAT family; however there is little accompanying annotation to confirm this.; PDB: 2K5T_A.
Probab=99.06 E-value=1.5e-08 Score=58.70 Aligned_cols=85 Identities=12% Similarity=0.218 Sum_probs=59.4
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccC--C
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCA--D 101 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~--~ 101 (123)
....++++.= |++++|.+.+.... ..+.|.+++|.|--|++|+|..|++.+.+.+ .+++.+.+... +
T Consensus 36 ~~~~l~aArF--NdRlLgAv~v~~~~-------~~~~L~~l~VRevTRrRGVG~yLlee~~rq~--p~i~~w~l~~~~~~ 104 (128)
T PF12568_consen 36 EGHRLFAARF--NDRLLGAVKVTISG-------QQAELSDLCVREVTRRRGVGLYLLEEVLRQL--PDIKHWWLADEGVE 104 (128)
T ss_dssp SSEEEEEEEE--TTEEEEEEEEEEET-------TEEEEEEEEE-TT-SSSSHHHHHHHHHHHHS---S--EEEE--TT-S
T ss_pred cCCeEEEEEe--chheeeeEEEEEcC-------cceEEeeEEEeeccccccHHHHHHHHHHHHC--CCCcEEEEecCCCc
Confidence 4455666665 79999999998653 3789999999999999999999999999876 46666666542 1
Q ss_pred -----CchhHHHhcCcEEcccee
Q psy6918 102 -----HMIPFYETFGFQKKNNFM 119 (123)
Q Consensus 102 -----~~~~~y~~~Gf~~~~~~~ 119 (123)
....|+..+||......+
T Consensus 105 ~~~~~~~~~Fm~a~GF~~~~~~W 127 (128)
T PF12568_consen 105 PQDRAVMAAFMQACGFSAQSDGW 127 (128)
T ss_dssp --THHHHHHHHHHHT-EE-SSSE
T ss_pred ccchHHHHHHHHHcCccccCCCC
Confidence 114699999998876554
No 65
>KOG4144|consensus
Probab=98.96 E-value=1.1e-09 Score=65.08 Aligned_cols=82 Identities=21% Similarity=0.296 Sum_probs=63.7
Q ss_pred CCeEEEEEEEEEeee---------cccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc-CCeEEEEccCCCchh
Q psy6918 36 TKQVIGTGSLILEQK---------FIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF-QCYKLTLDCADHMIP 105 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~---------~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~-~~~~i~~~~~~~~~~ 105 (123)
++.+||.+.....+. ....++....|..++|+|+||.+|+|..|+..-++...++ -..++.+-++++..+
T Consensus 70 ~~tLIghIigs~~~~E~lt~ESm~kh~s~g~ni~iHsl~Ihpa~rk~g~a~~Ll~~ylq~l~~q~i~~r~~Li~h~pLvP 149 (190)
T KOG4144|consen 70 EGTLIGHIIGSLWDKERLTQESMTKHRSGGHNIHIHSLAIHPAFRKQGRAPILLWRYLQHLGSQPIVRRAALICHDPLVP 149 (190)
T ss_pred cccceehhhcccCcchhhhHHHHhhhhcCCcceeEEEEEecHHHHhcCcchhHHHHHHHHhhcCccccceeeeecCCccc
Confidence 478888877653322 1123455589999999999999999999999966666555 446788889999999
Q ss_pred HHHhcCcEEccc
Q psy6918 106 FYETFGFQKKNN 117 (123)
Q Consensus 106 ~y~~~Gf~~~~~ 117 (123)
||+++||+..++
T Consensus 150 FYEr~gFk~vgp 161 (190)
T KOG4144|consen 150 FYERFGFKAVGP 161 (190)
T ss_pred hhHhcCceeecc
Confidence 999999998754
No 66
>KOG3138|consensus
Probab=98.90 E-value=3.9e-09 Score=65.34 Aligned_cols=59 Identities=27% Similarity=0.410 Sum_probs=52.4
Q ss_pred eEEEEEEEecccccCCchHHHHHHHHHHHHHhcC-CeEEEEc---cCCCchhHHHhcCcEEcc
Q psy6918 58 KGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQ-CYKLTLD---CADHMIPFYETFGFQKKN 116 (123)
Q Consensus 58 ~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~-~~~i~~~---~~~~~~~~y~~~Gf~~~~ 116 (123)
..+|..+.|.|+||.+|+|+.|++.+.+++.... +..+.+. ++..++.||++.||+.+.
T Consensus 89 ~~yi~~Lgvl~~yR~~gIGs~Ll~~~~~~~~~~~~~~~v~lHv~~~n~~ai~~Y~~~gF~~~~ 151 (187)
T KOG3138|consen 89 VIYILSLGVLPRYRNKGIGSKLLEFVKKYCSEAHQCRRVYLHVQAVNESAIEFYEKRGFEIVE 151 (187)
T ss_pred eeEEEeecccHHHHhcchHHHHHHHHHHHHhcccccceEEEEEEeCCCcHHHHHHhcCceEee
Confidence 4789999999999999999999999999998876 7777765 477889999999999983
No 67
>COG1444 Predicted P-loop ATPase fused to an acetyltransferase [General function prediction only]
Probab=98.88 E-value=1.4e-08 Score=74.17 Aligned_cols=93 Identities=14% Similarity=0.174 Sum_probs=67.6
Q ss_pred hcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccc------------------------------cceeEEEEEEEecccc
Q psy6918 21 KASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHE------------------------------CALKGKIEEVVVDDTY 70 (123)
Q Consensus 21 ~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~------------------------------~~~~~~i~~~~v~p~~ 70 (123)
.+.+++.++++.-. ++.+|+.+.+..+...... .-....|..|+|+|++
T Consensus 465 ~DaP~h~~~al~~~-~~~~va~~qva~EG~l~~~~i~~~~~g~r~~GnlIp~~l~~~~~~~~fa~l~G~RIvRIAvhPe~ 543 (758)
T COG1444 465 LDAPHHHIFALRAP-EGKPVAVWQVAEEGGLSDELIDIWLGGRRPRGNLIPDLLAKHHRDPEFAKLVGWRIVRIAVHPEL 543 (758)
T ss_pred hcCCCCeeEEEEcC-CCceEEEEEeeccCCCcHHHHHHHhcCCCCCCcccHHHHHHhhcchhhcccceeeEEEEEeCHHH
Confidence 34555666655542 4588888887755432100 1123489999999999
Q ss_pred cCCchHHHHHHHHHHHHHhcCCeEEE--EccCCCchhHHHhcCcEEc
Q psy6918 71 RGKELGKLLIAVLVKLAKHFQCYKLT--LDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 71 rg~G~g~~Ll~~~~~~~~~~~~~~i~--~~~~~~~~~~y~~~Gf~~~ 115 (123)
|++|+|++|++.+.++++ .++..+. ...++...+||.|+||.++
T Consensus 544 q~~GiGsrlL~~l~~~a~-~~~DwlgvsFG~t~~L~rFW~rnGF~pV 589 (758)
T COG1444 544 QRMGIGSRLLALLIEEAR-KGLDWLGVSFGYTEELLRFWLRNGFVPV 589 (758)
T ss_pred HhcCHHHHHHHHHHHHHh-cCCCEEeeccCCCHHHHHHHHHcCeEEE
Confidence 999999999999999996 3444444 4468899999999999986
No 68
>COG2388 Predicted acetyltransferase [General function prediction only]
Probab=98.77 E-value=5.7e-08 Score=54.33 Aligned_cols=67 Identities=19% Similarity=0.175 Sum_probs=53.4
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD 98 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~ 98 (123)
+....++..+ +|..+|.+...... .....|..-.|.+++||||+|++|++.+++.+++.|.+.+-++
T Consensus 13 ~~~~~y~~~~--~G~~~~e~~y~~~~------~~~i~i~HT~V~d~lrGqGia~~L~~~al~~ar~~g~kiiP~C 79 (99)
T COG2388 13 GENGRYVLTD--EGEVIGEATYYDRG------ENLIIIDHTYVPDELRGQGIAQKLVEKALEEAREAGLKIIPLC 79 (99)
T ss_pred cCceEEEEec--CCcEEEEEEEecCC------CCEEEEecCcCCHHHcCCcHHHHHHHHHHHHHHHcCCeEcccc
Confidence 3444555555 68899998887543 2567888999999999999999999999999999998655554
No 69
>COG1243 ELP3 Histone acetyltransferase [Transcription / Chromatin structure and dynamics]
Probab=98.76 E-value=1.6e-08 Score=69.73 Aligned_cols=98 Identities=15% Similarity=0.228 Sum_probs=73.4
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeeeccccc--c-eeE------------EEEEEEecccccCCchHHHHHHHHHHHHH
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQKFIHEC--A-LKG------------KIEEVVVDDTYRGKELGKLLIAVLVKLAK 88 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~--~-~~~------------~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~ 88 (123)
+..++...++.+++.++|++.+.....-.+.. . ..+ .|+. . ...||.+|+|++||+.+++.|+
T Consensus 403 g~e~Fls~ed~~~d~lig~lrlR~p~e~~~r~e~~~~~aivrelhvyg~~vpig~-~-~~~~QH~G~G~~L~~~AE~ia~ 480 (515)
T COG1243 403 GTEIFLSYEDPKNDILIGFLRLREPSEGAHREEIDDKTAIVRELHVYGSEVPIGK-R-EDEWQHRGYGRELLEEAERIAR 480 (515)
T ss_pred CEEEEeecccchhhhhhheeeecccccchhhhhcccchhhhhhhhcccccccccc-C-cchhhcccHHHHHHHHHHHHHH
Confidence 44444555555567899999998643311110 0 111 2333 1 4679999999999999999999
Q ss_pred hcCCeEEEEccCCCchhHHHhcCcEEccceeEeeC
Q psy6918 89 HFQCYKLTLDCADHMIPFYETFGFQKKNNFMQIYF 123 (123)
Q Consensus 89 ~~~~~~i~~~~~~~~~~~y~~~Gf~~~~~~~~~~~ 123 (123)
+.+..+|.+.+.-....+|+|+||+.+++||.+++
T Consensus 481 ee~~~ki~viSgiG~ReYy~k~GY~~~gpYm~K~l 515 (515)
T COG1243 481 EEGAKKILVISGIGVREYYRKLGYELDGPYMSKRL 515 (515)
T ss_pred hhccccEEEEecccHHHHHHHhCccccCCcccccC
Confidence 99999999988889999999999999999998864
No 70
>COG3053 CitC Citrate lyase synthetase [Energy production and conversion]
Probab=98.75 E-value=1.2e-07 Score=62.17 Aligned_cols=78 Identities=23% Similarity=0.413 Sum_probs=69.0
Q ss_pred CeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCch
Q psy6918 25 DYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMI 104 (123)
Q Consensus 25 ~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~ 104 (123)
...+.+..+ +++||++.++. ...|.-++|+|.+||-|++-+|+..+++.+-+.|...+.+-+.+...
T Consensus 36 e~~v~~~~~--~~~iiacGsia-----------GnvikcvAvs~s~qGeGl~lkl~TeLin~ay~~g~~hLFiyTKp~~~ 102 (352)
T COG3053 36 EYFVAIYRD--NEEIIACGSIA-----------GNVIKCVAVSESLQGEGLALKLVTELINLAYERGRTHLFIYTKPEYA 102 (352)
T ss_pred eEEEEEEcC--CCcEEEecccc-----------cceeEEEEechhcccccHHHHHHHHHHHHHHHcCCceEEEEechhHH
Confidence 345555555 79999999986 23677899999999999999999999999999999999999999999
Q ss_pred hHHHhcCcEEc
Q psy6918 105 PFYETFGFQKK 115 (123)
Q Consensus 105 ~~y~~~Gf~~~ 115 (123)
.||+.+||.+.
T Consensus 103 ~lFk~~GF~~i 113 (352)
T COG3053 103 ALFKQCGFSEI 113 (352)
T ss_pred HHHHhCCceEe
Confidence 99999999986
No 71
>KOG4135|consensus
Probab=98.73 E-value=6.2e-07 Score=53.26 Aligned_cols=110 Identities=15% Similarity=0.250 Sum_probs=74.2
Q ss_pred CCCCHHHHHHHHHhhhcCCCeEEEEEEcCC---C-----CeEEEEEEEEEeeeccc----ccceeEEEEEEEecccccCC
Q psy6918 6 PTSNVFHFAERFHRMKASQDYLVTVIEDTR---T-----KQVIGTGSLILEQKFIH----ECALKGKIEEVVVDDTYRGK 73 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~-----~~ivG~~~~~~~~~~~~----~~~~~~~i~~~~v~p~~rg~ 73 (123)
.+.+.++-.+.-..+.++.+.+-+++.+.+ . ...||-+.+.....+.+ +....+.+.-+.-.|..||+
T Consensus 43 E~LtLdeEyeMQ~sW~~DeDKlTFIVLdaE~~ea~~~ev~~MvGDvNlFlt~~~~~~n~s~~~~~gE~EvMIAEP~~RgK 122 (185)
T KOG4135|consen 43 EPLTLDEEYEMQKSWREDEDKLTFIVLDAEMNEAGEDEVDHMVGDVNLFLTTSPDTENPSDDVITGEVEVMIAEPRGRGK 122 (185)
T ss_pred CCcchhHHHHhhhhhccCCcceEEEEEechhcccCchhHhhhccceeeEEecCCCcCCcccceeeeeEEEEEecccccCC
Confidence 455666666666666666555555444321 1 24688888776543221 12234566667778999999
Q ss_pred chHHHHHHHHHHHHHhc-CCeEEEEcc---CCCchhHHHhcCcEEc
Q psy6918 74 ELGKLLIAVLVKLAKHF-QCYKLTLDC---ADHMIPFYETFGFQKK 115 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~-~~~~i~~~~---~~~~~~~y~~~Gf~~~ 115 (123)
|+|++++.+++.++... ++.+-.+.. +++.++||.|++|...
T Consensus 123 G~G~eav~~ml~y~~s~l~l~Ky~vkig~~nk~sl~lFkk~~f~q~ 168 (185)
T KOG4135|consen 123 GIGTEAVRAMLAYAYSVLKLDKYEVKIGMDNKPSLRLFKKFLFTQV 168 (185)
T ss_pred CccHHHHHHHHHHHHHHhhhheEEEEecCCCchHHHHHHHhhheee
Confidence 99999999999999775 554444443 7888999999999886
No 72
>PF08444 Gly_acyl_tr_C: Aralkyl acyl-CoA:amino acid N-acyltransferase, C-terminal region; InterPro: IPR013652 This entry represents mammalian-specific glycine N-acyltransferase (also called aralkyl acyl-CoA:amino acid N-acyltransferase; 2.3.1.13 from EC). Mitochondrial acyltransferases catalyse the transfer of an acyl group from acyl-CoA to the N terminus of glycine to produce N-acylglycine. These enzymes can conjugate a multitude of substrates to form a variety of N-acylglycines. The CoA derivatives of a number of aliphatic and aromatic acids, but not phenylacetyl-CoA or (indol-3-yl)acetyl-CoA, can act as donor [, ].
Probab=98.70 E-value=5.8e-08 Score=53.03 Aligned_cols=69 Identities=12% Similarity=0.098 Sum_probs=59.4
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc---CCCchhHHHhcCc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC---ADHMIPFYETFGF 112 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~---~~~~~~~y~~~Gf 112 (123)
+|.+|.++.+. ..+.+..-++.|+|||+|+.+.++....+.+.+.|+. +...+ |+..+++..++||
T Consensus 7 eG~PVSW~lmd----------qtge~rmgyTlPeyR~~G~~~~v~~~~~~~L~~~g~P-~Y~hv~~~N~~~~r~~~~lg~ 75 (89)
T PF08444_consen 7 EGNPVSWSLMD----------QTGEMRMGYTLPEYRGQGLMSQVMYHLAQYLHKLGFP-FYGHVDEDNEASQRLSKSLGF 75 (89)
T ss_pred CCCEeEEEEec----------ccccccccccCHhHhcCCHHHHHHHHHHHHHHHCCCC-eEeehHhccHHHHHHHHHCCC
Confidence 69999998885 3578888999999999999999999999999999986 44443 5666889999999
Q ss_pred EEc
Q psy6918 113 QKK 115 (123)
Q Consensus 113 ~~~ 115 (123)
+..
T Consensus 76 ~~~ 78 (89)
T PF08444_consen 76 IFM 78 (89)
T ss_pred eec
Confidence 986
No 73
>COG4552 Eis Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]
Probab=98.65 E-value=2.2e-07 Score=62.32 Aligned_cols=82 Identities=16% Similarity=0.134 Sum_probs=64.9
Q ss_pred CCeEEEEEEEEEeee-cccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEE
Q psy6918 36 TKQVIGTGSLILEQK-FIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQK 114 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~-~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~ 114 (123)
+.++++.+...+..- +-...-+.+.|..+++.|+|||+|..++||...++..++.|+....+ +|-+.+||+|+||..
T Consensus 47 nqkl~s~L~i~~f~~~f~~q~l~t~GIa~Vas~P~~R~~G~~~~Ll~~sLre~~~kG~p~s~L--~P~s~~iYrKfGye~ 124 (389)
T COG4552 47 NQKLASRLHIPPFIFWFGNQVLPTAGIAGVASAPTYRRRGALRALLAHSLREIARKGYPVSAL--HPFSGGIYRKFGYEY 124 (389)
T ss_pred hhhhhhcccccchheeeCCeeeeccceEEEEechhhccCcHHHHHHHHHHHHHHHcCCeeEEe--ccCchhhHhhccccc
Confidence 478888777652211 12233456789999999999999999999999999999999977766 588999999999998
Q ss_pred cccee
Q psy6918 115 KNNFM 119 (123)
Q Consensus 115 ~~~~~ 119 (123)
-..+.
T Consensus 125 asn~~ 129 (389)
T COG4552 125 ASNYH 129 (389)
T ss_pred cceEE
Confidence 75544
No 74
>PF00765 Autoind_synth: Autoinducer synthetase; InterPro: IPR001690 Bacterial species have many methods of controlling gene expression and cell growth. Regulation of gene expression in response to changes in cell density is termed quorum sensing [, ]. Quorum-sensing bacteria produce, release and respond to hormone-like molecules (autoinducers) that accumulate in the external environment as the cell population grows. Once a threshold of these molecules is reached, a signal transduction cascade is triggered that ultimately leads to behavioural changes in the bacterium []. Autoinducers are thus clearly important mediators of molecular communication. Conjugal transfer of Agrobacterium octopine-type Ti plasmids is activated by octopine, a metabolite released from plant tumours []. Octopine causes conjugal donors to secrete a pheromone, Agrobacterium autoinducer (AAI), and exogenous AAI further stimulates conjugation. The putative AAI synthase and an AAI-responsive transcriptional regulator have been found to be encoded by the Ti plasmid traI and traR genes, respectively. TraR and TraI are similar to the LuxR and LuxI regulatory proteins of Vibrio fischeri, and AAI is similar in structure to the diffusable V. fischeri autoinducer, the inducing ligand of LuxR. TraR activates target genes in the presence of AAI and also activates traR and traI themselves, creating two positive-feedback loops. TraR-AAI-mediated activation in wild-type Agrobacterium strains is enhanced by culturing on solid media, suggesting a possible role in cell density sensing []. Production of light by the marine bacterium V. fischeri and by recombinant hosts containing cloned lux genes is controlled by the density of the culture []. Density-dependent regulation of lux gene expression has been shown to require a locus consisting of the luxR and luxI genes. In these and other Gram-negative bacteria, N-(3-oxohexanoyl)-L-homoserine lactone (OHHL) acts as the autoinducer by binding to transcriptional regulatory proteins and activating them []. OHHL and related molecules, such as N-butanoyl- (BHL), N-hexanoyl- (HHL) and N-oxododecanoyl- (PAI) homoserine lactones, are produced by a family of proteins that share a high level of sequence similarity. Proteins which currently members of this family include: luxI from V. fischeri. ahyI and asaI from Aeromonas species, which synthesize BHL and whose targets are ahyR and asaR respectively. carI from Erwinia carotovora. The target of OHHL is carR which activates genes involved in the biosynthesis of carbapenem antibiotics. eagI from Enterobacter agglomerans. The target of OHHL is not yet known. esaI from Erwinia stewartii. expI from Erwinia carotovora. lasI from Pseudomonas aeruginosa, which synthesizes PAI and whose target is lasR which activates the transcription of the elastase gene. rhlI (or vsmI) from P. aeruginosa, which synthesizes BHL and HHL and whose target is rhlR. swrI from Serratia liquefaciens, which synthesizes BHL. yenI from Yersinia enterocolitica. ; GO: 0007165 signal transduction; PDB: 3P2H_A 3P2F_A 1KZF_A 1K4J_A 1RO5_A.
Probab=98.55 E-value=9.4e-07 Score=55.05 Aligned_cols=96 Identities=13% Similarity=0.098 Sum_probs=67.6
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeec---------------ccccceeEEEEEEEecccccC------Cch
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKF---------------IHECALKGKIEEVVVDDTYRG------KEL 75 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~---------------~~~~~~~~~i~~~~v~p~~rg------~G~ 75 (123)
++.+.+....+++ +.+ +|+++|++++.+...+ .+.......++.++|+|+.++ .-+
T Consensus 37 ~DqyD~~~~~ylv-~~~--~g~v~g~~RLlptt~p~ML~~~F~~ll~~~~~p~~~~vwE~SRf~v~~~~~~~~~~~~~~~ 113 (182)
T PF00765_consen 37 IDQYDDPDAVYLV-ALD--DGRVVGCARLLPTTGPYMLSDVFPHLLPDGPAPRSPDVWELSRFCVDPDRRRSRAGSRSPV 113 (182)
T ss_dssp --TTGCTT-EEEE-EEE--TTEEEEEEEEEETTS--HHHHCTGGGHTTS---SSTTEEEEEEEEE-HCCCHHCHSCC-TH
T ss_pred eeecCCCCCeEEE-EEE--CCEEEEEeeeccCCCcchhhhHHHHHhCCCCCCCCCcceeeeEEEEcccccccccccccHH
Confidence 4445555555444 444 5999999999864321 111245678999999998432 246
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
...|+..+.++|.++|+..+...+.....+++++.||...
T Consensus 114 ~~~L~~~~~e~a~~~gi~~~v~V~~~~~~r~l~r~G~~~~ 153 (182)
T PF00765_consen 114 TMELLLGMVEFALSNGIRHIVGVVDPAMERILRRAGWPVR 153 (182)
T ss_dssp HHHHHHHHHHHHHCTT-SEEEEEEEHHHHHHHHHCT-EEE
T ss_pred HHHHHHHHHHHHHHCCCCEEEEEEChHHHHHHHHcCCceE
Confidence 7899999999999999999999888889999999999886
No 75
>PRK13834 putative autoinducer synthesis protein; Provisional
Probab=98.55 E-value=1.3e-06 Score=55.45 Aligned_cols=97 Identities=10% Similarity=0.093 Sum_probs=71.8
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeec---------------ccccceeEEEEEEEeccccc---CCc----
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKF---------------IHECALKGKIEEVVVDDTYR---GKE---- 74 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~---------------~~~~~~~~~i~~~~v~p~~r---g~G---- 74 (123)
.+.+.+....+++...+ +|+++|++.+.+...+ .......+.++.++|+|+++ +.+
T Consensus 45 ~D~yD~~~~~yll~~~~--~g~vvG~~RLlptt~p~ml~~~fp~l~~~~~~~~~~~v~E~SRf~V~~~~~~~~~~~~~~~ 122 (207)
T PRK13834 45 RDQFDDLKPTYILAISD--SGRVAGCARLLPAIGPTMLAQVFPQLLPAGRLNAHPAMIESSRFCVDTALAEGRGGGQLHE 122 (207)
T ss_pred ccCCCCCCCEEEEEEeC--CCeEEEEEecccCCCcchhhhhcHHhcCCCCCCCCCCEEEEeeeEEcccccccccccccCH
Confidence 44455555555555545 7899999998754221 11234577899999999863 212
Q ss_pred hHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 75 LGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 75 ~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
+...|+..+.+++..+|+..+...+.+...+++.++|+...
T Consensus 123 ~~~~L~~~~~~~a~~~Gi~~~~~v~~~~~~r~l~r~G~~~~ 163 (207)
T PRK13834 123 ATLTMFAGIIEWSMANGYTEIVTATDLRFERILARAGWPMQ 163 (207)
T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHcCCCeE
Confidence 56789999999999999999998888888899999998765
No 76
>PF06852 DUF1248: Protein of unknown function (DUF1248); InterPro: IPR009658 This entry represents a conserved region within a number of proteins of unknown function that seem to be specific to Caenorhabditis elegans. Note that some proteins in the entry contain more than one copy of this region.
Probab=98.51 E-value=5.2e-06 Score=51.43 Aligned_cols=81 Identities=23% Similarity=0.218 Sum_probs=54.6
Q ss_pred CCeEEEEEEEEEeeecccc-cceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHH-HhcCcE
Q psy6918 36 TKQVIGTGSLILEQKFIHE-CALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY-ETFGFQ 113 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~-~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y-~~~Gf~ 113 (123)
+.++|+.+.+....+.... ..+...++.+++.|+|||+|+++-+-+.+.+..+..+ .......+..+.++| +-+||.
T Consensus 55 T~~via~~~~~~~~~l~~~~d~pl~~~G~~w~~p~yRg~~~~kl~~~~~~~~~~~~~-~N~~~~~~~~~~~~w~k~~G~~ 133 (181)
T PF06852_consen 55 TDRVIATVHLIRFDPLNPSPDKPLQFIGFFWIDPEYRGKGIMKLQDDICMDELDSVD-DNSVAQGNVKMSNFWHKMFGFD 133 (181)
T ss_pred CCcEEEEEEEEEeccCCCCCCCCeEEEeeeeeCCcccCcchHHHHHHHHHHHhccCC-CceeeecCHHHHHHHHHHhCCC
Confidence 7899999988754432222 3567899999999999999999644444444444433 334445667777888 467877
Q ss_pred Eccc
Q psy6918 114 KKNN 117 (123)
Q Consensus 114 ~~~~ 117 (123)
..+.
T Consensus 134 ~~~h 137 (181)
T PF06852_consen 134 DYGH 137 (181)
T ss_pred CCcc
Confidence 6644
No 77
>PF13480 Acetyltransf_6: Acetyltransferase (GNAT) domain
Probab=98.37 E-value=3.4e-05 Score=45.56 Aligned_cols=85 Identities=14% Similarity=0.036 Sum_probs=66.7
Q ss_pred CCCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHH
Q psy6918 6 PTSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVK 85 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~ 85 (123)
++.+.+.+...+......+...++++.. +|++||+........ ..+.....++|+++..+.|..|+..+++
T Consensus 51 ~~~~~~~~~~l~~~~~~~~~~~l~~~~~--~g~~va~~~~~~~~~-------~~~~~~~g~~~~~~~~~~~~~l~~~~i~ 121 (142)
T PF13480_consen 51 PPFSRDFFRDLLRSLAESGRLRLFVLYD--GGEPVAFALGFRHGG-------TLYYWYGGYDPEYRKYSPGRLLLWEAIR 121 (142)
T ss_pred CcchHHHHHHHHHhhccCCCEEEEEEEE--CCEEEEEEEEEEECC-------EEEEEEEEECHhhHhCCHHHHHHHHHHH
Confidence 3455666666666665566777777777 799999987765432 4566667789999999999999999999
Q ss_pred HHHhcCCeEEEEcc
Q psy6918 86 LAKHFQCYKLTLDC 99 (123)
Q Consensus 86 ~~~~~~~~~i~~~~ 99 (123)
++.+.|+..+.+..
T Consensus 122 ~a~~~g~~~~d~g~ 135 (142)
T PF13480_consen 122 WAIERGLRYFDFGG 135 (142)
T ss_pred HHHHCCCCEEEECC
Confidence 99999998888865
No 78
>COG3818 Predicted acetyltransferase, GNAT superfamily [General function prediction only]
Probab=98.37 E-value=4.3e-06 Score=48.74 Aligned_cols=85 Identities=18% Similarity=0.114 Sum_probs=62.6
Q ss_pred EEEcCCCCeEEEEEEEEEeeecccc---------cceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc-
Q psy6918 30 VIEDTRTKQVIGTGSLILEQKFIHE---------CALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC- 99 (123)
Q Consensus 30 ~~~~~~~~~ivG~~~~~~~~~~~~~---------~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~- 99 (123)
+++. +|.+.|++.......-+++ -.+..++..+.|....||.|+|++|.+.+-++++..|+..+.+.+
T Consensus 49 ~ArR--~G~l~afl~tFd~~a~ydSpNFlWFrErYe~F~YvDRvVVA~~aRGrG~aRalY~Dlf~~Ae~agy~~~tCEVn 126 (167)
T COG3818 49 VARR--DGNLAAFLVTFDSSARYDSPNFLWFRERYENFFYVDRVVVASRARGRGVARALYADLFSYAELAGYPYLTCEVN 126 (167)
T ss_pred HHhh--ccchhhheeeccccccCCCCceeehhhhCCceEEEEEEEEEecccccchHHHHHHHHHHHHHhcCCceEEEEec
Confidence 4555 5777776654422211111 234568999999999999999999999999999999998887765
Q ss_pred -C---CCchhHHHhcCcEEcc
Q psy6918 100 -A---DHMIPFYETFGFQKKN 116 (123)
Q Consensus 100 -~---~~~~~~y~~~Gf~~~~ 116 (123)
+ +.+..|...+||.+++
T Consensus 127 ~DppnpasdaFHaalGF~eVG 147 (167)
T COG3818 127 LDPPNPASDAFHAALGFHEVG 147 (167)
T ss_pred CCCCChHHHHHhhhcCceEcc
Confidence 2 3445689999999984
No 79
>TIGR03694 exosort_acyl putative PEP-CTERM/exosortase system-associated acyltransferase. Members of this protein family are restricted to bacterial species with the PEP-CTERM/exosortase system predicted to act in exopolysaccharide-associated protein targeting. PSI-BLAST and CDD reveal relationships to the acyltransferase family that includes N-acyl-L-homoserine lactone synthetase. Several members of this family may be found in a single genome. These proteins likely contribute to chemical modifications in exopolysaccharide and biofilm structural material production.
Probab=98.35 E-value=4e-06 Score=54.52 Aligned_cols=80 Identities=16% Similarity=0.206 Sum_probs=64.0
Q ss_pred CCeEEEEEEEEEee---e---c----c----------c----ccceeEEEEEEEecccccCC--------c---------
Q psy6918 36 TKQVIGTGSLILEQ---K---F----I----------H----ECALKGKIEEVVVDDTYRGK--------E--------- 74 (123)
Q Consensus 36 ~~~ivG~~~~~~~~---~---~----~----------~----~~~~~~~i~~~~v~p~~rg~--------G--------- 74 (123)
+|++||++++.+.. + + . . .....+.++.++|+|++|+. |
T Consensus 65 ~g~vvG~~RLl~t~~~~p~~~~p~e~~~~~~~~~~~~~~~~~~~~~i~E~SRf~V~~~~r~r~~~~~~~~~~~~~~~~~~ 144 (241)
T TIGR03694 65 TGTFVGCVRLVLPNSSDPDQPFPFEKHCSHSLDGLFLDPRRLPRSRIAEVSRLAVSKDFRRRKGEKLKPSGVGVIETEAP 144 (241)
T ss_pred CCCEEEEEEEeccccccccccccHHHHhccccchhhcCccccCCCceEEeehheECHhHhCCcccccccccccccccccc
Confidence 68999999998641 0 0 0 0 12456788999999999873 1
Q ss_pred -----------hHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 75 -----------LGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 75 -----------~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
+...|+..+.++|.++|+..+...+.+...+++.++|+...
T Consensus 145 ~~~~~~~~~~~~~~~L~~~~~~~a~~~Gi~~~~~v~~~~l~r~l~r~G~~~~ 196 (241)
T TIGR03694 145 FSESERRRFPHIPLGLYLGLIALSSANGITHWYAIMEPRLARLLSRFGIQFR 196 (241)
T ss_pred cchhhcccCchHHHHHHHHHHHHHHHCCCcEEEEEeCHHHHHHHHHhCCceE
Confidence 56789999999999999999999898999999999998665
No 80
>COG5628 Predicted acetyltransferase [General function prediction only]
Probab=98.32 E-value=8.4e-06 Score=46.78 Aligned_cols=74 Identities=8% Similarity=0.127 Sum_probs=53.9
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEE-EEccCCCchhHHHhcCcE
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKL-TLDCADHMIPFYETFGFQ 113 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i-~~~~~~~~~~~y~~~Gf~ 113 (123)
++.|||++.+.- ....+..-...+.+++|...|||+|+|++..+++-.. +.|...| .+.-|.++..||++.-+.
T Consensus 45 ~~~~igf~l~L~--~~~~~~~iD~~~~efFIi~k~~~~GvGR~aaK~If~~--~~g~w~Va~i~EN~PA~~fwK~~~~t 119 (143)
T COG5628 45 GGLPVGFALVLD--LAHSPTPIDRAVAEFFIVRKHRRRGVGRAAAKAIFGS--AWGVWQVATVRENTPARAFWKRVAET 119 (143)
T ss_pred CCceeeeeeeec--ccCCCCcccccchheEeeehhhccchhHHHHHHHHHH--hhceEEEEEeccCChhHHHHHhhhcc
Confidence 589999998863 2233333445788899999999999999999988644 4455444 344588999998876544
No 81
>COG0454 WecD Histone acetyltransferase HPA2 and related acetyltransferases [Transcription / General function prediction only]
Probab=98.31 E-value=8.2e-07 Score=49.99 Aligned_cols=44 Identities=27% Similarity=0.420 Sum_probs=35.5
Q ss_pred EEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCc
Q psy6918 64 VVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGF 112 (123)
Q Consensus 64 ~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf 112 (123)
++|+|++||+|+|++|++.++++++..++. .+.....++.+.||
T Consensus 87 l~v~~~~rg~Gig~~Ll~~~~~~~~~~g~~-----~~~~~~~~~~~~~~ 130 (156)
T COG0454 87 LYVLPEYRGKGIGSALLEAALEWARKRGIS-----LNRLALEVYEKNGF 130 (156)
T ss_pred EEecchhhccchHHHHHHHHHHHHHHcCce-----ehHHHHHHHHhcCC
Confidence 999999999999999999999999998875 33444555555554
No 82
>COG3916 LasI N-acyl-L-homoserine lactone synthetase [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]
Probab=98.28 E-value=1.8e-05 Score=49.70 Aligned_cols=97 Identities=10% Similarity=0.045 Sum_probs=71.7
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeec---------------ccccceeEEEEEEEecc--cc---cCCc-h
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKF---------------IHECALKGKIEEVVVDD--TY---RGKE-L 75 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~---------------~~~~~~~~~i~~~~v~p--~~---rg~G-~ 75 (123)
++.+.+..+.++++... +|+|+|++++.+...+ .+........+.++|++ .- .+.. .
T Consensus 44 ~DqyD~~~t~Yll~~~~--~g~I~G~~RlLptt~P~mL~~vF~~Ll~~~~~P~~p~vwEsSRF~vd~~~a~~~~g~~~~a 121 (209)
T COG3916 44 IDQYDNLDTVYLLALTS--DGRIVGCVRLLPTTGPYMLTDVFPALLEGGPPPSSPGVWESSRFAVDKPSARRAAGGVSPA 121 (209)
T ss_pred ccccCCCCceEEEEEcC--CCcEEEEEEeccCCCcchhhhhhHHHhcCCCCCCCCCeEEEeeeeeccccchhhcCCccHH
Confidence 45556666666665555 8999999999854221 11123456777898886 22 2333 3
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
+.+|+.-+++++.+.|++.|...+.....+.+.+.||..+
T Consensus 122 ~~el~~g~ie~a~~~G~~~IvtVt~~~meril~r~Gw~~~ 161 (209)
T COG3916 122 AYELFAGMIEYALARGITGIVTVTDTGMERILRRAGWPLT 161 (209)
T ss_pred HHHHHHHHHHHHHHcCCceEEEEEchHHHHHHHHcCCCeE
Confidence 7899999999999999999999999999999999999876
No 83
>PF05301 Mec-17: Touch receptor neuron protein Mec-17; InterPro: IPR007965 Mec-17 is the protein product of one of the 18 genes required for the development and function of the touch receptor neuron for gentle touch. Mec-17 is specifically required for maintaining the differentiation of the touch receptor []. This family is conserved to higher eukaryotes.; GO: 0019799 tubulin N-acetyltransferase activity
Probab=97.87 E-value=0.00054 Score=39.50 Aligned_cols=51 Identities=16% Similarity=0.161 Sum_probs=36.5
Q ss_pred CCeEEEEEEEEEeeeccccc-------ceeEEEEEEEecccccCCchHHHHHHHHHHH
Q psy6918 36 TKQVIGTGSLILEQKFIHEC-------ALKGKIEEVVVDDTYRGKELGKLLIAVLVKL 86 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~-------~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~ 86 (123)
.+.++|++-+....-+..+. .+..-|-+++|+++.|++|+|++|.+.+++.
T Consensus 17 ~g~viG~LKVG~K~Lfl~d~~g~~~e~~~~~cvLDFyVhes~QR~G~Gk~LF~~ML~~ 74 (120)
T PF05301_consen 17 KGAVIGFLKVGYKKLFLLDERGQHREIEPLLCVLDFYVHESRQRRGYGKRLFDHMLQE 74 (120)
T ss_pred CceEEEEEEEeeeeEEEEcCCCCEEEecccceeeeEEEEeceeccCchHHHHHHHHHH
Confidence 36799998776543322221 1222567899999999999999999999854
No 84
>KOG2535|consensus
Probab=97.82 E-value=2.3e-05 Score=53.02 Aligned_cols=102 Identities=17% Similarity=0.196 Sum_probs=70.8
Q ss_pred hcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccc---cceeE-----EEEE--EEe---cc-cccCCchHHHHHHHHHHH
Q psy6918 21 KASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHE---CALKG-----KIEE--VVV---DD-TYRGKELGKLLIAVLVKL 86 (123)
Q Consensus 21 ~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~---~~~~~-----~i~~--~~v---~p-~~rg~G~g~~Ll~~~~~~ 86 (123)
.+.++-.++.++|...+-+||.+.+......... .+... .+.. +.| +| .||.||+|+-||+.++..
T Consensus 436 ANgGWETFlSYEDpkqDILiGLLRLRkcs~~~~~~el~g~~SivRELHVYGs~vpv~~rDp~KfQHQG~GtLLmeEAERI 515 (554)
T KOG2535|consen 436 ANGGWETFLSYEDPKQDILIGLLRLRKCSKKTTRPELFGSQSIVRELHVYGSVVPVHSRDPTKFQHQGFGTLLMEEAERI 515 (554)
T ss_pred ccCChheeecccCcchhHHHHHHHHhhcccccccchhcCccchheeeeecceeeecccCCchhhhhcchhhHHHHHHHHH
Confidence 4456666667777667788998887643221110 01111 1111 222 33 489999999999999999
Q ss_pred HHhc-CCeEEEEccCCCchhHHHhcCcEEccceeEee
Q psy6918 87 AKHF-QCYKLTLDCADHMIPFYETFGFQKKNNFMQIY 122 (123)
Q Consensus 87 ~~~~-~~~~i~~~~~~~~~~~y~~~Gf~~~~~~~~~~ 122 (123)
|++. |-.+|.+-..-...++|.|+||+.+++||.+.
T Consensus 516 Ar~EHgS~KiavISGVGtR~YY~klGY~LdGPYM~K~ 552 (554)
T KOG2535|consen 516 AREEHGSGKIAVISGVGTRNYYRKLGYELDGPYMVKM 552 (554)
T ss_pred HHHhcCCCceEEEeccchHHHHHhhCeeecChhHhhh
Confidence 9874 77788777777888999999999999998764
No 85
>PF13880 Acetyltransf_13: ESCO1/2 acetyl-transferase
Probab=97.80 E-value=2.8e-05 Score=40.69 Aligned_cols=29 Identities=17% Similarity=0.137 Sum_probs=26.1
Q ss_pred eEEEEEEEecccccCCchHHHHHHHHHHH
Q psy6918 58 KGKIEEVVVDDTYRGKELGKLLIAVLVKL 86 (123)
Q Consensus 58 ~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~ 86 (123)
..-|..|+|+|.+|++||+++||+.+.+.
T Consensus 5 ~~GI~RIWV~~~~RR~GIAt~Lld~ar~~ 33 (70)
T PF13880_consen 5 VCGISRIWVSPSHRRKGIATRLLDAAREN 33 (70)
T ss_pred EEEeEEEEeChhhhhhhHHHHHHHHHHHh
Confidence 45688899999999999999999999875
No 86
>PF01233 NMT: Myristoyl-CoA:protein N-myristoyltransferase, N-terminal domain; InterPro: IPR022676 Myristoyl-CoA:protein N-myristoyltransferase (2.3.1.97 from EC) (Nmt) [] is the enzyme responsible for transferring a myristate group on the N-terminal glycine of a number of cellular eukaryotics and viral proteins. Nmt is a monomeric protein of about 50 to 60kDa whose sequence appears to be well conserved. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold. This entry represents the N-terminal region. ; GO: 0004379 glycylpeptide N-tetradecanoyltransferase activity; PDB: 2P6G_B 2P6F_F 2P6E_A 1IIC_A 1IID_A 2NMT_A 4A33_A 3H5Z_A 4A2Z_A 2WSA_A ....
Probab=97.78 E-value=0.0021 Score=39.02 Aligned_cols=64 Identities=17% Similarity=0.193 Sum_probs=49.1
Q ss_pred CCeEEEEEEEEEeeeccc-ccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 36 TKQVIGTGSLILEQKFIH-ECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~-~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
++++||+++..+..--.. .......|..++|+++.|.++++--|++.+.+++-..|+..-.-++
T Consensus 87 ~~kLvgfIsaip~~irv~~~~~~~~eINFLCVhKklRskrlAPvLIkEItRRvn~~gI~qAvyTa 151 (162)
T PF01233_consen 87 SKKLVGFISAIPATIRVRDKVIKMVEINFLCVHKKLRSKRLAPVLIKEITRRVNLQGIWQAVYTA 151 (162)
T ss_dssp TTEEEEEEEEEEEEEEETTEEEEEEEEEEEEE-GGGTTSSHHHHHHHHHHHHHHTTT--EEEEEE
T ss_pred CCEEEEEEccceEEEEEeeeEeeeeeEEEEeecHhHhhcCCcHHHHHHHHHHhhhcCceeeeeec
Confidence 799999999886543222 2334578999999999999999999999999999999986665543
No 87
>PF04958 AstA: Arginine N-succinyltransferase beta subunit; InterPro: IPR007041 Arginine N-succinyltransferase catalyses the transfer of succinyl-CoA to arginine to produce succinylarginine. This is the first step in arginine catabolism via the arginine succinyltransferase pathway. Six major L-arginine-degrading pathways have been described for prokaryotes []. Many bacteria arginine succinyltransferase 2.3.1.109 from EC, which is the AstA protein of the succinyltransferase (ast) pathway operon consists of five genes. In a few species, such as Pseudomonas aeruginosa, a tandem gene pair encodes alpha and beta subunits of a heterodimer that is designated arginine and ornithine succinyltransferase (AOST). This entry represents the family of proteins that make up the beta subunit of the heterodimer of Ast and AOST.; GO: 0008791 arginine N-succinyltransferase activity, 0006527 arginine catabolic process; PDB: 1YLE_A.
Probab=97.74 E-value=0.0022 Score=43.76 Aligned_cols=105 Identities=22% Similarity=0.368 Sum_probs=59.2
Q ss_pred CCCHHHHHHHHHh----h---h--c-CCCeEEEEEEcCCCCeEEEEEEEEEe----ee----------------------
Q psy6918 7 TSNVFHFAERFHR----M---K--A-SQDYLVTVIEDTRTKQVIGTGSLILE----QK---------------------- 50 (123)
Q Consensus 7 ~~~~~~~~~~~~~----~---~--~-~~~~~~~~~~~~~~~~ivG~~~~~~~----~~---------------------- 50 (123)
|.+.+.+.+.+.. + . . .+..+++|.+|.++|+|+|++.+.-. .+
T Consensus 30 P~d~~~L~~rI~~S~~sFa~~~~~~~~~~~YlfVLED~~tg~vvGts~I~a~vG~~~PfY~yr~~~~vh~S~~L~v~~~~ 109 (342)
T PF04958_consen 30 PPDREALAERIERSERSFAGRDVDFPGDEGYLFVLEDTETGEVVGTSAIEAAVGLDEPFYSYRVSTLVHASRELGVRNRH 109 (342)
T ss_dssp -S-HHHHHHHHHHHHHHHH-TT----S--EEEEEEEETTT--EEEEEEEESSTTSSS---EEEEEEEEEEETTTTEEEEE
T ss_pred CCCHHHHHHHHHHHHHHhhccccCCCCccceEEEEEecCCCcEEEEEeEEeccCCCCCcEEEEcCceeEcCcccCCccce
Confidence 4566666666533 2 1 1 14468889998779999999997621 00
Q ss_pred ----cccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc--CC-eEEEEcc----C-CCchhHHHhcC
Q psy6918 51 ----FIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF--QC-YKLTLDC----A-DHMIPFYETFG 111 (123)
Q Consensus 51 ----~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~--~~-~~i~~~~----~-~~~~~~y~~~G 111 (123)
..++......|..++++|+||+.|.|+-|-..-.=++.+. .+ .+|.... + ....+||+.+|
T Consensus 110 ~~L~L~~d~tG~sEl~tLfL~p~~R~~~~G~lLSr~RfLFiA~~~~rF~~~viAElrG~~De~G~SPFWdalG 182 (342)
T PF04958_consen 110 ETLTLSNDYTGCSELCTLFLDPDYRGGGNGRLLSRSRFLFIAQHRERFADRVIAELRGVSDEDGRSPFWDALG 182 (342)
T ss_dssp EEEEEE-TTTTSEEEEEEEE-GGGTTSHHHHHHHHHHHHHHHH-GGGS-SEEEEE--B---TT---HHHHHTG
T ss_pred eeEeeecCCCCCeeeEEEEECHHHcCCchHHHHHHHHHHHHHhChhhcchheeeeccCCcCCCCCCchHHHhh
Confidence 1122233457899999999999999988877666665543 22 4444432 2 34466888776
No 88
>COG3375 Uncharacterized conserved protein [Function unknown]
Probab=97.57 E-value=0.0024 Score=40.88 Aligned_cols=86 Identities=10% Similarity=0.005 Sum_probs=60.0
Q ss_pred HHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc
Q psy6918 11 FHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF 90 (123)
Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~ 90 (123)
......+......+...+-...+ ++++||.....+. ...+.-..+=+...|.|++++.|+|-+|=.+=-+++.++
T Consensus 32 ~~~~d~i~al~~~GGlvlgAf~~--dg~lVGls~G~pg---~r~g~~y~ySH~~gV~e~~k~sglg~aLK~~Qre~a~~~ 106 (266)
T COG3375 32 GAPADTIRALRYHGGLVLGAFSA--DGRLVGLSYGYPG---GRGGSLYLYSHMLGVREEVKGSGLGVALKMKQRERALSM 106 (266)
T ss_pred cchHHHHHHHHhcCCeEEEEEcC--CCcEEEEEeccCC---cCCCceeeeeeehhccccccccchhhhhHHHHHHHHHhc
Confidence 34444555444556655555555 6799998877642 112222344455899999999999999998888999999
Q ss_pred CCeEEEEccCC
Q psy6918 91 QCYKLTLDCAD 101 (123)
Q Consensus 91 ~~~~i~~~~~~ 101 (123)
|+..+.++-+|
T Consensus 107 G~tli~WTfDP 117 (266)
T COG3375 107 GYTLIAWTFDP 117 (266)
T ss_pred CeeeEEEeccc
Confidence 99999886543
No 89
>TIGR03019 pepcterm_femAB FemAB-related protein, PEP-CTERM system-associated. Members of this protein family are found always as part of extended exopolysaccharide biosynthesis loci in bacteria. In nearly every case, these loci contain determinants for the processing of the PEP-CTERM proposed C-terminal protein sorting signal. This family shows remote, local sequence similarity to the FemAB protein family (see pfam02388), whose members
Probab=97.48 E-value=0.0096 Score=40.55 Aligned_cols=106 Identities=11% Similarity=-0.013 Sum_probs=72.9
Q ss_pred CCCCHHHHHHHHHhhhcCCCeEEEEEE-cCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHH
Q psy6918 6 PTSNVFHFAERFHRMKASQDYLVTVIE-DTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLV 84 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~ 84 (123)
+..+.+.++..++.+. +...++++. . +|++||.+.+..... .........+++++..+-+.-|+-.++
T Consensus 177 p~~~~~~f~~l~~~~~--~~~~l~~a~~~--~g~~va~~l~~~~~~-------~~~~~~~g~~~~~~~~~~~~lL~w~~i 245 (330)
T TIGR03019 177 PVFSRRYFRLLKDVFG--EDCEVLTVRLG--DGVVASAVLSFYFRD-------EVLPYYAGGLREARDVAANDLMYWELM 245 (330)
T ss_pred CCCCHHHHHHHHHhcc--cCEEEEEEEeC--CCCEEEEEEEEEeCC-------EEEEEeccChHHHHhhChHHHHHHHHH
Confidence 4566777776666542 334455555 4 688888866654322 222223456789999999999999999
Q ss_pred HHHHhcCCeEEEEccCC---CchhHHHhcCcEEccceeEee
Q psy6918 85 KLAKHFQCYKLTLDCAD---HMIPFYETFGFQKKNNFMQIY 122 (123)
Q Consensus 85 ~~~~~~~~~~i~~~~~~---~~~~~y~~~Gf~~~~~~~~~~ 122 (123)
+++.++|++...+.... ...+|=+++||+++..++.++
T Consensus 246 ~~a~~~G~~~fDfG~s~~~~G~~~FK~~~G~~~~~l~~~~~ 286 (330)
T TIGR03019 246 RRACERGLRVFDFGRSKRGTGPFKFKKNWGFEPQPLHYEYL 286 (330)
T ss_pred HHHHHCCCcEEEcCCCCCCCccHHHHhcCCCeeccceEEEE
Confidence 99999999988886533 334566788999986666443
No 90
>TIGR03244 arg_catab_AstA arginine N-succinyltransferase. In many bacteria, the arginine succinyltransferase (ast) pathway operon consists of five genes, including this protein, arginine N-succinyltransferase (EC 2.3.1.109). In a few species, such as Pseudomonas aeruginosa, the member of this family is encoded adjacent to a paralog, and the two polypeptides form a heterodimeric enzyme, active on both arginine and ornithine. In such species, this polypeptide may be treated as the beta subunit of an enzyme that may be named either arginine N-succinyltransferase (AST) or arginine and orthithine N-succinyltransferase (AOST).
Probab=97.46 E-value=0.0014 Score=44.44 Aligned_cols=105 Identities=21% Similarity=0.338 Sum_probs=63.8
Q ss_pred CCCHHHHHHHHHhhh-------c-CCCeEEEEEEcCCCCeEEEEEEEEEe----eec-----------------------
Q psy6918 7 TSNVFHFAERFHRMK-------A-SQDYLVTVIEDTRTKQVIGTGSLILE----QKF----------------------- 51 (123)
Q Consensus 7 ~~~~~~~~~~~~~~~-------~-~~~~~~~~~~~~~~~~ivG~~~~~~~----~~~----------------------- 51 (123)
|.+.+.+.+.++... . .+..+++|.+|.++|+|+|++.+.-. .++
T Consensus 28 P~d~~~L~~rI~~S~~sF~~~~~~~~~~YlFVLEDt~tg~vvGts~I~a~vG~~~PfY~yr~~~~vhaS~~L~v~~~~~~ 107 (336)
T TIGR03244 28 PANEDLLSARIERAEKTFSGELTRAEQGYLFVLEDTETGTVAGVSAIEAAVGLEEPFYNYRVGTVVHASKELGIYKALET 107 (336)
T ss_pred CCCHHHHHHHHHHHHHHhcCcCCCCCccEEEEEEeCCCCeEEEEEeEEecccCCCCCEEEEcCceeecCcccCCceeeee
Confidence 556666666654321 1 24677888898778999999987621 111
Q ss_pred ---ccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc--CC-eEEEEcc-----CCCchhHHHhcC
Q psy6918 52 ---IHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF--QC-YKLTLDC-----ADHMIPFYETFG 111 (123)
Q Consensus 52 ---~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~--~~-~~i~~~~-----~~~~~~~y~~~G 111 (123)
.++......|..++++|+||+.|.|+-|-+.-.=++.+. -+ ++|.... .....+||+.+|
T Consensus 108 L~l~nd~tG~SElctLfL~p~~R~~~~G~LLSr~RfLFiA~~~erF~~~viAEmrG~~De~G~SPFWd~lg 178 (336)
T TIGR03244 108 LFLSNDLTGYSELCTLFLDPDYRKGGNGRLLSKSRFLFIAQFRERFSKKIIAEMRGVSDEQGRSPFWNALG 178 (336)
T ss_pred EEeeccCCCCeeeEEEEECHHHcCCcchhhHHHHHHHHHHhhHhhhhhhhhhhhcCccCCCCCCchHHHhh
Confidence 111233447889999999999998887766544444332 11 2333321 233456877766
No 91
>PRK10456 arginine succinyltransferase; Provisional
Probab=97.45 E-value=0.0015 Score=44.51 Aligned_cols=105 Identities=21% Similarity=0.324 Sum_probs=64.0
Q ss_pred CCCHHHHHHHHHhh-------hc-CCCeEEEEEEcCCCCeEEEEEEEEEe----eec-----------------------
Q psy6918 7 TSNVFHFAERFHRM-------KA-SQDYLVTVIEDTRTKQVIGTGSLILE----QKF----------------------- 51 (123)
Q Consensus 7 ~~~~~~~~~~~~~~-------~~-~~~~~~~~~~~~~~~~ivG~~~~~~~----~~~----------------------- 51 (123)
|.+.+.+.+.++.. .. .+..+++|.+|.++|+++|++.+.-. .++
T Consensus 30 P~d~~~L~~rI~~S~~sF~~~~~~~~~~YlFVLED~~tg~vvGts~I~a~vG~~~PfY~yr~~~~vhaS~~L~v~~~~~~ 109 (344)
T PRK10456 30 PANEATLAARIERALKTWQGELPKSEQGYVFVLEDSETGTVAGICAIEVAVGLNDPWYNYRVGTLVHASKELNVYNALPT 109 (344)
T ss_pred CCCHHHHHHHHHHHHHHhcCcCCCCCccEEEEEEeCCCCcEEEEEeEEecccCCCCCEEEEcCceeecCcccCCceeeee
Confidence 45666666665432 11 34567888898778999999987631 111
Q ss_pred ---ccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc--CC-eEEEEcc-----CCCchhHHHhcC
Q psy6918 52 ---IHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF--QC-YKLTLDC-----ADHMIPFYETFG 111 (123)
Q Consensus 52 ---~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~--~~-~~i~~~~-----~~~~~~~y~~~G 111 (123)
.++......|..++++|+||+.|.|+-|-+.-.=++.+. .+ .+|.... .....+||+.+|
T Consensus 110 L~l~nd~tG~sElctLfl~p~~R~~~~G~LLSr~RfLFiA~~~erF~~~viAEmRG~~De~G~SPFWd~lg 180 (344)
T PRK10456 110 LFLSNDHTGSSELCTLFLDPDWRKEGNGYLLSKSRFMFMAAFRDKFNDKVVAEMRGVIDEHGYSPFWQSLG 180 (344)
T ss_pred EEeeccCCCCceeEEEEECHHHcCCCchhHHHHHHHHHHHhhHhhhhhhhheeccCccCCCCCCccHHHhh
Confidence 111223347888999999999998887766544444332 11 3333321 234456887766
No 92
>TIGR03245 arg_AOST_alph arginine/ornithine succinyltransferase, alpha subunit. In some bacteria, including Pseudomonas aeruginosa, the astB gene (arginine N-succinyltransferase) is replaced by tandem paralogs that form a heterodimer. This heterodimer from P. aeruginosa is characterized as arginine and ornithine N-2 succinyltransferase (AOST). Members of this protein family represent the less widespread paralog, designated AruI, or arginine/ornithine succinyltransferase, alpha subunit.
Probab=97.44 E-value=0.0016 Score=44.14 Aligned_cols=105 Identities=20% Similarity=0.357 Sum_probs=64.0
Q ss_pred CCCHHHHHHHHHh----h-----hcCCCeEEEEEEcCCCCeEEEEEEEEEe----eec----------------------
Q psy6918 7 TSNVFHFAERFHR----M-----KASQDYLVTVIEDTRTKQVIGTGSLILE----QKF---------------------- 51 (123)
Q Consensus 7 ~~~~~~~~~~~~~----~-----~~~~~~~~~~~~~~~~~~ivG~~~~~~~----~~~---------------------- 51 (123)
|.+.+.+.+.++. + ...+..+++|.+|.++|+++|++.+.-. .++
T Consensus 28 P~d~~~L~~rI~~S~~sF~~~~~~~~~~~YlFVLEDt~tg~vvGts~I~a~vG~~~PfY~yr~~~~vhaS~~L~v~~~~~ 107 (336)
T TIGR03245 28 PADRAKLGEKIAQSERSFAAEVSFVGEERYLFVLEDTETGKLLGTSSIVASAGYGEPFYSYRNDTLIHASRELKVNNKIH 107 (336)
T ss_pred CCCHHHHHHHHHHHHHHHHhhcCCCCCccEEEEEEeCCCCcEEEEEeEEecccCCCCCEEEEcCceeecCcccCCcccee
Confidence 4556666665533 2 1235677888998779999999987631 111
Q ss_pred ----ccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc--CC-eEEEEcc-----CCCchhHHHhcC
Q psy6918 52 ----IHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF--QC-YKLTLDC-----ADHMIPFYETFG 111 (123)
Q Consensus 52 ----~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~--~~-~~i~~~~-----~~~~~~~y~~~G 111 (123)
.++......|..++++|+||+.|.|+-|-+.-.=++.+. -+ ++|.... .....+||+.+|
T Consensus 108 ~L~l~nd~tG~sElctLfL~p~~R~~~~G~lLSr~RfLFiA~~~erF~~~viAEmrG~~De~G~SPFWd~lg 179 (336)
T TIGR03245 108 VLYMCHELTGSSLLCSFYVDPRLRKTEAAELLSRARLLFMAAHRERFQSRIIVEIQGVQDDNGDSPFWDAIG 179 (336)
T ss_pred eEEeeccCCCCeeeEEEEECHHHcCCCchhHHHHHHHHHHHhhHhhhhhhheeeccCccCCCCCCccHHHhh
Confidence 111233447889999999999998887766554444332 12 3343322 234456887766
No 93
>TIGR03243 arg_catab_AOST arginine and ornithine succinyltransferase subunits. In many bacteria, the sole member of this protein family is arginine N-succinyltransferase (EC 2.3.1.109), the AstA protein of the arginine succinyltransferase (ast) pathway. However, in Pseudomonas aeruginosa and several other species, a tandem gene pair encodes alpha and beta subunits of a heterodimer that is designated arginine and ornithine succinyltransferase (AOST).
Probab=97.43 E-value=0.0018 Score=43.99 Aligned_cols=105 Identities=21% Similarity=0.348 Sum_probs=64.3
Q ss_pred CCCHHHHHHHHHh----h----hcCCCeEEEEEEcCCCCeEEEEEEEEEe----eec-----------------------
Q psy6918 7 TSNVFHFAERFHR----M----KASQDYLVTVIEDTRTKQVIGTGSLILE----QKF----------------------- 51 (123)
Q Consensus 7 ~~~~~~~~~~~~~----~----~~~~~~~~~~~~~~~~~~ivG~~~~~~~----~~~----------------------- 51 (123)
|.+.+.+.+.++. + ...+..+++|.+|.++|+|+|++.+... .++
T Consensus 28 P~d~~~L~~rI~~S~~sF~~~~~~~~~~YlFVLED~~tg~vvGts~I~a~vG~~~PfY~yrv~~~vhaS~~L~v~~~~~~ 107 (335)
T TIGR03243 28 PADRAALGSRIARSEKSFAGESTRGEEGYLFVLEDTETGTVAGVSAIEAAVGLDEPFYNYRVGTLVHASRELGVYNKIPT 107 (335)
T ss_pred CCCHHHHHHHHHHHHHHHhcccCCCCccEEEEEEeCCCCeEEEEEeEEecccCCCCCEEEEcCceeecCcccCCccceee
Confidence 4556666555432 2 1236677888998779999999987631 111
Q ss_pred ---ccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc--CC-eEEEEcc-----CCCchhHHHhcC
Q psy6918 52 ---IHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF--QC-YKLTLDC-----ADHMIPFYETFG 111 (123)
Q Consensus 52 ---~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~--~~-~~i~~~~-----~~~~~~~y~~~G 111 (123)
.++......|..++++|+||+.|.|+-|-+.-.=++.+. .+ .+|.... .....+||+.+|
T Consensus 108 L~l~nd~tG~sElctLfL~p~~R~~~~G~LLSr~RfLFiA~~~erF~~~viAEmrG~~De~G~SPFWd~lg 178 (335)
T TIGR03243 108 LTLSNDLTGSSELCTLFLDPDYRKGGNGRLLSRSRFLFIAAFRERFGDKIIAEMRGVSDEQGRSPFWEALG 178 (335)
T ss_pred EEeeccCCCCeeeEEEEECHHHcCCCchhhHHHHHHHHHHhhHhhhhhhheeeccCccCCCCCCccHHHhh
Confidence 111233447889999999999998887766554444332 12 3333322 234456887766
No 94
>PF04377 ATE_C: Arginine-tRNA-protein transferase, C terminus; InterPro: IPR007472 Arginine-tRNA-protein transferase catalyses the post-translational conjugation of arginine to the N terminus of a protein. In eukaryotes, this functions as part of the N terminus rule pathway of protein degradation by conjugating a destabilising amino acid to the N-terminal aspartate or glutamate of a protein, targeting the protein for ubiquitin-dependent proteolysis. N-terminal cysteine is sometimes modified []. In Saccharomyces cerevisiae, Cys20, 23, 94 and/or 95 are thought to be important for activity []. Of these, only Cys 94 appears to be completely conserved in this family. This entry represents the C-terminal region of the enzyme arginine-tRNA-protein transferase, found in both eukaryotic and prokaryotic enzymes.; GO: 0004057 arginyltransferase activity, 0016598 protein arginylation
Probab=97.28 E-value=0.012 Score=34.66 Aligned_cols=54 Identities=17% Similarity=0.222 Sum_probs=44.0
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEE--EEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEE--VVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD 98 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~--~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~ 98 (123)
+|++||++.+...++ .++. .+=+|++....+|+-.+-.-+++|++.|++.+.+.
T Consensus 47 ~~kLiav~v~D~l~~---------glSaVY~fyDPd~~~~SlG~~~iL~eI~~a~~~~l~y~YLG 102 (128)
T PF04377_consen 47 DGKLIAVAVVDILPD---------GLSAVYTFYDPDYSKRSLGTYSILREIELARELGLPYYYLG 102 (128)
T ss_pred CCeEEEEEEeecccc---------hhhheeeeeCCCccccCcHHHHHHHHHHHHHHcCCCEEeeC
Confidence 799999988875432 2333 23499999999999999999999999999988875
No 95
>KOG2696|consensus
Probab=96.94 E-value=0.0061 Score=41.73 Aligned_cols=66 Identities=15% Similarity=0.229 Sum_probs=42.6
Q ss_pred CCCeEEEEEEcC-CCC----eEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc
Q psy6918 23 SQDYLVTVIEDT-RTK----QVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF 90 (123)
Q Consensus 23 ~~~~~~~~~~~~-~~~----~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~ 90 (123)
+.+.++++.+.. +++ .++|+..+.-....++. ....|.-+.+.|.||++|+|+.|++.+.......
T Consensus 179 e~w~~~lv~EK~~~d~~~ly~~~gy~tiyk~y~yid~--~R~RiSQmlilpPfq~~Glgs~l~E~i~r~~~~~ 249 (403)
T KOG2696|consen 179 ECWLIYLVYEKKEEDGDTLYAYVGYYTIYKFYEYIDR--IRPRISQMLILPPFQGKGLGSQLYEAIARDYLEE 249 (403)
T ss_pred CceEEEEeeeecccCCceeEeeeeeEEEeehhhhhhh--hhhhhheeEEeccccCCchHHHHHHHHHHhhccC
Confidence 344555555543 122 45666555432222222 4568888999999999999999999999655443
No 96
>PRK01305 arginyl-tRNA-protein transferase; Provisional
Probab=96.88 E-value=0.057 Score=35.32 Aligned_cols=78 Identities=12% Similarity=0.098 Sum_probs=53.9
Q ss_pred CCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEE--EEecccccCCchHHHHHHHHHH
Q psy6918 8 SNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEE--VVVDDTYRGKELGKLLIAVLVK 85 (123)
Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~--~~v~p~~rg~G~g~~Ll~~~~~ 85 (123)
.+.+.+...+.... .....+-..+ +|++||++.+...++ -|+. .+=+|++-..++|+-.+-.-++
T Consensus 128 ~~~~~y~~Fl~~~~--~~t~~~ey~~--~g~LiaVav~D~l~d---------~lSAVY~FyDPd~~~~SLG~~~iL~qI~ 194 (240)
T PRK01305 128 PSRDQYAQFLEDSW--VNTRFIEFRG--DGKLVAVAVTDVLDD---------GLSAVYTFYDPDEEHRSLGTFAILWQIE 194 (240)
T ss_pred CCHHHHHHHHhcCC--CCcEEEEEEe--CCeEEEEEEEeccCC---------ceeeEEEeeCCCccccCCHHHHHHHHHH
Confidence 45555554444221 2222333344 799999998875432 2333 3459999999999999999999
Q ss_pred HHHhcCCeEEEEc
Q psy6918 86 LAKHFQCYKLTLD 98 (123)
Q Consensus 86 ~~~~~~~~~i~~~ 98 (123)
+|++.|.+.+.+.
T Consensus 195 ~ak~~gl~y~YLG 207 (240)
T PRK01305 195 LAKRLGLPYVYLG 207 (240)
T ss_pred HHHHcCCCeEeee
Confidence 9999999988774
No 97
>COG3882 FkbH Predicted enzyme involved in methoxymalonyl-ACP biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=96.86 E-value=0.0043 Score=44.10 Aligned_cols=87 Identities=16% Similarity=0.204 Sum_probs=65.5
Q ss_pred cCCCeEEEEEEcC---CCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc
Q psy6918 22 ASQDYLVTVIEDT---RTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD 98 (123)
Q Consensus 22 ~~~~~~~~~~~~~---~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~ 98 (123)
.++...++.+.-. -++-+||++.+.... ....|..+.++-..-|+++-++||..+++.|...|+..+...
T Consensus 454 ~~~~~li~sv~l~DKfgDnGiigvviv~kk~-------~~w~IDt~lmSCRVlgRkvE~~l~~~~~e~A~~~gi~tir~~ 526 (574)
T COG3882 454 EDPNFLIFSVSLKDKFGDNGIIGVVIVEKKE-------SEWFIDTFLMSCRVLGRKVEQRLMNSLEEQALSEGINTIRGY 526 (574)
T ss_pred hCCCeEEEEEEeccccccCceEEEEEEEecC-------CeEEhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceeeeE
Confidence 4455555544321 156799998887543 367787888888888999999999999999999999888664
Q ss_pred c-----CCCchhHHHhcCcEEc
Q psy6918 99 C-----ADHMIPFYETFGFQKK 115 (123)
Q Consensus 99 ~-----~~~~~~~y~~~Gf~~~ 115 (123)
- |.+...||+++||...
T Consensus 527 Y~pt~kN~pv~~FyE~mgf~l~ 548 (574)
T COG3882 527 YIPTEKNAPVSDFYERMGFKLK 548 (574)
T ss_pred ecccccCCcHHHHHHHhccccc
Confidence 3 4455779999999954
No 98
>KOG4601|consensus
Probab=96.79 E-value=0.013 Score=37.67 Aligned_cols=66 Identities=17% Similarity=0.149 Sum_probs=43.2
Q ss_pred hhcCCCeEEEEEEcC-CC--CeEEEEEEEEEeeecccc------cceeEEEEEEEecccccCCchHHHHHHHHHH
Q psy6918 20 MKASQDYLVTVIEDT-RT--KQVIGTGSLILEQKFIHE------CALKGKIEEVVVDDTYRGKELGKLLIAVLVK 85 (123)
Q Consensus 20 ~~~~~~~~~~~~~~~-~~--~~ivG~~~~~~~~~~~~~------~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~ 85 (123)
+.+.+...++++.++ ++ +.|.|++-+...+-+..+ .+....|-+++|+++.|++|.|.+|++++.+
T Consensus 61 l~~sd~~vvy~~kd~~a~~ks~l~GllKVG~KkLfl~D~~~~~ye~e~lcILDFyVheS~QR~G~G~~lfdyMl~ 135 (264)
T KOG4601|consen 61 LVDSDEEVVYYIKDEKANGKSILKGLLKVGYKKLFLTDNEQNQYEEEALCILDFYVHESEQRSGNGFKLFDYMLK 135 (264)
T ss_pred hhcCcceEEEEEeecccCcchheeeeehccceeEEEeccHhhhhccCCceEEEEEeehhhhhcCchHHHHHHHHH
Confidence 333444444444432 22 478888776644333222 2234578899999999999999999999984
No 99
>cd04264 DUF619-NAGS DUF619 domain of various N-acetylglutamate Synthases of the fungal arginine-biosynthetic pathway and urea cycle found in humans and fish. DUF619-NAGS: This family includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present in C-terminal of a NAG kinase-like domain in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAGS is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain function has yet to be characterized.
Probab=96.72 E-value=0.022 Score=32.04 Aligned_cols=55 Identities=13% Similarity=0.025 Sum_probs=41.0
Q ss_pred CeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccC
Q psy6918 37 KQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCA 100 (123)
Q Consensus 37 ~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~ 100 (123)
+...|++.+..+++ .....++..++|.|+.||+|+|..+++.+.+. .+.+.+.+.
T Consensus 17 e~y~~~aIvt~~~~----~~~~~yLdKfaV~~~~~g~gvad~vf~~i~~d-----~~~L~Wrsr 71 (99)
T cd04264 17 EGYNAAAIVTYEGV----NNGVPYLDKFAVSSSAQGEGTSDALWRRLRRD-----FPKLFWRSR 71 (99)
T ss_pred CCceEEEEEeccCC----CCCceEEEEEEEchhhhhcChHHHHHHHHHhh-----CCceEEEeC
Confidence 56777777764321 23467999999999999999999999988854 355666553
No 100
>PF01853 MOZ_SAS: MOZ/SAS family; InterPro: IPR002717 Moz is a monocytic leukemia Zn_finger protein and the SAS protein from Saccharomyces cerevisiae (Baker's yeast) is involved in silencing the Hmr locus. These proteins were reported to be homologous to acetyltransferases [] but this similarity is not supported by standard sequence analysis.; GO: 0016747 transferase activity, transferring acyl groups other than amino-acyl groups, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 3TO6_A 1MJA_A 1MJ9_A 3TO7_A 3TO9_A 1MJB_A 1FY7_A 2OZU_A 2RC4_A 2OU2_A ....
Probab=96.71 E-value=0.015 Score=36.38 Aligned_cols=49 Identities=18% Similarity=0.206 Sum_probs=34.8
Q ss_pred eEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCC
Q psy6918 38 QVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQC 92 (123)
Q Consensus 38 ~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~ 92 (123)
.++|+.+=... .. ....+.=|.|.|.||++|+|+-|++..=+.++..+.
T Consensus 66 h~vGyFSKEk~---s~---~~~NLsCIl~lP~yQrkGyG~~LI~fSY~LSr~e~~ 114 (188)
T PF01853_consen 66 HIVGYFSKEKE---SW---DNNNLSCILTLPPYQRKGYGRFLIDFSYELSRREGK 114 (188)
T ss_dssp EEEEEEEEESS----T---T-EEESEEEE-GGGTTSSHHHHHHHHHHHHHHHTTS
T ss_pred eeEEEEEEEec---cc---CCeeEeehhhcchhhhcchhhhhhhhHHHHhhccCc
Confidence 57787764321 11 123566689999999999999999998888887763
No 101
>PF13444 Acetyltransf_5: Acetyltransferase (GNAT) domain
Probab=96.43 E-value=0.017 Score=32.50 Aligned_cols=43 Identities=28% Similarity=0.270 Sum_probs=31.7
Q ss_pred eEEEEEEEEEeeecc-----------------cccceeEEEEEEEecccccCCchHHHHH
Q psy6918 38 QVIGTGSLILEQKFI-----------------HECALKGKIEEVVVDDTYRGKELGKLLI 80 (123)
Q Consensus 38 ~ivG~~~~~~~~~~~-----------------~~~~~~~~i~~~~v~p~~rg~G~g~~Ll 80 (123)
++||++++....... ......+.++.++|+|+||+......|.
T Consensus 41 ~~VGt~Rl~~~~~~~~~~~~~~~~~f~l~~~~~~~~~~~EisRl~V~~~~R~~~~~~~L~ 100 (101)
T PF13444_consen 41 EVVGTVRLILPSPAGPLEGFYSESEFDLDPLLPLPRRVAEISRLCVHPEYRRRKVLLLLW 100 (101)
T ss_pred CEEEEEEeeccccccccccCCchhhcCcchhhccCCcEEEeehheECHhHCCChHHHHHh
Confidence 699999998654311 1123667899999999999988776664
No 102
>cd04265 DUF619-NAGS-U DUF619 domain of various N-acetylglutamate Synthases (NAGS) of the urea (U) cycle of humans and fish. This family includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present in C-terminal of a NAG kinase-like domain in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAGS is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain function has yet to be characterized.
Probab=96.32 E-value=0.049 Score=30.67 Aligned_cols=39 Identities=18% Similarity=0.126 Sum_probs=31.6
Q ss_pred eeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccC
Q psy6918 57 LKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCA 100 (123)
Q Consensus 57 ~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~ 100 (123)
...++..++|.|+.||+|+|..+++++.+. .+.+.+.+.
T Consensus 33 ~~~yLdKfaV~~~~~g~gv~d~vf~~i~~d-----~~~L~Wrsr 71 (99)
T cd04265 33 GVPYLDKFAVSSSAQGEGTGEALWRRLRRD-----FPKLFWRSR 71 (99)
T ss_pred CceEEEEEEEchhhhhcChHHHHHHHHHhh-----CCceEEEeC
Confidence 467999999999999999999999988855 334666553
No 103
>COG2401 ABC-type ATPase fused to a predicted acetyltransferase domain [General function prediction only]
Probab=96.30 E-value=0.0038 Score=43.93 Aligned_cols=58 Identities=24% Similarity=0.407 Sum_probs=44.6
Q ss_pred eEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc-----c---CCCchhHHHhcCcEEc
Q psy6918 58 KGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD-----C---ADHMIPFYETFGFQKK 115 (123)
Q Consensus 58 ~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~-----~---~~~~~~~y~~~Gf~~~ 115 (123)
.+.|..+.|+|+||+-|+|...+..+.+|..+..+...... + -..-.+||++.||+-.
T Consensus 241 aariarvvvhpdyr~dglg~~sv~~a~ewI~eRriPEmr~rkHlvetiaqmarynpffe~~gfkyl 306 (593)
T COG2401 241 AARIARVVVHPDYRADGLGQLSVIAALEWIIERRIPEMRPRKHLVETIAQMARYNPFFEKVGFKYL 306 (593)
T ss_pred hhheeEEEeccccccCccchhHHHHHHHHHHHhhChhhhhhhhHHHHHHHHHhcCchhhhhceeee
Confidence 34799999999999999999999999999988766543321 0 1222569999999864
No 104
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=96.18 E-value=0.033 Score=37.15 Aligned_cols=50 Identities=20% Similarity=0.188 Sum_probs=37.3
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcC
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQ 91 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~ 91 (123)
+-.+||+.+=.... . ....+.=|.|.|.||++|+|+-|++..=+.++.+|
T Consensus 139 g~h~vGYFSKEK~s---~---~~nNLaCIltLPpyQrkGyG~~LI~fSYeLSr~Eg 188 (290)
T PLN03238 139 GSHIVGYFSKEKVS---A---EDYNLACILTLPPYQRKGYGKFLISFAYELSKREG 188 (290)
T ss_pred CcEEEEEeceeccc---c---CCCcEEEEEecChhhhccHhHhHHHHHhHHhhccC
Confidence 45788887654221 1 12346668999999999999999999988888776
No 105
>PF11124 Pho86: Inorganic phosphate transporter Pho86; InterPro: IPR024297 Pho86p is an ER protein which is produced in response to phosphate starvation. It is essential for growth when phosphate levels are limiting []. Pho86p is also involved in the regulation of Pho84p, a high-affinity phosphate transporter, which is localised to the endoplasmic reticulum (ER) in low phosphate medium. When the level of phosphate increases Pho84p is transported to the vacuole. Pho86p is required for packaging of Pho84p in to COPII vesicles [].
Probab=96.16 E-value=0.05 Score=36.55 Aligned_cols=88 Identities=19% Similarity=0.241 Sum_probs=67.5
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc---------CCeE
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF---------QCYK 94 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~---------~~~~ 94 (123)
.+..++++. +.||+.+++.+....+....-...|..+.|..-|..-|+=..|++|++-++++. +. .
T Consensus 169 ~NT~IIvYR----etPIAiisl~~~~~~St~~~~vv~ItgigvRkVy~Ksgi~e~LidWA~~Rtr~l~~ey~k~k~~~-s 243 (304)
T PF11124_consen 169 KNTHIIVYR----ETPIAIISLVPNKDQSTKENFVVKITGIGVRKVYVKSGIDEDLIDWAMLRTRQLYKEYLKGKKGC-S 243 (304)
T ss_pred CcceEEEEc----CCceEEEEeccccccCCCceEEEEEeeeEEEEEEeecChHHHHHHHHHHHHHHHHHHhccccccc-e
Confidence 444455554 599999999987777777777889999999999999999999999997766551 22 3
Q ss_pred EEE--cc---CCCchhHHHhcCcEEcc
Q psy6918 95 LTL--DC---ADHMIPFYETFGFQKKN 116 (123)
Q Consensus 95 i~~--~~---~~~~~~~y~~~Gf~~~~ 116 (123)
+.+ ++ ++...+.+++.||....
T Consensus 244 i~ll~d~YSFD~~~~k~L~~~gF~~i~ 270 (304)
T PF11124_consen 244 IKLLVDVYSFDKDMKKTLKKKGFKKIS 270 (304)
T ss_pred EEEEEEeeeccHHHHHHHHHCCCeeee
Confidence 333 22 56678899999999864
No 106
>KOG2036|consensus
Probab=95.77 E-value=0.0078 Score=44.77 Aligned_cols=31 Identities=16% Similarity=0.305 Sum_probs=27.9
Q ss_pred eEEEEEEEecccccCCchHHHHHHHHHHHHH
Q psy6918 58 KGKIEEVVVDDTYRGKELGKLLIAVLVKLAK 88 (123)
Q Consensus 58 ~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~ 88 (123)
-+.|..++|+|+|++.|||++.++.+.++-.
T Consensus 614 GaRIVRIAvhP~y~~MGYGsrAvqLL~~y~e 644 (1011)
T KOG2036|consen 614 GARIVRIAVHPEYQKMGYGSRAVQLLTDYFE 644 (1011)
T ss_pred CceEEEEEeccchhccCccHHHHHHHHHHHh
Confidence 3489999999999999999999999998854
No 107
>COG3138 AstA Arginine/ornithine N-succinyltransferase beta subunit [Amino acid transport and metabolism]
Probab=95.61 E-value=0.066 Score=35.75 Aligned_cols=73 Identities=19% Similarity=0.333 Sum_probs=45.2
Q ss_pred CCCHHHHHHHHHhh--------hcCCCeEEEEEEcCCCCeEEEEEEEEEe----eecc----------------------
Q psy6918 7 TSNVFHFAERFHRM--------KASQDYLVTVIEDTRTKQVIGTGSLILE----QKFI---------------------- 52 (123)
Q Consensus 7 ~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~ivG~~~~~~~----~~~~---------------------- 52 (123)
|.+.+.+..++... ...+..++++.+|.++|+++|++.+.-. .+++
T Consensus 30 P~de~~L~~Ri~~se~sf~~~~~~ge~~Y~fVLEDsetG~VvG~saI~a~vGl~~PfYsyRv~tlvhaS~~L~v~~~i~~ 109 (336)
T COG3138 30 PADEATLRARIERSEKSFQGELPPGEAGYLFVLEDSETGTVVGISAIEAAVGLNDPFYSYRVGTLVHASPELNVYNEIPT 109 (336)
T ss_pred CCCHHHHHHHHHHHHHHHhcccCCCCccEEEEEEecCCceEEeEEEEEEeeccCCccceeeeeeeeecCcccccccccee
Confidence 44555555554321 2235677888888889999999876521 1111
Q ss_pred ----cccceeEEEEEEEecccccCCchHHHH
Q psy6918 53 ----HECALKGKIEEVVVDDTYRGKELGKLL 79 (123)
Q Consensus 53 ----~~~~~~~~i~~~~v~p~~rg~G~g~~L 79 (123)
++......+..++++|++|.-+.|+-|
T Consensus 110 L~L~Nd~TG~SEl~sLFl~pd~Rkg~nG~Ll 140 (336)
T COG3138 110 LFLSNDLTGNSELCTLFLDPDWRKGGNGRLL 140 (336)
T ss_pred EEEeccCcCchhhhheeecHHHhcccchhhh
Confidence 112233467779999999876666544
No 108
>KOG2779|consensus
Probab=95.22 E-value=0.24 Score=34.16 Aligned_cols=59 Identities=17% Similarity=0.167 Sum_probs=46.3
Q ss_pred CCeEEEEEEEEEeee-cccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeE
Q psy6918 36 TKQVIGTGSLILEQK-FIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYK 94 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~-~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~ 94 (123)
+.++|||++..+..- ..+.......|..++|+.+.|.++++=-|++.+.+++.-.|+..
T Consensus 144 s~kLVaFIsaiP~~irvrdk~vk~veINFLCVHKkLRSKRlaPvLIrEITRRvnl~gIfq 203 (421)
T KOG2779|consen 144 SKKLVAFISAIPATIRVRDKVVKMVEINFLCVHKKLRSKRLAPVLIREITRRVNLEGIFQ 203 (421)
T ss_pred CCceEEEEeccccEEEEccceeeeeeEEEEEEehhhhccccccHHHHHHHHHhhhhhhhh
Confidence 689999999775321 11223345789999999999999999999999999988777643
No 109
>PLN03239 histone acetyltransferase; Provisional
Probab=95.21 E-value=0.086 Score=36.20 Aligned_cols=51 Identities=18% Similarity=0.094 Sum_probs=36.8
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCC
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQC 92 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~ 92 (123)
+-.+||+.+=.... . ....+.=|.|.|.||++|+|+-|++..=+.++..|.
T Consensus 197 g~h~vGYFSKEK~s---~---~~~NLaCIltLPpyQrkGyG~lLI~fSYeLSr~Eg~ 247 (351)
T PLN03239 197 GFHPVGYYSKEKYS---D---VGYNLACILTFPAHQRKGYGRFLIAFSYELSKKEEK 247 (351)
T ss_pred ceEEEEEeeecccC---C---CCCceEEEEecChhhhcchhhhhHhhhhHhhhhcCC
Confidence 34678876653211 1 122566689999999999999999998888877763
No 110
>PF02799 NMT_C: Myristoyl-CoA:protein N-myristoyltransferase, C-terminal domain; InterPro: IPR022677 Myristoyl-CoA:protein N-myristoyltransferase (2.3.1.97 from EC) (Nmt) [] is the enzyme responsible for transferring a myristate group on the N-terminal glycine of a number of cellular eukaryotics and viral proteins. Nmt is a monomeric protein of about 50 to 60kDa whose sequence appears to be well conserved. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold. This entry represents the C-terminal region.; GO: 0004379 glycylpeptide N-tetradecanoyltransferase activity; PDB: 2P6G_B 2P6F_F 2P6E_A 1IIC_A 1IID_A 2NMT_A 2WUU_A 1IYL_B 1NMT_B 1IYK_A ....
Probab=95.12 E-value=0.47 Score=29.99 Aligned_cols=109 Identities=15% Similarity=0.009 Sum_probs=66.6
Q ss_pred CCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccce----eEEEEEEEecccccCCchHHHHHHH
Q psy6918 7 TSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECAL----KGKIEEVVVDDTYRGKELGKLLIAV 82 (123)
Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~----~~~i~~~~v~p~~rg~G~g~~Ll~~ 82 (123)
..+.++++-++.. .++-.+.+|.++. +|+|-.++++..-+...-.... .+.-....+... -=-.+|+..
T Consensus 58 ~fs~eev~Hw~lp--~~~Vv~syVve~~-~~~ITDf~SFY~Lpstvi~~~k~~~l~aAY~fY~~~~~----~~l~~Lm~D 130 (190)
T PF02799_consen 58 VFSEEEVKHWFLP--RKNVVYSYVVEDP-DGKITDFFSFYSLPSTVIGNPKHKTLKAAYSFYYVATS----TRLKELMND 130 (190)
T ss_dssp E--HHHHHHHHS---BTTTEEEEEEEET-TSEEEEEEEEEEEEEEESSSSSSSEEEEEEEEEEEESS----SHHHHHHHH
T ss_pred ccCHHHHHhhccc--CCCeEEEEEEecC-CCceeeEEEEeecceeecCCCCccceeeeeeeeeeecC----CCHHHHHHH
Confidence 3455666555532 3445566677764 4799999998765432111111 122222333333 224678888
Q ss_pred HHHHHHhcCCeEEEEccCCCchhHHHhcCcEEccceeEee
Q psy6918 83 LVKLAKHFQCYKLTLDCADHMIPFYETFGFQKKNNFMQIY 122 (123)
Q Consensus 83 ~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~~~~~~~~ 122 (123)
++-.|++.|+.....-.--+|..|.+.+.|-+-.-.+++|
T Consensus 131 aLi~Ak~~gfDVFNaLd~mdN~~fL~~lKFg~GdG~L~YY 170 (190)
T PF02799_consen 131 ALILAKNEGFDVFNALDLMDNSSFLEDLKFGPGDGNLNYY 170 (190)
T ss_dssp HHHHHHHTTESEEEEESTTTGGGTTTTTT-EEEEEEEEEE
T ss_pred HHHHHHHcCCCEEehhhhccchhhHhhCCccCCCCCeEEE
Confidence 8999999999877776677888999999999875555555
No 111
>PRK14852 hypothetical protein; Provisional
Probab=95.08 E-value=0.18 Score=39.36 Aligned_cols=92 Identities=12% Similarity=0.118 Sum_probs=69.8
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeeec----------------ccccceeEEEEEEEecccccCCchHHHHHHHHHHHH
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQKF----------------IHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLA 87 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~~----------------~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~ 87 (123)
+...++++.. .++++|..++.+.... -..+...+.++.++++|+.|..-+--.+++.+..++
T Consensus 73 p~t~~~i~k~--~~~~l~T~t~~~ds~~~Gl~~D~lf~~eLd~lr~~Gr~v~EvtrLa~d~~~~~~~l~~~l~~~~~~y~ 150 (989)
T PRK14852 73 PATSVFIFKS--YHDVLCTLTHIPDSGLFGLPMDTLYKPEVDALRAQGRNVVEVGALATQYSRRWTNLMVFLAKAMFQYS 150 (989)
T ss_pred CcceEEEecc--CCcEEEEEEEecCCcccCcCHHHHHHHHHHHHHHcCCeEEeeehheechhhcccchhHHHHHHHHHHH
Confidence 4454566655 4778888877754220 022456788999999998888777778888888888
Q ss_pred HhcCCeEEEEccCCCchhHHH-hcCcEEccc
Q psy6918 88 KHFQCYKLTLDCADHMIPFYE-TFGFQKKNN 117 (123)
Q Consensus 88 ~~~~~~~i~~~~~~~~~~~y~-~~Gf~~~~~ 117 (123)
...++..+.+.+||.-..||+ =+||+..++
T Consensus 151 ~~~~~dd~~i~VnPkH~~FY~r~l~f~~ig~ 181 (989)
T PRK14852 151 MMSEVDDILVTVNPKHVKFYTDIFLFKPFGE 181 (989)
T ss_pred HHcCCCeEEEEECcchHHHHHHHhCCccccc
Confidence 888999999999999999997 678888643
No 112
>PTZ00064 histone acetyltransferase; Provisional
Probab=95.02 E-value=0.086 Score=37.86 Aligned_cols=50 Identities=22% Similarity=0.185 Sum_probs=36.9
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcC
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQ 91 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~ 91 (123)
.-.+||+.+=.... . ....+.=|.|.|.||++|+|+-|++..=+.++..|
T Consensus 368 G~HiVGYFSKEK~S---~---~~nNLACILtLPpyQRKGYGklLIdfSYeLSrrEg 417 (552)
T PTZ00064 368 GCHIVGYFSKEKVS---L---LHYNLACILTLPCYQRKGYGKLLVDLSYKLSLKEG 417 (552)
T ss_pred CcEEEEEecccccC---c---ccCceEEEEecchhhhcchhhhhhhhhhhhhhhcC
Confidence 34788887654211 1 12246668999999999999999999888887776
No 113
>KOG2779|consensus
Probab=94.89 E-value=0.57 Score=32.44 Aligned_cols=110 Identities=13% Similarity=0.026 Sum_probs=64.7
Q ss_pred CCCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEE----EEecccccCCchHHHHHH
Q psy6918 6 PTSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEE----VVVDDTYRGKELGKLLIA 81 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~----~~v~p~~rg~G~g~~Ll~ 81 (123)
+..+.++++.+|..- ++-.+.+|++.. +|+|-+++++..-+..+-....+-.|.. ..+..+ ---.+|+.
T Consensus 289 ~~f~~eev~Hwf~p~--e~VV~syVvesp-~g~ITDF~SFy~lpsTv~~~~~~ktl~aaYlyY~v~~~----t~~~~lvn 361 (421)
T KOG2779|consen 289 PVFDEEEVEHWFLPR--ENVVYSYVVESP-NGKITDFCSFYSLPSTVMGNPKYKTLQAAYLYYNVATS----TPLLQLVN 361 (421)
T ss_pred cccCHHHhHhhcccc--cceEEEEEEECC-CCcccceeeEEeccccccCCCCcceeeeeeEEEeccCC----ccHHHHHH
Confidence 345555665555422 233445566654 7889999998754431111111111211 222222 22456777
Q ss_pred HHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEccceeEee
Q psy6918 82 VLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKKNNFMQIY 122 (123)
Q Consensus 82 ~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~~~~~~~~ 122 (123)
.++-.+++.|+.......--+|..|+++++|-+-.-.+++|
T Consensus 362 Dalilak~~gfDVFNAld~meN~~fl~~LkFg~GdG~l~YY 402 (421)
T KOG2779|consen 362 DALILAKQKGFDVFNALDLMENESFLKDLKFGPGDGNLQYY 402 (421)
T ss_pred HHHHHHHhcCCceeehhhhhhhhhHHHhcCcCcCCCceeEE
Confidence 77778888898766555567888899999999976556655
No 114
>TIGR03827 GNAT_ablB putative beta-lysine N-acetyltransferase. Members of this protein family are GNAT family acetyltransferases, based on a seed alignment in which every member is associated with a lysine 2,3-aminomutase family protein, usually as the adjacent gene. This family includes AblB, the enzyme beta-lysine acetyltransferase that completes the two-step synthesis of the osmolyte (compatible solute) N-epsilon-acetyl-beta-lysine; all members of the family may have this function. Note that N-epsilon-acetyl-beta-lysine has been observed only in methanogenic archaea (e.g. Methanosarcina) but that this model, paired with TIGR03820, suggests a much broader distribution.
Probab=94.46 E-value=0.085 Score=34.91 Aligned_cols=42 Identities=24% Similarity=0.239 Sum_probs=38.9
Q ss_pred chHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
+-...|++.+.+.|+++|+.+|.+.+...+...|.+.||...
T Consensus 21 ~~~~~~~~~~~~~a~~~~~~ki~~~~~~~~~~~~~~~g~~~e 62 (266)
T TIGR03827 21 NDVEALIPDLDALAKKEGYTKIIAKVPGSDKPLFEERGYLEE 62 (266)
T ss_pred ccHHHHHHHHHHHHHHcCCcEEEEEccHHHHHHHHHCCCeEE
Confidence 347899999999999999999999999999999999999997
No 115
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=94.24 E-value=0.09 Score=37.36 Aligned_cols=50 Identities=22% Similarity=0.185 Sum_probs=36.8
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcC
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQ 91 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~ 91 (123)
.-.+||+.+=.... . ....+.=|.|.|.||++|+|+-|++..=+.++..|
T Consensus 290 g~h~vGyFSKEk~s---~---~~~NLaCIltlP~yQrkGyG~~LI~~SYeLSr~eg 339 (450)
T PLN00104 290 GCHMVGYFSKEKHS---E---EDYNLACILTLPPYQRKGYGKFLIAFSYELSKREG 339 (450)
T ss_pred CcEEEEEecccccC---c---CCCceEEEEecchhhhcchhheehhheehhhhccC
Confidence 44888887654221 1 11256668999999999999999998888877766
No 116
>cd04266 DUF619-NAGS-FABP DUF619 domain of N-acetylglutamate Synthase of the fungal arginine-biosynthetic pathway. DUF619-NAGS-FABP: This family includes the DUF619 domain of N-acetylglutamate synthase (NAGS) of the fungal arginine-biosynthetic pathway (FABP). This NAGS (also known as arginine-requiring protein 2 or ARG2) consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. NAGS catalyzes the formation of NAG from acetylcoenzyme A and L-glutamate. The DUF619 domain, yet to be characterized, is predicted to function in NAGS association in fungi.
Probab=93.78 E-value=0.71 Score=26.46 Aligned_cols=59 Identities=12% Similarity=0.081 Sum_probs=39.2
Q ss_pred CeEEEEEEEEEeeecccccceeEEEEEEEecccccC-CchHHHHHHHHHHHHHhcCCeE-EEEccC
Q psy6918 37 KQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRG-KELGKLLIAVLVKLAKHFQCYK-LTLDCA 100 (123)
Q Consensus 37 ~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg-~G~g~~Ll~~~~~~~~~~~~~~-i~~~~~ 100 (123)
+..-|.+.+..+.+....+....++.-++|.+..|| .|++.-+..++.+ .... +.+.+.
T Consensus 18 ~~y~~~AIvt~e~~~~~~~~~v~yLdKFav~~~~~gl~gv~D~vf~~m~~-----~fp~~L~Wrsr 78 (108)
T cd04266 18 GDYEGAAILTWEGPDGSTPEKIAYLDKFAVLPKAQGSDGIADILFNAMLD-----GFPNELIWRSR 78 (108)
T ss_pred CCCcEEEEEecCCCCccCCCCceEEEEEEEccccccccchHHHHHHHHHH-----cCCCceEEEeC
Confidence 444555555533221112345689999999999997 8999999998886 3443 666553
No 117
>KOG2747|consensus
Probab=93.38 E-value=0.22 Score=34.79 Aligned_cols=32 Identities=25% Similarity=0.363 Sum_probs=26.7
Q ss_pred EEEEEEecccccCCchHHHHHHHHHHHHHhcC
Q psy6918 60 KIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQ 91 (123)
Q Consensus 60 ~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~ 91 (123)
.+.=|.|.|.||++|+|+-|++.-=+..+..|
T Consensus 262 NlaCILtLPpyQRkGYGklLIdFSYeLSr~E~ 293 (396)
T KOG2747|consen 262 NLACILTLPPYQRKGYGKLLIDFSYELSRREG 293 (396)
T ss_pred ceeeeeecChhhhcccchhhhhhhhhhhcccC
Confidence 45567899999999999999998877776655
No 118
>PF04768 DUF619: Protein of unknown function (DUF619); InterPro: IPR006855 This region of unknown function is found at the C terminus of Neurospora crassa acetylglutamate synthase (2.7.2.8 from EC). It is also found C-terminal to the amino acid kinase region in some fungal acetylglutamate kinase enzymes (IPR001048 from INTERPRO). These enzymes play a role in arginine biosynthesis.; PDB: 3S6K_A 4AB7_F 3ZZF_B 3ZZI_D 3ZZH_A 3ZZG_A 3S6G_Y 3S6H_A 3S7Y_A.
Probab=91.84 E-value=2 Score=26.70 Aligned_cols=72 Identities=14% Similarity=0.134 Sum_probs=44.8
Q ss_pred CeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc--CCCchhHH--HhcCc
Q psy6918 37 KQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC--ADHMIPFY--ETFGF 112 (123)
Q Consensus 37 ~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~--~~~~~~~y--~~~Gf 112 (123)
+..-|.+.+.++.+ .......++.-++|.|..||.|++..+..++.+. .+.+.+.+ +.++.++| +.-|+
T Consensus 69 ~~y~~~AIVt~e~~--~~~~~v~yLdKFav~~~~~g~gv~D~vf~~i~~d-----~p~L~Wrsr~~n~~~~Wyf~rs~G~ 141 (170)
T PF04768_consen 69 EDYEGAAIVTPEGP--DSNGPVPYLDKFAVSKSAQGSGVADNVFNAIRKD-----FPKLFWRSREDNPNNKWYFERSDGS 141 (170)
T ss_dssp TTSSEEEEEEEE-S--CTCTSEEEEEEEEE-HHHHHTTHHHHHHHHHHHH------SSEEEEEETT-TTHHHHHHH-SEE
T ss_pred CCceEEEEEEecCC--CCCCCCeEEEEEEecchhhhcCHHHHHHHHHHHh-----ccceEEEecCCCCcccEEEEeeEEE
Confidence 45556666654332 2334678999999999999999999999999643 22355544 45556655 34555
Q ss_pred EEc
Q psy6918 113 QKK 115 (123)
Q Consensus 113 ~~~ 115 (123)
...
T Consensus 142 ~~~ 144 (170)
T PF04768_consen 142 FKR 144 (170)
T ss_dssp EEE
T ss_pred EEC
Confidence 553
No 119
>COG2898 Uncharacterized conserved protein [Function unknown]
Probab=91.57 E-value=2.5 Score=31.11 Aligned_cols=66 Identities=12% Similarity=0.138 Sum_probs=49.8
Q ss_pred eEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 26 YLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 26 ~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
..+.+.+. +|+|+|++.+.+... .....++-+--+|+.- +|+-..|+..++.+++++|+.++.+..
T Consensus 393 ~~va~~~~--~g~VvaFa~l~~~~~-----~~~~SlDlMR~sp~ap-~g~mdfLf~~li~~aKe~G~~~fsLgm 458 (538)
T COG2898 393 FPVAAVDN--EGEVVAFANLMPTGG-----KEGYSLDLMRRSPDAP-NGTMDFLFSELILWAKEEGYQRFSLGM 458 (538)
T ss_pred ceeeEEcC--CCCeEEEEeecccCC-----cceeEEEeeecCCCCC-chHHHHHHHHHHHHHHHcCCeEEecCC
Confidence 33444444 788999999985322 3345677777777754 599999999999999999999998853
No 120
>PF09390 DUF1999: Protein of unknown function (DUF1999); InterPro: IPR018987 This family contains a putative Fe-S binding reductase (Q72J89 from SWISSPROT) whose structure adopts an alpha and beta fold. ; PDB: 2D4O_A 2D4P_A.
Probab=91.32 E-value=2.1 Score=25.85 Aligned_cols=84 Identities=15% Similarity=0.125 Sum_probs=51.1
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeeccccc-ceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhH
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHEC-ALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPF 106 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~-~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~ 106 (123)
.+++.+. ++++.|++.... .|.+ .+...+..+.+.|. +......-||.+++.-+-+.++..+.+...+....-
T Consensus 57 SFvA~~e-~~~~~GfvLAQa----VWQGdrptVlV~ri~~~~~-~~~~~~~GLLrAvvKSAYDa~VYEv~l~l~p~l~~A 130 (161)
T PF09390_consen 57 SFVAEDE-GGELQGFVLAQA----VWQGDRPTVLVRRILLAPG-EPEEVYEGLLRAVVKSAYDAGVYEVHLHLDPELEAA 130 (161)
T ss_dssp -EEEE-E-TTEEEEEEEEEE----EE-SSSEEEEEEEE---EE-SSHHHHHHHHHHHHHHHHHTT-SEEEE---THHHHH
T ss_pred cEEEEcc-CCceeeeeehhH----HhcCCCceEEEEEeecCCC-CcHHHHHHHHHHHHHhhhccceEEEEeeCCHHHHHH
Confidence 4456521 689999988763 2333 34566666666555 667888999999999999999999999887766666
Q ss_pred HHhcCcEEccc
Q psy6918 107 YETFGFQKKNN 117 (123)
Q Consensus 107 y~~~Gf~~~~~ 117 (123)
...-||...+.
T Consensus 131 ~~a~~~~~~~~ 141 (161)
T PF09390_consen 131 ARAEGFRLGGQ 141 (161)
T ss_dssp HHHTT----S-
T ss_pred HhhcccccCCe
Confidence 66677765543
No 121
>PRK02983 lysS lysyl-tRNA synthetase; Provisional
Probab=90.94 E-value=3.7 Score=33.03 Aligned_cols=63 Identities=14% Similarity=0.085 Sum_probs=48.0
Q ss_pred EEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 29 TVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 29 ~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
+.+.+. +|+|+|++++.+.. .....++-+--+|+. =+|+-..|+..++.+++++|+..+.+..
T Consensus 423 ~~a~d~-~G~i~af~s~~p~~------~~g~slDLMRr~pda-pnGvmE~L~~~l~~~~k~~G~~~~sLg~ 485 (1094)
T PRK02983 423 VEAHDA-DGQVVALLSFVPWG------RRGLSLDLMRRSPDA-PNGVIELMVAELALEAESLGITRISLNF 485 (1094)
T ss_pred EEEECC-CCeEEEEEEEeeeC------CCCEEEEecccCCCC-CCCHHHHHHHHHHHHHHHcCCCEEEech
Confidence 344442 69999999998532 123566666667774 6799999999999999999999998854
No 122
>PF12261 T_hemolysin: Thermostable hemolysin; InterPro: IPR022050 This family of proteins is found in bacteria. Proteins in this family are typically between 200 and 228 amino acids in length. T_hemolysin is a pore-forming toxin of bacteria, able to lyse erythrocytes from a number of mammalian species.
Probab=90.70 E-value=0.57 Score=29.35 Aligned_cols=76 Identities=11% Similarity=0.194 Sum_probs=57.7
Q ss_pred CCeEEEEEEEEEeee--cccc--------------------cceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCe
Q psy6918 36 TKQVIGTGSLILEQK--FIHE--------------------CALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCY 93 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~--~~~~--------------------~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~ 93 (123)
+|++++.+++..-.. ++-. -.....|+.++.. +.|.+..|+..+.......|++
T Consensus 43 ~g~l~aa~G~r~A~~~~LFlEqYLd~piE~~l~~~~g~~v~R~~IvEvGnLAs~----~~g~~~~l~~~l~~~L~~~g~~ 118 (179)
T PF12261_consen 43 DGELVAAAGLRFASQEPLFLEQYLDQPIEQLLSRRFGRPVSRSQIVEVGNLASF----SPGAARLLFAALAQLLAQQGFE 118 (179)
T ss_pred CCCEEEEEeecccCCCCcchhhhcCCcHHHHHHhhcCCCcchhheeEeechhhc----CcccHHHHHHHHHHHHHHCCCC
Confidence 799999999885431 1100 1122345555443 5799999999999999999999
Q ss_pred EEEEccCCCchhHHHhcCcEEc
Q psy6918 94 KLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 94 ~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
.+..+.++...+++.|+|....
T Consensus 119 w~vfTaT~~lr~~~~rlgl~~~ 140 (179)
T PF12261_consen 119 WVVFTATRQLRNLFRRLGLPPT 140 (179)
T ss_pred EEEEeCCHHHHHHHHHcCCCce
Confidence 9999999999999999998765
No 123
>COG5027 SAS2 Histone acetyltransferase (MYST family) [Chromatin structure and dynamics]
Probab=89.41 E-value=0.28 Score=33.72 Aligned_cols=45 Identities=22% Similarity=0.259 Sum_probs=29.4
Q ss_pred eEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHH
Q psy6918 38 QVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAK 88 (123)
Q Consensus 38 ~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~ 88 (123)
.+||+.+=..... ....+.=+.+.|.||++|+|.-|++.-=...+
T Consensus 248 h~vGyFSKEK~S~------~~yNLaCILtLP~yQRrGYG~lLIdFSY~Ls~ 292 (395)
T COG5027 248 HLVGYFSKEKESE------QDYNLACILTLPPYQRRGYGKLLIDFSYLLSQ 292 (395)
T ss_pred eeeeeechhhccc------ccCceEEEEecChhHhcccceEeeeeeeeccc
Confidence 4778766432211 12356667899999999999988875443333
No 124
>PF02474 NodA: Nodulation protein A (NodA); InterPro: IPR003484 Rhizobial nodulation (Nod) factors are signalling molecules secreted by root-nodulating rhizobia in response to flavanoids excreted by the host plant. They induce various symbiotic responses on the roots of the leguminous host plant at low concentrations, and are required for successful infection. Rhizobial Nod factors are lipo-chitooligosaccharides carrying various substituents which are important determinants of host specificity []. NodA is an N-acyl transferase which specifies the transfer of an acyl chain to the oligosaccharide backbone of Nod factor. Allelic variation of the nodA gene can contribute to the determination of host range [].; GO: 0016746 transferase activity, transferring acyl groups
Probab=88.37 E-value=1.1 Score=27.94 Aligned_cols=53 Identities=8% Similarity=-0.019 Sum_probs=41.3
Q ss_pred eEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcC
Q psy6918 58 KGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFG 111 (123)
Q Consensus 58 ~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~G 111 (123)
.+.+.-+.|.|+..|.|++..+ ..+--..++.++....-++.....+.+++++
T Consensus 85 VaElGLygVRpDLEGlGi~hs~-r~m~PvLq~LgVPF~FGtVR~al~~Hv~R~~ 137 (196)
T PF02474_consen 85 VAELGLYGVRPDLEGLGISHSM-RVMYPVLQELGVPFGFGTVRHALRNHVERLC 137 (196)
T ss_pred EEEEEEEEeeccccccccchhh-hhhhhHHHhcCCCeecccchHHHHHHHHHHh
Confidence 4678889999999999999876 5677777888888777777666666666654
No 125
>PF09924 DUF2156: Uncharacterized conserved protein (DUF2156); InterPro: IPR024320 This domain of unknown function is found in uncharacterised proteins and in Lysylphosphatidylglycerol synthetase, which catalyses the transfer of a lysyl group from L-lysyl-tRNA(Lys) to membrane-bound phosphatidylglycerol [].; PDB: 2HQY_A.
Probab=86.59 E-value=7.7 Score=26.12 Aligned_cols=67 Identities=10% Similarity=-0.139 Sum_probs=39.0
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD 98 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~ 98 (123)
+....++... +|+|+|++.+.+... .....++-.--+|+ -=+|+-..|+..+.+.+++.|+..+.+.
T Consensus 179 ~~~~~~~~~~--dgki~af~~~~~~~~-----~~~~~~~~~k~~~~-a~~G~~e~l~~~~~~~~~~~g~~~lnLg 245 (299)
T PF09924_consen 179 GLRGFVARVA--DGKIVAFAIGSPLGG-----RDGWSIDFEKADPD-APKGIYEFLNVEFAEHLKAEGVEYLNLG 245 (299)
T ss_dssp T-EEEEEEE---TTEEEEEEEEEEEE------TTEEEEEEEEE-TT--STTHHHHHHHHHHHHS--TT--EEE--
T ss_pred CceEEEEEEC--CCcEEEEEEEEEccC-----CccEEEEEEecCCC-CCCcHHHHHHHHHHHhhhhCCceEEEcc
Confidence 3343444442 599999999985431 12233333333555 5679999999999999999999988853
No 126
>KOG3698|consensus
Probab=86.53 E-value=2.4 Score=31.57 Aligned_cols=50 Identities=18% Similarity=0.089 Sum_probs=39.1
Q ss_pred ecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCC---CchhHHHhcCcEEc
Q psy6918 66 VDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCAD---HMIPFYETFGFQKK 115 (123)
Q Consensus 66 v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~---~~~~~y~~~Gf~~~ 115 (123)
+..+.-.-|+.+++++-+..-.+.+|...-.+.+.. ..++||.++||...
T Consensus 824 ~~~~a~D~~~~k~m~~vll~tL~aNGsrGaf~~V~~dD~~~~~fys~lG~~d~ 876 (891)
T KOG3698|consen 824 FGMDASDAHPMKKMIQVLLVTLAANGSRGAFLTVAIDDIERQKFYSELGLTDL 876 (891)
T ss_pred cccccccchHHHHHHHHHHHHHHhcCCcceeEEechhHHHHHHHHHHhchHHH
Confidence 344445778999999999999999988777776643 34679999999876
No 127
>COG5630 ARG2 Acetylglutamate synthase [Amino acid transport and metabolism]
Probab=85.75 E-value=8.3 Score=27.32 Aligned_cols=47 Identities=13% Similarity=0.145 Sum_probs=35.6
Q ss_pred CeEEEEEEEEEeeecccccceeEEEEEEEecccccC-CchHHHHHHHHHHH
Q psy6918 37 KQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRG-KELGKLLIAVLVKL 86 (123)
Q Consensus 37 ~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg-~G~g~~Ll~~~~~~ 86 (123)
|.-.|.+.+.... ...+...+++.++|.++.|| .|++..+..-+-+.
T Consensus 382 gdY~g~aIlTyeg---s~~~~vpYLDKfAVl~~aQGs~gisd~vfniM~e~ 429 (495)
T COG5630 382 GDYRGAAILTYEG---SGENNVPYLDKFAVLDDAQGSEGISDAVFNIMREE 429 (495)
T ss_pred ccceeeEEEEeec---cCCCCCcceeeeeccccccccchHHHHHHHHHHHh
Confidence 6777887776542 22235679999999999999 99999888776654
No 128
>COG5092 NMT1 N-myristoyl transferase [Lipid metabolism]
Probab=85.33 E-value=9.5 Score=26.37 Aligned_cols=60 Identities=13% Similarity=0.100 Sum_probs=45.6
Q ss_pred CCeEEEEEEEEEeeecccc-cceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEE
Q psy6918 36 TKQVIGTGSLILEQKFIHE-CALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKL 95 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~-~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i 95 (123)
+.++||+++..+..-...+ ......+..++|+.+.|++.+.--|++.+.+.|...++.+-
T Consensus 142 t~klVaFIsa~p~~v~vRgK~~~~~evNFLCihk~lRsKRltPvLIkEiTRR~n~~~iw~a 202 (451)
T COG5092 142 TQKLVAFISAKPHLVSVRGKRSSVLEVNFLCIHKELRSKRLTPVLIKEITRRANVDGIWRA 202 (451)
T ss_pred cceeEEEEecceeEEEEcccccccceEEEEEEehhhhhCccchHHHHHHHHhhhhhhhHHH
Confidence 6799999987643211111 22356888999999999999999999999999887776543
No 129
>COG5653 Protein involved in cellulose biosynthesis (CelD) [Cell envelope biogenesis, outer membrane]
Probab=84.45 E-value=12 Score=26.60 Aligned_cols=85 Identities=9% Similarity=0.020 Sum_probs=63.4
Q ss_pred CCCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHH
Q psy6918 6 PTSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVK 85 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~ 85 (123)
..++.+-+...++.-..+....++...- .+.+++........ .+..-....++|++-+---|--|+-.+++
T Consensus 252 ~~~t~~fl~dL~~~~~~d~~~rl~gL~~--G~~lvAV~~~lr~~-------~t~h~~l~a~dpe~~~~SPG~~lf~d~i~ 322 (406)
T COG5653 252 AGWTRDFLRDLFTQRAEDGSGRLFGLHA--GGRLVAVHGLLRQG-------GTYHAWLGAIDPEFARASPGMLLFLDLIE 322 (406)
T ss_pred cchHHHHHHHHHhccCcCCceEEEEEee--CCEEEEEEeeeccC-------CEEEEEeeccCHHHhhcCchHHHHHHHHH
Confidence 3455666666666655566666666665 67899887776432 24445557789999999999999999999
Q ss_pred HHHhcCCeEEEEcc
Q psy6918 86 LAKHFQCYKLTLDC 99 (123)
Q Consensus 86 ~~~~~~~~~i~~~~ 99 (123)
++-.+|+....+.+
T Consensus 323 ~~~~~g~~~~Dfgv 336 (406)
T COG5653 323 WACGQGLARFDFGV 336 (406)
T ss_pred HHhcCCCeEEeecC
Confidence 99999998888866
No 130
>PHA02769 hypothetical protein; Provisional
Probab=84.33 E-value=1.9 Score=24.99 Aligned_cols=42 Identities=29% Similarity=0.256 Sum_probs=28.8
Q ss_pred chHHHHHHHHHHHH---HhcCCeEEEEcc-CCCchhHHHhcCcEEc
Q psy6918 74 ELGKLLIAVLVKLA---KHFQCYKLTLDC-ADHMIPFYETFGFQKK 115 (123)
Q Consensus 74 G~g~~Ll~~~~~~~---~~~~~~~i~~~~-~~~~~~~y~~~Gf~~~ 115 (123)
--|..|+..+...+ ++.|+..++.-. ...+..+|.|.||+..
T Consensus 92 apgd~lvnfl~~l~~k~~~dg~evlwtlgfpdhsnaly~kagfk~v 137 (154)
T PHA02769 92 APGDHLVNFLNDLAEKLKKDGFEVLWTLGFPDHSNALYKKAGFKLV 137 (154)
T ss_pred CChHHHHHHHHHHHHHHhcCCeEEEEEecCCCcchhHHhhhhhhHh
Confidence 34566666655544 456887666544 5667789999999886
No 131
>COG2935 Putative arginyl-tRNA:protein arginylyltransferase [Posttranslational modification, protein turnover, chaperones]
Probab=83.17 E-value=10 Score=25.16 Aligned_cols=54 Identities=13% Similarity=0.157 Sum_probs=43.0
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEE--EEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEE--VVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD 98 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~--~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~ 98 (123)
.|++|+++....-++ -|+. .+=+|++...++|+-.+-.=+.+|++.|...+.+.
T Consensus 159 ~G~LvAVavtDvL~d---------GlSsVY~FydPd~s~~SLGt~~iL~~I~~aq~~~l~yvYLG 214 (253)
T COG2935 159 EGKLVAVAVTDVLPD---------GLSSVYTFYDPDMSKRSLGTLSILDQIAIAQRLGLPYVYLG 214 (253)
T ss_pred CCcEEEEEeeecccC---------cceeEEEEeCCChhhhcchHHHHHHHHHHHHHhCCCeEEEE
Confidence 588998887764332 2333 34599999999999999999999999999999874
No 132
>KOG1472|consensus
Probab=81.42 E-value=0.54 Score=35.55 Aligned_cols=72 Identities=17% Similarity=0.099 Sum_probs=55.4
Q ss_pred EEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcC-CeEEEEccCCCchhHHHhcCcEEc
Q psy6918 39 VIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQ-CYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 39 ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~-~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
+||-++....+. .....|...+|.-+.|-+|+|+-++..+.++.+... +......+...++..+.+.||...
T Consensus 431 ~vggi~~r~f~~-----k~f~eivf~av~~~eqv~g~g~hlmnhlkd~~~~~~~i~~~ltyad~~aigyfkkqgfs~e 503 (720)
T KOG1472|consen 431 VVGGICFRPFPE-----KGFTEIVFCAVTTDEQVKGSGTHLMNHLKDYVRSSSTIDYALTYADEGAIGYFKKQGFSKE 503 (720)
T ss_pred cccccccCcCcc-----cCCcceeeccccCcccccccCcCchhhHHHHhhccchHHHHHHhhhhcccccccCccchhh
Confidence 788777764332 345678889999999999999999999999998866 544444566777778888888765
No 133
>KOG3014|consensus
Probab=77.07 E-value=3.9 Score=27.05 Aligned_cols=31 Identities=23% Similarity=0.132 Sum_probs=27.0
Q ss_pred eeEEEEEEEecccccCCchHHHHHHHHHHHH
Q psy6918 57 LKGKIEEVVVDDTYRGKELGKLLIAVLVKLA 87 (123)
Q Consensus 57 ~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~ 87 (123)
..+-|..+.|.+.-|++|+++.|++.+...-
T Consensus 182 ~~~GIsRIWV~s~~Rr~gIAs~lldva~~~~ 212 (257)
T KOG3014|consen 182 AICGISRIWVSSLRRRKGIASLLLDVARCNF 212 (257)
T ss_pred cEeeeEEEEeehhhhhhhhHHHHHHHHHHhh
Confidence 3457889999999999999999999988654
No 134
>PF02388 FemAB: FemAB family; InterPro: IPR003447 The femAB operon codes for two nearly identical approximately 50kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan []. These proteins are also considered as a factor influencing the level of methicillin resistance [].; GO: 0016755 transferase activity, transferring amino-acyl groups; PDB: 1XE4_A 1NE9_A 3GKR_A 1XIX_A 1P4N_A 1XF8_A 1LRZ_A.
Probab=74.69 E-value=28 Score=24.83 Aligned_cols=82 Identities=21% Similarity=0.265 Sum_probs=44.6
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecc--cccCCchHHHHHHHHHHHHHhcCCeEEEEcc------
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDD--TYRGKELGKLLIAVLVKLAKHFQCYKLTLDC------ 99 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p--~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~------ 99 (123)
.+.+.+ ++++++|.+.+..... ..+...++| .=.| +|...-+-..+++.+.++++++++-.|.+.-
T Consensus 37 ~vgv~~-d~~~v~aa~ll~~~~~--~~g~~~~yi---prGPv~d~~d~ell~~f~~~Lk~~akk~~a~~lridP~~~~~~ 110 (406)
T PF02388_consen 37 RVGVKD-DGGEVAAAALLLRKKP--FKGFKYAYI---PRGPVMDYSDEELLEFFLEELKKYAKKKRALFLRIDPNVIYQE 110 (406)
T ss_dssp EEEEE--TTS-EEEEEEEEEEEC--TTTCEEEEE---TT--EC-TT-HHHHHHHHHHHHHHHCTTTEEEEEE--S-EEEC
T ss_pred EEEEEe-CCCeEEEEEEEEEecc--CCceeEEEE---CCCCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEEeCchhhhh
Confidence 344444 1467777665553322 111122222 2245 7778888999999999999998876665532
Q ss_pred ------------CCCchhHHHhcCcEEc
Q psy6918 100 ------------ADHMIPFYETFGFQKK 115 (123)
Q Consensus 100 ------------~~~~~~~y~~~Gf~~~ 115 (123)
+......++++||+..
T Consensus 111 ~~~~g~~~~~~~~~~~~~~l~~~G~~~~ 138 (406)
T PF02388_consen 111 RDEDGEPIEGEENDELIENLKALGFRHQ 138 (406)
T ss_dssp E-TTS-EEEE-S-THHHHHHHHTT-CCT
T ss_pred cccccccccCcchHHHHHHHHhcCceec
Confidence 1122467889999876
No 135
>PF00571 CBS: CBS domain CBS domain web page. Mutations in the CBS domain of Swiss:P35520 lead to homocystinuria.; InterPro: IPR000644 CBS (cystathionine-beta-synthase) domains are small intracellular modules, mostly found in two or four copies within a protein, that occur in a variety of proteins in bacteria, archaea, and eukaryotes [, ]. Tandem pairs of CBS domains can act as binding domains for adenosine derivatives and may regulate the activity of attached enzymatic or other domains []. In some cases, CBS domains may act as sensors of cellular energy status by being activated by AMP and inhibited by ATP []. In chloride ion channels, the CBS domains have been implicated in intracellular targeting and trafficking, as well as in protein-protein interactions, but results vary with different channels: in the CLC-5 channel, the CBS domain was shown to be required for trafficking [], while in the CLC-1 channel, the CBS domain was shown to be critical for channel function, but not necessary for trafficking []. Recent experiments revealing that CBS domains can bind adenosine-containing ligands such ATP, AMP, or S-adenosylmethionine have led to the hypothesis that CBS domains function as sensors of intracellular metabolites [, ]. Crystallographic studies of CBS domains have shown that pairs of CBS sequences form a globular domain where each CBS unit adopts a beta-alpha-beta-beta-alpha pattern []. Crystal structure of the CBS domains of the AMP-activated protein kinase in complexes with AMP and ATP shows that the phosphate groups of AMP/ATP lie in a surface pocket at the interface of two CBS domains, which is lined with basic residues, many of which are associated with disease-causing mutations []. In humans, mutations in conserved residues within CBS domains cause a variety of human hereditary diseases, including (with the gene mutated in parentheses): homocystinuria (cystathionine beta-synthase); Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase); retinitis pigmentosa (IMP dehydrogenase-1); congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members).; GO: 0005515 protein binding; PDB: 3JTF_A 3TE5_C 3TDH_C 3T4N_C 2QLV_C 3OI8_A 3LV9_A 2QH1_B 1PVM_B 3LQN_A ....
Probab=73.89 E-value=8.4 Score=18.44 Aligned_cols=33 Identities=12% Similarity=0.121 Sum_probs=20.1
Q ss_pred CCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 8 SNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
.+..+..+.+. +.+...+.+..+ +++++|.++.
T Consensus 16 ~~l~~~~~~~~---~~~~~~~~V~d~--~~~~~G~is~ 48 (57)
T PF00571_consen 16 DSLEEALEIMR---KNGISRLPVVDE--DGKLVGIISR 48 (57)
T ss_dssp SBHHHHHHHHH---HHTSSEEEEEST--TSBEEEEEEH
T ss_pred CcHHHHHHHHH---HcCCcEEEEEec--CCEEEEEEEH
Confidence 33444444444 335555666655 7999999875
No 136
>cd07042 STAS_SulP_like_sulfate_transporter Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
Probab=73.16 E-value=11 Score=20.69 Aligned_cols=40 Identities=13% Similarity=0.032 Sum_probs=31.7
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
|-.++..+...+++.|........++...+.+.+.|+...
T Consensus 58 ~~~~L~~~~~~~~~~~~~~~l~~~~~~~~~~l~~~g~~~~ 97 (107)
T cd07042 58 AAEALEELVKDLRKRGVELYLAGLNPQVRELLERAGLLDE 97 (107)
T ss_pred HHHHHHHHHHHHHHCCCEEEEecCCHHHHHHHHHcCcHHH
Confidence 5667777777778888877777778888889999998765
No 137
>cd07043 STAS_anti-anti-sigma_factors Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation. Anti-anti-sigma factors play an important role in the regulation of several sigma factors and their corresponding anti-sigma factors. Upon dephosphorylation they bind the anti-sigma factor and induce the release of the sigma factor from the anti-sigma factor. In a feedback mechanism the anti-anti-sigma factor can be inactivated via phosphorylation by the anti-sigma factor. Well studied examples from Bacillus subtilis are SpoIIAA (regulating sigmaF and sigmaC which play an important role in sporulation) and RsbV (regulating sigmaB involved in the general stress response). The STAS domain is also found in the C- terminal region of sulphate transporters and stressosomes.
Probab=73.02 E-value=9.6 Score=20.51 Aligned_cols=41 Identities=12% Similarity=0.040 Sum_probs=31.2
Q ss_pred hHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 75 LGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 75 ~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
-|-.++..+.+.+++.|........++...++++..|+...
T Consensus 54 ~g~~~L~~l~~~~~~~g~~v~i~~~~~~~~~~l~~~gl~~~ 94 (99)
T cd07043 54 SGLGVLLGAYKRARAAGGRLVLVNVSPAVRRVLELTGLDRL 94 (99)
T ss_pred hhHHHHHHHHHHHHHcCCeEEEEcCCHHHHHHHHHhCccee
Confidence 45667777888888888775666678888889999998753
No 138
>PF13530 SCP2_2: Sterol carrier protein domain; PDB: 3SXN_C 3N7Z_A 3RYO_B 3R1K_A 3UY5_A 2HV2_F 2I00_D 2OZG_A.
Probab=72.17 E-value=23 Score=22.73 Aligned_cols=64 Identities=11% Similarity=0.091 Sum_probs=38.1
Q ss_pred cCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEE-EEecccccCCchHHHHHHHHHHHHHhc--CCeEEEEc
Q psy6918 22 ASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEE-VVVDDTYRGKELGKLLIAVLVKLAKHF--QCYKLTLD 98 (123)
Q Consensus 22 ~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~-~~v~p~~rg~G~g~~Ll~~~~~~~~~~--~~~~i~~~ 98 (123)
..+...+++..+ +|++.|++.+.+.... .......|.. ++.+|+. ...+..+...+ .+..+.+.
T Consensus 21 ~~~~~~~~~~~~--~g~~~GY~~y~~~~~~--~~~~~l~V~El~~~~~~A---------~~aLl~fl~~h~~~~~~v~~~ 87 (218)
T PF13530_consen 21 EKDRGYAVYYDE--DGEPDGYVIYRFKDDW--EPGGTLEVRELVALDPEA---------YRALLAFLASHRDQVDEVEWN 87 (218)
T ss_dssp CCGSEEEEEEEC--TSEEEEEEEEEEET-S--SSTTEEEEEEEEESSHHH---------HHHHHHHHHTCCTTESEEEEE
T ss_pred cCCceEEEEECC--CCCeeEEEEEEEcccC--CCCceEEEEEEEeCCHHH---------HHHHHHHHHhhhCcceEEEEE
Confidence 345566666666 7999999999976532 3334567766 4555552 33444444343 45666664
No 139
>PF08901 DUF1847: Protein of unknown function (DUF1847); InterPro: IPR014997 This group of proteins are functionally uncharacterised. They contain 4 N-terminal cysteines that may form a zinc-binding domain.
Probab=71.47 E-value=5.9 Score=24.30 Aligned_cols=38 Identities=16% Similarity=0.280 Sum_probs=28.2
Q ss_pred HHHHHHHHHhcCCeEEEEcc----C---CCchhHHHhcCcEEccc
Q psy6918 80 IAVLVKLAKHFQCYKLTLDC----A---DHMIPFYETFGFQKKNN 117 (123)
Q Consensus 80 l~~~~~~~~~~~~~~i~~~~----~---~~~~~~y~~~Gf~~~~~ 117 (123)
++.++++|+..|+++|-+-. . ....++++..||++...
T Consensus 43 veEiieFak~mgykkiGiAfCiGL~~EA~~~~~iL~~~gFev~sV 87 (157)
T PF08901_consen 43 VEEIIEFAKRMGYKKIGIAFCIGLRKEARILAKILEANGFEVYSV 87 (157)
T ss_pred HHHHHHHHHHcCCCeeeehhhHhHHHHHHHHHHHHHHCCCEEEEE
Confidence 67788899999999997742 1 22356889999998643
No 140
>cd06844 STAS Sulphate Transporter and Anti-Sigma factor antagonist domain found in the C-terminal region of sulphate transporters as well as in bacterial and archaeal proteins involved in the regulation of sigma factors. The STAS (Sulphate Transporter and Anti-Sigma factor antagonist) domain is found in the C-terminal region of sulphate transporters as well as in bacterial and archaeal proteins involved in the regulation of sigma factors, like anti-anti-sigma factors and "stressosome" components. The sigma factor regulators are involved in protein-protein interaction which is regulated by phosphorylation.
Probab=69.74 E-value=13 Score=20.44 Aligned_cols=40 Identities=13% Similarity=0.027 Sum_probs=31.6
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
|-.++..+.+.+++.|.....+..++...+.+++.|+...
T Consensus 56 gl~~L~~l~~~~~~~g~~l~l~~~~~~v~~~l~~~gl~~~ 95 (100)
T cd06844 56 GTGVLLERSRLAEAVGGQFVLTGISPAVRITLTESGLDKG 95 (100)
T ss_pred HHHHHHHHHHHHHHcCCEEEEECCCHHHHHHHHHhCchhh
Confidence 5566777777778889877777778888999999998764
No 141
>cd04590 CBS_pair_CorC_HlyC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role,
Probab=69.30 E-value=15 Score=19.98 Aligned_cols=30 Identities=13% Similarity=0.296 Sum_probs=18.3
Q ss_pred HHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 14 AERFHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
.+.+......+...+.+..+ +++++|+++.
T Consensus 76 ~~~~~~~~~~~~~~~~Vv~~--~~~~~Gvit~ 105 (111)
T cd04590 76 DDLLEEMRKERSHMAIVVDE--YGGTAGLVTL 105 (111)
T ss_pred HHHHHHHHhcCCcEEEEEEC--CCCEEEEeEH
Confidence 33444444445555666665 6899999875
No 142
>TIGR02886 spore_II_AA anti-sigma F factor antagonist. The anti-sigma F factor antagonist, also called stage II sporulation protein AA, is a protein universal among endospore-forming bacteria, all of which belong to the Firmcutes
Probab=68.92 E-value=15 Score=20.38 Aligned_cols=40 Identities=18% Similarity=0.060 Sum_probs=30.7
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
|-.++..+.+.+++.|........++...+.+++.|+...
T Consensus 56 gi~~L~~~~~~~~~~g~~l~l~~~~~~v~~~l~~~gl~~~ 95 (106)
T TIGR02886 56 GLGVILGRYKKIKNEGGEVIVCNVSPAVKRLFELSGLFKI 95 (106)
T ss_pred HHHHHHHHHHHHHHcCCEEEEEeCCHHHHHHHHHhCCceE
Confidence 4455666777778888877777778888999999998764
No 143
>PF01740 STAS: STAS domain; InterPro: IPR002645 The STAS (Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function. The establishment of differential gene expression in sporulating Bacillus subtilis involves four protein components one of which is SpoIIAA (P10727 from SWISSPROT). The four components regulate the sporulation sigma factor F. Early in sporulation, SpoIIAA is in the phosphorylated state (SpoIIAA-P), as a result of the activity of the ATP-dependent protein kinase SpoIIAB (P10728 from SWISSPROT). The site at which this protein is a conserved serine. SpoIIAB is an anti-sigma factor that in its free form inhibits F by binding to it. Competition by SpoIIAA (the anti-anti-sigma factor) for binding to SpoIIAB releases Sigma F activity []. The STAS domain is found in the anti-sigma factor antagonist SpoIIAA.; PDB: 3T6O_B 3LKL_B 1H4Z_A 1H4Y_B 1H4X_B 3NY7_A 3OIZ_A 1T6R_A 1VC1_B 1SBO_A ....
Probab=67.17 E-value=9.2 Score=21.61 Aligned_cols=40 Identities=10% Similarity=0.054 Sum_probs=32.2
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
|-.++..+.+.++..|........++...+.+++.||...
T Consensus 65 gi~~L~~~~~~~~~~g~~~~l~~~~~~v~~~l~~~~~~~~ 104 (117)
T PF01740_consen 65 GIQALVDIIKELRRRGVQLVLVGLNPDVRRILERSGLIDF 104 (117)
T ss_dssp HHHHHHHHHHHHHHTTCEEEEESHHHHHHHHHHHTTGHHH
T ss_pred HHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHcCCChh
Confidence 4567778888888899987888788888889999998753
No 144
>TIGR00377 ant_ant_sig anti-anti-sigma factor. This superfamily includes small (105-125 residue) proteins related to SpoIIAA of Bacillus subtilis, an anti-anti-sigma factor. SpoIIAA can bind to and inhibit the anti-sigma F factor SpoIIAB. Also, it can be phosphorylated by SpoIIAB on a Ser residue at position 59 of the seed alignment. A similar arrangement is inferred for RsbV, an anti-anti-sigma factor for sigma B. This Ser is fairly well conserved within a motif resembling MXS[STA]G[VIL]X[VIL][VILF] among homologous known or predicted anti-anti-sigma factors. Regions similar to SpoIIAA and apparently homologous, but differing considerably near the phosphorlated Ser of SpoIIAA, appear in a single copy in several longer proteins.
Probab=66.92 E-value=13 Score=20.60 Aligned_cols=40 Identities=8% Similarity=0.072 Sum_probs=29.8
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
|-.++..+.+.+++.|........++...++++..|+...
T Consensus 60 gl~~L~~~~~~~~~~~~~~~l~~~~~~~~~~l~~~~l~~~ 99 (108)
T TIGR00377 60 GLGVLLGRYKQVRRVGGQLVLVSVSPRVARLLDITGLLRI 99 (108)
T ss_pred cHHHHHHHHHHHHhcCCEEEEEeCCHHHHHHHHHhChhhe
Confidence 4556666777778888765566668888999999998764
No 145
>PHA00432 internal virion protein A
Probab=65.24 E-value=28 Score=20.97 Aligned_cols=22 Identities=18% Similarity=0.179 Sum_probs=16.9
Q ss_pred EEccCCCchhHHHhcCcEEccc
Q psy6918 96 TLDCADHMIPFYETFGFQKKNN 117 (123)
Q Consensus 96 ~~~~~~~~~~~y~~~Gf~~~~~ 117 (123)
...-|...++|.+.+||+-..+
T Consensus 100 V~~~N~~hir~Lk~lGf~f~~e 121 (137)
T PHA00432 100 VWVGNKSHIRFLKSIGAVFHNE 121 (137)
T ss_pred eecCCHHHHHHHHHcCeeeecc
Confidence 3344777789999999998744
No 146
>PF03588 Leu_Phe_trans: Leucyl/phenylalanyl-tRNA protein transferase; InterPro: IPR004616 Leucyl/phenylalanyl-tRNA--protein transferase 2.3.2.6 from EC transfers a Leu or Phe to the amino end of certain proteins to enable degradation. The N-terminal residue controls the biological half-life of many proteins via the N-end rule pathway.; GO: 0008914 leucyltransferase activity, 0030163 protein catabolic process; PDB: 2Z3L_A 2Z3O_A 2Z3P_A 2DPT_B 2Z3M_B 2Z3N_B 2DPS_B 2Z3K_B 2CXA_A.
Probab=65.19 E-value=32 Score=21.61 Aligned_cols=94 Identities=10% Similarity=-0.002 Sum_probs=56.3
Q ss_pred CCCCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHH
Q psy6918 5 LPTSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLV 84 (123)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~ 84 (123)
.++|=.+++.+.+..+...+....+-+-+ ++++||-.-...-.. -..-.+.+. +..+-++-.+..+.
T Consensus 77 ~~TWI~~~~~~aY~~Lh~~G~aHSvEvw~--~~~LvGGlyGv~iG~------~F~GESMFs-----~~~~ASKval~~L~ 143 (173)
T PF03588_consen 77 DGTWITPEMIEAYTELHELGYAHSVEVWQ--GGELVGGLYGVAIGG------VFFGESMFS-----RVSNASKVALVALV 143 (173)
T ss_dssp TGTTS-HHHHHHHHHHHHTTSEEEEEEEE--TTEEEEEEEEEEETT------EEEEEEEEE-----SSTTHHHHHHHHHH
T ss_pred CCCCcCHHHHHHHHHHHHcCeeEEEeeec--CCeeEEeeeCEEECC------EEEeccccc-----cCCChHHHHHHHHH
Confidence 34555677778888887778777775555 578888755442211 111222233 33467899999999
Q ss_pred HHHHhcCCeEEEEccCCCchhHHHhcCcEE
Q psy6918 85 KLAKHFQCYKLTLDCADHMIPFYETFGFQK 114 (123)
Q Consensus 85 ~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~ 114 (123)
+++++.|+..+.+-. . .+..+++|-+.
T Consensus 144 ~~L~~~g~~liD~Q~--~-~~hl~slGa~~ 170 (173)
T PF03588_consen 144 EHLRQCGFQLIDCQM--P-TPHLASLGAKE 170 (173)
T ss_dssp HHHHHTT--EEEEES-----HHHHHTTEEE
T ss_pred HHHHHCCCcEEEecc--C-CHHHHhcCCEe
Confidence 999999997666532 2 23345666554
No 147
>PF04339 DUF482: Protein of unknown function, DUF482; InterPro: IPR007434 This family contains several proteins of uncharacterised function.
Probab=64.84 E-value=48 Score=23.55 Aligned_cols=52 Identities=15% Similarity=0.135 Sum_probs=41.7
Q ss_pred EEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc-cCCCchhHHHhcCcEEc
Q psy6918 64 VVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD-CADHMIPFYETFGFQKK 115 (123)
Q Consensus 64 ~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~-~~~~~~~~y~~~Gf~~~ 115 (123)
+.++|......+...|++.+.+.+++.|+..+.+. +++......+..||...
T Consensus 106 ~l~~~~~~~~~~~~~L~~~~~~~a~~~~~Ss~h~lF~~~~~~~~l~~~G~~~r 158 (370)
T PF04339_consen 106 LLIAPGADRAALRAALLQALEQLAEENGLSSWHILFPDEEDAAALEEAGFLSR 158 (370)
T ss_pred eeECCCCCHHHHHHHHHHHHHHHHHHcCCCcceeecCCHHHHHHHHhCCCcee
Confidence 67778888888999999999999999998777653 45555667788888774
No 148
>PF09924 DUF2156: Uncharacterized conserved protein (DUF2156); InterPro: IPR024320 This domain of unknown function is found in uncharacterised proteins and in Lysylphosphatidylglycerol synthetase, which catalyses the transfer of a lysyl group from L-lysyl-tRNA(Lys) to membrane-bound phosphatidylglycerol [].; PDB: 2HQY_A.
Probab=62.32 E-value=15 Score=24.81 Aligned_cols=44 Identities=16% Similarity=0.297 Sum_probs=34.1
Q ss_pred chHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEccc
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKKNN 117 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~~~ 117 (123)
.--.+++..+.++|.+.+...+...+.+.....|+..||.....
T Consensus 55 ~~~~~~i~~f~~~~~~~~~~~~~~~v~e~~~~~~~~~g~~~~~~ 98 (299)
T PF09924_consen 55 EDRPELIEEFLEFADRNGWKPIFYGVSEEFLELLEELGFESNRD 98 (299)
T ss_dssp S-HHHHHHHHHHHHHHCTS--EEEEE-HHHHHHHHHHSEEE-GG
T ss_pred HHHHHHHHHHHHHHHHCCCceEEEECCHHHHHHHHHcCCeeecC
Confidence 34568999999999999999899889999999999999887643
No 149
>TIGR02990 ectoine_eutA ectoine utilization protein EutA. Members of this protein family are EutA, a predicted arylmalonate decarboxylase found in a conserved ectoine utilization operon of species that include Sinorhizobium meliloti 1021 (where it is known to be induced by ectoine), Mesorhizobium loti and Silicibacter pomeroyi. It is missing from two other species with the other ectoine transport and utilization genes: Pseudomonas putida and Agrobacterium tumefaciens.
Probab=61.37 E-value=12 Score=24.73 Aligned_cols=39 Identities=13% Similarity=0.149 Sum_probs=29.4
Q ss_pred HHHHHHHHHHHHhcCCeEEEEcc------CCCchhHHHhcCcEEc
Q psy6918 77 KLLIAVLVKLAKHFQCYKLTLDC------ADHMIPFYETFGFQKK 115 (123)
Q Consensus 77 ~~Ll~~~~~~~~~~~~~~i~~~~------~~~~~~~y~~~Gf~~~ 115 (123)
..-..++++-+++.|+++|.+.+ +.....||++.||+..
T Consensus 105 tt~~~A~~~AL~alg~~RIalvTPY~~~v~~~~~~~l~~~G~eV~ 149 (239)
T TIGR02990 105 VTPSSAAVDGLAALGVRRISLLTPYTPETSRPMAQYFAVRGFEIV 149 (239)
T ss_pred eCHHHHHHHHHHHcCCCEEEEECCCcHHHHHHHHHHHHhCCcEEe
Confidence 44455666667788999998865 3455679999999987
No 150
>PF12953 DUF3842: Domain of unknown function (DUF3842); InterPro: IPR024208 This family of proteins has no known function.
Probab=59.37 E-value=13 Score=22.13 Aligned_cols=42 Identities=17% Similarity=0.124 Sum_probs=27.8
Q ss_pred ccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCc
Q psy6918 70 YRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGF 112 (123)
Q Consensus 70 ~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf 112 (123)
-||-|+|+.+++.+.+..-+ .+.-+-+.+|.-+..-+-|.|-
T Consensus 7 GQGGGiG~~iv~~lr~~~~~-~~eI~AlGTNa~AT~~MlKaGA 48 (131)
T PF12953_consen 7 GQGGGIGKQIVEKLRKELPE-EVEIIALGTNAIATSAMLKAGA 48 (131)
T ss_pred CCCChhHHHHHHHHHHhCCC-CcEEEEEehhHHHHHHHHHcCC
Confidence 38999999999999877554 3455555566554444444443
No 151
>PRK04531 acetylglutamate kinase; Provisional
Probab=59.31 E-value=64 Score=23.16 Aligned_cols=38 Identities=21% Similarity=0.260 Sum_probs=31.1
Q ss_pred eEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccC
Q psy6918 58 KGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCA 100 (123)
Q Consensus 58 ~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~ 100 (123)
..++.-|+|.++.||.|++.-+...+.+.. +.+.|.+.
T Consensus 310 ~~~Ldkf~v~~~~~~~~v~d~vf~~~~~~~-----~~L~Wrsr 347 (398)
T PRK04531 310 GPYLDKFAVLDDARGEGLGRAVWNVMREET-----PQLFWRSR 347 (398)
T ss_pred ceEeEEEEEccchhhcChHHHHHHHHHhhC-----CceEEEcC
Confidence 579999999999999999999999888553 35666653
No 152
>PF10887 DUF2686: Protein of unknown function (DUF2686); InterPro: IPR021220 Some members in this family of proteins are annotated as yjfZ however currently no function is known.
Probab=58.99 E-value=47 Score=21.82 Aligned_cols=79 Identities=10% Similarity=0.067 Sum_probs=42.8
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHH---HHHHHHHHHHHhcCC-eEEEEccCCCchhHHHhcC
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGK---LLIAVLVKLAKHFQC-YKLTLDCADHMIPFYETFG 111 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~---~Ll~~~~~~~~~~~~-~~i~~~~~~~~~~~y~~~G 111 (123)
.|+-||++.-.+...........-.| +++|+.=.--.|++. ..+.+++.---..|. ..|-+.+-+.....+.++|
T Consensus 160 aGtTIGYi~TLP~~~~~~Ws~e~P~I-DiYIDQi~TVtGVsNssGF~~AALLNAn~~~g~dp~IGl~ayp~tA~ihs~~G 238 (276)
T PF10887_consen 160 AGTTIGYIVTLPVEMANCWSNELPRI-DIYIDQIMTVTGVSNSSGFALAALLNANIGMGNDPIIGLEAYPGTAEIHSKMG 238 (276)
T ss_pred CcceeEEEEecchhhcccccccCCce-eeEeeeeEEeeccccchhHHHHHHHhccccCCCCCeeeeeeCCCcHhhhhccC
Confidence 57888988766533321111111112 234443333333332 234444443333433 4567778889999999999
Q ss_pred cEEc
Q psy6918 112 FQKK 115 (123)
Q Consensus 112 f~~~ 115 (123)
|++.
T Consensus 239 y~vi 242 (276)
T PF10887_consen 239 YEVI 242 (276)
T ss_pred ceeC
Confidence 9986
No 153
>cd04619 CBS_pair_6 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera
Probab=58.66 E-value=20 Score=19.80 Aligned_cols=32 Identities=13% Similarity=0.199 Sum_probs=18.9
Q ss_pred HHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 13 FAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.+.++.....+...+.+..+ +++++|+++..
T Consensus 78 l~~a~~~m~~~~~~~lpVvd~--~~~~~Gvi~~~ 109 (114)
T cd04619 78 LHDVWQVMKQRGLKNIPVVDE--NARPLGVLNAR 109 (114)
T ss_pred HHHHHHHHHHcCCCeEEEECC--CCcEEEEEEhH
Confidence 344444444445545555554 58999998753
No 154
>COG3473 Maleate cis-trans isomerase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=57.49 E-value=17 Score=23.62 Aligned_cols=30 Identities=17% Similarity=0.268 Sum_probs=21.6
Q ss_pred HHHhcCCeEEEEcc------CCCchhHHHhcCcEEc
Q psy6918 86 LAKHFQCYKLTLDC------ADHMIPFYETFGFQKK 115 (123)
Q Consensus 86 ~~~~~~~~~i~~~~------~~~~~~~y~~~Gf~~~ 115 (123)
-.+..+..++.+-+ |.....|++++||+..
T Consensus 112 aL~al~a~ri~vlTPY~~evn~~e~ef~~~~Gfeiv 147 (238)
T COG3473 112 ALNALGAQRISVLTPYIDEVNQREIEFLEANGFEIV 147 (238)
T ss_pred HHHhhCcceEEEeccchhhhhhHHHHHHHhCCeEEE
Confidence 34556777777654 4555779999999986
No 155
>PRK00301 aat leucyl/phenylalanyl-tRNA--protein transferase; Reviewed
Probab=57.40 E-value=54 Score=21.71 Aligned_cols=78 Identities=10% Similarity=0.071 Sum_probs=55.1
Q ss_pred CCCCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEeccc--ccCCchHHHHHHH
Q psy6918 5 LPTSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDT--YRGKELGKLLIAV 82 (123)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~--~rg~G~g~~Ll~~ 82 (123)
.++|=.+++.+.+..+...+....+-+-+ ++++||-.-.. .|+.++..++ .+...-++..+-.
T Consensus 107 ~~TWI~~e~~~aY~~LH~~G~AHSVE~W~--~~~LvGGlYGv-------------~iG~~F~GESMFs~~~nASKvAl~~ 171 (233)
T PRK00301 107 EGTWITPEIIEAYLELHELGHAHSVEVWQ--GGELVGGLYGV-------------ALGRAFFGESMFSRATDASKVALAA 171 (233)
T ss_pred CCCCCCHHHHHHHHHHHHcCceEEEEEEE--CCEEEeeeecc-------------ccCCEEeecccccCCCChHHHHHHH
Confidence 34555667788888887777777776655 68999876543 3444444444 2556788889999
Q ss_pred HHHHHHhcCCeEEEE
Q psy6918 83 LVKLAKHFQCYKLTL 97 (123)
Q Consensus 83 ~~~~~~~~~~~~i~~ 97 (123)
+.++++..|+..+.+
T Consensus 172 L~~~L~~~g~~liD~ 186 (233)
T PRK00301 172 LVEHLRRHGFKLIDC 186 (233)
T ss_pred HHHHHHHCCceEEEE
Confidence 999999999876655
No 156
>PRK00756 acyltransferase NodA; Provisional
Probab=56.53 E-value=29 Score=21.78 Aligned_cols=42 Identities=5% Similarity=-0.035 Sum_probs=30.1
Q ss_pred eEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccC
Q psy6918 58 KGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCA 100 (123)
Q Consensus 58 ~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~ 100 (123)
.+.+.-+.|.|+..|.|++..+ ..+---.++.++..-.-++.
T Consensus 85 VaElGLygVRpDLEGlGi~~S~-r~m~PvLq~LgVPF~FGtVR 126 (196)
T PRK00756 85 VAELGLYGVRPDLEGLGIAHSI-RAMYPVLQELGVPFAFGTVR 126 (196)
T ss_pred EEEeeeeeeccccccccchhhH-HHHHHHHHhcCCCeecccch
Confidence 4577779999999999998866 55655666677654444333
No 157
>cd04641 CBS_pair_28 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=56.43 E-value=33 Score=19.05 Aligned_cols=30 Identities=23% Similarity=0.267 Sum_probs=17.9
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.++.........+.+.++ +++++|+++..
T Consensus 86 ~~~~~m~~~~~~~l~Vvd~--~~~~~Givt~~ 115 (120)
T cd04641 86 TIFDLIVKARVHRLVVVDE--NKRVEGIISLS 115 (120)
T ss_pred HHHHHHHhcCccEEEEECC--CCCEEEEEEHH
Confidence 3344444445555556655 58899998753
No 158
>TIGR00667 aat leucyl/phenylalanyl-tRNA--protein transferase. The N-terminal residue controls the biological half-life of many proteins via the N-end rule pathway. This enzyme transfers a Leu or Phe to the amino end of certain proteins to enable degradation.
Probab=55.85 E-value=51 Score=20.97 Aligned_cols=76 Identities=12% Similarity=0.040 Sum_probs=52.7
Q ss_pred CCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEeccc--ccCCchHHHHHHHHH
Q psy6918 7 TSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDT--YRGKELGKLLIAVLV 84 (123)
Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~--~rg~G~g~~Ll~~~~ 84 (123)
+|=.+++.+.+..+...+....+-+-+ ++++||-+-.. .|+.++..++ .+...-++..+-.+.
T Consensus 79 TWI~~e~~~aY~~LH~~G~AHSvEvw~--~~~LvGGlYGv-------------~iG~~F~GESMFs~~~nASKvAl~~L~ 143 (185)
T TIGR00667 79 TWISDELVEAYHRLHELGHAHSFEVWQ--GDELVGGMYGI-------------AQGGLFCGESMFSRMTNASKTALLVFC 143 (185)
T ss_pred CCCCHHHHHHHHHHHHhCceEEEEEEE--CCEEEEeeeee-------------eeCCeEEeccccccCCChhHHHHHHHH
Confidence 455567777787777777777665555 68998876543 2333344333 355677888999999
Q ss_pred HHHHhcCCeEEEE
Q psy6918 85 KLAKHFQCYKLTL 97 (123)
Q Consensus 85 ~~~~~~~~~~i~~ 97 (123)
+.+++.|+..+.+
T Consensus 144 ~~L~~~g~~liDc 156 (185)
T TIGR00667 144 EHFIRHGGQLIDC 156 (185)
T ss_pred HHHHHCCCcEEEE
Confidence 9999999876654
No 159
>PF13466 STAS_2: STAS domain
Probab=55.51 E-value=29 Score=18.02 Aligned_cols=39 Identities=13% Similarity=0.065 Sum_probs=27.9
Q ss_pred chHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCc
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGF 112 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf 112 (123)
.-|-+++..+.+.+++.|........++...++++..|+
T Consensus 41 sagl~lL~~~~~~~~~~g~~~~l~~~~~~~~~ll~~~gl 79 (80)
T PF13466_consen 41 SAGLQLLLAAARRARARGRQLRLTGPSPALRRLLELLGL 79 (80)
T ss_pred HHHHHHHHHHHHHHHHCCCeEEEEcCCHHHHHHHHHhCc
Confidence 345677777777888888765555567777788887775
No 160
>smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease.
Probab=55.13 E-value=18 Score=15.66 Aligned_cols=20 Identities=10% Similarity=0.205 Sum_probs=12.4
Q ss_pred CCeEEEEEEcCCCCeEEEEEEE
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
+...+.+..+ +++++|++..
T Consensus 22 ~~~~~~v~~~--~~~~~g~i~~ 41 (49)
T smart00116 22 GIRRLPVVDE--EGRLVGIVTR 41 (49)
T ss_pred CCCcccEECC--CCeEEEEEEH
Confidence 3344445554 5889998874
No 161
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=54.38 E-value=59 Score=22.71 Aligned_cols=37 Identities=22% Similarity=0.105 Sum_probs=27.0
Q ss_pred eEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccC
Q psy6918 26 YLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRG 72 (123)
Q Consensus 26 ~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg 72 (123)
.-+.++.- .+|.+|+++.+. +..-.|.++.|.|..|.
T Consensus 282 CGv~vidl-~tG~vv~~l~fe---------g~v~EifdV~vLPg~r~ 318 (335)
T TIGR03032 282 CGVAVIDL-NSGDVVHWLRFE---------GVIEEIYDVAVLPGVRR 318 (335)
T ss_pred ccEEEEEC-CCCCEEEEEEeC---------CceeEEEEEEEecCCCC
Confidence 33444443 389999999986 23568999999999875
No 162
>cd04583 CBS_pair_ABC_OpuCA_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyz
Probab=53.93 E-value=31 Score=18.55 Aligned_cols=28 Identities=11% Similarity=0.151 Sum_probs=16.0
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+......+...+.+..+ +|+++|+++..
T Consensus 77 ~~~~~~~~~~~~~vv~~--~g~~~Gvit~~ 104 (109)
T cd04583 77 LGLVLKRGPKYVPVVDE--DGKLVGLITRS 104 (109)
T ss_pred HHHHHHcCCceeeEECC--CCeEEEEEehH
Confidence 33333334444555554 68999988753
No 163
>PF06574 FAD_syn: FAD synthetase; InterPro: IPR015864 Riboflavin is converted into catalytically active cofactors (FAD and FMN) by the actions of riboflavin kinase (2.7.1.26 from EC), which converts it into FMN, and FAD synthetase (2.7.7.2 from EC), which adenylates FMN to FAD. Eukaryotes usually have two separate enzymes, while most prokaryotes have a single bifunctional protein that can carry out both catalyses, although exceptions occur in both cases. While eukaryotic monofunctional riboflavin kinase is orthologous to the bifunctional prokaryotic enzyme [], the monofunctional FAD synthetase differs from its prokaryotic counterpart, and is instead related to the PAPS-reductase family []. The bacterial FAD synthetase that is part of the bifunctional enzyme has remote similarity to nucleotidyl transferases and, hence, it may be involved in the adenylylation reaction of FAD synthetases []. This entry represents prokaryotic-type FAD synthetase, which occurs primarily as part of a bifunctional enzyme.; GO: 0003919 FMN adenylyltransferase activity, 0009231 riboflavin biosynthetic process; PDB: 2X0K_B 3OP1_B 1T6Z_A 2I1L_A 1T6Y_B 1T6X_B 1S4M_A 1MRZ_A.
Probab=53.64 E-value=50 Score=20.19 Aligned_cols=35 Identities=29% Similarity=0.341 Sum_probs=24.2
Q ss_pred EecccccCCchH-HHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 65 VVDDTYRGKELG-KLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 65 ~v~p~~rg~G~g-~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
.+.-.|=|..+| .+|++.+.+.+++.++..+.++-
T Consensus 9 v~iG~FDGvH~GHq~Li~~~~~~a~~~~~~~~v~tF 44 (157)
T PF06574_consen 9 VAIGNFDGVHLGHQKLIKKAVEIAKEKGLKSVVLTF 44 (157)
T ss_dssp EEES--TT--HHHHHHHHHHHHHHHHCT-EEEEEEE
T ss_pred EEEeCCCCccHHHHHHHHHHhhhhhhcccceEEEEc
Confidence 344556788888 78999999999999988877654
No 164
>cd07041 STAS_RsbR_RsbS_like Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation. The STAS (Sulphate Transporter and Anti-Sigma factor antagonist) domain of proteins related to RsbS and RsbR which are part of the "stressosome" complex that plays an important role in the regulation of the bacterial stress activated alternative sigma factor sigma-B. During stress conditions RsbS and RsbR are phosphorylated which leads to the release of RsbT, an activator of of the RsbU phosphatase, which in turn activates RsbV which leads to the release and activation of sigma factor B. RsbS is a single domain protein (STAS domain), while RsbR-like proteins have a well-conserved C-terminal STATS domain and a variable N-terminal domain. The STAS domain is also found in the C- terminal region of sulphate transporters and anti-anti-sigma factors.
Probab=53.33 E-value=34 Score=18.98 Aligned_cols=39 Identities=5% Similarity=0.002 Sum_probs=29.9
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEE
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQK 114 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~ 114 (123)
|-.++..+.+.++..|........++...+.++..|+..
T Consensus 58 g~~~l~~~~~~~~~~g~~l~l~g~~~~v~~~l~~~gl~~ 96 (109)
T cd07041 58 VARHLLRLARALRLLGARTILTGIRPEVAQTLVELGIDL 96 (109)
T ss_pred HHHHHHHHHHHHHHcCCeEEEEeCCHHHHHHHHHhCCCh
Confidence 455677777777888887677777888888999888765
No 165
>PF04816 DUF633: Family of unknown function (DUF633) ; InterPro: IPR006901 This is a family of uncharacterised bacterial proteins.; GO: 0016429 tRNA (adenine-N1-)-methyltransferase activity; PDB: 3LEC_A 3KU1_G 3KR9_A 3GNL_B.
Probab=52.91 E-value=33 Score=22.03 Aligned_cols=45 Identities=18% Similarity=0.203 Sum_probs=30.0
Q ss_pred chHHHHHHHHHHHHHhc--CCeEEEEccCCCc---hhHHHhcCcEEccce
Q psy6918 74 ELGKLLIAVLVKLAKHF--QCYKLTLDCADHM---IPFYETFGFQKKNNF 118 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~--~~~~i~~~~~~~~---~~~y~~~Gf~~~~~~ 118 (123)
|+|..++..+++..... ....+.+..+... .++...+||....+.
T Consensus 74 GMGG~lI~~ILe~~~~~~~~~~~lILqP~~~~~~LR~~L~~~gf~I~~E~ 123 (205)
T PF04816_consen 74 GMGGELIIEILEAGPEKLSSAKRLILQPNTHAYELRRWLYENGFEIIDED 123 (205)
T ss_dssp EE-HHHHHHHHHHTGGGGTT--EEEEEESS-HHHHHHHHHHTTEEEEEEE
T ss_pred cCCHHHHHHHHHhhHHHhccCCeEEEeCCCChHHHHHHHHHCCCEEEEeE
Confidence 89999999999887553 4566777544333 558889999987443
No 166
>cd04640 CBS_pair_27 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=52.69 E-value=39 Score=19.01 Aligned_cols=32 Identities=22% Similarity=0.205 Sum_probs=19.4
Q ss_pred HHHHHHhhhcCCCeEEEEEEcCCC-CeEEEEEEEE
Q psy6918 13 FAERFHRMKASQDYLVTVIEDTRT-KQVIGTGSLI 46 (123)
Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~-~~ivG~~~~~ 46 (123)
+.+.+..........+.+..+ + ++++|+++..
T Consensus 89 l~~~l~~m~~~~~~~lpVvd~--~~~~~~G~it~~ 121 (126)
T cd04640 89 VGDVVETLKASGRQHALVVDR--EHHQIRGIISTS 121 (126)
T ss_pred HHHHHHHHHHCCCceEEEEEC--CCCEEEEEEeHH
Confidence 344455444455555666665 4 6899998753
No 167
>COG1658 Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair]
Probab=51.36 E-value=13 Score=21.97 Aligned_cols=23 Identities=22% Similarity=0.293 Sum_probs=20.5
Q ss_pred EEEecccccCCchHHHHHHHHHH
Q psy6918 63 EVVVDDTYRGKELGKLLIAVLVK 85 (123)
Q Consensus 63 ~~~v~p~~rg~G~g~~Ll~~~~~ 85 (123)
-+.+||+++|.-+.++|.+++-.
T Consensus 59 IILTD~D~~Ge~Irk~l~~~l~~ 81 (127)
T COG1658 59 IILTDPDRKGERIRKKLKEYLPG 81 (127)
T ss_pred EEEeCCCcchHHHHHHHHHHhcc
Confidence 36789999999999999999886
No 168
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=50.75 E-value=27 Score=20.12 Aligned_cols=23 Identities=9% Similarity=0.114 Sum_probs=19.2
Q ss_pred CchHHHHHHHHHHHHHhcCCeEE
Q psy6918 73 KELGKLLIAVLVKLAKHFQCYKL 95 (123)
Q Consensus 73 ~G~g~~Ll~~~~~~~~~~~~~~i 95 (123)
.++...+++.+.+.++++|..+|
T Consensus 4 ~si~~~iv~~v~~~a~~~~~~~V 26 (114)
T PRK03681 4 ITLCQRALELIEQQAAKHGAKRV 26 (114)
T ss_pred HHHHHHHHHHHHHHHHHcCCCeE
Confidence 46889999999999999877655
No 169
>cd04610 CBS_pair_ParBc_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a ParBc (ParB-like nuclease) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=49.95 E-value=40 Score=18.05 Aligned_cols=30 Identities=10% Similarity=0.131 Sum_probs=16.9
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.+..........+.+..+ +|+++|++...
T Consensus 73 ~~~~~~~~~~~~~~~Vv~~--~g~~~Gvi~~~ 102 (107)
T cd04610 73 DAARVMFRTGISKLPVVDE--NNNLVGIITNT 102 (107)
T ss_pred HHHHHHHHhCCCeEeEECC--CCeEEEEEEHH
Confidence 3333333334444555555 68999998753
No 170
>PF06849 DUF1246: Protein of unknown function (DUF1246); InterPro: IPR010672 The last two steps of de novo purine biosynthesis are: i) conversion of 5-aminoimidazole-4-carboxamide-1-beta-D-ribofuranosyl 5'-monophosphate (AICAR) to 5-formaminoimidazole-4-carboxamide-1-beta-D-ribofuranosyl 5'-monophosphate (FAICAR) ii) conversion of FAICAR to inosine5'-monophopsphate (IMP) In bacteria and eukaryotes, these steps are catalysed by the well-characterised bifunctional enzyme PurH []. Archaea do not appear to posses PurH, however, and perform these reactions by a different mecahnism []. In archaea, step i) is catalysed by the well-conserved PurP protein, while step ii) is catalysed by the PurO enzyme in some (though not all) species [, ]. This entry represents the N-terminal domain of PurP. Its function is not known, though it is almost always found in association with IPR009720 from INTERPRO.; GO: 0000287 magnesium ion binding, 0005524 ATP binding, 0016879 ligase activity, forming carbon-nitrogen bonds, 0006188 IMP biosynthetic process; PDB: 2PBZ_C 2R85_B 2R87_E 2R84_A 2R86_A 2R7L_A 2R7N_A 2R7K_A 2R7M_A.
Probab=49.73 E-value=21 Score=21.07 Aligned_cols=31 Identities=23% Similarity=0.398 Sum_probs=21.1
Q ss_pred HHHHHHhcCCeEEEEccCCCchhHHHhcCcEE
Q psy6918 83 LVKLAKHFQCYKLTLDCADHMIPFYETFGFQK 114 (123)
Q Consensus 83 ~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~ 114 (123)
+..=|++.|++.+.+ |.+...++|+++++.-
T Consensus 12 Il~GAk~EGFrT~~i-c~~~r~~~Y~~f~~iD 42 (124)
T PF06849_consen 12 ILDGAKDEGFRTIAI-CQKGREKFYRRFPFID 42 (124)
T ss_dssp HHHHHHHTT--EEEE-EETTCHHHHHTTTT-S
T ss_pred HhhhHHHcCCcEEEE-ECCCCcchhhhcCcCc
Confidence 444578889988887 4577789999998543
No 171
>COG4353 Uncharacterized conserved protein [Function unknown]
Probab=49.46 E-value=33 Score=21.31 Aligned_cols=51 Identities=20% Similarity=0.192 Sum_probs=38.5
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcC
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQ 91 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~ 91 (123)
+|+.+|-+.+....+ -...++.-+-|.|-++|.-+-.+|.+.+-+...--+
T Consensus 37 ~ge~l~~l~vF~GR~-----yytPW~Eifnv~Pv~~gs~~E~~l~~~l~~~lspg~ 87 (192)
T COG4353 37 NGEQLGKLKVFKGRD-----YYTPWLEIFNVNPVFRGSELEVKLYKVLYNFLSPGG 87 (192)
T ss_pred CCceeeEEEEEcCCc-----cccchhhccccCCccCCCHHHHHHHHHHHHhcCCCC
Confidence 588888888765443 234567677799999999999999999887765444
No 172
>PRK02983 lysS lysyl-tRNA synthetase; Provisional
Probab=49.44 E-value=1.1e+02 Score=25.18 Aligned_cols=42 Identities=14% Similarity=0.148 Sum_probs=36.7
Q ss_pred chHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
.--..++....++|+++|..-+...+++.....|.++||...
T Consensus 295 e~~~~~i~~F~~~a~~~g~~p~fy~vse~~~~~~~~~G~~~l 336 (1094)
T PRK02983 295 EAWPQAIDAWLALARTYGWAPAVMGASEAGARAYREAGLSAL 336 (1094)
T ss_pred HHHHHHHHHHHHHHHHcCCEEEEEEECHHHHHHHHHcCCcEE
Confidence 344579999999999999988888999999999999999875
No 173
>PF02268 TFIIA_gamma_N: Transcription initiation factor IIA, gamma subunit, helical domain; InterPro: IPR015872 Transcription factor IIA (TFIIA) is one of several factors that form part of a transcription pre-initiation complex along with RNA polymerase II, the TATA-box-binding protein (TBP) and TBP-associated factors, on the TATA-box sequence upstream of the initiation start site. After initiation, some components of the pre-initiation complex (including TFIIA) remain attached and re-initiate a subsequent round of transcription. TFIIA binds to TBP to stabilise TBP binding to the TATA element. TFIIA also inhibits the cytokine HMGB1 (high mobility group 1 protein) binding to TBP [], and can dissociate HMGB1 already bound to TBP/TATA-box. Human and Drosophila TFIIA have three subunits: two large subunits, LN/alpha and LC/beta, derived from the same gene, and a small subunit, S/gamma. Yeast TFIIA has two subunits: a large TOA1 subunit that shows sequence similarity to the N-terminal of LN/alpha and the C-terminal of LC/beta, and a small subunit, TOA2 that is highly homologous with S/gamma. The conserved regions of the large and small subunits of TFIIA combine to form two domains: a four-helix bundle (helical domain) composed of two helices from each of the N-terminal regions of TOA1 and TOA2 in yeast; and a beta-barrel (beta-barrel domain) composed of beta-sheets from the C-terminal regions of TOA1 and TOA2 []. This entry represents the alpha-helical domain found at the N-terminal of the gamma subunit of transcription factor TFIIA.; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005672 transcription factor TFIIA complex; PDB: 1NVP_D 1RM1_B 1YTF_D 1NH2_D.
Probab=49.38 E-value=32 Score=16.73 Aligned_cols=23 Identities=30% Similarity=0.356 Sum_probs=19.7
Q ss_pred ccCCchHHHHHHHHHHHHHhcCC
Q psy6918 70 YRGKELGKLLIAVLVKLAKHFQC 92 (123)
Q Consensus 70 ~rg~G~g~~Ll~~~~~~~~~~~~ 92 (123)
||+--+|..|.+.+.+...+..+
T Consensus 6 YR~stlG~aL~dtLDeli~~~~I 28 (49)
T PF02268_consen 6 YRRSTLGIALTDTLDELIQEGKI 28 (49)
T ss_dssp GGCSHHHHHHHHHHHHHHHTTSS
T ss_pred HHcchHHHHHHHHHHHHHHcCCC
Confidence 79999999999999998877654
No 174
>PF02794 HlyC: RTX toxin acyltransferase family; InterPro: IPR003996 Secretion of virulence factors in Gram-negative bacteria involves transportation of the protein across two membranes to reach the cell exterior []. Four principal exotoxin secretion systems have been described. In the type II and IV secretion systems, toxins are first exported to the periplasm by way of a cleaved N-terminal signal sequence; a second set of proteins is used for extracellular transport (type II), or the C terminus of the exotoxin itself is used (type IV). Type III secretion involves at least 20 molecules that assemble into a needle; effector proteins are then translocated through this without need of a signal sequence. In the Type I system, a complete channel is formed through both membranes, and the secretion signal is carried on the C terminus of the exotoxin. The RTX (repeats in toxin) family of cytolytic toxins belong to the Type I secretion system, and are important virulence factors in Gram-negative bacteria. As well as the C-terminal signal sequence, several glycine-rich repeats are also found. These are essential for binding calcium, and are critical for the biological activity of the secreted toxins []. All RTX toxin operons exist in the order rtxCABD, RtxA protein being the structural component of the exotoxin, both RtxB and D being required for its export from the bacterial cell; RtxC is an acyl-carrier-protein-dependent acyl- modification enzyme, required to convert RtxA to its active form []. Escherichia coli haemolysin (HlyA) is often quoted as the model for RTX toxins. Recent work on its relative rtxC gene product HlyC [] has revealed that it provides the acylation aspect for post-translational modification of two internal lysine residues in the HlyA protein. Other residues, including His23 and two conserved tyrosine residues, also appear to be important []. ; GO: 0016746 transferase activity, transferring acyl groups, 0009404 toxin metabolic process, 0005737 cytoplasm
Probab=48.88 E-value=57 Score=19.47 Aligned_cols=73 Identities=5% Similarity=-0.073 Sum_probs=40.8
Q ss_pred CCCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEeee---------------cccccceeEEEEEEEecccc
Q psy6918 6 PTSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILEQK---------------FIHECALKGKIEEVVVDDTY 70 (123)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~---------------~~~~~~~~~~i~~~~v~p~~ 70 (123)
...+...+...+-.....+.+.++. + +|.++|+++-..-.+ .-+..+...+|-++ |.|-
T Consensus 17 r~~~l~~l~~~~lpai~~~Q~~l~~--~--~g~Pvaf~~WA~ls~e~e~~~l~~~~~l~~~dW~sG~rlWiiD~-iAPf- 90 (133)
T PF02794_consen 17 RDWPLSDLEQLLLPAIKLGQYRLYS--E--DGRPVAFCSWAFLSEEAEARYLEDPRSLSPEDWNSGDRLWIIDW-IAPF- 90 (133)
T ss_pred ccCcHHHHHHHHHHHHhhCcEEEEE--e--CCeEEEEEEhhcCCHHHHHHHHcCCCCCCchhcCCCCeEEEEEE-ECCC-
Confidence 3455566666665544555554433 3 699999998542211 01223444555553 4552
Q ss_pred cCCchHHHHHHHHHHHH
Q psy6918 71 RGKELGKLLIAVLVKLA 87 (123)
Q Consensus 71 rg~G~g~~Ll~~~~~~~ 87 (123)
|-+..+++.+.+..
T Consensus 91 ---G~~~~~~~~lr~~~ 104 (133)
T PF02794_consen 91 ---GHARAMVRDLRRNL 104 (133)
T ss_pred ---CcHHHHHHHHHhcc
Confidence 55777777777654
No 175
>cd04801 CBS_pair_M50_like This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the metalloprotease peptidase M50. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=48.65 E-value=45 Score=18.23 Aligned_cols=30 Identities=27% Similarity=0.246 Sum_probs=17.8
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.++.....+...+.+..+ +++++|.++..
T Consensus 80 ~a~~~~~~~~~~~l~Vv~~--~~~~~Gvl~~~ 109 (114)
T cd04801 80 EVLKLLEEQGLDELAVVED--SGQVIGLITEA 109 (114)
T ss_pred HHHHHHHHCCCCeeEEEcC--CCcEEEEEecc
Confidence 3344344444555555554 58999998765
No 176
>PF04555 XhoI: Restriction endonuclease XhoI; InterPro: IPR007636 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry represents type II restriction enzymes such as XhoI (3.1.21.4 from EC), which recognises the double-stranded sequence CTCGAG and cleave after C-1 [].; GO: 0003677 DNA binding, 0009036 Type II site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system
Probab=47.99 E-value=66 Score=20.57 Aligned_cols=36 Identities=17% Similarity=0.107 Sum_probs=27.4
Q ss_pred EEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc
Q psy6918 63 EVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD 98 (123)
Q Consensus 63 ~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~ 98 (123)
.+-|+|+|+|..|.++---.+.....+.-+....+.
T Consensus 144 hFpv~p~F~g~SY~~Ry~ilc~rLv~e~lY~aa~l~ 179 (196)
T PF04555_consen 144 HFPVDPEFKGASYLKRYEILCERLVQERLYTAACLI 179 (196)
T ss_pred CCCccHHhcCCcHHHHHHHHHHHHHHhcccceeEEE
Confidence 478999999999998877777777777666544443
No 177
>COG1064 AdhP Zn-dependent alcohol dehydrogenases [General function prediction only]
Probab=47.97 E-value=37 Score=23.81 Aligned_cols=41 Identities=15% Similarity=0.181 Sum_probs=31.8
Q ss_pred chHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
|+| -|-..++++++..|.+.+.++.+++..++-+++|-...
T Consensus 174 G~G-GlGh~avQ~Aka~ga~Via~~~~~~K~e~a~~lGAd~~ 214 (339)
T COG1064 174 GAG-GLGHMAVQYAKAMGAEVIAITRSEEKLELAKKLGADHV 214 (339)
T ss_pred CCc-HHHHHHHHHHHHcCCeEEEEeCChHHHHHHHHhCCcEE
Confidence 555 66677788889999877777778888888888886553
No 178
>PF09409 PUB: PUB domain; InterPro: IPR018997 The PUB (also known as PUG) domain is found in peptide N-glycanase where it functions as a AAA ATPase binding domain []. This domain is also found on other proteins linked to the ubiquitin-proteasome system. ; PDB: 2CM0_A 2CCQ_A 2D5U_A 2HPL_A 2HPJ_A.
Probab=47.64 E-value=29 Score=18.72 Aligned_cols=42 Identities=12% Similarity=0.305 Sum_probs=24.9
Q ss_pred HHHHHHHHHHhcCCeEEEEcc---------CCCchhHHHhcCcEEc--cceeE
Q psy6918 79 LIAVLVKLAKHFQCYKLTLDC---------ADHMIPFYETFGFQKK--NNFMQ 120 (123)
Q Consensus 79 Ll~~~~~~~~~~~~~~i~~~~---------~~~~~~~y~~~Gf~~~--~~~~~ 120 (123)
++..++....+..+++|.++. .+....++..+||+.. +.++.
T Consensus 16 il~NI~~~P~~~kyR~Ir~~N~~f~~~i~~~~g~~~~L~~~GF~~~~~~~~~v 68 (87)
T PF09409_consen 16 ILSNILSNPNEEKYRRIRLSNKTFQEKILPVPGARELLEALGFREVTDEEFLV 68 (87)
T ss_dssp HHHHHHHSTT-CGGGEEETTSHHHHHHTTTSTTHHHHHHHHT-EE---SSEEE
T ss_pred HHHHHccCCCcccceEeecCcchHHHHhcCChhHHHHHHHCCCEEecCCCEEE
Confidence 344444444455777887642 3567889999999998 44543
No 179
>PF04339 DUF482: Protein of unknown function, DUF482; InterPro: IPR007434 This family contains several proteins of uncharacterised function.
Probab=47.60 E-value=1e+02 Score=21.98 Aligned_cols=83 Identities=8% Similarity=-0.017 Sum_probs=49.2
Q ss_pred CCCeEEEEEEcCCCCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCC
Q psy6918 23 SQDYLVTVIEDTRTKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADH 102 (123)
Q Consensus 23 ~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~ 102 (123)
.+...++++.. +|++||++.....++ ..+-.......++.+... ..+.=..+++|-++|++.+...+.-
T Consensus 248 ~~~~~l~~A~~--~g~~Va~aL~l~~~~-------~LyGRYwG~~~~~~~LHF-e~cYYq~Ie~aI~~Gl~~f~~GaqG- 316 (370)
T PF04339_consen 248 PEQVVLVVARR--DGQPVAFALCLRGDD-------TLYGRYWGCDEEIPFLHF-ELCYYQGIEYAIEHGLRRFEPGAQG- 316 (370)
T ss_pred cCCEEEEEEEE--CCeEEEEEEEEEeCC-------EEEEeeecccccccCcch-HHHHHHHHHHHHHcCCCEEECCcch-
Confidence 35566677776 799999988875533 111111223344444442 3334568899999999987776432
Q ss_pred chhHHHhcCcEEccce
Q psy6918 103 MIPFYETFGFQKKNNF 118 (123)
Q Consensus 103 ~~~~y~~~Gf~~~~~~ 118 (123)
-++ -..||.++..+
T Consensus 317 EHK--~~RGf~P~~t~ 330 (370)
T PF04339_consen 317 EHK--IARGFEPVPTY 330 (370)
T ss_pred hHH--HHcCCccccce
Confidence 122 24577776443
No 180
>cd04596 CBS_pair_DRTGG_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=47.26 E-value=41 Score=18.19 Aligned_cols=30 Identities=27% Similarity=0.266 Sum_probs=16.9
Q ss_pred HHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 14 AERFHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
.+.++.+...+...+.+..+ +|+++|+++.
T Consensus 73 ~~~~~~~~~~~~~~~~Vv~~--~~~~~G~it~ 102 (108)
T cd04596 73 ASVAHMMIWEGIEMLPVVDD--NKKLLGIISR 102 (108)
T ss_pred HHHHHHHHHcCCCeeeEEcC--CCCEEEEEEH
Confidence 33343333334444555554 6899999875
No 181
>KOG4518|consensus
Probab=47.22 E-value=33 Score=22.25 Aligned_cols=25 Identities=16% Similarity=0.263 Sum_probs=21.6
Q ss_pred hHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 75 LGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 75 ~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
.-++-++.+++++++.||.+|.+.+
T Consensus 83 ~FR~~Ld~ai~yAkalgC~rIHlmA 107 (264)
T KOG4518|consen 83 EFRKSLDTAIEYAKALGCCRIHLMA 107 (264)
T ss_pred HHHHHHHHHHHHHHHhCCceEEEec
Confidence 4467789999999999999999875
No 182
>cd04627 CBS_pair_14 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=46.66 E-value=52 Score=18.35 Aligned_cols=30 Identities=23% Similarity=0.343 Sum_probs=17.7
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.+..........+.+..+ +++++|.++..
T Consensus 89 ~a~~~m~~~~~~~lpVvd~--~~~~vGiit~~ 118 (123)
T cd04627 89 DALHLMHNEGISSVAVVDN--QGNLIGNISVT 118 (123)
T ss_pred HHHHHHHHcCCceEEEECC--CCcEEEEEeHH
Confidence 3344444444444555554 58999998764
No 183
>COG0375 HybF Zn finger protein HypA/HybF (possibly regulating hydrogenase expression) [General function prediction only]
Probab=46.46 E-value=42 Score=19.58 Aligned_cols=22 Identities=14% Similarity=0.306 Sum_probs=18.9
Q ss_pred chHHHHHHHHHHHHHhcCCeEE
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKL 95 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i 95 (123)
.++..++..++++|++++..+|
T Consensus 5 Sla~aii~~i~~~A~~~~a~~V 26 (115)
T COG0375 5 SLAQAIIELIEEQAEKHGAKRV 26 (115)
T ss_pred HHHHHHHHHHHHHHHHcCCceE
Confidence 5788999999999999988554
No 184
>COG2898 Uncharacterized conserved protein [Function unknown]
Probab=46.29 E-value=44 Score=25.04 Aligned_cols=40 Identities=18% Similarity=0.230 Sum_probs=37.1
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
-.+|+....+.|+.+|+.-+...+.+...++|..+|+...
T Consensus 271 ~~eli~~F~e~A~~~G~r~~fy~vs~~~~p~y~d~Gl~~~ 310 (538)
T COG2898 271 WPELIWAFLELADRHGWRPVFYGVSEEGAPLYADAGLRAL 310 (538)
T ss_pred hHHHHHHHHHHHHhcCCeeEEEEeCccccHHHHhcCccee
Confidence 6899999999999999999999999999999999999875
No 185
>cd04611 CBS_pair_PAS_GGDEF_DUF1_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with a PAS domain, a GGDEF (DiGuanylate-Cyclase (DGC) domain, and a DUF1 domain downstream. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CB
Probab=46.25 E-value=48 Score=17.85 Aligned_cols=28 Identities=18% Similarity=0.191 Sum_probs=15.8
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
++.........+.+..+ +|+++|++...
T Consensus 79 l~~~~~~~~~~~~Vv~~--~~~~~Gvi~~~ 106 (111)
T cd04611 79 RQLMREHGIRHLVVVDD--DGELLGLLSQT 106 (111)
T ss_pred HHHHHHcCCeEEEEECC--CCcEEEEEEhH
Confidence 33333334444555554 58999988753
No 186
>cd04587 CBS_pair_CAP-ED_DUF294_PBI_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pai
Probab=46.20 E-value=40 Score=18.30 Aligned_cols=29 Identities=24% Similarity=0.338 Sum_probs=16.5
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
+.+......+...+.+..+ +++++|+++.
T Consensus 79 ~~~~~~~~~~~~~l~Vv~~--~~~~~Gvvs~ 107 (113)
T cd04587 79 EALHLMVQGKFRHLPVVDK--SGQVVGLLDV 107 (113)
T ss_pred HHHHHHHHcCCCcccEECC--CCCEEEEEEH
Confidence 3344333344434455554 5899999875
No 187
>COG3620 Predicted transcriptional regulator with C-terminal CBS domains [Transcription]
Probab=45.79 E-value=47 Score=20.81 Aligned_cols=37 Identities=19% Similarity=0.222 Sum_probs=20.7
Q ss_pred CCCCCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEE
Q psy6918 4 KLPTSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGS 44 (123)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~ 44 (123)
..|..+++.-..-+..++..... +++.+ +|+++|.++
T Consensus 139 ~fP~Vs~~~~l~vI~~LL~~~~A-VlV~e---~G~~vGIIT 175 (187)
T COG3620 139 PFPTVSPDESLNVISQLLEEHPA-VLVVE---NGKVVGIIT 175 (187)
T ss_pred CCCcCCCCCCHHHHHHHHhhCCe-EEEEe---CCceEEEEe
Confidence 33444555444445554444444 33444 599999876
No 188
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=45.44 E-value=36 Score=19.63 Aligned_cols=23 Identities=17% Similarity=0.306 Sum_probs=19.1
Q ss_pred CchHHHHHHHHHHHHHhcCCeEE
Q psy6918 73 KELGKLLIAVLVKLAKHFQCYKL 95 (123)
Q Consensus 73 ~G~g~~Ll~~~~~~~~~~~~~~i 95 (123)
.++...+++.+.+.+++++..+|
T Consensus 4 ~sia~~iv~~v~~~a~~~~~~~V 26 (115)
T TIGR00100 4 LSLAEAMLEIVEEQAEKHQAKKV 26 (115)
T ss_pred HHHHHHHHHHHHHHHHHhCCCeE
Confidence 46889999999999998887654
No 189
>cd04601 CBS_pair_IMPDH This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentosa.
Probab=45.25 E-value=50 Score=17.75 Aligned_cols=29 Identities=10% Similarity=0.157 Sum_probs=16.8
Q ss_pred HHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 16 RFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
.+......+...+.+..+ +++++|.++..
T Consensus 77 ~~~~~~~~~~~~~~Vv~~--~~~~~Gvi~~~ 105 (110)
T cd04601 77 ALELLHEHKIEKLPVVDD--EGKLKGLITVK 105 (110)
T ss_pred HHHHHHHhCCCeeeEEcC--CCCEEEEEEhh
Confidence 333333334444555554 68999998764
No 190
>cd04603 CBS_pair_KefB_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the KefB (Kef-type K+ transport systems) domain which is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=44.96 E-value=53 Score=17.97 Aligned_cols=30 Identities=13% Similarity=0.253 Sum_probs=17.4
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.+..........+.+..+ +++++|.++..
T Consensus 77 ~al~~m~~~~~~~lpVvd~--~~~~~Giit~~ 106 (111)
T cd04603 77 DLLRIFRETEPPVVAVVDK--EGKLVGTIYER 106 (111)
T ss_pred HHHHHHHHcCCCeEEEEcC--CCeEEEEEEhH
Confidence 3344434444444555555 58999988753
No 191
>PF11090 DUF2833: Protein of unknown function (DUF2833); InterPro: IPR020335 This entry contains proteins with no known function.
Probab=44.84 E-value=54 Score=18.05 Aligned_cols=43 Identities=21% Similarity=0.310 Sum_probs=25.3
Q ss_pred hHHHHHHHHHHHHHhc-CCeEEEEccCCCchhHHHhcCcEEccc
Q psy6918 75 LGKLLIAVLVKLAKHF-QCYKLTLDCADHMIPFYETFGFQKKNN 117 (123)
Q Consensus 75 ~g~~Ll~~~~~~~~~~-~~~~i~~~~~~~~~~~y~~~Gf~~~~~ 117 (123)
+.+.+...+....++. .+..-.+..|+..++|.+.+|++-..+
T Consensus 41 F~k~i~~~~d~~l~~Y~~l~N~V~~~N~~HIRfLk~lGA~f~~e 84 (86)
T PF11090_consen 41 FRKLIKEYLDKMLKQYPVLWNFVWVGNKSHIRFLKSLGAVFHNE 84 (86)
T ss_pred HHHHHHHHHHHHHHHhhheeEEEEeCCHHHHHHHHhcCcEEccc
Confidence 3344443333333332 223334556888999999999986544
No 192
>COG3543 Uncharacterized conserved protein [Function unknown]
Probab=44.79 E-value=35 Score=20.32 Aligned_cols=36 Identities=14% Similarity=0.242 Sum_probs=26.1
Q ss_pred EEEecccccCCchHHHHHHHHHHHHHhc-CCeEEEEc
Q psy6918 63 EVVVDDTYRGKELGKLLIAVLVKLAKHF-QCYKLTLD 98 (123)
Q Consensus 63 ~~~v~p~~rg~G~g~~Ll~~~~~~~~~~-~~~~i~~~ 98 (123)
++.-...|+|+||..+.++..-..+... .-..|.+.
T Consensus 9 HLlCmq~y~GkGYS~~FveN~d~I~~rL~~ge~i~lV 45 (135)
T COG3543 9 HLLCMQGYQGKGYSPAFVENYDAIAERLKAGEDIKLV 45 (135)
T ss_pred hhheeeecccccCCHHHHHHHHHHHHHhhcCCCeEEE
Confidence 3556678999999999999988888664 22335443
No 193
>cd04642 CBS_pair_29 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=44.66 E-value=49 Score=18.57 Aligned_cols=22 Identities=27% Similarity=0.296 Sum_probs=13.9
Q ss_pred cCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 22 ASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 22 ~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
+.....+.+..+ +++++|.++.
T Consensus 99 ~~~~~~l~Vvd~--~~~~~Giit~ 120 (126)
T cd04642 99 ANKVHRVWVVDE--EGKPIGVITL 120 (126)
T ss_pred HhCCcEEEEECC--CCCEEEEEEH
Confidence 334444555554 5899998874
No 194
>cd04585 CBS_pair_ACT_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The i
Probab=44.51 E-value=51 Score=18.03 Aligned_cols=21 Identities=14% Similarity=0.200 Sum_probs=13.2
Q ss_pred CCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 23 SQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 23 ~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
.+...+.+..+ +|+++|.++.
T Consensus 96 ~~~~~~~Vv~~--~~~~~Gvvt~ 116 (122)
T cd04585 96 RKISGLPVVDD--QGRLVGIITE 116 (122)
T ss_pred cCCCceeEECC--CCcEEEEEEH
Confidence 34444545544 5899998875
No 195
>cd04615 CBS_pair_2 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera
Probab=44.22 E-value=53 Score=17.81 Aligned_cols=27 Identities=19% Similarity=0.100 Sum_probs=15.2
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
++.+.......+.+..+ +|+++|.+..
T Consensus 81 ~~~~~~~~~~~~~Vvd~--~g~~~Gvvt~ 107 (113)
T cd04615 81 RWLMSNNNISRLPVLDD--KGKVGGIVTE 107 (113)
T ss_pred HHHHHHcCCCeeeEECC--CCeEEEEEEH
Confidence 33333334334455554 6799998875
No 196
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=43.79 E-value=41 Score=19.72 Aligned_cols=24 Identities=8% Similarity=0.166 Sum_probs=20.0
Q ss_pred CchHHHHHHHHHHHHHhcCCeEEE
Q psy6918 73 KELGKLLIAVLVKLAKHFQCYKLT 96 (123)
Q Consensus 73 ~G~g~~Ll~~~~~~~~~~~~~~i~ 96 (123)
.++...+++.+.+.+++++.++|.
T Consensus 4 ~si~~~il~~v~~~a~~~~~~rV~ 27 (124)
T PRK00762 4 LSMACEIVEAVIDTAEKNNATEVT 27 (124)
T ss_pred HHHHHHHHHHHHHHHHHcCCCeEE
Confidence 468899999999999998887664
No 197
>PF01751 Toprim: Toprim domain; InterPro: IPR006171 This is a conserved region from DNA primase. This corresponds to the Toprim (topoisomerase-primase) domain common to DnaG primases, topoisomerases, OLD family nucleases and RecR/M DNA repair proteins []. Both DnaG motifs IV and V are present in the alignment, the DxD (V) motif may be involved in Mg2+ binding and mutations to the conserved glutamate (IV) completely abolish DnaG type primase activity. DNA primase 2.7.7.6 from EC is a nucleotidyltransferase it synthesizes the oligoribonucleotide primers required for DNA replication on the lagging strand of the replication fork; it can also prime the leading stand and has been implicated in cell division []. This family also includes the atypical archaeal A subunit from type II DNA topoisomerases []. Type II DNA topoisomerases catalyse the relaxation of DNA supercoiling by causing transient double strand breaks.; PDB: 2ZJT_A 3IG0_A 3M4I_A 3NUH_B 1GKU_B 1GL9_C 3PWT_A 1CY4_A 1ECL_A 1CY7_A ....
Probab=43.78 E-value=17 Score=20.10 Aligned_cols=34 Identities=15% Similarity=0.160 Sum_probs=21.4
Q ss_pred EEEecccccCCchHHHHHHHHHHHHHhcCCeEEEE
Q psy6918 63 EVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTL 97 (123)
Q Consensus 63 ~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~ 97 (123)
-+++||+--|.-++..+++.+....... ++++++
T Consensus 64 iiatD~D~EGe~Ia~~i~~~~~~~~~~~-~~R~~~ 97 (100)
T PF01751_consen 64 IIATDPDREGELIAWEIIELLGKNNPKL-IKRVWF 97 (100)
T ss_dssp EEEC-SSHHHHHHHHHHHHHHHHHSHHH-TTEEEE
T ss_pred eecCCCChHHHHHHHHHHHHHhHhCCCc-CCEEEE
Confidence 4677788777777777777766655443 455554
No 198
>cd04631 CBS_pair_18 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=43.59 E-value=58 Score=18.06 Aligned_cols=27 Identities=15% Similarity=0.137 Sum_probs=15.8
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
++.+...+...+.+.++ +++++|.++.
T Consensus 93 ~~~~~~~~~~~~~V~~~--~~~~~Gvit~ 119 (125)
T cd04631 93 AELMLEKRVGGLPVVDD--DGKLVGIVTE 119 (125)
T ss_pred HHHHHHcCCceEEEEcC--CCcEEEEEEH
Confidence 33333334444555554 5899999875
No 199
>cd04602 CBS_pair_IMPDH_2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentos
Probab=42.92 E-value=58 Score=17.84 Aligned_cols=30 Identities=10% Similarity=0.179 Sum_probs=17.5
Q ss_pred HHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 14 AERFHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
.+.+..........+.+..+ +++++|+++.
T Consensus 79 ~~~l~~~~~~~~~~~pVv~~--~~~~~Gvit~ 108 (114)
T cd04602 79 EEANEILRESKKGKLPIVND--DGELVALVTR 108 (114)
T ss_pred HHHHHHHHhcCCCceeEECC--CCeEEEEEEH
Confidence 34444444444444555554 6899999875
No 200
>cd03173 DUF619-like DUF619 domain of various N-acetylglutamate Kinases and N-acetylglutamate Synthases. DUF619-like: This family includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present C-terminal of a NAG kinase-like domain in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAGS is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. This subgroup also includes the DUF619 domain of the FABP N-acetylglutamate kinase (NAGK), the enzyme that catalyzes the second reaction of arginine
Probab=42.73 E-value=63 Score=18.21 Aligned_cols=39 Identities=10% Similarity=0.035 Sum_probs=31.2
Q ss_pred eeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccC
Q psy6918 57 LKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCA 100 (123)
Q Consensus 57 ~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~ 100 (123)
...++..++|.+..++.|++..+.+.+.+. ...+.+.+.
T Consensus 32 ~v~~LdkFav~~~~~~~gv~D~vf~~i~~d-----~~~L~Wrsr 70 (98)
T cd03173 32 SIPYLDKFAVSDHLWLNNVTDNIFNLIRKD-----FPSLLWRVR 70 (98)
T ss_pred CCEEEEEEEEcccccccCHHHHHHHHHHhh-----CCeeEEEeC
Confidence 467999999999999999999999988754 345666553
No 201
>cd04604 CBS_pair_KpsF_GutQ_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with KpsF/GutQ domains in the API [A5P (D-arabinose 5-phosphate) isomerase] protein. These APIs catalyze the conversion of the pentose pathway intermediate D-ribulose 5-phosphate into A5P, a precursor of 3-deoxy-D-manno-octulosonate, which is an integral carbohydrate component of various glycolipids coating the surface of the outer membrane of Gram-negative bacteria, including lipopolysaccharide and many group 2 K-antigen capsules. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other funct
Probab=42.27 E-value=57 Score=17.63 Aligned_cols=27 Identities=15% Similarity=0.298 Sum_probs=16.0
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
++.+...+...+.+..+ +++++|++..
T Consensus 82 ~~~~~~~~~~~~~Vv~~--~~~~iG~it~ 108 (114)
T cd04604 82 LELMEENKITALPVVDD--NGRPVGVLHI 108 (114)
T ss_pred HHHHHHcCCCEEEEECC--CCCEEEEEEH
Confidence 33333344444555654 5899999875
No 202
>COG3640 CooC CO dehydrogenase maturation factor [Cell division and chromosome partitioning]
Probab=42.23 E-value=67 Score=21.52 Aligned_cols=42 Identities=24% Similarity=0.219 Sum_probs=30.8
Q ss_pred CchHHHHHHHH-HHHHHhc-CCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 73 KELGKLLIAVL-VKLAKHF-QCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 73 ~G~g~~Ll~~~-~~~~~~~-~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
-|.|+..+.++ .....+. ++..+.+++++ +..+-..+|-...
T Consensus 9 GG~GKTtiaalll~~l~~~~~~~VLvVDaDp-d~nL~~~LGve~~ 52 (255)
T COG3640 9 GGVGKTTIAALLLKRLLSKGGYNVLVVDADP-DSNLPEALGVEEP 52 (255)
T ss_pred CCccHHHHHHHHHHHHHhcCCceEEEEeCCC-CCChHHhcCCCCC
Confidence 47898888888 4444444 58878888877 8888888887663
No 203
>PF10116 Host_attach: Protein required for attachment to host cells; InterPro: IPR019291 Members of this family of bacterial proteins are required for the attachment of the bacterium to host cells [, ].
Probab=42.18 E-value=49 Score=19.56 Aligned_cols=37 Identities=11% Similarity=0.125 Sum_probs=29.7
Q ss_pred CCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHH
Q psy6918 72 GKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYE 108 (123)
Q Consensus 72 g~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~ 108 (123)
..-+++++.+.+.+...+..+..+++.+.+...+..+
T Consensus 71 ~~~Fa~~vA~~L~~~~~~~~~~~LvlvA~p~~LG~LR 107 (138)
T PF10116_consen 71 EERFAREVADRLEKARRAGKFDRLVLVAPPRFLGLLR 107 (138)
T ss_pred HHHHHHHHHHHHHHHHHhCCCCeEEEEECHHHHHHHH
Confidence 4457788899999888889999999988877766543
No 204
>PF12017 Tnp_P_element: Transposase protein; InterPro: IPR021896 Protein in this family are transposases found in insects. This region is about 230 amino acids in length and is found associated with PF05485 from PFAM.
Probab=42.07 E-value=74 Score=21.07 Aligned_cols=41 Identities=5% Similarity=0.222 Sum_probs=32.9
Q ss_pred chHHHHHHHHHHHHHhcCCeEEEEcc--CCCchhHHHhcCcEE
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKLTLDC--ADHMIPFYETFGFQK 114 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i~~~~--~~~~~~~y~~~Gf~~ 114 (123)
.+...++..+++.+.+.|+..+.+.+ .+.|..+++.+|-.+
T Consensus 192 ~m~~~~l~~iI~~l~~~g~~VvAivsD~g~~N~~~w~~Lgi~~ 234 (236)
T PF12017_consen 192 SMDADILKNIIEKLHEIGYNVVAIVSDMGSNNISLWRELGISE 234 (236)
T ss_pred cCCHHHHHHHHHHHHHCCCEEEEEECCCCcchHHHHHHcCCCC
Confidence 45578888899999999998887766 467899999998654
No 205
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=41.96 E-value=45 Score=19.19 Aligned_cols=23 Identities=9% Similarity=0.218 Sum_probs=18.8
Q ss_pred CchHHHHHHHHHHHHHhcCCeEE
Q psy6918 73 KELGKLLIAVLVKLAKHFQCYKL 95 (123)
Q Consensus 73 ~G~g~~Ll~~~~~~~~~~~~~~i 95 (123)
.++...+++.+.+.+++++..+|
T Consensus 4 lsi~~~iv~~v~~~a~~~~~~rV 26 (113)
T PRK12380 4 LSLCQSAVEIIQRQAEQHDVKRV 26 (113)
T ss_pred HHHHHHHHHHHHHHHHHhCCCeE
Confidence 46788999999999998876555
No 206
>cd04597 CBS_pair_DRTGG_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=41.86 E-value=63 Score=18.00 Aligned_cols=28 Identities=18% Similarity=0.256 Sum_probs=15.9
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
++.+...+...+.+..+ +++++|.+...
T Consensus 81 ~~~~~~~~~~~lpVvd~--~~~l~Givt~~ 108 (113)
T cd04597 81 LNLMHEHNIRTLPVVDD--DGTPAGIITLL 108 (113)
T ss_pred HHHHHHcCCCEEEEECC--CCeEEEEEEHH
Confidence 33333334344555554 68999998753
No 207
>cd04803 CBS_pair_15 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=41.71 E-value=62 Score=17.84 Aligned_cols=21 Identities=14% Similarity=0.218 Sum_probs=13.0
Q ss_pred CCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 23 SQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 23 ~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
.+...+.+..+ +++++|+++.
T Consensus 96 ~~~~~~~Vv~~--~~~~~Gvit~ 116 (122)
T cd04803 96 NKIGCLPVVDD--KGTLVGIITR 116 (122)
T ss_pred cCCCeEEEEcC--CCCEEEEEEH
Confidence 34444555544 5799998875
No 208
>cd04605 CBS_pair_MET2_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the MET2 domain. Met2 is a key enzyme in the biosynthesis of methionine. It encodes a homoserine transacetylase involved in converting homoserine to O-acetyl homoserine. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=41.71 E-value=58 Score=17.54 Aligned_cols=35 Identities=9% Similarity=0.135 Sum_probs=19.7
Q ss_pred CCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 7 TSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.+..+..+.+. ..+...+++..+ +|+++|.+...
T Consensus 10 ~~~~~~~~~~~~---~~~~~~~~V~d~--~~~~~G~v~~~ 44 (110)
T cd04605 10 DASIKEAAKLMI---EENINHLPVVDE--DGRLVGIVTSW 44 (110)
T ss_pred CCCHHHHHHHHH---hCCCceEEEECC--CCcEEEEEeHH
Confidence 334444444443 233334545554 68999999854
No 209
>cd04614 CBS_pair_1 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera
Probab=41.26 E-value=53 Score=17.56 Aligned_cols=27 Identities=19% Similarity=0.238 Sum_probs=15.7
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
+....+.+...+.+..+ +++++|.++.
T Consensus 64 ~~~m~~~~~~~lpVv~~--~~~~~Giit~ 90 (96)
T cd04614 64 AQKMKRNRIEQIPIING--NDKLIGLLRD 90 (96)
T ss_pred HHHHHHhCCCeeeEECC--CCcEEEEEEH
Confidence 33333344444555554 5899998875
No 210
>cd04620 CBS_pair_7 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera
Probab=41.10 E-value=62 Score=17.65 Aligned_cols=30 Identities=20% Similarity=0.234 Sum_probs=17.0
Q ss_pred HHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 14 AERFHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
.+.+..........+.+..+ +|+++|.++.
T Consensus 80 ~~a~~~~~~~~~~~~pVvd~--~~~~~Gvit~ 109 (115)
T cd04620 80 FTALSLFRQHQIRHLPVLDD--QGQLIGLVTA 109 (115)
T ss_pred HHHHHHHHHhCCceEEEEcC--CCCEEEEEEh
Confidence 33344444444444555554 6899998874
No 211
>PF01155 HypA: Hydrogenase expression/synthesis hypA family; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=41.01 E-value=40 Score=19.34 Aligned_cols=23 Identities=13% Similarity=0.234 Sum_probs=18.4
Q ss_pred chHHHHHHHHHHHHHhcCCeEEE
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKLT 96 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i~ 96 (123)
++...+++.+.+.+++++.++|.
T Consensus 5 si~~~iv~~v~~~a~~~~~~kV~ 27 (113)
T PF01155_consen 5 SIAQSIVEIVEEEAEENGAKKVT 27 (113)
T ss_dssp HHHHHHHHHHHHHHHCTT-SEEE
T ss_pred HHHHHHHHHHHHHHHHcCCCEEE
Confidence 67889999999999988776553
No 212
>cd04635 CBS_pair_22 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=40.74 E-value=64 Score=17.77 Aligned_cols=28 Identities=14% Similarity=0.124 Sum_probs=16.1
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
++.....+...+.+..+ +|+++|+++..
T Consensus 90 ~~~~~~~~~~~~~Vvd~--~g~~~Gvit~~ 117 (122)
T cd04635 90 VELMLEHDIGRLPVVNE--KDQLVGIVDRH 117 (122)
T ss_pred HHHHHHcCCCeeeEEcC--CCcEEEEEEhH
Confidence 33333334444455554 58999998753
No 213
>PRK05031 tRNA (uracil-5-)-methyltransferase; Validated
Probab=40.34 E-value=51 Score=23.13 Aligned_cols=47 Identities=13% Similarity=0.200 Sum_probs=32.5
Q ss_pred EEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCc----hhHHHhcCcEEc
Q psy6918 61 IEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHM----IPFYETFGFQKK 115 (123)
Q Consensus 61 i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~----~~~y~~~Gf~~~ 115 (123)
..-+++||. | .|+..++++.+.+. . ..+.++|++.. ...+.+ ||+..
T Consensus 289 ~D~v~lDPP-R-~G~~~~~l~~l~~~---~--~ivyvSC~p~tlarDl~~L~~-gY~l~ 339 (362)
T PRK05031 289 FSTIFVDPP-R-AGLDDETLKLVQAY---E--RILYISCNPETLCENLETLSQ-THKVE 339 (362)
T ss_pred CCEEEECCC-C-CCCcHHHHHHHHcc---C--CEEEEEeCHHHHHHHHHHHcC-CcEEE
Confidence 356889999 3 68999999888751 2 34778888754 334444 88875
No 214
>cd04607 CBS_pair_NTP_transferase_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain associated with the NTP (Nucleotidyl transferase) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=40.31 E-value=63 Score=17.56 Aligned_cols=24 Identities=13% Similarity=0.317 Sum_probs=14.5
Q ss_pred hcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 21 KASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 21 ~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
.......+.+..+ +++++|+++..
T Consensus 85 ~~~~~~~~~Vv~~--~~~~~Gvit~~ 108 (113)
T cd04607 85 RERSIRHLPILDE--EGRVVGLATLD 108 (113)
T ss_pred HHCCCCEEEEECC--CCCEEEEEEhH
Confidence 3334444545554 68999998753
No 215
>PRK15094 magnesium/cobalt efflux protein CorC; Provisional
Probab=40.10 E-value=85 Score=21.37 Aligned_cols=33 Identities=12% Similarity=0.206 Sum_probs=22.0
Q ss_pred HHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 12 HFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
.+.+.++.+.......+.|..+ .|.++|.+++.
T Consensus 150 ~l~~~L~~m~~~~~~~a~VvDe--~G~viGiVTle 182 (292)
T PRK15094 150 RVDRMLKEFRSQRYHMAIVIDE--FGGVSGLVTIE 182 (292)
T ss_pred cHHHHHHHHHhcCCEEEEEEeC--CCCEEEEeEHH
Confidence 3444555555556676767665 68899999864
No 216
>cd04600 CBS_pair_HPP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the HPP motif domain. These proteins are integral membrane proteins with four transmembrane spanning helices. The function of these proteins is uncertain, but they are thought to be transporters. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=40.03 E-value=67 Score=17.76 Aligned_cols=28 Identities=11% Similarity=0.253 Sum_probs=16.4
Q ss_pred HHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 17 FHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
++.+.......+.+.++ +|+++|.++..
T Consensus 92 ~~~~~~~~~~~~~Vv~~--~g~~~Gvit~~ 119 (124)
T cd04600 92 VPLLADGGHHHVPVVDE--DRRLVGIVTQT 119 (124)
T ss_pred HHHHHhcCCCceeEEcC--CCCEEEEEEhH
Confidence 33333334444555554 68999998753
No 217
>cd04197 eIF-2B_epsilon_N The N-terminal domain of epsilon subunit of the eIF-2B is a subfamily of glycosyltransferase 2. N-terminal domain of epsilon subunit of the eukaryotic translation initiation factor 2B (eIF-2B): eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit epsilon shares sequence similarity with gamma subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex.
Probab=40.00 E-value=32 Score=21.85 Aligned_cols=28 Identities=14% Similarity=0.140 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHM 103 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~ 103 (123)
|+-|++.+++.+.+.|+..|.+.++...
T Consensus 30 g~piI~~~l~~l~~~Gi~~I~iv~~~~~ 57 (217)
T cd04197 30 NVPLIDYTLEFLALNGVEEVFVFCCSHS 57 (217)
T ss_pred CEehHHHHHHHHHHCCCCeEEEEeCCCH
Confidence 3568888888888888888877665433
No 218
>TIGR00055 uppS undecaprenyl diphosphate synthase. Alternate name: undecaprenyl pyrophosphate synthetase. Activity has been demonstrated experimentally for members of this family from Micrococcus luteus, E. coli, Haemophilus influenzae, and Streptococcus pneumoniae.
Probab=39.26 E-value=41 Score=22.07 Aligned_cols=32 Identities=16% Similarity=0.156 Sum_probs=27.8
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|..-+..++++|.+.|++.+.+-+
T Consensus 20 ~~~~GH~~G~~~~~~v~~~c~~~GI~~lT~ya 51 (226)
T TIGR00055 20 PRAYGHKAGVKSLRRILRWCANLGVECLTLYA 51 (226)
T ss_pred ChhHhHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 55678889999999999999999999888754
No 219
>PRK14837 undecaprenyl pyrophosphate synthase; Provisional
Probab=38.38 E-value=43 Score=22.06 Aligned_cols=32 Identities=19% Similarity=0.169 Sum_probs=27.6
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|..-+..++++|.+.|++.+.+-+
T Consensus 27 ~~~~GH~~G~~~~~~i~~~c~~~GI~~lT~Ya 58 (230)
T PRK14837 27 SFFEGHKEGLKRAKEIVKHSLKLGIKYLSLYV 58 (230)
T ss_pred chhhhHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 55578888999999999999999999888754
No 220
>cd04639 CBS_pair_26 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=38.24 E-value=68 Score=17.30 Aligned_cols=30 Identities=17% Similarity=0.259 Sum_probs=16.5
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.++.+...+...+.+..+ +++++|.++..
T Consensus 77 ~~~~~~~~~~~~~~~Vv~~--~~~~~G~it~~ 106 (111)
T cd04639 77 AVLRLMQQGGAPAVPVVDG--SGRLVGLVTLE 106 (111)
T ss_pred HHHHHHHhcCCceeeEEcC--CCCEEEEEEHH
Confidence 3333333334444445544 58999988753
No 221
>TIGR01177 conserved hypothetical protein TIGR01177. This family is found exclusively in the Archaea.
Probab=37.54 E-value=1.3e+02 Score=20.69 Aligned_cols=58 Identities=9% Similarity=0.021 Sum_probs=32.0
Q ss_pred EEEEEEecccccCC-ch--------HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEccce
Q psy6918 60 KIEEVVVDDTYRGK-EL--------GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKKNNF 118 (123)
Q Consensus 60 ~i~~~~v~p~~rg~-G~--------g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~~~~ 118 (123)
....+.++|.|-.. +. -..+++.+.+..+..|.-.+.+........+.++.|| ....+
T Consensus 248 ~~D~Iv~dPPyg~~~~~~~~~~~~l~~~~l~~~~r~Lk~gG~lv~~~~~~~~~~~~~~~~g~-i~~~~ 314 (329)
T TIGR01177 248 SVDAIATDPPYGRSTTAAGDGLESLYERSLEEFHEVLKSEGWIVYAVPTRIDLESLAEDAFR-VVKRF 314 (329)
T ss_pred CCCEEEECCCCcCcccccCCchHHHHHHHHHHHHHHccCCcEEEEEEcCCCCHHHHHhhcCc-chhee
Confidence 34567778877321 11 2344444444444445433444445566778999999 65443
No 222
>COG3146 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.11 E-value=69 Score=22.60 Aligned_cols=52 Identities=19% Similarity=0.247 Sum_probs=38.8
Q ss_pred EEecccccCCchHHHHHHHHHHHHHhcCCeEEEEc-cCCCchhHHHhcCcEEc
Q psy6918 64 VVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD-CADHMIPFYETFGFQKK 115 (123)
Q Consensus 64 ~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~-~~~~~~~~y~~~Gf~~~ 115 (123)
+.+.+..-..-+...|+..+.+.|.+.|+..+.+. ..++-...+++.||-..
T Consensus 121 lL~~~~~d~~~~~~~L~~~l~~~~~~~glSS~Hv~F~~~~~~~~l~~~Gfl~r 173 (387)
T COG3146 121 LLARDGEDEEEVRQALLAGLDELCEQSGLSSAHVTFVDEDEQPALEKAGFLHR 173 (387)
T ss_pred eecCccccHHHHHHHHHHHHHHHHHhcCCCceeEecCCHHHHHHHHhccchhh
Confidence 55566555556778899999999999999877665 35666778888888653
No 223
>cd04612 CBS_pair_SpoIVFB_EriC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase an
Probab=36.99 E-value=71 Score=17.14 Aligned_cols=23 Identities=17% Similarity=0.208 Sum_probs=14.7
Q ss_pred cCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 22 ASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 22 ~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
..+...+.+..+ +++++|+++..
T Consensus 84 ~~~~~~~~V~~~--~~~~~G~it~~ 106 (111)
T cd04612 84 ERDIGRLPVVDD--SGRLVGIVSRS 106 (111)
T ss_pred hCCCCeeeEEcC--CCCEEEEEEHH
Confidence 334444555554 58999998753
No 224
>cd04606 CBS_pair_Mg_transporter This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain in the magnesium transporter, MgtE. MgtE and its homologs are found in eubacteria, archaebacteria, and eukaryota. Members of this family transport Mg2+ or other divalent cations into the cell via two highly conserved aspartates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=36.63 E-value=73 Score=17.20 Aligned_cols=22 Identities=9% Similarity=0.211 Sum_probs=13.9
Q ss_pred CCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 23 SQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 23 ~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
.+...+.+..+ +++++|.++..
T Consensus 82 ~~~~~~~Vv~~--~~~~~Gvit~~ 103 (109)
T cd04606 82 YDLLALPVVDE--EGRLVGIITVD 103 (109)
T ss_pred cCCceeeeECC--CCcEEEEEEhH
Confidence 34344555554 68999998764
No 225
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=36.03 E-value=63 Score=18.73 Aligned_cols=24 Identities=29% Similarity=0.178 Sum_probs=19.1
Q ss_pred CchHHHHHHHHHHHHHhcCCeEEE
Q psy6918 73 KELGKLLIAVLVKLAKHFQCYKLT 96 (123)
Q Consensus 73 ~G~g~~Ll~~~~~~~~~~~~~~i~ 96 (123)
.++...+++.+.+.+++++.++|.
T Consensus 4 ~si~~~il~~v~~~a~~~~~~~V~ 27 (117)
T PRK00564 4 YSVVSSLIALCEEHAKKNQAHKIE 27 (117)
T ss_pred HHHHHHHHHHHHHHHHHcCCCeEE
Confidence 367889999999999888776553
No 226
>cd04618 CBS_pair_5 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera
Probab=36.03 E-value=75 Score=17.14 Aligned_cols=30 Identities=7% Similarity=-0.115 Sum_probs=17.7
Q ss_pred HHHHhhhcCCCeEEEEEEcCCC-CeEEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRT-KQVIGTGSLI 46 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~-~~ivG~~~~~ 46 (123)
+.++.........+.+.++ + |+++|.++..
T Consensus 63 ~a~~~m~~~~~~~lpVvd~--~~~~~~giit~~ 93 (98)
T cd04618 63 DAALLLLKNKIHRLPVIDP--STGTGLYILTSR 93 (98)
T ss_pred HHHHHHHHCCCCEeeEEEC--CCCCceEEeehh
Confidence 3333333444455555655 4 8999998764
No 227
>cd06422 NTP_transferase_like_1 NTP_transferase_like_1 is a member of the nucleotidyl transferase family. This is a subfamily of nucleotidyl transferases. Nucleotidyl transferases transfer nucleotides onto phosphosugars. The activated sugars are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Other subfamilies of nucleotidyl transferases include Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase.
Probab=35.97 E-value=42 Score=21.26 Aligned_cols=32 Identities=13% Similarity=0.042 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHH
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY 107 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y 107 (123)
|+.|++.+++.+.+.|+..+.+.++.....+.
T Consensus 29 g~~li~~~l~~l~~~gi~~i~iv~~~~~~~i~ 60 (221)
T cd06422 29 GKPLIDHALDRLAAAGIRRIVVNTHHLADQIE 60 (221)
T ss_pred CEEHHHHHHHHHHHCCCCEEEEEccCCHHHHH
Confidence 46888888888888888888887655444333
No 228
>cd04637 CBS_pair_24 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=35.87 E-value=79 Score=17.41 Aligned_cols=20 Identities=20% Similarity=0.285 Sum_probs=12.5
Q ss_pred CCeEEEEEEcCCCCeEEEEEEE
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
+...+.+..+ +++++|++..
T Consensus 97 ~~~~~~vv~~--~~~~~Gvit~ 116 (122)
T cd04637 97 SISCLPVVDE--NGQLIGIITW 116 (122)
T ss_pred CCCeEeEECC--CCCEEEEEEH
Confidence 3334444544 5889998875
No 229
>PF04015 DUF362: Domain of unknown function (DUF362) ; InterPro: IPR007160 This domain is found in some iron-sulphur proteins.
Probab=35.58 E-value=85 Score=19.83 Aligned_cols=44 Identities=9% Similarity=0.208 Sum_probs=32.3
Q ss_pred CCchHHHHHHHHHHHHHhcCCeEEEEccCC-----CchhHHHhcCcEEc
Q psy6918 72 GKELGKLLIAVLVKLAKHFQCYKLTLDCAD-----HMIPFYETFGFQKK 115 (123)
Q Consensus 72 g~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~-----~~~~~y~~~Gf~~~ 115 (123)
+.-.--++++++++.+++.|...+.+...+ .....++..||...
T Consensus 17 ~~~T~P~vv~avv~~l~~~g~~~i~i~e~~~~~~~~~~~~~~~~G~~~~ 65 (206)
T PF04015_consen 17 GATTHPEVVRAVVEMLKEAGAKEIIIAESPGSGAADTREVFKRSGYEEI 65 (206)
T ss_pred CccCCHHHHHHHHHHHHHcCCCceEEEeCCCcchHhHHHHHHHcchhhH
Confidence 334456799999999999999867665432 34568899998876
No 230
>cd04582 CBS_pair_ABC_OpuCA_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzi
Probab=35.45 E-value=74 Score=16.95 Aligned_cols=29 Identities=14% Similarity=0.188 Sum_probs=16.5
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
+.++.....+...+.+..+ +++++|.++.
T Consensus 72 ~~~~~~~~~~~~~~~Vv~~--~~~~~Gvi~~ 100 (106)
T cd04582 72 IVLSRMFAHDMSWLPCVDE--DGRYVGEVTQ 100 (106)
T ss_pred HHHHHHHHCCCCeeeEECC--CCcEEEEEEH
Confidence 3344434444444555554 5889998864
No 231
>PF03465 eRF1_3: eRF1 domain 3; InterPro: IPR005142 This domain is found in the release factor eRF1 which terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase centre. The crystal structure of human eRF1 is known []. The overall shape and dimensions of eRF1 resemble a tRNA molecule with domains 1, 2, and 3 of eRF1 corresponding to the anticodon loop, aminoacyl acceptor stem, and T stem of a tRNA molecule, respectively. The position of the essential GGQ motif at an exposed tip of domain 2 suggests that the Gln residue coordinates a water molecule to mediate the hydrolytic activity at the peptidyl transferase centre. A conserved groove on domain 1, 80 A from the GGQ motif, is proposed to form the codon recognition site []. This domain is also found in other proteins which may also be involved in translation termination but this awaits experimental verification.; PDB: 3OBY_A 3E1Y_D 1DT9_A 2KTU_A 2KTV_A 3IR9_A 3E20_H 3OBW_A 3AGJ_F 3MCA_B ....
Probab=35.24 E-value=69 Score=18.36 Aligned_cols=53 Identities=19% Similarity=0.261 Sum_probs=32.7
Q ss_pred EEEEEEEecccccCCchHHH--------------HHHHHHHHHHhcCCeEEEEccC-CCchhHHHhcC
Q psy6918 59 GKIEEVVVDDTYRGKELGKL--------------LIAVLVKLAKHFQCYKLTLDCA-DHMIPFYETFG 111 (123)
Q Consensus 59 ~~i~~~~v~p~~rg~G~g~~--------------Ll~~~~~~~~~~~~~~i~~~~~-~~~~~~y~~~G 111 (123)
+.+..+.|..+....+.-.+ ++..+.+.+.+.|.+...+... +....|++.+|
T Consensus 36 GaV~~LlI~d~l~~~~~~~r~~~~~~~~~~~~~~~i~~l~~~a~~~g~~v~iis~~~e~G~~L~~~~g 103 (113)
T PF03465_consen 36 GAVETLLISDDLFRSRDVERCKCPECGGELEVVELIEELIELAEQSGAKVEIISSEHEEGEQLLKGFG 103 (113)
T ss_dssp T-EEEEEEEHHHHTESCHHHHHSTTTHSEEEEEEHHHHHHHHHHHTTSEEEEE-TTSHHHHHHHHCTT
T ss_pred CCCcEEEEecccccccceeccccccccchhhhHHHHHHHHHHHHHcCCEEEEEcCCCccHHHHHhcCC
Confidence 45666666666555443211 6899999999999876666543 33455666665
No 232
>COG3250 LacZ Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism]
Probab=35.17 E-value=2.4e+02 Score=22.65 Aligned_cols=61 Identities=15% Similarity=0.107 Sum_probs=47.2
Q ss_pred eEEEEEEEeccc--ccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHH---HhcCcEEccce
Q psy6918 58 KGKIEEVVVDDT--YRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY---ETFGFQKKNNF 118 (123)
Q Consensus 58 ~~~i~~~~v~p~--~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y---~~~Gf~~~~~~ 118 (123)
...+.++--+|+ .+|++....++..-+..+++.|+..|-+.-.+....|| -++|+-+..+.
T Consensus 299 pvf~kGvnrHe~~~~~G~~~~~~~~~~dl~lmk~~n~N~vRtsHyP~~~~~ydLcDelGllV~~Ea 364 (808)
T COG3250 299 PVFIRGVNRHEDDPILGRVTDEDAMERDLKLMKEANMNSVRTSHYPNSEEFYDLCDELGLLVIDEA 364 (808)
T ss_pred EEEEeeeecccCCCccccccCHHHHHHHHHHHHHcCCCEEEecCCCCCHHHHHHHHHhCcEEEEec
Confidence 455666666666 47888889999999999999999999988667776666 58888776443
No 233
>cd04632 CBS_pair_19 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=35.15 E-value=85 Score=17.54 Aligned_cols=32 Identities=13% Similarity=0.095 Sum_probs=18.2
Q ss_pred CHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 9 NVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
+..+....+. ..+...+.+..+ +++++|+++.
T Consensus 11 ~~~~~~~~~~---~~~~~~~~Vv~~--~~~~~G~it~ 42 (128)
T cd04632 11 SVGKAINVLR---EHGISRLPVVDD--NGKLTGIVTR 42 (128)
T ss_pred CHHHHHHHHH---HcCCCEEEEECC--CCcEEEEEEH
Confidence 3444444443 334444555555 6899999884
No 234
>cd00475 CIS_IPPS Cis (Z)-Isoprenyl Diphosphate Synthases (cis-IPPS); homodimers which catalyze the successive 1'-4 condensation of the isopentenyl diphosphate (IPP) molecule to trans,trans-farnesyl diphosphate (FPP) or to cis,trans-FPP to form long-chain polyprenyl diphosphates. A few can also catalyze the condensation of IPP to trans-geranyl diphosphate to form the short-chain cis,trans- FPP. In prokaryotes, the cis-IPPS, undecaprenyl diphosphate synthase (UPP synthase) catalyzes the formation of the carrier lipid UPP in bacterial cell wall peptidooglycan biosynthesis. Similarly, in eukaryotes, the cis-IPPS, dehydrodolichyl diphosphate (dedol-PP) synthase catalyzes the formation of the polyisoprenoid glycosyl carrier lipid dolichyl monophosphate. cis-IPPS are mechanistically and structurally distinct from trans-IPPS, lacking the DDXXD motifs, yet requiring Mg2+ for activity.
Probab=35.07 E-value=51 Score=21.55 Aligned_cols=32 Identities=16% Similarity=0.247 Sum_probs=27.5
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|.+-+..++++|.+.|++.+.+-+
T Consensus 21 ~~~~GH~~G~~~~~~i~~~~~~~gI~~lTvya 52 (221)
T cd00475 21 DRIEGHKAGAEKLRDILRWCLELGVKEVTLYA 52 (221)
T ss_pred ChhHhHHHHHHHHHHHHHHHHHcCCCEEEEEe
Confidence 55568888999999999999999999888754
No 235
>KOG3285|consensus
Probab=34.65 E-value=86 Score=19.91 Aligned_cols=44 Identities=16% Similarity=0.127 Sum_probs=36.0
Q ss_pred CCchHHHHHHHHHHHHHhcCCeEEEEcc-CCCchhHHHhcCcEEc
Q psy6918 72 GKELGKLLIAVLVKLAKHFQCYKLTLDC-ADHMIPFYETFGFQKK 115 (123)
Q Consensus 72 g~G~g~~Ll~~~~~~~~~~~~~~i~~~~-~~~~~~~y~~~Gf~~~ 115 (123)
=+-+-+.++..+.+|..+..+.++.+.. +...-...+++.|..+
T Consensus 60 lk~f~~~ilsQ~~~WL~~~kiqklvlvI~~~~tgEvlErWqFnie 104 (203)
T KOG3285|consen 60 LKTFIRNILSQVHEWLSKGKIQKLVLVITSKHTGEVLERWQFNIE 104 (203)
T ss_pred HHHHHHHHHHHHHHHHHhCCcceEEEEEEecccccchhheeeeee
Confidence 3567788999999999999998888864 6666778899998875
No 236
>cd01027 TOPRIM_RNase_M5_like TOPRIM_ RNase M5_like: The topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain found in Ribonuclease M5: (RNase M5) and other small primase-like proteins from bacteria and archaea. RNase M5 catalyzes the maturation of 5S rRNA in low G+C Gram-positive bacteria. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). The conserved glutamate may act as a general base in nucleotide polymerization by primases. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function.
Probab=34.64 E-value=24 Score=18.92 Aligned_cols=25 Identities=20% Similarity=0.210 Sum_probs=20.2
Q ss_pred EEEEEecccccCCchHHHHHHHHHH
Q psy6918 61 IEEVVVDDTYRGKELGKLLIAVLVK 85 (123)
Q Consensus 61 i~~~~v~p~~rg~G~g~~Ll~~~~~ 85 (123)
-.-+.++|+..|+.+..++.+.+..
T Consensus 47 ~VIiltD~D~aG~~i~~~~~~~l~~ 71 (81)
T cd01027 47 GVIILTDPDRKGEKIRKKLSEYLSG 71 (81)
T ss_pred EEEEEECCCHHHHHHHHHHHHHhcc
Confidence 3347889999999999998888754
No 237
>cd04588 CBS_pair_CAP-ED_DUF294_assoc_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the archaeal CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site.
Probab=34.42 E-value=79 Score=16.97 Aligned_cols=22 Identities=14% Similarity=0.072 Sum_probs=13.4
Q ss_pred CCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 23 SQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 23 ~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
.+...+.+..+ +++++|.+...
T Consensus 84 ~~~~~~~V~~~--~~~~~G~i~~~ 105 (110)
T cd04588 84 HNVGRLIVTDD--EGRPVGIITRT 105 (110)
T ss_pred cCCCEEEEECC--CCCEEEEEEhH
Confidence 34344455544 58999988753
No 238
>cd04624 CBS_pair_11 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=34.27 E-value=81 Score=17.01 Aligned_cols=21 Identities=19% Similarity=0.121 Sum_probs=13.4
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+...+.+..+ +++++|.+...
T Consensus 23 ~~~~~~v~d~--~~~~~G~v~~~ 43 (112)
T cd04624 23 NVGSVVVVDP--DERPIGIVTER 43 (112)
T ss_pred CCCEEEEECC--CCCEEEEeeHH
Confidence 3334445544 68999998853
No 239
>PF03376 Adeno_E3B: Adenovirus E3B protein; InterPro: IPR005041 Adenoviruses are medium-sized, non-enveloped viruses containing double-stranded DNA. They can cause a variety of diseases including pneumonia, cystitis, conjunctivitis and diarrhoea, all of which can be fatal to patients who are immunocompromised []. These viruses have many mechanisms to evade the host immune response, including several proteins which are expressed as part of the early transcription unit 3 (E3) []. One of the regions of E3, known as the E3B region, encodes three proteins known as 10.4K, 14.5K and 14.7K. Two of these proteins, 10.4K and 14.5K, form the RID complex (receptor internalisation and degradation) which protects the infected cell from host-induced lysis by clearing the the TNF and Fas receptors from the cell surface []. Other receptors, such as the epidermal growth factor receptor, are also known to be cleared by RID []. This entry represents the E3B region 10.4K protein, also known as the RID alpha subunit.; GO: 0016020 membrane
Probab=34.15 E-value=20 Score=18.48 Aligned_cols=13 Identities=31% Similarity=0.685 Sum_probs=10.0
Q ss_pred cccccCCchHHHH
Q psy6918 67 DDTYRGKELGKLL 79 (123)
Q Consensus 67 ~p~~rg~G~g~~L 79 (123)
+|+||++.+++-|
T Consensus 53 hPqYrn~~iA~LL 65 (67)
T PF03376_consen 53 HPQYRNQQIAALL 65 (67)
T ss_pred CchhcCHHHHHHh
Confidence 7888888887644
No 240
>KOG1198|consensus
Probab=33.81 E-value=81 Score=22.11 Aligned_cols=39 Identities=18% Similarity=0.316 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEE
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQK 114 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~ 114 (123)
+...-.++++.|+..++..+...+..++.++-+++|-..
T Consensus 167 sggVG~~aiQlAk~~~~~~v~t~~s~e~~~l~k~lGAd~ 205 (347)
T KOG1198|consen 167 SGGVGTAAIQLAKHAGAIKVVTACSKEKLELVKKLGADE 205 (347)
T ss_pred CcHHHHHHHHHHHhcCCcEEEEEcccchHHHHHHcCCcE
Confidence 345555666677777866777888999999999999544
No 241
>cd04593 CBS_pair_EriC_assoc_bac_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in bacteria and archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS d
Probab=33.53 E-value=86 Score=17.08 Aligned_cols=32 Identities=19% Similarity=0.197 Sum_probs=17.4
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.+..+.......+.+..+...++++|+++..
T Consensus 79 ~~l~~~~~~~~~~~~Vvd~~~~~~~~Gvit~~ 110 (115)
T cd04593 79 HALDRMASRGLRQLPVVDRGNPGQVLGLLTRE 110 (115)
T ss_pred HHHHHHHHcCCceeeEEeCCCCCeEEEEEEhH
Confidence 33444444455455555541017899998753
No 242
>PF00925 GTP_cyclohydro2: GTP cyclohydrolase II; InterPro: IPR000926 GTP cyclohydrolase II catalyses the first committed step in the biosynthesis of riboflavin. The enzyme converts GTP and water to formate, 2,5-diamino-6-hydroxy-4-(5-phosphoribosylamino)- pyrimidine and pyrophosphate, and requires magnesium as a cofactor. It is sometimes found as a bifunctional enzyme with 3,4-dihydroxy-2-butanone 4-phosphate synthase (DHBP_synthase) IPR000422 from INTERPRO. ; GO: 0003935 GTP cyclohydrolase II activity, 0009231 riboflavin biosynthetic process; PDB: 2BZ0_B 2BZ1_A.
Probab=33.40 E-value=38 Score=20.97 Aligned_cols=41 Identities=17% Similarity=0.284 Sum_probs=19.7
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEccC-CCchhHHHhcCcEEc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCA-DHMIPFYETFGFQKK 115 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~-~~~~~~y~~~Gf~~~ 115 (123)
+++|.-|+|.++| +..|++++.+-++ +....-.+.+|-+..
T Consensus 123 ~d~R~ygigaqIL-------~dLGV~~~rLLtnnp~k~~~L~g~gleV~ 164 (169)
T PF00925_consen 123 EDLRDYGIGAQIL-------RDLGVKKMRLLTNNPRKYVALEGFGLEVV 164 (169)
T ss_dssp S----THHHHHHH-------HHTT--SEEEE-S-HHHHHHHHHTT--EE
T ss_pred cccccHHHHHHHH-------HHcCCCEEEECCCChhHHHHHhcCCCEEE
Confidence 3344444444443 4568888887765 665666777776654
No 243
>COG2384 Predicted SAM-dependent methyltransferase [General function prediction only]
Probab=33.32 E-value=1.4e+02 Score=19.74 Aligned_cols=42 Identities=19% Similarity=0.222 Sum_probs=32.1
Q ss_pred chHHHHHHHHHHHHHhc--CCeEEEEccCCCc---hhHHHhcCcEEc
Q psy6918 74 ELGKLLIAVLVKLAKHF--QCYKLTLDCADHM---IPFYETFGFQKK 115 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~--~~~~i~~~~~~~~---~~~y~~~Gf~~~ 115 (123)
|+|..|+..+++..++. +..++.+.-+.+. ..+..+++|...
T Consensus 93 GMGG~lI~~ILee~~~~l~~~~rlILQPn~~~~~LR~~L~~~~~~I~ 139 (226)
T COG2384 93 GMGGTLIREILEEGKEKLKGVERLILQPNIHTYELREWLSANSYEIK 139 (226)
T ss_pred CCcHHHHHHHHHHhhhhhcCcceEEECCCCCHHHHHHHHHhCCceee
Confidence 89999999999988764 6677888543322 458889999886
No 244
>PF06559 DCD: 2'-deoxycytidine 5'-triphosphate deaminase (DCD); InterPro: IPR010550 This family consists of several bacterial 2'-deoxycytidine 5'-triphosphate deaminase proteins (3.5.4.13 from EC).; GO: 0008829 dCTP deaminase activity; PDB: 2R9Q_C.
Probab=33.23 E-value=20 Score=25.03 Aligned_cols=34 Identities=15% Similarity=0.252 Sum_probs=10.6
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCch
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKEL 75 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~ 75 (123)
.|++||-+.+..-.. .+ ..+..-.+..+||+||+
T Consensus 323 hGQ~vgrLvyE~m~~--~P----~~lYG~~~gSnYq~QgL 356 (364)
T PF06559_consen 323 HGQIVGRLVYERMAE--RP----ERLYGAGIGSNYQGQGL 356 (364)
T ss_dssp TT-EEEEEEEEEBSS----------TTSS-----------
T ss_pred CCcEEEEEEehhhcc--Cc----cccccccccccchhhhh
Confidence 499999988864221 11 12222346778898886
No 245
>COG3623 SgaU Putative L-xylulose-5-phosphate 3-epimerase [Carbohydrate transport and metabolism]
Probab=33.23 E-value=78 Score=21.22 Aligned_cols=24 Identities=13% Similarity=0.191 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|-++|..+++.+++.|+..|.+..
T Consensus 94 aleiM~KaI~LA~dLGIRtIQLAG 117 (287)
T COG3623 94 ALEIMEKAIQLAQDLGIRTIQLAG 117 (287)
T ss_pred HHHHHHHHHHHHHHhCceeEeecc
Confidence 568999999999999999998853
No 246
>PRK14831 undecaprenyl pyrophosphate synthase; Provisional
Probab=33.16 E-value=59 Score=21.69 Aligned_cols=32 Identities=19% Similarity=0.132 Sum_probs=27.4
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
+...|.-.|..-+..++++|.+.|++.+.+-+
T Consensus 41 ~~~~GH~~G~~~l~~i~~~c~~~GI~~vT~ya 72 (249)
T PRK14831 41 PRIMGHRRGVDALKDLLRCCKDWGIGALTAYA 72 (249)
T ss_pred chhhhHHHHHHHHHHHHHHHHHcCCCEEEEee
Confidence 44468888999999999999999999998754
No 247
>PF11633 SUD-M: Single-stranded poly(A) binding domain; InterPro: IPR024375 This domain identifies non-structural protein 3 (Nsp3). It is found in human SARS coronavirus polyprotein 1a and 1ab, and in related coronavirus polyproteins [].; PDB: 2KQV_A 2W2G_A 2WCT_D 2JZE_A 2JZF_A 2RNK_A 2JZD_A.
Probab=33.14 E-value=41 Score=20.21 Aligned_cols=45 Identities=9% Similarity=-0.128 Sum_probs=26.0
Q ss_pred cCCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 71 RGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 71 rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
.+.++-..=+..+++.+++.|.-.......+...++.++-|+...
T Consensus 16 ~ll~~v~~~~r~ml~~ak~~g~~~pvc~D~~A~~k~lkr~gv~~~ 60 (142)
T PF11633_consen 16 ELLGVVSWNFRAMLQHAKETGLLCPVCIDYPAFCKTLKRKGVDPK 60 (142)
T ss_dssp -GGCEE---CHHHHHHHHHHT-EEEEETT-HHHHHHHHHTTS---
T ss_pred ceeeeeehhHHHHHHHHHhcCcEEEEEeccHHHHHHHhccCcccc
Confidence 344455555677888889989765555556777888888777665
No 248
>KOG1764|consensus
Probab=33.05 E-value=81 Score=22.47 Aligned_cols=34 Identities=21% Similarity=0.232 Sum_probs=27.3
Q ss_pred HHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 11 FHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.+.+.++.+....-+.++++++ +|.++|.+++.
T Consensus 319 ssL~~vi~~lv~~~vHRl~VVd~--~~~l~GvvSLs 352 (381)
T KOG1764|consen 319 STLAEVIDKLVAHRVHRLWVVDE--DGVLVGVISLS 352 (381)
T ss_pred chHHHHHHHHHhcCceEEEEEcC--CCcEEEEeeHH
Confidence 45666777777778888888887 79999999976
No 249
>cd04446 DEP_DEPDC4 DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in DEPDC4-like proteins. DEPDC4 is a DEP domain containing protein of unknown function.
Probab=32.78 E-value=30 Score=19.39 Aligned_cols=23 Identities=17% Similarity=0.212 Sum_probs=17.5
Q ss_pred cccCCchHHHHHHHHHHHHHhcC
Q psy6918 69 TYRGKELGKLLIAVLVKLAKHFQ 91 (123)
Q Consensus 69 ~~rg~G~g~~Ll~~~~~~~~~~~ 91 (123)
.|...=+|+++++++.++..+..
T Consensus 23 ~y~~CF~GsEaVDwL~~~l~~n~ 45 (95)
T cd04446 23 SYHDCFLGSEAVDVVLAHLMQNK 45 (95)
T ss_pred ecccccchHHHHHHHHHHHhhcc
Confidence 35566689999999999876653
No 250
>PF14730 DUF4468: Domain of unknown function (DUF4468) with TBP-like fold
Probab=32.44 E-value=90 Score=16.99 Aligned_cols=42 Identities=17% Similarity=0.173 Sum_probs=24.6
Q ss_pred CCCCCHHHHHHHHHhhhcC---CCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 5 LPTSNVFHFAERFHRMKAS---QDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+|..+.+++......+... +..-.+...|.++|.|+|.....
T Consensus 7 vpg~sk~~ly~~~~~W~~~~~~~~~s~I~~~dke~g~I~~~g~~~ 51 (91)
T PF14730_consen 7 VPGMSKDQLYDRAKKWLAKNFKSANSVIQYSDKEEGTIIAKGEEW 51 (91)
T ss_pred cCCCCHHHHHHHHHHHHHHhcccccceEEEEcCCCCEEEEEEEEE
Confidence 4566777666665444322 22224445566689999988554
No 251
>PRK14841 undecaprenyl pyrophosphate synthase; Provisional
Probab=31.76 E-value=65 Score=21.29 Aligned_cols=32 Identities=19% Similarity=0.142 Sum_probs=27.4
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|..-+..++++|.+.|++.+.+-+
T Consensus 24 ~~~~GH~~G~~~l~~i~~~~~~lgIk~lTvYa 55 (233)
T PRK14841 24 PRIKGHQRGAEVLHNTVKWSLELGIKYLTAFS 55 (233)
T ss_pred chhhhHHHHHHHHHHHHHHHHHcCCCEEEEEe
Confidence 45568888999999999999999999988754
No 252
>TIGR01099 galU UTP-glucose-1-phosphate uridylyltransferase. Built to distinquish between the highly similar genes galU and galF
Probab=31.63 E-value=56 Score=21.32 Aligned_cols=32 Identities=16% Similarity=0.006 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHH
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY 107 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y 107 (123)
|+.|++.+++.+.+.|+..+.+.++.....+.
T Consensus 30 g~pli~~~l~~l~~~gi~~v~iv~~~~~~~i~ 61 (260)
T TIGR01099 30 DKPLIQYVVEEAVEAGIEDILIVTGRGKRAIE 61 (260)
T ss_pred CEEHHHHHHHHHHhCCCCEEEEEeCCcHHHHH
Confidence 46899999999999999999888765554443
No 253
>PRK00393 ribA GTP cyclohydrolase II; Reviewed
Probab=31.60 E-value=98 Score=19.75 Aligned_cols=42 Identities=12% Similarity=0.071 Sum_probs=29.9
Q ss_pred cccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCC-chhHHHhcCcEEc
Q psy6918 67 DDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADH-MIPFYETFGFQKK 115 (123)
Q Consensus 67 ~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~-~~~~y~~~Gf~~~ 115 (123)
.+++|.-|+|.++|. ..|+.++.+-++++ ...-...+|.+.+
T Consensus 124 ~~d~R~yGiGAQIL~-------dLGV~~mrLLtn~~~k~~~L~g~GleV~ 166 (197)
T PRK00393 124 AADERDYTLAADMLK-------ALGVKKVRLLTNNPKKVEALTEAGINIV 166 (197)
T ss_pred CccceehhHHHHHHH-------HcCCCEEEECCCCHHHHHHHHhCCCEEE
Confidence 556899999998875 45888888877644 3445566676664
No 254
>cd02205 CBS_pair The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generali
Probab=31.33 E-value=87 Score=16.47 Aligned_cols=30 Identities=17% Similarity=0.170 Sum_probs=17.6
Q ss_pred HHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEE
Q psy6918 14 AERFHRMKASQDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
.+.++.+...+...+.+..+ +++++|.++.
T Consensus 78 ~~~~~~~~~~~~~~~~V~~~--~~~~~G~i~~ 107 (113)
T cd02205 78 EEAAELMLEHGIRRLPVVDD--EGRLVGIVTR 107 (113)
T ss_pred HHHHHHHHHcCCCEEEEEcC--CCcEEEEEEH
Confidence 33444444444455555555 5899998875
No 255
>PRK14833 undecaprenyl pyrophosphate synthase; Provisional
Probab=31.28 E-value=66 Score=21.23 Aligned_cols=32 Identities=16% Similarity=0.093 Sum_probs=27.1
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|..-+..++++|.+.|++.+.+-+
T Consensus 25 ~~~~GH~~G~~~l~~~~~~c~~~gI~~lTvya 56 (233)
T PRK14833 25 ARAAGHKKGVKTLREITIWCANHKLECLTLYA 56 (233)
T ss_pred ChhhhHHHHHHHHHHHHHHHHHcCCCEEEEee
Confidence 44567888999999999999999999888754
No 256
>PF01255 Prenyltransf: Putative undecaprenyl diphosphate synthase; InterPro: IPR001441 Synonym(s): Di-trans-poly-cis-undecaprenyl-diphosphate synthase, Undecaprenyl pyrophosphate synthetase, Undecaprenyl pyrophosphate synthase, UPP synthetase Di-trans-poly-cis-decaprenylcistransferase (2.5.1.31 from EC) (UPP synthetase) generates undecaprenyl pyrophosphate (UPP) from isopentenyl pyrophosphate (IPP) []. This bacterial enzyme is also found in archaebacteria and in a number of uncharacterised proteins including some from yeasts. This entry also matches related enzymes that transfer alkyl groups, such as dehydrodolichyl diphosphate synthase.; GO: 0016765 transferase activity, transferring alkyl or aryl (other than methyl) groups; PDB: 2D2R_B 2DTN_B 1F75_B 1X07_A 2E9D_A 1JP3_A 3QAS_A 1X09_A 1V7U_B 2E9A_A ....
Probab=31.08 E-value=41 Score=21.90 Aligned_cols=31 Identities=16% Similarity=0.149 Sum_probs=25.0
Q ss_pred cccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 69 TYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 69 ~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
...|...|.+-+..++++|.+.|++.+.+-+
T Consensus 16 ~~~Gh~~G~~~l~~i~~~~~~~gI~~lTvYa 46 (223)
T PF01255_consen 16 RSEGHRAGAEKLKEIVEWCLELGIKYLTVYA 46 (223)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHCT-SEEEEEE
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 3456778899999999999999999988754
No 257
>PTZ00349 dehydrodolichyl diphosphate synthetase; Provisional
Probab=30.47 E-value=60 Score=22.61 Aligned_cols=32 Identities=9% Similarity=0.182 Sum_probs=27.7
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|+-.|.+.+..++++|.+.|++.+.+-+
T Consensus 40 ~~~~GH~~G~~~l~~il~~c~~lGIk~lTlYA 71 (322)
T PTZ00349 40 HSAIGHFMGSKALIQIIEICIKLKIKILSVFS 71 (322)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 45568889999999999999999999988754
No 258
>TIGR00505 ribA GTP cyclohydrolase II. Several members of the family are bifunctional, involving both ribA and ribB function. In these cases, ribA tends to be on the C-terminal end of the protein and ribB tends to be on the N-terminal. The function of archaeal members of the family has not been demonstrated and is assigned tentatively.
Probab=30.43 E-value=1e+02 Score=19.59 Aligned_cols=42 Identities=14% Similarity=0.099 Sum_probs=29.4
Q ss_pred cccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCC-chhHHHhcCcEEc
Q psy6918 67 DDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADH-MIPFYETFGFQKK 115 (123)
Q Consensus 67 ~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~-~~~~y~~~Gf~~~ 115 (123)
.+++|.-|+|.++|. ..|++++.+-++++ ...-.+.+|-+..
T Consensus 121 ~~d~R~yGiGAQIL~-------dLGV~~~rLLtn~~~k~~~L~g~gleVv 163 (191)
T TIGR00505 121 PADERDFSLCADILE-------DLGVKKVRLLTNNPKKIEILKKAGINIV 163 (191)
T ss_pred cccceehhHHHHHHH-------HcCCCEEEECCCCHHHHHHHHhCCCEEE
Confidence 445899999998875 45788888877644 4555566666554
No 259
>COG3113 Predicted NTP binding protein (contains STAS domain) [General function prediction only]
Probab=30.34 E-value=1.1e+02 Score=17.35 Aligned_cols=36 Identities=25% Similarity=0.358 Sum_probs=23.8
Q ss_pred hHHHHHHHHHHHHHhcCCeEEEEcc-CCCchhHHHhcC
Q psy6918 75 LGKLLIAVLVKLAKHFQCYKLTLDC-ADHMIPFYETFG 111 (123)
Q Consensus 75 ~g~~Ll~~~~~~~~~~~~~~i~~~~-~~~~~~~y~~~G 111 (123)
-|-+|+..+++.|+..|. .+.+.. ++....+.+-+|
T Consensus 55 aglALL~~~~~~~k~~g~-~~~L~~~p~~L~tLa~Ly~ 91 (99)
T COG3113 55 AGLALLLHLIRLAKKQGN-AVTLTGVPEQLRTLAELYN 91 (99)
T ss_pred HHHHHHHHHHHHHHHcCC-eeEEecCcHHHHHHHHHhC
Confidence 477899999999999987 455544 333344444333
No 260
>PRK07758 hypothetical protein; Provisional
Probab=30.30 E-value=67 Score=18.04 Aligned_cols=22 Identities=14% Similarity=0.064 Sum_probs=18.7
Q ss_pred cCCchHHHHHHHHHHHHHhcCC
Q psy6918 71 RGKELGKLLIAVLVKLAKHFQC 92 (123)
Q Consensus 71 rg~G~g~~Ll~~~~~~~~~~~~ 92 (123)
.=+|+|.+.++.+.+...+.|+
T Consensus 71 ~iknlGkKSL~EIkekL~E~GL 92 (95)
T PRK07758 71 KLHGMGPASLPKLRKALEESGL 92 (95)
T ss_pred HccCCCHHHHHHHHHHHHHcCC
Confidence 4679999999999999888775
No 261
>COG1366 SpoIIAA Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) [Signal transduction mechanisms]
Probab=30.16 E-value=1.1e+02 Score=17.32 Aligned_cols=40 Identities=18% Similarity=0.136 Sum_probs=30.3
Q ss_pred hHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEE
Q psy6918 75 LGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQK 114 (123)
Q Consensus 75 ~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~ 114 (123)
.|-..+-.+.+.++..|...+....++.....+...|+..
T Consensus 60 ~gl~~L~~~~~~~~~~g~~~~l~~i~p~v~~~~~~~gl~~ 99 (117)
T COG1366 60 AGLGVLVALLKSARLRGVELVLVGIQPEVARTLELTGLDK 99 (117)
T ss_pred HHHHHHHHHHHHHHhcCCeEEEEeCCHHHHHHHHHhCchh
Confidence 4556677777788888887777777888888888888764
No 262
>cd04181 NTP_transferase NTP_transferases catalyze the transfer of nucleotides onto phosphosugars. Nucleotidyltransferases transfer nucleotides onto phosphosugars. The enzyme family includes Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. The products are activated sugars that are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides.
Probab=30.11 E-value=73 Score=19.85 Aligned_cols=29 Identities=10% Similarity=0.035 Sum_probs=21.9
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCch
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMI 104 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~ 104 (123)
|+.|++.+++.+...++..+.+.++....
T Consensus 28 g~pli~~~l~~l~~~g~~~i~vv~~~~~~ 56 (217)
T cd04181 28 GKPILEYIIERLARAGIDEIILVVGYLGE 56 (217)
T ss_pred CeeHHHHHHHHHHHCCCCEEEEEeccCHH
Confidence 46788888888888888888887765433
No 263
>PRK14842 undecaprenyl pyrophosphate synthase; Provisional
Probab=29.94 E-value=73 Score=21.18 Aligned_cols=32 Identities=16% Similarity=0.199 Sum_probs=27.5
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|.+-+..++++|.+.|++.+.+-+
T Consensus 29 ~~~~GH~~G~~~l~~i~~~c~~lgI~~vTvYa 60 (241)
T PRK14842 29 KRSEGHREGANAIDRLMDASLEYGLKNISLYA 60 (241)
T ss_pred ChhHhHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 45568888999999999999999999988754
No 264
>PRK04017 hypothetical protein; Provisional
Probab=29.60 E-value=50 Score=19.76 Aligned_cols=25 Identities=28% Similarity=0.244 Sum_probs=19.5
Q ss_pred EEEEEecccccCCchHHHHHHHHHH
Q psy6918 61 IEEVVVDDTYRGKELGKLLIAVLVK 85 (123)
Q Consensus 61 i~~~~v~p~~rg~G~g~~Ll~~~~~ 85 (123)
-.-+.++|++.|.-+.+++.+.+..
T Consensus 67 ~VIILTD~D~~GekIr~~l~~~l~~ 91 (132)
T PRK04017 67 EVIILTDFDRKGEELAKKLSEYLQG 91 (132)
T ss_pred eEEEEECCCcchHHHHHHHHHHHHh
Confidence 3347899999998888888777664
No 265
>cd04622 CBS_pair_9 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera
Probab=29.52 E-value=1e+02 Score=16.62 Aligned_cols=20 Identities=30% Similarity=0.431 Sum_probs=12.3
Q ss_pred CCeEEEEEEcCCCCeEEEEEEE
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~ 45 (123)
....+.+..+ +++++|.+..
T Consensus 88 ~~~~~~V~~~--~~~~~G~it~ 107 (113)
T cd04622 88 QVRRLPVVDD--DGRLVGIVSL 107 (113)
T ss_pred CCCeeeEECC--CCcEEEEEEH
Confidence 3333444544 5889998875
No 266
>PRK14832 undecaprenyl pyrophosphate synthase; Provisional
Probab=29.45 E-value=59 Score=21.78 Aligned_cols=32 Identities=19% Similarity=0.211 Sum_probs=27.1
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|..-+..++++|.+.|++.+.+-+
T Consensus 39 ~~~~GH~~G~~~l~~i~~~c~~~gI~~lTvya 70 (253)
T PRK14832 39 PRIAGHRQGARTLKELLRCCKDWGIKALTAYA 70 (253)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 44567788999999999999999999888754
No 267
>COG2185 Sbm Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) [Lipid metabolism]
Probab=28.79 E-value=1.4e+02 Score=18.16 Aligned_cols=42 Identities=10% Similarity=-0.001 Sum_probs=30.8
Q ss_pred chHHHHHHHHHHHHHhcCCeEEEEcc----CCCchhHHHhcCcEEc
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKLTLDC----ADHMIPFYETFGFQKK 115 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i~~~~----~~~~~~~y~~~Gf~~~ 115 (123)
|-...++..+.+.+++.|...+.+.+ .+.....+++.|+-..
T Consensus 74 g~h~~l~~~lve~lre~G~~~i~v~~GGvip~~d~~~l~~~G~~~i 119 (143)
T COG2185 74 GGHLTLVPGLVEALREAGVEDILVVVGGVIPPGDYQELKEMGVDRI 119 (143)
T ss_pred chHHHHHHHHHHHHHHhCCcceEEeecCccCchhHHHHHHhCccee
Confidence 44578899999999999998888433 3444667777887653
No 268
>PRK10240 undecaprenyl pyrophosphate synthase; Provisional
Probab=28.74 E-value=61 Score=21.34 Aligned_cols=32 Identities=22% Similarity=0.083 Sum_probs=26.8
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.-.|..-+..++++|.+.|++.+.+-+
T Consensus 14 ~~~~GH~~G~~~l~~i~~~c~~~GI~~lT~ya 45 (229)
T PRK10240 14 IRAFGHKAGAKSVRRAVSFAANNGIEALTLYA 45 (229)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHcCCCEEEEEe
Confidence 44567778999999999999999999888754
No 269
>PRK14019 bifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II-like protein; Provisional
Probab=28.73 E-value=83 Score=22.37 Aligned_cols=29 Identities=17% Similarity=0.287 Sum_probs=22.2
Q ss_pred HHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 87 AKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 87 ~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
.+..|+.+|.+-+|+....-.+.+|.+..
T Consensus 331 L~~Lgv~~irLlTnp~K~~~L~~~Gi~V~ 359 (367)
T PRK14019 331 LRDLGVGKMRLLSSPRKFPSMSGFGLEVT 359 (367)
T ss_pred HHHcCCCeEEECCCcHHHHhhhhCCcEEE
Confidence 35668999988788776667788888775
No 270
>cd04592 CBS_pair_EriC_assoc_euk This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually
Probab=28.56 E-value=1.3e+02 Score=17.51 Aligned_cols=34 Identities=12% Similarity=0.062 Sum_probs=19.1
Q ss_pred CCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 8 SNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
.+..+..+.+. ..+...+.+..+ +++++|.++..
T Consensus 10 ~~l~ea~~~m~---~~~~~~~~VvD~--~g~l~Givt~~ 43 (133)
T cd04592 10 TTLKEALNLML---DEKQSCVLVVDS--DDFLEGILTLG 43 (133)
T ss_pred CCHHHHHHHHH---HcCCCEEEEECC--CCeEEEEEEHH
Confidence 34444444443 223334555554 68999999854
No 271
>PRK14829 undecaprenyl pyrophosphate synthase; Provisional
Probab=28.55 E-value=69 Score=21.29 Aligned_cols=32 Identities=19% Similarity=0.103 Sum_probs=27.3
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|..-+..++++|.+.|++.+.+-+
T Consensus 35 ~~~~GH~~G~~~l~~iv~~c~~~gI~~vTvYa 66 (243)
T PRK14829 35 KRTEGHKAGEPVLFDVVAGAIEAGVPYLSLYT 66 (243)
T ss_pred ChhHHHHHHHHHHHHHHHHHHHcCCCEEEEee
Confidence 44567788999999999999999999988754
No 272
>PRK14840 undecaprenyl pyrophosphate synthase; Provisional
Probab=28.42 E-value=68 Score=21.46 Aligned_cols=32 Identities=25% Similarity=0.223 Sum_probs=27.0
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|..-+..++++|.+.|++.+.+-+
T Consensus 43 ~~~~GH~~G~~~l~~v~~~c~~~GIk~lTvYa 74 (250)
T PRK14840 43 RAISGHYYGAKSLPQIVDTALHLGIEVLTLFA 74 (250)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 44567778999999999999999999988754
No 273
>cd02541 UGPase_prokaryotic Prokaryotic UGPase catalyses the synthesis of UDP-glucose. Prokaryotic UDP-Glucose Pyrophosphorylase (UGPase) catalyzes a reversible production of UDP-Glucose and pyrophosphate (PPi) from glucose-1-phosphate and UTP. UDP-glucose plays pivotal roles in galactose utilization, in glycogen synthesis, and in the synthesis of the carbohydrate moieties of glycolipids , glycoproteins , and proteoglycans. UGPase is found in both prokaryotes and eukaryotes, although prokaryotic and eukaryotic forms of UGPase catalyze the same reaction, they share low sequence similarity.
Probab=28.41 E-value=63 Score=21.22 Aligned_cols=29 Identities=14% Similarity=0.003 Sum_probs=23.6
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCch
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMI 104 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~ 104 (123)
|+.|++..++.+.+.|+..+.+.++....
T Consensus 30 gkpli~~~l~~l~~~gi~~i~iv~~~~~~ 58 (267)
T cd02541 30 DKPVIQYIVEEAVAAGIEDIIIVTGRGKR 58 (267)
T ss_pred CEEHHHHHHHHHHHCCCCEEEEEeCCchH
Confidence 46899999999999999998887765443
No 274
>cd04189 G1P_TT_long G1P_TT_long represents the long form of glucose-1-phosphate thymidylyltransferase. This family is the long form of Glucose-1-phosphate thymidylyltransferase. Glucose-1-phosphate thymidylyltransferase catalyses the formation of dTDP-glucose, from dTTP and glucose 1-phosphate. It is the first enzyme in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.There are two forms of Glucose-1-phosphate thymidylyltransferase in bacteria and archeae; short form and long form. The long form, which has an extra 50 amino acids c-terminal, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced.The long from enzymes also have a left-handed parallel helix domain at the c-terminus, whereas, th eshort form enzymes do not have this domain. The homotetrameric, feedback inhibited short form is found in
Probab=28.18 E-value=1.1e+02 Score=19.56 Aligned_cols=30 Identities=10% Similarity=0.022 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchh
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIP 105 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~ 105 (123)
|+.|++.+++.+.+.++..+.+.++.....
T Consensus 30 g~~li~~~l~~l~~~~~~~i~vv~~~~~~~ 59 (236)
T cd04189 30 GKPIIQYAIEDLREAGIEDIGIVVGPTGEE 59 (236)
T ss_pred CcchHHHHHHHHHHCCCCEEEEEcCCCHHH
Confidence 578888888888888888887766554433
No 275
>PF02743 Cache_1: Cache domain; InterPro: IPR004010 Cache is an extracellular domain that is predicted to have a role in small-molecule recognition in a wide range of proteins, including the animal dihydropyridine-sensitive voltage-gated Ca2+ channel; alpha-2delta subunit, and various bacterial chemotaxis receptors. The name Cache comes from CAlcium channels and CHEmotaxis receptors. This domain consists of an N-terminal part with three predicted strands and an alpha-helix, and a C-terminal part with a strand dyad followed by a relatively unstructured region. The N-terminal portion of the (unpermuted) Cache domain contains three predicted strands that could form a sheet analogous to that present in the core of the PAS domain structure. Cache domains are particularly widespread in bacteria, with Vibrio cholerae. The animal calcium channel alpha-2delta subunits might have acquired a part of their extracellular domains from a bacterial source []. The Cache domain appears to have arisen from the GAF-PAS fold despite their divergent functions [].; GO: 0016020 membrane; PDB: 3C8C_A 3LIB_D 3LIA_A 3LI8_A 3LI9_A.
Probab=28.16 E-value=97 Score=16.01 Aligned_cols=33 Identities=9% Similarity=0.201 Sum_probs=21.7
Q ss_pred CCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEE
Q psy6918 8 SNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGT 42 (123)
Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~ 42 (123)
.+.+.+.+.+........-+++++.+ +|.++..
T Consensus 36 i~l~~l~~~i~~~~~~~~g~~~ivd~--~G~ii~h 68 (81)
T PF02743_consen 36 ISLDQLSEIISNIKFGNNGYAFIVDK--NGTIIAH 68 (81)
T ss_dssp EEHHHHHHHHTTSBBTTTBEEEEEET--TSBBCE-
T ss_pred eccceeeeEEEeeEECCCEEEEEEEC--CCCEEEe
Confidence 34566777777666666667777777 6877653
No 276
>PRK10122 GalU regulator GalF; Provisional
Probab=27.99 E-value=64 Score=21.92 Aligned_cols=31 Identities=13% Similarity=0.004 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhH
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPF 106 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~ 106 (123)
|+-+++.+++.+.+.|++.+.+.++.....+
T Consensus 33 gkpiI~~~l~~l~~~Gi~~i~iv~~~~~~~i 63 (297)
T PRK10122 33 DKPMIQYIVDEIVAAGIKEIVLVTHASKNAV 63 (297)
T ss_pred CEEHHHHHHHHHHHCCCCEEEEEcCCChHHH
Confidence 4789999999999999999988876544433
No 277
>cd04621 CBS_pair_8 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera
Probab=27.99 E-value=1.3e+02 Score=17.35 Aligned_cols=33 Identities=15% Similarity=0.269 Sum_probs=18.5
Q ss_pred CHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 9 NVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+..+....+. ..+...+.+..+ +++++|.++..
T Consensus 11 ~~~~a~~~~~---~~~~~~l~V~d~--~~~~~Giv~~~ 43 (135)
T cd04621 11 SLLHVVDEME---KNGVGRVIVVDD--NGKPVGVITYR 43 (135)
T ss_pred cHHHHHHHHH---HcCCCcceEECC--CCCEEEEEeHH
Confidence 3444444443 223333444544 69999999854
No 278
>TIGR00151 ispF 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase. Members of this protein family are 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase, the IspF protein of the deoxyxylulose (non-mevalonate) pathway of IPP biosynthesis. This protein occurs as an IspDF bifunctional fusion protein in about 20 percent of bacterial genomes.
Probab=27.88 E-value=1.5e+02 Score=18.32 Aligned_cols=32 Identities=22% Similarity=0.325 Sum_probs=24.8
Q ss_pred ecccccCCchHHHHHHHHHHHHHhcCCeEEEEc
Q psy6918 66 VDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLD 98 (123)
Q Consensus 66 v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~ 98 (123)
.+|+|.|.- +..+|+.+.+.+++.|+.-..+.
T Consensus 63 td~~~k~~~-S~~lL~~~~~~~~~~g~~i~niD 94 (155)
T TIGR00151 63 TDPRWKGAD-SRVLLRHAVALIKEKGYRIGNVD 94 (155)
T ss_pred CChhhCCCC-HHHHHHHHHHHHHHcCCEEEEEE
Confidence 366677655 78999999999999988665554
No 279
>PRK14834 undecaprenyl pyrophosphate synthase; Provisional
Probab=27.78 E-value=91 Score=20.85 Aligned_cols=32 Identities=22% Similarity=0.132 Sum_probs=26.5
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|..-+..++++|.+.|++.+.+-+
T Consensus 35 ~~~~GH~~G~~~l~~i~~~c~~lgI~~lTvYa 66 (249)
T PRK14834 35 PRAAGHRAGVEALRRVVRAAGELGIGYLTLFA 66 (249)
T ss_pred chhhhHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 34457778999999999999999999887754
No 280
>PF02388 FemAB: FemAB family; InterPro: IPR003447 The femAB operon codes for two nearly identical approximately 50kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan []. These proteins are also considered as a factor influencing the level of methicillin resistance [].; GO: 0016755 transferase activity, transferring amino-acyl groups; PDB: 1XE4_A 1NE9_A 3GKR_A 1XIX_A 1P4N_A 1XF8_A 1LRZ_A.
Probab=27.67 E-value=2.3e+02 Score=20.30 Aligned_cols=58 Identities=21% Similarity=0.100 Sum_probs=38.9
Q ss_pred eEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEcc----CCC---chhHHH-hcCcEEc
Q psy6918 58 KGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC----ADH---MIPFYE-TFGFQKK 115 (123)
Q Consensus 58 ~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~----~~~---~~~~y~-~~Gf~~~ 115 (123)
.+....-+.+++++.-+-...|.-.++++|.++|+..-.+.. ..+ ...+|. |.||-..
T Consensus 316 ~~~yly~gs~~~~~~~~~~~~l~~~~i~~a~~~G~~~ydf~Gi~~~~~~~~~~~Gl~~FK~~F~g~ 381 (406)
T PF02388_consen 316 EAYYLYGGSDEEYRKFYAPYLLQWEAIKYAKEKGIKRYDFGGISGDFDGSDPDYGLYKFKKGFGGQ 381 (406)
T ss_dssp EEEEEEEEE-CGCGGCTHHHHHHHHHHHHHHHTT-SEEEEEE-SSSSTTTHTTHHHHHHHHCCT-C
T ss_pred EEEEEECccchhhHhcCcchHHHHHHHHHHHHCCCCEEEeeCCCCCCCCCcccchHHHHhhcCCCc
Confidence 344445578999999998999999999999999998877632 122 235664 6666443
No 281
>TIGR00083 ribF riboflavin kinase/FMN adenylyltransferase. multifunctional enzyme: riboflavin kinase (EC 2.7.1.26) (flavokinase) / FMN adenylyltransferase (EC 2.7.7.2) (FAD pyrophosphorylase) (FAD synthetase).
Probab=27.61 E-value=1.6e+02 Score=20.12 Aligned_cols=31 Identities=23% Similarity=0.244 Sum_probs=24.4
Q ss_pred cccCCchH-HHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 69 TYRGKELG-KLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 69 ~~rg~G~g-~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
.|=|-.+| .+|++.+.+.|++.+...+.++-
T Consensus 6 ~FDGvH~GHq~Li~~~~~~a~~~~~~~~V~tF 37 (288)
T TIGR00083 6 YFDGLHLGHQALLQELKQIAEEKGLPPAVLLF 37 (288)
T ss_pred eCCccCHHHHHHHHHHHHHHHHhCCCEEEEEe
Confidence 35566777 78999999999999987777654
No 282
>PRK14830 undecaprenyl pyrophosphate synthase; Provisional
Probab=27.42 E-value=87 Score=20.95 Aligned_cols=29 Identities=24% Similarity=0.238 Sum_probs=24.8
Q ss_pred cCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 71 RGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 71 rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
.|.--|..-+..++++|.+.|++.+.+-+
T Consensus 46 ~Gh~~G~~~l~~~l~~c~~~GI~~vTvYa 74 (251)
T PRK14830 46 AGHKAGMDTVKKITKAASELGVKVLTLYA 74 (251)
T ss_pred hhHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 36678889999999999999999887754
No 283
>PF14696 Glyoxalase_5: Hydroxyphenylpyruvate dioxygenase, HPPD, N-terminal ; PDB: 1CJX_A 2R5V_A.
Probab=27.41 E-value=59 Score=19.59 Aligned_cols=28 Identities=11% Similarity=0.202 Sum_probs=17.2
Q ss_pred cCCeEEEEccC--CCchhHHHhcCcEEccc
Q psy6918 90 FQCYKLTLDCA--DHMIPFYETFGFQKKNN 117 (123)
Q Consensus 90 ~~~~~i~~~~~--~~~~~~y~~~Gf~~~~~ 117 (123)
.|+..|...++ .....+++++||+..+.
T Consensus 8 ~G~dFvEFa~~~~~~l~~~~~~lGF~~~a~ 37 (139)
T PF14696_consen 8 DGFDFVEFAVPDAQALAQLFTALGFQPVAR 37 (139)
T ss_dssp EEEEEEEEE-SSTTSCHHHHCCCCEEEECC
T ss_pred CCeEEEEEecCCHHHHHHHHHHhCcceEEe
Confidence 35555655443 34456888999998754
No 284
>COG2994 HlyC ACP:hemolysin acyltransferase (hemolysin-activating protein) [Posttranslational modification, protein turnover, chaperones]
Probab=27.17 E-value=99 Score=18.83 Aligned_cols=13 Identities=8% Similarity=-0.031 Sum_probs=10.8
Q ss_pred CCeEEEEEEEEEe
Q psy6918 36 TKQVIGTGSLILE 48 (123)
Q Consensus 36 ~~~ivG~~~~~~~ 48 (123)
+|++||+++-...
T Consensus 61 ~g~Piaf~~WA~v 73 (148)
T COG2994 61 HGRPIAFCTWAFV 73 (148)
T ss_pred CCCeeEEEEEeec
Confidence 7999999997643
No 285
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=27.00 E-value=64 Score=19.23 Aligned_cols=24 Identities=13% Similarity=0.210 Sum_probs=19.8
Q ss_pred CchHHHHHHHHHHHHHhcCCeEEE
Q psy6918 73 KELGKLLIAVLVKLAKHFQCYKLT 96 (123)
Q Consensus 73 ~G~g~~Ll~~~~~~~~~~~~~~i~ 96 (123)
.+++..+++.+.+.++++|.++|.
T Consensus 4 ~si~~~i~~~v~~~A~~~g~~~V~ 27 (135)
T PRK03824 4 WALAEAIVRTVIDYAQKEGASKVK 27 (135)
T ss_pred HHHHHHHHHHHHHHHHHcCCchhH
Confidence 367889999999999998887664
No 286
>cd02508 ADP_Glucose_PP ADP-glucose pyrophosphorylase is involved in the biosynthesis of glycogen or starch. ADP-glucose pyrophosphorylase (glucose-1-phosphate adenylyltransferase) catalyzes a very important step in the biosynthesis of alpha 1,4-glucans (glycogen or starch) in bacteria and plants: synthesis of the activated glucosyl donor, ADP-glucose, from glucose-1-phosphate and ATP. ADP-glucose pyrophosphorylase is a tetrameric allosterically regulated enzyme. While a homotetramer in bacteria, in plant chloroplasts and amyloplasts, it is a heterotetramer of two different, yet evolutionary related, subunits. There are a number of conserved regions in the sequence of bacterial and plant ADP-glucose pyrophosphorylase subunits. It is a subfamily of a very diverse glycosy transferase family 2.
Probab=26.94 E-value=66 Score=20.09 Aligned_cols=32 Identities=6% Similarity=-0.098 Sum_probs=24.5
Q ss_pred HHHHHHHHHHHhcCCeEEEEccCCCchhHHHh
Q psy6918 78 LLIAVLVKLAKHFQCYKLTLDCADHMIPFYET 109 (123)
Q Consensus 78 ~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~ 109 (123)
-+++.+++.+.+.|++.+.+.++.....+.+.
T Consensus 31 pli~~~l~~l~~~gi~~iivv~~~~~~~i~~~ 62 (200)
T cd02508 31 RLIDFPLSNMVNSGIRNVGVLTQYKSRSLNDH 62 (200)
T ss_pred eeHHHHHHHHHHCCCCEEEEEeCCChHHHHHH
Confidence 58888999988899999988887665444443
No 287
>PRK14839 undecaprenyl pyrophosphate synthase; Provisional
Probab=26.88 E-value=76 Score=21.10 Aligned_cols=32 Identities=25% Similarity=0.198 Sum_probs=27.1
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.-.|..-+..++++|.+.|++.+.+-+
T Consensus 30 ~~~~GH~~G~~~l~~i~~~c~~~GI~~lTvYa 61 (239)
T PRK14839 30 PRLAGHRAGVEAIRRVVEAAPDLGIGTLTLYA 61 (239)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 44567888999999999999999999888754
No 288
>cd04617 CBS_pair_4 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera
Probab=26.83 E-value=1.2e+02 Score=16.67 Aligned_cols=29 Identities=21% Similarity=0.186 Sum_probs=17.0
Q ss_pred HHHhhhcCCCeEEEEEEcCCC---CeEEEEEEEE
Q psy6918 16 RFHRMKASQDYLVTVIEDTRT---KQVIGTGSLI 46 (123)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~---~~ivG~~~~~ 46 (123)
.+..........+.+..+ + ++++|.++..
T Consensus 82 ~~~~~~~~~~~~lpVvd~--~~~~~~l~Gvit~~ 113 (118)
T cd04617 82 AAKKLIEHQVDSLPVVEK--VDEGLEVIGRITKT 113 (118)
T ss_pred HHHHHHHcCCCEeeEEeC--CCccceEEEEEEhh
Confidence 334334445444555655 3 6899998764
No 289
>cd04608 CBS_pair_PALP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a poten
Probab=26.66 E-value=1.3e+02 Score=16.96 Aligned_cols=21 Identities=14% Similarity=0.246 Sum_probs=13.6
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
....+.+..+ +++++|.++..
T Consensus 24 ~~~~~~Vvd~--~~~~~Gii~~~ 44 (124)
T cd04608 24 GFDQLPVVDE--SGKILGMVTLG 44 (124)
T ss_pred CCCEEEEEcC--CCCEEEEEEHH
Confidence 3334545544 68999998864
No 290
>TIGR00815 sulP high affinity sulphate transporter 1. (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3- (out).
Probab=26.25 E-value=1.2e+02 Score=22.81 Aligned_cols=40 Identities=15% Similarity=-0.029 Sum_probs=31.7
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
|-+.+..+.+.+++.|.+-.....++...+.+++.|+...
T Consensus 511 g~~~L~~l~~~l~~~g~~l~l~~~~~~v~~~l~~~gl~~~ 550 (563)
T TIGR00815 511 GIHALEELRKELKARGIQLLLANPNKAVRSTLKRGGLVEL 550 (563)
T ss_pred HHHHHHHHHHHHHHcCCEEEEecCChHHHHHHHHCCchhh
Confidence 4566677777778888877777778888999999998765
No 291
>cd04623 CBS_pair_10 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=26.15 E-value=1.2e+02 Score=16.27 Aligned_cols=34 Identities=21% Similarity=0.140 Sum_probs=19.0
Q ss_pred CCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 8 SNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
.+..+....+. ..+...+.+..+ +++++|.+...
T Consensus 10 ~~~~~~~~~~~---~~~~~~~~V~~~--~~~~~Giv~~~ 43 (113)
T cd04623 10 ATVAEAAKLMA---EKNIGAVVVVDD--GGRLVGIFSER 43 (113)
T ss_pred CcHHHHHHHHH---HcCCCeEEEECC--CCCEEEEEehH
Confidence 34444444443 223334444544 58999999864
No 292
>KOG0122|consensus
Probab=26.05 E-value=2.1e+02 Score=19.29 Aligned_cols=40 Identities=18% Similarity=0.091 Sum_probs=27.7
Q ss_pred CCCHHHHHHHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEEEe
Q psy6918 7 TSNVFHFAERFHRMKASQDYLVTVIEDTRTKQVIGTGSLILE 48 (123)
Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~ 48 (123)
+.+.+.+++.+..+ .+-..++++.|.++|..-||+.+.+.
T Consensus 200 d~~E~dL~eLf~~f--g~i~rvylardK~TG~~kGFAFVtF~ 239 (270)
T KOG0122|consen 200 DMREDDLEELFRPF--GPITRVYLARDKETGLSKGFAFVTFE 239 (270)
T ss_pred ccChhHHHHHhhcc--CccceeEEEEccccCcccceEEEEEe
Confidence 44556666666544 34455777888789999999888754
No 293
>COG0807 RibA GTP cyclohydrolase II [Coenzyme metabolism]
Probab=25.91 E-value=84 Score=20.16 Aligned_cols=41 Identities=20% Similarity=0.214 Sum_probs=24.5
Q ss_pred ecccccCCchHHHHHHHHHHHHHhcCCeEEEEccC-CCchhHHHhcCcE
Q psy6918 66 VDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCA-DHMIPFYETFGFQ 113 (123)
Q Consensus 66 v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~-~~~~~~y~~~Gf~ 113 (123)
-.+++|.-|+|.+++. ..|++.+.+-+| |..+.-.+..|-.
T Consensus 122 ~~~D~R~ygigAqIL~-------dLGI~~irLLtnnp~K~~~l~~~Gi~ 163 (193)
T COG0807 122 FPADERDYGIGAQILK-------DLGIKKIRLLTNNPRKIYGLEGFGIN 163 (193)
T ss_pred CCchHHHHHHHHHHHH-------HcCCcEEEEecCChHHHHHHHhCCce
Confidence 3455666666666654 457888888774 3344444555533
No 294
>COG1041 Predicted DNA modification methylase [DNA replication, recombination, and repair]
Probab=25.85 E-value=2.5e+02 Score=19.97 Aligned_cols=61 Identities=16% Similarity=0.231 Sum_probs=39.3
Q ss_pred EEEEEEecccccC----CchH-HHHHHHHHHHHHhc--CCeEEEEccCCCchhHHHhcCcEEccceeE
Q psy6918 60 KIEEVVVDDTYRG----KELG-KLLIAVLVKLAKHF--QCYKLTLDCADHMIPFYETFGFQKKNNFMQ 120 (123)
Q Consensus 60 ~i~~~~v~p~~rg----~G~g-~~Ll~~~~~~~~~~--~~~~i~~~~~~~~~~~y~~~Gf~~~~~~~~ 120 (123)
.+..++++|.|-+ +|.+ .+|+..+++.+.+. +-..+.+.+......-++..||...+.+..
T Consensus 264 ~vdaIatDPPYGrst~~~~~~l~~Ly~~~le~~~evLk~gG~~vf~~p~~~~~~~~~~~f~v~~~~~~ 331 (347)
T COG1041 264 SVDAIATDPPYGRSTKIKGEGLDELYEEALESASEVLKPGGRIVFAAPRDPRHELEELGFKVLGRFTM 331 (347)
T ss_pred ccceEEecCCCCcccccccccHHHHHHHHHHHHHHHhhcCcEEEEecCCcchhhHhhcCceEEEEEEE
Confidence 5888999999842 2222 45666666665442 222355555666777889999998855543
No 295
>COG5270 PUA domain (predicted RNA-binding domain) [Translation, ribosomal structure and biogenesis]
Probab=25.83 E-value=86 Score=20.07 Aligned_cols=18 Identities=28% Similarity=0.475 Sum_probs=12.0
Q ss_pred EEEEEcCCCCeEEEEEEEEE
Q psy6918 28 VTVIEDTRTKQVIGTGSLIL 47 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~ 47 (123)
++++.. +|+++|+..+..
T Consensus 165 viVv~~--ng~~vGVg~a~~ 182 (202)
T COG5270 165 VIVVSE--NGRVVGVGIAKK 182 (202)
T ss_pred EEEEec--CCEEEEEEEEec
Confidence 344444 689999887763
No 296
>PRK10310 PTS system galactitol-specific transporter subunit IIB; Provisional
Probab=25.77 E-value=1.2e+02 Score=16.61 Aligned_cols=23 Identities=9% Similarity=0.109 Sum_probs=17.3
Q ss_pred cCCchHHHHH--HHHHHHHHhcCCe
Q psy6918 71 RGKELGKLLI--AVLVKLAKHFQCY 93 (123)
Q Consensus 71 rg~G~g~~Ll--~~~~~~~~~~~~~ 93 (123)
-|.|+|+-.+ ..+.+.+.++|+.
T Consensus 9 CgsG~~TS~m~~~ki~~~l~~~gi~ 33 (94)
T PRK10310 9 CGGAVATSTMAAEEIKELCQSHNIP 33 (94)
T ss_pred CCCchhHHHHHHHHHHHHHHHCCCe
Confidence 4679987666 6777888888874
No 297
>PF11039 DUF2824: Protein of unknown function (DUF2824); InterPro: IPR022568 This family of proteins has no known function. Members of the family are found in P22-like viruses and bacteria. Some of the phage members have been annotated as head assembly proteins, but this has not been confirmed.
Probab=25.74 E-value=1.6e+02 Score=17.79 Aligned_cols=46 Identities=15% Similarity=-0.015 Sum_probs=30.5
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHHhc
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHF 90 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~ 90 (123)
.+.++|++.+....+ ...-..-.-+|++|| ++.+.-...-.|..+.
T Consensus 46 g~~l~Gi~~v~~i~~-------~~vecHa~y~P~fRG--~a~~~~~~F~kwlL~N 91 (151)
T PF11039_consen 46 GGQLGGIVYVEEIQP-------SVVECHAMYDPGFRG--YALEIGRLFCKWLLEN 91 (151)
T ss_pred ceEEEEEEEEEEEee-------eeEEEEeeeccccch--hHHHHHHHHHHHHhcC
Confidence 478888888874332 111222345999999 8877777777777665
No 298
>PF00107 ADH_zinc_N: Zinc-binding dehydrogenase; InterPro: IPR013149 Alcohol dehydrogenase (1.1.1.1 from EC) (ADH) catalyzes the reversible oxidation of alcohols to their corresponding acetaldehyde or ketone with the concomitant reduction of NAD: alcohol + NAD = aldehyde or ketone + NADH Currently three structurally and catalytically different types of alcohol dehydrogenases are known: Zinc-containing 'long-chain' alcohol dehydrogenases. Insect-type, or 'short-chain' alcohol dehydrogenases. Iron-containing alcohol dehydrogenases. Zinc-containing ADH's [, ] are dimeric or tetrameric enzymes that bind two atoms of zinc per subunit. One of the zinc atom is essential for catalytic activity while the other is not. Both zinc atoms are coordinated by either cysteine or histidine residues; the catalytic zinc is coordinated by two cysteines and one histidine. Zinc-containing ADH's are found in bacteria, mammals, plants, and in fungi. In many species there is more than one isozyme (for example, humans have at least six isozymes, yeast have three, etc.). A number of other zinc-dependent dehydrogenases are closely related to zinc ADH [] and are included in this family. Sorbitol dehydrogenase (1.1.1.14 from EC) L-threonine 3-dehydrogenase (1.1.1.103 from EC) Glutathione-dependent formaldehyde dehydrogenase (1.1.1.284 from EC) Mannitol dehydrogenase (1.1.1.255 from EC) In addition, this family includes NADP-dependent quinone oxidoreductase (1.6.5.5 from EC), an enzyme found in bacteria (gene qor), in yeast and in mammals where, in some species such as rodents, it has been recruited as an eye lens protein and is known as zeta-crystallin []. The sequence of quinone oxidoreductase is distantly related to that other zinc-containing alcohol dehydrogenases and it lacks the zinc-ligand residues. The torpedo fish and mammalian synaptic vesicle membrane protein vat-1 is related to qor. This entry represents the cofactor-binding domain of these enzymes, which is normally found towards the C terminus. Structural studies indicate that it forms a classical Rossman fold that reversibly binds NAD(H) [, , ].; GO: 0008270 zinc ion binding, 0016491 oxidoreductase activity, 0055114 oxidation-reduction process; PDB: 3PI7_A 3COS_D 1VJ1_A 2ZB3_A 1PIW_B 1Q1N_A 1PS0_A 2EER_B 3KRT_A 1ZSY_A ....
Probab=25.59 E-value=1e+02 Score=17.39 Aligned_cols=32 Identities=16% Similarity=0.292 Sum_probs=21.1
Q ss_pred HHHHHHhcCCeEEEEccCCCchhHHHhcCcEE
Q psy6918 83 LVKLAKHFQCYKLTLDCADHMIPFYETFGFQK 114 (123)
Q Consensus 83 ~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~ 114 (123)
+.+.++..|.+.+.+..++....+.+++|...
T Consensus 6 a~q~ak~~G~~vi~~~~~~~k~~~~~~~Ga~~ 37 (130)
T PF00107_consen 6 AIQLAKAMGAKVIATDRSEEKLELAKELGADH 37 (130)
T ss_dssp HHHHHHHTTSEEEEEESSHHHHHHHHHTTESE
T ss_pred HHHHHHHcCCEEEEEECCHHHHHHHHhhcccc
Confidence 45566777855555555666777888888544
No 299
>PF14871 GHL6: Hypothetical glycosyl hydrolase 6
Probab=25.58 E-value=1.6e+02 Score=17.52 Aligned_cols=26 Identities=15% Similarity=0.024 Sum_probs=20.5
Q ss_pred cCCchHHHHHHHHHHHHHhcCCeEEE
Q psy6918 71 RGKELGKLLIAVLVKLAKHFQCYKLT 96 (123)
Q Consensus 71 rg~G~g~~Ll~~~~~~~~~~~~~~i~ 96 (123)
+..++.+-|+..+++.|++.|++.+.
T Consensus 37 ~hp~L~~Dllge~v~a~h~~Girv~a 62 (132)
T PF14871_consen 37 RHPGLKRDLLGEQVEACHERGIRVPA 62 (132)
T ss_pred CCCCCCcCHHHHHHHHHHHCCCEEEE
Confidence 34455679999999999999997553
No 300
>CHL00130 rbcS ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit; Reviewed
Probab=25.06 E-value=1.7e+02 Score=17.71 Aligned_cols=30 Identities=13% Similarity=-0.006 Sum_probs=25.6
Q ss_pred CCCCCCHHHHHHHHHhhhcCCCeEEEEEEc
Q psy6918 4 KLPTSNVFHFAERFHRMKASQDYLVTVIED 33 (123)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 33 (123)
..|+++.+++.+.++...+.+...++-..+
T Consensus 10 yLPpLTdeqI~kQI~Y~i~~GW~p~iEft~ 39 (138)
T CHL00130 10 FLPDLTDQQIEKQIQYAISKGWALNVEWTD 39 (138)
T ss_pred cCCCCCHHHHHHHHHHHHhcCCeEEEEecC
Confidence 458999999999999999999888876655
No 301
>COG2428 Uncharacterized conserved protein [Function unknown]
Probab=25.05 E-value=1.2e+02 Score=19.14 Aligned_cols=40 Identities=8% Similarity=0.065 Sum_probs=31.6
Q ss_pred HHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEcc
Q psy6918 77 KLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKKN 116 (123)
Q Consensus 77 ~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~~ 116 (123)
..|+....+.++..|.+.+.+.+.+.-...+.+.|+.-..
T Consensus 14 eWlllEy~e~a~~~G~~~~vtna~pe~p~vlak~g~~~i~ 53 (196)
T COG2428 14 EWLLLEYKEVARWWGDEFIVTNAKPEEPEVLAKIGLSGIP 53 (196)
T ss_pred HHHHHHHHHHHHHhchheeeecCCcchhHHHHHhccccCc
Confidence 5677777778999999999988888777788888865443
No 302
>PF02334 RTP: Replication terminator protein; InterPro: IPR003432 The bacterial replication terminator protein (RTP) plays a role in the termination of DNA replication by impeding replication fork movement. Two RTP dimers bind to the two inverted repeat regions at the termination site.; GO: 0003677 DNA binding, 0006274 DNA replication termination; PDB: 2DPU_A 2DPD_A 1F4K_A 1J0R_B 2EFW_F 2DQR_B 1BM9_B.
Probab=24.80 E-value=31 Score=19.96 Aligned_cols=22 Identities=18% Similarity=0.149 Sum_probs=15.1
Q ss_pred cCCchHHHHHHHHHHHHHhcCC
Q psy6918 71 RGKELGKLLIAVLVKLAKHFQC 92 (123)
Q Consensus 71 rg~G~g~~Ll~~~~~~~~~~~~ 92 (123)
+++|+|.+|++.+.+.-+..|+
T Consensus 29 ~~r~Yg~q~Ld~lr~EFk~~Gy 50 (122)
T PF02334_consen 29 QERGYGLQLLDELRSEFKPLGY 50 (122)
T ss_dssp TT-EBCTCHHHHHHHHHTTTT-
T ss_pred cccchHHHHHHHHHHHhhhcCC
Confidence 6778888888888777666664
No 303
>KOG1602|consensus
Probab=24.73 E-value=1.5e+02 Score=20.12 Aligned_cols=30 Identities=20% Similarity=0.256 Sum_probs=26.2
Q ss_pred ccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 70 YRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 70 ~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
-.|+--|-.-+..++++|.+.|++.|.+.+
T Consensus 59 ~~GH~aGf~~l~~ile~C~~lGI~~vT~fA 88 (271)
T KOG1602|consen 59 SEGHEAGFEALKEILELCKELGIKEVTVFA 88 (271)
T ss_pred ccchHHHHHHHHHHHHHHHHcCCcEEEEEE
Confidence 467788899999999999999999988765
No 304
>COG2265 TrmA SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]
Probab=24.54 E-value=2e+02 Score=21.04 Aligned_cols=49 Identities=18% Similarity=0.312 Sum_probs=35.7
Q ss_pred EEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCc----hhHHHhcCcEEc
Q psy6918 62 EEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHM----IPFYETFGFQKK 115 (123)
Q Consensus 62 ~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~----~~~y~~~Gf~~~ 115 (123)
+.+.|||. +.|.+..+++.+.+.. ..+-+.++||+.. ...+...||+..
T Consensus 364 d~VvvDPP--R~G~~~~~lk~l~~~~---p~~IvYVSCNP~TlaRDl~~L~~~gy~i~ 416 (432)
T COG2265 364 DVVVVDPP--RAGADREVLKQLAKLK---PKRIVYVSCNPATLARDLAILASTGYEIE 416 (432)
T ss_pred CEEEECCC--CCCCCHHHHHHHHhcC---CCcEEEEeCCHHHHHHHHHHHHhCCeEEE
Confidence 45788887 7789988888887542 2345788898755 558899998543
No 305
>PRK14838 undecaprenyl pyrophosphate synthase; Provisional
Probab=24.32 E-value=89 Score=20.80 Aligned_cols=32 Identities=19% Similarity=0.202 Sum_probs=27.0
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.--|..-+..++++|.+.|++.+.+-+
T Consensus 31 ~~~~GH~~G~~~l~~i~~~~~~~gI~~lT~Ya 62 (242)
T PRK14838 31 ERSFGHQAGAETVHIITEEAARLGVKFLTLYT 62 (242)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHcCCCEEEEEe
Confidence 44467788999999999999999999888754
No 306
>COG0529 CysC Adenylylsulfate kinase and related kinases [Inorganic ion transport and metabolism]
Probab=24.09 E-value=1e+02 Score=19.82 Aligned_cols=42 Identities=14% Similarity=0.097 Sum_probs=27.8
Q ss_pred chH-HHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 74 ELG-KLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 74 G~g-~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
|-| +.+-.++.+...+.|+..-.++.+.--..+.+.+||...
T Consensus 33 GsGKSTiA~ale~~L~~~G~~~y~LDGDnvR~gL~~dLgFs~e 75 (197)
T COG0529 33 GSGKSTIANALEEKLFAKGYHVYLLDGDNVRHGLNRDLGFSRE 75 (197)
T ss_pred CCCHHHHHHHHHHHHHHcCCeEEEecChhHhhcccCCCCCChH
Confidence 444 345555666677889887777665555567777888764
No 307
>cd04609 CBS_pair_PALP_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a pote
Probab=24.08 E-value=1.3e+02 Score=16.00 Aligned_cols=16 Identities=13% Similarity=0.320 Sum_probs=10.7
Q ss_pred EEEEEcCCCCeEEEEEEE
Q psy6918 28 VTVIEDTRTKQVIGTGSL 45 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~ 45 (123)
..++.+ +++++|.++.
T Consensus 89 ~~vv~~--~~~~~Gvvt~ 104 (110)
T cd04609 89 AVVVDE--GGKFVGIITR 104 (110)
T ss_pred eeEEec--CCeEEEEEeH
Confidence 344444 6899998875
No 308
>PRK14835 undecaprenyl pyrophosphate synthase; Provisional
Probab=23.92 E-value=85 Score=21.35 Aligned_cols=31 Identities=13% Similarity=0.069 Sum_probs=25.6
Q ss_pred cccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 69 TYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 69 ~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
...|.-.|..-+..++++|.+.|++.+.+-+
T Consensus 63 ~~~GH~~G~~~l~~i~~~c~~lGIk~lTvYa 93 (275)
T PRK14835 63 REMGHEFGVQKAYEVLEWCLELGIPTVTIWV 93 (275)
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 3356677899999999999999999988743
No 309
>cd03527 RuBisCO_small Ribulose bisphosphate carboxylase/oxygenase (Rubisco), small subunit. Rubisco is a bifunctional enzyme catalyzes the initial steps of two opposing metabolic pathways: photosynthetic carbon fixation and the competing process of photorespiration. Rubisco Form I, present in plants and green algae, is composed of eight large and eight small subunits. The nearly identical small subunits are encoded by a family of nuclear genes. After translation, the small subunits are translocated across the chloroplast membrane, where an N-terminal signal peptide is cleaved off. While the large subunits contain the catalytic activities, it has been shown that the small subunits are important for catalysis by enhancing the catalytic rate through inducing conformational changes in the large subunits.
Probab=23.83 E-value=1.5e+02 Score=16.77 Aligned_cols=30 Identities=10% Similarity=0.005 Sum_probs=24.2
Q ss_pred CCCCCCHHHHHHHHHhhhcCCCeEEEEEEc
Q psy6918 4 KLPTSNVFHFAERFHRMKASQDYLVTVIED 33 (123)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 33 (123)
..|+++.+++.+.++.+...+...++-..+
T Consensus 8 ylp~lt~~~i~~QI~yll~qG~~~~lE~ad 37 (99)
T cd03527 8 YLPPLTDEQIAKQIDYIISNGWAPCLEFTE 37 (99)
T ss_pred cCCCCCHHHHHHHHHHHHhCCCEEEEEccc
Confidence 458999999999999998888877755444
No 310
>PF02631 RecX: RecX family; InterPro: IPR003783 RecX is a putative bacterial regulatory protein []. The gene encoding RecX is found downstream of recA, and it is suggested that the RecX protein might be regulator of RecA activity by interaction with the RecA protein or filament [].; GO: 0006282 regulation of DNA repair; PDB: 3DFG_A 3D5L_B 3C1D_B 3E3V_A.
Probab=23.76 E-value=80 Score=18.08 Aligned_cols=22 Identities=14% Similarity=0.182 Sum_probs=16.5
Q ss_pred cCCchHHHHHHHHHHHHHhcCC
Q psy6918 71 RGKELGKLLIAVLVKLAKHFQC 92 (123)
Q Consensus 71 rg~G~g~~Ll~~~~~~~~~~~~ 92 (123)
+.+|+....++.+++.+.+.|+
T Consensus 2 ~~kg~~~e~I~~vi~~l~~~gy 23 (121)
T PF02631_consen 2 KRKGFSEEAIEEVIDRLKELGY 23 (121)
T ss_dssp HHTT--HHHHHHHHHHHHHTTS
T ss_pred cccCCCHHHHHHHHHHHHHcCC
Confidence 3578888899999999888886
No 311
>cd04625 CBS_pair_12 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=23.70 E-value=1.3e+02 Score=16.10 Aligned_cols=10 Identities=20% Similarity=0.292 Sum_probs=8.0
Q ss_pred CCeEEEEEEE
Q psy6918 36 TKQVIGTGSL 45 (123)
Q Consensus 36 ~~~ivG~~~~ 45 (123)
+++++|.++.
T Consensus 97 ~~~~~Gvvt~ 106 (112)
T cd04625 97 GGTLLGVISF 106 (112)
T ss_pred CCEEEEEEEH
Confidence 4889998875
No 312
>PRK14828 undecaprenyl pyrophosphate synthase; Provisional
Probab=23.69 E-value=1e+02 Score=20.66 Aligned_cols=32 Identities=16% Similarity=0.161 Sum_probs=26.6
Q ss_pred ccc-cCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTY-RGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~-rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|.. .|.--|..-+..++++|.+.|++.+.+-+
T Consensus 47 ~~~~~GH~~G~~~l~~~~~~~~~~gIk~lTvYa 79 (256)
T PRK14828 47 TDVSQGHRAGAAKIGEFLGWCDETDVNVVTLYL 79 (256)
T ss_pred CchHHHHHHHHHHHHHHHHHHHHcCCCEEEEEE
Confidence 444 67778999999999999999999988743
No 313
>COG2231 Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair]
Probab=23.66 E-value=57 Score=21.21 Aligned_cols=40 Identities=13% Similarity=0.029 Sum_probs=29.4
Q ss_pred CCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 72 GKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 72 g~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
=+|+|.+-.+.++-++.+... ..++.-..+++.++|+...
T Consensus 120 iKGIG~ETaDsILlYa~~rp~----FVvD~Yt~R~l~rlg~i~~ 159 (215)
T COG2231 120 IKGIGKETADSILLYALDRPV----FVVDKYTRRLLSRLGGIEE 159 (215)
T ss_pred cCCcchhhHHHHHHHHhcCcc----cchhHHHHHHHHHhccccc
Confidence 479999999999988866542 3334555788999988775
No 314
>KOG0023|consensus
Probab=23.64 E-value=1.2e+02 Score=21.38 Aligned_cols=39 Identities=18% Similarity=0.170 Sum_probs=28.9
Q ss_pred chHHHHHHHHHHHHHhcCCeEEEEccCC-CchhHHHhcCcE
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKLTLDCAD-HMIPFYETFGFQ 113 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i~~~~~~-~~~~~y~~~Gf~ 113 (123)
|+|. |=...++++++.|.+...++... ......+++|-.
T Consensus 189 GlGG-LGh~aVq~AKAMG~rV~vis~~~~kkeea~~~LGAd 228 (360)
T KOG0023|consen 189 GLGG-LGHMAVQYAKAMGMRVTVISTSSKKKEEAIKSLGAD 228 (360)
T ss_pred cCcc-cchHHHHHHHHhCcEEEEEeCCchhHHHHHHhcCcc
Confidence 5666 77788899999999888887654 555677777743
No 315
>cd04589 CBS_pair_CAP-ED_DUF294_assoc_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the bacterial CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or
Probab=22.57 E-value=1.4e+02 Score=15.99 Aligned_cols=29 Identities=21% Similarity=0.109 Sum_probs=16.0
Q ss_pred HHHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 15 ERFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+.+......+...+.+.. +++++|.++..
T Consensus 78 ~~~~~~~~~~~~~~~Vv~---~~~~~G~it~~ 106 (111)
T cd04589 78 NALLLMTRHRIHRVVVRE---GGEVVGVLEQT 106 (111)
T ss_pred HHHHHHHHhCccEEEEee---CCEEEEEEEhH
Confidence 333433344444444444 48899988753
No 316
>cd04626 CBS_pair_13 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=22.47 E-value=1.4e+02 Score=15.98 Aligned_cols=21 Identities=10% Similarity=0.219 Sum_probs=13.3
Q ss_pred CCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 24 QDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 24 ~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
+...+.+..+ +|+++|++...
T Consensus 23 ~~~~~~v~d~--~~~~~G~v~~~ 43 (111)
T cd04626 23 NTNEIIVKDN--EEKLKGVVTFT 43 (111)
T ss_pred CCCeEEEEcC--CCCEEEEEehH
Confidence 3333444444 68999998864
No 317
>cd06915 NTP_transferase_WcbM_like WcbM_like is a subfamily of nucleotidyl transferases. WcbM protein of Burkholderia mallei is involved in the biosynthesis, export or translocation of capsule. It is a subfamily of nucleotidyl transferases that transfer nucleotides onto phosphosugars.
Probab=22.46 E-value=1.5e+02 Score=18.52 Aligned_cols=26 Identities=12% Similarity=0.230 Sum_probs=20.3
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCC
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCAD 101 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~ 101 (123)
|+-|++.+++.+.+.++..+.+.++.
T Consensus 28 g~pli~~~l~~l~~~g~~~v~vv~~~ 53 (223)
T cd06915 28 GRPFLEYLLEYLARQGISRIVLSVGY 53 (223)
T ss_pred CcchHHHHHHHHHHCCCCEEEEEccc
Confidence 46788888888888888888777653
No 318
>PF01380 SIS: SIS domain SIS domain web page.; InterPro: IPR001347 The SIS (Sugar ISomerase) domain is a phosphosugar-binding domain [] found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars possibly by binding to the end-product of the pathway.; GO: 0005529 sugar binding, 0005975 carbohydrate metabolic process; PDB: 3TBF_C 2V4M_A 2ZJ4_A 2ZJ3_A 3FKJ_A 3ODP_A 3EUA_H 1VIV_A 1M3S_B 1TZB_A ....
Probab=22.40 E-value=1.6e+02 Score=16.56 Aligned_cols=27 Identities=19% Similarity=0.139 Sum_probs=20.5
Q ss_pred chHHHHHHHHHHHHHhcCCeEEEEccCC
Q psy6918 74 ELGKLLIAVLVKLAKHFQCYKLTLDCAD 101 (123)
Q Consensus 74 G~g~~Ll~~~~~~~~~~~~~~i~~~~~~ 101 (123)
|.-+++++.+. .+++.|.+.|.++.+.
T Consensus 64 g~~~~~~~~~~-~ak~~g~~vi~iT~~~ 90 (131)
T PF01380_consen 64 GETRELIELLR-FAKERGAPVILITSNS 90 (131)
T ss_dssp STTHHHHHHHH-HHHHTTSEEEEEESST
T ss_pred ccchhhhhhhH-HHHhcCCeEEEEeCCC
Confidence 55677777777 9999999887776543
No 319
>COG3448 CBS-domain-containing membrane protein [Signal transduction mechanisms]
Probab=22.34 E-value=1.7e+02 Score=20.57 Aligned_cols=25 Identities=16% Similarity=0.252 Sum_probs=16.7
Q ss_pred HhhhcCCCeEEEEEEcCCCCeEEEEEE
Q psy6918 18 HRMKASQDYLVTVIEDTRTKQVIGTGS 44 (123)
Q Consensus 18 ~~~~~~~~~~~~~~~~~~~~~ivG~~~ 44 (123)
-.+.+.+.+.+.+.++ .|+++|.++
T Consensus 339 p~lad~g~H~lpvld~--~g~lvGIvs 363 (382)
T COG3448 339 PRLADEGLHALPVLDA--AGKLVGIVS 363 (382)
T ss_pred HHhhcCCcceeeEEcC--CCcEEEEee
Confidence 3344556666666665 799999876
No 320
>PHA01733 hypothetical protein
Probab=22.34 E-value=2e+02 Score=17.74 Aligned_cols=72 Identities=13% Similarity=-0.051 Sum_probs=37.0
Q ss_pred CCeEEEEEEEEEeeecccccceeEEEEEEEecccccCCchHHHHHHHHHHHHH-hcCCeEEEEc---cCCCchhHHHhcC
Q psy6918 36 TKQVIGTGSLILEQKFIHECALKGKIEEVVVDDTYRGKELGKLLIAVLVKLAK-HFQCYKLTLD---CADHMIPFYETFG 111 (123)
Q Consensus 36 ~~~ivG~~~~~~~~~~~~~~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~-~~~~~~i~~~---~~~~~~~~y~~~G 111 (123)
+|+++|..+... + .. +..+....+.+..=.+ +-.+.+..+..+.. ...+..++=- -|...++|.+.+|
T Consensus 55 nG~l~aI~Gv~~-d-~~---~~vG~pWlV~T~~v~k---~~~~f~re~r~~l~e~~~Yp~LwNyV~~~N~~hir~Lk~lG 126 (153)
T PHA01733 55 DGSLAGVAGLVE-D-MG---NRVGEIWMVCTPAIEK---NPIALLRGAKWWLPKSRNYDLLWNIVDKRNLVHRKLLRKLG 126 (153)
T ss_pred CCcEEEEecccc-c-cc---CCCCceeEEecHHhHh---CCHHHHHHHHHHHHHhccccHHHHhHhcccHHHHHHHHHcC
Confidence 588988888864 1 11 1223333333322222 33344444444433 3344434322 2556688999999
Q ss_pred cEEc
Q psy6918 112 FQKK 115 (123)
Q Consensus 112 f~~~ 115 (123)
|+-.
T Consensus 127 F~f~ 130 (153)
T PHA01733 127 FKGL 130 (153)
T ss_pred ceee
Confidence 9986
No 321
>KOG4717|consensus
Probab=22.25 E-value=50 Score=25.08 Aligned_cols=30 Identities=17% Similarity=0.378 Sum_probs=20.2
Q ss_pred HHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 84 VKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 84 ~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
+.+|.+.. .|.=.-.+.|.-||+|+|.++.
T Consensus 131 I~YCHqLH--VVHRDLKPENVVFFEKlGlVKL 160 (864)
T KOG4717|consen 131 ISYCHQLH--VVHRDLKPENVVFFEKLGLVKL 160 (864)
T ss_pred HHHHhhhh--hhcccCCcceeEEeeecCceEe
Confidence 34454433 2333446888999999999997
No 322
>cd02523 PC_cytidylyltransferase Phosphocholine cytidylyltransferases catalyze the synthesis of CDP-choline. This family contains proteins similar to prokaryotic phosphocholine (P-cho) cytidylyltransferases. Phosphocholine (PC) cytidylyltransferases catalyze the transfer of a cytidine monophosphate from CTP to phosphocholine to form CDP-choline. PC is the most abundant phospholipid in eukaryotic membranes and it is also important in prokaryotic membranes. For pathogenic prokaryotes, the cell surface PC facilitates the interaction with host surface and induces attachment and invasion. In addition cell wall PC serves as scaffold for a group of choline-binding proteins that are secreted from the cells. Phosphocholine (PC) cytidylyltransferase is a key enzyme in the prokaryotic choline metabolism pathway. It has been hypothesized to consist of a choline transport system, a choline kinase, CTP:phosphocholine cytidylyltransferase, and a choline phosphotransferase that transfers P-Cho from CDP
Probab=22.11 E-value=87 Score=19.93 Aligned_cols=29 Identities=10% Similarity=-0.009 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHHhcCCeEEEEccCCCchh
Q psy6918 77 KLLIAVLVKLAKHFQCYKLTLDCADHMIP 105 (123)
Q Consensus 77 ~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~ 105 (123)
+.|++.+++.+.+.|+..+.+.+......
T Consensus 29 ~~li~~~l~~l~~~gi~~i~vv~~~~~~~ 57 (229)
T cd02523 29 KPLLERQIETLKEAGIDDIVIVTGYKKEQ 57 (229)
T ss_pred EEHHHHHHHHHHHCCCceEEEEeccCHHH
Confidence 57888888888888888888776554433
No 323
>COG5092 NMT1 N-myristoyl transferase [Lipid metabolism]
Probab=22.05 E-value=3e+02 Score=19.50 Aligned_cols=92 Identities=18% Similarity=0.128 Sum_probs=49.0
Q ss_pred EEEEEcCCCCeEEEEEEEEEeeec-c-cc---cceeEEEEEEEecccccCCchHHHHHHHHHH-----------HHHhcC
Q psy6918 28 VTVIEDTRTKQVIGTGSLILEQKF-I-HE---CALKGKIEEVVVDDTYRGKELGKLLIAVLVK-----------LAKHFQ 91 (123)
Q Consensus 28 ~~~~~~~~~~~ivG~~~~~~~~~~-~-~~---~~~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~-----------~~~~~~ 91 (123)
.+++++. +|+|-++.++..-+-. + +. .-..+++...+.+. +=+.+..+.-.++.+ .++..+
T Consensus 312 syvVe~p-~gkItdFfsFyslp~t~i~n~kykdiq~gYLYYya~d~--~~kd~~~~a~~a~~~r~~e~v~Da~ilak~~~ 388 (451)
T COG5092 312 SYVVEEP-NGKITDFFSFYSLPFTTIENKKYKDIQGGYLYYYAGDD--QFKDFDPKATKALKTRVAEMVGDAMILAKVEG 388 (451)
T ss_pred EEEEeCC-CCccccceEEEeccceeecCccccccceeEEEEEccCc--cccccChHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 3445543 7899999887643210 0 11 11223444444444 333344443333333 345556
Q ss_pred CeEEEEccCCCchhHHHhcCcEEccceeEee
Q psy6918 92 CYKLTLDCADHMIPFYETFGFQKKNNFMQIY 122 (123)
Q Consensus 92 ~~~i~~~~~~~~~~~y~~~Gf~~~~~~~~~~ 122 (123)
+...-..+.-+|.-|...++|..-.-++.+|
T Consensus 389 ~DVFNalt~~dN~lFL~dLkFg~GdGflnyY 419 (451)
T COG5092 389 CDVFNALTMMDNSLFLADLKFGCGDGFLNYY 419 (451)
T ss_pred CchhhhhhhccchhHHHhcCccCCCceeEEE
Confidence 5544444456777799999998875555544
No 324
>PF05063 MT-A70: MT-A70 ; InterPro: IPR007757 N6-methyladenosine (m6A) is present at internal sites in eukaryotic mRNA. It is present only within a defined sequence context that has been shown to be conserved across species from plants to man. Despite its ubiquity and conserved sequence specificity, the functional significance of this modification remains a mystery [], []. MT-A70 is the S-adenosylmethionine-binding subunit of human mRNA N6-adenosine-methyltransferase (MTase), an enzyme that sequence-specifically methylates adenines in pre-mRNAs. Proteins with sequence similarity to MT-A70 have been identified in eukaryotes and prokaryotes. The resulting family is defined by sequence similarity in the carboxyl-proximal regions of the respective proteins. The amino-proximal regions of the eukaryotic proteins are highly diverse, often Pro-rich, and are conserved only within individual subfamilies []. Corresponding regions are not present in prokaryotic members of the family. MT-A70-like proteins contain examples of some of the consensus methyltransferase motifs that have been derived from mutational and structural studies of bacterial DNA methyltransferases, including the universally conserved motif IV catalytic residues and a proposed motif I (AdoMet binding) element []. The MT-A70-like family comprises four subfamilies with varying degrees of interrelatedness. One subfamily is a small group of bacterial DNA: m6A MTases. The other three are paralogous eukaryotic lineages, two of which have not been associated with MTase activity but include proteins that regulate mRNA levels via unknown mechanisms apparently not involving methylation []. Some proteins known to belong to the MT-A70-like family are listed below: Human N6-adenosine-methyltransferase 70 kDa subunit (MT-A70) (2.1.1.62 from EC). Yeast N6-adenosine-methyltransferase IME4 (2.1.1.62 from EC), which is important for induction of sporulation. Yeast karyogamy protein KAR4, a phosphoprotein required for expression of karyogamy-specific genes during mating and that it also acts during mitosis and meiosis. It has been suggested that KAR4 is inactive for methyltransfer and may not even bind AdoMet. ; GO: 0008168 methyltransferase activity, 0006139 nucleobase-containing compound metabolic process
Probab=21.94 E-value=2.1e+02 Score=17.69 Aligned_cols=14 Identities=14% Similarity=0.619 Sum_probs=11.5
Q ss_pred hhHHHhcCcEEccc
Q psy6918 104 IPFYETFGFQKKNN 117 (123)
Q Consensus 104 ~~~y~~~Gf~~~~~ 117 (123)
.+++++.||+..+.
T Consensus 62 ~~l~~~WGf~~~~~ 75 (176)
T PF05063_consen 62 LELFPAWGFEYVTE 75 (176)
T ss_pred HHHHHhCCCEEEEE
Confidence 67999999998744
No 325
>PRK14836 undecaprenyl pyrophosphate synthase; Provisional
Probab=21.87 E-value=1.1e+02 Score=20.58 Aligned_cols=30 Identities=17% Similarity=0.097 Sum_probs=25.7
Q ss_pred ccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 70 YRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 70 ~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
..|.--|..-+..++++|.+.|++.+.+-+
T Consensus 37 ~~GH~~G~~~~~~iv~~c~~~gI~~lTvYa 66 (253)
T PRK14836 37 VEGHRAGVRAVRRTIEFCLEKGIEMLTLFA 66 (253)
T ss_pred hhhHHHHHHHHHHHHHHHHHcCCCEEehhH
Confidence 347778999999999999999999888754
No 326
>COG0196 RibF FAD synthase [Coenzyme metabolism]
Probab=21.70 E-value=2.9e+02 Score=19.21 Aligned_cols=35 Identities=17% Similarity=0.214 Sum_probs=27.4
Q ss_pred ecccccCCchHHH-HHHHHHHHHHhcCCeEEEEccC
Q psy6918 66 VDDTYRGKELGKL-LIAVLVKLAKHFQCYKLTLDCA 100 (123)
Q Consensus 66 v~p~~rg~G~g~~-Ll~~~~~~~~~~~~~~i~~~~~ 100 (123)
+.-.|=|..+|.+ |++.+.+.+++.+...+.++-+
T Consensus 20 ~IG~FDGvHlGHq~ll~~a~~~a~~~~~~~~VitF~ 55 (304)
T COG0196 20 TIGNFDGVHLGHQKLLAQALEAAEKRGLPVVVITFE 55 (304)
T ss_pred EEEcCCccchhHHHHHHHHHHHHHHhCCceEEEEec
Confidence 4455678888866 9999999999999877777543
No 327
>TIGR02143 trmA_only tRNA (uracil-5-)-methyltransferase. This family consists exclusively of proteins believed to act as tRNA (uracil-5-)-methyltransferase. All members of far are proteobacterial. The seed alignment was taken directly from pfam05958 in Pfam 12.0, but higher cutoffs are used to select only functionally equivalent proteins. Homologous proteins excluded by the higher cutoff scores of this model include other uracil methyltransferases, such as RumA, active on rRNA.
Probab=21.61 E-value=1.4e+02 Score=20.93 Aligned_cols=46 Identities=11% Similarity=0.098 Sum_probs=31.3
Q ss_pred EEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCCCch----hHHHhcCcEEc
Q psy6918 62 EEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCADHMI----PFYETFGFQKK 115 (123)
Q Consensus 62 ~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~----~~y~~~Gf~~~ 115 (123)
..+++||. +.|++..+++.+.+ - -..+.++|++... +.+.+ ||+..
T Consensus 281 d~v~lDPP--R~G~~~~~l~~l~~---~--~~ivYvsC~p~tlaRDl~~L~~-~Y~l~ 330 (353)
T TIGR02143 281 STIFVDPP--RAGLDPDTCKLVQA---Y--ERILYISCNPETLKANLEQLSE-THRVE 330 (353)
T ss_pred CEEEECCC--CCCCcHHHHHHHHc---C--CcEEEEEcCHHHHHHHHHHHhc-CcEEE
Confidence 56889999 57999999998875 1 2457788877552 23432 36664
No 328
>PRK14827 undecaprenyl pyrophosphate synthase; Provisional
Probab=21.60 E-value=1e+02 Score=21.28 Aligned_cols=32 Identities=16% Similarity=0.157 Sum_probs=25.9
Q ss_pred ccccCCchHHHHHHHHHHHHHhcCCeEEEEcc
Q psy6918 68 DTYRGKELGKLLIAVLVKLAKHFQCYKLTLDC 99 (123)
Q Consensus 68 p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~ 99 (123)
|...|.-.|..-+..++++|.+.|++.+.+-+
T Consensus 88 ~~~~GH~~G~~~l~~v~~~c~~lGI~~lTvYa 119 (296)
T PRK14827 88 ARTEGHKMGEAVVIDIACGAIELGIKWLSLYA 119 (296)
T ss_pred CHhHHHHHHHHHHHHHHHHHHHcCCCEEEEee
Confidence 33356666889999999999999999998754
No 329
>cd04586 CBS_pair_BON_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain. BON is a putative phospholipid-binding domain found in a family of osmotic shock protection proteins. It is also found in some secretins and a group of potential haemolysins. Its likely function is attachment to phospholipid membranes. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown.
Probab=21.58 E-value=1.7e+02 Score=16.57 Aligned_cols=18 Identities=22% Similarity=0.266 Sum_probs=12.1
Q ss_pred EEEEEEcCCCCeEEEEEEEE
Q psy6918 27 LVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 27 ~~~~~~~~~~~~ivG~~~~~ 46 (123)
.+.+..+ +++++|++...
T Consensus 27 ~~~Vvd~--~~~~~Gvi~~~ 44 (135)
T cd04586 27 GLPVVDD--DGRLVGIVSEG 44 (135)
T ss_pred CceEECC--CCCEEEEeeHH
Confidence 3444544 68999998754
No 330
>PF13344 Hydrolase_6: Haloacid dehalogenase-like hydrolase; PDB: 2HO4_B 1YV9_A 1WVI_B 3EPR_A 2P27_A 2OYC_A 2CFT_A 2P69_A 2CFS_A 2CFR_A ....
Probab=21.57 E-value=1.5e+02 Score=16.42 Aligned_cols=24 Identities=17% Similarity=0.061 Sum_probs=17.5
Q ss_pred cCCeEEEEccCCCchhHHHhcCcE
Q psy6918 90 FQCYKLTLDCADHMIPFYETFGFQ 113 (123)
Q Consensus 90 ~~~~~i~~~~~~~~~~~y~~~Gf~ 113 (123)
.+..++.+...+.....++..||+
T Consensus 78 ~~~~~v~vlG~~~l~~~l~~~G~e 101 (101)
T PF13344_consen 78 KGGKKVYVLGSDGLREELREAGFE 101 (101)
T ss_dssp TTSSEEEEES-HHHHHHHHHTTEE
T ss_pred CCCCEEEEEcCHHHHHHHHHcCCC
Confidence 456777777777778888888875
No 331
>COG2813 RsmC 16S RNA G1207 methylase RsmC [Translation, ribosomal structure and biogenesis]
Probab=21.54 E-value=2.9e+02 Score=19.20 Aligned_cols=39 Identities=18% Similarity=0.207 Sum_probs=25.1
Q ss_pred EEEEEEeccccc-CCchHH----HHHHHHHHHHHhcCCeEEEEccC
Q psy6918 60 KIEEVVVDDTYR-GKELGK----LLIAVLVKLAKHFQCYKLTLDCA 100 (123)
Q Consensus 60 ~i~~~~v~p~~r-g~G~g~----~Ll~~~~~~~~~~~~~~i~~~~~ 100 (123)
....|..+|.++ |+.+.. +|++.+.+..+.-|- +++.+|
T Consensus 224 kfd~IisNPPfh~G~~v~~~~~~~~i~~A~~~L~~gGe--L~iVan 267 (300)
T COG2813 224 KFDLIISNPPFHAGKAVVHSLAQEIIAAAARHLKPGGE--LWIVAN 267 (300)
T ss_pred cccEEEeCCCccCCcchhHHHHHHHHHHHHHhhccCCE--EEEEEc
Confidence 456678899976 777777 556666655555564 444443
No 332
>cd04633 CBS_pair_20 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=21.49 E-value=1.6e+02 Score=16.09 Aligned_cols=28 Identities=14% Similarity=0.090 Sum_probs=15.3
Q ss_pred HHHhhhcCCCeEEEEEEcCCCCeEEEEEEEE
Q psy6918 16 RFHRMKASQDYLVTVIEDTRTKQVIGTGSLI 46 (123)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~ 46 (123)
.+..+.+.+...+.++.+ ++++|.++..
T Consensus 89 ~~~~~~~~~~~~~~Vv~~---~~~~Gvi~~~ 116 (121)
T cd04633 89 VASLMLENNIGGLPVVDD---GKLVGIVTRT 116 (121)
T ss_pred HHHHHHHcCCCcccEEEC---CEEEEEEEHH
Confidence 333333334444444443 7899988753
No 333
>PRK11660 putative transporter; Provisional
Probab=21.12 E-value=1.8e+02 Score=21.96 Aligned_cols=39 Identities=15% Similarity=0.037 Sum_probs=30.2
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
|-+.+..+.+.+++ |.+-+....+++..+.+++.|+...
T Consensus 508 g~~~L~~l~~~l~~-g~~l~l~~l~~~v~~~l~~~gl~~~ 546 (568)
T PRK11660 508 GLDAFQRFVKRLPE-GCELRICNLQFQPLRTLARAGIQPI 546 (568)
T ss_pred HHHHHHHHHHHHHC-CCEEEEecCChHHHHHHHHCCChhh
Confidence 45667777777788 8876666778899999999998764
No 334
>PRK12485 bifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II-like protein; Provisional
Probab=21.11 E-value=1.6e+02 Score=21.00 Aligned_cols=28 Identities=7% Similarity=-0.081 Sum_probs=21.2
Q ss_pred HhcCCeEEEEccCCCchhHHHhcCcEEc
Q psy6918 88 KHFQCYKLTLDCADHMIPFYETFGFQKK 115 (123)
Q Consensus 88 ~~~~~~~i~~~~~~~~~~~y~~~Gf~~~ 115 (123)
+..|+.+|.+-.|+....-.+.+|.+..
T Consensus 335 r~LGV~kirLLnNP~K~~~L~~~GIeV~ 362 (369)
T PRK12485 335 QDLGVGKLRHLGPPLKYAGLTGYDLEVV 362 (369)
T ss_pred HHcCCCEEEECCCchhhhhhhhCCcEEE
Confidence 5668999988877666666778888776
No 335
>KOG2540|consensus
Probab=21.03 E-value=2.2e+02 Score=18.84 Aligned_cols=41 Identities=12% Similarity=0.147 Sum_probs=23.6
Q ss_pred EEcCCCCeEEEEEEEEEeee-----------------cccccceeEEEEEEEeccccc
Q psy6918 31 IEDTRTKQVIGTGSLILEQK-----------------FIHECALKGKIEEVVVDDTYR 71 (123)
Q Consensus 31 ~~~~~~~~ivG~~~~~~~~~-----------------~~~~~~~~~~i~~~~v~p~~r 71 (123)
+++..+..|||..+..+.+. -...+.....=..++|+|+|-
T Consensus 171 a~N~sdkpIiGvstYni~P~~Aa~YFnKiqCFCFEEQ~L~pgE~vDmPVFFyIDPefa 228 (269)
T KOG2540|consen 171 AENPSDKPIIGVSTYNITPGQAAVYFNKIQCFCFEEQKLNPGEQVDMPVFFYIDPEFA 228 (269)
T ss_pred ccCCCCCCceeeEeeccCccHhhhheeceeEEeehhhccCCCcccCcceEEEeCcccc
Confidence 34444567999988764332 112233334455688999874
No 336
>KOG0538|consensus
Probab=20.79 E-value=2.8e+02 Score=19.58 Aligned_cols=35 Identities=20% Similarity=0.132 Sum_probs=26.3
Q ss_pred EEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccCC
Q psy6918 60 KIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCAD 101 (123)
Q Consensus 60 ~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~ 101 (123)
...-++|.++ +.+.+.+++.+.+.|++-+.++.+.
T Consensus 123 rwfQLYvykd-------r~It~~Lv~raEk~GfkAlvlTvDt 157 (363)
T KOG0538|consen 123 RWFQLYVYKD-------RDITEQLVKRAEKAGFKALVLTVDT 157 (363)
T ss_pred EEEEEEecCc-------hHHHHHHHHHHHHcCceEEEEEecc
Confidence 3444788887 5677777788888999999887643
No 337
>PF02679 ComA: (2R)-phospho-3-sulfolactate synthase (ComA); InterPro: IPR003830 Methanogenic archaea produce methane via the anaerobic reduction of acetate or single carbon compounds []. Coenzyme M (CoM; 2-mercaptoethanesulphonic acid) serves as the terminal methyl carrier for this process. Previously thought to be unique to methanogenic archaea, CoM has also been found in methylotrophic bacteria. Biosynthesis of CoM begins with the Michael addition of sulphite to phosphoenolpyruvate, forming 2-phospho-3-sulpholactate (PSL). This reaction is catalyzed by members of this family, PSL synthase (ComA) []. Subsequently, PSL is dephosphorylated by phosphosulpholactate phosphatase (ComB) to form 3-sulpholactate [], which is then converted to 3-sulphopyruvate by L-sulpholactate dehydrogenase (ComC; 1.1.1.272 from EC) []. Sulphopyruvate decarboxylase (ComDE; 4.1.1.79 from EC) converts 3-sulphopyruvate to sulphoacetaldehyde []. Reductive thiolation of sulphoacetaldehyde is the final step.; GO: 0019295 coenzyme M biosynthetic process; PDB: 1U83_A 1QWG_A.
Probab=20.79 E-value=2.4e+02 Score=18.92 Aligned_cols=37 Identities=11% Similarity=0.099 Sum_probs=24.5
Q ss_pred HHHHHHHHHHhcCCeEEEEccCC---------CchhHHHhcCcEEc
Q psy6918 79 LIAVLVKLAKHFQCYKLTLDCAD---------HMIPFYETFGFQKK 115 (123)
Q Consensus 79 Ll~~~~~~~~~~~~~~i~~~~~~---------~~~~~y~~~Gf~~~ 115 (123)
-++..++.|++.|+..|.++... ..++..++.||++.
T Consensus 85 ~~~~yl~~~k~lGf~~IEiSdGti~l~~~~r~~~I~~~~~~Gf~v~ 130 (244)
T PF02679_consen 85 KFDEYLEECKELGFDAIEISDGTIDLPEEERLRLIRKAKEEGFKVL 130 (244)
T ss_dssp -HHHHHHHHHHCT-SEEEE--SSS---HHHHHHHHHHHCCTTSEEE
T ss_pred hHHHHHHHHHHcCCCEEEecCCceeCCHHHHHHHHHHHHHCCCEEe
Confidence 46777788899999999886421 22557778888875
No 338
>cd06426 NTP_transferase_like_2 NTP_trnasferase_like_2 is a member of the nucleotidyl transferase family. This is a subfamily of nucleotidyl transferases. Nucleotidyl transferases transfer nucleotides onto phosphosugars. The activated sugars are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Other subfamilies of nucleotidyl transferases include Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase.
Probab=20.77 E-value=1.6e+02 Score=18.46 Aligned_cols=28 Identities=7% Similarity=0.055 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEccCCCc
Q psy6918 76 GKLLIAVLVKLAKHFQCYKLTLDCADHM 103 (123)
Q Consensus 76 g~~Ll~~~~~~~~~~~~~~i~~~~~~~~ 103 (123)
|+.|++.+++.+.+.++..+.+.++...
T Consensus 28 g~pli~~~l~~l~~~~~~~iivv~~~~~ 55 (220)
T cd06426 28 GKPILETIIDRFIAQGFRNFYISVNYLA 55 (220)
T ss_pred CcchHHHHHHHHHHCCCcEEEEECccCH
Confidence 3568888888888888888888765443
No 339
>cd04263 DUF619-NAGK-FABP DUF619 domain of N-acetylglutamate kinase (NAGK) of the fungal arginine-biosynthetic pathway. DUF619-NAGK-FABP: DUF619 domain of N-acetylglutamate kinase (NAGK) of the fungal arginine-biosynthetic pathway (FABP). The nuclear-encoded, mitochondrial polyprotein precursor (ARG5,6) consists of an N-terminal NAGK (ArgB) domain, a central DUF619 domain, and a C-terminal reductase domain (ArgC, N-Acetylglutamate Phosphate Reductase, NAGPR). The precursor is cleaved into two distinct enzymes (NAGK-DUF619 and NAGPR) in the mitochondria. Native molecular weights of these proteins indicate that the kinase is an octamer whereas the reductase is a dimer. Arg5,6 catalyzes the second reaction of arginine biosynthesis; the phosphorylation of the gamma-carboxyl group of NAG to produce N-acetylglutamylphosphate (NAGP) which is subsequently converted to ornithine in two more steps. It also binds and regulates the promoters of nuclear and mitochondrial genes, and may possibly regu
Probab=20.76 E-value=1.8e+02 Score=16.44 Aligned_cols=39 Identities=8% Similarity=0.018 Sum_probs=30.3
Q ss_pred eeEEEEEEEecccccCCchHHHHHHHHHHHHHhcCCeEEEEccC
Q psy6918 57 LKGKIEEVVVDDTYRGKELGKLLIAVLVKLAKHFQCYKLTLDCA 100 (123)
Q Consensus 57 ~~~~i~~~~v~p~~rg~G~g~~Ll~~~~~~~~~~~~~~i~~~~~ 100 (123)
...++..+.|..+-++.|++..+.+++.+. ...+.+.+.
T Consensus 32 ~vp~LdkF~vs~~~~l~~vaD~Vf~~i~~d-----~p~L~W~~r 70 (98)
T cd04263 32 EVATLATFTITKSGWLNNVADNIFTAIKKD-----HPKLVWTVR 70 (98)
T ss_pred CCEEEEEEEEccccccccHHHHHHHHHHhh-----CCeeEEEeC
Confidence 467899999999999999999999988754 335655543
No 340
>COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism]
Probab=20.47 E-value=1.2e+02 Score=23.33 Aligned_cols=40 Identities=18% Similarity=0.120 Sum_probs=30.8
Q ss_pred HHHHHHHHHHHHhcCCeEEEEccCCCchhHHHhcCcEEccc
Q psy6918 77 KLLIAVLVKLAKHFQCYKLTLDCADHMIPFYETFGFQKKNN 117 (123)
Q Consensus 77 ~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y~~~Gf~~~~~ 117 (123)
.++...++.++++.|+..|.+. .-...++....||+.+..
T Consensus 164 ~e~a~~llpYl~elG~T~IELM-Pv~e~p~~~sWGYq~~g~ 203 (628)
T COG0296 164 FELAIELLPYLKELGITHIELM-PVAEHPGDRSWGYQGTGY 203 (628)
T ss_pred HHHHHHHhHHHHHhCCCEEEEc-ccccCCCCCCCCCCccee
Confidence 6788888899999999999884 334456778888887643
No 341
>PRK13397 3-deoxy-7-phosphoheptulonate synthase; Provisional
Probab=20.45 E-value=2e+02 Score=19.30 Aligned_cols=36 Identities=14% Similarity=0.073 Sum_probs=24.7
Q ss_pred CCchHHHHHHHHHHHHHhcCCeEEEEccCCCchhHH
Q psy6918 72 GKELGKLLIAVLVKLAKHFQCYKLTLDCADHMIPFY 107 (123)
Q Consensus 72 g~G~g~~Ll~~~~~~~~~~~~~~i~~~~~~~~~~~y 107 (123)
-+|+|.+=++.+.+.+++.|+..+.-..++....+.
T Consensus 60 f~G~G~~gl~~L~~~~~~~Gl~~~Tev~d~~~v~~~ 95 (250)
T PRK13397 60 FQGLGLQGIRYLHEVCQEFGLLSVSEIMSERQLEEA 95 (250)
T ss_pred cCCCCHHHHHHHHHHHHHcCCCEEEeeCCHHHHHHH
Confidence 344555799999999999999766654444444444
No 342
>COG3414 SgaB Phosphotransferase system, galactitol-specific IIB component [Carbohydrate transport and metabolism]
Probab=20.41 E-value=1.5e+02 Score=16.47 Aligned_cols=23 Identities=13% Similarity=0.176 Sum_probs=16.5
Q ss_pred cCCchHHHHH--HHHHHHHHhcCCe
Q psy6918 71 RGKELGKLLI--AVLVKLAKHFQCY 93 (123)
Q Consensus 71 rg~G~g~~Ll--~~~~~~~~~~~~~ 93 (123)
-|.|+|+-++ ..+++.++++|+.
T Consensus 8 CG~GvgSS~~ik~kve~~l~~~gi~ 32 (93)
T COG3414 8 CGNGVGSSTMIKMKVEEVLKELGID 32 (93)
T ss_pred CCCCccHHHHHHHHHHHHHHHcCCC
Confidence 4778886554 4567788888984
Done!