Query psy838
Match_columns 152
No_of_seqs 228 out of 1476
Neff 7.7
Searched_HMMs 46136
Date Fri Aug 16 17:09:56 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy838.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/838hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0402|consensus 99.9 3.2E-26 6.8E-31 148.7 2.3 45 1-45 1-45 (92)
2 PTZ00255 60S ribosomal protein 99.9 3.7E-25 8.1E-30 147.4 3.5 45 1-45 1-45 (90)
3 PRK03976 rpl37ae 50S ribosomal 99.9 9.5E-25 2E-29 145.4 3.5 45 1-45 1-45 (90)
4 TIGR00280 L37a ribosomal prote 99.9 4.3E-24 9.2E-29 142.4 3.4 44 1-45 1-44 (91)
5 KOG2488|consensus 99.9 1.9E-22 4E-27 150.8 10.4 98 54-152 53-150 (202)
6 PF01780 Ribosomal_L37ae: Ribo 99.9 7.8E-24 1.7E-28 141.1 2.4 44 2-45 1-44 (90)
7 COG1997 RPL43A Ribosomal prote 99.8 9.9E-22 2.2E-26 129.2 3.0 44 1-45 1-44 (89)
8 PRK10146 aminoalkylphosphonic 99.3 3.2E-12 7E-17 91.1 7.0 94 49-151 6-105 (144)
9 KOG3216|consensus 99.3 2E-11 4.3E-16 88.9 9.5 102 49-152 6-114 (163)
10 PHA00673 acetyltransferase dom 99.2 1.1E-10 2.4E-15 85.9 10.8 101 49-151 9-114 (154)
11 PF13673 Acetyltransf_10: Acet 99.2 1.8E-10 3.9E-15 79.1 10.7 87 56-151 1-93 (117)
12 TIGR03827 GNAT_ablB putative b 99.2 1.9E-10 4E-15 91.3 10.2 97 47-151 116-212 (266)
13 PRK10314 putative acyltransfer 99.1 6.7E-10 1.4E-14 81.4 10.5 97 46-151 6-103 (153)
14 TIGR01575 rimI ribosomal-prote 99.1 8E-10 1.7E-14 76.7 8.9 83 56-151 1-83 (131)
15 PHA01807 hypothetical protein 99.1 7.1E-10 1.5E-14 81.6 8.5 98 49-152 6-111 (153)
16 PTZ00330 acetyltransferase; Pr 99.1 3.2E-10 7E-15 80.9 6.4 95 49-151 9-111 (147)
17 PRK07757 acetyltransferase; Pr 99.0 2.1E-09 4.5E-14 77.5 9.3 91 48-151 3-94 (152)
18 TIGR03448 mycothiol_MshD mycot 99.0 3E-09 6.4E-14 84.6 10.9 95 48-151 2-99 (292)
19 PRK09831 putative acyltransfer 99.0 2.6E-09 5.6E-14 77.1 9.6 89 49-151 3-101 (147)
20 TIGR02406 ectoine_EctA L-2,4-d 99.0 1.9E-09 4.1E-14 79.1 8.9 93 50-151 2-95 (157)
21 PRK03624 putative acetyltransf 99.0 2.4E-09 5.1E-14 75.0 8.3 91 49-151 5-97 (140)
22 PF00583 Acetyltransf_1: Acety 99.0 2.3E-09 5E-14 69.3 7.6 49 103-151 4-54 (83)
23 PRK09491 rimI ribosomal-protei 99.0 5E-09 1.1E-13 75.1 9.6 90 48-151 3-92 (146)
24 TIGR01890 N-Ac-Glu-synth amino 99.0 3.6E-09 7.7E-14 89.3 9.9 93 46-151 282-376 (429)
25 PF13508 Acetyltransf_7: Acety 99.0 4.7E-09 1E-13 68.0 7.9 53 93-151 3-55 (79)
26 PLN02706 glucosamine 6-phospha 98.9 8.4E-09 1.8E-13 74.2 9.4 98 48-151 8-114 (150)
27 PRK07922 N-acetylglutamate syn 98.9 6.3E-09 1.4E-13 77.3 8.8 93 48-151 7-99 (169)
28 PF13527 Acetyltransf_9: Acety 98.9 1.9E-09 4.1E-14 75.3 5.6 92 50-151 3-101 (127)
29 PRK05279 N-acetylglutamate syn 98.9 6.4E-09 1.4E-13 88.0 9.8 94 46-151 294-388 (441)
30 PRK10140 putative acetyltransf 98.9 5.6E-09 1.2E-13 75.4 8.1 98 49-151 6-107 (162)
31 PLN02825 amino-acid N-acetyltr 98.9 7.1E-09 1.5E-13 89.3 9.9 94 46-151 367-461 (515)
32 TIGR03103 trio_acet_GNAT GNAT- 98.9 1E-08 2.2E-13 89.2 10.8 92 53-151 90-184 (547)
33 PRK10514 putative acetyltransf 98.9 1.2E-08 2.6E-13 72.8 9.3 94 48-151 3-98 (145)
34 COG0456 RimI Acetyltransferase 98.9 1.5E-08 3.2E-13 74.3 8.2 98 49-151 14-120 (177)
35 PRK12308 bifunctional arginino 98.8 3.6E-08 7.8E-13 86.7 9.4 92 48-151 465-556 (614)
36 TIGR01686 FkbH FkbH-like domai 98.8 7.8E-08 1.7E-12 78.3 10.6 97 48-151 188-286 (320)
37 COG1247 Sortase and related ac 98.7 9.5E-08 2.1E-12 71.3 9.2 101 49-152 4-111 (169)
38 TIGR02382 wecD_rffC TDP-D-fuco 98.7 1.2E-07 2.7E-12 71.4 10.0 99 46-151 43-152 (191)
39 PRK10975 TDP-fucosamine acetyl 98.7 2.1E-07 4.7E-12 70.1 10.5 97 48-151 48-155 (194)
40 COG3153 Predicted acetyltransf 98.7 1E-07 2.2E-12 71.3 8.2 93 49-151 6-104 (171)
41 PRK01346 hypothetical protein; 98.7 2E-07 4.3E-12 77.9 10.6 94 48-151 8-108 (411)
42 cd04301 NAT_SF N-Acyltransfera 98.7 9.9E-08 2.1E-12 56.9 6.3 48 103-151 7-54 (65)
43 PRK13688 hypothetical protein; 98.7 3.6E-07 7.7E-12 67.4 10.1 56 92-149 44-106 (156)
44 PF13523 Acetyltransf_8: Acety 98.6 1.4E-07 3E-12 68.0 7.1 97 51-151 3-107 (152)
45 PRK10562 putative acetyltransf 98.6 5E-07 1.1E-11 64.7 9.8 88 49-150 2-96 (145)
46 KOG3139|consensus 98.6 6.4E-07 1.4E-11 66.0 9.5 98 46-151 16-113 (165)
47 COG1246 ArgA N-acetylglutamate 98.5 1.5E-07 3.4E-12 68.9 5.5 93 49-152 3-95 (153)
48 PF13420 Acetyltransf_4: Acety 98.5 3.5E-07 7.6E-12 65.7 7.4 95 50-151 2-105 (155)
49 TIGR03448 mycothiol_MshD mycot 98.5 7.8E-07 1.7E-11 70.7 10.0 91 56-151 160-255 (292)
50 TIGR03585 PseH pseudaminic aci 98.5 2.6E-06 5.7E-11 61.1 10.3 99 48-151 2-104 (156)
51 PRK15130 spermidine N1-acetylt 98.4 2.7E-06 5.9E-11 63.3 9.8 100 47-151 7-111 (186)
52 PRK10809 ribosomal-protein-S5- 98.4 6.7E-06 1.4E-10 61.6 10.7 99 47-151 18-132 (194)
53 PF13302 Acetyltransf_3: Acety 98.2 2.4E-06 5.2E-11 60.1 5.6 101 48-151 3-112 (142)
54 PF14542 Acetyltransf_CG: GCN5 98.2 7.4E-06 1.6E-10 53.6 7.4 45 103-151 7-51 (78)
55 PRK10151 ribosomal-protein-L7/ 98.2 3E-05 6.6E-10 57.2 11.5 101 47-151 11-121 (179)
56 cd02169 Citrate_lyase_ligase C 98.1 4.5E-06 9.7E-11 67.6 5.9 41 103-151 14-54 (297)
57 KOG3396|consensus 98.0 4.5E-05 9.8E-10 54.9 8.5 73 79-151 34-114 (150)
58 COG2153 ElaA Predicted acyltra 98.0 1.1E-05 2.5E-10 58.6 5.2 98 46-151 7-105 (155)
59 PF13718 GNAT_acetyltr_2: GNAT 97.9 8.4E-05 1.8E-09 56.8 8.5 70 80-151 14-119 (196)
60 COG2388 Predicted acetyltransf 97.9 3.9E-05 8.4E-10 52.5 5.9 56 92-152 14-69 (99)
61 KOG3397|consensus 97.9 8E-05 1.7E-09 55.9 8.0 57 95-152 57-113 (225)
62 TIGR00124 cit_ly_ligase [citra 97.9 3.6E-05 7.9E-10 63.3 6.5 49 93-151 31-79 (332)
63 PF12568 DUF3749: Acetyltransf 97.8 0.00059 1.3E-08 48.6 10.5 86 47-151 5-90 (128)
64 KOG3235|consensus 97.8 0.00011 2.3E-09 54.4 6.8 51 97-148 45-97 (193)
65 TIGR01211 ELP3 histone acetylt 97.7 0.00012 2.6E-09 63.5 6.7 59 93-151 411-486 (522)
66 PF08445 FR47: FR47-like prote 97.6 0.00027 5.8E-09 46.9 6.4 42 105-150 8-49 (86)
67 KOG3234|consensus 97.5 0.00016 3.5E-09 53.3 5.0 51 101-151 48-98 (173)
68 COG1444 Predicted P-loop ATPas 97.4 0.001 2.3E-08 59.8 9.1 71 80-151 457-560 (758)
69 PF13480 Acetyltransf_6: Acety 97.3 0.011 2.5E-07 41.0 11.9 114 20-151 5-123 (142)
70 COG0454 WecD Histone acetyltra 97.2 0.00019 4.2E-09 45.8 2.0 24 128-151 87-110 (156)
71 PF05301 Mec-17: Touch recepto 97.1 0.0026 5.6E-08 44.9 6.7 47 104-150 18-74 (120)
72 PF01233 NMT: Myristoyl-CoA:pr 96.9 0.014 3.1E-07 43.2 9.3 102 47-151 27-139 (162)
73 PF01853 MOZ_SAS: MOZ/SAS fami 96.8 0.0034 7.3E-08 47.7 5.7 57 91-150 51-108 (188)
74 COG3981 Predicted acetyltransf 96.7 0.0038 8.3E-08 46.6 5.4 54 92-152 68-127 (174)
75 PF13880 Acetyltransf_13: ESCO 96.5 0.0021 4.6E-08 41.2 2.4 27 123-149 6-32 (70)
76 PLN03238 probable histone acet 96.4 0.0099 2.1E-07 47.9 6.2 55 93-150 128-183 (290)
77 KOG3138|consensus 96.4 0.0035 7.6E-08 47.6 3.4 95 47-151 17-118 (187)
78 PLN03239 histone acetyltransfe 96.0 0.015 3.2E-07 48.0 5.6 55 93-150 186-241 (351)
79 COG1670 RimL Acetyltransferase 96.0 0.047 1E-06 39.3 7.6 47 104-151 77-124 (187)
80 COG3393 Predicted acetyltransf 96.0 0.015 3.3E-07 46.3 5.2 52 92-148 176-227 (268)
81 PF12746 GNAT_acetyltran: GNAT 95.9 0.026 5.6E-07 45.2 6.2 53 92-151 165-217 (265)
82 PF06852 DUF1248: Protein of u 95.9 0.085 1.8E-06 39.9 8.6 63 85-148 37-104 (181)
83 PLN00104 MYST -like histone ac 95.8 0.013 2.8E-07 50.0 4.4 54 93-149 279-333 (450)
84 PTZ00064 histone acetyltransfe 95.8 0.017 3.8E-07 49.7 5.2 56 92-150 356-412 (552)
85 KOG4601|consensus 95.6 0.091 2E-06 41.2 8.0 45 104-149 81-135 (264)
86 TIGR03694 exosort_acyl putativ 95.3 0.34 7.3E-06 38.1 10.5 83 50-137 12-126 (241)
87 PRK13834 putative autoinducer 95.2 0.42 9.1E-06 36.8 10.5 96 50-151 11-135 (207)
88 COG5628 Predicted acetyltransf 95.0 0.076 1.7E-06 37.7 5.4 55 93-150 38-93 (143)
89 PF00765 Autoind_synth: Autoin 94.8 0.64 1.4E-05 35.1 10.3 93 51-151 4-125 (182)
90 KOG2747|consensus 94.8 0.037 8.1E-07 46.4 3.9 54 91-148 233-286 (396)
91 cd04264 DUF619-NAGS DUF619 dom 94.4 0.21 4.6E-06 34.1 6.3 44 105-149 18-61 (99)
92 COG5630 ARG2 Acetylglutamate s 94.4 0.33 7.2E-06 40.8 8.5 99 37-149 329-428 (495)
93 TIGR03243 arg_catab_AOST argin 93.4 0.27 5.9E-06 40.6 6.2 93 50-145 3-140 (335)
94 KOG2779|consensus 93.3 0.28 6E-06 40.9 6.2 100 52-151 89-196 (421)
95 TIGR03244 arg_catab_AstA argin 93.3 0.27 5.9E-06 40.6 6.2 93 50-145 3-140 (336)
96 PF13444 Acetyltransf_5: Acety 93.1 0.59 1.3E-05 31.5 6.7 60 84-144 21-100 (101)
97 PRK10456 arginine succinyltran 93.0 0.31 6.7E-06 40.4 6.1 93 50-145 5-142 (344)
98 PF08444 Gly_acyl_tr_C: Aralky 92.2 0.18 3.8E-06 33.8 3.1 41 103-150 7-47 (89)
99 PF04958 AstA: Arginine N-succ 91.6 0.89 1.9E-05 37.7 7.2 93 50-145 5-144 (342)
100 TIGR03245 arg_AOST_alph argini 91.5 0.77 1.7E-05 38.0 6.6 93 50-145 3-141 (336)
101 cd04265 DUF619-NAGS-U DUF619 d 91.4 0.86 1.9E-05 31.1 5.8 29 121-149 33-61 (99)
102 COG3053 CitC Citrate lyase syn 91.3 0.65 1.4E-05 37.9 5.8 51 93-151 35-85 (352)
103 COG3375 Uncharacterized conser 91.0 1.7 3.8E-05 34.2 7.8 88 52-145 9-97 (266)
104 COG5027 SAS2 Histone acetyltra 91.0 0.14 3E-06 42.5 1.8 51 93-146 235-286 (395)
105 KOG4144|consensus 91.0 0.2 4.2E-06 37.2 2.4 43 104-147 71-126 (190)
106 PF13465 zf-H2C2_2: Zinc-finge 90.9 0.19 4E-06 25.7 1.7 23 22-46 2-24 (26)
107 KOG2696|consensus 89.8 0.95 2.1E-05 37.9 5.8 24 125-148 220-243 (403)
108 PF04768 DUF619: Protein of un 89.7 3.5 7.5E-05 30.8 8.3 92 46-149 22-115 (170)
109 PF09924 DUF2156: Uncharacteri 89.1 2.8 6.2E-05 33.5 8.0 114 20-151 117-234 (299)
110 COG3818 Predicted acetyltransf 88.7 0.59 1.3E-05 33.8 3.3 32 120-151 82-113 (167)
111 KOG2036|consensus 88.1 0.39 8.5E-06 43.5 2.5 28 123-150 615-642 (1011)
112 COG4888 Uncharacterized Zn rib 86.4 0.2 4.3E-06 34.2 -0.1 23 23-45 9-31 (104)
113 COG5092 NMT1 N-myristoyl trans 84.5 4.2 9.2E-05 33.6 6.5 101 51-151 86-194 (451)
114 PF07754 DUF1610: Domain of un 84.2 0.5 1.1E-05 23.9 0.7 12 33-44 13-24 (24)
115 KOG4135|consensus 83.9 1.4 3.1E-05 32.5 3.3 32 120-151 105-136 (185)
116 PF00096 zf-C2H2: Zinc finger, 83.9 0.43 9.2E-06 23.1 0.4 14 37-50 1-14 (23)
117 PHA00616 hypothetical protein 83.1 0.14 3.1E-06 29.8 -1.7 15 36-50 1-15 (44)
118 COG3138 AstA Arginine/ornithin 82.7 3.7 8E-05 33.4 5.5 94 50-144 5-141 (336)
119 KOG3014|consensus 79.8 12 0.00025 29.9 7.2 27 122-148 183-209 (257)
120 PF13894 zf-C2H2_4: C2H2-type 79.8 0.82 1.8E-05 21.7 0.6 13 37-49 1-13 (24)
121 PF09237 GAGA: GAGA factor; I 78.2 0.22 4.7E-06 30.0 -2.1 20 30-49 18-37 (54)
122 PF09538 FYDLN_acid: Protein o 76.3 1.4 2.9E-05 30.6 1.1 14 35-48 8-21 (108)
123 KOG2270|consensus 76.2 7.8 0.00017 33.3 5.7 65 10-74 190-283 (520)
124 PF02945 Endonuclease_7: Recom 75.8 1.5 3.2E-05 28.8 1.1 30 13-45 2-31 (81)
125 COG1243 ELP3 Histone acetyltra 74.9 4.1 8.9E-05 35.3 3.8 50 103-152 415-480 (515)
126 COG4552 Eis Predicted acetyltr 73.8 3.2 7E-05 34.7 2.8 54 95-150 41-98 (389)
127 cd04266 DUF619-NAGS-FABP DUF61 73.6 27 0.00058 24.2 7.0 30 120-149 37-67 (108)
128 PF14354 Lar_restr_allev: Rest 72.4 1.7 3.6E-05 26.4 0.7 9 37-45 4-12 (61)
129 PF09151 DUF1936: Domain of un 72.3 1.7 3.6E-05 23.4 0.6 10 36-45 1-10 (36)
130 PF04438 zf-HIT: HIT zinc fing 72.2 1.5 3.3E-05 23.2 0.4 12 34-45 11-22 (30)
131 PHA02768 hypothetical protein; 71.4 0.92 2E-05 27.7 -0.6 18 35-52 4-21 (55)
132 PF04339 DUF482: Protein of un 71.4 21 0.00045 30.0 7.2 66 46-115 204-271 (370)
133 PF14447 Prok-RING_4: Prokaryo 71.2 2.4 5.1E-05 25.8 1.1 12 39-50 42-53 (55)
134 PF04606 Ogr_Delta: Ogr/Delta- 69.9 2 4.2E-05 25.1 0.5 15 38-52 1-17 (47)
135 TIGR02300 FYDLN_acid conserved 68.4 2.7 5.9E-05 30.0 1.1 11 35-45 8-18 (129)
136 PF14787 zf-CCHC_5: GAG-polypr 68.0 3.1 6.7E-05 23.1 1.1 15 37-51 3-17 (36)
137 KOG2593|consensus 67.3 2.7 5.9E-05 35.8 1.1 24 22-45 112-137 (436)
138 PRK09710 lar restriction allev 63.2 3.3 7.2E-05 26.0 0.6 8 38-45 8-15 (64)
139 PF06689 zf-C4_ClpX: ClpX C4-t 63.0 3.3 7.2E-05 23.4 0.6 9 37-45 2-10 (41)
140 PF13912 zf-C2H2_6: C2H2-type 62.9 3.2 6.9E-05 20.7 0.5 14 36-49 1-14 (27)
141 COG4068 Uncharacterized protei 62.4 9.5 0.00021 23.6 2.5 29 38-66 10-38 (64)
142 TIGR03655 anti_R_Lar restricti 62.2 3.4 7.3E-05 24.6 0.5 8 38-45 3-10 (53)
143 PF05605 zf-Di19: Drought indu 62.2 6.1 0.00013 23.4 1.7 25 20-44 15-39 (54)
144 COG2401 ABC-type ATPase fused 60.7 7.4 0.00016 33.8 2.5 28 123-150 242-269 (593)
145 cd00350 rubredoxin_like Rubred 60.3 3.8 8.3E-05 22.0 0.5 12 34-45 15-26 (33)
146 PF12171 zf-C2H2_jaz: Zinc-fin 59.6 6 0.00013 19.9 1.2 13 36-48 1-13 (27)
147 PF14353 CpXC: CpXC protein 59.5 6 0.00013 27.7 1.5 20 26-45 28-47 (128)
148 COG5189 SFP1 Putative transcri 59.1 4.1 8.9E-05 33.7 0.7 17 35-51 397-413 (423)
149 PF09986 DUF2225: Uncharacteri 58.6 9.4 0.0002 29.4 2.6 62 5-66 17-80 (214)
150 PF04423 Rad50_zn_hook: Rad50 58.6 27 0.00058 20.6 4.1 34 22-64 9-42 (54)
151 PF08271 TF_Zn_Ribbon: TFIIB z 58.5 4.9 0.00011 22.7 0.8 9 37-45 1-9 (43)
152 PRK14890 putative Zn-ribbon RN 58.1 4.8 0.0001 24.9 0.7 12 33-44 45-56 (59)
153 COG2888 Predicted Zn-ribbon RN 57.9 4.8 0.0001 24.9 0.7 12 33-44 47-58 (61)
154 PF09889 DUF2116: Uncharacteri 57.8 17 0.00036 22.4 3.1 24 38-61 5-28 (59)
155 PRK09678 DNA-binding transcrip 57.7 6.2 0.00013 25.4 1.2 19 37-55 2-22 (72)
156 PF12294 DUF3626: Protein of u 57.6 12 0.00027 30.4 3.1 22 125-146 191-212 (297)
157 PRK00398 rpoP DNA-directed RNA 57.5 3.4 7.5E-05 23.7 0.0 10 36-45 21-30 (46)
158 TIGR03019 pepcterm_femAB FemAB 57.1 1E+02 0.0022 24.8 10.4 112 16-151 134-248 (330)
159 smart00355 ZnF_C2H2 zinc finge 57.1 5.5 0.00012 18.8 0.7 12 37-48 1-12 (26)
160 PF03119 DNA_ligase_ZBD: NAD-d 57.0 5.6 0.00012 20.7 0.8 10 38-47 1-10 (28)
161 PRK14892 putative transcriptio 56.0 3.6 7.9E-05 28.1 -0.1 14 32-45 17-30 (99)
162 PF09862 DUF2089: Protein of u 55.7 14 0.0003 25.9 2.8 31 36-66 12-48 (113)
163 PRK15312 antimicrobial resista 55.4 35 0.00076 27.9 5.4 116 7-144 127-252 (298)
164 COG1675 TFA1 Transcription ini 55.0 9.7 0.00021 28.7 2.0 13 33-45 129-141 (176)
165 COG3916 LasI N-acyl-L-homoseri 54.6 1E+02 0.0022 24.0 10.3 96 50-151 10-133 (209)
166 PF13913 zf-C2HC_2: zinc-finge 54.0 5.9 0.00013 19.9 0.5 11 37-47 3-13 (25)
167 cd00729 rubredoxin_SM Rubredox 53.4 5.6 0.00012 21.5 0.4 12 34-45 16-27 (34)
168 smart00531 TFIIE Transcription 52.5 11 0.00023 27.3 1.9 15 33-47 96-110 (147)
169 COG2075 RPL24A Ribosomal prote 52.2 7 0.00015 24.7 0.8 14 36-49 3-16 (66)
170 COG2260 Predicted Zn-ribbon RN 52.2 7.8 0.00017 23.9 0.9 20 37-56 18-39 (59)
171 PF13240 zinc_ribbon_2: zinc-r 52.0 9.4 0.0002 18.8 1.1 6 39-44 16-21 (23)
172 PRK13130 H/ACA RNA-protein com 50.8 10 0.00022 23.1 1.3 21 36-56 17-39 (56)
173 COG3677 Transposase and inacti 50.8 22 0.00047 25.3 3.2 21 34-54 51-73 (129)
174 PF04377 ATE_C: Arginine-tRNA- 50.7 88 0.0019 22.2 7.8 45 103-151 47-91 (128)
175 PF05741 zf-nanos: Nanos RNA b 50.6 4.9 0.00011 24.5 -0.1 23 23-45 20-42 (55)
176 PF13824 zf-Mss51: Zinc-finger 50.2 6.7 0.00015 23.9 0.5 25 32-56 10-34 (55)
177 PF05129 Elf1: Transcription e 50.1 6.6 0.00014 25.7 0.4 14 31-44 17-30 (81)
178 COG1592 Rubrerythrin [Energy p 49.7 10 0.00023 28.3 1.5 25 20-46 120-144 (166)
179 PF10058 DUF2296: Predicted in 49.5 7.3 0.00016 23.5 0.5 15 30-44 38-52 (54)
180 PRK10144 formate-dependent nit 48.9 21 0.00046 25.4 2.9 54 35-97 39-92 (126)
181 PF07395 Mig-14: Mig-14; Inte 48.5 57 0.0012 26.2 5.5 115 7-144 97-222 (264)
182 PF09390 DUF1999: Protein of u 47.8 65 0.0014 23.7 5.3 56 92-149 54-109 (161)
183 PF03604 DNA_RNApol_7kD: DNA d 47.7 7.2 0.00016 21.0 0.3 17 29-45 10-26 (32)
184 PF09943 DUF2175: Uncharacteri 47.5 5.4 0.00012 27.3 -0.3 13 35-47 1-13 (101)
185 COG4049 Uncharacterized protei 47.2 5.9 0.00013 24.3 -0.1 14 36-49 17-30 (65)
186 PF10571 UPF0547: Uncharacteri 46.4 11 0.00024 19.2 0.9 8 37-44 15-22 (26)
187 smart00504 Ubox Modified RING 46.3 29 0.00063 20.5 3.0 25 15-46 20-45 (63)
188 TIGR03147 cyt_nit_nrfF cytochr 46.2 24 0.00051 25.2 2.8 54 35-97 39-92 (126)
189 PF03884 DUF329: Domain of unk 46.2 8 0.00017 23.7 0.3 13 36-48 2-14 (57)
190 PHA00626 hypothetical protein 45.7 9.9 0.00021 23.3 0.7 15 32-46 19-33 (59)
191 PF06559 DCD: 2'-deoxycytidine 45.7 18 0.00039 30.2 2.4 34 103-139 323-356 (364)
192 PF02891 zf-MIZ: MIZ/SP-RING z 45.1 9 0.0002 22.6 0.4 11 35-45 40-50 (50)
193 PRK00464 nrdR transcriptional 44.9 10 0.00023 27.9 0.8 7 38-44 2-8 (154)
194 PF13530 SCP2_2: Sterol carrie 44.6 1.2E+02 0.0026 23.0 6.8 52 92-149 23-76 (218)
195 smart00659 RPOLCX RNA polymera 43.9 7.9 0.00017 22.3 0.1 17 29-45 12-28 (44)
196 COG3024 Uncharacterized protei 43.7 14 0.00031 23.1 1.2 16 33-48 4-19 (65)
197 PF04959 ARS2: Arsenite-resist 42.3 4.7 0.0001 31.3 -1.4 15 33-47 74-88 (214)
198 PF10013 DUF2256: Uncharacteri 41.8 12 0.00025 21.5 0.5 15 36-50 8-22 (42)
199 PF09855 DUF2082: Nucleic-acid 41.5 12 0.00026 23.4 0.6 9 37-45 1-9 (64)
200 COG4391 Uncharacterized protei 41.2 12 0.00025 23.3 0.5 15 33-47 45-59 (62)
201 PRK00114 hslO Hsp33-like chape 41.1 19 0.00041 29.1 1.9 26 33-62 265-290 (293)
202 PF13248 zf-ribbon_3: zinc-rib 41.0 13 0.00027 18.7 0.6 10 37-46 3-12 (26)
203 PF10588 NADH-G_4Fe-4S_3: NADH 40.8 21 0.00046 20.0 1.6 22 23-45 1-22 (41)
204 PF14255 Cys_rich_CPXG: Cystei 40.5 6.7 0.00015 23.6 -0.6 9 37-45 1-9 (52)
205 KOG2907|consensus 40.5 8.9 0.00019 26.8 -0.1 11 35-45 73-83 (116)
206 PF13909 zf-H2C2_5: C2H2-type 40.4 13 0.00027 17.9 0.5 7 37-43 1-7 (24)
207 PRK06266 transcription initiat 40.3 21 0.00046 26.8 1.9 13 33-45 114-126 (178)
208 PF13945 NST1: Salt tolerance 40.2 26 0.00057 26.7 2.4 34 32-69 137-170 (190)
209 KOG2535|consensus 40.1 16 0.00035 30.8 1.3 21 132-152 497-517 (554)
210 PF08772 NOB1_Zn_bind: Nin one 39.8 13 0.00027 24.0 0.5 13 33-45 21-33 (73)
211 PF10122 Mu-like_Com: Mu-like 39.4 18 0.00039 21.7 1.1 22 24-45 12-33 (51)
212 KOG4317|consensus 38.8 14 0.0003 30.5 0.8 14 32-45 15-28 (383)
213 KOG3965|consensus 38.8 12 0.00026 27.6 0.3 26 8-33 117-142 (160)
214 PF11432 DUF3197: Protein of u 38.5 19 0.00041 25.0 1.3 49 10-68 46-97 (113)
215 PF05191 ADK_lid: Adenylate ki 38.4 12 0.00027 20.5 0.3 12 37-48 2-13 (36)
216 COG1998 RPS31 Ribosomal protei 37.9 16 0.00034 21.8 0.7 12 33-44 16-27 (51)
217 cd00730 rubredoxin Rubredoxin; 37.6 17 0.00037 21.5 0.9 13 33-45 31-43 (50)
218 smart00834 CxxC_CXXC_SSSS Puta 37.2 15 0.00033 20.0 0.6 10 36-45 5-14 (41)
219 cd00246 RabGEF Nucleotide exch 37.2 18 0.00039 24.9 1.0 13 36-48 2-14 (103)
220 PRK02983 lysS lysyl-tRNA synth 37.1 4E+02 0.0087 25.9 10.7 114 20-151 357-473 (1094)
221 PF10892 DUF2688: Protein of u 36.2 5.7 0.00012 24.3 -1.4 30 27-59 4-35 (60)
222 PF13878 zf-C2H2_3: zinc-finge 36.1 20 0.00044 20.2 1.0 16 36-51 13-28 (41)
223 TIGR00373 conserved hypothetic 35.8 16 0.00036 26.7 0.7 12 34-45 107-118 (158)
224 cd03173 DUF619-like DUF619 dom 35.8 1.1E+02 0.0023 20.8 4.6 30 120-149 31-60 (98)
225 TIGR02098 MJ0042_CXXC MJ0042 f 35.8 17 0.00038 19.6 0.6 10 36-45 2-11 (38)
226 TIGR01206 lysW lysine biosynth 35.6 18 0.00039 21.8 0.8 10 36-45 2-11 (54)
227 PF12874 zf-met: Zinc-finger o 35.5 16 0.00034 17.6 0.4 12 37-48 1-12 (25)
228 COG4311 SoxD Sarcosine oxidase 35.3 24 0.00052 23.9 1.3 9 37-45 4-12 (97)
229 PF12760 Zn_Tnp_IS1595: Transp 35.2 19 0.0004 20.6 0.7 9 37-45 19-27 (46)
230 PF08792 A2L_zn_ribbon: A2L zi 34.9 18 0.00038 19.5 0.6 14 32-45 17-30 (33)
231 PF02591 DUF164: Putative zinc 34.8 25 0.00055 20.9 1.3 12 34-45 44-55 (56)
232 PRK12380 hydrogenase nickel in 34.7 18 0.0004 25.0 0.8 11 35-45 85-95 (113)
233 PF14206 Cys_rich_CPCC: Cystei 34.7 18 0.00038 23.6 0.6 10 36-45 1-10 (78)
234 COG1656 Uncharacterized conser 34.1 18 0.00038 27.0 0.6 30 32-66 126-155 (165)
235 COG5595 Zn-ribbon-containing, 33.6 28 0.0006 27.0 1.6 29 17-45 184-227 (256)
236 COG1693 Repressor of nif and g 33.5 49 0.0011 27.1 3.1 60 5-67 56-130 (325)
237 PF10071 DUF2310: Zn-ribbon-co 33.4 31 0.00066 27.6 1.9 29 18-46 187-230 (258)
238 PRK01343 zinc-binding protein; 33.1 21 0.00045 21.9 0.7 12 35-46 8-19 (57)
239 KOG2462|consensus 33.0 22 0.00048 28.7 1.1 24 32-55 126-149 (279)
240 PF14690 zf-ISL3: zinc-finger 32.9 22 0.00047 20.0 0.8 10 36-45 2-11 (47)
241 TIGR00100 hypA hydrogenase nic 32.8 20 0.00043 24.9 0.7 10 36-45 86-95 (115)
242 PF00571 CBS: CBS domain CBS d 32.8 39 0.00085 19.2 1.9 32 80-112 17-48 (57)
243 PRK00418 DNA gyrase inhibitor; 32.7 23 0.0005 22.1 0.9 13 35-47 5-17 (62)
244 COG3877 Uncharacterized protei 32.6 39 0.00084 23.5 2.0 16 46-61 36-51 (122)
245 TIGR00244 transcriptional regu 32.5 21 0.00045 26.2 0.7 7 39-45 31-37 (147)
246 PF08209 Sgf11: Sgf11 (transcr 32.4 20 0.00044 19.4 0.5 12 36-47 4-15 (33)
247 COG4469 CoiA Competence protei 32.3 20 0.00044 29.7 0.8 10 36-45 25-34 (342)
248 PRK03681 hypA hydrogenase nick 32.2 19 0.00041 25.0 0.5 10 36-45 87-96 (114)
249 smart00440 ZnF_C2C2 C2C2 Zinc 32.1 23 0.00049 19.8 0.7 9 37-45 1-9 (40)
250 smart00451 ZnF_U1 U1-like zinc 31.8 22 0.00049 18.5 0.7 16 35-50 2-17 (35)
251 PF00301 Rubredoxin: Rubredoxi 31.7 13 0.00029 21.8 -0.3 16 36-51 1-16 (47)
252 PHA00732 hypothetical protein 31.7 23 0.00051 22.9 0.8 14 34-47 25-38 (79)
253 PF06221 zf-C2HC5: Putative zi 31.7 51 0.0011 20.1 2.3 11 36-46 35-45 (57)
254 COG4847 Uncharacterized protei 30.9 15 0.00032 25.0 -0.2 17 33-49 3-19 (103)
255 PF01155 HypA: Hydrogenase exp 30.8 19 0.00041 24.9 0.3 9 37-45 87-95 (113)
256 COG4338 Uncharacterized protei 30.5 15 0.00032 21.8 -0.2 16 36-51 12-27 (54)
257 TIGR01374 soxD sarcosine oxida 30.5 22 0.00049 23.5 0.6 8 38-45 3-10 (84)
258 KOG2879|consensus 30.5 34 0.00073 27.8 1.7 13 33-45 236-248 (298)
259 PF14952 zf-tcix: Putative tre 30.4 24 0.00052 20.4 0.6 11 35-45 10-20 (44)
260 PRK00807 50S ribosomal protein 29.9 29 0.00064 20.6 1.0 15 37-51 2-16 (52)
261 PF04810 zf-Sec23_Sec24: Sec23 29.8 21 0.00045 19.9 0.3 12 34-45 22-33 (40)
262 COG4416 Com Mu-like prophage p 29.4 38 0.00081 20.6 1.4 25 21-45 9-33 (60)
263 PF04135 Nop10p: Nucleolar RNA 29.0 38 0.00083 20.4 1.4 20 37-56 18-39 (53)
264 TIGR02666 moaA molybdenum cofa 28.9 46 0.001 26.9 2.3 46 17-64 4-55 (334)
265 PRK00564 hypA hydrogenase nick 28.9 22 0.00048 24.7 0.4 8 38-45 90-97 (117)
266 PF04475 DUF555: Protein of un 28.8 26 0.00056 23.9 0.7 10 36-45 47-56 (102)
267 PF10276 zf-CHCC: Zinc-finger 28.6 22 0.00048 20.1 0.3 11 36-46 29-39 (40)
268 COG2761 FrnE Predicted dithiol 28.6 12 0.00027 29.3 -1.0 17 26-42 3-19 (225)
269 PRK03922 hypothetical protein; 28.6 26 0.00056 24.4 0.6 10 36-45 49-58 (113)
270 PF01096 TFIIS_C: Transcriptio 28.1 27 0.00059 19.4 0.6 8 38-45 2-9 (39)
271 TIGR03032 conserved hypothetic 27.9 99 0.0021 25.7 4.0 37 93-136 282-318 (335)
272 PRK00420 hypothetical protein; 27.9 73 0.0016 22.2 2.8 9 37-45 24-32 (112)
273 smart00507 HNHc HNH nucleases. 27.6 34 0.00073 18.8 1.0 14 33-47 8-21 (52)
274 PF14205 Cys_rich_KTR: Cystein 27.6 30 0.00066 21.0 0.8 9 37-45 5-13 (55)
275 PF14446 Prok-RING_1: Prokaryo 27.6 27 0.00059 21.1 0.6 12 36-47 21-32 (54)
276 COG3478 Predicted nucleic-acid 27.6 29 0.00063 21.9 0.7 10 36-45 4-13 (68)
277 smart00116 CBS Domain in cysta 27.5 64 0.0014 16.5 2.1 27 85-112 15-41 (49)
278 KOG3993|consensus 27.4 26 0.00056 30.1 0.6 13 33-45 292-304 (500)
279 COG0419 SbcC ATPase involved i 27.3 54 0.0012 30.7 2.7 15 36-50 457-471 (908)
280 PF11039 DUF2824: Protein of u 27.3 2.4E+02 0.0052 20.5 5.4 41 90-136 35-75 (151)
281 PRK00762 hypA hydrogenase nick 27.3 28 0.00061 24.5 0.7 9 37-45 93-101 (124)
282 PF01246 Ribosomal_L24e: Ribos 27.3 31 0.00066 22.1 0.8 14 36-49 3-16 (71)
283 COG1327 Predicted transcriptio 27.2 29 0.00062 25.6 0.7 8 38-45 2-9 (156)
284 cd00085 HNHc HNH nucleases; HN 27.1 46 0.001 18.6 1.6 10 37-46 12-21 (57)
285 PF04267 SoxD: Sarcosine oxida 27.0 20 0.00043 23.7 -0.1 8 38-45 3-10 (84)
286 TIGR03826 YvyF flagellar opero 26.9 33 0.00073 24.7 1.0 11 37-47 4-14 (137)
287 PRK03824 hypA hydrogenase nick 26.9 29 0.00064 24.8 0.7 12 34-45 105-116 (135)
288 smart00734 ZnF_Rad18 Rad18-lik 26.8 30 0.00064 17.5 0.6 9 38-46 3-11 (26)
289 KOG2779|consensus 26.5 2.2E+02 0.0049 24.1 5.8 65 48-115 262-328 (421)
290 PF06827 zf-FPG_IleRS: Zinc fi 26.4 13 0.00029 19.1 -0.8 19 26-44 11-29 (30)
291 PF01927 Mut7-C: Mut7-C RNAse 26.2 29 0.00064 24.9 0.6 11 36-46 124-134 (147)
292 COG1996 RPC10 DNA-directed RNA 26.1 27 0.00058 20.7 0.3 12 34-45 22-33 (49)
293 PF12387 Peptidase_C74: Pestiv 26.0 29 0.00062 26.2 0.6 17 29-45 168-184 (200)
294 PF03811 Zn_Tnp_IS1: InsA N-te 25.7 34 0.00073 18.8 0.7 9 37-45 6-14 (36)
295 TIGR03830 CxxCG_CxxCG_HTH puta 25.6 1.3E+02 0.0028 20.4 3.8 14 35-48 30-43 (127)
296 smart00714 LITAF Possible memb 25.6 34 0.00073 21.1 0.8 15 31-45 47-61 (67)
297 COG1885 Uncharacterized protei 25.5 33 0.00071 23.7 0.7 10 36-45 49-58 (115)
298 PRK12496 hypothetical protein; 25.3 31 0.00068 25.5 0.7 11 37-47 144-154 (164)
299 COG1645 Uncharacterized Zn-fin 25.2 30 0.00066 24.8 0.5 10 36-45 28-37 (131)
300 PRK13361 molybdenum cofactor b 25.2 58 0.0013 26.4 2.3 45 17-63 8-56 (329)
301 PF07315 DUF1462: Protein of u 25.0 94 0.002 20.9 2.8 62 36-108 6-79 (93)
302 PF14803 Nudix_N_2: Nudix N-te 25.0 34 0.00073 18.6 0.6 8 38-45 2-9 (34)
303 COG2093 DNA-directed RNA polym 25.0 35 0.00075 21.3 0.7 9 37-45 19-27 (64)
304 PF10609 ParA: ParA/MinD ATPas 25.0 28 0.00062 22.8 0.3 13 33-45 62-74 (81)
305 PF10915 DUF2709: Protein of u 24.9 64 0.0014 24.9 2.3 32 31-62 82-115 (238)
306 COG1029 FwdB Formylmethanofura 24.8 31 0.00067 29.2 0.6 11 35-45 4-14 (429)
307 KOG1491|consensus 24.6 64 0.0014 27.2 2.4 24 125-148 88-113 (391)
308 PF06397 Desulfoferrod_N: Desu 24.6 30 0.00065 19.1 0.3 12 34-45 4-15 (36)
309 PF04564 U-box: U-box domain; 24.4 99 0.0022 19.3 2.8 27 35-61 38-64 (73)
310 PF09723 Zn-ribbon_8: Zinc rib 24.4 26 0.00056 19.7 0.0 10 36-45 5-14 (42)
311 PRK14852 hypothetical protein; 24.3 6.5E+02 0.014 24.3 9.5 88 52-147 35-146 (989)
312 PF12756 zf-C2H2_2: C2H2 type 24.1 38 0.00082 21.6 0.8 13 35-47 49-61 (100)
313 COG5319 Uncharacterized protei 24.1 34 0.00073 24.6 0.6 13 33-45 29-41 (142)
314 PF08348 PAS_6: YheO-like PAS 23.9 1.3E+02 0.0029 20.8 3.6 23 94-117 85-107 (118)
315 PF09706 Cas_CXXC_CXXC: CRISPR 23.9 38 0.00081 21.3 0.7 13 33-45 2-14 (69)
316 COG4694 Uncharacterized protei 23.9 31 0.00067 30.9 0.5 10 36-45 278-287 (758)
317 COG0375 HybF Zn finger protein 23.8 39 0.00083 23.7 0.8 10 36-45 86-95 (115)
318 COG5270 PUA domain (predicted 23.7 1E+02 0.0022 23.7 3.1 33 94-139 164-196 (202)
319 PF03982 DAGAT: Diacylglycerol 23.7 1.1E+02 0.0024 24.8 3.6 35 42-76 247-283 (297)
320 PRK14891 50S ribosomal protein 23.6 41 0.00089 24.1 0.9 14 36-49 4-17 (131)
321 PF13719 zinc_ribbon_5: zinc-r 23.6 38 0.00082 18.5 0.6 9 37-45 3-11 (37)
322 COG1579 Zn-ribbon protein, pos 23.5 31 0.00067 27.3 0.3 39 7-46 189-231 (239)
323 KOG4602|consensus 23.3 33 0.00071 27.6 0.5 13 33-45 265-277 (318)
324 PF01396 zf-C4_Topoisom: Topoi 23.3 39 0.00085 18.7 0.7 8 38-45 3-10 (39)
325 PF11023 DUF2614: Protein of u 23.1 49 0.0011 23.1 1.2 9 37-45 70-78 (114)
326 PF12677 DUF3797: Domain of un 23.1 39 0.00085 20.0 0.6 9 38-46 15-23 (49)
327 PF06170 DUF983: Protein of un 23.0 47 0.001 21.9 1.1 23 36-61 8-30 (86)
328 PLN02294 cytochrome c oxidase 23.0 38 0.00082 25.5 0.7 13 35-47 140-152 (174)
329 PF04376 ATE_N: Arginine-tRNA- 22.9 1.1E+02 0.0025 19.6 2.9 29 34-62 5-41 (80)
330 PF10815 ComZ: ComZ; InterPro 22.8 1.3E+02 0.0028 18.2 2.8 26 49-74 25-50 (56)
331 TIGR02668 moaA_archaeal probab 22.6 71 0.0015 25.4 2.3 46 17-64 4-52 (302)
332 PF14824 Sirohm_synth_M: Siroh 22.6 52 0.0011 17.4 1.0 19 10-28 10-28 (30)
333 PF06677 Auto_anti-p27: Sjogre 22.1 46 0.001 18.8 0.8 13 36-48 17-29 (41)
334 PF10601 zf-LITAF-like: LITAF- 22.0 43 0.00093 21.0 0.7 14 32-45 54-67 (73)
335 PF02799 NMT_C: Myristoyl-CoA: 21.9 1.2E+02 0.0027 23.1 3.3 66 47-115 29-96 (190)
336 smart00746 TRASH metallochaper 21.7 44 0.00096 16.3 0.7 7 39-45 1-7 (39)
337 PF03376 Adeno_E3B: Adenovirus 21.7 40 0.00087 21.2 0.5 14 130-143 52-65 (67)
338 PRK00758 GMP synthase subunit 21.7 1.1E+02 0.0024 22.4 3.0 27 125-151 155-181 (184)
339 TIGR03823 FliZ flagellar regul 21.7 60 0.0013 24.1 1.5 35 32-66 18-58 (168)
340 PRK05978 hypothetical protein; 21.6 41 0.0009 24.6 0.7 11 35-45 32-42 (148)
341 cd00472 Ribosomal_L24e_L24 Rib 21.5 46 0.001 20.1 0.8 14 36-49 3-16 (54)
342 TIGR03831 YgiT_finger YgiT-typ 21.3 50 0.0011 18.2 0.9 18 31-48 27-44 (46)
343 PF06054 CoiA: Competence prot 21.1 45 0.00099 27.9 0.9 12 34-45 28-39 (375)
344 smart00661 RPOL9 RNA polymeras 21.0 44 0.00095 19.1 0.6 11 35-45 19-29 (52)
345 PF10083 DUF2321: Uncharacteri 21.0 72 0.0016 23.6 1.8 29 33-65 65-93 (158)
346 PF05265 DUF723: Protein of un 20.7 46 0.001 20.6 0.6 9 35-43 52-60 (60)
347 TIGR02491 NrdG anaerobic ribon 20.7 1.3E+02 0.0029 21.5 3.3 33 34-66 24-60 (154)
348 PHA00733 hypothetical protein 20.6 58 0.0013 23.0 1.2 22 31-52 68-89 (128)
349 COG3058 FdhE Uncharacterized p 20.6 48 0.001 27.0 0.9 18 28-45 173-194 (308)
350 PRK00432 30S ribosomal protein 20.6 46 0.00099 19.6 0.6 9 36-44 20-28 (50)
351 PF08273 Prim_Zn_Ribbon: Zinc- 20.6 44 0.00095 18.9 0.5 9 37-45 4-12 (40)
352 PF07282 OrfB_Zn_ribbon: Putat 20.4 45 0.00097 20.4 0.6 15 31-45 41-55 (69)
353 cd04641 CBS_pair_28 The CBS do 20.3 1.6E+02 0.0035 19.1 3.4 27 85-112 88-114 (120)
354 KOG3029|consensus 20.2 21 0.00046 29.2 -1.1 16 30-45 90-105 (370)
355 PF02150 RNA_POL_M_15KD: RNA p 20.0 46 0.001 18.0 0.5 8 38-45 3-10 (35)
356 PHA01733 hypothetical protein 20.0 3.7E+02 0.008 19.8 5.9 59 50-114 6-66 (153)
357 TIGR02605 CxxC_CxxC_SSSS putat 20.0 46 0.001 19.2 0.6 10 36-45 5-14 (52)
No 1
>KOG0402|consensus
Probab=99.92 E-value=3.2e-26 Score=148.73 Aligned_cols=45 Identities=91% Similarity=1.418 Sum_probs=44.6
Q ss_pred CCCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838 1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 1 m~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
|++|||||||+|+||+|||++|||++++||++||++|+|+||||.
T Consensus 1 m~krtKKVgI~gkyGtrYGaSLrk~vKkiei~Qhaky~CsfCGK~ 45 (92)
T KOG0402|consen 1 MAKRTKKVGIVGKYGTRYGASLRKMVKKIEIQQHAKYTCSFCGKK 45 (92)
T ss_pred CCcccceeeeeecccchhhHHHHHHHHHHHHHHhhhhhhhhcchh
Confidence 999999999999999999999999999999999999999999998
No 2
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=99.91 E-value=3.7e-25 Score=147.37 Aligned_cols=45 Identities=84% Similarity=1.353 Sum_probs=44.3
Q ss_pred CCCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838 1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 1 m~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
|++|||||||+||||+|||++|||+|++||++||++|+|||||+.
T Consensus 1 MakrtkkvG~~GrfG~RYG~slRK~v~kie~~q~a~y~CpfCgk~ 45 (90)
T PTZ00255 1 MAKRTKKVGITGKYGTRYGASLRKQIKKIEISQHAKYFCPFCGKH 45 (90)
T ss_pred CCCcCceeeecCCCcCccCHHHHHHHHHHHHHHhCCccCCCCCCC
Confidence 899999999999999999999999999999999999999999987
No 3
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=99.90 E-value=9.5e-25 Score=145.44 Aligned_cols=45 Identities=47% Similarity=0.871 Sum_probs=44.1
Q ss_pred CCCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838 1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 1 m~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
|++|||||||+||||+|||++|||+|++||++||++|+|||||+.
T Consensus 1 m~~rtkkvGi~Gr~G~RYG~slRK~v~kie~~q~a~y~CpfCgk~ 45 (90)
T PRK03976 1 MMSRTKKVGSAGRFGARYGRKIRKRVADIEEKMRAKHVCPVCGRP 45 (90)
T ss_pred CCCcCceEeecCCCcCccCHHHHHHHHHHHHHHhcCccCCCCCCC
Confidence 888999999999999999999999999999999999999999987
No 4
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=99.89 E-value=4.3e-24 Score=142.43 Aligned_cols=44 Identities=68% Similarity=1.222 Sum_probs=42.7
Q ss_pred CCCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838 1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 1 m~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
|+ |||||||+||||||||++|||+|++||++||++|+|||||+.
T Consensus 1 ma-rtkkvG~~GrfG~RYG~slRK~v~kie~~q~a~y~CpfCgk~ 44 (91)
T TIGR00280 1 MK-RTKKVGITGRFGPRYGLKLRRQVKKIEIQQKAKYVCPFCGKK 44 (91)
T ss_pred CC-CCceeeecCCCcCccCHHHHHHHHHHHHHHhcCccCCCCCCC
Confidence 67 899999999999999999999999999999999999999987
No 5
>KOG2488|consensus
Probab=99.88 E-value=1.9e-22 Score=150.76 Aligned_cols=98 Identities=39% Similarity=0.732 Sum_probs=92.8
Q ss_pred HHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccc
Q psy838 54 AETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQ 133 (152)
Q Consensus 54 ~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~ 133 (152)
.++++|+|+|.+.||.++|.++.|||++..+..||.+...+|+++.++ .+++|||++|+++.|.+.|++|+++++|.++
T Consensus 53 ~~~ldw~f~L~k~nm~~~Y~qs~~Gw~~~~K~~El~~~~~~Yi~a~~~-~~~~vgf~~Frf~vd~g~~vlYcyEvqv~~~ 131 (202)
T KOG2488|consen 53 YEDLDWCFSLFKKNMGAMYRQSSWGWDDNSKAKELRNRKLRYICAWNN-KSKLVGFTMFRFTVDTGDPVLYCYEVQVASA 131 (202)
T ss_pred HHHHHHHHHHHHhhhHHHhhhcccccCchhHHHHHhhccceEEEEEcC-CCceeeEEEEEEEcccCCeEEEEEEEeehhh
Confidence 389999999999999999999999999999999999999999999977 3499999999999999999999999999999
Q ss_pred cccCCHHHHHHHHHHhhcC
Q psy838 134 VQRKGLGKFMMQAFYTYID 152 (152)
Q Consensus 134 ~qgkGiG~~Lm~~l~~~a~ 152 (152)
|||+|||++||+.++..|+
T Consensus 132 yR~kGiGk~LL~~l~~~a~ 150 (202)
T KOG2488|consen 132 YRGKGIGKFLLDTLEKLAD 150 (202)
T ss_pred hhccChHHHHHHHHHHHHH
Confidence 9999999999999998764
No 6
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=99.88 E-value=7.8e-24 Score=141.06 Aligned_cols=44 Identities=86% Similarity=1.365 Sum_probs=39.6
Q ss_pred CCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838 2 AKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 2 ~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
|+|||||||+||||+|||++|||+|++||++||++|+|||||+.
T Consensus 1 a~rtkkvgi~GkyG~RYG~slRK~vkkie~~q~~ky~Cp~Cgk~ 44 (90)
T PF01780_consen 1 AKRTKKVGIAGKYGTRYGASLRKRVKKIEISQHAKYTCPFCGKT 44 (90)
T ss_dssp -S-SSS-TTGGGGTTSSTHHHHHHHHHHHHHHHS-BEESSSSSS
T ss_pred CCCCceeeecCcCcCCcCHHHHHHHHHHHHHHhCCCcCCCCCCc
Confidence 57999999999999999999999999999999999999999998
No 7
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=99.84 E-value=9.9e-22 Score=129.15 Aligned_cols=44 Identities=59% Similarity=1.089 Sum_probs=42.9
Q ss_pred CCCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838 1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 1 m~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
|++ |||||++||||||||+++|++|++||..|+++|.||+||+.
T Consensus 1 M~~-TkkvG~aGrfGpRYG~~~Rrrv~~ie~~~~~~~~Cp~C~~~ 44 (89)
T COG1997 1 MAK-TKKVGIAGRFGPRYGSKLRRRVKEIEAQQRAKHVCPFCGRT 44 (89)
T ss_pred CCc-cceeccCcccccccchHHHHHHHHHHHHHhcCCcCCCCCCc
Confidence 777 99999999999999999999999999999999999999998
No 8
>PRK10146 aminoalkylphosphonic acid N-acetyltransferase; Provisional
Probab=99.34 E-value=3.2e-12 Score=91.06 Aligned_cols=94 Identities=13% Similarity=0.029 Sum_probs=66.7
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCCh----hhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeecc--CCcce
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDP----QGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD--FGEPV 122 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~----~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~--~~~~~ 122 (152)
..+++++|++.+.++..+.... .|+. +.+...+.++...++++..+ +++|||+.+..... .....
T Consensus 6 ir~a~~~D~~~l~~l~~~~~~~-------~~~~~~~~~~~~~~l~~~~~~~~v~~~~--~~ivG~~~~~~~~~~~~~~~~ 76 (144)
T PRK10146 6 LRPATQYDTDAVYALICELKQA-------EFDHQAFRVGFNANLRDPNMRYHLALLD--GEVVGMIGLHLQFHLHHVNWI 76 (144)
T ss_pred EeeCcHhhHHHHHHHHHHHhcc-------cCCHHHHHHHHHHHhcCCCceEEEEEEC--CEEEEEEEEEecccccccchh
Confidence 4567899999999987653221 1222 22334454556666666655 89999999875422 12223
Q ss_pred EEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 123 LYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.+|.+|+|+|++||+|+|+.||+.+++.|
T Consensus 77 ~~i~~l~v~p~~rg~GiG~~Ll~~~~~~a 105 (144)
T PRK10146 77 GEIQELVVMPQARGLNVGSKLLAWAEEEA 105 (144)
T ss_pred heeheeEECHHHcCCCHHHHHHHHHHHHH
Confidence 56899999999999999999999999876
No 9
>KOG3216|consensus
Probab=99.31 E-value=2e-11 Score=88.87 Aligned_cols=102 Identities=17% Similarity=0.219 Sum_probs=76.2
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhCC--CCCChhhhhhhhc-C-CCceEEEEEe-CCCCCeEEEEEEEeecc--CCcc
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVCD--IGWDPQGKHSEMF-D-DRACYLVAKN-GSSSTPVAFSHFRFDVD--FGEP 121 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~--~~w~~~~~~~el~-~-~~~~~~v~~~-~~~~~ivGf~~~~~~~~--~~~~ 121 (152)
...+++.|.+.+..|.++ | +-|+... .--+++.+..+++ + +..+..++.- ..++.++||+.|.+.++ .+..
T Consensus 6 IR~at~~D~~~i~rLike-l-a~Fek~~~~v~~te~~l~~~~F~d~~~~~~~v~~ie~~~~~~aGf~~yf~~ystW~~k~ 83 (163)
T KOG3216|consen 6 IRLATPKDCEDILRLIKE-L-AEFEKLEDQVEATEENLARDGFIDPPFKHWLVAAIETSGEVVAGFALYFNNYSTWLGKQ 83 (163)
T ss_pred EEecCcccHHHHHHHHHH-H-HHHHHhccchhhchhhhhhhhccCCCccEEEEEEEecCCCceeEEeeeecccccccccc
Confidence 456888999999999887 2 3455433 2356777777643 2 3344444433 22589999999988765 5667
Q ss_pred eEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838 122 VLYCYELQLEKQVQRKGLGKFMMQAFYTYID 152 (152)
Q Consensus 122 ~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~ 152 (152)
.+||.+|+|.|+|||+|+|+.|+..+-+.|+
T Consensus 84 ~iYleDlyV~e~yR~kG~Gs~Ll~~va~~A~ 114 (163)
T KOG3216|consen 84 GIYLEDLYVREQYRGKGIGSKLLKFVAEEAD 114 (163)
T ss_pred eEEEEeeEecchhcccChHHHHHHHHHHHHH
Confidence 8999999999999999999999999988774
No 10
>PHA00673 acetyltransferase domain containing protein
Probab=99.25 E-value=1.1e-10 Score=85.90 Aligned_cols=101 Identities=11% Similarity=-0.058 Sum_probs=68.9
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhCC-CCCCh-hhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeecc---CCcceE
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVCD-IGWDP-QGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD---FGEPVL 123 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~-~~w~~-~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~---~~~~~~ 123 (152)
..-++.+|++.|.+|..+.......... .+... +.+..-..+++..++++..+ |++|||+.+.+... .+.+..
T Consensus 9 ~~~A~~~D~paI~~LLadd~l~~~r~d~~~~~~y~~af~ai~~dp~~~llVa~~~--g~vVG~~~l~~~p~l~~~~~~~~ 86 (154)
T PHA00673 9 VAFAELADAPTFASLCAEYAHESANADLAGRAPDHHAYAGMEAAGVAHFLGVFRG--EELVGFACLLVTPVPHFKGQLIG 86 (154)
T ss_pred HhhccHhhHHHHHHHHHhcccccccccccccchhHHHHHHHHhCCCcEEEEEEEC--CEEEEEEEEEEecCCccCCccEE
Confidence 4457899999999997662111100000 00000 11223345677777777665 89999999877643 245677
Q ss_pred EEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 124 YCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+|.+|+|+|++||+|||+.||+.++++|
T Consensus 87 ~Ie~l~V~~~~RGqGIG~~Ll~~A~~~A 114 (154)
T PHA00673 87 TTESIFVAAAHRPGGAGMALLRATEALA 114 (154)
T ss_pred EEEEEEEChhccCCCHHHHHHHHHHHHH
Confidence 8999999999999999999999999886
No 11
>PF13673 Acetyltransf_10: Acetyltransferase (GNAT) domain; PDB: 2FIW_A 1BOB_A 3FNC_B 3EXN_A.
Probab=99.23 E-value=1.8e-10 Score=79.09 Aligned_cols=87 Identities=16% Similarity=0.206 Sum_probs=59.9
Q ss_pred HHHHHHHHHHHhHHHHHHhCC------CCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeee
Q psy838 56 TKKWVWELEEKNMKHSYEVCD------IGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQ 129 (152)
Q Consensus 56 d~~~i~~l~~~~m~~~y~~~~------~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~ 129 (152)
|++.+.+|..+.....+.... +.++++.....+.++...++++..+ +++|||+.+. .++ .|..++
T Consensus 1 D~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~--~~ivG~~~~~---~~~----~i~~l~ 71 (117)
T PF13673_consen 1 DIPAIAELYREAWQENYWDYGPEQIDAWRYSPEDLEEYLEEGSHTIFVAEEG--GEIVGFAWLE---PDG----EISHLY 71 (117)
T ss_dssp GHHHHHHHHHHHHHHHTTTTSHHHHHHHHSSHHHHHHHHCTCCCEEEEEEET--TEEEEEEEEE---TCE----EEEEEE
T ss_pred CHHHHHHHHHHHHHHhccCCCHHHHHHHhcCHHHHHHHHHhcCCEEEEEEEC--CEEEEEEEEc---CCC----eEEEEE
Confidence 667777777765544331110 0144555555555555667777766 9999999985 222 288899
Q ss_pred eccccccCCHHHHHHHHHHhhc
Q psy838 130 LEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 130 V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
|+|+|||+|+|+.||+.+++.+
T Consensus 72 v~p~~r~~Gig~~Ll~~~~~~~ 93 (117)
T PF13673_consen 72 VLPEYRGRGIGRALLDAAEKEA 93 (117)
T ss_dssp E-GGGTTSSHHHHHHHHHHHHH
T ss_pred EChhhcCCcHHHHHHHHHHHHH
Confidence 9999999999999999999875
No 12
>TIGR03827 GNAT_ablB putative beta-lysine N-acetyltransferase. Members of this protein family are GNAT family acetyltransferases, based on a seed alignment in which every member is associated with a lysine 2,3-aminomutase family protein, usually as the adjacent gene. This family includes AblB, the enzyme beta-lysine acetyltransferase that completes the two-step synthesis of the osmolyte (compatible solute) N-epsilon-acetyl-beta-lysine; all members of the family may have this function. Note that N-epsilon-acetyl-beta-lysine has been observed only in methanogenic archaea (e.g. Methanosarcina) but that this model, paired with TIGR03820, suggests a much broader distribution.
Probab=99.18 E-value=1.9e-10 Score=91.28 Aligned_cols=97 Identities=12% Similarity=0.106 Sum_probs=69.8
Q ss_pred ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEe
Q psy838 47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCY 126 (152)
Q Consensus 47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~ 126 (152)
.....++++|++.+.+|..+. |....++|.+.....+...+...++++..+ +++||++.+.... ....++|.
T Consensus 116 ~~IR~a~~~D~~~l~~L~~~v----~~~~~~~~~~~~~l~~~~~~~~~~~v~~~~--g~iVG~~~~~~~~--~~~~~eI~ 187 (266)
T TIGR03827 116 FTLRIATEDDADAMAALYRKV----FPTYPFPIHDPAYLLETMKSNVVYFGVEDG--GKIIALASAEMDP--ENGNAEMT 187 (266)
T ss_pred eEEEECCHHHHHHHHHHHHHH----hccCCCCccCHHHHHHHhcCCcEEEEEEEC--CEEEEEEEEecCC--CCCcEEEE
Confidence 345678999999999998763 322234444433333334455566666655 8999999875432 23357899
Q ss_pred eeeeccccccCCHHHHHHHHHHhhc
Q psy838 127 ELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 127 ~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+++|+|+|||+|+|+.||+.+++.|
T Consensus 188 ~i~V~P~yRG~GiG~~Ll~~l~~~a 212 (266)
T TIGR03827 188 DFATLPEYRGKGLAKILLAAMEKEM 212 (266)
T ss_pred EEEECHHHcCCCHHHHHHHHHHHHH
Confidence 9999999999999999999999876
No 13
>PRK10314 putative acyltransferase; Provisional
Probab=99.14 E-value=6.7e-10 Score=81.43 Aligned_cols=97 Identities=7% Similarity=0.023 Sum_probs=66.6
Q ss_pred cccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhh-cCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEE
Q psy838 46 YKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEM-FDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLY 124 (152)
Q Consensus 46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el-~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~ 124 (152)
+.+..+++.+++..++.|-.+.+ +.....++. + .++. ..+.+..+++..+ +++|||+.+....+ .....+
T Consensus 6 ~~~~~~l~~~~~~~~~~lR~~VF---~~eq~~~~~-e--~D~~d~~~~~~h~~~~~~--~~~vg~~r~~~~~~-~~~~~~ 76 (153)
T PRK10314 6 DLHHSELSVSQLYALLQLRCAVF---VVEQNCPYQ-D--IDGDDLTGDNRHILGWKN--DELVAYARILKSDD-DLEPVV 76 (153)
T ss_pred ecchhhCCHHHHHHHHHHHHHHh---hhhcCCCcc-c--cCCCCCCCCcEEEEEEEC--CEEEEEEEEecCCC-CCCCEE
Confidence 34568999999999999987643 222233332 1 1111 1123444556655 89999999876422 223478
Q ss_pred EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 125 CYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
|.+|+|+|+|||+|+|++||+.+++.+
T Consensus 77 i~rv~V~~~~rG~GiG~~Lm~~~~~~~ 103 (153)
T PRK10314 77 IGRVIVSEALRGEKVGQQLMSKTLESC 103 (153)
T ss_pred EEEEEECHHHhCCCHHHHHHHHHHHHH
Confidence 999999999999999999999988764
No 14
>TIGR01575 rimI ribosomal-protein-alanine acetyltransferase. Members of this model belong to the GCN5-related N-acetyltransferase (GNAT) superfamily. This model covers prokarotes and the archaea. The seed contains a characterized accession for Gram negative E. coli. An untraceable characterized accession (PIR|S66013) for Gram positive B. subtilis scores well (205.0) in the full alignment. Characterized members are lacking in the archaea. Noise cutoff (72.4) was set to exclude M. loti paralog of rimI. Trusted cutoff (80.0) was set at next highest scoring member in the mini-database.
Probab=99.09 E-value=8e-10 Score=76.70 Aligned_cols=83 Identities=19% Similarity=0.398 Sum_probs=61.6
Q ss_pred HHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccc
Q psy838 56 TKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQ 135 (152)
Q Consensus 56 d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~q 135 (152)
|++.++++..+ .| ..+|+.+.+...+......++++..+ +++|||+.+....+ ..+|..++|+|++|
T Consensus 1 d~~~i~~~~~~----~~---~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~vg~~~~~~~~~----~~~i~~~~v~~~~r 67 (131)
T TIGR01575 1 DLKAVLEIEAA----AF---AFPWTEAQFAEELANYHLCYLLARIG--GKVVGYAGVQIVLD----EAHILNIAVKPEYQ 67 (131)
T ss_pred CHHHHHHHHHh----hC---CCCCCHHHHHHHhcCCCceEEEEecC--CeEEEEEEEEecCC----CeEEEEEEECHHHc
Confidence 45667777544 34 34788888777776555555555544 89999999876432 34688999999999
Q ss_pred cCCHHHHHHHHHHhhc
Q psy838 136 RKGLGKFMMQAFYTYI 151 (152)
Q Consensus 136 gkGiG~~Lm~~l~~~a 151 (152)
|+|+|+.||+.+++.+
T Consensus 68 g~G~g~~ll~~~~~~~ 83 (131)
T TIGR01575 68 GQGIGRALLRELIDEA 83 (131)
T ss_pred CCCHHHHHHHHHHHHH
Confidence 9999999999998875
No 15
>PHA01807 hypothetical protein
Probab=99.08 E-value=7.1e-10 Score=81.61 Aligned_cols=98 Identities=9% Similarity=-0.008 Sum_probs=62.9
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhCCC--CCC--hhh---hhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccC-Cc
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVCDI--GWD--PQG---KHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDF-GE 120 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~--~w~--~~~---~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~-~~ 120 (152)
...++.+|++....|..+. +...+. +|. ++. ...++.++....+++..+ +++|||+.+.+.... ..
T Consensus 6 ~~~~~~~d~~~~~~l~l~~----l~e~p~~~~w~s~ee~~~~~~~~~~~~~~~~lva~~d--g~lvG~~~l~~~~~~~~~ 79 (153)
T PHA01807 6 RVHAKAGTPSELQGLCWLA----IQELEEFTLFRSKEEALERILDSTESNDRTELLVFRD--GKLAGIAVLVFEDDPHVG 79 (153)
T ss_pred hhhhhhCCHHHHHHHHHHH----HHhCccCCCCCChHHHHHHHHHHhhCCCceEEEEEEC--CEEEEEEEEEcCCCccee
Confidence 3446677888888886553 333222 144 221 222234455555666655 899999998764322 11
Q ss_pred ceEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838 121 PVLYCYELQLEKQVQRKGLGKFMMQAFYTYID 152 (152)
Q Consensus 121 ~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~ 152 (152)
..+.|..|+|+|+|||+|+|++||+.+++.|+
T Consensus 80 ~i~~l~~lYV~pe~RG~GiG~~Ll~~~~~~Ar 111 (153)
T PHA01807 80 PCLGVQWQYVLPEYRNAGVAREFLRELIRLAG 111 (153)
T ss_pred eeccceeEEECHHHcCCCHHHHHHHHHHHHHH
Confidence 23345668999999999999999999998763
No 16
>PTZ00330 acetyltransferase; Provisional
Probab=99.08 E-value=3.2e-10 Score=80.94 Aligned_cols=95 Identities=12% Similarity=0.087 Sum_probs=62.5
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC----CCc-eEEEEEeCCCCCeEEEEEEEeecc---CCc
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD----DRA-CYLVAKNGSSSTPVAFSHFRFDVD---FGE 120 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~----~~~-~~~v~~~~~~~~ivGf~~~~~~~~---~~~ 120 (152)
..+++++|++.+.++... ... ...++.+... .+.+ ++. ..+++..+ +|++|||+.+..... .+.
T Consensus 9 ir~~~~~D~~~i~~l~~~----~~~--~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~vG~~~~~~~~~~~~~~~ 80 (147)
T PTZ00330 9 LRDLEEGDLGSVLELLSH----LTS--APALSQEELE-QIAARRRLAGVVTRVFVHSP-TQRIVGTASLFVEPKFTRGGK 80 (147)
T ss_pred EEEcccccHHHHHHHHHH----hcC--CCccchhHHH-HHHHHHhcCCCceEEEEEeC-CCEEEEEEEEEeccccccCCC
Confidence 467889999999988654 111 1123443322 2211 121 22333333 489999999864322 233
Q ss_pred ceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 121 PVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 121 ~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
...+|..++|+|+|||+|+|+.||+.+++.|
T Consensus 81 ~~~~i~~~~V~~~~rg~Gig~~l~~~~~~~a 111 (147)
T PTZ00330 81 CVGHIEDVVVDPSYRGQGLGRALISDLCEIA 111 (147)
T ss_pred ceEEEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence 4678999999999999999999999999876
No 17
>PRK07757 acetyltransferase; Provisional
Probab=99.04 E-value=2.1e-09 Score=77.54 Aligned_cols=91 Identities=10% Similarity=0.080 Sum_probs=63.0
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHHHHHhC-CCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEe
Q psy838 48 KSTDLGAETKKWVWELEEKNMKHSYEVC-DIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCY 126 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~-~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~ 126 (152)
...+++++|++.+.++..+. .... ..+++.+.....+ ..++++..+ +++|||+.+....+ ...+|.
T Consensus 3 ~ir~~~~~D~~~l~~l~~~~----~~~~~~~~~~~~~~~~~~----~~~~i~~~~--~~lvG~~~l~~~~~---~~~~i~ 69 (152)
T PRK07757 3 EIRKARLSDVKAIHALINVY----AKKGLMLPRSLDELYENI----RDFYVAEEE--GEIVGCCALHILWE---DLAEIR 69 (152)
T ss_pred eEeeCCcccHHHHHHHHHHH----HhcCCccCCCHHHHHhcc----CcEEEEEEC--CEEEEEEEEEeccC---CceEEE
Confidence 45678899999999987541 2110 1234443332222 245555554 89999999875322 245788
Q ss_pred eeeeccccccCCHHHHHHHHHHhhc
Q psy838 127 ELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 127 ~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.++|+|+|||+|+|+.||+.+++.|
T Consensus 70 ~v~V~p~~rg~Glg~~Ll~~l~~~a 94 (152)
T PRK07757 70 SLAVSEDYRGQGIGRMLVEACLEEA 94 (152)
T ss_pred EEEECHHHcCCCHHHHHHHHHHHHH
Confidence 9999999999999999999999876
No 18
>TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species.
Probab=99.03 E-value=3e-09 Score=84.62 Aligned_cols=95 Identities=17% Similarity=0.120 Sum_probs=68.7
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCC---CceEEEEEeCCCCCeEEEEEEEeeccCCcceEE
Q psy838 48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDD---RACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLY 124 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~---~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~ 124 (152)
+..+++++|++.+.+|....- .+.. ..+|+++. ..++.++ ....+++..+ +++|||+.+....+. ..+
T Consensus 2 ~~~~l~~~d~~~v~~L~~~~~--~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~vG~~~~~~~~~~---~~~ 72 (292)
T TIGR03448 2 WRAALDADLRRDVRELLAAAT--AVDG-VAPVSEQV-LRGLREPGAGHTRHLVAVDS--DPIVGYANLVPARGT---DPA 72 (292)
T ss_pred ccccCCHHHHHHHHHHHHHHH--hcCC-CCCCCHHH-HhhccccCCCCceEEEEEEC--CEEEEEEEEEcCCCC---cce
Confidence 457899999999999987632 2222 34688765 4555432 2345555554 899999998754222 235
Q ss_pred EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 125 CYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
|.+|+|+|++||+|||+.||+.+++.+
T Consensus 73 ~~~l~V~p~~rg~GiG~~Ll~~~~~~~ 99 (292)
T TIGR03448 73 MAELVVHPAHRRRGIGRALIRALLAKG 99 (292)
T ss_pred EEEEEECHhhcCCCHHHHHHHHHHHhc
Confidence 889999999999999999999998764
No 19
>PRK09831 putative acyltransferase; Provisional
Probab=99.03 E-value=2.6e-09 Score=77.05 Aligned_cols=89 Identities=13% Similarity=0.151 Sum_probs=61.5
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhh----------hhhcCCCceEEEEEeCCCCCeEEEEEEEeeccC
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKH----------SEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDF 118 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~----------~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~ 118 (152)
..+++++|++.+.++..+.....+ ...++++... ..... ...++++..+ |+++||+.+..
T Consensus 3 ir~a~~~D~~~l~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~--~~iiG~~~~~~---- 72 (147)
T PRK09831 3 IRNYQPGDFQQLCAIFIRAVTMTA---SQHYSPQQIAAWAQIDESRWKEKLA-KSQVRVAVIN--AQPVGFITCIE---- 72 (147)
T ss_pred cccCChhhHHHHHHHHHHHHHHhh---hhcCCHHHHHhccCCCHHHHHHHHh-cCceEEEEEC--CEEEEEEEehh----
Confidence 467889999999998877654433 1223332211 11112 3346666655 99999988731
Q ss_pred CcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 119 GEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 119 ~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.+|..++|+|++||+|+|+.||+.+++.|
T Consensus 73 ----~~i~~~~v~p~~~g~GiG~~Ll~~~~~~~ 101 (147)
T PRK09831 73 ----HYIDMLFVDPEYTRRGVASALLKPLIKSE 101 (147)
T ss_pred ----ceeeeEEECHHHcCCCHHHHHHHHHHHHh
Confidence 35788999999999999999999998865
No 20
>TIGR02406 ectoine_EctA L-2,4-diaminobutyric acid acetyltransferase. This enzyme family is the EctA of ectoine biosynthesis. Ectoine is a compatible solute, analagous to trehalose, betaines, etc., found often in halotolerant organisms. EctA is L-2,4-diaminobutyric acid acetyltransferase, also called DABA acetyltransferase.
Probab=99.03 E-value=1.9e-09 Score=79.09 Aligned_cols=93 Identities=16% Similarity=0.237 Sum_probs=60.7
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC-CCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeee
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD-DRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYEL 128 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~-~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i 128 (152)
.+++++|++.+.+|..+. + ...++.........+ .....+++..+ ++++|||+.+..... ....++++.|
T Consensus 2 R~~~~~D~~~i~~L~~~~----~---~~~~~~~~~~~~~~~~~~~~~~v~~~~-~~~ivG~~~~~~~~~-~~~~~~i~~l 72 (157)
T TIGR02406 2 RPPRIEDGAGIWELVKDC----P---PLDLNSSYAYLLLCTDFADTSIVAESE-GGEIVGFVSGYLRPD-RPDVLFVWQV 72 (157)
T ss_pred CCCccccHHHHHHHHHhC----C---CCCcccceehhhhhhhcCCcEEEEEcC-CCeEEEEEEEEecCC-CCCeEEEEEE
Confidence 467889999999998763 1 111111111111111 22334454433 479999997654332 2345789999
Q ss_pred eeccccccCCHHHHHHHHHHhhc
Q psy838 129 QLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 129 ~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+|+|++||+|+|++||+.+++.|
T Consensus 73 ~V~p~~rg~GiG~~L~~~l~~~a 95 (157)
T TIGR02406 73 AVDPRARGKGLARRLLEALLERV 95 (157)
T ss_pred EEChHhccCcHHHHHHHHHHHHH
Confidence 99999999999999999999875
No 21
>PRK03624 putative acetyltransferase; Provisional
Probab=99.00 E-value=2.4e-09 Score=75.00 Aligned_cols=91 Identities=16% Similarity=0.211 Sum_probs=62.0
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhh--hhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEe
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQG--KHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCY 126 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~--~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~ 126 (152)
..+++++|++.+.++.... ....+|.+.. ....+..+...++++..+ +++||++.+..... ..++.
T Consensus 5 ir~~~~~d~~~i~~l~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~--~~~vG~~~~~~~~~----~~~i~ 72 (140)
T PRK03624 5 IRVFRQADFEAVIALWERC------DLTRPWNDPEMDIERKLNHDPSLFLVAEVG--GEVVGTVMGGYDGH----RGWAY 72 (140)
T ss_pred EEEcccccHHHHHHHHHhc------CCCcchhhHHHHHHHHhcCCCceEEEEEcC--CcEEEEEEeeccCC----CceEE
Confidence 4567889999999887652 1123454332 222233444555666544 89999998764321 24578
Q ss_pred eeeeccccccCCHHHHHHHHHHhhc
Q psy838 127 ELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 127 ~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.++|+|+|||+|+|+.||+.++..|
T Consensus 73 ~i~v~p~~rg~Gig~~ll~~~~~~~ 97 (140)
T PRK03624 73 YLAVHPDFRGRGIGRALVARLEKKL 97 (140)
T ss_pred EEEECHHHhCCCHHHHHHHHHHHHH
Confidence 8999999999999999999998865
No 22
>PF00583 Acetyltransf_1: Acetyltransferase (GNAT) family; InterPro: IPR000182 The N-acetyltransferases (NAT) (EC 2.3.1.-) are enzymes that use acetyl coenzyme A (CoA) to transfer an acetyl group to a substrate, a reaction implicated in various functions from bacterial antibiotic resistance to mammalian circadian rhythm and chromatin remodeling. The Gcn5-related N-acetyltransferases (GNAT) catalyze the transfer of the acetyl from the CoA donor to a primary amine of the acceptor. The GNAT proteins share a domain composed of four conserved sequence motifs A-D [, ]. This GNAT domain is named after yeast GCN5 (from General Control Nonrepressed) and related histone acetyltransferases (HATs) like Hat1 and PCAF. HATs acetylate lysine residues of amino terminal histone tails, resulting in transcription activation. Another category of GNAT, the aminoglycoside N-acetyltransferases, confer antibiotic resistance by catalyzing the acetylation of amino groups in aminoglycoside antibiotics []. GNAT proteins can also have anabolic and catabolic functions in both prokaryotes and eukaryotes [, , , , ]. The acetyltransferase/GNAT domain forms a structurally conserved fold of 6 to 7 beta strands (B) and 4 helices (H) in the topology B1-H1-H2-B2-B3-B4-H3-B5-H4-B6, followed by a C-terminal strand which may be from the same monomer or contributed by another [, ]. Motifs D (B2-B3), A (B4-H3) and B (B5-H4) are collectively called the HAT core [, , ], while the N-terminal motif C (B1-H1) is less conserved. Some proteins known to contain a GNAT domain: Yeast GCN5 and Hat1, which are histone acetyltransferases (EC 2.3.1.48). Human PCAF, a histone acetyltransferase. Mammalian serotonin N-acetyltransferase (SNAT) or arylalkylamine NAT (AANAT), which acetylates serotonin into a circadian neurohormone that may participate in light-dark rhythms, and human mood and behavior. Mammalian glucosamine 6-phosphate N-acetyltransferase (GNA1) (EC 2.3.1.4). Escherichia coli rimI and rimJ, which acetylate the N-terminal alanine of ribosomal proteins S18 and S5, respectively (EC 2.3.1.128). Mycobacterium tuberculosis aminoglycoside 2'-N-acetyltransferase (aac), which acetylates the 2' hydroxyl or amino group of a broad spectrum of aminoglycoside antibiotics. Bacillus subtilis bltD and paiA, which acetylate spermine and spermidine. This entry represents the entire GNAT domain.; GO: 0008080 N-acetyltransferase activity, 0008152 metabolic process; PDB: 3T9Y_A 2R7H_B 2OZH_A 1Y9W_B 1VKC_B 2OH1_C 3R9E_B 3R9G_B 3R9F_A 3R96_A ....
Probab=99.00 E-value=2.3e-09 Score=69.28 Aligned_cols=49 Identities=18% Similarity=0.274 Sum_probs=43.1
Q ss_pred CCCeEEEEEEEeeccC--CcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 103 SSTPVAFSHFRFDVDF--GEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 103 ~~~ivGf~~~~~~~~~--~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+|++||++.+.+..+. ....++|..|+|+|+|||+|+|+.||+.+++.|
T Consensus 4 ~~~ivg~~~~~~~~~~~~~~~~~~i~~~~v~~~~r~~Gig~~L~~~~~~~~ 54 (83)
T PF00583_consen 4 DGQIVGFASLRPPPEPFDHGNHAYIHRLAVDPEYRGQGIGSKLLQAAEEWA 54 (83)
T ss_dssp TTEEEEEEEEEEEETTTTTTTEEEEEEEEECGGGTTSSHHHHHHHHHHHHH
T ss_pred CCEEEEEEEEEECCCccccCCEEEEEEEEEcHHHhhCCCchhhhhhhhhhH
Confidence 4999999999886543 246899999999999999999999999999876
No 23
>PRK09491 rimI ribosomal-protein-alanine N-acetyltransferase; Provisional
Probab=98.98 E-value=5e-09 Score=75.11 Aligned_cols=90 Identities=16% Similarity=0.278 Sum_probs=63.5
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEee
Q psy838 48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYE 127 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~ 127 (152)
...+++++|++.+.++... .+ ..+|+.+.+.... .+....+.+..+ +++|||+.+....+ ...+..
T Consensus 3 ~iR~~~~~D~~~l~~l~~~----~~---~~~~~~~~~~~~~-~~~~~~~~~~~~--~~~vG~~~~~~~~~----~~~~~~ 68 (146)
T PRK09491 3 TISSLTPADLPAAYHIEQR----AH---AFPWSEKTFASNQ-GERYLNLKLTVN--GQMAAFAITQVVLD----EATLFN 68 (146)
T ss_pred chhcCChhhhHHHHHHHHh----cC---CCCCCHHHHHHHH-hcCceEEEEEEC--CeEEEEEEEEeecC----ceEEEE
Confidence 4567899999999998543 11 4678776554332 222222233444 89999998865432 245788
Q ss_pred eeeccccccCCHHHHHHHHHHhhc
Q psy838 128 LQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 128 i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
|+|+|++||+|+|+.||+.+++.+
T Consensus 69 i~v~~~~rg~G~g~~ll~~~~~~~ 92 (146)
T PRK09491 69 IAVDPDYQRQGLGRALLEHLIDEL 92 (146)
T ss_pred EEECHHHccCCHHHHHHHHHHHHH
Confidence 999999999999999999998865
No 24
>TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase. This model represents a clade of amino-acid N-acetyltransferases acting mainly on glutamate in the first step of the "acetylated" ornithine biosynthesis pathway. For this reason it is also called N-acetylglutamate synthase. The enzyme may also act on aspartate.
Probab=98.98 E-value=3.6e-09 Score=89.33 Aligned_cols=93 Identities=8% Similarity=-0.015 Sum_probs=66.8
Q ss_pred cccCCCCCHHHHHHHHHHHHHhHHHHHHhCCC--CCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceE
Q psy838 46 YKKSTDLGAETKKWVWELEEKNMKHSYEVCDI--GWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVL 123 (152)
Q Consensus 46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~--~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~ 123 (152)
+....+++++|++.+++|... ..+. .+ +|+.+.+. ++...++++..+ ++++||+.+.+..++ ...
T Consensus 282 y~~IR~at~~Dl~~I~~L~~~----~~~~-~~~~~~~~~~l~----~~~~~~~V~~~d--g~iVG~~~~~~~~~~--~~~ 348 (429)
T TIGR01890 282 FESIRQATIDDIGGIAALIRP----LEEQ-GILVRRSREYLE----REISEFSIIEHD--GNIIGCAALYPYAEE--DCG 348 (429)
T ss_pred hhheEECCHHHHHHHHHHHHH----HHHc-CCchhhhHHHHH----hhcCcEEEEEEC--CEEEEEEEEEecCCC--CeE
Confidence 445677899999999999753 2222 22 34444433 333345566555 899999998764332 356
Q ss_pred EEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 124 YCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+|..|+|+|+|||+|+|++||+.++++|
T Consensus 349 ~I~~l~V~p~~Rg~GiG~~Ll~~l~~~A 376 (429)
T TIGR01890 349 EMACLAVSPEYQDGGRGERLLAHIEDRA 376 (429)
T ss_pred EEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence 7999999999999999999999999886
No 25
>PF13508 Acetyltransf_7: Acetyltransferase (GNAT) domain; PDB: 3EY5_A 3FRM_B 3D8P_B 3GY9_A 3GYA_A 3S6F_A 2Q7B_A 1CM0_B 1XEB_B 1Y7R_A ....
Probab=98.96 E-value=4.7e-09 Score=68.05 Aligned_cols=53 Identities=19% Similarity=0.316 Sum_probs=42.2
Q ss_pred ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 93 ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 93 ~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+.++++.++ +++||++.+....+ ..+|..|+|+|++||+|+|+.||+.+.+.+
T Consensus 3 ~~~~~~~~~--~~ivG~~~~~~~~~----~~~i~~~~v~~~~rg~Gig~~ll~~~~~~~ 55 (79)
T PF13508_consen 3 ERFFVAEDD--GEIVGFIRLWPNED----FAYIGYLAVDPEYRGKGIGSKLLNYLLEKA 55 (79)
T ss_dssp EEEEEEEET--TEEEEEEEEEETTT----EEEEEEEEE-GGGTTSSHHHHHHHHHHHHH
T ss_pred cEEEEEEEC--CEEEEEEEEEEcCC----EEEEEEEEECHHHcCCCHHHHHHHHHHHHc
Confidence 345566655 99999999965422 678999999999999999999999998764
No 26
>PLN02706 glucosamine 6-phosphate N-acetyltransferase
Probab=98.94 E-value=8.4e-09 Score=74.17 Aligned_cols=98 Identities=16% Similarity=0.102 Sum_probs=63.7
Q ss_pred cCCCCCHHHHH-HHHHHHHHhHHHHHHhCCCCCChhhhhhhhc---C-CCceEE-EEEeCCCCCeEEEEEEEeec---cC
Q psy838 48 KSTDLGAETKK-WVWELEEKNMKHSYEVCDIGWDPQGKHSEMF---D-DRACYL-VAKNGSSSTPVAFSHFRFDV---DF 118 (152)
Q Consensus 48 ~~~~l~~~d~~-~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~---~-~~~~~~-v~~~~~~~~ivGf~~~~~~~---~~ 118 (152)
+..+++++|++ .+..+..+ .. . ..+|+.+.+.+.+. + +...++ ++..+.++++||++.+.+.. ..
T Consensus 8 ~ir~~~~~D~~~~~~~~~~~----~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~ 81 (150)
T PLN02706 8 KVRRLEISDKSKGFLELLQQ----LT-V-VGDVTEEEFEARFQELASLGDDHLICVIEDAASGRIIATGSVFVERKFIRN 81 (150)
T ss_pred EEeEhhhcccchHHHHHHHh----cc-C-CCCCCHHHHHHHHHHHHhCCCcEEEEEEEeCCCCcEEEEEEEEEEeecccC
Confidence 35667788877 36665543 11 1 23677666655543 2 233333 33332248999999875321 12
Q ss_pred CcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 119 GEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 119 ~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.....+|..|+|+|+|||+|+|+.||+.++++|
T Consensus 82 ~~~~~~i~~i~V~~~~rg~GiG~~ll~~~~~~a 114 (150)
T PLN02706 82 CGKVGHIEDVVVDSAARGKGLGKKIIEALTEHA 114 (150)
T ss_pred CCcEEEEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence 234567899999999999999999999999886
No 27
>PRK07922 N-acetylglutamate synthase; Validated
Probab=98.93 E-value=6.3e-09 Score=77.33 Aligned_cols=93 Identities=8% Similarity=0.089 Sum_probs=62.1
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEee
Q psy838 48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYE 127 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~ 127 (152)
...+++++|.+.+.+|..+.. ........ . ......+...++++..+ +++++||+.+....+ ....|..
T Consensus 7 ~iR~a~~~D~~~i~~L~~~~~-----~~~~~~~~-~-~~~~~~~~~~~~va~~~-~~~iiG~~~~~~~~~---~~~~i~~ 75 (169)
T PRK07922 7 TVRRARTSDVPAIKRLVDPYA-----QGRILLEK-N-LVTLYEAVQEFWVAEHL-DGEVVGCGALHVMWE---DLAEIRT 75 (169)
T ss_pred eeecCCHhhHHHHHHHHHHHh-----hcCccccc-h-HHHHHhhcCcEEEEEec-CCcEEEEEEEeecCC---CceEEEE
Confidence 456789999999999876421 11111111 1 11122233445566522 489999998865432 2466889
Q ss_pred eeeccccccCCHHHHHHHHHHhhc
Q psy838 128 LQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 128 i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
++|+|++||+|+|++||+.+++.|
T Consensus 76 l~V~p~~rgkGiG~~Ll~~~~~~a 99 (169)
T PRK07922 76 VAVDPAARGRGVGHAIVERLLDVA 99 (169)
T ss_pred EEECHHHhCCCHHHHHHHHHHHHH
Confidence 999999999999999999999876
No 28
>PF13527 Acetyltransf_9: Acetyltransferase (GNAT) domain; PDB: 3SXN_C 2I00_D 1M4D_B 1M44_A 1M4G_B 1M4I_A 2OZG_A 2HV2_F 3N7Z_A 3RYO_B ....
Probab=98.93 E-value=1.9e-09 Score=75.32 Aligned_cols=92 Identities=15% Similarity=0.196 Sum_probs=62.0
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChh---hhhhhhcCCCceEEEEEeCCCCCeEEEEEEEee--ccCC--cce
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQ---GKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFD--VDFG--EPV 122 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~---~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~--~~~~--~~~ 122 (152)
..++++|++.+.+|... .|... .+.. .......++ +..+++.++ +++||.+.+.+. ...+ -+.
T Consensus 3 R~~~~~d~~~i~~l~~~----~F~~~---~~~~~~~~~~~~~~~~-~~~~~~~~~--~~ivg~~~~~~~~~~~~g~~~~~ 72 (127)
T PF13527_consen 3 RPLTESDFEQIIELFNE----AFGDS---ESPPEIWEYFRNLYGP-GRCVVAEDD--GKIVGHVGLIPRRLSVGGKKFKA 72 (127)
T ss_dssp EEE-GGGHHHHHHHHHH----HTTT----CHHHHHHHHHHHHHHT-TEEEEEEET--TEEEEEEEEEEEEEEETTEEEEE
T ss_pred eECCHHHHHHHHHHHHH----HCCCC---CCchhhhhhhhcccCc-CcEEEEEEC--CEEEEEEEEEEEEEEECCEEEEE
Confidence 45789999999999876 34321 1111 112223333 356666665 999999988653 1122 346
Q ss_pred EEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 123 LYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.+|..++|+|+|||+|+|++||+.+++.+
T Consensus 73 ~~i~~v~v~p~~R~~Gl~~~L~~~~~~~~ 101 (127)
T PF13527_consen 73 AYIGDVAVDPEYRGRGLGRQLMRALLERA 101 (127)
T ss_dssp EEEEEEEE-GGGTTSSHHHHHHHHHHHHH
T ss_pred EEEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence 88999999999999999999999998765
No 29
>PRK05279 N-acetylglutamate synthase; Validated
Probab=98.93 E-value=6.4e-09 Score=88.02 Aligned_cols=94 Identities=9% Similarity=-0.026 Sum_probs=66.8
Q ss_pred cccCCCCCHHHHHHHHHHHHHhHHHHHHhC-CCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEE
Q psy838 46 YKKSTDLGAETKKWVWELEEKNMKHSYEVC-DIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLY 124 (152)
Q Consensus 46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~-~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~ 124 (152)
+....+++++|++.+.+|... ..... ...|+.+.+ .+....++++..+ ++++||+.+....+ ....+
T Consensus 294 y~~IR~at~~D~~~I~~L~~~----~~~~~~~~~~~~~~l----~~~~~~~~va~~d--g~iVG~~~~~~~~~--~~~~~ 361 (441)
T PRK05279 294 LEQLRRATIDDVGGILELIRP----LEEQGILVRRSREQL----EREIDKFTVIERD--GLIIGCAALYPFPE--EKMGE 361 (441)
T ss_pred hHHeEeCCHHHHHHHHHHHHH----HHHcCCccccCHHHH----hcccCcEEEEEEC--CEEEEEEEEEEcCC--CCeEE
Confidence 556678899999999999754 11110 123554433 3333345666655 89999998765322 23567
Q ss_pred EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 125 CYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
|..|+|+|+|||+|+|++||+.++++|
T Consensus 362 I~~l~V~p~~Rg~GiG~~Ll~~l~~~a 388 (441)
T PRK05279 362 MACLAVHPDYRGSGRGERLLKRIEQRA 388 (441)
T ss_pred EEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence 999999999999999999999999876
No 30
>PRK10140 putative acetyltransferase YhhY; Provisional
Probab=98.93 E-value=5.6e-09 Score=75.41 Aligned_cols=98 Identities=9% Similarity=0.070 Sum_probs=60.0
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhC-CCC-CChhhhhhhhcC-CCceEEEEEeCCCCCeEEEEEEEeecc-CCcceEE
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVC-DIG-WDPQGKHSEMFD-DRACYLVAKNGSSSTPVAFSHFRFDVD-FGEPVLY 124 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~-~~~-w~~~~~~~el~~-~~~~~~v~~~~~~~~ivGf~~~~~~~~-~~~~~~~ 124 (152)
..+++++|++.+.++..+.. .+... ..+ .+++.....+.+ +....+++..+ +++||++++..... .......
T Consensus 6 lr~~~~~D~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~--~~~vG~~~~~~~~~~~~~~~~~ 81 (162)
T PRK10140 6 IRHAETRDYEAIRQIHAQPE--VYHNTLQVPHPSDHMWQERLADRPGIKQLVACID--GDVVGHLTIDVQQRPRRSHVAD 81 (162)
T ss_pred EEecchhhHHHHHHHHhCcc--cccccccCCCcCHHHHHHHhhcCCCcEEEEEEEC--CEEEEEEEEecccccccceEEE
Confidence 46678899999999875421 11110 111 233333333433 23334455444 89999999875321 1111222
Q ss_pred EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 125 CYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
..++|+|+|||+|+|+.||+.++++|
T Consensus 82 -~~~~v~p~~rg~Gig~~ll~~l~~~~ 107 (162)
T PRK10140 82 -FGICVDSRWKNRGVASALMREMIEMC 107 (162)
T ss_pred -EEEEECHHHcCCCHHHHHHHHHHHHH
Confidence 35899999999999999999998875
No 31
>PLN02825 amino-acid N-acetyltransferase
Probab=98.92 E-value=7.1e-09 Score=89.29 Aligned_cols=94 Identities=9% Similarity=0.049 Sum_probs=69.5
Q ss_pred cccCCCCCHHHHHHHHHHHHHhHHHHHHhC-CCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEE
Q psy838 46 YKKSTDLGAETKKWVWELEEKNMKHSYEVC-DIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLY 124 (152)
Q Consensus 46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~-~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~ 124 (152)
+-...+++++|++.|.+|.+.. .+.. ...|+.+.+..++ ..++++..+ +++||++.+.+..+ ....+
T Consensus 367 ~e~IR~At~eDi~~I~~Li~~l----ee~g~lv~rs~e~le~ei----~~f~V~e~D--g~IVG~aal~~~~~--~~~aE 434 (515)
T PLN02825 367 YEGTRMARVEDLAGIRQIIRPL----EESGILVRRTDEELLRAL----DSFVVVERE--GSIIACAALFPFFE--EKCGE 434 (515)
T ss_pred HhhheeCCHHHHHHHHHHHHHH----HHcCCCcCCCHHHHHhcC----CcEEEEEEC--CEEEEEEEEEeecC--CCcEE
Confidence 5556789999999999998752 2221 1245555554443 346666655 99999998765433 23567
Q ss_pred EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 125 CYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
|..|+|+|+|||+|+|++||+.+++.|
T Consensus 435 I~~laV~P~yRGkGiG~~LL~~le~~A 461 (515)
T PLN02825 435 VAAIAVSPECRGQGQGDKLLDYIEKKA 461 (515)
T ss_pred EEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence 999999999999999999999999886
No 32
>TIGR03103 trio_acet_GNAT GNAT-family acetyltransferase TIGR03103. Members of this protein family belong to the GNAT family of acetyltransferases. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and an asparagine synthetase family protein.
Probab=98.92 E-value=1e-08 Score=89.16 Aligned_cols=92 Identities=14% Similarity=0.054 Sum_probs=68.5
Q ss_pred CHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEee---ccCCcceEEEeeee
Q psy838 53 GAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFD---VDFGEPVLYCYELQ 129 (152)
Q Consensus 53 ~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~---~~~~~~~~~I~~i~ 129 (152)
+++|++++.+|..+. . ..+|+.+....++..+...++++..+.+|++|||+.+... ..+.....+|..|+
T Consensus 90 ~~~D~~~I~~L~~~~----~---~~p~~~~~~~~~~~~~~~~~~vA~~~~~g~IVG~~~~~~~~~~~~d~~~~~~i~~l~ 162 (547)
T TIGR03103 90 GPADVDAINRLYAAR----G---MVPVRVDFVLDHRHSRAITYLVAEDEASGAIIGTVMGVDHRKAFNDPEHGSSLWCLA 162 (547)
T ss_pred ChhHHHHHHHHHHhc----C---CCCCCHHHHHHHhcCCCceEEEEEECCCCeEEEEEEEEeccccccCCCCCeEEEEEE
Confidence 678999999987652 1 2467777776666666667777765335899999975321 22333456799999
Q ss_pred eccccccCCHHHHHHHHHHhhc
Q psy838 130 LEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 130 V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
|+|+|||+|+|++||+.+++.+
T Consensus 163 V~P~~Rg~GIG~~Ll~~l~e~a 184 (547)
T TIGR03103 163 VDPQAAHPGVGEALVRALAEHF 184 (547)
T ss_pred ECHHHcCCCHHHHHHHHHHHHH
Confidence 9999999999999999999875
No 33
>PRK10514 putative acetyltransferase; Provisional
Probab=98.91 E-value=1.2e-08 Score=72.75 Aligned_cols=94 Identities=13% Similarity=0.069 Sum_probs=60.7
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhc--CCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEE
Q psy838 48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMF--DDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYC 125 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~--~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I 125 (152)
...+++++|.+.++++..++....... ..+++.+.....+. .+...++++..+ ++++|||+.+.. .++
T Consensus 3 ~ir~~~~~D~~~l~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~iG~~~~~~--------~~~ 72 (145)
T PRK10514 3 SIRRSRHEEGERLVAIWRRSVDATHDF-LSAEDRAEIEELVRSFLPEAPLWVAVDE-RDQPVGFMLLSG--------GHM 72 (145)
T ss_pred eeeecchhhHHHHHHHHHHHHHHhCcc-cCchhHHHHHHHHHHHhccCceEEEEec-CCcEEEEEEEec--------CcE
Confidence 346788999999999887654322110 11334433333222 123334444332 489999998742 125
Q ss_pred eeeeeccccccCCHHHHHHHHHHhhc
Q psy838 126 YELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 126 ~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
..++|+|+|||+|+|+.||+.+++.+
T Consensus 73 ~~~~v~p~~rgkGig~~Ll~~~~~~~ 98 (145)
T PRK10514 73 EALFVDPDVRGCGVGRMLVEHALSLH 98 (145)
T ss_pred eEEEECHHhccCCHHHHHHHHHHHhc
Confidence 67999999999999999999998754
No 34
>COG0456 RimI Acetyltransferases [General function prediction only]
Probab=98.85 E-value=1.5e-08 Score=74.29 Aligned_cols=98 Identities=14% Similarity=0.243 Sum_probs=68.5
Q ss_pred CCCCCHHHHH--HHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCC-CC----CeEEEEEEEeeccCC--
Q psy838 49 STDLGAETKK--WVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGS-SS----TPVAFSHFRFDVDFG-- 119 (152)
Q Consensus 49 ~~~l~~~d~~--~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~-~~----~ivGf~~~~~~~~~~-- 119 (152)
...+...|+. .++++... .|... .+|+...+...+.+....++++..+. ++ +++||+..+...+..
T Consensus 14 ir~~~~~d~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~l~~~~~~~~v~~~~~~~~~~~~~~~G~~~~~~~~~~~~~ 88 (177)
T COG0456 14 IREAINKDLLDVALAALEAR----TFDIR-LPWSREYFEKDLTQAPELLLVAETGGLDGLLDGKVVGFLLVRVVDGRPSA 88 (177)
T ss_pred hhhhhhcccchHHHHHHhhh----cCCCC-CcchHHHHHHHHhhCcceeEEEEecccCCCcccceeEEEEEEEecCCccc
Confidence 3455666666 66666554 23110 47999998888888777777766420 12 599999997443321
Q ss_pred cceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 120 EPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 120 ~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
....+|..|+|+|+|||+|||+.||+.+++.+
T Consensus 89 ~~~~~i~~iaV~p~~r~~Gig~~Ll~~~~~~~ 120 (177)
T COG0456 89 DHEGHIYNLAVDPEYRGRGIGRALLDEALERL 120 (177)
T ss_pred cCccEEEEEEEChHhhcCCHHHHHHHHHHHHH
Confidence 12467999999999999999999999887654
No 35
>PRK12308 bifunctional argininosuccinate lyase/N-acetylglutamate synthase; Provisional
Probab=98.78 E-value=3.6e-08 Score=86.73 Aligned_cols=92 Identities=12% Similarity=0.020 Sum_probs=64.4
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEee
Q psy838 48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYE 127 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~ 127 (152)
...+++++|++.+.+|...- ... ....+|+.+. +..+...++++..+ +++|||+.+... +. ...+|..
T Consensus 465 ~IR~a~~~D~~~I~~L~~~~-~~~--~~~~~~~~~~----l~~~~~~~~Va~~~--g~IVG~~~l~~~-~~--~~~~I~~ 532 (614)
T PRK12308 465 KVRPARLTDIDAIEGMVAYW-AGL--GENLPRSRNE----LVRDIGSFAVAEHH--GEVTGCASLYIY-DS--GLAEIRS 532 (614)
T ss_pred EEEECCHHHHHHHHHHHHHH-Hhh--hcccccCHHH----HhcccCcEEEEEEC--CEEEEEEEEEEc-CC--CeEEEEE
Confidence 35678899999999987541 111 1124455443 33333455666555 899999988653 22 2467999
Q ss_pred eeeccccccCCHHHHHHHHHHhhc
Q psy838 128 LQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 128 i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
|+|+|+|||+|||+.||+.+++.|
T Consensus 533 i~V~P~~rGkGIGk~Ll~~l~~~a 556 (614)
T PRK12308 533 LGVEAGWQVQGQGSALVQYLVEKA 556 (614)
T ss_pred EEECHHHcCCCHHHHHHHHHHHHH
Confidence 999999999999999999999876
No 36
>TIGR01686 FkbH FkbH-like domain. The C-terminal portion of this domain is unique to this family (by BLAST).
Probab=98.77 E-value=7.8e-08 Score=78.25 Aligned_cols=97 Identities=11% Similarity=0.131 Sum_probs=71.9
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeC--CCCCeEEEEEEEeeccCCcceEEE
Q psy838 48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNG--SSSTPVAFSHFRFDVDFGEPVLYC 125 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~--~~~~ivGf~~~~~~~~~~~~~~~I 125 (152)
+...++++|++.+.+|.... ..|......|+++.+...+.++ ..+.+...+ .++.+|||+.+.... ..++|
T Consensus 188 ~Ir~a~~~Dl~ri~~L~~~t--nqfn~~~~~~s~~~i~~~l~~~-~~~~~~~~d~~gd~givG~~~~~~~~----~~~~I 260 (320)
T TIGR01686 188 NISKNDEQNVQRVEELLGRT--NQFNATYTRLNQEDVAQHMQKE-EIVTVSMSDRFGDSGIIGIFVFEKKE----GNLFI 260 (320)
T ss_pred EEEECChhhhHHHHHHHHhH--HhhhccCccCCHHHHHHHhcCC-CEEEEEEEecCCCCceEEEEEEEecC----CcEEE
Confidence 46788999999999998653 2343334578988887777665 333333321 146899999886532 25679
Q ss_pred eeeeeccccccCCHHHHHHHHHHhhc
Q psy838 126 YELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 126 ~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
..|+|+|++||+|+|+.||+.+++.|
T Consensus 261 ~~l~vs~r~~grGig~~Ll~~l~~~a 286 (320)
T TIGR01686 261 DDLCMSCRALGRGVETRMLRWLFEQA 286 (320)
T ss_pred EEEEEcHhHhcCcHHHHHHHHHHHHH
Confidence 99999999999999999999999876
No 37
>COG1247 Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane]
Probab=98.74 E-value=9.5e-08 Score=71.35 Aligned_cols=101 Identities=14% Similarity=0.084 Sum_probs=62.5
Q ss_pred CCCCCHHHHHHHHHHHHHhHHH---HHHhCCCCCChhhhhhhh--cCCCc-eEEEEEeCCCCCeEEEEEEEeecc-CCcc
Q psy838 49 STDLGAETKKWVWELEEKNMKH---SYEVCDIGWDPQGKHSEM--FDDRA-CYLVAKNGSSSTPVAFSHFRFDVD-FGEP 121 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~---~y~~~~~~w~~~~~~~el--~~~~~-~~~v~~~~~~~~ivGf~~~~~~~~-~~~~ 121 (152)
..+++..|++.|.++....... .|+. ++=+.+....-+ ...+. .++++..+ +|+++||+.+..-.+ .+=.
T Consensus 4 ir~~~~~Dl~~I~~IY~~~v~~~~a~~e~--~~~~~~~~~~~~~~~~~~g~p~~V~~~~-~g~v~G~a~~~~fr~r~ay~ 80 (169)
T COG1247 4 IRPATAADLEAILEIYNGAVENTAATFEE--DPVSLEERAAWFSGRTRDGYPVVVAEEE-DGKVLGYASAGPFRERPAYR 80 (169)
T ss_pred EecChHHhHHHHHHHHHHhhhcceEEEec--cCCCHHHHHHHHHhcccCCceEEEEEcC-CCeEEEEEEeeeccCccccc
Confidence 3567889999999987763211 1111 111222222112 22333 44444433 589999999875433 2222
Q ss_pred eEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838 122 VLYCYELQLEKQVQRKGLGKFMMQAFYTYID 152 (152)
Q Consensus 122 ~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~ 152 (152)
...-+-|||+|+.||+|+|++||+.+++.|.
T Consensus 81 ~tve~SiYv~~~~~g~GiG~~Ll~~Li~~~~ 111 (169)
T COG1247 81 HTVELSIYLDPAARGKGLGKKLLQALITEAR 111 (169)
T ss_pred eEEEEEEEECcccccccHHHHHHHHHHHHHH
Confidence 3345679999999999999999999998763
No 38
>TIGR02382 wecD_rffC TDP-D-fucosamine acetyltransferase. This model represents the WecD protein (Formerly RffC) for the biosynthesis of enterobacterial common antigen (ECA), an outer leaflet, outer membrane glycolipid with a trisaccharide repeat unit. WecD is a member of the GNAT family of acetytransferases (pfam00583).
Probab=98.73 E-value=1.2e-07 Score=71.38 Aligned_cols=99 Identities=13% Similarity=0.132 Sum_probs=64.1
Q ss_pred cccCCCCCHHHHHHHHHHHHHhHHH-HHHhCCCCCChhhh--------hhhhc-CC-CceEEEEEeCCCCCeEEEEEEEe
Q psy838 46 YKKSTDLGAETKKWVWELEEKNMKH-SYEVCDIGWDPQGK--------HSEMF-DD-RACYLVAKNGSSSTPVAFSHFRF 114 (152)
Q Consensus 46 ~~~~~~l~~~d~~~i~~l~~~~m~~-~y~~~~~~w~~~~~--------~~el~-~~-~~~~~v~~~~~~~~ivGf~~~~~ 114 (152)
-....+++++|++.+.++..+.... .| ..+|.+... ..+.. +. ....+++..+ ++++|||+.+..
T Consensus 43 ~~~lR~~~~~D~~~l~~l~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~g~iiG~i~l~~ 118 (191)
T TIGR02382 43 DPGARVATETDIPALRQLASAAFALSRF---RAPWYAPDDSGRFYAQWVENAVRGTFDHQCLILRDA-SGDPRGYVTLRE 118 (191)
T ss_pred CCcceeCChhhHHHHHHHHHHHhhcccc---CCCCcCHHHHHHHHHHHHHHHhcCCCCCeEEEEEcc-CCeEEEEEEEEe
Confidence 3467889999999999997764211 11 224432110 01121 22 2222333333 489999999865
Q ss_pred eccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 115 DVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 115 ~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
..+ +..+|..++|+|++||+|+|+.||+.++++|
T Consensus 119 ~~~---~~~~i~~l~V~p~~rGkG~G~~ll~~~~~~a 152 (191)
T TIGR02382 119 LND---TDARIGLLAVFPGAQSRGIGAELMQTALNWC 152 (191)
T ss_pred cCC---CceEEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence 322 2356889999999999999999999999886
No 39
>PRK10975 TDP-fucosamine acetyltransferase; Provisional
Probab=98.70 E-value=2.1e-07 Score=70.07 Aligned_cols=97 Identities=14% Similarity=0.126 Sum_probs=62.3
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHH-HHHhCCCCCChhh----hhhh-----hcCC-CceEEEEEeCCCCCeEEEEEEEeec
Q psy838 48 KSTDLGAETKKWVWELEEKNMKH-SYEVCDIGWDPQG----KHSE-----MFDD-RACYLVAKNGSSSTPVAFSHFRFDV 116 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~-~y~~~~~~w~~~~----~~~e-----l~~~-~~~~~v~~~~~~~~ivGf~~~~~~~ 116 (152)
...+++++|++.+.++..+.... .| ..+|.... ...+ +.+. ...++++..+ ++++|||+.+....
T Consensus 48 ~iR~a~~~D~~~i~~l~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~-~g~~vG~~~l~~~~ 123 (194)
T PRK10975 48 GARVATETDIPALRQLAAQAFAQSRF---RAPWYAPDDSGRFYAQWIENAVRGTFDHQCLLLRDA-SGQIQGFVTLRELN 123 (194)
T ss_pred CcccCCcccHHHHHHHHHHHhhhccc---cCccCChhHHHHHHHHHHHHhhccccCCcEEEEEcC-CCCEEEEEEEEecC
Confidence 34578889999999987653211 12 12342221 1111 1111 2244455433 58999999986532
Q ss_pred cCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 117 DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 117 ~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+ ...+|..++|+|+|||+|+|+.||+.++++|
T Consensus 124 ~---~~~~i~~~~V~p~~rg~Gig~~Ll~~~~~~a 155 (194)
T PRK10975 124 D---TDARIGLLAVFPGAQGRGIGARLMQAALNWC 155 (194)
T ss_pred C---CceEEEEEEEChhhcCCCHHHHHHHHHHHHH
Confidence 2 2467888999999999999999999999876
No 40
>COG3153 Predicted acetyltransferase [General function prediction only]
Probab=98.69 E-value=1e-07 Score=71.29 Aligned_cols=93 Identities=12% Similarity=0.030 Sum_probs=67.4
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCC----ceEEEEEeCCCCCeEEEEEEEeec-c-CCcce
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDR----ACYLVAKNGSSSTPVAFSHFRFDV-D-FGEPV 122 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~----~~~~v~~~~~~~~ivGf~~~~~~~-~-~~~~~ 122 (152)
....++.|++.+.++.++. |. ++.+......+.... ..-+++.++ |++||.+.|+... . .....
T Consensus 6 ir~e~~~d~~~i~~~~~~a----F~----~~~e~~~v~~lR~~~~~~~~LslVA~d~--g~vvG~Il~s~v~~~g~~~~~ 75 (171)
T COG3153 6 IRTETPADIPAIEALTREA----FG----PGREAKLVDKLREGGRPDLTLSLVAEDD--GEVVGHILFSPVTVGGEELGW 75 (171)
T ss_pred EEecChhhHHHHHHHHHHH----hh----cchHHHHHHHHHhcCCcccceeEEEeeC--CEEEEEEEEeEEEecCcccce
Confidence 4567889999999998874 42 455555555555432 334455544 9999999998643 2 23445
Q ss_pred EEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 123 LYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+-|--|+|+|++||||||+.||...++.+
T Consensus 76 ~~LaPLaV~p~~qg~GIG~~Lvr~~le~a 104 (171)
T COG3153 76 LGLAPLAVDPEYQGQGIGSALVREGLEAL 104 (171)
T ss_pred EEEEeEEEchhhcCCcHHHHHHHHHHHHH
Confidence 66888999999999999999999988765
No 41
>PRK01346 hypothetical protein; Provisional
Probab=98.68 E-value=2e-07 Score=77.88 Aligned_cols=94 Identities=13% Similarity=-0.003 Sum_probs=63.9
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhh--hcCCCceEEEEEeCCCCCeEEEEEEEeec---cCC--c
Q psy838 48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSE--MFDDRACYLVAKNGSSSTPVAFSHFRFDV---DFG--E 120 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~e--l~~~~~~~~v~~~~~~~~ivGf~~~~~~~---~~~--~ 120 (152)
+..+++++|++.+.+|... .|.. .++++..... ...+ ...+++..+ +++||++.+.... ..+ .
T Consensus 8 ~iR~~~~~D~~~i~~L~~~----~f~~---~~~~~~~~~~~~~~~~-~~~~va~~~--~~lvg~~~~~~~~~~~~~~~~~ 77 (411)
T PRK01346 8 TIRTATEEDWPAWFRAAAT----GFGD---SPSDEELEAWRALVEP-DRTLGAFDG--DEVVGTAGAFDLRLTVPGGAVL 77 (411)
T ss_pred eeecCCHHHHHHHHHHHHH----HcCC---CCChHHHHHHHHhcCc-CCeEEEEEC--CEEEEEEEEeccccccCCCCcc
Confidence 3567789999999999754 3422 2233222111 1222 334556654 8999999876421 112 3
Q ss_pred ceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 121 PVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 121 ~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+..+|..++|+|+|||+|+|+.||+.+++.+
T Consensus 78 ~~~~i~~v~V~P~~RgrGig~~Ll~~~l~~a 108 (411)
T PRK01346 78 PAAGVTAVTVAPTHRRRGLLTALMREQLRRI 108 (411)
T ss_pred ceeEEEEEEEChhhcCCCHHHHHHHHHHHHH
Confidence 6788999999999999999999999998865
No 42
>cd04301 NAT_SF N-Acyltransferase superfamily: Various enzymes that characteristically catalyze the transfer of an acyl group to a substrate. NAT (N-Acyltransferase) is a large superfamily of enzymes that mostly catalyze the transfer of an acyl group to a substrate and are implicated in a variety of functions, ranging from bacterial antibiotic resistance to circadian rhythms in mammals. Members include GCN5-related N-Acetyltransferases (GNAT) such as Aminoglycoside N-acetyltransferases, Histone N-acetyltransferase (HAT) enzymes, and Serotonin N-acetyltransferase, which catalyze the transfer of an acetyl group to a substrate. The kinetic mechanism of most GNATs involves the ordered formation of a ternary complex: the reaction begins with Acetyl Coenzyme A (AcCoA) binding, followed by binding of substrate, then direct transfer of the acetyl group from AcCoA to the substrate, followed by product and subsequent CoA release. Other family members include Arginine/ornithine N-succinyltransfera
Probab=98.67 E-value=9.9e-08 Score=56.93 Aligned_cols=48 Identities=21% Similarity=0.284 Sum_probs=40.9
Q ss_pred CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+++++|++.+..... ....++|..+.|+|++||+|+|+.+|+.+++++
T Consensus 7 ~~~~ig~~~~~~~~~-~~~~~~l~~~~v~~~~~~~g~~~~~~~~~~~~~ 54 (65)
T cd04301 7 DGEIVGFASLSPDGS-GGDTAYIGDLAVLPEYRGKGIGSALLEAAEEEA 54 (65)
T ss_pred CCEEEEEEEEEecCC-CCccEEEEEEEECHHHcCcCHHHHHHHHHHHHH
Confidence 389999999987543 235688999999999999999999999998865
No 43
>PRK13688 hypothetical protein; Provisional
Probab=98.65 E-value=3.6e-07 Score=67.43 Aligned_cols=56 Identities=14% Similarity=0.155 Sum_probs=41.3
Q ss_pred CceEEEEEeCCCCCeEEEEEEEeec-------cCCcceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838 92 RACYLVAKNGSSSTPVAFSHFRFDV-------DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYT 149 (152)
Q Consensus 92 ~~~~~v~~~~~~~~ivGf~~~~~~~-------~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~ 149 (152)
...++++..+ +++||++.+.... ......++|.+|+|+|+|||+|+|++||+.+.+
T Consensus 44 ~~~~~~~~~~--~~~VG~~~l~~~dg~~~~~~~~~~~~~~L~~l~V~p~~rgkGiG~~Ll~~a~~ 106 (156)
T PRK13688 44 ESPFYGIYYG--DSLVARMSLYKKGGVEEPYFEDTQDYLELWKLEVLPKYQNRGYGEMLVDFAKS 106 (156)
T ss_pred CCCEEEEEEC--CEEEEEEEEEecCCcccccccCCCCeEEEEEEEECHHHcCCCHHHHHHHHHHH
Confidence 3444556655 8999998775321 112345789999999999999999999997764
No 44
>PF13523 Acetyltransf_8: Acetyltransferase (GNAT) domain; PDB: 2VQY_A 2BUE_A 1V0C_A 1YK3_D 2PR8_A 2QIR_A 2PRB_A 2QML_A 2PC1_A.
Probab=98.62 E-value=1.4e-07 Score=68.05 Aligned_cols=97 Identities=11% Similarity=0.151 Sum_probs=63.5
Q ss_pred CCC-HHHHHHHHHHHHHhHHHHHHhCCCCCCh---hhhhhhhc-CCCceEEEEEeCCCCCeEEEEEEEeecc---CCcce
Q psy838 51 DLG-AETKKWVWELEEKNMKHSYEVCDIGWDP---QGKHSEMF-DDRACYLVAKNGSSSTPVAFSHFRFDVD---FGEPV 122 (152)
Q Consensus 51 ~l~-~~d~~~i~~l~~~~m~~~y~~~~~~w~~---~~~~~el~-~~~~~~~v~~~~~~~~ivGf~~~~~~~~---~~~~~ 122 (152)
+++ .+|++.|.++..+.--..+-. ..|++ +.....+. ++....+|+..+ |+++||+.+....+ .....
T Consensus 3 ~a~~~~Dl~~i~~w~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~~~~~~~v~~~d--g~~~g~~~~~~~~~~~~~~~~~ 78 (152)
T PF13523_consen 3 PATTPDDLPLILQWLNQPHVREFWD--QDPSQEWVEEYPEQLEADPGHHPYVAEDD--GEPIGYFEIYWPDEDYDADDGD 78 (152)
T ss_dssp E---GGGHHHHHHHHTSHHHHCCH---CCCTHHHHHHHHHHHCHTTTEEEEEEEET--TEEEEEEEEEEGGGSS---TTE
T ss_pred eCccHHHHHHHHHHHHhHHHHHHcc--CCCCHHHHHHHHhhhcccCCceEEEEEEC--CEEEEEEEEecccccccCCCCE
Confidence 466 899999999876642122211 12332 22333453 566677777776 99999998754211 12345
Q ss_pred EEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 123 LYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
..++.+.++|++||+|+|+.+|..+++.+
T Consensus 79 ~~~~~~~~~~~~rg~G~g~~~~~~~~~~~ 107 (152)
T PF13523_consen 79 RGIHRLIVDPEYRGQGLGKAMLRALIEFL 107 (152)
T ss_dssp EEEEEEESTGGGTTSSHHHHHHHHHHHHH
T ss_pred EEEeeeeechhhcCCCHHHHHHHHHHHHH
Confidence 66888899999999999999999988764
No 45
>PRK10562 putative acetyltransferase; Provisional
Probab=98.61 E-value=5e-07 Score=64.73 Aligned_cols=88 Identities=13% Similarity=0.104 Sum_probs=56.2
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhh-------hhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcc
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHS-------EMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEP 121 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~-------el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~ 121 (152)
..+++.+|++.+++|..+.- .. ..++.+..... +...+....+++..+ +++|||+.+...
T Consensus 2 ir~~~~~D~~~i~~l~~~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~--~~~iG~~~~~~~------ 68 (145)
T PRK10562 2 IREYQPSDLPAILQLWLEST--IW---AHPFIKEQYWRESAPLVRDVYLPAAQTWVWEED--GKLLGFVSVLEG------ 68 (145)
T ss_pred cccccchhhHHHHHHHHHhc--cc---cCCCCCHHHHHHhHHHhhhhhcCcccEEEEEEC--CEEEEEEEEeec------
Confidence 45788999999999865421 11 11222211111 111233334455554 899999987432
Q ss_pred eEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 122 VLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 122 ~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
.+|..++|+|+|||+|+|+.||+.+++.
T Consensus 69 -~~i~~~~v~~~~rg~G~g~~ll~~~~~~ 96 (145)
T PRK10562 69 -RFVGALFVAPKAVRRGIGKALMQHVQQR 96 (145)
T ss_pred -cEEEEEEECHHHcCCCHHHHHHHHHHhh
Confidence 1367899999999999999999998764
No 46
>KOG3139|consensus
Probab=98.58 E-value=6.4e-07 Score=65.96 Aligned_cols=98 Identities=13% Similarity=0.162 Sum_probs=66.5
Q ss_pred cccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEE
Q psy838 46 YKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYC 125 (152)
Q Consensus 46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I 125 (152)
+....-.....++.+..|++.++.+-| +.-. ..-+..+...+.++.-+.++..||.+......+.+...+||
T Consensus 16 i~~~~~~~~~~l~~im~Li~k~lsepy-------S~~t-yrYf~~~wp~~~~~a~d~~~~~VGai~ck~~~~r~~~rgyi 87 (165)
T KOG3139|consen 16 IRPSLYPAEEYLADIMRLIDKDLSEPY-------SIYT-YRYFVPNWPCFCFLALDEKGDTVGAIVCKLDTHRNTLRGYI 87 (165)
T ss_pred eeeecchHHHHHHHHHHHHhhhcCchh-------HHHH-HHhcccCCceEEEEEEcCCCceEEEEEEeccccCCcceEEE
Confidence 444455566777778888887664444 2222 23344455555555443123369999887665544456899
Q ss_pred eeeeeccccccCCHHHHHHHHHHhhc
Q psy838 126 YELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 126 ~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
..|+|+++|||+|||+.|...+++.+
T Consensus 88 ~mLaV~~e~Rg~GIg~aLvr~aId~m 113 (165)
T KOG3139|consen 88 AMLAVDSEYRGQGIGKALVRKAIDAM 113 (165)
T ss_pred EEEEechhhccccHHHHHHHHHHHHH
Confidence 99999999999999999999998765
No 47
>COG1246 ArgA N-acetylglutamate synthase and related acetyltransferases [Amino acid transport and metabolism]
Probab=98.54 E-value=1.5e-07 Score=68.86 Aligned_cols=93 Identities=8% Similarity=0.065 Sum_probs=64.4
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeee
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYEL 128 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i 128 (152)
...++.+|+.+|.+|... |+..+ --....+..+...-..|.++..+ |.++|-+.+.+..+ .....+..|
T Consensus 3 iR~A~~~Di~~I~~Li~~-----~~~~g--il~~rs~~~le~~i~dF~i~E~~--g~viGC~aL~~~~~--~~~gE~~~l 71 (153)
T COG1246 3 IRKARISDIPAILELIRP-----LELQG--ILLRRSREQLEEEIDDFTIIERD--GKVIGCAALHPVLE--EDLGELRSL 71 (153)
T ss_pred eeeccccchHHHHHHHHH-----Hhhcc--ccchhhHHHHHHHHhhheeeeeC--CcEEEEEeecccCc--cCeeeEEEE
Confidence 356788999999999864 32211 00111122333333345677665 99999999985322 235679999
Q ss_pred eeccccccCCHHHHHHHHHHhhcC
Q psy838 129 QLEKQVQRKGLGKFMMQAFYTYID 152 (152)
Q Consensus 129 ~V~p~~qgkGiG~~Lm~~l~~~a~ 152 (152)
+|+|++||+|+|..|++.++..|+
T Consensus 72 aV~pd~r~~G~G~~Ll~~~~~~Ar 95 (153)
T COG1246 72 AVHPDYRGSGRGERLLERLLADAR 95 (153)
T ss_pred EECHHhcCCCcHHHHHHHHHHHHH
Confidence 999999999999999999998774
No 48
>PF13420 Acetyltransf_4: Acetyltransferase (GNAT) domain; PDB: 3DR8_A 3DR6_A 2AE6_B 2JLM_C 2J8R_A 1YVO_B 2J8M_A 2J8N_A 2BL1_A 3IWG_A ....
Probab=98.54 E-value=3.5e-07 Score=65.75 Aligned_cols=95 Identities=7% Similarity=0.071 Sum_probs=60.5
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCCh-----hhhhhhhc----CCCceEEEEEeCCCCCeEEEEEEEeeccCCc
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDP-----QGKHSEMF----DDRACYLVAKNGSSSTPVAFSHFRFDVDFGE 120 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~-----~~~~~el~----~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~ 120 (152)
.+++++|++.++.+..+. .....+.|.. +....-+. ++....+++... +|++||++.+..... ..
T Consensus 2 R~~~~~D~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~-~g~iiG~~~~~~~~~-~~ 75 (155)
T PF13420_consen 2 RPATEEDLEEILKLYNEP----RHEYFFTFEYPEDSEESFERWIESIIDSSKQRLFLVAEE-DGKIIGYVSLRDIDP-YN 75 (155)
T ss_dssp EE--GGGHHHHHHHHHHH----HHHTSSSSCSSHS-HHHHHHHHHHHHHHHTTEEEEEEEC-TTEEEEEEEEEESSS-GT
T ss_pred CCCcHHHHHHHHHHHhhh----hhcceeEecCCCCCHHHHHHHHHHhcccCCCcEEEEEEc-CCcEEEEEEEEeeec-cC
Confidence 357889999999988653 2111233331 11111121 234555555552 499999999986433 33
Q ss_pred ceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 121 PVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 121 ~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
..+.+. ++|.|++|++|+|+.|++.+++.|
T Consensus 76 ~~~~~~-~~v~~~~~~~gig~~l~~~l~~~a 105 (155)
T PF13420_consen 76 HTAELS-IYVSPDYRGKGIGRKLLDELIEYA 105 (155)
T ss_dssp TEEEEE-EEEEGGGTTSSHHHHHHHHHHHHH
T ss_pred CEEEEe-eEEChhHCCCcHHHHHHHHHHHHh
Confidence 344444 889999999999999999999887
No 49
>TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species.
Probab=98.54 E-value=7.8e-07 Score=70.69 Aligned_cols=91 Identities=13% Similarity=0.172 Sum_probs=56.8
Q ss_pred HHHHHHHHHHHhHHHHHHhC--CCCCChhhhhhhhcC---CCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeee
Q psy838 56 TKKWVWELEEKNMKHSYEVC--DIGWDPQGKHSEMFD---DRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQL 130 (152)
Q Consensus 56 d~~~i~~l~~~~m~~~y~~~--~~~w~~~~~~~el~~---~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V 130 (152)
|.+.+.++..+. |..+ ..+|+.+.+...... ....++++..+.++++|||+.+....++ ....+|+.+.|
T Consensus 160 d~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~vG~~~~~~~~~~-~~~~~i~~~~V 234 (292)
T TIGR03448 160 DDAEWLRVNNAA----FAWHPEQGGWTRADLAERRAEPWFDPAGLFLAFDDAPGELLGFHWTKVHPDE-PALGEVYVVGV 234 (292)
T ss_pred chHHHHHHHHHH----hhCCCccCCcCHHHHHHHhhCcCCCcCceEEEEECCCCcEEEEEEEEecCCC-CceeEEEEEEE
Confidence 555666664442 3221 146877665443221 1223455555224899999765543222 22456788899
Q ss_pred ccccccCCHHHHHHHHHHhhc
Q psy838 131 EKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 131 ~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+|+|||+|+|+.||..+++.+
T Consensus 235 ~p~~rg~GiG~~ll~~~~~~~ 255 (292)
T TIGR03448 235 DPAAQGRGLGDALTLIGLHHL 255 (292)
T ss_pred CHHHcCCCHHHHHHHHHHHHH
Confidence 999999999999999988765
No 50
>TIGR03585 PseH pseudaminic acid biosynthesis N-acetyl transferase. Sequences in this family are members of the pfam00583 (GNAT) superfamily of acetyltransferases and are proposed to perform a N-acetylation step in the process of pseudaminic acid biosynthesis in Campylobacter species. This gene is commonly observed in apparent operons with other genes responsible for the biosynthesis of pseudaminic acid and as a component of flagellar and exopolysaccharide biosynthesis loci. Significantly, many genomes containing other components of this pathway lack this gene, indicating that some other N-acetyl transferases may be incolved and/or the step is optional, resulting in a non-acetylated pseudaminic acid variant sugar.
Probab=98.47 E-value=2.6e-06 Score=61.11 Aligned_cols=99 Identities=16% Similarity=0.106 Sum_probs=60.4
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhh---hhhhhc-CCCceEEEEEeCCCCCeEEEEEEEeeccCCcceE
Q psy838 48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQG---KHSEMF-DDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVL 123 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~---~~~el~-~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~ 123 (152)
...+++++|++.++++..+.-...|..+...-+.+. ....+. ++...++++..+ |++||++.+..... .....
T Consensus 2 ~lr~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~~vG~~~~~~~~~-~~~~~ 78 (156)
T TIGR03585 2 NFTPLNSEELELVLEWRNHPDVRANMYSDHLIDWEEHLHFIEALKQDPNRRYWIVCQE--SRPIGVISFTDINL-VHKSA 78 (156)
T ss_pred CcccCCHHHHHHHHHhhCCHHHHhhccCcCCCCHHHHHHHHHHhhcCCCceEEEEEEC--CEEEEEEEEEecCh-hhCeE
Confidence 467899999999999764422122311111112111 112222 344455566554 99999999875432 12233
Q ss_pred EEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 124 YCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.+ .+++.|++| +|+|+.+|..++++|
T Consensus 79 ~~-g~~~~~~~~-~G~g~~~~~~~~~~a 104 (156)
T TIGR03585 79 FW-GIYANPFCK-PGVGSVLEEAALEYA 104 (156)
T ss_pred EE-EEEeChhhh-cCchHHHHHHHHHHH
Confidence 33 466999999 999999999999875
No 51
>PRK15130 spermidine N1-acetyltransferase; Provisional
Probab=98.43 E-value=2.7e-06 Score=63.27 Aligned_cols=100 Identities=13% Similarity=0.107 Sum_probs=59.9
Q ss_pred ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChh-h----hhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcc
Q psy838 47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQ-G----KHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEP 121 (152)
Q Consensus 47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~-~----~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~ 121 (152)
....+++++|++.++++..+.....+-. ..+++.. . +...+.++....+++..+ +++||++.+....... .
T Consensus 7 l~lR~~~~~D~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~--g~~iG~~~~~~~~~~~-~ 82 (186)
T PRK15130 7 VKLRPLEREDLRFVHQLDNNASVMRYWF-EEPYEAFVELSDLYDKHIHDQSERRFVVECD--GEKAGLVELVEINHVH-R 82 (186)
T ss_pred eEEecCCHHHHHHHHHHhcChHHHhhcC-CcccccHHHHHHHHHHhhhcccCcEEEEEEC--CEEEEEEEEEeecCCC-C
Confidence 3466788999999999865421011100 1111110 0 111122334444555544 8999999986543222 1
Q ss_pred eEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 122 VLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 122 ~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
...+ .++|+|+|||+|+|+.+++.+++++
T Consensus 83 ~~~~-~~~v~~~~~g~G~g~~l~~~l~~~~ 111 (186)
T PRK15130 83 RAEF-QIIISPEYQGKGLATRAAKLAMDYG 111 (186)
T ss_pred eEEE-EEEECHHHcCCCHHHHHHHHHHHHH
Confidence 2333 5999999999999999999998764
No 52
>PRK10809 ribosomal-protein-S5-alanine N-acetyltransferase; Provisional
Probab=98.36 E-value=6.7e-06 Score=61.63 Aligned_cols=99 Identities=16% Similarity=0.134 Sum_probs=60.7
Q ss_pred ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChh---------hh------hhhh-cCCCceEEEEEeCCCCCeEEEE
Q psy838 47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQ---------GK------HSEM-FDDRACYLVAKNGSSSTPVAFS 110 (152)
Q Consensus 47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~---------~~------~~el-~~~~~~~~v~~~~~~~~ivGf~ 110 (152)
....+++++|.+.++++..++- .+- .+|.+. .. .... .++....+++....++++||++
T Consensus 18 l~LR~~~~~Da~~~~~l~~~~~--~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~iG~i 92 (194)
T PRK10809 18 LVVRLVHERDAWRLADYYAENR--HFL---KPWEPVRDESHCYPSGWQARLGMINEFHKQGSAFYFALLDPDEKEIIGVA 92 (194)
T ss_pred EEEEeCCHHHHHHHHHHHHhCH--Hhc---cCCCCCCcccccCHHHHHHHHHHHHHHHhcCcEEEEEEEECCCCeEEEEE
Confidence 4467789999999999876542 121 122211 00 1111 2233333444333257999999
Q ss_pred EEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 111 HFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 111 ~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.+........... .-.+.|.|+|||+|+|+.++..++++|
T Consensus 93 ~l~~~~~~~~~~~-eig~~i~~~~~G~G~~~ea~~~ll~~~ 132 (194)
T PRK10809 93 NFSNVVRGSFHAC-YLGYSLGQKWQGQGLMFEALQAAIRYM 132 (194)
T ss_pred EEEeecCCCeeeE-EEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence 9875432222222 234779999999999999999999876
No 53
>PF13302 Acetyltransf_3: Acetyltransferase (GNAT) domain; PDB: 3TTH_C 3JUW_A 2ZXV_A 2Z0Z_A 2VI7_B 3EG7_F 1YRE_C 3IGR_B 3FBU_A 2FCK_A ....
Probab=98.25 E-value=2.4e-06 Score=60.11 Aligned_cols=101 Identities=9% Similarity=0.072 Sum_probs=60.7
Q ss_pred cCCCCCHHHHHHHHHHHHH-hHHHHHHhCCCC-CChhhhhhhhcC-----C--CceEEEEEeCCCCCeEEEEEEEeeccC
Q psy838 48 KSTDLGAETKKWVWELEEK-NMKHSYEVCDIG-WDPQGKHSEMFD-----D--RACYLVAKNGSSSTPVAFSHFRFDVDF 118 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~-~m~~~y~~~~~~-w~~~~~~~el~~-----~--~~~~~v~~~~~~~~ivGf~~~~~~~~~ 118 (152)
...+++++|++.++++..+ .+. .|.....+ ++.+....-+.+ + ...++++...+++++||++.+.....
T Consensus 3 ~lr~~~~~D~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~iG~i~~~~~~~- 80 (142)
T PF13302_consen 3 TLRPLTPEDADAIYEWRSDPEIR-RYLPWGPPWPTLEEAEEWIQSRQDSWENHGYYYFAIEDKDDGEIIGFIGLYNIDK- 80 (142)
T ss_dssp EEEE-HGGGHHHHHHHHTTTTHC-TTSSTTTSSSSHHHHHHHHHHHHHCHHEETEEEEEEEETTTTEEEEEEEEEEEET-
T ss_pred EEEcCCHHHHHHHHHHhcCHHHH-HhcCCCCCCCCHHHHHHHHHHhhhhhhcccceEEEEEeccCCceEEEeeeeeccc-
Confidence 4567889999999988742 111 11100112 243433333321 1 13344555543468999999954322
Q ss_pred CcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 119 GEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 119 ~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
......|- +.|.|++||+|+|+.++..+.+++
T Consensus 81 ~~~~~eig-~~i~~~~~g~G~~~~~~~~~~~~~ 112 (142)
T PF13302_consen 81 NNNWAEIG-YWIGPDYRGKGYGTEALKLLLDWA 112 (142)
T ss_dssp TTTEEEEE-EEEEGGGTTSSHHHHHHHHHHHHH
T ss_pred CCCccccc-cchhHHHHhhhHHHHHHHHHHHHH
Confidence 23345555 889999999999999999998875
No 54
>PF14542 Acetyltransf_CG: GCN5-related N-acetyl-transferase; PDB: 2H5M_A 2Q44_A 1XMT_A 2Q4Y_A 2IL4_A 2EVN_A 1R57_A.
Probab=98.24 E-value=7.4e-06 Score=53.63 Aligned_cols=45 Identities=18% Similarity=0.296 Sum_probs=38.2
Q ss_pred CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+|+.+|++.|.. + ..++.|....|.|++||+|+|+.||+.+.++|
T Consensus 7 ~g~~~a~l~Y~~--~--~~~~~i~hT~V~~~~rGqGia~~L~~~~l~~a 51 (78)
T PF14542_consen 7 DGEEIAELTYRE--D--GGVIVITHTEVPPELRGQGIAKKLVEAALDYA 51 (78)
T ss_dssp STTEEEEEEEEE--S--SSEEEEEEEEE-CSSSTTTHHHHHHHHHHHHH
T ss_pred CCEEEEEEEEEe--C--CCEEEEEEEEECccccCCcHHHHHHHHHHHHH
Confidence 388999999965 2 34778999999999999999999999999886
No 55
>PRK10151 ribosomal-protein-L7/L12-serine acetyltransferase; Provisional
Probab=98.23 E-value=3e-05 Score=57.23 Aligned_cols=101 Identities=11% Similarity=0.034 Sum_probs=60.5
Q ss_pred ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCC---CChhhhhhhhc------CCC-ceEEEEEeCCCCCeEEEEEEEeec
Q psy838 47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIG---WDPQGKHSEMF------DDR-ACYLVAKNGSSSTPVAFSHFRFDV 116 (152)
Q Consensus 47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~---w~~~~~~~el~------~~~-~~~~v~~~~~~~~ivGf~~~~~~~ 116 (152)
....+++++|++.++++..++-........++ -+.+.....+. ... ..++++..+ +++||++.+....
T Consensus 11 l~Lr~~~~~D~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~i~~~--~~~iG~~~l~~~~ 88 (179)
T PRK10151 11 LELHAVDESHVTPLHQLVCKNKTWLQQSLNWPQFVQSEEDTRKTVQGNVMLHQRGYAKMFMIFKE--DELIGVLSFNRIE 88 (179)
T ss_pred EEEEeCCHHHHHHHHHHHHHhHHHHHhcCCCcCccCCHHHHHHHHHHHHHHHhcCCcEEEEEEEC--CEEEEEEEEEeec
Confidence 34678999999999998743211111100111 12232222222 112 223444444 8999999987643
Q ss_pred cCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 117 DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 117 ~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+... ...| .+.|+|+|||+|+|+.++..++++|
T Consensus 89 ~~~~-~~~i-g~~i~~~~~g~G~~tea~~~l~~~~ 121 (179)
T PRK10151 89 PLNK-TAYI-GYWLDESHQGQGIISQALQALIHHY 121 (179)
T ss_pred cCCC-ceEE-EEEEChhhcCCcHHHHHHHHHHHHH
Confidence 3222 2333 4679999999999999999988765
No 56
>cd02169 Citrate_lyase_ligase Citrate lyase ligase. Citrate lyase ligase, also known as [Citrate (pro-3S)-lyase] ligase, is responsible for acetylation of the (2-(5''-phosphoribosyl)-3'-dephosphocoenzyme-A) prosthetic group of the gamma subunit of citrate lyase, converting the inactive thiol form of this enzyme to the active form. The acetylation of 1 molecule of deacetyl-citrate lyase to enzymatically active citrate lyase requires 6 molecules of ATP. The Adenylylyltranferase activity of the enzyme involves the formation of AMP and and pyrophosphate in the acetylation reaction.
Probab=98.15 E-value=4.5e-06 Score=67.62 Aligned_cols=41 Identities=10% Similarity=-0.005 Sum_probs=35.6
Q ss_pred CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
++++||++.+.. .+|..|+|+|+|||+|+|++||+.+++.|
T Consensus 14 ~~~iVG~~~l~~--------~~I~~vaV~p~~Rg~GiG~~Ll~~l~~~a 54 (297)
T cd02169 14 AGELIATGSIAG--------NVLKCVAVCPKYQGEGLALKIVSELINKA 54 (297)
T ss_pred CCEEEEEEEecc--------CEEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence 489999988742 13889999999999999999999999876
No 57
>KOG3396|consensus
Probab=98.03 E-value=4.5e-05 Score=54.88 Aligned_cols=73 Identities=10% Similarity=0.118 Sum_probs=50.1
Q ss_pred CChhhhhhhh---cCCCc-eEE-EEEeCCCCCeEEEEEEEeec---cCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 79 WDPQGKHSEM---FDDRA-CYL-VAKNGSSSTPVAFSHFRFDV---DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 79 w~~~~~~~el---~~~~~-~~~-v~~~~~~~~ivGf~~~~~~~---~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
-+++++...+ ....+ .++ ++++.+.++++|-+.+.... .......+|.++.|++++||++||+.|+..|.++
T Consensus 34 vt~e~F~krf~~mk~~~~~Y~i~Vied~~s~~vigtatL~IE~KfIh~~g~rGhiEDVVV~~~~rgk~LGkllv~~Lv~l 113 (150)
T KOG3396|consen 34 VTREQFEKRFEAMKKSGDWYYIVVIEDKESEKVIGTATLFIERKFIHGCGSRGHIEDVVVDSEYRGKQLGKLLVETLVDL 113 (150)
T ss_pred cCHHHHHHHHHHHHhcCCcEEEEEEEeCCcCeEEEEEEEEEehhhhhcccccCceeEEEeChhhhhhHHhHHHHHHHHHH
Confidence 4555555543 33333 333 34444467899999876532 1223345699999999999999999999999987
Q ss_pred c
Q psy838 151 I 151 (152)
Q Consensus 151 a 151 (152)
|
T Consensus 114 ~ 114 (150)
T KOG3396|consen 114 A 114 (150)
T ss_pred H
Confidence 6
No 58
>COG2153 ElaA Predicted acyltransferase [General function prediction only]
Probab=98.01 E-value=1.1e-05 Score=58.55 Aligned_cols=98 Identities=14% Similarity=0.116 Sum_probs=64.2
Q ss_pred cccCCCCCHHHHHHHHHHHHHhHHHHHHh-CCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEE
Q psy838 46 YKKSTDLGAETKKWVWELEEKNMKHSYEV-CDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLY 124 (152)
Q Consensus 46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~-~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~ 124 (152)
+....+++...+-+++.|..+. |-. +.=|+.+-. -.++... .+.++...+ +|++++|+.+.+....... .-
T Consensus 7 ~~~~~~Lt~~ely~LlkLRv~V----FVVEQ~CPY~E~D-g~Dl~~~-~~Hl~~~~~-~g~LvAyaRLl~~~~~~~~-~~ 78 (155)
T COG2153 7 CKHFNDLTVRELYELLKLRVDV----FVVEQNCPYPELD-GKDLLGD-TRHLLGWTP-DGELVAYARLLPPGAEYEE-VS 78 (155)
T ss_pred hhhhhhcCHHHHHHHHHhheeE----EEEecCCCCcCcC-Ccccccc-cceEEEEcC-CCeEEEEEecCCCCCCcCc-ee
Confidence 4556788888888888886652 211 122333211 1122223 334555653 5999999998764333232 55
Q ss_pred EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 125 CYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
|-++.|.|+.||+|+|++||....+.|
T Consensus 79 iGRV~v~~~~RG~glG~~Lm~~AL~~~ 105 (155)
T COG2153 79 IGRVIVSPAARGQGLGQQLMEKALETA 105 (155)
T ss_pred eeeEEECHhhhccchhHHHHHHHHHHH
Confidence 999999999999999999999887764
No 59
>PF13718 GNAT_acetyltr_2: GNAT acetyltransferase 2; PDB: 2ZPA_B.
Probab=97.90 E-value=8.4e-05 Score=56.82 Aligned_cols=70 Identities=11% Similarity=0.121 Sum_probs=48.4
Q ss_pred ChhhhhhhhcCCCceEEEEEeCCCC--CeEEEEEEEeecc----------------C------------------CcceE
Q psy838 80 DPQGKHSEMFDDRACYLVAKNGSSS--TPVAFSHFRFDVD----------------F------------------GEPVL 123 (152)
Q Consensus 80 ~~~~~~~el~~~~~~~~v~~~~~~~--~ivGf~~~~~~~~----------------~------------------~~~~~ 123 (152)
+++.++.-+-+|+.+.+++..+ + +++|-+.+...-. . ....+
T Consensus 14 sPnDL~~LlDaP~h~l~~l~~~--~~p~il~~~~v~~EG~l~~~l~~~i~~g~rRp~G~LiP~~L~~~~~~~~f~~l~g~ 91 (196)
T PF13718_consen 14 SPNDLQLLLDAPNHRLFVLLQP--GDPDILGVAQVALEGGLSKELIEAILSGGRRPKGHLIPQTLAQHFGDPEFAQLSGA 91 (196)
T ss_dssp -HHHHHHHHH-TTEEEEEEE-S--S--SEEEEEEEEEEE---HHHHHHHHTTS---SS-HHHHHHHHHSS-TTGGGSEEE
T ss_pred CHHHHHHHhcCCcceeehhccC--CCceEEEEEEEEecCCCCHHHHHHHHhCCCCCCCCCHHHHHHHHhCCHHHHhhcce
Confidence 6666555555677777777766 6 9999988754311 0 13457
Q ss_pred EEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 124 YCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
-|.+|+|+|++|++|+|+++++.+++++
T Consensus 92 RIvRIAvhP~~q~~G~Gs~lL~~l~~~~ 119 (196)
T PF13718_consen 92 RIVRIAVHPDLQRMGYGSRLLQQLEQYA 119 (196)
T ss_dssp EEEEEEE-CCC-SSSHHHHHHHHHHHT-
T ss_pred eEEEEEEChhhhcCCHHHHHHHHHHHHH
Confidence 7999999999999999999999999886
No 60
>COG2388 Predicted acetyltransferase [General function prediction only]
Probab=97.90 E-value=3.9e-05 Score=52.53 Aligned_cols=56 Identities=7% Similarity=0.116 Sum_probs=43.3
Q ss_pred CceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838 92 RACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYID 152 (152)
Q Consensus 92 ~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~ 152 (152)
....+++..+ |+.+|++.+... +...+.|..-+|.+++||||+|+.|+....+.|+
T Consensus 14 ~~~~y~~~~~--G~~~~e~~y~~~---~~~~i~i~HT~V~d~lrGqGia~~L~~~al~~ar 69 (99)
T COG2388 14 ENGRYVLTDE--GEVIGEATYYDR---GENLIIIDHTYVPDELRGQGIAQKLVEKALEEAR 69 (99)
T ss_pred CceEEEEecC--CcEEEEEEEecC---CCCEEEEecCcCCHHHcCCcHHHHHHHHHHHHHH
Confidence 3333444444 889999998754 2346779999999999999999999999988764
No 61
>KOG3397|consensus
Probab=97.90 E-value=8e-05 Score=55.87 Aligned_cols=57 Identities=18% Similarity=0.302 Sum_probs=45.4
Q ss_pred EEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838 95 YLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYID 152 (152)
Q Consensus 95 ~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~ 152 (152)
.+++..+...++||-+.+...... ...+++..+.|+.+.||+|.|+.||..+|++|+
T Consensus 57 sL~Ll~E~~~~VigH~rLS~i~n~-~~al~VEsVVV~k~~RG~GFGk~lMk~~E~~~R 113 (225)
T KOG3397|consen 57 SLLLLNEENDEVLGHSRLSHLPNR-DHALWVESVVVKKDQRGLGFGKFLMKSTEKWMR 113 (225)
T ss_pred eeeeecccccceeeeeccccCCCC-CceeEEEEEEEehhhccccHHHHHHHHHHHHHH
Confidence 344444436799999998875443 346889999999999999999999999999874
No 62
>TIGR00124 cit_ly_ligase [citrate (pro-3S)-lyase] ligase. ATP is cleaved to AMP and pyrophosphate during the reaction. The carboxyl end is homologous to a number of cytidyltransferases that also release pyrophosphate.
Probab=97.87 E-value=3.6e-05 Score=63.27 Aligned_cols=49 Identities=6% Similarity=0.128 Sum_probs=39.9
Q ss_pred ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 93 ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 93 ~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
..++++..+ +++||++++.- . .|..|+|+|+|||+|+|++||..+++.|
T Consensus 31 d~~vv~~~~--~~lVg~g~l~g-----~---~ik~vaV~~~~rG~Glg~~L~~~L~~~a 79 (332)
T TIGR00124 31 EIFIAVYED--EEIIGCGGIAG-----N---VIKCVAIDESLRGEGLALQLMTELENLA 79 (332)
T ss_pred CEEEEEEEC--CEEEEEEEEec-----C---EEEEEEEcHHHcCCCHHHHHHHHHHHHH
Confidence 455556555 89999998731 1 3789999999999999999999999876
No 63
>PF12568 DUF3749: Acetyltransferase (GNAT) domain; InterPro: IPR024612 This domain is found in uncharacterised proteins from Gammaproteobacteria, and is approximately 40 amino acids in length. It contains two completely conserved residues (D and I) that may be functionally important. Proteins having this domain are frequently annotated as acetyltransferases of the GNAT family; however there is little accompanying annotation to confirm this.; PDB: 2K5T_A.
Probab=97.78 E-value=0.00059 Score=48.65 Aligned_cols=86 Identities=12% Similarity=0.228 Sum_probs=51.6
Q ss_pred ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEe
Q psy838 47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCY 126 (152)
Q Consensus 47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~ 126 (152)
.+...++++|..++-.| |.. .+++.. ..-.+....++.+..+ +.++|-+.+..... ...|.
T Consensus 5 ~rl~~ls~Qd~iDL~KI--------wp~----~~~~~l-~~~l~~~~~l~aArFN--dRlLgAv~v~~~~~----~~~L~ 65 (128)
T PF12568_consen 5 ERLTTLSEQDRIDLAKI--------WPQ----QDPEQL-EQWLDEGHRLFAARFN--DRLLGAVKVTISGQ----QAELS 65 (128)
T ss_dssp EE-SS--HHHHHHHHHH---------TT----S-----------SSEEEEEEEET--TEEEEEEEEEEETT----EEEEE
T ss_pred EEcCCCCHHHHHHHHHh--------CCC----CCHHHH-HHHhccCCeEEEEEec--hheeeeEEEEEcCc----ceEEe
Confidence 34556777776665544 311 122332 2233556666677776 99999999876522 46699
Q ss_pred eeeeccccccCCHHHHHHHHHHhhc
Q psy838 127 ELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 127 ~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+|.|+|.-||+|+|+.|++.+...+
T Consensus 66 ~l~VRevTRrRGVG~yLlee~~rq~ 90 (128)
T PF12568_consen 66 DLCVREVTRRRGVGLYLLEEVLRQL 90 (128)
T ss_dssp EEEE-TT-SSSSHHHHHHHHHHHHS
T ss_pred eEEEeeccccccHHHHHHHHHHHHC
Confidence 9999999999999999999887653
No 64
>KOG3235|consensus
Probab=97.77 E-value=0.00011 Score=54.43 Aligned_cols=51 Identities=20% Similarity=0.234 Sum_probs=40.4
Q ss_pred EEEeCCCCCeEEEEEEEeec--cCCcceEEEeeeeeccccccCCHHHHHHHHHH
Q psy838 97 VAKNGSSSTPVAFSHFRFDV--DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFY 148 (152)
Q Consensus 97 v~~~~~~~~ivGf~~~~~~~--~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~ 148 (152)
|+. +.+|+||||+...... ++..+..+|.-|.|...||+.|||+.||++..
T Consensus 45 VA~-D~~gkiVGYvlAkmee~p~~~~~hGhItSlaV~rs~RrlGla~kLm~qa~ 97 (193)
T KOG3235|consen 45 VAE-DENGKIVGYVLAKMEEDPDDEPPHGHITSLAVKRSYRRLGLAQKLMNQAS 97 (193)
T ss_pred EEE-cCCCcEEEEeeeehhhcccCCCCCCeeEEeeehhhHHHhhHHHHHHHHHH
Confidence 444 3479999999876432 34556788999999999999999999998754
No 65
>TIGR01211 ELP3 histone acetyltransferase, ELP3 family. The Saccharomyces cerevisiae member YPL086C has been characterized in vitro as an N-terminal acetyltransferase for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases.
Probab=97.66 E-value=0.00012 Score=63.54 Aligned_cols=59 Identities=14% Similarity=0.267 Sum_probs=43.5
Q ss_pred ceEEEEEe-CCCCCeEEEEEEEeeccC-----CcceEEEeeeeec-----------cccccCCHHHHHHHHHHhhc
Q psy838 93 ACYLVAKN-GSSSTPVAFSHFRFDVDF-----GEPVLYCYELQLE-----------KQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 93 ~~~~v~~~-~~~~~ivGf~~~~~~~~~-----~~~~~~I~~i~V~-----------p~~qgkGiG~~Lm~~l~~~a 151 (152)
...++.+. ..++.++||+.++...+. .....+|.+|+|. |+|||+|+|++||+.++++|
T Consensus 411 ~e~F~~y~~~~~~~l~G~lrlr~~~~~~~~~~~~~~a~IrelhV~G~~~~~~~~~~~~~rg~GiG~~Ll~~ae~~A 486 (522)
T TIGR01211 411 TEFFLSYEDPKNDILIGFLRLRFPSEPAHRKEVDATALVRELHVYGSEVPIGERGDDEWQHRGYGRRLLEEAERIA 486 (522)
T ss_pred CeEEEEEEcCCCCeEEEEEEEecCcccccccccCCCceEEEEEEeeeeccccccCChhHhCcCHHHHHHHHHHHHH
Confidence 34455554 224689999999875431 1224678888865 99999999999999999876
No 66
>PF08445 FR47: FR47-like protein; InterPro: IPR013653 Proteins in this entry have a conserved region similar to the C-terminal region of the Drosophila melanogaster (Fruit fly) hypothetical protein FR47 (Q9VR51 from SWISSPROT). This protein has been found to consist of two N-acyltransferase-like domains swapped with the C-terminal strands. ; GO: 0016747 transferase activity, transferring acyl groups other than amino-acyl groups; PDB: 1SQH_A 3EC4_B.
Probab=97.61 E-value=0.00027 Score=46.87 Aligned_cols=42 Identities=21% Similarity=0.202 Sum_probs=29.5
Q ss_pred CeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 105 TPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 105 ~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
+.+..++.....+.+ .|..|+++|++||+|+|+.|+..+.+.
T Consensus 8 ~~~~l~~~~~~~~~g----~i~~v~t~p~~RrrGlg~~lv~~l~~~ 49 (86)
T PF08445_consen 8 ELVALVAWIIRSDDG----EIGGVYTLPEHRRRGLGSALVAALARE 49 (86)
T ss_dssp CCEEEEEEEEESCTC----CEEEEEE-GGGTTSSHHHHHHHHHHHH
T ss_pred CccceeeEeeeCCCc----EEEEEEECHHHcCCCHHHHHHHHHHHH
Confidence 344444444443444 388999999999999999999888654
No 67
>KOG3234|consensus
Probab=97.55 E-value=0.00016 Score=53.25 Aligned_cols=51 Identities=12% Similarity=0.040 Sum_probs=42.8
Q ss_pred CCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 101 GSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 101 ~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+.++++.||+.-...-..+.+..++..|.|.|+||+.|+|+.||+.+++..
T Consensus 48 ~p~~~imgyimgk~Eg~~~~wh~HvTAltVap~~Rrl~la~~lm~~led~~ 98 (173)
T KOG3234|consen 48 APTGEIMGYIMGKVEGKDTEWHGHVTALTVAPDYRRLGLAAKLMDTLEDVS 98 (173)
T ss_pred CCCCceEEEEeeeccccCcceeeEEEEEEechhHHHHHHHHHHHHHHHHHH
Confidence 336899999988654445667788999999999999999999999999864
No 68
>COG1444 Predicted P-loop ATPase fused to an acetyltransferase [General function prediction only]
Probab=97.40 E-value=0.001 Score=59.76 Aligned_cols=71 Identities=11% Similarity=0.122 Sum_probs=49.7
Q ss_pred ChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccC---------------------------------CcceEEEe
Q psy838 80 DPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDF---------------------------------GEPVLYCY 126 (152)
Q Consensus 80 ~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~---------------------------------~~~~~~I~ 126 (152)
+++....-+-.|+++++++..+ ++.+|+-+.+...-.. .-..+-|.
T Consensus 457 sP~DL~~L~DaP~h~~~al~~~-~~~~va~~qva~EG~l~~~~i~~~~~g~r~~GnlIp~~l~~~~~~~~fa~l~G~RIv 535 (758)
T COG1444 457 SPNDLRRLLDAPHHHIFALRAP-EGKPVAVWQVAEEGGLSDELIDIWLGGRRPRGNLIPDLLAKHHRDPEFAKLVGWRIV 535 (758)
T ss_pred CHHHHHHHhcCCCCeeEEEEcC-CCceEEEEEeeccCCCcHHHHHHHhcCCCCCCcccHHHHHHhhcchhhcccceeeEE
Confidence 5666555555677777777665 4577766655332110 01235699
Q ss_pred eeeeccccccCCHHHHHHHHHHhhc
Q psy838 127 ELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 127 ~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+|+|+|++|++|||++|++.+.++|
T Consensus 536 RIAvhPe~q~~GiGsrlL~~l~~~a 560 (758)
T COG1444 536 RIAVHPELQRMGIGSRLLALLIEEA 560 (758)
T ss_pred EEEeCHHHHhcCHHHHHHHHHHHHH
Confidence 9999999999999999999999887
No 69
>PF13480 Acetyltransf_6: Acetyltransferase (GNAT) domain
Probab=97.29 E-value=0.011 Score=41.00 Aligned_cols=114 Identities=12% Similarity=0.151 Sum_probs=70.3
Q ss_pred hhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhC-CCCCChhhhhhhhcC---C-Cce
Q psy838 20 ASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVC-DIGWDPQGKHSEMFD---D-RAC 94 (152)
Q Consensus 20 ~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~-~~~w~~~~~~~el~~---~-~~~ 94 (152)
..+|+.+++.+..-.-. +.. .-+++++++++++..+++... ... ..+++.+.+..-+.. + ...
T Consensus 5 ~~~rr~~r~~~~~g~~~---------~~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~ 72 (142)
T PF13480_consen 5 KKIRRAIRRAEKLGGVR---------FEV--ATDPADLEAFYELYRESWARR-HGGFAPPFSRDFFRDLLRSLAESGRLR 72 (142)
T ss_pred HHHHHHHHHHHhcCCEE---------EEE--eCCHHHHHHHHHHHHHHHhhh-hCCCCCcchHHHHHHHHHhhccCCCEE
Confidence 45677777776542111 111 237888999999987765443 010 123444433322221 2 344
Q ss_pred EEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 95 YLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 95 ~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
++++..+ |++||+...-.. . ..++.+-...+|+++..++|..|+..+++.|
T Consensus 73 l~~~~~~--g~~va~~~~~~~--~--~~~~~~~~g~~~~~~~~~~~~~l~~~~i~~a 123 (142)
T PF13480_consen 73 LFVLYDG--GEPVAFALGFRH--G--GTLYYWYGGYDPEYRKYSPGRLLLWEAIRWA 123 (142)
T ss_pred EEEEEEC--CEEEEEEEEEEE--C--CEEEEEEEEECHhhHhCCHHHHHHHHHHHHH
Confidence 4455554 999999865432 1 2566778889999999999999999998875
No 70
>COG0454 WecD Histone acetyltransferase HPA2 and related acetyltransferases [Transcription / General function prediction only]
Probab=97.23 E-value=0.00019 Score=45.77 Aligned_cols=24 Identities=21% Similarity=0.409 Sum_probs=22.6
Q ss_pred eeeccccccCCHHHHHHHHHHhhc
Q psy838 128 LQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 128 i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
++|+|++||+|+|+.|++.+++.|
T Consensus 87 l~v~~~~rg~Gig~~Ll~~~~~~~ 110 (156)
T COG0454 87 LYVLPEYRGKGIGSALLEAALEWA 110 (156)
T ss_pred EEecchhhccchHHHHHHHHHHHH
Confidence 999999999999999999988876
No 71
>PF05301 Mec-17: Touch receptor neuron protein Mec-17; InterPro: IPR007965 Mec-17 is the protein product of one of the 18 genes required for the development and function of the touch receptor neuron for gentle touch. Mec-17 is specifically required for maintaining the differentiation of the touch receptor []. This family is conserved to higher eukaryotes.; GO: 0019799 tubulin N-acetyltransferase activity
Probab=97.10 E-value=0.0026 Score=44.86 Aligned_cols=47 Identities=23% Similarity=0.470 Sum_probs=35.0
Q ss_pred CCeEEEEEEEe----ecc-CC-----cceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 104 STPVAFSHFRF----DVD-FG-----EPVLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 104 ~~ivGf~~~~~----~~~-~~-----~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
+.++||+.+-. ..+ .+ +|..-|.++||+++.|++|+|+.|++.+...
T Consensus 18 g~viG~LKVG~K~Lfl~d~~g~~~e~~~~~cvLDFyVhes~QR~G~Gk~LF~~ML~~ 74 (120)
T PF05301_consen 18 GAVIGFLKVGYKKLFLLDERGQHREIEPLLCVLDFYVHESRQRRGYGKRLFDHMLQE 74 (120)
T ss_pred ceEEEEEEEeeeeEEEEcCCCCEEEecccceeeeEEEEeceeccCchHHHHHHHHHH
Confidence 67999997642 112 11 3443578999999999999999999988653
No 72
>PF01233 NMT: Myristoyl-CoA:protein N-myristoyltransferase, N-terminal domain; InterPro: IPR022676 Myristoyl-CoA:protein N-myristoyltransferase (2.3.1.97 from EC) (Nmt) [] is the enzyme responsible for transferring a myristate group on the N-terminal glycine of a number of cellular eukaryotics and viral proteins. Nmt is a monomeric protein of about 50 to 60kDa whose sequence appears to be well conserved. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold. This entry represents the N-terminal region. ; GO: 0004379 glycylpeptide N-tetradecanoyltransferase activity; PDB: 2P6G_B 2P6F_F 2P6E_A 1IIC_A 1IID_A 2NMT_A 4A33_A 3H5Z_A 4A2Z_A 2WSA_A ....
Probab=96.89 E-value=0.014 Score=43.22 Aligned_cols=102 Identities=15% Similarity=0.010 Sum_probs=68.0
Q ss_pred ccCCCCCHHHHHHHHHHHHHhHHHHHHhC----CCCCChhhhhhhhcCCCc---eEEEEEeCCCCCeEEEEEEEee----
Q psy838 47 KKSTDLGAETKKWVWELEEKNMKHSYEVC----DIGWDPQGKHSEMFDDRA---CYLVAKNGSSSTPVAFSHFRFD---- 115 (152)
Q Consensus 47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~----~~~w~~~~~~~el~~~~~---~~~v~~~~~~~~ivGf~~~~~~---- 115 (152)
.+...-++++++++++|-.+| +.+.. .+.++.+.+.=.|..|+. +.+-+....++++|||++..+.
T Consensus 27 ~~~dl~d~~~l~ely~lL~~n---YVEDdd~~fRf~YS~efL~WaL~pPg~~~~whiGVR~~~~~kLvgfIsaip~~irv 103 (162)
T PF01233_consen 27 STLDLNDDEELKELYELLNEN---YVEDDDNMFRFDYSKEFLKWALKPPGWKKEWHIGVRVKSSKKLVGFISAIPATIRV 103 (162)
T ss_dssp EE--TTSHHHHHHHHHHHHHH---SSBTTTSSEEE---HHHHHHHHTSTT--GGGEEEEEETTTTEEEEEEEEEEEEEEE
T ss_pred EecCCCCHHHHHHHHHHHHhc---CccCCcceEEeeCCHHHHhheeeCcCCccceEEEEEECCCCEEEEEEccceEEEEE
Confidence 334555788899999987765 22221 246888888878887643 2334444336899999997653
Q ss_pred ccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 116 VDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 116 ~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.+......+|.-|.|++..|.++++-.|+.++-..+
T Consensus 104 ~~~~~~~~eINFLCVhKklRskrlAPvLIkEItRRv 139 (162)
T PF01233_consen 104 RDKVIKMVEINFLCVHKKLRSKRLAPVLIKEITRRV 139 (162)
T ss_dssp TTEEEEEEEEEEEEE-GGGTTSSHHHHHHHHHHHHH
T ss_pred eeeEeeeeeEEEEeecHhHhhcCCcHHHHHHHHHHh
Confidence 223345678999999999999999999999887643
No 73
>PF01853 MOZ_SAS: MOZ/SAS family; InterPro: IPR002717 Moz is a monocytic leukemia Zn_finger protein and the SAS protein from Saccharomyces cerevisiae (Baker's yeast) is involved in silencing the Hmr locus. These proteins were reported to be homologous to acetyltransferases [] but this similarity is not supported by standard sequence analysis.; GO: 0016747 transferase activity, transferring acyl groups other than amino-acyl groups, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 3TO6_A 1MJA_A 1MJ9_A 3TO7_A 3TO9_A 1MJB_A 1FY7_A 2OZU_A 2RC4_A 2OU2_A ....
Probab=96.82 E-value=0.0034 Score=47.67 Aligned_cols=57 Identities=16% Similarity=0.137 Sum_probs=37.1
Q ss_pred CCceEEEEEeCCCC-CeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 91 DRACYLVAKNGSSS-TPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 91 ~~~~~~v~~~~~~~-~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
+...|++...+++| .+|||.+=.-...+ ...|..|.|.|.||++|+|+.||+.--.+
T Consensus 51 ~F~FYVl~e~d~~g~h~vGyFSKEk~s~~---~~NLsCIl~lP~yQrkGyG~~LI~fSY~L 108 (188)
T PF01853_consen 51 PFLFYVLTEKDDDGFHIVGYFSKEKESWD---NNNLSCILTLPPYQRKGYGRFLIDFSYEL 108 (188)
T ss_dssp TEEEEEEEEEETTEEEEEEEEEEESS-TT----EEESEEEE-GGGTTSSHHHHHHHHHHHH
T ss_pred ceEEEEEEEecCccceeEEEEEEEecccC---CeeEeehhhcchhhhcchhhhhhhhHHHH
Confidence 34455555544222 58899875432222 34588899999999999999999875444
No 74
>COG3981 Predicted acetyltransferase [General function prediction only]
Probab=96.74 E-value=0.0038 Score=46.64 Aligned_cols=54 Identities=19% Similarity=0.273 Sum_probs=39.5
Q ss_pred CceEEEEEeCCCCCeEEEEEEEeeccC------CcceEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838 92 RACYLVAKNGSSSTPVAFSHFRFDVDF------GEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYID 152 (152)
Q Consensus 92 ~~~~~v~~~~~~~~ivGf~~~~~~~~~------~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~ 152 (152)
.+.|+.+. + ++++||++.++....+ |+. .=-|+|+.||+|+|++++....+.|+
T Consensus 68 ~~~y~~v~-~-d~~ivG~i~lRh~Ln~~ll~~gGHI-----GY~VrPseR~KGYA~emLkl~L~~ar 127 (174)
T COG3981 68 ASTYWAVD-E-DGQIVGFINLRHQLNDFLLEEGGHI-----GYSVRPSERRKGYAKEMLKLALEKAR 127 (174)
T ss_pred ceeEEEEe-c-CCcEEEEEEeeeecchHHHhcCCcc-----cceeChhhhccCHHHHHHHHHHHHHH
Confidence 34454444 4 5999999999976542 221 13499999999999999999887763
No 75
>PF13880 Acetyltransf_13: ESCO1/2 acetyl-transferase
Probab=96.51 E-value=0.0021 Score=41.23 Aligned_cols=27 Identities=15% Similarity=0.156 Sum_probs=23.9
Q ss_pred EEEeeeeeccccccCCHHHHHHHHHHh
Q psy838 123 LYCYELQLEKQVQRKGLGKFMMQAFYT 149 (152)
Q Consensus 123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~ 149 (152)
+=|.+|.|+|.+|++||++.||+.+-.
T Consensus 6 ~GI~RIWV~~~~RR~GIAt~Lld~ar~ 32 (70)
T PF13880_consen 6 CGISRIWVSPSHRRKGIATRLLDAARE 32 (70)
T ss_pred EEeEEEEeChhhhhhhHHHHHHHHHHH
Confidence 448899999999999999999998754
No 76
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=96.42 E-value=0.0099 Score=47.86 Aligned_cols=55 Identities=20% Similarity=0.187 Sum_probs=37.4
Q ss_pred ceEEEEEeCC-CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 93 ACYLVAKNGS-SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 93 ~~~~v~~~~~-~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
..|++...++ +..+|||.+=.....++ ..|-.|-+.|.||++|+|+.||+.--.+
T Consensus 128 lFYVl~e~d~~g~h~vGYFSKEK~s~~~---nNLaCIltLPpyQrkGyG~~LI~fSYeL 183 (290)
T PLN03238 128 LFYVMTEVDDHGSHIVGYFSKEKVSAED---YNLACILTLPPYQRKGYGKFLISFAYEL 183 (290)
T ss_pred EEEEEEEecCCCcEEEEEeceeccccCC---CcEEEEEecChhhhccHhHhHHHHHhHH
Confidence 3455544432 34699997754332222 3478899999999999999999876544
No 77
>KOG3138|consensus
Probab=96.40 E-value=0.0035 Score=47.61 Aligned_cols=95 Identities=15% Similarity=0.143 Sum_probs=59.6
Q ss_pred ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeecc-CC------
Q psy838 47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD-FG------ 119 (152)
Q Consensus 47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~-~~------ 119 (152)
.....+++.++...-.|.+......| ... +..+..+.....-++..+ +..+|-++...... .+
T Consensus 17 ~~l~~it~~nl~~~~~l~~~~fP~~y-------~~k-fy~~~~~~~~~~~~A~~~--~~~v~a~~~k~~~~~~~~~r~~~ 86 (187)
T KOG3138|consen 17 IELRLITPNNLKQLKQLNEDIFPISY-------VDK-FYPDVLSNGDLTQLAYYN--EIAVGAVACKLIKFVQNAKRLFG 86 (187)
T ss_pred eeeccCCcchHHHHHHHhccccCcch-------HHH-HHHHHHhcCCHHHhhhhc--cccccceeeeehhhhhhhhhhhc
Confidence 44567788888877777665433333 222 334443332222233333 56666666654321 11
Q ss_pred cceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 120 EPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 120 ~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
..++||.-|-|.|.||.+|||+.||+.+.+.+
T Consensus 87 ~~~~yi~~Lgvl~~yR~~gIGs~Ll~~~~~~~ 118 (187)
T KOG3138|consen 87 NRVIYILSLGVLPRYRNKGIGSKLLEFVKKYC 118 (187)
T ss_pred cceeEEEeecccHHHHhcchHHHHHHHHHHHH
Confidence 11588999999999999999999999998875
No 78
>PLN03239 histone acetyltransferase; Provisional
Probab=96.05 E-value=0.015 Score=48.05 Aligned_cols=55 Identities=22% Similarity=0.297 Sum_probs=36.5
Q ss_pred ceEEEEEeCC-CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 93 ACYLVAKNGS-SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 93 ~~~~v~~~~~-~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
..|++...++ +-.+|||.+=.....++ ..|-.|-+.|.||++|+|+.||+.--.+
T Consensus 186 lFYVl~e~D~~g~h~vGYFSKEK~s~~~---~NLaCIltLPpyQrkGyG~lLI~fSYeL 241 (351)
T PLN03239 186 LFYVLCEVDERGFHPVGYYSKEKYSDVG---YNLACILTFPAHQRKGYGRFLIAFSYEL 241 (351)
T ss_pred EEEEEEEecCCceEEEEEeeecccCCCC---CceEEEEecChhhhcchhhhhHhhhhHh
Confidence 3444444332 23689997743322222 3488899999999999999999875444
No 79
>COG1670 RimL Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]
Probab=95.99 E-value=0.047 Score=39.28 Aligned_cols=47 Identities=11% Similarity=0.026 Sum_probs=34.3
Q ss_pred CCeEEEEEEEeecc-CCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 104 STPVAFSHFRFDVD-FGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 104 ~~ivGf~~~~~~~~-~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+++||.+.+....+ ...... .....+.|+++|+|+|+..+..+.++|
T Consensus 77 ~~~iG~~~~~~~~~~~~~~~~-~ig~~l~~~~~g~G~~tea~~~~l~~~ 124 (187)
T COG1670 77 GELIGVIGLSDIDRAANGDLA-EIGYWLDPEYWGKGYATEALRALLDYA 124 (187)
T ss_pred CeEEEEEEEEEeccccccceE-EEEEEEChHHhcCchHHHHHHHHHHHh
Confidence 58999999986542 111112 334557999999999999999998875
No 80
>COG3393 Predicted acetyltransferase [General function prediction only]
Probab=95.97 E-value=0.015 Score=46.25 Aligned_cols=52 Identities=10% Similarity=0.030 Sum_probs=37.1
Q ss_pred CceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHH
Q psy838 92 RACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFY 148 (152)
Q Consensus 92 ~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~ 148 (152)
.++.++..++ |++|.-+.-.-..+ ....|..++++|+|||+|+|..|+..|-
T Consensus 176 ~~~~~f~~~d--~~iVa~A~t~a~~~---~~~~I~gV~T~peyR~kGyAt~lva~L~ 227 (268)
T COG3393 176 RSRTYFLEGD--GKIVAKAETAAENP---AYAQINGVYTHPEYRGKGYATALVATLA 227 (268)
T ss_pred ceeEEEEccC--CcEEEeeeccccCC---cceEEEEEEcCHHHccccHHHHHHHHHH
Confidence 3444444443 69988877543322 2345899999999999999999998874
No 81
>PF12746 GNAT_acetyltran: GNAT acetyltransferase; PDB: 3G3S_B.
Probab=95.90 E-value=0.026 Score=45.16 Aligned_cols=53 Identities=15% Similarity=0.160 Sum_probs=34.8
Q ss_pred CceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 92 RACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 92 ~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+--|.++ .+ +++|.-+.-.+.++.+ +.| +|..+|+|||+|+|+.+-..++..|
T Consensus 165 G~Gf~i~-~~--~~iVs~~~s~~~~~~~---~EI-~I~T~~~yR~kGLA~~~aa~~I~~C 217 (265)
T PF12746_consen 165 GFGFCIL-HD--GEIVSGCSSYFVYENG---IEI-DIETHPEYRGKGLATAVAAAFILEC 217 (265)
T ss_dssp --EEEEE-ET--TEEEEEEEEEEEETTE---EEE-EEEE-CCCTTSSHHHHHHHHHHHHH
T ss_pred CcEEEEE-EC--CEEEEEEEEEEEECCE---EEE-EEEECHHhhcCCHHHHHHHHHHHHH
Confidence 3344444 44 7877665544544433 223 5889999999999999999998765
No 82
>PF06852 DUF1248: Protein of unknown function (DUF1248); InterPro: IPR009658 This entry represents a conserved region within a number of proteins of unknown function that seem to be specific to Caenorhabditis elegans. Note that some proteins in the entry contain more than one copy of this region.
Probab=95.86 E-value=0.085 Score=39.93 Aligned_cols=63 Identities=22% Similarity=0.288 Sum_probs=39.4
Q ss_pred hhhhcCC-CceEEEEEeCCCCCeEEEEEEE-eec---cCCcceEEEeeeeeccccccCCHHHHHHHHHH
Q psy838 85 HSEMFDD-RACYLVAKNGSSSTPVAFSHFR-FDV---DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFY 148 (152)
Q Consensus 85 ~~el~~~-~~~~~v~~~~~~~~ivGf~~~~-~~~---~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~ 148 (152)
..+-+.. -..++++..+ .+++|+-+++- +.. ....|..++--.+++|+|||+|+++.+-+.+.
T Consensus 37 wk~sf~~~Y~l~~~~~Kg-T~~via~~~~~~~~~l~~~~d~pl~~~G~~w~~p~yRg~~~~kl~~~~~~ 104 (181)
T PF06852_consen 37 WKESFDDDYWLVLTCLKG-TDRVIATVHLIRFDPLNPSPDKPLQFIGFFWIDPEYRGKGIMKLQDDICM 104 (181)
T ss_pred HHHhhccCeEEEEEEEcC-CCcEEEEEEEEEeccCCCCCCCCeEEEeeeeeCCcccCcchHHHHHHHHH
Confidence 3333444 3344445544 56777776653 221 12367888999999999999999974444433
No 83
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=95.83 E-value=0.013 Score=49.96 Aligned_cols=54 Identities=19% Similarity=0.215 Sum_probs=36.7
Q ss_pred ceEEEEEeCC-CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838 93 ACYLVAKNGS-SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYT 149 (152)
Q Consensus 93 ~~~~v~~~~~-~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~ 149 (152)
..|++...++ +-.+|||.+=.....++ ..|-.|-+.|.||++|+|+.||+.--.
T Consensus 279 lFYvl~e~d~~g~h~vGyFSKEk~s~~~---~NLaCIltlP~yQrkGyG~~LI~~SYe 333 (450)
T PLN00104 279 LFYVLCECDDRGCHMVGYFSKEKHSEED---YNLACILTLPPYQRKGYGKFLIAFSYE 333 (450)
T ss_pred EEEEEEEecCCCcEEEEEecccccCcCC---CceEEEEecchhhhcchhheehhheeh
Confidence 4455544332 34799997744332222 348899999999999999999986443
No 84
>PTZ00064 histone acetyltransferase; Provisional
Probab=95.81 E-value=0.017 Score=49.69 Aligned_cols=56 Identities=18% Similarity=0.129 Sum_probs=37.6
Q ss_pred CceEEEEEeCC-CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 92 RACYLVAKNGS-SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 92 ~~~~~v~~~~~-~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
...|++...++ +-.+|||.+=.....+ ...|-.|-+.|.||++|+|+.||+.--.+
T Consensus 356 FlFYVLtE~D~~G~HiVGYFSKEK~S~~---~nNLACILtLPpyQRKGYGklLIdfSYeL 412 (552)
T PTZ00064 356 FLFYIVTEVDEEGCHIVGYFSKEKVSLL---HYNLACILTLPCYQRKGYGKLLVDLSYKL 412 (552)
T ss_pred eEEEEEEEecCCCcEEEEEecccccCcc---cCceEEEEecchhhhcchhhhhhhhhhhh
Confidence 34455544432 3479999774332222 23488999999999999999999875444
No 85
>KOG4601|consensus
Probab=95.63 E-value=0.091 Score=41.18 Aligned_cols=45 Identities=18% Similarity=0.191 Sum_probs=33.7
Q ss_pred CCeEEEEEEEe----ecc------CCcceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838 104 STPVAFSHFRF----DVD------FGEPVLYCYELQLEKQVQRKGLGKFMMQAFYT 149 (152)
Q Consensus 104 ~~ivGf~~~~~----~~~------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~ 149 (152)
+.+.||+.+-+ .+| +++ .+-|.++||+++.|++|.|+.|++.+.+
T Consensus 81 s~l~GllKVG~KkLfl~D~~~~~ye~e-~lcILDFyVheS~QR~G~G~~lfdyMl~ 135 (264)
T KOG4601|consen 81 SILKGLLKVGYKKLFLTDNEQNQYEEE-ALCILDFYVHESEQRSGNGFKLFDYMLK 135 (264)
T ss_pred hheeeeehccceeEEEeccHhhhhccC-CceEEEEEeehhhhhcCchHHHHHHHHH
Confidence 46888886532 223 233 3448999999999999999999998765
No 86
>TIGR03694 exosort_acyl putative PEP-CTERM/exosortase system-associated acyltransferase. Members of this protein family are restricted to bacterial species with the PEP-CTERM/exosortase system predicted to act in exopolysaccharide-associated protein targeting. PSI-BLAST and CDD reveal relationships to the acyltransferase family that includes N-acyl-L-homoserine lactone synthetase. Several members of this family may be found in a single genome. These proteins likely contribute to chemical modifications in exopolysaccharide and biofilm structural material production.
Probab=95.29 E-value=0.34 Score=38.09 Aligned_cols=83 Identities=14% Similarity=0.255 Sum_probs=52.4
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChh-----hhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeec--------
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQ-----GKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDV-------- 116 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~-----~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~-------- 116 (152)
...++++++.++.|-.+ .|-. ..+|... ....+-.+..+.++++...++|++||.+.+.+..
T Consensus 12 ~a~~~~~~~~~~~lR~~----VFv~-e~gw~~~~~~~~~~E~D~~D~~~~h~l~~~~~~g~vvG~~RLl~t~~~~p~~~~ 86 (241)
T TIGR03694 12 PAVTPELLEEAFRLRYQ----VYCE-ELGFEPPSDYPDGLETDEYDAHSVHSLLRHRRTGTFVGCVRLVLPNSSDPDQPF 86 (241)
T ss_pred EcCCHHHHHHHHHHHHH----HHHH-hcCCCCCCCCCCCCcCCCCCCCCcEEEEEECCCCCEEEEEEEeccccccccccc
Confidence 34566778999999876 4433 4567632 2222333444555555543258999999987531
Q ss_pred c-------------------CCcceEEEeeeeeccccccC
Q psy838 117 D-------------------FGEPVLYCYELQLEKQVQRK 137 (152)
Q Consensus 117 ~-------------------~~~~~~~I~~i~V~p~~qgk 137 (152)
. ....++++.++.|+|++|+.
T Consensus 87 p~e~~~~~~~~~~~~~~~~~~~~~i~E~SRf~V~~~~r~r 126 (241)
T TIGR03694 87 PFEKHCSHSLDGLFLDPRRLPRSRIAEVSRLAVSKDFRRR 126 (241)
T ss_pred cHHHHhccccchhhcCccccCCCceEEeehheECHhHhCC
Confidence 0 01347889999999999974
No 87
>PRK13834 putative autoinducer synthesis protein; Provisional
Probab=95.20 E-value=0.42 Score=36.75 Aligned_cols=96 Identities=13% Similarity=0.194 Sum_probs=61.6
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhh---h-hhhcCCCceEEEEEeCCCCCeEEEEEEEeecc--------
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGK---H-SEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD-------- 117 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~---~-~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~-------- 117 (152)
....+++++.++.|-.+ .|.. ..+|+.... . ++.-++..+|+++..+ +|+++|.+.+.+...
T Consensus 11 ~~~~~~~l~~~~rLR~~----VF~~-elgW~~~~~~g~E~D~yD~~~~~yll~~~~-~g~vvG~~RLlptt~p~ml~~~f 84 (207)
T PRK13834 11 YEREASLLKQMHRLRAR----VFGG-RLGWDVSITDGEERDQFDDLKPTYILAISD-SGRVAGCARLLPAIGPTMLAQVF 84 (207)
T ss_pred hhcCHHHHHHHHHHHHH----Hhcc-ccCCCCCCCCCcCccCCCCCCCEEEEEEeC-CCeEEEEEecccCCCcchhhhhc
Confidence 45667889999999866 4533 467865432 1 1122345677777654 589999998754311
Q ss_pred ----------CCcceEEEeeeeeccccccC---C----HHHHHHHHHHhhc
Q psy838 118 ----------FGEPVLYCYELQLEKQVQRK---G----LGKFMMQAFYTYI 151 (152)
Q Consensus 118 ----------~~~~~~~I~~i~V~p~~qgk---G----iG~~Lm~~l~~~a 151 (152)
....++++.++.|+|+++.. + +...|+..+.++|
T Consensus 85 p~l~~~~~~~~~~~v~E~SRf~V~~~~~~~~~~~~~~~~~~~L~~~~~~~a 135 (207)
T PRK13834 85 PQLLPAGRLNAHPAMIESSRFCVDTALAEGRGGGQLHEATLTMFAGIIEWS 135 (207)
T ss_pred HHhcCCCCCCCCCCEEEEeeeEEcccccccccccccCHHHHHHHHHHHHHH
Confidence 12457889999999986322 2 5566777776654
No 88
>COG5628 Predicted acetyltransferase [General function prediction only]
Probab=95.04 E-value=0.076 Score=37.70 Aligned_cols=55 Identities=18% Similarity=0.219 Sum_probs=39.1
Q ss_pred ceEEEEEeCCCCCeEEEEEEEee-ccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 93 ACYLVAKNGSSSTPVAFSHFRFD-VDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 93 ~~~~v~~~~~~~~ivGf~~~~~~-~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
+.|++.. + |.+|||+.+--. .........+.++.|...|||+|+||+...++-..
T Consensus 38 ~~~~~~~-~--~~~igf~l~L~~~~~~~~iD~~~~efFIi~k~~~~GvGR~aaK~If~~ 93 (143)
T COG5628 38 EAWLFRI-G--GLPVGFALVLDLAHSPTPIDRAVAEFFIVRKHRRRGVGRAAAKAIFGS 93 (143)
T ss_pred ceeEEEE-C--CceeeeeeeecccCCCCcccccchheEeeehhhccchhHHHHHHHHHH
Confidence 4455544 3 899999976422 12223345688999999999999999999887543
No 89
>PF00765 Autoind_synth: Autoinducer synthetase; InterPro: IPR001690 Bacterial species have many methods of controlling gene expression and cell growth. Regulation of gene expression in response to changes in cell density is termed quorum sensing [, ]. Quorum-sensing bacteria produce, release and respond to hormone-like molecules (autoinducers) that accumulate in the external environment as the cell population grows. Once a threshold of these molecules is reached, a signal transduction cascade is triggered that ultimately leads to behavioural changes in the bacterium []. Autoinducers are thus clearly important mediators of molecular communication. Conjugal transfer of Agrobacterium octopine-type Ti plasmids is activated by octopine, a metabolite released from plant tumours []. Octopine causes conjugal donors to secrete a pheromone, Agrobacterium autoinducer (AAI), and exogenous AAI further stimulates conjugation. The putative AAI synthase and an AAI-responsive transcriptional regulator have been found to be encoded by the Ti plasmid traI and traR genes, respectively. TraR and TraI are similar to the LuxR and LuxI regulatory proteins of Vibrio fischeri, and AAI is similar in structure to the diffusable V. fischeri autoinducer, the inducing ligand of LuxR. TraR activates target genes in the presence of AAI and also activates traR and traI themselves, creating two positive-feedback loops. TraR-AAI-mediated activation in wild-type Agrobacterium strains is enhanced by culturing on solid media, suggesting a possible role in cell density sensing []. Production of light by the marine bacterium V. fischeri and by recombinant hosts containing cloned lux genes is controlled by the density of the culture []. Density-dependent regulation of lux gene expression has been shown to require a locus consisting of the luxR and luxI genes. In these and other Gram-negative bacteria, N-(3-oxohexanoyl)-L-homoserine lactone (OHHL) acts as the autoinducer by binding to transcriptional regulatory proteins and activating them []. OHHL and related molecules, such as N-butanoyl- (BHL), N-hexanoyl- (HHL) and N-oxododecanoyl- (PAI) homoserine lactones, are produced by a family of proteins that share a high level of sequence similarity. Proteins which currently members of this family include: luxI from V. fischeri. ahyI and asaI from Aeromonas species, which synthesize BHL and whose targets are ahyR and asaR respectively. carI from Erwinia carotovora. The target of OHHL is carR which activates genes involved in the biosynthesis of carbapenem antibiotics. eagI from Enterobacter agglomerans. The target of OHHL is not yet known. esaI from Erwinia stewartii. expI from Erwinia carotovora. lasI from Pseudomonas aeruginosa, which synthesizes PAI and whose target is lasR which activates the transcription of the elastase gene. rhlI (or vsmI) from P. aeruginosa, which synthesizes BHL and HHL and whose target is rhlR. swrI from Serratia liquefaciens, which synthesizes BHL. yenI from Yersinia enterocolitica. ; GO: 0007165 signal transduction; PDB: 3P2H_A 3P2F_A 1KZF_A 1K4J_A 1RO5_A.
Probab=94.79 E-value=0.64 Score=35.07 Aligned_cols=93 Identities=16% Similarity=0.236 Sum_probs=63.9
Q ss_pred CCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhh-----cCCCceEEEEEeCCCCCeEEEEEEEeecc--------
Q psy838 51 DLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEM-----FDDRACYLVAKNGSSSTPVAFSHFRFDVD-------- 117 (152)
Q Consensus 51 ~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el-----~~~~~~~~v~~~~~~~~ivGf~~~~~~~~-------- 117 (152)
.++.++++.++.+-.+ .|.. ..+|+.... ..+ -+++..|+++..+ |+++|.+.+.++..
T Consensus 4 ~~~~~~l~~~~rlR~~----vFv~-rlgW~v~~~-dg~E~DqyD~~~~~ylv~~~~--g~v~g~~RLlptt~p~ML~~~F 75 (182)
T PF00765_consen 4 ELSRRLLEEMFRLRHR----VFVD-RLGWDVPCE-DGMEIDQYDDPDAVYLVALDD--GRVVGCARLLPTTGPYMLSDVF 75 (182)
T ss_dssp HHHHHHHHHHHHHHHH----HHTT-CSCCCHHCC-TSEE--TTGCTT-EEEEEEET--TEEEEEEEEEETTS--HHHHCT
T ss_pred ccCHHHHHHHHHHHHH----HHHH-hhCCCCcCC-CCcEeeecCCCCCeEEEEEEC--CEEEEEeeeccCCCcchhhhHH
Confidence 3456788899988765 5654 578987543 222 2356778888876 99999999986432
Q ss_pred ----------CCcceEEEeeeeecccccc------CCHHHHHHHHHHhhc
Q psy838 118 ----------FGEPVLYCYELQLEKQVQR------KGLGKFMMQAFYTYI 151 (152)
Q Consensus 118 ----------~~~~~~~I~~i~V~p~~qg------kGiG~~Lm~~l~~~a 151 (152)
.+..++.+.++.|+|+..+ .-+...|+..+.++|
T Consensus 76 ~~ll~~~~~p~~~~vwE~SRf~v~~~~~~~~~~~~~~~~~~L~~~~~e~a 125 (182)
T PF00765_consen 76 PHLLPDGPAPRSPDVWELSRFCVDPDRRRSRAGSRSPVTMELLLGMVEFA 125 (182)
T ss_dssp GGGHTTS---SSTTEEEEEEEEE-HCCCHHCHSCC-THHHHHHHHHHHHH
T ss_pred HHHhCCCCCCCCCcceeeeEEEEcccccccccccccHHHHHHHHHHHHHH
Confidence 1246799999999998532 246778888887765
No 90
>KOG2747|consensus
Probab=94.76 E-value=0.037 Score=46.43 Aligned_cols=54 Identities=19% Similarity=0.178 Sum_probs=33.8
Q ss_pred CCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHH
Q psy838 91 DRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFY 148 (152)
Q Consensus 91 ~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~ 148 (152)
+...|++...++.| .|||.+=.-. .....+|..|-|.|-||++|.|+.||+.--
T Consensus 233 pFlFYVlte~d~~G-~VGYFSKEK~---s~~~yNlaCILtLPpyQRkGYGklLIdFSY 286 (396)
T KOG2747|consen 233 PFLFYVLTECDSYG-CVGYFSKEKE---SSENYNLACILTLPPYQRKGYGKLLIDFSY 286 (396)
T ss_pred ceEEEEEEecCCcc-eeeeeccccc---cccccceeeeeecChhhhcccchhhhhhhh
Confidence 33445554444233 4666542211 112345888999999999999999998643
No 91
>cd04264 DUF619-NAGS DUF619 domain of various N-acetylglutamate Synthases of the fungal arginine-biosynthetic pathway and urea cycle found in humans and fish. DUF619-NAGS: This family includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present in C-terminal of a NAG kinase-like domain in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAGS is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain function has yet to be characterized.
Probab=94.39 E-value=0.21 Score=34.08 Aligned_cols=44 Identities=11% Similarity=0.126 Sum_probs=32.7
Q ss_pred CeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838 105 TPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYT 149 (152)
Q Consensus 105 ~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~ 149 (152)
...|.+.+.... .+..+.||-.++|.|+.||+|+|..|++.+.+
T Consensus 18 ~y~~~aIvt~~~-~~~~~~yLdKfaV~~~~~g~gvad~vf~~i~~ 61 (99)
T cd04264 18 GYNAAAIVTYEG-VNNGVPYLDKFAVSSSAQGEGTSDALWRRLRR 61 (99)
T ss_pred CceEEEEEeccC-CCCCceEEEEEEEchhhhhcChHHHHHHHHHh
Confidence 344554443321 12457899999999999999999999999875
No 92
>COG5630 ARG2 Acetylglutamate synthase [Amino acid transport and metabolism]
Probab=94.38 E-value=0.33 Score=40.79 Aligned_cols=99 Identities=13% Similarity=0.114 Sum_probs=66.8
Q ss_pred cccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeec
Q psy838 37 YTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDV 116 (152)
Q Consensus 37 ~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~ 116 (152)
+.-|.||.+ .+.....=|++.+-.|+.+ .|.. ..+++.....+..+-...+|+. .--|.+.+.+.-
T Consensus 329 f~~p~~~~~--~sttw~~Ldl~r~q~LI~~----SFkR---TLd~h~y~~r~~~~La~~iVsg-----dY~g~aIlTyeg 394 (495)
T COG5630 329 FHVPEKGAK--PSTTWKDLDLPRLQHLIQS----SFKR---TLDPHYYETRINTPLARAIVSG-----DYRGAAILTYEG 394 (495)
T ss_pred Eeccccccc--CCCChhhcCcHHHHHHHHH----HHhh---ccCHHHHHHhccCcceeEEeec-----cceeeEEEEeec
Confidence 444666643 3444455566777777765 5543 4578888888888877666542 234555554433
Q ss_pred cCCcceEEEeeeeecccccc-CCHHHHHHHHHHh
Q psy838 117 DFGEPVLYCYELQLEKQVQR-KGLGKFMMQAFYT 149 (152)
Q Consensus 117 ~~~~~~~~I~~i~V~p~~qg-kGiG~~Lm~~l~~ 149 (152)
..++.+-||..++|.++.|| -|||..+.+-+-+
T Consensus 395 s~~~~vpYLDKfAVl~~aQGs~gisd~vfniM~e 428 (495)
T COG5630 395 SGENNVPYLDKFAVLDDAQGSEGISDAVFNIMRE 428 (495)
T ss_pred cCCCCCcceeeeeccccccccchHHHHHHHHHHH
Confidence 34456889999999999999 9999999887654
No 93
>TIGR03243 arg_catab_AOST arginine and ornithine succinyltransferase subunits. In many bacteria, the sole member of this protein family is arginine N-succinyltransferase (EC 2.3.1.109), the AstA protein of the arginine succinyltransferase (ast) pathway. However, in Pseudomonas aeruginosa and several other species, a tandem gene pair encodes alpha and beta subunits of a heterodimer that is designated arginine and ornithine succinyltransferase (AOST).
Probab=93.35 E-value=0.27 Score=40.59 Aligned_cols=93 Identities=12% Similarity=0.085 Sum_probs=54.5
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCC-----------CChhhhhhhhcCCCceEEEEEeC-CCCCeEEEEEEEee--
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIG-----------WDPQGKHSEMFDDRACYLVAKNG-SSSTPVAFSHFRFD-- 115 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~-----------w~~~~~~~el~~~~~~~~v~~~~-~~~~ivGf~~~~~~-- 115 (152)
.++..+|++.+++|..+.= .=+ .+.| ++...+..+....+..|++|.++ ++|++||...+.-.
T Consensus 3 Rpv~~~Dl~aL~~LA~~sg-~G~--TsLP~d~~~L~~rI~~S~~sF~~~~~~~~~~YlFVLED~~tg~vvGts~I~a~vG 79 (335)
T TIGR03243 3 RPVRTSDLDALMQLARESG-IGL--TSLPADRAALGSRIARSEKSFAGESTRGEEGYLFVLEDTETGTVAGVSAIEAAVG 79 (335)
T ss_pred ccCccccHHHHHHHHHHcC-CCc--ccCCCCHHHHHHHHHHHHHHHhcccCCCCccEEEEEEeCCCCeEEEEEeEEeccc
Confidence 4677888999988876520 000 0122 34444433322235567776653 35899998874211
Q ss_pred ---------cc----------------------CCcceEEEeeeeeccccccCCHHHHHHH
Q psy838 116 ---------VD----------------------FGEPVLYCYELQLEKQVQRKGLGKFMMQ 145 (152)
Q Consensus 116 ---------~~----------------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~ 145 (152)
.+ +-.-.-.|-.|.|+|+||+-|.|+.|-.
T Consensus 80 ~~~PfY~yrv~~~vhaS~~L~v~~~~~~L~l~nd~tG~sElctLfL~p~~R~~~~G~LLSr 140 (335)
T TIGR03243 80 LDEPFYNYRVGTLVHASRELGVYNKIPTLTLSNDLTGSSELCTLFLDPDYRKGGNGRLLSR 140 (335)
T ss_pred CCCCCEEEEcCceeecCcccCCccceeeEEeeccCCCCeeeEEEEECHHHcCCCchhhHHH
Confidence 00 0011234778999999999999998743
No 94
>KOG2779|consensus
Probab=93.34 E-value=0.28 Score=40.85 Aligned_cols=100 Identities=13% Similarity=0.063 Sum_probs=65.9
Q ss_pred CCHHHHHHHHHHHHHhHH-HHHHhCCCCCChhhhhhhhcCCC---ceEEEEEeCCCCCeEEEEEEEee----ccCCcceE
Q psy838 52 LGAETKKWVWELEEKNMK-HSYEVCDIGWDPQGKHSEMFDDR---ACYLVAKNGSSSTPVAFSHFRFD----VDFGEPVL 123 (152)
Q Consensus 52 l~~~d~~~i~~l~~~~m~-~~y~~~~~~w~~~~~~~el~~~~---~~~~v~~~~~~~~ivGf~~~~~~----~~~~~~~~ 123 (152)
-...|++++++|..+|.- +.+...-+.++.+.+.=.+..|+ .+.+.+....+.++|||++..+. .+.-...+
T Consensus 89 ~~~~~l~el~~lL~enyVEd~~~m~rf~Ys~eFl~Wal~~pg~~~~WHiGVRv~~s~kLVaFIsaiP~~irvrdk~vk~v 168 (421)
T KOG2779|consen 89 SDFKDLEELYNLLNENYVEDDDSMFRFDYSPEFLKWALQPPGWKKEWHIGVRVKSSKKLVAFISAIPATIRVRDKVVKMV 168 (421)
T ss_pred ccHhHHHHHHhhcccCCCCccccchhhhccHHHHHhhhcCCCCccceEEEEEEecCCceEEEEeccccEEEEccceeeee
Confidence 345667777776665411 11111124678887777776654 23334443335799999987642 23445578
Q ss_pred EEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 124 YCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.|+-|.|++..|+++++--|+.++-..+
T Consensus 169 eINFLCVHKkLRSKRlaPvLIrEITRRv 196 (421)
T KOG2779|consen 169 EINFLCVHKKLRSKRLAPVLIREITRRV 196 (421)
T ss_pred eEEEEEEehhhhccccccHHHHHHHHHh
Confidence 8999999999999999999999886554
No 95
>TIGR03244 arg_catab_AstA arginine N-succinyltransferase. In many bacteria, the arginine succinyltransferase (ast) pathway operon consists of five genes, including this protein, arginine N-succinyltransferase (EC 2.3.1.109). In a few species, such as Pseudomonas aeruginosa, the member of this family is encoded adjacent to a paralog, and the two polypeptides form a heterodimeric enzyme, active on both arginine and ornithine. In such species, this polypeptide may be treated as the beta subunit of an enzyme that may be named either arginine N-succinyltransferase (AST) or arginine and orthithine N-succinyltransferase (AOST).
Probab=93.31 E-value=0.27 Score=40.62 Aligned_cols=93 Identities=12% Similarity=0.109 Sum_probs=53.7
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChh-----------hhhhhhcCCCceEEEEEeC-CCCCeEEEEEEEee--
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQ-----------GKHSEMFDDRACYLVAKNG-SSSTPVAFSHFRFD-- 115 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~-----------~~~~el~~~~~~~~v~~~~-~~~~ivGf~~~~~~-- 115 (152)
.++...|++.+++|..+.= .=+ .+.|-+++ .+..+....+..|++|.++ ++|++||...+.-.
T Consensus 3 RPv~~~Dl~aL~~LA~~sg-~G~--TsLP~d~~~L~~rI~~S~~sF~~~~~~~~~~YlFVLEDt~tg~vvGts~I~a~vG 79 (336)
T TIGR03244 3 RPVETSDLDALYQLAQSTG-IGL--TSLPANEDLLSARIERAEKTFSGELTRAEQGYLFVLEDTETGTVAGVSAIEAAVG 79 (336)
T ss_pred ccCccccHHHHHHHHHHcC-CCc--ccCCCCHHHHHHHHHHHHHHhcCcCCCCCccEEEEEEeCCCCeEEEEEeEEeccc
Confidence 4677888999998876520 000 01233333 3322222234567777654 35899998874211
Q ss_pred ---------cc----------------------CCcceEEEeeeeeccccccCCHHHHHHH
Q psy838 116 ---------VD----------------------FGEPVLYCYELQLEKQVQRKGLGKFMMQ 145 (152)
Q Consensus 116 ---------~~----------------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~ 145 (152)
.+ +-.-.-.|-.|.++|+||+-|.|+.|-.
T Consensus 80 ~~~PfY~yr~~~~vhaS~~L~v~~~~~~L~l~nd~tG~SElctLfL~p~~R~~~~G~LLSr 140 (336)
T TIGR03244 80 LEEPFYNYRVGTVVHASKELGIYKALETLFLSNDLTGYSELCTLFLDPDYRKGGNGRLLSK 140 (336)
T ss_pred CCCCCEEEEcCceeecCcccCCceeeeeEEeeccCCCCeeeEEEEECHHHcCCcchhhHHH
Confidence 00 0011234677899999999999988743
No 96
>PF13444 Acetyltransf_5: Acetyltransferase (GNAT) domain
Probab=93.07 E-value=0.59 Score=31.51 Aligned_cols=60 Identities=12% Similarity=0.200 Sum_probs=42.2
Q ss_pred hhhhhcCCCceEEEEEeCCCCCeEEEEEEEeecc--------------------CCcceEEEeeeeeccccccCCHHHHH
Q psy838 84 KHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD--------------------FGEPVLYCYELQLEKQVQRKGLGKFM 143 (152)
Q Consensus 84 ~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~--------------------~~~~~~~I~~i~V~p~~qgkGiG~~L 143 (152)
...+-.++.+..+++.++. .++||.+.+..... ....++.|-++.|+|+||+...-..|
T Consensus 21 ~e~D~fD~~~~h~lv~~~~-~~~VGt~Rl~~~~~~~~~~~~~~~~~f~l~~~~~~~~~~~EisRl~V~~~~R~~~~~~~L 99 (101)
T PF13444_consen 21 RERDEFDEHSVHLLVRDKN-TEVVGTVRLILPSPAGPLEGFYSESEFDLDPLLPLPRRVAEISRLCVHPEYRRRKVLLLL 99 (101)
T ss_pred ccccCCCCCccEEEEEECC-CCEEEEEEeeccccccccccCCchhhcCcchhhccCCcEEEeehheECHhHCCChHHHHH
Confidence 4455566777777776652 36999998754211 12367889999999999999887766
Q ss_pred H
Q psy838 144 M 144 (152)
Q Consensus 144 m 144 (152)
.
T Consensus 100 ~ 100 (101)
T PF13444_consen 100 W 100 (101)
T ss_pred h
Confidence 4
No 97
>PRK10456 arginine succinyltransferase; Provisional
Probab=93.00 E-value=0.31 Score=40.41 Aligned_cols=93 Identities=10% Similarity=0.071 Sum_probs=53.9
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhh-----------hcCCCceEEEEEeC-CCCCeEEEEEEEee--
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSE-----------MFDDRACYLVAKNG-SSSTPVAFSHFRFD-- 115 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~e-----------l~~~~~~~~v~~~~-~~~~ivGf~~~~~~-- 115 (152)
.++..+|++.+++|..+.= .=+ .+.|-+++.+... ....+..|++|.++ ++|++||...+.-.
T Consensus 5 Rpv~~~Dl~aL~~LA~~sG-~G~--TsLP~d~~~L~~rI~~S~~sF~~~~~~~~~~YlFVLED~~tg~vvGts~I~a~vG 81 (344)
T PRK10456 5 RPVERSDLAALMQLAGKTG-GGL--TSLPANEATLAARIERALKTWQGELPKSEQGYVFVLEDSETGTVAGICAIEVAVG 81 (344)
T ss_pred ecCccccHHHHHHHHHHcC-CCc--ccCCCCHHHHHHHHHHHHHHhcCcCCCCCccEEEEEEeCCCCcEEEEEeEEeccc
Confidence 4677889999998876520 000 0133333332222 22234566777654 35899998874211
Q ss_pred ---------cc----------------------CCcceEEEeeeeeccccccCCHHHHHHH
Q psy838 116 ---------VD----------------------FGEPVLYCYELQLEKQVQRKGLGKFMMQ 145 (152)
Q Consensus 116 ---------~~----------------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~ 145 (152)
.+ +-.-.-.|-.|.|+|+||+-|.|+.|-.
T Consensus 82 ~~~PfY~yr~~~~vhaS~~L~v~~~~~~L~l~nd~tG~sElctLfl~p~~R~~~~G~LLSr 142 (344)
T PRK10456 82 LNDPWYNYRVGTLVHASKELNVYNALPTLFLSNDHTGSSELCTLFLDPDWRKEGNGYLLSK 142 (344)
T ss_pred CCCCCEEEEcCceeecCcccCCceeeeeEEeeccCCCCceeEEEEECHHHcCCCchhHHHH
Confidence 00 0011234677899999999999988743
No 98
>PF08444 Gly_acyl_tr_C: Aralkyl acyl-CoA:amino acid N-acyltransferase, C-terminal region; InterPro: IPR013652 This entry represents mammalian-specific glycine N-acyltransferase (also called aralkyl acyl-CoA:amino acid N-acyltransferase; 2.3.1.13 from EC). Mitochondrial acyltransferases catalyse the transfer of an acyl group from acyl-CoA to the N terminus of glycine to produce N-acylglycine. These enzymes can conjugate a multitude of substrates to form a variety of N-acylglycines. The CoA derivatives of a number of aliphatic and aromatic acids, but not phenylacetyl-CoA or (indol-3-yl)acetyl-CoA, can act as donor [, ].
Probab=92.21 E-value=0.18 Score=33.85 Aligned_cols=41 Identities=17% Similarity=0.095 Sum_probs=30.3
Q ss_pred CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
+|.+|-.+..... ..+.--+..|+|||||+.+.++..+.+.
T Consensus 7 eG~PVSW~lmdqt-------ge~rmgyTlPeyR~~G~~~~v~~~~~~~ 47 (89)
T PF08444_consen 7 EGNPVSWSLMDQT-------GEMRMGYTLPEYRGQGLMSQVMYHLAQY 47 (89)
T ss_pred CCCEeEEEEeccc-------ccccccccCHhHhcCCHHHHHHHHHHHH
Confidence 5778766554221 2356678999999999999999887764
No 99
>PF04958 AstA: Arginine N-succinyltransferase beta subunit; InterPro: IPR007041 Arginine N-succinyltransferase catalyses the transfer of succinyl-CoA to arginine to produce succinylarginine. This is the first step in arginine catabolism via the arginine succinyltransferase pathway. Six major L-arginine-degrading pathways have been described for prokaryotes []. Many bacteria arginine succinyltransferase 2.3.1.109 from EC, which is the AstA protein of the succinyltransferase (ast) pathway operon consists of five genes. In a few species, such as Pseudomonas aeruginosa, a tandem gene pair encodes alpha and beta subunits of a heterodimer that is designated arginine and ornithine succinyltransferase (AOST). This entry represents the family of proteins that make up the beta subunit of the heterodimer of Ast and AOST.; GO: 0008791 arginine N-succinyltransferase activity, 0006527 arginine catabolic process; PDB: 1YLE_A.
Probab=91.62 E-value=0.89 Score=37.73 Aligned_cols=93 Identities=12% Similarity=0.079 Sum_probs=48.1
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhh-----------hc-CC-CceEEEEEeC-CCCCeEEEEEEEee
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSE-----------MF-DD-RACYLVAKNG-SSSTPVAFSHFRFD 115 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~e-----------l~-~~-~~~~~v~~~~-~~~~ivGf~~~~~~ 115 (152)
.++..+|++.+++|....= .-+ .+.|-+++.+... .. .+ +..|++|.++ ++|+|||.+.+.-.
T Consensus 5 Rp~~~~Dl~aL~~LA~~sg-~G~--TsLP~d~~~L~~rI~~S~~sFa~~~~~~~~~~~YlfVLED~~tg~vvGts~I~a~ 81 (342)
T PF04958_consen 5 RPARPSDLDALYALARESG-PGF--TSLPPDREALAERIERSERSFAGRDVDFPGDEGYLFVLEDTETGEVVGTSAIEAA 81 (342)
T ss_dssp EE--GGGHHHHHHHHHHS--TT---TTS-S-HHHHHHHHHHHHHHHH-TT----S--EEEEEEEETTT--EEEEEEEESS
T ss_pred ecCchhhHHHHHHHHHHcC-CCc--ccCCCCHHHHHHHHHHHHHHhhccccCCCCccceEEEEEecCCCcEEEEEeEEec
Confidence 4677889999999876521 111 0234444433222 21 12 3457777653 36899998874210
Q ss_pred c-------------------c--------------CCcceEEEeeeeeccccccCCHHHHHHH
Q psy838 116 V-------------------D--------------FGEPVLYCYELQLEKQVQRKGLGKFMMQ 145 (152)
Q Consensus 116 ~-------------------~--------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~ 145 (152)
. + +-.-.-.|-.|+|+|+||+-|.|+.|-.
T Consensus 82 vG~~~PfY~yr~~~~vh~S~~L~v~~~~~~L~L~~d~tG~sEl~tLfL~p~~R~~~~G~lLSr 144 (342)
T PF04958_consen 82 VGLDEPFYSYRVSTLVHASRELGVRNRHETLTLSNDYTGCSELCTLFLDPDYRGGGNGRLLSR 144 (342)
T ss_dssp TTSSS---EEEEEEEEEEETTTTEEEEEEEEEEE-TTTTSEEEEEEEE-GGGTTSHHHHHHHH
T ss_pred cCCCCCcEEEEcCceeEcCcccCCccceeeEeeecCCCCCeeeEEEEECHHHcCCchHHHHHH
Confidence 0 0 0111245778999999999999998753
No 100
>TIGR03245 arg_AOST_alph arginine/ornithine succinyltransferase, alpha subunit. In some bacteria, including Pseudomonas aeruginosa, the astB gene (arginine N-succinyltransferase) is replaced by tandem paralogs that form a heterodimer. This heterodimer from P. aeruginosa is characterized as arginine and ornithine N-2 succinyltransferase (AOST). Members of this protein family represent the less widespread paralog, designated AruI, or arginine/ornithine succinyltransferase, alpha subunit.
Probab=91.51 E-value=0.77 Score=38.00 Aligned_cols=93 Identities=6% Similarity=0.000 Sum_probs=53.9
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCC-----------CChhhhhhhhc-CCCceEEEEEeC-CCCCeEEEEEEEee-
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIG-----------WDPQGKHSEMF-DDRACYLVAKNG-SSSTPVAFSHFRFD- 115 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~-----------w~~~~~~~el~-~~~~~~~v~~~~-~~~~ivGf~~~~~~- 115 (152)
.++...|++.+++|..+.= .=+ .+.| ++...+..+.. ..+..|++|.++ ++|++||...+.-.
T Consensus 3 Rpv~~~Dl~aL~~LA~~sG-~G~--TsLP~d~~~L~~rI~~S~~sF~~~~~~~~~~~YlFVLEDt~tg~vvGts~I~a~v 79 (336)
T TIGR03245 3 RPSRFADLPAIERLANESA-IGV--TSLPADRAKLGEKIAQSERSFAAEVSFVGEERYLFVLEDTETGKLLGTSSIVASA 79 (336)
T ss_pred ccCccccHHHHHHHHHHcC-CCc--ccCCCCHHHHHHHHHHHHHHHHhhcCCCCCccEEEEEEeCCCCcEEEEEeEEecc
Confidence 4677888999998876520 000 0123 33334422222 234556776653 35899998874211
Q ss_pred ----------cc----------------------CCcceEEEeeeeeccccccCCHHHHHHH
Q psy838 116 ----------VD----------------------FGEPVLYCYELQLEKQVQRKGLGKFMMQ 145 (152)
Q Consensus 116 ----------~~----------------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~ 145 (152)
.+ +-.-.-.|-.|+++|+||+-|.|+.|-.
T Consensus 80 G~~~PfY~yr~~~~vhaS~~L~v~~~~~~L~l~nd~tG~sElctLfL~p~~R~~~~G~lLSr 141 (336)
T TIGR03245 80 GYGEPFYSYRNDTLIHASRELKVNNKIHVLYMCHELTGSSLLCSFYVDPRLRKTEAAELLSR 141 (336)
T ss_pred cCCCCCEEEEcCceeecCcccCCccceeeEEeeccCCCCeeeEEEEECHHHcCCCchhHHHH
Confidence 00 0011234778999999999999988743
No 101
>cd04265 DUF619-NAGS-U DUF619 domain of various N-acetylglutamate Synthases (NAGS) of the urea (U) cycle of humans and fish. This family includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present in C-terminal of a NAG kinase-like domain in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAGS is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain function has yet to be characterized.
Probab=91.43 E-value=0.86 Score=31.09 Aligned_cols=29 Identities=17% Similarity=0.287 Sum_probs=26.7
Q ss_pred ceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838 121 PVLYCYELQLEKQVQRKGLGKFMMQAFYT 149 (152)
Q Consensus 121 ~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~ 149 (152)
.+.||-.++|.|+.||+|+|..|++.+.+
T Consensus 33 ~~~yLdKfaV~~~~~g~gv~d~vf~~i~~ 61 (99)
T cd04265 33 GVPYLDKFAVSSSAQGEGTGEALWRRLRR 61 (99)
T ss_pred CceEEEEEEEchhhhhcChHHHHHHHHHh
Confidence 46789999999999999999999999875
No 102
>COG3053 CitC Citrate lyase synthetase [Energy production and conversion]
Probab=91.28 E-value=0.65 Score=37.91 Aligned_cols=51 Identities=14% Similarity=0.152 Sum_probs=38.5
Q ss_pred ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 93 ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 93 ~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
-.+.++..++++++|+..++.-. -|-.|+|.|.+||-|+.-+|+..|+++|
T Consensus 35 ve~~v~~~~~~~~iiacGsiaGn--------vikcvAvs~s~qGeGl~lkl~TeLin~a 85 (352)
T COG3053 35 VEYFVAIYRDNEEIIACGSIAGN--------VIKCVAVSESLQGEGLALKLVTELINLA 85 (352)
T ss_pred ceEEEEEEcCCCcEEEecccccc--------eeEEEEechhcccccHHHHHHHHHHHHH
Confidence 34455544436899988775311 1678999999999999999999998875
No 103
>COG3375 Uncharacterized conserved protein [Function unknown]
Probab=91.04 E-value=1.7 Score=34.15 Aligned_cols=88 Identities=16% Similarity=0.057 Sum_probs=52.8
Q ss_pred CCHHHHHHHHHHHHHhHHHHHHhCC-CCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeee
Q psy838 52 LGAETKKWVWELEEKNMKHSYEVCD-IGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQL 130 (152)
Q Consensus 52 l~~~d~~~i~~l~~~~m~~~y~~~~-~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V 130 (152)
-++.++++..++... .+..+. ...-.+.++ .|.....-.+-++.+ ++++||...-.+..-.+....|=|-+.|
T Consensus 9 ~dp~el~~~~dV~~~----aWg~~d~~~~~~d~i~-al~~~GGlvlgAf~~-dg~lVGls~G~pg~r~g~~y~ySH~~gV 82 (266)
T COG3375 9 TDPAELDEAEDVQAS----AWGSEDRDGAPADTIR-ALRYHGGLVLGAFSA-DGRLVGLSYGYPGGRGGSLYLYSHMLGV 82 (266)
T ss_pred CCHHHHHHHHHHHHH----HhCccccccchHHHHH-HHHhcCCeEEEEEcC-CCcEEEEEeccCCcCCCceeeeeeehhc
Confidence 356677777766544 221111 011122222 344445555555655 4699998766554434555677888999
Q ss_pred ccccccCCHHHHHHH
Q psy838 131 EKQVQRKGLGKFMMQ 145 (152)
Q Consensus 131 ~p~~qgkGiG~~Lm~ 145 (152)
.|++++.|+|-.|=.
T Consensus 83 ~e~~k~sglg~aLK~ 97 (266)
T COG3375 83 REEVKGSGLGVALKM 97 (266)
T ss_pred cccccccchhhhhHH
Confidence 999999999987743
No 104
>COG5027 SAS2 Histone acetyltransferase (MYST family) [Chromatin structure and dynamics]
Probab=91.00 E-value=0.14 Score=42.46 Aligned_cols=51 Identities=16% Similarity=0.253 Sum_probs=33.2
Q ss_pred ceEEEEEeCCCC-CeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHH
Q psy838 93 ACYLVAKNGSSS-TPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQA 146 (152)
Q Consensus 93 ~~~~v~~~~~~~-~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~ 146 (152)
..|++...++.+ .+|||.+=....+ ....|..|-+.|-||++|.|+.||+.
T Consensus 235 lFYvl~~~~~~~~h~vGyFSKEK~S~---~~yNLaCILtLP~yQRrGYG~lLIdF 286 (395)
T COG5027 235 LFYVLTERGDTGCHLVGYFSKEKESE---QDYNLACILTLPPYQRRGYGKLLIDF 286 (395)
T ss_pred EEEEEEEcCCcceeeeeeechhhccc---ccCceEEEEecChhHhcccceEeeee
Confidence 344444444222 4888876432222 23457889999999999999999864
No 105
>KOG4144|consensus
Probab=90.98 E-value=0.2 Score=37.22 Aligned_cols=43 Identities=9% Similarity=0.087 Sum_probs=33.0
Q ss_pred CCeEEEEEEEeecc-------------CCcceEEEeeeeeccccccCCHHHHHHHHH
Q psy838 104 STPVAFSHFRFDVD-------------FGEPVLYCYELQLEKQVQRKGLGKFMMQAF 147 (152)
Q Consensus 104 ~~ivGf~~~~~~~~-------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l 147 (152)
+.+||++.-..... .++ ...|+-+.|+|+||.+|.|..|+...
T Consensus 71 ~tLIghIigs~~~~E~lt~ESm~kh~s~g~-ni~iHsl~Ihpa~rk~g~a~~Ll~~y 126 (190)
T KOG4144|consen 71 GTLIGHIIGSLWDKERLTQESMTKHRSGGH-NIHIHSLAIHPAFRKQGRAPILLWRY 126 (190)
T ss_pred ccceehhhcccCcchhhhHHHHhhhhcCCc-ceeEEEEEecHHHHhcCcchhHHHHH
Confidence 78889886543211 233 47899999999999999999999873
No 106
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=90.91 E-value=0.19 Score=25.65 Aligned_cols=23 Identities=35% Similarity=0.964 Sum_probs=14.9
Q ss_pred HhHhHHhhhhhcccccccCCCCccc
Q psy838 22 LRKMVKKMEITQHAKYTCSFCGKEY 46 (152)
Q Consensus 22 ~rk~~~~~~~~~~~~~~c~~c~~~~ 46 (152)
|++.++ +. ....+|.||.|++.+
T Consensus 2 l~~H~~-~H-~~~k~~~C~~C~k~F 24 (26)
T PF13465_consen 2 LRRHMR-TH-TGEKPYKCPYCGKSF 24 (26)
T ss_dssp HHHHHH-HH-SSSSSEEESSSSEEE
T ss_pred HHHHhh-hc-CCCCCCCCCCCcCee
Confidence 344444 33 344679999999864
No 107
>KOG2696|consensus
Probab=89.85 E-value=0.95 Score=37.88 Aligned_cols=24 Identities=25% Similarity=0.339 Sum_probs=22.1
Q ss_pred EeeeeeccccccCCHHHHHHHHHH
Q psy838 125 CYELQLEKQVQRKGLGKFMMQAFY 148 (152)
Q Consensus 125 I~~i~V~p~~qgkGiG~~Lm~~l~ 148 (152)
|..+-|.|.||++|+|.+||+.+.
T Consensus 220 iSQmlilpPfq~~Glgs~l~E~i~ 243 (403)
T KOG2696|consen 220 ISQMLILPPFQGKGLGSQLYEAIA 243 (403)
T ss_pred hheeEEeccccCCchHHHHHHHHH
Confidence 677889999999999999999987
No 108
>PF04768 DUF619: Protein of unknown function (DUF619); InterPro: IPR006855 This region of unknown function is found at the C terminus of Neurospora crassa acetylglutamate synthase (2.7.2.8 from EC). It is also found C-terminal to the amino acid kinase region in some fungal acetylglutamate kinase enzymes (IPR001048 from INTERPRO). These enzymes play a role in arginine biosynthesis.; PDB: 3S6K_A 4AB7_F 3ZZF_B 3ZZI_D 3ZZH_A 3ZZG_A 3S6G_Y 3S6H_A 3S7Y_A.
Probab=89.68 E-value=3.5 Score=30.82 Aligned_cols=92 Identities=16% Similarity=0.106 Sum_probs=56.6
Q ss_pred cccCCCCCHH-HHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeec-cCCcceE
Q psy838 46 YKKSTDLGAE-TKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDV-DFGEPVL 123 (152)
Q Consensus 46 ~~~~~~l~~~-d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~-~~~~~~~ 123 (152)
+.....++.- |++.+.+|..+ .| .-.-+.+.....+.+.....++ . +..-|.+.+.+.. +...++.
T Consensus 22 i~~~~s~~~~~d~~kL~~ll~~----sf---~~~~~v~~yl~~l~~~~~~iy~--d---~~y~~~AIVt~e~~~~~~~v~ 89 (170)
T PF04768_consen 22 ILKHSSLSEFVDLDKLRALLER----SF---GGKLDVDHYLDRLNNRLFKIYV--D---EDYEGAAIVTPEGPDSNGPVP 89 (170)
T ss_dssp EEEESSCCCSS-HHHHHHHHHH----HS---TSSSBHTTHHHHHHTS-SEEEE--E---TTSSEEEEEEEE-SCTCTSEE
T ss_pred eEEecCccccCCHHHHHHHHHh----cc---cccccHHHHHHHhhccceEEEE--e---CCceEEEEEEecCCCCCCCCe
Confidence 3334444444 78888887655 34 2234566666777655443333 2 3344555554422 3345689
Q ss_pred EEeeeeeccccccCCHHHHHHHHHHh
Q psy838 124 YCYELQLEKQVQRKGLGKFMMQAFYT 149 (152)
Q Consensus 124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~ 149 (152)
||..++|.|..||.|++-.+.+.+.+
T Consensus 90 yLdKFav~~~~~g~gv~D~vf~~i~~ 115 (170)
T PF04768_consen 90 YLDKFAVSKSAQGSGVADNVFNAIRK 115 (170)
T ss_dssp EEEEEEE-HHHHHTTHHHHHHHHHHH
T ss_pred EEEEEEecchhhhcCHHHHHHHHHHH
Confidence 99999999999999999999998865
No 109
>PF09924 DUF2156: Uncharacterized conserved protein (DUF2156); InterPro: IPR024320 This domain of unknown function is found in uncharacterised proteins and in Lysylphosphatidylglycerol synthetase, which catalyses the transfer of a lysyl group from L-lysyl-tRNA(Lys) to membrane-bound phosphatidylglycerol [].; PDB: 2HQY_A.
Probab=89.07 E-value=2.8 Score=33.47 Aligned_cols=114 Identities=14% Similarity=0.107 Sum_probs=57.8
Q ss_pred hhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhh-hc---CCCceE
Q psy838 20 ASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSE-MF---DDRACY 95 (152)
Q Consensus 20 ~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~e-l~---~~~~~~ 95 (152)
.++|.++++.+..-+. +. +.+...+++++.+.|.++.++=.... . -.+..+... +. ..+...
T Consensus 117 ~~~Rn~in~~~k~G~~-~~-------~~~~~~~~~~~~~el~~i~~~W~~~~-----~-~~e~~~~~~~~~~~~~~~~~~ 182 (299)
T PF09924_consen 117 RKKRNHINRFEKEGYT-FE-------VVPIPELDPELRDELLEISDEWLKEK-----E-RPERGFIMGALEHFDELGLRG 182 (299)
T ss_dssp HHHHHHHHHHHHH--T--E-------EEE-----GGGHHHHHHHHHHHHHHC-----T-HHHHHHHHHHHHTHHHHT-EE
T ss_pred hhHHHHHHHHhcCceE-EE-------EEECCCCCHHHHHHHHHHHHHHHhcC-----c-hhHHHHHhccccchhhcCceE
Confidence 5677777777765222 11 44455558888899988876521111 0 001111111 11 125566
Q ss_pred EEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 96 LVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 96 ~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+++... +|+++||+.+.+..... .++.+|.-...--=+|+=..|+..+...+
T Consensus 183 ~~~~~~-dgki~af~~~~~~~~~~---~~~~~~~k~~~~a~~G~~e~l~~~~~~~~ 234 (299)
T PF09924_consen 183 FVARVA-DGKIVAFAIGSPLGGRD---GWSIDFEKADPDAPKGIYEFLNVEFAEHL 234 (299)
T ss_dssp EEEEE--TTEEEEEEEEEEEE-TT---EEEEEEEEE-TT-STTHHHHHHHHHHHHS
T ss_pred EEEEEC-CCcEEEEEEEEEccCCc---cEEEEEEecCCCCCCcHHHHHHHHHHHhh
Confidence 777772 39999999998775221 23333433322235788888888776653
No 110
>COG3818 Predicted acetyltransferase, GNAT superfamily [General function prediction only]
Probab=88.68 E-value=0.59 Score=33.80 Aligned_cols=32 Identities=13% Similarity=0.461 Sum_probs=29.1
Q ss_pred cceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 120 EPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 120 ~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+.-+||.++.|....||.|+|+.|.+-+.+.|
T Consensus 82 e~F~YvDRvVVA~~aRGrG~aRalY~Dlf~~A 113 (167)
T COG3818 82 ENFFYVDRVVVASRARGRGVARALYADLFSYA 113 (167)
T ss_pred CceEEEEEEEEEecccccchHHHHHHHHHHHH
Confidence 45689999999999999999999999988776
No 111
>KOG2036|consensus
Probab=88.07 E-value=0.39 Score=43.46 Aligned_cols=28 Identities=18% Similarity=0.292 Sum_probs=25.3
Q ss_pred EEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 123 LYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
..|.+|+|+|+||+.|.|+.-++.+.++
T Consensus 615 aRIVRIAvhP~y~~MGYGsrAvqLL~~y 642 (1011)
T KOG2036|consen 615 ARIVRIAVHPEYQKMGYGSRAVQLLTDY 642 (1011)
T ss_pred ceEEEEEeccchhccCccHHHHHHHHHH
Confidence 4589999999999999999999988765
No 112
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=86.45 E-value=0.2 Score=34.22 Aligned_cols=23 Identities=35% Similarity=0.738 Sum_probs=16.6
Q ss_pred hHhHHhhhhhcccccccCCCCcc
Q psy838 23 RKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 23 rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
|+.++..-......|+||+||..
T Consensus 9 r~~ik~~~~~L~k~FtCp~Cghe 31 (104)
T COG4888 9 RKIIKRRPQVLPKTFTCPRCGHE 31 (104)
T ss_pred cccCcccCccCCceEecCccCCe
Confidence 44555555556678999999987
No 113
>COG5092 NMT1 N-myristoyl transferase [Lipid metabolism]
Probab=84.45 E-value=4.2 Score=33.63 Aligned_cols=101 Identities=14% Similarity=0.076 Sum_probs=66.4
Q ss_pred CCCHHHHHHHHHHHHHhH-HHHHHhCCCCCChhhhhhhhcCCCc---eEEEEEeCCCCCeEEEEEEEeec----cCCcce
Q psy838 51 DLGAETKKWVWELEEKNM-KHSYEVCDIGWDPQGKHSEMFDDRA---CYLVAKNGSSSTPVAFSHFRFDV----DFGEPV 122 (152)
Q Consensus 51 ~l~~~d~~~i~~l~~~~m-~~~y~~~~~~w~~~~~~~el~~~~~---~~~v~~~~~~~~ivGf~~~~~~~----~~~~~~ 122 (152)
.....++++++.|..+|. .+.|..+.+.++.+.++=.+..++. +.+.+.-....++|||++..+.. +--.++
T Consensus 86 v~N~~ql~dv~~lL~eNYVED~~ag~rf~Y~~EFl~Wal~~pg~kK~whigvRvk~t~klVaFIsa~p~~v~vRgK~~~~ 165 (451)
T COG5092 86 VANKKQLEDVFVLLEENYVEDIYAGHRFRYSVEFLQWALDGPGGKKRWHIGVRVKGTQKLVAFISAKPHLVSVRGKRSSV 165 (451)
T ss_pred ccccchhHHHHHHHHhhhhhhhhhhhHHHHHHHHHHHhhcCCCCceeeEEEEEEcccceeEEEEecceeEEEEccccccc
Confidence 445667888888776663 2334333345666666666666542 22333333246999999875432 122356
Q ss_pred EEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 123 LYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+.+.-|.|+.+.|++-+.--|+..+-..|
T Consensus 166 ~evNFLCihk~lRsKRltPvLIkEiTRR~ 194 (451)
T COG5092 166 LEVNFLCIHKELRSKRLTPVLIKEITRRA 194 (451)
T ss_pred ceEEEEEEehhhhhCccchHHHHHHHHhh
Confidence 78999999999999999999999887665
No 114
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=84.16 E-value=0.5 Score=23.90 Aligned_cols=12 Identities=33% Similarity=0.838 Sum_probs=9.8
Q ss_pred cccccccCCCCc
Q psy838 33 QHAKYTCSFCGK 44 (152)
Q Consensus 33 ~~~~~~c~~c~~ 44 (152)
+-..|.||.||+
T Consensus 13 ~~v~f~CPnCG~ 24 (24)
T PF07754_consen 13 QAVPFPCPNCGF 24 (24)
T ss_pred cCceEeCCCCCC
Confidence 456899999995
No 115
>KOG4135|consensus
Probab=83.90 E-value=1.4 Score=32.52 Aligned_cols=32 Identities=19% Similarity=0.154 Sum_probs=25.9
Q ss_pred cceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 120 EPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 120 ~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
.+...+.-..-.|.-||+|+|+..+.++..+|
T Consensus 105 ~~~gE~EvMIAEP~~RgKG~G~eav~~ml~y~ 136 (185)
T KOG4135|consen 105 VITGEVEVMIAEPRGRGKGIGTEAVRAMLAYA 136 (185)
T ss_pred eeeeeEEEEEecccccCCCccHHHHHHHHHHH
Confidence 34555666667899999999999999998875
No 116
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=83.86 E-value=0.43 Score=23.07 Aligned_cols=14 Identities=36% Similarity=1.170 Sum_probs=10.8
Q ss_pred cccCCCCcccccCC
Q psy838 37 YTCSFCGKEYKKST 50 (152)
Q Consensus 37 ~~c~~c~~~~~~~~ 50 (152)
|.||.||+.+.+..
T Consensus 1 y~C~~C~~~f~~~~ 14 (23)
T PF00096_consen 1 YKCPICGKSFSSKS 14 (23)
T ss_dssp EEETTTTEEESSHH
T ss_pred CCCCCCCCccCCHH
Confidence 68999999866543
No 117
>PHA00616 hypothetical protein
Probab=83.14 E-value=0.14 Score=29.79 Aligned_cols=15 Identities=33% Similarity=0.846 Sum_probs=11.3
Q ss_pred ccccCCCCcccccCC
Q psy838 36 KYTCSFCGKEYKKST 50 (152)
Q Consensus 36 ~~~c~~c~~~~~~~~ 50 (152)
.|.||.||+.++..+
T Consensus 1 pYqC~~CG~~F~~~s 15 (44)
T PHA00616 1 MYQCLRCGGIFRKKK 15 (44)
T ss_pred CCccchhhHHHhhHH
Confidence 489999998765544
No 118
>COG3138 AstA Arginine/ornithine N-succinyltransferase beta subunit [Amino acid transport and metabolism]
Probab=82.70 E-value=3.7 Score=33.44 Aligned_cols=94 Identities=18% Similarity=0.210 Sum_probs=54.4
Q ss_pred CCCCHHHHHHHHHHHHHh---HHH------HHHhCCCCCChhhhhhhhcCCCceEEEEEeC-CCCCeEEEEEEEeec---
Q psy838 50 TDLGAETKKWVWELEEKN---MKH------SYEVCDIGWDPQGKHSEMFDDRACYLVAKNG-SSSTPVAFSHFRFDV--- 116 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~---m~~------~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~-~~~~ivGf~~~~~~~--- 116 (152)
.+++..|++.+++|.... |.. .... ...|++..+..+....+..|+++.++ +.|+++|...+.-..
T Consensus 5 RP~~~aDl~al~~LA~~sg~G~TsLP~de~~L~~-Ri~~se~sf~~~~~~ge~~Y~fVLEDsetG~VvG~saI~a~vGl~ 83 (336)
T COG3138 5 RPVERADLEALMELAVKTGVGLTSLPADEATLRA-RIERSEKSFQGELPPGEAGYLFVLEDSETGTVVGISAIEAAVGLN 83 (336)
T ss_pred ccccccCHHHHHHHHHhcCCCcccCCCCHHHHHH-HHHHHHHHHhcccCCCCccEEEEEEecCCceEEeEEEEEEeeccC
Confidence 467788999999886542 000 0000 01355555566565566667776653 468999987642100
Q ss_pred --------c---C---------Ccce----------EEEeeeeeccccccCCHHHHHH
Q psy838 117 --------D---F---------GEPV----------LYCYELQLEKQVQRKGLGKFMM 144 (152)
Q Consensus 117 --------~---~---------~~~~----------~~I~~i~V~p~~qgkGiG~~Lm 144 (152)
. + ..++ -.|-.|.++|++|.-|-|+.|-
T Consensus 84 ~PfYsyRv~tlvhaS~~L~v~~~i~~L~L~Nd~TG~SEl~sLFl~pd~Rkg~nG~Lls 141 (336)
T COG3138 84 DPFYSYRVGTLVHASPELNVYNEIPTLFLSNDLTGNSELCTLFLDPDWRKGGNGRLLS 141 (336)
T ss_pred CccceeeeeeeeecCccccccccceeEEEeccCcCchhhhheeecHHHhcccchhhhh
Confidence 0 0 0111 2245688999999888887653
No 119
>KOG3014|consensus
Probab=79.84 E-value=12 Score=29.88 Aligned_cols=27 Identities=11% Similarity=0.218 Sum_probs=24.3
Q ss_pred eEEEeeeeeccccccCCHHHHHHHHHH
Q psy838 122 VLYCYELQLEKQVQRKGLGKFMMQAFY 148 (152)
Q Consensus 122 ~~~I~~i~V~p~~qgkGiG~~Lm~~l~ 148 (152)
.+-|.+|.|.+-.|++||++.|++.+.
T Consensus 183 ~~GIsRIWV~s~~Rr~gIAs~lldva~ 209 (257)
T KOG3014|consen 183 ICGISRIWVSSLRRRKGIASLLLDVAR 209 (257)
T ss_pred EeeeEEEEeehhhhhhhhHHHHHHHHH
Confidence 466999999999999999999998764
No 120
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=79.79 E-value=0.82 Score=21.71 Aligned_cols=13 Identities=31% Similarity=1.216 Sum_probs=8.2
Q ss_pred cccCCCCcccccC
Q psy838 37 YTCSFCGKEYKKS 49 (152)
Q Consensus 37 ~~c~~c~~~~~~~ 49 (152)
|.||.|++.+.+.
T Consensus 1 ~~C~~C~~~~~~~ 13 (24)
T PF13894_consen 1 FQCPICGKSFRSK 13 (24)
T ss_dssp EE-SSTS-EESSH
T ss_pred CCCcCCCCcCCcH
Confidence 6799999876554
No 121
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=78.17 E-value=0.22 Score=29.98 Aligned_cols=20 Identities=25% Similarity=0.566 Sum_probs=10.6
Q ss_pred hhhcccccccCCCCcccccC
Q psy838 30 EITQHAKYTCSFCGKEYKKS 49 (152)
Q Consensus 30 ~~~~~~~~~c~~c~~~~~~~ 49 (152)
......+.+||.|+..+.+.
T Consensus 18 ~~~S~~PatCP~C~a~~~~s 37 (54)
T PF09237_consen 18 KSQSEQPATCPICGAVIRQS 37 (54)
T ss_dssp CCTTS--EE-TTT--EESSH
T ss_pred hhccCCCCCCCcchhhccch
Confidence 44556689999999876654
No 122
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=76.28 E-value=1.4 Score=30.62 Aligned_cols=14 Identities=36% Similarity=0.857 Sum_probs=8.6
Q ss_pred cccccCCCCccccc
Q psy838 35 AKYTCSFCGKEYKK 48 (152)
Q Consensus 35 ~~~~c~~c~~~~~~ 48 (152)
.|++||.||+.|..
T Consensus 8 tKR~Cp~CG~kFYD 21 (108)
T PF09538_consen 8 TKRTCPSCGAKFYD 21 (108)
T ss_pred CcccCCCCcchhcc
Confidence 36677777776333
No 123
>KOG2270|consensus
Probab=76.15 E-value=7.8 Score=33.28 Aligned_cols=65 Identities=23% Similarity=0.432 Sum_probs=47.2
Q ss_pred eccccccccc---hhHhHhHHhhhhhcccc--------cccC-------------CCCcc-----cccCCCCCHHHHHHH
Q psy838 10 ITGKYGTRYG---ASLRKMVKKMEITQHAK--------YTCS-------------FCGKE-----YKKSTDLGAETKKWV 60 (152)
Q Consensus 10 ~~~~~g~ryg---~~~rk~~~~~~~~~~~~--------~~c~-------------~c~~~-----~~~~~~l~~~d~~~i 60 (152)
+.|-|.-|+| .--||+|+.+-+.--.+ -.|| |||++ .+...+++.+++..+
T Consensus 190 V~GEfRFRhgyck~NPRKMVk~WAEKE~RNLkRl~~aGIP~PePIlLk~hVLVM~FlGrdgw~aPkLKd~~ls~~ka~~~ 269 (520)
T KOG2270|consen 190 VSGEFRFRHGYCKHNPRKMVKTWAEKEMRNLKRLNNAGIPCPEPILLKNHVLVMEFLGRDGWAAPKLKDASLSTSKAREL 269 (520)
T ss_pred ccceeeeecccccCCcHHHHHHHHHHHHHHHHHHHhcCCCCCCceeeecceEeeeeccCCCCcCcccccccCChHHHHHH
Confidence 4677888888 66788888874332212 2355 67776 455567788899999
Q ss_pred HHHHHHhHHHHHHh
Q psy838 61 WELEEKNMKHSYEV 74 (152)
Q Consensus 61 ~~l~~~~m~~~y~~ 74 (152)
+......|+.+|..
T Consensus 270 Y~~~v~~MR~lY~~ 283 (520)
T KOG2270|consen 270 YQQCVRIMRRLYQK 283 (520)
T ss_pred HHHHHHHHHHHHHH
Confidence 99888899999966
No 124
>PF02945 Endonuclease_7: Recombination endonuclease VII; InterPro: IPR004211 This family of proteins which includes Bacteriophage T4 endonuclease VII, Mycobacteriophage D29 gene 59, and other as yet uncharacterised proteins. The T4 endonuclease VII (Endo VII) recognises a broad spectrum of DNA substrates ranging from branched DNAs to single base mismatches. The structure of this enzyme has been resolved and it was found that the monomers form an elongated, intertwined molecular dimer that exibits extreme domain swapping. Two pairs of antiparallel helices which form a novel 'four-helix cross' motif are the major dimerisation elements [].; PDB: 3GOX_A 3FC3_A 1EN7_B 1E7L_B 2QNF_A 2QNC_A 1E7D_A.
Probab=75.79 E-value=1.5 Score=28.85 Aligned_cols=30 Identities=27% Similarity=0.542 Sum_probs=15.5
Q ss_pred ccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838 13 KYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 13 ~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
+.+.|||.+....-+-.+.+ ...|+.||++
T Consensus 2 ~~~~~Ygit~~~~~~l~~~q---~~~C~iC~~~ 31 (81)
T PF02945_consen 2 RLKRRYGITPEEYEALLEEQ---GGRCAICGKP 31 (81)
T ss_dssp -----H-HHHHHHHCCHHHT---TTE-TTT-SE
T ss_pred CcccccCCCHHHHHHHHHHh---CCcCcCCCCC
Confidence 45789999888766444444 5599999984
No 125
>COG1243 ELP3 Histone acetyltransferase [Transcription / Chromatin structure and dynamics]
Probab=74.94 E-value=4.1 Score=35.29 Aligned_cols=50 Identities=16% Similarity=0.250 Sum_probs=33.0
Q ss_pred CCCeEEEEEEEeeccCC-----c-ceEEEeeeee----------ccccccCCHHHHHHHHHHhhcC
Q psy838 103 SSTPVAFSHFRFDVDFG-----E-PVLYCYELQL----------EKQVQRKGLGKFMMQAFYTYID 152 (152)
Q Consensus 103 ~~~ivGf~~~~~~~~~~-----~-~~~~I~~i~V----------~p~~qgkGiG~~Lm~~l~~~a~ 152 (152)
++.++||+.++...+.. . +..-+.+|+| ...+|.+|.|+.||+..+.+|.
T Consensus 415 ~d~lig~lrlR~p~e~~~r~e~~~~~aivrelhvyg~~vpig~~~~~~QH~G~G~~L~~~AE~ia~ 480 (515)
T COG1243 415 NDILIGFLRLREPSEGAHREEIDDKTAIVRELHVYGSEVPIGKREDEWQHRGYGRELLEEAERIAR 480 (515)
T ss_pred hhhhhheeeecccccchhhhhcccchhhhhhhhccccccccccCcchhhcccHHHHHHHHHHHHHH
Confidence 47799999988643310 0 0111222222 4679999999999999998873
No 126
>COG4552 Eis Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]
Probab=73.78 E-value=3.2 Score=34.69 Aligned_cols=54 Identities=9% Similarity=-0.071 Sum_probs=36.6
Q ss_pred EEEEEeCCCCCeEEEEEEEe-ec---cCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 95 YLVAKNGSSSTPVAFSHFRF-DV---DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 95 ~~v~~~~~~~~ivGf~~~~~-~~---~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
+.+++.+ .++++-++..+ .. ...-+...|--++++|+|||+|.-++||......
T Consensus 41 ~~vi~~n--qkl~s~L~i~~f~~~f~~q~l~t~GIa~Vas~P~~R~~G~~~~Ll~~sLre 98 (389)
T COG4552 41 SYVIYMN--QKLASRLHIPPFIFWFGNQVLPTAGIAGVASAPTYRRRGALRALLAHSLRE 98 (389)
T ss_pred ceEEeeh--hhhhhcccccchheeeCCeeeeccceEEEEechhhccCcHHHHHHHHHHHH
Confidence 3455554 77777666431 11 1112345588999999999999999999887653
No 127
>cd04266 DUF619-NAGS-FABP DUF619 domain of N-acetylglutamate Synthase of the fungal arginine-biosynthetic pathway. DUF619-NAGS-FABP: This family includes the DUF619 domain of N-acetylglutamate synthase (NAGS) of the fungal arginine-biosynthetic pathway (FABP). This NAGS (also known as arginine-requiring protein 2 or ARG2) consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. NAGS catalyzes the formation of NAG from acetylcoenzyme A and L-glutamate. The DUF619 domain, yet to be characterized, is predicted to function in NAGS association in fungi.
Probab=73.60 E-value=27 Score=24.19 Aligned_cols=30 Identities=17% Similarity=0.380 Sum_probs=26.9
Q ss_pred cceEEEeeeeecccccc-CCHHHHHHHHHHh
Q psy838 120 EPVLYCYELQLEKQVQR-KGLGKFMMQAFYT 149 (152)
Q Consensus 120 ~~~~~I~~i~V~p~~qg-kGiG~~Lm~~l~~ 149 (152)
..+-||..+.|.+..|| .|++..+.+++.+
T Consensus 37 ~~v~yLdKFav~~~~~gl~gv~D~vf~~m~~ 67 (108)
T cd04266 37 EKIAYLDKFAVLPKAQGSDGIADILFNAMLD 67 (108)
T ss_pred CCceEEEEEEEccccccccchHHHHHHHHHH
Confidence 56789999999999997 8999999998874
No 128
>PF14354 Lar_restr_allev: Restriction alleviation protein Lar
Probab=72.43 E-value=1.7 Score=26.40 Aligned_cols=9 Identities=44% Similarity=1.298 Sum_probs=7.5
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
.-|||||..
T Consensus 4 kPCPFCG~~ 12 (61)
T PF14354_consen 4 KPCPFCGSA 12 (61)
T ss_pred cCCCCCCCc
Confidence 469999977
No 129
>PF09151 DUF1936: Domain of unknown function (DUF1936); InterPro: IPR015234 This domain is found in a set of hypothetical archaeal proteins. Its exact function has not, as yet, been defined. ; PDB: 2QH1_B 1PVM_B.
Probab=72.35 E-value=1.7 Score=23.36 Aligned_cols=10 Identities=30% Similarity=0.740 Sum_probs=5.6
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
+|.||.||.-
T Consensus 1 ~hlcpkcgvg 10 (36)
T PF09151_consen 1 QHLCPKCGVG 10 (36)
T ss_dssp --B-TTTSSS
T ss_pred CccCCccCce
Confidence 4789999865
No 130
>PF04438 zf-HIT: HIT zinc finger; InterPro: IPR007529 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the HIT-type zinc finger, which contains 7 conserved cysteines and one histidine that can potentially coordinate two zinc atoms. It has been named after the first protein that originally defined the domain: the yeast HIT1 protein (P46973 from SWISSPROT) []. The HIT-type zinc finger displays some sequence similarities to the MYND-type zinc finger. The function of this domain is unknown but it is mainly found in nuclear proteins involved in gene regulation and chromatin remodeling. This domain is also found in the thyroid receptor interacting protein 3 (TRIP-3) Q15649 from SWISSPROT, that specifically interacts with the ligand binding domain of the thyroid receptor. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 2YQP_A 2YQQ_A 1X4S_A.
Probab=72.21 E-value=1.5 Score=23.23 Aligned_cols=12 Identities=50% Similarity=1.079 Sum_probs=8.0
Q ss_pred ccccccCCCCcc
Q psy838 34 HAKYTCSFCGKE 45 (152)
Q Consensus 34 ~~~~~c~~c~~~ 45 (152)
.++|+||.|+..
T Consensus 11 ~~kY~Cp~C~~~ 22 (30)
T PF04438_consen 11 PAKYRCPRCGAR 22 (30)
T ss_dssp EESEE-TTT--E
T ss_pred CCEEECCCcCCc
Confidence 689999999976
No 131
>PHA02768 hypothetical protein; Provisional
Probab=71.41 E-value=0.92 Score=27.65 Aligned_cols=18 Identities=33% Similarity=0.870 Sum_probs=13.6
Q ss_pred cccccCCCCcccccCCCC
Q psy838 35 AKYTCSFCGKEYKKSTDL 52 (152)
Q Consensus 35 ~~~~c~~c~~~~~~~~~l 52 (152)
.-|.||.||+.+...+.+
T Consensus 4 ~~y~C~~CGK~Fs~~~~L 21 (55)
T PHA02768 4 LGYECPICGEIYIKRKSM 21 (55)
T ss_pred cccCcchhCCeeccHHHH
Confidence 358999999997766544
No 132
>PF04339 DUF482: Protein of unknown function, DUF482; InterPro: IPR007434 This family contains several proteins of uncharacterised function.
Probab=71.41 E-value=21 Score=30.01 Aligned_cols=66 Identities=12% Similarity=0.155 Sum_probs=40.4
Q ss_pred cccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC--CCceEEEEEeCCCCCeEEEEEEEee
Q psy838 46 YKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD--DRACYLVAKNGSSSTPVAFSHFRFD 115 (152)
Q Consensus 46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~--~~~~~~v~~~~~~~~ivGf~~~~~~ 115 (152)
..+...+++++++.++.+..++..... +.++-...+...+.+ ++...+++... +|++||++.+-..
T Consensus 204 ~l~G~~i~~~~~~~f~~~Y~~Ty~k~~---~~~yLt~~FF~~l~~~m~~~~~l~~A~~-~g~~Va~aL~l~~ 271 (370)
T PF04339_consen 204 TLTGDEITDEDWDRFYRLYQNTYAKRW---GRPYLTREFFEQLAETMPEQVVLVVARR-DGQPVAFALCLRG 271 (370)
T ss_pred EEeCCCCCHHHHHHHHHHHHHHHHhhC---CChhhcHHHHHHHHHhCcCCEEEEEEEE-CCeEEEEEEEEEe
Confidence 667889999999999999887543332 234433333444433 33333333332 4999999877543
No 133
>PF14447 Prok-RING_4: Prokaryotic RING finger family 4
Probab=71.22 E-value=2.4 Score=25.85 Aligned_cols=12 Identities=33% Similarity=1.112 Sum_probs=9.3
Q ss_pred cCCCCcccccCC
Q psy838 39 CSFCGKEYKKST 50 (152)
Q Consensus 39 c~~c~~~~~~~~ 50 (152)
||+||+++-.++
T Consensus 42 CPfC~~~~~~~~ 53 (55)
T PF14447_consen 42 CPFCGTPFEFDD 53 (55)
T ss_pred CCCCCCcccCCC
Confidence 999999865543
No 134
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=69.90 E-value=2 Score=25.07 Aligned_cols=15 Identities=33% Similarity=0.888 Sum_probs=10.0
Q ss_pred ccCCCCcc--cccCCCC
Q psy838 38 TCSFCGKE--YKKSTDL 52 (152)
Q Consensus 38 ~c~~c~~~--~~~~~~l 52 (152)
.||.||.. ++++..+
T Consensus 1 ~CP~Cg~~a~ir~S~~~ 17 (47)
T PF04606_consen 1 RCPHCGSKARIRTSRQL 17 (47)
T ss_pred CcCCCCCeeEEEEchhh
Confidence 59999988 4444443
No 135
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=68.40 E-value=2.7 Score=30.00 Aligned_cols=11 Identities=27% Similarity=0.534 Sum_probs=6.4
Q ss_pred cccccCCCCcc
Q psy838 35 AKYTCSFCGKE 45 (152)
Q Consensus 35 ~~~~c~~c~~~ 45 (152)
.|++||.||+.
T Consensus 8 tKr~Cp~cg~k 18 (129)
T TIGR02300 8 TKRICPNTGSK 18 (129)
T ss_pred ccccCCCcCcc
Confidence 35566666665
No 136
>PF14787 zf-CCHC_5: GAG-polyprotein viral zinc-finger; PDB: 1CL4_A 1DSV_A.
Probab=68.01 E-value=3.1 Score=23.07 Aligned_cols=15 Identities=27% Similarity=0.698 Sum_probs=9.7
Q ss_pred cccCCCCcccccCCC
Q psy838 37 YTCSFCGKEYKKSTD 51 (152)
Q Consensus 37 ~~c~~c~~~~~~~~~ 51 (152)
-.||.|||-.+++++
T Consensus 3 ~~CprC~kg~Hwa~~ 17 (36)
T PF14787_consen 3 GLCPRCGKGFHWASE 17 (36)
T ss_dssp -C-TTTSSSCS-TTT
T ss_pred ccCcccCCCcchhhh
Confidence 369999999777654
No 137
>KOG2593|consensus
Probab=67.25 E-value=2.7 Score=35.79 Aligned_cols=24 Identities=33% Similarity=0.531 Sum_probs=16.4
Q ss_pred HhHhHHhhh--hhcccccccCCCCcc
Q psy838 22 LRKMVKKME--ITQHAKYTCSFCGKE 45 (152)
Q Consensus 22 ~rk~~~~~~--~~~~~~~~c~~c~~~ 45 (152)
+||+++.-+ ....+-|.||+|.+.
T Consensus 112 m~krled~~~d~t~~~~Y~Cp~C~kk 137 (436)
T KOG2593|consen 112 MRKRLEDRLRDDTNVAGYVCPNCQKK 137 (436)
T ss_pred HHHHHHHHhhhccccccccCCccccc
Confidence 455555442 346778999999988
No 138
>PRK09710 lar restriction alleviation and modification protein; Reviewed
Probab=63.15 E-value=3.3 Score=25.98 Aligned_cols=8 Identities=50% Similarity=1.406 Sum_probs=7.1
Q ss_pred ccCCCCcc
Q psy838 38 TCSFCGKE 45 (152)
Q Consensus 38 ~c~~c~~~ 45 (152)
-|||||..
T Consensus 8 PCPFCG~~ 15 (64)
T PRK09710 8 PCPFCGCP 15 (64)
T ss_pred CCCCCCCc
Confidence 49999988
No 139
>PF06689 zf-C4_ClpX: ClpX C4-type zinc finger; InterPro: IPR010603 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The ClpX heat shock protein of Escherichia coli is a member of the universally conserved Hsp100 family of proteins, and possesses a putative zinc finger motif of the C4 type []. This presumed zinc binding domain (ZBD) is found at the N terminus of the ClpX protein. ClpX is an ATPase which functions both as a substrate specificity component of the ClpXP protease and as a molecular chaperone. ZBD is a member of the treble clef zinc finger family, a motif known to facilitate protein-ligand, protein-DNA, and protein-protein interactions and forms a constitutive dimer that is essential for the degradation of some, but not all, ClpX substrates []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0016887 ATPase activity, 0046983 protein dimerization activity, 0006200 ATP catabolic process, 0019538 protein metabolic process; PDB: 2DS8_B 2DS6_B 2DS5_A 1OVX_A 2DS7_A.
Probab=62.96 E-value=3.3 Score=23.44 Aligned_cols=9 Identities=56% Similarity=1.431 Sum_probs=4.5
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
..|.|||+.
T Consensus 2 ~~CSFCgr~ 10 (41)
T PF06689_consen 2 KRCSFCGRP 10 (41)
T ss_dssp -B-TTT--B
T ss_pred CCccCCCCC
Confidence 579999998
No 140
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=62.90 E-value=3.2 Score=20.67 Aligned_cols=14 Identities=29% Similarity=0.878 Sum_probs=10.6
Q ss_pred ccccCCCCcccccC
Q psy838 36 KYTCSFCGKEYKKS 49 (152)
Q Consensus 36 ~~~c~~c~~~~~~~ 49 (152)
.|.|+.|++.+.+.
T Consensus 1 ~~~C~~C~~~F~~~ 14 (27)
T PF13912_consen 1 PFECDECGKTFSSL 14 (27)
T ss_dssp SEEETTTTEEESSH
T ss_pred CCCCCccCCccCCh
Confidence 47899999886543
No 141
>COG4068 Uncharacterized protein containing a Zn-ribbon [Function unknown]
Probab=62.43 E-value=9.5 Score=23.57 Aligned_cols=29 Identities=24% Similarity=0.411 Sum_probs=23.1
Q ss_pred ccCCCCcccccCCCCCHHHHHHHHHHHHH
Q psy838 38 TCSFCGKEYKKSTDLGAETKKWVWELEEK 66 (152)
Q Consensus 38 ~c~~c~~~~~~~~~l~~~d~~~i~~l~~~ 66 (152)
-|+.||+.+...+..-.+.-.+|++-++.
T Consensus 10 HC~VCg~aIp~de~~CSe~C~eil~ker~ 38 (64)
T COG4068 10 HCVVCGKAIPPDEQVCSEECGEILNKERK 38 (64)
T ss_pred cccccCCcCCCccchHHHHHHHHHHHHHH
Confidence 49999999988888888877777765543
No 142
>TIGR03655 anti_R_Lar restriction alleviation protein, Lar family. Restriction alleviation proteins provide a countermeasure to host cell restriction enzyme defense against foreign DNA such as phage or plasmids. This family consists of homologs to the phage antirestriction protein Lar, and most members belong to phage genomes or prophage regions of bacterial genomes.
Probab=62.25 E-value=3.4 Score=24.62 Aligned_cols=8 Identities=50% Similarity=1.497 Sum_probs=6.5
Q ss_pred ccCCCCcc
Q psy838 38 TCSFCGKE 45 (152)
Q Consensus 38 ~c~~c~~~ 45 (152)
-|||||..
T Consensus 3 PCPfCGg~ 10 (53)
T TIGR03655 3 PCPFCGGA 10 (53)
T ss_pred CCCCCCCc
Confidence 59999976
No 143
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=62.18 E-value=6.1 Score=23.44 Aligned_cols=25 Identities=16% Similarity=0.402 Sum_probs=20.9
Q ss_pred hhHhHhHHhhhhhcccccccCCCCc
Q psy838 20 ASLRKMVKKMEITQHAKYTCSFCGK 44 (152)
Q Consensus 20 ~~~rk~~~~~~~~~~~~~~c~~c~~ 44 (152)
..|...+.+-.......+.||.|.+
T Consensus 15 ~~L~~H~~~~H~~~~~~v~CPiC~~ 39 (54)
T PF05605_consen 15 SSLVEHCEDEHRSESKNVVCPICSS 39 (54)
T ss_pred HHHHHHHHhHCcCCCCCccCCCchh
Confidence 4688888888888878999999964
No 144
>COG2401 ABC-type ATPase fused to a predicted acetyltransferase domain [General function prediction only]
Probab=60.73 E-value=7.4 Score=33.76 Aligned_cols=28 Identities=14% Similarity=0.243 Sum_probs=24.0
Q ss_pred EEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838 123 LYCYELQLEKQVQRKGLGKFMMQAFYTY 150 (152)
Q Consensus 123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~ 150 (152)
..|-++.|||+||+-|+|..-|....++
T Consensus 242 ariarvvvhpdyr~dglg~~sv~~a~ew 269 (593)
T COG2401 242 ARIARVVVHPDYRADGLGQLSVIAALEW 269 (593)
T ss_pred hheeEEEeccccccCccchhHHHHHHHH
Confidence 4488999999999999999988877665
No 145
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=60.25 E-value=3.8 Score=21.96 Aligned_cols=12 Identities=33% Similarity=0.985 Sum_probs=9.0
Q ss_pred ccccccCCCCcc
Q psy838 34 HAKYTCSFCGKE 45 (152)
Q Consensus 34 ~~~~~c~~c~~~ 45 (152)
.....||.||..
T Consensus 15 ~~~~~CP~Cg~~ 26 (33)
T cd00350 15 EAPWVCPVCGAP 26 (33)
T ss_pred cCCCcCcCCCCc
Confidence 367889998864
No 146
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=59.59 E-value=6 Score=19.86 Aligned_cols=13 Identities=23% Similarity=0.815 Sum_probs=9.7
Q ss_pred ccccCCCCccccc
Q psy838 36 KYTCSFCGKEYKK 48 (152)
Q Consensus 36 ~~~c~~c~~~~~~ 48 (152)
.|.|+.|++.+.+
T Consensus 1 q~~C~~C~k~f~~ 13 (27)
T PF12171_consen 1 QFYCDACDKYFSS 13 (27)
T ss_dssp -CBBTTTTBBBSS
T ss_pred CCCcccCCCCcCC
Confidence 4789999988654
No 147
>PF14353 CpXC: CpXC protein
Probab=59.50 E-value=6 Score=27.69 Aligned_cols=20 Identities=25% Similarity=0.504 Sum_probs=15.7
Q ss_pred HHhhhhhcccccccCCCCcc
Q psy838 26 VKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 26 ~~~~~~~~~~~~~c~~c~~~ 45 (152)
..+|-..+...++||.||+.
T Consensus 28 ~e~il~g~l~~~~CP~Cg~~ 47 (128)
T PF14353_consen 28 KEKILDGSLFSFTCPSCGHK 47 (128)
T ss_pred HHHHHcCCcCEEECCCCCCc
Confidence 34555667779999999998
No 148
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=59.14 E-value=4.1 Score=33.67 Aligned_cols=17 Identities=35% Similarity=0.860 Sum_probs=13.7
Q ss_pred cccccCCCCcccccCCC
Q psy838 35 AKYTCSFCGKEYKKSTD 51 (152)
Q Consensus 35 ~~~~c~~c~~~~~~~~~ 51 (152)
.+|.|++|+|.|.-+..
T Consensus 397 KPYrCevC~KRYKNlNG 413 (423)
T COG5189 397 KPYRCEVCDKRYKNLNG 413 (423)
T ss_pred CceeccccchhhccCcc
Confidence 78999999998665543
No 149
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=58.61 E-value=9.4 Score=29.43 Aligned_cols=62 Identities=21% Similarity=0.143 Sum_probs=40.3
Q ss_pred cceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcccc--cCCCCCHHHHHHHHHHHHH
Q psy838 5 TKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYK--KSTDLGAETKKWVWELEEK 66 (152)
Q Consensus 5 ~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~~--~~~~l~~~d~~~i~~l~~~ 66 (152)
|++|.+.+-=..+.-.=++++++.+.---..-.+||.||--.. ....+++.+.+.+.+....
T Consensus 17 ~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~~F~~l~~~~~~~i~~~i~~ 80 (214)
T PF09986_consen 17 TKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEEDFEKLSPEQKEKIKENISS 80 (214)
T ss_pred eeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCcccccccccCCHHHHHHHHHHHHh
Confidence 3444443332335556666777777777778889999997732 2467788888877775543
No 150
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=58.57 E-value=27 Score=20.62 Aligned_cols=34 Identities=24% Similarity=0.527 Sum_probs=16.4
Q ss_pred HhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHH
Q psy838 22 LRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELE 64 (152)
Q Consensus 22 ~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~ 64 (152)
+.+.+.++... +-.||.||++ ++++....+..=.
T Consensus 9 ~~k~i~~l~~~---~~~CPlC~r~------l~~e~~~~li~~~ 42 (54)
T PF04423_consen 9 LKKYIEELKEA---KGCCPLCGRP------LDEEHRQELIKKY 42 (54)
T ss_dssp HHHHHHHHTT----SEE-TTT--E------E-HHHHHHHHHHH
T ss_pred HHHHHHHHhcC---CCcCCCCCCC------CCHHHHHHHHHHH
Confidence 44555555543 3399999754 5555555555433
No 151
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=58.45 E-value=4.9 Score=22.74 Aligned_cols=9 Identities=33% Similarity=1.040 Sum_probs=6.1
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
|.||.||..
T Consensus 1 m~Cp~Cg~~ 9 (43)
T PF08271_consen 1 MKCPNCGSK 9 (43)
T ss_dssp ESBTTTSSS
T ss_pred CCCcCCcCC
Confidence 467777765
No 152
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=58.15 E-value=4.8 Score=24.87 Aligned_cols=12 Identities=50% Similarity=0.980 Sum_probs=9.2
Q ss_pred cccccccCCCCc
Q psy838 33 QHAKYTCSFCGK 44 (152)
Q Consensus 33 ~~~~~~c~~c~~ 44 (152)
|-..|+||.||-
T Consensus 45 ~~~~Y~CP~CGF 56 (59)
T PRK14890 45 QSNPYTCPKCGF 56 (59)
T ss_pred cCCceECCCCCC
Confidence 446899999984
No 153
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=57.92 E-value=4.8 Score=24.94 Aligned_cols=12 Identities=33% Similarity=0.753 Sum_probs=9.2
Q ss_pred cccccccCCCCc
Q psy838 33 QHAKYTCSFCGK 44 (152)
Q Consensus 33 ~~~~~~c~~c~~ 44 (152)
+-..|+||.||-
T Consensus 47 ~g~~Y~Cp~CGF 58 (61)
T COG2888 47 LGNPYRCPKCGF 58 (61)
T ss_pred cCCceECCCcCc
Confidence 456889999984
No 154
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=57.79 E-value=17 Score=22.44 Aligned_cols=24 Identities=17% Similarity=0.418 Sum_probs=15.2
Q ss_pred ccCCCCcccccCCCCCHHHHHHHH
Q psy838 38 TCSFCGKEYKKSTDLGAETKKWVW 61 (152)
Q Consensus 38 ~c~~c~~~~~~~~~l~~~d~~~i~ 61 (152)
.||.||+++.+.+..-.++=...+
T Consensus 5 HC~~CG~~Ip~~~~fCS~~C~~~~ 28 (59)
T PF09889_consen 5 HCPVCGKPIPPDESFCSPKCREEY 28 (59)
T ss_pred cCCcCCCcCCcchhhhCHHHHHHH
Confidence 599999987765555444433333
No 155
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=57.71 E-value=6.2 Score=25.36 Aligned_cols=19 Identities=16% Similarity=0.543 Sum_probs=13.4
Q ss_pred cccCCCCcc--cccCCCCCHH
Q psy838 37 YTCSFCGKE--YKKSTDLGAE 55 (152)
Q Consensus 37 ~~c~~c~~~--~~~~~~l~~~ 55 (152)
..||+||.. ++++..+++.
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~ 22 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT 22 (72)
T ss_pred ccCCCCCCccEEEEChhcChh
Confidence 479999988 5666655543
No 156
>PF12294 DUF3626: Protein of unknown function (DUF3626); InterPro: IPR022074 This family of proteins is found in bacteria. Proteins in this family are typically between 294 and 374 amino acids in length.
Probab=57.56 E-value=12 Score=30.38 Aligned_cols=22 Identities=9% Similarity=-0.008 Sum_probs=18.1
Q ss_pred EeeeeeccccccCCHHHHHHHH
Q psy838 125 CYELQLEKQVQRKGLGKFMMQA 146 (152)
Q Consensus 125 I~~i~V~p~~qgkGiG~~Lm~~ 146 (152)
|..|.++|.|||.-+++.+-+.
T Consensus 191 VeaLVlDPsyrgT~ve~~~~~l 212 (297)
T PF12294_consen 191 VEALVLDPSYRGTEVEAAARAL 212 (297)
T ss_pred hHHHhcCccccCChHHHHHHHH
Confidence 5568899999999999987543
No 157
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=57.51 E-value=3.4 Score=23.74 Aligned_cols=10 Identities=30% Similarity=1.032 Sum_probs=8.4
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
..+||+||..
T Consensus 21 ~~~Cp~CG~~ 30 (46)
T PRK00398 21 GVRCPYCGYR 30 (46)
T ss_pred ceECCCCCCe
Confidence 6789999887
No 158
>TIGR03019 pepcterm_femAB FemAB-related protein, PEP-CTERM system-associated. Members of this protein family are found always as part of extended exopolysaccharide biosynthesis loci in bacteria. In nearly every case, these loci contain determinants for the processing of the PEP-CTERM proposed C-terminal protein sorting signal. This family shows remote, local sequence similarity to the FemAB protein family (see pfam02388), whose members
Probab=57.12 E-value=1e+02 Score=24.84 Aligned_cols=112 Identities=11% Similarity=0.035 Sum_probs=64.2
Q ss_pred cccchhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCC-CCChhhhhhhh--cCCC
Q psy838 16 TRYGASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDI-GWDPQGKHSEM--FDDR 92 (152)
Q Consensus 16 ~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~-~w~~~~~~~el--~~~~ 92 (152)
.++.++.|+++++.+.. -.++ .. .+|++..+++..++|... +. ..+.+.+..-+ ..+.
T Consensus 134 ~~~~~k~R~~IRka~k~---Gv~v-------~~-----~~~l~~F~~l~~~t~~r~----g~p~~~~~~f~~l~~~~~~~ 194 (330)
T TIGR03019 134 LAIPRKQRAMVRKGIKA---GLTV-------TV-----DGDLDRFYDVYAENMRDL----GTPVFSRRYFRLLKDVFGED 194 (330)
T ss_pred HhcCHHHHHHHHHHHHC---CeEE-------EE-----CCcHHHHHHHHHHHHhcC----CCCCCCHHHHHHHHHhcccC
Confidence 34566778888876543 2221 11 134888888888877542 22 34454443322 1234
Q ss_pred ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 93 ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 93 ~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
..++++... +|++|+.+.+... . . .++.+-..-.++++..+.+..|+..+++.|
T Consensus 195 ~~l~~a~~~-~g~~va~~l~~~~-~--~-~~~~~~~g~~~~~~~~~~~~lL~w~~i~~a 248 (330)
T TIGR03019 195 CEVLTVRLG-DGVVASAVLSFYF-R--D-EVLPYYAGGLREARDVAANDLMYWELMRRA 248 (330)
T ss_pred EEEEEEEeC-CCCEEEEEEEEEe-C--C-EEEEEeccChHHHHhhChHHHHHHHHHHHH
Confidence 445555522 3787776554221 1 1 122223456789999999999999888765
No 159
>smart00355 ZnF_C2H2 zinc finger.
Probab=57.12 E-value=5.5 Score=18.76 Aligned_cols=12 Identities=50% Similarity=1.204 Sum_probs=9.1
Q ss_pred cccCCCCccccc
Q psy838 37 YTCSFCGKEYKK 48 (152)
Q Consensus 37 ~~c~~c~~~~~~ 48 (152)
|.|+.|++.+..
T Consensus 1 ~~C~~C~~~f~~ 12 (26)
T smart00355 1 YRCPECGKVFKS 12 (26)
T ss_pred CCCCCCcchhCC
Confidence 579999877554
No 160
>PF03119 DNA_ligase_ZBD: NAD-dependent DNA ligase C4 zinc finger domain; InterPro: IPR004149 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the zinc finger domain found in NAD-dependent DNA ligases. DNA ligases catalyse the crucial step of joining the breaks in duplex DNA during DNA replication, repair and recombination, utilizing either ATP or NAD(+) as a cofactor []. This domain is a small zinc binding motif that is presumably DNA binding. It is found only in NAD-dependent DNA ligases. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair; PDB: 1DGS_A 1V9P_B 2OWO_A.
Probab=57.03 E-value=5.6 Score=20.65 Aligned_cols=10 Identities=40% Similarity=1.072 Sum_probs=5.3
Q ss_pred ccCCCCcccc
Q psy838 38 TCSFCGKEYK 47 (152)
Q Consensus 38 ~c~~c~~~~~ 47 (152)
+||.||....
T Consensus 1 ~CP~C~s~l~ 10 (28)
T PF03119_consen 1 TCPVCGSKLV 10 (28)
T ss_dssp B-TTT--BEE
T ss_pred CcCCCCCEeE
Confidence 5999998844
No 161
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=55.97 E-value=3.6 Score=28.07 Aligned_cols=14 Identities=29% Similarity=0.667 Sum_probs=9.1
Q ss_pred hcccccccCCCCcc
Q psy838 32 TQHAKYTCSFCGKE 45 (152)
Q Consensus 32 ~~~~~~~c~~c~~~ 45 (152)
.....++||+||..
T Consensus 17 klpt~f~CP~Cge~ 30 (99)
T PRK14892 17 KLPKIFECPRCGKV 30 (99)
T ss_pred CCCcEeECCCCCCe
Confidence 33456778888843
No 162
>PF09862 DUF2089: Protein of unknown function (DUF2089); InterPro: IPR018658 This family consists of various hypothetical prokaryotic proteins.
Probab=55.72 E-value=14 Score=25.85 Aligned_cols=31 Identities=19% Similarity=0.391 Sum_probs=25.2
Q ss_pred ccccCCCCcc------cccCCCCCHHHHHHHHHHHHH
Q psy838 36 KYTCSFCGKE------YKKSTDLGAETKKWVWELEEK 66 (152)
Q Consensus 36 ~~~c~~c~~~------~~~~~~l~~~d~~~i~~l~~~ 66 (152)
.+.||.|+.. ..+...++++|++.+..+...
T Consensus 12 ~l~C~~C~t~i~G~F~l~~~~~L~~E~~~Fi~~Fi~~ 48 (113)
T PF09862_consen 12 RLKCPSCGTEIEGEFELPWFARLSPEQLEFIKLFIKN 48 (113)
T ss_pred EEEcCCCCCEEEeeeccchhhcCCHHHHHHHHHHHHh
Confidence 6778888887 677799999999988876554
No 163
>PRK15312 antimicrobial resistance protein Mig-14; Provisional
Probab=55.41 E-value=35 Score=27.86 Aligned_cols=116 Identities=10% Similarity=0.157 Sum_probs=63.5
Q ss_pred eeeeccc-cccccchhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCCh----
Q psy838 7 KVGITGK-YGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDP---- 81 (152)
Q Consensus 7 ~~~~~~~-~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~---- 81 (152)
..+.+.- |-..+-.+-|+-+++.+.. |-.+++.++.+++++.+|+.- +|+. .|+-..
T Consensus 127 ~~claK~~fSkKt~~~rrrEl~kF~~~----------GG~v~~is~fS~~Ela~iY~~-------Lf~~-Rwg~~~~~~~ 188 (298)
T PRK15312 127 QNCLVKETFSSKFEKTRRNEYQRFLRN----------GGSVKSVADCSSDELTHIFIE-------LFRS-RFGNTLSCYP 188 (298)
T ss_pred hhhhehhhhhhHhHHHHHHHHHHHHHc----------CCEEEEhHHCCHHHHHHHHHH-------HHHH-HhCCCCCccc
Confidence 3444444 5555555555555554432 455889999999999988842 3333 333222
Q ss_pred hhhhhhhcCC--C--ceEEEEEeCCCCCeEEEEEEEeeccCCcce-EEEeeeeeccccccCCHHHHHH
Q psy838 82 QGKHSEMFDD--R--ACYLVAKNGSSSTPVAFSHFRFDVDFGEPV-LYCYELQLEKQVQRKGLGKFMM 144 (152)
Q Consensus 82 ~~~~~el~~~--~--~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~-~~I~~i~V~p~~qgkGiG~~Lm 144 (152)
.....++.+. + .-.++ +-+ |++++|=.+... +....+ +..-+.-++|++..--+|+-||
T Consensus 189 ~~~l~e~f~~Lr~l~fG~VL-fl~--~~PcA~qlv~k~-eSp~wi~~D~iNgG~Dpe~~~~spGSIL~ 252 (298)
T PRK15312 189 ADNLANFFSQLRHLLFGHIL-YIE--GIPCAFDIVLKS-ESQMNVYFDVPNGAVKNECMPLSPGSILM 252 (298)
T ss_pred HHHHHHHHHHhHHhheeeEE-EEC--CcceEEEEEEEe-cCCCcEEEecccCccCcccccCCCccEEE
Confidence 2222333321 1 11222 233 899998655432 222221 1133456999999999998876
No 164
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=54.99 E-value=9.7 Score=28.71 Aligned_cols=13 Identities=31% Similarity=0.905 Sum_probs=8.4
Q ss_pred cccccccCCCCcc
Q psy838 33 QHAKYTCSFCGKE 45 (152)
Q Consensus 33 ~~~~~~c~~c~~~ 45 (152)
....++||.||.+
T Consensus 129 ~~~~F~Cp~Cg~~ 141 (176)
T COG1675 129 MELGFTCPKCGED 141 (176)
T ss_pred HHhCCCCCCCCch
Confidence 3445777777766
No 165
>COG3916 LasI N-acyl-L-homoserine lactone synthetase [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]
Probab=54.63 E-value=1e+02 Score=23.97 Aligned_cols=96 Identities=13% Similarity=0.151 Sum_probs=58.2
Q ss_pred CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhh----hhhcCCCceEEEEEeCCCCCeEEEEEEEeecc--------
Q psy838 50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKH----SEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD-------- 117 (152)
Q Consensus 50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~----~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~-------- 117 (152)
....++-+++++.|-.+ .|.. ..+|+....- ++.-+.+..|+++... +|+++|.+.+-+...
T Consensus 10 ~~~~~~~l~em~rlR~~----vF~e-rL~W~v~~~~g~E~DqyD~~~t~Yll~~~~-~g~I~G~~RlLptt~P~mL~~vF 83 (209)
T COG3916 10 RELFPKALEEMHRLRYQ----VFKE-RLGWDVVCIDGFEIDQYDNLDTVYLLALTS-DGRIVGCVRLLPTTGPYMLTDVF 83 (209)
T ss_pred chhcHHHHHHHHHHHHH----HHHH-hcCCceeccCCccccccCCCCceEEEEEcC-CCcEEEEEEeccCCCcchhhhhh
Confidence 44566778888888776 4444 4567654321 2223456778887544 699999999865321
Q ss_pred ----------CCcceEEEeeeeecc--ccccC---C-HHHHHHHHHHhhc
Q psy838 118 ----------FGEPVLYCYELQLEK--QVQRK---G-LGKFMMQAFYTYI 151 (152)
Q Consensus 118 ----------~~~~~~~I~~i~V~p--~~qgk---G-iG~~Lm~~l~~~a 151 (152)
....+++..++.|++ .-+.. . ++..||..+.++|
T Consensus 84 ~~Ll~~~~~P~~p~vwEsSRF~vd~~~a~~~~g~~~~a~~el~~g~ie~a 133 (209)
T COG3916 84 PALLEGGPPPSSPGVWESSRFAVDKPSARRAAGGVSPAAYELFAGMIEYA 133 (209)
T ss_pred HHHhcCCCCCCCCCeEEEeeeeeccccchhhcCCccHHHHHHHHHHHHHH
Confidence 112467799998886 22222 2 3556666665543
No 166
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=53.96 E-value=5.9 Score=19.87 Aligned_cols=11 Identities=27% Similarity=1.204 Sum_probs=8.6
Q ss_pred cccCCCCcccc
Q psy838 37 YTCSFCGKEYK 47 (152)
Q Consensus 37 ~~c~~c~~~~~ 47 (152)
..||.||+.+.
T Consensus 3 ~~C~~CgR~F~ 13 (25)
T PF13913_consen 3 VPCPICGRKFN 13 (25)
T ss_pred CcCCCCCCEEC
Confidence 46999998864
No 167
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=53.40 E-value=5.6 Score=21.55 Aligned_cols=12 Identities=33% Similarity=0.825 Sum_probs=7.9
Q ss_pred ccccccCCCCcc
Q psy838 34 HAKYTCSFCGKE 45 (152)
Q Consensus 34 ~~~~~c~~c~~~ 45 (152)
.++..||.||..
T Consensus 16 ~~p~~CP~Cg~~ 27 (34)
T cd00729 16 EAPEKCPICGAP 27 (34)
T ss_pred cCCCcCcCCCCc
Confidence 355678888754
No 168
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=52.55 E-value=11 Score=27.28 Aligned_cols=15 Identities=33% Similarity=0.857 Sum_probs=11.7
Q ss_pred cccccccCCCCcccc
Q psy838 33 QHAKYTCSFCGKEYK 47 (152)
Q Consensus 33 ~~~~~~c~~c~~~~~ 47 (152)
...-|.||.||..+.
T Consensus 96 ~~~~Y~Cp~C~~~y~ 110 (147)
T smart00531 96 NNAYYKCPNCQSKYT 110 (147)
T ss_pred CCcEEECcCCCCEee
Confidence 567899999986654
No 169
>COG2075 RPL24A Ribosomal protein L24E [Translation, ribosomal structure and biogenesis]
Probab=52.23 E-value=7 Score=24.67 Aligned_cols=14 Identities=43% Similarity=0.952 Sum_probs=10.1
Q ss_pred ccccCCCCcccccC
Q psy838 36 KYTCSFCGKEYKKS 49 (152)
Q Consensus 36 ~~~c~~c~~~~~~~ 49 (152)
-.+|.|||+++-+.
T Consensus 3 ~~~CsFcG~~I~PG 16 (66)
T COG2075 3 VRVCSFCGKKIEPG 16 (66)
T ss_pred eeEecCcCCccCCC
Confidence 36899999984443
No 170
>COG2260 Predicted Zn-ribbon RNA-binding protein [Translation, ribosomal structure and biogenesis]
Probab=52.19 E-value=7.8 Score=23.88 Aligned_cols=20 Identities=20% Similarity=0.576 Sum_probs=13.3
Q ss_pred cccCCCCcc--cccCCCCCHHH
Q psy838 37 YTCSFCGKE--YKKSTDLGAET 56 (152)
Q Consensus 37 ~~c~~c~~~--~~~~~~l~~~d 56 (152)
-+||.||.+ ..--...+++|
T Consensus 18 e~Cp~CG~~t~~~~PprFSPeD 39 (59)
T COG2260 18 EKCPVCGGDTKVPHPPRFSPED 39 (59)
T ss_pred ccCCCCCCccccCCCCCCCccc
Confidence 579999988 33334556666
No 171
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=51.96 E-value=9.4 Score=18.82 Aligned_cols=6 Identities=50% Similarity=1.497 Sum_probs=3.2
Q ss_pred cCCCCc
Q psy838 39 CSFCGK 44 (152)
Q Consensus 39 c~~c~~ 44 (152)
||.||+
T Consensus 16 C~~CG~ 21 (23)
T PF13240_consen 16 CPNCGT 21 (23)
T ss_pred hhhhCC
Confidence 555554
No 172
>PRK13130 H/ACA RNA-protein complex component Nop10p; Reviewed
Probab=50.83 E-value=10 Score=23.15 Aligned_cols=21 Identities=29% Similarity=0.608 Sum_probs=13.7
Q ss_pred ccccCCCCcccccCC--CCCHHH
Q psy838 36 KYTCSFCGKEYKKST--DLGAET 56 (152)
Q Consensus 36 ~~~c~~c~~~~~~~~--~l~~~d 56 (152)
+-.||.||.....++ ..+++|
T Consensus 17 k~~CP~CG~~t~~~~P~rfSp~D 39 (56)
T PRK13130 17 KEICPVCGGKTKNPHPPRFSPED 39 (56)
T ss_pred cccCcCCCCCCCCCCCCCCCCCC
Confidence 567999998855443 345555
No 173
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=50.76 E-value=22 Score=25.31 Aligned_cols=21 Identities=29% Similarity=0.773 Sum_probs=16.7
Q ss_pred ccccccCCCCcc--cccCCCCCH
Q psy838 34 HAKYTCSFCGKE--YKKSTDLGA 54 (152)
Q Consensus 34 ~~~~~c~~c~~~--~~~~~~l~~ 54 (152)
++.|.|+.|++. .....++..
T Consensus 51 ~qRyrC~~C~~tf~~~~~~~~~~ 73 (129)
T COG3677 51 HQRYKCKSCGSTFTVETGSPLSK 73 (129)
T ss_pred ccccccCCcCcceeeeccCcccc
Confidence 889999999999 555566665
No 174
>PF04377 ATE_C: Arginine-tRNA-protein transferase, C terminus; InterPro: IPR007472 Arginine-tRNA-protein transferase catalyses the post-translational conjugation of arginine to the N terminus of a protein. In eukaryotes, this functions as part of the N terminus rule pathway of protein degradation by conjugating a destabilising amino acid to the N-terminal aspartate or glutamate of a protein, targeting the protein for ubiquitin-dependent proteolysis. N-terminal cysteine is sometimes modified []. In Saccharomyces cerevisiae, Cys20, 23, 94 and/or 95 are thought to be important for activity []. Of these, only Cys 94 appears to be completely conserved in this family. This entry represents the C-terminal region of the enzyme arginine-tRNA-protein transferase, found in both eukaryotic and prokaryotic enzymes.; GO: 0004057 arginyltransferase activity, 0016598 protein arginylation
Probab=50.73 E-value=88 Score=22.19 Aligned_cols=45 Identities=11% Similarity=0.195 Sum_probs=31.4
Q ss_pred CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+|++||-+.+-.. +++-.. |+.+ -+|++...++|+..+-.-+++|
T Consensus 47 ~~kLiav~v~D~l-~~glSa--VY~f-yDPd~~~~SlG~~~iL~eI~~a 91 (128)
T PF04377_consen 47 DGKLIAVAVVDIL-PDGLSA--VYTF-YDPDYSKRSLGTYSILREIELA 91 (128)
T ss_pred CCeEEEEEEeecc-cchhhh--eeee-eCCCccccCcHHHHHHHHHHHH
Confidence 4999998776443 333222 3322 5899999999999887777665
No 175
>PF05741 zf-nanos: Nanos RNA binding domain; InterPro: IPR024161 Nanos is a highly conserved RNA-binding protein in higher eukaryotes and functions as a key regulatory protein in translational control using a 3' untranslated region during the development and maintenance of germ cells. Nanos comprises a non-conserved amino-terminus and highly conserved carboxy- terminal regions. The C-terminal region has two conserved Cys-Cys-His-Cys (CCHC)-type zinc-finger motifs that are indispensable for nanos function [, , ]. The structure of the nanos-type zinc finger is composed of two independent zinc-finger (ZF) lobes, the N-terminal ZF1 and the C-terminal ZF2, which are connected by a linker helix []. These lobes create a large cleft. Zinc ions in ZF1 and ZF2 are bound to the CCHC motif by tetrahedral coordination.; PDB: 3ALR_B.
Probab=50.56 E-value=4.9 Score=24.47 Aligned_cols=23 Identities=22% Similarity=0.544 Sum_probs=8.2
Q ss_pred hHhHHhhhhhcccccccCCCCcc
Q psy838 23 RKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 23 rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
|+.-.++.=+...+|+||.||.-
T Consensus 20 k~~~G~v~CPvLr~y~Cp~CgAt 42 (55)
T PF05741_consen 20 KDPDGRVTCPVLRKYVCPICGAT 42 (55)
T ss_dssp B-TTS-B--TTGGG---TTT---
T ss_pred ECCCCCEeCHHHhcCcCCCCcCc
Confidence 33333444455678999999964
No 176
>PF13824 zf-Mss51: Zinc-finger of mitochondrial splicing suppressor 51
Probab=50.24 E-value=6.7 Score=23.87 Aligned_cols=25 Identities=20% Similarity=0.430 Sum_probs=16.8
Q ss_pred hcccccccCCCCcccccCCCCCHHH
Q psy838 32 TQHAKYTCSFCGKEYKKSTDLGAET 56 (152)
Q Consensus 32 ~~~~~~~c~~c~~~~~~~~~l~~~d 56 (152)
..+.+|+||.||-+..-+...=.+|
T Consensus 10 ~~~v~~~Cp~cGipthcS~ehw~~D 34 (55)
T PF13824_consen 10 PAHVNFECPDCGIPTHCSEEHWEDD 34 (55)
T ss_pred ccccCCcCCCCCCcCccCHHHHHHh
Confidence 4567899999999854444443444
No 177
>PF05129 Elf1: Transcription elongation factor Elf1 like; InterPro: IPR007808 This family of uncharacterised, mostly short, proteins contain a putative zinc binding domain with four conserved cysteines.; PDB: 1WII_A.
Probab=50.13 E-value=6.6 Score=25.70 Aligned_cols=14 Identities=21% Similarity=0.695 Sum_probs=4.2
Q ss_pred hhcccccccCCCCc
Q psy838 31 ITQHAKYTCSFCGK 44 (152)
Q Consensus 31 ~~~~~~~~c~~c~~ 44 (152)
......+.||||+.
T Consensus 17 ~~l~~~F~CPfC~~ 30 (81)
T PF05129_consen 17 PKLPKVFDCPFCNH 30 (81)
T ss_dssp ---SS----TTT--
T ss_pred CCCCceEcCCcCCC
Confidence 34456899999993
No 178
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=49.66 E-value=10 Score=28.27 Aligned_cols=25 Identities=20% Similarity=0.599 Sum_probs=17.6
Q ss_pred hhHhHhHHhhhhhcccccccCCCCccc
Q psy838 20 ASLRKMVKKMEITQHAKYTCSFCGKEY 46 (152)
Q Consensus 20 ~~~rk~~~~~~~~~~~~~~c~~c~~~~ 46 (152)
..+|+..+.++..+ .|.||.||-.+
T Consensus 120 ~~~~~~Le~~~~~~--~~vC~vCGy~~ 144 (166)
T COG1592 120 EMFRGLLERLEEGK--VWVCPVCGYTH 144 (166)
T ss_pred HHHHHHHHhhhcCC--EEEcCCCCCcc
Confidence 44566666666655 89999999653
No 179
>PF10058 DUF2296: Predicted integral membrane metal-binding protein (DUF2296); InterPro: IPR019273 This domain, found mainly in the eukaryotic lunapark proteins, has no known function [].
Probab=49.54 E-value=7.3 Score=23.51 Aligned_cols=15 Identities=33% Similarity=0.972 Sum_probs=11.2
Q ss_pred hhhcccccccCCCCc
Q psy838 30 EITQHAKYTCSFCGK 44 (152)
Q Consensus 30 ~~~~~~~~~c~~c~~ 44 (152)
+....-.|.||.||.
T Consensus 38 ~~~~~i~y~C~~Cg~ 52 (54)
T PF10058_consen 38 EEFEEIQYRCPYCGA 52 (54)
T ss_pred ccCCceEEEcCCCCC
Confidence 444555999999985
No 180
>PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional
Probab=48.86 E-value=21 Score=25.44 Aligned_cols=54 Identities=11% Similarity=0.184 Sum_probs=30.0
Q ss_pred cccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEE
Q psy838 35 AKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLV 97 (152)
Q Consensus 35 ~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v 97 (152)
....||.|... .+..++.+-.-++-.. .++.-..|.+++++.+-|.+....+++
T Consensus 39 ~~LRC~vCqnq-----siadSna~iA~dmR~~----Vr~~i~~G~sd~eI~~~~v~RYG~~Vl 92 (126)
T PRK10144 39 SQLRCPQCQNQ-----NLLESNAPVAVSMRHQ----VYSMVAEGKSEVEIIGWMTERYGDFVR 92 (126)
T ss_pred HcCCCCCCCCC-----ChhhcCCHHHHHHHHH----HHHHHHcCCCHHHHHHHHHHhcCCeEE
Confidence 35789999755 3333333333333222 222224578999988888765444433
No 181
>PF07395 Mig-14: Mig-14; InterPro: IPR009977 This family contains a number of bacterial mig-14 proteins (approximately 270 residues long). In Salmonella, mig-14 contributes to resistance to antimicrobial peptides, although the mechanism is not fully understood [].
Probab=48.50 E-value=57 Score=26.23 Aligned_cols=115 Identities=10% Similarity=0.168 Sum_probs=65.1
Q ss_pred eeeecc---ccccccchhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCC--Ch
Q psy838 7 KVGITG---KYGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGW--DP 81 (152)
Q Consensus 7 ~~~~~~---~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w--~~ 81 (152)
..+++. .|-..+-.+-|+.+++.+.. |-.+++.++++++++.+|+.- +|+. -|+. ..
T Consensus 97 ~iclaK~~e~fSkKt~~~rrrElrkF~~~----------GG~v~~v~~~S~~Ela~iY~~-------Lf~~-Rwg~~~~~ 158 (264)
T PF07395_consen 97 QICLAKGPESFSKKTRKNRRRELRKFIEA----------GGSVRPVSEFSPEELADIYID-------LFQK-RWGFRCYG 158 (264)
T ss_pred ceeeEcCchhhchHHHHHHHHHHHHHHHc----------CCEEEEHHHCCHHHHHHHHHH-------HHHH-HhCCCCCc
Confidence 444444 56655556666666665543 445888999999999998853 3332 3331 11
Q ss_pred hhhhhhhcCCC----ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEE--eeeeeccccccCCHHHHHH
Q psy838 82 QGKHSEMFDDR----ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYC--YELQLEKQVQRKGLGKFMM 144 (152)
Q Consensus 82 ~~~~~el~~~~----~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I--~~i~V~p~~qgkGiG~~Lm 144 (152)
.....++.+.- .-.++. -+ |+++++=.+.-. +... .+++ -+.-++|++..--+|+.||
T Consensus 159 ~~~l~e~f~~Lr~~~fG~vL~-l~--~~P~Aiqlv~k~-es~~-wv~~D~iNgG~Dp~~~~~SpGSiL~ 222 (264)
T PF07395_consen 159 KEHLAEFFSELRHMIFGSVLF-LN--GQPCAIQLVYKV-ESPK-WVYFDYINGGYDPECRDFSPGSILM 222 (264)
T ss_pred HHHHHHHHHHhHHhheeeEEE-EC--CcceEEEEEEEe-cCCC-eEEEecccCccCcccccCCCccEEE
Confidence 12233333211 111222 33 899998655432 2222 2332 3445899999999998876
No 182
>PF09390 DUF1999: Protein of unknown function (DUF1999); InterPro: IPR018987 This family contains a putative Fe-S binding reductase (Q72J89 from SWISSPROT) whose structure adopts an alpha and beta fold. ; PDB: 2D4O_A 2D4P_A.
Probab=47.83 E-value=65 Score=23.70 Aligned_cols=56 Identities=11% Similarity=0.073 Sum_probs=31.4
Q ss_pred CceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838 92 RACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYT 149 (152)
Q Consensus 92 ~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~ 149 (152)
..|-+++.++ ++++.||+.....-.-..|.+.+..|.+.|. +....-.-||.++.+
T Consensus 54 sgHSFvA~~e-~~~~~GfvLAQaVWQGdrptVlV~ri~~~~~-~~~~~~~GLLrAvvK 109 (161)
T PF09390_consen 54 SGHSFVAEDE-GGELQGFVLAQAVWQGDRPTVLVRRILLAPG-EPEEVYEGLLRAVVK 109 (161)
T ss_dssp CS--EEEE-E-TTEEEEEEEEEEEE-SSSEEEEEEEE---EE-SSHHHHHHHHHHHHH
T ss_pred cCCcEEEEcc-CCceeeeeehhHHhcCCCceEEEEEeecCCC-CcHHHHHHHHHHHHH
Confidence 3445666633 4899999987654333468888999987765 233455555555543
No 183
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=47.69 E-value=7.2 Score=20.98 Aligned_cols=17 Identities=24% Similarity=0.624 Sum_probs=10.2
Q ss_pred hhhhcccccccCCCCcc
Q psy838 29 MEITQHAKYTCSFCGKE 45 (152)
Q Consensus 29 ~~~~~~~~~~c~~c~~~ 45 (152)
+++.....-.||.||-.
T Consensus 10 ~~~~~~~~irC~~CG~R 26 (32)
T PF03604_consen 10 VELKPGDPIRCPECGHR 26 (32)
T ss_dssp E-BSTSSTSSBSSSS-S
T ss_pred eEcCCCCcEECCcCCCe
Confidence 34555666788888865
No 184
>PF09943 DUF2175: Uncharacterized protein conserved in archaea (DUF2175); InterPro: IPR018686 This family of various hypothetical archaeal proteins has no known function.
Probab=47.48 E-value=5.4 Score=27.33 Aligned_cols=13 Identities=38% Similarity=0.912 Sum_probs=10.3
Q ss_pred cccccCCCCcccc
Q psy838 35 AKYTCSFCGKEYK 47 (152)
Q Consensus 35 ~~~~c~~c~~~~~ 47 (152)
.+++|-+||+++.
T Consensus 1 ~kWkC~iCg~~I~ 13 (101)
T PF09943_consen 1 KKWKCYICGKPIY 13 (101)
T ss_pred CceEEEecCCeee
Confidence 3689999999933
No 185
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=47.22 E-value=5.9 Score=24.30 Aligned_cols=14 Identities=21% Similarity=0.721 Sum_probs=10.1
Q ss_pred ccccCCCCcccccC
Q psy838 36 KYTCSFCGKEYKKS 49 (152)
Q Consensus 36 ~~~c~~c~~~~~~~ 49 (152)
-..||.||+.++..
T Consensus 17 ~lrCPRC~~~FR~~ 30 (65)
T COG4049 17 FLRCPRCGMVFRRR 30 (65)
T ss_pred eeeCCchhHHHHHh
Confidence 46799999775543
No 186
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=46.36 E-value=11 Score=19.21 Aligned_cols=8 Identities=38% Similarity=0.978 Sum_probs=4.1
Q ss_pred cccCCCCc
Q psy838 37 YTCSFCGK 44 (152)
Q Consensus 37 ~~c~~c~~ 44 (152)
-.||.||-
T Consensus 15 ~~Cp~CG~ 22 (26)
T PF10571_consen 15 KFCPHCGY 22 (26)
T ss_pred CcCCCCCC
Confidence 34555553
No 187
>smart00504 Ubox Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
Probab=46.33 E-value=29 Score=20.53 Aligned_cols=25 Identities=20% Similarity=0.303 Sum_probs=15.1
Q ss_pred ccccc-hhHhHhHHhhhhhcccccccCCCCccc
Q psy838 15 GTRYG-ASLRKMVKKMEITQHAKYTCSFCGKEY 46 (152)
Q Consensus 15 g~ryg-~~~rk~~~~~~~~~~~~~~c~~c~~~~ 46 (152)
|-.|. ..+.+.+++ .-+||.|++.+
T Consensus 20 G~v~~~~~i~~~~~~-------~~~cP~~~~~~ 45 (63)
T smart00504 20 GQTYERRAIEKWLLS-------HGTDPVTGQPL 45 (63)
T ss_pred CCEEeHHHHHHHHHH-------CCCCCCCcCCC
Confidence 45565 334444433 45899998765
No 188
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=46.25 E-value=24 Score=25.18 Aligned_cols=54 Identities=15% Similarity=0.280 Sum_probs=29.8
Q ss_pred cccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEE
Q psy838 35 AKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLV 97 (152)
Q Consensus 35 ~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v 97 (152)
+...||.|... .+..++.+-.-++-.. .++.-..|.+++++.+-|.+....+++
T Consensus 39 ~~LRC~vCqnq-----siadS~a~iA~dmR~~----Vr~~i~~G~Sd~eI~~~~v~RYG~~Vl 92 (126)
T TIGR03147 39 KSLRCPQCQNQ-----NLVESNSPIAYDLRHE----VYSMVNEGKSNQQIIDFMTARFGDFVL 92 (126)
T ss_pred HhCCCCCCCCC-----ChhhcCCHHHHHHHHH----HHHHHHcCCCHHHHHHHHHHhcCCeEE
Confidence 35789999755 3333332222233221 222224578999998888776544443
No 189
>PF03884 DUF329: Domain of unknown function (DUF329); InterPro: IPR005584 The biological function of these short proteins is unknown, but they contain four conserved cysteines, suggesting that they all bind zinc. YacG (Q5X8H6 from SWISSPROT) from Escherichia coli has been shown to bind zinc and contains the structural motifs typical of zinc-binding proteins []. The conserved four cysteine motif in these proteins (-C-X(2)-C-X(15)-C-X(3)-C-) is not found in other zinc-binding proteins with known structures.; GO: 0008270 zinc ion binding; PDB: 1LV3_A.
Probab=46.18 E-value=8 Score=23.71 Aligned_cols=13 Identities=31% Similarity=0.754 Sum_probs=6.7
Q ss_pred ccccCCCCccccc
Q psy838 36 KYTCSFCGKEYKK 48 (152)
Q Consensus 36 ~~~c~~c~~~~~~ 48 (152)
+..||.|||...+
T Consensus 2 ~v~CP~C~k~~~~ 14 (57)
T PF03884_consen 2 TVKCPICGKPVEW 14 (57)
T ss_dssp EEE-TTT--EEE-
T ss_pred cccCCCCCCeecc
Confidence 4689999998544
No 190
>PHA00626 hypothetical protein
Probab=45.71 E-value=9.9 Score=23.29 Aligned_cols=15 Identities=27% Similarity=0.561 Sum_probs=11.6
Q ss_pred hcccccccCCCCccc
Q psy838 32 TQHAKYTCSFCGKEY 46 (152)
Q Consensus 32 ~~~~~~~c~~c~~~~ 46 (152)
.+...|.||.||-.+
T Consensus 19 ~~snrYkCkdCGY~f 33 (59)
T PHA00626 19 GWSDDYVCCDCGYND 33 (59)
T ss_pred ccCcceEcCCCCCee
Confidence 346789999999653
No 191
>PF06559 DCD: 2'-deoxycytidine 5'-triphosphate deaminase (DCD); InterPro: IPR010550 This family consists of several bacterial 2'-deoxycytidine 5'-triphosphate deaminase proteins (3.5.4.13 from EC).; GO: 0008829 dCTP deaminase activity; PDB: 2R9Q_C.
Probab=45.67 E-value=18 Score=30.16 Aligned_cols=34 Identities=18% Similarity=0.121 Sum_probs=13.4
Q ss_pred CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCH
Q psy838 103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGL 139 (152)
Q Consensus 103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGi 139 (152)
+|++||-+.|....+..+ .+|.-.+...|||||+
T Consensus 323 hGQ~vgrLvyE~m~~~P~---~lYG~~~gSnYq~QgL 356 (364)
T PF06559_consen 323 HGQIVGRLVYERMAERPE---RLYGAGIGSNYQGQGL 356 (364)
T ss_dssp TT-EEEEEEEEEBSS-------TTSS-----------
T ss_pred CCcEEEEEEehhhccCcc---ccccccccccchhhhh
Confidence 399999999987644321 1455668999999997
No 192
>PF02891 zf-MIZ: MIZ/SP-RING zinc finger; InterPro: IPR004181 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents MIZ-type zinc finger domains. Miz1 (Msx-interacting-zinc finger) is a zinc finger-containing protein with homology to the yeast protein, Nfi-1. Miz1 is a sequence specific DNA binding protein that can function as a positive-acting transcription factor. Miz1 binds to the homeobox protein Msx2, enhancing the specific DNA-binding ability of Msx2 []. Other proteins containing this domain include the human pias family (protein inhibitor of activated STAT protein). More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3I2D_A.
Probab=45.06 E-value=9 Score=22.57 Aligned_cols=11 Identities=36% Similarity=1.065 Sum_probs=3.9
Q ss_pred cccccCCCCcc
Q psy838 35 AKYTCSFCGKE 45 (152)
Q Consensus 35 ~~~~c~~c~~~ 45 (152)
..+.||.|+++
T Consensus 40 ~~W~CPiC~~~ 50 (50)
T PF02891_consen 40 PKWKCPICNKP 50 (50)
T ss_dssp ---B-TTT---
T ss_pred CCeECcCCcCc
Confidence 45899999874
No 193
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=44.88 E-value=10 Score=27.88 Aligned_cols=7 Identities=57% Similarity=1.777 Sum_probs=3.6
Q ss_pred ccCCCCc
Q psy838 38 TCSFCGK 44 (152)
Q Consensus 38 ~c~~c~~ 44 (152)
.||+||.
T Consensus 2 ~cp~c~~ 8 (154)
T PRK00464 2 RCPFCGH 8 (154)
T ss_pred cCCCCCC
Confidence 3555554
No 194
>PF13530 SCP2_2: Sterol carrier protein domain; PDB: 3SXN_C 3N7Z_A 3RYO_B 3R1K_A 3UY5_A 2HV2_F 2I00_D 2OZG_A.
Probab=44.62 E-value=1.2e+02 Score=22.96 Aligned_cols=52 Identities=19% Similarity=0.266 Sum_probs=31.5
Q ss_pred CceEEEEEeCCCCCeEEEEEEEeecc-CCcceEEEeee-eeccccccCCHHHHHHHHHHh
Q psy838 92 RACYLVAKNGSSSTPVAFSHFRFDVD-FGEPVLYCYEL-QLEKQVQRKGLGKFMMQAFYT 149 (152)
Q Consensus 92 ~~~~~v~~~~~~~~ivGf~~~~~~~~-~~~~~~~I~~i-~V~p~~qgkGiG~~Lm~~l~~ 149 (152)
+..++++.++ +|++.||+.|.+... .....+.|.++ +.+|+.+ +.|++.+-.
T Consensus 23 ~~~~~~~~~~-~g~~~GY~~y~~~~~~~~~~~l~V~El~~~~~~A~-----~aLl~fl~~ 76 (218)
T PF13530_consen 23 DRGYAVYYDE-DGEPDGYVIYRFKDDWEPGGTLEVRELVALDPEAY-----RALLAFLAS 76 (218)
T ss_dssp GSEEEEEEEC-TSEEEEEEEEEEET-SSSTTEEEEEEEEESSHHHH-----HHHHHHHHT
T ss_pred CceEEEEECC-CCCeeEEEEEEEcccCCCCceEEEEEEEeCCHHHH-----HHHHHHHHh
Confidence 3444555554 599999999998642 22346778885 5666643 345554443
No 195
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=43.87 E-value=7.9 Score=22.32 Aligned_cols=17 Identities=29% Similarity=0.526 Sum_probs=12.0
Q ss_pred hhhhcccccccCCCCcc
Q psy838 29 MEITQHAKYTCSFCGKE 45 (152)
Q Consensus 29 ~~~~~~~~~~c~~c~~~ 45 (152)
+++.....-.||.||..
T Consensus 12 ~~~~~~~~irC~~CG~r 28 (44)
T smart00659 12 NEIKSKDVVRCRECGYR 28 (44)
T ss_pred eecCCCCceECCCCCce
Confidence 44456677788888876
No 196
>COG3024 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.72 E-value=14 Score=23.14 Aligned_cols=16 Identities=25% Similarity=0.449 Sum_probs=11.6
Q ss_pred cccccccCCCCccccc
Q psy838 33 QHAKYTCSFCGKEYKK 48 (152)
Q Consensus 33 ~~~~~~c~~c~~~~~~ 48 (152)
....-.||.|||+..+
T Consensus 4 ~~~~v~CP~Cgkpv~w 19 (65)
T COG3024 4 LRITVPCPTCGKPVVW 19 (65)
T ss_pred ccccccCCCCCCcccc
Confidence 3456689999998444
No 197
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=42.33 E-value=4.7 Score=31.35 Aligned_cols=15 Identities=40% Similarity=1.083 Sum_probs=9.8
Q ss_pred cccccccCCCCcccc
Q psy838 33 QHAKYTCSFCGKEYK 47 (152)
Q Consensus 33 ~~~~~~c~~c~~~~~ 47 (152)
...||.||.|+|.++
T Consensus 74 ~~~K~~C~lc~KlFk 88 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFK 88 (214)
T ss_dssp SSEEEEE-SSS-EES
T ss_pred cCCEECCCCCCcccC
Confidence 345899999999843
No 198
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=41.80 E-value=12 Score=21.54 Aligned_cols=15 Identities=20% Similarity=0.696 Sum_probs=11.2
Q ss_pred ccccCCCCcccccCC
Q psy838 36 KYTCSFCGKEYKKST 50 (152)
Q Consensus 36 ~~~c~~c~~~~~~~~ 50 (152)
.=.||.||+++.+-.
T Consensus 8 ~K~C~~C~rpf~WRK 22 (42)
T PF10013_consen 8 SKICPVCGRPFTWRK 22 (42)
T ss_pred CCcCcccCCcchHHH
Confidence 347999999876544
No 199
>PF09855 DUF2082: Nucleic-acid-binding protein containing Zn-ribbon domain (DUF2082); InterPro: IPR018652 This family of proteins contains various hypothetical prokaryotic proteins as well as some Zn-ribbon nucleic-acid-binding proteins.
Probab=41.54 E-value=12 Score=23.44 Aligned_cols=9 Identities=56% Similarity=1.442 Sum_probs=7.0
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
|.||.||..
T Consensus 1 y~C~KCg~~ 9 (64)
T PF09855_consen 1 YKCPKCGNE 9 (64)
T ss_pred CCCCCCCCc
Confidence 689999853
No 200
>COG4391 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=41.16 E-value=12 Score=23.33 Aligned_cols=15 Identities=20% Similarity=0.928 Sum_probs=11.0
Q ss_pred cccccccCCCCcccc
Q psy838 33 QHAKYTCSFCGKEYK 47 (152)
Q Consensus 33 ~~~~~~c~~c~~~~~ 47 (152)
....-.||.||+.++
T Consensus 45 ~~gev~CPYC~t~y~ 59 (62)
T COG4391 45 DEGEVVCPYCSTRYR 59 (62)
T ss_pred CCCcEecCccccEEE
Confidence 344568999998754
No 201
>PRK00114 hslO Hsp33-like chaperonin; Reviewed
Probab=41.11 E-value=19 Score=29.10 Aligned_cols=26 Identities=27% Similarity=0.590 Sum_probs=17.7
Q ss_pred cccccccCCCCcccccCCCCCHHHHHHHHH
Q psy838 33 QHAKYTCSFCGKEYKKSTDLGAETKKWVWE 62 (152)
Q Consensus 33 ~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~ 62 (152)
....-+|.||++.|+ .+++|++.+++
T Consensus 265 ~~iev~C~FC~~~Y~----f~~~dl~~l~~ 290 (293)
T PRK00114 265 GGAEMVCQFCGNKYL----FDEEDLEELIA 290 (293)
T ss_pred CCEEEEEeCCCCEEE----eCHHHHHHHHh
Confidence 345778999988753 56667766653
No 202
>PF13248 zf-ribbon_3: zinc-ribbon domain
Probab=41.00 E-value=13 Score=18.74 Aligned_cols=10 Identities=40% Similarity=0.899 Sum_probs=7.4
Q ss_pred cccCCCCccc
Q psy838 37 YTCSFCGKEY 46 (152)
Q Consensus 37 ~~c~~c~~~~ 46 (152)
..||.||+..
T Consensus 3 ~~Cp~Cg~~~ 12 (26)
T PF13248_consen 3 MFCPNCGAEI 12 (26)
T ss_pred CCCcccCCcC
Confidence 4799998753
No 203
>PF10588 NADH-G_4Fe-4S_3: NADH-ubiquinone oxidoreductase-G iron-sulfur binding region; InterPro: IPR019574 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This entry describes the G subunit (one of 14 subunits, A to N) of the NADH-quinone oxidoreductase complex I which generally couples NADH and ubiquinone oxidation/reduction in bacteria and mammalian mitochondria while translocating protons, but may act on NADPH and/or plastoquinone in cyanobacteria and plant chloroplasts. This family does not contain related subunits from formate dehydrogenase complexes. This entry represents the iron-sulphur binding domain of the G subunit.; GO: 0016491 oxidoreductase activity, 0055114 oxidation-reduction process; PDB: 3M9S_C 2FUG_L 3IAS_L 2YBB_3 3IAM_3 3I9V_3.
Probab=40.82 E-value=21 Score=20.04 Aligned_cols=22 Identities=27% Similarity=0.670 Sum_probs=11.3
Q ss_pred hHhHHhhhhhcccccccCCCCcc
Q psy838 23 RKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 23 rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
||.+-+.-.+.| +..|+.|.+.
T Consensus 1 Rr~~lelll~~H-~~dC~~C~~~ 22 (41)
T PF10588_consen 1 RRTVLELLLANH-PLDCPTCDKN 22 (41)
T ss_dssp -HHHHHHHHTT-----TTT-TTG
T ss_pred CHHHHHHHHhCC-CCcCcCCCCC
Confidence 556666666666 7889999865
No 204
>PF14255 Cys_rich_CPXG: Cysteine-rich CPXCG
Probab=40.51 E-value=6.7 Score=23.55 Aligned_cols=9 Identities=33% Similarity=1.298 Sum_probs=7.1
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
..||.||..
T Consensus 1 i~CPyCge~ 9 (52)
T PF14255_consen 1 IQCPYCGEP 9 (52)
T ss_pred CCCCCCCCe
Confidence 368999877
No 205
>KOG2907|consensus
Probab=40.49 E-value=8.9 Score=26.81 Aligned_cols=11 Identities=45% Similarity=0.945 Sum_probs=9.4
Q ss_pred cccccCCCCcc
Q psy838 35 AKYTCSFCGKE 45 (152)
Q Consensus 35 ~~~~c~~c~~~ 45 (152)
-++.||.||..
T Consensus 73 I~~kCpkCghe 83 (116)
T KOG2907|consen 73 IKHKCPKCGHE 83 (116)
T ss_pred hhccCcccCCc
Confidence 37999999977
No 206
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=40.44 E-value=13 Score=17.92 Aligned_cols=7 Identities=43% Similarity=1.502 Sum_probs=4.9
Q ss_pred cccCCCC
Q psy838 37 YTCSFCG 43 (152)
Q Consensus 37 ~~c~~c~ 43 (152)
|.|+.|.
T Consensus 1 y~C~~C~ 7 (24)
T PF13909_consen 1 YKCPHCS 7 (24)
T ss_dssp EE-SSSS
T ss_pred CCCCCCC
Confidence 6799997
No 207
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=40.33 E-value=21 Score=26.76 Aligned_cols=13 Identities=15% Similarity=0.462 Sum_probs=8.5
Q ss_pred cccccccCCCCcc
Q psy838 33 QHAKYTCSFCGKE 45 (152)
Q Consensus 33 ~~~~~~c~~c~~~ 45 (152)
...-|.||.|++.
T Consensus 114 ~~~~Y~Cp~C~~r 126 (178)
T PRK06266 114 NNMFFFCPNCHIR 126 (178)
T ss_pred CCCEEECCCCCcE
Confidence 3456777777765
No 208
>PF13945 NST1: Salt tolerance down-regulator
Probab=40.25 E-value=26 Score=26.71 Aligned_cols=34 Identities=18% Similarity=0.419 Sum_probs=21.4
Q ss_pred hcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHH
Q psy838 32 TQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMK 69 (152)
Q Consensus 32 ~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~ 69 (152)
.|+-.-.|++||+. ..+=.++++.|++...+.+.
T Consensus 137 qq~h~C~C~vCgr~----~~~ie~ele~ly~~~y~~l~ 170 (190)
T PF13945_consen 137 QQKHSCSCSVCGRK----RTAIEEELERLYDAYYEELE 170 (190)
T ss_pred HhccCcccHHHhch----hhHHHHHHHHHHHHHHHHHH
Confidence 45445668999976 23445668888776555443
No 209
>KOG2535|consensus
Probab=40.13 E-value=16 Score=30.83 Aligned_cols=21 Identities=19% Similarity=0.407 Sum_probs=18.5
Q ss_pred cccccCCHHHHHHHHHHhhcC
Q psy838 132 KQVQRKGLGKFMMQAFYTYID 152 (152)
Q Consensus 132 p~~qgkGiG~~Lm~~l~~~a~ 152 (152)
..||.||.|..||+..+.+|+
T Consensus 497 ~KfQHQG~GtLLmeEAERIAr 517 (554)
T KOG2535|consen 497 TKFQHQGFGTLLMEEAERIAR 517 (554)
T ss_pred hhhhhcchhhHHHHHHHHHHH
Confidence 369999999999999998873
No 210
>PF08772 NOB1_Zn_bind: Nin one binding (NOB1) Zn-ribbon like; InterPro: IPR014881 This entry corresponds to a zinc ribbon and is found on the RNA binding protein NOB1. ; PDB: 2CON_A.
Probab=39.80 E-value=13 Score=23.98 Aligned_cols=13 Identities=31% Similarity=0.692 Sum_probs=5.9
Q ss_pred cccccccCCCCcc
Q psy838 33 QHAKYTCSFCGKE 45 (152)
Q Consensus 33 ~~~~~~c~~c~~~ 45 (152)
.-.+..||.||-.
T Consensus 21 ~~~k~FCp~CGn~ 33 (73)
T PF08772_consen 21 DMTKQFCPKCGNA 33 (73)
T ss_dssp -SS--S-SSS--S
T ss_pred CCCceeCcccCCC
Confidence 3468899999988
No 211
>PF10122 Mu-like_Com: Mu-like prophage protein Com; InterPro: IPR019294 Members of this entry belong to the Com family of proteins that act as translational regulators of mom [, ].
Probab=39.40 E-value=18 Score=21.66 Aligned_cols=22 Identities=18% Similarity=0.317 Sum_probs=14.8
Q ss_pred HhHHhhhhhcccccccCCCCcc
Q psy838 24 KMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 24 k~~~~~~~~~~~~~~c~~c~~~ 45 (152)
|.+.+.....+-.-.||.||+.
T Consensus 12 klLa~~g~~~~leIKCpRC~ti 33 (51)
T PF10122_consen 12 KLLAKAGEVIELEIKCPRCKTI 33 (51)
T ss_pred HHHhhhcCccEEEEECCCCCcc
Confidence 4444544455566789999988
No 212
>KOG4317|consensus
Probab=38.80 E-value=14 Score=30.52 Aligned_cols=14 Identities=36% Similarity=0.712 Sum_probs=11.9
Q ss_pred hcccccccCCCCcc
Q psy838 32 TQHAKYTCSFCGKE 45 (152)
Q Consensus 32 ~~~~~~~c~~c~~~ 45 (152)
.|.+.|+||.|.-.
T Consensus 15 vq~~~YtCPRCn~~ 28 (383)
T KOG4317|consen 15 VQKREYTCPRCNLL 28 (383)
T ss_pred cccccccCCCCCcc
Confidence 46677999999988
No 213
>KOG3965|consensus
Probab=38.78 E-value=12 Score=27.58 Aligned_cols=26 Identities=27% Similarity=0.509 Sum_probs=22.5
Q ss_pred eeeccccccccchhHhHhHHhhhhhc
Q psy838 8 VGITGKYGTRYGASLRKMVKKMEITQ 33 (152)
Q Consensus 8 ~~~~~~~g~ryg~~~rk~~~~~~~~~ 33 (152)
-+|.+.|+.|||..+-.+.+.+|...
T Consensus 117 d~ISP~y~E~Fg~~l~q~L~~lek~~ 142 (160)
T KOG3965|consen 117 DDISPLYRERFGNKLEQRLKRLEKQR 142 (160)
T ss_pred hhcChHHHHHHHHHHHHHHHHHHHHh
Confidence 46899999999999999999988653
No 214
>PF11432 DUF3197: Protein of unknown function (DUF3197); InterPro: IPR024443 This domain, whose function is unknown, is found in bacterial proteins.; PDB: 1WN9_A 1WNA_A.
Probab=38.52 E-value=19 Score=24.97 Aligned_cols=49 Identities=12% Similarity=0.231 Sum_probs=27.5
Q ss_pred eccccccccchhHhHhHHhhh---hhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhH
Q psy838 10 ITGKYGTRYGASLRKMVKKME---ITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNM 68 (152)
Q Consensus 10 ~~~~~g~ryg~~~rk~~~~~~---~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m 68 (152)
+..-||||||..=....+++- ..+-.++. -..++++++..++++..+..
T Consensus 46 t~dAFGPafG~~G~~ALaELv~wl~~~G~~f~----------EaVl~p~e~~~ll~~p~~~~ 97 (113)
T PF11432_consen 46 TPDAFGPAFGPEGERALAELVRWLQERGARFY----------EAVLSPSEFAALLELPPEEE 97 (113)
T ss_dssp EEEEESTTS-TTHHHHHHHHHHHHHHTT-EEE----------EEEE-GGGHHHHHTS-HHHH
T ss_pred cccccCcccCccHHHHHHHHHHHHHHcCCchh----------heecCHHHHHHHHcCCcHHH
Confidence 456799999854444444332 22222222 45678999999998877653
No 215
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=38.38 E-value=12 Score=20.54 Aligned_cols=12 Identities=33% Similarity=0.849 Sum_probs=9.0
Q ss_pred cccCCCCccccc
Q psy838 37 YTCSFCGKEYKK 48 (152)
Q Consensus 37 ~~c~~c~~~~~~ 48 (152)
..||.||+.|+.
T Consensus 2 r~C~~Cg~~Yh~ 13 (36)
T PF05191_consen 2 RICPKCGRIYHI 13 (36)
T ss_dssp EEETTTTEEEET
T ss_pred cCcCCCCCcccc
Confidence 479999988553
No 216
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=37.90 E-value=16 Score=21.83 Aligned_cols=12 Identities=25% Similarity=0.686 Sum_probs=9.2
Q ss_pred cccccccCCCCc
Q psy838 33 QHAKYTCSFCGK 44 (152)
Q Consensus 33 ~~~~~~c~~c~~ 44 (152)
-+.+-.||.||-
T Consensus 16 ~rk~~~CPrCG~ 27 (51)
T COG1998 16 KRKNRFCPRCGP 27 (51)
T ss_pred EEccccCCCCCC
Confidence 456778999994
No 217
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=37.64 E-value=17 Score=21.53 Aligned_cols=13 Identities=23% Similarity=0.782 Sum_probs=9.8
Q ss_pred cccccccCCCCcc
Q psy838 33 QHAKYTCSFCGKE 45 (152)
Q Consensus 33 ~~~~~~c~~c~~~ 45 (152)
.-..+.||.||..
T Consensus 31 Lp~~w~CP~C~a~ 43 (50)
T cd00730 31 LPDDWVCPVCGAG 43 (50)
T ss_pred CCCCCCCCCCCCc
Confidence 3457899999865
No 218
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=37.22 E-value=15 Score=19.96 Aligned_cols=10 Identities=40% Similarity=1.152 Sum_probs=8.1
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
.|.|+.||+.
T Consensus 5 ~y~C~~Cg~~ 14 (41)
T smart00834 5 EYRCEDCGHT 14 (41)
T ss_pred EEEcCCCCCE
Confidence 4789999886
No 219
>cd00246 RabGEF Nucleotide exchange factor for Rab-like small GTPases (RabGEF), Mss4 type; RabGEF positely regulates the function of Rab GTPase by promoting exchange of GDP for GTP; members of the Rab subfamily of Ras GTPases are important in vesicular transport;
Probab=37.19 E-value=18 Score=24.90 Aligned_cols=13 Identities=23% Similarity=0.544 Sum_probs=10.4
Q ss_pred ccccCCCCccccc
Q psy838 36 KYTCSFCGKEYKK 48 (152)
Q Consensus 36 ~~~c~~c~~~~~~ 48 (152)
+..|++|+..+++
T Consensus 2 ~v~C~~C~S~VL~ 14 (103)
T cd00246 2 AVLCQRCGSRVLT 14 (103)
T ss_pred ceECCCCCCEEEe
Confidence 4679999999663
No 220
>PRK02983 lysS lysyl-tRNA synthetase; Provisional
Probab=37.15 E-value=4e+02 Score=25.89 Aligned_cols=114 Identities=18% Similarity=0.173 Sum_probs=60.5
Q ss_pred hhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCC--hhhhhhhhcCCCceEEE
Q psy838 20 ASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWD--PQGKHSEMFDDRACYLV 97 (152)
Q Consensus 20 ~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~--~~~~~~el~~~~~~~~v 97 (152)
..+|..+++.+.. -|+.. +.+..+++++.++.+-++.++=....- .-+.+ .-.+ .+..+....++.
T Consensus 357 ~~lR~a~nra~r~---G~t~~-----i~~~~~~~~~~~~~L~~isd~Wl~~~~---EkGFSm~LGr~-~~~~~~~~~i~~ 424 (1094)
T PRK02983 357 RPVRQAVTRVRRA---GYTVR-----IRRHRDLPAEEMAQVIARADAWRDTET---ERGFSMALGRL-GDPADGDCLLVE 424 (1094)
T ss_pred HHHHHHHHHHHhC---CCEEE-----EeeCCCCCHHHHHHHHHHHHHHhcCCC---CCceeeecCcc-cchhcCceEEEE
Confidence 6788888877765 33311 334556777777777776654211100 01111 0000 111222333233
Q ss_pred EEeCCCCCeEEEEEEEeeccCCcceEEEeeee-eccccccCCHHHHHHHHHHhhc
Q psy838 98 AKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQ-LEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 98 ~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~-V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
+.++ +|+++||+.+.+...++ + -.|+. -+|+. =.|+=-.|+..++..+
T Consensus 425 a~d~-~G~i~af~s~~p~~~~g---~-slDLMRr~pda-pnGvmE~L~~~l~~~~ 473 (1094)
T PRK02983 425 AHDA-DGQVVALLSFVPWGRRG---L-SLDLMRRSPDA-PNGVIELMVAELALEA 473 (1094)
T ss_pred EECC-CCeEEEEEEEeeeCCCC---E-EEEecccCCCC-CCCHHHHHHHHHHHHH
Confidence 3333 58999999999854332 2 33444 34554 6888888888777654
No 221
>PF10892 DUF2688: Protein of unknown function (DUF2688); InterPro: IPR024392 Members of this protein family are annotated as KleB, and may play a role in the regulation of transcription in plasmids.
Probab=36.22 E-value=5.7 Score=24.27 Aligned_cols=30 Identities=30% Similarity=0.569 Sum_probs=18.2
Q ss_pred HhhhhhcccccccCCCCcccccC--CCCCHHHHHH
Q psy838 27 KKMEITQHAKYTCSFCGKEYKKS--TDLGAETKKW 59 (152)
Q Consensus 27 ~~~~~~~~~~~~c~~c~~~~~~~--~~l~~~d~~~ 59 (152)
++||+. .-.|..|||.+++. +.+..+|+..
T Consensus 4 ~kieIV---~t~CrRCGk~i~tl~~SL~Gad~lr~ 35 (60)
T PF10892_consen 4 GKIEIV---ETPCRRCGKSIRTLSRSLIGADDLRV 35 (60)
T ss_pred CceEee---eehhhhhCccHHHHHHHhhChHHHHH
Confidence 355555 56799999995544 3444555444
No 222
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=36.11 E-value=20 Score=20.16 Aligned_cols=16 Identities=31% Similarity=0.673 Sum_probs=11.9
Q ss_pred ccccCCCCcccccCCC
Q psy838 36 KYTCSFCGKEYKKSTD 51 (152)
Q Consensus 36 ~~~c~~c~~~~~~~~~ 51 (152)
.-+||.||-.|....+
T Consensus 13 ~~~C~~CgM~Y~~~~~ 28 (41)
T PF13878_consen 13 ATTCPTCGMLYSPGSP 28 (41)
T ss_pred CcCCCCCCCEECCCCH
Confidence 5689999987766543
No 223
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=35.79 E-value=16 Score=26.74 Aligned_cols=12 Identities=8% Similarity=0.177 Sum_probs=6.2
Q ss_pred ccccccCCCCcc
Q psy838 34 HAKYTCSFCGKE 45 (152)
Q Consensus 34 ~~~~~c~~c~~~ 45 (152)
..-|.||.|++.
T Consensus 107 ~~~Y~Cp~c~~r 118 (158)
T TIGR00373 107 NMFFICPNMCVR 118 (158)
T ss_pred CCeEECCCCCcE
Confidence 344556666543
No 224
>cd03173 DUF619-like DUF619 domain of various N-acetylglutamate Kinases and N-acetylglutamate Synthases. DUF619-like: This family includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present C-terminal of a NAG kinase-like domain in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAGS is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. This subgroup also includes the DUF619 domain of the FABP N-acetylglutamate kinase (NAGK), the enzyme that catalyzes the second reaction of arginine
Probab=35.76 E-value=1.1e+02 Score=20.77 Aligned_cols=30 Identities=3% Similarity=0.129 Sum_probs=26.5
Q ss_pred cceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838 120 EPVLYCYELQLEKQVQRKGLGKFMMQAFYT 149 (152)
Q Consensus 120 ~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~ 149 (152)
..+-||..+.|.+.-++.|++..+.+.+.+
T Consensus 31 ~~v~~LdkFav~~~~~~~gv~D~vf~~i~~ 60 (98)
T cd03173 31 NSIPYLDKFAVSDHLWLNNVTDNIFNLIRK 60 (98)
T ss_pred CCCEEEEEEEEcccccccCHHHHHHHHHHh
Confidence 346689999999999999999999998874
No 225
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=35.76 E-value=17 Score=19.64 Aligned_cols=10 Identities=20% Similarity=0.717 Sum_probs=7.5
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
+..||.||+.
T Consensus 2 ~~~CP~C~~~ 11 (38)
T TIGR02098 2 RIQCPNCKTS 11 (38)
T ss_pred EEECCCCCCE
Confidence 3578888876
No 226
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=35.61 E-value=18 Score=21.83 Aligned_cols=10 Identities=40% Similarity=1.172 Sum_probs=7.4
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
.++||.||..
T Consensus 2 ~~~CP~CG~~ 11 (54)
T TIGR01206 2 QFECPDCGAE 11 (54)
T ss_pred ccCCCCCCCE
Confidence 4578999865
No 227
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=35.48 E-value=16 Score=17.58 Aligned_cols=12 Identities=25% Similarity=1.021 Sum_probs=8.7
Q ss_pred cccCCCCccccc
Q psy838 37 YTCSFCGKEYKK 48 (152)
Q Consensus 37 ~~c~~c~~~~~~ 48 (152)
|.|+.|++.+.+
T Consensus 1 ~~C~~C~~~f~s 12 (25)
T PF12874_consen 1 FYCDICNKSFSS 12 (25)
T ss_dssp EEETTTTEEESS
T ss_pred CCCCCCCCCcCC
Confidence 578999877544
No 228
>COG4311 SoxD Sarcosine oxidase delta subunit [Amino acid transport and metabolism]
Probab=35.29 E-value=24 Score=23.90 Aligned_cols=9 Identities=33% Similarity=1.335 Sum_probs=7.3
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
-.||+||-.
T Consensus 4 I~CP~Cg~R 12 (97)
T COG4311 4 IPCPYCGER 12 (97)
T ss_pred ecCCCCCCC
Confidence 479999975
No 229
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=35.16 E-value=19 Score=20.64 Aligned_cols=9 Identities=33% Similarity=1.294 Sum_probs=7.7
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
+.||.||..
T Consensus 19 ~~CP~Cg~~ 27 (46)
T PF12760_consen 19 FVCPHCGST 27 (46)
T ss_pred CCCCCCCCe
Confidence 679999975
No 230
>PF08792 A2L_zn_ribbon: A2L zinc ribbon domain; InterPro: IPR014900 This zinc ribbon protein is found associated with some viral A2L transcription factors [].
Probab=34.91 E-value=18 Score=19.53 Aligned_cols=14 Identities=29% Similarity=0.767 Sum_probs=10.8
Q ss_pred hcccccccCCCCcc
Q psy838 32 TQHAKYTCSFCGKE 45 (152)
Q Consensus 32 ~~~~~~~c~~c~~~ 45 (152)
+...-+.|++||..
T Consensus 17 ~~~~~~~C~~Cg~~ 30 (33)
T PF08792_consen 17 KEDDYEVCIFCGSS 30 (33)
T ss_pred ecCCeEEcccCCcE
Confidence 55677899999864
No 231
>PF02591 DUF164: Putative zinc ribbon domain; InterPro: IPR003743 This entry describes proteins of unknown function.
Probab=34.77 E-value=25 Score=20.88 Aligned_cols=12 Identities=25% Similarity=0.609 Sum_probs=9.8
Q ss_pred ccccccCCCCcc
Q psy838 34 HAKYTCSFCGKE 45 (152)
Q Consensus 34 ~~~~~c~~c~~~ 45 (152)
..-.+||.||+.
T Consensus 44 ~~i~~Cp~CgRi 55 (56)
T PF02591_consen 44 DEIVFCPNCGRI 55 (56)
T ss_pred CCeEECcCCCcc
Confidence 567899999974
No 232
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=34.73 E-value=18 Score=25.00 Aligned_cols=11 Identities=27% Similarity=0.664 Sum_probs=8.1
Q ss_pred cccccCCCCcc
Q psy838 35 AKYTCSFCGKE 45 (152)
Q Consensus 35 ~~~~c~~c~~~ 45 (152)
..+.||.||..
T Consensus 85 ~~~~CP~Cgs~ 95 (113)
T PRK12380 85 HDAQCPHCHGE 95 (113)
T ss_pred cCccCcCCCCC
Confidence 35569999975
No 233
>PF14206 Cys_rich_CPCC: Cysteine-rich CPCC
Probab=34.73 E-value=18 Score=23.62 Aligned_cols=10 Identities=50% Similarity=1.182 Sum_probs=8.0
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
||.||-||..
T Consensus 1 K~~CPCCg~~ 10 (78)
T PF14206_consen 1 KYPCPCCGYY 10 (78)
T ss_pred CccCCCCCcE
Confidence 6789999865
No 234
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=34.12 E-value=18 Score=27.04 Aligned_cols=30 Identities=27% Similarity=0.370 Sum_probs=18.7
Q ss_pred hcccccccCCCCcccccCCCCCHHHHHHHHHHHHH
Q psy838 32 TQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEK 66 (152)
Q Consensus 32 ~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~ 66 (152)
..+.-+.||.||+.+=. -+...++.+...+
T Consensus 126 ~~~~f~~C~~CgkiYW~-----GsHw~~m~~~~~~ 155 (165)
T COG1656 126 NYEEFYRCPKCGKIYWK-----GSHWRRMVERIVE 155 (165)
T ss_pred cccceeECCCCcccccC-----chHHHHHHHHHHH
Confidence 34455679999987533 3456666664444
No 235
>COG5595 Zn-ribbon-containing, possibly nucleic-acid-binding protein [General function prediction only]
Probab=33.56 E-value=28 Score=26.96 Aligned_cols=29 Identities=31% Similarity=0.505 Sum_probs=19.0
Q ss_pred ccchhHhHhHHhhhh---------------hcccccccCCCCcc
Q psy838 17 RYGASLRKMVKKMEI---------------TQHAKYTCSFCGKE 45 (152)
Q Consensus 17 ryg~~~rk~~~~~~~---------------~~~~~~~c~~c~~~ 45 (152)
+-|+.++|++.+.-- .+..-..||.||++
T Consensus 184 k~Gr~l~~rIek~t~iptyYylyrVgGnSlAqek~r~CPsC~k~ 227 (256)
T COG5595 184 KHGRYLSKRIEKCTGIPTYYYLYRVGGNSLAQEKYRCCPSCGKD 227 (256)
T ss_pred HHHHHHHHHHHHHhCCCceEEEEEecCchhhhhccCCCCccccc
Confidence 446777777766532 23444569999987
No 236
>COG1693 Repressor of nif and glnA expression [Transcription]
Probab=33.46 E-value=49 Score=27.09 Aligned_cols=60 Identities=23% Similarity=0.245 Sum_probs=43.5
Q ss_pred cceeeeccccccccchhHhHh----------HHhhhhhcccccccCCC-----CcccccCCCCCHHHHHHHHHHHHHh
Q psy838 5 TKKVGITGKYGTRYGASLRKM----------VKKMEITQHAKYTCSFC-----GKEYKKSTDLGAETKKWVWELEEKN 67 (152)
Q Consensus 5 ~~~~~~~~~~g~ryg~~~rk~----------~~~~~~~~~~~~~c~~c-----~~~~~~~~~l~~~d~~~i~~l~~~~ 67 (152)
|+|||.+||==|--|.+--|+ .+.||.. .|.|.|- |+...-.+.++++|++.++++.+++
T Consensus 56 t~kvgyagr~iTe~Gl~el~~~~~~~R~g~~~SrIeem---~y~t~fdp~t~~G~Vv~N~s~i~k~dfedvl~iv~~~ 130 (325)
T COG1693 56 TRKVGYAGREITEKGLKELKRALISERLGFISSRIEEM---AYRTNFDPETLKGKVVVNISVIDKSDFEDVLDIVKEV 130 (325)
T ss_pred hhhccccceeehHhhHHHHhhhhhHHHHHHHHHHHHHH---HHhccCCchhcCceEEEEEEEecHHHHHHHHHHHHHH
Confidence 678888887666666443333 3455555 6777776 6667777889999999999998864
No 237
>PF10071 DUF2310: Zn-ribbon-containing, possibly nucleic-acid-binding protein (DUF2310); InterPro: IPR016908 This group represents uncharacterised conserved proteins.
Probab=33.43 E-value=31 Score=27.65 Aligned_cols=29 Identities=24% Similarity=0.607 Sum_probs=19.3
Q ss_pred cchhHhHhHHhhh---------------hhcccccccCCCCccc
Q psy838 18 YGASLRKMVKKME---------------ITQHAKYTCSFCGKEY 46 (152)
Q Consensus 18 yg~~~rk~~~~~~---------------~~~~~~~~c~~c~~~~ 46 (152)
=|..|++++.++- .....+..||.||++=
T Consensus 187 ~G~~l~~~Ie~~T~iPtYYyLyrv~g~~~~~e~~r~CP~Cg~~W 230 (258)
T PF10071_consen 187 RGRDLCKRIEKLTGIPTYYYLYRVGGESLASEQARKCPSCGGDW 230 (258)
T ss_pred HHHHHHHHHHHHhCCCeEEEEEEccCcchHHhhCCCCCCCCCcc
Confidence 4666666666542 2345678899999883
No 238
>PRK01343 zinc-binding protein; Provisional
Probab=33.10 E-value=21 Score=21.90 Aligned_cols=12 Identities=33% Similarity=0.711 Sum_probs=9.5
Q ss_pred cccccCCCCccc
Q psy838 35 AKYTCSFCGKEY 46 (152)
Q Consensus 35 ~~~~c~~c~~~~ 46 (152)
...+||.|||..
T Consensus 8 p~~~CP~C~k~~ 19 (57)
T PRK01343 8 PTRPCPECGKPS 19 (57)
T ss_pred CCCcCCCCCCcC
Confidence 356799999984
No 239
>KOG2462|consensus
Probab=32.98 E-value=22 Score=28.67 Aligned_cols=24 Identities=38% Similarity=0.924 Sum_probs=18.8
Q ss_pred hcccccccCCCCcccccCCCCCHH
Q psy838 32 TQHAKYTCSFCGKEYKKSTDLGAE 55 (152)
Q Consensus 32 ~~~~~~~c~~c~~~~~~~~~l~~~ 55 (152)
..+-.|.|+.|||.+.+...|+..
T Consensus 126 ~~~~r~~c~eCgk~ysT~snLsrH 149 (279)
T KOG2462|consen 126 AKHPRYKCPECGKSYSTSSNLSRH 149 (279)
T ss_pred ccCCceeccccccccccccccchh
Confidence 356789999999998888765544
No 240
>PF14690 zf-ISL3: zinc-finger of transposase IS204/IS1001/IS1096/IS1165
Probab=32.89 E-value=22 Score=20.02 Aligned_cols=10 Identities=30% Similarity=0.853 Sum_probs=6.9
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
+..||.||..
T Consensus 2 ~~~Cp~Cg~~ 11 (47)
T PF14690_consen 2 PPRCPHCGSP 11 (47)
T ss_pred CccCCCcCCC
Confidence 4578888844
No 241
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=32.85 E-value=20 Score=24.85 Aligned_cols=10 Identities=30% Similarity=0.680 Sum_probs=7.8
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
.+.||.||..
T Consensus 86 ~~~CP~Cgs~ 95 (115)
T TIGR00100 86 LYRCPKCHGI 95 (115)
T ss_pred CccCcCCcCC
Confidence 4669999976
No 242
>PF00571 CBS: CBS domain CBS domain web page. Mutations in the CBS domain of Swiss:P35520 lead to homocystinuria.; InterPro: IPR000644 CBS (cystathionine-beta-synthase) domains are small intracellular modules, mostly found in two or four copies within a protein, that occur in a variety of proteins in bacteria, archaea, and eukaryotes [, ]. Tandem pairs of CBS domains can act as binding domains for adenosine derivatives and may regulate the activity of attached enzymatic or other domains []. In some cases, CBS domains may act as sensors of cellular energy status by being activated by AMP and inhibited by ATP []. In chloride ion channels, the CBS domains have been implicated in intracellular targeting and trafficking, as well as in protein-protein interactions, but results vary with different channels: in the CLC-5 channel, the CBS domain was shown to be required for trafficking [], while in the CLC-1 channel, the CBS domain was shown to be critical for channel function, but not necessary for trafficking []. Recent experiments revealing that CBS domains can bind adenosine-containing ligands such ATP, AMP, or S-adenosylmethionine have led to the hypothesis that CBS domains function as sensors of intracellular metabolites [, ]. Crystallographic studies of CBS domains have shown that pairs of CBS sequences form a globular domain where each CBS unit adopts a beta-alpha-beta-beta-alpha pattern []. Crystal structure of the CBS domains of the AMP-activated protein kinase in complexes with AMP and ATP shows that the phosphate groups of AMP/ATP lie in a surface pocket at the interface of two CBS domains, which is lined with basic residues, many of which are associated with disease-causing mutations []. In humans, mutations in conserved residues within CBS domains cause a variety of human hereditary diseases, including (with the gene mutated in parentheses): homocystinuria (cystathionine beta-synthase); Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase); retinitis pigmentosa (IMP dehydrogenase-1); congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members).; GO: 0005515 protein binding; PDB: 3JTF_A 3TE5_C 3TDH_C 3T4N_C 2QLV_C 3OI8_A 3LV9_A 2QH1_B 1PVM_B 3LQN_A ....
Probab=32.81 E-value=39 Score=19.21 Aligned_cols=32 Identities=9% Similarity=-0.049 Sum_probs=18.7
Q ss_pred ChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEE
Q psy838 80 DPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHF 112 (152)
Q Consensus 80 ~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~ 112 (152)
+-......|.+.+...+.+.++ +++++|++..
T Consensus 17 ~l~~~~~~~~~~~~~~~~V~d~-~~~~~G~is~ 48 (57)
T PF00571_consen 17 SLEEALEIMRKNGISRLPVVDE-DGKLVGIISR 48 (57)
T ss_dssp BHHHHHHHHHHHTSSEEEEEST-TSBEEEEEEH
T ss_pred cHHHHHHHHHHcCCcEEEEEec-CCEEEEEEEH
Confidence 3333444444444455555544 5999999874
No 243
>PRK00418 DNA gyrase inhibitor; Reviewed
Probab=32.70 E-value=23 Score=22.08 Aligned_cols=13 Identities=31% Similarity=0.751 Sum_probs=10.1
Q ss_pred cccccCCCCcccc
Q psy838 35 AKYTCSFCGKEYK 47 (152)
Q Consensus 35 ~~~~c~~c~~~~~ 47 (152)
....||.|||...
T Consensus 5 ~~v~CP~C~k~~~ 17 (62)
T PRK00418 5 ITVNCPTCGKPVE 17 (62)
T ss_pred ccccCCCCCCccc
Confidence 3578999999844
No 244
>COG3877 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.58 E-value=39 Score=23.50 Aligned_cols=16 Identities=19% Similarity=0.125 Sum_probs=10.0
Q ss_pred cccCCCCCHHHHHHHH
Q psy838 46 YKKSTDLGAETKKWVW 61 (152)
Q Consensus 46 ~~~~~~l~~~d~~~i~ 61 (152)
.....-+++++++.+-
T Consensus 36 ~s~F~~Lt~d~LeFv~ 51 (122)
T COG3877 36 MSKFEYLTSDQLEFVE 51 (122)
T ss_pred cccccccCHhHhHHHH
Confidence 3445667777776654
No 245
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=32.47 E-value=21 Score=26.17 Aligned_cols=7 Identities=29% Similarity=0.819 Sum_probs=3.2
Q ss_pred cCCCCcc
Q psy838 39 CSFCGKE 45 (152)
Q Consensus 39 c~~c~~~ 45 (152)
|+.||+-
T Consensus 31 C~~C~~R 37 (147)
T TIGR00244 31 CLECHER 37 (147)
T ss_pred CCccCCc
Confidence 4444444
No 246
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=32.43 E-value=20 Score=19.39 Aligned_cols=12 Identities=25% Similarity=0.675 Sum_probs=8.5
Q ss_pred ccccCCCCcccc
Q psy838 36 KYTCSFCGKEYK 47 (152)
Q Consensus 36 ~~~c~~c~~~~~ 47 (152)
.+.||.|++.+.
T Consensus 4 ~~~C~nC~R~v~ 15 (33)
T PF08209_consen 4 YVECPNCGRPVA 15 (33)
T ss_dssp EEE-TTTSSEEE
T ss_pred eEECCCCcCCcc
Confidence 578999998743
No 247
>COG4469 CoiA Competence protein CoiA-like family, contains a predicted nuclease domain [General function prediction only]
Probab=32.27 E-value=20 Score=29.69 Aligned_cols=10 Identities=30% Similarity=1.168 Sum_probs=9.4
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
+|.||.||+.
T Consensus 25 ~ffCPaC~~~ 34 (342)
T COG4469 25 RFFCPACGSQ 34 (342)
T ss_pred ccccCCCCCe
Confidence 6999999998
No 248
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=32.15 E-value=19 Score=24.98 Aligned_cols=10 Identities=20% Similarity=0.577 Sum_probs=7.5
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
.+.||.||..
T Consensus 87 ~~~CP~Cgs~ 96 (114)
T PRK03681 87 VRRCPQCHGD 96 (114)
T ss_pred CCcCcCcCCC
Confidence 3569999976
No 249
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=32.06 E-value=23 Score=19.82 Aligned_cols=9 Identities=33% Similarity=0.955 Sum_probs=6.4
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
+.||.||..
T Consensus 1 ~~Cp~C~~~ 9 (40)
T smart00440 1 APCPKCGNR 9 (40)
T ss_pred CcCCCCCCC
Confidence 368888766
No 250
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=31.83 E-value=22 Score=18.47 Aligned_cols=16 Identities=13% Similarity=0.659 Sum_probs=11.6
Q ss_pred cccccCCCCcccccCC
Q psy838 35 AKYTCSFCGKEYKKST 50 (152)
Q Consensus 35 ~~~~c~~c~~~~~~~~ 50 (152)
.+|.|+.|++.+....
T Consensus 2 ~~~~C~~C~~~~~~~~ 17 (35)
T smart00451 2 GGFYCKLCNVTFTDEI 17 (35)
T ss_pred cCeEccccCCccCCHH
Confidence 3688999998766433
No 251
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=31.73 E-value=13 Score=21.75 Aligned_cols=16 Identities=38% Similarity=0.941 Sum_probs=10.6
Q ss_pred ccccCCCCcccccCCC
Q psy838 36 KYTCSFCGKEYKKSTD 51 (152)
Q Consensus 36 ~~~c~~c~~~~~~~~~ 51 (152)
+|.|+.||-.+-....
T Consensus 1 ky~C~~CgyvYd~~~G 16 (47)
T PF00301_consen 1 KYQCPVCGYVYDPEKG 16 (47)
T ss_dssp EEEETTTSBEEETTTB
T ss_pred CcCCCCCCEEEcCCcC
Confidence 5788888866555443
No 252
>PHA00732 hypothetical protein
Probab=31.71 E-value=23 Score=22.94 Aligned_cols=14 Identities=36% Similarity=0.963 Sum_probs=10.8
Q ss_pred ccccccCCCCcccc
Q psy838 34 HAKYTCSFCGKEYK 47 (152)
Q Consensus 34 ~~~~~c~~c~~~~~ 47 (152)
|..+.|+.||+.+.
T Consensus 25 H~~~~C~~CgKsF~ 38 (79)
T PHA00732 25 HTLTKCPVCNKSYR 38 (79)
T ss_pred cCCCccCCCCCEeC
Confidence 44678999998864
No 253
>PF06221 zf-C2HC5: Putative zinc finger motif, C2HC5-type; InterPro: IPR009349 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This zinc finger appears to be common in activating signal cointegrator 1/thyroid receptor interacting protein 4. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=31.66 E-value=51 Score=20.14 Aligned_cols=11 Identities=36% Similarity=0.926 Sum_probs=7.6
Q ss_pred ccccCCCCccc
Q psy838 36 KYTCSFCGKEY 46 (152)
Q Consensus 36 ~~~c~~c~~~~ 46 (152)
.-.|+|||.++
T Consensus 35 ~~pC~fCg~~l 45 (57)
T PF06221_consen 35 LGPCPFCGTPL 45 (57)
T ss_pred cCcCCCCCCcc
Confidence 46788888553
No 254
>COG4847 Uncharacterized protein conserved in archaea [Function unknown]
Probab=30.86 E-value=15 Score=24.98 Aligned_cols=17 Identities=18% Similarity=0.540 Sum_probs=12.7
Q ss_pred cccccccCCCCcccccC
Q psy838 33 QHAKYTCSFCGKEYKKS 49 (152)
Q Consensus 33 ~~~~~~c~~c~~~~~~~ 49 (152)
+-+.|.|..||..+++.
T Consensus 3 ~lkewkC~VCg~~iieG 19 (103)
T COG4847 3 GLKEWKCYVCGGTIIEG 19 (103)
T ss_pred ccceeeEeeeCCEeeec
Confidence 45789999999984443
No 255
>PF01155 HypA: Hydrogenase expression/synthesis hypA family; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=30.80 E-value=19 Score=24.86 Aligned_cols=9 Identities=33% Similarity=1.220 Sum_probs=6.3
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
+.||.||..
T Consensus 87 ~~CP~Cgs~ 95 (113)
T PF01155_consen 87 FSCPRCGSP 95 (113)
T ss_dssp HH-SSSSSS
T ss_pred CCCcCCcCC
Confidence 668888876
No 256
>COG4338 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.54 E-value=15 Score=21.82 Aligned_cols=16 Identities=13% Similarity=0.513 Sum_probs=11.6
Q ss_pred ccccCCCCcccccCCC
Q psy838 36 KYTCSFCGKEYKKSTD 51 (152)
Q Consensus 36 ~~~c~~c~~~~~~~~~ 51 (152)
.-.||.|++++.+-..
T Consensus 12 ~KICpvCqRPFsWRkK 27 (54)
T COG4338 12 DKICPVCQRPFSWRKK 27 (54)
T ss_pred hhhhhhhcCchHHHHH
Confidence 3459999999776543
No 257
>TIGR01374 soxD sarcosine oxidase, delta subunit family, heterotetrameric form. Sarcosine oxidase catalyzes the oxidative demethylation of sarcosine to glycine. The reaction converts tetrahydrofolate to 5,10-methylene-tetrahydrofolate. The enzyme is known in monomeric and heterotetrameric (alpha,beta,gamma,delta) form
Probab=30.47 E-value=22 Score=23.49 Aligned_cols=8 Identities=38% Similarity=1.456 Sum_probs=6.7
Q ss_pred ccCCCCcc
Q psy838 38 TCSFCGKE 45 (152)
Q Consensus 38 ~c~~c~~~ 45 (152)
.||.||.-
T Consensus 3 ~CP~CG~R 10 (84)
T TIGR01374 3 PCPYCGPR 10 (84)
T ss_pred cCCCCCCc
Confidence 69999955
No 258
>KOG2879|consensus
Probab=30.47 E-value=34 Score=27.76 Aligned_cols=13 Identities=23% Similarity=0.746 Sum_probs=10.2
Q ss_pred cccccccCCCCcc
Q psy838 33 QHAKYTCSFCGKE 45 (152)
Q Consensus 33 ~~~~~~c~~c~~~ 45 (152)
....-+||+||.+
T Consensus 236 ~t~~~~C~~Cg~~ 248 (298)
T KOG2879|consen 236 GTSDTECPVCGEP 248 (298)
T ss_pred ccCCceeeccCCC
Confidence 4456789999987
No 259
>PF14952 zf-tcix: Putative treble-clef, zinc-finger, Zn-binding
Probab=30.44 E-value=24 Score=20.43 Aligned_cols=11 Identities=27% Similarity=0.631 Sum_probs=8.6
Q ss_pred cccccCCCCcc
Q psy838 35 AKYTCSFCGKE 45 (152)
Q Consensus 35 ~~~~c~~c~~~ 45 (152)
---.||.||..
T Consensus 10 GirkCp~CGt~ 20 (44)
T PF14952_consen 10 GIRKCPKCGTY 20 (44)
T ss_pred ccccCCcCcCc
Confidence 34579999987
No 260
>PRK00807 50S ribosomal protein L24e; Validated
Probab=29.87 E-value=29 Score=20.64 Aligned_cols=15 Identities=60% Similarity=1.092 Sum_probs=10.6
Q ss_pred cccCCCCcccccCCC
Q psy838 37 YTCSFCGKEYKKSTD 51 (152)
Q Consensus 37 ~~c~~c~~~~~~~~~ 51 (152)
+.|.+||..+.+.+.
T Consensus 2 ~~C~fcG~~I~pg~G 16 (52)
T PRK00807 2 RTCSFCGKEIEPGTG 16 (52)
T ss_pred cccCCCCCeEcCCCC
Confidence 579999988665443
No 261
>PF04810 zf-Sec23_Sec24: Sec23/Sec24 zinc finger; InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=29.76 E-value=21 Score=19.91 Aligned_cols=12 Identities=33% Similarity=1.126 Sum_probs=7.9
Q ss_pred ccccccCCCCcc
Q psy838 34 HAKYTCSFCGKE 45 (152)
Q Consensus 34 ~~~~~c~~c~~~ 45 (152)
...++|++|+..
T Consensus 22 ~~~w~C~~C~~~ 33 (40)
T PF04810_consen 22 GKTWICNFCGTK 33 (40)
T ss_dssp TTEEEETTT--E
T ss_pred CCEEECcCCCCc
Confidence 357899999864
No 262
>COG4416 Com Mu-like prophage protein Com [General function prediction only]
Probab=29.37 E-value=38 Score=20.61 Aligned_cols=25 Identities=16% Similarity=0.267 Sum_probs=20.0
Q ss_pred hHhHhHHhhhhhcccccccCCCCcc
Q psy838 21 SLRKMVKKMEITQHAKYTCSFCGKE 45 (152)
Q Consensus 21 ~~rk~~~~~~~~~~~~~~c~~c~~~ 45 (152)
..-|.+.+.+-+-+.+-.||.|...
T Consensus 9 ~CnKlLa~a~~~~yle~KCPrCK~v 33 (60)
T COG4416 9 KCNKLLAEAEGQAYLEKKCPRCKEV 33 (60)
T ss_pred HHhHHHHhcccceeeeecCCcccee
Confidence 4457788888888999999999654
No 263
>PF04135 Nop10p: Nucleolar RNA-binding protein, Nop10p family; InterPro: IPR007264 H/ACA ribonucleoprotein particles (RNPs) are a family of RNA pseudouridine synthases that specify modification sites through guide RNAs. More than 100 mammalian H/ACA RNAs form an equal number of ribonucleoproteins (RNPs) by associating with the same four core proteins: Cbf5, Gar1, Nhp2 and Nop10. The function of these H/ACA RNPs is essential for biogenesis of the ribosome, splicing of precursor mRNAs (pre-mRNAs), maintenance of telomeres and probably for additional cellular processes []. Recent crystal structures of archaeal H/ACA protein complexes show how the same four proteins accommodate >100 distinct but related H/ACA RNAs []. The complex contains a stable core composed of Cbf5 and Nop10, to which Gar1 and Nhp2 subsequently bind, the complex interacts with snoRNAs []. In eukaryotes Nop10 is a nucleolar protein that is specifically associated with H/ACA snoRNAs. It is essential for normal 18S rRNA production and rRNA pseudouridylation by the ribonucleoprotein particles containing H/ACA snoRNAs (H/ACA snoRNPs). Nop10 is probably necessary for the stability of these RNPs [].; PDB: 2RFK_B 3LWR_B 2HVY_C 3HAX_C 3MQK_B 3LWO_B 3LWV_B 3HAY_C 3HJY_B 2EY4_E ....
Probab=29.03 E-value=38 Score=20.38 Aligned_cols=20 Identities=15% Similarity=0.454 Sum_probs=12.3
Q ss_pred cccCCCCcccccCC--CCCHHH
Q psy838 37 YTCSFCGKEYKKST--DLGAET 56 (152)
Q Consensus 37 ~~c~~c~~~~~~~~--~l~~~d 56 (152)
-+||.||..-.+++ ..+++|
T Consensus 18 ~~cp~cG~~T~~ahPaRFSPdD 39 (53)
T PF04135_consen 18 DKCPPCGGPTESAHPARFSPDD 39 (53)
T ss_dssp SBBTTTSSBSEESSSSSS-TTT
T ss_pred CccCCCCCCCcCCcCCCCCCCC
Confidence 47999999944443 344444
No 264
>TIGR02666 moaA molybdenum cofactor biosynthesis protein A, bacterial. The model for this family describes molybdenum cofactor biosynthesis protein A, or MoaA, as found in bacteria. It does not include the family of probable functional equivalent proteins from the archaea. MoaA works together with MoaC to synthesize precursor Z from guanine.
Probab=28.91 E-value=46 Score=26.93 Aligned_cols=46 Identities=20% Similarity=0.334 Sum_probs=32.8
Q ss_pred ccchhHhHhHHhhhhhcccccccCCCCccc------ccCCCCCHHHHHHHHHHH
Q psy838 17 RYGASLRKMVKKMEITQHAKYTCSFCGKEY------KKSTDLGAETKKWVWELE 64 (152)
Q Consensus 17 ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~------~~~~~l~~~d~~~i~~l~ 64 (152)
+||+.++ +-.++++..=++.|.+|-... .....++.+++..+++..
T Consensus 4 ~~gr~~~--~l~i~vT~~CNl~C~yC~~~~~~~~~~~~~~~ls~eei~~~i~~~ 55 (334)
T TIGR02666 4 RFGRRID--YLRISVTDRCNLRCVYCMPEGGGLDFLPKEELLTFEEIERLVRAF 55 (334)
T ss_pred CCCCccC--eEEEEecCccCcCCCCCCCCcCCCCcCCccCCCCHHHHHHHHHHH
Confidence 5777766 455888889999999997643 123567787777766653
No 265
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=28.90 E-value=22 Score=24.72 Aligned_cols=8 Identities=25% Similarity=0.825 Sum_probs=6.6
Q ss_pred ccCCCCcc
Q psy838 38 TCSFCGKE 45 (152)
Q Consensus 38 ~c~~c~~~ 45 (152)
.||.||..
T Consensus 90 ~CP~Cgs~ 97 (117)
T PRK00564 90 VCEKCHSK 97 (117)
T ss_pred cCcCCCCC
Confidence 49999976
No 266
>PF04475 DUF555: Protein of unknown function (DUF555); InterPro: IPR007564 This is a family of uncharacterised, hypothetical archaeal proteins.
Probab=28.78 E-value=26 Score=23.95 Aligned_cols=10 Identities=40% Similarity=0.983 Sum_probs=8.4
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
...||.||..
T Consensus 47 ~~~cP~Cge~ 56 (102)
T PF04475_consen 47 DTICPKCGEE 56 (102)
T ss_pred cccCCCCCCc
Confidence 5679999987
No 267
>PF10276 zf-CHCC: Zinc-finger domain; InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=28.63 E-value=22 Score=20.10 Aligned_cols=11 Identities=36% Similarity=1.262 Sum_probs=8.7
Q ss_pred ccccCCCCccc
Q psy838 36 KYTCSFCGKEY 46 (152)
Q Consensus 36 ~~~c~~c~~~~ 46 (152)
.-.||.||..+
T Consensus 29 ~~~CpYCg~~y 39 (40)
T PF10276_consen 29 PVVCPYCGTRY 39 (40)
T ss_dssp EEEETTTTEEE
T ss_pred eEECCCCCCEE
Confidence 47899999754
No 268
>COG2761 FrnE Predicted dithiol-disulfide isomerase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=28.62 E-value=12 Score=29.26 Aligned_cols=17 Identities=29% Similarity=0.706 Sum_probs=14.8
Q ss_pred HHhhhhhcccccccCCC
Q psy838 26 VKKMEITQHAKYTCSFC 42 (152)
Q Consensus 26 ~~~~~~~~~~~~~c~~c 42 (152)
..+|++...+.+.||+|
T Consensus 3 ~~~i~I~v~sD~vCPwC 19 (225)
T COG2761 3 PMKIEIDVFSDVVCPWC 19 (225)
T ss_pred CceEEEEEEeCCcCchh
Confidence 45788888999999999
No 269
>PRK03922 hypothetical protein; Provisional
Probab=28.58 E-value=26 Score=24.40 Aligned_cols=10 Identities=30% Similarity=0.760 Sum_probs=8.4
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
...||.||.+
T Consensus 49 ~~~cP~cge~ 58 (113)
T PRK03922 49 LTICPKCGEP 58 (113)
T ss_pred cccCCCCCCc
Confidence 4679999988
No 270
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=28.07 E-value=27 Score=19.36 Aligned_cols=8 Identities=38% Similarity=1.178 Sum_probs=4.0
Q ss_pred ccCCCCcc
Q psy838 38 TCSFCGKE 45 (152)
Q Consensus 38 ~c~~c~~~ 45 (152)
.||.||..
T Consensus 2 ~Cp~Cg~~ 9 (39)
T PF01096_consen 2 KCPKCGHN 9 (39)
T ss_dssp --SSS-SS
T ss_pred CCcCCCCC
Confidence 58888877
No 271
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=27.90 E-value=99 Score=25.72 Aligned_cols=37 Identities=19% Similarity=0.156 Sum_probs=25.6
Q ss_pred ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeecccccc
Q psy838 93 ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQR 136 (152)
Q Consensus 93 ~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qg 136 (152)
.-..++--. +|.+|+++.|.-. +-+|++++|.|..|.
T Consensus 282 CGv~vidl~-tG~vv~~l~feg~------v~EifdV~vLPg~r~ 318 (335)
T TIGR03032 282 CGVAVIDLN-SGDVVHWLRFEGV------IEEIYDVAVLPGVRR 318 (335)
T ss_pred ccEEEEECC-CCCEEEEEEeCCc------eeEEEEEEEecCCCC
Confidence 334444323 5999999987432 345999999999875
No 272
>PRK00420 hypothetical protein; Validated
Probab=27.88 E-value=73 Score=22.21 Aligned_cols=9 Identities=33% Similarity=0.929 Sum_probs=4.4
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
-.||.||.+
T Consensus 24 ~~CP~Cg~p 32 (112)
T PRK00420 24 KHCPVCGLP 32 (112)
T ss_pred CCCCCCCCc
Confidence 345555544
No 273
>smart00507 HNHc HNH nucleases.
Probab=27.63 E-value=34 Score=18.81 Aligned_cols=14 Identities=29% Similarity=0.807 Sum_probs=10.3
Q ss_pred cccccccCCCCcccc
Q psy838 33 QHAKYTCSFCGKEYK 47 (152)
Q Consensus 33 ~~~~~~c~~c~~~~~ 47 (152)
.+. +.|++||....
T Consensus 8 ~r~-~~C~~C~~~~~ 21 (52)
T smart00507 8 HRD-GVCAYCGKPAS 21 (52)
T ss_pred HHC-CCCcCCcCCCC
Confidence 344 89999998753
No 274
>PF14205 Cys_rich_KTR: Cysteine-rich KTR
Probab=27.60 E-value=30 Score=20.99 Aligned_cols=9 Identities=33% Similarity=1.147 Sum_probs=7.0
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
..||.||..
T Consensus 5 i~CP~CgnK 13 (55)
T PF14205_consen 5 ILCPICGNK 13 (55)
T ss_pred EECCCCCCc
Confidence 369999965
No 275
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=27.56 E-value=27 Score=21.12 Aligned_cols=12 Identities=33% Similarity=0.844 Sum_probs=7.4
Q ss_pred ccccCCCCcccc
Q psy838 36 KYTCSFCGKEYK 47 (152)
Q Consensus 36 ~~~c~~c~~~~~ 47 (152)
--.||.||.+++
T Consensus 21 iVvCp~CgapyH 32 (54)
T PF14446_consen 21 IVVCPECGAPYH 32 (54)
T ss_pred EEECCCCCCccc
Confidence 345777777643
No 276
>COG3478 Predicted nucleic-acid-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=27.56 E-value=29 Score=21.85 Aligned_cols=10 Identities=30% Similarity=1.072 Sum_probs=7.0
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
...||.||-.
T Consensus 4 ~~kCpKCgn~ 13 (68)
T COG3478 4 AFKCPKCGNT 13 (68)
T ss_pred cccCCCcCCc
Confidence 4559999843
No 277
>smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease.
Probab=27.45 E-value=64 Score=16.48 Aligned_cols=27 Identities=7% Similarity=-0.085 Sum_probs=13.8
Q ss_pred hhhhcCCCceEEEEEeCCCCCeEEEEEE
Q psy838 85 HSEMFDDRACYLVAKNGSSSTPVAFSHF 112 (152)
Q Consensus 85 ~~el~~~~~~~~v~~~~~~~~ivGf~~~ 112 (152)
...+.+.....+++.++ +++++|++..
T Consensus 15 ~~~~~~~~~~~~~v~~~-~~~~~g~i~~ 41 (49)
T smart00116 15 LELLREHGIRRLPVVDE-EGRLVGIVTR 41 (49)
T ss_pred HHHHHHhCCCcccEECC-CCeEEEEEEH
Confidence 33444333333334333 3788888763
No 278
>KOG3993|consensus
Probab=27.44 E-value=26 Score=30.15 Aligned_cols=13 Identities=38% Similarity=0.879 Sum_probs=11.2
Q ss_pred cccccccCCCCcc
Q psy838 33 QHAKYTCSFCGKE 45 (152)
Q Consensus 33 ~~~~~~c~~c~~~ 45 (152)
-|-.|.||.|+|.
T Consensus 292 V~vEYrCPEC~KV 304 (500)
T KOG3993|consen 292 VHVEYRCPECDKV 304 (500)
T ss_pred EEeeecCCccccc
Confidence 4568999999998
No 279
>COG0419 SbcC ATPase involved in DNA repair [DNA replication, recombination, and repair]
Probab=27.30 E-value=54 Score=30.66 Aligned_cols=15 Identities=27% Similarity=0.711 Sum_probs=10.6
Q ss_pred ccccCCCCcccccCC
Q psy838 36 KYTCSFCGKEYKKST 50 (152)
Q Consensus 36 ~~~c~~c~~~~~~~~ 50 (152)
.-.||.||+++....
T Consensus 457 ~~~CPvCg~~l~~~~ 471 (908)
T COG0419 457 GEKCPVCGQELPEEH 471 (908)
T ss_pred CCCCCCCCCCCCcHH
Confidence 578999997644444
No 280
>PF11039 DUF2824: Protein of unknown function (DUF2824); InterPro: IPR022568 This family of proteins has no known function. Members of the family are found in P22-like viruses and bacteria. Some of the phage members have been annotated as head assembly proteins, but this has not been confirmed.
Probab=27.30 E-value=2.4e+02 Score=20.50 Aligned_cols=41 Identities=12% Similarity=0.088 Sum_probs=27.3
Q ss_pred CCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeecccccc
Q psy838 90 DDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQR 136 (152)
Q Consensus 90 ~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qg 136 (152)
.++..|+=+... +.++|+..++..-. .++.+|-+| +|++||
T Consensus 35 K~~~~Y~gVyeg--~~l~Gi~~v~~i~~---~~vecHa~y-~P~fRG 75 (151)
T PF11039_consen 35 KPDQLYLGVYEG--GQLGGIVYVEEIQP---SVVECHAMY-DPGFRG 75 (151)
T ss_pred cCccEEEEEEec--eEEEEEEEEEEEee---eeEEEEeee-ccccch
Confidence 356667767654 88999998865321 134456554 799998
No 281
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=27.29 E-value=28 Score=24.47 Aligned_cols=9 Identities=33% Similarity=1.158 Sum_probs=7.5
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
+.||.||..
T Consensus 93 ~~CP~Cgs~ 101 (124)
T PRK00762 93 IECPVCGNK 101 (124)
T ss_pred CcCcCCCCC
Confidence 579999965
No 282
>PF01246 Ribosomal_L24e: Ribosomal protein L24e; InterPro: IPR000988 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. A number of eukaryotic and archaeabacterial ribosomal proteins can be grouped on the basis of sequence similarities. One of these families [] consists of mammalian ribosomal protein L24; yeast ribosomal protein L30A/B (Rp29) (YL21); Kluyveromyces lactis ribosomal protein L30; Arabidopsis thaliana ribosomal protein L24 homolog; Haloarcula marismortui ribosomal protein HL21/HL22; and Methanocaldococcus jannaschii (Methanococcus jannaschii) MJ1201. These proteins have 60 to 160 amino-acid residues. This entry represents proteins related to the L24e ribosomal proteins.; PDB: 2ZKR_u 1VQ9_U 1VQL_U 1KD1_V 1VQP_U 3CCM_U 3CD6_U 3CCL_U 3CCR_U 1Q86_V ....
Probab=27.27 E-value=31 Score=22.07 Aligned_cols=14 Identities=29% Similarity=0.501 Sum_probs=8.0
Q ss_pred ccccCCCCcccccC
Q psy838 36 KYTCSFCGKEYKKS 49 (152)
Q Consensus 36 ~~~c~~c~~~~~~~ 49 (152)
..+|.|||..+.+.
T Consensus 3 ~~~C~Fsg~~I~PG 16 (71)
T PF01246_consen 3 TEKCSFSGYKIYPG 16 (71)
T ss_dssp SEE-TTT-SEE-SS
T ss_pred eEEecccCCccCCC
Confidence 36799999985544
No 283
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=27.16 E-value=29 Score=25.60 Aligned_cols=8 Identities=63% Similarity=1.717 Sum_probs=6.3
Q ss_pred ccCCCCcc
Q psy838 38 TCSFCGKE 45 (152)
Q Consensus 38 ~c~~c~~~ 45 (152)
.||||+..
T Consensus 2 ~CPfC~~~ 9 (156)
T COG1327 2 KCPFCGHE 9 (156)
T ss_pred CCCCCCCC
Confidence 48888876
No 284
>cd00085 HNHc HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins. The alignment includes members of the large group of homing endonucleases, yeast intron 1 protein, MutS, as well as bacterial colicins, pyocins, and anaredoxins.
Probab=27.11 E-value=46 Score=18.62 Aligned_cols=10 Identities=40% Similarity=0.949 Sum_probs=8.4
Q ss_pred cccCCCCccc
Q psy838 37 YTCSFCGKEY 46 (152)
Q Consensus 37 ~~c~~c~~~~ 46 (152)
+.|+.||..+
T Consensus 12 ~~C~~c~~~~ 21 (57)
T cd00085 12 GLCPYCGKPG 21 (57)
T ss_pred CcCccCCCcC
Confidence 8999999753
No 285
>PF04267 SoxD: Sarcosine oxidase, delta subunit family ; InterPro: IPR006279 These sequences represent the delta subunit of a family of known and putative heterotetrameric sarcosine oxidases. Five operons of such oxidases are found in Rhizobium loti (Mesorhizobium loti) and three in Agrobacterium tumefaciens, a high enough copy number to suggest that not all members share the same function. Sarcosine oxidase catalyzes the oxidative demethylation of sarcosine to glycine. The reaction converts tetrahydrofolate to 5,10-methylene-tetrahydrofolate []. Bacterial sarcosine oxidases have been isolated from over a dozen different organisms and fall into two major classes (1) monomeric form that contains only covalent flavin and (2) heterotetrameric (alpha, beta, gamma, delta) form that contain a covalent and noncovalent flavin, this entry represents the heterotetrameric form.; GO: 0008115 sarcosine oxidase activity, 0046653 tetrahydrofolate metabolic process; PDB: 3AD7_D 1X31_D 1VRQ_D 3AD8_D 3ADA_D 3AD9_D 2GAG_D 2GAH_D.
Probab=27.02 E-value=20 Score=23.74 Aligned_cols=8 Identities=38% Similarity=1.240 Sum_probs=7.0
Q ss_pred ccCCCCcc
Q psy838 38 TCSFCGKE 45 (152)
Q Consensus 38 ~c~~c~~~ 45 (152)
.||+||.-
T Consensus 3 ~CP~CG~R 10 (84)
T PF04267_consen 3 PCPHCGPR 10 (84)
T ss_dssp EETTTEEE
T ss_pred cCCCCCcc
Confidence 69999986
No 286
>TIGR03826 YvyF flagellar operon protein TIGR03826. This gene is found in flagellar operons of Bacillus-related organisms. Its function has not been determined and an official gene symbol has not been assigned, although the gene is designated yvyF in B. subtilus. A tentative assignment as a regulator is suggested in the NCBI record GI:16080597.
Probab=26.94 E-value=33 Score=24.73 Aligned_cols=11 Identities=27% Similarity=0.857 Sum_probs=7.9
Q ss_pred cccCCCCcccc
Q psy838 37 YTCSFCGKEYK 47 (152)
Q Consensus 37 ~~c~~c~~~~~ 47 (152)
-+||.||+.+.
T Consensus 4 ~nC~~CgklF~ 14 (137)
T TIGR03826 4 ANCPKCGRLFV 14 (137)
T ss_pred ccccccchhhh
Confidence 36888888843
No 287
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=26.86 E-value=29 Score=24.75 Aligned_cols=12 Identities=42% Similarity=0.899 Sum_probs=9.1
Q ss_pred ccccccCCCCcc
Q psy838 34 HAKYTCSFCGKE 45 (152)
Q Consensus 34 ~~~~~c~~c~~~ 45 (152)
...+.||.||..
T Consensus 105 ~~~~~CP~Cgs~ 116 (135)
T PRK03824 105 HAFLKCPKCGSR 116 (135)
T ss_pred ccCcCCcCCCCC
Confidence 345779999975
No 288
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=26.79 E-value=30 Score=17.47 Aligned_cols=9 Identities=33% Similarity=0.870 Sum_probs=7.1
Q ss_pred ccCCCCccc
Q psy838 38 TCSFCGKEY 46 (152)
Q Consensus 38 ~c~~c~~~~ 46 (152)
.||.|++..
T Consensus 3 ~CPiC~~~v 11 (26)
T smart00734 3 QCPVCFREV 11 (26)
T ss_pred cCCCCcCcc
Confidence 599998765
No 289
>KOG2779|consensus
Probab=26.50 E-value=2.2e+02 Score=24.13 Aligned_cols=65 Identities=12% Similarity=0.173 Sum_probs=39.7
Q ss_pred cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC-CCce-EEEEEeCCCCCeEEEEEEEee
Q psy838 48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD-DRAC-YLVAKNGSSSTPVAFSHFRFD 115 (152)
Q Consensus 48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~-~~~~-~~v~~~~~~~~ivGf~~~~~~ 115 (152)
-..++++.|++.+++|..+.+.+ |.- ..-.+++.+..-+.- ++-. .+++.. .+|.|-+|.+|...
T Consensus 262 G~R~me~kDvp~V~~Ll~~yl~q-f~l-a~~f~~eev~Hwf~p~e~VV~syVves-p~g~ITDF~SFy~l 328 (421)
T KOG2779|consen 262 GLREMEEKDVPAVFRLLRNYLKQ-FEL-APVFDEEEVEHWFLPRENVVYSYVVES-PNGKITDFCSFYSL 328 (421)
T ss_pred CcccccccchHHHHHHHHHHHHh-eec-ccccCHHHhHhhcccccceEEEEEEEC-CCCcccceeeEEec
Confidence 35788999999999999876644 321 123455544444432 2222 233333 36899999999754
No 290
>PF06827 zf-FPG_IleRS: Zinc finger found in FPG and IleRS; InterPro: IPR010663 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger domain found at the C-terminal in both DNA glycosylase/AP lyase enzymes and in isoleucyl tRNA synthetase. In these two types of enzymes, the C-terminal domain forms a zinc finger. Some related proteins may not bind zinc. DNA glycosylase/AP lyase enzymes are involved in base excision repair of DNA damaged by oxidation or by mutagenic agents. These enzymes have both DNA glycosylase activity (3.2.2 from EC) and AP lyase activity (4.2.99.18 from EC) []. Examples include formamidopyrimidine-DNA glycosylases (Fpg; MutM) and endonuclease VIII (Nei). Formamidopyrimidine-DNA glycosylases (Fpg, MutM) is a trifunctional DNA base excision repair enzyme that removes a wide range of oxidation-damaged bases (N-glycosylase activity; 3.2.2.23 from EC) and cleaves both the 3'- and 5'-phosphodiester bonds of the resulting apurinic/apyrimidinic site (AP lyase activity; 4.2.99.18 from EC). Fpg has a preference for oxidised purines, excising oxidized purine bases such as 7,8-dihydro-8-oxoguanine (8-oxoG). ITs AP (apurinic/apyrimidinic) lyase activity introduces nicks in the DNA strand, cleaving the DNA backbone by beta-delta elimination to generate a single-strand break at the site of the removed base with both 3'- and 5'-phosphates. Fpg is a monomer composed of 2 domains connected by a flexible hinge []. The two DNA-binding motifs (a zinc finger and the helix-two-turns-helix motifs) suggest that the oxidized base is flipped out from double-stranded DNA in the binding mode and excised by a catalytic mechanism similar to that of bifunctional base excision repair enzymes []. Fpg binds one ion of zinc at the C terminus, which contains four conserved and essential cysteines []. Endonuclease VIII (Nei) has the same enzyme activities as Fpg above, but with a preference for oxidized pyrimidines, such as thymine glycol, 5,6-dihydrouracil and 5,6-dihydrothymine [, ]. An Fpg-type zinc finger is also found at the C terminus of isoleucyl tRNA synthetase (6.1.1.5 from EC) [, ]. This enzyme catalyses the attachment of isoleucine to tRNA(Ile). As IleRS can inadvertently accommodate and process structurally similar amino acids such as valine, to avoid such errors it has two additional distinct tRNA(Ile)-dependent editing activities. One activity is designated as 'pre-transfer' editing and involves the hydrolysis of activated Val-AMP. The other activity is designated 'post-transfer' editing and involves deacylation of mischarged Val-tRNA(Ile) []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003824 catalytic activity; PDB: 1K82_C 1Q39_A 2OQ4_B 2OPF_A 1K3X_A 1K3W_A 1Q3B_A 2EA0_A 1Q3C_A 2XZF_A ....
Probab=26.38 E-value=13 Score=19.13 Aligned_cols=19 Identities=26% Similarity=0.714 Sum_probs=13.3
Q ss_pred HHhhhhhcccccccCCCCc
Q psy838 26 VKKMEITQHAKYTCSFCGK 44 (152)
Q Consensus 26 ~~~~~~~~~~~~~c~~c~~ 44 (152)
+.++...++..+.||.|-+
T Consensus 11 ~~~~~~~~r~~~~C~rCq~ 29 (30)
T PF06827_consen 11 IEDIGINGRSTYLCPRCQK 29 (30)
T ss_dssp BEEEEETTEEEEE-TTTCC
T ss_pred ceEeEecCCCCeECcCCcC
Confidence 4456667788999999954
No 291
>PF01927 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=26.18 E-value=29 Score=24.92 Aligned_cols=11 Identities=45% Similarity=1.020 Sum_probs=8.2
Q ss_pred ccccCCCCccc
Q psy838 36 KYTCSFCGKEY 46 (152)
Q Consensus 36 ~~~c~~c~~~~ 46 (152)
=+.||.|||.|
T Consensus 124 f~~C~~C~kiy 134 (147)
T PF01927_consen 124 FWRCPGCGKIY 134 (147)
T ss_pred EEECCCCCCEe
Confidence 46688888874
No 292
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=26.07 E-value=27 Score=20.72 Aligned_cols=12 Identities=25% Similarity=0.908 Sum_probs=7.5
Q ss_pred ccccccCCCCcc
Q psy838 34 HAKYTCSFCGKE 45 (152)
Q Consensus 34 ~~~~~c~~c~~~ 45 (152)
.....||.||..
T Consensus 22 ~~~irCp~Cg~r 33 (49)
T COG1996 22 TRGIRCPYCGSR 33 (49)
T ss_pred cCceeCCCCCcE
Confidence 345567777765
No 293
>PF12387 Peptidase_C74: Pestivirus NS2 peptidase; InterPro: IPR022120 The pestivirus NS2 peptidase is responsible for single cleavage between NS2 and NS3 of the Bovine viral diarrhea virus 1 polyprotein, a cleavage that is correlated with cytopathogenicity []. The peptidase is activated by its interaction with 'J-domain protein interacting with viral protein' - Jiv. ; GO: 0003968 RNA-directed RNA polymerase activity, 0004197 cysteine-type endopeptidase activity, 0004252 serine-type endopeptidase activity, 0016817 hydrolase activity, acting on acid anhydrides, 0017111 nucleoside-triphosphatase activity, 0070008 serine-type exopeptidase activity
Probab=25.99 E-value=29 Score=26.22 Aligned_cols=17 Identities=24% Similarity=0.466 Sum_probs=13.2
Q ss_pred hhhhcccccccCCCCcc
Q psy838 29 MEITQHAKYTCSFCGKE 45 (152)
Q Consensus 29 ~~~~~~~~~~c~~c~~~ 45 (152)
+|..+...-+||.||+.
T Consensus 168 Ce~r~w~g~~CPKCGr~ 184 (200)
T PF12387_consen 168 CEGREWKGGNCPKCGRH 184 (200)
T ss_pred eecCccCCCCCCcccCC
Confidence 36666677789999986
No 294
>PF03811 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=25.67 E-value=34 Score=18.82 Aligned_cols=9 Identities=22% Similarity=0.659 Sum_probs=6.9
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
-.||+|+..
T Consensus 6 v~CP~C~s~ 14 (36)
T PF03811_consen 6 VHCPRCQST 14 (36)
T ss_pred eeCCCCCCC
Confidence 469999865
No 295
>TIGR03830 CxxCG_CxxCG_HTH putative zinc finger/helix-turn-helix protein, YgiT family. This model describes a family of predicted regulatory proteins with a conserved zinc finger/HTH architecture. The amino-terminal region contains a novel domain, featuring two CXXC motifs and occuring in a number of small bacterial proteins as well as in the present family. The carboxyl-terminal region consists of a helix-turn-helix domain, modeled by pfam01381. The predicted function is DNA binding and transcriptional regulation.
Probab=25.60 E-value=1.3e+02 Score=20.39 Aligned_cols=14 Identities=29% Similarity=0.769 Sum_probs=9.7
Q ss_pred cccccCCCCccccc
Q psy838 35 AKYTCSFCGKEYKK 48 (152)
Q Consensus 35 ~~~~c~~c~~~~~~ 48 (152)
.-+.||.||-.+.+
T Consensus 30 ~~~~C~~CGe~~~~ 43 (127)
T TIGR03830 30 PGWYCPACGEELLD 43 (127)
T ss_pred eeeECCCCCCEEEc
Confidence 45689999965443
No 296
>smart00714 LITAF Possible membrane-associated motif in LPS-induced tumor necrosis factor alpha factor (LITAF), also known as PIG7, and other animal proteins.
Probab=25.58 E-value=34 Score=21.08 Aligned_cols=15 Identities=20% Similarity=0.386 Sum_probs=11.6
Q ss_pred hhcccccccCCCCcc
Q psy838 31 ITQHAKYTCSFCGKE 45 (152)
Q Consensus 31 ~~~~~~~~c~~c~~~ 45 (152)
.-+...|.||.||+.
T Consensus 47 ~~kd~~H~Cp~C~~~ 61 (67)
T smart00714 47 SFKDVNHYCPNCGAF 61 (67)
T ss_pred cccCccEECCCCCCE
Confidence 345668999999976
No 297
>COG1885 Uncharacterized protein conserved in archaea [Function unknown]
Probab=25.53 E-value=33 Score=23.72 Aligned_cols=10 Identities=30% Similarity=0.903 Sum_probs=8.2
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
-+.||.||.+
T Consensus 49 ~t~CP~Cg~~ 58 (115)
T COG1885 49 STSCPKCGEP 58 (115)
T ss_pred cccCCCCCCc
Confidence 4679999987
No 298
>PRK12496 hypothetical protein; Provisional
Probab=25.29 E-value=31 Score=25.46 Aligned_cols=11 Identities=36% Similarity=0.899 Sum_probs=8.1
Q ss_pred cccCCCCcccc
Q psy838 37 YTCSFCGKEYK 47 (152)
Q Consensus 37 ~~c~~c~~~~~ 47 (152)
-.||.||.+..
T Consensus 144 ~~C~~CG~~~~ 154 (164)
T PRK12496 144 DVCEICGSPVK 154 (164)
T ss_pred CcCCCCCChhh
Confidence 45999998743
No 299
>COG1645 Uncharacterized Zn-finger containing protein [General function prediction only]
Probab=25.19 E-value=30 Score=24.82 Aligned_cols=10 Identities=30% Similarity=0.783 Sum_probs=7.5
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
.+.||.||.+
T Consensus 28 ~~hCp~Cg~P 37 (131)
T COG1645 28 AKHCPKCGTP 37 (131)
T ss_pred HhhCcccCCc
Confidence 4668888877
No 300
>PRK13361 molybdenum cofactor biosynthesis protein A; Provisional
Probab=25.15 E-value=58 Score=26.44 Aligned_cols=45 Identities=18% Similarity=0.525 Sum_probs=31.0
Q ss_pred ccchhHhHhHHhhhhhcccccccCCCCccc----ccCCCCCHHHHHHHHHH
Q psy838 17 RYGASLRKMVKKMEITQHAKYTCSFCGKEY----KKSTDLGAETKKWVWEL 63 (152)
Q Consensus 17 ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~----~~~~~l~~~d~~~i~~l 63 (152)
+||+.+. +-.++++..=++.|++|-..- .....++.+++..+.+.
T Consensus 8 ~~gr~i~--~l~i~iT~~CNl~C~yC~~~~~~~~~~~~~ls~eei~~li~~ 56 (329)
T PRK13361 8 SFGRTVT--YLRLSVTDRCDFRCVYCMSEDPCFLPRDQVLSLEELAWLAQA 56 (329)
T ss_pred CCCCccC--eEEEEecCCccccCCCCCCCCCCcCCccCCCCHHHHHHHHHH
Confidence 5666544 556888999999999996541 22346777777776654
No 301
>PF07315 DUF1462: Protein of unknown function (DUF1462); InterPro: IPR009190 There are currently no experimental data for members of this group of bacterial proteins or their homologues. A crystal structure of Q7A6J8 from SWISSPROT revealed a thioredoxin-like fold, its core consisting of three layers alpha/beta/alpha.; PDB: 1XG8_A.
Probab=25.01 E-value=94 Score=20.92 Aligned_cols=62 Identities=18% Similarity=0.249 Sum_probs=31.4
Q ss_pred ccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCC---------CCCh---hhhhhhhcCCCceEEEEEeCCC
Q psy838 36 KYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDI---------GWDP---QGKHSEMFDDRACYLVAKNGSS 103 (152)
Q Consensus 36 ~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~---------~w~~---~~~~~el~~~~~~~~v~~~~~~ 103 (152)
.-.|+.|= ..+-+.+..+|+-.+... -|....+ +-.+ ..+.+.+.+++..|=++.-+
T Consensus 6 e~~CASCV-----n~PsSkeTyeWL~aal~R----Kyp~~~f~~~YiDi~~p~~~~~~~~~a~~I~ede~fYPlV~i~-- 74 (93)
T PF07315_consen 6 EVICASCV-----NAPSSKETYEWLEAALKR----KYPDQPFEFTYIDIENPPENDHDQQFAERILEDELFYPLVVIN-- 74 (93)
T ss_dssp SS--GGGS-----SS--HHHHHHHHHHHHHH----H-TTS-EEEEEEETTT----HHHHHHHHHHHTTSS-SSEEEET--
T ss_pred cccchhhc-----CCCCchhHHHHHHHHHhC----cCCCCceEEEEEecCCCCccHHHHHHHHHHHhcccccceEEEC--
Confidence 44688883 455667779998876554 4654322 1111 13334456666666666655
Q ss_pred CCeEE
Q psy838 104 STPVA 108 (152)
Q Consensus 104 ~~ivG 108 (152)
|++||
T Consensus 75 ~eiV~ 79 (93)
T PF07315_consen 75 DEIVA 79 (93)
T ss_dssp TEEEE
T ss_pred CEEEe
Confidence 88886
No 302
>PF14803 Nudix_N_2: Nudix N-terminal; PDB: 3CNG_C.
Probab=24.98 E-value=34 Score=18.58 Aligned_cols=8 Identities=38% Similarity=0.983 Sum_probs=3.8
Q ss_pred ccCCCCcc
Q psy838 38 TCSFCGKE 45 (152)
Q Consensus 38 ~c~~c~~~ 45 (152)
.||.||..
T Consensus 2 fC~~CG~~ 9 (34)
T PF14803_consen 2 FCPQCGGP 9 (34)
T ss_dssp B-TTT--B
T ss_pred ccccccCh
Confidence 48888877
No 303
>COG2093 DNA-directed RNA polymerase, subunit E'' [Transcription]
Probab=24.97 E-value=35 Score=21.35 Aligned_cols=9 Identities=33% Similarity=0.992 Sum_probs=7.0
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
-.||.||..
T Consensus 19 e~CP~Cgs~ 27 (64)
T COG2093 19 EICPVCGST 27 (64)
T ss_pred ccCCCCCCc
Confidence 359999965
No 304
>PF10609 ParA: ParA/MinD ATPase like; InterPro: IPR019591 This entry represents ATPases involved in plasmid partitioning []. It also contains cytosolic Fe-S cluster assembling factors, NBP35 and CFD1 which are required for biogenesis and export of both ribosomal subunits probably through assembling the ISCs in RLI1, a protein which performs rRNA processing and ribosome export [, , ].; PDB: 2PH1_A 3KB1_B.
Probab=24.95 E-value=28 Score=22.78 Aligned_cols=13 Identities=23% Similarity=0.897 Sum_probs=7.1
Q ss_pred cccccccCCCCcc
Q psy838 33 QHAKYTCSFCGKE 45 (152)
Q Consensus 33 ~~~~~~c~~c~~~ 45 (152)
.-+-|.||.||+.
T Consensus 62 NMs~~~Cp~Cg~~ 74 (81)
T PF10609_consen 62 NMSYFVCPHCGER 74 (81)
T ss_dssp CT-EEE-TTT--E
T ss_pred CCCccCCCCCCCe
Confidence 3467899999987
No 305
>PF10915 DUF2709: Protein of unknown function (DUF2709); InterPro: IPR024484 Members of this family appear restricted to Chlamydiales. Their function is unknown.
Probab=24.90 E-value=64 Score=24.89 Aligned_cols=32 Identities=25% Similarity=0.534 Sum_probs=23.8
Q ss_pred hhcccccccCCCCcccccCCCCCHHH--HHHHHH
Q psy838 31 ITQHAKYTCSFCGKEYKKSTDLGAET--KKWVWE 62 (152)
Q Consensus 31 ~~~~~~~~c~~c~~~~~~~~~l~~~d--~~~i~~ 62 (152)
+..+.-|-|||-||.+-...-..++| .+|+-.
T Consensus 82 E~TkkIYICPFTGKVF~DNt~~nPQDAIYDWvSk 115 (238)
T PF10915_consen 82 EQTKKIYICPFTGKVFGDNTHPNPQDAIYDWVSK 115 (238)
T ss_pred cccceEEEcCCcCccccCCCCCChHHHHHHHHhh
Confidence 44566799999999988887777777 344443
No 306
>COG1029 FwdB Formylmethanofuran dehydrogenase subunit B [Energy production and conversion]
Probab=24.78 E-value=31 Score=29.21 Aligned_cols=11 Identities=36% Similarity=1.017 Sum_probs=9.0
Q ss_pred cccccCCCCcc
Q psy838 35 AKYTCSFCGKE 45 (152)
Q Consensus 35 ~~~~c~~c~~~ 45 (152)
.+..||+||..
T Consensus 4 k~~vCp~CG~l 14 (429)
T COG1029 4 KNVVCPFCGTL 14 (429)
T ss_pred eeEEcCccccc
Confidence 46789999987
No 307
>KOG1491|consensus
Probab=24.59 E-value=64 Score=27.23 Aligned_cols=24 Identities=17% Similarity=0.439 Sum_probs=19.1
Q ss_pred Eeeee--eccccccCCHHHHHHHHHH
Q psy838 125 CYELQ--LEKQVQRKGLGKFMMQAFY 148 (152)
Q Consensus 125 I~~i~--V~p~~qgkGiG~~Lm~~l~ 148 (152)
|++|+ |.-+.+|+|+|...++.+-
T Consensus 88 v~DIAGLvkGAs~G~GLGN~FLs~iR 113 (391)
T KOG1491|consen 88 VYDIAGLVKGASAGEGLGNKFLSHIR 113 (391)
T ss_pred EEeecccccCcccCcCchHHHHHhhh
Confidence 45555 7788999999999998764
No 308
>PF06397 Desulfoferrod_N: Desulfoferrodoxin, N-terminal domain; InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=24.58 E-value=30 Score=19.12 Aligned_cols=12 Identities=33% Similarity=0.861 Sum_probs=7.1
Q ss_pred ccccccCCCCcc
Q psy838 34 HAKYTCSFCGKE 45 (152)
Q Consensus 34 ~~~~~c~~c~~~ 45 (152)
..-|.|..||..
T Consensus 4 ~~~YkC~~CGni 15 (36)
T PF06397_consen 4 GEFYKCEHCGNI 15 (36)
T ss_dssp TEEEE-TTT--E
T ss_pred ccEEEccCCCCE
Confidence 456899999976
No 309
>PF04564 U-box: U-box domain; InterPro: IPR003613 Quality control of intracellular proteins is essential for cellular homeostasis. Molecular chaperones recognise and contribute to the refolding of misfolded or unfolded proteins, whereas the ubiquitin-proteasome system mediates the degradation of such abnormal proteins. Ubiquitin-protein ligases (E3s) determine the substrate specificity for ubiquitylation and have been classified into HECT and RING-finger families. More recently, however, U-box proteins, which contain a domain (the U box) of about 70 amino acids that is conserved from yeast to humans, have been identified as a new type of E3 []. Members of the U-box family of proteins constitute a class of ubiquitin-protein ligases (E3s) distinct from the HECT-type and RING finger-containing E3 families []. Using yeast two-hybrid technology, all mammalian U-box proteins have been reported to interact with molecular chaperones or co-chaperones, including Hsp90, Hsp70, DnaJc7, EKN1, CRN, and VCP. This suggests that the function of U box-type E3s is to mediate the degradation of unfolded or misfolded proteins in conjunction with molecular chaperones as receptors that recognise such abnormal proteins [, ]. Unlike the RING finger domain, IPR001841 from INTERPRO, that is stabilised by Zn2+ ions coordinated by the cysteines and a histidine, the U-box scaffold is probably stabilised by a system of salt-bridges and hydrogen bonds. The charged and polar residues that participate in this network of bonds are more strongly conserved in the U-box proteins than in classic RING fingers, which supports their role in maintaining the stability of the U box. Thus, the U box appears to have evolved from a RING finger domain by appropriation of a new set of residues required to stabilise its structure, concomitant with the loss of the original, metal-chelating residues [].; GO: 0004842 ubiquitin-protein ligase activity, 0016567 protein ubiquitination, 0000151 ubiquitin ligase complex; PDB: 1T1H_A 2C2L_D 2C2V_V 1WGM_A 2KR4_A 3L1Z_B 3L1X_A 2KRE_A 3M63_A 2QIZ_A ....
Probab=24.45 E-value=99 Score=19.27 Aligned_cols=27 Identities=15% Similarity=-0.018 Sum_probs=15.3
Q ss_pred cccccCCCCcccccCCCCCHHHHHHHH
Q psy838 35 AKYTCSFCGKEYKKSTDLGAETKKWVW 61 (152)
Q Consensus 35 ~~~~c~~c~~~~~~~~~l~~~d~~~i~ 61 (152)
...+||+|+..+.....+....+....
T Consensus 38 ~~~~~P~t~~~l~~~~l~pn~~Lk~~I 64 (73)
T PF04564_consen 38 NGGTDPFTRQPLSESDLIPNRALKSAI 64 (73)
T ss_dssp TSSB-TTT-SB-SGGGSEE-HHHHHHH
T ss_pred CCCCCCCCCCcCCcccceECHHHHHHH
Confidence 478999998877666555555555444
No 310
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=24.43 E-value=26 Score=19.72 Aligned_cols=10 Identities=50% Similarity=1.295 Sum_probs=7.1
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
.|.|+.||..
T Consensus 5 ey~C~~Cg~~ 14 (42)
T PF09723_consen 5 EYRCEECGHE 14 (42)
T ss_pred EEEeCCCCCE
Confidence 4678888766
No 311
>PRK14852 hypothetical protein; Provisional
Probab=24.32 E-value=6.5e+02 Score=24.30 Aligned_cols=88 Identities=13% Similarity=0.088 Sum_probs=53.8
Q ss_pred CCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC-----CCceEEEEEeCCCCCeEEEEEEEeecc---------
Q psy838 52 LGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD-----DRACYLVAKNGSSSTPVAFSHFRFDVD--------- 117 (152)
Q Consensus 52 l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~-----~~~~~~v~~~~~~~~ivGf~~~~~~~~--------- 117 (152)
-++++.+.++.|... .|-.+ |+..+.......+ |.+..+++... +.++|-+.+.++..
T Consensus 35 et~~e~~~~~~L~~~----~Y~~~--Gy~~~~ps~~~~~~~~~lp~t~~~i~k~~--~~~l~T~t~~~ds~~~Gl~~D~l 106 (989)
T PRK14852 35 ETPDEYTRAFRLVYE----EYIRS--GYLKPHPSRMYYNVWSILPATSVFIFKSY--HDVLCTLTHIPDSGLFGLPMDTL 106 (989)
T ss_pred CCHHHHHHHHHHHHH----HHHHc--CCCCcCcccccCCccccCCcceEEEeccC--CcEEEEEEEecCCcccCcCHHHH
Confidence 367889999999765 56543 3443333333322 34444554433 66777776655422
Q ss_pred ----------CCcceEEEeeeeeccccccCCHHHHHHHHH
Q psy838 118 ----------FGEPVLYCYELQLEKQVQRKGLGKFMMQAF 147 (152)
Q Consensus 118 ----------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l 147 (152)
.|..++.+..|+++|+.+.+-+=-.|++.+
T Consensus 107 f~~eLd~lr~~Gr~v~EvtrLa~d~~~~~~~l~~~l~~~~ 146 (989)
T PRK14852 107 YKPEVDALRAQGRNVVEVGALATQYSRRWTNLMVFLAKAM 146 (989)
T ss_pred HHHHHHHHHHcCCeEEeeehheechhhcccchhHHHHHHH
Confidence 256789999999998888766544555554
No 312
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=24.07 E-value=38 Score=21.63 Aligned_cols=13 Identities=23% Similarity=1.147 Sum_probs=9.8
Q ss_pred cccccCCCCcccc
Q psy838 35 AKYTCSFCGKEYK 47 (152)
Q Consensus 35 ~~~~c~~c~~~~~ 47 (152)
..+.|+.|++.+.
T Consensus 49 ~~~~C~~C~~~f~ 61 (100)
T PF12756_consen 49 ESFRCPYCNKTFR 61 (100)
T ss_dssp SSEEBSSSS-EES
T ss_pred CCCCCCccCCCCc
Confidence 3799999998855
No 313
>COG5319 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.07 E-value=34 Score=24.56 Aligned_cols=13 Identities=31% Similarity=0.825 Sum_probs=10.4
Q ss_pred cccccccCCCCcc
Q psy838 33 QHAKYTCSFCGKE 45 (152)
Q Consensus 33 ~~~~~~c~~c~~~ 45 (152)
.+...+||.||..
T Consensus 29 ~rgLv~CPvCgs~ 41 (142)
T COG5319 29 ERGLVTCPVCGST 41 (142)
T ss_pred HcCceeCCCCCcH
Confidence 3456789999988
No 314
>PF08348 PAS_6: YheO-like PAS domain; InterPro: IPR013559 This domain is found in various hypothetical bacterial proteins that are similar to the Escherichia coli protein YheO (P64624 from SWISSPROT). Their function is unknown, but a few members are annotated as being HTH-containing proteins and putative DNA-binding proteins.
Probab=23.87 E-value=1.3e+02 Score=20.81 Aligned_cols=23 Identities=9% Similarity=0.251 Sum_probs=16.6
Q ss_pred eEEEEEeCCCCCeEEEEEEEeecc
Q psy838 94 CYLVAKNGSSSTPVAFSHFRFDVD 117 (152)
Q Consensus 94 ~~~v~~~~~~~~ivGf~~~~~~~~ 117 (152)
..+++.++ +|+++|.++..++.+
T Consensus 85 sT~~Ird~-~g~~iG~LCIN~D~s 107 (118)
T PF08348_consen 85 STFFIRDE-NGKLIGALCINFDIS 107 (118)
T ss_pred EEEEEECC-CCCEEEEEEEEeccH
Confidence 34555554 689999999998644
No 315
>PF09706 Cas_CXXC_CXXC: CRISPR-associated protein (Cas_CXXC_CXXC); InterPro: IPR019121 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents a conserved domain of about 65 amino acids found in otherwise highly divergent proteins encoded in CRISPR-associated regions. This domain features two CXXC motifs.
Probab=23.86 E-value=38 Score=21.33 Aligned_cols=13 Identities=46% Similarity=1.237 Sum_probs=10.7
Q ss_pred cccccccCCCCcc
Q psy838 33 QHAKYTCSFCGKE 45 (152)
Q Consensus 33 ~~~~~~c~~c~~~ 45 (152)
+..++.|-+||..
T Consensus 2 ~k~~~~C~~Cg~r 14 (69)
T PF09706_consen 2 SKKKYNCIFCGER 14 (69)
T ss_pred CCCCCcCcCCCCc
Confidence 5679999999954
No 316
>COG4694 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.85 E-value=31 Score=30.91 Aligned_cols=10 Identities=60% Similarity=1.408 Sum_probs=8.1
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
.--|||||+.
T Consensus 278 ~q~CpFCg~e 287 (758)
T COG4694 278 NQICPFCGKE 287 (758)
T ss_pred CCCCCccchH
Confidence 5679999965
No 317
>COG0375 HybF Zn finger protein HypA/HybF (possibly regulating hydrogenase expression) [General function prediction only]
Probab=23.80 E-value=39 Score=23.72 Aligned_cols=10 Identities=40% Similarity=1.072 Sum_probs=7.8
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
.|.||.||..
T Consensus 86 ~~~CP~C~s~ 95 (115)
T COG0375 86 DYRCPKCGSI 95 (115)
T ss_pred eeECCCCCCC
Confidence 4559999976
No 318
>COG5270 PUA domain (predicted RNA-binding domain) [Translation, ribosomal structure and biogenesis]
Probab=23.67 E-value=1e+02 Score=23.66 Aligned_cols=33 Identities=9% Similarity=0.136 Sum_probs=19.2
Q ss_pred eEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCH
Q psy838 94 CYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGL 139 (152)
Q Consensus 94 ~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGi 139 (152)
+++++.. ||+++|+....-+.+ ..+.|+ ||+|+
T Consensus 164 ~viVv~~--ng~~vGVg~a~~~~~----------~~in~~-rG~~v 196 (202)
T COG5270 164 EVIVVSE--NGRVVGVGIAKKSYE----------ELINPE-RGTGV 196 (202)
T ss_pred eEEEEec--CCEEEEEEEEecCHH----------HhcCcc-cCccc
Confidence 3444444 488888877654432 335666 66653
No 319
>PF03982 DAGAT: Diacylglycerol acyltransferase ; InterPro: IPR007130 The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) [, ].; GO: 0016747 transferase activity, transferring acyl groups other than amino-acyl groups
Probab=23.67 E-value=1.1e+02 Score=24.84 Aligned_cols=35 Identities=11% Similarity=0.096 Sum_probs=27.6
Q ss_pred CCcc--cccCCCCCHHHHHHHHHHHHHhHHHHHHhCC
Q psy838 42 CGKE--YKKSTDLGAETKKWVWELEEKNMKHSYEVCD 76 (152)
Q Consensus 42 c~~~--~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~ 76 (152)
.|++ +.....-+++|++.+.+...+.++++|+++.
T Consensus 247 VG~PI~v~~~~~Pt~e~Vd~~H~~Y~~~L~~LFd~~K 283 (297)
T PF03982_consen 247 VGKPIPVPKIENPTQEDVDKLHARYIEALRELFDKHK 283 (297)
T ss_pred eeceecccCCCCcCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444 4445566899999999999999999998864
No 320
>PRK14891 50S ribosomal protein L24e/unknown domain fusion protein; Provisional
Probab=23.63 E-value=41 Score=24.08 Aligned_cols=14 Identities=29% Similarity=0.693 Sum_probs=10.6
Q ss_pred ccccCCCCcccccC
Q psy838 36 KYTCSFCGKEYKKS 49 (152)
Q Consensus 36 ~~~c~~c~~~~~~~ 49 (152)
..+|.|||.++.+.
T Consensus 4 ~e~CsFcG~kIyPG 17 (131)
T PRK14891 4 TRTCDYTGEEIEPG 17 (131)
T ss_pred eeeecCcCCcccCC
Confidence 45799999985554
No 321
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=23.62 E-value=38 Score=18.48 Aligned_cols=9 Identities=33% Similarity=0.800 Sum_probs=6.9
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
-+||.|++.
T Consensus 3 i~CP~C~~~ 11 (37)
T PF13719_consen 3 ITCPNCQTR 11 (37)
T ss_pred EECCCCCce
Confidence 368999876
No 322
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=23.48 E-value=31 Score=27.29 Aligned_cols=39 Identities=21% Similarity=0.416 Sum_probs=22.6
Q ss_pred eeeeccccccccc----hhHhHhHHhhhhhcccccccCCCCccc
Q psy838 7 KVGITGKYGTRYG----ASLRKMVKKMEITQHAKYTCSFCGKEY 46 (152)
Q Consensus 7 ~~~~~~~~g~ryg----~~~rk~~~~~~~~~~~~~~c~~c~~~~ 46 (152)
.+|++.==|-+=| .--+...++|-. .-.--+||.||+..
T Consensus 189 g~gvvpl~g~~C~GC~m~l~~~~~~~V~~-~d~iv~CP~CgRIL 231 (239)
T COG1579 189 GVGVVPLEGRVCGGCHMKLPSQTLSKVRK-KDEIVFCPYCGRIL 231 (239)
T ss_pred CceEEeecCCcccCCeeeecHHHHHHHhc-CCCCccCCccchHH
Confidence 4555544443333 223444555544 66677899999873
No 323
>KOG4602|consensus
Probab=23.30 E-value=33 Score=27.59 Aligned_cols=13 Identities=31% Similarity=0.958 Sum_probs=10.1
Q ss_pred cccccccCCCCcc
Q psy838 33 QHAKYTCSFCGKE 45 (152)
Q Consensus 33 ~~~~~~c~~c~~~ 45 (152)
-...|+||.||.-
T Consensus 265 kLR~YVCPiCGAT 277 (318)
T KOG4602|consen 265 KLRSYVCPICGAT 277 (318)
T ss_pred hHhhhcCcccccc
Confidence 3447999999965
No 324
>PF01396 zf-C4_Topoisom: Topoisomerase DNA binding C4 zinc finger; InterPro: IPR013498 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks []. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis [, ]. DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA []. Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA. This entry represents the zinc-finger domain found in type IA topoisomerases, including bacterial and archaeal topoisomerase I and III enzymes, and in eukaryotic topoisomerase III enzymes. Escherichia coli topoisomerase I proteins contain five copies of a zinc-ribbon-like domain at their C terminus, two of which have lost their cysteine residues and are therefore probably not able to bind zinc []. This domain is still considered to be a member of the zinc-ribbon superfamily despite not being able to bind zinc. More information about this protein can be found at Protein of the Month: DNA Topoisomerase [].; GO: 0003677 DNA binding, 0003916 DNA topoisomerase activity, 0006265 DNA topological change, 0005694 chromosome
Probab=23.26 E-value=39 Score=18.71 Aligned_cols=8 Identities=38% Similarity=1.049 Sum_probs=6.4
Q ss_pred ccCCCCcc
Q psy838 38 TCSFCGKE 45 (152)
Q Consensus 38 ~c~~c~~~ 45 (152)
.||.||..
T Consensus 3 ~CP~Cg~~ 10 (39)
T PF01396_consen 3 KCPKCGGP 10 (39)
T ss_pred CCCCCCce
Confidence 58999866
No 325
>PF11023 DUF2614: Protein of unknown function (DUF2614); InterPro: IPR020912 This entry describes proteins of unknown function, which are thought to be membrane proteins.; GO: 0005887 integral to plasma membrane
Probab=23.05 E-value=49 Score=23.14 Aligned_cols=9 Identities=44% Similarity=1.132 Sum_probs=5.7
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
..||.|||.
T Consensus 70 V~CP~C~K~ 78 (114)
T PF11023_consen 70 VECPNCGKQ 78 (114)
T ss_pred eECCCCCCh
Confidence 347777666
No 326
>PF12677 DUF3797: Domain of unknown function (DUF3797); InterPro: IPR024256 This presumed domain is functionally uncharacterised. This domain family is found in bacteria and viruses, and is approximately 50 amino acids in length. There is a conserved CGN sequence motif.
Probab=23.05 E-value=39 Score=20.00 Aligned_cols=9 Identities=33% Similarity=1.043 Sum_probs=7.4
Q ss_pred ccCCCCccc
Q psy838 38 TCSFCGKEY 46 (152)
Q Consensus 38 ~c~~c~~~~ 46 (152)
.||.||.+.
T Consensus 15 ~Cp~CGN~~ 23 (49)
T PF12677_consen 15 KCPKCGNDK 23 (49)
T ss_pred cCcccCCcE
Confidence 599999873
No 327
>PF06170 DUF983: Protein of unknown function (DUF983); InterPro: IPR009325 This family consists of several bacterial proteins of unknown function.
Probab=23.05 E-value=47 Score=21.92 Aligned_cols=23 Identities=22% Similarity=0.559 Sum_probs=14.4
Q ss_pred ccccCCCCcccccCCCCCHHHHHHHH
Q psy838 36 KYTCSFCGKEYKKSTDLGAETKKWVW 61 (152)
Q Consensus 36 ~~~c~~c~~~~~~~~~l~~~d~~~i~ 61 (152)
.-.|+.||-++.... .+|-+..+
T Consensus 8 ~~~C~~CG~d~~~~~---adDgPA~f 30 (86)
T PF06170_consen 8 APRCPHCGLDYSHAR---ADDGPAYF 30 (86)
T ss_pred CCcccccCCccccCC---cCccchhH
Confidence 457999998855443 34444444
No 328
>PLN02294 cytochrome c oxidase subunit Vb
Probab=23.00 E-value=38 Score=25.46 Aligned_cols=13 Identities=15% Similarity=0.825 Sum_probs=10.2
Q ss_pred cccccCCCCcccc
Q psy838 35 AKYTCSFCGKEYK 47 (152)
Q Consensus 35 ~~~~c~~c~~~~~ 47 (152)
..+.||.||..+.
T Consensus 140 kp~RCpeCG~~fk 152 (174)
T PLN02294 140 KSFECPVCTQYFE 152 (174)
T ss_pred CceeCCCCCCEEE
Confidence 4778999998843
No 329
>PF04376 ATE_N: Arginine-tRNA-protein transferase, N terminus; InterPro: IPR007471 Arginine-tRNA-protein transferase catalyses the post-translational conjugation of arginine to the N terminus of a protein. In eukaryotes, this functions as part of the N terminus rule pathway of protein degradation by conjugating a destabilising amino acid to the N-terminal aspartate or glutamate of a protein, targeting the protein for ubiquitin-dependent proteolysis. N-terminal cysteine is sometimes modified []. In Saccharomyces cerevisiae, Cys20, 23, 94 and/or 95 are thought to be important for activity []. Of these, only Cys 94 appears to be completely conserved in this family. This entry represents the N-terminal region of the enzyme arginine-tRNA-protein transferase, found in both eukaryotic and prokaryotic enzymes.; GO: 0004057 arginyltransferase activity, 0016598 protein arginylation
Probab=22.94 E-value=1.1e+02 Score=19.58 Aligned_cols=29 Identities=17% Similarity=0.363 Sum_probs=20.1
Q ss_pred ccccccCCCCcc------cc--cCCCCCHHHHHHHHH
Q psy838 34 HAKYTCSFCGKE------YK--KSTDLGAETKKWVWE 62 (152)
Q Consensus 34 ~~~~~c~~c~~~------~~--~~~~l~~~d~~~i~~ 62 (152)
..++.|+.|... .. +...++.++.+.+++
T Consensus 5 ~~~~~C~Y~~~~~~~~~~~~~~~~~~~~~~~y~~Ll~ 41 (80)
T PF04376_consen 5 TQPSSCGYCPGRSARSSYTYGAPSESLSPEDYQQLLD 41 (80)
T ss_pred CCCCcCCCCCCCccCEEEEeccccccCCHHHHHHHHH
Confidence 467899999522 22 577888877777664
No 330
>PF10815 ComZ: ComZ; InterPro: IPR024558 ComZ, which contains a leucine zipper motif, negatively regulates transcription of the ComG operon [].
Probab=22.80 E-value=1.3e+02 Score=18.22 Aligned_cols=26 Identities=27% Similarity=0.360 Sum_probs=22.6
Q ss_pred CCCCCHHHHHHHHHHHHHhHHHHHHh
Q psy838 49 STDLGAETKKWVWELEEKNMKHSYEV 74 (152)
Q Consensus 49 ~~~l~~~d~~~i~~l~~~~m~~~y~~ 74 (152)
.-+++.+.+.-+++|....|.+.|+-
T Consensus 25 GIeLsme~~qP~m~L~~~VM~eAYEl 50 (56)
T PF10815_consen 25 GIELSMEMLQPLMQLLTKVMNEAYEL 50 (56)
T ss_pred CccCCHHHHHHHHHHHHHHHHHHHHh
Confidence 46788999999999999999999953
No 331
>TIGR02668 moaA_archaeal probable molybdenum cofactor biosynthesis protein A, archaeal. This model describes an archaeal family related, and predicted to be functionally equivalent, to molybdenum cofactor biosynthesis protein A (MoaA) of bacteria (see TIGR02666).
Probab=22.63 E-value=71 Score=25.38 Aligned_cols=46 Identities=17% Similarity=0.486 Sum_probs=31.7
Q ss_pred ccchhHhHhHHhhhhhcccccccCCCCccccc---CCCCCHHHHHHHHHHH
Q psy838 17 RYGASLRKMVKKMEITQHAKYTCSFCGKEYKK---STDLGAETKKWVWELE 64 (152)
Q Consensus 17 ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~~~---~~~l~~~d~~~i~~l~ 64 (152)
+||+.+.. -.|+.+..=++.|.+|-..... ...++.+++..+.+..
T Consensus 4 ~~gr~~~~--l~i~vT~~CNl~C~yC~~~~~~~~~~~~ls~eei~~~i~~~ 52 (302)
T TIGR02668 4 RFGRPVTS--LRISVTDRCNLSCFYCHMEGEDRSGGNELSPEEIERIVRVA 52 (302)
T ss_pred CCCCccCe--EEEEEcccccCCCCCCCccccCCCccCcCCHHHHHHHHHHH
Confidence 46666654 3678888889999999765322 3467888877766543
No 332
>PF14824 Sirohm_synth_M: Sirohaem biosynthesis protein central; PDB: 1KYQ_B.
Probab=22.56 E-value=52 Score=17.45 Aligned_cols=19 Identities=21% Similarity=0.293 Sum_probs=15.1
Q ss_pred eccccccccchhHhHhHHh
Q psy838 10 ITGKYGTRYGASLRKMVKK 28 (152)
Q Consensus 10 ~~~~~g~ryg~~~rk~~~~ 28 (152)
+++.=+||+.+.+|+.+..
T Consensus 10 STnG~sP~la~~iR~~ie~ 28 (30)
T PF14824_consen 10 STNGKSPRLARLIRKEIER 28 (30)
T ss_dssp EESSS-HHHHHHHHHHHHH
T ss_pred ECCCCChHHHHHHHHHHHH
Confidence 5778899999999998764
No 333
>PF06677 Auto_anti-p27: Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27); InterPro: IPR009563 The proteins in this entry are functionally uncharacterised and include several proteins that characterise Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27). It is thought that the potential association of anti-p27 with anti-centromere antibodies suggests that autoantigen p27 might play a role in mitosis [].
Probab=22.08 E-value=46 Score=18.84 Aligned_cols=13 Identities=23% Similarity=0.613 Sum_probs=9.5
Q ss_pred ccccCCCCccccc
Q psy838 36 KYTCSFCGKEYKK 48 (152)
Q Consensus 36 ~~~c~~c~~~~~~ 48 (152)
.-.||.||.+..+
T Consensus 17 ~~~Cp~C~~PL~~ 29 (41)
T PF06677_consen 17 DEHCPDCGTPLMR 29 (41)
T ss_pred cCccCCCCCeeEE
Confidence 3469999988544
No 334
>PF10601 zf-LITAF-like: LITAF-like zinc ribbon domain; InterPro: IPR006629 Members of this family display a conserved zinc ribbon structure [] with the motif C-XX-C- separated from the more C-terminal HX-C(P)X-C-X4-G-R motif by a variable region of usually 25-30 (hydrophobic) residues. Although it belongs to one of the zinc finger's fold groups (zinc ribbon), this particular domain was first identified in LPS-induced tumour necrosis alpha factor (LITAF) which is produced in mammalian cells after being challenged with lipopolysaccharide (LPS). The hydrophobic region probably inserts into the membrane rather than traversing it. Such an insertion brings together the N- and C-terminal C-XX-C motifs to form a compact Zn2+-binding structure [].
Probab=22.03 E-value=43 Score=20.97 Aligned_cols=14 Identities=21% Similarity=0.427 Sum_probs=11.1
Q ss_pred hcccccccCCCCcc
Q psy838 32 TQHAKYTCSFCGKE 45 (152)
Q Consensus 32 ~~~~~~~c~~c~~~ 45 (152)
-+...|.||.|++.
T Consensus 54 ~kd~~H~Cp~C~~~ 67 (73)
T PF10601_consen 54 CKDVYHYCPNCGAF 67 (73)
T ss_pred ccCceEECCCCCCE
Confidence 35668999999976
No 335
>PF02799 NMT_C: Myristoyl-CoA:protein N-myristoyltransferase, C-terminal domain; InterPro: IPR022677 Myristoyl-CoA:protein N-myristoyltransferase (2.3.1.97 from EC) (Nmt) [] is the enzyme responsible for transferring a myristate group on the N-terminal glycine of a number of cellular eukaryotics and viral proteins. Nmt is a monomeric protein of about 50 to 60kDa whose sequence appears to be well conserved. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold. This entry represents the C-terminal region.; GO: 0004379 glycylpeptide N-tetradecanoyltransferase activity; PDB: 2P6G_B 2P6F_F 2P6E_A 1IIC_A 1IID_A 2NMT_A 2WUU_A 1IYL_B 1NMT_B 1IYK_A ....
Probab=21.87 E-value=1.2e+02 Score=23.07 Aligned_cols=66 Identities=11% Similarity=0.076 Sum_probs=38.5
Q ss_pred ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC-CC-ceEEEEEeCCCCCeEEEEEEEee
Q psy838 47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD-DR-ACYLVAKNGSSSTPVAFSHFRFD 115 (152)
Q Consensus 47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~-~~-~~~~v~~~~~~~~ivGf~~~~~~ 115 (152)
....+++++|++.+..|..+.+.. |+- ..-.+++...--+.. ++ -..+|+.++ +++|-.|++|...
T Consensus 29 ~glR~m~~~Dv~~v~~Ll~~yl~~-f~l-~~~fs~eev~Hw~lp~~~Vv~syVve~~-~~~ITDf~SFY~L 96 (190)
T PF02799_consen 29 PGLRPMEEKDVPQVTKLLNKYLKK-FDL-APVFSEEEVKHWFLPRKNVVYSYVVEDP-DGKITDFFSFYSL 96 (190)
T ss_dssp TTEEE--GGGHHHHHHHHHHHHTT-SSE-EEE--HHHHHHHHS-BTTTEEEEEEEET-TSEEEEEEEEEEE
T ss_pred CccccCchhhHHHHHHHHHHHHHh-ccc-ccccCHHHHHhhcccCCCeEEEEEEecC-CCceeeEEEEeec
Confidence 336889999999999998875533 321 112455555444443 22 334455555 4799999999764
No 336
>smart00746 TRASH metallochaperone-like domain.
Probab=21.73 E-value=44 Score=16.33 Aligned_cols=7 Identities=86% Similarity=2.124 Sum_probs=3.5
Q ss_pred cCCCCcc
Q psy838 39 CSFCGKE 45 (152)
Q Consensus 39 c~~c~~~ 45 (152)
||.||..
T Consensus 1 c~~C~~~ 7 (39)
T smart00746 1 CSFCGKD 7 (39)
T ss_pred CCCCCCC
Confidence 4555544
No 337
>PF03376 Adeno_E3B: Adenovirus E3B protein; InterPro: IPR005041 Adenoviruses are medium-sized, non-enveloped viruses containing double-stranded DNA. They can cause a variety of diseases including pneumonia, cystitis, conjunctivitis and diarrhoea, all of which can be fatal to patients who are immunocompromised []. These viruses have many mechanisms to evade the host immune response, including several proteins which are expressed as part of the early transcription unit 3 (E3) []. One of the regions of E3, known as the E3B region, encodes three proteins known as 10.4K, 14.5K and 14.7K. Two of these proteins, 10.4K and 14.5K, form the RID complex (receptor internalisation and degradation) which protects the infected cell from host-induced lysis by clearing the the TNF and Fas receptors from the cell surface []. Other receptors, such as the epidermal growth factor receptor, are also known to be cleared by RID []. This entry represents the E3B region 10.4K protein, also known as the RID alpha subunit.; GO: 0016020 membrane
Probab=21.73 E-value=40 Score=21.20 Aligned_cols=14 Identities=7% Similarity=0.228 Sum_probs=10.6
Q ss_pred eccccccCCHHHHH
Q psy838 130 LEKQVQRKGLGKFM 143 (152)
Q Consensus 130 V~p~~qgkGiG~~L 143 (152)
=+|+||++-|++-|
T Consensus 52 HhPqYrn~~iA~LL 65 (67)
T PF03376_consen 52 HHPQYRNQQIAALL 65 (67)
T ss_pred cCchhcCHHHHHHh
Confidence 47888888887654
No 338
>PRK00758 GMP synthase subunit A; Validated
Probab=21.67 E-value=1.1e+02 Score=22.43 Aligned_cols=27 Identities=19% Similarity=0.388 Sum_probs=23.5
Q ss_pred EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838 125 CYELQLEKQVQRKGLGKFMMQAFYTYI 151 (152)
Q Consensus 125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a 151 (152)
++.+|-|||+...--|.+|++.+.+.|
T Consensus 155 ~~g~QfHPE~~~~~~g~~l~~~f~~~~ 181 (184)
T PRK00758 155 IYGVQFHPEVAHTEYGEEIFKNFLEIC 181 (184)
T ss_pred EEEEEcCCccCCCchHHHHHHHHHHHH
Confidence 688999999987779999999988765
No 339
>TIGR03823 FliZ flagellar regulatory protein FliZ. FliZ is involved in the regulation of flagellar assembly and possibly also the down-regulation of the motile phenotype. FliZ interacts with the flagellar translational activator FlhCD complex.
Probab=21.66 E-value=60 Score=24.09 Aligned_cols=35 Identities=20% Similarity=0.285 Sum_probs=28.8
Q ss_pred hcccccccCCCCcc------cccCCCCCHHHHHHHHHHHHH
Q psy838 32 TQHAKYTCSFCGKE------YKKSTDLGAETKKWVWELEEK 66 (152)
Q Consensus 32 ~~~~~~~c~~c~~~------~~~~~~l~~~d~~~i~~l~~~ 66 (152)
-.|+..-|..|+|. +.....+..+.+..+=.++++
T Consensus 18 fKhsQthCshC~K~LDRisLV~~gqiinK~~Ia~md~liDd 58 (168)
T TIGR03823 18 FKHSQTHCSHCHKLLDRISLVFRGQIINKESISRMDTLIDD 58 (168)
T ss_pred hccccchhhhhcchhhheeeeecceeecHHHHHHHhhhccH
Confidence 34567789999999 778899999999888887765
No 340
>PRK05978 hypothetical protein; Provisional
Probab=21.59 E-value=41 Score=24.59 Aligned_cols=11 Identities=27% Similarity=0.736 Sum_probs=8.9
Q ss_pred cccccCCCCcc
Q psy838 35 AKYTCSFCGKE 45 (152)
Q Consensus 35 ~~~~c~~c~~~ 45 (152)
..-+||.||+-
T Consensus 32 l~grCP~CG~G 42 (148)
T PRK05978 32 FRGRCPACGEG 42 (148)
T ss_pred HcCcCCCCCCC
Confidence 46789999987
No 341
>cd00472 Ribosomal_L24e_L24 Ribosomal protein L24e/L24 is a ribosomal protein found in eukaryotes (L24) and in archaea (L24e, distinct from archaeal L24). L24e/L24 is located on the surface of the large subunit, adjacent to proteins L14 and L3, and near the translation factor binding site. L24e/L24 appears to play a role in the kinetics of peptide synthesis, and may be involved in interactions between the large and small subunits, either directly or through other factors. In mouse, a deletion mutation in L24 has been identified as the cause for the belly spot and tail (Bst) mutation that results in disrupted pigmentation, somitogenesis and retinal cell fate determination. L24 may be an important protein in eukaryotic reproduction: in shrimp, L24 expression is elevated in the ovary, suggesting a role in oogenesis, and in Arabidopsis, L24 has been proposed to have a specific function in gynoecium development. No protein with sequence or structural homology to L24e/L24 has been identifi
Probab=21.54 E-value=46 Score=20.05 Aligned_cols=14 Identities=36% Similarity=0.771 Sum_probs=9.5
Q ss_pred ccccCCCCcccccC
Q psy838 36 KYTCSFCGKEYKKS 49 (152)
Q Consensus 36 ~~~c~~c~~~~~~~ 49 (152)
...|.|||..+.+.
T Consensus 3 ~~~C~f~g~~I~PG 16 (54)
T cd00472 3 TEKCSFCGYKIYPG 16 (54)
T ss_pred EEEecCcCCeecCC
Confidence 35788888875444
No 342
>TIGR03831 YgiT_finger YgiT-type zinc finger domain. This domain model describes a small domain with two copies of a putative zinc-binding motif CXXC (usually CXXCG). Most member proteins consist largely of this domain or else carry an additional C-terminal helix-turn-helix domain, resembling that of the phage protein Cro and modeled by pfam01381.
Probab=21.35 E-value=50 Score=18.16 Aligned_cols=18 Identities=33% Similarity=0.757 Sum_probs=11.6
Q ss_pred hhcccccccCCCCccccc
Q psy838 31 ITQHAKYTCSFCGKEYKK 48 (152)
Q Consensus 31 ~~~~~~~~c~~c~~~~~~ 48 (152)
+..-.-+.|+.||..+.+
T Consensus 27 i~~vp~~~C~~CGE~~~~ 44 (46)
T TIGR03831 27 IENVPALVCPQCGEEYLD 44 (46)
T ss_pred EeCCCccccccCCCEeeC
Confidence 334445679999976544
No 343
>PF06054 CoiA: Competence protein CoiA-like family; InterPro: IPR010330 Competence is the ability of a cell to take up exogenous DNA from its environment, resulting in transformation. It is widespread among bacteria and is probably an important mechanism for the horizontal transfer of genes. Cells that take up DNA inevitably acquire the nucleotides the DNA consists of, and, because nucleotides are needed for DNA and RNA synthesis and are expensive to synthesise, these may make a significant contribution to the cell's energy budget []. The lateral gene transfer caused by competence also contributes to the genetic diversity that makes evolution possible. DNA usually becomes available by the death and lysis of other cells. Competent bacteria use components of extracellular filaments called type 4 pili to create pores in their membranes and pull DNA through the pores into the cytoplasm. This process, including the development of competence and the expression of the uptake machinery, is regulated in response to cell-cell signalling and/or nutritional conditions []. Many of the members of this family are described as transcription factors. CoiA falls within a competence-specific operon in Streptococcus. CoiA is an uncharacterised protein.
Probab=21.07 E-value=45 Score=27.93 Aligned_cols=12 Identities=42% Similarity=0.982 Sum_probs=11.0
Q ss_pred ccccccCCCCcc
Q psy838 34 HAKYTCSFCGKE 45 (152)
Q Consensus 34 ~~~~~c~~c~~~ 45 (152)
...|.||.||.+
T Consensus 28 ~~~~~CP~C~~~ 39 (375)
T PF06054_consen 28 KGKYFCPGCGEP 39 (375)
T ss_pred CCcEECCCCCCe
Confidence 779999999988
No 344
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=21.03 E-value=44 Score=19.12 Aligned_cols=11 Identities=36% Similarity=1.053 Sum_probs=9.2
Q ss_pred cccccCCCCcc
Q psy838 35 AKYTCSFCGKE 45 (152)
Q Consensus 35 ~~~~c~~c~~~ 45 (152)
..+.||.||..
T Consensus 19 ~~~vC~~Cg~~ 29 (52)
T smart00661 19 RRFVCRKCGYE 29 (52)
T ss_pred CEEECCcCCCe
Confidence 47899999976
No 345
>PF10083 DUF2321: Uncharacterized protein conserved in bacteria (DUF2321); InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=20.99 E-value=72 Score=23.60 Aligned_cols=29 Identities=31% Similarity=0.350 Sum_probs=19.1
Q ss_pred cccccccCCCCcccccCCCCCHHHHHHHHHHHH
Q psy838 33 QHAKYTCSFCGKEYKKSTDLGAETKKWVWELEE 65 (152)
Q Consensus 33 ~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~ 65 (152)
.+++..|..||+++.| +..-++..-+|.+
T Consensus 65 ~~~PsYC~~CGkpyPW----t~~~L~aa~el~e 93 (158)
T PF10083_consen 65 YEAPSYCHNCGKPYPW----TENALEAANELIE 93 (158)
T ss_pred CCCChhHHhCCCCCch----HHHHHHHHHHHHH
Confidence 3478889999998755 4445555555544
No 346
>PF05265 DUF723: Protein of unknown function (DUF723); InterPro: IPR007929 This family contains several uncharacterised proteins from Neisseria meningitidis. These proteins may have a role in DNA binding.
Probab=20.74 E-value=46 Score=20.60 Aligned_cols=9 Identities=44% Similarity=1.003 Sum_probs=7.0
Q ss_pred cccccCCCC
Q psy838 35 AKYTCSFCG 43 (152)
Q Consensus 35 ~~~~c~~c~ 43 (152)
+++=||.||
T Consensus 52 sk~GCP~Cg 60 (60)
T PF05265_consen 52 SKHGCPECG 60 (60)
T ss_pred hccCCCCCC
Confidence 456799997
No 347
>TIGR02491 NrdG anaerobic ribonucleoside-triphosphate reductase activating protein. This enzyme is a member of the radical-SAM family (pfam04055) and utilizes S-adenosyl methionine, an iron-sulfur cluster and a reductant (dihydroflavodoxin ) to produce a glycine-centered radical in the class III (anaerobic) ribonucleotide triphosphate reductase (NrdD, TIGR02487). The two components form an alpha-2/beta-2 heterodimer.
Probab=20.69 E-value=1.3e+02 Score=21.49 Aligned_cols=33 Identities=12% Similarity=0.104 Sum_probs=22.6
Q ss_pred ccccccCCCCcccc----cCCCCCHHHHHHHHHHHHH
Q psy838 34 HAKYTCSFCGKEYK----KSTDLGAETKKWVWELEEK 66 (152)
Q Consensus 34 ~~~~~c~~c~~~~~----~~~~l~~~d~~~i~~l~~~ 66 (152)
-=++.|+.|-.+.. ....++.++.+.+++....
T Consensus 24 gCnl~C~~C~n~~~~~~~~g~~~~~~~~~~i~~~l~~ 60 (154)
T TIGR02491 24 GCKHHCEGCFNKETWNFNGGKEFTEALEKEIIRDLND 60 (154)
T ss_pred CCCCCCcCCCcccccCCCCCCcCCHHHHHHHHHHHHh
Confidence 34678999987632 3467887778887765543
No 348
>PHA00733 hypothetical protein
Probab=20.65 E-value=58 Score=22.99 Aligned_cols=22 Identities=23% Similarity=0.538 Sum_probs=15.6
Q ss_pred hhcccccccCCCCcccccCCCC
Q psy838 31 ITQHAKYTCSFCGKEYKKSTDL 52 (152)
Q Consensus 31 ~~~~~~~~c~~c~~~~~~~~~l 52 (152)
.....+|.|+.||+.+.+...+
T Consensus 68 ~~~~kPy~C~~Cgk~Fss~s~L 89 (128)
T PHA00733 68 SKAVSPYVCPLCLMPFSSSVSL 89 (128)
T ss_pred cCCCCCccCCCCCCcCCCHHHH
Confidence 3345689999999986655444
No 349
>COG3058 FdhE Uncharacterized protein involved in formate dehydrogenase formation [Posttranslational modification, protein turnover, chaperones]
Probab=20.65 E-value=48 Score=26.97 Aligned_cols=18 Identities=17% Similarity=0.429 Sum_probs=12.2
Q ss_pred hhhhhccccc----ccCCCCcc
Q psy838 28 KMEITQHAKY----TCSFCGKE 45 (152)
Q Consensus 28 ~~~~~~~~~~----~c~~c~~~ 45 (152)
.|+.+.+.++ .||.||..
T Consensus 173 ~i~~~~~~e~e~~~~CPvCGS~ 194 (308)
T COG3058 173 GIPGKARVENESRQYCPVCGSM 194 (308)
T ss_pred cCCccccccccccccCCCcCCC
Confidence 3444444444 99999988
No 350
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=20.60 E-value=46 Score=19.62 Aligned_cols=9 Identities=33% Similarity=0.866 Sum_probs=7.3
Q ss_pred ccccCCCCc
Q psy838 36 KYTCSFCGK 44 (152)
Q Consensus 36 ~~~c~~c~~ 44 (152)
.-.||.||.
T Consensus 20 ~~fCP~Cg~ 28 (50)
T PRK00432 20 NKFCPRCGS 28 (50)
T ss_pred cCcCcCCCc
Confidence 448999997
No 351
>PF08273 Prim_Zn_Ribbon: Zinc-binding domain of primase-helicase; InterPro: IPR013237 This entry is represented by bacteriophage T7 Gp4. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry represents a zinc binding domain found in the N-terminal region of the bacteriophage T7 Gp4 and P4 alpha protein. P4 is a multifunctional protein with origin recognition, helicase and primase activities [, , ].; GO: 0003896 DNA primase activity, 0004386 helicase activity, 0008270 zinc ion binding; PDB: 1NUI_B.
Probab=20.55 E-value=44 Score=18.86 Aligned_cols=9 Identities=33% Similarity=1.006 Sum_probs=4.4
Q ss_pred cccCCCCcc
Q psy838 37 YTCSFCGKE 45 (152)
Q Consensus 37 ~~c~~c~~~ 45 (152)
-.||.||-.
T Consensus 4 ~pCP~CGG~ 12 (40)
T PF08273_consen 4 GPCPICGGK 12 (40)
T ss_dssp E--TTTT-T
T ss_pred CCCCCCcCc
Confidence 358888865
No 352
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=20.42 E-value=45 Score=20.41 Aligned_cols=15 Identities=33% Similarity=0.638 Sum_probs=11.2
Q ss_pred hhcccccccCCCCcc
Q psy838 31 ITQHAKYTCSFCGKE 45 (152)
Q Consensus 31 ~~~~~~~~c~~c~~~ 45 (152)
......+.||.||..
T Consensus 41 ~~~~r~~~C~~Cg~~ 55 (69)
T PF07282_consen 41 RRSGRVFTCPNCGFE 55 (69)
T ss_pred ccccceEEcCCCCCE
Confidence 345668999999854
No 353
>cd04641 CBS_pair_28 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=20.33 E-value=1.6e+02 Score=19.13 Aligned_cols=27 Identities=11% Similarity=0.100 Sum_probs=16.9
Q ss_pred hhhhcCCCceEEEEEeCCCCCeEEEEEE
Q psy838 85 HSEMFDDRACYLVAKNGSSSTPVAFSHF 112 (152)
Q Consensus 85 ~~el~~~~~~~~v~~~~~~~~ivGf~~~ 112 (152)
...|.+....++.+.++ +++++|++..
T Consensus 88 ~~~m~~~~~~~l~Vvd~-~~~~~Givt~ 114 (120)
T cd04641 88 FDLIVKARVHRLVVVDE-NKRVEGIISL 114 (120)
T ss_pred HHHHHhcCccEEEEECC-CCCEEEEEEH
Confidence 44455555555555544 4789999874
No 354
>KOG3029|consensus
Probab=20.21 E-value=21 Score=29.18 Aligned_cols=16 Identities=31% Similarity=0.603 Sum_probs=11.2
Q ss_pred hhhcccccccCCCCcc
Q psy838 30 EITQHAKYTCSFCGKE 45 (152)
Q Consensus 30 ~~~~~~~~~c~~c~~~ 45 (152)
++...+--+||||-|.
T Consensus 90 ~l~LyQyetCPFCcKV 105 (370)
T KOG3029|consen 90 DLVLYQYETCPFCCKV 105 (370)
T ss_pred eEEEEeeccCchHHHH
Confidence 4444555689999987
No 355
>PF02150 RNA_POL_M_15KD: RNA polymerases M/15 Kd subunit; InterPro: IPR001529 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. In archaebacteria, there is generally a single form of RNA polymerase which also consist of an oligomeric assemblage of 10 to 13 polypeptides. It has recently been shown [], [] that small subunits of about 15 kDa, found in polymerase types I and II, are highly conserved. These proteins contain a probable zinc finger in their N-terminal region and a C-terminal zinc ribbon domain (see IPR001222 from INTERPRO).; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 3H0G_I 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I ....
Probab=20.04 E-value=46 Score=18.01 Aligned_cols=8 Identities=38% Similarity=0.912 Sum_probs=6.2
Q ss_pred ccCCCCcc
Q psy838 38 TCSFCGKE 45 (152)
Q Consensus 38 ~c~~c~~~ 45 (152)
.||.||..
T Consensus 3 FCp~C~nl 10 (35)
T PF02150_consen 3 FCPECGNL 10 (35)
T ss_dssp BETTTTSB
T ss_pred eCCCCCcc
Confidence 68888876
No 356
>PHA01733 hypothetical protein
Probab=20.03 E-value=3.7e+02 Score=19.82 Aligned_cols=59 Identities=12% Similarity=0.076 Sum_probs=33.4
Q ss_pred CCCCHHHHHHHHH-HHHHhHHHHHHhCCCCCChhhhhhhhcCCCce-EEEEEeCCCCCeEEEEEEEe
Q psy838 50 TDLGAETKKWVWE-LEEKNMKHSYEVCDIGWDPQGKHSEMFDDRAC-YLVAKNGSSSTPVAFSHFRF 114 (152)
Q Consensus 50 ~~l~~~d~~~i~~-l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~-~~v~~~~~~~~ivGf~~~~~ 114 (152)
.+.|.+|+..+++ +..+...++. ..+-++..+..-+...... +.++ -+ |++++......
T Consensus 6 rpaT~~d~~~l~~n~r~~Dr~E~e---alg~~p~~l~~~~~~s~~~v~~~~-~n--G~l~aI~Gv~~ 66 (153)
T PHA01733 6 RPATQADATEVAQNLRQEDREEIE---GLGHSPLALHLSLDVSENVVAFVA-PD--GSLAGVAGLVE 66 (153)
T ss_pred ccccHHHHHHHHccCCHHHHHHHH---HhCCCcccchhhhhccccceEEEe-cC--CcEEEEecccc
Confidence 4678888887777 6655554543 2222554333333333333 3333 33 89999888765
No 357
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=20.02 E-value=46 Score=19.17 Aligned_cols=10 Identities=40% Similarity=1.285 Sum_probs=7.4
Q ss_pred ccccCCCCcc
Q psy838 36 KYTCSFCGKE 45 (152)
Q Consensus 36 ~~~c~~c~~~ 45 (152)
.|.|+.||..
T Consensus 5 ey~C~~Cg~~ 14 (52)
T TIGR02605 5 EYRCTACGHR 14 (52)
T ss_pred EEEeCCCCCE
Confidence 4778888875
Done!