Query psy5202
Match_columns 343
No_of_seqs 512 out of 2379
Neff 10.7
Searched_HMMs 46136
Date Fri Aug 16 19:57:14 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5202.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5202hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 100.0 4.9E-34 1.1E-38 226.2 11.3 137 87-265 126-265 (279)
2 KOG2462|consensus 99.9 1.5E-26 3.1E-31 183.9 6.0 126 116-280 127-252 (279)
3 KOG1074|consensus 99.9 9.6E-23 2.1E-27 183.6 8.2 90 184-273 604-698 (958)
4 KOG1074|consensus 99.8 3.6E-22 7.7E-27 179.9 2.7 177 89-272 603-936 (958)
5 KOG3608|consensus 99.8 3.6E-21 7.8E-26 157.2 5.6 166 91-271 177-349 (467)
6 KOG3608|consensus 99.8 2.7E-20 5.9E-25 152.1 7.7 174 85-270 201-379 (467)
7 KOG3576|consensus 99.8 8.7E-20 1.9E-24 137.9 8.2 87 186-272 146-241 (267)
8 KOG3576|consensus 99.8 1E-20 2.2E-25 142.9 3.0 110 87-237 113-236 (267)
9 KOG3623|consensus 99.7 1.1E-17 2.4E-22 148.9 4.4 122 89-235 208-331 (1007)
10 KOG3623|consensus 99.7 1.2E-17 2.5E-22 148.8 2.2 107 120-264 211-330 (1007)
11 PLN03086 PRLI-interacting fact 99.5 3.2E-14 6.9E-19 127.8 7.2 129 92-268 408-565 (567)
12 KOG3993|consensus 99.5 7.8E-14 1.7E-18 117.4 5.8 174 91-271 267-486 (500)
13 PHA00733 hypothetical protein 99.3 2.6E-12 5.6E-17 94.4 4.6 82 183-268 38-124 (128)
14 PLN03086 PRLI-interacting fact 99.2 2.4E-11 5.2E-16 109.5 8.5 104 89-237 451-564 (567)
15 PHA00733 hypothetical protein 99.2 3.1E-11 6.8E-16 88.7 5.3 52 184-237 72-123 (128)
16 PHA02768 hypothetical protein; 99.0 1.2E-10 2.6E-15 70.0 2.2 40 186-227 6-45 (55)
17 PHA02768 hypothetical protein; 99.0 1.7E-10 3.6E-15 69.4 1.9 44 91-136 5-48 (55)
18 KOG3993|consensus 98.9 4.3E-10 9.4E-15 95.2 -0.1 89 184-272 266-385 (500)
19 PF13465 zf-H2C2_2: Zinc-finge 98.8 2.8E-09 6.2E-14 54.8 2.7 25 200-224 1-25 (26)
20 PF13465 zf-H2C2_2: Zinc-finge 98.7 1.6E-08 3.5E-13 51.9 2.2 26 106-131 1-26 (26)
21 PF05605 zf-Di19: Drought indu 98.6 7.1E-08 1.5E-12 59.6 4.4 51 214-267 3-53 (54)
22 PHA00732 hypothetical protein 98.6 3.2E-08 6.9E-13 65.8 2.6 45 186-236 2-47 (79)
23 PHA00616 hypothetical protein 98.6 2.9E-08 6.3E-13 56.7 1.5 32 186-217 2-33 (44)
24 PHA00616 hypothetical protein 98.5 3.1E-08 6.7E-13 56.6 1.5 34 91-124 1-34 (44)
25 PHA00732 hypothetical protein 98.4 2E-07 4.4E-12 62.0 2.7 47 213-266 1-47 (79)
26 PF05605 zf-Di19: Drought indu 98.3 1.1E-06 2.4E-11 54.3 4.8 23 120-143 3-25 (54)
27 PF00096 zf-C2H2: Zinc finger, 98.2 8.7E-07 1.9E-11 44.2 1.3 22 214-235 1-22 (23)
28 PF00096 zf-C2H2: Zinc finger, 98.2 1.8E-06 4E-11 43.0 2.4 22 244-265 1-22 (23)
29 COG5189 SFP1 Putative transcri 98.0 1.2E-06 2.6E-11 71.9 0.5 55 211-265 347-420 (423)
30 PF13894 zf-C2H2_4: C2H2-type 98.0 2.6E-06 5.7E-11 42.9 1.6 23 244-266 1-23 (24)
31 KOG2231|consensus 98.0 1.5E-05 3.3E-10 73.5 7.1 71 185-265 182-261 (669)
32 PF12756 zf-C2H2_2: C2H2 type 98.0 3.9E-06 8.4E-11 59.7 1.9 74 187-267 1-74 (100)
33 PF13894 zf-C2H2_4: C2H2-type 97.9 9.7E-06 2.1E-10 40.8 2.6 24 214-237 1-24 (24)
34 PF12756 zf-C2H2_2: C2H2 type 97.9 8.2E-06 1.8E-10 58.0 2.9 24 121-144 1-24 (100)
35 PF13912 zf-C2H2_6: C2H2-type 97.8 8.2E-06 1.8E-10 42.4 1.1 24 244-267 2-25 (27)
36 COG5189 SFP1 Putative transcri 97.8 7.2E-06 1.6E-10 67.5 0.6 53 182-234 346-419 (423)
37 PF13912 zf-C2H2_6: C2H2-type 97.7 2.3E-05 4.9E-10 40.7 1.7 25 91-115 1-25 (27)
38 KOG2231|consensus 97.7 0.0001 2.2E-09 68.3 6.9 103 121-271 117-240 (669)
39 COG5236 Uncharacterized conser 97.5 0.00012 2.6E-09 61.2 3.9 135 91-271 151-309 (493)
40 KOG1146|consensus 97.4 0.00028 6.1E-09 69.3 5.5 160 93-266 438-641 (1406)
41 PF13909 zf-H2C2_5: C2H2-type 97.3 0.00014 3E-09 36.5 1.4 22 214-236 1-22 (24)
42 smart00355 ZnF_C2H2 zinc finge 97.3 0.00025 5.4E-09 36.1 2.3 23 244-266 1-23 (26)
43 PF09237 GAGA: GAGA factor; I 97.3 0.00036 7.7E-09 40.9 3.0 31 88-118 21-51 (54)
44 KOG2785|consensus 97.2 0.00091 2E-08 57.2 6.5 60 214-273 167-250 (390)
45 smart00355 ZnF_C2H2 zinc finge 97.2 0.00029 6.3E-09 35.9 2.1 24 214-237 1-24 (26)
46 COG5048 FOG: Zn-finger [Genera 97.1 0.00043 9.4E-09 63.4 4.0 145 90-237 288-442 (467)
47 COG5236 Uncharacterized conser 97.1 0.00034 7.4E-09 58.5 2.9 107 120-272 152-280 (493)
48 PF09237 GAGA: GAGA factor; I 97.1 0.00062 1.3E-08 39.9 3.1 26 212-237 23-48 (54)
49 KOG1146|consensus 97.1 7E-05 1.5E-09 73.3 -1.3 150 87-236 461-641 (1406)
50 PF13909 zf-H2C2_5: C2H2-type 97.1 0.00055 1.2E-08 34.2 2.2 23 120-143 1-23 (24)
51 PRK04860 hypothetical protein; 96.9 0.0005 1.1E-08 52.6 1.7 36 185-224 119-154 (160)
52 PRK04860 hypothetical protein; 96.9 0.00051 1.1E-08 52.6 1.7 40 212-257 118-157 (160)
53 KOG2785|consensus 96.9 0.0041 8.9E-08 53.3 7.1 146 91-236 3-243 (390)
54 PF12874 zf-met: Zinc-finger o 96.8 0.00089 1.9E-08 33.8 1.8 20 215-234 2-21 (25)
55 PF12874 zf-met: Zinc-finger o 96.8 0.00056 1.2E-08 34.6 1.1 23 244-266 1-23 (25)
56 KOG2482|consensus 96.6 0.0094 2E-07 50.2 7.4 132 118-265 143-356 (423)
57 PF12171 zf-C2H2_jaz: Zinc-fin 96.5 0.00084 1.8E-08 34.7 0.4 22 244-265 2-23 (27)
58 PF13913 zf-C2HC_2: zinc-finge 96.5 0.0023 5E-08 32.2 1.9 22 244-266 3-24 (25)
59 PF12171 zf-C2H2_jaz: Zinc-fin 96.0 0.0019 4E-08 33.3 0.2 22 214-235 2-23 (27)
60 KOG2482|consensus 95.8 0.016 3.5E-07 48.8 4.8 54 91-144 195-304 (423)
61 KOG4173|consensus 95.4 0.01 2.2E-07 46.2 2.2 82 184-268 78-171 (253)
62 PF13913 zf-C2HC_2: zinc-finge 95.4 0.014 3.1E-07 29.3 2.0 19 215-234 4-22 (25)
63 TIGR00622 ssl1 transcription f 94.9 0.072 1.6E-06 37.8 4.9 109 92-250 2-110 (112)
64 smart00451 ZnF_U1 U1-like zinc 94.7 0.027 5.9E-07 30.9 2.0 23 243-265 3-25 (35)
65 cd00350 rubredoxin_like Rubred 94.4 0.02 4.4E-07 31.0 1.0 12 186-197 2-13 (33)
66 KOG4173|consensus 94.2 0.021 4.5E-07 44.5 1.2 26 210-235 141-168 (253)
67 KOG2893|consensus 93.8 0.015 3.2E-07 46.4 -0.3 47 93-143 12-58 (341)
68 COG4049 Uncharacterized protei 93.8 0.027 5.9E-07 33.7 0.8 28 240-267 14-41 (65)
69 smart00451 ZnF_U1 U1-like zinc 93.7 0.053 1.2E-06 29.7 1.9 23 213-235 3-25 (35)
70 PF09538 FYDLN_acid: Protein o 93.4 0.048 1E-06 38.8 1.7 30 214-256 10-39 (108)
71 COG5048 FOG: Zn-finger [Genera 93.4 0.017 3.6E-07 52.9 -0.9 54 184-237 288-347 (467)
72 PF12013 DUF3505: Protein of u 93.3 0.18 4E-06 36.1 4.6 25 244-268 81-109 (109)
73 COG4049 Uncharacterized protei 93.2 0.04 8.7E-07 33.0 0.8 29 86-114 12-40 (65)
74 KOG2893|consensus 93.0 0.023 5E-07 45.3 -0.5 41 188-232 13-53 (341)
75 cd00729 rubredoxin_SM Rubredox 92.9 0.079 1.7E-06 28.9 1.7 11 214-224 3-13 (34)
76 PF12013 DUF3505: Protein of u 92.7 0.25 5.4E-06 35.5 4.6 54 90-144 10-109 (109)
77 TIGR00622 ssl1 transcription f 90.8 0.51 1.1E-05 33.6 4.3 77 184-266 14-104 (112)
78 PF09538 FYDLN_acid: Protein o 90.8 0.18 3.8E-06 35.9 2.0 32 184-226 8-39 (108)
79 TIGR02300 FYDLN_acid conserved 90.4 0.18 3.8E-06 36.4 1.7 33 214-259 10-42 (129)
80 COG1592 Rubrerythrin [Energy p 90.4 0.18 4E-06 38.7 1.9 23 213-250 134-156 (166)
81 TIGR02098 MJ0042_CXXC MJ0042 f 90.3 0.13 2.9E-06 28.8 0.8 14 214-227 3-16 (38)
82 PF09986 DUF2225: Uncharacteri 90.2 0.067 1.4E-06 43.6 -0.7 45 212-256 4-61 (214)
83 KOG2807|consensus 90.0 0.65 1.4E-05 39.3 4.9 31 212-250 344-374 (378)
84 PF02892 zf-BED: BED zinc fing 90.0 0.18 4E-06 29.4 1.3 25 241-265 14-42 (45)
85 PF09986 DUF2225: Uncharacteri 89.2 0.064 1.4E-06 43.7 -1.5 44 89-132 3-61 (214)
86 PF13717 zinc_ribbon_4: zinc-r 88.8 0.29 6.2E-06 27.1 1.4 14 215-228 4-17 (36)
87 PF13719 zinc_ribbon_5: zinc-r 88.7 0.28 6.2E-06 27.3 1.4 11 243-253 25-35 (37)
88 PHA00626 hypothetical protein 88.5 0.18 3.9E-06 30.4 0.5 14 242-255 22-35 (59)
89 PRK14890 putative Zn-ribbon RN 87.9 0.54 1.2E-05 29.0 2.3 10 242-251 47-56 (59)
90 KOG2186|consensus 87.5 0.39 8.5E-06 39.1 2.0 47 185-234 3-49 (276)
91 PRK00398 rpoP DNA-directed RNA 87.0 0.29 6.4E-06 28.8 0.8 11 214-224 4-14 (46)
92 COG2888 Predicted Zn-ribbon RN 87.0 0.73 1.6E-05 28.3 2.5 10 242-251 49-58 (61)
93 smart00734 ZnF_Rad18 Rad18-lik 86.8 0.74 1.6E-05 23.3 2.1 21 245-266 3-23 (26)
94 KOG2186|consensus 86.8 0.44 9.5E-06 38.8 1.9 51 92-145 4-54 (276)
95 COG5151 SSL1 RNA polymerase II 86.8 0.6 1.3E-05 39.2 2.7 55 186-250 363-417 (421)
96 PF03604 DNA_RNApol_7kD: DNA d 86.5 0.59 1.3E-05 25.0 1.7 11 214-224 1-11 (32)
97 COG1996 RPC10 DNA-directed RNA 86.3 0.4 8.8E-06 28.4 1.1 11 185-195 6-16 (49)
98 COG1198 PriA Primosomal protei 86.3 0.68 1.5E-05 44.8 3.2 40 186-252 445-484 (730)
99 TIGR00373 conserved hypothetic 86.2 0.95 2.1E-05 34.9 3.5 34 209-253 105-138 (158)
100 TIGR02300 FYDLN_acid conserved 85.8 0.55 1.2E-05 34.0 1.8 35 184-229 8-42 (129)
101 COG1997 RPL43A Ribosomal prote 85.4 0.4 8.7E-06 32.0 0.9 13 242-254 52-64 (89)
102 smart00531 TFIIE Transcription 85.2 1.1 2.4E-05 34.1 3.4 38 210-253 96-133 (147)
103 PRK00464 nrdR transcriptional 85.1 0.16 3.5E-06 38.7 -1.2 44 92-138 1-47 (154)
104 COG1592 Rubrerythrin [Energy p 84.8 0.54 1.2E-05 36.2 1.5 25 184-221 133-157 (166)
105 smart00659 RPOLCX RNA polymera 84.8 0.61 1.3E-05 27.1 1.4 27 91-128 2-28 (44)
106 PRK04023 DNA polymerase II lar 84.6 1.1 2.4E-05 44.2 3.7 13 243-255 663-675 (1121)
107 smart00614 ZnF_BED BED zinc fi 84.4 0.7 1.5E-05 27.7 1.6 21 244-264 19-44 (50)
108 PRK00464 nrdR transcriptional 84.1 0.18 3.9E-06 38.5 -1.3 15 244-258 29-43 (154)
109 PRK06266 transcription initiat 83.8 1.2 2.6E-05 35.1 3.1 35 210-255 114-148 (178)
110 TIGR02605 CxxC_CxxC_SSSS putat 83.0 0.27 5.9E-06 29.8 -0.6 29 92-127 6-34 (52)
111 PF15269 zf-C2H2_7: Zinc-finge 82.4 1 2.2E-05 25.7 1.5 22 214-235 21-42 (54)
112 smart00531 TFIIE Transcription 82.2 1.3 2.9E-05 33.7 2.7 38 88-129 96-133 (147)
113 KOG4167|consensus 81.6 3.8 8.3E-05 38.9 5.8 29 87-115 788-816 (907)
114 PRK06266 transcription initiat 81.4 0.75 1.6E-05 36.2 1.1 32 184-224 116-147 (178)
115 KOG4377|consensus 81.4 7.7 0.00017 34.3 7.2 27 119-145 271-299 (480)
116 COG5188 PRP9 Splicing factor 3 80.9 1.4 3.1E-05 37.7 2.6 23 184-206 237-259 (470)
117 smart00834 CxxC_CXXC_SSSS Puta 80.9 0.34 7.4E-06 27.5 -0.7 31 91-128 5-35 (41)
118 TIGR00373 conserved hypothetic 80.8 2.1 4.7E-05 33.0 3.5 20 117-136 107-126 (158)
119 COG4530 Uncharacterized protei 79.9 1.1 2.3E-05 31.4 1.3 27 215-254 11-37 (129)
120 PRK09678 DNA-binding transcrip 79.7 0.51 1.1E-05 30.7 -0.2 17 241-257 25-43 (72)
121 PRK09678 DNA-binding transcrip 79.3 0.41 8.9E-06 31.1 -0.8 41 186-228 2-44 (72)
122 PF09723 Zn-ribbon_8: Zinc rib 79.2 0.81 1.8E-05 26.3 0.5 11 214-224 6-16 (42)
123 TIGR00595 priA primosomal prot 79.2 1.8 3.8E-05 40.5 3.0 41 185-252 222-262 (505)
124 PF04959 ARS2: Arsenite-resist 79.1 1.3 2.7E-05 35.9 1.7 31 240-270 74-104 (214)
125 KOG2071|consensus 78.9 1.7 3.8E-05 40.2 2.8 26 184-209 417-442 (579)
126 PF05443 ROS_MUCR: ROS/MUCR tr 77.6 1.2 2.6E-05 33.0 1.1 27 89-118 70-96 (132)
127 PF06524 NOA36: NOA36 protein; 77.4 1.1 2.4E-05 36.6 1.0 32 240-271 206-237 (314)
128 PF06524 NOA36: NOA36 protein; 77.1 0.96 2.1E-05 37.0 0.6 49 184-237 181-233 (314)
129 PRK14714 DNA polymerase II lar 77.1 3 6.6E-05 42.4 4.0 42 186-255 680-721 (1337)
130 PRK14873 primosome assembly pr 77.0 2.1 4.5E-05 41.4 2.8 40 185-252 392-431 (665)
131 PF02176 zf-TRAF: TRAF-type zi 76.9 1.3 2.9E-05 27.6 1.1 21 226-248 23-43 (60)
132 PRK04023 DNA polymerase II lar 76.9 5.6 0.00012 39.6 5.6 25 88-128 623-647 (1121)
133 KOG2071|consensus 76.8 8.2 0.00018 36.0 6.4 30 86-115 413-442 (579)
134 PF10571 UPF0547: Uncharacteri 76.0 1.6 3.4E-05 22.1 1.0 10 245-254 16-25 (26)
135 PF05443 ROS_MUCR: ROS/MUCR tr 75.6 1.3 2.7E-05 32.8 0.8 26 242-270 71-96 (132)
136 PF15269 zf-C2H2_7: Zinc-finge 74.8 3.7 8.1E-05 23.5 2.3 25 89-113 18-42 (54)
137 PF13878 zf-C2H2_3: zinc-finge 73.8 3.1 6.7E-05 23.7 1.9 24 92-115 14-39 (41)
138 PF08274 PhnA_Zn_Ribbon: PhnA 72.9 1.8 3.9E-05 22.8 0.7 9 188-196 5-13 (30)
139 PF14353 CpXC: CpXC protein 72.7 3 6.4E-05 30.9 2.2 23 243-265 38-60 (128)
140 KOG4377|consensus 72.4 13 0.00029 32.9 6.2 52 88-142 268-335 (480)
141 TIGR01384 TFS_arch transcripti 71.5 0.21 4.5E-06 35.5 -4.1 27 93-131 2-28 (104)
142 KOG2593|consensus 70.7 4.6 0.0001 36.0 3.2 38 210-252 125-162 (436)
143 PF04959 ARS2: Arsenite-resist 70.0 3.5 7.6E-05 33.4 2.2 26 212-237 76-101 (214)
144 PRK05580 primosome assembly pr 69.7 4 8.6E-05 39.8 2.9 41 185-252 390-430 (679)
145 KOG1280|consensus 68.1 4.5 9.7E-05 34.8 2.4 23 213-235 79-101 (381)
146 COG3357 Predicted transcriptio 67.8 2.7 5.8E-05 28.4 0.9 14 212-225 57-70 (97)
147 PF12760 Zn_Tnp_IS1595: Transp 66.8 5.3 0.00011 23.4 2.0 10 242-251 36-45 (46)
148 PRK03824 hypA hydrogenase nick 66.7 2.4 5.2E-05 31.7 0.6 8 244-251 108-115 (135)
149 KOG2593|consensus 66.4 4.9 0.00011 35.8 2.5 22 116-137 125-146 (436)
150 TIGR01206 lysW lysine biosynth 66.3 2.3 5.1E-05 25.9 0.4 10 214-223 3-12 (54)
151 PF14446 Prok-RING_1: Prokaryo 65.6 4.6 0.0001 24.5 1.5 9 187-195 7-15 (54)
152 KOG3408|consensus 65.3 3.5 7.6E-05 29.6 1.1 27 240-266 54-80 (129)
153 PRK12380 hydrogenase nickel in 65.2 4 8.7E-05 29.4 1.5 11 214-224 71-81 (113)
154 PF05191 ADK_lid: Adenylate ki 64.9 1.5 3.3E-05 24.2 -0.6 10 187-196 3-12 (36)
155 PF07975 C1_4: TFIIH C1-like d 64.9 1.7 3.7E-05 26.1 -0.4 23 213-235 21-43 (51)
156 PF07754 DUF1610: Domain of un 64.8 3.3 7.3E-05 20.4 0.7 8 243-250 16-23 (24)
157 KOG2807|consensus 64.0 9.3 0.0002 32.7 3.5 100 90-220 275-374 (378)
158 KOG4167|consensus 63.2 1.2 2.6E-05 42.1 -1.9 29 242-270 791-819 (907)
159 PF08209 Sgf11: Sgf11 (transcr 62.7 4.6 0.0001 21.8 1.0 25 243-268 4-28 (33)
160 PF09845 DUF2072: Zn-ribbon co 62.0 4.5 9.8E-05 29.7 1.2 15 213-227 1-15 (131)
161 COG4957 Predicted transcriptio 61.8 4 8.8E-05 29.8 0.9 25 92-119 77-101 (148)
162 KOG4124|consensus 61.6 2.8 6.2E-05 36.0 0.2 55 87-141 345-420 (442)
163 COG1571 Predicted DNA-binding 61.1 5 0.00011 36.0 1.6 30 215-257 352-381 (421)
164 PF05290 Baculo_IE-1: Baculovi 61.0 8 0.00017 28.4 2.3 55 184-258 79-136 (140)
165 TIGR00100 hypA hydrogenase nic 59.9 5.8 0.00013 28.7 1.5 12 214-225 71-82 (115)
166 PTZ00255 60S ribosomal protein 57.9 4.2 9.1E-05 27.7 0.5 13 242-254 53-65 (90)
167 PF01155 HypA: Hydrogenase exp 57.7 3.4 7.4E-05 29.8 0.0 12 214-225 71-82 (113)
168 COG1655 Uncharacterized protei 57.5 1.8 3.9E-05 34.8 -1.5 39 184-222 18-71 (267)
169 PRK00564 hypA hydrogenase nick 57.4 6.6 0.00014 28.5 1.5 11 214-224 72-82 (117)
170 PF01780 Ribosomal_L37ae: Ribo 57.1 3.1 6.7E-05 28.3 -0.3 11 214-224 54-64 (90)
171 PRK14873 primosome assembly pr 56.9 3.6 7.7E-05 39.8 0.0 33 92-128 384-419 (665)
172 PF09963 DUF2197: Uncharacteri 56.2 6.9 0.00015 24.0 1.1 36 92-128 3-40 (56)
173 TIGR00280 L37a ribosomal prote 55.4 4.1 8.9E-05 27.8 0.1 13 242-254 52-64 (91)
174 PF13453 zf-TFIIB: Transcripti 55.4 6.5 0.00014 22.3 0.9 16 214-229 20-35 (41)
175 PF01363 FYVE: FYVE zinc finge 55.3 5.5 0.00012 25.6 0.7 9 187-195 11-19 (69)
176 COG3091 SprT Zn-dependent meta 55.1 5.8 0.00013 29.9 0.8 11 185-196 117-127 (156)
177 COG3091 SprT Zn-dependent meta 53.8 6.4 0.00014 29.7 0.9 36 88-128 114-149 (156)
178 PF13451 zf-trcl: Probable zin 53.8 8 0.00017 23.0 1.1 10 186-195 5-14 (49)
179 PF07282 OrfB_Zn_ribbon: Putat 53.7 9.5 0.00021 24.5 1.6 27 187-223 30-56 (69)
180 PRK00432 30S ribosomal protein 53.3 7.3 0.00016 23.4 0.9 12 212-223 36-47 (50)
181 COG5151 SSL1 RNA polymerase II 52.9 6.8 0.00015 33.2 1.0 48 215-266 364-411 (421)
182 PF15135 UPF0515: Uncharacteri 52.1 6.5 0.00014 32.2 0.7 14 212-225 154-167 (278)
183 PRK03681 hypA hydrogenase nick 52.1 9.1 0.0002 27.7 1.4 10 214-223 71-80 (114)
184 COG4957 Predicted transcriptio 51.6 7.8 0.00017 28.4 1.0 26 185-213 76-101 (148)
185 TIGR00595 priA primosomal prot 51.2 5.8 0.00013 37.1 0.4 35 93-128 215-249 (505)
186 COG4896 Uncharacterized protei 51.2 12 0.00025 23.2 1.5 40 92-131 3-43 (68)
187 KOG3408|consensus 50.9 9.9 0.00022 27.3 1.4 26 210-235 54-79 (129)
188 COG1675 TFA1 Transcription ini 50.5 24 0.00051 27.7 3.6 34 208-252 108-141 (176)
189 cd00065 FYVE FYVE domain; Zinc 50.0 12 0.00026 22.8 1.6 9 245-253 20-28 (57)
190 smart00440 ZnF_C2C2 C2C2 Zinc 49.9 1.7 3.7E-05 24.6 -2.0 10 244-253 29-38 (40)
191 KOG1842|consensus 49.4 12 0.00026 33.6 2.0 22 92-113 16-37 (505)
192 PF13821 DUF4187: Domain of un 48.9 20 0.00043 22.0 2.3 18 244-261 28-45 (55)
193 KOG1280|consensus 48.7 19 0.00042 31.1 3.0 28 118-145 78-105 (381)
194 PF06220 zf-U1: U1 zinc finger 48.4 16 0.00034 20.5 1.7 22 243-264 3-26 (38)
195 KOG2907|consensus 48.1 9.2 0.0002 27.1 0.9 11 214-224 103-113 (116)
196 COG4888 Uncharacterized Zn rib 48.0 5.6 0.00012 27.5 -0.1 13 242-254 45-57 (104)
197 KOG4124|consensus 47.9 2 4.4E-05 36.8 -2.7 53 87-140 174-233 (442)
198 COG1571 Predicted DNA-binding 47.7 10 0.00022 34.0 1.4 34 185-229 350-383 (421)
199 PRK03976 rpl37ae 50S ribosomal 47.6 6.1 0.00013 27.0 -0.0 13 242-254 53-65 (90)
200 smart00661 RPOL9 RNA polymeras 47.6 12 0.00026 22.3 1.3 10 243-252 20-29 (52)
201 PF08271 TF_Zn_Ribbon: TFIIB z 47.3 9.5 0.00021 21.9 0.8 8 244-251 20-27 (43)
202 PRK12496 hypothetical protein; 47.1 14 0.0003 28.8 1.8 11 214-224 128-138 (164)
203 COG1773 Rubredoxin [Energy pro 46.9 9.4 0.0002 23.3 0.7 40 212-251 2-44 (55)
204 COG1998 RPS31 Ribosomal protei 46.9 8.9 0.00019 22.7 0.6 10 213-222 37-46 (51)
205 KOG2272|consensus 46.1 23 0.00049 29.2 2.9 44 87-132 133-176 (332)
206 PF04780 DUF629: Protein of un 45.4 15 0.00032 33.7 2.0 28 243-270 57-84 (466)
207 PF13824 zf-Mss51: Zinc-finger 45.2 16 0.00035 22.3 1.5 12 242-253 13-24 (55)
208 smart00154 ZnF_AN1 AN1-like Zi 45.1 11 0.00024 21.2 0.8 14 243-256 12-25 (39)
209 PF14311 DUF4379: Domain of un 45.0 14 0.0003 22.5 1.3 14 185-198 28-41 (55)
210 COG5216 Uncharacterized conser 44.3 12 0.00026 22.9 0.9 31 185-221 22-52 (67)
211 KOG2636|consensus 44.3 17 0.00037 32.7 2.1 29 206-234 394-423 (497)
212 PF03833 PolC_DP2: DNA polymer 43.9 7.6 0.00016 38.0 0.0 13 245-257 694-706 (900)
213 PLN03238 probable histone acet 43.7 19 0.00041 30.6 2.2 27 241-267 46-72 (290)
214 COG1656 Uncharacterized conser 43.3 23 0.00049 27.3 2.4 14 244-257 131-144 (165)
215 PRK04351 hypothetical protein; 42.7 11 0.00024 28.7 0.7 12 242-253 131-142 (149)
216 COG1779 C4-type Zn-finger prot 42.3 3.1 6.7E-05 32.8 -2.3 8 186-193 15-22 (201)
217 KOG3507|consensus 42.3 13 0.00028 22.9 0.8 12 212-223 36-47 (62)
218 smart00064 FYVE Protein presen 41.9 18 0.00039 23.1 1.5 10 187-196 12-21 (68)
219 COG1594 RPB9 DNA-directed RNA 41.7 6.1 0.00013 28.5 -0.8 40 185-224 72-111 (113)
220 COG2331 Uncharacterized protei 40.7 5.7 0.00012 25.8 -0.9 10 214-223 13-22 (82)
221 TIGR01562 FdhE formate dehydro 40.7 19 0.00041 31.1 1.9 36 185-222 184-219 (305)
222 KOG2636|consensus 40.6 20 0.00043 32.3 2.0 29 234-264 394-423 (497)
223 PF10013 DUF2256: Uncharacteri 40.2 21 0.00045 20.4 1.3 16 245-260 10-25 (42)
224 COG1198 PriA Primosomal protei 40.1 7.1 0.00015 38.0 -0.8 23 106-128 462-484 (730)
225 PRK03564 formate dehydrogenase 40.1 19 0.0004 31.2 1.7 36 184-222 186-221 (309)
226 smart00504 Ubox Modified RING 39.8 57 0.0012 20.1 3.6 45 186-254 2-46 (63)
227 COG0068 HypF Hydrogenase matur 39.7 4.6 9.9E-05 38.6 -2.1 57 187-252 125-182 (750)
228 PF10263 SprT-like: SprT-like 39.4 8.7 0.00019 29.4 -0.3 11 185-195 123-133 (157)
229 PF14205 Cys_rich_KTR: Cystein 39.0 17 0.00037 22.0 1.0 11 241-251 26-36 (55)
230 TIGR00686 phnA alkylphosphonat 38.3 17 0.00038 25.6 1.1 29 215-256 4-32 (109)
231 PF12907 zf-met2: Zinc-binding 38.2 19 0.0004 20.5 1.0 21 214-234 2-25 (40)
232 PRK14892 putative transcriptio 38.1 8.5 0.00018 26.9 -0.5 12 244-255 43-54 (99)
233 COG1326 Uncharacterized archae 37.2 31 0.00067 27.3 2.4 14 241-254 28-41 (201)
234 KOG2923|consensus 36.8 25 0.00055 22.0 1.5 13 209-221 40-52 (67)
235 PRK10220 hypothetical protein; 36.8 24 0.00052 24.9 1.6 30 215-257 5-34 (111)
236 smart00731 SprT SprT homologue 36.5 12 0.00026 28.4 0.1 11 213-223 112-122 (146)
237 PRK14559 putative protein seri 36.3 38 0.00082 32.8 3.3 12 245-256 43-54 (645)
238 PF00301 Rubredoxin: Rubredoxi 35.8 18 0.00038 21.4 0.7 39 214-252 2-43 (47)
239 PRK00762 hypA hydrogenase nick 35.7 14 0.00031 27.1 0.4 10 185-195 70-79 (124)
240 KOG1994|consensus 35.4 18 0.00039 29.3 0.8 22 242-263 238-259 (268)
241 PF10276 zf-CHCC: Zinc-finger 35.3 18 0.0004 20.5 0.7 11 213-223 29-39 (40)
242 CHL00174 accD acetyl-CoA carbo 35.2 19 0.00041 30.9 1.1 33 185-226 38-70 (296)
243 COG5188 PRP9 Splicing factor 3 34.9 40 0.00087 29.3 2.9 29 113-141 368-397 (470)
244 PF04810 zf-Sec23_Sec24: Sec23 34.9 7.3 0.00016 22.0 -1.0 10 212-221 23-32 (40)
245 PF07295 DUF1451: Protein of u 34.7 14 0.00031 28.0 0.2 28 213-252 112-139 (146)
246 COG5112 UFD2 U1-like Zn-finger 34.1 14 0.0003 25.8 0.0 26 240-265 52-77 (126)
247 PF07649 C1_3: C1-like domain; 33.8 23 0.00051 18.3 0.9 11 242-252 14-24 (30)
248 PF04780 DUF629: Protein of un 33.7 34 0.00073 31.5 2.4 23 185-207 57-79 (466)
249 COG4306 Uncharacterized protei 33.5 22 0.00047 25.7 1.0 58 199-257 24-82 (160)
250 COG0068 HypF Hydrogenase matur 33.4 5.3 0.00011 38.2 -2.7 45 93-138 125-170 (750)
251 COG4640 Predicted membrane pro 33.4 29 0.00063 30.7 1.9 18 244-261 16-33 (465)
252 PTZ00064 histone acetyltransfe 33.3 26 0.00057 32.2 1.6 29 240-268 277-305 (552)
253 KOG0978|consensus 33.1 15 0.00032 35.4 0.1 20 243-262 678-697 (698)
254 cd00924 Cyt_c_Oxidase_Vb Cytoc 33.1 23 0.0005 24.7 1.0 18 207-225 74-91 (97)
255 TIGR00515 accD acetyl-CoA carb 33.0 22 0.00047 30.5 1.1 33 185-226 26-58 (285)
256 PF05495 zf-CHY: CHY zinc fing 32.9 6.9 0.00015 25.5 -1.5 59 185-253 10-71 (71)
257 cd00730 rubredoxin Rubredoxin; 32.8 19 0.0004 21.6 0.4 37 214-251 2-42 (50)
258 PF01428 zf-AN1: AN1-like Zinc 32.4 18 0.00039 20.8 0.3 14 212-225 12-25 (43)
259 COG1327 Predicted transcriptio 32.2 10 0.00022 28.6 -0.9 14 120-133 29-42 (156)
260 PLN00104 MYST -like histone ac 32.0 25 0.00055 32.0 1.4 28 240-267 195-222 (450)
261 KOG0717|consensus 31.8 26 0.00055 31.8 1.3 22 244-265 293-314 (508)
262 KOG0978|consensus 31.8 17 0.00037 35.0 0.3 49 90-138 642-697 (698)
263 PF04423 Rad50_zn_hook: Rad50 31.6 14 0.0003 22.5 -0.2 13 245-257 22-34 (54)
264 PF11672 DUF3268: Protein of u 31.2 18 0.00039 25.4 0.3 7 244-250 32-38 (102)
265 PRK05654 acetyl-CoA carboxylas 31.0 23 0.00049 30.5 0.9 34 184-226 26-59 (292)
266 PHA02998 RNA polymerase subuni 30.6 8.2 0.00018 29.9 -1.6 12 214-225 172-183 (195)
267 PLN02294 cytochrome c oxidase 30.5 26 0.00057 27.1 1.0 15 211-225 139-153 (174)
268 PF04216 FdhE: Protein involve 30.5 11 0.00025 32.3 -1.0 9 186-194 173-181 (290)
269 PRK00420 hypothetical protein; 30.4 25 0.00054 25.2 0.9 14 184-197 39-52 (112)
270 KOG2747|consensus 30.2 21 0.00045 31.8 0.5 29 241-269 156-184 (396)
271 TIGR00310 ZPR1_znf ZPR1 zinc f 29.9 6 0.00013 31.6 -2.6 11 244-254 31-41 (192)
272 KOG3362|consensus 29.9 20 0.00043 26.7 0.3 24 242-265 128-151 (156)
273 KOG0717|consensus 29.8 32 0.0007 31.2 1.6 22 214-235 293-314 (508)
274 PF10537 WAC_Acf1_DNA_bd: ATP- 29.8 80 0.0017 22.3 3.3 36 213-251 3-38 (102)
275 PF11931 DUF3449: Domain of un 29.7 18 0.00038 28.9 0.0 27 208-234 96-123 (196)
276 PF08790 zf-LYAR: LYAR-type C2 29.7 11 0.00024 19.4 -0.7 19 92-111 1-19 (28)
277 PF08882 Acetone_carb_G: Aceto 29.5 17 0.00037 25.8 -0.1 37 91-128 24-83 (112)
278 KOG1842|consensus 29.4 28 0.00061 31.4 1.2 29 243-271 15-43 (505)
279 KOG0320|consensus 29.1 34 0.00074 26.7 1.4 15 184-198 166-180 (187)
280 PF03145 Sina: Seven in absent 28.4 35 0.00077 27.4 1.6 46 219-269 24-73 (198)
281 PF11789 zf-Nse: Zinc-finger o 28.3 51 0.0011 20.3 1.9 32 89-124 22-53 (57)
282 KOG4727|consensus 27.9 30 0.00066 26.6 1.0 22 243-264 75-96 (193)
283 COG3677 Transposase and inacti 27.8 29 0.00064 25.7 0.9 47 201-256 20-66 (129)
284 PF06397 Desulfoferrod_N: Desu 27.2 23 0.00049 19.6 0.2 13 90-102 5-17 (36)
285 PF10122 Mu-like_Com: Mu-like 27.2 22 0.00047 21.3 0.1 9 244-252 25-33 (51)
286 PTZ00448 hypothetical protein; 27.2 45 0.00097 29.5 2.0 25 243-267 314-338 (373)
287 PF03811 Zn_Tnp_IS1: InsA N-te 26.9 14 0.0003 20.4 -0.7 13 207-219 23-35 (36)
288 COG3364 Zn-ribbon containing p 26.3 38 0.00082 23.6 1.1 27 91-127 2-28 (112)
289 KOG0782|consensus 26.1 11 0.00024 34.9 -1.9 53 198-257 238-290 (1004)
290 TIGR00627 tfb4 transcription f 25.7 52 0.0011 28.1 2.1 25 213-253 255-279 (279)
291 PF07800 DUF1644: Protein of u 25.5 1.7E+02 0.0038 22.5 4.5 55 90-145 79-134 (162)
292 PF14369 zf-RING_3: zinc-finge 25.2 51 0.0011 18.0 1.3 10 245-254 23-32 (35)
293 PF11023 DUF2614: Protein of u 25.1 43 0.00094 23.9 1.3 12 212-223 68-79 (114)
294 TIGR00416 sms DNA repair prote 24.7 42 0.00092 31.0 1.5 25 184-222 6-30 (454)
295 KOG3002|consensus 24.6 86 0.0019 27.1 3.2 79 184-269 79-165 (299)
296 PRK06260 threonine synthase; V 24.4 51 0.0011 29.9 2.0 10 186-195 4-13 (397)
297 PF09332 Mcm10: Mcm10 replicat 24.4 18 0.0004 31.7 -0.8 12 213-224 285-296 (344)
298 PRK11823 DNA repair protein Ra 24.3 43 0.00093 30.9 1.5 25 184-222 6-30 (446)
299 PF02748 PyrI_C: Aspartate car 24.0 32 0.0007 20.8 0.4 19 86-104 30-48 (52)
300 PF02591 DUF164: Putative zinc 24.0 62 0.0013 19.7 1.7 31 187-221 24-54 (56)
301 KOG4317|consensus 23.7 21 0.00045 30.5 -0.6 21 184-204 18-38 (383)
302 PF13639 zf-RING_2: Ring finge 23.7 88 0.0019 17.7 2.3 15 211-226 14-28 (44)
303 PF03145 Sina: Seven in absent 23.5 56 0.0012 26.2 1.9 52 91-145 14-73 (198)
304 KOG1994|consensus 23.5 44 0.00096 27.1 1.2 22 212-233 238-259 (268)
305 KOG4727|consensus 23.3 61 0.0013 25.1 1.8 25 88-112 72-96 (193)
306 PF01286 XPA_N: XPA protein N- 23.3 24 0.00052 19.2 -0.2 9 216-224 6-14 (34)
307 PRK05978 hypothetical protein; 23.3 32 0.00069 26.2 0.3 10 245-254 54-63 (148)
308 COG4338 Uncharacterized protei 22.9 35 0.00076 20.0 0.4 16 245-260 14-29 (54)
309 PF04641 Rtf2: Rtf2 RING-finge 22.8 74 0.0016 26.9 2.5 51 184-256 112-163 (260)
310 PF14447 Prok-RING_4: Prokaryo 22.5 48 0.0011 20.3 1.0 12 244-255 40-51 (55)
311 PF11781 RRN7: RNA polymerase 22.5 41 0.0009 18.5 0.6 7 188-194 11-17 (36)
312 COG4391 Uncharacterized protei 22.1 42 0.00091 21.0 0.6 15 88-102 45-59 (62)
313 PLN03238 probable histone acet 22.0 80 0.0017 26.9 2.5 26 117-142 46-71 (290)
314 PF14255 Cys_rich_CPXG: Cystei 21.9 40 0.00087 20.4 0.5 10 245-254 2-11 (52)
315 KOG3214|consensus 21.7 30 0.00066 23.9 -0.0 16 242-257 46-61 (109)
316 PTZ00448 hypothetical protein; 21.6 62 0.0014 28.6 1.8 23 213-235 314-336 (373)
317 KOG4602|consensus 21.5 52 0.0011 27.2 1.3 66 184-266 232-304 (318)
318 COG5027 SAS2 Histone acetyltra 21.3 35 0.00077 29.7 0.3 30 240-269 155-184 (395)
319 PLN03239 histone acetyltransfe 21.2 82 0.0018 27.7 2.4 21 213-233 106-126 (351)
320 cd01121 Sms Sms (bacterial rad 21.0 55 0.0012 29.4 1.4 8 185-192 14-21 (372)
321 PRK12860 transcriptional activ 20.7 67 0.0015 25.5 1.7 48 192-250 114-161 (189)
322 PF09082 DUF1922: Domain of un 20.6 23 0.00049 22.8 -0.7 13 240-253 17-29 (68)
323 PF14787 zf-CCHC_5: GAG-polypr 20.3 54 0.0012 18.0 0.8 14 245-258 4-17 (36)
324 KOG1729|consensus 20.2 39 0.00085 28.9 0.3 7 187-193 170-176 (288)
No 1
>KOG2462|consensus
Probab=100.00 E-value=4.9e-34 Score=226.21 Aligned_cols=137 Identities=34% Similarity=0.645 Sum_probs=129.3
Q ss_pred CCCCcccccccccccCChHHHHHHHHhccC---CCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHH
Q psy5202 87 QEEKNKECEFCGKKFKYENTLQAHRRNHTG---EKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAA 163 (343)
Q Consensus 87 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~---~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~ 163 (343)
.....|+|..||+.|.+..+|.+|.++|.. .+.+.|++|++.|.+...|+.|+++|.
T Consensus 126 ~~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~-------------------- 185 (279)
T KOG2462|consen 126 AKHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT-------------------- 185 (279)
T ss_pred ccCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--------------------
Confidence 355679999999999999999999999965 577999999999999999999999996
Q ss_pred HHHHHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcc
Q psy5202 164 AAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDV 243 (343)
Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~ 243 (343)
-++.|.+||+.|...+.|+.|+|+|+|||||.|+.|+++|..+++|+.||++|. +.++
T Consensus 186 --------------------l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS--~~K~ 243 (279)
T KOG2462|consen 186 --------------------LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHS--DVKK 243 (279)
T ss_pred --------------------CCcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhc--CCcc
Confidence 578999999999999999999999999999999999999999999999999997 7789
Q ss_pred cccCCCCCCCCChHHHHHHHhh
Q psy5202 244 YRCRFCEMPFSVPSTLEKHMRK 265 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~H~~~ 265 (343)
|+|..|+|+|..++.|.+|...
T Consensus 244 ~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 244 HQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred ccCcchhhHHHHHHHHHHhhhh
Confidence 9999999999999999999874
No 2
>KOG2462|consensus
Probab=99.93 E-value=1.5e-26 Score=183.86 Aligned_cols=126 Identities=28% Similarity=0.575 Sum_probs=114.9
Q ss_pred CCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhhcCCCCcccccCCCCcccccCcccc
Q psy5202 116 GEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGKVF 195 (343)
Q Consensus 116 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f 195 (343)
....|+|+.|++.+.+..+|.+|.++|.... ..+.+.|.+|+++|
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~-----------------------------------s~ka~~C~~C~K~Y 171 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRHKQTHRSLD-----------------------------------SKKAFSCKYCGKVY 171 (279)
T ss_pred cCCceeccccccccccccccchhhccccccc-----------------------------------ccccccCCCCCcee
Confidence 3456999999999999999999999997532 12789999999999
Q ss_pred cCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhhcCccccccc
Q psy5202 196 KNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQNMKME 275 (343)
Q Consensus 196 ~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~ 275 (343)
.....|..|+++|+ -+++|.+||+.|.+.+.|+.|+|+|+ ||+||.|+.|+|.|..+++|+.||++|...+.+.|.
T Consensus 172 vSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiRTHT--GEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~ 247 (279)
T KOG2462|consen 172 VSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIRTHT--GEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCP 247 (279)
T ss_pred eehHHHhhHhhccC--CCcccccccccccchHHhhccccccc--CCCCccCCcccchhcchHHHHHHHHhhcCCccccCc
Confidence 99999999999997 68999999999999999999999997 999999999999999999999999999999998887
Q ss_pred chhHH
Q psy5202 276 SHNIM 280 (343)
Q Consensus 276 ~~~~~ 280 (343)
.+...
T Consensus 248 ~C~Ks 252 (279)
T KOG2462|consen 248 RCGKS 252 (279)
T ss_pred chhhH
Confidence 66544
No 3
>KOG1074|consensus
Probab=99.88 E-value=9.6e-23 Score=183.56 Aligned_cols=90 Identities=31% Similarity=0.626 Sum_probs=79.1
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcC--CCCcccccC---CCCCCCCChHH
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGR--LGKDVYRCR---FCEMPFSVPST 258 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~--~~~~~~~C~---~C~~~f~~~~~ 258 (343)
.+-.|-+|-++...++.|+-|.++|+||+||+|.+||+.|.++.+|+.||-+|.. .-.-.|.|+ +|-+.|...-.
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccc
Confidence 4667999999999999999999999999999999999999999999999999851 011348899 99999999999
Q ss_pred HHHHHhhhcCccccc
Q psy5202 259 LEKHMRKCVVNQNMK 273 (343)
Q Consensus 259 l~~H~~~~~~~~~~~ 273 (343)
|..|+|.|-+.....
T Consensus 684 lpQhIriH~~~~~s~ 698 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISN 698 (958)
T ss_pred ccceEEeecCCCCCC
Confidence 999999988655433
No 4
>KOG1074|consensus
Probab=99.84 E-value=3.6e-22 Score=179.92 Aligned_cols=177 Identities=27% Similarity=0.510 Sum_probs=129.6
Q ss_pred CCcccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCC-CcchHHHHHHHH
Q psy5202 89 EKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGP-SASSLAAAAAAA 167 (343)
Q Consensus 89 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~-~~~~~~~~~~~~ 167 (343)
-.+-+|-+|-++...+..|+.|.++|+|++||+|.+||+.|.++.+|+.||-.|....+.......+ .+..........
T Consensus 603 TdPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V 682 (958)
T KOG1074|consen 603 TDPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAV 682 (958)
T ss_pred CCccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccc
Confidence 4568999999999999999999999999999999999999999999999999987654332111110 000000000000
Q ss_pred HhhhcCCCCcccc--------------cCCCCcccccCcccccCHHHHHHHHHHccCC----------------------
Q psy5202 168 ALNLCLPNSHMKK--------------ESRRNDTCEFCGKVFKNCSNLTVHRRSHTGE---------------------- 211 (343)
Q Consensus 168 ~~~~~~~~~~~~~--------------~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~---------------------- 211 (343)
....+++. .....-+|..|.+.|.....+..++..|.+.
T Consensus 683 -----~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~ 757 (958)
T KOG1074|consen 683 -----TLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPP 757 (958)
T ss_pred -----cccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCC
Confidence 01111111 1112345777888887777777776655210
Q ss_pred --------------------------------------------------------------------------------
Q psy5202 212 -------------------------------------------------------------------------------- 211 (343)
Q Consensus 212 -------------------------------------------------------------------------------- 211 (343)
T Consensus 758 ~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l 837 (958)
T KOG1074|consen 758 PPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSML 837 (958)
T ss_pred ccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccc
Confidence
Q ss_pred ----------------------------------------CceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCC
Q psy5202 212 ----------------------------------------KPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEM 251 (343)
Q Consensus 212 ----------------------------------------~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~ 251 (343)
....|.+||+.|...+.|+.|+++|. ++++|.|.+|++
T Consensus 838 ~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHt--g~KPF~C~fC~~ 915 (958)
T KOG1074|consen 838 NEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHT--GPKPFFCHFCEE 915 (958)
T ss_pred ccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCC--CCCCccchhhhh
Confidence 12579999999999999999999997 899999999999
Q ss_pred CCCChHHHHHHHhhhcCcccc
Q psy5202 252 PFSVPSTLEKHMRKCVVNQNM 272 (343)
Q Consensus 252 ~f~~~~~l~~H~~~~~~~~~~ 272 (343)
.|..+.+|+.||.+|+..+..
T Consensus 916 aFttrgnLKvHMgtH~w~q~~ 936 (958)
T KOG1074|consen 916 AFTTRGNLKVHMGTHMWVQPP 936 (958)
T ss_pred hhhhhhhhhhhhccccccCCC
Confidence 999999999999999876654
No 5
>KOG3608|consensus
Probab=99.83 E-value=3.6e-21 Score=157.24 Aligned_cols=166 Identities=25% Similarity=0.499 Sum_probs=105.5
Q ss_pred cccccc--cccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCC--CCccccCCCCcchHHHHHHH
Q psy5202 91 NKECEF--CGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDG--DKEDREEGPSASSLAAAAAA 166 (343)
Q Consensus 91 ~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~--~~~~~~~~~~~~~~~~~~~~ 166 (343)
.+.|.. |-+.|.++..|++|++.|.+++...|+.||..|.++..|..|++...... .+.+..+...+.........
T Consensus 177 v~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~H 256 (467)
T KOG3608|consen 177 VTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSH 256 (467)
T ss_pred eeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHH
Confidence 356654 88999999999999999999999999999999999999999987665322 22232222222221111111
Q ss_pred HHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHH-ccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccc
Q psy5202 167 AALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRS-HTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYR 245 (343)
Q Consensus 167 ~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~ 245 (343)
.. +| -..|+|++|+.+....+.|..|++. |...+||+|..|.+.|.+.+.|.+|..+|. +..|+
T Consensus 257 v~-------rH-----vn~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS---~~~y~ 321 (467)
T KOG3608|consen 257 VV-------RH-----VNCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS---KTVYQ 321 (467)
T ss_pred HH-------Hh-----hhcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc---cccee
Confidence 10 11 1456677777777777777777664 566667777777777777777777766663 45566
Q ss_pred cCC--CCCCCCChHHHHHHHhhhcCccc
Q psy5202 246 CRF--CEMPFSVPSTLEKHMRKCVVNQN 271 (343)
Q Consensus 246 C~~--C~~~f~~~~~l~~H~~~~~~~~~ 271 (343)
|.. |.+.|.+...|++|++.+|.+.+
T Consensus 322 C~h~~C~~s~r~~~q~~~H~~evhEg~n 349 (467)
T KOG3608|consen 322 CEHPDCHYSVRTYTQMRRHFLEVHEGNN 349 (467)
T ss_pred cCCCCCcHHHHHHHHHHHHHHHhccCCC
Confidence 655 66666666666666665554433
No 6
>KOG3608|consensus
Probab=99.82 E-value=2.7e-20 Score=152.12 Aligned_cols=174 Identities=24% Similarity=0.404 Sum_probs=131.9
Q ss_pred CCCCCCcccccccccccCChHHHHHHHHhcc--CCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHH
Q psy5202 85 TSQEEKNKECEFCGKKFKYENTLQAHRRNHT--GEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAA 162 (343)
Q Consensus 85 ~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~--~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~ 162 (343)
.+.++|...|+.||..|.++..|-.|++.-+ ...+|+|..|.+.|.++..|..|+..|-.. +.+..+.-.......
T Consensus 201 ~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn~--ykCplCdmtc~~~ss 278 (467)
T KOG3608|consen 201 THSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVNC--YKCPLCDMTCSSASS 278 (467)
T ss_pred hcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhhc--ccccccccCCCChHH
Confidence 4567788888888888888888888876533 356788888888888888888888877532 333333322222211
Q ss_pred HHHHHHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCc--CcccccChHHHHHHHHHhc-CC
Q psy5202 163 AAAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCEL--CSYACAQSSKLTRHMKTHG-RL 239 (343)
Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~h~-~~ 239 (343)
... .+.-+| ...++|+|+.|++.|.+.+.|.+|+.+|. +..|+|.. |.+.|.+...|++|++.++ +.
T Consensus 279 L~~------H~r~rH---s~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~ 348 (467)
T KOG3608|consen 279 LTT------HIRYRH---SKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGN 348 (467)
T ss_pred HHH------HHHhhh---ccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCC
Confidence 111 111111 12489999999999999999999999988 77899988 9999999999999998766 55
Q ss_pred CCcccccCCCCCCCCChHHHHHHHhhhcCcc
Q psy5202 240 GKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQ 270 (343)
Q Consensus 240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 270 (343)
...+|.|..|++.|.+-.+|..|+++.|+-+
T Consensus 349 np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 349 NPILYACHCCDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred CCCceeeecchhhhccchhHHHHHHHhhccc
Confidence 6678999999999999999999998877533
No 7
>KOG3576|consensus
Probab=99.81 E-value=8.7e-20 Score=137.89 Aligned_cols=87 Identities=32% Similarity=0.601 Sum_probs=57.2
Q ss_pred cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhc---------CCCCcccccCCCCCCCCCh
Q psy5202 186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHG---------RLGKDVYRCRFCEMPFSVP 256 (343)
Q Consensus 186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~---------~~~~~~~~C~~C~~~f~~~ 256 (343)
+.|..||+.|.+...|++|+++|+|.+||+|..|++.|..+-.|..|++..| +...+.|.|+.||++-...
T Consensus 146 ~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~ 225 (267)
T KOG3576|consen 146 HLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERP 225 (267)
T ss_pred HHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCCh
Confidence 3455555555555555555555555566666666666665555555554322 1245778999999999999
Q ss_pred HHHHHHHhhhcCcccc
Q psy5202 257 STLEKHMRKCVVNQNM 272 (343)
Q Consensus 257 ~~l~~H~~~~~~~~~~ 272 (343)
..+..|++.||.....
T Consensus 226 e~~~~h~~~~hp~Spa 241 (267)
T KOG3576|consen 226 EVYYLHLKLHHPFSPA 241 (267)
T ss_pred hHHHHHHHhcCCCCHH
Confidence 9999999998877644
No 8
>KOG3576|consensus
Probab=99.81 E-value=1e-20 Score=142.87 Aligned_cols=110 Identities=31% Similarity=0.645 Sum_probs=98.7
Q ss_pred CCCCcccccccccccCChHHHHHHHHhccCCCCee---cCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHH
Q psy5202 87 QEEKNKECEFCGKKFKYENTLQAHRRNHTGEKPFK---CTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAA 163 (343)
Q Consensus 87 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~---C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~ 163 (343)
.+...|.|.+|++.|.-...|.+|++-|.. .+ |..||+.|..-..|++|+++|++.
T Consensus 113 sd~d~ftCrvCgK~F~lQRmlnrh~kch~~---vkr~lct~cgkgfndtfdlkrh~rthtgv------------------ 171 (267)
T KOG3576|consen 113 SDQDSFTCRVCGKKFGLQRMLNRHLKCHSD---VKRHLCTFCGKGFNDTFDLKRHTRTHTGV------------------ 171 (267)
T ss_pred CCCCeeeeehhhhhhhHHHHHHHHhhhccH---HHHHHHhhccCcccchhhhhhhhccccCc------------------
Confidence 345679999999999999999999999853 44 899999999999999999999987
Q ss_pred HHHHHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccC-----------CCceecCcCcccccChHHHHHH
Q psy5202 164 AAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTG-----------EKPYKCELCSYACAQSSKLTRH 232 (343)
Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~-----------~~~~~C~~C~~~f~~~~~L~~H 232 (343)
++|+|..|++.|..+..|..|++.-+| .+.|.|..||++-.....+..|
T Consensus 172 --------------------rpykc~~c~kaftqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h 231 (267)
T KOG3576|consen 172 --------------------RPYKCSLCEKAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLH 231 (267)
T ss_pred --------------------cccchhhhhHHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHH
Confidence 899999999999999999999876444 3569999999999999999999
Q ss_pred HHHhc
Q psy5202 233 MKTHG 237 (343)
Q Consensus 233 ~~~h~ 237 (343)
++.|+
T Consensus 232 ~~~~h 236 (267)
T KOG3576|consen 232 LKLHH 236 (267)
T ss_pred HHhcC
Confidence 99987
No 9
>KOG3623|consensus
Probab=99.69 E-value=1.1e-17 Score=148.89 Aligned_cols=122 Identities=29% Similarity=0.601 Sum_probs=99.0
Q ss_pred CCcccccccccccCChHHHHHHHHhccC--CCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHH
Q psy5202 89 EKNKECEFCGKKFKYENTLQAHRRNHTG--EKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAA 166 (343)
Q Consensus 89 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~--~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~ 166 (343)
.....|++|.+.+.....|+.|++.-+. +..|.|..|.++|..+..|.+||..|.........
T Consensus 208 sqlltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~s--------------- 272 (1007)
T KOG3623|consen 208 SQLLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAIS--------------- 272 (1007)
T ss_pred hhhhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCccccc---------------
Confidence 4457899999999999999999885332 45699999999999999999999999754322100
Q ss_pred HHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHH
Q psy5202 167 AALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 167 ~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~ 235 (343)
.......+.|+|..||+.|+.+-.|+.|+|+|.|+|||.|+.|+++|.....+-.||..
T Consensus 273 ----------ltqsa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 273 ----------LTQSALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred ----------ccchhhhccccccccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence 00111237899999999999999999999999999999999999999999999888754
No 10
>KOG3623|consensus
Probab=99.67 E-value=1.2e-17 Score=148.77 Aligned_cols=107 Identities=27% Similarity=0.587 Sum_probs=98.3
Q ss_pred eecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhhcCCCCcccccCCCCcccccCcccccCHH
Q psy5202 120 FKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCS 199 (343)
Q Consensus 120 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~ 199 (343)
..|++|++.+.....|+.|++..|... +..|.|.+|..+|..+.
T Consensus 211 ltcpycdrgykrltslkeHikyrhekn------------------------------------e~nfsC~lCsytFAyRt 254 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKN------------------------------------EPNFSCMLCSYTFAYRT 254 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhC------------------------------------CCCCcchhhhhhhhhHH
Confidence 789999999999999999999877542 26789999999999999
Q ss_pred HHHHHHHHccC-------------CCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHh
Q psy5202 200 NLTVHRRSHTG-------------EKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMR 264 (343)
Q Consensus 200 ~l~~H~~~h~~-------------~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 264 (343)
.|.+|+.+|.. .+.|+|..||++|..+..|+.|+|+|. |++||.|+.|+|+|+....+-.||-
T Consensus 255 QLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKAFKfKHHLKEHlRIHS--GEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 255 QLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKAFKFKHHLKEHLRIHS--GEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred HHHHHHHhhcCCCcccccccchhhhccccccccchhhhhHHHHHhhheeec--CCCCcCCcccccccccCCccccccc
Confidence 99999999853 356999999999999999999999997 9999999999999999999999995
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.50 E-value=3.2e-14 Score=127.83 Aligned_cols=129 Identities=19% Similarity=0.418 Sum_probs=98.4
Q ss_pred ccccccccccCChHHHHHHHHhccC-------------------CCCeecCCCCccCCCHHHHHHHHHHhccCCCCcccc
Q psy5202 92 KECEFCGKKFKYENTLQAHRRNHTG-------------------EKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDRE 152 (343)
Q Consensus 92 ~~C~~C~~~f~~~~~L~~H~~~h~~-------------------~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~ 152 (343)
-.|+.|.+... ...|..|...... ...+.|+.|++.|. ...|..|++.+|
T Consensus 408 V~C~NC~~~i~-l~~l~lHe~~C~r~~V~Cp~~~Cg~v~~r~el~~H~~C~~Cgk~f~-~s~LekH~~~~H--------- 476 (567)
T PLN03086 408 VECRNCKHYIP-SRSIALHEAYCSRHNVVCPHDGCGIVLRVEEAKNHVHCEKCGQAFQ-QGEMEKHMKVFH--------- 476 (567)
T ss_pred EECCCCCCccc-hhHHHHHHhhCCCcceeCCcccccceeeccccccCccCCCCCCccc-hHHHHHHHHhcC---------
Confidence 46888888766 3456677654321 24467888888885 677888888875
Q ss_pred CCCCcchHHHHHHHHHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCccccc--------
Q psy5202 153 EGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACA-------- 224 (343)
Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~-------- 224 (343)
.++.|+ ||+.+ .+..|..|+..|..++++.|.+|++.|.
T Consensus 477 -------------------------------kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~ 523 (567)
T PLN03086 477 -------------------------------EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDV 523 (567)
T ss_pred -------------------------------CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccch
Confidence 568898 99755 6688999999999999999999999885
Q ss_pred --ChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhhcC
Q psy5202 225 --QSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKCVV 268 (343)
Q Consensus 225 --~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~ 268 (343)
....|..|...+ |.+++.|..||+.|..+ .|..|+..+|.
T Consensus 524 ~d~~s~Lt~HE~~C---G~rt~~C~~Cgk~Vrlr-dm~~H~~~~h~ 565 (567)
T PLN03086 524 RDRLRGMSEHESIC---GSRTAPCDSCGRSVMLK-EMDIHQIAVHQ 565 (567)
T ss_pred hhhhhhHHHHHHhc---CCcceEccccCCeeeeh-hHHHHHHHhhc
Confidence 245788998886 78889999999887554 67888887774
No 12
>KOG3993|consensus
Probab=99.46 E-value=7.8e-14 Score=117.36 Aligned_cols=174 Identities=19% Similarity=0.328 Sum_probs=116.7
Q ss_pred cccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhh
Q psy5202 91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALN 170 (343)
Q Consensus 91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (343)
-|.|..|...|.+...|.+|...-.-...|+|+.|++.|....+|..|+++|............+......... ....
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~ra--e~~e 344 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRA--EVQE 344 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhh--hhhh
Confidence 49999999999999999999765444456999999999999999999999997654332111111000000000 0000
Q ss_pred hcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCC--------------------------------------
Q psy5202 171 LCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEK-------------------------------------- 212 (343)
Q Consensus 171 ~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~-------------------------------------- 212 (343)
.. . .-......-|.|.+|++.|.....|+.|+.+|+...
T Consensus 345 a~--r-sg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~v 421 (500)
T KOG3993|consen 345 AE--R-SGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEV 421 (500)
T ss_pred cc--c-cCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccce
Confidence 00 0 000112357999999999999999999988875310
Q ss_pred --------ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhhcCccc
Q psy5202 213 --------PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQN 271 (343)
Q Consensus 213 --------~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~ 271 (343)
-..|++|+--+.++..--.|.+.-. .+..|.|.+|.-.|.+...|.+|+.++|....
T Consensus 422 l~~a~sael~~pp~~~~ppsss~~sgg~~rlg~--~~q~f~~ky~~atfyss~~ltrhin~~Hpse~ 486 (500)
T KOG3993|consen 422 LYVAGSAELELPPYDGSPPSSSGSSGGYGRLGI--AEQGFTCKYCPATFYSSPGLTRHINKCHPSEL 486 (500)
T ss_pred eeeeccccccCCCCCCCCcccCCCCCccccccc--hhhccccccchHhhhcCcchHhHhhhcChHHh
Confidence 1235556655555444444433322 35569999999999999999999999996553
No 13
>PHA00733 hypothetical protein
Probab=99.30 E-value=2.6e-12 Score=94.45 Aligned_cols=82 Identities=18% Similarity=0.273 Sum_probs=70.6
Q ss_pred CCCcccccCcccccCHHHHHHH--HH---HccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202 183 RRNDTCEFCGKVFKNCSNLTVH--RR---SHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS 257 (343)
Q Consensus 183 ~~~~~C~~C~~~f~~~~~l~~H--~~---~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~ 257 (343)
++.+.|.+|.+.|.....|..| ++ .+++.++|.|+.|++.|.....|..|++.|. .+|.|..|++.|....
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~----~~~~C~~CgK~F~~~~ 113 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTE----HSKVCPVCGKEFRNTD 113 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcCC----cCccCCCCCCccCCHH
Confidence 3789999999999888777766 22 2456889999999999999999999999762 4699999999999999
Q ss_pred HHHHHHhhhcC
Q psy5202 258 TLEKHMRKCVV 268 (343)
Q Consensus 258 ~l~~H~~~~~~ 268 (343)
.|..|+++.|.
T Consensus 114 sL~~H~~~~h~ 124 (128)
T PHA00733 114 STLDHVCKKHN 124 (128)
T ss_pred HHHHHHHHhcC
Confidence 99999998874
No 14
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.24 E-value=2.4e-11 Score=109.54 Aligned_cols=104 Identities=20% Similarity=0.513 Sum_probs=86.9
Q ss_pred CCcccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHH
Q psy5202 89 EKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAA 168 (343)
Q Consensus 89 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (343)
++.+.|+.|++.|. ...|..|+..|+ +++.|+ |++.+ .+..|..|+..|.+.
T Consensus 451 ~~H~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~----------------------- 502 (567)
T PLN03086 451 KNHVHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPL----------------------- 502 (567)
T ss_pred ccCccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCCC-----------------------
Confidence 45689999999996 688999999986 789999 99755 678999999888766
Q ss_pred hhhcCCCCcccccCCCCcccccCccccc----------CHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhc
Q psy5202 169 LNLCLPNSHMKKESRRNDTCEFCGKVFK----------NCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHG 237 (343)
Q Consensus 169 ~~~~~~~~~~~~~~~~~~~C~~C~~~f~----------~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 237 (343)
+++.|.+|++.|. ....|..|...+ |.+++.|..||+.|.. ..|..|+...|
T Consensus 503 ---------------Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrl-rdm~~H~~~~h 564 (567)
T PLN03086 503 ---------------RLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVML-KEMDIHQIAVH 564 (567)
T ss_pred ---------------CceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeeee-hhHHHHHHHhh
Confidence 8899999999995 245899998885 8999999999988876 46778876543
No 15
>PHA00733 hypothetical protein
Probab=99.18 E-value=3.1e-11 Score=88.74 Aligned_cols=52 Identities=19% Similarity=0.358 Sum_probs=40.7
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhc
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHG 237 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 237 (343)
++|.|+.|++.|.....|..|++.| +.+|.|..|++.|.....|.+|+...|
T Consensus 72 kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h 123 (128)
T PHA00733 72 SPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKH 123 (128)
T ss_pred CCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhc
Confidence 6778888888888888888887765 346888888888888888888877655
No 16
>PHA02768 hypothetical protein; Provisional
Probab=99.03 E-value=1.2e-10 Score=69.99 Aligned_cols=40 Identities=20% Similarity=0.507 Sum_probs=19.4
Q ss_pred cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChH
Q psy5202 186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSS 227 (343)
Q Consensus 186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~ 227 (343)
|.|+.||+.|.....|..|+++|+ ++|+|..|++.|...+
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s 45 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTG 45 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccc
Confidence 445555555555555555555544 3444555555444433
No 17
>PHA02768 hypothetical protein; Provisional
Probab=99.00 E-value=1.7e-10 Score=69.41 Aligned_cols=44 Identities=20% Similarity=0.505 Sum_probs=39.9
Q ss_pred cccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHH
Q psy5202 91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLK 136 (343)
Q Consensus 91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~ 136 (343)
-|+|+.||+.|.....|..|+++|. ++|+|..|++.|...+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 3899999999999999999999998 6899999999998777653
No 18
>KOG3993|consensus
Probab=98.85 E-value=4.3e-10 Score=95.18 Aligned_cols=89 Identities=19% Similarity=0.402 Sum_probs=74.8
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcC-------------------------
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGR------------------------- 238 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~------------------------- 238 (343)
..|.|.+|...|.+...|.+|....--..-|+|+.|++.|.-..+|..|.|.|.-
T Consensus 266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea 345 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA 345 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence 4689999999999999999997543333459999999999999999999999961
Q ss_pred ------CCCcccccCCCCCCCCChHHHHHHHhhhcCcccc
Q psy5202 239 ------LGKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQNM 272 (343)
Q Consensus 239 ------~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~ 272 (343)
.....|.|.+|++.|.+...|++|+-+|+.....
T Consensus 346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~ 385 (500)
T KOG3993|consen 346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLA 385 (500)
T ss_pred cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccc
Confidence 1223599999999999999999999999865543
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.84 E-value=2.8e-09 Score=54.77 Aligned_cols=25 Identities=56% Similarity=1.050 Sum_probs=18.0
Q ss_pred HHHHHHHHccCCCceecCcCccccc
Q psy5202 200 NLTVHRRSHTGEKPYKCELCSYACA 224 (343)
Q Consensus 200 ~l~~H~~~h~~~~~~~C~~C~~~f~ 224 (343)
+|.+|+++|++++||.|++|++.|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 3667777777777777777777775
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.66 E-value=1.6e-08 Score=51.94 Aligned_cols=26 Identities=50% Similarity=1.007 Sum_probs=24.0
Q ss_pred HHHHHHHhccCCCCeecCCCCccCCC
Q psy5202 106 TLQAHRRNHTGEKPFKCTVCDHACSQ 131 (343)
Q Consensus 106 ~L~~H~~~h~~~~~~~C~~C~~~f~~ 131 (343)
+|.+|+++|.++++|.|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 58999999999999999999999964
No 21
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.60 E-value=7.1e-08 Score=59.61 Aligned_cols=51 Identities=20% Similarity=0.417 Sum_probs=27.7
Q ss_pred eecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhhc
Q psy5202 214 YKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKCV 267 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~ 267 (343)
|.|++|++ ..+...|..|....|....+.+.|++|...+. .+|..|+...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 55666666 34445566665554433344566666665433 25666665554
No 22
>PHA00732 hypothetical protein
Probab=98.58 E-value=3.2e-08 Score=65.81 Aligned_cols=45 Identities=29% Similarity=0.552 Sum_probs=24.4
Q ss_pred cccccCcccccCHHHHHHHHHH-ccCCCceecCcCcccccChHHHHHHHHHh
Q psy5202 186 DTCEFCGKVFKNCSNLTVHRRS-HTGEKPYKCELCSYACAQSSKLTRHMKTH 236 (343)
Q Consensus 186 ~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~L~~H~~~h 236 (343)
|.|..|++.|.....|..|++. |. ++.|+.|++.|. .|..|++.+
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence 4566666666666666666553 33 235666666555 355555443
No 23
>PHA00616 hypothetical protein
Probab=98.55 E-value=2.9e-08 Score=56.74 Aligned_cols=32 Identities=22% Similarity=0.369 Sum_probs=16.2
Q ss_pred cccccCcccccCHHHHHHHHHHccCCCceecC
Q psy5202 186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCE 217 (343)
Q Consensus 186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~ 217 (343)
|+|+.||+.|.....|..|++.|+|++++.|+
T Consensus 2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 44555555555555555555555555554444
No 24
>PHA00616 hypothetical protein
Probab=98.55 E-value=3.1e-08 Score=56.64 Aligned_cols=34 Identities=15% Similarity=0.272 Sum_probs=29.8
Q ss_pred cccccccccccCChHHHHHHHHhccCCCCeecCC
Q psy5202 91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTV 124 (343)
Q Consensus 91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~ 124 (343)
+|+|..||+.|.....|.+|++.|++++++.|++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 5889999999999999999999999988888765
No 25
>PHA00732 hypothetical protein
Probab=98.40 E-value=2e-07 Score=61.97 Aligned_cols=47 Identities=19% Similarity=0.589 Sum_probs=39.6
Q ss_pred ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhh
Q psy5202 213 PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 266 (343)
||.|..|++.|.....|+.|++.++ . ++.|+.||+.|. .|..|+++.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H--~--~~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNH--T--LTKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhccc--C--CCccCCCCCEeC---Chhhhhccc
Confidence 6899999999999999999998643 1 358999999998 588898654
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.34 E-value=1.1e-06 Score=54.30 Aligned_cols=23 Identities=22% Similarity=0.485 Sum_probs=10.0
Q ss_pred eecCCCCccCCCHHHHHHHHHHhc
Q psy5202 120 FKCTVCDHACSQSAKLKKHMKIHD 143 (343)
Q Consensus 120 ~~C~~C~~~f~~~~~L~~H~~~h~ 143 (343)
|.|++|++ ..+...|..|....|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H 25 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEH 25 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHC
Confidence 44444444 233344444444433
No 27
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=98.16 E-value=8.7e-07 Score=44.21 Aligned_cols=22 Identities=41% Similarity=0.922 Sum_probs=11.0
Q ss_pred eecCcCcccccChHHHHHHHHH
Q psy5202 214 YKCELCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~ 235 (343)
|+|+.|++.|.+...|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 3455555555555555555444
No 28
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=98.15 E-value=1.8e-06 Score=42.99 Aligned_cols=22 Identities=45% Similarity=1.025 Sum_probs=14.5
Q ss_pred cccCCCCCCCCChHHHHHHHhh
Q psy5202 244 YRCRFCEMPFSVPSTLEKHMRK 265 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~H~~~ 265 (343)
|.|+.|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4566666666666666666665
No 29
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=98.04 E-value=1.2e-06 Score=71.94 Aligned_cols=55 Identities=27% Similarity=0.577 Sum_probs=45.4
Q ss_pred CCceecCc--CcccccChHHHHHHHHHhc-----------------CCCCcccccCCCCCCCCChHHHHHHHhh
Q psy5202 211 EKPYKCEL--CSYACAQSSKLTRHMKTHG-----------------RLGKDVYRCRFCEMPFSVPSTLEKHMRK 265 (343)
Q Consensus 211 ~~~~~C~~--C~~~f~~~~~L~~H~~~h~-----------------~~~~~~~~C~~C~~~f~~~~~l~~H~~~ 265 (343)
++||+|++ |.+.+.....|+.|+..-| ....+||+|++|+|+|+....|+.|+..
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~H 420 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRKH 420 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecccc
Confidence 58999976 9999999999999976422 1345899999999999999999999753
No 30
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=98.03 E-value=2.6e-06 Score=42.91 Aligned_cols=23 Identities=35% Similarity=0.795 Sum_probs=10.8
Q ss_pred cccCCCCCCCCChHHHHHHHhhh
Q psy5202 244 YRCRFCEMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~H~~~~ 266 (343)
|.|++|++.|.+...|..|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 34555555555555555555544
No 31
>KOG2231|consensus
Probab=98.01 E-value=1.5e-05 Score=73.52 Aligned_cols=71 Identities=25% Similarity=0.538 Sum_probs=52.7
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcC------cccccChHHHHHHHHHhcCCCCcccccC--CCC-CCCCC
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELC------SYACAQSSKLTRHMKTHGRLGKDVYRCR--FCE-MPFSV 255 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C------~~~f~~~~~L~~H~~~h~~~~~~~~~C~--~C~-~~f~~ 255 (343)
.-.|..|...|.....|.+|++.++ |.|..| +..|.....|..|.+.+| |.|. .|- +.|..
T Consensus 182 hp~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H------flCE~~~C~~~~f~~ 251 (669)
T KOG2231|consen 182 HPLCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH------FLCEEEFCRTKKFYV 251 (669)
T ss_pred CccchhhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC------ccccccccccceeee
Confidence 4579999999999999999998766 566666 345778899999998877 7787 565 34444
Q ss_pred hHHHHHHHhh
Q psy5202 256 PSTLEKHMRK 265 (343)
Q Consensus 256 ~~~l~~H~~~ 265 (343)
.-.+..|++.
T Consensus 252 ~~~~ei~lk~ 261 (669)
T KOG2231|consen 252 AFELEIELKA 261 (669)
T ss_pred hhHHHHHHHh
Confidence 4455555553
No 32
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.95 E-value=3.9e-06 Score=59.68 Aligned_cols=74 Identities=23% Similarity=0.502 Sum_probs=22.3
Q ss_pred ccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhh
Q psy5202 187 TCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 187 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 266 (343)
.|.+|+..|.+...|..|+...++... + ....+.....+..+++.. -...+.|.+|++.|.+...|..||+.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~~~---~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLRKK---VKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp -------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Ccccccccccccccccccccccccccc---c-cccccccccccccccccc---cCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 489999999999999999977555321 1 222233445555555433 123699999999999999999999975
Q ss_pred c
Q psy5202 267 V 267 (343)
Q Consensus 267 ~ 267 (343)
+
T Consensus 74 ~ 74 (100)
T PF12756_consen 74 H 74 (100)
T ss_dssp T
T ss_pred c
Confidence 3
No 33
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.93 E-value=9.7e-06 Score=40.76 Aligned_cols=24 Identities=33% Similarity=0.819 Sum_probs=15.0
Q ss_pred eecCcCcccccChHHHHHHHHHhc
Q psy5202 214 YKCELCSYACAQSSKLTRHMKTHG 237 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~h~ 237 (343)
|.|++|++.|.+...|+.|++.||
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 567777777777777777776653
No 34
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.91 E-value=8.2e-06 Score=57.99 Aligned_cols=24 Identities=33% Similarity=0.630 Sum_probs=0.0
Q ss_pred ecCCCCccCCCHHHHHHHHHHhcc
Q psy5202 121 KCTVCDHACSQSAKLKKHMKIHDE 144 (343)
Q Consensus 121 ~C~~C~~~f~~~~~L~~H~~~h~~ 144 (343)
+|.+|+..|.+...|..|+...|+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~ 24 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHG 24 (100)
T ss_dssp ------------------------
T ss_pred Cccccccccccccccccccccccc
Confidence 488888888888888888877665
No 35
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.81 E-value=8.2e-06 Score=42.40 Aligned_cols=24 Identities=33% Similarity=0.620 Sum_probs=13.1
Q ss_pred cccCCCCCCCCChHHHHHHHhhhc
Q psy5202 244 YRCRFCEMPFSVPSTLEKHMRKCV 267 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~H~~~~~ 267 (343)
|.|..|++.|.+...|..|++.|+
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCccCCccCChhHHHHHhHHhc
Confidence 455555555555555555555443
No 36
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.76 E-value=7.2e-06 Score=67.48 Aligned_cols=53 Identities=32% Similarity=0.638 Sum_probs=45.9
Q ss_pred CCCCccccc--CcccccCHHHHHHHHHH-cc------------------CCCceecCcCcccccChHHHHHHHH
Q psy5202 182 SRRNDTCEF--CGKVFKNCSNLTVHRRS-HT------------------GEKPYKCELCSYACAQSSKLTRHMK 234 (343)
Q Consensus 182 ~~~~~~C~~--C~~~f~~~~~l~~H~~~-h~------------------~~~~~~C~~C~~~f~~~~~L~~H~~ 234 (343)
++++|+|++ |.++|.....|+.|+.- |. ..|||+|++|+++++....|+.|..
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceeccc
Confidence 459999998 99999999999999874 31 2489999999999999999998864
No 37
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.70 E-value=2.3e-05 Score=40.71 Aligned_cols=25 Identities=40% Similarity=0.804 Sum_probs=16.6
Q ss_pred cccccccccccCChHHHHHHHHhcc
Q psy5202 91 NKECEFCGKKFKYENTLQAHRRNHT 115 (343)
Q Consensus 91 ~~~C~~C~~~f~~~~~L~~H~~~h~ 115 (343)
+|+|..|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4667777777777777777766654
No 38
>KOG2231|consensus
Probab=97.69 E-value=0.0001 Score=68.27 Aligned_cols=103 Identities=22% Similarity=0.445 Sum_probs=74.2
Q ss_pred ecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhhcCCCCcccccCCCCccccc----------
Q psy5202 121 KCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEF---------- 190 (343)
Q Consensus 121 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~---------- 190 (343)
.|..| ..|.+...|+.|+...|. .+.|.+
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H~----------------------------------------~~~c~lC~~~~kif~~ 155 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQHK----------------------------------------LHLCSLCLQNLKIFIN 155 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhhh----------------------------------------hhccccccccceeeee
Confidence 46666 666688999999976663 233333
Q ss_pred CcccccCHHHHHHHHHHc-cCCCc----eecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC------CCCCChHHH
Q psy5202 191 CGKVFKNCSNLTVHRRSH-TGEKP----YKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE------MPFSVPSTL 259 (343)
Q Consensus 191 C~~~f~~~~~l~~H~~~h-~~~~~----~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~------~~f~~~~~l 259 (343)
+.+.| ....|..|++.- .+++. -.|..|...|.....|.+|++.+| |.|.+|. ..|.....|
T Consensus 156 e~k~Y-t~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h------~~chfC~~~~~~neyy~~~~dL 228 (669)
T KOG2231|consen 156 ERKLY-TRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH------EFCHFCDYKTGQNEYYNDYDDL 228 (669)
T ss_pred eeehe-hHHHHHHHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccce------eheeecCcccccchhcccchHH
Confidence 33444 566788887652 11221 369999999999999999999876 6677774 568888999
Q ss_pred HHHHhhhcCccc
Q psy5202 260 EKHMRKCVVNQN 271 (343)
Q Consensus 260 ~~H~~~~~~~~~ 271 (343)
..|.|..|..-.
T Consensus 229 e~HfR~~HflCE 240 (669)
T KOG2231|consen 229 EEHFRKGHFLCE 240 (669)
T ss_pred HHHhhhcCcccc
Confidence 999999886554
No 39
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.48 E-value=0.00012 Score=61.22 Aligned_cols=135 Identities=21% Similarity=0.438 Sum_probs=90.6
Q ss_pred cccccc--cccccCChHHHHHHHHhccCCCCeecCCCC---ccCCCH------HHHHHHHHHhccCCCCccccCCCCcch
Q psy5202 91 NKECEF--CGKKFKYENTLQAHRRNHTGEKPFKCTVCD---HACSQS------AKLKKHMKIHDEDGDKEDREEGPSASS 159 (343)
Q Consensus 91 ~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---~~f~~~------~~L~~H~~~h~~~~~~~~~~~~~~~~~ 159 (343)
.|.|+. |..+......|..|.+..++ .+.|.+|- +.|... ..|..|...-..+. +
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~-------G----- 216 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEE-------G----- 216 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeecccccccccCCcccc-------C-----
Confidence 377876 77777778889999887544 37788874 234333 33444332111100 0
Q ss_pred HHHHHHHHHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCcc-------cccChHHHHHH
Q psy5202 160 LAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSY-------ACAQSSKLTRH 232 (343)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~-------~f~~~~~L~~H 232 (343)
-+..-.|..|...|-+-..|.+|++..+. +|.+|++ -|..-..|.+|
T Consensus 217 ----------------------FKGHP~C~FC~~~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~H 270 (493)
T COG5236 217 ----------------------FKGHPLCIFCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAH 270 (493)
T ss_pred ----------------------cCCCchhhhccceecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHH
Confidence 01344699999999999999999997553 4666654 37888899999
Q ss_pred HHHhcCCCCcccccCC--CC----CCCCChHHHHHHHhhhcCccc
Q psy5202 233 MKTHGRLGKDVYRCRF--CE----MPFSVPSTLEKHMRKCVVNQN 271 (343)
Q Consensus 233 ~~~h~~~~~~~~~C~~--C~----~~f~~~~~l~~H~~~~~~~~~ 271 (343)
.+.-| |.|.+ |- +.|.....|..|+...|+...
T Consensus 271 F~~~h------y~ct~qtc~~~k~~vf~~~~el~~h~~~~h~~~~ 309 (493)
T COG5236 271 FRNAH------YCCTFQTCRVGKCYVFPYHTELLEHLTRFHKVNA 309 (493)
T ss_pred hhcCc------eEEEEEEEecCcEEEeccHHHHHHHHHHHhhccc
Confidence 86543 66654 53 579999999999988886553
No 40
>KOG1146|consensus
Probab=97.37 E-value=0.00028 Score=69.33 Aligned_cols=160 Identities=20% Similarity=0.297 Sum_probs=98.7
Q ss_pred cccccccccCChHHHHHHHH-hccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhh
Q psy5202 93 ECEFCGKKFKYENTLQAHRR-NHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNL 171 (343)
Q Consensus 93 ~C~~C~~~f~~~~~L~~H~~-~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 171 (343)
.|..|+..+.....+.-|+. .|...+.|+|+.|+..|.....|..||+..|.+........+.. ..
T Consensus 438 e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~-------------~~ 504 (1406)
T KOG1146|consen 438 ELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQN-------------HP 504 (1406)
T ss_pred cccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccc-------------cc
Confidence 46777888887777777765 36667889999999999999999999998765532211000000 00
Q ss_pred cCCCCcccccCCCCcccccCcccccCHHHHHHHHHH--ccCC--------------------------------------
Q psy5202 172 CLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRS--HTGE-------------------------------------- 211 (343)
Q Consensus 172 ~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~--h~~~-------------------------------------- 211 (343)
.........-+.++|.|..|...+....+|.+|++. |..+
T Consensus 505 ~~arg~~~~~~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~p 584 (1406)
T KOG1146|consen 505 RLARGEVYRCPGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGP 584 (1406)
T ss_pred cccccccccCCCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCC
Confidence 000001112234789999999999999999999874 2211
Q ss_pred ---CceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhh
Q psy5202 212 ---KPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 212 ---~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 266 (343)
-.|.|.+|++...-...|+-||..-.. -..+.-|-.|+-.+.....+..|.+-+
T Consensus 585 ktkP~~~C~vc~yetniarnlrihmtss~~-s~~p~~~Lq~~it~~l~~~~~~~~~lp 641 (1406)
T KOG1146|consen 585 KTKPSWRCEVCSYETNIARNLRIHMTASPS-SSPPSLVLQQNITSSLASLLGGQGRLP 641 (1406)
T ss_pred CCCCCcchhhhcchhhhhhccccccccCCC-CCChHHHhhhcchhhccccccCcCCCC
Confidence 125666666666666666666654221 223355555666655555555555444
No 41
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=97.29 E-value=0.00014 Score=36.48 Aligned_cols=22 Identities=55% Similarity=1.105 Sum_probs=10.3
Q ss_pred eecCcCcccccChHHHHHHHHHh
Q psy5202 214 YKCELCSYACAQSSKLTRHMKTH 236 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~h 236 (343)
|+|+.|++... ...|.+|++.|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~ 22 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRH 22 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhh
Confidence 44555555554 55555555544
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.26 E-value=0.00025 Score=36.12 Aligned_cols=23 Identities=43% Similarity=0.800 Sum_probs=17.4
Q ss_pred cccCCCCCCCCChHHHHHHHhhh
Q psy5202 244 YRCRFCEMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~H~~~~ 266 (343)
|.|..|++.|.....|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 56778888888888888887754
No 43
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=97.26 E-value=0.00036 Score=40.92 Aligned_cols=31 Identities=23% Similarity=0.439 Sum_probs=14.2
Q ss_pred CCCcccccccccccCChHHHHHHHHhccCCC
Q psy5202 88 EEKNKECEFCGKKFKYENTLQAHRRNHTGEK 118 (343)
Q Consensus 88 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~ 118 (343)
.+.|-.|++|+..+....+|++|+..+++.+
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 3445556666666666666666655544433
No 44
>KOG2785|consensus
Probab=97.24 E-value=0.00091 Score=57.15 Aligned_cols=60 Identities=23% Similarity=0.375 Sum_probs=46.9
Q ss_pred eecCcCcccccChHHHHHHHHHhcCC------------------C---CcccccCCCC---CCCCChHHHHHHHhhhcCc
Q psy5202 214 YKCELCSYACAQSSKLTRHMKTHGRL------------------G---KDVYRCRFCE---MPFSVPSTLEKHMRKCVVN 269 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~h~~~------------------~---~~~~~C~~C~---~~f~~~~~l~~H~~~~~~~ 269 (343)
-.|-.|++.|.....-..||..+|+. | ..-|.|-.|. +.|.+....+.||+....+
T Consensus 167 t~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~K~HC 246 (390)
T KOG2785|consen 167 TDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRDKGHC 246 (390)
T ss_pred cceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhhccCc
Confidence 46999999999999999999988731 0 1127788898 9999999999999865444
Q ss_pred cccc
Q psy5202 270 QNMK 273 (343)
Q Consensus 270 ~~~~ 273 (343)
+-..
T Consensus 247 kl~y 250 (390)
T KOG2785|consen 247 KLPY 250 (390)
T ss_pred ccCC
Confidence 4433
No 45
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.20 E-value=0.00029 Score=35.86 Aligned_cols=24 Identities=38% Similarity=0.760 Sum_probs=19.2
Q ss_pred eecCcCcccccChHHHHHHHHHhc
Q psy5202 214 YKCELCSYACAQSSKLTRHMKTHG 237 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~h~ 237 (343)
|+|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 578888888888888888887663
No 46
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=97.13 E-value=0.00043 Score=63.37 Aligned_cols=145 Identities=24% Similarity=0.351 Sum_probs=92.6
Q ss_pred CcccccccccccCChHHHHHHHH--hccCC--CCeecC--CCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHH
Q psy5202 90 KNKECEFCGKKFKYENTLQAHRR--NHTGE--KPFKCT--VCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAA 163 (343)
Q Consensus 90 ~~~~C~~C~~~f~~~~~L~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~ 163 (343)
.++.|..|...|.....|.+|.+ .|.++ +++.|+ .|++.|.....+.+|...|.+.....+.............
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 47999999999999999999999 89999 999999 7999999999999999999876544332221111100000
Q ss_pred HHHHHhhhcCCCCcccccCCCCccccc--CcccccCHHHHHHHHHHccCCC--ceecCcCcccccChHHHHHHHHHhc
Q psy5202 164 AAAAALNLCLPNSHMKKESRRNDTCEF--CGKVFKNCSNLTVHRRSHTGEK--PYKCELCSYACAQSSKLTRHMKTHG 237 (343)
Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~--~~~C~~C~~~f~~~~~L~~H~~~h~ 237 (343)
... ................+.|.. |-..+.....+..|...|.... .+.|..|...|.....+..|++.|.
T Consensus 368 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (467)
T COG5048 368 NNE---PPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHT 442 (467)
T ss_pred CCC---CccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccc
Confidence 000 000000000011112333322 5556666666666666665555 3556778888888888888877774
No 47
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.12 E-value=0.00034 Score=58.54 Aligned_cols=107 Identities=27% Similarity=0.504 Sum_probs=76.5
Q ss_pred eecCC--CCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhhcCCCCcccccCCCCcccccCcc----
Q psy5202 120 FKCTV--CDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGK---- 193 (343)
Q Consensus 120 ~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~---- 193 (343)
|.|+. |.........|..|.+..|+. +.|.+|-.
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~~----------------------------------------~~C~~C~~nKk~ 191 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHGF----------------------------------------VLCSECIGNKKD 191 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcCc----------------------------------------EEhHhhhcCccc
Confidence 88986 777777788999999987743 45555532
Q ss_pred -----cccCHHHHHHHHHHccCCCce----ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCC-------CCCChH
Q psy5202 194 -----VFKNCSNLTVHRRSHTGEKPY----KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEM-------PFSVPS 257 (343)
Q Consensus 194 -----~f~~~~~l~~H~~~h~~~~~~----~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~-------~f~~~~ 257 (343)
..-....|..|...-..+.-| .|..|...|..-..|.+|+|..| + +|-+|++ -|....
T Consensus 192 F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~H---E---~ChICD~v~p~~~QYFK~Y~ 265 (493)
T COG5236 192 FWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRH---E---ACHICDMVGPIRYQYFKSYE 265 (493)
T ss_pred CccceeeeecccccccccCCccccCcCCCchhhhccceecChHHHHHHHHhhh---h---hhhhhhccCccchhhhhCHH
Confidence 233455677776543323223 59999999999999999999866 3 4777764 489999
Q ss_pred HHHHHHhhhcCcccc
Q psy5202 258 TLEKHMRKCVVNQNM 272 (343)
Q Consensus 258 ~l~~H~~~~~~~~~~ 272 (343)
.|..|.+.-|..-.+
T Consensus 266 ~Le~HF~~~hy~ct~ 280 (493)
T COG5236 266 DLEAHFRNAHYCCTF 280 (493)
T ss_pred HHHHHhhcCceEEEE
Confidence 999999876654433
No 48
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=97.12 E-value=0.00062 Score=39.93 Aligned_cols=26 Identities=31% Similarity=0.661 Sum_probs=10.2
Q ss_pred CceecCcCcccccChHHHHHHHHHhc
Q psy5202 212 KPYKCELCSYACAQSSKLTRHMKTHG 237 (343)
Q Consensus 212 ~~~~C~~C~~~f~~~~~L~~H~~~h~ 237 (343)
.|-.|++|+..+....+|++|+.++|
T Consensus 23 ~PatCP~C~a~~~~srnLrRHle~~H 48 (54)
T PF09237_consen 23 QPATCPICGAVIRQSRNLRRHLEIRH 48 (54)
T ss_dssp --EE-TTT--EESSHHHHHHHHHHHT
T ss_pred CCCCCCcchhhccchhhHHHHHHHHh
Confidence 34445555555555555555544433
No 49
>KOG1146|consensus
Probab=97.12 E-value=7e-05 Score=73.33 Aligned_cols=150 Identities=20% Similarity=0.309 Sum_probs=96.1
Q ss_pred CCCCcccccccccccCChHHHHHHHHhcc-------------------------CCCCeecCCCCccCCCHHHHHHHHHH
Q psy5202 87 QEEKNKECEFCGKKFKYENTLQAHRRNHT-------------------------GEKPFKCTVCDHACSQSAKLKKHMKI 141 (343)
Q Consensus 87 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~-------------------------~~~~~~C~~C~~~f~~~~~L~~H~~~ 141 (343)
.-.+.|+|+.|++.|+....|..|||..+ +.++|.|..|...+..+.+|.+|++.
T Consensus 461 S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~LsihlqS 540 (1406)
T KOG1146|consen 461 SFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLSIHLQS 540 (1406)
T ss_pred cccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccceeeeeeeecchHHHHHHHH
Confidence 34578999999999999999999999721 12679999999999999999999975
Q ss_pred hccCCCCccccC--CCCcchHHHHHHH--HHhhhcCCCCccc-ccCCCCcccccCcccccCHHHHHHHHHH-ccCCCcee
Q psy5202 142 HDEDGDKEDREE--GPSASSLAAAAAA--AALNLCLPNSHMK-KESRRNDTCEFCGKVFKNCSNLTVHRRS-HTGEKPYK 215 (343)
Q Consensus 142 h~~~~~~~~~~~--~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~ 215 (343)
--.....+..+. +............ ............. ......|.|.+|+....-..+|+.|+.. ++-..|.-
T Consensus 541 ~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetniarnlrihmtss~~s~~p~~ 620 (1406)
T KOG1146|consen 541 DLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETNIARNLRIHMTASPSSSPPSL 620 (1406)
T ss_pred HhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhhhhhccccccccCCCCCChHH
Confidence 321111111000 0000000000000 0000001111111 2234679999999999999999999876 44445577
Q ss_pred cCcCcccccChHHHHHHHHHh
Q psy5202 216 CELCSYACAQSSKLTRHMKTH 236 (343)
Q Consensus 216 C~~C~~~f~~~~~L~~H~~~h 236 (343)
|..|+..+.....+..|.+.+
T Consensus 621 ~Lq~~it~~l~~~~~~~~~lp 641 (1406)
T KOG1146|consen 621 VLQQNITSSLASLLGGQGRLP 641 (1406)
T ss_pred HhhhcchhhccccccCcCCCC
Confidence 888888888887777777766
No 50
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=97.06 E-value=0.00055 Score=34.23 Aligned_cols=23 Identities=39% Similarity=0.846 Sum_probs=12.8
Q ss_pred eecCCCCccCCCHHHHHHHHHHhc
Q psy5202 120 FKCTVCDHACSQSAKLKKHMKIHD 143 (343)
Q Consensus 120 ~~C~~C~~~f~~~~~L~~H~~~h~ 143 (343)
|+|+.|++... ...|.+|++.+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45666666665 666666666554
No 51
>PRK04860 hypothetical protein; Provisional
Probab=96.87 E-value=0.0005 Score=52.64 Aligned_cols=36 Identities=25% Similarity=0.572 Sum_probs=17.5
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCccccc
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACA 224 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 224 (343)
+|.|. |+. ....+.+|.++|.++++|+|..|+..|.
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~ 154 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLV 154 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence 34554 544 3334445555555555555555554443
No 52
>PRK04860 hypothetical protein; Provisional
Probab=96.87 E-value=0.00051 Score=52.59 Aligned_cols=40 Identities=33% Similarity=0.765 Sum_probs=34.7
Q ss_pred CceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202 212 KPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS 257 (343)
Q Consensus 212 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~ 257 (343)
-+|.|. |+. ....+++|.++|. ++++|.|..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~--g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVR--GEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhc--CCccEECCCCCceeEEec
Confidence 479998 997 7788999999997 889999999999987543
No 53
>KOG2785|consensus
Probab=96.86 E-value=0.0041 Score=53.27 Aligned_cols=146 Identities=19% Similarity=0.345 Sum_probs=92.0
Q ss_pred cccccccccccCChHHHHHHHHh--cc-----------------------------------CCCCeecCCCCccCCCHH
Q psy5202 91 NKECEFCGKKFKYENTLQAHRRN--HT-----------------------------------GEKPFKCTVCDHACSQSA 133 (343)
Q Consensus 91 ~~~C~~C~~~f~~~~~L~~H~~~--h~-----------------------------------~~~~~~C~~C~~~f~~~~ 133 (343)
-|.|.-|...|.+...-+.|+++ |. ..-++.|..|.+.|.+..
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~ 82 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPK 82 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChh
Confidence 38999999999999888889875 31 113589999999999999
Q ss_pred HHHHHHHHhccCCCCccc---cCCCCcch----------HHH----HHHHHHhhhc---------------CCCCccccc
Q psy5202 134 KLKKHMKIHDEDGDKEDR---EEGPSASS----------LAA----AAAAAALNLC---------------LPNSHMKKE 181 (343)
Q Consensus 134 ~L~~H~~~h~~~~~~~~~---~~~~~~~~----------~~~----~~~~~~~~~~---------------~~~~~~~~~ 181 (343)
....|++........... ..+..... +.. .......... +........
T Consensus 83 a~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~~~dd~~Edi~~d~~~e~ 162 (390)
T KOG2785|consen 83 AHENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEEEEDDEEEDIEEDGDDED 162 (390)
T ss_pred hHHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhhccCcchhhhhhccchhc
Confidence 999998754321111000 00000000 000 0000000000 000000111
Q ss_pred CCCCcccccCcccccCHHHHHHHHHHccCC-----------------------CceecCcCc---ccccChHHHHHHHHH
Q psy5202 182 SRRNDTCEFCGKVFKNCSNLTVHRRSHTGE-----------------------KPYKCELCS---YACAQSSKLTRHMKT 235 (343)
Q Consensus 182 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~-----------------------~~~~C~~C~---~~f~~~~~L~~H~~~ 235 (343)
...|--|-+|++.+.....-..||..+||- .-|.|-.|. +.|.+....+.||..
T Consensus 163 e~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 163 ELIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred ccCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 123467999999999999999999998873 247899999 999999999999975
Q ss_pred h
Q psy5202 236 H 236 (343)
Q Consensus 236 h 236 (343)
-
T Consensus 243 K 243 (390)
T KOG2785|consen 243 K 243 (390)
T ss_pred c
Confidence 3
No 54
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.82 E-value=0.00089 Score=33.81 Aligned_cols=20 Identities=20% Similarity=0.747 Sum_probs=9.6
Q ss_pred ecCcCcccccChHHHHHHHH
Q psy5202 215 KCELCSYACAQSSKLTRHMK 234 (343)
Q Consensus 215 ~C~~C~~~f~~~~~L~~H~~ 234 (343)
.|.+|++.|.+...|+.|++
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCcCCHHHHHHHHC
Confidence 44444444444444444443
No 55
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.82 E-value=0.00056 Score=34.56 Aligned_cols=23 Identities=30% Similarity=0.778 Sum_probs=21.1
Q ss_pred cccCCCCCCCCChHHHHHHHhhh
Q psy5202 244 YRCRFCEMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~H~~~~ 266 (343)
|.|.+|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 68999999999999999999854
No 56
>KOG2482|consensus
Probab=96.62 E-value=0.0094 Score=50.18 Aligned_cols=132 Identities=19% Similarity=0.296 Sum_probs=85.2
Q ss_pred CCeecCCCCccC-CCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhhcCCCCcccccCCCCcccccCccccc
Q psy5202 118 KPFKCTVCDHAC-SQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGKVFK 196 (343)
Q Consensus 118 ~~~~C~~C~~~f-~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~ 196 (343)
...+|-+|...+ ..++....|+-..|+-....... ......-...... .-..+.|-.|.+.|.
T Consensus 143 fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDn--iVyvnelLehLke--------------kL~r~~CLyCekifr 206 (423)
T KOG2482|consen 143 FSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDN--IVYVNELLEHLKE--------------KLERLRCLYCEKIFR 206 (423)
T ss_pred eeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcc--eeeHHHHHHHHHH--------------HHhhheeeeeccccC
Confidence 346899998766 45667777876655431110000 0000000000000 114678999999999
Q ss_pred CHHHHHHHHHH--ccCC----------------------------------------------------Cc--eecCcCc
Q psy5202 197 NCSNLTVHRRS--HTGE----------------------------------------------------KP--YKCELCS 220 (343)
Q Consensus 197 ~~~~l~~H~~~--h~~~----------------------------------------------------~~--~~C~~C~ 220 (343)
++..|+.||+. |..- .+ .+|-.|.
T Consensus 207 dkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~ 286 (423)
T KOG2482|consen 207 DKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCT 286 (423)
T ss_pred CcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeec
Confidence 99999999985 3210 11 4799999
Q ss_pred ccccChHHHHHHHHHhcC-------------------------CCCcccccCCCCCCCCChHHHHHHHhh
Q psy5202 221 YACAQSSKLTRHMKTHGR-------------------------LGKDVYRCRFCEMPFSVPSTLEKHMRK 265 (343)
Q Consensus 221 ~~f~~~~~L~~H~~~h~~-------------------------~~~~~~~C~~C~~~f~~~~~l~~H~~~ 265 (343)
...-+...|..||+.-|. .....-.|-.|...|.....|..||.-
T Consensus 287 ~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 287 NFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred cchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence 988889999999987551 112234688999999999999999964
No 57
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=96.49 E-value=0.00084 Score=34.65 Aligned_cols=22 Identities=32% Similarity=0.762 Sum_probs=16.0
Q ss_pred cccCCCCCCCCChHHHHHHHhh
Q psy5202 244 YRCRFCEMPFSVPSTLEKHMRK 265 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~H~~~ 265 (343)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777754
No 58
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=96.47 E-value=0.0023 Score=32.20 Aligned_cols=22 Identities=41% Similarity=0.872 Sum_probs=14.6
Q ss_pred cccCCCCCCCCChHHHHHHHhhh
Q psy5202 244 YRCRFCEMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~H~~~~ 266 (343)
..|+.||+.| ....|.+|++.|
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~C 24 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKIC 24 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHhc
Confidence 4577777777 666677776543
No 59
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=96.02 E-value=0.0019 Score=33.33 Aligned_cols=22 Identities=27% Similarity=0.718 Sum_probs=15.4
Q ss_pred eecCcCcccccChHHHHHHHHH
Q psy5202 214 YKCELCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~ 235 (343)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777653
No 60
>KOG2482|consensus
Probab=95.80 E-value=0.016 Score=48.81 Aligned_cols=54 Identities=33% Similarity=0.591 Sum_probs=43.1
Q ss_pred cccccccccccCChHHHHHHHHh--ccCC---------------------------------------------------
Q psy5202 91 NKECEFCGKKFKYENTLQAHRRN--HTGE--------------------------------------------------- 117 (343)
Q Consensus 91 ~~~C~~C~~~f~~~~~L~~H~~~--h~~~--------------------------------------------------- 117 (343)
.+.|-.|.+.|..+..|+.||+. |...
T Consensus 195 r~~CLyCekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~ 274 (423)
T KOG2482|consen 195 RLRCLYCEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDD 274 (423)
T ss_pred hheeeeeccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCC
Confidence 58899999999999999999985 4221
Q ss_pred -C--CeecCCCCccCCCHHHHHHHHHHhcc
Q psy5202 118 -K--PFKCTVCDHACSQSAKLKKHMKIHDE 144 (343)
Q Consensus 118 -~--~~~C~~C~~~f~~~~~L~~H~~~h~~ 144 (343)
. ...|-+|....-+...|..||+..|.
T Consensus 275 a~a~~v~CLfC~~~~en~~~l~eHmk~vHe 304 (423)
T KOG2482|consen 275 AEALSVVCLFCTNFYENPVFLFEHMKIVHE 304 (423)
T ss_pred CCccceEEEeeccchhhHHHHHHHHHHHHH
Confidence 1 15888888888888888889887764
No 61
>KOG4173|consensus
Probab=95.44 E-value=0.01 Score=46.21 Aligned_cols=82 Identities=23% Similarity=0.498 Sum_probs=63.2
Q ss_pred CCccccc--CcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhc--------CCCCcccccCC--CCC
Q psy5202 184 RNDTCEF--CGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHG--------RLGKDVYRCRF--CEM 251 (343)
Q Consensus 184 ~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~--------~~~~~~~~C~~--C~~ 251 (343)
..|.|++ |...|.....+..|-..-++ -.|..|.+.|.+...|-.|+.--| ..|...|.|-+ |+.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~ 154 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE 154 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence 5677877 77888888777777655443 369999999999999999975432 24677899954 999
Q ss_pred CCCChHHHHHHHhhhcC
Q psy5202 252 PFSVPSTLEKHMRKCVV 268 (343)
Q Consensus 252 ~f~~~~~l~~H~~~~~~ 268 (343)
.|.+...-..|+-..|.
T Consensus 155 KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 155 KFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhhHHHHhcc
Confidence 99999999999876554
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=95.39 E-value=0.014 Score=29.29 Aligned_cols=19 Identities=26% Similarity=0.705 Sum_probs=10.1
Q ss_pred ecCcCcccccChHHHHHHHH
Q psy5202 215 KCELCSYACAQSSKLTRHMK 234 (343)
Q Consensus 215 ~C~~C~~~f~~~~~L~~H~~ 234 (343)
.|+.||+.| ....|.+|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 455555555 4455555544
No 63
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=94.88 E-value=0.072 Score=37.78 Aligned_cols=109 Identities=22% Similarity=0.339 Sum_probs=62.7
Q ss_pred ccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhh
Q psy5202 92 KECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNL 171 (343)
Q Consensus 92 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 171 (343)
|.|+.|+-.. =+-|..|+.|+-.......|.|-. ||--......+. +.
T Consensus 2 Y~CPrC~skv--------------C~LP~~CpiCgLtLVss~HLARSy--HHLfPl~~f~ev-~~--------------- 49 (112)
T TIGR00622 2 YFCPQCRAKV--------------CELPVECPICGLTLILSTHLARSY--HHLFPLKAFQEI-PL--------------- 49 (112)
T ss_pred ccCCCCCCCc--------------cCCCCcCCcCCCEEeccchHHHhh--hccCCCcccccc-cc---------------
Confidence 6777776544 235678999998888877777652 331100000000 00
Q ss_pred cCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC
Q psy5202 172 CLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE 250 (343)
Q Consensus 172 ~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~ 250 (343)
......-.|--|...|........ ..-.....|+|+.|...|-..-++..|...|. |+.|.
T Consensus 50 --------~~~~~~~~C~~C~~~f~~~~~~~~--~~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh~--------CPGC~ 110 (112)
T TIGR00622 50 --------EEYNGSRFCFGCQGPFPKPPVSPF--DELKDSHRYVCAVCKNVFCVDCDVFVHESLHC--------CPGCI 110 (112)
T ss_pred --------cccCCCCcccCcCCCCCCcccccc--cccccccceeCCCCCCccccccchhhhhhccC--------CcCCC
Confidence 000022348888888876431110 00122346889999999888888888877773 77775
No 64
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=94.66 E-value=0.027 Score=30.89 Aligned_cols=23 Identities=22% Similarity=0.732 Sum_probs=19.9
Q ss_pred ccccCCCCCCCCChHHHHHHHhh
Q psy5202 243 VYRCRFCEMPFSVPSTLEKHMRK 265 (343)
Q Consensus 243 ~~~C~~C~~~f~~~~~l~~H~~~ 265 (343)
+|.|.+|++.|.+...+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 57899999999999999999864
No 65
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=94.36 E-value=0.02 Score=31.00 Aligned_cols=12 Identities=25% Similarity=0.927 Sum_probs=6.7
Q ss_pred cccccCcccccC
Q psy5202 186 DTCEFCGKVFKN 197 (343)
Q Consensus 186 ~~C~~C~~~f~~ 197 (343)
|+|..||..+..
T Consensus 2 ~~C~~CGy~y~~ 13 (33)
T cd00350 2 YVCPVCGYIYDG 13 (33)
T ss_pred EECCCCCCEECC
Confidence 456666655543
No 66
>KOG4173|consensus
Probab=94.24 E-value=0.021 Score=44.50 Aligned_cols=26 Identities=27% Similarity=0.473 Sum_probs=18.7
Q ss_pred CCCceecC--cCcccccChHHHHHHHHH
Q psy5202 210 GEKPYKCE--LCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 210 ~~~~~~C~--~C~~~f~~~~~L~~H~~~ 235 (343)
|...|+|- .|+..|.+...-..|+-.
T Consensus 141 G~dMy~ClvEgCt~KFkT~r~RkdH~I~ 168 (253)
T KOG4173|consen 141 GQDMYQCLVEGCTEKFKTSRDRKDHMIR 168 (253)
T ss_pred CccHHHHHHHhhhhhhhhhhhhhhHHHH
Confidence 45567884 488888888888888643
No 67
>KOG2893|consensus
Probab=93.84 E-value=0.015 Score=46.42 Aligned_cols=47 Identities=28% Similarity=0.572 Sum_probs=38.4
Q ss_pred cccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhc
Q psy5202 93 ECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHD 143 (343)
Q Consensus 93 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 143 (343)
-|=+|++.|.+..-|..|++. +.|+|.+|.+...+--.|..|....|
T Consensus 12 wcwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhh
Confidence 488999999999999999876 34999999998887777777754443
No 68
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=93.76 E-value=0.027 Score=33.66 Aligned_cols=28 Identities=29% Similarity=0.590 Sum_probs=16.7
Q ss_pred CCcccccCCCCCCCCChHHHHHHHhhhc
Q psy5202 240 GKDVYRCRFCEMPFSVPSTLEKHMRKCV 267 (343)
Q Consensus 240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~ 267 (343)
|+..++|+.||..|....++.+|+.+-|
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 4555666666666666666666665544
No 69
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=93.72 E-value=0.053 Score=29.71 Aligned_cols=23 Identities=22% Similarity=0.598 Sum_probs=18.4
Q ss_pred ceecCcCcccccChHHHHHHHHH
Q psy5202 213 PYKCELCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~~L~~H~~~ 235 (343)
+|.|.+|+..|.....+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 57788888888888888888754
No 70
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=93.42 E-value=0.048 Score=38.77 Aligned_cols=30 Identities=23% Similarity=0.424 Sum_probs=24.0
Q ss_pred eecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCCh
Q psy5202 214 YKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVP 256 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~ 256 (343)
..|+.||.+|... ++.|..|++||..|.-.
T Consensus 10 R~Cp~CG~kFYDL-------------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDL-------------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccC-------------CCCCccCCCCCCccCcc
Confidence 4689999888762 56788899999988766
No 71
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.36 E-value=0.017 Score=52.90 Aligned_cols=54 Identities=35% Similarity=0.632 Sum_probs=49.6
Q ss_pred CCcccccCcccccCHHHHHHHHH--HccCC--CceecC--cCcccccChHHHHHHHHHhc
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRR--SHTGE--KPYKCE--LCSYACAQSSKLTRHMKTHG 237 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~ 237 (343)
..+.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.+...+..|...|.
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 347 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT 347 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCccccc
Confidence 36889999999999999999999 89999 999999 79999999999999988886
No 72
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=93.28 E-value=0.18 Score=36.15 Aligned_cols=25 Identities=36% Similarity=0.541 Sum_probs=22.5
Q ss_pred ccc----CCCCCCCCChHHHHHHHhhhcC
Q psy5202 244 YRC----RFCEMPFSVPSTLEKHMRKCVV 268 (343)
Q Consensus 244 ~~C----~~C~~~f~~~~~l~~H~~~~~~ 268 (343)
|.| ..|++.+.+...+..|++.+|+
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 789 9999999999999999998875
No 73
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=93.20 E-value=0.04 Score=32.97 Aligned_cols=29 Identities=21% Similarity=0.387 Sum_probs=22.3
Q ss_pred CCCCCcccccccccccCChHHHHHHHHhc
Q psy5202 86 SQEEKNKECEFCGKKFKYENTLQAHRRNH 114 (343)
Q Consensus 86 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h 114 (343)
-.++.-+.|+.|+..|.....+.+|+..-
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKa 40 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKA 40 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHH
Confidence 34666788888888888888888887643
No 74
>KOG2893|consensus
Probab=92.98 E-value=0.023 Score=45.34 Aligned_cols=41 Identities=27% Similarity=0.661 Sum_probs=24.5
Q ss_pred cccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHH
Q psy5202 188 CEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRH 232 (343)
Q Consensus 188 C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H 232 (343)
|-+|++.|.+..-|..|++. +-|+|.+|.+...+--.|..|
T Consensus 13 cwycnrefddekiliqhqka----khfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceee
Confidence 66666666666666666554 236666666655555555555
No 75
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=92.89 E-value=0.079 Score=28.89 Aligned_cols=11 Identities=27% Similarity=0.993 Sum_probs=6.8
Q ss_pred eecCcCccccc
Q psy5202 214 YKCELCSYACA 224 (343)
Q Consensus 214 ~~C~~C~~~f~ 224 (343)
|+|..||+.+.
T Consensus 3 ~~C~~CG~i~~ 13 (34)
T cd00729 3 WVCPVCGYIHE 13 (34)
T ss_pred EECCCCCCEeE
Confidence 66777775543
No 76
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=92.69 E-value=0.25 Score=35.45 Aligned_cols=54 Identities=17% Similarity=0.391 Sum_probs=43.5
Q ss_pred CcccccccccccCChHHHHHHHHh-ccCC-----------------------------------------CCeec----C
Q psy5202 90 KNKECEFCGKKFKYENTLQAHRRN-HTGE-----------------------------------------KPFKC----T 123 (343)
Q Consensus 90 ~~~~C~~C~~~f~~~~~L~~H~~~-h~~~-----------------------------------------~~~~C----~ 123 (343)
+-..|..|+....- ..+..|++. |+.. .-|.| .
T Consensus 10 ~vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~~G~~C~~~~~ 88 (109)
T PF12013_consen 10 RVLICRQCQYAVQP-SEVESHLRKRHHILKSQERQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVYDGYRCQCDPP 88 (109)
T ss_pred CEEEeCCCCcccCc-hHHHHHHHHhcccccHHHHHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCCCCCCeeeecCCC
Confidence 45689999988775 888999884 4221 12899 9
Q ss_pred CCCccCCCHHHHHHHHHHhcc
Q psy5202 124 VCDHACSQSAKLKKHMKIHDE 144 (343)
Q Consensus 124 ~C~~~f~~~~~L~~H~~~h~~ 144 (343)
.|++.+.+...|.+|++.+|+
T Consensus 89 ~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 89 HCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCCcEeccHHHHHHHHHHhcC
Confidence 999999999999999998875
No 77
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=90.80 E-value=0.51 Score=33.55 Aligned_cols=77 Identities=18% Similarity=0.246 Sum_probs=52.1
Q ss_pred CCcccccCcccccCHHHHHHHHHHcc-------CCC-------ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCC
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHT-------GEK-------PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFC 249 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~-------~~~-------~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C 249 (343)
-+..|+.||.+......|.+-- |+ .+. ...|--|...|........= . ......|+|+.|
T Consensus 14 LP~~CpiCgLtLVss~HLARSy--HHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~--~--~~~~~~y~C~~C 87 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLARSY--HHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFD--E--LKDSHRYVCAVC 87 (112)
T ss_pred CCCcCCcCCCEEeccchHHHhh--hccCCCcccccccccccCCCCcccCcCCCCCCccccccc--c--cccccceeCCCC
Confidence 5678999999998887776542 22 011 12399999999764311100 0 013457999999
Q ss_pred CCCCCChHHHHHHHhhh
Q psy5202 250 EMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 250 ~~~f~~~~~l~~H~~~~ 266 (343)
...|--.-++..|..-|
T Consensus 88 ~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 88 KNVFCVDCDVFVHESLH 104 (112)
T ss_pred CCccccccchhhhhhcc
Confidence 99999999999997654
No 78
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=90.78 E-value=0.18 Score=35.92 Aligned_cols=32 Identities=25% Similarity=0.608 Sum_probs=26.1
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccCh
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQS 226 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 226 (343)
....|+.||+.|-.. +..|-.|++||..|.-.
T Consensus 8 tKR~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 8 TKRTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred CcccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 456899999999653 34788999999999766
No 79
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=90.37 E-value=0.18 Score=36.44 Aligned_cols=33 Identities=9% Similarity=0.053 Sum_probs=24.0
Q ss_pred eecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHH
Q psy5202 214 YKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTL 259 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l 259 (343)
..|+.||++|... ++.|..|++||..|.-...+
T Consensus 10 r~Cp~cg~kFYDL-------------nk~p~vcP~cg~~~~~~~~~ 42 (129)
T TIGR02300 10 RICPNTGSKFYDL-------------NRRPAVSPYTGEQFPPEEAL 42 (129)
T ss_pred ccCCCcCcccccc-------------CCCCccCCCcCCccCcchhh
Confidence 5688888888652 56788899999887655333
No 80
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=90.36 E-value=0.18 Score=38.71 Aligned_cols=23 Identities=26% Similarity=0.807 Sum_probs=17.6
Q ss_pred ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC
Q psy5202 213 PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE 250 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~ 250 (343)
.|.|++||+.+. |+.|-+||+||
T Consensus 134 ~~vC~vCGy~~~---------------ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYTHE---------------GEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCccc---------------CCCCCcCCCCC
Confidence 588888886542 57788888888
No 81
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=90.29 E-value=0.13 Score=28.80 Aligned_cols=14 Identities=14% Similarity=0.387 Sum_probs=7.1
Q ss_pred eecCcCcccccChH
Q psy5202 214 YKCELCSYACAQSS 227 (343)
Q Consensus 214 ~~C~~C~~~f~~~~ 227 (343)
+.|+.|+..|.-..
T Consensus 3 ~~CP~C~~~~~v~~ 16 (38)
T TIGR02098 3 IQCPNCKTSFRVVD 16 (38)
T ss_pred EECCCCCCEEEeCH
Confidence 34555555554443
No 82
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=90.17 E-value=0.067 Score=43.58 Aligned_cols=45 Identities=16% Similarity=0.348 Sum_probs=24.4
Q ss_pred CceecCcCcccccChHHHHHHHHHhc-C-------CCCc-----ccccCCCCCCCCCh
Q psy5202 212 KPYKCELCSYACAQSSKLTRHMKTHG-R-------LGKD-----VYRCRFCEMPFSVP 256 (343)
Q Consensus 212 ~~~~C~~C~~~f~~~~~L~~H~~~h~-~-------~~~~-----~~~C~~C~~~f~~~ 256 (343)
+.+.||+|+..|....-+....+.-. . .+.. +..|+.||++|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 44566666666665544444433211 0 0122 25799999987654
No 83
>KOG2807|consensus
Probab=89.98 E-value=0.65 Score=39.32 Aligned_cols=31 Identities=29% Similarity=0.623 Sum_probs=21.6
Q ss_pred CceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC
Q psy5202 212 KPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE 250 (343)
Q Consensus 212 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~ 250 (343)
..|+|..|...|-.--+...|...|. |+.|.
T Consensus 344 ~~y~C~~Ck~~FCldCDv~iHesLh~--------CpgCe 374 (378)
T KOG2807|consen 344 GRYRCESCKNVFCLDCDVFIHESLHN--------CPGCE 374 (378)
T ss_pred CcEEchhccceeeccchHHHHhhhhc--------CCCcC
Confidence 45778777777777777777766663 66665
No 84
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=89.96 E-value=0.18 Score=29.45 Aligned_cols=25 Identities=28% Similarity=0.733 Sum_probs=12.0
Q ss_pred CcccccCCCCCCCCCh----HHHHHHHhh
Q psy5202 241 KDVYRCRFCEMPFSVP----STLEKHMRK 265 (343)
Q Consensus 241 ~~~~~C~~C~~~f~~~----~~l~~H~~~ 265 (343)
....+|.+|++.+... ..|.+|++.
T Consensus 14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~~ 42 (45)
T PF02892_consen 14 KKKAKCKYCGKVIKYSSGGTSNLKRHLKK 42 (45)
T ss_dssp SS-EEETTTTEE-----SSTHHHHHHHHH
T ss_pred cCeEEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence 3445666666655543 566666643
No 85
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=89.19 E-value=0.064 Score=43.69 Aligned_cols=44 Identities=27% Similarity=0.557 Sum_probs=30.0
Q ss_pred CCcccccccccccCChHHHHHHHHhc---c-------CCC-----CeecCCCCccCCCH
Q psy5202 89 EKNKECEFCGKKFKYENTLQAHRRNH---T-------GEK-----PFKCTVCDHACSQS 132 (343)
Q Consensus 89 ~~~~~C~~C~~~f~~~~~L~~H~~~h---~-------~~~-----~~~C~~C~~~f~~~ 132 (343)
++.+.|++|+..|.+..-.....+.- . +.. ...|+.||+.|...
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 46789999999999876555444321 1 112 26899999987644
No 86
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=88.81 E-value=0.29 Score=27.07 Aligned_cols=14 Identities=14% Similarity=0.406 Sum_probs=6.8
Q ss_pred ecCcCcccccChHH
Q psy5202 215 KCELCSYACAQSSK 228 (343)
Q Consensus 215 ~C~~C~~~f~~~~~ 228 (343)
.|+.|+..|.-...
T Consensus 4 ~Cp~C~~~y~i~d~ 17 (36)
T PF13717_consen 4 TCPNCQAKYEIDDE 17 (36)
T ss_pred ECCCCCCEEeCCHH
Confidence 45555555544433
No 87
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=88.74 E-value=0.28 Score=27.28 Aligned_cols=11 Identities=36% Similarity=0.721 Sum_probs=5.4
Q ss_pred ccccCCCCCCC
Q psy5202 243 VYRCRFCEMPF 253 (343)
Q Consensus 243 ~~~C~~C~~~f 253 (343)
..+|+.|+..|
T Consensus 25 ~vrC~~C~~~f 35 (37)
T PF13719_consen 25 KVRCPKCGHVF 35 (37)
T ss_pred EEECCCCCcEe
Confidence 44555555444
No 88
>PHA00626 hypothetical protein
Probab=88.49 E-value=0.18 Score=30.36 Aligned_cols=14 Identities=36% Similarity=0.429 Sum_probs=8.8
Q ss_pred cccccCCCCCCCCC
Q psy5202 242 DVYRCRFCEMPFSV 255 (343)
Q Consensus 242 ~~~~C~~C~~~f~~ 255 (343)
..|.|+.||+.|+.
T Consensus 22 nrYkCkdCGY~ft~ 35 (59)
T PHA00626 22 DDYVCCDCGYNDSK 35 (59)
T ss_pred cceEcCCCCCeech
Confidence 45677777766643
No 89
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=87.92 E-value=0.54 Score=28.96 Aligned_cols=10 Identities=30% Similarity=0.919 Sum_probs=6.2
Q ss_pred cccccCCCCC
Q psy5202 242 DVYRCRFCEM 251 (343)
Q Consensus 242 ~~~~C~~C~~ 251 (343)
.+|.|+.||.
T Consensus 47 ~~Y~CP~CGF 56 (59)
T PRK14890 47 NPYTCPKCGF 56 (59)
T ss_pred CceECCCCCC
Confidence 4566666664
No 90
>KOG2186|consensus
Probab=87.47 E-value=0.39 Score=39.08 Aligned_cols=47 Identities=21% Similarity=0.398 Sum_probs=37.9
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHH
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMK 234 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~ 234 (343)
.|.|..||....-. .+.+|+-..++ ..|.|--|++.|.. ..+..|..
T Consensus 3 ~FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 3 FFTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred EEehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 48899999887654 46779888777 66999999999988 77888854
No 91
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=87.02 E-value=0.29 Score=28.78 Aligned_cols=11 Identities=36% Similarity=0.920 Sum_probs=5.9
Q ss_pred eecCcCccccc
Q psy5202 214 YKCELCSYACA 224 (343)
Q Consensus 214 ~~C~~C~~~f~ 224 (343)
|+|+.||..|.
T Consensus 4 y~C~~CG~~~~ 14 (46)
T PRK00398 4 YKCARCGREVE 14 (46)
T ss_pred EECCCCCCEEE
Confidence 55555555544
No 92
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=86.95 E-value=0.73 Score=28.29 Aligned_cols=10 Identities=40% Similarity=1.082 Sum_probs=5.8
Q ss_pred cccccCCCCC
Q psy5202 242 DVYRCRFCEM 251 (343)
Q Consensus 242 ~~~~C~~C~~ 251 (343)
.+|+|+.||.
T Consensus 49 ~~Y~Cp~CGF 58 (61)
T COG2888 49 NPYRCPKCGF 58 (61)
T ss_pred CceECCCcCc
Confidence 4566666654
No 93
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=86.85 E-value=0.74 Score=23.29 Aligned_cols=21 Identities=19% Similarity=0.618 Sum_probs=14.6
Q ss_pred ccCCCCCCCCChHHHHHHHhhh
Q psy5202 245 RCRFCEMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 245 ~C~~C~~~f~~~~~l~~H~~~~ 266 (343)
.|++|++.+ ....+..|+..+
T Consensus 3 ~CPiC~~~v-~~~~in~HLD~C 23 (26)
T smart00734 3 QCPVCFREV-PENLINSHLDSC 23 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHHHh
Confidence 577888777 556777777644
No 94
>KOG2186|consensus
Probab=86.76 E-value=0.44 Score=38.80 Aligned_cols=51 Identities=24% Similarity=0.480 Sum_probs=42.1
Q ss_pred ccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccC
Q psy5202 92 KECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDED 145 (343)
Q Consensus 92 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~ 145 (343)
|.|..||..... ..+.+|+...++ .-|.|--|++.|.. ..+..|..-.+..
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITEa 54 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITEA 54 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcchH
Confidence 789999999874 557889988777 67999999999998 7888898765543
No 95
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=86.76 E-value=0.6 Score=39.16 Aligned_cols=55 Identities=25% Similarity=0.464 Sum_probs=35.0
Q ss_pred cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC
Q psy5202 186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE 250 (343)
Q Consensus 186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~ 250 (343)
-.|-.|.-.|.....-..- .-+....|+|+.|...|-.--+...|...|. |..|.
T Consensus 363 ~~Cf~CQ~~fp~~~~~~~~--~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh~--------C~gCe 417 (421)
T COG5151 363 THCFVCQGPFPKPPVSPFD--ESTSSGRYQCELCKSTFCSDCDVFIHETLHF--------CIGCE 417 (421)
T ss_pred ccceeccCCCCCCCCCccc--ccccccceechhhhhhhhhhhHHHHHHHHhh--------CCCCc
Confidence 3477777777654221000 1122346899999999988888888877773 66665
No 96
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=86.51 E-value=0.59 Score=25.00 Aligned_cols=11 Identities=27% Similarity=0.706 Sum_probs=5.8
Q ss_pred eecCcCccccc
Q psy5202 214 YKCELCSYACA 224 (343)
Q Consensus 214 ~~C~~C~~~f~ 224 (343)
|.|..|+..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 45666665554
No 97
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=86.29 E-value=0.4 Score=28.37 Aligned_cols=11 Identities=27% Similarity=0.773 Sum_probs=5.8
Q ss_pred CcccccCcccc
Q psy5202 185 NDTCEFCGKVF 195 (343)
Q Consensus 185 ~~~C~~C~~~f 195 (343)
.|.|..||+.|
T Consensus 6 ~Y~C~~Cg~~~ 16 (49)
T COG1996 6 EYKCARCGREV 16 (49)
T ss_pred EEEhhhcCCee
Confidence 35555555555
No 98
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=86.27 E-value=0.68 Score=44.75 Aligned_cols=40 Identities=20% Similarity=0.528 Sum_probs=28.7
Q ss_pred cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202 186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP 252 (343)
Q Consensus 186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~ 252 (343)
+.|+.|+..+. .|......+|.+||+. ...|..|+.||-.
T Consensus 445 ~~Cp~Cd~~lt----------~H~~~~~L~CH~Cg~~-----------------~~~p~~Cp~Cgs~ 484 (730)
T COG1198 445 AECPNCDSPLT----------LHKATGQLRCHYCGYQ-----------------EPIPQSCPECGSE 484 (730)
T ss_pred ccCCCCCcceE----------EecCCCeeEeCCCCCC-----------------CCCCCCCCCCCCC
Confidence 56888876643 3555577899999932 3578899999965
No 99
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=86.15 E-value=0.95 Score=34.89 Aligned_cols=34 Identities=12% Similarity=0.231 Sum_probs=25.6
Q ss_pred cCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCC
Q psy5202 209 TGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPF 253 (343)
Q Consensus 209 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f 253 (343)
.+..-|.|+.|+..|+....+. .-|.|+.||...
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~-----------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME-----------LNFTCPRCGAML 138 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH-----------cCCcCCCCCCEe
Confidence 3456688999999888887774 138899999653
No 100
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=85.76 E-value=0.55 Score=33.98 Aligned_cols=35 Identities=14% Similarity=0.238 Sum_probs=27.0
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHH
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKL 229 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L 229 (343)
....|+.||++|... +..|..|++||..|.....+
T Consensus 8 tKr~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~~~~ 42 (129)
T TIGR02300 8 TKRICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPEEAL 42 (129)
T ss_pred ccccCCCcCcccccc-----------CCCCccCCCcCCccCcchhh
Confidence 457899999999653 45789999999998665333
No 101
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=85.39 E-value=0.4 Score=32.01 Aligned_cols=13 Identities=31% Similarity=1.035 Sum_probs=6.9
Q ss_pred cccccCCCCCCCC
Q psy5202 242 DVYRCRFCEMPFS 254 (343)
Q Consensus 242 ~~~~C~~C~~~f~ 254 (343)
..+.|..||..|.
T Consensus 52 GIW~C~kCg~~fA 64 (89)
T COG1997 52 GIWKCRKCGAKFA 64 (89)
T ss_pred CeEEcCCCCCeec
Confidence 4455555555554
No 102
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=85.17 E-value=1.1 Score=34.07 Aligned_cols=38 Identities=16% Similarity=0.300 Sum_probs=24.1
Q ss_pred CCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCC
Q psy5202 210 GEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPF 253 (343)
Q Consensus 210 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f 253 (343)
+...|.|+.|+..|.....+.. .. ....|.|+.||...
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~----~d--~~~~f~Cp~Cg~~l 133 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQL----LD--MDGTFTCPRCGEEL 133 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHh----cC--CCCcEECCCCCCEE
Confidence 3456889999988886544432 01 12338899998754
No 103
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=85.12 E-value=0.16 Score=38.69 Aligned_cols=44 Identities=16% Similarity=0.355 Sum_probs=24.4
Q ss_pred ccccccccccCCh---HHHHHHHHhccCCCCeecCCCCccCCCHHHHHHH
Q psy5202 92 KECEFCGKKFKYE---NTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKH 138 (343)
Q Consensus 92 ~~C~~C~~~f~~~---~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H 138 (343)
++|++||..+... ..+.. ..+ -.+.++|+.||+.|.+...+..-
T Consensus 1 m~cp~c~~~~~~~~~s~~~~~--~~~-~~~~~~c~~c~~~f~~~e~~~~~ 47 (154)
T PRK00464 1 MRCPFCGHPDTRVIDSRPAED--GNA-IRRRRECLACGKRFTTFERVELV 47 (154)
T ss_pred CcCCCCCCCCCEeEeccccCC--CCc-eeeeeeccccCCcceEeEeccCc
Confidence 4799999665211 11111 001 11238899999999876654433
No 104
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=84.83 E-value=0.54 Score=36.20 Aligned_cols=25 Identities=32% Similarity=0.783 Sum_probs=20.6
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcc
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSY 221 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 221 (343)
+.|+|++||..+ -|+.|-+||+||-
T Consensus 133 ~~~vC~vCGy~~-------------~ge~P~~CPiCga 157 (166)
T COG1592 133 KVWVCPVCGYTH-------------EGEAPEVCPICGA 157 (166)
T ss_pred CEEEcCCCCCcc-------------cCCCCCcCCCCCC
Confidence 379999999764 4578999999993
No 105
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=84.77 E-value=0.61 Score=27.10 Aligned_cols=27 Identities=19% Similarity=0.508 Sum_probs=20.8
Q ss_pred cccccccccccCChHHHHHHHHhccCCCCeecCCCCcc
Q psy5202 91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHA 128 (343)
Q Consensus 91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~ 128 (343)
.|.|..||..|... ...+..|+.||..
T Consensus 2 ~Y~C~~Cg~~~~~~-----------~~~~irC~~CG~r 28 (44)
T smart00659 2 IYICGECGRENEIK-----------SKDVVRCRECGYR 28 (44)
T ss_pred EEECCCCCCEeecC-----------CCCceECCCCCce
Confidence 48999999988743 3456899999864
No 106
>PRK04023 DNA polymerase II large subunit; Validated
Probab=84.58 E-value=1.1 Score=44.21 Aligned_cols=13 Identities=15% Similarity=0.225 Sum_probs=8.6
Q ss_pred ccccCCCCCCCCC
Q psy5202 243 VYRCRFCEMPFSV 255 (343)
Q Consensus 243 ~~~C~~C~~~f~~ 255 (343)
++.|+.||..-..
T Consensus 663 ~y~CPKCG~El~~ 675 (1121)
T PRK04023 663 EDECEKCGREPTP 675 (1121)
T ss_pred CCcCCCCCCCCCc
Confidence 4678888865443
No 107
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=84.36 E-value=0.7 Score=27.73 Aligned_cols=21 Identities=33% Similarity=0.863 Sum_probs=11.8
Q ss_pred cccCCCCCCCCCh-----HHHHHHHh
Q psy5202 244 YRCRFCEMPFSVP-----STLEKHMR 264 (343)
Q Consensus 244 ~~C~~C~~~f~~~-----~~l~~H~~ 264 (343)
-.|.+|++.+... +.|.+|++
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHH
Confidence 4466666555443 45666665
No 108
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=84.14 E-value=0.18 Score=38.45 Aligned_cols=15 Identities=20% Similarity=0.416 Sum_probs=8.0
Q ss_pred cccCCCCCCCCChHH
Q psy5202 244 YRCRFCEMPFSVPST 258 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~ 258 (343)
++|+.||++|.+...
T Consensus 29 ~~c~~c~~~f~~~e~ 43 (154)
T PRK00464 29 RECLACGKRFTTFER 43 (154)
T ss_pred eeccccCCcceEeEe
Confidence 555555555555443
No 109
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=83.85 E-value=1.2 Score=35.09 Aligned_cols=35 Identities=14% Similarity=0.329 Sum_probs=26.3
Q ss_pred CCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCC
Q psy5202 210 GEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSV 255 (343)
Q Consensus 210 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~ 255 (343)
...-|.|+.|+..|+....+. .-|.|+.||.....
T Consensus 114 ~~~~Y~Cp~C~~rytf~eA~~-----------~~F~Cp~Cg~~L~~ 148 (178)
T PRK06266 114 NNMFFFCPNCHIRFTFDEAME-----------YGFRCPQCGEMLEE 148 (178)
T ss_pred CCCEEECCCCCcEEeHHHHhh-----------cCCcCCCCCCCCee
Confidence 345699999999998887752 24899999976543
No 110
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=82.98 E-value=0.27 Score=29.79 Aligned_cols=29 Identities=21% Similarity=0.627 Sum_probs=20.3
Q ss_pred ccccccccccCChHHHHHHHHhccCCCCeecCCCCc
Q psy5202 92 KECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDH 127 (343)
Q Consensus 92 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~ 127 (343)
|+|..|+..|.....+. ......|+.|+.
T Consensus 6 y~C~~Cg~~fe~~~~~~-------~~~~~~CP~Cg~ 34 (52)
T TIGR02605 6 YRCTACGHRFEVLQKMS-------DDPLATCPECGG 34 (52)
T ss_pred EEeCCCCCEeEEEEecC-------CCCCCCCCCCCC
Confidence 89999999997543221 134567999986
No 111
>PF15269 zf-C2H2_7: Zinc-finger
Probab=82.40 E-value=1 Score=25.74 Aligned_cols=22 Identities=45% Similarity=0.970 Sum_probs=16.7
Q ss_pred eecCcCcccccChHHHHHHHHH
Q psy5202 214 YKCELCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~ 235 (343)
|+|-.|..+..-++.|..||+.
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 6777777777777777777764
No 112
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=82.17 E-value=1.3 Score=33.67 Aligned_cols=38 Identities=18% Similarity=0.449 Sum_probs=25.5
Q ss_pred CCCcccccccccccCChHHHHHHHHhccCCCCeecCCCCccC
Q psy5202 88 EEKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHAC 129 (343)
Q Consensus 88 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f 129 (343)
...-|.|+.|+..|.....+.. .. ....|.|+.||...
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~---~d-~~~~f~Cp~Cg~~l 133 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQL---LD-MDGTFTCPRCGEEL 133 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHh---cC-CCCcEECCCCCCEE
Confidence 3457999999999985443322 11 23449999999764
No 113
>KOG4167|consensus
Probab=81.62 E-value=3.8 Score=38.95 Aligned_cols=29 Identities=28% Similarity=0.410 Sum_probs=24.8
Q ss_pred CCCCcccccccccccCChHHHHHHHHhcc
Q psy5202 87 QEEKNKECEFCGKKFKYENTLQAHRRNHT 115 (343)
Q Consensus 87 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 115 (343)
....-|.|..|+++|....++..||++|.
T Consensus 788 ~~~giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 788 DPTGIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred CCCceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 34567999999999999999999999984
No 114
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=81.42 E-value=0.75 Score=36.23 Aligned_cols=32 Identities=16% Similarity=0.275 Sum_probs=26.2
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCccccc
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACA 224 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 224 (343)
..|.|+.|+..|..-..+. .-|.|+.||....
T Consensus 116 ~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLE 147 (178)
T ss_pred CEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCe
Confidence 6799999999998877653 3699999997654
No 115
>KOG4377|consensus
Probab=81.36 E-value=7.7 Score=34.29 Aligned_cols=27 Identities=22% Similarity=0.533 Sum_probs=21.2
Q ss_pred Ceec--CCCCccCCCHHHHHHHHHHhccC
Q psy5202 119 PFKC--TVCDHACSQSAKLKKHMKIHDED 145 (343)
Q Consensus 119 ~~~C--~~C~~~f~~~~~L~~H~~~h~~~ 145 (343)
.|.| +.|++.+..+..+.+|.++|...
T Consensus 271 hyhcl~e~C~ykr~~k~DvirH~~~hkkr 299 (480)
T KOG4377|consen 271 HYHCLNEYCFYKRGQKNDVIRHVEIHKKR 299 (480)
T ss_pred hhcccCccccccccchhhhHHHHHHHhhc
Confidence 3566 45998888899999999988643
No 116
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=80.92 E-value=1.4 Score=37.72 Aligned_cols=23 Identities=22% Similarity=0.446 Sum_probs=20.5
Q ss_pred CCcccccCcccccCHHHHHHHHH
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRR 206 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~ 206 (343)
..+-|..|++.|....-+..|+.
T Consensus 237 ~~~YC~~C~r~f~~~~VFe~Hl~ 259 (470)
T COG5188 237 PKVYCVKCGREFSRSKVFEYHLE 259 (470)
T ss_pred cceeeHhhhhHhhhhHHHHHHHh
Confidence 56889999999999999999975
No 117
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=80.88 E-value=0.34 Score=27.55 Aligned_cols=31 Identities=23% Similarity=0.581 Sum_probs=21.3
Q ss_pred cccccccccccCChHHHHHHHHhccCCCCeecCCCCcc
Q psy5202 91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHA 128 (343)
Q Consensus 91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~ 128 (343)
.|+|..||..|....... ......|+.||..
T Consensus 5 ~y~C~~Cg~~fe~~~~~~-------~~~~~~CP~Cg~~ 35 (41)
T smart00834 5 EYRCEDCGHTFEVLQKIS-------DDPLATCPECGGD 35 (41)
T ss_pred EEEcCCCCCEEEEEEecC-------CCCCCCCCCCCCc
Confidence 389999999987443321 1355679999873
No 118
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=80.80 E-value=2.1 Score=32.96 Aligned_cols=20 Identities=10% Similarity=0.100 Sum_probs=14.1
Q ss_pred CCCeecCCCCccCCCHHHHH
Q psy5202 117 EKPFKCTVCDHACSQSAKLK 136 (343)
Q Consensus 117 ~~~~~C~~C~~~f~~~~~L~ 136 (343)
..-|.|+.|+..|.....+.
T Consensus 107 ~~~Y~Cp~c~~r~tf~eA~~ 126 (158)
T TIGR00373 107 NMFFICPNMCVRFTFNEAME 126 (158)
T ss_pred CCeEECCCCCcEeeHHHHHH
Confidence 45578888888777766653
No 119
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=79.87 E-value=1.1 Score=31.36 Aligned_cols=27 Identities=4% Similarity=-0.054 Sum_probs=20.7
Q ss_pred ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCC
Q psy5202 215 KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFS 254 (343)
Q Consensus 215 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~ 254 (343)
.|+.|++.|... ...|..|++||++|.
T Consensus 11 idPetg~KFYDL-------------NrdPiVsPytG~s~P 37 (129)
T COG4530 11 IDPETGKKFYDL-------------NRDPIVSPYTGKSYP 37 (129)
T ss_pred cCccccchhhcc-------------CCCccccCcccccch
Confidence 578888888642 567888999999883
No 120
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=79.73 E-value=0.51 Score=30.72 Aligned_cols=17 Identities=18% Similarity=0.440 Sum_probs=9.6
Q ss_pred CcccccC--CCCCCCCChH
Q psy5202 241 KDVYRCR--FCEMPFSVPS 257 (343)
Q Consensus 241 ~~~~~C~--~C~~~f~~~~ 257 (343)
+.-++|. .||.+|....
T Consensus 25 ~~Y~qC~N~eCg~tF~t~e 43 (72)
T PRK09678 25 ERYHQCQNVNCSATFITYE 43 (72)
T ss_pred eeeeecCCCCCCCEEEEEE
Confidence 4455665 5666665543
No 121
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=79.29 E-value=0.41 Score=31.13 Aligned_cols=41 Identities=17% Similarity=0.316 Sum_probs=26.8
Q ss_pred cccccCcccccCHHHHHHHHHHccCCCceecC--cCcccccChHH
Q psy5202 186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCE--LCSYACAQSSK 228 (343)
Q Consensus 186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~~~f~~~~~ 228 (343)
+.|+.||..-.....-..+. ...+..++|. .||.+|.....
T Consensus 2 m~CP~Cg~~a~irtSr~~s~--~~~~~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITD--TTKERYHQCQNVNCSATFITYES 44 (72)
T ss_pred ccCCCCCCccEEEEChhcCh--hhheeeeecCCCCCCCEEEEEEE
Confidence 67999997764333322222 2456778998 89999986544
No 122
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=79.22 E-value=0.81 Score=26.29 Aligned_cols=11 Identities=36% Similarity=0.960 Sum_probs=5.6
Q ss_pred eecCcCccccc
Q psy5202 214 YKCELCSYACA 224 (343)
Q Consensus 214 ~~C~~C~~~f~ 224 (343)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (42)
T PF09723_consen 6 YRCEECGHEFE 16 (42)
T ss_pred EEeCCCCCEEE
Confidence 45555555444
No 123
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=79.18 E-value=1.8 Score=40.51 Aligned_cols=41 Identities=22% Similarity=0.454 Sum_probs=26.6
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP 252 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~ 252 (343)
...|+.|+-.. ..|.......|.+||+.. ..+..|+.||-.
T Consensus 222 ~~~C~~C~~~l----------~~h~~~~~l~Ch~Cg~~~-----------------~~~~~Cp~C~s~ 262 (505)
T TIGR00595 222 ILCCPNCDVSL----------TYHKKEGKLRCHYCGYQE-----------------PIPKTCPQCGSE 262 (505)
T ss_pred ccCCCCCCCce----------EEecCCCeEEcCCCcCcC-----------------CCCCCCCCCCCC
Confidence 35677776443 234556678899999443 346789999864
No 124
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=79.09 E-value=1.3 Score=35.92 Aligned_cols=31 Identities=26% Similarity=0.532 Sum_probs=20.1
Q ss_pred CCcccccCCCCCCCCChHHHHHHHhhhcCcc
Q psy5202 240 GKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQ 270 (343)
Q Consensus 240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 270 (343)
.+..|.|+.|+|.|.-..-+.+|+...|...
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 4556778888888888888888887766543
No 125
>KOG2071|consensus
Probab=78.88 E-value=1.7 Score=40.19 Aligned_cols=26 Identities=23% Similarity=0.535 Sum_probs=21.7
Q ss_pred CCcccccCcccccCHHHHHHHHHHcc
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHT 209 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~ 209 (343)
.+..|..||.+|.+......||..|.
T Consensus 417 ~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 417 SPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CcchhcccccccccchhhhhHhhhhh
Confidence 56789999999999888888877664
No 126
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=77.62 E-value=1.2 Score=32.98 Aligned_cols=27 Identities=44% Similarity=0.678 Sum_probs=17.3
Q ss_pred CCcccccccccccCChHHHHHHHHhccCCC
Q psy5202 89 EKNKECEFCGKKFKYENTLQAHRRNHTGEK 118 (343)
Q Consensus 89 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~ 118 (343)
+.-..|-+||+.|.. |.+|++.|+|..
T Consensus 70 ~d~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 70 PDYIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp SS-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred cCeeEEccCCcccch---HHHHHHHccCCC
Confidence 344689999999975 699999998753
No 127
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=77.44 E-value=1.1 Score=36.64 Aligned_cols=32 Identities=13% Similarity=0.146 Sum_probs=24.5
Q ss_pred CCcccccCCCCCCCCChHHHHHHHhhhcCccc
Q psy5202 240 GKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQN 271 (343)
Q Consensus 240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~ 271 (343)
..+++.||.||+.......|-.-.|+|...+.
T Consensus 206 k~k~~PCPKCg~et~eTkdLSmStR~hkyGRQ 237 (314)
T PF06524_consen 206 KGKPIPCPKCGYETQETKDLSMSTRSHKYGRQ 237 (314)
T ss_pred cCCCCCCCCCCCcccccccceeeeecchhccc
Confidence 45789999999998888888777777655443
No 128
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=77.14 E-value=0.96 Score=37.02 Aligned_cols=49 Identities=24% Similarity=0.518 Sum_probs=30.5
Q ss_pred CCcccccCcccccCHHHHHHHHHHc----cCCCceecCcCcccccChHHHHHHHHHhc
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSH----TGEKPYKCELCSYACAQSSKLTRHMKTHG 237 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h----~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 237 (343)
..|.|-.|..-|-. .|++.. ...+++.|+.|++-......|..-.|.|.
T Consensus 181 Gq~sCLRCK~cfCd-----dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmStR~hk 233 (314)
T PF06524_consen 181 GQYSCLRCKICFCD-----DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMSTRSHK 233 (314)
T ss_pred cchhhhheeeeehh-----hhhhhcccccccCCCCCCCCCCCcccccccceeeeecch
Confidence 34555555555533 355432 23478999999998887777765555553
No 129
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=77.11 E-value=3 Score=42.44 Aligned_cols=42 Identities=24% Similarity=0.584 Sum_probs=25.8
Q ss_pred cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCC
Q psy5202 186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSV 255 (343)
Q Consensus 186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~ 255 (343)
..|+.||... +..|.|+.||....... .+ ...|+.||.....
T Consensus 680 ~fCP~CGs~t---------------e~vy~CPsCGaev~~de-----------s~--a~~CP~CGtplv~ 721 (1337)
T PRK14714 680 NRCPDCGTHT---------------EPVYVCPDCGAEVPPDE-----------SG--RVECPRCDVELTP 721 (1337)
T ss_pred ccCcccCCcC---------------CCceeCccCCCccCCCc-----------cc--cccCCCCCCcccc
Confidence 3677787664 23578999997543310 12 4469999965443
No 130
>PRK14873 primosome assembly protein PriA; Provisional
Probab=76.99 E-value=2.1 Score=41.39 Aligned_cols=40 Identities=20% Similarity=0.492 Sum_probs=26.6
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP 252 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~ 252 (343)
...|+.|+-.... |.......|.+||+. ..++.|+.||..
T Consensus 392 ~~~C~~C~~~L~~----------h~~~~~l~Ch~CG~~------------------~~p~~Cp~Cgs~ 431 (665)
T PRK14873 392 PARCRHCTGPLGL----------PSAGGTPRCRWCGRA------------------APDWRCPRCGSD 431 (665)
T ss_pred eeECCCCCCceeE----------ecCCCeeECCCCcCC------------------CcCccCCCCcCC
Confidence 4567777755432 444567889999942 236789999865
No 131
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=76.95 E-value=1.3 Score=27.55 Aligned_cols=21 Identities=19% Similarity=0.428 Sum_probs=8.8
Q ss_pred hHHHHHHHHHhcCCCCcccccCC
Q psy5202 226 SSKLTRHMKTHGRLGKDVYRCRF 248 (343)
Q Consensus 226 ~~~L~~H~~~h~~~~~~~~~C~~ 248 (343)
+..|..|+...- ..++..|++
T Consensus 23 r~~l~~H~~~~C--~~~~v~C~~ 43 (60)
T PF02176_consen 23 RKELDDHLENEC--PKRPVPCPY 43 (60)
T ss_dssp CCCHHHHHHTTS--TTSEEE-SS
T ss_pred HHHHHHHHHccC--CCCcEECCC
Confidence 345555554222 234445555
No 132
>PRK04023 DNA polymerase II large subunit; Validated
Probab=76.88 E-value=5.6 Score=39.60 Aligned_cols=25 Identities=28% Similarity=0.638 Sum_probs=18.2
Q ss_pred CCCcccccccccccCChHHHHHHHHhccCCCCeecCCCCcc
Q psy5202 88 EEKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHA 128 (343)
Q Consensus 88 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~ 128 (343)
......|+.||... ..+.|+.||..
T Consensus 623 EVg~RfCpsCG~~t----------------~~frCP~CG~~ 647 (1121)
T PRK04023 623 EIGRRKCPSCGKET----------------FYRRCPFCGTH 647 (1121)
T ss_pred cccCccCCCCCCcC----------------CcccCCCCCCC
Confidence 44567899999873 23788888865
No 133
>KOG2071|consensus
Probab=76.85 E-value=8.2 Score=36.00 Aligned_cols=30 Identities=20% Similarity=0.455 Sum_probs=24.3
Q ss_pred CCCCCcccccccccccCChHHHHHHHHhcc
Q psy5202 86 SQEEKNKECEFCGKKFKYENTLQAHRRNHT 115 (343)
Q Consensus 86 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 115 (343)
.....+.+|..||.+|........||..|.
T Consensus 413 Ly~~~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 413 LYKDSPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred hccCCcchhcccccccccchhhhhHhhhhh
Confidence 345677899999999999888888777763
No 134
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=76.03 E-value=1.6 Score=22.10 Aligned_cols=10 Identities=30% Similarity=0.810 Sum_probs=5.9
Q ss_pred ccCCCCCCCC
Q psy5202 245 RCRFCEMPFS 254 (343)
Q Consensus 245 ~C~~C~~~f~ 254 (343)
.|+.||+.|.
T Consensus 16 ~Cp~CG~~F~ 25 (26)
T PF10571_consen 16 FCPHCGYDFE 25 (26)
T ss_pred cCCCCCCCCc
Confidence 4666666653
No 135
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=75.58 E-value=1.3 Score=32.85 Aligned_cols=26 Identities=23% Similarity=0.351 Sum_probs=15.4
Q ss_pred cccccCCCCCCCCChHHHHHHHhhhcCcc
Q psy5202 242 DVYRCRFCEMPFSVPSTLEKHMRKCVVNQ 270 (343)
Q Consensus 242 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 270 (343)
.-..|-+||+.|... .+|+++||+-.
T Consensus 71 d~i~clecGk~~k~L---krHL~~~~glt 96 (132)
T PF05443_consen 71 DYIICLECGKKFKTL---KRHLRTHHGLT 96 (132)
T ss_dssp S-EE-TBT--EESBH---HHHHHHTT-S-
T ss_pred CeeEEccCCcccchH---HHHHHHccCCC
Confidence 346788899888765 88999887533
No 136
>PF15269 zf-C2H2_7: Zinc-finger
Probab=74.79 E-value=3.7 Score=23.48 Aligned_cols=25 Identities=16% Similarity=0.183 Sum_probs=19.3
Q ss_pred CCcccccccccccCChHHHHHHHHh
Q psy5202 89 EKNKECEFCGKKFKYENTLQAHRRN 113 (343)
Q Consensus 89 ~~~~~C~~C~~~f~~~~~L~~H~~~ 113 (343)
.-.|+|-+|......++.|-.||+.
T Consensus 18 p~~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 18 PFKYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred CccceeecCCcccchHHHHHHHHHH
Confidence 3457888888888888888888764
No 137
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=73.75 E-value=3.1 Score=23.73 Aligned_cols=24 Identities=21% Similarity=0.564 Sum_probs=18.9
Q ss_pred ccccccccccC--ChHHHHHHHHhcc
Q psy5202 92 KECEFCGKKFK--YENTLQAHRRNHT 115 (343)
Q Consensus 92 ~~C~~C~~~f~--~~~~L~~H~~~h~ 115 (343)
-.|+.||..|. ....-..|.+.|.
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 58999999987 5666778887774
No 138
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=72.87 E-value=1.8 Score=22.75 Aligned_cols=9 Identities=33% Similarity=1.276 Sum_probs=1.6
Q ss_pred cccCccccc
Q psy5202 188 CEFCGKVFK 196 (343)
Q Consensus 188 C~~C~~~f~ 196 (343)
|+.|+..+.
T Consensus 5 Cp~C~se~~ 13 (30)
T PF08274_consen 5 CPLCGSEYT 13 (30)
T ss_dssp -TTT-----
T ss_pred CCCCCCcce
Confidence 555554443
No 139
>PF14353 CpXC: CpXC protein
Probab=72.70 E-value=3 Score=30.86 Aligned_cols=23 Identities=22% Similarity=0.424 Sum_probs=13.2
Q ss_pred ccccCCCCCCCCChHHHHHHHhh
Q psy5202 243 VYRCRFCEMPFSVPSTLEKHMRK 265 (343)
Q Consensus 243 ~~~C~~C~~~f~~~~~l~~H~~~ 265 (343)
.|.|+.||..|.-...+..|-..
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D~~ 60 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHDPE 60 (128)
T ss_pred EEECCCCCCceecCCCEEEEcCC
Confidence 45666666666555555555443
No 140
>KOG4377|consensus
Probab=72.38 E-value=13 Score=32.89 Aligned_cols=52 Identities=21% Similarity=0.496 Sum_probs=34.1
Q ss_pred CCCccccc--ccccccCChHHHHHHHHhccCC------------CCeecC--CCCccCCCHHHHHHHHHHh
Q psy5202 88 EEKNKECE--FCGKKFKYENTLQAHRRNHTGE------------KPFKCT--VCDHACSQSAKLKKHMKIH 142 (343)
Q Consensus 88 ~~~~~~C~--~C~~~f~~~~~L~~H~~~h~~~------------~~~~C~--~C~~~f~~~~~L~~H~~~h 142 (343)
+..-|.|- .|+..+-.+..+.+|...|... ..|.|. .|.+ +-++...|-..|
T Consensus 268 ~rEhyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFh 335 (480)
T KOG4377|consen 268 GREHYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFH 335 (480)
T ss_pred cchhhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccc
Confidence 45568884 5988877799999999988543 125564 3766 334455555555
No 141
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=71.46 E-value=0.21 Score=35.55 Aligned_cols=27 Identities=19% Similarity=0.495 Sum_probs=18.6
Q ss_pred cccccccccCChHHHHHHHHhccCCCCeecCCCCccCCC
Q psy5202 93 ECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQ 131 (343)
Q Consensus 93 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~ 131 (343)
-|+.||..+... ...+.|+.|++.+..
T Consensus 2 fC~~Cg~~l~~~------------~~~~~C~~C~~~~~~ 28 (104)
T TIGR01384 2 FCPKCGSLMTPK------------NGVYVCPSCGYEKEK 28 (104)
T ss_pred CCcccCcccccC------------CCeEECcCCCCcccc
Confidence 388888877421 135889999987654
No 142
>KOG2593|consensus
Probab=70.71 E-value=4.6 Score=35.97 Aligned_cols=38 Identities=16% Similarity=0.298 Sum_probs=27.5
Q ss_pred CCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202 210 GEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP 252 (343)
Q Consensus 210 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~ 252 (343)
...-|.|+.|.+.|.....++-= - .....|.|..|+-.
T Consensus 125 ~~~~Y~Cp~C~kkyt~Lea~~L~---~--~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 125 NVAGYVCPNCQKKYTSLEALQLL---D--NETGEFHCENCGGE 162 (436)
T ss_pred ccccccCCccccchhhhHHHHhh---c--ccCceEEEecCCCc
Confidence 34569999999999988777542 1 12456999999854
No 143
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=70.04 E-value=3.5 Score=33.42 Aligned_cols=26 Identities=15% Similarity=0.445 Sum_probs=15.0
Q ss_pred CceecCcCcccccChHHHHHHHHHhc
Q psy5202 212 KPYKCELCSYACAQSSKLTRHMKTHG 237 (343)
Q Consensus 212 ~~~~C~~C~~~f~~~~~L~~H~~~h~ 237 (343)
..|.|+.|+|.|.-..-+++|+..-|
T Consensus 76 ~K~~C~lc~KlFkg~eFV~KHI~nKH 101 (214)
T PF04959_consen 76 DKWRCPLCGKLFKGPEFVRKHIFNKH 101 (214)
T ss_dssp EEEEE-SSS-EESSHHHHHHHHHHH-
T ss_pred CEECCCCCCcccCChHHHHHHHhhcC
Confidence 34667777777777777777765533
No 144
>PRK05580 primosome assembly protein PriA; Validated
Probab=69.69 E-value=4 Score=39.80 Aligned_cols=41 Identities=20% Similarity=0.480 Sum_probs=24.9
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP 252 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~ 252 (343)
...|+.|+-.+. .|.......|.+||+.. ..+..|+.||..
T Consensus 390 ~~~C~~C~~~l~----------~h~~~~~l~Ch~Cg~~~-----------------~~~~~Cp~Cg~~ 430 (679)
T PRK05580 390 VAECPHCDASLT----------LHRFQRRLRCHHCGYQE-----------------PIPKACPECGST 430 (679)
T ss_pred ccCCCCCCCcee----------EECCCCeEECCCCcCCC-----------------CCCCCCCCCcCC
Confidence 356777765432 23445667788888433 345678888754
No 145
>KOG1280|consensus
Probab=68.06 E-value=4.5 Score=34.84 Aligned_cols=23 Identities=13% Similarity=0.465 Sum_probs=12.1
Q ss_pred ceecCcCcccccChHHHHHHHHH
Q psy5202 213 PYKCELCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~~L~~H~~~ 235 (343)
-|.|++|++.-.+...|..|+..
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s 101 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLS 101 (381)
T ss_pred cccCCcccccccchhHHHHHhhh
Confidence 35555555555555555555443
No 146
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=67.79 E-value=2.7 Score=28.36 Aligned_cols=14 Identities=21% Similarity=0.902 Sum_probs=8.8
Q ss_pred CceecCcCcccccC
Q psy5202 212 KPYKCELCSYACAQ 225 (343)
Q Consensus 212 ~~~~C~~C~~~f~~ 225 (343)
+|-+|..||+.|..
T Consensus 57 ~Pa~CkkCGfef~~ 70 (97)
T COG3357 57 RPARCKKCGFEFRD 70 (97)
T ss_pred cChhhcccCccccc
Confidence 35567777766654
No 147
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=66.80 E-value=5.3 Score=23.40 Aligned_cols=10 Identities=40% Similarity=1.099 Sum_probs=6.1
Q ss_pred cccccCCCCC
Q psy5202 242 DVYRCRFCEM 251 (343)
Q Consensus 242 ~~~~C~~C~~ 251 (343)
..|+|..|++
T Consensus 36 ~~~~C~~C~~ 45 (46)
T PF12760_consen 36 GRYRCKACRK 45 (46)
T ss_pred CeEECCCCCC
Confidence 4566666654
No 148
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=66.73 E-value=2.4 Score=31.71 Aligned_cols=8 Identities=25% Similarity=0.804 Sum_probs=4.4
Q ss_pred cccCCCCC
Q psy5202 244 YRCRFCEM 251 (343)
Q Consensus 244 ~~C~~C~~ 251 (343)
+.||.||.
T Consensus 108 ~~CP~Cgs 115 (135)
T PRK03824 108 LKCPKCGS 115 (135)
T ss_pred cCCcCCCC
Confidence 45666653
No 149
>KOG2593|consensus
Probab=66.37 E-value=4.9 Score=35.82 Aligned_cols=22 Identities=14% Similarity=0.365 Sum_probs=16.9
Q ss_pred CCCCeecCCCCccCCCHHHHHH
Q psy5202 116 GEKPFKCTVCDHACSQSAKLKK 137 (343)
Q Consensus 116 ~~~~~~C~~C~~~f~~~~~L~~ 137 (343)
....|.|+.|.+.|.....++.
T Consensus 125 ~~~~Y~Cp~C~kkyt~Lea~~L 146 (436)
T KOG2593|consen 125 NVAGYVCPNCQKKYTSLEALQL 146 (436)
T ss_pred ccccccCCccccchhhhHHHHh
Confidence 3466999999999988776553
No 150
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=66.26 E-value=2.3 Score=25.92 Aligned_cols=10 Identities=20% Similarity=0.730 Sum_probs=4.6
Q ss_pred eecCcCcccc
Q psy5202 214 YKCELCSYAC 223 (343)
Q Consensus 214 ~~C~~C~~~f 223 (343)
|.|+.||..+
T Consensus 3 ~~CP~CG~~i 12 (54)
T TIGR01206 3 FECPDCGAEI 12 (54)
T ss_pred cCCCCCCCEE
Confidence 3455555443
No 151
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=65.59 E-value=4.6 Score=24.53 Aligned_cols=9 Identities=56% Similarity=1.398 Sum_probs=4.3
Q ss_pred ccccCcccc
Q psy5202 187 TCEFCGKVF 195 (343)
Q Consensus 187 ~C~~C~~~f 195 (343)
+|..||+.|
T Consensus 7 ~C~~Cg~~~ 15 (54)
T PF14446_consen 7 KCPVCGKKF 15 (54)
T ss_pred cChhhCCcc
Confidence 444555444
No 152
>KOG3408|consensus
Probab=65.30 E-value=3.5 Score=29.55 Aligned_cols=27 Identities=22% Similarity=0.439 Sum_probs=22.4
Q ss_pred CCcccccCCCCCCCCChHHHHHHHhhh
Q psy5202 240 GKDVYRCRFCEMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~ 266 (343)
|...|.|-.|.+-|.+...|..|.++.
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~ktK 80 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKTK 80 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhcc
Confidence 455688999999999999999998853
No 153
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=65.22 E-value=4 Score=29.43 Aligned_cols=11 Identities=27% Similarity=0.293 Sum_probs=6.1
Q ss_pred eecCcCccccc
Q psy5202 214 YKCELCSYACA 224 (343)
Q Consensus 214 ~~C~~C~~~f~ 224 (343)
++|..|+..|.
T Consensus 71 ~~C~~Cg~~~~ 81 (113)
T PRK12380 71 AWCWDCSQVVE 81 (113)
T ss_pred EEcccCCCEEe
Confidence 55666665554
No 154
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=64.91 E-value=1.5 Score=24.16 Aligned_cols=10 Identities=30% Similarity=1.222 Sum_probs=5.4
Q ss_pred ccccCccccc
Q psy5202 187 TCEFCGKVFK 196 (343)
Q Consensus 187 ~C~~C~~~f~ 196 (343)
.|+.||..|.
T Consensus 3 ~C~~Cg~~Yh 12 (36)
T PF05191_consen 3 ICPKCGRIYH 12 (36)
T ss_dssp EETTTTEEEE
T ss_pred CcCCCCCccc
Confidence 4555555553
No 155
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=64.91 E-value=1.7 Score=26.09 Aligned_cols=23 Identities=17% Similarity=0.390 Sum_probs=8.2
Q ss_pred ceecCcCcccccChHHHHHHHHH
Q psy5202 213 PYKCELCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~~L~~H~~~ 235 (343)
.|+|+.|...|=..-++..|...
T Consensus 21 ~y~C~~C~~~FC~dCD~fiHE~L 43 (51)
T PF07975_consen 21 RYRCPKCKNHFCIDCDVFIHETL 43 (51)
T ss_dssp EE--TTTT--B-HHHHHTTTTTS
T ss_pred eEECCCCCCccccCcChhhhccc
Confidence 35555555555444444444333
No 156
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=64.76 E-value=3.3 Score=20.45 Aligned_cols=8 Identities=25% Similarity=0.858 Sum_probs=4.4
Q ss_pred ccccCCCC
Q psy5202 243 VYRCRFCE 250 (343)
Q Consensus 243 ~~~C~~C~ 250 (343)
.|.|+.||
T Consensus 16 ~f~CPnCG 23 (24)
T PF07754_consen 16 PFPCPNCG 23 (24)
T ss_pred eEeCCCCC
Confidence 45555555
No 157
>KOG2807|consensus
Probab=63.98 E-value=9.3 Score=32.70 Aligned_cols=100 Identities=22% Similarity=0.313 Sum_probs=58.8
Q ss_pred CcccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHh
Q psy5202 90 KNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAAL 169 (343)
Q Consensus 90 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (343)
.-|.|+.|+...- .-|..|++|+-.......|.+-...--.-+++................
T Consensus 275 ~Gy~CP~CkakvC--------------sLP~eCpiC~ltLVss~hLARSyhhL~PL~~F~Eip~~~~~~~~~Cf~----- 335 (378)
T KOG2807|consen 275 GGYFCPQCKAKVC--------------SLPIECPICSLTLVSSPHLARSYHHLFPLKPFVEIPETEYNGSRFCFA----- 335 (378)
T ss_pred CceeCCcccCeee--------------cCCccCCccceeEecchHHHHHHHhhcCCcchhhccccccCCCcceee-----
Confidence 3499999986542 357899999999888877766443322222221111111000000000
Q ss_pred hhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCc
Q psy5202 170 NLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCS 220 (343)
Q Consensus 170 ~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 220 (343)
+ .-.......|+|..|...|-..-+...|...|. |+.|.
T Consensus 336 --C----~~~~~~~~~y~C~~Ck~~FCldCDv~iHesLh~------CpgCe 374 (378)
T KOG2807|consen 336 --C----QGELLSSGRYRCESCKNVFCLDCDVFIHESLHN------CPGCE 374 (378)
T ss_pred --e----ccccCCCCcEEchhccceeeccchHHHHhhhhc------CCCcC
Confidence 0 000112367999999999999999999987763 77775
No 158
>KOG4167|consensus
Probab=63.21 E-value=1.2 Score=42.11 Aligned_cols=29 Identities=21% Similarity=0.434 Sum_probs=25.6
Q ss_pred cccccCCCCCCCCChHHHHHHHhhhcCcc
Q psy5202 242 DVYRCRFCEMPFSVPSTLEKHMRKCVVNQ 270 (343)
Q Consensus 242 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 270 (343)
..|.|..|+|.|-...++..||++|....
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr~q~ 819 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHRQQE 819 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHHHHH
Confidence 45999999999999999999999997544
No 159
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=62.65 E-value=4.6 Score=21.77 Aligned_cols=25 Identities=28% Similarity=0.646 Sum_probs=17.0
Q ss_pred ccccCCCCCCCCChHHHHHHHhhhcC
Q psy5202 243 VYRCRFCEMPFSVPSTLEKHMRKCVV 268 (343)
Q Consensus 243 ~~~C~~C~~~f~~~~~l~~H~~~~~~ 268 (343)
.|.|+.|++.+ ....+..|+.+|-+
T Consensus 4 ~~~C~nC~R~v-~a~RfA~HLekCmg 28 (33)
T PF08209_consen 4 YVECPNCGRPV-AASRFAPHLEKCMG 28 (33)
T ss_dssp EEE-TTTSSEE-EGGGHHHHHHHHTC
T ss_pred eEECCCCcCCc-chhhhHHHHHHHHc
Confidence 46788888765 45567888887764
No 160
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=62.04 E-value=4.5 Score=29.66 Aligned_cols=15 Identities=27% Similarity=0.647 Sum_probs=11.1
Q ss_pred ceecCcCcccccChH
Q psy5202 213 PYKCELCSYACAQSS 227 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~ 227 (343)
|++|..||+.|..-+
T Consensus 1 PH~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGS 15 (131)
T ss_pred CcccCcCCCCcCCCc
Confidence 567888888887644
No 161
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=61.78 E-value=4 Score=29.84 Aligned_cols=25 Identities=44% Similarity=0.615 Sum_probs=21.0
Q ss_pred ccccccccccCChHHHHHHHHhccCCCC
Q psy5202 92 KECEFCGKKFKYENTLQAHRRNHTGEKP 119 (343)
Q Consensus 92 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~ 119 (343)
..|-+||+.|. +|++|+.+|.+..|
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gmTP 101 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGLTP 101 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCCCH
Confidence 57999999996 69999999988644
No 162
>KOG4124|consensus
Probab=61.62 E-value=2.8 Score=35.97 Aligned_cols=55 Identities=31% Similarity=0.628 Sum_probs=41.0
Q ss_pred CCCCcccccc--cccccCChHHHHHHHHh-cc------------------CCCCeecCCCCccCCCHHHHHHHHHH
Q psy5202 87 QEEKNKECEF--CGKKFKYENTLQAHRRN-HT------------------GEKPFKCTVCDHACSQSAKLKKHMKI 141 (343)
Q Consensus 87 ~~~~~~~C~~--C~~~f~~~~~L~~H~~~-h~------------------~~~~~~C~~C~~~f~~~~~L~~H~~~ 141 (343)
...++|+|++ |++.+.....|..|... |. ..|+|+|++|.+.++....|.-|+..
T Consensus 345 ~~~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~~~ 420 (442)
T KOG4124|consen 345 VVDKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHRTH 420 (442)
T ss_pred EecCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCceeeh
Confidence 3567899976 99999988888888543 32 13789999999988877666666543
No 163
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=61.13 E-value=5 Score=35.96 Aligned_cols=30 Identities=20% Similarity=0.513 Sum_probs=20.8
Q ss_pred ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202 215 KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS 257 (343)
Q Consensus 215 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~ 257 (343)
.|+.||.+..+. |..-|+|+.||+++....
T Consensus 352 ~Cp~Cg~~m~S~-------------G~~g~rC~kCg~~~~~~~ 381 (421)
T COG1571 352 VCPRCGGRMKSA-------------GRNGFRCKKCGTRARETL 381 (421)
T ss_pred CCCccCCchhhc-------------CCCCcccccccccCCccc
Confidence 588888655442 344788999998876653
No 164
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=61.03 E-value=8 Score=28.38 Aligned_cols=55 Identities=22% Similarity=0.500 Sum_probs=35.0
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceec---CcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHH
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKC---ELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPST 258 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C---~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~ 258 (343)
+.|.|.+|..+.....-| +|-.| .+|+.- -..|+.|-.+|. .||.|.-+|++...
T Consensus 79 ~lYeCnIC~etS~ee~FL----------KPneCCgY~iCn~C---ya~LWK~~~~yp-------vCPvCkTSFKss~~ 136 (140)
T PF05290_consen 79 KLYECNICKETSAEERFL----------KPNECCGYSICNAC---YANLWKFCNLYP-------VCPVCKTSFKSSSS 136 (140)
T ss_pred CceeccCcccccchhhcC----------CcccccchHHHHHH---HHHHHHHcccCC-------CCCccccccccccc
Confidence 678888888877654322 44444 123322 256777766663 69999999987643
No 165
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=59.94 E-value=5.8 Score=28.71 Aligned_cols=12 Identities=33% Similarity=0.659 Sum_probs=6.6
Q ss_pred eecCcCcccccC
Q psy5202 214 YKCELCSYACAQ 225 (343)
Q Consensus 214 ~~C~~C~~~f~~ 225 (343)
+.|..|+..|..
T Consensus 71 ~~C~~Cg~~~~~ 82 (115)
T TIGR00100 71 CECEDCSEEVSP 82 (115)
T ss_pred EEcccCCCEEec
Confidence 556666655543
No 166
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=57.93 E-value=4.2 Score=27.72 Aligned_cols=13 Identities=23% Similarity=0.861 Sum_probs=6.9
Q ss_pred cccccCCCCCCCC
Q psy5202 242 DVYRCRFCEMPFS 254 (343)
Q Consensus 242 ~~~~C~~C~~~f~ 254 (343)
..+.|..|++.|.
T Consensus 53 GIW~C~~C~~~~A 65 (90)
T PTZ00255 53 GIWRCKGCKKTVA 65 (90)
T ss_pred EEEEcCCCCCEEe
Confidence 3455555555553
No 167
>PF01155 HypA: Hydrogenase expression/synthesis hypA family; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=57.75 E-value=3.4 Score=29.82 Aligned_cols=12 Identities=17% Similarity=0.476 Sum_probs=7.3
Q ss_pred eecCcCcccccC
Q psy5202 214 YKCELCSYACAQ 225 (343)
Q Consensus 214 ~~C~~C~~~f~~ 225 (343)
++|..|++.|..
T Consensus 71 ~~C~~Cg~~~~~ 82 (113)
T PF01155_consen 71 ARCRDCGHEFEP 82 (113)
T ss_dssp EEETTTS-EEEC
T ss_pred EECCCCCCEEec
Confidence 567777766654
No 168
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=57.50 E-value=1.8 Score=34.84 Aligned_cols=39 Identities=28% Similarity=0.503 Sum_probs=25.8
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCC---------------ceecCcCccc
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEK---------------PYKCELCSYA 222 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~---------------~~~C~~C~~~ 222 (343)
+.+.|++|+..|....-+..-.|+-.|+- ...||.|+++
T Consensus 18 k~ieCPvC~tkFkkeev~tgsiRiiagDld~~lkygninP~fY~VvvCP~C~yA 71 (267)
T COG1655 18 KTIECPVCNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPYFYDVVVCPICYYA 71 (267)
T ss_pred ceeccCcccchhhhhheeccceeEecccccceeeccccCCceeEEEEcchhhHH
Confidence 67889999999987655444444433321 2579999864
No 169
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=57.44 E-value=6.6 Score=28.51 Aligned_cols=11 Identities=27% Similarity=0.615 Sum_probs=5.7
Q ss_pred eecCcCccccc
Q psy5202 214 YKCELCSYACA 224 (343)
Q Consensus 214 ~~C~~C~~~f~ 224 (343)
+.|..|+..|.
T Consensus 72 ~~C~~Cg~~~~ 82 (117)
T PRK00564 72 LECKDCSHVFK 82 (117)
T ss_pred EEhhhCCCccc
Confidence 45555554443
No 170
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=57.11 E-value=3.1 Score=28.34 Aligned_cols=11 Identities=36% Similarity=0.875 Sum_probs=5.7
Q ss_pred eecCcCccccc
Q psy5202 214 YKCELCSYACA 224 (343)
Q Consensus 214 ~~C~~C~~~f~ 224 (343)
|.|..|++.|.
T Consensus 54 W~C~~C~~~~A 64 (90)
T PF01780_consen 54 WKCKKCGKKFA 64 (90)
T ss_dssp EEETTTTEEEE
T ss_pred eecCCCCCEEe
Confidence 55555555543
No 171
>PRK14873 primosome assembly protein PriA; Provisional
Probab=56.92 E-value=3.6 Score=39.83 Aligned_cols=33 Identities=18% Similarity=0.406 Sum_probs=19.2
Q ss_pred ccccccccccCCh---HHHHHHHHhccCCCCeecCCCCcc
Q psy5202 92 KECEFCGKKFKYE---NTLQAHRRNHTGEKPFKCTVCDHA 128 (343)
Q Consensus 92 ~~C~~C~~~f~~~---~~L~~H~~~h~~~~~~~C~~C~~~ 128 (343)
..|..||..+... ..|.. |.......|.+||+.
T Consensus 384 l~C~~Cg~~~~C~~C~~~L~~----h~~~~~l~Ch~CG~~ 419 (665)
T PRK14873 384 LACARCRTPARCRHCTGPLGL----PSAGGTPRCRWCGRA 419 (665)
T ss_pred eEhhhCcCeeECCCCCCceeE----ecCCCeeECCCCcCC
Confidence 3677777766532 22333 334456788888863
No 172
>PF09963 DUF2197: Uncharacterized protein conserved in bacteria (DUF2197); InterPro: IPR019241 This family represents various hypothetical bacterial proteins with no known function.
Probab=56.19 E-value=6.9 Score=24.00 Aligned_cols=36 Identities=31% Similarity=0.566 Sum_probs=22.4
Q ss_pred ccccccccccC-ChHHHHH-HHHhccCCCCeecCCCCcc
Q psy5202 92 KECEFCGKKFK-YENTLQA-HRRNHTGEKPFKCTVCDHA 128 (343)
Q Consensus 92 ~~C~~C~~~f~-~~~~L~~-H~~~h~~~~~~~C~~C~~~ 128 (343)
-+|-.|++.+. ....+.. -++. .....|.|+.|...
T Consensus 3 vkC~lCdk~~~Id~~~~~aKrLrn-rPi~tYmC~eC~~R 40 (56)
T PF09963_consen 3 VKCILCDKKEEIDEDTPEAKRLRN-RPIHTYMCDECKER 40 (56)
T ss_pred eEEEecCCEEEeccCCHHHHHhhc-CCCcceeChhHHHH
Confidence 47999998876 2333333 2333 35567999999643
No 173
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=55.45 E-value=4.1 Score=27.81 Aligned_cols=13 Identities=31% Similarity=0.961 Sum_probs=7.1
Q ss_pred cccccCCCCCCCC
Q psy5202 242 DVYRCRFCEMPFS 254 (343)
Q Consensus 242 ~~~~C~~C~~~f~ 254 (343)
..+.|..|++.|.
T Consensus 52 GIW~C~~C~~~~A 64 (91)
T TIGR00280 52 GIWTCRKCGAKFA 64 (91)
T ss_pred EEEEcCCCCCEEe
Confidence 3455666655553
No 174
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=55.41 E-value=6.5 Score=22.32 Aligned_cols=16 Identities=19% Similarity=0.330 Sum_probs=7.2
Q ss_pred eecCcCcccccChHHH
Q psy5202 214 YKCELCSYACAQSSKL 229 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L 229 (343)
+.|+.|+-.+-....|
T Consensus 20 d~C~~C~G~W~d~~el 35 (41)
T PF13453_consen 20 DVCPSCGGIWFDAGEL 35 (41)
T ss_pred EECCCCCeEEccHHHH
Confidence 4444444444444433
No 175
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=55.33 E-value=5.5 Score=25.60 Aligned_cols=9 Identities=56% Similarity=1.390 Sum_probs=2.6
Q ss_pred ccccCcccc
Q psy5202 187 TCEFCGKVF 195 (343)
Q Consensus 187 ~C~~C~~~f 195 (343)
.|..|++.|
T Consensus 11 ~C~~C~~~F 19 (69)
T PF01363_consen 11 NCMICGKKF 19 (69)
T ss_dssp B-TTT--B-
T ss_pred cCcCcCCcC
Confidence 455555555
No 176
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=55.09 E-value=5.8 Score=29.88 Aligned_cols=11 Identities=18% Similarity=0.676 Sum_probs=5.2
Q ss_pred CcccccCccccc
Q psy5202 185 NDTCEFCGKVFK 196 (343)
Q Consensus 185 ~~~C~~C~~~f~ 196 (343)
.|.|. |+..|.
T Consensus 117 ~Y~C~-C~q~~l 127 (156)
T COG3091 117 PYRCQ-CQQHYL 127 (156)
T ss_pred eEEee-cCCccc
Confidence 44455 554443
No 177
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=53.82 E-value=6.4 Score=29.65 Aligned_cols=36 Identities=19% Similarity=0.600 Sum_probs=25.7
Q ss_pred CCCcccccccccccCChHHHHHHHHhccCCCCeecCCCCcc
Q psy5202 88 EEKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHA 128 (343)
Q Consensus 88 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~ 128 (343)
..-+|.|. |+..|.+ .++|-..-.|+ .|.|..|+..
T Consensus 114 ~~~~Y~C~-C~q~~l~---~RRhn~~~~g~-~YrC~~C~gk 149 (156)
T COG3091 114 TTYPYRCQ-CQQHYLR---IRRHNTVRRGE-VYRCGKCGGK 149 (156)
T ss_pred cceeEEee-cCCccch---hhhcccccccc-eEEeccCCce
Confidence 34579999 9999765 34554444455 7999999864
No 178
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=53.76 E-value=8 Score=23.03 Aligned_cols=10 Identities=50% Similarity=0.982 Sum_probs=4.3
Q ss_pred cccccCcccc
Q psy5202 186 DTCEFCGKVF 195 (343)
Q Consensus 186 ~~C~~C~~~f 195 (343)
+.|..||..|
T Consensus 5 l~C~dCg~~F 14 (49)
T PF13451_consen 5 LTCKDCGAEF 14 (49)
T ss_pred EEcccCCCeE
Confidence 3444444443
No 179
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=53.72 E-value=9.5 Score=24.50 Aligned_cols=27 Identities=26% Similarity=0.717 Sum_probs=14.2
Q ss_pred ccccCcccccCHHHHHHHHHHccCCCceecCcCcccc
Q psy5202 187 TCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYAC 223 (343)
Q Consensus 187 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 223 (343)
.|+.||..... -...+.|.|+.||..+
T Consensus 30 ~C~~CG~~~~~----------~~~~r~~~C~~Cg~~~ 56 (69)
T PF07282_consen 30 TCPRCGHRNKK----------RRSGRVFTCPNCGFEM 56 (69)
T ss_pred CccCccccccc----------ccccceEEcCCCCCEE
Confidence 46666655443 1223446666666554
No 180
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=53.32 E-value=7.3 Score=23.36 Aligned_cols=12 Identities=25% Similarity=0.839 Sum_probs=8.1
Q ss_pred CceecCcCcccc
Q psy5202 212 KPYKCELCSYAC 223 (343)
Q Consensus 212 ~~~~C~~C~~~f 223 (343)
..|.|..||+.+
T Consensus 36 ~r~~C~~Cgyt~ 47 (50)
T PRK00432 36 DRWHCGKCGYTE 47 (50)
T ss_pred CcEECCCcCCEE
Confidence 457777777654
No 181
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=52.94 E-value=6.8 Score=33.16 Aligned_cols=48 Identities=15% Similarity=0.325 Sum_probs=31.4
Q ss_pred ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhh
Q psy5202 215 KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKC 266 (343)
Q Consensus 215 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 266 (343)
.|-.|.-.|..... |-.-.. +....|+|+.|...|-...+...|..-|
T Consensus 364 ~Cf~CQ~~fp~~~~---~~~~~~-~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 364 HCFVCQGPFPKPPV---SPFDES-TSSGRYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred cceeccCCCCCCCC---Cccccc-ccccceechhhhhhhhhhhHHHHHHHHh
Confidence 37777776654321 110000 1245699999999999999999997654
No 182
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=52.15 E-value=6.5 Score=32.20 Aligned_cols=14 Identities=14% Similarity=0.479 Sum_probs=8.2
Q ss_pred CceecCcCcccccC
Q psy5202 212 KPYKCELCSYACAQ 225 (343)
Q Consensus 212 ~~~~C~~C~~~f~~ 225 (343)
..|.|+.|+..|..
T Consensus 154 aef~C~~C~h~F~G 167 (278)
T PF15135_consen 154 AEFHCPKCRHNFRG 167 (278)
T ss_pred eeeecccccccchh
Confidence 34666666666653
No 183
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=52.07 E-value=9.1 Score=27.66 Aligned_cols=10 Identities=30% Similarity=0.587 Sum_probs=5.0
Q ss_pred eecCcCcccc
Q psy5202 214 YKCELCSYAC 223 (343)
Q Consensus 214 ~~C~~C~~~f 223 (343)
+.|..|+..|
T Consensus 71 ~~C~~Cg~~~ 80 (114)
T PRK03681 71 CWCETCQQYV 80 (114)
T ss_pred EEcccCCCee
Confidence 4555555433
No 184
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=51.62 E-value=7.8 Score=28.43 Aligned_cols=26 Identities=38% Similarity=0.485 Sum_probs=19.0
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCc
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKP 213 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 213 (343)
-..|-.+|+.|.+ |++|+.+|.|-.|
T Consensus 76 ~IicLEDGkkfKS---LKRHL~t~~gmTP 101 (148)
T COG4957 76 YIICLEDGKKFKS---LKRHLTTHYGLTP 101 (148)
T ss_pred eEEEeccCcchHH---HHHHHhcccCCCH
Confidence 4568888888864 7888888877543
No 185
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=51.21 E-value=5.8 Score=37.15 Aligned_cols=35 Identities=23% Similarity=0.464 Sum_probs=19.3
Q ss_pred cccccccccCChHHHHHHHHhccCCCCeecCCCCcc
Q psy5202 93 ECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHA 128 (343)
Q Consensus 93 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~ 128 (343)
.|..||....... -...+..|.......|.+||+.
T Consensus 215 ~C~~Cg~~~~C~~-C~~~l~~h~~~~~l~Ch~Cg~~ 249 (505)
T TIGR00595 215 LCRSCGYILCCPN-CDVSLTYHKKEGKLRCHYCGYQ 249 (505)
T ss_pred EhhhCcCccCCCC-CCCceEEecCCCeEEcCCCcCc
Confidence 5777776655321 1112334445556788888864
No 186
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=51.16 E-value=12 Score=23.24 Aligned_cols=40 Identities=18% Similarity=0.470 Sum_probs=25.6
Q ss_pred ccccccccc-cCChHHHHHHHHhccCCCCeecCCCCccCCC
Q psy5202 92 KECEFCGKK-FKYENTLQAHRRNHTGEKPFKCTVCDHACSQ 131 (343)
Q Consensus 92 ~~C~~C~~~-f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~ 131 (343)
.+|-+|++. |-....+..-...-...+.|.|+.|......
T Consensus 3 vkCiiCd~v~~iD~rt~~tKrLrN~PIrtymC~eC~~Rva~ 43 (68)
T COG4896 3 VKCIICDRVDEIDNRTFKTKRLRNKPIRTYMCPECEHRVAI 43 (68)
T ss_pred ceEEEecceeeecchhHHHHHhhCCCceeEechhhHhhhch
Confidence 478999866 4455555554444445567999999755433
No 187
>KOG3408|consensus
Probab=50.90 E-value=9.9 Score=27.33 Aligned_cols=26 Identities=27% Similarity=0.506 Sum_probs=21.0
Q ss_pred CCCceecCcCcccccChHHHHHHHHH
Q psy5202 210 GEKPYKCELCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 210 ~~~~~~C~~C~~~f~~~~~L~~H~~~ 235 (343)
|...|.|-.|.+-|.+...|..|.+.
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhc
Confidence 44558898999999999999888765
No 188
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=50.49 E-value=24 Score=27.72 Aligned_cols=34 Identities=18% Similarity=0.379 Sum_probs=22.1
Q ss_pred ccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202 208 HTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP 252 (343)
Q Consensus 208 h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~ 252 (343)
-....-|.|+.|...|+.-.++. .-|.|+.||-.
T Consensus 108 ~~~~~~y~C~~~~~r~sfdeA~~-----------~~F~Cp~Cg~~ 141 (176)
T COG1675 108 ETENNYYVCPNCHVKYSFDEAME-----------LGFTCPKCGED 141 (176)
T ss_pred hccCCceeCCCCCCcccHHHHHH-----------hCCCCCCCCch
Confidence 34556688888887776654432 12888888854
No 189
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=50.04 E-value=12 Score=22.82 Aligned_cols=9 Identities=44% Similarity=1.003 Sum_probs=4.0
Q ss_pred ccCCCCCCC
Q psy5202 245 RCRFCEMPF 253 (343)
Q Consensus 245 ~C~~C~~~f 253 (343)
.|..||+.|
T Consensus 20 ~Cr~Cg~~~ 28 (57)
T cd00065 20 HCRNCGRIF 28 (57)
T ss_pred ccCcCcCCc
Confidence 344444444
No 190
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=49.93 E-value=1.7 Score=24.62 Aligned_cols=10 Identities=30% Similarity=0.863 Sum_probs=4.8
Q ss_pred cccCCCCCCC
Q psy5202 244 YRCRFCEMPF 253 (343)
Q Consensus 244 ~~C~~C~~~f 253 (343)
|.|..|+..|
T Consensus 29 y~C~~C~~~w 38 (40)
T smart00440 29 YVCTKCGHRW 38 (40)
T ss_pred EEeCCCCCEe
Confidence 4455555443
No 191
>KOG1842|consensus
Probab=49.37 E-value=12 Score=33.57 Aligned_cols=22 Identities=18% Similarity=0.241 Sum_probs=12.1
Q ss_pred ccccccccccCChHHHHHHHHh
Q psy5202 92 KECEFCGKKFKYENTLQAHRRN 113 (343)
Q Consensus 92 ~~C~~C~~~f~~~~~L~~H~~~ 113 (343)
|.|++|...|.....|..|+..
T Consensus 16 flCPiC~~dl~~~~~L~~H~d~ 37 (505)
T KOG1842|consen 16 FLCPICLLDLPNLSALNDHLDV 37 (505)
T ss_pred ccCchHhhhhhhHHHHHHHHhh
Confidence 5555555555555555555543
No 192
>PF13821 DUF4187: Domain of unknown function (DUF4187)
Probab=48.88 E-value=20 Score=22.02 Aligned_cols=18 Identities=22% Similarity=0.573 Sum_probs=11.0
Q ss_pred cccCCCCCCCCChHHHHH
Q psy5202 244 YRCRFCEMPFSVPSTLEK 261 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~ 261 (343)
+.|-.||..|.+...|..
T Consensus 28 ~YC~~Cg~~Y~d~~dL~~ 45 (55)
T PF13821_consen 28 NYCFWCGTKYDDEEDLER 45 (55)
T ss_pred ceeeeeCCccCCHHHHHh
Confidence 456666666666666553
No 193
>KOG1280|consensus
Probab=48.65 E-value=19 Score=31.14 Aligned_cols=28 Identities=14% Similarity=0.394 Sum_probs=22.4
Q ss_pred CCeecCCCCccCCCHHHHHHHHHHhccC
Q psy5202 118 KPFKCTVCDHACSQSAKLKKHMKIHDED 145 (343)
Q Consensus 118 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~ 145 (343)
..|.|++|+..-.+...|..|....|.+
T Consensus 78 qSftCPyC~~~Gfte~~f~~Hv~s~Hpd 105 (381)
T KOG1280|consen 78 QSFTCPYCGIMGFTERQFGTHVLSQHPE 105 (381)
T ss_pred ccccCCcccccccchhHHHHHhhhcCcc
Confidence 3588888888888888888888877765
No 194
>PF06220 zf-U1: U1 zinc finger; InterPro: IPR013085 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. C2H2-type (classical) zinc fingers (Znf) were the first class to be characterised. They contain a short beta hairpin and an alpha helix (beta/beta/alpha structure), where a single zinc atom is held in place by Cys(2)His(2) (C2H2) residues in a tetrahedral array. C2H2 Znf's can be divided into three groups based on the number and pattern of fingers: triple-C2H2 (binds single ligand), multiple-adjacent-C2H2 (binds multiple ligands), and separated paired-C2H2 []. C2H2 Znf's are the most common DNA-binding motifs found in eukaryotic transcription factors, and have also been identified in prokaryotes []. Transcription factors usually contain several Znf's (each with a conserved beta/beta/alpha structure) capable of making multiple contacts along the DNA, where the C2H2 Znf motifs recognise DNA sequences by binding to the major groove of DNA via a short alpha-helix in the Znf, the Znf spanning 3-4 bases of the DNA []. C2H2 Znf's can also bind to RNA and protein targets []. This entry represents a C2H2-type zinc finger motif found in several U1 small nuclear ribonucleoprotein C (U1-C) proteins. Some proteins contain multiple copies of this motif. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' splice site is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' splice site. Binding of TIA-1 in the vicinity of a 5' splice site helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2VRD_A.
Probab=48.40 E-value=16 Score=20.46 Aligned_cols=22 Identities=27% Similarity=0.611 Sum_probs=8.9
Q ss_pred ccccCCCCCCCC-Ch-HHHHHHHh
Q psy5202 243 VYRCRFCEMPFS-VP-STLEKHMR 264 (343)
Q Consensus 243 ~~~C~~C~~~f~-~~-~~l~~H~~ 264 (343)
.|-|.+|...|. .. ..-+.|.+
T Consensus 3 ryyCdyC~~~~~~d~~~~Rk~H~~ 26 (38)
T PF06220_consen 3 RYYCDYCKKYLTHDSPSIRKQHER 26 (38)
T ss_dssp S-B-TTT--B-S--SHHHHHHHT-
T ss_pred CeecccccceecCCChHHHHHhhc
Confidence 477888888883 33 33356654
No 195
>KOG2907|consensus
Probab=48.13 E-value=9.2 Score=27.11 Aligned_cols=11 Identities=36% Similarity=1.077 Sum_probs=7.4
Q ss_pred eecCcCccccc
Q psy5202 214 YKCELCSYACA 224 (343)
Q Consensus 214 ~~C~~C~~~f~ 224 (343)
|.|+.|++.|.
T Consensus 103 YTC~kC~~k~~ 113 (116)
T KOG2907|consen 103 YTCPKCKYKFT 113 (116)
T ss_pred EEcCccceeee
Confidence 66777776664
No 196
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=48.02 E-value=5.6 Score=27.49 Aligned_cols=13 Identities=23% Similarity=0.626 Sum_probs=8.0
Q ss_pred cccccCCCCCCCC
Q psy5202 242 DVYRCRFCEMPFS 254 (343)
Q Consensus 242 ~~~~C~~C~~~f~ 254 (343)
....|..||.+|.
T Consensus 45 g~~~Cg~CGls~e 57 (104)
T COG4888 45 GTAVCGNCGLSFE 57 (104)
T ss_pred eEEEcccCcceEE
Confidence 3456777776664
No 197
>KOG4124|consensus
Probab=47.91 E-value=2 Score=36.79 Aligned_cols=53 Identities=17% Similarity=0.086 Sum_probs=35.0
Q ss_pred CCCCcccccc--cccccCChHHHHHH-----HHhccCCCCeecCCCCccCCCHHHHHHHHH
Q psy5202 87 QEEKNKECEF--CGKKFKYENTLQAH-----RRNHTGEKPFKCTVCDHACSQSAKLKKHMK 140 (343)
Q Consensus 87 ~~~~~~~C~~--C~~~f~~~~~L~~H-----~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~ 140 (343)
.+|+++.|.. |.+.+........| +..-...++|+|+ |++.+.++..|+.|-.
T Consensus 174 ~EE~~~S~~vp~~~~~~~~~Ns~~~~S~~~~~T~~t~~~p~k~~-~~~~~~T~~~l~~HS~ 233 (442)
T KOG4124|consen 174 PEEYRVSVVVPAAAAAAAAANSSDMSSDEASSTAETTGTPKKMP-ESLVMDTSSPLSDHSM 233 (442)
T ss_pred CccccccccCchhhhhhhccccccccccccccccccccCCccCc-ccccccccchhhhccc
Confidence 3778888876 76666544444333 2233345788885 8888888888888853
No 198
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=47.73 E-value=10 Score=34.03 Aligned_cols=34 Identities=24% Similarity=0.617 Sum_probs=25.0
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHH
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKL 229 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L 229 (343)
.-.|+.||....+. |..-|+|+.||+.+......
T Consensus 350 ~p~Cp~Cg~~m~S~-----------G~~g~rC~kCg~~~~~~~~~ 383 (421)
T COG1571 350 NPVCPRCGGRMKSA-----------GRNGFRCKKCGTRARETLIK 383 (421)
T ss_pred CCCCCccCCchhhc-----------CCCCcccccccccCCccccc
Confidence 34799999876543 34479999999998765443
No 199
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=47.60 E-value=6.1 Score=26.96 Aligned_cols=13 Identities=31% Similarity=0.976 Sum_probs=6.9
Q ss_pred cccccCCCCCCCC
Q psy5202 242 DVYRCRFCEMPFS 254 (343)
Q Consensus 242 ~~~~C~~C~~~f~ 254 (343)
..+.|..|++.|.
T Consensus 53 GIW~C~~C~~~~A 65 (90)
T PRK03976 53 GIWECRKCGAKFA 65 (90)
T ss_pred EEEEcCCCCCEEe
Confidence 3455555555553
No 200
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=47.59 E-value=12 Score=22.34 Aligned_cols=10 Identities=30% Similarity=0.800 Sum_probs=5.1
Q ss_pred ccccCCCCCC
Q psy5202 243 VYRCRFCEMP 252 (343)
Q Consensus 243 ~~~C~~C~~~ 252 (343)
.|.|+.||+.
T Consensus 20 ~~vC~~Cg~~ 29 (52)
T smart00661 20 RFVCRKCGYE 29 (52)
T ss_pred EEECCcCCCe
Confidence 4555555543
No 201
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=47.31 E-value=9.5 Score=21.89 Aligned_cols=8 Identities=25% Similarity=0.791 Sum_probs=3.3
Q ss_pred cccCCCCC
Q psy5202 244 YRCRFCEM 251 (343)
Q Consensus 244 ~~C~~C~~ 251 (343)
+.|..||.
T Consensus 20 ~vC~~CG~ 27 (43)
T PF08271_consen 20 LVCPNCGL 27 (43)
T ss_dssp EEETTT-B
T ss_pred EECCCCCC
Confidence 44555543
No 202
>PRK12496 hypothetical protein; Provisional
Probab=47.13 E-value=14 Score=28.76 Aligned_cols=11 Identities=18% Similarity=0.407 Sum_probs=6.9
Q ss_pred eecCcCccccc
Q psy5202 214 YKCELCSYACA 224 (343)
Q Consensus 214 ~~C~~C~~~f~ 224 (343)
|+|.-|++.|.
T Consensus 128 ~~C~gC~~~~~ 138 (164)
T PRK12496 128 KVCKGCKKKYP 138 (164)
T ss_pred EECCCCCcccc
Confidence 56666666663
No 203
>COG1773 Rubredoxin [Energy production and conversion]
Probab=46.91 E-value=9.4 Score=23.31 Aligned_cols=40 Identities=20% Similarity=0.395 Sum_probs=21.2
Q ss_pred CceecCcCcccccChHHHHHHHHHh-c--CCCCcccccCCCCC
Q psy5202 212 KPYKCELCSYACAQSSKLTRHMKTH-G--RLGKDVYRCRFCEM 251 (343)
Q Consensus 212 ~~~~C~~C~~~f~~~~~L~~H~~~h-~--~~~~~~~~C~~C~~ 251 (343)
+.|+|..||+.|.-...--++-..- + ..-..-+.|+.||.
T Consensus 2 ~~~~C~~CG~vYd~e~Gdp~~gi~pgT~fedlPd~w~CP~Cg~ 44 (55)
T COG1773 2 KRWRCSVCGYVYDPEKGDPRCGIAPGTPFEDLPDDWVCPECGV 44 (55)
T ss_pred CceEecCCceEeccccCCccCCCCCCCchhhCCCccCCCCCCC
Confidence 3589999999986543222210000 0 00011368999985
No 204
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=46.88 E-value=8.9 Score=22.67 Aligned_cols=10 Identities=30% Similarity=1.056 Sum_probs=6.8
Q ss_pred ceecCcCccc
Q psy5202 213 PYKCELCSYA 222 (343)
Q Consensus 213 ~~~C~~C~~~ 222 (343)
.|.|..||++
T Consensus 37 R~~CGkCgyT 46 (51)
T COG1998 37 RWACGKCGYT 46 (51)
T ss_pred eeEeccccce
Confidence 4677777764
No 205
>KOG2272|consensus
Probab=46.12 E-value=23 Score=29.19 Aligned_cols=44 Identities=18% Similarity=0.358 Sum_probs=24.4
Q ss_pred CCCCcccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCH
Q psy5202 87 QEEKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQS 132 (343)
Q Consensus 87 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~ 132 (343)
.+...|.|..|...... ..|.---...+ .-.|+|..|++...+-
T Consensus 133 ~~~g~YvC~KCh~~iD~-~~l~fr~d~yH-~yHFkCt~C~keL~sd 176 (332)
T KOG2272|consen 133 KGRGRYVCQKCHAHIDE-QPLTFRGDPYH-PYHFKCTTCGKELTSD 176 (332)
T ss_pred cccceeehhhhhhhccc-ccccccCCCCC-ccceecccccccccch
Confidence 34457889988876654 22221111111 2347888888876554
No 206
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=45.39 E-value=15 Score=33.71 Aligned_cols=28 Identities=21% Similarity=0.324 Sum_probs=19.2
Q ss_pred ccccCCCCCCCCChHHHHHHHhhhcCcc
Q psy5202 243 VYRCRFCEMPFSVPSTLEKHMRKCVVNQ 270 (343)
Q Consensus 243 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 270 (343)
-+.|+.|.+.|.....+..|+...|...
T Consensus 57 FWiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 57 FWICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred EeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 3567777777777777777777555443
No 207
>PF13824 zf-Mss51: Zinc-finger of mitochondrial splicing suppressor 51
Probab=45.23 E-value=16 Score=22.33 Aligned_cols=12 Identities=25% Similarity=0.697 Sum_probs=6.4
Q ss_pred cccccCCCCCCC
Q psy5202 242 DVYRCRFCEMPF 253 (343)
Q Consensus 242 ~~~~C~~C~~~f 253 (343)
..|.|+.||..+
T Consensus 13 v~~~Cp~cGipt 24 (55)
T PF13824_consen 13 VNFECPDCGIPT 24 (55)
T ss_pred cCCcCCCCCCcC
Confidence 345566666543
No 208
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=45.11 E-value=11 Score=21.18 Aligned_cols=14 Identities=29% Similarity=0.717 Sum_probs=10.9
Q ss_pred ccccCCCCCCCCCh
Q psy5202 243 VYRCRFCEMPFSVP 256 (343)
Q Consensus 243 ~~~C~~C~~~f~~~ 256 (343)
+|.|..|++.|-..
T Consensus 12 ~f~C~~C~~~FC~~ 25 (39)
T smart00154 12 GFKCRHCGNLFCGE 25 (39)
T ss_pred CeECCccCCccccc
Confidence 78899999888543
No 209
>PF14311 DUF4379: Domain of unknown function (DUF4379)
Probab=44.98 E-value=14 Score=22.52 Aligned_cols=14 Identities=29% Similarity=0.534 Sum_probs=8.1
Q ss_pred CcccccCcccccCH
Q psy5202 185 NDTCEFCGKVFKNC 198 (343)
Q Consensus 185 ~~~C~~C~~~f~~~ 198 (343)
-++|+.||..|...
T Consensus 28 ~W~C~~Cgh~w~~~ 41 (55)
T PF14311_consen 28 WWKCPKCGHEWKAS 41 (55)
T ss_pred EEECCCCCCeeEcc
Confidence 35666666655544
No 210
>COG5216 Uncharacterized conserved protein [Function unknown]
Probab=44.32 E-value=12 Score=22.89 Aligned_cols=31 Identities=29% Similarity=0.615 Sum_probs=15.8
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCcc
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSY 221 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 221 (343)
.|.|+ ||.+|.-.- . -...|+..-+|+-|..
T Consensus 22 tyPCP-CGDRFeIsL--e---Dl~~GE~VArCPSCSL 52 (67)
T COG5216 22 TYPCP-CGDRFEISL--E---DLRNGEVVARCPSCSL 52 (67)
T ss_pred EecCC-CCCEeEEEH--H---HhhCCceEEEcCCceE
Confidence 45565 666664321 1 1234566666666654
No 211
>KOG2636|consensus
Probab=44.30 E-value=17 Score=32.68 Aligned_cols=29 Identities=28% Similarity=0.692 Sum_probs=23.5
Q ss_pred HHccCCCceecCcCc-ccccChHHHHHHHH
Q psy5202 206 RSHTGEKPYKCELCS-YACAQSSKLTRHMK 234 (343)
Q Consensus 206 ~~h~~~~~~~C~~C~-~~f~~~~~L~~H~~ 234 (343)
+.|.-...|.|.+|| +++.-+..+.+|..
T Consensus 394 KLHGL~~ey~CEICGNy~Y~GrkaF~RHF~ 423 (497)
T KOG2636|consen 394 KLHGLDIEYNCEICGNYVYKGRKAFDRHFN 423 (497)
T ss_pred hhcCCCcccceeeccCccccCcHHHHHHhH
Confidence 456667789999999 88899999999953
No 212
>PF03833 PolC_DP2: DNA polymerase II large subunit DP2; InterPro: IPR016033 DP2 is the large subunit of a two-subunit novel archaebacterial replicative DNA polymerase first characterised for Pyrococcus furiosus. The structure of DP2 appears to be organised as a ~950 residue component separated from a ~300 residue component by a ~150 residue intein. The other subunit, DP1, has sequence similarity to the eukaryotic DNA polymerase delta small subunit. This entry represents the N-terminal ~950 residue component of DP2.; GO: 0003887 DNA-directed DNA polymerase activity; PDB: 3O59_X.
Probab=43.92 E-value=7.6 Score=37.96 Aligned_cols=13 Identities=23% Similarity=0.350 Sum_probs=0.0
Q ss_pred ccCCCCCCCCChH
Q psy5202 245 RCRFCEMPFSVPS 257 (343)
Q Consensus 245 ~C~~C~~~f~~~~ 257 (343)
.|+.|+.......
T Consensus 694 ~C~~C~~~~~~~~ 706 (900)
T PF03833_consen 694 ECPKCGRETTSYS 706 (900)
T ss_dssp -------------
T ss_pred ccccccccCcccc
Confidence 5777776554443
No 213
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=43.68 E-value=19 Score=30.58 Aligned_cols=27 Identities=26% Similarity=0.662 Sum_probs=20.2
Q ss_pred CcccccCCCCCCCCChHHHHHHHhhhc
Q psy5202 241 KDVYRCRFCEMPFSVPSTLEKHMRKCV 267 (343)
Q Consensus 241 ~~~~~C~~C~~~f~~~~~l~~H~~~~~ 267 (343)
...|.|+.|-+-|.+...|.+|+.+|.
T Consensus 46 ~~lyiCe~Clky~~~~~~l~~H~~~C~ 72 (290)
T PLN03238 46 TKLYICEYCLKYMRKKKSLLRHLAKCD 72 (290)
T ss_pred CeEEEcCCCcchhCCHHHHHHHHHhCC
Confidence 456778888888888888888877544
No 214
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=43.28 E-value=23 Score=27.31 Aligned_cols=14 Identities=36% Similarity=0.648 Sum_probs=7.4
Q ss_pred cccCCCCCCCCChH
Q psy5202 244 YRCRFCEMPFSVPS 257 (343)
Q Consensus 244 ~~C~~C~~~f~~~~ 257 (343)
|.|+.||+.|---+
T Consensus 131 ~~C~~CgkiYW~Gs 144 (165)
T COG1656 131 YRCPKCGKIYWKGS 144 (165)
T ss_pred eECCCCcccccCch
Confidence 45666665554443
No 215
>PRK04351 hypothetical protein; Provisional
Probab=42.68 E-value=11 Score=28.72 Aligned_cols=12 Identities=33% Similarity=0.777 Sum_probs=6.5
Q ss_pred cccccCCCCCCC
Q psy5202 242 DVYRCRFCEMPF 253 (343)
Q Consensus 242 ~~~~C~~C~~~f 253 (343)
..|.|..|+-.+
T Consensus 131 ~~yrCg~C~g~L 142 (149)
T PRK04351 131 KRYRCGKCRGKL 142 (149)
T ss_pred CcEEeCCCCcEe
Confidence 445666665444
No 216
>COG1779 C4-type Zn-finger protein [General function prediction only]
Probab=42.27 E-value=3.1 Score=32.85 Aligned_cols=8 Identities=38% Similarity=1.012 Sum_probs=4.3
Q ss_pred cccccCcc
Q psy5202 186 DTCEFCGK 193 (343)
Q Consensus 186 ~~C~~C~~ 193 (343)
..|+.||.
T Consensus 15 ~~CPvCg~ 22 (201)
T COG1779 15 IDCPVCGG 22 (201)
T ss_pred ecCCcccc
Confidence 34555555
No 217
>KOG3507|consensus
Probab=42.26 E-value=13 Score=22.88 Aligned_cols=12 Identities=25% Similarity=0.772 Sum_probs=6.4
Q ss_pred CceecCcCcccc
Q psy5202 212 KPYKCELCSYAC 223 (343)
Q Consensus 212 ~~~~C~~C~~~f 223 (343)
..++|..||+..
T Consensus 36 D~irCReCG~RI 47 (62)
T KOG3507|consen 36 DVIRCRECGYRI 47 (62)
T ss_pred CcEehhhcchHH
Confidence 345666666543
No 218
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the COG1594 RPB9 DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription]
Probab=41.65 E-value=6.1 Score=28.48 Aligned_cols=40 Identities=23% Similarity=0.502 Sum_probs=20.2
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCccccc
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACA 224 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 224 (343)
.+.|+.||..=.....|+.-..--....-|.|..||+.|.
T Consensus 72 ~~~CpkCg~~ea~y~~~QtRsaDEp~T~Fy~C~~Cg~~wr 111 (113)
T COG1594 72 KEKCPKCGNKEAYYWQLQTRSADEPETRFYKCTRCGYRWR 111 (113)
T ss_pred cccCCCCCCceeEEEeeehhccCCCceEEEEecccCCEee
Confidence 3678888855433322221111111123388888887765
No 220
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.70 E-value=5.7 Score=25.82 Aligned_cols=10 Identities=30% Similarity=0.889 Sum_probs=5.0
Q ss_pred eecCcCcccc
Q psy5202 214 YKCELCSYAC 223 (343)
Q Consensus 214 ~~C~~C~~~f 223 (343)
|+|..|+..|
T Consensus 13 Y~c~~cg~~~ 22 (82)
T COG2331 13 YECTECGNRF 22 (82)
T ss_pred EeecccchHH
Confidence 4555555443
No 221
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=40.69 E-value=19 Score=31.13 Aligned_cols=36 Identities=22% Similarity=0.483 Sum_probs=19.7
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCccc
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYA 222 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 222 (343)
.-.|++||..-.- .+.+......|.+...|..|+..
T Consensus 184 ~~~CPvCGs~P~~--s~~~~~~~~~G~RyL~CslC~te 219 (305)
T TIGR01562 184 RTLCPACGSPPVA--SMVRQGGKETGLRYLSCSLCATE 219 (305)
T ss_pred CCcCCCCCChhhh--hhhcccCCCCCceEEEcCCCCCc
Confidence 4479999965421 11111112456667778888743
No 222
>KOG2636|consensus
Probab=40.58 E-value=20 Score=32.26 Aligned_cols=29 Identities=24% Similarity=0.594 Sum_probs=24.2
Q ss_pred HHhcCCCCcccccCCCC-CCCCChHHHHHHHh
Q psy5202 234 KTHGRLGKDVYRCRFCE-MPFSVPSTLEKHMR 264 (343)
Q Consensus 234 ~~h~~~~~~~~~C~~C~-~~f~~~~~l~~H~~ 264 (343)
+.|. -..-|.|.+|| +++.-+..+.+|..
T Consensus 394 KLHG--L~~ey~CEICGNy~Y~GrkaF~RHF~ 423 (497)
T KOG2636|consen 394 KLHG--LDIEYNCEICGNYVYKGRKAFDRHFN 423 (497)
T ss_pred hhcC--CCcccceeeccCccccCcHHHHHHhH
Confidence 3554 45679999999 99999999999975
No 223
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=40.17 E-value=21 Score=20.41 Aligned_cols=16 Identities=25% Similarity=0.613 Sum_probs=11.3
Q ss_pred ccCCCCCCCCChHHHH
Q psy5202 245 RCRFCEMPFSVPSTLE 260 (343)
Q Consensus 245 ~C~~C~~~f~~~~~l~ 260 (343)
.|..||+.|.......
T Consensus 10 ~C~~C~rpf~WRKKW~ 25 (42)
T PF10013_consen 10 ICPVCGRPFTWRKKWA 25 (42)
T ss_pred cCcccCCcchHHHHHH
Confidence 5888888887765443
No 224
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=40.10 E-value=7.1 Score=38.04 Aligned_cols=23 Identities=22% Similarity=0.421 Sum_probs=14.2
Q ss_pred HHHHHHHhccCCCCeecCCCCcc
Q psy5202 106 TLQAHRRNHTGEKPFKCTVCDHA 128 (343)
Q Consensus 106 ~L~~H~~~h~~~~~~~C~~C~~~ 128 (343)
.|.-|...|....|..|+.||..
T Consensus 462 ~L~CH~Cg~~~~~p~~Cp~Cgs~ 484 (730)
T COG1198 462 QLRCHYCGYQEPIPQSCPECGSE 484 (730)
T ss_pred eeEeCCCCCCCCCCCCCCCCCCC
Confidence 34444444555677888888754
No 225
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=40.05 E-value=19 Score=31.20 Aligned_cols=36 Identities=22% Similarity=0.556 Sum_probs=21.2
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCccc
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYA 222 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 222 (343)
..-.|++||..-.- + +. +.....|.+...|..|+..
T Consensus 186 ~~~~CPvCGs~P~~-s-~v-~~~~~~G~RyL~CslC~te 221 (309)
T PRK03564 186 QRQFCPVCGSMPVS-S-VV-QIGTTQGLRYLHCNLCESE 221 (309)
T ss_pred CCCCCCCCCCcchh-h-ee-eccCCCCceEEEcCCCCCc
Confidence 45679999976431 1 11 2233456677778888743
No 226
>smart00504 Ubox Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
Probab=39.82 E-value=57 Score=20.11 Aligned_cols=45 Identities=11% Similarity=0.110 Sum_probs=24.9
Q ss_pred cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCC
Q psy5202 186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFS 254 (343)
Q Consensus 186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~ 254 (343)
|.|++|+..+... . ...||..|- +..+..+++.+ ..|+.|+..+.
T Consensus 2 ~~Cpi~~~~~~~P---------------v-~~~~G~v~~-~~~i~~~~~~~-------~~cP~~~~~~~ 46 (63)
T smart00504 2 FLCPISLEVMKDP---------------V-ILPSGQTYE-RRAIEKWLLSH-------GTDPVTGQPLT 46 (63)
T ss_pred cCCcCCCCcCCCC---------------E-ECCCCCEEe-HHHHHHHHHHC-------CCCCCCcCCCC
Confidence 4677777776442 1 234565553 34444454432 35888887764
No 227
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=39.67 E-value=4.6 Score=38.57 Aligned_cols=57 Identities=21% Similarity=0.435 Sum_probs=38.0
Q ss_pred ccccCcccccCHHHHHHHHHHccCCCce-ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202 187 TCEFCGKVFKNCSNLTVHRRSHTGEKPY-KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP 252 (343)
Q Consensus 187 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~-~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~ 252 (343)
.|..||-+|+-...|-.-+. ++.-+.| .|+.|.+.|....+-+-| ..+..|+.||-.
T Consensus 125 ~CT~CGPRfTIi~alPYDR~-nTsM~~F~lC~~C~~EY~dP~nRRfH--------AQp~aCp~CGP~ 182 (750)
T COG0068 125 NCTNCGPRFTIIEALPYDRE-NTSMADFPLCPFCDKEYKDPLNRRFH--------AQPIACPKCGPH 182 (750)
T ss_pred ccCCCCcceeeeccCCCCcc-cCccccCcCCHHHHHHhcCccccccc--------cccccCcccCCC
Confidence 49999999987655543322 2222333 599999988887765444 356789999953
No 228
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=39.40 E-value=8.7 Score=29.43 Aligned_cols=11 Identities=27% Similarity=0.818 Sum_probs=5.9
Q ss_pred CcccccCcccc
Q psy5202 185 NDTCEFCGKVF 195 (343)
Q Consensus 185 ~~~C~~C~~~f 195 (343)
.|.|..|+..+
T Consensus 123 ~~~C~~C~~~~ 133 (157)
T PF10263_consen 123 VYRCPSCGREY 133 (157)
T ss_pred EEEcCCCCCEe
Confidence 45555555554
No 229
>PF14205 Cys_rich_KTR: Cysteine-rich KTR
Probab=39.00 E-value=17 Score=22.04 Aligned_cols=11 Identities=18% Similarity=0.356 Sum_probs=6.0
Q ss_pred CcccccCCCCC
Q psy5202 241 KDVYRCRFCEM 251 (343)
Q Consensus 241 ~~~~~C~~C~~ 251 (343)
..|..|+.|.+
T Consensus 26 NfPlyCpKCK~ 36 (55)
T PF14205_consen 26 NFPLYCPKCKQ 36 (55)
T ss_pred cccccCCCCCc
Confidence 44555666654
No 230
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=38.30 E-value=17 Score=25.62 Aligned_cols=29 Identities=14% Similarity=0.341 Sum_probs=19.8
Q ss_pred ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCCh
Q psy5202 215 KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVP 256 (343)
Q Consensus 215 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~ 256 (343)
.|+.|+..|.+. ....|.|+.|+..+...
T Consensus 4 ~CP~C~seytY~-------------dg~~~iCpeC~~EW~~~ 32 (109)
T TIGR00686 4 PCPKCNSEYTYH-------------DGTQLICPSCLYEWNEN 32 (109)
T ss_pred cCCcCCCcceEe-------------cCCeeECcccccccccc
Confidence 488888776653 23458899998877654
No 231
>PF12907 zf-met2: Zinc-binding
Probab=38.17 E-value=19 Score=20.45 Aligned_cols=21 Identities=19% Similarity=0.580 Sum_probs=9.8
Q ss_pred eecCcCcccc---cChHHHHHHHH
Q psy5202 214 YKCELCSYAC---AQSSKLTRHMK 234 (343)
Q Consensus 214 ~~C~~C~~~f---~~~~~L~~H~~ 234 (343)
+.|.+|...| .....|..|..
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~e 25 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAE 25 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHH
Confidence 3455555333 23344555544
No 232
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=38.07 E-value=8.5 Score=26.93 Aligned_cols=12 Identities=17% Similarity=0.418 Sum_probs=6.8
Q ss_pred cccCCCCCCCCC
Q psy5202 244 YRCRFCEMPFSV 255 (343)
Q Consensus 244 ~~C~~C~~~f~~ 255 (343)
..|+.||..|..
T Consensus 43 ~~C~~CG~y~~~ 54 (99)
T PRK14892 43 ITCGNCGLYTEF 54 (99)
T ss_pred EECCCCCCccCE
Confidence 456666665543
No 233
>COG1326 Uncharacterized archaeal Zn-finger protein [General function prediction only]
Probab=37.18 E-value=31 Score=27.30 Aligned_cols=14 Identities=21% Similarity=0.536 Sum_probs=9.1
Q ss_pred CcccccCCCCCCCC
Q psy5202 241 KDVYRCRFCEMPFS 254 (343)
Q Consensus 241 ~~~~~C~~C~~~f~ 254 (343)
...+.|..||..+.
T Consensus 28 ~~lvrC~eCG~V~~ 41 (201)
T COG1326 28 EPLVRCEECGTVHP 41 (201)
T ss_pred ceEEEccCCCcEee
Confidence 34567777777663
No 234
>KOG2923|consensus
Probab=36.85 E-value=25 Score=22.04 Aligned_cols=13 Identities=38% Similarity=0.833 Sum_probs=7.0
Q ss_pred cCCCceecCcCcc
Q psy5202 209 TGEKPYKCELCSY 221 (343)
Q Consensus 209 ~~~~~~~C~~C~~ 221 (343)
.|+..-.|+-|..
T Consensus 40 ~ge~Va~CpsCSL 52 (67)
T KOG2923|consen 40 NGEDVARCPSCSL 52 (67)
T ss_pred CCCeeecCCCceE
Confidence 3455555666654
No 235
>PRK10220 hypothetical protein; Provisional
Probab=36.76 E-value=24 Score=24.95 Aligned_cols=30 Identities=17% Similarity=0.502 Sum_probs=20.2
Q ss_pred ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202 215 KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS 257 (343)
Q Consensus 215 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~ 257 (343)
.|+.|+..|.+. ....|.|+.|+..+....
T Consensus 5 ~CP~C~seytY~-------------d~~~~vCpeC~hEW~~~~ 34 (111)
T PRK10220 5 HCPKCNSEYTYE-------------DNGMYICPECAHEWNDAE 34 (111)
T ss_pred cCCCCCCcceEc-------------CCCeEECCcccCcCCccc
Confidence 488888766552 234588999988776553
No 236
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=36.52 E-value=12 Score=28.37 Aligned_cols=11 Identities=36% Similarity=1.035 Sum_probs=5.3
Q ss_pred ceecCcCcccc
Q psy5202 213 PYKCELCSYAC 223 (343)
Q Consensus 213 ~~~C~~C~~~f 223 (343)
.|+|..|+..+
T Consensus 112 ~y~C~~C~~~~ 122 (146)
T smart00731 112 PYRCTGCGQRY 122 (146)
T ss_pred EEECCCCCCCC
Confidence 35555555443
No 237
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=36.30 E-value=38 Score=32.83 Aligned_cols=12 Identities=17% Similarity=0.368 Sum_probs=5.6
Q ss_pred ccCCCCCCCCCh
Q psy5202 245 RCRFCEMPFSVP 256 (343)
Q Consensus 245 ~C~~C~~~f~~~ 256 (343)
.|+.||......
T Consensus 43 fC~~CG~~~~~~ 54 (645)
T PRK14559 43 HCPNCGAETGTI 54 (645)
T ss_pred cccccCCcccch
Confidence 355555544433
No 238
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=35.78 E-value=18 Score=21.38 Aligned_cols=39 Identities=21% Similarity=0.470 Sum_probs=18.8
Q ss_pred eecCcCcccccChHHHHHHH-HHhc--CCCCcccccCCCCCC
Q psy5202 214 YKCELCSYACAQSSKLTRHM-KTHG--RLGKDVYRCRFCEMP 252 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~-~~h~--~~~~~~~~C~~C~~~ 252 (343)
|+|..|++.+.-...-..+- .--+ ..-..-|.|+.|+..
T Consensus 2 y~C~~CgyvYd~~~Gd~~~~i~pGt~F~~Lp~~w~CP~C~a~ 43 (47)
T PF00301_consen 2 YQCPVCGYVYDPEKGDPENGIPPGTPFEDLPDDWVCPVCGAP 43 (47)
T ss_dssp EEETTTSBEEETTTBBGGGTB-TT--GGGS-TT-B-TTTSSB
T ss_pred cCCCCCCEEEcCCcCCcccCcCCCCCHHHCCCCCcCcCCCCc
Confidence 89999998886543211110 0000 001234899999853
No 239
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=35.75 E-value=14 Score=27.11 Aligned_cols=10 Identities=30% Similarity=0.451 Sum_probs=4.6
Q ss_pred CcccccCcccc
Q psy5202 185 NDTCEFCGKVF 195 (343)
Q Consensus 185 ~~~C~~C~~~f 195 (343)
.+.| .||..|
T Consensus 70 ~~~C-~Cg~~~ 79 (124)
T PRK00762 70 EIEC-ECGYEG 79 (124)
T ss_pred eEEe-eCcCcc
Confidence 3445 455444
No 240
>KOG1994|consensus
Probab=35.35 E-value=18 Score=29.25 Aligned_cols=22 Identities=27% Similarity=0.670 Sum_probs=18.4
Q ss_pred cccccCCCCCCCCChHHHHHHH
Q psy5202 242 DVYRCRFCEMPFSVPSTLEKHM 263 (343)
Q Consensus 242 ~~~~C~~C~~~f~~~~~l~~H~ 263 (343)
..|.|-+||..|.+...|..|.
T Consensus 238 eh~YC~fCG~~y~~~edl~ehC 259 (268)
T KOG1994|consen 238 EHYYCFFCGIKYKDEEDLYEHC 259 (268)
T ss_pred cceEEEEeccccCCHHHHHHhC
Confidence 3488999999999999998884
No 241
>PF10276 zf-CHCC: Zinc-finger domain; InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=35.34 E-value=18 Score=20.47 Aligned_cols=11 Identities=27% Similarity=0.748 Sum_probs=5.7
Q ss_pred ceecCcCcccc
Q psy5202 213 PYKCELCSYAC 223 (343)
Q Consensus 213 ~~~C~~C~~~f 223 (343)
+-.|++|+..|
T Consensus 29 ~~~CpYCg~~y 39 (40)
T PF10276_consen 29 PVVCPYCGTRY 39 (40)
T ss_dssp EEEETTTTEEE
T ss_pred eEECCCCCCEE
Confidence 34555555544
No 242
>CHL00174 accD acetyl-CoA carboxylase beta subunit; Reviewed
Probab=35.24 E-value=19 Score=30.88 Aligned_cols=33 Identities=30% Similarity=0.375 Sum_probs=20.2
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccCh
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQS 226 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 226 (343)
..+|+.|+..+..+. |. +.-+.|+.|++.|.-.
T Consensus 38 w~kc~~C~~~~~~~~-l~--------~~~~vcp~c~~h~rlt 70 (296)
T CHL00174 38 WVQCENCYGLNYKKF-LK--------SKMNICEQCGYHLKMS 70 (296)
T ss_pred eeECCCccchhhHHH-HH--------HcCCCCCCCCCCcCCC
Confidence 457888887764332 11 2336788888877643
No 243
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=34.89 E-value=40 Score=29.33 Aligned_cols=29 Identities=17% Similarity=0.460 Sum_probs=17.0
Q ss_pred hccCCCCeecCCCC-ccCCCHHHHHHHHHH
Q psy5202 113 NHTGEKPFKCTVCD-HACSQSAKLKKHMKI 141 (343)
Q Consensus 113 ~h~~~~~~~C~~C~-~~f~~~~~L~~H~~~ 141 (343)
.|.-.+.|.|.+|| +.+..+..+.+|...
T Consensus 368 lhgLd~ef~CEICgNyvy~GR~~FdrHF~E 397 (470)
T COG5188 368 LHGLDIEFECEICGNYVYYGRDRFDRHFEE 397 (470)
T ss_pred hcCCCcceeeeecccccccchHHHHhhhhh
Confidence 34445566666666 566666666666543
No 244
>PF04810 zf-Sec23_Sec24: Sec23/Sec24 zinc finger; InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=34.88 E-value=7.3 Score=22.01 Aligned_cols=10 Identities=30% Similarity=0.989 Sum_probs=3.8
Q ss_pred CceecCcCcc
Q psy5202 212 KPYKCELCSY 221 (343)
Q Consensus 212 ~~~~C~~C~~ 221 (343)
+.|.|.+|+.
T Consensus 23 ~~w~C~~C~~ 32 (40)
T PF04810_consen 23 KTWICNFCGT 32 (40)
T ss_dssp TEEEETTT--
T ss_pred CEEECcCCCC
Confidence 3444544443
No 245
>PF07295 DUF1451: Protein of unknown function (DUF1451); InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=34.73 E-value=14 Score=27.96 Aligned_cols=28 Identities=25% Similarity=0.489 Sum_probs=17.4
Q ss_pred ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202 213 PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP 252 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~ 252 (343)
.|.|..||..... .| ....-.|+.||..
T Consensus 112 ~l~C~~Cg~~~~~---------~~---~~~l~~Cp~C~~~ 139 (146)
T PF07295_consen 112 TLVCENCGHEVEL---------TH---PERLPPCPKCGHT 139 (146)
T ss_pred eEecccCCCEEEe---------cC---CCcCCCCCCCCCC
Confidence 4889999865432 12 2344579999864
No 246
>COG5112 UFD2 U1-like Zn-finger-containing protein [General function prediction only]
Probab=34.11 E-value=14 Score=25.82 Aligned_cols=26 Identities=23% Similarity=0.332 Sum_probs=20.6
Q ss_pred CCcccccCCCCCCCCChHHHHHHHhh
Q psy5202 240 GKDVYRCRFCEMPFSVPSTLEKHMRK 265 (343)
Q Consensus 240 ~~~~~~C~~C~~~f~~~~~l~~H~~~ 265 (343)
|-..|.|-.|.+-|.+...|..|.+-
T Consensus 52 GlGqhYCieCaryf~t~~aL~~Hkkg 77 (126)
T COG5112 52 GLGQHYCIECARYFITEKALMEHKKG 77 (126)
T ss_pred CCceeeeehhHHHHHHHHHHHHHhcc
Confidence 33457888899999999999998873
No 247
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=33.75 E-value=23 Score=18.32 Aligned_cols=11 Identities=36% Similarity=0.906 Sum_probs=3.5
Q ss_pred cccccCCCCCC
Q psy5202 242 DVYRCRFCEMP 252 (343)
Q Consensus 242 ~~~~C~~C~~~ 252 (343)
..|.|..|+..
T Consensus 14 ~~Y~C~~Cdf~ 24 (30)
T PF07649_consen 14 WFYRCSECDFD 24 (30)
T ss_dssp -EEE-TTT---
T ss_pred ceEECccCCCc
Confidence 34566666543
No 248
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=33.67 E-value=34 Score=31.47 Aligned_cols=23 Identities=22% Similarity=0.354 Sum_probs=13.9
Q ss_pred CcccccCcccccCHHHHHHHHHH
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRS 207 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~ 207 (343)
-+.|+.|.+.|.....+..|+..
T Consensus 57 FWiCp~CskkF~d~~~~~~H~~~ 79 (466)
T PF04780_consen 57 FWICPRCSKKFSDAESCLSHMEQ 79 (466)
T ss_pred EeeCCcccceeCCHHHHHHHHHH
Confidence 45566666666666666666553
No 249
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.54 E-value=22 Score=25.70 Aligned_cols=58 Identities=19% Similarity=0.222 Sum_probs=33.8
Q ss_pred HHHHHHHHHccCCC-ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202 199 SNLTVHRRSHTGEK-PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS 257 (343)
Q Consensus 199 ~~l~~H~~~h~~~~-~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~ 257 (343)
..|..-...+.|+. ..+|+.|.....-...+..=+-.-+ .-+.|-.|..||+.|.+..
T Consensus 24 pel~eafcskcgeati~qcp~csasirgd~~vegvlglg~-dye~psfchncgs~fpwte 82 (160)
T COG4306 24 PELMEAFCSKCGEATITQCPICSASIRGDYYVEGVLGLGG-DYEPPSFCHNCGSRFPWTE 82 (160)
T ss_pred HHHHHHHHhhhchHHHhcCCccCCcccccceeeeeeccCC-CCCCcchhhcCCCCCCcHH
Confidence 34444444555543 3579999876655433322222221 2355677999999998764
No 250
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=33.44 E-value=5.3 Score=38.16 Aligned_cols=45 Identities=27% Similarity=0.595 Sum_probs=22.6
Q ss_pred cccccccccCChHHHHHHHHhccCCCCe-ecCCCCccCCCHHHHHHH
Q psy5202 93 ECEFCGKKFKYENTLQAHRRNHTGEKPF-KCTVCDHACSQSAKLKKH 138 (343)
Q Consensus 93 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~-~C~~C~~~f~~~~~L~~H 138 (343)
.|..||-+|+-...|-.- |.++....| .|+.|.+.|....+-+-|
T Consensus 125 ~CT~CGPRfTIi~alPYD-R~nTsM~~F~lC~~C~~EY~dP~nRRfH 170 (750)
T COG0068 125 NCTNCGPRFTIIEALPYD-RENTSMADFPLCPFCDKEYKDPLNRRFH 170 (750)
T ss_pred ccCCCCcceeeeccCCCC-cccCccccCcCCHHHHHHhcCccccccc
Confidence 377777777654443211 111222222 477777766666554444
No 251
>COG4640 Predicted membrane protein [Function unknown]
Probab=33.41 E-value=29 Score=30.73 Aligned_cols=18 Identities=17% Similarity=0.471 Sum_probs=9.7
Q ss_pred cccCCCCCCCCChHHHHH
Q psy5202 244 YRCRFCEMPFSVPSTLEK 261 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~ 261 (343)
++|+.||..|.....+..
T Consensus 16 ~qC~qCG~~~t~~~sqan 33 (465)
T COG4640 16 VQCTQCGHKFTSRQSQAN 33 (465)
T ss_pred ccccccCCcCCchhhhhh
Confidence 336666666655554443
No 252
>PTZ00064 histone acetyltransferase; Provisional
Probab=33.33 E-value=26 Score=32.21 Aligned_cols=29 Identities=21% Similarity=0.537 Sum_probs=24.6
Q ss_pred CCcccccCCCCCCCCChHHHHHHHhhhcC
Q psy5202 240 GKDVYRCRFCEMPFSVPSTLEKHMRKCVV 268 (343)
Q Consensus 240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~ 268 (343)
-...|.|++|-+-|.....|.+|+.+|..
T Consensus 277 ~d~LYICEfCLkY~~s~~~l~rH~~~C~~ 305 (552)
T PTZ00064 277 VDTLHFCEYCLDFFCFEDELIRHLSRCQL 305 (552)
T ss_pred CCeEEEccchhhhhCCHHHHHHHHhcCCC
Confidence 35679999999999999999999986543
No 253
>KOG0978|consensus
Probab=33.09 E-value=15 Score=35.41 Aligned_cols=20 Identities=20% Similarity=0.471 Sum_probs=13.2
Q ss_pred ccccCCCCCCCCChHHHHHH
Q psy5202 243 VYRCRFCEMPFSVPSTLEKH 262 (343)
Q Consensus 243 ~~~C~~C~~~f~~~~~l~~H 262 (343)
.-+||.|+..|.-..-+..|
T Consensus 678 qRKCP~Cn~aFganDv~~I~ 697 (698)
T KOG0978|consen 678 QRKCPKCNAAFGANDVHRIH 697 (698)
T ss_pred cCCCCCCCCCCCcccccccC
Confidence 34688888888766554444
No 254
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region. Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A. The abnormally extended conformation is stable only in the CcO assembly.
Probab=33.09 E-value=23 Score=24.72 Aligned_cols=18 Identities=28% Similarity=0.586 Sum_probs=13.8
Q ss_pred HccCCCceecCcCcccccC
Q psy5202 207 SHTGEKPYKCELCSYACAQ 225 (343)
Q Consensus 207 ~h~~~~~~~C~~C~~~f~~ 225 (343)
.+.+ ++++|..||..|.-
T Consensus 74 l~~g-~~~rC~eCG~~fkL 91 (97)
T cd00924 74 LEKG-KPKRCPECGHVFKL 91 (97)
T ss_pred EeCC-CceeCCCCCcEEEE
Confidence 3555 78999999988863
No 255
>TIGR00515 accD acetyl-CoA carboxylase, carboxyl transferase, beta subunit. The enzyme acetyl-CoA carboxylase contains a biotin carboxyl carrier protein or domain, a biotin carboxylase, and a carboxyl transferase. This model represents the beta chain of the carboxyl transferase for cases in which the architecture of the protein is as in E. coli, in which the carboxyltransferase portion consists of two non-identical subnits, alpha and beta.
Probab=32.99 E-value=22 Score=30.49 Aligned_cols=33 Identities=21% Similarity=0.398 Sum_probs=20.1
Q ss_pred CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccCh
Q psy5202 185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQS 226 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 226 (343)
..+|+.|+..+..+. |. ..-+.|+.|++.|.-.
T Consensus 26 ~~~c~~c~~~~~~~~-l~--------~~~~vc~~c~~h~rl~ 58 (285)
T TIGR00515 26 WTKCPKCGQVLYTKE-LE--------RNLEVCPKCDHHMRMD 58 (285)
T ss_pred eeECCCCcchhhHHH-HH--------hhCCCCCCCCCcCcCC
Confidence 567888887764332 21 1236788888877643
No 256
>PF05495 zf-CHY: CHY zinc finger; InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins: Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation: ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom. More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=32.90 E-value=6.9 Score=25.49 Aligned_cols=59 Identities=22% Similarity=0.317 Sum_probs=28.8
Q ss_pred CcccccCcccccCHH---HHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCC
Q psy5202 185 NDTCEFCGKVFKNCS---NLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPF 253 (343)
Q Consensus 185 ~~~C~~C~~~f~~~~---~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f 253 (343)
.++|+-|++.|.-.. .+..|...........|..|+..+..... .. + ..|.|+.|+..|
T Consensus 10 ~~~~~cC~~~y~C~~CHde~~~H~~~~~~~~~v~Cg~C~~~~~~~~~-------~c--~-~~~~C~~C~~~~ 71 (71)
T PF05495_consen 10 AIRFPCCGKYYPCRFCHDELEDHPFDRWPVKRVICGKCRTEQPIDEY-------SC--G-ADYFCPICGLYF 71 (71)
T ss_dssp EEEETTTTEEESSHHHHHHCSSS---TTT--EEEETTT--EEES-SB-------TT-----SEEETTTTEEE
T ss_pred EEECCcccCeecHHHHHHHhccCccccccccCeECCCCCCccChhhh-------hc--C-CCccCcCcCCCC
Confidence 356667777776541 11122222233346789999887765444 11 2 458898887543
No 257
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=32.79 E-value=19 Score=21.63 Aligned_cols=37 Identities=19% Similarity=0.414 Sum_probs=20.2
Q ss_pred eecCcCcccccChHHHHHHHHHhcC----CCCcccccCCCCC
Q psy5202 214 YKCELCSYACAQSSKLTRHMKTHGR----LGKDVYRCRFCEM 251 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~h~~----~~~~~~~C~~C~~ 251 (343)
|+|..||+.+.-...-..+- +-.+ .-...|.|+.|+.
T Consensus 2 y~C~~CgyiYd~~~Gd~~~~-i~pGt~f~~Lp~~w~CP~C~a 42 (50)
T cd00730 2 YECRICGYIYDPAEGDPDEG-IPPGTPFEDLPDDWVCPVCGA 42 (50)
T ss_pred cCCCCCCeEECCCCCCcccC-cCCCCCHhHCCCCCCCCCCCC
Confidence 89999999887532111100 0000 0123489999985
No 258
>PF01428 zf-AN1: AN1-like Zinc finger; InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include: Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=32.39 E-value=18 Score=20.75 Aligned_cols=14 Identities=29% Similarity=0.800 Sum_probs=8.0
Q ss_pred CceecCcCcccccC
Q psy5202 212 KPYKCELCSYACAQ 225 (343)
Q Consensus 212 ~~~~C~~C~~~f~~ 225 (343)
.+|.|..|+..|=.
T Consensus 12 ~~~~C~~C~~~FC~ 25 (43)
T PF01428_consen 12 LPFKCKHCGKSFCL 25 (43)
T ss_dssp SHEE-TTTS-EE-T
T ss_pred CCeECCCCCcccCc
Confidence 46788888887754
No 259
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=32.24 E-value=10 Score=28.63 Aligned_cols=14 Identities=14% Similarity=0.373 Sum_probs=7.4
Q ss_pred eecCCCCccCCCHH
Q psy5202 120 FKCTVCDHACSQSA 133 (343)
Q Consensus 120 ~~C~~C~~~f~~~~ 133 (343)
-.|..|+..|.+..
T Consensus 29 ReC~~C~~RFTTfE 42 (156)
T COG1327 29 RECLECGERFTTFE 42 (156)
T ss_pred hcccccccccchhh
Confidence 34555555555543
No 260
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=32.04 E-value=25 Score=32.00 Aligned_cols=28 Identities=32% Similarity=0.713 Sum_probs=22.7
Q ss_pred CCcccccCCCCCCCCChHHHHHHHhhhc
Q psy5202 240 GKDVYRCRFCEMPFSVPSTLEKHMRKCV 267 (343)
Q Consensus 240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~ 267 (343)
....|.|++|-+-|.....|.+|+.+|.
T Consensus 195 ~~~lyiCe~Cl~y~~~~~~~~~H~~~C~ 222 (450)
T PLN00104 195 CSKLYFCEFCLKFMKRKEQLQRHMKKCD 222 (450)
T ss_pred CCeEEEchhhhhhhcCHHHHHHHHhcCC
Confidence 3467889999999999999999987554
No 261
>KOG0717|consensus
Probab=31.82 E-value=26 Score=31.85 Aligned_cols=22 Identities=23% Similarity=0.536 Sum_probs=20.0
Q ss_pred cccCCCCCCCCChHHHHHHHhh
Q psy5202 244 YRCRFCEMPFSVPSTLEKHMRK 265 (343)
Q Consensus 244 ~~C~~C~~~f~~~~~l~~H~~~ 265 (343)
+.|.+|.+.|.+...|..|...
T Consensus 293 lyC~vCnKsFKseKq~kNHEnS 314 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHENS 314 (508)
T ss_pred eEEeeccccccchHHHHhhHHH
Confidence 7899999999999999999863
No 262
>KOG0978|consensus
Probab=31.77 E-value=17 Score=35.04 Aligned_cols=49 Identities=18% Similarity=0.384 Sum_probs=29.8
Q ss_pred CcccccccccccCChHHH-HHHH------HhccCCCCeecCCCCccCCCHHHHHHH
Q psy5202 90 KNKECEFCGKKFKYENTL-QAHR------RNHTGEKPFKCTVCDHACSQSAKLKKH 138 (343)
Q Consensus 90 ~~~~C~~C~~~f~~~~~L-~~H~------~~h~~~~~~~C~~C~~~f~~~~~L~~H 138 (343)
...+|+.|+..+...--. --|. +.-...+.-+||.|+..|....-+..|
T Consensus 642 ~~LkCs~Cn~R~Kd~vI~kC~H~FC~~Cvq~r~etRqRKCP~Cn~aFganDv~~I~ 697 (698)
T KOG0978|consen 642 ELLKCSVCNTRWKDAVITKCGHVFCEECVQTRYETRQRKCPKCNAAFGANDVHRIH 697 (698)
T ss_pred hceeCCCccCchhhHHHHhcchHHHHHHHHHHHHHhcCCCCCCCCCCCcccccccC
Confidence 347899999666543221 1231 111233556899999999887655544
No 263
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=31.65 E-value=14 Score=22.46 Aligned_cols=13 Identities=23% Similarity=0.690 Sum_probs=7.0
Q ss_pred ccCCCCCCCCChH
Q psy5202 245 RCRFCEMPFSVPS 257 (343)
Q Consensus 245 ~C~~C~~~f~~~~ 257 (343)
.||+|++.|....
T Consensus 22 ~CPlC~r~l~~e~ 34 (54)
T PF04423_consen 22 CCPLCGRPLDEEH 34 (54)
T ss_dssp E-TTT--EE-HHH
T ss_pred cCCCCCCCCCHHH
Confidence 8999999987643
No 264
>PF11672 DUF3268: Protein of unknown function (DUF3268); InterPro: IPR021686 This entry is represented by Listeria phage P100, Gp150. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=31.21 E-value=18 Score=25.44 Aligned_cols=7 Identities=43% Similarity=1.307 Sum_probs=4.2
Q ss_pred cccCCCC
Q psy5202 244 YRCRFCE 250 (343)
Q Consensus 244 ~~C~~C~ 250 (343)
|.|.-|+
T Consensus 32 y~C~~C~ 38 (102)
T PF11672_consen 32 YVCTPCD 38 (102)
T ss_pred EECCCCC
Confidence 5666665
No 265
>PRK05654 acetyl-CoA carboxylase subunit beta; Validated
Probab=30.97 E-value=23 Score=30.48 Aligned_cols=34 Identities=24% Similarity=0.384 Sum_probs=21.5
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccCh
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQS 226 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 226 (343)
-..+|+.|+..+..+. |. ..-+.|+.|++.|.-.
T Consensus 26 ~~~~c~~c~~~~~~~~-l~--------~~~~vc~~c~~h~rl~ 59 (292)
T PRK05654 26 LWTKCPSCGQVLYRKE-LE--------ANLNVCPKCGHHMRIS 59 (292)
T ss_pred CeeECCCccchhhHHH-HH--------hcCCCCCCCCCCeeCC
Confidence 3567999987764332 21 1236899999888643
No 266
>PHA02998 RNA polymerase subunit; Provisional
Probab=30.57 E-value=8.2 Score=29.85 Aligned_cols=12 Identities=17% Similarity=0.351 Sum_probs=7.8
Q ss_pred eecCcCcccccC
Q psy5202 214 YKCELCSYACAQ 225 (343)
Q Consensus 214 ~~C~~C~~~f~~ 225 (343)
|.|..||+.|.-
T Consensus 172 YkC~~CG~~wkp 183 (195)
T PHA02998 172 HACRDCKKHFKP 183 (195)
T ss_pred EEcCCCCCccCC
Confidence 677777766643
No 267
>PLN02294 cytochrome c oxidase subunit Vb
Probab=30.53 E-value=26 Score=27.06 Aligned_cols=15 Identities=20% Similarity=0.636 Sum_probs=12.1
Q ss_pred CCceecCcCcccccC
Q psy5202 211 EKPYKCELCSYACAQ 225 (343)
Q Consensus 211 ~~~~~C~~C~~~f~~ 225 (343)
.++++|+.||..|.-
T Consensus 139 Gkp~RCpeCG~~fkL 153 (174)
T PLN02294 139 GKSFECPVCTQYFEL 153 (174)
T ss_pred CCceeCCCCCCEEEE
Confidence 468899999988864
No 268
>PF04216 FdhE: Protein involved in formate dehydrogenase formation; InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex []. The exact nature of the function of FdhE in the assembly of the complex is unknown, but considering the presence of selenocysteine, molybdopterin, iron-sulphur clusters and cytochrome b556, it is likely to be involved in the insertion of cofactors. ; GO: 0005737 cytoplasm; PDB: 2FIY_B.
Probab=30.52 E-value=11 Score=32.32 Aligned_cols=9 Identities=33% Similarity=0.951 Sum_probs=3.2
Q ss_pred cccccCccc
Q psy5202 186 DTCEFCGKV 194 (343)
Q Consensus 186 ~~C~~C~~~ 194 (343)
-.|++||..
T Consensus 173 g~CPvCGs~ 181 (290)
T PF04216_consen 173 GYCPVCGSP 181 (290)
T ss_dssp SS-TTT---
T ss_pred CcCCCCCCc
Confidence 367777754
No 269
>PRK00420 hypothetical protein; Validated
Probab=30.38 E-value=25 Score=25.25 Aligned_cols=14 Identities=29% Similarity=0.496 Sum_probs=10.3
Q ss_pred CCcccccCcccccC
Q psy5202 184 RNDTCEFCGKVFKN 197 (343)
Q Consensus 184 ~~~~C~~C~~~f~~ 197 (343)
....|+.||..+.-
T Consensus 39 g~~~Cp~Cg~~~~v 52 (112)
T PRK00420 39 GEVVCPVHGKVYIV 52 (112)
T ss_pred CceECCCCCCeeee
Confidence 55789999986654
No 270
>KOG2747|consensus
Probab=30.22 E-value=21 Score=31.82 Aligned_cols=29 Identities=28% Similarity=0.794 Sum_probs=23.2
Q ss_pred CcccccCCCCCCCCChHHHHHHHhhhcCc
Q psy5202 241 KDVYRCRFCEMPFSVPSTLEKHMRKCVVN 269 (343)
Q Consensus 241 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~ 269 (343)
...|.|.+|-+-+.+...|.+|+++|...
T Consensus 156 ~~lYiCEfCLkY~~s~~~l~rH~~kC~~r 184 (396)
T KOG2747|consen 156 DKLYICEFCLKYMKSRTSLQRHLKKCKLR 184 (396)
T ss_pred CeEEEehHHHhHhchHHHHHHHHHhcCCC
Confidence 46688888888888888888898876544
No 271
>TIGR00310 ZPR1_znf ZPR1 zinc finger domain.
Probab=29.92 E-value=6 Score=31.57 Aligned_cols=11 Identities=18% Similarity=0.461 Sum_probs=5.2
Q ss_pred cccCCCCCCCC
Q psy5202 244 YRCRFCEMPFS 254 (343)
Q Consensus 244 ~~C~~C~~~f~ 254 (343)
|.|+.||++..
T Consensus 31 f~C~~CGyr~~ 41 (192)
T TIGR00310 31 TICEHCGYRSN 41 (192)
T ss_pred EECCCCCCccc
Confidence 44555554443
No 272
>KOG3362|consensus
Probab=29.92 E-value=20 Score=26.72 Aligned_cols=24 Identities=21% Similarity=0.468 Sum_probs=18.9
Q ss_pred cccccCCCCCCCCChHHHHHHHhh
Q psy5202 242 DVYRCRFCEMPFSVPSTLEKHMRK 265 (343)
Q Consensus 242 ~~~~C~~C~~~f~~~~~l~~H~~~ 265 (343)
.+|.|.-||-.+-+..-|..|..+
T Consensus 128 S~ysC~~CG~kyCsv~C~~~HneT 151 (156)
T KOG3362|consen 128 SKYSCVNCGTKYCSVRCLKTHNET 151 (156)
T ss_pred chhHHHhcCCceeechhhhhcccc
Confidence 357888888888888888888754
No 273
>KOG0717|consensus
Probab=29.83 E-value=32 Score=31.24 Aligned_cols=22 Identities=18% Similarity=0.489 Sum_probs=20.1
Q ss_pred eecCcCcccccChHHHHHHHHH
Q psy5202 214 YKCELCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 214 ~~C~~C~~~f~~~~~L~~H~~~ 235 (343)
+.|.+|.+.|.+...|..|...
T Consensus 293 lyC~vCnKsFKseKq~kNHEnS 314 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHENS 314 (508)
T ss_pred eEEeeccccccchHHHHhhHHH
Confidence 7799999999999999999754
No 274
>PF10537 WAC_Acf1_DNA_bd: ATP-utilising chromatin assembly and remodelling N-terminal; InterPro: IPR013136 ACF (for ATP-utilising chromatin assembly and remodeling factor) is a chromatin-remodeling complex that catalyzes the ATP-dependent assembly of periodic nucleosome arrays. This reaction utilises the energy of ATP hydrolysis by ISWI, the smaller of the two subunits of ACF. Acf1, the large subunit of ACF, is essential for the full activity of the complex. The WAC (WSTF/Acf1/cbp146) domain is an ~110-residue module present at the N-termini of Acf1-related proteins in a variety of organisms. It is found in association with other domains such as the bromodomain, the PHD-type zinc finger, DDT or WAKS. The DNA-binding region of Acf1 includes the WAC domain, which is necessary for the efficient binding of ACF complex to DNA. It seems probable that the WAC domain will be involved in DNA binding in other related factors [, ]. Some proteins known to contain a WAC domain are the Drosophila melanogaster (Fruit fly) ATP-dependent chromatin assembly factor large subunit Acf1, human WSTF (Williams syndrome transcription factor), mouse cbp146, yeast imitation switch two complex protein 1 (ITC1 or YGL133w), and yeast protein YPL216w.
Probab=29.81 E-value=80 Score=22.30 Aligned_cols=36 Identities=8% Similarity=0.157 Sum_probs=22.0
Q ss_pred ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCC
Q psy5202 213 PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEM 251 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~ 251 (343)
.|.|+.=|..|.+-..+...+... ..+.|.|...|+
T Consensus 3 Vf~i~~T~EiF~dYe~Y~~R~~~y---~~~vwtC~~TGk 38 (102)
T PF10537_consen 3 VFYIPFTGEIFRDYEEYLKRMILY---NQRVWTCEITGK 38 (102)
T ss_pred eEEeCCCCcccCCHHHHHHHHHHH---hCCeeEEecCCC
Confidence 455666666666666666665555 355666666665
No 275
>PF11931 DUF3449: Domain of unknown function (DUF3449); InterPro: IPR024598 This presumed domain is functionally uncharacterised. It has two conserved sequence motifs: PIP and CEICG and contains a zinc-finger of the C2H2-type.; PDB: 4DGW_A.
Probab=29.74 E-value=18 Score=28.93 Aligned_cols=27 Identities=26% Similarity=0.609 Sum_probs=0.0
Q ss_pred ccCCCceecCcCcc-cccChHHHHHHHH
Q psy5202 208 HTGEKPYKCELCSY-ACAQSSKLTRHMK 234 (343)
Q Consensus 208 h~~~~~~~C~~C~~-~f~~~~~L~~H~~ 234 (343)
|.-...|.|.+||- +|.-+..+.+|..
T Consensus 96 hGL~~ey~CEICGN~~Y~GrkaFekHF~ 123 (196)
T PF11931_consen 96 HGLGVEYKCEICGNQSYKGRKAFEKHFQ 123 (196)
T ss_dssp ----------------------------
T ss_pred hCCCCeeeeEeCCCcceecHHHHHHhcC
Confidence 33345577777774 4556667777765
No 276
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=29.68 E-value=11 Score=19.43 Aligned_cols=19 Identities=26% Similarity=0.567 Sum_probs=10.4
Q ss_pred ccccccccccCChHHHHHHH
Q psy5202 92 KECEFCGKKFKYENTLQAHR 111 (343)
Q Consensus 92 ~~C~~C~~~f~~~~~L~~H~ 111 (343)
|.|-.|++.| .....+.|.
T Consensus 1 ~sCiDC~~~F-~~~~y~~Ht 19 (28)
T PF08790_consen 1 FSCIDCSKDF-DGDSYKSHT 19 (28)
T ss_dssp EEETTTTEEE-EGGGTTT--
T ss_pred CeeecCCCCc-CcCCcCCCC
Confidence 4577777777 344455553
No 277
>PF08882 Acetone_carb_G: Acetone carboxylase gamma subunit; InterPro: IPR014979 Acetone carboxylase is the key enzyme of bacterial acetone metabolism, catalysing the condensation of acetone and CO2 to form acetoacetate [] according to the following reaction: CH3COCH3 + CO2 + ATP = CH3COCH2COO- + AMP + 2P(i) + H+ It has the subunit composition: (alpha(2)beta(2)gamma(2) multimers of 85kDa, 78kDa, and 20kDa subunits). It is expressed to high levels (17 to 25% of soluble protein) in cells grown with acetone as the carbon source but are not present at detectable levels in cells grown with other carbon sources []. Acetone carboxylase may enable Helicobacter pylori to survive off acetone in the stomach of humans and other mammals where it is the etiological agent of peptic ulcer disease []. This entry represents the family of gamma subunit-related acetone carboxylase proteins.
Probab=29.49 E-value=17 Score=25.80 Aligned_cols=37 Identities=16% Similarity=0.404 Sum_probs=23.4
Q ss_pred cccccccccccCC-hHHHHHHHHhccCC----------------------CCeecCCCCcc
Q psy5202 91 NKECEFCGKKFKY-ENTLQAHRRNHTGE----------------------KPFKCTVCDHA 128 (343)
Q Consensus 91 ~~~C~~C~~~f~~-~~~L~~H~~~h~~~----------------------~~~~C~~C~~~ 128 (343)
.-+| .||..|-. +.+.+.|...+-.+ +.|-|+.||..
T Consensus 24 ~vkc-~CGh~f~d~r~NwK~~alv~vRd~~E~~~~iYp~~~aPdp~w~~irEyyCP~Cgt~ 83 (112)
T PF08882_consen 24 VVKC-DCGHEFCDARENWKLGALVYVRDPEEIHPEIYPFTMAPDPEWQVIREYYCPGCGTQ 83 (112)
T ss_pred eeec-cCCCeecChhcChhhCcEEEecChHHhhhhhcccccCCCCCcEEEEEEECCCCcce
Confidence 3445 49999874 45566665544221 33889999864
No 278
>KOG1842|consensus
Probab=29.42 E-value=28 Score=31.35 Aligned_cols=29 Identities=17% Similarity=0.401 Sum_probs=21.3
Q ss_pred ccccCCCCCCCCChHHHHHHHhhhcCccc
Q psy5202 243 VYRCRFCEMPFSVPSTLEKHMRKCVVNQN 271 (343)
Q Consensus 243 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~~ 271 (343)
-|.||+|.+.|...+.|..|+..-|....
T Consensus 15 gflCPiC~~dl~~~~~L~~H~d~eH~~ed 43 (505)
T KOG1842|consen 15 GFLCPICLLDLPNLSALNDHLDVEHFEED 43 (505)
T ss_pred cccCchHhhhhhhHHHHHHHHhhhccccc
Confidence 47788888888888888888776666554
No 279
>KOG0320|consensus
Probab=29.09 E-value=34 Score=26.71 Aligned_cols=15 Identities=20% Similarity=0.521 Sum_probs=9.7
Q ss_pred CCcccccCcccccCH
Q psy5202 184 RNDTCEFCGKVFKNC 198 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~ 198 (343)
...+|+.|++....+
T Consensus 166 ~~~~CP~C~kkIt~k 180 (187)
T KOG0320|consen 166 NTNKCPTCRKKITHK 180 (187)
T ss_pred hCCCCCCcccccchh
Confidence 445788888755443
No 280
>PF03145 Sina: Seven in absentia protein family; InterPro: IPR018121 The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non-neuronal cell type. The Sina protein contains an N-terminal RING finger domain C3HC4-type. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 [] also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation [, ]. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is a C3HC4-type, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana.; GO: 0006511 ubiquitin-dependent protein catabolic process, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2A25_A 1K2F_B 2AN6_C.
Probab=28.45 E-value=35 Score=27.36 Aligned_cols=46 Identities=20% Similarity=0.247 Sum_probs=17.3
Q ss_pred CcccccChHHHHHHHHHhcCCCCcccccCC----CCCCCCChHHHHHHHhhhcCc
Q psy5202 219 CSYACAQSSKLTRHMKTHGRLGKDVYRCRF----CEMPFSVPSTLEKHMRKCVVN 269 (343)
Q Consensus 219 C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~----C~~~f~~~~~l~~H~~~~~~~ 269 (343)
|...+.. .....|.+.. .-+|+.|+. |+..= ....|..|++..|..
T Consensus 24 C~~~~~~-~~~~~HE~~C---~~~p~~CP~~~~~C~~~G-~~~~l~~Hl~~~H~~ 73 (198)
T PF03145_consen 24 CTETFPY-SEKREHEEEC---PFRPCSCPFPGSGCDWQG-SYKELLDHLRDKHSW 73 (198)
T ss_dssp ---EE-G-GGHHHHHHT----TTSEEE-SSSSTT---EE-ECCCHHHHHHHHTTT
T ss_pred Ccccccc-cChhhHhccC---CCcCCcCCCCCCCccccC-CHHHHHHHHHHHCCC
Confidence 4444322 3444555554 245555555 43321 223455666555543
No 281
>PF11789 zf-Nse: Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=28.28 E-value=51 Score=20.34 Aligned_cols=32 Identities=19% Similarity=0.388 Sum_probs=17.9
Q ss_pred CCcccccccccccCChHHHHHHHHhccCCCCeecCC
Q psy5202 89 EKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTV 124 (343)
Q Consensus 89 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~ 124 (343)
+.|++...|+-.|.. ..+...+ ...+...|++
T Consensus 22 ~~PV~s~~C~H~fek-~aI~~~i---~~~~~~~CPv 53 (57)
T PF11789_consen 22 EDPVKSKKCGHTFEK-EAILQYI---QRNGSKRCPV 53 (57)
T ss_dssp SSEEEESSS--EEEH-HHHHHHC---TTTS-EE-SC
T ss_pred hCCcCcCCCCCeecH-HHHHHHH---HhcCCCCCCC
Confidence 457777888888874 4444444 2446688887
No 282
>KOG4727|consensus
Probab=27.87 E-value=30 Score=26.63 Aligned_cols=22 Identities=18% Similarity=0.461 Sum_probs=19.7
Q ss_pred ccccCCCCCCCCChHHHHHHHh
Q psy5202 243 VYRCRFCEMPFSVPSTLEKHMR 264 (343)
Q Consensus 243 ~~~C~~C~~~f~~~~~l~~H~~ 264 (343)
-|.|.+|+-.|....++.-|+.
T Consensus 75 GyyCdVCdcvvKDSinflDHiN 96 (193)
T KOG4727|consen 75 GYYCDVCDCVVKDSINFLDHIN 96 (193)
T ss_pred ceeeeecceeehhhHHHHHHhc
Confidence 4899999999999999999984
No 283
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=27.83 E-value=29 Score=25.68 Aligned_cols=47 Identities=19% Similarity=0.361 Sum_probs=25.7
Q ss_pred HHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCCh
Q psy5202 201 LTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVP 256 (343)
Q Consensus 201 l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~ 256 (343)
...++++..+. -.|+.|+.....+... .. .+...|+|..|++.|...
T Consensus 20 ~~~~~~~~~~~--~~cP~C~s~~~~k~g~--~~-----~~~qRyrC~~C~~tf~~~ 66 (129)
T COG3677 20 AAYAIRMQITK--VNCPRCKSSNVVKIGG--IR-----RGHQRYKCKSCGSTFTVE 66 (129)
T ss_pred HHHHHhhhccc--CcCCCCCccceeeECC--cc-----ccccccccCCcCcceeee
Confidence 34445554432 3588887443111110 00 135669999999998764
No 284
>PF06397 Desulfoferrod_N: Desulfoferrodoxin, N-terminal domain; InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=27.22 E-value=23 Score=19.56 Aligned_cols=13 Identities=31% Similarity=0.777 Sum_probs=7.2
Q ss_pred CcccccccccccC
Q psy5202 90 KNKECEFCGKKFK 102 (343)
Q Consensus 90 ~~~~C~~C~~~f~ 102 (343)
+-|+|..||....
T Consensus 5 ~~YkC~~CGniVe 17 (36)
T PF06397_consen 5 EFYKCEHCGNIVE 17 (36)
T ss_dssp EEEE-TTT--EEE
T ss_pred cEEEccCCCCEEE
Confidence 4689999987653
No 285
>PF10122 Mu-like_Com: Mu-like prophage protein Com; InterPro: IPR019294 Members of this entry belong to the Com family of proteins that act as translational regulators of mom [, ].
Probab=27.18 E-value=22 Score=21.30 Aligned_cols=9 Identities=22% Similarity=0.700 Sum_probs=4.8
Q ss_pred cccCCCCCC
Q psy5202 244 YRCRFCEMP 252 (343)
Q Consensus 244 ~~C~~C~~~ 252 (343)
.+|+.|+..
T Consensus 25 IKCpRC~ti 33 (51)
T PF10122_consen 25 IKCPRCKTI 33 (51)
T ss_pred EECCCCCcc
Confidence 456666543
No 286
>PTZ00448 hypothetical protein; Provisional
Probab=27.17 E-value=45 Score=29.46 Aligned_cols=25 Identities=20% Similarity=0.465 Sum_probs=21.9
Q ss_pred ccccCCCCCCCCChHHHHHHHhhhc
Q psy5202 243 VYRCRFCEMPFSVPSTLEKHMRKCV 267 (343)
Q Consensus 243 ~~~C~~C~~~f~~~~~l~~H~~~~~ 267 (343)
.|.|..|+..|.+....+.|+|+-+
T Consensus 314 ~~tC~~C~v~F~~~~~qR~H~KSDw 338 (373)
T PTZ00448 314 MLLCRKCNIQLMDHNAFKQHYRSEW 338 (373)
T ss_pred CccccccccccCCHHHHHHHhhhhH
Confidence 6889999999999999999998633
No 287
>PF03811 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=26.94 E-value=14 Score=20.42 Aligned_cols=13 Identities=38% Similarity=1.048 Sum_probs=6.1
Q ss_pred HccCCCceecCcC
Q psy5202 207 SHTGEKPYKCELC 219 (343)
Q Consensus 207 ~h~~~~~~~C~~C 219 (343)
...|...|+|..|
T Consensus 23 ~~~G~qryrC~~C 35 (36)
T PF03811_consen 23 SPSGHQRYRCKDC 35 (36)
T ss_pred CCCCCEeEecCcC
Confidence 3334444555554
No 288
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=26.33 E-value=38 Score=23.60 Aligned_cols=27 Identities=22% Similarity=0.523 Sum_probs=18.3
Q ss_pred cccccccccccCChHHHHHHHHhccCCCCeecCCCCc
Q psy5202 91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDH 127 (343)
Q Consensus 91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~ 127 (343)
+++|..||..|.+-..... --|+.||.
T Consensus 2 pH~CtrCG~vf~~g~~~il----------~GCp~CG~ 28 (112)
T COG3364 2 PHQCTRCGEVFDDGSEEIL----------SGCPKCGC 28 (112)
T ss_pred CceecccccccccccHHHH----------ccCccccc
Confidence 6789999999987543321 24777875
No 289
>KOG0782|consensus
Probab=26.12 E-value=11 Score=34.89 Aligned_cols=53 Identities=19% Similarity=0.365 Sum_probs=35.8
Q ss_pred HHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202 198 CSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS 257 (343)
Q Consensus 198 ~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~ 257 (343)
...|.+|--+|.....-+|..|++.|..+..+ |. ...-...|..|...|-.+-
T Consensus 238 E~~fvrHHWVHrrRqeGkC~~CgKgFQQKf~F------hs-KEivAisCSWCKqayH~Kv 290 (1004)
T KOG0782|consen 238 ESGFVRHHWVHRRRQEGKCNTCGKGFQQKFFF------HS-KEIVAISCSWCKQAYHLKV 290 (1004)
T ss_pred cccchHHhHhhHhhhccccchhhhhhhhheee------cc-ccEEEEEehHHHHHhhcch
Confidence 34677777666655556899999998765443 22 1223467999988887765
No 290
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=25.73 E-value=52 Score=28.10 Aligned_cols=25 Identities=24% Similarity=0.636 Sum_probs=18.0
Q ss_pred ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCC
Q psy5202 213 PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPF 253 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f 253 (343)
-|.|+.|.-.|-. .+-.|+.||-.|
T Consensus 255 GyvCs~Clsi~C~----------------~p~~C~~Cgt~f 279 (279)
T TIGR00627 255 GFVCSVCLSVLCQ----------------YTPICKTCKTAF 279 (279)
T ss_pred eEECCCccCCcCC----------------CCCCCCCCCCCC
Confidence 3889998876642 234799998776
No 291
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=25.47 E-value=1.7e+02 Score=22.47 Aligned_cols=55 Identities=16% Similarity=0.284 Sum_probs=33.8
Q ss_pred CcccccccccccCChHHHHHHHHhccCCCCeecCCCCccC-CCHHHHHHHHHHhccC
Q psy5202 90 KNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHAC-SQSAKLKKHMKIHDED 145 (343)
Q Consensus 90 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f-~~~~~L~~H~~~h~~~ 145 (343)
....|+.|.....-..-. .-.+.+...|+-.|..=+-.| .+...|++|.+..|..
T Consensus 79 ~~L~CPLCRG~V~GWtvv-e~AR~~LN~K~RsC~~e~C~F~GtY~eLrKHar~~HP~ 134 (162)
T PF07800_consen 79 PELACPLCRGEVKGWTVV-EPARRFLNAKKRSCSQESCSFSGTYSELRKHARSEHPS 134 (162)
T ss_pred ccccCccccCceeceEEc-hHHHHHhccCCccCcccccccccCHHHHHHHHHhhCCC
Confidence 457899995443322222 224555566777776633333 4677899999987754
No 292
>PF14369 zf-RING_3: zinc-finger
Probab=25.23 E-value=51 Score=17.99 Aligned_cols=10 Identities=30% Similarity=0.653 Sum_probs=6.4
Q ss_pred ccCCCCCCCC
Q psy5202 245 RCRFCEMPFS 254 (343)
Q Consensus 245 ~C~~C~~~f~ 254 (343)
.|+.|+-.|.
T Consensus 23 ~CP~C~~gFv 32 (35)
T PF14369_consen 23 ACPRCHGGFV 32 (35)
T ss_pred CCcCCCCcEe
Confidence 4777776663
No 293
>PF11023 DUF2614: Protein of unknown function (DUF2614); InterPro: IPR020912 This entry describes proteins of unknown function, which are thought to be membrane proteins.; GO: 0005887 integral to plasma membrane
Probab=25.08 E-value=43 Score=23.88 Aligned_cols=12 Identities=17% Similarity=0.426 Sum_probs=6.7
Q ss_pred CceecCcCcccc
Q psy5202 212 KPYKCELCSYAC 223 (343)
Q Consensus 212 ~~~~C~~C~~~f 223 (343)
...+|+.|++..
T Consensus 68 v~V~CP~C~K~T 79 (114)
T PF11023_consen 68 VQVECPNCGKQT 79 (114)
T ss_pred eeeECCCCCChH
Confidence 345666666543
No 294
>TIGR00416 sms DNA repair protein RadA. The gene protuct codes for a probable ATP-dependent protease involved in both DNA repair and degradation of proteins, peptides, glycopeptides. Also known as sms. Residues 11-28 of the SEED alignment contain a putative Zn binding domain. Residues 110-117 of the seed contain a putative ATP binding site both documented in Haemophilus and in Listeria monocytogenes. for E.coli see ( J. BACTERIOL. 178:5045-5048(1996)).
Probab=24.71 E-value=42 Score=31.01 Aligned_cols=25 Identities=24% Similarity=0.463 Sum_probs=17.7
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCccc
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYA 222 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 222 (343)
..|.|..||..+..+ .++|+.|+.+
T Consensus 6 ~~y~C~~Cg~~~~~~--------------~g~Cp~C~~w 30 (454)
T TIGR00416 6 SKFVCQHCGADSPKW--------------QGKCPACHAW 30 (454)
T ss_pred CeEECCcCCCCCccc--------------cEECcCCCCc
Confidence 468899998877553 2578888754
No 295
>KOG3002|consensus
Probab=24.64 E-value=86 Score=27.14 Aligned_cols=79 Identities=15% Similarity=0.249 Sum_probs=53.9
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCc----CcccccChHHHHHHHHHhcCCCCcccccCC----CCCCCCC
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCEL----CSYACAQSSKLTRHMKTHGRLGKDVYRCRF----CEMPFSV 255 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~----C~~~f~~ 255 (343)
....|+.|...+.....+ +|..-.....+.|+. |.+.|..... ..|.+... -++|.|+. |.+. ..
T Consensus 79 ~~~~CP~Cr~~~g~~R~~--amEkV~e~~~vpC~~~~~GC~~~~~Y~~~-~~HE~~C~---f~~~~CP~p~~~C~~~-G~ 151 (299)
T KOG3002|consen 79 VSNKCPTCRLPIGNIRCR--AMEKVAEAVLVPCKNAKLGCTKSFPYGEK-SKHEKVCE---FRPCSCPVPGAECKYT-GS 151 (299)
T ss_pred hcccCCccccccccHHHH--HHHHHHHhceecccccccCCceeeccccc-cccccccc---cCCcCCCCCcccCCcc-Cc
Confidence 567899999999866444 444334445677864 9999988777 67777663 36777765 5544 45
Q ss_pred hHHHHHHHhhhcCc
Q psy5202 256 PSTLEKHMRKCVVN 269 (343)
Q Consensus 256 ~~~l~~H~~~~~~~ 269 (343)
...|..|.+.-|..
T Consensus 152 ~~~l~~H~~~~hk~ 165 (299)
T KOG3002|consen 152 YKDLYAHLNDTHKS 165 (299)
T ss_pred HHHHHHHHHhhChh
Confidence 66788898866544
No 296
>PRK06260 threonine synthase; Validated
Probab=24.38 E-value=51 Score=29.87 Aligned_cols=10 Identities=40% Similarity=0.849 Sum_probs=5.6
Q ss_pred cccccCcccc
Q psy5202 186 DTCEFCGKVF 195 (343)
Q Consensus 186 ~~C~~C~~~f 195 (343)
++|..||+.|
T Consensus 4 ~~C~~cg~~~ 13 (397)
T PRK06260 4 LKCIECGKEY 13 (397)
T ss_pred EEECCCCCCC
Confidence 4555555555
No 297
>PF09332 Mcm10: Mcm10 replication factor; InterPro: IPR015411 Mcm10 is a eukaryotic DNA replication factor that regulates the stability and chromatin association of DNA polymerase alpha []. ; PDB: 2KWQ_A.
Probab=24.36 E-value=18 Score=31.72 Aligned_cols=12 Identities=25% Similarity=0.606 Sum_probs=4.5
Q ss_pred ceecCcCccccc
Q psy5202 213 PYKCELCSYACA 224 (343)
Q Consensus 213 ~~~C~~C~~~f~ 224 (343)
.|+|..|++.+.
T Consensus 285 FFkC~~C~~Rt~ 296 (344)
T PF09332_consen 285 FFKCKDCGNRTI 296 (344)
T ss_dssp EEE-T-TS-EEE
T ss_pred eEECCCCCCeee
Confidence 355555555443
No 298
>PRK11823 DNA repair protein RadA; Provisional
Probab=24.26 E-value=43 Score=30.89 Aligned_cols=25 Identities=20% Similarity=0.386 Sum_probs=17.8
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecCcCccc
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYA 222 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 222 (343)
..|.|..||..+..+ .++|+.|+.+
T Consensus 6 ~~y~C~~Cg~~~~~~--------------~g~Cp~C~~w 30 (446)
T PRK11823 6 TAYVCQECGAESPKW--------------LGRCPECGAW 30 (446)
T ss_pred CeEECCcCCCCCccc--------------CeeCcCCCCc
Confidence 568899998877554 2578888754
No 299
>PF02748 PyrI_C: Aspartate carbamoyltransferase regulatory chain, metal binding domain; InterPro: IPR020542 Aspartate carbamoyltransferase (aspartate transcarbamylase, ATCase) 2.1.3.2 from EC is an allosteric enzyme that plays a central role in the regulation of the pyrimidine pathway in bacteria. The holoenzyme is a dodecamer composed of six catalytic chains, each with an active site, and six regulatory chains lacking catalytic activity []. The catalytic subunits exist as a dimer of catalytic trimers, (c3)2, while the regulatory subunits exist as a trimer of regulatory dimers, (r2)3, therefore the complete holoenzyme can be represented as (c3)2(r2)3. The association of the catalytic subunits c3 with the regulatory subunits r2 is responsible for the establishment of positive co-operativity between catalytic sites for the binding of aspartate and it dictates the pattern of allosteric response toward nucleotide effectors. ATCase from Escherichia coli is the most extensively studied allosteric enzyme []. The crystal structure of the T-state, the T-state with CTP bound, the R-state with N-phosphonacetyl-L-aspartate (PALA) bound, and the R-state with phosphonoacetamide plus malonate bound have been used in interpreting kinetic and mutational studies. A high-resolution structure of E. coli ATCase in the presence of PALA (a bisubstrate analog) allows a detailed description of the binding at the active site of the enzyme and allows a detailed model of the tetrahedral intermediate to be constructed. The entire regulatory chain has been traced showing that the N-terminal regions of the regulatory chains R1 and R6 are located in close proximity to each other and to the regulatory site. This portion of the molecule may be involved in the observed asymmetry between the regulatory binding sites as well as in the heterotropic response of the enzyme []. The C-terminal domain of the regulatory chains have a rubredoxin-like zinc-bound fold. ATCase from Enterobacter agglomerans (Erwinia herbicola) (Pantoea agglomerans) differs from the other investigated enterobacterial ATCases by its absence of homotropic co-operativity toward the substrate aspartate and its lack of response to ATP which is an allosteric effector (activator) of this family of enzymes. Nevertheless, the E. herbicola ATCase has the same quaternary structure, two trimers of catalytic chains with three dimers of regulatory chains, (c3)2(r2)3, as other enterobacterial ATCases and shows extensive primary structure conservation []. This entry represents the C-terminal domain.; PDB: 2YWW_B 1SKU_D 1Q95_L 8ATC_B 3AT1_D 1RAI_D 4E2F_D 1NBE_B 6AT1_B 2FZC_D ....
Probab=23.97 E-value=32 Score=20.78 Aligned_cols=19 Identities=16% Similarity=0.553 Sum_probs=12.4
Q ss_pred CCCCCcccccccccccCCh
Q psy5202 86 SQEEKNKECEFCGKKFKYE 104 (343)
Q Consensus 86 ~~~~~~~~C~~C~~~f~~~ 104 (343)
......|+|.+|++.|...
T Consensus 30 ~~~~~~~rC~YCe~~~~~~ 48 (52)
T PF02748_consen 30 DKEPIKLRCHYCERIITED 48 (52)
T ss_dssp ETTTCEEEETTT--EEEHH
T ss_pred eCCCCEEEeeCCCCEeccc
Confidence 3455679999999988643
No 300
>PF02591 DUF164: Putative zinc ribbon domain; InterPro: IPR003743 This entry describes proteins of unknown function.
Probab=23.96 E-value=62 Score=19.72 Aligned_cols=31 Identities=19% Similarity=0.313 Sum_probs=15.3
Q ss_pred ccccCcccccCHHHHHHHHHHccCCCceecCcCcc
Q psy5202 187 TCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSY 221 (343)
Q Consensus 187 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 221 (343)
.|.-|+......... .+..++....|+.||+
T Consensus 24 ~C~gC~~~l~~~~~~----~i~~~~~i~~Cp~CgR 54 (56)
T PF02591_consen 24 TCSGCHMELPPQELN----EIRKGDEIVFCPNCGR 54 (56)
T ss_pred ccCCCCEEcCHHHHH----HHHcCCCeEECcCCCc
Confidence 566666665443221 1122345556666664
No 301
>KOG4317|consensus
Probab=23.75 E-value=21 Score=30.47 Aligned_cols=21 Identities=24% Similarity=0.450 Sum_probs=12.5
Q ss_pred CCcccccCcccccCHHHHHHH
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVH 204 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H 204 (343)
+.|+|+.|+..|-+..-++.|
T Consensus 18 ~~YtCPRCn~~YCsl~CYr~h 38 (383)
T KOG4317|consen 18 REYTCPRCNLLYCSLKCYRNH 38 (383)
T ss_pred ccccCCCCCccceeeeeecCC
Confidence 447777777776554443333
No 302
>PF13639 zf-RING_2: Ring finger domain; PDB: 2KIZ_A 4EPO_C 1IYM_A 2EP4_A 2ECT_A 2JRJ_A 2ECN_A 2ECM_A 3NG2_A 2EA6_A ....
Probab=23.68 E-value=88 Score=17.69 Aligned_cols=15 Identities=33% Similarity=0.379 Sum_probs=7.8
Q ss_pred CCceecCcCcccccCh
Q psy5202 211 EKPYKCELCSYACAQS 226 (343)
Q Consensus 211 ~~~~~C~~C~~~f~~~ 226 (343)
+..+..+ |+..|-..
T Consensus 14 ~~~~~l~-C~H~fh~~ 28 (44)
T PF13639_consen 14 EKVVKLP-CGHVFHRS 28 (44)
T ss_dssp SCEEEET-TSEEEEHH
T ss_pred CeEEEcc-CCCeeCHH
Confidence 3444444 66666543
No 303
>PF03145 Sina: Seven in absentia protein family; InterPro: IPR018121 The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non-neuronal cell type. The Sina protein contains an N-terminal RING finger domain C3HC4-type. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 [] also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation [, ]. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is a C3HC4-type, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana.; GO: 0006511 ubiquitin-dependent protein catabolic process, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2A25_A 1K2F_B 2AN6_C.
Probab=23.53 E-value=56 Score=26.19 Aligned_cols=52 Identities=19% Similarity=0.404 Sum_probs=30.8
Q ss_pred cccccc----cccccCChHHHHHHHHhccCCCCeecCC----CCccCCCHHHHHHHHHHhccC
Q psy5202 91 NKECEF----CGKKFKYENTLQAHRRNHTGEKPFKCTV----CDHACSQSAKLKKHMKIHDED 145 (343)
Q Consensus 91 ~~~C~~----C~~~f~~~~~L~~H~~~h~~~~~~~C~~----C~~~f~~~~~L~~H~~~h~~~ 145 (343)
.|.|.. |...|. ......|.... .-+|+.|+. |+..- ....|..|+...|..
T Consensus 14 ~~pC~~~~~GC~~~~~-~~~~~~HE~~C-~~~p~~CP~~~~~C~~~G-~~~~l~~Hl~~~H~~ 73 (198)
T PF03145_consen 14 KFPCKNAKYGCTETFP-YSEKREHEEEC-PFRPCSCPFPGSGCDWQG-SYKELLDHLRDKHSW 73 (198)
T ss_dssp -EE-CCGGGT---EE--GGGHHHHHHT--TTSEEE-SSSSTT---EE-ECCCHHHHHHHHTTT
T ss_pred eecCCCCCCCCccccc-ccChhhHhccC-CCcCCcCCCCCCCccccC-CHHHHHHHHHHHCCC
Confidence 477887 887765 45678888765 568899998 76544 345899999988865
No 304
>KOG1994|consensus
Probab=23.48 E-value=44 Score=27.11 Aligned_cols=22 Identities=23% Similarity=0.526 Sum_probs=18.2
Q ss_pred CceecCcCcccccChHHHHHHH
Q psy5202 212 KPYKCELCSYACAQSSKLTRHM 233 (343)
Q Consensus 212 ~~~~C~~C~~~f~~~~~L~~H~ 233 (343)
.-|-|-.||..|.+...|..|.
T Consensus 238 eh~YC~fCG~~y~~~edl~ehC 259 (268)
T KOG1994|consen 238 EHYYCFFCGIKYKDEEDLYEHC 259 (268)
T ss_pred cceEEEEeccccCCHHHHHHhC
Confidence 3488999999999999998873
No 305
>KOG4727|consensus
Probab=23.32 E-value=61 Score=25.07 Aligned_cols=25 Identities=16% Similarity=0.284 Sum_probs=20.3
Q ss_pred CCCcccccccccccCChHHHHHHHH
Q psy5202 88 EEKNKECEFCGKKFKYENTLQAHRR 112 (343)
Q Consensus 88 ~~~~~~C~~C~~~f~~~~~L~~H~~ 112 (343)
...-|-|.+|+-+|....++..|+.
T Consensus 72 q~~GyyCdVCdcvvKDSinflDHiN 96 (193)
T KOG4727|consen 72 QKGGYYCDVCDCVVKDSINFLDHIN 96 (193)
T ss_pred ccCceeeeecceeehhhHHHHHHhc
Confidence 3455889999999998888888874
No 306
>PF01286 XPA_N: XPA protein N-terminal; InterPro: IPR022652 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. Skin cells of individual's with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-A is the most severe form of the disease and is due to defects in a 30 kDa nuclear protein called XPA (or XPAC) []. The sequence of the XPA protein is conserved from higher eukaryotes [] to yeast (gene RAD14) []. XPA is a hydrophilic protein of 247 to 296 amino-acid residues which has a C4-type zinc finger motif in its central section. This entry contains the zinc-finger containing region in the XPA protein. It is found N-terminal to PF05181 from PFAM ; PDB: 1D4U_A 1XPA_A.
Probab=23.31 E-value=24 Score=19.18 Aligned_cols=9 Identities=22% Similarity=0.741 Sum_probs=2.7
Q ss_pred cCcCccccc
Q psy5202 216 CELCSYACA 224 (343)
Q Consensus 216 C~~C~~~f~ 224 (343)
|..|++.|.
T Consensus 6 C~eC~~~f~ 14 (34)
T PF01286_consen 6 CDECGKPFM 14 (34)
T ss_dssp -TTT--EES
T ss_pred HhHhCCHHH
Confidence 444444443
No 307
>PRK05978 hypothetical protein; Provisional
Probab=23.30 E-value=32 Score=26.16 Aligned_cols=10 Identities=30% Similarity=0.856 Sum_probs=5.5
Q ss_pred ccCCCCCCCC
Q psy5202 245 RCRFCEMPFS 254 (343)
Q Consensus 245 ~C~~C~~~f~ 254 (343)
.|+.||..|.
T Consensus 54 ~C~~CG~~~~ 63 (148)
T PRK05978 54 HCAACGEDFT 63 (148)
T ss_pred CccccCCccc
Confidence 3666665553
No 308
>COG4338 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.89 E-value=35 Score=20.01 Aligned_cols=16 Identities=31% Similarity=0.679 Sum_probs=11.9
Q ss_pred ccCCCCCCCCChHHHH
Q psy5202 245 RCRFCEMPFSVPSTLE 260 (343)
Q Consensus 245 ~C~~C~~~f~~~~~l~ 260 (343)
.|+.|++-|.......
T Consensus 14 ICpvCqRPFsWRkKW~ 29 (54)
T COG4338 14 ICPVCQRPFSWRKKWA 29 (54)
T ss_pred hhhhhcCchHHHHHHH
Confidence 5999999998765443
No 309
>PF04641 Rtf2: Rtf2 RING-finger
Probab=22.83 E-value=74 Score=26.86 Aligned_cols=51 Identities=20% Similarity=0.400 Sum_probs=31.8
Q ss_pred CCcccccCcccccCHHHHHHHHHHccCCCceecC-cCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCCh
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCE-LCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVP 256 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~-~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~ 256 (343)
..|.|++.++.|... ..|.+- -||-.|.....=.. . .-..|+.||..|...
T Consensus 112 ~~~~CPvt~~~~~~~-------------~~fv~l~~cG~V~s~~alke~--k-------~~~~Cp~c~~~f~~~ 163 (260)
T PF04641_consen 112 GRFICPVTGKEFNGK-------------HKFVYLRPCGCVFSEKALKEL--K-------KSKKCPVCGKPFTEE 163 (260)
T ss_pred ceeECCCCCcccCCc-------------eeEEEEcCCCCEeeHHHHHhh--c-------ccccccccCCccccC
Confidence 679999998888443 234443 48877765432211 1 123499999999754
No 310
>PF14447 Prok-RING_4: Prokaryotic RING finger family 4
Probab=22.55 E-value=48 Score=20.27 Aligned_cols=12 Identities=42% Similarity=1.115 Sum_probs=7.2
Q ss_pred cccCCCCCCCCC
Q psy5202 244 YRCRFCEMPFSV 255 (343)
Q Consensus 244 ~~C~~C~~~f~~ 255 (343)
--|++|++.|..
T Consensus 40 ngCPfC~~~~~~ 51 (55)
T PF14447_consen 40 NGCPFCGTPFEF 51 (55)
T ss_pred cCCCCCCCcccC
Confidence 347777766643
No 311
>PF11781 RRN7: RNA polymerase I-specific transcription initiation factor Rrn7; InterPro: IPR021752 Rrn7 is a transcription binding factor that associates strongly with both Rrn6 and Rrn11 to form a complex which itself binds the TATA-binding protein and is required for transcription by the core domain of the RNA PolI promoter [],[].
Probab=22.53 E-value=41 Score=18.49 Aligned_cols=7 Identities=43% Similarity=1.298 Sum_probs=3.4
Q ss_pred cccCccc
Q psy5202 188 CEFCGKV 194 (343)
Q Consensus 188 C~~C~~~ 194 (343)
|..|+..
T Consensus 11 C~~C~~~ 17 (36)
T PF11781_consen 11 CPVCGSR 17 (36)
T ss_pred CCCCCCe
Confidence 5555544
No 312
>COG4391 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.10 E-value=42 Score=20.98 Aligned_cols=15 Identities=20% Similarity=0.913 Sum_probs=11.6
Q ss_pred CCCcccccccccccC
Q psy5202 88 EEKNKECEFCGKKFK 102 (343)
Q Consensus 88 ~~~~~~C~~C~~~f~ 102 (343)
.++.-.|++|+..|.
T Consensus 45 ~~gev~CPYC~t~y~ 59 (62)
T COG4391 45 DEGEVVCPYCSTRYR 59 (62)
T ss_pred CCCcEecCccccEEE
Confidence 345568999999886
No 313
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=21.97 E-value=80 Score=26.94 Aligned_cols=26 Identities=19% Similarity=0.383 Sum_probs=20.7
Q ss_pred CCCeecCCCCccCCCHHHHHHHHHHh
Q psy5202 117 EKPFKCTVCDHACSQSAKLKKHMKIH 142 (343)
Q Consensus 117 ~~~~~C~~C~~~f~~~~~L~~H~~~h 142 (343)
...|.|+.|-+-|.+...|.+|+...
T Consensus 46 ~~lyiCe~Clky~~~~~~l~~H~~~C 71 (290)
T PLN03238 46 TKLYICEYCLKYMRKKKSLLRHLAKC 71 (290)
T ss_pred CeEEEcCCCcchhCCHHHHHHHHHhC
Confidence 45688888888888888888888743
No 314
>PF14255 Cys_rich_CPXG: Cysteine-rich CPXCG
Probab=21.88 E-value=40 Score=20.40 Aligned_cols=10 Identities=30% Similarity=1.155 Sum_probs=6.6
Q ss_pred ccCCCCCCCC
Q psy5202 245 RCRFCEMPFS 254 (343)
Q Consensus 245 ~C~~C~~~f~ 254 (343)
.|++||..+.
T Consensus 2 ~CPyCge~~~ 11 (52)
T PF14255_consen 2 QCPYCGEPIE 11 (52)
T ss_pred CCCCCCCeeE
Confidence 5777776653
No 315
>KOG3214|consensus
Probab=21.68 E-value=30 Score=23.93 Aligned_cols=16 Identities=31% Similarity=0.671 Sum_probs=10.9
Q ss_pred cccccCCCCCCCCChH
Q psy5202 242 DVYRCRFCEMPFSVPS 257 (343)
Q Consensus 242 ~~~~C~~C~~~f~~~~ 257 (343)
....|.+|+..|....
T Consensus 46 G~~sC~iC~esFqt~i 61 (109)
T KOG3214|consen 46 GKASCRICEESFQTTI 61 (109)
T ss_pred ceeeeeehhhhhccch
Confidence 3456888888776654
No 316
>PTZ00448 hypothetical protein; Provisional
Probab=21.62 E-value=62 Score=28.61 Aligned_cols=23 Identities=13% Similarity=0.427 Sum_probs=20.9
Q ss_pred ceecCcCcccccChHHHHHHHHH
Q psy5202 213 PYKCELCSYACAQSSKLTRHMKT 235 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~~L~~H~~~ 235 (343)
.|.|..|+..|.+....+.|++.
T Consensus 314 ~~tC~~C~v~F~~~~~qR~H~KS 336 (373)
T PTZ00448 314 MLLCRKCNIQLMDHNAFKQHYRS 336 (373)
T ss_pred CccccccccccCCHHHHHHHhhh
Confidence 58899999999999999999886
No 317
>KOG4602|consensus
Probab=21.53 E-value=52 Score=27.24 Aligned_cols=66 Identities=17% Similarity=0.440 Sum_probs=35.2
Q ss_pred CCcccccCcccccCHHHHHHHHHHccC-------CCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCCh
Q psy5202 184 RNDTCEFCGKVFKNCSNLTVHRRSHTG-------EKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVP 256 (343)
Q Consensus 184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~-------~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~ 256 (343)
++..|.+|.-.-.....+..|...-.. -+.|.|++||.+- +..+.=.+|.+.|...
T Consensus 232 r~l~CvFC~nN~E~~A~y~tH~lkd~dgRVLCPkLR~YVCPiCGATg-----------------DnAHTiKyCPl~~~~~ 294 (318)
T KOG4602|consen 232 RPLCCVFCFNNAEEFARYHTHPLKDKDGRVLCPKLRSYVCPICGATG-----------------DNAHTIKYCPLAFGDD 294 (318)
T ss_pred CceeEEeecCCCchhhheecccccCCCCcEechhHhhhcCccccccC-----------------CcccceecccccCCCC
Confidence 455677776555555555555332111 2457788887432 2233334566666666
Q ss_pred HHHHHHHhhh
Q psy5202 257 STLEKHMRKC 266 (343)
Q Consensus 257 ~~l~~H~~~~ 266 (343)
.....|-..+
T Consensus 295 ~s~~~~~e~~ 304 (318)
T KOG4602|consen 295 TSVYSHTERN 304 (318)
T ss_pred ccccchhhhc
Confidence 6666665443
No 318
>COG5027 SAS2 Histone acetyltransferase (MYST family) [Chromatin structure and dynamics]
Probab=21.29 E-value=35 Score=29.73 Aligned_cols=30 Identities=30% Similarity=0.661 Sum_probs=24.1
Q ss_pred CCcccccCCCCCCCCChHHHHHHHhhhcCc
Q psy5202 240 GKDVYRCRFCEMPFSVPSTLEKHMRKCVVN 269 (343)
Q Consensus 240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~ 269 (343)
..-.|.|..|-+-|.+...|.+|.++|.-.
T Consensus 155 ~d~vyICefClkY~~s~~~~~rH~~kC~~~ 184 (395)
T COG5027 155 LDIVYICEFCLKYYGSQTSLVRHRKKCSLQ 184 (395)
T ss_pred cceEEEhhhhHHHhcchhHHHHHHhcCcCc
Confidence 345688999999999999999998877544
No 319
>PLN03239 histone acetyltransferase; Provisional
Probab=21.18 E-value=82 Score=27.74 Aligned_cols=21 Identities=33% Similarity=0.395 Sum_probs=9.9
Q ss_pred ceecCcCcccccChHHHHHHH
Q psy5202 213 PYKCELCSYACAQSSKLTRHM 233 (343)
Q Consensus 213 ~~~C~~C~~~f~~~~~L~~H~ 233 (343)
-|.|..|-+-|.+...|.+|+
T Consensus 106 lYiCE~Clky~~~~~~l~~H~ 126 (351)
T PLN03239 106 LYVCEFSFGFFARKSELLRFQ 126 (351)
T ss_pred EEEeccchhhhcCHHHHHHHH
Confidence 344444444444444444443
No 320
>cd01121 Sms Sms (bacterial radA) DNA repair protein. This protein is not related to archael radA any more than is to other RecA-like NTPases. Sms has a role in recombination and recombinational repair and is responsible for the stabilization or processing of branched DNA molecules.
Probab=20.96 E-value=55 Score=29.38 Aligned_cols=8 Identities=38% Similarity=0.933 Sum_probs=3.9
Q ss_pred CcccccCc
Q psy5202 185 NDTCEFCG 192 (343)
Q Consensus 185 ~~~C~~C~ 192 (343)
..+|+.|+
T Consensus 14 ~g~cp~c~ 21 (372)
T cd01121 14 LGKCPECG 21 (372)
T ss_pred cEECcCCC
Confidence 34455554
No 321
>PRK12860 transcriptional activator FlhC; Provisional
Probab=20.74 E-value=67 Score=25.55 Aligned_cols=48 Identities=10% Similarity=0.135 Sum_probs=30.2
Q ss_pred cccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC
Q psy5202 192 GKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE 250 (343)
Q Consensus 192 ~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~ 250 (343)
--.|.....|.+.+... .-..-.|..||-.|..... .....|.|+.|.
T Consensus 114 ~Ls~tRAw~LvRf~~s~-~L~l~~C~~Cgg~fv~~~~----------e~~~~f~CplC~ 161 (189)
T PRK12860 114 VLDLTRAWTLVRFFDAG-MLQLARCCRCGGKFVTHAH----------DLRHNFVCGLCQ 161 (189)
T ss_pred eecHHHHHHHHHHhcCC-CeeeccCCCCCCCeecccc----------ccCCCCcCCCCC
Confidence 34566677777776643 2344678888888764321 133468899886
No 322
>PF09082 DUF1922: Domain of unknown function (DUF1922); InterPro: IPR015166 Members of this family consist of a beta-sheet region followed by an alpha-helix and an unstructured C terminus. The beta-sheet region contains a CXCX...XCXC sequence with Cys residues located in two proximal loops and pointing towards each other. This precise function of this set of bacterial proteins is, as yet, unknown []. ; PDB: 1GH9_A.
Probab=20.64 E-value=23 Score=22.75 Aligned_cols=13 Identities=23% Similarity=0.777 Sum_probs=7.0
Q ss_pred CCcccccCCCCCCC
Q psy5202 240 GKDVYRCRFCEMPF 253 (343)
Q Consensus 240 ~~~~~~C~~C~~~f 253 (343)
+.+.-+| .||+..
T Consensus 17 ~~kTkkC-~CG~~l 29 (68)
T PF09082_consen 17 GAKTKKC-VCGKTL 29 (68)
T ss_dssp T-SEEEE-TTTEEE
T ss_pred CcceeEe-cCCCee
Confidence 3445567 777653
No 323
>PF14787 zf-CCHC_5: GAG-polyprotein viral zinc-finger; PDB: 1CL4_A 1DSV_A.
Probab=20.28 E-value=54 Score=18.02 Aligned_cols=14 Identities=29% Similarity=0.465 Sum_probs=8.5
Q ss_pred ccCCCCCCCCChHH
Q psy5202 245 RCRFCEMPFSVPST 258 (343)
Q Consensus 245 ~C~~C~~~f~~~~~ 258 (343)
.|+.|++.|-....
T Consensus 4 ~CprC~kg~Hwa~~ 17 (36)
T PF14787_consen 4 LCPRCGKGFHWASE 17 (36)
T ss_dssp C-TTTSSSCS-TTT
T ss_pred cCcccCCCcchhhh
Confidence 58888888866544
No 324
>KOG1729|consensus
Probab=20.17 E-value=39 Score=28.93 Aligned_cols=7 Identities=43% Similarity=1.180 Sum_probs=4.1
Q ss_pred ccccCcc
Q psy5202 187 TCEFCGK 193 (343)
Q Consensus 187 ~C~~C~~ 193 (343)
.|.+|++
T Consensus 170 ~C~~C~~ 176 (288)
T KOG1729|consen 170 ECMVCGC 176 (288)
T ss_pred ecccCCC
Confidence 4666665
Done!