Query psy8050
Match_columns 311
No_of_seqs 388 out of 3471
Neff 9.5
Searched_HMMs 46136
Date Fri Aug 16 17:08:00 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8050.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8050hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 100.0 1.3E-28 2.7E-33 204.0 7.1 138 110-249 125-265 (279)
2 KOG2462|consensus 99.9 4E-27 8.7E-32 195.1 4.6 134 141-283 128-264 (279)
3 KOG3608|consensus 99.9 1.7E-25 3.8E-30 190.0 2.9 265 33-310 66-369 (467)
4 KOG3608|consensus 99.9 7.3E-23 1.6E-27 174.1 5.2 235 35-287 133-377 (467)
5 KOG1074|consensus 99.9 1.9E-23 4.1E-28 194.9 1.8 79 173-251 607-692 (958)
6 KOG1074|consensus 99.8 1.2E-21 2.6E-26 182.9 4.1 106 200-308 606-724 (958)
7 KOG3623|consensus 99.8 3.2E-21 7E-26 177.3 0.4 117 35-164 209-330 (1007)
8 KOG3623|consensus 99.7 2.4E-19 5.2E-24 165.2 0.8 108 80-193 211-331 (1007)
9 KOG3576|consensus 99.7 2.3E-17 5E-22 130.6 3.0 114 141-254 115-239 (267)
10 KOG3576|consensus 99.6 2.6E-16 5.6E-21 124.7 5.5 118 110-227 112-240 (267)
11 PLN03086 PRLI-interacting fact 99.1 8.4E-11 1.8E-15 109.7 7.7 143 114-282 406-560 (567)
12 PLN03086 PRLI-interacting fact 99.1 9.9E-11 2.1E-15 109.3 6.6 118 115-249 433-562 (567)
13 PHA00733 hypothetical protein 99.0 2.4E-10 5.2E-15 87.5 4.2 55 196-252 70-124 (128)
14 PHA00733 hypothetical protein 98.9 7.5E-10 1.6E-14 84.8 3.6 83 140-224 37-124 (128)
15 PHA02768 hypothetical protein; 98.7 4E-09 8.6E-14 66.8 1.9 42 200-243 6-47 (55)
16 KOG3993|consensus 98.7 9.5E-10 2.1E-14 96.8 -1.5 80 144-223 268-380 (500)
17 KOG3993|consensus 98.7 1.4E-09 3E-14 95.7 -1.2 164 114-284 266-480 (500)
18 PHA02768 hypothetical protein; 98.6 2.4E-08 5.2E-13 63.2 2.6 42 144-187 6-47 (55)
19 PF13465 zf-H2C2_2: Zinc-finge 98.5 8.5E-08 1.8E-12 51.8 2.5 26 214-239 1-26 (26)
20 PF13465 zf-H2C2_2: Zinc-finge 98.3 4.2E-07 9.2E-12 49.1 2.7 24 131-154 2-25 (26)
21 PHA00616 hypothetical protein 98.1 1E-06 2.3E-11 53.1 1.5 32 200-231 2-33 (44)
22 PHA00732 hypothetical protein 98.1 2.9E-06 6.4E-11 58.9 2.7 46 199-250 1-47 (79)
23 PHA00616 hypothetical protein 98.0 2.1E-06 4.6E-11 51.8 1.6 33 143-175 1-33 (44)
24 PHA00732 hypothetical protein 98.0 5.5E-06 1.2E-10 57.6 2.8 48 227-287 1-49 (79)
25 PF05605 zf-Di19: Drought indu 97.8 4.3E-05 9.2E-10 49.2 4.4 47 200-249 3-51 (54)
26 PF00096 zf-C2H2: Zinc finger, 97.7 2.7E-05 5.9E-10 40.6 2.4 23 228-250 1-23 (23)
27 PF00096 zf-C2H2: Zinc finger, 97.5 7.4E-05 1.6E-09 38.9 2.3 22 144-165 1-22 (23)
28 PF05605 zf-Di19: Drought indu 97.5 0.00019 4.1E-09 46.2 4.5 49 116-167 3-53 (54)
29 PF13894 zf-C2H2_4: C2H2-type 97.5 0.00013 2.7E-09 38.2 2.6 23 228-250 1-23 (24)
30 PF13912 zf-C2H2_6: C2H2-type 97.4 9.7E-05 2.1E-09 40.1 1.5 25 227-251 1-25 (27)
31 PF12756 zf-C2H2_2: C2H2 type 97.4 0.00015 3.3E-09 52.9 3.0 24 227-250 50-73 (100)
32 PF13894 zf-C2H2_4: C2H2-type 97.2 0.00031 6.8E-09 36.6 2.5 23 144-166 1-23 (24)
33 PF12756 zf-C2H2_2: C2H2 type 97.2 0.00018 4E-09 52.5 2.0 72 146-222 2-73 (100)
34 PF13912 zf-C2H2_6: C2H2-type 97.0 0.0004 8.6E-09 37.6 1.6 23 144-166 2-24 (27)
35 COG5189 SFP1 Putative transcri 96.8 0.00037 8.1E-09 59.9 0.6 20 264-283 400-419 (423)
36 smart00355 ZnF_C2H2 zinc finge 96.6 0.0021 4.5E-08 33.9 2.5 23 228-250 1-23 (26)
37 COG5189 SFP1 Putative transcri 96.6 0.00075 1.6E-08 58.1 0.8 51 197-247 347-418 (423)
38 KOG2231|consensus 96.4 0.0052 1.1E-07 59.0 5.4 100 146-252 118-237 (669)
39 KOG2231|consensus 96.4 0.0078 1.7E-07 57.9 6.4 122 118-250 118-262 (669)
40 smart00355 ZnF_C2H2 zinc finge 96.1 0.0055 1.2E-07 32.2 2.3 24 200-223 1-24 (26)
41 PF13909 zf-H2C2_5: C2H2-type 96.0 0.0055 1.2E-07 32.0 1.9 23 228-251 1-23 (24)
42 PF12874 zf-met: Zinc-finger o 96.0 0.0041 8.8E-08 32.9 1.4 21 228-248 1-21 (25)
43 COG5236 Uncharacterized conser 95.9 0.011 2.4E-07 51.7 4.5 129 116-254 152-309 (493)
44 PF09237 GAGA: GAGA factor; I 95.9 0.0069 1.5E-07 37.5 2.3 27 226-252 23-49 (54)
45 PRK04860 hypothetical protein; 95.7 0.005 1.1E-07 49.0 1.4 39 198-240 118-156 (160)
46 PF09237 GAGA: GAGA factor; I 95.7 0.014 3.1E-07 36.1 3.1 29 142-170 23-51 (54)
47 COG5048 FOG: Zn-finger [Genera 95.5 0.0075 1.6E-07 55.9 2.1 144 143-291 289-447 (467)
48 PF12874 zf-met: Zinc-finger o 95.4 0.0086 1.9E-07 31.5 1.3 23 200-222 1-23 (25)
49 PF13909 zf-H2C2_5: C2H2-type 95.4 0.017 3.6E-07 30.1 2.3 22 144-166 1-22 (24)
50 KOG1146|consensus 95.0 0.013 2.7E-07 59.9 1.9 51 115-165 436-487 (1406)
51 PRK04860 hypothetical protein; 94.9 0.023 4.9E-07 45.3 2.8 34 172-209 120-153 (160)
52 KOG2482|consensus 94.9 0.061 1.3E-06 47.1 5.5 57 228-284 280-356 (423)
53 COG5048 FOG: Zn-finger [Genera 94.5 0.0093 2E-07 55.3 -0.4 142 114-255 288-446 (467)
54 PF12171 zf-C2H2_jaz: Zinc-fin 94.4 0.018 3.9E-07 31.0 0.8 20 200-219 2-21 (27)
55 KOG1146|consensus 94.2 0.028 6E-07 57.6 2.2 59 226-284 517-611 (1406)
56 PF12171 zf-C2H2_jaz: Zinc-fin 93.9 0.026 5.7E-07 30.4 0.9 22 172-193 2-23 (27)
57 KOG2785|consensus 93.1 0.4 8.6E-06 42.9 7.2 23 142-164 67-89 (390)
58 TIGR00622 ssl1 transcription f 93.0 0.17 3.8E-06 37.4 4.2 48 201-250 57-104 (112)
59 KOG4173|consensus 92.7 0.037 8.1E-07 44.9 0.3 46 144-192 80-127 (253)
60 PF13913 zf-C2HC_2: zinc-finge 92.2 0.14 2.9E-06 27.1 2.0 20 228-248 3-22 (25)
61 PF13913 zf-C2HC_2: zinc-finge 92.0 0.14 3E-06 27.1 1.9 19 145-164 4-22 (25)
62 COG5236 Uncharacterized conser 92.0 0.24 5.2E-06 43.6 4.4 133 143-284 151-303 (493)
63 smart00451 ZnF_U1 U1-like zinc 90.7 0.23 5E-06 28.3 2.1 22 227-248 3-24 (35)
64 KOG2482|consensus 89.9 0.37 8E-06 42.4 3.6 51 172-222 280-357 (423)
65 KOG2893|consensus 89.2 0.1 2.2E-06 43.4 -0.3 49 202-254 13-62 (341)
66 smart00451 ZnF_U1 U1-like zinc 88.8 0.37 8E-06 27.4 2.0 22 143-164 3-24 (35)
67 TIGR00622 ssl1 transcription f 88.7 1.1 2.4E-05 33.2 4.9 85 142-234 14-110 (112)
68 PF12013 DUF3505: Protein of u 88.2 1.2 2.7E-05 32.9 5.0 25 172-196 81-109 (109)
69 cd00350 rubredoxin_like Rubred 88.2 0.39 8.4E-06 27.2 1.8 8 173-180 3-10 (33)
70 KOG2893|consensus 86.7 0.27 5.9E-06 41.0 0.8 41 118-162 13-53 (341)
71 PF06524 NOA36: NOA36 protein; 86.2 0.55 1.2E-05 39.8 2.3 26 225-250 207-232 (314)
72 PF12013 DUF3505: Protein of u 85.0 1.1 2.5E-05 33.1 3.4 24 142-166 10-33 (109)
73 PF09986 DUF2225: Uncharacteri 83.3 0.2 4.4E-06 42.1 -1.5 48 226-275 4-61 (214)
74 PF09538 FYDLN_acid: Protein o 82.7 1 2.2E-05 33.3 2.1 34 112-156 6-39 (108)
75 KOG2186|consensus 82.4 0.83 1.8E-05 38.6 1.8 49 115-166 3-51 (276)
76 COG4049 Uncharacterized protei 80.2 0.97 2.1E-05 28.6 1.1 29 139-167 13-41 (65)
77 KOG4173|consensus 79.8 0.61 1.3E-05 38.1 0.1 77 171-250 79-170 (253)
78 PF13717 zinc_ribbon_4: zinc-r 79.6 2 4.3E-05 24.8 2.2 33 116-153 3-35 (36)
79 cd00729 rubredoxin_SM Rubredox 76.9 2 4.3E-05 24.5 1.7 9 172-180 3-11 (34)
80 PF09538 FYDLN_acid: Protein o 76.1 1.8 3.9E-05 32.0 1.7 14 226-239 25-38 (108)
81 PF09986 DUF2225: Uncharacteri 75.6 1.4 3.1E-05 37.0 1.2 13 200-212 49-61 (214)
82 COG4049 Uncharacterized protei 75.3 1.6 3.4E-05 27.7 1.0 25 224-248 14-38 (65)
83 PF13719 zinc_ribbon_5: zinc-r 75.0 3 6.5E-05 24.2 2.1 33 116-153 3-35 (37)
84 TIGR00373 conserved hypothetic 74.4 3.6 7.9E-05 32.7 3.2 34 139-181 105-138 (158)
85 TIGR02098 MJ0042_CXXC MJ0042 f 74.3 2.7 5.7E-05 24.4 1.8 10 144-153 26-35 (38)
86 COG1592 Rubrerythrin [Energy p 71.6 2.8 6.1E-05 33.5 1.9 10 143-152 134-143 (166)
87 KOG2186|consensus 70.5 2.8 6E-05 35.6 1.7 47 143-192 3-49 (276)
88 COG1592 Rubrerythrin [Energy p 69.9 2.6 5.6E-05 33.7 1.4 10 225-234 147-156 (166)
89 TIGR02300 FYDLN_acid conserved 68.6 3.9 8.4E-05 30.9 2.0 34 112-156 6-39 (129)
90 smart00531 TFIIE Transcription 68.6 9.4 0.0002 29.9 4.3 38 140-181 96-133 (147)
91 PRK06266 transcription initiat 67.0 5.8 0.00013 32.3 2.9 32 141-181 115-146 (178)
92 KOG1701|consensus 66.2 2.7 5.8E-05 38.4 0.9 44 264-307 396-441 (468)
93 TIGR02605 CxxC_CxxC_SSSS putat 65.8 3.7 8.1E-05 25.7 1.3 10 144-153 6-15 (52)
94 COG2888 Predicted Zn-ribbon RN 64.3 8.5 0.00018 24.9 2.6 10 170-179 49-58 (61)
95 TIGR01384 TFS_arch transcripti 64.0 3.2 6.9E-05 30.4 0.8 13 79-91 16-28 (104)
96 KOG1280|consensus 63.8 7.4 0.00016 34.6 3.1 22 143-164 79-100 (381)
97 smart00659 RPOLCX RNA polymera 63.0 6.5 0.00014 23.9 1.9 10 144-153 3-12 (44)
98 smart00834 CxxC_CXXC_SSSS Puta 63.0 4.7 0.0001 23.6 1.3 10 172-181 6-15 (41)
99 PRK00398 rpoP DNA-directed RNA 62.8 5 0.00011 24.5 1.4 10 144-153 4-13 (46)
100 KOG2879|consensus 62.3 11 0.00024 32.5 3.7 28 16-43 219-246 (298)
101 PRK00464 nrdR transcriptional 62.2 2 4.4E-05 33.9 -0.5 16 200-215 29-44 (154)
102 PRK14890 putative Zn-ribbon RN 60.9 14 0.0003 23.9 3.1 10 170-179 47-56 (59)
103 TIGR00373 conserved hypothetic 60.5 8 0.00017 30.8 2.6 32 168-208 106-137 (158)
104 KOG4752|consensus 60.1 8.1 0.00018 19.6 1.6 11 10-20 7-17 (26)
105 COG1198 PriA Primosomal protei 59.5 7 0.00015 39.0 2.5 12 197-208 473-484 (730)
106 TIGR02300 FYDLN_acid conserved 58.2 7.1 0.00015 29.5 1.7 13 226-238 25-37 (129)
107 smart00614 ZnF_BED BED zinc fi 58.1 9.4 0.0002 23.7 2.1 21 228-248 19-44 (50)
108 smart00734 ZnF_Rad18 Rad18-lik 57.9 9.3 0.0002 20.2 1.7 17 146-163 4-20 (26)
109 PF08188 Protamine_3: Spermato 57.4 9.7 0.00021 22.2 1.8 23 2-24 8-30 (48)
110 smart00531 TFIIE Transcription 57.2 14 0.00029 29.0 3.3 39 112-154 96-134 (147)
111 KOG2785|consensus 55.9 14 0.0003 33.5 3.4 49 116-164 167-241 (390)
112 PRK06266 transcription initiat 54.1 10 0.00023 30.8 2.3 30 170-208 116-145 (178)
113 PF09723 Zn-ribbon_8: Zinc rib 53.9 7.9 0.00017 23.1 1.2 11 172-182 6-16 (42)
114 PF02892 zf-BED: BED zinc fing 53.6 16 0.00035 21.8 2.6 22 226-247 15-40 (45)
115 COG5151 SSL1 RNA polymerase II 53.6 6.1 0.00013 34.6 0.9 23 228-250 389-411 (421)
116 COG1198 PriA Primosomal protei 53.5 10 0.00022 38.0 2.4 40 172-236 445-484 (730)
117 PF10571 UPF0547: Uncharacteri 52.9 7.8 0.00017 20.6 0.9 9 296-304 17-25 (26)
118 KOG2807|consensus 52.5 26 0.00056 31.1 4.5 25 226-250 344-368 (378)
119 PF05129 Elf1: Transcription e 52.4 6.2 0.00014 27.5 0.6 10 35-44 21-30 (81)
120 KOG4377|consensus 52.0 11 0.00025 34.3 2.3 110 142-254 270-430 (480)
121 KOG4602|consensus 51.6 14 0.00031 31.4 2.7 10 143-152 268-277 (318)
122 PF05290 Baculo_IE-1: Baculovi 51.5 8.1 0.00018 29.4 1.1 17 168-184 77-93 (140)
123 PF02176 zf-TRAF: TRAF-type zi 51.3 11 0.00023 24.2 1.6 33 143-176 9-43 (60)
124 PF05443 ROS_MUCR: ROS/MUCR tr 49.4 10 0.00022 29.1 1.4 26 113-141 70-95 (132)
125 PF14446 Prok-RING_1: Prokaryo 49.2 9.3 0.0002 24.3 1.0 12 264-275 7-18 (54)
126 PRK04023 DNA polymerase II lar 48.7 18 0.00039 37.1 3.4 9 144-152 627-635 (1121)
127 PF08274 PhnA_Zn_Ribbon: PhnA 48.6 8.6 0.00019 21.2 0.7 10 142-151 18-27 (30)
128 CHL00152 rpl32 ribosomal prote 48.5 10 0.00022 24.0 1.0 22 2-23 4-25 (53)
129 PF05191 ADK_lid: Adenylate ki 47.8 9.1 0.0002 22.1 0.7 11 264-274 3-13 (36)
130 COG1655 Uncharacterized protei 47.8 3.6 7.7E-05 34.4 -1.3 46 226-271 18-71 (267)
131 KOG2807|consensus 47.3 21 0.00045 31.6 3.1 23 264-286 347-369 (378)
132 cd00730 rubredoxin Rubredoxin; 47.2 25 0.00055 22.0 2.7 9 37-45 2-10 (50)
133 COG1996 RPC10 DNA-directed RNA 47.1 14 0.00031 22.9 1.6 10 144-153 7-16 (49)
134 PF15269 zf-C2H2_7: Zinc-finge 47.0 24 0.00052 21.3 2.4 22 115-136 20-41 (54)
135 PF03604 DNA_RNApol_7kD: DNA d 44.9 20 0.00043 20.1 1.8 7 144-150 18-24 (32)
136 PRK14873 primosome assembly pr 44.8 16 0.00035 36.2 2.4 8 145-152 385-392 (665)
137 COG1997 RPL43A Ribosomal prote 44.2 20 0.00043 25.2 2.1 14 114-127 52-65 (89)
138 COG4530 Uncharacterized protei 44.0 18 0.00039 26.5 1.9 32 112-154 6-37 (129)
139 PF00301 Rubredoxin: Rubredoxi 43.2 23 0.0005 21.8 2.1 13 172-184 2-14 (47)
140 PRK09678 DNA-binding transcrip 41.4 7.3 0.00016 26.5 -0.4 17 170-186 26-44 (72)
141 PF01783 Ribosomal_L32p: Ribos 41.4 9 0.00019 24.6 0.0 31 2-43 3-33 (56)
142 KOG2593|consensus 39.9 36 0.00079 31.5 3.6 39 139-180 124-162 (436)
143 PRK04023 DNA polymerase II lar 37.8 30 0.00066 35.6 3.0 9 144-152 639-647 (1121)
144 TIGR00595 priA primosomal prot 37.7 25 0.00055 33.7 2.5 10 198-207 252-261 (505)
145 PF05443 ROS_MUCR: ROS/MUCR tr 37.6 23 0.00049 27.3 1.7 25 227-254 72-96 (132)
146 COG5151 SSL1 RNA polymerase II 37.4 46 0.00099 29.4 3.7 79 142-222 321-411 (421)
147 KOG2071|consensus 36.8 31 0.00067 33.3 2.8 27 141-167 416-442 (579)
148 PF13878 zf-C2H2_3: zinc-finge 36.6 38 0.00082 20.1 2.2 24 228-251 14-39 (41)
149 KOG4167|consensus 33.9 11 0.00025 36.9 -0.5 27 141-167 790-816 (907)
150 PRK03824 hypA hydrogenase nick 32.5 29 0.00062 26.8 1.6 11 144-154 71-81 (135)
151 COG3677 Transposase and inacti 31.1 64 0.0014 24.7 3.3 17 291-307 51-67 (129)
152 COG4957 Predicted transcriptio 30.5 56 0.0012 25.1 2.7 25 113-140 74-98 (148)
153 KOG4377|consensus 30.2 37 0.00081 31.1 2.1 100 119-223 277-427 (480)
154 KOG0782|consensus 29.6 13 0.00029 35.4 -0.8 53 212-276 238-290 (1004)
155 PHA00626 hypothetical protein 29.3 28 0.00061 22.2 0.9 15 226-240 22-36 (59)
156 PF14353 CpXC: CpXC protein 29.2 16 0.00034 27.8 -0.3 17 143-159 38-54 (128)
157 KOG0847|consensus 29.2 22 0.00048 29.5 0.5 20 13-32 217-236 (288)
158 PF07754 DUF1610: Domain of un 29.0 26 0.00056 18.2 0.5 10 34-43 14-23 (24)
159 PF04959 ARS2: Arsenite-resist 29.0 29 0.00064 29.1 1.2 25 199-223 77-101 (214)
160 PF09845 DUF2072: Zn-ribbon co 28.9 29 0.00063 26.5 1.0 14 144-157 2-15 (131)
161 PF14122 YokU: YokU-like prote 28.8 52 0.0011 23.1 2.2 21 111-131 31-51 (87)
162 PF05495 zf-CHY: CHY zinc fing 28.6 20 0.00044 24.2 0.2 9 145-153 12-20 (71)
163 PF15135 UPF0515: Uncharacteri 28.2 53 0.0011 28.0 2.5 12 142-153 111-122 (278)
164 COG3357 Predicted transcriptio 27.8 37 0.0008 24.0 1.3 13 171-183 58-70 (97)
165 KOG2593|consensus 27.6 39 0.00084 31.3 1.8 36 114-152 127-162 (436)
166 KOG1280|consensus 27.3 52 0.0011 29.5 2.4 36 199-234 79-116 (381)
167 PTZ00255 60S ribosomal protein 27.2 29 0.00064 24.6 0.8 10 200-209 55-64 (90)
168 PF04959 ARS2: Arsenite-resist 27.1 23 0.00051 29.7 0.3 30 169-198 75-104 (214)
169 KOG2906|consensus 27.1 12 0.00026 26.8 -1.2 13 78-90 20-32 (105)
170 PF01363 FYVE: FYVE zinc finge 26.9 35 0.00076 22.5 1.1 9 264-272 11-19 (69)
171 COG1779 C4-type Zn-finger prot 26.7 20 0.00043 29.4 -0.2 11 295-305 45-55 (201)
172 PF01780 Ribosomal_L37ae: Ribo 26.5 25 0.00054 25.0 0.3 10 200-209 54-63 (90)
173 TIGR01206 lysW lysine biosynth 26.4 41 0.00089 21.4 1.2 9 264-272 4-12 (54)
174 KOG4167|consensus 25.4 21 0.00046 35.2 -0.3 31 110-140 787-817 (907)
175 PF01096 TFIIS_C: Transcriptio 25.2 15 0.00033 21.5 -0.8 6 174-179 3-8 (39)
176 PF13453 zf-TFIIB: Transcripti 25.0 45 0.00097 19.6 1.2 17 144-160 20-36 (41)
177 PF10058 DUF2296: Predicted in 24.6 77 0.0017 20.1 2.3 9 143-151 44-52 (54)
178 TIGR00280 L37a ribosomal prote 24.5 31 0.00068 24.5 0.5 11 200-210 54-64 (91)
179 COG0333 RpmF Ribosomal protein 23.8 45 0.00097 21.5 1.1 30 3-43 5-34 (57)
180 TIGR00595 priA primosomal prot 23.1 53 0.0012 31.5 1.9 30 192-236 233-262 (505)
181 PRK00432 30S ribosomal protein 22.9 65 0.0014 20.1 1.6 11 143-153 37-47 (50)
182 PRK14714 DNA polymerase II lar 22.8 1E+02 0.0022 32.9 3.9 8 144-151 668-675 (1337)
183 TIGR01562 FdhE formate dehydro 22.6 58 0.0013 29.0 1.9 12 168-179 207-218 (305)
184 PRK12286 rpmF 50S ribosomal pr 22.4 25 0.00054 22.6 -0.3 9 35-43 26-34 (57)
185 TIGR00686 phnA alkylphosphonat 22.1 54 0.0012 24.1 1.3 15 142-156 18-32 (109)
186 PRK12380 hydrogenase nickel in 22.0 56 0.0012 24.3 1.5 11 144-154 71-81 (113)
187 KOG0782|consensus 22.0 30 0.00065 33.1 0.0 53 240-307 238-290 (1004)
188 PRK03564 formate dehydrogenase 21.8 30 0.00066 30.8 0.0 6 201-206 228-233 (309)
189 PF12647 RNHCP: RNHCP domain; 21.8 3.1E+02 0.0067 19.6 5.0 9 35-43 3-11 (92)
190 KOG3408|consensus 21.7 49 0.0011 24.8 1.1 26 196-221 54-79 (129)
191 TIGR00100 hypA hydrogenase nic 21.5 57 0.0012 24.3 1.4 11 144-154 71-81 (115)
192 PF10013 DUF2256: Uncharacteri 21.5 40 0.00087 20.2 0.5 16 116-131 9-24 (42)
193 COG2331 Uncharacterized protei 21.4 30 0.00064 23.6 -0.1 10 228-237 13-22 (82)
194 KOG3014|consensus 21.4 42 0.00091 28.7 0.8 31 35-68 36-66 (257)
195 TIGR03829 YokU_near_AblA uncha 21.1 54 0.0012 23.3 1.1 16 115-130 35-50 (89)
196 PF09788 Tmemb_55A: Transmembr 21.0 3.7E+02 0.008 23.2 6.2 41 36-81 65-107 (256)
197 PF07295 DUF1451: Protein of u 20.9 65 0.0014 25.2 1.7 11 227-237 112-122 (146)
198 PF07649 C1_3: C1-like domain; 20.6 56 0.0012 17.6 0.9 8 143-150 15-22 (30)
199 PF13451 zf-trcl: Probable zin 20.5 90 0.002 19.4 1.9 18 226-243 3-20 (49)
200 PRK05978 hypothetical protein; 20.0 58 0.0013 25.6 1.2 9 265-273 55-63 (148)
No 1
>KOG2462|consensus
Probab=99.95 E-value=1.3e-28 Score=204.03 Aligned_cols=138 Identities=36% Similarity=0.654 Sum_probs=126.5
Q ss_pred CCCcceeeccCCCCcCCChhHHHHHHhhhCC---CCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcch
Q psy8050 110 SGIRLILIQFIPGKTYARPSTLKTHLRTHSG---EKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSS 186 (311)
Q Consensus 110 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~---~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~ 186 (311)
......++|..||+.|.+.++|.+|.+.|-. .+.+.|++|+++|.+...|..|+++|+ .+++|.+||+.|...+.
T Consensus 125 ~~~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWL 202 (279)
T KOG2462|consen 125 AAKHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWL 202 (279)
T ss_pred cccCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHH
Confidence 3455678999999999999999999998864 467999999999999999999999996 57899999999999999
Q ss_pred hhhhhhhccccccccccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHH
Q psy8050 187 VTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRV 249 (311)
Q Consensus 187 l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~ 249 (311)
|+-|+++|+||+||.|+.|+++|...++|+.|+++|.+.+.|+|..|++.|...+.|..|...
T Consensus 203 LQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 203 LQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred hhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 999999999999999999999999999999999999999999999999999999999999764
No 2
>KOG2462|consensus
Probab=99.93 E-value=4e-27 Score=195.13 Aligned_cols=134 Identities=32% Similarity=0.671 Sum_probs=125.5
Q ss_pred CCCeecCcCccccCChhHHHHHHHHhcC---CCceecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHH
Q psy8050 141 EKPYRCEDCNKSFSQAANLTAHVRTHSG---EKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTK 217 (311)
Q Consensus 141 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~---~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~ 217 (311)
...|.|..||+.+.+.++|.+|.++|.. .+.+.|.+||+.|.+...|..|+++|+ -+++|.+||+.|.....|+.
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQG 205 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQG 205 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhc
Confidence 3459999999999999999999999864 467999999999999999999999997 57999999999999999999
Q ss_pred HhhhhcCCCCeecCccccccCChHHHHHHHHHhCCCCCCCCcccccccccccccccccccchhhce
Q psy8050 218 HLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQVCSNIVDQRRGIYVVEA 283 (311)
Q Consensus 218 H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 283 (311)
|+++|+||+||.|+.|++.|..+++|+.||++|.+.+.. +|..|++.|.....|.+|..
T Consensus 206 HiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~-------qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 206 HIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKH-------QCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred ccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccc-------cCcchhhHHHHHHHHHHhhh
Confidence 999999999999999999999999999999999998853 89999999999999999964
No 3
>KOG3608|consensus
Probab=99.91 E-value=1.7e-25 Score=189.96 Aligned_cols=265 Identities=22% Similarity=0.348 Sum_probs=204.3
Q ss_pred hccCCCCCCCCCCCCCCchhhHhhhccccccCC----------------CCCCCC--------CCCCCCCCCCCC--CCC
Q psy8050 33 IQDRSECPGSGMGKPPHDVADVLLSLKHAVVHP----------------GQMTPP--------PYGDPGAPYQVG--APQ 86 (311)
Q Consensus 33 ~~~~~~C~~c~~~~~~~~~~~~~~~~~H~~~h~----------------~~~~p~--------~~~~p~~~~~C~--~c~ 86 (311)
-.|.++|.+=+|++...+ +...|.+|...|- .+..+- ..+.-|.-|.|+ .|+
T Consensus 66 h~de~qC~w~~C~f~~~~--~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe 143 (467)
T KOG3608|consen 66 HTDEHQCTWNSCDFRTEN--SSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCE 143 (467)
T ss_pred cccceeEEeccCCccccc--hHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcC
Confidence 445799998887765444 3344455655442 122111 113334456675 489
Q ss_pred CCCCCCCCccccCCCCCCC-----CCCCCCCcceeeccC--CCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHH
Q psy8050 87 QFPTASLSYTVHPHQSSRG-----RPLNSGIRLILIQFI--PGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANL 159 (311)
Q Consensus 87 ~~~~~~~~~~~h~~~~~~~-----~~~~~~~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L 159 (311)
..|.+...+..|+..|.-. ...+.+.++.+.|.+ |-+.+.++..|+.|++.|++++...|+.||..|.+...|
T Consensus 144 ~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl 223 (467)
T KOG3608|consen 144 REFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKL 223 (467)
T ss_pred CcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHH
Confidence 9999999999887666443 222334445566654 999999999999999999999999999999999999999
Q ss_pred HHHHHHhc--CCCceecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhh-hhcCCCCeecCccccc
Q psy8050 160 TAHVRTHS--GEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLR-IHSGEKPYQCKLCLLR 236 (311)
Q Consensus 160 ~~H~~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~-~h~~~~~~~C~~C~~~ 236 (311)
-.|++..+ ...+|.|..|.+.|.+...|..|+..|. .-|+|+.|+......++|..|++ .|...+||+|+.|+..
T Consensus 224 ~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~ 301 (467)
T KOG3608|consen 224 FDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTR 301 (467)
T ss_pred HHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccCCCChHHHHHHHHhhhccCCCccccchhhh
Confidence 99986543 4568999999999999999999999885 45899999999999999999988 4667899999999999
Q ss_pred cCChHHHHHHHHHhCCCCCCCCccccccccc--ccccccccccchhhceeee-ecccccCCCCCCCCccccccCCCC
Q psy8050 237 FSQSGNLNRHMRVHGSQYNMLPVIYMGVNQV--CSNIVDQRRGIYVVEAVWT-VQSTWIDGRSTMCGHSFIVKLPLS 310 (311)
Q Consensus 237 f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~-~~~~~~~~kC~~C~~~f~~~~~L~ 310 (311)
|.+.++|.+|..+|.... + .|.. |.+.|.+...|+.|++.++ +..+. .|+|-.|++-|++=.+|.
T Consensus 302 c~~esdL~kH~~~HS~~~----y----~C~h~~C~~s~r~~~q~~~H~~evhEg~np~-~Y~CH~Cdr~ft~G~~L~ 369 (467)
T KOG3608|consen 302 CVRESDLAKHVQVHSKTV----Y----QCEHPDCHYSVRTYTQMRRHFLEVHEGNNPI-LYACHCCDRFFTSGKSLS 369 (467)
T ss_pred hccHHHHHHHHHhccccc----e----ecCCCCCcHHHHHHHHHHHHHHHhccCCCCC-ceeeecchhhhccchhHH
Confidence 999999999999887322 1 6877 9999999999999987654 55543 357999999998876663
No 4
>KOG3608|consensus
Probab=99.87 E-value=7.3e-23 Score=174.09 Aligned_cols=235 Identities=21% Similarity=0.365 Sum_probs=189.9
Q ss_pred cCCCCCCCCCCCCCCchhhHhhhccccccCCCCC---CCCCCCCCCCCCCCC--CCCCCCCCCCCccccCCCCCCCCCCC
Q psy8050 35 DRSECPGSGMGKPPHDVADVLLSLKHAVVHPGQM---TPPPYGDPGAPYQVG--APQQFPTASLSYTVHPHQSSRGRPLN 109 (311)
Q Consensus 35 ~~~~C~~c~~~~~~~~~~~~~~~~~H~~~h~~~~---~p~~~~~p~~~~~C~--~c~~~~~~~~~~~~h~~~~~~~~~~~ 109 (311)
+-|-|.+-.|+.+|.+...+ ..|+..|..-. ....+++-..-+.|. .|-+.+.+...|..|.+.|++++.
T Consensus 133 ~~f~C~WedCe~~F~s~~ef---~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKv-- 207 (467)
T KOG3608|consen 133 QNFRCGWEDCEREFVSIVEF---QDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKV-- 207 (467)
T ss_pred hhhccChhhcCCcccCHHHH---HHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeE--
Confidence 55888887777776665544 45555443110 011223332345564 478888999999999999987764
Q ss_pred CCCcceeeccCCCCcCCChhHHHHHHhhhC--CCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchh
Q psy8050 110 SGIRLILIQFIPGKTYARPSTLKTHLRTHS--GEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSV 187 (311)
Q Consensus 110 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~--~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l 187 (311)
+.|+.||..|.+...|-.|++..+ ...+|.|..|.+.|.+...|..|+..|- ..|+|+.|+.+....++|
T Consensus 208 ------vACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmtc~~~ssL 279 (467)
T KOG3608|consen 208 ------VACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMTCSSASSL 279 (467)
T ss_pred ------EecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccCCCChHHH
Confidence 568999999999999999986544 4568999999999999999999999885 459999999999999999
Q ss_pred hhhhhh-ccccccccccccccccCChHHHHHHhhhhcCCCCeecCc--cccccCChHHHHHHHHHhCCCCCCCCcccccc
Q psy8050 188 TTHMRT-HSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKL--CLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGV 264 (311)
Q Consensus 188 ~~H~~~-h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~ 264 (311)
..|++. |...+||+|+.|++.|.+.+.|.+|...|. +..|.|+. |..+|.+...|.+|++.++...+..++ .
T Consensus 280 ~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y----~ 354 (467)
T KOG3608|consen 280 TTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILY----A 354 (467)
T ss_pred HHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCce----e
Confidence 999985 777899999999999999999999999988 77799988 999999999999999877655554344 7
Q ss_pred cccccccccccccchhhceeeee
Q psy8050 265 NQVCSNIVDQRRGIYVVEAVWTV 287 (311)
Q Consensus 265 C~~C~~~f~~~~~l~~H~~~h~~ 287 (311)
|..|.+.|++-..|-.|++.-++
T Consensus 355 CH~Cdr~ft~G~~L~~HL~kkH~ 377 (467)
T KOG3608|consen 355 CHCCDRFFTSGKSLSAHLMKKHG 377 (467)
T ss_pred eecchhhhccchhHHHHHHHhhc
Confidence 99999999999999999855333
No 5
>KOG1074|consensus
Probab=99.87 E-value=1.9e-23 Score=194.87 Aligned_cols=79 Identities=35% Similarity=0.869 Sum_probs=60.8
Q ss_pred ecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhhhhcCCC----CeecC---ccccccCChHHHHH
Q psy8050 173 RCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEK----PYQCK---LCLLRFSQSGNLNR 245 (311)
Q Consensus 173 ~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~----~~~C~---~C~~~f~~~~~L~~ 245 (311)
+|.+|.++..-...|+.|.++|+||+||+|.+||++|+++.+|+.|+.+|...- .+.|+ +|.+.|...-.|..
T Consensus 607 qCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lpQ 686 (958)
T KOG1074|consen 607 QCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLPQ 686 (958)
T ss_pred ceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccccc
Confidence 677888888888888888888888888888888888888888888887775432 36677 77777777777777
Q ss_pred HHHHhC
Q psy8050 246 HMRVHG 251 (311)
Q Consensus 246 H~~~H~ 251 (311)
|+++|.
T Consensus 687 hIriH~ 692 (958)
T KOG1074|consen 687 HIRIHL 692 (958)
T ss_pred eEEeec
Confidence 777775
No 6
>KOG1074|consensus
Probab=99.83 E-value=1.2e-21 Score=182.90 Aligned_cols=106 Identities=24% Similarity=0.406 Sum_probs=89.5
Q ss_pred ccccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHHhCCCCCCCCcccccccc---cccccccccc
Q psy8050 200 YRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQ---VCSNIVDQRR 276 (311)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~---~C~~~f~~~~ 276 (311)
-.|.+|.++..-.+.|+.|+++|+||+||+|.+||+.|+++.+|..|+-+|...-.+.- . +.|+ +|-+.|.+..
T Consensus 606 NqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~-q--~ScP~~~ic~~kftn~V 682 (958)
T KOG1074|consen 606 NQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARV-Q--FSCPSTFICQKKFTNAV 682 (958)
T ss_pred cceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccc-c--ccCCchhhhcccccccc
Confidence 57999999999999999999999999999999999999999999999999976554431 1 2799 9999999999
Q ss_pred cchhhceeeeeccc-cc---------CCCCCCCCccccccCC
Q psy8050 277 GIYVVEAVWTVQST-WI---------DGRSTMCGHSFIVKLP 308 (311)
Q Consensus 277 ~l~~H~~~h~~~~~-~~---------~~kC~~C~~~f~~~~~ 308 (311)
.|.+|+++|.+... .. ..+|..|.+.|..-.+
T Consensus 683 ~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~ 724 (958)
T KOG1074|consen 683 TLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARS 724 (958)
T ss_pred cccceEEeecCCCCCCCcccccccchhcccchhhhccccccc
Confidence 99999999984332 22 2389999988865443
No 7
>KOG3623|consensus
Probab=99.81 E-value=3.2e-21 Score=177.35 Aligned_cols=117 Identities=21% Similarity=0.297 Sum_probs=96.6
Q ss_pred cCCCCCCCCCCCCCCchhhHhhhccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCCCC-----C
Q psy8050 35 DRSECPGSGMGKPPHDVADVLLSLKHAVVHPGQMTPPPYGDPGAPYQVGAPQQFPTASLSYTVHPHQSSRGRPL-----N 109 (311)
Q Consensus 35 ~~~~C~~c~~~~~~~~~~~~~~~~~H~~~h~~~~~p~~~~~p~~~~~C~~c~~~~~~~~~~~~h~~~~~~~~~~-----~ 109 (311)
....|++|. .....+-.+..|+..-+...+ .-|.|..|.+.|.....|..|+..|....+. .
T Consensus 209 qlltcpycd-----rgykrltslkeHikyrhekne--------~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltq 275 (1007)
T KOG3623|consen 209 QLLTCPYCD-----RGYKRLTSLKEHIKYRHEKNE--------PNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQ 275 (1007)
T ss_pred hhhcchhHH-----HHHHHHHHHHHHHHHHHhhCC--------CCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccc
Confidence 347899999 666777777888875443332 3567999999999999999999887655432 3
Q ss_pred CCCcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHHHH
Q psy8050 110 SGIRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVR 164 (311)
Q Consensus 110 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~ 164 (311)
.+....|.|.+||++|..+-.|+.|+++|.|++||.|+.|++.|.....+..||.
T Consensus 276 sa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 276 SALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhhccccccccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 4556679999999999999999999999999999999999999999999999974
No 8
>KOG3623|consensus
Probab=99.74 E-value=2.4e-19 Score=165.17 Aligned_cols=108 Identities=29% Similarity=0.498 Sum_probs=88.6
Q ss_pred CCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCcceeeccCCCCcCCChhHHHHHHhhhCC-------------CCCeec
Q psy8050 80 YQVGAPQQFPTASLSYTVHPHQSSRGRPLNSGIRLILIQFIPGKTYARPSTLKTHLRTHSG-------------EKPYRC 146 (311)
Q Consensus 80 ~~C~~c~~~~~~~~~~~~h~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~-------------~~~~~C 146 (311)
..|+.|++......+|..|+..-.. .....|-|.+|..+|..+..|.+||.+|.. .+-|+|
T Consensus 211 ltcpycdrgykrltslkeHikyrhe------kne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKC 284 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHE------KNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKC 284 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHh------hCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccc
Confidence 3478887777777777777542111 122346699999999999999999998853 244999
Q ss_pred CcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhhh
Q psy8050 147 EDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRT 193 (311)
Q Consensus 147 ~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 193 (311)
..||++|..+..|..|+++|.|++||.|+.|++.|....++..||..
T Consensus 285 tECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 285 TECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred cccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence 99999999999999999999999999999999999999999999854
No 9
>KOG3576|consensus
Probab=99.67 E-value=2.3e-17 Score=130.61 Aligned_cols=114 Identities=32% Similarity=0.675 Sum_probs=91.8
Q ss_pred CCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhh
Q psy8050 141 EKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLR 220 (311)
Q Consensus 141 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~ 220 (311)
...|.|.+|++.|.....|.+|+.-|...+.+.|..||+.|...-+|.+|+++|+|.+||+|..|+++|++.-.|..|.+
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence 34578888888888888888888888777778888888888888888888888888888888888888888888888876
Q ss_pred hhcC-----------CCCeecCccccccCChHHHHHHHHHhCCCC
Q psy8050 221 IHSG-----------EKPYQCKLCLLRFSQSGNLNRHMRVHGSQY 254 (311)
Q Consensus 221 ~h~~-----------~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~ 254 (311)
.-+| .+.|.|+.||.+-.....+..|+..|+...
T Consensus 195 kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 195 KVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred HHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence 5444 356899999998888888889988887655
No 10
>KOG3576|consensus
Probab=99.63 E-value=2.6e-16 Score=124.72 Aligned_cols=118 Identities=29% Similarity=0.583 Sum_probs=108.0
Q ss_pred CCCcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhh
Q psy8050 110 SGIRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTT 189 (311)
Q Consensus 110 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~ 189 (311)
+.....+.|.+|++.|.-...|.+|++.|..-+.+.|..||+.|....+|.+|+++|+|.+||+|..|++.|...-+|..
T Consensus 112 ssd~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsles 191 (267)
T KOG3576|consen 112 SSDQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLES 191 (267)
T ss_pred CCCCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHH
Confidence 44466899999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhhccc-----------cccccccccccccCChHHHHHHhhhhcCCCC
Q psy8050 190 HMRTHSG-----------ERPYRCRLCKKAFSDSSTLTKHLRIHSGEKP 227 (311)
Q Consensus 190 H~~~h~~-----------~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 227 (311)
|++.-+| .+.|.|..||.+-.....+..|+..|+...|
T Consensus 192 hl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 192 HLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred HHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 9875444 4679999999999999999999999886543
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.15 E-value=8.4e-11 Score=109.74 Aligned_cols=143 Identities=13% Similarity=0.283 Sum_probs=94.8
Q ss_pred ceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCc--CccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhh
Q psy8050 114 LILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCED--CNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHM 191 (311)
Q Consensus 114 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 191 (311)
....|.-|..... ...|..|...- ....-.|+. |+..|. ...+..| +.|+.|+..|. ...|..|+
T Consensus 406 ~~V~C~NC~~~i~-l~~l~lHe~~C-~r~~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~ 472 (567)
T PLN03086 406 DTVECRNCKHYIP-SRSIALHEAYC-SRHNVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHM 472 (567)
T ss_pred CeEECCCCCCccc-hhHHHHHHhhC-CCcceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHH
Confidence 3456777776544 34455666332 223456764 888773 3334444 57888888875 56788888
Q ss_pred hhccccccccccccccccCChHHHHHHhhhhcCCCCeecCccccccCC----------hHHHHHHHHHhCCCCCCCCccc
Q psy8050 192 RTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQ----------SGNLNRHMRVHGSQYNMLPVIY 261 (311)
Q Consensus 192 ~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~----------~~~L~~H~~~H~~~~~~~~~~~ 261 (311)
.+++ .++.|+ |+..+ ....|..|+.+|.+++++.|..|+..|.. ...|..|..+. +.. ++
T Consensus 473 ~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~r---t~-- 542 (567)
T PLN03086 473 KVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSR---TA-- 542 (567)
T ss_pred HhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCc---ce--
Confidence 8875 678888 88654 56788888888888888888888888742 34677777764 333 22
Q ss_pred ccccccccccccccccchhhc
Q psy8050 262 MGVNQVCSNIVDQRRGIYVVE 282 (311)
Q Consensus 262 ~~~C~~C~~~f~~~~~l~~H~ 282 (311)
.|..||+.+..+ .+..|+
T Consensus 543 --~C~~Cgk~Vrlr-dm~~H~ 560 (567)
T PLN03086 543 --PCDSCGRSVMLK-EMDIHQ 560 (567)
T ss_pred --EccccCCeeeeh-hHHHHH
Confidence 688888888776 556664
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.12 E-value=9.9e-11 Score=109.27 Aligned_cols=118 Identities=19% Similarity=0.435 Sum_probs=95.9
Q ss_pred eeeccC--CCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhh
Q psy8050 115 ILIQFI--PGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMR 192 (311)
Q Consensus 115 ~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 192 (311)
...|+. ||..|. +..+. ..+.|+.|++.|. ...|..|+.+++ .++.|+ |+..+ ....|..|+.
T Consensus 433 ~V~Cp~~~Cg~v~~-r~el~---------~H~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~ 497 (567)
T PLN03086 433 NVVCPHDGCGIVLR-VEEAK---------NHVHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQA 497 (567)
T ss_pred ceeCCcccccceee-ccccc---------cCccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhh
Confidence 456774 999883 33333 3468999999996 688999999985 789999 99765 6689999999
Q ss_pred hccccccccccccccccC----------ChHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHH
Q psy8050 193 THSGERPYRCRLCKKAFS----------DSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRV 249 (311)
Q Consensus 193 ~h~~~k~~~C~~C~~~f~----------~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~ 249 (311)
.|.+.+++.|..|++.|. ....|..|..+. |.+++.|..||+.+..+ +|..|+..
T Consensus 498 thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~~ 562 (567)
T PLN03086 498 STCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQIA 562 (567)
T ss_pred ccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeeeeh-hHHHHHHH
Confidence 999999999999999995 245899999885 89999999999998855 57788643
No 13
>PHA00733 hypothetical protein
Probab=99.02 E-value=2.4e-10 Score=87.49 Aligned_cols=55 Identities=29% Similarity=0.567 Sum_probs=39.4
Q ss_pred ccccccccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHHhCC
Q psy8050 196 GERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVHGS 252 (311)
Q Consensus 196 ~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~ 252 (311)
+.++|.|+.|++.|.+...|..|++.+ +.+|.|++|++.|.....|..|+...++
T Consensus 70 ~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 70 AVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 466777777777777777777777765 3457777777777777777777766544
No 14
>PHA00733 hypothetical protein
Probab=98.93 E-value=7.5e-10 Score=84.78 Aligned_cols=83 Identities=23% Similarity=0.404 Sum_probs=63.5
Q ss_pred CCCCeecCcCccccCChhHHHHH--H---HHhcCCCceecCCCCccccCcchhhhhhhhccccccccccccccccCChHH
Q psy8050 140 GEKPYRCEDCNKSFSQAANLTAH--V---RTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSST 214 (311)
Q Consensus 140 ~~~~~~C~~C~~~f~~~~~L~~H--~---~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~ 214 (311)
..+++.|.+|...|.....|..+ + ..++++.+|.|+.|++.|.....|..|+..+ +.+|.|..|++.|.....
T Consensus 37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~s 114 (128)
T PHA00733 37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDS 114 (128)
T ss_pred hhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHH
Confidence 34668888888887776666555 1 1234577888999999999888888888876 356899999999999999
Q ss_pred HHHHhhhhcC
Q psy8050 215 LTKHLRIHSG 224 (311)
Q Consensus 215 l~~H~~~h~~ 224 (311)
|..|+...++
T Consensus 115 L~~H~~~~h~ 124 (128)
T PHA00733 115 TLDHVCKKHN 124 (128)
T ss_pred HHHHHHHhcC
Confidence 9999887664
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.75 E-value=4e-09 Score=66.78 Aligned_cols=42 Identities=21% Similarity=0.525 Sum_probs=24.2
Q ss_pred ccccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHH
Q psy8050 200 YRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNL 243 (311)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L 243 (311)
|.|+.||+.|...++|..|+++|+ ++|+|..|++.|...+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 556666666666666666666655 455566666655554444
No 16
>KOG3993|consensus
Probab=98.74 E-value=9.5e-10 Score=96.76 Aligned_cols=80 Identities=30% Similarity=0.611 Sum_probs=49.9
Q ss_pred eecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhhhccc---------------------------
Q psy8050 144 YRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSG--------------------------- 196 (311)
Q Consensus 144 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~--------------------------- 196 (311)
|.|..|...|.....|.+|.-...-...|+|++|++.|.-..+|..|.+.|-.
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~r 347 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAER 347 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccc
Confidence 67777777777777777775322222347777777777777777777776631
Q ss_pred ------cccccccccccccCChHHHHHHhhhhc
Q psy8050 197 ------ERPYRCRLCKKAFSDSSTLTKHLRIHS 223 (311)
Q Consensus 197 ------~k~~~C~~C~~~f~~~~~l~~H~~~h~ 223 (311)
+..|.|..|++.|.+...|+.|+.+|+
T Consensus 348 sg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq 380 (500)
T KOG3993|consen 348 SGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQ 380 (500)
T ss_pred cCCcccCceeecHHhhhhhHHHHHHHHhHHhhh
Confidence 123666666666666666666655554
No 17
>KOG3993|consensus
Probab=98.72 E-value=1.4e-09 Score=95.74 Aligned_cols=164 Identities=20% Similarity=0.190 Sum_probs=108.9
Q ss_pred ceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHHHHHhcC-------------------------
Q psy8050 114 LILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVRTHSG------------------------- 168 (311)
Q Consensus 114 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~------------------------- 168 (311)
..++|.+|...|.+.-.|.+|...-.....|.|+.|+++|....+|..|.+.|-.
T Consensus 266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea 345 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA 345 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence 3489999999999999999998654444569999999999999999999988721
Q ss_pred --------CCceecCCCCccccCcchhhhhhhhccccc-----------------cccccccccccCChHHHHHHhhhhc
Q psy8050 169 --------EKPFRCPICDRRFSQSSSVTTHMRTHSGER-----------------PYRCRLCKKAFSDSSTLTKHLRIHS 223 (311)
Q Consensus 169 --------~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k-----------------~~~C~~C~~~f~~~~~l~~H~~~h~ 223 (311)
+..|.|.+|++.|.....|+.|+.+|+... .+-|..|+-.+...+.--.+...+.
T Consensus 346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a 425 (500)
T KOG3993|consen 346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVA 425 (500)
T ss_pred cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeee
Confidence 123899999999999999999988875321 0234444444443332222222221
Q ss_pred C-CCCeecCccccccCChHHHHHHHHHhCCCCCCCCcccccccccccccccccccchhhcee
Q psy8050 224 G-EKPYQCKLCLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQVCSNIVDQRRGIYVVEAV 284 (311)
Q Consensus 224 ~-~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~ 284 (311)
+ .....|+.|+..+.++..--.+.+.-..++. +.|.+|.-.|.+...|..|++.
T Consensus 426 ~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~-------f~~ky~~atfyss~~ltrhin~ 480 (500)
T KOG3993|consen 426 GSAELELPPYDGSPPSSSGSSGGYGRLGIAEQG-------FTCKYCPATFYSSPGLTRHINK 480 (500)
T ss_pred ccccccCCCCCCCCcccCCCCCccccccchhhc-------cccccchHhhhcCcchHhHhhh
Confidence 1 1123467777666665443333332222211 1588888888888888888765
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.62 E-value=2.4e-08 Score=63.24 Aligned_cols=42 Identities=19% Similarity=0.521 Sum_probs=22.8
Q ss_pred eecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchh
Q psy8050 144 YRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSV 187 (311)
Q Consensus 144 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l 187 (311)
|.|+.||+.|...++|..|+++|+ ++++|..|++.|...+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 455555555555555555555554 345555555555544443
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.51 E-value=8.5e-08 Score=51.81 Aligned_cols=26 Identities=50% Similarity=1.067 Sum_probs=19.9
Q ss_pred HHHHHhhhhcCCCCeecCccccccCC
Q psy8050 214 TLTKHLRIHSGEKPYQCKLCLLRFSQ 239 (311)
Q Consensus 214 ~l~~H~~~h~~~~~~~C~~C~~~f~~ 239 (311)
+|.+|+++|++++||.|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 46778888888888888888887753
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.34 E-value=4.2e-07 Score=49.06 Aligned_cols=24 Identities=67% Similarity=1.334 Sum_probs=13.2
Q ss_pred HHHHHhhhCCCCCeecCcCccccC
Q psy8050 131 LKTHLRTHSGEKPYRCEDCNKSFS 154 (311)
Q Consensus 131 L~~H~~~h~~~~~~~C~~C~~~f~ 154 (311)
|..|++.|+++++|.|+.|++.|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 455555555555555555555553
No 21
>PHA00616 hypothetical protein
Probab=98.14 E-value=1e-06 Score=53.07 Aligned_cols=32 Identities=19% Similarity=0.407 Sum_probs=16.3
Q ss_pred ccccccccccCChHHHHHHhhhhcCCCCeecC
Q psy8050 200 YRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCK 231 (311)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~ 231 (311)
|.|..||+.|...+.|..|++.|++++++.|+
T Consensus 2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 44555555555555555555555555554444
No 22
>PHA00732 hypothetical protein
Probab=98.06 E-value=2.9e-06 Score=58.95 Aligned_cols=46 Identities=24% Similarity=0.551 Sum_probs=31.0
Q ss_pred cccccccccccCChHHHHHHhhh-hcCCCCeecCccccccCChHHHHHHHHHh
Q psy8050 199 PYRCRLCKKAFSDSSTLTKHLRI-HSGEKPYQCKLCLLRFSQSGNLNRHMRVH 250 (311)
Q Consensus 199 ~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~L~~H~~~H 250 (311)
||.|..|++.|.+...|..|++. |. ++.|+.|++.|. .|..|.+++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence 46777777777777777777764 43 256777777776 366666444
No 23
>PHA00616 hypothetical protein
Probab=98.05 E-value=2.1e-06 Score=51.76 Aligned_cols=33 Identities=15% Similarity=0.399 Sum_probs=19.9
Q ss_pred CeecCcCccccCChhHHHHHHHHhcCCCceecC
Q psy8050 143 PYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCP 175 (311)
Q Consensus 143 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~ 175 (311)
||+|+.||+.|...++|..|++.|+++.++.|+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 355666666666666666666666666655554
No 24
>PHA00732 hypothetical protein
Probab=97.97 E-value=5.5e-06 Score=57.61 Aligned_cols=48 Identities=17% Similarity=0.163 Sum_probs=39.5
Q ss_pred CeecCccccccCChHHHHHHHHH-hCCCCCCCCcccccccccccccccccccchhhceeeee
Q psy8050 227 PYQCKLCLLRFSQSGNLNRHMRV-HGSQYNMLPVIYMGVNQVCSNIVDQRRGIYVVEAVWTV 287 (311)
Q Consensus 227 ~~~C~~C~~~f~~~~~L~~H~~~-H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 287 (311)
||.|+.|++.|.+...|..|++. |.+ . .|+.|++.|. .|..|.....+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~-~---------~C~~CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL-T---------KCPVCNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC-C---------ccCCCCCEeC---ChhhhhcccCC
Confidence 58999999999999999999985 543 2 5999999998 48888855443
No 25
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.78 E-value=4.3e-05 Score=49.23 Aligned_cols=47 Identities=32% Similarity=0.564 Sum_probs=19.9
Q ss_pred ccccccccccCChHHHHHHhhhhcC-C-CCeecCccccccCChHHHHHHHHH
Q psy8050 200 YRCRLCKKAFSDSSTLTKHLRIHSG-E-KPYQCKLCLLRFSQSGNLNRHMRV 249 (311)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~~~h~~-~-~~~~C~~C~~~f~~~~~L~~H~~~ 249 (311)
|.|++|++ ..+...|..|....+. + +.+.|++|...+. .+|..|+..
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~ 51 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNS 51 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHH
Confidence 44555555 3334444455433222 1 2344555544332 245555443
No 26
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.74 E-value=2.7e-05 Score=40.58 Aligned_cols=23 Identities=52% Similarity=1.013 Sum_probs=15.2
Q ss_pred eecCccccccCChHHHHHHHHHh
Q psy8050 228 YQCKLCLLRFSQSGNLNRHMRVH 250 (311)
Q Consensus 228 ~~C~~C~~~f~~~~~L~~H~~~H 250 (311)
|.|+.|++.|.+...|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 56666777777666666666654
No 27
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.53 E-value=7.4e-05 Score=38.89 Aligned_cols=22 Identities=50% Similarity=0.976 Sum_probs=12.5
Q ss_pred eecCcCccccCChhHHHHHHHH
Q psy8050 144 YRCEDCNKSFSQAANLTAHVRT 165 (311)
Q Consensus 144 ~~C~~C~~~f~~~~~L~~H~~~ 165 (311)
|.|+.|++.|.+...|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4555555555555555555554
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.52 E-value=0.00019 Score=46.17 Aligned_cols=49 Identities=27% Similarity=0.355 Sum_probs=25.0
Q ss_pred eeccCCCCcCCChhHHHHHHhh-hCCC-CCeecCcCccccCChhHHHHHHHHhc
Q psy8050 116 LIQFIPGKTYARPSTLKTHLRT-HSGE-KPYRCEDCNKSFSQAANLTAHVRTHS 167 (311)
Q Consensus 116 ~~C~~C~~~f~~~~~L~~H~~~-h~~~-~~~~C~~C~~~f~~~~~L~~H~~~h~ 167 (311)
|.|+.|++ ..+...|..|... |..+ +.+.|++|...+. .+|..|+..++
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 45566666 3444556666533 3222 3466666665433 25566655443
No 29
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.46 E-value=0.00013 Score=38.21 Aligned_cols=23 Identities=39% Similarity=0.891 Sum_probs=14.0
Q ss_pred eecCccccccCChHHHHHHHHHh
Q psy8050 228 YQCKLCLLRFSQSGNLNRHMRVH 250 (311)
Q Consensus 228 ~~C~~C~~~f~~~~~L~~H~~~H 250 (311)
|.|++|+..|.+...|..|+.+|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 45666777777666676666655
No 30
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.37 E-value=9.7e-05 Score=40.11 Aligned_cols=25 Identities=36% Similarity=0.704 Sum_probs=17.0
Q ss_pred CeecCccccccCChHHHHHHHHHhC
Q psy8050 227 PYQCKLCLLRFSQSGNLNRHMRVHG 251 (311)
Q Consensus 227 ~~~C~~C~~~f~~~~~L~~H~~~H~ 251 (311)
+|.|..|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4667777777777777777766654
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.36 E-value=0.00015 Score=52.92 Aligned_cols=24 Identities=29% Similarity=0.688 Sum_probs=16.4
Q ss_pred CeecCccccccCChHHHHHHHHHh
Q psy8050 227 PYQCKLCLLRFSQSGNLNRHMRVH 250 (311)
Q Consensus 227 ~~~C~~C~~~f~~~~~L~~H~~~H 250 (311)
.+.|..|+..|.+...|..|++.+
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHHHT
T ss_pred CCCCCccCCCCcCHHHHHHHHcCc
Confidence 477777777777777777777653
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.22 E-value=0.00031 Score=36.63 Aligned_cols=23 Identities=43% Similarity=0.928 Sum_probs=11.5
Q ss_pred eecCcCccccCChhHHHHHHHHh
Q psy8050 144 YRCEDCNKSFSQAANLTAHVRTH 166 (311)
Q Consensus 144 ~~C~~C~~~f~~~~~L~~H~~~h 166 (311)
|.|++|+..|.+...|..|+..+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 34555555555555555555544
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.22 E-value=0.00018 Score=52.47 Aligned_cols=72 Identities=25% Similarity=0.582 Sum_probs=15.8
Q ss_pred cCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhhhh
Q psy8050 146 CEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIH 222 (311)
Q Consensus 146 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h 222 (311)
|..|+..|.+...|..|+...++-.. . ....+.....+..+..... ...+.|..|++.|.+...|..|++.+
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred cccccccccccccccccccccccccc---c-ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 55666666666666666644333211 0 0011112222222222111 12466777777777777777777654
No 34
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.01 E-value=0.0004 Score=37.64 Aligned_cols=23 Identities=39% Similarity=0.989 Sum_probs=13.2
Q ss_pred eecCcCccccCChhHHHHHHHHh
Q psy8050 144 YRCEDCNKSFSQAANLTAHVRTH 166 (311)
Q Consensus 144 ~~C~~C~~~f~~~~~L~~H~~~h 166 (311)
|.|..|++.|.+...|..|++.|
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCccCCccCChhHHHHHhHHh
Confidence 55555555555555555555444
No 35
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.81 E-value=0.00037 Score=59.89 Aligned_cols=20 Identities=15% Similarity=0.237 Sum_probs=16.1
Q ss_pred ccccccccccccccchhhce
Q psy8050 264 VNQVCSNIVDQRRGIYVVEA 283 (311)
Q Consensus 264 ~C~~C~~~f~~~~~l~~H~~ 283 (311)
.|++|++.|.+..+|..|..
T Consensus 400 rCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 400 RCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred eccccchhhccCccceeccc
Confidence 68888888888888888864
No 36
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.60 E-value=0.0021 Score=33.93 Aligned_cols=23 Identities=39% Similarity=0.784 Sum_probs=13.4
Q ss_pred eecCccccccCChHHHHHHHHHh
Q psy8050 228 YQCKLCLLRFSQSGNLNRHMRVH 250 (311)
Q Consensus 228 ~~C~~C~~~f~~~~~L~~H~~~H 250 (311)
|.|..|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 34566666666666666665544
No 37
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.58 E-value=0.00075 Score=58.08 Aligned_cols=51 Identities=29% Similarity=0.770 Sum_probs=36.9
Q ss_pred cccccccc--cccccCChHHHHHHhhhhc-------------------CCCCeecCccccccCChHHHHHHH
Q psy8050 197 ERPYRCRL--CKKAFSDSSTLTKHLRIHS-------------------GEKPYQCKLCLLRFSQSGNLNRHM 247 (311)
Q Consensus 197 ~k~~~C~~--C~~~f~~~~~l~~H~~~h~-------------------~~~~~~C~~C~~~f~~~~~L~~H~ 247 (311)
++||+|++ |.+.|.+...|+.|+.--+ ..+||.|++|++.+.....|.-|.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence 47777766 7777777777777764221 247899999999998888887774
No 38
>KOG2231|consensus
Probab=96.41 E-value=0.0052 Score=59.04 Aligned_cols=100 Identities=24% Similarity=0.550 Sum_probs=61.5
Q ss_pred cCcCccccCChhHHHHHHHHhcCCCceecCCCC---------ccccCcchhhhhhhhccc-cc----cccccccccccCC
Q psy8050 146 CEDCNKSFSQAANLTAHVRTHSGEKPFRCPICD---------RRFSQSSSVTTHMRTHSG-ER----PYRCRLCKKAFSD 211 (311)
Q Consensus 146 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---------~~f~~~~~l~~H~~~h~~-~k----~~~C~~C~~~f~~ 211 (311)
|..| -.|.+...|..|+...|+ .+.|.+|- ....+...|..|+..--. +. .-.|..|...|..
T Consensus 118 ~~~c-~~~~s~~~Lk~H~~~~H~--~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld 194 (669)
T KOG2231|consen 118 CLHC-TEFKSVENLKNHMRDQHK--LHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFLD 194 (669)
T ss_pred Cccc-cchhHHHHHHHHHHHhhh--hhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhcc
Confidence 4455 445567788888843322 23444432 122344566666654211 11 2468888888888
Q ss_pred hHHHHHHhhhhcCCCCeecCcc------ccccCChHHHHHHHHHhCC
Q psy8050 212 SSTLTKHLRIHSGEKPYQCKLC------LLRFSQSGNLNRHMRVHGS 252 (311)
Q Consensus 212 ~~~l~~H~~~h~~~~~~~C~~C------~~~f~~~~~L~~H~~~H~~ 252 (311)
...|.+|++.++ |.|..| +.-|.....|..|.+.+|-
T Consensus 195 ~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~Hf 237 (669)
T KOG2231|consen 195 DDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGHF 237 (669)
T ss_pred HHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcCc
Confidence 888888888765 556666 3457778888888876543
No 39
>KOG2231|consensus
Probab=96.40 E-value=0.0078 Score=57.90 Aligned_cols=122 Identities=22% Similarity=0.473 Sum_probs=79.5
Q ss_pred ccCCCCcCCChhHHHHHHhhhCCCCCeecCcCc---------cccCChhHHHHHHHHhcC-CC----ceecCCCCccccC
Q psy8050 118 QFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCN---------KSFSQAANLTAHVRTHSG-EK----PFRCPICDRRFSQ 183 (311)
Q Consensus 118 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---------~~f~~~~~L~~H~~~h~~-~~----~~~C~~C~~~f~~ 183 (311)
|..| ..|.+...|+.|+...+ +.+.|.+|- ....+...|..|+..-.. +. .-.|..|...|..
T Consensus 118 ~~~c-~~~~s~~~Lk~H~~~~H--~~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld 194 (669)
T KOG2231|consen 118 CLHC-TEFKSVENLKNHMRDQH--KLHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFLD 194 (669)
T ss_pred Cccc-cchhHHHHHHHHHHHhh--hhhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhcc
Confidence 4445 45557889999995433 234454442 223346778888764321 21 2469999999999
Q ss_pred cchhhhhhhhcccccccccccc------ccccCChHHHHHHhhhhcCCCCeecC--ccc-cccCChHHHHHHHHHh
Q psy8050 184 SSSVTTHMRTHSGERPYRCRLC------KKAFSDSSTLTKHLRIHSGEKPYQCK--LCL-LRFSQSGNLNRHMRVH 250 (311)
Q Consensus 184 ~~~l~~H~~~h~~~k~~~C~~C------~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~-~~f~~~~~L~~H~~~H 250 (311)
...|.+|++.++ |.|..| +.-|.....|..|.+.+| |.|+ .|. ..|.....+..+++.|
T Consensus 195 ~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f~~~~~~ei~lk~~ 262 (669)
T KOG2231|consen 195 DDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKFYVAFELEIELKAH 262 (669)
T ss_pred HHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC----ccccccccccceeeehhHHHHHHHhh
Confidence 999999998765 556666 456788899999998877 7787 564 3344443455555433
No 40
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.08 E-value=0.0055 Score=32.23 Aligned_cols=24 Identities=46% Similarity=0.962 Sum_probs=17.8
Q ss_pred ccccccccccCChHHHHHHhhhhc
Q psy8050 200 YRCRLCKKAFSDSSTLTKHLRIHS 223 (311)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~~~h~ 223 (311)
|.|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 467778888888888888777553
No 41
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.00 E-value=0.0055 Score=32.01 Aligned_cols=23 Identities=39% Similarity=0.803 Sum_probs=12.9
Q ss_pred eecCccccccCChHHHHHHHHHhC
Q psy8050 228 YQCKLCLLRFSQSGNLNRHMRVHG 251 (311)
Q Consensus 228 ~~C~~C~~~f~~~~~L~~H~~~H~ 251 (311)
|.|+.|+.... ...|..|++.|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45666666665 666666666543
No 42
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.98 E-value=0.0041 Score=32.85 Aligned_cols=21 Identities=33% Similarity=0.866 Sum_probs=13.2
Q ss_pred eecCccccccCChHHHHHHHH
Q psy8050 228 YQCKLCLLRFSQSGNLNRHMR 248 (311)
Q Consensus 228 ~~C~~C~~~f~~~~~L~~H~~ 248 (311)
|.|.+|+..|.+...|..|++
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCCcCCHHHHHHHHC
Confidence 456666666666666666654
No 43
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.93 E-value=0.011 Score=51.67 Aligned_cols=129 Identities=24% Similarity=0.434 Sum_probs=68.2
Q ss_pred eeccC--CCCcCCChhHHHHHHhhhCCCCCeecCcCc---cccC------ChhHHHHHHHHhcCCCc----eecCCCCcc
Q psy8050 116 LIQFI--PGKTYARPSTLKTHLRTHSGEKPYRCEDCN---KSFS------QAANLTAHVRTHSGEKP----FRCPICDRR 180 (311)
Q Consensus 116 ~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---~~f~------~~~~L~~H~~~h~~~~~----~~C~~C~~~ 180 (311)
|.|+. |..+...-..|..|.+..++. +.|.+|- +.|. +...|..|...-..+.. -.|..|...
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~ 229 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIY 229 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccce
Confidence 44443 445555556666676654432 4455542 2232 23445555433222211 247777777
Q ss_pred ccCcchhhhhhhhcccccccccccccc-------ccCChHHHHHHhhhhcCCCCeecCc--cc----cccCChHHHHHHH
Q psy8050 181 FSQSSSVTTHMRTHSGERPYRCRLCKK-------AFSDSSTLTKHLRIHSGEKPYQCKL--CL----LRFSQSGNLNRHM 247 (311)
Q Consensus 181 f~~~~~l~~H~~~h~~~k~~~C~~C~~-------~f~~~~~l~~H~~~h~~~~~~~C~~--C~----~~f~~~~~L~~H~ 247 (311)
|....+|.+|++..+ + .|-+|++ -|.+...|..|.+.-+ |.|.+ |. ..|.+...|..|+
T Consensus 230 FYdDDEL~~HcR~~H-E---~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el~~h~ 301 (493)
T COG5236 230 FYDDDELRRHCRLRH-E---ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEHL 301 (493)
T ss_pred ecChHHHHHHHHhhh-h---hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHHHHHH
Confidence 777777777777544 2 3444443 2666667777665332 65643 42 3577777777775
Q ss_pred -HHhCCCC
Q psy8050 248 -RVHGSQY 254 (311)
Q Consensus 248 -~~H~~~~ 254 (311)
+.|....
T Consensus 302 ~~~h~~~~ 309 (493)
T COG5236 302 TRFHKVNA 309 (493)
T ss_pred HHHhhccc
Confidence 3454333
No 44
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.91 E-value=0.0069 Score=37.48 Aligned_cols=27 Identities=33% Similarity=0.626 Sum_probs=11.8
Q ss_pred CCeecCccccccCChHHHHHHHHHhCC
Q psy8050 226 KPYQCKLCLLRFSQSGNLNRHMRVHGS 252 (311)
Q Consensus 226 ~~~~C~~C~~~f~~~~~L~~H~~~H~~ 252 (311)
.|-.|++|+..+.+..+|.+|+.++++
T Consensus 23 ~PatCP~C~a~~~~srnLrRHle~~H~ 49 (54)
T PF09237_consen 23 QPATCPICGAVIRQSRNLRRHLEIRHF 49 (54)
T ss_dssp --EE-TTT--EESSHHHHHHHHHHHTT
T ss_pred CCCCCCcchhhccchhhHHHHHHHHhc
Confidence 445555555555555555555554443
No 45
>PRK04860 hypothetical protein; Provisional
Probab=95.70 E-value=0.005 Score=49.00 Aligned_cols=39 Identities=33% Similarity=0.778 Sum_probs=32.3
Q ss_pred ccccccccccccCChHHHHHHhhhhcCCCCeecCccccccCCh
Q psy8050 198 RPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQS 240 (311)
Q Consensus 198 k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 240 (311)
-+|.|. |+. ....+.+|.++|.++++|.|..|+..|...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 468897 887 667788999999999999999999887654
No 46
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.69 E-value=0.014 Score=36.11 Aligned_cols=29 Identities=28% Similarity=0.490 Sum_probs=14.9
Q ss_pred CCeecCcCccccCChhHHHHHHHHhcCCC
Q psy8050 142 KPYRCEDCNKSFSQAANLTAHVRTHSGEK 170 (311)
Q Consensus 142 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~ 170 (311)
.|-.|++|+..+....+|.+|+...++.+
T Consensus 23 ~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 23 QPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp --EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred CCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 45666666666666666666666555544
No 47
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.50 E-value=0.0075 Score=55.86 Aligned_cols=144 Identities=28% Similarity=0.494 Sum_probs=88.4
Q ss_pred CeecCcCccccCChhHHHHHHH--HhcCC--CceecC--CCCccccCcchhhhhhhhccccccccccc--cccccCChHH
Q psy8050 143 PYRCEDCNKSFSQAANLTAHVR--THSGE--KPFRCP--ICDRRFSQSSSVTTHMRTHSGERPYRCRL--CKKAFSDSST 214 (311)
Q Consensus 143 ~~~C~~C~~~f~~~~~L~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~k~~~C~~--C~~~f~~~~~ 214 (311)
++.|..|...|.....|..|.. .|.++ .++.|+ .|++.|.....+..|...|.+..++.+.. +...+.....
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 368 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLN 368 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCccccccccC
Confidence 5677777777777777777777 67777 777777 67777777777777777777766655544 3333332222
Q ss_pred HH-----HHhhhhcCCCCeecC--ccccccCChHHHHHHHHHhCCCCCCCCcccccccccccccccccccchhhceeeee
Q psy8050 215 LT-----KHLRIHSGEKPYQCK--LCLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQVCSNIVDQRRGIYVVEAVWTV 287 (311)
Q Consensus 215 l~-----~H~~~h~~~~~~~C~--~C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 287 (311)
-. .....-.....+.+. .|...+.....+..|...|........ .+..|...+.....|..|++.|.-
T Consensus 369 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~ 443 (467)
T COG5048 369 NEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNC-----KNPPCSKSFNRHYNLIPHKKIHTN 443 (467)
T ss_pred CCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCC-----CCCcchhhccCccccccccccccc
Confidence 11 111111112223332 255556666666666666655443211 578888888888888888887776
Q ss_pred cccc
Q psy8050 288 QSTW 291 (311)
Q Consensus 288 ~~~~ 291 (311)
..+.
T Consensus 444 ~~~~ 447 (467)
T COG5048 444 HAPL 447 (467)
T ss_pred CCce
Confidence 6665
No 48
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.40 E-value=0.0086 Score=31.55 Aligned_cols=23 Identities=39% Similarity=0.941 Sum_probs=16.0
Q ss_pred ccccccccccCChHHHHHHhhhh
Q psy8050 200 YRCRLCKKAFSDSSTLTKHLRIH 222 (311)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~~~h 222 (311)
|.|.+|+..|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 46777777777777777776643
No 49
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.36 E-value=0.017 Score=30.12 Aligned_cols=22 Identities=41% Similarity=0.889 Sum_probs=10.7
Q ss_pred eecCcCccccCChhHHHHHHHHh
Q psy8050 144 YRCEDCNKSFSQAANLTAHVRTH 166 (311)
Q Consensus 144 ~~C~~C~~~f~~~~~L~~H~~~h 166 (311)
|.|+.|+.... ...|..|++.+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~ 22 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRH 22 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhh
Confidence 44555555554 55555555544
No 50
>KOG1146|consensus
Probab=94.99 E-value=0.013 Score=59.94 Aligned_cols=51 Identities=22% Similarity=0.303 Sum_probs=30.7
Q ss_pred eeeccCCCCcCCChhHHHHHHh-hhCCCCCeecCcCccccCChhHHHHHHHH
Q psy8050 115 ILIQFIPGKTYARPSTLKTHLR-THSGEKPYRCEDCNKSFSQAANLTAHVRT 165 (311)
Q Consensus 115 ~~~C~~C~~~f~~~~~L~~H~~-~h~~~~~~~C~~C~~~f~~~~~L~~H~~~ 165 (311)
...|.-|+..+.....+..|+. .+...+.+.|+.|+..|.....|..||+.
T Consensus 436 l~e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRs 487 (1406)
T KOG1146|consen 436 LVELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRS 487 (1406)
T ss_pred cccccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccc
Confidence 3344444545555444444442 34444667788888877777777777765
No 51
>PRK04860 hypothetical protein; Provisional
Probab=94.86 E-value=0.023 Score=45.26 Aligned_cols=34 Identities=38% Similarity=0.902 Sum_probs=16.1
Q ss_pred eecCCCCccccCcchhhhhhhhcccccccccccccccc
Q psy8050 172 FRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAF 209 (311)
Q Consensus 172 ~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f 209 (311)
|.|. |+. ....+.+|.++|.++++|.|..|+..|
T Consensus 120 Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l 153 (160)
T PRK04860 120 YRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETL 153 (160)
T ss_pred EEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCcee
Confidence 4444 443 334444455555544445555554444
No 52
>KOG2482|consensus
Probab=94.85 E-value=0.061 Score=47.12 Aligned_cols=57 Identities=18% Similarity=0.255 Sum_probs=37.0
Q ss_pred eecCccccccCChHHHHHHHHH-hCC-------CC--CCC----------Ccccccccccccccccccccchhhcee
Q psy8050 228 YQCKLCLLRFSQSGNLNRHMRV-HGS-------QY--NML----------PVIYMGVNQVCSNIVDQRRGIYVVEAV 284 (311)
Q Consensus 228 ~~C~~C~~~f~~~~~L~~H~~~-H~~-------~~--~~~----------~~~~~~~C~~C~~~f~~~~~l~~H~~~ 284 (311)
..|-.|.....+...|..||.+ |.- +. +.. ...+...|-.|.-.|....+|..|+..
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence 4788888887778888888763 421 10 000 001223688888899999999999854
No 53
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.45 E-value=0.0093 Score=55.26 Aligned_cols=142 Identities=25% Similarity=0.409 Sum_probs=105.1
Q ss_pred ceeeccCCCCcCCChhHHHHHHh--hhCCC--CCeecC--cCccccCChhHHHHHHHHhcCCCceecCC--CCccccCcc
Q psy8050 114 LILIQFIPGKTYARPSTLKTHLR--THSGE--KPYRCE--DCNKSFSQAANLTAHVRTHSGEKPFRCPI--CDRRFSQSS 185 (311)
Q Consensus 114 ~~~~C~~C~~~f~~~~~L~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~C~~--C~~~f~~~~ 185 (311)
....|..|...|.....|..|.. .|.++ +++.|+ .|++.|.....+..|...|.+..++.+.. +...+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 46788999999999999999999 89999 999999 79999999999999999999988777755 444433333
Q ss_pred hhhhhhh-----hccccccccccc--cccccCChHHHHHHhhhhcCCC--CeecCccccccCChHHHHHHHHHhCCCCC
Q psy8050 186 SVTTHMR-----THSGERPYRCRL--CKKAFSDSSTLTKHLRIHSGEK--PYQCKLCLLRFSQSGNLNRHMRVHGSQYN 255 (311)
Q Consensus 186 ~l~~H~~-----~h~~~k~~~C~~--C~~~f~~~~~l~~H~~~h~~~~--~~~C~~C~~~f~~~~~L~~H~~~H~~~~~ 255 (311)
.-..+.. .-.....+.+.. |...+.....+..|...|.... .+.+..|...|.....+..|++.|.....
T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (467)
T COG5048 368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAP 446 (467)
T ss_pred CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCc
Confidence 2111111 112233344433 6667777777777777776655 35678899999999999999888866653
No 54
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.37 E-value=0.018 Score=31.03 Aligned_cols=20 Identities=35% Similarity=0.858 Sum_probs=9.1
Q ss_pred ccccccccccCChHHHHHHh
Q psy8050 200 YRCRLCKKAFSDSSTLTKHL 219 (311)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~ 219 (311)
|.|..|++.|.+...|..|+
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~ 21 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHM 21 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCT
T ss_pred CCcccCCCCcCCHHHHHHHH
Confidence 34444444444444444444
No 55
>KOG1146|consensus
Probab=94.18 E-value=0.028 Score=57.58 Aligned_cols=59 Identities=27% Similarity=0.499 Sum_probs=41.2
Q ss_pred CCeecCccccccCChHHHHHHHH--HhCCC-----CC--------------CCC---------------ccccccccccc
Q psy8050 226 KPYQCKLCLLRFSQSGNLNRHMR--VHGSQ-----YN--------------MLP---------------VIYMGVNQVCS 269 (311)
Q Consensus 226 ~~~~C~~C~~~f~~~~~L~~H~~--~H~~~-----~~--------------~~~---------------~~~~~~C~~C~ 269 (311)
++|.|..|..+++...+|..|+. .|..+ ++ ..| ....|.|.+|+
T Consensus 517 ~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~ 596 (1406)
T KOG1146|consen 517 KPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCS 596 (1406)
T ss_pred CcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhc
Confidence 57888899999998888888875 23221 00 000 01236899999
Q ss_pred ccccccccchhhcee
Q psy8050 270 NIVDQRRGIYVVEAV 284 (311)
Q Consensus 270 ~~f~~~~~l~~H~~~ 284 (311)
+.-.-...|+.|+..
T Consensus 597 yetniarnlrihmts 611 (1406)
T KOG1146|consen 597 YETNIARNLRIHMTA 611 (1406)
T ss_pred chhhhhhcccccccc
Confidence 999999999999864
No 56
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.93 E-value=0.026 Score=30.37 Aligned_cols=22 Identities=36% Similarity=0.899 Sum_probs=15.9
Q ss_pred eecCCCCccccCcchhhhhhhh
Q psy8050 172 FRCPICDRRFSQSSSVTTHMRT 193 (311)
Q Consensus 172 ~~C~~C~~~f~~~~~l~~H~~~ 193 (311)
|.|..|++.|.+...|..|+..
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777654
No 57
>KOG2785|consensus
Probab=93.08 E-value=0.4 Score=42.94 Aligned_cols=23 Identities=35% Similarity=0.681 Sum_probs=17.3
Q ss_pred CCeecCcCccccCChhHHHHHHH
Q psy8050 142 KPYRCEDCNKSFSQAANLTAHVR 164 (311)
Q Consensus 142 ~~~~C~~C~~~f~~~~~L~~H~~ 164 (311)
.++.|.+|.+.|.+......|+.
T Consensus 67 ~~~~c~~c~k~~~s~~a~~~hl~ 89 (390)
T KOG2785|consen 67 SVVYCEACNKSFASPKAHENHLK 89 (390)
T ss_pred cceehHHhhccccChhhHHHHHH
Confidence 45778888888888777777764
No 58
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=93.02 E-value=0.17 Score=37.35 Aligned_cols=48 Identities=19% Similarity=0.351 Sum_probs=27.3
Q ss_pred cccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHHh
Q psy8050 201 RCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVH 250 (311)
Q Consensus 201 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H 250 (311)
.|-.|...|........- .-.....|.|+.|...|-..-++..|...|
T Consensus 57 ~C~~C~~~f~~~~~~~~~--~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 57 FCFGCQGPFPKPPVSPFD--ELKDSHRYVCAVCKNVFCVDCDVFVHESLH 104 (112)
T ss_pred cccCcCCCCCCccccccc--ccccccceeCCCCCCccccccchhhhhhcc
Confidence 377777777653211100 011234577888888877777777776655
No 59
>KOG4173|consensus
Probab=92.72 E-value=0.037 Score=44.90 Aligned_cols=46 Identities=24% Similarity=0.642 Sum_probs=28.3
Q ss_pred eecCc--CccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhh
Q psy8050 144 YRCED--CNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMR 192 (311)
Q Consensus 144 ~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 192 (311)
|.|++ |.+.|.+......|..+-++. .|..|.+.|.+...|..|+.
T Consensus 80 ~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~ 127 (253)
T KOG4173|consen 80 FACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHIL 127 (253)
T ss_pred ccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHH
Confidence 56654 666666666666665544443 46666666666666666654
No 60
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.18 E-value=0.14 Score=27.07 Aligned_cols=20 Identities=25% Similarity=0.713 Sum_probs=11.8
Q ss_pred eecCccccccCChHHHHHHHH
Q psy8050 228 YQCKLCLLRFSQSGNLNRHMR 248 (311)
Q Consensus 228 ~~C~~C~~~f~~~~~L~~H~~ 248 (311)
..|+.||+.| ..+.|..|+.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHH
Confidence 3566666666 4555666654
No 61
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.05 E-value=0.14 Score=27.05 Aligned_cols=19 Identities=26% Similarity=0.690 Sum_probs=9.5
Q ss_pred ecCcCccccCChhHHHHHHH
Q psy8050 145 RCEDCNKSFSQAANLTAHVR 164 (311)
Q Consensus 145 ~C~~C~~~f~~~~~L~~H~~ 164 (311)
.|+.||+.| ....|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 355555555 3445555543
No 62
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=92.02 E-value=0.24 Score=43.58 Aligned_cols=133 Identities=20% Similarity=0.347 Sum_probs=83.7
Q ss_pred CeecCc--CccccCChhHHHHHHHHhcCCCceecCCCC---cccc------Ccchhhhhhhhccccc----ccccccccc
Q psy8050 143 PYRCED--CNKSFSQAANLTAHVRTHSGEKPFRCPICD---RRFS------QSSSVTTHMRTHSGER----PYRCRLCKK 207 (311)
Q Consensus 143 ~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---~~f~------~~~~l~~H~~~h~~~k----~~~C~~C~~ 207 (311)
.|.|+. |..+...-..|..|....++. +.|.+|- +.|. +...|..|...-..+. .-.|..|..
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~ 228 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKI 228 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccc
Confidence 377864 666666677899998876654 6787763 2343 3456666665433332 236999999
Q ss_pred ccCChHHHHHHhhhhcCCCCeecCccc----cccCChHHHHHHHHHhCCCCCCCCcccccccccc-cccccccccchhhc
Q psy8050 208 AFSDSSTLTKHLRIHSGEKPYQCKLCL----LRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQVC-SNIVDQRRGIYVVE 282 (311)
Q Consensus 208 ~f~~~~~l~~H~~~h~~~~~~~C~~C~----~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~C-~~~f~~~~~l~~H~ 282 (311)
.|.....|..|++..+ ++-|.|+.-+ +-|.+..+|..|.+.-+--- .++ .|..= -.+|.....|..|+
T Consensus 229 ~FYdDDEL~~HcR~~H-E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~hy~c---t~q---tc~~~k~~vf~~~~el~~h~ 301 (493)
T COG5236 229 YFYDDDELRRHCRLRH-EACHICDMVGPIRYQYFKSYEDLEAHFRNAHYCC---TFQ---TCRVGKCYVFPYHTELLEHL 301 (493)
T ss_pred eecChHHHHHHHHhhh-hhhhhhhccCccchhhhhCHHHHHHHhhcCceEE---EEE---EEecCcEEEeccHHHHHHHH
Confidence 9999999999999755 3323343322 34888899999976321110 010 12211 24688888899998
Q ss_pred ee
Q psy8050 283 AV 284 (311)
Q Consensus 283 ~~ 284 (311)
..
T Consensus 302 ~~ 303 (493)
T COG5236 302 TR 303 (493)
T ss_pred HH
Confidence 54
No 63
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=90.68 E-value=0.23 Score=28.28 Aligned_cols=22 Identities=27% Similarity=0.760 Sum_probs=18.0
Q ss_pred CeecCccccccCChHHHHHHHH
Q psy8050 227 PYQCKLCLLRFSQSGNLNRHMR 248 (311)
Q Consensus 227 ~~~C~~C~~~f~~~~~L~~H~~ 248 (311)
+|.|++|+..|.+...+..|+.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4778888888888888888874
No 64
>KOG2482|consensus
Probab=89.95 E-value=0.37 Score=42.43 Aligned_cols=51 Identities=20% Similarity=0.409 Sum_probs=38.9
Q ss_pred eecCCCCccccCcchhhhhhhhccc---------------------------cccccccccccccCChHHHHHHhhhh
Q psy8050 172 FRCPICDRRFSQSSSVTTHMRTHSG---------------------------ERPYRCRLCKKAFSDSSTLTKHLRIH 222 (311)
Q Consensus 172 ~~C~~C~~~f~~~~~l~~H~~~h~~---------------------------~k~~~C~~C~~~f~~~~~l~~H~~~h 222 (311)
..|-.|.....+...|..||..-+. ...-.|..|...|-....|..|+..+
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence 5788888888888888888875431 12246888999999999999998643
No 65
>KOG2893|consensus
Probab=89.19 E-value=0.1 Score=43.39 Aligned_cols=49 Identities=27% Similarity=0.686 Sum_probs=30.3
Q ss_pred ccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHHHHH-HHHhCCCC
Q psy8050 202 CRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRH-MRVHGSQY 254 (311)
Q Consensus 202 C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H-~~~H~~~~ 254 (311)
|=+|++.|.....|.+|++..+ |+|.+|.+..-+-..|..| +++|....
T Consensus 13 cwycnrefddekiliqhqkakh----fkchichkkl~sgpglsihcmqvhketi 62 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH----FKCHICHKKLFSGPGLSIHCMQVHKETI 62 (341)
T ss_pred eeecccccchhhhhhhhhhhcc----ceeeeehhhhccCCCceeehhhhhhhhh
Confidence 5567777777777777665433 6777777665555556666 45564443
No 66
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=88.79 E-value=0.37 Score=27.38 Aligned_cols=22 Identities=27% Similarity=0.751 Sum_probs=15.4
Q ss_pred CeecCcCccccCChhHHHHHHH
Q psy8050 143 PYRCEDCNKSFSQAANLTAHVR 164 (311)
Q Consensus 143 ~~~C~~C~~~f~~~~~L~~H~~ 164 (311)
+|.|++|+..|.+...+..|+.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 3667777777777777777763
No 67
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=88.73 E-value=1.1 Score=33.16 Aligned_cols=85 Identities=20% Similarity=0.396 Sum_probs=47.2
Q ss_pred CCeecCcCccccCChhHHHHHHHHhcCCC------------ceecCCCCccccCcchhhhhhhhcccccccccccccccc
Q psy8050 142 KPYRCEDCNKSFSQAANLTAHVRTHSGEK------------PFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAF 209 (311)
Q Consensus 142 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~------------~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f 209 (311)
.|..|++||-+......|.+....--.-. .-.|--|...|........- .-.....|.|+.|...|
T Consensus 14 LP~~CpiCgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~--~~~~~~~y~C~~C~~~F 91 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFD--ELKDSHRYVCAVCKNVF 91 (112)
T ss_pred CCCcCCcCCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCccccccc--ccccccceeCCCCCCcc
Confidence 45566677666666666655321111111 12377788877654321100 01123468888888888
Q ss_pred CChHHHHHHhhhhcCCCCeecCccc
Q psy8050 210 SDSSTLTKHLRIHSGEKPYQCKLCL 234 (311)
Q Consensus 210 ~~~~~l~~H~~~h~~~~~~~C~~C~ 234 (311)
-..-+.-.|...| .|+.|.
T Consensus 92 C~dCD~fiHe~Lh------~CPGC~ 110 (112)
T TIGR00622 92 CVDCDVFVHESLH------CCPGCI 110 (112)
T ss_pred ccccchhhhhhcc------CCcCCC
Confidence 8777777776655 366664
No 68
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=88.19 E-value=1.2 Score=32.90 Aligned_cols=25 Identities=28% Similarity=0.689 Sum_probs=21.2
Q ss_pred eec----CCCCccccCcchhhhhhhhccc
Q psy8050 172 FRC----PICDRRFSQSSSVTTHMRTHSG 196 (311)
Q Consensus 172 ~~C----~~C~~~f~~~~~l~~H~~~h~~ 196 (311)
|.| ..|++.+.+...+..|++.++|
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 788 8899999999999999887664
No 69
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=88.17 E-value=0.39 Score=27.18 Aligned_cols=8 Identities=38% Similarity=1.148 Sum_probs=3.3
Q ss_pred ecCCCCcc
Q psy8050 173 RCPICDRR 180 (311)
Q Consensus 173 ~C~~C~~~ 180 (311)
.|..||..
T Consensus 3 ~C~~CGy~ 10 (33)
T cd00350 3 VCPVCGYI 10 (33)
T ss_pred ECCCCCCE
Confidence 34444433
No 70
>KOG2893|consensus
Probab=86.74 E-value=0.27 Score=40.95 Aligned_cols=41 Identities=17% Similarity=0.458 Sum_probs=21.0
Q ss_pred ccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHH
Q psy8050 118 QFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAH 162 (311)
Q Consensus 118 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H 162 (311)
|..|++.|....-|.+|++.. -|.|.+|.+...+--.|..|
T Consensus 13 cwycnrefddekiliqhqkak----hfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAK----HFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeecccccchhhhhhhhhhhc----cceeeeehhhhccCCCceee
Confidence 555555555555555555432 25555555554444444444
No 71
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=86.20 E-value=0.55 Score=39.78 Aligned_cols=26 Identities=27% Similarity=0.511 Sum_probs=17.1
Q ss_pred CCCeecCccccccCChHHHHHHHHHh
Q psy8050 225 EKPYQCKLCLLRFSQSGNLNRHMRVH 250 (311)
Q Consensus 225 ~~~~~C~~C~~~f~~~~~L~~H~~~H 250 (311)
.+++.|+.||........|..-.|+|
T Consensus 207 ~k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 207 GKPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred CCCCCCCCCCCcccccccceeeeecc
Confidence 35677888887776666665555555
No 72
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=85.00 E-value=1.1 Score=33.08 Aligned_cols=24 Identities=17% Similarity=0.564 Sum_probs=17.0
Q ss_pred CCeecCcCccccCChhHHHHHHHHh
Q psy8050 142 KPYRCEDCNKSFSQAANLTAHVRTH 166 (311)
Q Consensus 142 ~~~~C~~C~~~f~~~~~L~~H~~~h 166 (311)
+...|..|+.+..- +++..|++..
T Consensus 10 ~vlIC~~C~~av~~-~~v~~HL~~~ 33 (109)
T PF12013_consen 10 RVLICRQCQYAVQP-SEVESHLRKR 33 (109)
T ss_pred CEEEeCCCCcccCc-hHHHHHHHHh
Confidence 45678888877655 7788887743
No 73
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=83.32 E-value=0.2 Score=42.07 Aligned_cols=48 Identities=19% Similarity=0.274 Sum_probs=26.3
Q ss_pred CCeecCccccccCChHHHHHHHHH---------hCCCCCCCCcc-ccccccccccccccc
Q psy8050 226 KPYQCKLCLLRFSQSGNLNRHMRV---------HGSQYNMLPVI-YMGVNQVCSNIVDQR 275 (311)
Q Consensus 226 ~~~~C~~C~~~f~~~~~L~~H~~~---------H~~~~~~~~~~-~~~~C~~C~~~f~~~ 275 (311)
+.+.|++|+..|.+..-+....+. +-++ +.|.. ..++|+.||+.+...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~--vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKG--VNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCC--CCCeeeeEEECCCCCCccccc
Confidence 446777777777665444333322 1111 22322 235799998888754
No 74
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=82.69 E-value=1 Score=33.35 Aligned_cols=34 Identities=15% Similarity=0.286 Sum_probs=18.9
Q ss_pred CcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCCh
Q psy8050 112 IRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQA 156 (311)
Q Consensus 112 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~ 156 (311)
++...+|+.||..|... +..|..|+.||..|.-.
T Consensus 6 lGtKR~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 6 LGTKRTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred cCCcccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 34445566666666432 23556666666666543
No 75
>KOG2186|consensus
Probab=82.43 E-value=0.83 Score=38.61 Aligned_cols=49 Identities=18% Similarity=0.493 Sum_probs=27.6
Q ss_pred eeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHHHHHh
Q psy8050 115 ILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVRTH 166 (311)
Q Consensus 115 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h 166 (311)
.|.|..||....-. .|-.|+..-++ .-|.|-.|+..|.. .+...|..--
T Consensus 3 ~FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCI 51 (276)
T KOG2186|consen 3 FFTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCI 51 (276)
T ss_pred EEehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhhhc
Confidence 46666776665443 33346644444 44666667766665 5555665433
No 76
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=80.20 E-value=0.97 Score=28.60 Aligned_cols=29 Identities=28% Similarity=0.576 Sum_probs=19.9
Q ss_pred CCCCCeecCcCccccCChhHHHHHHHHhc
Q psy8050 139 SGEKPYRCEDCNKSFSQAANLTAHVRTHS 167 (311)
Q Consensus 139 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 167 (311)
-|+..+.|+-|+..|....+..+|....+
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 34556777777777777777777775443
No 77
>KOG4173|consensus
Probab=79.78 E-value=0.61 Score=38.08 Aligned_cols=77 Identities=32% Similarity=0.680 Sum_probs=52.7
Q ss_pred ceecCC--CCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhhhh----------cCCCCeecC--ccccc
Q psy8050 171 PFRCPI--CDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIH----------SGEKPYQCK--LCLLR 236 (311)
Q Consensus 171 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h----------~~~~~~~C~--~C~~~ 236 (311)
.|.|++ |...|.+...+..|...-++. .|..|.+.|.+...|..|+... .|...|+|- .|+..
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~K 155 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEK 155 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhh
Confidence 466765 667777777777776655543 6888888888887777776532 245567774 48888
Q ss_pred cCChHHHHHHH-HHh
Q psy8050 237 FSQSGNLNRHM-RVH 250 (311)
Q Consensus 237 f~~~~~L~~H~-~~H 250 (311)
|.+...-..|+ ++|
T Consensus 156 FkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 156 FKTSRDRKDHMIRMH 170 (253)
T ss_pred hhhhhhhhhHHHHhc
Confidence 88887777775 455
No 78
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=79.59 E-value=2 Score=24.84 Aligned_cols=33 Identities=15% Similarity=0.254 Sum_probs=15.3
Q ss_pred eeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCcccc
Q psy8050 116 LIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSF 153 (311)
Q Consensus 116 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f 153 (311)
+.|+.|+..|.-.... +.......+|+.|+..|
T Consensus 3 i~Cp~C~~~y~i~d~~-----ip~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEK-----IPPKGRKVRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHH-----CCCCCcEEECCCCCCEe
Confidence 3455555555444332 11222335566665554
No 79
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=76.93 E-value=2 Score=24.47 Aligned_cols=9 Identities=33% Similarity=1.154 Sum_probs=3.7
Q ss_pred eecCCCCcc
Q psy8050 172 FRCPICDRR 180 (311)
Q Consensus 172 ~~C~~C~~~ 180 (311)
|.|..||..
T Consensus 3 ~~C~~CG~i 11 (34)
T cd00729 3 WVCPVCGYI 11 (34)
T ss_pred EECCCCCCE
Confidence 344444433
No 80
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=76.15 E-value=1.8 Score=32.01 Aligned_cols=14 Identities=29% Similarity=0.669 Sum_probs=7.4
Q ss_pred CCeecCccccccCC
Q psy8050 226 KPYQCKLCLLRFSQ 239 (311)
Q Consensus 226 ~~~~C~~C~~~f~~ 239 (311)
.|..|+.||..|.-
T Consensus 25 ~PivCP~CG~~~~~ 38 (108)
T PF09538_consen 25 DPIVCPKCGTEFPP 38 (108)
T ss_pred CCccCCCCCCccCc
Confidence 44555555555543
No 81
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=75.62 E-value=1.4 Score=36.96 Aligned_cols=13 Identities=23% Similarity=0.496 Sum_probs=7.7
Q ss_pred ccccccccccCCh
Q psy8050 200 YRCRLCKKAFSDS 212 (311)
Q Consensus 200 ~~C~~C~~~f~~~ 212 (311)
..|+.||.++...
T Consensus 49 ~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 49 WVCPHCGYAAFEE 61 (214)
T ss_pred EECCCCCCccccc
Confidence 4577777665533
No 82
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=75.30 E-value=1.6 Score=27.68 Aligned_cols=25 Identities=28% Similarity=0.596 Sum_probs=14.7
Q ss_pred CCCCeecCccccccCChHHHHHHHH
Q psy8050 224 GEKPYQCKLCLLRFSQSGNLNRHMR 248 (311)
Q Consensus 224 ~~~~~~C~~C~~~f~~~~~L~~H~~ 248 (311)
||.-+.|+.||..|....+..+|..
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVN 38 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVN 38 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhh
Confidence 4444566666666666666666654
No 83
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=74.96 E-value=3 Score=24.22 Aligned_cols=33 Identities=18% Similarity=0.382 Sum_probs=15.0
Q ss_pred eeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCcccc
Q psy8050 116 LIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSF 153 (311)
Q Consensus 116 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f 153 (311)
..|+.|+..|.-...- ...+.....|+.|+..|
T Consensus 3 i~CP~C~~~f~v~~~~-----l~~~~~~vrC~~C~~~f 35 (37)
T PF13719_consen 3 ITCPNCQTRFRVPDDK-----LPAGGRKVRCPKCGHVF 35 (37)
T ss_pred EECCCCCceEEcCHHH-----cccCCcEEECCCCCcEe
Confidence 3455555555443321 11122345566665555
No 84
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=74.43 E-value=3.6 Score=32.73 Aligned_cols=34 Identities=18% Similarity=0.478 Sum_probs=22.3
Q ss_pred CCCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccc
Q psy8050 139 SGEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRF 181 (311)
Q Consensus 139 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f 181 (311)
....-|.|+.|+..|.....+. ..|.|+.||...
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAML 138 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEe
Confidence 3445577877877777666664 147777777653
No 85
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=74.29 E-value=2.7 Score=24.41 Aligned_cols=10 Identities=30% Similarity=0.996 Sum_probs=4.8
Q ss_pred eecCcCcccc
Q psy8050 144 YRCEDCNKSF 153 (311)
Q Consensus 144 ~~C~~C~~~f 153 (311)
..|+.|+..|
T Consensus 26 v~C~~C~~~~ 35 (38)
T TIGR02098 26 VRCGKCGHVW 35 (38)
T ss_pred EECCCCCCEE
Confidence 4455554444
No 86
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=71.59 E-value=2.8 Score=33.50 Aligned_cols=10 Identities=20% Similarity=0.657 Sum_probs=5.2
Q ss_pred CeecCcCccc
Q psy8050 143 PYRCEDCNKS 152 (311)
Q Consensus 143 ~~~C~~C~~~ 152 (311)
.|.|++||.+
T Consensus 134 ~~vC~vCGy~ 143 (166)
T COG1592 134 VWVCPVCGYT 143 (166)
T ss_pred EEEcCCCCCc
Confidence 3555555544
No 87
>KOG2186|consensus
Probab=70.49 E-value=2.8 Score=35.56 Aligned_cols=47 Identities=21% Similarity=0.526 Sum_probs=35.8
Q ss_pred CeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhh
Q psy8050 143 PYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMR 192 (311)
Q Consensus 143 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 192 (311)
.|.|..||.... +..+.+|+...++ ..|.|-.|+..|.. .++..|..
T Consensus 3 ~FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 3 FFTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred EEehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 378888888866 4567778877777 56888889998887 66777754
No 88
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=69.90 E-value=2.6 Score=33.69 Aligned_cols=10 Identities=40% Similarity=0.913 Sum_probs=4.8
Q ss_pred CCCeecCccc
Q psy8050 225 EKPYQCKLCL 234 (311)
Q Consensus 225 ~~~~~C~~C~ 234 (311)
+-|-.|++||
T Consensus 147 e~P~~CPiCg 156 (166)
T COG1592 147 EAPEVCPICG 156 (166)
T ss_pred CCCCcCCCCC
Confidence 4444455554
No 89
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=68.62 E-value=3.9 Score=30.93 Aligned_cols=34 Identities=12% Similarity=0.108 Sum_probs=19.4
Q ss_pred CcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCCh
Q psy8050 112 IRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQA 156 (311)
Q Consensus 112 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~ 156 (311)
......|+.||+.|.. -+..|..|+.||..|.-.
T Consensus 6 lGtKr~Cp~cg~kFYD-----------Lnk~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 6 LGTKRICPNTGSKFYD-----------LNRRPAVSPYTGEQFPPE 39 (129)
T ss_pred hCccccCCCcCccccc-----------cCCCCccCCCcCCccCcc
Confidence 3444566677766643 233566677777666443
No 90
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=68.62 E-value=9.4 Score=29.93 Aligned_cols=38 Identities=18% Similarity=0.592 Sum_probs=22.6
Q ss_pred CCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccc
Q psy8050 140 GEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRF 181 (311)
Q Consensus 140 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f 181 (311)
....|.|+.|+..|.....+.. .. ....|.|+.||...
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~---~d-~~~~f~Cp~Cg~~l 133 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQL---LD-MDGTFTCPRCGEEL 133 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHh---cC-CCCcEECCCCCCEE
Confidence 3456888888888775443322 01 12338888887754
No 91
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=67.04 E-value=5.8 Score=32.26 Aligned_cols=32 Identities=25% Similarity=0.695 Sum_probs=20.0
Q ss_pred CCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccc
Q psy8050 141 EKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRF 181 (311)
Q Consensus 141 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f 181 (311)
..-|.|+.|+..|.....+. ..|.|+.||...
T Consensus 115 ~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L 146 (178)
T PRK06266 115 NMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEML 146 (178)
T ss_pred CCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCC
Confidence 34577777777776665542 247777777654
No 92
>KOG1701|consensus
Probab=66.15 E-value=2.7 Score=38.40 Aligned_cols=44 Identities=18% Similarity=0.177 Sum_probs=28.8
Q ss_pred ccccccccccccccchhhceeeeecccccC--CCCCCCCccccccC
Q psy8050 264 VNQVCSNIVDQRRGIYVVEAVWTVQSTWID--GRSTMCGHSFIVKL 307 (311)
Q Consensus 264 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~--~kC~~C~~~f~~~~ 307 (311)
+|.+|+...-=...-..=.++...++.|-. |+|+.|+-...++.
T Consensus 396 rCs~C~~PI~P~~G~~etvRvvamdr~fHv~CY~CEDCg~~LS~e~ 441 (468)
T KOG1701|consen 396 RCSVCGNPILPRDGKDETVRVVAMDRDFHVNCYKCEDCGLLLSSEE 441 (468)
T ss_pred chhhccCCccCCCCCcceEEEEEccccccccceehhhcCccccccC
Confidence 799999887666555555565555554322 47889987766543
No 93
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=65.76 E-value=3.7 Score=25.67 Aligned_cols=10 Identities=50% Similarity=1.428 Sum_probs=4.0
Q ss_pred eecCcCcccc
Q psy8050 144 YRCEDCNKSF 153 (311)
Q Consensus 144 ~~C~~C~~~f 153 (311)
|.|..||..|
T Consensus 6 y~C~~Cg~~f 15 (52)
T TIGR02605 6 YRCTACGHRF 15 (52)
T ss_pred EEeCCCCCEe
Confidence 3344444433
No 94
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=64.28 E-value=8.5 Score=24.85 Aligned_cols=10 Identities=50% Similarity=1.451 Sum_probs=5.6
Q ss_pred CceecCCCCc
Q psy8050 170 KPFRCPICDR 179 (311)
Q Consensus 170 ~~~~C~~C~~ 179 (311)
.+|.|+.||.
T Consensus 49 ~~Y~Cp~CGF 58 (61)
T COG2888 49 NPYRCPKCGF 58 (61)
T ss_pred CceECCCcCc
Confidence 3466666654
No 95
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=63.99 E-value=3.2 Score=30.35 Aligned_cols=13 Identities=8% Similarity=-0.171 Sum_probs=9.4
Q ss_pred CCCCCCCCCCCCC
Q psy8050 79 PYQVGAPQQFPTA 91 (311)
Q Consensus 79 ~~~C~~c~~~~~~ 91 (311)
.+.|..|+.....
T Consensus 16 ~~~C~~C~~~~~~ 28 (104)
T TIGR01384 16 VYVCPSCGYEKEK 28 (104)
T ss_pred eEECcCCCCcccc
Confidence 5789999876543
No 96
>KOG1280|consensus
Probab=63.83 E-value=7.4 Score=34.65 Aligned_cols=22 Identities=18% Similarity=0.345 Sum_probs=10.9
Q ss_pred CeecCcCccccCChhHHHHHHH
Q psy8050 143 PYRCEDCNKSFSQAANLTAHVR 164 (311)
Q Consensus 143 ~~~C~~C~~~f~~~~~L~~H~~ 164 (311)
.|.|++|+..-.+...|..|..
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~ 100 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVL 100 (381)
T ss_pred cccCCcccccccchhHHHHHhh
Confidence 3555555555444455555543
No 97
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=63.03 E-value=6.5 Score=23.87 Aligned_cols=10 Identities=30% Similarity=1.016 Sum_probs=4.4
Q ss_pred eecCcCcccc
Q psy8050 144 YRCEDCNKSF 153 (311)
Q Consensus 144 ~~C~~C~~~f 153 (311)
|.|..||..|
T Consensus 3 Y~C~~Cg~~~ 12 (44)
T smart00659 3 YICGECGREN 12 (44)
T ss_pred EECCCCCCEe
Confidence 4444444443
No 98
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=62.99 E-value=4.7 Score=23.63 Aligned_cols=10 Identities=40% Similarity=1.198 Sum_probs=4.2
Q ss_pred eecCCCCccc
Q psy8050 172 FRCPICDRRF 181 (311)
Q Consensus 172 ~~C~~C~~~f 181 (311)
|.|..||..|
T Consensus 6 y~C~~Cg~~f 15 (41)
T smart00834 6 YRCEDCGHTF 15 (41)
T ss_pred EEcCCCCCEE
Confidence 3444444433
No 99
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=62.77 E-value=5 Score=24.48 Aligned_cols=10 Identities=30% Similarity=1.218 Sum_probs=4.3
Q ss_pred eecCcCcccc
Q psy8050 144 YRCEDCNKSF 153 (311)
Q Consensus 144 ~~C~~C~~~f 153 (311)
|.|+.||..|
T Consensus 4 y~C~~CG~~~ 13 (46)
T PRK00398 4 YKCARCGREV 13 (46)
T ss_pred EECCCCCCEE
Confidence 3444444443
No 100
>KOG2879|consensus
Probab=62.26 E-value=11 Score=32.54 Aligned_cols=28 Identities=29% Similarity=0.211 Sum_probs=22.2
Q ss_pred HhhhhcCccccCcchhhhccCCCCCCCC
Q psy8050 16 KKRKKKGILANSSSKEKIQDRSECPGSG 43 (311)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~C~~c~ 43 (311)
+.|..+-+.++..+++-.+..-+|+.||
T Consensus 219 ~sw~~~l~~ap~~sss~~t~~~~C~~Cg 246 (298)
T KOG2879|consen 219 KSWKLDLDRAPKFSSSTGTSDTECPVCG 246 (298)
T ss_pred HhhcccccCCCCcccccccCCceeeccC
Confidence 4677776667777788888889999999
No 101
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=62.22 E-value=2 Score=33.92 Aligned_cols=16 Identities=25% Similarity=0.567 Sum_probs=9.3
Q ss_pred ccccccccccCChHHH
Q psy8050 200 YRCRLCKKAFSDSSTL 215 (311)
Q Consensus 200 ~~C~~C~~~f~~~~~l 215 (311)
+.|+.||+.|.+...+
T Consensus 29 ~~c~~c~~~f~~~e~~ 44 (154)
T PRK00464 29 RECLACGKRFTTFERV 44 (154)
T ss_pred eeccccCCcceEeEec
Confidence 5666666666655443
No 102
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=60.86 E-value=14 Score=23.95 Aligned_cols=10 Identities=40% Similarity=1.288 Sum_probs=5.8
Q ss_pred CceecCCCCc
Q psy8050 170 KPFRCPICDR 179 (311)
Q Consensus 170 ~~~~C~~C~~ 179 (311)
.+|.|+.||.
T Consensus 47 ~~Y~CP~CGF 56 (59)
T PRK14890 47 NPYTCPKCGF 56 (59)
T ss_pred CceECCCCCC
Confidence 3466666654
No 103
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=60.50 E-value=8 Score=30.77 Aligned_cols=32 Identities=22% Similarity=0.466 Sum_probs=18.6
Q ss_pred CCCceecCCCCccccCcchhhhhhhhccccccccccccccc
Q psy8050 168 GEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKA 208 (311)
Q Consensus 168 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~ 208 (311)
+..-|.|+.|+..|.....+. ..|.|+.||..
T Consensus 106 ~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 106 NNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM 137 (158)
T ss_pred CCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence 344566777776666555543 24666666654
No 104
>KOG4752|consensus
Probab=60.11 E-value=8.1 Score=19.63 Aligned_cols=11 Identities=45% Similarity=0.751 Sum_probs=4.8
Q ss_pred HHHHHHHhhhh
Q psy8050 10 KKKKKKKKRKK 20 (311)
Q Consensus 10 ~~~~~~~~~~~ 20 (311)
+|+||+.||++
T Consensus 7 kkrmrrlkrkr 17 (26)
T KOG4752|consen 7 KKRMRRLKRKR 17 (26)
T ss_pred HHHHHHHHHHH
Confidence 34444444443
No 105
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=59.49 E-value=7 Score=39.02 Aligned_cols=12 Identities=25% Similarity=0.587 Sum_probs=7.1
Q ss_pred cccccccccccc
Q psy8050 197 ERPYRCRLCKKA 208 (311)
Q Consensus 197 ~k~~~C~~C~~~ 208 (311)
..|..|+.||-.
T Consensus 473 ~~p~~Cp~Cgs~ 484 (730)
T COG1198 473 PIPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCCC
Confidence 455666666644
No 106
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=58.18 E-value=7.1 Score=29.54 Aligned_cols=13 Identities=15% Similarity=0.189 Sum_probs=6.9
Q ss_pred CCeecCccccccC
Q psy8050 226 KPYQCKLCLLRFS 238 (311)
Q Consensus 226 ~~~~C~~C~~~f~ 238 (311)
.|..|+.||..|.
T Consensus 25 ~p~vcP~cg~~~~ 37 (129)
T TIGR02300 25 RPAVSPYTGEQFP 37 (129)
T ss_pred CCccCCCcCCccC
Confidence 4455555555554
No 107
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=58.10 E-value=9.4 Score=23.69 Aligned_cols=21 Identities=48% Similarity=0.893 Sum_probs=11.8
Q ss_pred eecCccccccCCh-----HHHHHHHH
Q psy8050 228 YQCKLCLLRFSQS-----GNLNRHMR 248 (311)
Q Consensus 228 ~~C~~C~~~f~~~-----~~L~~H~~ 248 (311)
-.|..|++.+... +.|.+|++
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHH
Confidence 3466666555433 46666665
No 108
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=57.88 E-value=9.3 Score=20.22 Aligned_cols=17 Identities=18% Similarity=0.554 Sum_probs=7.2
Q ss_pred cCcCccccCChhHHHHHH
Q psy8050 146 CEDCNKSFSQAANLTAHV 163 (311)
Q Consensus 146 C~~C~~~f~~~~~L~~H~ 163 (311)
|++|++.+ ....+..|+
T Consensus 4 CPiC~~~v-~~~~in~HL 20 (26)
T smart00734 4 CPVCFREV-PENLINSHL 20 (26)
T ss_pred CCCCcCcc-cHHHHHHHH
Confidence 44444444 233444443
No 109
>PF08188 Protamine_3: Spermatozal protamine family; InterPro: IPR012601 This entry consists of the spermatozal protamines. Spermatozal protamines play an important role in remodelling of the sperm chromatin during mammalian spermiogenesis. Nuclear elongation and chromatin condensation are concomitant with modifications in the basic protein complement associated with DNA. Somatic histones are initially replaced by testis-specific histone variants, then by transitional proteins, and ultimately by protamines [].; GO: 0003677 DNA binding, 0035092 sperm chromatin condensation, 0000228 nuclear chromosome
Probab=57.41 E-value=9.7 Score=22.17 Aligned_cols=23 Identities=43% Similarity=0.723 Sum_probs=12.3
Q ss_pred hhhHHHHHHHHHHHHhhhhcCcc
Q psy8050 2 KKKKKKKKKKKKKKKKRKKKGIL 24 (311)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~ 24 (311)
|+|.|+-+.+|-|+--|++|.++
T Consensus 8 kkkrksvrrrktrknqrkrknsl 30 (48)
T PF08188_consen 8 KKKRKSVRRRKTRKNQRKRKNSL 30 (48)
T ss_pred HHHHHHHHHHHHHhhHHHhhhhh
Confidence 34455555555555556555443
No 110
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=57.21 E-value=14 Score=29.01 Aligned_cols=39 Identities=8% Similarity=0.087 Sum_probs=26.5
Q ss_pred CcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccC
Q psy8050 112 IRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFS 154 (311)
Q Consensus 112 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~ 154 (311)
....+.|+.|+..|+....+.. .. ....|.|+.||....
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~---~d-~~~~f~Cp~Cg~~l~ 134 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQL---LD-MDGTFTCPRCGEELE 134 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHh---cC-CCCcEECCCCCCEEE
Confidence 3557899999999886544332 01 134499999998764
No 111
>KOG2785|consensus
Probab=55.91 E-value=14 Score=33.48 Aligned_cols=49 Identities=22% Similarity=0.269 Sum_probs=37.3
Q ss_pred eeccCCCCcCCChhHHHHHHhhhCCCC-----------------------CeecCcCc---cccCChhHHHHHHH
Q psy8050 116 LIQFIPGKTYARPSTLKTHLRTHSGEK-----------------------PYRCEDCN---KSFSQAANLTAHVR 164 (311)
Q Consensus 116 ~~C~~C~~~f~~~~~L~~H~~~h~~~~-----------------------~~~C~~C~---~~f~~~~~L~~H~~ 164 (311)
..|..|+..+.+-..-..||..+++-. -|.|-.|+ ..|.+....+.||.
T Consensus 167 t~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~ 241 (390)
T KOG2785|consen 167 TDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMR 241 (390)
T ss_pred cceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHh
Confidence 789999999999999999998877621 15566666 66666677777764
No 112
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=54.05 E-value=10 Score=30.78 Aligned_cols=30 Identities=30% Similarity=0.718 Sum_probs=16.6
Q ss_pred CceecCCCCccccCcchhhhhhhhccccccccccccccc
Q psy8050 170 KPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKA 208 (311)
Q Consensus 170 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~ 208 (311)
.-|.|+.|+..|.....+. ..|.|+.||..
T Consensus 116 ~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~ 145 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEAME---------YGFRCPQCGEM 145 (178)
T ss_pred CEEECCCCCcEEeHHHHhh---------cCCcCCCCCCC
Confidence 4466666666665554432 24666666654
No 113
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=53.88 E-value=7.9 Score=23.12 Aligned_cols=11 Identities=36% Similarity=1.147 Sum_probs=4.7
Q ss_pred eecCCCCcccc
Q psy8050 172 FRCPICDRRFS 182 (311)
Q Consensus 172 ~~C~~C~~~f~ 182 (311)
|.|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (42)
T PF09723_consen 6 YRCEECGHEFE 16 (42)
T ss_pred EEeCCCCCEEE
Confidence 34444444443
No 114
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=53.65 E-value=16 Score=21.79 Aligned_cols=22 Identities=36% Similarity=0.652 Sum_probs=10.3
Q ss_pred CCeecCccccccCC----hHHHHHHH
Q psy8050 226 KPYQCKLCLLRFSQ----SGNLNRHM 247 (311)
Q Consensus 226 ~~~~C~~C~~~f~~----~~~L~~H~ 247 (311)
....|..|++.+.. .+.|..|+
T Consensus 15 ~~a~C~~C~~~~~~~~~~ts~l~~HL 40 (45)
T PF02892_consen 15 KKAKCKYCGKVIKYSSGGTSNLKRHL 40 (45)
T ss_dssp S-EEETTTTEE-----SSTHHHHHHH
T ss_pred CeEEeCCCCeEEeeCCCcHHHHHHhh
Confidence 33556666665544 25566665
No 115
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=53.64 E-value=6.1 Score=34.58 Aligned_cols=23 Identities=35% Similarity=0.728 Sum_probs=11.6
Q ss_pred eecCccccccCChHHHHHHHHHh
Q psy8050 228 YQCKLCLLRFSQSGNLNRHMRVH 250 (311)
Q Consensus 228 ~~C~~C~~~f~~~~~L~~H~~~H 250 (311)
|.|+.|...|-..-+.-.|...|
T Consensus 389 Y~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 389 YQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred eechhhhhhhhhhhHHHHHHHHh
Confidence 55555555555544444444444
No 116
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=53.52 E-value=10 Score=37.96 Aligned_cols=40 Identities=30% Similarity=0.658 Sum_probs=27.3
Q ss_pred eecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhhhhcCCCCeecCccccc
Q psy8050 172 FRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLR 236 (311)
Q Consensus 172 ~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 236 (311)
+.|+.|+-. |. .|.......|.+||.. ...|..|+.||..
T Consensus 445 ~~Cp~Cd~~------lt----~H~~~~~L~CH~Cg~~---------------~~~p~~Cp~Cgs~ 484 (730)
T COG1198 445 AECPNCDSP------LT----LHKATGQLRCHYCGYQ---------------EPIPQSCPECGSE 484 (730)
T ss_pred ccCCCCCcc------eE----EecCCCeeEeCCCCCC---------------CCCCCCCCCCCCC
Confidence 358888754 22 3334466889999863 3577899999876
No 117
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=52.90 E-value=7.8 Score=20.57 Aligned_cols=9 Identities=33% Similarity=0.726 Sum_probs=6.0
Q ss_pred CCCCCcccc
Q psy8050 296 STMCGHSFI 304 (311)
Q Consensus 296 C~~C~~~f~ 304 (311)
|.+||+.|.
T Consensus 17 Cp~CG~~F~ 25 (26)
T PF10571_consen 17 CPHCGYDFE 25 (26)
T ss_pred CCCCCCCCc
Confidence 677777664
No 118
>KOG2807|consensus
Probab=52.47 E-value=26 Score=31.10 Aligned_cols=25 Identities=24% Similarity=0.486 Sum_probs=19.9
Q ss_pred CCeecCccccccCChHHHHHHHHHh
Q psy8050 226 KPYQCKLCLLRFSQSGNLNRHMRVH 250 (311)
Q Consensus 226 ~~~~C~~C~~~f~~~~~L~~H~~~H 250 (311)
..|.|+.|...|-..-+...|...|
T Consensus 344 ~~y~C~~Ck~~FCldCDv~iHesLh 368 (378)
T KOG2807|consen 344 GRYRCESCKNVFCLDCDVFIHESLH 368 (378)
T ss_pred CcEEchhccceeeccchHHHHhhhh
Confidence 4588888888888877777787776
No 119
>PF05129 Elf1: Transcription elongation factor Elf1 like; InterPro: IPR007808 This family of uncharacterised, mostly short, proteins contain a putative zinc binding domain with four conserved cysteines.; PDB: 1WII_A.
Probab=52.38 E-value=6.2 Score=27.50 Aligned_cols=10 Identities=20% Similarity=0.384 Sum_probs=3.4
Q ss_pred cCCCCCCCCC
Q psy8050 35 DRSECPGSGM 44 (311)
Q Consensus 35 ~~~~C~~c~~ 44 (311)
..|.||.|+.
T Consensus 21 ~~F~CPfC~~ 30 (81)
T PF05129_consen 21 KVFDCPFCNH 30 (81)
T ss_dssp S----TTT--
T ss_pred ceEcCCcCCC
Confidence 4599999993
No 120
>KOG4377|consensus
Probab=52.00 E-value=11 Score=34.30 Aligned_cols=110 Identities=25% Similarity=0.522 Sum_probs=66.0
Q ss_pred CCeec--CcCccccCChhHHHHHHHHhcCC------------CceecC--CCCccccCcchhhhhhhhcccc-------c
Q psy8050 142 KPYRC--EDCNKSFSQAANLTAHVRTHSGE------------KPFRCP--ICDRRFSQSSSVTTHMRTHSGE-------R 198 (311)
Q Consensus 142 ~~~~C--~~C~~~f~~~~~L~~H~~~h~~~------------~~~~C~--~C~~~f~~~~~l~~H~~~h~~~-------k 198 (311)
..|.| +.|+..+..+..+.+|..+|-.. ..|.|. .|++ +.++...|-..|+.. .
T Consensus 270 Ehyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~Gfrrt 346 (480)
T KOG4377|consen 270 EHYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRT 346 (480)
T ss_pred hhhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCceecc
Confidence 44777 56988888888999998877432 125563 4777 344555666655421 2
Q ss_pred cccccccc--cccCChHHHHHHhhhhcCCC------------------------CeecC--ccccccCChHHHHHHHHHh
Q psy8050 199 PYRCRLCK--KAFSDSSTLTKHLRIHSGEK------------------------PYQCK--LCLLRFSQSGNLNRHMRVH 250 (311)
Q Consensus 199 ~~~C~~C~--~~f~~~~~l~~H~~~h~~~~------------------------~~~C~--~C~~~f~~~~~L~~H~~~H 250 (311)
.|.|..|+ ..|.....-..|.+-+.++. .+.|. -|+..+.+.+.+..|.+.|
T Consensus 347 hfhC~r~gCTdtfK~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~s~sqm~shkrkh 426 (480)
T KOG4377|consen 347 HFHCQRIGCTDTFKDSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLYSVSQMASHKRKH 426 (480)
T ss_pred eeEEeccCCccccccccccccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEEehhhhhhhhhhh
Confidence 36777766 55553333333333332211 14453 3788888888888888877
Q ss_pred CCCC
Q psy8050 251 GSQY 254 (311)
Q Consensus 251 ~~~~ 254 (311)
....
T Consensus 427 eRqe 430 (480)
T KOG4377|consen 427 ERQE 430 (480)
T ss_pred hhhh
Confidence 5443
No 121
>KOG4602|consensus
Probab=51.58 E-value=14 Score=31.43 Aligned_cols=10 Identities=30% Similarity=0.880 Sum_probs=6.4
Q ss_pred CeecCcCccc
Q psy8050 143 PYRCEDCNKS 152 (311)
Q Consensus 143 ~~~C~~C~~~ 152 (311)
.|.|++||.+
T Consensus 268 ~YVCPiCGAT 277 (318)
T KOG4602|consen 268 SYVCPICGAT 277 (318)
T ss_pred hhcCcccccc
Confidence 3667777665
No 122
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=51.49 E-value=8.1 Score=29.45 Aligned_cols=17 Identities=24% Similarity=0.647 Sum_probs=9.7
Q ss_pred CCCceecCCCCccccCc
Q psy8050 168 GEKPFRCPICDRRFSQS 184 (311)
Q Consensus 168 ~~~~~~C~~C~~~f~~~ 184 (311)
+...|+|.+|..+....
T Consensus 77 d~~lYeCnIC~etS~ee 93 (140)
T PF05290_consen 77 DPKLYECNICKETSAEE 93 (140)
T ss_pred CCCceeccCcccccchh
Confidence 34557777776654433
No 123
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=51.33 E-value=11 Score=24.23 Aligned_cols=33 Identities=27% Similarity=0.764 Sum_probs=14.2
Q ss_pred CeecCc--CccccCChhHHHHHHHHhcCCCceecCC
Q psy8050 143 PYRCED--CNKSFSQAANLTAHVRTHSGEKPFRCPI 176 (311)
Q Consensus 143 ~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~ 176 (311)
+..|+. |...+. ...|..|+...-...+..|..
T Consensus 9 ~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~ 43 (60)
T PF02176_consen 9 PVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPY 43 (60)
T ss_dssp EEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS
T ss_pred EeeCCCCCccccee-HHHHHHHHHccCCCCcEECCC
Confidence 345554 333233 345555555444444555555
No 124
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=49.45 E-value=10 Score=29.13 Aligned_cols=26 Identities=42% Similarity=0.554 Sum_probs=15.6
Q ss_pred cceeeccCCCCcCCChhHHHHHHhhhCCC
Q psy8050 113 RLILIQFIPGKTYARPSTLKTHLRTHSGE 141 (311)
Q Consensus 113 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~ 141 (311)
...++|.+||+.|.. |++|++.|+|-
T Consensus 70 ~d~i~clecGk~~k~---LkrHL~~~~gl 95 (132)
T PF05443_consen 70 PDYIICLECGKKFKT---LKRHLRTHHGL 95 (132)
T ss_dssp SS-EE-TBT--EESB---HHHHHHHTT-S
T ss_pred cCeeEEccCCcccch---HHHHHHHccCC
Confidence 346789999998865 48888888663
No 125
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=49.19 E-value=9.3 Score=24.29 Aligned_cols=12 Identities=17% Similarity=0.302 Sum_probs=6.2
Q ss_pred cccccccccccc
Q psy8050 264 VNQVCSNIVDQR 275 (311)
Q Consensus 264 ~C~~C~~~f~~~ 275 (311)
.|+.|++.|...
T Consensus 7 ~C~~Cg~~~~~~ 18 (54)
T PF14446_consen 7 KCPVCGKKFKDG 18 (54)
T ss_pred cChhhCCcccCC
Confidence 355555555443
No 126
>PRK04023 DNA polymerase II large subunit; Validated
Probab=48.69 E-value=18 Score=37.14 Aligned_cols=9 Identities=33% Similarity=1.025 Sum_probs=4.6
Q ss_pred eecCcCccc
Q psy8050 144 YRCEDCNKS 152 (311)
Q Consensus 144 ~~C~~C~~~ 152 (311)
..|+.||..
T Consensus 627 RfCpsCG~~ 635 (1121)
T PRK04023 627 RKCPSCGKE 635 (1121)
T ss_pred ccCCCCCCc
Confidence 445555544
No 127
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=48.63 E-value=8.6 Score=21.20 Aligned_cols=10 Identities=20% Similarity=0.770 Sum_probs=5.0
Q ss_pred CCeecCcCcc
Q psy8050 142 KPYRCEDCNK 151 (311)
Q Consensus 142 ~~~~C~~C~~ 151 (311)
..|.|+.|+.
T Consensus 18 ~~~vCp~C~~ 27 (30)
T PF08274_consen 18 ELLVCPECGH 27 (30)
T ss_dssp SSEEETTTTE
T ss_pred CEEeCCcccc
Confidence 3455555543
No 128
>CHL00152 rpl32 ribosomal protein L32; Validated
Probab=48.51 E-value=10 Score=23.97 Aligned_cols=22 Identities=36% Similarity=0.469 Sum_probs=18.3
Q ss_pred hhhHHHHHHHHHHHHhhhhcCc
Q psy8050 2 KKKKKKKKKKKKKKKKRKKKGI 23 (311)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~ 23 (311)
++|+.|+-+++.|+.-|++|-.
T Consensus 4 PKKRtSksKk~~Rk~~Wk~Ka~ 25 (53)
T CHL00152 4 PKKRTSKSKKRIRKNIWKRKAY 25 (53)
T ss_pred CCCccchHHHHHHHHHHHHHHH
Confidence 4677788889999999999854
No 129
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=47.79 E-value=9.1 Score=22.08 Aligned_cols=11 Identities=18% Similarity=0.395 Sum_probs=6.1
Q ss_pred ccccccccccc
Q psy8050 264 VNQVCSNIVDQ 274 (311)
Q Consensus 264 ~C~~C~~~f~~ 274 (311)
.|+.||..|..
T Consensus 3 ~C~~Cg~~Yh~ 13 (36)
T PF05191_consen 3 ICPKCGRIYHI 13 (36)
T ss_dssp EETTTTEEEET
T ss_pred CcCCCCCcccc
Confidence 36666666553
No 130
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.76 E-value=3.6 Score=34.43 Aligned_cols=46 Identities=20% Similarity=0.421 Sum_probs=23.7
Q ss_pred CCeecCccccccCChHHHHHHHHHhCCCC-------CCCCcc-ccccccccccc
Q psy8050 226 KPYQCKLCLLRFSQSGNLNRHMRVHGSQY-------NMLPVI-YMGVNQVCSNI 271 (311)
Q Consensus 226 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~~-------~~~~~~-~~~~C~~C~~~ 271 (311)
+.+.|++|+-.|....-+..-.|+-.++. ++.|.+ -.++|++|.+.
T Consensus 18 k~ieCPvC~tkFkkeev~tgsiRiiagDld~~lkygninP~fY~VvvCP~C~yA 71 (267)
T COG1655 18 KTIECPVCNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPYFYDVVVCPICYYA 71 (267)
T ss_pred ceeccCcccchhhhhheeccceeEecccccceeeccccCCceeEEEEcchhhHH
Confidence 34666666666654433333333333322 233433 33679999865
No 131
>KOG2807|consensus
Probab=47.32 E-value=21 Score=31.65 Aligned_cols=23 Identities=13% Similarity=0.346 Sum_probs=18.4
Q ss_pred ccccccccccccccchhhceeee
Q psy8050 264 VNQVCSNIVDQRRGIYVVEAVWT 286 (311)
Q Consensus 264 ~C~~C~~~f~~~~~l~~H~~~h~ 286 (311)
.|+.|...|-.-=+.-.|...|.
T Consensus 347 ~C~~Ck~~FCldCDv~iHesLh~ 369 (378)
T KOG2807|consen 347 RCESCKNVFCLDCDVFIHESLHN 369 (378)
T ss_pred EchhccceeeccchHHHHhhhhc
Confidence 68888888887777788877775
No 132
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=47.25 E-value=25 Score=21.95 Aligned_cols=9 Identities=33% Similarity=0.364 Sum_probs=7.4
Q ss_pred CCCCCCCCC
Q psy8050 37 SECPGSGMG 45 (311)
Q Consensus 37 ~~C~~c~~~ 45 (311)
|+|..||+-
T Consensus 2 y~C~~Cgyi 10 (50)
T cd00730 2 YECRICGYI 10 (50)
T ss_pred cCCCCCCeE
Confidence 899999954
No 133
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=47.07 E-value=14 Score=22.91 Aligned_cols=10 Identities=30% Similarity=1.218 Sum_probs=5.1
Q ss_pred eecCcCcccc
Q psy8050 144 YRCEDCNKSF 153 (311)
Q Consensus 144 ~~C~~C~~~f 153 (311)
|.|..||+.|
T Consensus 7 Y~C~~Cg~~~ 16 (49)
T COG1996 7 YKCARCGREV 16 (49)
T ss_pred EEhhhcCCee
Confidence 4555555554
No 134
>PF15269 zf-C2H2_7: Zinc-finger
Probab=47.00 E-value=24 Score=21.31 Aligned_cols=22 Identities=23% Similarity=0.102 Sum_probs=14.5
Q ss_pred eeeccCCCCcCCChhHHHHHHh
Q psy8050 115 ILIQFIPGKTYARPSTLKTHLR 136 (311)
Q Consensus 115 ~~~C~~C~~~f~~~~~L~~H~~ 136 (311)
.++|..|..+...++.|-.||.
T Consensus 20 ~ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred cceeecCCcccchHHHHHHHHH
Confidence 4566666666666666666664
No 135
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=44.89 E-value=20 Score=20.10 Aligned_cols=7 Identities=43% Similarity=1.407 Sum_probs=3.1
Q ss_pred eecCcCc
Q psy8050 144 YRCEDCN 150 (311)
Q Consensus 144 ~~C~~C~ 150 (311)
..|..||
T Consensus 18 irC~~CG 24 (32)
T PF03604_consen 18 IRCPECG 24 (32)
T ss_dssp SSBSSSS
T ss_pred EECCcCC
Confidence 3444444
No 136
>PRK14873 primosome assembly protein PriA; Provisional
Probab=44.81 E-value=16 Score=36.24 Aligned_cols=8 Identities=25% Similarity=0.953 Sum_probs=4.6
Q ss_pred ecCcCccc
Q psy8050 145 RCEDCNKS 152 (311)
Q Consensus 145 ~C~~C~~~ 152 (311)
.|..||..
T Consensus 385 ~C~~Cg~~ 392 (665)
T PRK14873 385 ACARCRTP 392 (665)
T ss_pred EhhhCcCe
Confidence 46666654
No 137
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=44.22 E-value=20 Score=25.20 Aligned_cols=14 Identities=21% Similarity=-0.026 Sum_probs=8.4
Q ss_pred ceeeccCCCCcCCC
Q psy8050 114 LILIQFIPGKTYAR 127 (311)
Q Consensus 114 ~~~~C~~C~~~f~~ 127 (311)
..++|..||..|.-
T Consensus 52 GIW~C~kCg~~fAG 65 (89)
T COG1997 52 GIWKCRKCGAKFAG 65 (89)
T ss_pred CeEEcCCCCCeecc
Confidence 35666666666643
No 138
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.96 E-value=18 Score=26.50 Aligned_cols=32 Identities=19% Similarity=0.199 Sum_probs=22.1
Q ss_pred CcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccC
Q psy8050 112 IRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFS 154 (311)
Q Consensus 112 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~ 154 (311)
....-+|+.||+.|.. -...|..|+.||++|.
T Consensus 6 LGtKridPetg~KFYD-----------LNrdPiVsPytG~s~P 37 (129)
T COG4530 6 LGTKRIDPETGKKFYD-----------LNRDPIVSPYTGKSYP 37 (129)
T ss_pred ccccccCccccchhhc-----------cCCCccccCcccccch
Confidence 3445577888888743 2446788888888884
No 139
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=43.19 E-value=23 Score=21.80 Aligned_cols=13 Identities=23% Similarity=1.081 Sum_probs=7.2
Q ss_pred eecCCCCccccCc
Q psy8050 172 FRCPICDRRFSQS 184 (311)
Q Consensus 172 ~~C~~C~~~f~~~ 184 (311)
|+|..|+..+...
T Consensus 2 y~C~~CgyvYd~~ 14 (47)
T PF00301_consen 2 YQCPVCGYVYDPE 14 (47)
T ss_dssp EEETTTSBEEETT
T ss_pred cCCCCCCEEEcCC
Confidence 4566666555443
No 140
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=41.42 E-value=7.3 Score=26.46 Aligned_cols=17 Identities=24% Similarity=0.518 Sum_probs=8.5
Q ss_pred CceecC--CCCccccCcch
Q psy8050 170 KPFRCP--ICDRRFSQSSS 186 (311)
Q Consensus 170 ~~~~C~--~C~~~f~~~~~ 186 (311)
.-++|. .||.+|.....
T Consensus 26 ~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 26 RYHQCQNVNCSATFITYES 44 (72)
T ss_pred eeeecCCCCCCCEEEEEEE
Confidence 335554 55555554443
No 141
>PF01783 Ribosomal_L32p: Ribosomal L32p protein family; InterPro: IPR002677 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein L32p is part of the 50S ribosomal subunit. This family is found in both prokaryotes and eukaryotes. Ribosomal protein L32 of yeast binds to and regulates the splicing and the translation of the transcript of its own gene [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015934 large ribosomal subunit; PDB: 3PYT_2 3F1F_5 3PYV_2 3D5B_5 3MRZ_2 3D5D_5 3F1H_5 1VSP_Y 3PYR_2 3MS1_2 ....
Probab=41.37 E-value=9 Score=24.56 Aligned_cols=31 Identities=29% Similarity=0.563 Sum_probs=17.6
Q ss_pred hhhHHHHHHHHHHHHhhhhcCccccCcchhhhccCCCCCCCC
Q psy8050 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG 43 (311)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~ 43 (311)
+|+|.|+-.+++|+.-|+.+-. +...|+.||
T Consensus 3 PKrk~Sksr~~~Rrs~~~l~~~-----------~l~~c~~cg 33 (56)
T PF01783_consen 3 PKRKTSKSRKRMRRSHWKLKAP-----------NLVKCPNCG 33 (56)
T ss_dssp -SS-SCHHHHHHHTTTTS--TT-----------SEEESSSSS
T ss_pred CCCcCChhHccchhcccccccc-----------ceeeeccCC
Confidence 3566677777777766642221 347788888
No 142
>KOG2593|consensus
Probab=39.87 E-value=36 Score=31.45 Aligned_cols=39 Identities=21% Similarity=0.534 Sum_probs=24.5
Q ss_pred CCCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCcc
Q psy8050 139 SGEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRR 180 (311)
Q Consensus 139 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~ 180 (311)
+....|.|+.|.+.|.....+ +..-...-.|.|..|+-.
T Consensus 124 t~~~~Y~Cp~C~kkyt~Lea~---~L~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKYTSLEAL---QLLDNETGEFHCENCGGE 162 (436)
T ss_pred cccccccCCccccchhhhHHH---HhhcccCceEEEecCCCc
Confidence 444568888888888654443 333323345888888754
No 143
>PRK04023 DNA polymerase II large subunit; Validated
Probab=37.81 E-value=30 Score=35.63 Aligned_cols=9 Identities=33% Similarity=0.785 Sum_probs=5.5
Q ss_pred eecCcCccc
Q psy8050 144 YRCEDCNKS 152 (311)
Q Consensus 144 ~~C~~C~~~ 152 (311)
+.|+.||..
T Consensus 639 frCP~CG~~ 647 (1121)
T PRK04023 639 RRCPFCGTH 647 (1121)
T ss_pred ccCCCCCCC
Confidence 566666654
No 144
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=37.73 E-value=25 Score=33.71 Aligned_cols=10 Identities=30% Similarity=0.747 Sum_probs=5.9
Q ss_pred cccccccccc
Q psy8050 198 RPYRCRLCKK 207 (311)
Q Consensus 198 k~~~C~~C~~ 207 (311)
.+..|+.|+.
T Consensus 252 ~~~~Cp~C~s 261 (505)
T TIGR00595 252 IPKTCPQCGS 261 (505)
T ss_pred CCCCCCCCCC
Confidence 3456666664
No 145
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=37.57 E-value=23 Score=27.27 Aligned_cols=25 Identities=32% Similarity=0.537 Sum_probs=14.6
Q ss_pred CeecCccccccCChHHHHHHHHHhCCCC
Q psy8050 227 PYQCKLCLLRFSQSGNLNRHMRVHGSQY 254 (311)
Q Consensus 227 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~ 254 (311)
-..|-+||+.|.. |.+|++.|+|-.
T Consensus 72 ~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 72 YIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp -EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred eeEEccCCcccch---HHHHHHHccCCC
Confidence 3678888888875 488888886643
No 146
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=37.45 E-value=46 Score=29.37 Aligned_cols=79 Identities=20% Similarity=0.459 Sum_probs=46.3
Q ss_pred CCeecCcCccccCChhHHHHHHHHhcC----------C--CceecCCCCccccCcchhhhhhhhcccccccccccccccc
Q psy8050 142 KPYRCEDCNKSFSQAANLTAHVRTHSG----------E--KPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAF 209 (311)
Q Consensus 142 ~~~~C~~C~~~f~~~~~L~~H~~~h~~----------~--~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f 209 (311)
.|..|+.|.-.......|.+-...--. + +.--|-.|.-.|.....-..- .-+....|.|+.|...|
T Consensus 321 LPi~CP~Csl~LilsthLarSyhhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~~~~~~~~--~~~ss~rY~Ce~CK~~F 398 (421)
T COG5151 321 LPISCPICSLQLILSTHLARSYHHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPKPPVSPFD--ESTSSGRYQCELCKSTF 398 (421)
T ss_pred CCccCcchhHHHHHHHHHHHHHHhhccCcccccccCCCCCCCccceeccCCCCCCCCCccc--ccccccceechhhhhhh
Confidence 467788888776555555443221111 1 112366676666544221111 11234569999999999
Q ss_pred CChHHHHHHhhhh
Q psy8050 210 SDSSTLTKHLRIH 222 (311)
Q Consensus 210 ~~~~~l~~H~~~h 222 (311)
-..-..-.|...|
T Consensus 399 C~dCdvfiHe~Lh 411 (421)
T COG5151 399 CSDCDVFIHETLH 411 (421)
T ss_pred hhhhHHHHHHHHh
Confidence 9988888887766
No 147
>KOG2071|consensus
Probab=36.80 E-value=31 Score=33.26 Aligned_cols=27 Identities=22% Similarity=0.689 Sum_probs=21.4
Q ss_pred CCCeecCcCccccCChhHHHHHHHHhc
Q psy8050 141 EKPYRCEDCNKSFSQAANLTAHVRTHS 167 (311)
Q Consensus 141 ~~~~~C~~C~~~f~~~~~L~~H~~~h~ 167 (311)
..+-.|..||..|........||..|.
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhhh
Confidence 456789999999998888877777663
No 148
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=36.61 E-value=38 Score=20.08 Aligned_cols=24 Identities=21% Similarity=0.456 Sum_probs=14.9
Q ss_pred eecCccccccC--ChHHHHHHHHHhC
Q psy8050 228 YQCKLCLLRFS--QSGNLNRHMRVHG 251 (311)
Q Consensus 228 ~~C~~C~~~f~--~~~~L~~H~~~H~ 251 (311)
-.|+.||..|. ...+-..|.+.|.
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 46777776665 3555666766653
No 149
>KOG4167|consensus
Probab=33.88 E-value=11 Score=36.94 Aligned_cols=27 Identities=30% Similarity=0.664 Sum_probs=23.9
Q ss_pred CCCeecCcCccccCChhHHHHHHHHhc
Q psy8050 141 EKPYRCEDCNKSFSQAANLTAHVRTHS 167 (311)
Q Consensus 141 ~~~~~C~~C~~~f~~~~~L~~H~~~h~ 167 (311)
...|.|..|++.|....++..||..|.
T Consensus 790 ~giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 790 TGIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred CceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 356999999999999999999998884
No 150
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=32.45 E-value=29 Score=26.80 Aligned_cols=11 Identities=27% Similarity=1.075 Sum_probs=5.1
Q ss_pred eecCcCccccC
Q psy8050 144 YRCEDCNKSFS 154 (311)
Q Consensus 144 ~~C~~C~~~f~ 154 (311)
+.|..||..|.
T Consensus 71 ~~C~~CG~~~~ 81 (135)
T PRK03824 71 LKCRNCGNEWS 81 (135)
T ss_pred EECCCCCCEEe
Confidence 44444544443
No 151
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=31.12 E-value=64 Score=24.67 Aligned_cols=17 Identities=24% Similarity=0.464 Sum_probs=12.1
Q ss_pred ccCCCCCCCCccccccC
Q psy8050 291 WIDGRSTMCGHSFIVKL 307 (311)
Q Consensus 291 ~~~~kC~~C~~~f~~~~ 307 (311)
++.|+|..|++.|....
T Consensus 51 ~qRyrC~~C~~tf~~~~ 67 (129)
T COG3677 51 HQRYKCKSCGSTFTVET 67 (129)
T ss_pred ccccccCCcCcceeeec
Confidence 44557888888887643
No 152
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=30.52 E-value=56 Score=25.05 Aligned_cols=25 Identities=40% Similarity=0.581 Sum_probs=16.8
Q ss_pred cceeeccCCCCcCCChhHHHHHHhhhCC
Q psy8050 113 RLILIQFIPGKTYARPSTLKTHLRTHSG 140 (311)
Q Consensus 113 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~ 140 (311)
...++|.++|+.|. +|++|+.+|.+
T Consensus 74 pD~IicLEDGkkfK---SLKRHL~t~~g 98 (148)
T COG4957 74 PDYIICLEDGKKFK---SLKRHLTTHYG 98 (148)
T ss_pred CCeEEEeccCcchH---HHHHHHhcccC
Confidence 34567777777774 56777777665
No 153
>KOG4377|consensus
Probab=30.16 E-value=37 Score=31.11 Aligned_cols=100 Identities=20% Similarity=0.376 Sum_probs=62.2
Q ss_pred cCCCCcCCChhHHHHHHhhhCCC------------CCeec--CcCccccCChhHHHHHHHHhcCC-------CceecCCC
Q psy8050 119 FIPGKTYARPSTLKTHLRTHSGE------------KPYRC--EDCNKSFSQAANLTAHVRTHSGE-------KPFRCPIC 177 (311)
Q Consensus 119 ~~C~~~f~~~~~L~~H~~~h~~~------------~~~~C--~~C~~~f~~~~~L~~H~~~h~~~-------~~~~C~~C 177 (311)
..|+..+.++..+..|..+|-.. ..|.| ..|.+ +-++...|-..|+.. ..|.|.-|
T Consensus 277 e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~GfrrthfhC~r~ 353 (480)
T KOG4377|consen 277 EYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRTHFHCQRI 353 (480)
T ss_pred ccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCceecceeEEecc
Confidence 45888888888999998877432 22567 46877 455666666655432 23778777
Q ss_pred CccccCcchhhhhhhhcc----cc------------------------ccccccc--cccccCChHHHHHHhhhhc
Q psy8050 178 DRRFSQSSSVTTHMRTHS----GE------------------------RPYRCRL--CKKAFSDSSTLTKHLRIHS 223 (311)
Q Consensus 178 ~~~f~~~~~l~~H~~~h~----~~------------------------k~~~C~~--C~~~f~~~~~l~~H~~~h~ 223 (311)
+-++..+ ...|...|. ++ ..|-|.. |+..+.+.+.+..|.+.|.
T Consensus 354 gCTdtfK--~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~s~sqm~shkrkhe 427 (480)
T KOG4377|consen 354 GCTDTFK--DSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLYSVSQMASHKRKHE 427 (480)
T ss_pred CCccccc--cccccccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEEehhhhhhhhhhhh
Confidence 7333333 334444432 11 1133433 8888999999998888774
No 154
>KOG0782|consensus
Probab=29.63 E-value=13 Score=35.44 Aligned_cols=53 Identities=17% Similarity=0.253 Sum_probs=29.1
Q ss_pred hHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHHhCCCCCCCCcccccccccccccccccc
Q psy8050 212 SSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQVCSNIVDQRR 276 (311)
Q Consensus 212 ~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~ 276 (311)
.+.|.+|-..|.....=+|..||+.|..+-.+ |..+. ..+.|.+|...|..+.
T Consensus 238 E~~fvrHHWVHrrRqeGkC~~CgKgFQQKf~F------hsKEi------vAisCSWCKqayH~Kv 290 (1004)
T KOG0782|consen 238 ESGFVRHHWVHRRRQEGKCNTCGKGFQQKFFF------HSKEI------VAISCSWCKQAYHLKV 290 (1004)
T ss_pred cccchHHhHhhHhhhccccchhhhhhhhheee------ccccE------EEEEehHHHHHhhcch
Confidence 34566666555443334677777777654332 33321 2235777777776665
No 155
>PHA00626 hypothetical protein
Probab=29.31 E-value=28 Score=22.21 Aligned_cols=15 Identities=27% Similarity=0.399 Sum_probs=12.1
Q ss_pred CCeecCccccccCCh
Q psy8050 226 KPYQCKLCLLRFSQS 240 (311)
Q Consensus 226 ~~~~C~~C~~~f~~~ 240 (311)
..|.|+.||..|+..
T Consensus 22 nrYkCkdCGY~ft~~ 36 (59)
T PHA00626 22 DDYVCCDCGYNDSKD 36 (59)
T ss_pred cceEcCCCCCeechh
Confidence 459999999999743
No 156
>PF14353 CpXC: CpXC protein
Probab=29.17 E-value=16 Score=27.75 Aligned_cols=17 Identities=24% Similarity=0.696 Sum_probs=10.8
Q ss_pred CeecCcCccccCChhHH
Q psy8050 143 PYRCEDCNKSFSQAANL 159 (311)
Q Consensus 143 ~~~C~~C~~~f~~~~~L 159 (311)
.|.|+.||..|.-...+
T Consensus 38 ~~~CP~Cg~~~~~~~p~ 54 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPL 54 (128)
T ss_pred EEECCCCCCceecCCCE
Confidence 47777777777554443
No 157
>KOG0847|consensus
Probab=29.16 E-value=22 Score=29.54 Aligned_cols=20 Identities=30% Similarity=0.391 Sum_probs=15.1
Q ss_pred HHHHhhhhcCccccCcchhh
Q psy8050 13 KKKKKRKKKGILANSSSKEK 32 (311)
Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~ 32 (311)
-||.||||||-.+..++..+
T Consensus 217 NRRTKWRKkhAaEmasakkk 236 (288)
T KOG0847|consen 217 NRRTKWRKKHAAEMASAKKK 236 (288)
T ss_pred cchhhhhhhhccchhhcccc
Confidence 48899999998766665444
No 158
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=29.01 E-value=26 Score=18.24 Aligned_cols=10 Identities=30% Similarity=0.630 Sum_probs=8.6
Q ss_pred ccCCCCCCCC
Q psy8050 34 QDRSECPGSG 43 (311)
Q Consensus 34 ~~~~~C~~c~ 43 (311)
..+|.||.||
T Consensus 14 ~v~f~CPnCG 23 (24)
T PF07754_consen 14 AVPFPCPNCG 23 (24)
T ss_pred CceEeCCCCC
Confidence 5679999998
No 159
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=28.97 E-value=29 Score=29.07 Aligned_cols=25 Identities=32% Similarity=0.651 Sum_probs=12.2
Q ss_pred cccccccccccCChHHHHHHhhhhc
Q psy8050 199 PYRCRLCKKAFSDSSTLTKHLRIHS 223 (311)
Q Consensus 199 ~~~C~~C~~~f~~~~~l~~H~~~h~ 223 (311)
.|.|.+|++.|....-...|+..-|
T Consensus 77 K~~C~lc~KlFkg~eFV~KHI~nKH 101 (214)
T PF04959_consen 77 KWRCPLCGKLFKGPEFVRKHIFNKH 101 (214)
T ss_dssp EEEE-SSS-EESSHHHHHHHHHHH-
T ss_pred EECCCCCCcccCChHHHHHHHhhcC
Confidence 3555555555555555555555433
No 160
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=28.86 E-value=29 Score=26.48 Aligned_cols=14 Identities=21% Similarity=0.821 Sum_probs=7.7
Q ss_pred eecCcCccccCChh
Q psy8050 144 YRCEDCNKSFSQAA 157 (311)
Q Consensus 144 ~~C~~C~~~f~~~~ 157 (311)
++|..||+.|..-+
T Consensus 2 H~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 2 HQCTKCGRVFEDGS 15 (131)
T ss_pred cccCcCCCCcCCCc
Confidence 45666666665443
No 161
>PF14122 YokU: YokU-like protein
Probab=28.83 E-value=52 Score=23.07 Aligned_cols=21 Identities=14% Similarity=0.039 Sum_probs=11.4
Q ss_pred CCcceeeccCCCCcCCChhHH
Q psy8050 111 GIRLILIQFIPGKTYARPSTL 131 (311)
Q Consensus 111 ~~~~~~~C~~C~~~f~~~~~L 131 (311)
...+...|..||.+|-....+
T Consensus 31 ~~tP~i~C~~CgmvYq~d~vi 51 (87)
T PF14122_consen 31 TDTPAIICSNCGMVYQDDEVI 51 (87)
T ss_pred cCCceeeecCCCcEEehhHHH
Confidence 334455666666666554443
No 162
>PF05495 zf-CHY: CHY zinc finger; InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins: Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation: ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom. More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=28.61 E-value=20 Score=24.16 Aligned_cols=9 Identities=33% Similarity=0.685 Sum_probs=3.6
Q ss_pred ecCcCcccc
Q psy8050 145 RCEDCNKSF 153 (311)
Q Consensus 145 ~C~~C~~~f 153 (311)
+++-|++.|
T Consensus 12 ~~~cC~~~y 20 (71)
T PF05495_consen 12 RFPCCGKYY 20 (71)
T ss_dssp EETTTTEEE
T ss_pred ECCcccCee
Confidence 334444433
No 163
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=28.18 E-value=53 Score=28.04 Aligned_cols=12 Identities=17% Similarity=0.991 Sum_probs=7.8
Q ss_pred CCeecCcCcccc
Q psy8050 142 KPYRCEDCNKSF 153 (311)
Q Consensus 142 ~~~~C~~C~~~f 153 (311)
..|.|..|+..+
T Consensus 111 rqFaC~~Cd~~W 122 (278)
T PF15135_consen 111 RQFACSSCDHMW 122 (278)
T ss_pred eeeeccccchHH
Confidence 557777776543
No 164
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=27.76 E-value=37 Score=24.05 Aligned_cols=13 Identities=38% Similarity=0.904 Sum_probs=6.1
Q ss_pred ceecCCCCccccC
Q psy8050 171 PFRCPICDRRFSQ 183 (311)
Q Consensus 171 ~~~C~~C~~~f~~ 183 (311)
|-.|..||+.|..
T Consensus 58 Pa~CkkCGfef~~ 70 (97)
T COG3357 58 PARCKKCGFEFRD 70 (97)
T ss_pred ChhhcccCccccc
Confidence 3445555555443
No 165
>KOG2593|consensus
Probab=27.56 E-value=39 Score=31.27 Aligned_cols=36 Identities=17% Similarity=0.279 Sum_probs=19.6
Q ss_pred ceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccc
Q psy8050 114 LILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKS 152 (311)
Q Consensus 114 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~ 152 (311)
..|.|+.|++.|+....++. .-.....|.|..|+-.
T Consensus 127 ~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 127 AGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGGE 162 (436)
T ss_pred ccccCCccccchhhhHHHHh---hcccCceEEEecCCCc
Confidence 44566666666666554442 2222345777777643
No 166
>KOG1280|consensus
Probab=27.27 E-value=52 Score=29.52 Aligned_cols=36 Identities=19% Similarity=0.422 Sum_probs=19.9
Q ss_pred cccccccccccCChHHHHHHhhhhcCCCC--eecCccc
Q psy8050 199 PYRCRLCKKAFSDSSTLTKHLRIHSGEKP--YQCKLCL 234 (311)
Q Consensus 199 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~--~~C~~C~ 234 (311)
.|.|++|+..-.+...|..|....+.+-+ ..|++|+
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 46666666666666666666555444433 2355554
No 167
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=27.19 E-value=29 Score=24.64 Aligned_cols=10 Identities=50% Similarity=1.238 Sum_probs=4.6
Q ss_pred cccccccccc
Q psy8050 200 YRCRLCKKAF 209 (311)
Q Consensus 200 ~~C~~C~~~f 209 (311)
|.|..|++.|
T Consensus 55 W~C~~C~~~~ 64 (90)
T PTZ00255 55 WRCKGCKKTV 64 (90)
T ss_pred EEcCCCCCEE
Confidence 4444444444
No 168
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=27.15 E-value=23 Score=29.65 Aligned_cols=30 Identities=30% Similarity=0.710 Sum_probs=21.1
Q ss_pred CCceecCCCCccccCcchhhhhhhhccccc
Q psy8050 169 EKPFRCPICDRRFSQSSSVTTHMRTHSGER 198 (311)
Q Consensus 169 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k 198 (311)
+..|.|..|++.|.-.....+|+..-|.++
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 345888888888888888888888777664
No 169
>KOG2906|consensus
Probab=27.09 E-value=12 Score=26.83 Aligned_cols=13 Identities=0% Similarity=-0.123 Sum_probs=7.4
Q ss_pred CCCCCCCCCCCCC
Q psy8050 78 APYQVGAPQQFPT 90 (311)
Q Consensus 78 ~~~~C~~c~~~~~ 90 (311)
..|.|..|++.+.
T Consensus 20 ~rf~C~tCpY~~~ 32 (105)
T KOG2906|consen 20 NRFSCRTCPYVFP 32 (105)
T ss_pred eeEEcCCCCceee
Confidence 3456666665543
No 170
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=26.85 E-value=35 Score=22.52 Aligned_cols=9 Identities=11% Similarity=0.327 Sum_probs=3.3
Q ss_pred ccccccccc
Q psy8050 264 VNQVCSNIV 272 (311)
Q Consensus 264 ~C~~C~~~f 272 (311)
.|.+|+..|
T Consensus 11 ~C~~C~~~F 19 (69)
T PF01363_consen 11 NCMICGKKF 19 (69)
T ss_dssp B-TTT--B-
T ss_pred cCcCcCCcC
Confidence 466777776
No 171
>COG1779 C4-type Zn-finger protein [General function prediction only]
Probab=26.69 E-value=20 Score=29.45 Aligned_cols=11 Identities=18% Similarity=0.175 Sum_probs=5.9
Q ss_pred CCCCCCccccc
Q psy8050 295 RSTMCGHSFIV 305 (311)
Q Consensus 295 kC~~C~~~f~~ 305 (311)
.|..||+++..
T Consensus 45 ~C~~CgYR~~D 55 (201)
T COG1779 45 VCERCGYRSTD 55 (201)
T ss_pred EccccCCcccc
Confidence 45555555543
No 172
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=26.51 E-value=25 Score=25.01 Aligned_cols=10 Identities=40% Similarity=1.301 Sum_probs=5.3
Q ss_pred cccccccccc
Q psy8050 200 YRCRLCKKAF 209 (311)
Q Consensus 200 ~~C~~C~~~f 209 (311)
|.|..|++.|
T Consensus 54 W~C~~C~~~~ 63 (90)
T PF01780_consen 54 WKCKKCGKKF 63 (90)
T ss_dssp EEETTTTEEE
T ss_pred eecCCCCCEE
Confidence 5555555544
No 173
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=26.42 E-value=41 Score=21.40 Aligned_cols=9 Identities=11% Similarity=0.195 Sum_probs=4.1
Q ss_pred ccccccccc
Q psy8050 264 VNQVCSNIV 272 (311)
Q Consensus 264 ~C~~C~~~f 272 (311)
.|+.|+..+
T Consensus 4 ~CP~CG~~i 12 (54)
T TIGR01206 4 ECPDCGAEI 12 (54)
T ss_pred CCCCCCCEE
Confidence 345554443
No 174
>KOG4167|consensus
Probab=25.43 E-value=21 Score=35.19 Aligned_cols=31 Identities=19% Similarity=0.203 Sum_probs=26.9
Q ss_pred CCCcceeeccCCCCcCCChhHHHHHHhhhCC
Q psy8050 110 SGIRLILIQFIPGKTYARPSTLKTHLRTHSG 140 (311)
Q Consensus 110 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~ 140 (311)
.+....|-|.+|++.|....++..||+.|..
T Consensus 787 ~~~~giFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 787 TDPTGIFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred CCCCceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 3456789999999999999999999998854
No 175
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=25.18 E-value=15 Score=21.50 Aligned_cols=6 Identities=50% Similarity=1.619 Sum_probs=1.8
Q ss_pred cCCCCc
Q psy8050 174 CPICDR 179 (311)
Q Consensus 174 C~~C~~ 179 (311)
|+.||.
T Consensus 3 Cp~Cg~ 8 (39)
T PF01096_consen 3 CPKCGH 8 (39)
T ss_dssp -SSS-S
T ss_pred CcCCCC
Confidence 444443
No 176
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=25.02 E-value=45 Score=19.60 Aligned_cols=17 Identities=24% Similarity=0.493 Sum_probs=9.0
Q ss_pred eecCcCccccCChhHHH
Q psy8050 144 YRCEDCNKSFSQAANLT 160 (311)
Q Consensus 144 ~~C~~C~~~f~~~~~L~ 160 (311)
..|+.|+-.|.....|.
T Consensus 20 d~C~~C~G~W~d~~el~ 36 (41)
T PF13453_consen 20 DVCPSCGGIWFDAGELE 36 (41)
T ss_pred EECCCCCeEEccHHHHH
Confidence 45555555555555443
No 177
>PF10058 DUF2296: Predicted integral membrane metal-binding protein (DUF2296); InterPro: IPR019273 This domain, found mainly in the eukaryotic lunapark proteins, has no known function [].
Probab=24.60 E-value=77 Score=20.12 Aligned_cols=9 Identities=44% Similarity=1.184 Sum_probs=5.5
Q ss_pred CeecCcCcc
Q psy8050 143 PYRCEDCNK 151 (311)
Q Consensus 143 ~~~C~~C~~ 151 (311)
.|.|..|+.
T Consensus 44 ~y~C~~Cg~ 52 (54)
T PF10058_consen 44 QYRCPYCGA 52 (54)
T ss_pred EEEcCCCCC
Confidence 466666653
No 178
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=24.54 E-value=31 Score=24.53 Aligned_cols=11 Identities=36% Similarity=1.081 Sum_probs=5.1
Q ss_pred ccccccccccC
Q psy8050 200 YRCRLCKKAFS 210 (311)
Q Consensus 200 ~~C~~C~~~f~ 210 (311)
|.|..|++.|.
T Consensus 54 W~C~~C~~~~A 64 (91)
T TIGR00280 54 WTCRKCGAKFA 64 (91)
T ss_pred EEcCCCCCEEe
Confidence 44444444443
No 179
>COG0333 RpmF Ribosomal protein L32 [Translation, ribosomal structure and biogenesis]
Probab=23.83 E-value=45 Score=21.49 Aligned_cols=30 Identities=27% Similarity=0.534 Sum_probs=18.5
Q ss_pred hhHHHHHHHHHHHHhhhhcCccccCcchhhhccCCCCCCCC
Q psy8050 3 KKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG 43 (311)
Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~ 43 (311)
++|.||.+|.+|+.-++.+-. ..-.|+.||
T Consensus 5 krktSksrr~~RRsh~~l~~~-----------~~~~c~~cG 34 (57)
T COG0333 5 KRKTSKSRRRMRRSHDALKAP-----------TLSVCPNCG 34 (57)
T ss_pred cccCChhhhhhhhhhHhhhCc-----------cceeccCCC
Confidence 456677777777766554322 136788888
No 180
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=23.09 E-value=53 Score=31.52 Aligned_cols=30 Identities=27% Similarity=0.484 Sum_probs=20.5
Q ss_pred hhccccccccccccccccCChHHHHHHhhhhcCCCCeecCccccc
Q psy8050 192 RTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLR 236 (311)
Q Consensus 192 ~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 236 (311)
..|.......|..||... ..|..|+.|+..
T Consensus 233 ~~h~~~~~l~Ch~Cg~~~---------------~~~~~Cp~C~s~ 262 (505)
T TIGR00595 233 TYHKKEGKLRCHYCGYQE---------------PIPKTCPQCGSE 262 (505)
T ss_pred EEecCCCeEEcCCCcCcC---------------CCCCCCCCCCCC
Confidence 344445667899998644 356789999864
No 181
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=22.85 E-value=65 Score=20.09 Aligned_cols=11 Identities=18% Similarity=0.679 Sum_probs=5.8
Q ss_pred CeecCcCcccc
Q psy8050 143 PYRCEDCNKSF 153 (311)
Q Consensus 143 ~~~C~~C~~~f 153 (311)
.+.|..||.++
T Consensus 37 r~~C~~Cgyt~ 47 (50)
T PRK00432 37 RWHCGKCGYTE 47 (50)
T ss_pred cEECCCcCCEE
Confidence 35566565543
No 182
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=22.75 E-value=1e+02 Score=32.91 Aligned_cols=8 Identities=38% Similarity=1.099 Sum_probs=4.8
Q ss_pred eecCcCcc
Q psy8050 144 YRCEDCNK 151 (311)
Q Consensus 144 ~~C~~C~~ 151 (311)
+.|+.||.
T Consensus 668 rkCPkCG~ 675 (1337)
T PRK14714 668 RRCPSCGT 675 (1337)
T ss_pred EECCCCCC
Confidence 55666664
No 183
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=22.59 E-value=58 Score=28.95 Aligned_cols=12 Identities=25% Similarity=0.858 Sum_probs=5.7
Q ss_pred CCCceecCCCCc
Q psy8050 168 GEKPFRCPICDR 179 (311)
Q Consensus 168 ~~~~~~C~~C~~ 179 (311)
|.+-..|..|+.
T Consensus 207 G~RyL~CslC~t 218 (305)
T TIGR01562 207 GLRYLSCSLCAT 218 (305)
T ss_pred CceEEEcCCCCC
Confidence 334445555543
No 184
>PRK12286 rpmF 50S ribosomal protein L32; Reviewed
Probab=22.43 E-value=25 Score=22.65 Aligned_cols=9 Identities=44% Similarity=0.988 Sum_probs=7.1
Q ss_pred cCCCCCCCC
Q psy8050 35 DRSECPGSG 43 (311)
Q Consensus 35 ~~~~C~~c~ 43 (311)
....|+.||
T Consensus 26 ~l~~C~~CG 34 (57)
T PRK12286 26 GLVECPNCG 34 (57)
T ss_pred cceECCCCC
Confidence 347899998
No 185
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=22.08 E-value=54 Score=24.12 Aligned_cols=15 Identities=13% Similarity=0.479 Sum_probs=8.5
Q ss_pred CCeecCcCccccCCh
Q psy8050 142 KPYRCEDCNKSFSQA 156 (311)
Q Consensus 142 ~~~~C~~C~~~f~~~ 156 (311)
..|.|+.|+..+...
T Consensus 18 ~~~iCpeC~~EW~~~ 32 (109)
T TIGR00686 18 TQLICPSCLYEWNEN 32 (109)
T ss_pred CeeECcccccccccc
Confidence 346666666665433
No 186
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=22.01 E-value=56 Score=24.29 Aligned_cols=11 Identities=27% Similarity=0.646 Sum_probs=5.5
Q ss_pred eecCcCccccC
Q psy8050 144 YRCEDCNKSFS 154 (311)
Q Consensus 144 ~~C~~C~~~f~ 154 (311)
+.|..|+..|.
T Consensus 71 ~~C~~Cg~~~~ 81 (113)
T PRK12380 71 AWCWDCSQVVE 81 (113)
T ss_pred EEcccCCCEEe
Confidence 45555554443
No 187
>KOG0782|consensus
Probab=21.99 E-value=30 Score=33.15 Aligned_cols=53 Identities=17% Similarity=0.248 Sum_probs=36.6
Q ss_pred hHHHHHHHHHhCCCCCCCCcccccccccccccccccccchhhceeeeecccccCCCCCCCCccccccC
Q psy8050 240 SGNLNRHMRVHGSQYNMLPVIYMGVNQVCSNIVDQRRGIYVVEAVWTVQSTWIDGRSTMCGHSFIVKL 307 (311)
Q Consensus 240 ~~~L~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~kC~~C~~~f~~~~ 307 (311)
.+.|.+|--+|...... +|..|++.|.++..+. ..+..-+ -|+-|...|..+.
T Consensus 238 E~~fvrHHWVHrrRqeG-------kC~~CgKgFQQKf~Fh------sKEivAi--sCSWCKqayH~Kv 290 (1004)
T KOG0782|consen 238 ESGFVRHHWVHRRRQEG-------KCNTCGKGFQQKFFFH------SKEIVAI--SCSWCKQAYHLKV 290 (1004)
T ss_pred cccchHHhHhhHhhhcc-------ccchhhhhhhhheeec------cccEEEE--EehHHHHHhhcch
Confidence 44788887777655543 7999999999885554 3333322 3888988887765
No 188
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=21.79 E-value=30 Score=30.75 Aligned_cols=6 Identities=33% Similarity=1.315 Sum_probs=3.0
Q ss_pred cccccc
Q psy8050 201 RCRLCK 206 (311)
Q Consensus 201 ~C~~C~ 206 (311)
.|+.||
T Consensus 228 ~C~~Cg 233 (309)
T PRK03564 228 KCSNCE 233 (309)
T ss_pred cCCCCC
Confidence 455555
No 189
>PF12647 RNHCP: RNHCP domain; InterPro: IPR024439 This domain is found in uncharacterised bacterial proteins. It is typically between 94 and 143 amino acids in length and has a conserved RNHCP sequence motif.
Probab=21.76 E-value=3.1e+02 Score=19.62 Aligned_cols=9 Identities=22% Similarity=0.541 Sum_probs=7.7
Q ss_pred cCCCCCCCC
Q psy8050 35 DRSECPGSG 43 (311)
Q Consensus 35 ~~~~C~~c~ 43 (311)
+.|.|..||
T Consensus 3 ~~F~C~~CG 11 (92)
T PF12647_consen 3 ESFTCVHCG 11 (92)
T ss_pred cccCccccC
Confidence 459999999
No 190
>KOG3408|consensus
Probab=21.69 E-value=49 Score=24.83 Aligned_cols=26 Identities=27% Similarity=0.664 Sum_probs=20.1
Q ss_pred ccccccccccccccCChHHHHHHhhh
Q psy8050 196 GERPYRCRLCKKAFSDSSTLTKHLRI 221 (311)
Q Consensus 196 ~~k~~~C~~C~~~f~~~~~l~~H~~~ 221 (311)
|...|.|-.|.+-|.+...|..|.++
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhc
Confidence 34457888888888888888888764
No 191
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=21.52 E-value=57 Score=24.35 Aligned_cols=11 Identities=45% Similarity=1.084 Sum_probs=5.8
Q ss_pred eecCcCccccC
Q psy8050 144 YRCEDCNKSFS 154 (311)
Q Consensus 144 ~~C~~C~~~f~ 154 (311)
+.|..|+..|.
T Consensus 71 ~~C~~Cg~~~~ 81 (115)
T TIGR00100 71 CECEDCSEEVS 81 (115)
T ss_pred EEcccCCCEEe
Confidence 45555555543
No 192
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=21.52 E-value=40 Score=20.17 Aligned_cols=16 Identities=13% Similarity=0.165 Sum_probs=11.9
Q ss_pred eeccCCCCcCCChhHH
Q psy8050 116 LIQFIPGKTYARPSTL 131 (311)
Q Consensus 116 ~~C~~C~~~f~~~~~L 131 (311)
-+|..||+.|+.+..-
T Consensus 9 K~C~~C~rpf~WRKKW 24 (42)
T PF10013_consen 9 KICPVCGRPFTWRKKW 24 (42)
T ss_pred CcCcccCCcchHHHHH
Confidence 4688899988876544
No 193
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.45 E-value=30 Score=23.63 Aligned_cols=10 Identities=50% Similarity=1.135 Sum_probs=5.7
Q ss_pred eecCcccccc
Q psy8050 228 YQCKLCLLRF 237 (311)
Q Consensus 228 ~~C~~C~~~f 237 (311)
|.|..|+..|
T Consensus 13 Y~c~~cg~~~ 22 (82)
T COG2331 13 YECTECGNRF 22 (82)
T ss_pred EeecccchHH
Confidence 5566665544
No 194
>KOG3014|consensus
Probab=21.42 E-value=42 Score=28.66 Aligned_cols=31 Identities=19% Similarity=0.244 Sum_probs=22.9
Q ss_pred cCCCCCCCCCCCCCCchhhHhhhccccccCCCCC
Q psy8050 35 DRSECPGSGMGKPPHDVADVLLSLKHAVVHPGQM 68 (311)
Q Consensus 35 ~~~~C~~c~~~~~~~~~~~~~~~~~H~~~h~~~~ 68 (311)
...+|..||+.+++.+-.+. +.|...|..-+
T Consensus 36 g~~~C~~Cgm~Yt~~s~EDe---~~H~~fH~~~l 66 (257)
T KOG3014|consen 36 GAVKCKECGMKYTVTSPEDE---ALHEKFHNRFL 66 (257)
T ss_pred CceehhhcCceecCCCHHHH---HHHHHHHHhhc
Confidence 45799999998877776666 56777777443
No 195
>TIGR03829 YokU_near_AblA uncharacterized protein, YokU family. Members of this protein family occur in various species of the genus Bacillus, always next to the gene (kamA or ablA) for lysine 2,3-aminomutase. Members have a pair of CXXC motifs, and share homology to the amino-terminal region of a family of putative transcription factors for which the C-terminal is modeled by pfam01381, a helix-turn-helix domain model. This family, however, is shorter and lacks the helix-turn-helix region. The function of this protein family is unknown, but a regulatory role in compatible solute biosynthesis is suggested by local genome context.
Probab=21.08 E-value=54 Score=23.26 Aligned_cols=16 Identities=19% Similarity=0.094 Sum_probs=7.5
Q ss_pred eeeccCCCCcCCChhH
Q psy8050 115 ILIQFIPGKTYARPST 130 (311)
Q Consensus 115 ~~~C~~C~~~f~~~~~ 130 (311)
...|..||..|-+...
T Consensus 35 a~~C~~CGe~y~~dev 50 (89)
T TIGR03829 35 SISCSHCGMEYQDDTT 50 (89)
T ss_pred cccccCCCcEeecHHH
Confidence 3445555555444433
No 196
>PF09788 Tmemb_55A: Transmembrane protein 55A; InterPro: IPR019178 Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction: 1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
Probab=20.97 E-value=3.7e+02 Score=23.20 Aligned_cols=41 Identities=20% Similarity=0.136 Sum_probs=22.8
Q ss_pred CCCCCCCCCCCCCCchhhHhhhcccccc--CCCCCCCCCCCCCCCCCC
Q psy8050 36 RSECPGSGMGKPPHDVADVLLSLKHAVV--HPGQMTPPPYGDPGAPYQ 81 (311)
Q Consensus 36 ~~~C~~c~~~~~~~~~~~~~~~~~H~~~--h~~~~~p~~~~~p~~~~~ 81 (311)
.-.|..|+ ....-.....+|.+. +-.+.+|--..+||+-|.
T Consensus 65 ~v~CrVCq-----~~I~i~gk~~QhVVkC~~CnEATPIr~aPpGKKYV 107 (256)
T PF09788_consen 65 VVTCRVCQ-----SLIDIEGKMHQHVVKCSVCNEATPIRNAPPGKKYV 107 (256)
T ss_pred eEEeecCC-----ceecccCccceeeEECCCCCccccccCCCCCCeeE
Confidence 35688887 322222333445442 345666666667778777
No 197
>PF07295 DUF1451: Protein of unknown function (DUF1451); InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=20.92 E-value=65 Score=25.23 Aligned_cols=11 Identities=18% Similarity=0.425 Sum_probs=7.2
Q ss_pred CeecCcccccc
Q psy8050 227 PYQCKLCLLRF 237 (311)
Q Consensus 227 ~~~C~~C~~~f 237 (311)
.|.|..||...
T Consensus 112 ~l~C~~Cg~~~ 122 (146)
T PF07295_consen 112 TLVCENCGHEV 122 (146)
T ss_pred eEecccCCCEE
Confidence 37777777554
No 198
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=20.57 E-value=56 Score=17.62 Aligned_cols=8 Identities=50% Similarity=1.531 Sum_probs=2.9
Q ss_pred CeecCcCc
Q psy8050 143 PYRCEDCN 150 (311)
Q Consensus 143 ~~~C~~C~ 150 (311)
.|.|..|+
T Consensus 15 ~Y~C~~Cd 22 (30)
T PF07649_consen 15 FYRCSECD 22 (30)
T ss_dssp EEE-TTT-
T ss_pred eEECccCC
Confidence 34555554
No 199
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=20.45 E-value=90 Score=19.42 Aligned_cols=18 Identities=28% Similarity=0.433 Sum_probs=10.3
Q ss_pred CCeecCccccccCChHHH
Q psy8050 226 KPYQCKLCLLRFSQSGNL 243 (311)
Q Consensus 226 ~~~~C~~C~~~f~~~~~L 243 (311)
+.+.|..||..|.....=
T Consensus 3 k~l~C~dCg~~FvfTa~E 20 (49)
T PF13451_consen 3 KTLTCKDCGAEFVFTAGE 20 (49)
T ss_pred eeEEcccCCCeEEEehhH
Confidence 446666777666544433
No 200
>PRK05978 hypothetical protein; Provisional
Probab=20.02 E-value=58 Score=25.56 Aligned_cols=9 Identities=11% Similarity=0.268 Sum_probs=4.8
Q ss_pred ccccccccc
Q psy8050 265 NQVCSNIVD 273 (311)
Q Consensus 265 C~~C~~~f~ 273 (311)
|+.||..|.
T Consensus 55 C~~CG~~~~ 63 (148)
T PRK05978 55 CAACGEDFT 63 (148)
T ss_pred ccccCCccc
Confidence 555555543
Done!