Query psy10075
Match_columns 373
No_of_seqs 194 out of 2379
Neff 10.8
Searched_HMMs 46136
Date Fri Aug 16 17:18:00 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10075.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10075hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3608|consensus 99.9 4.7E-27 1E-31 193.6 11.8 226 112-348 135-382 (467)
2 KOG2462|consensus 99.9 2.1E-27 4.5E-32 190.9 5.7 140 196-341 127-266 (279)
3 KOG3608|consensus 99.9 2.8E-26 6.1E-31 189.0 11.9 257 25-290 69-381 (467)
4 KOG1074|consensus 99.9 4.6E-27 9.9E-32 213.9 2.7 200 140-342 604-931 (958)
5 KOG1074|consensus 99.9 2.1E-26 4.6E-31 209.5 0.6 48 83-131 354-401 (958)
6 KOG2462|consensus 99.9 3.1E-25 6.7E-30 178.5 4.6 126 228-373 128-253 (279)
7 KOG3623|consensus 99.9 2.3E-22 4.9E-27 180.6 5.4 107 23-131 208-329 (1007)
8 KOG3623|consensus 99.8 2.5E-22 5.3E-27 180.3 3.2 78 262-340 895-972 (1007)
9 KOG3576|consensus 99.6 7.1E-17 1.5E-21 123.6 2.4 114 229-346 116-240 (267)
10 KOG3576|consensus 99.6 1.2E-15 2.5E-20 117.0 2.4 131 16-167 108-238 (267)
11 PLN03086 PRLI-interacting fact 99.2 3.6E-11 7.8E-16 109.9 9.3 146 171-343 408-565 (567)
12 PHA00733 hypothetical protein 99.2 3.5E-11 7.7E-16 89.6 5.1 85 259-345 38-126 (128)
13 PLN03086 PRLI-interacting fact 99.1 6E-10 1.3E-14 102.1 8.6 147 141-315 407-565 (567)
14 PHA00733 hypothetical protein 99.0 2E-10 4.3E-15 85.6 4.4 85 228-315 38-124 (128)
15 KOG3993|consensus 98.8 2.5E-09 5.5E-14 91.8 3.6 53 291-343 431-483 (500)
16 PHA02768 hypothetical protein; 98.8 2.6E-09 5.5E-14 65.0 2.0 44 261-307 5-48 (55)
17 KOG3993|consensus 98.7 5E-09 1.1E-13 90.0 1.1 198 112-316 268-484 (500)
18 PHA02768 hypothetical protein; 98.7 1E-08 2.2E-13 62.4 1.5 44 290-335 5-48 (55)
19 PF05605 zf-Di19: Drought indu 98.4 2.6E-07 5.7E-12 57.7 3.5 49 291-342 3-53 (54)
20 PF13465 zf-H2C2_2: Zinc-finge 98.4 1.6E-07 3.4E-12 48.7 1.7 25 276-301 1-25 (26)
21 PHA00732 hypothetical protein 98.2 9.1E-07 2E-11 59.6 3.1 44 261-310 1-44 (79)
22 PF05605 zf-Di19: Drought indu 98.2 1.4E-06 3.1E-11 54.4 3.6 50 262-314 3-53 (54)
23 KOG1146|consensus 98.2 1.2E-06 2.7E-11 86.2 4.7 101 6-106 444-613 (1406)
24 PHA00616 hypothetical protein 98.2 3.8E-07 8.2E-12 52.7 0.7 31 262-293 2-32 (44)
25 PHA00732 hypothetical protein 98.2 1.1E-06 2.4E-11 59.2 2.5 47 290-341 1-47 (79)
26 PHA00616 hypothetical protein 98.1 9.2E-07 2E-11 51.1 1.1 33 290-322 1-33 (44)
27 PF13465 zf-H2C2_2: Zinc-finge 98.0 1E-06 2.2E-11 45.6 0.2 17 76-92 8-24 (26)
28 PF12756 zf-C2H2_2: C2H2 type 97.9 7.8E-06 1.7E-10 58.9 2.1 73 263-341 1-73 (100)
29 PF13894 zf-C2H2_4: C2H2-type 97.8 1.6E-05 3.6E-10 40.3 1.8 24 319-342 1-24 (24)
30 PF00096 zf-C2H2: Zinc finger, 97.7 1.3E-05 2.8E-10 40.3 1.1 22 319-340 1-22 (23)
31 PF00096 zf-C2H2: Zinc finger, 97.6 4E-05 8.6E-10 38.4 1.7 20 262-281 1-20 (23)
32 COG5189 SFP1 Putative transcri 97.6 2.9E-05 6.3E-10 64.8 1.7 52 259-310 347-418 (423)
33 PF13894 zf-C2H2_4: C2H2-type 97.5 0.00011 2.3E-09 37.2 2.2 20 262-281 1-20 (24)
34 KOG1146|consensus 97.4 2.7E-05 5.9E-10 77.1 -0.4 75 56-130 439-537 (1406)
35 PF12756 zf-C2H2_2: C2H2 type 97.4 0.00013 2.8E-09 52.5 2.6 22 261-282 50-71 (100)
36 PF13912 zf-C2H2_6: C2H2-type 97.4 7.1E-05 1.5E-09 39.2 0.8 24 319-342 2-25 (27)
37 PF13912 zf-C2H2_6: C2H2-type 97.3 0.00014 3E-09 38.1 1.7 25 25-49 1-25 (27)
38 COG5189 SFP1 Putative transcri 97.3 0.00014 2.9E-09 60.9 2.3 24 258-281 395-418 (423)
39 PF13909 zf-H2C2_5: C2H2-type 97.1 0.00025 5.5E-09 35.9 1.4 24 319-343 1-24 (24)
40 KOG2482|consensus 96.8 0.0027 5.8E-08 54.0 5.7 185 155-341 129-357 (423)
41 COG5236 Uncharacterized conser 96.8 0.002 4.3E-08 54.8 4.4 132 200-345 152-308 (493)
42 PF09237 GAGA: GAGA factor; I 96.8 0.0009 1.9E-08 39.7 1.7 32 22-53 21-52 (54)
43 PF13909 zf-H2C2_5: C2H2-type 96.7 0.00081 1.8E-08 34.0 1.0 23 291-314 1-23 (24)
44 smart00355 ZnF_C2H2 zinc finge 96.5 0.0023 5.1E-08 32.7 2.3 23 319-341 1-23 (26)
45 PF12874 zf-met: Zinc-finger o 96.5 0.0012 2.5E-08 33.8 0.7 22 319-340 1-22 (25)
46 PF09237 GAGA: GAGA factor; I 96.4 0.0015 3.3E-08 38.7 1.2 22 260-281 23-44 (54)
47 COG5236 Uncharacterized conser 96.3 0.0038 8.2E-08 53.1 3.6 132 25-167 151-307 (493)
48 PF12874 zf-met: Zinc-finger o 96.2 0.0024 5.2E-08 32.5 1.0 22 26-47 1-22 (25)
49 KOG2482|consensus 96.2 0.0092 2E-07 50.9 4.9 149 183-343 128-304 (423)
50 smart00355 ZnF_C2H2 zinc finge 96.1 0.0046 9.9E-08 31.5 2.0 19 263-281 2-20 (26)
51 PF12171 zf-C2H2_jaz: Zinc-fin 96.0 0.0047 1E-07 32.1 1.7 22 26-47 2-23 (27)
52 KOG2231|consensus 95.9 0.022 4.7E-07 54.1 6.5 120 171-314 100-236 (669)
53 PRK04860 hypothetical protein; 95.8 0.0056 1.2E-07 47.6 2.1 39 261-304 119-157 (160)
54 PF12171 zf-C2H2_jaz: Zinc-fin 95.4 0.0095 2.1E-07 31.0 1.3 19 292-310 3-21 (27)
55 PRK04860 hypothetical protein; 94.7 0.014 2.9E-07 45.5 1.0 40 289-332 118-157 (160)
56 PF13913 zf-C2HC_2: zinc-finge 94.6 0.034 7.3E-07 28.2 2.0 19 320-339 4-22 (25)
57 KOG4173|consensus 94.2 0.012 2.5E-07 46.4 -0.3 83 199-287 79-172 (253)
58 PF12013 DUF3505: Protein of u 93.6 0.095 2.1E-06 38.2 3.6 25 319-343 81-109 (109)
59 KOG2785|consensus 93.5 0.14 2.9E-06 44.9 4.8 22 141-162 3-24 (390)
60 smart00451 ZnF_U1 U1-like zinc 93.4 0.054 1.2E-06 30.0 1.6 21 25-45 3-23 (35)
61 KOG2231|consensus 93.2 0.25 5.3E-06 47.3 6.4 19 202-220 185-203 (669)
62 cd00350 rubredoxin_like Rubred 93.2 0.069 1.5E-06 29.2 1.7 10 262-271 2-11 (33)
63 KOG2785|consensus 93.1 0.1 2.3E-06 45.6 3.6 142 199-341 3-243 (390)
64 KOG4173|consensus 92.8 0.039 8.4E-07 43.6 0.5 81 229-315 78-171 (253)
65 smart00451 ZnF_U1 U1-like zinc 92.5 0.11 2.3E-06 28.8 2.0 23 318-340 3-25 (35)
66 KOG2893|consensus 91.6 0.046 9.9E-07 44.2 -0.3 45 85-134 13-57 (341)
67 COG2888 Predicted Zn-ribbon RN 90.9 0.34 7.4E-06 30.0 3.0 11 229-239 26-36 (61)
68 PRK14890 putative Zn-ribbon RN 90.5 0.43 9.4E-06 29.7 3.2 11 229-239 24-34 (59)
69 PF12013 DUF3505: Protein of u 90.3 0.35 7.5E-06 35.2 3.3 56 259-315 9-109 (109)
70 COG5048 FOG: Zn-finger [Genera 88.9 0.24 5.2E-06 46.0 2.0 60 25-85 289-354 (467)
71 KOG2893|consensus 88.9 0.14 3.1E-06 41.5 0.3 42 264-310 13-54 (341)
72 COG4049 Uncharacterized protei 88.8 0.15 3.2E-06 30.9 0.3 24 319-342 18-41 (65)
73 COG4049 Uncharacterized protei 88.3 0.3 6.5E-06 29.6 1.3 30 195-224 13-42 (65)
74 TIGR00622 ssl1 transcription f 87.6 1.8 3.8E-05 31.3 5.1 47 293-341 58-104 (112)
75 PF09986 DUF2225: Uncharacteri 87.3 0.2 4.3E-06 41.4 0.3 22 260-281 4-25 (214)
76 cd00729 rubredoxin_SM Rubredox 87.2 0.47 1E-05 26.1 1.6 9 262-270 3-11 (34)
77 COG2888 Predicted Zn-ribbon RN 85.3 0.76 1.6E-05 28.5 2.0 14 198-211 26-39 (61)
78 PHA00626 hypothetical protein 85.2 0.41 9E-06 29.2 0.8 12 261-272 23-34 (59)
79 COG5048 FOG: Zn-finger [Genera 84.3 0.54 1.2E-05 43.6 1.6 150 169-322 288-450 (467)
80 smart00659 RPOLCX RNA polymera 83.8 0.93 2E-05 26.7 1.9 10 262-271 3-12 (44)
81 PF09986 DUF2225: Uncharacteri 83.8 0.44 9.5E-06 39.4 0.7 23 23-45 3-25 (214)
82 PF13719 zinc_ribbon_5: zinc-r 83.2 0.81 1.8E-05 25.7 1.4 11 141-151 25-35 (37)
83 PRK00464 nrdR transcriptional 83.1 0.38 8.3E-06 37.2 0.1 16 319-334 29-44 (154)
84 PF09538 FYDLN_acid: Protein o 82.8 0.71 1.5E-05 33.3 1.4 15 260-274 25-39 (108)
85 PF13717 zinc_ribbon_4: zinc-r 82.8 0.79 1.7E-05 25.6 1.2 11 141-151 25-35 (36)
86 PRK00398 rpoP DNA-directed RNA 81.9 1.1 2.5E-05 26.6 1.8 10 262-271 4-13 (46)
87 TIGR02098 MJ0042_CXXC MJ0042 f 81.8 0.87 1.9E-05 25.7 1.2 10 142-151 26-35 (38)
88 TIGR00622 ssl1 transcription f 80.6 2.8 6E-05 30.3 3.6 19 141-159 15-33 (112)
89 COG1592 Rubrerythrin [Energy p 80.4 1.2 2.7E-05 34.7 2.0 24 261-298 134-157 (166)
90 smart00834 CxxC_CXXC_SSSS Puta 78.1 1.2 2.5E-05 25.6 1.0 30 25-61 5-34 (41)
91 TIGR02605 CxxC_CxxC_SSSS putat 77.9 1.2 2.6E-05 27.3 1.0 29 26-61 6-34 (52)
92 PF09723 Zn-ribbon_8: Zinc rib 75.7 1.6 3.5E-05 25.4 1.1 29 26-61 6-34 (42)
93 PF09538 FYDLN_acid: Protein o 75.5 2 4.2E-05 31.1 1.7 13 141-153 26-38 (108)
94 KOG2186|consensus 74.4 1.5 3.3E-05 36.3 1.0 45 26-73 4-48 (276)
95 smart00531 TFIIE Transcription 73.2 2.9 6.4E-05 32.2 2.3 18 23-40 97-114 (147)
96 PF02892 zf-BED: BED zinc fing 72.5 3.6 7.7E-05 24.1 2.1 26 259-284 14-43 (45)
97 COG1592 Rubrerythrin [Energy p 72.4 2.8 6.1E-05 32.8 2.0 24 230-269 134-157 (166)
98 TIGR00373 conserved hypothetic 71.5 2.7 5.8E-05 32.9 1.8 15 262-276 110-124 (158)
99 PF03604 DNA_RNApol_7kD: DNA d 71.2 3.9 8.4E-05 22.1 1.8 11 262-272 1-11 (32)
100 KOG2186|consensus 71.2 1.7 3.8E-05 35.9 0.7 45 262-310 4-48 (276)
101 PF15269 zf-C2H2_7: Zinc-finge 71.1 4 8.6E-05 23.6 1.9 23 319-341 21-43 (54)
102 COG0068 HypF Hydrogenase matur 71.0 0.54 1.2E-05 45.0 -2.5 78 231-326 102-181 (750)
103 smart00531 TFIIE Transcription 70.9 3.7 7.9E-05 31.7 2.4 12 230-241 99-110 (147)
104 COG1996 RPC10 DNA-directed RNA 70.6 2.4 5.3E-05 25.4 1.0 29 23-61 4-32 (49)
105 COG1997 RPL43A Ribosomal prote 69.7 3.1 6.7E-05 28.3 1.5 15 259-273 51-65 (89)
106 TIGR00373 conserved hypothetic 69.7 3.2 7E-05 32.4 1.9 32 22-62 106-137 (158)
107 TIGR02300 FYDLN_acid conserved 69.4 3.1 6.6E-05 30.6 1.5 15 260-274 25-39 (129)
108 PRK06266 transcription initiat 69.4 2.9 6.4E-05 33.4 1.6 12 262-273 118-129 (178)
109 PF02176 zf-TRAF: TRAF-type zi 68.7 2.3 5E-05 26.8 0.8 44 21-65 5-54 (60)
110 PF06524 NOA36: NOA36 protein; 67.9 4.1 8.9E-05 33.9 2.1 41 196-238 139-179 (314)
111 PF00301 Rubredoxin: Rubredoxi 67.9 4.1 9E-05 24.3 1.6 8 319-326 35-42 (47)
112 PF08274 PhnA_Zn_Ribbon: PhnA 67.4 1.9 4E-05 22.9 0.1 7 319-325 20-26 (30)
113 PF04959 ARS2: Arsenite-resist 67.2 2.8 6E-05 34.5 1.1 31 315-345 74-104 (214)
114 smart00734 ZnF_Rad18 Rad18-lik 67.1 3.9 8.5E-05 20.9 1.2 18 292-310 3-20 (26)
115 PRK06266 transcription initiat 66.9 4 8.7E-05 32.6 1.9 32 22-62 114-145 (178)
116 PRK09678 DNA-binding transcrip 66.0 2.6 5.7E-05 27.8 0.6 39 291-331 2-42 (72)
117 COG2879 Uncharacterized small 65.7 9 0.00019 24.2 2.8 30 328-357 22-51 (65)
118 smart00614 ZnF_BED BED zinc fi 63.8 6.4 0.00014 23.8 2.0 25 261-285 18-47 (50)
119 PF14353 CpXC: CpXC protein 63.0 1.8 3.9E-05 32.5 -0.7 21 290-310 38-58 (128)
120 PF04959 ARS2: Arsenite-resist 62.5 5.7 0.00012 32.7 2.0 29 258-286 74-102 (214)
121 cd00730 rubredoxin Rubredoxin; 62.0 6.8 0.00015 23.8 1.8 7 319-325 35-41 (50)
122 smart00440 ZnF_C2C2 C2C2 Zinc 60.8 3.8 8.2E-05 23.5 0.6 11 319-329 29-39 (40)
123 KOG1280|consensus 60.4 6.8 0.00015 34.2 2.2 36 290-325 79-116 (381)
124 KOG1280|consensus 59.3 11 0.00024 32.9 3.3 41 196-237 76-116 (381)
125 PF06524 NOA36: NOA36 protein; 57.5 7.2 0.00016 32.5 1.8 91 165-281 137-229 (314)
126 COG5151 SSL1 RNA polymerase II 55.7 16 0.00035 31.4 3.6 83 259-345 320-415 (421)
127 PF07975 C1_4: TFIIH C1-like d 55.6 7.6 0.00017 23.6 1.3 26 23-48 19-44 (51)
128 PF10571 UPF0547: Uncharacteri 55.1 7.4 0.00016 19.9 1.0 8 321-328 17-24 (26)
129 COG3364 Zn-ribbon containing p 54.9 8 0.00017 27.1 1.4 14 261-274 2-15 (112)
130 PF05191 ADK_lid: Adenylate ki 53.9 12 0.00026 20.9 1.8 8 264-271 4-11 (36)
131 PF01096 TFIIS_C: Transcriptio 53.8 3.1 6.7E-05 23.7 -0.6 10 319-328 29-38 (39)
132 PF12760 Zn_Tnp_IS1595: Transp 51.9 8.6 0.00019 22.8 1.1 9 318-326 37-45 (46)
133 PF13878 zf-C2H2_3: zinc-finge 51.4 7 0.00015 22.5 0.6 24 25-48 13-38 (41)
134 PTZ00255 60S ribosomal protein 48.4 8.9 0.00019 26.5 0.8 33 228-273 34-66 (90)
135 KOG2907|consensus 47.6 10 0.00022 27.2 1.0 10 291-300 103-112 (116)
136 PRK00432 30S ribosomal protein 47.2 12 0.00025 22.8 1.1 10 290-299 37-46 (50)
137 PF14369 zf-RING_3: zinc-finge 47.1 12 0.00026 20.7 1.1 8 85-92 24-31 (35)
138 COG4530 Uncharacterized protei 46.7 11 0.00023 26.8 1.1 27 232-272 11-37 (129)
139 PF08790 zf-LYAR: LYAR-type C2 46.3 13 0.00029 19.3 1.1 19 26-45 1-19 (28)
140 TIGR00100 hypA hydrogenase nic 46.1 13 0.00028 27.3 1.5 41 5-61 54-94 (115)
141 PF03811 Zn_Tnp_IS1: InsA N-te 45.7 11 0.00024 21.0 0.8 29 292-324 7-35 (36)
142 PRK12380 hydrogenase nickel in 44.8 12 0.00026 27.3 1.2 41 5-61 54-94 (113)
143 PF07754 DUF1610: Domain of un 44.6 13 0.00028 18.6 0.9 11 23-33 14-24 (24)
144 KOG3408|consensus 44.5 10 0.00022 27.6 0.7 26 22-47 54-79 (129)
145 COG4888 Uncharacterized Zn rib 44.5 10 0.00022 26.6 0.7 37 23-63 20-56 (104)
146 KOG2593|consensus 44.4 17 0.00038 33.0 2.3 36 51-90 126-161 (436)
147 PRK03824 hypA hydrogenase nick 44.0 17 0.00038 27.5 1.9 16 23-38 68-83 (135)
148 PF05495 zf-CHY: CHY zinc fing 43.9 5.4 0.00012 26.3 -0.7 12 230-241 10-21 (71)
149 KOG2593|consensus 43.6 11 0.00023 34.2 0.8 42 18-62 121-162 (436)
150 KOG2272|consensus 43.5 17 0.00038 30.2 2.0 18 111-128 99-116 (332)
151 KOG2807|consensus 43.5 39 0.00085 29.5 4.1 25 318-342 345-369 (378)
152 TIGR00280 L37a ribosomal prote 43.2 9.6 0.00021 26.4 0.4 33 229-274 34-66 (91)
153 KOG4124|consensus 42.0 7.6 0.00017 33.9 -0.3 70 259-338 347-418 (442)
154 PF05443 ROS_MUCR: ROS/MUCR tr 41.8 14 0.00031 27.7 1.2 27 316-345 70-96 (132)
155 KOG2807|consensus 41.7 49 0.0011 28.9 4.4 32 228-268 343-374 (378)
156 TIGR01206 lysW lysine biosynth 41.4 14 0.00031 22.8 0.9 31 25-63 2-32 (54)
157 cd00924 Cyt_c_Oxidase_Vb Cytoc 41.4 19 0.00042 25.4 1.7 26 11-37 66-91 (97)
158 PF09332 Mcm10: Mcm10 replicat 41.3 8 0.00017 34.3 -0.3 58 260-327 251-312 (344)
159 COG5188 PRP9 Splicing factor 3 40.9 38 0.00081 29.9 3.6 21 141-161 238-258 (470)
160 PRK04023 DNA polymerase II lar 40.4 34 0.00073 35.0 3.7 11 319-329 664-674 (1121)
161 PF13240 zinc_ribbon_2: zinc-r 40.4 14 0.00031 18.1 0.7 9 28-36 2-10 (23)
162 COG1198 PriA Primosomal protei 39.8 21 0.00046 35.5 2.3 47 25-91 435-484 (730)
163 PF01780 Ribosomal_L37ae: Ribo 39.7 8.9 0.00019 26.5 -0.2 11 53-63 53-63 (90)
164 PRK14714 DNA polymerase II lar 38.9 39 0.00084 35.5 3.9 29 290-329 692-720 (1337)
165 PF01155 HypA: Hydrogenase exp 38.6 14 0.0003 27.1 0.6 27 24-62 69-95 (113)
166 COG0068 HypF Hydrogenase matur 38.5 11 0.00025 36.5 0.2 55 28-90 126-181 (750)
167 COG3357 Predicted transcriptio 38.4 19 0.00042 24.7 1.2 14 52-65 57-70 (97)
168 PF13451 zf-trcl: Probable zin 38.2 17 0.00038 21.9 0.9 14 23-36 2-15 (49)
169 PRK03976 rpl37ae 50S ribosomal 37.9 12 0.00027 25.8 0.3 32 229-273 35-66 (90)
170 PF11789 zf-Nse: Zinc-finger o 37.3 33 0.00071 21.5 2.1 32 259-295 22-53 (57)
171 PF13453 zf-TFIIB: Transcripti 37.0 20 0.00044 20.5 1.1 17 319-335 20-36 (41)
172 PF04606 Ogr_Delta: Ogr/Delta- 36.7 11 0.00024 22.5 -0.1 36 293-330 2-39 (47)
173 COG5188 PRP9 Splicing factor 3 36.6 24 0.00052 31.0 1.9 27 313-339 369-396 (470)
174 COG3677 Transposase and inacti 36.5 18 0.00038 27.2 0.9 15 316-330 51-65 (129)
175 PRK04023 DNA polymerase II lar 36.5 32 0.00069 35.2 2.9 10 262-271 664-673 (1121)
176 TIGR00244 transcriptional regu 36.2 17 0.00036 27.8 0.8 18 319-336 29-46 (147)
177 KOG3507|consensus 36.2 26 0.00056 21.8 1.4 11 260-270 36-46 (62)
178 PF08271 TF_Zn_Ribbon: TFIIB z 36.0 18 0.00039 20.9 0.8 9 262-270 20-28 (43)
179 smart00661 RPOL9 RNA polymeras 34.7 38 0.00081 20.4 2.1 11 261-271 20-30 (52)
180 PF14446 Prok-RING_1: Prokaryo 34.5 19 0.00041 22.2 0.7 10 263-272 7-16 (54)
181 PF07503 zf-HYPF: HypF finger; 34.0 5.1 0.00011 22.2 -1.7 10 262-271 22-31 (35)
182 COG1656 Uncharacterized conser 33.7 46 0.001 26.0 2.8 46 231-276 98-145 (165)
183 PRK03681 hypA hydrogenase nick 32.9 24 0.00051 25.9 1.1 42 5-61 54-95 (114)
184 PHA02998 RNA polymerase subuni 32.5 23 0.00049 27.9 1.0 11 319-329 172-182 (195)
185 KOG3408|consensus 31.9 21 0.00045 26.1 0.6 22 289-310 56-77 (129)
186 PF07649 C1_3: C1-like domain; 31.7 28 0.0006 18.3 1.0 10 260-269 14-23 (30)
187 PF13824 zf-Mss51: Zinc-finger 31.6 45 0.00098 20.7 2.0 13 318-330 14-26 (55)
188 PF04780 DUF629: Protein of un 31.0 26 0.00056 32.6 1.3 25 319-343 58-82 (466)
189 PF04780 DUF629: Protein of un 30.9 34 0.00074 31.9 2.0 28 141-168 57-84 (466)
190 PF05290 Baculo_IE-1: Baculovi 30.8 38 0.00082 25.3 1.8 13 83-95 122-134 (140)
191 TIGR00686 phnA alkylphosphonat 30.7 32 0.0007 24.6 1.4 13 318-330 19-31 (109)
192 PF14311 DUF4379: Domain of un 30.5 39 0.00086 20.8 1.7 15 262-276 29-43 (55)
193 COG1998 RPS31 Ribosomal protei 30.3 30 0.00066 20.7 1.0 8 291-298 38-45 (51)
194 smart00731 SprT SprT homologue 30.2 33 0.00071 26.4 1.6 11 261-271 112-122 (146)
195 PF05129 Elf1: Transcription e 29.7 7.4 0.00016 26.5 -1.8 13 319-331 47-59 (81)
196 KOG1606|consensus 29.4 27 0.00059 28.3 1.0 30 1-30 103-132 (296)
197 COG1571 Predicted DNA-binding 28.6 38 0.00083 31.0 1.9 14 290-303 367-380 (421)
198 COG3091 SprT Zn-dependent meta 28.5 22 0.00048 27.2 0.3 13 25-38 117-129 (156)
199 PF13821 DUF4187: Domain of un 28.4 47 0.001 20.6 1.7 19 291-309 28-46 (55)
200 COG1773 Rubredoxin [Energy pro 28.2 29 0.00062 21.5 0.7 13 261-273 3-15 (55)
201 COG1327 Predicted transcriptio 27.7 32 0.00069 26.4 1.0 16 319-334 29-44 (156)
202 TIGR00595 priA primosomal prot 27.3 42 0.00091 32.0 2.0 34 26-60 214-247 (505)
203 KOG1842|consensus 27.1 27 0.00058 31.9 0.6 30 259-288 13-42 (505)
204 PRK10220 hypothetical protein; 27.1 47 0.001 23.9 1.7 12 319-330 21-32 (111)
205 KOG4124|consensus 26.9 14 0.00031 32.3 -1.0 24 228-251 347-372 (442)
206 PF15135 UPF0515: Uncharacteri 26.5 46 0.001 27.8 1.8 16 167-182 152-167 (278)
207 PF14205 Cys_rich_KTR: Cystein 26.0 31 0.00066 21.2 0.6 12 260-271 3-14 (55)
208 TIGR00143 hypF [NiFe] hydrogen 25.9 7.9 0.00017 38.4 -3.1 27 293-325 121-147 (711)
209 PF07282 OrfB_Zn_ribbon: Putat 25.8 77 0.0017 20.4 2.6 10 319-328 47-56 (69)
210 KOG4118|consensus 25.7 34 0.00073 21.8 0.7 27 319-345 39-65 (74)
211 smart00154 ZnF_AN1 AN1-like Zi 25.6 40 0.00087 19.1 1.0 14 25-38 12-25 (39)
212 KOG1842|consensus 24.9 54 0.0012 30.0 2.1 29 319-347 16-44 (505)
213 KOG3214|consensus 24.8 31 0.00068 24.1 0.5 40 289-332 22-61 (109)
214 PF01927 Mut7-C: Mut7-C RNAse 24.8 54 0.0012 25.2 1.9 15 261-275 124-138 (147)
215 PF06397 Desulfoferrod_N: Desu 24.6 13 0.00028 20.7 -1.1 11 361-371 4-14 (36)
216 PRK00564 hypA hydrogenase nick 24.3 49 0.0011 24.4 1.5 17 22-38 68-84 (117)
217 PF10013 DUF2256: Uncharacteri 24.1 37 0.0008 19.6 0.6 16 262-277 9-24 (42)
218 PRK04351 hypothetical protein; 24.0 46 0.001 25.7 1.4 12 289-300 131-142 (149)
219 PF14803 Nudix_N_2: Nudix N-te 23.2 46 0.001 18.3 0.9 8 262-269 23-30 (34)
220 KOG4167|consensus 23.2 14 0.0003 35.9 -1.9 27 23-49 790-816 (907)
221 PTZ00448 hypothetical protein; 23.2 57 0.0012 29.2 1.9 24 318-341 314-337 (373)
222 PF14445 Prok-RING_2: Prokaryo 23.0 16 0.00035 21.9 -1.0 15 24-38 6-20 (57)
223 COG5152 Uncharacterized conser 23.0 72 0.0016 25.6 2.2 16 23-38 194-209 (259)
224 COG5112 UFD2 U1-like Zn-finger 23.0 29 0.00063 24.5 0.1 26 22-47 52-77 (126)
225 PF04328 DUF466: Protein of un 22.5 1.1E+02 0.0024 19.8 2.7 30 329-358 23-52 (65)
226 PF01428 zf-AN1: AN1-like Zinc 22.1 45 0.00097 19.3 0.8 14 24-37 12-25 (43)
227 PRK00762 hypA hydrogenase nick 22.0 54 0.0012 24.4 1.4 15 23-38 68-82 (124)
228 TIGR03831 YgiT_finger YgiT-typ 21.9 39 0.00085 19.5 0.5 12 319-330 33-44 (46)
229 PF09963 DUF2197: Uncharacteri 21.8 33 0.00073 21.3 0.2 9 172-180 4-12 (56)
230 PF08209 Sgf11: Sgf11 (transcr 21.2 67 0.0015 17.5 1.3 19 291-310 5-23 (33)
231 PF01363 FYVE: FYVE zinc finge 21.1 36 0.00077 22.0 0.2 9 263-271 11-19 (69)
232 smart00132 LIM Zinc-binding do 21.0 54 0.0012 17.8 1.0 10 54-63 28-37 (39)
233 PF10276 zf-CHCC: Zinc-finger 20.9 88 0.0019 18.0 1.7 18 18-35 22-39 (40)
234 PF03966 Trm112p: Trm112p-like 20.9 49 0.0011 21.5 0.8 18 19-36 47-64 (68)
235 COG4957 Predicted transcriptio 20.8 44 0.00096 25.0 0.7 21 263-287 78-98 (148)
236 cd00065 FYVE FYVE domain; Zinc 20.5 58 0.0013 20.0 1.1 9 264-272 5-13 (57)
237 PF05741 zf-nanos: Nanos RNA b 20.4 21 0.00045 22.2 -0.9 9 319-327 34-42 (55)
238 KOG0717|consensus 20.1 53 0.0011 30.4 1.1 19 262-280 293-311 (508)
No 1
>KOG3608|consensus
Probab=99.94 E-value=4.7e-27 Score=193.62 Aligned_cols=226 Identities=27% Similarity=0.564 Sum_probs=194.3
Q ss_pred ccc--CCCCccccChHHHHHhhhcccCC-----------CC-CCcc--cccccccccCHHHHHHHHhhhccCCCceeccc
Q psy10075 112 FSC--NFCDYKSKTKYDITSHSLRKHSK-----------GD-KPHK--CTICESTYLTKVELKNHILKIHQGQTRFQCSF 175 (373)
Q Consensus 112 ~~C--~~C~~~f~~~~~l~~h~~~~h~~-----------~~-~~~~--C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~ 175 (373)
|.| ..|+..|.+...+.+|+ -.|.. ++ ..+. +..|-+.+.++..|+.|+ +.|++++...|+.
T Consensus 135 f~C~WedCe~~F~s~~ef~dHV-~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~-r~Hs~eKvvACp~ 212 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSIVEFQDHV-VKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHI-RTHSNEKVVACPH 212 (467)
T ss_pred hccChhhcCCcccCHHHHHHHH-HHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHH-HhcCCCeEEecch
Confidence 455 45888888888888883 23321 22 2233 456999999999999997 8999999999999
Q ss_pred ccccccchhHHHhhhhhcc-CCCCccccCCCchhccCcHHHHHHHHHhcCCCCccceecCCCcccccChHHHhhhhhccC
Q psy10075 176 CDRRFYFKNECTKHYLEHH-TNEKPFKCTLCSYKAKRNFELKQHILRIHQDNIEAEYHCTYCDYICSSRTILKSHINKKH 254 (373)
Q Consensus 176 C~~~f~~~~~l~~H~~~~h-~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~ 254 (373)
||..|..+..|..|.++.. ....+|+|..|.+.|.+...|..|+ +.| ..-|+|+.|..+....++|..|++..|
T Consensus 213 Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv-~rH----vn~ykCplCdmtc~~~ssL~~H~r~rH 287 (467)
T KOG3608|consen 213 CGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHV-VRH----VNCYKCPLCDMTCSSASSLTTHIRYRH 287 (467)
T ss_pred HHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHH-HHh----hhcccccccccCCCChHHHHHHHHhhh
Confidence 9999999999999975543 3457999999999999999999998 666 457999999999999999999998888
Q ss_pred CCCCCCCeecCCCCccccChhHHHHhhhhhccCCCCccCCC--CcccccchhhhhhhhhhccCC-Cc--ccCCCCCCCcc
Q psy10075 255 ANDEDKPFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSF--CEYKCTTKFGLKSHVTRIHSN-QK--YFCGSCNFSCT 329 (373)
Q Consensus 255 ~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~--C~~~f~~~~~l~~H~~~~h~~-~~--~~C~~C~~~f~ 329 (373)
.. ++||+|..|++.|.+.+.|..|+ ..|. +..|+|.. |.+.|.+..+|++|++.+|.+ .+ |.|..|++-|.
T Consensus 288 s~--dkpfKCd~Cd~~c~~esdL~kH~-~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft 363 (467)
T KOG3608|consen 288 SK--DKPFKCDECDTRCVRESDLAKHV-QVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFT 363 (467)
T ss_pred cc--CCCccccchhhhhccHHHHHHHH-Hhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhc
Confidence 76 99999999999999999999999 7888 77899999 999999999999999999843 33 99999999999
Q ss_pred ChhHHHHHHHhhccccccc
Q psy10075 330 SNTDLLEHVQTNHIIVGES 348 (373)
Q Consensus 330 ~~~~l~~H~~~~H~~~~~~ 348 (373)
+..+|..|++..|++..+.
T Consensus 364 ~G~~L~~HL~kkH~f~~Ps 382 (467)
T KOG3608|consen 364 SGKSLSAHLMKKHGFRLPS 382 (467)
T ss_pred cchhHHHHHHHhhcccCCC
Confidence 9999999999999865543
No 2
>KOG2462|consensus
Probab=99.94 E-value=2.1e-27 Score=190.93 Aligned_cols=140 Identities=24% Similarity=0.505 Sum_probs=125.7
Q ss_pred CCCccccCCCchhccCcHHHHHHHHHhcCCCCccceecCCCcccccChHHHhhhhhccCCCCCCCCeecCCCCccccChh
Q psy10075 196 NEKPFKCTLCSYKAKRNFELKQHILRIHQDNIEAEYHCTYCDYICSSRTILKSHINKKHANDEDKPFTCEHCDSKFCKKY 275 (373)
Q Consensus 196 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~ 275 (373)
....|+|+.||+.+.+.++|.+|.+..-.....+.+.|+.|++.|.+...|..|++++. -+..|.+|||.|...+
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~-----l~c~C~iCGKaFSRPW 201 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT-----LPCECGICGKAFSRPW 201 (279)
T ss_pred cCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC-----CCcccccccccccchH
Confidence 44678999999999999999999744444455778999999999999999999996443 4689999999999999
Q ss_pred HHHHhhhhhccCCCCccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChhHHHHHHHhh
Q psy10075 276 DMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNTDLLEHVQTN 341 (373)
Q Consensus 276 ~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~ 341 (373)
.|+-|+ +.|+||+||.|+.|++.|.++++|+.||++|-+.++|+|+.|++.|....-|-+|...-
T Consensus 202 LLQGHi-RTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~ 266 (279)
T KOG2462|consen 202 LLQGHI-RTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSESA 266 (279)
T ss_pred Hhhccc-ccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhhc
Confidence 999999 99999999999999999999999999999999999999999999999999999997653
No 3
>KOG3608|consensus
Probab=99.94 E-value=2.8e-26 Score=189.04 Aligned_cols=257 Identities=25% Similarity=0.536 Sum_probs=189.2
Q ss_pred ceecCC--CCccccC-hhhHhhccccCCC----------------------------------CCcccc--CcCcccccc
Q psy10075 25 NFICIL--CNYHAEN-FENIYLHLPSHFE----------------------------------QKLFSC--DLCEYQTNT 65 (373)
Q Consensus 25 ~~~C~~--C~~~f~~-~~~l~~H~~~h~~----------------------------------~~~~~C--~~C~~~f~~ 65 (373)
.+.|.. |++...+ ...|.+|+..|.= ...|.| ..|+..|.+
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s 148 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVS 148 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccC
Confidence 567754 8877655 5789999876630 012445 458888888
Q ss_pred hHHHHHHhhhhhcc------------C-CccccCc--cccccCCchHHHHHHhhhcCCCCCcccCCCCccccChHHHHHh
Q psy10075 66 KIYIRRHMLKIHLR------------Q-RNLECSL--CDSTFSDNMDLKKHIQRKHTNDRPFSCNFCDYKSKTKYDITSH 130 (373)
Q Consensus 66 ~~~l~~H~~~~h~~------------~-~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~h 130 (373)
...+..|+ ..|+- + ..+.|.. |-+.+.++..|+.|+ +.|++++...|+.|+.-|.+...|.+|
T Consensus 149 ~~ef~dHV-~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~-r~Hs~eKvvACp~Cg~~F~~~tkl~DH 226 (467)
T KOG3608|consen 149 IVEFQDHV-VKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHI-RTHSNEKVVACPHCGELFRTKTKLFDH 226 (467)
T ss_pred HHHHHHHH-HHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHH-HhcCCCeEEecchHHHHhccccHHHHH
Confidence 88888884 44421 1 2355644 777788888888884 778888888888888888888888888
Q ss_pred hhcccCCCCCCcccccccccccCHHHHHHHHhhhccCCCceecccccccccchhHHHhhhhhccCCCCccccCCCchhcc
Q psy10075 131 SLRKHSKGDKPHKCTICESTYLTKVELKNHILKIHQGQTRFQCSFCDRRFYFKNECTKHYLEHHTNEKPFKCTLCSYKAK 210 (373)
Q Consensus 131 ~~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~ 210 (373)
+++.......+|.|..|.+.|.+...|..|+ ..| ..-|+|+.|+.+....++|..|+...|..++||+|..|++.+.
T Consensus 227 ~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv-~rH--vn~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~ 303 (467)
T KOG3608|consen 227 LRRQTELNTNSFQCAQCFKRFATEKLLKSHV-VRH--VNCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCV 303 (467)
T ss_pred HHhhhhhcCCchHHHHHHHHHhHHHHHHHHH-HHh--hhcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhc
Confidence 7776666667888888888888888888886 555 4567888888888888888888877788888888888888888
Q ss_pred CcHHHHHHHHHhcCCCCccceecCC--CcccccChHHHhhhhhccCCCCCCCCeecCCCCccccChhHHHHhhhhhccCC
Q psy10075 211 RNFELKQHILRIHQDNIEAEYHCTY--CDYICSSRTILKSHINKKHANDEDKPFTCEHCDSKFCKKYDMKMHMSRAHYGL 288 (373)
Q Consensus 211 ~~~~l~~H~~~~h~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~ 288 (373)
+.++|.+|+ ..|. +..|.|+. |...+.+...+..|+.++|.+....+|.|-.|++.|++-.+|..|+.+.|.-.
T Consensus 304 ~esdL~kH~-~~HS---~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 304 RESDLAKHV-QVHS---KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred cHHHHHHHH-Hhcc---ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhccc
Confidence 888888887 4665 45677766 88888888888888877777777777888888888888888888876666544
Q ss_pred CC
Q psy10075 289 KP 290 (373)
Q Consensus 289 ~~ 290 (373)
.|
T Consensus 380 ~P 381 (467)
T KOG3608|consen 380 LP 381 (467)
T ss_pred CC
Confidence 33
No 4
>KOG1074|consensus
Probab=99.93 E-value=4.6e-27 Score=213.88 Aligned_cols=200 Identities=25% Similarity=0.488 Sum_probs=153.5
Q ss_pred CCcccccccccccCHHHHHHHHhhhccCCCceecccccccccchhHHHhhhhhccCC----CCccccC---CCchhccCc
Q psy10075 140 KPHKCTICESTYLTKVELKNHILKIHQGQTRFQCSFCDRRFYFKNECTKHYLEHHTN----EKPFKCT---LCSYKAKRN 212 (373)
Q Consensus 140 ~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~----~~~~~C~---~C~~~f~~~ 212 (373)
.+..|-.|.++..-..+|+.|. +.|+|++||+|.+||+.|..+.+|..|+ ..|.. ..++.|| +|.+.|...
T Consensus 604 dPNqCiiC~rVlSC~saLqmHy-rtHtGERPFkCKiCgRAFtTkGNLkaH~-~vHka~p~~R~q~ScP~~~ic~~kftn~ 681 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHY-RTHTGERPFKCKICGRAFTTKGNLKAHM-SVHKAKPPARVQFSCPSTFICQKKFTNA 681 (958)
T ss_pred Cccceeeeeecccchhhhhhhh-hcccCcCccccccccchhccccchhhcc-cccccCccccccccCCchhhhccccccc
Confidence 4678999999999999999998 8999999999999999999999999997 44433 3568899 999999999
Q ss_pred HHHHHHHHHhcCCC-----------CccceecCCCcccccChHHHhhhhhccCCC-------------CCC----CCeec
Q psy10075 213 FELKQHILRIHQDN-----------IEAEYHCTYCDYICSSRTILKSHINKKHAN-------------DED----KPFTC 264 (373)
Q Consensus 213 ~~l~~H~~~~h~~~-----------~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~-------------~~~----~~~~C 264 (373)
..|..|+ +.|... ....-.|..|.+.|.+...+..++..+-.. .++ .+..+
T Consensus 682 V~lpQhI-riH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e 760 (958)
T KOG1074|consen 682 VTLPQHI-RIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPE 760 (958)
T ss_pred ccccceE-EeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccc
Confidence 9999998 777521 112357899999998888888777443110 011 13556
Q ss_pred CCCCccccChhHHHHhhhhh----------------------ccCCC---------------------------------
Q psy10075 265 EHCDSKFCKKYDMKMHMSRA----------------------HYGLK--------------------------------- 289 (373)
Q Consensus 265 ~~C~~~f~~~~~L~~H~~~~----------------------H~~~~--------------------------------- 289 (373)
..|+..+.....+..+-... +++++
T Consensus 761 ~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg 840 (958)
T KOG1074|consen 761 NSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEG 840 (958)
T ss_pred cccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccc
Confidence 66776665554443332100 00000
Q ss_pred --------------------------------------CccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccCh
Q psy10075 290 --------------------------------------PFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSN 331 (373)
Q Consensus 290 --------------------------------------~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~ 331 (373)
.-.|..|++.|.+.++|..|+++|.++++|.|.+|++.|..+
T Consensus 841 ~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttr 920 (958)
T KOG1074|consen 841 LATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTR 920 (958)
T ss_pred cccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhh
Confidence 056999999999999999999999999999999999999999
Q ss_pred hHHHHHHHhhc
Q psy10075 332 TDLLEHVQTNH 342 (373)
Q Consensus 332 ~~l~~H~~~~H 342 (373)
.+|++||.+|+
T Consensus 921 gnLKvHMgtH~ 931 (958)
T KOG1074|consen 921 GNLKVHMGTHM 931 (958)
T ss_pred hhhhhhhcccc
Confidence 99999999997
No 5
>KOG1074|consensus
Probab=99.92 E-value=2.1e-26 Score=209.54 Aligned_cols=48 Identities=35% Similarity=0.768 Sum_probs=40.0
Q ss_pred cccCccccccCCchHHHHHHhhhcCCCCCcccCCCCccccChHHHHHhh
Q psy10075 83 LECSLCDSTFSDNMDLKKHIQRKHTNDRPFSCNFCDYKSKTKYDITSHS 131 (373)
Q Consensus 83 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~h~ 131 (373)
.+|..|.+.|...+.|+.| .++|++++||+|.+|+..|.++.+|+.|.
T Consensus 354 hkCr~CakvfgS~SaLqiH-lRSHTGERPfqCnvCG~~FSTkGNLKvH~ 401 (958)
T KOG1074|consen 354 HKCRFCAKVFGSDSALQIH-LRSHTGERPFQCNVCGNRFSTKGNLKVHF 401 (958)
T ss_pred chhhhhHhhcCchhhhhhh-hhccCCCCCeeecccccccccccceeeee
Confidence 5688888888888888888 47888888888888888888888888883
No 6
>KOG2462|consensus
Probab=99.91 E-value=3.1e-25 Score=178.49 Aligned_cols=126 Identities=23% Similarity=0.507 Sum_probs=117.1
Q ss_pred ccceecCCCcccccChHHHhhhhhccCCCCCCCCeecCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchhhhh
Q psy10075 228 EAEYHCTYCDYICSSRTILKSHINKKHANDEDKPFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLK 307 (373)
Q Consensus 228 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~ 307 (373)
...|.|++||+.+.+.++|.+|.+.+-..+..+.+.|+.|++.|.....|.+|+ ++|+ -+.+|.+||+.|...=-|.
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHi-rTH~--l~c~C~iCGKaFSRPWLLQ 204 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHI-RTHT--LPCECGICGKAFSRPWLLQ 204 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHh-hccC--CCcccccccccccchHHhh
Confidence 456999999999999999999998888888888999999999999999999999 8886 6799999999999999999
Q ss_pred hhhhhccCCCcccCCCCCCCccChhHHHHHHHhhcccccccccchhHHHHHHHhhhhHhhhccccC
Q psy10075 308 SHVTRIHSNQKYFCGSCNFSCTSNTDLLEHVQTNHIIVGESSSRDFIIHSFVKNAMYHCQITLNAF 373 (373)
Q Consensus 308 ~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f 373 (373)
-|+++|.|++||.|+.|++.|.++.||+.||++| .+ .+.|+|+.|+|+|
T Consensus 205 GHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTH---S~--------------~K~~qC~~C~KsF 253 (279)
T KOG2462|consen 205 GHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTH---SD--------------VKKHQCPRCGKSF 253 (279)
T ss_pred cccccccCCCCccCCcccchhcchHHHHHHHHhh---cC--------------CccccCcchhhHH
Confidence 9999999999999999999999999999999999 32 3579999999998
No 7
>KOG3623|consensus
Probab=99.86 E-value=2.3e-22 Score=180.59 Aligned_cols=107 Identities=28% Similarity=0.597 Sum_probs=90.9
Q ss_pred CcceecCCCCccccChhhHhhccc-cCC-CCCccccCcCcccccchHHHHHHhhhhhcc-------------CCccccCc
Q psy10075 23 SYNFICILCNYHAENFENIYLHLP-SHF-EQKLFSCDLCEYQTNTKIYIRRHMLKIHLR-------------QRNLECSL 87 (373)
Q Consensus 23 ~~~~~C~~C~~~f~~~~~l~~H~~-~h~-~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~-------------~~~~~C~~ 87 (373)
...+.|+.|.+.+.....|..|++ .|. .+..|.|..|..+|..+..|.+|| ..|.. .+.|+|..
T Consensus 208 sqlltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm-~~hkpg~dqa~sltqsa~lRKFKCtE 286 (1007)
T KOG3623|consen 208 SQLLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHM-QLHKPGGDQAISLTQSALLRKFKCTE 286 (1007)
T ss_pred hhhhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHH-HhhcCCCcccccccchhhhccccccc
Confidence 345789999999999999999965 333 345699999999999999999996 45522 35699999
Q ss_pred cccccCCchHHHHHHhhhcCCCCCcccCCCCccccChHHHHHhh
Q psy10075 88 CDSTFSDNMDLKKHIQRKHTNDRPFSCNFCDYKSKTKYDITSHS 131 (373)
Q Consensus 88 C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~h~ 131 (373)
|++.|..+..|..|+ ++|.|++||.|+.|.+.|.....+..||
T Consensus 287 CgKAFKfKHHLKEHl-RIHSGEKPfeCpnCkKRFSHSGSySSHm 329 (1007)
T KOG3623|consen 287 CGKAFKFKHHLKEHL-RIHSGEKPFECPNCKKRFSHSGSYSSHM 329 (1007)
T ss_pred cchhhhhHHHHHhhh-eeecCCCCcCCcccccccccCCcccccc
Confidence 999999999999995 8999999999999999999999999994
No 8
>KOG3623|consensus
Probab=99.85 E-value=2.5e-22 Score=180.35 Aligned_cols=78 Identities=22% Similarity=0.554 Sum_probs=75.2
Q ss_pred eecCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChhHHHHHHHh
Q psy10075 262 FTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNTDLLEHVQT 340 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~ 340 (373)
|.|..|+|+|...+.|.+|. ..|+|.+||+|.+|.+.|..+..|..|.+.|-|+++|+|..|++.|......-.||.-
T Consensus 895 yaCDqCDK~FqKqSSLaRHK-YEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQHMNH 972 (1007)
T KOG3623|consen 895 YACDQCDKAFQKQSSLARHK-YEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQHMNH 972 (1007)
T ss_pred chHHHHHHHHHhhHHHHHhh-hhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhhhcc
Confidence 99999999999999999999 9999999999999999999999999999999999999999999999999999999763
No 9
>KOG3576|consensus
Probab=99.64 E-value=7.1e-17 Score=123.61 Aligned_cols=114 Identities=28% Similarity=0.660 Sum_probs=87.7
Q ss_pred cceecCCCcccccChHHHhhhhhccCCCCCCCCeecCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchhhhhh
Q psy10075 229 AEYHCTYCDYICSSRTILKSHINKKHANDEDKPFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLKS 308 (373)
Q Consensus 229 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~ 308 (373)
..|.|.+|++.|.-.-.|.+|++- | ...+.|.|..||+.|.....|.+|+ +.|++.+||+|..|++.|+.+-+|..
T Consensus 116 d~ftCrvCgK~F~lQRmlnrh~kc-h--~~vkr~lct~cgkgfndtfdlkrh~-rthtgvrpykc~~c~kaftqrcsles 191 (267)
T KOG3576|consen 116 DSFTCRVCGKKFGLQRMLNRHLKC-H--SDVKRHLCTFCGKGFNDTFDLKRHT-RTHTGVRPYKCSLCEKAFTQRCSLES 191 (267)
T ss_pred CeeeeehhhhhhhHHHHHHHHhhh-c--cHHHHHHHhhccCcccchhhhhhhh-ccccCccccchhhhhHHHHhhccHHH
Confidence 346677777777666666666642 2 2256677888888888888888888 78888888888888888888888888
Q ss_pred hhhhccCCC-----------cccCCCCCCCccChhHHHHHHHhhccccc
Q psy10075 309 HVTRIHSNQ-----------KYFCGSCNFSCTSNTDLLEHVQTNHIIVG 346 (373)
Q Consensus 309 H~~~~h~~~-----------~~~C~~C~~~f~~~~~l~~H~~~~H~~~~ 346 (373)
|++..|+.. .|.|..||+.-...+.+..|++.+|+...
T Consensus 192 hl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 192 HLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred HHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 887777643 39999999999999999999999987553
No 10
>KOG3576|consensus
Probab=99.56 E-value=1.2e-15 Score=117.02 Aligned_cols=131 Identities=22% Similarity=0.377 Sum_probs=89.8
Q ss_pred eeecCCCCcceecCCCCccccChhhHhhccccCCCCCccccCcCcccccchHHHHHHhhhhhccCCccccCccccccCCc
Q psy10075 16 FVSRLDDSYNFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEYQTNTKIYIRRHMLKIHLRQRNLECSLCDSTFSDN 95 (373)
Q Consensus 16 ~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~ 95 (373)
.+....+...|.|.+|++.|.....|.+|++-|..-+.+-|.-||+.|+.-.+|.+| ...|.+-+||+|..|++.|..+
T Consensus 108 t~gsssd~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh-~rthtgvrpykc~~c~kaftqr 186 (267)
T KOG3576|consen 108 TIGSSSDQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRH-TRTHTGVRPYKCSLCEKAFTQR 186 (267)
T ss_pred cccCCCCCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhh-hccccCccccchhhhhHHHHhh
Confidence 334444566677888888887777777888777777777777777777777777777 4677777777777777777777
Q ss_pred hHHHHHHhhhcCCCCCcccCCCCccccChHHHHHhhhcccCCCCCCcccccccccccCHHHHHHHHhhhccC
Q psy10075 96 MDLKKHIQRKHTNDRPFSCNFCDYKSKTKYDITSHSLRKHSKGDKPHKCTICESTYLTKVELKNHILKIHQG 167 (373)
Q Consensus 96 ~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~h~~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~ 167 (373)
-.|..|++.+|.....| .+.+- .++.|.|..||..-.....+..|+...|..
T Consensus 187 csleshl~kvhgv~~~y-------------ayker-------r~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 187 CSLESHLKKVHGVQHQY-------------AYKER-------RAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred ccHHHHHHHHcCchHHH-------------HHHHh-------hhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 77777777777543222 11111 345677777777777777777776444433
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.23 E-value=3.6e-11 Score=109.93 Aligned_cols=146 Identities=20% Similarity=0.396 Sum_probs=108.4
Q ss_pred eecccccccccchhHHHhhhhhccCCCCccccCC--CchhccCcHHHHHHHHHhcCCCCccceecCCCcccccChHHHhh
Q psy10075 171 FQCSFCDRRFYFKNECTKHYLEHHTNEKPFKCTL--CSYKAKRNFELKQHILRIHQDNIEAEYHCTYCDYICSSRTILKS 248 (373)
Q Consensus 171 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~ 248 (373)
-.|+.|..... ...|..|. . +..-..-.|+. |+..|.... + ++.+.|+.|++.|. ...|..
T Consensus 408 V~C~NC~~~i~-l~~l~lHe-~-~C~r~~V~Cp~~~Cg~v~~r~e-l------------~~H~~C~~Cgk~f~-~s~Lek 470 (567)
T PLN03086 408 VECRNCKHYIP-SRSIALHE-A-YCSRHNVVCPHDGCGIVLRVEE-A------------KNHVHCEKCGQAFQ-QGEMEK 470 (567)
T ss_pred EECCCCCCccc-hhHHHHHH-h-hCCCcceeCCcccccceeeccc-c------------ccCccCCCCCCccc-hHHHHH
Confidence 46888877766 44566774 3 34445566874 888773322 2 33458999999885 678899
Q ss_pred hhhccCCCCCCCCeecCCCCccccChhHHHHhhhhhccCCCCccCCCCccccc----------chhhhhhhhhhccCCCc
Q psy10075 249 HINKKHANDEDKPFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCT----------TKFGLKSHVTRIHSNQK 318 (373)
Q Consensus 249 H~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~----------~~~~l~~H~~~~h~~~~ 318 (373)
|+...| .++.|+ ||+.+ ....|..|+ ..|...+++.|++|++.|. ..+.|..|.. ..+..+
T Consensus 471 H~~~~H-----kpv~Cp-Cg~~~-~R~~L~~H~-~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~-~CG~rt 541 (567)
T PLN03086 471 HMKVFH-----EPLQCP-CGVVL-EKEQMVQHQ-ASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHES-ICGSRT 541 (567)
T ss_pred HHHhcC-----CCccCC-CCCCc-chhHHHhhh-hccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHH-hcCCcc
Confidence 986543 579999 99765 668999998 7888899999999999884 1347889984 468888
Q ss_pred ccCCCCCCCccChhHHHHHHHhhcc
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQTNHI 343 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~H~ 343 (373)
+.|..||+.+.. ..|..|+...|.
T Consensus 542 ~~C~~Cgk~Vrl-rdm~~H~~~~h~ 565 (567)
T PLN03086 542 APCDSCGRSVML-KEMDIHQIAVHQ 565 (567)
T ss_pred eEccccCCeeee-hhHHHHHHHhhc
Confidence 999999988776 468888887764
No 12
>PHA00733 hypothetical protein
Probab=99.17 E-value=3.5e-11 Score=89.59 Aligned_cols=85 Identities=16% Similarity=0.333 Sum_probs=62.1
Q ss_pred CCCeecCCCCccccChhHHHHh--hh--hhccCCCCccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChhHH
Q psy10075 259 DKPFTCEHCDSKFCKKYDMKMH--MS--RAHYGLKPFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNTDL 334 (373)
Q Consensus 259 ~~~~~C~~C~~~f~~~~~L~~H--~~--~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l 334 (373)
.+++.|.+|+..|.....|..+ +. ..+.+.+||.|+.|++.|.+...|..|++.+ ..+|.|..|++.|.....|
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL 115 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDST 115 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHH
Confidence 5556666666666665555554 10 1333477888888888888888888888643 3458999999999999999
Q ss_pred HHHHHhhcccc
Q psy10075 335 LEHVQTNHIIV 345 (373)
Q Consensus 335 ~~H~~~~H~~~ 345 (373)
..|+...|+|.
T Consensus 116 ~~H~~~~h~~~ 126 (128)
T PHA00733 116 LDHVCKKHNIC 126 (128)
T ss_pred HHHHHHhcCcc
Confidence 99999998864
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.05 E-value=6e-10 Score=102.09 Aligned_cols=147 Identities=17% Similarity=0.359 Sum_probs=106.2
Q ss_pred CcccccccccccCHHHHHHHHhhhccCCCceeccc--ccccccchhHHHhhhhhccCCCCccccCCCchhccCcHHHHHH
Q psy10075 141 PHKCTICESTYLTKVELKNHILKIHQGQTRFQCSF--CDRRFYFKNECTKHYLEHHTNEKPFKCTLCSYKAKRNFELKQH 218 (373)
Q Consensus 141 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H 218 (373)
...|+.|.+.... ..|..|. .+.....-.|+. |+..|. +..+..| +.|+.|++.|. ...|..|
T Consensus 407 ~V~C~NC~~~i~l-~~l~lHe--~~C~r~~V~Cp~~~Cg~v~~-r~el~~H----------~~C~~Cgk~f~-~s~LekH 471 (567)
T PLN03086 407 TVECRNCKHYIPS-RSIALHE--AYCSRHNVVCPHDGCGIVLR-VEEAKNH----------VHCEKCGQAFQ-QGEMEKH 471 (567)
T ss_pred eEECCCCCCccch-hHHHHHH--hhCCCcceeCCcccccceee-ccccccC----------ccCCCCCCccc-hHHHHHH
Confidence 3578888876553 4566774 444455667874 888885 3333333 47999999985 5778999
Q ss_pred HHHhcCCCCccceecCCCcccccChHHHhhhhhccCCCCCCCCeecCCCCcccc----------ChhHHHHhhhhhccCC
Q psy10075 219 ILRIHQDNIEAEYHCTYCDYICSSRTILKSHINKKHANDEDKPFTCEHCDSKFC----------KKYDMKMHMSRAHYGL 288 (373)
Q Consensus 219 ~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~----------~~~~L~~H~~~~H~~~ 288 (373)
+...| .++.|+ ||+.+ .+..|..|+.. +-+.+++.|+.|++.|. ....|..|. ... +.
T Consensus 472 ~~~~H-----kpv~Cp-Cg~~~-~R~~L~~H~~t---hCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE-~~C-G~ 539 (567)
T PLN03086 472 MKVFH-----EPLQCP-CGVVL-EKEQMVQHQAS---TCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHE-SIC-GS 539 (567)
T ss_pred HHhcC-----CCccCC-CCCCc-chhHHHhhhhc---cCCCCceeCCCCCCccccCccccchhhhhhhHHHHH-Hhc-CC
Confidence 85433 678999 99755 66899999742 23478999999999985 245899998 554 78
Q ss_pred CCccCCCCcccccchhhhhhhhhhccC
Q psy10075 289 KPFKCSFCEYKCTTKFGLKSHVTRIHS 315 (373)
Q Consensus 289 ~~~~C~~C~~~f~~~~~l~~H~~~~h~ 315 (373)
+++.|..|++.+ ....|..|+...|.
T Consensus 540 rt~~C~~Cgk~V-rlrdm~~H~~~~h~ 565 (567)
T PLN03086 540 RTAPCDSCGRSV-MLKEMDIHQIAVHQ 565 (567)
T ss_pred cceEccccCCee-eehhHHHHHHHhhc
Confidence 889999999987 46678888866654
No 14
>PHA00733 hypothetical protein
Probab=99.05 E-value=2e-10 Score=85.58 Aligned_cols=85 Identities=16% Similarity=0.300 Sum_probs=61.1
Q ss_pred ccceecCCCcccccChHHHhhh--hhccCCCCCCCCeecCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchhh
Q psy10075 228 EAEYHCTYCDYICSSRTILKSH--INKKHANDEDKPFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFG 305 (373)
Q Consensus 228 ~~~~~C~~C~~~f~~~~~l~~H--~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~ 305 (373)
.+++.|.+|+..|.+...|..+ +..+....+.+||.|+.|++.|.....|..|+ +.| +.+|.|..|++.|.....
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~-r~h--~~~~~C~~CgK~F~~~~s 114 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHI-RYT--EHSKVCPVCGKEFRNTDS 114 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHH-hcC--CcCccCCCCCCccCCHHH
Confidence 5667777777777766666555 22222333477888888888888888888888 444 356888888888888888
Q ss_pred hhhhhhhccC
Q psy10075 306 LKSHVTRIHS 315 (373)
Q Consensus 306 l~~H~~~~h~ 315 (373)
|..|+...|+
T Consensus 115 L~~H~~~~h~ 124 (128)
T PHA00733 115 TLDHVCKKHN 124 (128)
T ss_pred HHHHHHHhcC
Confidence 8888866664
No 15
>KOG3993|consensus
Probab=98.82 E-value=2.5e-09 Score=91.76 Aligned_cols=53 Identities=13% Similarity=0.051 Sum_probs=41.8
Q ss_pred ccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChhHHHHHHHhhcc
Q psy10075 291 FKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNTDLLEHVQTNHI 343 (373)
Q Consensus 291 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 343 (373)
-.|++|+..+.++..=..+.+.-+..+.|.|.+|.-.|.+..+|..|+..-|.
T Consensus 431 ~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hp 483 (500)
T KOG3993|consen 431 ELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHP 483 (500)
T ss_pred cCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcCh
Confidence 34777777777776666666556666779999999999999999999988774
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.79 E-value=2.6e-09 Score=64.97 Aligned_cols=44 Identities=23% Similarity=0.387 Sum_probs=33.6
Q ss_pred CeecCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchhhhh
Q psy10075 261 PFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLK 307 (373)
Q Consensus 261 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~ 307 (373)
.|.|+.||+.|+..+.|..|+ +.|+ ++|+|..|++.|...+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~-r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHL-RKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHH-HhcC--CcccCCcccceecccceeE
Confidence 477888888888888888888 5565 5788888888887666654
No 17
>KOG3993|consensus
Probab=98.68 E-value=5e-09 Score=89.98 Aligned_cols=198 Identities=16% Similarity=0.223 Sum_probs=106.7
Q ss_pred cccCCCCccccChHHHHHhhhcccCCCCCCcccccccccccCHHHHHHHHhhhccCCCceec-ccc-cccccchhHHHhh
Q psy10075 112 FSCNFCDYKSKTKYDITSHSLRKHSKGDKPHKCTICESTYLTKVELKNHILKIHQGQTRFQC-SFC-DRRFYFKNECTKH 189 (373)
Q Consensus 112 ~~C~~C~~~f~~~~~l~~h~~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C-~~C-~~~f~~~~~l~~H 189 (373)
|.|.+|...|.+...|..| +.-..-...|+|++|+++|+-..+|..|. ++|-....-.= ..= .+.-.....+...
T Consensus 268 yiCqLCK~kYeD~F~LAQH--rC~RIV~vEYrCPEC~KVFsCPANLASHR-RWHKPR~eaa~a~~~P~k~~~~~rae~~e 344 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQH--RCPRIVHVEYRCPECDKVFSCPANLASHR-RWHKPRPEAAKAGSPPPKQAVETRAEVQE 344 (500)
T ss_pred HHHHHHHHhhhhHHHHhhc--cCCeeEEeeecCCcccccccCchhhhhhh-cccCCchhhhhcCCCChhhhhhhhhhhhh
Confidence 7777787777777777777 23222344577888888888777887774 66632211000 000 0000000000000
Q ss_pred hhhc---cCCCCccccCCCchhccCcHHHHHHHHHhcCCCCccc--------------eecCCCcccccChHHHhhhhhc
Q psy10075 190 YLEH---HTNEKPFKCTLCSYKAKRNFELKQHILRIHQDNIEAE--------------YHCTYCDYICSSRTILKSHINK 252 (373)
Q Consensus 190 ~~~~---h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~--------------~~C~~C~~~f~~~~~l~~H~~~ 252 (373)
... ...+.-|.|.+|++.|++...|+.|+...|.....+. +.+..+...+.....-..+. .
T Consensus 345 -a~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~v-l 422 (500)
T KOG3993|consen 345 -AERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEV-L 422 (500)
T ss_pred -ccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccce-e
Confidence 000 0223468899999999999999999755553321110 22333332222211111111 0
Q ss_pred cCCCCCCCCeecCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchhhhhhhhhhccCC
Q psy10075 253 KHANDEDKPFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLKSHVTRIHSN 316 (373)
Q Consensus 253 ~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~ 316 (373)
.+... .....++.|+..+..+..=..+. +.-.....|.|.+|...|.+...|.+|+..-|..
T Consensus 423 ~~a~s-ael~~pp~~~~ppsss~~sgg~~-rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hps 484 (500)
T KOG3993|consen 423 YVAGS-AELELPPYDGSPPSSSGSSGGYG-RLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPS 484 (500)
T ss_pred eeecc-ccccCCCCCCCCcccCCCCCccc-cccchhhccccccchHhhhcCcchHhHhhhcChH
Confidence 11111 12246788887777765544444 2223345689999999999999999999666643
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.65 E-value=1e-08 Score=62.40 Aligned_cols=44 Identities=9% Similarity=0.319 Sum_probs=39.2
Q ss_pred CccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChhHHH
Q psy10075 290 PFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNTDLL 335 (373)
Q Consensus 290 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l~ 335 (373)
-|.|+.||+.|+..++|..|+++|. .+|+|..|++.|....+|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4899999999999999999998876 6899999999999877664
No 19
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.42 E-value=2.6e-07 Score=57.73 Aligned_cols=49 Identities=24% Similarity=0.604 Sum_probs=29.8
Q ss_pred ccCCCCcccccchhhhhhhhhhccCCCc--ccCCCCCCCccChhHHHHHHHhhc
Q psy10075 291 FKCSFCEYKCTTKFGLKSHVTRIHSNQK--YFCGSCNFSCTSNTDLLEHVQTNH 342 (373)
Q Consensus 291 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~--~~C~~C~~~f~~~~~l~~H~~~~H 342 (373)
|.||+|++. .+..+|..|+...|..+. +.|++|...+. .+|..|+..+|
T Consensus 3 f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 566666663 345666666666665432 66777766433 36777766665
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.39 E-value=1.6e-07 Score=48.72 Aligned_cols=25 Identities=40% Similarity=0.999 Sum_probs=20.3
Q ss_pred HHHHhhhhhccCCCCccCCCCccccc
Q psy10075 276 DMKMHMSRAHYGLKPFKCSFCEYKCT 301 (373)
Q Consensus 276 ~L~~H~~~~H~~~~~~~C~~C~~~f~ 301 (373)
+|..|+ +.|++++||.|+.|++.|.
T Consensus 1 ~l~~H~-~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHM-RTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHH-HHHSSSSSEEESSSSEEES
T ss_pred CHHHHh-hhcCCCCCCCCCCCcCeeC
Confidence 477888 6788888888888888875
No 21
>PHA00732 hypothetical protein
Probab=98.24 E-value=9.1e-07 Score=59.55 Aligned_cols=44 Identities=27% Similarity=0.531 Sum_probs=27.9
Q ss_pred CeecCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchhhhhhhh
Q psy10075 261 PFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLKSHV 310 (373)
Q Consensus 261 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~ 310 (373)
||.|+.|++.|.+...|..|++..|.+ +.|+.|++.|. .+..|+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~ 44 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR---RLNQHF 44 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeC---Chhhhh
Confidence 466777777777777777776334542 46777777764 466666
No 22
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.23 E-value=1.4e-06 Score=54.37 Aligned_cols=50 Identities=26% Similarity=0.456 Sum_probs=28.5
Q ss_pred eecCCCCccccChhHHHHhhhhhccCC-CCccCCCCcccccchhhhhhhhhhcc
Q psy10075 262 FTCEHCDSKFCKKYDMKMHMSRAHYGL-KPFKCSFCEYKCTTKFGLKSHVTRIH 314 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~L~~H~~~~H~~~-~~~~C~~C~~~f~~~~~l~~H~~~~h 314 (373)
|.||.|++ ..+...|..|....|..+ +.+.||+|...+. .+|..|+...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 56666666 334556666665555543 3466666665533 36666665554
No 23
>KOG1146|consensus
Probab=98.22 E-value=1.2e-06 Score=86.16 Aligned_cols=101 Identities=21% Similarity=0.357 Sum_probs=77.4
Q ss_pred hhhhcccccce--eecCCCCcceecCCCCccccChhhHhhccccCC-------------------------CCCccccCc
Q psy10075 6 EILNHSRECAF--VSRLDDSYNFICILCNYHAENFENIYLHLPSHF-------------------------EQKLFSCDL 58 (373)
Q Consensus 6 ~~~~~~~~~~~--~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~-------------------------~~~~~~C~~ 58 (373)
..+.+.+.... +.+++-++.|+|+.|+..|...+.|..|++..+ +.++|.|..
T Consensus 444 ~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~ 523 (1406)
T KOG1146|consen 444 PLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRA 523 (1406)
T ss_pred hhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCccccee
Confidence 34444555444 555666799999999999999999999998722 236799999
Q ss_pred CcccccchHHHHHHhhhh-hc-----------------------------------c------CCccccCccccccCCch
Q psy10075 59 CEYQTNTKIYIRRHMLKI-HL-----------------------------------R------QRNLECSLCDSTFSDNM 96 (373)
Q Consensus 59 C~~~f~~~~~l~~H~~~~-h~-----------------------------------~------~~~~~C~~C~~~f~~~~ 96 (373)
|...+..+..|..|++.. |. + ...|.|..|+....-..
T Consensus 524 C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetniar 603 (1406)
T KOG1146|consen 524 CNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETNIAR 603 (1406)
T ss_pred eeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhhhhh
Confidence 999999999999997432 20 0 12489999999988888
Q ss_pred HHHHHHhhhc
Q psy10075 97 DLKKHIQRKH 106 (373)
Q Consensus 97 ~l~~H~~~~h 106 (373)
.|..||..++
T Consensus 604 nlrihmtss~ 613 (1406)
T KOG1146|consen 604 NLRIHMTASP 613 (1406)
T ss_pred ccccccccCC
Confidence 8988876554
No 24
>PHA00616 hypothetical protein
Probab=98.21 E-value=3.8e-07 Score=52.72 Aligned_cols=31 Identities=19% Similarity=0.380 Sum_probs=15.4
Q ss_pred eecCCCCccccChhHHHHhhhhhccCCCCccC
Q psy10075 262 FTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKC 293 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C 293 (373)
|+|+.||+.|...++|..|+ +.|++++++.|
T Consensus 2 YqC~~CG~~F~~~s~l~~H~-r~~hg~~~~~~ 32 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHL-LSVHKQNKLTL 32 (44)
T ss_pred CccchhhHHHhhHHHHHHHH-HHhcCCCccce
Confidence 45555555555555555555 44444444444
No 25
>PHA00732 hypothetical protein
Probab=98.18 E-value=1.1e-06 Score=59.18 Aligned_cols=47 Identities=36% Similarity=0.638 Sum_probs=40.1
Q ss_pred CccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChhHHHHHHHhh
Q psy10075 290 PFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNTDLLEHVQTN 341 (373)
Q Consensus 290 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~ 341 (373)
||.|..|++.|.+.++|..|++.+|.+ +.|+.|++.|. .+..|+.+.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~--~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL--TKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC--CccCCCCCEeC---Chhhhhccc
Confidence 589999999999999999999755543 58999999998 588888665
No 26
>PHA00616 hypothetical protein
Probab=98.13 E-value=9.2e-07 Score=51.13 Aligned_cols=33 Identities=18% Similarity=0.291 Sum_probs=28.8
Q ss_pred CccCCCCcccccchhhhhhhhhhccCCCcccCC
Q psy10075 290 PFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCG 322 (373)
Q Consensus 290 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~ 322 (373)
||+|+.||+.|..+++|..|++++|+..++.|+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 688999999999999999999889988887765
No 27
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.05 E-value=1e-06 Score=45.63 Aligned_cols=17 Identities=24% Similarity=0.782 Sum_probs=7.3
Q ss_pred hhccCCccccCcccccc
Q psy10075 76 IHLRQRNLECSLCDSTF 92 (373)
Q Consensus 76 ~h~~~~~~~C~~C~~~f 92 (373)
.|.++++|.|+.|++.|
T Consensus 8 ~H~~~k~~~C~~C~k~F 24 (26)
T PF13465_consen 8 THTGEKPYKCPYCGKSF 24 (26)
T ss_dssp HHSSSSSEEESSSSEEE
T ss_pred hcCCCCCCCCCCCcCee
Confidence 34444444444444443
No 28
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.86 E-value=7.8e-06 Score=58.93 Aligned_cols=73 Identities=23% Similarity=0.346 Sum_probs=20.5
Q ss_pred ecCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChhHHHHHHHhh
Q psy10075 263 TCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNTDLLEHVQTN 341 (373)
Q Consensus 263 ~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~ 341 (373)
.|..|+..|.+...|..|+...|.-..+ ....+.....+..++. ......+.|.+|+..|.+...|..||+.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~-----~~~~l~~~~~~~~~~~-~~~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP-----DQKYLVDPNRLLNYLR-KKVKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Cccccccccccccccccccccccccccc-----cccccccccccccccc-cccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 4788888888888888888667754432 1122234455555542 22233588999999999999999998865
No 29
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.76 E-value=1.6e-05 Score=40.34 Aligned_cols=24 Identities=33% Similarity=0.693 Sum_probs=15.5
Q ss_pred ccCCCCCCCccChhHHHHHHHhhc
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQTNH 342 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~H 342 (373)
|.|++|++.|.+...|..|++++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 467777777777777777777664
No 30
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.73 E-value=1.3e-05 Score=40.27 Aligned_cols=22 Identities=32% Similarity=0.637 Sum_probs=13.3
Q ss_pred ccCCCCCCCccChhHHHHHHHh
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQT 340 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~ 340 (373)
|.|+.|++.|.++.+|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4566666666666666666554
No 31
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.60 E-value=4e-05 Score=38.45 Aligned_cols=20 Identities=35% Similarity=0.851 Sum_probs=11.7
Q ss_pred eecCCCCccccChhHHHHhh
Q psy10075 262 FTCEHCDSKFCKKYDMKMHM 281 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~L~~H~ 281 (373)
|.|+.|++.|.+...|..|+
T Consensus 1 y~C~~C~~~f~~~~~l~~H~ 20 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHM 20 (23)
T ss_dssp EEETTTTEEESSHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHH
Confidence 45555666666666666555
No 32
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.59 E-value=2.9e-05 Score=64.77 Aligned_cols=52 Identities=33% Similarity=0.742 Sum_probs=39.8
Q ss_pred CCCeecCC--CCccccChhHHHHhhhhhcc------------------CCCCccCCCCcccccchhhhhhhh
Q psy10075 259 DKPFTCEH--CDSKFCKKYDMKMHMSRAHY------------------GLKPFKCSFCEYKCTTKFGLKSHV 310 (373)
Q Consensus 259 ~~~~~C~~--C~~~f~~~~~L~~H~~~~H~------------------~~~~~~C~~C~~~f~~~~~l~~H~ 310 (373)
++||+|++ |.|.|++...|..|+.--|- ..+||+|+.|++++.+...|+.|+
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence 58999987 99999999999999854452 236677777777777777777775
No 33
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.46 E-value=0.00011 Score=37.21 Aligned_cols=20 Identities=35% Similarity=0.855 Sum_probs=9.1
Q ss_pred eecCCCCccccChhHHHHhh
Q psy10075 262 FTCEHCDSKFCKKYDMKMHM 281 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~L~~H~ 281 (373)
|.|++|++.|.+...|..|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~ 20 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHM 20 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHH
Confidence 34555555555555555554
No 34
>KOG1146|consensus
Probab=97.43 E-value=2.7e-05 Score=77.14 Aligned_cols=75 Identities=21% Similarity=0.471 Sum_probs=48.3
Q ss_pred cCcCcccccchHHHHHHhhhhhccCCccccCccccccCCchHHHHHHhhhcC------------------------CCCC
Q psy10075 56 CDLCEYQTNTKIYIRRHMLKIHLRQRNLECSLCDSTFSDNMDLKKHIQRKHT------------------------NDRP 111 (373)
Q Consensus 56 C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~------------------------~~~~ 111 (373)
|.-++..+.+...+.-|+...+...+.++|+.|++.|.....|..||+..|. +..+
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p 518 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKP 518 (1406)
T ss_pred ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCc
Confidence 3444455555555555555556666778888888888888888888766553 2345
Q ss_pred cccCCCCccccChHHHHHh
Q psy10075 112 FSCNFCDYKSKTKYDITSH 130 (373)
Q Consensus 112 ~~C~~C~~~f~~~~~l~~h 130 (373)
|.|..|...+....+|..|
T Consensus 519 ~~C~~C~~stttng~Lsih 537 (1406)
T KOG1146|consen 519 YPCRACNYSTTTNGNLSIH 537 (1406)
T ss_pred ccceeeeeeeecchHHHHH
Confidence 6666666666666666666
No 35
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.37 E-value=0.00013 Score=52.50 Aligned_cols=22 Identities=27% Similarity=0.711 Sum_probs=13.6
Q ss_pred CeecCCCCccccChhHHHHhhh
Q psy10075 261 PFTCEHCDSKFCKKYDMKMHMS 282 (373)
Q Consensus 261 ~~~C~~C~~~f~~~~~L~~H~~ 282 (373)
.+.|..|++.|.+...|..||+
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~ 71 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMR 71 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHH
T ss_pred CCCCCccCCCCcCHHHHHHHHc
Confidence 4666666666666666666664
No 36
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.36 E-value=7.1e-05 Score=39.20 Aligned_cols=24 Identities=25% Similarity=0.512 Sum_probs=15.5
Q ss_pred ccCCCCCCCccChhHHHHHHHhhc
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQTNH 342 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~H 342 (373)
|+|..|++.|.+..+|..|++.|+
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCccCCccCChhHHHHHhHHhc
Confidence 566666666666666666666554
No 37
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.32 E-value=0.00014 Score=38.07 Aligned_cols=25 Identities=24% Similarity=0.363 Sum_probs=18.1
Q ss_pred ceecCCCCccccChhhHhhccccCC
Q psy10075 25 NFICILCNYHAENFENIYLHLPSHF 49 (373)
Q Consensus 25 ~~~C~~C~~~f~~~~~l~~H~~~h~ 49 (373)
||.|..|++.|.+..+|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 5677777777777777777776653
No 38
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.31 E-value=0.00014 Score=60.91 Aligned_cols=24 Identities=38% Similarity=0.881 Sum_probs=22.2
Q ss_pred CCCCeecCCCCccccChhHHHHhh
Q psy10075 258 EDKPFTCEHCDSKFCKKYDMKMHM 281 (373)
Q Consensus 258 ~~~~~~C~~C~~~f~~~~~L~~H~ 281 (373)
..+||+|++|+|+|++...|..|+
T Consensus 395 ~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 395 KDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred cCCceeccccchhhccCccceecc
Confidence 469999999999999999999997
No 39
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=97.13 E-value=0.00025 Score=35.86 Aligned_cols=24 Identities=29% Similarity=0.769 Sum_probs=14.5
Q ss_pred ccCCCCCCCccChhHHHHHHHhhcc
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQTNHI 343 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~H~ 343 (373)
|+|+.|++... +.+|.+|++.+|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 56777776666 6677777766653
No 40
>KOG2482|consensus
Probab=96.83 E-value=0.0027 Score=53.98 Aligned_cols=185 Identities=16% Similarity=0.234 Sum_probs=87.7
Q ss_pred HHHHHHHhhhccCCCceeccccccccc-chhHHHhhhhhccCCC---------------------CccccCCCchhccCc
Q psy10075 155 VELKNHILKIHQGQTRFQCSFCDRRFY-FKNECTKHYLEHHTNE---------------------KPFKCTLCSYKAKRN 212 (373)
Q Consensus 155 ~~l~~H~~~~h~~~~~~~C~~C~~~f~-~~~~l~~H~~~~h~~~---------------------~~~~C~~C~~~f~~~ 212 (373)
..|+++.+..-......+|-.|...+. .++.+..|+...|.-. ..+.|-+|.+.|+.+
T Consensus 129 eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdk 208 (423)
T KOG2482|consen 129 EALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDK 208 (423)
T ss_pred HHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCc
Confidence 345555432223345578888876654 5677777765554311 236677777777777
Q ss_pred HHHHHHHHHh-cCCCCccc------eecCC--Cccccc-ChHHH--------hhhhhccCC--CCCCCC--eecCCCCcc
Q psy10075 213 FELKQHILRI-HQDNIEAE------YHCTY--CDYICS-SRTIL--------KSHINKKHA--NDEDKP--FTCEHCDSK 270 (373)
Q Consensus 213 ~~l~~H~~~~-h~~~~~~~------~~C~~--C~~~f~-~~~~l--------~~H~~~~~~--~~~~~~--~~C~~C~~~ 270 (373)
..|+.||+.. |.....+. |.=.. =|+... ..+.+ .++-..+.. ..+..+ ..|-.|...
T Consensus 209 ntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~ 288 (423)
T KOG2482|consen 209 NTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNF 288 (423)
T ss_pred HHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccc
Confidence 7777777432 21100100 11100 011110 00000 000000000 011112 578888888
Q ss_pred ccChhHHHHhhhhhccCCCCccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChhHHHHHHHhh
Q psy10075 271 FCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNTDLLEHVQTN 341 (373)
Q Consensus 271 f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~ 341 (373)
..+...|..||+.+|.-.---.=+-=+-.|..+-.+..=+++ ....-.|..|..+|..+..|+.||..+
T Consensus 289 ~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRk--q~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 289 YENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRK--QKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred hhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHH--HhhccccccccccccCcchhhhhcccc
Confidence 777888888887767321000000000111111112111111 111246888889999999999997743
No 41
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.75 E-value=0.002 Score=54.78 Aligned_cols=132 Identities=26% Similarity=0.461 Sum_probs=78.1
Q ss_pred cccCC--CchhccCcHHHHHHHHHhcCCCCccceecCCC---ccccc------ChHHHhhhhhccCCCCCCCCe-ecCCC
Q psy10075 200 FKCTL--CSYKAKRNFELKQHILRIHQDNIEAEYHCTYC---DYICS------SRTILKSHINKKHANDEDKPF-TCEHC 267 (373)
Q Consensus 200 ~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~~~C~~C---~~~f~------~~~~l~~H~~~~~~~~~~~~~-~C~~C 267 (373)
|.||. |......-..|..|....|.. +.|.+| .+.|. +...|..|...-..+.|-+.. .|..|
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~~-----~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC 226 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHGF-----VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFC 226 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcCc-----EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhc
Confidence 55554 444444445666666554432 555555 23333 344566665322222222222 58888
Q ss_pred CccccChhHHHHhhhhhccCCCCccCCCCcc-------cccchhhhhhhhhhccCCCcccCCC--CC----CCccChhHH
Q psy10075 268 DSKFCKKYDMKMHMSRAHYGLKPFKCSFCEY-------KCTTKFGLKSHVTRIHSNQKYFCGS--CN----FSCTSNTDL 334 (373)
Q Consensus 268 ~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~-------~f~~~~~l~~H~~~~h~~~~~~C~~--C~----~~f~~~~~l 334 (373)
...|..-..|..|++..|. +|-+|.+ .|.+-.+|..|.+..| |.|.+ |. +.|+....|
T Consensus 227 ~~~FYdDDEL~~HcR~~HE-----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el 297 (493)
T COG5236 227 KIYFYDDDELRRHCRLRHE-----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTEL 297 (493)
T ss_pred cceecChHHHHHHHHhhhh-----hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHH
Confidence 8888888888888866662 2555543 5666678888874333 44442 32 568888899
Q ss_pred HHHHHhhcccc
Q psy10075 335 LEHVQTNHIIV 345 (373)
Q Consensus 335 ~~H~~~~H~~~ 345 (373)
++|+...|++.
T Consensus 298 ~~h~~~~h~~~ 308 (493)
T COG5236 298 LEHLTRFHKVN 308 (493)
T ss_pred HHHHHHHhhcc
Confidence 99998888854
No 42
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.75 E-value=0.0009 Score=39.65 Aligned_cols=32 Identities=19% Similarity=0.229 Sum_probs=18.5
Q ss_pred CCcceecCCCCccccChhhHhhccccCCCCCc
Q psy10075 22 DSYNFICILCNYHAENFENIYLHLPSHFEQKL 53 (373)
Q Consensus 22 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 53 (373)
++.|-+|++|+.+++...+|.+|+.+.++.+|
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 56677777777777777777777766555443
No 43
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.68 E-value=0.00081 Score=33.96 Aligned_cols=23 Identities=43% Similarity=0.940 Sum_probs=10.4
Q ss_pred ccCCCCcccccchhhhhhhhhhcc
Q psy10075 291 FKCSFCEYKCTTKFGLKSHVTRIH 314 (373)
Q Consensus 291 ~~C~~C~~~f~~~~~l~~H~~~~h 314 (373)
|+|+.|++... ...|..|++.+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 34555555444 445555554443
No 44
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.55 E-value=0.0023 Score=32.70 Aligned_cols=23 Identities=35% Similarity=0.639 Sum_probs=18.5
Q ss_pred ccCCCCCCCccChhHHHHHHHhh
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQTN 341 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~ 341 (373)
|.|..|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 57888888888888888888754
No 45
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.45 E-value=0.0012 Score=33.75 Aligned_cols=22 Identities=32% Similarity=0.807 Sum_probs=15.2
Q ss_pred ccCCCCCCCccChhHHHHHHHh
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQT 340 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~ 340 (373)
|.|.+|++.|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4577777777777777777654
No 46
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.43 E-value=0.0015 Score=38.74 Aligned_cols=22 Identities=23% Similarity=0.644 Sum_probs=7.2
Q ss_pred CCeecCCCCccccChhHHHHhh
Q psy10075 260 KPFTCEHCDSKFCKKYDMKMHM 281 (373)
Q Consensus 260 ~~~~C~~C~~~f~~~~~L~~H~ 281 (373)
.|-.|++|+..+.+..+|.+|+
T Consensus 23 ~PatCP~C~a~~~~srnLrRHl 44 (54)
T PF09237_consen 23 QPATCPICGAVIRQSRNLRRHL 44 (54)
T ss_dssp --EE-TTT--EESSHHHHHHHH
T ss_pred CCCCCCcchhhccchhhHHHHH
Confidence 3334444444444444444444
No 47
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.35 E-value=0.0038 Score=53.15 Aligned_cols=132 Identities=21% Similarity=0.426 Sum_probs=84.4
Q ss_pred ceecCC--CCccccChhhHhhccccCCCCCccccCcCc---------ccccchHHHHHHhhhhhccCCc----cccCccc
Q psy10075 25 NFICIL--CNYHAENFENIYLHLPSHFEQKLFSCDLCE---------YQTNTKIYIRRHMLKIHLRQRN----LECSLCD 89 (373)
Q Consensus 25 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~---------~~f~~~~~l~~H~~~~h~~~~~----~~C~~C~ 89 (373)
.|.|+. |..+......|..|++..++ .+.|.+|- ....+...|..| ...-..+.. -.|..|+
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H-~~~G~~e~GFKGHP~C~FC~ 227 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDH-KNGGLEEEGFKGHPLCIFCK 227 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeeccccccc-ccCCccccCcCCCchhhhcc
Confidence 366776 77777778888889876554 25565553 223566777888 333332322 4588999
Q ss_pred cccCCchHHHHHHhhhcCCCCCcccCCCC----ccccChHHHHHhhhcccCCCCCCccccc--cc----ccccCHHHHHH
Q psy10075 90 STFSDNMDLKKHIQRKHTNDRPFSCNFCD----YKSKTKYDITSHSLRKHSKGDKPHKCTI--CE----STYLTKVELKN 159 (373)
Q Consensus 90 ~~f~~~~~l~~H~~~~h~~~~~~~C~~C~----~~f~~~~~l~~h~~~~h~~~~~~~~C~~--C~----~~f~~~~~l~~ 159 (373)
..|.+-..|..|++..|. +-+.|+.-+ .-|.+...|..|....| |.|.. |- .+|.....|..
T Consensus 228 ~~FYdDDEL~~HcR~~HE--~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h------y~ct~qtc~~~k~~vf~~~~el~~ 299 (493)
T COG5236 228 IYFYDDDELRRHCRLRHE--ACHICDMVGPIRYQYFKSYEDLEAHFRNAH------YCCTFQTCRVGKCYVFPYHTELLE 299 (493)
T ss_pred ceecChHHHHHHHHhhhh--hhhhhhccCccchhhhhCHHHHHHHhhcCc------eEEEEEEEecCcEEEeccHHHHHH
Confidence 999999999999877763 223333222 23777788888844333 44432 21 57888888888
Q ss_pred HHhhhccC
Q psy10075 160 HILKIHQG 167 (373)
Q Consensus 160 H~~~~h~~ 167 (373)
|+.+.|..
T Consensus 300 h~~~~h~~ 307 (493)
T COG5236 300 HLTRFHKV 307 (493)
T ss_pred HHHHHhhc
Confidence 88777743
No 48
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.17 E-value=0.0024 Score=32.54 Aligned_cols=22 Identities=32% Similarity=0.564 Sum_probs=13.2
Q ss_pred eecCCCCccccChhhHhhcccc
Q psy10075 26 FICILCNYHAENFENIYLHLPS 47 (373)
Q Consensus 26 ~~C~~C~~~f~~~~~l~~H~~~ 47 (373)
|.|++|+++|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4566666666666666666543
No 49
>KOG2482|consensus
Probab=96.17 E-value=0.0092 Score=50.85 Aligned_cols=149 Identities=19% Similarity=0.297 Sum_probs=79.7
Q ss_pred hhHHHhhhhhccCCCCccccCCCchhcc-CcHHHHHHHHHhcCCCCccceecCCCcccccChHHHhhhhhccCCCCCCCC
Q psy10075 183 KNECTKHYLEHHTNEKPFKCTLCSYKAK-RNFELKQHILRIHQDNIEAEYHCTYCDYICSSRTILKSHINKKHANDEDKP 261 (373)
Q Consensus 183 ~~~l~~H~~~~h~~~~~~~C~~C~~~f~-~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~ 261 (373)
+..|..+++..-......+|-.|+..+. .++.+..|+-..|.-+...|- .......|..|++.. -..
T Consensus 128 ~eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpD-------niVyvnelLehLkek-----L~r 195 (423)
T KOG2482|consen 128 KEALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPD-------NIVYVNELLEHLKEK-----LER 195 (423)
T ss_pred HHHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCc-------ceeeHHHHHHHHHHH-----Hhh
Confidence 5677777766545556678999987654 456778888777765443331 111223445554321 123
Q ss_pred eecCCCCccccChhHHHHhhhhh-ccCCCC--------ccCCC--Ccccccc-hhhhhhhhh-------------hccCC
Q psy10075 262 FTCEHCDSKFCKKYDMKMHMSRA-HYGLKP--------FKCSF--CEYKCTT-KFGLKSHVT-------------RIHSN 316 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~L~~H~~~~-H~~~~~--------~~C~~--C~~~f~~-~~~l~~H~~-------------~~h~~ 316 (373)
+.|-.|.+.|..+..|..||+.. |..-.| |.=.+ =|++..- .+.+.+-.. ..++.
T Consensus 196 ~~CLyCekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a 275 (423)
T KOG2482|consen 196 LRCLYCEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDA 275 (423)
T ss_pred heeeeeccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCC
Confidence 56777777777777777777432 322222 00000 0111110 011100000 00111
Q ss_pred Cc--ccCCCCCCCccChhHHHHHHHhhcc
Q psy10075 317 QK--YFCGSCNFSCTSNTDLLEHVQTNHI 343 (373)
Q Consensus 317 ~~--~~C~~C~~~f~~~~~l~~H~~~~H~ 343 (373)
.+ .+|-.|.....+...|.+||+..|-
T Consensus 276 ~a~~v~CLfC~~~~en~~~l~eHmk~vHe 304 (423)
T KOG2482|consen 276 EALSVVCLFCTNFYENPVFLFEHMKIVHE 304 (423)
T ss_pred CccceEEEeeccchhhHHHHHHHHHHHHH
Confidence 22 5888888888888888889888875
No 50
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.13 E-value=0.0046 Score=31.55 Aligned_cols=19 Identities=37% Similarity=0.716 Sum_probs=9.9
Q ss_pred ecCCCCccccChhHHHHhh
Q psy10075 263 TCEHCDSKFCKKYDMKMHM 281 (373)
Q Consensus 263 ~C~~C~~~f~~~~~L~~H~ 281 (373)
.|+.|++.|.....|..|+
T Consensus 2 ~C~~C~~~f~~~~~l~~H~ 20 (26)
T smart00355 2 RCPECGKVFKSKSALKEHM 20 (26)
T ss_pred CCCCCcchhCCHHHHHHHH
Confidence 4555555555555555555
No 51
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=96.02 E-value=0.0047 Score=32.14 Aligned_cols=22 Identities=23% Similarity=0.428 Sum_probs=17.6
Q ss_pred eecCCCCccccChhhHhhcccc
Q psy10075 26 FICILCNYHAENFENIYLHLPS 47 (373)
Q Consensus 26 ~~C~~C~~~f~~~~~l~~H~~~ 47 (373)
|.|.+|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 6788888888888888888764
No 52
>KOG2231|consensus
Probab=95.87 E-value=0.022 Score=54.07 Aligned_cols=120 Identities=25% Similarity=0.446 Sum_probs=60.9
Q ss_pred eecccccccccchhHHHhhhhhccCCCCccccCCCchhccCcHHHHHHHHHhcCCCCccceecCCCc---cc------cc
Q psy10075 171 FQCSFCDRRFYFKNECTKHYLEHHTNEKPFKCTLCSYKAKRNFELKQHILRIHQDNIEAEYHCTYCD---YI------CS 241 (373)
Q Consensus 171 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~C~~C~---~~------f~ 241 (373)
+.|.+|++.+.-. . ..-.|..| -.|.+...|+.|+...|.. +.|..|- +. ..
T Consensus 100 ~~C~~C~~~~~~~--------~-----~~~~~~~c-~~~~s~~~Lk~H~~~~H~~-----~~c~lC~~~~kif~~e~k~Y 160 (669)
T KOG2231|consen 100 HSCHICDRRFRAL--------Y-----NKKECLHC-TEFKSVENLKNHMRDQHKL-----HLCSLCLQNLKIFINERKLY 160 (669)
T ss_pred hhcCccccchhhh--------c-----ccCCCccc-cchhHHHHHHHHHHHhhhh-----hccccccccceeeeeeeehe
Confidence 6688887776411 0 01125555 5555666677776555533 3333331 11 12
Q ss_pred ChHHHhhhhhccCC-CCCCCC-eecCCCCccccChhHHHHhhhhhccCCCCccCCCCc------ccccchhhhhhhhhhc
Q psy10075 242 SRTILKSHINKKHA-NDEDKP-FTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCE------YKCTTKFGLKSHVTRI 313 (373)
Q Consensus 242 ~~~~l~~H~~~~~~-~~~~~~-~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~------~~f~~~~~l~~H~~~~ 313 (373)
+...|..|++.-.. ....+. -.|..|...|-....|..|++..| |.|..|. ..|.+-..|..|.+..
T Consensus 161 t~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h-----~~chfC~~~~~~neyy~~~~dLe~HfR~~ 235 (669)
T KOG2231|consen 161 TRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH-----EFCHFCDYKTGQNEYYNDYDDLEEHFRKG 235 (669)
T ss_pred hHHHHHHHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccce-----eheeecCcccccchhcccchHHHHHhhhc
Confidence 44556666543222 111111 356677777777777777775555 4455552 2444556666666433
Q ss_pred c
Q psy10075 314 H 314 (373)
Q Consensus 314 h 314 (373)
|
T Consensus 236 H 236 (669)
T KOG2231|consen 236 H 236 (669)
T ss_pred C
Confidence 3
No 53
>PRK04860 hypothetical protein; Provisional
Probab=95.83 E-value=0.0056 Score=47.57 Aligned_cols=39 Identities=23% Similarity=0.575 Sum_probs=29.7
Q ss_pred CeecCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchh
Q psy10075 261 PFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKF 304 (373)
Q Consensus 261 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~ 304 (373)
+|.|. |+. ....+..|. ++|+++++|.|..|+..|....
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~-ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHN-RVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEcC-CCC---eeCHHHHHH-HHhcCCccEECCCCCceeEEec
Confidence 58887 876 666778888 8888888888888888776443
No 54
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.35 E-value=0.0095 Score=30.96 Aligned_cols=19 Identities=26% Similarity=0.560 Sum_probs=8.1
Q ss_pred cCCCCcccccchhhhhhhh
Q psy10075 292 KCSFCEYKCTTKFGLKSHV 310 (373)
Q Consensus 292 ~C~~C~~~f~~~~~l~~H~ 310 (373)
.|..|++.|.+...+..|+
T Consensus 3 ~C~~C~k~f~~~~~~~~H~ 21 (27)
T PF12171_consen 3 YCDACDKYFSSENQLKQHM 21 (27)
T ss_dssp BBTTTTBBBSSHHHHHCCT
T ss_pred CcccCCCCcCCHHHHHHHH
Confidence 3444444444444444443
No 55
>PRK04860 hypothetical protein; Provisional
Probab=94.67 E-value=0.014 Score=45.47 Aligned_cols=40 Identities=18% Similarity=0.543 Sum_probs=34.2
Q ss_pred CCccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChh
Q psy10075 289 KPFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNT 332 (373)
Q Consensus 289 ~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~ 332 (373)
-+|.|. |+. ....+.+|.+.+.++.+|.|..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 479998 987 6788999998888889999999999887543
No 56
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=94.59 E-value=0.034 Score=28.24 Aligned_cols=19 Identities=21% Similarity=0.352 Sum_probs=10.8
Q ss_pred cCCCCCCCccChhHHHHHHH
Q psy10075 320 FCGSCNFSCTSNTDLLEHVQ 339 (373)
Q Consensus 320 ~C~~C~~~f~~~~~l~~H~~ 339 (373)
.|++||+.| ....|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 466666666 4455655543
No 57
>KOG4173|consensus
Probab=94.21 E-value=0.012 Score=46.40 Aligned_cols=83 Identities=25% Similarity=0.508 Sum_probs=43.9
Q ss_pred ccccCC--CchhccCcHHHHHHHHHhcCCCCccceecCCCcccccChHHHhhhhhccCC-------CCCCCCeecCC--C
Q psy10075 199 PFKCTL--CSYKAKRNFELKQHILRIHQDNIEAEYHCTYCDYICSSRTILKSHINKKHA-------NDEDKPFTCEH--C 267 (373)
Q Consensus 199 ~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~-------~~~~~~~~C~~--C 267 (373)
.+.|++ |...|.+......|....|.+ .|..|.+.|.+.--|..|+.++|. ..|...|.|-+ |
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~~------sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgC 152 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHGN------SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGC 152 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhcccc------hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhh
Confidence 355554 555555555555555444433 566666666666666666554442 12344455532 6
Q ss_pred CccccChhHHHHhhhhhccC
Q psy10075 268 DSKFCKKYDMKMHMSRAHYG 287 (373)
Q Consensus 268 ~~~f~~~~~L~~H~~~~H~~ 287 (373)
+..|.+...-..|+-..|.-
T Consensus 153 t~KFkT~r~RkdH~I~~Hk~ 172 (253)
T KOG4173|consen 153 TEKFKTSRDRKDHMIRMHKY 172 (253)
T ss_pred hhhhhhhhhhhhHHHHhccC
Confidence 66666655555555445543
No 58
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=93.62 E-value=0.095 Score=38.16 Aligned_cols=25 Identities=24% Similarity=0.549 Sum_probs=21.1
Q ss_pred ccC----CCCCCCccChhHHHHHHHhhcc
Q psy10075 319 YFC----GSCNFSCTSNTDLLEHVQTNHI 343 (373)
Q Consensus 319 ~~C----~~C~~~f~~~~~l~~H~~~~H~ 343 (373)
|.| ..|++...+...+.+|++.+|+
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 888 8888888888888888888874
No 59
>KOG2785|consensus
Probab=93.48 E-value=0.14 Score=44.90 Aligned_cols=22 Identities=18% Similarity=0.403 Sum_probs=18.9
Q ss_pred CcccccccccccCHHHHHHHHh
Q psy10075 141 PHKCTICESTYLTKVELKNHIL 162 (373)
Q Consensus 141 ~~~C~~C~~~f~~~~~l~~H~~ 162 (373)
.|.|..|...|.+...-+.|++
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyK 24 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYK 24 (390)
T ss_pred cceeeceeeeeccHHHHHHHhh
Confidence 5889999999999888888874
No 60
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=93.43 E-value=0.054 Score=30.04 Aligned_cols=21 Identities=33% Similarity=0.592 Sum_probs=11.6
Q ss_pred ceecCCCCccccChhhHhhcc
Q psy10075 25 NFICILCNYHAENFENIYLHL 45 (373)
Q Consensus 25 ~~~C~~C~~~f~~~~~l~~H~ 45 (373)
+|.|++|++.|.+...+..|+
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~ 23 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHL 23 (35)
T ss_pred CeEccccCCccCCHHHHHHHH
Confidence 455555555555555555554
No 61
>KOG2231|consensus
Probab=93.17 E-value=0.25 Score=47.26 Aligned_cols=19 Identities=26% Similarity=0.553 Sum_probs=9.7
Q ss_pred cCCCchhccCcHHHHHHHH
Q psy10075 202 CTLCSYKAKRNFELKQHIL 220 (373)
Q Consensus 202 C~~C~~~f~~~~~l~~H~~ 220 (373)
|..|...|.....|.+|+.
T Consensus 185 C~~C~~~fld~~el~rH~~ 203 (669)
T KOG2231|consen 185 CKFCHERFLDDDELYRHLR 203 (669)
T ss_pred chhhhhhhccHHHHHHhhc
Confidence 4455555555555555543
No 62
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=93.16 E-value=0.069 Score=29.23 Aligned_cols=10 Identities=20% Similarity=0.893 Sum_probs=4.5
Q ss_pred eecCCCCccc
Q psy10075 262 FTCEHCDSKF 271 (373)
Q Consensus 262 ~~C~~C~~~f 271 (373)
|+|.+||..+
T Consensus 2 ~~C~~CGy~y 11 (33)
T cd00350 2 YVCPVCGYIY 11 (33)
T ss_pred EECCCCCCEE
Confidence 3444444443
No 63
>KOG2785|consensus
Probab=93.14 E-value=0.1 Score=45.57 Aligned_cols=142 Identities=18% Similarity=0.273 Sum_probs=89.8
Q ss_pred ccccCCCchhccCcHHHHHHHHH-hcCCC---------------------------------CccceecCCCcccccChH
Q psy10075 199 PFKCTLCSYKAKRNFELKQHILR-IHQDN---------------------------------IEAEYHCTYCDYICSSRT 244 (373)
Q Consensus 199 ~~~C~~C~~~f~~~~~l~~H~~~-~h~~~---------------------------------~~~~~~C~~C~~~f~~~~ 244 (373)
-|.|.-|...|.+....+.|+.+ .|.-+ ...++.|..|++.|.+..
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~ 82 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPK 82 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChh
Confidence 47899999999998888888743 33111 122478999999999888
Q ss_pred HHhhhhhccC-CCCCC-------------CCee-------------cCCCCccccChhHHHHhhhhh------------c
Q psy10075 245 ILKSHINKKH-ANDED-------------KPFT-------------CEHCDSKFCKKYDMKMHMSRA------------H 285 (373)
Q Consensus 245 ~l~~H~~~~~-~~~~~-------------~~~~-------------C~~C~~~f~~~~~L~~H~~~~------------H 285 (373)
+...|+.... ..... ..+. +..+...+........+. .. -
T Consensus 83 a~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~~-~dd~~Edi~~d~~~e 161 (390)
T KOG2785|consen 83 AHENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEEE-EDDEEEDIEEDGDDE 161 (390)
T ss_pred hHHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhhc-cCcchhhhhhccchh
Confidence 8888875321 10000 1111 222222222211122221 00 0
Q ss_pred cCCCCccCCCCcccccchhhhhhhhhhccCC-----------------------CcccCCCCC---CCccChhHHHHHHH
Q psy10075 286 YGLKPFKCSFCEYKCTTKFGLKSHVTRIHSN-----------------------QKYFCGSCN---FSCTSNTDLLEHVQ 339 (373)
Q Consensus 286 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~-----------------------~~~~C~~C~---~~f~~~~~l~~H~~ 339 (373)
....|-.|-.|+..+.+...-..||..+|+- .-+.|-.|+ +.|.+....+.||.
T Consensus 162 ~e~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~ 241 (390)
T KOG2785|consen 162 DELIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMR 241 (390)
T ss_pred cccCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHh
Confidence 0112467899999999999999999888872 137888998 99999999999987
Q ss_pred hh
Q psy10075 340 TN 341 (373)
Q Consensus 340 ~~ 341 (373)
..
T Consensus 242 ~K 243 (390)
T KOG2785|consen 242 DK 243 (390)
T ss_pred hc
Confidence 43
No 64
>KOG4173|consensus
Probab=92.78 E-value=0.039 Score=43.58 Aligned_cols=81 Identities=23% Similarity=0.487 Sum_probs=45.7
Q ss_pred cceecCC--CcccccChHHHhhhhhccCCCCCCCCeecCCCCccccChhHHHHhhhhhc---------cCCCCccCCC--
Q psy10075 229 AEYHCTY--CDYICSSRTILKSHINKKHANDEDKPFTCEHCDSKFCKKYDMKMHMSRAH---------YGLKPFKCSF-- 295 (373)
Q Consensus 229 ~~~~C~~--C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H---------~~~~~~~C~~-- 295 (373)
..+.|+. |...|.+......|.+.-|.+ .|..|.+.|.+...|..|+...| .|...|+|-.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~------sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEg 151 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN------SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEG 151 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc------hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHh
Confidence 3455554 555666666666665444433 46666666666666666664444 2344566654
Q ss_pred CcccccchhhhhhhhhhccC
Q psy10075 296 CEYKCTTKFGLKSHVTRIHS 315 (373)
Q Consensus 296 C~~~f~~~~~l~~H~~~~h~ 315 (373)
|+..|.+..+-..|+...|.
T Consensus 152 Ct~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 152 CTEKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhhhhhHHHHhcc
Confidence 66666666666666654443
No 65
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=92.47 E-value=0.11 Score=28.78 Aligned_cols=23 Identities=22% Similarity=0.645 Sum_probs=18.2
Q ss_pred cccCCCCCCCccChhHHHHHHHh
Q psy10075 318 KYFCGSCNFSCTSNTDLLEHVQT 340 (373)
Q Consensus 318 ~~~C~~C~~~f~~~~~l~~H~~~ 340 (373)
+|.|.+|+..|.+...+..|+..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 46788888888888888888664
No 66
>KOG2893|consensus
Probab=91.60 E-value=0.046 Score=44.21 Aligned_cols=45 Identities=27% Similarity=0.571 Sum_probs=20.8
Q ss_pred cCccccccCCchHHHHHHhhhcCCCCCcccCCCCccccChHHHHHhhhcc
Q psy10075 85 CSLCDSTFSDNMDLKKHIQRKHTNDRPFSCNFCDYKSKTKYDITSHSLRK 134 (373)
Q Consensus 85 C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~h~~~~ 134 (373)
|..|+..|.+..-|++|.+.. -|+|..|.+.+.+---|..|..++
T Consensus 13 cwycnrefddekiliqhqkak-----hfkchichkkl~sgpglsihcmqv 57 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAK-----HFKCHICHKKLFSGPGLSIHCMQV 57 (341)
T ss_pred eeecccccchhhhhhhhhhhc-----cceeeeehhhhccCCCceeehhhh
Confidence 555555555555555553222 255555555444443444443333
No 67
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=90.87 E-value=0.34 Score=30.00 Aligned_cols=11 Identities=18% Similarity=0.833 Sum_probs=5.7
Q ss_pred cceecCCCccc
Q psy10075 229 AEYHCTYCDYI 239 (373)
Q Consensus 229 ~~~~C~~C~~~ 239 (373)
..|.|+.||..
T Consensus 26 v~F~CPnCGe~ 36 (61)
T COG2888 26 VKFPCPNCGEV 36 (61)
T ss_pred eEeeCCCCCce
Confidence 34555555543
No 68
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=90.46 E-value=0.43 Score=29.68 Aligned_cols=11 Identities=18% Similarity=0.809 Sum_probs=6.2
Q ss_pred cceecCCCccc
Q psy10075 229 AEYHCTYCDYI 239 (373)
Q Consensus 229 ~~~~C~~C~~~ 239 (373)
..|.|+.||..
T Consensus 24 ~~F~CPnCG~~ 34 (59)
T PRK14890 24 VKFLCPNCGEV 34 (59)
T ss_pred CEeeCCCCCCe
Confidence 44666666654
No 69
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=90.28 E-value=0.35 Score=35.19 Aligned_cols=56 Identities=21% Similarity=0.527 Sum_probs=45.2
Q ss_pred CCCeecCCCCccccChhHHHHhhhhhccCCC-----------------------------------------CccC----
Q psy10075 259 DKPFTCEHCDSKFCKKYDMKMHMSRAHYGLK-----------------------------------------PFKC---- 293 (373)
Q Consensus 259 ~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~-----------------------------------------~~~C---- 293 (373)
.+..+|..|+..... +++..|++..|.... -|.|
T Consensus 9 ~~vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~~G~~C~~~~ 87 (109)
T PF12013_consen 9 YRVLICRQCQYAVQP-SEVESHLRKRHHILKSQERQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVYDGYRCQCDP 87 (109)
T ss_pred CCEEEeCCCCcccCc-hHHHHHHHHhcccccHHHHHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCCCCCCeeeecCC
Confidence 566899999998866 889999974542110 1899
Q ss_pred CCCcccccchhhhhhhhhhccC
Q psy10075 294 SFCEYKCTTKFGLKSHVTRIHS 315 (373)
Q Consensus 294 ~~C~~~f~~~~~l~~H~~~~h~ 315 (373)
..|++.+.+...|..|++.+|+
T Consensus 88 ~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 88 PHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCCCcEeccHHHHHHHHHHhcC
Confidence 9999999999999999988875
No 70
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=88.88 E-value=0.24 Score=45.99 Aligned_cols=60 Identities=23% Similarity=0.333 Sum_probs=33.0
Q ss_pred ceecCCCCccccChhhHhhccc--cCCCC--CccccC--cCcccccchHHHHHHhhhhhccCCcccc
Q psy10075 25 NFICILCNYHAENFENIYLHLP--SHFEQ--KLFSCD--LCEYQTNTKIYIRRHMLKIHLRQRNLEC 85 (373)
Q Consensus 25 ~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~h~~~~~~~C 85 (373)
++.|..|...|.....|.+|.. .|.++ +++.|+ .|++.|.....+..| ...|....++.+
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 354 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRH-ILLHTSISPAKE 354 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCC-cccccCCCcccc
Confidence 4555566666666666666655 45555 555555 455555555555555 344444443333
No 71
>KOG2893|consensus
Probab=88.86 E-value=0.14 Score=41.48 Aligned_cols=42 Identities=31% Similarity=0.575 Sum_probs=19.0
Q ss_pred cCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchhhhhhhh
Q psy10075 264 CEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLKSHV 310 (373)
Q Consensus 264 C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~ 310 (373)
|=.|++.|.....|..|++..| |+|-+|.+..-+-..|..|-
T Consensus 13 cwycnrefddekiliqhqkakh-----fkchichkkl~sgpglsihc 54 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH-----FKCHICHKKLFSGPGLSIHC 54 (341)
T ss_pred eeecccccchhhhhhhhhhhcc-----ceeeeehhhhccCCCceeeh
Confidence 4444444444444444443333 44555544444444444443
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=88.78 E-value=0.15 Score=30.90 Aligned_cols=24 Identities=25% Similarity=0.584 Sum_probs=10.3
Q ss_pred ccCCCCCCCccChhHHHHHHHhhc
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQTNH 342 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~H 342 (373)
++|+.|+..|....+...|+...|
T Consensus 18 lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 18 LRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred eeCCchhHHHHHhHHHHHHhhHHh
Confidence 444444444444444444444433
No 73
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=88.30 E-value=0.3 Score=29.63 Aligned_cols=30 Identities=20% Similarity=0.475 Sum_probs=21.0
Q ss_pred CCCCccccCCCchhccCcHHHHHHHHHhcC
Q psy10075 195 TNEKPFKCTLCSYKAKRNFELKQHILRIHQ 224 (373)
Q Consensus 195 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 224 (373)
.++..+.||-|+..|....+..+|....|.
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 456667777777777777777777766553
No 74
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=87.61 E-value=1.8 Score=31.27 Aligned_cols=47 Identities=15% Similarity=0.318 Sum_probs=24.6
Q ss_pred CCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChhHHHHHHHhh
Q psy10075 293 CSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNTDLLEHVQTN 341 (373)
Q Consensus 293 C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~ 341 (373)
|--|...|........ .......+|+|+.|...|-..=++..|...|
T Consensus 58 C~~C~~~f~~~~~~~~--~~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 58 CFGCQGPFPKPPVSPF--DELKDSHRYVCAVCKNVFCVDCDVFVHESLH 104 (112)
T ss_pred ccCcCCCCCCcccccc--cccccccceeCCCCCCccccccchhhhhhcc
Confidence 6666666654321110 0011223477777777777666666665554
No 75
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=87.28 E-value=0.2 Score=41.39 Aligned_cols=22 Identities=27% Similarity=0.433 Sum_probs=12.2
Q ss_pred CCeecCCCCccccChhHHHHhh
Q psy10075 260 KPFTCEHCDSKFCKKYDMKMHM 281 (373)
Q Consensus 260 ~~~~C~~C~~~f~~~~~L~~H~ 281 (373)
+.+.||+|+..|..........
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~ 25 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKI 25 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCc
Confidence 3456666666666654443333
No 76
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=87.17 E-value=0.47 Score=26.12 Aligned_cols=9 Identities=22% Similarity=0.748 Sum_probs=3.9
Q ss_pred eecCCCCcc
Q psy10075 262 FTCEHCDSK 270 (373)
Q Consensus 262 ~~C~~C~~~ 270 (373)
|+|.+||..
T Consensus 3 ~~C~~CG~i 11 (34)
T cd00729 3 WVCPVCGYI 11 (34)
T ss_pred EECCCCCCE
Confidence 344444433
No 77
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=85.25 E-value=0.76 Score=28.53 Aligned_cols=14 Identities=21% Similarity=0.389 Sum_probs=7.4
Q ss_pred CccccCCCchhccC
Q psy10075 198 KPFKCTLCSYKAKR 211 (373)
Q Consensus 198 ~~~~C~~C~~~f~~ 211 (373)
..|.||.||.....
T Consensus 26 v~F~CPnCGe~~I~ 39 (61)
T COG2888 26 VKFPCPNCGEVEIY 39 (61)
T ss_pred eEeeCCCCCceeee
Confidence 34566666644433
No 78
>PHA00626 hypothetical protein
Probab=85.25 E-value=0.41 Score=29.17 Aligned_cols=12 Identities=17% Similarity=0.539 Sum_probs=5.7
Q ss_pred CeecCCCCcccc
Q psy10075 261 PFTCEHCDSKFC 272 (373)
Q Consensus 261 ~~~C~~C~~~f~ 272 (373)
.|.|+.||..|+
T Consensus 23 rYkCkdCGY~ft 34 (59)
T PHA00626 23 DYVCCDCGYNDS 34 (59)
T ss_pred ceEcCCCCCeec
Confidence 344555554443
No 79
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=84.26 E-value=0.54 Score=43.64 Aligned_cols=150 Identities=21% Similarity=0.328 Sum_probs=81.5
Q ss_pred CceecccccccccchhHHHhhhhh-ccCCC--CccccC--CCchhccCcHHHHHHHHHhcCCCCccceec--CCCccccc
Q psy10075 169 TRFQCSFCDRRFYFKNECTKHYLE-HHTNE--KPFKCT--LCSYKAKRNFELKQHILRIHQDNIEAEYHC--TYCDYICS 241 (373)
Q Consensus 169 ~~~~C~~C~~~f~~~~~l~~H~~~-~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~h~~~~~~~~~C--~~C~~~f~ 241 (373)
.++.|..|...|.....+..|... .|.++ .++.|+ .|++.|.+...+..|. ..|.. ..++.+ ..+...+.
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~-~~~~~--~~~~~~~~~~~~~~~~ 364 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHI-LLHTS--ISPAKEKLLNSSSKFS 364 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCc-ccccC--CCccccccccCccccc
Confidence 356777777777777777777532 56666 777777 6777777777777775 45544 222222 22333333
Q ss_pred ChHHHhhhhh--ccCCCCCCCCeecC--CCCccccChhHHHHhhhhhccCCC--CccCCCCcccccchhhhhhhhhhccC
Q psy10075 242 SRTILKSHIN--KKHANDEDKPFTCE--HCDSKFCKKYDMKMHMSRAHYGLK--PFKCSFCEYKCTTKFGLKSHVTRIHS 315 (373)
Q Consensus 242 ~~~~l~~H~~--~~~~~~~~~~~~C~--~C~~~f~~~~~L~~H~~~~H~~~~--~~~C~~C~~~f~~~~~l~~H~~~~h~ 315 (373)
....-..+.. ..........+.+. .|...+.....+..|. ..|.... .+.+..|...+.....+..|.+.+..
T Consensus 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (467)
T COG5048 365 PLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHI-ITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTN 443 (467)
T ss_pred cccCCCCccchhhccCccCCccccccccchhhhhcccccccccc-ccccccCCcCCCCCcchhhccCccccccccccccc
Confidence 2221100000 00011122333332 2555666666666666 5555444 45666777777777777777755555
Q ss_pred CCcccCC
Q psy10075 316 NQKYFCG 322 (373)
Q Consensus 316 ~~~~~C~ 322 (373)
..++.|.
T Consensus 444 ~~~~~~~ 450 (467)
T COG5048 444 HAPLLCS 450 (467)
T ss_pred CCceeec
Confidence 5544443
No 80
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=83.78 E-value=0.93 Score=26.65 Aligned_cols=10 Identities=20% Similarity=0.836 Sum_probs=4.3
Q ss_pred eecCCCCccc
Q psy10075 262 FTCEHCDSKF 271 (373)
Q Consensus 262 ~~C~~C~~~f 271 (373)
|+|..||..|
T Consensus 3 Y~C~~Cg~~~ 12 (44)
T smart00659 3 YICGECGREN 12 (44)
T ss_pred EECCCCCCEe
Confidence 4444444443
No 81
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=83.77 E-value=0.44 Score=39.40 Aligned_cols=23 Identities=9% Similarity=0.151 Sum_probs=17.0
Q ss_pred CcceecCCCCccccChhhHhhcc
Q psy10075 23 SYNFICILCNYHAENFENIYLHL 45 (373)
Q Consensus 23 ~~~~~C~~C~~~f~~~~~l~~H~ 45 (373)
++.+.||+|+..|.+........
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~ 25 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKI 25 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCc
Confidence 56789999999998876554433
No 82
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=83.24 E-value=0.81 Score=25.74 Aligned_cols=11 Identities=18% Similarity=0.763 Sum_probs=5.7
Q ss_pred Ccccccccccc
Q psy10075 141 PHKCTICESTY 151 (373)
Q Consensus 141 ~~~C~~C~~~f 151 (373)
..+|+.|+..|
T Consensus 25 ~vrC~~C~~~f 35 (37)
T PF13719_consen 25 KVRCPKCGHVF 35 (37)
T ss_pred EEECCCCCcEe
Confidence 45555555544
No 83
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=83.14 E-value=0.38 Score=37.17 Aligned_cols=16 Identities=19% Similarity=0.260 Sum_probs=10.7
Q ss_pred ccCCCCCCCccChhHH
Q psy10075 319 YFCGSCNFSCTSNTDL 334 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l 334 (373)
|+|+.||+.|.+...+
T Consensus 29 ~~c~~c~~~f~~~e~~ 44 (154)
T PRK00464 29 RECLACGKRFTTFERV 44 (154)
T ss_pred eeccccCCcceEeEec
Confidence 6777777777765543
No 84
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=82.81 E-value=0.71 Score=33.27 Aligned_cols=15 Identities=27% Similarity=0.877 Sum_probs=7.3
Q ss_pred CCeecCCCCccccCh
Q psy10075 260 KPFTCEHCDSKFCKK 274 (373)
Q Consensus 260 ~~~~C~~C~~~f~~~ 274 (373)
.|.+|+.||..|.-.
T Consensus 25 ~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 25 DPIVCPKCGTEFPPE 39 (108)
T ss_pred CCccCCCCCCccCcc
Confidence 344555555555433
No 85
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=82.76 E-value=0.79 Score=25.61 Aligned_cols=11 Identities=18% Similarity=0.806 Sum_probs=5.7
Q ss_pred Ccccccccccc
Q psy10075 141 PHKCTICESTY 151 (373)
Q Consensus 141 ~~~C~~C~~~f 151 (373)
..+|+.|+.+|
T Consensus 25 ~v~C~~C~~~f 35 (36)
T PF13717_consen 25 KVRCSKCGHVF 35 (36)
T ss_pred EEECCCCCCEe
Confidence 45555555544
No 86
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=81.91 E-value=1.1 Score=26.56 Aligned_cols=10 Identities=20% Similarity=0.999 Sum_probs=5.3
Q ss_pred eecCCCCccc
Q psy10075 262 FTCEHCDSKF 271 (373)
Q Consensus 262 ~~C~~C~~~f 271 (373)
|+|+.||..|
T Consensus 4 y~C~~CG~~~ 13 (46)
T PRK00398 4 YKCARCGREV 13 (46)
T ss_pred EECCCCCCEE
Confidence 5555555544
No 87
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=81.80 E-value=0.87 Score=25.70 Aligned_cols=10 Identities=20% Similarity=0.786 Sum_probs=4.9
Q ss_pred cccccccccc
Q psy10075 142 HKCTICESTY 151 (373)
Q Consensus 142 ~~C~~C~~~f 151 (373)
..|+.|+..|
T Consensus 26 v~C~~C~~~~ 35 (38)
T TIGR02098 26 VRCGKCGHVW 35 (38)
T ss_pred EECCCCCCEE
Confidence 4455555444
No 88
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=80.56 E-value=2.8 Score=30.28 Aligned_cols=19 Identities=32% Similarity=0.691 Sum_probs=10.2
Q ss_pred CcccccccccccCHHHHHH
Q psy10075 141 PHKCTICESTYLTKVELKN 159 (373)
Q Consensus 141 ~~~C~~C~~~f~~~~~l~~ 159 (373)
|..|+.|+....+...|.+
T Consensus 15 P~~CpiCgLtLVss~HLAR 33 (112)
T TIGR00622 15 PVECPICGLTLILSTHLAR 33 (112)
T ss_pred CCcCCcCCCEEeccchHHH
Confidence 4556666555555544443
No 89
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=80.43 E-value=1.2 Score=34.72 Aligned_cols=24 Identities=29% Similarity=0.704 Sum_probs=14.9
Q ss_pred CeecCCCCccccChhHHHHhhhhhccCCCCccCCCCcc
Q psy10075 261 PFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEY 298 (373)
Q Consensus 261 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~ 298 (373)
.|+|++||.. +.++.|-+||+||.
T Consensus 134 ~~vC~vCGy~--------------~~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYT--------------HEGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCc--------------ccCCCCCcCCCCCC
Confidence 5777777643 33456677777763
No 90
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=78.12 E-value=1.2 Score=25.61 Aligned_cols=30 Identities=23% Similarity=0.420 Sum_probs=16.9
Q ss_pred ceecCCCCccccChhhHhhccccCCCCCccccCcCcc
Q psy10075 25 NFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEY 61 (373)
Q Consensus 25 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 61 (373)
.|.|+.||..|........ ...-.|+.|+.
T Consensus 5 ~y~C~~Cg~~fe~~~~~~~-------~~~~~CP~Cg~ 34 (41)
T smart00834 5 EYRCEDCGHTFEVLQKISD-------DPLATCPECGG 34 (41)
T ss_pred EEEcCCCCCEEEEEEecCC-------CCCCCCCCCCC
Confidence 3667777777765433311 34456777764
No 91
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=77.94 E-value=1.2 Score=27.26 Aligned_cols=29 Identities=21% Similarity=0.473 Sum_probs=15.9
Q ss_pred eecCCCCccccChhhHhhccccCCCCCccccCcCcc
Q psy10075 26 FICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEY 61 (373)
Q Consensus 26 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 61 (373)
|.|..|+..|.....+.. +.+-.|+.|+.
T Consensus 6 y~C~~Cg~~fe~~~~~~~-------~~~~~CP~Cg~ 34 (52)
T TIGR02605 6 YRCTACGHRFEVLQKMSD-------DPLATCPECGG 34 (52)
T ss_pred EEeCCCCCEeEEEEecCC-------CCCCCCCCCCC
Confidence 667777777765432211 23445777763
No 92
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=75.75 E-value=1.6 Score=25.36 Aligned_cols=29 Identities=21% Similarity=0.440 Sum_probs=16.4
Q ss_pred eecCCCCccccChhhHhhccccCCCCCccccCcCcc
Q psy10075 26 FICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEY 61 (373)
Q Consensus 26 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 61 (373)
|.|..||..|.....+.. ..+-.|+.|+.
T Consensus 6 y~C~~Cg~~fe~~~~~~~-------~~~~~CP~Cg~ 34 (42)
T PF09723_consen 6 YRCEECGHEFEVLQSISE-------DDPVPCPECGS 34 (42)
T ss_pred EEeCCCCCEEEEEEEcCC-------CCCCcCCCCCC
Confidence 667777777655533311 34456666664
No 93
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=75.47 E-value=2 Score=31.05 Aligned_cols=13 Identities=23% Similarity=0.723 Sum_probs=6.7
Q ss_pred CcccccccccccC
Q psy10075 141 PHKCTICESTYLT 153 (373)
Q Consensus 141 ~~~C~~C~~~f~~ 153 (373)
|..||.||..|..
T Consensus 26 PivCP~CG~~~~~ 38 (108)
T PF09538_consen 26 PIVCPKCGTEFPP 38 (108)
T ss_pred CccCCCCCCccCc
Confidence 4455555555543
No 94
>KOG2186|consensus
Probab=74.38 E-value=1.5 Score=36.27 Aligned_cols=45 Identities=20% Similarity=0.424 Sum_probs=33.4
Q ss_pred eecCCCCccccChhhHhhccccCCCCCccccCcCcccccchHHHHHHh
Q psy10075 26 FICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEYQTNTKIYIRRHM 73 (373)
Q Consensus 26 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 73 (373)
|.|.+||-+..-. .+..|+....+ .-|.|..|+..|.. ..+..|.
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~ 48 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHT 48 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhh
Confidence 6788888777654 56678877766 66888888888877 6667774
No 95
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=73.24 E-value=2.9 Score=32.23 Aligned_cols=18 Identities=17% Similarity=0.316 Sum_probs=8.5
Q ss_pred CcceecCCCCccccChhh
Q psy10075 23 SYNFICILCNYHAENFEN 40 (373)
Q Consensus 23 ~~~~~C~~C~~~f~~~~~ 40 (373)
..-|.|+.|+..|...++
T Consensus 97 ~~~Y~Cp~C~~~y~~~ea 114 (147)
T smart00531 97 NAYYKCPNCQSKYTFLEA 114 (147)
T ss_pred CcEEECcCCCCEeeHHHH
Confidence 334555555555544433
No 96
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=72.53 E-value=3.6 Score=24.13 Aligned_cols=26 Identities=19% Similarity=0.429 Sum_probs=13.2
Q ss_pred CCCeecCCCCccccC----hhHHHHhhhhh
Q psy10075 259 DKPFTCEHCDSKFCK----KYDMKMHMSRA 284 (373)
Q Consensus 259 ~~~~~C~~C~~~f~~----~~~L~~H~~~~ 284 (373)
.....|..|++.+.. .+.|..|++..
T Consensus 14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~ 43 (45)
T PF02892_consen 14 KKKAKCKYCGKVIKYSSGGTSNLKRHLKKK 43 (45)
T ss_dssp SS-EEETTTTEE-----SSTHHHHHHHHHT
T ss_pred cCeEEeCCCCeEEeeCCCcHHHHHHhhhhh
Confidence 344567777776654 35666666433
No 97
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=72.42 E-value=2.8 Score=32.80 Aligned_cols=24 Identities=25% Similarity=0.800 Sum_probs=18.9
Q ss_pred ceecCCCcccccChHHHhhhhhccCCCCCCCCeecCCCCc
Q psy10075 230 EYHCTYCDYICSSRTILKSHINKKHANDEDKPFTCEHCDS 269 (373)
Q Consensus 230 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~ 269 (373)
.|.|++||..+ .++.|-+||+||-
T Consensus 134 ~~vC~vCGy~~----------------~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTH----------------EGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCcc----------------cCCCCCcCCCCCC
Confidence 69999998753 3378889999984
No 98
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=71.53 E-value=2.7 Score=32.88 Aligned_cols=15 Identities=20% Similarity=0.383 Sum_probs=6.5
Q ss_pred eecCCCCccccChhH
Q psy10075 262 FTCEHCDSKFCKKYD 276 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~ 276 (373)
|.|+.|+..|+...+
T Consensus 110 Y~Cp~c~~r~tf~eA 124 (158)
T TIGR00373 110 FICPNMCVRFTFNEA 124 (158)
T ss_pred EECCCCCcEeeHHHH
Confidence 444444444443333
No 99
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=71.19 E-value=3.9 Score=22.12 Aligned_cols=11 Identities=18% Similarity=0.806 Sum_probs=5.3
Q ss_pred eecCCCCcccc
Q psy10075 262 FTCEHCDSKFC 272 (373)
Q Consensus 262 ~~C~~C~~~f~ 272 (373)
|.|..||..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 45555555543
No 100
>KOG2186|consensus
Probab=71.17 E-value=1.7 Score=35.91 Aligned_cols=45 Identities=31% Similarity=0.559 Sum_probs=23.4
Q ss_pred eecCCCCccccChhHHHHhhhhhccCCCCccCCCCcccccchhhhhhhh
Q psy10075 262 FTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLKSHV 310 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~ 310 (373)
|.|.+||.+.. ...+..|+.+.+. .-|.|--|+..|.. .++..|.
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn--~~fSCIDC~k~F~~-~sYknH~ 48 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRN--AYFSCIDCGKTFER-VSYKNHT 48 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccC--CeeEEeeccccccc-chhhhhh
Confidence 45555555553 2344556544333 34566666666644 5555555
No 101
>PF15269 zf-C2H2_7: Zinc-finger
Probab=71.12 E-value=4 Score=23.62 Aligned_cols=23 Identities=30% Similarity=0.869 Sum_probs=16.6
Q ss_pred ccCCCCCCCccChhHHHHHHHhh
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQTN 341 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~ 341 (373)
|+|-.|.+....++.|..||+-.
T Consensus 21 ykcfqcpftc~~kshl~nhmky~ 43 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKYS 43 (54)
T ss_pred ceeecCCcccchHHHHHHHHHHH
Confidence 77777777777777777776643
No 102
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=71.02 E-value=0.54 Score=45.02 Aligned_cols=78 Identities=21% Similarity=0.464 Sum_probs=49.7
Q ss_pred eecCCCcccccChHHHhhhhhccCCCCCCCCe-ecCCCCccccChhHHHHhhhhhccCCCCc-cCCCCcccccchhhhhh
Q psy10075 231 YHCTYCDYICSSRTILKSHINKKHANDEDKPF-TCEHCDSKFCKKYDMKMHMSRAHYGLKPF-KCSFCEYKCTTKFGLKS 308 (373)
Q Consensus 231 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~-~C~~C~~~f~~~~~L~~H~~~~H~~~~~~-~C~~C~~~f~~~~~l~~ 308 (373)
-.|+.|-+.+.+..+-+.+. || .|+.||-+|+--..|=.= +..+....| -|+.|.+.+.+..+-+-
T Consensus 102 a~C~~Cl~Ei~dp~~rrY~Y----------PF~~CT~CGPRfTIi~alPYD--R~nTsM~~F~lC~~C~~EY~dP~nRRf 169 (750)
T COG0068 102 ATCEDCLEEIFDPNSRRYLY----------PFINCTNCGPRFTIIEALPYD--RENTSMADFPLCPFCDKEYKDPLNRRF 169 (750)
T ss_pred hhhHHHHHHhcCCCCcceec----------cccccCCCCcceeeeccCCCC--cccCccccCcCCHHHHHHhcCcccccc
Confidence 35666755554444333332 23 588888888765554332 344555544 48888888888877777
Q ss_pred hhhhccCCCcccCCCCCC
Q psy10075 309 HVTRIHSNQKYFCGSCNF 326 (373)
Q Consensus 309 H~~~~h~~~~~~C~~C~~ 326 (373)
|. ++..|+.||-
T Consensus 170 HA------Qp~aCp~CGP 181 (750)
T COG0068 170 HA------QPIACPKCGP 181 (750)
T ss_pred cc------ccccCcccCC
Confidence 76 5778999985
No 103
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=70.94 E-value=3.7 Score=31.71 Aligned_cols=12 Identities=25% Similarity=0.703 Sum_probs=5.4
Q ss_pred ceecCCCccccc
Q psy10075 230 EYHCTYCDYICS 241 (373)
Q Consensus 230 ~~~C~~C~~~f~ 241 (373)
-|.|+.|+..|.
T Consensus 99 ~Y~Cp~C~~~y~ 110 (147)
T smart00531 99 YYKCPNCQSKYT 110 (147)
T ss_pred EEECcCCCCEee
Confidence 344444444444
No 104
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=70.55 E-value=2.4 Score=25.42 Aligned_cols=29 Identities=21% Similarity=0.478 Sum_probs=18.7
Q ss_pred CcceecCCCCccccChhhHhhccccCCCCCccccCcCcc
Q psy10075 23 SYNFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEY 61 (373)
Q Consensus 23 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 61 (373)
...|+|..|++.|.... ......|+.||.
T Consensus 4 ~~~Y~C~~Cg~~~~~~~----------~~~~irCp~Cg~ 32 (49)
T COG1996 4 MMEYKCARCGREVELDQ----------ETRGIRCPYCGS 32 (49)
T ss_pred eEEEEhhhcCCeeehhh----------ccCceeCCCCCc
Confidence 45688888888883222 234577888874
No 105
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=69.74 E-value=3.1 Score=28.26 Aligned_cols=15 Identities=27% Similarity=0.773 Sum_probs=9.2
Q ss_pred CCCeecCCCCccccC
Q psy10075 259 DKPFTCEHCDSKFCK 273 (373)
Q Consensus 259 ~~~~~C~~C~~~f~~ 273 (373)
...+.|..||..|..
T Consensus 51 ~GIW~C~kCg~~fAG 65 (89)
T COG1997 51 TGIWKCRKCGAKFAG 65 (89)
T ss_pred cCeEEcCCCCCeecc
Confidence 345667777766654
No 106
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=69.68 E-value=3.2 Score=32.44 Aligned_cols=32 Identities=22% Similarity=0.288 Sum_probs=18.5
Q ss_pred CCcceecCCCCccccChhhHhhccccCCCCCccccCcCccc
Q psy10075 22 DSYNFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEYQ 62 (373)
Q Consensus 22 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 62 (373)
+..-|.|+.|+..|+..+++. ..|.|+.||..
T Consensus 106 ~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 106 NNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM 137 (158)
T ss_pred CCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence 344466666666666666663 13666666643
No 107
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=69.39 E-value=3.1 Score=30.63 Aligned_cols=15 Identities=13% Similarity=0.250 Sum_probs=7.5
Q ss_pred CCeecCCCCccccCh
Q psy10075 260 KPFTCEHCDSKFCKK 274 (373)
Q Consensus 260 ~~~~C~~C~~~f~~~ 274 (373)
.|..|+.||..|...
T Consensus 25 ~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 25 RPAVSPYTGEQFPPE 39 (129)
T ss_pred CCccCCCcCCccCcc
Confidence 345555555555433
No 108
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=69.38 E-value=2.9 Score=33.37 Aligned_cols=12 Identities=33% Similarity=0.916 Sum_probs=5.3
Q ss_pred eecCCCCccccC
Q psy10075 262 FTCEHCDSKFCK 273 (373)
Q Consensus 262 ~~C~~C~~~f~~ 273 (373)
|.|+.|+..|+.
T Consensus 118 Y~Cp~C~~rytf 129 (178)
T PRK06266 118 FFCPNCHIRFTF 129 (178)
T ss_pred EECCCCCcEEeH
Confidence 444444444433
No 109
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=68.69 E-value=2.3 Score=26.79 Aligned_cols=44 Identities=16% Similarity=0.270 Sum_probs=24.7
Q ss_pred CCCcceecCC--CCccccChhhHhhccccCCCCCccccCc----Ccccccc
Q psy10075 21 DDSYNFICIL--CNYHAENFENIYLHLPSHFEQKLFSCDL----CEYQTNT 65 (373)
Q Consensus 21 ~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~~ 65 (373)
....+..|+. |...+.. ..|..|+...-...+..|.. |+..+..
T Consensus 5 C~~~~v~C~~~cc~~~i~r-~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~~ 54 (60)
T PF02176_consen 5 CPFRPVPCPNGCCNEMIPR-KELDDHLENECPKRPVPCPYSPYGCKERVPR 54 (60)
T ss_dssp STTSEEE-TT--S-BEEEC-CCHHHHHHTTSTTSEEE-SS----S--EEEH
T ss_pred CCCCEeeCCCCCcccceeH-HHHHHHHHccCCCCcEECCCCCCCCCCccch
Confidence 3556777877 5454553 46778877555556777877 7765543
No 110
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=67.93 E-value=4.1 Score=33.93 Aligned_cols=41 Identities=17% Similarity=0.400 Sum_probs=17.7
Q ss_pred CCCccccCCCchhccCcHHHHHHHHHhcCCCCccceecCCCcc
Q psy10075 196 NEKPFKCTLCSYKAKRNFELKQHILRIHQDNIEAEYHCTYCDY 238 (373)
Q Consensus 196 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~C~~C~~ 238 (373)
+.+.|.|.+|....-. ..--.|+.....-. ...|+|..|++
T Consensus 139 GGrif~CsfC~~flCE-DDQFEHQAsCQvLe-~E~~KC~SCNr 179 (314)
T PF06524_consen 139 GGRIFKCSFCDNFLCE-DDQFEHQASCQVLE-SETFKCQSCNR 179 (314)
T ss_pred CCeEEEeecCCCeeec-cchhhhhhhhhhhh-ccccccccccc
Confidence 4556666666543222 22223332222111 34466666654
No 111
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=67.90 E-value=4.1 Score=24.31 Aligned_cols=8 Identities=25% Similarity=0.945 Sum_probs=3.2
Q ss_pred ccCCCCCC
Q psy10075 319 YFCGSCNF 326 (373)
Q Consensus 319 ~~C~~C~~ 326 (373)
+.|++|+.
T Consensus 35 w~CP~C~a 42 (47)
T PF00301_consen 35 WVCPVCGA 42 (47)
T ss_dssp -B-TTTSS
T ss_pred CcCcCCCC
Confidence 55555553
No 112
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=67.44 E-value=1.9 Score=22.94 Aligned_cols=7 Identities=29% Similarity=1.051 Sum_probs=3.3
Q ss_pred ccCCCCC
Q psy10075 319 YFCGSCN 325 (373)
Q Consensus 319 ~~C~~C~ 325 (373)
|.|+.|+
T Consensus 20 ~vCp~C~ 26 (30)
T PF08274_consen 20 LVCPECG 26 (30)
T ss_dssp EEETTTT
T ss_pred EeCCccc
Confidence 4444444
No 113
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=67.18 E-value=2.8 Score=34.46 Aligned_cols=31 Identities=16% Similarity=0.244 Sum_probs=22.6
Q ss_pred CCCcccCCCCCCCccChhHHHHHHHhhcccc
Q psy10075 315 SNQKYFCGSCNFSCTSNTDLLEHVQTNHIIV 345 (373)
Q Consensus 315 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 345 (373)
.+..|.|++|++.|.-..-+++||.+.|...
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 3445999999999999999999999988743
No 114
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=67.09 E-value=3.9 Score=20.87 Aligned_cols=18 Identities=22% Similarity=0.517 Sum_probs=9.4
Q ss_pred cCCCCcccccchhhhhhhh
Q psy10075 292 KCSFCEYKCTTKFGLKSHV 310 (373)
Q Consensus 292 ~C~~C~~~f~~~~~l~~H~ 310 (373)
.||+|+..+ ....+..|+
T Consensus 3 ~CPiC~~~v-~~~~in~HL 20 (26)
T smart00734 3 QCPVCFREV-PENLINSHL 20 (26)
T ss_pred cCCCCcCcc-cHHHHHHHH
Confidence 355565555 444555554
No 115
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=66.91 E-value=4 Score=32.60 Aligned_cols=32 Identities=22% Similarity=0.375 Sum_probs=18.2
Q ss_pred CCcceecCCCCccccChhhHhhccccCCCCCccccCcCccc
Q psy10075 22 DSYNFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEYQ 62 (373)
Q Consensus 22 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 62 (373)
+..-|.|+.|+..|+..+++. ..|.|+.||..
T Consensus 114 ~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~ 145 (178)
T PRK06266 114 NNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEM 145 (178)
T ss_pred CCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCC
Confidence 344566666666666665552 24666666643
No 116
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=66.01 E-value=2.6 Score=27.79 Aligned_cols=39 Identities=13% Similarity=0.351 Sum_probs=17.6
Q ss_pred ccCCCCcccccchhhhhhhhhhccCCCcccCC--CCCCCccCh
Q psy10075 291 FKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCG--SCNFSCTSN 331 (373)
Q Consensus 291 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~--~C~~~f~~~ 331 (373)
+.|+.||.......+-...- ...+..++|. .||..|...
T Consensus 2 m~CP~Cg~~a~irtSr~~s~--~~~~~Y~qC~N~eCg~tF~t~ 42 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITD--TTKERYHQCQNVNCSATFITY 42 (72)
T ss_pred ccCCCCCCccEEEEChhcCh--hhheeeeecCCCCCCCEEEEE
Confidence 45666665543222222221 1222336666 666666643
No 117
>COG2879 Uncharacterized small protein [Function unknown]
Probab=65.74 E-value=9 Score=24.19 Aligned_cols=30 Identities=13% Similarity=0.224 Sum_probs=23.4
Q ss_pred ccChhHHHHHHHhhcccccccccchhHHHH
Q psy10075 328 CTSNTDLLEHVQTNHIIVGESSSRDFIIHS 357 (373)
Q Consensus 328 f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~ 357 (373)
-++.++..+||+.+|+......++.|....
T Consensus 22 vpdYdnYVehmr~~hPd~p~mT~~EFfrec 51 (65)
T COG2879 22 VPDYDNYVEHMRKKHPDKPPMTYEEFFREC 51 (65)
T ss_pred CCcHHHHHHHHHHhCcCCCcccHHHHHHHH
Confidence 467789999999999877666666776665
No 118
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=63.82 E-value=6.4 Score=23.78 Aligned_cols=25 Identities=24% Similarity=0.670 Sum_probs=15.8
Q ss_pred CeecCCCCccccCh-----hHHHHhhhhhc
Q psy10075 261 PFTCEHCDSKFCKK-----YDMKMHMSRAH 285 (373)
Q Consensus 261 ~~~C~~C~~~f~~~-----~~L~~H~~~~H 285 (373)
.-.|..|++.++.. +.|.+|+...|
T Consensus 18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h 47 (50)
T smart00614 18 RAKCKYCGKKLSRSSKGGTSNLRRHLRRKH 47 (50)
T ss_pred EEEecCCCCEeeeCCCCCcHHHHHHHHhHC
Confidence 35677777777543 57777774344
No 119
>PF14353 CpXC: CpXC protein
Probab=63.04 E-value=1.8 Score=32.46 Aligned_cols=21 Identities=29% Similarity=0.537 Sum_probs=11.5
Q ss_pred CccCCCCcccccchhhhhhhh
Q psy10075 290 PFKCSFCEYKCTTKFGLKSHV 310 (373)
Q Consensus 290 ~~~C~~C~~~f~~~~~l~~H~ 310 (373)
.|.|+.||..|.-...+..|-
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D 58 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHD 58 (128)
T ss_pred EEECCCCCCceecCCCEEEEc
Confidence 356666666665444444443
No 120
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=62.48 E-value=5.7 Score=32.66 Aligned_cols=29 Identities=17% Similarity=0.393 Sum_probs=18.6
Q ss_pred CCCCeecCCCCccccChhHHHHhhhhhcc
Q psy10075 258 EDKPFTCEHCDSKFCKKYDMKMHMSRAHY 286 (373)
Q Consensus 258 ~~~~~~C~~C~~~f~~~~~L~~H~~~~H~ 286 (373)
.+..|.|+.|+|.|.-......|+...|.
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCH
Confidence 45557777777777777777777766664
No 121
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=61.96 E-value=6.8 Score=23.77 Aligned_cols=7 Identities=29% Similarity=1.189 Sum_probs=4.0
Q ss_pred ccCCCCC
Q psy10075 319 YFCGSCN 325 (373)
Q Consensus 319 ~~C~~C~ 325 (373)
+.|++|+
T Consensus 35 w~CP~C~ 41 (50)
T cd00730 35 WVCPVCG 41 (50)
T ss_pred CCCCCCC
Confidence 5555555
No 122
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=60.84 E-value=3.8 Score=23.49 Aligned_cols=11 Identities=27% Similarity=0.872 Sum_probs=7.4
Q ss_pred ccCCCCCCCcc
Q psy10075 319 YFCGSCNFSCT 329 (373)
Q Consensus 319 ~~C~~C~~~f~ 329 (373)
|.|..|+..|.
T Consensus 29 y~C~~C~~~w~ 39 (40)
T smart00440 29 YVCTKCGHRWR 39 (40)
T ss_pred EEeCCCCCEeC
Confidence 77777776553
No 123
>KOG1280|consensus
Probab=60.39 E-value=6.8 Score=34.23 Aligned_cols=36 Identities=25% Similarity=0.583 Sum_probs=17.3
Q ss_pred CccCCCCcccccchhhhhhhhhhccCCCc--ccCCCCC
Q psy10075 290 PFKCSFCEYKCTTKFGLKSHVTRIHSNQK--YFCGSCN 325 (373)
Q Consensus 290 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~--~~C~~C~ 325 (373)
.|.|++|+..-.+...|..|+...|.+.. -.|++|+
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 35555555544445555555544444443 3444443
No 124
>KOG1280|consensus
Probab=59.34 E-value=11 Score=32.95 Aligned_cols=41 Identities=20% Similarity=0.546 Sum_probs=31.0
Q ss_pred CCCccccCCCchhccCcHHHHHHHHHhcCCCCccceecCCCc
Q psy10075 196 NEKPFKCTLCSYKAKRNFELKQHILRIHQDNIEAEYHCTYCD 237 (373)
Q Consensus 196 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~C~~C~ 237 (373)
....|.||+|+..-.+...+..|+...|.... ..-.|+.|+
T Consensus 76 ~~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~-~~~icp~c~ 116 (381)
T KOG1280|consen 76 DPQSFTCPYCGIMGFTERQFGTHVLSQHPEAS-TSVICPLCA 116 (381)
T ss_pred ccccccCCcccccccchhHHHHHhhhcCcccC-cceeeeccc
Confidence 44578899999888888889999888887643 445677765
No 125
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=57.46 E-value=7.2 Score=32.54 Aligned_cols=91 Identities=25% Similarity=0.593 Sum_probs=52.3
Q ss_pred ccCCCceecccccccccchhHHHhhhhhc-cCCCCccccCCCchhccCcHHHHHHHHHhcCCCCccceecCCCcccccCh
Q psy10075 165 HQGQTRFQCSFCDRRFYFKNECTKHYLEH-HTNEKPFKCTLCSYKAKRNFELKQHILRIHQDNIEAEYHCTYCDYICSSR 243 (373)
Q Consensus 165 h~~~~~~~C~~C~~~f~~~~~l~~H~~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~ 243 (373)
..+.+.|+|.+|...+-.-..| .|+..- -.....|+|.-|++. ..|.|..|...|-+
T Consensus 137 ~hGGrif~CsfC~~flCEDDQF-EHQAsCQvLe~E~~KC~SCNrl--------------------Gq~sCLRCK~cfCd- 194 (314)
T PF06524_consen 137 DHGGRIFKCSFCDNFLCEDDQF-EHQASCQVLESETFKCQSCNRL--------------------GQYSCLRCKICFCD- 194 (314)
T ss_pred cCCCeEEEeecCCCeeeccchh-hhhhhhhhhhcccccccccccc--------------------cchhhhheeeeehh-
Confidence 3467789999988765433333 343221 123456788877732 34556666555543
Q ss_pred HHHhhhhhccCC-CCCCCCeecCCCCccccChhHHHHhh
Q psy10075 244 TILKSHINKKHA-NDEDKPFTCEHCDSKFCKKYDMKMHM 281 (373)
Q Consensus 244 ~~l~~H~~~~~~-~~~~~~~~C~~C~~~f~~~~~L~~H~ 281 (373)
.|+++.-. -...+++.||.||...+....|..-.
T Consensus 195 ----dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmSt 229 (314)
T PF06524_consen 195 ----DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMST 229 (314)
T ss_pred ----hhhhhcccccccCCCCCCCCCCCcccccccceeee
Confidence 23322111 12246788888888888877777766
No 126
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=55.68 E-value=16 Score=31.43 Aligned_cols=83 Identities=18% Similarity=0.297 Sum_probs=47.0
Q ss_pred CCCeecCCCCccccChhHHHHhhhhhc-------------cCCCCccCCCCcccccchhhhhhhhhhccCCCcccCCCCC
Q psy10075 259 DKPFTCEHCDSKFCKKYDMKMHMSRAH-------------YGLKPFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCN 325 (373)
Q Consensus 259 ~~~~~C~~C~~~f~~~~~L~~H~~~~H-------------~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~ 325 (373)
.-|..|+.|....-....|.+-. +| +..+.--|-.|.-.|.-...-..- ......+|+|+.|.
T Consensus 320 sLPi~CP~Csl~LilsthLarSy--hhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~~~~~~~~--~~~ss~rY~Ce~CK 395 (421)
T COG5151 320 SLPISCPICSLQLILSTHLARSY--HHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPKPPVSPFD--ESTSSGRYQCELCK 395 (421)
T ss_pred cCCccCcchhHHHHHHHHHHHHH--HhhccCcccccccCCCCCCCccceeccCCCCCCCCCccc--ccccccceechhhh
Confidence 34667888876544333333221 12 111223477777777543311110 11223359999999
Q ss_pred CCccChhHHHHHHHhhcccc
Q psy10075 326 FSCTSNTDLLEHVQTNHIIV 345 (373)
Q Consensus 326 ~~f~~~~~l~~H~~~~H~~~ 345 (373)
..|-..-+...|...|-.+.
T Consensus 396 ~~FC~dCdvfiHe~Lh~C~g 415 (421)
T COG5151 396 STFCSDCDVFIHETLHFCIG 415 (421)
T ss_pred hhhhhhhHHHHHHHHhhCCC
Confidence 99999888888888775443
No 127
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=55.61 E-value=7.6 Score=23.63 Aligned_cols=26 Identities=23% Similarity=0.409 Sum_probs=12.7
Q ss_pred CcceecCCCCccccChhhHhhccccC
Q psy10075 23 SYNFICILCNYHAENFENIYLHLPSH 48 (373)
Q Consensus 23 ~~~~~C~~C~~~f~~~~~l~~H~~~h 48 (373)
...|.|+.|+..|=-.-++..|..+|
T Consensus 19 ~~~y~C~~C~~~FC~dCD~fiHE~LH 44 (51)
T PF07975_consen 19 SSRYRCPKCKNHFCIDCDVFIHETLH 44 (51)
T ss_dssp -EEE--TTTT--B-HHHHHTTTTTS-
T ss_pred CCeEECCCCCCccccCcChhhhcccc
Confidence 35677777777776666666665555
No 128
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=55.13 E-value=7.4 Score=19.88 Aligned_cols=8 Identities=25% Similarity=1.099 Sum_probs=4.3
Q ss_pred CCCCCCCc
Q psy10075 321 CGSCNFSC 328 (373)
Q Consensus 321 C~~C~~~f 328 (373)
|+.||+.|
T Consensus 17 Cp~CG~~F 24 (26)
T PF10571_consen 17 CPHCGYDF 24 (26)
T ss_pred CCCCCCCC
Confidence 55555554
No 129
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=54.94 E-value=8 Score=27.10 Aligned_cols=14 Identities=29% Similarity=0.781 Sum_probs=8.1
Q ss_pred CeecCCCCccccCh
Q psy10075 261 PFTCEHCDSKFCKK 274 (373)
Q Consensus 261 ~~~C~~C~~~f~~~ 274 (373)
|++|..||..|..-
T Consensus 2 pH~CtrCG~vf~~g 15 (112)
T COG3364 2 PHQCTRCGEVFDDG 15 (112)
T ss_pred Cceecccccccccc
Confidence 34566666666553
No 130
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=53.93 E-value=12 Score=20.89 Aligned_cols=8 Identities=25% Similarity=1.082 Sum_probs=3.4
Q ss_pred cCCCCccc
Q psy10075 264 CEHCDSKF 271 (373)
Q Consensus 264 C~~C~~~f 271 (373)
|+.||..|
T Consensus 4 C~~Cg~~Y 11 (36)
T PF05191_consen 4 CPKCGRIY 11 (36)
T ss_dssp ETTTTEEE
T ss_pred cCCCCCcc
Confidence 44444443
No 131
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=53.82 E-value=3.1 Score=23.68 Aligned_cols=10 Identities=30% Similarity=0.962 Sum_probs=6.9
Q ss_pred ccCCCCCCCc
Q psy10075 319 YFCGSCNFSC 328 (373)
Q Consensus 319 ~~C~~C~~~f 328 (373)
|.|..|+..|
T Consensus 29 y~C~~C~~~w 38 (39)
T PF01096_consen 29 YVCCNCGHRW 38 (39)
T ss_dssp EEESSSTEEE
T ss_pred EEeCCCCCee
Confidence 7777777654
No 132
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=51.86 E-value=8.6 Score=22.75 Aligned_cols=9 Identities=33% Similarity=1.010 Sum_probs=4.4
Q ss_pred cccCCCCCC
Q psy10075 318 KYFCGSCNF 326 (373)
Q Consensus 318 ~~~C~~C~~ 326 (373)
.|+|..|++
T Consensus 37 ~~~C~~C~~ 45 (46)
T PF12760_consen 37 RYRCKACRK 45 (46)
T ss_pred eEECCCCCC
Confidence 355555543
No 133
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=51.44 E-value=7 Score=22.54 Aligned_cols=24 Identities=25% Similarity=0.290 Sum_probs=14.7
Q ss_pred ceecCCCCccc--cChhhHhhccccC
Q psy10075 25 NFICILCNYHA--ENFENIYLHLPSH 48 (373)
Q Consensus 25 ~~~C~~C~~~f--~~~~~l~~H~~~h 48 (373)
...|+.||++| ...++-..|.+-|
T Consensus 13 ~~~C~~CgM~Y~~~~~eD~~~H~~yH 38 (41)
T PF13878_consen 13 ATTCPTCGMLYSPGSPEDEKLHKKYH 38 (41)
T ss_pred CcCCCCCCCEECCCCHHHHHHHHHHH
Confidence 45788888877 3344555555444
No 134
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=48.40 E-value=8.9 Score=26.50 Aligned_cols=33 Identities=18% Similarity=0.564 Sum_probs=19.6
Q ss_pred ccceecCCCcccccChHHHhhhhhccCCCCCCCCeecCCCCccccC
Q psy10075 228 EAEYHCTYCDYICSSRTILKSHINKKHANDEDKPFTCEHCDSKFCK 273 (373)
Q Consensus 228 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~ 273 (373)
...|.|+.|++.-. ...+...+.|..|++.|..
T Consensus 34 ~a~y~CpfCgk~~v-------------kR~a~GIW~C~~C~~~~AG 66 (90)
T PTZ00255 34 HAKYFCPFCGKHAV-------------KRQAVGIWRCKGCKKTVAG 66 (90)
T ss_pred hCCccCCCCCCCce-------------eeeeeEEEEcCCCCCEEeC
Confidence 35688888875311 1112345788888887765
No 135
>KOG2907|consensus
Probab=47.58 E-value=10 Score=27.23 Aligned_cols=10 Identities=40% Similarity=1.221 Sum_probs=5.2
Q ss_pred ccCCCCcccc
Q psy10075 291 FKCSFCEYKC 300 (373)
Q Consensus 291 ~~C~~C~~~f 300 (373)
|.|+.|++.|
T Consensus 103 YTC~kC~~k~ 112 (116)
T KOG2907|consen 103 YTCPKCKYKF 112 (116)
T ss_pred EEcCccceee
Confidence 4555555544
No 136
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=47.18 E-value=12 Score=22.77 Aligned_cols=10 Identities=30% Similarity=1.016 Sum_probs=5.1
Q ss_pred CccCCCCccc
Q psy10075 290 PFKCSFCEYK 299 (373)
Q Consensus 290 ~~~C~~C~~~ 299 (373)
.+.|+.||+.
T Consensus 37 r~~C~~Cgyt 46 (50)
T PRK00432 37 RWHCGKCGYT 46 (50)
T ss_pred cEECCCcCCE
Confidence 3455555543
No 137
>PF14369 zf-RING_3: zinc-finger
Probab=47.10 E-value=12 Score=20.67 Aligned_cols=8 Identities=38% Similarity=1.161 Sum_probs=4.0
Q ss_pred cCcccccc
Q psy10075 85 CSLCDSTF 92 (373)
Q Consensus 85 C~~C~~~f 92 (373)
|+.|+..|
T Consensus 24 CP~C~~gF 31 (35)
T PF14369_consen 24 CPRCHGGF 31 (35)
T ss_pred CcCCCCcE
Confidence 55555444
No 138
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=46.75 E-value=11 Score=26.83 Aligned_cols=27 Identities=4% Similarity=-0.097 Sum_probs=16.3
Q ss_pred ecCCCcccccChHHHhhhhhccCCCCCCCCeecCCCCcccc
Q psy10075 232 HCTYCDYICSSRTILKSHINKKHANDEDKPFTCEHCDSKFC 272 (373)
Q Consensus 232 ~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~ 272 (373)
.|+.||+.|... +..|..|+.||++|.
T Consensus 11 idPetg~KFYDL--------------NrdPiVsPytG~s~P 37 (129)
T COG4530 11 IDPETGKKFYDL--------------NRDPIVSPYTGKSYP 37 (129)
T ss_pred cCccccchhhcc--------------CCCccccCcccccch
Confidence 566666666441 145667777777773
No 139
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=46.35 E-value=13 Score=19.34 Aligned_cols=19 Identities=26% Similarity=0.448 Sum_probs=11.0
Q ss_pred eecCCCCccccChhhHhhcc
Q psy10075 26 FICILCNYHAENFENIYLHL 45 (373)
Q Consensus 26 ~~C~~C~~~f~~~~~l~~H~ 45 (373)
|.|..|++.| .......|.
T Consensus 1 ~sCiDC~~~F-~~~~y~~Ht 19 (28)
T PF08790_consen 1 FSCIDCSKDF-DGDSYKSHT 19 (28)
T ss_dssp EEETTTTEEE-EGGGTTT--
T ss_pred CeeecCCCCc-CcCCcCCCC
Confidence 5677788888 334555553
No 140
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=46.08 E-value=13 Score=27.30 Aligned_cols=41 Identities=15% Similarity=0.298 Sum_probs=23.6
Q ss_pred hhhhhcccccceeecCCCCcceecCCCCccccChhhHhhccccCCCCCccccCcCcc
Q psy10075 5 DEILNHSRECAFVSRLDDSYNFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEY 61 (373)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 61 (373)
+.++.++.|.- ..-...+.|..|+..|..... .+.|+.|+-
T Consensus 54 ~t~~ega~L~I----~~~p~~~~C~~Cg~~~~~~~~------------~~~CP~Cgs 94 (115)
T TIGR00100 54 GTVAEGAKLNI----EDEPVECECEDCSEEVSPEID------------LYRCPKCHG 94 (115)
T ss_pred CCccCCCEEEE----EeeCcEEEcccCCCEEecCCc------------CccCcCCcC
Confidence 34445554443 223346788888877766432 356888873
No 141
>PF03811 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=45.69 E-value=11 Score=21.04 Aligned_cols=29 Identities=28% Similarity=0.661 Sum_probs=15.2
Q ss_pred cCCCCcccccchhhhhhhhhhccCCCcccCCCC
Q psy10075 292 KCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSC 324 (373)
Q Consensus 292 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C 324 (373)
.|+.|+.. ..+..|=+...+.++|.|..|
T Consensus 7 ~CP~C~s~----~~v~k~G~~~~G~qryrC~~C 35 (36)
T PF03811_consen 7 HCPRCQST----EGVKKNGKSPSGHQRYRCKDC 35 (36)
T ss_pred eCCCCCCC----CcceeCCCCCCCCEeEecCcC
Confidence 46666443 223444434444556777766
No 142
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=44.79 E-value=12 Score=27.33 Aligned_cols=41 Identities=12% Similarity=0.153 Sum_probs=24.4
Q ss_pred hhhhhcccccceeecCCCCcceecCCCCccccChhhHhhccccCCCCCccccCcCcc
Q psy10075 5 DEILNHSRECAFVSRLDDSYNFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEY 61 (373)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 61 (373)
|+++.++.|.... -...+.|..|+..|.... ..+.|+.|+.
T Consensus 54 ~T~~egA~L~I~~----vp~~~~C~~Cg~~~~~~~------------~~~~CP~Cgs 94 (113)
T PRK12380 54 GTVAQGCDLHIVY----KPAQAWCWDCSQVVEIHQ------------HDAQCPHCHG 94 (113)
T ss_pred CCccCCCEEEEEe----eCcEEEcccCCCEEecCC------------cCccCcCCCC
Confidence 3455555554422 334678888887776643 2346888873
No 143
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=44.58 E-value=13 Score=18.58 Aligned_cols=11 Identities=27% Similarity=0.800 Sum_probs=8.9
Q ss_pred CcceecCCCCc
Q psy10075 23 SYNFICILCNY 33 (373)
Q Consensus 23 ~~~~~C~~C~~ 33 (373)
..+|.|+.||.
T Consensus 14 ~v~f~CPnCG~ 24 (24)
T PF07754_consen 14 AVPFPCPNCGF 24 (24)
T ss_pred CceEeCCCCCC
Confidence 56799999984
No 144
>KOG3408|consensus
Probab=44.55 E-value=10 Score=27.60 Aligned_cols=26 Identities=19% Similarity=0.354 Sum_probs=20.6
Q ss_pred CCcceecCCCCccccChhhHhhcccc
Q psy10075 22 DSYNFICILCNYHAENFENIYLHLPS 47 (373)
Q Consensus 22 ~~~~~~C~~C~~~f~~~~~l~~H~~~ 47 (373)
+.+.|-|-+|.+.|.+...|..|.+.
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhc
Confidence 56778888888888888888888654
No 145
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=44.47 E-value=10 Score=26.60 Aligned_cols=37 Identities=14% Similarity=0.257 Sum_probs=18.7
Q ss_pred CcceecCCCCccccChhhHhhccccCCCCCccccCcCcccc
Q psy10075 23 SYNFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEYQT 63 (373)
Q Consensus 23 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 63 (373)
++.|.|+.||..-.+...+ +.-.......|..||..|
T Consensus 20 ~k~FtCp~Cghe~vs~ctv----kk~~~~g~~~Cg~CGls~ 56 (104)
T COG4888 20 PKTFTCPRCGHEKVSSCTV----KKTVNIGTAVCGNCGLSF 56 (104)
T ss_pred CceEecCccCCeeeeEEEE----EecCceeEEEcccCcceE
Confidence 3456677777655443332 222223345566666554
No 146
>KOG2593|consensus
Probab=44.44 E-value=17 Score=32.95 Aligned_cols=36 Identities=11% Similarity=0.301 Sum_probs=17.9
Q ss_pred CCccccCcCcccccchHHHHHHhhhhhccCCccccCcccc
Q psy10075 51 QKLFSCDLCEYQTNTKIYIRRHMLKIHLRQRNLECSLCDS 90 (373)
Q Consensus 51 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~ 90 (373)
...|.|+.|.++|.....++.- -.....|.|..|+.
T Consensus 126 ~~~Y~Cp~C~kkyt~Lea~~L~----~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 126 VAGYVCPNCQKKYTSLEALQLL----DNETGEFHCENCGG 161 (436)
T ss_pred cccccCCccccchhhhHHHHhh----cccCceEEEecCCC
Confidence 3446666666666554444432 11234456666654
No 147
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=43.97 E-value=17 Score=27.50 Aligned_cols=16 Identities=13% Similarity=0.065 Sum_probs=11.9
Q ss_pred CcceecCCCCccccCh
Q psy10075 23 SYNFICILCNYHAENF 38 (373)
Q Consensus 23 ~~~~~C~~C~~~f~~~ 38 (373)
...+.|..||..|...
T Consensus 68 p~~~~C~~CG~~~~~~ 83 (135)
T PRK03824 68 EAVLKCRNCGNEWSLK 83 (135)
T ss_pred ceEEECCCCCCEEecc
Confidence 3568888998888654
No 148
>PF05495 zf-CHY: CHY zinc finger; InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins: Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation: ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom. More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=43.92 E-value=5.4 Score=26.32 Aligned_cols=12 Identities=8% Similarity=0.207 Sum_probs=5.9
Q ss_pred ceecCCCccccc
Q psy10075 230 EYHCTYCDYICS 241 (373)
Q Consensus 230 ~~~C~~C~~~f~ 241 (373)
.+.++-|++.|.
T Consensus 10 ~~~~~cC~~~y~ 21 (71)
T PF05495_consen 10 AIRFPCCGKYYP 21 (71)
T ss_dssp EEEETTTTEEES
T ss_pred EEECCcccCeec
Confidence 344555555443
No 149
>KOG2593|consensus
Probab=43.60 E-value=11 Score=34.25 Aligned_cols=42 Identities=19% Similarity=0.398 Sum_probs=31.0
Q ss_pred ecCCCCcceecCCCCccccChhhHhhccccCCCCCccccCcCccc
Q psy10075 18 SRLDDSYNFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEYQ 62 (373)
Q Consensus 18 ~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 62 (373)
+..+.-.-|.|+.|++.|...+++.. .-.....|.|..|+-.
T Consensus 121 ~d~t~~~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 121 RDDTNVAGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGGE 162 (436)
T ss_pred hhccccccccCCccccchhhhHHHHh---hcccCceEEEecCCCc
Confidence 34445567999999999999887743 3334567999999843
No 150
>KOG2272|consensus
Probab=43.45 E-value=17 Score=30.17 Aligned_cols=18 Identities=17% Similarity=0.394 Sum_probs=11.7
Q ss_pred CcccCCCCccccChHHHH
Q psy10075 111 PFSCNFCDYKSKTKYDIT 128 (373)
Q Consensus 111 ~~~C~~C~~~f~~~~~l~ 128 (373)
-|.|+.|++.+.+...+.
T Consensus 99 CF~Cd~Cn~~Lad~gf~r 116 (332)
T KOG2272|consen 99 CFRCDLCNKHLADQGFYR 116 (332)
T ss_pred cchhHHHHHHHhhhhhHh
Confidence 477888877766555443
No 151
>KOG2807|consensus
Probab=43.45 E-value=39 Score=29.50 Aligned_cols=25 Identities=24% Similarity=0.506 Sum_probs=20.1
Q ss_pred cccCCCCCCCccChhHHHHHHHhhc
Q psy10075 318 KYFCGSCNFSCTSNTDLLEHVQTNH 342 (373)
Q Consensus 318 ~~~C~~C~~~f~~~~~l~~H~~~~H 342 (373)
.|+|..|...|-..=+...|-..|-
T Consensus 345 ~y~C~~Ck~~FCldCDv~iHesLh~ 369 (378)
T KOG2807|consen 345 RYRCESCKNVFCLDCDVFIHESLHN 369 (378)
T ss_pred cEEchhccceeeccchHHHHhhhhc
Confidence 4899999999988888888876663
No 152
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=43.25 E-value=9.6 Score=26.37 Aligned_cols=33 Identities=27% Similarity=0.695 Sum_probs=19.3
Q ss_pred cceecCCCcccccChHHHhhhhhccCCCCCCCCeecCCCCccccCh
Q psy10075 229 AEYHCTYCDYICSSRTILKSHINKKHANDEDKPFTCEHCDSKFCKK 274 (373)
Q Consensus 229 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~ 274 (373)
..|.|+.|++.-.. ..+...+.|..|++.|..-
T Consensus 34 a~y~CpfCgk~~vk-------------R~a~GIW~C~~C~~~~AGG 66 (91)
T TIGR00280 34 AKYVCPFCGKKTVK-------------RGSTGIWTCRKCGAKFAGG 66 (91)
T ss_pred cCccCCCCCCCceE-------------EEeeEEEEcCCCCCEEeCC
Confidence 45788888753110 1123457788888877653
No 153
>KOG4124|consensus
Probab=42.04 E-value=7.6 Score=33.90 Aligned_cols=70 Identities=24% Similarity=0.408 Sum_probs=39.6
Q ss_pred CCCeecCC--CCccccChhHHHHhhhhhccCCCCccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChhHHHH
Q psy10075 259 DKPFTCEH--CDSKFCKKYDMKMHMSRAHYGLKPFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNTDLLE 336 (373)
Q Consensus 259 ~~~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~ 336 (373)
.++|.|++ |.+.+++...|..|-...|...- -.-...=+-|....-.+++|+|++|.+.+.....|.-
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i----------~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~ 416 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPI----------TTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKY 416 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCC----------CCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCc
Confidence 47788855 99999888888888644443221 0111122233333333455777777777666555544
Q ss_pred HH
Q psy10075 337 HV 338 (373)
Q Consensus 337 H~ 338 (373)
|.
T Consensus 417 ~~ 418 (442)
T KOG4124|consen 417 HR 418 (442)
T ss_pred ee
Confidence 43
No 154
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=41.82 E-value=14 Score=27.74 Aligned_cols=27 Identities=22% Similarity=0.428 Sum_probs=16.8
Q ss_pred CCcccCCCCCCCccChhHHHHHHHhhcccc
Q psy10075 316 NQKYFCGSCNFSCTSNTDLLEHVQTNHIIV 345 (373)
Q Consensus 316 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 345 (373)
.....|-.||+.|.. |..|+++||++.
T Consensus 70 ~d~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 70 PDYIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp SS-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred cCeeEEccCCcccch---HHHHHHHccCCC
Confidence 344789999998886 589999998855
No 155
>KOG2807|consensus
Probab=41.71 E-value=49 Score=28.92 Aligned_cols=32 Identities=19% Similarity=0.493 Sum_probs=22.5
Q ss_pred ccceecCCCcccccChHHHhhhhhccCCCCCCCCeecCCCC
Q psy10075 228 EAEYHCTYCDYICSSRTILKSHINKKHANDEDKPFTCEHCD 268 (373)
Q Consensus 228 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~ 268 (373)
...|.|+.|...|-..-+...|- ....|+-|.
T Consensus 343 ~~~y~C~~Ck~~FCldCDv~iHe---------sLh~CpgCe 374 (378)
T KOG2807|consen 343 SGRYRCESCKNVFCLDCDVFIHE---------SLHNCPGCE 374 (378)
T ss_pred CCcEEchhccceeeccchHHHHh---------hhhcCCCcC
Confidence 45688888888888777777773 334566664
No 156
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=41.44 E-value=14 Score=22.80 Aligned_cols=31 Identities=26% Similarity=0.433 Sum_probs=17.4
Q ss_pred ceecCCCCccccChhhHhhccccCCCCCccccCcCcccc
Q psy10075 25 NFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEYQT 63 (373)
Q Consensus 25 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 63 (373)
.+.|+.|+..+.-.... .+ ....|+.||-.+
T Consensus 2 ~~~CP~CG~~iev~~~~-------~G-eiV~Cp~CGael 32 (54)
T TIGR01206 2 QFECPDCGAEIELENPE-------LG-ELVICDECGAEL 32 (54)
T ss_pred ccCCCCCCCEEecCCCc-------cC-CEEeCCCCCCEE
Confidence 35788888766443221 12 245777777544
No 157
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region. Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A. The abnormally extended conformation is stable only in the CcO assembly.
Probab=41.43 E-value=19 Score=25.41 Aligned_cols=26 Identities=12% Similarity=0.043 Sum_probs=16.5
Q ss_pred ccccceeecCCCCcceecCCCCccccC
Q psy10075 11 SRECAFVSRLDDSYNFICILCNYHAEN 37 (373)
Q Consensus 11 ~~~~~~~~~~~~~~~~~C~~C~~~f~~ 37 (373)
++...+++.+.+ ++..|++||..|.-
T Consensus 66 ~h~v~W~~l~~g-~~~rC~eCG~~fkL 91 (97)
T cd00924 66 SHDVIWMWLEKG-KPKRCPECGHVFKL 91 (97)
T ss_pred CceEEEEEEeCC-CceeCCCCCcEEEE
Confidence 344456666655 57777777777743
No 158
>PF09332 Mcm10: Mcm10 replication factor; InterPro: IPR015411 Mcm10 is a eukaryotic DNA replication factor that regulates the stability and chromatin association of DNA polymerase alpha []. ; PDB: 2KWQ_A.
Probab=41.29 E-value=8 Score=34.34 Aligned_cols=58 Identities=28% Similarity=0.403 Sum_probs=24.5
Q ss_pred CCeecCCCCccccChhHHH---Hhhhhhcc-CCCCccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCC
Q psy10075 260 KPFTCEHCDSKFCKKYDMK---MHMSRAHY-GLKPFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFS 327 (373)
Q Consensus 260 ~~~~C~~C~~~f~~~~~L~---~H~~~~H~-~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~ 327 (373)
+.+.|..|.+++......- .|....|. ..+.|+|+.|+++..+...|-. -.|..||-.
T Consensus 251 kav~C~~C~yt~~~~~~~C~~~~H~l~~~~a~KRFFkC~~C~~Rt~sl~r~P~----------~~C~~Cg~~ 312 (344)
T PF09332_consen 251 KAVTCKQCKYTAFKPSDRCKEEGHPLKWHDAVKRFFKCKDCGNRTISLERLPK----------KHCSNCGSS 312 (344)
T ss_dssp EEEEETTT--EESS--HHHHHTT--EEEEEEE-EEEE-T-TS-EEEESSSS------------S--TTT-S-
T ss_pred EEEEcCCCCCcccCcchhHHhcCCceEEeeeeeeeEECCCCCCeeeecccCCC----------CCCCcCCcC
Confidence 4478888887665554443 23322232 3345889999888665433322 248888754
No 159
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=40.94 E-value=38 Score=29.86 Aligned_cols=21 Identities=14% Similarity=0.398 Sum_probs=16.0
Q ss_pred CcccccccccccCHHHHHHHH
Q psy10075 141 PHKCTICESTYLTKVELKNHI 161 (373)
Q Consensus 141 ~~~C~~C~~~f~~~~~l~~H~ 161 (373)
.+-|+.|++.|.....+..|+
T Consensus 238 ~~YC~~C~r~f~~~~VFe~Hl 258 (470)
T COG5188 238 KVYCVKCGREFSRSKVFEYHL 258 (470)
T ss_pred ceeeHhhhhHhhhhHHHHHHH
Confidence 356888888888877777776
No 160
>PRK04023 DNA polymerase II large subunit; Validated
Probab=40.45 E-value=34 Score=35.01 Aligned_cols=11 Identities=27% Similarity=0.377 Sum_probs=7.0
Q ss_pred ccCCCCCCCcc
Q psy10075 319 YFCGSCNFSCT 329 (373)
Q Consensus 319 ~~C~~C~~~f~ 329 (373)
|.|+.|+..-.
T Consensus 664 y~CPKCG~El~ 674 (1121)
T PRK04023 664 DECEKCGREPT 674 (1121)
T ss_pred CcCCCCCCCCC
Confidence 56777775444
No 161
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=40.41 E-value=14 Score=18.13 Aligned_cols=9 Identities=33% Similarity=0.696 Sum_probs=4.3
Q ss_pred cCCCCcccc
Q psy10075 28 CILCNYHAE 36 (373)
Q Consensus 28 C~~C~~~f~ 36 (373)
|+.||....
T Consensus 2 Cp~CG~~~~ 10 (23)
T PF13240_consen 2 CPNCGAEIE 10 (23)
T ss_pred CcccCCCCC
Confidence 445554443
No 162
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=39.80 E-value=21 Score=35.46 Aligned_cols=47 Identities=28% Similarity=0.553 Sum_probs=29.2
Q ss_pred ceecCCCCccccCh---hhHhhccccCCCCCccccCcCcccccchHHHHHHhhhhhccCCccccCccccc
Q psy10075 25 NFICILCNYHAENF---ENIYLHLPSHFEQKLFSCDLCEYQTNTKIYIRRHMLKIHLRQRNLECSLCDST 91 (373)
Q Consensus 25 ~~~C~~C~~~f~~~---~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~ 91 (373)
.+.|..||..+.-. ..|..| ...+...|.+||. ....|..|+.||..
T Consensus 435 ~l~C~~Cg~v~~Cp~Cd~~lt~H----~~~~~L~CH~Cg~----------------~~~~p~~Cp~Cgs~ 484 (730)
T COG1198 435 LLLCRDCGYIAECPNCDSPLTLH----KATGQLRCHYCGY----------------QEPIPQSCPECGSE 484 (730)
T ss_pred eeecccCCCcccCCCCCcceEEe----cCCCeeEeCCCCC----------------CCCCCCCCCCCCCC
Confidence 35688888777443 234344 3345678888882 24557788888764
No 163
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=39.70 E-value=8.9 Score=26.49 Aligned_cols=11 Identities=18% Similarity=0.736 Sum_probs=5.2
Q ss_pred ccccCcCcccc
Q psy10075 53 LFSCDLCEYQT 63 (373)
Q Consensus 53 ~~~C~~C~~~f 63 (373)
.+.|..|++.|
T Consensus 53 IW~C~~C~~~~ 63 (90)
T PF01780_consen 53 IWKCKKCGKKF 63 (90)
T ss_dssp EEEETTTTEEE
T ss_pred EeecCCCCCEE
Confidence 34455555444
No 164
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=38.86 E-value=39 Score=35.55 Aligned_cols=29 Identities=21% Similarity=0.466 Sum_probs=16.5
Q ss_pred CccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCcc
Q psy10075 290 PFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCT 329 (373)
Q Consensus 290 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~ 329 (373)
+|.|+.||....... .+ ...|+.|+....
T Consensus 692 vy~CPsCGaev~~de----------s~-a~~CP~CGtplv 720 (1337)
T PRK14714 692 VYVCPDCGAEVPPDE----------SG-RVECPRCDVELT 720 (1337)
T ss_pred ceeCccCCCccCCCc----------cc-cccCCCCCCccc
Confidence 567888887643211 11 345888885443
No 165
>PF01155 HypA: Hydrogenase expression/synthesis hypA family; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=38.62 E-value=14 Score=27.05 Aligned_cols=27 Identities=30% Similarity=0.342 Sum_probs=15.9
Q ss_pred cceecCCCCccccChhhHhhccccCCCCCccccCcCccc
Q psy10075 24 YNFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEYQ 62 (373)
Q Consensus 24 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 62 (373)
..+.|..|+..|.-.... +.|+.|+..
T Consensus 69 ~~~~C~~Cg~~~~~~~~~------------~~CP~Cgs~ 95 (113)
T PF01155_consen 69 ARARCRDCGHEFEPDEFD------------FSCPRCGSP 95 (113)
T ss_dssp -EEEETTTS-EEECHHCC------------HH-SSSSSS
T ss_pred CcEECCCCCCEEecCCCC------------CCCcCCcCC
Confidence 357788888888765321 458888743
No 166
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=38.45 E-value=11 Score=36.55 Aligned_cols=55 Identities=18% Similarity=0.391 Sum_probs=25.9
Q ss_pred cCCCCccccChhhHhhccccCCCCCcc-ccCcCcccccchHHHHHHhhhhhccCCccccCcccc
Q psy10075 28 CILCNYHAENFENIYLHLPSHFEQKLF-SCDLCEYQTNTKIYIRRHMLKIHLRQRNLECSLCDS 90 (373)
Q Consensus 28 C~~C~~~f~~~~~l~~H~~~h~~~~~~-~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~ 90 (373)
|..||-.|+-.++|= -=+..+....| .|+.|.+.+....+-+=| ..+.-|+.||-
T Consensus 126 CT~CGPRfTIi~alP-YDR~nTsM~~F~lC~~C~~EY~dP~nRRfH-------AQp~aCp~CGP 181 (750)
T COG0068 126 CTNCGPRFTIIEALP-YDRENTSMADFPLCPFCDKEYKDPLNRRFH-------AQPIACPKCGP 181 (750)
T ss_pred cCCCCcceeeeccCC-CCcccCccccCcCCHHHHHHhcCccccccc-------cccccCcccCC
Confidence 666666665555441 11222223333 266666555554444444 22455666654
No 167
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=38.41 E-value=19 Score=24.69 Aligned_cols=14 Identities=14% Similarity=0.558 Sum_probs=8.8
Q ss_pred CccccCcCcccccc
Q psy10075 52 KLFSCDLCEYQTNT 65 (373)
Q Consensus 52 ~~~~C~~C~~~f~~ 65 (373)
.|-.|..||..|.+
T Consensus 57 ~Pa~CkkCGfef~~ 70 (97)
T COG3357 57 RPARCKKCGFEFRD 70 (97)
T ss_pred cChhhcccCccccc
Confidence 45567777766655
No 168
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=38.19 E-value=17 Score=21.86 Aligned_cols=14 Identities=14% Similarity=0.278 Sum_probs=11.0
Q ss_pred CcceecCCCCcccc
Q psy10075 23 SYNFICILCNYHAE 36 (373)
Q Consensus 23 ~~~~~C~~C~~~f~ 36 (373)
++++.|..||..|.
T Consensus 2 Dk~l~C~dCg~~Fv 15 (49)
T PF13451_consen 2 DKTLTCKDCGAEFV 15 (49)
T ss_pred CeeEEcccCCCeEE
Confidence 56788888888773
No 169
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=37.90 E-value=12 Score=25.81 Aligned_cols=32 Identities=22% Similarity=0.466 Sum_probs=19.0
Q ss_pred cceecCCCcccccChHHHhhhhhccCCCCCCCCeecCCCCccccC
Q psy10075 229 AEYHCTYCDYICSSRTILKSHINKKHANDEDKPFTCEHCDSKFCK 273 (373)
Q Consensus 229 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~ 273 (373)
..|.|+.|++.-.. ..+...+.|..|++.|..
T Consensus 35 a~y~CpfCgk~~vk-------------R~a~GIW~C~~C~~~~AG 66 (90)
T PRK03976 35 AKHVCPVCGRPKVK-------------RVGTGIWECRKCGAKFAG 66 (90)
T ss_pred cCccCCCCCCCceE-------------EEEEEEEEcCCCCCEEeC
Confidence 45788888653111 112345778888887765
No 170
>PF11789 zf-Nse: Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=37.32 E-value=33 Score=21.47 Aligned_cols=32 Identities=16% Similarity=0.401 Sum_probs=14.0
Q ss_pred CCCeecCCCCccccChhHHHHhhhhhccCCCCccCCC
Q psy10075 259 DKPFTCEHCDSKFCKKYDMKMHMSRAHYGLKPFKCSF 295 (373)
Q Consensus 259 ~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~ 295 (373)
+.|++...|+.+|...+.+.. + ......+|+.
T Consensus 22 ~~PV~s~~C~H~fek~aI~~~-i----~~~~~~~CPv 53 (57)
T PF11789_consen 22 EDPVKSKKCGHTFEKEAILQY-I----QRNGSKRCPV 53 (57)
T ss_dssp SSEEEESSS--EEEHHHHHHH-C----TTTS-EE-SC
T ss_pred hCCcCcCCCCCeecHHHHHHH-H----HhcCCCCCCC
Confidence 446666667766655443332 2 2234455655
No 171
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=37.03 E-value=20 Score=20.48 Aligned_cols=17 Identities=24% Similarity=0.278 Sum_probs=9.7
Q ss_pred ccCCCCCCCccChhHHH
Q psy10075 319 YFCGSCNFSCTSNTDLL 335 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~ 335 (373)
+.|+.|+=.|.....|.
T Consensus 20 d~C~~C~G~W~d~~el~ 36 (41)
T PF13453_consen 20 DVCPSCGGIWFDAGELE 36 (41)
T ss_pred EECCCCCeEEccHHHHH
Confidence 56666665555554443
No 172
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=36.66 E-value=11 Score=22.45 Aligned_cols=36 Identities=14% Similarity=0.380 Sum_probs=17.1
Q ss_pred CCCCcccccchhhhhhhhhhccCCCcccCCC--CCCCccC
Q psy10075 293 CSFCEYKCTTKFGLKSHVTRIHSNQKYFCGS--CNFSCTS 330 (373)
Q Consensus 293 C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~--C~~~f~~ 330 (373)
||.||.......+-..+. .-.+..|+|.. ||..|..
T Consensus 2 CP~Cg~~a~ir~S~~~s~--~~~~~Y~qC~N~~Cg~tfv~ 39 (47)
T PF04606_consen 2 CPHCGSKARIRTSRQLSP--LTRELYCQCTNPECGHTFVA 39 (47)
T ss_pred cCCCCCeeEEEEchhhCc--ceEEEEEEECCCcCCCEEEE
Confidence 666766544333222221 11112367776 7766653
No 173
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=36.61 E-value=24 Score=30.98 Aligned_cols=27 Identities=11% Similarity=0.324 Sum_probs=19.2
Q ss_pred ccCCCcccCCCCC-CCccChhHHHHHHH
Q psy10075 313 IHSNQKYFCGSCN-FSCTSNTDLLEHVQ 339 (373)
Q Consensus 313 ~h~~~~~~C~~C~-~~f~~~~~l~~H~~ 339 (373)
|.-+..|.|.+|| +.+.-...+..|..
T Consensus 369 hgLd~ef~CEICgNyvy~GR~~FdrHF~ 396 (470)
T COG5188 369 HGLDIEFECEICGNYVYYGRDRFDRHFE 396 (470)
T ss_pred cCCCcceeeeecccccccchHHHHhhhh
Confidence 3334458899998 77777777777755
No 174
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=36.51 E-value=18 Score=27.22 Aligned_cols=15 Identities=40% Similarity=0.778 Sum_probs=9.9
Q ss_pred CCcccCCCCCCCccC
Q psy10075 316 NQKYFCGSCNFSCTS 330 (373)
Q Consensus 316 ~~~~~C~~C~~~f~~ 330 (373)
.++|+|+.|++.|..
T Consensus 51 ~qRyrC~~C~~tf~~ 65 (129)
T COG3677 51 HQRYKCKSCGSTFTV 65 (129)
T ss_pred ccccccCCcCcceee
Confidence 345777777777764
No 175
>PRK04023 DNA polymerase II large subunit; Validated
Probab=36.46 E-value=32 Score=35.17 Aligned_cols=10 Identities=30% Similarity=0.664 Sum_probs=5.1
Q ss_pred eecCCCCccc
Q psy10075 262 FTCEHCDSKF 271 (373)
Q Consensus 262 ~~C~~C~~~f 271 (373)
+.|+.||..-
T Consensus 664 y~CPKCG~El 673 (1121)
T PRK04023 664 DECEKCGREP 673 (1121)
T ss_pred CcCCCCCCCC
Confidence 4455555433
No 176
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=36.20 E-value=17 Score=27.77 Aligned_cols=18 Identities=17% Similarity=0.060 Sum_probs=13.2
Q ss_pred ccCCCCCCCccChhHHHH
Q psy10075 319 YFCGSCNFSCTSNTDLLE 336 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~ 336 (373)
-.|..|++.|++...+..
T Consensus 29 ReC~~C~~RFTTyErve~ 46 (147)
T TIGR00244 29 RECLECHERFTTFERAEL 46 (147)
T ss_pred ccCCccCCccceeeeccc
Confidence 678888888887765443
No 177
>KOG3507|consensus
Probab=36.18 E-value=26 Score=21.84 Aligned_cols=11 Identities=18% Similarity=0.733 Sum_probs=6.0
Q ss_pred CCeecCCCCcc
Q psy10075 260 KPFTCEHCDSK 270 (373)
Q Consensus 260 ~~~~C~~C~~~ 270 (373)
..++|..||..
T Consensus 36 D~irCReCG~R 46 (62)
T KOG3507|consen 36 DVIRCRECGYR 46 (62)
T ss_pred CcEehhhcchH
Confidence 34566666654
No 178
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=36.04 E-value=18 Score=20.94 Aligned_cols=9 Identities=22% Similarity=0.814 Sum_probs=3.8
Q ss_pred eecCCCCcc
Q psy10075 262 FTCEHCDSK 270 (373)
Q Consensus 262 ~~C~~C~~~ 270 (373)
+.|+.||..
T Consensus 20 ~vC~~CG~V 28 (43)
T PF08271_consen 20 LVCPNCGLV 28 (43)
T ss_dssp EEETTT-BB
T ss_pred EECCCCCCE
Confidence 445555443
No 179
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=34.73 E-value=38 Score=20.38 Aligned_cols=11 Identities=27% Similarity=0.776 Sum_probs=5.5
Q ss_pred CeecCCCCccc
Q psy10075 261 PFTCEHCDSKF 271 (373)
Q Consensus 261 ~~~C~~C~~~f 271 (373)
.|.|+.||..+
T Consensus 20 ~~vC~~Cg~~~ 30 (52)
T smart00661 20 RFVCRKCGYEE 30 (52)
T ss_pred EEECCcCCCeE
Confidence 34555555444
No 180
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=34.51 E-value=19 Score=22.19 Aligned_cols=10 Identities=40% Similarity=0.939 Sum_probs=5.6
Q ss_pred ecCCCCcccc
Q psy10075 263 TCEHCDSKFC 272 (373)
Q Consensus 263 ~C~~C~~~f~ 272 (373)
+|+.||+.|.
T Consensus 7 ~C~~Cg~~~~ 16 (54)
T PF14446_consen 7 KCPVCGKKFK 16 (54)
T ss_pred cChhhCCccc
Confidence 4556665554
No 181
>PF07503 zf-HYPF: HypF finger; InterPro: IPR011125 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Proteins of the HypF family are involved in the maturation and regulation of hydrogenase []. In the N terminus they appear to have two zinc finger domains that are similar to those found in the DnaJ chaperone []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3TTD_A 3TSQ_A 3TTC_A 3TSP_A 3TTF_A 3TSU_A.
Probab=34.02 E-value=5.1 Score=22.17 Aligned_cols=10 Identities=20% Similarity=1.162 Sum_probs=4.0
Q ss_pred eecCCCCccc
Q psy10075 262 FTCEHCDSKF 271 (373)
Q Consensus 262 ~~C~~C~~~f 271 (373)
..|+.||-.+
T Consensus 22 isC~~CGPr~ 31 (35)
T PF07503_consen 22 ISCTNCGPRY 31 (35)
T ss_dssp --BTTCC-SC
T ss_pred ccCCCCCCCE
Confidence 4566666544
No 182
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=33.65 E-value=46 Score=26.01 Aligned_cols=46 Identities=20% Similarity=0.409 Sum_probs=22.2
Q ss_pred eecCCCccccc--ChHHHhhhhhccCCCCCCCCeecCCCCccccChhH
Q psy10075 231 YHCTYCDYICS--SRTILKSHINKKHANDEDKPFTCEHCDSKFCKKYD 276 (373)
Q Consensus 231 ~~C~~C~~~f~--~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~ 276 (373)
-.|+.||-... +......-+-..+......-|.|+.||+.|-.-+.
T Consensus 98 ~RCp~CN~~L~~vs~eev~~~Vp~~~~~~~~~f~~C~~CgkiYW~GsH 145 (165)
T COG1656 98 SRCPECNGELEKVSREEVKEKVPEKVYRNYEEFYRCPKCGKIYWKGSH 145 (165)
T ss_pred ccCcccCCEeccCcHHHHhhccchhhhhcccceeECCCCcccccCchH
Confidence 35777765433 33332222111111122445778888888755443
No 183
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=32.89 E-value=24 Score=25.86 Aligned_cols=42 Identities=10% Similarity=0.237 Sum_probs=22.4
Q ss_pred hhhhhcccccceeecCCCCcceecCCCCccccChhhHhhccccCCCCCccccCcCcc
Q psy10075 5 DEILNHSRECAFVSRLDDSYNFICILCNYHAENFENIYLHLPSHFEQKLFSCDLCEY 61 (373)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 61 (373)
|.++.++.|.- ..-...+.|..|+..|..... ..+.|+.|+.
T Consensus 54 ~t~~egA~L~i----~~~p~~~~C~~Cg~~~~~~~~-----------~~~~CP~Cgs 95 (114)
T PRK03681 54 GTVAEGCKLHL----EEQEAECWCETCQQYVTLLTQ-----------RVRRCPQCHG 95 (114)
T ss_pred CCccCCCEEEE----EeeCcEEEcccCCCeeecCCc-----------cCCcCcCcCC
Confidence 33444444443 223346778888876654321 1146888873
No 184
>PHA02998 RNA polymerase subunit; Provisional
Probab=32.46 E-value=23 Score=27.89 Aligned_cols=11 Identities=18% Similarity=0.443 Sum_probs=5.7
Q ss_pred ccCCCCCCCcc
Q psy10075 319 YFCGSCNFSCT 329 (373)
Q Consensus 319 ~~C~~C~~~f~ 329 (373)
|.|..||..|.
T Consensus 172 YkC~~CG~~wk 182 (195)
T PHA02998 172 HACRDCKKHFK 182 (195)
T ss_pred EEcCCCCCccC
Confidence 55555555443
No 185
>KOG3408|consensus
Probab=31.95 E-value=21 Score=26.09 Aligned_cols=22 Identities=27% Similarity=0.413 Sum_probs=13.0
Q ss_pred CCccCCCCcccccchhhhhhhh
Q psy10075 289 KPFKCSFCEYKCTTKFGLKSHV 310 (373)
Q Consensus 289 ~~~~C~~C~~~f~~~~~l~~H~ 310 (373)
..|.|-.|.+.|.+...|..|.
T Consensus 56 GqfyCi~CaRyFi~~~~l~~H~ 77 (129)
T KOG3408|consen 56 GQFYCIECARYFIDAKALKTHF 77 (129)
T ss_pred ceeehhhhhhhhcchHHHHHHH
Confidence 3455666666666666666655
No 186
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=31.69 E-value=28 Score=18.26 Aligned_cols=10 Identities=30% Similarity=0.770 Sum_probs=3.5
Q ss_pred CCeecCCCCc
Q psy10075 260 KPFTCEHCDS 269 (373)
Q Consensus 260 ~~~~C~~C~~ 269 (373)
..|.|..|+.
T Consensus 14 ~~Y~C~~Cdf 23 (30)
T PF07649_consen 14 WFYRCSECDF 23 (30)
T ss_dssp -EEE-TTT--
T ss_pred ceEECccCCC
Confidence 3456666654
No 187
>PF13824 zf-Mss51: Zinc-finger of mitochondrial splicing suppressor 51
Probab=31.56 E-value=45 Score=20.67 Aligned_cols=13 Identities=15% Similarity=0.641 Sum_probs=9.5
Q ss_pred cccCCCCCCCccC
Q psy10075 318 KYFCGSCNFSCTS 330 (373)
Q Consensus 318 ~~~C~~C~~~f~~ 330 (373)
.|.|+.||..+.-
T Consensus 14 ~~~Cp~cGipthc 26 (55)
T PF13824_consen 14 NFECPDCGIPTHC 26 (55)
T ss_pred CCcCCCCCCcCcc
Confidence 4888888877653
No 188
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=30.99 E-value=26 Score=32.61 Aligned_cols=25 Identities=20% Similarity=0.555 Sum_probs=18.3
Q ss_pred ccCCCCCCCccChhHHHHHHHhhcc
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQTNHI 343 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~H~ 343 (373)
..|+.|.+.|.+...+..|+...|.
T Consensus 58 WiCp~CskkF~d~~~~~~H~~~eH~ 82 (466)
T PF04780_consen 58 WICPRCSKKFSDAESCLSHMEQEHP 82 (466)
T ss_pred eeCCcccceeCCHHHHHHHHHHhhh
Confidence 6777777777777777777776664
No 189
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=30.87 E-value=34 Score=31.87 Aligned_cols=28 Identities=14% Similarity=0.268 Sum_probs=19.5
Q ss_pred CcccccccccccCHHHHHHHHhhhccCC
Q psy10075 141 PHKCTICESTYLTKVELKNHILKIHQGQ 168 (373)
Q Consensus 141 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~ 168 (373)
-+.|+.|.+.|.+...+..|+...|.+.
T Consensus 57 FWiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 57 FWICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred EeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 3567777777777777777776666543
No 190
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=30.78 E-value=38 Score=25.32 Aligned_cols=13 Identities=23% Similarity=0.846 Sum_probs=7.1
Q ss_pred cccCccccccCCc
Q psy10075 83 LECSLCDSTFSDN 95 (373)
Q Consensus 83 ~~C~~C~~~f~~~ 95 (373)
-+||.|...|.+.
T Consensus 122 pvCPvCkTSFKss 134 (140)
T PF05290_consen 122 PVCPVCKTSFKSS 134 (140)
T ss_pred CCCCccccccccc
Confidence 3466666555543
No 191
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=30.75 E-value=32 Score=24.61 Aligned_cols=13 Identities=23% Similarity=0.720 Sum_probs=6.8
Q ss_pred cccCCCCCCCccC
Q psy10075 318 KYFCGSCNFSCTS 330 (373)
Q Consensus 318 ~~~C~~C~~~f~~ 330 (373)
.|.|+.|+..+..
T Consensus 19 ~~iCpeC~~EW~~ 31 (109)
T TIGR00686 19 QLICPSCLYEWNE 31 (109)
T ss_pred eeECccccccccc
Confidence 3555555555543
No 192
>PF14311 DUF4379: Domain of unknown function (DUF4379)
Probab=30.48 E-value=39 Score=20.76 Aligned_cols=15 Identities=13% Similarity=0.656 Sum_probs=9.2
Q ss_pred eecCCCCccccChhH
Q psy10075 262 FTCEHCDSKFCKKYD 276 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~ 276 (373)
++|+.||..|.....
T Consensus 29 W~C~~Cgh~w~~~v~ 43 (55)
T PF14311_consen 29 WKCPKCGHEWKASVN 43 (55)
T ss_pred EECCCCCCeeEccHh
Confidence 567777766655443
No 193
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=30.29 E-value=30 Score=20.74 Aligned_cols=8 Identities=38% Similarity=1.261 Sum_probs=3.7
Q ss_pred ccCCCCcc
Q psy10075 291 FKCSFCEY 298 (373)
Q Consensus 291 ~~C~~C~~ 298 (373)
+.|..||+
T Consensus 38 ~~CGkCgy 45 (51)
T COG1998 38 WACGKCGY 45 (51)
T ss_pred eEeccccc
Confidence 44444444
No 194
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=30.23 E-value=33 Score=26.36 Aligned_cols=11 Identities=27% Similarity=1.274 Sum_probs=5.5
Q ss_pred CeecCCCCccc
Q psy10075 261 PFTCEHCDSKF 271 (373)
Q Consensus 261 ~~~C~~C~~~f 271 (373)
.|.|..|+..+
T Consensus 112 ~y~C~~C~~~~ 122 (146)
T smart00731 112 PYRCTGCGQRY 122 (146)
T ss_pred EEECCCCCCCC
Confidence 35555555443
No 195
>PF05129 Elf1: Transcription elongation factor Elf1 like; InterPro: IPR007808 This family of uncharacterised, mostly short, proteins contain a putative zinc binding domain with four conserved cysteines.; PDB: 1WII_A.
Probab=29.69 E-value=7.4 Score=26.46 Aligned_cols=13 Identities=23% Similarity=0.473 Sum_probs=7.4
Q ss_pred ccCCCCCCCccCh
Q psy10075 319 YFCGSCNFSCTSN 331 (373)
Q Consensus 319 ~~C~~C~~~f~~~ 331 (373)
..|.+|+..|...
T Consensus 47 ~~C~~Cg~~~~~~ 59 (81)
T PF05129_consen 47 LSCRVCGESFQTK 59 (81)
T ss_dssp EEESSS--EEEEE
T ss_pred EEecCCCCeEEEc
Confidence 6777777776644
No 196
>KOG1606|consensus
Probab=29.40 E-value=27 Score=28.33 Aligned_cols=30 Identities=17% Similarity=0.232 Sum_probs=25.8
Q ss_pred CchhhhhhhcccccceeecCCCCcceecCC
Q psy10075 1 MKQEDEILNHSRECAFVSRLDDSYNFICIL 30 (373)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~ 30 (373)
|-++.+.|+.+++..|+.-|.-.-||.|.-
T Consensus 103 YiDESEvlt~AD~~hhI~KhnFkvPFvCG~ 132 (296)
T KOG1606|consen 103 YIDESEVLTPADWDHHIEKHNFKVPFVCGC 132 (296)
T ss_pred ccchhhhcccccccchhhhhcCcCceeecc
Confidence 457789999999999999999999998753
No 197
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=28.56 E-value=38 Score=31.01 Aligned_cols=14 Identities=21% Similarity=0.672 Sum_probs=9.2
Q ss_pred CccCCCCcccccch
Q psy10075 290 PFKCSFCEYKCTTK 303 (373)
Q Consensus 290 ~~~C~~C~~~f~~~ 303 (373)
-|+|+.|++.+...
T Consensus 367 g~rC~kCg~~~~~~ 380 (421)
T COG1571 367 GFRCKKCGTRARET 380 (421)
T ss_pred CcccccccccCCcc
Confidence 57777777766543
No 198
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=28.48 E-value=22 Score=27.21 Aligned_cols=13 Identities=23% Similarity=0.621 Sum_probs=6.1
Q ss_pred ceecCCCCccccCh
Q psy10075 25 NFICILCNYHAENF 38 (373)
Q Consensus 25 ~~~C~~C~~~f~~~ 38 (373)
+|.|. |+..+.++
T Consensus 117 ~Y~C~-C~q~~l~~ 129 (156)
T COG3091 117 PYRCQ-CQQHYLRI 129 (156)
T ss_pred eEEee-cCCccchh
Confidence 44455 55544443
No 199
>PF13821 DUF4187: Domain of unknown function (DUF4187)
Probab=28.43 E-value=47 Score=20.63 Aligned_cols=19 Identities=21% Similarity=0.391 Sum_probs=14.0
Q ss_pred ccCCCCcccccchhhhhhh
Q psy10075 291 FKCSFCEYKCTTKFGLKSH 309 (373)
Q Consensus 291 ~~C~~C~~~f~~~~~l~~H 309 (373)
+.|-+||..|.+...|..+
T Consensus 28 ~YC~~Cg~~Y~d~~dL~~~ 46 (55)
T PF13821_consen 28 NYCFWCGTKYDDEEDLERN 46 (55)
T ss_pred ceeeeeCCccCCHHHHHhC
Confidence 5678888888777777654
No 200
>COG1773 Rubredoxin [Energy production and conversion]
Probab=28.17 E-value=29 Score=21.52 Aligned_cols=13 Identities=15% Similarity=0.552 Sum_probs=8.7
Q ss_pred CeecCCCCccccC
Q psy10075 261 PFTCEHCDSKFCK 273 (373)
Q Consensus 261 ~~~C~~C~~~f~~ 273 (373)
.|+|.+||..|.-
T Consensus 3 ~~~C~~CG~vYd~ 15 (55)
T COG1773 3 RWRCSVCGYVYDP 15 (55)
T ss_pred ceEecCCceEecc
Confidence 4677777777654
No 201
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=27.70 E-value=32 Score=26.35 Aligned_cols=16 Identities=19% Similarity=0.166 Sum_probs=11.3
Q ss_pred ccCCCCCCCccChhHH
Q psy10075 319 YFCGSCNFSCTSNTDL 334 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l 334 (373)
-.|+.|+..|++....
T Consensus 29 ReC~~C~~RFTTfE~~ 44 (156)
T COG1327 29 RECLECGERFTTFERA 44 (156)
T ss_pred hcccccccccchhhee
Confidence 5788888888776543
No 202
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=27.28 E-value=42 Score=32.03 Aligned_cols=34 Identities=21% Similarity=0.463 Sum_probs=14.3
Q ss_pred eecCCCCccccChhhHhhccccCCCCCccccCcCc
Q psy10075 26 FICILCNYHAENFENIYLHLPSHFEQKLFSCDLCE 60 (373)
Q Consensus 26 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 60 (373)
+.|..||....-.. -...+..|.......|..||
T Consensus 214 ~~C~~Cg~~~~C~~-C~~~l~~h~~~~~l~Ch~Cg 247 (505)
T TIGR00595 214 LLCRSCGYILCCPN-CDVSLTYHKKEGKLRCHYCG 247 (505)
T ss_pred eEhhhCcCccCCCC-CCCceEEecCCCeEEcCCCc
Confidence 34555555543321 11122233333445566665
No 203
>KOG1842|consensus
Probab=27.08 E-value=27 Score=31.89 Aligned_cols=30 Identities=17% Similarity=0.299 Sum_probs=23.4
Q ss_pred CCCeecCCCCccccChhHHHHhhhhhccCC
Q psy10075 259 DKPFTCEHCDSKFCKKYDMKMHMSRAHYGL 288 (373)
Q Consensus 259 ~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~ 288 (373)
...|.||+|...|.+...|..|.-..|..+
T Consensus 13 ~egflCPiC~~dl~~~~~L~~H~d~eH~~e 42 (505)
T KOG1842|consen 13 LEGFLCPICLLDLPNLSALNDHLDVEHFEE 42 (505)
T ss_pred hhcccCchHhhhhhhHHHHHHHHhhhcccc
Confidence 456888888888888888888886667553
No 204
>PRK10220 hypothetical protein; Provisional
Probab=27.07 E-value=47 Score=23.85 Aligned_cols=12 Identities=25% Similarity=0.838 Sum_probs=6.2
Q ss_pred ccCCCCCCCccC
Q psy10075 319 YFCGSCNFSCTS 330 (373)
Q Consensus 319 ~~C~~C~~~f~~ 330 (373)
|.|+.|+..|..
T Consensus 21 ~vCpeC~hEW~~ 32 (111)
T PRK10220 21 YICPECAHEWND 32 (111)
T ss_pred EECCcccCcCCc
Confidence 555555555543
No 205
>KOG4124|consensus
Probab=26.87 E-value=14 Score=32.29 Aligned_cols=24 Identities=29% Similarity=0.527 Sum_probs=17.8
Q ss_pred ccceecC--CCcccccChHHHhhhhh
Q psy10075 228 EAEYHCT--YCDYICSSRTILKSHIN 251 (373)
Q Consensus 228 ~~~~~C~--~C~~~f~~~~~l~~H~~ 251 (373)
.++|.|. .|.+.+.+...|..|..
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~ 372 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKL 372 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccc
Confidence 3568885 48888888878877754
No 206
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=26.54 E-value=46 Score=27.84 Aligned_cols=16 Identities=31% Similarity=0.721 Sum_probs=8.6
Q ss_pred CCCceecccccccccc
Q psy10075 167 GQTRFQCSFCDRRFYF 182 (373)
Q Consensus 167 ~~~~~~C~~C~~~f~~ 182 (373)
|.-.|.|+.|+..|+.
T Consensus 152 G~aef~C~~C~h~F~G 167 (278)
T PF15135_consen 152 GIAEFHCPKCRHNFRG 167 (278)
T ss_pred ceeeeecccccccchh
Confidence 4445555555555553
No 207
>PF14205 Cys_rich_KTR: Cysteine-rich KTR
Probab=26.04 E-value=31 Score=21.24 Aligned_cols=12 Identities=25% Similarity=0.506 Sum_probs=7.1
Q ss_pred CCeecCCCCccc
Q psy10075 260 KPFTCEHCDSKF 271 (373)
Q Consensus 260 ~~~~C~~C~~~f 271 (373)
.+..|++||...
T Consensus 3 ~Wi~CP~CgnKT 14 (55)
T PF14205_consen 3 EWILCPICGNKT 14 (55)
T ss_pred eEEECCCCCCcc
Confidence 355677777543
No 208
>TIGR00143 hypF [NiFe] hydrogenase maturation protein HypF. A previously described regulatory effect of HypF mutatation is attributable to loss of activity of a regulatory hydrogenase. A zinc finger-like region CXXCX(18)CXXCX(24)CXXCX(18)CXXC region further supported the regulatory hypothesis. However, more recent work (PUBMED:11375153) shows the direct effect is on the activity of expressed hydrogenases with nickel/iron centers, rather than on expression.
Probab=25.92 E-value=7.9 Score=38.43 Aligned_cols=27 Identities=22% Similarity=0.539 Sum_probs=13.7
Q ss_pred CCCCcccccchhhhhhhhhhccCCCcccCCCCC
Q psy10075 293 CSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCN 325 (373)
Q Consensus 293 C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~ 325 (373)
|+.|.+.+.+..+-+.|. ++..|+.||
T Consensus 121 C~~C~~ey~~p~~rr~h~------~~~~C~~Cg 147 (711)
T TIGR00143 121 CPDCAKEYKDPLDRRFHA------QPIACPRCG 147 (711)
T ss_pred CHHHHHHhcCCccccCCC------CCccCCCCC
Confidence 555555555544444443 344566665
No 209
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=25.81 E-value=77 Score=20.43 Aligned_cols=10 Identities=30% Similarity=1.062 Sum_probs=4.9
Q ss_pred ccCCCCCCCc
Q psy10075 319 YFCGSCNFSC 328 (373)
Q Consensus 319 ~~C~~C~~~f 328 (373)
|.|+.||...
T Consensus 47 ~~C~~Cg~~~ 56 (69)
T PF07282_consen 47 FTCPNCGFEM 56 (69)
T ss_pred EEcCCCCCEE
Confidence 4455555443
No 210
>KOG4118|consensus
Probab=25.68 E-value=34 Score=21.82 Aligned_cols=27 Identities=15% Similarity=0.279 Sum_probs=22.9
Q ss_pred ccCCCCCCCccChhHHHHHHHhhcccc
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQTNHIIV 345 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~H~~~ 345 (373)
|+|.+|-...+.+..+..|..+.|++.
T Consensus 39 ~kCtVC~~~mpdpktfkqhfe~kHpk~ 65 (74)
T KOG4118|consen 39 HKCTVCMVQMPDPKTFKQHFENKHPKE 65 (74)
T ss_pred hhhHhhHhhCCCCchHHHHHhhcCCCC
Confidence 889999988888888999988888644
No 211
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=25.62 E-value=40 Score=19.12 Aligned_cols=14 Identities=21% Similarity=0.294 Sum_probs=11.3
Q ss_pred ceecCCCCccccCh
Q psy10075 25 NFICILCNYHAENF 38 (373)
Q Consensus 25 ~~~C~~C~~~f~~~ 38 (373)
||.|..|+..|=..
T Consensus 12 ~f~C~~C~~~FC~~ 25 (39)
T smart00154 12 GFKCRHCGNLFCGE 25 (39)
T ss_pred CeECCccCCccccc
Confidence 88999999988443
No 212
>KOG1842|consensus
Probab=24.93 E-value=54 Score=30.02 Aligned_cols=29 Identities=21% Similarity=0.498 Sum_probs=21.5
Q ss_pred ccCCCCCCCccChhHHHHHHHhhcccccc
Q psy10075 319 YFCGSCNFSCTSNTDLLEHVQTNHIIVGE 347 (373)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~ 347 (373)
|.||+|..-|.+...|..|+-..|+.-..
T Consensus 16 flCPiC~~dl~~~~~L~~H~d~eH~~ed~ 44 (505)
T KOG1842|consen 16 FLCPICLLDLPNLSALNDHLDVEHFEEDE 44 (505)
T ss_pred ccCchHhhhhhhHHHHHHHHhhhccccch
Confidence 77888888888877888887777765443
No 213
>KOG3214|consensus
Probab=24.83 E-value=31 Score=24.14 Aligned_cols=40 Identities=23% Similarity=0.497 Sum_probs=20.3
Q ss_pred CCccCCCCcccccchhhhhhhhhhccCCCcccCCCCCCCccChh
Q psy10075 289 KPFKCSFCEYKCTTKFGLKSHVTRIHSNQKYFCGSCNFSCTSNT 332 (373)
Q Consensus 289 ~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~ 332 (373)
..|.|+.|.-.-+ ..--+...+......|.+|+.+|....
T Consensus 22 t~FnClfcnHek~----v~~~~Dk~~~iG~~sC~iC~esFqt~i 61 (109)
T KOG3214|consen 22 TQFNCLFCNHEKS----VSCTLDKKHNIGKASCRICEESFQTTI 61 (109)
T ss_pred eeeccCccccccc----eeeeehhhcCcceeeeeehhhhhccch
Confidence 3466666644321 111122333344466788877777644
No 214
>PF01927 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=24.80 E-value=54 Score=25.21 Aligned_cols=15 Identities=13% Similarity=0.470 Sum_probs=9.6
Q ss_pred CeecCCCCccccChh
Q psy10075 261 PFTCEHCDSKFCKKY 275 (373)
Q Consensus 261 ~~~C~~C~~~f~~~~ 275 (373)
-+.|+.||+.|-.-+
T Consensus 124 f~~C~~C~kiyW~Gs 138 (147)
T PF01927_consen 124 FWRCPGCGKIYWEGS 138 (147)
T ss_pred EEECCCCCCEecccc
Confidence 457777777775443
No 215
>PF06397 Desulfoferrod_N: Desulfoferrodoxin, N-terminal domain; InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=24.58 E-value=13 Score=20.72 Aligned_cols=11 Identities=27% Similarity=0.616 Sum_probs=6.5
Q ss_pred hhhhHhhhccc
Q psy10075 361 NAMYHCQITLN 371 (373)
Q Consensus 361 ~~~~~C~~C~~ 371 (373)
...|+|..||+
T Consensus 4 ~~~YkC~~CGn 14 (36)
T PF06397_consen 4 GEFYKCEHCGN 14 (36)
T ss_dssp TEEEE-TTT--
T ss_pred ccEEEccCCCC
Confidence 36799999996
No 216
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=24.32 E-value=49 Score=24.35 Aligned_cols=17 Identities=12% Similarity=0.206 Sum_probs=11.6
Q ss_pred CCcceecCCCCccccCh
Q psy10075 22 DSYNFICILCNYHAENF 38 (373)
Q Consensus 22 ~~~~~~C~~C~~~f~~~ 38 (373)
....+.|..|+..|...
T Consensus 68 vp~~~~C~~Cg~~~~~~ 84 (117)
T PRK00564 68 EKVELECKDCSHVFKPN 84 (117)
T ss_pred cCCEEEhhhCCCccccC
Confidence 34467788888777654
No 217
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=24.14 E-value=37 Score=19.63 Aligned_cols=16 Identities=19% Similarity=0.397 Sum_probs=11.5
Q ss_pred eecCCCCccccChhHH
Q psy10075 262 FTCEHCDSKFCKKYDM 277 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~L 277 (373)
-.|++|++.|+-....
T Consensus 9 K~C~~C~rpf~WRKKW 24 (42)
T PF10013_consen 9 KICPVCGRPFTWRKKW 24 (42)
T ss_pred CcCcccCCcchHHHHH
Confidence 4689999988765443
No 218
>PRK04351 hypothetical protein; Provisional
Probab=24.03 E-value=46 Score=25.72 Aligned_cols=12 Identities=33% Similarity=0.786 Sum_probs=6.3
Q ss_pred CCccCCCCcccc
Q psy10075 289 KPFKCSFCEYKC 300 (373)
Q Consensus 289 ~~~~C~~C~~~f 300 (373)
..|.|..|+-.+
T Consensus 131 ~~yrCg~C~g~L 142 (149)
T PRK04351 131 KRYRCGKCRGKL 142 (149)
T ss_pred CcEEeCCCCcEe
Confidence 445566555444
No 219
>PF14803 Nudix_N_2: Nudix N-terminal; PDB: 3CNG_C.
Probab=23.21 E-value=46 Score=18.28 Aligned_cols=8 Identities=25% Similarity=0.874 Sum_probs=4.1
Q ss_pred eecCCCCc
Q psy10075 262 FTCEHCDS 269 (373)
Q Consensus 262 ~~C~~C~~ 269 (373)
+.|+.||.
T Consensus 23 ~vC~~Cg~ 30 (34)
T PF14803_consen 23 LVCPACGF 30 (34)
T ss_dssp EEETTTTE
T ss_pred eECCCCCC
Confidence 45555554
No 220
>KOG4167|consensus
Probab=23.16 E-value=14 Score=35.88 Aligned_cols=27 Identities=19% Similarity=0.229 Sum_probs=20.0
Q ss_pred CcceecCCCCccccChhhHhhccccCC
Q psy10075 23 SYNFICILCNYHAENFENIYLHLPSHF 49 (373)
Q Consensus 23 ~~~~~C~~C~~~f~~~~~l~~H~~~h~ 49 (373)
++-|-|.+|++.|..+.++..||+.|.
T Consensus 790 ~giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 790 TGIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred CceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 455778888888877777777777774
No 221
>PTZ00448 hypothetical protein; Provisional
Probab=23.16 E-value=57 Score=29.21 Aligned_cols=24 Identities=17% Similarity=0.527 Sum_probs=20.8
Q ss_pred cccCCCCCCCccChhHHHHHHHhh
Q psy10075 318 KYFCGSCNFSCTSNTDLLEHVQTN 341 (373)
Q Consensus 318 ~~~C~~C~~~f~~~~~l~~H~~~~ 341 (373)
.|.|..|+..|.+....+.|+++-
T Consensus 314 ~~tC~~C~v~F~~~~~qR~H~KSD 337 (373)
T PTZ00448 314 MLLCRKCNIQLMDHNAFKQHYRSE 337 (373)
T ss_pred CccccccccccCCHHHHHHHhhhh
Confidence 488999999999889999998865
No 222
>PF14445 Prok-RING_2: Prokaryotic RING finger family 2
Probab=23.03 E-value=16 Score=21.86 Aligned_cols=15 Identities=27% Similarity=0.439 Sum_probs=9.6
Q ss_pred cceecCCCCccccCh
Q psy10075 24 YNFICILCNYHAENF 38 (373)
Q Consensus 24 ~~~~C~~C~~~f~~~ 38 (373)
..|.|++|+-+|+-.
T Consensus 6 sry~CDLCn~~~p~~ 20 (57)
T PF14445_consen 6 SRYSCDLCNSSHPIS 20 (57)
T ss_pred hhHhHHhhcccCcHH
Confidence 456777777666543
No 223
>COG5152 Uncharacterized conserved protein, contains RING and CCCH-type Zn-fingers [General function prediction only]
Probab=22.98 E-value=72 Score=25.63 Aligned_cols=16 Identities=25% Similarity=0.567 Sum_probs=10.0
Q ss_pred CcceecCCCCccccCh
Q psy10075 23 SYNFICILCNYHAENF 38 (373)
Q Consensus 23 ~~~~~C~~C~~~f~~~ 38 (373)
+-||.|.+|.+.|.+.
T Consensus 194 ~IPF~C~iCKkdy~sp 209 (259)
T COG5152 194 KIPFLCGICKKDYESP 209 (259)
T ss_pred CCceeehhchhhccch
Confidence 4566677776666554
No 224
>COG5112 UFD2 U1-like Zn-finger-containing protein [General function prediction only]
Probab=22.95 E-value=29 Score=24.53 Aligned_cols=26 Identities=15% Similarity=0.128 Sum_probs=18.9
Q ss_pred CCcceecCCCCccccChhhHhhcccc
Q psy10075 22 DSYNFICILCNYHAENFENIYLHLPS 47 (373)
Q Consensus 22 ~~~~~~C~~C~~~f~~~~~l~~H~~~ 47 (373)
+.+.+-|-+|.+.|.+...|..|.+.
T Consensus 52 GlGqhYCieCaryf~t~~aL~~Hkkg 77 (126)
T COG5112 52 GLGQHYCIECARYFITEKALMEHKKG 77 (126)
T ss_pred CCceeeeehhHHHHHHHHHHHHHhcc
Confidence 55667788888888877777777653
No 225
>PF04328 DUF466: Protein of unknown function (DUF466); InterPro: IPR007423 This is a small bacterial protein of unknown function.
Probab=22.51 E-value=1.1e+02 Score=19.78 Aligned_cols=30 Identities=20% Similarity=0.241 Sum_probs=21.5
Q ss_pred cChhHHHHHHHhhcccccccccchhHHHHH
Q psy10075 329 TSNTDLLEHVQTNHIIVGESSSRDFIIHSF 358 (373)
Q Consensus 329 ~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~ 358 (373)
+......+|++.+|+.....+.+.|...+.
T Consensus 23 ~~Ye~Yv~H~~~~HP~~p~ms~~eF~r~r~ 52 (65)
T PF04328_consen 23 PDYERYVEHMRRHHPDEPPMSEREFFRERQ 52 (65)
T ss_pred HHHHHHHHHHHHHCcCCCCCCHHHHHHHHH
Confidence 345678889999998776666667766653
No 226
>PF01428 zf-AN1: AN1-like Zinc finger; InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include: Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=22.10 E-value=45 Score=19.32 Aligned_cols=14 Identities=21% Similarity=0.306 Sum_probs=9.2
Q ss_pred cceecCCCCccccC
Q psy10075 24 YNFICILCNYHAEN 37 (373)
Q Consensus 24 ~~~~C~~C~~~f~~ 37 (373)
-|+.|+.|+..|=.
T Consensus 12 ~~~~C~~C~~~FC~ 25 (43)
T PF01428_consen 12 LPFKCKHCGKSFCL 25 (43)
T ss_dssp SHEE-TTTS-EE-T
T ss_pred CCeECCCCCcccCc
Confidence 58999999999843
No 227
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=21.97 E-value=54 Score=24.41 Aligned_cols=15 Identities=20% Similarity=0.428 Sum_probs=8.8
Q ss_pred CcceecCCCCccccCh
Q psy10075 23 SYNFICILCNYHAENF 38 (373)
Q Consensus 23 ~~~~~C~~C~~~f~~~ 38 (373)
...+.| .|+..|...
T Consensus 68 p~~~~C-~Cg~~~~~~ 82 (124)
T PRK00762 68 PVEIEC-ECGYEGVVD 82 (124)
T ss_pred CeeEEe-eCcCccccc
Confidence 335667 777666543
No 228
>TIGR03831 YgiT_finger YgiT-type zinc finger domain. This domain model describes a small domain with two copies of a putative zinc-binding motif CXXC (usually CXXCG). Most member proteins consist largely of this domain or else carry an additional C-terminal helix-turn-helix domain, resembling that of the phage protein Cro and modeled by pfam01381.
Probab=21.93 E-value=39 Score=19.52 Aligned_cols=12 Identities=17% Similarity=0.476 Sum_probs=7.8
Q ss_pred ccCCCCCCCccC
Q psy10075 319 YFCGSCNFSCTS 330 (373)
Q Consensus 319 ~~C~~C~~~f~~ 330 (373)
+.|+.||..+.+
T Consensus 33 ~~C~~CGE~~~~ 44 (46)
T TIGR03831 33 LVCPQCGEEYLD 44 (46)
T ss_pred cccccCCCEeeC
Confidence 667777766543
No 229
>PF09963 DUF2197: Uncharacterized protein conserved in bacteria (DUF2197); InterPro: IPR019241 This family represents various hypothetical bacterial proteins with no known function.
Probab=21.84 E-value=33 Score=21.31 Aligned_cols=9 Identities=33% Similarity=1.143 Sum_probs=4.2
Q ss_pred ecccccccc
Q psy10075 172 QCSFCDRRF 180 (373)
Q Consensus 172 ~C~~C~~~f 180 (373)
+|.+|++.+
T Consensus 4 kC~lCdk~~ 12 (56)
T PF09963_consen 4 KCILCDKKE 12 (56)
T ss_pred EEEecCCEE
Confidence 344555443
No 230
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=21.18 E-value=67 Score=17.52 Aligned_cols=19 Identities=16% Similarity=0.433 Sum_probs=9.7
Q ss_pred ccCCCCcccccchhhhhhhh
Q psy10075 291 FKCSFCEYKCTTKFGLKSHV 310 (373)
Q Consensus 291 ~~C~~C~~~f~~~~~l~~H~ 310 (373)
+.|+.|++.. ..+-+..|+
T Consensus 5 ~~C~nC~R~v-~a~RfA~HL 23 (33)
T PF08209_consen 5 VECPNCGRPV-AASRFAPHL 23 (33)
T ss_dssp EE-TTTSSEE-EGGGHHHHH
T ss_pred EECCCCcCCc-chhhhHHHH
Confidence 4566666654 344555555
No 231
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=21.07 E-value=36 Score=22.04 Aligned_cols=9 Identities=44% Similarity=1.099 Sum_probs=2.9
Q ss_pred ecCCCCccc
Q psy10075 263 TCEHCDSKF 271 (373)
Q Consensus 263 ~C~~C~~~f 271 (373)
.|..|++.|
T Consensus 11 ~C~~C~~~F 19 (69)
T PF01363_consen 11 NCMICGKKF 19 (69)
T ss_dssp B-TTT--B-
T ss_pred cCcCcCCcC
Confidence 466666666
No 232
>smart00132 LIM Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
Probab=21.03 E-value=54 Score=17.75 Aligned_cols=10 Identities=30% Similarity=0.803 Sum_probs=5.2
Q ss_pred cccCcCcccc
Q psy10075 54 FSCDLCEYQT 63 (373)
Q Consensus 54 ~~C~~C~~~f 63 (373)
|.|..|+..+
T Consensus 28 f~C~~C~~~L 37 (39)
T smart00132 28 FKCSKCGKPL 37 (39)
T ss_pred CCCcccCCcC
Confidence 5555555443
No 233
>PF10276 zf-CHCC: Zinc-finger domain; InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=20.90 E-value=88 Score=17.96 Aligned_cols=18 Identities=17% Similarity=0.337 Sum_probs=10.6
Q ss_pred ecCCCCcceecCCCCccc
Q psy10075 18 SRLDDSYNFICILCNYHA 35 (373)
Q Consensus 18 ~~~~~~~~~~C~~C~~~f 35 (373)
.+..+.++..|+-|+..|
T Consensus 22 ~l~~~~~~~~CpYCg~~y 39 (40)
T PF10276_consen 22 NLDDEPGPVVCPYCGTRY 39 (40)
T ss_dssp E-TTTTCEEEETTTTEEE
T ss_pred ecCCCCCeEECCCCCCEE
Confidence 333333567788887766
No 234
>PF03966 Trm112p: Trm112p-like protein; InterPro: IPR005651 This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest conservation. The function of this family is uncertain. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest conservation. The entry contains 2 families: Trm112, which is required for tRNA methylation in Saccharomyces cerevisiae (Baker's yeast) and is found in complexes with 2 tRNA methylases (TRM9 and TRM11) also with putative methyltransferase YDR140W []. The zinc-finger protein Ynr046w is plurifunctional and a component of the eRF1 methyltransferase in yeast []. The crystal structure of Ynr046w has been determined to 1.7 A resolution. It comprises a zinc-binding domain built from both the N- and C-terminal sequences and an inserted domain, absent from bacterial and archaeal orthologs of the protein, composed of three alpha-helices []. UPF0434, which are proteins that are functionally uncharacterised. ; PDB: 3Q87_A 2KPI_A 2K5R_A 2HF1_A 2JS4_A 2J6A_A 2JR6_A 2PK7_A 2JNY_A.
Probab=20.88 E-value=49 Score=21.46 Aligned_cols=18 Identities=17% Similarity=0.104 Sum_probs=14.7
Q ss_pred cCCCCcceecCCCCcccc
Q psy10075 19 RLDDSYNFICILCNYHAE 36 (373)
Q Consensus 19 ~~~~~~~~~C~~C~~~f~ 36 (373)
.+..++.+.|+.|+..|+
T Consensus 47 ~~i~eg~L~Cp~c~r~YP 64 (68)
T PF03966_consen 47 VEIVEGELICPECGREYP 64 (68)
T ss_dssp EETTTTEEEETTTTEEEE
T ss_pred ccccCCEEEcCCCCCEEe
Confidence 456778999999999885
No 235
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=20.84 E-value=44 Score=24.96 Aligned_cols=21 Identities=38% Similarity=0.751 Sum_probs=10.7
Q ss_pred ecCCCCccccChhHHHHhhhhhccC
Q psy10075 263 TCEHCDSKFCKKYDMKMHMSRAHYG 287 (373)
Q Consensus 263 ~C~~C~~~f~~~~~L~~H~~~~H~~ 287 (373)
+|-.+|+.|+ .|.+|+ ..|.+
T Consensus 78 icLEDGkkfK---SLKRHL-~t~~g 98 (148)
T COG4957 78 ICLEDGKKFK---SLKRHL-TTHYG 98 (148)
T ss_pred EEeccCcchH---HHHHHH-hcccC
Confidence 4555565553 355555 44433
No 236
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=20.53 E-value=58 Score=19.96 Aligned_cols=9 Identities=33% Similarity=0.951 Sum_probs=4.1
Q ss_pred cCCCCcccc
Q psy10075 264 CEHCDSKFC 272 (373)
Q Consensus 264 C~~C~~~f~ 272 (373)
|..|++.|.
T Consensus 5 C~~C~~~F~ 13 (57)
T cd00065 5 CMGCGKPFT 13 (57)
T ss_pred CcccCcccc
Confidence 444444443
No 237
>PF05741 zf-nanos: Nanos RNA binding domain; InterPro: IPR024161 Nanos is a highly conserved RNA-binding protein in higher eukaryotes and functions as a key regulatory protein in translational control using a 3' untranslated region during the development and maintenance of germ cells. Nanos comprises a non-conserved amino-terminus and highly conserved carboxy- terminal regions. The C-terminal region has two conserved Cys-Cys-His-Cys (CCHC)-type zinc-finger motifs that are indispensable for nanos function [, , ]. The structure of the nanos-type zinc finger is composed of two independent zinc-finger (ZF) lobes, the N-terminal ZF1 and the C-terminal ZF2, which are connected by a linker helix []. These lobes create a large cleft. Zinc ions in ZF1 and ZF2 are bound to the CCHC motif by tetrahedral coordination.; PDB: 3ALR_B.
Probab=20.36 E-value=21 Score=22.16 Aligned_cols=9 Identities=33% Similarity=1.003 Sum_probs=2.0
Q ss_pred ccCCCCCCC
Q psy10075 319 YFCGSCNFS 327 (373)
Q Consensus 319 ~~C~~C~~~ 327 (373)
|.|++||..
T Consensus 34 y~Cp~CgAt 42 (55)
T PF05741_consen 34 YVCPICGAT 42 (55)
T ss_dssp ---TTT---
T ss_pred CcCCCCcCc
Confidence 566666643
No 238
>KOG0717|consensus
Probab=20.09 E-value=53 Score=30.36 Aligned_cols=19 Identities=26% Similarity=0.627 Sum_probs=0.0
Q ss_pred eecCCCCccccChhHHHHh
Q psy10075 262 FTCEHCDSKFCKKYDMKMH 280 (373)
Q Consensus 262 ~~C~~C~~~f~~~~~L~~H 280 (373)
+-|.+|+++|.+...|..|
T Consensus 293 lyC~vCnKsFKseKq~kNH 311 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNH 311 (508)
T ss_pred eEEeeccccccchHHHHhh
Done!