Query psy8251
Match_columns 434
No_of_seqs 394 out of 2526
Neff 9.6
Searched_HMMs 46136
Date Fri Aug 16 22:35:02 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8251.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8251hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 99.9 1.3E-27 2.7E-32 234.7 6.4 124 61-196 605-735 (958)
2 KOG2462|consensus 99.9 7.4E-27 1.6E-31 203.7 3.9 131 1-139 132-265 (279)
3 KOG2462|consensus 99.9 1.8E-25 4E-30 195.0 4.4 138 25-192 128-265 (279)
4 KOG1074|consensus 99.9 3.2E-25 6.9E-30 217.9 3.8 87 89-196 605-694 (958)
5 KOG3608|consensus 99.9 1.5E-23 3.2E-28 187.2 8.1 163 2-194 182-345 (467)
6 KOG3608|consensus 99.9 2.3E-22 4.9E-27 179.7 7.3 167 2-201 139-322 (467)
7 KOG3623|consensus 99.8 1.5E-21 3.2E-26 189.1 5.2 109 1-111 212-331 (1007)
8 KOG3623|consensus 99.8 1.2E-19 2.6E-24 176.0 4.8 107 30-139 212-331 (1007)
9 KOG3576|consensus 99.6 5.7E-16 1.2E-20 128.7 1.7 125 58-197 114-239 (267)
10 KOG3576|consensus 99.6 8.3E-16 1.8E-20 127.7 2.5 116 24-145 114-240 (267)
11 PLN03086 PRLI-interacting fact 99.0 2.9E-10 6.3E-15 112.4 5.0 100 28-140 454-563 (567)
12 PLN03086 PRLI-interacting fact 99.0 1.2E-09 2.7E-14 108.0 8.3 142 1-192 409-562 (567)
13 PHA00733 hypothetical protein 98.9 8.1E-10 1.8E-14 89.2 3.0 83 58-142 37-124 (128)
14 PHA00733 hypothetical protein 98.8 2.8E-09 6.1E-14 86.1 4.6 97 13-114 26-124 (128)
15 PHA02768 hypothetical protein; 98.6 1.2E-08 2.7E-13 67.5 0.9 43 61-105 5-47 (55)
16 PHA02768 hypothetical protein; 98.5 3.6E-08 7.8E-13 65.4 2.1 42 90-133 6-47 (55)
17 KOG3993|consensus 98.4 4.3E-08 9.3E-13 91.0 0.6 140 1-142 269-483 (500)
18 KOG3993|consensus 98.4 1.5E-07 3.2E-12 87.6 3.7 101 89-196 267-382 (500)
19 PF13465 zf-H2C2_2: Zinc-finge 98.4 2E-07 4.4E-12 52.6 1.8 24 105-128 2-25 (26)
20 PF13465 zf-H2C2_2: Zinc-finge 98.3 5.1E-07 1.1E-11 51.0 2.3 25 76-100 1-25 (26)
21 PHA00616 hypothetical protein 98.2 6.2E-07 1.4E-11 56.5 1.3 31 311-341 1-31 (44)
22 PHA00616 hypothetical protein 98.0 4E-06 8.6E-11 52.9 1.8 32 62-93 2-33 (44)
23 PF00096 zf-C2H2: Zinc finger, 97.9 3.9E-06 8.4E-11 46.0 0.9 23 312-334 1-23 (23)
24 PHA00732 hypothetical protein 97.8 1.2E-05 2.6E-10 58.8 2.9 18 31-48 4-21 (79)
25 PF05605 zf-Di19: Drought indu 97.8 3.7E-05 8E-10 52.1 4.7 51 28-85 3-53 (54)
26 PHA00732 hypothetical protein 97.6 3.7E-05 7.9E-10 56.3 2.6 43 90-138 2-45 (79)
27 PF13894 zf-C2H2_4: C2H2-type 97.6 2.9E-05 6.2E-10 42.8 1.3 24 312-335 1-24 (24)
28 PF00096 zf-C2H2: Zinc finger, 97.5 4.3E-05 9.4E-10 41.7 1.5 23 118-140 1-23 (23)
29 PF13912 zf-C2H2_6: C2H2-type 97.5 3.1E-05 6.8E-10 44.2 0.6 26 311-336 1-26 (27)
30 PF05605 zf-Di19: Drought indu 97.4 0.00018 3.9E-09 48.7 3.4 47 62-111 3-51 (54)
31 PF13912 zf-C2H2_6: C2H2-type 97.2 0.00027 5.8E-09 40.3 2.1 26 117-142 1-26 (27)
32 PF13894 zf-C2H2_4: C2H2-type 97.2 0.00024 5.1E-09 39.0 1.6 24 118-141 1-24 (24)
33 PF12756 zf-C2H2_2: C2H2 type 97.0 0.00049 1.1E-08 53.2 3.1 24 61-84 50-73 (100)
34 smart00355 ZnF_C2H2 zinc finge 97.0 0.00048 1E-08 38.5 2.1 24 312-335 1-24 (26)
35 PF12756 zf-C2H2_2: C2H2 type 97.0 0.00037 8E-09 53.9 2.0 23 117-139 50-72 (100)
36 COG5189 SFP1 Putative transcri 96.9 0.00036 7.8E-09 63.0 1.0 70 59-138 347-419 (423)
37 PF09237 GAGA: GAGA factor; I 96.7 0.0012 2.7E-08 42.6 2.2 33 306-338 19-51 (54)
38 KOG2231|consensus 96.5 0.0052 1.1E-07 62.5 6.4 127 1-138 101-260 (669)
39 PF09237 GAGA: GAGA factor; I 96.5 0.0019 4.1E-08 41.7 2.1 32 114-145 21-52 (54)
40 PF13909 zf-H2C2_5: C2H2-type 96.5 0.0014 3E-08 36.1 1.1 24 312-336 1-24 (24)
41 PF12874 zf-met: Zinc-finger o 96.2 0.0015 3.3E-08 36.3 0.6 23 312-334 1-23 (25)
42 smart00355 ZnF_C2H2 zinc finge 96.0 0.0036 7.8E-08 34.8 1.3 24 118-141 1-24 (26)
43 COG5189 SFP1 Putative transcri 95.8 0.0037 8E-08 56.7 1.2 69 114-191 346-419 (423)
44 PF12874 zf-met: Zinc-finger o 95.8 0.0063 1.4E-07 33.7 1.7 23 118-140 1-23 (25)
45 PF13909 zf-H2C2_5: C2H2-type 95.5 0.0076 1.7E-07 33.0 1.4 24 118-142 1-24 (24)
46 KOG2231|consensus 95.4 0.024 5.2E-07 57.8 5.4 95 37-142 123-237 (669)
47 PF12171 zf-C2H2_jaz: Zinc-fin 95.2 0.0038 8.2E-08 35.4 -0.6 22 312-333 2-23 (27)
48 PF12171 zf-C2H2_jaz: Zinc-fin 95.0 0.015 3.2E-07 32.9 1.4 22 118-139 2-23 (27)
49 PRK04860 hypothetical protein; 94.9 0.015 3.3E-07 48.8 1.9 36 89-128 119-154 (160)
50 PF13913 zf-C2HC_2: zinc-finge 94.9 0.018 3.9E-07 31.9 1.5 21 312-333 3-23 (25)
51 PRK04860 hypothetical protein; 94.7 0.019 4E-07 48.3 2.1 40 60-103 118-157 (160)
52 KOG2482|consensus 93.9 0.17 3.8E-06 46.7 6.4 51 90-140 280-357 (423)
53 KOG1146|consensus 93.5 0.027 5.9E-07 60.9 0.9 137 3-140 440-641 (1406)
54 PF13913 zf-C2HC_2: zinc-finge 93.3 0.079 1.7E-06 29.3 2.1 20 1-21 4-23 (25)
55 COG5236 Uncharacterized conser 92.6 0.13 2.9E-06 47.4 3.8 78 62-141 152-244 (493)
56 smart00451 ZnF_U1 U1-like zinc 92.3 0.078 1.7E-06 31.9 1.3 24 310-333 2-25 (35)
57 KOG2893|consensus 91.3 0.041 8.8E-07 48.0 -1.0 41 31-80 13-53 (341)
58 COG5048 FOG: Zn-finger [Genera 91.3 0.04 8.7E-07 54.7 -1.2 51 95-145 394-446 (467)
59 COG5048 FOG: Zn-finger [Genera 90.6 0.067 1.5E-06 53.1 -0.3 63 60-122 288-356 (467)
60 KOG1146|consensus 90.4 0.08 1.7E-06 57.5 -0.0 92 31-143 1263-1354(1406)
61 KOG2785|consensus 89.8 1.1 2.4E-05 42.4 6.8 137 1-138 5-241 (390)
62 KOG2482|consensus 89.7 0.88 1.9E-05 42.2 6.0 180 13-192 129-356 (423)
63 smart00451 ZnF_U1 U1-like zinc 89.6 0.26 5.5E-06 29.6 1.8 23 117-139 3-25 (35)
64 COG5236 Uncharacterized conser 89.5 0.52 1.1E-05 43.7 4.4 94 11-114 200-306 (493)
65 PF04838 Baculo_LEF5: Baculovi 87.0 0.19 4.1E-06 40.7 0.0 29 397-425 27-55 (159)
66 KOG2893|consensus 86.9 0.26 5.7E-06 43.1 0.8 42 64-109 13-54 (341)
67 KOG2785|consensus 86.2 1.7 3.6E-05 41.2 5.7 161 27-192 3-242 (390)
68 COG4049 Uncharacterized protei 86.2 0.42 9.2E-06 31.5 1.3 31 22-53 12-42 (65)
69 KOG4173|consensus 85.6 0.33 7.2E-06 41.6 0.8 75 31-113 82-171 (253)
70 cd00350 rubredoxin_like Rubred 84.7 0.51 1.1E-05 28.1 1.1 10 116-125 16-25 (33)
71 COG4049 Uncharacterized protei 82.3 0.52 1.1E-05 31.1 0.4 32 111-142 11-42 (65)
72 COG2888 Predicted Zn-ribbon RN 80.3 1.2 2.7E-05 30.0 1.7 46 256-320 11-59 (61)
73 TIGR00622 ssl1 transcription f 79.4 2.1 4.5E-05 33.5 2.9 23 25-48 13-35 (112)
74 PF09538 FYDLN_acid: Protein o 77.9 0.92 2E-05 35.4 0.7 14 61-74 26-39 (108)
75 PF12013 DUF3505: Protein of u 77.0 2.7 5.8E-05 32.9 3.1 82 60-142 10-109 (109)
76 PRK14890 putative Zn-ribbon RN 76.9 2.5 5.4E-05 28.6 2.4 46 256-320 9-57 (59)
77 TIGR00622 ssl1 transcription f 76.7 3.1 6.6E-05 32.5 3.2 81 59-141 13-105 (112)
78 KOG2186|consensus 76.2 1.7 3.6E-05 38.8 1.8 46 90-138 4-49 (276)
79 PF10571 UPF0547: Uncharacteri 74.9 1.1 2.3E-05 25.1 0.2 9 31-39 17-25 (26)
80 PF09538 FYDLN_acid: Protein o 73.2 2.6 5.6E-05 32.9 2.0 31 61-102 9-39 (108)
81 smart00614 ZnF_BED BED zinc fi 73.0 2.3 5E-05 28.0 1.5 25 311-335 18-48 (50)
82 cd00729 rubredoxin_SM Rubredox 72.4 2.1 4.5E-05 25.7 1.0 10 310-319 17-26 (34)
83 KOG4173|consensus 72.1 1.9 4.1E-05 37.1 1.1 79 58-139 76-168 (253)
84 PF09986 DUF2225: Uncharacteri 70.1 1.8 4E-05 38.5 0.7 43 88-130 4-61 (214)
85 PF02892 zf-BED: BED zinc fing 66.8 3.3 7.2E-05 26.3 1.2 26 308-333 13-42 (45)
86 PHA00626 hypothetical protein 66.7 4 8.7E-05 27.2 1.5 15 310-324 22-36 (59)
87 COG1592 Rubrerythrin [Energy p 66.0 3.6 7.9E-05 34.7 1.6 14 306-319 144-157 (166)
88 KOG2186|consensus 65.7 3.7 8.1E-05 36.6 1.7 45 31-83 6-50 (276)
89 smart00734 ZnF_Rad18 Rad18-lik 64.8 4.9 0.00011 22.4 1.4 20 312-332 2-21 (26)
90 KOG2272|consensus 63.2 3.2 7E-05 36.9 0.8 18 167-184 277-294 (332)
91 TIGR02098 MJ0042_CXXC MJ0042 f 61.2 7.2 0.00016 23.8 1.9 33 90-127 3-35 (38)
92 TIGR02300 FYDLN_acid conserved 61.1 3.8 8.3E-05 32.5 0.8 14 61-74 26-39 (129)
93 PF12013 DUF3505: Protein of u 60.9 5.1 0.00011 31.3 1.5 26 311-336 80-109 (109)
94 PF13719 zinc_ribbon_5: zinc-r 60.3 7.3 0.00016 23.8 1.8 32 91-127 4-35 (37)
95 TIGR00373 conserved hypothetic 58.8 9.1 0.0002 32.2 2.8 34 85-127 105-138 (158)
96 smart00834 CxxC_CXXC_SSSS Puta 54.9 5.2 0.00011 24.8 0.5 12 90-101 6-17 (41)
97 smart00531 TFIIE Transcription 54.9 16 0.00034 30.4 3.6 38 86-127 96-133 (147)
98 KOG2071|consensus 54.7 11 0.00023 38.3 2.9 25 171-195 419-443 (579)
99 smart00659 RPOLCX RNA polymera 54.5 9.2 0.0002 24.4 1.6 11 62-72 3-13 (44)
100 PF13717 zinc_ribbon_4: zinc-r 54.4 12 0.00027 22.6 2.1 32 91-127 4-35 (36)
101 PF02176 zf-TRAF: TRAF-type zi 54.4 15 0.00032 24.9 2.8 27 102-128 23-53 (60)
102 PF09986 DUF2225: Uncharacteri 54.2 7.8 0.00017 34.5 1.7 13 312-324 49-61 (214)
103 PF05443 ROS_MUCR: ROS/MUCR tr 53.2 6.5 0.00014 31.9 0.9 26 310-338 71-96 (132)
104 PRK00464 nrdR transcriptional 53.0 5 0.00011 33.5 0.3 15 118-132 29-43 (154)
105 TIGR02605 CxxC_CxxC_SSSS putat 52.1 6 0.00013 26.1 0.5 11 90-100 6-16 (52)
106 PRK06266 transcription initiat 51.3 14 0.00029 31.9 2.7 34 86-128 114-147 (178)
107 smart00531 TFIIE Transcription 48.4 17 0.00037 30.2 2.7 39 58-100 96-134 (147)
108 PF13878 zf-C2H2_3: zinc-finge 46.9 20 0.00043 22.4 2.2 22 1-22 15-38 (41)
109 KOG4167|consensus 46.4 4.4 9.6E-05 41.7 -1.2 28 310-337 791-818 (907)
110 PF09723 Zn-ribbon_8: Zinc rib 45.9 11 0.00024 23.7 0.9 12 90-101 6-17 (42)
111 KOG1701|consensus 45.1 13 0.00029 35.9 1.8 15 168-182 425-439 (468)
112 PRK00398 rpoP DNA-directed RNA 44.5 11 0.00023 24.2 0.8 12 311-322 21-32 (46)
113 PF07754 DUF1610: Domain of un 42.6 10 0.00022 20.7 0.4 10 310-319 15-24 (24)
114 TIGR00373 conserved hypothetic 42.3 17 0.00036 30.7 1.8 34 58-100 106-139 (158)
115 PF05443 ROS_MUCR: ROS/MUCR tr 40.0 15 0.00033 29.8 1.1 19 31-52 75-93 (132)
116 PRK06266 transcription initiat 40.0 15 0.00033 31.6 1.2 34 58-100 114-147 (178)
117 PF15269 zf-C2H2_7: Zinc-finge 39.6 15 0.00033 23.1 0.9 23 311-333 20-42 (54)
118 PF13240 zinc_ribbon_2: zinc-r 39.5 9 0.00019 20.6 -0.1 7 31-37 16-22 (23)
119 PRK09678 DNA-binding transcrip 38.0 13 0.00027 26.7 0.3 41 90-132 2-44 (72)
120 KOG2593|consensus 37.8 32 0.00069 33.6 3.1 39 85-126 124-162 (436)
121 COG1997 RPL43A Ribosomal prote 37.7 18 0.00039 26.7 1.1 15 309-323 51-65 (89)
122 PF03604 DNA_RNApol_7kD: DNA d 37.6 20 0.00043 21.1 1.1 9 63-71 2-10 (32)
123 PF08274 PhnA_Zn_Ribbon: PhnA 36.8 17 0.00037 21.1 0.7 9 117-125 19-27 (30)
124 PF05290 Baculo_IE-1: Baculovi 36.6 31 0.00067 27.8 2.4 13 172-184 123-135 (140)
125 TIGR01384 TFS_arch transcripti 35.4 23 0.0005 27.3 1.5 11 118-128 17-27 (104)
126 PF14353 CpXC: CpXC protein 35.2 4.3 9.4E-05 32.8 -2.7 21 117-137 38-58 (128)
127 COG0068 HypF Hydrogenase matur 35.1 7 0.00015 40.5 -1.8 56 2-69 126-181 (750)
128 COG1996 RPC10 DNA-directed RNA 33.4 26 0.00057 22.9 1.2 11 61-71 6-16 (49)
129 KOG2593|consensus 33.3 46 0.00099 32.6 3.4 14 60-73 127-140 (436)
130 PF01096 TFIIS_C: Transcriptio 33.3 20 0.00044 22.1 0.7 11 311-321 28-38 (39)
131 KOG1280|consensus 33.1 46 0.001 31.4 3.2 92 256-352 10-116 (381)
132 PF12907 zf-met2: Zinc-binding 32.3 39 0.00085 21.1 1.8 22 31-52 4-28 (40)
133 COG4957 Predicted transcriptio 31.7 27 0.00058 28.1 1.3 20 2-24 79-98 (148)
134 smart00440 ZnF_C2C2 C2C2 Zinc 30.8 19 0.00041 22.4 0.3 12 117-128 28-39 (40)
135 COG5151 SSL1 RNA polymerase II 30.7 33 0.00071 31.8 1.9 48 91-140 364-411 (421)
136 COG5151 SSL1 RNA polymerase II 30.5 79 0.0017 29.4 4.2 28 308-335 385-412 (421)
137 COG4530 Uncharacterized protei 30.0 23 0.00049 27.3 0.6 12 61-72 26-37 (129)
138 PF12760 Zn_Tnp_IS1595: Transp 29.9 67 0.0015 20.5 2.8 8 117-124 37-44 (46)
139 PF05191 ADK_lid: Adenylate ki 29.8 21 0.00046 21.6 0.4 10 91-100 3-12 (36)
140 PF04959 ARS2: Arsenite-resist 29.4 17 0.00036 32.2 -0.2 32 306-337 72-103 (214)
141 PF10013 DUF2256: Uncharacteri 28.6 36 0.00078 21.4 1.2 15 1-15 10-24 (42)
142 KOG2807|consensus 28.5 60 0.0013 30.5 3.1 33 256-288 332-364 (378)
143 COG1592 Rubrerythrin [Energy p 28.4 37 0.0008 28.7 1.7 24 255-278 135-158 (166)
144 smart00154 ZnF_AN1 AN1-like Zi 27.7 26 0.00056 21.6 0.5 14 311-324 12-25 (39)
145 KOG2636|consensus 27.6 36 0.00078 33.3 1.6 30 303-332 393-423 (497)
146 PF07975 C1_4: TFIIH C1-like d 26.9 20 0.00043 23.7 -0.1 21 268-288 20-40 (51)
147 KOG4167|consensus 26.7 17 0.00037 37.7 -0.7 22 2-23 795-816 (907)
148 KOG3408|consensus 26.7 44 0.00095 26.4 1.7 25 24-49 54-78 (129)
149 PF15135 UPF0515: Uncharacteri 26.4 82 0.0018 28.3 3.5 60 59-131 110-169 (278)
150 PF14446 Prok-RING_1: Prokaryo 26.1 54 0.0012 22.0 1.8 24 255-278 6-30 (54)
151 PF05495 zf-CHY: CHY zinc fing 25.2 23 0.00051 25.2 -0.0 59 61-127 10-71 (71)
152 KOG3408|consensus 24.6 37 0.0008 26.8 0.9 27 307-333 53-79 (129)
153 PF08790 zf-LYAR: LYAR-type C2 24.6 27 0.00058 19.9 0.1 20 118-138 1-20 (28)
154 COG1198 PriA Primosomal protei 24.3 53 0.0012 35.0 2.3 14 306-319 457-470 (730)
155 COG1198 PriA Primosomal protei 24.0 63 0.0014 34.5 2.8 12 309-320 473-484 (730)
156 PRK03824 hypA hydrogenase nick 23.2 38 0.00082 27.7 0.8 12 90-101 71-82 (135)
157 PF08271 TF_Zn_Ribbon: TFIIB z 23.0 55 0.0012 20.5 1.4 11 117-127 19-29 (43)
158 PF13451 zf-trcl: Probable zin 22.7 30 0.00066 22.6 0.1 24 309-332 2-25 (49)
159 TIGR01206 lysW lysine biosynth 22.6 43 0.00094 22.4 0.9 31 90-128 3-33 (54)
160 PF04959 ARS2: Arsenite-resist 22.3 37 0.00081 30.1 0.7 31 115-145 75-105 (214)
161 PF06524 NOA36: NOA36 protein; 22.3 45 0.00097 30.1 1.1 28 114-141 206-233 (314)
162 PRK04023 DNA polymerase II lar 21.3 1.5E+02 0.0032 32.7 4.8 61 249-363 621-681 (1121)
163 PF13453 zf-TFIIB: Transcripti 21.1 62 0.0013 20.0 1.3 18 117-134 19-36 (41)
164 PF01428 zf-AN1: AN1-like Zinc 20.9 28 0.0006 21.9 -0.3 15 310-324 12-26 (43)
165 COG4888 Uncharacterized Zn rib 20.2 35 0.00075 26.0 0.0 37 88-128 21-57 (104)
No 1
>KOG1074|consensus
Probab=99.94 E-value=1.3e-27 Score=234.68 Aligned_cols=124 Identities=27% Similarity=0.584 Sum_probs=98.6
Q ss_pred CeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccccchHHHchhhhcCCCC----ceecc---ccCcccCChhHH
Q psy8251 61 PWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKANLLQHQPVHTGQR----KYNCT---VCGKAFSRKSFV 133 (434)
Q Consensus 61 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~----~~~C~---~C~k~F~~~~~L 133 (434)
|..|-+|.++..-++.|+.|.++|+||+||+|.+||+.|.++++|+.|+.+|...- .+.|+ +|-+.|...-.|
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~l 684 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTL 684 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccc
Confidence 35699999999999999999999999999999999999999999999999886543 48899 999999999999
Q ss_pred HhhhccccCCcccccccccccccccccchhhhhhhhceecccCCcccCChhhHHHHhhhccCC
Q psy8251 134 TTHMRVHTGKLTGKFGQKMSALSGTAFAVEYEIIQECLFAKNGGKAFSRKSFVTTHMRVHTGK 196 (434)
Q Consensus 134 ~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~C~~C~~~F~~~~~L~~H~~~H~~~ 196 (434)
..|+++|.+.....-.. . . ..... -..|..| .+.|.....+..++..|.+.
T Consensus 685 pQhIriH~~~~~s~g~~--a-~--e~~~~-adq~~~~------qk~~~~a~~f~~~~se~~~~ 735 (958)
T KOG1074|consen 685 PQHIRIHLGGQISNGGT--A-A--EGILA-ADQCSSC------QKTFSDARSFSQQISEQPSP 735 (958)
T ss_pred cceEEeecCCCCCCCcc--c-c--cccch-hcccchh------hhcccccccchhhhhccCCc
Confidence 99999998543311100 0 0 11111 1126666 99999999999999887543
No 2
>KOG2462|consensus
Probab=99.93 E-value=7.4e-27 Score=203.67 Aligned_cols=131 Identities=30% Similarity=0.641 Sum_probs=123.3
Q ss_pred CCCccccccCCHHHHHHHHHHcCC---CCCCCCccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHH
Q psy8251 1 MWPRGFRGFRQVINLLKHRREHGG---NLSFDNCHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSL 77 (434)
Q Consensus 1 ~C~~C~~~F~~~~~l~~H~~~h~~---~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L 77 (434)
+|+.||+.+.+..+|.+|.++|-. .+.+. |++||+.|.+...|+.|++ .| .-+.+|.+||+.|.....|
T Consensus 132 ~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~-C~~C~K~YvSmpALkMHir-TH------~l~c~C~iCGKaFSRPWLL 203 (279)
T KOG2462|consen 132 KCPECGKSYSTSSNLSRHKQTHRSLDSKKAFS-CKYCGKVYVSMPALKMHIR-TH------TLPCECGICGKAFSRPWLL 203 (279)
T ss_pred eccccccccccccccchhhccccccccccccc-CCCCCceeeehHHHhhHhh-cc------CCCcccccccccccchHHh
Confidence 599999999999999999999953 45666 9999999999999999999 55 3589999999999999999
Q ss_pred HHHHHhccCCCceeccccccccccchHHHchhhhcCCCCceeccccCcccCChhHHHhhhcc
Q psy8251 78 GIHERIHTGQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHMRV 139 (434)
Q Consensus 78 ~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~ 139 (434)
+-|+|+|+|||||.|+.|++.|.++++|+.|+++|.+.|+|+|..|+|.|...+.|..|...
T Consensus 204 QGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 204 QGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred hcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 99999999999999999999999999999999999999999999999999999999999764
No 3
>KOG2462|consensus
Probab=99.91 E-value=1.8e-25 Score=194.96 Aligned_cols=138 Identities=30% Similarity=0.648 Sum_probs=125.6
Q ss_pred CCCCCCccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccccchH
Q psy8251 25 NLSFDNCHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKAN 104 (434)
Q Consensus 25 ~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~ 104 (434)
...|. |++||+.+.+...|.+|.+ .|... ...+.+.|++|++.|.+-..|+.|+++|+ -++.|.+|||.|...+.
T Consensus 128 ~~r~~-c~eCgk~ysT~snLsrHkQ-~H~~~-~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWL 202 (279)
T KOG2462|consen 128 HPRYK-CPECGKSYSTSSNLSRHKQ-THRSL-DSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWL 202 (279)
T ss_pred CCcee-ccccccccccccccchhhc-ccccc-cccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHH
Confidence 44577 9999999999999999998 55322 34678999999999999999999999997 57999999999999999
Q ss_pred HHchhhhcCCCCceeccccCcccCChhHHHhhhccccCCcccccccccccccccccchhhhhhhhceecccCCcccCChh
Q psy8251 105 LLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHMRVHTGKLTGKFGQKMSALSGTAFAVEYEIIQECLFAKNGGKAFSRKS 184 (434)
Q Consensus 105 l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~C~~C~~~F~~~~ 184 (434)
|+-|+|+|+|||||.|+.|++.|..+++|+.||++|.+.+. |. |..| +|.|..++
T Consensus 203 LQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~------------------~q-C~~C------~KsFsl~S 257 (279)
T KOG2462|consen 203 LQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKK------------------HQ-CPRC------GKSFALKS 257 (279)
T ss_pred hhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCcc------------------cc-Ccch------hhHHHHHH
Confidence 99999999999999999999999999999999999999988 66 7777 99999999
Q ss_pred hHHHHhhh
Q psy8251 185 FVTTHMRV 192 (434)
Q Consensus 185 ~L~~H~~~ 192 (434)
-|.+|...
T Consensus 258 yLnKH~ES 265 (279)
T KOG2462|consen 258 YLNKHSES 265 (279)
T ss_pred HHHHhhhh
Confidence 99999864
No 4
>KOG1074|consensus
Probab=99.91 E-value=3.2e-25 Score=217.94 Aligned_cols=87 Identities=30% Similarity=0.536 Sum_probs=76.6
Q ss_pred ceeccccccccccchHHHchhhhcCCCCceeccccCcccCChhHHHhhhccccCCcccccccccccccccccchhhhhhh
Q psy8251 89 PYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHMRVHTGKLTGKFGQKMSALSGTAFAVEYEIIQ 168 (434)
Q Consensus 89 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~C~ 168 (434)
|-+|.+|.+...-++.|+.|.++|+||+||+|.+||+.|+++.+|+.|+.+|...-+ ++
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~------------------~R--- 663 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPP------------------AR--- 663 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCcc------------------cc---
Confidence 378999999999999999999999999999999999999999999999999987665 22
Q ss_pred hceecc---cCCcccCChhhHHHHhhhccCC
Q psy8251 169 ECLFAK---NGGKAFSRKSFVTTHMRVHTGK 196 (434)
Q Consensus 169 ~C~~C~---~C~~~F~~~~~L~~H~~~H~~~ 196 (434)
.++.|+ +|.+.|.+.-.|..|+++|.+.
T Consensus 664 ~q~ScP~~~ic~~kftn~V~lpQhIriH~~~ 694 (958)
T KOG1074|consen 664 VQFSCPSTFICQKKFTNAVTLPQHIRIHLGG 694 (958)
T ss_pred ccccCCchhhhcccccccccccceEEeecCC
Confidence 234445 5599999999999999999853
No 5
>KOG3608|consensus
Probab=99.89 E-value=1.5e-23 Score=187.19 Aligned_cols=163 Identities=23% Similarity=0.434 Sum_probs=133.8
Q ss_pred CCccccccCCHHHHHHHHHHcCCCCCCCCccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHHHHH
Q psy8251 2 WPRGFRGFRQVINLLKHRREHGGNLSFDNCHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLGIHE 81 (434)
Q Consensus 2 C~~C~~~F~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~ 81 (434)
+..|-+.|.++..|++|+++|++++... |+.||.-|.++..|..|+++.-. ....+|.|..|.+.|.++..|..|+
T Consensus 182 W~~Ct~~~~~k~~LreH~r~Hs~eKvvA-Cp~Cg~~F~~~tkl~DH~rRqt~---l~~n~fqC~~C~KrFaTeklL~~Hv 257 (467)
T KOG3608|consen 182 WAMCTKHMGNKYRLREHIRTHSNEKVVA-CPHCGELFRTKTKLFDHLRRQTE---LNTNSFQCAQCFKRFATEKLLKSHV 257 (467)
T ss_pred chhhhhhhccHHHHHHHHHhcCCCeEEe-cchHHHHhccccHHHHHHHhhhh---hcCCchHHHHHHHHHhHHHHHHHHH
Confidence 3568888999999999999999999988 99999999999999999885542 3456899999999999999999998
Q ss_pred HhccCCCceeccccccccccchHHHchhhh-cCCCCceeccccCcccCChhHHHhhhccccCCccccccccccccccccc
Q psy8251 82 RIHTGQRPYICDWCGVGFRSKANLLQHQPV-HTGQRKYNCTVCGKAFSRKSFVTTHMRVHTGKLTGKFGQKMSALSGTAF 160 (434)
Q Consensus 82 ~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~ 160 (434)
..|.. -|+|++|+.+....++|..|++. |..++||+|+.|++.|.+.+.|.+|...|. +..
T Consensus 258 ~rHvn--~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~--------------- 319 (467)
T KOG3608|consen 258 VRHVN--CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTV--------------- 319 (467)
T ss_pred HHhhh--cccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccc---------------
Confidence 88864 58899999999999999999874 777889999999999999999999998887 333
Q ss_pred chhhhhhhhceecccCCcccCChhhHHHHhhhcc
Q psy8251 161 AVEYEIIQECLFAKNGGKAFSRKSFVTTHMRVHT 194 (434)
Q Consensus 161 ~~~~~~C~~C~~C~~C~~~F~~~~~L~~H~~~H~ 194 (434)
|. |.. +.|..+|...-.|++|++.++
T Consensus 320 ---y~-C~h----~~C~~s~r~~~q~~~H~~evh 345 (467)
T KOG3608|consen 320 ---YQ-CEH----PDCHYSVRTYTQMRRHFLEVH 345 (467)
T ss_pred ---ee-cCC----CCCcHHHHHHHHHHHHHHHhc
Confidence 33 544 124888888888999887644
No 6
>KOG3608|consensus
Probab=99.86 E-value=2.3e-22 Score=179.70 Aligned_cols=167 Identities=25% Similarity=0.449 Sum_probs=120.3
Q ss_pred CCccccccCCHHHHHHHHHHcCC------------CCCCCCcc--chhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcC
Q psy8251 2 WPRGFRGFRQVINLLKHRREHGG------------NLSFDNCH--ICKETFYKHSELVRHLEREHNIPRSESSPWCCRFC 67 (434)
Q Consensus 2 C~~C~~~F~~~~~l~~H~~~h~~------------~~~~~~C~--~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C 67 (434)
+..|++.|.+...|..|.-.|.. ++|...|. .|-+.|.+++.|++|++ +| ++++...|+.|
T Consensus 139 WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r-~H----s~eKvvACp~C 213 (467)
T KOG3608|consen 139 WEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIR-TH----SNEKVVACPHC 213 (467)
T ss_pred hhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHH-hc----CCCeEEecchH
Confidence 45788888888888888777641 22333364 47778888888888887 66 67778888888
Q ss_pred CcccCCHHHHHHHHHhc--cCCCceeccccccccccchHHHchhhhcCCCCceeccccCcccCChhHHHhhhcc-ccCCc
Q psy8251 68 NKRMMTKLSLGIHERIH--TGQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHMRV-HTGKL 144 (434)
Q Consensus 68 ~~~f~~~~~L~~H~~~h--~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~-h~~~~ 144 (434)
|..|.++..|-.|++.. ....+|.|..|.|.|.+...|..|++.|.. -|+|+.|+.+...+++|..|++. |...+
T Consensus 214 g~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn--~ykCplCdmtc~~~ssL~~H~r~rHs~dk 291 (467)
T KOG3608|consen 214 GELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN--CYKCPLCDMTCSSASSLTTHIRYRHSKDK 291 (467)
T ss_pred HHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh--cccccccccCCCChHHHHHHHHhhhccCC
Confidence 88888888888887633 345678888888888888888888877754 37888888888888888888774 55556
Q ss_pred ccccccccccccccccchhhhhhhhceecccCCcccCChhhHHHHhhhccCCCCCCC
Q psy8251 145 TGKFGQKMSALSGTAFAVEYEIIQECLFAKNGGKAFSRKSFVTTHMRVHTGKLTGKF 201 (434)
Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~C~~C~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~C 201 (434)
| +. |+.| ++.|...++|.+|..+|+ +..|.|
T Consensus 292 p------------------fK-Cd~C------d~~c~~esdL~kH~~~HS-~~~y~C 322 (467)
T KOG3608|consen 292 P------------------FK-CDEC------DTRCVRESDLAKHVQVHS-KTVYQC 322 (467)
T ss_pred C------------------cc-ccch------hhhhccHHHHHHHHHhcc-ccceec
Confidence 6 33 4444 888888888888887777 444444
No 7
>KOG3623|consensus
Probab=99.84 E-value=1.5e-21 Score=189.07 Aligned_cols=109 Identities=27% Similarity=0.502 Sum_probs=94.1
Q ss_pred CCCccccccCCHHHHHHHHHHcC--CCCCCCCccchhhhcCCHHHHHHHHHhhcCCCC---------CCCCCeecCcCCc
Q psy8251 1 MWPRGFRGFRQVINLLKHRREHG--GNLSFDNCHICKETFYKHSELVRHLEREHNIPR---------SESSPWCCRFCNK 69 (434)
Q Consensus 1 ~C~~C~~~F~~~~~l~~H~~~h~--~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~---------~~~~~~~C~~C~~ 69 (434)
+|+.|.+.+.....|+.|++..+ .+..|. |..|..+|..+..|.+||. .|..-. ...+.|+|.+||+
T Consensus 212 tcpycdrgykrltslkeHikyrhekne~nfs-C~lCsytFAyRtQLErhm~-~hkpg~dqa~sltqsa~lRKFKCtECgK 289 (1007)
T KOG3623|consen 212 TCPYCDRGYKRLTSLKEHIKYRHEKNEPNFS-CMLCSYTFAYRTQLERHMQ-LHKPGGDQAISLTQSALLRKFKCTECGK 289 (1007)
T ss_pred cchhHHHHHHHHHHHHHHHHHHHhhCCCCCc-chhhhhhhhhHHHHHHHHH-hhcCCCcccccccchhhhccccccccch
Confidence 59999999999999999998643 345576 9999999999999999998 553111 1235699999999
Q ss_pred ccCCHHHHHHHHHhccCCCceeccccccccccchHHHchhhh
Q psy8251 70 RMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKANLLQHQPV 111 (434)
Q Consensus 70 ~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 111 (434)
+|+.+..|+.|+|+|.|+|||.|+.|+|.|+..+++..|+..
T Consensus 290 AFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 290 AFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred hhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence 999999999999999999999999999999999999999764
No 8
>KOG3623|consensus
Probab=99.78 E-value=1.2e-19 Score=176.00 Aligned_cols=107 Identities=25% Similarity=0.628 Sum_probs=97.6
Q ss_pred CccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHHHHHHhccCC-------------Cceeccccc
Q psy8251 30 NCHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLGIHERIHTGQ-------------RPYICDWCG 96 (434)
Q Consensus 30 ~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~-------------~~~~C~~C~ 96 (434)
.|++|...+.....|+.|++..|. ..+..|.|..|..+|.....|.+|+.+|... +.|+|.+||
T Consensus 212 tcpycdrgykrltslkeHikyrhe---kne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECg 288 (1007)
T KOG3623|consen 212 TCPYCDRGYKRLTSLKEHIKYRHE---KNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECG 288 (1007)
T ss_pred cchhHHHHHHHHHHHHHHHHHHHh---hCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccc
Confidence 499999999999999999997774 4466799999999999999999999998532 459999999
Q ss_pred cccccchHHHchhhhcCCCCceeccccCcccCChhHHHhhhcc
Q psy8251 97 VGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHMRV 139 (434)
Q Consensus 97 ~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~ 139 (434)
|.|..+-.|+.|+|+|.|+|||.|+.|+|.|....++..||..
T Consensus 289 KAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 289 KAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred hhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence 9999999999999999999999999999999999999999853
No 9
>KOG3576|consensus
Probab=99.57 E-value=5.7e-16 Score=128.66 Aligned_cols=125 Identities=24% Similarity=0.545 Sum_probs=103.5
Q ss_pred CCCCeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccccchHHHchhhhcCCCCceeccccCcccCChhHHHhhh
Q psy8251 58 ESSPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHM 137 (434)
Q Consensus 58 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~ 137 (434)
+...|.|.+|++.|.....|.+|++.|...+.|.|..||+.|.+.-.|++|.++|+|.+||+|..|++.|+++-+|..|+
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl 193 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHL 193 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHH
Confidence 34569999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccCCcc-cccccccccccccccchhhhhhhhceecccCCcccCChhhHHHHhhhccCCC
Q psy8251 138 RVHTGKLT-GKFGQKMSALSGTAFAVEYEIIQECLFAKNGGKAFSRKSFVTTHMRVHTGKL 197 (434)
Q Consensus 138 ~~h~~~~~-~~~~~~~~~~~~~~~~~~~~~C~~C~~C~~C~~~F~~~~~L~~H~~~H~~~~ 197 (434)
+.-+|... +.+..+ =..=+.|+.||..-.....+..|++.|+...
T Consensus 194 ~kvhgv~~~yayker---------------r~kl~vcedcg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 194 KKVHGVQHQYAYKER---------------RAKLYVCEDCGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred HHHcCchHHHHHHHh---------------hhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence 86666422 111000 0001445555999999999999999887643
No 10
>KOG3576|consensus
Probab=99.57 E-value=8.3e-16 Score=127.69 Aligned_cols=116 Identities=32% Similarity=0.607 Sum_probs=103.0
Q ss_pred CCCCCCCccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccccch
Q psy8251 24 GNLSFDNCHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKA 103 (434)
Q Consensus 24 ~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~ 103 (434)
+...+. |.+|++.|.-...|.+|++ -| ...+.|-|..||+.|...-+|++|+++|+|.+||+|..|++.|..+-
T Consensus 114 d~d~ft-CrvCgK~F~lQRmlnrh~k-ch----~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrc 187 (267)
T KOG3576|consen 114 DQDSFT-CRVCGKKFGLQRMLNRHLK-CH----SDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRC 187 (267)
T ss_pred CCCeee-eehhhhhhhHHHHHHHHhh-hc----cHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhc
Confidence 345676 9999999999999999998 66 56778999999999999999999999999999999999999999999
Q ss_pred HHHchhhhcCC-----------CCceeccccCcccCChhHHHhhhccccCCcc
Q psy8251 104 NLLQHQPVHTG-----------QRKYNCTVCGKAFSRKSFVTTHMRVHTGKLT 145 (434)
Q Consensus 104 ~l~~H~~~h~~-----------~~~~~C~~C~k~F~~~~~L~~H~~~h~~~~~ 145 (434)
+|..|++.-+| ++.|.|..||.+-.....+..|+..|+..-+
T Consensus 188 sleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 188 SLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred cHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 99999875444 3569999999999999999999999987544
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.00 E-value=2.9e-10 Score=112.36 Aligned_cols=100 Identities=23% Similarity=0.506 Sum_probs=76.0
Q ss_pred CCCccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHHHHHHhccCCCceecccccccccc------
Q psy8251 28 FDNCHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRS------ 101 (434)
Q Consensus 28 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~------ 101 (434)
+. |+.|++.|. ...|..|+..+| .++.|+ ||+.+ .+..|..|+.+|.+.+++.|+.|++.|..
T Consensus 454 ~~-C~~Cgk~f~-~s~LekH~~~~H-------kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d 522 (567)
T PLN03086 454 VH-CEKCGQAFQ-QGEMEKHMKVFH-------EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMD 522 (567)
T ss_pred cc-CCCCCCccc-hHHHHHHHHhcC-------CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccc
Confidence 45 888888885 577888887443 578888 88755 56788889888888888999999888841
Q ss_pred ----chHHHchhhhcCCCCceeccccCcccCChhHHHhhhccc
Q psy8251 102 ----KANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHMRVH 140 (434)
Q Consensus 102 ----~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~h 140 (434)
...|..|...+ |.+++.|..||+.+..+ .|..|+..-
T Consensus 523 ~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~~~ 563 (567)
T PLN03086 523 VRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQIAV 563 (567)
T ss_pred hhhhhhhHHHHHHhc-CCcceEccccCCeeeeh-hHHHHHHHh
Confidence 34788887775 78889999998888754 566676543
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.98 E-value=1.2e-09 Score=107.99 Aligned_cols=142 Identities=19% Similarity=0.317 Sum_probs=107.2
Q ss_pred CCCccccccCCHHHHHHHHHHcCCCCCCCCccc--hhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHH
Q psy8251 1 MWPRGFRGFRQVINLLKHRREHGGNLSFDNCHI--CKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLG 78 (434)
Q Consensus 1 ~C~~C~~~F~~~~~l~~H~~~h~~~~~~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~ 78 (434)
.|+.|..... ...|..|...... .... |+. ||..|.. ..+ +..+.|+.|++.|. ...|.
T Consensus 409 ~C~NC~~~i~-l~~l~lHe~~C~r-~~V~-Cp~~~Cg~v~~r-~el--------------~~H~~C~~Cgk~f~-~s~Le 469 (567)
T PLN03086 409 ECRNCKHYIP-SRSIALHEAYCSR-HNVV-CPHDGCGIVLRV-EEA--------------KNHVHCEKCGQAFQ-QGEME 469 (567)
T ss_pred ECCCCCCccc-hhHHHHHHhhCCC-ccee-CCcccccceeec-ccc--------------ccCccCCCCCCccc-hHHHH
Confidence 3778887654 4567788866543 3344 874 9988732 222 33468999999996 68899
Q ss_pred HHHHhccCCCceeccccccccccchHHHchhhhcCCCCceeccccCcccCC----------hhHHHhhhccccCCccccc
Q psy8251 79 IHERIHTGQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSR----------KSFVTTHMRVHTGKLTGKF 148 (434)
Q Consensus 79 ~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~----------~~~L~~H~~~h~~~~~~~~ 148 (434)
.|+.+++ +++.|+ ||+.+ .+..|..|+.+|.+.+++.|..|++.|.. ...|..|.... |.++
T Consensus 470 kH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt--- 541 (567)
T PLN03086 470 KHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRT--- 541 (567)
T ss_pred HHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcc---
Confidence 9999985 789999 99765 66899999999999999999999999952 35799998885 7777
Q ss_pred ccccccccccccchhhhhhhhceecccCCcccCChhhHHHHhhh
Q psy8251 149 GQKMSALSGTAFAVEYEIIQECLFAKNGGKAFSRKSFVTTHMRV 192 (434)
Q Consensus 149 ~~~~~~~~~~~~~~~~~~C~~C~~C~~C~~~F~~~~~L~~H~~~ 192 (434)
+. |..| |+.+..+ +|..|+..
T Consensus 542 ---------------~~-C~~C------gk~Vrlr-dm~~H~~~ 562 (567)
T PLN03086 542 ---------------AP-CDSC------GRSVMLK-EMDIHQIA 562 (567)
T ss_pred ---------------eE-cccc------CCeeeeh-hHHHHHHH
Confidence 44 7666 9888654 56777654
No 13
>PHA00733 hypothetical protein
Probab=98.90 E-value=8.1e-10 Score=89.23 Aligned_cols=83 Identities=22% Similarity=0.341 Sum_probs=63.5
Q ss_pred CCCCeecCcCCcccCCHHHHHHH--H---HhccCCCceeccccccccccchHHHchhhhcCCCCceeccccCcccCChhH
Q psy8251 58 ESSPWCCRFCNKRMMTKLSLGIH--E---RIHTGQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSF 132 (434)
Q Consensus 58 ~~~~~~C~~C~~~f~~~~~L~~H--~---~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~ 132 (434)
.++++.|.+|.+.|.....|..+ + ..+.++++|.|+.|++.|.+...|..|++.+ +.+|.|+.|++.|.....
T Consensus 37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~s 114 (128)
T PHA00733 37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDS 114 (128)
T ss_pred hhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHH
Confidence 46778888888777776666554 1 1234477888888888888888888888876 356899999999999999
Q ss_pred HHhhhccccC
Q psy8251 133 VTTHMRVHTG 142 (434)
Q Consensus 133 L~~H~~~h~~ 142 (434)
|..|+...|+
T Consensus 115 L~~H~~~~h~ 124 (128)
T PHA00733 115 TLDHVCKKHN 124 (128)
T ss_pred HHHHHHHhcC
Confidence 9999887665
No 14
>PHA00733 hypothetical protein
Probab=98.85 E-value=2.8e-09 Score=86.11 Aligned_cols=97 Identities=21% Similarity=0.328 Sum_probs=75.3
Q ss_pred HHHHHHHHHcCCCCCCCCccchhhhcCCHHHHHHH--HHhhcCCCCCCCCCeecCcCCcccCCHHHHHHHHHhccCCCce
Q psy8251 13 INLLKHRREHGGNLSFDNCHICKETFYKHSELVRH--LEREHNIPRSESSPWCCRFCNKRMMTKLSLGIHERIHTGQRPY 90 (434)
Q Consensus 13 ~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~L~~H--~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~ 90 (434)
..|..+-..-...+++. |.+|.+.|.+...|..| +. .+ ....+..+|.|+.|++.|.+...|..|++.+ +.+|
T Consensus 26 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~l~-~~-~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~ 100 (128)
T PHA00733 26 EELKRYHSLTPEQKRLI-RAVVKTLIYNPQLLDESSYLY-KL-LTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSK 100 (128)
T ss_pred HHhhhhhcCChhhhhHH-HHHHhhhccChhhhcchHHHH-hh-cccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCc
Confidence 33444333333456787 99999999998888877 33 22 1224578999999999999999999999976 3579
Q ss_pred eccccccccccchHHHchhhhcCC
Q psy8251 91 ICDWCGVGFRSKANLLQHQPVHTG 114 (434)
Q Consensus 91 ~C~~C~~~f~~~~~l~~H~~~h~~ 114 (434)
.|+.|++.|.....|..|+...++
T Consensus 101 ~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 101 VCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred cCCCCCCccCCHHHHHHHHHHhcC
Confidence 999999999999999999886654
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.60 E-value=1.2e-08 Score=67.53 Aligned_cols=43 Identities=16% Similarity=0.230 Sum_probs=25.0
Q ss_pred CeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccccchHH
Q psy8251 61 PWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKANL 105 (434)
Q Consensus 61 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l 105 (434)
.|.|+.||+.|...+.|..|+++|+ ++|+|..|++.|...+.|
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 3556666666666666666666665 455666666655544443
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.55 E-value=3.6e-08 Score=65.35 Aligned_cols=42 Identities=19% Similarity=0.418 Sum_probs=22.4
Q ss_pred eeccccccccccchHHHchhhhcCCCCceeccccCcccCChhHH
Q psy8251 90 YICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFV 133 (434)
Q Consensus 90 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L 133 (434)
|.|+.||+.|...++|..|+++|+ ++|+|..|++.|...+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 455555555555555555555555 345555555555544443
No 17
>KOG3993|consensus
Probab=98.45 E-value=4.3e-08 Score=91.04 Aligned_cols=140 Identities=17% Similarity=0.261 Sum_probs=93.4
Q ss_pred CCCccccccCCHHHHHHHHHHcCCCCCCCCccchhhhcCCHHHHHHHHHhhcCCCC------------------------
Q psy8251 1 MWPRGFRGFRQVINLLKHRREHGGNLSFDNCHICKETFYKHSELVRHLEREHNIPR------------------------ 56 (434)
Q Consensus 1 ~C~~C~~~F~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~------------------------ 56 (434)
+|..|...|.+.-.|.+|.-..--...|. |++|+|.|+-..+|..|.+ .|....
T Consensus 269 iCqLCK~kYeD~F~LAQHrC~RIV~vEYr-CPEC~KVFsCPANLASHRR-WHKPR~eaa~a~~~P~k~~~~~rae~~ea~ 346 (500)
T KOG3993|consen 269 ICQLCKEKYEDAFALAQHRCPRIVHVEYR-CPECDKVFSCPANLASHRR-WHKPRPEAAKAGSPPPKQAVETRAEVQEAE 346 (500)
T ss_pred HHHHHHHhhhhHHHHhhccCCeeEEeeec-CCcccccccCchhhhhhhc-ccCCchhhhhcCCCChhhhhhhhhhhhhcc
Confidence 47778888888888888864332234577 8888888888888888877 552110
Q ss_pred -----CCCCCeecCcCCcccCCHHHHHHHHHhccCCC-------------------------------------------
Q psy8251 57 -----SESSPWCCRFCNKRMMTKLSLGIHERIHTGQR------------------------------------------- 88 (434)
Q Consensus 57 -----~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~------------------------------------------- 88 (434)
..+..|.|.+|++.|.+...|+.|+.+|....
T Consensus 347 rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~ 426 (500)
T KOG3993|consen 347 RSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAG 426 (500)
T ss_pred ccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeec
Confidence 12235888888888888888888877663210
Q ss_pred ---ceeccccccccccchHHHchhhhcCCCCceeccccCcccCChhHHHhhhccccC
Q psy8251 89 ---PYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHMRVHTG 142 (434)
Q Consensus 89 ---~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~ 142 (434)
...|+.|+..+.++..-..+.+.-..+..|.|.+|...|.+...|.+|+...|.
T Consensus 427 sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hp 483 (500)
T KOG3993|consen 427 SAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHP 483 (500)
T ss_pred cccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcCh
Confidence 022334444444433333333333334468999999999999999999886654
No 18
>KOG3993|consensus
Probab=98.43 E-value=1.5e-07 Score=87.58 Aligned_cols=101 Identities=26% Similarity=0.485 Sum_probs=69.7
Q ss_pred ceeccccccccccchHHHchhhhcCCCCceeccccCcccCChhHHHhhhccccCCcccc-ccccccc-------------
Q psy8251 89 PYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHMRVHTGKLTGK-FGQKMSA------------- 154 (434)
Q Consensus 89 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~~~~~~-~~~~~~~------------- 154 (434)
.|.|.+|...|.+.-.|.+|.-.-.-...|+|+.|+|+|.-..+|..|.|.|....... .......
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~ 346 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAE 346 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhcc
Confidence 39999999999999999999643222234999999999999999999999996432111 0000000
Q ss_pred cccc-ccchhhhhhhhceecccCCcccCChhhHHHHhhhccCC
Q psy8251 155 LSGT-AFAVEYEIIQECLFAKNGGKAFSRKSFVTTHMRVHTGK 196 (434)
Q Consensus 155 ~~~~-~~~~~~~~C~~C~~C~~C~~~F~~~~~L~~H~~~H~~~ 196 (434)
.++. .....|. |..| ++.|.....|+.|+.+|+..
T Consensus 347 rsg~dss~gi~~-C~~C------~KkFrRqAYLrKHqlthq~~ 382 (500)
T KOG3993|consen 347 RSGDDSSSGIFS-CHTC------GKKFRRQAYLRKHQLTHQRA 382 (500)
T ss_pred ccCCcccCceee-cHHh------hhhhHHHHHHHHhHHhhhcc
Confidence 0000 1111244 6666 99999999999999888753
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.36 E-value=2e-07 Score=52.65 Aligned_cols=24 Identities=50% Similarity=1.068 Sum_probs=12.4
Q ss_pred HHchhhhcCCCCceeccccCcccC
Q psy8251 105 LLQHQPVHTGQRKYNCTVCGKAFS 128 (434)
Q Consensus 105 l~~H~~~h~~~~~~~C~~C~k~F~ 128 (434)
|..|+++|++++||.|+.|++.|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 445555555555555555555553
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.29 E-value=5.1e-07 Score=50.98 Aligned_cols=25 Identities=48% Similarity=1.077 Sum_probs=19.3
Q ss_pred HHHHHHHhccCCCceeccccccccc
Q psy8251 76 SLGIHERIHTGQRPYICDWCGVGFR 100 (434)
Q Consensus 76 ~L~~H~~~h~~~~~~~C~~C~~~f~ 100 (434)
+|.+|+++|++++||.|+.|++.|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 4677888888888888888887775
No 21
>PHA00616 hypothetical protein
Probab=98.18 E-value=6.2e-07 Score=56.48 Aligned_cols=31 Identities=19% Similarity=0.388 Sum_probs=28.2
Q ss_pred cccCCccccccCCchhHHHHHHhhccCCCCC
Q psy8251 311 PYDCDVCGAKFTQVGDMRRHRKKHDENGGSA 341 (434)
Q Consensus 311 ~f~C~~Cgk~F~~~~~L~~H~r~H~~~~~~~ 341 (434)
||.|+.||+.|.++++|.+|++.|+|++++.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~ 31 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLT 31 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccc
Confidence 7999999999999999999999999996443
No 22
>PHA00616 hypothetical protein
Probab=97.97 E-value=4e-06 Score=52.89 Aligned_cols=32 Identities=16% Similarity=0.116 Sum_probs=16.3
Q ss_pred eecCcCCcccCCHHHHHHHHHhccCCCceecc
Q psy8251 62 WCCRFCNKRMMTKLSLGIHERIHTGQRPYICD 93 (434)
Q Consensus 62 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~ 93 (434)
|+|..||+.|..+++|..|++.|++++++.|+
T Consensus 2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 44555555555555555555555555554443
No 23
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.90 E-value=3.9e-06 Score=46.00 Aligned_cols=23 Identities=35% Similarity=1.093 Sum_probs=21.8
Q ss_pred ccCCccccccCCchhHHHHHHhh
Q psy8251 312 YDCDVCGAKFTQVGDMRRHRKKH 334 (434)
Q Consensus 312 f~C~~Cgk~F~~~~~L~~H~r~H 334 (434)
|.|+.||+.|.+...|.+|+++|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 78999999999999999999985
No 24
>PHA00732 hypothetical protein
Probab=97.85 E-value=1.2e-05 Score=58.80 Aligned_cols=18 Identities=33% Similarity=0.447 Sum_probs=7.4
Q ss_pred ccchhhhcCCHHHHHHHH
Q psy8251 31 CHICKETFYKHSELVRHL 48 (434)
Q Consensus 31 C~~C~~~f~~~~~L~~H~ 48 (434)
|+.||+.|.+...|+.|+
T Consensus 4 C~~Cgk~F~s~s~Lk~H~ 21 (79)
T PHA00732 4 CPICGFTTVTLFALKQHA 21 (79)
T ss_pred CCCCCCccCCHHHHHHHh
Confidence 444444444444444443
No 25
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.81 E-value=3.7e-05 Score=52.10 Aligned_cols=51 Identities=35% Similarity=0.573 Sum_probs=34.6
Q ss_pred CCCccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHHHHHHhcc
Q psy8251 28 FDNCHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLGIHERIHT 85 (434)
Q Consensus 28 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 85 (434)
|. |+.|++ ..+...|..|+...|. .+.+.+.|++|...+. .+|..|+..++
T Consensus 3 f~-CP~C~~-~~~~~~L~~H~~~~H~---~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FT-CPYCGK-GFSESSLVEHCEDEHR---SESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cC-CCCCCC-ccCHHHHHHHHHhHCc---CCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 55 888888 4556778888777774 2334677888877544 37777777654
No 26
>PHA00732 hypothetical protein
Probab=97.63 E-value=3.7e-05 Score=56.26 Aligned_cols=43 Identities=30% Similarity=0.586 Sum_probs=21.2
Q ss_pred eeccccccccccchHHHchhhh-cCCCCceeccccCcccCChhHHHhhhc
Q psy8251 90 YICDWCGVGFRSKANLLQHQPV-HTGQRKYNCTVCGKAFSRKSFVTTHMR 138 (434)
Q Consensus 90 ~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~k~F~~~~~L~~H~~ 138 (434)
|.|..|++.|.+...|..|++. |. ++.|+.|++.|. .|..|++
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~ 45 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFY 45 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhc
Confidence 4555555555555555555543 32 134555555554 2444443
No 27
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.60 E-value=2.9e-05 Score=42.84 Aligned_cols=24 Identities=29% Similarity=0.952 Sum_probs=20.6
Q ss_pred ccCCccccccCCchhHHHHHHhhc
Q psy8251 312 YDCDVCGAKFTQVGDMRRHRKKHD 335 (434)
Q Consensus 312 f~C~~Cgk~F~~~~~L~~H~r~H~ 335 (434)
|.|++|++.|.+...|+.|+++|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 689999999999999999999875
No 28
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.54 E-value=4.3e-05 Score=41.73 Aligned_cols=23 Identities=57% Similarity=1.125 Sum_probs=17.5
Q ss_pred eeccccCcccCChhHHHhhhccc
Q psy8251 118 YNCTVCGKAFSRKSFVTTHMRVH 140 (434)
Q Consensus 118 ~~C~~C~k~F~~~~~L~~H~~~h 140 (434)
|.|+.|++.|.++..|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 67788888888888888887653
No 29
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.50 E-value=3.1e-05 Score=44.20 Aligned_cols=26 Identities=35% Similarity=1.031 Sum_probs=24.2
Q ss_pred cccCCccccccCCchhHHHHHHhhcc
Q psy8251 311 PYDCDVCGAKFTQVGDMRRHRKKHDE 336 (434)
Q Consensus 311 ~f~C~~Cgk~F~~~~~L~~H~r~H~~ 336 (434)
||.|+.|++.|.+...|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 68999999999999999999998863
No 30
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.40 E-value=0.00018 Score=48.73 Aligned_cols=47 Identities=23% Similarity=0.508 Sum_probs=22.4
Q ss_pred eecCcCCcccCCHHHHHHHHHhcc-CC-CceeccccccccccchHHHchhhh
Q psy8251 62 WCCRFCNKRMMTKLSLGIHERIHT-GQ-RPYICDWCGVGFRSKANLLQHQPV 111 (434)
Q Consensus 62 ~~C~~C~~~f~~~~~L~~H~~~h~-~~-~~~~C~~C~~~f~~~~~l~~H~~~ 111 (434)
|.|++|++ ..+...|..|....+ .+ +.+.|++|...+. .+|..|+..
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~ 51 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNS 51 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHH
Confidence 55666666 334455666654322 21 3455555554332 244445443
No 31
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.18 E-value=0.00027 Score=40.25 Aligned_cols=26 Identities=38% Similarity=0.761 Sum_probs=20.8
Q ss_pred ceeccccCcccCChhHHHhhhccccC
Q psy8251 117 KYNCTVCGKAFSRKSFVTTHMRVHTG 142 (434)
Q Consensus 117 ~~~C~~C~k~F~~~~~L~~H~~~h~~ 142 (434)
||.|..|++.|.+...|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 57888888888888888888877753
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.16 E-value=0.00024 Score=39.03 Aligned_cols=24 Identities=46% Similarity=1.017 Sum_probs=16.4
Q ss_pred eeccccCcccCChhHHHhhhcccc
Q psy8251 118 YNCTVCGKAFSRKSFVTTHMRVHT 141 (434)
Q Consensus 118 ~~C~~C~k~F~~~~~L~~H~~~h~ 141 (434)
|.|++|++.|.+...|..|++.|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 577778888888888888777653
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.05 E-value=0.00049 Score=53.24 Aligned_cols=24 Identities=29% Similarity=0.639 Sum_probs=13.7
Q ss_pred CeecCcCCcccCCHHHHHHHHHhc
Q psy8251 61 PWCCRFCNKRMMTKLSLGIHERIH 84 (434)
Q Consensus 61 ~~~C~~C~~~f~~~~~L~~H~~~h 84 (434)
.+.|..|++.|.+...|..|++.+
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHHHT
T ss_pred CCCCCccCCCCcCHHHHHHHHcCc
Confidence 456666666666666666666543
No 34
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.02 E-value=0.00048 Score=38.49 Aligned_cols=24 Identities=29% Similarity=0.800 Sum_probs=22.4
Q ss_pred ccCCccccccCCchhHHHHHHhhc
Q psy8251 312 YDCDVCGAKFTQVGDMRRHRKKHD 335 (434)
Q Consensus 312 f~C~~Cgk~F~~~~~L~~H~r~H~ 335 (434)
|.|+.|+++|.+...|..|+++|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 689999999999999999999875
No 35
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.01 E-value=0.00037 Score=53.91 Aligned_cols=23 Identities=30% Similarity=0.791 Sum_probs=15.3
Q ss_pred ceeccccCcccCChhHHHhhhcc
Q psy8251 117 KYNCTVCGKAFSRKSFVTTHMRV 139 (434)
Q Consensus 117 ~~~C~~C~k~F~~~~~L~~H~~~ 139 (434)
.+.|..|++.|.+...|..|++.
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~~ 72 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMRS 72 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHHH
T ss_pred CCCCCccCCCCcCHHHHHHHHcC
Confidence 46777777777777777777764
No 36
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.87 E-value=0.00036 Score=63.02 Aligned_cols=70 Identities=21% Similarity=0.412 Sum_probs=41.5
Q ss_pred CCCeecCc--CCcccCCHHHHHHHHHh-ccCCCceeccccccccccchHHHchhhhcCCCCceeccccCcccCChhHHHh
Q psy8251 59 SSPWCCRF--CNKRMMTKLSLGIHERI-HTGQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTT 135 (434)
Q Consensus 59 ~~~~~C~~--C~~~f~~~~~L~~H~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~ 135 (434)
++||+|++ |.+.++....|+-|+.. |-..+... ...-..|.-.-...|||.|++|+|.|.....|+-
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~----------~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKY 416 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHE----------NPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKY 416 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCC----------CCCccccccccccCCceeccccchhhccCcccee
Confidence 46777766 77777777777777542 11111100 0011111122345689999999999999999888
Q ss_pred hhc
Q psy8251 136 HMR 138 (434)
Q Consensus 136 H~~ 138 (434)
|+.
T Consensus 417 Hr~ 419 (423)
T COG5189 417 HRK 419 (423)
T ss_pred ccc
Confidence 864
No 37
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.71 E-value=0.0012 Score=42.57 Aligned_cols=33 Identities=27% Similarity=0.596 Sum_probs=25.0
Q ss_pred CCCCCcccCCccccccCCchhHHHHHHhhccCC
Q psy8251 306 HHCDRPYDCDVCGAKFTQVGDMRRHRKKHDENG 338 (434)
Q Consensus 306 H~~~k~f~C~~Cgk~F~~~~~L~~H~r~H~~~~ 338 (434)
+..+.|-.|++|+..+.+..+|++|+.++++.+
T Consensus 19 ~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 19 SQSEQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp CTTS--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred hccCCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 556789999999999999999999999988774
No 38
>KOG2231|consensus
Probab=96.51 E-value=0.0052 Score=62.46 Aligned_cols=127 Identities=24% Similarity=0.385 Sum_probs=78.1
Q ss_pred CCCccccccC---------------CHHHHHHHHHHcCCCCCCCCccchh---------hhcCCHHHHHHHHHhhcCCCC
Q psy8251 1 MWPRGFRGFR---------------QVINLLKHRREHGGNLSFDNCHICK---------ETFYKHSELVRHLEREHNIPR 56 (434)
Q Consensus 1 ~C~~C~~~F~---------------~~~~l~~H~~~h~~~~~~~~C~~C~---------~~f~~~~~L~~H~~~~H~~~~ 56 (434)
+|.+|++.|. +...|+.|+...++ .+. |..|- ....+...|..|+..--....
T Consensus 101 ~C~~C~~~~~~~~~~~~~~~c~~~~s~~~Lk~H~~~~H~--~~~-c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~ 177 (669)
T KOG2231|consen 101 SCHICDRRFRALYNKKECLHCTEFKSVENLKNHMRDQHK--LHL-CSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDE 177 (669)
T ss_pred hcCccccchhhhcccCCCccccchhHHHHHHHHHHHhhh--hhc-cccccccceeeeeeeehehHHHHHHHHhcCCCccc
Confidence 4777777773 78899999954332 233 55442 223456677788762221011
Q ss_pred CCCCCeecCcCCcccCCHHHHHHHHHhccCCCceecccc------ccccccchHHHchhhhcCCCCceecc--ccCc-cc
Q psy8251 57 SESSPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWC------GVGFRSKANLLQHQPVHTGQRKYNCT--VCGK-AF 127 (434)
Q Consensus 57 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C------~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~k-~F 127 (434)
.-..--.|..|...|-....|.+|++.++ |.|..| +..|.....|..|.+.+| |.|. .|.- .|
T Consensus 178 s~rGhp~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f 249 (669)
T KOG2231|consen 178 SCRGHPLCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKF 249 (669)
T ss_pred cccCCccchhhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC----cccccccccccee
Confidence 11122469999999999999999988765 566666 356777888888888655 7776 5542 33
Q ss_pred CChhHHHhhhc
Q psy8251 128 SRKSFVTTHMR 138 (434)
Q Consensus 128 ~~~~~L~~H~~ 138 (434)
.....+..|++
T Consensus 250 ~~~~~~ei~lk 260 (669)
T KOG2231|consen 250 YVAFELEIELK 260 (669)
T ss_pred eehhHHHHHHH
Confidence 33334444444
No 39
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.51 E-value=0.0019 Score=41.72 Aligned_cols=32 Identities=16% Similarity=0.448 Sum_probs=23.1
Q ss_pred CCCceeccccCcccCChhHHHhhhccccCCcc
Q psy8251 114 GQRKYNCTVCGKAFSRKSFVTTHMRVHTGKLT 145 (434)
Q Consensus 114 ~~~~~~C~~C~k~F~~~~~L~~H~~~h~~~~~ 145 (434)
.+.|-.|++|+..+++..+|++|+.+.|+.+|
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 46788999999999999999999998888776
No 40
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.46 E-value=0.0014 Score=36.08 Aligned_cols=24 Identities=29% Similarity=0.735 Sum_probs=19.9
Q ss_pred ccCCccccccCCchhHHHHHHhhcc
Q psy8251 312 YDCDVCGAKFTQVGDMRRHRKKHDE 336 (434)
Q Consensus 312 f~C~~Cgk~F~~~~~L~~H~r~H~~ 336 (434)
|.|+.|+.... ...|.+|+++|++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 78999999998 9999999999764
No 41
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.25 E-value=0.0015 Score=36.26 Aligned_cols=23 Identities=26% Similarity=0.827 Sum_probs=21.3
Q ss_pred ccCCccccccCCchhHHHHHHhh
Q psy8251 312 YDCDVCGAKFTQVGDMRRHRKKH 334 (434)
Q Consensus 312 f~C~~Cgk~F~~~~~L~~H~r~H 334 (434)
|.|++|++.|.+...|+.|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 68999999999999999999865
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.01 E-value=0.0036 Score=34.75 Aligned_cols=24 Identities=50% Similarity=0.949 Sum_probs=18.3
Q ss_pred eeccccCcccCChhHHHhhhcccc
Q psy8251 118 YNCTVCGKAFSRKSFVTTHMRVHT 141 (434)
Q Consensus 118 ~~C~~C~k~F~~~~~L~~H~~~h~ 141 (434)
|.|+.|++.|.+...|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 567888888888888888877553
No 43
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.82 E-value=0.0037 Score=56.69 Aligned_cols=69 Identities=20% Similarity=0.374 Sum_probs=43.9
Q ss_pred CCCceeccc--cCcccCChhHHHhhhcccc-CCcccc--cccccccccccccchhhhhhhhceecccCCcccCChhhHHH
Q psy8251 114 GQRKYNCTV--CGKAFSRKSFVTTHMRVHT-GKLTGK--FGQKMSALSGTAFAVEYEIIQECLFAKNGGKAFSRKSFVTT 188 (434)
Q Consensus 114 ~~~~~~C~~--C~k~F~~~~~L~~H~~~h~-~~~~~~--~~~~~~~~~~~~~~~~~~~C~~C~~C~~C~~~F~~~~~L~~ 188 (434)
+++||+|++ |+|.|++...|+-|+.--| ..+..+ ...++..+ .+....|. |+.| ++.+.+...|..
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F--~~~~KPYr-CevC------~KRYKNlNGLKY 416 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIF--SAKDKPYR-CEVC------DKRYKNLNGLKY 416 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccc--cccCCcee-cccc------chhhccCcccee
Confidence 358999986 9999999999999987432 222111 11111111 11222244 5555 999999999999
Q ss_pred Hhh
Q psy8251 189 HMR 191 (434)
Q Consensus 189 H~~ 191 (434)
|..
T Consensus 417 Hr~ 419 (423)
T COG5189 417 HRK 419 (423)
T ss_pred ccc
Confidence 875
No 44
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.79 E-value=0.0063 Score=33.71 Aligned_cols=23 Identities=30% Similarity=0.833 Sum_probs=18.2
Q ss_pred eeccccCcccCChhHHHhhhccc
Q psy8251 118 YNCTVCGKAFSRKSFVTTHMRVH 140 (434)
Q Consensus 118 ~~C~~C~k~F~~~~~L~~H~~~h 140 (434)
|.|.+|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 67888888888888888887653
No 45
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.53 E-value=0.0076 Score=33.03 Aligned_cols=24 Identities=33% Similarity=0.649 Sum_probs=16.1
Q ss_pred eeccccCcccCChhHHHhhhccccC
Q psy8251 118 YNCTVCGKAFSRKSFVTTHMRVHTG 142 (434)
Q Consensus 118 ~~C~~C~k~F~~~~~L~~H~~~h~~ 142 (434)
|+|+.|+.... +..|.+|++.+++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 67788887777 7778888777653
No 46
>KOG2231|consensus
Probab=95.40 E-value=0.024 Score=57.77 Aligned_cols=95 Identities=19% Similarity=0.420 Sum_probs=64.0
Q ss_pred hcCCHHHHHHHHHhhcCCCCCCCCCeecCcCC---------cccCCHHHHHHHHHhccC-CC----ceeccccccccccc
Q psy8251 37 TFYKHSELVRHLEREHNIPRSESSPWCCRFCN---------KRMMTKLSLGIHERIHTG-QR----PYICDWCGVGFRSK 102 (434)
Q Consensus 37 ~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~---------~~f~~~~~L~~H~~~h~~-~~----~~~C~~C~~~f~~~ 102 (434)
.|.+...|+.|+...|. .+.|..|- ...-+...|..|++.-.. ++ .-.|..|...|...
T Consensus 123 ~~~s~~~Lk~H~~~~H~-------~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~ 195 (669)
T KOG2231|consen 123 EFKSVENLKNHMRDQHK-------LHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDD 195 (669)
T ss_pred chhHHHHHHHHHHHhhh-------hhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhccH
Confidence 33478899999987773 33344433 222345677888764322 11 24688899999999
Q ss_pred hHHHchhhhcCCCCceeccccC------cccCChhHHHhhhccccC
Q psy8251 103 ANLLQHQPVHTGQRKYNCTVCG------KAFSRKSFVTTHMRVHTG 142 (434)
Q Consensus 103 ~~l~~H~~~h~~~~~~~C~~C~------k~F~~~~~L~~H~~~h~~ 142 (434)
..|.+|++.++ |.|..|+ ..|.....|..|.+.+|-
T Consensus 196 ~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~Hf 237 (669)
T KOG2231|consen 196 DELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGHF 237 (669)
T ss_pred HHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcCc
Confidence 99999988665 6666664 457778889999887763
No 47
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.18 E-value=0.0038 Score=35.43 Aligned_cols=22 Identities=27% Similarity=0.764 Sum_probs=20.7
Q ss_pred ccCCccccccCCchhHHHHHHh
Q psy8251 312 YDCDVCGAKFTQVGDMRRHRKK 333 (434)
Q Consensus 312 f~C~~Cgk~F~~~~~L~~H~r~ 333 (434)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 7899999999999999999875
No 48
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.97 E-value=0.015 Score=32.94 Aligned_cols=22 Identities=32% Similarity=0.762 Sum_probs=17.5
Q ss_pred eeccccCcccCChhHHHhhhcc
Q psy8251 118 YNCTVCGKAFSRKSFVTTHMRV 139 (434)
Q Consensus 118 ~~C~~C~k~F~~~~~L~~H~~~ 139 (434)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 6788888888888888888764
No 49
>PRK04860 hypothetical protein; Provisional
Probab=94.88 E-value=0.015 Score=48.82 Aligned_cols=36 Identities=31% Similarity=0.802 Sum_probs=21.1
Q ss_pred ceeccccccccccchHHHchhhhcCCCCceeccccCcccC
Q psy8251 89 PYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFS 128 (434)
Q Consensus 89 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~ 128 (434)
+|.|. |+. ....+.+|.++|.++++|.|..|+..|.
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~ 154 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLV 154 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence 46665 655 4445556666666666666666666554
No 50
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=94.85 E-value=0.018 Score=31.93 Aligned_cols=21 Identities=33% Similarity=0.823 Sum_probs=18.6
Q ss_pred ccCCccccccCCchhHHHHHHh
Q psy8251 312 YDCDVCGAKFTQVGDMRRHRKK 333 (434)
Q Consensus 312 f~C~~Cgk~F~~~~~L~~H~r~ 333 (434)
..|+.||++| ....|.+|+++
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4799999999 88999999865
No 51
>PRK04860 hypothetical protein; Provisional
Probab=94.73 E-value=0.019 Score=48.31 Aligned_cols=40 Identities=30% Similarity=0.740 Sum_probs=33.6
Q ss_pred CCeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccccch
Q psy8251 60 SPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKA 103 (434)
Q Consensus 60 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~ 103 (434)
-+|.|. |+. ....+.+|.++|.++++|.|..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 469998 987 5678899999999999999999998886543
No 52
>KOG2482|consensus
Probab=93.87 E-value=0.17 Score=46.69 Aligned_cols=51 Identities=20% Similarity=0.439 Sum_probs=37.3
Q ss_pred eeccccccccccchHHHchhhhcCC---------------------------CCceeccccCcccCChhHHHhhhccc
Q psy8251 90 YICDWCGVGFRSKANLLQHQPVHTG---------------------------QRKYNCTVCGKAFSRKSFVTTHMRVH 140 (434)
Q Consensus 90 ~~C~~C~~~f~~~~~l~~H~~~h~~---------------------------~~~~~C~~C~k~F~~~~~L~~H~~~h 140 (434)
..|-.|.....+...|..|+..-+. ...-.|-.|...|..+..|..||..+
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence 4788888888888888888875321 01235788888999999999998743
No 53
>KOG1146|consensus
Probab=93.52 E-value=0.027 Score=60.87 Aligned_cols=137 Identities=18% Similarity=0.228 Sum_probs=88.5
Q ss_pred CccccccCCHHHHHHHHHH-cCCCCCCCCccchhhhcCCHHHHHHHHHhhcCCCC--------------------CCCCC
Q psy8251 3 PRGFRGFRQVINLLKHRRE-HGGNLSFDNCHICKETFYKHSELVRHLEREHNIPR--------------------SESSP 61 (434)
Q Consensus 3 ~~C~~~F~~~~~l~~H~~~-h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~--------------------~~~~~ 61 (434)
..|+..+.....+..|+.. +.-.+.+. |+.|+..|.....|-.|++..|.... .+.++
T Consensus 440 ~~~e~~~~s~r~~~~~t~~L~S~~kt~~-cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p 518 (1406)
T KOG1146|consen 440 TKAEPLLESKRSLEGQTVVLHSFFKTLK-CPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKP 518 (1406)
T ss_pred cchhhhhhhhcccccceeeeeccccccc-CCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCc
Confidence 3445555555555555443 44456677 99999999999999999886552111 12457
Q ss_pred eecCcCCcccCCHHHHHHHHHh--ccC-----------------------------------------CCceeccccccc
Q psy8251 62 WCCRFCNKRMMTKLSLGIHERI--HTG-----------------------------------------QRPYICDWCGVG 98 (434)
Q Consensus 62 ~~C~~C~~~f~~~~~L~~H~~~--h~~-----------------------------------------~~~~~C~~C~~~ 98 (434)
|.|..|...+..+.+|.+|++. |.. .-.+.|.+|+..
T Consensus 519 ~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~ye 598 (1406)
T KOG1146|consen 519 YPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYE 598 (1406)
T ss_pred ccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcch
Confidence 8888888888888888888753 100 012677777777
Q ss_pred cccchHHHchhhhc-CCCCceeccccCcccCChhHHHhhhccc
Q psy8251 99 FRSKANLLQHQPVH-TGQRKYNCTVCGKAFSRKSFVTTHMRVH 140 (434)
Q Consensus 99 f~~~~~l~~H~~~h-~~~~~~~C~~C~k~F~~~~~L~~H~~~h 140 (434)
-.-..+|..|+..- +...|..|..++..+.....+..+.+.+
T Consensus 599 tniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~~~lp 641 (1406)
T KOG1146|consen 599 TNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQGRLP 641 (1406)
T ss_pred hhhhhccccccccCCCCCChHHHhhhcchhhccccccCcCCCC
Confidence 77777777776543 3333466666776666666666666555
No 54
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=93.30 E-value=0.079 Score=29.32 Aligned_cols=20 Identities=30% Similarity=0.192 Sum_probs=12.5
Q ss_pred CCCccccccCCHHHHHHHHHH
Q psy8251 1 MWPRGFRGFRQVINLLKHRRE 21 (434)
Q Consensus 1 ~C~~C~~~F~~~~~l~~H~~~ 21 (434)
.|+.||+.| ....|..|+..
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHHh
Confidence 366777777 55666666543
No 55
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=92.62 E-value=0.13 Score=47.43 Aligned_cols=78 Identities=21% Similarity=0.366 Sum_probs=44.0
Q ss_pred eecCc--CCcccCCHHHHHHHHHhccCCCceeccccc---cccc------cchHHHchhhhcCCCC----ceeccccCcc
Q psy8251 62 WCCRF--CNKRMMTKLSLGIHERIHTGQRPYICDWCG---VGFR------SKANLLQHQPVHTGQR----KYNCTVCGKA 126 (434)
Q Consensus 62 ~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---~~f~------~~~~l~~H~~~h~~~~----~~~C~~C~k~ 126 (434)
|.|+. |.........|..|.+..++. +.|.+|- +.|. +...|..|...-..+. .-.|..|...
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~ 229 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIY 229 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccce
Confidence 55665 555555566777777655542 5566653 2222 3445555544322221 1357777777
Q ss_pred cCChhHHHhhhcccc
Q psy8251 127 FSRKSFVTTHMRVHT 141 (434)
Q Consensus 127 F~~~~~L~~H~~~h~ 141 (434)
|-+-..|.+|++..|
T Consensus 230 FYdDDEL~~HcR~~H 244 (493)
T COG5236 230 FYDDDELRRHCRLRH 244 (493)
T ss_pred ecChHHHHHHHHhhh
Confidence 777777777777544
No 56
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=92.25 E-value=0.078 Score=31.93 Aligned_cols=24 Identities=25% Similarity=0.520 Sum_probs=21.5
Q ss_pred CcccCCccccccCCchhHHHHHHh
Q psy8251 310 RPYDCDVCGAKFTQVGDMRRHRKK 333 (434)
Q Consensus 310 k~f~C~~Cgk~F~~~~~L~~H~r~ 333 (434)
.+|.|++|++.|.+...+..|++.
T Consensus 2 ~~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 2 GGFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred cCeEccccCCccCCHHHHHHHHCh
Confidence 368999999999999999999864
No 57
>KOG2893|consensus
Probab=91.31 E-value=0.041 Score=47.98 Aligned_cols=41 Identities=27% Similarity=0.720 Sum_probs=26.9
Q ss_pred ccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHHHH
Q psy8251 31 CHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLGIH 80 (434)
Q Consensus 31 C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H 80 (434)
|.+|+..|.+...|..|++.. -|+|.+|.+.+.+--.|..|
T Consensus 13 cwycnrefddekiliqhqkak---------hfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAK---------HFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeecccccchhhhhhhhhhhc---------cceeeeehhhhccCCCceee
Confidence 777777777777777776533 37777777665555555555
No 58
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=91.31 E-value=0.04 Score=54.71 Aligned_cols=51 Identities=27% Similarity=0.468 Sum_probs=33.3
Q ss_pred cccccccchHHHchhhhcCCCC--ceeccccCcccCChhHHHhhhccccCCcc
Q psy8251 95 CGVGFRSKANLLQHQPVHTGQR--KYNCTVCGKAFSRKSFVTTHMRVHTGKLT 145 (434)
Q Consensus 95 C~~~f~~~~~l~~H~~~h~~~~--~~~C~~C~k~F~~~~~L~~H~~~h~~~~~ 145 (434)
|...+.....+..|...|.... .+.+..|.+.|.....+..|++.+....+
T Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (467)
T COG5048 394 CIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAP 446 (467)
T ss_pred hhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCc
Confidence 4444455555555555444433 46677888888888888888888776554
No 59
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=90.61 E-value=0.067 Score=53.07 Aligned_cols=63 Identities=27% Similarity=0.462 Sum_probs=49.1
Q ss_pred CCeecCcCCcccCCHHHHHHHHH--hccCC--Cceecc--ccccccccchHHHchhhhcCCCCceeccc
Q psy8251 60 SPWCCRFCNKRMMTKLSLGIHER--IHTGQ--RPYICD--WCGVGFRSKANLLQHQPVHTGQRKYNCTV 122 (434)
Q Consensus 60 ~~~~C~~C~~~f~~~~~L~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 122 (434)
.++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.....+..|...|.+..++.+..
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 356 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKL 356 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCcccccc
Confidence 46778888888888888888888 78888 788888 68888888888888888887777666643
No 60
>KOG1146|consensus
Probab=90.42 E-value=0.08 Score=57.49 Aligned_cols=92 Identities=16% Similarity=0.250 Sum_probs=59.7
Q ss_pred ccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccccchHHHchhh
Q psy8251 31 CHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKANLLQHQP 110 (434)
Q Consensus 31 C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 110 (434)
|..|++.|...-.+. |+-. ...|.|..|...|.....|..|. ..|-+.+.......-|..
T Consensus 1263 c~~~~~~~~~~~~~~-~l~~--------~~~~~~~~~~~~~~~~~~l~~~~-----------~k~~~~~~~~~~~~~~~l 1322 (1406)
T KOG1146|consen 1263 CGAVDELLTPSFGIS-TLDV--------THRYLCRQCKMAFDGEAPLTAHQ-----------RKFCFAGRGSGGSMPPPL 1322 (1406)
T ss_pred hhhccccccCcccee-eccc--------chhHHHHHHHhhhcchhHHHHHH-----------HHHHhccCccccCCCCcc
Confidence 666666666655555 4432 23566666666666666666664 223344444455555555
Q ss_pred hcCCCCceeccccCcccCChhHHHhhhccccCC
Q psy8251 111 VHTGQRKYNCTVCGKAFSRKSFVTTHMRVHTGK 143 (434)
Q Consensus 111 ~h~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~~ 143 (434)
.+....+| |..|...|.....|+.|++.-+++
T Consensus 1323 ~~~d~~~~-c~~c~~~~~~~~alqihm~~~~~~ 1354 (1406)
T KOG1146|consen 1323 RVPDCTYH-CLACEVLLSGREALQIHMRSSAHR 1354 (1406)
T ss_pred cCcccccc-chHHHhhcchhHHHHHHHHHhhhc
Confidence 56666667 999999999999999999964443
No 61
>KOG2785|consensus
Probab=89.80 E-value=1.1 Score=42.44 Aligned_cols=137 Identities=21% Similarity=0.226 Sum_probs=86.3
Q ss_pred CCCccccccCCHHHHHHHHHH--cC-----------------------------------CCCCCCCccchhhhcCCHHH
Q psy8251 1 MWPRGFRGFRQVINLLKHRRE--HG-----------------------------------GNLSFDNCHICKETFYKHSE 43 (434)
Q Consensus 1 ~C~~C~~~F~~~~~l~~H~~~--h~-----------------------------------~~~~~~~C~~C~~~f~~~~~ 43 (434)
+|..|...|.+...-+.|+++ |. ++.++. |.+|.+.|.+...
T Consensus 5 tC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~-c~~c~k~~~s~~a 83 (390)
T KOG2785|consen 5 TCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVY-CEACNKSFASPKA 83 (390)
T ss_pred eeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhccccee-hHHhhccccChhh
Confidence 599999999999999999765 21 234566 9999999999998
Q ss_pred HHHHHHhh-cCCCC---C--C------CCCee-------------cCcCCcccCCHHHHHHHH------Hhc------cC
Q psy8251 44 LVRHLERE-HNIPR---S--E------SSPWC-------------CRFCNKRMMTKLSLGIHE------RIH------TG 86 (434)
Q Consensus 44 L~~H~~~~-H~~~~---~--~------~~~~~-------------C~~C~~~f~~~~~L~~H~------~~h------~~ 86 (434)
...|+... |.... . + ..... +..+-...........+. .+- ..
T Consensus 84 ~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~~~dd~~Edi~~d~~~e~e 163 (390)
T KOG2785|consen 84 HENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEEEEDDEEEDIEEDGDDEDE 163 (390)
T ss_pred HHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhhccCcchhhhhhccchhcc
Confidence 88887522 10000 0 0 00011 111211221111111111 111 11
Q ss_pred CCceeccccccccccchHHHchhhhcCCC-----------------------CceeccccC---cccCChhHHHhhhc
Q psy8251 87 QRPYICDWCGVGFRSKANLLQHQPVHTGQ-----------------------RKYNCTVCG---KAFSRKSFVTTHMR 138 (434)
Q Consensus 87 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~-----------------------~~~~C~~C~---k~F~~~~~L~~H~~ 138 (434)
..|-.|-.|++.+.+...-..||..++|- ..|.|-.|+ +.|.+-...+.||.
T Consensus 164 ~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~ 241 (390)
T KOG2785|consen 164 LIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMR 241 (390)
T ss_pred cCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHh
Confidence 13467888999998888888888877652 147899999 99999999999987
No 62
>KOG2482|consensus
Probab=89.73 E-value=0.88 Score=42.22 Aligned_cols=180 Identities=18% Similarity=0.235 Sum_probs=89.9
Q ss_pred HHHHHHHHHcCCCCCCCCccchhhhc-CCHHHHHHHHHhhcCCCCCC-----------------CCCeecCcCCcccCCH
Q psy8251 13 INLLKHRREHGGNLSFDNCHICKETF-YKHSELVRHLEREHNIPRSE-----------------SSPWCCRFCNKRMMTK 74 (434)
Q Consensus 13 ~~l~~H~~~h~~~~~~~~C~~C~~~f-~~~~~L~~H~~~~H~~~~~~-----------------~~~~~C~~C~~~f~~~ 74 (434)
..|..|++...+..-...|-.|...+ .+++....|+-..|+..... -..+.|-.|.+.|..+
T Consensus 129 eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdk 208 (423)
T KOG2482|consen 129 EALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDK 208 (423)
T ss_pred HHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCc
Confidence 34556665543332222388887664 56677778887667532110 1136788888999888
Q ss_pred HHHHHHHHhc--cCCCc--------eeccc--ccccccc-chHHH-----chhhhc--------CCCCc--eeccccCcc
Q psy8251 75 LSLGIHERIH--TGQRP--------YICDW--CGVGFRS-KANLL-----QHQPVH--------TGQRK--YNCTVCGKA 126 (434)
Q Consensus 75 ~~L~~H~~~h--~~~~~--------~~C~~--C~~~f~~-~~~l~-----~H~~~h--------~~~~~--~~C~~C~k~ 126 (434)
..|+.||+.- ..-.| |.=.. =|++... .+.+. .+...- .+..+ .+|-.|...
T Consensus 209 ntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~ 288 (423)
T KOG2482|consen 209 NTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNF 288 (423)
T ss_pred HHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccc
Confidence 8888888743 22212 22111 1222111 01110 000000 01112 589999999
Q ss_pred cCChhHHHhhhccccCCccccccccccc-ccccccchhhhhhhhc-eecccCCcccCChhhHHHHhhh
Q psy8251 127 FSRKSFVTTHMRVHTGKLTGKFGQKMSA-LSGTAFAVEYEIIQEC-LFAKNGGKAFSRKSFVTTHMRV 192 (434)
Q Consensus 127 F~~~~~L~~H~~~h~~~~~~~~~~~~~~-~~~~~~~~~~~~C~~C-~~C~~C~~~F~~~~~L~~H~~~ 192 (434)
..+...|..||..-|.-.-.+....... .=+.....+|.+=..| ..|-.|.-.|.....|..||--
T Consensus 289 ~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 289 YENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred hhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence 9999999999986543211110000000 0000011111100000 1223338889999999988864
No 63
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=89.64 E-value=0.26 Score=29.60 Aligned_cols=23 Identities=22% Similarity=0.634 Sum_probs=19.9
Q ss_pred ceeccccCcccCChhHHHhhhcc
Q psy8251 117 KYNCTVCGKAFSRKSFVTTHMRV 139 (434)
Q Consensus 117 ~~~C~~C~k~F~~~~~L~~H~~~ 139 (434)
+|.|+.|++.|.+...+..|+..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 58899999999999999999764
No 64
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=89.47 E-value=0.52 Score=43.69 Aligned_cols=94 Identities=24% Similarity=0.395 Sum_probs=61.4
Q ss_pred CHHHHHHHHHHcCCC---CCCCCccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCC----cccCCHHHHHHHHHh
Q psy8251 11 QVINLLKHRREHGGN---LSFDNCHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCN----KRMMTKLSLGIHERI 83 (434)
Q Consensus 11 ~~~~l~~H~~~h~~~---~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~----~~f~~~~~L~~H~~~ 83 (434)
+...|+.|...-..+ +....|..|...|.+-..|.+|++..| ++-|.|..-+ .-|.+...|..|.+.
T Consensus 200 ~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~H------E~ChICD~v~p~~~QYFK~Y~~Le~HF~~ 273 (493)
T COG5236 200 RSSTLRDHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRH------EACHICDMVGPIRYQYFKSYEDLEAHFRN 273 (493)
T ss_pred ecccccccccCCccccCcCCCchhhhccceecChHHHHHHHHhhh------hhhhhhhccCccchhhhhCHHHHHHHhhc
Confidence 345677775432222 223349999999999999999999777 3444444332 237788888888764
Q ss_pred ccCCCceeccc--cc----cccccchHHHchhhhcCC
Q psy8251 84 HTGQRPYICDW--CG----VGFRSKANLLQHQPVHTG 114 (434)
Q Consensus 84 h~~~~~~~C~~--C~----~~f~~~~~l~~H~~~h~~ 114 (434)
-+ |.|.. |. ..|.....|..|+..-++
T Consensus 274 ~h----y~ct~qtc~~~k~~vf~~~~el~~h~~~~h~ 306 (493)
T COG5236 274 AH----YCCTFQTCRVGKCYVFPYHTELLEHLTRFHK 306 (493)
T ss_pred Cc----eEEEEEEEecCcEEEeccHHHHHHHHHHHhh
Confidence 33 66654 42 367888888899765443
No 65
>PF04838 Baculo_LEF5: Baculoviridae late expression factor 5 ; InterPro: IPR006923 This is a family of Baculoviridae late expression factor 5, required for late and very late gene expression.; GO: 0006355 regulation of transcription, DNA-dependent
Probab=87.04 E-value=0.19 Score=40.66 Aligned_cols=29 Identities=41% Similarity=0.458 Sum_probs=25.3
Q ss_pred eecccceeeeeeccceeeecccccccccc
Q psy8251 397 FTNFNVKTQILNLTSTFHLFHPLTKYNLS 425 (434)
Q Consensus 397 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (434)
..+.|||-.-+|..||||+||-|..|.-+
T Consensus 27 nyp~nVKNkTFNF~nTGHlFHsLYAYvP~ 55 (159)
T PF04838_consen 27 NYPKNVKNKTFNFANTGHLFHSLYAYVPS 55 (159)
T ss_pred hcccccccCeeecCCCchhhhhhhhccCC
Confidence 45669999999999999999999999754
No 66
>KOG2893|consensus
Probab=86.88 E-value=0.26 Score=43.10 Aligned_cols=42 Identities=24% Similarity=0.488 Sum_probs=33.7
Q ss_pred cCcCCcccCCHHHHHHHHHhccCCCceeccccccccccchHHHchh
Q psy8251 64 CRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKANLLQHQ 109 (434)
Q Consensus 64 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 109 (434)
|.+|++.|....-|.+|++..+ |+|.+|-+.+.+-..|..|-
T Consensus 13 cwycnrefddekiliqhqkakh----fkchichkkl~sgpglsihc 54 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH----FKCHICHKKLFSGPGLSIHC 54 (341)
T ss_pred eeecccccchhhhhhhhhhhcc----ceeeeehhhhccCCCceeeh
Confidence 8889999998888888877543 88988888877777777773
No 67
>KOG2785|consensus
Probab=86.22 E-value=1.7 Score=41.23 Aligned_cols=161 Identities=20% Similarity=0.275 Sum_probs=92.0
Q ss_pred CCCCccchhhhcCCHHHHHHHHHhh-c---------CCC----------------------CCCCCCeecCcCCcccCCH
Q psy8251 27 SFDNCHICKETFYKHSELVRHLERE-H---------NIP----------------------RSESSPWCCRFCNKRMMTK 74 (434)
Q Consensus 27 ~~~~C~~C~~~f~~~~~L~~H~~~~-H---------~~~----------------------~~~~~~~~C~~C~~~f~~~ 74 (434)
.|. |.-|...|.+...-+.|++.. | ..+ .....++.|..|.+.|.+.
T Consensus 3 ~ft-C~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~ 81 (390)
T KOG2785|consen 3 GFT-CNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASP 81 (390)
T ss_pred cce-eeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccCh
Confidence 355 999999999999999998621 2 000 0123468899999999999
Q ss_pred HHHHHHHHhccCC-----------------Ccee-------------ccccccccccchHHHchhhh------------c
Q psy8251 75 LSLGIHERIHTGQ-----------------RPYI-------------CDWCGVGFRSKANLLQHQPV------------H 112 (434)
Q Consensus 75 ~~L~~H~~~h~~~-----------------~~~~-------------C~~C~~~f~~~~~l~~H~~~------------h 112 (434)
.....|+..-... .... +..+............+... -
T Consensus 82 ~a~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~~~dd~~Edi~~d~~~e 161 (390)
T KOG2785|consen 82 KAHENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEEEEDDEEEDIEEDGDDE 161 (390)
T ss_pred hhHHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhhccCcchhhhhhccchh
Confidence 9888887532110 0011 11111111111111111110 0
Q ss_pred CCCCceeccccCcccCChhHHHhhhccccCCccccc----ccc-cccccccccchhhhhhhhceecccCCcccCChhhHH
Q psy8251 113 TGQRKYNCTVCGKAFSRKSFVTTHMRVHTGKLTGKF----GQK-MSALSGTAFAVEYEIIQECLFAKNGGKAFSRKSFVT 187 (434)
Q Consensus 113 ~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~~~~~~~----~~~-~~~~~~~~~~~~~~~C~~C~~C~~C~~~F~~~~~L~ 187 (434)
....|-.|-.|++.+.+-..-..||..+||-..... ... ....-+......+ .|++|..=++.|.+....+
T Consensus 162 ~e~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~----~CL~CN~~~~~f~sleavr 237 (390)
T KOG2785|consen 162 DELIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGF----ICLFCNELGRPFSSLEAVR 237 (390)
T ss_pred cccCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCc----eEEEeccccCcccccHHHH
Confidence 011257799999999999999999999887433110 000 0000111111222 2444443349999999999
Q ss_pred HHhhh
Q psy8251 188 THMRV 192 (434)
Q Consensus 188 ~H~~~ 192 (434)
.||+.
T Consensus 238 ~HM~~ 242 (390)
T KOG2785|consen 238 AHMRD 242 (390)
T ss_pred HHHhh
Confidence 99974
No 68
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=86.20 E-value=0.42 Score=31.55 Aligned_cols=31 Identities=23% Similarity=0.620 Sum_probs=23.5
Q ss_pred cCCCCCCCCccchhhhcCCHHHHHHHHHhhcC
Q psy8251 22 HGGNLSFDNCHICKETFYKHSELVRHLEREHN 53 (434)
Q Consensus 22 h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~ 53 (434)
..|+.-+. |+.||..|....++.+|....|+
T Consensus 12 RDGE~~lr-CPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 12 RDGEEFLR-CPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred cCCceeee-CCchhHHHHHhHHHHHHhhHHhh
Confidence 34565566 88888888888888888877774
No 69
>KOG4173|consensus
Probab=85.62 E-value=0.33 Score=41.55 Aligned_cols=75 Identities=25% Similarity=0.489 Sum_probs=48.2
Q ss_pred ccc--hhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHHHHHHh----------ccCCCceeccc--cc
Q psy8251 31 CHI--CKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLGIHERI----------HTGQRPYICDW--CG 96 (434)
Q Consensus 31 C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~----------h~~~~~~~C~~--C~ 96 (434)
|++ |...|.+......|...-| ...|..|.+.|.+...|..|+.. -.|...|.|-+ |+
T Consensus 82 cqvagc~~~~d~lD~~E~hY~~~h--------~~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 82 CQVAGCCQVFDALDDYEHHYHTLH--------GNSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred ccccchHHHHhhhhhHHHhhhhcc--------cchhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 654 6666777666666654444 34577777777777777777642 23445677854 88
Q ss_pred cccccchHHHchhh-hcC
Q psy8251 97 VGFRSKANLLQHQP-VHT 113 (434)
Q Consensus 97 ~~f~~~~~l~~H~~-~h~ 113 (434)
..|.+...-+.|+- +|.
T Consensus 154 ~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhhhHHHHhcc
Confidence 88888777777753 443
No 70
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=84.67 E-value=0.51 Score=28.06 Aligned_cols=10 Identities=40% Similarity=1.112 Sum_probs=4.9
Q ss_pred CceeccccCc
Q psy8251 116 RKYNCTVCGK 125 (434)
Q Consensus 116 ~~~~C~~C~k 125 (434)
.++.|++|+.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 3455555543
No 71
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=82.29 E-value=0.52 Score=31.15 Aligned_cols=32 Identities=25% Similarity=0.515 Sum_probs=25.7
Q ss_pred hcCCCCceeccccCcccCChhHHHhhhccccC
Q psy8251 111 VHTGQRKYNCTVCGKAFSRKSFVTTHMRVHTG 142 (434)
Q Consensus 111 ~h~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~ 142 (434)
.-.|+.-+.|+-|++.|....++.+|+..-|+
T Consensus 11 ~RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 11 DRDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred ccCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 34567778999999999999999999876554
No 72
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=80.30 E-value=1.2 Score=29.98 Aligned_cols=46 Identities=22% Similarity=0.366 Sum_probs=30.3
Q ss_pred cccccCCCC-CCC--CCcccccCCCCCCCcchhhhhccccchhhhhcccccccCCCCCcccCCccccc
Q psy8251 256 SCISQGKGS-KEK--KDDEENEYGKPEDFIPDVFQAEFSKENDKLLSYLEFCDHHCDRPYDCDVCGAK 320 (434)
Q Consensus 256 ~C~~c~~~~-~~~--~~~~c~~c~~~~~~~~~~f~~~~~~~~~~~~~~l~~~~H~~~k~f~C~~Cgk~ 320 (434)
.|+.||..- ..+ ..|.|+.||....+-+..++. -..+|.|+.||..
T Consensus 11 ~CtSCg~~i~p~e~~v~F~CPnCGe~~I~Rc~~CRk-------------------~g~~Y~Cp~CGF~ 59 (61)
T COG2888 11 VCTSCGREIAPGETAVKFPCPNCGEVEIYRCAKCRK-------------------LGNPYRCPKCGFE 59 (61)
T ss_pred eeccCCCEeccCCceeEeeCCCCCceeeehhhhHHH-------------------cCCceECCCcCcc
Confidence 788888876 333 368899999764333333322 1368999999963
No 73
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=79.43 E-value=2.1 Score=33.45 Aligned_cols=23 Identities=30% Similarity=0.611 Sum_probs=18.3
Q ss_pred CCCCCCccchhhhcCCHHHHHHHH
Q psy8251 25 NLSFDNCHICKETFYKHSELVRHL 48 (434)
Q Consensus 25 ~~~~~~C~~C~~~f~~~~~L~~H~ 48 (434)
+.|.. |++||-...+...|.+-.
T Consensus 13 ~LP~~-CpiCgLtLVss~HLARSy 35 (112)
T TIGR00622 13 ELPVE-CPICGLTLILSTHLARSY 35 (112)
T ss_pred CCCCc-CCcCCCEEeccchHHHhh
Confidence 45777 999999988888888763
No 74
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=77.92 E-value=0.92 Score=35.36 Aligned_cols=14 Identities=21% Similarity=0.619 Sum_probs=8.1
Q ss_pred CeecCcCCcccCCH
Q psy8251 61 PWCCRFCNKRMMTK 74 (434)
Q Consensus 61 ~~~C~~C~~~f~~~ 74 (434)
|-.|+.||..|.-.
T Consensus 26 PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 26 PIVCPKCGTEFPPE 39 (108)
T ss_pred CccCCCCCCccCcc
Confidence 55566666666543
No 75
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=77.01 E-value=2.7 Score=32.90 Aligned_cols=82 Identities=20% Similarity=0.245 Sum_probs=46.4
Q ss_pred CCeecCcCCcccCCHHHHHHHHHhccCCC-ceeccccccccccchHHHc-h--hhh-cCC---------CCceec----c
Q psy8251 60 SPWCCRFCNKRMMTKLSLGIHERIHTGQR-PYICDWCGVGFRSKANLLQ-H--QPV-HTG---------QRKYNC----T 121 (434)
Q Consensus 60 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~-~~~C~~C~~~f~~~~~l~~-H--~~~-h~~---------~~~~~C----~ 121 (434)
+...|..|+.+..- +++..|++..+... ...-..=...+.....|.. . +.. ... ..-|.| .
T Consensus 10 ~vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~~G~~C~~~~~ 88 (109)
T PF12013_consen 10 RVLICRQCQYAVQP-SEVESHLRKRHHILKSQERQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVYDGYRCQCDPP 88 (109)
T ss_pred CEEEeCCCCcccCc-hHHHHHHHHhcccccHHHHHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCCCCCCeeeecCCC
Confidence 45779999987765 88999998443221 1101000001111110100 0 000 000 023899 9
Q ss_pred ccCcccCChhHHHhhhccccC
Q psy8251 122 VCGKAFSRKSFVTTHMRVHTG 142 (434)
Q Consensus 122 ~C~k~F~~~~~L~~H~~~h~~ 142 (434)
.|+....+...+..|.+.+||
T Consensus 89 ~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 89 HCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCCcEeccHHHHHHHHHHhcC
Confidence 999999999999999998775
No 76
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=76.91 E-value=2.5 Score=28.64 Aligned_cols=46 Identities=20% Similarity=0.312 Sum_probs=27.8
Q ss_pred cccccCCCCCCC---CCcccccCCCCCCCcchhhhhccccchhhhhcccccccCCCCCcccCCccccc
Q psy8251 256 SCISQGKGSKEK---KDDEENEYGKPEDFIPDVFQAEFSKENDKLLSYLEFCDHHCDRPYDCDVCGAK 320 (434)
Q Consensus 256 ~C~~c~~~~~~~---~~~~c~~c~~~~~~~~~~f~~~~~~~~~~~~~~l~~~~H~~~k~f~C~~Cgk~ 320 (434)
.|+.|+..-... ..|.|+.||+...+-+..++. -..+|.|+.||..
T Consensus 9 ~CtSCg~~i~~~~~~~~F~CPnCG~~~I~RC~~CRk-------------------~~~~Y~CP~CGF~ 57 (59)
T PRK14890 9 KCTSCGIEIAPREKAVKFLCPNCGEVIIYRCEKCRK-------------------QSNPYTCPKCGFE 57 (59)
T ss_pred cccCCCCcccCCCccCEeeCCCCCCeeEeechhHHh-------------------cCCceECCCCCCc
Confidence 788888765432 368888888752222222222 1258999999863
No 77
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=76.71 E-value=3.1 Score=32.50 Aligned_cols=81 Identities=19% Similarity=0.281 Sum_probs=53.6
Q ss_pred CCCeecCcCCcccCCHHHHHHHHHhccCCC------------ceeccccccccccchHHHchhhhcCCCCceeccccCcc
Q psy8251 59 SSPWCCRFCNKRMMTKLSLGIHERIHTGQR------------PYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKA 126 (434)
Q Consensus 59 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~------------~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~ 126 (434)
+-|-.|+.||-..-....|.+...---.-+ .-.|--|...|........- .-.....|.|+.|...
T Consensus 13 ~LP~~CpiCgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~--~~~~~~~y~C~~C~~~ 90 (112)
T TIGR00622 13 ELPVECPICGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFD--ELKDSHRYVCAVCKNV 90 (112)
T ss_pred CCCCcCCcCCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCccccccc--ccccccceeCCCCCCc
Confidence 347889999999888888876522111111 12388899988765421100 0122346999999999
Q ss_pred cCChhHHHhhhcccc
Q psy8251 127 FSRKSFVTTHMRVHT 141 (434)
Q Consensus 127 F~~~~~L~~H~~~h~ 141 (434)
|-..-+.-.|...|.
T Consensus 91 FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 91 FCVDCDVFVHESLHC 105 (112)
T ss_pred cccccchhhhhhccC
Confidence 998888888887764
No 78
>KOG2186|consensus
Probab=76.19 E-value=1.7 Score=38.77 Aligned_cols=46 Identities=26% Similarity=0.659 Sum_probs=26.6
Q ss_pred eeccccccccccchHHHchhhhcCCCCceeccccCcccCChhHHHhhhc
Q psy8251 90 YICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHMR 138 (434)
Q Consensus 90 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~ 138 (434)
|.|..||.... +..+.+|+...++ .-|.|-.|++.|.. .++..|..
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 55666666554 3345556666555 33666666666665 55555654
No 79
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=74.94 E-value=1.1 Score=25.08 Aligned_cols=9 Identities=33% Similarity=0.748 Sum_probs=7.3
Q ss_pred ccchhhhcC
Q psy8251 31 CHICKETFY 39 (434)
Q Consensus 31 C~~C~~~f~ 39 (434)
|+.||..|.
T Consensus 17 Cp~CG~~F~ 25 (26)
T PF10571_consen 17 CPHCGYDFE 25 (26)
T ss_pred CCCCCCCCc
Confidence 888888875
No 80
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=73.22 E-value=2.6 Score=32.88 Aligned_cols=31 Identities=23% Similarity=0.672 Sum_probs=24.3
Q ss_pred CeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccccc
Q psy8251 61 PWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSK 102 (434)
Q Consensus 61 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~ 102 (434)
...|+.||+.|... +..|..|+.||..|...
T Consensus 9 KR~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 9 KRTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred cccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 45799999998763 34678899999988765
No 81
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=72.96 E-value=2.3 Score=27.97 Aligned_cols=25 Identities=36% Similarity=0.840 Sum_probs=20.3
Q ss_pred cccCCccccccCCc-----hhHHHHHH-hhc
Q psy8251 311 PYDCDVCGAKFTQV-----GDMRRHRK-KHD 335 (434)
Q Consensus 311 ~f~C~~Cgk~F~~~-----~~L~~H~r-~H~ 335 (434)
.=.|..|++.+... ++|.+|++ +|.
T Consensus 18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~ 48 (50)
T smart00614 18 RAKCKYCGKKLSRSSKGGTSNLRRHLRRKHP 48 (50)
T ss_pred EEEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence 35799999998766 69999998 564
No 82
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=72.40 E-value=2.1 Score=25.65 Aligned_cols=10 Identities=50% Similarity=1.305 Sum_probs=6.8
Q ss_pred CcccCCcccc
Q psy8251 310 RPYDCDVCGA 319 (434)
Q Consensus 310 k~f~C~~Cgk 319 (434)
.|-.|++||.
T Consensus 17 ~p~~CP~Cg~ 26 (34)
T cd00729 17 APEKCPICGA 26 (34)
T ss_pred CCCcCcCCCC
Confidence 4567777775
No 83
>KOG4173|consensus
Probab=72.06 E-value=1.9 Score=37.09 Aligned_cols=79 Identities=22% Similarity=0.378 Sum_probs=61.6
Q ss_pred CCCCeecCc--CCcccCCHHHHHHHHHhccCCCceeccccccccccchHHHchhhhc----------CCCCceec--ccc
Q psy8251 58 ESSPWCCRF--CNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKANLLQHQPVH----------TGQRKYNC--TVC 123 (434)
Q Consensus 58 ~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h----------~~~~~~~C--~~C 123 (434)
....|.|++ |-..|........|..+-++ -.|..|.+.|.+.-.|..|+.-. .|..-|.| ..|
T Consensus 76 ~~~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgC 152 (253)
T KOG4173|consen 76 RVPAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGC 152 (253)
T ss_pred ccccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhh
Confidence 345588888 77888888888888766555 36999999999999999997643 23445889 459
Q ss_pred CcccCChhHHHhhhcc
Q psy8251 124 GKAFSRKSFVTTHMRV 139 (434)
Q Consensus 124 ~k~F~~~~~L~~H~~~ 139 (434)
+..|.+...-..|+-.
T Consensus 153 t~KFkT~r~RkdH~I~ 168 (253)
T KOG4173|consen 153 TEKFKTSRDRKDHMIR 168 (253)
T ss_pred hhhhhhhhhhhhHHHH
Confidence 9999999999999754
No 84
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=70.09 E-value=1.8 Score=38.47 Aligned_cols=43 Identities=26% Similarity=0.477 Sum_probs=24.6
Q ss_pred CceeccccccccccchHHHchhhh----------cCCCCc-----eeccccCcccCCh
Q psy8251 88 RPYICDWCGVGFRSKANLLQHQPV----------HTGQRK-----YNCTVCGKAFSRK 130 (434)
Q Consensus 88 ~~~~C~~C~~~f~~~~~l~~H~~~----------h~~~~~-----~~C~~C~k~F~~~ 130 (434)
+...|++|++.|.+..-.....+. ..+..| ..|+.||.+|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 445677777666655433333321 112233 5799999988755
No 85
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=66.80 E-value=3.3 Score=26.35 Aligned_cols=26 Identities=27% Similarity=0.590 Sum_probs=17.3
Q ss_pred CCCcccCCccccccCCc----hhHHHHHHh
Q psy8251 308 CDRPYDCDVCGAKFTQV----GDMRRHRKK 333 (434)
Q Consensus 308 ~~k~f~C~~Cgk~F~~~----~~L~~H~r~ 333 (434)
+.....|..|++.+... ++|.+|++.
T Consensus 13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~ 42 (45)
T PF02892_consen 13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKK 42 (45)
T ss_dssp CSS-EEETTTTEE-----SSTHHHHHHHHH
T ss_pred CcCeEEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence 34567999999998874 899999943
No 86
>PHA00626 hypothetical protein
Probab=66.74 E-value=4 Score=27.19 Aligned_cols=15 Identities=27% Similarity=0.481 Sum_probs=13.0
Q ss_pred CcccCCccccccCCc
Q psy8251 310 RPYDCDVCGAKFTQV 324 (434)
Q Consensus 310 k~f~C~~Cgk~F~~~ 324 (434)
..|+|+.||..|+..
T Consensus 22 nrYkCkdCGY~ft~~ 36 (59)
T PHA00626 22 DDYVCCDCGYNDSKD 36 (59)
T ss_pred cceEcCCCCCeechh
Confidence 579999999999764
No 87
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=65.98 E-value=3.6 Score=34.68 Aligned_cols=14 Identities=43% Similarity=1.021 Sum_probs=12.1
Q ss_pred CCCCCcccCCcccc
Q psy8251 306 HHCDRPYDCDVCGA 319 (434)
Q Consensus 306 H~~~k~f~C~~Cgk 319 (434)
+.++.|-+|++||.
T Consensus 144 ~~ge~P~~CPiCga 157 (166)
T COG1592 144 HEGEAPEVCPICGA 157 (166)
T ss_pred ccCCCCCcCCCCCC
Confidence 67789999999994
No 88
>KOG2186|consensus
Probab=65.69 E-value=3.7 Score=36.63 Aligned_cols=45 Identities=24% Similarity=0.564 Sum_probs=23.9
Q ss_pred ccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCcccCCHHHHHHHHHh
Q psy8251 31 CHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNKRMMTKLSLGIHERI 83 (434)
Q Consensus 31 C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~ 83 (434)
|..||...... .+.+|+.+-+ ..-|.|-.|++.|.. .++..|..-
T Consensus 6 CnvCgEsvKKp-~vekH~srCr------n~~fSCIDC~k~F~~-~sYknH~kC 50 (276)
T KOG2186|consen 6 CNVCGESVKKP-QVEKHMSRCR------NAYFSCIDCGKTFER-VSYKNHTKC 50 (276)
T ss_pred hhhhhhhcccc-chHHHHHhcc------CCeeEEeeccccccc-chhhhhhhh
Confidence 66666554443 3444655322 245666666666655 455555443
No 89
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=64.79 E-value=4.9 Score=22.39 Aligned_cols=20 Identities=20% Similarity=0.371 Sum_probs=16.8
Q ss_pred ccCCccccccCCchhHHHHHH
Q psy8251 312 YDCDVCGAKFTQVGDMRRHRK 332 (434)
Q Consensus 312 f~C~~Cgk~F~~~~~L~~H~r 332 (434)
..|++|++.+ ....+..|+.
T Consensus 2 v~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLD 21 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHH
Confidence 3699999999 6688888875
No 90
>KOG2272|consensus
Probab=63.15 E-value=3.2 Score=36.89 Aligned_cols=18 Identities=22% Similarity=0.423 Sum_probs=14.2
Q ss_pred hhhceecccCCcccCChh
Q psy8251 167 IQECLFAKNGGKAFSRKS 184 (434)
Q Consensus 167 C~~C~~C~~C~~~F~~~~ 184 (434)
|..|+.|+.|++...++.
T Consensus 277 Cv~cf~Cs~Cdkkl~~K~ 294 (332)
T KOG2272|consen 277 CVECFSCSTCDKKLTQKN 294 (332)
T ss_pred cccccccccccccccccc
Confidence 888999999988775543
No 91
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=61.19 E-value=7.2 Score=23.75 Aligned_cols=33 Identities=27% Similarity=0.595 Sum_probs=15.4
Q ss_pred eeccccccccccchHHHchhhhcCCCCceeccccCccc
Q psy8251 90 YICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAF 127 (434)
Q Consensus 90 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F 127 (434)
+.|+.|+..|.-...... .......|+.|+..|
T Consensus 3 ~~CP~C~~~~~v~~~~~~-----~~~~~v~C~~C~~~~ 35 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLG-----ANGGKVRCGKCGHVW 35 (38)
T ss_pred EECCCCCCEEEeCHHHcC-----CCCCEEECCCCCCEE
Confidence 356666665554432211 111235666666555
No 92
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=61.14 E-value=3.8 Score=32.54 Aligned_cols=14 Identities=7% Similarity=0.273 Sum_probs=8.4
Q ss_pred CeecCcCCcccCCH
Q psy8251 61 PWCCRFCNKRMMTK 74 (434)
Q Consensus 61 ~~~C~~C~~~f~~~ 74 (434)
|-.|+.||..|.-.
T Consensus 26 p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 26 PAVSPYTGEQFPPE 39 (129)
T ss_pred CccCCCcCCccCcc
Confidence 55566666666444
No 93
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=60.85 E-value=5.1 Score=31.33 Aligned_cols=26 Identities=31% Similarity=0.584 Sum_probs=23.9
Q ss_pred cccC----CccccccCCchhHHHHHHhhcc
Q psy8251 311 PYDC----DVCGAKFTQVGDMRRHRKKHDE 336 (434)
Q Consensus 311 ~f~C----~~Cgk~F~~~~~L~~H~r~H~~ 336 (434)
-|.| ..|++.+.+...+++|.+.++|
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 3899 9999999999999999998875
No 94
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=60.30 E-value=7.3 Score=23.75 Aligned_cols=32 Identities=34% Similarity=0.788 Sum_probs=15.0
Q ss_pred eccccccccccchHHHchhhhcCCCCceeccccCccc
Q psy8251 91 ICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAF 127 (434)
Q Consensus 91 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F 127 (434)
.|+.|+..|.-...- .-.+....+|+.|+..|
T Consensus 4 ~CP~C~~~f~v~~~~-----l~~~~~~vrC~~C~~~f 35 (37)
T PF13719_consen 4 TCPNCQTRFRVPDDK-----LPAGGRKVRCPKCGHVF 35 (37)
T ss_pred ECCCCCceEEcCHHH-----cccCCcEEECCCCCcEe
Confidence 455565555544321 11222345666666555
No 95
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=58.79 E-value=9.1 Score=32.22 Aligned_cols=34 Identities=24% Similarity=0.503 Sum_probs=22.3
Q ss_pred cCCCceeccccccccccchHHHchhhhcCCCCceeccccCccc
Q psy8251 85 TGQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAF 127 (434)
Q Consensus 85 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F 127 (434)
.+..-|.|+.|+..|+....+. ..|.|+.||...
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAML 138 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEe
Confidence 3445577877777777666654 247888887654
No 96
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=54.90 E-value=5.2 Score=24.76 Aligned_cols=12 Identities=42% Similarity=0.994 Sum_probs=6.6
Q ss_pred eecccccccccc
Q psy8251 90 YICDWCGVGFRS 101 (434)
Q Consensus 90 ~~C~~C~~~f~~ 101 (434)
|+|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (41)
T smart00834 6 YRCEDCGHTFEV 17 (41)
T ss_pred EEcCCCCCEEEE
Confidence 555666655543
No 97
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=54.89 E-value=16 Score=30.36 Aligned_cols=38 Identities=16% Similarity=0.410 Sum_probs=21.2
Q ss_pred CCCceeccccccccccchHHHchhhhcCCCCceeccccCccc
Q psy8251 86 GQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAF 127 (434)
Q Consensus 86 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F 127 (434)
+...|.|+.|+..|.....+..- .. ...|.|+.||...
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~~---d~-~~~f~Cp~Cg~~l 133 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQLL---DM-DGTFTCPRCGEEL 133 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHhc---CC-CCcEECCCCCCEE
Confidence 33457777777777654433220 11 2337777777654
No 98
>KOG2071|consensus
Probab=54.73 E-value=11 Score=38.27 Aligned_cols=25 Identities=24% Similarity=0.312 Sum_probs=20.5
Q ss_pred eecccCCcccCChhhHHHHhhhccC
Q psy8251 171 LFAKNGGKAFSRKSFVTTHMRVHTG 195 (434)
Q Consensus 171 ~~C~~C~~~F~~~~~L~~H~~~H~~ 195 (434)
..|..||..|.+......||.+|..
T Consensus 419 nqC~~CG~R~~~~ee~sk~md~H~d 443 (579)
T KOG2071|consen 419 NQCKSCGLRFDDSEERSKHMDIHDD 443 (579)
T ss_pred chhcccccccccchhhhhHhhhhhh
Confidence 3445559999999999999998875
No 99
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=54.54 E-value=9.2 Score=24.41 Aligned_cols=11 Identities=18% Similarity=0.619 Sum_probs=5.7
Q ss_pred eecCcCCcccC
Q psy8251 62 WCCRFCNKRMM 72 (434)
Q Consensus 62 ~~C~~C~~~f~ 72 (434)
|.|..||..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 55555555443
No 100
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=54.43 E-value=12 Score=22.61 Aligned_cols=32 Identities=25% Similarity=0.653 Sum_probs=15.3
Q ss_pred eccccccccccchHHHchhhhcCCCCceeccccCccc
Q psy8251 91 ICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAF 127 (434)
Q Consensus 91 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F 127 (434)
.|+.|+..|.-..... -......+|+.|+..|
T Consensus 4 ~Cp~C~~~y~i~d~~i-----p~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 4 TCPNCQAKYEIDDEKI-----PPKGRKVRCSKCGHVF 35 (36)
T ss_pred ECCCCCCEEeCCHHHC-----CCCCcEEECCCCCCEe
Confidence 4566665555443321 1222335666666555
No 101
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=54.36 E-value=15 Score=24.89 Aligned_cols=27 Identities=19% Similarity=0.368 Sum_probs=12.5
Q ss_pred chHHHchhhhcCCCCceeccc----cCcccC
Q psy8251 102 KANLLQHQPVHTGQRKYNCTV----CGKAFS 128 (434)
Q Consensus 102 ~~~l~~H~~~h~~~~~~~C~~----C~k~F~ 128 (434)
+..|..|+...=..++..|+. |+..+.
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 445555655444444556666 655554
No 102
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=54.17 E-value=7.8 Score=34.50 Aligned_cols=13 Identities=23% Similarity=0.534 Sum_probs=11.1
Q ss_pred ccCCccccccCCc
Q psy8251 312 YDCDVCGAKFTQV 324 (434)
Q Consensus 312 f~C~~Cgk~F~~~ 324 (434)
-.|+.||.+|...
T Consensus 49 ~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 49 WVCPHCGYAAFEE 61 (214)
T ss_pred EECCCCCCccccc
Confidence 4799999998866
No 103
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=53.22 E-value=6.5 Score=31.86 Aligned_cols=26 Identities=31% Similarity=0.565 Sum_probs=17.1
Q ss_pred CcccCCccccccCCchhHHHHHHhhccCC
Q psy8251 310 RPYDCDVCGAKFTQVGDMRRHRKKHDENG 338 (434)
Q Consensus 310 k~f~C~~Cgk~F~~~~~L~~H~r~H~~~~ 338 (434)
.--.|-+||+.|.. |.+|++.|+|..
T Consensus 71 d~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 71 DYIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp S-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred CeeEEccCCcccch---HHHHHHHccCCC
Confidence 44689999999975 599999998874
No 104
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=53.02 E-value=5 Score=33.53 Aligned_cols=15 Identities=33% Similarity=0.713 Sum_probs=11.1
Q ss_pred eeccccCcccCChhH
Q psy8251 118 YNCTVCGKAFSRKSF 132 (434)
Q Consensus 118 ~~C~~C~k~F~~~~~ 132 (434)
++|+.||++|.+...
T Consensus 29 ~~c~~c~~~f~~~e~ 43 (154)
T PRK00464 29 RECLACGKRFTTFER 43 (154)
T ss_pred eeccccCCcceEeEe
Confidence 788888888876544
No 105
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=52.05 E-value=6 Score=26.11 Aligned_cols=11 Identities=45% Similarity=1.084 Sum_probs=5.5
Q ss_pred eeccccccccc
Q psy8251 90 YICDWCGVGFR 100 (434)
Q Consensus 90 ~~C~~C~~~f~ 100 (434)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 44555555444
No 106
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=51.28 E-value=14 Score=31.89 Aligned_cols=34 Identities=18% Similarity=0.576 Sum_probs=24.0
Q ss_pred CCCceeccccccccccchHHHchhhhcCCCCceeccccCcccC
Q psy8251 86 GQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFS 128 (434)
Q Consensus 86 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~ 128 (434)
...-|.|+.|+..|+....+. ..|.|+.||....
T Consensus 114 ~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 114 NNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLE 147 (178)
T ss_pred CCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCe
Confidence 345688888888887766553 2488888887654
No 107
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=48.42 E-value=17 Score=30.18 Aligned_cols=39 Identities=13% Similarity=0.353 Sum_probs=22.0
Q ss_pred CCCCeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccc
Q psy8251 58 ESSPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFR 100 (434)
Q Consensus 58 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~ 100 (434)
....|.|+.|+..|.....+.. . . ....|.|+.||....
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~-~--d-~~~~f~Cp~Cg~~l~ 134 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQL-L--D-MDGTFTCPRCGEELE 134 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHh-c--C-CCCcEECCCCCCEEE
Confidence 3456778888877764433221 0 1 133378888876653
No 108
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=46.92 E-value=20 Score=22.43 Aligned_cols=22 Identities=14% Similarity=0.036 Sum_probs=10.5
Q ss_pred CCCcccccc--CCHHHHHHHHHHc
Q psy8251 1 MWPRGFRGF--RQVINLLKHRREH 22 (434)
Q Consensus 1 ~C~~C~~~F--~~~~~l~~H~~~h 22 (434)
+|+.||..| .....-..|.+.|
T Consensus 15 ~C~~CgM~Y~~~~~eD~~~H~~yH 38 (41)
T PF13878_consen 15 TCPTCGMLYSPGSPEDEKLHKKYH 38 (41)
T ss_pred CCCCCCCEECCCCHHHHHHHHHHH
Confidence 355555554 2334444454444
No 109
>KOG4167|consensus
Probab=46.36 E-value=4.4 Score=41.74 Aligned_cols=28 Identities=25% Similarity=0.579 Sum_probs=25.3
Q ss_pred CcccCCccccccCCchhHHHHHHhhccC
Q psy8251 310 RPYDCDVCGAKFTQVGDMRRHRKKHDEN 337 (434)
Q Consensus 310 k~f~C~~Cgk~F~~~~~L~~H~r~H~~~ 337 (434)
.-|.|..|||.|....++..||++|.-.
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr~q 818 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHRQQ 818 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHHHH
Confidence 5699999999999999999999999643
No 110
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=45.88 E-value=11 Score=23.72 Aligned_cols=12 Identities=42% Similarity=1.027 Sum_probs=6.4
Q ss_pred eecccccccccc
Q psy8251 90 YICDWCGVGFRS 101 (434)
Q Consensus 90 ~~C~~C~~~f~~ 101 (434)
|.|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (42)
T PF09723_consen 6 YRCEECGHEFEV 17 (42)
T ss_pred EEeCCCCCEEEE
Confidence 555555555543
No 111
>KOG1701|consensus
Probab=45.12 E-value=13 Score=35.92 Aligned_cols=15 Identities=20% Similarity=0.375 Sum_probs=11.7
Q ss_pred hhceecccCCcccCC
Q psy8251 168 QECLFAKNGGKAFSR 182 (434)
Q Consensus 168 ~~C~~C~~C~~~F~~ 182 (434)
..||+|+.|+.....
T Consensus 425 v~CY~CEDCg~~LS~ 439 (468)
T KOG1701|consen 425 VNCYKCEDCGLLLSS 439 (468)
T ss_pred ccceehhhcCccccc
Confidence 468888999987764
No 112
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=44.46 E-value=11 Score=24.24 Aligned_cols=12 Identities=25% Similarity=0.694 Sum_probs=8.0
Q ss_pred cccCCccccccC
Q psy8251 311 PYDCDVCGAKFT 322 (434)
Q Consensus 311 ~f~C~~Cgk~F~ 322 (434)
...|+.||..+.
T Consensus 21 ~~~Cp~CG~~~~ 32 (46)
T PRK00398 21 GVRCPYCGYRIL 32 (46)
T ss_pred ceECCCCCCeEE
Confidence 567777776543
No 113
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=42.62 E-value=10 Score=20.67 Aligned_cols=10 Identities=40% Similarity=1.145 Sum_probs=8.4
Q ss_pred CcccCCcccc
Q psy8251 310 RPYDCDVCGA 319 (434)
Q Consensus 310 k~f~C~~Cgk 319 (434)
-+|.|+.||+
T Consensus 15 v~f~CPnCG~ 24 (24)
T PF07754_consen 15 VPFPCPNCGF 24 (24)
T ss_pred ceEeCCCCCC
Confidence 4799999985
No 114
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=42.32 E-value=17 Score=30.65 Aligned_cols=34 Identities=15% Similarity=0.333 Sum_probs=25.6
Q ss_pred CCCCeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccc
Q psy8251 58 ESSPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFR 100 (434)
Q Consensus 58 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~ 100 (434)
+..-|.|+.|+..|+....+. ..|.|+.||....
T Consensus 106 ~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L~ 139 (158)
T TIGR00373 106 NNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAMLD 139 (158)
T ss_pred CCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEee
Confidence 456789999998888776664 2589999987653
No 115
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=40.04 E-value=15 Score=29.78 Aligned_cols=19 Identities=42% Similarity=0.828 Sum_probs=7.2
Q ss_pred ccchhhhcCCHHHHHHHHHhhc
Q psy8251 31 CHICKETFYKHSELVRHLEREH 52 (434)
Q Consensus 31 C~~C~~~f~~~~~L~~H~~~~H 52 (434)
|-+||+.|.. |.+|++.+|
T Consensus 75 clecGk~~k~---LkrHL~~~~ 93 (132)
T PF05443_consen 75 CLECGKKFKT---LKRHLRTHH 93 (132)
T ss_dssp -TBT--EESB---HHHHHHHTT
T ss_pred EccCCcccch---HHHHHHHcc
Confidence 4445554444 244554333
No 116
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=40.02 E-value=15 Score=31.59 Aligned_cols=34 Identities=18% Similarity=0.435 Sum_probs=25.6
Q ss_pred CCCCeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccc
Q psy8251 58 ESSPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFR 100 (434)
Q Consensus 58 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~ 100 (434)
...-|.|+.|+..|+....+. ..|.|+.||....
T Consensus 114 ~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 114 NNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLE 147 (178)
T ss_pred CCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCe
Confidence 345799999999888766542 2599999997654
No 117
>PF15269 zf-C2H2_7: Zinc-finger
Probab=39.57 E-value=15 Score=23.14 Aligned_cols=23 Identities=22% Similarity=0.388 Sum_probs=20.8
Q ss_pred cccCCccccccCCchhHHHHHHh
Q psy8251 311 PYDCDVCGAKFTQVGDMRRHRKK 333 (434)
Q Consensus 311 ~f~C~~Cgk~F~~~~~L~~H~r~ 333 (434)
.|+|-+|..+...++.|..||+-
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 47899999999999999999975
No 118
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=39.54 E-value=9 Score=20.61 Aligned_cols=7 Identities=29% Similarity=0.857 Sum_probs=4.7
Q ss_pred ccchhhh
Q psy8251 31 CHICKET 37 (434)
Q Consensus 31 C~~C~~~ 37 (434)
|+.||..
T Consensus 16 C~~CG~~ 22 (23)
T PF13240_consen 16 CPNCGTP 22 (23)
T ss_pred hhhhCCc
Confidence 7777754
No 119
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=37.96 E-value=13 Score=26.67 Aligned_cols=41 Identities=17% Similarity=0.392 Sum_probs=22.1
Q ss_pred eeccccccccccchHHHchhhhcCCCCceecc--ccCcccCChhH
Q psy8251 90 YICDWCGVGFRSKANLLQHQPVHTGQRKYNCT--VCGKAFSRKSF 132 (434)
Q Consensus 90 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~k~F~~~~~ 132 (434)
+.|+.||..-....+-..... ..+.-+.|. .||.+|.....
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYES 44 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEE
Confidence 457777665533333222211 334557776 78888875443
No 120
>KOG2593|consensus
Probab=37.76 E-value=32 Score=33.58 Aligned_cols=39 Identities=23% Similarity=0.551 Sum_probs=25.1
Q ss_pred cCCCceeccccccccccchHHHchhhhcCCCCceeccccCcc
Q psy8251 85 TGQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKA 126 (434)
Q Consensus 85 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~ 126 (434)
+...-|.|+.|.+.|.+...++ ..-.....|.|..|+-.
T Consensus 124 t~~~~Y~Cp~C~kkyt~Lea~~---L~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKYTSLEALQ---LLDNETGEFHCENCGGE 162 (436)
T ss_pred cccccccCCccccchhhhHHHH---hhcccCceEEEecCCCc
Confidence 3445688888888888766554 22223345888888754
No 121
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=37.69 E-value=18 Score=26.71 Aligned_cols=15 Identities=40% Similarity=0.906 Sum_probs=10.8
Q ss_pred CCcccCCccccccCC
Q psy8251 309 DRPYDCDVCGAKFTQ 323 (434)
Q Consensus 309 ~k~f~C~~Cgk~F~~ 323 (434)
..-..|..||.+|.-
T Consensus 51 ~GIW~C~kCg~~fAG 65 (89)
T COG1997 51 TGIWKCRKCGAKFAG 65 (89)
T ss_pred cCeEEcCCCCCeecc
Confidence 356788888888764
No 122
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=37.57 E-value=20 Score=21.11 Aligned_cols=9 Identities=22% Similarity=0.704 Sum_probs=3.9
Q ss_pred ecCcCCccc
Q psy8251 63 CCRFCNKRM 71 (434)
Q Consensus 63 ~C~~C~~~f 71 (434)
.|..||..+
T Consensus 2 ~C~~Cg~~~ 10 (32)
T PF03604_consen 2 ICGECGAEV 10 (32)
T ss_dssp BESSSSSSE
T ss_pred CCCcCCCee
Confidence 344444443
No 123
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=36.85 E-value=17 Score=21.06 Aligned_cols=9 Identities=33% Similarity=0.892 Sum_probs=4.8
Q ss_pred ceeccccCc
Q psy8251 117 KYNCTVCGK 125 (434)
Q Consensus 117 ~~~C~~C~k 125 (434)
.|.|+.|+.
T Consensus 19 ~~vCp~C~~ 27 (30)
T PF08274_consen 19 LLVCPECGH 27 (30)
T ss_dssp SEEETTTTE
T ss_pred EEeCCcccc
Confidence 355555553
No 124
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=36.59 E-value=31 Score=27.77 Aligned_cols=13 Identities=15% Similarity=0.133 Sum_probs=8.3
Q ss_pred ecccCCcccCChh
Q psy8251 172 FAKNGGKAFSRKS 184 (434)
Q Consensus 172 ~C~~C~~~F~~~~ 184 (434)
.||.|.-+|.+.+
T Consensus 123 vCPvCkTSFKss~ 135 (140)
T PF05290_consen 123 VCPVCKTSFKSSS 135 (140)
T ss_pred CCCcccccccccc
Confidence 4455588887654
No 125
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=35.41 E-value=23 Score=27.27 Aligned_cols=11 Identities=36% Similarity=0.881 Sum_probs=6.5
Q ss_pred eeccccCcccC
Q psy8251 118 YNCTVCGKAFS 128 (434)
Q Consensus 118 ~~C~~C~k~F~ 128 (434)
+.|+.|+..+.
T Consensus 17 ~~C~~C~~~~~ 27 (104)
T TIGR01384 17 YVCPSCGYEKE 27 (104)
T ss_pred EECcCCCCccc
Confidence 56666665544
No 126
>PF14353 CpXC: CpXC protein
Probab=35.23 E-value=4.3 Score=32.77 Aligned_cols=21 Identities=24% Similarity=0.577 Sum_probs=14.0
Q ss_pred ceeccccCcccCChhHHHhhh
Q psy8251 117 KYNCTVCGKAFSRKSFVTTHM 137 (434)
Q Consensus 117 ~~~C~~C~k~F~~~~~L~~H~ 137 (434)
.|.|+.||..|.-...+.-|-
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D 58 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHD 58 (128)
T ss_pred EEECCCCCCceecCCCEEEEc
Confidence 477888888877665555443
No 127
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=35.11 E-value=7 Score=40.54 Aligned_cols=56 Identities=18% Similarity=0.296 Sum_probs=28.7
Q ss_pred CCccccccCCHHHHHHHHHHcCCCCCCCCccchhhhcCCHHHHHHHHHhhcCCCCCCCCCeecCcCCc
Q psy8251 2 WPRGFRGFRQVINLLKHRREHGGNLSFDNCHICKETFYKHSELVRHLEREHNIPRSESSPWCCRFCNK 69 (434)
Q Consensus 2 C~~C~~~F~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~C~~C~~ 69 (434)
|..||-+|+-..+|--=.. .+.-..|..|+.|.+.|.+..+-+-|. .|..|+.||-
T Consensus 126 CT~CGPRfTIi~alPYDR~-nTsM~~F~lC~~C~~EY~dP~nRRfHA-----------Qp~aCp~CGP 181 (750)
T COG0068 126 CTNCGPRFTIIEALPYDRE-NTSMADFPLCPFCDKEYKDPLNRRFHA-----------QPIACPKCGP 181 (750)
T ss_pred cCCCCcceeeeccCCCCcc-cCccccCcCCHHHHHHhcCcccccccc-----------ccccCcccCC
Confidence 6666666655444321111 111122333777777777766544332 2566777774
No 128
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=33.40 E-value=26 Score=22.90 Aligned_cols=11 Identities=18% Similarity=0.760 Sum_probs=6.7
Q ss_pred CeecCcCCccc
Q psy8251 61 PWCCRFCNKRM 71 (434)
Q Consensus 61 ~~~C~~C~~~f 71 (434)
.|.|..||+.|
T Consensus 6 ~Y~C~~Cg~~~ 16 (49)
T COG1996 6 EYKCARCGREV 16 (49)
T ss_pred EEEhhhcCCee
Confidence 46666666655
No 129
>KOG2593|consensus
Probab=33.29 E-value=46 Score=32.55 Aligned_cols=14 Identities=21% Similarity=0.712 Sum_probs=7.1
Q ss_pred CCeecCcCCcccCC
Q psy8251 60 SPWCCRFCNKRMMT 73 (434)
Q Consensus 60 ~~~~C~~C~~~f~~ 73 (434)
..|.|+.|.+.|..
T Consensus 127 ~~Y~Cp~C~kkyt~ 140 (436)
T KOG2593|consen 127 AGYVCPNCQKKYTS 140 (436)
T ss_pred ccccCCccccchhh
Confidence 34555555555543
No 130
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=33.27 E-value=20 Score=22.11 Aligned_cols=11 Identities=36% Similarity=0.932 Sum_probs=9.1
Q ss_pred cccCCcccccc
Q psy8251 311 PYDCDVCGAKF 321 (434)
Q Consensus 311 ~f~C~~Cgk~F 321 (434)
-|.|..||..|
T Consensus 28 fy~C~~C~~~w 38 (39)
T PF01096_consen 28 FYVCCNCGHRW 38 (39)
T ss_dssp EEEESSSTEEE
T ss_pred EEEeCCCCCee
Confidence 38999999876
No 131
>KOG1280|consensus
Probab=33.15 E-value=46 Score=31.44 Aligned_cols=92 Identities=11% Similarity=0.071 Sum_probs=52.6
Q ss_pred cccccCCCCCCCCCcccccCCCCCCCcchhhhhccccchhhhhcccc---------------cccCCCCCcccCCccccc
Q psy8251 256 SCISQGKGSKEKKDDEENEYGKPEDFIPDVFQAEFSKENDKLLSYLE---------------FCDHHCDRPYDCDVCGAK 320 (434)
Q Consensus 256 ~C~~c~~~~~~~~~~~c~~c~~~~~~~~~~f~~~~~~~~~~~~~~l~---------------~~~H~~~k~f~C~~Cgk~ 320 (434)
.|+-|++..-.-..|+|-.|... +...+.|.+.+....+....-+. ...|-....|.|+.||+.
T Consensus 10 ~CdgC~k~~~t~rrYkCL~C~Dy-DlC~sCyen~~tt~~H~~dHPmqcil~~~dfeL~f~Ge~i~~y~~qSftCPyC~~~ 88 (381)
T KOG1280|consen 10 SCDGCGKTAFTFRRYKCLRCSDY-DLCFSCYENGATTPIHDEDHPMQCILSRVDFELYFGGEPISHYDPQSFTCPYCGIM 88 (381)
T ss_pred eeccccccceeeeeeEeeeecch-hHHHHHhhcCCCCcccCCCCceeEEeeccceeeEecCccccccccccccCCccccc
Confidence 78888887666667777777442 22222233332222221111111 112334457999999999
Q ss_pred cCCchhHHHHHHhhccCCCCCCCCCCCCCCCC
Q psy8251 321 FTQVGDMRRHRKKHDENGGSAKDKPGAYNKTD 352 (434)
Q Consensus 321 F~~~~~L~~H~r~H~~~~~~~~~~~~~c~~~~ 352 (434)
=.+...|..|+..-+.+.. -.-.|+.|+
T Consensus 89 Gfte~~f~~Hv~s~Hpda~----~~~icp~c~ 116 (381)
T KOG1280|consen 89 GFTERQFGTHVLSQHPEAS----TSVICPLCA 116 (381)
T ss_pred ccchhHHHHHhhhcCcccC----cceeeeccc
Confidence 9999999999765444432 222566555
No 132
>PF12907 zf-met2: Zinc-binding
Probab=32.34 E-value=39 Score=21.06 Aligned_cols=22 Identities=41% Similarity=0.931 Sum_probs=15.5
Q ss_pred ccchhhhc---CCHHHHHHHHHhhc
Q psy8251 31 CHICKETF---YKHSELVRHLEREH 52 (434)
Q Consensus 31 C~~C~~~f---~~~~~L~~H~~~~H 52 (434)
|.+|-.+| .+...|..|....|
T Consensus 4 C~iC~qtF~~t~~~~~L~eH~enKH 28 (40)
T PF12907_consen 4 CKICRQTFMQTTNEPQLKEHAENKH 28 (40)
T ss_pred cHHhhHHHHhcCCHHHHHHHHHccC
Confidence 88887544 55677888877666
No 133
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=31.75 E-value=27 Score=28.11 Aligned_cols=20 Identities=25% Similarity=0.270 Sum_probs=10.2
Q ss_pred CCccccccCCHHHHHHHHHHcCC
Q psy8251 2 WPRGFRGFRQVINLLKHRREHGG 24 (434)
Q Consensus 2 C~~C~~~F~~~~~l~~H~~~h~~ 24 (434)
|-++|+.| .+|++|+.+|.+
T Consensus 79 cLEDGkkf---KSLKRHL~t~~g 98 (148)
T COG4957 79 CLEDGKKF---KSLKRHLTTHYG 98 (148)
T ss_pred EeccCcch---HHHHHHHhcccC
Confidence 44555555 345555555543
No 134
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=30.84 E-value=19 Score=22.36 Aligned_cols=12 Identities=42% Similarity=1.002 Sum_probs=8.5
Q ss_pred ceeccccCcccC
Q psy8251 117 KYNCTVCGKAFS 128 (434)
Q Consensus 117 ~~~C~~C~k~F~ 128 (434)
-|.|..|+..++
T Consensus 28 fy~C~~C~~~w~ 39 (40)
T smart00440 28 FYVCTKCGHRWR 39 (40)
T ss_pred EEEeCCCCCEeC
Confidence 477888877654
No 135
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=30.72 E-value=33 Score=31.77 Aligned_cols=48 Identities=23% Similarity=0.412 Sum_probs=30.6
Q ss_pred eccccccccccchHHHchhhhcCCCCceeccccCcccCChhHHHhhhccc
Q psy8251 91 ICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKSFVTTHMRVH 140 (434)
Q Consensus 91 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~h 140 (434)
.|-.|...|.-...-..-. -+....|.|+.|...|-..-..-.|...|
T Consensus 364 ~Cf~CQ~~fp~~~~~~~~~--~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 364 HCFVCQGPFPKPPVSPFDE--STSSGRYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred cceeccCCCCCCCCCcccc--cccccceechhhhhhhhhhhHHHHHHHHh
Confidence 4777777776443211111 11234599999999999888888887765
No 136
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=30.46 E-value=79 Score=29.37 Aligned_cols=28 Identities=25% Similarity=0.517 Sum_probs=24.4
Q ss_pred CCCcccCCccccccCCchhHHHHHHhhc
Q psy8251 308 CDRPYDCDVCGAKFTQVGDMRRHRKKHD 335 (434)
Q Consensus 308 ~~k~f~C~~Cgk~F~~~~~L~~H~r~H~ 335 (434)
....|.|+.|...|-..-+.-.|...|.
T Consensus 385 ss~rY~Ce~CK~~FC~dCdvfiHe~Lh~ 412 (421)
T COG5151 385 SSGRYQCELCKSTFCSDCDVFIHETLHF 412 (421)
T ss_pred cccceechhhhhhhhhhhHHHHHHHHhh
Confidence 3467999999999999999999998884
No 137
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.97 E-value=23 Score=27.31 Aligned_cols=12 Identities=17% Similarity=0.290 Sum_probs=7.5
Q ss_pred CeecCcCCcccC
Q psy8251 61 PWCCRFCNKRMM 72 (434)
Q Consensus 61 ~~~C~~C~~~f~ 72 (434)
|..|+.||++|+
T Consensus 26 PiVsPytG~s~P 37 (129)
T COG4530 26 PIVSPYTGKSYP 37 (129)
T ss_pred ccccCcccccch
Confidence 556666666663
No 138
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=29.87 E-value=67 Score=20.49 Aligned_cols=8 Identities=38% Similarity=1.373 Sum_probs=4.0
Q ss_pred ceeccccC
Q psy8251 117 KYNCTVCG 124 (434)
Q Consensus 117 ~~~C~~C~ 124 (434)
.|+|..|+
T Consensus 37 ~~~C~~C~ 44 (46)
T PF12760_consen 37 RYRCKACR 44 (46)
T ss_pred eEECCCCC
Confidence 35555554
No 139
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=29.84 E-value=21 Score=21.60 Aligned_cols=10 Identities=40% Similarity=0.953 Sum_probs=5.0
Q ss_pred eccccccccc
Q psy8251 91 ICDWCGVGFR 100 (434)
Q Consensus 91 ~C~~C~~~f~ 100 (434)
.|+.||..|.
T Consensus 3 ~C~~Cg~~Yh 12 (36)
T PF05191_consen 3 ICPKCGRIYH 12 (36)
T ss_dssp EETTTTEEEE
T ss_pred CcCCCCCccc
Confidence 4555555543
No 140
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=29.36 E-value=17 Score=32.23 Aligned_cols=32 Identities=19% Similarity=0.437 Sum_probs=24.3
Q ss_pred CCCCCcccCCccccccCCchhHHHHHHhhccC
Q psy8251 306 HHCDRPYDCDVCGAKFTQVGDMRRHRKKHDEN 337 (434)
Q Consensus 306 H~~~k~f~C~~Cgk~F~~~~~L~~H~r~H~~~ 337 (434)
-..+..|.|..|+|.|.-..-.++|+..-|.+
T Consensus 72 e~~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 72 EEDEDKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp SSSSEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred HHcCCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 34556799999999999999999999875554
No 141
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=28.58 E-value=36 Score=21.39 Aligned_cols=15 Identities=20% Similarity=0.071 Sum_probs=9.1
Q ss_pred CCCccccccCCHHHH
Q psy8251 1 MWPRGFRGFRQVINL 15 (434)
Q Consensus 1 ~C~~C~~~F~~~~~l 15 (434)
+|+.||+.|+...-.
T Consensus 10 ~C~~C~rpf~WRKKW 24 (42)
T PF10013_consen 10 ICPVCGRPFTWRKKW 24 (42)
T ss_pred cCcccCCcchHHHHH
Confidence 366777777665443
No 142
>KOG2807|consensus
Probab=28.54 E-value=60 Score=30.46 Aligned_cols=33 Identities=12% Similarity=-0.061 Sum_probs=26.5
Q ss_pred cccccCCCCCCCCCcccccCCCCCCCcchhhhh
Q psy8251 256 SCISQGKGSKEKKDDEENEYGKPEDFIPDVFQA 288 (434)
Q Consensus 256 ~C~~c~~~~~~~~~~~c~~c~~~~~~~~~~f~~ 288 (434)
.|-.|+..-.....|.|..|...|-..+|+|..
T Consensus 332 ~Cf~C~~~~~~~~~y~C~~Ck~~FCldCDv~iH 364 (378)
T KOG2807|consen 332 FCFACQGELLSSGRYRCESCKNVFCLDCDVFIH 364 (378)
T ss_pred ceeeeccccCCCCcEEchhccceeeccchHHHH
Confidence 477776666677789999999998888888876
No 143
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=28.38 E-value=37 Score=28.75 Aligned_cols=24 Identities=17% Similarity=0.210 Sum_probs=21.9
Q ss_pred ccccccCCCCCCCCCcccccCCCC
Q psy8251 255 LSCISQGKGSKEKKDDEENEYGKP 278 (434)
Q Consensus 255 ~~C~~c~~~~~~~~~~~c~~c~~~ 278 (434)
+.|..||.....+.|..|++||.+
T Consensus 135 ~vC~vCGy~~~ge~P~~CPiCga~ 158 (166)
T COG1592 135 WVCPVCGYTHEGEAPEVCPICGAP 158 (166)
T ss_pred EEcCCCCCcccCCCCCcCCCCCCh
Confidence 399999999999999999999965
No 144
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=27.72 E-value=26 Score=21.64 Aligned_cols=14 Identities=29% Similarity=0.774 Sum_probs=12.2
Q ss_pred cccCCccccccCCc
Q psy8251 311 PYDCDVCGAKFTQV 324 (434)
Q Consensus 311 ~f~C~~Cgk~F~~~ 324 (434)
||.|+.|++.|-..
T Consensus 12 ~f~C~~C~~~FC~~ 25 (39)
T smart00154 12 GFKCRHCGNLFCGE 25 (39)
T ss_pred CeECCccCCccccc
Confidence 89999999999654
No 145
>KOG2636|consensus
Probab=27.57 E-value=36 Score=33.30 Aligned_cols=30 Identities=27% Similarity=0.613 Sum_probs=26.0
Q ss_pred cccCCCCCcccCCccc-cccCCchhHHHHHH
Q psy8251 303 FCDHHCDRPYDCDVCG-AKFTQVGDMRRHRK 332 (434)
Q Consensus 303 ~~~H~~~k~f~C~~Cg-k~F~~~~~L~~H~r 332 (434)
.+.|.-...|.|.+|| ++|.-...+.+|-.
T Consensus 393 yKLHGL~~ey~CEICGNy~Y~GrkaF~RHF~ 423 (497)
T KOG2636|consen 393 YKLHGLDIEYNCEICGNYVYKGRKAFDRHFN 423 (497)
T ss_pred HhhcCCCcccceeeccCccccCcHHHHHHhH
Confidence 3458888899999999 99999999999954
No 146
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=26.92 E-value=20 Score=23.71 Aligned_cols=21 Identities=14% Similarity=-0.121 Sum_probs=12.3
Q ss_pred CCcccccCCCCCCCcchhhhh
Q psy8251 268 KDDEENEYGKPEDFIPDVFQA 288 (434)
Q Consensus 268 ~~~~c~~c~~~~~~~~~~f~~ 288 (434)
..|.|+.|+..|=+.+|+|..
T Consensus 20 ~~y~C~~C~~~FC~dCD~fiH 40 (51)
T PF07975_consen 20 SRYRCPKCKNHFCIDCDVFIH 40 (51)
T ss_dssp EEE--TTTT--B-HHHHHTTT
T ss_pred CeEECCCCCCccccCcChhhh
Confidence 467888888887777888766
No 147
>KOG4167|consensus
Probab=26.72 E-value=17 Score=37.74 Aligned_cols=22 Identities=14% Similarity=0.066 Sum_probs=17.4
Q ss_pred CCccccccCCHHHHHHHHHHcC
Q psy8251 2 WPRGFRGFRQVINLLKHRREHG 23 (434)
Q Consensus 2 C~~C~~~F~~~~~l~~H~~~h~ 23 (434)
|.+|+++|....++..||++|.
T Consensus 795 CreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 795 CRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred hHHHHHHHHHHhhhhHHHHHHH
Confidence 7778888888888888888774
No 148
>KOG3408|consensus
Probab=26.66 E-value=44 Score=26.40 Aligned_cols=25 Identities=28% Similarity=0.542 Sum_probs=20.2
Q ss_pred CCCCCCCccchhhhcCCHHHHHHHHH
Q psy8251 24 GNLSFDNCHICKETFYKHSELVRHLE 49 (434)
Q Consensus 24 ~~~~~~~C~~C~~~f~~~~~L~~H~~ 49 (434)
|...|. |-.|.+-|.+...|+.|.+
T Consensus 54 G~Gqfy-Ci~CaRyFi~~~~l~~H~k 78 (129)
T KOG3408|consen 54 GGGQFY-CIECARYFIDAKALKTHFK 78 (129)
T ss_pred CCceee-hhhhhhhhcchHHHHHHHh
Confidence 344566 9999999999999998876
No 149
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=26.45 E-value=82 Score=28.35 Aligned_cols=60 Identities=15% Similarity=0.395 Sum_probs=36.6
Q ss_pred CCCeecCcCCcccCCHHHHHHHHHhccCCCceeccccccccccchHHHchhhhcCCCCceeccccCcccCChh
Q psy8251 59 SSPWCCRFCNKRMMTKLSLGIHERIHTGQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFSRKS 131 (434)
Q Consensus 59 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~~~~ 131 (434)
...|.|..|....=. +.-.....-.|..|.+.|.-... ..--|...|.|+.|+..|+...
T Consensus 110 drqFaC~~Cd~~WwR--------rvp~rKeVSRCr~C~~rYDPVP~-----dkmwG~aef~C~~C~h~F~G~~ 169 (278)
T PF15135_consen 110 DRQFACSSCDHMWWR--------RVPQRKEVSRCRKCRKRYDPVPC-----DKMWGIAEFHCPKCRHNFRGFA 169 (278)
T ss_pred ceeeeccccchHHHh--------ccCcccccccccccccccCCCcc-----ccccceeeeecccccccchhhh
Confidence 467889988654211 22223334578888888765431 1112344599999999998553
No 150
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=26.07 E-value=54 Score=21.96 Aligned_cols=24 Identities=29% Similarity=0.249 Sum_probs=20.4
Q ss_pred ccccccCCCCC-CCCCcccccCCCC
Q psy8251 255 LSCISQGKGSK-EKKDDEENEYGKP 278 (434)
Q Consensus 255 ~~C~~c~~~~~-~~~~~~c~~c~~~ 278 (434)
..|..|++.|. ......|+.|+.+
T Consensus 6 ~~C~~Cg~~~~~~dDiVvCp~Cgap 30 (54)
T PF14446_consen 6 CKCPVCGKKFKDGDDIVVCPECGAP 30 (54)
T ss_pred ccChhhCCcccCCCCEEECCCCCCc
Confidence 38999999997 5677899999987
No 151
>PF05495 zf-CHY: CHY zinc finger; InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins: Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation: ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom. More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=25.16 E-value=23 Score=25.19 Aligned_cols=59 Identities=22% Similarity=0.324 Sum_probs=26.8
Q ss_pred CeecCcCCcccCCH---HHHHHHHHhccCCCceeccccccccccchHHHchhhhcCCCCceeccccCccc
Q psy8251 61 PWCCRFCNKRMMTK---LSLGIHERIHTGQRPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAF 127 (434)
Q Consensus 61 ~~~C~~C~~~f~~~---~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F 127 (434)
.++++-|++.|.=. ..+..|...........|..|+..+..... .-+ ..|.|+.|+..|
T Consensus 10 ~~~~~cC~~~y~C~~CHde~~~H~~~~~~~~~v~Cg~C~~~~~~~~~-------~c~-~~~~C~~C~~~~ 71 (71)
T PF05495_consen 10 AIRFPCCGKYYPCRFCHDELEDHPFDRWPVKRVICGKCRTEQPIDEY-------SCG-ADYFCPICGLYF 71 (71)
T ss_dssp EEEETTTTEEESSHHHHHHCSSS---TTT--EEEETTT--EEES-SB-------TT---SEEETTTTEEE
T ss_pred EEECCcccCeecHHHHHHHhccCccccccccCeECCCCCCccChhhh-------hcC-CCccCcCcCCCC
Confidence 35556666665543 122233333333345677778776655433 111 457888887654
No 152
>KOG3408|consensus
Probab=24.63 E-value=37 Score=26.80 Aligned_cols=27 Identities=19% Similarity=0.379 Sum_probs=23.9
Q ss_pred CCCCcccCCccccccCCchhHHHHHHh
Q psy8251 307 HCDRPYDCDVCGAKFTQVGDMRRHRKK 333 (434)
Q Consensus 307 ~~~k~f~C~~Cgk~F~~~~~L~~H~r~ 333 (434)
.|...|.|-.|.+-|.+...|..|.++
T Consensus 53 PG~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 53 PGGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCCceeehhhhhhhhcchHHHHHHHhc
Confidence 455779999999999999999999874
No 153
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=24.61 E-value=27 Score=19.88 Aligned_cols=20 Identities=25% Similarity=0.630 Sum_probs=11.1
Q ss_pred eeccccCcccCChhHHHhhhc
Q psy8251 118 YNCTVCGKAFSRKSFVTTHMR 138 (434)
Q Consensus 118 ~~C~~C~k~F~~~~~L~~H~~ 138 (434)
|.|-.|++.| .....+.|..
T Consensus 1 ~sCiDC~~~F-~~~~y~~Ht~ 20 (28)
T PF08790_consen 1 FSCIDCSKDF-DGDSYKSHTS 20 (28)
T ss_dssp EEETTTTEEE-EGGGTTT---
T ss_pred CeeecCCCCc-CcCCcCCCCc
Confidence 4677788888 4455555543
No 154
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=24.27 E-value=53 Score=34.98 Aligned_cols=14 Identities=29% Similarity=0.769 Sum_probs=7.4
Q ss_pred CCCCCcccCCcccc
Q psy8251 306 HHCDRPYDCDVCGA 319 (434)
Q Consensus 306 H~~~k~f~C~~Cgk 319 (434)
|...+...|..||.
T Consensus 457 H~~~~~L~CH~Cg~ 470 (730)
T COG1198 457 HKATGQLRCHYCGY 470 (730)
T ss_pred ecCCCeeEeCCCCC
Confidence 44445555555554
No 155
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=23.96 E-value=63 Score=34.46 Aligned_cols=12 Identities=33% Similarity=0.905 Sum_probs=7.7
Q ss_pred CCcccCCccccc
Q psy8251 309 DRPYDCDVCGAK 320 (434)
Q Consensus 309 ~k~f~C~~Cgk~ 320 (434)
..|..|+.||-.
T Consensus 473 ~~p~~Cp~Cgs~ 484 (730)
T COG1198 473 PIPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCCC
Confidence 456677777655
No 156
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=23.22 E-value=38 Score=27.66 Aligned_cols=12 Identities=25% Similarity=0.609 Sum_probs=6.4
Q ss_pred eecccccccccc
Q psy8251 90 YICDWCGVGFRS 101 (434)
Q Consensus 90 ~~C~~C~~~f~~ 101 (434)
+.|..||..|..
T Consensus 71 ~~C~~CG~~~~~ 82 (135)
T PRK03824 71 LKCRNCGNEWSL 82 (135)
T ss_pred EECCCCCCEEec
Confidence 555555555543
No 157
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=22.96 E-value=55 Score=20.52 Aligned_cols=11 Identities=27% Similarity=0.803 Sum_probs=5.7
Q ss_pred ceeccccCccc
Q psy8251 117 KYNCTVCGKAF 127 (434)
Q Consensus 117 ~~~C~~C~k~F 127 (434)
.+.|..||...
T Consensus 19 ~~vC~~CG~Vl 29 (43)
T PF08271_consen 19 ELVCPNCGLVL 29 (43)
T ss_dssp EEEETTT-BBE
T ss_pred eEECCCCCCEe
Confidence 35666666543
No 158
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=22.73 E-value=30 Score=22.62 Aligned_cols=24 Identities=25% Similarity=0.600 Sum_probs=18.7
Q ss_pred CCcccCCccccccCCchhHHHHHH
Q psy8251 309 DRPYDCDVCGAKFTQVGDMRRHRK 332 (434)
Q Consensus 309 ~k~f~C~~Cgk~F~~~~~L~~H~r 332 (434)
++.+.|..||..|.....=+.--.
T Consensus 2 Dk~l~C~dCg~~FvfTa~EQ~fy~ 25 (49)
T PF13451_consen 2 DKTLTCKDCGAEFVFTAGEQKFYA 25 (49)
T ss_pred CeeEEcccCCCeEEEehhHHHHHH
Confidence 578899999999998776665543
No 159
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=22.61 E-value=43 Score=22.43 Aligned_cols=31 Identities=23% Similarity=0.564 Sum_probs=15.6
Q ss_pred eeccccccccccchHHHchhhhcCCCCceeccccCcccC
Q psy8251 90 YICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFS 128 (434)
Q Consensus 90 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~ 128 (434)
+.|+.||..+.-.... .|+ ...|+.||..+.
T Consensus 3 ~~CP~CG~~iev~~~~-------~Ge-iV~Cp~CGaele 33 (54)
T TIGR01206 3 FECPDCGAEIELENPE-------LGE-LVICDECGAELE 33 (54)
T ss_pred cCCCCCCCEEecCCCc-------cCC-EEeCCCCCCEEE
Confidence 4566666655432211 122 356777766553
No 160
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=22.31 E-value=37 Score=30.10 Aligned_cols=31 Identities=29% Similarity=0.639 Sum_probs=23.0
Q ss_pred CCceeccccCcccCChhHHHhhhccccCCcc
Q psy8251 115 QRKYNCTVCGKAFSRKSFVTTHMRVHTGKLT 145 (434)
Q Consensus 115 ~~~~~C~~C~k~F~~~~~L~~H~~~h~~~~~ 145 (434)
+.-|.|+.|+|.|.-..-...|+..-|++..
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~v 105 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHPEKV 105 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-HHHH
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCHHHH
Confidence 3459999999999999999999987776644
No 161
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=22.29 E-value=45 Score=30.14 Aligned_cols=28 Identities=18% Similarity=0.360 Sum_probs=21.4
Q ss_pred CCCceeccccCcccCChhHHHhhhcccc
Q psy8251 114 GQRKYNCTVCGKAFSRKSFVTTHMRVHT 141 (434)
Q Consensus 114 ~~~~~~C~~C~k~F~~~~~L~~H~~~h~ 141 (434)
..+++.|+.|+.....-..|..=.+.|.
T Consensus 206 k~k~~PCPKCg~et~eTkdLSmStR~hk 233 (314)
T PF06524_consen 206 KGKPIPCPKCGYETQETKDLSMSTRSHK 233 (314)
T ss_pred cCCCCCCCCCCCcccccccceeeeecch
Confidence 4478999999988887777776666664
No 162
>PRK04023 DNA polymerase II large subunit; Validated
Probab=21.26 E-value=1.5e+02 Score=32.65 Aligned_cols=61 Identities=21% Similarity=0.295 Sum_probs=0.0
Q ss_pred CcccCcccccccCCCCCCCCCcccccCCCCCCCcchhhhhccccchhhhhcccccccCCCCCcccCCccccccCCchhHH
Q psy8251 249 NVSINKLSCISQGKGSKEKKDDEENEYGKPEDFIPDVFQAEFSKENDKLLSYLEFCDHHCDRPYDCDVCGAKFTQVGDMR 328 (434)
Q Consensus 249 ~~~~~~~~C~~c~~~~~~~~~~~c~~c~~~~~~~~~~f~~~~~~~~~~~~~~l~~~~H~~~k~f~C~~Cgk~F~~~~~L~ 328 (434)
........|..||... -.+.|+.||.. .+..+.|+.||...
T Consensus 621 eVEVg~RfCpsCG~~t---~~frCP~CG~~-----------------------------Te~i~fCP~CG~~~------- 661 (1121)
T PRK04023 621 EVEIGRRKCPSCGKET---FYRRCPFCGTH-----------------------------TEPVYRCPRCGIEV------- 661 (1121)
T ss_pred eecccCccCCCCCCcC---CcccCCCCCCC-----------------------------CCcceeCccccCcC-------
Q ss_pred HHHHhhccCCCCCCCCCCCCCCCCCcccceeeEEe
Q psy8251 329 RHRKKHDENGGSAKDKPGAYNKTDRYSKLTYLLTV 363 (434)
Q Consensus 329 ~H~r~H~~~~~~~~~~~~~c~~~~~~~~~~~~~~l 363 (434)
.++.|+.|+....-.+...+
T Consensus 662 ---------------~~y~CPKCG~El~~~s~~~i 681 (1121)
T PRK04023 662 ---------------EEDECEKCGREPTPYSKRKI 681 (1121)
T ss_pred ---------------CCCcCCCCCCCCCccceEEe
No 163
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=21.11 E-value=62 Score=20.03 Aligned_cols=18 Identities=17% Similarity=0.335 Sum_probs=9.6
Q ss_pred ceeccccCcccCChhHHH
Q psy8251 117 KYNCTVCGKAFSRKSFVT 134 (434)
Q Consensus 117 ~~~C~~C~k~F~~~~~L~ 134 (434)
.+.|+.|+-.+-....|.
T Consensus 19 id~C~~C~G~W~d~~el~ 36 (41)
T PF13453_consen 19 IDVCPSCGGIWFDAGELE 36 (41)
T ss_pred EEECCCCCeEEccHHHHH
Confidence 355666665555554443
No 164
>PF01428 zf-AN1: AN1-like Zinc finger; InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include: Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=20.86 E-value=28 Score=21.95 Aligned_cols=15 Identities=33% Similarity=0.933 Sum_probs=10.0
Q ss_pred CcccCCccccccCCc
Q psy8251 310 RPYDCDVCGAKFTQV 324 (434)
Q Consensus 310 k~f~C~~Cgk~F~~~ 324 (434)
-||.|+.|++.|=..
T Consensus 12 ~~~~C~~C~~~FC~~ 26 (43)
T PF01428_consen 12 LPFKCKHCGKSFCLK 26 (43)
T ss_dssp SHEE-TTTS-EE-TT
T ss_pred CCeECCCCCcccCcc
Confidence 489999999999654
No 165
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=20.19 E-value=35 Score=25.98 Aligned_cols=37 Identities=22% Similarity=0.509 Sum_probs=19.7
Q ss_pred CceeccccccccccchHHHchhhhcCCCCceeccccCcccC
Q psy8251 88 RPYICDWCGVGFRSKANLLQHQPVHTGQRKYNCTVCGKAFS 128 (434)
Q Consensus 88 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~k~F~ 128 (434)
+.|.|+.||..-.+.-.++. -.......|..||.+|.
T Consensus 21 k~FtCp~Cghe~vs~ctvkk----~~~~g~~~Cg~CGls~e 57 (104)
T COG4888 21 KTFTCPRCGHEKVSSCTVKK----TVNIGTAVCGNCGLSFE 57 (104)
T ss_pred ceEecCccCCeeeeEEEEEe----cCceeEEEcccCcceEE
Confidence 44677777655444332221 11222357888887775
Done!