Query psy14799
Match_columns 230
No_of_seqs 300 out of 3115
Neff 10.2
Searched_HMMs 46136
Date Fri Aug 16 21:08:34 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14799.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14799hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 100.0 7.3E-31 1.6E-35 197.7 3.6 138 33-197 128-265 (279)
2 KOG2462|consensus 100.0 8.8E-30 1.9E-34 191.9 4.8 130 9-144 132-264 (279)
3 KOG1074|consensus 99.9 4.2E-26 9E-31 193.4 4.0 75 67-142 606-687 (958)
4 KOG3608|consensus 99.9 5E-25 1.1E-29 170.7 5.7 187 10-205 182-382 (467)
5 KOG3608|consensus 99.9 4.5E-22 9.8E-27 154.4 6.5 152 10-171 210-368 (467)
6 KOG3623|consensus 99.8 3.9E-20 8.4E-25 155.4 3.9 77 95-196 895-971 (1007)
7 KOG1074|consensus 99.8 4.6E-20 1E-24 157.1 3.0 94 94-205 605-698 (958)
8 KOG3576|consensus 99.7 4E-19 8.8E-24 128.2 2.1 123 65-201 116-238 (267)
9 KOG3576|consensus 99.7 4.5E-18 9.7E-23 122.9 3.4 115 33-150 115-239 (267)
10 KOG3623|consensus 99.5 9E-15 2E-19 123.6 2.8 79 65-144 893-971 (1007)
11 PLN03086 PRLI-interacting fact 99.4 9E-13 2E-17 111.5 5.8 102 36-148 454-565 (567)
12 PHA00733 hypothetical protein 99.2 1.3E-11 2.9E-16 85.9 5.7 82 64-148 38-124 (128)
13 PLN03086 PRLI-interacting fact 99.2 3E-11 6.5E-16 102.4 6.4 143 10-199 410-564 (567)
14 PHA00733 hypothetical protein 99.2 5.2E-11 1.1E-15 82.9 5.7 97 21-119 26-124 (128)
15 KOG3993|consensus 99.0 1E-10 2.3E-15 93.7 1.8 145 2-150 265-485 (500)
16 PHA02768 hypothetical protein; 98.8 2.1E-09 4.5E-14 62.1 2.0 39 67-107 6-44 (55)
17 KOG3993|consensus 98.8 1.1E-09 2.3E-14 88.1 1.0 162 36-200 268-483 (500)
18 PHA02768 hypothetical protein; 98.8 2E-09 4.2E-14 62.2 1.7 42 95-139 6-47 (55)
19 PF13465 zf-H2C2_2: Zinc-finge 98.7 1.7E-08 3.7E-13 49.8 2.2 26 22-47 1-26 (26)
20 PF13465 zf-H2C2_2: Zinc-finge 98.6 3.1E-08 6.7E-13 48.9 2.6 24 82-105 2-25 (26)
21 PF05605 zf-Di19: Drought indu 98.4 5.4E-07 1.2E-11 53.0 4.6 51 94-147 2-53 (54)
22 PHA00616 hypothetical protein 98.4 1.8E-07 3.9E-12 51.5 2.0 33 66-98 1-33 (44)
23 PHA00732 hypothetical protein 98.3 6E-07 1.3E-11 56.9 2.5 21 36-56 2-22 (79)
24 PHA00732 hypothetical protein 98.2 7.5E-07 1.6E-11 56.4 2.7 37 95-134 2-38 (79)
25 PF05605 zf-Di19: Drought indu 98.2 3.2E-06 6.9E-11 49.7 4.1 37 36-74 3-39 (54)
26 PHA00616 hypothetical protein 98.2 8.2E-07 1.8E-11 48.9 1.3 27 13-39 7-33 (44)
27 PF00096 zf-C2H2: Zinc finger, 97.9 6.3E-06 1.4E-10 39.3 1.8 22 124-145 1-22 (23)
28 PF00096 zf-C2H2: Zinc finger, 97.8 1.5E-05 3.2E-10 38.0 2.2 16 98-113 4-19 (23)
29 PF13894 zf-C2H2_4: C2H2-type 97.8 1.8E-05 3.9E-10 37.9 2.4 24 124-147 1-24 (24)
30 PF12756 zf-C2H2_2: C2H2 type 97.8 2E-05 4.4E-10 52.5 2.8 72 69-146 2-73 (100)
31 PF12756 zf-C2H2_2: C2H2 type 97.7 4.4E-05 9.6E-10 50.8 3.2 70 38-115 2-71 (100)
32 COG5189 SFP1 Putative transcri 97.6 1.4E-05 3.1E-10 62.5 0.1 52 92-143 347-418 (423)
33 PF13894 zf-C2H2_4: C2H2-type 97.6 7E-05 1.5E-09 35.8 2.3 14 72-85 6-19 (24)
34 PF13912 zf-C2H2_6: C2H2-type 97.5 5.3E-05 1.1E-09 37.6 1.7 23 36-58 2-24 (27)
35 COG5189 SFP1 Putative transcri 97.5 5.3E-05 1.2E-09 59.4 1.5 72 120-196 346-419 (423)
36 PF13912 zf-C2H2_6: C2H2-type 97.4 8.7E-05 1.9E-09 36.8 1.6 20 123-142 1-20 (27)
37 COG5236 Uncharacterized conser 97.3 0.00053 1.2E-08 54.6 5.8 138 2-148 149-306 (493)
38 KOG2231|consensus 97.3 0.00034 7.4E-09 61.0 4.8 92 46-147 125-236 (669)
39 COG5236 Uncharacterized conser 97.1 0.00019 4.1E-09 57.0 0.8 135 35-201 151-307 (493)
40 PF09237 GAGA: GAGA factor; I 96.8 0.0015 3.2E-08 36.9 2.5 29 65-93 23-51 (54)
41 PF13909 zf-H2C2_5: C2H2-type 96.7 0.0016 3.4E-08 31.1 2.0 23 124-147 1-23 (24)
42 smart00355 ZnF_C2H2 zinc finge 96.7 0.0019 4.1E-08 31.1 2.4 21 124-144 1-21 (26)
43 PF09237 GAGA: GAGA factor; I 96.7 0.0014 3.1E-08 37.0 2.0 31 117-147 18-48 (54)
44 KOG2231|consensus 96.6 0.0064 1.4E-07 53.4 6.3 115 17-142 124-258 (669)
45 PF12874 zf-met: Zinc-finger o 96.6 0.0017 3.6E-08 31.3 1.7 21 124-144 1-21 (25)
46 KOG2482|consensus 96.5 0.01 2.2E-07 47.4 6.6 24 95-118 280-303 (423)
47 KOG4173|consensus 96.4 0.0012 2.7E-08 48.7 1.0 84 65-151 78-174 (253)
48 KOG4173|consensus 96.3 0.0027 5.9E-08 46.9 2.3 84 35-124 79-176 (253)
49 PF12874 zf-met: Zinc-finger o 96.2 0.0038 8.3E-08 30.0 1.9 21 95-115 1-21 (25)
50 PF13909 zf-H2C2_5: C2H2-type 96.2 0.0041 8.8E-08 29.7 1.9 7 108-114 13-19 (24)
51 KOG1146|consensus 96.2 0.0041 8.9E-08 57.9 3.3 52 124-198 590-641 (1406)
52 KOG1146|consensus 96.0 0.0039 8.4E-08 58.0 2.2 127 72-199 442-613 (1406)
53 PF12171 zf-C2H2_jaz: Zinc-fin 95.8 0.0052 1.1E-07 30.3 1.3 21 36-56 2-22 (27)
54 PF12171 zf-C2H2_jaz: Zinc-fin 95.8 0.0025 5.3E-08 31.5 0.1 20 124-143 2-21 (27)
55 PRK04860 hypothetical protein; 95.7 0.0067 1.4E-07 44.0 2.0 38 94-136 119-156 (160)
56 KOG2482|consensus 95.7 0.037 8E-07 44.3 6.1 73 124-198 280-357 (423)
57 PRK04860 hypothetical protein; 95.7 0.0051 1.1E-07 44.6 1.2 36 66-105 119-154 (160)
58 PF13913 zf-C2HC_2: zinc-finge 94.6 0.033 7.2E-07 26.8 2.0 18 124-142 3-20 (25)
59 PF13913 zf-C2HC_2: zinc-finge 94.5 0.039 8.5E-07 26.6 2.0 19 68-87 4-22 (25)
60 KOG2785|consensus 94.4 0.17 3.7E-06 41.4 6.6 54 94-147 166-244 (390)
61 smart00451 ZnF_U1 U1-like zinc 94.3 0.037 7.9E-07 28.9 1.8 23 123-145 3-25 (35)
62 KOG2785|consensus 94.2 0.19 4.1E-06 41.1 6.5 51 66-116 166-242 (390)
63 KOG2893|consensus 93.7 0.018 4E-07 43.7 -0.1 49 96-149 12-60 (341)
64 smart00451 ZnF_U1 U1-like zinc 93.3 0.079 1.7E-06 27.6 2.1 22 94-115 3-24 (35)
65 PF12013 DUF3505: Protein of u 92.8 0.26 5.6E-06 33.3 4.5 24 124-147 81-108 (109)
66 COG5048 FOG: Zn-finger [Genera 92.7 0.033 7.1E-07 46.9 -0.0 62 66-128 289-356 (467)
67 COG4049 Uncharacterized protei 92.0 0.088 1.9E-06 30.3 1.2 29 119-147 13-41 (65)
68 KOG2893|consensus 91.5 0.036 7.7E-07 42.2 -1.0 43 9-55 12-54 (341)
69 cd00350 rubredoxin_like Rubred 91.0 0.11 2.4E-06 26.8 0.9 8 123-130 17-24 (33)
70 COG5048 FOG: Zn-finger [Genera 90.9 0.019 4.2E-07 48.3 -3.3 49 13-61 295-349 (467)
71 COG4049 Uncharacterized protei 90.7 0.14 2.9E-06 29.6 1.1 28 91-118 14-41 (65)
72 PF12013 DUF3505: Protein of u 85.4 2 4.3E-05 29.0 4.4 24 95-118 81-108 (109)
73 PF13717 zinc_ribbon_4: zinc-r 85.3 0.72 1.6E-05 24.3 1.7 33 95-133 3-35 (36)
74 PF13719 zinc_ribbon_5: zinc-r 85.1 0.68 1.5E-05 24.6 1.6 32 96-133 4-35 (37)
75 TIGR00622 ssl1 transcription f 84.9 1.3 2.8E-05 30.0 3.1 24 123-147 81-104 (112)
76 COG2888 Predicted Zn-ribbon RN 84.7 0.59 1.3E-05 27.5 1.3 10 93-102 49-58 (61)
77 PF09986 DUF2225: Uncharacteri 84.7 0.15 3.2E-06 39.1 -1.6 13 124-136 49-61 (214)
78 KOG2186|consensus 84.6 0.67 1.4E-05 35.8 1.9 45 36-86 4-48 (276)
79 PF12907 zf-met2: Zinc-binding 84.3 0.98 2.1E-05 24.4 1.9 32 124-155 2-36 (40)
80 PF02892 zf-BED: BED zinc fing 84.2 0.99 2.1E-05 24.9 2.1 24 124-147 17-44 (45)
81 PF09986 DUF2225: Uncharacteri 82.6 0.3 6.5E-06 37.4 -0.7 22 34-55 4-25 (214)
82 KOG2186|consensus 82.4 0.95 2.1E-05 35.0 1.9 46 67-115 4-49 (276)
83 TIGR02098 MJ0042_CXXC MJ0042 f 81.9 0.98 2.1E-05 23.9 1.4 13 95-107 3-15 (38)
84 PF09538 FYDLN_acid: Protein o 80.7 1.1 2.3E-05 30.3 1.5 29 160-188 11-39 (108)
85 cd00729 rubredoxin_SM Rubredox 79.9 0.93 2E-05 23.6 0.8 9 95-103 3-11 (34)
86 smart00614 ZnF_BED BED zinc fi 79.2 1.8 4E-05 24.6 2.0 24 124-147 19-47 (50)
87 TIGR00622 ssl1 transcription f 77.7 2.7 5.8E-05 28.5 2.7 89 33-130 13-110 (112)
88 smart00734 ZnF_Rad18 Rad18-lik 77.0 2.3 5E-05 20.6 1.7 19 124-143 2-20 (26)
89 TIGR00373 conserved hypothetic 76.7 3.6 7.8E-05 29.9 3.4 32 91-132 106-137 (158)
90 smart00531 TFIIE Transcription 74.9 2.9 6.2E-05 30.0 2.5 12 35-46 99-110 (147)
91 PF04959 ARS2: Arsenite-resist 74.4 2.5 5.4E-05 32.3 2.1 30 120-149 74-103 (214)
92 PRK14890 putative Zn-ribbon RN 73.7 1.7 3.6E-05 25.7 0.8 10 93-102 47-56 (59)
93 smart00531 TFIIE Transcription 73.5 5.5 0.00012 28.5 3.6 38 91-133 96-133 (147)
94 TIGR00373 conserved hypothetic 73.3 3.7 8E-05 29.8 2.7 17 66-82 109-125 (158)
95 PRK06266 transcription initiat 71.1 4.7 0.0001 29.9 2.9 32 92-133 115-146 (178)
96 PF09723 Zn-ribbon_8: Zinc rib 69.8 0.66 1.4E-05 25.4 -1.4 12 36-47 6-17 (42)
97 PRK00398 rpoP DNA-directed RNA 68.4 2.9 6.3E-05 23.2 1.0 10 36-45 4-13 (46)
98 smart00659 RPOLCX RNA polymera 67.9 3.1 6.6E-05 23.1 1.0 10 36-45 3-12 (44)
99 PRK00464 nrdR transcriptional 67.0 0.88 1.9E-05 32.8 -1.6 14 124-137 29-42 (154)
100 PF02176 zf-TRAF: TRAF-type zi 66.7 3.4 7.3E-05 24.2 1.1 19 81-99 25-43 (60)
101 TIGR02300 FYDLN_acid conserved 66.6 4.2 9E-05 28.1 1.7 33 160-192 11-43 (129)
102 PF13451 zf-trcl: Probable zin 65.9 4.3 9.2E-05 23.1 1.3 38 93-130 3-40 (49)
103 PRK06266 transcription initiat 65.8 5.6 0.00012 29.5 2.4 15 66-80 117-131 (178)
104 PF03604 DNA_RNApol_7kD: DNA d 65.2 2.4 5.2E-05 21.7 0.2 11 36-46 1-11 (32)
105 PHA00626 hypothetical protein 63.7 1.8 3.9E-05 25.1 -0.4 12 66-77 23-34 (59)
106 TIGR02605 CxxC_CxxC_SSSS putat 62.8 1.3 2.7E-05 25.4 -1.2 11 36-46 6-16 (52)
107 PF01363 FYVE: FYVE zinc finge 59.5 4.9 0.00011 24.4 0.9 29 159-188 10-38 (69)
108 PF05290 Baculo_IE-1: Baculovi 59.3 11 0.00023 26.4 2.6 13 125-137 123-135 (140)
109 COG1592 Rubrerythrin [Energy p 59.1 5.5 0.00012 29.1 1.3 24 35-74 134-157 (166)
110 KOG2593|consensus 59.0 11 0.00025 31.7 3.2 37 91-131 125-161 (436)
111 KOG4167|consensus 58.3 4.9 0.00011 36.1 1.1 23 10-32 795-817 (907)
112 PF06524 NOA36: NOA36 protein; 56.7 5.4 0.00012 31.2 0.9 23 120-142 206-228 (314)
113 COG1996 RPC10 DNA-directed RNA 56.4 5.8 0.00013 22.5 0.8 10 67-76 7-16 (49)
114 COG5188 PRP9 Splicing factor 3 56.3 11 0.00025 30.8 2.7 23 65-87 237-259 (470)
115 PF14353 CpXC: CpXC protein 54.9 11 0.00025 26.0 2.3 17 123-139 38-54 (128)
116 PF13240 zinc_ribbon_2: zinc-r 54.8 2.4 5.1E-05 19.8 -0.8 6 38-43 16-21 (23)
117 PF04780 DUF629: Protein of un 54.7 9.2 0.0002 32.9 2.1 28 123-150 57-84 (466)
118 PF04959 ARS2: Arsenite-resist 53.5 10 0.00022 29.0 2.0 26 94-119 77-102 (214)
119 PF07754 DUF1610: Domain of un 52.8 6.1 0.00013 18.7 0.4 9 122-130 15-23 (24)
120 COG4957 Predicted transcriptio 52.2 13 0.00028 26.0 2.1 21 180-203 81-101 (148)
121 PF10571 UPF0547: Uncharacteri 51.5 4 8.8E-05 19.7 -0.3 10 37-46 16-25 (26)
122 KOG4167|consensus 50.8 4.6 9.9E-05 36.3 -0.3 26 66-91 792-817 (907)
123 PF05443 ROS_MUCR: ROS/MUCR tr 49.4 11 0.00023 26.6 1.4 25 66-93 72-96 (132)
124 PF15269 zf-C2H2_7: Zinc-finge 49.2 15 0.00033 20.2 1.7 21 95-115 21-41 (54)
125 PRK09678 DNA-binding transcrip 49.0 3.3 7.2E-05 25.7 -1.0 8 128-135 34-41 (72)
126 KOG1280|consensus 47.1 18 0.00038 29.7 2.4 26 36-61 80-105 (381)
127 COG2331 Uncharacterized protei 46.6 7.8 0.00017 24.1 0.3 10 36-45 13-22 (82)
128 PF08274 PhnA_Zn_Ribbon: PhnA 46.5 9 0.00019 19.3 0.5 7 67-73 20-26 (30)
129 PF04780 DUF629: Protein of un 45.8 16 0.00035 31.4 2.2 29 93-121 56-84 (466)
130 COG0068 HypF Hydrogenase matur 44.6 2.4 5.3E-05 37.9 -2.9 56 68-131 125-181 (750)
131 KOG0717|consensus 44.1 9.8 0.00021 32.4 0.6 22 8-29 293-314 (508)
132 cd00730 rubredoxin Rubredoxin; 43.7 13 0.00028 21.2 0.9 11 95-105 2-12 (50)
133 PRK00432 30S ribosomal protein 43.7 18 0.00038 20.6 1.5 25 161-185 23-47 (50)
134 KOG1842|consensus 43.3 15 0.00032 31.2 1.5 29 123-151 15-43 (505)
135 COG4530 Uncharacterized protei 43.2 14 0.0003 24.8 1.1 27 161-187 12-38 (129)
136 COG5151 SSL1 RNA polymerase II 43.1 18 0.00038 29.2 1.9 47 97-147 365-411 (421)
137 PRK04023 DNA polymerase II lar 42.3 15 0.00032 34.7 1.5 9 95-103 664-672 (1121)
138 PF07800 DUF1644: Protein of u 42.2 56 0.0012 23.7 4.1 57 94-154 80-139 (162)
139 PRK03824 hypA hydrogenase nick 42.2 8.8 0.00019 27.1 0.1 12 95-106 71-82 (135)
140 KOG3408|consensus 42.1 17 0.00038 24.9 1.5 26 120-145 54-79 (129)
141 KOG4377|consensus 40.1 20 0.00042 30.2 1.8 124 67-199 272-427 (480)
142 smart00834 CxxC_CXXC_SSSS Puta 39.7 12 0.00027 19.8 0.4 14 123-136 5-18 (41)
143 COG3364 Zn-ribbon containing p 38.7 19 0.00041 23.8 1.2 12 124-135 3-14 (112)
144 PF08790 zf-LYAR: LYAR-type C2 38.1 6 0.00013 19.5 -0.9 18 124-142 1-18 (28)
145 PF07975 C1_4: TFIIH C1-like d 38.0 18 0.00038 20.8 0.9 20 123-142 21-40 (51)
146 PF13878 zf-C2H2_3: zinc-finge 37.9 34 0.00073 18.5 2.0 23 124-147 14-38 (41)
147 PF14803 Nudix_N_2: Nudix N-te 37.6 3.2 6.9E-05 21.6 -2.0 8 36-43 23-30 (34)
148 PF15135 UPF0515: Uncharacteri 36.9 25 0.00054 27.4 1.8 58 34-107 111-168 (278)
149 PF13453 zf-TFIIB: Transcripti 35.5 15 0.00033 19.7 0.4 17 124-140 20-36 (41)
150 PF12760 Zn_Tnp_IS1595: Transp 35.2 25 0.00054 19.4 1.2 7 95-101 19-25 (46)
151 PF13358 DDE_3: DDE superfamil 34.5 31 0.00068 23.7 2.0 24 201-225 112-135 (146)
152 KOG2593|consensus 34.4 41 0.00089 28.5 2.8 17 33-49 126-142 (436)
153 TIGR00244 transcriptional regu 34.4 30 0.00064 24.8 1.7 33 174-206 27-62 (147)
154 KOG2907|consensus 33.9 19 0.00041 24.4 0.6 33 6-48 6-38 (116)
155 KOG2272|consensus 32.5 50 0.0011 25.9 2.8 21 118-138 216-236 (332)
156 KOG3408|consensus 32.5 31 0.00068 23.7 1.5 25 91-115 54-78 (129)
157 KOG4124|consensus 32.4 13 0.00027 30.6 -0.4 71 122-197 348-420 (442)
158 PF09963 DUF2197: Uncharacteri 32.4 17 0.00037 21.3 0.2 10 122-131 30-39 (56)
159 KOG0717|consensus 32.3 27 0.00059 29.9 1.5 31 174-204 459-490 (508)
160 KOG4216|consensus 32.1 49 0.0011 27.8 2.8 68 159-226 47-115 (479)
161 PF07282 OrfB_Zn_ribbon: Putat 31.9 31 0.00068 20.8 1.4 26 160-185 30-56 (69)
162 smart00154 ZnF_AN1 AN1-like Zi 31.4 22 0.00047 19.0 0.5 13 123-135 12-24 (39)
163 PF05191 ADK_lid: Adenylate ki 31.3 26 0.00056 18.4 0.8 10 96-105 3-12 (36)
164 KOG1280|consensus 30.9 37 0.0008 27.9 2.0 26 123-148 79-104 (381)
165 PF05495 zf-CHY: CHY zinc fing 30.8 7.2 0.00016 24.1 -1.6 12 94-105 41-52 (71)
166 COG4888 Uncharacterized Zn rib 30.8 7.6 0.00016 25.6 -1.5 11 124-134 47-57 (104)
167 cd00065 FYVE FYVE domain; Zinc 30.7 34 0.00073 19.6 1.4 26 161-187 5-30 (57)
168 COG1655 Uncharacterized protei 30.5 13 0.00028 28.6 -0.6 26 33-58 17-42 (267)
169 COG1198 PriA Primosomal protei 29.2 13 0.00029 33.9 -0.8 10 93-102 474-483 (730)
170 COG1327 Predicted transcriptio 28.4 40 0.00088 24.2 1.6 33 174-206 27-62 (156)
171 KOG2636|consensus 28.0 41 0.0009 28.7 1.8 28 117-144 395-423 (497)
172 PRK12380 hydrogenase nickel in 27.8 15 0.00033 25.0 -0.6 26 34-74 69-94 (113)
173 PF01428 zf-AN1: AN1-like Zinc 27.5 22 0.00047 19.4 0.1 14 122-135 12-25 (43)
174 COG4896 Uncharacterized protei 27.3 41 0.00089 20.0 1.2 40 95-135 3-43 (68)
175 KOG1842|consensus 27.0 38 0.00082 28.9 1.4 27 65-91 14-40 (505)
176 smart00440 ZnF_C2C2 C2C2 Zinc 26.9 4.9 0.00011 21.7 -2.5 10 124-133 29-38 (40)
177 COG2879 Uncharacterized small 26.6 32 0.0007 20.6 0.7 18 188-205 25-42 (65)
178 COG1773 Rubredoxin [Energy pro 26.1 29 0.00063 20.2 0.5 41 123-165 3-43 (55)
179 smart00064 FYVE Protein presen 25.4 46 0.00099 19.9 1.3 10 125-134 12-21 (68)
180 PF06220 zf-U1: U1 zinc finger 25.4 56 0.0012 17.3 1.5 8 126-133 6-13 (38)
181 PF08209 Sgf11: Sgf11 (transcr 25.3 57 0.0012 16.8 1.4 19 124-143 5-23 (33)
182 PF07535 zf-DBF: DBF zinc fing 24.7 30 0.00065 19.6 0.3 19 8-29 6-24 (49)
183 KOG2807|consensus 24.2 88 0.0019 25.6 2.9 68 65-147 289-368 (378)
184 TIGR00100 hypA hydrogenase nic 24.2 29 0.00064 23.6 0.3 11 95-105 71-81 (115)
185 PF04423 Rad50_zn_hook: Rad50 23.6 30 0.00066 19.8 0.2 13 125-137 22-34 (54)
186 PRK00420 hypothetical protein; 23.4 57 0.0012 22.2 1.5 28 160-187 25-52 (112)
187 PLN03238 probable histone acet 23.0 49 0.0011 26.6 1.3 21 5-28 49-69 (290)
188 PF07503 zf-HYPF: HypF finger; 23.0 10 0.00022 19.8 -1.6 6 68-73 23-28 (35)
189 smart00586 ZnF_DBF Zinc finger 22.8 43 0.00094 19.0 0.8 19 8-29 6-24 (49)
190 PTZ00448 hypothetical protein; 22.3 62 0.0013 27.0 1.8 25 123-147 314-338 (373)
191 PF10013 DUF2256: Uncharacteri 22.0 57 0.0012 17.8 1.1 17 67-83 9-25 (42)
192 PF13821 DUF4187: Domain of un 21.9 1.1E+02 0.0024 17.8 2.4 17 95-111 28-44 (55)
193 PF04216 FdhE: Protein involve 21.8 5.6 0.00012 32.0 -4.2 54 65-134 196-249 (290)
194 COG3091 SprT Zn-dependent meta 21.5 48 0.001 23.8 0.9 32 94-131 117-148 (156)
195 COG0068 HypF Hydrogenase matur 21.4 9.6 0.00021 34.4 -3.1 56 37-101 125-180 (750)
196 COG1675 TFA1 Transcription ini 21.3 1.3E+02 0.0027 22.4 3.1 34 89-132 108-141 (176)
197 PF04606 Ogr_Delta: Ogr/Delta- 21.2 18 0.00039 20.2 -1.0 6 38-43 2-7 (47)
198 COG1997 RPL43A Ribosomal prote 21.2 70 0.0015 20.6 1.5 14 174-187 52-65 (89)
199 PF09845 DUF2072: Zn-ribbon co 20.6 43 0.00093 23.4 0.5 15 123-137 1-15 (131)
200 PF04810 zf-Sec23_Sec24: Sec23 20.3 31 0.00068 18.4 -0.1 11 121-131 22-32 (40)
201 PF12773 DZR: Double zinc ribb 20.2 40 0.00088 18.7 0.3 8 38-45 15-22 (50)
202 COG5188 PRP9 Splicing factor 3 20.0 61 0.0013 26.8 1.3 22 8-29 239-260 (470)
203 KOG2071|consensus 20.0 59 0.0013 28.8 1.4 27 33-59 416-442 (579)
No 1
>KOG2462|consensus
Probab=99.96 E-value=7.3e-31 Score=197.67 Aligned_cols=138 Identities=28% Similarity=0.577 Sum_probs=128.8
Q ss_pred CCcccccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHHHhcCCCceecccccccccCHHHHHH
Q psy14799 33 DKSFQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSKADLVR 112 (230)
Q Consensus 33 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~ 112 (230)
...|+|+.|++.+.+.++|.+|.++|......+.+.|.+||+.|.+...|..|+++|+ -+.+|.+||+.|...+-|+.
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQG 205 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQG 205 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhc
Confidence 3459999999999999999999999998887788999999999999999999999997 57899999999999999999
Q ss_pred HHHhhcCCCCcccccccccccCChHHHHHHHHhhcCCCChhhhhhccCCCCCCCCCcccccccccCCccccccccCCccc
Q psy14799 113 HMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLLTIHMQPADEAQNAIHRCYACRCAGHCLQIGTHSGGICGSMWRVETWHP 192 (230)
Q Consensus 113 H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~c~~~~~C~~c~~~~~~~~~~~h~~~~C~~~f~~~~~l~ 192 (230)
|+| .|+|+|||.|+.|++.|.++++|+.| +.+|.+.++|.|. .|+|+|..++.|.
T Consensus 206 HiR-THTGEKPF~C~hC~kAFADRSNLRAH-mQTHS~~K~~qC~-----------------------~C~KsFsl~SyLn 260 (279)
T KOG2462|consen 206 HIR-THTGEKPFSCPHCGKAFADRSNLRAH-MQTHSDVKKHQCP-----------------------RCGKSFALKSYLN 260 (279)
T ss_pred ccc-cccCCCCccCCcccchhcchHHHHHH-HHhhcCCccccCc-----------------------chhhHHHHHHHHH
Confidence 999 99999999999999999999999999 7889999999986 8999999999999
Q ss_pred ccccc
Q psy14799 193 CHLLT 197 (230)
Q Consensus 193 ~H~~~ 197 (230)
+|...
T Consensus 261 KH~ES 265 (279)
T KOG2462|consen 261 KHSES 265 (279)
T ss_pred Hhhhh
Confidence 99874
No 2
>KOG2462|consensus
Probab=99.96 E-value=8.8e-30 Score=191.87 Aligned_cols=130 Identities=28% Similarity=0.525 Sum_probs=122.4
Q ss_pred hhhhhhhhccCHHHHHHHHHhhcC---CCcccccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHh
Q psy14799 9 YWNKNYFFFLMGLKLEAHITTVHG---DKSFQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYH 85 (230)
Q Consensus 9 ~C~~C~~~f~~~~~l~~H~~~h~~---~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H 85 (230)
-|++||+.+.+...|.+|.++|-. .+.+.|+.|++.|.+...|+.|+++|.. +..|.+||+.|...+.|+-|
T Consensus 132 ~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~l-----~c~C~iCGKaFSRPWLLQGH 206 (279)
T KOG2462|consen 132 KCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHTL-----PCECGICGKAFSRPWLLQGH 206 (279)
T ss_pred eccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccCC-----CcccccccccccchHHhhcc
Confidence 466699999999999999999853 5679999999999999999999999874 59999999999999999999
Q ss_pred HHHhcCCCceecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHH
Q psy14799 86 VKTHTGERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLL 144 (230)
Q Consensus 86 ~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~ 144 (230)
+|+|+||+||.|+.|++.|.++++|+.|++ .|.+.++|+|..|+++|...+-|.+|..
T Consensus 207 iRTHTGEKPF~C~hC~kAFADRSNLRAHmQ-THS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 207 IRTHTGEKPFSCPHCGKAFADRSNLRAHMQ-THSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred cccccCCCCccCCcccchhcchHHHHHHHH-hhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 999999999999999999999999999999 9999999999999999999999999944
No 3
>KOG1074|consensus
Probab=99.92 E-value=4.2e-26 Score=193.42 Aligned_cols=75 Identities=35% Similarity=0.627 Sum_probs=62.2
Q ss_pred ccccccccccCCHHHHHHhHHHhcCCCceecccccccccCHHHHHHHHHhhcCCC----Cccccc---ccccccCChHHH
Q psy14799 67 FPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSKADLVRHMKGRHLNI----RSFTCH---LCNKGFFRKSYL 139 (230)
Q Consensus 67 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~----~~~~C~---~C~~~f~~~~~l 139 (230)
..|.+|.+...-.+.|+.|.++|+|++||+|.+|++.|.++.+|+.|+- .|... .++.|+ +|.+.|...-.|
T Consensus 606 NqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~-vHka~p~~R~q~ScP~~~ic~~kftn~V~l 684 (958)
T KOG1074|consen 606 NQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMS-VHKAKPPARVQFSCPSTFICQKKFTNAVTL 684 (958)
T ss_pred cceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhccc-ccccCccccccccCCchhhhcccccccccc
Confidence 7899999999999999999999999999999999999999999999998 66433 235565 566666665555
Q ss_pred HHH
Q psy14799 140 KVH 142 (230)
Q Consensus 140 ~~H 142 (230)
..|
T Consensus 685 pQh 687 (958)
T KOG1074|consen 685 PQH 687 (958)
T ss_pred cce
Confidence 555
No 4
>KOG3608|consensus
Probab=99.91 E-value=5e-25 Score=170.70 Aligned_cols=187 Identities=24% Similarity=0.397 Sum_probs=153.0
Q ss_pred hhhhhhhccCHHHHHHHHHhhcCCCcccccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHHHh
Q psy14799 10 WNKNYFFFLMGLKLEAHITTVHGDKSFQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVKTH 89 (230)
Q Consensus 10 C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h 89 (230)
|..|-+.|.++..|++|+++|++++...|+.||..|.++..|--|.+....... .+|.|..|.+.|.+...|..|+..|
T Consensus 182 W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~-n~fqC~~C~KrFaTeklL~~Hv~rH 260 (467)
T KOG3608|consen 182 WAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNT-NSFQCAQCFKRFATEKLLKSHVVRH 260 (467)
T ss_pred chhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcC-CchHHHHHHHHHhHHHHHHHHHHHh
Confidence 477889999999999999999999999999999999999999999986654433 5899999999999999999999988
Q ss_pred cCCCceecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHHhhcCCCChhhhhhccCCCCCCCCCc
Q psy14799 90 TGERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLLTIHMQPADEAQNAIHRCYACRCAGH 169 (230)
Q Consensus 90 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~c~~~~~C~~c~~~~~ 169 (230)
. ..|+|+.|+.+....+.|..|++-.|...+||+|..|++.|.+.++|.+| ...|+ +..|.|.. +.|-..+.
T Consensus 261 v--n~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH-~~~HS-~~~y~C~h----~~C~~s~r 332 (467)
T KOG3608|consen 261 V--NCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKH-VQVHS-KTVYQCEH----PDCHYSVR 332 (467)
T ss_pred h--hcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHH-HHhcc-ccceecCC----CCCcHHHH
Confidence 6 45999999999999999999999999999999999999999999999999 55887 55677653 22433332
Q ss_pred cc-----cc-ccccC--------CccccccccCCccccccccccCCCCCC
Q psy14799 170 CL-----QI-GTHSG--------GICGSMWRVETWHPCHLLTERNRQPPY 205 (230)
Q Consensus 170 ~~-----~~-~~h~~--------~~C~~~f~~~~~l~~H~~~h~~~~~~~ 205 (230)
+. |+ .+|.+ ..|++.|++-.+|..|++.-|+=+-|+
T Consensus 333 ~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~Ps 382 (467)
T KOG3608|consen 333 TYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFRLPS 382 (467)
T ss_pred HHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcccCCC
Confidence 22 22 33323 377777777777777776665555443
No 5
>KOG3608|consensus
Probab=99.86 E-value=4.5e-22 Score=154.42 Aligned_cols=152 Identities=24% Similarity=0.422 Sum_probs=116.9
Q ss_pred hhhhhhhccCHHHHHHHHHhhcC--CCcccccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHH
Q psy14799 10 WNKNYFFFLMGLKLEAHITTVHG--DKSFQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVK 87 (230)
Q Consensus 10 C~~C~~~f~~~~~l~~H~~~h~~--~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 87 (230)
|+-|+..|.++..|-.|++..+. ..+|.|..|-+.|.+...|..|+..|-. -|+|+.|+.+....++|.+|++
T Consensus 210 Cp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn-----~ykCplCdmtc~~~ssL~~H~r 284 (467)
T KOG3608|consen 210 CPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN-----CYKCPLCDMTCSSASSLTTHIR 284 (467)
T ss_pred cchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh-----cccccccccCCCChHHHHHHHH
Confidence 66688888888888888876653 4578888888888888888888886654 3888888888888888888877
Q ss_pred -HhcCCCceecccccccccCHHHHHHHHHhhcCCCCcccccc--cccccCChHHHHHHHHhhcCC--CChhhhhhccCCC
Q psy14799 88 -THTGERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHL--CNKGFFRKSYLKVHLLTIHMQ--PADEAQNAIHRCY 162 (230)
Q Consensus 88 -~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~--~~~~~c~~~~~C~ 162 (230)
.|..++||+|+.|++.|.+.++|.+|.. .|+ +..|+|.. |..+|.+...+++|++.+|.+ +.+|.|-. |.
T Consensus 285 ~rHs~dkpfKCd~Cd~~c~~esdL~kH~~-~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~---Cd 359 (467)
T KOG3608|consen 285 YRHSKDKPFKCDECDTRCVRESDLAKHVQ-VHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHC---CD 359 (467)
T ss_pred hhhccCCCccccchhhhhccHHHHHHHHH-hcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeec---ch
Confidence 5667788888888888888888888888 787 55688877 888888888888888888843 45577665 44
Q ss_pred CCCCCCccc
Q psy14799 163 ACRCAGHCL 171 (230)
Q Consensus 163 ~c~~~~~~~ 171 (230)
.++.+...+
T Consensus 360 r~ft~G~~L 368 (467)
T KOG3608|consen 360 RFFTSGKSL 368 (467)
T ss_pred hhhccchhH
Confidence 444444333
No 6
>KOG3623|consensus
Probab=99.79 E-value=3.9e-20 Score=155.37 Aligned_cols=77 Identities=23% Similarity=0.407 Sum_probs=71.8
Q ss_pred eecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHHhhcCCCChhhhhhccCCCCCCCCCcccccc
Q psy14799 95 VTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLLTIHMQPADEAQNAIHRCYACRCAGHCLQIG 174 (230)
Q Consensus 95 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~c~~~~~C~~c~~~~~~~~~~ 174 (230)
|.|+.|+++|...+.|.+|.. .|+|.+||+|.+|.+.|..+-.|..| ++.|.|++||.|.
T Consensus 895 yaCDqCDK~FqKqSSLaRHKY-EHsGqRPyqC~iCkKAFKHKHHLtEH-kRLHSGEKPfQCd------------------ 954 (1007)
T KOG3623|consen 895 YACDQCDKAFQKQSSLARHKY-EHSGQRPYQCIICKKAFKHKHHLTEH-KRLHSGEKPFQCD------------------ 954 (1007)
T ss_pred chHHHHHHHHHhhHHHHHhhh-hhcCCCCcccchhhHhhhhhhhhhhh-hhhccCCCcchhh------------------
Confidence 889999999999999999998 89999999999999999999999999 8899999999986
Q ss_pred cccCCccccccccCCccccccc
Q psy14799 175 THSGGICGSMWRVETWHPCHLL 196 (230)
Q Consensus 175 ~h~~~~C~~~f~~~~~l~~H~~ 196 (230)
.|+|+|...+....||.
T Consensus 955 -----KClKRFSHSGSYSQHMN 971 (1007)
T KOG3623|consen 955 -----KCLKRFSHSGSYSQHMN 971 (1007)
T ss_pred -----hhhhhcccccchHhhhc
Confidence 89999999999999995
No 7
>KOG1074|consensus
Probab=99.78 E-value=4.6e-20 Score=157.08 Aligned_cols=94 Identities=22% Similarity=0.451 Sum_probs=79.8
Q ss_pred ceecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHHhhcCCCChhhhhhccCCCCCCCCCccccc
Q psy14799 94 DVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLLTIHMQPADEAQNAIHRCYACRCAGHCLQI 173 (230)
Q Consensus 94 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~c~~~~~C~~c~~~~~~~~~ 173 (230)
+-+|-+|.++..-.++|+-|.+ .|+|++||+|.+||+.|.++.+|+.| ..+|....+..-. ..|+.-+
T Consensus 605 PNqCiiC~rVlSC~saLqmHyr-tHtGERPFkCKiCgRAFtTkGNLkaH-~~vHka~p~~R~q--~ScP~~~-------- 672 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYR-THTGERPFKCKICGRAFTTKGNLKAH-MSVHKAKPPARVQ--FSCPSTF-------- 672 (958)
T ss_pred ccceeeeeecccchhhhhhhhh-cccCcCccccccccchhccccchhhc-ccccccCcccccc--ccCCchh--------
Confidence 4789999999999999999999 99999999999999999999999999 8889777665521 2222111
Q ss_pred ccccCCccccccccCCccccccccccCCCCCC
Q psy14799 174 GTHSGGICGSMWRVETWHPCHLLTERNRQPPY 205 (230)
Q Consensus 174 ~~h~~~~C~~~f~~~~~l~~H~~~h~~~~~~~ 205 (230)
+|-+.|.+...|..|+++|.+.+-|.
T Consensus 673 ------ic~~kftn~V~lpQhIriH~~~~~s~ 698 (958)
T KOG1074|consen 673 ------ICQKKFTNAVTLPQHIRIHLGGQISN 698 (958)
T ss_pred ------hhcccccccccccceEEeecCCCCCC
Confidence 99999999999999999999776543
No 8
>KOG3576|consensus
Probab=99.74 E-value=4e-19 Score=128.25 Aligned_cols=123 Identities=25% Similarity=0.423 Sum_probs=109.4
Q ss_pred CcccccccccccCCHHHHHHhHHHhcCCCceecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHH
Q psy14799 65 RPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLL 144 (230)
Q Consensus 65 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~ 144 (230)
..|.|.+|++.|.-...|.+|++-|...+.+-|..||+.|.+..+|++|++ .|++.+||+|..|++.|+..-.|..|++
T Consensus 116 d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~r-thtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 116 DSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTR-THTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred CeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhc-cccCccccchhhhhHHHHhhccHHHHHH
Confidence 359999999999999999999999999999999999999999999999999 9999999999999999999999999999
Q ss_pred hhcCCCChhhhhhccCCCCCCCCCcccccccccCCccccccccCCccccccccccCC
Q psy14799 145 TIHMQPADEAQNAIHRCYACRCAGHCLQIGTHSGGICGSMWRVETWHPCHLLTERNR 201 (230)
Q Consensus 145 ~~H~~~~~~~c~~~~~C~~c~~~~~~~~~~~h~~~~C~~~f~~~~~l~~H~~~h~~~ 201 (230)
.+|.-...|.-. ....+.+.|+.||.+-.....+..|++.|+..
T Consensus 195 kvhgv~~~yayk-------------err~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 195 KVHGVQHQYAYK-------------ERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HHcCchHHHHHH-------------HhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 999766655432 12235677789999999999999999999743
No 9
>KOG3576|consensus
Probab=99.71 E-value=4.5e-18 Score=122.89 Aligned_cols=115 Identities=30% Similarity=0.561 Sum_probs=103.5
Q ss_pred CCcccccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHHHhcCCCceecccccccccCHHHHHH
Q psy14799 33 DKSFQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSKADLVR 112 (230)
Q Consensus 33 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~ 112 (230)
...|.|.+|++.|.-...|.+|++-|... +.+.|..||+.|...-+|++|+++|+|.+||+|..|++.|...-.|..
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~v---kr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsles 191 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDV---KRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLES 191 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHH---HHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHH
Confidence 34599999999999999999999977666 789999999999999999999999999999999999999999999999
Q ss_pred HHHhhcC----------CCCcccccccccccCChHHHHHHHHhhcCCC
Q psy14799 113 HMKGRHL----------NIRSFTCHLCNKGFFRKSYLKVHLLTIHMQP 150 (230)
Q Consensus 113 H~~~~H~----------~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 150 (230)
|.+..|. ..+.|.|..||.+-.....+..|+...|...
T Consensus 192 hl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 192 HLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred HHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence 9998885 4467999999999999999999977777544
No 10
>KOG3623|consensus
Probab=99.49 E-value=9e-15 Score=123.55 Aligned_cols=79 Identities=30% Similarity=0.610 Sum_probs=76.2
Q ss_pred CcccccccccccCCHHHHHHhHHHhcCCCceecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHH
Q psy14799 65 RPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLL 144 (230)
Q Consensus 65 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~ 144 (230)
-.|.|+.|++.|...++|.+|.-.|+|.+||+|.+|.+.|..+-.|..|+| .|.|+|||+|..|++.|.....+.+||.
T Consensus 893 gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkR-LHSGEKPfQCdKClKRFSHSGSYSQHMN 971 (1007)
T KOG3623|consen 893 GMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKR-LHSGEKPFQCDKCLKRFSHSGSYSQHMN 971 (1007)
T ss_pred ccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhh-hccCCCcchhhhhhhhcccccchHhhhc
Confidence 569999999999999999999999999999999999999999999999999 9999999999999999999999999953
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.36 E-value=9e-13 Score=111.51 Aligned_cols=102 Identities=21% Similarity=0.515 Sum_probs=71.6
Q ss_pred ccccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHHHhcCCCceeccccccccc----------
Q psy14799 36 FQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFY---------- 105 (230)
Q Consensus 36 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~---------- 105 (230)
+.|+.|+..|. ...|..|+.++| +++.|+ |+..+ ....|..|+.+|.+.+++.|+.|++.+.
T Consensus 454 ~~C~~Cgk~f~-~s~LekH~~~~H-----kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d 525 (567)
T PLN03086 454 VHCEKCGQAFQ-QGEMEKHMKVFH-----EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRD 525 (567)
T ss_pred ccCCCCCCccc-hHHHHHHHHhcC-----CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhh
Confidence 56777877775 466778877764 357777 77644 4567777877777777888888877774
Q ss_pred CHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHHhhcC
Q psy14799 106 SKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLLTIHM 148 (230)
Q Consensus 106 ~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 148 (230)
....|..|.. .. +.+++.|..||+.+... ++..|+...|.
T Consensus 526 ~~s~Lt~HE~-~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~~~h~ 565 (567)
T PLN03086 526 RLRGMSEHES-IC-GSRTAPCDSCGRSVMLK-EMDIHQIAVHQ 565 (567)
T ss_pred hhhhHHHHHH-hc-CCcceEccccCCeeeeh-hHHHHHHHhhc
Confidence 1346777776 43 66777887787777665 67777666663
No 12
>PHA00733 hypothetical protein
Probab=99.25 E-value=1.3e-11 Score=85.93 Aligned_cols=82 Identities=21% Similarity=0.308 Sum_probs=59.8
Q ss_pred CCcccccccccccCCHHHHHHh--HH---HhcCCCceecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHH
Q psy14799 64 LRPFPCDLCEQRFLNKSHLAYH--VK---THTGERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSY 138 (230)
Q Consensus 64 ~~~~~C~~C~~~f~~~~~l~~H--~~---~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~ 138 (230)
.+++.|.+|...|.....|..+ +. .+.++++|.|+.|++.|.+...|..|++ .| ..+|.|+.|++.|.....
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r-~h--~~~~~C~~CgK~F~~~~s 114 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIR-YT--EHSKVCPVCGKEFRNTDS 114 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHh-cC--CcCccCCCCCCccCCHHH
Confidence 3667777777777766655554 11 2334678888888888888888888887 55 346888888888888888
Q ss_pred HHHHHHhhcC
Q psy14799 139 LKVHLLTIHM 148 (230)
Q Consensus 139 l~~H~~~~H~ 148 (230)
|..|+...|.
T Consensus 115 L~~H~~~~h~ 124 (128)
T PHA00733 115 TLDHVCKKHN 124 (128)
T ss_pred HHHHHHHhcC
Confidence 8888777663
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.19 E-value=3e-11 Score=102.43 Aligned_cols=143 Identities=17% Similarity=0.340 Sum_probs=101.8
Q ss_pred hhhhhhhccCHHHHHHHHHhhcCCCcccccc--cccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHH
Q psy14799 10 WNKNYFFFLMGLKLEAHITTVHGDKSFQCDE--CDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVK 87 (230)
Q Consensus 10 C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 87 (230)
|+.|..... ...|..|..... ...-.|+. |+..|.. ..+. ..+.|+.|+..|. ...|..|+.
T Consensus 410 C~NC~~~i~-l~~l~lHe~~C~-r~~V~Cp~~~Cg~v~~r-~el~------------~H~~C~~Cgk~f~-~s~LekH~~ 473 (567)
T PLN03086 410 CRNCKHYIP-SRSIALHEAYCS-RHNVVCPHDGCGIVLRV-EEAK------------NHVHCEKCGQAFQ-QGEMEKHMK 473 (567)
T ss_pred CCCCCCccc-hhHHHHHHhhCC-CcceeCCcccccceeec-cccc------------cCccCCCCCCccc-hHHHHHHHH
Confidence 444554433 344556664432 23345764 8877732 2222 3468999999996 678999999
Q ss_pred HhcCCCceecccccccccCHHHHHHHHHhhcCCCCcccccccccccC----------ChHHHHHHHHhhcCCCChhhhhh
Q psy14799 88 THTGERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFF----------RKSYLKVHLLTIHMQPADEAQNA 157 (230)
Q Consensus 88 ~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~----------~~~~l~~H~~~~H~~~~~~~c~~ 157 (230)
+++ .++.|+ |++.+ ....|..|+. .|...+++.|+.|++.|. ..+.|..| .... +.+++.|.
T Consensus 474 ~~H--kpv~Cp-Cg~~~-~R~~L~~H~~-thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~H-E~~C-G~rt~~C~- 545 (567)
T PLN03086 474 VFH--EPLQCP-CGVVL-EKEQMVQHQA-STCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEH-ESIC-GSRTAPCD- 545 (567)
T ss_pred hcC--CCccCC-CCCCc-chhHHHhhhh-ccCCCCceeCCCCCCccccCccccchhhhhhhHHHH-HHhc-CCcceEcc-
Confidence 886 789999 99755 6689999988 899999999999999985 24589999 4444 77788874
Q ss_pred ccCCCCCCCCCcccccccccCCccccccccCCcccccccccc
Q psy14799 158 IHRCYACRCAGHCLQIGTHSGGICGSMWRVETWHPCHLLTER 199 (230)
Q Consensus 158 ~~~C~~c~~~~~~~~~~~h~~~~C~~~f~~~~~l~~H~~~h~ 199 (230)
.||+.++. ..|..|+..-|
T Consensus 546 ----------------------~Cgk~Vrl-rdm~~H~~~~h 564 (567)
T PLN03086 546 ----------------------SCGRSVML-KEMDIHQIAVH 564 (567)
T ss_pred ----------------------ccCCeeee-hhHHHHHHHhh
Confidence 78887776 56777776544
No 14
>PHA00733 hypothetical protein
Probab=99.17 E-value=5.2e-11 Score=82.94 Aligned_cols=97 Identities=18% Similarity=0.255 Sum_probs=74.8
Q ss_pred HHHHHHHHhhcCCCcccccccccccCChHHHHHh--HhccCCCCCCCcccccccccccCCHHHHHHhHHHhcCCCceecc
Q psy14799 21 LKLEAHITTVHGDKSFQCDECDMSFGTHQALGRH--KQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCG 98 (230)
Q Consensus 21 ~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H--~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~ 98 (230)
.+|.++-..-...+++.|.+|...|.+...|..+ +..+....+.++|.|+.|++.|.+...|..|++.+ +.++.|.
T Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~ 103 (128)
T PHA00733 26 EELKRYHSLTPEQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCP 103 (128)
T ss_pred HHhhhhhcCChhhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCC
Confidence 3443333322335779999999999998887777 33222222347999999999999999999999976 3579999
Q ss_pred cccccccCHHHHHHHHHhhcC
Q psy14799 99 LCFQSFYSKADLVRHMKGRHL 119 (230)
Q Consensus 99 ~C~~~f~~~~~l~~H~~~~H~ 119 (230)
.|++.|.....|..|+...|.
T Consensus 104 ~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 104 VCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCCccCCHHHHHHHHHHhcC
Confidence 999999999999999986663
No 15
>KOG3993|consensus
Probab=99.01 E-value=1e-10 Score=93.71 Aligned_cols=145 Identities=19% Similarity=0.262 Sum_probs=101.6
Q ss_pred hhhhhhhhhhhhhhhccCHHHHHHHHHhhcCCCcccccccccccCChHHHHHhHhccCCCCC------------------
Q psy14799 2 LLSFILDYWNKNYFFFLMGLKLEAHITTVHGDKSFQCDECDMSFGTHQALGRHKQCKHMDIS------------------ 63 (230)
Q Consensus 2 ~~~~~c~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~------------------ 63 (230)
|=.|||..|.+ .|.+...|.+|.-.-.--.-|+|++|+++|....+|..|.+.|.....
T Consensus 265 iGdyiCqLCK~---kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae 341 (500)
T KOG3993|consen 265 IGDYICQLCKE---KYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAE 341 (500)
T ss_pred HHHHHHHHHHH---hhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhh
Confidence 34689999655 999999999997543333459999999999999999999998854311
Q ss_pred ------------CCcccccccccccCCHHHHHHhHHHhcCCC--------------------------------------
Q psy14799 64 ------------LRPFPCDLCEQRFLNKSHLAYHVKTHTGER-------------------------------------- 93 (230)
Q Consensus 64 ------------~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~-------------------------------------- 93 (230)
...|.|.+|++.|.....|++|+.+|+...
T Consensus 342 ~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~v 421 (500)
T KOG3993|consen 342 VQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEV 421 (500)
T ss_pred hhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccce
Confidence 123889999999999999999988775310
Q ss_pred --------ceecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHHhhcCCC
Q psy14799 94 --------DVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLLTIHMQP 150 (230)
Q Consensus 94 --------~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 150 (230)
-..+++++..+.+...-..+.+ .-..+..|.|.+|.-+|.+...|.+|+...|..+
T Consensus 422 l~~a~sael~~pp~~~~ppsss~~sgg~~r-lg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse 485 (500)
T KOG3993|consen 422 LYVAGSAELELPPYDGSPPSSSGSSGGYGR-LGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSE 485 (500)
T ss_pred eeeeccccccCCCCCCCCcccCCCCCcccc-ccchhhccccccchHhhhcCcchHhHhhhcChHH
Confidence 0123334433333222222222 1112234899999999999999999998888543
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.82 E-value=2.1e-09 Score=62.12 Aligned_cols=39 Identities=13% Similarity=0.526 Sum_probs=18.5
Q ss_pred ccccccccccCCHHHHHHhHHHhcCCCceecccccccccCH
Q psy14799 67 FPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSK 107 (230)
Q Consensus 67 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 107 (230)
|.|+.||+.|...+.|..|+++|+ ++++|..|++.|...
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~ 44 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRT 44 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccc
Confidence 444444444444444444444444 344444444444433
No 17
>KOG3993|consensus
Probab=98.81 E-value=1.1e-09 Score=88.06 Aligned_cols=162 Identities=19% Similarity=0.299 Sum_probs=99.5
Q ss_pred ccccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHHHhcCCCcee---cccccccccCHHHHHH
Q psy14799 36 FQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVKTHTGERDVT---CGLCFQSFYSKADLVR 112 (230)
Q Consensus 36 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~---C~~C~~~f~~~~~l~~ 112 (230)
|.|..|...|.+...|.+|.-..... ..|+|++|++.|.-..+|..|.++|.....-. -+-=.+...+....+.
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC~RIV~---vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~e 344 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRCPRIVH---VEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQE 344 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccCCeeEE---eeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhh
Confidence 89999999999999999997533222 45999999999999999999999996422100 0000000011111111
Q ss_pred HHH-hhcCCCCcccccccccccCChHHHHHHHHhhcCCCChhh-------------------------------------
Q psy14799 113 HMK-GRHLNIRSFTCHLCNKGFFRKSYLKVHLLTIHMQPADEA------------------------------------- 154 (230)
Q Consensus 113 H~~-~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~------------------------------------- 154 (230)
-.+ ..-.....|.|..|++.|....-|+.|..++|.....-.
T Consensus 345 a~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~ 424 (500)
T KOG3993|consen 345 AERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYV 424 (500)
T ss_pred ccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeee
Confidence 000 000122358888888888888888888555553221110
Q ss_pred --hhhccCCCCCCCCCccc-----------ccccccCCccccccccCCccccccccccC
Q psy14799 155 --QNAIHRCYACRCAGHCL-----------QIGTHSGGICGSMWRVETWHPCHLLTERN 200 (230)
Q Consensus 155 --c~~~~~C~~c~~~~~~~-----------~~~~h~~~~C~~~f~~~~~l~~H~~~h~~ 200 (230)
-.....++.++..+++. -..+.+|++|.-.|.+..+|.+|+..-|-
T Consensus 425 a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hp 483 (500)
T KOG3993|consen 425 AGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHP 483 (500)
T ss_pred eccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcCh
Confidence 01113455555444332 13777788999999999999999887653
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.81 E-value=2e-09 Score=62.23 Aligned_cols=42 Identities=14% Similarity=0.463 Sum_probs=28.9
Q ss_pred eecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHH
Q psy14799 95 VTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYL 139 (230)
Q Consensus 95 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l 139 (230)
|.|+.||+.|...+.|..|++ .|. ++++|..|++.|...++|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r-~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLR-KHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHH-hcC--CcccCCcccceeccccee
Confidence 667777777777777777777 565 467777777777666554
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.66 E-value=1.7e-08 Score=49.83 Aligned_cols=26 Identities=35% Similarity=0.782 Sum_probs=22.3
Q ss_pred HHHHHHHhhcCCCcccccccccccCC
Q psy14799 22 KLEAHITTVHGDKSFQCDECDMSFGT 47 (230)
Q Consensus 22 ~l~~H~~~h~~~~~~~C~~C~~~f~~ 47 (230)
+|.+|+++|++++||.|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 47889999999999999999988863
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.62 E-value=3.1e-08 Score=48.91 Aligned_cols=24 Identities=46% Similarity=0.820 Sum_probs=13.0
Q ss_pred HHHhHHHhcCCCceeccccccccc
Q psy14799 82 LAYHVKTHTGERDVTCGLCFQSFY 105 (230)
Q Consensus 82 l~~H~~~h~~~~~~~C~~C~~~f~ 105 (230)
|..|+++|++++||.|+.|++.|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 455555555555555555555553
No 21
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.42 E-value=5.4e-07 Score=53.02 Aligned_cols=51 Identities=27% Similarity=0.525 Sum_probs=35.6
Q ss_pred ceecccccccccCHHHHHHHHHhhcCCC-CcccccccccccCChHHHHHHHHhhc
Q psy14799 94 DVTCGLCFQSFYSKADLVRHMKGRHLNI-RSFTCHLCNKGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 94 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~-~~~~C~~C~~~f~~~~~l~~H~~~~H 147 (230)
.|.|++|++ ..+..+|..|....|..+ +.+.||+|...+. .+|.+|+...|
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 367888888 455677888877677654 4577888877544 37778876666
No 22
>PHA00616 hypothetical protein
Probab=98.39 E-value=1.8e-07 Score=51.46 Aligned_cols=33 Identities=18% Similarity=0.314 Sum_probs=19.3
Q ss_pred cccccccccccCCHHHHHHhHHHhcCCCceecc
Q psy14799 66 PFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCG 98 (230)
Q Consensus 66 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~ 98 (230)
||.|+.||..|...++|..|++.|++++++.|+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 355666666666666666666666665555554
No 23
>PHA00732 hypothetical protein
Probab=98.27 E-value=6e-07 Score=56.88 Aligned_cols=21 Identities=33% Similarity=0.599 Sum_probs=9.7
Q ss_pred ccccccccccCChHHHHHhHh
Q psy14799 36 FQCDECDMSFGTHQALGRHKQ 56 (230)
Q Consensus 36 ~~C~~C~~~f~~~~~l~~H~~ 56 (230)
|.|+.|++.|.+...|..|++
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r 22 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHAR 22 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhh
Confidence 444444444444444444444
No 24
>PHA00732 hypothetical protein
Probab=98.25 E-value=7.5e-07 Score=56.43 Aligned_cols=37 Identities=24% Similarity=0.507 Sum_probs=17.5
Q ss_pred eecccccccccCHHHHHHHHHhhcCCCCcccccccccccC
Q psy14799 95 VTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFF 134 (230)
Q Consensus 95 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~ 134 (230)
|.|..|++.|.+...|..|++..|. ++.|+.|++.|.
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~ 38 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR 38 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC
Confidence 4455555555555555555442232 234555555554
No 25
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.17 E-value=3.2e-06 Score=49.72 Aligned_cols=37 Identities=24% Similarity=0.469 Sum_probs=15.5
Q ss_pred ccccccccccCChHHHHHhHhccCCCCCCCccccccccc
Q psy14799 36 FQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQ 74 (230)
Q Consensus 36 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~ 74 (230)
|.|+.|++ ..+...|..|....|.... +.+.||+|..
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~-~~v~CPiC~~ 39 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSES-KNVVCPICSS 39 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCC-CCccCCCchh
Confidence 44444444 2333444444444433321 2344444444
No 26
>PHA00616 hypothetical protein
Probab=98.17 E-value=8.2e-07 Score=48.88 Aligned_cols=27 Identities=15% Similarity=0.160 Sum_probs=13.5
Q ss_pred hhhhccCHHHHHHHHHhhcCCCccccc
Q psy14799 13 NYFFFLMGLKLEAHITTVHGDKSFQCD 39 (230)
Q Consensus 13 C~~~f~~~~~l~~H~~~h~~~~~~~C~ 39 (230)
||+.|....+|.+|++.|++++++.|+
T Consensus 7 CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 7 CGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred hhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 444555555555555555555554443
No 27
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.93 E-value=6.3e-06 Score=39.31 Aligned_cols=22 Identities=41% Similarity=0.810 Sum_probs=14.0
Q ss_pred ccccccccccCChHHHHHHHHh
Q psy14799 124 FTCHLCNKGFFRKSYLKVHLLT 145 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H~~~ 145 (230)
|.|+.|++.|.+.+.|.+|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4566666666666666666443
No 28
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.84 E-value=1.5e-05 Score=37.99 Aligned_cols=16 Identities=50% Similarity=0.791 Sum_probs=5.7
Q ss_pred ccccccccCHHHHHHH
Q psy14799 98 GLCFQSFYSKADLVRH 113 (230)
Q Consensus 98 ~~C~~~f~~~~~l~~H 113 (230)
+.|++.|.+...|..|
T Consensus 4 ~~C~~~f~~~~~l~~H 19 (23)
T PF00096_consen 4 PICGKSFSSKSNLKRH 19 (23)
T ss_dssp TTTTEEESSHHHHHHH
T ss_pred CCCCCccCCHHHHHHH
Confidence 3333333333333333
No 29
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.82 E-value=1.8e-05 Score=37.91 Aligned_cols=24 Identities=46% Similarity=0.834 Sum_probs=14.2
Q ss_pred ccccccccccCChHHHHHHHHhhc
Q psy14799 124 FTCHLCNKGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H~~~~H 147 (230)
|.|++|++.|.+..+|..|+.+.|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 456777777777777777755443
No 30
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.77 E-value=2e-05 Score=52.51 Aligned_cols=72 Identities=24% Similarity=0.419 Sum_probs=15.4
Q ss_pred ccccccccCCHHHHHHhHHHhcCCCceecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHHhh
Q psy14799 69 CDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLLTI 146 (230)
Q Consensus 69 C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 146 (230)
|..|+..|.+...|..|+...++-.. + ....+.....+..+++ . .....+.|..|++.|.+...|..|+...
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~-~-~~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLR-K-KVKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp -----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred cccccccccccccccccccccccccc---c-cccccccccccccccc-c-ccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 55555555555556666543332110 0 1111122333333333 1 1112466777777777777777776554
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.66 E-value=4.4e-05 Score=50.83 Aligned_cols=70 Identities=26% Similarity=0.460 Sum_probs=14.5
Q ss_pred ccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHHHhcCCCceecccccccccCHHHHHHHHH
Q psy14799 38 CDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSKADLVRHMK 115 (230)
Q Consensus 38 C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 115 (230)
|..|+..|.+...|..|+...|+.. .. ....+.....+..+++... ...+.|..|++.|.+...|..|++
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~----~~---~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~ 71 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFD----IP---DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMR 71 (100)
T ss_dssp -------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHH
T ss_pred ccccccccccccccccccccccccc----cc---ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHc
Confidence 5666666666666666665555431 00 0111112223333332211 124667777766766666777766
No 32
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.60 E-value=1.4e-05 Score=62.49 Aligned_cols=52 Identities=23% Similarity=0.511 Sum_probs=40.4
Q ss_pred CCceeccc--ccccccCHHHHHHHHHhhc------------------CCCCcccccccccccCChHHHHHHH
Q psy14799 92 ERDVTCGL--CFQSFYSKADLVRHMKGRH------------------LNIRSFTCHLCNKGFFRKSYLKVHL 143 (230)
Q Consensus 92 ~~~~~C~~--C~~~f~~~~~l~~H~~~~H------------------~~~~~~~C~~C~~~f~~~~~l~~H~ 143 (230)
++||+|++ |.+.+.+...|+-|+..-| ...|||+|++|++.|.....|+-|+
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence 47788876 8888888888888876333 1337899999999999999999983
No 33
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.56 E-value=7e-05 Score=35.77 Aligned_cols=14 Identities=43% Similarity=0.743 Sum_probs=4.6
Q ss_pred cccccCCHHHHHHh
Q psy14799 72 CEQRFLNKSHLAYH 85 (230)
Q Consensus 72 C~~~f~~~~~l~~H 85 (230)
|++.|.+...|..|
T Consensus 6 C~~~~~~~~~l~~H 19 (24)
T PF13894_consen 6 CGKSFRSKSELRQH 19 (24)
T ss_dssp TS-EESSHHHHHHH
T ss_pred CCCcCCcHHHHHHH
Confidence 33333333333333
No 34
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.54 E-value=5.3e-05 Score=37.56 Aligned_cols=23 Identities=43% Similarity=0.936 Sum_probs=13.0
Q ss_pred ccccccccccCChHHHHHhHhcc
Q psy14799 36 FQCDECDMSFGTHQALGRHKQCK 58 (230)
Q Consensus 36 ~~C~~C~~~f~~~~~l~~H~~~~ 58 (230)
|.|..|++.|.+...|..|++.|
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCccCCccCChhHHHHHhHHh
Confidence 55555555555555555555544
No 35
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.45 E-value=5.3e-05 Score=59.40 Aligned_cols=72 Identities=15% Similarity=0.223 Sum_probs=47.8
Q ss_pred CCCcccccc--cccccCChHHHHHHHHhhcCCCChhhhhhccCCCCCCCCCcccccccccCCccccccccCCccccccc
Q psy14799 120 NIRSFTCHL--CNKGFFRKSYLKVHLLTIHMQPADEAQNAIHRCYACRCAGHCLQIGTHSGGICGSMWRVETWHPCHLL 196 (230)
Q Consensus 120 ~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~c~~~~~C~~c~~~~~~~~~~~h~~~~C~~~f~~~~~l~~H~~ 196 (230)
+++||+|++ |.+.|++...|+-|++.-|...+..+-+... ........-+...|++|+|+|.+...|+-|..
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~-----~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPE-----KMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCcc-----ccccccccCCceeccccchhhccCccceeccc
Confidence 458999998 9999999999999987777332221111000 00001112244455799999999999999865
No 36
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.42 E-value=8.7e-05 Score=36.75 Aligned_cols=20 Identities=40% Similarity=0.718 Sum_probs=12.0
Q ss_pred cccccccccccCChHHHHHH
Q psy14799 123 SFTCHLCNKGFFRKSYLKVH 142 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~~l~~H 142 (230)
||.|..|++.|.+...|..|
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H 20 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREH 20 (27)
T ss_dssp SEEETTTTEEESSHHHHHHH
T ss_pred CCCCCccCCccCChhHHHHH
Confidence 35566666666666666666
No 37
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.33 E-value=0.00053 Score=54.57 Aligned_cols=138 Identities=26% Similarity=0.355 Sum_probs=91.9
Q ss_pred hhhhhhhhhhhhhhhccCHHHHHHHHHhhcCCCcccccccc---------cccCChHHHHHhHhccCCCCCCCc-ccccc
Q psy14799 2 LLSFILDYWNKNYFFFLMGLKLEAHITTVHGDKSFQCDECD---------MSFGTHQALGRHKQCKHMDISLRP-FPCDL 71 (230)
Q Consensus 2 ~~~~~c~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~---------~~f~~~~~l~~H~~~~~~~~~~~~-~~C~~ 71 (230)
|+.|.|.. ..|..+-..-..|..|.+.-++. +.|.+|- -..-+...|..|...-....+-+. -.|..
T Consensus 149 LL~F~CP~-skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~F 225 (493)
T COG5236 149 LLSFKCPK-SKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIF 225 (493)
T ss_pred HHHhcCCc-hhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhh
Confidence 45666643 23555666677899999876653 4566553 223344667778653222212121 35999
Q ss_pred cccccCCHHHHHHhHHHhcCCCceeccccc----ccccCHHHHHHHHHhhcCCCCcccccc--cc----cccCChHHHHH
Q psy14799 72 CEQRFLNKSHLAYHVKTHTGERDVTCGLCF----QSFYSKADLVRHMKGRHLNIRSFTCHL--CN----KGFFRKSYLKV 141 (230)
Q Consensus 72 C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~----~~f~~~~~l~~H~~~~H~~~~~~~C~~--C~----~~f~~~~~l~~ 141 (230)
|...|-+-..|..|++..+ ++-+.|..-+ .-|.+...|.+|.+..| |.|.+ |- ..|.+..+|..
T Consensus 226 C~~~FYdDDEL~~HcR~~H-E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h-----y~ct~qtc~~~k~~vf~~~~el~~ 299 (493)
T COG5236 226 CKIYFYDDDELRRHCRLRH-EACHICDMVGPIRYQYFKSYEDLEAHFRNAH-----YCCTFQTCRVGKCYVFPYHTELLE 299 (493)
T ss_pred ccceecChHHHHHHHHhhh-hhhhhhhccCccchhhhhCHHHHHHHhhcCc-----eEEEEEEEecCcEEEeccHHHHHH
Confidence 9999999999999999665 3334443322 34888899999998555 66654 32 57999999999
Q ss_pred HHHhhcC
Q psy14799 142 HLLTIHM 148 (230)
Q Consensus 142 H~~~~H~ 148 (230)
|+...|.
T Consensus 300 h~~~~h~ 306 (493)
T COG5236 300 HLTRFHK 306 (493)
T ss_pred HHHHHhh
Confidence 9999984
No 38
>KOG2231|consensus
Probab=97.30 E-value=0.00034 Score=60.99 Aligned_cols=92 Identities=24% Similarity=0.492 Sum_probs=49.1
Q ss_pred CChHHHHHhHhccCCCCCCCcccccccc---------cccCCHHHHHHhHHHhcC-CC----ceecccccccccCHHHHH
Q psy14799 46 GTHQALGRHKQCKHMDISLRPFPCDLCE---------QRFLNKSHLAYHVKTHTG-ER----DVTCGLCFQSFYSKADLV 111 (230)
Q Consensus 46 ~~~~~l~~H~~~~~~~~~~~~~~C~~C~---------~~f~~~~~l~~H~~~h~~-~~----~~~C~~C~~~f~~~~~l~ 111 (230)
.+...|+.|+...|.. +.|.+|- ...-+...|..|+..--. ++ .-.|..|...|.....|.
T Consensus 125 ~s~~~Lk~H~~~~H~~-----~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~ 199 (669)
T KOG2231|consen 125 KSVENLKNHMRDQHKL-----HLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELY 199 (669)
T ss_pred hHHHHHHHHHHHhhhh-----hccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhccHHHHH
Confidence 3556677776544433 4444432 122234456666552211 11 135666777777777777
Q ss_pred HHHHhhcCCCCccccccc------ccccCChHHHHHHHHhhc
Q psy14799 112 RHMKGRHLNIRSFTCHLC------NKGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 112 ~H~~~~H~~~~~~~C~~C------~~~f~~~~~l~~H~~~~H 147 (230)
+|++..| |.|..| +.-|.....|..|.+..|
T Consensus 200 rH~~~~h-----~~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 200 RHLRFDH-----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred Hhhccce-----eheeecCcccccchhcccchHHHHHhhhcC
Confidence 7776444 445555 244556667777766666
No 39
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.06 E-value=0.00019 Score=57.02 Aligned_cols=135 Identities=27% Similarity=0.446 Sum_probs=89.8
Q ss_pred cccccc--cccccCChHHHHHhHhccCCCCCCCccccccccc---cc------CCHHHHHHhHHHhcCCCc----eeccc
Q psy14799 35 SFQCDE--CDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQ---RF------LNKSHLAYHVKTHTGERD----VTCGL 99 (230)
Q Consensus 35 ~~~C~~--C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~---~f------~~~~~l~~H~~~h~~~~~----~~C~~ 99 (230)
.|.|+. |..+......|+.|....|+. +.|.+|-. .| -+...|..|...-..+.. -.|..
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~-----~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~F 225 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGF-----VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIF 225 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCc-----EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhh
Confidence 378865 677766678899999887765 67777742 22 234456666543322222 26999
Q ss_pred ccccccCHHHHHHHHHhhcCCCCcccccccccc-------cCChHHHHHHHHhhcCCCChhhhhhccCCCCCCCCCcccc
Q psy14799 100 CFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKG-------FFRKSYLKVHLLTIHMQPADEAQNAIHRCYACRCAGHCLQ 172 (230)
Q Consensus 100 C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~-------f~~~~~l~~H~~~~H~~~~~~~c~~~~~C~~c~~~~~~~~ 172 (230)
|...|-+...|..|++..| -.|.+|++. |.+..+|..|.+..| |.|.. ..|.-
T Consensus 226 C~~~FYdDDEL~~HcR~~H-----E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h-----y~ct~----qtc~~------ 285 (493)
T COG5236 226 CKIYFYDDDELRRHCRLRH-----EACHICDMVGPIRYQYFKSYEDLEAHFRNAH-----YCCTF----QTCRV------ 285 (493)
T ss_pred ccceecChHHHHHHHHhhh-----hhhhhhhccCccchhhhhCHHHHHHHhhcCc-----eEEEE----EEEec------
Confidence 9999999999999998666 247777654 788888999977766 65542 11100
Q ss_pred cccccCCccccccccCCccccccccccCC
Q psy14799 173 IGTHSGGICGSMWRVETWHPCHLLTERNR 201 (230)
Q Consensus 173 ~~~h~~~~C~~~f~~~~~l~~H~~~h~~~ 201 (230)
.. -..|.....|..|+..-|+.
T Consensus 286 ------~k-~~vf~~~~el~~h~~~~h~~ 307 (493)
T COG5236 286 ------GK-CYVFPYHTELLEHLTRFHKV 307 (493)
T ss_pred ------Cc-EEEeccHHHHHHHHHHHhhc
Confidence 01 15788888999998766543
No 40
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.75 E-value=0.0015 Score=36.92 Aligned_cols=29 Identities=17% Similarity=0.445 Sum_probs=12.1
Q ss_pred CcccccccccccCCHHHHHHhHHHhcCCC
Q psy14799 65 RPFPCDLCEQRFLNKSHLAYHVKTHTGER 93 (230)
Q Consensus 65 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~ 93 (230)
.|..|++|+..+.+..+|.+|+...++.+
T Consensus 23 ~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 23 QPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp --EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred CCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 34455555555555555555554444433
No 41
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.68 E-value=0.0016 Score=31.14 Aligned_cols=23 Identities=39% Similarity=0.582 Sum_probs=12.9
Q ss_pred ccccccccccCChHHHHHHHHhhc
Q psy14799 124 FTCHLCNKGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H~~~~H 147 (230)
|+|+.|+.... .+.|.+|++..|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45666666665 666666655554
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.68 E-value=0.0019 Score=31.06 Aligned_cols=21 Identities=43% Similarity=0.681 Sum_probs=13.6
Q ss_pred ccccccccccCChHHHHHHHH
Q psy14799 124 FTCHLCNKGFFRKSYLKVHLL 144 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H~~ 144 (230)
|.|+.|++.|.+.+.|..|++
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~ 21 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMR 21 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHH
Confidence 456667777777767766643
No 43
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.66 E-value=0.0014 Score=37.00 Aligned_cols=31 Identities=23% Similarity=0.381 Sum_probs=18.5
Q ss_pred hcCCCCcccccccccccCChHHHHHHHHhhc
Q psy14799 117 RHLNIRSFTCHLCNKGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 117 ~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H 147 (230)
.+..+.|-.|++|+..+....+|++|+...|
T Consensus 18 ~~~S~~PatCP~C~a~~~~srnLrRHle~~H 48 (54)
T PF09237_consen 18 KSQSEQPATCPICGAVIRQSRNLRRHLEIRH 48 (54)
T ss_dssp CCTTS--EE-TTT--EESSHHHHHHHHHHHT
T ss_pred hhccCCCCCCCcchhhccchhhHHHHHHHHh
Confidence 3445567778888888888888888866666
No 44
>KOG2231|consensus
Probab=96.57 E-value=0.0064 Score=53.36 Aligned_cols=115 Identities=25% Similarity=0.457 Sum_probs=74.8
Q ss_pred ccCHHHHHHHHHhhcCCCcccccc---------cccccCChHHHHHhHhccCC-CCCCCc-ccccccccccCCHHHHHHh
Q psy14799 17 FLMGLKLEAHITTVHGDKSFQCDE---------CDMSFGTHQALGRHKQCKHM-DISLRP-FPCDLCEQRFLNKSHLAYH 85 (230)
Q Consensus 17 f~~~~~l~~H~~~h~~~~~~~C~~---------C~~~f~~~~~l~~H~~~~~~-~~~~~~-~~C~~C~~~f~~~~~l~~H 85 (230)
|.+...|+.|+...|.. +.|.+ +.....+...|..|++.--. +...+. -.|..|...|.....|.+|
T Consensus 124 ~~s~~~Lk~H~~~~H~~--~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH 201 (669)
T KOG2231|consen 124 FKSVENLKNHMRDQHKL--HLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRH 201 (669)
T ss_pred hhHHHHHHHHHHHhhhh--hccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhccHHHHHHh
Confidence 34889999999654432 33433 23334456778888774333 211122 4699999999999999999
Q ss_pred HHHhcCCCceecccc------cccccCHHHHHHHHHhhcCCCCccccc--cccc-ccCChHHHHHH
Q psy14799 86 VKTHTGERDVTCGLC------FQSFYSKADLVRHMKGRHLNIRSFTCH--LCNK-GFFRKSYLKVH 142 (230)
Q Consensus 86 ~~~h~~~~~~~C~~C------~~~f~~~~~l~~H~~~~H~~~~~~~C~--~C~~-~f~~~~~l~~H 142 (230)
++.++ |.|..| +..|.....|..|.+..| |.|. .|.- .|.....+..+
T Consensus 202 ~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H-----flCE~~~C~~~~f~~~~~~ei~ 258 (669)
T KOG2231|consen 202 LRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH-----FLCEEEFCRTKKFYVAFELEIE 258 (669)
T ss_pred hccce----eheeecCcccccchhcccchHHHHHhhhcC-----ccccccccccceeeehhHHHHH
Confidence 98766 566666 356788899999999666 7777 4542 33333344444
No 45
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.56 E-value=0.0017 Score=31.35 Aligned_cols=21 Identities=43% Similarity=0.887 Sum_probs=13.0
Q ss_pred ccccccccccCChHHHHHHHH
Q psy14799 124 FTCHLCNKGFFRKSYLKVHLL 144 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H~~ 144 (230)
|.|.+|++.|.+...|..|++
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCCcCCHHHHHHHHC
Confidence 456666666666666666643
No 46
>KOG2482|consensus
Probab=96.51 E-value=0.01 Score=47.40 Aligned_cols=24 Identities=33% Similarity=0.494 Sum_probs=14.9
Q ss_pred eecccccccccCHHHHHHHHHhhc
Q psy14799 95 VTCGLCFQSFYSKADLVRHMKGRH 118 (230)
Q Consensus 95 ~~C~~C~~~f~~~~~l~~H~~~~H 118 (230)
..|-.|.....+...|..||...|
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vH 303 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVH 303 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHH
Confidence 356666666666666666666555
No 47
>KOG4173|consensus
Probab=96.42 E-value=0.0012 Score=48.67 Aligned_cols=84 Identities=26% Similarity=0.495 Sum_probs=54.9
Q ss_pred Ccccccc--cccccCCHHHHHHhHHHhcCCCceecccccccccCHHHHHHHHHhhc---------CCCCcccccc--ccc
Q psy14799 65 RPFPCDL--CEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSKADLVRHMKGRH---------LNIRSFTCHL--CNK 131 (230)
Q Consensus 65 ~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H---------~~~~~~~C~~--C~~ 131 (230)
..+.|++ |...|.+......|..+-++ -.|+.|.+.|.+..-|..|+...| .|...|.|-+ |+.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~ 154 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE 154 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence 3466766 66777777666666554443 257777777777777777776444 2455677744 777
Q ss_pred ccCChHHHHHHHHhhcCCCC
Q psy14799 132 GFFRKSYLKVHLLTIHMQPA 151 (230)
Q Consensus 132 ~f~~~~~l~~H~~~~H~~~~ 151 (230)
.|.+....+.|+...|.-..
T Consensus 155 KFkT~r~RkdH~I~~Hk~Pa 174 (253)
T KOG4173|consen 155 KFKTSRDRKDHMIRMHKYPA 174 (253)
T ss_pred hhhhhhhhhhHHHHhccCCc
Confidence 77777777777777775443
No 48
>KOG4173|consensus
Probab=96.32 E-value=0.0027 Score=46.92 Aligned_cols=84 Identities=24% Similarity=0.470 Sum_probs=66.8
Q ss_pred cccccc--cccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHHHh----------cCCCceeccc--c
Q psy14799 35 SFQCDE--CDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVKTH----------TGERDVTCGL--C 100 (230)
Q Consensus 35 ~~~C~~--C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h----------~~~~~~~C~~--C 100 (230)
.|.|++ |...|.+......|...-|+. .|..|.+.|.+...|..|+..- .|...|.|-+ |
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~~------sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgC 152 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHGN------SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGC 152 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhcccc------hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhh
Confidence 377876 778888888888887665554 8999999999999999998633 2456689965 9
Q ss_pred cccccCHHHHHHHHHhhcCCCCcc
Q psy14799 101 FQSFYSKADLVRHMKGRHLNIRSF 124 (230)
Q Consensus 101 ~~~f~~~~~l~~H~~~~H~~~~~~ 124 (230)
+..|.+..+...|+...|.-...|
T Consensus 153 t~KFkT~r~RkdH~I~~Hk~Pa~f 176 (253)
T KOG4173|consen 153 TEKFKTSRDRKDHMIRMHKYPADF 176 (253)
T ss_pred hhhhhhhhhhhhHHHHhccCCcce
Confidence 999999999999998788655444
No 49
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.24 E-value=0.0038 Score=30.03 Aligned_cols=21 Identities=33% Similarity=0.768 Sum_probs=16.7
Q ss_pred eecccccccccCHHHHHHHHH
Q psy14799 95 VTCGLCFQSFYSKADLVRHMK 115 (230)
Q Consensus 95 ~~C~~C~~~f~~~~~l~~H~~ 115 (230)
|.|.+|++.|.+...|..|++
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCCcCCHHHHHHHHC
Confidence 568888888888888888876
No 50
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.21 E-value=0.0041 Score=29.65 Aligned_cols=7 Identities=43% Similarity=0.909 Sum_probs=2.6
Q ss_pred HHHHHHH
Q psy14799 108 ADLVRHM 114 (230)
Q Consensus 108 ~~l~~H~ 114 (230)
..|.+|+
T Consensus 13 ~~l~~H~ 19 (24)
T PF13909_consen 13 SNLKRHL 19 (24)
T ss_dssp HHHHHHH
T ss_pred HHHHHHH
Confidence 3333333
No 51
>KOG1146|consensus
Probab=96.21 E-value=0.0041 Score=57.89 Aligned_cols=52 Identities=8% Similarity=0.036 Sum_probs=34.7
Q ss_pred ccccccccccCChHHHHHHHHhhcCCCChhhhhhccCCCCCCCCCcccccccccCCccccccccCCccccccccc
Q psy14799 124 FTCHLCNKGFFRKSYLKVHLLTIHMQPADEAQNAIHRCYACRCAGHCLQIGTHSGGICGSMWRVETWHPCHLLTE 198 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~c~~~~~C~~c~~~~~~~~~~~h~~~~C~~~f~~~~~l~~H~~~h 198 (230)
+.|.+|++.-.-..+|+.|+...+....|-.+- .++-.+.....+..+-+.+
T Consensus 590 ~~C~vc~yetniarnlrihmtss~~s~~p~~~L-----------------------q~~it~~l~~~~~~~~~lp 641 (1406)
T KOG1146|consen 590 WRCEVCSYETNIARNLRIHMTASPSSSPPSLVL-----------------------QQNITSSLASLLGGQGRLP 641 (1406)
T ss_pred cchhhhcchhhhhhccccccccCCCCCChHHHh-----------------------hhcchhhccccccCcCCCC
Confidence 778888887777778888866665555544443 5566666666666666655
No 52
>KOG1146|consensus
Probab=96.01 E-value=0.0039 Score=58.04 Aligned_cols=127 Identities=17% Similarity=0.188 Sum_probs=70.6
Q ss_pred cccccCCHHHHHHhHH-HhcCCCceecccccccccCHHHHHHHHHhhc------------------------CCCCcccc
Q psy14799 72 CEQRFLNKSHLAYHVK-THTGERDVTCGLCFQSFYSKADLVRHMKGRH------------------------LNIRSFTC 126 (230)
Q Consensus 72 C~~~f~~~~~l~~H~~-~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H------------------------~~~~~~~C 126 (230)
++..+.+...+..|+. .+.-.+.++|+.|+..|.....|..|++..| .+.++|.|
T Consensus 442 ~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C 521 (1406)
T KOG1146|consen 442 AEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPC 521 (1406)
T ss_pred hhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccc
Confidence 3333444444444432 2333455666666666666666666666433 23467999
Q ss_pred cccccccCChHHHHHHHHhh-cCCC----------ChhhhhhccCCCCCCCCC------cc---cccccccCCccccccc
Q psy14799 127 HLCNKGFFRKSYLKVHLLTI-HMQP----------ADEAQNAIHRCYACRCAG------HC---LQIGTHSGGICGSMWR 186 (230)
Q Consensus 127 ~~C~~~f~~~~~l~~H~~~~-H~~~----------~~~~c~~~~~C~~c~~~~------~~---~~~~~h~~~~C~~~f~ 186 (230)
..|..++.+..+|.+|++.. |..+ ....-.. ..+..+.... .+ .-.-.-.|++|+..-.
T Consensus 522 ~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~-~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetn 600 (1406)
T KOG1146|consen 522 RACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPA-SVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETN 600 (1406)
T ss_pred eeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhh-hhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhh
Confidence 99999999999999998754 3222 0000000 0011111111 11 1123445579999999
Q ss_pred cCCcccccccccc
Q psy14799 187 VETWHPCHLLTER 199 (230)
Q Consensus 187 ~~~~l~~H~~~h~ 199 (230)
-..+|+.||..-.
T Consensus 601 iarnlrihmtss~ 613 (1406)
T KOG1146|consen 601 IARNLRIHMTASP 613 (1406)
T ss_pred hhhccccccccCC
Confidence 9999999987543
No 53
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.84 E-value=0.0052 Score=30.25 Aligned_cols=21 Identities=38% Similarity=0.868 Sum_probs=14.8
Q ss_pred ccccccccccCChHHHHHhHh
Q psy14799 36 FQCDECDMSFGTHQALGRHKQ 56 (230)
Q Consensus 36 ~~C~~C~~~f~~~~~l~~H~~ 56 (230)
|.|..|++.|.+...|..|+.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 567777777777777777765
No 54
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.84 E-value=0.0025 Score=31.45 Aligned_cols=20 Identities=40% Similarity=0.793 Sum_probs=12.7
Q ss_pred ccccccccccCChHHHHHHH
Q psy14799 124 FTCHLCNKGFFRKSYLKVHL 143 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H~ 143 (230)
|.|..|++.|.+...|..|+
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~ 21 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHM 21 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCT
T ss_pred CCcccCCCCcCCHHHHHHHH
Confidence 45666666666666666663
No 55
>PRK04860 hypothetical protein; Provisional
Probab=95.71 E-value=0.0067 Score=44.00 Aligned_cols=38 Identities=18% Similarity=0.430 Sum_probs=24.4
Q ss_pred ceecccccccccCHHHHHHHHHhhcCCCCcccccccccccCCh
Q psy14799 94 DVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRK 136 (230)
Q Consensus 94 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~ 136 (230)
+|.|. |+. ....+.+|.+ .|.++++|.|..|+..|...
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~r-i~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNR-VVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHH-HhcCCccEECCCCCceeEEe
Confidence 46776 665 3445666766 67777777777777666543
No 56
>KOG2482|consensus
Probab=95.69 E-value=0.037 Score=44.34 Aligned_cols=73 Identities=16% Similarity=0.183 Sum_probs=43.5
Q ss_pred ccccccccccCChHHHHHHHHhhcCCC-----ChhhhhhccCCCCCCCCCcccccccccCCccccccccCCccccccccc
Q psy14799 124 FTCHLCNKGFFRKSYLKVHLLTIHMQP-----ADEAQNAIHRCYACRCAGHCLQIGTHSGGICGSMWRVETWHPCHLLTE 198 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H~~~~H~~~-----~~~~c~~~~~C~~c~~~~~~~~~~~h~~~~C~~~f~~~~~l~~H~~~h 198 (230)
..|-.|.....+...|..|+..+|.-. +.|.-....+-.. -.+....+..-.|-.|.-.|-....|..||.-+
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~Yqrvrv--iNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRV--INYIRKQKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhH--HHHHHHHhhccccccccccccCcchhhhhcccc
Confidence 589999999999999999999999311 1110000000000 000000112223358999999999999999754
No 57
>PRK04860 hypothetical protein; Provisional
Probab=95.66 E-value=0.0051 Score=44.62 Aligned_cols=36 Identities=22% Similarity=0.619 Sum_probs=18.4
Q ss_pred cccccccccccCCHHHHHHhHHHhcCCCceeccccccccc
Q psy14799 66 PFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFY 105 (230)
Q Consensus 66 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 105 (230)
+|.|. |+. ....+..|.++|.++++|.|..|+..|.
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~ 154 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLV 154 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence 35554 554 3334455555555555555555555443
No 58
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=94.63 E-value=0.033 Score=26.83 Aligned_cols=18 Identities=28% Similarity=0.774 Sum_probs=11.2
Q ss_pred ccccccccccCChHHHHHH
Q psy14799 124 FTCHLCNKGFFRKSYLKVH 142 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H 142 (230)
..|+.||+.| ..+.|.+|
T Consensus 3 ~~C~~CgR~F-~~~~l~~H 20 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKH 20 (25)
T ss_pred CcCCCCCCEE-CHHHHHHH
Confidence 3566677776 44566666
No 59
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=94.45 E-value=0.039 Score=26.58 Aligned_cols=19 Identities=42% Similarity=0.992 Sum_probs=9.2
Q ss_pred cccccccccCCHHHHHHhHH
Q psy14799 68 PCDLCEQRFLNKSHLAYHVK 87 (230)
Q Consensus 68 ~C~~C~~~f~~~~~l~~H~~ 87 (230)
.|+.||+.| ....|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 455555555 3344555543
No 60
>KOG2785|consensus
Probab=94.42 E-value=0.17 Score=41.37 Aligned_cols=54 Identities=22% Similarity=0.385 Sum_probs=42.7
Q ss_pred ceecccccccccCHHHHHHHHHhhcCCC----------------------Ccccccccc---cccCChHHHHHHHHhhc
Q psy14799 94 DVTCGLCFQSFYSKADLVRHMKGRHLNI----------------------RSFTCHLCN---KGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 94 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~----------------------~~~~C~~C~---~~f~~~~~l~~H~~~~H 147 (230)
|-.|-.|++.+.+...-..||...|.-- ..+.|-.|+ +.|.+....+.||...-
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~K~ 244 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRDKG 244 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhhcc
Confidence 4679999999999999999998666422 227788888 89999999999987653
No 61
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=94.26 E-value=0.037 Score=28.89 Aligned_cols=23 Identities=35% Similarity=0.667 Sum_probs=16.7
Q ss_pred cccccccccccCChHHHHHHHHh
Q psy14799 123 SFTCHLCNKGFFRKSYLKVHLLT 145 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~~l~~H~~~ 145 (230)
+|.|.+|+..|.+...+..|+..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 46788888888877777777543
No 62
>KOG2785|consensus
Probab=94.22 E-value=0.19 Score=41.10 Aligned_cols=51 Identities=22% Similarity=0.405 Sum_probs=44.4
Q ss_pred cccccccccccCCHHHHHHhHHHhcCC-----------------------Cceeccccc---ccccCHHHHHHHHHh
Q psy14799 66 PFPCDLCEQRFLNKSHLAYHVKTHTGE-----------------------RDVTCGLCF---QSFYSKADLVRHMKG 116 (230)
Q Consensus 66 ~~~C~~C~~~f~~~~~l~~H~~~h~~~-----------------------~~~~C~~C~---~~f~~~~~l~~H~~~ 116 (230)
|-.|..|+..+.+...-..||..++|- ..+.|-.|+ +.|.+..+.+.||..
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 367999999999999999999988762 347899999 999999999999983
No 63
>KOG2893|consensus
Probab=93.68 E-value=0.018 Score=43.69 Aligned_cols=49 Identities=33% Similarity=0.762 Sum_probs=38.3
Q ss_pred ecccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHHhhcCC
Q psy14799 96 TCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLLTIHMQ 149 (230)
Q Consensus 96 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 149 (230)
.|=+|++.|.+..-|.+|++..| |+|.+|.+...+--.|..|-..+|.+
T Consensus 12 wcwycnrefddekiliqhqkakh-----fkchichkkl~sgpglsihcmqvhke 60 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAKH-----FKCHICHKKLFSGPGLSIHCMQVHKE 60 (341)
T ss_pred eeeecccccchhhhhhhhhhhcc-----ceeeeehhhhccCCCceeehhhhhhh
Confidence 47788888888888888877444 88888888777777888887777743
No 64
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=93.33 E-value=0.079 Score=27.57 Aligned_cols=22 Identities=27% Similarity=0.590 Sum_probs=17.2
Q ss_pred ceecccccccccCHHHHHHHHH
Q psy14799 94 DVTCGLCFQSFYSKADLVRHMK 115 (230)
Q Consensus 94 ~~~C~~C~~~f~~~~~l~~H~~ 115 (230)
+|.|.+|+..|.+...+..|+.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4778888888888888877775
No 65
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=92.76 E-value=0.26 Score=33.34 Aligned_cols=24 Identities=21% Similarity=0.346 Sum_probs=22.2
Q ss_pred ccc----cccccccCChHHHHHHHHhhc
Q psy14799 124 FTC----HLCNKGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 124 ~~C----~~C~~~f~~~~~l~~H~~~~H 147 (230)
|.| ..|+..+.+...+.+|+...|
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 889 999999999999999988877
No 66
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=92.68 E-value=0.033 Score=46.90 Aligned_cols=62 Identities=31% Similarity=0.559 Sum_probs=35.8
Q ss_pred cccccccccccCCHHHHHHhHH--HhcCC--Cceecc--cccccccCHHHHHHHHHhhcCCCCcccccc
Q psy14799 66 PFPCDLCEQRFLNKSHLAYHVK--THTGE--RDVTCG--LCFQSFYSKADLVRHMKGRHLNIRSFTCHL 128 (230)
Q Consensus 66 ~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~ 128 (230)
++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.....+.+|.. .|.+..++.+..
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 356 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHIL-LHTSISPAKEKL 356 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcc-cccCCCcccccc
Confidence 4555566666666666666665 55555 566666 46666666666666655 555555444433
No 67
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=91.97 E-value=0.088 Score=30.33 Aligned_cols=29 Identities=21% Similarity=0.266 Sum_probs=23.2
Q ss_pred CCCCcccccccccccCChHHHHHHHHhhc
Q psy14799 119 LNIRSFTCHLCNKGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 119 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~H 147 (230)
.++.-+.||-|+..|....++.+|....|
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 45566788888888888888888877777
No 68
>KOG2893|consensus
Probab=91.50 E-value=0.036 Score=42.17 Aligned_cols=43 Identities=21% Similarity=0.170 Sum_probs=22.1
Q ss_pred hhhhhhhhccCHHHHHHHHHhhcCCCcccccccccccCChHHHHHhH
Q psy14799 9 YWNKNYFFFLMGLKLEAHITTVHGDKSFQCDECDMSFGTHQALGRHK 55 (230)
Q Consensus 9 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 55 (230)
||--|++.|.+..-|..|++. +-|+|.+|.+...+--.|..|.
T Consensus 12 wcwycnrefddekiliqhqka----khfkchichkkl~sgpglsihc 54 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHC 54 (341)
T ss_pred eeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeeh
Confidence 334455566665555555543 2355666655544444455443
No 69
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=91.02 E-value=0.11 Score=26.85 Aligned_cols=8 Identities=25% Similarity=1.169 Sum_probs=4.2
Q ss_pred cccccccc
Q psy14799 123 SFTCHLCN 130 (230)
Q Consensus 123 ~~~C~~C~ 130 (230)
++.|++|+
T Consensus 17 ~~~CP~Cg 24 (33)
T cd00350 17 PWVCPVCG 24 (33)
T ss_pred CCcCcCCC
Confidence 45555555
No 70
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=90.91 E-value=0.019 Score=48.30 Aligned_cols=49 Identities=27% Similarity=0.392 Sum_probs=27.1
Q ss_pred hhhhccCHHHHHHHHH--hhcCC--Cccccc--ccccccCChHHHHHhHhccCCC
Q psy14799 13 NYFFFLMGLKLEAHIT--TVHGD--KSFQCD--ECDMSFGTHQALGRHKQCKHMD 61 (230)
Q Consensus 13 C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~~~~ 61 (230)
|...|.....|..|.+ .|.++ +++.|+ .|++.|.....+..|...+...
T Consensus 295 ~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 349 (467)
T COG5048 295 CNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSI 349 (467)
T ss_pred ccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCC
Confidence 4445555555555555 45555 555555 4555555555555555554444
No 71
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=90.70 E-value=0.14 Score=29.58 Aligned_cols=28 Identities=32% Similarity=0.622 Sum_probs=20.0
Q ss_pred CCCceecccccccccCHHHHHHHHHhhc
Q psy14799 91 GERDVTCGLCFQSFYSKADLVRHMKGRH 118 (230)
Q Consensus 91 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H 118 (230)
|+.-+.|+-|+..|....++.+|....|
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 4555677777777777777777777555
No 72
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=85.44 E-value=2 Score=28.98 Aligned_cols=24 Identities=25% Similarity=0.376 Sum_probs=21.7
Q ss_pred eec----ccccccccCHHHHHHHHHhhc
Q psy14799 95 VTC----GLCFQSFYSKADLVRHMKGRH 118 (230)
Q Consensus 95 ~~C----~~C~~~f~~~~~l~~H~~~~H 118 (230)
|.| ..|+....+...+..|++..|
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 899 999999999999999999655
No 73
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=85.33 E-value=0.72 Score=24.32 Aligned_cols=33 Identities=21% Similarity=0.438 Sum_probs=16.4
Q ss_pred eecccccccccCHHHHHHHHHhhcCCCCccccccccccc
Q psy14799 95 VTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGF 133 (230)
Q Consensus 95 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f 133 (230)
..|+.|+..|.-..+. .-......+|+.|+..|
T Consensus 3 i~Cp~C~~~y~i~d~~------ip~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEK------IPPKGRKVRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHH------CCCCCcEEECCCCCCEe
Confidence 3566666666554442 11222345666666555
No 74
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=85.09 E-value=0.68 Score=24.57 Aligned_cols=32 Identities=28% Similarity=0.466 Sum_probs=14.9
Q ss_pred ecccccccccCHHHHHHHHHhhcCCCCccccccccccc
Q psy14799 96 TCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGF 133 (230)
Q Consensus 96 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f 133 (230)
.|+.|+..|.-..+- .-.+....+|+.|+..|
T Consensus 4 ~CP~C~~~f~v~~~~------l~~~~~~vrC~~C~~~f 35 (37)
T PF13719_consen 4 TCPNCQTRFRVPDDK------LPAGGRKVRCPKCGHVF 35 (37)
T ss_pred ECCCCCceEEcCHHH------cccCCcEEECCCCCcEe
Confidence 455565555544331 11222345566665554
No 75
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=84.86 E-value=1.3 Score=29.98 Aligned_cols=24 Identities=25% Similarity=0.620 Sum_probs=14.8
Q ss_pred cccccccccccCChHHHHHHHHhhc
Q psy14799 123 SFTCHLCNKGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~~l~~H~~~~H 147 (230)
.|.|+.|...|=..-+.-.| ...|
T Consensus 81 ~y~C~~C~~~FC~dCD~fiH-e~Lh 104 (112)
T TIGR00622 81 RYVCAVCKNVFCVDCDVFVH-ESLH 104 (112)
T ss_pred ceeCCCCCCccccccchhhh-hhcc
Confidence 46677777666666666666 4444
No 76
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=84.73 E-value=0.59 Score=27.45 Aligned_cols=10 Identities=20% Similarity=0.328 Sum_probs=6.0
Q ss_pred Cceecccccc
Q psy14799 93 RDVTCGLCFQ 102 (230)
Q Consensus 93 ~~~~C~~C~~ 102 (230)
.+|.|+.||+
T Consensus 49 ~~Y~Cp~CGF 58 (61)
T COG2888 49 NPYRCPKCGF 58 (61)
T ss_pred CceECCCcCc
Confidence 4566666664
No 77
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=84.70 E-value=0.15 Score=39.10 Aligned_cols=13 Identities=23% Similarity=0.736 Sum_probs=10.3
Q ss_pred ccccccccccCCh
Q psy14799 124 FTCHLCNKGFFRK 136 (230)
Q Consensus 124 ~~C~~C~~~f~~~ 136 (230)
..||.||.+|...
T Consensus 49 ~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 49 WVCPHCGYAAFEE 61 (214)
T ss_pred EECCCCCCccccc
Confidence 5799999887754
No 78
>KOG2186|consensus
Probab=84.64 E-value=0.67 Score=35.84 Aligned_cols=45 Identities=22% Similarity=0.434 Sum_probs=21.0
Q ss_pred ccccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhH
Q psy14799 36 FQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHV 86 (230)
Q Consensus 36 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~ 86 (230)
|.|..||....- ..+.+|+..-++. -|.|..|+..|.. .+...|.
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn~----~fSCIDC~k~F~~-~sYknH~ 48 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRNA----YFSCIDCGKTFER-VSYKNHT 48 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccCC----eeEEeeccccccc-chhhhhh
Confidence 445555554433 2344455444332 3555555555554 3444443
No 79
>PF12907 zf-met2: Zinc-binding
Probab=84.29 E-value=0.98 Score=24.43 Aligned_cols=32 Identities=25% Similarity=0.454 Sum_probs=21.8
Q ss_pred ccccccccccC---ChHHHHHHHHhhcCCCChhhh
Q psy14799 124 FTCHLCNKGFF---RKSYLKVHLLTIHMQPADEAQ 155 (230)
Q Consensus 124 ~~C~~C~~~f~---~~~~l~~H~~~~H~~~~~~~c 155 (230)
+.|.+|..+|. +...|..|....|....+..|
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~~~C 36 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTFEEC 36 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCHHHc
Confidence 56888885544 456788887788877655554
No 80
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=84.15 E-value=0.99 Score=24.89 Aligned_cols=24 Identities=38% Similarity=0.495 Sum_probs=10.5
Q ss_pred ccccccccccCCh----HHHHHHHHhhc
Q psy14799 124 FTCHLCNKGFFRK----SYLKVHLLTIH 147 (230)
Q Consensus 124 ~~C~~C~~~f~~~----~~l~~H~~~~H 147 (230)
..|..|++.+... +.|.+|++..|
T Consensus 17 a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 17 AKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred EEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 4555555555442 45556644433
No 81
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=82.58 E-value=0.3 Score=37.39 Aligned_cols=22 Identities=23% Similarity=0.454 Sum_probs=13.2
Q ss_pred CcccccccccccCChHHHHHhH
Q psy14799 34 KSFQCDECDMSFGTHQALGRHK 55 (230)
Q Consensus 34 ~~~~C~~C~~~f~~~~~l~~H~ 55 (230)
+...|+.|+..|.++.-.....
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~ 25 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKI 25 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCc
Confidence 4466777777777664444443
No 82
>KOG2186|consensus
Probab=82.36 E-value=0.95 Score=35.03 Aligned_cols=46 Identities=20% Similarity=0.455 Sum_probs=26.3
Q ss_pred ccccccccccCCHHHHHHhHHHhcCCCceecccccccccCHHHHHHHHH
Q psy14799 67 FPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSKADLVRHMK 115 (230)
Q Consensus 67 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 115 (230)
|.|..||...... .+.+|+..-++ .-|.|-.|+..|.. ..+..|..
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 5666666655543 35556655555 34666666666665 45555554
No 83
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=81.91 E-value=0.98 Score=23.94 Aligned_cols=13 Identities=31% Similarity=0.587 Sum_probs=6.4
Q ss_pred eecccccccccCH
Q psy14799 95 VTCGLCFQSFYSK 107 (230)
Q Consensus 95 ~~C~~C~~~f~~~ 107 (230)
+.|+.|+..|.-.
T Consensus 3 ~~CP~C~~~~~v~ 15 (38)
T TIGR02098 3 IQCPNCKTSFRVV 15 (38)
T ss_pred EECCCCCCEEEeC
Confidence 3455555555443
No 84
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=80.67 E-value=1.1 Score=30.28 Aligned_cols=29 Identities=21% Similarity=0.292 Sum_probs=20.7
Q ss_pred CCCCCCCCCcccccccccCCccccccccC
Q psy14799 160 RCYACRCAGHCLQIGTHSGGICGSMWRVE 188 (230)
Q Consensus 160 ~C~~c~~~~~~~~~~~h~~~~C~~~f~~~ 188 (230)
.|+.||+.|..+....-.|+.||..|.-.
T Consensus 11 ~Cp~CG~kFYDLnk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 11 TCPSCGAKFYDLNKDPIVCPKCGTEFPPE 39 (108)
T ss_pred cCCCCcchhccCCCCCccCCCCCCccCcc
Confidence 36666666666666666677899888876
No 85
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=79.85 E-value=0.93 Score=23.56 Aligned_cols=9 Identities=22% Similarity=0.442 Sum_probs=4.1
Q ss_pred eeccccccc
Q psy14799 95 VTCGLCFQS 103 (230)
Q Consensus 95 ~~C~~C~~~ 103 (230)
|.|..||..
T Consensus 3 ~~C~~CG~i 11 (34)
T cd00729 3 WVCPVCGYI 11 (34)
T ss_pred EECCCCCCE
Confidence 344444443
No 86
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=79.23 E-value=1.8 Score=24.56 Aligned_cols=24 Identities=38% Similarity=0.624 Sum_probs=13.8
Q ss_pred ccccccccccCCh-----HHHHHHHHhhc
Q psy14799 124 FTCHLCNKGFFRK-----SYLKVHLLTIH 147 (230)
Q Consensus 124 ~~C~~C~~~f~~~-----~~l~~H~~~~H 147 (230)
-.|..|++.+... +.|.+|+...|
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~~~h 47 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLRRKH 47 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHHhHC
Confidence 4566666665443 46666655444
No 87
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=77.75 E-value=2.7 Score=28.46 Aligned_cols=89 Identities=17% Similarity=0.309 Sum_probs=56.2
Q ss_pred CCcccccccccccCChHHHHHhHhccCCC--C-------CCCcccccccccccCCHHHHHHhHHHhcCCCceeccccccc
Q psy14799 33 DKSFQCDECDMSFGTHQALGRHKQCKHMD--I-------SLRPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQS 103 (230)
Q Consensus 33 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~--~-------~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 103 (230)
+.|..|+.|+-+......|.+-.-.--.. - ......|--|...|........ ..-.....|.|+.|...
T Consensus 13 ~LP~~CpiCgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~--~~~~~~~~y~C~~C~~~ 90 (112)
T TIGR00622 13 ELPVECPICGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPF--DELKDSHRYVCAVCKNV 90 (112)
T ss_pred CCCCcCCcCCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCcccccc--cccccccceeCCCCCCc
Confidence 56788999999988888887753210011 0 0011248889988876531110 00122346999999999
Q ss_pred ccCHHHHHHHHHhhcCCCCcccccccc
Q psy14799 104 FYSKADLVRHMKGRHLNIRSFTCHLCN 130 (230)
Q Consensus 104 f~~~~~l~~H~~~~H~~~~~~~C~~C~ 130 (230)
|-..-+...|.. .| .|+-|.
T Consensus 91 FC~dCD~fiHe~-Lh------~CPGC~ 110 (112)
T TIGR00622 91 FCVDCDVFVHES-LH------CCPGCI 110 (112)
T ss_pred cccccchhhhhh-cc------CCcCCC
Confidence 998888888876 55 466665
No 88
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=77.04 E-value=2.3 Score=20.56 Aligned_cols=19 Identities=21% Similarity=0.641 Sum_probs=13.9
Q ss_pred ccccccccccCChHHHHHHH
Q psy14799 124 FTCHLCNKGFFRKSYLKVHL 143 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H~ 143 (230)
..||+|++.+ ....+..|+
T Consensus 2 v~CPiC~~~v-~~~~in~HL 20 (26)
T smart00734 2 VQCPVCFREV-PENLINSHL 20 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHH
Confidence 3588888887 556777774
No 89
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=76.66 E-value=3.6 Score=29.86 Aligned_cols=32 Identities=19% Similarity=0.274 Sum_probs=22.7
Q ss_pred CCCceecccccccccCHHHHHHHHHhhcCCCCcccccccccc
Q psy14799 91 GERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKG 132 (230)
Q Consensus 91 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~ 132 (230)
+..-|.|+.|+..|+...++. ..|.|+.||..
T Consensus 106 ~~~~Y~Cp~c~~r~tf~eA~~----------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 106 NNMFFICPNMCVRFTFNEAME----------LNFTCPRCGAM 137 (158)
T ss_pred CCCeEECCCCCcEeeHHHHHH----------cCCcCCCCCCE
Confidence 445588888888887776663 14888888864
No 90
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=74.86 E-value=2.9 Score=29.96 Aligned_cols=12 Identities=17% Similarity=0.875 Sum_probs=6.0
Q ss_pred cccccccccccC
Q psy14799 35 SFQCDECDMSFG 46 (230)
Q Consensus 35 ~~~C~~C~~~f~ 46 (230)
-|.|+.|+..|.
T Consensus 99 ~Y~Cp~C~~~y~ 110 (147)
T smart00531 99 YYKCPNCQSKYT 110 (147)
T ss_pred EEECcCCCCEee
Confidence 355555555544
No 91
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=74.41 E-value=2.5 Score=32.31 Aligned_cols=30 Identities=23% Similarity=0.584 Sum_probs=22.7
Q ss_pred CCCcccccccccccCChHHHHHHHHhhcCC
Q psy14799 120 NIRSFTCHLCNKGFFRKSYLKVHLLTIHMQ 149 (230)
Q Consensus 120 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 149 (230)
.+..|.|+.|++.|.-..-...||...|.+
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 344599999999999999999999999953
No 92
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=73.68 E-value=1.7 Score=25.66 Aligned_cols=10 Identities=30% Similarity=0.418 Sum_probs=6.2
Q ss_pred Cceecccccc
Q psy14799 93 RDVTCGLCFQ 102 (230)
Q Consensus 93 ~~~~C~~C~~ 102 (230)
.+|.|+.||+
T Consensus 47 ~~Y~CP~CGF 56 (59)
T PRK14890 47 NPYTCPKCGF 56 (59)
T ss_pred CceECCCCCC
Confidence 4566666664
No 93
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=73.46 E-value=5.5 Score=28.50 Aligned_cols=38 Identities=16% Similarity=0.300 Sum_probs=22.2
Q ss_pred CCCceecccccccccCHHHHHHHHHhhcCCCCccccccccccc
Q psy14799 91 GERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGF 133 (230)
Q Consensus 91 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f 133 (230)
+...|.|+.|+..|....++.. ... ...|.|+.||...
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~----~d~-~~~f~Cp~Cg~~l 133 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQL----LDM-DGTFTCPRCGEEL 133 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHh----cCC-CCcEECCCCCCEE
Confidence 3445788888877775443321 112 2338888888654
No 94
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=73.32 E-value=3.7 Score=29.81 Aligned_cols=17 Identities=24% Similarity=0.262 Sum_probs=7.9
Q ss_pred cccccccccccCCHHHH
Q psy14799 66 PFPCDLCEQRFLNKSHL 82 (230)
Q Consensus 66 ~~~C~~C~~~f~~~~~l 82 (230)
-|.|+.|+..|.....+
T Consensus 109 ~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 109 FFICPNMCVRFTFNEAM 125 (158)
T ss_pred eEECCCCCcEeeHHHHH
Confidence 34455555444444433
No 95
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=71.07 E-value=4.7 Score=29.92 Aligned_cols=32 Identities=19% Similarity=0.397 Sum_probs=22.7
Q ss_pred CCceecccccccccCHHHHHHHHHhhcCCCCccccccccccc
Q psy14799 92 ERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGF 133 (230)
Q Consensus 92 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f 133 (230)
..-|.|+.|+..|+...++. ..|.|+.||...
T Consensus 115 ~~~Y~Cp~C~~rytf~eA~~----------~~F~Cp~Cg~~L 146 (178)
T PRK06266 115 NMFFFCPNCHIRFTFDEAME----------YGFRCPQCGEML 146 (178)
T ss_pred CCEEECCCCCcEEeHHHHhh----------cCCcCCCCCCCC
Confidence 45588888888887766642 248888888653
No 96
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=69.76 E-value=0.66 Score=25.39 Aligned_cols=12 Identities=33% Similarity=1.140 Sum_probs=7.2
Q ss_pred ccccccccccCC
Q psy14799 36 FQCDECDMSFGT 47 (230)
Q Consensus 36 ~~C~~C~~~f~~ 47 (230)
|.|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (42)
T PF09723_consen 6 YRCEECGHEFEV 17 (42)
T ss_pred EEeCCCCCEEEE
Confidence 566666666543
No 97
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=68.44 E-value=2.9 Score=23.23 Aligned_cols=10 Identities=20% Similarity=0.946 Sum_probs=5.3
Q ss_pred cccccccccc
Q psy14799 36 FQCDECDMSF 45 (230)
Q Consensus 36 ~~C~~C~~~f 45 (230)
|.|+.||..+
T Consensus 4 y~C~~CG~~~ 13 (46)
T PRK00398 4 YKCARCGREV 13 (46)
T ss_pred EECCCCCCEE
Confidence 5555555544
No 98
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=67.86 E-value=3.1 Score=23.06 Aligned_cols=10 Identities=30% Similarity=0.846 Sum_probs=4.4
Q ss_pred cccccccccc
Q psy14799 36 FQCDECDMSF 45 (230)
Q Consensus 36 ~~C~~C~~~f 45 (230)
|.|..||..|
T Consensus 3 Y~C~~Cg~~~ 12 (44)
T smart00659 3 YICGECGREN 12 (44)
T ss_pred EECCCCCCEe
Confidence 4444444443
No 99
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=66.98 E-value=0.88 Score=32.82 Aligned_cols=14 Identities=29% Similarity=0.494 Sum_probs=8.5
Q ss_pred ccccccccccCChH
Q psy14799 124 FTCHLCNKGFFRKS 137 (230)
Q Consensus 124 ~~C~~C~~~f~~~~ 137 (230)
++|+.||.+|.+..
T Consensus 29 ~~c~~c~~~f~~~e 42 (154)
T PRK00464 29 RECLACGKRFTTFE 42 (154)
T ss_pred eeccccCCcceEeE
Confidence 56666666666543
No 100
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=66.71 E-value=3.4 Score=24.24 Aligned_cols=19 Identities=26% Similarity=0.487 Sum_probs=8.8
Q ss_pred HHHHhHHHhcCCCceeccc
Q psy14799 81 HLAYHVKTHTGERDVTCGL 99 (230)
Q Consensus 81 ~l~~H~~~h~~~~~~~C~~ 99 (230)
.|..|+...-..++..|++
T Consensus 25 ~l~~H~~~~C~~~~v~C~~ 43 (60)
T PF02176_consen 25 ELDDHLENECPKRPVPCPY 43 (60)
T ss_dssp CHHHHHHTTSTTSEEE-SS
T ss_pred HHHHHHHccCCCCcEECCC
Confidence 4555555444444455555
No 101
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=66.61 E-value=4.2 Score=28.12 Aligned_cols=33 Identities=12% Similarity=-0.052 Sum_probs=21.9
Q ss_pred CCCCCCCCCcccccccccCCccccccccCCccc
Q psy14799 160 RCYACRCAGHCLQIGTHSGGICGSMWRVETWHP 192 (230)
Q Consensus 160 ~C~~c~~~~~~~~~~~h~~~~C~~~f~~~~~l~ 192 (230)
.|+.|++.|..+....-.|+.||..|.-...++
T Consensus 11 ~Cp~cg~kFYDLnk~p~vcP~cg~~~~~~~~~~ 43 (129)
T TIGR02300 11 ICPNTGSKFYDLNRRPAVSPYTGEQFPPEEALK 43 (129)
T ss_pred cCCCcCccccccCCCCccCCCcCCccCcchhhc
Confidence 366666666666666666678998887664443
No 102
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=65.91 E-value=4.3 Score=23.06 Aligned_cols=38 Identities=18% Similarity=0.373 Sum_probs=16.8
Q ss_pred CceecccccccccCHHHHHHHHHhhcCCCCcccccccc
Q psy14799 93 RDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCN 130 (230)
Q Consensus 93 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~ 130 (230)
+.+.|..||..|.....=+......--...|-.|+.|-
T Consensus 3 k~l~C~dCg~~FvfTa~EQ~fy~eKgf~n~p~RC~~CR 40 (49)
T PF13451_consen 3 KTLTCKDCGAEFVFTAGEQKFYAEKGFDNEPKRCPSCR 40 (49)
T ss_pred eeEEcccCCCeEEEehhHHHHHHhcCCcCCCccCHHHH
Confidence 44566666665554333333332111112345565554
No 103
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=65.85 E-value=5.6 Score=29.52 Aligned_cols=15 Identities=33% Similarity=0.561 Sum_probs=7.1
Q ss_pred cccccccccccCCHH
Q psy14799 66 PFPCDLCEQRFLNKS 80 (230)
Q Consensus 66 ~~~C~~C~~~f~~~~ 80 (230)
-|.|+.|+..|....
T Consensus 117 ~Y~Cp~C~~rytf~e 131 (178)
T PRK06266 117 FFFCPNCHIRFTFDE 131 (178)
T ss_pred EEECCCCCcEEeHHH
Confidence 344555554444443
No 104
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=65.19 E-value=2.4 Score=21.68 Aligned_cols=11 Identities=27% Similarity=0.872 Sum_probs=6.3
Q ss_pred ccccccccccC
Q psy14799 36 FQCDECDMSFG 46 (230)
Q Consensus 36 ~~C~~C~~~f~ 46 (230)
|.|..|+..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 45666766654
No 105
>PHA00626 hypothetical protein
Probab=63.70 E-value=1.8 Score=25.10 Aligned_cols=12 Identities=17% Similarity=0.268 Sum_probs=5.8
Q ss_pred cccccccccccC
Q psy14799 66 PFPCDLCEQRFL 77 (230)
Q Consensus 66 ~~~C~~C~~~f~ 77 (230)
.|.|+.||..|.
T Consensus 23 rYkCkdCGY~ft 34 (59)
T PHA00626 23 DYVCCDCGYNDS 34 (59)
T ss_pred ceEcCCCCCeec
Confidence 355555554444
No 106
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=62.78 E-value=1.3 Score=25.40 Aligned_cols=11 Identities=27% Similarity=1.041 Sum_probs=4.8
Q ss_pred ccccccccccC
Q psy14799 36 FQCDECDMSFG 46 (230)
Q Consensus 36 ~~C~~C~~~f~ 46 (230)
|+|..|+..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 44444444443
No 107
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=59.51 E-value=4.9 Score=24.41 Aligned_cols=29 Identities=21% Similarity=0.280 Sum_probs=13.1
Q ss_pred cCCCCCCCCCcccccccccCCccccccccC
Q psy14799 159 HRCYACRCAGHCLQIGTHSGGICGSMWRVE 188 (230)
Q Consensus 159 ~~C~~c~~~~~~~~~~~h~~~~C~~~f~~~ 188 (230)
..|..|++.|. +-.+.|.|..||..|=..
T Consensus 10 ~~C~~C~~~F~-~~~rrhhCr~CG~~vC~~ 38 (69)
T PF01363_consen 10 SNCMICGKKFS-LFRRRHHCRNCGRVVCSS 38 (69)
T ss_dssp SB-TTT--B-B-SSS-EEE-TTT--EEECC
T ss_pred CcCcCcCCcCC-CceeeEccCCCCCEECCc
Confidence 34777777773 346777777777777543
No 108
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=59.28 E-value=11 Score=26.39 Aligned_cols=13 Identities=31% Similarity=0.790 Sum_probs=7.3
Q ss_pred cccccccccCChH
Q psy14799 125 TCHLCNKGFFRKS 137 (230)
Q Consensus 125 ~C~~C~~~f~~~~ 137 (230)
.||.|.-+|.+++
T Consensus 123 vCPvCkTSFKss~ 135 (140)
T PF05290_consen 123 VCPVCKTSFKSSS 135 (140)
T ss_pred CCCcccccccccc
Confidence 3666666665543
No 109
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=59.12 E-value=5.5 Score=29.10 Aligned_cols=24 Identities=21% Similarity=0.647 Sum_probs=15.2
Q ss_pred cccccccccccCChHHHHHhHhccCCCCCCCccccccccc
Q psy14799 35 SFQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQ 74 (230)
Q Consensus 35 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~ 74 (230)
-|.|++||.++. +. .|-.||+||.
T Consensus 134 ~~vC~vCGy~~~-------------ge---~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTHE-------------GE---APEVCPICGA 157 (166)
T ss_pred EEEcCCCCCccc-------------CC---CCCcCCCCCC
Confidence 477777776542 21 5677777773
No 110
>KOG2593|consensus
Probab=58.95 E-value=11 Score=31.71 Aligned_cols=37 Identities=19% Similarity=0.350 Sum_probs=24.4
Q ss_pred CCCceecccccccccCHHHHHHHHHhhcCCCCccccccccc
Q psy14799 91 GERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNK 131 (230)
Q Consensus 91 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~ 131 (230)
....|.|+.|.+.|.....+.- .-.....|.|..|+-
T Consensus 125 ~~~~Y~Cp~C~kkyt~Lea~~L----~~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 125 NVAGYVCPNCQKKYTSLEALQL----LDNETGEFHCENCGG 161 (436)
T ss_pred ccccccCCccccchhhhHHHHh----hcccCceEEEecCCC
Confidence 3456888888888887666542 223334588888874
No 111
>KOG4167|consensus
Probab=58.31 E-value=4.9 Score=36.12 Aligned_cols=23 Identities=17% Similarity=0.037 Sum_probs=19.5
Q ss_pred hhhhhhhccCHHHHHHHHHhhcC
Q psy14799 10 WNKNYFFFLMGLKLEAHITTVHG 32 (230)
Q Consensus 10 C~~C~~~f~~~~~l~~H~~~h~~ 32 (230)
|..|+++|....++.+||++|.-
T Consensus 795 CreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 795 CRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred hHHHHHHHHHHhhhhHHHHHHHH
Confidence 45588899999999999999853
No 112
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=56.67 E-value=5.4 Score=31.16 Aligned_cols=23 Identities=13% Similarity=0.238 Sum_probs=17.1
Q ss_pred CCCcccccccccccCChHHHHHH
Q psy14799 120 NIRSFTCHLCNKGFFRKSYLKVH 142 (230)
Q Consensus 120 ~~~~~~C~~C~~~f~~~~~l~~H 142 (230)
..++++||.|+........|..=
T Consensus 206 k~k~~PCPKCg~et~eTkdLSmS 228 (314)
T PF06524_consen 206 KGKPIPCPKCGYETQETKDLSMS 228 (314)
T ss_pred cCCCCCCCCCCCcccccccceee
Confidence 34789999999887776666554
No 113
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=56.42 E-value=5.8 Score=22.49 Aligned_cols=10 Identities=20% Similarity=0.873 Sum_probs=4.0
Q ss_pred cccccccccc
Q psy14799 67 FPCDLCEQRF 76 (230)
Q Consensus 67 ~~C~~C~~~f 76 (230)
|.|..||+.|
T Consensus 7 Y~C~~Cg~~~ 16 (49)
T COG1996 7 YKCARCGREV 16 (49)
T ss_pred EEhhhcCCee
Confidence 3344444333
No 114
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=56.30 E-value=11 Score=30.80 Aligned_cols=23 Identities=22% Similarity=0.534 Sum_probs=19.0
Q ss_pred CcccccccccccCCHHHHHHhHH
Q psy14799 65 RPFPCDLCEQRFLNKSHLAYHVK 87 (230)
Q Consensus 65 ~~~~C~~C~~~f~~~~~l~~H~~ 87 (230)
..+-|+.|++.|....-+..|..
T Consensus 237 ~~~YC~~C~r~f~~~~VFe~Hl~ 259 (470)
T COG5188 237 PKVYCVKCGREFSRSKVFEYHLE 259 (470)
T ss_pred cceeeHhhhhHhhhhHHHHHHHh
Confidence 34789999999999888888865
No 115
>PF14353 CpXC: CpXC protein
Probab=54.94 E-value=11 Score=26.02 Aligned_cols=17 Identities=41% Similarity=0.651 Sum_probs=9.0
Q ss_pred cccccccccccCChHHH
Q psy14799 123 SFTCHLCNKGFFRKSYL 139 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~~l 139 (230)
.+.|+.||..|.-...+
T Consensus 38 ~~~CP~Cg~~~~~~~p~ 54 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPL 54 (128)
T ss_pred EEECCCCCCceecCCCE
Confidence 35566666555544443
No 116
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=54.80 E-value=2.4 Score=19.85 Aligned_cols=6 Identities=33% Similarity=1.259 Sum_probs=3.2
Q ss_pred cccccc
Q psy14799 38 CDECDM 43 (230)
Q Consensus 38 C~~C~~ 43 (230)
|+.||.
T Consensus 16 C~~CG~ 21 (23)
T PF13240_consen 16 CPNCGT 21 (23)
T ss_pred hhhhCC
Confidence 555554
No 117
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=54.75 E-value=9.2 Score=32.86 Aligned_cols=28 Identities=21% Similarity=0.301 Sum_probs=20.5
Q ss_pred cccccccccccCChHHHHHHHHhhcCCC
Q psy14799 123 SFTCHLCNKGFFRKSYLKVHLLTIHMQP 150 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 150 (230)
-+.|+.|.+.|.+...+..|+...|.+.
T Consensus 57 FWiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 57 FWICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred EeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 4667778888888888888877777543
No 118
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=53.48 E-value=10 Score=28.99 Aligned_cols=26 Identities=23% Similarity=0.411 Sum_probs=13.6
Q ss_pred ceecccccccccCHHHHHHHHHhhcC
Q psy14799 94 DVTCGLCFQSFYSKADLVRHMKGRHL 119 (230)
Q Consensus 94 ~~~C~~C~~~f~~~~~l~~H~~~~H~ 119 (230)
.|.|+.|++.|....-+..|+...|.
T Consensus 77 K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 77 KWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp EEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred EECCCCCCcccCChHHHHHHHhhcCH
Confidence 35666666666666666666654553
No 119
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=52.75 E-value=6.1 Score=18.73 Aligned_cols=9 Identities=33% Similarity=0.918 Sum_probs=5.3
Q ss_pred Ccccccccc
Q psy14799 122 RSFTCHLCN 130 (230)
Q Consensus 122 ~~~~C~~C~ 130 (230)
.+|.|+.||
T Consensus 15 v~f~CPnCG 23 (24)
T PF07754_consen 15 VPFPCPNCG 23 (24)
T ss_pred ceEeCCCCC
Confidence 346666665
No 120
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=52.25 E-value=13 Score=25.96 Aligned_cols=21 Identities=24% Similarity=0.249 Sum_probs=18.1
Q ss_pred ccccccccCCccccccccccCCCC
Q psy14799 180 ICGSMWRVETWHPCHLLTERNRQP 203 (230)
Q Consensus 180 ~C~~~f~~~~~l~~H~~~h~~~~~ 203 (230)
.+|+.|.+ |++|+.+|.|=.|
T Consensus 81 EDGkkfKS---LKRHL~t~~gmTP 101 (148)
T COG4957 81 EDGKKFKS---LKRHLTTHYGLTP 101 (148)
T ss_pred ccCcchHH---HHHHHhcccCCCH
Confidence 77999987 9999999987765
No 121
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=51.45 E-value=4 Score=19.73 Aligned_cols=10 Identities=30% Similarity=0.889 Sum_probs=6.5
Q ss_pred cccccccccC
Q psy14799 37 QCDECDMSFG 46 (230)
Q Consensus 37 ~C~~C~~~f~ 46 (230)
.|+.||..|.
T Consensus 16 ~Cp~CG~~F~ 25 (26)
T PF10571_consen 16 FCPHCGYDFE 25 (26)
T ss_pred cCCCCCCCCc
Confidence 4777776664
No 122
>KOG4167|consensus
Probab=50.79 E-value=4.6 Score=36.31 Aligned_cols=26 Identities=35% Similarity=0.662 Sum_probs=23.6
Q ss_pred cccccccccccCCHHHHHHhHHHhcC
Q psy14799 66 PFPCDLCEQRFLNKSHLAYHVKTHTG 91 (230)
Q Consensus 66 ~~~C~~C~~~f~~~~~l~~H~~~h~~ 91 (230)
.|.|.+|++.|....++..||++|.-
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 39999999999999999999999863
No 123
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=49.45 E-value=11 Score=26.55 Aligned_cols=25 Identities=32% Similarity=0.592 Sum_probs=15.7
Q ss_pred cccccccccccCCHHHHHHhHHHhcCCC
Q psy14799 66 PFPCDLCEQRFLNKSHLAYHVKTHTGER 93 (230)
Q Consensus 66 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~ 93 (230)
...|-+||+.|.. |.+|++.|+|-.
T Consensus 72 ~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 72 YIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp -EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred eeEEccCCcccch---HHHHHHHccCCC
Confidence 3679999999987 478999987754
No 124
>PF15269 zf-C2H2_7: Zinc-finger
Probab=49.23 E-value=15 Score=20.24 Aligned_cols=21 Identities=33% Similarity=0.491 Sum_probs=13.7
Q ss_pred eecccccccccCHHHHHHHHH
Q psy14799 95 VTCGLCFQSFYSKADLVRHMK 115 (230)
Q Consensus 95 ~~C~~C~~~f~~~~~l~~H~~ 115 (230)
|+|-.|..+..-.+.|..||+
T Consensus 21 ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred ceeecCCcccchHHHHHHHHH
Confidence 566666666666666666665
No 125
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=49.03 E-value=3.3 Score=25.66 Aligned_cols=8 Identities=25% Similarity=0.654 Sum_probs=3.6
Q ss_pred ccccccCC
Q psy14799 128 LCNKGFFR 135 (230)
Q Consensus 128 ~C~~~f~~ 135 (230)
.||.+|.+
T Consensus 34 eCg~tF~t 41 (72)
T PRK09678 34 NCSATFIT 41 (72)
T ss_pred CCCCEEEE
Confidence 44444443
No 126
>KOG1280|consensus
Probab=47.10 E-value=18 Score=29.69 Aligned_cols=26 Identities=27% Similarity=0.521 Sum_probs=13.2
Q ss_pred ccccccccccCChHHHHHhHhccCCC
Q psy14799 36 FQCDECDMSFGTHQALGRHKQCKHMD 61 (230)
Q Consensus 36 ~~C~~C~~~f~~~~~l~~H~~~~~~~ 61 (230)
|.|+.|+.+-.+...|..|+...|.+
T Consensus 80 ftCPyC~~~Gfte~~f~~Hv~s~Hpd 105 (381)
T KOG1280|consen 80 FTCPYCGIMGFTERQFGTHVLSQHPE 105 (381)
T ss_pred ccCCcccccccchhHHHHHhhhcCcc
Confidence 55555555544455555555444443
No 127
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=46.56 E-value=7.8 Score=24.09 Aligned_cols=10 Identities=40% Similarity=1.268 Sum_probs=5.7
Q ss_pred cccccccccc
Q psy14799 36 FQCDECDMSF 45 (230)
Q Consensus 36 ~~C~~C~~~f 45 (230)
|+|..|+..|
T Consensus 13 Y~c~~cg~~~ 22 (82)
T COG2331 13 YECTECGNRF 22 (82)
T ss_pred EeecccchHH
Confidence 5666666543
No 128
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=46.48 E-value=9 Score=19.26 Aligned_cols=7 Identities=29% Similarity=0.899 Sum_probs=2.8
Q ss_pred ccccccc
Q psy14799 67 FPCDLCE 73 (230)
Q Consensus 67 ~~C~~C~ 73 (230)
+.|+.|+
T Consensus 20 ~vCp~C~ 26 (30)
T PF08274_consen 20 LVCPECG 26 (30)
T ss_dssp EEETTTT
T ss_pred EeCCccc
Confidence 3444443
No 129
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=45.77 E-value=16 Score=31.44 Aligned_cols=29 Identities=24% Similarity=0.353 Sum_probs=21.6
Q ss_pred CceecccccccccCHHHHHHHHHhhcCCC
Q psy14799 93 RDVTCGLCFQSFYSKADLVRHMKGRHLNI 121 (230)
Q Consensus 93 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 121 (230)
+-+.|+.|.+.|.+...+..|+...|.+.
T Consensus 56 rFWiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 56 RFWICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred eEeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 34678888888888888888887677544
No 130
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=44.57 E-value=2.4 Score=37.94 Aligned_cols=56 Identities=23% Similarity=0.500 Sum_probs=29.1
Q ss_pred cccccccccCCHHHHHHhHHHhcCCCce-ecccccccccCHHHHHHHHHhhcCCCCccccccccc
Q psy14799 68 PCDLCEQRFLNKSHLAYHVKTHTGERDV-TCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNK 131 (230)
Q Consensus 68 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~-~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~ 131 (230)
.|..||-+|.-...|-.-. ..+.-+.| -|+.|.+.+.+..+-+-|.+ |..|+.||-
T Consensus 125 ~CT~CGPRfTIi~alPYDR-~nTsM~~F~lC~~C~~EY~dP~nRRfHAQ-------p~aCp~CGP 181 (750)
T COG0068 125 NCTNCGPRFTIIEALPYDR-ENTSMADFPLCPFCDKEYKDPLNRRFHAQ-------PIACPKCGP 181 (750)
T ss_pred ccCCCCcceeeeccCCCCc-ccCccccCcCCHHHHHHhcCccccccccc-------cccCcccCC
Confidence 3666776666554443221 11111222 47777777776655433332 567777774
No 131
>KOG0717|consensus
Probab=44.10 E-value=9.8 Score=32.45 Aligned_cols=22 Identities=18% Similarity=0.031 Sum_probs=19.0
Q ss_pred hhhhhhhhhccCHHHHHHHHHh
Q psy14799 8 DYWNKNYFFFLMGLKLEAHITT 29 (230)
Q Consensus 8 ~~C~~C~~~f~~~~~l~~H~~~ 29 (230)
.||.+|+++|.+...|..|..+
T Consensus 293 lyC~vCnKsFKseKq~kNHEnS 314 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHENS 314 (508)
T ss_pred eEEeeccccccchHHHHhhHHH
Confidence 4788999999999999999765
No 132
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=43.69 E-value=13 Score=21.23 Aligned_cols=11 Identities=18% Similarity=0.522 Sum_probs=5.5
Q ss_pred eeccccccccc
Q psy14799 95 VTCGLCFQSFY 105 (230)
Q Consensus 95 ~~C~~C~~~f~ 105 (230)
|+|..|+..+.
T Consensus 2 y~C~~CgyiYd 12 (50)
T cd00730 2 YECRICGYIYD 12 (50)
T ss_pred cCCCCCCeEEC
Confidence 44555555444
No 133
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=43.68 E-value=18 Score=20.64 Aligned_cols=25 Identities=20% Similarity=0.359 Sum_probs=10.9
Q ss_pred CCCCCCCCcccccccccCCcccccc
Q psy14799 161 CYACRCAGHCLQIGTHSGGICGSMW 185 (230)
Q Consensus 161 C~~c~~~~~~~~~~~h~~~~C~~~f 185 (230)
|+.|+..+...+.....|..||.++
T Consensus 23 CP~Cg~~~m~~~~~r~~C~~Cgyt~ 47 (50)
T PRK00432 23 CPRCGSGFMAEHLDRWHCGKCGYTE 47 (50)
T ss_pred CcCCCcchheccCCcEECCCcCCEE
Confidence 4444433222233444445666554
No 134
>KOG1842|consensus
Probab=43.26 E-value=15 Score=31.21 Aligned_cols=29 Identities=28% Similarity=0.414 Sum_probs=22.0
Q ss_pred cccccccccccCChHHHHHHHHhhcCCCC
Q psy14799 123 SFTCHLCNKGFFRKSYLKVHLLTIHMQPA 151 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~ 151 (230)
.|.||+|...|.+...|..|+-..|.++.
T Consensus 15 gflCPiC~~dl~~~~~L~~H~d~eH~~ed 43 (505)
T KOG1842|consen 15 GFLCPICLLDLPNLSALNDHLDVEHFEED 43 (505)
T ss_pred cccCchHhhhhhhHHHHHHHHhhhccccc
Confidence 47788888888888888888777776554
No 135
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.16 E-value=14 Score=24.79 Aligned_cols=27 Identities=7% Similarity=-0.221 Sum_probs=19.2
Q ss_pred CCCCCCCCcccccccccCCcccccccc
Q psy14799 161 CYACRCAGHCLQIGTHSGGICGSMWRV 187 (230)
Q Consensus 161 C~~c~~~~~~~~~~~h~~~~C~~~f~~ 187 (230)
|+.|++.|..+..+.-.++.||++|+.
T Consensus 12 dPetg~KFYDLNrdPiVsPytG~s~P~ 38 (129)
T COG4530 12 DPETGKKFYDLNRDPIVSPYTGKSYPR 38 (129)
T ss_pred CccccchhhccCCCccccCcccccchH
Confidence 566666666666666667788888853
No 136
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=43.13 E-value=18 Score=29.21 Aligned_cols=47 Identities=21% Similarity=0.429 Sum_probs=25.9
Q ss_pred cccccccccCHHHHHHHHHhhcCCCCcccccccccccCChHHHHHHHHhhc
Q psy14799 97 CGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 97 C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H 147 (230)
|-.|.-.|.-... |....-.....|+|+.|...|-.--+.-.| .+.|
T Consensus 365 Cf~CQ~~fp~~~~---~~~~~~~ss~rY~Ce~CK~~FC~dCdvfiH-e~Lh 411 (421)
T COG5151 365 CFVCQGPFPKPPV---SPFDESTSSGRYQCELCKSTFCSDCDVFIH-ETLH 411 (421)
T ss_pred ceeccCCCCCCCC---CcccccccccceechhhhhhhhhhhHHHHH-HHHh
Confidence 6666655554211 111111223458888888887777777777 4444
No 137
>PRK04023 DNA polymerase II large subunit; Validated
Probab=42.35 E-value=15 Score=34.71 Aligned_cols=9 Identities=22% Similarity=0.504 Sum_probs=4.7
Q ss_pred eeccccccc
Q psy14799 95 VTCGLCFQS 103 (230)
Q Consensus 95 ~~C~~C~~~ 103 (230)
+.|+.|+..
T Consensus 664 y~CPKCG~E 672 (1121)
T PRK04023 664 DECEKCGRE 672 (1121)
T ss_pred CcCCCCCCC
Confidence 446666543
No 138
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=42.24 E-value=56 Score=23.68 Aligned_cols=57 Identities=23% Similarity=0.336 Sum_probs=34.5
Q ss_pred ceeccccc-ccccCHHHHHHHHHhhcCCCCcccccc--cccccCChHHHHHHHHhhcCCCChhh
Q psy14799 94 DVTCGLCF-QSFYSKADLVRHMKGRHLNIRSFTCHL--CNKGFFRKSYLKVHLLTIHMQPADEA 154 (230)
Q Consensus 94 ~~~C~~C~-~~f~~~~~l~~H~~~~H~~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~ 154 (230)
...|+.|. .++.+. ...-.+ .+-+.|+-.|.. |... .+-.+|++|.+..|...+|-+
T Consensus 80 ~L~CPLCRG~V~GWt--vve~AR-~~LN~K~RsC~~e~C~F~-GtY~eLrKHar~~HP~~rP~~ 139 (162)
T PF07800_consen 80 ELACPLCRGEVKGWT--VVEPAR-RFLNAKKRSCSQESCSFS-GTYSELRKHARSEHPSARPSE 139 (162)
T ss_pred cccCccccCceeceE--EchHHH-HHhccCCccCcccccccc-cCHHHHHHHHHhhCCCCCCcc
Confidence 46899985 333221 112223 444556666765 6543 356699999999997776644
No 139
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=42.21 E-value=8.8 Score=27.05 Aligned_cols=12 Identities=17% Similarity=0.451 Sum_probs=6.4
Q ss_pred eecccccccccC
Q psy14799 95 VTCGLCFQSFYS 106 (230)
Q Consensus 95 ~~C~~C~~~f~~ 106 (230)
+.|..|+..|..
T Consensus 71 ~~C~~CG~~~~~ 82 (135)
T PRK03824 71 LKCRNCGNEWSL 82 (135)
T ss_pred EECCCCCCEEec
Confidence 555555555543
No 140
>KOG3408|consensus
Probab=42.14 E-value=17 Score=24.86 Aligned_cols=26 Identities=31% Similarity=0.480 Sum_probs=18.4
Q ss_pred CCCcccccccccccCChHHHHHHHHh
Q psy14799 120 NIRSFTCHLCNKGFFRKSYLKVHLLT 145 (230)
Q Consensus 120 ~~~~~~C~~C~~~f~~~~~l~~H~~~ 145 (230)
|...|-|-.|.+-|.+...|..|.++
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhc
Confidence 33457788888888888888887553
No 141
>KOG4377|consensus
Probab=40.10 E-value=20 Score=30.16 Aligned_cols=124 Identities=17% Similarity=0.211 Sum_probs=65.9
Q ss_pred ccccc--cccccCCHHHHHHhHHHhcCC------------Cceecc--cccccccCHHHHHHHHHhhcCCC-------Cc
Q psy14799 67 FPCDL--CEQRFLNKSHLAYHVKTHTGE------------RDVTCG--LCFQSFYSKADLVRHMKGRHLNI-------RS 123 (230)
Q Consensus 67 ~~C~~--C~~~f~~~~~l~~H~~~h~~~------------~~~~C~--~C~~~f~~~~~l~~H~~~~H~~~-------~~ 123 (230)
|.|.. |+..+.++.++.+|..+|... ..|-|. .|.+ +-++..-|-. .|+.. .-
T Consensus 272 yhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~n-Fht~~~n~Gfrrth 347 (480)
T KOG4377|consen 272 YHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDN-FHTDKRNNGFRRTH 347 (480)
T ss_pred hcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCc-cccccccCceecce
Confidence 44533 776666677778887776432 124553 3766 2334445555 45322 13
Q ss_pred ccccccc--cccCChHHHHHHHHhhcCCCChhhh------hhccCCCCCCCCCccccc-ccccCCccccccccCCccccc
Q psy14799 124 FTCHLCN--KGFFRKSYLKVHLLTIHMQPADEAQ------NAIHRCYACRCAGHCLQI-GTHSGGICGSMWRVETWHPCH 194 (230)
Q Consensus 124 ~~C~~C~--~~f~~~~~l~~H~~~~H~~~~~~~c------~~~~~C~~c~~~~~~~~~-~~h~~~~C~~~f~~~~~l~~H 194 (230)
|.|..++ -+|. ...| +..|..+..++- .....|.+-+-.++.+=. -.-..+.|+.++.+.+.+..|
T Consensus 348 fhC~r~gCTdtfK----~~kh-k~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~s~sqm~sh 422 (480)
T KOG4377|consen 348 FHCQRIGCTDTFK----DSKH-KPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLYSVSQMASH 422 (480)
T ss_pred eEEeccCCccccc----cccc-cccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEEehhhhhhh
Confidence 6687766 4444 3445 444432221111 011223333333333311 111225999999999999999
Q ss_pred ccccc
Q psy14799 195 LLTER 199 (230)
Q Consensus 195 ~~~h~ 199 (230)
.+.|.
T Consensus 423 krkhe 427 (480)
T KOG4377|consen 423 KRKHE 427 (480)
T ss_pred hhhhh
Confidence 99885
No 142
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=39.71 E-value=12 Score=19.78 Aligned_cols=14 Identities=21% Similarity=0.558 Sum_probs=10.7
Q ss_pred cccccccccccCCh
Q psy14799 123 SFTCHLCNKGFFRK 136 (230)
Q Consensus 123 ~~~C~~C~~~f~~~ 136 (230)
.|.|+.||..|...
T Consensus 5 ~y~C~~Cg~~fe~~ 18 (41)
T smart00834 5 EYRCEDCGHTFEVL 18 (41)
T ss_pred EEEcCCCCCEEEEE
Confidence 37899999888744
No 143
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=38.65 E-value=19 Score=23.83 Aligned_cols=12 Identities=25% Similarity=0.581 Sum_probs=5.6
Q ss_pred ccccccccccCC
Q psy14799 124 FTCHLCNKGFFR 135 (230)
Q Consensus 124 ~~C~~C~~~f~~ 135 (230)
++|..||..|.+
T Consensus 3 H~CtrCG~vf~~ 14 (112)
T COG3364 3 HQCTRCGEVFDD 14 (112)
T ss_pred ceeccccccccc
Confidence 344444444444
No 144
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=38.12 E-value=6 Score=19.53 Aligned_cols=18 Identities=39% Similarity=0.776 Sum_probs=10.9
Q ss_pred ccccccccccCChHHHHHH
Q psy14799 124 FTCHLCNKGFFRKSYLKVH 142 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H 142 (230)
|.|-.|++.| ...+.+.|
T Consensus 1 ~sCiDC~~~F-~~~~y~~H 18 (28)
T PF08790_consen 1 FSCIDCSKDF-DGDSYKSH 18 (28)
T ss_dssp EEETTTTEEE-EGGGTTT-
T ss_pred CeeecCCCCc-CcCCcCCC
Confidence 4577788888 44455566
No 145
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=37.99 E-value=18 Score=20.76 Aligned_cols=20 Identities=20% Similarity=0.565 Sum_probs=6.3
Q ss_pred cccccccccccCChHHHHHH
Q psy14799 123 SFTCHLCNKGFFRKSYLKVH 142 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~~l~~H 142 (230)
.|.|+.|+..|=-.=+.-.|
T Consensus 21 ~y~C~~C~~~FC~dCD~fiH 40 (51)
T PF07975_consen 21 RYRCPKCKNHFCIDCDVFIH 40 (51)
T ss_dssp EE--TTTT--B-HHHHHTTT
T ss_pred eEECCCCCCccccCcChhhh
Confidence 35555555444433333333
No 146
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=37.93 E-value=34 Score=18.50 Aligned_cols=23 Identities=26% Similarity=0.366 Sum_probs=13.3
Q ss_pred ccccccccccCC--hHHHHHHHHhhc
Q psy14799 124 FTCHLCNKGFFR--KSYLKVHLLTIH 147 (230)
Q Consensus 124 ~~C~~C~~~f~~--~~~l~~H~~~~H 147 (230)
-.|+.||..|.. ..+-..| ++.|
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H-~~yH 38 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLH-KKYH 38 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHH-HHHH
Confidence 467777766653 4455566 4444
No 147
>PF14803 Nudix_N_2: Nudix N-terminal; PDB: 3CNG_C.
Probab=37.57 E-value=3.2 Score=21.58 Aligned_cols=8 Identities=25% Similarity=1.020 Sum_probs=3.8
Q ss_pred cccccccc
Q psy14799 36 FQCDECDM 43 (230)
Q Consensus 36 ~~C~~C~~ 43 (230)
+.|+.|+.
T Consensus 23 ~vC~~Cg~ 30 (34)
T PF14803_consen 23 LVCPACGF 30 (34)
T ss_dssp EEETTTTE
T ss_pred eECCCCCC
Confidence 44555543
No 148
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=36.86 E-value=25 Score=27.44 Aligned_cols=58 Identities=19% Similarity=0.405 Sum_probs=32.0
Q ss_pred CcccccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHHHhcCCCceecccccccccCH
Q psy14799 34 KSFQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSK 107 (230)
Q Consensus 34 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 107 (230)
..|.|..|+..+..+ . ......-.|..|.+.|.-... ..-.|-..|.|+.|+..|...
T Consensus 111 rqFaC~~Cd~~WwRr-----v------p~rKeVSRCr~C~~rYDPVP~-----dkmwG~aef~C~~C~h~F~G~ 168 (278)
T PF15135_consen 111 RQFACSSCDHMWWRR-----V------PQRKEVSRCRKCRKRYDPVPC-----DKMWGIAEFHCPKCRHNFRGF 168 (278)
T ss_pred eeeeccccchHHHhc-----c------CcccccccccccccccCCCcc-----ccccceeeeecccccccchhh
Confidence 568888887654321 1 011123567777777654321 111244457888888888754
No 149
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=35.51 E-value=15 Score=19.71 Aligned_cols=17 Identities=24% Similarity=0.450 Sum_probs=9.1
Q ss_pred ccccccccccCChHHHH
Q psy14799 124 FTCHLCNKGFFRKSYLK 140 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~ 140 (230)
..|+.|+-.+-+..+|.
T Consensus 20 d~C~~C~G~W~d~~el~ 36 (41)
T PF13453_consen 20 DVCPSCGGIWFDAGELE 36 (41)
T ss_pred EECCCCCeEEccHHHHH
Confidence 44555555555555544
No 150
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=35.20 E-value=25 Score=19.43 Aligned_cols=7 Identities=29% Similarity=0.686 Sum_probs=3.5
Q ss_pred eeccccc
Q psy14799 95 VTCGLCF 101 (230)
Q Consensus 95 ~~C~~C~ 101 (230)
+.|+.|+
T Consensus 19 ~~CP~Cg 25 (46)
T PF12760_consen 19 FVCPHCG 25 (46)
T ss_pred CCCCCCC
Confidence 4455555
No 151
>PF13358 DDE_3: DDE superfamily endonuclease
Probab=34.54 E-value=31 Score=23.68 Aligned_cols=24 Identities=38% Similarity=0.643 Sum_probs=16.7
Q ss_pred CCCCCCCCCCchhHHhHHHhhhhhc
Q psy14799 201 RQPPYSQDLSPCDIFLLQKLKNQLN 225 (230)
Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (230)
--|||+|+++|.+. ....++..+.
T Consensus 112 ~~P~~sPdLNpiE~-~w~~lk~~~~ 135 (146)
T PF13358_consen 112 FLPPYSPDLNPIEN-VWGYLKRRIR 135 (146)
T ss_pred cccCcCCccCHHHH-HHHHHHHHHH
Confidence 34999999999994 4444444443
No 152
>KOG2593|consensus
Probab=34.45 E-value=41 Score=28.54 Aligned_cols=17 Identities=12% Similarity=0.565 Sum_probs=10.1
Q ss_pred CCcccccccccccCChH
Q psy14799 33 DKSFQCDECDMSFGTHQ 49 (230)
Q Consensus 33 ~~~~~C~~C~~~f~~~~ 49 (230)
...|.|+.|.+.|....
T Consensus 126 ~~~Y~Cp~C~kkyt~Le 142 (436)
T KOG2593|consen 126 VAGYVCPNCQKKYTSLE 142 (436)
T ss_pred cccccCCccccchhhhH
Confidence 34467777776665443
No 153
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=34.44 E-value=30 Score=24.75 Aligned_cols=33 Identities=6% Similarity=0.116 Sum_probs=21.6
Q ss_pred ccccCCccccccccCCccccc---cccccCCCCCCC
Q psy14799 174 GTHSGGICGSMWRVETWHPCH---LLTERNRQPPYS 206 (230)
Q Consensus 174 ~~h~~~~C~~~f~~~~~l~~H---~~~h~~~~~~~~ 206 (230)
|.-.|..||+.|++-.....- +....|..-||+
T Consensus 27 RRReC~~C~~RFTTyErve~~~l~ViKkdG~re~Fd 62 (147)
T TIGR00244 27 RRRECLECHERFTTFERAELLPPTVIKQDGVREPFN 62 (147)
T ss_pred ecccCCccCCccceeeeccccccEEEcCCCCCCCCC
Confidence 566778999999987665443 223346666665
No 154
>KOG2907|consensus
Probab=33.88 E-value=19 Score=24.35 Aligned_cols=33 Identities=12% Similarity=0.253 Sum_probs=20.3
Q ss_pred hhhhhhhhhhhccCHHHHHHHHHhhcCCCcccccccccccCCh
Q psy14799 6 ILDYWNKNYFFFLMGLKLEAHITTVHGDKSFQCDECDMSFGTH 48 (230)
Q Consensus 6 ~c~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 48 (230)
.-+||+.||.....+. ......|..|...|...
T Consensus 6 ~~~FC~~CG~ll~~~~----------~~~~~~C~~Ck~~~~v~ 38 (116)
T KOG2907|consen 6 DLDFCSDCGSLLEEPS----------AQSTVLCIRCKIEYPVS 38 (116)
T ss_pred Ccchhhhhhhhccccc----------ccCceEeccccccCCHH
Confidence 4478888987543322 22335588888877654
No 155
>KOG2272|consensus
Probab=32.49 E-value=50 Score=25.90 Aligned_cols=21 Identities=29% Similarity=0.724 Sum_probs=13.9
Q ss_pred cCCCCcccccccccccCChHH
Q psy14799 118 HLNIRSFTCHLCNKGFFRKSY 138 (230)
Q Consensus 118 H~~~~~~~C~~C~~~f~~~~~ 138 (230)
|....-|.|..|.+-|.....
T Consensus 216 hWHveHFvCa~CekPFlGHrH 236 (332)
T KOG2272|consen 216 HWHVEHFVCAKCEKPFLGHRH 236 (332)
T ss_pred ccchhheeehhcCCcccchhh
Confidence 444445888888888876533
No 156
>KOG3408|consensus
Probab=32.45 E-value=31 Score=23.66 Aligned_cols=25 Identities=28% Similarity=0.474 Sum_probs=20.9
Q ss_pred CCCceecccccccccCHHHHHHHHH
Q psy14799 91 GERDVTCGLCFQSFYSKADLVRHMK 115 (230)
Q Consensus 91 ~~~~~~C~~C~~~f~~~~~l~~H~~ 115 (230)
|...|.|-.|.+-|.+...|..|.+
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~k 78 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFK 78 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHh
Confidence 3445889999999999999999987
No 157
>KOG4124|consensus
Probab=32.41 E-value=13 Score=30.55 Aligned_cols=71 Identities=18% Similarity=0.330 Sum_probs=43.6
Q ss_pred Ccccccc--cccccCChHHHHHHHHhhcCCCChhhhhhccCCCCCCCCCcccccccccCCccccccccCCcccccccc
Q psy14799 122 RSFTCHL--CNKGFFRKSYLKVHLLTIHMQPADEAQNAIHRCYACRCAGHCLQIGTHSGGICGSMWRVETWHPCHLLT 197 (230)
Q Consensus 122 ~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~c~~~~~C~~c~~~~~~~~~~~h~~~~C~~~f~~~~~l~~H~~~ 197 (230)
++|.|++ |++.+.....|+.|....|.....-.-++ --.-.+....-+...+++|.+++.....|.-|+..
T Consensus 348 ~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~-----~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~~~ 420 (442)
T KOG4124|consen 348 KPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPA-----PIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHRTH 420 (442)
T ss_pred CCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCC-----CCCcceeeeccCcccChhhhhhhccCCCCCceeeh
Confidence 6789976 99999999999999666673221100000 00001111122555667999999998888777643
No 158
>PF09963 DUF2197: Uncharacterized protein conserved in bacteria (DUF2197); InterPro: IPR019241 This family represents various hypothetical bacterial proteins with no known function.
Probab=32.35 E-value=17 Score=21.26 Aligned_cols=10 Identities=20% Similarity=0.972 Sum_probs=5.8
Q ss_pred Cccccccccc
Q psy14799 122 RSFTCHLCNK 131 (230)
Q Consensus 122 ~~~~C~~C~~ 131 (230)
..|-|..|..
T Consensus 30 ~tYmC~eC~~ 39 (56)
T PF09963_consen 30 HTYMCDECKE 39 (56)
T ss_pred cceeChhHHH
Confidence 4466666654
No 159
>KOG0717|consensus
Probab=32.26 E-value=27 Score=29.92 Aligned_cols=31 Identities=19% Similarity=0.195 Sum_probs=24.5
Q ss_pred ccccCCccccccccCCcccccccc-ccCCCCC
Q psy14799 174 GTHSGGICGSMWRVETWHPCHLLT-ERNRQPP 204 (230)
Q Consensus 174 ~~h~~~~C~~~f~~~~~l~~H~~~-h~~~~~~ 204 (230)
.++.|..|+.+|.++.-|..|++. -|+.-++
T Consensus 459 a~~~C~tCr~~FdSRnkLF~Hlk~tgHa~~~~ 490 (508)
T KOG0717|consen 459 ALISCTTCRESFDSRNKLFAHLKKTGHARLPS 490 (508)
T ss_pred hhHhhhhhhhhccchhHHHHHhhhcCCeeccc
Confidence 557788999999999999999975 3455443
No 160
>KOG4216|consensus
Probab=32.06 E-value=49 Score=27.75 Aligned_cols=68 Identities=15% Similarity=0.173 Sum_probs=43.7
Q ss_pred cCCCCCCCCCcccccccccCCccccccccCCccccccccccCCCCCCC-CCCCchhHHhHHHhhhhhcc
Q psy14799 159 HRCYACRCAGHCLQIGTHSGGICGSMWRVETWHPCHLLTERNRQPPYS-QDLSPCDIFLLQKLKNQLNG 226 (230)
Q Consensus 159 ~~C~~c~~~~~~~~~~~h~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 226 (230)
..|.+||-..+.+|..+-+|+.|.=.|+....-..--.--....-+.+ .+-+.|.+.++++-....+.
T Consensus 47 IPCKiCGDKSSGiHYGVITCEGCKGFFRRSQ~s~a~YsCpRqknC~iDRtnRNRCQ~CRLqKClaLGMS 115 (479)
T KOG4216|consen 47 IPCKICGDKSSGIHYGVITCEGCKGFFRRSQQSNANYSCPRQKNCLIDRTNRNRCQHCRLQKCLALGMS 115 (479)
T ss_pred EeeeeccCCCCcceeeeEeeccchHhhhhhhhccccccCCcccCCcccccccchhhHHHHHHHHHhccc
Confidence 459999999999999999999998777765433332222212221111 23467888888776655443
No 161
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=31.85 E-value=31 Score=20.75 Aligned_cols=26 Identities=15% Similarity=0.184 Sum_probs=16.2
Q ss_pred CCCCCCCCCcc-cccccccCCcccccc
Q psy14799 160 RCYACRCAGHC-LQIGTHSGGICGSMW 185 (230)
Q Consensus 160 ~C~~c~~~~~~-~~~~~h~~~~C~~~f 185 (230)
.|+.|+..... ...+.+.|+.||..+
T Consensus 30 ~C~~CG~~~~~~~~~r~~~C~~Cg~~~ 56 (69)
T PF07282_consen 30 TCPRCGHRNKKRRSGRVFTCPNCGFEM 56 (69)
T ss_pred CccCcccccccccccceEEcCCCCCEE
Confidence 36666655555 455777777777654
No 162
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=31.45 E-value=22 Score=19.02 Aligned_cols=13 Identities=31% Similarity=0.800 Sum_probs=10.0
Q ss_pred cccccccccccCC
Q psy14799 123 SFTCHLCNKGFFR 135 (230)
Q Consensus 123 ~~~C~~C~~~f~~ 135 (230)
|+.|..|+..|=.
T Consensus 12 ~f~C~~C~~~FC~ 24 (39)
T smart00154 12 GFKCRHCGNLFCG 24 (39)
T ss_pred CeECCccCCcccc
Confidence 6788888887764
No 163
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=31.32 E-value=26 Score=18.38 Aligned_cols=10 Identities=20% Similarity=0.537 Sum_probs=5.1
Q ss_pred eccccccccc
Q psy14799 96 TCGLCFQSFY 105 (230)
Q Consensus 96 ~C~~C~~~f~ 105 (230)
.|+.|+++|.
T Consensus 3 ~C~~Cg~~Yh 12 (36)
T PF05191_consen 3 ICPKCGRIYH 12 (36)
T ss_dssp EETTTTEEEE
T ss_pred CcCCCCCccc
Confidence 4555555554
No 164
>KOG1280|consensus
Probab=30.91 E-value=37 Score=27.91 Aligned_cols=26 Identities=35% Similarity=0.580 Sum_probs=15.3
Q ss_pred cccccccccccCChHHHHHHHHhhcC
Q psy14799 123 SFTCHLCNKGFFRKSYLKVHLLTIHM 148 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~~l~~H~~~~H~ 148 (230)
.|.|++|+..=.+...|..|+...|.
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hp 104 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHP 104 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCc
Confidence 46666666655555566666555553
No 165
>PF05495 zf-CHY: CHY zinc finger; InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins: Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation: ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom. More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=30.80 E-value=7.2 Score=24.05 Aligned_cols=12 Identities=33% Similarity=0.592 Sum_probs=4.9
Q ss_pred ceeccccccccc
Q psy14799 94 DVTCGLCFQSFY 105 (230)
Q Consensus 94 ~~~C~~C~~~f~ 105 (230)
...|..|+....
T Consensus 41 ~v~Cg~C~~~~~ 52 (71)
T PF05495_consen 41 RVICGKCRTEQP 52 (71)
T ss_dssp EEEETTT--EEE
T ss_pred CeECCCCCCccC
Confidence 445555554443
No 166
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=30.77 E-value=7.6 Score=25.61 Aligned_cols=11 Identities=27% Similarity=0.549 Sum_probs=5.9
Q ss_pred ccccccccccC
Q psy14799 124 FTCHLCNKGFF 134 (230)
Q Consensus 124 ~~C~~C~~~f~ 134 (230)
..|..||..|.
T Consensus 47 ~~Cg~CGls~e 57 (104)
T COG4888 47 AVCGNCGLSFE 57 (104)
T ss_pred EEcccCcceEE
Confidence 44555655544
No 167
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=30.71 E-value=34 Score=19.59 Aligned_cols=26 Identities=19% Similarity=0.386 Sum_probs=11.7
Q ss_pred CCCCCCCCcccccccccCCcccccccc
Q psy14799 161 CYACRCAGHCLQIGTHSGGICGSMWRV 187 (230)
Q Consensus 161 C~~c~~~~~~~~~~~h~~~~C~~~f~~ 187 (230)
|..|.+.|.. -.+.|.+..||+.|=.
T Consensus 5 C~~C~~~F~~-~~rk~~Cr~Cg~~~C~ 30 (57)
T cd00065 5 CMGCGKPFTL-TRRRHHCRNCGRIFCS 30 (57)
T ss_pred CcccCccccC-CccccccCcCcCCcCh
Confidence 4444444433 2244444555555543
No 168
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.54 E-value=13 Score=28.60 Aligned_cols=26 Identities=15% Similarity=0.386 Sum_probs=16.8
Q ss_pred CCcccccccccccCChHHHHHhHhcc
Q psy14799 33 DKSFQCDECDMSFGTHQALGRHKQCK 58 (230)
Q Consensus 33 ~~~~~C~~C~~~f~~~~~l~~H~~~~ 58 (230)
++.+.|++|+..|.....+.--.++.
T Consensus 17 kk~ieCPvC~tkFkkeev~tgsiRii 42 (267)
T COG1655 17 KKTIECPVCNTKFKKEEVKTGSIRII 42 (267)
T ss_pred hceeccCcccchhhhhheeccceeEe
Confidence 45678888888887765544444433
No 169
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=29.24 E-value=13 Score=33.95 Aligned_cols=10 Identities=20% Similarity=0.351 Sum_probs=5.4
Q ss_pred Cceecccccc
Q psy14799 93 RDVTCGLCFQ 102 (230)
Q Consensus 93 ~~~~C~~C~~ 102 (230)
.|..|+.|+.
T Consensus 474 ~p~~Cp~Cgs 483 (730)
T COG1198 474 IPQSCPECGS 483 (730)
T ss_pred CCCCCCCCCC
Confidence 4455666653
No 170
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=28.43 E-value=40 Score=24.18 Aligned_cols=33 Identities=12% Similarity=0.270 Sum_probs=20.9
Q ss_pred ccccCCccccccccCCccccc---cccccCCCCCCC
Q psy14799 174 GTHSGGICGSMWRVETWHPCH---LLTERNRQPPYS 206 (230)
Q Consensus 174 ~~h~~~~C~~~f~~~~~l~~H---~~~h~~~~~~~~ 206 (230)
|.-.|+.||..|++-.....= +-.-.|...||+
T Consensus 27 RRReC~~C~~RFTTfE~~El~~~~VvKkdg~Re~F~ 62 (156)
T COG1327 27 RRRECLECGERFTTFERAELRPLIVVKKDGRREPFD 62 (156)
T ss_pred hhhcccccccccchhheeeeccceEECcCCCcCCCC
Confidence 566678999999986554443 222336666665
No 171
>KOG2636|consensus
Probab=28.03 E-value=41 Score=28.67 Aligned_cols=28 Identities=14% Similarity=0.288 Sum_probs=24.2
Q ss_pred hcCCCCcccccccc-cccCChHHHHHHHH
Q psy14799 117 RHLNIRSFTCHLCN-KGFFRKSYLKVHLL 144 (230)
Q Consensus 117 ~H~~~~~~~C~~C~-~~f~~~~~l~~H~~ 144 (230)
.|.-...|.|.+|| +++.-...+.+|..
T Consensus 395 LHGL~~ey~CEICGNy~Y~GrkaF~RHF~ 423 (497)
T KOG2636|consen 395 LHGLDIEYNCEICGNYVYKGRKAFDRHFN 423 (497)
T ss_pred hcCCCcccceeeccCccccCcHHHHHHhH
Confidence 56667779999999 89999999999954
No 172
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=27.81 E-value=15 Score=24.97 Aligned_cols=26 Identities=23% Similarity=0.493 Sum_probs=17.7
Q ss_pred CcccccccccccCChHHHHHhHhccCCCCCCCccccccccc
Q psy14799 34 KSFQCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQ 74 (230)
Q Consensus 34 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~ 74 (230)
....|..|+..|.... ..+.||.||.
T Consensus 69 ~~~~C~~Cg~~~~~~~---------------~~~~CP~Cgs 94 (113)
T PRK12380 69 AQAWCWDCSQVVEIHQ---------------HDAQCPHCHG 94 (113)
T ss_pred cEEEcccCCCEEecCC---------------cCccCcCCCC
Confidence 3478999997776532 2356888884
No 173
>PF01428 zf-AN1: AN1-like Zinc finger; InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include: Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=27.53 E-value=22 Score=19.38 Aligned_cols=14 Identities=36% Similarity=0.833 Sum_probs=8.4
Q ss_pred CcccccccccccCC
Q psy14799 122 RSFTCHLCNKGFFR 135 (230)
Q Consensus 122 ~~~~C~~C~~~f~~ 135 (230)
.|+.|+.|+..|=.
T Consensus 12 ~~~~C~~C~~~FC~ 25 (43)
T PF01428_consen 12 LPFKCKHCGKSFCL 25 (43)
T ss_dssp SHEE-TTTS-EE-T
T ss_pred CCeECCCCCcccCc
Confidence 47888888888764
No 174
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.26 E-value=41 Score=19.98 Aligned_cols=40 Identities=20% Similarity=0.388 Sum_probs=20.8
Q ss_pred eeccccccc-ccCHHHHHHHHHhhcCCCCcccccccccccCC
Q psy14799 95 VTCGLCFQS-FYSKADLVRHMKGRHLNIRSFTCHLCNKGFFR 135 (230)
Q Consensus 95 ~~C~~C~~~-f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~ 135 (230)
.+|.+|++. +-+...+..-.. .....+.|.|+.|......
T Consensus 3 vkCiiCd~v~~iD~rt~~tKrL-rN~PIrtymC~eC~~Rva~ 43 (68)
T COG4896 3 VKCIICDRVDEIDNRTFKTKRL-RNKPIRTYMCPECEHRVAI 43 (68)
T ss_pred ceEEEecceeeecchhHHHHHh-hCCCceeEechhhHhhhch
Confidence 356677643 333344433322 3444456778777655443
No 175
>KOG1842|consensus
Probab=26.98 E-value=38 Score=28.89 Aligned_cols=27 Identities=26% Similarity=0.381 Sum_probs=20.5
Q ss_pred CcccccccccccCCHHHHHHhHHHhcC
Q psy14799 65 RPFPCDLCEQRFLNKSHLAYHVKTHTG 91 (230)
Q Consensus 65 ~~~~C~~C~~~f~~~~~l~~H~~~h~~ 91 (230)
..|.||+|...|.+...|..|...-++
T Consensus 14 egflCPiC~~dl~~~~~L~~H~d~eH~ 40 (505)
T KOG1842|consen 14 EGFLCPICLLDLPNLSALNDHLDVEHF 40 (505)
T ss_pred hcccCchHhhhhhhHHHHHHHHhhhcc
Confidence 457888888888888888888765543
No 176
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=26.90 E-value=4.9 Score=21.68 Aligned_cols=10 Identities=20% Similarity=0.976 Sum_probs=4.9
Q ss_pred cccccccccc
Q psy14799 124 FTCHLCNKGF 133 (230)
Q Consensus 124 ~~C~~C~~~f 133 (230)
|.|..|+..+
T Consensus 29 y~C~~C~~~w 38 (40)
T smart00440 29 YVCTKCGHRW 38 (40)
T ss_pred EEeCCCCCEe
Confidence 5555555443
No 177
>COG2879 Uncharacterized small protein [Function unknown]
Probab=26.62 E-value=32 Score=20.57 Aligned_cols=18 Identities=17% Similarity=0.196 Sum_probs=14.2
Q ss_pred CCccccccccccCCCCCC
Q psy14799 188 ETWHPCHLLTERNRQPPY 205 (230)
Q Consensus 188 ~~~l~~H~~~h~~~~~~~ 205 (230)
-.+...||+.+|.++||.
T Consensus 25 YdnYVehmr~~hPd~p~m 42 (65)
T COG2879 25 YDNYVEHMRKKHPDKPPM 42 (65)
T ss_pred HHHHHHHHHHhCcCCCcc
Confidence 356789999999888764
No 178
>COG1773 Rubredoxin [Energy production and conversion]
Probab=26.08 E-value=29 Score=20.23 Aligned_cols=41 Identities=15% Similarity=0.159 Sum_probs=21.8
Q ss_pred cccccccccccCChHHHHHHHHhhcCCCChhhhhhccCCCCCC
Q psy14799 123 SFTCHLCNKGFFRKSYLKVHLLTIHMQPADEAQNAIHRCYACR 165 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~c~~~~~C~~c~ 165 (230)
.|+|..||..|.-...-... -.-.+.+-..-+....|+.|+
T Consensus 3 ~~~C~~CG~vYd~e~Gdp~~--gi~pgT~fedlPd~w~CP~Cg 43 (55)
T COG1773 3 RWRCSVCGYVYDPEKGDPRC--GIAPGTPFEDLPDDWVCPECG 43 (55)
T ss_pred ceEecCCceEeccccCCccC--CCCCCCchhhCCCccCCCCCC
Confidence 58999999998765432222 111222222334445666665
No 179
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PF06220 zf-U1: U1 zinc finger; InterPro: IPR013085 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. C2H2-type (classical) zinc fingers (Znf) were the first class to be characterised. They contain a short beta hairpin and an alpha helix (beta/beta/alpha structure), where a single zinc atom is held in place by Cys(2)His(2) (C2H2) residues in a tetrahedral array. C2H2 Znf's can be divided into three groups based on the number and pattern of fingers: triple-C2H2 (binds single ligand), multiple-adjacent-C2H2 (binds multiple ligands), and separated paired-C2H2 []. C2H2 Znf's are the most common DNA-binding motifs found in eukaryotic transcription factors, and have also been identified in prokaryotes []. Transcription factors usually contain several Znf's (each with a conserved beta/beta/alpha structure) capable of making multiple contacts along the DNA, where the C2H2 Znf motifs recognise DNA sequences by binding to the major groove of DNA via a short alpha-helix in the Znf, the Znf spanning 3-4 bases of the DNA []. C2H2 Znf's can also bind to RNA and protein targets []. This entry represents a C2H2-type zinc finger motif found in several U1 small nuclear ribonucleoprotein C (U1-C) proteins. Some proteins contain multiple copies of this motif. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' splice site is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' splice site. Binding of TIA-1 in the vicinity of a 5' splice site helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2VRD_A.
Probab=25.38 E-value=56 Score=17.32 Aligned_cols=8 Identities=38% Similarity=1.057 Sum_probs=1.7
Q ss_pred cccccccc
Q psy14799 126 CHLCNKGF 133 (230)
Q Consensus 126 C~~C~~~f 133 (230)
|..|+..|
T Consensus 6 CdyC~~~~ 13 (38)
T PF06220_consen 6 CDYCKKYL 13 (38)
T ss_dssp -TTT--B-
T ss_pred ccccccee
Confidence 44444444
No 181
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=25.34 E-value=57 Score=16.78 Aligned_cols=19 Identities=26% Similarity=0.488 Sum_probs=10.2
Q ss_pred ccccccccccCChHHHHHHH
Q psy14799 124 FTCHLCNKGFFRKSYLKVHL 143 (230)
Q Consensus 124 ~~C~~C~~~f~~~~~l~~H~ 143 (230)
+.|+.|++.+.. +-+..|+
T Consensus 5 ~~C~nC~R~v~a-~RfA~HL 23 (33)
T PF08209_consen 5 VECPNCGRPVAA-SRFAPHL 23 (33)
T ss_dssp EE-TTTSSEEEG-GGHHHHH
T ss_pred EECCCCcCCcch-hhhHHHH
Confidence 567777765543 3555663
No 182
>PF07535 zf-DBF: DBF zinc finger; InterPro: IPR006572 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. In eukaryotes, initiation of DNA replication requires the assembly of pre-replication complexes (pre-RCs) on chromatin during the G1 phase. In the S phase, pre-RCs are activated by two protein kinases, Cdk2 and Cdc7, which results in the loading of replication factors and the unwinding of replication origins by the MCM helicase complex []. Cdc7 is a serine/threonine kinase that is conserved from yeast to human. It is regulated by its association with a regulatory subunit, the Dbf4 protein. This complex is often referred to as DDK (Dbf4-dependent kinase) []. DBF4 contains an N-terminal BRCT domain and a C-terminal conserved region that could potentially coordinate one zinc atom, the DBF4-type zinc finger. This entry represents the zinc finger, which is important for the interaction with Cdc7 [, ]. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding
Probab=24.72 E-value=30 Score=19.65 Aligned_cols=19 Identities=26% Similarity=0.396 Sum_probs=12.1
Q ss_pred hhhhhhhhhccCHHHHHHHHHh
Q psy14799 8 DYWNKNYFFFLMGLKLEAHITT 29 (230)
Q Consensus 8 ~~C~~C~~~f~~~~~l~~H~~~ 29 (230)
-||..|...|. +|..|+.+
T Consensus 6 GYCE~C~~ky~---~l~~Hi~s 24 (49)
T PF07535_consen 6 GYCENCRVKYD---DLEEHIQS 24 (49)
T ss_pred ccCccccchhh---hHHHHhCC
Confidence 36777777776 46667643
No 183
>KOG2807|consensus
Probab=24.23 E-value=88 Score=25.64 Aligned_cols=68 Identities=21% Similarity=0.434 Sum_probs=38.2
Q ss_pred CcccccccccccCCHHHHHHhHHHhcCCCc------------eecccccccccCHHHHHHHHHhhcCCCCcccccccccc
Q psy14799 65 RPFPCDLCEQRFLNKSHLAYHVKTHTGERD------------VTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKG 132 (230)
Q Consensus 65 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~------------~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~ 132 (230)
-|..|++|+-...+...|.+-..---.-++ -.|-.|+. .-.....|+|..|...
T Consensus 289 LP~eCpiC~ltLVss~hLARSyhhL~PL~~F~Eip~~~~~~~~~Cf~C~~--------------~~~~~~~y~C~~Ck~~ 354 (378)
T KOG2807|consen 289 LPIECPICSLTLVSSPHLARSYHHLFPLKPFVEIPETEYNGSRFCFACQG--------------ELLSSGRYRCESCKNV 354 (378)
T ss_pred CCccCCccceeEecchHHHHHHHhhcCCcchhhccccccCCCcceeeecc--------------ccCCCCcEEchhccce
Confidence 367888888877776666543221111111 12544510 1122234888888888
Q ss_pred cCChHHHHHHHHhhc
Q psy14799 133 FFRKSYLKVHLLTIH 147 (230)
Q Consensus 133 f~~~~~l~~H~~~~H 147 (230)
|-.--+.-.| ...|
T Consensus 355 FCldCDv~iH-esLh 368 (378)
T KOG2807|consen 355 FCLDCDVFIH-ESLH 368 (378)
T ss_pred eeccchHHHH-hhhh
Confidence 8777777777 5555
No 184
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=24.22 E-value=29 Score=23.63 Aligned_cols=11 Identities=18% Similarity=0.468 Sum_probs=5.1
Q ss_pred eeccccccccc
Q psy14799 95 VTCGLCFQSFY 105 (230)
Q Consensus 95 ~~C~~C~~~f~ 105 (230)
..|..|+..|.
T Consensus 71 ~~C~~Cg~~~~ 81 (115)
T TIGR00100 71 CECEDCSEEVS 81 (115)
T ss_pred EEcccCCCEEe
Confidence 44555554443
No 185
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=23.56 E-value=30 Score=19.79 Aligned_cols=13 Identities=23% Similarity=0.662 Sum_probs=6.9
Q ss_pred cccccccccCChH
Q psy14799 125 TCHLCNKGFFRKS 137 (230)
Q Consensus 125 ~C~~C~~~f~~~~ 137 (230)
.||+|++.|....
T Consensus 22 ~CPlC~r~l~~e~ 34 (54)
T PF04423_consen 22 CCPLCGRPLDEEH 34 (54)
T ss_dssp E-TTT--EE-HHH
T ss_pred cCCCCCCCCCHHH
Confidence 8999999988643
No 186
>PRK00420 hypothetical protein; Validated
Probab=23.40 E-value=57 Score=22.21 Aligned_cols=28 Identities=21% Similarity=0.331 Sum_probs=17.5
Q ss_pred CCCCCCCCCcccccccccCCcccccccc
Q psy14799 160 RCYACRCAGHCLQIGTHSGGICGSMWRV 187 (230)
Q Consensus 160 ~C~~c~~~~~~~~~~~h~~~~C~~~f~~ 187 (230)
.|+.|+.....+....-.|+.||....-
T Consensus 25 ~CP~Cg~pLf~lk~g~~~Cp~Cg~~~~v 52 (112)
T PRK00420 25 HCPVCGLPLFELKDGEVVCPVHGKVYIV 52 (112)
T ss_pred CCCCCCCcceecCCCceECCCCCCeeee
Confidence 3666766555555566666788875543
No 187
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=23.04 E-value=49 Score=26.58 Aligned_cols=21 Identities=19% Similarity=0.366 Sum_probs=12.6
Q ss_pred hhhhhhhhhhhhccCHHHHHHHHH
Q psy14799 5 FILDYWNKNYFFFLMGLKLEAHIT 28 (230)
Q Consensus 5 ~~c~~C~~C~~~f~~~~~l~~H~~ 28 (230)
|+|++ |-+-|.+...|.+|+.
T Consensus 49 yiCe~---Clky~~~~~~l~~H~~ 69 (290)
T PLN03238 49 YICEY---CLKYMRKKKSLLRHLA 69 (290)
T ss_pred EEcCC---CcchhCCHHHHHHHHH
Confidence 55555 4446666666666654
No 188
>PF07503 zf-HYPF: HypF finger; InterPro: IPR011125 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Proteins of the HypF family are involved in the maturation and regulation of hydrogenase []. In the N terminus they appear to have two zinc finger domains that are similar to those found in the DnaJ chaperone []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3TTD_A 3TSQ_A 3TTC_A 3TSP_A 3TTF_A 3TSU_A.
Probab=22.99 E-value=10 Score=19.80 Aligned_cols=6 Identities=33% Similarity=1.088 Sum_probs=2.1
Q ss_pred cccccc
Q psy14799 68 PCDLCE 73 (230)
Q Consensus 68 ~C~~C~ 73 (230)
.|..||
T Consensus 23 sC~~CG 28 (35)
T PF07503_consen 23 SCTNCG 28 (35)
T ss_dssp -BTTCC
T ss_pred cCCCCC
Confidence 344444
No 189
>smart00586 ZnF_DBF Zinc finger in DBF-like proteins.
Probab=22.84 E-value=43 Score=19.00 Aligned_cols=19 Identities=21% Similarity=0.293 Sum_probs=13.0
Q ss_pred hhhhhhhhhccCHHHHHHHHHh
Q psy14799 8 DYWNKNYFFFLMGLKLEAHITT 29 (230)
Q Consensus 8 ~~C~~C~~~f~~~~~l~~H~~~ 29 (230)
-||..|...|. .|..|+.+
T Consensus 6 GYCE~Cr~kfd---~l~~Hi~s 24 (49)
T smart00586 6 GYCENCREKYD---DLETHLLS 24 (49)
T ss_pred cccccHhHHHh---hHHHHhcc
Confidence 47888887876 46666653
No 190
>PTZ00448 hypothetical protein; Provisional
Probab=22.35 E-value=62 Score=26.98 Aligned_cols=25 Identities=20% Similarity=0.396 Sum_probs=21.6
Q ss_pred cccccccccccCChHHHHHHHHhhc
Q psy14799 123 SFTCHLCNKGFFRKSYLKVHLLTIH 147 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~~l~~H~~~~H 147 (230)
.|.|..|+..|.+....+.|+++..
T Consensus 314 ~~tC~~C~v~F~~~~~qR~H~KSDw 338 (373)
T PTZ00448 314 MLLCRKCNIQLMDHNAFKQHYRSEW 338 (373)
T ss_pred CccccccccccCCHHHHHHHhhhhH
Confidence 5889999999999999999977653
No 191
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=22.01 E-value=57 Score=17.83 Aligned_cols=17 Identities=24% Similarity=0.495 Sum_probs=11.6
Q ss_pred ccccccccccCCHHHHH
Q psy14799 67 FPCDLCEQRFLNKSHLA 83 (230)
Q Consensus 67 ~~C~~C~~~f~~~~~l~ 83 (230)
=.|+.|++.|.......
T Consensus 9 K~C~~C~rpf~WRKKW~ 25 (42)
T PF10013_consen 9 KICPVCGRPFTWRKKWA 25 (42)
T ss_pred CcCcccCCcchHHHHHH
Confidence 46888888887665543
No 192
>PF13821 DUF4187: Domain of unknown function (DUF4187)
Probab=21.94 E-value=1.1e+02 Score=17.76 Aligned_cols=17 Identities=24% Similarity=0.368 Sum_probs=8.0
Q ss_pred eecccccccccCHHHHH
Q psy14799 95 VTCGLCFQSFYSKADLV 111 (230)
Q Consensus 95 ~~C~~C~~~f~~~~~l~ 111 (230)
+-|-+|+..|.+...|.
T Consensus 28 ~YC~~Cg~~Y~d~~dL~ 44 (55)
T PF13821_consen 28 NYCFWCGTKYDDEEDLE 44 (55)
T ss_pred ceeeeeCCccCCHHHHH
Confidence 34444555554444443
No 193
>PF04216 FdhE: Protein involved in formate dehydrogenase formation; InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex []. The exact nature of the function of FdhE in the assembly of the complex is unknown, but considering the presence of selenocysteine, molybdopterin, iron-sulphur clusters and cytochrome b556, it is likely to be involved in the insertion of cofactors. ; GO: 0005737 cytoplasm; PDB: 2FIY_B.
Probab=21.84 E-value=5.6 Score=32.01 Aligned_cols=54 Identities=19% Similarity=0.376 Sum_probs=21.8
Q ss_pred CcccccccccccCCHHHHHHhHHHhcCCCceecccccccccCHHHHHHHHHhhcCCCCcccccccccccC
Q psy14799 65 RPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKGFF 134 (230)
Q Consensus 65 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~ 134 (230)
+...|..|+...... ...|+.||..-...-.+.. .. .-.+.+...|..|+.-++
T Consensus 196 R~L~Cs~C~t~W~~~--------------R~~Cp~Cg~~~~~~l~~~~-~e-~~~~~rve~C~~C~~YlK 249 (290)
T PF04216_consen 196 RYLHCSLCGTEWRFV--------------RIKCPYCGNTDHEKLEYFT-VE-GEPAYRVEVCESCGSYLK 249 (290)
T ss_dssp EEEEETTT--EEE----------------TTS-TTT---SS-EEE----------SEEEEEETTTTEEEE
T ss_pred EEEEcCCCCCeeeec--------------CCCCcCCCCCCCcceeeEe-cC-CCCcEEEEECCcccchHH
Confidence 567788887666543 1468888755433222110 01 223445567777774433
No 194
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=21.55 E-value=48 Score=23.85 Aligned_cols=32 Identities=22% Similarity=0.481 Sum_probs=14.7
Q ss_pred ceecccccccccCHHHHHHHHHhhcCCCCccccccccc
Q psy14799 94 DVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNK 131 (230)
Q Consensus 94 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~ 131 (230)
+|.|. |+..|.+. .+|-. .-.|+ .|.|..|+-
T Consensus 117 ~Y~C~-C~q~~l~~---RRhn~-~~~g~-~YrC~~C~g 148 (156)
T COG3091 117 PYRCQ-CQQHYLRI---RRHNT-VRRGE-VYRCGKCGG 148 (156)
T ss_pred eEEee-cCCccchh---hhccc-ccccc-eEEeccCCc
Confidence 45666 66555442 22222 22233 466666653
No 195
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=21.39 E-value=9.6 Score=34.38 Aligned_cols=56 Identities=20% Similarity=0.513 Sum_probs=28.1
Q ss_pred cccccccccCChHHHHHhHhccCCCCCCCcccccccccccCCHHHHHHhHHHhcCCCceeccccc
Q psy14799 37 QCDECDMSFGTHQALGRHKQCKHMDISLRPFPCDLCEQRFLNKSHLAYHVKTHTGERDVTCGLCF 101 (230)
Q Consensus 37 ~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 101 (230)
.|..||-.|.-...|---...-... .--.|+.|.+.+.+..+-.-| ..+..|+.||
T Consensus 125 ~CT~CGPRfTIi~alPYDR~nTsM~---~F~lC~~C~~EY~dP~nRRfH------AQp~aCp~CG 180 (750)
T COG0068 125 NCTNCGPRFTIIEALPYDRENTSMA---DFPLCPFCDKEYKDPLNRRFH------AQPIACPKCG 180 (750)
T ss_pred ccCCCCcceeeeccCCCCcccCccc---cCcCCHHHHHHhcCccccccc------cccccCcccC
Confidence 3777777776655543332211111 112466666666666543322 2345666666
No 196
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=21.32 E-value=1.3e+02 Score=22.42 Aligned_cols=34 Identities=24% Similarity=0.545 Sum_probs=20.8
Q ss_pred hcCCCceecccccccccCHHHHHHHHHhhcCCCCcccccccccc
Q psy14799 89 HTGERDVTCGLCFQSFYSKADLVRHMKGRHLNIRSFTCHLCNKG 132 (230)
Q Consensus 89 h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~ 132 (230)
-....-|.|+.|...++...++ ..+ |.||.||..
T Consensus 108 ~~~~~~y~C~~~~~r~sfdeA~-------~~~---F~Cp~Cg~~ 141 (176)
T COG1675 108 ETENNYYVCPNCHVKYSFDEAM-------ELG---FTCPKCGED 141 (176)
T ss_pred hccCCceeCCCCCCcccHHHHH-------HhC---CCCCCCCch
Confidence 3445668887777666554442 112 788888854
No 197
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=21.20 E-value=18 Score=20.19 Aligned_cols=6 Identities=33% Similarity=1.187 Sum_probs=2.6
Q ss_pred cccccc
Q psy14799 38 CDECDM 43 (230)
Q Consensus 38 C~~C~~ 43 (230)
|+.||.
T Consensus 2 CP~Cg~ 7 (47)
T PF04606_consen 2 CPHCGS 7 (47)
T ss_pred cCCCCC
Confidence 444443
No 198
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=21.19 E-value=70 Score=20.63 Aligned_cols=14 Identities=21% Similarity=0.423 Sum_probs=8.0
Q ss_pred ccccCCcccccccc
Q psy14799 174 GTHSGGICGSMWRV 187 (230)
Q Consensus 174 ~~h~~~~C~~~f~~ 187 (230)
.+-.|..||..|.-
T Consensus 52 GIW~C~kCg~~fAG 65 (89)
T COG1997 52 GIWKCRKCGAKFAG 65 (89)
T ss_pred CeEEcCCCCCeecc
Confidence 33444577777754
No 199
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=20.62 E-value=43 Score=23.42 Aligned_cols=15 Identities=27% Similarity=0.547 Sum_probs=11.2
Q ss_pred cccccccccccCChH
Q psy14799 123 SFTCHLCNKGFFRKS 137 (230)
Q Consensus 123 ~~~C~~C~~~f~~~~ 137 (230)
|++|..||+.|.+-+
T Consensus 1 PH~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGS 15 (131)
T ss_pred CcccCcCCCCcCCCc
Confidence 467888888888754
No 200
>PF04810 zf-Sec23_Sec24: Sec23/Sec24 zinc finger; InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=20.27 E-value=31 Score=18.43 Aligned_cols=11 Identities=18% Similarity=0.890 Sum_probs=4.9
Q ss_pred CCccccccccc
Q psy14799 121 IRSFTCHLCNK 131 (230)
Q Consensus 121 ~~~~~C~~C~~ 131 (230)
.+.+.|.+|+.
T Consensus 22 ~~~w~C~~C~~ 32 (40)
T PF04810_consen 22 GKTWICNFCGT 32 (40)
T ss_dssp TTEEEETTT--
T ss_pred CCEEECcCCCC
Confidence 34456666654
No 201
>PF12773 DZR: Double zinc ribbon
Probab=20.22 E-value=40 Score=18.73 Aligned_cols=8 Identities=25% Similarity=1.024 Sum_probs=3.5
Q ss_pred cccccccc
Q psy14799 38 CDECDMSF 45 (230)
Q Consensus 38 C~~C~~~f 45 (230)
|+.||...
T Consensus 15 C~~CG~~l 22 (50)
T PF12773_consen 15 CPHCGTPL 22 (50)
T ss_pred ChhhcCCh
Confidence 44444443
No 202
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=20.02 E-value=61 Score=26.81 Aligned_cols=22 Identities=23% Similarity=0.045 Sum_probs=19.2
Q ss_pred hhhhhhhhhccCHHHHHHHHHh
Q psy14799 8 DYWNKNYFFFLMGLKLEAHITT 29 (230)
Q Consensus 8 ~~C~~C~~~f~~~~~l~~H~~~ 29 (230)
-||..|++-|.+...+..|...
T Consensus 239 ~YC~~C~r~f~~~~VFe~Hl~g 260 (470)
T COG5188 239 VYCVKCGREFSRSKVFEYHLEG 260 (470)
T ss_pred eeeHhhhhHhhhhHHHHHHHhh
Confidence 4899999999999999999753
No 203
>KOG2071|consensus
Probab=20.02 E-value=59 Score=28.84 Aligned_cols=27 Identities=22% Similarity=0.570 Sum_probs=22.3
Q ss_pred CCcccccccccccCChHHHHHhHhccC
Q psy14799 33 DKSFQCDECDMSFGTHQALGRHKQCKH 59 (230)
Q Consensus 33 ~~~~~C~~C~~~f~~~~~l~~H~~~~~ 59 (230)
..|.+|..||..|.+......||..|-
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhhh
Confidence 566889999999999988888887664
Done!