Query psy14939
Match_columns 971
No_of_seqs 471 out of 4452
Neff 10.0
Searched_HMMs 46136
Date Fri Aug 16 16:19:03 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14939.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14939hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 9.6E-33 2.1E-37 300.3 13.9 58 761-819 878-935 (958)
2 KOG3623|consensus 100.0 2E-31 4.4E-36 283.9 21.0 107 198-304 211-332 (1007)
3 KOG1074|consensus 99.9 1.1E-26 2.5E-31 253.3 13.4 83 253-335 605-693 (958)
4 KOG3608|consensus 99.9 6.6E-25 1.4E-29 215.6 11.7 227 30-338 133-380 (467)
5 KOG2462|consensus 99.9 5.4E-24 1.2E-28 205.6 6.8 134 195-332 128-265 (279)
6 KOG3608|consensus 99.9 9.5E-23 2.1E-27 200.5 6.7 232 5-321 142-397 (467)
7 KOG2462|consensus 99.9 9.4E-23 2E-27 197.1 5.6 104 198-303 162-265 (279)
8 KOG3623|consensus 99.9 1.7E-20 3.7E-25 201.1 18.8 107 168-275 211-331 (1007)
9 KOG3576|consensus 99.6 7.4E-16 1.6E-20 140.4 4.2 86 28-114 114-199 (267)
10 KOG3576|consensus 99.6 1E-15 2.2E-20 139.4 2.7 114 223-337 116-239 (267)
11 PHA00733 hypothetical protein 99.0 1.3E-10 2.8E-15 104.7 4.2 85 757-844 35-124 (128)
12 PLN03086 PRLI-interacting fact 99.0 1.1E-09 2.3E-14 121.6 8.2 120 198-335 434-565 (567)
13 PLN03086 PRLI-interacting fact 98.9 1.5E-09 3.4E-14 120.3 8.4 101 198-305 454-564 (567)
14 PHA02768 hypothetical protein; 98.9 5.9E-10 1.3E-14 80.8 2.6 43 763-807 6-48 (55)
15 PHA02768 hypothetical protein; 98.9 6.6E-10 1.4E-14 80.6 1.5 43 790-835 5-47 (55)
16 PHA00733 hypothetical protein 98.8 3.4E-09 7.3E-14 95.5 4.3 82 29-113 38-124 (128)
17 KOG3993|consensus 98.8 2.9E-09 6.3E-14 109.1 2.2 54 59-113 267-320 (500)
18 PF13465 zf-H2C2_2: Zinc-finge 98.7 7.6E-09 1.6E-13 64.2 1.9 25 777-801 1-25 (26)
19 KOG3993|consensus 98.6 1.2E-08 2.6E-13 104.7 -0.0 84 30-114 266-382 (500)
20 KOG1146|consensus 98.4 1.4E-06 3.1E-11 103.0 12.0 86 760-850 1258-1359(1406)
21 PF13465 zf-H2C2_2: Zinc-finge 98.4 1.7E-07 3.6E-12 58.2 2.4 25 46-70 1-25 (26)
22 PHA00616 hypothetical protein 98.2 5.6E-07 1.2E-11 62.1 1.8 39 31-69 1-39 (44)
23 PHA00732 hypothetical protein 98.2 8E-07 1.7E-11 72.0 2.7 43 763-811 2-45 (79)
24 PHA00616 hypothetical protein 98.1 9.2E-07 2E-11 61.0 1.3 34 762-795 1-34 (44)
25 PHA00732 hypothetical protein 98.0 5.6E-06 1.2E-10 67.1 3.4 47 790-842 1-47 (79)
26 PF05605 zf-Di19: Drought indu 97.9 1.2E-05 2.7E-10 60.5 4.3 50 60-112 3-53 (54)
27 PF05605 zf-Di19: Drought indu 97.9 1.4E-05 3E-10 60.2 3.7 50 791-843 3-53 (54)
28 KOG2231|consensus 97.7 0.00029 6.2E-09 79.9 11.4 51 226-285 184-240 (669)
29 COG5189 SFP1 Putative transcri 97.6 2E-05 4.4E-10 78.1 0.8 70 760-840 347-419 (423)
30 PF00096 zf-C2H2: Zinc finger, 97.5 7.6E-05 1.7E-09 45.0 2.2 23 32-54 1-23 (23)
31 PF00096 zf-C2H2: Zinc finger, 97.5 6.2E-05 1.3E-09 45.4 1.6 22 820-841 1-22 (23)
32 PF13894 zf-C2H2_4: C2H2-type 97.1 0.00031 6.6E-09 42.9 2.0 24 820-843 1-24 (24)
33 COG5189 SFP1 Putative transcri 97.1 0.00017 3.7E-09 71.8 0.8 53 57-109 347-419 (423)
34 KOG1146|consensus 97.0 0.00021 4.5E-09 85.3 1.2 54 61-114 438-491 (1406)
35 PF12756 zf-C2H2_2: C2H2 type 97.0 0.0005 1.1E-08 60.3 2.7 74 33-112 1-74 (100)
36 PF13894 zf-C2H2_4: C2H2-type 96.9 0.00071 1.5E-08 41.2 2.4 19 62-80 3-21 (24)
37 PF12756 zf-C2H2_2: C2H2 type 96.8 0.00095 2.1E-08 58.5 2.8 74 226-305 1-74 (100)
38 PF13912 zf-C2H2_6: C2H2-type 96.7 0.0008 1.7E-08 42.4 1.6 23 32-54 2-24 (27)
39 PF13912 zf-C2H2_6: C2H2-type 96.7 0.00087 1.9E-08 42.2 1.7 25 819-843 1-25 (27)
40 PF09237 GAGA: GAGA factor; I 96.4 0.0028 6.1E-08 44.8 2.7 39 807-845 12-50 (54)
41 PRK04860 hypothetical protein; 96.1 0.0029 6.3E-08 59.4 2.2 41 761-805 118-158 (160)
42 KOG2231|consensus 96.0 0.012 2.5E-07 67.3 6.5 71 198-276 183-262 (669)
43 PRK04860 hypothetical protein; 95.9 0.0032 7E-08 59.1 1.4 41 789-834 118-158 (160)
44 PF13909 zf-H2C2_5: C2H2-type 95.9 0.0068 1.5E-07 36.8 2.3 23 89-112 1-23 (24)
45 smart00355 ZnF_C2H2 zinc finge 95.7 0.0081 1.8E-07 37.2 2.2 23 32-54 1-23 (26)
46 smart00355 ZnF_C2H2 zinc finge 95.7 0.0086 1.9E-07 37.0 2.3 23 820-842 1-23 (26)
47 PF09237 GAGA: GAGA factor; I 95.7 0.0071 1.5E-07 42.9 1.8 30 308-337 22-51 (54)
48 KOG2482|consensus 95.5 0.022 4.7E-07 58.0 5.5 135 196-332 143-356 (423)
49 COG5236 Uncharacterized conser 95.5 0.013 2.8E-07 59.5 3.8 75 198-277 221-305 (493)
50 KOG2785|consensus 95.5 0.015 3.2E-07 60.8 4.2 135 198-332 4-242 (390)
51 PF13909 zf-H2C2_5: C2H2-type 95.4 0.014 3E-07 35.4 2.2 23 60-83 1-23 (24)
52 KOG2785|consensus 95.3 0.06 1.3E-06 56.4 7.9 75 196-275 165-242 (390)
53 KOG2893|consensus 95.2 0.0066 1.4E-07 58.1 0.6 46 34-83 13-58 (341)
54 PF12874 zf-met: Zinc-finger o 95.0 0.014 3.1E-07 35.8 1.6 21 32-52 1-21 (25)
55 KOG2482|consensus 95.0 0.023 5E-07 57.8 3.8 82 30-111 194-357 (423)
56 PF06524 NOA36: NOA36 protein; 94.9 0.033 7.1E-07 54.5 4.4 84 758-843 138-233 (314)
57 PF12874 zf-met: Zinc-finger o 94.6 0.02 4.4E-07 35.1 1.5 23 820-842 1-23 (25)
58 COG5236 Uncharacterized conser 94.1 0.045 9.8E-07 55.7 3.5 129 198-336 152-307 (493)
59 PF12171 zf-C2H2_jaz: Zinc-fin 93.3 0.041 9E-07 34.5 1.1 21 791-811 2-22 (27)
60 COG5048 FOG: Zn-finger [Genera 93.2 0.021 4.6E-07 65.9 -0.5 61 31-92 289-355 (467)
61 COG5048 FOG: Zn-finger [Genera 93.1 0.04 8.7E-07 63.6 1.6 73 761-834 288-368 (467)
62 PF12171 zf-C2H2_jaz: Zinc-fin 93.1 0.048 1E-06 34.2 1.2 22 225-246 2-23 (27)
63 KOG4173|consensus 92.4 0.048 1E-06 51.3 0.6 87 757-846 74-173 (253)
64 PF13913 zf-C2HC_2: zinc-finge 92.2 0.13 2.8E-06 31.4 2.2 21 820-841 3-23 (25)
65 KOG2893|consensus 91.8 0.05 1.1E-06 52.3 0.0 48 198-248 11-58 (341)
66 PF13913 zf-C2HC_2: zinc-finge 89.4 0.26 5.6E-06 30.1 1.7 22 311-333 3-24 (25)
67 PF12013 DUF3505: Protein of u 89.2 0.53 1.1E-05 41.7 4.2 73 30-113 10-109 (109)
68 smart00451 ZnF_U1 U1-like zinc 87.5 0.43 9.3E-06 32.0 2.0 23 762-784 3-25 (35)
69 smart00451 ZnF_U1 U1-like zinc 87.0 0.47 1E-05 31.8 1.9 22 31-52 3-24 (35)
70 KOG4173|consensus 86.3 0.48 1E-05 44.9 2.2 82 29-113 77-171 (253)
71 cd00350 rubredoxin_like Rubred 84.2 0.63 1.4E-05 30.7 1.4 24 763-798 2-25 (33)
72 COG4049 Uncharacterized protei 83.6 0.49 1.1E-05 34.3 0.8 31 814-844 12-42 (65)
73 COG4049 Uncharacterized protei 82.1 0.69 1.5E-05 33.5 1.0 28 28-55 14-41 (65)
74 cd00350 rubredoxin_like Rubred 81.9 0.77 1.7E-05 30.3 1.2 10 280-289 16-25 (33)
75 PF12013 DUF3505: Protein of u 81.4 5.6 0.00012 35.1 6.9 25 87-112 10-34 (109)
76 PF09538 FYDLN_acid: Protein o 78.9 1.2 2.5E-05 38.9 1.6 30 32-72 10-39 (108)
77 PF09986 DUF2225: Uncharacteri 78.0 0.99 2.2E-05 45.4 1.1 44 789-832 4-61 (214)
78 PRK00464 nrdR transcriptional 75.9 1.1 2.3E-05 42.0 0.6 15 31-45 28-42 (154)
79 PF09538 FYDLN_acid: Protein o 74.5 2.3 5.1E-05 37.0 2.3 30 254-294 10-39 (108)
80 PF10571 UPF0547: Uncharacteri 71.7 1.8 4E-05 26.7 0.7 9 61-69 16-24 (26)
81 KOG2186|consensus 70.7 2.5 5.3E-05 42.0 1.7 45 763-810 4-48 (276)
82 COG5188 PRP9 Splicing factor 3 70.3 5.4 0.00012 41.4 4.1 29 813-841 368-397 (470)
83 TIGR02098 MJ0042_CXXC MJ0042 f 70.3 3.1 6.7E-05 28.5 1.7 34 791-830 3-36 (38)
84 PF02892 zf-BED: BED zinc fing 70.1 3.4 7.4E-05 29.5 2.0 24 29-52 14-41 (45)
85 cd00729 rubredoxin_SM Rubredox 69.0 2.8 6.1E-05 27.9 1.2 25 762-798 2-26 (34)
86 KOG4377|consensus 68.4 7.7 0.00017 41.4 4.8 29 311-339 402-432 (480)
87 TIGR02300 FYDLN_acid conserved 67.5 4.1 8.9E-05 36.0 2.2 14 819-832 26-39 (129)
88 PF02892 zf-BED: BED zinc fing 67.1 5 0.00011 28.7 2.3 27 57-83 14-44 (45)
89 COG5188 PRP9 Splicing factor 3 67.0 5.7 0.00012 41.2 3.4 23 30-52 237-259 (470)
90 PF13717 zinc_ribbon_4: zinc-r 66.4 4.8 0.0001 27.2 1.9 33 791-829 3-35 (36)
91 COG4530 Uncharacterized protei 66.3 4.5 9.8E-05 34.2 2.1 26 33-69 11-36 (129)
92 PF05443 ROS_MUCR: ROS/MUCR tr 64.8 5.1 0.00011 36.3 2.4 28 28-58 69-96 (132)
93 PF09986 DUF2225: Uncharacteri 64.5 2.3 5E-05 42.8 0.2 12 282-293 49-60 (214)
94 PF05443 ROS_MUCR: ROS/MUCR tr 63.0 3.7 8E-05 37.2 1.2 29 759-790 69-97 (132)
95 smart00659 RPOLCX RNA polymera 62.8 4.5 9.9E-05 28.7 1.3 11 32-42 3-13 (44)
96 TIGR02300 FYDLN_acid conserved 62.7 6.3 0.00014 34.9 2.5 30 254-294 10-39 (129)
97 PF06524 NOA36: NOA36 protein; 62.5 4.3 9.4E-05 40.3 1.6 96 219-337 137-236 (314)
98 COG1996 RPC10 DNA-directed RNA 61.7 4.3 9.2E-05 29.4 1.0 28 31-68 6-33 (49)
99 smart00614 ZnF_BED BED zinc fi 61.6 6 0.00013 29.1 1.9 25 762-786 18-48 (50)
100 KOG4167|consensus 61.1 10 0.00023 43.6 4.4 29 758-786 788-816 (907)
101 KOG2186|consensus 61.1 5.9 0.00013 39.4 2.2 49 60-112 4-52 (276)
102 PF13719 zinc_ribbon_5: zinc-r 61.0 6.5 0.00014 26.8 1.8 33 792-830 4-36 (37)
103 COG1997 RPL43A Ribosomal prote 60.8 4.7 0.0001 32.9 1.3 31 789-830 34-64 (89)
104 smart00614 ZnF_BED BED zinc fi 60.7 6.2 0.00013 29.0 1.8 22 31-52 18-44 (50)
105 PHA00626 hypothetical protein 60.4 4.8 0.0001 29.6 1.1 13 819-831 23-35 (59)
106 PRK14890 putative Zn-ribbon RN 60.4 7.6 0.00017 29.2 2.2 11 309-319 47-57 (59)
107 COG1592 Rubrerythrin [Energy p 60.3 5.6 0.00012 37.6 1.9 14 813-826 143-156 (166)
108 COG1592 Rubrerythrin [Energy p 59.2 5.7 0.00012 37.6 1.7 23 59-95 134-156 (166)
109 KOG4167|consensus 58.6 2.1 4.5E-05 48.9 -1.4 25 819-843 792-816 (907)
110 COG2888 Predicted Zn-ribbon RN 58.2 7.8 0.00017 29.1 1.9 10 87-96 49-58 (61)
111 PRK00464 nrdR transcriptional 54.0 3.6 7.8E-05 38.5 -0.5 17 820-836 29-45 (154)
112 TIGR00373 conserved hypothetic 53.9 10 0.00022 36.1 2.5 38 784-831 103-140 (158)
113 COG4957 Predicted transcriptio 53.8 8.2 0.00018 34.3 1.7 27 761-790 75-101 (148)
114 smart00531 TFIIE Transcription 53.1 11 0.00024 35.4 2.6 39 787-830 96-134 (147)
115 TIGR00622 ssl1 transcription f 51.3 26 0.00056 30.7 4.3 79 762-843 15-105 (112)
116 PF12907 zf-met2: Zinc-binding 50.8 9.7 0.00021 26.4 1.3 32 60-91 2-36 (40)
117 smart00531 TFIIE Transcription 50.3 14 0.00031 34.6 2.9 40 55-99 95-134 (147)
118 PF03604 DNA_RNApol_7kD: DNA d 50.2 6.6 0.00014 25.7 0.4 11 32-42 1-11 (32)
119 PRK00398 rpoP DNA-directed RNA 50.2 8.5 0.00018 27.7 1.1 10 819-828 21-30 (46)
120 PF15269 zf-C2H2_7: Zinc-finge 50.1 11 0.00024 26.2 1.4 22 89-110 21-42 (54)
121 COG4957 Predicted transcriptio 50.0 13 0.00027 33.2 2.2 24 31-57 76-99 (148)
122 TIGR02605 CxxC_CxxC_SSSS putat 49.8 3.8 8.3E-05 30.4 -0.8 11 60-70 6-16 (52)
123 TIGR00373 conserved hypothetic 48.8 18 0.0004 34.3 3.4 36 54-99 104-139 (158)
124 smart00734 ZnF_Rad18 Rad18-lik 48.2 17 0.00038 22.5 2.0 20 820-840 2-21 (26)
125 PRK06266 transcription initiat 47.8 15 0.00032 35.7 2.6 36 786-831 113-148 (178)
126 KOG4124|consensus 47.2 4.8 0.0001 41.8 -0.8 71 758-839 345-418 (442)
127 smart00834 CxxC_CXXC_SSSS Puta 47.1 4.2 9.2E-05 28.3 -0.9 11 32-42 6-16 (41)
128 PF02176 zf-TRAF: TRAF-type zi 46.7 13 0.00029 28.4 1.8 40 30-70 8-53 (60)
129 TIGR00622 ssl1 transcription f 46.5 29 0.00062 30.4 3.8 83 27-112 11-105 (112)
130 COG4530 Uncharacterized protei 45.3 13 0.00027 31.7 1.4 11 819-829 26-36 (129)
131 PRK09678 DNA-binding transcrip 43.3 6.6 0.00014 31.3 -0.4 40 60-102 2-43 (72)
132 PF02176 zf-TRAF: TRAF-type zi 43.3 14 0.0003 28.3 1.4 42 788-831 7-54 (60)
133 COG1198 PriA Primosomal protei 42.5 17 0.00037 43.5 2.6 26 250-290 459-484 (730)
134 PF04959 ARS2: Arsenite-resist 42.2 11 0.00024 37.4 0.9 25 29-53 75-99 (214)
135 PF15269 zf-C2H2_7: Zinc-finge 41.6 19 0.0004 25.1 1.5 21 60-80 21-41 (54)
136 PRK06266 transcription initiat 41.5 24 0.00052 34.3 3.0 35 56-100 114-148 (178)
137 PRK04023 DNA polymerase II lar 39.8 22 0.00047 43.2 2.8 17 757-773 1032-1048(1121)
138 KOG4124|consensus 38.0 8.5 0.00018 40.0 -0.7 25 787-811 395-419 (442)
139 PF04959 ARS2: Arsenite-resist 38.0 23 0.0005 35.3 2.3 28 57-84 75-102 (214)
140 KOG1280|consensus 37.9 11 0.00023 39.6 -0.0 25 280-304 78-102 (381)
141 PF14353 CpXC: CpXC protein 37.9 4.7 0.0001 36.9 -2.4 20 59-78 38-57 (128)
142 KOG2071|consensus 37.6 27 0.00059 39.8 3.0 42 759-800 415-523 (579)
143 PF14353 CpXC: CpXC protein 37.4 13 0.00028 34.0 0.4 17 253-269 38-54 (128)
144 KOG2593|consensus 36.7 22 0.00048 38.8 2.1 40 54-97 123-162 (436)
145 PF05191 ADK_lid: Adenylate ki 36.3 9.7 0.00021 25.7 -0.4 10 763-772 2-11 (36)
146 KOG4377|consensus 35.7 22 0.00047 38.2 1.8 106 198-306 272-428 (480)
147 COG4888 Uncharacterized Zn rib 35.4 5.2 0.00011 33.5 -2.1 11 89-99 47-57 (104)
148 TIGR00280 L37a ribosomal prote 35.0 25 0.00055 29.4 1.7 32 789-831 34-65 (91)
149 PTZ00255 60S ribosomal protein 34.3 29 0.00063 29.0 1.9 32 789-831 35-66 (90)
150 PF09723 Zn-ribbon_8: Zinc rib 34.0 20 0.00043 25.2 0.8 10 254-263 6-15 (42)
151 KOG2593|consensus 33.3 31 0.00067 37.7 2.5 37 786-826 124-160 (436)
152 PF07800 DUF1644: Protein of u 33.2 1.1E+02 0.0023 28.7 5.5 57 760-819 78-137 (162)
153 PF01780 Ribosomal_L37ae: Ribo 33.1 20 0.00043 29.9 0.8 31 789-830 34-64 (90)
154 PRK09678 DNA-binding transcrip 32.8 14 0.0003 29.5 -0.1 21 815-835 23-45 (72)
155 PF13240 zinc_ribbon_2: zinc-r 32.4 16 0.00035 21.8 0.1 7 34-40 2-8 (23)
156 COG3677 Transposase and inacti 32.1 25 0.00054 32.1 1.4 12 59-70 53-64 (129)
157 cd00730 rubredoxin Rubredoxin; 31.2 23 0.0005 26.0 0.8 11 254-264 2-12 (50)
158 COG0068 HypF Hydrogenase matur 31.1 11 0.00024 43.8 -1.2 56 764-827 125-181 (750)
159 PRK03976 rpl37ae 50S ribosomal 30.9 31 0.00068 28.8 1.6 32 789-831 35-66 (90)
160 KOG1701|consensus 30.2 11 0.00023 40.8 -1.5 10 225-234 361-370 (468)
161 KOG3408|consensus 29.9 31 0.00067 30.2 1.5 26 28-53 54-79 (129)
162 PRK04023 DNA polymerase II lar 29.8 45 0.00097 40.7 3.2 14 789-802 1036-1049(1121)
163 COG4896 Uncharacterized protei 29.3 35 0.00076 25.7 1.5 40 32-71 3-43 (68)
164 COG1645 Uncharacterized Zn-fin 29.0 23 0.00049 31.9 0.5 27 32-70 29-55 (131)
165 PF01927 Mut7-C: Mut7-C RNAse 29.0 36 0.00078 31.9 1.9 46 32-77 92-142 (147)
166 COG1656 Uncharacterized conser 28.3 48 0.001 31.2 2.5 16 60-75 131-146 (165)
167 COG1655 Uncharacterized protei 27.8 13 0.00028 36.5 -1.2 18 29-46 17-34 (267)
168 COG5151 SSL1 RNA polymerase II 25.9 24 0.00052 36.2 0.2 27 86-112 386-412 (421)
169 PF12760 Zn_Tnp_IS1595: Transp 25.7 23 0.00051 25.4 0.1 27 32-67 19-45 (46)
170 COG1198 PriA Primosomal protei 25.5 47 0.001 39.9 2.5 29 784-828 456-484 (730)
171 PF13878 zf-C2H2_3: zinc-finge 25.4 67 0.0015 22.5 2.3 24 763-786 14-39 (41)
172 PF08274 PhnA_Zn_Ribbon: PhnA 24.5 31 0.00067 22.2 0.4 7 60-66 20-26 (30)
173 PF08790 zf-LYAR: LYAR-type C2 24.1 16 0.00035 23.0 -0.9 23 32-55 1-23 (28)
174 PRK14714 DNA polymerase II lar 23.8 51 0.0011 41.4 2.4 15 758-772 1249-1263(1337)
175 KOG2807|consensus 23.6 1.1E+02 0.0024 32.1 4.3 68 29-112 274-369 (378)
176 PF11789 zf-Nse: Zinc-finger o 23.5 57 0.0012 24.8 1.8 44 761-824 10-53 (57)
177 PRK03824 hypA hydrogenase nick 23.5 25 0.00054 32.4 -0.2 15 30-44 69-83 (135)
178 COG1655 Uncharacterized protei 22.9 31 0.00068 34.0 0.3 10 282-291 63-72 (267)
179 PRK14873 primosome assembly pr 22.7 53 0.0011 39.4 2.2 46 198-288 384-429 (665)
180 KOG1280|consensus 22.4 72 0.0016 33.7 2.8 29 88-116 79-107 (381)
181 PF04810 zf-Sec23_Sec24: Sec23 21.9 17 0.00036 25.3 -1.3 12 816-827 21-32 (40)
182 PF03811 Zn_Tnp_IS1: InsA N-te 21.8 56 0.0012 22.1 1.3 16 22-37 20-35 (36)
183 COG1571 Predicted DNA-binding 21.2 57 0.0012 36.0 1.9 28 1-43 352-379 (421)
184 PF07295 DUF1451: Protein of u 21.2 39 0.00084 31.5 0.6 13 758-770 108-120 (146)
185 TIGR00595 priA primosomal prot 21.0 30 0.00065 40.1 -0.2 47 32-95 214-260 (505)
186 COG3091 SprT Zn-dependent meta 20.9 46 0.001 30.7 0.9 33 31-68 117-149 (156)
187 PRK03824 hypA hydrogenase nick 20.7 35 0.00077 31.4 0.2 18 758-775 66-83 (135)
188 PF05495 zf-CHY: CHY zinc fing 20.6 25 0.00055 28.1 -0.7 59 31-98 10-71 (71)
189 PF05290 Baculo_IE-1: Baculovi 20.1 52 0.0011 29.6 1.1 55 251-324 78-135 (140)
No 1
>KOG1074|consensus
Probab=99.98 E-value=9.6e-33 Score=300.31 Aligned_cols=58 Identities=41% Similarity=0.848 Sum_probs=52.8
Q ss_pred CCcccCCCCCccCChhhHHHHHHhhcCCCccccCCCCCccccchhhhhhhhhccCCCCC
Q psy14939 761 DRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKP 819 (971)
Q Consensus 761 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~ 819 (971)
.++.|.+||+.|....+|+.|+|+|+|+|||.|.+|+++|..+.+|+.||. +|....+
T Consensus 878 n~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKvHMg-tH~w~q~ 935 (958)
T KOG1074|consen 878 NAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKVHMG-THMWVQP 935 (958)
T ss_pred chhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhhhhc-cccccCC
Confidence 357899999999999999999999999999999999999999999999999 6765544
No 2
>KOG3623|consensus
Probab=99.98 E-value=2e-31 Score=283.95 Aligned_cols=107 Identities=31% Similarity=0.627 Sum_probs=99.0
Q ss_pred cccCccchhccChhHHHhhHhhcCCC---CcccCCCCCccCChhHHHhhHhhhcc------------CCcccccCcCCCc
Q psy14939 198 YRCESCSQTFCWRPHLNFHQAQVHGR---KFPCENCTKVFSDPSNLQRHIRTHHV------------GARQHACQECGKT 262 (971)
Q Consensus 198 ~~C~~C~~~f~~~~~L~~H~~~~~~~---~~~C~~C~~~f~~~~~L~~H~~~hh~------------~~~~~~C~~C~~~ 262 (971)
..|.+|.+.+.....|+.|++..|.+ .|.|.+|.++|..+..|.+|+.+|.. ..+.|+|..||+.
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKA 290 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKA 290 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchh
Confidence 78999999999999999999998873 49999999999999999999998842 3477999999999
Q ss_pred cCChHHHhhCccccCCCCcccccccchhcCCHHHHHHHHhhh
Q psy14939 263 FATSSGLKQHTHIHSSVKPFRCEVCHKAYTQFSNLCRHKRMH 304 (971)
Q Consensus 263 f~~~~~l~~H~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~h 304 (971)
|+.+.+|+.|+++|.|+|||.|+.|++.|...+.+..||...
T Consensus 291 FKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSSK 332 (1007)
T KOG3623|consen 291 FKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSSK 332 (1007)
T ss_pred hhhHHHHHhhheeecCCCCcCCcccccccccCCccccccccc
Confidence 999999999999999999999999999999999999998753
No 3
>KOG1074|consensus
Probab=99.94 E-value=1.1e-26 Score=253.29 Aligned_cols=83 Identities=29% Similarity=0.653 Sum_probs=74.7
Q ss_pred ccccCcCCCccCChHHHhhCccccCCCCcccccccchhcCCHHHHHHHHhhhcC---CCCceecc---ccccccCCHHHH
Q psy14939 253 QHACQECGKTFATSSGLKQHTHIHSSVKPFRCEVCHKAYTQFSNLCRHKRMHAN---CRMQIKCV---KCDQSFSTVTSL 326 (971)
Q Consensus 253 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~h~~---~~~~~~C~---~C~~~f~~~~~l 326 (971)
+-+|-+|-+....++.|+.|+++|+|++||+|.+|+++|+++++|+.||-+|.. .+.+|.|+ +|-+.|.....|
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~l 684 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTL 684 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccc
Confidence 367999999999999999999999999999999999999999999999988864 34578999 999999999999
Q ss_pred HHhhhhcCC
Q psy14939 327 SKHKRFCDS 335 (971)
Q Consensus 327 ~~H~~~~~~ 335 (971)
..|+|+|.+
T Consensus 685 pQhIriH~~ 693 (958)
T KOG1074|consen 685 PQHIRIHLG 693 (958)
T ss_pred cceEEeecC
Confidence 999999874
No 4
>KOG3608|consensus
Probab=99.92 E-value=6.6e-25 Score=215.61 Aligned_cols=227 Identities=26% Similarity=0.545 Sum_probs=201.3
Q ss_pred Cceeccc--cccccCChHHHHHHHHHhcC------------Ccc-eeccc--cccccCCHHHHHHHHHhhcCCCCCeecC
Q psy14939 30 DRYACKY--CGKVFPRSANLTRHLRTHTG------------EQP-YKCKY--CERSFSISSNLQRHVRNIHNKEKPFKCP 92 (971)
Q Consensus 30 ~~~~C~~--C~k~F~~~~~L~~H~~~H~~------------~~~-~~C~~--C~~~f~~~~~L~~H~~~~H~~~k~~~C~ 92 (971)
.-|.|.+ |++.|.+...|..|+-.|.- ++| +.|+. |.+.|.++..|++|++ +|+++|...|+
T Consensus 133 ~~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r-~Hs~eKvvACp 211 (467)
T KOG3608|consen 133 QNFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIR-THSNEKVVACP 211 (467)
T ss_pred hhhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHH-hcCCCeEEecc
Confidence 3577755 99999999999999988832 222 67876 9999999999999999 89999999999
Q ss_pred CCCcccCchhhHHHHHhhhhccCCCCccccCCCCCCcccchhhhhhHHHHhhhcCCccccCCcccccccccCCCCCCCCC
Q psy14939 93 LCDRCFGQQTNLDRHLKKHEADDGSGLVSMADSPESSNENEREDAYFDEIRSFMGKVTYSGDVAYNQENLYTGNNPPSPL 172 (971)
Q Consensus 93 ~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (971)
.|+.-|+++..|.-|++......
T Consensus 212 ~Cg~~F~~~tkl~DH~rRqt~l~--------------------------------------------------------- 234 (467)
T KOG3608|consen 212 HCGELFRTKTKLFDHLRRQTELN--------------------------------------------------------- 234 (467)
T ss_pred hHHHHhccccHHHHHHHhhhhhc---------------------------------------------------------
Confidence 99999999999999998764321
Q ss_pred CcCCCCChHHHHHHHhhcCCCCCCCcccCccchhccChhHHHhhHhhcCCCCcccCCCCCccCChhHHHhhHhhhccCCc
Q psy14939 173 IDVKDDDDNELEDHLITGHRYPPDQYRCESCSQTFCWRPHLNFHQAQVHGRKFPCENCTKVFSDPSNLQRHIRTHHVGAR 252 (971)
Q Consensus 173 ~~~~~~~~~~l~~h~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~hh~~~~ 252 (971)
..+|.|..|.+.|.+...|..|+..|- .-|+|++|..+...++.|.+|++..|...+
T Consensus 235 ----------------------~n~fqC~~C~KrFaTeklL~~Hv~rHv-n~ykCplCdmtc~~~ssL~~H~r~rHs~dk 291 (467)
T KOG3608|consen 235 ----------------------TNSFQCAQCFKRFATEKLLKSHVVRHV-NCYKCPLCDMTCSSASSLTTHIRYRHSKDK 291 (467)
T ss_pred ----------------------CCchHHHHHHHHHhHHHHHHHHHHHhh-hcccccccccCCCChHHHHHHHHhhhccCC
Confidence 124999999999999999999998764 569999999999999999999999999999
Q ss_pred ccccCcCCCccCChHHHhhCccccCCCCcccccc--cchhcCCHHHHHHHHhhhcCC--CCceeccccccccCCHHHHHH
Q psy14939 253 QHACQECGKTFATSSGLKQHTHIHSSVKPFRCEV--CHKAYTQFSNLCRHKRMHANC--RMQIKCVKCDQSFSTVTSLSK 328 (971)
Q Consensus 253 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~~~~~~~l~~H~~~h~~~--~~~~~C~~C~~~f~~~~~l~~ 328 (971)
+|+|..|.+.|.+.+.|.+|+.+|+ +..|.|.. |.+.+.....|++|++.++.. ..+|.|..|++.|.+..+|..
T Consensus 292 pfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~ 370 (467)
T KOG3608|consen 292 PFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSA 370 (467)
T ss_pred CccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHH
Confidence 9999999999999999999999887 67899988 999999999999999888732 446999999999999999999
Q ss_pred hhhhcCCCCC
Q psy14939 329 HKRFCDSTTP 338 (971)
Q Consensus 329 H~~~~~~~~~ 338 (971)
|++..|+-+.
T Consensus 371 HL~kkH~f~~ 380 (467)
T KOG3608|consen 371 HLMKKHGFRL 380 (467)
T ss_pred HHHHhhcccC
Confidence 9998887654
No 5
>KOG2462|consensus
Probab=99.89 E-value=5.4e-24 Score=205.63 Aligned_cols=134 Identities=32% Similarity=0.678 Sum_probs=126.3
Q ss_pred CCCcccCccchhccChhHHHhhHhhcCC----CCcccCCCCCccCChhHHHhhHhhhccCCcccccCcCCCccCChHHHh
Q psy14939 195 PDQYRCESCSQTFCWRPHLNFHQAQVHG----RKFPCENCTKVFSDPSNLQRHIRTHHVGARQHACQECGKTFATSSGLK 270 (971)
Q Consensus 195 ~~~~~C~~C~~~f~~~~~L~~H~~~~~~----~~~~C~~C~~~f~~~~~L~~H~~~hh~~~~~~~C~~C~~~f~~~~~l~ 270 (971)
...|+|..|++.+.+..+|..|...|-. +.+.|.+|+|.|.+.-.|..|+|+|. -+++|.+||+.|....-|+
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~---l~c~C~iCGKaFSRPWLLQ 204 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT---LPCECGICGKAFSRPWLLQ 204 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC---CCcccccccccccchHHhh
Confidence 3569999999999999999999998875 56999999999999999999999993 6799999999999999999
Q ss_pred hCccccCCCCcccccccchhcCCHHHHHHHHhhhcCCCCceeccccccccCCHHHHHHhhhh
Q psy14939 271 QHTHIHSSVKPFRCEVCHKAYTQFSNLCRHKRMHANCRMQIKCVKCDQSFSTVTSLSKHKRF 332 (971)
Q Consensus 271 ~H~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~ 332 (971)
.|+++|+|||||.|..|+++|..+++|+.||++|.+.+ .|+|..|++.|...+.|.+|...
T Consensus 205 GHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K-~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 205 GHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVK-KHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred cccccccCCCCccCCcccchhcchHHHHHHHHhhcCCc-cccCcchhhHHHHHHHHHHhhhh
Confidence 99999999999999999999999999999999999855 89999999999999999999865
No 6
>KOG3608|consensus
Probab=99.87 E-value=9.5e-23 Score=200.50 Aligned_cols=232 Identities=23% Similarity=0.460 Sum_probs=193.4
Q ss_pred CCchHHHHHHhccccccCCCC------CCCC---Cceeccc--cccccCChHHHHHHHHHhcCCcceeccccccccCCHH
Q psy14939 5 ANKPFQEILTSQNNAVNGSSS------KTTK---DRYACKY--CGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISS 73 (971)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~------~~~~---~~~~C~~--C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~ 73 (971)
|.++|..+.+...+.+.|+-- ++++ ..+.|.+ |-+.|.+++.|++|+|.|+++|-..|+.||.-|+++.
T Consensus 142 Ce~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~t 221 (467)
T KOG3608|consen 142 CEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKT 221 (467)
T ss_pred cCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhcccc
Confidence 567777777777776655432 2222 2367866 9999999999999999999999999999999999999
Q ss_pred HHHHHHHhhc-CCCCCeecCCCCcccCchhhHHHHHhhhhccCCCCccccCCCCCCcccchhhhhhHHHHhhhcCCcccc
Q psy14939 74 NLQRHVRNIH-NKEKPFKCPLCDRCFGQQTNLDRHLKKHEADDGSGLVSMADSPESSNENEREDAYFDEIRSFMGKVTYS 152 (971)
Q Consensus 74 ~L~~H~~~~H-~~~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (971)
.|-.|+++.- ....+|.|..|.+.|.+...|..|+..|-.
T Consensus 222 kl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn--------------------------------------- 262 (467)
T KOG3608|consen 222 KLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN--------------------------------------- 262 (467)
T ss_pred HHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh---------------------------------------
Confidence 9999998432 234589999999999999999999988831
Q ss_pred CCcccccccccCCCCCCCCCCcCCCCChHHHHHHHhhcCCCCCCCcccCccchhccChhHHHhhHhhcCC--CCcccCCC
Q psy14939 153 GDVAYNQENLYTGNNPPSPLIDVKDDDDNELEDHLITGHRYPPDQYRCESCSQTFCWRPHLNFHQAQVHG--RKFPCENC 230 (971)
Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~--~~~~C~~C 230 (971)
-|+|+.|+.+....+.|..|++..|. ++|+|..|
T Consensus 263 --------------------------------------------~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~C 298 (467)
T KOG3608|consen 263 --------------------------------------------CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDEC 298 (467)
T ss_pred --------------------------------------------cccccccccCCCChHHHHHHHHhhhccCCCccccch
Confidence 29999999999999999999998775 78999999
Q ss_pred CCccCChhHHHhhHhhhccCCcccccCc--CCCccCChHHHhhCccccC-C--CCcccccccchhcCCHHHHHHHHhhhc
Q psy14939 231 TKVFSDPSNLQRHIRTHHVGARQHACQE--CGKTFATSSGLKQHTHIHS-S--VKPFRCEVCHKAYTQFSNLCRHKRMHA 305 (971)
Q Consensus 231 ~~~f~~~~~L~~H~~~hh~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~-~--~~~~~C~~C~~~~~~~~~l~~H~~~h~ 305 (971)
.+.|.+.+.|.+|+..| . +-.|.|.. |.+.|.....++.|++.|+ + +-+|.|..|.+.|++-.+|..|+...+
T Consensus 299 d~~c~~esdL~kH~~~H-S-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH 376 (467)
T KOG3608|consen 299 DTRCVRESDLAKHVQVH-S-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKH 376 (467)
T ss_pred hhhhccHHHHHHHHHhc-c-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhh
Confidence 99999999999999988 3 77899998 9999999999999987665 3 346999999999999999999987777
Q ss_pred CCCC-----ceeccccccccC
Q psy14939 306 NCRM-----QIKCVKCDQSFS 321 (971)
Q Consensus 306 ~~~~-----~~~C~~C~~~f~ 321 (971)
+.+. .|...+|.-+|-
T Consensus 377 ~f~~PsGh~RFtYk~~edG~m 397 (467)
T KOG3608|consen 377 GFRLPSGHKRFTYKVDEDGFM 397 (467)
T ss_pred cccCCCCCCceeeeeccCcee
Confidence 6443 355555555543
No 7
>KOG2462|consensus
Probab=99.87 E-value=9.4e-23 Score=197.09 Aligned_cols=104 Identities=37% Similarity=0.703 Sum_probs=98.7
Q ss_pred cccCccchhccChhHHHhhHhhcCCCCcccCCCCCccCChhHHHhhHhhhccCCcccccCcCCCccCChHHHhhCccccC
Q psy14939 198 YRCESCSQTFCWRPHLNFHQAQVHGRKFPCENCTKVFSDPSNLQRHIRTHHVGARQHACQECGKTFATSSGLKQHTHIHS 277 (971)
Q Consensus 198 ~~C~~C~~~f~~~~~L~~H~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~hh~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 277 (971)
+.|..|++.|.+...|..|++.|. -++.|.+|||.|...=.|+.|+|+| +|||||.|+.|++.|..+++|+.||.+|.
T Consensus 162 ~~C~~C~K~YvSmpALkMHirTH~-l~c~C~iCGKaFSRPWLLQGHiRTH-TGEKPF~C~hC~kAFADRSNLRAHmQTHS 239 (279)
T KOG2462|consen 162 FSCKYCGKVYVSMPALKMHIRTHT-LPCECGICGKAFSRPWLLQGHIRTH-TGEKPFSCPHCGKAFADRSNLRAHMQTHS 239 (279)
T ss_pred ccCCCCCceeeehHHHhhHhhccC-CCcccccccccccchHHhhcccccc-cCCCCccCCcccchhcchHHHHHHHHhhc
Confidence 889999999999999999998875 6789999999999999999999999 89999999999999999999999999999
Q ss_pred CCCcccccccchhcCCHHHHHHHHhh
Q psy14939 278 SVKPFRCEVCHKAYTQFSNLCRHKRM 303 (971)
Q Consensus 278 ~~~~~~C~~C~~~~~~~~~l~~H~~~ 303 (971)
+.|+|.|..|++.|...+.|.+|...
T Consensus 240 ~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 240 DVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred CCccccCcchhhHHHHHHHHHHhhhh
Confidence 99999999999999999999999765
No 8
>KOG3623|consensus
Probab=99.85 E-value=1.7e-20 Score=201.12 Aligned_cols=107 Identities=29% Similarity=0.553 Sum_probs=98.8
Q ss_pred CCCCCCcCCCCChHHHHHHHhhcCCCCCCCcccCccchhccChhHHHhhHhhcCC--------------CCcccCCCCCc
Q psy14939 168 PPSPLIDVKDDDDNELEDHLITGHRYPPDQYRCESCSQTFCWRPHLNFHQAQVHG--------------RKFPCENCTKV 233 (971)
Q Consensus 168 ~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~--------------~~~~C~~C~~~ 233 (971)
..|+.|......-..|..|+.-.|...+..|.|..|.++|+.+..|.+|+..|.. ++|+|..|+|.
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKA 290 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKA 290 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchh
Confidence 4678888888889999999999998888889999999999999999999988753 67999999999
Q ss_pred cCChhHHHhhHhhhccCCcccccCcCCCccCChHHHhhCccc
Q psy14939 234 FSDPSNLQRHIRTHHVGARQHACQECGKTFATSSGLKQHTHI 275 (971)
Q Consensus 234 f~~~~~L~~H~~~hh~~~~~~~C~~C~~~f~~~~~l~~H~~~ 275 (971)
|+.+..|+.|+|+| .|++||.|+.|++.|.....+-.||..
T Consensus 291 FKfKHHLKEHlRIH-SGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 291 FKFKHHLKEHLRIH-SGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred hhhHHHHHhhheee-cCCCCcCCcccccccccCCcccccccc
Confidence 99999999999999 899999999999999999999999743
No 9
>KOG3576|consensus
Probab=99.59 E-value=7.4e-16 Score=140.37 Aligned_cols=86 Identities=36% Similarity=0.810 Sum_probs=82.2
Q ss_pred CCCceeccccccccCChHHHHHHHHHhcCCcceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCCcccCchhhHHHH
Q psy14939 28 TKDRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTNLDRH 107 (971)
Q Consensus 28 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~f~~~~~L~~H 107 (971)
+...|.|.+|+|.|.-...|.+|++.|...+.|-|..||+.|...-.|++|++ +|+|.+||+|..|++.|..+-.|..|
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~r-thtgvrpykc~~c~kaftqrcslesh 192 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTR-THTGVRPYKCSLCEKAFTQRCSLESH 192 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhc-cccCccccchhhhhHHHHhhccHHHH
Confidence 35689999999999999999999999999999999999999999999999999 79999999999999999999999999
Q ss_pred Hhhhhcc
Q psy14939 108 LKKHEAD 114 (971)
Q Consensus 108 ~~~h~~~ 114 (971)
+++.|+.
T Consensus 193 l~kvhgv 199 (267)
T KOG3576|consen 193 LKKVHGV 199 (267)
T ss_pred HHHHcCc
Confidence 9998864
No 10
>KOG3576|consensus
Probab=99.56 E-value=1e-15 Score=139.45 Aligned_cols=114 Identities=29% Similarity=0.591 Sum_probs=97.6
Q ss_pred CCcccCCCCCccCChhHHHhhHhhhccCCcccccCcCCCccCChHHHhhCccccCCCCcccccccchhcCCHHHHHHHHh
Q psy14939 223 RKFPCENCTKVFSDPSNLQRHIRTHHVGARQHACQECGKTFATSSGLKQHTHIHSSVKPFRCEVCHKAYTQFSNLCRHKR 302 (971)
Q Consensus 223 ~~~~C~~C~~~f~~~~~L~~H~~~hh~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~ 302 (971)
..|.|.+|+|.|.....|.+|++.| ..-+.|.|..||+.|.....|++|+++|+|.+||+|..|+++|+++-.|..|.+
T Consensus 116 d~ftCrvCgK~F~lQRmlnrh~kch-~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 116 DSFTCRVCGKKFGLQRMLNRHLKCH-SDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred CeeeeehhhhhhhHHHHHHHHhhhc-cHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence 4588999999999999999999888 677889999999999999999999999999999999999999999999999987
Q ss_pred hhcCCC----------CceeccccccccCCHHHHHHhhhhcCCCC
Q psy14939 303 MHANCR----------MQIKCVKCDQSFSTVTSLSKHKRFCDSTT 337 (971)
Q Consensus 303 ~h~~~~----------~~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 337 (971)
.-++.. +.|.|..||+.-.....+..|++.+|...
T Consensus 195 kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 195 KVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred HHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence 655532 35889999999999999999999887544
No 11
>PHA00733 hypothetical protein
Probab=99.05 E-value=1.3e-10 Score=104.71 Aligned_cols=85 Identities=25% Similarity=0.433 Sum_probs=73.2
Q ss_pred CCCCCCcccCCCCCccCChhhHHHH--HH---hhcCCCccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCC
Q psy14939 757 KTTKDRYACKYCGKVFPRSANLTRH--LR---THTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQ 831 (971)
Q Consensus 757 ~~~~~~~~C~~C~~~f~~~~~l~~H--~~---~h~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~ 831 (971)
....+++.|.+|.+.|.....|..| ++ .+.+++||.|+.|++.|.+...|.+|++ +| +.+|.|..|++.|..
T Consensus 35 ~~~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r-~h--~~~~~C~~CgK~F~~ 111 (128)
T PHA00733 35 TPEQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIR-YT--EHSKVCPVCGKEFRN 111 (128)
T ss_pred ChhhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHh-cC--CcCccCCCCCCccCC
Confidence 3556789999999999988887766 22 3345899999999999999999999998 56 458999999999999
Q ss_pred hhHHHHHHhhhcC
Q psy14939 832 QTNLDRHLKKHEA 844 (971)
Q Consensus 832 ~~~l~~H~~~~~~ 844 (971)
...|.+|+...|+
T Consensus 112 ~~sL~~H~~~~h~ 124 (128)
T PHA00733 112 TDSTLDHVCKKHN 124 (128)
T ss_pred HHHHHHHHHHhcC
Confidence 9999999998875
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.97 E-value=1.1e-09 Score=121.64 Aligned_cols=120 Identities=18% Similarity=0.487 Sum_probs=84.6
Q ss_pred cccCc--cchhccChhHHHhhHhhcCCCCcccCCCCCccCChhHHHhhHhhhccCCcccccCcCCCccCChHHHhhCccc
Q psy14939 198 YRCES--CSQTFCWRPHLNFHQAQVHGRKFPCENCTKVFSDPSNLQRHIRTHHVGARQHACQECGKTFATSSGLKQHTHI 275 (971)
Q Consensus 198 ~~C~~--C~~~f~~~~~L~~H~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~hh~~~~~~~C~~C~~~f~~~~~l~~H~~~ 275 (971)
..|+. |+..| .+..+..| +.|..|++.|. ...|..|+++|| .++.|+ |++.+ .+..|..|+.+
T Consensus 434 V~Cp~~~Cg~v~-~r~el~~H--------~~C~~Cgk~f~-~s~LekH~~~~H---kpv~Cp-Cg~~~-~R~~L~~H~~t 498 (567)
T PLN03086 434 VVCPHDGCGIVL-RVEEAKNH--------VHCEKCGQAFQ-QGEMEKHMKVFH---EPLQCP-CGVVL-EKEQMVQHQAS 498 (567)
T ss_pred eeCCccccccee-eccccccC--------ccCCCCCCccc-hHHHHHHHHhcC---CCccCC-CCCCc-chhHHHhhhhc
Confidence 44653 77766 23334444 46888888885 577888888875 678888 88644 56788888888
Q ss_pred cCCCCcccccccchhcC----------CHHHHHHHHhhhcCCCCceeccccccccCCHHHHHHhhhhcCC
Q psy14939 276 HSSVKPFRCEVCHKAYT----------QFSNLCRHKRMHANCRMQIKCVKCDQSFSTVTSLSKHKRFCDS 335 (971)
Q Consensus 276 h~~~~~~~C~~C~~~~~----------~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~ 335 (971)
|...+++.|.+|++.|. ....|..|...+. ...+.|..|++.+.. ..|..|+..+|.
T Consensus 499 hCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~CG--~rt~~C~~Cgk~Vrl-rdm~~H~~~~h~ 565 (567)
T PLN03086 499 TCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESICG--SRTAPCDSCGRSVML-KEMDIHQIAVHQ 565 (567)
T ss_pred cCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhcC--CcceEccccCCeeee-hhHHHHHHHhhc
Confidence 88888888888888774 2347888887753 346888888887664 457777776664
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.94 E-value=1.5e-09 Score=120.33 Aligned_cols=101 Identities=21% Similarity=0.477 Sum_probs=84.9
Q ss_pred cccCccchhccChhHHHhhHhhcCCCCcccCCCCCccCChhHHHhhHhhhccCCcccccCcCCCccCC----------hH
Q psy14939 198 YRCESCSQTFCWRPHLNFHQAQVHGRKFPCENCTKVFSDPSNLQRHIRTHHVGARQHACQECGKTFAT----------SS 267 (971)
Q Consensus 198 ~~C~~C~~~f~~~~~L~~H~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~hh~~~~~~~C~~C~~~f~~----------~~ 267 (971)
+.|+.|++.|. ...|..|+..+| .++.|+ |++.+ .+..|..|+.+| ...+++.|.+|++.|.. ..
T Consensus 454 ~~C~~Cgk~f~-~s~LekH~~~~H-kpv~Cp-Cg~~~-~R~~L~~H~~th-Cp~Kpi~C~fC~~~v~~g~~~~d~~d~~s 528 (567)
T PLN03086 454 VHCEKCGQAFQ-QGEMEKHMKVFH-EPLQCP-CGVVL-EKEQMVQHQAST-CPLRLITCRFCGDMVQAGGSAMDVRDRLR 528 (567)
T ss_pred ccCCCCCCccc-hHHHHHHHHhcC-CCccCC-CCCCc-chhHHHhhhhcc-CCCCceeCCCCCCccccCccccchhhhhh
Confidence 78999999996 678999999986 789999 99765 678999999887 78999999999999842 35
Q ss_pred HHhhCccccCCCCcccccccchhcCCHHHHHHHHhhhc
Q psy14939 268 GLKQHTHIHSSVKPFRCEVCHKAYTQFSNLCRHKRMHA 305 (971)
Q Consensus 268 ~l~~H~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~h~ 305 (971)
.|..|.... +.+++.|..|++.+..+ .|..|+...|
T Consensus 529 ~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~~~h 564 (567)
T PLN03086 529 GMSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQIAVH 564 (567)
T ss_pred hHHHHHHhc-CCcceEccccCCeeeeh-hHHHHHHHhh
Confidence 789998875 88999999999998654 5778876544
No 14
>PHA02768 hypothetical protein; Provisional
Probab=98.92 E-value=5.9e-10 Score=80.81 Aligned_cols=43 Identities=26% Similarity=0.604 Sum_probs=32.6
Q ss_pred cccCCCCCccCChhhHHHHHHhhcCCCccccCCCCCccccchhhh
Q psy14939 763 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQ 807 (971)
Q Consensus 763 ~~C~~C~~~f~~~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~~l~ 807 (971)
|.|+.||+.|.+..+|.+|+++|+ +||+|..|+|.|.+++.|.
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 677777777777777777777777 6777777777777776664
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.87 E-value=6.6e-10 Score=80.57 Aligned_cols=43 Identities=23% Similarity=0.464 Sum_probs=39.7
Q ss_pred ccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCChhHH
Q psy14939 790 PYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTNL 835 (971)
Q Consensus 790 p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l 835 (971)
-|.|+.||+.|++.++|.+||| +|+ ++|+|..|++.|.+...|
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r-~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLR-KHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHH-hcC--CcccCCcccceeccccee
Confidence 3899999999999999999999 798 699999999999988766
No 16
>PHA00733 hypothetical protein
Probab=98.81 E-value=3.4e-09 Score=95.54 Aligned_cols=82 Identities=24% Similarity=0.438 Sum_probs=70.9
Q ss_pred CCceeccccccccCChHHHHHH--HHH---hcCCcceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCCcccCchhh
Q psy14939 29 KDRYACKYCGKVFPRSANLTRH--LRT---HTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTN 103 (971)
Q Consensus 29 ~~~~~C~~C~k~F~~~~~L~~H--~~~---H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~f~~~~~ 103 (971)
.+++.|.+|.+.|.....|..| ++. +.+.+||.|+.|++.|.+...|..|++ .| ..+|.|..|++.|.....
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r-~h--~~~~~C~~CgK~F~~~~s 114 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIR-YT--EHSKVCPVCGKEFRNTDS 114 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHh-cC--CcCccCCCCCCccCCHHH
Confidence 5789999999999988877776 332 345789999999999999999999998 45 457999999999999999
Q ss_pred HHHHHhhhhc
Q psy14939 104 LDRHLKKHEA 113 (971)
Q Consensus 104 L~~H~~~h~~ 113 (971)
|.+|+...|+
T Consensus 115 L~~H~~~~h~ 124 (128)
T PHA00733 115 TLDHVCKKHN 124 (128)
T ss_pred HHHHHHHhcC
Confidence 9999998875
No 17
>KOG3993|consensus
Probab=98.75 E-value=2.9e-09 Score=109.13 Aligned_cols=54 Identities=26% Similarity=0.602 Sum_probs=43.0
Q ss_pred ceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCCcccCchhhHHHHHhhhhc
Q psy14939 59 PYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTNLDRHLKKHEA 113 (971)
Q Consensus 59 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~h~~ 113 (971)
.|.|.+|...|...-.|.+|... --....|+|+.|+|.|.--.+|..|.+.|..
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~-RIV~vEYrCPEC~KVFsCPANLASHRRWHKP 320 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCP-RIVHVEYRCPECDKVFSCPANLASHRRWHKP 320 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCC-eeEEeeecCCcccccccCchhhhhhhcccCC
Confidence 38999999999999999988541 1122349999999999999999999998854
No 18
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.69 E-value=7.6e-09 Score=64.16 Aligned_cols=25 Identities=76% Similarity=1.472 Sum_probs=17.8
Q ss_pred hHHHHHHhhcCCCccccCCCCCccc
Q psy14939 777 NLTRHLRTHTGEQPYKCKYCERSFS 801 (971)
Q Consensus 777 ~l~~H~~~h~~~~p~~C~~C~~~f~ 801 (971)
+|.+|+++|+|+|||.|++|+++|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 4667777777777777777777765
No 19
>KOG3993|consensus
Probab=98.56 E-value=1.2e-08 Score=104.73 Aligned_cols=84 Identities=27% Similarity=0.593 Sum_probs=69.8
Q ss_pred CceeccccccccCChHHHHHHHHHhcCCcceeccccccccCCHHHHHHHHHhhcCC------------------------
Q psy14939 30 DRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNK------------------------ 85 (971)
Q Consensus 30 ~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~------------------------ 85 (971)
.-|.|..|...|...-.|.+|.-.--.--.|+|+.|+|.|+...+|..|.| +|-.
T Consensus 266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRR-WHKPR~eaa~a~~~P~k~~~~~rae~~e 344 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRR-WHKPRPEAAKAGSPPPKQAVETRAEVQE 344 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhc-ccCCchhhhhcCCCChhhhhhhhhhhhh
Confidence 358999999999999999999632222234999999999999999999987 7731
Q ss_pred ---------CCCeecCCCCcccCchhhHHHHHhhhhcc
Q psy14939 86 ---------EKPFKCPLCDRCFGQQTNLDRHLKKHEAD 114 (971)
Q Consensus 86 ---------~k~~~C~~C~~~f~~~~~L~~H~~~h~~~ 114 (971)
+-.|.|.+|++.|+.+..|++|+.+|+..
T Consensus 345 a~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~ 382 (500)
T KOG3993|consen 345 AERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRA 382 (500)
T ss_pred ccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhcc
Confidence 12499999999999999999999998754
No 20
>KOG1146|consensus
Probab=98.41 E-value=1.4e-06 Score=102.95 Aligned_cols=86 Identities=16% Similarity=0.334 Sum_probs=63.5
Q ss_pred CCCcccCCCCCccCChhhHHHHHHhhcCCCccccCCCCCccccchhhhhhhhh----------------ccCCCCCccCC
Q psy14939 760 KDRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRN----------------IHNKEKPFKCP 823 (971)
Q Consensus 760 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~~l~~H~~~----------------~h~~~~~~~C~ 823 (971)
.+.+.|..|.+.|.-.-.+. |+-. +++|.|..|...|.....|..|.+. .|..-+.| |.
T Consensus 1258 sGe~~c~~~~~~~~~~~~~~-~l~~---~~~~~~~~~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~ 1332 (1406)
T KOG1146|consen 1258 SGEGECGAVDELLTPSFGIS-TLDV---THRYLCRQCKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CL 1332 (1406)
T ss_pred CCcchhhhccccccCcccee-eccc---chhHHHHHHHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc-ch
Confidence 45578888888887766666 5432 3678888888888888888888652 23334456 99
Q ss_pred CCCcccCChhHHHHHHhhhcCCCCCCC
Q psy14939 824 LCDRCFGQQTNLDRHLKKHEADDGSGL 850 (971)
Q Consensus 824 ~C~~~f~~~~~l~~H~~~~~~~~~~~~ 850 (971)
.|...|.....|..||+..++......
T Consensus 1333 ~c~~~~~~~~alqihm~~~~~~~kt~~ 1359 (1406)
T KOG1146|consen 1333 ACEVLLSGREALQIHMRSSAHRRKTAP 1359 (1406)
T ss_pred HHHhhcchhHHHHHHHHHhhhcccCCC
Confidence 999999999999999999766555443
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.41 E-value=1.7e-07 Score=58.17 Aligned_cols=25 Identities=76% Similarity=1.472 Sum_probs=18.3
Q ss_pred HHHHHHHHhcCCcceeccccccccC
Q psy14939 46 NLTRHLRTHTGEQPYKCKYCERSFS 70 (971)
Q Consensus 46 ~L~~H~~~H~~~~~~~C~~C~~~f~ 70 (971)
.|.+|+++|++++||.|++|++.|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 3677777777777777777777775
No 22
>PHA00616 hypothetical protein
Probab=98.22 E-value=5.6e-07 Score=62.08 Aligned_cols=39 Identities=23% Similarity=0.506 Sum_probs=30.9
Q ss_pred ceeccccccccCChHHHHHHHHHhcCCcceecccccccc
Q psy14939 31 RYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSF 69 (971)
Q Consensus 31 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f 69 (971)
+|+|+.||+.|...+.|..|++.|+|+++|.|++=-..|
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~y~~f 39 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYFYIYF 39 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEEEEEE
Confidence 578888888888888888888888888888887644333
No 23
>PHA00732 hypothetical protein
Probab=98.20 E-value=8e-07 Score=72.00 Aligned_cols=43 Identities=33% Similarity=0.667 Sum_probs=25.4
Q ss_pred cccCCCCCccCChhhHHHHHHh-hcCCCccccCCCCCccccchhhhhhhh
Q psy14939 763 YACKYCGKVFPRSANLTRHLRT-HTGEQPYKCKYCERSFSISSNLQRHVR 811 (971)
Q Consensus 763 ~~C~~C~~~f~~~~~l~~H~~~-h~~~~p~~C~~C~~~f~~~~~l~~H~~ 811 (971)
|.|..||+.|.+...|+.|++. |+ ++.|+.|+++|. .|.+|.+
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~ 45 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFY 45 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhc
Confidence 5566666666666666666653 43 245666666665 3556654
No 24
>PHA00616 hypothetical protein
Probab=98.13 E-value=9.2e-07 Score=61.03 Aligned_cols=34 Identities=24% Similarity=0.588 Sum_probs=23.9
Q ss_pred CcccCCCCCccCChhhHHHHHHhhcCCCccccCC
Q psy14939 762 RYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKY 795 (971)
Q Consensus 762 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~p~~C~~ 795 (971)
+|.|..||+.|..++.|.+|++.|+|++|+.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 3667777777777777777777777777776654
No 25
>PHA00732 hypothetical protein
Probab=97.98 E-value=5.6e-06 Score=67.12 Aligned_cols=47 Identities=28% Similarity=0.662 Sum_probs=41.0
Q ss_pred ccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCChhHHHHHHhhh
Q psy14939 790 PYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTNLDRHLKKH 842 (971)
Q Consensus 790 p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 842 (971)
||.|..|++.|.+...|++|++..|. ++.|+.|++.|. .|..|+++.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence 68999999999999999999984465 468999999998 588998654
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.93 E-value=1.2e-05 Score=60.48 Aligned_cols=50 Identities=34% Similarity=0.717 Sum_probs=30.5
Q ss_pred eeccccccccCCHHHHHHHHHhhcCCC-CCeecCCCCcccCchhhHHHHHhhhh
Q psy14939 60 YKCKYCERSFSISSNLQRHVRNIHNKE-KPFKCPLCDRCFGQQTNLDRHLKKHE 112 (971)
Q Consensus 60 ~~C~~C~~~f~~~~~L~~H~~~~H~~~-k~~~C~~C~~~f~~~~~L~~H~~~h~ 112 (971)
|.|++|++. .+...|..|+...|..+ +.+.|++|...+. .+|..||..+|
T Consensus 3 f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 666667663 34566666766666543 3566777766544 36667766654
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.87 E-value=1.4e-05 Score=60.19 Aligned_cols=50 Identities=38% Similarity=0.809 Sum_probs=27.9
Q ss_pred cccCCCCCccccchhhhhhhhhccCCC-CCccCCCCCcccCChhHHHHHHhhhc
Q psy14939 791 YKCKYCERSFSISSNLQRHVRNIHNKE-KPFKCPLCDRCFGQQTNLDRHLKKHE 843 (971)
Q Consensus 791 ~~C~~C~~~f~~~~~l~~H~~~~h~~~-~~~~C~~C~~~f~~~~~l~~H~~~~~ 843 (971)
|.|++|++.|. ...|.+|....|..+ +.+.||+|...+. .+|.+||+.+|
T Consensus 3 f~CP~C~~~~~-~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGKGFS-ESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCCccC-HHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 55666666333 345666655555543 3466666666543 36666666554
No 28
>KOG2231|consensus
Probab=97.66 E-value=0.00029 Score=79.86 Aligned_cols=51 Identities=31% Similarity=0.664 Sum_probs=27.7
Q ss_pred ccCCCCCccCChhHHHhhHhhhccCCcccccCcCC------CccCChHHHhhCccccCCCCccccc
Q psy14939 226 PCENCTKVFSDPSNLQRHIRTHHVGARQHACQECG------KTFATSSGLKQHTHIHSSVKPFRCE 285 (971)
Q Consensus 226 ~C~~C~~~f~~~~~L~~H~~~hh~~~~~~~C~~C~------~~f~~~~~l~~H~~~h~~~~~~~C~ 285 (971)
.|.+|...|.....|.+|++.+| |.|.+|. .-|.....|..|.+.+| |.|.
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~h-----~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE 240 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFDH-----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCE 240 (669)
T ss_pred cchhhhhhhccHHHHHHhhccce-----eheeecCcccccchhcccchHHHHHhhhcC----cccc
Confidence 46666666666666666665554 4555542 33444555555555444 5555
No 29
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.58 E-value=2e-05 Score=78.13 Aligned_cols=70 Identities=23% Similarity=0.528 Sum_probs=38.8
Q ss_pred CCCcccCC--CCCccCChhhHHHHHH-hhcCCCccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCChhHHH
Q psy14939 760 KDRYACKY--CGKVFPRSANLTRHLR-THTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTNLD 836 (971)
Q Consensus 760 ~~~~~C~~--C~~~f~~~~~l~~H~~-~h~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~ 836 (971)
.|+|+|++ |.|.|+++..|+-|+. .|...+...=+ .-..|. ..-.+.|||.|++|+|+++....|+
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p----------~p~~~~-~F~~~~KPYrCevC~KRYKNlNGLK 415 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENP----------SPEKMN-IFSAKDKPYRCEVCDKRYKNLNGLK 415 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCC----------Cccccc-cccccCCceeccccchhhccCccce
Confidence 47777876 7777877777777776 34222211100 000000 0112456777777777777777777
Q ss_pred HHHh
Q psy14939 837 RHLK 840 (971)
Q Consensus 837 ~H~~ 840 (971)
-|.+
T Consensus 416 YHr~ 419 (423)
T COG5189 416 YHRK 419 (423)
T ss_pred eccc
Confidence 6654
No 30
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.49 E-value=7.6e-05 Score=45.01 Aligned_cols=23 Identities=52% Similarity=1.132 Sum_probs=13.0
Q ss_pred eeccccccccCChHHHHHHHHHh
Q psy14939 32 YACKYCGKVFPRSANLTRHLRTH 54 (971)
Q Consensus 32 ~~C~~C~k~F~~~~~L~~H~~~H 54 (971)
|+|+.|++.|.+...|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 45555555555555555555543
No 31
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.47 E-value=6.2e-05 Score=45.40 Aligned_cols=22 Identities=41% Similarity=1.247 Sum_probs=12.0
Q ss_pred ccCCCCCcccCChhHHHHHHhh
Q psy14939 820 FKCPLCDRCFGQQTNLDRHLKK 841 (971)
Q Consensus 820 ~~C~~C~~~f~~~~~l~~H~~~ 841 (971)
|+|+.|++.|.++..|.+|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4455555555555555555554
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.12 E-value=0.00031 Score=42.87 Aligned_cols=24 Identities=33% Similarity=1.083 Sum_probs=13.5
Q ss_pred ccCCCCCcccCChhHHHHHHhhhc
Q psy14939 820 FKCPLCDRCFGQQTNLDRHLKKHE 843 (971)
Q Consensus 820 ~~C~~C~~~f~~~~~l~~H~~~~~ 843 (971)
|.|++|++.|.....|.+|+++||
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 456666666666666666666553
No 33
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.07 E-value=0.00017 Score=71.76 Aligned_cols=53 Identities=32% Similarity=0.801 Sum_probs=41.7
Q ss_pred Ccceeccc--cccccCCHHHHHHHHHhhc------------------CCCCCeecCCCCcccCchhhHHHHHh
Q psy14939 57 EQPYKCKY--CERSFSISSNLQRHVRNIH------------------NKEKPFKCPLCDRCFGQQTNLDRHLK 109 (971)
Q Consensus 57 ~~~~~C~~--C~~~f~~~~~L~~H~~~~H------------------~~~k~~~C~~C~~~f~~~~~L~~H~~ 109 (971)
+|||+|++ |.|.++....|+.|+.--| .+.|||+|.+|++.+.....|+.|+.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceeccc
Confidence 47888877 8888888888888876323 12589999999999999999998864
No 34
>KOG1146|consensus
Probab=97.04 E-value=0.00021 Score=85.34 Aligned_cols=54 Identities=19% Similarity=0.377 Sum_probs=43.3
Q ss_pred eccccccccCCHHHHHHHHHhhcCCCCCeecCCCCcccCchhhHHHHHhhhhcc
Q psy14939 61 KCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTNLDRHLKKHEAD 114 (971)
Q Consensus 61 ~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~h~~~ 114 (971)
.|.-|+..+..+..+--|+...|.-.+.|+|+.|+..|.....|..|||..|.+
T Consensus 438 e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~ 491 (1406)
T KOG1146|consen 438 ELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPE 491 (1406)
T ss_pred cccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccc
Confidence 455667777777777778777788788899999999999999999999886644
No 35
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.96 E-value=0.0005 Score=60.28 Aligned_cols=74 Identities=20% Similarity=0.586 Sum_probs=22.5
Q ss_pred eccccccccCChHHHHHHHHHhcCCcceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCCcccCchhhHHHHHhhhh
Q psy14939 33 ACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTNLDRHLKKHE 112 (971)
Q Consensus 33 ~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~h~ 112 (971)
+|.+|+..|.+...|..||...++-. .. ....+.....|..+++. .....+.|.+|++.|.+...|..||+.++
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~---~~-~~~~l~~~~~~~~~~~~--~~~~~~~C~~C~~~f~s~~~l~~Hm~~~~ 74 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFD---IP-DQKYLVDPNRLLNYLRK--KVKESFRCPYCNKTFRSREALQEHMRSKH 74 (100)
T ss_dssp ------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHTT
T ss_pred Cccccccccccccccccccccccccc---cc-ccccccccccccccccc--ccCCCCCCCccCCCCcCHHHHHHHHcCcc
Confidence 59999999999999999997765432 11 22223345555555542 22336999999999999999999999763
No 36
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.93 E-value=0.00071 Score=41.21 Aligned_cols=19 Identities=42% Similarity=0.934 Sum_probs=6.6
Q ss_pred ccccccccCCHHHHHHHHH
Q psy14939 62 CKYCERSFSISSNLQRHVR 80 (971)
Q Consensus 62 C~~C~~~f~~~~~L~~H~~ 80 (971)
|++|++.|.+...|..|++
T Consensus 3 C~~C~~~~~~~~~l~~H~~ 21 (24)
T PF13894_consen 3 CPICGKSFRSKSELRQHMR 21 (24)
T ss_dssp -SSTS-EESSHHHHHHHHH
T ss_pred CcCCCCcCCcHHHHHHHHH
Confidence 3333333333333333333
No 37
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.75 E-value=0.00095 Score=58.46 Aligned_cols=74 Identities=23% Similarity=0.381 Sum_probs=16.7
Q ss_pred ccCCCCCccCChhHHHhhHhhhccCCcccccCcCCCccCChHHHhhCccccCCCCcccccccchhcCCHHHHHHHHhhhc
Q psy14939 226 PCENCTKVFSDPSNLQRHIRTHHVGARQHACQECGKTFATSSGLKQHTHIHSSVKPFRCEVCHKAYTQFSNLCRHKRMHA 305 (971)
Q Consensus 226 ~C~~C~~~f~~~~~L~~H~~~hh~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~h~ 305 (971)
+|.+|+..|.+...|..|+...|.-..+ ....+.....+..+++... ...|.|.+|++.|.+...|..||+.++
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~-----~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~~~ 74 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP-----DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRSKH 74 (100)
T ss_dssp ------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHTT
T ss_pred Cccccccccccccccccccccccccccc-----ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcCcc
Confidence 3667777777777777777655532111 1111112222333322211 124677777777777777777776543
No 38
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.75 E-value=0.0008 Score=42.37 Aligned_cols=23 Identities=39% Similarity=0.897 Sum_probs=12.5
Q ss_pred eeccccccccCChHHHHHHHHHh
Q psy14939 32 YACKYCGKVFPRSANLTRHLRTH 54 (971)
Q Consensus 32 ~~C~~C~k~F~~~~~L~~H~~~H 54 (971)
|+|..|++.|.+...|..|++.|
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCccCCccCChhHHHHHhHHh
Confidence 45555555555555555555544
No 39
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.73 E-value=0.00087 Score=42.21 Aligned_cols=25 Identities=32% Similarity=0.781 Sum_probs=15.5
Q ss_pred CccCCCCCcccCChhHHHHHHhhhc
Q psy14939 819 PFKCPLCDRCFGQQTNLDRHLKKHE 843 (971)
Q Consensus 819 ~~~C~~C~~~f~~~~~l~~H~~~~~ 843 (971)
||.|..|++.|.....|..|++.|+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4566666666666666666666554
No 40
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.42 E-value=0.0028 Score=44.85 Aligned_cols=39 Identities=28% Similarity=0.600 Sum_probs=20.7
Q ss_pred hhhhhhccCCCCCccCCCCCcccCChhHHHHHHhhhcCC
Q psy14939 807 QRHVRNIHNKEKPFKCPLCDRCFGQQTNLDRHLKKHEAD 845 (971)
Q Consensus 807 ~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~ 845 (971)
..+.+..+..+.|-.|++|+..+.+..+|++|+.++|..
T Consensus 12 ~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~ 50 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFK 50 (54)
T ss_dssp ----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTT
T ss_pred hhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcc
Confidence 344443445566777888888888888888887776654
No 41
>PRK04860 hypothetical protein; Provisional
Probab=96.14 E-value=0.0029 Score=59.41 Aligned_cols=41 Identities=29% Similarity=0.709 Sum_probs=32.0
Q ss_pred CCcccCCCCCccCChhhHHHHHHhhcCCCccccCCCCCccccchh
Q psy14939 761 DRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSN 805 (971)
Q Consensus 761 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~~ 805 (971)
-+|.|. |++ ....+.+|.++|+++++|.|..|++.|.....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~~ 158 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKGE 158 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEecc
Confidence 358887 887 55788888888888888888888888776543
No 42
>KOG2231|consensus
Probab=96.01 E-value=0.012 Score=67.30 Aligned_cols=71 Identities=25% Similarity=0.452 Sum_probs=49.0
Q ss_pred cccCccchhccChhHHHhhHhhcCCCCcccCCC------CCccCChhHHHhhHhhhccCCcccccC--cCCC-ccCChHH
Q psy14939 198 YRCESCSQTFCWRPHLNFHQAQVHGRKFPCENC------TKVFSDPSNLQRHIRTHHVGARQHACQ--ECGK-TFATSSG 268 (971)
Q Consensus 198 ~~C~~C~~~f~~~~~L~~H~~~~~~~~~~C~~C------~~~f~~~~~L~~H~~~hh~~~~~~~C~--~C~~-~f~~~~~ 268 (971)
-.|..|...|.....|..|++.+| |.|.+| +..|.....|..|.|.+| |.|. .|.- .|.....
T Consensus 183 p~C~~C~~~fld~~el~rH~~~~h---~~chfC~~~~~~neyy~~~~dLe~HfR~~H-----flCE~~~C~~~~f~~~~~ 254 (669)
T KOG2231|consen 183 PLCKFCHERFLDDDELYRHLRFDH---EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH-----FLCEEEFCRTKKFYVAFE 254 (669)
T ss_pred ccchhhhhhhccHHHHHHhhccce---eheeecCcccccchhcccchHHHHHhhhcC-----ccccccccccceeeehhH
Confidence 458888888888888888887766 455555 456778899999999887 8888 5753 3433334
Q ss_pred HhhCcccc
Q psy14939 269 LKQHTHIH 276 (971)
Q Consensus 269 l~~H~~~h 276 (971)
+..|++.|
T Consensus 255 ~ei~lk~~ 262 (669)
T KOG2231|consen 255 LEIELKAH 262 (669)
T ss_pred HHHHHHhh
Confidence 44444433
No 43
>PRK04860 hypothetical protein; Provisional
Probab=95.93 E-value=0.0032 Score=59.11 Aligned_cols=41 Identities=27% Similarity=0.679 Sum_probs=36.5
Q ss_pred CccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCChhH
Q psy14939 789 QPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTN 834 (971)
Q Consensus 789 ~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~ 834 (971)
-+|.|. |++ ....+++|.+ +|+++++|.|..|+..|.....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~r-i~~g~~~YrC~~C~~~l~~~~~ 158 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNR-VVRGEAVYRCRRCGETLVFKGE 158 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHH-HhcCCccEECCCCCceeEEecc
Confidence 479998 998 8889999999 8999999999999999986553
No 44
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.91 E-value=0.0068 Score=36.82 Aligned_cols=23 Identities=48% Similarity=1.047 Sum_probs=11.8
Q ss_pred eecCCCCcccCchhhHHHHHhhhh
Q psy14939 89 FKCPLCDRCFGQQTNLDRHLKKHE 112 (971)
Q Consensus 89 ~~C~~C~~~f~~~~~L~~H~~~h~ 112 (971)
|+|+.|++... ...|.+|++.||
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45555655555 555556655554
No 45
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.71 E-value=0.0081 Score=37.15 Aligned_cols=23 Identities=52% Similarity=1.067 Sum_probs=12.9
Q ss_pred eeccccccccCChHHHHHHHHHh
Q psy14939 32 YACKYCGKVFPRSANLTRHLRTH 54 (971)
Q Consensus 32 ~~C~~C~k~F~~~~~L~~H~~~H 54 (971)
|+|..|++.|.....|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 34555555555555555555544
No 46
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.71 E-value=0.0086 Score=37.03 Aligned_cols=23 Identities=30% Similarity=1.009 Sum_probs=17.8
Q ss_pred ccCCCCCcccCChhHHHHHHhhh
Q psy14939 820 FKCPLCDRCFGQQTNLDRHLKKH 842 (971)
Q Consensus 820 ~~C~~C~~~f~~~~~l~~H~~~~ 842 (971)
|.|..|++.|.....|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 56778888888888888887765
No 47
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.65 E-value=0.0071 Score=42.89 Aligned_cols=30 Identities=13% Similarity=0.273 Sum_probs=11.8
Q ss_pred CCceeccccccccCCHHHHHHhhhhcCCCC
Q psy14939 308 RMQIKCVKCDQSFSTVTSLSKHKRFCDSTT 337 (971)
Q Consensus 308 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 337 (971)
..+-.|++|+..+....+|++|+.+.|+..
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 334444444444444444444444444443
No 48
>KOG2482|consensus
Probab=95.53 E-value=0.022 Score=57.99 Aligned_cols=135 Identities=23% Similarity=0.388 Sum_probs=87.9
Q ss_pred CCcccCccchhcc-ChhHHHhhHhhcCC-----------------------CCcccCCCCCccCChhHHHhhHhhh-ccC
Q psy14939 196 DQYRCESCSQTFC-WRPHLNFHQAQVHG-----------------------RKFPCENCTKVFSDPSNLQRHIRTH-HVG 250 (971)
Q Consensus 196 ~~~~C~~C~~~f~-~~~~L~~H~~~~~~-----------------------~~~~C~~C~~~f~~~~~L~~H~~~h-h~~ 250 (971)
...+|-+|...+. .++....|+...|+ ..+.|-+|.+.|+.+..|+.|||.. |..
T Consensus 143 fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrkK~Hrr 222 (423)
T KOG2482|consen 143 FSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRKKRHRR 222 (423)
T ss_pred eeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHhccCcc
Confidence 3478989988764 66777788776664 2478999999999999999999754 221
Q ss_pred Ccc--------cccCc--CCCccCC-hHHHhhCcccc--C-------------CCCc--ccccccchhcCCHHHHHHHHh
Q psy14939 251 ARQ--------HACQE--CGKTFAT-SSGLKQHTHIH--S-------------SVKP--FRCEVCHKAYTQFSNLCRHKR 302 (971)
Q Consensus 251 ~~~--------~~C~~--C~~~f~~-~~~l~~H~~~h--~-------------~~~~--~~C~~C~~~~~~~~~l~~H~~ 302 (971)
-.| |.-.+ -|+.... .+.+. +.+- . +..+ .+|..|....-+...|..||+
T Consensus 223 inPknreYDkfyiINY~ev~ks~t~~~~e~d--ret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~en~~~l~eHmk 300 (423)
T KOG2482|consen 223 INPKNREYDKFYIINYLEVGKSWTIVHSEDD--RETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYENPVFLFEHMK 300 (423)
T ss_pred cCCCccccceEEEEeHhhcCCccchhhhhhh--hhhhccccccccchhhhhcCCCCccceEEEeeccchhhHHHHHHHHH
Confidence 111 21111 1222111 11110 1110 0 1112 589999999999999999998
Q ss_pred hhcCCC--------------------------CceeccccccccCCHHHHHHhhhh
Q psy14939 303 MHANCR--------------------------MQIKCVKCDQSFSTVTSLSKHKRF 332 (971)
Q Consensus 303 ~h~~~~--------------------------~~~~C~~C~~~f~~~~~l~~H~~~ 332 (971)
.-|... ..-.|..|...|.....|..|+.-
T Consensus 301 ~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 301 IVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred HHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence 765310 124589999999999999999864
No 49
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.51 E-value=0.013 Score=59.49 Aligned_cols=75 Identities=23% Similarity=0.331 Sum_probs=53.3
Q ss_pred cccCccchhccChhHHHhhHhhcCCCCcccCCC----CCccCChhHHHhhHhhhccCCcccccCc--CC----CccCChH
Q psy14939 198 YRCESCSQTFCWRPHLNFHQAQVHGRKFPCENC----TKVFSDPSNLQRHIRTHHVGARQHACQE--CG----KTFATSS 267 (971)
Q Consensus 198 ~~C~~C~~~f~~~~~L~~H~~~~~~~~~~C~~C----~~~f~~~~~L~~H~~~hh~~~~~~~C~~--C~----~~f~~~~ 267 (971)
-.|.+|...|-....|..|++..|.+-|.|..= ..-|++...|.+|.+.-| |.|.+ |- ..|....
T Consensus 221 P~C~FC~~~FYdDDEL~~HcR~~HE~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h-----y~ct~qtc~~~k~~vf~~~~ 295 (493)
T COG5236 221 PLCIFCKIYFYDDDELRRHCRLRHEACHICDMVGPIRYQYFKSYEDLEAHFRNAH-----YCCTFQTCRVGKCYVFPYHT 295 (493)
T ss_pred chhhhccceecChHHHHHHHHhhhhhhhhhhccCccchhhhhCHHHHHHHhhcCc-----eEEEEEEEecCcEEEeccHH
Confidence 468888888888888888888877753333222 235888889999987654 77764 53 4688888
Q ss_pred HHhhCccccC
Q psy14939 268 GLKQHTHIHS 277 (971)
Q Consensus 268 ~l~~H~~~h~ 277 (971)
.|..|+..-+
T Consensus 296 el~~h~~~~h 305 (493)
T COG5236 296 ELLEHLTRFH 305 (493)
T ss_pred HHHHHHHHHh
Confidence 8888875443
No 50
>KOG2785|consensus
Probab=95.48 E-value=0.015 Score=60.79 Aligned_cols=135 Identities=17% Similarity=0.333 Sum_probs=92.1
Q ss_pred cccCccchhccChhHHHhhHhhcC--------------------------------------CCCcccCCCCCccCChhH
Q psy14939 198 YRCESCSQTFCWRPHLNFHQAQVH--------------------------------------GRKFPCENCTKVFSDPSN 239 (971)
Q Consensus 198 ~~C~~C~~~f~~~~~L~~H~~~~~--------------------------------------~~~~~C~~C~~~f~~~~~ 239 (971)
|.|..|...|.+...-+.|+++-- +..+.|..|.+.|.+...
T Consensus 4 ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~a 83 (390)
T KOG2785|consen 4 FTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPKA 83 (390)
T ss_pred ceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChhh
Confidence 999999999999888888876521 024789999999999999
Q ss_pred HHhhHhhhc--------c--C------Ccccc-------------cCcCCCccCChHHHhhCccc------c------CC
Q psy14939 240 LQRHIRTHH--------V--G------ARQHA-------------CQECGKTFATSSGLKQHTHI------H------SS 278 (971)
Q Consensus 240 L~~H~~~hh--------~--~------~~~~~-------------C~~C~~~f~~~~~l~~H~~~------h------~~ 278 (971)
-..|+.... . + ...+. +..+-..+........+... - ..
T Consensus 84 ~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~~~dd~~Edi~~d~~~e~e 163 (390)
T KOG2785|consen 84 HENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEEEEDDEEEDIEEDGDDEDE 163 (390)
T ss_pred HHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhhccCcchhhhhhccchhcc
Confidence 999986431 0 0 01122 22232222221111111111 0 00
Q ss_pred CCcccccccchhcCCHHHHHHHHhhhcCCCC----------------------ceeccccc---cccCCHHHHHHhhhh
Q psy14939 279 VKPFRCEVCHKAYTQFSNLCRHKRMHANCRM----------------------QIKCVKCD---QSFSTVTSLSKHKRF 332 (971)
Q Consensus 279 ~~~~~C~~C~~~~~~~~~l~~H~~~h~~~~~----------------------~~~C~~C~---~~f~~~~~l~~H~~~ 332 (971)
.-|-.|..|++.+.+...-..||..+|+... -|.|-.|+ +.|.+....+.||+-
T Consensus 164 ~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 164 LIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred cCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 1246799999999999999999999987432 26788898 999999999999975
No 51
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.37 E-value=0.014 Score=35.45 Aligned_cols=23 Identities=52% Similarity=0.908 Sum_probs=12.8
Q ss_pred eeccccccccCCHHHHHHHHHhhc
Q psy14939 60 YKCKYCERSFSISSNLQRHVRNIH 83 (971)
Q Consensus 60 ~~C~~C~~~f~~~~~L~~H~~~~H 83 (971)
|+|+.|++... ...|..|++.+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 55666666665 666666666444
No 52
>KOG2785|consensus
Probab=95.29 E-value=0.06 Score=56.40 Aligned_cols=75 Identities=16% Similarity=0.240 Sum_probs=46.5
Q ss_pred CCcccCccchhccChhHHHhhHhhcCCCCcccCCCCCccCChhHHHhhHhhhccCCcccccCcCC---CccCChHHHhhC
Q psy14939 196 DQYRCESCSQTFCWRPHLNFHQAQVHGRKFPCENCTKVFSDPSNLQRHIRTHHVGARQHACQECG---KTFATSSGLKQH 272 (971)
Q Consensus 196 ~~~~C~~C~~~f~~~~~L~~H~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~hh~~~~~~~C~~C~---~~f~~~~~l~~H 272 (971)
.+-.|-+|++.+.+......||..+|+.-.. . ..-......|..-+... -..-+.|.+|+ +.|.+-...+.|
T Consensus 165 ~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIP--d-reYL~D~~GLl~YLgeK--V~~~~~CL~CN~~~~~f~sleavr~H 239 (390)
T KOG2785|consen 165 IPTDCLFCDKKSKSLEENLKHMFKEHGFFIP--D-REYLTDEKGLLKYLGEK--VGIGFICLFCNELGRPFSSLEAVRAH 239 (390)
T ss_pred CCcceeecCCCcccHHHHHHHHhhccCCcCC--c-hHhhhchhHHHHHHHHH--hccCceEEEeccccCcccccHHHHHH
Confidence 3477889999999988889999888872100 0 00112233333333222 12347788887 788888888888
Q ss_pred ccc
Q psy14939 273 THI 275 (971)
Q Consensus 273 ~~~ 275 (971)
|..
T Consensus 240 M~~ 242 (390)
T KOG2785|consen 240 MRD 242 (390)
T ss_pred Hhh
Confidence 753
No 53
>KOG2893|consensus
Probab=95.19 E-value=0.0066 Score=58.12 Aligned_cols=46 Identities=26% Similarity=0.676 Sum_probs=33.7
Q ss_pred ccccccccCChHHHHHHHHHhcCCcceeccccccccCCHHHHHHHHHhhc
Q psy14939 34 CKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIH 83 (971)
Q Consensus 34 C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H 83 (971)
|=+|++.|....-|.+|++..+ |+|.+|.+...+--.|..|-..+|
T Consensus 13 cwycnrefddekiliqhqkakh----fkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH----FKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeecccccchhhhhhhhhhhcc----ceeeeehhhhccCCCceeehhhhh
Confidence 7888888888888888887644 888888777766666766644344
No 54
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.02 E-value=0.014 Score=35.82 Aligned_cols=21 Identities=38% Similarity=0.893 Sum_probs=10.8
Q ss_pred eeccccccccCChHHHHHHHH
Q psy14939 32 YACKYCGKVFPRSANLTRHLR 52 (971)
Q Consensus 32 ~~C~~C~k~F~~~~~L~~H~~ 52 (971)
|.|..|++.|.+...|..|++
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCCcCCHHHHHHHHC
Confidence 345555555555555555554
No 55
>KOG2482|consensus
Probab=95.00 E-value=0.023 Score=57.84 Aligned_cols=82 Identities=27% Similarity=0.599 Sum_probs=66.4
Q ss_pred CceeccccccccCChHHHHHHHHH--hcC---------------------------------------------------
Q psy14939 30 DRYACKYCGKVFPRSANLTRHLRT--HTG--------------------------------------------------- 56 (971)
Q Consensus 30 ~~~~C~~C~k~F~~~~~L~~H~~~--H~~--------------------------------------------------- 56 (971)
.+++|-+|.|.|+.+..|+.|||. |..
T Consensus 194 ~r~~CLyCekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed 273 (423)
T KOG2482|consen 194 ERLRCLYCEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNED 273 (423)
T ss_pred hhheeeeeccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcC
Confidence 478999999999999999999985 310
Q ss_pred -Ccc--eeccccccccCCHHHHHHHHHhhcCC--------------------------CCCeecCCCCcccCchhhHHHH
Q psy14939 57 -EQP--YKCKYCERSFSISSNLQRHVRNIHNK--------------------------EKPFKCPLCDRCFGQQTNLDRH 107 (971)
Q Consensus 57 -~~~--~~C~~C~~~f~~~~~L~~H~~~~H~~--------------------------~k~~~C~~C~~~f~~~~~L~~H 107 (971)
..+ ..|-.|.....+...|..||+.+|.- .+.-.|..|+..|.....|..|
T Consensus 274 ~a~a~~v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~h 353 (423)
T KOG2482|consen 274 DAEALSVVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIH 353 (423)
T ss_pred CCCccceEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhh
Confidence 011 58999999999999999999988842 1224689999999999999999
Q ss_pred Hhhh
Q psy14939 108 LKKH 111 (971)
Q Consensus 108 ~~~h 111 (971)
|..+
T Consensus 354 m~e~ 357 (423)
T KOG2482|consen 354 MVED 357 (423)
T ss_pred cccc
Confidence 8655
No 56
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=94.90 E-value=0.033 Score=54.50 Aligned_cols=84 Identities=24% Similarity=0.481 Sum_probs=49.5
Q ss_pred CCCCCcccCCCCCccCChhhHHHHHHhhc--CCCccccCCCCCccccchhh-------hhhhhh---ccCCCCCccCCCC
Q psy14939 758 TTKDRYACKYCGKVFPRSANLTRHLRTHT--GEQPYKCKYCERSFSISSNL-------QRHVRN---IHNKEKPFKCPLC 825 (971)
Q Consensus 758 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~--~~~p~~C~~C~~~f~~~~~l-------~~H~~~---~h~~~~~~~C~~C 825 (971)
.+-+.|.|.+|... .-..+--.|+..-. ....|+|.-|+|.=. -+-| .-|.|. .-...+++.||.|
T Consensus 138 hGGrif~CsfC~~f-lCEDDQFEHQAsCQvLe~E~~KC~SCNrlGq-~sCLRCK~cfCddHvrrKg~ky~k~k~~PCPKC 215 (314)
T PF06524_consen 138 HGGRIFKCSFCDNF-LCEDDQFEHQASCQVLESETFKCQSCNRLGQ-YSCLRCKICFCDDHVRRKGFKYEKGKPIPCPKC 215 (314)
T ss_pred CCCeEEEeecCCCe-eeccchhhhhhhhhhhhcccccccccccccc-hhhhheeeeehhhhhhhcccccccCCCCCCCCC
Confidence 45567899998863 33444455665432 234577766664211 1111 123331 1234579999999
Q ss_pred CcccCChhHHHHHHhhhc
Q psy14939 826 DRCFGQQTNLDRHLKKHE 843 (971)
Q Consensus 826 ~~~f~~~~~l~~H~~~~~ 843 (971)
+.....-..|.--.|+|.
T Consensus 216 g~et~eTkdLSmStR~hk 233 (314)
T PF06524_consen 216 GYETQETKDLSMSTRSHK 233 (314)
T ss_pred CCcccccccceeeeecch
Confidence 998888888877666664
No 57
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.58 E-value=0.02 Score=35.11 Aligned_cols=23 Identities=30% Similarity=0.931 Sum_probs=15.6
Q ss_pred ccCCCCCcccCChhHHHHHHhhh
Q psy14939 820 FKCPLCDRCFGQQTNLDRHLKKH 842 (971)
Q Consensus 820 ~~C~~C~~~f~~~~~l~~H~~~~ 842 (971)
|.|.+|++.|.....|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 55677777777777777776653
No 58
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=94.10 E-value=0.045 Score=55.68 Aligned_cols=129 Identities=22% Similarity=0.415 Sum_probs=89.9
Q ss_pred cccCc--cchhccChhHHHhhHhhcCCCCcccCCCC---CccC------ChhHHHhhHhhhccCC---cccccCcCCCcc
Q psy14939 198 YRCES--CSQTFCWRPHLNFHQAQVHGRKFPCENCT---KVFS------DPSNLQRHIRTHHVGA---RQHACQECGKTF 263 (971)
Q Consensus 198 ~~C~~--C~~~f~~~~~L~~H~~~~~~~~~~C~~C~---~~f~------~~~~L~~H~~~hh~~~---~~~~C~~C~~~f 263 (971)
|.|+. |.........|..|.+..|++ +.|.+|- +.|. +...|+.|...--.++ ..-.|.+|...|
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~~-~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~F 230 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHGF-VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIYF 230 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcCc-EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhcccee
Confidence 77864 777777888999999988764 5677763 2343 4466777764331111 123699999999
Q ss_pred CChHHHhhCccccCCCCcccccccc-------hhcCCHHHHHHHHhhhcCCCCceeccc--cc----cccCCHHHHHHhh
Q psy14939 264 ATSSGLKQHTHIHSSVKPFRCEVCH-------KAYTQFSNLCRHKRMHANCRMQIKCVK--CD----QSFSTVTSLSKHK 330 (971)
Q Consensus 264 ~~~~~l~~H~~~h~~~~~~~C~~C~-------~~~~~~~~l~~H~~~h~~~~~~~~C~~--C~----~~f~~~~~l~~H~ 330 (971)
..-..|..|++..+ -.|-+|. ..|.....|..|.+.-| |.|.. |- +.|.....|..|+
T Consensus 231 YdDDEL~~HcR~~H----E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h-----y~ct~qtc~~~k~~vf~~~~el~~h~ 301 (493)
T COG5236 231 YDDDELRRHCRLRH----EACHICDMVGPIRYQYFKSYEDLEAHFRNAH-----YCCTFQTCRVGKCYVFPYHTELLEHL 301 (493)
T ss_pred cChHHHHHHHHhhh----hhhhhhhccCccchhhhhCHHHHHHHhhcCc-----eEEEEEEEecCcEEEeccHHHHHHHH
Confidence 99999999987654 3577776 46788888999987433 55533 22 5799999999998
Q ss_pred hhcCCC
Q psy14939 331 RFCDST 336 (971)
Q Consensus 331 ~~~~~~ 336 (971)
-.-|+.
T Consensus 302 ~~~h~~ 307 (493)
T COG5236 302 TRFHKV 307 (493)
T ss_pred HHHhhc
Confidence 665544
No 59
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.30 E-value=0.041 Score=34.46 Aligned_cols=21 Identities=29% Similarity=0.861 Sum_probs=14.3
Q ss_pred cccCCCCCccccchhhhhhhh
Q psy14939 791 YKCKYCERSFSISSNLQRHVR 811 (971)
Q Consensus 791 ~~C~~C~~~f~~~~~l~~H~~ 811 (971)
|.|..|++.|.....|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 566667777777777766665
No 60
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.22 E-value=0.021 Score=65.87 Aligned_cols=61 Identities=41% Similarity=0.766 Sum_probs=36.7
Q ss_pred ceeccccccccCChHHHHHHHH--HhcCC--cceecc--ccccccCCHHHHHHHHHhhcCCCCCeecC
Q psy14939 31 RYACKYCGKVFPRSANLTRHLR--THTGE--QPYKCK--YCERSFSISSNLQRHVRNIHNKEKPFKCP 92 (971)
Q Consensus 31 ~~~C~~C~k~F~~~~~L~~H~~--~H~~~--~~~~C~--~C~~~f~~~~~L~~H~~~~H~~~k~~~C~ 92 (971)
++.|..|...|.....|..|++ .|+++ +++.|+ .|++.|.....|..|.. .|.+.+++.|.
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 355 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHIL-LHTSISPAKEK 355 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcc-cccCCCccccc
Confidence 4566666666666666666666 56666 666666 56666666666666655 45555555443
No 61
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.13 E-value=0.04 Score=63.57 Aligned_cols=73 Identities=38% Similarity=0.666 Sum_probs=64.0
Q ss_pred CCcccCCCCCccCChhhHHHHHH--hhcCC--CccccC--CCCCccccchhhhhhhhhccCCCCCccCCC--CCcccCCh
Q psy14939 761 DRYACKYCGKVFPRSANLTRHLR--THTGE--QPYKCK--YCERSFSISSNLQRHVRNIHNKEKPFKCPL--CDRCFGQQ 832 (971)
Q Consensus 761 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~p~~C~--~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~--C~~~f~~~ 832 (971)
..+.|..|...|.+...|.+|.+ .|+++ +|+.|+ .|++.|.+.+.+.+|.. +|.+.+++.|.. |.+.+...
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 366 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHIL-LHTSISPAKEKLLNSSSKFSPL 366 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcc-cccCCCccccccccCccccccc
Confidence 57899999999999999999999 89999 999999 79999999999999998 899999998844 55555544
Q ss_pred hH
Q psy14939 833 TN 834 (971)
Q Consensus 833 ~~ 834 (971)
..
T Consensus 367 ~~ 368 (467)
T COG5048 367 LN 368 (467)
T ss_pred cC
Confidence 44
No 62
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.10 E-value=0.048 Score=34.17 Aligned_cols=22 Identities=36% Similarity=0.937 Sum_probs=13.9
Q ss_pred cccCCCCCccCChhHHHhhHhh
Q psy14939 225 FPCENCTKVFSDPSNLQRHIRT 246 (971)
Q Consensus 225 ~~C~~C~~~f~~~~~L~~H~~~ 246 (971)
|.|..|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4566666666666666666653
No 63
>KOG4173|consensus
Probab=92.38 E-value=0.048 Score=51.35 Aligned_cols=87 Identities=23% Similarity=0.552 Sum_probs=71.2
Q ss_pred CCCCCCcccCC--CCCccCChhhHHHHHHhhcCCCccccCCCCCccccchhhhhhhhhccC---------CCCCccC--C
Q psy14939 757 KTTKDRYACKY--CGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHN---------KEKPFKC--P 823 (971)
Q Consensus 757 ~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~---------~~~~~~C--~ 823 (971)
+.....|.|.+ |..+|....+...|..+-++. .|.+|.|.|.+...|..|+.-.|- |...|.| .
T Consensus 74 ~~~~~~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvE 150 (253)
T KOG4173|consen 74 KPRVPAFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVE 150 (253)
T ss_pred ccccccccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHH
Confidence 34456788988 889999999888998654433 699999999999999999876663 5668999 6
Q ss_pred CCCcccCChhHHHHHHhhhcCCC
Q psy14939 824 LCDRCFGQQTNLDRHLKKHEADD 846 (971)
Q Consensus 824 ~C~~~f~~~~~l~~H~~~~~~~~ 846 (971)
.|+..|......+.||-+.|...
T Consensus 151 gCt~KFkT~r~RkdH~I~~Hk~P 173 (253)
T KOG4173|consen 151 GCTEKFKTSRDRKDHMIRMHKYP 173 (253)
T ss_pred hhhhhhhhhhhhhhHHHHhccCC
Confidence 79999999999999988776544
No 64
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.22 E-value=0.13 Score=31.42 Aligned_cols=21 Identities=38% Similarity=0.895 Sum_probs=13.0
Q ss_pred ccCCCCCcccCChhHHHHHHhh
Q psy14939 820 FKCPLCDRCFGQQTNLDRHLKK 841 (971)
Q Consensus 820 ~~C~~C~~~f~~~~~l~~H~~~ 841 (971)
..|+.|++.| ..+.|.+|++.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4566666666 66666666553
No 65
>KOG2893|consensus
Probab=91.77 E-value=0.05 Score=52.31 Aligned_cols=48 Identities=29% Similarity=0.402 Sum_probs=38.6
Q ss_pred cccCccchhccChhHHHhhHhhcCCCCcccCCCCCccCChhHHHhhHhhhc
Q psy14939 198 YRCESCSQTFCWRPHLNFHQAQVHGRKFPCENCTKVFSDPSNLQRHIRTHH 248 (971)
Q Consensus 198 ~~C~~C~~~f~~~~~L~~H~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~hh 248 (971)
-+|.+|++.|.....|++|++..| |+|.+|.+.+.+--.|..|....|
T Consensus 11 pwcwycnrefddekiliqhqkakh---fkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 11 PWCWYCNREFDDEKILIQHQKAKH---FKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred ceeeecccccchhhhhhhhhhhcc---ceeeeehhhhccCCCceeehhhhh
Confidence 579999999999999999987654 889999888877777777765444
No 66
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=89.43 E-value=0.26 Score=30.14 Aligned_cols=22 Identities=32% Similarity=0.887 Sum_probs=14.8
Q ss_pred eeccccccccCCHHHHHHhhhhc
Q psy14939 311 IKCVKCDQSFSTVTSLSKHKRFC 333 (971)
Q Consensus 311 ~~C~~C~~~f~~~~~l~~H~~~~ 333 (971)
..|+.|++.| ....|.+|+++|
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~C 24 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKIC 24 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHhc
Confidence 4577777777 566677776654
No 67
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=89.21 E-value=0.53 Score=41.68 Aligned_cols=73 Identities=16% Similarity=0.413 Sum_probs=46.1
Q ss_pred CceeccccccccCChHHHHHHHHH-hcCCcceeccccccccCCHHHHHHHHHhhc-------------CC---------C
Q psy14939 30 DRYACKYCGKVFPRSANLTRHLRT-HTGEQPYKCKYCERSFSISSNLQRHVRNIH-------------NK---------E 86 (971)
Q Consensus 30 ~~~~C~~C~k~F~~~~~L~~H~~~-H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H-------------~~---------~ 86 (971)
+-..|..|...... +.+..|++. |...+... ...|..-++..- .. .
T Consensus 10 ~vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~----------~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~ 78 (109)
T PF12013_consen 10 RVLICRQCQYAVQP-SEVESHLRKRHHILKSQE----------RQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVY 78 (109)
T ss_pred CEEEeCCCCcccCc-hHHHHHHHHhcccccHHH----------HHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCCCC
Confidence 45689999987776 779999995 43221110 111211111100 00 1
Q ss_pred CCeec----CCCCcccCchhhHHHHHhhhhc
Q psy14939 87 KPFKC----PLCDRCFGQQTNLDRHLKKHEA 113 (971)
Q Consensus 87 k~~~C----~~C~~~f~~~~~L~~H~~~h~~ 113 (971)
.-|.| ..|++.+.+...|.+|++.+|+
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 12899 9999999999999999998874
No 68
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=87.52 E-value=0.43 Score=31.99 Aligned_cols=23 Identities=26% Similarity=0.676 Sum_probs=15.4
Q ss_pred CcccCCCCCccCChhhHHHHHHh
Q psy14939 762 RYACKYCGKVFPRSANLTRHLRT 784 (971)
Q Consensus 762 ~~~C~~C~~~f~~~~~l~~H~~~ 784 (971)
+|.|++|++.|.....+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 46677777777766666666653
No 69
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=86.99 E-value=0.47 Score=31.81 Aligned_cols=22 Identities=27% Similarity=0.723 Sum_probs=14.0
Q ss_pred ceeccccccccCChHHHHHHHH
Q psy14939 31 RYACKYCGKVFPRSANLTRHLR 52 (971)
Q Consensus 31 ~~~C~~C~k~F~~~~~L~~H~~ 52 (971)
+|.|.+|++.|.+...+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4566666666666666666654
No 70
>KOG4173|consensus
Probab=86.29 E-value=0.48 Score=44.90 Aligned_cols=82 Identities=23% Similarity=0.577 Sum_probs=67.6
Q ss_pred CCceeccc--cccccCChHHHHHHHHHhcCCcceeccccccccCCHHHHHHHHHhhcC---------CCCCeec--CCCC
Q psy14939 29 KDRYACKY--CGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHN---------KEKPFKC--PLCD 95 (971)
Q Consensus 29 ~~~~~C~~--C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~---------~~k~~~C--~~C~ 95 (971)
...|.|.+ |..+|.....+..|...-++ -.|..|.+.|.+...|..|+...|. |..-|.| .-|+
T Consensus 77 ~~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 77 VPAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred cccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 35688987 88999999999999876444 3799999999999999999986663 4667899 5699
Q ss_pred cccCchhhHHHHHhhhhc
Q psy14939 96 RCFGQQTNLDRHLKKHEA 113 (971)
Q Consensus 96 ~~f~~~~~L~~H~~~h~~ 113 (971)
..|.+...-+.||-..|.
T Consensus 154 ~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhhhHHHHhcc
Confidence 999999999999866553
No 71
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=84.21 E-value=0.63 Score=30.73 Aligned_cols=24 Identities=29% Similarity=0.973 Sum_probs=13.8
Q ss_pred cccCCCCCccCChhhHHHHHHhhcCCCccccCCCCC
Q psy14939 763 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCER 798 (971)
Q Consensus 763 ~~C~~C~~~f~~~~~l~~H~~~h~~~~p~~C~~C~~ 798 (971)
|.|..||..+.-.. .|+.|++|+.
T Consensus 2 ~~C~~CGy~y~~~~------------~~~~CP~Cg~ 25 (33)
T cd00350 2 YVCPVCGYIYDGEE------------APWVCPVCGA 25 (33)
T ss_pred EECCCCCCEECCCc------------CCCcCcCCCC
Confidence 56777776653221 5666666664
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=83.62 E-value=0.49 Score=34.29 Aligned_cols=31 Identities=26% Similarity=0.627 Sum_probs=21.8
Q ss_pred cCCCCCccCCCCCcccCChhHHHHHHhhhcC
Q psy14939 814 HNKEKPFKCPLCDRCFGQQTNLDRHLKKHEA 844 (971)
Q Consensus 814 h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~ 844 (971)
--||--+.||-|++.|.+...+.+|....|.
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 3466677777777777777777777766543
No 73
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=82.10 E-value=0.69 Score=33.54 Aligned_cols=28 Identities=29% Similarity=0.617 Sum_probs=21.3
Q ss_pred CCCceeccccccccCChHHHHHHHHHhc
Q psy14939 28 TKDRYACKYCGKVFPRSANLTRHLRTHT 55 (971)
Q Consensus 28 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~ 55 (971)
|+--++|+-|++.|..+..+.+|...-+
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 4556788888888888888888876543
No 74
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=81.90 E-value=0.77 Score=30.33 Aligned_cols=10 Identities=40% Similarity=1.095 Sum_probs=5.4
Q ss_pred Ccccccccch
Q psy14939 280 KPFRCEVCHK 289 (971)
Q Consensus 280 ~~~~C~~C~~ 289 (971)
.++.|++|+.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 4455666653
No 75
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=81.35 E-value=5.6 Score=35.12 Aligned_cols=25 Identities=20% Similarity=0.579 Sum_probs=16.4
Q ss_pred CCeecCCCCcccCchhhHHHHHhhhh
Q psy14939 87 KPFKCPLCDRCFGQQTNLDRHLKKHE 112 (971)
Q Consensus 87 k~~~C~~C~~~f~~~~~L~~H~~~h~ 112 (971)
+...|..|...... +.+..|++..|
T Consensus 10 ~vlIC~~C~~av~~-~~v~~HL~~~H 34 (109)
T PF12013_consen 10 RVLICRQCQYAVQP-SEVESHLRKRH 34 (109)
T ss_pred CEEEeCCCCcccCc-hHHHHHHHHhc
Confidence 44567777776665 66777777443
No 76
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=78.88 E-value=1.2 Score=38.93 Aligned_cols=30 Identities=27% Similarity=0.631 Sum_probs=24.5
Q ss_pred eeccccccccCChHHHHHHHHHhcCCcceeccccccccCCH
Q psy14939 32 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSIS 72 (971)
Q Consensus 32 ~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~ 72 (971)
..|+.||+.|.... ..|..|++||..|.-.
T Consensus 10 R~Cp~CG~kFYDLn-----------k~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDLN-----------KDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccCC-----------CCCccCCCCCCccCcc
Confidence 57999999998742 4688999999999755
No 77
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=78.05 E-value=0.99 Score=45.37 Aligned_cols=44 Identities=20% Similarity=0.512 Sum_probs=26.5
Q ss_pred CccccCCCCCccccchhhhhhhhhcc---------CCCCC-----ccCCCCCcccCCh
Q psy14939 789 QPYKCKYCERSFSISSNLQRHVRNIH---------NKEKP-----FKCPLCDRCFGQQ 832 (971)
Q Consensus 789 ~p~~C~~C~~~f~~~~~l~~H~~~~h---------~~~~~-----~~C~~C~~~f~~~ 832 (971)
|.+.|++|++.|.++.-+.+..|... .+..| ..||.||.+|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 45566667666666655555554222 12333 5799999888744
No 78
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=75.91 E-value=1.1 Score=42.01 Aligned_cols=15 Identities=40% Similarity=0.744 Sum_probs=9.2
Q ss_pred ceeccccccccCChH
Q psy14939 31 RYACKYCGKVFPRSA 45 (971)
Q Consensus 31 ~~~C~~C~k~F~~~~ 45 (971)
.++|+.||++|.+..
T Consensus 28 ~~~c~~c~~~f~~~e 42 (154)
T PRK00464 28 RRECLACGKRFTTFE 42 (154)
T ss_pred eeeccccCCcceEeE
Confidence 466666666666543
No 79
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=74.52 E-value=2.3 Score=37.05 Aligned_cols=30 Identities=23% Similarity=0.668 Sum_probs=18.4
Q ss_pred cccCcCCCccCChHHHhhCccccCCCCcccccccchhcCCH
Q psy14939 254 HACQECGKTFATSSGLKQHTHIHSSVKPFRCEVCHKAYTQF 294 (971)
Q Consensus 254 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~~~~~ 294 (971)
..|+.||..|..- +..|..|++||..|...
T Consensus 10 R~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 4677777766532 12567777777777544
No 80
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=71.72 E-value=1.8 Score=26.69 Aligned_cols=9 Identities=33% Similarity=0.807 Sum_probs=4.4
Q ss_pred ecccccccc
Q psy14939 61 KCKYCERSF 69 (971)
Q Consensus 61 ~C~~C~~~f 69 (971)
.|++||+.|
T Consensus 16 ~Cp~CG~~F 24 (26)
T PF10571_consen 16 FCPHCGYDF 24 (26)
T ss_pred cCCCCCCCC
Confidence 455555444
No 81
>KOG2186|consensus
Probab=70.72 E-value=2.5 Score=41.97 Aligned_cols=45 Identities=18% Similarity=0.637 Sum_probs=20.9
Q ss_pred cccCCCCCccCChhhHHHHHHhhcCCCccccCCCCCccccchhhhhhh
Q psy14939 763 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHV 810 (971)
Q Consensus 763 ~~C~~C~~~f~~~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~~l~~H~ 810 (971)
|.|..||...+. -.|.+|+-..++ .-|.|--|++.|.+ ...+.|.
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~ 48 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHT 48 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhh
Confidence 455555554432 233445544444 34455445555554 4444443
No 82
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=70.30 E-value=5.4 Score=41.38 Aligned_cols=29 Identities=24% Similarity=0.721 Sum_probs=24.4
Q ss_pred ccCCCCCccCCCCC-cccCChhHHHHHHhh
Q psy14939 813 IHNKEKPFKCPLCD-RCFGQQTNLDRHLKK 841 (971)
Q Consensus 813 ~h~~~~~~~C~~C~-~~f~~~~~l~~H~~~ 841 (971)
.|.-.+-|.|.+|| +.+.-.+.+.+|..-
T Consensus 368 lhgLd~ef~CEICgNyvy~GR~~FdrHF~E 397 (470)
T COG5188 368 LHGLDIEFECEICGNYVYYGRDRFDRHFEE 397 (470)
T ss_pred hcCCCcceeeeecccccccchHHHHhhhhh
Confidence 46667789999999 899999999999654
No 83
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=70.30 E-value=3.1 Score=28.49 Aligned_cols=34 Identities=18% Similarity=0.506 Sum_probs=17.7
Q ss_pred cccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccC
Q psy14939 791 YKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFG 830 (971)
Q Consensus 791 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~ 830 (971)
+.|+.|+..|.-..+... .......|+.|+..|.
T Consensus 3 ~~CP~C~~~~~v~~~~~~------~~~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLG------ANGGKVRCGKCGHVWY 36 (38)
T ss_pred EECCCCCCEEEeCHHHcC------CCCCEEECCCCCCEEE
Confidence 456666666655543321 1122466777766653
No 84
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=70.11 E-value=3.4 Score=29.52 Aligned_cols=24 Identities=58% Similarity=1.085 Sum_probs=14.0
Q ss_pred CCceeccccccccCCh----HHHHHHHH
Q psy14939 29 KDRYACKYCGKVFPRS----ANLTRHLR 52 (971)
Q Consensus 29 ~~~~~C~~C~k~F~~~----~~L~~H~~ 52 (971)
+...+|.+|++.+... +.|.+|++
T Consensus 14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~ 41 (45)
T PF02892_consen 14 KKKAKCKYCGKVIKYSSGGTSNLKRHLK 41 (45)
T ss_dssp SS-EEETTTTEE-----SSTHHHHHHHH
T ss_pred cCeEEeCCCCeEEeeCCCcHHHHHHhhh
Confidence 4567888888887763 66777773
No 85
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=68.99 E-value=2.8 Score=27.89 Aligned_cols=25 Identities=32% Similarity=0.820 Sum_probs=14.5
Q ss_pred CcccCCCCCccCChhhHHHHHHhhcCCCccccCCCCC
Q psy14939 762 RYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCER 798 (971)
Q Consensus 762 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~p~~C~~C~~ 798 (971)
.|.|..||..+... +.|..|++|+.
T Consensus 2 ~~~C~~CG~i~~g~------------~~p~~CP~Cg~ 26 (34)
T cd00729 2 VWVCPVCGYIHEGE------------EAPEKCPICGA 26 (34)
T ss_pred eEECCCCCCEeECC------------cCCCcCcCCCC
Confidence 36677777654211 24667777764
No 86
>KOG4377|consensus
Probab=68.41 E-value=7.7 Score=41.41 Aligned_cols=29 Identities=24% Similarity=0.658 Sum_probs=23.1
Q ss_pred eeccc--cccccCCHHHHHHhhhhcCCCCCC
Q psy14939 311 IKCVK--CDQSFSTVTSLSKHKRFCDSTTPG 339 (971)
Q Consensus 311 ~~C~~--C~~~f~~~~~l~~H~~~~~~~~~~ 339 (971)
|.|.. |+..+.+.+++..|.|.|....++
T Consensus 402 fhc~r~Gc~~tl~s~sqm~shkrkheRqeqg 432 (480)
T KOG4377|consen 402 FHCDRLGCEATLYSVSQMASHKRKHERQEQG 432 (480)
T ss_pred eeecccCCceEEEehhhhhhhhhhhhhhhhc
Confidence 44554 889999999999999998876544
No 87
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=67.55 E-value=4.1 Score=36.01 Aligned_cols=14 Identities=21% Similarity=0.377 Sum_probs=8.4
Q ss_pred CccCCCCCcccCCh
Q psy14939 819 PFKCPLCDRCFGQQ 832 (971)
Q Consensus 819 ~~~C~~C~~~f~~~ 832 (971)
|-.|++||..|...
T Consensus 26 p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 26 PAVSPYTGEQFPPE 39 (129)
T ss_pred CccCCCcCCccCcc
Confidence 66666666665444
No 88
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=67.10 E-value=5 Score=28.67 Aligned_cols=27 Identities=44% Similarity=0.888 Sum_probs=14.2
Q ss_pred CcceeccccccccCCH----HHHHHHHHhhc
Q psy14939 57 EQPYKCKYCERSFSIS----SNLQRHVRNIH 83 (971)
Q Consensus 57 ~~~~~C~~C~~~f~~~----~~L~~H~~~~H 83 (971)
.....|.+|++.+... .+|..|++..|
T Consensus 14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 14 KKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp SS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred cCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 4456777777766653 66777775343
No 89
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=66.99 E-value=5.7 Score=41.24 Aligned_cols=23 Identities=35% Similarity=0.688 Sum_probs=20.3
Q ss_pred CceeccccccccCChHHHHHHHH
Q psy14939 30 DRYACKYCGKVFPRSANLTRHLR 52 (971)
Q Consensus 30 ~~~~C~~C~k~F~~~~~L~~H~~ 52 (971)
+.+-|.+|++.|.+..-+..|+.
T Consensus 237 ~~~YC~~C~r~f~~~~VFe~Hl~ 259 (470)
T COG5188 237 PKVYCVKCGREFSRSKVFEYHLE 259 (470)
T ss_pred cceeeHhhhhHhhhhHHHHHHHh
Confidence 45789999999999999999975
No 90
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=66.45 E-value=4.8 Score=27.21 Aligned_cols=33 Identities=21% Similarity=0.640 Sum_probs=21.6
Q ss_pred cccCCCCCccccchhhhhhhhhccCCCCCccCCCCCccc
Q psy14939 791 YKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCF 829 (971)
Q Consensus 791 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f 829 (971)
..|+.|+..|.-.+.... .+.+..+|+.|+..|
T Consensus 3 i~Cp~C~~~y~i~d~~ip------~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEKIP------PKGRKVRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHHCC------CCCcEEECCCCCCEe
Confidence 357778777777765422 334467788887776
No 91
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=66.33 E-value=4.5 Score=34.22 Aligned_cols=26 Identities=23% Similarity=0.318 Sum_probs=17.1
Q ss_pred eccccccccCChHHHHHHHHHhcCCcceecccccccc
Q psy14939 33 ACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSF 69 (971)
Q Consensus 33 ~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f 69 (971)
.|+.|++.|... +..|..|++||+.|
T Consensus 11 idPetg~KFYDL-----------NrdPiVsPytG~s~ 36 (129)
T COG4530 11 IDPETGKKFYDL-----------NRDPIVSPYTGKSY 36 (129)
T ss_pred cCccccchhhcc-----------CCCccccCcccccc
Confidence 577777777653 24567777777777
No 92
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=64.83 E-value=5.1 Score=36.35 Aligned_cols=28 Identities=54% Similarity=0.848 Sum_probs=16.0
Q ss_pred CCCceeccccccccCChHHHHHHHHHhcCCc
Q psy14939 28 TKDRYACKYCGKVFPRSANLTRHLRTHTGEQ 58 (971)
Q Consensus 28 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~ 58 (971)
.++...|-.|||.|+. |++|++.|+|-.
T Consensus 69 ~~d~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 69 TPDYIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp -SS-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred ccCeeEEccCCcccch---HHHHHHHccCCC
Confidence 3455678888888886 588888887653
No 93
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=64.48 E-value=2.3 Score=42.78 Aligned_cols=12 Identities=25% Similarity=0.509 Sum_probs=7.7
Q ss_pred ccccccchhcCC
Q psy14939 282 FRCEVCHKAYTQ 293 (971)
Q Consensus 282 ~~C~~C~~~~~~ 293 (971)
+.|+.||+++..
T Consensus 49 ~vCP~CgyA~~~ 60 (214)
T PF09986_consen 49 WVCPHCGYAAFE 60 (214)
T ss_pred EECCCCCCcccc
Confidence 567777766653
No 94
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=63.02 E-value=3.7 Score=37.24 Aligned_cols=29 Identities=55% Similarity=0.917 Sum_probs=18.6
Q ss_pred CCCCcccCCCCCccCChhhHHHHHHhhcCCCc
Q psy14939 759 TKDRYACKYCGKVFPRSANLTRHLRTHTGEQP 790 (971)
Q Consensus 759 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~p 790 (971)
..+.-.|-.|||.| ..|++|++.|.|--|
T Consensus 69 ~~d~i~clecGk~~---k~LkrHL~~~~gltp 97 (132)
T PF05443_consen 69 TPDYIICLECGKKF---KTLKRHLRTHHGLTP 97 (132)
T ss_dssp -SS-EE-TBT--EE---SBHHHHHHHTT-S-H
T ss_pred ccCeeEEccCCccc---chHHHHHHHccCCCH
Confidence 44567899999999 677999999987654
No 95
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=62.79 E-value=4.5 Score=28.74 Aligned_cols=11 Identities=36% Similarity=0.909 Sum_probs=6.0
Q ss_pred eeccccccccC
Q psy14939 32 YACKYCGKVFP 42 (971)
Q Consensus 32 ~~C~~C~k~F~ 42 (971)
|.|..||..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 55555555554
No 96
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=62.74 E-value=6.3 Score=34.90 Aligned_cols=30 Identities=13% Similarity=0.275 Sum_probs=18.7
Q ss_pred cccCcCCCccCChHHHhhCccccCCCCcccccccchhcCCH
Q psy14939 254 HACQECGKTFATSSGLKQHTHIHSSVKPFRCEVCHKAYTQF 294 (971)
Q Consensus 254 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~~~~~ 294 (971)
..|+.||+.|..- +..|..|++||..|...
T Consensus 10 r~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 10 RICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPE 39 (129)
T ss_pred ccCCCcCcccccc-----------CCCCccCCCcCCccCcc
Confidence 4677777776532 22567777777776554
No 97
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=62.50 E-value=4.3 Score=40.32 Aligned_cols=96 Identities=21% Similarity=0.454 Sum_probs=56.7
Q ss_pred hcCCCCcccCCCCCccCChhHHHhhHhhh-ccCCcccccCcCCCccCChHHHhhCccccCCCCcccccccchhcCCHHHH
Q psy14939 219 QVHGRKFPCENCTKVFSDPSNLQRHIRTH-HVGARQHACQECGKTFATSSGLKQHTHIHSSVKPFRCEVCHKAYTQFSNL 297 (971)
Q Consensus 219 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h-h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~~~~~~~l 297 (971)
.|.++.|+|.+|...+-.-..| .|.... -.....|+|.-|++. -.|.|..|.-.|-.
T Consensus 137 ~hGGrif~CsfC~~flCEDDQF-EHQAsCQvLe~E~~KC~SCNrl-----------------Gq~sCLRCK~cfCd---- 194 (314)
T PF06524_consen 137 DHGGRIFKCSFCDNFLCEDDQF-EHQASCQVLESETFKCQSCNRL-----------------GQYSCLRCKICFCD---- 194 (314)
T ss_pred cCCCeEEEeecCCCeeeccchh-hhhhhhhhhhcccccccccccc-----------------cchhhhheeeeehh----
Confidence 3566788999987544333333 344322 123456888888763 13667666666532
Q ss_pred HHHHhhhc---CCCCceeccccccccCCHHHHHHhhhhcCCCC
Q psy14939 298 CRHKRMHA---NCRMQIKCVKCDQSFSTVTSLSKHKRFCDSTT 337 (971)
Q Consensus 298 ~~H~~~h~---~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 337 (971)
.|++... ....++.|+.|++.......|..-.|.|.=.+
T Consensus 195 -dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmStR~hkyGR 236 (314)
T PF06524_consen 195 -DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMSTRSHKYGR 236 (314)
T ss_pred -hhhhhcccccccCCCCCCCCCCCcccccccceeeeecchhcc
Confidence 3444321 12357889999988877777766666654333
No 98
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=61.69 E-value=4.3 Score=29.40 Aligned_cols=28 Identities=25% Similarity=0.761 Sum_probs=15.6
Q ss_pred ceeccccccccCChHHHHHHHHHhcCCcceeccccccc
Q psy14939 31 RYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERS 68 (971)
Q Consensus 31 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~ 68 (971)
.|+|..||+.|... .......|++||..
T Consensus 6 ~Y~C~~Cg~~~~~~----------~~~~~irCp~Cg~r 33 (49)
T COG1996 6 EYKCARCGREVELD----------QETRGIRCPYCGSR 33 (49)
T ss_pred EEEhhhcCCeeehh----------hccCceeCCCCCcE
Confidence 46777777776211 12334667777654
No 99
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=61.63 E-value=6 Score=29.08 Aligned_cols=25 Identities=64% Similarity=1.169 Sum_probs=19.2
Q ss_pred CcccCCCCCccCCh-----hhHHHHHH-hhc
Q psy14939 762 RYACKYCGKVFPRS-----ANLTRHLR-THT 786 (971)
Q Consensus 762 ~~~C~~C~~~f~~~-----~~l~~H~~-~h~ 786 (971)
.-.|.+|++.+... ++|.+|++ .|.
T Consensus 18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~ 48 (50)
T smart00614 18 RAKCKYCGKKLSRSSKGGTSNLRRHLRRKHP 48 (50)
T ss_pred EEEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence 45799999999765 58888887 554
No 100
>KOG4167|consensus
Probab=61.10 E-value=10 Score=43.62 Aligned_cols=29 Identities=31% Similarity=0.708 Sum_probs=25.8
Q ss_pred CCCCCcccCCCCCccCChhhHHHHHHhhc
Q psy14939 758 TTKDRYACKYCGKVFPRSANLTRHLRTHT 786 (971)
Q Consensus 758 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 786 (971)
...+.|.|..|+|+|.....++.||++|-
T Consensus 788 ~~~giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 788 DPTGIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred CCCceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 34678999999999999999999999994
No 101
>KOG2186|consensus
Probab=61.09 E-value=5.9 Score=39.43 Aligned_cols=49 Identities=22% Similarity=0.564 Sum_probs=30.1
Q ss_pred eeccccccccCCHHHHHHHHHhhcCCCCCeecCCCCcccCchhhHHHHHhhhh
Q psy14939 60 YKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTNLDRHLKKHE 112 (971)
Q Consensus 60 ~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~h~ 112 (971)
|.|+.||-... +..|.+|+-..+. .-|.|-.|+..|.. ..+..|.+--.
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn--~~fSCIDC~k~F~~-~sYknH~kCIT 52 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRN--AYFSCIDCGKTFER-VSYKNHTKCIT 52 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccC--CeeEEeeccccccc-chhhhhhhhcc
Confidence 66777776654 3455557664333 45777777777776 56666765443
No 102
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=60.95 E-value=6.5 Score=26.75 Aligned_cols=33 Identities=21% Similarity=0.736 Sum_probs=19.5
Q ss_pred ccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccC
Q psy14939 792 KCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFG 830 (971)
Q Consensus 792 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~ 830 (971)
.|+.|+..|.-.++. +-.+.+..+|+.|+-.|.
T Consensus 4 ~CP~C~~~f~v~~~~------l~~~~~~vrC~~C~~~f~ 36 (37)
T PF13719_consen 4 TCPNCQTRFRVPDDK------LPAGGRKVRCPKCGHVFR 36 (37)
T ss_pred ECCCCCceEEcCHHH------cccCCcEEECCCCCcEee
Confidence 577777777665542 123334677777776663
No 103
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=60.82 E-value=4.7 Score=32.87 Aligned_cols=31 Identities=29% Similarity=0.645 Sum_probs=17.0
Q ss_pred CccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccC
Q psy14939 789 QPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFG 830 (971)
Q Consensus 789 ~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~ 830 (971)
.+|.|+.|++. .+.| +-+| -|.|..|+..|.
T Consensus 34 ~~~~Cp~C~~~--------~VkR-~a~G--IW~C~kCg~~fA 64 (89)
T COG1997 34 AKHVCPFCGRT--------TVKR-IATG--IWKCRKCGAKFA 64 (89)
T ss_pred cCCcCCCCCCc--------ceee-eccC--eEEcCCCCCeec
Confidence 45666666653 2223 2222 577777777665
No 104
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=60.71 E-value=6.2 Score=29.01 Aligned_cols=22 Identities=68% Similarity=1.235 Sum_probs=13.4
Q ss_pred ceeccccccccCCh-----HHHHHHHH
Q psy14939 31 RYACKYCGKVFPRS-----ANLTRHLR 52 (971)
Q Consensus 31 ~~~C~~C~k~F~~~-----~~L~~H~~ 52 (971)
.-.|.+|++.+... +.|.+|++
T Consensus 18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 18 RAKCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EEEecCCCCEeeeCCCCCcHHHHHHHH
Confidence 45677777776543 45666655
No 105
>PHA00626 hypothetical protein
Probab=60.45 E-value=4.8 Score=29.63 Aligned_cols=13 Identities=15% Similarity=0.322 Sum_probs=9.9
Q ss_pred CccCCCCCcccCC
Q psy14939 819 PFKCPLCDRCFGQ 831 (971)
Q Consensus 819 ~~~C~~C~~~f~~ 831 (971)
.|+|+.|+..|+.
T Consensus 23 rYkCkdCGY~ft~ 35 (59)
T PHA00626 23 DYVCCDCGYNDSK 35 (59)
T ss_pred ceEcCCCCCeech
Confidence 6888888887753
No 106
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=60.37 E-value=7.6 Score=29.24 Aligned_cols=11 Identities=27% Similarity=0.579 Sum_probs=7.5
Q ss_pred Cceeccccccc
Q psy14939 309 MQIKCVKCDQS 319 (971)
Q Consensus 309 ~~~~C~~C~~~ 319 (971)
..|+|+.||+.
T Consensus 47 ~~Y~CP~CGF~ 57 (59)
T PRK14890 47 NPYTCPKCGFE 57 (59)
T ss_pred CceECCCCCCc
Confidence 36778877763
No 107
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=60.26 E-value=5.6 Score=37.60 Aligned_cols=14 Identities=43% Similarity=1.044 Sum_probs=8.6
Q ss_pred ccCCCCCccCCCCC
Q psy14939 813 IHNKEKPFKCPLCD 826 (971)
Q Consensus 813 ~h~~~~~~~C~~C~ 826 (971)
+|-++-|-+||+|+
T Consensus 143 ~~~ge~P~~CPiCg 156 (166)
T COG1592 143 THEGEAPEVCPICG 156 (166)
T ss_pred cccCCCCCcCCCCC
Confidence 34556666666666
No 108
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=59.24 E-value=5.7 Score=37.57 Aligned_cols=23 Identities=35% Similarity=0.946 Sum_probs=19.4
Q ss_pred ceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCC
Q psy14939 59 PYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCD 95 (971)
Q Consensus 59 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~ 95 (971)
-|.|++||+. |-++.|-+||+|+
T Consensus 134 ~~vC~vCGy~--------------~~ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYT--------------HEGEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCc--------------ccCCCCCcCCCCC
Confidence 5999999984 4458899999998
No 109
>KOG4167|consensus
Probab=58.59 E-value=2.1 Score=48.93 Aligned_cols=25 Identities=28% Similarity=0.616 Sum_probs=22.9
Q ss_pred CccCCCCCcccCChhHHHHHHhhhc
Q psy14939 819 PFKCPLCDRCFGQQTNLDRHLKKHE 843 (971)
Q Consensus 819 ~~~C~~C~~~f~~~~~l~~H~~~~~ 843 (971)
-|.|..|+|.|....++.-|||+|.
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHH
Confidence 5899999999999999999999995
No 110
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=58.18 E-value=7.8 Score=29.07 Aligned_cols=10 Identities=40% Similarity=1.384 Sum_probs=6.1
Q ss_pred CCeecCCCCc
Q psy14939 87 KPFKCPLCDR 96 (971)
Q Consensus 87 k~~~C~~C~~ 96 (971)
.+|+|+.||+
T Consensus 49 ~~Y~Cp~CGF 58 (61)
T COG2888 49 NPYRCPKCGF 58 (61)
T ss_pred CceECCCcCc
Confidence 4566666664
No 111
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=54.02 E-value=3.6 Score=38.55 Aligned_cols=17 Identities=18% Similarity=0.513 Sum_probs=9.4
Q ss_pred ccCCCCCcccCChhHHH
Q psy14939 820 FKCPLCDRCFGQQTNLD 836 (971)
Q Consensus 820 ~~C~~C~~~f~~~~~l~ 836 (971)
++|+-||++|.....+.
T Consensus 29 ~~c~~c~~~f~~~e~~~ 45 (154)
T PRK00464 29 RECLACGKRFTTFERVE 45 (154)
T ss_pred eeccccCCcceEeEecc
Confidence 55666666665444443
No 112
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=53.95 E-value=10 Score=36.09 Aligned_cols=38 Identities=18% Similarity=0.439 Sum_probs=28.4
Q ss_pred hhcCCCccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCC
Q psy14939 784 THTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQ 831 (971)
Q Consensus 784 ~h~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~ 831 (971)
...+..-|.|+.|+..|+.-.++. .-|.|+.||.....
T Consensus 103 ~e~~~~~Y~Cp~c~~r~tf~eA~~----------~~F~Cp~Cg~~L~~ 140 (158)
T TIGR00373 103 FETNNMFFICPNMCVRFTFNEAME----------LNFTCPRCGAMLDY 140 (158)
T ss_pred hccCCCeEECCCCCcEeeHHHHHH----------cCCcCCCCCCEeee
Confidence 344556788999999988888874 16999999976543
No 113
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=53.80 E-value=8.2 Score=34.29 Aligned_cols=27 Identities=48% Similarity=0.705 Sum_probs=23.0
Q ss_pred CCcccCCCCCccCChhhHHHHHHhhcCCCc
Q psy14939 761 DRYACKYCGKVFPRSANLTRHLRTHTGEQP 790 (971)
Q Consensus 761 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~p 790 (971)
.-..|-.|||.| ..|+||+.+|.|--|
T Consensus 75 D~IicLEDGkkf---KSLKRHL~t~~gmTP 101 (148)
T COG4957 75 DYIICLEDGKKF---KSLKRHLTTHYGLTP 101 (148)
T ss_pred CeEEEeccCcch---HHHHHHHhcccCCCH
Confidence 445899999999 899999999987655
No 114
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=53.13 E-value=11 Score=35.36 Aligned_cols=39 Identities=23% Similarity=0.611 Sum_probs=24.5
Q ss_pred CCCccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccC
Q psy14939 787 GEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFG 830 (971)
Q Consensus 787 ~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~ 830 (971)
+..-|.|+.|++.|.....+.- .+. ...|.|+.||....
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~----~d~-~~~f~Cp~Cg~~l~ 134 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQL----LDM-DGTFTCPRCGEELE 134 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHh----cCC-CCcEECCCCCCEEE
Confidence 4456888888888876554432 111 23488888887654
No 115
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=51.32 E-value=26 Score=30.71 Aligned_cols=79 Identities=18% Similarity=0.326 Sum_probs=51.4
Q ss_pred CcccCCCCCccCChhhHHHHHHhhcCCCcc------------ccCCCCCccccchhhhhhhhhccCCCCCccCCCCCccc
Q psy14939 762 RYACKYCGKVFPRSANLTRHLRTHTGEQPY------------KCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCF 829 (971)
Q Consensus 762 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~p~------------~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f 829 (971)
+-.|.+||-.....-+|.|-..-=---++| .|--|.+.|........- .-.....|+|+.|...|
T Consensus 15 P~~CpiCgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~---~~~~~~~y~C~~C~~~F 91 (112)
T TIGR00622 15 PVECPICGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFD---ELKDSHRYVCAVCKNVF 91 (112)
T ss_pred CCcCCcCCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCccccccc---ccccccceeCCCCCCcc
Confidence 456899999888888888753211111222 388899988765422211 01223479999999999
Q ss_pred CChhHHHHHHhhhc
Q psy14939 830 GQQTNLDRHLKKHE 843 (971)
Q Consensus 830 ~~~~~l~~H~~~~~ 843 (971)
=-.-....|...|.
T Consensus 92 C~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 92 CVDCDVFVHESLHC 105 (112)
T ss_pred ccccchhhhhhccC
Confidence 88888888887774
No 116
>PF12907 zf-met2: Zinc-binding
Probab=50.77 E-value=9.7 Score=26.36 Aligned_cols=32 Identities=31% Similarity=0.641 Sum_probs=19.8
Q ss_pred eecccccccc---CCHHHHHHHHHhhcCCCCCeec
Q psy14939 60 YKCKYCERSF---SISSNLQRHVRNIHNKEKPFKC 91 (971)
Q Consensus 60 ~~C~~C~~~f---~~~~~L~~H~~~~H~~~k~~~C 91 (971)
+.|.+|-..| .....|..|..+.|.+..+-.|
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~~~C 36 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTFEEC 36 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCHHHc
Confidence 5677777433 3556677777777776544444
No 117
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=50.35 E-value=14 Score=34.58 Aligned_cols=40 Identities=25% Similarity=0.616 Sum_probs=25.7
Q ss_pred cCCcceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCCcccC
Q psy14939 55 TGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFG 99 (971)
Q Consensus 55 ~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~f~ 99 (971)
....-|.|+.|+..|.....+.. .+. ...|.|+.|+....
T Consensus 95 ~~~~~Y~Cp~C~~~y~~~ea~~~----~d~-~~~f~Cp~Cg~~l~ 134 (147)
T smart00531 95 TNNAYYKCPNCQSKYTFLEANQL----LDM-DGTFTCPRCGEELE 134 (147)
T ss_pred cCCcEEECcCCCCEeeHHHHHHh----cCC-CCcEECCCCCCEEE
Confidence 34456999999988875443332 122 33499999987654
No 118
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=50.17 E-value=6.6 Score=25.69 Aligned_cols=11 Identities=36% Similarity=0.911 Sum_probs=5.0
Q ss_pred eeccccccccC
Q psy14939 32 YACKYCGKVFP 42 (971)
Q Consensus 32 ~~C~~C~k~F~ 42 (971)
|.|..|+..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 34555555444
No 119
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=50.15 E-value=8.5 Score=27.70 Aligned_cols=10 Identities=30% Similarity=0.929 Sum_probs=4.7
Q ss_pred CccCCCCCcc
Q psy14939 819 PFKCPLCDRC 828 (971)
Q Consensus 819 ~~~C~~C~~~ 828 (971)
.+.|+.||..
T Consensus 21 ~~~Cp~CG~~ 30 (46)
T PRK00398 21 GVRCPYCGYR 30 (46)
T ss_pred ceECCCCCCe
Confidence 3445555543
No 120
>PF15269 zf-C2H2_7: Zinc-finger
Probab=50.08 E-value=11 Score=26.18 Aligned_cols=22 Identities=27% Similarity=0.557 Sum_probs=17.4
Q ss_pred eecCCCCcccCchhhHHHHHhh
Q psy14939 89 FKCPLCDRCFGQQTNLDRHLKK 110 (971)
Q Consensus 89 ~~C~~C~~~f~~~~~L~~H~~~ 110 (971)
|+|-.|..+...++.|..||+-
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 6788888888888888888764
No 121
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=50.01 E-value=13 Score=33.17 Aligned_cols=24 Identities=46% Similarity=0.605 Sum_probs=17.1
Q ss_pred ceeccccccccCChHHHHHHHHHhcCC
Q psy14939 31 RYACKYCGKVFPRSANLTRHLRTHTGE 57 (971)
Q Consensus 31 ~~~C~~C~k~F~~~~~L~~H~~~H~~~ 57 (971)
-..|-.|||.|++ |++|+.+|.+-
T Consensus 76 ~IicLEDGkkfKS---LKRHL~t~~gm 99 (148)
T COG4957 76 YIICLEDGKKFKS---LKRHLTTHYGL 99 (148)
T ss_pred eEEEeccCcchHH---HHHHHhcccCC
Confidence 3468888887774 77787777664
No 122
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=49.84 E-value=3.8 Score=30.41 Aligned_cols=11 Identities=36% Similarity=1.156 Sum_probs=5.5
Q ss_pred eeccccccccC
Q psy14939 60 YKCKYCERSFS 70 (971)
Q Consensus 60 ~~C~~C~~~f~ 70 (971)
|.|..|+..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 45555555443
No 123
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=48.80 E-value=18 Score=34.33 Aligned_cols=36 Identities=19% Similarity=0.499 Sum_probs=27.6
Q ss_pred hcCCcceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCCcccC
Q psy14939 54 HTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFG 99 (971)
Q Consensus 54 H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~f~ 99 (971)
.....-|.|+.|+.+|+.-..+. .-|.|+.||....
T Consensus 104 e~~~~~Y~Cp~c~~r~tf~eA~~----------~~F~Cp~Cg~~L~ 139 (158)
T TIGR00373 104 ETNNMFFICPNMCVRFTFNEAME----------LNFTCPRCGAMLD 139 (158)
T ss_pred ccCCCeEECCCCCcEeeHHHHHH----------cCCcCCCCCCEee
Confidence 34456799999999998888775 2599999987643
No 124
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=48.18 E-value=17 Score=22.46 Aligned_cols=20 Identities=30% Similarity=0.760 Sum_probs=13.6
Q ss_pred ccCCCCCcccCChhHHHHHHh
Q psy14939 820 FKCPLCDRCFGQQTNLDRHLK 840 (971)
Q Consensus 820 ~~C~~C~~~f~~~~~l~~H~~ 840 (971)
..||+|++.+ ....+.+|+.
T Consensus 2 v~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLD 21 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHH
Confidence 3577777777 5566777765
No 125
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=47.78 E-value=15 Score=35.68 Aligned_cols=36 Identities=19% Similarity=0.638 Sum_probs=26.8
Q ss_pred cCCCccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCC
Q psy14939 786 TGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQ 831 (971)
Q Consensus 786 ~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~ 831 (971)
....-|.|+.|++.|+.-.++. .-|.|+.||.....
T Consensus 113 ~~~~~Y~Cp~C~~rytf~eA~~----------~~F~Cp~Cg~~L~~ 148 (178)
T PRK06266 113 ENNMFFFCPNCHIRFTFDEAME----------YGFRCPQCGEMLEE 148 (178)
T ss_pred cCCCEEECCCCCcEEeHHHHhh----------cCCcCCCCCCCCee
Confidence 3445688999998888887763 26999999976654
No 126
>KOG4124|consensus
Probab=47.20 E-value=4.8 Score=41.76 Aligned_cols=71 Identities=20% Similarity=0.456 Sum_probs=40.9
Q ss_pred CCCCCcccCC--CCCccCChhhHHHHHHh-hcCCCccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCChhH
Q psy14939 758 TTKDRYACKY--CGKVFPRSANLTRHLRT-HTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQQTN 834 (971)
Q Consensus 758 ~~~~~~~C~~--C~~~f~~~~~l~~H~~~-h~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~ 834 (971)
...++|.|.+ |.+.++....|+.|... |... + .. ..+ .-+-|.- .-...|+|+|++|.++.+....
T Consensus 345 ~~~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~------i-~~--~s~-~~~ph~~-~~~~nk~~r~~i~~~~~k~~~~ 413 (442)
T KOG4124|consen 345 VVDKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSP------I-TT--PTP-APIPHQG-FVVENKPYRCEVCSKRYKNLNG 413 (442)
T ss_pred EecCCCCCCCCcchhhcccCcceeeccccCcCCC------C-CC--CCC-CCCCcce-eeeccCcccChhhhhhhccCCC
Confidence 3456788865 88888888888877653 2110 0 00 000 1111222 2234679999999988876666
Q ss_pred HHHHH
Q psy14939 835 LDRHL 839 (971)
Q Consensus 835 l~~H~ 839 (971)
|+-|.
T Consensus 414 l~~~~ 418 (442)
T KOG4124|consen 414 LKYHR 418 (442)
T ss_pred CCcee
Confidence 55553
No 127
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=47.09 E-value=4.2 Score=28.28 Aligned_cols=11 Identities=45% Similarity=1.280 Sum_probs=5.8
Q ss_pred eeccccccccC
Q psy14939 32 YACKYCGKVFP 42 (971)
Q Consensus 32 ~~C~~C~k~F~ 42 (971)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (41)
T smart00834 6 YRCEDCGHTFE 16 (41)
T ss_pred EEcCCCCCEEE
Confidence 45555555554
No 128
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=46.74 E-value=13 Score=28.40 Aligned_cols=40 Identities=28% Similarity=0.747 Sum_probs=21.5
Q ss_pred Cceeccc--cccccCChHHHHHHHHHhcCCcceeccc----cccccC
Q psy14939 30 DRYACKY--CGKVFPRSANLTRHLRTHTGEQPYKCKY----CERSFS 70 (971)
Q Consensus 30 ~~~~C~~--C~k~F~~~~~L~~H~~~H~~~~~~~C~~----C~~~f~ 70 (971)
.+..|+. |...+.. ..|..|+...-..++..|.+ |...+.
T Consensus 8 ~~v~C~~~cc~~~i~r-~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 8 RPVPCPNGCCNEMIPR-KELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp SEEE-TT--S-BEEEC-CCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred CEeeCCCCCcccceeH-HHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 3456766 5554553 45777777555566677777 766653
No 129
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=46.55 E-value=29 Score=30.43 Aligned_cols=83 Identities=17% Similarity=0.270 Sum_probs=54.2
Q ss_pred CCCCceeccccccccCChHHHHHHHHHhcCCc------------ceeccccccccCCHHHHHHHHHhhcCCCCCeecCCC
Q psy14939 27 TTKDRYACKYCGKVFPRSANLTRHLRTHTGEQ------------PYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLC 94 (971)
Q Consensus 27 ~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~------------~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C 94 (971)
.-+=|-.|+.|+-......+|.+-..-=-.-+ .-.|--|...|........- .-.....|.|+.|
T Consensus 11 vC~LP~~CpiCgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~---~~~~~~~y~C~~C 87 (112)
T TIGR00622 11 VCELPVECPICGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFD---ELKDSHRYVCAVC 87 (112)
T ss_pred ccCCCCcCCcCCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCccccccc---ccccccceeCCCC
Confidence 34456789999988888887766532111111 12488899988765321100 0122457999999
Q ss_pred CcccCchhhHHHHHhhhh
Q psy14939 95 DRCFGQQTNLDRHLKKHE 112 (971)
Q Consensus 95 ~~~f~~~~~L~~H~~~h~ 112 (971)
...|=.--++..|...|.
T Consensus 88 ~~~FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 88 KNVFCVDCDVFVHESLHC 105 (112)
T ss_pred CCccccccchhhhhhccC
Confidence 999998888888988774
No 130
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=45.26 E-value=13 Score=31.67 Aligned_cols=11 Identities=18% Similarity=0.670 Sum_probs=6.5
Q ss_pred CccCCCCCccc
Q psy14939 819 PFKCPLCDRCF 829 (971)
Q Consensus 819 ~~~C~~C~~~f 829 (971)
|..||+||+.|
T Consensus 26 PiVsPytG~s~ 36 (129)
T COG4530 26 PIVSPYTGKSY 36 (129)
T ss_pred ccccCcccccc
Confidence 55566666655
No 131
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=43.26 E-value=6.6 Score=31.31 Aligned_cols=40 Identities=18% Similarity=0.408 Sum_probs=19.7
Q ss_pred eeccccccccCCHHHHHHHHHhhcCCCCCeecC--CCCcccCchh
Q psy14939 60 YKCKYCERSFSISSNLQRHVRNIHNKEKPFKCP--LCDRCFGQQT 102 (971)
Q Consensus 60 ~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~--~C~~~f~~~~ 102 (971)
+.|+.|+.......+-..+. -..+.-+.|. .|+.+|....
T Consensus 2 m~CP~Cg~~a~irtSr~~s~---~~~~~Y~qC~N~eCg~tF~t~e 43 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITD---TTKERYHQCQNVNCSATFITYE 43 (72)
T ss_pred ccCCCCCCccEEEEChhcCh---hhheeeeecCCCCCCCEEEEEE
Confidence 45666666542222211111 1234556776 7777776544
No 132
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=43.26 E-value=14 Score=28.28 Aligned_cols=42 Identities=21% Similarity=0.595 Sum_probs=18.1
Q ss_pred CCccccCC--CCCccccchhhhhhhhhccCCCCCccCCC----CCcccCC
Q psy14939 788 EQPYKCKY--CERSFSISSNLQRHVRNIHNKEKPFKCPL----CDRCFGQ 831 (971)
Q Consensus 788 ~~p~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~C~~----C~~~f~~ 831 (971)
..+..|+. |...+. +..|..|.. .--..++..|++ |+..+.+
T Consensus 7 ~~~v~C~~~cc~~~i~-r~~l~~H~~-~~C~~~~v~C~~~~~GC~~~~~~ 54 (60)
T PF02176_consen 7 FRPVPCPNGCCNEMIP-RKELDDHLE-NECPKRPVPCPYSPYGCKERVPR 54 (60)
T ss_dssp TSEEE-TT--S-BEEE-CCCHHHHHH-TTSTTSEEE-SS----S--EEEH
T ss_pred CCEeeCCCCCccccee-HHHHHHHHH-ccCCCCcEECCCCCCCCCCccch
Confidence 34455555 333333 445566655 234445556666 6655543
No 133
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=42.52 E-value=17 Score=43.48 Aligned_cols=26 Identities=27% Similarity=0.599 Sum_probs=14.8
Q ss_pred CCcccccCcCCCccCChHHHhhCccccCCCCcccccccchh
Q psy14939 250 GARQHACQECGKTFATSSGLKQHTHIHSSVKPFRCEVCHKA 290 (971)
Q Consensus 250 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 290 (971)
..+...|.+||+. ...|..|+.|+..
T Consensus 459 ~~~~L~CH~Cg~~---------------~~~p~~Cp~Cgs~ 484 (730)
T COG1198 459 ATGQLRCHYCGYQ---------------EPIPQSCPECGSE 484 (730)
T ss_pred CCCeeEeCCCCCC---------------CCCCCCCCCCCCC
Confidence 3345667666642 2345667777644
No 134
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=42.24 E-value=11 Score=37.44 Aligned_cols=25 Identities=28% Similarity=0.789 Sum_probs=11.7
Q ss_pred CCceeccccccccCChHHHHHHHHH
Q psy14939 29 KDRYACKYCGKVFPRSANLTRHLRT 53 (971)
Q Consensus 29 ~~~~~C~~C~k~F~~~~~L~~H~~~ 53 (971)
+..|.|+.|+|.|+-..-.+.|+..
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~n 99 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFN 99 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHH
T ss_pred CCEECCCCCCcccCChHHHHHHHhh
Confidence 3445555555555555555555544
No 135
>PF15269 zf-C2H2_7: Zinc-finger
Probab=41.60 E-value=19 Score=25.09 Aligned_cols=21 Identities=33% Similarity=0.630 Sum_probs=16.3
Q ss_pred eeccccccccCCHHHHHHHHH
Q psy14939 60 YKCKYCERSFSISSNLQRHVR 80 (971)
Q Consensus 60 ~~C~~C~~~f~~~~~L~~H~~ 80 (971)
|+|-.|......++.|-.||+
T Consensus 21 ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred ceeecCCcccchHHHHHHHHH
Confidence 677777777777788887776
No 136
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=41.49 E-value=24 Score=34.28 Aligned_cols=35 Identities=20% Similarity=0.657 Sum_probs=26.6
Q ss_pred CCcceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCCcccCc
Q psy14939 56 GEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQ 100 (971)
Q Consensus 56 ~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~f~~ 100 (971)
...-|.|+.|+.+|+.-..+. .-|.|+.||.....
T Consensus 114 ~~~~Y~Cp~C~~rytf~eA~~----------~~F~Cp~Cg~~L~~ 148 (178)
T PRK06266 114 NNMFFFCPNCHIRFTFDEAME----------YGFRCPQCGEMLEE 148 (178)
T ss_pred CCCEEECCCCCcEEeHHHHhh----------cCCcCCCCCCCCee
Confidence 345699999999998777653 25999999876543
No 137
>PRK04023 DNA polymerase II large subunit; Validated
Probab=39.79 E-value=22 Score=43.22 Aligned_cols=17 Identities=18% Similarity=0.727 Sum_probs=11.5
Q ss_pred CCCCCCcccCCCCCccC
Q psy14939 757 KTTKDRYACKYCGKVFP 773 (971)
Q Consensus 757 ~~~~~~~~C~~C~~~f~ 773 (971)
.-.+..|+|..||..|.
T Consensus 1032 aFsrQ~fRC~kC~~kYR 1048 (1121)
T PRK04023 1032 AFSRQEFRCTKCGAKYR 1048 (1121)
T ss_pred hhcccceeecccCcccc
Confidence 34455678888887773
No 138
>KOG4124|consensus
Probab=37.98 E-value=8.5 Score=40.05 Aligned_cols=25 Identities=24% Similarity=0.728 Sum_probs=21.2
Q ss_pred CCCccccCCCCCccccchhhhhhhh
Q psy14939 787 GEQPYKCKYCERSFSISSNLQRHVR 811 (971)
Q Consensus 787 ~~~p~~C~~C~~~f~~~~~l~~H~~ 811 (971)
..|||.|++|.++++.-..|+-|..
T Consensus 395 ~nk~~r~~i~~~~~k~~~~l~~~~~ 419 (442)
T KOG4124|consen 395 ENKPYRCEVCSKRYKNLNGLKYHRT 419 (442)
T ss_pred ccCcccChhhhhhhccCCCCCceee
Confidence 3589999999999998888888854
No 139
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=37.96 E-value=23 Score=35.27 Aligned_cols=28 Identities=25% Similarity=0.616 Sum_probs=18.3
Q ss_pred CcceeccccccccCCHHHHHHHHHhhcC
Q psy14939 57 EQPYKCKYCERSFSISSNLQRHVRNIHN 84 (971)
Q Consensus 57 ~~~~~C~~C~~~f~~~~~L~~H~~~~H~ 84 (971)
+..|.|.+|+|.|.-..-...|+...|.
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCH
Confidence 3457788888888877777778776665
No 140
>KOG1280|consensus
Probab=37.92 E-value=11 Score=39.56 Aligned_cols=25 Identities=16% Similarity=0.242 Sum_probs=15.8
Q ss_pred CcccccccchhcCCHHHHHHHHhhh
Q psy14939 280 KPFRCEVCHKAYTQFSNLCRHKRMH 304 (971)
Q Consensus 280 ~~~~C~~C~~~~~~~~~l~~H~~~h 304 (971)
.-|.|++|+..=.+...|..|....
T Consensus 78 qSftCPyC~~~Gfte~~f~~Hv~s~ 102 (381)
T KOG1280|consen 78 QSFTCPYCGIMGFTERQFGTHVLSQ 102 (381)
T ss_pred ccccCCcccccccchhHHHHHhhhc
Confidence 3567777776666666666776443
No 141
>PF14353 CpXC: CpXC protein
Probab=37.85 E-value=4.7 Score=36.86 Aligned_cols=20 Identities=25% Similarity=0.660 Sum_probs=12.4
Q ss_pred ceeccccccccCCHHHHHHH
Q psy14939 59 PYKCKYCERSFSISSNLQRH 78 (971)
Q Consensus 59 ~~~C~~C~~~f~~~~~L~~H 78 (971)
.|.|+.||..|.-...+..|
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~ 57 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYH 57 (128)
T ss_pred EEECCCCCCceecCCCEEEE
Confidence 47777777777554444444
No 142
>KOG2071|consensus
Probab=37.56 E-value=27 Score=39.84 Aligned_cols=42 Identities=29% Similarity=0.671 Sum_probs=0.0
Q ss_pred CCCCcccCCCCCccCChhhHHHHHHhh-----------------------------------------------------
Q psy14939 759 TKDRYACKYCGKVFPRSANLTRHLRTH----------------------------------------------------- 785 (971)
Q Consensus 759 ~~~~~~C~~C~~~f~~~~~l~~H~~~h----------------------------------------------------- 785 (971)
...+..|..||..|........||..|
T Consensus 415 ~~~pnqC~~CG~R~~~~ee~sk~md~H~dwh~r~n~~~r~s~~~~~sR~Wf~s~s~W~~~K~~e~t~e~~~~~~~~~~~~ 494 (579)
T KOG2071|consen 415 KDSPNQCKSCGLRFDDSEERSKHMDIHDDWHRRKNTTIRGSSKWQKSRSWFPSKSGWLAAKAGEETDEKEKVEHEELQIK 494 (579)
T ss_pred cCCcchhcccccccccchhhhhHhhhhhhhhhhhhhhcccccccccccceeechhhhccccccccccccccccchhhcch
Q ss_pred --------------cCCCccccCCCCCcc
Q psy14939 786 --------------TGEQPYKCKYCERSF 800 (971)
Q Consensus 786 --------------~~~~p~~C~~C~~~f 800 (971)
++++++.|.+|+-.|
T Consensus 495 ~~~s~~~k~~~Vp~d~e~~~~C~IC~EkF 523 (579)
T KOG2071|consen 495 KELSLRSKYELVPADSERQASCPICQEKF 523 (579)
T ss_pred hhhhhhccceecccCcccccCCccccccc
No 143
>PF14353 CpXC: CpXC protein
Probab=37.42 E-value=13 Score=33.98 Aligned_cols=17 Identities=29% Similarity=0.722 Sum_probs=8.3
Q ss_pred ccccCcCCCccCChHHH
Q psy14939 253 QHACQECGKTFATSSGL 269 (971)
Q Consensus 253 ~~~C~~C~~~f~~~~~l 269 (971)
.|.|+.||..|.-...+
T Consensus 38 ~~~CP~Cg~~~~~~~p~ 54 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPL 54 (128)
T ss_pred EEECCCCCCceecCCCE
Confidence 45555555555443333
No 144
>KOG2593|consensus
Probab=36.74 E-value=22 Score=38.77 Aligned_cols=40 Identities=25% Similarity=0.567 Sum_probs=29.0
Q ss_pred hcCCcceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCCcc
Q psy14939 54 HTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRC 97 (971)
Q Consensus 54 H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~ 97 (971)
-+....|.|+.|.+.|.....|+. .-...-.|.|..|+-.
T Consensus 123 ~t~~~~Y~Cp~C~kkyt~Lea~~L----~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 123 DTNVAGYVCPNCQKKYTSLEALQL----LDNETGEFHCENCGGE 162 (436)
T ss_pred ccccccccCCccccchhhhHHHHh----hcccCceEEEecCCCc
Confidence 355567999999999987776653 2333456999999654
No 145
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=36.33 E-value=9.7 Score=25.75 Aligned_cols=10 Identities=30% Similarity=1.172 Sum_probs=6.0
Q ss_pred cccCCCCCcc
Q psy14939 763 YACKYCGKVF 772 (971)
Q Consensus 763 ~~C~~C~~~f 772 (971)
..|..||++|
T Consensus 2 r~C~~Cg~~Y 11 (36)
T PF05191_consen 2 RICPKCGRIY 11 (36)
T ss_dssp EEETTTTEEE
T ss_pred cCcCCCCCcc
Confidence 3466666666
No 146
>KOG4377|consensus
Probab=35.75 E-value=22 Score=38.19 Aligned_cols=106 Identities=26% Similarity=0.448 Sum_probs=65.6
Q ss_pred ccc--CccchhccChhHHHhhHhhcCCC-------------Cccc--CCCCCccCChhHHHhhHhhhcc----CC--ccc
Q psy14939 198 YRC--ESCSQTFCWRPHLNFHQAQVHGR-------------KFPC--ENCTKVFSDPSNLQRHIRTHHV----GA--RQH 254 (971)
Q Consensus 198 ~~C--~~C~~~f~~~~~L~~H~~~~~~~-------------~~~C--~~C~~~f~~~~~L~~H~~~hh~----~~--~~~ 254 (971)
|.| +.|...+..+..+..|...|..+ -|.| ..|.+ +-+....|-.-|.. |- -.|
T Consensus 272 yhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~Gfrrthf 348 (480)
T KOG4377|consen 272 YHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRTHF 348 (480)
T ss_pred hcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCceeccee
Confidence 667 46888888888888888776542 2667 46887 45556666655521 11 237
Q ss_pred ccCcCC--CccCChHHHhhCccccCCCC------------------------cccccc--cchhcCCHHHHHHHHhhhcC
Q psy14939 255 ACQECG--KTFATSSGLKQHTHIHSSVK------------------------PFRCEV--CHKAYTQFSNLCRHKRMHAN 306 (971)
Q Consensus 255 ~C~~C~--~~f~~~~~l~~H~~~h~~~~------------------------~~~C~~--C~~~~~~~~~l~~H~~~h~~ 306 (971)
.|.-|+ .+|+....-..|++-+.++. .|.|.+ |+..+...+.+..|.+.|..
T Consensus 349 hC~r~gCTdtfK~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~s~sqm~shkrkheR 428 (480)
T KOG4377|consen 349 HCQRIGCTDTFKDSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLYSVSQMASHKRKHER 428 (480)
T ss_pred EEeccCCccccccccccccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEEehhhhhhhhhhhhh
Confidence 888877 44543333333333333221 245643 88999999999999887754
No 147
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=35.43 E-value=5.2 Score=33.55 Aligned_cols=11 Identities=27% Similarity=0.564 Sum_probs=5.9
Q ss_pred eecCCCCcccC
Q psy14939 89 FKCPLCDRCFG 99 (971)
Q Consensus 89 ~~C~~C~~~f~ 99 (971)
..|..|+.+|.
T Consensus 47 ~~Cg~CGls~e 57 (104)
T COG4888 47 AVCGNCGLSFE 57 (104)
T ss_pred EEcccCcceEE
Confidence 44555555554
No 148
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=35.03 E-value=25 Score=29.39 Aligned_cols=32 Identities=22% Similarity=0.621 Sum_probs=21.3
Q ss_pred CccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCC
Q psy14939 789 QPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQ 831 (971)
Q Consensus 789 ~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~ 831 (971)
..|.|++|++.- ++| .-.| .|.|..|++.|.-
T Consensus 34 a~y~CpfCgk~~-----vkR----~a~G--IW~C~~C~~~~AG 65 (91)
T TIGR00280 34 AKYVCPFCGKKT-----VKR----GSTG--IWTCRKCGAKFAG 65 (91)
T ss_pred cCccCCCCCCCc-----eEE----EeeE--EEEcCCCCCEEeC
Confidence 568999998531 111 2233 7899999998863
No 149
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=34.30 E-value=29 Score=28.98 Aligned_cols=32 Identities=19% Similarity=0.611 Sum_probs=21.3
Q ss_pred CccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCC
Q psy14939 789 QPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQ 831 (971)
Q Consensus 789 ~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~ 831 (971)
..|.|++|++. .++| .-. -.|.|..|++.|.-
T Consensus 35 a~y~CpfCgk~-----~vkR----~a~--GIW~C~~C~~~~AG 66 (90)
T PTZ00255 35 AKYFCPFCGKH-----AVKR----QAV--GIWRCKGCKKTVAG 66 (90)
T ss_pred CCccCCCCCCC-----ceee----eee--EEEEcCCCCCEEeC
Confidence 57899999853 1222 222 37899999998864
No 150
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=33.99 E-value=20 Score=25.25 Aligned_cols=10 Identities=50% Similarity=1.318 Sum_probs=4.6
Q ss_pred cccCcCCCcc
Q psy14939 254 HACQECGKTF 263 (971)
Q Consensus 254 ~~C~~C~~~f 263 (971)
|.|..||..|
T Consensus 6 y~C~~Cg~~f 15 (42)
T PF09723_consen 6 YRCEECGHEF 15 (42)
T ss_pred EEeCCCCCEE
Confidence 4444444444
No 151
>KOG2593|consensus
Probab=33.33 E-value=31 Score=37.71 Aligned_cols=37 Identities=27% Similarity=0.645 Sum_probs=24.1
Q ss_pred cCCCccccCCCCCccccchhhhhhhhhccCCCCCccCCCCC
Q psy14939 786 TGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCD 826 (971)
Q Consensus 786 ~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~ 826 (971)
+...-|.|+.|.+.|+.-..|+- +-...-.|.|..|+
T Consensus 124 t~~~~Y~Cp~C~kkyt~Lea~~L----~~~~~~~F~C~~C~ 160 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKYTSLEALQL----LDNETGEFHCENCG 160 (436)
T ss_pred cccccccCCccccchhhhHHHHh----hcccCceEEEecCC
Confidence 44556888888888877776633 22223468888877
No 152
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=33.17 E-value=1.1e+02 Score=28.71 Aligned_cols=57 Identities=26% Similarity=0.466 Sum_probs=33.8
Q ss_pred CCCcccCCC-CCccCChhhHHHHHHhhcCCCccccCC--CCCccccchhhhhhhhhccCCCCC
Q psy14939 760 KDRYACKYC-GKVFPRSANLTRHLRTHTGEQPYKCKY--CERSFSISSNLQRHVRNIHNKEKP 819 (971)
Q Consensus 760 ~~~~~C~~C-~~~f~~~~~l~~H~~~h~~~~p~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~ 819 (971)
.....|++| |.++.+..- .-.|.+...|+-.|.. |...- +-..|++|.|..|...+|
T Consensus 78 ~~~L~CPLCRG~V~GWtvv--e~AR~~LN~K~RsC~~e~C~F~G-tY~eLrKHar~~HP~~rP 137 (162)
T PF07800_consen 78 QPELACPLCRGEVKGWTVV--EPARRFLNAKKRSCSQESCSFSG-TYSELRKHARSEHPSARP 137 (162)
T ss_pred cccccCccccCceeceEEc--hHHHHHhccCCccCccccccccc-CHHHHHHHHHhhCCCCCC
Confidence 345678888 565543332 2256666777777765 65432 234577777766766555
No 153
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=33.07 E-value=20 Score=29.93 Aligned_cols=31 Identities=26% Similarity=0.826 Sum_probs=21.5
Q ss_pred CccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccC
Q psy14939 789 QPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFG 830 (971)
Q Consensus 789 ~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~ 830 (971)
..|.|++|++.- ++| .-.| -|.|..|++.|.
T Consensus 34 ~ky~Cp~Cgk~~-----vkR----~a~G--IW~C~~C~~~~A 64 (90)
T PF01780_consen 34 AKYTCPFCGKTS-----VKR----VATG--IWKCKKCGKKFA 64 (90)
T ss_dssp S-BEESSSSSSE-----EEE----EETT--EEEETTTTEEEE
T ss_pred CCCcCCCCCCce-----eEE----eeeE--EeecCCCCCEEe
Confidence 679999999642 222 3444 599999999886
No 154
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=32.77 E-value=14 Score=29.50 Aligned_cols=21 Identities=24% Similarity=0.491 Sum_probs=12.1
Q ss_pred CCCCCccCC--CCCcccCChhHH
Q psy14939 815 NKEKPFKCP--LCDRCFGQQTNL 835 (971)
Q Consensus 815 ~~~~~~~C~--~C~~~f~~~~~l 835 (971)
..++-+.|. .||.+|.....+
T Consensus 23 ~~~~Y~qC~N~eCg~tF~t~es~ 45 (72)
T PRK09678 23 TKERYHQCQNVNCSATFITYESV 45 (72)
T ss_pred hheeeeecCCCCCCCEEEEEEEE
Confidence 344556675 677777654443
No 155
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=32.37 E-value=16 Score=21.84 Aligned_cols=7 Identities=43% Similarity=1.203 Sum_probs=2.9
Q ss_pred ccccccc
Q psy14939 34 CKYCGKV 40 (971)
Q Consensus 34 C~~C~k~ 40 (971)
|+.||..
T Consensus 2 Cp~CG~~ 8 (23)
T PF13240_consen 2 CPNCGAE 8 (23)
T ss_pred CcccCCC
Confidence 4444433
No 156
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=32.11 E-value=25 Score=32.05 Aligned_cols=12 Identities=50% Similarity=1.204 Sum_probs=7.6
Q ss_pred ceeccccccccC
Q psy14939 59 PYKCKYCERSFS 70 (971)
Q Consensus 59 ~~~C~~C~~~f~ 70 (971)
.|.|..|++.|.
T Consensus 53 RyrC~~C~~tf~ 64 (129)
T COG3677 53 RYKCKSCGSTFT 64 (129)
T ss_pred ccccCCcCccee
Confidence 366666666664
No 157
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=31.17 E-value=23 Score=26.02 Aligned_cols=11 Identities=27% Similarity=0.857 Sum_probs=5.3
Q ss_pred cccCcCCCccC
Q psy14939 254 HACQECGKTFA 264 (971)
Q Consensus 254 ~~C~~C~~~f~ 264 (971)
|+|..||+.+.
T Consensus 2 y~C~~CgyiYd 12 (50)
T cd00730 2 YECRICGYIYD 12 (50)
T ss_pred cCCCCCCeEEC
Confidence 44555554444
No 158
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=31.07 E-value=11 Score=43.78 Aligned_cols=56 Identities=27% Similarity=0.698 Sum_probs=41.0
Q ss_pred ccCCCCCccCChhhHHHHHHhhcCCCccc-cCCCCCccccchhhhhhhhhccCCCCCccCCCCCc
Q psy14939 764 ACKYCGKVFPRSANLTRHLRTHTGEQPYK-CKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDR 827 (971)
Q Consensus 764 ~C~~C~~~f~~~~~l~~H~~~h~~~~p~~-C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~ 827 (971)
.|..||-.|+-...|--- |-+|.-+.|. |+.|.+-|....+-+-|.. |..|+.||-
T Consensus 125 ~CT~CGPRfTIi~alPYD-R~nTsM~~F~lC~~C~~EY~dP~nRRfHAQ-------p~aCp~CGP 181 (750)
T COG0068 125 NCTNCGPRFTIIEALPYD-RENTSMADFPLCPFCDKEYKDPLNRRFHAQ-------PIACPKCGP 181 (750)
T ss_pred ccCCCCcceeeeccCCCC-cccCccccCcCCHHHHHHhcCccccccccc-------cccCcccCC
Confidence 699999999877776544 4566666664 8999987776666555543 899999994
No 159
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=30.91 E-value=31 Score=28.82 Aligned_cols=32 Identities=22% Similarity=0.543 Sum_probs=21.1
Q ss_pred CccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcccCC
Q psy14939 789 QPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCFGQ 831 (971)
Q Consensus 789 ~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~ 831 (971)
..|.|++|++.- ++ | .-.| .|.|..|++.|.-
T Consensus 35 a~y~CpfCgk~~-----vk---R-~a~G--IW~C~~C~~~~AG 66 (90)
T PRK03976 35 AKHVCPVCGRPK-----VK---R-VGTG--IWECRKCGAKFAG 66 (90)
T ss_pred cCccCCCCCCCc-----eE---E-EEEE--EEEcCCCCCEEeC
Confidence 568999997531 11 1 2233 7899999998864
No 160
>KOG1701|consensus
Probab=30.23 E-value=11 Score=40.79 Aligned_cols=10 Identities=30% Similarity=1.042 Sum_probs=5.8
Q ss_pred cccCCCCCcc
Q psy14939 225 FPCENCTKVF 234 (971)
Q Consensus 225 ~~C~~C~~~f 234 (971)
|.|-.|.+..
T Consensus 361 F~Cv~C~r~l 370 (468)
T KOG1701|consen 361 FTCVVCARCL 370 (468)
T ss_pred eEEEEecccc
Confidence 5666665543
No 161
>KOG3408|consensus
Probab=29.88 E-value=31 Score=30.23 Aligned_cols=26 Identities=23% Similarity=0.567 Sum_probs=20.7
Q ss_pred CCCceeccccccccCChHHHHHHHHH
Q psy14939 28 TKDRYACKYCGKVFPRSANLTRHLRT 53 (971)
Q Consensus 28 ~~~~~~C~~C~k~F~~~~~L~~H~~~ 53 (971)
|-..|.|-.|.+.|.+...|..|.+.
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhc
Confidence 45678888888888888888888764
No 162
>PRK04023 DNA polymerase II large subunit; Validated
Probab=29.78 E-value=45 Score=40.75 Aligned_cols=14 Identities=21% Similarity=0.766 Sum_probs=11.0
Q ss_pred CccccCCCCCcccc
Q psy14939 789 QPYKCKYCERSFSI 802 (971)
Q Consensus 789 ~p~~C~~C~~~f~~ 802 (971)
.-|.|..|+..|.|
T Consensus 1036 Q~fRC~kC~~kYRR 1049 (1121)
T PRK04023 1036 QEFRCTKCGAKYRR 1049 (1121)
T ss_pred cceeecccCccccc
Confidence 45899999988864
No 163
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.31 E-value=35 Score=25.70 Aligned_cols=40 Identities=20% Similarity=0.423 Sum_probs=24.2
Q ss_pred eeccccccc-cCChHHHHHHHHHhcCCcceeccccccccCC
Q psy14939 32 YACKYCGKV-FPRSANLTRHLRTHTGEQPYKCKYCERSFSI 71 (971)
Q Consensus 32 ~~C~~C~k~-F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~ 71 (971)
.+|-.|++. |.....|..-...-...+.|.|+.|..+...
T Consensus 3 vkCiiCd~v~~iD~rt~~tKrLrN~PIrtymC~eC~~Rva~ 43 (68)
T COG4896 3 VKCIICDRVDEIDNRTFKTKRLRNKPIRTYMCPECEHRVAI 43 (68)
T ss_pred ceEEEecceeeecchhHHHHHhhCCCceeEechhhHhhhch
Confidence 467777765 4455555554444455566888888765543
No 164
>COG1645 Uncharacterized Zn-finger containing protein [General function prediction only]
Probab=29.02 E-value=23 Score=31.94 Aligned_cols=27 Identities=30% Similarity=0.559 Sum_probs=15.7
Q ss_pred eeccccccccCChHHHHHHHHHhcCCcceeccccccccC
Q psy14939 32 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFS 70 (971)
Q Consensus 32 ~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~ 70 (971)
+-|+.||.-... ++|+ ..|++|++.+.
T Consensus 29 ~hCp~Cg~PLF~----------KdG~--v~CPvC~~~~~ 55 (131)
T COG1645 29 KHCPKCGTPLFR----------KDGE--VFCPVCGYREV 55 (131)
T ss_pred hhCcccCCccee----------eCCe--EECCCCCceEE
Confidence 458888864322 1233 67888886553
No 165
>PF01927 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=29.00 E-value=36 Score=31.94 Aligned_cols=46 Identities=17% Similarity=0.335 Sum_probs=23.7
Q ss_pred eeccccccccCChHHHHHHHHH-----hcCCcceeccccccccCCHHHHHH
Q psy14939 32 YACKYCGKVFPRSANLTRHLRT-----HTGEQPYKCKYCERSFSISSNLQR 77 (971)
Q Consensus 32 ~~C~~C~k~F~~~~~L~~H~~~-----H~~~~~~~C~~C~~~f~~~~~L~~ 77 (971)
-.|..||..+.....-..--+. -..+.-+.|+.|++.|..-++..+
T Consensus 92 sRC~~CN~~L~~v~~~~v~~~vp~~v~~~~~~f~~C~~C~kiyW~GsH~~~ 142 (147)
T PF01927_consen 92 SRCPKCNGPLRPVSKEEVKDRVPPYVYETYDEFWRCPGCGKIYWEGSHWRR 142 (147)
T ss_pred CccCCCCcEeeechhhccccccCccccccCCeEEECCCCCCEecccccHHH
Confidence 3688888765543211110000 011235788888888876655543
No 166
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=28.27 E-value=48 Score=31.22 Aligned_cols=16 Identities=25% Similarity=0.702 Sum_probs=9.0
Q ss_pred eeccccccccCCHHHH
Q psy14939 60 YKCKYCERSFSISSNL 75 (971)
Q Consensus 60 ~~C~~C~~~f~~~~~L 75 (971)
|.|+.|++.|.--++.
T Consensus 131 ~~C~~CgkiYW~GsHw 146 (165)
T COG1656 131 YRCPKCGKIYWKGSHW 146 (165)
T ss_pred eECCCCcccccCchHH
Confidence 5566666666544433
No 167
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.77 E-value=13 Score=36.47 Aligned_cols=18 Identities=22% Similarity=0.556 Sum_probs=12.0
Q ss_pred CCceeccccccccCChHH
Q psy14939 29 KDRYACKYCGKVFPRSAN 46 (971)
Q Consensus 29 ~~~~~C~~C~k~F~~~~~ 46 (971)
++.+.|++|+..|....-
T Consensus 17 kk~ieCPvC~tkFkkeev 34 (267)
T COG1655 17 KKTIECPVCNTKFKKEEV 34 (267)
T ss_pred hceeccCcccchhhhhhe
Confidence 456777777777776543
No 168
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=25.91 E-value=24 Score=36.21 Aligned_cols=27 Identities=22% Similarity=0.459 Sum_probs=22.9
Q ss_pred CCCeecCCCCcccCchhhHHHHHhhhh
Q psy14939 86 EKPFKCPLCDRCFGQQTNLDRHLKKHE 112 (971)
Q Consensus 86 ~k~~~C~~C~~~f~~~~~L~~H~~~h~ 112 (971)
...|.|..|...|-.--+...|-..|.
T Consensus 386 s~rY~Ce~CK~~FC~dCdvfiHe~Lh~ 412 (421)
T COG5151 386 SGRYQCELCKSTFCSDCDVFIHETLHF 412 (421)
T ss_pred ccceechhhhhhhhhhhHHHHHHHHhh
Confidence 345999999999999888899988875
No 169
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=25.72 E-value=23 Score=25.42 Aligned_cols=27 Identities=30% Similarity=0.897 Sum_probs=16.6
Q ss_pred eeccccccccCChHHHHHHHHHhcCCcceecccccc
Q psy14939 32 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCER 67 (971)
Q Consensus 32 ~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~ 67 (971)
+.|+.||.. ....|.. ...|.|..|.+
T Consensus 19 ~~CP~Cg~~--~~~~~~~-------~~~~~C~~C~~ 45 (46)
T PF12760_consen 19 FVCPHCGST--KHYRLKT-------RGRYRCKACRK 45 (46)
T ss_pred CCCCCCCCe--eeEEeCC-------CCeEECCCCCC
Confidence 779999964 1111111 55688988875
No 170
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=25.46 E-value=47 Score=39.90 Aligned_cols=29 Identities=28% Similarity=0.738 Sum_probs=18.8
Q ss_pred hhcCCCccccCCCCCccccchhhhhhhhhccCCCCCccCCCCCcc
Q psy14939 784 THTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRC 828 (971)
Q Consensus 784 ~h~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~ 828 (971)
.|...+-..|.+|| |....|+.|+.||..
T Consensus 456 ~H~~~~~L~CH~Cg----------------~~~~~p~~Cp~Cgs~ 484 (730)
T COG1198 456 LHKATGQLRCHYCG----------------YQEPIPQSCPECGSE 484 (730)
T ss_pred EecCCCeeEeCCCC----------------CCCCCCCCCCCCCCC
Confidence 34444556777776 224558999999965
No 171
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=25.37 E-value=67 Score=22.48 Aligned_cols=24 Identities=21% Similarity=0.555 Sum_probs=18.9
Q ss_pred cccCCCCCccC--ChhhHHHHHHhhc
Q psy14939 763 YACKYCGKVFP--RSANLTRHLRTHT 786 (971)
Q Consensus 763 ~~C~~C~~~f~--~~~~l~~H~~~h~ 786 (971)
-.|..||..|. ...+-+.|.+.|.
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 47999999885 4567888888874
No 172
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=24.53 E-value=31 Score=22.24 Aligned_cols=7 Identities=29% Similarity=0.819 Sum_probs=3.1
Q ss_pred eeccccc
Q psy14939 60 YKCKYCE 66 (971)
Q Consensus 60 ~~C~~C~ 66 (971)
|.|+.|+
T Consensus 20 ~vCp~C~ 26 (30)
T PF08274_consen 20 LVCPECG 26 (30)
T ss_dssp EEETTTT
T ss_pred EeCCccc
Confidence 4444444
No 173
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=24.05 E-value=16 Score=22.99 Aligned_cols=23 Identities=26% Similarity=0.530 Sum_probs=11.9
Q ss_pred eeccccccccCChHHHHHHHHHhc
Q psy14939 32 YACKYCGKVFPRSANLTRHLRTHT 55 (971)
Q Consensus 32 ~~C~~C~k~F~~~~~L~~H~~~H~ 55 (971)
|.|-.|++.|. ....+.|...-+
T Consensus 1 ~sCiDC~~~F~-~~~y~~Ht~CIt 23 (28)
T PF08790_consen 1 FSCIDCSKDFD-GDSYKSHTSCIT 23 (28)
T ss_dssp EEETTTTEEEE-GGGTTT-----S
T ss_pred CeeecCCCCcC-cCCcCCCCcccC
Confidence 46778888884 455666655444
No 174
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=23.81 E-value=51 Score=41.42 Aligned_cols=15 Identities=20% Similarity=0.815 Sum_probs=8.9
Q ss_pred CCCCCcccCCCCCcc
Q psy14939 758 TTKDRYACKYCGKVF 772 (971)
Q Consensus 758 ~~~~~~~C~~C~~~f 772 (971)
-.+..|+|..||..|
T Consensus 1249 FsrQ~~RC~kC~~ky 1263 (1337)
T PRK14714 1249 FSRQEFRCLKCGTKY 1263 (1337)
T ss_pred hhccceeecccCccc
Confidence 344556666666666
No 175
>KOG2807|consensus
Probab=23.58 E-value=1.1e+02 Score=32.08 Aligned_cols=68 Identities=22% Similarity=0.471 Sum_probs=50.1
Q ss_pred CCceeccccccccCChHHHHHHHHHhcCCcceeccccccccCCHHHHHHHHHhhcC------------------------
Q psy14939 29 KDRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHN------------------------ 84 (971)
Q Consensus 29 ~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~------------------------ 84 (971)
...|.|+.|....- .-|..|++|+-..-...+|.+-++ |-
T Consensus 274 ~~Gy~CP~CkakvC--------------sLP~eCpiC~ltLVss~hLARSyh--hL~PL~~F~Eip~~~~~~~~~Cf~C~ 337 (378)
T KOG2807|consen 274 GGGYFCPQCKAKVC--------------SLPIECPICSLTLVSSPHLARSYH--HLFPLKPFVEIPETEYNGSRFCFACQ 337 (378)
T ss_pred cCceeCCcccCeee--------------cCCccCCccceeEecchHHHHHHH--hhcCCcchhhccccccCCCcceeeec
Confidence 35688888875443 246789999988888888877553 21
Q ss_pred ----CCCCeecCCCCcccCchhhHHHHHhhhh
Q psy14939 85 ----KEKPFKCPLCDRCFGQQTNLDRHLKKHE 112 (971)
Q Consensus 85 ----~~k~~~C~~C~~~f~~~~~L~~H~~~h~ 112 (971)
....|.|..|...|-.--+...|-..|.
T Consensus 338 ~~~~~~~~y~C~~Ck~~FCldCDv~iHesLh~ 369 (378)
T KOG2807|consen 338 GELLSSGRYRCESCKNVFCLDCDVFIHESLHN 369 (378)
T ss_pred cccCCCCcEEchhccceeeccchHHHHhhhhc
Confidence 1234899999999988888888888875
No 176
>PF11789 zf-Nse: Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=23.47 E-value=57 Score=24.78 Aligned_cols=44 Identities=27% Similarity=0.541 Sum_probs=24.9
Q ss_pred CCcccCCCCCccCChhhHHHHHHhhcCCCccccCCCCCccccchhhhhhhhhccCCCCCccCCC
Q psy14939 761 DRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPL 824 (971)
Q Consensus 761 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~ 824 (971)
-.++|++..+.| +.|++...|+..|.+..-+.- + ...+..+||+
T Consensus 10 ~~~~CPiT~~~~---------------~~PV~s~~C~H~fek~aI~~~-i----~~~~~~~CPv 53 (57)
T PF11789_consen 10 ISLKCPITLQPF---------------EDPVKSKKCGHTFEKEAILQY-I----QRNGSKRCPV 53 (57)
T ss_dssp --SB-TTTSSB----------------SSEEEESSS--EEEHHHHHHH-C----TTTS-EE-SC
T ss_pred eccCCCCcCChh---------------hCCcCcCCCCCeecHHHHHHH-H----HhcCCCCCCC
Confidence 457788888777 578888888888876655533 2 2345788887
No 177
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=23.45 E-value=25 Score=32.36 Aligned_cols=15 Identities=20% Similarity=0.645 Sum_probs=11.4
Q ss_pred CceeccccccccCCh
Q psy14939 30 DRYACKYCGKVFPRS 44 (971)
Q Consensus 30 ~~~~C~~C~k~F~~~ 44 (971)
..+.|..||..|...
T Consensus 69 ~~~~C~~CG~~~~~~ 83 (135)
T PRK03824 69 AVLKCRNCGNEWSLK 83 (135)
T ss_pred eEEECCCCCCEEecc
Confidence 568888888887653
No 178
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.86 E-value=31 Score=33.97 Aligned_cols=10 Identities=30% Similarity=0.653 Sum_probs=7.1
Q ss_pred ccccccchhc
Q psy14939 282 FRCEVCHKAY 291 (971)
Q Consensus 282 ~~C~~C~~~~ 291 (971)
+.|+.|.++.
T Consensus 63 vvCP~C~yAa 72 (267)
T COG1655 63 VVCPICYYAA 72 (267)
T ss_pred EEcchhhHHH
Confidence 5788888653
No 179
>PRK14873 primosome assembly protein PriA; Provisional
Probab=22.73 E-value=53 Score=39.41 Aligned_cols=46 Identities=24% Similarity=0.658 Sum_probs=0.0
Q ss_pred cccCccchhccChhHHHhhHhhcCCCCcccCCCCCccCChhHHHhhHhhhccCCcccccCcCCCccCChHHHhhCccccC
Q psy14939 198 YRCESCSQTFCWRPHLNFHQAQVHGRKFPCENCTKVFSDPSNLQRHIRTHHVGARQHACQECGKTFATSSGLKQHTHIHS 277 (971)
Q Consensus 198 ~~C~~C~~~f~~~~~L~~H~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~hh~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 277 (971)
..|..|+ ....|+.|+-.... |...+...|.+||+..
T Consensus 384 l~C~~Cg------------------~~~~C~~C~~~L~~-----------h~~~~~l~Ch~CG~~~-------------- 420 (665)
T PRK14873 384 LACARCR------------------TPARCRHCTGPLGL-----------PSAGGTPRCRWCGRAA-------------- 420 (665)
T ss_pred eEhhhCc------------------CeeECCCCCCceeE-----------ecCCCeeECCCCcCCC--------------
Q ss_pred CCCcccccccc
Q psy14939 278 SVKPFRCEVCH 288 (971)
Q Consensus 278 ~~~~~~C~~C~ 288 (971)
.++.|+.|+
T Consensus 421 --~p~~Cp~Cg 429 (665)
T PRK14873 421 --PDWRCPRCG 429 (665)
T ss_pred --cCccCCCCc
No 180
>KOG1280|consensus
Probab=22.37 E-value=72 Score=33.70 Aligned_cols=29 Identities=17% Similarity=0.450 Sum_probs=19.5
Q ss_pred CeecCCCCcccCchhhHHHHHhhhhccCC
Q psy14939 88 PFKCPLCDRCFGQQTNLDRHLKKHEADDG 116 (971)
Q Consensus 88 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~ 116 (971)
-|.|+.|+..-.+...|..|....|.+..
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~ 107 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEAS 107 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCcccC
Confidence 47777777766666677777766665554
No 181
>PF04810 zf-Sec23_Sec24: Sec23/Sec24 zinc finger; InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=21.88 E-value=17 Score=25.27 Aligned_cols=12 Identities=25% Similarity=0.747 Sum_probs=5.1
Q ss_pred CCCCccCCCCCc
Q psy14939 816 KEKPFKCPLCDR 827 (971)
Q Consensus 816 ~~~~~~C~~C~~ 827 (971)
+.+.|.|.+|+.
T Consensus 21 ~~~~w~C~~C~~ 32 (40)
T PF04810_consen 21 GGKTWICNFCGT 32 (40)
T ss_dssp TTTEEEETTT--
T ss_pred CCCEEECcCCCC
Confidence 334555555554
No 182
>PF03811 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=21.84 E-value=56 Score=22.14 Aligned_cols=16 Identities=38% Similarity=0.827 Sum_probs=7.4
Q ss_pred CCCCCCCCCceecccc
Q psy14939 22 GSSSKTTKDRYACKYC 37 (971)
Q Consensus 22 ~~~~~~~~~~~~C~~C 37 (971)
+.....|..+|.|..|
T Consensus 20 ~G~~~~G~qryrC~~C 35 (36)
T PF03811_consen 20 NGKSPSGHQRYRCKDC 35 (36)
T ss_pred CCCCCCCCEeEecCcC
Confidence 3333344455555554
No 183
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=21.24 E-value=57 Score=35.98 Aligned_cols=28 Identities=18% Similarity=0.538 Sum_probs=0.0
Q ss_pred CCCCCCchHHHHHHhccccccCCCCCCCCCceeccccccccCC
Q psy14939 1 ISPYANKPFQEILTSQNNAVNGSSSKTTKDRYACKYCGKVFPR 43 (971)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~k~F~~ 43 (971)
+||+|+...-- .|++.|+|+.|++.+..
T Consensus 352 ~Cp~Cg~~m~S---------------~G~~g~rC~kCg~~~~~ 379 (421)
T COG1571 352 VCPRCGGRMKS---------------AGRNGFRCKKCGTRARE 379 (421)
T ss_pred CCCccCCchhh---------------cCCCCcccccccccCCc
No 184
>PF07295 DUF1451: Protein of unknown function (DUF1451); InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=21.23 E-value=39 Score=31.48 Aligned_cols=13 Identities=23% Similarity=0.726 Sum_probs=6.1
Q ss_pred CCCCCcccCCCCC
Q psy14939 758 TTKDRYACKYCGK 770 (971)
Q Consensus 758 ~~~~~~~C~~C~~ 770 (971)
.+.+.|.|..||.
T Consensus 108 ~g~G~l~C~~Cg~ 120 (146)
T PF07295_consen 108 VGPGTLVCENCGH 120 (146)
T ss_pred ecCceEecccCCC
Confidence 3444455555554
No 185
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=21.00 E-value=30 Score=40.13 Aligned_cols=47 Identities=28% Similarity=0.645 Sum_probs=0.0
Q ss_pred eeccccccccCChHHHHHHHHHhcCCcceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCC
Q psy14939 32 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCD 95 (971)
Q Consensus 32 ~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~ 95 (971)
..|..||....-.. -...+..|.......|.+||+.. ..+..|+.|+
T Consensus 214 ~~C~~Cg~~~~C~~-C~~~l~~h~~~~~l~Ch~Cg~~~----------------~~~~~Cp~C~ 260 (505)
T TIGR00595 214 LLCRSCGYILCCPN-CDVSLTYHKKEGKLRCHYCGYQE----------------PIPKTCPQCG 260 (505)
T ss_pred eEhhhCcCccCCCC-CCCceEEecCCCeEEcCCCcCcC----------------CCCCCCCCCC
No 186
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=20.94 E-value=46 Score=30.73 Aligned_cols=33 Identities=33% Similarity=0.851 Sum_probs=19.8
Q ss_pred ceeccccccccCChHHHHHHHHHhcCCcceeccccccc
Q psy14939 31 RYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERS 68 (971)
Q Consensus 31 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~ 68 (971)
.|.|. |+..|.+ .++|-..-.|+ .|.|..|+-.
T Consensus 117 ~Y~C~-C~q~~l~---~RRhn~~~~g~-~YrC~~C~gk 149 (156)
T COG3091 117 PYRCQ-CQQHYLR---IRRHNTVRRGE-VYRCGKCGGK 149 (156)
T ss_pred eEEee-cCCccch---hhhcccccccc-eEEeccCCce
Confidence 56777 7776654 33555544555 5777777543
No 187
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=20.67 E-value=35 Score=31.39 Aligned_cols=18 Identities=17% Similarity=0.556 Sum_probs=13.3
Q ss_pred CCCCCcccCCCCCccCCh
Q psy14939 758 TTKDRYACKYCGKVFPRS 775 (971)
Q Consensus 758 ~~~~~~~C~~C~~~f~~~ 775 (971)
.....+.|..||..|...
T Consensus 66 ~~p~~~~C~~CG~~~~~~ 83 (135)
T PRK03824 66 EEEAVLKCRNCGNEWSLK 83 (135)
T ss_pred ecceEEECCCCCCEEecc
Confidence 334678999999888654
No 188
>PF05495 zf-CHY: CHY zinc finger; InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins: Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation: ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom. More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=20.57 E-value=25 Score=28.10 Aligned_cols=59 Identities=22% Similarity=0.437 Sum_probs=27.8
Q ss_pred ceeccccccccCChH---HHHHHHHHhcCCcceeccccccccCCHHHHHHHHHhhcCCCCCeecCCCCccc
Q psy14939 31 RYACKYCGKVFPRSA---NLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLCDRCF 98 (971)
Q Consensus 31 ~~~C~~C~k~F~~~~---~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~k~~~C~~C~~~f 98 (971)
.++|+-|++.|.=.. .+..|-......+...|..|...+.-... ..+ ..|.|++|+..|
T Consensus 10 ~~~~~cC~~~y~C~~CHde~~~H~~~~~~~~~v~Cg~C~~~~~~~~~--------~c~-~~~~C~~C~~~~ 71 (71)
T PF05495_consen 10 AIRFPCCGKYYPCRFCHDELEDHPFDRWPVKRVICGKCRTEQPIDEY--------SCG-ADYFCPICGLYF 71 (71)
T ss_dssp EEEETTTTEEESSHHHHHHCSSS---TTT--EEEETTT--EEES-SB--------TT---SEEETTTTEEE
T ss_pred EEECCcccCeecHHHHHHHhccCccccccccCeECCCCCCccChhhh--------hcC-CCccCcCcCCCC
Confidence 456677777666431 12233222233346788888877654433 111 457888887654
No 189
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=20.10 E-value=52 Score=29.56 Aligned_cols=55 Identities=29% Similarity=0.608 Sum_probs=30.0
Q ss_pred CcccccCcCCCccCChHHHhhCccccCCCCcccc---cccchhcCCHHHHHHHHhhhcCCCCceeccccccccCCHH
Q psy14939 251 ARQHACQECGKTFATSSGLKQHTHIHSSVKPFRC---EVCHKAYTQFSNLCRHKRMHANCRMQIKCVKCDQSFSTVT 324 (971)
Q Consensus 251 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C---~~C~~~~~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~ 324 (971)
.+.|+|.+|..+...+.-|+ |-.| .+|+.-+ .+|..|-..| -+|+.|.-+|.+..
T Consensus 78 ~~lYeCnIC~etS~ee~FLK----------PneCCgY~iCn~Cy---a~LWK~~~~y------pvCPvCkTSFKss~ 135 (140)
T PF05290_consen 78 PKLYECNICKETSAEERFLK----------PNECCGYSICNACY---ANLWKFCNLY------PVCPVCKTSFKSSS 135 (140)
T ss_pred CCceeccCcccccchhhcCC----------cccccchHHHHHHH---HHHHHHcccC------CCCCcccccccccc
Confidence 36677777777665544332 3333 2233332 4455554433 26888887776654
Done!