Query psy16830
Match_columns 712
No_of_seqs 596 out of 4049
Neff 10.5
Searched_HMMs 46136
Date Fri Aug 16 23:08:35 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16830.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16830hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 1.4E-32 2.9E-37 281.7 4.1 208 377-628 604-935 (958)
2 KOG1074|consensus 100.0 1.3E-31 2.8E-36 274.6 5.4 110 318-427 604-730 (958)
3 KOG2462|consensus 99.9 1.4E-27 3E-32 216.2 6.9 104 510-621 131-237 (279)
4 KOG3608|consensus 99.9 2.1E-26 4.6E-31 212.9 12.1 243 288-581 132-396 (467)
5 KOG3608|consensus 99.9 6E-26 1.3E-30 209.9 13.9 253 319-625 69-378 (467)
6 KOG2462|consensus 99.9 1.4E-26 3E-31 209.8 5.7 136 478-621 127-265 (279)
7 KOG3623|consensus 99.9 3.6E-24 7.8E-29 216.0 8.2 81 478-558 891-971 (1007)
8 KOG3623|consensus 99.9 2.4E-23 5.2E-28 210.1 8.9 78 537-620 894-971 (1007)
9 KOG3576|consensus 99.7 4.5E-18 9.7E-23 145.5 1.3 123 479-624 115-237 (267)
10 KOG3576|consensus 99.7 1.8E-17 4E-22 141.8 1.3 119 94-212 114-238 (267)
11 PLN03086 PRLI-interacting fact 99.2 3.5E-11 7.6E-16 125.0 7.1 143 482-696 408-562 (567)
12 PLN03086 PRLI-interacting fact 99.1 8.1E-11 1.8E-15 122.3 6.6 120 482-624 434-565 (567)
13 PHA00733 hypothetical protein 99.0 3.6E-10 7.8E-15 95.3 4.6 83 535-624 38-124 (128)
14 PHA00733 hypothetical protein 98.9 1.6E-09 3.4E-14 91.4 5.3 53 185-239 72-124 (128)
15 PHA02768 hypothetical protein; 98.9 8.3E-10 1.8E-14 75.0 2.2 44 571-616 5-48 (55)
16 KOG3993|consensus 98.9 4.7E-10 1E-14 108.1 1.2 147 480-626 266-485 (500)
17 PHA02768 hypothetical protein; 98.7 4E-09 8.8E-14 71.7 2.0 42 510-553 6-47 (55)
18 KOG3993|consensus 98.7 1.2E-09 2.6E-14 105.4 -1.0 218 185-433 266-485 (500)
19 PF13465 zf-H2C2_2: Zinc-finge 98.7 1.2E-08 2.5E-13 59.2 1.9 26 586-611 1-26 (26)
20 KOG1146|consensus 98.7 1.3E-08 2.8E-13 112.6 3.4 174 155-340 439-639 (1406)
21 PF13465 zf-H2C2_2: Zinc-finge 98.6 1.6E-08 3.6E-13 58.6 0.9 24 689-712 1-24 (26)
22 PHA00616 hypothetical protein 98.4 1.2E-07 2.7E-12 61.1 2.4 34 674-707 1-34 (44)
23 PHA00616 hypothetical protein 98.3 1.8E-07 3.9E-12 60.4 1.1 39 571-609 1-39 (44)
24 KOG1146|consensus 98.2 9.9E-07 2.1E-11 98.2 4.5 80 538-628 1261-1357(1406)
25 PHA00732 hypothetical protein 98.1 1.8E-06 4E-11 65.4 2.7 45 571-621 1-46 (79)
26 PHA00732 hypothetical protein 98.0 3.5E-06 7.5E-11 63.9 2.9 45 509-559 1-46 (79)
27 PF05605 zf-Di19: Drought indu 97.9 1.3E-05 2.9E-10 56.4 3.7 49 572-623 3-53 (54)
28 PF05605 zf-Di19: Drought indu 97.8 1.6E-05 3.4E-10 56.0 2.7 49 187-238 3-53 (54)
29 PF00096 zf-C2H2: Zinc finger, 97.7 8.9E-06 1.9E-10 45.9 0.5 23 675-697 1-23 (23)
30 PF00096 zf-C2H2: Zinc finger, 97.6 5.3E-05 1.1E-09 42.7 2.4 23 187-209 1-23 (23)
31 COG5189 SFP1 Putative transcri 97.5 3.7E-05 8.1E-10 71.9 2.0 72 596-695 346-419 (423)
32 PF13894 zf-C2H2_4: C2H2-type 97.3 0.00012 2.7E-09 41.7 1.6 24 600-623 1-24 (24)
33 PF12756 zf-C2H2_2: C2H2 type 97.3 0.00013 2.9E-09 59.8 1.9 75 539-624 1-75 (100)
34 COG5189 SFP1 Putative transcri 97.2 0.00011 2.4E-09 68.8 1.3 25 596-620 395-419 (423)
35 PF13894 zf-C2H2_4: C2H2-type 97.2 0.00033 7.2E-09 39.8 2.5 23 187-209 1-23 (24)
36 PF13912 zf-C2H2_6: C2H2-type 97.2 0.00011 2.4E-09 43.2 0.4 26 674-699 1-26 (27)
37 PF13912 zf-C2H2_6: C2H2-type 97.0 0.00039 8.4E-09 40.9 1.9 25 186-210 1-25 (27)
38 PF12756 zf-C2H2_2: C2H2 type 97.0 0.00049 1.1E-08 56.4 2.4 24 378-401 50-73 (100)
39 smart00355 ZnF_C2H2 zinc finge 96.8 0.00084 1.8E-08 39.0 2.0 25 675-699 1-25 (26)
40 KOG2231|consensus 96.7 0.0014 2.9E-08 70.1 4.2 20 479-498 403-422 (669)
41 KOG2231|consensus 96.6 0.0054 1.2E-07 65.7 7.2 62 153-236 100-171 (669)
42 smart00355 ZnF_C2H2 zinc finge 96.3 0.0027 5.8E-08 36.7 2.0 22 573-594 2-23 (26)
43 PRK04860 hypothetical protein; 96.3 0.0022 4.7E-08 56.4 1.9 40 571-614 119-158 (160)
44 PF13909 zf-H2C2_5: C2H2-type 96.2 0.0026 5.5E-08 36.1 1.4 23 600-623 1-23 (24)
45 PF09237 GAGA: GAGA factor; I 96.2 0.0039 8.5E-08 41.3 2.1 31 316-346 21-51 (54)
46 PF09237 GAGA: GAGA factor; I 96.1 0.0047 1E-07 41.0 2.4 32 506-537 21-52 (54)
47 COG5236 Uncharacterized conser 96.1 0.009 2E-07 57.1 5.0 133 97-244 151-311 (493)
48 PRK04860 hypothetical protein; 96.1 0.0026 5.7E-08 55.9 1.3 36 673-712 118-153 (160)
49 PF13909 zf-H2C2_5: C2H2-type 96.0 0.0066 1.4E-07 34.4 2.3 23 187-210 1-23 (24)
50 PF12874 zf-met: Zinc-finger o 96.0 0.0052 1.1E-07 35.2 1.9 23 187-209 1-23 (25)
51 COG5048 FOG: Zn-finger [Genera 95.9 0.028 6E-07 60.7 8.9 139 571-710 289-454 (467)
52 KOG2785|consensus 95.7 0.031 6.7E-07 54.9 7.2 75 289-369 165-242 (390)
53 PF12874 zf-met: Zinc-finger o 95.7 0.0033 7.1E-08 36.1 0.3 23 675-697 1-23 (25)
54 KOG2482|consensus 95.5 0.027 6E-07 54.0 5.8 178 185-370 143-357 (423)
55 COG5236 Uncharacterized conser 95.2 0.012 2.7E-07 56.2 2.4 73 292-373 222-307 (493)
56 PF12171 zf-C2H2_jaz: Zinc-fin 95.1 0.0059 1.3E-07 35.8 -0.0 23 675-697 2-24 (27)
57 COG5048 FOG: Zn-finger [Genera 95.1 0.015 3.3E-07 62.8 3.1 147 480-628 288-446 (467)
58 KOG2785|consensus 94.7 0.038 8.2E-07 54.4 4.3 82 319-400 3-90 (390)
59 PF12171 zf-C2H2_jaz: Zinc-fin 94.7 0.022 4.7E-07 33.3 1.7 21 510-530 2-22 (27)
60 KOG4173|consensus 94.4 0.013 2.9E-07 51.7 0.4 81 536-625 78-172 (253)
61 KOG2482|consensus 92.7 0.32 6.9E-06 47.0 6.5 138 289-427 143-355 (423)
62 PF13913 zf-C2HC_2: zinc-finge 91.9 0.087 1.9E-06 30.0 1.1 21 675-696 3-23 (25)
63 smart00451 ZnF_U1 U1-like zinc 91.5 0.086 1.9E-06 33.1 0.9 24 673-696 2-25 (35)
64 PF13913 zf-C2HC_2: zinc-finge 91.2 0.15 3.3E-06 29.0 1.6 19 573-592 4-22 (25)
65 cd00350 rubredoxin_like Rubred 89.9 0.19 4.1E-06 30.9 1.4 11 510-520 2-12 (33)
66 smart00451 ZnF_U1 U1-like zinc 89.1 0.31 6.8E-06 30.4 2.0 23 186-208 3-25 (35)
67 KOG4173|consensus 88.4 0.25 5.4E-06 43.9 1.5 78 348-431 80-171 (253)
68 cd00350 rubredoxin_like Rubred 88.3 0.31 6.6E-06 30.0 1.5 10 598-607 16-25 (33)
69 PF12013 DUF3505: Protein of u 88.2 0.31 6.7E-06 40.3 1.9 27 673-699 79-109 (109)
70 KOG2893|consensus 87.9 0.13 2.7E-06 46.7 -0.6 41 293-338 13-53 (341)
71 KOG2893|consensus 86.1 0.26 5.6E-06 44.8 0.4 51 570-625 10-60 (341)
72 COG4049 Uncharacterized protei 85.6 0.33 7.2E-06 32.9 0.6 31 593-623 11-41 (65)
73 TIGR00622 ssl1 transcription f 83.5 2.4 5.3E-05 34.5 4.7 24 571-594 81-104 (112)
74 PF15269 zf-C2H2_7: Zinc-finge 80.8 1.4 3.1E-05 28.3 2.0 22 675-696 21-42 (54)
75 PF12013 DUF3505: Protein of u 80.2 1.4 3.1E-05 36.4 2.5 23 185-208 10-32 (109)
76 cd00729 rubredoxin_SM Rubredox 79.8 1.1 2.3E-05 27.8 1.2 10 510-519 3-12 (34)
77 PF09986 DUF2225: Uncharacteri 78.3 1.1 2.3E-05 42.2 1.4 25 151-175 4-28 (214)
78 PF09986 DUF2225: Uncharacteri 77.8 0.77 1.7E-05 43.2 0.2 12 572-583 49-60 (214)
79 COG4049 Uncharacterized protei 77.0 1.1 2.4E-05 30.5 0.7 28 211-238 14-41 (65)
80 PF09538 FYDLN_acid: Protein o 74.9 1.6 3.5E-05 35.6 1.3 27 675-712 10-36 (108)
81 PF09538 FYDLN_acid: Protein o 73.5 2.3 5E-05 34.7 1.9 14 151-164 25-38 (108)
82 PF02892 zf-BED: BED zinc fing 72.5 2.1 4.6E-05 28.5 1.2 27 597-623 14-44 (45)
83 KOG2186|consensus 71.6 1.7 3.7E-05 40.4 0.8 49 125-176 4-52 (276)
84 COG1592 Rubrerythrin [Energy p 71.0 2.6 5.6E-05 37.3 1.7 23 509-544 134-156 (166)
85 PF10571 UPF0547: Uncharacteri 70.6 2.3 5E-05 24.5 0.9 9 511-519 16-24 (26)
86 TIGR00622 ssl1 transcription f 70.4 5.8 0.00013 32.4 3.5 78 480-559 14-103 (112)
87 COG2888 Predicted Zn-ribbon RN 69.5 3.1 6.7E-05 29.1 1.5 8 510-517 28-35 (61)
88 PF13717 zinc_ribbon_4: zinc-r 66.7 4.2 9.1E-05 25.6 1.6 33 153-196 3-35 (36)
89 PRK00464 nrdR transcriptional 66.5 1.9 4.1E-05 37.8 0.0 14 600-613 29-42 (154)
90 PHA00626 hypothetical protein 65.9 3.4 7.5E-05 28.4 1.1 13 186-198 23-35 (59)
91 TIGR02098 MJ0042_CXXC MJ0042 f 65.8 3.4 7.5E-05 26.3 1.1 14 153-166 3-16 (38)
92 PF13719 zinc_ribbon_5: zinc-r 62.7 5.2 0.00011 25.4 1.5 9 573-581 27-35 (37)
93 smart00614 ZnF_BED BED zinc fi 62.0 5.6 0.00012 27.3 1.7 22 602-623 21-47 (50)
94 KOG2186|consensus 61.3 3 6.4E-05 38.9 0.2 51 482-535 4-54 (276)
95 smart00659 RPOLCX RNA polymera 61.2 5.4 0.00012 26.5 1.4 25 572-607 3-27 (44)
96 PRK09678 DNA-binding transcrip 61.0 1.9 4E-05 32.1 -0.9 13 674-686 27-41 (72)
97 PRK00464 nrdR transcriptional 60.7 3.8 8.3E-05 35.9 0.8 15 186-200 28-42 (154)
98 PRK00398 rpoP DNA-directed RNA 60.1 3.5 7.7E-05 27.7 0.4 12 571-582 3-14 (46)
99 smart00734 ZnF_Rad18 Rad18-lik 60.1 5.5 0.00012 22.9 1.1 20 676-696 3-22 (26)
100 PRK14890 putative Zn-ribbon RN 59.5 5.7 0.00012 27.9 1.3 9 510-518 26-34 (59)
101 smart00531 TFIIE Transcription 57.9 4.7 0.0001 35.4 1.0 13 571-583 99-111 (147)
102 smart00834 CxxC_CXXC_SSSS Puta 57.9 4.2 9.1E-05 26.4 0.5 11 572-582 6-16 (41)
103 TIGR02300 FYDLN_acid conserved 57.5 7 0.00015 32.4 1.7 14 151-164 25-38 (129)
104 PF09723 Zn-ribbon_8: Zinc rib 56.3 1.9 4E-05 28.3 -1.4 29 675-710 6-34 (42)
105 TIGR02605 CxxC_CxxC_SSSS putat 56.2 4.5 9.8E-05 28.0 0.4 12 572-583 6-17 (52)
106 TIGR00373 conserved hypothetic 55.2 8.6 0.00019 34.2 2.1 33 505-546 105-137 (158)
107 COG1996 RPC10 DNA-directed RNA 55.1 5.8 0.00013 26.8 0.8 29 185-223 5-33 (49)
108 COG1592 Rubrerythrin [Energy p 53.5 8.6 0.00019 34.1 1.8 23 319-354 134-156 (166)
109 KOG2071|consensus 53.3 17 0.00037 38.9 4.2 23 187-209 419-441 (579)
110 COG5151 SSL1 RNA polymerase II 52.6 13 0.00029 35.7 3.0 48 540-595 365-412 (421)
111 PF15269 zf-C2H2_7: Zinc-finge 52.5 8.3 0.00018 25.0 1.1 23 571-593 20-42 (54)
112 TIGR02300 FYDLN_acid conserved 52.4 9 0.0002 31.8 1.6 30 482-522 10-39 (129)
113 PRK09678 DNA-binding transcrip 52.4 4.3 9.3E-05 30.2 -0.2 42 572-615 2-45 (72)
114 PF05443 ROS_MUCR: ROS/MUCR tr 51.7 6.7 0.00014 33.3 0.8 21 97-120 72-92 (132)
115 TIGR00373 conserved hypothetic 50.4 9.8 0.00021 33.8 1.7 42 140-196 97-138 (158)
116 PF03604 DNA_RNApol_7kD: DNA d 50.4 10 0.00022 23.2 1.2 10 153-162 1-10 (32)
117 smart00531 TFIIE Transcription 49.3 14 0.00031 32.4 2.5 37 506-546 96-132 (147)
118 PRK06266 transcription initiat 48.8 12 0.00027 33.9 2.2 33 506-547 114-146 (178)
119 COG1198 PriA Primosomal protei 48.2 13 0.00029 41.7 2.6 9 511-519 437-445 (730)
120 PF02176 zf-TRAF: TRAF-type zi 48.1 8.5 0.00018 27.5 0.8 21 522-542 23-43 (60)
121 PRK06266 transcription initiat 47.7 10 0.00022 34.4 1.4 16 151-166 116-131 (178)
122 PF12907 zf-met2: Zinc-binding 47.6 15 0.00033 23.7 1.8 32 291-322 2-36 (40)
123 KOG4124|consensus 47.3 5.5 0.00012 39.0 -0.4 72 535-621 347-420 (442)
124 PF00301 Rubredoxin: Rubredoxi 45.8 14 0.00031 24.9 1.5 13 153-165 2-14 (47)
125 PRK04023 DNA polymerase II lar 45.7 18 0.0004 41.4 3.2 12 571-582 663-674 (1121)
126 KOG4124|consensus 44.4 8.9 0.00019 37.6 0.5 71 597-695 347-419 (442)
127 KOG2807|consensus 43.4 32 0.00068 33.7 3.9 25 571-595 345-369 (378)
128 cd00730 rubredoxin Rubredoxin; 42.1 16 0.00036 25.0 1.4 12 153-164 2-13 (50)
129 PF12760 Zn_Tnp_IS1595: Transp 41.7 21 0.00045 24.0 1.8 21 307-328 7-27 (46)
130 COG5151 SSL1 RNA polymerase II 41.6 50 0.0011 32.0 4.9 49 153-209 363-411 (421)
131 KOG4167|consensus 41.1 10 0.00022 41.3 0.4 26 123-148 791-816 (907)
132 COG5188 PRP9 Splicing factor 3 40.0 23 0.00049 34.9 2.5 25 186-210 238-264 (470)
133 TIGR01384 TFS_arch transcripti 39.0 11 0.00023 30.9 0.1 37 571-610 62-101 (104)
134 PF08274 PhnA_Zn_Ribbon: PhnA 38.9 13 0.00029 22.2 0.5 24 573-607 4-27 (30)
135 COG1997 RPL43A Ribosomal prote 38.4 20 0.00043 27.6 1.4 11 572-582 54-64 (89)
136 PF14353 CpXC: CpXC protein 37.2 3.3 7.1E-05 35.4 -3.3 8 512-519 4-11 (128)
137 PF01363 FYVE: FYVE zinc finge 36.0 12 0.00025 27.8 -0.1 26 675-712 10-35 (69)
138 PF06524 NOA36: NOA36 protein; 35.8 20 0.00042 33.8 1.3 50 320-370 183-232 (314)
139 PF05290 Baculo_IE-1: Baculovi 35.6 23 0.00049 29.8 1.5 58 147-228 75-135 (140)
140 PF04959 ARS2: Arsenite-resist 35.4 17 0.00038 33.9 1.0 28 597-624 75-102 (214)
141 KOG2807|consensus 34.8 81 0.0018 31.1 5.2 83 330-426 223-310 (378)
142 PF06524 NOA36: NOA36 protein; 34.4 11 0.00024 35.4 -0.5 12 95-106 36-47 (314)
143 PF13240 zinc_ribbon_2: zinc-r 34.2 13 0.00029 20.6 -0.0 6 677-682 16-21 (23)
144 PRK04023 DNA polymerase II lar 33.7 28 0.0006 40.0 2.3 11 572-582 1038-1048(1121)
145 KOG1280|consensus 33.0 52 0.0011 32.7 3.7 29 599-627 79-107 (381)
146 KOG2593|consensus 32.7 37 0.0008 34.9 2.8 21 148-168 124-144 (436)
147 PF05443 ROS_MUCR: ROS/MUCR tr 31.2 26 0.00056 29.9 1.2 25 186-213 72-96 (132)
148 KOG3408|consensus 30.8 33 0.00071 28.2 1.6 28 669-696 52-79 (129)
149 PF05191 ADK_lid: Adenylate ki 30.8 30 0.00066 21.8 1.2 8 512-519 4-11 (36)
150 smart00440 ZnF_C2C2 C2C2 Zinc 30.7 16 0.00034 23.7 -0.1 10 510-519 29-38 (40)
151 PF01096 TFIIS_C: Transcriptio 29.6 8.9 0.00019 24.6 -1.4 10 510-519 29-38 (39)
152 cd00065 FYVE FYVE domain; Zinc 29.3 29 0.00063 24.3 1.1 13 673-685 17-29 (57)
153 PF13878 zf-C2H2_3: zinc-finge 29.0 54 0.0012 21.4 2.2 25 186-210 13-39 (41)
154 KOG2593|consensus 28.9 29 0.00062 35.7 1.3 35 673-710 127-161 (436)
155 COG3357 Predicted transcriptio 28.2 37 0.00081 26.2 1.5 13 319-331 58-70 (97)
156 PF07754 DUF1610: Domain of un 28.2 24 0.00052 19.9 0.3 9 152-160 16-24 (24)
157 KOG2272|consensus 27.4 45 0.00097 31.3 2.1 71 122-200 97-177 (332)
158 PF09845 DUF2072: Zn-ribbon co 27.1 32 0.0007 29.0 1.1 13 510-522 2-14 (131)
159 smart00064 FYVE Protein presen 26.8 32 0.00069 25.3 0.9 26 675-712 11-36 (68)
160 PF04959 ARS2: Arsenite-resist 26.2 34 0.00074 32.0 1.2 25 379-403 78-102 (214)
161 KOG4167|consensus 26.0 32 0.0007 37.7 1.1 27 672-698 790-816 (907)
162 KOG3408|consensus 25.7 43 0.00094 27.5 1.5 22 124-145 57-78 (129)
163 KOG1280|consensus 25.0 44 0.00094 33.2 1.7 37 673-709 78-116 (381)
164 KOG2071|consensus 24.9 34 0.00074 36.8 1.1 29 671-699 415-443 (579)
165 COG4957 Predicted transcriptio 24.7 31 0.00067 28.9 0.6 20 98-120 77-96 (148)
166 COG2331 Uncharacterized protei 24.4 28 0.0006 25.9 0.2 29 674-710 12-41 (82)
167 KOG4377|consensus 24.4 43 0.00093 34.0 1.6 109 509-624 271-427 (480)
168 PF14446 Prok-RING_1: Prokaryo 24.0 53 0.0011 22.9 1.5 12 572-583 6-17 (54)
169 PTZ00255 60S ribosomal protein 23.8 38 0.00082 26.5 0.9 12 571-582 54-65 (90)
170 COG4530 Uncharacterized protei 23.7 38 0.00083 27.1 0.9 11 186-196 26-36 (129)
171 COG1198 PriA Primosomal protei 23.3 53 0.0011 37.2 2.2 26 321-356 446-471 (730)
172 PF01780 Ribosomal_L37ae: Ribo 23.2 39 0.00085 26.4 0.9 31 151-197 34-64 (90)
173 PF13453 zf-TFIIB: Transcripti 23.1 41 0.0009 21.8 0.8 16 600-615 20-35 (41)
174 smart00154 ZnF_AN1 AN1-like Zi 22.5 40 0.00088 21.7 0.7 11 674-684 12-22 (39)
175 COG4888 Uncharacterized Zn rib 22.2 34 0.00073 27.1 0.3 11 151-161 21-31 (104)
176 COG4957 Predicted transcriptio 22.0 57 0.0012 27.4 1.6 25 509-536 76-100 (148)
177 PF10013 DUF2256: Uncharacteri 21.9 75 0.0016 20.8 1.7 15 511-525 10-24 (42)
178 TIGR01206 lysW lysine biosynth 21.6 47 0.001 23.2 0.9 11 153-163 3-13 (54)
179 TIGR00280 L37a ribosomal prote 21.3 41 0.00089 26.4 0.6 12 571-582 53-64 (91)
180 PRK03824 hypA hydrogenase nick 21.1 47 0.001 28.6 1.0 11 482-492 71-81 (135)
181 PF13451 zf-trcl: Probable zin 21.1 44 0.00095 22.8 0.6 12 290-301 4-15 (49)
182 COG0068 HypF Hydrogenase matur 20.6 30 0.00066 38.1 -0.3 56 484-545 126-181 (750)
183 PF08790 zf-LYAR: LYAR-type C2 20.0 46 0.00099 19.6 0.5 8 512-519 3-10 (28)
No 1
>KOG1074|consensus
Probab=99.97 E-value=1.4e-32 Score=281.72 Aligned_cols=208 Identities=26% Similarity=0.517 Sum_probs=157.2
Q ss_pred CceecCCChhhccChHHHHHHHhhhcCCCcccCCCCccccCChHHHHhHHHHhhcCCCCccccCCCCCCCcccccccccc
Q psy16830 377 KSYKCSLCPRVFTHLAALEDHQFVHTGQKTRLCTMCPAIFASIGGLRSHMKIVHHGLKTQQSCQSSHESKTCSVDEINKE 456 (712)
Q Consensus 377 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (712)
.|-+|-+|.++..-.+.|+.|.++|+|++||+|++|+++|+++.+|+.|+ .+|....+.
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~-~vHka~p~~-------------------- 662 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHM-SVHKAKPPA-------------------- 662 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcc-cccccCccc--------------------
Confidence 45789999999999999999999999999999999999999999999999 556432221
Q ss_pred ccccccccccCCCCCchhhhccCCCCCCc---chhhhhcchhhHhhccccccCCC-------------ccccCCCCcccC
Q psy16830 457 NEEGATIRKVKPHEDSKEYMALNKPFKCD---DCDQRFLTKILLNKHKLLHFDKD-------------YFSCNYCGKTFL 520 (712)
Q Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~~~~C~---~C~~~f~~~~~L~~H~~~h~~~~-------------~~~C~~C~k~f~ 520 (712)
.-++.|+ +|.+.|.+...|..|+++|.+.. .-+|..|.+.|.
T Consensus 663 ----------------------R~q~ScP~~~ic~~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~ 720 (958)
T KOG1074|consen 663 ----------------------RVQFSCPSTFICQKKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFS 720 (958)
T ss_pred ----------------------cccccCCchhhhcccccccccccceEEeecCCCCCCCcccccccchhcccchhhhccc
Confidence 2457888 88888888888888888887321 146888888887
Q ss_pred ChHHHhhhhhhcC----------------CCC----cccCcccccccCChhhHHhhhhhc--c-----------------
Q psy16830 521 HKYKLRRHIVIHS----------------TAR----PFSCPVCKKQFKRKQDVAGHDRVV--H----------------- 561 (712)
Q Consensus 521 ~~~~L~~H~~~H~----------------~~~----p~~C~~C~~~f~~~~~l~~H~~~~--h----------------- 561 (712)
....+..++..|. ++. +..+..|+..+.....+..+--.. +
T Consensus 721 ~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~ 800 (958)
T KOG1074|consen 721 DARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGE 800 (958)
T ss_pred ccccchhhhhccCCcccCCcccccccccccccccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhh
Confidence 7777777765551 222 455666777666555544442211 0
Q ss_pred ---cCcc------------------------------------------------------------------ccccCCc
Q psy16830 562 ---QGKK------------------------------------------------------------------LIVYKSY 572 (712)
Q Consensus 562 ---~~~~------------------------------------------------------------------~~~~~~~ 572 (712)
++++ ..++...
T Consensus 801 ~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h 880 (958)
T KOG1074|consen 801 EDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAH 880 (958)
T ss_pred hcccCCCCcccccCCCcCCccccccCchhhcccccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchh
Confidence 0000 0011226
Q ss_pred ccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCCchhhhhHHhhhccCCCC
Q psy16830 573 KCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYATIGGLRSHIKIVHHGLKT 628 (712)
Q Consensus 573 ~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~ 628 (712)
.|.+||+.|...++|+.|+++|+|+|||.|.+|++.|.++.+|+.||.+ |....+
T Consensus 881 ~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKvHMgt-H~w~q~ 935 (958)
T KOG1074|consen 881 VCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKVHMGT-HMWVQP 935 (958)
T ss_pred hhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhhhhcc-ccccCC
Confidence 7999999999999999999999999999999999999999999999997 555443
No 2
>KOG1074|consensus
Probab=99.97 E-value=1.3e-31 Score=274.58 Aligned_cols=110 Identities=24% Similarity=0.576 Sum_probs=97.9
Q ss_pred CCcccCcCccccCChhHHHHHHHhcCCCCCccCCcccccccChHHHHHHHHHhcCCCCC-CceecC---CChhhccChHH
Q psy16830 318 GGFTCNSCGLIFSEKKYLRNHMISHSTQKPFSCSLCGKRFKRENGVINHQRLHQTKKVY-KSYKCS---LCPRVFTHLAA 393 (712)
Q Consensus 318 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~-~~~~C~---~C~~~f~~~~~ 393 (712)
.+.+|-+|.++.+-++.|+.|+++|+|++||+|.+||+.|.++.+|+.|+-+|...-+. .++.|+ +|.+.|.+...
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccc
Confidence 46789999999999999999999999999999999999999999999999998765433 368999 99999999999
Q ss_pred HHHHHhhhcCC-C------------cccCCCCccccCChHHHHhHHH
Q psy16830 394 LEDHQFVHTGQ-K------------TRLCTMCPAIFASIGGLRSHMK 427 (712)
Q Consensus 394 l~~H~~~h~~~-~------------~~~C~~C~~~f~~~~~l~~H~~ 427 (712)
|..|+++|.+. . .-+|..|.+.|.....+..++.
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~s 730 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQIS 730 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhh
Confidence 99999999842 1 2469999999999999998884
No 3
>KOG2462|consensus
Probab=99.94 E-value=1.4e-27 Score=216.24 Aligned_cols=104 Identities=32% Similarity=0.634 Sum_probs=53.7
Q ss_pred cccCCCCcccCChHHHhhhhhhcCC---CCcccCcccccccCChhhHHhhhhhcccCccccccCCcccCcCCcccCCchh
Q psy16830 510 FSCNYCGKTFLHKYKLRRHIVIHST---ARPFSCPVCKKQFKRKQDVAGHDRVVHQGKKLIVYKSYKCSLCPKAFTYLGA 586 (712)
Q Consensus 510 ~~C~~C~k~f~~~~~L~~H~~~H~~---~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~ 586 (712)
|.|+.||+.+.+.++|.+|.++|.. .+.+.|++|++.|.+...|..|+|+ |+- +.+|.+|||.|...--
T Consensus 131 ~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirT-H~l-------~c~C~iCGKaFSRPWL 202 (279)
T KOG2462|consen 131 YKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRT-HTL-------PCECGICGKAFSRPWL 202 (279)
T ss_pred eeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhc-cCC-------CcccccccccccchHH
Confidence 4444444444444444444444422 2334455555555555555555554 321 3555555555555555
Q ss_pred HHhHHhhhcCCCcccCCCCccccCCchhhhhHHhh
Q psy16830 587 LEDHQFVHTGQKTHLCTMCPASYATIGGLRSHIKI 621 (712)
Q Consensus 587 L~~H~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~ 621 (712)
|+-|+|+|||||||.|..|+++|.++++|+.||++
T Consensus 203 LQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQT 237 (279)
T KOG2462|consen 203 LQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQT 237 (279)
T ss_pred hhcccccccCCCCccCCcccchhcchHHHHHHHHh
Confidence 55555555555555555555555555555555555
No 4
>KOG3608|consensus
Probab=99.94 E-value=2.1e-26 Score=212.85 Aligned_cols=243 Identities=26% Similarity=0.518 Sum_probs=189.9
Q ss_pred CCCccC--CcchhhccCHHHHHHHHHhhccC------------C-CCccc--CcCccccCChhHHHHHHHhcCCCCCccC
Q psy16830 288 NKPFKC--DDCFQCFLSKILLNQHKKQMHFN------------K-GGFTC--NSCGLIFSEKKYLRNHMISHSTQKPFSC 350 (712)
Q Consensus 288 ~~~~~C--~~C~~~f~~~~~l~~H~~~~h~~------------~-~~~~C--~~C~~~f~~~~~l~~H~~~h~~~~~~~C 350 (712)
...|.| ..|+..|.+...+..|+.. |.. + ..+.| ..|.+.|.+++.|++|++.|++++...|
T Consensus 132 g~~f~C~WedCe~~F~s~~ef~dHV~~-H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvAC 210 (467)
T KOG3608|consen 132 GQNFRCGWEDCEREFVSIVEFQDHVVK-HALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVAC 210 (467)
T ss_pred hhhhccChhhcCCcccCHHHHHHHHHH-hhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEec
Confidence 345777 6899999999999999864 421 1 12445 4688999999999999999999999999
Q ss_pred CcccccccChHHHHHHHHHhcCCCCCCceecCCChhhccChHHHHHHHhhhcCCCcccCCCCccccCChHHHHhHHHHhh
Q psy16830 351 SLCGKRFKRENGVINHQRLHQTKKVYKSYKCSLCPRVFTHLAALEDHQFVHTGQKTRLCTMCPAIFASIGGLRSHMKIVH 430 (712)
Q Consensus 351 ~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H 430 (712)
+.||.-|+++..|..|++..+.-. ..+|+|..|.+.|.+...|..|+..|.. -|+|+.|+.+....++|..|++..|
T Consensus 211 p~Cg~~F~~~tkl~DH~rRqt~l~-~n~fqC~~C~KrFaTeklL~~Hv~rHvn--~ykCplCdmtc~~~ssL~~H~r~rH 287 (467)
T KOG3608|consen 211 PHCGELFRTKTKLFDHLRRQTELN-TNSFQCAQCFKRFATEKLLKSHVVRHVN--CYKCPLCDMTCSSASSLTTHIRYRH 287 (467)
T ss_pred chHHHHhccccHHHHHHHhhhhhc-CCchHHHHHHHHHhHHHHHHHHHHHhhh--cccccccccCCCChHHHHHHHHhhh
Confidence 999999999999999988765433 2389999999999999999999998864 5889999999999999999998766
Q ss_pred cCCCCccccCCCCCCCccccccccccccccccccccCCCCCchhhhccCCCCCCcchhhhhcchhhHhhccccccCCCcc
Q psy16830 431 HGLKTQQSCQSSHESKTCSVDEINKENEEGATIRKVKPHEDSKEYMALNKPFKCDDCDQRFLTKILLNKHKLLHFDKDYF 510 (712)
Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~ 510 (712)
.. .+||+|+.|++.|.+.+.|.+|...|. +..|
T Consensus 288 s~----------------------------------------------dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y 320 (467)
T KOG3608|consen 288 SK----------------------------------------------DKPFKCDECDTRCVRESDLAKHVQVHS-KTVY 320 (467)
T ss_pred cc----------------------------------------------CCCccccchhhhhccHHHHHHHHHhcc-ccce
Confidence 32 577888888888888888888888776 6678
Q ss_pred ccCC--CCcccCChHHHhhhhhhcC-C--CCcccCcccccccCChhhHHhhhhhcccCccccccCCcccCcCCccc
Q psy16830 511 SCNY--CGKTFLHKYKLRRHIVIHS-T--ARPFSCPVCKKQFKRKQDVAGHDRVVHQGKKLIVYKSYKCSLCPKAF 581 (712)
Q Consensus 511 ~C~~--C~k~f~~~~~L~~H~~~H~-~--~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f 581 (712)
+|.. |.++|++...|++|++.++ | +.+|.|..|++.|++-.+|..|++..|.-.-+.+.+.|.=.+|.-.|
T Consensus 321 ~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~PsGh~RFtYk~~edG~ 396 (467)
T KOG3608|consen 321 QCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFRLPSGHKRFTYKVDEDGF 396 (467)
T ss_pred ecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcccCCCCCCceeeeeccCce
Confidence 8877 8888888888888876544 3 34577777777777777777777777766555555555555555444
No 5
>KOG3608|consensus
Probab=99.93 E-value=6e-26 Score=209.86 Aligned_cols=253 Identities=25% Similarity=0.462 Sum_probs=213.1
Q ss_pred Cccc--CcCccccCC-hhHHHHHHHhcCC----------------------------------CCCccCC--cccccccC
Q psy16830 319 GFTC--NSCGLIFSE-KKYLRNHMISHST----------------------------------QKPFSCS--LCGKRFKR 359 (712)
Q Consensus 319 ~~~C--~~C~~~f~~-~~~l~~H~~~h~~----------------------------------~~~~~C~--~C~~~f~~ 359 (712)
.++| ..|++...+ ...|.+|.-.|-- ...|.|. .|+..|.+
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s 148 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVS 148 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccC
Confidence 4666 568877766 5889999877620 0125664 59999999
Q ss_pred hHHHHHHHHHhcCC----------CCCCceecC--CChhhccChHHHHHHHhhhcCCCcccCCCCccccCChHHHHhHHH
Q psy16830 360 ENGVINHQRLHQTK----------KVYKSYKCS--LCPRVFTHLAALEDHQFVHTGQKTRLCTMCPAIFASIGGLRSHMK 427 (712)
Q Consensus 360 ~~~l~~H~~~h~~~----------~~~~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 427 (712)
...+..|+..|..- .....+.|. .|.+.|.+++.|..|++.|++++...|+.|+..|.++..|..|++
T Consensus 149 ~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~r 228 (467)
T KOG3608|consen 149 IVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLR 228 (467)
T ss_pred HHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHH
Confidence 99999999876421 111235564 599999999999999999999999999999999999999999997
Q ss_pred HhhcCCCCccccCCCCCCCccccccccccccccccccccCCCCCchhhhccCCCCCCcchhhhhcchhhHhhccccccCC
Q psy16830 428 IVHHGLKTQQSCQSSHESKTCSVDEINKENEEGATIRKVKPHEDSKEYMALNKPFKCDDCDQRFLTKILLNKHKLLHFDK 507 (712)
Q Consensus 428 ~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~ 507 (712)
+.- .....+|.|..|.+.|.+...|..|+..|..
T Consensus 229 Rqt---------------------------------------------~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn- 262 (467)
T KOG3608|consen 229 RQT---------------------------------------------ELNTNSFQCAQCFKRFATEKLLKSHVVRHVN- 262 (467)
T ss_pred hhh---------------------------------------------hhcCCchHHHHHHHHHhHHHHHHHHHHHhhh-
Confidence 521 1113479999999999999999999999864
Q ss_pred CccccCCCCcccCChHHHhhhhh-hcCCCCcccCcccccccCChhhHHhhhhhcccCccccccCCcccCc--CCcccCCc
Q psy16830 508 DYFSCNYCGKTFLHKYKLRRHIV-IHSTARPFSCPVCKKQFKRKQDVAGHDRVVHQGKKLIVYKSYKCSL--CPKAFTYL 584 (712)
Q Consensus 508 ~~~~C~~C~k~f~~~~~L~~H~~-~H~~~~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~--C~~~f~~~ 584 (712)
-|+|++|+.+...+++|.+||+ .|+..+||+|+.|++.|.+.++|++|..+ |+.. .|+|.. |..+|.+.
T Consensus 263 -~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~-HS~~------~y~C~h~~C~~s~r~~ 334 (467)
T KOG3608|consen 263 -CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQV-HSKT------VYQCEHPDCHYSVRTY 334 (467)
T ss_pred -cccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHh-cccc------ceecCCCCCcHHHHHH
Confidence 5899999999999999999987 68889999999999999999999999996 6632 699999 99999999
Q ss_pred hhHHhHHhhhc-C--CCcccCCCCccccCCchhhhhHHhhhccC
Q psy16830 585 GALEDHQFVHT-G--QKTHLCTMCPASYATIGGLRSHIKIVHHG 625 (712)
Q Consensus 585 ~~L~~H~~~H~-~--~k~~~C~~C~~~f~~~~~L~~H~~~~H~~ 625 (712)
.+|..|++.+. | +-+|.|..|++.|++-.+|.+|++..|.-
T Consensus 335 ~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f 378 (467)
T KOG3608|consen 335 TQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGF 378 (467)
T ss_pred HHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcc
Confidence 99999998655 5 45699999999999999999999998853
No 6
>KOG2462|consensus
Probab=99.93 E-value=1.4e-26 Score=209.82 Aligned_cols=136 Identities=35% Similarity=0.677 Sum_probs=128.4
Q ss_pred cCCCCCCcchhhhhcchhhHhhccccccC---CCccccCCCCcccCChHHHhhhhhhcCCCCcccCcccccccCChhhHH
Q psy16830 478 LNKPFKCDDCDQRFLTKILLNKHKLLHFD---KDYFSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCKKQFKRKQDVA 554 (712)
Q Consensus 478 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~---~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~~f~~~~~l~ 554 (712)
....|+|+.||+.+.+.++|.+|..+|.. .+.+.|++|||.|.+..+|.+|+++|+ -|+.|.+|||.|.+.=-|.
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQ 204 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQ 204 (279)
T ss_pred cCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhh
Confidence 45679999999999999999999999964 567999999999999999999999997 6899999999999999999
Q ss_pred hhhhhcccCccccccCCcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCCchhhhhHHhh
Q psy16830 555 GHDRVVHQGKKLIVYKSYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYATIGGLRSHIKI 621 (712)
Q Consensus 555 ~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~ 621 (712)
.|+|+ |+||+ ||.|+.|+++|.++++|+.||++|.+.|+|+|..|++.|...+.|.+|...
T Consensus 205 GHiRT-HTGEK-----PF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 205 GHIRT-HTGEK-----PFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred ccccc-ccCCC-----CccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 99999 88885 999999999999999999999999999999999999999999999999864
No 7
>KOG3623|consensus
Probab=99.90 E-value=3.6e-24 Score=216.00 Aligned_cols=81 Identities=32% Similarity=0.652 Sum_probs=68.6
Q ss_pred cCCCCCCcchhhhhcchhhHhhccccccCCCccccCCCCcccCChHHHhhhhhhcCCCCcccCcccccccCChhhHHhhh
Q psy16830 478 LNKPFKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCKKQFKRKQDVAGHD 557 (712)
Q Consensus 478 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~~f~~~~~l~~H~ 557 (712)
.+.+|.|+.|++.|...++|.+|...|+|.+||+|.+|.|.|..+..|..|+|.|+|++||+|+.|+|.|........||
T Consensus 891 e~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQHM 970 (1007)
T KOG3623|consen 891 EDGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQHM 970 (1007)
T ss_pred ccccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhhh
Confidence 35678888888888888888888888888888888888888888888888888888888888888888888888888887
Q ss_pred h
Q psy16830 558 R 558 (712)
Q Consensus 558 ~ 558 (712)
.
T Consensus 971 N 971 (1007)
T KOG3623|consen 971 N 971 (1007)
T ss_pred c
Confidence 5
No 8
>KOG3623|consensus
Probab=99.89 E-value=2.4e-23 Score=210.10 Aligned_cols=78 Identities=29% Similarity=0.739 Sum_probs=73.5
Q ss_pred cccCcccccccCChhhHHhhhhhcccCccccccCCcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCCchhhh
Q psy16830 537 PFSCPVCKKQFKRKQDVAGHDRVVHQGKKLIVYKSYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYATIGGLR 616 (712)
Q Consensus 537 p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~~~L~ 616 (712)
+|.|+.|+|.|...+.|.+|... |+|.+ ||+|.+|.|+|..+..|..|+|.|.|||||+|..|++.|...+.+.
T Consensus 894 myaCDqCDK~FqKqSSLaRHKYE-HsGqR-----PyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYS 967 (1007)
T KOG3623|consen 894 MYACDQCDKAFQKQSSLARHKYE-HSGQR-----PYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYS 967 (1007)
T ss_pred cchHHHHHHHHHhhHHHHHhhhh-hcCCC-----CcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchH
Confidence 59999999999999999999886 88875 9999999999999999999999999999999999999999999999
Q ss_pred hHHh
Q psy16830 617 SHIK 620 (712)
Q Consensus 617 ~H~~ 620 (712)
.||.
T Consensus 968 QHMN 971 (1007)
T KOG3623|consen 968 QHMN 971 (1007)
T ss_pred hhhc
Confidence 9986
No 9
>KOG3576|consensus
Probab=99.69 E-value=4.5e-18 Score=145.53 Aligned_cols=123 Identities=27% Similarity=0.565 Sum_probs=98.7
Q ss_pred CCCCCCcchhhhhcchhhHhhccccccCCCccccCCCCcccCChHHHhhhhhhcCCCCcccCcccccccCChhhHHhhhh
Q psy16830 479 NKPFKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCKKQFKRKQDVAGHDR 558 (712)
Q Consensus 479 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~~f~~~~~l~~H~~ 558 (712)
...|.|.+|++.|.-...|.+|++.|...+.|.|..||+.|.....|++|+++|+|.+||+|..|++.|+.+-.|..|.+
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence 45689999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hcccCccccccCCcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCCchhhhhHHhhhcc
Q psy16830 559 VVHQGKKLIVYKSYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYATIGGLRSHIKIVHH 624 (712)
Q Consensus 559 ~~h~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~~H~ 624 (712)
.+|+...... .. -..++.|.|..||+.-.....+..|++.+|.
T Consensus 195 kvhgv~~~ya------------------yk-----err~kl~vcedcg~t~~~~e~~~~h~~~~hp 237 (267)
T KOG3576|consen 195 KVHGVQHQYA------------------YK-----ERRAKLYVCEDCGYTSERPEVYYLHLKLHHP 237 (267)
T ss_pred HHcCchHHHH------------------HH-----HhhhheeeecccCCCCCChhHHHHHHHhcCC
Confidence 9887532110 00 0123556666666666666666666665553
No 10
>KOG3576|consensus
Probab=99.66 E-value=1.8e-17 Score=141.79 Aligned_cols=119 Identities=25% Similarity=0.497 Sum_probs=100.9
Q ss_pred CCCCccCccChhccCCHHHHHHhHhhcC-CCCCcCcchhHhhcCHHHHHHHHhhcCCCCCccccccccccCChhhhhccc
Q psy16830 94 LNKPFKCDDCDQCFLTKILLNKHKLLHT-EGHLTCNLCGQTFRQKFRLLNHMISHSTEKPFSCPICKKQFKRKHEVSKHD 172 (712)
Q Consensus 94 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~-~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 172 (712)
....|.|.+|++.|.....|.+|++-|. .+.|-|..||+.|....+|++|+++|+|.+||+|..|++.|...-.|..|.
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl 193 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHL 193 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHH
Confidence 3567999999999999999999999995 467889999999999999999999999999999999999999999999999
Q ss_pred cccccCcc-----cccccceecCCcccccCChHHHHHHHHHhcCC
Q psy16830 173 RIVHQGKK-----YNVYKSYKCSLCPKAFTHLAALEDHQYVHTGQ 212 (712)
Q Consensus 173 ~~~h~~~~-----~~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~ 212 (712)
+.+|+... ....+.|.|..||.+-.....+..|++.|+..
T Consensus 194 ~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 194 KKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 98887432 22335678888888887777788888777643
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.17 E-value=3.5e-11 Score=124.97 Aligned_cols=143 Identities=18% Similarity=0.467 Sum_probs=103.6
Q ss_pred CCCcchhhhhcchhhHhhccccccCCCccccCC--CCcccCChHHHhhhhhhcCCCCcccCcccccccCChhhHHhhhhh
Q psy16830 482 FKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNY--CGKTFLHKYKLRRHIVIHSTARPFSCPVCKKQFKRKQDVAGHDRV 559 (712)
Q Consensus 482 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~--C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~~f~~~~~l~~H~~~ 559 (712)
..|+.|...... ..|..|.... ....-.|+. ||..|. +..|..| +.|+.|++.|. ...|..|+..
T Consensus 408 V~C~NC~~~i~l-~~l~lHe~~C-~r~~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~~~ 474 (567)
T PLN03086 408 VECRNCKHYIPS-RSIALHEAYC-SRHNVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHMKV 474 (567)
T ss_pred EECCCCCCccch-hHHHHHHhhC-CCcceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHHHh
Confidence 468888876654 4455776433 334456874 998883 3444444 58999999985 6789999988
Q ss_pred cccCccccccCCcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCC----------chhhhhHHhhhccCCCCC
Q psy16830 560 VHQGKKLIVYKSYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYAT----------IGGLRSHIKIVHHGLKTQ 629 (712)
Q Consensus 560 ~h~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~----------~~~L~~H~~~~H~~~~~~ 629 (712)
+| ++|.|+ ||+.+ ....|..|+.+|.+++|+.|.+|++.|.. ...|..|...
T Consensus 475 ~H--------kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~-------- 536 (567)
T PLN03086 475 FH--------EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESI-------- 536 (567)
T ss_pred cC--------CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHh--------
Confidence 65 278999 99755 66899999999999999999999998842 2367777543
Q ss_pred ccccCCCCCCccchhhccccccccccccccCCCccccchhccCCCccCChhhhhhhChHHHHhHHhh
Q psy16830 630 QSCQSSHESKTCSVDEINKENEEGATIRKVKPHEDSKEYMALNKPFKCDDCDQRFLTKILLNKHKLL 696 (712)
Q Consensus 630 ~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~C~~C~~~f~~~~~L~~H~~~ 696 (712)
.|.+++.|..||+.+.. .+|..|+-.
T Consensus 537 ----------------------------------------CG~rt~~C~~Cgk~Vrl-rdm~~H~~~ 562 (567)
T PLN03086 537 ----------------------------------------CGSRTAPCDSCGRSVML-KEMDIHQIA 562 (567)
T ss_pred ----------------------------------------cCCcceEccccCCeeee-hhHHHHHHH
Confidence 35788999999987754 467777643
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.11 E-value=8.1e-11 Score=122.33 Aligned_cols=120 Identities=22% Similarity=0.517 Sum_probs=97.7
Q ss_pred CCCcc--hhhhhcchhhHhhccccccCCCccccCCCCcccCChHHHhhhhhhcCCCCcccCcccccccCChhhHHhhhhh
Q psy16830 482 FKCDD--CDQRFLTKILLNKHKLLHFDKDYFSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCKKQFKRKQDVAGHDRV 559 (712)
Q Consensus 482 ~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~~f~~~~~l~~H~~~ 559 (712)
..|+. ||..|. +..+..| +.|+.|++.|. ...|..|+.+|+ +|+.|+ |++.+ .+..|..|+..
T Consensus 434 V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~t 498 (567)
T PLN03086 434 VVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQAS 498 (567)
T ss_pred eeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhc
Confidence 35874 999884 4444555 47999999996 688999999985 899999 99765 67899999876
Q ss_pred cccCccccccCCcccCcCCcccC----------CchhHHhHHhhhcCCCcccCCCCccccCCchhhhhHHhhhcc
Q psy16830 560 VHQGKKLIVYKSYKCSLCPKAFT----------YLGALEDHQFVHTGQKTHLCTMCPASYATIGGLRSHIKIVHH 624 (712)
Q Consensus 560 ~h~~~~~~~~~~~~C~~C~~~f~----------~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~~H~ 624 (712)
|.++ +++.|++|++.|. ..+.|..|..++ |.+++.|..||+.+.. .+|..|+..+|.
T Consensus 499 -hCp~-----Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrl-rdm~~H~~~~h~ 565 (567)
T PLN03086 499 -TCPL-----RLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVML-KEMDIHQIAVHQ 565 (567)
T ss_pred -cCCC-----CceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeeee-hhHHHHHHHhhc
Confidence 5555 3899999999995 245899999986 8999999999998876 679999988884
No 13
>PHA00733 hypothetical protein
Probab=99.00 E-value=3.6e-10 Score=95.28 Aligned_cols=83 Identities=17% Similarity=0.301 Sum_probs=56.6
Q ss_pred CCcccCcccccccCChhhHHhh--hhh--cccCccccccCCcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccC
Q psy16830 535 ARPFSCPVCKKQFKRKQDVAGH--DRV--VHQGKKLIVYKSYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYA 610 (712)
Q Consensus 535 ~~p~~C~~C~~~f~~~~~l~~H--~~~--~h~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~ 610 (712)
.+++.|.+|.+.|.....|..| ++. .+.+. +||.|+.|++.|.+...|..|++.| +.+|.|.+|++.|.
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~-----kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~ 110 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAV-----SPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFR 110 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCC-----CCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccC
Confidence 3455555555555555544444 111 11112 4788888888888888888888876 35788888888888
Q ss_pred CchhhhhHHhhhcc
Q psy16830 611 TIGGLRSHIKIVHH 624 (712)
Q Consensus 611 ~~~~L~~H~~~~H~ 624 (712)
....|.+|+...|.
T Consensus 111 ~~~sL~~H~~~~h~ 124 (128)
T PHA00733 111 NTDSTLDHVCKKHN 124 (128)
T ss_pred CHHHHHHHHHHhcC
Confidence 88888888888663
No 14
>PHA00733 hypothetical protein
Probab=98.91 E-value=1.6e-09 Score=91.42 Aligned_cols=53 Identities=26% Similarity=0.595 Sum_probs=33.8
Q ss_pred cceecCCcccccCChHHHHHHHHHhcCCCCcCCccccccCCChhHHHHHHHHhcc
Q psy16830 185 KSYKCSLCPKAFTHLAALEDHQYVHTGQKSHMCNLCSARFTCKGALLGHIKVLHE 239 (712)
Q Consensus 185 ~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~ 239 (712)
++|.|+.|++.|.+...|..|++.| +.+|.|+.|++.|.....|..|+...|.
T Consensus 72 kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 72 SPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 3566666666666666666666654 3456666666666666666666666553
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.88 E-value=8.3e-10 Score=74.98 Aligned_cols=44 Identities=16% Similarity=0.328 Sum_probs=40.3
Q ss_pred CcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCCchhhh
Q psy16830 571 SYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYATIGGLR 616 (712)
Q Consensus 571 ~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~~~L~ 616 (712)
.|+|+.||+.|...+.|..|+++|+ ++|+|..|++.|.+.+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4899999999999999999999999 7999999999999888775
No 16
>KOG3993|consensus
Probab=98.88 E-value=4.7e-10 Score=108.06 Aligned_cols=147 Identities=20% Similarity=0.337 Sum_probs=113.8
Q ss_pred CCCCCcchhhhhcchhhHhhccccccCCCccccCCCCcccCChHHHhhhhhhcCC--------C----------------
Q psy16830 480 KPFKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGKTFLHKYKLRRHIVIHST--------A---------------- 535 (712)
Q Consensus 480 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~--------~---------------- 535 (712)
..|.|..|...|.+...|.+|.-...-.--|.|+.|+|.|+...+|..|.|.|.. .
T Consensus 266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea 345 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA 345 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence 4589999999999999999997544444569999999999999999999999931 1
Q ss_pred ---------CcccCcccccccCChhhHHhhhhhcccCccccc--------------------------------------
Q psy16830 536 ---------RPFSCPVCKKQFKRKQDVAGHDRVVHQGKKLIV-------------------------------------- 568 (712)
Q Consensus 536 ---------~p~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~-------------------------------------- 568 (712)
.-|.|.+|++.|.+...|+.|..++|.......
T Consensus 346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a 425 (500)
T KOG3993|consen 346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVA 425 (500)
T ss_pred cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeee
Confidence 138899999999999999999887664332110
Q ss_pred --cCCcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCCchhhhhHHhhhccCC
Q psy16830 569 --YKSYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYATIGGLRSHIKIVHHGL 626 (712)
Q Consensus 569 --~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~~H~~~ 626 (712)
...-.|++|+-.+.++..---+.+.-.-+..|.|.+|...|.+..+|.+|+...|..+
T Consensus 426 ~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse 485 (500)
T KOG3993|consen 426 GSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSE 485 (500)
T ss_pred ccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChHH
Confidence 0112466677666666555555554445677999999999999999999999988654
No 17
>PHA02768 hypothetical protein; Provisional
Probab=98.75 E-value=4e-09 Score=71.66 Aligned_cols=42 Identities=19% Similarity=0.545 Sum_probs=28.1
Q ss_pred cccCCCCcccCChHHHhhhhhhcCCCCcccCcccccccCChhhH
Q psy16830 510 FSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCKKQFKRKQDV 553 (712)
Q Consensus 510 ~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~~f~~~~~l 553 (712)
|.|+.||+.|...++|..|+++|+ +||+|..|++.|...+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 567777777777777777777766 566666677666655544
No 18
>KOG3993|consensus
Probab=98.74 E-value=1.2e-09 Score=105.37 Aligned_cols=218 Identities=16% Similarity=0.188 Sum_probs=108.8
Q ss_pred cceecCCcccccCChHHHHHHHHHhcCCCCcCCccccccCCChhHHHHHHHHhccCCCCCCcccccccCCc-cccccccc
Q psy16830 185 KSYKCSLCPKAFTHLAALEDHQYVHTGQKSHMCNLCSARFTCKGALLGHIKVLHEGYKRPQESCQSSHESK-TCSVDEIN 263 (712)
Q Consensus 185 ~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~~~~~~~~~~-~~~~~~~~ 263 (712)
..|.|..|...|.+.-.|.+|.-.-..--.|+|+.|++.|+.-.+|..|.+.|........ ......+ ........
T Consensus 266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~---a~~~P~k~~~~~rae~ 342 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAK---AGSPPPKQAVETRAEV 342 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhh---cCCCChhhhhhhhhhh
Confidence 3477777777777777777775322222347778888888777788777776332211100 0000000 00000000
Q ss_pred cccc-ccccccccCCCcchhhhhccCCCccCCcchhhccCHHHHHHHHHhhccCCCCcccCcCccccCChhHHHHHHHhc
Q psy16830 264 KENE-EGATIRKVKPHEDSKEYMALNKPFKCDDCFQCFLSKILLNQHKKQMHFNKGGFTCNSCGLIFSEKKYLRNHMISH 342 (712)
Q Consensus 264 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h 342 (712)
++.+ .++. ..+..|.|..|++.|.....|+.|+.++|.....-.= .-.|.....
T Consensus 343 ~ea~rsg~d--------------ss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~---a~~f~~s~~-------- 397 (500)
T KOG3993|consen 343 QEAERSGDD--------------SSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEK---APKFLLSRV-------- 397 (500)
T ss_pred hhccccCCc--------------ccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhc---ccCcchhhc--------
Confidence 0000 0000 1123577777777777777777776654432211000 000100000
Q ss_pred CCCCCccCCcccccccChHHHHHHHHHhcCCCCCCceecCCChhhccChHHHHHHHhhhcCCCcccCCCCccccCChHHH
Q psy16830 343 STQKPFSCSLCGKRFKRENGVINHQRLHQTKKVYKSYKCSLCPRVFTHLAALEDHQFVHTGQKTRLCTMCPAIFASIGGL 422 (712)
Q Consensus 343 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 422 (712)
..-.+-|..|+-.+...+.-..+...+.+.. .-..|++|+-.+.++..-..+.+.-..+..|.|.+|..+|.+...|
T Consensus 398 -~~l~~~~~~~a~h~~a~~~~g~~vl~~a~sa--el~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~l 474 (500)
T KOG3993|consen 398 -IPLMHFNQAVATHSSASDSHGDEVLYVAGSA--ELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGL 474 (500)
T ss_pred -ccccccccccccccccccccccceeeeeccc--cccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcch
Confidence 0011234444444443332222222222211 1245667776666666555555555666778888888888888888
Q ss_pred HhHHHHhhcCC
Q psy16830 423 RSHMKIVHHGL 433 (712)
Q Consensus 423 ~~H~~~~H~~~ 433 (712)
.+|+..-|..+
T Consensus 475 trhin~~Hpse 485 (500)
T KOG3993|consen 475 TRHINKCHPSE 485 (500)
T ss_pred HhHhhhcChHH
Confidence 88888777654
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.66 E-value=1.2e-08 Score=59.21 Aligned_cols=26 Identities=35% Similarity=0.657 Sum_probs=23.9
Q ss_pred hHHhHHhhhcCCCcccCCCCccccCC
Q psy16830 586 ALEDHQFVHTGQKTHLCTMCPASYAT 611 (712)
Q Consensus 586 ~L~~H~~~H~~~k~~~C~~C~~~f~~ 611 (712)
+|..|+++|+|++||.|++|+++|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 48899999999999999999999974
No 20
>KOG1146|consensus
Probab=98.65 E-value=1.3e-08 Score=112.65 Aligned_cols=174 Identities=15% Similarity=0.098 Sum_probs=99.5
Q ss_pred ccccccccCChhhhhccccccccCcccccccceecCCcccccCChHHHHHHHHH-hc-----------------------
Q psy16830 155 CPICKKQFKRKHEVSKHDRIVHQGKKYNVYKSYKCSLCPKAFTHLAALEDHQYV-HT----------------------- 210 (712)
Q Consensus 155 C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~-H~----------------------- 210 (712)
|.-|+..+.....+..|+...+.-. +.|+|+.|+..|+....|..|||. |.
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~-----kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~ 513 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFF-----KTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYR 513 (1406)
T ss_pred ccchhhhhhhhcccccceeeeeccc-----ccccCCccchhhhhHHHhhhcccccccccchhHhHhcccccccccccccc
Confidence 4445556666666666666655543 578899999999999999999886 21
Q ss_pred -CCCCcCCccccccCCChhHHHHHHHHhccCCCCCCcccccc--cCCcccccccccccccccccccccCCCcchhhhhcc
Q psy16830 211 -GQKSHMCNLCSARFTCKGALLGHIKVLHEGYKRPQESCQSS--HESKTCSVDEINKENEEGATIRKVKPHEDSKEYMAL 287 (712)
Q Consensus 211 -~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (712)
+.++|.|..|...+..+.+|..|++..-............. .......+.... .++.......-. .....
T Consensus 514 ~~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~--P~~ag~~~~ags-----~~pkt 586 (1406)
T KOG1146|consen 514 CPGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAV--PEEAGLGPSAGS-----SGPKT 586 (1406)
T ss_pred CCCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccC--cccccCCCCCCC-----CCCCC
Confidence 23679999999999999999999976321111000000000 000000000000 000000000000 00112
Q ss_pred CCCccCCcchhhccCHHHHHHHHHhhccCCCCcccCcCccccCChhHHHHHHH
Q psy16830 288 NKPFKCDDCFQCFLSKILLNQHKKQMHFNKGGFTCNSCGLIFSEKKYLRNHMI 340 (712)
Q Consensus 288 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 340 (712)
...+.|.+|++.-.-..+|+.|+...+....+..|-.|+..+.....+..+-+
T Consensus 587 kP~~~C~vc~yetniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~~~ 639 (1406)
T KOG1146|consen 587 KPSWRCEVCSYETNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQGR 639 (1406)
T ss_pred CCCcchhhhcchhhhhhccccccccCCCCCChHHHhhhcchhhccccccCcCC
Confidence 34689999999999999999999866655555555555555554444444433
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.58 E-value=1.6e-08 Score=58.59 Aligned_cols=24 Identities=38% Similarity=0.847 Sum_probs=22.8
Q ss_pred HHHhHHhhhcCCCccccccccCCC
Q psy16830 689 LLNKHKLLHFDKDYFSCNYCGKTF 712 (712)
Q Consensus 689 ~L~~H~~~H~~~~~~~C~~C~~~F 712 (712)
+|.+||++|+|++||.|++|+++|
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F 24 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSF 24 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEE
T ss_pred CHHHHhhhcCCCCCCCCCCCcCee
Confidence 589999999999999999999987
No 22
>PHA00616 hypothetical protein
Probab=98.44 E-value=1.2e-07 Score=61.09 Aligned_cols=34 Identities=21% Similarity=0.388 Sum_probs=32.3
Q ss_pred CccCChhhhhhhChHHHHhHHhhhcCCCcccccc
Q psy16830 674 PFKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNY 707 (712)
Q Consensus 674 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~ 707 (712)
||+|..||+.|...++|..|++.|+|++++.|++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 6999999999999999999999999999999874
No 23
>PHA00616 hypothetical protein
Probab=98.33 E-value=1.8e-07 Score=60.36 Aligned_cols=39 Identities=15% Similarity=0.220 Sum_probs=31.8
Q ss_pred CcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCcccc
Q psy16830 571 SYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASY 609 (712)
Q Consensus 571 ~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f 609 (712)
||+|+.||+.|.+.+.|..|++.|+|++++.|+.=-..|
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~y~~f 39 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYFYIYF 39 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEEEEEE
Confidence 588999999999999999999999999998887543333
No 24
>KOG1146|consensus
Probab=98.22 E-value=9.9e-07 Score=98.21 Aligned_cols=80 Identities=24% Similarity=0.351 Sum_probs=61.5
Q ss_pred ccCcccccccCChhhHHhhhhhcccCccccccCCcccCcCCcccCCchhHHhHHhh-----------------hcCCCcc
Q psy16830 538 FSCPVCKKQFKRKQDVAGHDRVVHQGKKLIVYKSYKCSLCPKAFTYLGALEDHQFV-----------------HTGQKTH 600 (712)
Q Consensus 538 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~-----------------H~~~k~~ 600 (712)
+.|..|++.|.....+. |+-..| +|.|..|...|.....|..|.+. |...++|
T Consensus 1261 ~~c~~~~~~~~~~~~~~-~l~~~~---------~~~~~~~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~ 1330 (1406)
T KOG1146|consen 1261 GECGAVDELLTPSFGIS-TLDVTH---------RYLCRQCKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH 1330 (1406)
T ss_pred chhhhccccccCcccee-ecccch---------hHHHHHHHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc
Confidence 66888888777766666 554432 68899999999888888888743 2234556
Q ss_pred cCCCCccccCCchhhhhHHhhhccCCCC
Q psy16830 601 LCTMCPASYATIGGLRSHIKIVHHGLKT 628 (712)
Q Consensus 601 ~C~~C~~~f~~~~~L~~H~~~~H~~~~~ 628 (712)
|.+|...|.....|..||++.|+..++
T Consensus 1331 -c~~c~~~~~~~~alqihm~~~~~~~kt 1357 (1406)
T KOG1146|consen 1331 -CLACEVLLSGREALQIHMRSSAHRRKT 1357 (1406)
T ss_pred -chHHHhhcchhHHHHHHHHHhhhcccC
Confidence 999999999999999999998876554
No 25
>PHA00732 hypothetical protein
Probab=98.11 E-value=1.8e-06 Score=65.42 Aligned_cols=45 Identities=33% Similarity=0.569 Sum_probs=32.1
Q ss_pred CcccCcCCcccCCchhHHhHHhh-hcCCCcccCCCCccccCCchhhhhHHhh
Q psy16830 571 SYKCSLCPKAFTYLGALEDHQFV-HTGQKTHLCTMCPASYATIGGLRSHIKI 621 (712)
Q Consensus 571 ~~~C~~C~~~f~~~~~L~~H~~~-H~~~k~~~C~~C~~~f~~~~~L~~H~~~ 621 (712)
||.|..|++.|.+...|..|++. |+ ++.|+.|++.|. .|..|+++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhcc
Confidence 46777788888777778888773 54 356888888776 57777644
No 26
>PHA00732 hypothetical protein
Probab=98.03 E-value=3.5e-06 Score=63.93 Aligned_cols=45 Identities=31% Similarity=0.689 Sum_probs=29.6
Q ss_pred ccccCCCCcccCChHHHhhhhhh-cCCCCcccCcccccccCChhhHHhhhhh
Q psy16830 509 YFSCNYCGKTFLHKYKLRRHIVI-HSTARPFSCPVCKKQFKRKQDVAGHDRV 559 (712)
Q Consensus 509 ~~~C~~C~k~f~~~~~L~~H~~~-H~~~~p~~C~~C~~~f~~~~~l~~H~~~ 559 (712)
||.|+.||+.|.+..+|..|++. |. ++.|+.|++.|. .|..|++.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhcc
Confidence 46677777777777777777763 43 346777777775 46666654
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.90 E-value=1.3e-05 Score=56.39 Aligned_cols=49 Identities=24% Similarity=0.544 Sum_probs=25.3
Q ss_pred cccCcCCcccCCchhHHhHHh-hhcCC-CcccCCCCccccCCchhhhhHHhhhc
Q psy16830 572 YKCSLCPKAFTYLGALEDHQF-VHTGQ-KTHLCTMCPASYATIGGLRSHIKIVH 623 (712)
Q Consensus 572 ~~C~~C~~~f~~~~~L~~H~~-~H~~~-k~~~C~~C~~~f~~~~~L~~H~~~~H 623 (712)
|.|++|++. .+...|..|.. .|.++ +.+.|++|...+. .+|.+|+...|
T Consensus 3 f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 556666663 33455555644 33332 3456666665433 36666666544
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.79 E-value=1.6e-05 Score=56.03 Aligned_cols=49 Identities=31% Similarity=0.623 Sum_probs=24.6
Q ss_pred eecCCcccccCChHHHHHHHH-HhcCC-CCcCCccccccCCChhHHHHHHHHhc
Q psy16830 187 YKCSLCPKAFTHLAALEDHQY-VHTGQ-KSHMCNLCSARFTCKGALLGHIKVLH 238 (712)
Q Consensus 187 ~~C~~C~k~f~~~~~L~~H~~-~H~~~-~~~~C~~C~~~f~~~~~L~~H~~~~h 238 (712)
|.|++|++ ..+...|..|.. .|..+ +.+.|++|...+. .+|..|+...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 55555555 333455555544 23322 3455666655433 25666665544
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.73 E-value=8.9e-06 Score=45.90 Aligned_cols=23 Identities=35% Similarity=0.677 Sum_probs=21.8
Q ss_pred ccCChhhhhhhChHHHHhHHhhh
Q psy16830 675 FKCDDCDQRFLTKILLNKHKLLH 697 (712)
Q Consensus 675 ~~C~~C~~~f~~~~~L~~H~~~H 697 (712)
|.|++|++.|.+++.|..|+++|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 68999999999999999999986
No 30
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.60 E-value=5.3e-05 Score=42.67 Aligned_cols=23 Identities=39% Similarity=0.855 Sum_probs=16.2
Q ss_pred eecCCcccccCChHHHHHHHHHh
Q psy16830 187 YKCSLCPKAFTHLAALEDHQYVH 209 (712)
Q Consensus 187 ~~C~~C~k~f~~~~~L~~H~~~H 209 (712)
|+|+.|++.|.++..|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 56777777777777777777654
No 31
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.55 E-value=3.7e-05 Score=71.90 Aligned_cols=72 Identities=26% Similarity=0.528 Sum_probs=51.3
Q ss_pred CCCcccCCC--CccccCCchhhhhHHhhhccCCCCCccccCCCCCCccchhhccccccccccccccCCCccccchhccCC
Q psy16830 596 GQKTHLCTM--CPASYATIGGLRSHIKIVHHGLKTQQSCQSSHESKTCSVDEINKENEEGATIRKVKPHEDSKEYMALNK 673 (712)
Q Consensus 596 ~~k~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k 673 (712)
++|||+|++ |.+.|.+...|+-|+.--|...+... .. .......+...+|
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~----------------------~p------~p~~~~~F~~~~K 397 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHE----------------------NP------SPEKMNIFSAKDK 397 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCC----------------------CC------CccccccccccCC
Confidence 359999987 99999999999999876554322110 00 0001112245689
Q ss_pred CccCChhhhhhhChHHHHhHHh
Q psy16830 674 PFKCDDCDQRFLTKILLNKHKL 695 (712)
Q Consensus 674 ~~~C~~C~~~f~~~~~L~~H~~ 695 (712)
||.|++|+|++.+...|+-|..
T Consensus 398 PYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 398 PYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred ceeccccchhhccCccceeccc
Confidence 9999999999999999998863
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.31 E-value=0.00012 Score=41.66 Aligned_cols=24 Identities=29% Similarity=0.710 Sum_probs=12.8
Q ss_pred ccCCCCccccCCchhhhhHHhhhc
Q psy16830 600 HLCTMCPASYATIGGLRSHIKIVH 623 (712)
Q Consensus 600 ~~C~~C~~~f~~~~~L~~H~~~~H 623 (712)
|.|++|++.|.+...|..|++++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 456666666666666666665533
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.26 E-value=0.00013 Score=59.79 Aligned_cols=75 Identities=19% Similarity=0.346 Sum_probs=20.9
Q ss_pred cCcccccccCChhhHHhhhhhcccCccccccCCcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCCchhhhhH
Q psy16830 539 SCPVCKKQFKRKQDVAGHDRVVHQGKKLIVYKSYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYATIGGLRSH 618 (712)
Q Consensus 539 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~~~L~~H 618 (712)
+|..|+..|.+...|..|+...|.... + ....+.....+..+.+.-. ...+.|.+|++.|.+...|..|
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~---------~-~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~H 69 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDI---------P-DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEH 69 (100)
T ss_dssp -----------------------------------------------------------SSEEBSSSS-EESSHHHHHHH
T ss_pred Ccccccccccccccccccccccccccc---------c-ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHH
Confidence 367777777777777777776665321 1 1122224444444444322 2368899999999888889999
Q ss_pred Hhhhcc
Q psy16830 619 IKIVHH 624 (712)
Q Consensus 619 ~~~~H~ 624 (712)
|+..++
T Consensus 70 m~~~~H 75 (100)
T PF12756_consen 70 MRSKHH 75 (100)
T ss_dssp HHHTTT
T ss_pred HcCccC
Confidence 887543
No 34
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.24 E-value=0.00011 Score=68.84 Aligned_cols=25 Identities=32% Similarity=0.683 Sum_probs=16.7
Q ss_pred CCCcccCCCCccccCCchhhhhHHh
Q psy16830 596 GQKTHLCTMCPASYATIGGLRSHIK 620 (712)
Q Consensus 596 ~~k~~~C~~C~~~f~~~~~L~~H~~ 620 (712)
..|||+|++|++.|.+...|+-|..
T Consensus 395 ~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 395 KDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred cCCceeccccchhhccCccceeccc
Confidence 3567777777777777777766643
No 35
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.19 E-value=0.00033 Score=39.84 Aligned_cols=23 Identities=30% Similarity=0.805 Sum_probs=13.9
Q ss_pred eecCCcccccCChHHHHHHHHHh
Q psy16830 187 YKCSLCPKAFTHLAALEDHQYVH 209 (712)
Q Consensus 187 ~~C~~C~k~f~~~~~L~~H~~~H 209 (712)
|.|++|++.|.+...|..|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 45666666666666666666655
No 36
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.17 E-value=0.00011 Score=43.23 Aligned_cols=26 Identities=38% Similarity=0.833 Sum_probs=24.0
Q ss_pred CccCChhhhhhhChHHHHhHHhhhcC
Q psy16830 674 PFKCDDCDQRFLTKILLNKHKLLHFD 699 (712)
Q Consensus 674 ~~~C~~C~~~f~~~~~L~~H~~~H~~ 699 (712)
||.|..|++.|.+..+|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 68999999999999999999998853
No 37
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.05 E-value=0.00039 Score=40.90 Aligned_cols=25 Identities=36% Similarity=0.836 Sum_probs=17.8
Q ss_pred ceecCCcccccCChHHHHHHHHHhc
Q psy16830 186 SYKCSLCPKAFTHLAALEDHQYVHT 210 (712)
Q Consensus 186 ~~~C~~C~k~f~~~~~L~~H~~~H~ 210 (712)
+|.|..|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4677777777777777777776664
No 38
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.96 E-value=0.00049 Score=56.39 Aligned_cols=24 Identities=29% Similarity=0.719 Sum_probs=13.3
Q ss_pred ceecCCChhhccChHHHHHHHhhh
Q psy16830 378 SYKCSLCPRVFTHLAALEDHQFVH 401 (712)
Q Consensus 378 ~~~C~~C~~~f~~~~~l~~H~~~h 401 (712)
.+.|.+|++.|.+...|..|++.+
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHHHT
T ss_pred CCCCCccCCCCcCHHHHHHHHcCc
Confidence 355666666666666666665543
No 39
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.81 E-value=0.00084 Score=38.96 Aligned_cols=25 Identities=28% Similarity=0.705 Sum_probs=22.9
Q ss_pred ccCChhhhhhhChHHHHhHHhhhcC
Q psy16830 675 FKCDDCDQRFLTKILLNKHKLLHFD 699 (712)
Q Consensus 675 ~~C~~C~~~f~~~~~L~~H~~~H~~ 699 (712)
|.|..|++.|.+.+.|..|++.|..
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~~ 25 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHXX 25 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhcc
Confidence 6899999999999999999998853
No 40
>KOG2231|consensus
Probab=96.74 E-value=0.0014 Score=70.11 Aligned_cols=20 Identities=10% Similarity=0.047 Sum_probs=11.1
Q ss_pred CCCCCCcchhhhhcchhhHh
Q psy16830 479 NKPFKCDDCDQRFLTKILLN 498 (712)
Q Consensus 479 ~~~~~C~~C~~~f~~~~~L~ 498 (712)
..+|.+..++..+....++.
T Consensus 403 ~~~~~~~~~~~~~~~~~s~~ 422 (669)
T KOG2231|consen 403 AAPYPAPNRGPGENSIKSLR 422 (669)
T ss_pred ccCccccccCcccccccccc
Confidence 34566666666555555443
No 41
>KOG2231|consensus
Probab=96.56 E-value=0.0054 Score=65.69 Aligned_cols=62 Identities=27% Similarity=0.683 Sum_probs=38.2
Q ss_pred ccccccccccCChhhhhccccccccCcccccccceecCCcccccCChHHHHHHHH-HhcCCCCcCCccccc---------
Q psy16830 153 FSCPICKKQFKRKHEVSKHDRIVHQGKKYNVYKSYKCSLCPKAFTHLAALEDHQY-VHTGQKSHMCNLCSA--------- 222 (712)
Q Consensus 153 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~-~H~~~~~~~C~~C~~--------- 222 (712)
+.|.+|++.|.- . - ..-.|..| -.|.+...|+.|+. .|. .+.|.+|-.
T Consensus 100 ~~C~~C~~~~~~---------~-~--------~~~~~~~c-~~~~s~~~Lk~H~~~~H~---~~~c~lC~~~~kif~~e~ 157 (669)
T KOG2231|consen 100 HSCHICDRRFRA---------L-Y--------NKKECLHC-TEFKSVENLKNHMRDQHK---LHLCSLCLQNLKIFINER 157 (669)
T ss_pred hhcCccccchhh---------h-c--------ccCCCccc-cchhHHHHHHHHHHHhhh---hhccccccccceeeeeee
Confidence 678888887631 0 0 12358888 78888888999985 553 356666643
Q ss_pred cCCChhHHHHHHHH
Q psy16830 223 RFTCKGALLGHIKV 236 (712)
Q Consensus 223 ~f~~~~~L~~H~~~ 236 (712)
..-+...|..|++.
T Consensus 158 k~Yt~~el~~h~~~ 171 (669)
T KOG2231|consen 158 KLYTRAELNLHLMF 171 (669)
T ss_pred ehehHHHHHHHHhc
Confidence 22244556666543
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.34 E-value=0.0027 Score=36.74 Aligned_cols=22 Identities=36% Similarity=0.706 Sum_probs=10.8
Q ss_pred ccCcCCcccCCchhHHhHHhhh
Q psy16830 573 KCSLCPKAFTYLGALEDHQFVH 594 (712)
Q Consensus 573 ~C~~C~~~f~~~~~L~~H~~~H 594 (712)
+|..|++.|.+...|..|++.|
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 2 RCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCcchhCCHHHHHHHHHHh
Confidence 4445555555555555554444
No 43
>PRK04860 hypothetical protein; Provisional
Probab=96.26 E-value=0.0022 Score=56.41 Aligned_cols=40 Identities=23% Similarity=0.502 Sum_probs=35.6
Q ss_pred CcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCCchh
Q psy16830 571 SYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYATIGG 614 (712)
Q Consensus 571 ~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~~~ 614 (712)
+|.|. |+. ....+.+|.++|+|+++|.|..|+..|.....
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~~ 158 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKGE 158 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEecc
Confidence 69998 997 77889999999999999999999999876544
No 44
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.23 E-value=0.0026 Score=36.09 Aligned_cols=23 Identities=30% Similarity=0.521 Sum_probs=13.5
Q ss_pred ccCCCCccccCCchhhhhHHhhhc
Q psy16830 600 HLCTMCPASYATIGGLRSHIKIVH 623 (712)
Q Consensus 600 ~~C~~C~~~f~~~~~L~~H~~~~H 623 (712)
|+|+.|++... ..+|.+|++++|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 56666666666 666666666654
No 45
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.15 E-value=0.0039 Score=41.32 Aligned_cols=31 Identities=26% Similarity=0.523 Sum_probs=12.7
Q ss_pred CCCCcccCcCccccCChhHHHHHHHhcCCCC
Q psy16830 316 NKGGFTCNSCGLIFSEKKYLRNHMISHSTQK 346 (712)
Q Consensus 316 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~ 346 (712)
.+.|-.|++|+..++...+|++|+..+++.+
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 3344455555555555555555554444433
No 46
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.13 E-value=0.0047 Score=40.98 Aligned_cols=32 Identities=28% Similarity=0.602 Sum_probs=17.8
Q ss_pred CCCccccCCCCcccCChHHHhhhhhhcCCCCc
Q psy16830 506 DKDYFSCNYCGKTFLHKYKLRRHIVIHSTARP 537 (712)
Q Consensus 506 ~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~p 537 (712)
...|-.|++|+..+.+..+|++|+.++++.+|
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 34566777777777777777777766666554
No 47
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.08 E-value=0.009 Score=57.06 Aligned_cols=133 Identities=27% Similarity=0.476 Sum_probs=93.2
Q ss_pred CccCcc--ChhccCCHHHHHHhHhhcCCCCCcCcchh---Hhhc------CHHHHHHHHhhcCCCCCc----cccccccc
Q psy16830 97 PFKCDD--CDQCFLTKILLNKHKLLHTEGHLTCNLCG---QTFR------QKFRLLNHMISHSTEKPF----SCPICKKQ 161 (712)
Q Consensus 97 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~C~~C~---~~f~------~~~~l~~H~~~h~~~~~~----~C~~C~~~ 161 (712)
.|.|+. |..+......|..|.+..+.. +.|.+|- +.|. ++..|+.|...-..+.-| .|..|...
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~-~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~ 229 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGF-VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIY 229 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCc-EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccce
Confidence 488865 888888888999998764432 5577773 2343 345677776532222122 59999999
Q ss_pred cCChhhhhccccccccCcccccccceecCCcccc-------cCChHHHHHHHHHhcCCCCcCCcc--cc----ccCCChh
Q psy16830 162 FKRKHEVSKHDRIVHQGKKYNVYKSYKCSLCPKA-------FTHLAALEDHQYVHTGQKSHMCNL--CS----ARFTCKG 228 (712)
Q Consensus 162 f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~k~-------f~~~~~L~~H~~~H~~~~~~~C~~--C~----~~f~~~~ 228 (712)
|-.-.+|..|+|..|. +|.+|+++ |.+-..|..|.+.- -|.|.+ |- ..|....
T Consensus 230 FYdDDEL~~HcR~~HE----------~ChICD~v~p~~~QYFK~Y~~Le~HF~~~----hy~ct~qtc~~~k~~vf~~~~ 295 (493)
T COG5236 230 FYDDDELRRHCRLRHE----------ACHICDMVGPIRYQYFKSYEDLEAHFRNA----HYCCTFQTCRVGKCYVFPYHT 295 (493)
T ss_pred ecChHHHHHHHHhhhh----------hhhhhhccCccchhhhhCHHHHHHHhhcC----ceEEEEEEEecCcEEEeccHH
Confidence 9999999999998664 37777654 77888899988732 266644 43 3589999
Q ss_pred HHHHHHHHhccCCCCC
Q psy16830 229 ALLGHIKVLHEGYKRP 244 (712)
Q Consensus 229 ~L~~H~~~~h~~~~~~ 244 (712)
.|..|+...|......
T Consensus 296 el~~h~~~~h~~~~~~ 311 (493)
T COG5236 296 ELLEHLTRFHKVNARL 311 (493)
T ss_pred HHHHHHHHHhhccccc
Confidence 9999999888665443
No 48
>PRK04860 hypothetical protein; Provisional
Probab=96.05 E-value=0.0026 Score=55.93 Aligned_cols=36 Identities=22% Similarity=0.791 Sum_probs=33.1
Q ss_pred CCccCChhhhhhhChHHHHhHHhhhcCCCccccccccCCC
Q psy16830 673 KPFKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGKTF 712 (712)
Q Consensus 673 k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F 712 (712)
-+|.|. |+. ....+.+|.++|.++++|.|..|+..|
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l 153 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETL 153 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCcee
Confidence 379998 998 889999999999999999999999875
No 49
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.97 E-value=0.0066 Score=34.37 Aligned_cols=23 Identities=30% Similarity=0.592 Sum_probs=13.3
Q ss_pred eecCCcccccCChHHHHHHHHHhc
Q psy16830 187 YKCSLCPKAFTHLAALEDHQYVHT 210 (712)
Q Consensus 187 ~~C~~C~k~f~~~~~L~~H~~~H~ 210 (712)
|+|+.|++... ...|.+|++.|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 56666666666 666666666553
No 50
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.96 E-value=0.0052 Score=35.21 Aligned_cols=23 Identities=26% Similarity=0.674 Sum_probs=17.5
Q ss_pred eecCCcccccCChHHHHHHHHHh
Q psy16830 187 YKCSLCPKAFTHLAALEDHQYVH 209 (712)
Q Consensus 187 ~~C~~C~k~f~~~~~L~~H~~~H 209 (712)
|.|.+|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 56888888888888888887654
No 51
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.91 E-value=0.028 Score=60.70 Aligned_cols=139 Identities=19% Similarity=0.281 Sum_probs=75.7
Q ss_pred CcccCcCCcccCCchhHHhHHh--hhcCC--CcccCC--CCccccCCchhhhhHHhhhccCCCCCcc----ccCCCCCCc
Q psy16830 571 SYKCSLCPKAFTYLGALEDHQF--VHTGQ--KTHLCT--MCPASYATIGGLRSHIKIVHHGLKTQQS----CQSSHESKT 640 (712)
Q Consensus 571 ~~~C~~C~~~f~~~~~L~~H~~--~H~~~--k~~~C~--~C~~~f~~~~~L~~H~~~~H~~~~~~~~----c~~~~~~~~ 640 (712)
++.|..|...|.....|..|.+ .|+++ +|+.|+ .|++.|.....+..|... |.+..+... +...+....
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILL-HTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCccc-ccCCCccccccccCcccccccc
Confidence 5667777777777777777777 67777 777777 577777777777777665 554443322 111110000
Q ss_pred c-----chhhc---cc----cccccccc---cccCCCccccchh--ccCCCccCChhhhhhhChHHHHhHHhhhcCCCcc
Q psy16830 641 C-----SVDEI---NK----ENEEGATI---RKVKPHEDSKEYM--ALNKPFKCDDCDQRFLTKILLNKHKLLHFDKDYF 703 (712)
Q Consensus 641 ~-----~~~~~---~~----~~~~~~~~---~~~~~~~~~~~~~--~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~ 703 (712)
- ..... .. .......+ ............+ .....+.+..|.+.|.....|..|++.|....++
T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (467)
T COG5048 368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPL 447 (467)
T ss_pred CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCce
Confidence 0 00000 00 00000000 0000000001111 1223578899999999999999999999888887
Q ss_pred ccccccC
Q psy16830 704 SCNYCGK 710 (712)
Q Consensus 704 ~C~~C~~ 710 (712)
.|..+..
T Consensus 448 ~~~~~~~ 454 (467)
T COG5048 448 LCSILKS 454 (467)
T ss_pred eeccccc
Confidence 7766543
No 52
>KOG2785|consensus
Probab=95.74 E-value=0.031 Score=54.92 Aligned_cols=75 Identities=20% Similarity=0.110 Sum_probs=39.8
Q ss_pred CCccCCcchhhccCHHHHHHHHHhhccCCCCcccCcCccccCChhHHHHHHHhcCCCCCccCCccc---ccccChHHHHH
Q psy16830 289 KPFKCDDCFQCFLSKILLNQHKKQMHFNKGGFTCNSCGLIFSEKKYLRNHMISHSTQKPFSCSLCG---KRFKRENGVIN 365 (712)
Q Consensus 289 ~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~---~~f~~~~~l~~ 365 (712)
.|-.|-.|++.+.+...-..||...|.---| .= .-..+...|..=+..-- ..-+.|-.|+ +.|.+....+.
T Consensus 165 ~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIP----dr-eYL~D~~GLl~YLgeKV-~~~~~CL~CN~~~~~f~sleavr~ 238 (390)
T KOG2785|consen 165 IPTDCLFCDKKSKSLEENLKHMFKEHGFFIP----DR-EYLTDEKGLLKYLGEKV-GIGFICLFCNELGRPFSSLEAVRA 238 (390)
T ss_pred CCcceeecCCCcccHHHHHHHHhhccCCcCC----ch-HhhhchhHHHHHHHHHh-ccCceEEEeccccCcccccHHHHH
Confidence 3466777777777777777777665521100 00 00111222222211111 1236677777 88888888888
Q ss_pred HHHH
Q psy16830 366 HQRL 369 (712)
Q Consensus 366 H~~~ 369 (712)
||..
T Consensus 239 HM~~ 242 (390)
T KOG2785|consen 239 HMRD 242 (390)
T ss_pred HHhh
Confidence 8864
No 53
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.69 E-value=0.0033 Score=36.08 Aligned_cols=23 Identities=30% Similarity=0.595 Sum_probs=21.3
Q ss_pred ccCChhhhhhhChHHHHhHHhhh
Q psy16830 675 FKCDDCDQRFLTKILLNKHKLLH 697 (712)
Q Consensus 675 ~~C~~C~~~f~~~~~L~~H~~~H 697 (712)
|.|++|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 67999999999999999999865
No 54
>KOG2482|consensus
Probab=95.53 E-value=0.027 Score=53.95 Aligned_cols=178 Identities=22% Similarity=0.256 Sum_probs=94.4
Q ss_pred cceecCCccccc-CChHHHHHHHH-HhcC---------------------CCCcCCccccccCCChhHHHHHHHHh-ccC
Q psy16830 185 KSYKCSLCPKAF-THLAALEDHQY-VHTG---------------------QKSHMCNLCSARFTCKGALLGHIKVL-HEG 240 (712)
Q Consensus 185 ~~~~C~~C~k~f-~~~~~L~~H~~-~H~~---------------------~~~~~C~~C~~~f~~~~~L~~H~~~~-h~~ 240 (712)
...+|-+|+..+ -+++....|+. .|.- -..++|-.|.+.|..+..|+.||+.. |..
T Consensus 143 fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrkK~Hrr 222 (423)
T KOG2482|consen 143 FSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRKKRHRR 222 (423)
T ss_pred eeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHhccCcc
Confidence 356899998888 46677777875 4421 12478999999999999999999864 222
Q ss_pred CCCCCccccc--------ccCCcccccc----ccc-ccccccccccccCCCcchhhhhccCCCccCCcchhhccCHHHHH
Q psy16830 241 YKRPQESCQS--------SHESKTCSVD----EIN-KENEEGATIRKVKPHEDSKEYMALNKPFKCDDCFQCFLSKILLN 307 (712)
Q Consensus 241 ~~~~~~~~~~--------~~~~~~~~~~----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~ 307 (712)
..+....-+. ...+...... ++. ..++..... .+..+..+......|-.|.....+...|.
T Consensus 223 inPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~w------sDw~ed~a~a~~v~CLfC~~~~en~~~l~ 296 (423)
T KOG2482|consen 223 INPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTW------SDWNEDDAEALSVVCLFCTNFYENPVFLF 296 (423)
T ss_pred cCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccch------hhhhcCCCCccceEEEeeccchhhHHHHH
Confidence 2111110000 0000000000 000 000000000 00011111112268999999999999999
Q ss_pred HHHHhhccCCCCcccCcCccccCChhHHHHHHHhcCCCCCccCCcccccccChHHHHHHHHHh
Q psy16830 308 QHKKQMHFNKGGFTCNSCGLIFSEKKYLRNHMISHSTQKPFSCSLCGKRFKRENGVINHQRLH 370 (712)
Q Consensus 308 ~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h 370 (712)
.||+.+|.-+-.-.=..=+..|-.+-.+..-+|. ......|-.|.-.|.....|..|+..+
T Consensus 297 eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRk--q~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 297 EHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRK--QKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred HHHHHHHHhhHHhhccccccchhhhhhHHHHHHH--HhhccccccccccccCcchhhhhcccc
Confidence 9999888432100000011222223233332222 123456788888999999999998654
No 55
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.18 E-value=0.012 Score=56.17 Aligned_cols=73 Identities=21% Similarity=0.431 Sum_probs=41.8
Q ss_pred cCCcchhhccCHHHHHHHHHhhccCCCCcccCcCcc-------ccCChhHHHHHHHhcCCCCCccCCc--cc----cccc
Q psy16830 292 KCDDCFQCFLSKILLNQHKKQMHFNKGGFTCNSCGL-------IFSEKKYLRNHMISHSTQKPFSCSL--CG----KRFK 358 (712)
Q Consensus 292 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~-------~f~~~~~l~~H~~~h~~~~~~~C~~--C~----~~f~ 358 (712)
.|..|+..|-+...|.+|++..|. .|.+|++ -|.+...|..|.+.- -|.|.. |. ..|.
T Consensus 222 ~C~FC~~~FYdDDEL~~HcR~~HE-----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~----hy~ct~qtc~~~k~~vf~ 292 (493)
T COG5236 222 LCIFCKIYFYDDDELRRHCRLRHE-----ACHICDMVGPIRYQYFKSYEDLEAHFRNA----HYCCTFQTCRVGKCYVFP 292 (493)
T ss_pred hhhhccceecChHHHHHHHHhhhh-----hhhhhhccCccchhhhhCHHHHHHHhhcC----ceEEEEEEEecCcEEEec
Confidence 466666666666666666665442 3444443 345556666665532 244432 22 3577
Q ss_pred ChHHHHHHHHHhcCC
Q psy16830 359 RENGVINHQRLHQTK 373 (712)
Q Consensus 359 ~~~~l~~H~~~h~~~ 373 (712)
....|..|+..-|+.
T Consensus 293 ~~~el~~h~~~~h~~ 307 (493)
T COG5236 293 YHTELLEHLTRFHKV 307 (493)
T ss_pred cHHHHHHHHHHHhhc
Confidence 777788887766654
No 56
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.07 E-value=0.0059 Score=35.75 Aligned_cols=23 Identities=35% Similarity=0.684 Sum_probs=21.2
Q ss_pred ccCChhhhhhhChHHHHhHHhhh
Q psy16830 675 FKCDDCDQRFLTKILLNKHKLLH 697 (712)
Q Consensus 675 ~~C~~C~~~f~~~~~L~~H~~~H 697 (712)
|.|..|++.|.+..+|..|++..
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~sk 24 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKSK 24 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTSH
T ss_pred CCcccCCCCcCCHHHHHHHHccC
Confidence 78999999999999999999763
No 57
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.06 E-value=0.015 Score=62.76 Aligned_cols=147 Identities=24% Similarity=0.355 Sum_probs=108.1
Q ss_pred CCCCCcchhhhhcchhhHhhccc--cccCC--CccccC--CCCcccCChHHHhhhhhhcCCCCcccCcc--cccccCChh
Q psy16830 480 KPFKCDDCDQRFLTKILLNKHKL--LHFDK--DYFSCN--YCGKTFLHKYKLRRHIVIHSTARPFSCPV--CKKQFKRKQ 551 (712)
Q Consensus 480 ~~~~C~~C~~~f~~~~~L~~H~~--~h~~~--~~~~C~--~C~k~f~~~~~L~~H~~~H~~~~p~~C~~--C~~~f~~~~ 551 (712)
.++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.+...+..|..+|.+..++.+.. +...+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 47899999999999999999999 89999 999999 89999999999999999999998888755 444444333
Q ss_pred hHHhhhhhcccCccccccCCcccCc--CCcccCCchhHHhHHhhhcCCC--cccCCCCccccCCchhhhhHHhhhccCCC
Q psy16830 552 DVAGHDRVVHQGKKLIVYKSYKCSL--CPKAFTYLGALEDHQFVHTGQK--THLCTMCPASYATIGGLRSHIKIVHHGLK 627 (712)
Q Consensus 552 ~l~~H~~~~h~~~~~~~~~~~~C~~--C~~~f~~~~~L~~H~~~H~~~k--~~~C~~C~~~f~~~~~L~~H~~~~H~~~~ 627 (712)
.-..+..... .........+.+.. |-..+.....+..|...|...+ .+.+..|.+.|.....|..|++. |....
T Consensus 368 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 445 (467)
T COG5048 368 NNEPPQSLQQ-YKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKI-HTNHA 445 (467)
T ss_pred CCCCccchhh-ccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccc-cccCC
Confidence 3211111111 00111112344433 7778888888888888888777 45667899999999999999877 54443
Q ss_pred C
Q psy16830 628 T 628 (712)
Q Consensus 628 ~ 628 (712)
+
T Consensus 446 ~ 446 (467)
T COG5048 446 P 446 (467)
T ss_pred c
Confidence 3
No 58
>KOG2785|consensus
Probab=94.70 E-value=0.038 Score=54.36 Aligned_cols=82 Identities=24% Similarity=0.406 Sum_probs=41.4
Q ss_pred CcccCcCccccCChhHHHHHHHh--cCC---CCCcc-CCcccccccChHHHHHHHHHhcCCCCCCceecCCChhhccChH
Q psy16830 319 GFTCNSCGLIFSEKKYLRNHMIS--HST---QKPFS-CSLCGKRFKRENGVINHQRLHQTKKVYKSYKCSLCPRVFTHLA 392 (712)
Q Consensus 319 ~~~C~~C~~~f~~~~~l~~H~~~--h~~---~~~~~-C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~ 392 (712)
.|.|..|...|.+...-+.|+++ |.- .+.+. =++--..|..+..-..-...-..+....++.|.+|.+.|.+..
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~ 82 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPK 82 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChh
Confidence 58899999999998888888874 310 00000 0111111221111111000000111123677777777777777
Q ss_pred HHHHHHhh
Q psy16830 393 ALEDHQFV 400 (712)
Q Consensus 393 ~l~~H~~~ 400 (712)
....|+..
T Consensus 83 a~~~hl~S 90 (390)
T KOG2785|consen 83 AHENHLKS 90 (390)
T ss_pred hHHHHHHH
Confidence 77777653
No 59
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.67 E-value=0.022 Score=33.32 Aligned_cols=21 Identities=33% Similarity=0.801 Sum_probs=13.1
Q ss_pred cccCCCCcccCChHHHhhhhh
Q psy16830 510 FSCNYCGKTFLHKYKLRRHIV 530 (712)
Q Consensus 510 ~~C~~C~k~f~~~~~L~~H~~ 530 (712)
|.|..|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 456666666666666666654
No 60
>KOG4173|consensus
Probab=94.40 E-value=0.013 Score=51.67 Aligned_cols=81 Identities=28% Similarity=0.572 Sum_probs=63.9
Q ss_pred CcccCcc--cccccCChhhHHhhhhhcccCccccccCCcccCcCCcccCCchhHHhHHhh-h---------cCCCcccCC
Q psy16830 536 RPFSCPV--CKKQFKRKQDVAGHDRVVHQGKKLIVYKSYKCSLCPKAFTYLGALEDHQFV-H---------TGQKTHLCT 603 (712)
Q Consensus 536 ~p~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~-H---------~~~k~~~C~ 603 (712)
+.|.|++ |...|.+......|....|++ .|.+|.+.|.+..-|..|+.. | .|...|+|-
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~---------sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~Cl 148 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN---------SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCL 148 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc---------hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHH
Confidence 4577876 777888888888888777765 488888888888888888642 2 467789995
Q ss_pred C--CccccCCchhhhhHHhhhccC
Q psy16830 604 M--CPASYATIGGLRSHIKIVHHG 625 (712)
Q Consensus 604 ~--C~~~f~~~~~L~~H~~~~H~~ 625 (712)
+ |+..|.+...-..|+-+.|.-
T Consensus 149 vEgCt~KFkT~r~RkdH~I~~Hk~ 172 (253)
T KOG4173|consen 149 VEGCTEKFKTSRDRKDHMIRMHKY 172 (253)
T ss_pred HHhhhhhhhhhhhhhhHHHHhccC
Confidence 4 999999999999999988853
No 61
>KOG2482|consensus
Probab=92.68 E-value=0.32 Score=46.98 Aligned_cols=138 Identities=23% Similarity=0.320 Sum_probs=83.2
Q ss_pred CCccCCcchhhcc-CHHHHHHHHHhhcc-CC--------------------CCcccCcCccccCChhHHHHHHHh--cCC
Q psy16830 289 KPFKCDDCFQCFL-SKILLNQHKKQMHF-NK--------------------GGFTCNSCGLIFSEKKYLRNHMIS--HST 344 (712)
Q Consensus 289 ~~~~C~~C~~~f~-~~~~l~~H~~~~h~-~~--------------------~~~~C~~C~~~f~~~~~l~~H~~~--h~~ 344 (712)
...+|-.|+..+. .++....|+-..|. +- ..+.|-.|.+.|+++..|+.||+. |..
T Consensus 143 fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrkK~Hrr 222 (423)
T KOG2482|consen 143 FSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRKKRHRR 222 (423)
T ss_pred eeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHhccCcc
Confidence 3467999987764 56777777655553 11 236799999999999999999975 322
Q ss_pred CCCccCCcccccc------cChHHHHHHHHH----h--cCC----------CCCC--ceecCCChhhccChHHHHHHHhh
Q psy16830 345 QKPFSCSLCGKRF------KRENGVINHQRL----H--QTK----------KVYK--SYKCSLCPRVFTHLAALEDHQFV 400 (712)
Q Consensus 345 ~~~~~C~~C~~~f------~~~~~l~~H~~~----h--~~~----------~~~~--~~~C~~C~~~f~~~~~l~~H~~~ 400 (712)
-.|-.= .-++-| ..++....|... - .++ .... ...|-.|.....+...|..||..
T Consensus 223 inPknr-eYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~en~~~l~eHmk~ 301 (423)
T KOG2482|consen 223 INPKNR-EYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYENPVFLFEHMKI 301 (423)
T ss_pred cCCCcc-ccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccchhhHHHHHHHHHH
Confidence 222000 000000 011111111111 0 000 0111 25899999988889999999876
Q ss_pred hcC---------------------------CCcccCCCCccccCChHHHHhHHH
Q psy16830 401 HTG---------------------------QKTRLCTMCPAIFASIGGLRSHMK 427 (712)
Q Consensus 401 h~~---------------------------~~~~~C~~C~~~f~~~~~l~~H~~ 427 (712)
-+. .....|-.|.-.|.....|..||.
T Consensus 302 vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~ 355 (423)
T KOG2482|consen 302 VHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMV 355 (423)
T ss_pred HHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcc
Confidence 432 122468899999999999999984
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.93 E-value=0.087 Score=30.03 Aligned_cols=21 Identities=29% Similarity=0.623 Sum_probs=18.4
Q ss_pred ccCChhhhhhhChHHHHhHHhh
Q psy16830 675 FKCDDCDQRFLTKILLNKHKLL 696 (712)
Q Consensus 675 ~~C~~C~~~f~~~~~L~~H~~~ 696 (712)
..|+.||+.| ....|.+|+.+
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4799999999 88999999864
No 63
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=91.54 E-value=0.086 Score=33.05 Aligned_cols=24 Identities=25% Similarity=0.427 Sum_probs=21.6
Q ss_pred CCccCChhhhhhhChHHHHhHHhh
Q psy16830 673 KPFKCDDCDQRFLTKILLNKHKLL 696 (712)
Q Consensus 673 k~~~C~~C~~~f~~~~~L~~H~~~ 696 (712)
.+|.|++|+..|.+...+..|+..
T Consensus 2 ~~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 2 GGFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred cCeEccccCCccCCHHHHHHHHCh
Confidence 368899999999999999999864
No 64
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.17 E-value=0.15 Score=29.04 Aligned_cols=19 Identities=32% Similarity=0.730 Sum_probs=11.6
Q ss_pred ccCcCCcccCCchhHHhHHh
Q psy16830 573 KCSLCPKAFTYLGALEDHQF 592 (712)
Q Consensus 573 ~C~~C~~~f~~~~~L~~H~~ 592 (712)
.|+.||+.| ....|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 466666666 4556666654
No 65
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=89.89 E-value=0.19 Score=30.95 Aligned_cols=11 Identities=27% Similarity=0.911 Sum_probs=6.1
Q ss_pred cccCCCCcccC
Q psy16830 510 FSCNYCGKTFL 520 (712)
Q Consensus 510 ~~C~~C~k~f~ 520 (712)
|.|..||+.+.
T Consensus 2 ~~C~~CGy~y~ 12 (33)
T cd00350 2 YVCPVCGYIYD 12 (33)
T ss_pred EECCCCCCEEC
Confidence 45666665543
No 66
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=89.11 E-value=0.31 Score=30.42 Aligned_cols=23 Identities=30% Similarity=0.580 Sum_probs=17.9
Q ss_pred ceecCCcccccCChHHHHHHHHH
Q psy16830 186 SYKCSLCPKAFTHLAALEDHQYV 208 (712)
Q Consensus 186 ~~~C~~C~k~f~~~~~L~~H~~~ 208 (712)
+|.|.+|++.|.+...+..|+..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 57788888888888888888763
No 67
>KOG4173|consensus
Probab=88.44 E-value=0.25 Score=43.94 Aligned_cols=78 Identities=26% Similarity=0.522 Sum_probs=42.3
Q ss_pred ccCCc--ccccccChHHHHHHHHHhcCCCCCCceecCCChhhccChHHHHHHHhh----------hcCCCcccCC--CCc
Q psy16830 348 FSCSL--CGKRFKRENGVINHQRLHQTKKVYKSYKCSLCPRVFTHLAALEDHQFV----------HTGQKTRLCT--MCP 413 (712)
Q Consensus 348 ~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~----------h~~~~~~~C~--~C~ 413 (712)
|.|++ |...|........|...-|+. .|..|.+.|++...|..|+.. ..|...|.|- .|+
T Consensus 80 ~~cqvagc~~~~d~lD~~E~hY~~~h~~------sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 80 FACQVAGCCQVFDALDDYEHHYHTLHGN------SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred ccccccchHHHHhhhhhHHHhhhhcccc------hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 44444 444555444444454444432 255555555555555555432 2345556663 477
Q ss_pred cccCChHHHHhHHHHhhc
Q psy16830 414 AIFASIGGLRSHMKIVHH 431 (712)
Q Consensus 414 ~~f~~~~~l~~H~~~~H~ 431 (712)
..|.+....+.|+-..|.
T Consensus 154 ~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhhhHHHHhcc
Confidence 777777777777766664
No 68
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=88.33 E-value=0.31 Score=30.04 Aligned_cols=10 Identities=30% Similarity=0.873 Sum_probs=5.0
Q ss_pred CcccCCCCcc
Q psy16830 598 KTHLCTMCPA 607 (712)
Q Consensus 598 k~~~C~~C~~ 607 (712)
.++.|++|+.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 4455555543
No 69
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=88.16 E-value=0.31 Score=40.34 Aligned_cols=27 Identities=19% Similarity=0.337 Sum_probs=24.7
Q ss_pred CCccC----ChhhhhhhChHHHHhHHhhhcC
Q psy16830 673 KPFKC----DDCDQRFLTKILLNKHKLLHFD 699 (712)
Q Consensus 673 k~~~C----~~C~~~f~~~~~L~~H~~~H~~ 699 (712)
..|.| ..|++.+.+...+.+|++.++|
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 56999 9999999999999999998764
No 70
>KOG2893|consensus
Probab=87.92 E-value=0.13 Score=46.73 Aligned_cols=41 Identities=32% Similarity=0.484 Sum_probs=27.9
Q ss_pred CCcchhhccCHHHHHHHHHhhccCCCCcccCcCccccCChhHHHHH
Q psy16830 293 CDDCFQCFLSKILLNQHKKQMHFNKGGFTCNSCGLIFSEKKYLRNH 338 (712)
Q Consensus 293 C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H 338 (712)
|-+|++.|..+..|..|++..| |+|.+|.+...+--.|..|
T Consensus 13 cwycnrefddekiliqhqkakh-----fkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH-----FKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeecccccchhhhhhhhhhhcc-----ceeeeehhhhccCCCceee
Confidence 6677777777777777776644 6777777666666556655
No 71
>KOG2893|consensus
Probab=86.09 E-value=0.26 Score=44.79 Aligned_cols=51 Identities=29% Similarity=0.481 Sum_probs=40.8
Q ss_pred CCcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCCchhhhhHHhhhccC
Q psy16830 570 KSYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYATIGGLRSHIKIVHHG 625 (712)
Q Consensus 570 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~~H~~ 625 (712)
|+| |=+|++.|.+...|.+|++. |-|+|.+|.+...+--.|..|-..+|..
T Consensus 10 kpw-cwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvhke 60 (341)
T KOG2893|consen 10 KPW-CWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVHKE 60 (341)
T ss_pred Cce-eeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhhhh
Confidence 344 88899999988888888764 5588999999888888888887777753
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=85.56 E-value=0.33 Score=32.89 Aligned_cols=31 Identities=16% Similarity=0.337 Sum_probs=26.5
Q ss_pred hhcCCCcccCCCCccccCCchhhhhHHhhhc
Q psy16830 593 VHTGQKTHLCTMCPASYATIGGLRSHIKIVH 623 (712)
Q Consensus 593 ~H~~~k~~~C~~C~~~f~~~~~L~~H~~~~H 623 (712)
.-.||-.++|+-|++.|+...+..+|+...|
T Consensus 11 ~RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 11 DRDGEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred ccCCceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 3457788899999999999999999998866
No 73
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=83.54 E-value=2.4 Score=34.48 Aligned_cols=24 Identities=25% Similarity=0.541 Sum_probs=12.6
Q ss_pred CcccCcCCcccCCchhHHhHHhhh
Q psy16830 571 SYKCSLCPKAFTYLGALEDHQFVH 594 (712)
Q Consensus 571 ~~~C~~C~~~f~~~~~L~~H~~~H 594 (712)
.|+|+.|...|-..-++..|...|
T Consensus 81 ~y~C~~C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 81 RYVCAVCKNVFCVDCDVFVHESLH 104 (112)
T ss_pred ceeCCCCCCccccccchhhhhhcc
Confidence 455555555555555555554444
No 74
>PF15269 zf-C2H2_7: Zinc-finger
Probab=80.85 E-value=1.4 Score=28.35 Aligned_cols=22 Identities=27% Similarity=0.365 Sum_probs=20.4
Q ss_pred ccCChhhhhhhChHHHHhHHhh
Q psy16830 675 FKCDDCDQRFLTKILLNKHKLL 696 (712)
Q Consensus 675 ~~C~~C~~~f~~~~~L~~H~~~ 696 (712)
|+|..|.++...++.|-.||+-
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 7999999999999999999974
No 75
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=80.20 E-value=1.4 Score=36.36 Aligned_cols=23 Identities=22% Similarity=0.404 Sum_probs=15.6
Q ss_pred cceecCCcccccCChHHHHHHHHH
Q psy16830 185 KSYKCSLCPKAFTHLAALEDHQYV 208 (712)
Q Consensus 185 ~~~~C~~C~k~f~~~~~L~~H~~~ 208 (712)
+...|..|+....- +.+..|++.
T Consensus 10 ~vlIC~~C~~av~~-~~v~~HL~~ 32 (109)
T PF12013_consen 10 RVLICRQCQYAVQP-SEVESHLRK 32 (109)
T ss_pred CEEEeCCCCcccCc-hHHHHHHHH
Confidence 34667777777655 677777773
No 76
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=79.82 E-value=1.1 Score=27.84 Aligned_cols=10 Identities=30% Similarity=0.913 Sum_probs=5.6
Q ss_pred cccCCCCccc
Q psy16830 510 FSCNYCGKTF 519 (712)
Q Consensus 510 ~~C~~C~k~f 519 (712)
|.|..||+.+
T Consensus 3 ~~C~~CG~i~ 12 (34)
T cd00729 3 WVCPVCGYIH 12 (34)
T ss_pred EECCCCCCEe
Confidence 4566666554
No 77
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=78.33 E-value=1.1 Score=42.23 Aligned_cols=25 Identities=40% Similarity=0.817 Sum_probs=18.2
Q ss_pred CCccccccccccCChhhhhcccccc
Q psy16830 151 KPFSCPICKKQFKRKHEVSKHDRIV 175 (712)
Q Consensus 151 ~~~~C~~C~~~f~~~~~l~~H~~~~ 175 (712)
+.+.||+|+..|.+..-+....+..
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~ 28 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVI 28 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEe
Confidence 4578888888888877666666543
No 78
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=77.76 E-value=0.77 Score=43.19 Aligned_cols=12 Identities=25% Similarity=0.429 Sum_probs=7.2
Q ss_pred cccCcCCcccCC
Q psy16830 572 YKCSLCPKAFTY 583 (712)
Q Consensus 572 ~~C~~C~~~f~~ 583 (712)
..|+.||.+|..
T Consensus 49 ~vCP~CgyA~~~ 60 (214)
T PF09986_consen 49 WVCPHCGYAAFE 60 (214)
T ss_pred EECCCCCCcccc
Confidence 356677666653
No 79
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=76.95 E-value=1.1 Score=30.47 Aligned_cols=28 Identities=21% Similarity=0.494 Sum_probs=14.4
Q ss_pred CCCCcCCccccccCCChhHHHHHHHHhc
Q psy16830 211 GQKSHMCNLCSARFTCKGALLGHIKVLH 238 (712)
Q Consensus 211 ~~~~~~C~~C~~~f~~~~~L~~H~~~~h 238 (712)
|+..+.|+-|++.|....+..+|....|
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 3444555555555555555555554444
No 80
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=74.87 E-value=1.6 Score=35.56 Aligned_cols=27 Identities=33% Similarity=0.837 Sum_probs=21.9
Q ss_pred ccCChhhhhhhChHHHHhHHhhhcCCCccccccccCCC
Q psy16830 675 FKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGKTF 712 (712)
Q Consensus 675 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F 712 (712)
..|+-||++|... +..|..|+.||..|
T Consensus 10 R~Cp~CG~kFYDL-----------nk~PivCP~CG~~~ 36 (108)
T PF09538_consen 10 RTCPSCGAKFYDL-----------NKDPIVCPKCGTEF 36 (108)
T ss_pred ccCCCCcchhccC-----------CCCCccCCCCCCcc
Confidence 4699999999653 45889999999876
No 81
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=73.47 E-value=2.3 Score=34.70 Aligned_cols=14 Identities=36% Similarity=1.018 Sum_probs=7.0
Q ss_pred CCccccccccccCC
Q psy16830 151 KPFSCPICKKQFKR 164 (712)
Q Consensus 151 ~~~~C~~C~~~f~~ 164 (712)
.|..|+.||..|.-
T Consensus 25 ~PivCP~CG~~~~~ 38 (108)
T PF09538_consen 25 DPIVCPKCGTEFPP 38 (108)
T ss_pred CCccCCCCCCccCc
Confidence 44455555555543
No 82
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=72.47 E-value=2.1 Score=28.52 Aligned_cols=27 Identities=26% Similarity=0.476 Sum_probs=12.1
Q ss_pred CCcccCCCCccccCCc----hhhhhHHhhhc
Q psy16830 597 QKTHLCTMCPASYATI----GGLRSHIKIVH 623 (712)
Q Consensus 597 ~k~~~C~~C~~~f~~~----~~L~~H~~~~H 623 (712)
....+|.+|++.+... +.|.+|++..|
T Consensus 14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 14 KKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp SS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred cCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 3344566666655543 45666665444
No 83
>KOG2186|consensus
Probab=71.64 E-value=1.7 Score=40.42 Aligned_cols=49 Identities=31% Similarity=0.665 Sum_probs=27.8
Q ss_pred CcCcchhHhhcCHHHHHHHHhhcCCCCCccccccccccCChhhhhccccccc
Q psy16830 125 LTCNLCGQTFRQKFRLLNHMISHSTEKPFSCPICKKQFKRKHEVSKHDRIVH 176 (712)
Q Consensus 125 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h 176 (712)
|+|..||..... ..|.+||-..++ .-|.|-.|++.|.. .++..|..-+.
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCIT 52 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCIT 52 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcc
Confidence 456666665543 345556665555 44666666666665 55556655433
No 84
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=71.00 E-value=2.6 Score=37.28 Aligned_cols=23 Identities=35% Similarity=0.915 Sum_probs=14.3
Q ss_pred ccccCCCCcccCChHHHhhhhhhcCCCCcccCcccc
Q psy16830 509 YFSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCK 544 (712)
Q Consensus 509 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~ 544 (712)
.|.|++||+++ -|+.|-.||+||
T Consensus 134 ~~vC~vCGy~~-------------~ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYTH-------------EGEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCcc-------------cCCCCCcCCCCC
Confidence 46677776654 245666677776
No 85
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=70.64 E-value=2.3 Score=24.47 Aligned_cols=9 Identities=44% Similarity=1.058 Sum_probs=5.2
Q ss_pred ccCCCCccc
Q psy16830 511 SCNYCGKTF 519 (712)
Q Consensus 511 ~C~~C~k~f 519 (712)
.|+.||+.|
T Consensus 16 ~Cp~CG~~F 24 (26)
T PF10571_consen 16 FCPHCGYDF 24 (26)
T ss_pred cCCCCCCCC
Confidence 366666555
No 86
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=70.44 E-value=5.8 Score=32.36 Aligned_cols=78 Identities=21% Similarity=0.291 Sum_probs=51.0
Q ss_pred CCCCCcchhhhhcchhhHhhccccc-----c-------CCCccccCCCCcccCChHHHhhhhhhcCCCCcccCccccccc
Q psy16830 480 KPFKCDDCDQRFLTKILLNKHKLLH-----F-------DKDYFSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCKKQF 547 (712)
Q Consensus 480 ~~~~C~~C~~~f~~~~~L~~H~~~h-----~-------~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~~f 547 (712)
-|..|+.||.+......|.+-..-= . ....-.|--|...|........ ..-.....|.|+.|...|
T Consensus 14 LP~~CpiCgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~--~~~~~~~~y~C~~C~~~F 91 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPF--DELKDSHRYVCAVCKNVF 91 (112)
T ss_pred CCCcCCcCCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCcccccc--cccccccceeCCCCCCcc
Confidence 4678999999888877776642100 0 1111249999999976432110 001234579999999999
Q ss_pred CChhhHHhhhhh
Q psy16830 548 KRKQDVAGHDRV 559 (712)
Q Consensus 548 ~~~~~l~~H~~~ 559 (712)
-..-++-.|...
T Consensus 92 C~dCD~fiHe~L 103 (112)
T TIGR00622 92 CVDCDVFVHESL 103 (112)
T ss_pred ccccchhhhhhc
Confidence 988888888765
No 87
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=69.50 E-value=3.1 Score=29.09 Aligned_cols=8 Identities=50% Similarity=1.493 Sum_probs=3.5
Q ss_pred cccCCCCc
Q psy16830 510 FSCNYCGK 517 (712)
Q Consensus 510 ~~C~~C~k 517 (712)
|.|+.||.
T Consensus 28 F~CPnCGe 35 (61)
T COG2888 28 FPCPNCGE 35 (61)
T ss_pred eeCCCCCc
Confidence 44444443
No 88
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=66.74 E-value=4.2 Score=25.61 Aligned_cols=33 Identities=27% Similarity=0.786 Sum_probs=18.6
Q ss_pred ccccccccccCChhhhhccccccccCcccccccceecCCccccc
Q psy16830 153 FSCPICKKQFKRKHEVSKHDRIVHQGKKYNVYKSYKCSLCPKAF 196 (712)
Q Consensus 153 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~k~f 196 (712)
..|+.|+..|.-..+...= .+ ...+|+.|+..|
T Consensus 3 i~Cp~C~~~y~i~d~~ip~-----~g------~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEKIPP-----KG------RKVRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHHCCC-----CC------cEEECCCCCCEe
Confidence 3577777777655543221 11 246777777666
No 89
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=66.54 E-value=1.9 Score=37.80 Aligned_cols=14 Identities=21% Similarity=0.596 Sum_probs=6.8
Q ss_pred ccCCCCccccCCch
Q psy16830 600 HLCTMCPASYATIG 613 (712)
Q Consensus 600 ~~C~~C~~~f~~~~ 613 (712)
|+|+.||+.|.+..
T Consensus 29 ~~c~~c~~~f~~~e 42 (154)
T PRK00464 29 RECLACGKRFTTFE 42 (154)
T ss_pred eeccccCCcceEeE
Confidence 44555555554443
No 90
>PHA00626 hypothetical protein
Probab=65.87 E-value=3.4 Score=28.37 Aligned_cols=13 Identities=23% Similarity=0.496 Sum_probs=8.3
Q ss_pred ceecCCcccccCC
Q psy16830 186 SYKCSLCPKAFTH 198 (712)
Q Consensus 186 ~~~C~~C~k~f~~ 198 (712)
.|+|+.||+.|+.
T Consensus 23 rYkCkdCGY~ft~ 35 (59)
T PHA00626 23 DYVCCDCGYNDSK 35 (59)
T ss_pred ceEcCCCCCeech
Confidence 4666666666654
No 91
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=65.78 E-value=3.4 Score=26.30 Aligned_cols=14 Identities=36% Similarity=0.925 Sum_probs=7.9
Q ss_pred ccccccccccCChh
Q psy16830 153 FSCPICKKQFKRKH 166 (712)
Q Consensus 153 ~~C~~C~~~f~~~~ 166 (712)
+.|+.|+..|.-..
T Consensus 3 ~~CP~C~~~~~v~~ 16 (38)
T TIGR02098 3 IQCPNCKTSFRVVD 16 (38)
T ss_pred EECCCCCCEEEeCH
Confidence 45666666665443
No 92
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=62.67 E-value=5.2 Score=25.36 Aligned_cols=9 Identities=33% Similarity=1.113 Sum_probs=3.7
Q ss_pred ccCcCCccc
Q psy16830 573 KCSLCPKAF 581 (712)
Q Consensus 573 ~C~~C~~~f 581 (712)
+|+.|+..|
T Consensus 27 rC~~C~~~f 35 (37)
T PF13719_consen 27 RCPKCGHVF 35 (37)
T ss_pred ECCCCCcEe
Confidence 344444333
No 93
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=61.97 E-value=5.6 Score=27.25 Aligned_cols=22 Identities=27% Similarity=0.649 Sum_probs=10.8
Q ss_pred CCCCccccCCc-----hhhhhHHhhhc
Q psy16830 602 CTMCPASYATI-----GGLRSHIKIVH 623 (712)
Q Consensus 602 C~~C~~~f~~~-----~~L~~H~~~~H 623 (712)
|.+|++.+... +.|.+|++..|
T Consensus 21 C~~C~~~l~~~~~~gTs~L~rHl~~~h 47 (50)
T smart00614 21 CKYCGKKLSRSSKGGTSNLRRHLRRKH 47 (50)
T ss_pred ecCCCCEeeeCCCCCcHHHHHHHHhHC
Confidence 45555544333 35666655434
No 94
>KOG2186|consensus
Probab=61.27 E-value=3 Score=38.94 Aligned_cols=51 Identities=31% Similarity=0.577 Sum_probs=36.5
Q ss_pred CCCcchhhhhcchhhHhhccccccCCCccccCCCCcccCChHHHhhhhhhcCCC
Q psy16830 482 FKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGKTFLHKYKLRRHIVIHSTA 535 (712)
Q Consensus 482 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~ 535 (712)
|.|..||.....+ .+.+|+....+ ..|.|-.|++.|.. ..+..|..--+..
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITEa 54 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITEA 54 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcchH
Confidence 6788888776544 45668777666 77888888888876 7777887654443
No 95
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=61.23 E-value=5.4 Score=26.46 Aligned_cols=25 Identities=20% Similarity=0.489 Sum_probs=13.5
Q ss_pred cccCcCCcccCCchhHHhHHhhhcCCCcccCCCCcc
Q psy16830 572 YKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPA 607 (712)
Q Consensus 572 ~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~ 607 (712)
|.|..||..|... ...+.+|+.||.
T Consensus 3 Y~C~~Cg~~~~~~-----------~~~~irC~~CG~ 27 (44)
T smart00659 3 YICGECGRENEIK-----------SKDVVRCRECGY 27 (44)
T ss_pred EECCCCCCEeecC-----------CCCceECCCCCc
Confidence 5666666655422 224456666664
No 96
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=60.98 E-value=1.9 Score=32.09 Aligned_cols=13 Identities=31% Similarity=0.943 Sum_probs=6.3
Q ss_pred CccCC--hhhhhhhC
Q psy16830 674 PFKCD--DCDQRFLT 686 (712)
Q Consensus 674 ~~~C~--~C~~~f~~ 686 (712)
-++|. .||.+|.+
T Consensus 27 Y~qC~N~eCg~tF~t 41 (72)
T PRK09678 27 YHQCQNVNCSATFIT 41 (72)
T ss_pred eeecCCCCCCCEEEE
Confidence 34554 55555543
No 97
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=60.72 E-value=3.8 Score=35.93 Aligned_cols=15 Identities=33% Similarity=0.691 Sum_probs=9.8
Q ss_pred ceecCCcccccCChH
Q psy16830 186 SYKCSLCPKAFTHLA 200 (712)
Q Consensus 186 ~~~C~~C~k~f~~~~ 200 (712)
.++|+.||++|.+..
T Consensus 28 ~~~c~~c~~~f~~~e 42 (154)
T PRK00464 28 RRECLACGKRFTTFE 42 (154)
T ss_pred eeeccccCCcceEeE
Confidence 467777777776543
No 98
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=60.14 E-value=3.5 Score=27.66 Aligned_cols=12 Identities=33% Similarity=0.949 Sum_probs=6.7
Q ss_pred CcccCcCCcccC
Q psy16830 571 SYKCSLCPKAFT 582 (712)
Q Consensus 571 ~~~C~~C~~~f~ 582 (712)
.|+|+-||..|.
T Consensus 3 ~y~C~~CG~~~~ 14 (46)
T PRK00398 3 EYKCARCGREVE 14 (46)
T ss_pred EEECCCCCCEEE
Confidence 355666665554
No 99
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=60.14 E-value=5.5 Score=22.92 Aligned_cols=20 Identities=25% Similarity=0.424 Sum_probs=17.3
Q ss_pred cCChhhhhhhChHHHHhHHhh
Q psy16830 676 KCDDCDQRFLTKILLNKHKLL 696 (712)
Q Consensus 676 ~C~~C~~~f~~~~~L~~H~~~ 696 (712)
.|++|++.+ ....++.|+..
T Consensus 3 ~CPiC~~~v-~~~~in~HLD~ 22 (26)
T smart00734 3 QCPVCFREV-PENLINSHLDS 22 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHHH
Confidence 699999998 77899999863
No 100
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=59.48 E-value=5.7 Score=27.95 Aligned_cols=9 Identities=44% Similarity=1.235 Sum_probs=4.3
Q ss_pred cccCCCCcc
Q psy16830 510 FSCNYCGKT 518 (712)
Q Consensus 510 ~~C~~C~k~ 518 (712)
|.|+.||..
T Consensus 26 F~CPnCG~~ 34 (59)
T PRK14890 26 FLCPNCGEV 34 (59)
T ss_pred eeCCCCCCe
Confidence 445555443
No 101
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=57.95 E-value=4.7 Score=35.36 Aligned_cols=13 Identities=38% Similarity=1.068 Sum_probs=7.0
Q ss_pred CcccCcCCcccCC
Q psy16830 571 SYKCSLCPKAFTY 583 (712)
Q Consensus 571 ~~~C~~C~~~f~~ 583 (712)
-|.|+.|+..|..
T Consensus 99 ~Y~Cp~C~~~y~~ 111 (147)
T smart00531 99 YYKCPNCQSKYTF 111 (147)
T ss_pred EEECcCCCCEeeH
Confidence 3555555555553
No 102
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=57.92 E-value=4.2 Score=26.35 Aligned_cols=11 Identities=36% Similarity=1.096 Sum_probs=6.3
Q ss_pred cccCcCCcccC
Q psy16830 572 YKCSLCPKAFT 582 (712)
Q Consensus 572 ~~C~~C~~~f~ 582 (712)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (41)
T smart00834 6 YRCEDCGHTFE 16 (41)
T ss_pred EEcCCCCCEEE
Confidence 55666665554
No 103
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=57.52 E-value=7 Score=32.45 Aligned_cols=14 Identities=29% Similarity=0.520 Sum_probs=7.2
Q ss_pred CCccccccccccCC
Q psy16830 151 KPFSCPICKKQFKR 164 (712)
Q Consensus 151 ~~~~C~~C~~~f~~ 164 (712)
.|..|+.||..|..
T Consensus 25 ~p~vcP~cg~~~~~ 38 (129)
T TIGR02300 25 RPAVSPYTGEQFPP 38 (129)
T ss_pred CCccCCCcCCccCc
Confidence 45555555555543
No 104
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=56.29 E-value=1.9 Score=28.33 Aligned_cols=29 Identities=24% Similarity=0.729 Sum_probs=20.2
Q ss_pred ccCChhhhhhhChHHHHhHHhhhcCCCccccccccC
Q psy16830 675 FKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGK 710 (712)
Q Consensus 675 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~ 710 (712)
|.|..||..|..... +.. ..+-.|+.||.
T Consensus 6 y~C~~Cg~~fe~~~~------~~~-~~~~~CP~Cg~ 34 (42)
T PF09723_consen 6 YRCEECGHEFEVLQS------ISE-DDPVPCPECGS 34 (42)
T ss_pred EEeCCCCCEEEEEEE------cCC-CCCCcCCCCCC
Confidence 889999988854332 232 56778999875
No 105
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=56.22 E-value=4.5 Score=27.96 Aligned_cols=12 Identities=33% Similarity=1.054 Sum_probs=7.3
Q ss_pred cccCcCCcccCC
Q psy16830 572 YKCSLCPKAFTY 583 (712)
Q Consensus 572 ~~C~~C~~~f~~ 583 (712)
|+|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (52)
T TIGR02605 6 YRCTACGHRFEV 17 (52)
T ss_pred EEeCCCCCEeEE
Confidence 566666666653
No 106
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=55.16 E-value=8.6 Score=34.17 Aligned_cols=33 Identities=21% Similarity=0.511 Sum_probs=20.0
Q ss_pred cCCCccccCCCCcccCChHHHhhhhhhcCCCCcccCcccccc
Q psy16830 505 FDKDYFSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCKKQ 546 (712)
Q Consensus 505 ~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~~ 546 (712)
.+...|.|+.|+..|+...++. --|.||.||..
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM 137 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence 3445566777777666666553 14666666654
No 107
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=55.11 E-value=5.8 Score=26.83 Aligned_cols=29 Identities=24% Similarity=0.617 Sum_probs=18.6
Q ss_pred cceecCCcccccCChHHHHHHHHHhcCCCCcCCcccccc
Q psy16830 185 KSYKCSLCPKAFTHLAALEDHQYVHTGQKSHMCNLCSAR 223 (712)
Q Consensus 185 ~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~ 223 (712)
..|+|..|++.|... .......|+.||..
T Consensus 5 ~~Y~C~~Cg~~~~~~----------~~~~~irCp~Cg~r 33 (49)
T COG1996 5 MEYKCARCGREVELD----------QETRGIRCPYCGSR 33 (49)
T ss_pred EEEEhhhcCCeeehh----------hccCceeCCCCCcE
Confidence 358888888887211 13456778888764
No 108
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=53.48 E-value=8.6 Score=34.10 Aligned_cols=23 Identities=30% Similarity=0.915 Sum_probs=13.5
Q ss_pred CcccCcCccccCChhHHHHHHHhcCCCCCccCCccc
Q psy16830 319 GFTCNSCGLIFSEKKYLRNHMISHSTQKPFSCSLCG 354 (712)
Q Consensus 319 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 354 (712)
.|.|+.||..+. ++.|-+||+||
T Consensus 134 ~~vC~vCGy~~~-------------ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYTHE-------------GEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCccc-------------CCCCCcCCCCC
Confidence 466666666443 34566666666
No 109
>KOG2071|consensus
Probab=53.30 E-value=17 Score=38.90 Aligned_cols=23 Identities=22% Similarity=0.463 Sum_probs=12.4
Q ss_pred eecCCcccccCChHHHHHHHHHh
Q psy16830 187 YKCSLCPKAFTHLAALEDHQYVH 209 (712)
Q Consensus 187 ~~C~~C~k~f~~~~~L~~H~~~H 209 (712)
-+|..||..|........||..|
T Consensus 419 nqC~~CG~R~~~~ee~sk~md~H 441 (579)
T KOG2071|consen 419 NQCKSCGLRFDDSEERSKHMDIH 441 (579)
T ss_pred chhcccccccccchhhhhHhhhh
Confidence 45555555555555555555544
No 110
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=52.59 E-value=13 Score=35.73 Aligned_cols=48 Identities=25% Similarity=0.444 Sum_probs=26.7
Q ss_pred CcccccccCChhhHHhhhhhcccCccccccCCcccCcCCcccCCchhHHhHHhhhc
Q psy16830 540 CPVCKKQFKRKQDVAGHDRVVHQGKKLIVYKSYKCSLCPKAFTYLGALEDHQFVHT 595 (712)
Q Consensus 540 C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~ 595 (712)
|-.|.-.|.... ++..........|+|+.|...|-..-..-.|...|.
T Consensus 365 Cf~CQ~~fp~~~--------~~~~~~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh~ 412 (421)
T COG5151 365 CFVCQGPFPKPP--------VSPFDESTSSGRYQCELCKSTFCSDCDVFIHETLHF 412 (421)
T ss_pred ceeccCCCCCCC--------CCcccccccccceechhhhhhhhhhhHHHHHHHHhh
Confidence 666666665322 122222233346777777777776666666666553
No 111
>PF15269 zf-C2H2_7: Zinc-finger
Probab=52.45 E-value=8.3 Score=24.99 Aligned_cols=23 Identities=30% Similarity=0.465 Sum_probs=19.8
Q ss_pred CcccCcCCcccCCchhHHhHHhh
Q psy16830 571 SYKCSLCPKAFTYLGALEDHQFV 593 (712)
Q Consensus 571 ~~~C~~C~~~f~~~~~L~~H~~~ 593 (712)
.|+|-.|+.+...++.|..||+-
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 37899999999999999999874
No 112
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=52.44 E-value=9 Score=31.82 Aligned_cols=30 Identities=17% Similarity=0.361 Sum_probs=15.8
Q ss_pred CCCcchhhhhcchhhHhhccccccCCCccccCCCCcccCCh
Q psy16830 482 FKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGKTFLHK 522 (712)
Q Consensus 482 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~ 522 (712)
..|+.||+.|... +..|..|++||..|.-.
T Consensus 10 r~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 10 RICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPE 39 (129)
T ss_pred ccCCCcCcccccc-----------CCCCccCCCcCCccCcc
Confidence 3566666665431 33455566666655443
No 113
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=52.36 E-value=4.3 Score=30.22 Aligned_cols=42 Identities=24% Similarity=0.473 Sum_probs=23.2
Q ss_pred cccCcCCcccCCchhHHhHHhhhcCCCcccCC--CCccccCCchhh
Q psy16830 572 YKCSLCPKAFTYLGALEDHQFVHTGQKTHLCT--MCPASYATIGGL 615 (712)
Q Consensus 572 ~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~--~C~~~f~~~~~L 615 (712)
+.|+.||.......+-..... ..++-++|. .||..|.+...+
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es~ 45 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYESV 45 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEEE
Confidence 457777766543333222222 345566776 788777765543
No 114
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=51.66 E-value=6.7 Score=33.33 Aligned_cols=21 Identities=29% Similarity=0.501 Sum_probs=9.0
Q ss_pred CccCccChhccCCHHHHHHhHhhc
Q psy16830 97 PFKCDDCDQCFLTKILLNKHKLLH 120 (712)
Q Consensus 97 ~~~C~~C~~~f~~~~~l~~H~~~h 120 (712)
...|-+||+.|... .+|++.|
T Consensus 72 ~i~clecGk~~k~L---krHL~~~ 92 (132)
T PF05443_consen 72 YIICLECGKKFKTL---KRHLRTH 92 (132)
T ss_dssp -EE-TBT--EESBH---HHHHHHT
T ss_pred eeEEccCCcccchH---HHHHHHc
Confidence 45566666666542 4555544
No 115
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=50.43 E-value=9.8 Score=33.82 Aligned_cols=42 Identities=17% Similarity=0.380 Sum_probs=24.4
Q ss_pred HHHHHhhcCCCCCccccccccccCChhhhhccccccccCcccccccceecCCccccc
Q psy16830 140 LLNHMISHSTEKPFSCPICKKQFKRKHEVSKHDRIVHQGKKYNVYKSYKCSLCPKAF 196 (712)
Q Consensus 140 l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~k~f 196 (712)
|..=+....+..-|.|+.|+..|+...++. -.|.|+.||...
T Consensus 97 lk~~l~~e~~~~~Y~Cp~c~~r~tf~eA~~---------------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 97 LREKLEFETNNMFFICPNMCVRFTFNEAME---------------LNFTCPRCGAML 138 (158)
T ss_pred HHHHHhhccCCCeEECCCCCcEeeHHHHHH---------------cCCcCCCCCCEe
Confidence 333344444555677777777776666652 136677776654
No 116
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=50.42 E-value=10 Score=23.16 Aligned_cols=10 Identities=20% Similarity=0.743 Sum_probs=4.7
Q ss_pred cccccccccc
Q psy16830 153 FSCPICKKQF 162 (712)
Q Consensus 153 ~~C~~C~~~f 162 (712)
|.|..||..+
T Consensus 1 Y~C~~Cg~~~ 10 (32)
T PF03604_consen 1 YICGECGAEV 10 (32)
T ss_dssp EBESSSSSSE
T ss_pred CCCCcCCCee
Confidence 3445555443
No 117
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=49.32 E-value=14 Score=32.39 Aligned_cols=37 Identities=19% Similarity=0.637 Sum_probs=18.5
Q ss_pred CCCccccCCCCcccCChHHHhhhhhhcCCCCcccCcccccc
Q psy16830 506 DKDYFSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCKKQ 546 (712)
Q Consensus 506 ~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~~ 546 (712)
+...|.|+.|+..|.....+.. . . ....|.||.||..
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~-~--d-~~~~f~Cp~Cg~~ 132 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQL-L--D-MDGTFTCPRCGEE 132 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHh-c--C-CCCcEECCCCCCE
Confidence 3445667777666664433322 0 0 1223666666654
No 118
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=48.85 E-value=12 Score=33.88 Aligned_cols=33 Identities=24% Similarity=0.668 Sum_probs=20.2
Q ss_pred CCCccccCCCCcccCChHHHhhhhhhcCCCCcccCccccccc
Q psy16830 506 DKDYFSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCKKQF 547 (712)
Q Consensus 506 ~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~~f 547 (712)
+...|.|+.|+..|+...++. --|.|+.||...
T Consensus 114 ~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L 146 (178)
T PRK06266 114 NNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEML 146 (178)
T ss_pred CCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCC
Confidence 345677777777776665542 246777776543
No 119
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=48.22 E-value=13 Score=41.75 Aligned_cols=9 Identities=33% Similarity=0.741 Sum_probs=5.5
Q ss_pred ccCCCCccc
Q psy16830 511 SCNYCGKTF 519 (712)
Q Consensus 511 ~C~~C~k~f 519 (712)
.|..||+.+
T Consensus 437 ~C~~Cg~v~ 445 (730)
T COG1198 437 LCRDCGYIA 445 (730)
T ss_pred ecccCCCcc
Confidence 466666654
No 120
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=48.08 E-value=8.5 Score=27.49 Aligned_cols=21 Identities=29% Similarity=0.640 Sum_probs=9.0
Q ss_pred hHHHhhhhhhcCCCCcccCcc
Q psy16830 522 KYKLRRHIVIHSTARPFSCPV 542 (712)
Q Consensus 522 ~~~L~~H~~~H~~~~p~~C~~ 542 (712)
+..|..|+...-..++..|+.
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~ 43 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPY 43 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS
T ss_pred HHHHHHHHHccCCCCcEECCC
Confidence 344555554333344444444
No 121
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=47.65 E-value=10 Score=34.43 Aligned_cols=16 Identities=31% Similarity=0.658 Sum_probs=7.7
Q ss_pred CCccccccccccCChh
Q psy16830 151 KPFSCPICKKQFKRKH 166 (712)
Q Consensus 151 ~~~~C~~C~~~f~~~~ 166 (712)
.-|.|+.|+..|+...
T Consensus 116 ~~Y~Cp~C~~rytf~e 131 (178)
T PRK06266 116 MFFFCPNCHIRFTFDE 131 (178)
T ss_pred CEEECCCCCcEEeHHH
Confidence 3455555555544433
No 122
>PF12907 zf-met2: Zinc-binding
Probab=47.61 E-value=15 Score=23.70 Aligned_cols=32 Identities=25% Similarity=0.405 Sum_probs=22.2
Q ss_pred ccCCcchhhc---cCHHHHHHHHHhhccCCCCccc
Q psy16830 291 FKCDDCFQCF---LSKILLNQHKKQMHFNKGGFTC 322 (712)
Q Consensus 291 ~~C~~C~~~f---~~~~~l~~H~~~~h~~~~~~~C 322 (712)
+.|.+|..+| .+...|..|....|.....-.|
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~~~C 36 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTFEEC 36 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCHHHc
Confidence 5788888655 5567788888877776544344
No 123
>KOG4124|consensus
Probab=47.29 E-value=5.5 Score=38.99 Aligned_cols=72 Identities=26% Similarity=0.467 Sum_probs=43.0
Q ss_pred CCcccCcc--cccccCChhhHHhhhhhcccCccccccCCcccCcCCcccCCchhHHhHHhhhcCCCcccCCCCccccCCc
Q psy16830 535 ARPFSCPV--CKKQFKRKQDVAGHDRVVHQGKKLIVYKSYKCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPASYATI 612 (712)
Q Consensus 535 ~~p~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~f~~~ 612 (712)
.+||+|++ |.+.+.+...|+.|...-|...-.. .++ .-.-|...-.-.|+|+|++|.+++...
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~--------------~s~-~~~ph~~~~~~nk~~r~~i~~~~~k~~ 411 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITT--------------PTP-APIPHQGFVVENKPYRCEVCSKRYKNL 411 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCC--------------CCC-CCCCcceeeeccCcccChhhhhhhccC
Confidence 46778854 8888777777777755433221100 000 111122222346899999999999988
Q ss_pred hhhhhHHhh
Q psy16830 613 GGLRSHIKI 621 (712)
Q Consensus 613 ~~L~~H~~~ 621 (712)
..|.-|...
T Consensus 412 ~~l~~~~~~ 420 (442)
T KOG4124|consen 412 NGLKYHRTH 420 (442)
T ss_pred CCCCceeeh
Confidence 888877554
No 124
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=45.82 E-value=14 Score=24.89 Aligned_cols=13 Identities=23% Similarity=1.030 Sum_probs=8.5
Q ss_pred ccccccccccCCh
Q psy16830 153 FSCPICKKQFKRK 165 (712)
Q Consensus 153 ~~C~~C~~~f~~~ 165 (712)
|+|..|+..+...
T Consensus 2 y~C~~CgyvYd~~ 14 (47)
T PF00301_consen 2 YQCPVCGYVYDPE 14 (47)
T ss_dssp EEETTTSBEEETT
T ss_pred cCCCCCCEEEcCC
Confidence 5677777766543
No 125
>PRK04023 DNA polymerase II large subunit; Validated
Probab=45.72 E-value=18 Score=41.36 Aligned_cols=12 Identities=25% Similarity=0.542 Sum_probs=7.4
Q ss_pred CcccCcCCcccC
Q psy16830 571 SYKCSLCPKAFT 582 (712)
Q Consensus 571 ~~~C~~C~~~f~ 582 (712)
+|.|+.||..-.
T Consensus 663 ~y~CPKCG~El~ 674 (1121)
T PRK04023 663 EDECEKCGREPT 674 (1121)
T ss_pred CCcCCCCCCCCC
Confidence 466777776543
No 126
>KOG4124|consensus
Probab=44.37 E-value=8.9 Score=37.61 Aligned_cols=71 Identities=28% Similarity=0.545 Sum_probs=43.8
Q ss_pred CCcccCCC--CccccCCchhhhhHHhhhccCCCCCccccCCCCCCccchhhccccccccccccccCCCccccchhccCCC
Q psy16830 597 QKTHLCTM--CPASYATIGGLRSHIKIVHHGLKTQQSCQSSHESKTCSVDEINKENEEGATIRKVKPHEDSKEYMALNKP 674 (712)
Q Consensus 597 ~k~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~ 674 (712)
.++|+|.+ |.+.+.+..+|+.|-..-|...-... .. .. .++ .-....+|+
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~--------------------s~-~~----~ph---~~~~~~nk~ 398 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTP--------------------TP-AP----IPH---QGFVVENKP 398 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCC--------------------CC-CC----CCc---ceeeeccCc
Confidence 46788854 88888888888888665442211000 00 00 011 112456899
Q ss_pred ccCChhhhhhhChHHHHhHHh
Q psy16830 675 FKCDDCDQRFLTKILLNKHKL 695 (712)
Q Consensus 675 ~~C~~C~~~f~~~~~L~~H~~ 695 (712)
|.|++|.+++.....|.-|+.
T Consensus 399 ~r~~i~~~~~k~~~~l~~~~~ 419 (442)
T KOG4124|consen 399 YRCEVCSKRYKNLNGLKYHRT 419 (442)
T ss_pred ccChhhhhhhccCCCCCceee
Confidence 999999999987766665543
No 127
>KOG2807|consensus
Probab=43.37 E-value=32 Score=33.75 Aligned_cols=25 Identities=24% Similarity=0.523 Sum_probs=17.7
Q ss_pred CcccCcCCcccCCchhHHhHHhhhc
Q psy16830 571 SYKCSLCPKAFTYLGALEDHQFVHT 595 (712)
Q Consensus 571 ~~~C~~C~~~f~~~~~L~~H~~~H~ 595 (712)
.|+|..|...|-.--+...|...|.
T Consensus 345 ~y~C~~Ck~~FCldCDv~iHesLh~ 369 (378)
T KOG2807|consen 345 RYRCESCKNVFCLDCDVFIHESLHN 369 (378)
T ss_pred cEEchhccceeeccchHHHHhhhhc
Confidence 5788888888776666666766653
No 128
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=42.09 E-value=16 Score=24.98 Aligned_cols=12 Identities=25% Similarity=0.861 Sum_probs=7.1
Q ss_pred ccccccccccCC
Q psy16830 153 FSCPICKKQFKR 164 (712)
Q Consensus 153 ~~C~~C~~~f~~ 164 (712)
|+|..||..+..
T Consensus 2 y~C~~CgyiYd~ 13 (50)
T cd00730 2 YECRICGYIYDP 13 (50)
T ss_pred cCCCCCCeEECC
Confidence 556666666543
No 129
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=41.72 E-value=21 Score=23.95 Aligned_cols=21 Identities=24% Similarity=0.735 Sum_probs=10.8
Q ss_pred HHHHHhhccCCCCcccCcCccc
Q psy16830 307 NQHKKQMHFNKGGFTCNSCGLI 328 (712)
Q Consensus 307 ~~H~~~~h~~~~~~~C~~C~~~ 328 (712)
..++..+-..++ +.|+.||..
T Consensus 7 ~~~l~~~RW~~g-~~CP~Cg~~ 27 (46)
T PF12760_consen 7 REYLEEIRWPDG-FVCPHCGST 27 (46)
T ss_pred HHHHHHhcCCCC-CCCCCCCCe
Confidence 334444334443 567777753
No 130
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=41.61 E-value=50 Score=32.02 Aligned_cols=49 Identities=20% Similarity=0.429 Sum_probs=30.7
Q ss_pred ccccccccccCChhhhhccccccccCcccccccceecCCcccccCChHHHHHHHHHh
Q psy16830 153 FSCPICKKQFKRKHEVSKHDRIVHQGKKYNVYKSYKCSLCPKAFTHLAALEDHQYVH 209 (712)
Q Consensus 153 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~H 209 (712)
-.|-.|.-.|+.... +...+......|+|+.|...|-..-..-.|...|
T Consensus 363 ~~Cf~CQ~~fp~~~~--------~~~~~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 363 THCFVCQGPFPKPPV--------SPFDESTSSGRYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred ccceeccCCCCCCCC--------CcccccccccceechhhhhhhhhhhHHHHHHHHh
Confidence 347777777765321 2222222224688888888887777777777766
No 131
>KOG4167|consensus
Probab=41.10 E-value=10 Score=41.25 Aligned_cols=26 Identities=31% Similarity=0.575 Sum_probs=23.1
Q ss_pred CCCcCcchhHhhcCHHHHHHHHhhcC
Q psy16830 123 GHLTCNLCGQTFRQKFRLLNHMISHS 148 (712)
Q Consensus 123 ~~~~C~~C~~~f~~~~~l~~H~~~h~ 148 (712)
+.|.|.+|++.|....++..||++|.
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 45889999999999999999999984
No 132
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=39.98 E-value=23 Score=34.91 Aligned_cols=25 Identities=24% Similarity=0.436 Sum_probs=20.7
Q ss_pred ceecCCcccccCChHHHHHHHH--Hhc
Q psy16830 186 SYKCSLCPKAFTHLAALEDHQY--VHT 210 (712)
Q Consensus 186 ~~~C~~C~k~f~~~~~L~~H~~--~H~ 210 (712)
.+-|..|++.|....-+..|+. .|.
T Consensus 238 ~~YC~~C~r~f~~~~VFe~Hl~gK~H~ 264 (470)
T COG5188 238 KVYCVKCGREFSRSKVFEYHLEGKRHC 264 (470)
T ss_pred ceeeHhhhhHhhhhHHHHHHHhhhhhh
Confidence 4679999999999999999975 454
No 133
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=39.00 E-value=11 Score=30.85 Aligned_cols=37 Identities=19% Similarity=0.371 Sum_probs=20.8
Q ss_pred CcccCcCCcccCCchhHHhHHhhhcCC---CcccCCCCccccC
Q psy16830 571 SYKCSLCPKAFTYLGALEDHQFVHTGQ---KTHLCTMCPASYA 610 (712)
Q Consensus 571 ~~~C~~C~~~f~~~~~L~~H~~~H~~~---k~~~C~~C~~~f~ 610 (712)
..+|+.||..-.....++ +|. .++ .-|.|..|+..++
T Consensus 62 ~~~Cp~Cg~~~a~f~~~Q--~Rs-adE~~T~fy~C~~C~~~w~ 101 (104)
T TIGR01384 62 RVECPKCGHKEAYYWLLQ--TRR-ADEPETRFYKCTKCGYVWR 101 (104)
T ss_pred cCCCCCCCCCeeEEEEec--cCC-CCCCcEEEEEeCCCCCeeE
Confidence 367888886544333332 111 111 2388988887664
No 134
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=38.90 E-value=13 Score=22.22 Aligned_cols=24 Identities=33% Similarity=0.861 Sum_probs=8.7
Q ss_pred ccCcCCcccCCchhHHhHHhhhcCCCcccCCCCcc
Q psy16830 573 KCSLCPKAFTYLGALEDHQFVHTGQKTHLCTMCPA 607 (712)
Q Consensus 573 ~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~ 607 (712)
+|+.|+..+.. .+.-.|.|+.|+.
T Consensus 4 ~Cp~C~se~~y-----------~D~~~~vCp~C~~ 27 (30)
T PF08274_consen 4 KCPLCGSEYTY-----------EDGELLVCPECGH 27 (30)
T ss_dssp --TTT-----E-----------E-SSSEEETTTTE
T ss_pred CCCCCCCccee-----------ccCCEEeCCcccc
Confidence 36666655544 2334566666654
No 135
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=38.36 E-value=20 Score=27.57 Aligned_cols=11 Identities=36% Similarity=1.005 Sum_probs=5.6
Q ss_pred cccCcCCcccC
Q psy16830 572 YKCSLCPKAFT 582 (712)
Q Consensus 572 ~~C~~C~~~f~ 582 (712)
++|..||..|.
T Consensus 54 W~C~kCg~~fA 64 (89)
T COG1997 54 WKCRKCGAKFA 64 (89)
T ss_pred EEcCCCCCeec
Confidence 45555555553
No 136
>PF14353 CpXC: CpXC protein
Probab=37.25 E-value=3.3 Score=35.41 Aligned_cols=8 Identities=50% Similarity=1.414 Sum_probs=3.7
Q ss_pred cCCCCccc
Q psy16830 512 CNYCGKTF 519 (712)
Q Consensus 512 C~~C~k~f 519 (712)
|+.|+..|
T Consensus 4 CP~C~~~~ 11 (128)
T PF14353_consen 4 CPHCGHEF 11 (128)
T ss_pred CCCCCCee
Confidence 44444444
No 137
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=36.05 E-value=12 Score=27.76 Aligned_cols=26 Identities=23% Similarity=0.631 Sum_probs=12.2
Q ss_pred ccCChhhhhhhChHHHHhHHhhhcCCCccccccccCCC
Q psy16830 675 FKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGKTF 712 (712)
Q Consensus 675 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F 712 (712)
-.|..|++.|.. ..+.+.|+.||..|
T Consensus 10 ~~C~~C~~~F~~------------~~rrhhCr~CG~~v 35 (69)
T PF01363_consen 10 SNCMICGKKFSL------------FRRRHHCRNCGRVV 35 (69)
T ss_dssp SB-TTT--B-BS------------SS-EEE-TTT--EE
T ss_pred CcCcCcCCcCCC------------ceeeEccCCCCCEE
Confidence 369999999943 35667888888754
No 138
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=35.78 E-value=20 Score=33.81 Aligned_cols=50 Identities=22% Similarity=0.537 Sum_probs=25.9
Q ss_pred cccCcCccccCChhHHHHHHHhcCCCCCccCCcccccccChHHHHHHHHHh
Q psy16830 320 FTCNSCGLIFSEKKYLRNHMISHSTQKPFSCSLCGKRFKRENGVINHQRLH 370 (712)
Q Consensus 320 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h 370 (712)
|.|-.|...|-.....+.=.+ ....+++.|+.|+........|..-.++|
T Consensus 183 ~sCLRCK~cfCddHvrrKg~k-y~k~k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 183 YSCLRCKICFCDDHVRRKGFK-YEKGKPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred hhhhheeeeehhhhhhhcccc-cccCCCCCCCCCCCcccccccceeeeecc
Confidence 455555554543222111111 22346788888887776666665544444
No 139
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=35.57 E-value=23 Score=29.76 Aligned_cols=58 Identities=24% Similarity=0.537 Sum_probs=35.9
Q ss_pred cCCCCCccccccccccCChhhhhccccccccCcccccccceec---CCcccccCChHHHHHHHHHhcCCCCcCCcccccc
Q psy16830 147 HSTEKPFSCPICKKQFKRKHEVSKHDRIVHQGKKYNVYKSYKC---SLCPKAFTHLAALEDHQYVHTGQKSHMCNLCSAR 223 (712)
Q Consensus 147 h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C---~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~ 223 (712)
-.+.+.|.|++|..+.....-| +|-.| .+|+.- -..|..|-..|. .|+.|..+
T Consensus 75 F~d~~lYeCnIC~etS~ee~FL----------------KPneCCgY~iCn~C---ya~LWK~~~~yp-----vCPvCkTS 130 (140)
T PF05290_consen 75 FLDPKLYECNICKETSAEERFL----------------KPNECCGYSICNAC---YANLWKFCNLYP-----VCPVCKTS 130 (140)
T ss_pred ecCCCceeccCcccccchhhcC----------------CcccccchHHHHHH---HHHHHHHcccCC-----CCCccccc
Confidence 3556789999998887655433 23333 123222 245677666553 68999888
Q ss_pred CCChh
Q psy16830 224 FTCKG 228 (712)
Q Consensus 224 f~~~~ 228 (712)
|.+.+
T Consensus 131 FKss~ 135 (140)
T PF05290_consen 131 FKSSS 135 (140)
T ss_pred ccccc
Confidence 87654
No 140
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=35.44 E-value=17 Score=33.86 Aligned_cols=28 Identities=21% Similarity=0.356 Sum_probs=17.1
Q ss_pred CCcccCCCCccccCCchhhhhHHhhhcc
Q psy16830 597 QKTHLCTMCPASYATIGGLRSHIKIVHH 624 (712)
Q Consensus 597 ~k~~~C~~C~~~f~~~~~L~~H~~~~H~ 624 (712)
+..|.|..|++.|.-..-.+.|+...|.
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCH
Confidence 3456777777777777777777777664
No 141
>KOG2807|consensus
Probab=34.82 E-value=81 Score=31.08 Aligned_cols=83 Identities=17% Similarity=0.270 Sum_probs=46.0
Q ss_pred CChhHHHHHHHhcCCCCC----ccCCcccccccChHHHHH-HHHHhcCCCCCCceecCCChhhccChHHHHHHHhhhcCC
Q psy16830 330 SEKKYLRNHMISHSTQKP----FSCSLCGKRFKRENGVIN-HQRLHQTKKVYKSYKCSLCPRVFTHLAALEDHQFVHTGQ 404 (712)
Q Consensus 330 ~~~~~l~~H~~~h~~~~~----~~C~~C~~~f~~~~~l~~-H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 404 (712)
.+...|+.=+..|+...| ..|..=...|.+...... -.-.-|.......|.|+.|....- .
T Consensus 223 lDe~HlkeLl~e~~~Pp~~~~~~~~sLvkmGFP~~~~e~~ps~C~CH~~~~~~Gy~CP~CkakvC--------------s 288 (378)
T KOG2807|consen 223 LDEGHLKELLLEHTHPPPANKSKECSLVKMGFPSRSPEDTPSFCACHSELSGGGYFCPQCKAKVC--------------S 288 (378)
T ss_pred eCHHHHHHHHHhcCCCCCcccccCCceEEecCCCcccccCcchheeccccccCceeCCcccCeee--------------c
Confidence 344555555555543222 235555555653322111 011122333344799999976543 2
Q ss_pred CcccCCCCccccCChHHHHhHH
Q psy16830 405 KTRLCTMCPAIFASIGGLRSHM 426 (712)
Q Consensus 405 ~~~~C~~C~~~f~~~~~l~~H~ 426 (712)
-|..|+.|+.+..+...|.+-.
T Consensus 289 LP~eCpiC~ltLVss~hLARSy 310 (378)
T KOG2807|consen 289 LPIECPICSLTLVSSPHLARSY 310 (378)
T ss_pred CCccCCccceeEecchHHHHHH
Confidence 4778999999998888887654
No 142
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=34.37 E-value=11 Score=35.36 Aligned_cols=12 Identities=33% Similarity=0.880 Sum_probs=7.4
Q ss_pred CCCccCccChhc
Q psy16830 95 NKPFKCDDCDQC 106 (712)
Q Consensus 95 ~~~~~C~~C~~~ 106 (712)
+-...|+.|++.
T Consensus 36 N~~MeCdkC~r~ 47 (314)
T PF06524_consen 36 NALMECDKCQRK 47 (314)
T ss_pred Cccccchhhhhh
Confidence 445667777654
No 143
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=34.15 E-value=13 Score=20.64 Aligned_cols=6 Identities=33% Similarity=1.365 Sum_probs=2.9
Q ss_pred CChhhh
Q psy16830 677 CDDCDQ 682 (712)
Q Consensus 677 C~~C~~ 682 (712)
|+.||.
T Consensus 16 C~~CG~ 21 (23)
T PF13240_consen 16 CPNCGT 21 (23)
T ss_pred hhhhCC
Confidence 555543
No 144
>PRK04023 DNA polymerase II large subunit; Validated
Probab=33.67 E-value=28 Score=40.00 Aligned_cols=11 Identities=18% Similarity=0.990 Sum_probs=6.3
Q ss_pred cccCcCCcccC
Q psy16830 572 YKCSLCPKAFT 582 (712)
Q Consensus 572 ~~C~~C~~~f~ 582 (712)
|+|..|+..|.
T Consensus 1038 fRC~kC~~kYR 1048 (1121)
T PRK04023 1038 FRCTKCGAKYR 1048 (1121)
T ss_pred eeecccCcccc
Confidence 55666665554
No 145
>KOG1280|consensus
Probab=32.97 E-value=52 Score=32.67 Aligned_cols=29 Identities=17% Similarity=0.230 Sum_probs=17.7
Q ss_pred cccCCCCccccCCchhhhhHHhhhccCCC
Q psy16830 599 THLCTMCPASYATIGGLRSHIKIVHHGLK 627 (712)
Q Consensus 599 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~ 627 (712)
.|.|++|+..=.+...|..|+...|....
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~ 107 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEAS 107 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCcccC
Confidence 56666666666666666666666665443
No 146
>KOG2593|consensus
Probab=32.72 E-value=37 Score=34.94 Aligned_cols=21 Identities=19% Similarity=0.656 Sum_probs=12.3
Q ss_pred CCCCCccccccccccCChhhh
Q psy16830 148 STEKPFSCPICKKQFKRKHEV 168 (712)
Q Consensus 148 ~~~~~~~C~~C~~~f~~~~~l 168 (712)
+...-|.|+.|.+.|....++
T Consensus 124 t~~~~Y~Cp~C~kkyt~Lea~ 144 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKYTSLEAL 144 (436)
T ss_pred cccccccCCccccchhhhHHH
Confidence 344456666666666655554
No 147
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=31.24 E-value=26 Score=29.87 Aligned_cols=25 Identities=32% Similarity=0.530 Sum_probs=15.8
Q ss_pred ceecCCcccccCChHHHHHHHHHhcCCC
Q psy16830 186 SYKCSLCPKAFTHLAALEDHQYVHTGQK 213 (712)
Q Consensus 186 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~ 213 (712)
...|-+||+.|.. |.+|++.|+|-.
T Consensus 72 ~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 72 YIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp -EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred eeEEccCCcccch---HHHHHHHccCCC
Confidence 4679999999976 589999997753
No 148
>KOG3408|consensus
Probab=30.82 E-value=33 Score=28.21 Aligned_cols=28 Identities=21% Similarity=0.409 Sum_probs=24.5
Q ss_pred hccCCCccCChhhhhhhChHHHHhHHhh
Q psy16830 669 MALNKPFKCDDCDQRFLTKILLNKHKLL 696 (712)
Q Consensus 669 ~~~~k~~~C~~C~~~f~~~~~L~~H~~~ 696 (712)
..|...|.|-.|.+-|.+..+|..|.+.
T Consensus 52 lPG~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 52 LPGGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCCCceeehhhhhhhhcchHHHHHHHhc
Confidence 3456779999999999999999999875
No 149
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=30.78 E-value=30 Score=21.78 Aligned_cols=8 Identities=38% Similarity=1.406 Sum_probs=3.4
Q ss_pred cCCCCccc
Q psy16830 512 CNYCGKTF 519 (712)
Q Consensus 512 C~~C~k~f 519 (712)
|+.||..|
T Consensus 4 C~~Cg~~Y 11 (36)
T PF05191_consen 4 CPKCGRIY 11 (36)
T ss_dssp ETTTTEEE
T ss_pred cCCCCCcc
Confidence 44444443
No 150
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=30.73 E-value=16 Score=23.69 Aligned_cols=10 Identities=30% Similarity=1.202 Sum_probs=5.4
Q ss_pred cccCCCCccc
Q psy16830 510 FSCNYCGKTF 519 (712)
Q Consensus 510 ~~C~~C~k~f 519 (712)
|.|..|+..+
T Consensus 29 y~C~~C~~~w 38 (40)
T smart00440 29 YVCTKCGHRW 38 (40)
T ss_pred EEeCCCCCEe
Confidence 5555555544
No 151
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=29.60 E-value=8.9 Score=24.65 Aligned_cols=10 Identities=30% Similarity=1.149 Sum_probs=5.4
Q ss_pred cccCCCCccc
Q psy16830 510 FSCNYCGKTF 519 (712)
Q Consensus 510 ~~C~~C~k~f 519 (712)
|.|..||..|
T Consensus 29 y~C~~C~~~w 38 (39)
T PF01096_consen 29 YVCCNCGHRW 38 (39)
T ss_dssp EEESSSTEEE
T ss_pred EEeCCCCCee
Confidence 5555555544
No 152
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=29.33 E-value=29 Score=24.33 Aligned_cols=13 Identities=23% Similarity=0.818 Sum_probs=6.3
Q ss_pred CCccCChhhhhhh
Q psy16830 673 KPFKCDDCDQRFL 685 (712)
Q Consensus 673 k~~~C~~C~~~f~ 685 (712)
+.+.|..||..|-
T Consensus 17 rk~~Cr~Cg~~~C 29 (57)
T cd00065 17 RRHHCRNCGRIFC 29 (57)
T ss_pred cccccCcCcCCcC
Confidence 3445555555543
No 153
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=29.03 E-value=54 Score=21.35 Aligned_cols=25 Identities=16% Similarity=0.344 Sum_probs=17.8
Q ss_pred ceecCCcccccC--ChHHHHHHHHHhc
Q psy16830 186 SYKCSLCPKAFT--HLAALEDHQYVHT 210 (712)
Q Consensus 186 ~~~C~~C~k~f~--~~~~L~~H~~~H~ 210 (712)
.-.|+.||..|. ....-..|.+.|.
T Consensus 13 ~~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 13 ATTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred CcCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 357888888884 4666777877764
No 154
>KOG2593|consensus
Probab=28.90 E-value=29 Score=35.70 Aligned_cols=35 Identities=26% Similarity=0.717 Sum_probs=27.4
Q ss_pred CCccCChhhhhhhChHHHHhHHhhhcCCCccccccccC
Q psy16830 673 KPFKCDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGK 710 (712)
Q Consensus 673 k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~ 710 (712)
..|.|+.|++.|....+|+ ++-.....|.|..|+-
T Consensus 127 ~~Y~Cp~C~kkyt~Lea~~---L~~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 127 AGYVCPNCQKKYTSLEALQ---LLDNETGEFHCENCGG 161 (436)
T ss_pred ccccCCccccchhhhHHHH---hhcccCceEEEecCCC
Confidence 5699999999998776664 4555456799999973
No 155
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=28.25 E-value=37 Score=26.25 Aligned_cols=13 Identities=31% Similarity=0.902 Sum_probs=7.6
Q ss_pred CcccCcCccccCC
Q psy16830 319 GFTCNSCGLIFSE 331 (712)
Q Consensus 319 ~~~C~~C~~~f~~ 331 (712)
|-.|..||+.|.+
T Consensus 58 Pa~CkkCGfef~~ 70 (97)
T COG3357 58 PARCKKCGFEFRD 70 (97)
T ss_pred ChhhcccCccccc
Confidence 4456666666654
No 156
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=28.18 E-value=24 Score=19.90 Aligned_cols=9 Identities=56% Similarity=1.420 Sum_probs=5.5
Q ss_pred Ccccccccc
Q psy16830 152 PFSCPICKK 160 (712)
Q Consensus 152 ~~~C~~C~~ 160 (712)
+|.|+.||+
T Consensus 16 ~f~CPnCG~ 24 (24)
T PF07754_consen 16 PFPCPNCGF 24 (24)
T ss_pred eEeCCCCCC
Confidence 566666663
No 157
>KOG2272|consensus
Probab=27.41 E-value=45 Score=31.27 Aligned_cols=71 Identities=18% Similarity=0.399 Sum_probs=38.8
Q ss_pred CCCCcCcchhHhhcCHHHHHHHHh---------hc-CCCCCccccccccccCChhhhhccccccccCcccccccceecCC
Q psy16830 122 EGHLTCNLCGQTFRQKFRLLNHMI---------SH-STEKPFSCPICKKQFKRKHEVSKHDRIVHQGKKYNVYKSYKCSL 191 (712)
Q Consensus 122 ~~~~~C~~C~~~f~~~~~l~~H~~---------~h-~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~ 191 (712)
..-|.|++|++...+...++.--+ .. .+.-.|.|..|...... ..|.---...|.. -|+|..
T Consensus 97 p~CF~Cd~Cn~~Lad~gf~rnqgr~LC~~Cn~k~Ka~~~g~YvC~KCh~~iD~-~~l~fr~d~yH~y-------HFkCt~ 168 (332)
T KOG2272|consen 97 PACFRCDLCNKHLADQGFYRNQGRALCRECNQKEKAKGRGRYVCQKCHAHIDE-QPLTFRGDPYHPY-------HFKCTT 168 (332)
T ss_pred cccchhHHHHHHHhhhhhHhhcchHHhhhhhhhhcccccceeehhhhhhhccc-ccccccCCCCCcc-------ceeccc
Confidence 445788888887777665554211 11 11225888888665554 2222111111221 478999
Q ss_pred cccccCChH
Q psy16830 192 CPKAFTHLA 200 (712)
Q Consensus 192 C~k~f~~~~ 200 (712)
|++...+.+
T Consensus 169 C~keL~sda 177 (332)
T KOG2272|consen 169 CGKELTSDA 177 (332)
T ss_pred ccccccchh
Confidence 998876543
No 158
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=27.11 E-value=32 Score=28.97 Aligned_cols=13 Identities=31% Similarity=0.879 Sum_probs=6.5
Q ss_pred cccCCCCcccCCh
Q psy16830 510 FSCNYCGKTFLHK 522 (712)
Q Consensus 510 ~~C~~C~k~f~~~ 522 (712)
++|..||+.|..-
T Consensus 2 H~Ct~Cg~~f~dg 14 (131)
T PF09845_consen 2 HQCTKCGRVFEDG 14 (131)
T ss_pred cccCcCCCCcCCC
Confidence 4455555555543
No 159
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=26.24 E-value=34 Score=31.98 Aligned_cols=25 Identities=24% Similarity=0.647 Sum_probs=10.6
Q ss_pred eecCCChhhccChHHHHHHHhhhcC
Q psy16830 379 YKCSLCPRVFTHLAALEDHQFVHTG 403 (712)
Q Consensus 379 ~~C~~C~~~f~~~~~l~~H~~~h~~ 403 (712)
|.|+.|+|.|....-...|+..-|.
T Consensus 78 ~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 78 WRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp EEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred ECCCCCCcccCChHHHHHHHhhcCH
Confidence 4455555555544444444444443
No 161
>KOG4167|consensus
Probab=25.95 E-value=32 Score=37.69 Aligned_cols=27 Identities=26% Similarity=0.448 Sum_probs=24.6
Q ss_pred CCCccCChhhhhhhChHHHHhHHhhhc
Q psy16830 672 NKPFKCDDCDQRFLTKILLNKHKLLHF 698 (712)
Q Consensus 672 ~k~~~C~~C~~~f~~~~~L~~H~~~H~ 698 (712)
..-|.|..|++.|..-..++.||++|.
T Consensus 790 ~giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 790 TGIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred CceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 455999999999999999999999994
No 162
>KOG3408|consensus
Probab=25.68 E-value=43 Score=27.53 Aligned_cols=22 Identities=23% Similarity=0.424 Sum_probs=11.2
Q ss_pred CCcCcchhHhhcCHHHHHHHHh
Q psy16830 124 HLTCNLCGQTFRQKFRLLNHMI 145 (712)
Q Consensus 124 ~~~C~~C~~~f~~~~~l~~H~~ 145 (712)
.|.|-.|.+-|.+...|..|.+
T Consensus 57 qfyCi~CaRyFi~~~~l~~H~k 78 (129)
T KOG3408|consen 57 QFYCIECARYFIDAKALKTHFK 78 (129)
T ss_pred eeehhhhhhhhcchHHHHHHHh
Confidence 3445555555555555555544
No 163
>KOG1280|consensus
Probab=24.99 E-value=44 Score=33.16 Aligned_cols=37 Identities=22% Similarity=0.435 Sum_probs=29.0
Q ss_pred CCccCChhhhhhhChHHHHhHHhhhcCCCcc--cccccc
Q psy16830 673 KPFKCDDCDQRFLTKILLNKHKLLHFDKDYF--SCNYCG 709 (712)
Q Consensus 673 k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~--~C~~C~ 709 (712)
..|.|++|+..-.+...|..|+..-+-+.++ .|++|+
T Consensus 78 qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 78 QSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred ccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 4799999999999999999999765444443 377764
No 164
>KOG2071|consensus
Probab=24.89 E-value=34 Score=36.78 Aligned_cols=29 Identities=31% Similarity=0.489 Sum_probs=25.3
Q ss_pred cCCCccCChhhhhhhChHHHHhHHhhhcC
Q psy16830 671 LNKPFKCDDCDQRFLTKILLNKHKLLHFD 699 (712)
Q Consensus 671 ~~k~~~C~~C~~~f~~~~~L~~H~~~H~~ 699 (712)
.+.|-+|..||.+|.+......||.+|.+
T Consensus 415 ~~~pnqC~~CG~R~~~~ee~sk~md~H~d 443 (579)
T KOG2071|consen 415 KDSPNQCKSCGLRFDDSEERSKHMDIHDD 443 (579)
T ss_pred cCCcchhcccccccccchhhhhHhhhhhh
Confidence 45778999999999999999999888853
No 165
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=24.68 E-value=31 Score=28.92 Aligned_cols=20 Identities=25% Similarity=0.373 Sum_probs=10.9
Q ss_pred ccCccChhccCCHHHHHHhHhhc
Q psy16830 98 FKCDDCDQCFLTKILLNKHKLLH 120 (712)
Q Consensus 98 ~~C~~C~~~f~~~~~l~~H~~~h 120 (712)
..|-++|+.|+ +|++|+.+|
T Consensus 77 IicLEDGkkfK---SLKRHL~t~ 96 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTH 96 (148)
T ss_pred EEEeccCcchH---HHHHHHhcc
Confidence 45666666653 345555554
No 166
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.38 E-value=28 Score=25.86 Aligned_cols=29 Identities=28% Similarity=0.836 Sum_probs=19.7
Q ss_pred CccCChhhhhhhChHHHHhHHhhhcCCCccc-cccccC
Q psy16830 674 PFKCDDCDQRFLTKILLNKHKLLHFDKDYFS-CNYCGK 710 (712)
Q Consensus 674 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~-C~~C~~ 710 (712)
.|.|..||..|. +..|| .+.|+. |+.|+.
T Consensus 12 ~Y~c~~cg~~~d----vvq~~----~ddplt~ce~c~a 41 (82)
T COG2331 12 SYECTECGNRFD----VVQAM----TDDPLTTCEECGA 41 (82)
T ss_pred EEeecccchHHH----HHHhc----ccCccccChhhCh
Confidence 499999999883 44444 344544 888875
No 167
>KOG4377|consensus
Probab=24.35 E-value=43 Score=33.99 Aligned_cols=109 Identities=26% Similarity=0.509 Sum_probs=65.4
Q ss_pred cccc--CCCCcccCChHHHhhhhhhcCCC------------CcccC--cccccccCChhhHHhhhhhcccCccccccC--
Q psy16830 509 YFSC--NYCGKTFLHKYKLRRHIVIHSTA------------RPFSC--PVCKKQFKRKQDVAGHDRVVHQGKKLIVYK-- 570 (712)
Q Consensus 509 ~~~C--~~C~k~f~~~~~L~~H~~~H~~~------------~p~~C--~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~-- 570 (712)
-|.| +.|++.+-++..+.+|..+|... ..|.| .+|.+ +-+++..|... |+.....+..
T Consensus 271 hyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nF-ht~~~n~Gfrrt 346 (480)
T KOG4377|consen 271 HYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNF-HTDKRNNGFRRT 346 (480)
T ss_pred hhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCcc-ccccccCceecc
Confidence 3556 45888777788888888777422 12566 35777 55667777666 4433332222
Q ss_pred CcccCcCCcccCCchhHHhHHhhh----cCCC------------c------------ccCC--CCccccCCchhhhhHHh
Q psy16830 571 SYKCSLCPKAFTYLGALEDHQFVH----TGQK------------T------------HLCT--MCPASYATIGGLRSHIK 620 (712)
Q Consensus 571 ~~~C~~C~~~f~~~~~L~~H~~~H----~~~k------------~------------~~C~--~C~~~f~~~~~L~~H~~ 620 (712)
-|.|..||-++..+ ...|...| .++. + |.|. =|++.+...+.+..|.+
T Consensus 347 hfhC~r~gCTdtfK--~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~s~sqm~shkr 424 (480)
T KOG4377|consen 347 HFHCQRIGCTDTFK--DSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLYSVSQMASHKR 424 (480)
T ss_pred eeEEeccCCccccc--cccccccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEEehhhhhhhhh
Confidence 36788888444433 33443333 2211 1 3353 38999999999999977
Q ss_pred hhcc
Q psy16830 621 IVHH 624 (712)
Q Consensus 621 ~~H~ 624 (712)
. |.
T Consensus 425 k-he 427 (480)
T KOG4377|consen 425 K-HE 427 (480)
T ss_pred h-hh
Confidence 6 53
No 168
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=23.97 E-value=53 Score=22.90 Aligned_cols=12 Identities=42% Similarity=0.927 Sum_probs=7.3
Q ss_pred cccCcCCcccCC
Q psy16830 572 YKCSLCPKAFTY 583 (712)
Q Consensus 572 ~~C~~C~~~f~~ 583 (712)
.+|.+||+.|..
T Consensus 6 ~~C~~Cg~~~~~ 17 (54)
T PF14446_consen 6 CKCPVCGKKFKD 17 (54)
T ss_pred ccChhhCCcccC
Confidence 356677766653
No 169
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=23.78 E-value=38 Score=26.51 Aligned_cols=12 Identities=25% Similarity=0.766 Sum_probs=7.1
Q ss_pred CcccCcCCcccC
Q psy16830 571 SYKCSLCPKAFT 582 (712)
Q Consensus 571 ~~~C~~C~~~f~ 582 (712)
.+.|..|++.|.
T Consensus 54 IW~C~~C~~~~A 65 (90)
T PTZ00255 54 IWRCKGCKKTVA 65 (90)
T ss_pred EEEcCCCCCEEe
Confidence 356666666653
No 170
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.70 E-value=38 Score=27.12 Aligned_cols=11 Identities=9% Similarity=0.178 Sum_probs=6.5
Q ss_pred ceecCCccccc
Q psy16830 186 SYKCSLCPKAF 196 (712)
Q Consensus 186 ~~~C~~C~k~f 196 (712)
|..|+.||++|
T Consensus 26 PiVsPytG~s~ 36 (129)
T COG4530 26 PIVSPYTGKSY 36 (129)
T ss_pred ccccCcccccc
Confidence 45566666666
No 171
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=23.26 E-value=53 Score=37.17 Aligned_cols=26 Identities=23% Similarity=0.687 Sum_probs=14.9
Q ss_pred ccCcCccccCChhHHHHHHHhcCCCCCccCCccccc
Q psy16830 321 TCNSCGLIFSEKKYLRNHMISHSTQKPFSCSLCGKR 356 (712)
Q Consensus 321 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 356 (712)
.|+.|+..+ ..|...+...|..||..
T Consensus 446 ~Cp~Cd~~l----------t~H~~~~~L~CH~Cg~~ 471 (730)
T COG1198 446 ECPNCDSPL----------TLHKATGQLRCHYCGYQ 471 (730)
T ss_pred cCCCCCcce----------EEecCCCeeEeCCCCCC
Confidence 466666542 23334456677777765
No 172
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=23.24 E-value=39 Score=26.41 Aligned_cols=31 Identities=35% Similarity=0.855 Sum_probs=19.8
Q ss_pred CCccccccccccCChhhhhccccccccCcccccccceecCCcccccC
Q psy16830 151 KPFSCPICKKQFKRKHEVSKHDRIVHQGKKYNVYKSYKCSLCPKAFT 197 (712)
Q Consensus 151 ~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~k~f~ 197 (712)
..|.|+.|++.-. +....+ .+.|.-|++.|.
T Consensus 34 ~ky~Cp~Cgk~~v---------kR~a~G-------IW~C~~C~~~~A 64 (90)
T PF01780_consen 34 AKYTCPFCGKTSV---------KRVATG-------IWKCKKCGKKFA 64 (90)
T ss_dssp S-BEESSSSSSEE---------EEEETT-------EEEETTTTEEEE
T ss_pred CCCcCCCCCCcee---------EEeeeE-------EeecCCCCCEEe
Confidence 3588999987632 222222 588999988875
No 173
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=23.13 E-value=41 Score=21.79 Aligned_cols=16 Identities=25% Similarity=0.663 Sum_probs=7.3
Q ss_pred ccCCCCccccCCchhh
Q psy16830 600 HLCTMCPASYATIGGL 615 (712)
Q Consensus 600 ~~C~~C~~~f~~~~~L 615 (712)
+.|+.|+-.+.....|
T Consensus 20 d~C~~C~G~W~d~~el 35 (41)
T PF13453_consen 20 DVCPSCGGIWFDAGEL 35 (41)
T ss_pred EECCCCCeEEccHHHH
Confidence 4444444444444443
No 174
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=22.48 E-value=40 Score=21.66 Aligned_cols=11 Identities=45% Similarity=1.129 Sum_probs=5.2
Q ss_pred CccCChhhhhh
Q psy16830 674 PFKCDDCDQRF 684 (712)
Q Consensus 674 ~~~C~~C~~~f 684 (712)
||+|..|+..|
T Consensus 12 ~f~C~~C~~~F 22 (39)
T smart00154 12 GFKCRHCGNLF 22 (39)
T ss_pred CeECCccCCcc
Confidence 45555554443
No 175
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=22.20 E-value=34 Score=27.11 Aligned_cols=11 Identities=45% Similarity=1.247 Sum_probs=5.7
Q ss_pred CCccccccccc
Q psy16830 151 KPFSCPICKKQ 161 (712)
Q Consensus 151 ~~~~C~~C~~~ 161 (712)
+.|.|+.||..
T Consensus 21 k~FtCp~Cghe 31 (104)
T COG4888 21 KTFTCPRCGHE 31 (104)
T ss_pred ceEecCccCCe
Confidence 34555555544
No 176
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=22.01 E-value=57 Score=27.41 Aligned_cols=25 Identities=36% Similarity=0.610 Sum_probs=15.1
Q ss_pred ccccCCCCcccCChHHHhhhhhhcCCCC
Q psy16830 509 YFSCNYCGKTFLHKYKLRRHIVIHSTAR 536 (712)
Q Consensus 509 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~ 536 (712)
...|-.+|+.|. +|++|+.+|.+--
T Consensus 76 ~IicLEDGkkfK---SLKRHL~t~~gmT 100 (148)
T COG4957 76 YIICLEDGKKFK---SLKRHLTTHYGLT 100 (148)
T ss_pred eEEEeccCcchH---HHHHHHhcccCCC
Confidence 345666666663 4667777665543
No 177
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=21.87 E-value=75 Score=20.75 Aligned_cols=15 Identities=33% Similarity=0.813 Sum_probs=8.3
Q ss_pred ccCCCCcccCChHHH
Q psy16830 511 SCNYCGKTFLHKYKL 525 (712)
Q Consensus 511 ~C~~C~k~f~~~~~L 525 (712)
.|..||..|+.....
T Consensus 10 ~C~~C~rpf~WRKKW 24 (42)
T PF10013_consen 10 ICPVCGRPFTWRKKW 24 (42)
T ss_pred cCcccCCcchHHHHH
Confidence 366666666554443
No 178
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=21.62 E-value=47 Score=23.22 Aligned_cols=11 Identities=36% Similarity=1.102 Sum_probs=7.2
Q ss_pred ccccccccccC
Q psy16830 153 FSCPICKKQFK 163 (712)
Q Consensus 153 ~~C~~C~~~f~ 163 (712)
|.|+.||..+.
T Consensus 3 ~~CP~CG~~ie 13 (54)
T TIGR01206 3 FECPDCGAEIE 13 (54)
T ss_pred cCCCCCCCEEe
Confidence 56777777653
No 179
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=21.27 E-value=41 Score=26.35 Aligned_cols=12 Identities=25% Similarity=0.786 Sum_probs=7.1
Q ss_pred CcccCcCCcccC
Q psy16830 571 SYKCSLCPKAFT 582 (712)
Q Consensus 571 ~~~C~~C~~~f~ 582 (712)
.+.|..|++.|.
T Consensus 53 IW~C~~C~~~~A 64 (91)
T TIGR00280 53 IWTCRKCGAKFA 64 (91)
T ss_pred EEEcCCCCCEEe
Confidence 356666666653
No 180
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=21.14 E-value=47 Score=28.62 Aligned_cols=11 Identities=27% Similarity=0.993 Sum_probs=4.9
Q ss_pred CCCcchhhhhc
Q psy16830 482 FKCDDCDQRFL 492 (712)
Q Consensus 482 ~~C~~C~~~f~ 492 (712)
+.|..||..|.
T Consensus 71 ~~C~~CG~~~~ 81 (135)
T PRK03824 71 LKCRNCGNEWS 81 (135)
T ss_pred EECCCCCCEEe
Confidence 44444444443
No 181
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=21.06 E-value=44 Score=22.77 Aligned_cols=12 Identities=33% Similarity=0.855 Sum_probs=6.0
Q ss_pred CccCCcchhhcc
Q psy16830 290 PFKCDDCFQCFL 301 (712)
Q Consensus 290 ~~~C~~C~~~f~ 301 (712)
.+.|..||..|.
T Consensus 4 ~l~C~dCg~~Fv 15 (49)
T PF13451_consen 4 TLTCKDCGAEFV 15 (49)
T ss_pred eEEcccCCCeEE
Confidence 345555555443
No 182
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=20.62 E-value=30 Score=38.10 Aligned_cols=56 Identities=29% Similarity=0.624 Sum_probs=23.9
Q ss_pred CcchhhhhcchhhHhhccccccCCCccccCCCCcccCChHHHhhhhhhcCCCCcccCccccc
Q psy16830 484 CDDCDQRFLTKILLNKHKLLHFDKDYFSCNYCGKTFLHKYKLRRHIVIHSTARPFSCPVCKK 545 (712)
Q Consensus 484 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~p~~C~~C~~ 545 (712)
|..||-+|+--..|-.-+..-+-...-.|+.|.+.|....+-+-| ..|..|+.||-
T Consensus 126 CT~CGPRfTIi~alPYDR~nTsM~~F~lC~~C~~EY~dP~nRRfH------AQp~aCp~CGP 181 (750)
T COG0068 126 CTNCGPRFTIIEALPYDRENTSMADFPLCPFCDKEYKDPLNRRFH------AQPIACPKCGP 181 (750)
T ss_pred cCCCCcceeeeccCCCCcccCccccCcCCHHHHHHhcCccccccc------cccccCcccCC
Confidence 555555555444443322111111112355555555554443333 23455555554
No 183
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=20.01 E-value=46 Score=19.61 Aligned_cols=8 Identities=50% Similarity=1.165 Sum_probs=3.6
Q ss_pred cCCCCccc
Q psy16830 512 CNYCGKTF 519 (712)
Q Consensus 512 C~~C~k~f 519 (712)
|-.|++.|
T Consensus 3 CiDC~~~F 10 (28)
T PF08790_consen 3 CIDCSKDF 10 (28)
T ss_dssp ETTTTEEE
T ss_pred eecCCCCc
Confidence 44444444
Done!