Query psy13090
Match_columns 771
No_of_seqs 625 out of 4393
Neff 10.2
Searched_HMMs 46136
Date Fri Aug 16 19:18:14 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13090.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13090hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 8.3E-30 1.8E-34 267.6 3.8 200 544-770 605-933 (958)
2 KOG2462|consensus 99.9 4.7E-28 1E-32 224.9 5.4 136 627-765 127-265 (279)
3 KOG3608|consensus 99.9 8.7E-27 1.9E-31 220.6 12.9 231 506-768 135-377 (467)
4 KOG2462|consensus 99.9 3.3E-27 7.2E-32 219.3 5.7 138 569-736 127-264 (279)
5 KOG1074|consensus 99.9 9.1E-27 2E-31 244.9 3.7 254 447-711 605-932 (958)
6 KOG3608|consensus 99.9 8.4E-25 1.8E-29 207.3 14.6 237 447-755 134-398 (467)
7 KOG3623|consensus 99.9 1.9E-23 4.1E-28 215.7 12.2 113 385-498 210-331 (1007)
8 KOG3623|consensus 99.9 1.4E-23 3.1E-28 216.6 9.6 121 446-593 209-330 (1007)
9 KOG3576|consensus 99.7 3.2E-18 7E-23 150.1 3.7 118 627-744 114-241 (267)
10 KOG3576|consensus 99.7 1.5E-17 3.3E-22 145.9 2.2 112 655-767 114-236 (267)
11 PLN03086 PRLI-interacting fact 99.3 2.3E-12 5E-17 136.9 7.6 122 631-770 434-567 (567)
12 PHA00733 hypothetical protein 99.1 3E-11 6.6E-16 104.4 4.4 54 713-768 71-124 (128)
13 PLN03086 PRLI-interacting fact 99.1 1.5E-10 3.3E-15 123.3 9.0 144 545-738 408-564 (567)
14 PHA00733 hypothetical protein 99.0 1.7E-10 3.8E-15 99.7 4.7 83 655-740 37-124 (128)
15 KOG1146|consensus 99.0 2.9E-11 6.2E-16 136.0 -0.6 104 180-284 438-611 (1406)
16 PHA02768 hypothetical protein; 99.0 1.6E-10 3.4E-15 80.6 1.6 45 33-79 5-49 (55)
17 PHA02768 hypothetical protein; 98.9 5.6E-10 1.2E-14 77.9 2.1 44 715-760 5-48 (55)
18 KOG3993|consensus 98.8 5E-10 1.1E-14 110.5 0.1 141 629-769 266-484 (500)
19 KOG1146|consensus 98.8 1.4E-08 2.9E-13 115.0 7.9 58 26-83 458-540 (1406)
20 PF13465 zf-H2C2_2: Zinc-finge 98.8 3.5E-09 7.5E-14 63.2 1.7 25 48-72 1-25 (26)
21 PF13465 zf-H2C2_2: Zinc-finge 98.6 1.3E-08 2.9E-13 60.6 1.6 24 731-754 2-25 (26)
22 PHA00616 hypothetical protein 98.6 2.6E-08 5.7E-13 65.9 1.8 40 33-72 1-40 (44)
23 KOG3993|consensus 98.5 1.4E-08 3.1E-13 100.4 -0.9 109 290-409 267-380 (500)
24 PHA00616 hypothetical protein 98.3 2.1E-07 4.6E-12 61.7 1.1 33 715-747 1-33 (44)
25 PHA00732 hypothetical protein 98.2 9.1E-07 2E-11 68.9 2.7 45 659-709 2-47 (79)
26 PHA00732 hypothetical protein 98.1 2.4E-06 5.3E-11 66.5 3.4 48 686-740 1-49 (79)
27 PF05605 zf-Di19: Drought indu 97.9 2.1E-05 4.6E-10 56.9 4.4 50 448-500 3-53 (54)
28 PF05605 zf-Di19: Drought indu 97.8 1.3E-05 2.8E-10 58.0 1.9 51 179-232 3-53 (54)
29 PF00096 zf-C2H2: Zinc finger, 97.7 3E-05 6.6E-10 44.9 2.1 20 717-736 2-21 (23)
30 PF00096 zf-C2H2: Zinc finger, 97.6 3.1E-05 6.8E-10 44.9 1.4 22 687-708 1-22 (23)
31 PF12756 zf-C2H2_2: C2H2 type 97.4 6.4E-05 1.4E-09 63.2 2.0 73 660-738 1-73 (100)
32 PF13894 zf-C2H2_4: C2H2-type 97.3 0.00012 2.6E-09 42.9 1.7 23 716-738 1-23 (24)
33 COG5189 SFP1 Putative transcri 97.3 6.9E-05 1.5E-09 71.9 0.5 75 683-767 346-422 (423)
34 PF13894 zf-C2H2_4: C2H2-type 97.3 0.00019 4.2E-09 42.0 2.3 24 744-767 1-24 (24)
35 PF12756 zf-C2H2_2: C2H2 type 97.3 0.00016 3.5E-09 60.8 2.6 73 180-259 1-73 (100)
36 PF13912 zf-C2H2_6: C2H2-type 97.1 0.00032 6.9E-09 42.4 2.1 24 715-738 1-24 (27)
37 COG5189 SFP1 Putative transcri 97.0 0.00031 6.7E-09 67.5 2.0 28 175-202 346-375 (423)
38 PF13912 zf-C2H2_6: C2H2-type 97.0 0.0003 6.6E-09 42.5 1.0 25 686-710 1-25 (27)
39 PF09237 GAGA: GAGA factor; I 96.8 0.001 2.2E-08 45.2 2.2 41 703-743 12-52 (54)
40 KOG2231|consensus 96.6 0.004 8.8E-08 68.2 6.4 47 478-529 184-236 (669)
41 PRK04860 hypothetical protein; 96.4 0.0014 3E-08 59.1 1.5 41 32-76 118-158 (160)
42 smart00355 ZnF_C2H2 zinc finge 96.4 0.0027 5.8E-08 37.8 2.1 19 718-736 3-21 (26)
43 COG5236 Uncharacterized conser 96.3 0.0039 8.4E-08 60.9 4.1 135 385-534 151-310 (493)
44 PRK04860 hypothetical protein; 96.2 0.0026 5.6E-08 57.4 2.2 40 714-757 118-157 (160)
45 KOG2785|consensus 96.2 0.015 3.4E-07 58.3 7.6 28 384-411 165-192 (390)
46 PF09237 GAGA: GAGA factor; I 96.2 0.0032 6.9E-08 42.9 1.9 32 30-61 21-52 (54)
47 smart00355 ZnF_C2H2 zinc finge 96.2 0.0034 7.4E-08 37.3 2.0 23 34-56 1-23 (26)
48 PF13909 zf-H2C2_5: C2H2-type 96.1 0.004 8.7E-08 36.3 1.9 23 744-767 1-23 (24)
49 PF13909 zf-H2C2_5: C2H2-type 96.1 0.0041 8.9E-08 36.2 2.0 23 716-739 1-23 (24)
50 KOG2231|consensus 96.1 0.012 2.6E-07 64.5 7.0 62 263-337 100-170 (669)
51 COG5236 Uncharacterized conser 96.0 0.0075 1.6E-07 59.0 4.0 129 630-769 151-307 (493)
52 KOG2482|consensus 95.8 0.022 4.8E-07 55.9 6.6 188 263-468 145-355 (423)
53 PF12874 zf-met: Zinc-finger o 95.6 0.0049 1.1E-07 36.4 0.9 20 717-736 2-21 (25)
54 PF12874 zf-met: Zinc-finger o 95.6 0.0089 1.9E-07 35.2 1.9 21 687-707 1-21 (25)
55 KOG2785|consensus 94.9 0.021 4.4E-07 57.5 3.3 109 382-502 65-192 (390)
56 KOG2482|consensus 94.6 0.079 1.7E-06 52.2 6.1 52 477-528 280-357 (423)
57 PF12171 zf-C2H2_jaz: Zinc-fin 94.5 0.02 4.2E-07 34.5 1.2 22 62-83 2-23 (27)
58 COG5048 FOG: Zn-finger [Genera 94.1 0.026 5.6E-07 62.3 2.1 58 32-89 288-351 (467)
59 PF12171 zf-C2H2_jaz: Zinc-fin 93.7 0.038 8.3E-07 33.2 1.3 22 209-230 2-23 (27)
60 COG5048 FOG: Zn-finger [Genera 92.2 0.032 6.9E-07 61.5 -0.9 123 629-751 288-454 (467)
61 KOG2893|consensus 92.0 0.054 1.2E-06 50.2 0.5 49 718-770 13-61 (341)
62 PF13913 zf-C2HC_2: zinc-finge 91.8 0.12 2.7E-06 30.3 1.7 19 62-81 3-21 (25)
63 KOG4173|consensus 91.8 0.12 2.6E-06 47.0 2.4 80 446-529 78-170 (253)
64 KOG2893|consensus 91.4 0.08 1.7E-06 49.1 0.9 54 479-537 13-66 (341)
65 PF13913 zf-C2HC_2: zinc-finge 91.4 0.16 3.4E-06 29.8 1.8 18 717-735 4-21 (25)
66 smart00451 ZnF_U1 U1-like zinc 89.5 0.26 5.6E-06 31.7 1.9 20 716-735 4-23 (35)
67 KOG4173|consensus 89.3 0.22 4.7E-06 45.4 1.8 82 385-503 79-173 (253)
68 smart00451 ZnF_U1 U1-like zinc 89.1 0.3 6.5E-06 31.4 2.0 22 744-765 4-25 (35)
69 cd00350 rubredoxin_like Rubred 88.1 0.22 4.7E-06 31.6 0.8 25 34-70 2-26 (33)
70 cd00350 rubredoxin_like Rubred 87.8 0.37 7.9E-06 30.5 1.6 10 742-751 16-25 (33)
71 COG4049 Uncharacterized protei 87.1 0.33 7.2E-06 33.8 1.2 30 740-769 14-43 (65)
72 PF12013 DUF3505: Protein of u 80.4 0.69 1.5E-05 39.2 0.7 26 384-409 79-108 (109)
73 TIGR00622 ssl1 transcription f 80.0 3.2 6.9E-05 34.7 4.4 33 209-255 2-34 (112)
74 cd00729 rubredoxin_SM Rubredox 79.3 1.3 2.8E-05 28.2 1.5 10 716-725 3-12 (34)
75 PF12013 DUF3505: Protein of u 79.0 3.1 6.7E-05 35.2 4.3 24 744-767 81-108 (109)
76 PF09986 DUF2225: Uncharacteri 78.1 0.64 1.4E-05 44.8 -0.2 43 714-756 4-61 (214)
77 PF09986 DUF2225: Uncharacteri 75.5 1.8 4E-05 41.7 2.1 42 208-249 5-61 (214)
78 PF09538 FYDLN_acid: Protein o 74.6 2.4 5.1E-05 35.5 2.2 35 540-585 5-39 (108)
79 TIGR00622 ssl1 transcription f 74.1 4.3 9.4E-05 33.9 3.6 54 688-750 57-110 (112)
80 COG4049 Uncharacterized protei 73.9 2.1 4.6E-05 29.9 1.5 30 472-501 13-42 (65)
81 PF09538 FYDLN_acid: Protein o 69.0 3.2 7E-05 34.7 1.8 30 34-74 10-39 (108)
82 COG1592 Rubrerythrin [Energy p 68.6 2.8 6.2E-05 38.0 1.5 23 33-68 134-156 (166)
83 PF13719 zinc_ribbon_5: zinc-r 67.3 3.1 6.7E-05 27.1 1.1 34 34-72 3-36 (37)
84 PF13717 zinc_ribbon_4: zinc-r 66.1 4.3 9.2E-05 26.3 1.6 11 715-725 25-35 (36)
85 PRK00398 rpoP DNA-directed RNA 66.1 3.2 7E-05 28.6 1.1 31 32-72 2-32 (46)
86 TIGR02098 MJ0042_CXXC MJ0042 f 64.8 5.6 0.00012 26.0 2.0 13 210-222 4-16 (38)
87 PF02892 zf-BED: BED zinc fing 64.0 4.4 9.5E-05 27.7 1.5 22 714-735 15-40 (45)
88 COG1592 Rubrerythrin [Energy p 63.0 5.1 0.00011 36.4 2.0 10 713-722 147-156 (166)
89 PF10571 UPF0547: Uncharacteri 62.9 4.9 0.00011 23.8 1.3 10 318-327 16-25 (26)
90 smart00659 RPOLCX RNA polymera 62.8 5.9 0.00013 27.0 1.8 11 716-726 3-13 (44)
91 PRK06266 transcription initiat 60.7 4.7 0.0001 37.5 1.4 41 22-71 106-146 (178)
92 smart00531 TFIIE Transcription 60.5 5 0.00011 36.1 1.6 42 26-71 92-133 (147)
93 PHA00626 hypothetical protein 60.1 7.1 0.00015 27.6 1.8 12 236-247 23-34 (59)
94 PF04959 ARS2: Arsenite-resist 59.0 5.3 0.00011 38.1 1.5 26 742-767 76-101 (214)
95 TIGR02300 FYDLN_acid conserved 58.8 7.7 0.00017 33.0 2.2 38 540-588 5-42 (129)
96 smart00834 CxxC_CXXC_SSSS Puta 57.6 3 6.4E-05 27.8 -0.3 31 33-70 5-35 (41)
97 TIGR02300 FYDLN_acid conserved 57.6 7 0.00015 33.3 1.8 35 33-78 9-43 (129)
98 PF03604 DNA_RNApol_7kD: DNA d 55.5 6.8 0.00015 24.6 1.1 27 34-71 1-27 (32)
99 PRK06266 transcription initiat 54.9 6.6 0.00014 36.5 1.4 17 686-702 117-133 (178)
100 TIGR00373 conserved hypothetic 54.4 13 0.00028 33.9 3.2 17 686-702 109-125 (158)
101 COG2888 Predicted Zn-ribbon RN 54.4 11 0.00024 27.2 2.1 9 743-751 50-58 (61)
102 PRK00464 nrdR transcriptional 53.6 7.1 0.00015 35.2 1.3 15 237-251 29-43 (154)
103 TIGR00373 conserved hypothetic 53.3 13 0.00028 33.9 3.0 34 381-420 105-138 (158)
104 KOG2186|consensus 53.2 9 0.00019 36.8 2.0 46 631-679 4-49 (276)
105 PF12907 zf-met2: Zinc-binding 53.1 11 0.00024 25.0 1.8 29 448-476 2-33 (40)
106 TIGR02605 CxxC_CxxC_SSSS putat 52.9 5.2 0.00011 28.4 0.3 10 209-218 6-15 (52)
107 COG1996 RPC10 DNA-directed RNA 51.8 9 0.0002 26.7 1.3 10 687-696 7-16 (49)
108 KOG1280|consensus 51.2 11 0.00023 38.1 2.2 22 236-257 79-100 (381)
109 smart00734 ZnF_Rad18 Rad18-lik 50.6 13 0.00027 22.1 1.6 19 717-736 3-21 (26)
110 smart00614 ZnF_BED BED zinc fi 50.3 10 0.00022 26.7 1.5 24 744-767 19-47 (50)
111 PF02176 zf-TRAF: TRAF-type zi 49.9 12 0.00026 27.4 1.9 41 714-754 8-53 (60)
112 PF09723 Zn-ribbon_8: Zinc rib 49.6 8.1 0.00018 26.0 0.8 11 179-189 6-16 (42)
113 KOG2186|consensus 48.3 12 0.00026 36.0 1.9 49 658-709 3-51 (276)
114 PRK14890 putative Zn-ribbon RN 47.9 17 0.00036 26.4 2.1 7 716-722 49-55 (59)
115 KOG2071|consensus 47.5 15 0.00033 40.2 2.8 25 629-653 417-441 (579)
116 PRK09678 DNA-binding transcrip 46.7 10 0.00023 29.0 1.1 39 291-329 2-42 (72)
117 PF00301 Rubredoxin: Rubredoxi 46.3 13 0.00028 25.8 1.4 13 506-518 2-14 (47)
118 PRK09678 DNA-binding transcrip 45.7 6.4 0.00014 30.1 -0.2 42 34-77 2-45 (72)
119 PF15269 zf-C2H2_7: Zinc-finge 45.6 14 0.00031 24.6 1.4 22 33-54 20-41 (54)
120 smart00531 TFIIE Transcription 45.1 17 0.00037 32.7 2.5 36 382-419 96-132 (147)
121 PF12907 zf-met2: Zinc-binding 43.9 17 0.00037 24.1 1.6 28 477-504 2-32 (40)
122 COG5188 PRP9 Splicing factor 3 43.0 51 0.0011 33.4 5.4 26 32-57 237-264 (470)
123 PRK00464 nrdR transcriptional 42.9 7.1 0.00015 35.2 -0.4 13 573-585 29-41 (154)
124 PF12760 Zn_Tnp_IS1595: Transp 41.4 10 0.00022 26.2 0.3 40 18-69 6-45 (46)
125 PF15269 zf-C2H2_7: Zinc-finge 40.9 19 0.00041 24.1 1.4 21 744-764 21-41 (54)
126 cd00730 rubredoxin Rubredoxin; 39.4 17 0.00037 25.6 1.2 11 506-516 2-12 (50)
127 PF06524 NOA36: NOA36 protein; 39.0 17 0.00036 35.0 1.4 85 232-339 138-232 (314)
128 PF04959 ARS2: Arsenite-resist 38.7 17 0.00037 34.8 1.4 27 446-472 76-102 (214)
129 COG4888 Uncharacterized Zn rib 38.6 17 0.00037 29.4 1.2 37 207-247 21-57 (104)
130 COG2331 Uncharacterized protei 37.4 18 0.00039 27.5 1.1 32 32-71 11-43 (82)
131 PF02176 zf-TRAF: TRAF-type zi 36.0 21 0.00046 26.1 1.3 44 30-74 6-55 (60)
132 COG4957 Predicted transcriptio 35.8 22 0.00048 30.5 1.5 25 34-61 77-101 (148)
133 PF08274 PhnA_Zn_Ribbon: PhnA 34.5 17 0.00037 22.4 0.5 10 714-723 18-27 (30)
134 PRK04023 DNA polymerase II lar 34.3 39 0.00085 39.7 3.6 9 687-695 639-647 (1121)
135 COG1997 RPL43A Ribosomal prote 32.0 34 0.00073 27.0 1.8 31 290-328 35-65 (89)
136 COG1198 PriA Primosomal protei 31.8 31 0.00066 39.8 2.3 11 235-245 474-484 (730)
137 COG5151 SSL1 RNA polymerase II 31.0 29 0.00063 34.3 1.7 50 291-340 363-412 (421)
138 KOG2593|consensus 30.7 34 0.00075 35.9 2.2 36 383-418 126-161 (436)
139 PF09845 DUF2072: Zn-ribbon co 29.5 34 0.00073 29.6 1.6 12 687-698 2-13 (131)
140 PF04606 Ogr_Delta: Ogr/Delta- 28.9 27 0.00059 24.2 0.8 36 293-328 2-39 (47)
141 PF05290 Baculo_IE-1: Baculovi 28.8 37 0.00079 29.3 1.7 12 718-729 124-135 (140)
142 PF07754 DUF1610: Domain of un 27.8 29 0.00064 20.1 0.7 8 715-722 16-23 (24)
143 PF14353 CpXC: CpXC protein 27.7 27 0.00059 30.5 0.8 21 208-228 38-58 (128)
144 cd00924 Cyt_c_Oxidase_Vb Cytoc 26.4 27 0.00058 28.6 0.5 19 25-44 72-90 (97)
145 COG4888 Uncharacterized Zn rib 24.8 28 0.0006 28.3 0.3 39 176-220 20-58 (104)
146 TIGR01206 lysW lysine biosynth 24.8 50 0.0011 23.7 1.6 9 180-188 4-12 (54)
147 PRK04023 DNA polymerase II lar 24.7 61 0.0013 38.2 3.0 10 317-326 664-673 (1121)
148 PF05443 ROS_MUCR: ROS/MUCR tr 24.0 32 0.0007 30.0 0.6 28 31-61 70-97 (132)
149 COG4530 Uncharacterized protei 23.7 54 0.0012 27.0 1.7 32 35-78 11-42 (129)
150 COG1655 Uncharacterized protei 23.5 18 0.00039 34.2 -1.1 40 714-753 18-72 (267)
151 smart00440 ZnF_C2C2 C2C2 Zinc 22.8 68 0.0015 21.3 1.8 11 236-246 28-38 (40)
152 KOG2593|consensus 22.8 85 0.0018 33.2 3.4 15 658-672 128-142 (436)
153 COG3357 Predicted transcriptio 22.6 47 0.001 26.4 1.2 27 32-68 57-83 (97)
154 KOG2807|consensus 22.5 96 0.0021 31.3 3.5 37 542-592 274-310 (378)
155 PF05443 ROS_MUCR: ROS/MUCR tr 22.0 49 0.0011 29.0 1.3 22 716-740 73-94 (132)
156 PF05129 Elf1: Transcription e 21.9 39 0.00084 26.7 0.6 12 237-248 47-58 (81)
157 PF05191 ADK_lid: Adenylate ki 21.8 78 0.0017 20.5 1.9 32 545-584 2-33 (36)
158 PRK03824 hypA hydrogenase nick 21.0 34 0.00073 30.2 0.1 44 27-70 64-116 (135)
159 KOG4167|consensus 20.9 27 0.0006 39.0 -0.5 25 658-682 792-816 (907)
160 KOG4167|consensus 20.9 37 0.0008 38.0 0.4 27 32-58 791-817 (907)
161 COG0068 HypF Hydrogenase matur 20.9 16 0.00034 41.0 -2.3 74 265-392 104-180 (750)
162 KOG3408|consensus 20.7 78 0.0017 26.8 2.1 29 624-652 51-79 (129)
163 smart00154 ZnF_AN1 AN1-like Zi 20.5 47 0.001 22.0 0.7 13 61-73 12-24 (39)
164 PF08271 TF_Zn_Ribbon: TFIIB z 20.4 76 0.0016 21.4 1.7 9 237-245 20-28 (43)
165 COG4896 Uncharacterized protei 20.4 54 0.0012 23.8 1.0 40 631-670 3-43 (68)
166 PF01096 TFIIS_C: Transcriptio 20.4 53 0.0011 21.7 0.9 11 236-246 28-38 (39)
167 PF03107 C1_2: C1 domain; Int 20.2 93 0.002 19.1 1.9 9 544-552 15-23 (30)
No 1
>KOG1074|consensus
Probab=99.95 E-value=8.3e-30 Score=267.58 Aligned_cols=200 Identities=18% Similarity=0.317 Sum_probs=162.8
Q ss_pred ceecCCCCcccCChHHHHccccccCCCCCCCCcccccccCCchhHHHHHHHhcCCCCCCchhhhhhhhhhHHhhcCCCcc
Q psy13090 544 KYSCTQCSYQTYHKHALDNHMPLHSLDRKHQCPVCLRRFSRQSDNNTHQRRVHGVLKPTPLEMQVEKRDQVALRMKPSAF 623 (771)
Q Consensus 544 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~ 623 (771)
+-+|-+|-++..-+++|+.|.++|+||+||+|.+||+.|.++.+|+.|+-. |....+.
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~v-Hka~p~~--------------------- 662 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSV-HKAKPPA--------------------- 662 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccc-cccCccc---------------------
Confidence 478999999999999999999999999999999999999999999999977 5432221
Q ss_pred ccCCCCccccc---cchhhccChHHHHHHHhhccCC-C------------ccccccccccccChhHHHHHHHHhc-----
Q psy13090 624 EMQSESQLTCD---ICDRVFQTDKQWKLHMGFHNTK-R------------RLPCKLCGFRLKNHKQLKLHQRYHL----- 682 (771)
Q Consensus 624 ~~~~~~~~~C~---~C~~~f~~~~~L~~H~~~H~~~-~------------~~~C~~C~~~f~~~~~L~~H~~~H~----- 682 (771)
..++.|+ +|.+.|.+...|..|+++|.+. . .-+|..|.+.|.....+..++..|.
T Consensus 663 ----R~q~ScP~~~ic~~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~ 738 (958)
T KOG1074|consen 663 ----RVQFSCPSTFICQKKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESE 738 (958)
T ss_pred ----cccccCCchhhhcccccccccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccC
Confidence 2468899 9999999999999999999832 1 2469999999988888888876662
Q ss_pred -----------CCC----CccCCcCCccCCCchhHHHHHHHhc--C--------------------CC------------
Q psy13090 683 -----------SSN----LFRCEKCNLGYSRKDSFEYHMTSHN--Q--------------------KE------------ 713 (771)
Q Consensus 683 -----------~~~----~~~C~~C~~~F~~~~~L~~H~~~H~--~--------------------~~------------ 713 (771)
++. +..+..|+..+.....+..+-..+. . .+
T Consensus 739 ~~~~~~~~t~t~~~~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~ 818 (958)
T KOG1074|consen 739 PDEQMDERTETEELDVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEI 818 (958)
T ss_pred CcccccccccccccccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcC
Confidence 222 5678888888877666555432220 0 00
Q ss_pred -----------------------------------------------------------CeeecccCCcccCChhHHHHH
Q psy13090 714 -----------------------------------------------------------QTFMCTQCDFTSSSKDTLNTH 734 (771)
Q Consensus 714 -----------------------------------------------------------~~~~C~~C~~~f~~~~~L~~H 734 (771)
....|.+||+.|.+.++|+.|
T Consensus 819 ~~~~v~~~pvl~~~~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH 898 (958)
T KOG1074|consen 819 LAPSVNMDPVLWNQETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIH 898 (958)
T ss_pred CccccccCchhhcccccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHh
Confidence 016799999999999999999
Q ss_pred HhhccCCCccccCCCCCccchhhhhhhHHHHHhhhc
Q psy13090 735 AMVHADELKFSCQNCDFKCSKLSCVRNVISQVAQKT 770 (771)
Q Consensus 735 ~~~H~~e~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 770 (771)
+++|++++||.|.+|++.|..+.+|+.||. -|+|+
T Consensus 899 ~rTHtg~KPF~C~fC~~aFttrgnLKvHMg-tH~w~ 933 (958)
T KOG1074|consen 899 MRTHTGPKPFFCHFCEEAFTTRGNLKVHMG-THMWV 933 (958)
T ss_pred hhcCCCCCCccchhhhhhhhhhhhhhhhhc-ccccc
Confidence 999999999999999999999999999999 55554
No 2
>KOG2462|consensus
Probab=99.94 E-value=4.7e-28 Score=224.87 Aligned_cols=136 Identities=24% Similarity=0.486 Sum_probs=129.1
Q ss_pred CCCccccccchhhccChHHHHHHHhhccC---CCccccccccccccChhHHHHHHHHhcCCCCccCCcCCccCCCchhHH
Q psy13090 627 SESQLTCDICDRVFQTDKQWKLHMGFHNT---KRRLPCKLCGFRLKNHKQLKLHQRYHLSSNLFRCEKCNLGYSRKDSFE 703 (771)
Q Consensus 627 ~~~~~~C~~C~~~f~~~~~L~~H~~~H~~---~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~L~ 703 (771)
....|+|+.||+.+.+.++|.+|..+|-. .+.+.|++|||.|.+..+|+.|+|+|+ -+++|.+|||.|.+.-.|+
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQ 204 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQ 204 (279)
T ss_pred cCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhh
Confidence 35679999999999999999999999864 678999999999999999999999997 6899999999999999999
Q ss_pred HHHHHhcCCCCeeecccCCcccCChhHHHHHHhhccCCCccccCCCCCccchhhhhhhHHHH
Q psy13090 704 YHMTSHNQKEQTFMCTQCDFTSSSKDTLNTHAMVHADELKFSCQNCDFKCSKLSCVRNVISQ 765 (771)
Q Consensus 704 ~H~~~H~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~e~~~~C~~C~~~f~~~~~l~~H~~~ 765 (771)
-|+|+|+ |||||.|+.|++.|..+++|+.||++|++.++|+|..|+|.|+.++.|.+|...
T Consensus 205 GHiRTHT-GEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 205 GHIRTHT-GEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred ccccccc-CCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 9999999 699999999999999999999999999999999999999999999999999753
No 3
>KOG3608|consensus
Probab=99.94 E-value=8.7e-27 Score=220.64 Aligned_cols=231 Identities=20% Similarity=0.412 Sum_probs=191.7
Q ss_pred cccC--CCCccCCChhhhhhhHHhhhcc-ccCCCcccccCCc-eec--CCCCcccCChHHHHccccccCCCCCCCCcccc
Q psy13090 506 YSCN--ICFRRNSKYTHLSRHKLLVHQK-QIDGVGSVEIGEK-YSC--TQCSYQTYHKHALDNHMPLHSLDRKHQCPVCL 579 (771)
Q Consensus 506 ~~C~--~C~~~f~~~~~l~~H~~~~h~~-~~~~~~~~~~~~~-~~C--~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 579 (771)
|.|. .|+..|.+...+.+|+..+-.- ..+.........+ +.| ..|.+.+.++..|+.|++.|++++...|+.||
T Consensus 135 f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg 214 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCG 214 (467)
T ss_pred hccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHH
Confidence 4443 3777777777777775433221 1110111111112 556 56999999999999999999999999999999
Q ss_pred cccCCchhHHHHHHHhcCCCCCCchhhhhhhhhhHHhhcCCCccccCCCCccccccchhhccChHHHHHHHhhccCCCcc
Q psy13090 580 RRFSRQSDNNTHQRRVHGVLKPTPLEMQVEKRDQVALRMKPSAFEMQSESQLTCDICDRVFQTDKQWKLHMGFHNTKRRL 659 (771)
Q Consensus 580 ~~f~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~ 659 (771)
.-|.++..|..|+++.... ...+|+|..|.+.|.+...|+.|+..|.. -|
T Consensus 215 ~~F~~~tkl~DH~rRqt~l----------------------------~~n~fqC~~C~KrFaTeklL~~Hv~rHvn--~y 264 (467)
T KOG3608|consen 215 ELFRTKTKLFDHLRRQTEL----------------------------NTNSFQCAQCFKRFATEKLLKSHVVRHVN--CY 264 (467)
T ss_pred HHhccccHHHHHHHhhhhh----------------------------cCCchHHHHHHHHHhHHHHHHHHHHHhhh--cc
Confidence 9999999999999873221 13479999999999999999999999975 59
Q ss_pred ccccccccccChhHHHHHHH-HhcCCCCccCCcCCccCCCchhHHHHHHHhcCCCCeeeccc--CCcccCChhHHHHHHh
Q psy13090 660 PCKLCGFRLKNHKQLKLHQR-YHLSSNLFRCEKCNLGYSRKDSFEYHMTSHNQKEQTFMCTQ--CDFTSSSKDTLNTHAM 736 (771)
Q Consensus 660 ~C~~C~~~f~~~~~L~~H~~-~H~~~~~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~C~~--C~~~f~~~~~L~~H~~ 736 (771)
+|+.|+.+....++|..|++ .|...+||+|+.|++.|.+.+.|.+|+.+|. +..|+|+. |.++|.+...|.+|++
T Consensus 265 kCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS--~~~y~C~h~~C~~s~r~~~q~~~H~~ 342 (467)
T KOG3608|consen 265 KCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS--KTVYQCEHPDCHYSVRTYTQMRRHFL 342 (467)
T ss_pred cccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc--ccceecCCCCCcHHHHHHHHHHHHHH
Confidence 99999999999999999998 6888999999999999999999999999997 57899998 9999999999999998
Q ss_pred hcc-C--CCccccCCCCCccchhhhhhhHHHHHhh
Q psy13090 737 VHA-D--ELKFSCQNCDFKCSKLSCVRNVISQVAQ 768 (771)
Q Consensus 737 ~H~-~--e~~~~C~~C~~~f~~~~~l~~H~~~~H~ 768 (771)
.++ | +.+|.|..|++.|++-.+|-.|++..|.
T Consensus 343 evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~ 377 (467)
T KOG3608|consen 343 EVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHG 377 (467)
T ss_pred HhccCCCCCceeeecchhhhccchhHHHHHHHhhc
Confidence 544 4 4579999999999999999999998885
No 4
>KOG2462|consensus
Probab=99.93 E-value=3.3e-27 Score=219.26 Aligned_cols=138 Identities=25% Similarity=0.552 Sum_probs=120.4
Q ss_pred CCCCCCCcccccccCCchhHHHHHHHhcCCCCCCchhhhhhhhhhHHhhcCCCccccCCCCccccccchhhccChHHHHH
Q psy13090 569 LDRKHQCPVCLRRFSRQSDNNTHQRRVHGVLKPTPLEMQVEKRDQVALRMKPSAFEMQSESQLTCDICDRVFQTDKQWKL 648 (771)
Q Consensus 569 ~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~ 648 (771)
....|+|+.||+.+.+.++|.+|... |-.. ...+.+.|++||+.+.+...|+.
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrHkQ~-H~~~--------------------------~s~ka~~C~~C~K~YvSmpALkM 179 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRHKQT-HRSL--------------------------DSKKAFSCKYCGKVYVSMPALKM 179 (279)
T ss_pred cCCceeccccccccccccccchhhcc-cccc--------------------------cccccccCCCCCceeeehHHHhh
Confidence 34568888888888888889888877 4211 11456889999999999999999
Q ss_pred HHhhccCCCccccccccccccChhHHHHHHHHhcCCCCccCCcCCccCCCchhHHHHHHHhcCCCCeeecccCCcccCCh
Q psy13090 649 HMGFHNTKRRLPCKLCGFRLKNHKQLKLHQRYHLSSNLFRCEKCNLGYSRKDSFEYHMTSHNQKEQTFMCTQCDFTSSSK 728 (771)
Q Consensus 649 H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~C~~C~~~f~~~ 728 (771)
|+++|+ -+++|.+|||.|....-|+-|+|+|||||||.|+.|+++|.++++|+.||++|. +.|+|+|+.|+|+|..+
T Consensus 180 HirTH~--l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS-~~K~~qC~~C~KsFsl~ 256 (279)
T KOG2462|consen 180 HIRTHT--LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHS-DVKKHQCPRCGKSFALK 256 (279)
T ss_pred HhhccC--CCcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhc-CCccccCcchhhHHHHH
Confidence 999997 679999999999999999999999999999999999999999999999999998 57899999999999999
Q ss_pred hHHHHHHh
Q psy13090 729 DTLNTHAM 736 (771)
Q Consensus 729 ~~L~~H~~ 736 (771)
+.|.+|..
T Consensus 257 SyLnKH~E 264 (279)
T KOG2462|consen 257 SYLNKHSE 264 (279)
T ss_pred HHHHHhhh
Confidence 99999965
No 5
>KOG1074|consensus
Probab=99.93 E-value=9.1e-27 Score=244.88 Aligned_cols=254 Identities=19% Similarity=0.380 Sum_probs=166.4
Q ss_pred ccccccchhhccCHHHHHHHHhhccCCCCCcccCcchhhcCCHHHHHHHHHhhhcCC----CccccC---CCCccCCChh
Q psy13090 447 KRQCEYCSQLFRRASAVEKHIALKHTGDKPYSCYLCEEKFENSESLRTHMNSLHALL----FPYSCN---ICFRRNSKYT 519 (771)
Q Consensus 447 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~----~~~~C~---~C~~~f~~~~ 519 (771)
+-+|-+|-++..-+++|+.|++ .|+|++||+|.+|++.|+++.+|+.||- +|... .++.|+ +|.+.|....
T Consensus 605 PNqCiiC~rVlSC~saLqmHyr-tHtGERPFkCKiCgRAFtTkGNLkaH~~-vHka~p~~R~q~ScP~~~ic~~kftn~V 682 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYR-THTGERPFKCKICGRAFTTKGNLKAHMS-VHKAKPPARVQFSCPSTFICQKKFTNAV 682 (958)
T ss_pred ccceeeeeecccchhhhhhhhh-cccCcCccccccccchhccccchhhccc-ccccCccccccccCCchhhhcccccccc
Confidence 4579999999999999999999 7999999999999999999999999997 67543 468899 9999999999
Q ss_pred hhhhhHHhhhccccCCCccc--ccCCceecCCCCcccCChHHHHcccccc----------------CCCC----CCCCcc
Q psy13090 520 HLSRHKLLVHQKQIDGVGSV--EIGEKYSCTQCSYQTYHKHALDNHMPLH----------------SLDR----KHQCPV 577 (771)
Q Consensus 520 ~l~~H~~~~h~~~~~~~~~~--~~~~~~~C~~C~~~f~~~~~l~~H~~~h----------------~~~~----~~~C~~ 577 (771)
.|..|+++|-...+++.+.. ..+..-+|..|.+.|.....+..++..| +++. +..+..
T Consensus 683 ~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~ 762 (958)
T KOG1074|consen 683 TLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENS 762 (958)
T ss_pred cccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccccc
Confidence 99999988775555443222 1222367999999997777776665544 2222 567888
Q ss_pred cccccCCchhHHHHHHHh--cCCCCCCchhhhhhhhhhHHhhcCCCccccCCCCcc-ccccchhhccChH----------
Q psy13090 578 CLRRFSRQSDNNTHQRRV--HGVLKPTPLEMQVEKRDQVALRMKPSAFEMQSESQL-TCDICDRVFQTDK---------- 644 (771)
Q Consensus 578 C~~~f~~~~~L~~H~~~~--H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~C~~C~~~f~~~~---------- 644 (771)
|+..+.....+..+-... +...+. .+.. ...........+++++. .+.+++..-...-
T Consensus 763 ~~~~~~~e~~i~~~g~te~asa~~~~------vg~~---s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~ 833 (958)
T KOG1074|consen 763 CGRELEGEMAISVRGSTEEASANLDE------VGTV---SAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQE 833 (958)
T ss_pred cccccCcccccccccchhhhhcChhh------hcCc---cccchhhhhcccCCCCcccccCCCcCCccccccCchhhccc
Confidence 888888776665543221 100000 0000 00000000000111111 2222221111000
Q ss_pred ----------------------HHHHHHhhcc--------C--CCccccccccccccChhHHHHHHHHhcCCCCccCCcC
Q psy13090 645 ----------------------QWKLHMGFHN--------T--KRRLPCKLCGFRLKNHKQLKLHQRYHLSSNLFRCEKC 692 (771)
Q Consensus 645 ----------------------~L~~H~~~H~--------~--~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C 692 (771)
.+..--.+-. + .....|.+||+.|.+.++|..|+|+|+++|||.|.+|
T Consensus 834 ~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC 913 (958)
T KOG1074|consen 834 TSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFC 913 (958)
T ss_pred ccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhh
Confidence 0000000000 0 1126799999999999999999999999999999999
Q ss_pred CccCCCchhHHHHHHHhcC
Q psy13090 693 NLGYSRKDSFEYHMTSHNQ 711 (771)
Q Consensus 693 ~~~F~~~~~L~~H~~~H~~ 711 (771)
++.|..+.+|+.||.+|.+
T Consensus 914 ~~aFttrgnLKvHMgtH~w 932 (958)
T KOG1074|consen 914 EEAFTTRGNLKVHMGTHMW 932 (958)
T ss_pred hhhhhhhhhhhhhhccccc
Confidence 9999999999999999874
No 6
>KOG3608|consensus
Probab=99.92 E-value=8.4e-25 Score=207.26 Aligned_cols=237 Identities=22% Similarity=0.435 Sum_probs=184.6
Q ss_pred cccc--ccchhhccCHHHHHHHHhhccC-----------C-CCC-cccC--cchhhcCCHHHHHHHHHhhhcCCCccccC
Q psy13090 447 KRQC--EYCSQLFRRASAVEKHIALKHT-----------G-DKP-YSCY--LCEEKFENSESLRTHMNSLHALLFPYSCN 509 (771)
Q Consensus 447 ~~~C--~~C~~~f~~~~~L~~H~~~~H~-----------~-~~~-~~C~--~C~~~f~~~~~L~~H~~~~H~~~~~~~C~ 509 (771)
.|.| +.|+..|.+...+..|+. .|. + ++| +.|. .|-+.|.+++.|++|++ .|++++...|+
T Consensus 134 ~f~C~WedCe~~F~s~~ef~dHV~-~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r-~Hs~eKvvACp 211 (467)
T KOG3608|consen 134 NFRCGWEDCEREFVSIVEFQDHVV-KHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIR-THSNEKVVACP 211 (467)
T ss_pred hhccChhhcCCcccCHHHHHHHHH-HhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHH-hcCCCeEEecc
Confidence 3556 458888888888888876 442 1 122 4564 58888999999999988 78888888888
Q ss_pred CCCccCCChhhhhhhHHhhhccccCCCcccccCCceecCCCCcccCChHHHHccccccCCCCCCCCcccccccCCchhHH
Q psy13090 510 ICFRRNSKYTHLSRHKLLVHQKQIDGVGSVEIGEKYSCTQCSYQTYHKHALDNHMPLHSLDRKHQCPVCLRRFSRQSDNN 589 (771)
Q Consensus 510 ~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~ 589 (771)
.|+.-|++...|..|.+... -....+|+|..|.+.|.++..|.
T Consensus 212 ~Cg~~F~~~tkl~DH~rRqt-------------------------------------~l~~n~fqC~~C~KrFaTeklL~ 254 (467)
T KOG3608|consen 212 HCGELFRTKTKLFDHLRRQT-------------------------------------ELNTNSFQCAQCFKRFATEKLLK 254 (467)
T ss_pred hHHHHhccccHHHHHHHhhh-------------------------------------hhcCCchHHHHHHHHHhHHHHHH
Confidence 88888888888888876321 11234788888888888888888
Q ss_pred HHHHHhcCCCCCCchhhhhhhhhhHHhhcCCCccccCCCCccccccchhhccChHHHHHHHhh-ccCCCccccccccccc
Q psy13090 590 THQRRVHGVLKPTPLEMQVEKRDQVALRMKPSAFEMQSESQLTCDICDRVFQTDKQWKLHMGF-HNTKRRLPCKLCGFRL 668 (771)
Q Consensus 590 ~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~-H~~~~~~~C~~C~~~f 668 (771)
.|+.++- .-|+|+.|+.+....++|..|++. |...+||+|+.|++.|
T Consensus 255 ~Hv~rHv--------------------------------n~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c 302 (467)
T KOG3608|consen 255 SHVVRHV--------------------------------NCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRC 302 (467)
T ss_pred HHHHHhh--------------------------------hcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhh
Confidence 8888732 238899999999999999999875 8888999999999999
Q ss_pred cChhHHHHHHHHhcCCCCccCCc--CCccCCCchhHHHHHHHhcCC--CCeeecccCCcccCChhHHHHHHh-hcc----
Q psy13090 669 KNHKQLKLHQRYHLSSNLFRCEK--CNLGYSRKDSFEYHMTSHNQK--EQTFMCTQCDFTSSSKDTLNTHAM-VHA---- 739 (771)
Q Consensus 669 ~~~~~L~~H~~~H~~~~~~~C~~--C~~~F~~~~~L~~H~~~H~~~--~~~~~C~~C~~~f~~~~~L~~H~~-~H~---- 739 (771)
.+.+.|.+|..+|+ +..|.|+. |.++|.+...|++|++.++.+ +-+|.|..|++.|++..+|.+|++ .|.
T Consensus 303 ~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~P 381 (467)
T KOG3608|consen 303 VRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFRLP 381 (467)
T ss_pred ccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcccCC
Confidence 99999999999987 77899987 999999999999999987633 457999999999999999999976 453
Q ss_pred -CCCccccCCCCCccch
Q psy13090 740 -DELKFSCQNCDFKCSK 755 (771)
Q Consensus 740 -~e~~~~C~~C~~~f~~ 755 (771)
|-+.|.=.+|.-.|..
T Consensus 382 sGh~RFtYk~~edG~mR 398 (467)
T KOG3608|consen 382 SGHKRFTYKVDEDGFMR 398 (467)
T ss_pred CCCCceeeeeccCceee
Confidence 3333444555555544
No 7
>KOG3623|consensus
Probab=99.90 E-value=1.9e-23 Score=215.74 Aligned_cols=113 Identities=28% Similarity=0.575 Sum_probs=99.9
Q ss_pred ccccccccccccChHHHHHHHhhcCC-----ccCCCCCCCCCCHHHHHHHHHhcCCCCCC----CCccCcCccccccchh
Q psy13090 385 NYKCNICDCFFKTHSILEDHMKALHS-----FPCDKCDYVGGNLSMLTSHQNSHNSQTEV----PPILSKNKRQCEYCSQ 455 (771)
Q Consensus 385 ~~~C~~C~~~f~~~~~L~~H~~~~h~-----~~C~~C~~~f~~~~~l~~H~~~h~~~~~~----~~~~~~~~~~C~~C~~ 455 (771)
...|+.|+..+.....|+.|++..|. |.|.+|.++|..+..|.+|+.+|...... +.....+.|+|..|+|
T Consensus 210 lltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgK 289 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGK 289 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccch
Confidence 46899999999999999999987764 99999999999999999999999755432 2223347899999999
Q ss_pred hccCHHHHHHHHhhccCCCCCcccCcchhhcCCHHHHHHHHHh
Q psy13090 456 LFRRASAVEKHIALKHTGDKPYSCYLCEEKFENSESLRTHMNS 498 (771)
Q Consensus 456 ~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~ 498 (771)
.|..+-.|+.|++ .|.|++||.|+.|+++|....++..||.+
T Consensus 290 AFKfKHHLKEHlR-IHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 290 AFKFKHHLKEHLR-IHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred hhhhHHHHHhhhe-eecCCCCcCCcccccccccCCcccccccc
Confidence 9999999999999 69999999999999999999999999853
No 8
>KOG3623|consensus
Probab=99.89 E-value=1.4e-23 Score=216.64 Aligned_cols=121 Identities=28% Similarity=0.602 Sum_probs=97.5
Q ss_pred CccccccchhhccCHHHHHHHHhhccCC-CCCcccCcchhhcCCHHHHHHHHHhhhcCCCccccCCCCccCCChhhhhhh
Q psy13090 446 NKRQCEYCSQLFRRASAVEKHIALKHTG-DKPYSCYLCEEKFENSESLRTHMNSLHALLFPYSCNICFRRNSKYTHLSRH 524 (771)
Q Consensus 446 ~~~~C~~C~~~f~~~~~L~~H~~~~H~~-~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H 524 (771)
....|++|++.+.....|+.|++..|.. +..|.|.+|.++|+.+..|.+||. .|....+- ...
T Consensus 209 qlltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~-~hkpg~dq-----------a~s---- 272 (1007)
T KOG3623|consen 209 QLLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQ-LHKPGGDQ-----------AIS---- 272 (1007)
T ss_pred hhhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHH-hhcCCCcc-----------ccc----
Confidence 3467999999999999999999877754 456999999999999999999998 55432210 000
Q ss_pred HHhhhccccCCCcccccCCceecCCCCcccCChHHHHccccccCCCCCCCCcccccccCCchhHHHHHH
Q psy13090 525 KLLVHQKQIDGVGSVEIGEKYSCTQCSYQTYHKHALDNHMPLHSLDRKHQCPVCLRRFSRQSDNNTHQR 593 (771)
Q Consensus 525 ~~~~h~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~ 593 (771)
.......++|+|+.||+.|..+..|+.|+|+|.|++||.|+.|++.|.....+..||.
T Consensus 273 -----------ltqsa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 273 -----------LTQSALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred -----------ccchhhhccccccccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 0112334569999999999999999999999999999999999999999999988874
No 9
>KOG3576|consensus
Probab=99.71 E-value=3.2e-18 Score=150.07 Aligned_cols=118 Identities=22% Similarity=0.545 Sum_probs=108.1
Q ss_pred CCCccccccchhhccChHHHHHHHhhccCCCccccccccccccChhHHHHHHHHhcCCCCccCCcCCccCCCchhHHHHH
Q psy13090 627 SESQLTCDICDRVFQTDKQWKLHMGFHNTKRRLPCKLCGFRLKNHKQLKLHQRYHLSSNLFRCEKCNLGYSRKDSFEYHM 706 (771)
Q Consensus 627 ~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~L~~H~ 706 (771)
+...|.|.+|++.|.-...|.+|++-|...++|.|..||+.|.....|++|+|+|+|.+||+|..|+++|+..-+|..|+
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl 193 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHL 193 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHH
Confidence 35679999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHhcC----------CCCeeecccCCcccCChhHHHHHHhhccCCCcc
Q psy13090 707 TSHNQ----------KEQTFMCTQCDFTSSSKDTLNTHAMVHADELKF 744 (771)
Q Consensus 707 ~~H~~----------~~~~~~C~~C~~~f~~~~~L~~H~~~H~~e~~~ 744 (771)
+.-+| .++.|.|+.||++-.....+..|++.|+..-|+
T Consensus 194 ~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Spa 241 (267)
T KOG3576|consen 194 KKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSPA 241 (267)
T ss_pred HHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCHH
Confidence 87654 246799999999999999999999988776553
No 10
>KOG3576|consensus
Probab=99.67 E-value=1.5e-17 Score=145.87 Aligned_cols=112 Identities=17% Similarity=0.463 Sum_probs=104.3
Q ss_pred CCCccccccccccccChhHHHHHHHHhcCCCCccCCcCCccCCCchhHHHHHHHhcCCCCeeecccCCcccCChhHHHHH
Q psy13090 655 TKRRLPCKLCGFRLKNHKQLKLHQRYHLSSNLFRCEKCNLGYSRKDSFEYHMTSHNQKEQTFMCTQCDFTSSSKDTLNTH 734 (771)
Q Consensus 655 ~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~C~~C~~~f~~~~~L~~H 734 (771)
+...|.|.+|+|.|.-..-|.+|++-|...+.|-|..||+.|.+--+|++|+|+|+ +.+||+|..|++.|+++-+|..|
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rtht-gvrpykc~~c~kaftqrcslesh 192 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHT-GVRPYKCSLCEKAFTQRCSLESH 192 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhcccc-CccccchhhhhHHHHhhccHHHH
Confidence 45679999999999999999999999999999999999999999999999999999 69999999999999999999999
Q ss_pred Hh-hcc----------CCCccccCCCCCccchhhhhhhHHHHHh
Q psy13090 735 AM-VHA----------DELKFSCQNCDFKCSKLSCVRNVISQVA 767 (771)
Q Consensus 735 ~~-~H~----------~e~~~~C~~C~~~f~~~~~l~~H~~~~H 767 (771)
++ +|. .++.|.|+.||+.-.....+..|+..+|
T Consensus 193 l~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~h 236 (267)
T KOG3576|consen 193 LKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHH 236 (267)
T ss_pred HHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcC
Confidence 87 554 2567999999999999999999999887
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.32 E-value=2.3e-12 Score=136.92 Aligned_cols=122 Identities=18% Similarity=0.371 Sum_probs=102.6
Q ss_pred ccccc--chhhccChHHHHHHHhhccCCCccccccccccccChhHHHHHHHHhcCCCCccCCcCCccCCCchhHHHHHHH
Q psy13090 631 LTCDI--CDRVFQTDKQWKLHMGFHNTKRRLPCKLCGFRLKNHKQLKLHQRYHLSSNLFRCEKCNLGYSRKDSFEYHMTS 708 (771)
Q Consensus 631 ~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~L~~H~~~ 708 (771)
-.|+. ||..|. +..|..| +.|+.|++.|. ...|..|+++|+ +++.|+ ||+.+ .+..|..|+++
T Consensus 434 V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~t 498 (567)
T PLN03086 434 VVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQAS 498 (567)
T ss_pred eeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhc
Confidence 35874 999883 4444444 59999999996 688999999985 899999 99755 67999999999
Q ss_pred hcCCCCeeecccCCcccCC----------hhHHHHHHhhccCCCccccCCCCCccchhhhhhhHHHHHhhhc
Q psy13090 709 HNQKEQTFMCTQCDFTSSS----------KDTLNTHAMVHADELKFSCQNCDFKCSKLSCVRNVISQVAQKT 770 (771)
Q Consensus 709 H~~~~~~~~C~~C~~~f~~----------~~~L~~H~~~H~~e~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 770 (771)
|. +++++.|+.|++.|.. .+.|..|..+. |.+++.|..||+.+..+. +..|+..+|+|+
T Consensus 499 hC-p~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlrd-m~~H~~~~h~~~ 567 (567)
T PLN03086 499 TC-PLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLKE-MDIHQIAVHQKS 567 (567)
T ss_pred cC-CCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeeeehh-HHHHHHHhhcCC
Confidence 98 5899999999999952 35899999986 999999999999998665 899999999885
No 12
>PHA00733 hypothetical protein
Probab=99.15 E-value=3e-11 Score=104.44 Aligned_cols=54 Identities=17% Similarity=0.295 Sum_probs=37.9
Q ss_pred CCeeecccCCcccCChhHHHHHHhhccCCCccccCCCCCccchhhhhhhHHHHHhh
Q psy13090 713 EQTFMCTQCDFTSSSKDTLNTHAMVHADELKFSCQNCDFKCSKLSCVRNVISQVAQ 768 (771)
Q Consensus 713 ~~~~~C~~C~~~f~~~~~L~~H~~~H~~e~~~~C~~C~~~f~~~~~l~~H~~~~H~ 768 (771)
++||.|+.||+.|.+...|..|++.| +.+|.|++|++.|.....|..|+.+.|.
T Consensus 71 ~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 71 VSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 56777777777777777777777665 3467777777777777777777776663
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.12 E-value=1.5e-10 Score=123.28 Aligned_cols=144 Identities=22% Similarity=0.430 Sum_probs=110.1
Q ss_pred eecCCCCcccCChHHHHccccccCCCCCCCCcc--cccccCCchhHHHHHHHhcCCCCCCchhhhhhhhhhHHhhcCCCc
Q psy13090 545 YSCTQCSYQTYHKHALDNHMPLHSLDRKHQCPV--CLRRFSRQSDNNTHQRRVHGVLKPTPLEMQVEKRDQVALRMKPSA 622 (771)
Q Consensus 545 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~ 622 (771)
-.|+-|...... ..|..|...- .-..-.|+. |+..|.. ..|
T Consensus 408 V~C~NC~~~i~l-~~l~lHe~~C-~r~~V~Cp~~~Cg~v~~r-~el---------------------------------- 450 (567)
T PLN03086 408 VECRNCKHYIPS-RSIALHEAYC-SRHNVVCPHDGCGIVLRV-EEA---------------------------------- 450 (567)
T ss_pred EECCCCCCccch-hHHHHHHhhC-CCcceeCCcccccceeec-ccc----------------------------------
Confidence 568888766554 4555776433 223446774 8887732 222
Q ss_pred cccCCCCccccccchhhccChHHHHHHHhhccCCCccccccccccccChhHHHHHHHHhcCCCCccCCcCCccCCC----
Q psy13090 623 FEMQSESQLTCDICDRVFQTDKQWKLHMGFHNTKRRLPCKLCGFRLKNHKQLKLHQRYHLSSNLFRCEKCNLGYSR---- 698 (771)
Q Consensus 623 ~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~---- 698 (771)
+..+.|+.|++.|. ...|..|+..++ +++.|+ ||+.+ .+..|..|+++|.+++++.|+.|++.|..
T Consensus 451 -----~~H~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~ 520 (567)
T PLN03086 451 -----KNHVHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSA 520 (567)
T ss_pred -----ccCccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccc
Confidence 23467999999996 688999999985 789999 99765 67999999999999999999999999952
Q ss_pred ------chhHHHHHHHhcCCCCeeecccCCcccCChhHHHHHHh-hc
Q psy13090 699 ------KDSFEYHMTSHNQKEQTFMCTQCDFTSSSKDTLNTHAM-VH 738 (771)
Q Consensus 699 ------~~~L~~H~~~H~~~~~~~~C~~C~~~f~~~~~L~~H~~-~H 738 (771)
.+.|..|+... +.+++.|..||+.|..+ .|..|+. .|
T Consensus 521 ~d~~d~~s~Lt~HE~~C--G~rt~~C~~Cgk~Vrlr-dm~~H~~~~h 564 (567)
T PLN03086 521 MDVRDRLRGMSEHESIC--GSRTAPCDSCGRSVMLK-EMDIHQIAVH 564 (567)
T ss_pred cchhhhhhhHHHHHHhc--CCcceEccccCCeeeeh-hHHHHHHHhh
Confidence 35899999986 58999999999998655 5788865 44
No 14
>PHA00733 hypothetical protein
Probab=99.05 E-value=1.7e-10 Score=99.74 Aligned_cols=83 Identities=16% Similarity=0.247 Sum_probs=54.3
Q ss_pred CCCccccccccccccChhHHHHH--HH---HhcCCCCccCCcCCccCCCchhHHHHHHHhcCCCCeeecccCCcccCChh
Q psy13090 655 TKRRLPCKLCGFRLKNHKQLKLH--QR---YHLSSNLFRCEKCNLGYSRKDSFEYHMTSHNQKEQTFMCTQCDFTSSSKD 729 (771)
Q Consensus 655 ~~~~~~C~~C~~~f~~~~~L~~H--~~---~H~~~~~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~C~~C~~~f~~~~ 729 (771)
..+++.|.+|++.|.....|..+ ++ .+.+++||.|+.|++.|.+...|..|++.|. .+|.|++|++.|....
T Consensus 37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~---~~~~C~~CgK~F~~~~ 113 (128)
T PHA00733 37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTE---HSKVCPVCGKEFRNTD 113 (128)
T ss_pred hhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcCC---cCccCCCCCCccCCHH
Confidence 34556666666666665555554 11 2234667777777777777777777777652 3577777777777777
Q ss_pred HHHHHHhhccC
Q psy13090 730 TLNTHAMVHAD 740 (771)
Q Consensus 730 ~L~~H~~~H~~ 740 (771)
.|..|+...++
T Consensus 114 sL~~H~~~~h~ 124 (128)
T PHA00733 114 STLDHVCKKHN 124 (128)
T ss_pred HHHHHHHHhcC
Confidence 77777765443
No 15
>KOG1146|consensus
Probab=99.04 E-value=2.9e-11 Score=135.96 Aligned_cols=104 Identities=19% Similarity=0.366 Sum_probs=84.8
Q ss_pred cccccccccccchhhhhhhhhhcCCCCCcccCCCCCccccchhhhccccccccC-------------------------C
Q psy13090 180 KCKLCNYQAVNIKGIRAHVFRKHLQNTVTYSCKKCPFETQFKVTLGDHYKTCHK-------------------------G 234 (771)
Q Consensus 180 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~-------------------------~ 234 (771)
.|.-|+..+.+...+..|+...+. -.+.|.|+.|+..|+....|..|||..|. +
T Consensus 438 e~~~~e~~~~s~r~~~~~t~~L~S-~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~ 516 (1406)
T KOG1146|consen 438 ELTKAEPLLESKRSLEGQTVVLHS-FFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPG 516 (1406)
T ss_pred cccchhhhhhhhcccccceeeeec-ccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCC
Confidence 345566677777788888766776 45889999999999999999999999662 2
Q ss_pred cceecCCCCCcccchhhHHHHHHhc----C-----------------------------------------Ccccccccc
Q psy13090 235 ISFTCDECHLCFNTFRRFDSHIRAH----E-----------------------------------------YTEQCELCN 269 (771)
Q Consensus 235 ~~~~C~~C~~~F~~~~~L~~H~~~H----~-----------------------------------------~~~~C~~C~ 269 (771)
++|.|..|..++.++.+|..||+.- + ..+.|.+|+
T Consensus 517 ~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~ 596 (1406)
T KOG1146|consen 517 KPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCS 596 (1406)
T ss_pred CcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhc
Confidence 6799999999999999999998632 0 145799999
Q ss_pred ccccChhhhhhhhcc
Q psy13090 270 EKVIGKNQIYFHYYH 284 (771)
Q Consensus 270 ~~f~~~~~L~~H~~~ 284 (771)
+...-..+|+.|+..
T Consensus 597 yetniarnlrihmts 611 (1406)
T KOG1146|consen 597 YETNIARNLRIHMTA 611 (1406)
T ss_pred chhhhhhcccccccc
Confidence 999999999999754
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.98 E-value=1.6e-10 Score=80.61 Aligned_cols=45 Identities=16% Similarity=0.209 Sum_probs=42.0
Q ss_pred cccccccccchhhhhhHHHHHHhhcCCCCccCCCCcccchhhHHHhh
Q psy13090 33 KFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKGASSQALKY 79 (771)
Q Consensus 33 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~ 79 (771)
-|+|++||+.|+.+++|..|+++|+ +||+|..|++.|+..+.|..
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~~ 49 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYIE 49 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeEE
Confidence 4899999999999999999999999 79999999999999888763
No 17
>PHA02768 hypothetical protein; Provisional
Probab=98.91 E-value=5.6e-10 Score=77.88 Aligned_cols=44 Identities=16% Similarity=0.256 Sum_probs=31.7
Q ss_pred eeecccCCcccCChhHHHHHHhhccCCCccccCCCCCccchhhhhh
Q psy13090 715 TFMCTQCDFTSSSKDTLNTHAMVHADELKFSCQNCDFKCSKLSCVR 760 (771)
Q Consensus 715 ~~~C~~C~~~f~~~~~L~~H~~~H~~e~~~~C~~C~~~f~~~~~l~ 760 (771)
.|+|+.||+.|++.++|..|+++|+ +||+|..|++.|++.+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 3677777777777777777777777 5777777777777666653
No 18
>KOG3993|consensus
Probab=98.84 E-value=5e-10 Score=110.50 Aligned_cols=141 Identities=16% Similarity=0.299 Sum_probs=113.5
Q ss_pred CccccccchhhccChHHHHHHHhhccCCCccccccccccccChhHHHHHHHHhcC-------------------------
Q psy13090 629 SQLTCDICDRVFQTDKQWKLHMGFHNTKRRLPCKLCGFRLKNHKQLKLHQRYHLS------------------------- 683 (771)
Q Consensus 629 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~------------------------- 683 (771)
.-|.|..|...|.+.-.|.+|.-.-.---.|+|+.|+|.|.-..+|..|.|+|--
T Consensus 266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea 345 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA 345 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence 3499999999999999999996322222349999999999999999999999831
Q ss_pred --------CCCccCCcCCccCCCchhHHHHHHHhcCCCC-----------------------------------------
Q psy13090 684 --------SNLFRCEKCNLGYSRKDSFEYHMTSHNQKEQ----------------------------------------- 714 (771)
Q Consensus 684 --------~~~~~C~~C~~~F~~~~~L~~H~~~H~~~~~----------------------------------------- 714 (771)
+..|.|.+|++.|.++..|++|+.+|+..+.
T Consensus 346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a 425 (500)
T KOG3993|consen 346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVA 425 (500)
T ss_pred cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeee
Confidence 1249999999999999999999988863211
Q ss_pred ----eeecccCCcccCChhHHHHHHhhccCCCccccCCCCCccchhhhhhhHHHHHhhh
Q psy13090 715 ----TFMCTQCDFTSSSKDTLNTHAMVHADELKFSCQNCDFKCSKLSCVRNVISQVAQK 769 (771)
Q Consensus 715 ----~~~C~~C~~~f~~~~~L~~H~~~H~~e~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 769 (771)
.-.|++|+-.+.++..-..+.+.=..+.-|.|.+|.-.|.+.-.|.+||..-|..
T Consensus 426 ~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hps 484 (500)
T KOG3993|consen 426 GSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPS 484 (500)
T ss_pred ccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChH
Confidence 1347778877777777666666666677799999999999999999999988864
No 19
>KOG1146|consensus
Probab=98.76 E-value=1.4e-08 Score=115.01 Aligned_cols=58 Identities=21% Similarity=0.563 Sum_probs=50.9
Q ss_pred ccCCCCCcccccccccchhhhhhHHHHHHh-h------------------------cCCCCccCCCCcccchhhHHHhhh
Q psy13090 26 RIHPHKDKFMCYACYYSTYISTNMKNHIMT-H------------------------TGSKPYNCSHCTFKGASSQALKYH 80 (771)
Q Consensus 26 r~H~~~k~~~C~~C~~~f~~~~~l~~H~~~-H------------------------~~~~~~~C~~C~~~f~~~~~L~~H 80 (771)
-+|+..|-|.|+.|+..|+....|..|||. | -+-+||.|..|.+++..+.+|..|
T Consensus 458 ~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~Lsih 537 (1406)
T KOG1146|consen 458 VLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLSIH 537 (1406)
T ss_pred eeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccceeeeeeeecchHHHHH
Confidence 367778999999999999999999999998 1 123689999999999999999999
Q ss_pred hhH
Q psy13090 81 MHV 83 (771)
Q Consensus 81 ~~~ 83 (771)
|+.
T Consensus 538 lqS 540 (1406)
T KOG1146|consen 538 LQS 540 (1406)
T ss_pred HHH
Confidence 974
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.76 E-value=3.5e-09 Score=63.17 Aligned_cols=25 Identities=44% Similarity=0.885 Sum_probs=20.1
Q ss_pred hHHHHHHhhcCCCCccCCCCcccch
Q psy13090 48 NMKNHIMTHTGSKPYNCSHCTFKGA 72 (771)
Q Consensus 48 ~l~~H~~~H~~~~~~~C~~C~~~f~ 72 (771)
+|.+||++|+|++||+|++|+++|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 4778888888888888888888875
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.63 E-value=1.3e-08 Score=60.64 Aligned_cols=24 Identities=29% Similarity=0.559 Sum_probs=15.1
Q ss_pred HHHHHhhccCCCccccCCCCCccc
Q psy13090 731 LNTHAMVHADELKFSCQNCDFKCS 754 (771)
Q Consensus 731 L~~H~~~H~~e~~~~C~~C~~~f~ 754 (771)
|.+|+++|+|++||.|++|+++|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 566666666666666666666664
No 22
>PHA00616 hypothetical protein
Probab=98.57 E-value=2.6e-08 Score=65.94 Aligned_cols=40 Identities=8% Similarity=0.070 Sum_probs=35.5
Q ss_pred cccccccccchhhhhhHHHHHHhhcCCCCccCCCCcccch
Q psy13090 33 KFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKGA 72 (771)
Q Consensus 33 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~ 72 (771)
||+|+.||+.|...+.|+.|++.|+|++||.|+.==-.|+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~y~~f~ 40 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYFYIYFR 40 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEEEEEEE
Confidence 7999999999999999999999999999999987544443
No 23
>KOG3993|consensus
Probab=98.51 E-value=1.4e-08 Score=100.45 Aligned_cols=109 Identities=21% Similarity=0.387 Sum_probs=68.7
Q ss_pred cccCCCCCcccCCchhHHHhhh--cCccceecCCCCccccchHHHHHHHhhccccccccccCCCCcccCCCCCCCCCCch
Q psy13090 290 DFNCKQCSFTTKTPKKFIEHKE--AGHFTWNCEHCDFSASKKSVLLSHLKQHVTRVTRTQHMTPDEETNEIPIEPDLSPD 367 (771)
Q Consensus 290 ~~~C~~C~~~f~~~~~l~~H~~--~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (771)
.|.|..|...|.+.-.|.+|.- ..+..|+|+.|+|.|+-..+|..|.+.|..+........+ |....
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~-------P~k~~---- 335 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSP-------PPKQA---- 335 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCC-------Chhhh----
Confidence 4677777777777777777754 5677899999999999999999999999765322210000 00000
Q ss_pred hhhHHHHHhh---hcccCCCccccccccccccChHHHHHHHhhcC
Q psy13090 368 LQKEELVRET---RSMKMYDNYKCNICDCFFKTHSILEDHMKALH 409 (771)
Q Consensus 368 ~~~~~~~~~~---~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h 409 (771)
......+.+. ........|.|.+|++.|.....|+.|..+.+
T Consensus 336 ~~~rae~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq 380 (500)
T KOG3993|consen 336 VETRAEVQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQ 380 (500)
T ss_pred hhhhhhhhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhh
Confidence 0000111110 01122246999999999999999999974443
No 24
>PHA00616 hypothetical protein
Probab=98.30 E-value=2.1e-07 Score=61.69 Aligned_cols=33 Identities=15% Similarity=0.310 Sum_probs=21.5
Q ss_pred eeecccCCcccCChhHHHHHHhhccCCCccccC
Q psy13090 715 TFMCTQCDFTSSSKDTLNTHAMVHADELKFSCQ 747 (771)
Q Consensus 715 ~~~C~~C~~~f~~~~~L~~H~~~H~~e~~~~C~ 747 (771)
||+|+.||+.|..++.|..|++.|+|+.||.|+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 466666666666666666666666666666654
No 25
>PHA00732 hypothetical protein
Probab=98.20 E-value=9.1e-07 Score=68.89 Aligned_cols=45 Identities=36% Similarity=0.713 Sum_probs=21.4
Q ss_pred cccccccccccChhHHHHHHHH-hcCCCCccCCcCCccCCCchhHHHHHHHh
Q psy13090 659 LPCKLCGFRLKNHKQLKLHQRY-HLSSNLFRCEKCNLGYSRKDSFEYHMTSH 709 (771)
Q Consensus 659 ~~C~~C~~~f~~~~~L~~H~~~-H~~~~~~~C~~C~~~F~~~~~L~~H~~~H 709 (771)
|.|+.||+.|.+..+|+.|++. |. ++.|+.||+.|. .|..|+.+.
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence 4455555555555555555542 32 234555555554 344444443
No 26
>PHA00732 hypothetical protein
Probab=98.10 E-value=2.4e-06 Score=66.50 Aligned_cols=48 Identities=23% Similarity=0.468 Sum_probs=32.8
Q ss_pred CccCCcCCccCCCchhHHHHHHH-hcCCCCeeecccCCcccCChhHHHHHHhhccC
Q psy13090 686 LFRCEKCNLGYSRKDSFEYHMTS-HNQKEQTFMCTQCDFTSSSKDTLNTHAMVHAD 740 (771)
Q Consensus 686 ~~~C~~C~~~F~~~~~L~~H~~~-H~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~ 740 (771)
||.|+.|++.|.+..+|+.|++. |. ++.|+.||++|. .|..|+++..+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~----~~~C~~CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT----LTKCPVCNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC----CCccCCCCCEeC---ChhhhhcccCC
Confidence 56777777777777777777774 43 246777777776 46677766544
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.86 E-value=2.1e-05 Score=56.85 Aligned_cols=50 Identities=28% Similarity=0.574 Sum_probs=33.2
Q ss_pred cccccchhhccCHHHHHHHHhhccCCC-CCcccCcchhhcCCHHHHHHHHHhhh
Q psy13090 448 RQCEYCSQLFRRASAVEKHIALKHTGD-KPYSCYLCEEKFENSESLRTHMNSLH 500 (771)
Q Consensus 448 ~~C~~C~~~f~~~~~L~~H~~~~H~~~-~~~~C~~C~~~f~~~~~L~~H~~~~H 500 (771)
|.|++|++ ..+...|..|+...|..+ +.+.|++|...+. .+|..|+...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 66777777 455667777777667664 4577777776544 37777776544
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.77 E-value=1.3e-05 Score=57.99 Aligned_cols=51 Identities=18% Similarity=0.367 Sum_probs=30.6
Q ss_pred ccccccccccccchhhhhhhhhhcCCCCCcccCCCCCccccchhhhcccccccc
Q psy13090 179 LKCKLCNYQAVNIKGIRAHVFRKHLQNTVTYSCKKCPFETQFKVTLGDHYKTCH 232 (771)
Q Consensus 179 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h 232 (771)
|.|+.|++ ..+..+|..|+...|..+.+.+.|++|...+. .+|..|+...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 66777777 44456677776666664445566666666433 26666665544
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.66 E-value=3e-05 Score=44.92 Aligned_cols=20 Identities=35% Similarity=0.536 Sum_probs=8.6
Q ss_pred ecccCCcccCChhHHHHHHh
Q psy13090 717 MCTQCDFTSSSKDTLNTHAM 736 (771)
Q Consensus 717 ~C~~C~~~f~~~~~L~~H~~ 736 (771)
+|+.|++.|.++..|..|++
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~ 21 (23)
T PF00096_consen 2 KCPICGKSFSSKSNLKRHMR 21 (23)
T ss_dssp EETTTTEEESSHHHHHHHHH
T ss_pred CCCCCCCccCCHHHHHHHHh
Confidence 34444444444444444443
No 30
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.59 E-value=3.1e-05 Score=44.86 Aligned_cols=22 Identities=27% Similarity=0.756 Sum_probs=12.1
Q ss_pred ccCCcCCccCCCchhHHHHHHH
Q psy13090 687 FRCEKCNLGYSRKDSFEYHMTS 708 (771)
Q Consensus 687 ~~C~~C~~~F~~~~~L~~H~~~ 708 (771)
|+|++|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4455555555555555555554
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.44 E-value=6.4e-05 Score=63.24 Aligned_cols=73 Identities=15% Similarity=0.282 Sum_probs=17.2
Q ss_pred ccccccccccChhHHHHHHHHhcCCCCccCCcCCccCCCchhHHHHHHHhcCCCCeeecccCCcccCChhHHHHHHhhc
Q psy13090 660 PCKLCGFRLKNHKQLKLHQRYHLSSNLFRCEKCNLGYSRKDSFEYHMTSHNQKEQTFMCTQCDFTSSSKDTLNTHAMVH 738 (771)
Q Consensus 660 ~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~C~~C~~~f~~~~~L~~H~~~H 738 (771)
+|..|+..|.+...|..|+...++-.. + ....+.....+..+++.-. ...+.|.+|++.|.+...|..||+.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~~~~--~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLRKKV--KESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Ccccccccccccccccccccccccccc---c-ccccccccccccccccccc--CCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 477777777777777777764443211 1 1122223444444443322 23577777777777777777777743
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.33 E-value=0.00012 Score=42.91 Aligned_cols=23 Identities=35% Similarity=0.612 Sum_probs=10.2
Q ss_pred eecccCCcccCChhHHHHHHhhc
Q psy13090 716 FMCTQCDFTSSSKDTLNTHAMVH 738 (771)
Q Consensus 716 ~~C~~C~~~f~~~~~L~~H~~~H 738 (771)
|.|++|++.|.+...|+.|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 34555555555555555554443
No 33
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.29 E-value=6.9e-05 Score=71.86 Aligned_cols=75 Identities=17% Similarity=0.441 Sum_probs=48.2
Q ss_pred CCCCccCCc--CCccCCCchhHHHHHHHhcCCCCeeecccCCcccCChhHHHHHHhhccCCCccccCCCCCccchhhhhh
Q psy13090 683 SSNLFRCEK--CNLGYSRKDSFEYHMTSHNQKEQTFMCTQCDFTSSSKDTLNTHAMVHADELKFSCQNCDFKCSKLSCVR 760 (771)
Q Consensus 683 ~~~~~~C~~--C~~~F~~~~~L~~H~~~H~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~e~~~~C~~C~~~f~~~~~l~ 760 (771)
++|||+|++ |+|.+.+...|+.||.--+...+...=+ +-..|.-.-...+||.|++|+|++..+..|+
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p----------~p~~~~~F~~~~KPYrCevC~KRYKNlNGLK 415 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENP----------SPEKMNIFSAKDKPYRCEVCDKRYKNLNGLK 415 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCC----------CccccccccccCCceeccccchhhccCccce
Confidence 469999976 9999999999999987432111111000 0001111123468888888888888888888
Q ss_pred hHHHHHh
Q psy13090 761 NVISQVA 767 (771)
Q Consensus 761 ~H~~~~H 767 (771)
-|+...|
T Consensus 416 YHr~Hsh 422 (423)
T COG5189 416 YHRKHSH 422 (423)
T ss_pred ecccccC
Confidence 8876555
No 34
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.28 E-value=0.00019 Score=41.98 Aligned_cols=24 Identities=21% Similarity=0.348 Sum_probs=11.7
Q ss_pred cccCCCCCccchhhhhhhHHHHHh
Q psy13090 744 FSCQNCDFKCSKLSCVRNVISQVA 767 (771)
Q Consensus 744 ~~C~~C~~~f~~~~~l~~H~~~~H 767 (771)
|.|++|++.|.++..|+.|+.++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 345555555555555555555544
No 35
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.28 E-value=0.00016 Score=60.78 Aligned_cols=73 Identities=18% Similarity=0.303 Sum_probs=20.6
Q ss_pred cccccccccccchhhhhhhhhhcCCCCCcccCCCCCccccchhhhccccccccCCcceecCCCCCcccchhhHHHHHHhc
Q psy13090 180 KCKLCNYQAVNIKGIRAHVFRKHLQNTVTYSCKKCPFETQFKVTLGDHYKTCHKGISFTCDECHLCFNTFRRFDSHIRAH 259 (771)
Q Consensus 180 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~F~~~~~L~~H~~~H 259 (771)
+|.+|+..|.+...|..|+...|. -..+ ....+.....|..+++. -....+.|..|++.|.+...|..||+.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~-~~~~-----~~~~l~~~~~~~~~~~~-~~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHG-FDIP-----DQKYLVDPNRLLNYLRK-KVKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp -------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Cccccccccccccccccccccccc-cccc-----ccccccccccccccccc-ccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 488999999999999999877775 2211 12222344445455443 2224689999999999999999999864
No 36
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.12 E-value=0.00032 Score=42.44 Aligned_cols=24 Identities=38% Similarity=0.595 Sum_probs=13.1
Q ss_pred eeecccCCcccCChhHHHHHHhhc
Q psy13090 715 TFMCTQCDFTSSSKDTLNTHAMVH 738 (771)
Q Consensus 715 ~~~C~~C~~~f~~~~~L~~H~~~H 738 (771)
||.|+.|++.|.+...|..|++.|
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCCccCCccCChhHHHHHhHHh
Confidence 355555555555555555555544
No 37
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.02 E-value=0.00031 Score=67.50 Aligned_cols=28 Identities=29% Similarity=0.692 Sum_probs=23.1
Q ss_pred ccccccccc--cccccccchhhhhhhhhhc
Q psy13090 175 EAEDLKCKL--CNYQAVNIKGIRAHVFRKH 202 (771)
Q Consensus 175 ~~~~~~C~~--C~~~f~~~~~l~~H~~~~h 202 (771)
++++|+|++ |++.+.+...|+.|++--|
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH 375 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGH 375 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccc
Confidence 358999987 9999999999999985444
No 38
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.98 E-value=0.0003 Score=42.53 Aligned_cols=25 Identities=28% Similarity=0.618 Sum_probs=14.2
Q ss_pred CccCCcCCccCCCchhHHHHHHHhc
Q psy13090 686 LFRCEKCNLGYSRKDSFEYHMTSHN 710 (771)
Q Consensus 686 ~~~C~~C~~~F~~~~~L~~H~~~H~ 710 (771)
||+|+.|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 3555555555555555555555553
No 39
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.76 E-value=0.001 Score=45.21 Aligned_cols=41 Identities=20% Similarity=0.224 Sum_probs=19.8
Q ss_pred HHHHHHhcCCCCeeecccCCcccCChhHHHHHHhhccCCCc
Q psy13090 703 EYHMTSHNQKEQTFMCTQCDFTSSSKDTLNTHAMVHADELK 743 (771)
Q Consensus 703 ~~H~~~H~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~e~~ 743 (771)
..+.+.+...+.|-.|++|+..+.+..+|++|+.++++.+|
T Consensus 12 ~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp ----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred hhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 44444454456666777777777777777777766655544
No 40
>KOG2231|consensus
Probab=96.56 E-value=0.004 Score=68.16 Aligned_cols=47 Identities=28% Similarity=0.545 Sum_probs=24.2
Q ss_pred ccCcchhhcCCHHHHHHHHHhhhcCCCccccCCC------CccCCChhhhhhhHHhhh
Q psy13090 478 SCYLCEEKFENSESLRTHMNSLHALLFPYSCNIC------FRRNSKYTHLSRHKLLVH 529 (771)
Q Consensus 478 ~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C------~~~f~~~~~l~~H~~~~h 529 (771)
.|..|...|-....|.+|++..| |.|..| +.-|.....|..|.+..|
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~h-----~~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFDH-----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred cchhhhhhhccHHHHHHhhccce-----eheeecCcccccchhcccchHHHHHhhhcC
Confidence 35555556666666655555333 334444 234555555666655444
No 41
>PRK04860 hypothetical protein; Provisional
Probab=96.41 E-value=0.0014 Score=59.11 Aligned_cols=41 Identities=17% Similarity=0.319 Sum_probs=36.3
Q ss_pred CcccccccccchhhhhhHHHHHHhhcCCCCccCCCCcccchhhHH
Q psy13090 32 DKFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKGASSQA 76 (771)
Q Consensus 32 k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~ 76 (771)
-||.|. |+. ...++.+|.++|+|+++|.|..|+..|.....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~~ 158 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKGE 158 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEecc
Confidence 489998 998 77889999999999999999999999876543
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.36 E-value=0.0027 Score=37.78 Aligned_cols=19 Identities=32% Similarity=0.592 Sum_probs=7.6
Q ss_pred cccCCcccCChhHHHHHHh
Q psy13090 718 CTQCDFTSSSKDTLNTHAM 736 (771)
Q Consensus 718 C~~C~~~f~~~~~L~~H~~ 736 (771)
|+.|++.|.+...|..|++
T Consensus 3 C~~C~~~f~~~~~l~~H~~ 21 (26)
T smart00355 3 CPECGKVFKSKSALKEHMR 21 (26)
T ss_pred CCCCcchhCCHHHHHHHHH
Confidence 3334444444444444433
No 43
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.34 E-value=0.0039 Score=60.88 Aligned_cols=135 Identities=26% Similarity=0.478 Sum_probs=94.6
Q ss_pred cccccc--ccccccChHHHHHHHhhcCC-ccCCCCC---CCCC------CHHHHHHHHHhcCCCCCCCCccCcCcccccc
Q psy13090 385 NYKCNI--CDCFFKTHSILEDHMKALHS-FPCDKCD---YVGG------NLSMLTSHQNSHNSQTEVPPILSKNKRQCEY 452 (771)
Q Consensus 385 ~~~C~~--C~~~f~~~~~L~~H~~~~h~-~~C~~C~---~~f~------~~~~l~~H~~~h~~~~~~~~~~~~~~~~C~~ 452 (771)
.|.|+. |.........|..|.+..|. +-|.+|- +.|. +...|+.|...-..+..+ +..-.|..
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GF-----KGHP~C~F 225 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGFVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGF-----KGHPLCIF 225 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCcEEhHhhhcCcccCccceeeeecccccccccCCccccCc-----CCCchhhh
Confidence 477864 77777778899999998776 6788874 3343 345566665433322111 12235999
Q ss_pred chhhccCHHHHHHHHhhccCCCCCcccCcchh-------hcCCHHHHHHHHHhhhcCCCccccCC--CC----ccCCChh
Q psy13090 453 CSQLFRRASAVEKHIALKHTGDKPYSCYLCEE-------KFENSESLRTHMNSLHALLFPYSCNI--CF----RRNSKYT 519 (771)
Q Consensus 453 C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~-------~f~~~~~L~~H~~~~H~~~~~~~C~~--C~----~~f~~~~ 519 (771)
|...|-+-..|.+|++..|. .|-+|++ -|.+-.+|..|.+..| |.|.+ |. ..|....
T Consensus 226 C~~~FYdDDEL~~HcR~~HE-----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h-----y~ct~qtc~~~k~~vf~~~~ 295 (493)
T COG5236 226 CKIYFYDDDELRRHCRLRHE-----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH-----YCCTFQTCRVGKCYVFPYHT 295 (493)
T ss_pred ccceecChHHHHHHHHhhhh-----hhhhhhccCccchhhhhCHHHHHHHhhcCc-----eEEEEEEEecCcEEEeccHH
Confidence 99999999999999998774 4777765 4888899999987555 44543 32 3688889
Q ss_pred hhhhhHHhhhccccC
Q psy13090 520 HLSRHKLLVHQKQID 534 (771)
Q Consensus 520 ~l~~H~~~~h~~~~~ 534 (771)
.|..|+...|.....
T Consensus 296 el~~h~~~~h~~~~~ 310 (493)
T COG5236 296 ELLEHLTRFHKVNAR 310 (493)
T ss_pred HHHHHHHHHhhcccc
Confidence 999999988876543
No 44
>PRK04860 hypothetical protein; Provisional
Probab=96.23 E-value=0.0026 Score=57.40 Aligned_cols=40 Identities=20% Similarity=0.391 Sum_probs=31.7
Q ss_pred CeeecccCCcccCChhHHHHHHhhccCCCccccCCCCCccchhh
Q psy13090 714 QTFMCTQCDFTSSSKDTLNTHAMVHADELKFSCQNCDFKCSKLS 757 (771)
Q Consensus 714 ~~~~C~~C~~~f~~~~~L~~H~~~H~~e~~~~C~~C~~~f~~~~ 757 (771)
-+|.|. |+. ....+++|.++|+++++|.|..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 468887 887 6677888888888888888888888876544
No 45
>KOG2785|consensus
Probab=96.20 E-value=0.015 Score=58.32 Aligned_cols=28 Identities=29% Similarity=0.274 Sum_probs=21.4
Q ss_pred CccccccccccccChHHHHHHHhhcCCc
Q psy13090 384 DNYKCNICDCFFKTHSILEDHMKALHSF 411 (771)
Q Consensus 384 ~~~~C~~C~~~f~~~~~L~~H~~~~h~~ 411 (771)
.|-.|-.|++.+.+...-..||...|+|
T Consensus 165 ~Pt~CLfC~~~~k~~e~~~~HM~~~Hgf 192 (390)
T KOG2785|consen 165 IPTDCLFCDKKSKSLEENLKHMFKEHGF 192 (390)
T ss_pred CCcceeecCCCcccHHHHHHHHhhccCC
Confidence 3467888888888888888888777764
No 46
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.20 E-value=0.0032 Score=42.88 Aligned_cols=32 Identities=22% Similarity=0.312 Sum_probs=18.2
Q ss_pred CCCcccccccccchhhhhhHHHHHHhhcCCCC
Q psy13090 30 HKDKFMCYACYYSTYISTNMKNHIMTHTGSKP 61 (771)
Q Consensus 30 ~~k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~ 61 (771)
.+.|-.||+|+..+++..+|.+|+.++.+.||
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 45677777777777777777777766666554
No 47
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.19 E-value=0.0034 Score=37.29 Aligned_cols=23 Identities=26% Similarity=0.488 Sum_probs=12.4
Q ss_pred ccccccccchhhhhhHHHHHHhh
Q psy13090 34 FMCYACYYSTYISTNMKNHIMTH 56 (771)
Q Consensus 34 ~~C~~C~~~f~~~~~l~~H~~~H 56 (771)
|+|+.|+++|.....|..|+++|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 34555555555555555555544
No 48
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.13 E-value=0.004 Score=36.29 Aligned_cols=23 Identities=17% Similarity=0.570 Sum_probs=13.6
Q ss_pred cccCCCCCccchhhhhhhHHHHHh
Q psy13090 744 FSCQNCDFKCSKLSCVRNVISQVA 767 (771)
Q Consensus 744 ~~C~~C~~~f~~~~~l~~H~~~~H 767 (771)
|+|+.|++..+ ++.|.+|++.+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 56666666666 666666666665
No 49
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.12 E-value=0.0041 Score=36.24 Aligned_cols=23 Identities=30% Similarity=0.785 Sum_probs=11.8
Q ss_pred eecccCCcccCChhHHHHHHhhcc
Q psy13090 716 FMCTQCDFTSSSKDTLNTHAMVHA 739 (771)
Q Consensus 716 ~~C~~C~~~f~~~~~L~~H~~~H~ 739 (771)
|+|+.|++... +..|.+|+++|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45666665555 555666655543
No 50
>KOG2231|consensus
Probab=96.12 E-value=0.012 Score=64.54 Aligned_cols=62 Identities=19% Similarity=0.378 Sum_probs=40.5
Q ss_pred cccccccccccChhhhhhhhccccCCCcccCCCCCcccCCchhHHHhhhcCccceecCCCCcc---------ccchHHHH
Q psy13090 263 EQCELCNEKVIGKNQIYFHYYHEHEGYDFNCKQCSFTTKTPKKFIEHKEAGHFTWNCEHCDFS---------ASKKSVLL 333 (771)
Q Consensus 263 ~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~---------f~~~~~L~ 333 (771)
+.|.+|+..|.-.. ..-.|..| -.|.+...|..|+...|..+.|.+|-.. ..+...|.
T Consensus 100 ~~C~~C~~~~~~~~------------~~~~~~~c-~~~~s~~~Lk~H~~~~H~~~~c~lC~~~~kif~~e~k~Yt~~el~ 166 (669)
T KOG2231|consen 100 HSCHICDRRFRALY------------NKKECLHC-TEFKSVENLKNHMRDQHKLHLCSLCLQNLKIFINERKLYTRAELN 166 (669)
T ss_pred hhcCccccchhhhc------------ccCCCccc-cchhHHHHHHHHHHHhhhhhccccccccceeeeeeeehehHHHHH
Confidence 57888887664221 02257777 7778888888888877777778777331 23445666
Q ss_pred HHHh
Q psy13090 334 SHLK 337 (771)
Q Consensus 334 ~H~~ 337 (771)
.|+.
T Consensus 167 ~h~~ 170 (669)
T KOG2231|consen 167 LHLM 170 (669)
T ss_pred HHHh
Confidence 6654
No 51
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.96 E-value=0.0075 Score=58.96 Aligned_cols=129 Identities=19% Similarity=0.367 Sum_probs=92.3
Q ss_pred cccccc--chhhccChHHHHHHHhhccCCCcccccccc---cccc------ChhHHHHHHHHhcCCCCc----cCCcCCc
Q psy13090 630 QLTCDI--CDRVFQTDKQWKLHMGFHNTKRRLPCKLCG---FRLK------NHKQLKLHQRYHLSSNLF----RCEKCNL 694 (771)
Q Consensus 630 ~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~---~~f~------~~~~L~~H~~~H~~~~~~----~C~~C~~ 694 (771)
.|.|+. |..+...-..|+.|.+.-++ .+.|.+|- +.|. ++..|+.|...-..+..| .|..|..
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~ 228 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKI 228 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccc
Confidence 377765 77776677889999887655 36777773 3443 466777776654333233 5999999
Q ss_pred cCCCchhHHHHHHHhcCCCCeeecccCCc-------ccCChhHHHHHHhhccCCCccccC--CCC----Cccchhhhhhh
Q psy13090 695 GYSRKDSFEYHMTSHNQKEQTFMCTQCDF-------TSSSKDTLNTHAMVHADELKFSCQ--NCD----FKCSKLSCVRN 761 (771)
Q Consensus 695 ~F~~~~~L~~H~~~H~~~~~~~~C~~C~~-------~f~~~~~L~~H~~~H~~e~~~~C~--~C~----~~f~~~~~l~~ 761 (771)
.|.+-..|..|+|.-+ |+ |-+|++ -|.+-.+|..|.+.- -|.|. .|- +.|...-.|..
T Consensus 229 ~FYdDDEL~~HcR~~H--E~---ChICD~v~p~~~QYFK~Y~~Le~HF~~~----hy~ct~qtc~~~k~~vf~~~~el~~ 299 (493)
T COG5236 229 YFYDDDELRRHCRLRH--EA---CHICDMVGPIRYQYFKSYEDLEAHFRNA----HYCCTFQTCRVGKCYVFPYHTELLE 299 (493)
T ss_pred eecChHHHHHHHHhhh--hh---hhhhhccCccchhhhhCHHHHHHHhhcC----ceEEEEEEEecCcEEEeccHHHHHH
Confidence 9999999999999876 44 666654 477888999997632 26663 342 56888889999
Q ss_pred HHHHHhhh
Q psy13090 762 VISQVAQK 769 (771)
Q Consensus 762 H~~~~H~~ 769 (771)
|+.+.|..
T Consensus 300 h~~~~h~~ 307 (493)
T COG5236 300 HLTRFHKV 307 (493)
T ss_pred HHHHHhhc
Confidence 99999863
No 52
>KOG2482|consensus
Probab=95.83 E-value=0.022 Score=55.87 Aligned_cols=188 Identities=19% Similarity=0.294 Sum_probs=93.8
Q ss_pred cccccccccccC-hhhhhhhhccccCCCcccCCCCCcccCCchhHHHhhhcCccceecCCCCccccchHHHHHHHhh--c
Q psy13090 263 EQCELCNEKVIG-KNQIYFHYYHEHEGYDFNCKQCSFTTKTPKKFIEHKEAGHFTWNCEHCDFSASKKSVLLSHLKQ--H 339 (771)
Q Consensus 263 ~~C~~C~~~f~~-~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~L~~H~~~--h 339 (771)
.+|-.|+..+.. ++.+..|+...|+-.. .=+.....-..|..|++..-..++|-.|.+.|..+..|+.||+. |
T Consensus 145 lqClFCn~e~lgnRs~~l~Hlf~~H~lni----GlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrkK~H 220 (423)
T KOG2482|consen 145 LQCLFCNNEGLGNRSEILEHLFHVHGLNI----GLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRKKRH 220 (423)
T ss_pred eEEEEecchhcccHHHHHHHHHHHhhhcc----CCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHhccC
Confidence 468888877654 4667777766664220 00011122345677776555567899999999999999999864 3
Q ss_pred cccccccccC---------------CCCcccCCCCCCCCCC-chhhhHHHHHhhhcccCCCccccccccccccChHHHHH
Q psy13090 340 VTRVTRTQHM---------------TPDEETNEIPIEPDLS-PDLQKEELVRETRSMKMYDNYKCNICDCFFKTHSILED 403 (771)
Q Consensus 340 ~~~~~~~~~~---------------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~ 403 (771)
....+....- ++.......+...+.. .+..-.+...+. .......|-.|.....+...|..
T Consensus 221 rrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~---a~a~~v~CLfC~~~~en~~~l~e 297 (423)
T KOG2482|consen 221 RRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDD---AEALSVVCLFCTNFYENPVFLFE 297 (423)
T ss_pred cccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCC---CCccceEEEeeccchhhHHHHHH
Confidence 2211110000 0000000000000000 000000000000 00112689999999999999999
Q ss_pred HHhhcCCccC--CCCC--CCCCCHHHHHHHHHhcCCCCCCCCccCcCccccccchhhccCHHHHHHHHh
Q psy13090 404 HMKALHSFPC--DKCD--YVGGNLSMLTSHQNSHNSQTEVPPILSKNKRQCEYCSQLFRRASAVEKHIA 468 (771)
Q Consensus 404 H~~~~h~~~C--~~C~--~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~ 468 (771)
||+.+|.|-= ..=+ .-|...-.+..-++... ..-.|..|+-.|.....|..||.
T Consensus 298 Hmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~-----------~~~~c~~cd~~F~~e~~l~~hm~ 355 (423)
T KOG2482|consen 298 HMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQK-----------KKSRCAECDLSFWKEPGLLIHMV 355 (423)
T ss_pred HHHHHHHhhHHhhccccccchhhhhhHHHHHHHHh-----------hccccccccccccCcchhhhhcc
Confidence 9999886310 0001 11111111111111111 22357888888888888888875
No 53
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.64 E-value=0.0049 Score=36.36 Aligned_cols=20 Identities=30% Similarity=0.587 Sum_probs=9.4
Q ss_pred ecccCCcccCChhHHHHHHh
Q psy13090 717 MCTQCDFTSSSKDTLNTHAM 736 (771)
Q Consensus 717 ~C~~C~~~f~~~~~L~~H~~ 736 (771)
.|++|++.|.+...|+.|++
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCcCCHHHHHHHHC
Confidence 34444444444444444443
No 54
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.60 E-value=0.0089 Score=35.22 Aligned_cols=21 Identities=33% Similarity=0.839 Sum_probs=10.5
Q ss_pred ccCCcCCccCCCchhHHHHHH
Q psy13090 687 FRCEKCNLGYSRKDSFEYHMT 707 (771)
Q Consensus 687 ~~C~~C~~~F~~~~~L~~H~~ 707 (771)
|.|++|++.|.+...|+.|++
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCCcCCHHHHHHHHC
Confidence 345555555555555555544
No 55
>KOG2785|consensus
Probab=94.95 E-value=0.021 Score=57.48 Aligned_cols=109 Identities=18% Similarity=0.276 Sum_probs=57.5
Q ss_pred CCCccccccccccccChHHHHHHHhhcCCccCCCCCCCCCCHHHHHHHHHhcCCCCC-CC-------CccCcCccccccc
Q psy13090 382 MYDNYKCNICDCFFKTHSILEDHMKALHSFPCDKCDYVGGNLSMLTSHQNSHNSQTE-VP-------PILSKNKRQCEYC 453 (771)
Q Consensus 382 ~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~C~~C~~~f~~~~~l~~H~~~h~~~~~-~~-------~~~~~~~~~C~~C 453 (771)
...++.|.+|.+.|.+..+...|++. ......+..|.+.-.+... .. ..+..+.-.+..+
T Consensus 65 ~~~~~~c~~c~k~~~s~~a~~~hl~S------------k~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~ 132 (390)
T KOG2785|consen 65 AESVVYCEACNKSFASPKAHENHLKS------------KKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKW 132 (390)
T ss_pred cccceehHHhhccccChhhHHHHHHH------------hhcchhhhhhhccccccchhhhhccccCccccccCCCcccch
Confidence 34678999999999999999999953 2334445555542211111 00 0011111113333
Q ss_pred hhhccCHHHHHHHHhh-----------ccCCCCCcccCcchhhcCCHHHHHHHHHhhhcC
Q psy13090 454 SQLFRRASAVEKHIAL-----------KHTGDKPYSCYLCEEKFENSESLRTHMNSLHAL 502 (771)
Q Consensus 454 ~~~f~~~~~L~~H~~~-----------~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~ 502 (771)
............+... .-....|-.|-.|+..+.+...-..||...|.-
T Consensus 133 ~E~~~~~d~~~e~~~dd~~Edi~~d~~~e~e~~Pt~CLfC~~~~k~~e~~~~HM~~~Hgf 192 (390)
T KOG2785|consen 133 YEVDSDEDSSEEEEEDDEEEDIEEDGDDEDELIPTDCLFCDKKSKSLEENLKHMFKEHGF 192 (390)
T ss_pred hhcccccccchhhccCcchhhhhhccchhcccCCcceeecCCCcccHHHHHHHHhhccCC
Confidence 3333332222222110 001123466888888888888888888766653
No 56
>KOG2482|consensus
Probab=94.56 E-value=0.079 Score=52.15 Aligned_cols=52 Identities=25% Similarity=0.287 Sum_probs=38.3
Q ss_pred cccCcchhhcCCHHHHHHHHHhhhcCC--------------------------CccccCCCCccCCChhhhhhhHHhh
Q psy13090 477 YSCYLCEEKFENSESLRTHMNSLHALL--------------------------FPYSCNICFRRNSKYTHLSRHKLLV 528 (771)
Q Consensus 477 ~~C~~C~~~f~~~~~L~~H~~~~H~~~--------------------------~~~~C~~C~~~f~~~~~l~~H~~~~ 528 (771)
..|-.|.....+...|..||..+|.-. ..-.|-.|...|.....|..|+...
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence 467778777777888888887777311 1235888889999999999998743
No 57
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.49 E-value=0.02 Score=34.48 Aligned_cols=22 Identities=32% Similarity=0.534 Sum_probs=19.1
Q ss_pred ccCCCCcccchhhHHHhhhhhH
Q psy13090 62 YNCSHCTFKGASSQALKYHMHV 83 (771)
Q Consensus 62 ~~C~~C~~~f~~~~~L~~H~~~ 83 (771)
|-|++|++.|.+..+|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 7799999999999999999865
No 58
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.12 E-value=0.026 Score=62.30 Aligned_cols=58 Identities=24% Similarity=0.363 Sum_probs=53.6
Q ss_pred CcccccccccchhhhhhHHHHHH--hhcCC--CCccCC--CCcccchhhHHHhhhhhHhHHHHH
Q psy13090 32 DKFMCYACYYSTYISTNMKNHIM--THTGS--KPYNCS--HCTFKGASSQALKYHMHVDMLQYL 89 (771)
Q Consensus 32 k~~~C~~C~~~f~~~~~l~~H~~--~H~~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~ 89 (771)
.++.|..|...|+....|.+|.+ .|+++ +|+.|+ .|++.|.+...|..|..+|++-.+
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISP 351 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCc
Confidence 58999999999999999999999 89999 999999 799999999999999988876543
No 59
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.65 E-value=0.038 Score=33.20 Aligned_cols=22 Identities=23% Similarity=0.427 Sum_probs=14.4
Q ss_pred ccCCCCCccccchhhhcccccc
Q psy13090 209 YSCKKCPFETQFKVTLGDHYKT 230 (771)
Q Consensus 209 ~~C~~C~~~f~~~~~L~~H~~~ 230 (771)
|.|..|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5566666666666666666654
No 60
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=92.15 E-value=0.032 Score=61.54 Aligned_cols=123 Identities=17% Similarity=0.291 Sum_probs=88.3
Q ss_pred CccccccchhhccChHHHHHHHh--hccCC--Cccccc--cccccccChhHHHHHHHHhcCCCCccCCc--CCccCCCch
Q psy13090 629 SQLTCDICDRVFQTDKQWKLHMG--FHNTK--RRLPCK--LCGFRLKNHKQLKLHQRYHLSSNLFRCEK--CNLGYSRKD 700 (771)
Q Consensus 629 ~~~~C~~C~~~f~~~~~L~~H~~--~H~~~--~~~~C~--~C~~~f~~~~~L~~H~~~H~~~~~~~C~~--C~~~F~~~~ 700 (771)
.++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.....|..|...|++..++.+.. +...+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 46899999999999999999999 89999 999999 89999999999999999998877666633 334433332
Q ss_pred h-----------------------------------HHHHHHHhcC-CCCeeecccCCcccCChhHHHHHHhhccCCCcc
Q psy13090 701 S-----------------------------------FEYHMTSHNQ-KEQTFMCTQCDFTSSSKDTLNTHAMVHADELKF 744 (771)
Q Consensus 701 ~-----------------------------------L~~H~~~H~~-~~~~~~C~~C~~~f~~~~~L~~H~~~H~~e~~~ 744 (771)
. +..|...|-. +...+.+..|.+.|.....|..|++.|....++
T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (467)
T COG5048 368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPL 447 (467)
T ss_pred CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCce
Confidence 2 2222222210 011355677777777777777777777777766
Q ss_pred ccCCCCC
Q psy13090 745 SCQNCDF 751 (771)
Q Consensus 745 ~C~~C~~ 751 (771)
.|..++.
T Consensus 448 ~~~~~~~ 454 (467)
T COG5048 448 LCSILKS 454 (467)
T ss_pred eeccccc
Confidence 6654443
No 61
>KOG2893|consensus
Probab=92.01 E-value=0.054 Score=50.22 Aligned_cols=49 Identities=22% Similarity=0.298 Sum_probs=38.4
Q ss_pred cccCCcccCChhHHHHHHhhccCCCccccCCCCCccchhhhhhhHHHHHhhhc
Q psy13090 718 CTQCDFTSSSKDTLNTHAMVHADELKFSCQNCDFKCSKLSCVRNVISQVAQKT 770 (771)
Q Consensus 718 C~~C~~~f~~~~~L~~H~~~H~~e~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 770 (771)
|=+|++.|.....|..|++. +-|+|.+|-|+..+--.|..|..+||+-|
T Consensus 13 cwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvhket 61 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVHKET 61 (341)
T ss_pred eeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhhhhh
Confidence 77888888888888887663 34888888888888888888888888754
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.84 E-value=0.12 Score=30.27 Aligned_cols=19 Identities=26% Similarity=0.380 Sum_probs=10.9
Q ss_pred ccCCCCcccchhhHHHhhhh
Q psy13090 62 YNCSHCTFKGASSQALKYHM 81 (771)
Q Consensus 62 ~~C~~C~~~f~~~~~L~~H~ 81 (771)
..|++|+++| ....|..|+
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~ 21 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHE 21 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHH
Confidence 3566666666 455555554
No 63
>KOG4173|consensus
Probab=91.79 E-value=0.12 Score=47.02 Aligned_cols=80 Identities=24% Similarity=0.460 Sum_probs=50.8
Q ss_pred Ccccccc--chhhccCHHHHHHHHhhccCCCCCcccCcchhhcCCHHHHHHHHHhhhc---------CCCccccCC--CC
Q psy13090 446 NKRQCEY--CSQLFRRASAVEKHIALKHTGDKPYSCYLCEEKFENSESLRTHMNSLHA---------LLFPYSCNI--CF 512 (771)
Q Consensus 446 ~~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~---------~~~~~~C~~--C~ 512 (771)
..+.|++ |...|.+....+.|....|+. .|.+|.+.|.+...|..|+...|. +...|+|-+ |+
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~----sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN----SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc----hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 4566765 667777777777776555543 477777777777777777766663 233455543 66
Q ss_pred ccCCChhhhhhhHHhhh
Q psy13090 513 RRNSKYTHLSRHKLLVH 529 (771)
Q Consensus 513 ~~f~~~~~l~~H~~~~h 529 (771)
.+|.+...-..|+...|
T Consensus 154 ~KFkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMH 170 (253)
T ss_pred hhhhhhhhhhhHHHHhc
Confidence 66666666666665554
No 64
>KOG2893|consensus
Probab=91.37 E-value=0.08 Score=49.12 Aligned_cols=54 Identities=28% Similarity=0.639 Sum_probs=36.2
Q ss_pred cCcchhhcCCHHHHHHHHHhhhcCCCccccCCCCccCCChhhhhhhHHhhhccccCCCc
Q psy13090 479 CYLCEEKFENSESLRTHMNSLHALLFPYSCNICFRRNSKYTHLSRHKLLVHQKQIDGVG 537 (771)
Q Consensus 479 C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~ 537 (771)
|-+|+..|.....|..|++.. -|+|.||.+..-+-..|..|...+|.+.++.+.
T Consensus 13 cwycnrefddekiliqhqkak-----hfkchichkkl~sgpglsihcmqvhketid~ip 66 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAK-----HFKCHICHKKLFSGPGLSIHCMQVHKETIDKIP 66 (341)
T ss_pred eeecccccchhhhhhhhhhhc-----cceeeeehhhhccCCCceeehhhhhhhhhhccc
Confidence 677777777777776666543 367777777666667777777777766655443
No 65
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.35 E-value=0.16 Score=29.82 Aligned_cols=18 Identities=28% Similarity=0.534 Sum_probs=8.7
Q ss_pred ecccCCcccCChhHHHHHH
Q psy13090 717 MCTQCDFTSSSKDTLNTHA 735 (771)
Q Consensus 717 ~C~~C~~~f~~~~~L~~H~ 735 (771)
.|+.||+.| ..+.|..|+
T Consensus 4 ~C~~CgR~F-~~~~l~~H~ 21 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHE 21 (25)
T ss_pred cCCCCCCEE-CHHHHHHHH
Confidence 355555555 444444444
No 66
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=89.54 E-value=0.26 Score=31.68 Aligned_cols=20 Identities=25% Similarity=0.655 Sum_probs=9.7
Q ss_pred eecccCCcccCChhHHHHHH
Q psy13090 716 FMCTQCDFTSSSKDTLNTHA 735 (771)
Q Consensus 716 ~~C~~C~~~f~~~~~L~~H~ 735 (771)
|.|++|+..|.+...+..|+
T Consensus 4 ~~C~~C~~~~~~~~~~~~H~ 23 (35)
T smart00451 4 FYCKLCNVTFTDEISVEAHL 23 (35)
T ss_pred eEccccCCccCCHHHHHHHH
Confidence 44555555554444444444
No 67
>KOG4173|consensus
Probab=89.29 E-value=0.22 Score=45.37 Aligned_cols=82 Identities=29% Similarity=0.643 Sum_probs=50.5
Q ss_pred cccccc--ccccccChHHHHHHHhhcCCccCCCCCCCCCCHHHHHHHHHhcCCCCCCCCccCcCccccccchhhccCHHH
Q psy13090 385 NYKCNI--CDCFFKTHSILEDHMKALHSFPCDKCDYVGGNLSMLTSHQNSHNSQTEVPPILSKNKRQCEYCSQLFRRASA 462 (771)
Q Consensus 385 ~~~C~~--C~~~f~~~~~L~~H~~~~h~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~C~~C~~~f~~~~~ 462 (771)
.+.|++ |...|........|..+.|... |.+|.+.|.+...
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~~s-------------------------------------Cs~C~r~~Pt~hL 121 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHGNS-------------------------------------CSFCKRAFPTGHL 121 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhcccch-------------------------------------hHHHHHhCCchhh
Confidence 356664 6667777666666665555444 5555555555555
Q ss_pred HHHHHhhcc---------CCCCCcccC--cchhhcCCHHHHHHHHHhhhcCC
Q psy13090 463 VEKHIALKH---------TGDKPYSCY--LCEEKFENSESLRTHMNSLHALL 503 (771)
Q Consensus 463 L~~H~~~~H---------~~~~~~~C~--~C~~~f~~~~~L~~H~~~~H~~~ 503 (771)
|..|+...| .|..-|.|- .|+..|.+...-+.||-..|.-.
T Consensus 122 Ld~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk~P 173 (253)
T KOG4173|consen 122 LDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMHKYP 173 (253)
T ss_pred hhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhccCC
Confidence 555554333 344557774 48888888888888877666543
No 68
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=89.15 E-value=0.3 Score=31.39 Aligned_cols=22 Identities=18% Similarity=0.442 Sum_probs=14.2
Q ss_pred cccCCCCCccchhhhhhhHHHH
Q psy13090 744 FSCQNCDFKCSKLSCVRNVISQ 765 (771)
Q Consensus 744 ~~C~~C~~~f~~~~~l~~H~~~ 765 (771)
|.|++|++.|.....+..|+..
T Consensus 4 ~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 4 FYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred eEccccCCccCCHHHHHHHHCh
Confidence 5566666666666666666553
No 69
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=88.12 E-value=0.22 Score=31.56 Aligned_cols=25 Identities=24% Similarity=0.615 Sum_probs=19.4
Q ss_pred ccccccccchhhhhhHHHHHHhhcCCCCccCCCCccc
Q psy13090 34 FMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFK 70 (771)
Q Consensus 34 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~ 70 (771)
|+|..||+.+.... .|+.||+|+..
T Consensus 2 ~~C~~CGy~y~~~~------------~~~~CP~Cg~~ 26 (33)
T cd00350 2 YVCPVCGYIYDGEE------------APWVCPVCGAP 26 (33)
T ss_pred EECCCCCCEECCCc------------CCCcCcCCCCc
Confidence 78999998876432 78999999753
No 70
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=87.79 E-value=0.37 Score=30.55 Aligned_cols=10 Identities=20% Similarity=0.640 Sum_probs=5.1
Q ss_pred CccccCCCCC
Q psy13090 742 LKFSCQNCDF 751 (771)
Q Consensus 742 ~~~~C~~C~~ 751 (771)
.|+.|++||.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 4455555553
No 71
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=87.10 E-value=0.33 Score=33.77 Aligned_cols=30 Identities=10% Similarity=0.158 Sum_probs=15.2
Q ss_pred CCCccccCCCCCccchhhhhhhHHHHHhhh
Q psy13090 740 DELKFSCQNCDFKCSKLSCVRNVISQVAQK 769 (771)
Q Consensus 740 ~e~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 769 (771)
||.-+.|+-||.-|....+..+|+...|.|
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH~~ 43 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAHGW 43 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHhhh
Confidence 444455555555555555555555555443
No 72
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=80.39 E-value=0.69 Score=39.22 Aligned_cols=26 Identities=23% Similarity=0.672 Sum_probs=20.3
Q ss_pred Ccccc----ccccccccChHHHHHHHhhcC
Q psy13090 384 DNYKC----NICDCFFKTHSILEDHMKALH 409 (771)
Q Consensus 384 ~~~~C----~~C~~~f~~~~~L~~H~~~~h 409 (771)
..|.| ..|+....+...+..|++..|
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 55888 888888888888888886655
No 73
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=79.98 E-value=3.2 Score=34.70 Aligned_cols=33 Identities=18% Similarity=0.455 Sum_probs=23.4
Q ss_pred ccCCCCCccccchhhhccccccccCCcceecCCCCCcccchhhHHHH
Q psy13090 209 YSCKKCPFETQFKVTLGDHYKTCHKGISFTCDECHLCFNTFRRFDSH 255 (771)
Q Consensus 209 ~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~F~~~~~L~~H 255 (771)
|.|+.|+... -..|-.|++||.+......|.+-
T Consensus 2 Y~CPrC~skv--------------C~LP~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 2 YFCPQCRAKV--------------CELPVECPICGLTLILSTHLARS 34 (112)
T ss_pred ccCCCCCCCc--------------cCCCCcCCcCCCEEeccchHHHh
Confidence 6778775433 22577899998888887777654
No 74
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=79.25 E-value=1.3 Score=28.25 Aligned_cols=10 Identities=20% Similarity=1.039 Sum_probs=5.4
Q ss_pred eecccCCccc
Q psy13090 716 FMCTQCDFTS 725 (771)
Q Consensus 716 ~~C~~C~~~f 725 (771)
|+|.+||..+
T Consensus 3 ~~C~~CG~i~ 12 (34)
T cd00729 3 WVCPVCGYIH 12 (34)
T ss_pred EECCCCCCEe
Confidence 5555555544
No 75
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=78.99 E-value=3.1 Score=35.21 Aligned_cols=24 Identities=13% Similarity=0.429 Sum_probs=18.2
Q ss_pred ccc----CCCCCccchhhhhhhHHHHHh
Q psy13090 744 FSC----QNCDFKCSKLSCVRNVISQVA 767 (771)
Q Consensus 744 ~~C----~~C~~~f~~~~~l~~H~~~~H 767 (771)
|.| ..|++.+.++.++++|++..|
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 777 777777777777777777776
No 76
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=78.14 E-value=0.64 Score=44.80 Aligned_cols=43 Identities=16% Similarity=0.390 Sum_probs=25.5
Q ss_pred CeeecccCCcccCChhHHHHHHhh---ccC-------CCc-----cccCCCCCccchh
Q psy13090 714 QTFMCTQCDFTSSSKDTLNTHAMV---HAD-------ELK-----FSCQNCDFKCSKL 756 (771)
Q Consensus 714 ~~~~C~~C~~~f~~~~~L~~H~~~---H~~-------e~~-----~~C~~C~~~f~~~ 756 (771)
+.+.||+|+..|.++..+....++ .++ ..| ..|+.||++|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 456677777777666555444432 222 222 4788888888754
No 77
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=75.45 E-value=1.8 Score=41.68 Aligned_cols=42 Identities=17% Similarity=0.325 Sum_probs=22.6
Q ss_pred cccCCCCCccccchhhhccccccccC---------C-c-----ceecCCCCCcccch
Q psy13090 208 TYSCKKCPFETQFKVTLGDHYKTCHK---------G-I-----SFTCDECHLCFNTF 249 (771)
Q Consensus 208 ~~~C~~C~~~f~~~~~L~~H~~~~h~---------~-~-----~~~C~~C~~~F~~~ 249 (771)
.+.||+|+..|.+...+....+.... + . ...|+.||.+|...
T Consensus 5 ~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 5 KITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred ceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 35566666666555554444433111 1 1 24788888777643
No 78
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=74.61 E-value=2.4 Score=35.55 Aligned_cols=35 Identities=29% Similarity=0.653 Sum_probs=28.7
Q ss_pred ccCCceecCCCCcccCChHHHHccccccCCCCCCCCcccccccCCc
Q psy13090 540 EIGEKYSCTQCSYQTYHKHALDNHMPLHSLDRKHQCPVCLRRFSRQ 585 (771)
Q Consensus 540 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 585 (771)
..|.+..|+.||..|+.. +..|-.|+.||..|.-.
T Consensus 5 elGtKR~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 5 ELGTKRTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCcccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 446678899999999874 34788999999999876
No 79
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=74.06 E-value=4.3 Score=33.93 Aligned_cols=54 Identities=15% Similarity=0.428 Sum_probs=31.0
Q ss_pred cCCcCCccCCCchhHHHHHHHhcCCCCeeecccCCcccCChhHHHHHHhhccCCCccccCCCC
Q psy13090 688 RCEKCNLGYSRKDSFEYHMTSHNQKEQTFMCTQCDFTSSSKDTLNTHAMVHADELKFSCQNCD 750 (771)
Q Consensus 688 ~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~e~~~~C~~C~ 750 (771)
.|--|...|........= .-. ....|+|+.|...|--.-++-.|...|. |+.|.
T Consensus 57 ~C~~C~~~f~~~~~~~~~--~~~-~~~~y~C~~C~~~FC~dCD~fiHe~Lh~------CPGC~ 110 (112)
T TIGR00622 57 FCFGCQGPFPKPPVSPFD--ELK-DSHRYVCAVCKNVFCVDCDVFVHESLHC------CPGCI 110 (112)
T ss_pred cccCcCCCCCCccccccc--ccc-cccceeCCCCCCccccccchhhhhhccC------CcCCC
Confidence 377777777654311100 001 2345778888877777777777766664 66554
No 80
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=73.93 E-value=2.1 Score=29.94 Aligned_cols=30 Identities=23% Similarity=0.480 Sum_probs=22.4
Q ss_pred CCCCCcccCcchhhcCCHHHHHHHHHhhhc
Q psy13090 472 TGDKPYSCYLCEEKFENSESLRTHMNSLHA 501 (771)
Q Consensus 472 ~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~ 501 (771)
.|+.-+.|+-|+..|....+..+|+...|.
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 455667788888888888888888776664
No 81
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=68.99 E-value=3.2 Score=34.73 Aligned_cols=30 Identities=20% Similarity=0.380 Sum_probs=25.6
Q ss_pred ccccccccchhhhhhHHHHHHhhcCCCCccCCCCcccchhh
Q psy13090 34 FMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKGASS 74 (771)
Q Consensus 34 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~ 74 (771)
..|+.||+.|... +..|-.||.||..|.-.
T Consensus 10 R~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 4899999999953 46799999999999866
No 82
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=68.62 E-value=2.8 Score=37.96 Aligned_cols=23 Identities=30% Similarity=0.797 Sum_probs=20.2
Q ss_pred cccccccccchhhhhhHHHHHHhhcCCCCccCCCCc
Q psy13090 33 KFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCT 68 (771)
Q Consensus 33 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~ 68 (771)
-|.|++||+.+. |+-|-+||+|+
T Consensus 134 ~~vC~vCGy~~~-------------ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYTHE-------------GEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCccc-------------CCCCCcCCCCC
Confidence 699999998754 68899999997
No 83
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=67.33 E-value=3.1 Score=27.14 Aligned_cols=34 Identities=15% Similarity=0.377 Sum_probs=25.0
Q ss_pred ccccccccchhhhhhHHHHHHhhcCCCCccCCCCcccch
Q psy13090 34 FMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKGA 72 (771)
Q Consensus 34 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~ 72 (771)
.+||.|+..|.....- +-.+.+..+|+.|+..|.
T Consensus 3 i~CP~C~~~f~v~~~~-----l~~~~~~vrC~~C~~~f~ 36 (37)
T PF13719_consen 3 ITCPNCQTRFRVPDDK-----LPAGGRKVRCPKCGHVFR 36 (37)
T ss_pred EECCCCCceEEcCHHH-----cccCCcEEECCCCCcEee
Confidence 4799999999866532 223456789999998874
No 84
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=66.14 E-value=4.3 Score=26.30 Aligned_cols=11 Identities=18% Similarity=0.658 Sum_probs=5.2
Q ss_pred eeecccCCccc
Q psy13090 715 TFMCTQCDFTS 725 (771)
Q Consensus 715 ~~~C~~C~~~f 725 (771)
..+|+.|+..|
T Consensus 25 ~v~C~~C~~~f 35 (36)
T PF13717_consen 25 KVRCSKCGHVF 35 (36)
T ss_pred EEECCCCCCEe
Confidence 34455554443
No 85
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=66.06 E-value=3.2 Score=28.63 Aligned_cols=31 Identities=13% Similarity=0.388 Sum_probs=22.3
Q ss_pred CcccccccccchhhhhhHHHHHHhhcCCCCccCCCCcccch
Q psy13090 32 DKFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKGA 72 (771)
Q Consensus 32 k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~ 72 (771)
..|.|+.||..|.... ......|+.||..+.
T Consensus 2 ~~y~C~~CG~~~~~~~----------~~~~~~Cp~CG~~~~ 32 (46)
T PRK00398 2 AEYKCARCGREVELDE----------YGTGVRCPYCGYRIL 32 (46)
T ss_pred CEEECCCCCCEEEECC----------CCCceECCCCCCeEE
Confidence 3689999999887432 112789999987654
No 86
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=64.78 E-value=5.6 Score=26.01 Aligned_cols=13 Identities=15% Similarity=0.401 Sum_probs=6.5
Q ss_pred cCCCCCccccchh
Q psy13090 210 SCKKCPFETQFKV 222 (771)
Q Consensus 210 ~C~~C~~~f~~~~ 222 (771)
.|+.|+..|.-..
T Consensus 4 ~CP~C~~~~~v~~ 16 (38)
T TIGR02098 4 QCPNCKTSFRVVD 16 (38)
T ss_pred ECCCCCCEEEeCH
Confidence 4555555554433
No 87
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=64.01 E-value=4.4 Score=27.72 Aligned_cols=22 Identities=18% Similarity=0.350 Sum_probs=8.4
Q ss_pred CeeecccCCcccCCh----hHHHHHH
Q psy13090 714 QTFMCTQCDFTSSSK----DTLNTHA 735 (771)
Q Consensus 714 ~~~~C~~C~~~f~~~----~~L~~H~ 735 (771)
....|.+|++.+... +.|.+|+
T Consensus 15 ~~a~C~~C~~~~~~~~~~ts~l~~HL 40 (45)
T PF02892_consen 15 KKAKCKYCGKVIKYSSGGTSNLKRHL 40 (45)
T ss_dssp S-EEETTTTEE-----SSTHHHHHHH
T ss_pred CeEEeCCCCeEEeeCCCcHHHHHHhh
Confidence 334555555555432 4444444
No 88
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=63.02 E-value=5.1 Score=36.38 Aligned_cols=10 Identities=30% Similarity=0.750 Sum_probs=4.8
Q ss_pred CCeeecccCC
Q psy13090 713 EQTFMCTQCD 722 (771)
Q Consensus 713 ~~~~~C~~C~ 722 (771)
+-|-+||+||
T Consensus 147 e~P~~CPiCg 156 (166)
T COG1592 147 EAPEVCPICG 156 (166)
T ss_pred CCCCcCCCCC
Confidence 4444455554
No 89
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=62.93 E-value=4.9 Score=23.81 Aligned_cols=10 Identities=30% Similarity=0.966 Sum_probs=5.9
Q ss_pred ecCCCCcccc
Q psy13090 318 NCEHCDFSAS 327 (771)
Q Consensus 318 ~C~~C~~~f~ 327 (771)
.|+.||+.|.
T Consensus 16 ~Cp~CG~~F~ 25 (26)
T PF10571_consen 16 FCPHCGYDFE 25 (26)
T ss_pred cCCCCCCCCc
Confidence 3666666653
No 90
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=62.80 E-value=5.9 Score=27.02 Aligned_cols=11 Identities=18% Similarity=0.881 Sum_probs=5.3
Q ss_pred eecccCCcccC
Q psy13090 716 FMCTQCDFTSS 726 (771)
Q Consensus 716 ~~C~~C~~~f~ 726 (771)
|.|..||..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 44555554443
No 91
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=60.68 E-value=4.7 Score=37.52 Aligned_cols=41 Identities=15% Similarity=0.257 Sum_probs=30.0
Q ss_pred hcccccCCCCCcccccccccchhhhhhHHHHHHhhcCCCCccCCCCcccc
Q psy13090 22 VAMPRIHPHKDKFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKG 71 (771)
Q Consensus 22 ~~h~r~H~~~k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f 71 (771)
..-+.--+...-|.||.|+..|+.-..+. ..|.||.||-..
T Consensus 106 k~~l~~e~~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L 146 (178)
T PRK06266 106 KEQLEEEENNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEML 146 (178)
T ss_pred HHHhhhccCCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCC
Confidence 33334444557899999999999887763 369999998654
No 92
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=60.55 E-value=5 Score=36.12 Aligned_cols=42 Identities=10% Similarity=0.215 Sum_probs=28.9
Q ss_pred ccCCCCCcccccccccchhhhhhHHHHHHhhcCCCCccCCCCcccc
Q psy13090 26 RIHPHKDKFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKG 71 (771)
Q Consensus 26 r~H~~~k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f 71 (771)
+--++..-|.||.|+..|+....+.. .+. +..|.||.||...
T Consensus 92 ~~e~~~~~Y~Cp~C~~~y~~~ea~~~---~d~-~~~f~Cp~Cg~~l 133 (147)
T smart00531 92 EDETNNAYYKCPNCQSKYTFLEANQL---LDM-DGTFTCPRCGEEL 133 (147)
T ss_pred hcccCCcEEECcCCCCEeeHHHHHHh---cCC-CCcEECCCCCCEE
Confidence 34456678999999999996554432 012 3449999998765
No 93
>PHA00626 hypothetical protein
Probab=60.12 E-value=7.1 Score=27.65 Aligned_cols=12 Identities=17% Similarity=0.592 Sum_probs=5.4
Q ss_pred ceecCCCCCccc
Q psy13090 236 SFTCDECHLCFN 247 (771)
Q Consensus 236 ~~~C~~C~~~F~ 247 (771)
.|.|+.||..|+
T Consensus 23 rYkCkdCGY~ft 34 (59)
T PHA00626 23 DYVCCDCGYNDS 34 (59)
T ss_pred ceEcCCCCCeec
Confidence 344444444443
No 94
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=59.01 E-value=5.3 Score=38.12 Aligned_cols=26 Identities=23% Similarity=0.241 Sum_probs=13.7
Q ss_pred CccccCCCCCccchhhhhhhHHHHHh
Q psy13090 742 LKFSCQNCDFKCSKLSCVRNVISQVA 767 (771)
Q Consensus 742 ~~~~C~~C~~~f~~~~~l~~H~~~~H 767 (771)
-.|.|..|+|.|.-..-+++||.+.|
T Consensus 76 ~K~~C~lc~KlFkg~eFV~KHI~nKH 101 (214)
T PF04959_consen 76 DKWRCPLCGKLFKGPEFVRKHIFNKH 101 (214)
T ss_dssp EEEEE-SSS-EESSHHHHHHHHHHH-
T ss_pred CEECCCCCCcccCChHHHHHHHhhcC
Confidence 34666666666666666666666555
No 95
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=58.85 E-value=7.7 Score=33.04 Aligned_cols=38 Identities=18% Similarity=0.304 Sum_probs=29.3
Q ss_pred ccCCceecCCCCcccCChHHHHccccccCCCCCCCCcccccccCCchhH
Q psy13090 540 EIGEKYSCTQCSYQTYHKHALDNHMPLHSLDRKHQCPVCLRRFSRQSDN 588 (771)
Q Consensus 540 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L 588 (771)
..|.+..|+.||..|+.. +..|-.|+.||..|.....+
T Consensus 5 elGtKr~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~~~~ 42 (129)
T TIGR02300 5 DLGTKRICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPEEAL 42 (129)
T ss_pred hhCccccCCCcCcccccc-----------CCCCccCCCcCCccCcchhh
Confidence 345668899999999874 34789999999998765333
No 96
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=57.63 E-value=3 Score=27.84 Aligned_cols=31 Identities=13% Similarity=0.356 Sum_probs=21.7
Q ss_pred cccccccccchhhhhhHHHHHHhhcCCCCccCCCCccc
Q psy13090 33 KFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFK 70 (771)
Q Consensus 33 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~ 70 (771)
-|+|+.||..|...... .. ..+-.|+.|+..
T Consensus 5 ~y~C~~Cg~~fe~~~~~------~~-~~~~~CP~Cg~~ 35 (41)
T smart00834 5 EYRCEDCGHTFEVLQKI------SD-DPLATCPECGGD 35 (41)
T ss_pred EEEcCCCCCEEEEEEec------CC-CCCCCCCCCCCc
Confidence 48999999998754322 11 456789999873
No 97
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=57.61 E-value=7 Score=33.28 Aligned_cols=35 Identities=20% Similarity=0.111 Sum_probs=27.9
Q ss_pred cccccccccchhhhhhHHHHHHhhcCCCCccCCCCcccchhhHHHh
Q psy13090 33 KFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKGASSQALK 78 (771)
Q Consensus 33 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L~ 78 (771)
...|+.||+.|... +..|-.|+.||..|.-...++
T Consensus 9 Kr~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~~~~~ 43 (129)
T TIGR02300 9 KRICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPEEALK 43 (129)
T ss_pred cccCCCcCcccccc-----------CCCCccCCCcCCccCcchhhc
Confidence 34899999999853 567999999999987665554
No 98
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=55.46 E-value=6.8 Score=24.57 Aligned_cols=27 Identities=19% Similarity=0.594 Sum_probs=17.8
Q ss_pred ccccccccchhhhhhHHHHHHhhcCCCCccCCCCcccc
Q psy13090 34 FMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKG 71 (771)
Q Consensus 34 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f 71 (771)
|.|..|+..+... ...+-+|+.||..-
T Consensus 1 Y~C~~Cg~~~~~~-----------~~~~irC~~CG~RI 27 (32)
T PF03604_consen 1 YICGECGAEVELK-----------PGDPIRCPECGHRI 27 (32)
T ss_dssp EBESSSSSSE-BS-----------TSSTSSBSSSS-SE
T ss_pred CCCCcCCCeeEcC-----------CCCcEECCcCCCeE
Confidence 6788898888732 13467899998654
No 99
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=54.89 E-value=6.6 Score=36.52 Aligned_cols=17 Identities=18% Similarity=0.811 Sum_probs=8.7
Q ss_pred CccCCcCCccCCCchhH
Q psy13090 686 LFRCEKCNLGYSRKDSF 702 (771)
Q Consensus 686 ~~~C~~C~~~F~~~~~L 702 (771)
-|.|+.|+..|+....+
T Consensus 117 ~Y~Cp~C~~rytf~eA~ 133 (178)
T PRK06266 117 FFFCPNCHIRFTFDEAM 133 (178)
T ss_pred EEECCCCCcEEeHHHHh
Confidence 35555555555554443
No 100
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=54.44 E-value=13 Score=33.92 Aligned_cols=17 Identities=12% Similarity=0.512 Sum_probs=8.1
Q ss_pred CccCCcCCccCCCchhH
Q psy13090 686 LFRCEKCNLGYSRKDSF 702 (771)
Q Consensus 686 ~~~C~~C~~~F~~~~~L 702 (771)
-|.|+.|+..|+...++
T Consensus 109 ~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 109 FFICPNMCVRFTFNEAM 125 (158)
T ss_pred eEECCCCCcEeeHHHHH
Confidence 34455555544444444
No 101
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=54.41 E-value=11 Score=27.17 Aligned_cols=9 Identities=33% Similarity=1.294 Sum_probs=4.6
Q ss_pred ccccCCCCC
Q psy13090 743 KFSCQNCDF 751 (771)
Q Consensus 743 ~~~C~~C~~ 751 (771)
+|.|+.||+
T Consensus 50 ~Y~Cp~CGF 58 (61)
T COG2888 50 PYRCPKCGF 58 (61)
T ss_pred ceECCCcCc
Confidence 455555554
No 102
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=53.61 E-value=7.1 Score=35.16 Aligned_cols=15 Identities=40% Similarity=0.713 Sum_probs=8.6
Q ss_pred eecCCCCCcccchhh
Q psy13090 237 FTCDECHLCFNTFRR 251 (771)
Q Consensus 237 ~~C~~C~~~F~~~~~ 251 (771)
+.|+.||.+|.+...
T Consensus 29 ~~c~~c~~~f~~~e~ 43 (154)
T PRK00464 29 RECLACGKRFTTFER 43 (154)
T ss_pred eeccccCCcceEeEe
Confidence 556666666665443
No 103
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=53.26 E-value=13 Score=33.89 Aligned_cols=34 Identities=18% Similarity=0.315 Sum_probs=26.4
Q ss_pred cCCCccccccccccccChHHHHHHHhhcCCccCCCCCCCC
Q psy13090 381 KMYDNYKCNICDCFFKTHSILEDHMKALHSFPCDKCDYVG 420 (771)
Q Consensus 381 ~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~C~~C~~~f 420 (771)
....-|.|+.|+..|+...++. ..|.|+.||...
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME------LNFTCPRCGAML 138 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH------cCCcCCCCCCEe
Confidence 3345689999999999888875 368888888654
No 104
>KOG2186|consensus
Probab=53.19 E-value=9 Score=36.76 Aligned_cols=46 Identities=24% Similarity=0.511 Sum_probs=26.0
Q ss_pred cccccchhhccChHHHHHHHhhccCCCccccccccccccChhHHHHHHH
Q psy13090 631 LTCDICDRVFQTDKQWKLHMGFHNTKRRLPCKLCGFRLKNHKQLKLHQR 679 (771)
Q Consensus 631 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~ 679 (771)
|.|.+||.+...+ .+..|+..-.+ .-|.|-.|++.|.. ..+..|..
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 5666666665433 44446655544 44666666666655 55555544
No 105
>PF12907 zf-met2: Zinc-binding
Probab=53.13 E-value=11 Score=24.98 Aligned_cols=29 Identities=24% Similarity=0.470 Sum_probs=16.1
Q ss_pred cccccchhhc---cCHHHHHHHHhhccCCCCC
Q psy13090 448 RQCEYCSQLF---RRASAVEKHIALKHTGDKP 476 (771)
Q Consensus 448 ~~C~~C~~~f---~~~~~L~~H~~~~H~~~~~ 476 (771)
++|.+|-..| .+...|+.|....|....+
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~ 33 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTF 33 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCH
Confidence 3566666444 3445566666666655433
No 106
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=52.95 E-value=5.2 Score=28.42 Aligned_cols=10 Identities=30% Similarity=0.893 Sum_probs=4.5
Q ss_pred ccCCCCCccc
Q psy13090 209 YSCKKCPFET 218 (771)
Q Consensus 209 ~~C~~C~~~f 218 (771)
|+|..|+..|
T Consensus 6 y~C~~Cg~~f 15 (52)
T TIGR02605 6 YRCTACGHRF 15 (52)
T ss_pred EEeCCCCCEe
Confidence 4444444444
No 107
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=51.77 E-value=9 Score=26.66 Aligned_cols=10 Identities=20% Similarity=0.992 Sum_probs=5.4
Q ss_pred ccCCcCCccC
Q psy13090 687 FRCEKCNLGY 696 (771)
Q Consensus 687 ~~C~~C~~~F 696 (771)
|.|..||+.|
T Consensus 7 Y~C~~Cg~~~ 16 (49)
T COG1996 7 YKCARCGREV 16 (49)
T ss_pred EEhhhcCCee
Confidence 5555555555
No 108
>KOG1280|consensus
Probab=51.16 E-value=11 Score=38.06 Aligned_cols=22 Identities=41% Similarity=0.664 Sum_probs=12.0
Q ss_pred ceecCCCCCcccchhhHHHHHH
Q psy13090 236 SFTCDECHLCFNTFRRFDSHIR 257 (771)
Q Consensus 236 ~~~C~~C~~~F~~~~~L~~H~~ 257 (771)
.|.|++|++.=-+...|..|+.
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~ 100 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVL 100 (381)
T ss_pred cccCCcccccccchhHHHHHhh
Confidence 4555555555555555555543
No 109
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=50.60 E-value=13 Score=22.07 Aligned_cols=19 Identities=21% Similarity=0.426 Sum_probs=11.0
Q ss_pred ecccCCcccCChhHHHHHHh
Q psy13090 717 MCTQCDFTSSSKDTLNTHAM 736 (771)
Q Consensus 717 ~C~~C~~~f~~~~~L~~H~~ 736 (771)
.|++|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 466666666 4455556553
No 110
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=50.30 E-value=10 Score=26.69 Aligned_cols=24 Identities=25% Similarity=0.404 Sum_probs=12.2
Q ss_pred cccCCCCCccchh-----hhhhhHHHHHh
Q psy13090 744 FSCQNCDFKCSKL-----SCVRNVISQVA 767 (771)
Q Consensus 744 ~~C~~C~~~f~~~-----~~l~~H~~~~H 767 (771)
-.|..|++.++.. ++|.+|+...|
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~~~h 47 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLRRKH 47 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHHhHC
Confidence 3455555555443 45666655444
No 111
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=49.88 E-value=12 Score=27.40 Aligned_cols=41 Identities=15% Similarity=0.359 Sum_probs=17.1
Q ss_pred Ceeeccc-CCcccCChhHHHHHHhhccCCCccccCC----CCCccc
Q psy13090 714 QTFMCTQ-CDFTSSSKDTLNTHAMVHADELKFSCQN----CDFKCS 754 (771)
Q Consensus 714 ~~~~C~~-C~~~f~~~~~L~~H~~~H~~e~~~~C~~----C~~~f~ 754 (771)
.+..|+. |+..-..+..|..|+..-=..++..|++ |+..+.
T Consensus 8 ~~v~C~~~cc~~~i~r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 8 RPVPCPNGCCNEMIPRKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp SEEE-TT--S-BEEECCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred CEeeCCCCCcccceeHHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 3455555 2222222445556655444455555666 555544
No 112
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=49.57 E-value=8.1 Score=26.02 Aligned_cols=11 Identities=18% Similarity=0.600 Sum_probs=4.7
Q ss_pred ccccccccccc
Q psy13090 179 LKCKLCNYQAV 189 (771)
Q Consensus 179 ~~C~~C~~~f~ 189 (771)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (42)
T PF09723_consen 6 YRCEECGHEFE 16 (42)
T ss_pred EEeCCCCCEEE
Confidence 44444444443
No 113
>KOG2186|consensus
Probab=48.33 E-value=12 Score=35.99 Aligned_cols=49 Identities=24% Similarity=0.533 Sum_probs=39.9
Q ss_pred ccccccccccccChhHHHHHHHHhcCCCCccCCcCCccCCCchhHHHHHHHh
Q psy13090 658 RLPCKLCGFRLKNHKQLKLHQRYHLSSNLFRCEKCNLGYSRKDSFEYHMTSH 709 (771)
Q Consensus 658 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~L~~H~~~H 709 (771)
-|.|.+||...+- ..|.+|+-.-++ .-|.|-.|++.|.. .++..|..--
T Consensus 3 ~FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCI 51 (276)
T KOG2186|consen 3 FFTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCI 51 (276)
T ss_pred EEehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhc
Confidence 3789999998765 456779988777 67999999999998 7888887643
No 114
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=47.92 E-value=17 Score=26.42 Aligned_cols=7 Identities=29% Similarity=1.388 Sum_probs=2.8
Q ss_pred eecccCC
Q psy13090 716 FMCTQCD 722 (771)
Q Consensus 716 ~~C~~C~ 722 (771)
|.|+.||
T Consensus 49 Y~CP~CG 55 (59)
T PRK14890 49 YTCPKCG 55 (59)
T ss_pred eECCCCC
Confidence 3444443
No 115
>KOG2071|consensus
Probab=47.49 E-value=15 Score=40.18 Aligned_cols=25 Identities=28% Similarity=0.557 Sum_probs=13.7
Q ss_pred CccccccchhhccChHHHHHHHhhc
Q psy13090 629 SQLTCDICDRVFQTDKQWKLHMGFH 653 (771)
Q Consensus 629 ~~~~C~~C~~~f~~~~~L~~H~~~H 653 (771)
.+-+|..||..|........||.+|
T Consensus 417 ~pnqC~~CG~R~~~~ee~sk~md~H 441 (579)
T KOG2071|consen 417 SPNQCKSCGLRFDDSEERSKHMDIH 441 (579)
T ss_pred CcchhcccccccccchhhhhHhhhh
Confidence 3455666666665555555555444
No 116
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=46.73 E-value=10 Score=28.97 Aligned_cols=39 Identities=13% Similarity=0.330 Sum_probs=24.8
Q ss_pred ccCCCCCcccCCchhHHHhhhcCccceecC--CCCccccch
Q psy13090 291 FNCKQCSFTTKTPKKFIEHKEAGHFTWNCE--HCDFSASKK 329 (771)
Q Consensus 291 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~C~--~C~~~f~~~ 329 (771)
+.|+.|+.......+-.......+.-++|. .|+.+|...
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~~~~~Y~qC~N~eCg~tF~t~ 42 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDTTKERYHQCQNVNCSATFITY 42 (72)
T ss_pred ccCCCCCCccEEEEChhcChhhheeeeecCCCCCCCEEEEE
Confidence 467777766544444333333556678888 899888764
No 117
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=46.29 E-value=13 Score=25.77 Aligned_cols=13 Identities=23% Similarity=0.513 Sum_probs=8.4
Q ss_pred cccCCCCccCCCh
Q psy13090 506 YSCNICFRRNSKY 518 (771)
Q Consensus 506 ~~C~~C~~~f~~~ 518 (771)
|+|.+|+..+.-.
T Consensus 2 y~C~~CgyvYd~~ 14 (47)
T PF00301_consen 2 YQCPVCGYVYDPE 14 (47)
T ss_dssp EEETTTSBEEETT
T ss_pred cCCCCCCEEEcCC
Confidence 6677777666543
No 118
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=45.69 E-value=6.4 Score=30.12 Aligned_cols=42 Identities=17% Similarity=0.261 Sum_probs=28.6
Q ss_pred ccccccccchhhhhhHHHHHHhhcCCCCccCC--CCcccchhhHHH
Q psy13090 34 FMCYACYYSTYISTNMKNHIMTHTGSKPYNCS--HCTFKGASSQAL 77 (771)
Q Consensus 34 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~--~C~~~f~~~~~L 77 (771)
+.||.||.....+.+-...-. +.++-++|. .||.+|..-..+
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es~ 45 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYESV 45 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEEE
Confidence 579999988854443332222 557789999 999999765543
No 119
>PF15269 zf-C2H2_7: Zinc-finger
Probab=45.63 E-value=14 Score=24.61 Aligned_cols=22 Identities=23% Similarity=0.711 Sum_probs=17.0
Q ss_pred cccccccccchhhhhhHHHHHH
Q psy13090 33 KFMCYACYYSTYISTNMKNHIM 54 (771)
Q Consensus 33 ~~~C~~C~~~f~~~~~l~~H~~ 54 (771)
.|+|-.|+.+...+++|-+||+
T Consensus 20 ~ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred cceeecCCcccchHHHHHHHHH
Confidence 3578888888888888888875
No 120
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=45.10 E-value=17 Score=32.67 Aligned_cols=36 Identities=22% Similarity=0.430 Sum_probs=20.8
Q ss_pred CCCccccccccccccChHHHHH-HHhhcCCccCCCCCCC
Q psy13090 382 MYDNYKCNICDCFFKTHSILED-HMKALHSFPCDKCDYV 419 (771)
Q Consensus 382 ~~~~~~C~~C~~~f~~~~~L~~-H~~~~h~~~C~~C~~~ 419 (771)
....|.|+.|+..|....++.. .+ ...|.|+.|+..
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~~d~--~~~f~Cp~Cg~~ 132 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQLLDM--DGTFTCPRCGEE 132 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHhcCC--CCcEECCCCCCE
Confidence 3356888888888876555433 21 222666666544
No 121
>PF12907 zf-met2: Zinc-binding
Probab=43.88 E-value=17 Score=24.13 Aligned_cols=28 Identities=21% Similarity=0.422 Sum_probs=19.0
Q ss_pred cccCcchhh---cCCHHHHHHHHHhhhcCCC
Q psy13090 477 YSCYLCEEK---FENSESLRTHMNSLHALLF 504 (771)
Q Consensus 477 ~~C~~C~~~---f~~~~~L~~H~~~~H~~~~ 504 (771)
+.|.+|-.+ ..+...|+.|..+.|....
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~ 32 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNT 32 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCC
Confidence 568888843 4556778888877776543
No 122
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=43.05 E-value=51 Score=33.36 Aligned_cols=26 Identities=23% Similarity=0.327 Sum_probs=20.9
Q ss_pred CcccccccccchhhhhhHHHHHH--hhc
Q psy13090 32 DKFMCYACYYSTYISTNMKNHIM--THT 57 (771)
Q Consensus 32 k~~~C~~C~~~f~~~~~l~~H~~--~H~ 57 (771)
+.+=|+.|++.|+.+.-+..|+. .|.
T Consensus 237 ~~~YC~~C~r~f~~~~VFe~Hl~gK~H~ 264 (470)
T COG5188 237 PKVYCVKCGREFSRSKVFEYHLEGKRHC 264 (470)
T ss_pred cceeeHhhhhHhhhhHHHHHHHhhhhhh
Confidence 45679999999999988888874 554
No 123
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=42.93 E-value=7.1 Score=35.17 Aligned_cols=13 Identities=31% Similarity=0.815 Sum_probs=6.7
Q ss_pred CCCcccccccCCc
Q psy13090 573 HQCPVCLRRFSRQ 585 (771)
Q Consensus 573 ~~C~~C~~~f~~~ 585 (771)
++|+.||++|.+.
T Consensus 29 ~~c~~c~~~f~~~ 41 (154)
T PRK00464 29 RECLACGKRFTTF 41 (154)
T ss_pred eeccccCCcceEe
Confidence 4555555555443
No 124
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=41.37 E-value=10 Score=26.19 Aligned_cols=40 Identities=25% Similarity=0.514 Sum_probs=24.1
Q ss_pred hhhhhcccccCCCCCcccccccccchhhhhhHHHHHHhhcCCCCccCCCCcc
Q psy13090 18 CRACVAMPRIHPHKDKFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTF 69 (771)
Q Consensus 18 ~~~~~~h~r~H~~~k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~ 69 (771)
|.++...+|--.| |.||.||.. ....|. +...|+|..|++
T Consensus 6 c~~~l~~~RW~~g---~~CP~Cg~~--~~~~~~-------~~~~~~C~~C~~ 45 (46)
T PF12760_consen 6 CREYLEEIRWPDG---FVCPHCGST--KHYRLK-------TRGRYRCKACRK 45 (46)
T ss_pred HHHHHHHhcCCCC---CCCCCCCCe--eeEEeC-------CCCeEECCCCCC
Confidence 4444444555444 779999975 122111 156899999975
No 125
>PF15269 zf-C2H2_7: Zinc-finger
Probab=40.85 E-value=19 Score=24.07 Aligned_cols=21 Identities=29% Similarity=0.763 Sum_probs=12.5
Q ss_pred cccCCCCCccchhhhhhhHHH
Q psy13090 744 FSCQNCDFKCSKLSCVRNVIS 764 (771)
Q Consensus 744 ~~C~~C~~~f~~~~~l~~H~~ 764 (771)
|+|-.|.+.+.-++.|.+||.
T Consensus 21 ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred ceeecCCcccchHHHHHHHHH
Confidence 556666666666666666653
No 126
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=39.40 E-value=17 Score=25.58 Aligned_cols=11 Identities=36% Similarity=0.757 Sum_probs=6.8
Q ss_pred cccCCCCccCC
Q psy13090 506 YSCNICFRRNS 516 (771)
Q Consensus 506 ~~C~~C~~~f~ 516 (771)
|+|.+|+..+.
T Consensus 2 y~C~~CgyiYd 12 (50)
T cd00730 2 YECRICGYIYD 12 (50)
T ss_pred cCCCCCCeEEC
Confidence 56666666554
No 127
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=38.95 E-value=17 Score=35.05 Aligned_cols=85 Identities=21% Similarity=0.414 Sum_probs=47.3
Q ss_pred cCCcceecCCCCCcccchhhHHHHHHhc----CCccccccccccccChhhhhhhhccccCCCcccCCCCCcccCCchhHH
Q psy13090 232 HKGISFTCDECHLCFNTFRRFDSHIRAH----EYTEQCELCNEKVIGKNQIYFHYYHEHEGYDFNCKQCSFTTKTPKKFI 307 (771)
Q Consensus 232 h~~~~~~C~~C~~~F~~~~~L~~H~~~H----~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~l~ 307 (771)
|+++.|.|..|+. |.-...-..|+.+- ...|+|.-|++.= .|.|-.|-..|-.
T Consensus 138 hGGrif~CsfC~~-flCEDDQFEHQAsCQvLe~E~~KC~SCNrlG-----------------q~sCLRCK~cfCd----- 194 (314)
T PF06524_consen 138 HGGRIFKCSFCDN-FLCEDDQFEHQASCQVLESETFKCQSCNRLG-----------------QYSCLRCKICFCD----- 194 (314)
T ss_pred CCCeEEEeecCCC-eeeccchhhhhhhhhhhhccccccccccccc-----------------chhhhheeeeehh-----
Confidence 7888999999964 44444445666543 4577888776521 1233333322322
Q ss_pred Hhhh------cCccceecCCCCccccchHHHHHHHhhc
Q psy13090 308 EHKE------AGHFTWNCEHCDFSASKKSVLLSHLKQH 339 (771)
Q Consensus 308 ~H~~------~~~~~~~C~~C~~~f~~~~~L~~H~~~h 339 (771)
.|.+ ...+++.||.|++.......|..=.++|
T Consensus 195 dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 195 DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred hhhhhcccccccCCCCCCCCCCCcccccccceeeeecc
Confidence 2222 2335667777777666655555444444
No 128
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=38.68 E-value=17 Score=34.78 Aligned_cols=27 Identities=41% Similarity=0.760 Sum_probs=18.3
Q ss_pred CccccccchhhccCHHHHHHHHhhccC
Q psy13090 446 NKRQCEYCSQLFRRASAVEKHIALKHT 472 (771)
Q Consensus 446 ~~~~C~~C~~~f~~~~~L~~H~~~~H~ 472 (771)
..|.|.+|+|.|....-+.+|+...|.
T Consensus 76 ~K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 76 DKWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp EEEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred CEECCCCCCcccCChHHHHHHHhhcCH
Confidence 567788888888888888888777664
No 129
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=38.59 E-value=17 Score=29.41 Aligned_cols=37 Identities=24% Similarity=0.560 Sum_probs=16.1
Q ss_pred CcccCCCCCccccchhhhccccccccCCcceecCCCCCccc
Q psy13090 207 VTYSCKKCPFETQFKVTLGDHYKTCHKGISFTCDECHLCFN 247 (771)
Q Consensus 207 ~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~F~ 247 (771)
+.|.|+.|+..-...-.|+.-... -...|..||.+|.
T Consensus 21 k~FtCp~Cghe~vs~ctvkk~~~~----g~~~Cg~CGls~e 57 (104)
T COG4888 21 KTFTCPRCGHEKVSSCTVKKTVNI----GTAVCGNCGLSFE 57 (104)
T ss_pred ceEecCccCCeeeeEEEEEecCce----eEEEcccCcceEE
Confidence 345555555544433332211111 2345666666554
No 130
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.35 E-value=18 Score=27.52 Aligned_cols=32 Identities=16% Similarity=0.398 Sum_probs=21.5
Q ss_pred CcccccccccchhhhhhHHHHHHhhcCCCCc-cCCCCcccc
Q psy13090 32 DKFMCYACYYSTYISTNMKNHIMTHTGSKPY-NCSHCTFKG 71 (771)
Q Consensus 32 k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~-~C~~C~~~f 71 (771)
..|.|.+|+..|.... +..+-|+ .|+.|+-.+
T Consensus 11 Y~Y~c~~cg~~~dvvq--------~~~ddplt~ce~c~a~~ 43 (82)
T COG2331 11 YSYECTECGNRFDVVQ--------AMTDDPLTTCEECGARL 43 (82)
T ss_pred eEEeecccchHHHHHH--------hcccCccccChhhChHH
Confidence 3589999999987432 3334454 599997643
No 131
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=36.02 E-value=21 Score=26.06 Aligned_cols=44 Identities=14% Similarity=0.306 Sum_probs=29.6
Q ss_pred CCCcccccc--cccchhhhhhHHHHHHhhcCCCCccCCC----Ccccchhh
Q psy13090 30 HKDKFMCYA--CYYSTYISTNMKNHIMTHTGSKPYNCSH----CTFKGASS 74 (771)
Q Consensus 30 ~~k~~~C~~--C~~~f~~~~~l~~H~~~H~~~~~~~C~~----C~~~f~~~ 74 (771)
...+..|+. |...+. +..|..|+...=..++..|++ |+..+...
T Consensus 6 ~~~~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~~~ 55 (60)
T PF02176_consen 6 PFRPVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPYSPYGCKERVPRE 55 (60)
T ss_dssp TTSEEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS----S--EEEHH
T ss_pred CCCEeeCCCCCccccee-HHHHHHHHHccCCCCcEECCCCCCCCCCccchh
Confidence 345778988 545455 778999998777788999999 98877643
No 132
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=35.76 E-value=22 Score=30.46 Aligned_cols=25 Identities=28% Similarity=0.477 Sum_probs=21.3
Q ss_pred ccccccccchhhhhhHHHHHHhhcCCCC
Q psy13090 34 FMCYACYYSTYISTNMKNHIMTHTGSKP 61 (771)
Q Consensus 34 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~ 61 (771)
..|-++|+.|+ +|++|+.+|.|.-|
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gmTP 101 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGLTP 101 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCCCH
Confidence 36999999998 48999999988765
No 133
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=34.52 E-value=17 Score=22.41 Aligned_cols=10 Identities=20% Similarity=0.959 Sum_probs=5.3
Q ss_pred CeeecccCCc
Q psy13090 714 QTFMCTQCDF 723 (771)
Q Consensus 714 ~~~~C~~C~~ 723 (771)
..|.|+.|+.
T Consensus 18 ~~~vCp~C~~ 27 (30)
T PF08274_consen 18 ELLVCPECGH 27 (30)
T ss_dssp SSEEETTTTE
T ss_pred CEEeCCcccc
Confidence 3455555553
No 134
>PRK04023 DNA polymerase II large subunit; Validated
Probab=34.29 E-value=39 Score=39.68 Aligned_cols=9 Identities=33% Similarity=0.681 Sum_probs=4.7
Q ss_pred ccCCcCCcc
Q psy13090 687 FRCEKCNLG 695 (771)
Q Consensus 687 ~~C~~C~~~ 695 (771)
+.|+.||..
T Consensus 639 frCP~CG~~ 647 (1121)
T PRK04023 639 RRCPFCGTH 647 (1121)
T ss_pred ccCCCCCCC
Confidence 455555544
No 135
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=31.97 E-value=34 Score=27.04 Aligned_cols=31 Identities=26% Similarity=0.538 Sum_probs=21.4
Q ss_pred cccCCCCCcccCCchhHHHhhhcCccceecCCCCccccc
Q psy13090 290 DFNCKQCSFTTKTPKKFIEHKEAGHFTWNCEHCDFSASK 328 (771)
Q Consensus 290 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~ 328 (771)
+|.|+.|++.. +.+.+...|.|..|+..|..
T Consensus 35 ~~~Cp~C~~~~--------VkR~a~GIW~C~kCg~~fAG 65 (89)
T COG1997 35 KHVCPFCGRTT--------VKRIATGIWKCRKCGAKFAG 65 (89)
T ss_pred CCcCCCCCCcc--------eeeeccCeEEcCCCCCeecc
Confidence 57777777542 23455667888888888864
No 136
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=31.83 E-value=31 Score=39.84 Aligned_cols=11 Identities=36% Similarity=0.688 Sum_probs=6.1
Q ss_pred cceecCCCCCc
Q psy13090 235 ISFTCDECHLC 245 (771)
Q Consensus 235 ~~~~C~~C~~~ 245 (771)
.|..|+.||..
T Consensus 474 ~p~~Cp~Cgs~ 484 (730)
T COG1198 474 IPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCC
Confidence 45556666544
No 137
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=31.03 E-value=29 Score=34.31 Aligned_cols=50 Identities=16% Similarity=0.248 Sum_probs=34.8
Q ss_pred ccCCCCCcccCCchhHHHhhhcCccceecCCCCccccchHHHHHHHhhcc
Q psy13090 291 FNCKQCSFTTKTPKKFIEHKEAGHFTWNCEHCDFSASKKSVLLSHLKQHV 340 (771)
Q Consensus 291 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 340 (771)
-.|-.|.-.|+....-..-..+....|+|+.|...|-.--..-.|...|.
T Consensus 363 ~~Cf~CQ~~fp~~~~~~~~~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh~ 412 (421)
T COG5151 363 THCFVCQGPFPKPPVSPFDESTSSGRYQCELCKSTFCSDCDVFIHETLHF 412 (421)
T ss_pred ccceeccCCCCCCCCCcccccccccceechhhhhhhhhhhHHHHHHHHhh
Confidence 35888887777654322222244567999999999988888888877664
No 138
>KOG2593|consensus
Probab=30.71 E-value=34 Score=35.93 Aligned_cols=36 Identities=19% Similarity=0.436 Sum_probs=24.6
Q ss_pred CCccccccccccccChHHHHHHHhhcCCccCCCCCC
Q psy13090 383 YDNYKCNICDCFFKTHSILEDHMKALHSFPCDKCDY 418 (771)
Q Consensus 383 ~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~C~~C~~ 418 (771)
...|.|+.|++.|.....++.=--..-.|.|..|+-
T Consensus 126 ~~~Y~Cp~C~kkyt~Lea~~L~~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 126 VAGYVCPNCQKKYTSLEALQLLDNETGEFHCENCGG 161 (436)
T ss_pred cccccCCccccchhhhHHHHhhcccCceEEEecCCC
Confidence 356889999888888877765332234577777764
No 139
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=29.53 E-value=34 Score=29.57 Aligned_cols=12 Identities=25% Similarity=0.761 Sum_probs=5.8
Q ss_pred ccCCcCCccCCC
Q psy13090 687 FRCEKCNLGYSR 698 (771)
Q Consensus 687 ~~C~~C~~~F~~ 698 (771)
++|..||+.|.+
T Consensus 2 H~Ct~Cg~~f~d 13 (131)
T PF09845_consen 2 HQCTKCGRVFED 13 (131)
T ss_pred cccCcCCCCcCC
Confidence 445555555543
No 140
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=28.87 E-value=27 Score=24.16 Aligned_cols=36 Identities=11% Similarity=0.312 Sum_probs=21.0
Q ss_pred CCCCCcccCCchhHHHhhhcCccceecCC--CCccccc
Q psy13090 293 CKQCSFTTKTPKKFIEHKEAGHFTWNCEH--CDFSASK 328 (771)
Q Consensus 293 C~~C~~~f~~~~~l~~H~~~~~~~~~C~~--C~~~f~~ 328 (771)
|+.||...........+....+.-|+|.. ||.+|..
T Consensus 2 CP~Cg~~a~ir~S~~~s~~~~~~Y~qC~N~~Cg~tfv~ 39 (47)
T PF04606_consen 2 CPHCGSKARIRTSRQLSPLTRELYCQCTNPECGHTFVA 39 (47)
T ss_pred cCCCCCeeEEEEchhhCcceEEEEEEECCCcCCCEEEE
Confidence 66666555544444444445555677765 7777754
No 141
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=28.84 E-value=37 Score=29.30 Aligned_cols=12 Identities=25% Similarity=0.598 Sum_probs=7.1
Q ss_pred cccCCcccCChh
Q psy13090 718 CTQCDFTSSSKD 729 (771)
Q Consensus 718 C~~C~~~f~~~~ 729 (771)
||+|.-+|.+.+
T Consensus 124 CPvCkTSFKss~ 135 (140)
T PF05290_consen 124 CPVCKTSFKSSS 135 (140)
T ss_pred CCcccccccccc
Confidence 666666665543
No 142
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=27.81 E-value=29 Score=20.13 Aligned_cols=8 Identities=38% Similarity=1.190 Sum_probs=3.8
Q ss_pred eeecccCC
Q psy13090 715 TFMCTQCD 722 (771)
Q Consensus 715 ~~~C~~C~ 722 (771)
+|.|+.||
T Consensus 16 ~f~CPnCG 23 (24)
T PF07754_consen 16 PFPCPNCG 23 (24)
T ss_pred eEeCCCCC
Confidence 34455444
No 143
>PF14353 CpXC: CpXC protein
Probab=27.68 E-value=27 Score=30.48 Aligned_cols=21 Identities=19% Similarity=0.502 Sum_probs=10.8
Q ss_pred cccCCCCCccccchhhhcccc
Q psy13090 208 TYSCKKCPFETQFKVTLGDHY 228 (771)
Q Consensus 208 ~~~C~~C~~~f~~~~~L~~H~ 228 (771)
.|.|+.||..|.-...+.-|-
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D 58 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHD 58 (128)
T ss_pred EEECCCCCCceecCCCEEEEc
Confidence 355555555555444444443
No 144
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region. Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A. The abnormally extended conformation is stable only in the CcO assembly.
Probab=26.41 E-value=27 Score=28.64 Aligned_cols=19 Identities=11% Similarity=0.011 Sum_probs=16.1
Q ss_pred cccCCCCCcccccccccchh
Q psy13090 25 PRIHPHKDKFMCYACYYSTY 44 (771)
Q Consensus 25 ~r~H~~~k~~~C~~C~~~f~ 44 (771)
+.++.| +|++|++||..|.
T Consensus 72 ~~l~~g-~~~rC~eCG~~fk 90 (97)
T cd00924 72 MWLEKG-KPKRCPECGHVFK 90 (97)
T ss_pred EEEeCC-CceeCCCCCcEEE
Confidence 677888 6999999998886
No 145
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=24.85 E-value=28 Score=28.28 Aligned_cols=39 Identities=21% Similarity=0.442 Sum_probs=26.0
Q ss_pred cccccccccccccccchhhhhhhhhhcCCCCCcccCCCCCccccc
Q psy13090 176 AEDLKCKLCNYQAVNIKGIRAHVFRKHLQNTVTYSCKKCPFETQF 220 (771)
Q Consensus 176 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~ 220 (771)
.+.|.|+.|+..-.+.-.++.-. +.....|..||..|..
T Consensus 20 ~k~FtCp~Cghe~vs~ctvkk~~------~~g~~~Cg~CGls~e~ 58 (104)
T COG4888 20 PKTFTCPRCGHEKVSSCTVKKTV------NIGTAVCGNCGLSFEC 58 (104)
T ss_pred CceEecCccCCeeeeEEEEEecC------ceeEEEcccCcceEEE
Confidence 45688888888777666544332 3345678899887754
No 146
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=24.84 E-value=50 Score=23.73 Aligned_cols=9 Identities=22% Similarity=0.693 Sum_probs=4.2
Q ss_pred ccccccccc
Q psy13090 180 KCKLCNYQA 188 (771)
Q Consensus 180 ~C~~C~~~f 188 (771)
+|+.|+..+
T Consensus 4 ~CP~CG~~i 12 (54)
T TIGR01206 4 ECPDCGAEI 12 (54)
T ss_pred CCCCCCCEE
Confidence 455555433
No 147
>PRK04023 DNA polymerase II large subunit; Validated
Probab=24.67 E-value=61 Score=38.21 Aligned_cols=10 Identities=30% Similarity=0.677 Sum_probs=6.1
Q ss_pred eecCCCCccc
Q psy13090 317 WNCEHCDFSA 326 (771)
Q Consensus 317 ~~C~~C~~~f 326 (771)
+.|+.|+..-
T Consensus 664 y~CPKCG~El 673 (1121)
T PRK04023 664 DECEKCGREP 673 (1121)
T ss_pred CcCCCCCCCC
Confidence 5577776543
No 148
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=23.98 E-value=32 Score=30.03 Aligned_cols=28 Identities=32% Similarity=0.612 Sum_probs=17.2
Q ss_pred CCcccccccccchhhhhhHHHHHHhhcCCCC
Q psy13090 31 KDKFMCYACYYSTYISTNMKNHIMTHTGSKP 61 (771)
Q Consensus 31 ~k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~ 61 (771)
+.-..|-+||+.|.. |++|++.|.|..|
T Consensus 70 ~d~i~clecGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 70 PDYIICLECGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp SS-EE-TBT--EESB---HHHHHHHTT-S-H
T ss_pred cCeeEEccCCcccch---HHHHHHHccCCCH
Confidence 345579999999984 5999999977654
No 149
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.74 E-value=54 Score=26.96 Aligned_cols=32 Identities=6% Similarity=-0.147 Sum_probs=25.2
Q ss_pred cccccccchhhhhhHHHHHHhhcCCCCccCCCCcccchhhHHHh
Q psy13090 35 MCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCTFKGASSQALK 78 (771)
Q Consensus 35 ~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L~ 78 (771)
.||.|++.|... +..|-.||+||++|. .+.|.
T Consensus 11 idPetg~KFYDL-----------NrdPiVsPytG~s~P-~s~fe 42 (129)
T COG4530 11 IDPETGKKFYDL-----------NRDPIVSPYTGKSYP-RSYFE 42 (129)
T ss_pred cCccccchhhcc-----------CCCccccCcccccch-HHHHH
Confidence 699999999843 568999999999994 44444
No 150
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.53 E-value=18 Score=34.21 Aligned_cols=40 Identities=18% Similarity=0.431 Sum_probs=22.0
Q ss_pred CeeecccCCcccCChhHHHHHHhhccCCC----------c-----cccCCCCCcc
Q psy13090 714 QTFMCTQCDFTSSSKDTLNTHAMVHADEL----------K-----FSCQNCDFKC 753 (771)
Q Consensus 714 ~~~~C~~C~~~f~~~~~L~~H~~~H~~e~----------~-----~~C~~C~~~f 753 (771)
+.+.||+|+..|.....+..=.|+-.|+. | ..|++|+++.
T Consensus 18 k~ieCPvC~tkFkkeev~tgsiRiiagDld~~lkygninP~fY~VvvCP~C~yAa 72 (267)
T COG1655 18 KTIECPVCNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPYFYDVVVCPICYYAA 72 (267)
T ss_pred ceeccCcccchhhhhheeccceeEecccccceeeccccCCceeEEEEcchhhHHH
Confidence 45666666666665544444444433332 2 4688887654
No 151
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=22.82 E-value=68 Score=21.34 Aligned_cols=11 Identities=18% Similarity=0.745 Sum_probs=6.5
Q ss_pred ceecCCCCCcc
Q psy13090 236 SFTCDECHLCF 246 (771)
Q Consensus 236 ~~~C~~C~~~F 246 (771)
-|.|..|+..|
T Consensus 28 fy~C~~C~~~w 38 (40)
T smart00440 28 FYVCTKCGHRW 38 (40)
T ss_pred EEEeCCCCCEe
Confidence 46666666554
No 152
>KOG2593|consensus
Probab=22.75 E-value=85 Score=33.17 Aligned_cols=15 Identities=13% Similarity=0.341 Sum_probs=6.8
Q ss_pred ccccccccccccChh
Q psy13090 658 RLPCKLCGFRLKNHK 672 (771)
Q Consensus 658 ~~~C~~C~~~f~~~~ 672 (771)
-|.|+.|++.|....
T Consensus 128 ~Y~Cp~C~kkyt~Le 142 (436)
T KOG2593|consen 128 GYVCPNCQKKYTSLE 142 (436)
T ss_pred cccCCccccchhhhH
Confidence 344444444444433
No 153
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=22.60 E-value=47 Score=26.38 Aligned_cols=27 Identities=22% Similarity=0.477 Sum_probs=20.6
Q ss_pred CcccccccccchhhhhhHHHHHHhhcCCCCccCCCCc
Q psy13090 32 DKFMCYACYYSTYISTNMKNHIMTHTGSKPYNCSHCT 68 (771)
Q Consensus 32 k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~ 68 (771)
+|=+|.-||+.|.+- .-.+|-+||.|.
T Consensus 57 ~Pa~CkkCGfef~~~----------~ik~pSRCP~CK 83 (97)
T COG3357 57 RPARCKKCGFEFRDD----------KIKKPSRCPKCK 83 (97)
T ss_pred cChhhcccCcccccc----------ccCCcccCCcch
Confidence 577899999999851 124688999994
No 154
>KOG2807|consensus
Probab=22.54 E-value=96 Score=31.29 Aligned_cols=37 Identities=22% Similarity=0.507 Sum_probs=28.2
Q ss_pred CCceecCCCCcccCChHHHHccccccCCCCCCCCcccccccCCchhHHHHH
Q psy13090 542 GEKYSCTQCSYQTYHKHALDNHMPLHSLDRKHQCPVCLRRFSRQSDNNTHQ 592 (771)
Q Consensus 542 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~ 592 (771)
+..|.|+.|.... -.-|-.|++|+-+..+...|.+-.
T Consensus 274 ~~Gy~CP~Ckakv--------------CsLP~eCpiC~ltLVss~hLARSy 310 (378)
T KOG2807|consen 274 GGGYFCPQCKAKV--------------CSLPIECPICSLTLVSSPHLARSY 310 (378)
T ss_pred cCceeCCcccCee--------------ecCCccCCccceeEecchHHHHHH
Confidence 4459999997543 245889999999999888887654
No 155
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=22.01 E-value=49 Score=28.95 Aligned_cols=22 Identities=23% Similarity=0.540 Sum_probs=8.2
Q ss_pred eecccCCcccCChhHHHHHHhhccC
Q psy13090 716 FMCTQCDFTSSSKDTLNTHAMVHAD 740 (771)
Q Consensus 716 ~~C~~C~~~f~~~~~L~~H~~~H~~ 740 (771)
..|-+||+.|.. |++|++.|+|
T Consensus 73 i~clecGk~~k~---LkrHL~~~~g 94 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHG 94 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-
T ss_pred eEEccCCcccch---HHHHHHHccC
Confidence 345555554432 2455555543
No 156
>PF05129 Elf1: Transcription elongation factor Elf1 like; InterPro: IPR007808 This family of uncharacterised, mostly short, proteins contain a putative zinc binding domain with four conserved cysteines.; PDB: 1WII_A.
Probab=21.92 E-value=39 Score=26.73 Aligned_cols=12 Identities=33% Similarity=0.844 Sum_probs=5.2
Q ss_pred eecCCCCCcccc
Q psy13090 237 FTCDECHLCFNT 248 (771)
Q Consensus 237 ~~C~~C~~~F~~ 248 (771)
..|.+||..|..
T Consensus 47 ~~C~~Cg~~~~~ 58 (81)
T PF05129_consen 47 LSCRVCGESFQT 58 (81)
T ss_dssp EEESSS--EEEE
T ss_pred EEecCCCCeEEE
Confidence 455555555543
No 157
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=21.80 E-value=78 Score=20.53 Aligned_cols=32 Identities=13% Similarity=0.279 Sum_probs=16.8
Q ss_pred eecCCCCcccCChHHHHccccccCCCCCCCCcccccccCC
Q psy13090 545 YSCTQCSYQTYHKHALDNHMPLHSLDRKHQCPVCLRRFSR 584 (771)
Q Consensus 545 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 584 (771)
..|+.||..|.. ..-....+-.|+.||..+..
T Consensus 2 r~C~~Cg~~Yh~--------~~~pP~~~~~Cd~cg~~L~q 33 (36)
T PF05191_consen 2 RICPKCGRIYHI--------EFNPPKVEGVCDNCGGELVQ 33 (36)
T ss_dssp EEETTTTEEEET--------TTB--SSTTBCTTTTEBEBE
T ss_pred cCcCCCCCcccc--------ccCCCCCCCccCCCCCeeEe
Confidence 357777776642 22223344567777765543
No 158
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=21.05 E-value=34 Score=30.22 Aligned_cols=44 Identities=11% Similarity=0.309 Sum_probs=26.6
Q ss_pred cCCCCCcccccccccchhhhh---hH------HHHHHhhcCCCCccCCCCccc
Q psy13090 27 IHPHKDKFMCYACYYSTYIST---NM------KNHIMTHTGSKPYNCSHCTFK 70 (771)
Q Consensus 27 ~H~~~k~~~C~~C~~~f~~~~---~l------~~H~~~H~~~~~~~C~~C~~~ 70 (771)
+-+....+.|..||+.|.... .| ..|..--+....+.||.||..
T Consensus 64 i~~~p~~~~C~~CG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~CP~Cgs~ 116 (135)
T PRK03824 64 FEEEEAVLKCRNCGNEWSLKEVKESLDEEIREAIHFIPEVVHAFLKCPKCGSR 116 (135)
T ss_pred EEecceEEECCCCCCEEecccccccccccccccccccccccccCcCCcCCCCC
Confidence 444457789999999988641 11 122222222455889999843
No 159
>KOG4167|consensus
Probab=20.93 E-value=27 Score=38.99 Aligned_cols=25 Identities=28% Similarity=0.522 Sum_probs=20.3
Q ss_pred ccccccccccccChhHHHHHHHHhc
Q psy13090 658 RLPCKLCGFRLKNHKQLKLHQRYHL 682 (771)
Q Consensus 658 ~~~C~~C~~~f~~~~~L~~H~~~H~ 682 (771)
-|.|.+|++.|-...+++.||++|.
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHH
Confidence 4888888888888888888888874
No 160
>KOG4167|consensus
Probab=20.92 E-value=37 Score=38.04 Aligned_cols=27 Identities=22% Similarity=0.370 Sum_probs=23.8
Q ss_pred CcccccccccchhhhhhHHHHHHhhcC
Q psy13090 32 DKFMCYACYYSTYISTNMKNHIMTHTG 58 (771)
Q Consensus 32 k~~~C~~C~~~f~~~~~l~~H~~~H~~ 58 (771)
.-|-|-+|++.|.....+..||++|.-
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 468999999999999999999999863
No 161
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=20.92 E-value=16 Score=41.02 Aligned_cols=74 Identities=23% Similarity=0.380 Sum_probs=0.0
Q ss_pred cccccccccChhhhhhhhccccCCCcc-cCCCCCcccCCchhHHHhhh--cCccceecCCCCccccchHHHHHHHhhccc
Q psy13090 265 CELCNEKVIGKNQIYFHYYHEHEGYDF-NCKQCSFTTKTPKKFIEHKE--AGHFTWNCEHCDFSASKKSVLLSHLKQHVT 341 (771)
Q Consensus 265 C~~C~~~f~~~~~L~~H~~~~h~~~~~-~C~~C~~~f~~~~~l~~H~~--~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~ 341 (771)
|+.|-+.+.+..+-+.++ || .|..||-+|.-...|---.. +...--.|+.|.+.|....+-+-|.
T Consensus 104 C~~Cl~Ei~dp~~rrY~Y-------PF~~CT~CGPRfTIi~alPYDR~nTsM~~F~lC~~C~~EY~dP~nRRfHA----- 171 (750)
T COG0068 104 CEDCLEEIFDPNSRRYLY-------PFINCTNCGPRFTIIEALPYDRENTSMADFPLCPFCDKEYKDPLNRRFHA----- 171 (750)
T ss_pred hHHHHHHhcCCCCcceec-------cccccCCCCcceeeeccCCCCcccCccccCcCCHHHHHHhcCcccccccc-----
Q ss_pred cccccccCCCCcccCCCCCCCCCCchhhhHHHHHhhhcccCCCcccccccc
Q psy13090 342 RVTRTQHMTPDEETNEIPIEPDLSPDLQKEELVRETRSMKMYDNYKCNICD 392 (771)
Q Consensus 342 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~ 392 (771)
.+..|+.||
T Consensus 172 ------------------------------------------Qp~aCp~CG 180 (750)
T COG0068 172 ------------------------------------------QPIACPKCG 180 (750)
T ss_pred ------------------------------------------ccccCcccC
No 162
>KOG3408|consensus
Probab=20.74 E-value=78 Score=26.80 Aligned_cols=29 Identities=28% Similarity=0.475 Sum_probs=24.0
Q ss_pred ccCCCCccccccchhhccChHHHHHHHhh
Q psy13090 624 EMQSESQLTCDICDRVFQTDKQWKLHMGF 652 (771)
Q Consensus 624 ~~~~~~~~~C~~C~~~f~~~~~L~~H~~~ 652 (771)
...|-..|.|-.|.+-|.+...|+.|.++
T Consensus 51 dlPG~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 51 DLPGGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCCCCceeehhhhhhhhcchHHHHHHHhc
Confidence 33566789999999999999999999764
No 163
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=20.46 E-value=47 Score=21.96 Aligned_cols=13 Identities=23% Similarity=0.577 Sum_probs=10.6
Q ss_pred CccCCCCcccchh
Q psy13090 61 PYNCSHCTFKGAS 73 (771)
Q Consensus 61 ~~~C~~C~~~f~~ 73 (771)
||+|+.|++.|=.
T Consensus 12 ~f~C~~C~~~FC~ 24 (39)
T smart00154 12 GFKCRHCGNLFCG 24 (39)
T ss_pred CeECCccCCcccc
Confidence 8899999888853
No 164
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=20.37 E-value=76 Score=21.39 Aligned_cols=9 Identities=33% Similarity=1.029 Sum_probs=4.2
Q ss_pred eecCCCCCc
Q psy13090 237 FTCDECHLC 245 (771)
Q Consensus 237 ~~C~~C~~~ 245 (771)
+.|..||..
T Consensus 20 ~vC~~CG~V 28 (43)
T PF08271_consen 20 LVCPNCGLV 28 (43)
T ss_dssp EEETTT-BB
T ss_pred EECCCCCCE
Confidence 455555544
No 165
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.36 E-value=54 Score=23.78 Aligned_cols=40 Identities=25% Similarity=0.431 Sum_probs=25.0
Q ss_pred cccccchh-hccChHHHHHHHhhccCCCccccccccccccC
Q psy13090 631 LTCDICDR-VFQTDKQWKLHMGFHNTKRRLPCKLCGFRLKN 670 (771)
Q Consensus 631 ~~C~~C~~-~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~ 670 (771)
.+|.+|++ .|.....+..-.......+.|.|+.|.-+...
T Consensus 3 vkCiiCd~v~~iD~rt~~tKrLrN~PIrtymC~eC~~Rva~ 43 (68)
T COG4896 3 VKCIICDRVDEIDNRTFKTKRLRNKPIRTYMCPECEHRVAI 43 (68)
T ss_pred ceEEEecceeeecchhHHHHHhhCCCceeEechhhHhhhch
Confidence 35777775 35555666665555566667888887655443
No 166
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=20.35 E-value=53 Score=21.69 Aligned_cols=11 Identities=18% Similarity=0.640 Sum_probs=6.4
Q ss_pred ceecCCCCCcc
Q psy13090 236 SFTCDECHLCF 246 (771)
Q Consensus 236 ~~~C~~C~~~F 246 (771)
-|.|..|+..|
T Consensus 28 fy~C~~C~~~w 38 (39)
T PF01096_consen 28 FYVCCNCGHRW 38 (39)
T ss_dssp EEEESSSTEEE
T ss_pred EEEeCCCCCee
Confidence 36666666554
No 167
>PF03107 C1_2: C1 domain; InterPro: IPR004146 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in DAG_PE-bind (IPR002219 from INTERPRO), therefore we have termed this domain DC1 for divergent C1 domain. This domain probably also binds to two zinc ions. The function of proteins with this domain is uncertain, however this domain may bind to molecules such as diacylglycerol. This family are found in plant proteins.
Probab=20.16 E-value=93 Score=19.05 Aligned_cols=9 Identities=33% Similarity=1.320 Sum_probs=6.0
Q ss_pred ceecCCCCc
Q psy13090 544 KYSCTQCSY 552 (771)
Q Consensus 544 ~~~C~~C~~ 552 (771)
.|.|..|+.
T Consensus 15 ~Y~C~~c~f 23 (30)
T PF03107_consen 15 FYHCSECCF 23 (30)
T ss_pred eEEeCCCCC
Confidence 477777763
Done!