Query psy13793
Match_columns 337
No_of_seqs 232 out of 2832
Neff 10.0
Searched_HMMs 46136
Date Fri Aug 16 20:38:57 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13793.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13793hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 100.0 1.2E-30 2.5E-35 213.6 6.0 137 177-314 127-266 (279)
2 KOG1074|consensus 100.0 8.6E-29 1.9E-33 228.3 6.8 144 179-323 604-938 (958)
3 KOG2462|consensus 99.9 6.7E-27 1.5E-31 191.6 7.1 138 73-285 127-265 (279)
4 KOG3623|consensus 99.9 8.2E-26 1.8E-30 205.5 0.6 77 235-311 894-970 (1007)
5 KOG3608|consensus 99.9 4.2E-23 9E-28 173.6 6.6 190 80-320 183-381 (467)
6 KOG3608|consensus 99.8 4.4E-21 9.6E-26 161.5 9.7 157 162-320 189-350 (467)
7 KOG1074|consensus 99.8 3.1E-22 6.7E-27 185.4 0.7 85 236-321 606-697 (958)
8 KOG3576|consensus 99.7 2.9E-18 6.4E-23 133.9 3.8 108 209-316 118-237 (267)
9 KOG3576|consensus 99.7 2.8E-17 6.1E-22 128.5 3.0 114 177-290 114-239 (267)
10 KOG3623|consensus 99.6 1E-15 2.2E-20 140.4 3.2 118 181-312 211-330 (1007)
11 PLN03086 PRLI-interacting fact 99.4 2.3E-12 5.1E-17 119.2 8.6 104 207-316 452-565 (567)
12 PHA00733 hypothetical protein 99.2 1.3E-11 2.8E-16 93.6 5.2 55 259-315 69-123 (128)
13 PLN03086 PRLI-interacting fact 99.1 6.1E-10 1.3E-14 103.4 10.5 98 181-285 454-562 (567)
14 PHA00733 hypothetical protein 99.1 2.3E-10 4.9E-15 86.8 5.5 60 227-288 65-124 (128)
15 PHA02768 hypothetical protein; 99.1 7.9E-11 1.7E-15 73.3 2.1 43 264-308 6-48 (55)
16 PF13465 zf-H2C2_2: Zinc-finge 98.8 2.3E-09 5E-14 57.1 2.7 26 278-303 1-26 (26)
17 KOG3993|consensus 98.8 3.5E-10 7.5E-15 98.5 -1.0 52 181-232 268-319 (500)
18 PHA02768 hypothetical protein; 98.8 1.6E-09 3.4E-14 67.5 1.8 44 235-280 5-48 (55)
19 PF13465 zf-H2C2_2: Zinc-finge 98.6 2.9E-08 6.2E-13 52.8 2.2 26 250-275 1-26 (26)
20 KOG3993|consensus 98.6 4.4E-09 9.6E-14 91.8 -2.0 215 77-318 268-485 (500)
21 PHA00616 hypothetical protein 98.6 3.6E-08 7.8E-13 58.4 1.9 39 263-301 1-39 (44)
22 PHA00732 hypothetical protein 98.4 2.2E-07 4.8E-12 63.8 2.7 43 263-311 1-44 (79)
23 PHA00616 hypothetical protein 98.3 4.1E-07 8.9E-12 53.9 1.8 39 180-218 1-39 (44)
24 PHA00732 hypothetical protein 98.1 2.1E-06 4.5E-11 59.0 2.9 48 235-288 1-49 (79)
25 PF05605 zf-Di19: Drought indu 98.0 1.1E-05 2.4E-10 51.4 4.4 49 264-315 3-53 (54)
26 PF05605 zf-Di19: Drought indu 97.8 4E-05 8.7E-10 48.8 5.0 50 235-287 2-53 (54)
27 PF00096 zf-C2H2: Zinc finger, 97.8 9.8E-06 2.1E-10 41.7 1.6 21 292-312 1-21 (23)
28 PF00096 zf-C2H2: Zinc finger, 97.8 8.5E-06 1.8E-10 41.9 1.3 23 77-99 1-23 (23)
29 COG5189 SFP1 Putative transcri 97.7 1.4E-05 2.9E-10 67.8 0.9 52 260-311 346-418 (423)
30 PF13912 zf-C2H2_6: C2H2-type 97.6 2.7E-05 5.9E-10 41.8 1.5 25 76-100 1-25 (27)
31 PF13894 zf-C2H2_4: C2H2-type 97.6 4.9E-05 1.1E-09 39.3 2.4 24 292-315 1-24 (24)
32 PF13894 zf-C2H2_4: C2H2-type 97.6 4.9E-05 1.1E-09 39.3 2.0 24 77-100 1-24 (24)
33 PF12756 zf-C2H2_2: C2H2 type 97.5 8.3E-05 1.8E-09 54.1 3.2 72 238-314 2-73 (100)
34 PF12756 zf-C2H2_2: C2H2 type 97.5 7.4E-05 1.6E-09 54.3 2.4 23 263-285 50-72 (100)
35 PF13912 zf-C2H2_6: C2H2-type 97.5 7.5E-05 1.6E-09 40.0 1.7 21 291-311 1-21 (27)
36 COG5189 SFP1 Putative transcri 96.9 0.00012 2.5E-09 62.3 -1.5 69 206-284 347-419 (423)
37 smart00355 ZnF_C2H2 zinc finge 96.8 0.0011 2.3E-08 34.7 2.3 19 293-311 2-20 (26)
38 PRK04860 hypothetical protein; 96.8 0.00053 1.1E-08 54.0 1.4 40 262-305 118-157 (160)
39 smart00355 ZnF_C2H2 zinc finge 96.8 0.00091 2E-08 35.0 1.9 24 77-100 1-24 (26)
40 PF09237 GAGA: GAGA factor; I 96.7 0.0015 3.3E-08 39.7 2.5 28 262-289 23-50 (54)
41 PF12874 zf-met: Zinc-finger o 96.6 0.001 2.2E-08 34.7 1.2 23 77-99 1-23 (25)
42 PF09237 GAGA: GAGA factor; I 96.5 0.0035 7.7E-08 38.1 3.1 32 177-208 21-52 (54)
43 PF13909 zf-H2C2_5: C2H2-type 96.5 0.0021 4.6E-08 33.2 1.9 23 292-315 1-23 (24)
44 KOG2231|consensus 96.4 0.011 2.4E-07 56.6 7.2 32 77-108 100-146 (669)
45 PF12171 zf-C2H2_jaz: Zinc-fin 96.2 0.0019 4.2E-08 34.4 0.8 22 77-98 2-23 (27)
46 PF12874 zf-met: Zinc-finger o 96.1 0.0032 7E-08 32.8 1.4 20 265-284 2-21 (25)
47 PF13909 zf-H2C2_5: C2H2-type 96.0 0.0048 1E-07 31.8 1.6 23 264-287 1-23 (24)
48 PRK04860 hypothetical protein; 95.6 0.0068 1.5E-07 47.8 1.8 38 180-221 119-156 (160)
49 KOG1146|consensus 95.2 0.0068 1.5E-07 61.5 0.8 147 173-325 458-651 (1406)
50 KOG2231|consensus 94.9 0.039 8.5E-07 52.9 4.9 121 181-311 100-259 (669)
51 COG5236 Uncharacterized conser 94.8 0.042 9.1E-07 47.7 4.4 127 180-316 151-306 (493)
52 KOG1146|consensus 94.8 0.0091 2E-07 60.6 0.4 57 260-316 515-614 (1406)
53 PF12171 zf-C2H2_jaz: Zinc-fin 94.7 0.0089 1.9E-07 31.8 0.0 21 264-284 2-22 (27)
54 smart00451 ZnF_U1 U1-like zinc 93.8 0.054 1.2E-06 30.6 2.1 23 76-98 3-25 (35)
55 KOG2785|consensus 93.7 0.15 3.2E-06 45.3 5.5 133 180-312 3-241 (390)
56 COG5236 Uncharacterized conser 93.5 0.11 2.5E-06 45.1 4.4 34 76-109 151-186 (493)
57 PF13913 zf-C2HC_2: zinc-finge 93.5 0.061 1.3E-06 28.0 1.8 20 77-97 3-22 (25)
58 PF13913 zf-C2HC_2: zinc-finge 92.7 0.1 2.2E-06 27.1 1.9 8 266-273 5-12 (25)
59 COG5048 FOG: Zn-finger [Genera 92.6 0.015 3.2E-07 54.1 -2.5 55 179-233 288-348 (467)
60 KOG2893|consensus 92.1 0.038 8.1E-07 45.4 -0.4 41 238-282 13-53 (341)
61 smart00451 ZnF_U1 U1-like zinc 91.2 0.15 3.3E-06 28.7 1.7 22 263-284 3-24 (35)
62 COG5048 FOG: Zn-finger [Genera 90.5 0.084 1.8E-06 49.0 0.2 57 45-101 30-88 (467)
63 KOG2482|consensus 90.3 0.49 1.1E-05 41.3 4.6 134 179-312 143-355 (423)
64 KOG4173|consensus 90.1 0.12 2.6E-06 41.6 0.8 79 208-288 79-172 (253)
65 cd00350 rubredoxin_like Rubred 89.9 0.12 2.6E-06 28.9 0.5 9 290-298 16-24 (33)
66 KOG2482|consensus 89.7 0.46 1E-05 41.5 4.0 106 180-285 195-356 (423)
67 TIGR00622 ssl1 transcription f 89.6 0.41 8.9E-06 35.0 3.1 33 181-227 2-34 (112)
68 COG4049 Uncharacterized protei 89.0 0.21 4.5E-06 31.0 1.0 28 260-287 14-41 (65)
69 PF02892 zf-BED: BED zinc fing 88.9 0.32 7E-06 29.2 1.9 28 288-315 13-44 (45)
70 smart00614 ZnF_BED BED zinc fi 88.8 0.39 8.5E-06 29.7 2.3 27 291-317 18-49 (50)
71 PF09538 FYDLN_acid: Protein o 87.7 0.35 7.5E-06 35.4 1.7 30 264-304 10-39 (108)
72 COG4049 Uncharacterized protei 87.2 0.32 6.9E-06 30.2 1.1 30 286-315 12-41 (65)
73 PF12013 DUF3505: Protein of u 85.6 1.6 3.5E-05 32.1 4.4 24 292-315 81-108 (109)
74 TIGR00622 ssl1 transcription f 85.3 1.1 2.4E-05 32.8 3.2 92 209-316 2-105 (112)
75 KOG2893|consensus 85.1 0.3 6.5E-06 40.3 0.3 50 262-316 10-59 (341)
76 PF12013 DUF3505: Protein of u 84.6 1.6 3.5E-05 32.0 4.0 51 235-286 11-107 (109)
77 KOG4173|consensus 82.5 0.87 1.9E-05 36.8 1.9 78 179-259 78-171 (253)
78 KOG2186|consensus 81.6 0.91 2E-05 38.0 1.7 50 181-233 4-53 (276)
79 PF12907 zf-met2: Zinc-binding 81.0 0.37 7.9E-06 28.2 -0.5 29 292-320 2-33 (40)
80 cd00729 rubredoxin_SM Rubredox 80.9 0.65 1.4E-05 26.1 0.5 9 291-299 18-26 (34)
81 TIGR00373 conserved hypothetic 78.2 2.1 4.5E-05 33.9 2.7 32 260-300 106-137 (158)
82 TIGR02098 MJ0042_CXXC MJ0042 f 78.0 1.3 2.8E-05 25.5 1.1 34 264-302 3-36 (38)
83 smart00659 RPOLCX RNA polymera 77.9 1.2 2.6E-05 26.7 1.0 26 264-300 3-28 (44)
84 TIGR02300 FYDLN_acid conserved 77.6 1.5 3.2E-05 32.8 1.6 30 264-304 10-39 (129)
85 COG1198 PriA Primosomal protei 77.3 1.6 3.5E-05 43.2 2.2 28 258-300 457-484 (730)
86 COG1996 RPC10 DNA-directed RNA 77.2 1.2 2.6E-05 27.3 0.8 28 263-300 6-33 (49)
87 PF09986 DUF2225: Uncharacteri 77.2 0.46 1E-05 39.7 -1.3 43 262-304 4-61 (214)
88 PF13719 zinc_ribbon_5: zinc-r 77.1 1.6 3.4E-05 25.1 1.3 32 265-301 4-35 (37)
89 PF13717 zinc_ribbon_4: zinc-r 76.2 1.8 3.9E-05 24.7 1.4 32 265-301 4-35 (36)
90 KOG2186|consensus 75.4 2 4.3E-05 36.1 2.0 46 209-256 4-49 (276)
91 COG2888 Predicted Zn-ribbon RN 75.3 2 4.4E-05 27.3 1.5 9 236-244 28-36 (61)
92 COG5188 PRP9 Splicing factor 3 73.4 8 0.00017 34.2 5.2 77 256-333 367-453 (470)
93 PRK06266 transcription initiat 72.7 3 6.6E-05 33.7 2.4 31 262-301 116-146 (178)
94 smart00531 TFIIE Transcription 72.5 3.5 7.6E-05 32.2 2.6 36 262-301 98-133 (147)
95 COG1592 Rubrerythrin [Energy p 71.5 2.1 4.5E-05 34.0 1.1 23 263-298 134-156 (166)
96 COG1592 Rubrerythrin [Energy p 70.5 2.6 5.6E-05 33.5 1.5 24 235-271 134-157 (166)
97 smart00734 ZnF_Rad18 Rad18-lik 68.6 4 8.7E-05 21.3 1.5 20 292-312 2-21 (26)
98 PF03604 DNA_RNApol_7kD: DNA d 67.5 3.8 8.3E-05 22.7 1.3 26 264-300 1-26 (32)
99 PRK00398 rpoP DNA-directed RNA 67.3 3.5 7.5E-05 24.9 1.3 30 263-302 3-32 (46)
100 PF09538 FYDLN_acid: Protein o 67.0 3.5 7.5E-05 30.2 1.5 29 50-89 11-39 (108)
101 PRK09678 DNA-binding transcrip 67.0 1.1 2.4E-05 30.0 -1.0 41 264-306 2-44 (72)
102 PF08274 PhnA_Zn_Ribbon: PhnA 67.0 2.7 5.8E-05 22.9 0.7 25 265-300 4-28 (30)
103 PRK14890 putative Zn-ribbon RN 66.9 4.1 8.9E-05 26.0 1.6 11 234-244 24-34 (59)
104 KOG2807|consensus 66.8 6.2 0.00013 34.6 3.1 34 179-226 275-308 (378)
105 smart00834 CxxC_CXXC_SSSS Puta 66.2 1.3 2.9E-05 25.8 -0.7 10 181-190 6-15 (41)
106 PF04959 ARS2: Arsenite-resist 65.4 4.5 9.7E-05 33.6 2.0 29 288-316 74-102 (214)
107 PRK04023 DNA polymerase II lar 65.3 5.8 0.00012 40.4 3.0 11 291-301 663-673 (1121)
108 PF06524 NOA36: NOA36 protein; 64.4 2.6 5.5E-05 35.6 0.4 25 262-286 208-232 (314)
109 TIGR02605 CxxC_CxxC_SSSS putat 64.3 1.6 3.4E-05 27.2 -0.7 11 181-191 6-16 (52)
110 TIGR02300 FYDLN_acid conserved 64.1 4.2 9.1E-05 30.4 1.4 30 181-221 10-39 (129)
111 PRK00464 nrdR transcriptional 63.0 1.2 2.5E-05 35.0 -1.8 13 292-304 29-41 (154)
112 TIGR00373 conserved hypothetic 61.9 9.4 0.0002 30.2 3.2 16 208-223 109-124 (158)
113 PF09986 DUF2225: Uncharacteri 61.3 1.2 2.6E-05 37.2 -2.1 21 179-199 4-24 (214)
114 KOG1280|consensus 61.2 12 0.00025 33.2 3.8 36 235-270 79-116 (381)
115 PF05443 ROS_MUCR: ROS/MUCR tr 59.4 4.7 0.0001 30.7 1.0 24 264-290 73-96 (132)
116 PF07800 DUF1644: Protein of u 58.5 29 0.00063 27.2 5.1 60 263-324 80-141 (162)
117 COG5151 SSL1 RNA polymerase II 57.1 6.5 0.00014 34.1 1.6 92 179-286 307-411 (421)
118 smart00531 TFIIE Transcription 55.8 13 0.00028 29.0 3.0 12 208-219 99-110 (147)
119 PF05290 Baculo_IE-1: Baculovi 55.8 7.1 0.00015 29.5 1.4 18 176-193 76-93 (140)
120 PRK06266 transcription initiat 55.6 13 0.00028 30.1 3.0 15 208-222 117-131 (178)
121 KOG2071|consensus 53.5 10 0.00022 36.2 2.3 26 234-259 417-442 (579)
122 COG4530 Uncharacterized protei 53.4 7.7 0.00017 28.0 1.2 28 264-302 10-37 (129)
123 PF15269 zf-C2H2_7: Zinc-finge 52.6 10 0.00022 22.5 1.4 22 77-98 21-42 (54)
124 KOG2785|consensus 52.2 19 0.00042 32.4 3.7 26 73-98 65-90 (390)
125 PF09723 Zn-ribbon_8: Zinc rib 52.0 16 0.00034 21.6 2.2 11 74-84 24-34 (42)
126 COG1198 PriA Primosomal protei 51.3 7.7 0.00017 38.6 1.3 14 259-272 471-484 (730)
127 PHA00626 hypothetical protein 50.9 8 0.00017 24.3 0.8 16 289-304 21-36 (59)
128 KOG1842|consensus 50.8 7.5 0.00016 35.6 1.0 21 264-284 16-36 (505)
129 PF10571 UPF0547: Uncharacteri 50.6 9 0.00019 20.0 0.9 11 77-87 15-25 (26)
130 PF02176 zf-TRAF: TRAF-type zi 49.9 4.3 9.4E-05 25.9 -0.5 41 262-303 8-54 (60)
131 KOG3408|consensus 49.2 4.2 9E-05 30.0 -0.7 26 260-285 54-79 (129)
132 PF05191 ADK_lid: Adenylate ki 47.5 3.9 8.6E-05 23.3 -0.8 29 182-218 3-31 (36)
133 KOG3408|consensus 47.0 11 0.00024 27.8 1.2 28 71-98 52-79 (129)
134 PF13878 zf-C2H2_3: zinc-finge 46.3 17 0.00037 21.3 1.8 25 76-100 13-39 (41)
135 PRK04023 DNA polymerase II lar 45.9 17 0.00036 37.3 2.6 48 180-273 626-673 (1121)
136 KOG2593|consensus 45.7 15 0.00033 33.6 2.1 37 178-217 126-162 (436)
137 PF04780 DUF629: Protein of un 45.5 15 0.00032 34.5 2.1 33 290-322 56-88 (466)
138 KOG4167|consensus 45.2 3.9 8.4E-05 39.8 -1.7 27 75-101 791-817 (907)
139 TIGR00595 priA primosomal prot 44.2 16 0.00035 35.0 2.2 9 291-299 253-261 (505)
140 PRK14873 primosome assembly pr 44.1 15 0.00031 36.5 1.9 11 290-300 421-431 (665)
141 PF07754 DUF1610: Domain of un 43.3 13 0.00028 19.0 0.8 10 75-84 15-24 (24)
142 KOG4167|consensus 42.8 7.7 0.00017 37.9 -0.1 25 263-287 792-816 (907)
143 KOG4118|consensus 42.8 15 0.00033 23.8 1.2 31 291-321 38-68 (74)
144 smart00154 ZnF_AN1 AN1-like Zi 42.4 14 0.00031 21.4 1.0 14 76-89 12-25 (39)
145 PF14353 CpXC: CpXC protein 41.9 22 0.00048 26.8 2.3 20 76-95 38-57 (128)
146 COG1997 RPL43A Ribosomal prote 41.2 11 0.00024 26.1 0.5 33 75-115 34-66 (89)
147 PF04959 ARS2: Arsenite-resist 37.1 15 0.00032 30.7 0.7 30 177-206 74-103 (214)
148 KOG1280|consensus 35.7 29 0.00064 30.8 2.3 62 261-322 77-147 (381)
149 PRK14714 DNA polymerase II lar 35.6 35 0.00076 36.0 3.1 28 263-301 692-719 (1337)
150 KOG4377|consensus 35.2 11 0.00023 34.2 -0.4 26 179-204 270-297 (480)
151 KOG2593|consensus 34.8 44 0.00096 30.8 3.3 16 235-250 128-143 (436)
152 cd00065 FYVE FYVE domain; Zinc 34.2 25 0.00055 22.0 1.3 14 290-303 17-30 (57)
153 KOG4124|consensus 33.8 6 0.00013 35.0 -2.2 49 262-310 348-417 (442)
154 COG3091 SprT Zn-dependent meta 33.3 19 0.00041 28.0 0.7 34 262-300 116-149 (156)
155 KOG2807|consensus 33.3 62 0.0013 28.6 3.8 26 289-315 343-368 (378)
156 COG4957 Predicted transcriptio 33.1 21 0.00045 27.0 0.8 25 264-291 77-101 (148)
157 PF01428 zf-AN1: AN1-like Zinc 31.9 17 0.00038 21.5 0.2 15 290-304 12-26 (43)
158 PF10276 zf-CHCC: Zinc-finger 31.4 26 0.00056 20.5 0.9 18 69-86 22-39 (40)
159 PRK05580 primosome assembly pr 31.4 33 0.00072 34.3 2.2 9 291-299 421-429 (679)
160 KOG0696|consensus 31.2 21 0.00045 32.9 0.7 60 204-271 69-129 (683)
161 KOG2636|consensus 31.1 32 0.00069 31.7 1.8 37 69-105 394-433 (497)
162 PF01363 FYVE: FYVE zinc finge 30.6 30 0.00065 22.7 1.3 11 48-58 9-19 (69)
163 PF01780 Ribosomal_L37ae: Ribo 29.8 13 0.00028 26.2 -0.7 31 179-219 34-64 (90)
164 PTZ00255 60S ribosomal protein 29.4 18 0.0004 25.4 0.0 31 179-219 35-65 (90)
165 COG1571 Predicted DNA-binding 29.2 32 0.00069 31.8 1.5 13 291-303 367-379 (421)
166 PRK00432 30S ribosomal protein 28.6 38 0.00083 20.9 1.4 11 101-111 36-46 (50)
167 smart00440 ZnF_C2C2 C2C2 Zinc 28.2 41 0.0009 19.5 1.4 11 77-87 29-39 (40)
168 KOG2071|consensus 28.1 37 0.0008 32.6 1.8 28 73-100 415-442 (579)
169 PF14311 DUF4379: Domain of un 28.1 33 0.00072 21.4 1.1 12 264-275 29-40 (55)
170 KOG2907|consensus 27.4 30 0.00065 25.3 0.8 11 292-302 103-113 (116)
171 KOG4377|consensus 27.2 20 0.00044 32.6 -0.1 103 208-316 271-427 (480)
172 PF12760 Zn_Tnp_IS1595: Transp 26.9 43 0.00093 20.1 1.4 9 290-298 36-44 (46)
173 smart00064 FYVE Protein presen 26.8 29 0.00063 22.7 0.7 10 237-246 12-21 (68)
174 TIGR00280 L37a ribosomal prote 26.7 19 0.0004 25.4 -0.3 31 179-219 34-64 (91)
175 KOG2636|consensus 26.3 45 0.00098 30.8 1.9 77 256-333 394-480 (497)
176 TIGR00686 phnA alkylphosphonat 26.1 36 0.00078 24.8 1.1 29 265-304 4-32 (109)
177 PF11931 DUF3449: Domain of un 26.0 23 0.00049 29.1 0.0 36 70-105 95-133 (196)
178 PF04606 Ogr_Delta: Ogr/Delta- 25.7 24 0.00051 21.4 0.1 37 79-115 2-40 (47)
179 PRK10220 hypothetical protein; 25.5 44 0.00095 24.4 1.4 29 265-304 5-33 (111)
180 PF08209 Sgf11: Sgf11 (transcr 25.5 67 0.0015 17.9 1.8 21 291-312 4-24 (33)
181 PRK03824 hypA hydrogenase nick 25.4 20 0.00044 27.5 -0.4 7 292-298 108-114 (135)
182 PF10013 DUF2256: Uncharacteri 25.4 38 0.00083 20.0 0.9 13 293-305 10-22 (42)
183 PF08271 TF_Zn_Ribbon: TFIIB z 25.3 29 0.00063 20.4 0.4 11 290-300 18-28 (43)
184 PF07282 OrfB_Zn_ribbon: Putat 24.9 47 0.001 21.8 1.4 10 263-272 46-55 (69)
185 PRK12496 hypothetical protein; 24.7 42 0.0009 26.7 1.3 11 264-274 128-138 (164)
186 KOG2272|consensus 24.0 35 0.00075 28.9 0.7 68 181-249 100-177 (332)
187 COG4640 Predicted membrane pro 23.9 49 0.0011 30.1 1.7 19 289-307 13-31 (465)
188 COG3357 Predicted transcriptio 23.8 33 0.00073 24.0 0.5 27 235-271 58-84 (97)
189 cd00924 Cyt_c_Oxidase_Vb Cytoc 23.6 40 0.00086 24.2 0.9 17 285-302 74-90 (97)
190 PF09845 DUF2072: Zn-ribbon co 23.4 42 0.00091 25.4 1.0 27 76-110 1-27 (131)
191 PF13451 zf-trcl: Probable zin 23.1 38 0.00083 20.8 0.6 14 74-87 2-15 (49)
192 PF04423 Rad50_zn_hook: Rad50 22.9 35 0.00077 21.2 0.5 12 293-304 22-33 (54)
193 PF13824 zf-Mss51: Zinc-finger 22.9 57 0.0012 20.6 1.4 15 289-303 12-26 (55)
194 KOG0717|consensus 22.7 48 0.001 30.9 1.4 22 77-98 293-314 (508)
195 TIGR00100 hypA hydrogenase nic 22.4 27 0.00058 25.9 -0.2 12 236-247 71-82 (115)
196 COG1571 Predicted DNA-binding 21.9 47 0.001 30.7 1.2 29 78-116 352-381 (421)
197 smart00504 Ubox Modified RING 21.9 1.2E+02 0.0026 19.1 2.9 10 293-302 37-46 (63)
198 COG1773 Rubredoxin [Energy pro 21.3 47 0.001 21.0 0.7 13 291-303 3-15 (55)
199 COG3364 Zn-ribbon containing p 21.2 58 0.0013 23.4 1.3 27 76-110 2-28 (112)
200 PRK03976 rpl37ae 50S ribosomal 21.1 26 0.00056 24.7 -0.4 31 179-219 35-65 (90)
201 COG4896 Uncharacterized protei 21.0 64 0.0014 20.7 1.3 35 182-216 4-39 (68)
202 COG1675 TFA1 Transcription ini 20.5 1.1E+02 0.0023 24.7 2.8 7 292-298 133-139 (176)
203 PF10263 SprT-like: SprT-like 20.3 27 0.00059 27.3 -0.6 32 263-302 123-154 (157)
204 PF08790 zf-LYAR: LYAR-type C2 20.2 30 0.00066 18.4 -0.2 18 292-310 1-18 (28)
205 PF14787 zf-CCHC_5: GAG-polypr 20.1 48 0.001 18.8 0.6 16 293-308 4-19 (36)
206 PF06524 NOA36: NOA36 protein; 20.1 39 0.00085 28.8 0.3 81 176-258 138-232 (314)
No 1
>KOG2462|consensus
Probab=99.96 E-value=1.2e-30 Score=213.65 Aligned_cols=137 Identities=31% Similarity=0.694 Sum_probs=129.1
Q ss_pred CCCcccccccccccCCHHHHHHHHHHcCC---CCccccccccccccCHHHHHHHHHhcCCccccCCccCCCCCCHHHHHH
Q psy13793 177 SGTRYQCPLCNKKINTKSNLRRHLQAHKP---KGKHTCEICFRIFKTEINLTRHLECHDTVELSCPDCNRVYPTNSTLRA 253 (337)
Q Consensus 177 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~---~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~ 253 (337)
....|.|+.|++.+.+.++|.+|...|.. .+.+.|..||+.|.+..+|..|+++|. .+++|.+|||.|..++.|+-
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~-l~c~C~iCGKaFSRPWLLQG 205 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT-LPCECGICGKAFSRPWLLQG 205 (279)
T ss_pred cCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC-CCcccccccccccchHHhhc
Confidence 34569999999999999999999999864 567999999999999999999999996 67899999999999999999
Q ss_pred HHHHhCCCCCccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCchHHHHHHHhh
Q psy13793 254 HMISHSTERPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASSGNCFSHRKRM 314 (337)
Q Consensus 254 H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~~ 314 (337)
|++.|+|||||.|..|+++|...++|+.|+++|.+.++|+|..|+|.|...+.|.+|....
T Consensus 206 HiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~ 266 (279)
T KOG2462|consen 206 HIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSESA 266 (279)
T ss_pred ccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhhc
Confidence 9999999999999999999999999999999999999999999999999999999997543
No 2
>KOG1074|consensus
Probab=99.95 E-value=8.6e-29 Score=228.32 Aligned_cols=144 Identities=31% Similarity=0.627 Sum_probs=128.8
Q ss_pred CcccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhcCCc-----cccCC---ccCCCCCCHHH
Q psy13793 179 TRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECHDTV-----ELSCP---DCNRVYPTNST 250 (337)
Q Consensus 179 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~-----~~~C~---~C~~~f~~~~~ 250 (337)
.+-.|-+|-++..-++.|+.|.++|+|++||+|.+||+.|.++.+|+.|+.+|..+ .+.|+ +|.+.|...-.
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccc
Confidence 46789999999999999999999999999999999999999999999999998643 47888 88888888888
Q ss_pred HHHHHHHhCCC---------------------------------------------------------------------
Q psy13793 251 LRAHMISHSTE--------------------------------------------------------------------- 261 (337)
Q Consensus 251 l~~H~~~h~~~--------------------------------------------------------------------- 261 (337)
|.+|+++|.+.
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~ 763 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSC 763 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCcccccc
Confidence 88888887621
Q ss_pred --------------------------------------------------------------------------------
Q psy13793 262 -------------------------------------------------------------------------------- 261 (337)
Q Consensus 262 -------------------------------------------------------------------------------- 261 (337)
T Consensus 764 ~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t 843 (958)
T KOG1074|consen 764 GRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLAT 843 (958)
T ss_pred ccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccccccccc
Confidence
Q ss_pred ----------------------------------CCccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCchHH
Q psy13793 262 ----------------------------------RPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASSGNC 307 (337)
Q Consensus 262 ----------------------------------k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L 307 (337)
....|.+|++.|...+.|..|+|+|+|++||.|.+|++.|+.+.+|
T Consensus 844 ~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnL 923 (958)
T KOG1074|consen 844 KTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNL 923 (958)
T ss_pred ccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhh
Confidence 1267999999999999999999999999999999999999999999
Q ss_pred HHHHHhhCCCCCCCCC
Q psy13793 308 FSHRKRMHEGETLPDI 323 (337)
Q Consensus 308 ~~H~~~~H~~~~~~~~ 323 (337)
..|| ..|....++..
T Consensus 924 KvHM-gtH~w~q~~sr 938 (958)
T KOG1074|consen 924 KVHM-GTHMWVQPPSR 938 (958)
T ss_pred hhhh-ccccccCCCcc
Confidence 9997 69987766643
No 3
>KOG2462|consensus
Probab=99.94 E-value=6.7e-27 Score=191.64 Aligned_cols=138 Identities=30% Similarity=0.639 Sum_probs=126.4
Q ss_pred CCCceecCCCCCcCCChhHHHhHHhhccCccccCcccceeeecccccccccccccccccccCCCCCCccCCCCCcccCcc
Q psy13793 73 GPKPYQCAICPRGYYSDIGLQNHLWSHRKILKNSNVGIYYTFTEDLREREDGEKRKHWIDENGGGENEMETPTPKKIGIR 152 (337)
Q Consensus 73 ~~~~~~C~~C~~~F~~~~~L~~H~~~H~~~~~c~~c~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (337)
....|+|+.||+.+.+.++|.+|.++|...
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~-------------------------------------------------- 156 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRHKQTHRSL-------------------------------------------------- 156 (279)
T ss_pred cCCceeccccccccccccccchhhcccccc--------------------------------------------------
Confidence 445699999999999999999999999431
Q ss_pred ccccccccccCccccccccccccCCCCcccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhcC
Q psy13793 153 EKVNWKKTKITASNLLSYNNAMFSSGTRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECHD 232 (337)
Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 232 (337)
...+.+.|++|++.|.....|..|+++|+ -+++|.+||+.|+....|.-|+|+|+
T Consensus 157 -----------------------~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiRTHT 211 (279)
T KOG2462|consen 157 -----------------------DSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIRTHT 211 (279)
T ss_pred -----------------------cccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhccccccc
Confidence 01244889999999999999999999997 68999999999999999999999998
Q ss_pred C-ccccCCccCCCCCCHHHHHHHHHHhCCCCCccccccchhccCHHHHHHHHHh
Q psy13793 233 T-VELSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEKTFKRNQDLKFHLNQ 285 (337)
Q Consensus 233 ~-~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~ 285 (337)
+ +||.|+.|+++|.++++|+.||.+|.+.|+|.|..|+|+|...+.|.+|...
T Consensus 212 GEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 212 GEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred CCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 7 8999999999999999999999999999999999999999999999999764
No 4
>KOG3623|consensus
Probab=99.91 E-value=8.2e-26 Score=205.48 Aligned_cols=77 Identities=34% Similarity=0.757 Sum_probs=74.8
Q ss_pred cccCCccCCCCCCHHHHHHHHHHhCCCCCccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCchHHHHHH
Q psy13793 235 ELSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASSGNCFSHR 311 (337)
Q Consensus 235 ~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~ 311 (337)
.|.|..|+|+|...++|.+|..-|+|.+||.|.+|.++|..+..|..|+|.|.|++||+|..|+|+|+..++...||
T Consensus 894 myaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQHM 970 (1007)
T KOG3623|consen 894 MYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQHM 970 (1007)
T ss_pred cchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhhh
Confidence 37899999999999999999999999999999999999999999999999999999999999999999999999995
No 5
>KOG3608|consensus
Probab=99.88 E-value=4.2e-23 Score=173.62 Aligned_cols=190 Identities=22% Similarity=0.435 Sum_probs=151.6
Q ss_pred CCCCCcCCChhHHHhHHhhccC--ccccCcccceeeecccccccccccccccccccCCCCCCccCCCCCcccCccccccc
Q psy13793 80 AICPRGYYSDIGLQNHLWSHRK--ILKNSNVGIYYTFTEDLREREDGEKRKHWIDENGGGENEMETPTPKKIGIREKVNW 157 (337)
Q Consensus 80 ~~C~~~F~~~~~L~~H~~~H~~--~~~c~~c~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (337)
..|.+.|.++..|++|++.|.+ ...|+.||.-|.....|..|.....
T Consensus 183 ~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt------------------------------- 231 (467)
T KOG3608|consen 183 AMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQT------------------------------- 231 (467)
T ss_pred hhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhh-------------------------------
Confidence 4566666666666666666655 3466666666665555544421110
Q ss_pred cccccCccccccccccccCCCCcccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhcCC--cc
Q psy13793 158 KKTKITASNLLSYNNAMFSSGTRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECHDT--VE 235 (337)
Q Consensus 158 ~~~~~~~~~~~~~~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~--~~ 235 (337)
.....+|.|..|.+.|.+...|..|+..|.. -|+|..|..+....++|.+|++..+. ++
T Consensus 232 -----------------~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn--~ykCplCdmtc~~~ssL~~H~r~rHs~dkp 292 (467)
T KOG3608|consen 232 -----------------ELNTNSFQCAQCFKRFATEKLLKSHVVRHVN--CYKCPLCDMTCSSASSLTTHIRYRHSKDKP 292 (467)
T ss_pred -----------------hhcCCchHHHHHHHHHhHHHHHHHHHHHhhh--cccccccccCCCChHHHHHHHHhhhccCCC
Confidence 0012369999999999999999999999854 69999999999999999999987544 78
Q ss_pred ccCCccCCCCCCHHHHHHHHHHhCCCCCccccc--cchhccCHHHHHHHHHh-ccCC--CCeecCCCccccCCchHHHHH
Q psy13793 236 LSCPDCNRVYPTNSTLRAHMISHSTERPHNCPH--CEKTFKRNQDLKFHLNQ-HTGE--RPYKCPFCPRTFASSGNCFSH 310 (337)
Q Consensus 236 ~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~--C~~~f~~~~~L~~H~~~-H~~~--~~~~C~~C~k~f~~~~~L~~H 310 (337)
|+|..|.+.|...+.|.+|...|. +-.|.|.. |..+|.+...|.+|++. |.|. .+|.|..|.+.|++-.+|..|
T Consensus 293 fKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~H 371 (467)
T KOG3608|consen 293 FKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAH 371 (467)
T ss_pred ccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHH
Confidence 999999999999999999999887 67799988 99999999999999985 5454 469999999999999999999
Q ss_pred HHhhCCCCCC
Q psy13793 311 RKRMHEGETL 320 (337)
Q Consensus 311 ~~~~H~~~~~ 320 (337)
+.+.|.=.=|
T Consensus 372 L~kkH~f~~P 381 (467)
T KOG3608|consen 372 LMKKHGFRLP 381 (467)
T ss_pred HHHhhcccCC
Confidence 9999864433
No 6
>KOG3608|consensus
Probab=99.85 E-value=4.4e-21 Score=161.48 Aligned_cols=157 Identities=27% Similarity=0.511 Sum_probs=141.7
Q ss_pred cCccccccccccccCCCCcccccccccccCCHHHHHHHHHHcC--CCCccccccccccccCHHHHHHHHHhcCCccccCC
Q psy13793 162 ITASNLLSYNNAMFSSGTRYQCPLCNKKINTKSNLRRHLQAHK--PKGKHTCEICFRIFKTEINLTRHLECHDTVELSCP 239 (337)
Q Consensus 162 ~~~~~~~~~~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~ 239 (337)
+.....|..|.+.|++++-..|+.|+.-|.++..|-.|++..+ ...+|.|..|.+.|.+...|..|+..|- .-|+|+
T Consensus 189 ~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv-n~ykCp 267 (467)
T KOG3608|consen 189 MGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV-NCYKCP 267 (467)
T ss_pred hccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh-hccccc
Confidence 4444567778999999999999999999999999999997543 5679999999999999999999999874 458999
Q ss_pred ccCCCCCCHHHHHHHHH-HhCCCCCccccccchhccCHHHHHHHHHhccCCCCeecCC--CccccCCchHHHHHHHhhCC
Q psy13793 240 DCNRVYPTNSTLRAHMI-SHSTERPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPF--CPRTFASSGNCFSHRKRMHE 316 (337)
Q Consensus 240 ~C~~~f~~~~~l~~H~~-~h~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~--C~k~f~~~~~L~~H~~~~H~ 316 (337)
.|......+++|..|++ .|...|||+|..|++.|.+.++|.+|..+|+ +-.|+|.. |...|.+...|.+|++.+|.
T Consensus 268 lCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhE 346 (467)
T KOG3608|consen 268 LCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHE 346 (467)
T ss_pred ccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhcc
Confidence 99999999999999987 5888999999999999999999999999998 78899998 99999999999999989997
Q ss_pred CCCC
Q psy13793 317 GETL 320 (337)
Q Consensus 317 ~~~~ 320 (337)
|..+
T Consensus 347 g~np 350 (467)
T KOG3608|consen 347 GNNP 350 (467)
T ss_pred CCCC
Confidence 7643
No 7
>KOG1074|consensus
Probab=99.84 E-value=3.1e-22 Score=185.35 Aligned_cols=85 Identities=31% Similarity=0.658 Sum_probs=76.8
Q ss_pred ccCCccCCCCCCHHHHHHHHHHhCCCCCccccccchhccCHHHHHHHHHhccCC----CCeecC---CCccccCCchHHH
Q psy13793 236 LSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEKTFKRNQDLKFHLNQHTGE----RPYKCP---FCPRTFASSGNCF 308 (337)
Q Consensus 236 ~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~----~~~~C~---~C~k~f~~~~~L~ 308 (337)
-.|-+|.++..-++.|+.|++.|+||+||+|.+||++|+++.+|+.|+-+|-.. -+|.|+ +|-+.|+..-.|.
T Consensus 606 NqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lp 685 (958)
T KOG1074|consen 606 NQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLP 685 (958)
T ss_pred cceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccccc
Confidence 459999999999999999999999999999999999999999999999998665 358999 9999999999999
Q ss_pred HHHHhhCCCCCCC
Q psy13793 309 SHRKRMHEGETLP 321 (337)
Q Consensus 309 ~H~~~~H~~~~~~ 321 (337)
.|+ ++|.+-.++
T Consensus 686 QhI-riH~~~~~s 697 (958)
T KOG1074|consen 686 QHI-RIHLGGQIS 697 (958)
T ss_pred ceE-EeecCCCCC
Confidence 997 999854433
No 8
>KOG3576|consensus
Probab=99.72 E-value=2.9e-18 Score=133.91 Aligned_cols=108 Identities=31% Similarity=0.586 Sum_probs=61.2
Q ss_pred cccccccccccCHHHHHHHHHhcCC-ccccCCccCCCCCCHHHHHHHHHHhCCCCCccccccchhccCHHHHHHHHHhcc
Q psy13793 209 HTCEICFRIFKTEINLTRHLECHDT-VELSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEKTFKRNQDLKFHLNQHT 287 (337)
Q Consensus 209 ~~C~~C~~~f~~~~~l~~H~~~h~~-~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~H~ 287 (337)
|.|.+||+.|.....|.+|++.|.. +.+-|..||+.|.+...|.+|+++|+|.+||+|..|+++|++.-.|..|++.-+
T Consensus 118 ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~kvh 197 (267)
T KOG3576|consen 118 FTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKVH 197 (267)
T ss_pred eeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHHHc
Confidence 4555555555555555555555443 334555555555555556666666666666666666666666666666655322
Q ss_pred C-----------CCCeecCCCccccCCchHHHHHHHhhCC
Q psy13793 288 G-----------ERPYKCPFCPRTFASSGNCFSHRKRMHE 316 (337)
Q Consensus 288 ~-----------~~~~~C~~C~k~f~~~~~L~~H~~~~H~ 316 (337)
| .+.|.|+.||.+-.....+..|++..|+
T Consensus 198 gv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp 237 (267)
T KOG3576|consen 198 GVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHP 237 (267)
T ss_pred CchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCC
Confidence 2 2456666666666666666666655554
No 9
>KOG3576|consensus
Probab=99.66 E-value=2.8e-17 Score=128.46 Aligned_cols=114 Identities=25% Similarity=0.518 Sum_probs=104.4
Q ss_pred CCCcccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhcCC-ccccCCccCCCCCCHHHHHHHH
Q psy13793 177 SGTRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECHDT-VELSCPDCNRVYPTNSTLRAHM 255 (337)
Q Consensus 177 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~-~~~~C~~C~~~f~~~~~l~~H~ 255 (337)
+...|.|.+|++.|.....|.+|++-|...+.|.|..||+.|.....|++|+++|++ ++|+|..|+++|+.+-+|..|+
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl 193 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHL 193 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHH
Confidence 356699999999999999999999999999999999999999999999999999998 8999999999999999999998
Q ss_pred HHhCC-----------CCCccccccchhccCHHHHHHHHHhccCCC
Q psy13793 256 ISHST-----------ERPHNCPHCEKTFKRNQDLKFHLNQHTGER 290 (337)
Q Consensus 256 ~~h~~-----------~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~ 290 (337)
..-+| .|.|.|+.||.+-.....+..|++.|+..-
T Consensus 194 ~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 194 KKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred HHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence 75443 467999999999999999999999876543
No 10
>KOG3623|consensus
Probab=99.57 E-value=1e-15 Score=140.39 Aligned_cols=118 Identities=29% Similarity=0.631 Sum_probs=77.4
Q ss_pred ccccccccccCCHHHHHHHHHHcC--CCCccccccccccccCHHHHHHHHHhcCCccccCCccCCCCCCHHHHHHHHHHh
Q psy13793 181 YQCPLCNKKINTKSNLRRHLQAHK--PKGKHTCEICFRIFKTEINLTRHLECHDTVELSCPDCNRVYPTNSTLRAHMISH 258 (337)
Q Consensus 181 ~~C~~C~~~f~~~~~l~~H~~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h 258 (337)
..|++|.+.+.....|+.|++..+ .+..|.|..|..+|.++..|.+|+.+|....- .- .+...-
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~d-------qa-------~sltqs 276 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGD-------QA-------ISLTQS 276 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCc-------cc-------ccccch
Confidence 446666666655566666654321 23345555555555555555555555532110 00 011111
Q ss_pred CCCCCccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCchHHHHHHH
Q psy13793 259 STERPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASSGNCFSHRK 312 (337)
Q Consensus 259 ~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~ 312 (337)
.+.+.|+|.+||++|..+..|+.|+|+|.|++||.|+-|+|+|+..++...||.
T Consensus 277 a~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 277 ALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhccccccccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 133669999999999999999999999999999999999999999999888874
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.35 E-value=2.3e-12 Score=119.15 Aligned_cols=104 Identities=23% Similarity=0.465 Sum_probs=91.4
Q ss_pred CccccccccccccCHHHHHHHHHhcCCccccCCccCCCCCCHHHHHHHHHHhCCCCCccccccchhcc----------CH
Q psy13793 207 GKHTCEICFRIFKTEINLTRHLECHDTVELSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEKTFK----------RN 276 (337)
Q Consensus 207 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~----------~~ 276 (337)
.++.|..|++.|. ...|..|+..++ .++.|+ ||+.+ .+..|..|+..|.+.+|+.|..|++.|. ..
T Consensus 452 ~H~~C~~Cgk~f~-~s~LekH~~~~H-kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~ 527 (567)
T PLN03086 452 NHVHCEKCGQAFQ-QGEMEKHMKVFH-EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRL 527 (567)
T ss_pred cCccCCCCCCccc-hHHHHHHHHhcC-CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhh
Confidence 4578999999996 678999999986 789999 99755 6799999999999999999999999985 24
Q ss_pred HHHHHHHHhccCCCCeecCCCccccCCchHHHHHHHhhCC
Q psy13793 277 QDLKFHLNQHTGERPYKCPFCPRTFASSGNCFSHRKRMHE 316 (337)
Q Consensus 277 ~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~~H~ 316 (337)
+.|..|..++ |.+++.|..||+.|..+ .|..|+..+|.
T Consensus 528 s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~~~h~ 565 (567)
T PLN03086 528 RGMSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQIAVHQ 565 (567)
T ss_pred hhHHHHHHhc-CCcceEccccCCeeeeh-hHHHHHHHhhc
Confidence 6899999986 99999999999999877 68999988885
No 12
>PHA00733 hypothetical protein
Probab=99.23 E-value=1.3e-11 Score=93.58 Aligned_cols=55 Identities=27% Similarity=0.593 Sum_probs=41.9
Q ss_pred CCCCCccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCchHHHHHHHhhC
Q psy13793 259 STERPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASSGNCFSHRKRMH 315 (337)
Q Consensus 259 ~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~~H 315 (337)
.+.+||.|..|++.|.+.+.|..|++.| +.+|.|++|++.|.....|..|+...|
T Consensus 69 ~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h 123 (128)
T PHA00733 69 KAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKH 123 (128)
T ss_pred CCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhc
Confidence 3467788888888888888888887765 346788888888888888888876666
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.09 E-value=6.1e-10 Score=103.37 Aligned_cols=98 Identities=13% Similarity=0.388 Sum_probs=82.8
Q ss_pred ccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhcCC-ccccCCccCCCCCC----------HH
Q psy13793 181 YQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECHDT-VELSCPDCNRVYPT----------NS 249 (337)
Q Consensus 181 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~-~~~~C~~C~~~f~~----------~~ 249 (337)
+.|+.|++.|. ...|..|+..++ .++.|. ||..+ ....|..|+..|.. +++.|..|++.|.. .+
T Consensus 454 ~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s 528 (567)
T PLN03086 454 VHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLR 528 (567)
T ss_pred ccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhh
Confidence 68999999996 688999999986 789999 99765 67899999998875 78899999999952 45
Q ss_pred HHHHHHHHhCCCCCccccccchhccCHHHHHHHHHh
Q psy13793 250 TLRAHMISHSTERPHNCPHCEKTFKRNQDLKFHLNQ 285 (337)
Q Consensus 250 ~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~ 285 (337)
.|..|.... |.+++.|..|++.+... .|..|+..
T Consensus 529 ~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~~ 562 (567)
T PLN03086 529 GMSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQIA 562 (567)
T ss_pred hHHHHHHhc-CCcceEccccCCeeeeh-hHHHHHHH
Confidence 899998885 99999999999888764 57778764
No 14
>PHA00733 hypothetical protein
Probab=99.07 E-value=2.3e-10 Score=86.76 Aligned_cols=60 Identities=22% Similarity=0.473 Sum_probs=40.9
Q ss_pred HHHhcCCccccCCccCCCCCCHHHHHHHHHHhCCCCCccccccchhccCHHHHHHHHHhccC
Q psy13793 227 HLECHDTVELSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEKTFKRNQDLKFHLNQHTG 288 (337)
Q Consensus 227 H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~ 288 (337)
|+..+...+|.|+.|++.|.+...|..|++.| +.+|.|.+|++.|.....|..|++..++
T Consensus 65 ~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 65 LLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred hcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 33444456677777777777777777777655 3467777777777777777777776554
No 15
>PHA02768 hypothetical protein; Provisional
Probab=99.05 E-value=7.9e-11 Score=73.29 Aligned_cols=43 Identities=23% Similarity=0.542 Sum_probs=36.8
Q ss_pred ccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCchHHH
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASSGNCF 308 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~ 308 (337)
|.|+.||+.|.+.+.|..|+++|+ ++|+|..|++.|.+.+.|.
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 789999999999999999999888 6888999999888777654
No 16
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.85 E-value=2.3e-09 Score=57.08 Aligned_cols=26 Identities=50% Similarity=1.307 Sum_probs=20.6
Q ss_pred HHHHHHHhccCCCCeecCCCccccCC
Q psy13793 278 DLKFHLNQHTGERPYKCPFCPRTFAS 303 (337)
Q Consensus 278 ~L~~H~~~H~~~~~~~C~~C~k~f~~ 303 (337)
+|.+|+++|+|++||.|++|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 47788888888888888888888863
No 17
>KOG3993|consensus
Probab=98.84 E-value=3.5e-10 Score=98.52 Aligned_cols=52 Identities=27% Similarity=0.503 Sum_probs=43.6
Q ss_pred ccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhcC
Q psy13793 181 YQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECHD 232 (337)
Q Consensus 181 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 232 (337)
|.|..|...|.+...|-+|.-.-.-..-|.|.+|++.|+-..+|..|.++|-
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHK 319 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHK 319 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccC
Confidence 8999999999999999999854333345899999999999999999999884
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.82 E-value=1.6e-09 Score=67.51 Aligned_cols=44 Identities=18% Similarity=0.488 Sum_probs=35.0
Q ss_pred cccCCccCCCCCCHHHHHHHHHHhCCCCCccccccchhccCHHHHH
Q psy13793 235 ELSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEKTFKRNQDLK 280 (337)
Q Consensus 235 ~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~ 280 (337)
.|.|+.||+.|...+.|..|+++|+ ++|+|..|++.|.+.+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4678888888888888888888887 6888888888888766654
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.60 E-value=2.9e-08 Score=52.83 Aligned_cols=26 Identities=46% Similarity=0.981 Sum_probs=23.9
Q ss_pred HHHHHHHHhCCCCCccccccchhccC
Q psy13793 250 TLRAHMISHSTERPHNCPHCEKTFKR 275 (337)
Q Consensus 250 ~l~~H~~~h~~~k~~~C~~C~~~f~~ 275 (337)
+|..|+++|++++||.|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 58899999999999999999999964
No 20
>KOG3993|consensus
Probab=98.58 E-value=4.4e-09 Score=91.77 Aligned_cols=215 Identities=16% Similarity=0.204 Sum_probs=114.4
Q ss_pred eecCCCCCcCCChhHHHhHH--hhccCccccCcccceeeecccccccccccccccccccCCCCCCccCCCCCcccCcccc
Q psy13793 77 YQCAICPRGYYSDIGLQNHL--WSHRKILKNSNVGIYYTFTEDLREREDGEKRKHWIDENGGGENEMETPTPKKIGIREK 154 (337)
Q Consensus 77 ~~C~~C~~~F~~~~~L~~H~--~~H~~~~~c~~c~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (337)
|.|..|...|.....|.+|. ++=+.+|+|++|++.|.....|..|..=.+....-...+... ++.. ...+
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P-------~k~~-~~~r 339 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPP-------PKQA-VETR 339 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCC-------hhhh-hhhh
Confidence 99999999999999999997 445668999999999999999988763222211111101000 0000 0000
Q ss_pred ccccccccCccccccccccccCCCCcccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhcCCc
Q psy13793 155 VNWKKTKITASNLLSYNNAMFSSGTRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECHDTV 234 (337)
Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 234 (337)
.........- .-+.+..|.|..|++.|.....|+.|+..|+....-.=. ...|..... ...
T Consensus 340 ae~~ea~rsg---------~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~--a~~f~~s~~--------~~l 400 (500)
T KOG3993|consen 340 AEVQEAERSG---------DDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEK--APKFLLSRV--------IPL 400 (500)
T ss_pred hhhhhccccC---------CcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhc--ccCcchhhc--------ccc
Confidence 0000000000 002234688999999999999999998777532110000 011111000 001
Q ss_pred cccCCccCCCCCCHHHHHHHHHHhCC-CCCccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCchHHHHHHHh
Q psy13793 235 ELSCPDCNRVYPTNSTLRAHMISHST-ERPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASSGNCFSHRKR 313 (337)
Q Consensus 235 ~~~C~~C~~~f~~~~~l~~H~~~h~~-~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~ 313 (337)
-+.|..|+-.+.....-..+...+.+ ..-..|..|+..+.++..=..+.+.-..+-.|.|.+|.-.|.+...|.+|+..
T Consensus 401 ~~~~~~~a~h~~a~~~~g~~vl~~a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~ 480 (500)
T KOG3993|consen 401 MHFNQAVATHSSASDSHGDEVLYVAGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINK 480 (500)
T ss_pred cccccccccccccccccccceeeeeccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhh
Confidence 12233333333222211111111111 11134555665555554444444443445578888898888888889999888
Q ss_pred hCCCC
Q psy13793 314 MHEGE 318 (337)
Q Consensus 314 ~H~~~ 318 (337)
.|..+
T Consensus 481 ~Hpse 485 (500)
T KOG3993|consen 481 CHPSE 485 (500)
T ss_pred cChHH
Confidence 88755
No 21
>PHA00616 hypothetical protein
Probab=98.55 E-value=3.6e-08 Score=58.38 Aligned_cols=39 Identities=15% Similarity=0.303 Sum_probs=31.4
Q ss_pred CccccccchhccCHHHHHHHHHhccCCCCeecCCCcccc
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTF 301 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f 301 (337)
||.|+.||+.|...+.|..|++.|+|++++.|++--..|
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~y~~f 39 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYFYIYF 39 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEEEEEE
Confidence 688888888888888888888888888888887644443
No 22
>PHA00732 hypothetical protein
Probab=98.38 E-value=2.2e-07 Score=63.78 Aligned_cols=43 Identities=33% Similarity=0.551 Sum_probs=23.8
Q ss_pred CccccccchhccCHHHHHHHHHh-ccCCCCeecCCCccccCCchHHHHHH
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLNQ-HTGERPYKCPFCPRTFASSGNCFSHR 311 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~~-H~~~~~~~C~~C~k~f~~~~~L~~H~ 311 (337)
||.|..|++.|.+.+.|..|++. |+ ++.|+.||+.|. .|..|+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~ 44 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHF 44 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhh
Confidence 45566666666666666666553 33 235666666665 245554
No 23
>PHA00616 hypothetical protein
Probab=98.28 E-value=4.1e-07 Score=53.93 Aligned_cols=39 Identities=28% Similarity=0.451 Sum_probs=33.9
Q ss_pred cccccccccccCCHHHHHHHHHHcCCCCccccccccccc
Q psy13793 180 RYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIF 218 (337)
Q Consensus 180 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 218 (337)
||+|+.||+.|.+++.|..|++.|++++++.|+.--..|
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~y~~f 39 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYFYIYF 39 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEEEEEE
Confidence 589999999999999999999999999999987644444
No 24
>PHA00732 hypothetical protein
Probab=98.12 E-value=2.1e-06 Score=58.99 Aligned_cols=48 Identities=27% Similarity=0.575 Sum_probs=32.1
Q ss_pred cccCCccCCCCCCHHHHHHHHHH-hCCCCCccccccchhccCHHHHHHHHHhccC
Q psy13793 235 ELSCPDCNRVYPTNSTLRAHMIS-HSTERPHNCPHCEKTFKRNQDLKFHLNQHTG 288 (337)
Q Consensus 235 ~~~C~~C~~~f~~~~~l~~H~~~-h~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~ 288 (337)
+|.|..|++.|.....|..|++. |. ++.|+.|++.|. .|..|.+++.+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---ChhhhhcccCC
Confidence 35677777777777777777764 43 356777877777 36667765443
No 25
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.00 E-value=1.1e-05 Score=51.38 Aligned_cols=49 Identities=29% Similarity=0.567 Sum_probs=28.5
Q ss_pred ccccccchhccCHHHHHHHHH-hccCC-CCeecCCCccccCCchHHHHHHHhhC
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLN-QHTGE-RPYKCPFCPRTFASSGNCFSHRKRMH 315 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~-~H~~~-~~~~C~~C~k~f~~~~~L~~H~~~~H 315 (337)
|.|+.|++ ..+...|..|.. .|..+ +.+.|++|...++ .+|..|+...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 56666666 344556666655 34443 3566777766544 36677765555
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.85 E-value=4e-05 Score=48.79 Aligned_cols=50 Identities=28% Similarity=0.589 Sum_probs=35.1
Q ss_pred cccCCccCCCCCCHHHHHHHHHH-hCC-CCCccccccchhccCHHHHHHHHHhcc
Q psy13793 235 ELSCPDCNRVYPTNSTLRAHMIS-HST-ERPHNCPHCEKTFKRNQDLKFHLNQHT 287 (337)
Q Consensus 235 ~~~C~~C~~~f~~~~~l~~H~~~-h~~-~k~~~C~~C~~~f~~~~~L~~H~~~H~ 287 (337)
.|.|+.|++ ..+...|..|... |.. .+.+.|++|...+. .+|..|+..++
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 477888888 4556778888654 444 35688888887654 38888887654
No 27
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.83 E-value=9.8e-06 Score=41.69 Aligned_cols=21 Identities=43% Similarity=0.980 Sum_probs=13.8
Q ss_pred eecCCCccccCCchHHHHHHH
Q psy13793 292 YKCPFCPRTFASSGNCFSHRK 312 (337)
Q Consensus 292 ~~C~~C~k~f~~~~~L~~H~~ 312 (337)
|+|++|++.|.++..|..|++
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~ 21 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMR 21 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHh
Confidence 566666666666666666653
No 28
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.82 E-value=8.5e-06 Score=41.94 Aligned_cols=23 Identities=35% Similarity=0.889 Sum_probs=21.5
Q ss_pred eecCCCCCcCCChhHHHhHHhhc
Q psy13793 77 YQCAICPRGYYSDIGLQNHLWSH 99 (337)
Q Consensus 77 ~~C~~C~~~F~~~~~L~~H~~~H 99 (337)
|.|+.|++.|.+...|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 78999999999999999999874
No 29
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.67 E-value=1.4e-05 Score=67.80 Aligned_cols=52 Identities=31% Similarity=0.842 Sum_probs=44.7
Q ss_pred CCCCccccc--cchhccCHHHHHHHHHh-cc------------------CCCCeecCCCccccCCchHHHHHH
Q psy13793 260 TERPHNCPH--CEKTFKRNQDLKFHLNQ-HT------------------GERPYKCPFCPRTFASSGNCFSHR 311 (337)
Q Consensus 260 ~~k~~~C~~--C~~~f~~~~~L~~H~~~-H~------------------~~~~~~C~~C~k~f~~~~~L~~H~ 311 (337)
++|||+|++ |.+.|.....|+.|+.- |- ..+||+|++|+|.|..-..|.-|+
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence 359999987 99999999999999763 41 248999999999999999999994
No 30
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.63 E-value=2.7e-05 Score=41.76 Aligned_cols=25 Identities=32% Similarity=0.656 Sum_probs=23.4
Q ss_pred ceecCCCCCcCCChhHHHhHHhhcc
Q psy13793 76 PYQCAICPRGYYSDIGLQNHLWSHR 100 (337)
Q Consensus 76 ~~~C~~C~~~F~~~~~L~~H~~~H~ 100 (337)
||.|..|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 6899999999999999999998875
No 31
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.62 E-value=4.9e-05 Score=39.32 Aligned_cols=24 Identities=29% Similarity=0.802 Sum_probs=13.5
Q ss_pred eecCCCccccCCchHHHHHHHhhC
Q psy13793 292 YKCPFCPRTFASSGNCFSHRKRMH 315 (337)
Q Consensus 292 ~~C~~C~k~f~~~~~L~~H~~~~H 315 (337)
|.|++|++.|.+...|..|+...|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 556666666666666666654433
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.57 E-value=4.9e-05 Score=39.34 Aligned_cols=24 Identities=33% Similarity=0.854 Sum_probs=20.1
Q ss_pred eecCCCCCcCCChhHHHhHHhhcc
Q psy13793 77 YQCAICPRGYYSDIGLQNHLWSHR 100 (337)
Q Consensus 77 ~~C~~C~~~F~~~~~L~~H~~~H~ 100 (337)
|.|++|++.|.+...|..|+..|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 789999999999999999998763
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.51 E-value=8.3e-05 Score=54.05 Aligned_cols=72 Identities=22% Similarity=0.436 Sum_probs=17.1
Q ss_pred CCccCCCCCCHHHHHHHHHHhCCCCCccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCchHHHHHHHhh
Q psy13793 238 CPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASSGNCFSHRKRM 314 (337)
Q Consensus 238 C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~~ 314 (337)
|..|+..|.+...|..|+...++-.. . ....+.....+..+.+.-. ...+.|.+|++.|.+...|..|++..
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred cccccccccccccccccccccccccc---c-ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 66677777777777777654333211 0 1112223333333433211 22677777777777777777777544
No 34
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.47 E-value=7.4e-05 Score=54.33 Aligned_cols=23 Identities=35% Similarity=0.836 Sum_probs=15.7
Q ss_pred CccccccchhccCHHHHHHHHHh
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLNQ 285 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~~ 285 (337)
.+.|..|++.|.+...|..|++.
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~~ 72 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMRS 72 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHHH
T ss_pred CCCCCccCCCCcCHHHHHHHHcC
Confidence 56777777777777777777764
No 35
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.45 E-value=7.5e-05 Score=40.02 Aligned_cols=21 Identities=33% Similarity=0.768 Sum_probs=11.2
Q ss_pred CeecCCCccccCCchHHHHHH
Q psy13793 291 PYKCPFCPRTFASSGNCFSHR 311 (337)
Q Consensus 291 ~~~C~~C~k~f~~~~~L~~H~ 311 (337)
||+|..|++.|.+...|..|+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~ 21 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHK 21 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHH
T ss_pred CCCCCccCCccCChhHHHHHh
Confidence 355555555555555555553
No 36
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.93 E-value=0.00012 Score=62.30 Aligned_cols=69 Identities=22% Similarity=0.457 Sum_probs=41.8
Q ss_pred CCcccccc--ccccccCHHHHHHHHHhcCCcc--ccCCccCCCCCCHHHHHHHHHHhCCCCCccccccchhccCHHHHHH
Q psy13793 206 KGKHTCEI--CFRIFKTEINLTRHLECHDTVE--LSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEKTFKRNQDLKF 281 (337)
Q Consensus 206 ~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~--~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~ 281 (337)
++||+|++ |.+.+.....|+.|+.--+..+ ..-+ .-..|...-...|||+|++|++.|.....|+.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p----------~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKY 416 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENP----------SPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKY 416 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCC----------CccccccccccCCceeccccchhhccCcccee
Confidence 36677765 7777777777777664332111 0000 01111111235699999999999999999998
Q ss_pred HHH
Q psy13793 282 HLN 284 (337)
Q Consensus 282 H~~ 284 (337)
|..
T Consensus 417 Hr~ 419 (423)
T COG5189 417 HRK 419 (423)
T ss_pred ccc
Confidence 854
No 37
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.83 E-value=0.0011 Score=34.72 Aligned_cols=19 Identities=32% Similarity=0.688 Sum_probs=9.7
Q ss_pred ecCCCccccCCchHHHHHH
Q psy13793 293 KCPFCPRTFASSGNCFSHR 311 (337)
Q Consensus 293 ~C~~C~k~f~~~~~L~~H~ 311 (337)
+|+.|++.|.....|..|+
T Consensus 2 ~C~~C~~~f~~~~~l~~H~ 20 (26)
T smart00355 2 RCPECGKVFKSKSALKEHM 20 (26)
T ss_pred CCCCCcchhCCHHHHHHHH
Confidence 4555555555555555553
No 38
>PRK04860 hypothetical protein; Provisional
Probab=96.82 E-value=0.00053 Score=54.05 Aligned_cols=40 Identities=28% Similarity=0.608 Sum_probs=33.1
Q ss_pred CCccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCch
Q psy13793 262 RPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASSG 305 (337)
Q Consensus 262 k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~ 305 (337)
-+|.|. |+. ....+.+|.++|+++++|.|..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 368998 887 6777889999999999999999998886543
No 39
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.80 E-value=0.00091 Score=35.01 Aligned_cols=24 Identities=29% Similarity=0.735 Sum_probs=21.9
Q ss_pred eecCCCCCcCCChhHHHhHHhhcc
Q psy13793 77 YQCAICPRGYYSDIGLQNHLWSHR 100 (337)
Q Consensus 77 ~~C~~C~~~F~~~~~L~~H~~~H~ 100 (337)
|.|+.|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 679999999999999999998764
No 40
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.71 E-value=0.0015 Score=39.67 Aligned_cols=28 Identities=25% Similarity=0.650 Sum_probs=10.4
Q ss_pred CCccccccchhccCHHHHHHHHHhccCC
Q psy13793 262 RPHNCPHCEKTFKRNQDLKFHLNQHTGE 289 (337)
Q Consensus 262 k~~~C~~C~~~f~~~~~L~~H~~~H~~~ 289 (337)
.|-.|++|+..+.+..+|.+|+..+++.
T Consensus 23 ~PatCP~C~a~~~~srnLrRHle~~H~~ 50 (54)
T PF09237_consen 23 QPATCPICGAVIRQSRNLRRHLEIRHFK 50 (54)
T ss_dssp --EE-TTT--EESSHHHHHHHHHHHTTT
T ss_pred CCCCCCcchhhccchhhHHHHHHHHhcc
Confidence 3444444444444444444444443333
No 41
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.61 E-value=0.001 Score=34.73 Aligned_cols=23 Identities=35% Similarity=0.803 Sum_probs=21.1
Q ss_pred eecCCCCCcCCChhHHHhHHhhc
Q psy13793 77 YQCAICPRGYYSDIGLQNHLWSH 99 (337)
Q Consensus 77 ~~C~~C~~~F~~~~~L~~H~~~H 99 (337)
|.|++|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 68999999999999999999754
No 42
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.50 E-value=0.0035 Score=38.13 Aligned_cols=32 Identities=38% Similarity=0.638 Sum_probs=18.9
Q ss_pred CCCcccccccccccCCHHHHHHHHHHcCCCCc
Q psy13793 177 SGTRYQCPLCNKKINTKSNLRRHLQAHKPKGK 208 (337)
Q Consensus 177 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 208 (337)
...|-.|++|+..+.+..+|++|+.++++.+|
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 34566788888888888888888777666554
No 43
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.48 E-value=0.0021 Score=33.16 Aligned_cols=23 Identities=43% Similarity=0.814 Sum_probs=13.2
Q ss_pred eecCCCccccCCchHHHHHHHhhC
Q psy13793 292 YKCPFCPRTFASSGNCFSHRKRMH 315 (337)
Q Consensus 292 ~~C~~C~k~f~~~~~L~~H~~~~H 315 (337)
|+|+.|+.... ...|.+|++..|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 56666666665 666666665544
No 44
>KOG2231|consensus
Probab=96.36 E-value=0.011 Score=56.57 Aligned_cols=32 Identities=31% Similarity=0.477 Sum_probs=22.6
Q ss_pred eecCCCCCcCC---------------ChhHHHhHHhhccCccccCcc
Q psy13793 77 YQCAICPRGYY---------------SDIGLQNHLWSHRKILKNSNV 108 (337)
Q Consensus 77 ~~C~~C~~~F~---------------~~~~L~~H~~~H~~~~~c~~c 108 (337)
+.|.+|++.|. +...|+.|+..-+..+.|..|
T Consensus 100 ~~C~~C~~~~~~~~~~~~~~~c~~~~s~~~Lk~H~~~~H~~~~c~lC 146 (669)
T KOG2231|consen 100 HSCHICDRRFRALYNKKECLHCTEFKSVENLKNHMRDQHKLHLCSLC 146 (669)
T ss_pred hhcCccccchhhhcccCCCccccchhHHHHHHHHHHHhhhhhccccc
Confidence 66777777773 788999999655555555555
No 45
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=96.20 E-value=0.0019 Score=34.42 Aligned_cols=22 Identities=27% Similarity=0.626 Sum_probs=20.4
Q ss_pred eecCCCCCcCCChhHHHhHHhh
Q psy13793 77 YQCAICPRGYYSDIGLQNHLWS 98 (337)
Q Consensus 77 ~~C~~C~~~F~~~~~L~~H~~~ 98 (337)
|.|..|++.|.+...|..|+..
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 7899999999999999999865
No 46
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.11 E-value=0.0032 Score=32.78 Aligned_cols=20 Identities=35% Similarity=0.795 Sum_probs=9.6
Q ss_pred cccccchhccCHHHHHHHHH
Q psy13793 265 NCPHCEKTFKRNQDLKFHLN 284 (337)
Q Consensus 265 ~C~~C~~~f~~~~~L~~H~~ 284 (337)
.|.+|++.|.+...|..|++
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCcCCHHHHHHHHC
Confidence 34444444444444444444
No 47
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.96 E-value=0.0048 Score=31.79 Aligned_cols=23 Identities=39% Similarity=0.839 Sum_probs=14.0
Q ss_pred ccccccchhccCHHHHHHHHHhcc
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHT 287 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~ 287 (337)
|.|+.|+.... ...|..|++.|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 56777776666 667777776654
No 48
>PRK04860 hypothetical protein; Provisional
Probab=95.59 E-value=0.0068 Score=47.84 Aligned_cols=38 Identities=21% Similarity=0.524 Sum_probs=33.1
Q ss_pred cccccccccccCCHHHHHHHHHHcCCCCccccccccccccCH
Q psy13793 180 RYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTE 221 (337)
Q Consensus 180 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 221 (337)
+|.|. |+. ....+.+|.++|+++++|.|..|+..|...
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 59998 987 678889999999999999999999988653
No 49
>KOG1146|consensus
Probab=95.22 E-value=0.0068 Score=61.52 Aligned_cols=147 Identities=16% Similarity=0.192 Sum_probs=91.8
Q ss_pred cccCCCCcccccccccccCCHHHHHHHHHHcC-------------------------CCCccccccccccccCHHHHHHH
Q psy13793 173 AMFSSGTRYQCPLCNKKINTKSNLRRHLQAHK-------------------------PKGKHTCEICFRIFKTEINLTRH 227 (337)
Q Consensus 173 ~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~-------------------------~~~~~~C~~C~~~f~~~~~l~~H 227 (337)
.+++-.+-|+|+.|++.|.....|..|++..+ +.++|.|..|..+++...+|..|
T Consensus 458 ~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~Lsih 537 (1406)
T KOG1146|consen 458 VLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLSIH 537 (1406)
T ss_pred eeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccceeeeeeeecchHHHHH
Confidence 34455578999999999999999999998733 22457777777777777777777
Q ss_pred HHhc--CCc---------------cc-cCCccCCCCC--CHHHHHHHHHHhCCC-CCccccccchhccCHHHHHHHHH-h
Q psy13793 228 LECH--DTV---------------EL-SCPDCNRVYP--TNSTLRAHMISHSTE-RPHNCPHCEKTFKRNQDLKFHLN-Q 285 (337)
Q Consensus 228 ~~~h--~~~---------------~~-~C~~C~~~f~--~~~~l~~H~~~h~~~-k~~~C~~C~~~f~~~~~L~~H~~-~ 285 (337)
+..- ... +. .|..+...-. ...... .... -.|.|.+|++.-+-...|..|+. .
T Consensus 538 lqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~-----~pktkP~~~C~vc~yetniarnlrihmtss 612 (1406)
T KOG1146|consen 538 LQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSS-----GPKTKPSWRCEVCSYETNIARNLRIHMTAS 612 (1406)
T ss_pred HHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCC-----CCCCCCCcchhhhcchhhhhhccccccccC
Confidence 6532 000 00 0111110000 000000 0111 23889999999998899999987 5
Q ss_pred ccCCCCeecCCCccccCCchHHHHHHHhhCCCCCCCCCCc
Q psy13793 286 HTGERPYKCPFCPRTFASSGNCFSHRKRMHEGETLPDISL 325 (337)
Q Consensus 286 H~~~~~~~C~~C~k~f~~~~~L~~H~~~~H~~~~~~~~~~ 325 (337)
++...|..|-.|+-.+.....|..+ .+.+.....+..+-
T Consensus 613 ~~s~~p~~~Lq~~it~~l~~~~~~~-~~lp~tps~~~~s~ 651 (1406)
T KOG1146|consen 613 PSSSPPSLVLQQNITSSLASLLGGQ-GRLPFTPSSPPSSD 651 (1406)
T ss_pred CCCCChHHHhhhcchhhccccccCc-CCCCCCCCCCcccc
Confidence 6666778888888888887777777 46774444444333
No 50
>KOG2231|consensus
Probab=94.92 E-value=0.039 Score=52.94 Aligned_cols=121 Identities=22% Similarity=0.511 Sum_probs=77.0
Q ss_pred ccccccccccC---------------CHHHHHHHHHHcCCCCccccccc---------cccccCHHHHHHHHHhcCC---
Q psy13793 181 YQCPLCNKKIN---------------TKSNLRRHLQAHKPKGKHTCEIC---------FRIFKTEINLTRHLECHDT--- 233 (337)
Q Consensus 181 ~~C~~C~~~f~---------------~~~~l~~H~~~h~~~~~~~C~~C---------~~~f~~~~~l~~H~~~h~~--- 233 (337)
+.|.+|+..|. ....|+.|+..-+. .+.|.+| .....+...|..|+..-..
T Consensus 100 ~~C~~C~~~~~~~~~~~~~~~c~~~~s~~~Lk~H~~~~H~--~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~ 177 (669)
T KOG2231|consen 100 HSCHICDRRFRALYNKKECLHCTEFKSVENLKNHMRDQHK--LHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDE 177 (669)
T ss_pred hhcCccccchhhhcccCCCccccchhHHHHHHHHHHHhhh--hhccccccccceeeeeeeehehHHHHHHHHhcCCCccc
Confidence 56777776663 66788899854222 2344443 2233456777777765322
Q ss_pred --c-cccCCccCCCCCCHHHHHHHHHHhCCCCCccccccc------hhccCHHHHHHHHHhccCCCCeecC--CCc-ccc
Q psy13793 234 --V-ELSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCE------KTFKRNQDLKFHLNQHTGERPYKCP--FCP-RTF 301 (337)
Q Consensus 234 --~-~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~------~~f~~~~~L~~H~~~H~~~~~~~C~--~C~-k~f 301 (337)
+ ...|..|...|.....|..|++.++ |.|..|. .-|.....|..|.|.+| |.|. .|. +.|
T Consensus 178 s~rGhp~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f 249 (669)
T KOG2231|consen 178 SCRGHPLCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKF 249 (669)
T ss_pred cccCCccchhhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC----cccccccccccee
Confidence 1 2469999999999999999988765 6676664 46788889999988654 4565 454 334
Q ss_pred CCchHHHHHH
Q psy13793 302 ASSGNCFSHR 311 (337)
Q Consensus 302 ~~~~~L~~H~ 311 (337)
.....+..|+
T Consensus 250 ~~~~~~ei~l 259 (669)
T KOG2231|consen 250 YVAFELEIEL 259 (669)
T ss_pred eehhHHHHHH
Confidence 3333344443
No 51
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=94.84 E-value=0.042 Score=47.73 Aligned_cols=127 Identities=24% Similarity=0.508 Sum_probs=89.7
Q ss_pred cccccc--cccccCCHHHHHHHHHHcCCCCccccccccc---ccc------CHHHHHHHHHhcCCc-c----ccCCccCC
Q psy13793 180 RYQCPL--CNKKINTKSNLRRHLQAHKPKGKHTCEICFR---IFK------TEINLTRHLECHDTV-E----LSCPDCNR 243 (337)
Q Consensus 180 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~---~f~------~~~~l~~H~~~h~~~-~----~~C~~C~~ 243 (337)
.|.|+. |..+......|..|.+..++. +.|.+|-. .|. +...|..|...-.+. . ..|..|..
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~ 228 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKI 228 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccc
Confidence 378875 777777788999999876653 67777642 333 345666666543322 2 36999999
Q ss_pred CCCCHHHHHHHHHHhCCCCCccccccch-------hccCHHHHHHHHHhccCCCCeecCC--Cc----cccCCchHHHHH
Q psy13793 244 VYPTNSTLRAHMISHSTERPHNCPHCEK-------TFKRNQDLKFHLNQHTGERPYKCPF--CP----RTFASSGNCFSH 310 (337)
Q Consensus 244 ~f~~~~~l~~H~~~h~~~k~~~C~~C~~-------~f~~~~~L~~H~~~H~~~~~~~C~~--C~----k~f~~~~~L~~H 310 (337)
.|.+...|..|++..+ -.|-+|++ -|....+|..|.+.- -|.|.+ |- ..|.....|+.|
T Consensus 229 ~FYdDDEL~~HcR~~H----E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~----hy~ct~qtc~~~k~~vf~~~~el~~h 300 (493)
T COG5236 229 YFYDDDELRRHCRLRH----EACHICDMVGPIRYQYFKSYEDLEAHFRNA----HYCCTFQTCRVGKCYVFPYHTELLEH 300 (493)
T ss_pred eecChHHHHHHHHhhh----hhhhhhhccCccchhhhhCHHHHHHHhhcC----ceEEEEEEEecCcEEEeccHHHHHHH
Confidence 9999999999988543 35777765 478888999998742 244543 32 468888899999
Q ss_pred HHhhCC
Q psy13793 311 RKRMHE 316 (337)
Q Consensus 311 ~~~~H~ 316 (337)
+-+.|.
T Consensus 301 ~~~~h~ 306 (493)
T COG5236 301 LTRFHK 306 (493)
T ss_pred HHHHhh
Confidence 988885
No 52
>KOG1146|consensus
Probab=94.78 E-value=0.0091 Score=60.63 Aligned_cols=57 Identities=21% Similarity=0.447 Sum_probs=43.8
Q ss_pred CCCCccccccchhccCHHHHHHHHHh--ccC-----------------------------------------CCCeecCC
Q psy13793 260 TERPHNCPHCEKTFKRNQDLKFHLNQ--HTG-----------------------------------------ERPYKCPF 296 (337)
Q Consensus 260 ~~k~~~C~~C~~~f~~~~~L~~H~~~--H~~-----------------------------------------~~~~~C~~ 296 (337)
+.+||.|..|..+++....|.+|+.. |-. +-.+.|.+
T Consensus 515 ~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~v 594 (1406)
T KOG1146|consen 515 PGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEV 594 (1406)
T ss_pred CCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhh
Confidence 34789999999999999999999873 411 11489999
Q ss_pred CccccCCchHHHHHHHhhCC
Q psy13793 297 CPRTFASSGNCFSHRKRMHE 316 (337)
Q Consensus 297 C~k~f~~~~~L~~H~~~~H~ 316 (337)
|+..-+-..+|+.|+...|.
T Consensus 595 c~yetniarnlrihmtss~~ 614 (1406)
T KOG1146|consen 595 CSYETNIARNLRIHMTASPS 614 (1406)
T ss_pred hcchhhhhhccccccccCCC
Confidence 99998888888888765544
No 53
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.68 E-value=0.0089 Score=31.80 Aligned_cols=21 Identities=33% Similarity=0.756 Sum_probs=11.5
Q ss_pred ccccccchhccCHHHHHHHHH
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLN 284 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~ 284 (337)
|.|..|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 445555555555555555544
No 54
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=93.78 E-value=0.054 Score=30.63 Aligned_cols=23 Identities=22% Similarity=0.640 Sum_probs=20.8
Q ss_pred ceecCCCCCcCCChhHHHhHHhh
Q psy13793 76 PYQCAICPRGYYSDIGLQNHLWS 98 (337)
Q Consensus 76 ~~~C~~C~~~F~~~~~L~~H~~~ 98 (337)
+|.|++|++.|.+...+..|+..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 68999999999999999999853
No 55
>KOG2785|consensus
Probab=93.71 E-value=0.15 Score=45.28 Aligned_cols=133 Identities=21% Similarity=0.407 Sum_probs=91.0
Q ss_pred cccccccccccCCHHHHHHHHHH--cC-----------------------------------CCCccccccccccccCHH
Q psy13793 180 RYQCPLCNKKINTKSNLRRHLQA--HK-----------------------------------PKGKHTCEICFRIFKTEI 222 (337)
Q Consensus 180 ~~~C~~C~~~f~~~~~l~~H~~~--h~-----------------------------------~~~~~~C~~C~~~f~~~~ 222 (337)
.|.|..|...|.+...-+.|.++ |. ++.++.|..|.+.|.+..
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~ 82 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPK 82 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChh
Confidence 38899999999999888888764 31 123488999999999998
Q ss_pred HHHHHHHhcCC------------------cccc-------------CCccCCCCCCHHHHHHHHHHh------------C
Q psy13793 223 NLTRHLECHDT------------------VELS-------------CPDCNRVYPTNSTLRAHMISH------------S 259 (337)
Q Consensus 223 ~l~~H~~~h~~------------------~~~~-------------C~~C~~~f~~~~~l~~H~~~h------------~ 259 (337)
+...|+..... .... +..+-..+........+...- .
T Consensus 83 a~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~~~dd~~Edi~~d~~~e~ 162 (390)
T KOG2785|consen 83 AHENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEEEEDDEEEDIEEDGDDED 162 (390)
T ss_pred hHHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhhccCcchhhhhhccchhc
Confidence 88888754210 0111 222322333222222221110 0
Q ss_pred CCCCccccccchhccCHHHHHHHHHhccCC-----------------------CCeecCCCc---cccCCchHHHHHHH
Q psy13793 260 TERPHNCPHCEKTFKRNQDLKFHLNQHTGE-----------------------RPYKCPFCP---RTFASSGNCFSHRK 312 (337)
Q Consensus 260 ~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~-----------------------~~~~C~~C~---k~f~~~~~L~~H~~ 312 (337)
..-|-.|-.|++.+.+...-..||-.++|- .-|.|-.|+ +.|.+-...+.||.
T Consensus 163 e~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~ 241 (390)
T KOG2785|consen 163 ELIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMR 241 (390)
T ss_pred ccCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHh
Confidence 123567999999999999999999987762 348899999 99999999999986
No 56
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=93.52 E-value=0.11 Score=45.13 Aligned_cols=34 Identities=21% Similarity=0.293 Sum_probs=25.5
Q ss_pred ceecCC--CCCcCCChhHHHhHHhhccCccccCccc
Q psy13793 76 PYQCAI--CPRGYYSDIGLQNHLWSHRKILKNSNVG 109 (337)
Q Consensus 76 ~~~C~~--C~~~F~~~~~L~~H~~~H~~~~~c~~c~ 109 (337)
.|.|+. |.........|..|.+..++.+-|..|.
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~~~C~~C~ 186 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGFVLCSECI 186 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCcEEhHhhh
Confidence 488864 7777777889999988777766666664
No 57
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=93.49 E-value=0.061 Score=27.96 Aligned_cols=20 Identities=30% Similarity=0.579 Sum_probs=17.3
Q ss_pred eecCCCCCcCCChhHHHhHHh
Q psy13793 77 YQCAICPRGYYSDIGLQNHLW 97 (337)
Q Consensus 77 ~~C~~C~~~F~~~~~L~~H~~ 97 (337)
..|+.||+.| ....|..|+.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHH
Confidence 5799999999 7888988974
No 58
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.67 E-value=0.1 Score=27.10 Aligned_cols=8 Identities=50% Similarity=1.555 Sum_probs=3.3
Q ss_pred ccccchhc
Q psy13793 266 CPHCEKTF 273 (337)
Q Consensus 266 C~~C~~~f 273 (337)
|+.||+.|
T Consensus 5 C~~CgR~F 12 (25)
T PF13913_consen 5 CPICGRKF 12 (25)
T ss_pred CCCCCCEE
Confidence 44444444
No 59
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=92.58 E-value=0.015 Score=54.09 Aligned_cols=55 Identities=27% Similarity=0.535 Sum_probs=50.7
Q ss_pred CcccccccccccCCHHHHHHHHH--HcCCC--Cccccc--cccccccCHHHHHHHHHhcCC
Q psy13793 179 TRYQCPLCNKKINTKSNLRRHLQ--AHKPK--GKHTCE--ICFRIFKTEINLTRHLECHDT 233 (337)
Q Consensus 179 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~ 233 (337)
.++.|..|...|.....|..|.+ .|.++ +++.|. .|++.|.....+..|...|..
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 348 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTS 348 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccC
Confidence 46899999999999999999999 89999 999999 799999999999999988865
No 60
>KOG2893|consensus
Probab=92.06 E-value=0.038 Score=45.45 Aligned_cols=41 Identities=27% Similarity=0.550 Sum_probs=26.2
Q ss_pred CCccCCCCCCHHHHHHHHHHhCCCCCccccccchhccCHHHHHHH
Q psy13793 238 CPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEKTFKRNQDLKFH 282 (337)
Q Consensus 238 C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H 282 (337)
|..|.+.|.+..-|.+|++. |-|+|.+|.+.+.+--.|..|
T Consensus 13 cwycnrefddekiliqhqka----khfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceee
Confidence 66677777777777766553 337777777666555555555
No 61
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=91.20 E-value=0.15 Score=28.70 Aligned_cols=22 Identities=27% Similarity=0.560 Sum_probs=13.8
Q ss_pred CccccccchhccCHHHHHHHHH
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLN 284 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~ 284 (337)
+|.|.+|+..|.....+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 3556666666666666666654
No 62
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=90.50 E-value=0.084 Score=49.01 Aligned_cols=57 Identities=21% Similarity=0.348 Sum_probs=45.3
Q ss_pred CcceeehhhhcccceeceeeEEeeecccCCCceecCC--CCCcCCChhHHHhHHhhccC
Q psy13793 45 GSIVVKSLEHLKDLKVTKKKTVYDRRKNGPKPYQCAI--CPRGYYSDIGLQNHLWSHRK 101 (337)
Q Consensus 45 ~~~~~~~~~~~~~~~~~~~~~~h~~~h~~~~~~~C~~--C~~~F~~~~~L~~H~~~H~~ 101 (337)
......|..|...|........|.+.+++++++.|.. |...|.....+..|...+..
T Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (467)
T COG5048 30 APRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHN 88 (467)
T ss_pred CCchhhcccccccccccchhhhhcccccccCCccccccccccccCCcchhhhhcccccc
Confidence 3345577778888888888888999999999999976 55778888888888877754
No 63
>KOG2482|consensus
Probab=90.27 E-value=0.49 Score=41.33 Aligned_cols=134 Identities=18% Similarity=0.300 Sum_probs=80.7
Q ss_pred Cccccccccccc-CCHHHHHHHHHH-cCC--CC-------------------ccccccccccccCHHHHHHHHHhcCCc-
Q psy13793 179 TRYQCPLCNKKI-NTKSNLRRHLQA-HKP--KG-------------------KHTCEICFRIFKTEINLTRHLECHDTV- 234 (337)
Q Consensus 179 ~~~~C~~C~~~f-~~~~~l~~H~~~-h~~--~~-------------------~~~C~~C~~~f~~~~~l~~H~~~h~~~- 234 (337)
....|-.|..-+ ..++.+..|+.. |.- .. .+.|-.|.+.|..+..|+.||+...-+
T Consensus 143 fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrkK~Hrr 222 (423)
T KOG2482|consen 143 FSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRKKRHRR 222 (423)
T ss_pred eeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHhccCcc
Confidence 346798998654 456777778743 421 11 278999999999999999999764211
Q ss_pred --c--------cc--CCccCCCCCC-HHHHHHHHHHhC-------------CCCC--ccccccchhccCHHHHHHHHHh-
Q psy13793 235 --E--------LS--CPDCNRVYPT-NSTLRAHMISHS-------------TERP--HNCPHCEKTFKRNQDLKFHLNQ- 285 (337)
Q Consensus 235 --~--------~~--C~~C~~~f~~-~~~l~~H~~~h~-------------~~k~--~~C~~C~~~f~~~~~L~~H~~~- 285 (337)
| |. -..-|++... .+.+.+-..... +..+ .+|-.|.........|..||.+
T Consensus 223 inPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~en~~~l~eHmk~v 302 (423)
T KOG2482|consen 223 INPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYENPVFLFEHMKIV 302 (423)
T ss_pred cCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccchhhHHHHHHHHHHH
Confidence 0 00 0111222211 111100000000 1122 4788899888889999999884
Q ss_pred ccC--------------------------CCCeecCCCccccCCchHHHHHHH
Q psy13793 286 HTG--------------------------ERPYKCPFCPRTFASSGNCFSHRK 312 (337)
Q Consensus 286 H~~--------------------------~~~~~C~~C~k~f~~~~~L~~H~~ 312 (337)
|.- ...-.|-.|.-.|-....|..|+.
T Consensus 303 He~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~ 355 (423)
T KOG2482|consen 303 HEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMV 355 (423)
T ss_pred HHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcc
Confidence 510 123568888899999999999973
No 64
>KOG4173|consensus
Probab=90.11 E-value=0.12 Score=41.57 Aligned_cols=79 Identities=28% Similarity=0.579 Sum_probs=57.3
Q ss_pred cccccc--ccccccCHHHHHHHHHhcCCccccCCccCCCCCCHHHHHHHHH-HhC---------CCCCccc--cccchhc
Q psy13793 208 KHTCEI--CFRIFKTEINLTRHLECHDTVELSCPDCNRVYPTNSTLRAHMI-SHS---------TERPHNC--PHCEKTF 273 (337)
Q Consensus 208 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~-~h~---------~~k~~~C--~~C~~~f 273 (337)
.|.|.+ |...|........|...-++ -.|..|.+.|.+...|.-|+. +|. |.-.|.| +.|+..|
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~--~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KF 156 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHG--NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKF 156 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhccc--chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhh
Confidence 477876 66778777666677655443 368999999999988888875 342 4456888 4588889
Q ss_pred cCHHHHHHHHH-hccC
Q psy13793 274 KRNQDLKFHLN-QHTG 288 (337)
Q Consensus 274 ~~~~~L~~H~~-~H~~ 288 (337)
.+...-..|+- +|.-
T Consensus 157 kT~r~RkdH~I~~Hk~ 172 (253)
T KOG4173|consen 157 KTSRDRKDHMIRMHKY 172 (253)
T ss_pred hhhhhhhhHHHHhccC
Confidence 88888888874 6643
No 65
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=89.95 E-value=0.12 Score=28.87 Aligned_cols=9 Identities=44% Similarity=1.423 Sum_probs=4.5
Q ss_pred CCeecCCCc
Q psy13793 290 RPYKCPFCP 298 (337)
Q Consensus 290 ~~~~C~~C~ 298 (337)
.+..|+.||
T Consensus 16 ~~~~CP~Cg 24 (33)
T cd00350 16 APWVCPVCG 24 (33)
T ss_pred CCCcCcCCC
Confidence 344555554
No 66
>KOG2482|consensus
Probab=89.73 E-value=0.46 Score=41.49 Aligned_cols=106 Identities=20% Similarity=0.368 Sum_probs=68.1
Q ss_pred cccccccccccCCHHHHHHHHHH--cCCCCc--------ccc--ccccccccCH-HHHHHHHH-hc-------------C
Q psy13793 180 RYQCPLCNKKINTKSNLRRHLQA--HKPKGK--------HTC--EICFRIFKTE-INLTRHLE-CH-------------D 232 (337)
Q Consensus 180 ~~~C~~C~~~f~~~~~l~~H~~~--h~~~~~--------~~C--~~C~~~f~~~-~~l~~H~~-~h-------------~ 232 (337)
.+.|-+|.+.|.++..|+.||+. |..-.| |.- ..-|++..-. +.+.+-.. .+ .
T Consensus 195 r~~CLyCekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~ 274 (423)
T KOG2482|consen 195 RLRCLYCEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDD 274 (423)
T ss_pred hheeeeeccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCC
Confidence 48899999999999999999975 432222 111 1233333221 11110000 00 1
Q ss_pred Ccc--ccCCccCCCCCCHHHHHHHHHHhCC---------------------------CCCccccccchhccCHHHHHHHH
Q psy13793 233 TVE--LSCPDCNRVYPTNSTLRAHMISHST---------------------------ERPHNCPHCEKTFKRNQDLKFHL 283 (337)
Q Consensus 233 ~~~--~~C~~C~~~f~~~~~l~~H~~~h~~---------------------------~k~~~C~~C~~~f~~~~~L~~H~ 283 (337)
..+ ..|-.|....-+...|..||..-+. .+.-.|-.|.-.|-....|..|+
T Consensus 275 a~a~~v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm 354 (423)
T KOG2482|consen 275 AEALSVVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHM 354 (423)
T ss_pred CCccceEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhc
Confidence 112 5788999888889999999976431 02346889999999999999999
Q ss_pred Hh
Q psy13793 284 NQ 285 (337)
Q Consensus 284 ~~ 285 (337)
..
T Consensus 355 ~e 356 (423)
T KOG2482|consen 355 VE 356 (423)
T ss_pred cc
Confidence 74
No 67
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=89.65 E-value=0.41 Score=35.03 Aligned_cols=33 Identities=30% Similarity=0.603 Sum_probs=23.4
Q ss_pred ccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHH
Q psy13793 181 YQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRH 227 (337)
Q Consensus 181 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H 227 (337)
|.|+.|+...- +-|-.|..||-+..+...|.+.
T Consensus 2 Y~CPrC~skvC--------------~LP~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 2 YFCPQCRAKVC--------------ELPVECPICGLTLILSTHLARS 34 (112)
T ss_pred ccCCCCCCCcc--------------CCCCcCCcCCCEEeccchHHHh
Confidence 67888765542 3467888888888877777764
No 68
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=88.99 E-value=0.21 Score=31.03 Aligned_cols=28 Identities=32% Similarity=0.644 Sum_probs=14.4
Q ss_pred CCCCccccccchhccCHHHHHHHHHhcc
Q psy13793 260 TERPHNCPHCEKTFKRNQDLKFHLNQHT 287 (337)
Q Consensus 260 ~~k~~~C~~C~~~f~~~~~L~~H~~~H~ 287 (337)
|+.-+.|+.|+..|....+..+|...-+
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 3444555555555555555555554433
No 69
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=88.88 E-value=0.32 Score=29.23 Aligned_cols=28 Identities=29% Similarity=0.565 Sum_probs=17.2
Q ss_pred CCCCeecCCCccccCCc----hHHHHHHHhhC
Q psy13793 288 GERPYKCPFCPRTFASS----GNCFSHRKRMH 315 (337)
Q Consensus 288 ~~~~~~C~~C~k~f~~~----~~L~~H~~~~H 315 (337)
+....+|.+|++.+... +.|.+|+++.|
T Consensus 13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp CSS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred CcCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 45567788888877663 67888877766
No 70
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=88.82 E-value=0.39 Score=29.72 Aligned_cols=27 Identities=30% Similarity=0.605 Sum_probs=19.8
Q ss_pred CeecCCCccccCCc-----hHHHHHHHhhCCC
Q psy13793 291 PYKCPFCPRTFASS-----GNCFSHRKRMHEG 317 (337)
Q Consensus 291 ~~~C~~C~k~f~~~-----~~L~~H~~~~H~~ 317 (337)
.-.|.+|++.++.. ++|.+|++..|+.
T Consensus 18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~~ 49 (50)
T smart00614 18 RAKCKYCGKKLSRSSKGGTSNLRRHLRRKHPA 49 (50)
T ss_pred EEEecCCCCEeeeCCCCCcHHHHHHHHhHCcC
Confidence 35688888877665 5888888767764
No 71
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=87.71 E-value=0.35 Score=35.44 Aligned_cols=30 Identities=30% Similarity=0.816 Sum_probs=24.2
Q ss_pred ccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCc
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASS 304 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~ 304 (337)
..|+.||+.|.- .+..|..|++||..|.-.
T Consensus 10 R~Cp~CG~kFYD-----------Lnk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYD-----------LNKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhcc-----------CCCCCccCCCCCCccCcc
Confidence 579999988876 345788999999988766
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=87.22 E-value=0.32 Score=30.23 Aligned_cols=30 Identities=27% Similarity=0.605 Sum_probs=21.0
Q ss_pred ccCCCCeecCCCccccCCchHHHHHHHhhC
Q psy13793 286 HTGERPYKCPFCPRTFASSGNCFSHRKRMH 315 (337)
Q Consensus 286 H~~~~~~~C~~C~k~f~~~~~L~~H~~~~H 315 (337)
-.|+..+.|+-||..|....+..+|+-..|
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 356667777777777777777777766655
No 73
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=85.64 E-value=1.6 Score=32.08 Aligned_cols=24 Identities=29% Similarity=0.635 Sum_probs=15.3
Q ss_pred eec----CCCccccCCchHHHHHHHhhC
Q psy13793 292 YKC----PFCPRTFASSGNCFSHRKRMH 315 (337)
Q Consensus 292 ~~C----~~C~k~f~~~~~L~~H~~~~H 315 (337)
|.| ..|+..+.+...+.+|.+..|
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 666 666666666666666665544
No 74
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=85.33 E-value=1.1 Score=32.82 Aligned_cols=92 Identities=17% Similarity=0.387 Sum_probs=59.9
Q ss_pred cccccccccccCHHHHHHHHHhcCCccccCCccCCCCCCHHHHHHHHHHhCCCCCc------------cccccchhccCH
Q psy13793 209 HTCEICFRIFKTEINLTRHLECHDTVELSCPDCNRVYPTNSTLRAHMISHSTERPH------------NCPHCEKTFKRN 276 (337)
Q Consensus 209 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~------------~C~~C~~~f~~~ 276 (337)
|.|..|+...- ..|..|++||-.......|.+-.----..++| .|-.|...|...
T Consensus 2 Y~CPrC~skvC-------------~LP~~CpiCgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~ 68 (112)
T TIGR00622 2 YFCPQCRAKVC-------------ELPVECPICGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKP 68 (112)
T ss_pred ccCCCCCCCcc-------------CCCCcCCcCCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCc
Confidence 66777764332 35678999999988887777632111112222 389999988764
Q ss_pred HHHHHHHHhccCCCCeecCCCccccCCchHHHHHHHhhCC
Q psy13793 277 QDLKFHLNQHTGERPYKCPFCPRTFASSGNCFSHRKRMHE 316 (337)
Q Consensus 277 ~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~~H~ 316 (337)
.... ...-.....|+|+.|...|-..-++..| ...|.
T Consensus 69 ~~~~--~~~~~~~~~y~C~~C~~~FC~dCD~fiH-e~Lh~ 105 (112)
T TIGR00622 69 PVSP--FDELKDSHRYVCAVCKNVFCVDCDVFVH-ESLHC 105 (112)
T ss_pred cccc--ccccccccceeCCCCCCccccccchhhh-hhccC
Confidence 3111 0001234579999999999998888889 57774
No 75
>KOG2893|consensus
Probab=85.12 E-value=0.3 Score=40.31 Aligned_cols=50 Identities=24% Similarity=0.541 Sum_probs=40.2
Q ss_pred CCccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCchHHHHHHHhhCC
Q psy13793 262 RPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASSGNCFSHRKRMHE 316 (337)
Q Consensus 262 k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~~H~ 316 (337)
||| |=.|++.|.....|+.|++. +-|+|.+|.|..-+--.|..|-..+|.
T Consensus 10 kpw-cwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 10 KPW-CWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred Cce-eeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhhh
Confidence 454 88899999999999999874 469999999877666677777666763
No 76
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=84.60 E-value=1.6 Score=32.04 Aligned_cols=51 Identities=22% Similarity=0.504 Sum_probs=40.4
Q ss_pred cccCCccCCCCCCHHHHHHHHHH-hCCC-----------------------------------------CCccc----cc
Q psy13793 235 ELSCPDCNRVYPTNSTLRAHMIS-HSTE-----------------------------------------RPHNC----PH 268 (337)
Q Consensus 235 ~~~C~~C~~~f~~~~~l~~H~~~-h~~~-----------------------------------------k~~~C----~~ 268 (337)
...|..|+.+..- +.+..|++. |... .-|.| ..
T Consensus 11 vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~~G~~C~~~~~~ 89 (109)
T PF12013_consen 11 VLICRQCQYAVQP-SEVESHLRKRHHILKSQERQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVYDGYRCQCDPPH 89 (109)
T ss_pred EEEeCCCCcccCc-hHHHHHHHHhcccccHHHHHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCCCCCCeeeecCCCC
Confidence 3579999988776 889999884 4311 12889 99
Q ss_pred cchhccCHHHHHHHHHhc
Q psy13793 269 CEKTFKRNQDLKFHLNQH 286 (337)
Q Consensus 269 C~~~f~~~~~L~~H~~~H 286 (337)
|++.+.+...+.+|++.+
T Consensus 90 C~y~~~~~~~m~~H~~~~ 107 (109)
T PF12013_consen 90 CGYITRSKKTMRKHWRKE 107 (109)
T ss_pred CCcEeccHHHHHHHHHHh
Confidence 999999999999999865
No 77
>KOG4173|consensus
Probab=82.53 E-value=0.87 Score=36.84 Aligned_cols=78 Identities=22% Similarity=0.497 Sum_probs=57.2
Q ss_pred Ccccccc--cccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhcC----------C-ccccC--CccCC
Q psy13793 179 TRYQCPL--CNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECHD----------T-VELSC--PDCNR 243 (337)
Q Consensus 179 ~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~----------~-~~~~C--~~C~~ 243 (337)
..+.|+. |...|.....+..|...-++. .|..|.+.|.+...|..|+..-+ + .-|.| ..|+.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~ 154 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE 154 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence 3478876 778888888888887554443 79999999999888888875432 1 23677 56888
Q ss_pred CCCCHHHHHHHHH-HhC
Q psy13793 244 VYPTNSTLRAHMI-SHS 259 (337)
Q Consensus 244 ~f~~~~~l~~H~~-~h~ 259 (337)
.|.....-..|+. .|.
T Consensus 155 KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 155 KFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhhHHHHhcc
Confidence 8888888888874 454
No 78
>KOG2186|consensus
Probab=81.55 E-value=0.91 Score=38.03 Aligned_cols=50 Identities=16% Similarity=0.602 Sum_probs=38.7
Q ss_pred ccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhcCC
Q psy13793 181 YQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECHDT 233 (337)
Q Consensus 181 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 233 (337)
|.|..||..... ..+.+|+...++ ..|.|-.||.+|.. .....|...-++
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITE 53 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITE 53 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcch
Confidence 789999988664 456669988777 67899999999987 667778765544
No 79
>PF12907 zf-met2: Zinc-binding
Probab=81.00 E-value=0.37 Score=28.17 Aligned_cols=29 Identities=24% Similarity=0.463 Sum_probs=18.3
Q ss_pred eecCCCccccCC---chHHHHHHHhhCCCCCC
Q psy13793 292 YKCPFCPRTFAS---SGNCFSHRKRMHEGETL 320 (337)
Q Consensus 292 ~~C~~C~k~f~~---~~~L~~H~~~~H~~~~~ 320 (337)
++|.+|-..|.. ...|..|....|+...+
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~ 33 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTF 33 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCH
Confidence 567777755543 35577777777776544
No 80
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=80.92 E-value=0.65 Score=26.13 Aligned_cols=9 Identities=56% Similarity=1.387 Sum_probs=4.8
Q ss_pred CeecCCCcc
Q psy13793 291 PYKCPFCPR 299 (337)
Q Consensus 291 ~~~C~~C~k 299 (337)
|-.|++||-
T Consensus 18 p~~CP~Cg~ 26 (34)
T cd00729 18 PEKCPICGA 26 (34)
T ss_pred CCcCcCCCC
Confidence 445555553
No 81
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=78.21 E-value=2.1 Score=33.90 Aligned_cols=32 Identities=22% Similarity=0.493 Sum_probs=17.9
Q ss_pred CCCCccccccchhccCHHHHHHHHHhccCCCCeecCCCccc
Q psy13793 260 TERPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRT 300 (337)
Q Consensus 260 ~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~ 300 (337)
...-|.|+.|+..|+....+. ..|.|+.||..
T Consensus 106 ~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 106 NNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM 137 (158)
T ss_pred CCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence 334466666666666555553 24666666644
No 82
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=78.02 E-value=1.3 Score=25.50 Aligned_cols=34 Identities=18% Similarity=0.526 Sum_probs=16.7
Q ss_pred ccccccchhccCHHHHHHHHHhccCCCCeecCCCccccC
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFA 302 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~ 302 (337)
+.|+.|+..|.-..... ..+.....|+.|+..|.
T Consensus 3 ~~CP~C~~~~~v~~~~~-----~~~~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQL-----GANGGKVRCGKCGHVWY 36 (38)
T ss_pred EECCCCCCEEEeCHHHc-----CCCCCEEECCCCCCEEE
Confidence 35666666655443321 11122466666666553
No 83
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=77.88 E-value=1.2 Score=26.74 Aligned_cols=26 Identities=15% Similarity=0.396 Sum_probs=16.2
Q ss_pred ccccccchhccCHHHHHHHHHhccCCCCeecCCCccc
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRT 300 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~ 300 (337)
|.|..||..|.-. ...+.+|+.||..
T Consensus 3 Y~C~~Cg~~~~~~-----------~~~~irC~~CG~r 28 (44)
T smart00659 3 YICGECGRENEIK-----------SKDVVRCRECGYR 28 (44)
T ss_pred EECCCCCCEeecC-----------CCCceECCCCCce
Confidence 6677777666532 2356777777743
No 84
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=77.58 E-value=1.5 Score=32.76 Aligned_cols=30 Identities=23% Similarity=0.540 Sum_probs=22.7
Q ss_pred ccccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCc
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASS 304 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~ 304 (337)
..|+.||+.|.. .+..|..|++||..|.-.
T Consensus 10 r~Cp~cg~kFYD-----------Lnk~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 10 RICPNTGSKFYD-----------LNRRPAVSPYTGEQFPPE 39 (129)
T ss_pred ccCCCcCccccc-----------cCCCCccCCCcCCccCcc
Confidence 578888888765 335788899998887655
No 85
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=77.34 E-value=1.6 Score=43.23 Aligned_cols=28 Identities=25% Similarity=0.568 Sum_probs=18.8
Q ss_pred hCCCCCccccccchhccCHHHHHHHHHhccCCCCeecCCCccc
Q psy13793 258 HSTERPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRT 300 (337)
Q Consensus 258 h~~~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~ 300 (337)
|...+...|..||+ ....|..|+.||-.
T Consensus 457 H~~~~~L~CH~Cg~---------------~~~~p~~Cp~Cgs~ 484 (730)
T COG1198 457 HKATGQLRCHYCGY---------------QEPIPQSCPECGSE 484 (730)
T ss_pred ecCCCeeEeCCCCC---------------CCCCCCCCCCCCCC
Confidence 44445677888873 44568888888854
No 86
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=77.20 E-value=1.2 Score=27.30 Aligned_cols=28 Identities=21% Similarity=0.750 Sum_probs=17.9
Q ss_pred CccccccchhccCHHHHHHHHHhccCCCCeecCCCccc
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRT 300 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~ 300 (337)
-|.|..||+.|... .....++|++||..
T Consensus 6 ~Y~C~~Cg~~~~~~----------~~~~~irCp~Cg~r 33 (49)
T COG1996 6 EYKCARCGREVELD----------QETRGIRCPYCGSR 33 (49)
T ss_pred EEEhhhcCCeeehh----------hccCceeCCCCCcE
Confidence 47788888777211 23456788888853
No 87
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=77.16 E-value=0.46 Score=39.69 Aligned_cols=43 Identities=26% Similarity=0.579 Sum_probs=25.6
Q ss_pred CCccccccchhccCHHHHHHHHHh---c-------cCCCC-----eecCCCccccCCc
Q psy13793 262 RPHNCPHCEKTFKRNQDLKFHLNQ---H-------TGERP-----YKCPFCPRTFASS 304 (337)
Q Consensus 262 k~~~C~~C~~~f~~~~~L~~H~~~---H-------~~~~~-----~~C~~C~k~f~~~ 304 (337)
|.+.|++|+..|.+..-+....++ . .+..| ..|+.||.+|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 456677777777665444333332 1 12233 6899999887765
No 88
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=77.07 E-value=1.6 Score=25.08 Aligned_cols=32 Identities=34% Similarity=0.905 Sum_probs=14.4
Q ss_pred cccccchhccCHHHHHHHHHhccCCCCeecCCCcccc
Q psy13793 265 NCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTF 301 (337)
Q Consensus 265 ~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f 301 (337)
.|+.|+..|.-..+- +-.+.+..+|+.|+..|
T Consensus 4 ~CP~C~~~f~v~~~~-----l~~~~~~vrC~~C~~~f 35 (37)
T PF13719_consen 4 TCPNCQTRFRVPDDK-----LPAGGRKVRCPKCGHVF 35 (37)
T ss_pred ECCCCCceEEcCHHH-----cccCCcEEECCCCCcEe
Confidence 355555555443321 11233445555555554
No 89
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=76.16 E-value=1.8 Score=24.68 Aligned_cols=32 Identities=22% Similarity=0.709 Sum_probs=17.1
Q ss_pred cccccchhccCHHHHHHHHHhccCCCCeecCCCcccc
Q psy13793 265 NCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTF 301 (337)
Q Consensus 265 ~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f 301 (337)
.|+.|+..|.-..... -...+..+|+.|+..|
T Consensus 4 ~Cp~C~~~y~i~d~~i-----p~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 4 TCPNCQAKYEIDDEKI-----PPKGRKVRCSKCGHVF 35 (36)
T ss_pred ECCCCCCEEeCCHHHC-----CCCCcEEECCCCCCEe
Confidence 4666666665544421 1233456666666655
No 90
>KOG2186|consensus
Probab=75.41 E-value=2 Score=36.09 Aligned_cols=46 Identities=22% Similarity=0.543 Sum_probs=34.8
Q ss_pred cccccccccccCHHHHHHHHHhcCCccccCCccCCCCCCHHHHHHHHH
Q psy13793 209 HTCEICFRIFKTEINLTRHLECHDTVELSCPDCNRVYPTNSTLRAHMI 256 (337)
Q Consensus 209 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~ 256 (337)
|.|..||.+..- ..+.+|+...++..|.|-.|++.|.. .+...|..
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRNAYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccCCeeEEeeccccccc-chhhhhhh
Confidence 678888887754 45677888877777888888888887 56666654
No 91
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=75.26 E-value=2 Score=27.29 Aligned_cols=9 Identities=44% Similarity=1.457 Sum_probs=4.4
Q ss_pred ccCCccCCC
Q psy13793 236 LSCPDCNRV 244 (337)
Q Consensus 236 ~~C~~C~~~ 244 (337)
|.|+.||+.
T Consensus 28 F~CPnCGe~ 36 (61)
T COG2888 28 FPCPNCGEV 36 (61)
T ss_pred eeCCCCCce
Confidence 455555543
No 92
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=73.44 E-value=8 Score=34.23 Aligned_cols=77 Identities=14% Similarity=0.256 Sum_probs=44.3
Q ss_pred HHhCCCCCccccccc-hhccCHHHHHHHHHhccCCCCeec---CCCccccCCc------hHHHHHHHhhCCCCCCCCCCc
Q psy13793 256 ISHSTERPHNCPHCE-KTFKRNQDLKFHLNQHTGERPYKC---PFCPRTFASS------GNCFSHRKRMHEGETLPDISL 325 (337)
Q Consensus 256 ~~h~~~k~~~C~~C~-~~f~~~~~L~~H~~~H~~~~~~~C---~~C~k~f~~~------~~L~~H~~~~H~~~~~~~~~~ 325 (337)
+.|.-.+-|.|.+|| +++.-...+-+|..--....-..| ..| ..|... -.|+.-++..-+.-++|..+.
T Consensus 367 klhgLd~ef~CEICgNyvy~GR~~FdrHF~E~rHiygl~clGi~ps-~vfkgIT~I~ea~~lw~~m~~~ss~~kv~~e~~ 445 (470)
T COG5188 367 KLHGLDIEFECEICGNYVYYGRDRFDRHFEEDRHIYGLECLGIKPS-RVFKGITRIGEAMKLWNRMEESSSSLKVPTEYS 445 (470)
T ss_pred HhcCCCcceeeeecccccccchHHHHhhhhhhhhhhheeeccccch-HHHhhhhhHHHHHHHHHHhhhhhhhcccchhhh
Confidence 456677889999999 888888888888652111112223 112 233332 234555433334556677666
Q ss_pred ccCCccch
Q psy13793 326 DEMSDNET 333 (337)
Q Consensus 326 ~~~~~~~~ 333 (337)
.+.+++|.
T Consensus 446 ~E~EDeEG 453 (470)
T COG5188 446 EEFEDEEG 453 (470)
T ss_pred hhhhcccc
Confidence 66666554
No 93
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=72.70 E-value=3 Score=33.67 Aligned_cols=31 Identities=23% Similarity=0.702 Sum_probs=16.6
Q ss_pred CCccccccchhccCHHHHHHHHHhccCCCCeecCCCcccc
Q psy13793 262 RPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTF 301 (337)
Q Consensus 262 k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f 301 (337)
.-|.|+.|+..|+....+. ..|.|+.||-..
T Consensus 116 ~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L 146 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEAME---------YGFRCPQCGEML 146 (178)
T ss_pred CEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCC
Confidence 3455666666555555442 246666666443
No 94
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=72.48 E-value=3.5 Score=32.18 Aligned_cols=36 Identities=19% Similarity=0.607 Sum_probs=16.2
Q ss_pred CCccccccchhccCHHHHHHHHHhccCCCCeecCCCcccc
Q psy13793 262 RPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTF 301 (337)
Q Consensus 262 k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f 301 (337)
.-|.|+.|+..|.....+.. . . ....|.|+.||...
T Consensus 98 ~~Y~Cp~C~~~y~~~ea~~~-~--d-~~~~f~Cp~Cg~~l 133 (147)
T smart00531 98 AYYKCPNCQSKYTFLEANQL-L--D-MDGTFTCPRCGEEL 133 (147)
T ss_pred cEEECcCCCCEeeHHHHHHh-c--C-CCCcEECCCCCCEE
Confidence 34556666655554332221 0 0 12226666666543
No 95
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=71.48 E-value=2.1 Score=33.98 Aligned_cols=23 Identities=43% Similarity=1.008 Sum_probs=16.7
Q ss_pred CccccccchhccCHHHHHHHHHhccCCCCeecCCCc
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCP 298 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~ 298 (337)
-|.|.+||+ +|.|+.|-+||+||
T Consensus 134 ~~vC~vCGy-------------~~~ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGY-------------THEGEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCC-------------cccCCCCCcCCCCC
Confidence 588888874 34557788888887
No 96
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=70.51 E-value=2.6 Score=33.46 Aligned_cols=24 Identities=38% Similarity=0.886 Sum_probs=17.6
Q ss_pred cccCCccCCCCCCHHHHHHHHHHhCCCCCccccccch
Q psy13793 235 ELSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEK 271 (337)
Q Consensus 235 ~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~ 271 (337)
.|.|++||.. +.|+-|-+|++||.
T Consensus 134 ~~vC~vCGy~-------------~~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYT-------------HEGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCc-------------ccCCCCCcCCCCCC
Confidence 5788888765 34567888888883
No 97
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=68.61 E-value=4 Score=21.31 Aligned_cols=20 Identities=30% Similarity=0.539 Sum_probs=14.2
Q ss_pred eecCCCccccCCchHHHHHHH
Q psy13793 292 YKCPFCPRTFASSGNCFSHRK 312 (337)
Q Consensus 292 ~~C~~C~k~f~~~~~L~~H~~ 312 (337)
..|++|++.+ ....++.|+.
T Consensus 2 v~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLD 21 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHH
Confidence 3688888887 4456777763
No 98
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=67.52 E-value=3.8 Score=22.65 Aligned_cols=26 Identities=23% Similarity=0.604 Sum_probs=14.0
Q ss_pred ccccccchhccCHHHHHHHHHhccCCCCeecCCCccc
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRT 300 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~ 300 (337)
|.|..|+..+.- ....+.+|+.||..
T Consensus 1 Y~C~~Cg~~~~~-----------~~~~~irC~~CG~R 26 (32)
T PF03604_consen 1 YICGECGAEVEL-----------KPGDPIRCPECGHR 26 (32)
T ss_dssp EBESSSSSSE-B-----------STSSTSSBSSSS-S
T ss_pred CCCCcCCCeeEc-----------CCCCcEECCcCCCe
Confidence 566677765541 12345677777754
No 99
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=67.31 E-value=3.5 Score=24.91 Aligned_cols=30 Identities=17% Similarity=0.610 Sum_probs=18.8
Q ss_pred CccccccchhccCHHHHHHHHHhccCCCCeecCCCccccC
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFA 302 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~ 302 (337)
.|.|+.||..|..... ...+.|+.||..+.
T Consensus 3 ~y~C~~CG~~~~~~~~----------~~~~~Cp~CG~~~~ 32 (46)
T PRK00398 3 EYKCARCGREVELDEY----------GTGVRCPYCGYRIL 32 (46)
T ss_pred EEECCCCCCEEEECCC----------CCceECCCCCCeEE
Confidence 4778888876654211 11678888886554
No 100
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=67.04 E-value=3.5 Score=30.25 Aligned_cols=29 Identities=10% Similarity=0.204 Sum_probs=18.6
Q ss_pred ehhhhcccceeceeeEEeeecccCCCceecCCCCCcCCCh
Q psy13793 50 KSLEHLKDLKVTKKKTVYDRRKNGPKPYQCAICPRGYYSD 89 (337)
Q Consensus 50 ~~~~~~~~~~~~~~~~~h~~~h~~~~~~~C~~C~~~F~~~ 89 (337)
.|..|++.|.-- +..|-.|+.||..|.-.
T Consensus 11 ~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 11 TCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred cCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 566777777432 33577788887777655
No 101
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=67.03 E-value=1.1 Score=30.02 Aligned_cols=41 Identities=24% Similarity=0.472 Sum_probs=25.4
Q ss_pred ccccccchhccCHHHHHHHHHhccCCCCeecC--CCccccCCchH
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHTGERPYKCP--FCPRTFASSGN 306 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~--~C~k~f~~~~~ 306 (337)
+.|+.||....-..+-..... ..++-++|. .||.+|+....
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYES 44 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEE
Confidence 578888865533222222211 446778998 89999987654
No 102
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=66.99 E-value=2.7 Score=22.86 Aligned_cols=25 Identities=24% Similarity=0.711 Sum_probs=10.1
Q ss_pred cccccchhccCHHHHHHHHHhccCCCCeecCCCccc
Q psy13793 265 NCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRT 300 (337)
Q Consensus 265 ~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~ 300 (337)
+|+.|+..+.. .+...|.|+.|+..
T Consensus 4 ~Cp~C~se~~y-----------~D~~~~vCp~C~~e 28 (30)
T PF08274_consen 4 KCPLCGSEYTY-----------EDGELLVCPECGHE 28 (30)
T ss_dssp --TTT-----E-----------E-SSSEEETTTTEE
T ss_pred CCCCCCCccee-----------ccCCEEeCCccccc
Confidence 46666655544 34456667766643
No 103
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=66.89 E-value=4.1 Score=25.97 Aligned_cols=11 Identities=45% Similarity=1.256 Sum_probs=7.2
Q ss_pred ccccCCccCCC
Q psy13793 234 VELSCPDCNRV 244 (337)
Q Consensus 234 ~~~~C~~C~~~ 244 (337)
..|.|+.||..
T Consensus 24 ~~F~CPnCG~~ 34 (59)
T PRK14890 24 VKFLCPNCGEV 34 (59)
T ss_pred CEeeCCCCCCe
Confidence 44677777765
No 104
>KOG2807|consensus
Probab=66.85 E-value=6.2 Score=34.57 Aligned_cols=34 Identities=29% Similarity=0.595 Sum_probs=22.3
Q ss_pred CcccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHH
Q psy13793 179 TRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTR 226 (337)
Q Consensus 179 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~ 226 (337)
..|.|+.|..... .-|-.|..|+-+......|.+
T Consensus 275 ~Gy~CP~CkakvC--------------sLP~eCpiC~ltLVss~hLAR 308 (378)
T KOG2807|consen 275 GGYFCPQCKAKVC--------------SLPIECPICSLTLVSSPHLAR 308 (378)
T ss_pred CceeCCcccCeee--------------cCCccCCccceeEecchHHHH
Confidence 4477777765432 236677788777777766665
No 105
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=66.25 E-value=1.3 Score=25.82 Aligned_cols=10 Identities=30% Similarity=0.996 Sum_probs=4.5
Q ss_pred cccccccccc
Q psy13793 181 YQCPLCNKKI 190 (337)
Q Consensus 181 ~~C~~C~~~f 190 (337)
|+|..|+..|
T Consensus 6 y~C~~Cg~~f 15 (41)
T smart00834 6 YRCEDCGHTF 15 (41)
T ss_pred EEcCCCCCEE
Confidence 4444444444
No 106
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=65.44 E-value=4.5 Score=33.65 Aligned_cols=29 Identities=24% Similarity=0.542 Sum_probs=20.3
Q ss_pred CCCCeecCCCccccCCchHHHHHHHhhCC
Q psy13793 288 GERPYKCPFCPRTFASSGNCFSHRKRMHE 316 (337)
Q Consensus 288 ~~~~~~C~~C~k~f~~~~~L~~H~~~~H~ 316 (337)
++..|.|+.|+|.|.-..-..+|+...|+
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCH
Confidence 34458888888888888888888888886
No 107
>PRK04023 DNA polymerase II large subunit; Validated
Probab=65.32 E-value=5.8 Score=40.36 Aligned_cols=11 Identities=27% Similarity=0.495 Sum_probs=7.0
Q ss_pred CeecCCCcccc
Q psy13793 291 PYKCPFCPRTF 301 (337)
Q Consensus 291 ~~~C~~C~k~f 301 (337)
++.|+.||..-
T Consensus 663 ~y~CPKCG~El 673 (1121)
T PRK04023 663 EDECEKCGREP 673 (1121)
T ss_pred CCcCCCCCCCC
Confidence 46677777543
No 108
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=64.42 E-value=2.6 Score=35.58 Aligned_cols=25 Identities=28% Similarity=0.700 Sum_probs=12.0
Q ss_pred CCccccccchhccCHHHHHHHHHhc
Q psy13793 262 RPHNCPHCEKTFKRNQDLKFHLNQH 286 (337)
Q Consensus 262 k~~~C~~C~~~f~~~~~L~~H~~~H 286 (337)
++++|+.|+........|..-.|+|
T Consensus 208 k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 208 KPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred CCCCCCCCCCcccccccceeeeecc
Confidence 4455555555444444444444443
No 109
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=64.31 E-value=1.6 Score=27.15 Aligned_cols=11 Identities=27% Similarity=1.114 Sum_probs=5.4
Q ss_pred ccccccccccC
Q psy13793 181 YQCPLCNKKIN 191 (337)
Q Consensus 181 ~~C~~C~~~f~ 191 (337)
|+|..|+..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 44555555443
No 110
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=64.05 E-value=4.2 Score=30.43 Aligned_cols=30 Identities=17% Similarity=0.161 Sum_probs=19.2
Q ss_pred ccccccccccCCHHHHHHHHHHcCCCCccccccccccccCH
Q psy13793 181 YQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTE 221 (337)
Q Consensus 181 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 221 (337)
..|+.|+++|... +..|..|..||..|...
T Consensus 10 r~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 10 RICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPE 39 (129)
T ss_pred ccCCCcCcccccc-----------CCCCccCCCcCCccCcc
Confidence 4577777777432 33566777777776554
No 111
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=62.97 E-value=1.2 Score=35.04 Aligned_cols=13 Identities=23% Similarity=0.641 Sum_probs=7.3
Q ss_pred eecCCCccccCCc
Q psy13793 292 YKCPFCPRTFASS 304 (337)
Q Consensus 292 ~~C~~C~k~f~~~ 304 (337)
++|+.||++|...
T Consensus 29 ~~c~~c~~~f~~~ 41 (154)
T PRK00464 29 RECLACGKRFTTF 41 (154)
T ss_pred eeccccCCcceEe
Confidence 5555666555543
No 112
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=61.88 E-value=9.4 Score=30.19 Aligned_cols=16 Identities=13% Similarity=0.021 Sum_probs=7.2
Q ss_pred ccccccccccccCHHH
Q psy13793 208 KHTCEICFRIFKTEIN 223 (337)
Q Consensus 208 ~~~C~~C~~~f~~~~~ 223 (337)
-|.|+.|+..|+...+
T Consensus 109 ~Y~Cp~c~~r~tf~eA 124 (158)
T TIGR00373 109 FFICPNMCVRFTFNEA 124 (158)
T ss_pred eEECCCCCcEeeHHHH
Confidence 3444444444444333
No 113
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=61.34 E-value=1.2 Score=37.19 Aligned_cols=21 Identities=29% Similarity=0.665 Sum_probs=14.6
Q ss_pred CcccccccccccCCHHHHHHH
Q psy13793 179 TRYQCPLCNKKINTKSNLRRH 199 (337)
Q Consensus 179 ~~~~C~~C~~~f~~~~~l~~H 199 (337)
+.+.||+|+..|.........
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~ 24 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGK 24 (214)
T ss_pred CceECCCCCCeeeeeEEEcCC
Confidence 457799999888876554433
No 114
>KOG1280|consensus
Probab=61.18 E-value=12 Score=33.24 Aligned_cols=36 Identities=28% Similarity=0.640 Sum_probs=21.4
Q ss_pred cccCCccCCCCCCHHHHHHHHHHhCCCCC--ccccccc
Q psy13793 235 ELSCPDCNRVYPTNSTLRAHMISHSTERP--HNCPHCE 270 (337)
Q Consensus 235 ~~~C~~C~~~f~~~~~l~~H~~~h~~~k~--~~C~~C~ 270 (337)
.|.|+.|+.+-.+...|..|....+.+-+ ..|++|.
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 35677777766666777777655444433 2355554
No 115
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=59.39 E-value=4.7 Score=30.70 Aligned_cols=24 Identities=46% Similarity=0.725 Sum_probs=13.8
Q ss_pred ccccccchhccCHHHHHHHHHhccCCC
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHTGER 290 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~~~~ 290 (337)
..|-+||+.|.. |.+|++.|+|-.
T Consensus 73 i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred eEEccCCcccch---HHHHHHHccCCC
Confidence 567788887776 477888877654
No 116
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=58.47 E-value=29 Score=27.19 Aligned_cols=60 Identities=13% Similarity=0.287 Sum_probs=36.5
Q ss_pred CccccccchhccCHHHHHHHHHhccCCCCeecCC--CccccCCchHHHHHHHhhCCCCCCCCCC
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPF--CPRTFASSGNCFSHRKRMHEGETLPDIS 324 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~--C~k~f~~~~~L~~H~~~~H~~~~~~~~~ 324 (337)
...|+.|.-.-.-.. ...-.|.+-+.|+-.|.. |... .+...|.+|.+..|+..+|..+-
T Consensus 80 ~L~CPLCRG~V~GWt-vve~AR~~LN~K~RsC~~e~C~F~-GtY~eLrKHar~~HP~~rP~~vD 141 (162)
T PF07800_consen 80 ELACPLCRGEVKGWT-VVEPARRFLNAKKRSCSQESCSFS-GTYSELRKHARSEHPSARPSEVD 141 (162)
T ss_pred cccCccccCceeceE-EchHHHHHhccCCccCcccccccc-cCHHHHHHHHHhhCCCCCCccCC
Confidence 367888864322211 112345555566655544 6654 34567999999999988887653
No 117
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=57.07 E-value=6.5 Score=34.13 Aligned_cols=92 Identities=22% Similarity=0.424 Sum_probs=47.2
Q ss_pred CcccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhc------CC-------ccccCCccCCCC
Q psy13793 179 TRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECH------DT-------VELSCPDCNRVY 245 (337)
Q Consensus 179 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h------~~-------~~~~C~~C~~~f 245 (337)
..|.|+.|..... .-|..|+.|.-.......|.+-..-- .+ +.-.|-.|.-.|
T Consensus 307 gGy~CP~CktkVC--------------sLPi~CP~Csl~LilsthLarSyhhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~f 372 (421)
T COG5151 307 GGYECPVCKTKVC--------------SLPISCPICSLQLILSTHLARSYHHLYPLKPFVEKPEGTNPKSTHCFVCQGPF 372 (421)
T ss_pred CceeCCcccceee--------------cCCccCcchhHHHHHHHHHHHHHHhhccCcccccccCCCCCCCccceeccCCC
Confidence 4478888854332 23667888876665555554422110 01 112366666655
Q ss_pred CCHHHHHHHHHHhCCCCCccccccchhccCHHHHHHHHHhc
Q psy13793 246 PTNSTLRAHMISHSTERPHNCPHCEKTFKRNQDLKFHLNQH 286 (337)
Q Consensus 246 ~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~H 286 (337)
.-+..-.. ..-+....|.|+.|...|-..-+.-.|...|
T Consensus 373 p~~~~~~~--~~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 373 PKPPVSPF--DESTSSGRYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred CCCCCCcc--cccccccceechhhhhhhhhhhHHHHHHHHh
Confidence 54321000 0011123477777777777766666666555
No 118
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=55.77 E-value=13 Score=28.99 Aligned_cols=12 Identities=17% Similarity=0.440 Sum_probs=5.6
Q ss_pred cccccccccccc
Q psy13793 208 KHTCEICFRIFK 219 (337)
Q Consensus 208 ~~~C~~C~~~f~ 219 (337)
-|.|+.|+..|.
T Consensus 99 ~Y~Cp~C~~~y~ 110 (147)
T smart00531 99 YYKCPNCQSKYT 110 (147)
T ss_pred EEECcCCCCEee
Confidence 344444444444
No 119
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=55.76 E-value=7.1 Score=29.47 Aligned_cols=18 Identities=17% Similarity=0.560 Sum_probs=10.3
Q ss_pred CCCCcccccccccccCCH
Q psy13793 176 SSGTRYQCPLCNKKINTK 193 (337)
Q Consensus 176 ~~~~~~~C~~C~~~f~~~ 193 (337)
...+-|+|++|..+....
T Consensus 76 ~d~~lYeCnIC~etS~ee 93 (140)
T PF05290_consen 76 LDPKLYECNICKETSAEE 93 (140)
T ss_pred cCCCceeccCcccccchh
Confidence 344557777776655443
No 120
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=55.63 E-value=13 Score=30.07 Aligned_cols=15 Identities=20% Similarity=0.314 Sum_probs=7.2
Q ss_pred ccccccccccccCHH
Q psy13793 208 KHTCEICFRIFKTEI 222 (337)
Q Consensus 208 ~~~C~~C~~~f~~~~ 222 (337)
-|.|+.|+..|+...
T Consensus 117 ~Y~Cp~C~~rytf~e 131 (178)
T PRK06266 117 FFFCPNCHIRFTFDE 131 (178)
T ss_pred EEECCCCCcEEeHHH
Confidence 355555555444443
No 121
>KOG2071|consensus
Probab=53.47 E-value=10 Score=36.24 Aligned_cols=26 Identities=19% Similarity=0.368 Sum_probs=21.8
Q ss_pred ccccCCccCCCCCCHHHHHHHHHHhC
Q psy13793 234 VELSCPDCNRVYPTNSTLRAHMISHS 259 (337)
Q Consensus 234 ~~~~C~~C~~~f~~~~~l~~H~~~h~ 259 (337)
.+..|..||..|.+......||..|.
T Consensus 417 ~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 417 SPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CcchhcccccccccchhhhhHhhhhh
Confidence 45689999999999988888887775
No 122
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=53.40 E-value=7.7 Score=28.04 Aligned_cols=28 Identities=18% Similarity=0.405 Sum_probs=19.7
Q ss_pred ccccccchhccCHHHHHHHHHhccCCCCeecCCCccccC
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFA 302 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~ 302 (337)
-.|+.|++.|.. .+..|..|++||+.|.
T Consensus 10 ridPetg~KFYD-----------LNrdPiVsPytG~s~P 37 (129)
T COG4530 10 RIDPETGKKFYD-----------LNRDPIVSPYTGKSYP 37 (129)
T ss_pred ccCccccchhhc-----------cCCCccccCcccccch
Confidence 357777777754 2356888888888884
No 123
>PF15269 zf-C2H2_7: Zinc-finger
Probab=52.65 E-value=10 Score=22.51 Aligned_cols=22 Identities=32% Similarity=0.540 Sum_probs=19.7
Q ss_pred eecCCCCCcCCChhHHHhHHhh
Q psy13793 77 YQCAICPRGYYSDIGLQNHLWS 98 (337)
Q Consensus 77 ~~C~~C~~~F~~~~~L~~H~~~ 98 (337)
|+|-+|.......+.|-.||+.
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 7899999999999999999865
No 124
>KOG2785|consensus
Probab=52.18 E-value=19 Score=32.36 Aligned_cols=26 Identities=27% Similarity=0.450 Sum_probs=22.9
Q ss_pred CCCceecCCCCCcCCChhHHHhHHhh
Q psy13793 73 GPKPYQCAICPRGYYSDIGLQNHLWS 98 (337)
Q Consensus 73 ~~~~~~C~~C~~~F~~~~~L~~H~~~ 98 (337)
++-++.|.+|.+.|.+..+...|+..
T Consensus 65 ~~~~~~c~~c~k~~~s~~a~~~hl~S 90 (390)
T KOG2785|consen 65 AESVVYCEACNKSFASPKAHENHLKS 90 (390)
T ss_pred cccceehHHhhccccChhhHHHHHHH
Confidence 45689999999999999999999864
No 125
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=52.01 E-value=16 Score=21.56 Aligned_cols=11 Identities=27% Similarity=0.754 Sum_probs=6.0
Q ss_pred CCceecCCCCC
Q psy13793 74 PKPYQCAICPR 84 (337)
Q Consensus 74 ~~~~~C~~C~~ 84 (337)
+.+-.|+.||.
T Consensus 24 ~~~~~CP~Cg~ 34 (42)
T PF09723_consen 24 DDPVPCPECGS 34 (42)
T ss_pred CCCCcCCCCCC
Confidence 34555666654
No 126
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=51.26 E-value=7.7 Score=38.64 Aligned_cols=14 Identities=29% Similarity=0.883 Sum_probs=9.5
Q ss_pred CCCCCccccccchh
Q psy13793 259 STERPHNCPHCEKT 272 (337)
Q Consensus 259 ~~~k~~~C~~C~~~ 272 (337)
....|..|+.||..
T Consensus 471 ~~~~p~~Cp~Cgs~ 484 (730)
T COG1198 471 QEPIPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCCCCC
Confidence 34567788888754
No 127
>PHA00626 hypothetical protein
Probab=50.87 E-value=8 Score=24.29 Aligned_cols=16 Identities=19% Similarity=0.314 Sum_probs=12.9
Q ss_pred CCCeecCCCccccCCc
Q psy13793 289 ERPYKCPFCPRTFASS 304 (337)
Q Consensus 289 ~~~~~C~~C~k~f~~~ 304 (337)
...|.|+.||..|+..
T Consensus 21 snrYkCkdCGY~ft~~ 36 (59)
T PHA00626 21 SDDYVCCDCGYNDSKD 36 (59)
T ss_pred CcceEcCCCCCeechh
Confidence 3579999999999754
No 128
>KOG1842|consensus
Probab=50.80 E-value=7.5 Score=35.62 Aligned_cols=21 Identities=29% Similarity=0.501 Sum_probs=11.4
Q ss_pred ccccccchhccCHHHHHHHHH
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLN 284 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~ 284 (337)
|.|++|...|.....|..|..
T Consensus 16 flCPiC~~dl~~~~~L~~H~d 36 (505)
T KOG1842|consen 16 FLCPICLLDLPNLSALNDHLD 36 (505)
T ss_pred ccCchHhhhhhhHHHHHHHHh
Confidence 455555555555555555554
No 129
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=50.63 E-value=9 Score=20.03 Aligned_cols=11 Identities=18% Similarity=0.471 Sum_probs=8.9
Q ss_pred eecCCCCCcCC
Q psy13793 77 YQCAICPRGYY 87 (337)
Q Consensus 77 ~~C~~C~~~F~ 87 (337)
-.|+.||..|.
T Consensus 15 ~~Cp~CG~~F~ 25 (26)
T PF10571_consen 15 KFCPHCGYDFE 25 (26)
T ss_pred CcCCCCCCCCc
Confidence 45999999885
No 130
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=49.89 E-value=4.3 Score=25.88 Aligned_cols=41 Identities=32% Similarity=0.852 Sum_probs=19.8
Q ss_pred CCccccc--cchhccCHHHHHHHHHhccCCCCeecCC----CccccCC
Q psy13793 262 RPHNCPH--CEKTFKRNQDLKFHLNQHTGERPYKCPF----CPRTFAS 303 (337)
Q Consensus 262 k~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~~C~~----C~k~f~~ 303 (337)
.+..|+. |...+. ...|..|...-=..++..|++ |+..+..
T Consensus 8 ~~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~~ 54 (60)
T PF02176_consen 8 RPVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPYSPYGCKERVPR 54 (60)
T ss_dssp SEEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS----S--EEEH
T ss_pred CEeeCCCCCccccee-HHHHHHHHHccCCCCcEECCCCCCCCCCccch
Confidence 3455655 333333 345666766444456677777 7766643
No 131
>KOG3408|consensus
Probab=49.24 E-value=4.2 Score=30.04 Aligned_cols=26 Identities=23% Similarity=0.466 Sum_probs=21.0
Q ss_pred CCCCccccccchhccCHHHHHHHHHh
Q psy13793 260 TERPHNCPHCEKTFKRNQDLKFHLNQ 285 (337)
Q Consensus 260 ~~k~~~C~~C~~~f~~~~~L~~H~~~ 285 (337)
|...|.|-.|.+-|.....|..|.++
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhc
Confidence 44558888899999988888888774
No 132
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=47.52 E-value=3.9 Score=23.29 Aligned_cols=29 Identities=24% Similarity=0.584 Sum_probs=14.1
Q ss_pred cccccccccCCHHHHHHHHHHcCCCCccccccccccc
Q psy13793 182 QCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIF 218 (337)
Q Consensus 182 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 218 (337)
.|+.|+..|.. ....+...-.|+.||..+
T Consensus 3 ~C~~Cg~~Yh~--------~~~pP~~~~~Cd~cg~~L 31 (36)
T PF05191_consen 3 ICPKCGRIYHI--------EFNPPKVEGVCDNCGGEL 31 (36)
T ss_dssp EETTTTEEEET--------TTB--SSTTBCTTTTEBE
T ss_pred CcCCCCCcccc--------ccCCCCCCCccCCCCCee
Confidence 46666666632 222233445566666544
No 133
>KOG3408|consensus
Probab=46.99 E-value=11 Score=27.85 Aligned_cols=28 Identities=21% Similarity=0.472 Sum_probs=24.4
Q ss_pred ccCCCceecCCCCCcCCChhHHHhHHhh
Q psy13793 71 KNGPKPYQCAICPRGYYSDIGLQNHLWS 98 (337)
Q Consensus 71 h~~~~~~~C~~C~~~F~~~~~L~~H~~~ 98 (337)
-.|...|.|-.|.+-|.+...|..|.++
T Consensus 52 lPG~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 52 LPGGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCCCceeehhhhhhhhcchHHHHHHHhc
Confidence 3566679999999999999999999864
No 134
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=46.25 E-value=17 Score=21.32 Aligned_cols=25 Identities=20% Similarity=0.219 Sum_probs=18.1
Q ss_pred ceecCCCCCcCCC--hhHHHhHHhhcc
Q psy13793 76 PYQCAICPRGYYS--DIGLQNHLWSHR 100 (337)
Q Consensus 76 ~~~C~~C~~~F~~--~~~L~~H~~~H~ 100 (337)
.-+|+.||..|.. ...-..|.+.|.
T Consensus 13 ~~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 13 ATTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred CcCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 3589999998874 456667877663
No 135
>PRK04023 DNA polymerase II large subunit; Validated
Probab=45.95 E-value=17 Score=37.26 Aligned_cols=48 Identities=25% Similarity=0.623 Sum_probs=29.0
Q ss_pred cccccccccccCCHHHHHHHHHHcCCCCccccccccccccCHHHHHHHHHhcCCccccCCccCCCCCCHHHHHHHHHHhC
Q psy13793 180 RYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFKTEINLTRHLECHDTVELSCPDCNRVYPTNSTLRAHMISHS 259 (337)
Q Consensus 180 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 259 (337)
...|+.||... ..+.|..||.. +...+.|+.|+...
T Consensus 626 ~RfCpsCG~~t----------------~~frCP~CG~~--------------Te~i~fCP~CG~~~-------------- 661 (1121)
T PRK04023 626 RRKCPSCGKET----------------FYRRCPFCGTH--------------TEPVYRCPRCGIEV-------------- 661 (1121)
T ss_pred CccCCCCCCcC----------------CcccCCCCCCC--------------CCcceeCccccCcC--------------
Confidence 45688888763 23578778765 23345677774332
Q ss_pred CCCCccccccchhc
Q psy13793 260 TERPHNCPHCEKTF 273 (337)
Q Consensus 260 ~~k~~~C~~C~~~f 273 (337)
.++.|+.|+...
T Consensus 662 --~~y~CPKCG~El 673 (1121)
T PRK04023 662 --EEDECEKCGREP 673 (1121)
T ss_pred --CCCcCCCCCCCC
Confidence 236788887643
No 136
>KOG2593|consensus
Probab=45.71 E-value=15 Score=33.64 Aligned_cols=37 Identities=30% Similarity=0.561 Sum_probs=21.8
Q ss_pred CCcccccccccccCCHHHHHHHHHHcCCCCcccccccccc
Q psy13793 178 GTRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRI 217 (337)
Q Consensus 178 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 217 (337)
...|.|+.|.+.|.....++ ..-.....|.|..|+-.
T Consensus 126 ~~~Y~Cp~C~kkyt~Lea~~---L~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 126 VAGYVCPNCQKKYTSLEALQ---LLDNETGEFHCENCGGE 162 (436)
T ss_pred cccccCCccccchhhhHHHH---hhcccCceEEEecCCCc
Confidence 45577777777776555543 22222345777777653
No 137
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=45.50 E-value=15 Score=34.46 Aligned_cols=33 Identities=30% Similarity=0.746 Sum_probs=23.9
Q ss_pred CCeecCCCccccCCchHHHHHHHhhCCCCCCCC
Q psy13793 290 RPYKCPFCPRTFASSGNCFSHRKRMHEGETLPD 322 (337)
Q Consensus 290 ~~~~C~~C~k~f~~~~~L~~H~~~~H~~~~~~~ 322 (337)
+-..|+.|.+.|.....+..|+...|.+.-.|.
T Consensus 56 rFWiCp~CskkF~d~~~~~~H~~~eH~~~l~P~ 88 (466)
T PF04780_consen 56 RFWICPRCSKKFSDAESCLSHMEQEHPAGLKPK 88 (466)
T ss_pred eEeeCCcccceeCCHHHHHHHHHHhhhhhcChh
Confidence 346777888888888888888877887665443
No 138
>KOG4167|consensus
Probab=45.21 E-value=3.9 Score=39.79 Aligned_cols=27 Identities=19% Similarity=0.503 Sum_probs=24.5
Q ss_pred CceecCCCCCcCCChhHHHhHHhhccC
Q psy13793 75 KPYQCAICPRGYYSDIGLQNHLWSHRK 101 (337)
Q Consensus 75 ~~~~C~~C~~~F~~~~~L~~H~~~H~~ 101 (337)
.-|.|..|++.|.....+..||++|+.
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 359999999999999999999999964
No 139
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=44.18 E-value=16 Score=34.97 Aligned_cols=9 Identities=44% Similarity=1.176 Sum_probs=5.2
Q ss_pred CeecCCCcc
Q psy13793 291 PYKCPFCPR 299 (337)
Q Consensus 291 ~~~C~~C~k 299 (337)
|..|+.|+-
T Consensus 253 ~~~Cp~C~s 261 (505)
T TIGR00595 253 PKTCPQCGS 261 (505)
T ss_pred CCCCCCCCC
Confidence 555666654
No 140
>PRK14873 primosome assembly protein PriA; Provisional
Probab=44.14 E-value=15 Score=36.52 Aligned_cols=11 Identities=27% Similarity=0.933 Sum_probs=7.5
Q ss_pred CCeecCCCccc
Q psy13793 290 RPYKCPFCPRT 300 (337)
Q Consensus 290 ~~~~C~~C~k~ 300 (337)
.|..|+.||-.
T Consensus 421 ~p~~Cp~Cgs~ 431 (665)
T PRK14873 421 PDWRCPRCGSD 431 (665)
T ss_pred cCccCCCCcCC
Confidence 36678888754
No 141
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=43.27 E-value=13 Score=19.05 Aligned_cols=10 Identities=30% Similarity=0.883 Sum_probs=8.1
Q ss_pred CceecCCCCC
Q psy13793 75 KPYQCAICPR 84 (337)
Q Consensus 75 ~~~~C~~C~~ 84 (337)
.+|.|+.||.
T Consensus 15 v~f~CPnCG~ 24 (24)
T PF07754_consen 15 VPFPCPNCGF 24 (24)
T ss_pred ceEeCCCCCC
Confidence 4799999984
No 142
>KOG4167|consensus
Probab=42.81 E-value=7.7 Score=37.87 Aligned_cols=25 Identities=24% Similarity=0.515 Sum_probs=23.2
Q ss_pred CccccccchhccCHHHHHHHHHhcc
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLNQHT 287 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~~H~ 287 (337)
-|.|.+|++.|....++..||++|.
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHH
Confidence 3999999999999999999999984
No 143
>KOG4118|consensus
Probab=42.77 E-value=15 Score=23.80 Aligned_cols=31 Identities=26% Similarity=0.427 Sum_probs=25.3
Q ss_pred CeecCCCccccCCchHHHHHHHhhCCCCCCC
Q psy13793 291 PYKCPFCPRTFASSGNCFSHRKRMHEGETLP 321 (337)
Q Consensus 291 ~~~C~~C~k~f~~~~~L~~H~~~~H~~~~~~ 321 (337)
.|.|.+|--.-.....+..|....|+.+.+|
T Consensus 38 ~~kCtVC~~~mpdpktfkqhfe~kHpk~~~P 68 (74)
T KOG4118|consen 38 HHKCTVCMVQMPDPKTFKQHFENKHPKEPLP 68 (74)
T ss_pred HhhhHhhHhhCCCCchHHHHHhhcCCCCCCC
Confidence 3788888877777788888988899887776
No 144
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=42.43 E-value=14 Score=21.37 Aligned_cols=14 Identities=14% Similarity=0.648 Sum_probs=11.6
Q ss_pred ceecCCCCCcCCCh
Q psy13793 76 PYQCAICPRGYYSD 89 (337)
Q Consensus 76 ~~~C~~C~~~F~~~ 89 (337)
||.|..|++.|=..
T Consensus 12 ~f~C~~C~~~FC~~ 25 (39)
T smart00154 12 GFKCRHCGNLFCGE 25 (39)
T ss_pred CeECCccCCccccc
Confidence 89999999988543
No 145
>PF14353 CpXC: CpXC protein
Probab=41.95 E-value=22 Score=26.82 Aligned_cols=20 Identities=20% Similarity=0.559 Sum_probs=12.6
Q ss_pred ceecCCCCCcCCChhHHHhH
Q psy13793 76 PYQCAICPRGYYSDIGLQNH 95 (337)
Q Consensus 76 ~~~C~~C~~~F~~~~~L~~H 95 (337)
.|.|+.||..|.-...+.-|
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~ 57 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYH 57 (128)
T ss_pred EEECCCCCCceecCCCEEEE
Confidence 37777777777655555444
No 146
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=41.18 E-value=11 Score=26.10 Aligned_cols=33 Identities=18% Similarity=0.213 Sum_probs=24.5
Q ss_pred CceecCCCCCcCCChhHHHhHHhhccCccccCcccceeeec
Q psy13793 75 KPYQCAICPRGYYSDIGLQNHLWSHRKILKNSNVGIYYTFT 115 (337)
Q Consensus 75 ~~~~C~~C~~~F~~~~~L~~H~~~H~~~~~c~~c~~~~~~~ 115 (337)
.+|.|+.|++. .+.+.-.+...|..|+..|.-.
T Consensus 34 ~~~~Cp~C~~~--------~VkR~a~GIW~C~kCg~~fAGg 66 (89)
T COG1997 34 AKHVCPFCGRT--------TVKRIATGIWKCRKCGAKFAGG 66 (89)
T ss_pred cCCcCCCCCCc--------ceeeeccCeEEcCCCCCeeccc
Confidence 46889999884 3445667788999999887543
No 147
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=37.07 E-value=15 Score=30.66 Aligned_cols=30 Identities=27% Similarity=0.738 Sum_probs=21.7
Q ss_pred CCCcccccccccccCCHHHHHHHHHHcCCC
Q psy13793 177 SGTRYQCPLCNKKINTKSNLRRHLQAHKPK 206 (337)
Q Consensus 177 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 206 (337)
.+..|.|+.|++.|.-..-...|+...|++
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 345699999999999999999999765544
No 148
>KOG1280|consensus
Probab=35.72 E-value=29 Score=30.82 Aligned_cols=62 Identities=23% Similarity=0.406 Sum_probs=37.6
Q ss_pred CCCccccccchhccCHHHHHHHHH-hccCCC-CeecCCCcc-ccCCc------hHHHHHHHhhCCCCCCCC
Q psy13793 261 ERPHNCPHCEKTFKRNQDLKFHLN-QHTGER-PYKCPFCPR-TFASS------GNCFSHRKRMHEGETLPD 322 (337)
Q Consensus 261 ~k~~~C~~C~~~f~~~~~L~~H~~-~H~~~~-~~~C~~C~k-~f~~~------~~L~~H~~~~H~~~~~~~ 322 (337)
..-|.|+.|+..=.+...|..|.- .|-... ...|++|+- .+.+. .++..|+..+|+++...+
T Consensus 77 ~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~~~~~~qp~~~~~~~~~~~~~~~~a~~r~~~e 147 (381)
T KOG1280|consen 77 PQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCAANPEMQPIHSKETENLSVHWTEIALGRPLQD 147 (381)
T ss_pred cccccCCcccccccchhHHHHHhhhcCcccCcceeeeccccCcccCchhhhhhhhhhhhhhhhcccccchh
Confidence 346999999998888899999986 453322 234555541 22221 334455556677665544
No 149
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=35.61 E-value=35 Score=36.02 Aligned_cols=28 Identities=25% Similarity=0.713 Sum_probs=15.8
Q ss_pred CccccccchhccCHHHHHHHHHhccCCCCeecCCCcccc
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTF 301 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f 301 (337)
+|.|+.||....... ++ ...|+.|+...
T Consensus 692 vy~CPsCGaev~~de---------s~--a~~CP~CGtpl 719 (1337)
T PRK14714 692 VYVCPDCGAEVPPDE---------SG--RVECPRCDVEL 719 (1337)
T ss_pred ceeCccCCCccCCCc---------cc--cccCCCCCCcc
Confidence 577888876543310 11 44688888543
No 150
>KOG4377|consensus
Probab=35.21 E-value=11 Score=34.21 Aligned_cols=26 Identities=35% Similarity=0.722 Sum_probs=18.1
Q ss_pred Ccccc--cccccccCCHHHHHHHHHHcC
Q psy13793 179 TRYQC--PLCNKKINTKSNLRRHLQAHK 204 (337)
Q Consensus 179 ~~~~C--~~C~~~f~~~~~l~~H~~~h~ 204 (337)
.-|.| +.|...+.++..+.+|..+|.
T Consensus 270 Ehyhcl~e~C~ykr~~k~DvirH~~~hk 297 (480)
T KOG4377|consen 270 EHYHCLNEYCFYKRGQKNDVIRHVEIHK 297 (480)
T ss_pred hhhcccCccccccccchhhhHHHHHHHh
Confidence 34556 458877777878888877764
No 151
>KOG2593|consensus
Probab=34.82 E-value=44 Score=30.77 Aligned_cols=16 Identities=25% Similarity=0.831 Sum_probs=8.4
Q ss_pred cccCCccCCCCCCHHH
Q psy13793 235 ELSCPDCNRVYPTNST 250 (337)
Q Consensus 235 ~~~C~~C~~~f~~~~~ 250 (337)
.|.|+.|.+.|.....
T Consensus 128 ~Y~Cp~C~kkyt~Lea 143 (436)
T KOG2593|consen 128 GYVCPNCQKKYTSLEA 143 (436)
T ss_pred cccCCccccchhhhHH
Confidence 3556666555554433
No 152
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=34.21 E-value=25 Score=21.96 Aligned_cols=14 Identities=43% Similarity=0.795 Sum_probs=7.3
Q ss_pred CCeecCCCccccCC
Q psy13793 290 RPYKCPFCPRTFAS 303 (337)
Q Consensus 290 ~~~~C~~C~k~f~~ 303 (337)
+.+.|..||+.|-.
T Consensus 17 rk~~Cr~Cg~~~C~ 30 (57)
T cd00065 17 RRHHCRNCGRIFCS 30 (57)
T ss_pred cccccCcCcCCcCh
Confidence 34455566655543
No 153
>KOG4124|consensus
Probab=33.83 E-value=6 Score=34.97 Aligned_cols=49 Identities=31% Similarity=0.843 Sum_probs=37.3
Q ss_pred CCccccc--cchhccCHHHHHHHHHh-cc------------------CCCCeecCCCccccCCchHHHHH
Q psy13793 262 RPHNCPH--CEKTFKRNQDLKFHLNQ-HT------------------GERPYKCPFCPRTFASSGNCFSH 310 (337)
Q Consensus 262 k~~~C~~--C~~~f~~~~~L~~H~~~-H~------------------~~~~~~C~~C~k~f~~~~~L~~H 310 (337)
++|.|.+ |++.+.....|..|... |- -.|+|+|++|.+.+.....|.-|
T Consensus 348 ~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~ 417 (442)
T KOG4124|consen 348 KPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYH 417 (442)
T ss_pred CCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCce
Confidence 6788865 88888887788877653 31 14789999999998887777666
No 154
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=33.29 E-value=19 Score=27.96 Aligned_cols=34 Identities=29% Similarity=0.695 Sum_probs=20.5
Q ss_pred CCccccccchhccCHHHHHHHHHhccCCCCeecCCCccc
Q psy13793 262 RPHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRT 300 (337)
Q Consensus 262 k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~ 300 (337)
-+|.|. |+..|.+. .+|-.+-.|+ .|.|..|+-.
T Consensus 116 ~~Y~C~-C~q~~l~~---RRhn~~~~g~-~YrC~~C~gk 149 (156)
T COG3091 116 YPYRCQ-CQQHYLRI---RRHNTVRRGE-VYRCGKCGGK 149 (156)
T ss_pred eeEEee-cCCccchh---hhcccccccc-eEEeccCCce
Confidence 467787 87765542 2343444455 7888887743
No 155
>KOG2807|consensus
Probab=33.27 E-value=62 Score=28.64 Aligned_cols=26 Identities=27% Similarity=0.709 Sum_probs=20.4
Q ss_pred CCCeecCCCccccCCchHHHHHHHhhC
Q psy13793 289 ERPYKCPFCPRTFASSGNCFSHRKRMH 315 (337)
Q Consensus 289 ~~~~~C~~C~k~f~~~~~L~~H~~~~H 315 (337)
.-.|+|..|...|-..-+...| ...|
T Consensus 343 ~~~y~C~~Ck~~FCldCDv~iH-esLh 368 (378)
T KOG2807|consen 343 SGRYRCESCKNVFCLDCDVFIH-ESLH 368 (378)
T ss_pred CCcEEchhccceeeccchHHHH-hhhh
Confidence 3469999999999888887778 4666
No 156
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=33.14 E-value=21 Score=27.04 Aligned_cols=25 Identities=44% Similarity=0.583 Sum_probs=18.1
Q ss_pred ccccccchhccCHHHHHHHHHhccCCCC
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQHTGERP 291 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~ 291 (337)
..|-++|+.|.+ |++|+.+|.|--|
T Consensus 77 IicLEDGkkfKS---LKRHL~t~~gmTP 101 (148)
T COG4957 77 IICLEDGKKFKS---LKRHLTTHYGLTP 101 (148)
T ss_pred EEEeccCcchHH---HHHHHhcccCCCH
Confidence 468888887764 7888888877543
No 157
>PF01428 zf-AN1: AN1-like Zinc finger; InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include: Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=31.88 E-value=17 Score=21.47 Aligned_cols=15 Identities=33% Similarity=0.979 Sum_probs=9.9
Q ss_pred CCeecCCCccccCCc
Q psy13793 290 RPYKCPFCPRTFASS 304 (337)
Q Consensus 290 ~~~~C~~C~k~f~~~ 304 (337)
-||.|+.|++.|-..
T Consensus 12 ~~~~C~~C~~~FC~~ 26 (43)
T PF01428_consen 12 LPFKCKHCGKSFCLK 26 (43)
T ss_dssp SHEE-TTTS-EE-TT
T ss_pred CCeECCCCCcccCcc
Confidence 489999999999644
No 158
>PF10276 zf-CHCC: Zinc-finger domain; InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=31.42 E-value=26 Score=20.50 Aligned_cols=18 Identities=28% Similarity=0.724 Sum_probs=12.0
Q ss_pred ecccCCCceecCCCCCcC
Q psy13793 69 RRKNGPKPYQCAICPRGY 86 (337)
Q Consensus 69 ~~h~~~~~~~C~~C~~~F 86 (337)
.+....++-.|++|+..|
T Consensus 22 ~l~~~~~~~~CpYCg~~y 39 (40)
T PF10276_consen 22 NLDDEPGPVVCPYCGTRY 39 (40)
T ss_dssp E-TTTTCEEEETTTTEEE
T ss_pred ecCCCCCeEECCCCCCEE
Confidence 333333478999999877
No 159
>PRK05580 primosome assembly protein PriA; Validated
Probab=31.38 E-value=33 Score=34.25 Aligned_cols=9 Identities=44% Similarity=1.110 Sum_probs=5.0
Q ss_pred CeecCCCcc
Q psy13793 291 PYKCPFCPR 299 (337)
Q Consensus 291 ~~~C~~C~k 299 (337)
|..|+.||.
T Consensus 421 ~~~Cp~Cg~ 429 (679)
T PRK05580 421 PKACPECGS 429 (679)
T ss_pred CCCCCCCcC
Confidence 445666654
No 160
>KOG0696|consensus
Probab=31.19 E-value=21 Score=32.92 Aligned_cols=60 Identities=28% Similarity=0.617 Sum_probs=44.5
Q ss_pred CCCCccccccccccccCHHHHHHHHHhcCCccccCCccCCCCCCHHHHHHHH-HHhCCCCCccccccch
Q psy13793 204 KPKGKHTCEICFRIFKTEINLTRHLECHDTVELSCPDCNRVYPTNSTLRAHM-ISHSTERPHNCPHCEK 271 (337)
Q Consensus 204 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~-~~h~~~k~~~C~~C~~ 271 (337)
-|..-|.|++|. +..|.+.|.-..|.|+--++.+.....-.+|. +.|+-.-|--|..||.
T Consensus 69 fgKQGfQCqvC~--------fvvHkrChefVtF~CPGadkg~dtDdpr~kHkf~~~tYssPTFCDhCGs 129 (683)
T KOG0696|consen 69 FGKQGFQCQVCC--------FVVHKRCHEFVTFSCPGADKGPDTDDPRSKHKFKIHTYSSPTFCDHCGS 129 (683)
T ss_pred cccCceeeeEEe--------ehhhhhhcceEEEECCCCCCCCCCCCcccccceeeeecCCCchhhhHHH
Confidence 355668888885 44577877777888988888887777767774 5677777778888884
No 161
>KOG2636|consensus
Probab=31.15 E-value=32 Score=31.73 Aligned_cols=37 Identities=22% Similarity=0.516 Sum_probs=28.8
Q ss_pred ecccCCCceecCCCC-CcCCChhHHHhHH--hhccCcccc
Q psy13793 69 RRKNGPKPYQCAICP-RGYYSDIGLQNHL--WSHRKILKN 105 (337)
Q Consensus 69 ~~h~~~~~~~C~~C~-~~F~~~~~L~~H~--~~H~~~~~c 105 (337)
+.|.=+..|.|.+|| +++..+..+.+|. +.|.-.++|
T Consensus 394 KLHGL~~ey~CEICGNy~Y~GrkaF~RHF~EwRH~hGmrC 433 (497)
T KOG2636|consen 394 KLHGLDIEYNCEICGNYVYKGRKAFDRHFNEWRHAHGMRC 433 (497)
T ss_pred hhcCCCcccceeeccCccccCcHHHHHHhHHHHHhhccee
Confidence 446667789999999 9999999999997 455444444
No 162
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=30.57 E-value=30 Score=22.69 Aligned_cols=11 Identities=9% Similarity=0.006 Sum_probs=2.9
Q ss_pred eeehhhhcccc
Q psy13793 48 VVKSLEHLKDL 58 (337)
Q Consensus 48 ~~~~~~~~~~~ 58 (337)
...|..|.+.|
T Consensus 9 ~~~C~~C~~~F 19 (69)
T PF01363_consen 9 ASNCMICGKKF 19 (69)
T ss_dssp -SB-TTT--B-
T ss_pred CCcCcCcCCcC
Confidence 33444555555
No 163
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=29.76 E-value=13 Score=26.16 Aligned_cols=31 Identities=29% Similarity=0.783 Sum_probs=18.1
Q ss_pred CcccccccccccCCHHHHHHHHHHcCCCCcccccccccccc
Q psy13793 179 TRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFK 219 (337)
Q Consensus 179 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 219 (337)
..|.|+.|++.-..+ .....|.|..|++.|.
T Consensus 34 ~ky~Cp~Cgk~~vkR----------~a~GIW~C~~C~~~~A 64 (90)
T PF01780_consen 34 AKYTCPFCGKTSVKR----------VATGIWKCKKCGKKFA 64 (90)
T ss_dssp S-BEESSSSSSEEEE----------EETTEEEETTTTEEEE
T ss_pred CCCcCCCCCCceeEE----------eeeEEeecCCCCCEEe
Confidence 347788887653211 1223578888887774
No 164
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=29.38 E-value=18 Score=25.41 Aligned_cols=31 Identities=26% Similarity=0.697 Sum_probs=17.5
Q ss_pred CcccccccccccCCHHHHHHHHHHcCCCCcccccccccccc
Q psy13793 179 TRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFK 219 (337)
Q Consensus 179 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 219 (337)
..|.|+.|++.-.. -.....+.|..|++.|.
T Consensus 35 a~y~CpfCgk~~vk----------R~a~GIW~C~~C~~~~A 65 (90)
T PTZ00255 35 AKYFCPFCGKHAVK----------RQAVGIWRCKGCKKTVA 65 (90)
T ss_pred CCccCCCCCCCcee----------eeeeEEEEcCCCCCEEe
Confidence 34777777653211 11234577777777764
No 165
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=29.22 E-value=32 Score=31.78 Aligned_cols=13 Identities=15% Similarity=0.529 Sum_probs=6.7
Q ss_pred CeecCCCccccCC
Q psy13793 291 PYKCPFCPRTFAS 303 (337)
Q Consensus 291 ~~~C~~C~k~f~~ 303 (337)
-|+|+.||..+..
T Consensus 367 g~rC~kCg~~~~~ 379 (421)
T COG1571 367 GFRCKKCGTRARE 379 (421)
T ss_pred CcccccccccCCc
Confidence 5555555555443
No 166
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=28.58 E-value=38 Score=20.87 Aligned_cols=11 Identities=9% Similarity=0.045 Sum_probs=5.8
Q ss_pred CccccCcccce
Q psy13793 101 KILKNSNVGIY 111 (337)
Q Consensus 101 ~~~~c~~c~~~ 111 (337)
+.+.|..|+.+
T Consensus 36 ~r~~C~~Cgyt 46 (50)
T PRK00432 36 DRWHCGKCGYT 46 (50)
T ss_pred CcEECCCcCCE
Confidence 44555555544
No 167
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=28.23 E-value=41 Score=19.54 Aligned_cols=11 Identities=27% Similarity=0.851 Sum_probs=8.9
Q ss_pred eecCCCCCcCC
Q psy13793 77 YQCAICPRGYY 87 (337)
Q Consensus 77 ~~C~~C~~~F~ 87 (337)
|.|..|+..|.
T Consensus 29 y~C~~C~~~w~ 39 (40)
T smart00440 29 YVCTKCGHRWR 39 (40)
T ss_pred EEeCCCCCEeC
Confidence 88998988764
No 168
>KOG2071|consensus
Probab=28.09 E-value=37 Score=32.60 Aligned_cols=28 Identities=21% Similarity=0.350 Sum_probs=24.3
Q ss_pred CCCceecCCCCCcCCChhHHHhHHhhcc
Q psy13793 73 GPKPYQCAICPRGYYSDIGLQNHLWSHR 100 (337)
Q Consensus 73 ~~~~~~C~~C~~~F~~~~~L~~H~~~H~ 100 (337)
.+.|-.|..||..|........||.+|.
T Consensus 415 ~~~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 415 KDSPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred cCCcchhcccccccccchhhhhHhhhhh
Confidence 4667999999999999999988888773
No 169
>PF14311 DUF4379: Domain of unknown function (DUF4379)
Probab=28.08 E-value=33 Score=21.43 Aligned_cols=12 Identities=33% Similarity=1.030 Sum_probs=5.7
Q ss_pred ccccccchhccC
Q psy13793 264 HNCPHCEKTFKR 275 (337)
Q Consensus 264 ~~C~~C~~~f~~ 275 (337)
++|+.|+..|..
T Consensus 29 W~C~~Cgh~w~~ 40 (55)
T PF14311_consen 29 WKCPKCGHEWKA 40 (55)
T ss_pred EECCCCCCeeEc
Confidence 445555544443
No 170
>KOG2907|consensus
Probab=27.39 E-value=30 Score=25.34 Aligned_cols=11 Identities=45% Similarity=1.225 Sum_probs=6.9
Q ss_pred eecCCCccccC
Q psy13793 292 YKCPFCPRTFA 302 (337)
Q Consensus 292 ~~C~~C~k~f~ 302 (337)
|.|+.|++.|+
T Consensus 103 YTC~kC~~k~~ 113 (116)
T KOG2907|consen 103 YTCPKCKYKFT 113 (116)
T ss_pred EEcCccceeee
Confidence 66666666654
No 171
>KOG4377|consensus
Probab=27.20 E-value=20 Score=32.55 Aligned_cols=103 Identities=27% Similarity=0.417 Sum_probs=64.3
Q ss_pred ccccc--cccccccCHHHHHHHHHhcCCc-------------cccC--CccCCCCCCHHHHHHHHHHhCCC-------CC
Q psy13793 208 KHTCE--ICFRIFKTEINLTRHLECHDTV-------------ELSC--PDCNRVYPTNSTLRAHMISHSTE-------RP 263 (337)
Q Consensus 208 ~~~C~--~C~~~f~~~~~l~~H~~~h~~~-------------~~~C--~~C~~~f~~~~~l~~H~~~h~~~-------k~ 263 (337)
-|.|. .|+..+..+..+.+|..+|..+ .|.| .+|.+ +-+....|...|+.. --
T Consensus 271 hyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~Gfrrth 347 (480)
T KOG4377|consen 271 HYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRTH 347 (480)
T ss_pred hhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCceecce
Confidence 46674 5988888899999999888421 2455 46777 445566666666421 13
Q ss_pred ccccccchhccCHHHHHHHHHhc----cCCC------------------------CeecCC--CccccCCchHHHHHHHh
Q psy13793 264 HNCPHCEKTFKRNQDLKFHLNQH----TGER------------------------PYKCPF--CPRTFASSGNCFSHRKR 313 (337)
Q Consensus 264 ~~C~~C~~~f~~~~~L~~H~~~H----~~~~------------------------~~~C~~--C~k~f~~~~~L~~H~~~ 313 (337)
|.|..|+-+++.. ...|...| .++. -|-|.. |+..|.+.+.+..| ++
T Consensus 348 fhC~r~gCTdtfK--~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~s~sqm~sh-kr 424 (480)
T KOG4377|consen 348 FHCQRIGCTDTFK--DSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLYSVSQMASH-KR 424 (480)
T ss_pred eEEeccCCccccc--cccccccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEEehhhhhhh-hh
Confidence 6788777333332 33444333 2211 155543 88889999999999 68
Q ss_pred hCC
Q psy13793 314 MHE 316 (337)
Q Consensus 314 ~H~ 316 (337)
.|-
T Consensus 425 khe 427 (480)
T KOG4377|consen 425 KHE 427 (480)
T ss_pred hhh
Confidence 885
No 172
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=26.90 E-value=43 Score=20.06 Aligned_cols=9 Identities=33% Similarity=1.028 Sum_probs=4.6
Q ss_pred CCeecCCCc
Q psy13793 290 RPYKCPFCP 298 (337)
Q Consensus 290 ~~~~C~~C~ 298 (337)
..|+|..|+
T Consensus 36 ~~~~C~~C~ 44 (46)
T PF12760_consen 36 GRYRCKACR 44 (46)
T ss_pred CeEECCCCC
Confidence 445555554
No 173
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=26.72 E-value=19 Score=25.41 Aligned_cols=31 Identities=35% Similarity=0.779 Sum_probs=17.2
Q ss_pred CcccccccccccCCHHHHHHHHHHcCCCCcccccccccccc
Q psy13793 179 TRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFK 219 (337)
Q Consensus 179 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 219 (337)
..|.|+.|++.-.. -.....|.|..|++.|.
T Consensus 34 a~y~CpfCgk~~vk----------R~a~GIW~C~~C~~~~A 64 (91)
T TIGR00280 34 AKYVCPFCGKKTVK----------RGSTGIWTCRKCGAKFA 64 (91)
T ss_pred cCccCCCCCCCceE----------EEeeEEEEcCCCCCEEe
Confidence 34777777653211 11234577777777764
No 175
>KOG2636|consensus
Probab=26.28 E-value=45 Score=30.81 Aligned_cols=77 Identities=23% Similarity=0.340 Sum_probs=46.0
Q ss_pred HHhCCCCCccccccc-hhccCHHHHHHHHH--hc-cCCCCeecCCCccccCCc------hHHHHHHHhhCCCCCCCCCCc
Q psy13793 256 ISHSTERPHNCPHCE-KTFKRNQDLKFHLN--QH-TGERPYKCPFCPRTFASS------GNCFSHRKRMHEGETLPDISL 325 (337)
Q Consensus 256 ~~h~~~k~~~C~~C~-~~f~~~~~L~~H~~--~H-~~~~~~~C~~C~k~f~~~------~~L~~H~~~~H~~~~~~~~~~ 325 (337)
+.|.-..-|.|.+|| +++.-...|.+|.. .| +|-+-+-=+.|. .|... -.|+.-++.--..++.+.-.+
T Consensus 394 KLHGL~~ey~CEICGNy~Y~GrkaF~RHF~EwRH~hGmrCLGIpnt~-~F~~IT~I~eA~~LW~k~k~q~~~~kw~~~~e 472 (497)
T KOG2636|consen 394 KLHGLDIEYNCEICGNYVYKGRKAFDRHFNEWRHAHGMRCLGIPNTS-VFKGITKIEEALELWKKMKEQSQSEKWPPDLE 472 (497)
T ss_pred hhcCCCcccceeeccCccccCcHHHHHHhHHHHHhhcceecCCCCcH-HhcccccHHHHHHHHHHHHHhhhhccCCchhH
Confidence 456667889999999 89999999999965 23 333322222222 34332 346666554444455555555
Q ss_pred ccCCccch
Q psy13793 326 DEMSDNET 333 (337)
Q Consensus 326 ~~~~~~~~ 333 (337)
+++++++.
T Consensus 473 eE~ED~eG 480 (497)
T KOG2636|consen 473 EEYEDEEG 480 (497)
T ss_pred hhhhcccc
Confidence 55555543
No 176
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=26.14 E-value=36 Score=24.75 Aligned_cols=29 Identities=21% Similarity=0.697 Sum_probs=17.8
Q ss_pred cccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCc
Q psy13793 265 NCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASS 304 (337)
Q Consensus 265 ~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~ 304 (337)
.|+.|+..|++. ....|.|+.|+..+...
T Consensus 4 ~CP~C~seytY~-----------dg~~~iCpeC~~EW~~~ 32 (109)
T TIGR00686 4 PCPKCNSEYTYH-----------DGTQLICPSCLYEWNEN 32 (109)
T ss_pred cCCcCCCcceEe-----------cCCeeECcccccccccc
Confidence 466776666552 23357777777776544
No 177
>PF11931 DUF3449: Domain of unknown function (DUF3449); InterPro: IPR024598 This presumed domain is functionally uncharacterised. It has two conserved sequence motifs: PIP and CEICG and contains a zinc-finger of the C2H2-type.; PDB: 4DGW_A.
Probab=25.96 E-value=23 Score=29.05 Aligned_cols=36 Identities=22% Similarity=0.415 Sum_probs=0.0
Q ss_pred cccCCCceecCCCC-CcCCChhHHHhHHh--hccCcccc
Q psy13793 70 RKNGPKPYQCAICP-RGYYSDIGLQNHLW--SHRKILKN 105 (337)
Q Consensus 70 ~h~~~~~~~C~~C~-~~F~~~~~L~~H~~--~H~~~~~c 105 (337)
.|.=...|.|.+|| .+|..+..+..|.. .|...++|
T Consensus 95 LhGL~~ey~CEICGN~~Y~GrkaFekHF~E~rH~~Glrc 133 (196)
T PF11931_consen 95 LHGLGVEYKCEICGNQSYKGRKAFEKHFQEWRHAYGLRC 133 (196)
T ss_dssp ---------------------------------------
T ss_pred HhCCCCeeeeEeCCCcceecHHHHHHhcChhHHHccChh
Confidence 34455689999998 67779999999974 56544554
No 178
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=25.71 E-value=24 Score=21.38 Aligned_cols=37 Identities=11% Similarity=0.113 Sum_probs=15.5
Q ss_pred cCCCCCcCCChhHHHhHHhhccCccccC--cccceeeec
Q psy13793 79 CAICPRGYYSDIGLQNHLWSHRKILKNS--NVGIYYTFT 115 (337)
Q Consensus 79 C~~C~~~F~~~~~L~~H~~~H~~~~~c~--~c~~~~~~~ 115 (337)
|+.||...........+...-..-+.|. +||..|...
T Consensus 2 CP~Cg~~a~ir~S~~~s~~~~~~Y~qC~N~~Cg~tfv~~ 40 (47)
T PF04606_consen 2 CPHCGSKARIRTSRQLSPLTRELYCQCTNPECGHTFVAN 40 (47)
T ss_pred cCCCCCeeEEEEchhhCcceEEEEEEECCCcCCCEEEEE
Confidence 5556554444443333322222223333 355555443
No 179
>PRK10220 hypothetical protein; Provisional
Probab=25.53 E-value=44 Score=24.36 Aligned_cols=29 Identities=24% Similarity=0.842 Sum_probs=17.3
Q ss_pred cccccchhccCHHHHHHHHHhccCCCCeecCCCccccCCc
Q psy13793 265 NCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFASS 304 (337)
Q Consensus 265 ~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~ 304 (337)
.|+.|...|++. ....|.|+.|+..|+..
T Consensus 5 ~CP~C~seytY~-----------d~~~~vCpeC~hEW~~~ 33 (111)
T PRK10220 5 HCPKCNSEYTYE-----------DNGMYICPECAHEWNDA 33 (111)
T ss_pred cCCCCCCcceEc-----------CCCeEECCcccCcCCcc
Confidence 466676665552 23357777777766544
No 180
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=25.52 E-value=67 Score=17.89 Aligned_cols=21 Identities=33% Similarity=0.573 Sum_probs=12.8
Q ss_pred CeecCCCccccCCchHHHHHHH
Q psy13793 291 PYKCPFCPRTFASSGNCFSHRK 312 (337)
Q Consensus 291 ~~~C~~C~k~f~~~~~L~~H~~ 312 (337)
-+.|+.|++..... -+..|+.
T Consensus 4 ~~~C~nC~R~v~a~-RfA~HLe 24 (33)
T PF08209_consen 4 YVECPNCGRPVAAS-RFAPHLE 24 (33)
T ss_dssp EEE-TTTSSEEEGG-GHHHHHH
T ss_pred eEECCCCcCCcchh-hhHHHHH
Confidence 46788888776443 4667764
No 181
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=25.43 E-value=20 Score=27.47 Aligned_cols=7 Identities=57% Similarity=1.374 Sum_probs=3.9
Q ss_pred eecCCCc
Q psy13793 292 YKCPFCP 298 (337)
Q Consensus 292 ~~C~~C~ 298 (337)
+.|+.||
T Consensus 108 ~~CP~Cg 114 (135)
T PRK03824 108 LKCPKCG 114 (135)
T ss_pred cCCcCCC
Confidence 4455555
No 182
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=25.39 E-value=38 Score=19.97 Aligned_cols=13 Identities=38% Similarity=0.891 Sum_probs=8.3
Q ss_pred ecCCCccccCCch
Q psy13793 293 KCPFCPRTFASSG 305 (337)
Q Consensus 293 ~C~~C~k~f~~~~ 305 (337)
.|.+||+.|+.+.
T Consensus 10 ~C~~C~rpf~WRK 22 (42)
T PF10013_consen 10 ICPVCGRPFTWRK 22 (42)
T ss_pred cCcccCCcchHHH
Confidence 3667777776654
No 183
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=25.31 E-value=29 Score=20.43 Aligned_cols=11 Identities=27% Similarity=0.622 Sum_probs=5.4
Q ss_pred CCeecCCCccc
Q psy13793 290 RPYKCPFCPRT 300 (337)
Q Consensus 290 ~~~~C~~C~k~ 300 (337)
..+.|+.||..
T Consensus 18 g~~vC~~CG~V 28 (43)
T PF08271_consen 18 GELVCPNCGLV 28 (43)
T ss_dssp TEEEETTT-BB
T ss_pred CeEECCCCCCE
Confidence 34566666643
No 184
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=24.92 E-value=47 Score=21.77 Aligned_cols=10 Identities=30% Similarity=0.873 Sum_probs=4.8
Q ss_pred Cccccccchh
Q psy13793 263 PHNCPHCEKT 272 (337)
Q Consensus 263 ~~~C~~C~~~ 272 (337)
.|.|+.||..
T Consensus 46 ~~~C~~Cg~~ 55 (69)
T PF07282_consen 46 VFTCPNCGFE 55 (69)
T ss_pred eEEcCCCCCE
Confidence 3455555544
No 185
>PRK12496 hypothetical protein; Provisional
Probab=24.69 E-value=42 Score=26.74 Aligned_cols=11 Identities=27% Similarity=0.860 Sum_probs=6.8
Q ss_pred ccccccchhcc
Q psy13793 264 HNCPHCEKTFK 274 (337)
Q Consensus 264 ~~C~~C~~~f~ 274 (337)
|.|..|++.|.
T Consensus 128 ~~C~gC~~~~~ 138 (164)
T PRK12496 128 KVCKGCKKKYP 138 (164)
T ss_pred EECCCCCcccc
Confidence 55666766554
No 186
>KOG2272|consensus
Probab=24.04 E-value=35 Score=28.89 Aligned_cols=68 Identities=21% Similarity=0.499 Sum_probs=33.4
Q ss_pred ccccccccccCCHHHHHHHHH----------HcCCCCccccccccccccCHHHHHHHHHhcCCccccCCccCCCCCCHH
Q psy13793 181 YQCPLCNKKINTKSNLRRHLQ----------AHKPKGKHTCEICFRIFKTEINLTRHLECHDTVELSCPDCNRVYPTNS 249 (337)
Q Consensus 181 ~~C~~C~~~f~~~~~l~~H~~----------~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~ 249 (337)
|.|+.|++...+...++..=+ .-.+...|.|..|-..... ..|.-.-...+...|.|..|++...+.+
T Consensus 100 F~Cd~Cn~~Lad~gf~rnqgr~LC~~Cn~k~Ka~~~g~YvC~KCh~~iD~-~~l~fr~d~yH~yHFkCt~C~keL~sda 177 (332)
T KOG2272|consen 100 FRCDLCNKHLADQGFYRNQGRALCRECNQKEKAKGRGRYVCQKCHAHIDE-QPLTFRGDPYHPYHFKCTTCGKELTSDA 177 (332)
T ss_pred chhHHHHHHHhhhhhHhhcchHHhhhhhhhhcccccceeehhhhhhhccc-ccccccCCCCCccceecccccccccchh
Confidence 778888877666555442111 0112234666666433322 2222222222334467777777766543
No 187
>COG4640 Predicted membrane protein [Function unknown]
Probab=23.87 E-value=49 Score=30.09 Aligned_cols=19 Identities=26% Similarity=0.522 Sum_probs=9.9
Q ss_pred CCCeecCCCccccCCchHH
Q psy13793 289 ERPYKCPFCPRTFASSGNC 307 (337)
Q Consensus 289 ~~~~~C~~C~k~f~~~~~L 307 (337)
+..++|+.||..|+....+
T Consensus 13 Ed~~qC~qCG~~~t~~~sq 31 (465)
T COG4640 13 EDDVQCTQCGHKFTSRQSQ 31 (465)
T ss_pred cccccccccCCcCCchhhh
Confidence 3344466666666555443
No 188
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=23.80 E-value=33 Score=24.00 Aligned_cols=27 Identities=22% Similarity=0.818 Sum_probs=15.5
Q ss_pred cccCCccCCCCCCHHHHHHHHHHhCCCCCccccccch
Q psy13793 235 ELSCPDCNRVYPTNSTLRAHMISHSTERPHNCPHCEK 271 (337)
Q Consensus 235 ~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~ 271 (337)
|-.|..||+.|.+. .-.+|-.|+.|..
T Consensus 58 Pa~CkkCGfef~~~----------~ik~pSRCP~CKS 84 (97)
T COG3357 58 PARCKKCGFEFRDD----------KIKKPSRCPKCKS 84 (97)
T ss_pred ChhhcccCcccccc----------ccCCcccCCcchh
Confidence 45677777777651 1124666777653
No 189
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region. Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A. The abnormally extended conformation is stable only in the CcO assembly.
Probab=23.64 E-value=40 Score=24.20 Aligned_cols=17 Identities=35% Similarity=0.954 Sum_probs=10.6
Q ss_pred hccCCCCeecCCCccccC
Q psy13793 285 QHTGERPYKCPFCPRTFA 302 (337)
Q Consensus 285 ~H~~~~~~~C~~C~k~f~ 302 (337)
.+.| ++++|..||..|.
T Consensus 74 l~~g-~~~rC~eCG~~fk 90 (97)
T cd00924 74 LEKG-KPKRCPECGHVFK 90 (97)
T ss_pred EeCC-CceeCCCCCcEEE
Confidence 3444 5777777776664
No 190
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=23.40 E-value=42 Score=25.43 Aligned_cols=27 Identities=22% Similarity=0.337 Sum_probs=19.3
Q ss_pred ceecCCCCCcCCChhHHHhHHhhccCccccCcccc
Q psy13793 76 PYQCAICPRGYYSDIGLQNHLWSHRKILKNSNVGI 110 (337)
Q Consensus 76 ~~~C~~C~~~F~~~~~L~~H~~~H~~~~~c~~c~~ 110 (337)
|++|..||+.|..-+.- |. .-|+.||.
T Consensus 1 PH~Ct~Cg~~f~dgs~e---il-----~GCP~CGg 27 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGSKE---IL-----SGCPECGG 27 (131)
T ss_pred CcccCcCCCCcCCCcHH---HH-----ccCcccCC
Confidence 68899999999876632 11 34788874
No 191
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=23.12 E-value=38 Score=20.81 Aligned_cols=14 Identities=21% Similarity=0.534 Sum_probs=8.7
Q ss_pred CCceecCCCCCcCC
Q psy13793 74 PKPYQCAICPRGYY 87 (337)
Q Consensus 74 ~~~~~C~~C~~~F~ 87 (337)
++.+.|..||..|.
T Consensus 2 Dk~l~C~dCg~~Fv 15 (49)
T PF13451_consen 2 DKTLTCKDCGAEFV 15 (49)
T ss_pred CeeEEcccCCCeEE
Confidence 34566777776664
No 192
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=22.90 E-value=35 Score=21.23 Aligned_cols=12 Identities=33% Similarity=0.974 Sum_probs=6.6
Q ss_pred ecCCCccccCCc
Q psy13793 293 KCPFCPRTFASS 304 (337)
Q Consensus 293 ~C~~C~k~f~~~ 304 (337)
.||+|++.|...
T Consensus 22 ~CPlC~r~l~~e 33 (54)
T PF04423_consen 22 CCPLCGRPLDEE 33 (54)
T ss_dssp E-TTT--EE-HH
T ss_pred cCCCCCCCCCHH
Confidence 899999998764
No 193
>PF13824 zf-Mss51: Zinc-finger of mitochondrial splicing suppressor 51
Probab=22.90 E-value=57 Score=20.61 Aligned_cols=15 Identities=20% Similarity=0.514 Sum_probs=8.5
Q ss_pred CCCeecCCCccccCC
Q psy13793 289 ERPYKCPFCPRTFAS 303 (337)
Q Consensus 289 ~~~~~C~~C~k~f~~ 303 (337)
.-.|.|+.||..+..
T Consensus 12 ~v~~~Cp~cGipthc 26 (55)
T PF13824_consen 12 HVNFECPDCGIPTHC 26 (55)
T ss_pred ccCCcCCCCCCcCcc
Confidence 345666666655443
No 194
>KOG0717|consensus
Probab=22.73 E-value=48 Score=30.88 Aligned_cols=22 Identities=32% Similarity=0.646 Sum_probs=19.8
Q ss_pred eecCCCCCcCCChhHHHhHHhh
Q psy13793 77 YQCAICPRGYYSDIGLQNHLWS 98 (337)
Q Consensus 77 ~~C~~C~~~F~~~~~L~~H~~~ 98 (337)
+.|.+|.+.|.+..+|..|..+
T Consensus 293 lyC~vCnKsFKseKq~kNHEnS 314 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHENS 314 (508)
T ss_pred eEEeeccccccchHHHHhhHHH
Confidence 6799999999999999999753
No 195
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=22.43 E-value=27 Score=25.92 Aligned_cols=12 Identities=25% Similarity=0.686 Sum_probs=6.1
Q ss_pred ccCCccCCCCCC
Q psy13793 236 LSCPDCNRVYPT 247 (337)
Q Consensus 236 ~~C~~C~~~f~~ 247 (337)
+.|..|+..|..
T Consensus 71 ~~C~~Cg~~~~~ 82 (115)
T TIGR00100 71 CECEDCSEEVSP 82 (115)
T ss_pred EEcccCCCEEec
Confidence 345555555443
No 196
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=21.95 E-value=47 Score=30.68 Aligned_cols=29 Identities=17% Similarity=0.141 Sum_probs=18.6
Q ss_pred ecCCCCCcCCChhHHHhHHhhccC-ccccCcccceeeecc
Q psy13793 78 QCAICPRGYYSDIGLQNHLWSHRK-ILKNSNVGIYYTFTE 116 (337)
Q Consensus 78 ~C~~C~~~F~~~~~L~~H~~~H~~-~~~c~~c~~~~~~~~ 116 (337)
.|+.||... .+-.. .|+|+.|+..+....
T Consensus 352 ~Cp~Cg~~m----------~S~G~~g~rC~kCg~~~~~~~ 381 (421)
T COG1571 352 VCPRCGGRM----------KSAGRNGFRCKKCGTRARETL 381 (421)
T ss_pred CCCccCCch----------hhcCCCCcccccccccCCccc
Confidence 588888753 33222 688888887765543
No 197
>smart00504 Ubox Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
Probab=21.90 E-value=1.2e+02 Score=19.11 Aligned_cols=10 Identities=10% Similarity=0.225 Sum_probs=6.6
Q ss_pred ecCCCccccC
Q psy13793 293 KCPFCPRTFA 302 (337)
Q Consensus 293 ~C~~C~k~f~ 302 (337)
.|+.|++.++
T Consensus 37 ~cP~~~~~~~ 46 (63)
T smart00504 37 TDPVTGQPLT 46 (63)
T ss_pred CCCCCcCCCC
Confidence 5777776664
No 198
>COG1773 Rubredoxin [Energy production and conversion]
Probab=21.27 E-value=47 Score=20.99 Aligned_cols=13 Identities=15% Similarity=0.800 Sum_probs=9.4
Q ss_pred CeecCCCccccCC
Q psy13793 291 PYKCPFCPRTFAS 303 (337)
Q Consensus 291 ~~~C~~C~k~f~~ 303 (337)
.|+|..||..|.-
T Consensus 3 ~~~C~~CG~vYd~ 15 (55)
T COG1773 3 RWRCSVCGYVYDP 15 (55)
T ss_pred ceEecCCceEecc
Confidence 5778888877754
No 199
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=21.16 E-value=58 Score=23.38 Aligned_cols=27 Identities=19% Similarity=0.238 Sum_probs=19.6
Q ss_pred ceecCCCCCcCCChhHHHhHHhhccCccccCcccc
Q psy13793 76 PYQCAICPRGYYSDIGLQNHLWSHRKILKNSNVGI 110 (337)
Q Consensus 76 ~~~C~~C~~~F~~~~~L~~H~~~H~~~~~c~~c~~ 110 (337)
|+.|..||..|..-+.+.. .-|+.||.
T Consensus 2 pH~CtrCG~vf~~g~~~il--------~GCp~CG~ 28 (112)
T COG3364 2 PHQCTRCGEVFDDGSEEIL--------SGCPKCGC 28 (112)
T ss_pred CceecccccccccccHHHH--------ccCccccc
Confidence 6889999999998654432 24777775
No 200
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=21.10 E-value=26 Score=24.65 Aligned_cols=31 Identities=23% Similarity=0.552 Sum_probs=16.8
Q ss_pred CcccccccccccCCHHHHHHHHHHcCCCCcccccccccccc
Q psy13793 179 TRYQCPLCNKKINTKSNLRRHLQAHKPKGKHTCEICFRIFK 219 (337)
Q Consensus 179 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 219 (337)
..|.|+.|++.-.. -.+...+.|..|++.|.
T Consensus 35 a~y~CpfCgk~~vk----------R~a~GIW~C~~C~~~~A 65 (90)
T PRK03976 35 AKHVCPVCGRPKVK----------RVGTGIWECRKCGAKFA 65 (90)
T ss_pred cCccCCCCCCCceE----------EEEEEEEEcCCCCCEEe
Confidence 34777777654211 11233577777777664
No 201
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.95 E-value=64 Score=20.74 Aligned_cols=35 Identities=17% Similarity=0.448 Sum_probs=17.8
Q ss_pred cccccccc-cCCHHHHHHHHHHcCCCCccccccccc
Q psy13793 182 QCPLCNKK-INTKSNLRRHLQAHKPKGKHTCEICFR 216 (337)
Q Consensus 182 ~C~~C~~~-f~~~~~l~~H~~~h~~~~~~~C~~C~~ 216 (337)
+|-+|++. |.+...+..-+....+-+.|.|+.|..
T Consensus 4 kCiiCd~v~~iD~rt~~tKrLrN~PIrtymC~eC~~ 39 (68)
T COG4896 4 KCIICDRVDEIDNRTFKTKRLRNKPIRTYMCPECEH 39 (68)
T ss_pred eEEEecceeeecchhHHHHHhhCCCceeEechhhHh
Confidence 46666543 444444444444444445566666643
No 202
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=20.51 E-value=1.1e+02 Score=24.72 Aligned_cols=7 Identities=43% Similarity=1.469 Sum_probs=3.5
Q ss_pred eecCCCc
Q psy13793 292 YKCPFCP 298 (337)
Q Consensus 292 ~~C~~C~ 298 (337)
|.|+.||
T Consensus 133 F~Cp~Cg 139 (176)
T COG1675 133 FTCPKCG 139 (176)
T ss_pred CCCCCCC
Confidence 4455555
No 203
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=20.26 E-value=27 Score=27.28 Aligned_cols=32 Identities=28% Similarity=0.783 Sum_probs=17.9
Q ss_pred CccccccchhccCHHHHHHHHHhccCCCCeecCCCccccC
Q psy13793 263 PHNCPHCEKTFKRNQDLKFHLNQHTGERPYKCPFCPRTFA 302 (337)
Q Consensus 263 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~ 302 (337)
.|.|..|+..+.. |.+. ....|.|..|+-.|.
T Consensus 123 ~~~C~~C~~~~~r------~~~~--~~~~~~C~~C~~~l~ 154 (157)
T PF10263_consen 123 VYRCPSCGREYKR------HRRS--KRKRYRCGRCGGPLV 154 (157)
T ss_pred EEEcCCCCCEeee------eccc--chhhEECCCCCCEEE
Confidence 3677777766533 2222 223477777776553
No 204
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=20.19 E-value=30 Score=18.44 Aligned_cols=18 Identities=28% Similarity=0.680 Sum_probs=10.5
Q ss_pred eecCCCccccCCchHHHHH
Q psy13793 292 YKCPFCPRTFASSGNCFSH 310 (337)
Q Consensus 292 ~~C~~C~k~f~~~~~L~~H 310 (337)
|.|-.|++.|.. .....|
T Consensus 1 ~sCiDC~~~F~~-~~y~~H 18 (28)
T PF08790_consen 1 FSCIDCSKDFDG-DSYKSH 18 (28)
T ss_dssp EEETTTTEEEEG-GGTTT-
T ss_pred CeeecCCCCcCc-CCcCCC
Confidence 567788888843 333444
No 205
>PF14787 zf-CCHC_5: GAG-polyprotein viral zinc-finger; PDB: 1CL4_A 1DSV_A.
Probab=20.15 E-value=48 Score=18.82 Aligned_cols=16 Identities=31% Similarity=0.810 Sum_probs=10.0
Q ss_pred ecCCCccccCCchHHH
Q psy13793 293 KCPFCPRTFASSGNCF 308 (337)
Q Consensus 293 ~C~~C~k~f~~~~~L~ 308 (337)
.|+.|++.|.+.++..
T Consensus 4 ~CprC~kg~Hwa~~C~ 19 (36)
T PF14787_consen 4 LCPRCGKGFHWASECR 19 (36)
T ss_dssp C-TTTSSSCS-TTT--
T ss_pred cCcccCCCcchhhhhh
Confidence 5888999998887653
No 206
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=20.12 E-value=39 Score=28.76 Aligned_cols=81 Identities=19% Similarity=0.426 Sum_probs=45.2
Q ss_pred CCCCcccccccccccCCHHHHHHHHHH--cCCCCccccccccccccCHHHH-------HHHHHhc-----CCccccCCcc
Q psy13793 176 SSGTRYQCPLCNKKINTKSNLRRHLQA--HKPKGKHTCEICFRIFKTEINL-------TRHLECH-----DTVELSCPDC 241 (337)
Q Consensus 176 ~~~~~~~C~~C~~~f~~~~~l~~H~~~--h~~~~~~~C~~C~~~f~~~~~l-------~~H~~~h-----~~~~~~C~~C 241 (337)
.+.+.|+|..|..-...-..+ .|+.. -.....|+|..|++.=. .+-| ..|++.. .+.++.|+.|
T Consensus 138 hGGrif~CsfC~~flCEDDQF-EHQAsCQvLe~E~~KC~SCNrlGq-~sCLRCK~cfCddHvrrKg~ky~k~k~~PCPKC 215 (314)
T PF06524_consen 138 HGGRIFKCSFCDNFLCEDDQF-EHQASCQVLESETFKCQSCNRLGQ-YSCLRCKICFCDDHVRRKGFKYEKGKPIPCPKC 215 (314)
T ss_pred CCCeEEEeecCCCeeeccchh-hhhhhhhhhhcccccccccccccc-hhhhheeeeehhhhhhhcccccccCCCCCCCCC
Confidence 366779999997654443333 34432 11223466665554211 1111 1244432 2467899999
Q ss_pred CCCCCCHHHHHHHHHHh
Q psy13793 242 NRVYPTNSTLRAHMISH 258 (337)
Q Consensus 242 ~~~f~~~~~l~~H~~~h 258 (337)
|........|..-.+.|
T Consensus 216 g~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 216 GYETQETKDLSMSTRSH 232 (314)
T ss_pred CCcccccccceeeeecc
Confidence 99888777776655554
Done!