Query psy13347
Match_columns 339
No_of_seqs 436 out of 2683
Neff 9.2
Searched_HMMs 46136
Date Fri Aug 16 17:05:36 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13347.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13347hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 99.9 1.5E-26 3.3E-31 196.9 6.0 137 102-258 125-267 (279)
2 KOG2462|consensus 99.9 6.2E-25 1.3E-29 187.1 5.6 107 104-230 158-264 (279)
3 KOG1074|consensus 99.8 2.9E-19 6.2E-24 171.6 5.3 66 197-262 621-696 (958)
4 KOG3608|consensus 99.7 4.8E-19 1E-23 154.9 0.3 169 50-260 201-378 (467)
5 KOG3576|consensus 99.7 4.5E-18 9.8E-23 138.2 2.0 110 133-260 115-238 (267)
6 KOG3608|consensus 99.7 8E-18 1.7E-22 147.4 1.2 187 71-261 135-348 (467)
7 KOG3576|consensus 99.7 8.7E-18 1.9E-22 136.6 -0.6 133 96-235 106-238 (267)
8 KOG1074|consensus 99.6 2.1E-16 4.5E-21 152.2 3.2 57 208-264 604-663 (958)
9 KOG3623|consensus 99.5 5.8E-15 1.3E-19 140.1 1.3 80 133-230 892-971 (1007)
10 KOG3623|consensus 99.4 8.5E-14 1.8E-18 132.3 2.5 119 135-258 210-333 (1007)
11 PLN03086 PRLI-interacting fact 99.2 3.2E-11 7E-16 115.5 7.4 110 54-177 405-514 (567)
12 PHA00733 hypothetical protein 99.0 1.7E-10 3.6E-15 90.8 3.3 85 130-234 35-124 (128)
13 PHA00733 hypothetical protein 99.0 2.1E-10 4.5E-15 90.2 1.8 88 162-260 37-125 (128)
14 PLN03086 PRLI-interacting fact 98.9 1.6E-09 3.4E-14 104.1 7.5 119 105-258 431-564 (567)
15 PHA02768 hypothetical protein; 98.9 9E-10 1.9E-14 71.6 2.8 44 209-252 5-49 (55)
16 PHA02768 hypothetical protein; 98.6 1.1E-08 2.4E-13 66.5 1.4 44 135-184 5-48 (55)
17 KOG3993|consensus 98.5 2.4E-08 5.1E-13 90.5 -0.6 154 103-260 263-484 (500)
18 PF13465 zf-H2C2_2: Zinc-finge 98.5 1E-07 2.2E-12 53.0 1.9 25 152-178 1-25 (26)
19 PHA00732 hypothetical protein 98.4 2.1E-07 4.5E-12 66.5 2.8 47 209-258 1-48 (79)
20 KOG3993|consensus 98.3 1.6E-07 3.6E-12 85.1 -0.1 96 135-234 267-381 (500)
21 PF13465 zf-H2C2_2: Zinc-finge 98.1 8.7E-07 1.9E-11 49.1 0.7 25 123-149 2-26 (26)
22 COG5189 SFP1 Putative transcri 98.1 1.3E-06 2.9E-11 76.5 1.7 25 206-230 395-419 (423)
23 PHA00616 hypothetical protein 98.0 2.1E-06 4.6E-11 53.2 1.0 33 135-169 1-33 (44)
24 PHA00732 hypothetical protein 98.0 5.1E-06 1.1E-10 59.3 3.0 51 165-236 1-51 (79)
25 PHA00616 hypothetical protein 97.9 4.8E-06 1E-10 51.6 1.7 28 209-236 1-28 (44)
26 PF00096 zf-C2H2: Zinc finger, 97.9 5.4E-06 1.2E-10 44.5 1.1 23 210-232 1-23 (23)
27 PF05605 zf-Di19: Drought indu 97.8 3E-05 6.4E-10 51.3 3.7 53 165-234 2-54 (54)
28 PF13894 zf-C2H2_4: C2H2-type 97.6 3.7E-05 8E-10 41.4 1.5 24 210-233 1-24 (24)
29 PF05605 zf-Di19: Drought indu 97.5 5.2E-05 1.1E-09 50.2 2.2 47 209-258 2-53 (54)
30 PF00096 zf-C2H2: Zinc finger, 97.5 7.3E-05 1.6E-09 40.0 2.1 23 136-160 1-23 (23)
31 PF13912 zf-C2H2_6: C2H2-type 97.4 5.6E-05 1.2E-09 42.2 1.0 26 209-234 1-26 (27)
32 KOG1146|consensus 97.3 0.00038 8.3E-09 72.2 5.6 126 129-258 459-642 (1406)
33 PF12756 zf-C2H2_2: C2H2 type 97.2 0.00021 4.6E-09 53.5 1.8 73 137-232 1-73 (100)
34 smart00355 ZnF_C2H2 zinc finge 97.0 0.00064 1.4E-08 37.0 2.4 24 210-233 1-24 (26)
35 PF13894 zf-C2H2_4: C2H2-type 97.0 0.00066 1.4E-08 36.3 2.2 23 136-160 1-23 (24)
36 PF13912 zf-C2H2_6: C2H2-type 96.8 0.00089 1.9E-08 37.2 2.0 25 135-161 1-25 (27)
37 KOG2231|consensus 96.8 0.013 2.9E-07 57.8 10.9 44 137-188 184-233 (669)
38 PF12756 zf-C2H2_2: C2H2 type 96.7 0.00034 7.3E-09 52.4 -0.6 48 211-258 1-74 (100)
39 COG5048 FOG: Zn-finger [Genera 96.6 0.0013 2.8E-08 62.6 2.5 144 106-258 288-442 (467)
40 PF09237 GAGA: GAGA factor; I 96.3 0.0031 6.6E-08 40.1 2.1 32 205-236 20-51 (54)
41 COG5236 Uncharacterized conser 96.2 0.0057 1.2E-07 54.8 4.3 105 135-258 151-275 (493)
42 PF13909 zf-H2C2_5: C2H2-type 96.2 0.0024 5.3E-08 34.3 1.2 24 210-234 1-24 (24)
43 PF12874 zf-met: Zinc-finger o 96.1 0.0023 5E-08 34.8 0.8 23 210-232 1-23 (25)
44 PF09237 GAGA: GAGA factor; I 95.7 0.012 2.5E-07 37.5 2.9 32 132-165 21-52 (54)
45 COG5189 SFP1 Putative transcri 95.7 0.0045 9.7E-08 54.8 1.2 23 163-187 396-418 (423)
46 smart00355 ZnF_C2H2 zinc finge 95.5 0.0067 1.4E-07 32.8 1.1 24 136-161 1-24 (26)
47 COG5048 FOG: Zn-finger [Genera 95.5 0.0055 1.2E-07 58.3 1.1 112 134-261 288-415 (467)
48 PF12874 zf-met: Zinc-finger o 95.5 0.011 2.4E-07 32.0 1.9 23 136-160 1-23 (25)
49 PF12171 zf-C2H2_jaz: Zinc-fin 95.1 0.0052 1.1E-07 34.1 -0.2 22 210-231 2-23 (27)
50 KOG2231|consensus 95.0 0.089 1.9E-06 52.2 7.8 100 108-233 116-236 (669)
51 PRK04860 hypothetical protein; 94.4 0.026 5.7E-07 46.1 2.2 40 134-181 118-157 (160)
52 PF13909 zf-H2C2_5: C2H2-type 94.1 0.034 7.3E-07 29.8 1.5 22 136-160 1-22 (24)
53 PF13913 zf-C2HC_2: zinc-finge 94.0 0.046 1E-06 29.7 1.9 21 210-231 3-23 (25)
54 PRK04860 hypothetical protein; 93.4 0.043 9.2E-07 44.9 1.6 37 208-248 118-157 (160)
55 PF12171 zf-C2H2_jaz: Zinc-fin 93.2 0.026 5.7E-07 31.2 0.1 22 136-159 2-23 (27)
56 KOG1146|consensus 92.5 0.066 1.4E-06 56.3 1.9 115 143-259 442-614 (1406)
57 KOG2893|consensus 92.5 0.049 1.1E-06 46.4 0.8 44 172-233 15-59 (341)
58 smart00451 ZnF_U1 U1-like zinc 92.1 0.093 2E-06 30.8 1.5 22 209-230 3-24 (35)
59 KOG4124|consensus 91.9 0.057 1.2E-06 48.6 0.5 66 163-228 347-417 (442)
60 KOG2482|consensus 91.7 0.2 4.3E-06 45.2 3.7 134 122-259 130-359 (423)
61 KOG2785|consensus 89.7 0.49 1.1E-05 43.5 4.4 49 209-257 166-243 (390)
62 KOG2482|consensus 89.7 0.21 4.5E-06 45.0 2.0 94 135-232 195-357 (423)
63 KOG4173|consensus 88.6 0.051 1.1E-06 45.3 -2.4 51 137-189 108-168 (253)
64 smart00451 ZnF_U1 U1-like zinc 87.2 0.43 9.4E-06 27.8 1.7 22 135-158 3-24 (35)
65 KOG4173|consensus 85.9 0.14 3.1E-06 42.7 -1.2 92 133-233 77-170 (253)
66 PF09986 DUF2225: Uncharacteri 84.3 0.18 4E-06 43.4 -1.4 41 207-247 3-61 (214)
67 COG4049 Uncharacterized protei 84.2 0.43 9.4E-06 31.0 0.6 30 205-234 13-42 (65)
68 COG5236 Uncharacterized conser 84.0 2.1 4.5E-05 38.9 5.0 117 120-260 166-307 (493)
69 PF09538 FYDLN_acid: Protein o 78.7 1.5 3.2E-05 33.4 1.9 15 208-222 25-39 (108)
70 KOG4377|consensus 78.6 1.5 3.3E-05 40.7 2.3 116 133-260 269-429 (480)
71 COG2888 Predicted Zn-ribbon RN 78.3 1.4 3E-05 29.2 1.4 16 50-65 21-36 (61)
72 PF07754 DUF1610: Domain of un 78.1 1.6 3.5E-05 23.4 1.4 14 51-64 11-24 (24)
73 PHA00626 hypothetical protein 77.3 2.1 4.5E-05 27.9 2.0 14 134-149 22-35 (59)
74 COG4049 Uncharacterized protei 77.0 1.3 2.7E-05 28.9 0.9 31 128-160 10-40 (65)
75 PF13717 zinc_ribbon_4: zinc-r 76.5 1.7 3.7E-05 25.8 1.4 34 166-220 3-36 (36)
76 TIGR00622 ssl1 transcription f 73.1 6.3 0.00014 30.0 4.0 45 137-187 57-101 (112)
77 KOG0696|consensus 72.6 4 8.8E-05 38.5 3.4 78 132-243 70-147 (683)
78 TIGR02098 MJ0042_CXXC MJ0042 f 71.9 2 4.3E-05 25.7 0.9 34 166-220 3-36 (38)
79 KOG2186|consensus 71.8 1.9 4E-05 37.5 1.0 47 210-258 4-52 (276)
80 KOG2893|consensus 70.9 1.6 3.5E-05 37.5 0.4 42 138-187 13-54 (341)
81 cd00350 rubredoxin_like Rubred 70.8 2.9 6.2E-05 24.2 1.4 24 136-175 2-25 (33)
82 KOG2186|consensus 69.6 3.4 7.4E-05 35.9 2.1 46 135-187 3-48 (276)
83 KOG4124|consensus 69.5 0.88 1.9E-05 41.3 -1.5 53 133-187 347-418 (442)
84 smart00614 ZnF_BED BED zinc fi 69.4 3.1 6.7E-05 26.6 1.5 25 209-233 18-48 (50)
85 PF10571 UPF0547: Uncharacteri 69.3 3.8 8.3E-05 22.4 1.6 10 236-245 16-25 (26)
86 PF13719 zinc_ribbon_5: zinc-r 68.2 3.2 6.9E-05 24.8 1.2 34 166-220 3-36 (37)
87 PRK00398 rpoP DNA-directed RNA 67.9 4.3 9.4E-05 25.4 1.9 9 55-63 2-10 (46)
88 PF05443 ROS_MUCR: ROS/MUCR tr 65.7 3.5 7.6E-05 32.5 1.4 26 208-236 71-96 (132)
89 cd00729 rubredoxin_SM Rubredox 62.9 5.2 0.00011 23.4 1.4 10 208-217 17-26 (34)
90 COG1996 RPC10 DNA-directed RNA 62.3 2.5 5.4E-05 27.0 -0.0 29 208-243 5-33 (49)
91 PF12013 DUF3505: Protein of u 61.9 5.4 0.00012 30.2 1.8 26 209-234 80-109 (109)
92 PF02892 zf-BED: BED zinc fing 61.5 8.3 0.00018 23.8 2.3 25 132-158 13-41 (45)
93 COG5151 SSL1 RNA polymerase II 61.2 12 0.00027 33.5 4.0 103 49-160 301-411 (421)
94 TIGR00373 conserved hypothetic 60.9 8.3 0.00018 31.5 2.8 34 161-219 105-138 (158)
95 PF15269 zf-C2H2_7: Zinc-finge 60.5 5.2 0.00011 24.8 1.1 23 209-231 20-42 (54)
96 PF05290 Baculo_IE-1: Baculovi 60.1 5.7 0.00012 31.1 1.6 57 161-248 76-135 (140)
97 smart00734 ZnF_Rad18 Rad18-lik 59.9 8 0.00017 21.1 1.7 20 210-230 2-21 (26)
98 TIGR02300 FYDLN_acid conserved 59.6 6.4 0.00014 30.6 1.8 14 208-221 25-38 (129)
99 KOG4377|consensus 58.0 5.1 0.00011 37.3 1.2 126 100-234 264-428 (480)
100 COG1198 PriA Primosomal protei 56.8 7.2 0.00016 39.9 2.1 8 56-63 435-442 (730)
101 PRK06266 transcription initiat 55.4 11 0.00023 31.5 2.6 19 163-183 115-133 (178)
102 PRK14892 putative transcriptio 54.7 13 0.00028 27.7 2.7 13 51-63 16-28 (99)
103 PRK14890 putative Zn-ribbon RN 53.6 9.5 0.00021 25.3 1.6 10 208-217 47-56 (59)
104 COG1198 PriA Primosomal protei 53.3 13 0.00029 38.0 3.4 56 98-176 426-484 (730)
105 smart00834 CxxC_CXXC_SSSS Puta 53.2 2.8 6.2E-05 25.3 -0.9 30 210-243 6-35 (41)
106 PF09538 FYDLN_acid: Protein o 52.8 7.5 0.00016 29.5 1.2 31 135-180 9-39 (108)
107 smart00531 TFIIE Transcription 51.7 15 0.00032 29.6 2.8 19 163-183 97-115 (147)
108 KOG4167|consensus 51.0 3.7 8E-05 41.1 -0.9 26 208-233 791-816 (907)
109 PRK05978 hypothetical protein; 50.5 8 0.00017 31.1 1.0 15 51-65 28-42 (148)
110 PF06524 NOA36: NOA36 protein; 49.9 1E+02 0.0022 27.3 7.6 24 237-260 212-235 (314)
111 COG1592 Rubrerythrin [Energy p 49.4 12 0.00026 30.7 2.0 23 165-216 134-156 (166)
112 PF09986 DUF2225: Uncharacteri 49.0 5 0.00011 34.6 -0.4 44 133-180 3-61 (214)
113 PRK14873 primosome assembly pr 47.5 19 0.00041 36.7 3.4 8 56-63 383-390 (665)
114 PF12013 DUF3505: Protein of u 46.4 15 0.00033 27.7 2.0 27 134-162 79-109 (109)
115 PF05129 Elf1: Transcription e 45.5 7.5 0.00016 27.8 0.2 12 52-63 18-29 (81)
116 TIGR00622 ssl1 transcription f 45.4 23 0.0005 27.0 2.8 31 208-241 80-110 (112)
117 PRK06266 transcription initiat 44.7 13 0.00028 31.1 1.5 31 106-147 116-146 (178)
118 COG4957 Predicted transcriptio 43.0 10 0.00022 29.7 0.6 24 210-236 77-100 (148)
119 KOG4167|consensus 42.4 6.6 0.00014 39.4 -0.6 27 133-161 790-816 (907)
120 TIGR00595 priA primosomal prot 42.3 26 0.00056 34.5 3.4 15 130-146 235-249 (505)
121 KOG2807|consensus 41.2 65 0.0014 29.4 5.3 95 52-174 272-374 (378)
122 COG4530 Uncharacterized protei 40.9 15 0.00032 27.7 1.1 15 207-221 24-38 (129)
123 PF12760 Zn_Tnp_IS1595: Transp 40.1 23 0.0005 22.1 1.8 8 56-63 18-25 (46)
124 PF04959 ARS2: Arsenite-resist 39.9 9.2 0.0002 32.9 -0.1 30 206-235 74-103 (214)
125 PF05443 ROS_MUCR: ROS/MUCR tr 39.0 23 0.00049 28.0 2.0 28 133-165 70-97 (132)
126 COG5188 PRP9 Splicing factor 3 38.5 43 0.00093 30.8 3.8 28 203-230 368-396 (470)
127 KOG2593|consensus 38.2 14 0.00031 34.8 0.9 38 161-217 124-161 (436)
128 COG1997 RPL43A Ribosomal prote 37.2 22 0.00048 25.6 1.5 15 133-149 51-65 (89)
129 KOG2785|consensus 36.3 37 0.0008 31.6 3.1 50 135-188 166-241 (390)
130 PF14353 CpXC: CpXC protein 35.4 4.4 9.5E-05 31.7 -2.6 46 210-257 2-61 (128)
131 KOG2636|consensus 34.9 26 0.00056 33.3 2.0 35 203-237 395-433 (497)
132 PRK00464 nrdR transcriptional 34.9 10 0.00022 30.8 -0.6 16 166-183 29-44 (154)
133 PF05290 Baculo_IE-1: Baculovi 34.7 34 0.00073 26.9 2.3 55 106-181 79-135 (140)
134 COG3478 Predicted nucleic-acid 32.8 31 0.00067 23.3 1.5 25 96-120 29-53 (68)
135 COG1655 Uncharacterized protei 32.8 8.6 0.00019 33.1 -1.3 28 207-234 17-44 (267)
136 PF14311 DUF4379: Domain of un 32.6 24 0.00053 22.9 1.1 28 209-240 28-55 (55)
137 PF08271 TF_Zn_Ribbon: TFIIB z 32.6 30 0.00064 21.2 1.4 7 57-63 1-7 (43)
138 KOG2593|consensus 32.1 25 0.00053 33.3 1.3 43 128-177 121-163 (436)
139 PRK05580 primosome assembly pr 31.8 50 0.0011 33.9 3.6 7 57-63 382-388 (679)
140 smart00661 RPOL9 RNA polymeras 31.7 41 0.00088 21.3 2.0 6 58-63 2-7 (52)
141 PF09723 Zn-ribbon_8: Zinc rib 30.8 38 0.00082 20.7 1.6 8 56-63 5-12 (42)
142 KOG1280|consensus 30.7 46 0.001 30.6 2.8 106 103-241 4-116 (381)
143 COG2331 Uncharacterized protei 30.7 14 0.00031 25.8 -0.3 34 209-246 12-45 (82)
144 PF09845 DUF2072: Zn-ribbon co 30.6 33 0.00071 26.9 1.6 15 165-181 1-15 (131)
145 COG5349 Uncharacterized protei 29.9 22 0.00047 27.4 0.5 31 49-79 14-44 (126)
146 PF02176 zf-TRAF: TRAF-type zi 29.7 42 0.0009 21.9 1.9 45 133-180 7-55 (60)
147 PF10013 DUF2256: Uncharacteri 29.0 13 0.00028 22.8 -0.7 15 211-225 10-24 (42)
148 TIGR00244 transcriptional regu 28.8 17 0.00037 29.1 -0.2 8 58-65 2-9 (147)
149 PRK00464 nrdR transcriptional 28.5 34 0.00074 27.8 1.5 17 135-153 28-44 (154)
150 COG1779 C4-type Zn-finger prot 28.3 11 0.00024 31.7 -1.4 37 209-251 14-60 (201)
151 TIGR00373 conserved hypothetic 28.3 30 0.00066 28.2 1.2 38 128-178 102-139 (158)
152 smart00531 TFIIE Transcription 28.2 68 0.0015 25.7 3.2 40 131-178 95-134 (147)
153 KOG3002|consensus 28.1 30 0.00064 31.5 1.1 79 105-188 78-160 (299)
154 PRK09678 DNA-binding transcrip 27.5 35 0.00075 23.8 1.2 19 163-181 25-43 (72)
155 KOG3408|consensus 27.2 33 0.00072 26.4 1.1 26 206-231 54-79 (129)
156 PF08209 Sgf11: Sgf11 (transcr 27.1 12 0.00025 21.8 -1.1 18 55-72 3-20 (33)
157 COG1592 Rubrerythrin [Energy p 27.0 40 0.00087 27.7 1.6 25 134-175 133-157 (166)
158 KOG2071|consensus 26.7 39 0.00084 33.4 1.7 29 207-235 416-444 (579)
159 smart00154 ZnF_AN1 AN1-like Zi 26.2 27 0.00058 21.1 0.4 14 209-222 12-25 (39)
160 PF04959 ARS2: Arsenite-resist 26.1 56 0.0012 28.1 2.4 25 133-159 75-99 (214)
161 KOG3214|consensus 25.5 27 0.00058 25.8 0.3 15 134-150 46-60 (109)
162 TIGR00595 priA primosomal prot 25.3 53 0.0011 32.3 2.4 19 157-177 232-250 (505)
163 PF03811 Zn_Tnp_IS1: InsA N-te 25.2 65 0.0014 19.1 1.9 11 55-65 4-14 (36)
164 PRK14873 primosome assembly pr 24.7 56 0.0012 33.4 2.5 10 209-218 422-431 (665)
165 PF13240 zinc_ribbon_2: zinc-r 24.5 35 0.00077 17.9 0.6 6 237-242 16-21 (23)
166 PF03604 DNA_RNApol_7kD: DNA d 24.1 69 0.0015 18.4 1.8 7 57-63 1-7 (32)
167 PF11931 DUF3449: Domain of un 22.4 29 0.00063 29.4 0.0 34 204-237 96-133 (196)
168 PF07975 C1_4: TFIIH C1-like d 21.9 58 0.0013 21.0 1.3 22 164-187 20-41 (51)
169 KOG0717|consensus 21.2 45 0.00097 32.0 1.0 21 210-230 293-313 (508)
170 PF09855 DUF2082: Nucleic-acid 21.2 55 0.0012 22.3 1.1 10 57-66 1-10 (64)
171 TIGR02605 CxxC_CxxC_SSSS putat 20.7 37 0.00081 21.6 0.3 30 135-175 5-34 (52)
172 COG1571 Predicted DNA-binding 20.6 58 0.0013 31.0 1.6 32 211-250 352-383 (421)
173 cd00924 Cyt_c_Oxidase_Vb Cytoc 20.5 41 0.0009 25.0 0.5 18 203-221 74-91 (97)
174 COG3357 Predicted transcriptio 20.1 46 0.00099 24.2 0.6 14 164-179 57-70 (97)
175 TIGR01206 lysW lysine biosynth 20.1 28 0.0006 22.8 -0.5 10 210-219 23-32 (54)
No 1
>KOG2462|consensus
Probab=99.93 E-value=1.5e-26 Score=196.88 Aligned_cols=137 Identities=30% Similarity=0.458 Sum_probs=126.8
Q ss_pred ccccCccccccccccccccccccccccccCC---CCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccC
Q psy13347 102 AEVNNKIRLSSYFYPALISTLFSCSLRTADR---KKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFT 178 (339)
Q Consensus 102 ~~~~~~~~c~~c~~~~~~~~~l~~h~~~h~~---~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~ 178 (339)
......+.|..|+........|.+|.++|-. ++.+.|.+ |+|.|.....|++|+|+|+ -+++|. +|||.|.
T Consensus 125 ~~~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~--C~K~YvSmpALkMHirTH~--l~c~C~--iCGKaFS 198 (279)
T KOG2462|consen 125 AAKHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKY--CGKVYVSMPALKMHIRTHT--LPCECG--ICGKAFS 198 (279)
T ss_pred cccCCceeccccccccccccccchhhcccccccccccccCCC--CCceeeehHHHhhHhhccC--CCcccc--ccccccc
Confidence 3356788999999999999999999999954 68899977 9999999999999999997 679999 9999999
Q ss_pred ChHHHHhhhcccCcccchhccccceecCCCCcCCCccccccCChhHHHHHHHhhcCCC---CCccccccCCccccccccc
Q psy13347 179 RSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFMRSDHLQKHIKTHSKTR---CTSLVMNFSDNTVVTEDSK 255 (339)
Q Consensus 179 ~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~~---C~~C~~~f~~~~~l~~H~~ 255 (339)
+...|.-|+| +|+|||||.|..|+|.|..+++|+.||++|.+.+ |..|+|.|..++.|.+|..
T Consensus 199 RPWLLQGHiR--------------THTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 199 RPWLLQGHIR--------------THTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred chHHhhcccc--------------cccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 9999999986 8999999999999999999999999999999888 9999999999999999987
Q ss_pred ccc
Q psy13347 256 LEL 258 (339)
Q Consensus 256 ~~~ 258 (339)
..-
T Consensus 265 S~C 267 (279)
T KOG2462|consen 265 SAC 267 (279)
T ss_pred hcc
Confidence 654
No 2
>KOG2462|consensus
Probab=99.91 E-value=6.2e-25 Score=187.12 Aligned_cols=107 Identities=29% Similarity=0.504 Sum_probs=100.8
Q ss_pred ccCccccccccccccccccccccccccCCCCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHH
Q psy13347 104 VNNKIRLSSYFYPALISTLFSCSLRTADRKKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDEL 183 (339)
Q Consensus 104 ~~~~~~c~~c~~~~~~~~~l~~h~~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L 183 (339)
...-+.|..|++.++....|+.|+++|+ -+++|.+ |||.|.+.+.|+-|+|+|+|||||.|+ .|+|.|..+++|
T Consensus 158 s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~i--CGKaFSRPWLLQGHiRTHTGEKPF~C~--hC~kAFADRSNL 231 (279)
T KOG2462|consen 158 SKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGI--CGKAFSRPWLLQGHIRTHTGEKPFSCP--HCGKAFADRSNL 231 (279)
T ss_pred ccccccCCCCCceeeehHHHhhHhhccC--CCccccc--ccccccchHHhhcccccccCCCCccCC--cccchhcchHHH
Confidence 3566789999999999999999999996 7899988 999999999999999999999999999 999999999999
Q ss_pred HhhhcccCcccchhccccceecCCCCcCCCccccccCChhHHHHHHH
Q psy13347 184 QVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFMRSDHLQKHIK 230 (339)
Q Consensus 184 ~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~ 230 (339)
+.||. +|.+.|+|+|..|+|.|..++.|.+|..
T Consensus 232 RAHmQ--------------THS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 232 RAHMQ--------------THSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred HHHHH--------------hhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 99996 7888999999999999999999999975
No 3
>KOG1074|consensus
Probab=99.76 E-value=2.9e-19 Score=171.63 Aligned_cols=66 Identities=20% Similarity=0.474 Sum_probs=58.2
Q ss_pred hccccceecCCCCcCCCccccccCChhHHHHHHHhhcCCC-------CC---ccccccCCccccccccccccCCCC
Q psy13347 197 VRILPMTWCREKRFQCSECHKKFMRSDHLQKHIKTHSKTR-------CT---SLVMNFSDNTVVTEDSKLELSGGS 262 (339)
Q Consensus 197 ~~~~~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~~-------C~---~C~~~f~~~~~l~~H~~~~~~~~~ 262 (339)
++-|.++|+|||||+|.+||+.|.++.+|+.||-+|.... |+ +|-+.|...-.|..|++.|..+..
T Consensus 621 LqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lpQhIriH~~~~~ 696 (958)
T KOG1074|consen 621 LQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLPQHIRIHLGGQI 696 (958)
T ss_pred hhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccccceEEeecCCCC
Confidence 4445569999999999999999999999999999997543 99 999999999999999999985543
No 4
>KOG3608|consensus
Probab=99.73 E-value=4.8e-19 Score=154.93 Aligned_cols=169 Identities=18% Similarity=0.345 Sum_probs=140.9
Q ss_pred CCCCccceeCCCCCCCCccccccccccCCCCCCchhHHHHHHhhhhhcccccccccCccccccccccccccccccccccc
Q psy13347 50 PRSRRVACTCPNCTEGGEREMFSYMLHGCLWGNTMEYALAARVLKLKIYKTTAEVNNKIRLSSYFYPALISTLFSCSLRT 129 (339)
Q Consensus 50 ~~~~~~~~~C~~C~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~h~~~ 129 (339)
..+..+..-||.|++.+.++. .+.-++.+......++|+|..|++.|.....|+.|++.
T Consensus 201 ~Hs~eKvvACp~Cg~~F~~~t---------------------kl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~r 259 (467)
T KOG3608|consen 201 THSNEKVVACPHCGELFRTKT---------------------KLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVR 259 (467)
T ss_pred hcCCCeEEecchHHHHhcccc---------------------HHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHH
Confidence 345667777888885443322 22224455666677799999999999999999999999
Q ss_pred cCCCCeEEeeccccccccCCchhHHHHHh-hhCCCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCC
Q psy13347 130 ADRKKVHICHIEGCNKVYGKTSHLRAHLR-WHTGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREK 208 (339)
Q Consensus 130 h~~~~~~~C~~~~C~k~F~~~~~L~~H~~-~H~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k 208 (339)
| ..-|+|++ |+......+.|.+|++ .|...|||+|+ .|.+.|.+.+.|.+|.. +|+ +-
T Consensus 260 H--vn~ykCpl--Cdmtc~~~ssL~~H~r~rHs~dkpfKCd--~Cd~~c~~esdL~kH~~--------------~HS-~~ 318 (467)
T KOG3608|consen 260 H--VNCYKCPL--CDMTCSSASSLTTHIRYRHSKDKPFKCD--ECDTRCVRESDLAKHVQ--------------VHS-KT 318 (467)
T ss_pred h--hhcccccc--cccCCCChHHHHHHHHhhhccCCCcccc--chhhhhccHHHHHHHHH--------------hcc-cc
Confidence 9 56799988 9999999999999999 59999999999 99999999999999985 444 56
Q ss_pred CcCCCc--cccccCChhHHHHHHHhhc-CCC-----CCccccccCCccccccccccccCC
Q psy13347 209 RFQCSE--CHKKFMRSDHLQKHIKTHS-KTR-----CTSLVMNFSDNTVVTEDSKLELSG 260 (339)
Q Consensus 209 ~~~C~~--C~k~F~~~~~L~~H~~~h~-~~~-----C~~C~~~f~~~~~l~~H~~~~~~~ 260 (339)
.|.|+. |..+|++...|++|++.++ |.. |..|++.|++-.+|..|+...|.-
T Consensus 319 ~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f 378 (467)
T KOG3608|consen 319 VYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGF 378 (467)
T ss_pred ceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcc
Confidence 699988 9999999999999998766 443 999999999999999998877753
No 5
>KOG3576|consensus
Probab=99.69 E-value=4.5e-18 Score=138.21 Aligned_cols=110 Identities=26% Similarity=0.502 Sum_probs=65.3
Q ss_pred CCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCC
Q psy13347 133 KKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQC 212 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C 212 (339)
...|.|.+ |+|.|....-|.+|++.|...+.|.|. .||+.|....+|++|++ +|+|.+||+|
T Consensus 115 ~d~ftCrv--CgK~F~lQRmlnrh~kch~~vkr~lct--~cgkgfndtfdlkrh~r--------------thtgvrpykc 176 (267)
T KOG3576|consen 115 QDSFTCRV--CGKKFGLQRMLNRHLKCHSDVKRHLCT--FCGKGFNDTFDLKRHTR--------------THTGVRPYKC 176 (267)
T ss_pred CCeeeeeh--hhhhhhHHHHHHHHhhhccHHHHHHHh--hccCcccchhhhhhhhc--------------cccCccccch
Confidence 44566655 666666666666666666666666666 66666666666666654 5556666666
Q ss_pred CccccccCChhHHHHHHHhhcCCC--------------CCccccccCCccccccccccccCC
Q psy13347 213 SECHKKFMRSDHLQKHIKTHSKTR--------------CTSLVMNFSDNTVVTEDSKLELSG 260 (339)
Q Consensus 213 ~~C~k~F~~~~~L~~H~~~h~~~~--------------C~~C~~~f~~~~~l~~H~~~~~~~ 260 (339)
..|+|.|..+-.|..|++.-+|.. |+.||..-.....+..|++.||..
T Consensus 177 ~~c~kaftqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 177 SLCEKAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred hhhhHHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 666666666666666655333321 666666666666666666666543
No 6
>KOG3608|consensus
Probab=99.68 E-value=8e-18 Score=147.36 Aligned_cols=187 Identities=21% Similarity=0.310 Sum_probs=133.3
Q ss_pred ccccccCCCCCCchhHHHHHHhhhhhccc-----cccccc-C--ccccccccccccccccccccccccCCCCeEEeeccc
Q psy13347 71 FSYMLHGCLWGNTMEYALAARVLKLKIYK-----TTAEVN-N--KIRLSSYFYPALISTLFSCSLRTADRKKVHICHIEG 142 (339)
Q Consensus 71 ~~~~~~~C~~~~~~~~~~~~~~~~~~~~~-----~~~~~~-~--~~~c~~c~~~~~~~~~l~~h~~~h~~~~~~~C~~~~ 142 (339)
+.|.=..|...|.......+++-+...+. .....+ . .....-|...+.++..|+.|++.|+++|...|+.
T Consensus 135 f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~-- 212 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPH-- 212 (467)
T ss_pred hccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecch--
Confidence 33444456666666666666555544441 111111 1 2233566777777778888888888888888855
Q ss_pred cccccCCchhHHHHHhhh--CCCCceecCccccCCccCChHHHHhhhcccCcc-------------cchhccccceecCC
Q psy13347 143 CNKVYGKTSHLRAHLRWH--TGERPFVCTWQFCGKRFTRSDELQVNGGRFGIE-------------RTVVRILPMTWCRE 207 (339)
Q Consensus 143 C~k~F~~~~~L~~H~~~H--~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~-------------~~~~~~~~~~h~~~ 207 (339)
||..|.++..|..|++.. ....+|.|. .|.|.|.....|+.|+.+|-.- ..+.++.+..|...
T Consensus 213 Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~--~C~KrFaTeklL~~Hv~rHvn~ykCplCdmtc~~~ssL~~H~r~rHs~d 290 (467)
T KOG3608|consen 213 CGELFRTKTKLFDHLRRQTELNTNSFQCA--QCFKRFATEKLLKSHVVRHVNCYKCPLCDMTCSSASSLTTHIRYRHSKD 290 (467)
T ss_pred HHHHhccccHHHHHHHhhhhhcCCchHHH--HHHHHHhHHHHHHHHHHHhhhcccccccccCCCChHHHHHHHHhhhccC
Confidence 888888888887776643 345677777 7777777777777777665322 12234455688889
Q ss_pred CCcCCCccccccCChhHHHHHHHhhcCCC--CCc--cccccCCccccccccccccCCC
Q psy13347 208 KRFQCSECHKKFMRSDHLQKHIKTHSKTR--CTS--LVMNFSDNTVVTEDSKLELSGG 261 (339)
Q Consensus 208 k~~~C~~C~k~F~~~~~L~~H~~~h~~~~--C~~--C~~~f~~~~~l~~H~~~~~~~~ 261 (339)
|||+|+.|++.|.+.+.|.+|..+|.... |.. |.++|.+...|++|++.+|.+.
T Consensus 291 kpfKCd~Cd~~c~~esdL~kH~~~HS~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~ 348 (467)
T KOG3608|consen 291 KPFKCDECDTRCVRESDLAKHVQVHSKTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGN 348 (467)
T ss_pred CCccccchhhhhccHHHHHHHHHhccccceecCCCCCcHHHHHHHHHHHHHHHhccCC
Confidence 99999999999999999999999998554 988 9999999999999999999665
No 7
>KOG3576|consensus
Probab=99.66 E-value=8.7e-18 Score=136.55 Aligned_cols=133 Identities=23% Similarity=0.351 Sum_probs=112.9
Q ss_pred hcccccccccCccccccccccccccccccccccccCCCCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCC
Q psy13347 96 KIYKTTAEVNNKIRLSSYFYPALISTLFSCSLRTADRKKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGK 175 (339)
Q Consensus 96 ~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~h~~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~ 175 (339)
+.-.........+.|++|++.|..+..|.+|++-|...+.|.|.+ |||.|....+|++|+|+|+|.+||+|. .|++
T Consensus 106 k~t~gsssd~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~--cgkgfndtfdlkrh~rthtgvrpykc~--~c~k 181 (267)
T KOG3576|consen 106 KSTIGSSSDQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTF--CGKGFNDTFDLKRHTRTHTGVRPYKCS--LCEK 181 (267)
T ss_pred cccccCCCCCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhh--ccCcccchhhhhhhhccccCccccchh--hhhH
Confidence 444455556788999999999999999999999999999999988 999999999999999999999999999 9999
Q ss_pred ccCChHHHHhhhcccCcccchhccccceecCCCCcCCCccccccCChhHHHHHHHhhcCC
Q psy13347 176 RFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFMRSDHLQKHIKTHSKT 235 (339)
Q Consensus 176 ~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~ 235 (339)
.|.+.-.|..|++.-++...-. .+....+|.|.|+.||..-.+...+..|++.|+..
T Consensus 182 aftqrcsleshl~kvhgv~~~y---aykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 182 AFTQRCSLESHLKKVHGVQHQY---AYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HHHhhccHHHHHHHHcCchHHH---HHHHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 9999999999986543322111 11222468899999999999999999999998753
No 8
>KOG1074|consensus
Probab=99.61 E-value=2.1e-16 Score=152.21 Aligned_cols=57 Identities=19% Similarity=0.297 Sum_probs=51.8
Q ss_pred CCcCCCccccccCChhHHHHHHHhhcCCC---CCccccccCCccccccccccccCCCCCC
Q psy13347 208 KRFQCSECHKKFMRSDHLQKHIKTHSKTR---CTSLVMNFSDNTVVTEDSKLELSGGSTG 264 (339)
Q Consensus 208 k~~~C~~C~k~F~~~~~L~~H~~~h~~~~---C~~C~~~f~~~~~l~~H~~~~~~~~~~~ 264 (339)
-|-+|-+|-+...-++.|+.|.|+|+|++ |.+|++.|.++.+|+.||..|...-...
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R 663 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPAR 663 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCcccc
Confidence 36789999999999999999999999999 9999999999999999999997654333
No 9
>KOG3623|consensus
Probab=99.49 E-value=5.8e-15 Score=140.11 Aligned_cols=80 Identities=33% Similarity=0.675 Sum_probs=74.9
Q ss_pred CCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCC
Q psy13347 133 KKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQC 212 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C 212 (339)
+.+|.|+. |+|+|...++|.+|.--|+|.+||+|. +|.|.|+.+..|..|+| .|.|+|||+|
T Consensus 892 ~gmyaCDq--CDK~FqKqSSLaRHKYEHsGqRPyqC~--iCkKAFKHKHHLtEHkR--------------LHSGEKPfQC 953 (1007)
T KOG3623|consen 892 DGMYACDQ--CDKAFQKQSSLARHKYEHSGQRPYQCI--ICKKAFKHKHHLTEHKR--------------LHSGEKPFQC 953 (1007)
T ss_pred cccchHHH--HHHHHHhhHHHHHhhhhhcCCCCcccc--hhhHhhhhhhhhhhhhh--------------hccCCCcchh
Confidence 56799977 999999999999999999999999999 99999999999999985 8999999999
Q ss_pred CccccccCChhHHHHHHH
Q psy13347 213 SECHKKFMRSDHLQKHIK 230 (339)
Q Consensus 213 ~~C~k~F~~~~~L~~H~~ 230 (339)
+.|+|+|.....+..||.
T Consensus 954 dKClKRFSHSGSYSQHMN 971 (1007)
T KOG3623|consen 954 DKCLKRFSHSGSYSQHMN 971 (1007)
T ss_pred hhhhhhcccccchHhhhc
Confidence 999999999999999986
No 10
>KOG3623|consensus
Probab=99.40 E-value=8.5e-14 Score=132.35 Aligned_cols=119 Identities=23% Similarity=0.487 Sum_probs=97.6
Q ss_pred eEEeeccccccccCCchhHHHHHhh-h-CCCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCC
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRW-H-TGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQC 212 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~-H-~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C 212 (339)
...|.+ |.+.+.+...|+.|++. | ..+..|.|. .|.++|.++..|.+||..|.....-.+.+ ..-...|.|+|
T Consensus 210 lltcpy--cdrgykrltslkeHikyrhekne~nfsC~--lCsytFAyRtQLErhm~~hkpg~dqa~sl-tqsa~lRKFKC 284 (1007)
T KOG3623|consen 210 LLTCPY--CDRGYKRLTSLKEHIKYRHEKNEPNFSCM--LCSYTFAYRTQLERHMQLHKPGGDQAISL-TQSALLRKFKC 284 (1007)
T ss_pred hhcchh--HHHHHHHHHHHHHHHHHHHhhCCCCCcch--hhhhhhhhHHHHHHHHHhhcCCCcccccc-cchhhhccccc
Confidence 467977 99999999999999884 4 345679999 99999999999999998775433211111 11123578999
Q ss_pred CccccccCChhHHHHHHHhhcCCC---CCccccccCCcccccccccccc
Q psy13347 213 SECHKKFMRSDHLQKHIKTHSKTR---CTSLVMNFSDNTVVTEDSKLEL 258 (339)
Q Consensus 213 ~~C~k~F~~~~~L~~H~~~h~~~~---C~~C~~~f~~~~~l~~H~~~~~ 258 (339)
..|||.|+.+.+|+.|+|+|.|++ |+.|+|+|+....+..||...-
T Consensus 285 tECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSSKK 333 (1007)
T KOG3623|consen 285 TECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSSKK 333 (1007)
T ss_pred cccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccccc
Confidence 999999999999999999999999 9999999999999999987543
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.19 E-value=3.2e-11 Score=115.53 Aligned_cols=110 Identities=15% Similarity=0.162 Sum_probs=47.9
Q ss_pred ccceeCCCCCCCCccccccccccCCCCCCchhHHHHHHhhhhhcccccccccCccccccccccccccccccccccccCCC
Q psy13347 54 RVACTCPNCTEGGEREMFSYMLHGCLWGNTMEYALAARVLKLKIYKTTAEVNNKIRLSSYFYPALISTLFSCSLRTADRK 133 (339)
Q Consensus 54 ~~~~~C~~C~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~h~~~h~~~ 133 (339)
.....|++|........+..+.-.|.+.....-. --.-.++ .+..+.+|+.|..|+..|. ...|..|++.|+
T Consensus 405 ~~~V~C~NC~~~i~l~~l~lHe~~C~r~~V~Cp~----~~Cg~v~-~r~el~~H~~C~~Cgk~f~-~s~LekH~~~~H-- 476 (567)
T PLN03086 405 VDTVECRNCKHYIPSRSIALHEAYCSRHNVVCPH----DGCGIVL-RVEEAKNHVHCEKCGQAFQ-QGEMEKHMKVFH-- 476 (567)
T ss_pred CCeEECCCCCCccchhHHHHHHhhCCCcceeCCc----cccccee-eccccccCccCCCCCCccc-hHHHHHHHHhcC--
Confidence 3446899999654444433333333321100000 0000222 3334444555555554443 334455554442
Q ss_pred CeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCcc
Q psy13347 134 KVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRF 177 (339)
Q Consensus 134 ~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f 177 (339)
+++.| + |++.| .+..|..|+++|...+++.|. .|++.|
T Consensus 477 kpv~C--p-Cg~~~-~R~~L~~H~~thCp~Kpi~C~--fC~~~v 514 (567)
T PLN03086 477 EPLQC--P-CGVVL-EKEQMVQHQASTCPLRLITCR--FCGDMV 514 (567)
T ss_pred CCccC--C-CCCCc-chhHHHhhhhccCCCCceeCC--CCCCcc
Confidence 44555 3 55433 334455555555555555554 455544
No 12
>PHA00733 hypothetical protein
Probab=99.02 E-value=1.7e-10 Score=90.76 Aligned_cols=85 Identities=21% Similarity=0.316 Sum_probs=65.1
Q ss_pred cCCCCeEEeeccccccccCCchhHHHH--Hh---hhCCCCceecCccccCCccCChHHHHhhhcccCcccchhcccccee
Q psy13347 130 ADRKKVHICHIEGCNKVYGKTSHLRAH--LR---WHTGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTW 204 (339)
Q Consensus 130 h~~~~~~~C~~~~C~k~F~~~~~L~~H--~~---~H~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h 204 (339)
....+++.|.+ |.+.|.....|..| ++ .+.+.++|.|+ .|++.|.....|..|++. |
T Consensus 35 ~~~~~~~~~~~--~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~--~Cgk~Fss~s~L~~H~r~--------------h 96 (128)
T PHA00733 35 TPEQKRLIRAV--VKTLIYNPQLLDESSYLYKLLTSKAVSPYVCP--LCLMPFSSSVSLKQHIRY--------------T 96 (128)
T ss_pred ChhhhhHHHHH--HhhhccChhhhcchHHHHhhcccCCCCCccCC--CCCCcCCCHHHHHHHHhc--------------C
Confidence 33467888977 99888887766655 22 23447889998 999999999999988752 2
Q ss_pred cCCCCcCCCccccccCChhHHHHHHHhhcC
Q psy13347 205 CREKRFQCSECHKKFMRSDHLQKHIKTHSK 234 (339)
Q Consensus 205 ~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~ 234 (339)
..+|.|.+|++.|.....|..|+...++
T Consensus 97 --~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 97 --EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred --CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 2468999999999999999999887665
No 13
>PHA00733 hypothetical protein
Probab=98.96 E-value=2.1e-10 Score=90.24 Aligned_cols=88 Identities=14% Similarity=0.161 Sum_probs=71.4
Q ss_pred CCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCccccccCChhHHHHHHHhhcCC-CCCcc
Q psy13347 162 GERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFMRSDHLQKHIKTHSKT-RCTSL 240 (339)
Q Consensus 162 ~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~-~C~~C 240 (339)
..+++.|. +|.+.|.....|..|.- ++.++ .+.+.+||.|+.|++.|.+...|..|++.|... .|..|
T Consensus 37 ~~~~~~~~--~~~~~~~~~~~l~~~~~--------l~~~~-~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~~~~~C~~C 105 (128)
T PHA00733 37 EQKRLIRA--VVKTLIYNPQLLDESSY--------LYKLL-TSKAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSKVCPVC 105 (128)
T ss_pred hhhhHHHH--HHhhhccChhhhcchHH--------HHhhc-ccCCCCCccCCCCCCcCCCHHHHHHHHhcCCcCccCCCC
Confidence 35789999 99999999888877621 12222 333578999999999999999999999976433 39999
Q ss_pred ccccCCccccccccccccCC
Q psy13347 241 VMNFSDNTVVTEDSKLELSG 260 (339)
Q Consensus 241 ~~~f~~~~~l~~H~~~~~~~ 260 (339)
++.|.....|..|+...|+.
T Consensus 106 gK~F~~~~sL~~H~~~~h~~ 125 (128)
T PHA00733 106 GKEFRNTDSTLDHVCKKHNI 125 (128)
T ss_pred CCccCCHHHHHHHHHHhcCc
Confidence 99999999999999988854
No 14
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.95 E-value=1.6e-09 Score=104.12 Aligned_cols=119 Identities=13% Similarity=0.288 Sum_probs=89.7
Q ss_pred cCcccccc--ccccccccccccccccccCCCCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHH
Q psy13347 105 NNKIRLSS--YFYPALISTLFSCSLRTADRKKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDE 182 (339)
Q Consensus 105 ~~~~~c~~--c~~~~~~~~~l~~h~~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~ 182 (339)
.+...|.. |+..| +...-++.+.|.+ |++.|. ...|..|++.|+ ++|.|+ |++.+ .+..
T Consensus 431 r~~V~Cp~~~Cg~v~----------~r~el~~H~~C~~--Cgk~f~-~s~LekH~~~~H--kpv~Cp---Cg~~~-~R~~ 491 (567)
T PLN03086 431 RHNVVCPHDGCGIVL----------RVEEAKNHVHCEK--CGQAFQ-QGEMEKHMKVFH--EPLQCP---CGVVL-EKEQ 491 (567)
T ss_pred CcceeCCccccccee----------eccccccCccCCC--CCCccc-hHHHHHHHHhcC--CCccCC---CCCCc-chhH
Confidence 34555663 66665 2222244469977 999996 688999999975 789995 99765 5688
Q ss_pred HHhhhcccCcccchhccccceecCCCCcCCCccccccCC----------hhHHHHHHHhhcCCC---CCccccccCCccc
Q psy13347 183 LQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFMR----------SDHLQKHIKTHSKTR---CTSLVMNFSDNTV 249 (339)
Q Consensus 183 L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~~----------~~~L~~H~~~h~~~~---C~~C~~~f~~~~~ 249 (339)
|..|+. .|...+++.|.+|++.|.. ...|..|+..+ +.+ |..|++.|..+ .
T Consensus 492 L~~H~~--------------thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-d 555 (567)
T PLN03086 492 MVQHQA--------------STCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLK-E 555 (567)
T ss_pred HHhhhh--------------ccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeeeeh-h
Confidence 999974 7888999999999999952 35899999986 444 99999988654 6
Q ss_pred ccccccccc
Q psy13347 250 VTEDSKLEL 258 (339)
Q Consensus 250 l~~H~~~~~ 258 (339)
|..|+..-|
T Consensus 556 m~~H~~~~h 564 (567)
T PLN03086 556 MDIHQIAVH 564 (567)
T ss_pred HHHHHHHhh
Confidence 677766554
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.91 E-value=9e-10 Score=71.62 Aligned_cols=44 Identities=11% Similarity=0.332 Sum_probs=40.4
Q ss_pred CcCCCccccccCChhHHHHHHHhhc-CCCCCccccccCCcccccc
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIKTHS-KTRCTSLVMNFSDNTVVTE 252 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~~h~-~~~C~~C~~~f~~~~~l~~ 252 (339)
.|.|+.||+.|.+..+|..|+++|+ +.+|..|++.|...+.|..
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~k~~kc~~C~k~f~~~s~l~~ 49 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHNTNLKLSNCKRISLRTGEYIE 49 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcCCcccCCcccceecccceeEE
Confidence 4999999999999999999999999 6669999999998888764
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.63 E-value=1.1e-08 Score=66.52 Aligned_cols=44 Identities=27% Similarity=0.440 Sum_probs=39.6
Q ss_pred eEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHHH
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDELQ 184 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L~ 184 (339)
.|.|+. ||+.|.+..+|..|+++|+ ++|+|. .|++.|...+.|.
T Consensus 5 ~y~C~~--CGK~Fs~~~~L~~H~r~H~--k~~kc~--~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPI--CGEIYIKRKSMITHLRKHN--TNLKLS--NCKRISLRTGEYI 48 (55)
T ss_pred ccCcch--hCCeeccHHHHHHHHHhcC--CcccCC--cccceecccceeE
Confidence 489966 9999999999999999998 799999 9999999877663
No 17
>KOG3993|consensus
Probab=98.46 E-value=2.4e-08 Score=90.46 Aligned_cols=154 Identities=16% Similarity=0.114 Sum_probs=107.9
Q ss_pred cccCccccccccccccccccccccccccCCCCeEEeeccccccccCCchhHHHHHhhhCC--------C-----------
Q psy13347 103 EVNNKIRLSSYFYPALISTLFSCSLRTADRKKVHICHIEGCNKVYGKTSHLRAHLRWHTG--------E----------- 163 (339)
Q Consensus 103 ~~~~~~~c~~c~~~~~~~~~l~~h~~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~--------~----------- 163 (339)
+....|.|..|-..+..-..|-.|.-.---.-.|+| ++|+|+|.-..+|..|.|+|.- .
T Consensus 263 n~iGdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrC--PEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~ra 340 (500)
T KOG3993|consen 263 NVIGDYICQLCKEKYEDAFALAQHRCPRIVHVEYRC--PECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRA 340 (500)
T ss_pred ccHHHHHHHHHHHhhhhHHHHhhccCCeeEEeeecC--CcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhh
Confidence 344567899998888888888888633333456999 6699999999999999999932 1
Q ss_pred --------------CceecCccccCCccCChHHHHhhhcccCcccchh---------cccccee---------------c
Q psy13347 164 --------------RPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVV---------RILPMTW---------------C 205 (339)
Q Consensus 164 --------------k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~---------~~~~~~h---------------~ 205 (339)
--|.|. +|+|.|.+...|+.|+.+|+...... +.....| .
T Consensus 341 e~~ea~rsg~dss~gi~~C~--~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g 418 (500)
T KOG3993|consen 341 EVQEAERSGDDSSSGIFSCH--TCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHG 418 (500)
T ss_pred hhhhccccCCcccCceeecH--HhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccc
Confidence 138999 99999999999999977665332211 0000011 0
Q ss_pred C--------CCCcCCCccccccCChhHHHHHHHhhc---CCCCCccccccCCccccccccccccCC
Q psy13347 206 R--------EKRFQCSECHKKFMRSDHLQKHIKTHS---KTRCTSLVMNFSDNTVVTEDSKLELSG 260 (339)
Q Consensus 206 ~--------~k~~~C~~C~k~F~~~~~L~~H~~~h~---~~~C~~C~~~f~~~~~l~~H~~~~~~~ 260 (339)
. .....|+.|+..+..+..--.|.+.-. +..|.+|.-.|.+...|.+|+...|..
T Consensus 419 ~~vl~~a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hps 484 (500)
T KOG3993|consen 419 DEVLYVAGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPS 484 (500)
T ss_pred cceeeeeccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChH
Confidence 0 122457778877777765555554332 222999999999999999998887743
No 18
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.45 E-value=1e-07 Score=52.95 Aligned_cols=25 Identities=56% Similarity=1.305 Sum_probs=19.4
Q ss_pred hHHHHHhhhCCCCceecCccccCCccC
Q psy13347 152 HLRAHLRWHTGERPFVCTWQFCGKRFT 178 (339)
Q Consensus 152 ~L~~H~~~H~~~k~~~C~~~~C~~~f~ 178 (339)
+|.+|+++|++++||.|+ .|++.|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~--~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCP--YCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEES--SSSEEES
T ss_pred CHHHHhhhcCCCCCCCCC--CCcCeeC
Confidence 477788888888888887 8887775
No 19
>PHA00732 hypothetical protein
Probab=98.39 E-value=2.1e-07 Score=66.48 Aligned_cols=47 Identities=17% Similarity=0.349 Sum_probs=41.2
Q ss_pred CcCCCccccccCChhHHHHHHH-hhcCCCCCccccccCCcccccccccccc
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIK-THSKTRCTSLVMNFSDNTVVTEDSKLEL 258 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~-~h~~~~C~~C~~~f~~~~~l~~H~~~~~ 258 (339)
||.|+.|++.|.+...|..|++ .|++..|+.|++.|. .|..|.++..
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~~~C~~CgKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTLTKCPVCNKSYR---RLNQHFYSQY 48 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCCCccCCCCCEeC---ChhhhhcccC
Confidence 5899999999999999999998 577777999999998 5888886553
No 20
>KOG3993|consensus
Probab=98.27 E-value=1.6e-07 Score=85.10 Aligned_cols=96 Identities=24% Similarity=0.354 Sum_probs=70.6
Q ss_pred eEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHHHhhhcccCcccchh---------c-------
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVV---------R------- 198 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~---------~------- 198 (339)
-|.|.+ |...|.....|.+|.-.-.-..-|+|+ .|+|.|.-..+|..|.|.|.-....- .
T Consensus 267 dyiCqL--CK~kYeD~F~LAQHrC~RIV~vEYrCP--EC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~ 342 (500)
T KOG3993|consen 267 DYICQL--CKEKYEDAFALAQHRCPRIVHVEYRCP--ECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEV 342 (500)
T ss_pred HHHHHH--HHHhhhhHHHHhhccCCeeEEeeecCC--cccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhh
Confidence 489999 999999999999996432222359999 99999999999999998774322111 0
Q ss_pred -cccc--eecCCCCcCCCccccccCChhHHHHHHHhhcC
Q psy13347 199 -ILPM--TWCREKRFQCSECHKKFMRSDHLQKHIKTHSK 234 (339)
Q Consensus 199 -~~~~--~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~ 234 (339)
.-.+ ....+-.|.|.+|+|.|++...|+.|+.+|+.
T Consensus 343 ~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~ 381 (500)
T KOG3993|consen 343 QEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQR 381 (500)
T ss_pred hhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhc
Confidence 0000 00223469999999999999999999888874
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.11 E-value=8.7e-07 Score=49.14 Aligned_cols=25 Identities=24% Similarity=0.481 Sum_probs=22.7
Q ss_pred ccccccccCCCCeEEeeccccccccCC
Q psy13347 123 FSCSLRTADRKKVHICHIEGCNKVYGK 149 (339)
Q Consensus 123 l~~h~~~h~~~~~~~C~~~~C~k~F~~ 149 (339)
|.+|+++|+++++|.|++ |++.|.+
T Consensus 2 l~~H~~~H~~~k~~~C~~--C~k~F~~ 26 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPY--CGKSFSN 26 (26)
T ss_dssp HHHHHHHHSSSSSEEESS--SSEEESS
T ss_pred HHHHhhhcCCCCCCCCCC--CcCeeCc
Confidence 788999999999999977 9999964
No 22
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=98.08 E-value=1.3e-06 Score=76.48 Aligned_cols=25 Identities=28% Similarity=0.731 Sum_probs=21.0
Q ss_pred CCCCcCCCccccccCChhHHHHHHH
Q psy13347 206 REKRFQCSECHKKFMRSDHLQKHIK 230 (339)
Q Consensus 206 ~~k~~~C~~C~k~F~~~~~L~~H~~ 230 (339)
..|||.|++|+|+|+....|+.|+.
T Consensus 395 ~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 395 KDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred cCCceeccccchhhccCccceeccc
Confidence 3689999999999999888888864
No 23
>PHA00616 hypothetical protein
Probab=98.00 E-value=2.1e-06 Score=53.15 Aligned_cols=33 Identities=15% Similarity=0.292 Sum_probs=30.6
Q ss_pred eEEeeccccccccCCchhHHHHHhhhCCCCceecC
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCT 169 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~ 169 (339)
+|+|.. ||+.|..++.|.+|++.|+++++|.|+
T Consensus 1 pYqC~~--CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLR--CGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccch--hhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 589966 999999999999999999999999987
No 24
>PHA00732 hypothetical protein
Probab=97.99 E-value=5.1e-06 Score=59.33 Aligned_cols=51 Identities=22% Similarity=0.428 Sum_probs=40.5
Q ss_pred ceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCccccccCChhHHHHHHHhhcCCC
Q psy13347 165 PFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFMRSDHLQKHIKTHSKTR 236 (339)
Q Consensus 165 ~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~~ 236 (339)
||.|. .|++.|.+...|..|++.+ |. ++.|+.|++.|. .|..|++.+.+..
T Consensus 1 py~C~--~Cgk~F~s~s~Lk~H~r~~-------------H~---~~~C~~CgKsF~---~l~~H~~~~~~~~ 51 (79)
T PHA00732 1 MFKCP--ICGFTTVTLFALKQHARRN-------------HT---LTKCPVCNKSYR---RLNQHFYSQYDIE 51 (79)
T ss_pred CccCC--CCCCccCCHHHHHHHhhcc-------------cC---CCccCCCCCEeC---ChhhhhcccCCcc
Confidence 58898 9999999999999998531 22 368999999998 4888987766544
No 25
>PHA00616 hypothetical protein
Probab=97.93 E-value=4.8e-06 Score=51.59 Aligned_cols=28 Identities=21% Similarity=0.506 Sum_probs=26.7
Q ss_pred CcCCCccccccCChhHHHHHHHhhcCCC
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIKTHSKTR 236 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~~h~~~~ 236 (339)
||+|..||+.|..+..|..|++.|++..
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~ 28 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQN 28 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCC
Confidence 6999999999999999999999999986
No 26
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.88 E-value=5.4e-06 Score=44.51 Aligned_cols=23 Identities=30% Similarity=0.892 Sum_probs=21.2
Q ss_pred cCCCccccccCChhHHHHHHHhh
Q psy13347 210 FQCSECHKKFMRSDHLQKHIKTH 232 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~~h 232 (339)
|.|+.|++.|.++..|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 68999999999999999999875
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.78 E-value=3e-05 Score=51.33 Aligned_cols=53 Identities=23% Similarity=0.510 Sum_probs=37.2
Q ss_pred ceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCccccccCChhHHHHHHHhhcC
Q psy13347 165 PFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFMRSDHLQKHIKTHSK 234 (339)
Q Consensus 165 ~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~ 234 (339)
.|.|+ +|++ ..+...|..|....+.. ..+.+.|++|...+. .+|..|+..+++
T Consensus 2 ~f~CP--~C~~-~~~~~~L~~H~~~~H~~------------~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCP--YCGK-GFSESSLVEHCEDEHRS------------ESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCC--CCCC-ccCHHHHHHHHHhHCcC------------CCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 47898 8999 45567888887542221 135689999998655 388899887653
No 28
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.57 E-value=3.7e-05 Score=41.42 Aligned_cols=24 Identities=42% Similarity=0.901 Sum_probs=19.9
Q ss_pred cCCCccccccCChhHHHHHHHhhc
Q psy13347 210 FQCSECHKKFMRSDHLQKHIKTHS 233 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~~h~ 233 (339)
|.|++|++.|.+...|..|+++|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 689999999999999999999874
No 29
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.54 E-value=5.2e-05 Score=50.16 Aligned_cols=47 Identities=17% Similarity=0.323 Sum_probs=37.1
Q ss_pred CcCCCccccccCChhHHHHHHHh-hcCCC----CCccccccCCcccccccccccc
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIKT-HSKTR----CTSLVMNFSDNTVVTEDSKLEL 258 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~~-h~~~~----C~~C~~~f~~~~~l~~H~~~~~ 258 (339)
.|.|++|++ ..+...|..|... |..+. |++|...+. .+|..|+..+|
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 389999999 4556789999775 54433 999998765 48999998876
No 30
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.51 E-value=7.3e-05 Score=40.00 Aligned_cols=23 Identities=39% Similarity=0.830 Sum_probs=18.2
Q ss_pred EEeeccccccccCCchhHHHHHhhh
Q psy13347 136 HICHIEGCNKVYGKTSHLRAHLRWH 160 (339)
Q Consensus 136 ~~C~~~~C~k~F~~~~~L~~H~~~H 160 (339)
|.|.+ |++.|.++..|..|++.|
T Consensus 1 y~C~~--C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPI--CGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETT--TTEEESSHHHHHHHHHHH
T ss_pred CCCCC--CCCccCCHHHHHHHHhHC
Confidence 67855 888888888888888765
No 31
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.43 E-value=5.6e-05 Score=42.22 Aligned_cols=26 Identities=35% Similarity=0.779 Sum_probs=23.3
Q ss_pred CcCCCccccccCChhHHHHHHHhhcC
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIKTHSK 234 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~~h~~ 234 (339)
+|.|..|++.|.+...|..|++.|..
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 58999999999999999999998854
No 32
>KOG1146|consensus
Probab=97.27 E-value=0.00038 Score=72.18 Aligned_cols=126 Identities=17% Similarity=0.231 Sum_probs=84.0
Q ss_pred ccCCCCeEEeeccccccccCCchhHHHHHhh-h------------------------CCCCceecCccccCCccCChHHH
Q psy13347 129 TADRKKVHICHIEGCNKVYGKTSHLRAHLRW-H------------------------TGERPFVCTWQFCGKRFTRSDEL 183 (339)
Q Consensus 129 ~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~-H------------------------~~~k~~~C~~~~C~~~f~~~~~L 183 (339)
.|+..+.|+|.. |+..|+....|..|||. | .+.++|.|. .|...+..+..|
T Consensus 459 L~S~~kt~~cpk--c~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~--~C~~stttng~L 534 (1406)
T KOG1146|consen 459 LHSFFKTLKCPK--CNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCR--ACNYSTTTNGNL 534 (1406)
T ss_pred eecccccccCCc--cchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccce--eeeeeeecchHH
Confidence 344557778855 88888888888888775 2 124679998 999999999999
Q ss_pred HhhhcccCcccch------hcccccee-----------------------cCCCCcCCCccccccCChhHHHHHHHh--h
Q psy13347 184 QVNGGRFGIERTV------VRILPMTW-----------------------CREKRFQCSECHKKFMRSDHLQKHIKT--H 232 (339)
Q Consensus 184 ~~H~~~~~~~~~~------~~~~~~~h-----------------------~~~k~~~C~~C~k~F~~~~~L~~H~~~--h 232 (339)
.+|+..-.....+ ...+.+.+ ...-.|.|.+|++.-.-..+|+.||.. |
T Consensus 535 sihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetniarnlrihmtss~~ 614 (1406)
T KOG1146|consen 535 SIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETNIARNLRIHMTASPS 614 (1406)
T ss_pred HHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhhhhhccccccccCCC
Confidence 9997531111110 00000000 002358899999988888899999873 3
Q ss_pred cCC-C-CCccccccCCcccccccccccc
Q psy13347 233 SKT-R-CTSLVMNFSDNTVVTEDSKLEL 258 (339)
Q Consensus 233 ~~~-~-C~~C~~~f~~~~~l~~H~~~~~ 258 (339)
... . |-.|...+.....+..+.+.+-
T Consensus 615 s~~p~~~Lq~~it~~l~~~~~~~~~lp~ 642 (1406)
T KOG1146|consen 615 SSPPSLVLQQNITSSLASLLGGQGRLPF 642 (1406)
T ss_pred CCChHHHhhhcchhhccccccCcCCCCC
Confidence 222 1 8888888888888888777763
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.15 E-value=0.00021 Score=53.51 Aligned_cols=73 Identities=21% Similarity=0.400 Sum_probs=20.6
Q ss_pred EeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCccc
Q psy13347 137 ICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECH 216 (339)
Q Consensus 137 ~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~ 216 (339)
+|.+ |+..|.....|..|+...++-. . .....+.....+..+.+ ..-...+.|.+|+
T Consensus 1 ~C~~--C~~~f~~~~~l~~H~~~~H~~~---~---~~~~~l~~~~~~~~~~~---------------~~~~~~~~C~~C~ 57 (100)
T PF12756_consen 1 QCLF--CDESFSSVDDLLQHMKKKHGFD---I---PDQKYLVDPNRLLNYLR---------------KKVKESFRCPYCN 57 (100)
T ss_dssp -----------------------------------------------------------------------SSEEBSSSS
T ss_pred Cccc--cccccccccccccccccccccc---c---ccccccccccccccccc---------------cccCCCCCCCccC
Confidence 4767 8888888888888886533321 1 11111222233332221 0112358899999
Q ss_pred cccCChhHHHHHHHhh
Q psy13347 217 KKFMRSDHLQKHIKTH 232 (339)
Q Consensus 217 k~F~~~~~L~~H~~~h 232 (339)
+.|.+...|..||+.+
T Consensus 58 ~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 58 KTFRSREALQEHMRSK 73 (100)
T ss_dssp -EESSHHHHHHHHHHT
T ss_pred CCCcCHHHHHHHHcCc
Confidence 9999999999999864
No 34
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.00 E-value=0.00064 Score=37.00 Aligned_cols=24 Identities=38% Similarity=0.915 Sum_probs=21.8
Q ss_pred cCCCccccccCChhHHHHHHHhhc
Q psy13347 210 FQCSECHKKFMRSDHLQKHIKTHS 233 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~~h~ 233 (339)
|.|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 689999999999999999999775
No 35
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.96 E-value=0.00066 Score=36.31 Aligned_cols=23 Identities=43% Similarity=0.839 Sum_probs=15.4
Q ss_pred EEeeccccccccCCchhHHHHHhhh
Q psy13347 136 HICHIEGCNKVYGKTSHLRAHLRWH 160 (339)
Q Consensus 136 ~~C~~~~C~k~F~~~~~L~~H~~~H 160 (339)
|.|++ |++.|.+...|..|++.|
T Consensus 1 ~~C~~--C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPI--CGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SS--TS-EESSHHHHHHHHHHH
T ss_pred CCCcC--CCCcCCcHHHHHHHHHhh
Confidence 56766 777777777777777765
No 36
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.82 E-value=0.00089 Score=37.22 Aligned_cols=25 Identities=36% Similarity=0.599 Sum_probs=18.5
Q ss_pred eEEeeccccccccCCchhHHHHHhhhC
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRWHT 161 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~H~ 161 (339)
+|.|.. |++.|.+...|..|++.|.
T Consensus 1 ~~~C~~--C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDE--CGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETT--TTEEESSHHHHHHHHCTTT
T ss_pred CCCCCc--cCCccCChhHHHHHhHHhc
Confidence 467765 8888888888888877664
No 37
>KOG2231|consensus
Probab=96.77 E-value=0.013 Score=57.81 Aligned_cols=44 Identities=25% Similarity=0.646 Sum_probs=26.3
Q ss_pred EeeccccccccCCchhHHHHHhhhCCCCceecCcccc------CCccCChHHHHhhhc
Q psy13347 137 ICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFC------GKRFTRSDELQVNGG 188 (339)
Q Consensus 137 ~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C------~~~f~~~~~L~~H~~ 188 (339)
.|.+ |...|.....|.+|++.++ |.|. .| +..|.....|..|-+
T Consensus 184 ~C~~--C~~~fld~~el~rH~~~~h----~~ch--fC~~~~~~neyy~~~~dLe~HfR 233 (669)
T KOG2231|consen 184 LCKF--CHERFLDDDELYRHLRFDH----EFCH--FCDYKTGQNEYYNDYDDLEEHFR 233 (669)
T ss_pred cchh--hhhhhccHHHHHHhhccce----ehee--ecCcccccchhcccchHHHHHhh
Confidence 4655 7777777777777766543 4444 44 344556666666654
No 38
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.67 E-value=0.00034 Score=52.38 Aligned_cols=48 Identities=23% Similarity=0.421 Sum_probs=19.6
Q ss_pred CCCccccccCChhHHHHHHHhhcCCC--------------------------CCccccccCCcccccccccccc
Q psy13347 211 QCSECHKKFMRSDHLQKHIKTHSKTR--------------------------CTSLVMNFSDNTVVTEDSKLEL 258 (339)
Q Consensus 211 ~C~~C~k~F~~~~~L~~H~~~h~~~~--------------------------C~~C~~~f~~~~~l~~H~~~~~ 258 (339)
+|.+|+..|.....|..||...++.. |..|++.|.+...|..|++.++
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~~~ 74 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIPDQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHMRSKH 74 (100)
T ss_dssp ------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHTT
T ss_pred CccccccccccccccccccccccccccccccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHHcCcc
Confidence 48888888888888888886544311 9999999999999999999764
No 39
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=96.57 E-value=0.0013 Score=62.57 Aligned_cols=144 Identities=18% Similarity=0.215 Sum_probs=89.2
Q ss_pred Ccccccccccccccccccccccc--ccCCC--CeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChH
Q psy13347 106 NKIRLSSYFYPALISTLFSCSLR--TADRK--KVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSD 181 (339)
Q Consensus 106 ~~~~c~~c~~~~~~~~~l~~h~~--~h~~~--~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~ 181 (339)
..+.|..|...+.....|.+|.+ .|+.+ +++.|.+..|++.|.+...+..|...|.+.+++.|.+..|.+.+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 46788889999999999999999 79998 899997224999999999999999999988888887666666665444
Q ss_pred HHHhhhcccCcccchhccccceecCCCCcCCCc--cccccCChhHHHHHHHhhcCCC-----CCccccccCCcccccccc
Q psy13347 182 ELQVNGGRFGIERTVVRILPMTWCREKRFQCSE--CHKKFMRSDHLQKHIKTHSKTR-----CTSLVMNFSDNTVVTEDS 254 (339)
Q Consensus 182 ~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~--C~k~F~~~~~L~~H~~~h~~~~-----C~~C~~~f~~~~~l~~H~ 254 (339)
.-..+...... ..-...+.+.|.. |-..+.+...+..|...|.... +..|.+.|.....|..|+
T Consensus 368 ~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (467)
T COG5048 368 NNEPPQSLQQY---------KDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHK 438 (467)
T ss_pred CCCCccchhhc---------cCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccc
Confidence 32111100000 0011122233322 4445555555555544443322 555555555555555555
Q ss_pred cccc
Q psy13347 255 KLEL 258 (339)
Q Consensus 255 ~~~~ 258 (339)
+.+.
T Consensus 439 ~~~~ 442 (467)
T COG5048 439 KIHT 442 (467)
T ss_pred cccc
Confidence 5554
No 40
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.29 E-value=0.0031 Score=40.10 Aligned_cols=32 Identities=19% Similarity=0.396 Sum_probs=22.8
Q ss_pred cCCCCcCCCccccccCChhHHHHHHHhhcCCC
Q psy13347 205 CREKRFQCSECHKKFMRSDHLQKHIKTHSKTR 236 (339)
Q Consensus 205 ~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~~ 236 (339)
..+.|-.|++|+..+.+..+|++|+.++++.+
T Consensus 20 ~S~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 20 QSEQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp TTS--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred ccCCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 35688999999999999999999998777643
No 41
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.24 E-value=0.0057 Score=54.77 Aligned_cols=105 Identities=23% Similarity=0.393 Sum_probs=69.8
Q ss_pred eEEeeccccccccCCchhHHHHHhhhCCCCceecCccccC---CccC------ChHHHHhhhcccCcccchhccccceec
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCG---KRFT------RSDELQVNGGRFGIERTVVRILPMTWC 205 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~---~~f~------~~~~L~~H~~~~~~~~~~~~~~~~~h~ 205 (339)
.|.|+...|.........|+.|.+.-++ .+-|. +|- +.|. ++..|+.|.. .-.
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~--~C~~nKk~F~~E~~lF~~~~Lr~H~~--------------~G~ 212 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCS--ECIGNKKDFWNEIRLFRSSTLRDHKN--------------GGL 212 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhH--hhhcCcccCccceeeeeccccccccc--------------CCc
Confidence 4889877788888778899999886443 36676 663 2333 3344555532 111
Q ss_pred CCCCc----CCCccccccCChhHHHHHHHhhcCCCCCccccc-------cCCcccccccccccc
Q psy13347 206 REKRF----QCSECHKKFMRSDHLQKHIKTHSKTRCTSLVMN-------FSDNTVVTEDSKLEL 258 (339)
Q Consensus 206 ~~k~~----~C~~C~k~F~~~~~L~~H~~~h~~~~C~~C~~~-------f~~~~~l~~H~~~~~ 258 (339)
.+..| .|.+|.+.|-.-..|..|+|..+ ++|.+|++. |++...|..|-+..|
T Consensus 213 ~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~H-E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h 275 (493)
T COG5236 213 EEEGFKGHPLCIFCKIYFYDDDELRRHCRLRH-EACHICDMVGPIRYQYFKSYEDLEAHFRNAH 275 (493)
T ss_pred cccCcCCCchhhhccceecChHHHHHHHHhhh-hhhhhhhccCccchhhhhCHHHHHHHhhcCc
Confidence 12122 49999999999999999998544 458888764 666666777655433
No 42
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.20 E-value=0.0024 Score=34.34 Aligned_cols=24 Identities=25% Similarity=0.717 Sum_probs=18.7
Q ss_pred cCCCccccccCChhHHHHHHHhhcC
Q psy13347 210 FQCSECHKKFMRSDHLQKHIKTHSK 234 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~~h~~ 234 (339)
|+|+.|+.... +..|..|+++|++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 68999998888 8899999998753
No 43
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.11 E-value=0.0023 Score=34.76 Aligned_cols=23 Identities=30% Similarity=0.852 Sum_probs=20.1
Q ss_pred cCCCccccccCChhHHHHHHHhh
Q psy13347 210 FQCSECHKKFMRSDHLQKHIKTH 232 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~~h 232 (339)
|.|.+|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 57999999999999999998754
No 44
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.74 E-value=0.012 Score=37.50 Aligned_cols=32 Identities=28% Similarity=0.565 Sum_probs=20.4
Q ss_pred CCCeEEeeccccccccCCchhHHHHHhhhCCCCc
Q psy13347 132 RKKVHICHIEGCNKVYGKTSHLRAHLRWHTGERP 165 (339)
Q Consensus 132 ~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~ 165 (339)
.+.+..|++ |+..+....+|++|+..+++.||
T Consensus 21 S~~PatCP~--C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPI--CGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TT--T--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCc--chhhccchhhHHHHHHHHhcccC
Confidence 357788866 88888888888888887666655
No 45
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.69 E-value=0.0045 Score=54.82 Aligned_cols=23 Identities=26% Similarity=0.819 Sum_probs=11.8
Q ss_pred CCceecCccccCCccCChHHHHhhh
Q psy13347 163 ERPFVCTWQFCGKRFTRSDELQVNG 187 (339)
Q Consensus 163 ~k~~~C~~~~C~~~f~~~~~L~~H~ 187 (339)
.|||+|+ +|+|+|+....|+.|+
T Consensus 396 ~KPYrCe--vC~KRYKNlNGLKYHr 418 (423)
T COG5189 396 DKPYRCE--VCDKRYKNLNGLKYHR 418 (423)
T ss_pred CCceecc--ccchhhccCccceecc
Confidence 3555554 5555555555555543
No 46
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.50 E-value=0.0067 Score=32.76 Aligned_cols=24 Identities=38% Similarity=0.643 Sum_probs=15.4
Q ss_pred EEeeccccccccCCchhHHHHHhhhC
Q psy13347 136 HICHIEGCNKVYGKTSHLRAHLRWHT 161 (339)
Q Consensus 136 ~~C~~~~C~k~F~~~~~L~~H~~~H~ 161 (339)
|.|.. |++.|.....|..|++.|.
T Consensus 1 ~~C~~--C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPE--CGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCC--CcchhCCHHHHHHHHHHhc
Confidence 35644 7777777777777766553
No 47
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.48 E-value=0.0055 Score=58.26 Aligned_cols=112 Identities=24% Similarity=0.342 Sum_probs=79.3
Q ss_pred CeEEeeccccccccCCchhHHHHHh--hhCCC--CceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCC
Q psy13347 134 KVHICHIEGCNKVYGKTSHLRAHLR--WHTGE--RPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKR 209 (339)
Q Consensus 134 ~~~~C~~~~C~k~F~~~~~L~~H~~--~H~~~--k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~ 209 (339)
.++.|.. |...|.....|.+|.+ .|.++ +++.|++..|++.|.+...+..|.. .|.+.++
T Consensus 288 ~~~~~~~--~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~--------------~~~~~~~ 351 (467)
T COG5048 288 LPIKSKQ--CNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHIL--------------LHTSISP 351 (467)
T ss_pred cCCCCcc--ccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcc--------------cccCCCc
Confidence 4688866 9999999999999999 89999 9999986679999999999999986 4555555
Q ss_pred cCCCc--cccccCChhHHH----HHH--HhhcCCC----CCccccccCCccccccccccccCCC
Q psy13347 210 FQCSE--CHKKFMRSDHLQ----KHI--KTHSKTR----CTSLVMNFSDNTVVTEDSKLELSGG 261 (339)
Q Consensus 210 ~~C~~--C~k~F~~~~~L~----~H~--~~h~~~~----C~~C~~~f~~~~~l~~H~~~~~~~~ 261 (339)
+.|.. |.+.+.....-. .+. ..+.... =..|...+.+...+..|...+-...
T Consensus 352 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 415 (467)
T COG5048 352 AKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFR 415 (467)
T ss_pred cccccccCccccccccCCCCccchhhccCccCCccccccccchhhhhccccccccccccccccC
Confidence 66544 555554443311 111 1111111 3467778888888888887776543
No 48
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.48 E-value=0.011 Score=31.97 Aligned_cols=23 Identities=43% Similarity=0.859 Sum_probs=17.2
Q ss_pred EEeeccccccccCCchhHHHHHhhh
Q psy13347 136 HICHIEGCNKVYGKTSHLRAHLRWH 160 (339)
Q Consensus 136 ~~C~~~~C~k~F~~~~~L~~H~~~H 160 (339)
|.|.+ |++.|.....|..|++.+
T Consensus 1 ~~C~~--C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDI--CNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETT--TTEEESSHHHHHHHHTTH
T ss_pred CCCCC--CCCCcCCHHHHHHHHCcC
Confidence 56777 888888888888887653
No 49
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.12 E-value=0.0052 Score=34.13 Aligned_cols=22 Identities=36% Similarity=0.901 Sum_probs=17.9
Q ss_pred cCCCccccccCChhHHHHHHHh
Q psy13347 210 FQCSECHKKFMRSDHLQKHIKT 231 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~~ 231 (339)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 6788888888888888888764
No 50
>KOG2231|consensus
Probab=95.03 E-value=0.089 Score=52.19 Aligned_cols=100 Identities=21% Similarity=0.289 Sum_probs=59.9
Q ss_pred cccccccccccccccccccccc-cCC---------CCeEEeeccccccccCCchhHHHHHhhh-CCCC----ceecCccc
Q psy13347 108 IRLSSYFYPALISTLFSCSLRT-ADR---------KKVHICHIEGCNKVYGKTSHLRAHLRWH-TGER----PFVCTWQF 172 (339)
Q Consensus 108 ~~c~~c~~~~~~~~~l~~h~~~-h~~---------~~~~~C~~~~C~k~F~~~~~L~~H~~~H-~~~k----~~~C~~~~ 172 (339)
-.|..| -.|.....|+.|++. |.. .+.|.| +.|.|. ...|..|++.- .+++ .-.|. .
T Consensus 116 ~~~~~c-~~~~s~~~Lk~H~~~~H~~~~c~lC~~~~kif~~----e~k~Yt-~~el~~h~~~gd~d~~s~rGhp~C~--~ 187 (669)
T KOG2231|consen 116 KECLHC-TEFKSVENLKNHMRDQHKLHLCSLCLQNLKIFIN----ERKLYT-RAELNLHLMFGDPDDESCRGHPLCK--F 187 (669)
T ss_pred CCCccc-cchhHHHHHHHHHHHhhhhhccccccccceeeee----eeeheh-HHHHHHHHhcCCCccccccCCccch--h
Confidence 344444 444456677777733 321 233333 445554 45677787642 1121 24688 9
Q ss_pred cCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCcc------ccccCChhHHHHHHHhhc
Q psy13347 173 CGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSEC------HKKFMRSDHLQKHIKTHS 233 (339)
Q Consensus 173 C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C------~k~F~~~~~L~~H~~~h~ 233 (339)
|...|.....|.+|++..+ |.|.+| +..|.....|..|.|.+|
T Consensus 188 C~~~fld~~el~rH~~~~h------------------~~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 188 CHERFLDDDELYRHLRFDH------------------EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred hhhhhccHHHHHHhhccce------------------eheeecCcccccchhcccchHHHHHhhhcC
Confidence 9999999999999986422 555555 345666777777777655
No 51
>PRK04860 hypothetical protein; Provisional
Probab=94.43 E-value=0.026 Score=46.11 Aligned_cols=40 Identities=25% Similarity=0.505 Sum_probs=34.2
Q ss_pred CeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChH
Q psy13347 134 KVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSD 181 (339)
Q Consensus 134 ~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~ 181 (339)
-+|.| . |++ ....+++|.++|.++++|.|. .|+..|....
T Consensus 118 ~~Y~C--~-C~~---~~~~~rrH~ri~~g~~~YrC~--~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRC--K-CQE---HQLTVRRHNRVVRGEAVYRCR--RCGETLVFKG 157 (160)
T ss_pred EEEEc--C-CCC---eeCHHHHHHHHhcCCccEECC--CCCceeEEec
Confidence 36999 4 987 677899999999999999999 9999887543
No 52
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=94.12 E-value=0.034 Score=29.76 Aligned_cols=22 Identities=32% Similarity=0.582 Sum_probs=13.9
Q ss_pred EEeeccccccccCCchhHHHHHhhh
Q psy13347 136 HICHIEGCNKVYGKTSHLRAHLRWH 160 (339)
Q Consensus 136 ~~C~~~~C~k~F~~~~~L~~H~~~H 160 (339)
|+|.. |+.... ...|..|++.|
T Consensus 1 y~C~~--C~y~t~-~~~l~~H~~~~ 22 (24)
T PF13909_consen 1 YKCPH--CSYSTS-KSNLKRHLKRH 22 (24)
T ss_dssp EE-SS--SS-EES-HHHHHHHHHHH
T ss_pred CCCCC--CCCcCC-HHHHHHHHHhh
Confidence 56755 777776 77777777764
No 53
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=93.95 E-value=0.046 Score=29.70 Aligned_cols=21 Identities=43% Similarity=0.839 Sum_probs=17.6
Q ss_pred cCCCccccccCChhHHHHHHHh
Q psy13347 210 FQCSECHKKFMRSDHLQKHIKT 231 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~~ 231 (339)
..|+.||+.| ....|..|+++
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 5799999999 66788899875
No 54
>PRK04860 hypothetical protein; Provisional
Probab=93.41 E-value=0.043 Score=44.89 Aligned_cols=37 Identities=11% Similarity=0.301 Sum_probs=31.5
Q ss_pred CCcCCCccccccCChhHHHHHHHhhcCCC---CCccccccCCcc
Q psy13347 208 KRFQCSECHKKFMRSDHLQKHIKTHSKTR---CTSLVMNFSDNT 248 (339)
Q Consensus 208 k~~~C~~C~k~F~~~~~L~~H~~~h~~~~---C~~C~~~f~~~~ 248 (339)
-+|.|. |++ ....+++|.++|++++ |..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 469998 998 7778999999999877 999999987643
No 55
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.19 E-value=0.026 Score=31.22 Aligned_cols=22 Identities=23% Similarity=0.733 Sum_probs=13.7
Q ss_pred EEeeccccccccCCchhHHHHHhh
Q psy13347 136 HICHIEGCNKVYGKTSHLRAHLRW 159 (339)
Q Consensus 136 ~~C~~~~C~k~F~~~~~L~~H~~~ 159 (339)
|.|.+ |++.|.+...|..|++.
T Consensus 2 ~~C~~--C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDA--CDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTT--TTBBBSSHHHHHCCTTS
T ss_pred CCccc--CCCCcCCHHHHHHHHcc
Confidence 45655 66666666666666653
No 56
>KOG1146|consensus
Probab=92.54 E-value=0.066 Score=56.35 Aligned_cols=115 Identities=15% Similarity=0.247 Sum_probs=83.5
Q ss_pred cccccCCchhHHHHHh-hhCCCCceecCccccCCccCChHHHHhhhcccCcc-cch----hccccc------eecCCCCc
Q psy13347 143 CNKVYGKTSHLRAHLR-WHTGERPFVCTWQFCGKRFTRSDELQVNGGRFGIE-RTV----VRILPM------TWCREKRF 210 (339)
Q Consensus 143 C~k~F~~~~~L~~H~~-~H~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~-~~~----~~~~~~------~h~~~k~~ 210 (339)
|+..|.++..+..|+. .|.-.+.|+|+ .|+..|+....|..|||..+.. +.. ...|.+ .-.+.++|
T Consensus 442 ~e~~~~s~r~~~~~t~~L~S~~kt~~cp--kc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~ 519 (1406)
T KOG1146|consen 442 AEPLLESKRSLEGQTVVLHSFFKTLKCP--KCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPY 519 (1406)
T ss_pred hhhhhhhhcccccceeeeecccccccCC--ccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcc
Confidence 5556666666666655 46666889999 9999999999999999873222 111 111111 12345789
Q ss_pred CCCccccccCChhHHHHHHHh--hcCC--------------------------------------C------CCcccccc
Q psy13347 211 QCSECHKKFMRSDHLQKHIKT--HSKT--------------------------------------R------CTSLVMNF 244 (339)
Q Consensus 211 ~C~~C~k~F~~~~~L~~H~~~--h~~~--------------------------------------~------C~~C~~~f 244 (339)
.|..|...|..+.+|..|+.. |..+ + |..|++.-
T Consensus 520 ~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yet 599 (1406)
T KOG1146|consen 520 PCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYET 599 (1406)
T ss_pred cceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchh
Confidence 999999999999999999873 3211 0 99999999
Q ss_pred CCccccccccccccC
Q psy13347 245 SDNTVVTEDSKLELS 259 (339)
Q Consensus 245 ~~~~~l~~H~~~~~~ 259 (339)
.-..+|+.||...+.
T Consensus 600 niarnlrihmtss~~ 614 (1406)
T KOG1146|consen 600 NIARNLRIHMTASPS 614 (1406)
T ss_pred hhhhccccccccCCC
Confidence 999999999886543
No 57
>KOG2893|consensus
Probab=92.49 E-value=0.049 Score=46.43 Aligned_cols=44 Identities=30% Similarity=0.676 Sum_probs=31.9
Q ss_pred ccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCccccccCChhHHHHH-HHhhc
Q psy13347 172 FCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFMRSDHLQKH-IKTHS 233 (339)
Q Consensus 172 ~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~~~~~L~~H-~~~h~ 233 (339)
+|++.|....-|..|++ .|-|+|.+|.|..-+--.|..| |++|.
T Consensus 15 ycnrefddekiliqhqk------------------akhfkchichkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 15 YCNREFDDEKILIQHQK------------------AKHFKCHICHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred ecccccchhhhhhhhhh------------------hccceeeeehhhhccCCCceeehhhhhh
Confidence 78888888888887774 3568888888776666667666 45553
No 58
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=92.08 E-value=0.093 Score=30.81 Aligned_cols=22 Identities=27% Similarity=0.631 Sum_probs=20.0
Q ss_pred CcCCCccccccCChhHHHHHHH
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIK 230 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~ 230 (339)
+|.|++|++.|.....+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4889999999999999999986
No 59
>KOG4124|consensus
Probab=91.88 E-value=0.057 Score=48.64 Aligned_cols=66 Identities=17% Similarity=0.345 Sum_probs=45.4
Q ss_pred CCceecCccccCCccCChHHHHhhhcccCc-c----cchhccccceecCCCCcCCCccccccCChhHHHHH
Q psy13347 163 ERPFVCTWQFCGKRFTRSDELQVNGGRFGI-E----RTVVRILPMTWCREKRFQCSECHKKFMRSDHLQKH 228 (339)
Q Consensus 163 ~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~-~----~~~~~~~~~~h~~~k~~~C~~C~k~F~~~~~L~~H 228 (339)
.++|+|+++.|.+.++....|+.|...-|. . ...-+.|.......|+|.|++|.++++....|.-|
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~ 417 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYH 417 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCce
Confidence 457899989999999998888888643221 1 11234444455567889999998888776666544
No 60
>KOG2482|consensus
Probab=91.73 E-value=0.2 Score=45.15 Aligned_cols=134 Identities=17% Similarity=0.196 Sum_probs=85.9
Q ss_pred cccccccccCC-CCeEEeeccccccccC-CchhHHHHHh-hhCC---------------------CCceecCccccCCcc
Q psy13347 122 LFSCSLRTADR-KKVHICHIEGCNKVYG-KTSHLRAHLR-WHTG---------------------ERPFVCTWQFCGKRF 177 (339)
Q Consensus 122 ~l~~h~~~h~~-~~~~~C~~~~C~k~F~-~~~~L~~H~~-~H~~---------------------~k~~~C~~~~C~~~f 177 (339)
.|..+++.-.+ ....+|-+ |...+. ..+.+..|+- +|.- -..+.|- .|.+.|
T Consensus 130 aLeqqQ~Eredt~fslqClF--Cn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CL--yCekif 205 (423)
T KOG2482|consen 130 ALEQQQKEREDTIFSLQCLF--CNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCL--YCEKIF 205 (423)
T ss_pred HHHHHHHHhcCCeeeeEEEE--ecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheee--eecccc
Confidence 44444443322 24567988 987664 3455666654 3421 1147999 999999
Q ss_pred CChHHHHhhhcccCccc--ch---------------------hcccc--ce---e------------cCCCC--cCCCcc
Q psy13347 178 TRSDELQVNGGRFGIER--TV---------------------VRILP--MT---W------------CREKR--FQCSEC 215 (339)
Q Consensus 178 ~~~~~L~~H~~~~~~~~--~~---------------------~~~~~--~~---h------------~~~k~--~~C~~C 215 (339)
..+..|+.||+.....+ +. ...+- -+ + .+..+ .+|-+|
T Consensus 206 rdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC 285 (423)
T KOG2482|consen 206 RDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFC 285 (423)
T ss_pred CCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEee
Confidence 99999999996321111 00 00000 00 0 00122 489999
Q ss_pred ccccCChhHHHHHHHhhcCC------------------------------CCCccccccCCccccccccccccC
Q psy13347 216 HKKFMRSDHLQKHIKTHSKT------------------------------RCTSLVMNFSDNTVVTEDSKLELS 259 (339)
Q Consensus 216 ~k~F~~~~~L~~H~~~h~~~------------------------------~C~~C~~~f~~~~~l~~H~~~~~~ 259 (339)
....-+...|..||++-+.- +|..|...|-....|..||..+..
T Consensus 286 ~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~k~ 359 (423)
T KOG2482|consen 286 TNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVEDKH 359 (423)
T ss_pred ccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccccc
Confidence 99999999999999864421 099999999999999999987643
No 61
>KOG2785|consensus
Probab=89.72 E-value=0.49 Score=43.49 Aligned_cols=49 Identities=20% Similarity=0.177 Sum_probs=42.6
Q ss_pred CcCCCccccccCChhHHHHHHHhhcCCC--------------------------CCccc---cccCCccccccccccc
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIKTHSKTR--------------------------CTSLV---MNFSDNTVVTEDSKLE 257 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~~h~~~~--------------------------C~~C~---~~f~~~~~l~~H~~~~ 257 (339)
|-.|-+|++.|..-..-..||..+++.. |-.|. +.|.+....+.||...
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~K 243 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRDK 243 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhhc
Confidence 4679999999999999999999998854 88888 9999999999998753
No 62
>KOG2482|consensus
Probab=89.68 E-value=0.21 Score=45.03 Aligned_cols=94 Identities=22% Similarity=0.409 Sum_probs=64.4
Q ss_pred eEEeeccccccccCCchhHHHHHhh--hCCCC------------------------------------------------
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRW--HTGER------------------------------------------------ 164 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~--H~~~k------------------------------------------------ 164 (339)
.+.|-+ |.|.|..+..|+.|||. |....
T Consensus 195 r~~CLy--CekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~e 272 (423)
T KOG2482|consen 195 RLRCLY--CEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNE 272 (423)
T ss_pred hheeee--eccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhc
Confidence 478988 99999999999999984 53210
Q ss_pred ----c--eecCccccCCccCChHHHHhhhcccCcccch-------------hccccceecCCCCcCCCccccccCChhHH
Q psy13347 165 ----P--FVCTWQFCGKRFTRSDELQVNGGRFGIERTV-------------VRILPMTWCREKRFQCSECHKKFMRSDHL 225 (339)
Q Consensus 165 ----~--~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~-------------~~~~~~~h~~~k~~~C~~C~k~F~~~~~L 225 (339)
+ .+|- .|.....+...|..||+.-|.-.-. ++....+-...+.-.|-.|.-.|.....|
T Consensus 273 d~a~a~~v~CL--fC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l 350 (423)
T KOG2482|consen 273 DDAEALSVVCL--FCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGL 350 (423)
T ss_pred CCCCccceEEE--eeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchh
Confidence 1 4788 9999999999999999753321100 00000011112335678899999999999
Q ss_pred HHHHHhh
Q psy13347 226 QKHIKTH 232 (339)
Q Consensus 226 ~~H~~~h 232 (339)
..||..+
T Consensus 351 ~~hm~e~ 357 (423)
T KOG2482|consen 351 LIHMVED 357 (423)
T ss_pred hhhcccc
Confidence 9999754
No 63
>KOG4173|consensus
Probab=88.62 E-value=0.051 Score=45.29 Aligned_cols=51 Identities=25% Similarity=0.602 Sum_probs=35.6
Q ss_pred EeeccccccccCCchhHHHHHh-hh---------CCCCceecCccccCCccCChHHHHhhhcc
Q psy13347 137 ICHIEGCNKVYGKTSHLRAHLR-WH---------TGERPFVCTWQFCGKRFTRSDELQVNGGR 189 (339)
Q Consensus 137 ~C~~~~C~k~F~~~~~L~~H~~-~H---------~~~k~~~C~~~~C~~~f~~~~~L~~H~~~ 189 (339)
.|.+ |.+.|.+...|..|+. .| .|..-|.|-.+.|+..|.+....+.|+-.
T Consensus 108 sCs~--C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~ 168 (253)
T KOG4173|consen 108 SCSF--CKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIR 168 (253)
T ss_pred hhHH--HHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHH
Confidence 5766 7788877777777765 33 33445777777788888877777777643
No 64
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=87.24 E-value=0.43 Score=27.85 Aligned_cols=22 Identities=27% Similarity=0.668 Sum_probs=14.6
Q ss_pred eEEeeccccccccCCchhHHHHHh
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLR 158 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~ 158 (339)
+|.|.+ |++.|.....+..|++
T Consensus 3 ~~~C~~--C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKL--CNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccc--cCCccCCHHHHHHHHC
Confidence 366766 7777776666666665
No 65
>KOG4173|consensus
Probab=85.95 E-value=0.14 Score=42.67 Aligned_cols=92 Identities=20% Similarity=0.393 Sum_probs=66.4
Q ss_pred CCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCC
Q psy13347 133 KKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQC 212 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C 212 (339)
...|.|.+.+|-..|........|...-++ -.|. .|.+.|.+...|..|+...|...- ...+..|.-.|+|
T Consensus 77 ~~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs--~C~r~~Pt~hLLd~HI~E~HDs~F----qa~veRG~dMy~C 147 (253)
T KOG4173|consen 77 VPAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCS--FCKRAFPTGHLLDAHILEWHDSLF----QALVERGQDMYQC 147 (253)
T ss_pred cccccccccchHHHHhhhhhHHHhhhhccc---chhH--HHHHhCCchhhhhHHHHHHHHHHH----HHHHHcCccHHHH
Confidence 356889999999999988888888764322 3798 999999999999999743221100 0012234567999
Q ss_pred Cc--cccccCChhHHHHHHHhhc
Q psy13347 213 SE--CHKKFMRSDHLQKHIKTHS 233 (339)
Q Consensus 213 ~~--C~k~F~~~~~L~~H~~~h~ 233 (339)
-+ |+..|.+...-+.|+-.-+
T Consensus 148 lvEgCt~KFkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 148 LVEGCTEKFKTSRDRKDHMIRMH 170 (253)
T ss_pred HHHhhhhhhhhhhhhhhHHHHhc
Confidence 65 9999999999999976433
No 66
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=84.35 E-value=0.18 Score=43.43 Aligned_cols=41 Identities=20% Similarity=0.372 Sum_probs=29.3
Q ss_pred CCCcCCCccccccCChhHHHHHHHh---hcCCC---------------CCccccccCCc
Q psy13347 207 EKRFQCSECHKKFMRSDHLQKHIKT---HSKTR---------------CTSLVMNFSDN 247 (339)
Q Consensus 207 ~k~~~C~~C~k~F~~~~~L~~H~~~---h~~~~---------------C~~C~~~f~~~ 247 (339)
+|.+.|++|++.|.++.-+....++ ..... |+.||..|...
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 3668999999999988766655543 11111 99999988754
No 67
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=84.16 E-value=0.43 Score=30.96 Aligned_cols=30 Identities=20% Similarity=0.441 Sum_probs=25.4
Q ss_pred cCCCCcCCCccccccCChhHHHHHHHhhcC
Q psy13347 205 CREKRFQCSECHKKFMRSDHLQKHIKTHSK 234 (339)
Q Consensus 205 ~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~ 234 (339)
-|+.-+.|+-||..|....++.+|...-++
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 467779999999999999999999876554
No 68
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=84.03 E-value=2.1 Score=38.89 Aligned_cols=117 Identities=21% Similarity=0.288 Sum_probs=67.3
Q ss_pred cccccccccccCCCCeEEeeccccc---cccC------CchhHHHHHhhhCCCCc----eecCccccCCccCChHHHHhh
Q psy13347 120 STLFSCSLRTADRKKVHICHIEGCN---KVYG------KTSHLRAHLRWHTGERP----FVCTWQFCGKRFTRSDELQVN 186 (339)
Q Consensus 120 ~~~l~~h~~~h~~~~~~~C~~~~C~---k~F~------~~~~L~~H~~~H~~~k~----~~C~~~~C~~~f~~~~~L~~H 186 (339)
-..|+.|.+..++ .+.|.+ |- +.|. ++..|+.|...-..+.- -.|. .|...|...+.|.+|
T Consensus 166 ~k~lk~H~K~~H~--~~~C~~--C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~--FC~~~FYdDDEL~~H 239 (493)
T COG5236 166 LKELKKHYKAQHG--FVLCSE--CIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCI--FCKIYFYDDDELRRH 239 (493)
T ss_pred HHHHHHHHHhhcC--cEEhHh--hhcCcccCccceeeeecccccccccCCccccCcCCCchhh--hccceecChHHHHHH
Confidence 4455666555332 355655 63 2232 34567777664322212 2577 888888888888888
Q ss_pred hcccCcccchhccccceecCCCCcCCCcccc-------ccCChhHHHHHHHhhc-CC---CCCcc-ccccCCcccccccc
Q psy13347 187 GGRFGIERTVVRILPMTWCREKRFQCSECHK-------KFMRSDHLQKHIKTHS-KT---RCTSL-VMNFSDNTVVTEDS 254 (339)
Q Consensus 187 ~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k-------~F~~~~~L~~H~~~h~-~~---~C~~C-~~~f~~~~~l~~H~ 254 (339)
++. +--.|-+|++ .|.....|..|.+.-+ -- .|-.- -..|.....|..|+
T Consensus 240 cR~------------------~HE~ChICD~v~p~~~QYFK~Y~~Le~HF~~~hy~ct~qtc~~~k~~vf~~~~el~~h~ 301 (493)
T COG5236 240 CRL------------------RHEACHICDMVGPIRYQYFKSYEDLEAHFRNAHYCCTFQTCRVGKCYVFPYHTELLEHL 301 (493)
T ss_pred HHh------------------hhhhhhhhhccCccchhhhhCHHHHHHHhhcCceEEEEEEEecCcEEEeccHHHHHHHH
Confidence 852 2234555544 4777788888876322 10 02211 24688888888887
Q ss_pred ccccCC
Q psy13347 255 KLELSG 260 (339)
Q Consensus 255 ~~~~~~ 260 (339)
..-|..
T Consensus 302 ~~~h~~ 307 (493)
T COG5236 302 TRFHKV 307 (493)
T ss_pred HHHhhc
Confidence 776654
No 69
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=78.66 E-value=1.5 Score=33.36 Aligned_cols=15 Identities=20% Similarity=0.612 Sum_probs=11.4
Q ss_pred CCcCCCccccccCCh
Q psy13347 208 KRFQCSECHKKFMRS 222 (339)
Q Consensus 208 k~~~C~~C~k~F~~~ 222 (339)
.|..|+.||..|.-.
T Consensus 25 ~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 25 DPIVCPKCGTEFPPE 39 (108)
T ss_pred CCccCCCCCCccCcc
Confidence 677788888877665
No 70
>KOG4377|consensus
Probab=78.64 E-value=1.5 Score=40.71 Aligned_cols=116 Identities=16% Similarity=0.269 Sum_probs=68.9
Q ss_pred CCeEEeeccccccccCCchhHHHHHhhhCCCC------------ceecCccccCCccCChHHHHhhhcccCcccchhccc
Q psy13347 133 KKVHICHIEGCNKVYGKTSHLRAHLRWHTGER------------PFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRIL 200 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k------------~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~ 200 (339)
...|.|--+.|+..+..+.++.+|..+|.... -|.|..+.|.+ +-++...|--.|...
T Consensus 269 rEhyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~------- 338 (480)
T KOG4377|consen 269 REHYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDK------- 338 (480)
T ss_pred chhhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCcccccc-------
Confidence 34577766779888777888999988885422 26787788888 344555554333111
Q ss_pred cceecC--CCCcCCCccc--cccCChhHHHHHHHhhcCCC----------------C-------------CccccccCCc
Q psy13347 201 PMTWCR--EKRFQCSECH--KKFMRSDHLQKHIKTHSKTR----------------C-------------TSLVMNFSDN 247 (339)
Q Consensus 201 ~~~h~~--~k~~~C~~C~--k~F~~~~~L~~H~~~h~~~~----------------C-------------~~C~~~f~~~ 247 (339)
...| ..-|.|..|| -.|.-...-..|.+-+.++. | .-|+..|...
T Consensus 339 --~n~GfrrthfhC~r~gCTdtfK~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~s~ 416 (480)
T KOG4377|consen 339 --RNNGFRRTHFHCQRIGCTDTFKDSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLYSV 416 (480)
T ss_pred --ccCceecceeEEeccCCccccccccccccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEEeh
Confidence 1111 1247777776 44442222222333222221 1 3577888899
Q ss_pred cccccccccccCC
Q psy13347 248 TVVTEDSKLELSG 260 (339)
Q Consensus 248 ~~l~~H~~~~~~~ 260 (339)
+.+..|.+.|...
T Consensus 417 sqm~shkrkheRq 429 (480)
T KOG4377|consen 417 SQMASHKRKHERQ 429 (480)
T ss_pred hhhhhhhhhhhhh
Confidence 9999999988654
No 71
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=78.27 E-value=1.4 Score=29.16 Aligned_cols=16 Identities=44% Similarity=0.872 Sum_probs=11.8
Q ss_pred CCCCccceeCCCCCCC
Q psy13347 50 PRSRRVACTCPNCTEG 65 (339)
Q Consensus 50 ~~~~~~~~~C~~C~~~ 65 (339)
+..+-..|.||+|++.
T Consensus 21 p~e~~v~F~CPnCGe~ 36 (61)
T COG2888 21 PGETAVKFPCPNCGEV 36 (61)
T ss_pred cCCceeEeeCCCCCce
Confidence 3445677999999943
No 72
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=78.15 E-value=1.6 Score=23.36 Aligned_cols=14 Identities=43% Similarity=0.909 Sum_probs=10.3
Q ss_pred CCCccceeCCCCCC
Q psy13347 51 RSRRVACTCPNCTE 64 (339)
Q Consensus 51 ~~~~~~~~C~~C~~ 64 (339)
+..-..|.||+|++
T Consensus 11 r~~~v~f~CPnCG~ 24 (24)
T PF07754_consen 11 REQAVPFPCPNCGF 24 (24)
T ss_pred cccCceEeCCCCCC
Confidence 33456799999983
No 73
>PHA00626 hypothetical protein
Probab=77.32 E-value=2.1 Score=27.95 Aligned_cols=14 Identities=21% Similarity=0.370 Sum_probs=8.0
Q ss_pred CeEEeeccccccccCC
Q psy13347 134 KVHICHIEGCNKVYGK 149 (339)
Q Consensus 134 ~~~~C~~~~C~k~F~~ 149 (339)
..|+| +.||..|+.
T Consensus 22 nrYkC--kdCGY~ft~ 35 (59)
T PHA00626 22 DDYVC--CDCGYNDSK 35 (59)
T ss_pred cceEc--CCCCCeech
Confidence 45667 337666653
No 74
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=76.95 E-value=1.3 Score=28.86 Aligned_cols=31 Identities=19% Similarity=0.386 Sum_probs=24.6
Q ss_pred cccCCCCeEEeeccccccccCCchhHHHHHhhh
Q psy13347 128 RTADRKKVHICHIEGCNKVYGKTSHLRAHLRWH 160 (339)
Q Consensus 128 ~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H 160 (339)
+.-.++..+.|+- |+..|....++.+|+..-
T Consensus 10 ~~RDGE~~lrCPR--C~~~FR~~K~Y~RHVNKa 40 (65)
T COG4049 10 RDRDGEEFLRCPR--CGMVFRRRKDYIRHVNKA 40 (65)
T ss_pred eccCCceeeeCCc--hhHHHHHhHHHHHHhhHH
Confidence 4456788889955 999999999999998753
No 75
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=76.47 E-value=1.7 Score=25.79 Aligned_cols=34 Identities=24% Similarity=0.649 Sum_probs=24.9
Q ss_pred eecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCccccccC
Q psy13347 166 FVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFM 220 (339)
Q Consensus 166 ~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~ 220 (339)
+.|+ .|+..|.-...... ......+|+.|+..|.
T Consensus 3 i~Cp--~C~~~y~i~d~~ip-------------------~~g~~v~C~~C~~~f~ 36 (36)
T PF13717_consen 3 ITCP--NCQAKYEIDDEKIP-------------------PKGRKVRCSKCGHVFF 36 (36)
T ss_pred EECC--CCCCEEeCCHHHCC-------------------CCCcEEECCCCCCEeC
Confidence 5788 99999988776521 2235589999998874
No 76
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=73.11 E-value=6.3 Score=30.02 Aligned_cols=45 Identities=18% Similarity=0.358 Sum_probs=25.0
Q ss_pred EeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHHHhhh
Q psy13347 137 ICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDELQVNG 187 (339)
Q Consensus 137 ~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L~~H~ 187 (339)
.|-- |.+.|........ ..-.....|.|+ .|...|-..-++..|.
T Consensus 57 ~C~~--C~~~f~~~~~~~~--~~~~~~~~y~C~--~C~~~FC~dCD~fiHe 101 (112)
T TIGR00622 57 FCFG--CQGPFPKPPVSPF--DELKDSHRYVCA--VCKNVFCVDCDVFVHE 101 (112)
T ss_pred cccC--cCCCCCCcccccc--cccccccceeCC--CCCCccccccchhhhh
Confidence 3733 7777765431110 001223457787 7887777777777664
No 77
>KOG0696|consensus
Probab=72.59 E-value=4 Score=38.50 Aligned_cols=78 Identities=21% Similarity=0.265 Sum_probs=56.2
Q ss_pred CCCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcC
Q psy13347 132 RKKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQ 211 (339)
Q Consensus 132 ~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~ 211 (339)
++..|+|.+ |. +..|.|.|. --.|.|+ .=++.+.....-..|.- +.|+-.-|--
T Consensus 70 gKQGfQCqv--C~--------fvvHkrChe-fVtF~CP--Gadkg~dtDdpr~kHkf-------------~~~tYssPTF 123 (683)
T KOG0696|consen 70 GKQGFQCQV--CC--------FVVHKRCHE-FVTFSCP--GADKGPDTDDPRSKHKF-------------KIHTYSSPTF 123 (683)
T ss_pred ccCceeeeE--Ee--------ehhhhhhcc-eEEEECC--CCCCCCCCCCcccccce-------------eeeecCCCch
Confidence 567799988 86 456888874 3468999 88888888776666642 3566566667
Q ss_pred CCccccccCChhHHHHHHHhhcCCCCCccccc
Q psy13347 212 CSECHKKFMRSDHLQKHIKTHSKTRCTSLVMN 243 (339)
Q Consensus 212 C~~C~k~F~~~~~L~~H~~~h~~~~C~~C~~~ 243 (339)
|+.||. ..|-..|.|.+|..|...
T Consensus 124 CDhCGs--------LLyGl~HQGmKC~~C~mN 147 (683)
T KOG0696|consen 124 CDHCGS--------LLYGLIHQGMKCDTCDMN 147 (683)
T ss_pred hhhHHH--------HHHHHHhcccccccccch
Confidence 999983 235567899998888654
No 78
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=71.90 E-value=2 Score=25.65 Aligned_cols=34 Identities=18% Similarity=0.475 Sum_probs=23.6
Q ss_pred eecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCccccccC
Q psy13347 166 FVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFM 220 (339)
Q Consensus 166 ~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~ 220 (339)
+.|+ .|+..|.-......- ......|+.|+..|.
T Consensus 3 ~~CP--~C~~~~~v~~~~~~~-------------------~~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 3 IQCP--NCKTSFRVVDSQLGA-------------------NGGKVRCGKCGHVWY 36 (38)
T ss_pred EECC--CCCCEEEeCHHHcCC-------------------CCCEEECCCCCCEEE
Confidence 6798 999998877654211 122478999998774
No 79
>KOG2186|consensus
Probab=71.79 E-value=1.9 Score=37.50 Aligned_cols=47 Identities=17% Similarity=0.400 Sum_probs=38.3
Q ss_pred cCCCccccccCChhHHHHHHHhhcCCC--CCccccccCCcccccccccccc
Q psy13347 210 FQCSECHKKFMRSDHLQKHIKTHSKTR--CTSLVMNFSDNTVVTEDSKLEL 258 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~~h~~~~--C~~C~~~f~~~~~l~~H~~~~~ 258 (339)
|.|.+||...... .|.+|+...++.. |-.|++.|.. ..++.|.+--.
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn~~fSCIDC~k~F~~-~sYknH~kCIT 52 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRNAYFSCIDCGKTFER-VSYKNHTKCIT 52 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccCCeeEEeeccccccc-chhhhhhhhcc
Confidence 8999999887654 5778999888866 9999999988 78888866443
No 80
>KOG2893|consensus
Probab=70.90 E-value=1.6 Score=37.47 Aligned_cols=42 Identities=24% Similarity=0.593 Sum_probs=34.3
Q ss_pred eeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHHHhhh
Q psy13347 138 CHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDELQVNG 187 (339)
Q Consensus 138 C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L~~H~ 187 (339)
|-+ |++.|....-|..|++. |.|+|. +|-|.+.+--.|..|.
T Consensus 13 cwy--cnrefddekiliqhqka----khfkch--ichkkl~sgpglsihc 54 (341)
T KOG2893|consen 13 CWY--CNREFDDEKILIQHQKA----KHFKCH--ICHKKLFSGPGLSIHC 54 (341)
T ss_pred eee--cccccchhhhhhhhhhh----ccceee--eehhhhccCCCceeeh
Confidence 756 99999999999999874 349998 9998887777787774
No 81
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=70.77 E-value=2.9 Score=24.24 Aligned_cols=24 Identities=33% Similarity=1.077 Sum_probs=17.2
Q ss_pred EEeeccccccccCCchhHHHHHhhhCCCCceecCccccCC
Q psy13347 136 HICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGK 175 (339)
Q Consensus 136 ~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~ 175 (339)
|+|.+ ||..+.-.. .++.|+ .|+.
T Consensus 2 ~~C~~--CGy~y~~~~------------~~~~CP--~Cg~ 25 (33)
T cd00350 2 YVCPV--CGYIYDGEE------------APWVCP--VCGA 25 (33)
T ss_pred EECCC--CCCEECCCc------------CCCcCc--CCCC
Confidence 78877 888765332 678888 8875
No 82
>KOG2186|consensus
Probab=69.58 E-value=3.4 Score=35.93 Aligned_cols=46 Identities=22% Similarity=0.527 Sum_probs=34.3
Q ss_pred eEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCChHHHHhhh
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRSDELQVNG 187 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~~L~~H~ 187 (339)
.|.|.+ ||.... +..|.+|+-..++ .-|.|. .|++.|.. .+++.|.
T Consensus 3 ~FtCnv--CgEsvK-Kp~vekH~srCrn-~~fSCI--DC~k~F~~-~sYknH~ 48 (276)
T KOG2186|consen 3 FFTCNV--CGESVK-KPQVEKHMSRCRN-AYFSCI--DCGKTFER-VSYKNHT 48 (276)
T ss_pred EEehhh--hhhhcc-ccchHHHHHhccC-CeeEEe--eccccccc-chhhhhh
Confidence 478877 888765 4457778877666 568898 89999987 6677775
No 83
>KOG4124|consensus
Probab=69.52 E-value=0.88 Score=41.27 Aligned_cols=53 Identities=26% Similarity=0.630 Sum_probs=43.1
Q ss_pred CCeEEeeccccccccCCchhHHHHHhh-hC------------------CCCceecCccccCCccCChHHHHhhh
Q psy13347 133 KKVHICHIEGCNKVYGKTSHLRAHLRW-HT------------------GERPFVCTWQFCGKRFTRSDELQVNG 187 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~~~~L~~H~~~-H~------------------~~k~~~C~~~~C~~~f~~~~~L~~H~ 187 (339)
.++|+|.++.|.+.++....|+.|... |. ..|+|+|+ +|.++++....|+-|+
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~--i~~~~~k~~~~l~~~~ 418 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCE--VCSKRYKNLNGLKYHR 418 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccCh--hhhhhhccCCCCCcee
Confidence 578999999999999999888888653 31 14789999 9999998887777665
No 84
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=69.37 E-value=3.1 Score=26.63 Aligned_cols=25 Identities=40% Similarity=0.862 Sum_probs=19.3
Q ss_pred CcCCCccccccCCh-----hHHHHHHH-hhc
Q psy13347 209 RFQCSECHKKFMRS-----DHLQKHIK-THS 233 (339)
Q Consensus 209 ~~~C~~C~k~F~~~-----~~L~~H~~-~h~ 233 (339)
.-.|..|++.+... ++|.+|++ +|.
T Consensus 18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~ 48 (50)
T smart00614 18 RAKCKYCGKKLSRSSKGGTSNLRRHLRRKHP 48 (50)
T ss_pred EEEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence 35699999988765 58999988 553
No 85
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=69.31 E-value=3.8 Score=22.39 Aligned_cols=10 Identities=20% Similarity=0.421 Sum_probs=7.1
Q ss_pred CCCccccccC
Q psy13347 236 RCTSLVMNFS 245 (339)
Q Consensus 236 ~C~~C~~~f~ 245 (339)
.|+.||..|.
T Consensus 16 ~Cp~CG~~F~ 25 (26)
T PF10571_consen 16 FCPHCGYDFE 25 (26)
T ss_pred cCCCCCCCCc
Confidence 3888887774
No 86
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=68.22 E-value=3.2 Score=24.78 Aligned_cols=34 Identities=24% Similarity=0.606 Sum_probs=24.5
Q ss_pred eecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCccccccC
Q psy13347 166 FVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFM 220 (339)
Q Consensus 166 ~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~ 220 (339)
..|+ .|+..|.-..... -.+.+..+|+.|+..|.
T Consensus 3 i~CP--~C~~~f~v~~~~l-------------------~~~~~~vrC~~C~~~f~ 36 (37)
T PF13719_consen 3 ITCP--NCQTRFRVPDDKL-------------------PAGGRKVRCPKCGHVFR 36 (37)
T ss_pred EECC--CCCceEEcCHHHc-------------------ccCCcEEECCCCCcEee
Confidence 5788 9999998776531 12345689999998874
No 87
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=67.88 E-value=4.3 Score=25.43 Aligned_cols=9 Identities=22% Similarity=0.822 Sum_probs=6.9
Q ss_pred cceeCCCCC
Q psy13347 55 VACTCPNCT 63 (339)
Q Consensus 55 ~~~~C~~C~ 63 (339)
..|+|+.|+
T Consensus 2 ~~y~C~~CG 10 (46)
T PRK00398 2 AEYKCARCG 10 (46)
T ss_pred CEEECCCCC
Confidence 358888888
No 88
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=65.71 E-value=3.5 Score=32.50 Aligned_cols=26 Identities=38% Similarity=0.745 Sum_probs=16.7
Q ss_pred CCcCCCccccccCChhHHHHHHHhhcCCC
Q psy13347 208 KRFQCSECHKKFMRSDHLQKHIKTHSKTR 236 (339)
Q Consensus 208 k~~~C~~C~k~F~~~~~L~~H~~~h~~~~ 236 (339)
.-..|-+|||.|.. |++|++.|+|..
T Consensus 71 d~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 71 DYIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp S-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred CeeEEccCCcccch---HHHHHHHccCCC
Confidence 33679999999976 589999998765
No 89
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=62.93 E-value=5.2 Score=23.38 Aligned_cols=10 Identities=20% Similarity=0.507 Sum_probs=8.2
Q ss_pred CCcCCCcccc
Q psy13347 208 KRFQCSECHK 217 (339)
Q Consensus 208 k~~~C~~C~k 217 (339)
.|..|++|+.
T Consensus 17 ~p~~CP~Cg~ 26 (34)
T cd00729 17 APEKCPICGA 26 (34)
T ss_pred CCCcCcCCCC
Confidence 5679999985
No 90
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=62.31 E-value=2.5 Score=26.97 Aligned_cols=29 Identities=14% Similarity=0.374 Sum_probs=20.1
Q ss_pred CCcCCCccccccCChhHHHHHHHhhcCCCCCccccc
Q psy13347 208 KRFQCSECHKKFMRSDHLQKHIKTHSKTRCTSLVMN 243 (339)
Q Consensus 208 k~~~C~~C~k~F~~~~~L~~H~~~h~~~~C~~C~~~ 243 (339)
..|.|..||+.|.. ...-.+.+|++|+..
T Consensus 5 ~~Y~C~~Cg~~~~~-------~~~~~~irCp~Cg~r 33 (49)
T COG1996 5 MEYKCARCGREVEL-------DQETRGIRCPYCGSR 33 (49)
T ss_pred EEEEhhhcCCeeeh-------hhccCceeCCCCCcE
Confidence 34999999999921 122345569999865
No 91
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=61.94 E-value=5.4 Score=30.23 Aligned_cols=26 Identities=15% Similarity=0.432 Sum_probs=22.3
Q ss_pred CcCC----CccccccCChhHHHHHHHhhcC
Q psy13347 209 RFQC----SECHKKFMRSDHLQKHIKTHSK 234 (339)
Q Consensus 209 ~~~C----~~C~k~F~~~~~L~~H~~~h~~ 234 (339)
-|.| ..|+..+.+...+++|++.++|
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 3889 8899999999999999987764
No 92
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=61.54 E-value=8.3 Score=23.78 Aligned_cols=25 Identities=40% Similarity=0.683 Sum_probs=16.3
Q ss_pred CCCeEEeeccccccccCCc----hhHHHHHh
Q psy13347 132 RKKVHICHIEGCNKVYGKT----SHLRAHLR 158 (339)
Q Consensus 132 ~~~~~~C~~~~C~k~F~~~----~~L~~H~~ 158 (339)
+....+|.+ |++.+... +.|.+|++
T Consensus 13 ~~~~a~C~~--C~~~~~~~~~~ts~l~~HL~ 41 (45)
T PF02892_consen 13 DKKKAKCKY--CGKVIKYSSGGTSNLKRHLK 41 (45)
T ss_dssp CSS-EEETT--TTEE-----SSTHHHHHHHH
T ss_pred CcCeEEeCC--CCeEEeeCCCcHHHHHHhhh
Confidence 456789988 99988764 78999984
No 93
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=61.21 E-value=12 Score=33.55 Aligned_cols=103 Identities=13% Similarity=0.158 Sum_probs=53.8
Q ss_pred CCCCCccceeCCCCCCCCccccccccccCCCCCCchhH---H---HHHHhhhhhcccccccccCc--ccccccccccccc
Q psy13347 49 KPRSRRVACTCPNCTEGGEREMFSYMLHGCLWGNTMEY---A---LAARVLKLKIYKTTAEVNNK--IRLSSYFYPALIS 120 (339)
Q Consensus 49 ~~~~~~~~~~C~~C~~~~~~~~~~~~~~~C~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~~--~~c~~c~~~~~~~ 120 (339)
.+......|.||.|.-.--. ... .|..|..+.. . .-.+++.++.|.......+. -.|-.|.-+|...
T Consensus 301 Hs~~~~gGy~CP~CktkVCs--LPi---~CP~Csl~LilsthLarSyhhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~~ 375 (421)
T COG5151 301 HSEVKGGGYECPVCKTKVCS--LPI---SCPICSLQLILSTHLARSYHHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPKP 375 (421)
T ss_pred eeeeccCceeCCcccceeec--CCc---cCcchhHHHHHHHHHHHHHHhhccCcccccccCCCCCCCccceeccCCCCCC
Confidence 34566778999999832111 111 3333333222 1 11234445666555444432 2356666666443
Q ss_pred ccccccccccCCCCeEEeeccccccccCCchhHHHHHhhh
Q psy13347 121 TLFSCSLRTADRKKVHICHIEGCNKVYGKTSHLRAHLRWH 160 (339)
Q Consensus 121 ~~l~~h~~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H 160 (339)
..-.- -.-+....|.|+. |...|-..-+.-.|...|
T Consensus 376 ~~~~~--~~~~ss~rY~Ce~--CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 376 PVSPF--DESTSSGRYQCEL--CKSTFCSDCDVFIHETLH 411 (421)
T ss_pred CCCcc--cccccccceechh--hhhhhhhhhHHHHHHHHh
Confidence 21111 0112245688855 888888888888887766
No 94
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=60.86 E-value=8.3 Score=31.49 Aligned_cols=34 Identities=26% Similarity=0.581 Sum_probs=24.1
Q ss_pred CCCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCcccccc
Q psy13347 161 TGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKF 219 (339)
Q Consensus 161 ~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F 219 (339)
.+..-|.|+ .|+..|.....+. ..|.|+.||...
T Consensus 105 ~~~~~Y~Cp--~c~~r~tf~eA~~-----------------------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 105 TNNMFFICP--NMCVRFTFNEAME-----------------------LNFTCPRCGAML 138 (158)
T ss_pred cCCCeEECC--CCCcEeeHHHHHH-----------------------cCCcCCCCCCEe
Confidence 344568888 8888888777663 248888888654
No 95
>PF15269 zf-C2H2_7: Zinc-finger
Probab=60.55 E-value=5.2 Score=24.77 Aligned_cols=23 Identities=26% Similarity=0.622 Sum_probs=19.9
Q ss_pred CcCCCccccccCChhHHHHHHHh
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIKT 231 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~~ 231 (339)
.|+|-.|......++.|-.||+-
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 36799999999999999999974
No 96
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=60.14 E-value=5.7 Score=31.07 Aligned_cols=57 Identities=19% Similarity=0.401 Sum_probs=34.6
Q ss_pred CCCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCC---CccccccCChhHHHHHHHhhcCCCC
Q psy13347 161 TGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQC---SECHKKFMRSDHLQKHIKTHSKTRC 237 (339)
Q Consensus 161 ~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C---~~C~k~F~~~~~L~~H~~~h~~~~C 237 (339)
...+-|+|+ +|..+......| ||-.| .+|+.-+ ..|..|-.+| ..|
T Consensus 76 ~d~~lYeCn--IC~etS~ee~FL------------------------KPneCCgY~iCn~Cy---a~LWK~~~~y--pvC 124 (140)
T PF05290_consen 76 LDPKLYECN--ICKETSAEERFL------------------------KPNECCGYSICNACY---ANLWKFCNLY--PVC 124 (140)
T ss_pred cCCCceecc--CcccccchhhcC------------------------CcccccchHHHHHHH---HHHHHHcccC--CCC
Confidence 455778888 888877665444 45555 2233322 4455555444 348
Q ss_pred CccccccCCcc
Q psy13347 238 TSLVMNFSDNT 248 (339)
Q Consensus 238 ~~C~~~f~~~~ 248 (339)
+.|.-+|++.+
T Consensus 125 PvCkTSFKss~ 135 (140)
T PF05290_consen 125 PVCKTSFKSSS 135 (140)
T ss_pred Ccccccccccc
Confidence 89988887654
No 97
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=59.91 E-value=8 Score=21.07 Aligned_cols=20 Identities=20% Similarity=0.679 Sum_probs=15.2
Q ss_pred cCCCccccccCChhHHHHHHH
Q psy13347 210 FQCSECHKKFMRSDHLQKHIK 230 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~ 230 (339)
..|++|++.+ ....+..|+.
T Consensus 2 v~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLD 21 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHH
Confidence 4689999888 5567777875
No 98
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=59.65 E-value=6.4 Score=30.55 Aligned_cols=14 Identities=7% Similarity=0.010 Sum_probs=6.6
Q ss_pred CCcCCCccccccCC
Q psy13347 208 KRFQCSECHKKFMR 221 (339)
Q Consensus 208 k~~~C~~C~k~F~~ 221 (339)
.|..|+.||..|.-
T Consensus 25 ~p~vcP~cg~~~~~ 38 (129)
T TIGR02300 25 RPAVSPYTGEQFPP 38 (129)
T ss_pred CCccCCCcCCccCc
Confidence 34445555544433
No 99
>KOG4377|consensus
Probab=57.96 E-value=5.1 Score=37.34 Aligned_cols=126 Identities=15% Similarity=0.249 Sum_probs=75.3
Q ss_pred ccccccCcccc--ccccccccccccccccccccCCC------------CeEEeeccccccccCCchhHHHHHhhhCCC--
Q psy13347 100 TTAEVNNKIRL--SSYFYPALISTLFSCSLRTADRK------------KVHICHIEGCNKVYGKTSHLRAHLRWHTGE-- 163 (339)
Q Consensus 100 ~~~~~~~~~~c--~~c~~~~~~~~~l~~h~~~h~~~------------~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~-- 163 (339)
....+..||.| ..|++.+..+..+.+|.+.|... ..|.|....|.| +.++...|-..|+..
T Consensus 264 ~~e~~rEhyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n 340 (480)
T KOG4377|consen 264 HVEAGREHYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRN 340 (480)
T ss_pred eeeccchhhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCcccccccc
Confidence 33446677777 57888888888888998888543 236788788998 566777787777432
Q ss_pred -----CceecCccccC--CccCChHHHHhhhcccCcccch--------hccccceecCC------CCcCCCc--cccccC
Q psy13347 164 -----RPFVCTWQFCG--KRFTRSDELQVNGGRFGIERTV--------VRILPMTWCRE------KRFQCSE--CHKKFM 220 (339)
Q Consensus 164 -----k~~~C~~~~C~--~~f~~~~~L~~H~~~~~~~~~~--------~~~~~~~h~~~------k~~~C~~--C~k~F~ 220 (339)
..|.|- .|+ -+|+ ...|...|...... .+.......+= .-|.|.. |+..|.
T Consensus 341 ~GfrrthfhC~--r~gCTdtfK----~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~ 414 (480)
T KOG4377|consen 341 NGFRRTHFHCQ--RIGCTDTFK----DSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLY 414 (480)
T ss_pred CceecceeEEe--ccCCccccc----cccccccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEE
Confidence 247787 566 5555 33343222211110 00000011110 1134443 888999
Q ss_pred ChhHHHHHHHhhcC
Q psy13347 221 RSDHLQKHIKTHSK 234 (339)
Q Consensus 221 ~~~~L~~H~~~h~~ 234 (339)
..+.+..|.|.|..
T Consensus 415 s~sqm~shkrkheR 428 (480)
T KOG4377|consen 415 SVSQMASHKRKHER 428 (480)
T ss_pred ehhhhhhhhhhhhh
Confidence 99999999888854
No 100
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=56.77 E-value=7.2 Score=39.92 Aligned_cols=8 Identities=25% Similarity=0.800 Sum_probs=6.0
Q ss_pred ceeCCCCC
Q psy13347 56 ACTCPNCT 63 (339)
Q Consensus 56 ~~~C~~C~ 63 (339)
...|..|+
T Consensus 435 ~l~C~~Cg 442 (730)
T COG1198 435 LLLCRDCG 442 (730)
T ss_pred eeecccCC
Confidence 45688888
No 101
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=55.43 E-value=11 Score=31.52 Aligned_cols=19 Identities=32% Similarity=0.632 Sum_probs=13.5
Q ss_pred CCceecCccccCCccCChHHH
Q psy13347 163 ERPFVCTWQFCGKRFTRSDEL 183 (339)
Q Consensus 163 ~k~~~C~~~~C~~~f~~~~~L 183 (339)
..-|.|+ .|+..|.....+
T Consensus 115 ~~~Y~Cp--~C~~rytf~eA~ 133 (178)
T PRK06266 115 NMFFFCP--NCHIRFTFDEAM 133 (178)
T ss_pred CCEEECC--CCCcEEeHHHHh
Confidence 3458887 788888776654
No 102
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=54.71 E-value=13 Score=27.72 Aligned_cols=13 Identities=23% Similarity=0.577 Sum_probs=9.8
Q ss_pred CCCccceeCCCCC
Q psy13347 51 RSRRVACTCPNCT 63 (339)
Q Consensus 51 ~~~~~~~~C~~C~ 63 (339)
...+..|.||+|+
T Consensus 16 ~klpt~f~CP~Cg 28 (99)
T PRK14892 16 PKLPKIFECPRCG 28 (99)
T ss_pred cCCCcEeECCCCC
Confidence 3446679999999
No 103
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=53.55 E-value=9.5 Score=25.34 Aligned_cols=10 Identities=20% Similarity=0.830 Sum_probs=8.4
Q ss_pred CCcCCCcccc
Q psy13347 208 KRFQCSECHK 217 (339)
Q Consensus 208 k~~~C~~C~k 217 (339)
.+|.|+.||.
T Consensus 47 ~~Y~CP~CGF 56 (59)
T PRK14890 47 NPYTCPKCGF 56 (59)
T ss_pred CceECCCCCC
Confidence 5799999985
No 104
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=53.30 E-value=13 Score=38.02 Aligned_cols=56 Identities=16% Similarity=0.238 Sum_probs=31.9
Q ss_pred ccccccccCccccccccccccccc---cccccccccCCCCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccC
Q psy13347 98 YKTTAEVNNKIRLSSYFYPALIST---LFSCSLRTADRKKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCG 174 (339)
Q Consensus 98 ~~~~~~~~~~~~c~~c~~~~~~~~---~l~~h~~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~ 174 (339)
|-.+.....-+.|..|++.+.... .|. .|.......|.+ ||+. ...|..|+ .||
T Consensus 426 flnRRGys~~l~C~~Cg~v~~Cp~Cd~~lt----~H~~~~~L~CH~--Cg~~---------------~~~p~~Cp--~Cg 482 (730)
T COG1198 426 FLNRRGYAPLLLCRDCGYIAECPNCDSPLT----LHKATGQLRCHY--CGYQ---------------EPIPQSCP--ECG 482 (730)
T ss_pred EEccCCccceeecccCCCcccCCCCCcceE----EecCCCeeEeCC--CCCC---------------CCCCCCCC--CCC
Confidence 333444444555666655543322 233 344446678877 8843 34678998 998
Q ss_pred Cc
Q psy13347 175 KR 176 (339)
Q Consensus 175 ~~ 176 (339)
..
T Consensus 483 s~ 484 (730)
T COG1198 483 SE 484 (730)
T ss_pred CC
Confidence 66
No 105
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=53.19 E-value=2.8 Score=25.29 Aligned_cols=30 Identities=13% Similarity=0.473 Sum_probs=19.2
Q ss_pred cCCCccccccCChhHHHHHHHhhcCCCCCccccc
Q psy13347 210 FQCSECHKKFMRSDHLQKHIKTHSKTRCTSLVMN 243 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~~h~~~~C~~C~~~ 243 (339)
|.|..||..|........ -....|+.|+..
T Consensus 6 y~C~~Cg~~fe~~~~~~~----~~~~~CP~Cg~~ 35 (41)
T smart00834 6 YRCEDCGHTFEVLQKISD----DPLATCPECGGD 35 (41)
T ss_pred EEcCCCCCEEEEEEecCC----CCCCCCCCCCCc
Confidence 899999998865443211 122239999874
No 106
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=52.81 E-value=7.5 Score=29.51 Aligned_cols=31 Identities=26% Similarity=0.714 Sum_probs=23.6
Q ss_pred eEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccCCh
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFTRS 180 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~ 180 (339)
...| +.||+.|.-. +..|..|+ .||..|.-.
T Consensus 9 KR~C--p~CG~kFYDL-----------nk~PivCP--~CG~~~~~~ 39 (108)
T PF09538_consen 9 KRTC--PSCGAKFYDL-----------NKDPIVCP--KCGTEFPPE 39 (108)
T ss_pred cccC--CCCcchhccC-----------CCCCccCC--CCCCccCcc
Confidence 3479 4599999753 34688999 999999766
No 107
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=51.74 E-value=15 Score=29.57 Aligned_cols=19 Identities=16% Similarity=0.429 Sum_probs=12.5
Q ss_pred CCceecCccccCCccCChHHH
Q psy13347 163 ERPFVCTWQFCGKRFTRSDEL 183 (339)
Q Consensus 163 ~k~~~C~~~~C~~~f~~~~~L 183 (339)
..-|.|+ .|+..|.....+
T Consensus 97 ~~~Y~Cp--~C~~~y~~~ea~ 115 (147)
T smart00531 97 NAYYKCP--NCQSKYTFLEAN 115 (147)
T ss_pred CcEEECc--CCCCEeeHHHHH
Confidence 3457787 788777755443
No 108
>KOG4167|consensus
Probab=50.95 E-value=3.7 Score=41.11 Aligned_cols=26 Identities=38% Similarity=0.740 Sum_probs=23.4
Q ss_pred CCcCCCccccccCChhHHHHHHHhhc
Q psy13347 208 KRFQCSECHKKFMRSDHLQKHIKTHS 233 (339)
Q Consensus 208 k~~~C~~C~k~F~~~~~L~~H~~~h~ 233 (339)
-.|.|..|+|.|-.-.++..||++|.
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 35999999999999999999999985
No 109
>PRK05978 hypothetical protein; Provisional
Probab=50.49 E-value=8 Score=31.14 Aligned_cols=15 Identities=40% Similarity=0.786 Sum_probs=10.7
Q ss_pred CCCccceeCCCCCCC
Q psy13347 51 RSRRVACTCPNCTEG 65 (339)
Q Consensus 51 ~~~~~~~~C~~C~~~ 65 (339)
..+-...+||.|++.
T Consensus 28 ~~rGl~grCP~CG~G 42 (148)
T PRK05978 28 MWRGFRGRCPACGEG 42 (148)
T ss_pred HHHHHcCcCCCCCCC
Confidence 344556899999954
No 110
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=49.90 E-value=1e+02 Score=27.30 Aligned_cols=24 Identities=8% Similarity=0.181 Sum_probs=18.3
Q ss_pred CCccccccCCccccccccccccCC
Q psy13347 237 CTSLVMNFSDNTVVTEDSKLELSG 260 (339)
Q Consensus 237 C~~C~~~f~~~~~l~~H~~~~~~~ 260 (339)
|+.|+........|..-.++|.-+
T Consensus 212 CPKCg~et~eTkdLSmStR~hkyG 235 (314)
T PF06524_consen 212 CPKCGYETQETKDLSMSTRSHKYG 235 (314)
T ss_pred CCCCCCcccccccceeeeecchhc
Confidence 888888888777787777777544
No 111
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=49.45 E-value=12 Score=30.73 Aligned_cols=23 Identities=30% Similarity=0.691 Sum_probs=19.0
Q ss_pred ceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCccc
Q psy13347 165 PFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECH 216 (339)
Q Consensus 165 ~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~ 216 (339)
-|.|+ +||..+. ++-|-+|++||
T Consensus 134 ~~vC~--vCGy~~~---------------------------ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCP--VCGYTHE---------------------------GEAPEVCPICG 156 (166)
T ss_pred EEEcC--CCCCccc---------------------------CCCCCcCCCCC
Confidence 59999 9986643 56889999999
No 112
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=49.00 E-value=5 Score=34.60 Aligned_cols=44 Identities=25% Similarity=0.568 Sum_probs=28.3
Q ss_pred CCeEEeeccccccccCCchhHHHHHhh---h-------CCCCc-----eecCccccCCccCCh
Q psy13347 133 KKVHICHIEGCNKVYGKTSHLRAHLRW---H-------TGERP-----FVCTWQFCGKRFTRS 180 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~~~~L~~H~~~---H-------~~~k~-----~~C~~~~C~~~f~~~ 180 (339)
++.+.|++ |++.|.++.-+....+. . .+..| ..|+ .||.+|...
T Consensus 3 ~k~~~CPv--C~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP--~CgyA~~~~ 61 (214)
T PF09986_consen 3 DKKITCPV--CGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCP--HCGYAAFEE 61 (214)
T ss_pred CCceECCC--CCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECC--CCCCccccc
Confidence 45678877 88888877554444432 1 12223 4799 999998755
No 113
>PRK14873 primosome assembly protein PriA; Provisional
Probab=47.53 E-value=19 Score=36.72 Aligned_cols=8 Identities=25% Similarity=0.982 Sum_probs=5.8
Q ss_pred ceeCCCCC
Q psy13347 56 ACTCPNCT 63 (339)
Q Consensus 56 ~~~C~~C~ 63 (339)
...|..|+
T Consensus 383 ~l~C~~Cg 390 (665)
T PRK14873 383 SLACARCR 390 (665)
T ss_pred eeEhhhCc
Confidence 34788888
No 114
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=46.41 E-value=15 Score=27.70 Aligned_cols=27 Identities=22% Similarity=0.385 Sum_probs=22.6
Q ss_pred CeEEe----eccccccccCCchhHHHHHhhhCC
Q psy13347 134 KVHIC----HIEGCNKVYGKTSHLRAHLRWHTG 162 (339)
Q Consensus 134 ~~~~C----~~~~C~k~F~~~~~L~~H~~~H~~ 162 (339)
..|.| .. |++.+.+...+.+|++.++|
T Consensus 79 ~G~~C~~~~~~--C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 79 DGYRCQCDPPH--CGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCeeeecCCCC--CCcEeccHHHHHHHHHHhcC
Confidence 45899 66 99999999999999987654
No 115
>PF05129 Elf1: Transcription elongation factor Elf1 like; InterPro: IPR007808 This family of uncharacterised, mostly short, proteins contain a putative zinc binding domain with four conserved cysteines.; PDB: 1WII_A.
Probab=45.47 E-value=7.5 Score=27.83 Aligned_cols=12 Identities=25% Similarity=0.628 Sum_probs=4.2
Q ss_pred CCccceeCCCCC
Q psy13347 52 SRRVACTCPNCT 63 (339)
Q Consensus 52 ~~~~~~~C~~C~ 63 (339)
.-+..|.||+|+
T Consensus 18 ~l~~~F~CPfC~ 29 (81)
T PF05129_consen 18 KLPKVFDCPFCN 29 (81)
T ss_dssp --SS----TTT-
T ss_pred CCCceEcCCcCC
Confidence 445779999999
No 116
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=45.43 E-value=23 Score=27.01 Aligned_cols=31 Identities=23% Similarity=0.559 Sum_probs=24.3
Q ss_pred CCcCCCccccccCChhHHHHHHHhhcCCCCCccc
Q psy13347 208 KRFQCSECHKKFMRSDHLQKHIKTHSKTRCTSLV 241 (339)
Q Consensus 208 k~~~C~~C~k~F~~~~~L~~H~~~h~~~~C~~C~ 241 (339)
..|.|+.|...|-..-+.-.|...|. |+-|.
T Consensus 80 ~~y~C~~C~~~FC~dCD~fiHe~Lh~---CPGC~ 110 (112)
T TIGR00622 80 HRYVCAVCKNVFCVDCDVFVHESLHC---CPGCI 110 (112)
T ss_pred cceeCCCCCCccccccchhhhhhccC---CcCCC
Confidence 45999999999988888888888776 55553
No 117
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=44.74 E-value=13 Score=31.05 Aligned_cols=31 Identities=6% Similarity=0.052 Sum_probs=17.3
Q ss_pred CccccccccccccccccccccccccCCCCeEEeecccccccc
Q psy13347 106 NKIRLSSYFYPALISTLFSCSLRTADRKKVHICHIEGCNKVY 147 (339)
Q Consensus 106 ~~~~c~~c~~~~~~~~~l~~h~~~h~~~~~~~C~~~~C~k~F 147 (339)
..|.|..|...+....++ +..|.|+. ||...
T Consensus 116 ~~Y~Cp~C~~rytf~eA~---------~~~F~Cp~--Cg~~L 146 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEAM---------EYGFRCPQ--CGEML 146 (178)
T ss_pred CEEECCCCCcEEeHHHHh---------hcCCcCCC--CCCCC
Confidence 455566665555555443 23577855 87653
No 118
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=43.02 E-value=10 Score=29.66 Aligned_cols=24 Identities=38% Similarity=0.562 Sum_probs=20.0
Q ss_pred cCCCccccccCChhHHHHHHHhhcCCC
Q psy13347 210 FQCSECHKKFMRSDHLQKHIKTHSKTR 236 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~~h~~~~ 236 (339)
..|-.+||.|+ +|++|+.+|.+..
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gmT 100 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGLT 100 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCCC
Confidence 46889999996 5899999998754
No 119
>KOG4167|consensus
Probab=42.37 E-value=6.6 Score=39.40 Aligned_cols=27 Identities=30% Similarity=0.544 Sum_probs=24.1
Q ss_pred CCeEEeeccccccccCCchhHHHHHhhhC
Q psy13347 133 KKVHICHIEGCNKVYGKTSHLRAHLRWHT 161 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~~~~L~~H~~~H~ 161 (339)
...|.|.. |+|+|....++..||++|.
T Consensus 790 ~giFpCre--C~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 790 TGIFPCRE--CGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred CceeehHH--HHHHHHHHhhhhHHHHHHH
Confidence 46799977 9999999999999999995
No 120
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=42.35 E-value=26 Score=34.48 Aligned_cols=15 Identities=20% Similarity=0.481 Sum_probs=8.1
Q ss_pred cCCCCeEEeeccccccc
Q psy13347 130 ADRKKVHICHIEGCNKV 146 (339)
Q Consensus 130 h~~~~~~~C~~~~C~k~ 146 (339)
|.......|.+ ||..
T Consensus 235 h~~~~~l~Ch~--Cg~~ 249 (505)
T TIGR00595 235 HKKEGKLRCHY--CGYQ 249 (505)
T ss_pred ecCCCeEEcCC--CcCc
Confidence 43445566666 6643
No 121
>KOG2807|consensus
Probab=41.20 E-value=65 Score=29.41 Aligned_cols=95 Identities=15% Similarity=0.146 Sum_probs=52.8
Q ss_pred CCccceeCCCCCCCCccccccccccCCCCCCchhH------HHHHHhhhhhcccccccccCcc--ccccccccccccccc
Q psy13347 52 SRRVACTCPNCTEGGEREMFSYMLHGCLWGNTMEY------ALAARVLKLKIYKTTAEVNNKI--RLSSYFYPALISTLF 123 (339)
Q Consensus 52 ~~~~~~~C~~C~~~~~~~~~~~~~~~C~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~--~c~~c~~~~~~~~~l 123 (339)
.....|.||.|...-=.-+. .|..|...-. ....+++.++.|.........- .|-.|+
T Consensus 272 ~~~~Gy~CP~CkakvCsLP~-----eCpiC~ltLVss~hLARSyhhL~PL~~F~Eip~~~~~~~~~Cf~C~--------- 337 (378)
T KOG2807|consen 272 LSGGGYFCPQCKAKVCSLPI-----ECPICSLTLVSSPHLARSYHHLFPLKPFVEIPETEYNGSRFCFACQ--------- 337 (378)
T ss_pred cccCceeCCcccCeeecCCc-----cCCccceeEecchHHHHHHHhhcCCcchhhccccccCCCcceeeec---------
Confidence 45667999999932111111 2333332221 2223445566665555444211 244442
Q ss_pred cccccccCCCCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccC
Q psy13347 124 SCSLRTADRKKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCG 174 (339)
Q Consensus 124 ~~h~~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~ 174 (339)
..-.....|.|.. |...|-..-+...|...| .|+ .|.
T Consensus 338 ----~~~~~~~~y~C~~--Ck~~FCldCDv~iHesLh------~Cp--gCe 374 (378)
T KOG2807|consen 338 ----GELLSSGRYRCES--CKNVFCLDCDVFIHESLH------NCP--GCE 374 (378)
T ss_pred ----cccCCCCcEEchh--ccceeeccchHHHHhhhh------cCC--CcC
Confidence 1123355799966 999998888888887776 476 775
No 122
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.93 E-value=15 Score=27.69 Aligned_cols=15 Identities=13% Similarity=0.138 Sum_probs=11.3
Q ss_pred CCCcCCCccccccCC
Q psy13347 207 EKRFQCSECHKKFMR 221 (339)
Q Consensus 207 ~k~~~C~~C~k~F~~ 221 (339)
..|..|+.||++|..
T Consensus 24 rdPiVsPytG~s~P~ 38 (129)
T COG4530 24 RDPIVSPYTGKSYPR 38 (129)
T ss_pred CCccccCcccccchH
Confidence 367888888888844
No 123
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=40.14 E-value=23 Score=22.11 Aligned_cols=8 Identities=38% Similarity=1.265 Sum_probs=6.2
Q ss_pred ceeCCCCC
Q psy13347 56 ACTCPNCT 63 (339)
Q Consensus 56 ~~~C~~C~ 63 (339)
.|.||.|+
T Consensus 18 g~~CP~Cg 25 (46)
T PF12760_consen 18 GFVCPHCG 25 (46)
T ss_pred CCCCCCCC
Confidence 37788888
No 124
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=39.92 E-value=9.2 Score=32.86 Aligned_cols=30 Identities=27% Similarity=0.588 Sum_probs=22.2
Q ss_pred CCCCcCCCccccccCChhHHHHHHHhhcCC
Q psy13347 206 REKRFQCSECHKKFMRSDHLQKHIKTHSKT 235 (339)
Q Consensus 206 ~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~ 235 (339)
.+..|.|..|+|.|.-..-.++|+..-|.+
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 345699999999999999999998854443
No 125
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=38.96 E-value=23 Score=27.97 Aligned_cols=28 Identities=39% Similarity=0.705 Sum_probs=16.8
Q ss_pred CCeEEeeccccccccCCchhHHHHHhhhCCCCc
Q psy13347 133 KKVHICHIEGCNKVYGKTSHLRAHLRWHTGERP 165 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~ 165 (339)
+....|-. ||+.|.. |++|++.|+|-.|
T Consensus 70 ~d~i~cle--cGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 70 PDYIICLE--CGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp SS-EE-TB--T--EESB---HHHHHHHTT-S-H
T ss_pred cCeeEEcc--CCcccch---HHHHHHHccCCCH
Confidence 45578966 9999974 6999999977543
No 126
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=38.54 E-value=43 Score=30.81 Aligned_cols=28 Identities=18% Similarity=0.495 Sum_probs=22.7
Q ss_pred eecCCCCcCCCccc-cccCChhHHHHHHH
Q psy13347 203 TWCREKRFQCSECH-KKFMRSDHLQKHIK 230 (339)
Q Consensus 203 ~h~~~k~~~C~~C~-k~F~~~~~L~~H~~ 230 (339)
.|.-.+-|.|.+|| +.+.-+..+.+|..
T Consensus 368 lhgLd~ef~CEICgNyvy~GR~~FdrHF~ 396 (470)
T COG5188 368 LHGLDIEFECEICGNYVYYGRDRFDRHFE 396 (470)
T ss_pred hcCCCcceeeeecccccccchHHHHhhhh
Confidence 45557789999999 88888888888854
No 127
>KOG2593|consensus
Probab=38.22 E-value=14 Score=34.81 Aligned_cols=38 Identities=29% Similarity=0.598 Sum_probs=25.9
Q ss_pred CCCCceecCccccCCccCChHHHHhhhcccCcccchhccccceecCCCCcCCCcccc
Q psy13347 161 TGERPFVCTWQFCGKRFTRSDELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHK 217 (339)
Q Consensus 161 ~~~k~~~C~~~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k 217 (339)
+...-|.|+ .|.+.|.....|..- -...-.|.|..|+-
T Consensus 124 t~~~~Y~Cp--~C~kkyt~Lea~~L~-----------------~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 124 TNVAGYVCP--NCQKKYTSLEALQLL-----------------DNETGEFHCENCGG 161 (436)
T ss_pred cccccccCC--ccccchhhhHHHHhh-----------------cccCceEEEecCCC
Confidence 445569999 999999887766421 11123489999874
No 128
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=37.19 E-value=22 Score=25.65 Aligned_cols=15 Identities=13% Similarity=0.523 Sum_probs=10.0
Q ss_pred CCeEEeeccccccccCC
Q psy13347 133 KKVHICHIEGCNKVYGK 149 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~ 149 (339)
...+.|.. |+..|.-
T Consensus 51 ~GIW~C~k--Cg~~fAG 65 (89)
T COG1997 51 TGIWKCRK--CGAKFAG 65 (89)
T ss_pred cCeEEcCC--CCCeecc
Confidence 45678855 8887753
No 129
>KOG2785|consensus
Probab=36.31 E-value=37 Score=31.62 Aligned_cols=50 Identities=20% Similarity=0.432 Sum_probs=42.2
Q ss_pred eEEeeccccccccCCchhHHHHHhhhCC-----------------------CCceecCccccC---CccCChHHHHhhhc
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRWHTG-----------------------ERPFVCTWQFCG---KRFTRSDELQVNGG 188 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~H~~-----------------------~k~~~C~~~~C~---~~f~~~~~L~~H~~ 188 (339)
|-.|-+ |++.|.....-..||..++| ..-|.|. .|+ +.|.+....+.||.
T Consensus 166 Pt~CLf--C~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL--~CN~~~~~f~sleavr~HM~ 241 (390)
T KOG2785|consen 166 PTDCLF--CDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICL--FCNELGRPFSSLEAVRAHMR 241 (390)
T ss_pred Ccceee--cCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEE--EeccccCcccccHHHHHHHh
Confidence 467988 99999999999999998776 2247888 898 99999999999984
No 130
>PF14353 CpXC: CpXC protein
Probab=35.44 E-value=4.4 Score=31.70 Aligned_cols=46 Identities=17% Similarity=0.277 Sum_probs=29.4
Q ss_pred cCCCccccccCCh----------hHHHHHHHhhcCCC----CCccccccCCccccccccccc
Q psy13347 210 FQCSECHKKFMRS----------DHLQKHIKTHSKTR----CTSLVMNFSDNTVVTEDSKLE 257 (339)
Q Consensus 210 ~~C~~C~k~F~~~----------~~L~~H~~~h~~~~----C~~C~~~f~~~~~l~~H~~~~ 257 (339)
..|+.|+..|... ..|+. ++-.|.- |+.||..|.-...+.-|-..+
T Consensus 2 itCP~C~~~~~~~v~~~I~~~~~p~l~e--~il~g~l~~~~CP~Cg~~~~~~~p~lY~D~~~ 61 (128)
T PF14353_consen 2 ITCPHCGHEFEFEVWTSINADEDPELKE--KILDGSLFSFTCPSCGHKFRLEYPLLYHDPEK 61 (128)
T ss_pred cCCCCCCCeeEEEEEeEEcCcCCHHHHH--HHHcCCcCEEECCCCCCceecCCCEEEEcCCC
Confidence 4799999888533 22222 2234433 999999998777776665443
No 131
>KOG2636|consensus
Probab=34.92 E-value=26 Score=33.27 Aligned_cols=35 Identities=17% Similarity=0.491 Sum_probs=27.3
Q ss_pred eecCCCCcCCCccc-cccCChhHHHHHHH--hh-cCCCC
Q psy13347 203 TWCREKRFQCSECH-KKFMRSDHLQKHIK--TH-SKTRC 237 (339)
Q Consensus 203 ~h~~~k~~~C~~C~-k~F~~~~~L~~H~~--~h-~~~~C 237 (339)
.|.-.+.|.|.+|| +++.-+..+.+|.. .| +|.+|
T Consensus 395 LHGL~~ey~CEICGNy~Y~GrkaF~RHF~EwRH~hGmrC 433 (497)
T KOG2636|consen 395 LHGLDIEYNCEICGNYVYKGRKAFDRHFNEWRHAHGMRC 433 (497)
T ss_pred hcCCCcccceeeccCccccCcHHHHHHhHHHHHhhccee
Confidence 45556789999999 99999999999964 34 35555
No 132
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=34.91 E-value=10 Score=30.84 Aligned_cols=16 Identities=44% Similarity=0.837 Sum_probs=9.6
Q ss_pred eecCccccCCccCChHHH
Q psy13347 166 FVCTWQFCGKRFTRSDEL 183 (339)
Q Consensus 166 ~~C~~~~C~~~f~~~~~L 183 (339)
++|+ .||++|.....+
T Consensus 29 ~~c~--~c~~~f~~~e~~ 44 (154)
T PRK00464 29 RECL--ACGKRFTTFERV 44 (154)
T ss_pred eecc--ccCCcceEeEec
Confidence 5666 666666655443
No 133
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=34.74 E-value=34 Score=26.88 Aligned_cols=55 Identities=16% Similarity=0.274 Sum_probs=35.0
Q ss_pred CccccccccccccccccccccccccCCCCeEEeeccccccccCCc--hhHHHHHhhhCCCCceecCccccCCccCChH
Q psy13347 106 NKIRLSSYFYPALISTLFSCSLRTADRKKVHICHIEGCNKVYGKT--SHLRAHLRWHTGERPFVCTWQFCGKRFTRSD 181 (339)
Q Consensus 106 ~~~~c~~c~~~~~~~~~l~~h~~~h~~~~~~~C~~~~C~k~F~~~--~~L~~H~~~H~~~k~~~C~~~~C~~~f~~~~ 181 (339)
.-|+|..|...+.....| ||-.| ||+..-.. ..|..|-..|. +|| .|..+|++..
T Consensus 79 ~lYeCnIC~etS~ee~FL----------KPneC----CgY~iCn~Cya~LWK~~~~yp-----vCP--vCkTSFKss~ 135 (140)
T PF05290_consen 79 KLYECNICKETSAEERFL----------KPNEC----CGYSICNACYANLWKFCNLYP-----VCP--VCKTSFKSSS 135 (140)
T ss_pred CceeccCcccccchhhcC----------Ccccc----cchHHHHHHHHHHHHHcccCC-----CCC--cccccccccc
Confidence 678888887666544433 44556 77543222 45666655552 799 9999998765
No 134
>COG3478 Predicted nucleic-acid-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=32.79 E-value=31 Score=23.31 Aligned_cols=25 Identities=20% Similarity=0.177 Sum_probs=14.3
Q ss_pred hcccccccccCcccccccccccccc
Q psy13347 96 KIYKTTAEVNNKIRLSSYFYPALIS 120 (339)
Q Consensus 96 ~~~~~~~~~~~~~~c~~c~~~~~~~ 120 (339)
++|....+.--++.|..|+++-.++
T Consensus 29 kifdvq~n~f~~itCk~CgYtEfY~ 53 (68)
T COG3478 29 KIFDVQNNKFIVITCKNCGYTEFYS 53 (68)
T ss_pred eeEEecccEEEEEEeccCCchhhee
Confidence 5555555555566666666654443
No 135
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.78 E-value=8.6 Score=33.05 Aligned_cols=28 Identities=21% Similarity=0.508 Sum_probs=19.9
Q ss_pred CCCcCCCccccccCChhHHHHHHHhhcC
Q psy13347 207 EKRFQCSECHKKFMRSDHLQKHIKTHSK 234 (339)
Q Consensus 207 ~k~~~C~~C~k~F~~~~~L~~H~~~h~~ 234 (339)
++.+.|++|+-.|....-+..-.|+-.+
T Consensus 17 kk~ieCPvC~tkFkkeev~tgsiRiiag 44 (267)
T COG1655 17 KKTIECPVCNTKFKKEEVKTGSIRIIAG 44 (267)
T ss_pred hceeccCcccchhhhhheeccceeEecc
Confidence 4669999999999887665544554433
No 136
>PF14311 DUF4379: Domain of unknown function (DUF4379)
Probab=32.63 E-value=24 Score=22.91 Aligned_cols=28 Identities=14% Similarity=0.415 Sum_probs=18.9
Q ss_pred CcCCCccccccCChhHHHHHHHhhcCCCCCcc
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIKTHSKTRCTSL 240 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~~h~~~~C~~C 240 (339)
-++|+.||..|......+. ..+..|+.|
T Consensus 28 ~W~C~~Cgh~w~~~v~~R~----~~~~~CP~C 55 (55)
T PF14311_consen 28 WWKCPKCGHEWKASVNDRT----RRGKGCPYC 55 (55)
T ss_pred EEECCCCCCeeEccHhhhc----cCCCCCCCC
Confidence 3889999988877655433 333348777
No 137
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=32.56 E-value=30 Score=21.21 Aligned_cols=7 Identities=57% Similarity=1.545 Sum_probs=5.4
Q ss_pred eeCCCCC
Q psy13347 57 CTCPNCT 63 (339)
Q Consensus 57 ~~C~~C~ 63 (339)
|+||.|+
T Consensus 1 m~Cp~Cg 7 (43)
T PF08271_consen 1 MKCPNCG 7 (43)
T ss_dssp ESBTTTS
T ss_pred CCCcCCc
Confidence 5788888
No 138
>KOG2593|consensus
Probab=32.08 E-value=25 Score=33.33 Aligned_cols=43 Identities=23% Similarity=0.468 Sum_probs=28.4
Q ss_pred cccCCCCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCcc
Q psy13347 128 RTADRKKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRF 177 (339)
Q Consensus 128 ~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f 177 (339)
+.-+....|.|+. |.+.|.....++ ..-...-.|.|. .|+--.
T Consensus 121 ~d~t~~~~Y~Cp~--C~kkyt~Lea~~---L~~~~~~~F~C~--~C~gel 163 (436)
T KOG2593|consen 121 RDDTNVAGYVCPN--CQKKYTSLEALQ---LLDNETGEFHCE--NCGGEL 163 (436)
T ss_pred hhccccccccCCc--cccchhhhHHHH---hhcccCceEEEe--cCCCch
Confidence 4445567899966 999998655443 333223459998 998543
No 139
>PRK05580 primosome assembly protein PriA; Validated
Probab=31.76 E-value=50 Score=33.86 Aligned_cols=7 Identities=29% Similarity=0.971 Sum_probs=4.3
Q ss_pred eeCCCCC
Q psy13347 57 CTCPNCT 63 (339)
Q Consensus 57 ~~C~~C~ 63 (339)
..|..|+
T Consensus 382 ~~C~~Cg 388 (679)
T PRK05580 382 LLCRDCG 388 (679)
T ss_pred eEhhhCc
Confidence 4566666
No 140
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=31.69 E-value=41 Score=21.28 Aligned_cols=6 Identities=50% Similarity=1.558 Sum_probs=4.0
Q ss_pred eCCCCC
Q psy13347 58 TCPNCT 63 (339)
Q Consensus 58 ~C~~C~ 63 (339)
.||.|+
T Consensus 2 FCp~Cg 7 (52)
T smart00661 2 FCPKCG 7 (52)
T ss_pred CCCCCC
Confidence 377777
No 141
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=30.78 E-value=38 Score=20.70 Aligned_cols=8 Identities=25% Similarity=0.899 Sum_probs=5.8
Q ss_pred ceeCCCCC
Q psy13347 56 ACTCPNCT 63 (339)
Q Consensus 56 ~~~C~~C~ 63 (339)
.|+|+.|+
T Consensus 5 ey~C~~Cg 12 (42)
T PF09723_consen 5 EYRCEECG 12 (42)
T ss_pred EEEeCCCC
Confidence 47777777
No 142
>KOG1280|consensus
Probab=30.74 E-value=46 Score=30.56 Aligned_cols=106 Identities=12% Similarity=0.108 Sum_probs=0.0
Q ss_pred cccCccccccccccccccccccccccccCCCCeEEeeccccccccCCchhHHHHHh--hhCCCCceecCccccCCccCCh
Q psy13347 103 EVNNKIRLSSYFYPALISTLFSCSLRTADRKKVHICHIEGCNKVYGKTSHLRAHLR--WHTGERPFVCTWQFCGKRFTRS 180 (339)
Q Consensus 103 ~~~~~~~c~~c~~~~~~~~~l~~h~~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~--~H~~~k~~~C~~~~C~~~f~~~ 180 (339)
+....+.|+-|+ +..-.-+.|+|-. |.-.-.-.+-+..-.. +|..+-|..| +|.+.=..-
T Consensus 4 ~rHe~v~CdgC~-------------k~~~t~rrYkCL~--C~DyDlC~sCyen~~tt~~H~~dHPmqc---il~~~dfeL 65 (381)
T KOG1280|consen 4 SRHEGVSCDGCG-------------KTAFTFRRYKCLR--CSDYDLCFSCYENGATTPIHDEDHPMQC---ILSRVDFEL 65 (381)
T ss_pred CCcCCceecccc-------------ccceeeeeeEeee--ecchhHHHHHhhcCCCCcccCCCCceeE---Eeeccceee
Q ss_pred HHHHhhhcccCcccchhccccceecCCCCcCCCccccccCChhHHHHHHHhhcCCC-----CCccc
Q psy13347 181 DELQVNGGRFGIERTVVRILPMTWCREKRFQCSECHKKFMRSDHLQKHIKTHSKTR-----CTSLV 241 (339)
Q Consensus 181 ~~L~~H~~~~~~~~~~~~~~~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~~-----C~~C~ 241 (339)
..-..-+ .|-...-|.|++|++.=.+...|..|....+.+. |+.|.
T Consensus 66 ~f~Ge~i---------------~~y~~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 66 YFGGEPI---------------SHYDPQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred EecCccc---------------cccccccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
No 143
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.69 E-value=14 Score=25.79 Aligned_cols=34 Identities=18% Similarity=0.567 Sum_probs=22.2
Q ss_pred CcCCCccccccCChhHHHHHHHhhcCCCCCccccccCC
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIKTHSKTRCTSLVMNFSD 246 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~~h~~~~C~~C~~~f~~ 246 (339)
.|+|..|+..| .+..||.--.-..|+.|+-.|+.
T Consensus 12 ~Y~c~~cg~~~----dvvq~~~ddplt~ce~c~a~~kk 45 (82)
T COG2331 12 SYECTECGNRF----DVVQAMTDDPLTTCEECGARLKK 45 (82)
T ss_pred EEeecccchHH----HHHHhcccCccccChhhChHHHH
Confidence 49999999766 45555553322339999876543
No 144
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=30.58 E-value=33 Score=26.90 Aligned_cols=15 Identities=40% Similarity=0.955 Sum_probs=12.1
Q ss_pred ceecCccccCCccCChH
Q psy13347 165 PFVCTWQFCGKRFTRSD 181 (339)
Q Consensus 165 ~~~C~~~~C~~~f~~~~ 181 (339)
|++|. .||+.|..-+
T Consensus 1 PH~Ct--~Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCT--KCGRVFEDGS 15 (131)
T ss_pred CcccC--cCCCCcCCCc
Confidence 57898 9999998655
No 145
>COG5349 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.94 E-value=22 Score=27.38 Aligned_cols=31 Identities=29% Similarity=0.563 Sum_probs=19.0
Q ss_pred CCCCCccceeCCCCCCCCccccccccccCCC
Q psy13347 49 KPRSRRVACTCPNCTEGGEREMFSYMLHGCL 79 (339)
Q Consensus 49 ~~~~~~~~~~C~~C~~~~~~~~~~~~~~~C~ 79 (339)
.++.+-...+||.|+|...-.-|.+....|.
T Consensus 14 ~pi~~Gl~grCP~CGeGrLF~gFLK~~p~C~ 44 (126)
T COG5349 14 TPIKRGLRGRCPRCGEGRLFRGFLKVVPACE 44 (126)
T ss_pred cHHHHHhcCCCCCCCCchhhhhhcccCchhh
Confidence 4566677789999997644444444444333
No 146
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=29.65 E-value=42 Score=21.92 Aligned_cols=45 Identities=27% Similarity=0.536 Sum_probs=25.8
Q ss_pred CCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccc----cCCccCCh
Q psy13347 133 KKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQF----CGKRFTRS 180 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~----C~~~f~~~ 180 (339)
..+..|+...|...+. +..|..|+...-..++..|+ . |...+...
T Consensus 7 ~~~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~--~~~~GC~~~~~~~ 55 (60)
T PF02176_consen 7 FRPVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCP--YSPYGCKERVPRE 55 (60)
T ss_dssp TSEEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-S--S----S--EEEHH
T ss_pred CCEeeCCCCCccccee-HHHHHHHHHccCCCCcEECC--CCCCCCCCccchh
Confidence 3467785322555555 55799999866667788898 7 87776543
No 147
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=29.02 E-value=13 Score=22.83 Aligned_cols=15 Identities=20% Similarity=0.563 Sum_probs=9.0
Q ss_pred CCCccccccCChhHH
Q psy13347 211 QCSECHKKFMRSDHL 225 (339)
Q Consensus 211 ~C~~C~k~F~~~~~L 225 (339)
.|.+|++.|..+...
T Consensus 10 ~C~~C~rpf~WRKKW 24 (42)
T PF10013_consen 10 ICPVCGRPFTWRKKW 24 (42)
T ss_pred cCcccCCcchHHHHH
Confidence 466677666655433
No 148
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=28.77 E-value=17 Score=29.12 Aligned_cols=8 Identities=38% Similarity=1.090 Sum_probs=6.1
Q ss_pred eCCCCCCC
Q psy13347 58 TCPNCTEG 65 (339)
Q Consensus 58 ~C~~C~~~ 65 (339)
+||+|+..
T Consensus 2 ~CP~C~~~ 9 (147)
T TIGR00244 2 HCPFCQHH 9 (147)
T ss_pred CCCCCCCC
Confidence 69999853
No 149
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=28.52 E-value=34 Score=27.79 Aligned_cols=17 Identities=18% Similarity=0.464 Sum_probs=11.1
Q ss_pred eEEeeccccccccCCchhH
Q psy13347 135 VHICHIEGCNKVYGKTSHL 153 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L 153 (339)
.++|. .||+.|.+...+
T Consensus 28 ~~~c~--~c~~~f~~~e~~ 44 (154)
T PRK00464 28 RRECL--ACGKRFTTFERV 44 (154)
T ss_pred eeecc--ccCCcceEeEec
Confidence 37884 488888765443
No 150
>COG1779 C4-type Zn-finger protein [General function prediction only]
Probab=28.31 E-value=11 Score=31.65 Aligned_cols=37 Identities=19% Similarity=0.333 Sum_probs=24.6
Q ss_pred CcCCCccccccCChhHHHHHHHh----hcCCC------CCccccccCCccccc
Q psy13347 209 RFQCSECHKKFMRSDHLQKHIKT----HSKTR------CTSLVMNFSDNTVVT 251 (339)
Q Consensus 209 ~~~C~~C~k~F~~~~~L~~H~~~----h~~~~------C~~C~~~f~~~~~l~ 251 (339)
.-.|++||. .|..|++. |-|+- |..||+++..-..|.
T Consensus 14 ~~~CPvCg~------~l~~~~~~~~IPyFG~V~i~t~~C~~CgYR~~DV~~~e 60 (201)
T COG1779 14 RIDCPVCGG------TLKAHMYLYDIPYFGEVLISTGVCERCGYRSTDVKTLE 60 (201)
T ss_pred eecCCcccc------eeeEEEeeecCCccceEEEEEEEccccCCcccceeecc
Confidence 367999997 34455543 33333 999999987755443
No 151
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=28.28 E-value=30 Score=28.18 Aligned_cols=38 Identities=16% Similarity=0.328 Sum_probs=28.1
Q ss_pred cccCCCCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccC
Q psy13347 128 RTADRKKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFT 178 (339)
Q Consensus 128 ~~h~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~ 178 (339)
.......-|.|+. |+..|+....+. .-|.|+ .||....
T Consensus 102 ~~e~~~~~Y~Cp~--c~~r~tf~eA~~---------~~F~Cp--~Cg~~L~ 139 (158)
T TIGR00373 102 EFETNNMFFICPN--MCVRFTFNEAME---------LNFTCP--RCGAMLD 139 (158)
T ss_pred hhccCCCeEECCC--CCcEeeHHHHHH---------cCCcCC--CCCCEee
Confidence 3345567899954 999998777764 259999 9998754
No 152
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=28.19 E-value=68 Score=25.70 Aligned_cols=40 Identities=18% Similarity=0.419 Sum_probs=26.2
Q ss_pred CCCCeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCCccC
Q psy13347 131 DRKKVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGKRFT 178 (339)
Q Consensus 131 ~~~~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~~f~ 178 (339)
.....|.|+. |+..|.....+.. ... ...|.|+ .||....
T Consensus 95 ~~~~~Y~Cp~--C~~~y~~~ea~~~---~d~-~~~f~Cp--~Cg~~l~ 134 (147)
T smart00531 95 TNNAYYKCPN--CQSKYTFLEANQL---LDM-DGTFTCP--RCGEELE 134 (147)
T ss_pred cCCcEEECcC--CCCEeeHHHHHHh---cCC-CCcEECC--CCCCEEE
Confidence 4456899955 9999985443322 111 3349999 9998764
No 153
>KOG3002|consensus
Probab=28.13 E-value=30 Score=31.49 Aligned_cols=79 Identities=14% Similarity=0.133 Sum_probs=45.0
Q ss_pred cCccccccccccccccccccccccccCCCCeEEeec--cccccccCCchhHHHHHhhhCCCCceecCcc--ccCCccCCh
Q psy13347 105 NNKIRLSSYFYPALISTLFSCSLRTADRKKVHICHI--EGCNKVYGKTSHLRAHLRWHTGERPFVCTWQ--FCGKRFTRS 180 (339)
Q Consensus 105 ~~~~~c~~c~~~~~~~~~l~~h~~~h~~~~~~~C~~--~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~--~C~~~f~~~ 180 (339)
+..-+|..|..++... ..++|..--....+.|.. -+|.+.|..... ..|.+.-.- .||.|+.+ .|...= ..
T Consensus 78 ~~~~~CP~Cr~~~g~~--R~~amEkV~e~~~vpC~~~~~GC~~~~~Y~~~-~~HE~~C~f-~~~~CP~p~~~C~~~G-~~ 152 (299)
T KOG3002|consen 78 KVSNKCPTCRLPIGNI--RCRAMEKVAEAVLVPCKNAKLGCTKSFPYGEK-SKHEKVCEF-RPCSCPVPGAECKYTG-SY 152 (299)
T ss_pred hhcccCCccccccccH--HHHHHHHHHHhceecccccccCCceeeccccc-ccccccccc-CCcCCCCCcccCCccC-cH
Confidence 3345566666555532 233333334455677754 368888877766 556655433 67888865 555443 33
Q ss_pred HHHHhhhc
Q psy13347 181 DELQVNGG 188 (339)
Q Consensus 181 ~~L~~H~~ 188 (339)
..|..|.+
T Consensus 153 ~~l~~H~~ 160 (299)
T KOG3002|consen 153 KDLYAHLN 160 (299)
T ss_pred HHHHHHHH
Confidence 45666654
No 154
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=27.54 E-value=35 Score=23.83 Aligned_cols=19 Identities=26% Similarity=0.443 Sum_probs=10.0
Q ss_pred CCceecCccccCCccCChH
Q psy13347 163 ERPFVCTWQFCGKRFTRSD 181 (339)
Q Consensus 163 ~k~~~C~~~~C~~~f~~~~ 181 (339)
+.-+.|....||.+|....
T Consensus 25 ~~Y~qC~N~eCg~tF~t~e 43 (72)
T PRK09678 25 ERYHQCQNVNCSATFITYE 43 (72)
T ss_pred eeeeecCCCCCCCEEEEEE
Confidence 3445565445666665443
No 155
>KOG3408|consensus
Probab=27.23 E-value=33 Score=26.39 Aligned_cols=26 Identities=35% Similarity=0.728 Sum_probs=22.1
Q ss_pred CCCCcCCCccccccCChhHHHHHHHh
Q psy13347 206 REKRFQCSECHKKFMRSDHLQKHIKT 231 (339)
Q Consensus 206 ~~k~~~C~~C~k~F~~~~~L~~H~~~ 231 (339)
|.-.|-|-.|.+-|.....|..|.++
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhc
Confidence 33458899999999999999999873
No 156
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=27.11 E-value=12 Score=21.81 Aligned_cols=18 Identities=28% Similarity=0.569 Sum_probs=10.2
Q ss_pred cceeCCCCCCCCcccccc
Q psy13347 55 VACTCPNCTEGGEREMFS 72 (339)
Q Consensus 55 ~~~~C~~C~~~~~~~~~~ 72 (339)
..|.|++|+..-....|.
T Consensus 3 ~~~~C~nC~R~v~a~RfA 20 (33)
T PF08209_consen 3 PYVECPNCGRPVAASRFA 20 (33)
T ss_dssp -EEE-TTTSSEEEGGGHH
T ss_pred CeEECCCCcCCcchhhhH
Confidence 458999999654444333
No 157
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=27.01 E-value=40 Score=27.72 Aligned_cols=25 Identities=44% Similarity=1.024 Sum_probs=19.4
Q ss_pred CeEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCC
Q psy13347 134 KVHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGK 175 (339)
Q Consensus 134 ~~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~ 175 (339)
+.|.|.+ ||..+ .++.|-+|| +||-
T Consensus 133 ~~~vC~v--CGy~~-------------~ge~P~~CP--iCga 157 (166)
T COG1592 133 KVWVCPV--CGYTH-------------EGEAPEVCP--ICGA 157 (166)
T ss_pred CEEEcCC--CCCcc-------------cCCCCCcCC--CCCC
Confidence 3799988 98764 457788999 9984
No 158
>KOG2071|consensus
Probab=26.73 E-value=39 Score=33.41 Aligned_cols=29 Identities=28% Similarity=0.531 Sum_probs=25.1
Q ss_pred CCCcCCCccccccCChhHHHHHHHhhcCC
Q psy13347 207 EKRFQCSECHKKFMRSDHLQKHIKTHSKT 235 (339)
Q Consensus 207 ~k~~~C~~C~k~F~~~~~L~~H~~~h~~~ 235 (339)
.++-+|..||.+|........||-.|...
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H~dw 444 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIHDDW 444 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhhhhh
Confidence 56789999999999999999999888653
No 159
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=26.21 E-value=27 Score=21.06 Aligned_cols=14 Identities=29% Similarity=0.793 Sum_probs=12.2
Q ss_pred CcCCCccccccCCh
Q psy13347 209 RFQCSECHKKFMRS 222 (339)
Q Consensus 209 ~~~C~~C~k~F~~~ 222 (339)
||.|..|++.|-..
T Consensus 12 ~f~C~~C~~~FC~~ 25 (39)
T smart00154 12 GFKCRHCGNLFCGE 25 (39)
T ss_pred CeECCccCCccccc
Confidence 79999999999765
No 160
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=26.07 E-value=56 Score=28.10 Aligned_cols=25 Identities=20% Similarity=0.509 Sum_probs=15.1
Q ss_pred CCeEEeeccccccccCCchhHHHHHhh
Q psy13347 133 KKVHICHIEGCNKVYGKTSHLRAHLRW 159 (339)
Q Consensus 133 ~~~~~C~~~~C~k~F~~~~~L~~H~~~ 159 (339)
+..|.|.+ |+|.|.-..-.+.|+..
T Consensus 75 ~~K~~C~l--c~KlFkg~eFV~KHI~n 99 (214)
T PF04959_consen 75 EDKWRCPL--CGKLFKGPEFVRKHIFN 99 (214)
T ss_dssp SEEEEE-S--SS-EESSHHHHHHHHHH
T ss_pred CCEECCCC--CCcccCChHHHHHHHhh
Confidence 44577755 77777777666777653
No 161
>KOG3214|consensus
Probab=25.49 E-value=27 Score=25.83 Aligned_cols=15 Identities=27% Similarity=0.667 Sum_probs=10.3
Q ss_pred CeEEeeccccccccCCc
Q psy13347 134 KVHICHIEGCNKVYGKT 150 (339)
Q Consensus 134 ~~~~C~~~~C~k~F~~~ 150 (339)
....|.+ |+..|...
T Consensus 46 G~~sC~i--C~esFqt~ 60 (109)
T KOG3214|consen 46 GKASCRI--CEESFQTT 60 (109)
T ss_pred ceeeeee--hhhhhccc
Confidence 3456877 88888664
No 162
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=25.30 E-value=53 Score=32.34 Aligned_cols=19 Identities=32% Similarity=0.786 Sum_probs=12.3
Q ss_pred HhhhCCCCceecCccccCCcc
Q psy13347 157 LRWHTGERPFVCTWQFCGKRF 177 (339)
Q Consensus 157 ~~~H~~~k~~~C~~~~C~~~f 177 (339)
+..|.......|. .||...
T Consensus 232 l~~h~~~~~l~Ch--~Cg~~~ 250 (505)
T TIGR00595 232 LTYHKKEGKLRCH--YCGYQE 250 (505)
T ss_pred eEEecCCCeEEcC--CCcCcC
Confidence 3445555678888 888653
No 163
>PF03811 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=25.21 E-value=65 Score=19.09 Aligned_cols=11 Identities=27% Similarity=0.984 Sum_probs=8.0
Q ss_pred cceeCCCCCCC
Q psy13347 55 VACTCPNCTEG 65 (339)
Q Consensus 55 ~~~~C~~C~~~ 65 (339)
....||.|+..
T Consensus 4 i~v~CP~C~s~ 14 (36)
T PF03811_consen 4 IDVHCPRCQST 14 (36)
T ss_pred EeeeCCCCCCC
Confidence 34689999954
No 164
>PRK14873 primosome assembly protein PriA; Provisional
Probab=24.75 E-value=56 Score=33.40 Aligned_cols=10 Identities=20% Similarity=0.926 Sum_probs=7.3
Q ss_pred CcCCCccccc
Q psy13347 209 RFQCSECHKK 218 (339)
Q Consensus 209 ~~~C~~C~k~ 218 (339)
++.|+.||..
T Consensus 422 p~~Cp~Cgs~ 431 (665)
T PRK14873 422 DWRCPRCGSD 431 (665)
T ss_pred CccCCCCcCC
Confidence 5788888754
No 165
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=24.54 E-value=35 Score=17.89 Aligned_cols=6 Identities=17% Similarity=0.395 Sum_probs=3.0
Q ss_pred CCcccc
Q psy13347 237 CTSLVM 242 (339)
Q Consensus 237 C~~C~~ 242 (339)
|..||.
T Consensus 16 C~~CG~ 21 (23)
T PF13240_consen 16 CPNCGT 21 (23)
T ss_pred hhhhCC
Confidence 555544
No 166
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=24.11 E-value=69 Score=18.41 Aligned_cols=7 Identities=29% Similarity=0.956 Sum_probs=4.3
Q ss_pred eeCCCCC
Q psy13347 57 CTCPNCT 63 (339)
Q Consensus 57 ~~C~~C~ 63 (339)
|.|..|+
T Consensus 1 Y~C~~Cg 7 (32)
T PF03604_consen 1 YICGECG 7 (32)
T ss_dssp EBESSSS
T ss_pred CCCCcCC
Confidence 4566666
No 167
>PF11931 DUF3449: Domain of unknown function (DUF3449); InterPro: IPR024598 This presumed domain is functionally uncharacterised. It has two conserved sequence motifs: PIP and CEICG and contains a zinc-finger of the C2H2-type.; PDB: 4DGW_A.
Probab=22.36 E-value=29 Score=29.35 Aligned_cols=34 Identities=21% Similarity=0.574 Sum_probs=0.0
Q ss_pred ecCCCCcCCCccc-cccCChhHHHHHHH--hhc-CCCC
Q psy13347 204 WCREKRFQCSECH-KKFMRSDHLQKHIK--THS-KTRC 237 (339)
Q Consensus 204 h~~~k~~~C~~C~-k~F~~~~~L~~H~~--~h~-~~~C 237 (339)
|--.+.|.|.+|| ..|.-+..+.+|.. .|. |.+|
T Consensus 96 hGL~~ey~CEICGN~~Y~GrkaFekHF~E~rH~~Glrc 133 (196)
T PF11931_consen 96 HGLGVEYKCEICGNQSYKGRKAFEKHFQEWRHAYGLRC 133 (196)
T ss_dssp --------------------------------------
T ss_pred hCCCCeeeeEeCCCcceecHHHHHHhcChhHHHccChh
Confidence 3345679999999 67778888999964 454 4444
No 168
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=21.94 E-value=58 Score=21.03 Aligned_cols=22 Identities=14% Similarity=0.439 Sum_probs=9.1
Q ss_pred CceecCccccCCccCChHHHHhhh
Q psy13347 164 RPFVCTWQFCGKRFTRSDELQVNG 187 (339)
Q Consensus 164 k~~~C~~~~C~~~f~~~~~L~~H~ 187 (339)
..|+|+ .|...|-..-++..|.
T Consensus 20 ~~y~C~--~C~~~FC~dCD~fiHE 41 (51)
T PF07975_consen 20 SRYRCP--KCKNHFCIDCDVFIHE 41 (51)
T ss_dssp EEE--T--TTT--B-HHHHHTTTT
T ss_pred CeEECC--CCCCccccCcChhhhc
Confidence 345665 6666665555555553
No 169
>KOG0717|consensus
Probab=21.23 E-value=45 Score=32.00 Aligned_cols=21 Identities=29% Similarity=0.676 Sum_probs=19.7
Q ss_pred cCCCccccccCChhHHHHHHH
Q psy13347 210 FQCSECHKKFMRSDHLQKHIK 230 (339)
Q Consensus 210 ~~C~~C~k~F~~~~~L~~H~~ 230 (339)
+-|.+|.|.|.+...|..|..
T Consensus 293 lyC~vCnKsFKseKq~kNHEn 313 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHEN 313 (508)
T ss_pred eEEeeccccccchHHHHhhHH
Confidence 779999999999999999975
No 170
>PF09855 DUF2082: Nucleic-acid-binding protein containing Zn-ribbon domain (DUF2082); InterPro: IPR018652 This family of proteins contains various hypothetical prokaryotic proteins as well as some Zn-ribbon nucleic-acid-binding proteins.
Probab=21.17 E-value=55 Score=22.25 Aligned_cols=10 Identities=30% Similarity=0.973 Sum_probs=7.6
Q ss_pred eeCCCCCCCC
Q psy13347 57 CTCPNCTEGG 66 (339)
Q Consensus 57 ~~C~~C~~~~ 66 (339)
|.||.|+..+
T Consensus 1 y~C~KCg~~~ 10 (64)
T PF09855_consen 1 YKCPKCGNEE 10 (64)
T ss_pred CCCCCCCCcc
Confidence 6899999643
No 171
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=20.71 E-value=37 Score=21.58 Aligned_cols=30 Identities=17% Similarity=0.471 Sum_probs=18.6
Q ss_pred eEEeeccccccccCCchhHHHHHhhhCCCCceecCccccCC
Q psy13347 135 VHICHIEGCNKVYGKTSHLRAHLRWHTGERPFVCTWQFCGK 175 (339)
Q Consensus 135 ~~~C~~~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~~~C~~ 175 (339)
.|+|. +|+..|.....+ .. ...-.|+ .||.
T Consensus 5 ey~C~--~Cg~~fe~~~~~------~~-~~~~~CP--~Cg~ 34 (52)
T TIGR02605 5 EYRCT--ACGHRFEVLQKM------SD-DPLATCP--ECGG 34 (52)
T ss_pred EEEeC--CCCCEeEEEEec------CC-CCCCCCC--CCCC
Confidence 48894 499988744221 11 3445788 8986
No 172
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=20.59 E-value=58 Score=31.02 Aligned_cols=32 Identities=13% Similarity=0.355 Sum_probs=22.1
Q ss_pred CCCccccccCChhHHHHHHHhhcCCCCCccccccCCcccc
Q psy13347 211 QCSECHKKFMRSDHLQKHIKTHSKTRCTSLVMNFSDNTVV 250 (339)
Q Consensus 211 ~C~~C~k~F~~~~~L~~H~~~h~~~~C~~C~~~f~~~~~l 250 (339)
.|+.||.+..++.. .+.+|..|++.+......
T Consensus 352 ~Cp~Cg~~m~S~G~--------~g~rC~kCg~~~~~~~~~ 383 (421)
T COG1571 352 VCPRCGGRMKSAGR--------NGFRCKKCGTRARETLIK 383 (421)
T ss_pred CCCccCCchhhcCC--------CCcccccccccCCccccc
Confidence 68888888877762 266688888877665433
No 173
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region. Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A. The abnormally extended conformation is stable only in the CcO assembly.
Probab=20.48 E-value=41 Score=24.96 Aligned_cols=18 Identities=28% Similarity=0.353 Sum_probs=14.2
Q ss_pred eecCCCCcCCCccccccCC
Q psy13347 203 TWCREKRFQCSECHKKFMR 221 (339)
Q Consensus 203 ~h~~~k~~~C~~C~k~F~~ 221 (339)
.+.+ +++.|..||..|.-
T Consensus 74 l~~g-~~~rC~eCG~~fkL 91 (97)
T cd00924 74 LEKG-KPKRCPECGHVFKL 91 (97)
T ss_pred EeCC-CceeCCCCCcEEEE
Confidence 4445 79999999998864
No 174
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=20.08 E-value=46 Score=24.23 Aligned_cols=14 Identities=43% Similarity=0.904 Sum_probs=10.2
Q ss_pred CceecCccccCCccCC
Q psy13347 164 RPFVCTWQFCGKRFTR 179 (339)
Q Consensus 164 k~~~C~~~~C~~~f~~ 179 (339)
+|-.|. .||+.|..
T Consensus 57 ~Pa~Ck--kCGfef~~ 70 (97)
T COG3357 57 RPARCK--KCGFEFRD 70 (97)
T ss_pred cChhhc--ccCccccc
Confidence 466787 88888765
No 175
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=20.05 E-value=28 Score=22.80 Aligned_cols=10 Identities=30% Similarity=0.889 Sum_probs=5.1
Q ss_pred cCCCcccccc
Q psy13347 210 FQCSECHKKF 219 (339)
Q Consensus 210 ~~C~~C~k~F 219 (339)
..|+.||..|
T Consensus 23 V~Cp~CGael 32 (54)
T TIGR01206 23 VICDECGAEL 32 (54)
T ss_pred EeCCCCCCEE
Confidence 4555555433
Done!