Query psy11590
Match_columns 555
No_of_seqs 326 out of 3774
Neff 9.9
Searched_HMMs 46136
Date Fri Aug 16 22:35:20 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11590.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11590hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 2.2E-30 4.8E-35 261.3 3.8 147 382-528 604-939 (958)
2 KOG2462|consensus 99.9 1E-27 2.3E-32 214.5 6.0 135 381-518 128-265 (279)
3 KOG1074|consensus 99.9 7.1E-27 1.5E-31 236.1 4.1 107 187-298 618-733 (958)
4 KOG3608|consensus 99.9 1.6E-25 3.4E-30 204.8 11.9 295 99-506 69-396 (467)
5 KOG2462|consensus 99.9 7.9E-26 1.7E-30 202.5 5.3 132 409-544 128-265 (279)
6 KOG3608|consensus 99.9 4.1E-24 8.9E-29 195.6 11.9 260 203-522 69-378 (467)
7 KOG3623|consensus 99.9 2.3E-23 5E-28 207.1 4.3 117 203-404 210-330 (1007)
8 KOG3623|consensus 99.8 4.5E-21 9.8E-26 190.9 5.5 109 153-297 210-332 (1007)
9 KOG3576|consensus 99.7 3.6E-17 7.8E-22 138.9 3.5 125 381-524 115-240 (267)
10 KOG3576|consensus 99.6 4E-16 8.7E-21 132.6 3.4 95 438-534 117-222 (267)
11 PLN03086 PRLI-interacting fact 99.1 1.5E-10 3.3E-15 118.0 7.0 102 412-521 454-565 (567)
12 PHA00733 hypothetical protein 99.0 2.6E-10 5.5E-15 94.8 4.7 53 467-521 72-124 (128)
13 PLN03086 PRLI-interacting fact 99.0 7.1E-10 1.5E-14 113.2 7.3 131 384-533 408-550 (567)
14 PHA00733 hypothetical protein 98.9 1.3E-09 2.9E-14 90.5 4.6 81 381-462 38-123 (128)
15 PHA02768 hypothetical protein; 98.9 1E-09 2.3E-14 74.1 1.8 44 468-513 5-48 (55)
16 KOG3993|consensus 98.7 5.3E-09 1.2E-13 99.6 0.5 140 384-523 268-485 (500)
17 PF13465 zf-H2C2_2: Zinc-finge 98.6 2.1E-08 4.6E-13 57.9 2.4 26 483-508 1-26 (26)
18 PHA02768 hypothetical protein; 98.6 2.3E-08 5E-13 67.7 1.8 42 384-427 6-47 (55)
19 KOG3993|consensus 98.5 2.2E-08 4.8E-13 95.4 1.5 181 70-255 266-483 (500)
20 PHA00616 hypothetical protein 98.4 1.1E-07 2.3E-12 61.1 1.3 34 468-501 1-34 (44)
21 PF13465 zf-H2C2_2: Zinc-finge 98.4 1.8E-07 3.8E-12 54.0 2.0 26 398-423 1-26 (26)
22 PHA00616 hypothetical protein 98.3 2.7E-07 5.8E-12 59.3 1.7 33 496-528 1-33 (44)
23 PF05605 zf-Di19: Drought indu 98.2 2.2E-06 4.8E-11 59.7 4.5 50 439-491 3-52 (54)
24 PHA00732 hypothetical protein 98.1 1.5E-06 3.2E-11 65.2 2.7 36 469-507 2-38 (79)
25 KOG1146|consensus 98.1 1.2E-05 2.5E-10 88.2 9.3 148 73-225 438-611 (1406)
26 PHA00732 hypothetical protein 98.0 3.3E-06 7.2E-11 63.3 2.9 45 439-490 2-46 (79)
27 KOG1146|consensus 98.0 5.5E-06 1.2E-10 90.6 4.2 77 439-522 1261-1354(1406)
28 PF05605 zf-Di19: Drought indu 97.8 2.6E-05 5.6E-10 54.3 3.7 51 468-521 2-54 (54)
29 PF00096 zf-C2H2: Zinc finger, 97.7 2E-05 4.4E-10 44.1 1.5 22 497-518 1-22 (23)
30 PF12756 zf-C2H2_2: C2H2 type 97.6 3.3E-05 7.2E-10 62.0 2.7 73 440-519 1-73 (100)
31 PF00096 zf-C2H2: Zinc finger, 97.6 2.8E-05 6.1E-10 43.5 1.4 22 469-490 1-22 (23)
32 PF13894 zf-C2H2_4: C2H2-type 97.5 7.2E-05 1.6E-09 42.3 1.8 24 497-520 1-24 (24)
33 PF13912 zf-C2H2_6: C2H2-type 97.4 9.3E-05 2E-09 43.2 2.1 26 496-521 1-26 (27)
34 PF12756 zf-C2H2_2: C2H2 type 97.3 0.00016 3.5E-09 58.0 2.4 71 386-460 2-72 (100)
35 PF13894 zf-C2H2_4: C2H2-type 97.2 0.00024 5.2E-09 40.0 2.0 21 440-460 2-22 (24)
36 COG5189 SFP1 Putative transcri 97.2 0.00011 2.3E-09 68.0 0.8 27 436-462 347-375 (423)
37 PF13912 zf-C2H2_6: C2H2-type 97.1 0.00022 4.9E-09 41.6 1.3 25 468-492 1-25 (27)
38 PF09237 GAGA: GAGA factor; I 96.7 0.0013 2.9E-08 43.3 2.1 32 493-524 21-52 (54)
39 PF09237 GAGA: GAGA factor; I 96.6 0.0011 2.4E-08 43.6 1.3 41 455-496 12-52 (54)
40 COG5189 SFP1 Putative transcri 96.5 0.0019 4.1E-08 60.1 3.2 67 228-294 347-418 (423)
41 PF13909 zf-H2C2_5: C2H2-type 96.4 0.0014 3.1E-08 36.9 1.1 24 497-521 1-24 (24)
42 smart00355 ZnF_C2H2 zinc finge 96.4 0.0025 5.4E-08 36.4 1.9 22 498-519 2-23 (26)
43 smart00355 ZnF_C2H2 zinc finge 96.4 0.0028 6.1E-08 36.2 2.1 23 469-491 1-23 (26)
44 KOG2231|consensus 96.2 0.0073 1.6E-07 63.4 5.8 48 204-254 183-236 (669)
45 COG5236 Uncharacterized conser 96.1 0.012 2.7E-07 55.4 6.1 101 153-254 151-275 (493)
46 PRK04860 hypothetical protein; 96.1 0.0029 6.2E-08 54.8 1.8 40 467-510 118-157 (160)
47 PF12874 zf-met: Zinc-finger o 95.8 0.005 1.1E-07 35.0 1.3 22 497-518 1-22 (25)
48 PF12874 zf-met: Zinc-finger o 95.7 0.0051 1.1E-07 34.9 1.1 22 469-490 1-22 (25)
49 PF13909 zf-H2C2_5: C2H2-type 95.6 0.0098 2.1E-07 33.4 2.0 20 440-460 2-21 (24)
50 KOG2231|consensus 95.5 0.039 8.6E-07 58.1 7.4 110 319-462 116-236 (669)
51 KOG2482|consensus 95.4 0.12 2.7E-06 48.9 9.7 65 189-253 130-218 (423)
52 KOG2785|consensus 95.3 0.032 6.9E-07 53.9 5.5 136 383-518 3-242 (390)
53 KOG2785|consensus 95.0 0.083 1.8E-06 51.2 7.2 172 318-490 3-242 (390)
54 PF12171 zf-C2H2_jaz: Zinc-fin 94.9 0.0087 1.9E-07 34.7 0.3 22 497-518 2-23 (27)
55 PF12171 zf-C2H2_jaz: Zinc-fin 94.6 0.016 3.4E-07 33.7 0.9 22 72-93 2-23 (27)
56 PRK04860 hypothetical protein; 94.6 0.021 4.5E-07 49.5 2.0 39 382-424 118-156 (160)
57 COG5048 FOG: Zn-finger [Genera 93.7 0.014 2.9E-07 60.8 -1.1 151 382-533 288-455 (467)
58 KOG4173|consensus 93.6 0.038 8.2E-07 48.3 1.7 75 411-488 79-166 (253)
59 COG5236 Uncharacterized conser 93.2 0.11 2.4E-06 49.3 4.2 78 73-177 222-305 (493)
60 PF13913 zf-C2HC_2: zinc-finge 91.8 0.13 2.9E-06 29.1 1.8 20 498-518 4-23 (25)
61 smart00451 ZnF_U1 U1-like zinc 91.7 0.14 3E-06 31.7 2.0 23 496-518 3-25 (35)
62 KOG2482|consensus 91.3 0.44 9.6E-06 45.3 5.7 124 396-520 128-303 (423)
63 PF13913 zf-C2HC_2: zinc-finge 91.2 0.16 3.4E-06 28.8 1.7 19 470-489 4-22 (25)
64 KOG2893|consensus 90.0 0.099 2.1E-06 46.8 0.2 41 441-487 13-53 (341)
65 KOG4173|consensus 89.4 0.22 4.8E-06 43.7 1.9 78 383-463 79-171 (253)
66 smart00451 ZnF_U1 U1-like zinc 88.8 0.3 6.4E-06 30.1 1.7 23 438-460 3-25 (35)
67 cd00350 rubredoxin_like Rubred 88.8 0.21 4.6E-06 30.5 1.0 11 469-479 2-12 (33)
68 COG5048 FOG: Zn-finger [Genera 88.4 0.16 3.5E-06 52.7 0.5 66 468-533 289-360 (467)
69 KOG2893|consensus 86.5 0.19 4.2E-06 45.0 -0.2 46 470-519 12-58 (341)
70 PF12013 DUF3505: Protein of u 84.9 1.3 2.9E-05 35.8 4.0 25 497-521 81-109 (109)
71 TIGR00622 ssl1 transcription f 82.9 1.9 4.1E-05 34.6 3.8 22 409-430 13-34 (112)
72 COG4049 Uncharacterized protei 81.9 0.8 1.7E-05 30.9 1.2 27 438-464 17-43 (65)
73 COG4049 Uncharacterized protei 79.5 0.83 1.8E-05 30.9 0.7 28 492-519 13-40 (65)
74 TIGR00622 ssl1 transcription f 78.9 6.4 0.00014 31.7 5.6 20 229-248 14-33 (112)
75 cd00729 rubredoxin_SM Rubredox 78.8 0.99 2.2E-05 27.7 0.8 11 469-479 3-13 (34)
76 PF13717 zinc_ribbon_4: zinc-r 78.7 0.95 2.1E-05 28.2 0.7 33 132-164 4-36 (36)
77 PF12013 DUF3505: Protein of u 78.1 3.9 8.4E-05 33.1 4.4 25 439-463 81-109 (109)
78 PF13719 zinc_ribbon_5: zinc-r 77.6 0.94 2E-05 28.4 0.5 33 132-164 4-36 (37)
79 KOG2186|consensus 73.1 2.1 4.6E-05 39.2 1.7 55 469-526 4-58 (276)
80 PHA00626 hypothetical protein 71.6 2.4 5.1E-05 28.9 1.2 16 151-166 21-36 (59)
81 smart00659 RPOLCX RNA polymera 69.4 3 6.6E-05 27.3 1.4 25 469-504 3-27 (44)
82 PF09986 DUF2225: Uncharacteri 69.3 1.8 3.8E-05 39.9 0.4 14 275-288 49-62 (214)
83 PF09986 DUF2225: Uncharacteri 68.6 2.1 4.6E-05 39.4 0.7 43 201-243 3-61 (214)
84 PF09538 FYDLN_acid: Protein o 67.9 3.6 7.9E-05 33.0 1.9 30 204-243 10-39 (108)
85 TIGR02098 MJ0042_CXXC MJ0042 f 67.0 2.3 4.9E-05 26.8 0.4 32 133-164 5-36 (38)
86 PF09538 FYDLN_acid: Protein o 66.6 3.8 8.2E-05 32.9 1.7 17 200-216 23-39 (108)
87 KOG2186|consensus 66.3 3.8 8.2E-05 37.6 1.8 48 203-252 3-50 (276)
88 COG2888 Predicted Zn-ribbon RN 64.8 4.9 0.00011 27.9 1.7 33 71-108 27-59 (61)
89 PF10571 UPF0547: Uncharacteri 64.8 4.9 0.00011 23.0 1.5 9 321-329 17-25 (26)
90 PF02892 zf-BED: BED zinc fing 63.9 6.9 0.00015 25.6 2.4 23 495-517 15-41 (45)
91 TIGR00373 conserved hypothetic 63.2 7 0.00015 34.0 2.9 16 384-399 110-125 (158)
92 COG0068 HypF Hydrogenase matur 62.5 5.1 0.00011 42.8 2.2 74 25-108 102-182 (750)
93 smart00614 ZnF_BED BED zinc fi 62.1 6.4 0.00014 26.6 2.0 20 498-517 20-44 (50)
94 smart00531 TFIIE Transcription 60.9 8.2 0.00018 33.1 2.9 38 199-240 95-133 (147)
95 PRK00398 rpoP DNA-directed RNA 60.8 3 6.4E-05 27.7 0.1 29 468-506 3-31 (46)
96 COG1592 Rubrerythrin [Energy p 60.5 5.4 0.00012 34.7 1.7 24 383-419 134-157 (166)
97 smart00834 CxxC_CXXC_SSSS Puta 57.7 6.3 0.00014 25.1 1.3 31 203-239 5-35 (41)
98 COG1996 RPC10 DNA-directed RNA 56.8 5.4 0.00012 26.7 0.8 29 467-505 5-33 (49)
99 TIGR00373 conserved hypothetic 56.0 10 0.00022 33.1 2.6 34 199-240 105-138 (158)
100 PRK06266 transcription initiat 56.0 9.8 0.00021 33.8 2.6 16 383-398 117-132 (178)
101 PF06524 NOA36: NOA36 protein; 55.2 7.6 0.00016 35.8 1.8 25 228-252 207-231 (314)
102 smart00531 TFIIE Transcription 54.7 12 0.00027 32.0 3.0 12 383-394 99-110 (147)
103 TIGR02605 CxxC_CxxC_SSSS putat 54.1 7.2 0.00016 26.5 1.2 30 203-238 5-34 (52)
104 COG1198 PriA Primosomal protei 53.6 8.7 0.00019 42.1 2.3 49 412-505 436-484 (730)
105 PF05443 ROS_MUCR: ROS/MUCR tr 53.0 8.1 0.00018 32.3 1.5 34 497-533 73-112 (132)
106 smart00734 ZnF_Rad18 Rad18-lik 52.5 12 0.00026 21.3 1.7 19 440-459 3-21 (26)
107 PRK06266 transcription initiat 52.3 11 0.00025 33.4 2.4 35 199-241 113-147 (178)
108 KOG1940|consensus 51.5 14 0.0003 35.2 2.9 36 8-43 58-93 (276)
109 COG1592 Rubrerythrin [Energy p 49.8 11 0.00023 32.9 1.8 10 230-239 134-143 (166)
110 TIGR02300 FYDLN_acid conserved 49.2 12 0.00026 30.6 1.9 34 204-247 10-43 (129)
111 KOG2807|consensus 48.4 19 0.00041 34.6 3.3 36 202-250 275-310 (378)
112 PF00301 Rubredoxin: Rubredoxi 47.6 19 0.00041 24.0 2.3 37 203-239 1-43 (47)
113 TIGR02300 FYDLN_acid conserved 46.0 14 0.0003 30.3 1.7 34 153-219 9-42 (129)
114 PF05290 Baculo_IE-1: Baculovi 45.8 3.3 7.2E-05 34.0 -1.7 54 21-85 77-135 (140)
115 PF09723 Zn-ribbon_8: Zinc rib 45.2 16 0.00034 23.6 1.6 18 275-292 6-23 (42)
116 COG3677 Transposase and inacti 43.6 27 0.00058 29.2 3.2 46 121-167 22-67 (129)
117 PRK14890 putative Zn-ribbon RN 42.7 17 0.00037 25.4 1.5 33 71-108 25-57 (59)
118 PRK09678 DNA-binding transcrip 42.6 8.1 0.00018 28.4 -0.0 41 469-511 2-44 (72)
119 cd00730 rubredoxin Rubredoxin; 42.5 19 0.0004 24.4 1.7 13 230-242 1-13 (50)
120 KOG2071|consensus 41.8 18 0.00039 37.9 2.3 23 438-460 418-440 (579)
121 KOG4167|consensus 40.9 7 0.00015 41.6 -0.8 28 495-522 791-818 (907)
122 PF03604 DNA_RNApol_7kD: DNA d 40.8 27 0.00059 21.1 2.0 26 204-239 1-26 (32)
123 PF14446 Prok-RING_1: Prokaryo 40.8 12 0.00025 25.7 0.5 30 23-52 4-33 (54)
124 COG5152 Uncharacterized conser 40.5 7.8 0.00017 34.1 -0.4 20 19-38 191-210 (259)
125 PF15269 zf-C2H2_7: Zinc-finge 39.4 21 0.00045 23.1 1.4 23 383-405 20-42 (54)
126 PRK04023 DNA polymerase II lar 39.1 24 0.00052 39.6 2.8 13 467-479 662-674 (1121)
127 KOG2593|consensus 38.1 29 0.00063 34.9 3.0 36 438-478 128-163 (436)
128 PF08274 PhnA_Zn_Ribbon: PhnA 37.6 14 0.00031 21.9 0.5 8 441-448 5-12 (30)
129 PF04959 ARS2: Arsenite-resist 37.0 12 0.00026 34.2 0.2 27 496-522 77-103 (214)
130 PF04959 ARS2: Arsenite-resist 36.9 21 0.00046 32.6 1.7 28 438-465 77-104 (214)
131 PF12907 zf-met2: Zinc-binding 36.8 26 0.00057 22.4 1.6 19 470-488 3-24 (40)
132 KOG1280|consensus 36.6 26 0.00056 34.0 2.3 27 439-465 80-106 (381)
133 KOG2807|consensus 36.1 72 0.0016 30.8 5.0 22 152-173 289-310 (378)
134 KOG2071|consensus 35.4 26 0.00056 36.8 2.3 28 380-407 415-442 (579)
135 PF05443 ROS_MUCR: ROS/MUCR tr 35.4 17 0.00037 30.4 0.8 27 152-181 71-97 (132)
136 COG1997 RPL43A Ribosomal prote 35.0 25 0.00054 26.7 1.5 17 199-215 49-65 (89)
137 KOG4124|consensus 34.9 8.5 0.00018 37.1 -1.2 26 437-462 348-375 (442)
138 KOG4124|consensus 34.6 23 0.0005 34.3 1.6 73 16-92 341-419 (442)
139 PF02176 zf-TRAF: TRAF-type zi 34.3 19 0.00041 25.2 0.8 39 383-422 9-53 (60)
140 PF13451 zf-trcl: Probable zin 33.5 28 0.00062 23.4 1.5 16 467-482 3-18 (49)
141 PF13878 zf-C2H2_3: zinc-finge 33.1 39 0.00085 21.7 2.0 25 153-177 13-39 (41)
142 KOG2593|consensus 31.2 26 0.00057 35.2 1.4 41 65-108 122-162 (436)
143 COG5188 PRP9 Splicing factor 3 30.5 35 0.00076 33.1 2.1 48 410-457 237-394 (470)
144 PF15135 UPF0515: Uncharacteri 30.5 47 0.001 30.7 2.8 57 98-168 111-170 (278)
145 PF14353 CpXC: CpXC protein 30.4 16 0.00035 30.4 -0.1 23 274-296 38-60 (128)
146 KOG3408|consensus 30.2 31 0.00067 27.9 1.4 25 494-518 55-79 (129)
147 COG4957 Predicted transcriptio 30.0 37 0.0008 28.1 1.8 25 497-524 77-101 (148)
148 KOG4167|consensus 29.7 16 0.00036 39.0 -0.2 27 70-96 791-817 (907)
149 PRK00464 nrdR transcriptional 29.2 25 0.00053 30.4 0.8 17 153-169 28-44 (154)
150 PF06524 NOA36: NOA36 protein; 28.8 78 0.0017 29.5 3.9 85 199-297 138-232 (314)
151 PF01363 FYVE: FYVE zinc finge 28.8 31 0.00067 25.0 1.2 26 469-506 10-35 (69)
152 PF03107 C1_2: C1 domain; Int 27.1 15 0.00032 21.7 -0.6 22 25-47 1-22 (30)
153 PF07754 DUF1610: Domain of un 26.5 28 0.0006 19.5 0.4 11 151-161 14-24 (24)
154 PF07649 C1_3: C1-like domain; 25.9 22 0.00048 20.9 -0.0 7 72-78 16-22 (30)
155 COG5151 SSL1 RNA polymerase II 25.7 79 0.0017 30.2 3.5 24 229-252 387-410 (421)
156 PF12760 Zn_Tnp_IS1595: Transp 25.3 34 0.00073 22.6 0.8 29 70-107 17-45 (46)
157 PF05191 ADK_lid: Adenylate ki 25.0 32 0.00069 21.4 0.5 10 384-393 2-11 (36)
158 COG3357 Predicted transcriptio 24.1 37 0.00081 25.9 0.8 12 438-449 58-69 (97)
159 PF04780 DUF629: Protein of un 24.0 51 0.0011 34.0 2.1 27 439-465 58-84 (466)
160 TIGR00143 hypF [NiFe] hydrogen 23.0 48 0.001 36.7 1.8 78 21-108 65-149 (711)
161 PRK00432 30S ribosomal protein 23.0 45 0.00097 22.6 1.0 12 495-506 36-47 (50)
162 COG1198 PriA Primosomal protei 22.7 36 0.00077 37.5 0.7 12 97-108 473-484 (730)
163 PF07975 C1_4: TFIIH C1-like d 22.6 42 0.00091 22.8 0.8 30 273-302 20-49 (51)
164 TIGR01206 lysW lysine biosynth 22.2 19 0.00042 24.8 -0.9 13 24-36 2-14 (54)
165 TIGR00595 priA primosomal prot 22.1 54 0.0012 34.7 1.9 47 412-503 214-260 (505)
166 PRK04023 DNA polymerase II lar 21.9 80 0.0017 35.7 3.1 52 70-166 625-676 (1121)
167 cd00065 FYVE FYVE domain; Zinc 21.2 56 0.0012 22.4 1.3 9 471-479 5-13 (57)
168 PF02891 zf-MIZ: MIZ/SP-RING z 21.1 21 0.00047 24.1 -0.8 23 10-32 27-49 (50)
169 smart00154 ZnF_AN1 AN1-like Zi 21.1 50 0.0011 20.9 0.9 13 496-508 12-24 (39)
170 COG2879 Uncharacterized small 21.0 71 0.0015 22.6 1.6 19 507-525 23-41 (65)
171 PF08790 zf-LYAR: LYAR-type C2 20.8 31 0.00067 20.1 -0.1 18 498-516 2-19 (28)
172 KOG3408|consensus 20.6 54 0.0012 26.6 1.2 28 199-226 53-80 (129)
173 COG5151 SSL1 RNA polymerase II 20.4 77 0.0017 30.3 2.3 30 202-244 307-336 (421)
No 1
>KOG1074|consensus
Probab=99.96 E-value=2.2e-30 Score=261.31 Aligned_cols=147 Identities=29% Similarity=0.560 Sum_probs=132.7
Q ss_pred cceecCCCcccccCHHHHHHHHhhccCCCccccccccccccCHHHHHHHHHhcCCC-----CccCC---CChhhcCChHH
Q psy11590 382 MRYQCPHCPSLLKTFWTLKEHFNIHTSEKKYVCELCGAYFIHKSSLVAHTTTHSNV-----KFKCD---FCEKVYSHRNR 453 (555)
Q Consensus 382 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~-----~~~C~---~C~~~f~~~~~ 453 (555)
.|.+|-+|-+....++.|+.|+++|+|+|||+|.+||+.|+++.+|+.|+-+|..+ +|.|+ +|.+.|.....
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccc
Confidence 46899999999999999999999999999999999999999999999999999765 58999 99999999999
Q ss_pred HHHHHhhhcCCCc-------------------------------------------------------------------
Q psy11590 454 LRYHISTVHENKW------------------------------------------------------------------- 466 (555)
Q Consensus 454 L~~H~~~~H~~~~------------------------------------------------------------------- 466 (555)
|..|++.|-++..
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~ 763 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSC 763 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCcccccc
Confidence 9999999331110
Q ss_pred --------------------------------------------------------------------------------
Q psy11590 467 -------------------------------------------------------------------------------- 466 (555)
Q Consensus 467 -------------------------------------------------------------------------------- 466 (555)
T Consensus 764 ~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t 843 (958)
T KOG1074|consen 764 GRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLAT 843 (958)
T ss_pred ccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccccccccc
Confidence
Q ss_pred ----------------------------------cccccccchhcccCchHHHHHHHhcCCCCCeecccccccccChhhH
Q psy11590 467 ----------------------------------KSAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRKNL 512 (555)
Q Consensus 467 ----------------------------------~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L 512 (555)
....|.+||+.|...++|..|+++|+|++||.|.+|++.|..+.+|
T Consensus 844 ~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnL 923 (958)
T KOG1074|consen 844 KTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNL 923 (958)
T ss_pred ccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhh
Confidence 0146999999999999999999999999999999999999999999
Q ss_pred HHHHHhCCCccccCcc
Q psy11590 513 VRHQKNCKASAKCSLM 528 (555)
Q Consensus 513 ~~H~~~h~~~~~~~~~ 528 (555)
+.||.+|+...|..=+
T Consensus 924 KvHMgtH~w~q~~srr 939 (958)
T KOG1074|consen 924 KVHMGTHMWVQPPSRR 939 (958)
T ss_pred hhhhccccccCCCccC
Confidence 9999999998876544
No 2
>KOG2462|consensus
Probab=99.94 E-value=1e-27 Score=214.50 Aligned_cols=135 Identities=31% Similarity=0.670 Sum_probs=128.4
Q ss_pred CcceecCCCcccccCHHHHHHHHhhccC---CCccccccccccccCHHHHHHHHHhcCCCCccCCCChhhcCChHHHHHH
Q psy11590 381 TMRYQCPHCPSLLKTFWTLKEHFNIHTS---EKKYVCELCGAYFIHKSSLVAHTTTHSNVKFKCDFCEKVYSHRNRLRYH 457 (555)
Q Consensus 381 ~~~~~C~~C~~~f~~~~~L~~H~~~H~~---~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~L~~H 457 (555)
...|+|+.||+.+.+.++|.+|.++|-. .+.+.|++|++.|.+..+|..|+++|+ .+++|.+||+.|...+.|+-|
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~-l~c~C~iCGKaFSRPWLLQGH 206 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT-LPCECGICGKAFSRPWLLQGH 206 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC-CCcccccccccccchHHhhcc
Confidence 4469999999999999999999999964 678999999999999999999999998 589999999999999999999
Q ss_pred HhhhcCCCccccccccchhcccCchHHHHHHHhcCCCCCeecccccccccChhhHHHHHHh
Q psy11590 458 ISTVHENKWKSAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRKNLVRHQKN 518 (555)
Q Consensus 458 ~~~~H~~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~ 518 (555)
+|+ |+|+ |||.|..|+|+|.+.++|+.||++|.+.++|+|..|+|+|..++-|.+|...
T Consensus 207 iRT-HTGE-KPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 207 IRT-HTGE-KPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred ccc-ccCC-CCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 999 8888 9999999999999999999999999999999999999999999999999764
No 3
>KOG1074|consensus
Probab=99.93 E-value=7.1e-27 Score=236.14 Aligned_cols=107 Identities=25% Similarity=0.417 Sum_probs=81.3
Q ss_pred ccchhhhhhccccCCceeecCccCccccchhhhhhhhhhcC-----CCcccCC---CCCccCCCchHHHhHHHhhcCCCC
Q psy11590 187 NSNLSEHIRMHHTEEKKYICEVCGTSFLTWSSLNSHKYSHD-----DRKFSCS---HCGNSYKNPIFLKRHFRESHLKHS 258 (555)
Q Consensus 187 ~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~-----~~~~~C~---~C~~~f~~~~~L~~H~~~~H~~h~ 258 (555)
.+.|+-|.|+| +||+||+|.+|++.|.++..|+.||-+|. ...|.|+ +|.+.|.+...|..|+++ |.
T Consensus 618 ~saLqmHyrtH-tGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lpQhIri----H~ 692 (958)
T KOG1074|consen 618 PSALQMHYRTH-TGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLPQHIRI----HL 692 (958)
T ss_pred hhhhhhhhhcc-cCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccccceEEe----ec
Confidence 35566677777 88999999999999999999999999996 4569999 999999999999999998 55
Q ss_pred CC-CCcccccccccCceeeCCCcccccccccccccCCCCCC
Q psy11590 259 KS-NPEQYAVKSLQDIMYWCNVCSQRCYLSATCSIHRPSNC 298 (555)
Q Consensus 259 ~~-~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~ 298 (555)
+. .+............-+|..|.+.|.....+..++..+.
T Consensus 693 ~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~ 733 (958)
T KOG1074|consen 693 GGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQP 733 (958)
T ss_pred CCCCCCCcccccccchhcccchhhhcccccccchhhhhccC
Confidence 22 22111111111122579999999988888877766553
No 4
>KOG3608|consensus
Probab=99.93 E-value=1.6e-25 Score=204.84 Aligned_cols=295 Identities=23% Similarity=0.473 Sum_probs=207.4
Q ss_pred cccCC--CCCCChhhHHHH-HHHHHHHHHhhcccc--------------ccccccccccccccccccCCCCcccC--CCC
Q psy11590 99 DFVCN--LCPPDSKIMIKY-AHLLVRHMKQCHTMQ--------------VHIPAVYKNFRSVTSFTMNNITQYKC--PDC 159 (555)
Q Consensus 99 ~~~C~--~C~~~~~~~f~~-~~~l~~H~~~~H~~~--------------~~c~~c~~~~~~~~~~~~~~~~~~~C--~~C 159 (555)
.+.|. .|++. ..+ ...|.+|.-. |--. ...+.|.-.|.....+...+. .|.| ..|
T Consensus 69 e~qC~w~~C~f~----~~~~s~~l~RHvy~-H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~-~f~C~WedC 142 (467)
T KOG3608|consen 69 EHQCTWNSCDFR----TENSSADLIRHVYF-HCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQ-NFRCGWEDC 142 (467)
T ss_pred ceeEEeccCCcc----ccchHHHHHhhhhh-hhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchh-hhccChhhc
Confidence 36663 57765 433 4567777655 3100 034555555655554443332 4666 669
Q ss_pred hhhcCChHHHHHHHHhccCCCccccCCccchhhhhhccccCCc-eeec--CccCccccchhhhhhhhhhcC-CCcccCCC
Q psy11590 160 PAILQTFKNLKCHLDIHSGEKDFSCHINSNLSEHIRMHHTEEK-KYIC--EVCGTSFLTWSSLNSHKYSHD-DRKFSCSH 235 (555)
Q Consensus 160 ~~~f~~~~~l~~H~~~h~~~~~~~C~~~~~l~~H~~~h~~~~~-~~~C--~~C~~~f~~~~~L~~H~~~h~-~~~~~C~~ 235 (555)
+..|.+...|..|+..|..--.|. -..+- .+++ .+.| ..|.+.|.+++.|++|++.|. ++...|+.
T Consensus 143 e~~F~s~~ef~dHV~~H~l~ceyd---------~~~~~-~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~ 212 (467)
T KOG3608|consen 143 EREFVSIVEFQDHVVKHALFCEYD---------IQKTP-EDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPH 212 (467)
T ss_pred CCcccCHHHHHHHHHHhhhhhhhh---------hhhCC-CCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecch
Confidence 999999999999998775321111 00111 2222 2445 458888888888888888887 77888888
Q ss_pred CCccCCCchHHHhHHHhhcCCCCCCCCcccccccccCceeeCCCcccccccccccccCCCCCCcccccccccchhhcccc
Q psy11590 236 CGNSYKNPIFLKRHFRESHLKHSKSNPEQYAVKSLQDIMYWCNVCSQRCYLSATCSIHRPSNCKDQYKMSVSISNECDQH 315 (555)
Q Consensus 236 C~~~f~~~~~L~~H~~~~H~~h~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~c~~~~~~~~~~~~l~~h 315 (555)
||.-|.++..|-.|++.. +.. ...+|+|
T Consensus 213 Cg~~F~~~tkl~DH~rRq----t~l----------~~n~fqC-------------------------------------- 240 (467)
T KOG3608|consen 213 CGELFRTKTKLFDHLRRQ----TEL----------NTNSFQC-------------------------------------- 240 (467)
T ss_pred HHHHhccccHHHHHHHhh----hhh----------cCCchHH--------------------------------------
Confidence 888888888888887751 110 0011444
Q ss_pred ccccccCccCccccChhHHhHHHHhhhcccccccCCCCchhhhhHHHHhhhhHhHHHHHHhhhcCCcceecCCCcccccC
Q psy11590 316 MECHTCNECDSKHNDCQSLWDHVFSEHKNMLAFSKPDDSENKISKVAEKQYWKDVEEKCRIIVEDTMRYQCPHCPSLLKT 395 (555)
Q Consensus 316 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~c~~~~~~~l~~~~~~~~~~~~~~~C~~C~~~f~~ 395 (555)
..|.+.|.+...|..|+ ..|.+ -|+|+.|+.+.+.
T Consensus 241 ------~~C~KrFaTeklL~~Hv-~rHvn--------------------------------------~ykCplCdmtc~~ 275 (467)
T KOG3608|consen 241 ------AQCFKRFATEKLLKSHV-VRHVN--------------------------------------CYKCPLCDMTCSS 275 (467)
T ss_pred ------HHHHHHHhHHHHHHHHH-HHhhh--------------------------------------cccccccccCCCC
Confidence 44444455555565555 33432 3899999999999
Q ss_pred HHHHHHHHhh-ccCCCccccccccccccCHHHHHHHHHhcCCCCccCCC--ChhhcCChHHHHHHHhhhcCCC-cccccc
Q psy11590 396 FWTLKEHFNI-HTSEKKYVCELCGAYFIHKSSLVAHTTTHSNVKFKCDF--CEKVYSHRNRLRYHISTVHENK-WKSAAC 471 (555)
Q Consensus 396 ~~~L~~H~~~-H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~C~~--C~~~f~~~~~L~~H~~~~H~~~-~~~~~C 471 (555)
.+.|.+|++. |...+||+|+.|++.|.+.+.|..|+.+|++..|.|.. |...|.+...|++|++.+|.+. .-+|.|
T Consensus 276 ~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~C 355 (467)
T KOG3608|consen 276 ASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHSKTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYAC 355 (467)
T ss_pred hHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhccccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceee
Confidence 9999999875 78899999999999999999999999999987999977 9999999999999999999554 356999
Q ss_pred ccchhcccCchHHHHHHH-hc-----CCCCCeecccccccc
Q psy11590 472 DICGRVFLDSKNMKKHAA-VH-----STERPFVCKLCGIAY 506 (555)
Q Consensus 472 ~~C~k~f~~~~~L~~H~~-~H-----~~~~~~~C~~C~~~f 506 (555)
..|++.|.+..+|.+|++ .| .|-+.|.=.+|.-+|
T Consensus 356 H~Cdr~ft~G~~L~~HL~kkH~f~~PsGh~RFtYk~~edG~ 396 (467)
T KOG3608|consen 356 HCCDRFFTSGKSLSAHLMKKHGFRLPSGHKRFTYKVDEDGF 396 (467)
T ss_pred ecchhhhccchhHHHHHHHhhcccCCCCCCceeeeeccCce
Confidence 999999999999999964 45 344556656665555
No 5
>KOG2462|consensus
Probab=99.92 E-value=7.9e-26 Score=202.50 Aligned_cols=132 Identities=30% Similarity=0.592 Sum_probs=118.7
Q ss_pred CCccccccccccccCHHHHHHHHHhcCCC----CccCCCChhhcCChHHHHHHHhhhcCCCccccccccchhcccCchHH
Q psy11590 409 EKKYVCELCGAYFIHKSSLVAHTTTHSNV----KFKCDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVFLDSKNM 484 (555)
Q Consensus 409 ~~~~~C~~C~~~f~~~~~L~~H~~~H~~~----~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f~~~~~L 484 (555)
...|+|+.||+.+.+..+|.+|.++|-.. .+.|++|+++|.+..+|+.|+++ |+ -+.+|.+|||.|.....|
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirT-H~---l~c~C~iCGKaFSRPWLL 203 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRT-HT---LPCECGICGKAFSRPWLL 203 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhc-cC---CCcccccccccccchHHh
Confidence 45699999999999999999999999643 79999999999999999999999 66 469999999999999999
Q ss_pred HHHHHhcCCCCCeecccccccccChhhHHHHHHhCCCccccCcccCCCC--ccccccceeee
Q psy11590 485 KKHAAVHSTERPFVCKLCGIAYKWRKNLVRHQKNCKASAKCSLMTTSPL--SVKFAMPHTRF 544 (555)
Q Consensus 485 ~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~--~~~~~~~~~~~ 544 (555)
+.|+|+||||+||.|+.|++.|..++||+.||.+|.+.|+|.|..|.=. -+-.+.+|.+.
T Consensus 204 QGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 204 QGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred hcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 9999999999999999999999999999999999999999999999632 22333466554
No 6
>KOG3608|consensus
Probab=99.91 E-value=4.1e-24 Score=195.58 Aligned_cols=260 Identities=21% Similarity=0.353 Sum_probs=196.5
Q ss_pred eeec--CccCccccc-hhhhhhhhhhcC-----------------------------------CCcccCC--CCCccCCC
Q psy11590 203 KYIC--EVCGTSFLT-WSSLNSHKYSHD-----------------------------------DRKFSCS--HCGNSYKN 242 (555)
Q Consensus 203 ~~~C--~~C~~~f~~-~~~L~~H~~~h~-----------------------------------~~~~~C~--~C~~~f~~ 242 (555)
.++| ..|++...+ ..+|.+|...|. ...|.|. .|+..|.+
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s 148 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVS 148 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccC
Confidence 3555 457777666 478999976651 1127885 59999999
Q ss_pred chHHHhHHHhhcCCCCCCCCc-ccccccccCceeeCCC--cccccccccccccCCCCCCcccccccccchhhcccccccc
Q psy11590 243 PIFLKRHFRESHLKHSKSNPE-QYAVKSLQDIMYWCNV--CSQRCYLSATCSIHRPSNCKDQYKMSVSISNECDQHMECH 319 (555)
Q Consensus 243 ~~~L~~H~~~~H~~h~~~~~~-~~~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~~~c~~~~~~~~~~~~l~~h~~~~ 319 (555)
..++..|+.. |..--.. +..++.++...+.|.+ |.+.|.+++.|+.|.+.|. ..++.
T Consensus 149 ~~ef~dHV~~----H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs----------------~eKvv 208 (467)
T KOG3608|consen 149 IVEFQDHVVK----HALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHS----------------NEKVV 208 (467)
T ss_pred HHHHHHHHHH----hhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcC----------------CCeEE
Confidence 9999999986 3322111 1112222223466654 7777777775555544432 11223
Q ss_pred ccCccCccccChhHHhHHHHhhhcccccccCCCCchhhhhHHHHhhhhHhHHHHHHhhhcCCcceecCCCcccccCHHHH
Q psy11590 320 TCNECDSKHNDCQSLWDHVFSEHKNMLAFSKPDDSENKISKVAEKQYWKDVEEKCRIIVEDTMRYQCPHCPSLLKTFWTL 399 (555)
Q Consensus 320 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~c~~~~~~~l~~~~~~~~~~~~~~~C~~C~~~f~~~~~L 399 (555)
.|+.||.-|.+...|.+|+...- .....+|+|..|.|.|.++..|
T Consensus 209 ACp~Cg~~F~~~tkl~DH~rRqt-----------------------------------~l~~n~fqC~~C~KrFaTeklL 253 (467)
T KOG3608|consen 209 ACPHCGELFRTKTKLFDHLRRQT-----------------------------------ELNTNSFQCAQCFKRFATEKLL 253 (467)
T ss_pred ecchHHHHhccccHHHHHHHhhh-----------------------------------hhcCCchHHHHHHHHHhHHHHH
Confidence 38888888888888877764321 1223479999999999999999
Q ss_pred HHHHhhccCCCccccccccccccCHHHHHHHHHhcCCC--CccCCCChhhcCChHHHHHHHhhhcCCCcccccccc--ch
Q psy11590 400 KEHFNIHTSEKKYVCELCGAYFIHKSSLVAHTTTHSNV--KFKCDFCEKVYSHRNRLRYHISTVHENKWKSAACDI--CG 475 (555)
Q Consensus 400 ~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~--~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~--C~ 475 (555)
..|+..|. .-|+|+.|+.+....++|..|++..+.+ ||+|+.|++.|.+...|.+|+.. |.. ..|.|+. |.
T Consensus 254 ~~Hv~rHv--n~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~-HS~--~~y~C~h~~C~ 328 (467)
T KOG3608|consen 254 KSHVVRHV--NCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQV-HSK--TVYQCEHPDCH 328 (467)
T ss_pred HHHHHHhh--hcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHh-ccc--cceecCCCCCc
Confidence 99999886 5699999999999999999999875543 99999999999999999999997 774 6799998 99
Q ss_pred hcccCchHHHHHHHhcC-C--CCCeecccccccccChhhHHHHHHhCCCc
Q psy11590 476 RVFLDSKNMKKHAAVHS-T--ERPFVCKLCGIAYKWRKNLVRHQKNCKAS 522 (555)
Q Consensus 476 k~f~~~~~L~~H~~~H~-~--~~~~~C~~C~~~f~~~~~L~~H~~~h~~~ 522 (555)
.+|.+...|++|++.++ | +.+|.|-.|++.|++..+|..|++.-|+-
T Consensus 329 ~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f 378 (467)
T KOG3608|consen 329 YSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGF 378 (467)
T ss_pred HHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcc
Confidence 99999999999998765 4 45699999999999999999997766554
No 7
>KOG3623|consensus
Probab=99.87 E-value=2.3e-23 Score=207.08 Aligned_cols=117 Identities=20% Similarity=0.402 Sum_probs=85.9
Q ss_pred eeecCccCccccchhhhhhhhhhcC---CCcccCCCCCccCCCchHHHhHHHhhcCCCCCCCCcc-cccccccCceeeCC
Q psy11590 203 KYICEVCGTSFLTWSSLNSHKYSHD---DRKFSCSHCGNSYKNPIFLKRHFRESHLKHSKSNPEQ-YAVKSLQDIMYWCN 278 (555)
Q Consensus 203 ~~~C~~C~~~f~~~~~L~~H~~~h~---~~~~~C~~C~~~f~~~~~L~~H~~~~H~~h~~~~~~~-~~~~~~~~~~~~C~ 278 (555)
..+|++|++.+.....|+.|++.-+ +..|.|.+|.++|..+..|.+||.+ |....+.. ....+...+.|+|.
T Consensus 210 lltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~----hkpg~dqa~sltqsa~lRKFKCt 285 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQL----HKPGGDQAISLTQSALLRKFKCT 285 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHh----hcCCCcccccccchhhhcccccc
Confidence 3789999999999999999987654 6679999999999999999999998 54443321 11122223335555
Q ss_pred CcccccccccccccCCCCCCcccccccccchhhccccccccccCccCccccChhHHhHHHHhhhcccccccCCCCchhhh
Q psy11590 279 VCSQRCYLSATCSIHRPSNCKDQYKMSVSISNECDQHMECHTCNECDSKHNDCQSLWDHVFSEHKNMLAFSKPDDSENKI 358 (555)
Q Consensus 279 ~C~~~f~~~~~l~~H~~~~~c~~~~~~~~~~~~l~~h~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~ 358 (555)
+|++.|..+. .| .+
T Consensus 286 ECgKAFKfKH--------------------------------------------HL-----KE----------------- 299 (1007)
T KOG3623|consen 286 ECGKAFKFKH--------------------------------------------HL-----KE----------------- 299 (1007)
T ss_pred ccchhhhhHH--------------------------------------------HH-----Hh-----------------
Confidence 5555544333 33 33
Q ss_pred hHHHHhhhhHhHHHHHHhhhcCCcceecCCCcccccCHHHHHHHHh
Q psy11590 359 SKVAEKQYWKDVEEKCRIIVEDTMRYQCPHCPSLLKTFWTLKEHFN 404 (555)
Q Consensus 359 ~~~c~~~~~~~l~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~ 404 (555)
|.++|.|++||.|+.|+|+|.....+..||.
T Consensus 300 ---------------HlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 300 ---------------HLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred ---------------hheeecCCCCcCCcccccccccCCccccccc
Confidence 4456677888999999999999999999985
No 8
>KOG3623|consensus
Probab=99.82 E-value=4.5e-21 Score=190.90 Aligned_cols=109 Identities=31% Similarity=0.601 Sum_probs=96.0
Q ss_pred cccCCCChhhcCChHHHHHHHHhccCCCccccCCccchhhhhhccccCCceeecCccCccccchhhhhhhhhhcC-----
Q psy11590 153 QYKCPDCPAILQTFKNLKCHLDIHSGEKDFSCHINSNLSEHIRMHHTEEKKYICEVCGTSFLTWSSLNSHKYSHD----- 227 (555)
Q Consensus 153 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~----- 227 (555)
...|+.|...+.....|+.|++--+ . ..+..|.|..|..+|..+..|.+|+.+|.
T Consensus 210 lltcpycdrgykrltslkeHikyrh----------------e----kne~nfsC~lCsytFAyRtQLErhm~~hkpg~dq 269 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKYRH----------------E----KNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQ 269 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHH----------------h----hCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcc
Confidence 4689999999999999999997321 1 34567899999999999999999999884
Q ss_pred ---------CCcccCCCCCccCCCchHHHhHHHhhcCCCCCCCCcccccccccCceeeCCCcccccccccccccCCCCC
Q psy11590 228 ---------DRKFSCSHCGNSYKNPIFLKRHFRESHLKHSKSNPEQYAVKSLQDIMYWCNVCSQRCYLSATCSIHRPSN 297 (555)
Q Consensus 228 ---------~~~~~C~~C~~~f~~~~~L~~H~~~~H~~h~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 297 (555)
.+.|+|..|+|.|..+..|+.|+|+ |.|++| |.|+.|.+.|..+.++..|+...
T Consensus 270 a~sltqsa~lRKFKCtECgKAFKfKHHLKEHlRI----HSGEKP------------feCpnCkKRFSHSGSySSHmSSK 332 (1007)
T KOG3623|consen 270 AISLTQSALLRKFKCTECGKAFKFKHHLKEHLRI----HSGEKP------------FECPNCKKRFSHSGSYSSHMSSK 332 (1007)
T ss_pred cccccchhhhccccccccchhhhhHHHHHhhhee----ecCCCC------------cCCcccccccccCCccccccccc
Confidence 2569999999999999999999999 999999 99999999999999999997544
No 9
>KOG3576|consensus
Probab=99.66 E-value=3.6e-17 Score=138.88 Aligned_cols=125 Identities=26% Similarity=0.501 Sum_probs=101.9
Q ss_pred CcceecCCCcccccCHHHHHHHHhhccCCCccccccccccccCHHHHHHHHHhcCCC-CccCCCChhhcCChHHHHHHHh
Q psy11590 381 TMRYQCPHCPSLLKTFWTLKEHFNIHTSEKKYVCELCGAYFIHKSSLVAHTTTHSNV-KFKCDFCEKVYSHRNRLRYHIS 459 (555)
Q Consensus 381 ~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~-~~~C~~C~~~f~~~~~L~~H~~ 459 (555)
...|.|.+|++.|.-...|.+|++-|...+.|-|..||+.|.....|++|+++|++. ||+|..|+++|+.+-.|..|++
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence 446999999999999999999999999999999999999999999999999999998 9999999999999999999999
Q ss_pred hhcCCCccccccccchhcccCchHHHHHHHhcCCCCCeecccccccccChhhHHHHHHhCCCccc
Q psy11590 460 TVHENKWKSAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRKNLVRHQKNCKASAK 524 (555)
Q Consensus 460 ~~H~~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~ 524 (555)
.+|+.. ..| ++ ....++.|.|+.||..-.....+..|++.||+..|
T Consensus 195 kvhgv~-~~y-------------ay-----kerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 195 KVHGVQ-HQY-------------AY-----KERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred HHcCch-HHH-------------HH-----HHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 999864 111 00 01234556666666666666666666666665544
No 10
>KOG3576|consensus
Probab=99.60 E-value=4e-16 Score=132.56 Aligned_cols=95 Identities=28% Similarity=0.560 Sum_probs=85.7
Q ss_pred CccCCCChhhcCChHHHHHHHhhhcCCCccccccccchhcccCchHHHHHHHhcCCCCCeecccccccccChhhHHHHHH
Q psy11590 438 KFKCDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRKNLVRHQK 517 (555)
Q Consensus 438 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~ 517 (555)
.|.|.+|++.|.....|.+|++- |... +.|.|..|||.|.....|++|+|+|+|.+||+|..|+++|+++-.|..|++
T Consensus 117 ~ftCrvCgK~F~lQRmlnrh~kc-h~~v-kr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 117 SFTCRVCGKKFGLQRMLNRHLKC-HSDV-KRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred eeeeehhhhhhhHHHHHHHHhhh-ccHH-HHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence 68899999999999999999988 7776 899999999999999999999999999999999999999999999999998
Q ss_pred hCCCc-----------cccCcccCCCCc
Q psy11590 518 NCKAS-----------AKCSLMTTSPLS 534 (555)
Q Consensus 518 ~h~~~-----------~~~~~~~~~~~~ 534 (555)
.-||. |-|.|.+|.+-+
T Consensus 195 kvhgv~~~yaykerr~kl~vcedcg~t~ 222 (267)
T KOG3576|consen 195 KVHGVQHQYAYKERRAKLYVCEDCGYTS 222 (267)
T ss_pred HHcCchHHHHHHHhhhheeeecccCCCC
Confidence 77764 456799998875
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.08 E-value=1.5e-10 Score=117.97 Aligned_cols=102 Identities=21% Similarity=0.473 Sum_probs=69.0
Q ss_pred cccccccccccCHHHHHHHHHhcCCCCccCCCChhhcCChHHHHHHHhhhcCCCccccccccchhcccC----------c
Q psy11590 412 YVCELCGAYFIHKSSLVAHTTTHSNVKFKCDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVFLD----------S 481 (555)
Q Consensus 412 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f~~----------~ 481 (555)
+.|+.|++.|. ...|..|+..++ .++.|+ |++.+ .+..|..|+.+ |... +++.|.+|++.|.. .
T Consensus 454 ~~C~~Cgk~f~-~s~LekH~~~~H-kpv~Cp-Cg~~~-~R~~L~~H~~t-hCp~-Kpi~C~fC~~~v~~g~~~~d~~d~~ 527 (567)
T PLN03086 454 VHCEKCGQAFQ-QGEMEKHMKVFH-EPLQCP-CGVVL-EKEQMVQHQAS-TCPL-RLITCRFCGDMVQAGGSAMDVRDRL 527 (567)
T ss_pred ccCCCCCCccc-hHHHHHHHHhcC-CCccCC-CCCCc-chhHHHhhhhc-cCCC-CceeCCCCCCccccCccccchhhhh
Confidence 46777777774 566777777764 577777 77644 55777777766 5555 77777777777741 3
Q ss_pred hHHHHHHHhcCCCCCeecccccccccChhhHHHHHHhCCC
Q psy11590 482 KNMKKHAAVHSTERPFVCKLCGIAYKWRKNLVRHQKNCKA 521 (555)
Q Consensus 482 ~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~ 521 (555)
+.|..|.... |.+++.|..||+.+..+ .|..|+...|.
T Consensus 528 s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~~~h~ 565 (567)
T PLN03086 528 RGMSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQIAVHQ 565 (567)
T ss_pred hhHHHHHHhc-CCcceEccccCCeeeeh-hHHHHHHHhhc
Confidence 4677777764 77777777777776544 56677665543
No 12
>PHA00733 hypothetical protein
Probab=99.02 E-value=2.6e-10 Score=94.81 Aligned_cols=53 Identities=23% Similarity=0.454 Sum_probs=39.6
Q ss_pred cccccccchhcccCchHHHHHHHhcCCCCCeecccccccccChhhHHHHHHhCCC
Q psy11590 467 KSAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRKNLVRHQKNCKA 521 (555)
Q Consensus 467 ~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~ 521 (555)
+||.|+.||+.|.+...|..|++.| +.+|.|+.|++.|.....|..|+...|+
T Consensus 72 kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 72 SPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 6777777777777777777777765 3467777787777777777777777665
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.99 E-value=7.1e-10 Score=113.22 Aligned_cols=131 Identities=20% Similarity=0.504 Sum_probs=105.1
Q ss_pred eecCCCcccccCHHHHHHHHhhccCCCcccccc--ccccccCHHHHHHHHHhcCCCCccCCCChhhcCChHHHHHHHhhh
Q psy11590 384 YQCPHCPSLLKTFWTLKEHFNIHTSEKKYVCEL--CGAYFIHKSSLVAHTTTHSNVKFKCDFCEKVYSHRNRLRYHISTV 461 (555)
Q Consensus 384 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~L~~H~~~~ 461 (555)
-.|+.|....+. ..|..|..... ...-.|+. |+..|. ...|..| +.|+.|++.|. ...|..|+..+
T Consensus 408 V~C~NC~~~i~l-~~l~lHe~~C~-r~~V~Cp~~~Cg~v~~-r~el~~H--------~~C~~Cgk~f~-~s~LekH~~~~ 475 (567)
T PLN03086 408 VECRNCKHYIPS-RSIALHEAYCS-RHNVVCPHDGCGIVLR-VEEAKNH--------VHCEKCGQAFQ-QGEMEKHMKVF 475 (567)
T ss_pred EECCCCCCccch-hHHHHHHhhCC-CcceeCCcccccceee-ccccccC--------ccCCCCCCccc-hHHHHHHHHhc
Confidence 579999887654 55778875543 23456874 999993 3344444 69999999996 68899999995
Q ss_pred cCCCccccccccchhcccCchHHHHHHHhcCCCCCeecccccccccC----------hhhHHHHHHhCCCccccCcccCC
Q psy11590 462 HENKWKSAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKW----------RKNLVRHQKNCKASAKCSLMTTS 531 (555)
Q Consensus 462 H~~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~----------~~~L~~H~~~h~~~~~~~~~~~~ 531 (555)
| +++.|+ ||+.+ .+..|..|+.+|.+++|+.|+.|++.|.. .+.|..|...+ |.+++.|..|.
T Consensus 476 H----kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cg 548 (567)
T PLN03086 476 H----EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCG 548 (567)
T ss_pred C----CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccC
Confidence 4 689999 99765 67999999999999999999999999952 45899999997 99999999996
Q ss_pred CC
Q psy11590 532 PL 533 (555)
Q Consensus 532 ~~ 533 (555)
-.
T Consensus 549 k~ 550 (567)
T PLN03086 549 RS 550 (567)
T ss_pred Ce
Confidence 43
No 14
>PHA00733 hypothetical protein
Probab=98.90 E-value=1.3e-09 Score=90.55 Aligned_cols=81 Identities=22% Similarity=0.311 Sum_probs=40.5
Q ss_pred CcceecCCCcccccCHHHHHHH--Hh---hccCCCccccccccccccCHHHHHHHHHhcCCCCccCCCChhhcCChHHHH
Q psy11590 381 TMRYQCPHCPSLLKTFWTLKEH--FN---IHTSEKKYVCELCGAYFIHKSSLVAHTTTHSNVKFKCDFCEKVYSHRNRLR 455 (555)
Q Consensus 381 ~~~~~C~~C~~~f~~~~~L~~H--~~---~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~L~ 455 (555)
.+++.|.+|...|.....|..+ ++ .+.+.+||.|+.|++.|.+...|..|+..| ..+|.|+.|++.|.....|.
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h-~~~~~C~~CgK~F~~~~sL~ 116 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT-EHSKVCPVCGKEFRNTDSTL 116 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC-CcCccCCCCCCccCCHHHHH
Confidence 4456666666666655555444 11 122344555555555555555555555443 12455555555555555555
Q ss_pred HHHhhhc
Q psy11590 456 YHISTVH 462 (555)
Q Consensus 456 ~H~~~~H 462 (555)
.|+...|
T Consensus 117 ~H~~~~h 123 (128)
T PHA00733 117 DHVCKKH 123 (128)
T ss_pred HHHHHhc
Confidence 5554433
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.85 E-value=1e-09 Score=74.15 Aligned_cols=44 Identities=20% Similarity=0.438 Sum_probs=35.8
Q ss_pred ccccccchhcccCchHHHHHHHhcCCCCCeecccccccccChhhHH
Q psy11590 468 SAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRKNLV 513 (555)
Q Consensus 468 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~ 513 (555)
-|.|+.||+.|...++|..|+++|+ +||+|..|++.|...+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 3788888888888888888888887 6888888888888777653
No 16
>KOG3993|consensus
Probab=98.65 E-value=5.3e-09 Score=99.59 Aligned_cols=140 Identities=23% Similarity=0.376 Sum_probs=92.7
Q ss_pred eecCCCcccccCHHHHHHHHhhccCCCccccccccccccCHHHHHHHHHhcCC---------------------------
Q psy11590 384 YQCPHCPSLLKTFWTLKEHFNIHTSEKKYVCELCGAYFIHKSSLVAHTTTHSN--------------------------- 436 (555)
Q Consensus 384 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~--------------------------- 436 (555)
|.|..|...|.+...|.+|.-.---.-.|+|+.|++.|.-..+|..|.++|..
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~r 347 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAER 347 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccc
Confidence 66666666666666666664211111236666666666666666666666620
Q ss_pred -------CCccCCCChhhcCChHHHHHHHhhhcCCCc-------------------------------------------
Q psy11590 437 -------VKFKCDFCEKVYSHRNRLRYHISTVHENKW------------------------------------------- 466 (555)
Q Consensus 437 -------~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~------------------------------------------- 466 (555)
..|.|.+|++.|.....|++|..++|....
T Consensus 348 sg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~s 427 (500)
T KOG3993|consen 348 SGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAGS 427 (500)
T ss_pred cCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeecc
Confidence 037788888888888888888777552110
Q ss_pred -cccccccchhcccCchHHHHHHHhcCCCCCeecccccccccChhhHHHHHHhCCCcc
Q psy11590 467 -KSAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRKNLVRHQKNCKASA 523 (555)
Q Consensus 467 -~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~ 523 (555)
..-.|++||-.+.++..--.+.+.-..+..|.|.+|...|.+..+|.+|+..+|...
T Consensus 428 ael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse 485 (500)
T KOG3993|consen 428 AELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSE 485 (500)
T ss_pred ccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChHH
Confidence 112477777777766665556555556677999999999999999999998877543
No 17
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.63 E-value=2.1e-08 Score=57.87 Aligned_cols=26 Identities=31% Similarity=0.897 Sum_probs=23.1
Q ss_pred HHHHHHHhcCCCCCeecccccccccC
Q psy11590 483 NMKKHAAVHSTERPFVCKLCGIAYKW 508 (555)
Q Consensus 483 ~L~~H~~~H~~~~~~~C~~C~~~f~~ 508 (555)
+|..|+++|+|++||+|++|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 58899999999999999999999863
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.58 E-value=2.3e-08 Score=67.70 Aligned_cols=42 Identities=17% Similarity=0.403 Sum_probs=26.1
Q ss_pred eecCCCcccccCHHHHHHHHhhccCCCccccccccccccCHHHH
Q psy11590 384 YQCPHCPSLLKTFWTLKEHFNIHTSEKKYVCELCGAYFIHKSSL 427 (555)
Q Consensus 384 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L 427 (555)
|.|+.||+.|.....|..|+++|+ ++|+|..|++.|...+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 566666666666666666666665 466666666666555443
No 19
>KOG3993|consensus
Probab=98.55 E-value=2.2e-08 Score=95.45 Aligned_cols=181 Identities=15% Similarity=0.218 Sum_probs=119.6
Q ss_pred CCCCCccchhhccChhhHHhhhhhhhcCCcccCCCCCCChhhHHHHHHHHHHHHHhhcccccccccccc-----cccc--
Q psy11590 70 KCHSCQKCEESFDNCNNLWSHMFIKHENSDFVCNLCPPDSKIMIKYAHLLVRHMKQCHTMQVHIPAVYK-----NFRS-- 142 (555)
Q Consensus 70 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~~~~~f~~~~~l~~H~~~~H~~~~~c~~c~~-----~~~~-- 142 (555)
..|.|..|...|.+...|.+|.-..--...|+|+.|++. |.-..+|..|.|. |.....-..-+- ...+
T Consensus 266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KV----FsCPANLASHRRW-HKPR~eaa~a~~~P~k~~~~~ra 340 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKV----FSCPANLASHRRW-HKPRPEAAKAGSPPPKQAVETRA 340 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCccccc----ccCchhhhhhhcc-cCCchhhhhcCCCChhhhhhhhh
Confidence 458999999999999999999743222235999999999 9999999999999 754431110000 0000
Q ss_pred -cccc----ccCCCCcccCCCChhhcCChHHHHHHHHhccCCCc--------cccC------Cccchhhhhhccc-----
Q psy11590 143 -VTSF----TMNNITQYKCPDCPAILQTFKNLKCHLDIHSGEKD--------FSCH------INSNLSEHIRMHH----- 198 (555)
Q Consensus 143 -~~~~----~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~--------~~C~------~~~~l~~H~~~h~----- 198 (555)
.... .......|.|.+|++.|.+...|+.|+.+|..... +.-. ....+..|...-+
T Consensus 341 e~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~ 420 (500)
T KOG3993|consen 341 EVQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDE 420 (500)
T ss_pred hhhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccc
Confidence 0000 00234589999999999999999999988763211 1100 1111222221110
Q ss_pred -----cCCceeecCccCccccchhhhhhhhhhcC-CCcccCCCCCccCCCchHHHhHHHhhcC
Q psy11590 199 -----TEEKKYICEVCGTSFLTWSSLNSHKYSHD-DRKFSCSHCGNSYKNPIFLKRHFRESHL 255 (555)
Q Consensus 199 -----~~~~~~~C~~C~~~f~~~~~L~~H~~~h~-~~~~~C~~C~~~f~~~~~L~~H~~~~H~ 255 (555)
.......|++|+..+.++..--.+.+.-. +..|.|.+|..+|.+...|.+|+...|+
T Consensus 421 vl~~a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hp 483 (500)
T KOG3993|consen 421 VLYVAGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHP 483 (500)
T ss_pred eeeeeccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcCh
Confidence 11122458888888877766666655444 6679999999999999999999998764
No 20
>PHA00616 hypothetical protein
Probab=98.40 E-value=1.1e-07 Score=61.14 Aligned_cols=34 Identities=18% Similarity=0.347 Sum_probs=19.8
Q ss_pred ccccccchhcccCchHHHHHHHhcCCCCCeeccc
Q psy11590 468 SAACDICGRVFLDSKNMKKHAAVHSTERPFVCKL 501 (555)
Q Consensus 468 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~ 501 (555)
||+|..||+.|..++.|..|++.|+|+++|.|++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 4556666666666666666665555555555554
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.39 E-value=1.8e-07 Score=54.04 Aligned_cols=26 Identities=42% Similarity=0.888 Sum_probs=21.5
Q ss_pred HHHHHHhhccCCCccccccccccccC
Q psy11590 398 TLKEHFNIHTSEKKYVCELCGAYFIH 423 (555)
Q Consensus 398 ~L~~H~~~H~~~~~~~C~~C~~~f~~ 423 (555)
+|.+|+++|+|++||.|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 47888888888888888888888853
No 22
>PHA00616 hypothetical protein
Probab=98.31 E-value=2.7e-07 Score=59.34 Aligned_cols=33 Identities=21% Similarity=0.467 Sum_probs=27.4
Q ss_pred CeecccccccccChhhHHHHHHhCCCccccCcc
Q psy11590 496 PFVCKLCGIAYKWRKNLVRHQKNCKASAKCSLM 528 (555)
Q Consensus 496 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~ 528 (555)
||+|+.||+.|..+++|..|++.|||++|+.|+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 678888888888888888888888888887765
No 23
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.20 E-value=2.2e-06 Score=59.66 Aligned_cols=50 Identities=26% Similarity=0.525 Sum_probs=28.3
Q ss_pred ccCCCChhhcCChHHHHHHHhhhcCCCccccccccchhcccCchHHHHHHHhc
Q psy11590 439 FKCDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVFLDSKNMKKHAAVH 491 (555)
Q Consensus 439 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f~~~~~L~~H~~~H 491 (555)
|.||+|++ .-+...|..|+...|..+.+.+.|++|...+. .+|..|+..+
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~ 52 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQ 52 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHh
Confidence 56666666 34455666666666655545566666665433 2555555443
No 24
>PHA00732 hypothetical protein
Probab=98.14 E-value=1.5e-06 Score=65.21 Aligned_cols=36 Identities=31% Similarity=0.654 Sum_probs=16.5
Q ss_pred cccccchhcccCchHHHHHHHh-cCCCCCeeccccccccc
Q psy11590 469 AACDICGRVFLDSKNMKKHAAV-HSTERPFVCKLCGIAYK 507 (555)
Q Consensus 469 ~~C~~C~k~f~~~~~L~~H~~~-H~~~~~~~C~~C~~~f~ 507 (555)
|.|+.||+.|.+...|..|++. |.+ +.|+.||+.|.
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~ 38 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR 38 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeC
Confidence 4444455555444445544442 321 34555555444
No 25
>KOG1146|consensus
Probab=98.09 E-value=1.2e-05 Score=88.16 Aligned_cols=148 Identities=19% Similarity=0.187 Sum_probs=93.2
Q ss_pred CCccchhhccChhhHHhhhhh-hhcCCcccCCCCCCChhhHHHHHHHHHHHHHhhccccccccccccccccccccc----
Q psy11590 73 SCQKCEESFDNCNNLWSHMFI-KHENSDFVCNLCPPDSKIMIKYAHLLVRHMKQCHTMQVHIPAVYKNFRSVTSFT---- 147 (555)
Q Consensus 73 ~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~~~~~f~~~~~l~~H~~~~H~~~~~c~~c~~~~~~~~~~~---- 147 (555)
.|..|+..+.+...+..|+.. +.-.+.|+|+.|++. ++....|..|||..|..... ..|.-.-.......
T Consensus 438 e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~----yk~a~~L~vhmRskhp~~~~-~~c~~gq~~~~~arg~~~ 512 (1406)
T KOG1146|consen 438 ELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWH----YKLAQTLGVHMRSKHPESQS-AYCKAGQNHPRLARGEVY 512 (1406)
T ss_pred cccchhhhhhhhcccccceeeeecccccccCCccchh----hhhHHHhhhcccccccccch-hHhHhccccccccccccc
Confidence 366777888888888888765 444578999999999 99999999999997744332 22221111111111
Q ss_pred cCCCCcccCCCChhhcCChHHHHHHHHh--ccCC----------Ccc-----ccC---Cccchhhhhhcc-ccCCceeec
Q psy11590 148 MNNITQYKCPDCPAILQTFKNLKCHLDI--HSGE----------KDF-----SCH---INSNLSEHIRMH-HTEEKKYIC 206 (555)
Q Consensus 148 ~~~~~~~~C~~C~~~f~~~~~l~~H~~~--h~~~----------~~~-----~C~---~~~~l~~H~~~h-~~~~~~~~C 206 (555)
..+.++|.|..|...++.+.+|.+|++. |..+ ... .+. ..+.+..-.-.- ...+..|.|
T Consensus 513 ~~~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C 592 (1406)
T KOG1146|consen 513 RCPGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRC 592 (1406)
T ss_pred cCCCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcch
Confidence 1346789999999999999999999974 3211 000 010 000000000000 011234889
Q ss_pred CccCccccchhhhhhhhhh
Q psy11590 207 EVCGTSFLTWSSLNSHKYS 225 (555)
Q Consensus 207 ~~C~~~f~~~~~L~~H~~~ 225 (555)
.+|+..-+-...|+.||..
T Consensus 593 ~vc~yetniarnlrihmts 611 (1406)
T KOG1146|consen 593 EVCSYETNIARNLRIHMTA 611 (1406)
T ss_pred hhhcchhhhhhcccccccc
Confidence 9999988888888888753
No 26
>PHA00732 hypothetical protein
Probab=98.03 E-value=3.3e-06 Score=63.33 Aligned_cols=45 Identities=22% Similarity=0.506 Sum_probs=21.1
Q ss_pred ccCCCChhhcCChHHHHHHHhhhcCCCccccccccchhcccCchHHHHHHHh
Q psy11590 439 FKCDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVFLDSKNMKKHAAV 490 (555)
Q Consensus 439 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f~~~~~L~~H~~~ 490 (555)
|.|+.|++.|.+...|+.|++.+|. ++.|+.||+.|. .|..|+++
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~----~~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT----LTKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC----CCccCCCCCEeC---Chhhhhcc
Confidence 4455555555555555555543232 134555555554 34445433
No 27
>KOG1146|consensus
Probab=97.97 E-value=5.5e-06 Score=90.60 Aligned_cols=77 Identities=13% Similarity=0.245 Sum_probs=63.4
Q ss_pred ccCCCChhhcCChHHHHHHHhhhcCCCccccccccchhcccCchHHHHHHHhc-----------------CCCCCeeccc
Q psy11590 439 FKCDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVFLDSKNMKKHAAVH-----------------STERPFVCKL 501 (555)
Q Consensus 439 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f~~~~~L~~H~~~H-----------------~~~~~~~C~~ 501 (555)
+.|..|+..|.....+. |+-. ..+|.|..|...|.....|..|.+.- ..-.+| |..
T Consensus 1261 ~~c~~~~~~~~~~~~~~-~l~~-----~~~~~~~~~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~~ 1333 (1406)
T KOG1146|consen 1261 GECGAVDELLTPSFGIS-TLDV-----THRYLCRQCKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CLA 1333 (1406)
T ss_pred chhhhccccccCcccee-eccc-----chhHHHHHHHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc-chH
Confidence 89999999999988887 6544 16799999999999999999998532 122346 999
Q ss_pred ccccccChhhHHHHHHhCCCc
Q psy11590 502 CGIAYKWRKNLVRHQKNCKAS 522 (555)
Q Consensus 502 C~~~f~~~~~L~~H~~~h~~~ 522 (555)
|...|..+..|..||+.-++.
T Consensus 1334 c~~~~~~~~alqihm~~~~~~ 1354 (1406)
T KOG1146|consen 1334 CEVLLSGREALQIHMRSSAHR 1354 (1406)
T ss_pred HHhhcchhHHHHHHHHHhhhc
Confidence 999999999999999975544
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.79 E-value=2.6e-05 Score=54.29 Aligned_cols=51 Identities=27% Similarity=0.621 Sum_probs=41.2
Q ss_pred ccccccchhcccCchHHHHHHH-hcCCC-CCeecccccccccChhhHHHHHHhCCC
Q psy11590 468 SAACDICGRVFLDSKNMKKHAA-VHSTE-RPFVCKLCGIAYKWRKNLVRHQKNCKA 521 (555)
Q Consensus 468 ~~~C~~C~k~f~~~~~L~~H~~-~H~~~-~~~~C~~C~~~f~~~~~L~~H~~~h~~ 521 (555)
.|.|++|++ ..+...|..|.. .|.++ +.+.|++|...+. .+|..|+..+|+
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 489999999 556789999965 56554 4699999998755 599999998774
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.69 E-value=2e-05 Score=44.10 Aligned_cols=22 Identities=32% Similarity=0.874 Sum_probs=13.3
Q ss_pred eecccccccccChhhHHHHHHh
Q psy11590 497 FVCKLCGIAYKWRKNLVRHQKN 518 (555)
Q Consensus 497 ~~C~~C~~~f~~~~~L~~H~~~ 518 (555)
|.|++|++.|.++.+|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4566666666666666666654
No 30
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.63 E-value=3.3e-05 Score=62.01 Aligned_cols=73 Identities=19% Similarity=0.441 Sum_probs=19.0
Q ss_pred cCCCChhhcCChHHHHHHHhhhcCCCccccccccchhcccCchHHHHHHHhcCCCCCeecccccccccChhhHHHHHHhC
Q psy11590 440 KCDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRKNLVRHQKNC 519 (555)
Q Consensus 440 ~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h 519 (555)
+|..|+..|.+...|..|+...|+.. -+ ....+.....|..+++.- ....+.|.+|++.|.+...|..||+.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~-~~-----~~~~l~~~~~~~~~~~~~-~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFD-IP-----DQKYLVDPNRLLNYLRKK-VKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp -------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Cccccccccccccccccccccccccc-cc-----cccccccccccccccccc-cCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 47778888888888888877766543 11 112222344444443322 122678888888888888888888864
No 31
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.61 E-value=2.8e-05 Score=43.49 Aligned_cols=22 Identities=36% Similarity=0.800 Sum_probs=12.2
Q ss_pred cccccchhcccCchHHHHHHHh
Q psy11590 469 AACDICGRVFLDSKNMKKHAAV 490 (555)
Q Consensus 469 ~~C~~C~k~f~~~~~L~~H~~~ 490 (555)
|.|++|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4455555555555555555554
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.46 E-value=7.2e-05 Score=42.26 Aligned_cols=24 Identities=25% Similarity=0.771 Sum_probs=13.3
Q ss_pred eecccccccccChhhHHHHHHhCC
Q psy11590 497 FVCKLCGIAYKWRKNLVRHQKNCK 520 (555)
Q Consensus 497 ~~C~~C~~~f~~~~~L~~H~~~h~ 520 (555)
|.|++|++.|.+...|+.|+++||
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 456666666666666666666543
No 33
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.44 E-value=9.3e-05 Score=43.22 Aligned_cols=26 Identities=27% Similarity=0.622 Sum_probs=18.9
Q ss_pred CeecccccccccChhhHHHHHHhCCC
Q psy11590 496 PFVCKLCGIAYKWRKNLVRHQKNCKA 521 (555)
Q Consensus 496 ~~~C~~C~~~f~~~~~L~~H~~~h~~ 521 (555)
||+|..|++.|.+..+|..|++.|++
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 57777777777777777777777653
No 34
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.27 E-value=0.00016 Score=57.96 Aligned_cols=71 Identities=18% Similarity=0.430 Sum_probs=12.4
Q ss_pred cCCCcccccCHHHHHHHHhhccCCCccccccccccccCHHHHHHHHHhcCCCCccCCCChhhcCChHHHHHHHhh
Q psy11590 386 CPHCPSLLKTFWTLKEHFNIHTSEKKYVCELCGAYFIHKSSLVAHTTTHSNVKFKCDFCEKVYSHRNRLRYHIST 460 (555)
Q Consensus 386 C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~L~~H~~~ 460 (555)
|..|+..|.+...|..||...++...- ....+.....+..++..-...++.|..|++.|.+...|..|++.
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~~----~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~ 72 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDIP----DQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHMRS 72 (100)
T ss_dssp ---------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHH
T ss_pred ccccccccccccccccccccccccccc----cccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHHcC
Confidence 555555565555565565443321100 11111122223333322222245555555555555555555554
No 35
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.20 E-value=0.00024 Score=40.01 Aligned_cols=21 Identities=33% Similarity=0.836 Sum_probs=7.6
Q ss_pred cCCCChhhcCChHHHHHHHhh
Q psy11590 440 KCDFCEKVYSHRNRLRYHIST 460 (555)
Q Consensus 440 ~C~~C~~~f~~~~~L~~H~~~ 460 (555)
.|++|++.|.+...|+.|+++
T Consensus 2 ~C~~C~~~~~~~~~l~~H~~~ 22 (24)
T PF13894_consen 2 QCPICGKSFRSKSELRQHMRT 22 (24)
T ss_dssp E-SSTS-EESSHHHHHHHHHH
T ss_pred CCcCCCCcCCcHHHHHHHHHh
Confidence 344444444444444444333
No 36
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.20 E-value=0.00011 Score=68.04 Aligned_cols=27 Identities=37% Similarity=0.838 Sum_probs=18.5
Q ss_pred CCCccCCC--ChhhcCChHHHHHHHhhhc
Q psy11590 436 NVKFKCDF--CEKVYSHRNRLRYHISTVH 462 (555)
Q Consensus 436 ~~~~~C~~--C~~~f~~~~~L~~H~~~~H 462 (555)
++||+|++ |++.|+....|+.|+.-.|
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH 375 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGH 375 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccc
Confidence 45777765 7777777777777776544
No 37
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.11 E-value=0.00022 Score=41.57 Aligned_cols=25 Identities=28% Similarity=0.697 Sum_probs=15.7
Q ss_pred ccccccchhcccCchHHHHHHHhcC
Q psy11590 468 SAACDICGRVFLDSKNMKKHAAVHS 492 (555)
Q Consensus 468 ~~~C~~C~k~f~~~~~L~~H~~~H~ 492 (555)
||.|+.|++.|.+..+|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4566666666666666666666553
No 38
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.66 E-value=0.0013 Score=43.31 Aligned_cols=32 Identities=28% Similarity=0.556 Sum_probs=16.8
Q ss_pred CCCCeecccccccccChhhHHHHHHhCCCccc
Q psy11590 493 TERPFVCKLCGIAYKWRKNLVRHQKNCKASAK 524 (555)
Q Consensus 493 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~ 524 (555)
.+.|-.|++|+..+.+..+|++|+...|+.||
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 34566666666666666666666666666654
No 39
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.55 E-value=0.0011 Score=43.65 Aligned_cols=41 Identities=24% Similarity=0.602 Sum_probs=19.9
Q ss_pred HHHHhhhcCCCccccccccchhcccCchHHHHHHHhcCCCCC
Q psy11590 455 RYHISTVHENKWKSAACDICGRVFLDSKNMKKHAAVHSTERP 496 (555)
Q Consensus 455 ~~H~~~~H~~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~ 496 (555)
..+.+.++..+ .|-.|++|+..+....+|++|+.++++.+|
T Consensus 12 ~~~~k~~~~S~-~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 12 TKKPKSKSQSE-QPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp ----CCCCTTS---EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred hhHHHHhhccC-CCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 34444433333 667777777777777777777766666554
No 40
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.55 E-value=0.0019 Score=60.07 Aligned_cols=67 Identities=31% Similarity=0.564 Sum_probs=50.3
Q ss_pred CCcccCCC--CCccCCCchHHHhHHHhhcC---CCCCCCCcccccccccCceeeCCCcccccccccccccCC
Q psy11590 228 DRKFSCSH--CGNSYKNPIFLKRHFRESHL---KHSKSNPEQYAVKSLQDIMYWCNVCSQRCYLSATCSIHR 294 (555)
Q Consensus 228 ~~~~~C~~--C~~~f~~~~~L~~H~~~~H~---~h~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~ 294 (555)
++||+|++ |+|.|.+...|+-|+..-|. .+....|+....=+..++||+|+.|++.+.....|+.|+
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence 58999987 99999999999999986552 122222322222345667899999999999999888885
No 41
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.44 E-value=0.0014 Score=36.86 Aligned_cols=24 Identities=29% Similarity=0.612 Sum_probs=14.5
Q ss_pred eecccccccccChhhHHHHHHhCCC
Q psy11590 497 FVCKLCGIAYKWRKNLVRHQKNCKA 521 (555)
Q Consensus 497 ~~C~~C~~~f~~~~~L~~H~~~h~~ 521 (555)
|+|+.|++... ..+|..|++.+|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 56777776666 6677777766553
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.36 E-value=0.0025 Score=36.41 Aligned_cols=22 Identities=27% Similarity=0.596 Sum_probs=10.8
Q ss_pred ecccccccccChhhHHHHHHhC
Q psy11590 498 VCKLCGIAYKWRKNLVRHQKNC 519 (555)
Q Consensus 498 ~C~~C~~~f~~~~~L~~H~~~h 519 (555)
+|..|++.|.....|..|++.|
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 2 RCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCcchhCCHHHHHHHHHHh
Confidence 4455555555555555554433
No 43
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.36 E-value=0.0028 Score=36.20 Aligned_cols=23 Identities=35% Similarity=0.764 Sum_probs=13.2
Q ss_pred cccccchhcccCchHHHHHHHhc
Q psy11590 469 AACDICGRVFLDSKNMKKHAAVH 491 (555)
Q Consensus 469 ~~C~~C~k~f~~~~~L~~H~~~H 491 (555)
|.|+.|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 34555666666666666665544
No 44
>KOG2231|consensus
Probab=96.24 E-value=0.0073 Score=63.39 Aligned_cols=48 Identities=29% Similarity=0.535 Sum_probs=40.5
Q ss_pred eecCccCccccchhhhhhhhhhcCCCcccCCCC------CccCCCchHHHhHHHhhc
Q psy11590 204 YICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHC------GNSYKNPIFLKRHFRESH 254 (555)
Q Consensus 204 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C------~~~f~~~~~L~~H~~~~H 254 (555)
-.|..|...|.....|.+|++.++ |.|-.| +..|.....|..|.+..|
T Consensus 183 p~C~~C~~~fld~~el~rH~~~~h---~~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 183 PLCKFCHERFLDDDELYRHLRFDH---EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred ccchhhhhhhccHHHHHHhhccce---eheeecCcccccchhcccchHHHHHhhhcC
Confidence 469999999999999999999876 556665 456888899999999876
No 45
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.13 E-value=0.012 Score=55.38 Aligned_cols=101 Identities=30% Similarity=0.519 Sum_probs=69.9
Q ss_pred cccCCC--ChhhcCChHHHHHHHHhccC----------CCccccC----CccchhhhhhccccCCcee----ecCccCcc
Q psy11590 153 QYKCPD--CPAILQTFKNLKCHLDIHSG----------EKDFSCH----INSNLSEHIRMHHTEEKKY----ICEVCGTS 212 (555)
Q Consensus 153 ~~~C~~--C~~~f~~~~~l~~H~~~h~~----------~~~~~C~----~~~~l~~H~~~h~~~~~~~----~C~~C~~~ 212 (555)
.|.|+. |.........|..|.+.-++ .+.|.|. ....|+.|...- ..+..| .|..|...
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G-~~e~GFKGHP~C~FC~~~ 229 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGFVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGG-LEEEGFKGHPLCIFCKIY 229 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCcEEhHhhhcCcccCccceeeeecccccccccCC-ccccCcCCCchhhhccce
Confidence 588976 88888889999999986432 3445554 466777776543 222233 58999999
Q ss_pred ccchhhhhhhhhhcCCCcccCCCCC----ccCCCchHHHhHHHhhc
Q psy11590 213 FLTWSSLNSHKYSHDDRKFSCSHCG----NSYKNPIFLKRHFRESH 254 (555)
Q Consensus 213 f~~~~~L~~H~~~h~~~~~~C~~C~----~~f~~~~~L~~H~~~~H 254 (555)
|.+...|.+|++..+++-|.|+.=+ +-|.+..+|.+|.+..|
T Consensus 230 FYdDDEL~~HcR~~HE~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h 275 (493)
T COG5236 230 FYDDDELRRHCRLRHEACHICDMVGPIRYQYFKSYEDLEAHFRNAH 275 (493)
T ss_pred ecChHHHHHHHHhhhhhhhhhhccCccchhhhhCHHHHHHHhhcCc
Confidence 9999999999988776544444322 24778888888887643
No 46
>PRK04860 hypothetical protein; Provisional
Probab=96.11 E-value=0.0029 Score=54.76 Aligned_cols=40 Identities=20% Similarity=0.536 Sum_probs=33.4
Q ss_pred cccccccchhcccCchHHHHHHHhcCCCCCeecccccccccChh
Q psy11590 467 KSAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRK 510 (555)
Q Consensus 467 ~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~ 510 (555)
-+|.|. |+. ....+++|.++|+|+++|.|..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 468998 887 7788899999999999999999998886543
No 47
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.79 E-value=0.005 Score=35.00 Aligned_cols=22 Identities=23% Similarity=0.714 Sum_probs=13.5
Q ss_pred eecccccccccChhhHHHHHHh
Q psy11590 497 FVCKLCGIAYKWRKNLVRHQKN 518 (555)
Q Consensus 497 ~~C~~C~~~f~~~~~L~~H~~~ 518 (555)
|.|.+|++.|.+...|+.|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4566666666666666666553
No 48
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.68 E-value=0.0051 Score=34.95 Aligned_cols=22 Identities=27% Similarity=0.709 Sum_probs=12.9
Q ss_pred cccccchhcccCchHHHHHHHh
Q psy11590 469 AACDICGRVFLDSKNMKKHAAV 490 (555)
Q Consensus 469 ~~C~~C~k~f~~~~~L~~H~~~ 490 (555)
|.|++|++.|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4566666666666666666543
No 49
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.58 E-value=0.0098 Score=33.39 Aligned_cols=20 Identities=30% Similarity=0.672 Sum_probs=7.3
Q ss_pred cCCCChhhcCChHHHHHHHhh
Q psy11590 440 KCDFCEKVYSHRNRLRYHIST 460 (555)
Q Consensus 440 ~C~~C~~~f~~~~~L~~H~~~ 460 (555)
+|+.|+.... ...|..|++.
T Consensus 2 ~C~~C~y~t~-~~~l~~H~~~ 21 (24)
T PF13909_consen 2 KCPHCSYSTS-KSNLKRHLKR 21 (24)
T ss_dssp E-SSSS-EES-HHHHHHHHHH
T ss_pred CCCCCCCcCC-HHHHHHHHHh
Confidence 3444444443 3344444443
No 50
>KOG2231|consensus
Probab=95.47 E-value=0.039 Score=58.10 Aligned_cols=110 Identities=21% Similarity=0.378 Sum_probs=66.3
Q ss_pred cccCccCccccChhHHhHHHHhhhcccccccCCCCchhhhhHHHHhhhhHhHHHHHHhhhcCCcceecCCCcccccCHHH
Q psy11590 319 HTCNECDSKHNDCQSLWDHVFSEHKNMLAFSKPDDSENKISKVAEKQYWKDVEEKCRIIVEDTMRYQCPHCPSLLKTFWT 398 (555)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~c~~~~~~~l~~~~~~~~~~~~~~~C~~C~~~f~~~~~ 398 (555)
..|..| -.|.+...|.+|+...|....... |- .+.+ .+..+.+. .+...
T Consensus 116 ~~~~~c-~~~~s~~~Lk~H~~~~H~~~~c~l------------C~---------------~~~k--if~~e~k~-Yt~~e 164 (669)
T KOG2231|consen 116 KECLHC-TEFKSVENLKNHMRDQHKLHLCSL------------CL---------------QNLK--IFINERKL-YTRAE 164 (669)
T ss_pred CCCccc-cchhHHHHHHHHHHHhhhhhcccc------------cc---------------ccce--eeeeeeeh-ehHHH
Confidence 348888 788888999999888886532110 00 0011 12222333 34566
Q ss_pred HHHHHhhcc-CCCc----cccccccccccCHHHHHHHHHhcCCCCccCCCC------hhhcCChHHHHHHHhhhc
Q psy11590 399 LKEHFNIHT-SEKK----YVCELCGAYFIHKSSLVAHTTTHSNVKFKCDFC------EKVYSHRNRLRYHISTVH 462 (555)
Q Consensus 399 L~~H~~~H~-~~~~----~~C~~C~~~f~~~~~L~~H~~~H~~~~~~C~~C------~~~f~~~~~L~~H~~~~H 462 (555)
|..|++.-- +++. -.|..|...|.....|..|++.++ |.|..| +.-|.....|..|.+..|
T Consensus 165 l~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h---~~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 165 LNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH---EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred HHHHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccce---eheeecCcccccchhcccchHHHHHhhhcC
Confidence 777765421 1221 347788888888888888887665 344444 455777788888888844
No 51
>KOG2482|consensus
Probab=95.44 E-value=0.12 Score=48.95 Aligned_cols=65 Identities=20% Similarity=0.417 Sum_probs=44.2
Q ss_pred chhhhhhccccCCceeecCccCccccc-hhhhhhhhhh-cC---C-------------------CcccCCCCCccCCCch
Q psy11590 189 NLSEHIRMHHTEEKKYICEVCGTSFLT-WSSLNSHKYS-HD---D-------------------RKFSCSHCGNSYKNPI 244 (555)
Q Consensus 189 ~l~~H~~~h~~~~~~~~C~~C~~~f~~-~~~L~~H~~~-h~---~-------------------~~~~C~~C~~~f~~~~ 244 (555)
.|.++++.........+|-.|+..+.. ++....|+.. |+ + ..+.|-.|.+.|+.+.
T Consensus 130 aLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkn 209 (423)
T KOG2482|consen 130 ALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKN 209 (423)
T ss_pred HHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcH
Confidence 344555444233455789999877654 4555666542 32 1 2389999999999999
Q ss_pred HHHhHHHhh
Q psy11590 245 FLKRHFRES 253 (555)
Q Consensus 245 ~L~~H~~~~ 253 (555)
.|+.||+..
T Consensus 210 tLkeHMrkK 218 (423)
T KOG2482|consen 210 TLKEHMRKK 218 (423)
T ss_pred HHHHHHHhc
Confidence 999999863
No 52
>KOG2785|consensus
Probab=95.31 E-value=0.032 Score=53.93 Aligned_cols=136 Identities=16% Similarity=0.277 Sum_probs=93.4
Q ss_pred ceecCCCcccccCHHHHHHHHhh--cc-----------------------------------CCCccccccccccccCHH
Q psy11590 383 RYQCPHCPSLLKTFWTLKEHFNI--HT-----------------------------------SEKKYVCELCGAYFIHKS 425 (555)
Q Consensus 383 ~~~C~~C~~~f~~~~~L~~H~~~--H~-----------------------------------~~~~~~C~~C~~~f~~~~ 425 (555)
.|.|.-|...|.+...-+.|+++ |. ++-++.|.+|.+.|.+..
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~ 82 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPK 82 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChh
Confidence 37899999999999888888863 42 234588999999999999
Q ss_pred HHHHHHHhcCCC------------------Ccc-------------CCCChhhcCChHHHHHHHhhhc----------CC
Q psy11590 426 SLVAHTTTHSNV------------------KFK-------------CDFCEKVYSHRNRLRYHISTVH----------EN 464 (555)
Q Consensus 426 ~L~~H~~~H~~~------------------~~~-------------C~~C~~~f~~~~~L~~H~~~~H----------~~ 464 (555)
+...|+..-... .+. +..+...+........+...-- ..
T Consensus 83 a~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~~~dd~~Edi~~d~~~e~ 162 (390)
T KOG2785|consen 83 AHENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEEEEDDEEEDIEEDGDDED 162 (390)
T ss_pred hHHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhhccCcchhhhhhccchhc
Confidence 998887643110 122 3333333333222222221100 00
Q ss_pred CccccccccchhcccCchHHHHHHHhcCCC-----------------------CCeeccccc---ccccChhhHHHHHHh
Q psy11590 465 KWKSAACDICGRVFLDSKNMKKHAAVHSTE-----------------------RPFVCKLCG---IAYKWRKNLVRHQKN 518 (555)
Q Consensus 465 ~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~-----------------------~~~~C~~C~---~~f~~~~~L~~H~~~ 518 (555)
+.-|-.|-+|++.+.+-..-..||..++|- .-|.|-.|+ +.|.+-...+.||..
T Consensus 163 e~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 163 ELIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred ccCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 113467999999999999999999888762 238999999 999999999999974
No 53
>KOG2785|consensus
Probab=94.96 E-value=0.083 Score=51.16 Aligned_cols=172 Identities=17% Similarity=0.188 Sum_probs=102.2
Q ss_pred ccccCccCccccChhHHhHHHHhhhcccccccCCCC-chhhhhHHHHhhhhHhHHHHHHhhhcCCcceecCCCcccccCH
Q psy11590 318 CHTCNECDSKHNDCQSLWDHVFSEHKNMLAFSKPDD-SENKISKVAEKQYWKDVEEKCRIIVEDTMRYQCPHCPSLLKTF 396 (555)
Q Consensus 318 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~-~~~~~~~~c~~~~~~~l~~~~~~~~~~~~~~~C~~C~~~f~~~ 396 (555)
.++|.-|...|.+...-+.|++..--+.-..-+... .+.......++....+... ....-.+..++.|..|.+.|.+.
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~-~~~~e~~~~~~~c~~c~k~~~s~ 81 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEK-EENLEEAESVVYCEACNKSFASP 81 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhh-hhhhhhcccceehHHhhccccCh
Confidence 466999999999999998888765322111111111 1111111112211111111 11123456679999999999999
Q ss_pred HHHHHHHhh--ccC---------C------Cccc-------------cccccccccCHHHHHHHHHhc--------C---
Q psy11590 397 WTLKEHFNI--HTS---------E------KKYV-------------CELCGAYFIHKSSLVAHTTTH--------S--- 435 (555)
Q Consensus 397 ~~L~~H~~~--H~~---------~------~~~~-------------C~~C~~~f~~~~~L~~H~~~H--------~--- 435 (555)
.....|+.. |.. + .... +..+...+........+...- .
T Consensus 82 ~a~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~~~dd~~Edi~~d~~~e 161 (390)
T KOG2785|consen 82 KAHENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEEEEDDEEEDIEEDGDDE 161 (390)
T ss_pred hhHHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhhccCcchhhhhhccchh
Confidence 999999753 310 0 1111 333333333322222221110 0
Q ss_pred --CCCccCCCChhhcCChHHHHHHHhhhcCCCc---------------------cccccccch---hcccCchHHHHHHH
Q psy11590 436 --NVKFKCDFCEKVYSHRNRLRYHISTVHENKW---------------------KSAACDICG---RVFLDSKNMKKHAA 489 (555)
Q Consensus 436 --~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~---------------------~~~~C~~C~---k~f~~~~~L~~H~~ 489 (555)
..|-.|-.|++.+.+...-..||...|+.-+ .-|.|-.|+ +.|.+..+.+.||.
T Consensus 162 ~e~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~ 241 (390)
T KOG2785|consen 162 DELIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMR 241 (390)
T ss_pred cccCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHh
Confidence 0167899999999999999999999886321 237899998 99999999999996
Q ss_pred h
Q psy11590 490 V 490 (555)
Q Consensus 490 ~ 490 (555)
.
T Consensus 242 ~ 242 (390)
T KOG2785|consen 242 D 242 (390)
T ss_pred h
Confidence 4
No 54
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.87 E-value=0.0087 Score=34.73 Aligned_cols=22 Identities=27% Similarity=0.617 Sum_probs=14.7
Q ss_pred eecccccccccChhhHHHHHHh
Q psy11590 497 FVCKLCGIAYKWRKNLVRHQKN 518 (555)
Q Consensus 497 ~~C~~C~~~f~~~~~L~~H~~~ 518 (555)
|.|.+|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5567777777777777666653
No 55
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.61 E-value=0.016 Score=33.66 Aligned_cols=22 Identities=32% Similarity=0.596 Sum_probs=20.0
Q ss_pred CCCccchhhccChhhHHhhhhh
Q psy11590 72 HSCQKCEESFDNCNNLWSHMFI 93 (555)
Q Consensus 72 ~~C~~C~~~f~~~~~l~~H~~~ 93 (555)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 6799999999999999999875
No 56
>PRK04860 hypothetical protein; Provisional
Probab=94.56 E-value=0.021 Score=49.47 Aligned_cols=39 Identities=28% Similarity=0.702 Sum_probs=30.0
Q ss_pred cceecCCCcccccCHHHHHHHHhhccCCCccccccccccccCH
Q psy11590 382 MRYQCPHCPSLLKTFWTLKEHFNIHTSEKKYVCELCGAYFIHK 424 (555)
Q Consensus 382 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~ 424 (555)
-+|.|. |+. ....+.+|.++|+++++|.|..|+..|...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 358887 877 566778888888888888888888877543
No 57
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.66 E-value=0.014 Score=60.79 Aligned_cols=151 Identities=20% Similarity=0.279 Sum_probs=111.9
Q ss_pred cceecCCCcccccCHHHHHHHHh--hccCC--Cccccc--cccccccCHHHHHHHHHhcCCC-CccCCC--ChhhcCChH
Q psy11590 382 MRYQCPHCPSLLKTFWTLKEHFN--IHTSE--KKYVCE--LCGAYFIHKSSLVAHTTTHSNV-KFKCDF--CEKVYSHRN 452 (555)
Q Consensus 382 ~~~~C~~C~~~f~~~~~L~~H~~--~H~~~--~~~~C~--~C~~~f~~~~~L~~H~~~H~~~-~~~C~~--C~~~f~~~~ 452 (555)
.++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.....+..|...|.+. ++.+.. +...+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 46899999999999999999999 89999 999999 8999999999999999999887 455533 555544433
Q ss_pred HHH----HHHhhhcCCCcccccccc--chhcccCchHHHHHHHhcCCCC--CeecccccccccChhhHHHHHHhCCCccc
Q psy11590 453 RLR----YHISTVHENKWKSAACDI--CGRVFLDSKNMKKHAAVHSTER--PFVCKLCGIAYKWRKNLVRHQKNCKASAK 524 (555)
Q Consensus 453 ~L~----~H~~~~H~~~~~~~~C~~--C~k~f~~~~~L~~H~~~H~~~~--~~~C~~C~~~f~~~~~L~~H~~~h~~~~~ 524 (555)
.-. ..... -......+.+.. |-..+.....+..|...|...+ .+.+..|.+.|.....|..|++.+....+
T Consensus 368 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (467)
T COG5048 368 NNEPPQSLQQYK-DLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAP 446 (467)
T ss_pred CCCCccchhhcc-CccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCc
Confidence 311 11111 111114455543 7777788888888887777666 46778899999999999999998888887
Q ss_pred cCcccCCCC
Q psy11590 525 CSLMTTSPL 533 (555)
Q Consensus 525 ~~~~~~~~~ 533 (555)
+.+......
T Consensus 447 ~~~~~~~~~ 455 (467)
T COG5048 447 LLCSILKSF 455 (467)
T ss_pred eeecccccc
Confidence 766555544
No 58
>KOG4173|consensus
Probab=93.57 E-value=0.038 Score=48.30 Aligned_cols=75 Identities=24% Similarity=0.529 Sum_probs=38.3
Q ss_pred cccccc--ccccccCHHHHHHHHHhcCCCCccCCCChhhcCChHHHHHHHhhhcC---------CCcccccccc--chhc
Q psy11590 411 KYVCEL--CGAYFIHKSSLVAHTTTHSNVKFKCDFCEKVYSHRNRLRYHISTVHE---------NKWKSAACDI--CGRV 477 (555)
Q Consensus 411 ~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~L~~H~~~~H~---------~~~~~~~C~~--C~k~ 477 (555)
.|.|++ |...|.....+..|..+-++ -.|.+|.+.|.+.-.|..|+..-|. |. .-|+|-+ |+..
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~--~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~-dMy~ClvEgCt~K 155 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHG--NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQ-DMYQCLVEGCTEK 155 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhccc--chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCc-cHHHHHHHhhhhh
Confidence 355554 55555555555555433222 2455666666666666665554441 11 3355533 5555
Q ss_pred ccCchHHHHHH
Q psy11590 478 FLDSKNMKKHA 488 (555)
Q Consensus 478 f~~~~~L~~H~ 488 (555)
|.+...-..|+
T Consensus 156 FkT~r~RkdH~ 166 (253)
T KOG4173|consen 156 FKTSRDRKDHM 166 (253)
T ss_pred hhhhhhhhhHH
Confidence 55555555554
No 59
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=93.20 E-value=0.11 Score=49.26 Aligned_cols=78 Identities=24% Similarity=0.415 Sum_probs=57.5
Q ss_pred CCccchhhccChhhHHhhhhhhhcCCcccCCCCCCChhhHHHHHHHHHHHHHhhccccccccccccccccccccccCCCC
Q psy11590 73 SCQKCEESFDNCNNLWSHMFIKHENSDFVCNLCPPDSKIMIKYAHLLVRHMKQCHTMQVHIPAVYKNFRSVTSFTMNNIT 152 (555)
Q Consensus 73 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~~~~~f~~~~~l~~H~~~~H~~~~~c~~c~~~~~~~~~~~~~~~~ 152 (555)
.|..|...|.+-..|..|++..|. +-|.|+.=+......|++-..|..|.+..|
T Consensus 222 ~C~FC~~~FYdDDEL~~HcR~~HE-~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h------------------------- 275 (493)
T COG5236 222 LCIFCKIYFYDDDELRRHCRLRHE-ACHICDMVGPIRYQYFKSYEDLEAHFRNAH------------------------- 275 (493)
T ss_pred hhhhccceecChHHHHHHHHhhhh-hhhhhhccCccchhhhhCHHHHHHHhhcCc-------------------------
Confidence 499999999999999999998763 334444444446788999999999998755
Q ss_pred cccCCC--C----hhhcCChHHHHHHHHhcc
Q psy11590 153 QYKCPD--C----PAILQTFKNLKCHLDIHS 177 (555)
Q Consensus 153 ~~~C~~--C----~~~f~~~~~l~~H~~~h~ 177 (555)
|.|.. | -.+|....+|..|+..-+
T Consensus 276 -y~ct~qtc~~~k~~vf~~~~el~~h~~~~h 305 (493)
T COG5236 276 -YCCTFQTCRVGKCYVFPYHTELLEHLTRFH 305 (493)
T ss_pred -eEEEEEEEecCcEEEeccHHHHHHHHHHHh
Confidence 33321 2 157889999999986543
No 60
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.78 E-value=0.13 Score=29.08 Aligned_cols=20 Identities=30% Similarity=0.692 Sum_probs=11.1
Q ss_pred ecccccccccChhhHHHHHHh
Q psy11590 498 VCKLCGIAYKWRKNLVRHQKN 518 (555)
Q Consensus 498 ~C~~C~~~f~~~~~L~~H~~~ 518 (555)
.|+.||+.| ....|..|++.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHHh
Confidence 455666665 44555555544
No 61
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=91.68 E-value=0.14 Score=31.68 Aligned_cols=23 Identities=30% Similarity=0.654 Sum_probs=17.4
Q ss_pred CeecccccccccChhhHHHHHHh
Q psy11590 496 PFVCKLCGIAYKWRKNLVRHQKN 518 (555)
Q Consensus 496 ~~~C~~C~~~f~~~~~L~~H~~~ 518 (555)
+|.|++|++.|.....+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 57788888888877778777653
No 62
>KOG2482|consensus
Probab=91.28 E-value=0.44 Score=45.33 Aligned_cols=124 Identities=17% Similarity=0.352 Sum_probs=70.8
Q ss_pred HHHHHHHHhhccC-CCccccccccccccC-HHHHHHHHH-hcC---CC-------------------CccCCCChhhcCC
Q psy11590 396 FWTLKEHFNIHTS-EKKYVCELCGAYFIH-KSSLVAHTT-THS---NV-------------------KFKCDFCEKVYSH 450 (555)
Q Consensus 396 ~~~L~~H~~~H~~-~~~~~C~~C~~~f~~-~~~L~~H~~-~H~---~~-------------------~~~C~~C~~~f~~ 450 (555)
+..|.++++--.+ ....+|-.|...+.. .+.+..|+- +|. |. .+.|-.|.+.|..
T Consensus 128 ~eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrd 207 (423)
T KOG2482|consen 128 KEALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRD 207 (423)
T ss_pred HHHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCC
Confidence 3456666554332 345679999887654 445666653 332 11 3889999999999
Q ss_pred hHHHHHHHhhhcCCCccccccccchhcc------cCchHHHHH--H--Hhc---------------CCCCC--eeccccc
Q psy11590 451 RNRLRYHISTVHENKWKSAACDICGRVF------LDSKNMKKH--A--AVH---------------STERP--FVCKLCG 503 (555)
Q Consensus 451 ~~~L~~H~~~~H~~~~~~~~C~~C~k~f------~~~~~L~~H--~--~~H---------------~~~~~--~~C~~C~ 503 (555)
+..|+.||+..-.....|-. ..=++.| ..++-...| . .+- .++.+ -+|-.|.
T Consensus 208 kntLkeHMrkK~HrrinPkn-reYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~ 286 (423)
T KOG2482|consen 208 KNTLKEHMRKKRHRRINPKN-REYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCT 286 (423)
T ss_pred cHHHHHHHHhccCcccCCCc-cccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeec
Confidence 99999999873322212200 0000000 001111111 1 110 12233 5899999
Q ss_pred ccccChhhHHHHHHhCC
Q psy11590 504 IAYKWRKNLVRHQKNCK 520 (555)
Q Consensus 504 ~~f~~~~~L~~H~~~h~ 520 (555)
....+...|..||+.-|
T Consensus 287 ~~~en~~~l~eHmk~vH 303 (423)
T KOG2482|consen 287 NFYENPVFLFEHMKIVH 303 (423)
T ss_pred cchhhHHHHHHHHHHHH
Confidence 99999999999999765
No 63
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.20 E-value=0.16 Score=28.80 Aligned_cols=19 Identities=42% Similarity=1.041 Sum_probs=10.8
Q ss_pred ccccchhcccCchHHHHHHH
Q psy11590 470 ACDICGRVFLDSKNMKKHAA 489 (555)
Q Consensus 470 ~C~~C~k~f~~~~~L~~H~~ 489 (555)
.|+.||+.| ....|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 456666666 4555555543
No 64
>KOG2893|consensus
Probab=90.04 E-value=0.099 Score=46.82 Aligned_cols=41 Identities=22% Similarity=0.593 Sum_probs=21.9
Q ss_pred CCCChhhcCChHHHHHHHhhhcCCCccccccccchhcccCchHHHHH
Q psy11590 441 CDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVFLDSKNMKKH 487 (555)
Q Consensus 441 C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f~~~~~L~~H 487 (555)
|=+|++.|....-|.+|.+. +-|+|.+|.|...+-..|..|
T Consensus 13 cwycnrefddekiliqhqka------khfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA------KHFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeecccccchhhhhhhhhhh------ccceeeeehhhhccCCCceee
Confidence 44555555555555555544 445555555555555555544
No 65
>KOG4173|consensus
Probab=89.40 E-value=0.22 Score=43.66 Aligned_cols=78 Identities=17% Similarity=0.423 Sum_probs=46.5
Q ss_pred ceecCC--CcccccCHHHHHHHHhhccCCCccccccccccccCHHHHHHHHHhcC----------CC-CccCC--CChhh
Q psy11590 383 RYQCPH--CPSLLKTFWTLKEHFNIHTSEKKYVCELCGAYFIHKSSLVAHTTTHS----------NV-KFKCD--FCEKV 447 (555)
Q Consensus 383 ~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~----------~~-~~~C~--~C~~~ 447 (555)
.+.|++ |...|.+......|..+-++. .|..|.+.|.+...|..|+..-+ |. -|.|- .|+..
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~K 155 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEK 155 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhh
Confidence 466665 666666666666665443332 46777777777777777754221 22 36663 37777
Q ss_pred cCChHHHHHHHhhhcC
Q psy11590 448 YSHRNRLRYHISTVHE 463 (555)
Q Consensus 448 f~~~~~L~~H~~~~H~ 463 (555)
|.+...-..|+..-|.
T Consensus 156 FkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 156 FKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhHHHHhcc
Confidence 7777777777666554
No 66
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=88.83 E-value=0.3 Score=30.15 Aligned_cols=23 Identities=17% Similarity=0.590 Sum_probs=12.0
Q ss_pred CccCCCChhhcCChHHHHHHHhh
Q psy11590 438 KFKCDFCEKVYSHRNRLRYHIST 460 (555)
Q Consensus 438 ~~~C~~C~~~f~~~~~L~~H~~~ 460 (555)
+|.|++|+..|.+...+..|+..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 34555555555555555555443
No 67
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=88.80 E-value=0.21 Score=30.48 Aligned_cols=11 Identities=27% Similarity=0.978 Sum_probs=5.9
Q ss_pred cccccchhccc
Q psy11590 469 AACDICGRVFL 479 (555)
Q Consensus 469 ~~C~~C~k~f~ 479 (555)
|+|.+||..+.
T Consensus 2 ~~C~~CGy~y~ 12 (33)
T cd00350 2 YVCPVCGYIYD 12 (33)
T ss_pred EECCCCCCEEC
Confidence 45555655543
No 68
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=88.41 E-value=0.16 Score=52.69 Aligned_cols=66 Identities=24% Similarity=0.397 Sum_probs=50.3
Q ss_pred ccccccchhcccCchHHHHHHH--hcCCC--CCeecc--cccccccChhhHHHHHHhCCCccccCcccCCCC
Q psy11590 468 SAACDICGRVFLDSKNMKKHAA--VHSTE--RPFVCK--LCGIAYKWRKNLVRHQKNCKASAKCSLMTTSPL 533 (555)
Q Consensus 468 ~~~C~~C~k~f~~~~~L~~H~~--~H~~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~ 533 (555)
++.|..|...|.....|..|.+ .|+++ +|+.|+ .|++.|.+...+..|..+|.+..++.+......
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 360 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSS 360 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCc
Confidence 5777888888888888888877 78888 788888 688888888888888888887777665544443
No 69
>KOG2893|consensus
Probab=86.50 E-value=0.19 Score=45.02 Aligned_cols=46 Identities=26% Similarity=0.420 Sum_probs=39.9
Q ss_pred ccccchhcccCchHHHHHHHhcCCCCCeecccccccccChhhHHHH-HHhC
Q psy11590 470 ACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRKNLVRH-QKNC 519 (555)
Q Consensus 470 ~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H-~~~h 519 (555)
-|-+|++.|.....|.+|.+ .+-|+|.+|.+...+.-.|..| |.+|
T Consensus 12 wcwycnrefddekiliqhqk----akhfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQK----AKHFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeeecccccchhhhhhhhhh----hccceeeeehhhhccCCCceeehhhhh
Confidence 49999999999999999975 3569999999999999999988 4444
No 70
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=84.88 E-value=1.3 Score=35.83 Aligned_cols=25 Identities=20% Similarity=0.547 Sum_probs=23.3
Q ss_pred eec----ccccccccChhhHHHHHHhCCC
Q psy11590 497 FVC----KLCGIAYKWRKNLVRHQKNCKA 521 (555)
Q Consensus 497 ~~C----~~C~~~f~~~~~L~~H~~~h~~ 521 (555)
|.| ..|++...+...++.|++.+||
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 899 9999999999999999999886
No 71
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=82.86 E-value=1.9 Score=34.62 Aligned_cols=22 Identities=27% Similarity=0.538 Sum_probs=15.0
Q ss_pred CCccccccccccccCHHHHHHH
Q psy11590 409 EKKYVCELCGAYFIHKSSLVAH 430 (555)
Q Consensus 409 ~~~~~C~~C~~~f~~~~~L~~H 430 (555)
+-|..|++||-.......|.+-
T Consensus 13 ~LP~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 13 ELPVECPICGLTLILSTHLARS 34 (112)
T ss_pred CCCCcCCcCCCEEeccchHHHh
Confidence 3567778887777776666653
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=81.90 E-value=0.8 Score=30.95 Aligned_cols=27 Identities=22% Similarity=0.495 Sum_probs=22.4
Q ss_pred CccCCCChhhcCChHHHHHHHhhhcCC
Q psy11590 438 KFKCDFCEKVYSHRNRLRYHISTVHEN 464 (555)
Q Consensus 438 ~~~C~~C~~~f~~~~~L~~H~~~~H~~ 464 (555)
-+.||.|+..|.....+..|+...|+.
T Consensus 17 ~lrCPRC~~~FR~~K~Y~RHVNKaH~~ 43 (65)
T COG4049 17 FLRCPRCGMVFRRRKDYIRHVNKAHGW 43 (65)
T ss_pred eeeCCchhHHHHHhHHHHHHhhHHhhh
Confidence 578999999999998888888876753
No 73
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=79.47 E-value=0.83 Score=30.86 Aligned_cols=28 Identities=29% Similarity=0.711 Sum_probs=16.2
Q ss_pred CCCCCeecccccccccChhhHHHHHHhC
Q psy11590 492 STERPFVCKLCGIAYKWRKNLVRHQKNC 519 (555)
Q Consensus 492 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h 519 (555)
.||.-+.|+.||..|....+..+|...-
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKa 40 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKA 40 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHH
Confidence 3455566666666666666666665443
No 74
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=78.90 E-value=6.4 Score=31.65 Aligned_cols=20 Identities=25% Similarity=0.446 Sum_probs=10.2
Q ss_pred CcccCCCCCccCCCchHHHh
Q psy11590 229 RKFSCSHCGNSYKNPIFLKR 248 (555)
Q Consensus 229 ~~~~C~~C~~~f~~~~~L~~ 248 (555)
.|..|++|+-...+...|.+
T Consensus 14 LP~~CpiCgLtLVss~HLAR 33 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLAR 33 (112)
T ss_pred CCCcCCcCCCEEeccchHHH
Confidence 34555555555555444444
No 75
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=78.78 E-value=0.99 Score=27.72 Aligned_cols=11 Identities=27% Similarity=0.815 Sum_probs=6.1
Q ss_pred cccccchhccc
Q psy11590 469 AACDICGRVFL 479 (555)
Q Consensus 469 ~~C~~C~k~f~ 479 (555)
|+|.+||..+.
T Consensus 3 ~~C~~CG~i~~ 13 (34)
T cd00729 3 WVCPVCGYIHE 13 (34)
T ss_pred EECCCCCCEeE
Confidence 55666665543
No 76
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=78.68 E-value=0.95 Score=28.23 Aligned_cols=33 Identities=9% Similarity=0.220 Sum_probs=23.9
Q ss_pred cccccccccccccccccCCCCcccCCCChhhcC
Q psy11590 132 HIPAVYKNFRSVTSFTMNNITQYKCPDCPAILQ 164 (555)
Q Consensus 132 ~c~~c~~~~~~~~~~~~~~~~~~~C~~C~~~f~ 164 (555)
.|+.|...|...+..........+|+.|+..|.
T Consensus 4 ~Cp~C~~~y~i~d~~ip~~g~~v~C~~C~~~f~ 36 (36)
T PF13717_consen 4 TCPNCQAKYEIDDEKIPPKGRKVRCSKCGHVFF 36 (36)
T ss_pred ECCCCCCEEeCCHHHCCCCCcEEECCCCCCEeC
Confidence 466777777766665555566789999998873
No 77
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=78.05 E-value=3.9 Score=33.06 Aligned_cols=25 Identities=20% Similarity=0.587 Sum_probs=23.1
Q ss_pred ccC----CCChhhcCChHHHHHHHhhhcC
Q psy11590 439 FKC----DFCEKVYSHRNRLRYHISTVHE 463 (555)
Q Consensus 439 ~~C----~~C~~~f~~~~~L~~H~~~~H~ 463 (555)
|.| +.|++.+.+...++.|++.+|+
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 999 9999999999999999998774
No 78
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=77.61 E-value=0.94 Score=28.45 Aligned_cols=33 Identities=18% Similarity=0.413 Sum_probs=23.1
Q ss_pred cccccccccccccccccCCCCcccCCCChhhcC
Q psy11590 132 HIPAVYKNFRSVTSFTMNNITQYKCPDCPAILQ 164 (555)
Q Consensus 132 ~c~~c~~~~~~~~~~~~~~~~~~~C~~C~~~f~ 164 (555)
.||.|...|....+....+....+|+.|+..|.
T Consensus 4 ~CP~C~~~f~v~~~~l~~~~~~vrC~~C~~~f~ 36 (37)
T PF13719_consen 4 TCPNCQTRFRVPDDKLPAGGRKVRCPKCGHVFR 36 (37)
T ss_pred ECCCCCceEEcCHHHcccCCcEEECCCCCcEee
Confidence 466677777666654445556889999998874
No 79
>KOG2186|consensus
Probab=73.09 E-value=2.1 Score=39.21 Aligned_cols=55 Identities=22% Similarity=0.470 Sum_probs=37.7
Q ss_pred cccccchhcccCchHHHHHHHhcCCCCCeecccccccccChhhHHHHHHhCCCccccC
Q psy11590 469 AACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAYKWRKNLVRHQKNCKASAKCS 526 (555)
Q Consensus 469 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~ 526 (555)
|.|.+||.... +..|.+|+..-++ .-|.|-.|++.|.. .+...|.+--+..+.|.
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITEaQKYg 58 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITEAQKYG 58 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcchHHHhh
Confidence 67777877765 4556667766666 56778888888766 67777777666555554
No 80
>PHA00626 hypothetical protein
Probab=71.62 E-value=2.4 Score=28.91 Aligned_cols=16 Identities=25% Similarity=0.314 Sum_probs=13.2
Q ss_pred CCcccCCCChhhcCCh
Q psy11590 151 ITQYKCPDCPAILQTF 166 (555)
Q Consensus 151 ~~~~~C~~C~~~f~~~ 166 (555)
...|+|+.||..|+.-
T Consensus 21 snrYkCkdCGY~ft~~ 36 (59)
T PHA00626 21 SDDYVCCDCGYNDSKD 36 (59)
T ss_pred CcceEcCCCCCeechh
Confidence 4679999999999753
No 81
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=69.45 E-value=3 Score=27.31 Aligned_cols=25 Identities=28% Similarity=0.597 Sum_probs=15.2
Q ss_pred cccccchhcccCchHHHHHHHhcCCCCCeecccccc
Q psy11590 469 AACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGI 504 (555)
Q Consensus 469 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~ 504 (555)
|.|..||..|... ...+.+|+.||.
T Consensus 3 Y~C~~Cg~~~~~~-----------~~~~irC~~CG~ 27 (44)
T smart00659 3 YICGECGRENEIK-----------SKDVVRCRECGY 27 (44)
T ss_pred EECCCCCCEeecC-----------CCCceECCCCCc
Confidence 6677777766543 234566777764
No 82
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=69.32 E-value=1.8 Score=39.93 Aligned_cols=14 Identities=14% Similarity=0.534 Sum_probs=11.4
Q ss_pred eeCCCccccccccc
Q psy11590 275 YWCNVCSQRCYLSA 288 (555)
Q Consensus 275 ~~C~~C~~~f~~~~ 288 (555)
+.|+.|+..+....
T Consensus 49 ~vCP~CgyA~~~~~ 62 (214)
T PF09986_consen 49 WVCPHCGYAAFEED 62 (214)
T ss_pred EECCCCCCcccccc
Confidence 78999999877654
No 83
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=68.59 E-value=2.1 Score=39.41 Aligned_cols=43 Identities=28% Similarity=0.503 Sum_probs=30.6
Q ss_pred CceeecCccCccccchhhhhhhhhhcC-----------CC-----cccCCCCCccCCCc
Q psy11590 201 EKKYICEVCGTSFLTWSSLNSHKYSHD-----------DR-----KFSCSHCGNSYKNP 243 (555)
Q Consensus 201 ~~~~~C~~C~~~f~~~~~L~~H~~~h~-----------~~-----~~~C~~C~~~f~~~ 243 (555)
++.+.||+|+..|.+..-+....+.-. .. ...|+.||..+...
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 456899999999998876665554311 11 26899999987644
No 84
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=67.91 E-value=3.6 Score=33.02 Aligned_cols=30 Identities=23% Similarity=0.751 Sum_probs=19.9
Q ss_pred eecCccCccccchhhhhhhhhhcCCCcccCCCCCccCCCc
Q psy11590 204 YICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHCGNSYKNP 243 (555)
Q Consensus 204 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~ 243 (555)
..|+.||+.|.... ..|..|+.||..|.-.
T Consensus 10 R~Cp~CG~kFYDLn----------k~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDLN----------KDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccCC----------CCCccCCCCCCccCcc
Confidence 45777777777743 2566777777777655
No 85
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=67.00 E-value=2.3 Score=26.81 Aligned_cols=32 Identities=25% Similarity=0.368 Sum_probs=17.8
Q ss_pred ccccccccccccccccCCCCcccCCCChhhcC
Q psy11590 133 IPAVYKNFRSVTSFTMNNITQYKCPDCPAILQ 164 (555)
Q Consensus 133 c~~c~~~~~~~~~~~~~~~~~~~C~~C~~~f~ 164 (555)
|+.|+..|...............|+.|+..|.
T Consensus 5 CP~C~~~~~v~~~~~~~~~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 5 CPNCKTSFRVVDSQLGANGGKVRCGKCGHVWY 36 (38)
T ss_pred CCCCCCEEEeCHHHcCCCCCEEECCCCCCEEE
Confidence 45555544443332222334688999988774
No 86
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=66.64 E-value=3.8 Score=32.94 Aligned_cols=17 Identities=29% Similarity=0.687 Sum_probs=11.7
Q ss_pred CCceeecCccCccccch
Q psy11590 200 EEKKYICEVCGTSFLTW 216 (555)
Q Consensus 200 ~~~~~~C~~C~~~f~~~ 216 (555)
+..|..|++||..|.-.
T Consensus 23 nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 23 NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred CCCCccCCCCCCccCcc
Confidence 34566788888777655
No 87
>KOG2186|consensus
Probab=66.26 E-value=3.8 Score=37.65 Aligned_cols=48 Identities=27% Similarity=0.619 Sum_probs=39.8
Q ss_pred eeecCccCccccchhhhhhhhhhcCCCcccCCCCCccCCCchHHHhHHHh
Q psy11590 203 KYICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHCGNSYKNPIFLKRHFRE 252 (555)
Q Consensus 203 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~ 252 (555)
-|.|.+||.+.. +..|.+|+..-....|.|-.|++.|.. .+++.|..-
T Consensus 3 ~FtCnvCgEsvK-Kp~vekH~srCrn~~fSCIDC~k~F~~-~sYknH~kC 50 (276)
T KOG2186|consen 3 FFTCNVCGESVK-KPQVEKHMSRCRNAYFSCIDCGKTFER-VSYKNHTKC 50 (276)
T ss_pred EEehhhhhhhcc-ccchHHHHHhccCCeeEEeeccccccc-chhhhhhhh
Confidence 488999999876 455677998888888999999999998 778888753
No 88
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=64.83 E-value=4.9 Score=27.86 Aligned_cols=33 Identities=15% Similarity=0.283 Sum_probs=21.0
Q ss_pred CCCCccchhhccChhhHHhhhhhhhcCCcccCCCCCCC
Q psy11590 71 CHSCQKCEESFDNCNNLWSHMFIKHENSDFVCNLCPPD 108 (555)
Q Consensus 71 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 108 (555)
.|.|+.||..-.-+..-- ..-..+|.|+.||+.
T Consensus 27 ~F~CPnCGe~~I~Rc~~C-----Rk~g~~Y~Cp~CGF~ 59 (61)
T COG2888 27 KFPCPNCGEVEIYRCAKC-----RKLGNPYRCPKCGFE 59 (61)
T ss_pred EeeCCCCCceeeehhhhH-----HHcCCceECCCcCcc
Confidence 478999996654432211 112468999999864
No 89
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=64.79 E-value=4.9 Score=22.96 Aligned_cols=9 Identities=22% Similarity=0.814 Sum_probs=7.4
Q ss_pred cCccCcccc
Q psy11590 321 CNECDSKHN 329 (555)
Q Consensus 321 C~~C~~~f~ 329 (555)
|+.||..|.
T Consensus 17 Cp~CG~~F~ 25 (26)
T PF10571_consen 17 CPHCGYDFE 25 (26)
T ss_pred CCCCCCCCc
Confidence 888988875
No 90
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=63.95 E-value=6.9 Score=25.58 Aligned_cols=23 Identities=43% Similarity=0.779 Sum_probs=10.3
Q ss_pred CCeecccccccccC----hhhHHHHHH
Q psy11590 495 RPFVCKLCGIAYKW----RKNLVRHQK 517 (555)
Q Consensus 495 ~~~~C~~C~~~f~~----~~~L~~H~~ 517 (555)
...+|.+|++.+.. .++|..|++
T Consensus 15 ~~a~C~~C~~~~~~~~~~ts~l~~HL~ 41 (45)
T PF02892_consen 15 KKAKCKYCGKVIKYSSGGTSNLKRHLK 41 (45)
T ss_dssp S-EEETTTTEE-----SSTHHHHHHHH
T ss_pred CeEEeCCCCeEEeeCCCcHHHHHHhhh
Confidence 34455555555544 255666653
No 91
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=63.18 E-value=7 Score=34.03 Aligned_cols=16 Identities=13% Similarity=0.256 Sum_probs=7.5
Q ss_pred eecCCCcccccCHHHH
Q psy11590 384 YQCPHCPSLLKTFWTL 399 (555)
Q Consensus 384 ~~C~~C~~~f~~~~~L 399 (555)
|.|+.|+..|+....+
T Consensus 110 Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 110 FICPNMCVRFTFNEAM 125 (158)
T ss_pred EECCCCCcEeeHHHHH
Confidence 4455555444444444
No 92
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=62.45 E-value=5.1 Score=42.77 Aligned_cols=74 Identities=19% Similarity=0.367 Sum_probs=54.9
Q ss_pred echhhHHHHHhhccc------eecccccccccCCCCCCCCcccccccccccCCCC-CccchhhccChhhHHhhhhhhhcC
Q psy11590 25 FPCEECKELCVLSKY------CIKHKDCSKAMSTPAPSSESVCTEHSNLFPKCHS-CQKCEESFDNCNNLWSHMFIKHEN 97 (555)
Q Consensus 25 ~~C~~C~~~f~~~~~------l~~h~~C~~~~~~~~~~~~~~~h~~~~~~~~~~~-C~~C~~~f~~~~~l~~H~~~h~~~ 97 (555)
=.|+.|-+.+.+..+ +..|..||-.|+-...+.- -|.++....|. |+.|.+.|.+..+-+-|.+
T Consensus 102 a~C~~Cl~Ei~dp~~rrY~YPF~~CT~CGPRfTIi~alPY----DR~nTsM~~F~lC~~C~~EY~dP~nRRfHAQ----- 172 (750)
T COG0068 102 ATCEDCLEEIFDPNSRRYLYPFINCTNCGPRFTIIEALPY----DRENTSMADFPLCPFCDKEYKDPLNRRFHAQ----- 172 (750)
T ss_pred hhhHHHHHHhcCCCCcceeccccccCCCCcceeeeccCCC----CcccCccccCcCCHHHHHHhcCccccccccc-----
Confidence 469999998877755 3446899999988654432 23455555665 9999999999988766653
Q ss_pred CcccCCCCCCC
Q psy11590 98 SDFVCNLCPPD 108 (555)
Q Consensus 98 ~~~~C~~C~~~ 108 (555)
|..|+.||..
T Consensus 173 -p~aCp~CGP~ 182 (750)
T COG0068 173 -PIACPKCGPH 182 (750)
T ss_pred -cccCcccCCC
Confidence 6899999975
No 93
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=62.06 E-value=6.4 Score=26.58 Aligned_cols=20 Identities=40% Similarity=0.743 Sum_probs=10.3
Q ss_pred ecccccccccCh-----hhHHHHHH
Q psy11590 498 VCKLCGIAYKWR-----KNLVRHQK 517 (555)
Q Consensus 498 ~C~~C~~~f~~~-----~~L~~H~~ 517 (555)
.|.+|++.++.. ++|.+|++
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHH
Confidence 355555555333 45666655
No 94
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=60.92 E-value=8.2 Score=33.14 Aligned_cols=38 Identities=24% Similarity=0.593 Sum_probs=28.0
Q ss_pred cCCceeecCccCccccchhhhhh-hhhhcCCCcccCCCCCccC
Q psy11590 199 TEEKKYICEVCGTSFLTWSSLNS-HKYSHDDRKFSCSHCGNSY 240 (555)
Q Consensus 199 ~~~~~~~C~~C~~~f~~~~~L~~-H~~~h~~~~~~C~~C~~~f 240 (555)
.+...|.|+.|+..|.....+.. . ....|.|+.||...
T Consensus 95 ~~~~~Y~Cp~C~~~y~~~ea~~~~d----~~~~f~Cp~Cg~~l 133 (147)
T smart00531 95 TNNAYYKCPNCQSKYTFLEANQLLD----MDGTFTCPRCGEEL 133 (147)
T ss_pred cCCcEEECcCCCCEeeHHHHHHhcC----CCCcEECCCCCCEE
Confidence 45567999999999997655432 2 24559999999864
No 95
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=60.77 E-value=3 Score=27.65 Aligned_cols=29 Identities=24% Similarity=0.500 Sum_probs=16.7
Q ss_pred ccccccchhcccCchHHHHHHHhcCCCCCeecccccccc
Q psy11590 468 SAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAY 506 (555)
Q Consensus 468 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f 506 (555)
.|.|+.||..|..... ...+.|+.||..+
T Consensus 3 ~y~C~~CG~~~~~~~~----------~~~~~Cp~CG~~~ 31 (46)
T PRK00398 3 EYKCARCGREVELDEY----------GTGVRCPYCGYRI 31 (46)
T ss_pred EEECCCCCCEEEECCC----------CCceECCCCCCeE
Confidence 4677777776654321 1156777777544
No 96
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=60.46 E-value=5.4 Score=34.70 Aligned_cols=24 Identities=38% Similarity=0.838 Sum_probs=17.4
Q ss_pred ceecCCCcccccCHHHHHHHHhhccCCCccccccccc
Q psy11590 383 RYQCPHCPSLLKTFWTLKEHFNIHTSEKKYVCELCGA 419 (555)
Q Consensus 383 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~ 419 (555)
.|.|++||+++ -|+.|-.||+||.
T Consensus 134 ~~vC~vCGy~~-------------~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTH-------------EGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCcc-------------cCCCCCcCCCCCC
Confidence 58888887753 4567788888873
No 97
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=57.68 E-value=6.3 Score=25.10 Aligned_cols=31 Identities=32% Similarity=0.842 Sum_probs=22.0
Q ss_pred eeecCccCccccchhhhhhhhhhcCCCcccCCCCCcc
Q psy11590 203 KYICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHCGNS 239 (555)
Q Consensus 203 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~ 239 (555)
.|+|..||..|........ ...-.|+.||..
T Consensus 5 ~y~C~~Cg~~fe~~~~~~~------~~~~~CP~Cg~~ 35 (41)
T smart00834 5 EYRCEDCGHTFEVLQKISD------DPLATCPECGGD 35 (41)
T ss_pred EEEcCCCCCEEEEEEecCC------CCCCCCCCCCCc
Confidence 4899999999875443311 456789999873
No 98
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=56.84 E-value=5.4 Score=26.71 Aligned_cols=29 Identities=28% Similarity=0.557 Sum_probs=16.9
Q ss_pred cccccccchhcccCchHHHHHHHhcCCCCCeeccccccc
Q psy11590 467 KSAACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIA 505 (555)
Q Consensus 467 ~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~ 505 (555)
..|.|..||+.|... .......|+.||..
T Consensus 5 ~~Y~C~~Cg~~~~~~----------~~~~~irCp~Cg~r 33 (49)
T COG1996 5 MEYKCARCGREVELD----------QETRGIRCPYCGSR 33 (49)
T ss_pred EEEEhhhcCCeeehh----------hccCceeCCCCCcE
Confidence 357777777777211 12445677777743
No 99
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=56.00 E-value=10 Score=33.07 Aligned_cols=34 Identities=24% Similarity=0.543 Sum_probs=28.3
Q ss_pred cCCceeecCccCccccchhhhhhhhhhcCCCcccCCCCCccC
Q psy11590 199 TEEKKYICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHCGNSY 240 (555)
Q Consensus 199 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f 240 (555)
....-|.|+.|+..|+....+. .-|.|+.||...
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~--------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME--------LNFTCPRCGAML 138 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH--------cCCcCCCCCCEe
Confidence 4567799999999999888885 369999999864
No 100
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=55.96 E-value=9.8 Score=33.83 Aligned_cols=16 Identities=19% Similarity=0.370 Sum_probs=7.9
Q ss_pred ceecCCCcccccCHHH
Q psy11590 383 RYQCPHCPSLLKTFWT 398 (555)
Q Consensus 383 ~~~C~~C~~~f~~~~~ 398 (555)
-|.|+.|+..|+....
T Consensus 117 ~Y~Cp~C~~rytf~eA 132 (178)
T PRK06266 117 FFFCPNCHIRFTFDEA 132 (178)
T ss_pred EEECCCCCcEEeHHHH
Confidence 3555555555544443
No 101
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=55.18 E-value=7.6 Score=35.84 Aligned_cols=25 Identities=20% Similarity=0.426 Sum_probs=16.1
Q ss_pred CCcccCCCCCccCCCchHHHhHHHh
Q psy11590 228 DRKFSCSHCGNSYKNPIFLKRHFRE 252 (555)
Q Consensus 228 ~~~~~C~~C~~~f~~~~~L~~H~~~ 252 (555)
.+++.|+.|+........|..-.|.
T Consensus 207 ~k~~PCPKCg~et~eTkdLSmStR~ 231 (314)
T PF06524_consen 207 GKPIPCPKCGYETQETKDLSMSTRS 231 (314)
T ss_pred CCCCCCCCCCCcccccccceeeeec
Confidence 5678888888776655555444333
No 102
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=54.66 E-value=12 Score=32.03 Aligned_cols=12 Identities=42% Similarity=1.090 Sum_probs=5.8
Q ss_pred ceecCCCccccc
Q psy11590 383 RYQCPHCPSLLK 394 (555)
Q Consensus 383 ~~~C~~C~~~f~ 394 (555)
-|.|+.|+..|.
T Consensus 99 ~Y~Cp~C~~~y~ 110 (147)
T smart00531 99 YYKCPNCQSKYT 110 (147)
T ss_pred EEECcCCCCEee
Confidence 344555554444
No 103
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=54.10 E-value=7.2 Score=26.51 Aligned_cols=30 Identities=30% Similarity=0.837 Sum_probs=21.2
Q ss_pred eeecCccCccccchhhhhhhhhhcCCCcccCCCCCc
Q psy11590 203 KYICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHCGN 238 (555)
Q Consensus 203 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~ 238 (555)
.|+|..|+..|........ +..-.|+.||.
T Consensus 5 ey~C~~Cg~~fe~~~~~~~------~~~~~CP~Cg~ 34 (52)
T TIGR02605 5 EYRCTACGHRFEVLQKMSD------DPLATCPECGG 34 (52)
T ss_pred EEEeCCCCCEeEEEEecCC------CCCCCCCCCCC
Confidence 4899999999985432211 45567999986
No 104
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=53.59 E-value=8.7 Score=42.07 Aligned_cols=49 Identities=27% Similarity=0.689 Sum_probs=32.9
Q ss_pred cccccccccccCHHHHHHHHHhcCCCCccCCCChhhcCChHHHHHHHhhhcCCCccccccccchhcccCchHHHHHHHhc
Q psy11590 412 YVCELCGAYFIHKSSLVAHTTTHSNVKFKCDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVFLDSKNMKKHAAVH 491 (555)
Q Consensus 412 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f~~~~~L~~H~~~H 491 (555)
..|..||..+ +|+.|+..++ . |... +...|..||..
T Consensus 436 l~C~~Cg~v~------------------~Cp~Cd~~lt------~-----H~~~-~~L~CH~Cg~~-------------- 471 (730)
T COG1198 436 LLCRDCGYIA------------------ECPNCDSPLT------L-----HKAT-GQLRCHYCGYQ-------------- 471 (730)
T ss_pred eecccCCCcc------------------cCCCCCcceE------E-----ecCC-CeeEeCCCCCC--------------
Confidence 4588888765 7888887642 2 4333 66788888743
Q ss_pred CCCCCeeccccccc
Q psy11590 492 STERPFVCKLCGIA 505 (555)
Q Consensus 492 ~~~~~~~C~~C~~~ 505 (555)
...|..|+.||-.
T Consensus 472 -~~~p~~Cp~Cgs~ 484 (730)
T COG1198 472 -EPIPQSCPECGSE 484 (730)
T ss_pred -CCCCCCCCCCCCC
Confidence 3467788888854
No 105
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=53.00 E-value=8.1 Score=32.26 Aligned_cols=34 Identities=29% Similarity=0.571 Sum_probs=18.6
Q ss_pred eecccccccccChhhHHHHHHhCCCccc------cCcccCCCC
Q psy11590 497 FVCKLCGIAYKWRKNLVRHQKNCKASAK------CSLMTTSPL 533 (555)
Q Consensus 497 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~------~~~~~~~~~ 533 (555)
..|-+||+.|. .|++|++.|||.-| |..+.-.|+
T Consensus 73 i~clecGk~~k---~LkrHL~~~~gltp~eYR~kwGlp~dypl 112 (132)
T PF05443_consen 73 IICLECGKKFK---TLKRHLRTHHGLTPEEYRAKWGLPKDYPL 112 (132)
T ss_dssp EE-TBT--EES---BHHHHHHHTT-S-HHHHHHHTT-GGG--S
T ss_pred eEEccCCcccc---hHHHHHHHccCCCHHHHHHHhCcCCCCcc
Confidence 67888998885 56899999987644 445544444
No 106
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=52.54 E-value=12 Score=21.31 Aligned_cols=19 Identities=21% Similarity=0.546 Sum_probs=10.0
Q ss_pred cCCCChhhcCChHHHHHHHh
Q psy11590 440 KCDFCEKVYSHRNRLRYHIS 459 (555)
Q Consensus 440 ~C~~C~~~f~~~~~L~~H~~ 459 (555)
.||+|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 455665555 4445555543
No 107
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=52.34 E-value=11 Score=33.41 Aligned_cols=35 Identities=20% Similarity=0.575 Sum_probs=28.0
Q ss_pred cCCceeecCccCccccchhhhhhhhhhcCCCcccCCCCCccCC
Q psy11590 199 TEEKKYICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHCGNSYK 241 (555)
Q Consensus 199 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~ 241 (555)
....-|.|+.|+..|+....+. .-|.|+.||....
T Consensus 113 ~~~~~Y~Cp~C~~rytf~eA~~--------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 113 ENNMFFFCPNCHIRFTFDEAME--------YGFRCPQCGEMLE 147 (178)
T ss_pred cCCCEEECCCCCcEEeHHHHhh--------cCCcCCCCCCCCe
Confidence 4456799999999999887763 4699999998643
No 108
>KOG1940|consensus
Probab=51.48 E-value=14 Score=35.17 Aligned_cols=36 Identities=17% Similarity=0.410 Sum_probs=29.1
Q ss_pred eeecccchhccccccceechhhHHHHHhhccceecc
Q psy11590 8 FHFDVESCAVSRLQDITFPCEECKELCVLSKYCIKH 43 (555)
Q Consensus 8 ~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~h 43 (555)
+|++.+...+.|.+..+-.|..|.+.+.....+..+
T Consensus 58 CH~~s~~h~~~r~~v~~~~C~~C~~~q~~~~~c~~c 93 (276)
T KOG1940|consen 58 CHNESEDHDLDRKTVYELLCMKCRKIQPVGQICSNC 93 (276)
T ss_pred ecChhhhcccchhhhhhhhhhhHHhhhhhhhccccc
Confidence 577777778888899999999999998877765544
No 109
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=49.77 E-value=11 Score=32.92 Aligned_cols=10 Identities=30% Similarity=0.966 Sum_probs=5.3
Q ss_pred cccCCCCCcc
Q psy11590 230 KFSCSHCGNS 239 (555)
Q Consensus 230 ~~~C~~C~~~ 239 (555)
.|.|++||..
T Consensus 134 ~~vC~vCGy~ 143 (166)
T COG1592 134 VWVCPVCGYT 143 (166)
T ss_pred EEEcCCCCCc
Confidence 3555555544
No 110
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=49.23 E-value=12 Score=30.64 Aligned_cols=34 Identities=24% Similarity=0.366 Sum_probs=26.4
Q ss_pred eecCccCccccchhhhhhhhhhcCCCcccCCCCCccCCCchHHH
Q psy11590 204 YICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHCGNSYKNPIFLK 247 (555)
Q Consensus 204 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~ 247 (555)
..|+.||+.|.... ..|..|+.||..|.-...++
T Consensus 10 r~Cp~cg~kFYDLn----------k~p~vcP~cg~~~~~~~~~~ 43 (129)
T TIGR02300 10 RICPNTGSKFYDLN----------RRPAVSPYTGEQFPPEEALK 43 (129)
T ss_pred ccCCCcCccccccC----------CCCccCCCcCCccCcchhhc
Confidence 57999999997532 36899999999987664444
No 111
>KOG2807|consensus
Probab=48.45 E-value=19 Score=34.60 Aligned_cols=36 Identities=19% Similarity=0.509 Sum_probs=28.0
Q ss_pred ceeecCccCccccchhhhhhhhhhcCCCcccCCCCCccCCCchHHHhHH
Q psy11590 202 KKYICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHCGNSYKNPIFLKRHF 250 (555)
Q Consensus 202 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~ 250 (555)
..|.|+.|+...- ..|..|++|+-+..+...|.+=.
T Consensus 275 ~Gy~CP~CkakvC-------------sLP~eCpiC~ltLVss~hLARSy 310 (378)
T KOG2807|consen 275 GGYFCPQCKAKVC-------------SLPIECPICSLTLVSSPHLARSY 310 (378)
T ss_pred CceeCCcccCeee-------------cCCccCCccceeEecchHHHHHH
Confidence 4588999986543 46889999999988888887643
No 112
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=47.56 E-value=19 Score=24.01 Aligned_cols=37 Identities=24% Similarity=0.522 Sum_probs=19.9
Q ss_pred eeecCccCccccchhhhhhhhhh------cCCCcccCCCCCcc
Q psy11590 203 KYICEVCGTSFLTWSSLNSHKYS------HDDRKFSCSHCGNS 239 (555)
Q Consensus 203 ~~~C~~C~~~f~~~~~L~~H~~~------h~~~~~~C~~C~~~ 239 (555)
.|+|.+|+..|.-...--..-.. ....-|.|++|+..
T Consensus 1 ky~C~~CgyvYd~~~Gd~~~~i~pGt~F~~Lp~~w~CP~C~a~ 43 (47)
T PF00301_consen 1 KYQCPVCGYVYDPEKGDPENGIPPGTPFEDLPDDWVCPVCGAP 43 (47)
T ss_dssp EEEETTTSBEEETTTBBGGGTB-TT--GGGS-TT-B-TTTSSB
T ss_pred CcCCCCCCEEEcCCcCCcccCcCCCCCHHHCCCCCcCcCCCCc
Confidence 37899999887655432222111 11345899999853
No 113
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=46.02 E-value=14 Score=30.33 Aligned_cols=34 Identities=15% Similarity=0.292 Sum_probs=26.1
Q ss_pred cccCCCChhhcCChHHHHHHHHhccCCCccccCCccchhhhhhccccCCceeecCccCccccchhhh
Q psy11590 153 QYKCPDCPAILQTFKNLKCHLDIHSGEKDFSCHINSNLSEHIRMHHTEEKKYICEVCGTSFLTWSSL 219 (555)
Q Consensus 153 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~L 219 (555)
...|+.||+.|.. .+..|..|++||..|.-...+
T Consensus 9 Kr~Cp~cg~kFYD---------------------------------Lnk~p~vcP~cg~~~~~~~~~ 42 (129)
T TIGR02300 9 KRICPNTGSKFYD---------------------------------LNRRPAVSPYTGEQFPPEEAL 42 (129)
T ss_pred cccCCCcCccccc---------------------------------cCCCCccCCCcCCccCcchhh
Confidence 4679999999865 334688999999998766443
No 114
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=45.81 E-value=3.3 Score=33.99 Aligned_cols=54 Identities=17% Similarity=0.348 Sum_probs=32.5
Q ss_pred ccceechhhHHHHHhhccceecc-----cccccccCCCCCCCCcccccccccccCCCCCccchhhccChh
Q psy11590 21 QDITFPCEECKELCVLSKYCIKH-----KDCSKAMSTPAPSSESVCTEHSNLFPKCHSCQKCEESFDNCN 85 (555)
Q Consensus 21 ~~~~~~C~~C~~~f~~~~~l~~h-----~~C~~~~~~~~~~~~~~~h~~~~~~~~~~~C~~C~~~f~~~~ 85 (555)
..+-|.|++|.++.+.+..|+-- .+|+..+.. |..+-..+. +|+.|..+|.+.+
T Consensus 77 d~~lYeCnIC~etS~ee~FLKPneCCgY~iCn~Cya~------LWK~~~~yp-----vCPvCkTSFKss~ 135 (140)
T PF05290_consen 77 DPKLYECNICKETSAEERFLKPNECCGYSICNACYAN------LWKFCNLYP-----VCPVCKTSFKSSS 135 (140)
T ss_pred CCCceeccCcccccchhhcCCcccccchHHHHHHHHH------HHHHcccCC-----CCCcccccccccc
Confidence 34678899998888877766653 344444443 333333221 3777777776653
No 115
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=45.22 E-value=16 Score=23.65 Aligned_cols=18 Identities=22% Similarity=0.484 Sum_probs=14.1
Q ss_pred eeCCCccccccccccccc
Q psy11590 275 YWCNVCSQRCYLSATCSI 292 (555)
Q Consensus 275 ~~C~~C~~~f~~~~~l~~ 292 (555)
|+|..|+..|.....+..
T Consensus 6 y~C~~Cg~~fe~~~~~~~ 23 (42)
T PF09723_consen 6 YRCEECGHEFEVLQSISE 23 (42)
T ss_pred EEeCCCCCEEEEEEEcCC
Confidence 999999998887664444
No 116
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=43.58 E-value=27 Score=29.20 Aligned_cols=46 Identities=13% Similarity=0.205 Sum_probs=27.6
Q ss_pred HHHHhhccccccccccccccccccccccCCCCcccCCCChhhcCChH
Q psy11590 121 RHMKQCHTMQVHIPAVYKNFRSVTSFTMNNITQYKCPDCPAILQTFK 167 (555)
Q Consensus 121 ~H~~~~H~~~~~c~~c~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~ 167 (555)
.+++. ......||.|.............+...|+|..|++.|....
T Consensus 22 ~~~~~-~~~~~~cP~C~s~~~~k~g~~~~~~qRyrC~~C~~tf~~~~ 67 (129)
T COG3677 22 YAIRM-QITKVNCPRCKSSNVVKIGGIRRGHQRYKCKSCGSTFTVET 67 (129)
T ss_pred HHHhh-hcccCcCCCCCccceeeECCccccccccccCCcCcceeeec
Confidence 34444 44444666666555333333335567899999999996543
No 117
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=42.69 E-value=17 Score=25.39 Aligned_cols=33 Identities=15% Similarity=0.377 Sum_probs=19.1
Q ss_pred CCCCccchhhccChhhHHhhhhhhhcCCcccCCCCCCC
Q psy11590 71 CHSCQKCEESFDNCNNLWSHMFIKHENSDFVCNLCPPD 108 (555)
Q Consensus 71 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 108 (555)
.|.|+.||+.-..+-. +-..-..+|.|+.||+.
T Consensus 25 ~F~CPnCG~~~I~RC~-----~CRk~~~~Y~CP~CGF~ 57 (59)
T PRK14890 25 KFLCPNCGEVIIYRCE-----KCRKQSNPYTCPKCGFE 57 (59)
T ss_pred EeeCCCCCCeeEeech-----hHHhcCCceECCCCCCc
Confidence 3778888876322211 11112357999999864
No 118
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=42.62 E-value=8.1 Score=28.40 Aligned_cols=41 Identities=15% Similarity=0.317 Sum_probs=22.1
Q ss_pred cccccchhcccCchHHHHHHHhcCCCCCeecc--cccccccChhh
Q psy11590 469 AACDICGRVFLDSKNMKKHAAVHSTERPFVCK--LCGIAYKWRKN 511 (555)
Q Consensus 469 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~--~C~~~f~~~~~ 511 (555)
+.|+.||........-...-. ..++-++|. .||..|.....
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYES 44 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEE
Confidence 456777665533333222211 345567777 78877765543
No 119
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=42.45 E-value=19 Score=24.41 Aligned_cols=13 Identities=38% Similarity=1.022 Sum_probs=10.1
Q ss_pred cccCCCCCccCCC
Q psy11590 230 KFSCSHCGNSYKN 242 (555)
Q Consensus 230 ~~~C~~C~~~f~~ 242 (555)
.|+|..||..|.-
T Consensus 1 ~y~C~~CgyiYd~ 13 (50)
T cd00730 1 KYECRICGYIYDP 13 (50)
T ss_pred CcCCCCCCeEECC
Confidence 3789999988764
No 120
>KOG2071|consensus
Probab=41.77 E-value=18 Score=37.93 Aligned_cols=23 Identities=13% Similarity=0.358 Sum_probs=19.7
Q ss_pred CccCCCChhhcCChHHHHHHHhh
Q psy11590 438 KFKCDFCEKVYSHRNRLRYHIST 460 (555)
Q Consensus 438 ~~~C~~C~~~f~~~~~L~~H~~~ 460 (555)
|-+|..||..|........|+..
T Consensus 418 pnqC~~CG~R~~~~ee~sk~md~ 440 (579)
T KOG2071|consen 418 PNQCKSCGLRFDDSEERSKHMDI 440 (579)
T ss_pred cchhcccccccccchhhhhHhhh
Confidence 68899999999998888777776
No 121
>KOG4167|consensus
Probab=40.89 E-value=7 Score=41.60 Aligned_cols=28 Identities=25% Similarity=0.408 Sum_probs=24.7
Q ss_pred CCeecccccccccChhhHHHHHHhCCCc
Q psy11590 495 RPFVCKLCGIAYKWRKNLVRHQKNCKAS 522 (555)
Q Consensus 495 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~ 522 (555)
..|-|.+|++.|..-..+..||++|--.
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr~q 818 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHRQQ 818 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHHHH
Confidence 3499999999999999999999999643
No 122
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=40.85 E-value=27 Score=21.08 Aligned_cols=26 Identities=31% Similarity=0.818 Sum_probs=15.5
Q ss_pred eecCccCccccchhhhhhhhhhcCCCcccCCCCCcc
Q psy11590 204 YICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHCGNS 239 (555)
Q Consensus 204 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~ 239 (555)
|.|..|+..+.... ..+.+|+.||..
T Consensus 1 Y~C~~Cg~~~~~~~----------~~~irC~~CG~R 26 (32)
T PF03604_consen 1 YICGECGAEVELKP----------GDPIRCPECGHR 26 (32)
T ss_dssp EBESSSSSSE-BST----------SSTSSBSSSS-S
T ss_pred CCCCcCCCeeEcCC----------CCcEECCcCCCe
Confidence 56777887765221 345788888754
No 123
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=40.82 E-value=12 Score=25.74 Aligned_cols=30 Identities=10% Similarity=0.194 Sum_probs=22.4
Q ss_pred ceechhhHHHHHhhccceecccccccccCC
Q psy11590 23 ITFPCEECKELCVLSKYCIKHKDCSKAMST 52 (555)
Q Consensus 23 ~~~~C~~C~~~f~~~~~l~~h~~C~~~~~~ 52 (555)
..-.|++||+.|......-.+++|+.-+-+
T Consensus 4 ~~~~C~~Cg~~~~~~dDiVvCp~CgapyHR 33 (54)
T PF14446_consen 4 EGCKCPVCGKKFKDGDDIVVCPECGAPYHR 33 (54)
T ss_pred cCccChhhCCcccCCCCEEECCCCCCcccH
Confidence 345688999999888888888877765543
No 124
>COG5152 Uncharacterized conserved protein, contains RING and CCCH-type Zn-fingers [General function prediction only]
Probab=40.47 E-value=7.8 Score=34.07 Aligned_cols=20 Identities=25% Similarity=0.331 Sum_probs=16.5
Q ss_pred ccccceechhhHHHHHhhcc
Q psy11590 19 RLQDITFPCEECKELCVLSK 38 (555)
Q Consensus 19 ~~~~~~~~C~~C~~~f~~~~ 38 (555)
+..+.||.|.+|.+.|.+.-
T Consensus 191 ~~e~IPF~C~iCKkdy~spv 210 (259)
T COG5152 191 PGEKIPFLCGICKKDYESPV 210 (259)
T ss_pred CCCCCceeehhchhhccchh
Confidence 34678999999999988764
No 125
>PF15269 zf-C2H2_7: Zinc-finger
Probab=39.42 E-value=21 Score=23.11 Aligned_cols=23 Identities=26% Similarity=0.544 Sum_probs=20.5
Q ss_pred ceecCCCcccccCHHHHHHHHhh
Q psy11590 383 RYQCPHCPSLLKTFWTLKEHFNI 405 (555)
Q Consensus 383 ~~~C~~C~~~f~~~~~L~~H~~~ 405 (555)
.|+|-+|+.+...++.|-.||+.
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 48999999999999999999875
No 126
>PRK04023 DNA polymerase II large subunit; Validated
Probab=39.07 E-value=24 Score=39.60 Aligned_cols=13 Identities=31% Similarity=0.744 Sum_probs=7.8
Q ss_pred cccccccchhccc
Q psy11590 467 KSAACDICGRVFL 479 (555)
Q Consensus 467 ~~~~C~~C~k~f~ 479 (555)
.++.|+.||..-.
T Consensus 662 ~~y~CPKCG~El~ 674 (1121)
T PRK04023 662 EEDECEKCGREPT 674 (1121)
T ss_pred CCCcCCCCCCCCC
Confidence 3466777776543
No 127
>KOG2593|consensus
Probab=38.13 E-value=29 Score=34.92 Aligned_cols=36 Identities=22% Similarity=0.578 Sum_probs=23.3
Q ss_pred CccCCCChhhcCChHHHHHHHhhhcCCCccccccccchhcc
Q psy11590 438 KFKCDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVF 478 (555)
Q Consensus 438 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f 478 (555)
.|.||.|++.|+....++.= -... ..|.|..|+-..
T Consensus 128 ~Y~Cp~C~kkyt~Lea~~L~----~~~~-~~F~C~~C~gel 163 (436)
T KOG2593|consen 128 GYVCPNCQKKYTSLEALQLL----DNET-GEFHCENCGGEL 163 (436)
T ss_pred cccCCccccchhhhHHHHhh----cccC-ceEEEecCCCch
Confidence 58888888888777776541 1212 458888886544
No 128
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=37.65 E-value=14 Score=21.91 Aligned_cols=8 Identities=38% Similarity=1.227 Sum_probs=1.4
Q ss_pred CCCChhhc
Q psy11590 441 CDFCEKVY 448 (555)
Q Consensus 441 C~~C~~~f 448 (555)
|+.|+..+
T Consensus 5 Cp~C~se~ 12 (30)
T PF08274_consen 5 CPLCGSEY 12 (30)
T ss_dssp -TTT----
T ss_pred CCCCCCcc
Confidence 44444444
No 129
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=36.99 E-value=12 Score=34.19 Aligned_cols=27 Identities=26% Similarity=0.465 Sum_probs=12.0
Q ss_pred CeecccccccccChhhHHHHHHhCCCc
Q psy11590 496 PFVCKLCGIAYKWRKNLVRHQKNCKAS 522 (555)
Q Consensus 496 ~~~C~~C~~~f~~~~~L~~H~~~h~~~ 522 (555)
.|.|.+|+|.|.-..=+++|+.+-|++
T Consensus 77 K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 77 KWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp EEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred EECCCCCCcccCChHHHHHHHhhcCHH
Confidence 355555555555555555555544433
No 130
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=36.87 E-value=21 Score=32.62 Aligned_cols=28 Identities=32% Similarity=0.698 Sum_probs=21.6
Q ss_pred CccCCCChhhcCChHHHHHHHhhhcCCC
Q psy11590 438 KFKCDFCEKVYSHRNRLRYHISTVHENK 465 (555)
Q Consensus 438 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~ 465 (555)
.|.|+.|+|.|.-..-+++|+...|...
T Consensus 77 K~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 77 KWRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp EEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred EECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 6899999999999999999999888654
No 131
>PF12907 zf-met2: Zinc-binding
Probab=36.81 E-value=26 Score=22.39 Aligned_cols=19 Identities=37% Similarity=0.894 Sum_probs=8.2
Q ss_pred ccccchhcc---cCchHHHHHH
Q psy11590 470 ACDICGRVF---LDSKNMKKHA 488 (555)
Q Consensus 470 ~C~~C~k~f---~~~~~L~~H~ 488 (555)
+|.+|...| .+...|..|.
T Consensus 3 ~C~iC~qtF~~t~~~~~L~eH~ 24 (40)
T PF12907_consen 3 ICKICRQTFMQTTNEPQLKEHA 24 (40)
T ss_pred CcHHhhHHHHhcCCHHHHHHHH
Confidence 455554333 2233455553
No 132
>KOG1280|consensus
Probab=36.60 E-value=26 Score=34.01 Aligned_cols=27 Identities=19% Similarity=0.500 Sum_probs=15.2
Q ss_pred ccCCCChhhcCChHHHHHHHhhhcCCC
Q psy11590 439 FKCDFCEKVYSHRNRLRYHISTVHENK 465 (555)
Q Consensus 439 ~~C~~C~~~f~~~~~L~~H~~~~H~~~ 465 (555)
|.|++|+..=.+...|..|+...|...
T Consensus 80 ftCPyC~~~Gfte~~f~~Hv~s~Hpda 106 (381)
T KOG1280|consen 80 FTCPYCGIMGFTERQFGTHVLSQHPEA 106 (381)
T ss_pred ccCCcccccccchhHHHHHhhhcCccc
Confidence 555555555555555555655555443
No 133
>KOG2807|consensus
Probab=36.13 E-value=72 Score=30.84 Aligned_cols=22 Identities=23% Similarity=0.416 Sum_probs=17.1
Q ss_pred CcccCCCChhhcCChHHHHHHH
Q psy11590 152 TQYKCPDCPAILQTFKNLKCHL 173 (555)
Q Consensus 152 ~~~~C~~C~~~f~~~~~l~~H~ 173 (555)
-|-.|++|+-+..+...|.+=.
T Consensus 289 LP~eCpiC~ltLVss~hLARSy 310 (378)
T KOG2807|consen 289 LPIECPICSLTLVSSPHLARSY 310 (378)
T ss_pred CCccCCccceeEecchHHHHHH
Confidence 4778999999888887776543
No 134
>KOG2071|consensus
Probab=35.41 E-value=26 Score=36.79 Aligned_cols=28 Identities=21% Similarity=0.359 Sum_probs=23.5
Q ss_pred CCcceecCCCcccccCHHHHHHHHhhcc
Q psy11590 380 DTMRYQCPHCPSLLKTFWTLKEHFNIHT 407 (555)
Q Consensus 380 ~~~~~~C~~C~~~f~~~~~L~~H~~~H~ 407 (555)
...+-+|..||.+|........||.+|.
T Consensus 415 ~~~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 415 KDSPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred cCCcchhcccccccccchhhhhHhhhhh
Confidence 4567899999999999988888887763
No 135
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=35.37 E-value=17 Score=30.40 Aligned_cols=27 Identities=33% Similarity=0.543 Sum_probs=17.6
Q ss_pred CcccCCCChhhcCChHHHHHHHHhccCCCc
Q psy11590 152 TQYKCPDCPAILQTFKNLKCHLDIHSGEKD 181 (555)
Q Consensus 152 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 181 (555)
....|-+||+.|.. |++|++.|+|..|
T Consensus 71 d~i~clecGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 71 DYIICLECGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp S-EE-TBT--EESB---HHHHHHHTT-S-H
T ss_pred CeeEEccCCcccch---HHHHHHHccCCCH
Confidence 45789999999977 4899999988655
No 136
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=35.03 E-value=25 Score=26.68 Aligned_cols=17 Identities=24% Similarity=0.624 Sum_probs=12.7
Q ss_pred cCCceeecCccCccccc
Q psy11590 199 TEEKKYICEVCGTSFLT 215 (555)
Q Consensus 199 ~~~~~~~C~~C~~~f~~ 215 (555)
.+...|.|..|+..|.-
T Consensus 49 ~a~GIW~C~kCg~~fAG 65 (89)
T COG1997 49 IATGIWKCRKCGAKFAG 65 (89)
T ss_pred eccCeEEcCCCCCeecc
Confidence 34567899999988864
No 137
>KOG4124|consensus
Probab=34.85 E-value=8.5 Score=37.11 Aligned_cols=26 Identities=38% Similarity=0.852 Sum_probs=16.8
Q ss_pred CCccCCC--ChhhcCChHHHHHHHhhhc
Q psy11590 437 VKFKCDF--CEKVYSHRNRLRYHISTVH 462 (555)
Q Consensus 437 ~~~~C~~--C~~~f~~~~~L~~H~~~~H 462 (555)
.+|+|++ |++.+.....|+.|....|
T Consensus 348 ~~~~~~vp~~~~~~~n~ng~~~~~~~~h 375 (442)
T KOG4124|consen 348 KPYKCPVPNCDKAYKNQNGLKYHKLHGH 375 (442)
T ss_pred CCCCCCCCcchhhcccCcceeeccccCc
Confidence 3667743 7777777777777665544
No 138
>KOG4124|consensus
Probab=34.57 E-value=23 Score=34.31 Aligned_cols=73 Identities=23% Similarity=0.331 Sum_probs=47.0
Q ss_pred hccccccceechhh--HHHHHhhccceeccc---cccc-ccCCCCCCCCcccccccccccCCCCCccchhhccChhhHHh
Q psy11590 16 AVSRLQDITFPCEE--CKELCVLSKYCIKHK---DCSK-AMSTPAPSSESVCTEHSNLFPKCHSCQKCEESFDNCNNLWS 89 (555)
Q Consensus 16 ~~~~~~~~~~~C~~--C~~~f~~~~~l~~h~---~C~~-~~~~~~~~~~~~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~ 89 (555)
++.-..++||.|.+ |++.+.....|..|. .|.- ...++. -+-|...-...|+|+|++|.+++.....|..
T Consensus 341 ~~~~~~~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~----~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~ 416 (442)
T KOG4124|consen 341 SNLVVVDKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPA----PIPHQGFVVENKPYRCEVCSKRYKNLNGLKY 416 (442)
T ss_pred CeEEEecCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCC----CCCcceeeeccCcccChhhhhhhccCCCCCc
Confidence 34445789999987 999999999998873 3322 122211 1112222234578889999988887777766
Q ss_pred hhh
Q psy11590 90 HMF 92 (555)
Q Consensus 90 H~~ 92 (555)
|..
T Consensus 417 ~~~ 419 (442)
T KOG4124|consen 417 HRT 419 (442)
T ss_pred eee
Confidence 654
No 139
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=34.35 E-value=19 Score=25.16 Aligned_cols=39 Identities=18% Similarity=0.498 Sum_probs=20.0
Q ss_pred ceecCC--CcccccCHHHHHHHHhhccCCCcccccc----cccccc
Q psy11590 383 RYQCPH--CPSLLKTFWTLKEHFNIHTSEKKYVCEL----CGAYFI 422 (555)
Q Consensus 383 ~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~~C~~----C~~~f~ 422 (555)
+..|+. |...+. +..|..|+...=..++..|+. |+..+.
T Consensus 9 ~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 9 PVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp EEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred EeeCCCCCccccee-HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 456766 444343 446777776555556666776 666653
No 140
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=33.53 E-value=28 Score=23.36 Aligned_cols=16 Identities=31% Similarity=0.694 Sum_probs=8.2
Q ss_pred cccccccchhcccCch
Q psy11590 467 KSAACDICGRVFLDSK 482 (555)
Q Consensus 467 ~~~~C~~C~k~f~~~~ 482 (555)
+.+.|..||..|....
T Consensus 3 k~l~C~dCg~~FvfTa 18 (49)
T PF13451_consen 3 KTLTCKDCGAEFVFTA 18 (49)
T ss_pred eeEEcccCCCeEEEeh
Confidence 4455555555554433
No 141
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=33.07 E-value=39 Score=21.69 Aligned_cols=25 Identities=24% Similarity=0.527 Sum_probs=18.7
Q ss_pred cccCCCChhhcC--ChHHHHHHHHhcc
Q psy11590 153 QYKCPDCPAILQ--TFKNLKCHLDIHS 177 (555)
Q Consensus 153 ~~~C~~C~~~f~--~~~~l~~H~~~h~ 177 (555)
.-.|+.||..|. ...+-+.|.+-|.
T Consensus 13 ~~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 13 ATTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred CcCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 458999999995 5567777877663
No 142
>KOG2593|consensus
Probab=31.18 E-value=26 Score=35.21 Aligned_cols=41 Identities=17% Similarity=0.513 Sum_probs=26.2
Q ss_pred cccccCCCCCccchhhccChhhHHhhhhhhhcCCcccCCCCCCC
Q psy11590 65 SNLFPKCHSCQKCEESFDNCNNLWSHMFIKHENSDFVCNLCPPD 108 (555)
Q Consensus 65 ~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 108 (555)
..+....|.|+.|.+.|.+...+. ..-.....|.|..|+-.
T Consensus 122 d~t~~~~Y~Cp~C~kkyt~Lea~~---L~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 122 DDTNVAGYVCPNCQKKYTSLEALQ---LLDNETGEFHCENCGGE 162 (436)
T ss_pred hccccccccCCccccchhhhHHHH---hhcccCceEEEecCCCc
Confidence 334556788888888887766654 22223446888888754
No 143
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=30.53 E-value=35 Score=33.11 Aligned_cols=48 Identities=29% Similarity=0.636 Sum_probs=0.0
Q ss_pred CccccccccccccCHHHHHHHHHhc-------------------------------------------------------
Q psy11590 410 KKYVCELCGAYFIHKSSLVAHTTTH------------------------------------------------------- 434 (555)
Q Consensus 410 ~~~~C~~C~~~f~~~~~L~~H~~~H------------------------------------------------------- 434 (555)
+.+.|+.|++-|.+..-+..|+...
T Consensus 237 ~~~YC~~C~r~f~~~~VFe~Hl~gK~H~k~~~~~~~~v~~Ey~l~r~~kyl~d~~s~trs~V~r~la~ta~ER~aei~~l 316 (470)
T COG5188 237 PKVYCVKCGREFSRSKVFEYHLEGKRHCKEGQGKEEFVYSEYVLHRYLKYLGDPVSETRSLVLRSLAITAKERKAEISLL 316 (470)
T ss_pred cceeeHhhhhHhhhhHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHhCChhHHHHHHHHHHHHHHHHHHHHHhHHH
Q ss_pred ------------------------------------------------------CCCCccCCCCh-hhcCChHHHHHH
Q psy11590 435 ------------------------------------------------------SNVKFKCDFCE-KVYSHRNRLRYH 457 (555)
Q Consensus 435 ------------------------------------------------------~~~~~~C~~C~-~~f~~~~~L~~H 457 (555)
.+..|.|.+|| +++..+..+.+|
T Consensus 317 ~r~~~~~at~S~e~EGaeq~d~eQ~DE~~~~k~fdmPLG~DG~PmP~WL~klhgLd~ef~CEICgNyvy~GR~~FdrH 394 (470)
T COG5188 317 SRRKKQPATKSSEKEGAEQVDGEQRDEHVSGKSFDMPLGPDGLPMPRWLCKLHGLDIEFECEICGNYVYYGRDRFDRH 394 (470)
T ss_pred HHHhhccCCCchhhcccccccccccchhhccCcccCCCCCCCCCCchHHHHhcCCCcceeeeecccccccchHHHHhh
No 144
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=30.46 E-value=47 Score=30.69 Aligned_cols=57 Identities=19% Similarity=0.451 Sum_probs=40.1
Q ss_pred CcccCCCCCCChhhHHHHHHHHHHHHHhhccccc---cccccccccccccccccCCCCcccCCCChhhcCChHH
Q psy11590 98 SDFVCNLCPPDSKIMIKYAHLLVRHMKQCHTMQV---HIPAVYKNFRSVTSFTMNNITQYKCPDCPAILQTFKN 168 (555)
Q Consensus 98 ~~~~C~~C~~~~~~~f~~~~~l~~H~~~~H~~~~---~c~~c~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~ 168 (555)
.-|.|..|+.. .++ . ....+ .|.-|.+.|.-...-..-|...|.|+.|+..|.....
T Consensus 111 rqFaC~~Cd~~-----WwR--------r-vp~rKeVSRCr~C~~rYDPVP~dkmwG~aef~C~~C~h~F~G~~q 170 (278)
T PF15135_consen 111 RQFACSSCDHM-----WWR--------R-VPQRKEVSRCRKCRKRYDPVPCDKMWGIAEFHCPKCRHNFRGFAQ 170 (278)
T ss_pred eeeeccccchH-----HHh--------c-cCcccccccccccccccCCCccccccceeeeecccccccchhhhh
Confidence 55999999653 221 1 22222 7888888887766555567778999999999987654
No 145
>PF14353 CpXC: CpXC protein
Probab=30.43 E-value=16 Score=30.44 Aligned_cols=23 Identities=22% Similarity=0.498 Sum_probs=18.8
Q ss_pred eeeCCCcccccccccccccCCCC
Q psy11590 274 MYWCNVCSQRCYLSATCSIHRPS 296 (555)
Q Consensus 274 ~~~C~~C~~~f~~~~~l~~H~~~ 296 (555)
.|.|+.||..|.-...+..|-..
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D~~ 60 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHDPE 60 (128)
T ss_pred EEECCCCCCceecCCCEEEEcCC
Confidence 39999999999888887777544
No 146
>KOG3408|consensus
Probab=30.18 E-value=31 Score=27.94 Aligned_cols=25 Identities=28% Similarity=0.364 Sum_probs=20.2
Q ss_pred CCCeecccccccccChhhHHHHHHh
Q psy11590 494 ERPFVCKLCGIAYKWRKNLVRHQKN 518 (555)
Q Consensus 494 ~~~~~C~~C~~~f~~~~~L~~H~~~ 518 (555)
-..|-|-.|.+.|.+...|..|.++
T Consensus 55 ~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 55 GGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CceeehhhhhhhhcchHHHHHHHhc
Confidence 3448888888888888888888875
No 147
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=30.00 E-value=37 Score=28.10 Aligned_cols=25 Identities=24% Similarity=0.372 Sum_probs=18.0
Q ss_pred eecccccccccChhhHHHHHHhCCCccc
Q psy11590 497 FVCKLCGIAYKWRKNLVRHQKNCKASAK 524 (555)
Q Consensus 497 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~ 524 (555)
..|-.+|+.|. +|++|+.+|+|.-|
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gmTP 101 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGLTP 101 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCCCH
Confidence 46888887774 67788888877544
No 148
>KOG4167|consensus
Probab=29.69 E-value=16 Score=38.98 Aligned_cols=27 Identities=19% Similarity=0.393 Sum_probs=24.3
Q ss_pred CCCCCccchhhccChhhHHhhhhhhhc
Q psy11590 70 KCHSCQKCEESFDNCNNLWSHMFIKHE 96 (555)
Q Consensus 70 ~~~~C~~C~~~f~~~~~l~~H~~~h~~ 96 (555)
..|.|.+|++.|....++..||+.|.-
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 458999999999999999999999863
No 149
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=29.17 E-value=25 Score=30.39 Aligned_cols=17 Identities=24% Similarity=0.622 Sum_probs=14.2
Q ss_pred cccCCCChhhcCChHHH
Q psy11590 153 QYKCPDCPAILQTFKNL 169 (555)
Q Consensus 153 ~~~C~~C~~~f~~~~~l 169 (555)
.++|+.||+.|.+...+
T Consensus 28 ~~~c~~c~~~f~~~e~~ 44 (154)
T PRK00464 28 RRECLACGKRFTTFERV 44 (154)
T ss_pred eeeccccCCcceEeEec
Confidence 48999999999887654
No 150
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=28.80 E-value=78 Score=29.52 Aligned_cols=85 Identities=22% Similarity=0.396 Sum_probs=48.9
Q ss_pred cCCceeecCccCccccchhhhhhhhhhcC---CCcccCCCCCccCCCchHH-------HhHHHhhcCCCCCCCCcccccc
Q psy11590 199 TEEKKYICEVCGTSFLTWSSLNSHKYSHD---DRKFSCSHCGNSYKNPIFL-------KRHFRESHLKHSKSNPEQYAVK 268 (555)
Q Consensus 199 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~---~~~~~C~~C~~~f~~~~~L-------~~H~~~~H~~h~~~~~~~~~~~ 268 (555)
.|.+.|+|.+|+. |.-..+--.|+..-- ...|+|..|++.= ..+-| ..|++..-.+-...++
T Consensus 138 hGGrif~CsfC~~-flCEDDQFEHQAsCQvLe~E~~KC~SCNrlG-q~sCLRCK~cfCddHvrrKg~ky~k~k~------ 209 (314)
T PF06524_consen 138 HGGRIFKCSFCDN-FLCEDDQFEHQASCQVLESETFKCQSCNRLG-QYSCLRCKICFCDDHVRRKGFKYEKGKP------ 209 (314)
T ss_pred CCCeEEEeecCCC-eeeccchhhhhhhhhhhhccccccccccccc-chhhhheeeeehhhhhhhcccccccCCC------
Confidence 4567899999985 444444456665432 4467777776531 11222 3566542222222333
Q ss_pred cccCceeeCCCcccccccccccccCCCCC
Q psy11590 269 SLQDIMYWCNVCSQRCYLSATCSIHRPSN 297 (555)
Q Consensus 269 ~~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 297 (555)
+.|+.|+........|..-.+.|
T Consensus 210 ------~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 210 ------IPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred ------CCCCCCCCcccccccceeeeecc
Confidence 89999998877666665544444
No 151
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=28.79 E-value=31 Score=24.95 Aligned_cols=26 Identities=35% Similarity=0.757 Sum_probs=9.8
Q ss_pred cccccchhcccCchHHHHHHHhcCCCCCeecccccccc
Q psy11590 469 AACDICGRVFLDSKNMKKHAAVHSTERPFVCKLCGIAY 506 (555)
Q Consensus 469 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f 506 (555)
-.|.+|++.|.. -.+.+.|..||..|
T Consensus 10 ~~C~~C~~~F~~------------~~rrhhCr~CG~~v 35 (69)
T PF01363_consen 10 SNCMICGKKFSL------------FRRRHHCRNCGRVV 35 (69)
T ss_dssp SB-TTT--B-BS------------SS-EEE-TTT--EE
T ss_pred CcCcCcCCcCCC------------ceeeEccCCCCCEE
Confidence 356667777743 13445666666654
No 152
>PF03107 C1_2: C1 domain; InterPro: IPR004146 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in DAG_PE-bind (IPR002219 from INTERPRO), therefore we have termed this domain DC1 for divergent C1 domain. This domain probably also binds to two zinc ions. The function of proteins with this domain is uncertain, however this domain may bind to molecules such as diacylglycerol. This family are found in plant proteins.
Probab=27.15 E-value=15 Score=21.70 Aligned_cols=22 Identities=18% Similarity=0.391 Sum_probs=11.9
Q ss_pred echhhHHHHHhhccceecccccc
Q psy11590 25 FPCEECKELCVLSKYCIKHKDCS 47 (555)
Q Consensus 25 ~~C~~C~~~f~~~~~l~~h~~C~ 47 (555)
|.|++|++...... ++.++.|+
T Consensus 1 ~~C~~C~~~~~~~~-~Y~C~~c~ 22 (30)
T PF03107_consen 1 FWCDVCRRKIDGFY-FYHCSECC 22 (30)
T ss_pred CCCCCCCCCcCCCE-eEEeCCCC
Confidence 45666666555554 55555444
No 153
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=26.54 E-value=28 Score=19.50 Aligned_cols=11 Identities=27% Similarity=0.981 Sum_probs=8.9
Q ss_pred CCcccCCCChh
Q psy11590 151 ITQYKCPDCPA 161 (555)
Q Consensus 151 ~~~~~C~~C~~ 161 (555)
...|.|+.||+
T Consensus 14 ~v~f~CPnCG~ 24 (24)
T PF07754_consen 14 AVPFPCPNCGF 24 (24)
T ss_pred CceEeCCCCCC
Confidence 46799999985
No 154
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=25.93 E-value=22 Score=20.90 Aligned_cols=7 Identities=29% Similarity=1.175 Sum_probs=2.1
Q ss_pred CCCccch
Q psy11590 72 HSCQKCE 78 (555)
Q Consensus 72 ~~C~~C~ 78 (555)
|.|.+|+
T Consensus 16 Y~C~~Cd 22 (30)
T PF07649_consen 16 YRCSECD 22 (30)
T ss_dssp EE-TTT-
T ss_pred EECccCC
Confidence 3344443
No 155
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=25.74 E-value=79 Score=30.21 Aligned_cols=24 Identities=13% Similarity=0.303 Sum_probs=14.8
Q ss_pred CcccCCCCCccCCCchHHHhHHHh
Q psy11590 229 RKFSCSHCGNSYKNPIFLKRHFRE 252 (555)
Q Consensus 229 ~~~~C~~C~~~f~~~~~L~~H~~~ 252 (555)
..|+|+.|...|-..-+.-.|...
T Consensus 387 ~rY~Ce~CK~~FC~dCdvfiHe~L 410 (421)
T COG5151 387 GRYQCELCKSTFCSDCDVFIHETL 410 (421)
T ss_pred cceechhhhhhhhhhhHHHHHHHH
Confidence 456666666666666666666544
No 156
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=25.29 E-value=34 Score=22.55 Aligned_cols=29 Identities=17% Similarity=0.538 Sum_probs=17.6
Q ss_pred CCCCCccchhhccChhhHHhhhhhhhcCCcccCCCCCC
Q psy11590 70 KCHSCQKCEESFDNCNNLWSHMFIKHENSDFVCNLCPP 107 (555)
Q Consensus 70 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 107 (555)
..+.|+.|+.. ....|.. ...|+|..|.+
T Consensus 17 ~g~~CP~Cg~~--~~~~~~~-------~~~~~C~~C~~ 45 (46)
T PF12760_consen 17 DGFVCPHCGST--KHYRLKT-------RGRYRCKACRK 45 (46)
T ss_pred CCCCCCCCCCe--eeEEeCC-------CCeEECCCCCC
Confidence 34779999875 1111111 45689998875
No 157
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=24.97 E-value=32 Score=21.43 Aligned_cols=10 Identities=30% Similarity=0.949 Sum_probs=5.3
Q ss_pred eecCCCcccc
Q psy11590 384 YQCPHCPSLL 393 (555)
Q Consensus 384 ~~C~~C~~~f 393 (555)
+.|+.||..|
T Consensus 2 r~C~~Cg~~Y 11 (36)
T PF05191_consen 2 RICPKCGRIY 11 (36)
T ss_dssp EEETTTTEEE
T ss_pred cCcCCCCCcc
Confidence 3455555554
No 158
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=24.10 E-value=37 Score=25.93 Aligned_cols=12 Identities=17% Similarity=0.526 Sum_probs=6.7
Q ss_pred CccCCCChhhcC
Q psy11590 438 KFKCDFCEKVYS 449 (555)
Q Consensus 438 ~~~C~~C~~~f~ 449 (555)
|-+|..||+.|.
T Consensus 58 Pa~CkkCGfef~ 69 (97)
T COG3357 58 PARCKKCGFEFR 69 (97)
T ss_pred ChhhcccCcccc
Confidence 455555555554
No 159
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=23.99 E-value=51 Score=34.01 Aligned_cols=27 Identities=22% Similarity=0.438 Sum_probs=16.4
Q ss_pred ccCCCChhhcCChHHHHHHHhhhcCCC
Q psy11590 439 FKCDFCEKVYSHRNRLRYHISTVHENK 465 (555)
Q Consensus 439 ~~C~~C~~~f~~~~~L~~H~~~~H~~~ 465 (555)
+.|+.|.+.|.+...+..|+...|.+.
T Consensus 58 WiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 58 WICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred eeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 456666666666666666666555544
No 160
>TIGR00143 hypF [NiFe] hydrogenase maturation protein HypF. A previously described regulatory effect of HypF mutatation is attributable to loss of activity of a regulatory hydrogenase. A zinc finger-like region CXXCX(18)CXXCX(24)CXXCX(18)CXXC region further supported the regulatory hypothesis. However, more recent work (PUBMED:11375153) shows the direct effect is on the activity of expressed hydrogenases with nickel/iron centers, rather than on expression.
Probab=22.98 E-value=48 Score=36.70 Aligned_cols=78 Identities=15% Similarity=0.285 Sum_probs=54.7
Q ss_pred ccceechhhHHHHHhhccc------eecccccccccCCCCCCCCcccccccccccCCC-CCccchhhccChhhHHhhhhh
Q psy11590 21 QDITFPCEECKELCVLSKY------CIKHKDCSKAMSTPAPSSESVCTEHSNLFPKCH-SCQKCEESFDNCNNLWSHMFI 93 (555)
Q Consensus 21 ~~~~~~C~~C~~~f~~~~~------l~~h~~C~~~~~~~~~~~~~~~h~~~~~~~~~~-~C~~C~~~f~~~~~l~~H~~~ 93 (555)
...--.|+.|-+.+.+..+ +..|..||-.|+-..++.. . +.++....| .|+.|-+.+.+..+..-|..
T Consensus 65 ppD~a~C~~Cl~E~~dp~~Rry~YpF~nCt~CGPr~~i~~~lpy---d-r~~t~m~~f~~C~~C~~ey~~p~~rr~h~~- 139 (711)
T TIGR00143 65 PADVATCSDCLEEMLDKNDRRYLYPFISCTHCGPRFTIIEALPY---D-RENTSMADFPLCPDCAKEYKDPLDRRFHAQ- 139 (711)
T ss_pred CCchhhHHHHHHHhcCCCcccccCCcccccCCCCCeEEeecCCC---C-CCCcCCCCCcCCHHHHHHhcCCccccCCCC-
Confidence 3344679999998876654 4456899999987654332 1 223444555 49999999998877666653
Q ss_pred hhcCCcccCCCCCCC
Q psy11590 94 KHENSDFVCNLCPPD 108 (555)
Q Consensus 94 h~~~~~~~C~~C~~~ 108 (555)
+..|+.||-.
T Consensus 140 -----~~~C~~Cgp~ 149 (711)
T TIGR00143 140 -----PIACPRCGPQ 149 (711)
T ss_pred -----CccCCCCCcE
Confidence 5789999976
No 161
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=22.96 E-value=45 Score=22.58 Aligned_cols=12 Identities=25% Similarity=0.667 Sum_probs=7.2
Q ss_pred CCeecccccccc
Q psy11590 495 RPFVCKLCGIAY 506 (555)
Q Consensus 495 ~~~~C~~C~~~f 506 (555)
..+.|..||..+
T Consensus 36 ~r~~C~~Cgyt~ 47 (50)
T PRK00432 36 DRWHCGKCGYTE 47 (50)
T ss_pred CcEECCCcCCEE
Confidence 456666666654
No 162
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=22.65 E-value=36 Score=37.51 Aligned_cols=12 Identities=17% Similarity=0.385 Sum_probs=8.9
Q ss_pred CCcccCCCCCCC
Q psy11590 97 NSDFVCNLCPPD 108 (555)
Q Consensus 97 ~~~~~C~~C~~~ 108 (555)
..|..|+.|+-.
T Consensus 473 ~~p~~Cp~Cgs~ 484 (730)
T COG1198 473 PIPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCCC
Confidence 457888888765
No 163
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=22.56 E-value=42 Score=22.81 Aligned_cols=30 Identities=17% Similarity=0.302 Sum_probs=16.4
Q ss_pred ceeeCCCcccccccccccccCCCCCCcccc
Q psy11590 273 IMYWCNVCSQRCYLSATCSIHRPSNCKDQY 302 (555)
Q Consensus 273 ~~~~C~~C~~~f~~~~~l~~H~~~~~c~~~ 302 (555)
..|+|+.|...|-.--..-+|...|.|+-|
T Consensus 20 ~~y~C~~C~~~FC~dCD~fiHE~LH~CPGC 49 (51)
T PF07975_consen 20 SRYRCPKCKNHFCIDCDVFIHETLHNCPGC 49 (51)
T ss_dssp EEE--TTTT--B-HHHHHTTTTTS-SSSTT
T ss_pred CeEECCCCCCccccCcChhhhccccCCcCC
Confidence 349999999998777766666666555444
No 164
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=22.18 E-value=19 Score=24.78 Aligned_cols=13 Identities=31% Similarity=0.738 Sum_probs=7.7
Q ss_pred eechhhHHHHHhh
Q psy11590 24 TFPCEECKELCVL 36 (555)
Q Consensus 24 ~~~C~~C~~~f~~ 36 (555)
.|.|+.||..+..
T Consensus 2 ~~~CP~CG~~iev 14 (54)
T TIGR01206 2 QFECPDCGAEIEL 14 (54)
T ss_pred ccCCCCCCCEEec
Confidence 3567777665543
No 165
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=22.11 E-value=54 Score=34.71 Aligned_cols=47 Identities=28% Similarity=0.722 Sum_probs=0.0
Q ss_pred cccccccccccCHHHHHHHHHhcCCCCccCCCChhhcCChHHHHHHHhhhcCCCccccccccchhcccCchHHHHHHHhc
Q psy11590 412 YVCELCGAYFIHKSSLVAHTTTHSNVKFKCDFCEKVYSHRNRLRYHISTVHENKWKSAACDICGRVFLDSKNMKKHAAVH 491 (555)
Q Consensus 412 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~C~~C~k~f~~~~~L~~H~~~H 491 (555)
..|..||... +|+.|+-..+. |... ....|..||...
T Consensus 214 ~~C~~Cg~~~------------------~C~~C~~~l~~-----------h~~~-~~l~Ch~Cg~~~------------- 250 (505)
T TIGR00595 214 LLCRSCGYIL------------------CCPNCDVSLTY-----------HKKE-GKLRCHYCGYQE------------- 250 (505)
T ss_pred eEhhhCcCcc------------------CCCCCCCceEE-----------ecCC-CeEEcCCCcCcC-------------
Q ss_pred CCCCCeeccccc
Q psy11590 492 STERPFVCKLCG 503 (555)
Q Consensus 492 ~~~~~~~C~~C~ 503 (555)
.-|..|+.|+
T Consensus 251 --~~~~~Cp~C~ 260 (505)
T TIGR00595 251 --PIPKTCPQCG 260 (505)
T ss_pred --CCCCCCCCCC
No 166
>PRK04023 DNA polymerase II large subunit; Validated
Probab=21.88 E-value=80 Score=35.72 Aligned_cols=52 Identities=13% Similarity=0.328 Sum_probs=31.7
Q ss_pred CCCCCccchhhccChhhHHhhhhhhhcCCcccCCCCCCChhhHHHHHHHHHHHHHhhccccccccccccccccccccccC
Q psy11590 70 KCHSCQKCEESFDNCNNLWSHMFIKHENSDFVCNLCPPDSKIMIKYAHLLVRHMKQCHTMQVHIPAVYKNFRSVTSFTMN 149 (555)
Q Consensus 70 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~~~~~f~~~~~l~~H~~~~H~~~~~c~~c~~~~~~~~~~~~~ 149 (555)
....|+.||... ..+.|+.||.. -.....|+.|+..
T Consensus 625 g~RfCpsCG~~t----------------~~frCP~CG~~------------------Te~i~fCP~CG~~---------- 660 (1121)
T PRK04023 625 GRRKCPSCGKET----------------FYRRCPFCGTH------------------TEPVYRCPRCGIE---------- 660 (1121)
T ss_pred cCccCCCCCCcC----------------CcccCCCCCCC------------------CCcceeCccccCc----------
Confidence 345688888773 23678888764 1222367777533
Q ss_pred CCCcccCCCChhhcCCh
Q psy11590 150 NITQYKCPDCPAILQTF 166 (555)
Q Consensus 150 ~~~~~~C~~C~~~f~~~ 166 (555)
..++.|+.||..-...
T Consensus 661 -~~~y~CPKCG~El~~~ 676 (1121)
T PRK04023 661 -VEEDECEKCGREPTPY 676 (1121)
T ss_pred -CCCCcCCCCCCCCCcc
Confidence 2357799998766543
No 167
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=21.24 E-value=56 Score=22.41 Aligned_cols=9 Identities=44% Similarity=1.062 Sum_probs=4.6
Q ss_pred cccchhccc
Q psy11590 471 CDICGRVFL 479 (555)
Q Consensus 471 C~~C~k~f~ 479 (555)
|..|++.|.
T Consensus 5 C~~C~~~F~ 13 (57)
T cd00065 5 CMGCGKPFT 13 (57)
T ss_pred CcccCcccc
Confidence 455555554
No 168
>PF02891 zf-MIZ: MIZ/SP-RING zinc finger; InterPro: IPR004181 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents MIZ-type zinc finger domains. Miz1 (Msx-interacting-zinc finger) is a zinc finger-containing protein with homology to the yeast protein, Nfi-1. Miz1 is a sequence specific DNA binding protein that can function as a positive-acting transcription factor. Miz1 binds to the homeobox protein Msx2, enhancing the specific DNA-binding ability of Msx2 []. Other proteins containing this domain include the human pias family (protein inhibitor of activated STAT protein). More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3I2D_A.
Probab=21.10 E-value=21 Score=24.08 Aligned_cols=23 Identities=30% Similarity=0.582 Sum_probs=10.4
Q ss_pred ecccchhccccccceechhhHHH
Q psy11590 10 FDVESCAVSRLQDITFPCEECKE 32 (555)
Q Consensus 10 ~~~~~~~~~~~~~~~~~C~~C~~ 32 (555)
||.++--........|.|++|++
T Consensus 27 FDl~~fl~~~~~~~~W~CPiC~~ 49 (50)
T PF02891_consen 27 FDLESFLESNQRTPKWKCPICNK 49 (50)
T ss_dssp EEHHHHHHHHHHS---B-TTT--
T ss_pred ECHHHHHHHhhccCCeECcCCcC
Confidence 45554444444566799999985
No 169
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=21.06 E-value=50 Score=20.95 Aligned_cols=13 Identities=31% Similarity=0.728 Sum_probs=10.4
Q ss_pred CeecccccccccC
Q psy11590 496 PFVCKLCGIAYKW 508 (555)
Q Consensus 496 ~~~C~~C~~~f~~ 508 (555)
||+|..|+..|=.
T Consensus 12 ~f~C~~C~~~FC~ 24 (39)
T smart00154 12 GFKCRHCGNLFCG 24 (39)
T ss_pred CeECCccCCcccc
Confidence 7889999888853
No 170
>COG2879 Uncharacterized small protein [Function unknown]
Probab=21.02 E-value=71 Score=22.59 Aligned_cols=19 Identities=16% Similarity=0.064 Sum_probs=15.1
Q ss_pred cChhhHHHHHHhCCCcccc
Q psy11590 507 KWRKNLVRHQKNCKASAKC 525 (555)
Q Consensus 507 ~~~~~L~~H~~~h~~~~~~ 525 (555)
..-.+...|||.+|+++|.
T Consensus 23 pdYdnYVehmr~~hPd~p~ 41 (65)
T COG2879 23 PDYDNYVEHMRKKHPDKPP 41 (65)
T ss_pred CcHHHHHHHHHHhCcCCCc
Confidence 3456788999999998885
No 171
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=20.83 E-value=31 Score=20.12 Aligned_cols=18 Identities=17% Similarity=0.340 Sum_probs=7.6
Q ss_pred ecccccccccChhhHHHHH
Q psy11590 498 VCKLCGIAYKWRKNLVRHQ 516 (555)
Q Consensus 498 ~C~~C~~~f~~~~~L~~H~ 516 (555)
.|-.|++.| .....+.|.
T Consensus 2 sCiDC~~~F-~~~~y~~Ht 19 (28)
T PF08790_consen 2 SCIDCSKDF-DGDSYKSHT 19 (28)
T ss_dssp EETTTTEEE-EGGGTTT--
T ss_pred eeecCCCCc-CcCCcCCCC
Confidence 455555555 333444443
No 172
>KOG3408|consensus
Probab=20.56 E-value=54 Score=26.62 Aligned_cols=28 Identities=18% Similarity=0.435 Sum_probs=23.0
Q ss_pred cCCceeecCccCccccchhhhhhhhhhc
Q psy11590 199 TEEKKYICEVCGTSFLTWSSLNSHKYSH 226 (555)
Q Consensus 199 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h 226 (555)
.|-..|.|-.|.+-|.+...|..|.++-
T Consensus 53 PG~GqfyCi~CaRyFi~~~~l~~H~ktK 80 (129)
T KOG3408|consen 53 PGGGQFYCIECARYFIDAKALKTHFKTK 80 (129)
T ss_pred CCCceeehhhhhhhhcchHHHHHHHhcc
Confidence 3445699999999999999999998753
No 173
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=20.40 E-value=77 Score=30.30 Aligned_cols=30 Identities=27% Similarity=0.482 Sum_probs=18.5
Q ss_pred ceeecCccCccccchhhhhhhhhhcCCCcccCCCCCccCCCch
Q psy11590 202 KKYICEVCGTSFLTWSSLNSHKYSHDDRKFSCSHCGNSYKNPI 244 (555)
Q Consensus 202 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~ 244 (555)
..|.|+.|...-- ..|..|+.|.-......
T Consensus 307 gGy~CP~CktkVC-------------sLPi~CP~Csl~Lilst 336 (421)
T COG5151 307 GGYECPVCKTKVC-------------SLPISCPICSLQLILST 336 (421)
T ss_pred CceeCCcccceee-------------cCCccCcchhHHHHHHH
Confidence 3488888865432 35778888876544333
Done!