Query psy2973
Match_columns 514
No_of_seqs 587 out of 3187
Neff 9.3
Searched_HMMs 46136
Date Fri Aug 16 22:18:04 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2973.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2973hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3608|consensus 99.9 8E-27 1.7E-31 215.7 12.6 219 229-452 140-377 (467)
2 KOG1074|consensus 99.9 5.8E-28 1.3E-32 246.0 4.6 226 253-482 605-931 (958)
3 KOG1074|consensus 99.9 2.1E-28 4.6E-33 249.1 0.3 229 282-511 606-936 (958)
4 KOG3608|consensus 99.9 7.8E-26 1.7E-30 209.2 12.4 216 253-481 134-375 (467)
5 KOG2462|consensus 99.9 1.5E-25 3.2E-30 203.1 5.7 107 367-480 159-265 (279)
6 KOG2462|consensus 99.9 3.1E-25 6.8E-30 201.0 6.1 134 165-302 128-264 (279)
7 KOG3623|consensus 99.8 7.1E-22 1.5E-26 198.3 5.5 108 167-275 210-331 (1007)
8 KOG3623|consensus 99.8 3.1E-20 6.8E-25 186.6 6.4 108 196-303 210-331 (1007)
9 KOG3576|consensus 99.6 7.6E-16 1.6E-20 132.7 2.6 113 193-306 114-237 (267)
10 KOG3576|consensus 99.6 1E-15 2.2E-20 132.0 3.2 118 162-281 112-240 (267)
11 PLN03086 PRLI-interacting fact 99.1 1.4E-10 3E-15 119.1 7.0 116 370-506 434-564 (567)
12 PLN03086 PRLI-interacting fact 99.1 5E-10 1.1E-14 115.1 10.1 107 196-320 407-515 (567)
13 PHA00733 hypothetical protein 98.9 1.3E-09 2.7E-14 91.6 5.3 82 279-360 38-124 (128)
14 PHA00733 hypothetical protein 98.9 2.5E-09 5.3E-14 89.9 4.5 85 396-485 40-126 (128)
15 PHA02768 hypothetical protein; 98.7 7.9E-09 1.7E-13 71.1 1.6 44 427-475 5-48 (55)
16 KOG3993|consensus 98.6 3.4E-09 7.3E-14 101.9 -2.2 111 194-305 265-380 (500)
17 KOG3993|consensus 98.6 9.2E-09 2E-13 99.0 0.7 53 253-305 267-319 (500)
18 PHA02768 hypothetical protein; 98.5 2.5E-08 5.4E-13 68.7 1.2 36 255-292 7-42 (55)
19 PHA00616 hypothetical protein 98.3 2.8E-07 6.1E-12 60.3 1.4 32 483-514 2-33 (44)
20 PF13465 zf-H2C2_2: Zinc-finge 98.2 5.9E-07 1.3E-11 52.7 1.9 24 269-292 2-25 (26)
21 PF13465 zf-H2C2_2: Zinc-finge 98.2 8.2E-07 1.8E-11 52.1 2.0 25 442-469 1-25 (26)
22 PF05605 zf-Di19: Drought indu 98.1 4E-06 8.6E-11 59.2 4.6 51 397-451 3-53 (54)
23 PHA00732 hypothetical protein 98.0 4E-06 8.6E-11 63.8 2.7 44 428-480 2-46 (79)
24 PHA00616 hypothetical protein 98.0 3.4E-06 7.3E-11 55.3 1.9 29 427-455 1-29 (44)
25 PHA00732 hypothetical protein 97.9 6E-06 1.3E-10 62.8 2.7 48 397-452 2-49 (79)
26 PF05605 zf-Di19: Drought indu 97.8 3.4E-05 7.3E-10 54.4 4.9 51 336-393 3-53 (54)
27 PF00096 zf-C2H2: Zinc finger, 97.6 3.7E-05 7.9E-10 43.7 2.0 23 428-450 1-23 (23)
28 PF12756 zf-C2H2_2: C2H2 type 97.6 3.8E-05 8.2E-10 62.0 2.4 72 372-450 2-73 (100)
29 PF00096 zf-C2H2: Zinc finger, 97.5 6.7E-05 1.5E-09 42.6 1.7 22 483-504 1-22 (23)
30 KOG1146|consensus 97.3 6.6E-05 1.4E-09 83.0 1.1 77 170-246 439-539 (1406)
31 PF13894 zf-C2H2_4: C2H2-type 97.3 0.00016 3.5E-09 41.3 2.1 23 428-450 1-23 (24)
32 KOG2231|consensus 97.3 0.00047 1E-08 72.5 6.8 54 429-489 184-245 (669)
33 PF12756 zf-C2H2_2: C2H2 type 97.3 0.00014 3E-09 58.6 2.4 73 337-419 1-73 (100)
34 PF13894 zf-C2H2_4: C2H2-type 97.3 0.00015 3.2E-09 41.5 1.7 24 483-506 1-24 (24)
35 PF13912 zf-C2H2_6: C2H2-type 97.3 0.00016 3.4E-09 42.9 1.7 25 427-451 1-25 (27)
36 PF13912 zf-C2H2_6: C2H2-type 96.7 0.0012 2.6E-08 39.0 1.8 23 282-304 2-24 (27)
37 COG5189 SFP1 Putative transcri 96.5 0.0013 2.8E-08 61.8 1.6 70 194-273 347-418 (423)
38 KOG2231|consensus 96.4 0.0086 1.9E-07 63.3 7.7 36 254-302 100-135 (669)
39 PF13909 zf-H2C2_5: C2H2-type 96.4 0.0021 4.6E-08 36.8 1.7 24 483-507 1-24 (24)
40 PF09237 GAGA: GAGA factor; I 96.4 0.0024 5.3E-08 42.8 2.0 30 482-511 24-53 (54)
41 PF09237 GAGA: GAGA factor; I 96.3 0.0022 4.7E-08 43.1 1.6 30 426-455 23-52 (54)
42 COG5189 SFP1 Putative transcri 96.3 0.0011 2.4E-08 62.2 0.1 56 334-389 348-418 (423)
43 smart00355 ZnF_C2H2 zinc finge 96.2 0.0036 7.9E-08 36.1 2.2 23 428-450 1-23 (26)
44 PF12874 zf-met: Zinc-finger o 96.2 0.0032 6.9E-08 36.4 1.6 23 428-450 1-23 (25)
45 PF13909 zf-H2C2_5: C2H2-type 96.0 0.0045 9.8E-08 35.4 1.9 23 428-451 1-23 (24)
46 smart00355 ZnF_C2H2 zinc finge 96.0 0.0046 9.9E-08 35.7 1.9 23 483-505 1-23 (26)
47 KOG2482|consensus 96.0 0.022 4.7E-07 54.4 7.1 71 428-503 280-355 (423)
48 KOG2482|consensus 95.7 0.031 6.8E-07 53.3 6.9 89 348-449 128-217 (423)
49 COG5236 Uncharacterized conser 95.7 0.024 5.3E-07 54.0 6.1 144 196-370 151-316 (493)
50 PF12874 zf-met: Zinc-finger o 95.6 0.0074 1.6E-07 34.8 1.6 23 483-505 1-23 (25)
51 KOG2785|consensus 95.5 0.028 6.1E-07 54.7 6.0 26 366-391 65-90 (390)
52 COG5236 Uncharacterized conser 95.4 0.022 4.8E-07 54.3 4.8 70 430-506 223-305 (493)
53 KOG1146|consensus 95.3 0.0058 1.3E-07 68.2 0.7 134 372-505 439-641 (1406)
54 PF12171 zf-C2H2_jaz: Zinc-fin 94.9 0.0078 1.7E-07 35.5 0.1 22 428-449 2-23 (27)
55 PRK04860 hypothetical protein; 94.5 0.022 4.7E-07 49.8 2.1 37 253-293 119-155 (160)
56 PF12171 zf-C2H2_jaz: Zinc-fin 94.4 0.021 4.5E-07 33.6 1.2 21 484-504 3-23 (27)
57 KOG2785|consensus 94.4 0.061 1.3E-06 52.5 4.9 26 278-303 65-90 (390)
58 PRK04860 hypothetical protein; 94.3 0.018 3.8E-07 50.4 1.0 39 426-471 118-156 (160)
59 COG5048 FOG: Zn-finger [Genera 93.5 0.026 5.6E-07 58.4 0.8 62 196-257 289-355 (467)
60 COG5048 FOG: Zn-finger [Genera 91.8 0.051 1.1E-06 56.2 0.2 61 225-285 289-355 (467)
61 PF13913 zf-C2HC_2: zinc-finge 91.7 0.12 2.6E-06 29.8 1.6 19 460-479 4-22 (25)
62 smart00451 ZnF_U1 U1-like zinc 90.9 0.16 3.5E-06 31.8 1.8 23 427-449 3-25 (35)
63 PF01352 KRAB: KRAB box; Inte 90.6 0.082 1.8E-06 34.6 0.2 20 100-119 18-37 (41)
64 PF13913 zf-C2HC_2: zinc-finge 90.2 0.24 5.3E-06 28.5 1.9 19 429-448 4-22 (25)
65 TIGR00622 ssl1 transcription f 89.4 0.57 1.2E-05 37.9 4.1 23 367-389 13-35 (112)
66 KOG4173|consensus 88.4 0.3 6.5E-06 43.4 2.0 22 371-392 108-129 (253)
67 KOG2893|consensus 88.1 0.13 2.8E-06 46.6 -0.4 47 169-220 12-58 (341)
68 COG4049 Uncharacterized protei 87.6 0.27 5.9E-06 33.7 1.0 30 365-394 13-42 (65)
69 KOG4173|consensus 87.1 0.37 8.1E-06 42.8 1.9 83 368-452 78-171 (253)
70 COG4049 Uncharacterized protei 86.8 0.27 5.9E-06 33.7 0.6 29 164-192 14-42 (65)
71 KOG2893|consensus 86.7 0.18 3.9E-06 45.7 -0.3 41 399-446 13-53 (341)
72 PF12013 DUF3505: Protein of u 86.5 1.1 2.4E-05 36.6 4.3 27 426-452 79-109 (109)
73 TIGR00622 ssl1 transcription f 86.1 1.2 2.6E-05 36.1 4.1 24 309-332 81-104 (112)
74 smart00451 ZnF_U1 U1-like zinc 85.9 0.51 1.1E-05 29.4 1.6 21 483-503 4-24 (35)
75 PF12013 DUF3505: Protein of u 84.2 1.9 4.1E-05 35.2 4.6 25 458-482 80-108 (109)
76 cd00350 rubredoxin_like Rubred 83.1 0.79 1.7E-05 28.4 1.5 10 226-235 2-11 (33)
77 TIGR02098 MJ0042_CXXC MJ0042 f 73.9 0.93 2E-05 29.0 -0.2 12 460-471 4-15 (38)
78 PF10571 UPF0547: Uncharacteri 72.7 1.8 3.8E-05 25.2 0.8 9 484-492 16-24 (26)
79 COG2888 Predicted Zn-ribbon RN 72.5 2.9 6.2E-05 29.4 1.9 8 369-376 50-57 (61)
80 PF09986 DUF2225: Uncharacteri 72.0 1.9 4.1E-05 40.0 1.2 22 224-245 4-25 (214)
81 PRK00398 rpoP DNA-directed RNA 69.8 1 2.3E-05 30.2 -0.7 11 428-438 4-14 (46)
82 smart00531 TFIIE Transcription 69.1 1.9 4.2E-05 37.3 0.6 35 426-468 98-133 (147)
83 PF09723 Zn-ribbon_8: Zinc rib 67.6 0.83 1.8E-05 30.1 -1.5 12 428-439 6-17 (42)
84 smart00834 CxxC_CXXC_SSSS Puta 65.8 1.1 2.4E-05 29.1 -1.2 11 428-438 6-16 (41)
85 PRK00464 nrdR transcriptional 65.4 1.4 3E-05 38.3 -1.0 13 459-471 29-41 (154)
86 PF09986 DUF2225: Uncharacteri 65.4 2.1 4.5E-05 39.7 0.0 22 165-186 3-24 (214)
87 KOG2186|consensus 65.1 3.4 7.3E-05 38.3 1.3 46 254-302 4-49 (276)
88 KOG2186|consensus 64.5 3.8 8.3E-05 38.0 1.5 45 336-389 4-48 (276)
89 PF13717 zinc_ribbon_4: zinc-r 64.0 2.6 5.6E-05 26.7 0.3 32 460-492 4-35 (36)
90 cd00729 rubredoxin_SM Rubredox 63.5 4.9 0.00011 25.0 1.5 8 226-233 3-10 (34)
91 PF09538 FYDLN_acid: Protein o 62.2 5.1 0.00011 32.5 1.7 14 427-440 26-39 (108)
92 TIGR00373 conserved hypothetic 61.9 6 0.00013 34.7 2.3 30 426-467 108-137 (158)
93 PRK06266 transcription initiat 60.2 3.5 7.6E-05 36.9 0.5 31 426-468 116-146 (178)
94 PF13719 zinc_ribbon_5: zinc-r 58.7 3.9 8.4E-05 26.0 0.4 13 460-472 4-16 (37)
95 PRK00464 nrdR transcriptional 56.7 4.4 9.6E-05 35.2 0.5 14 254-267 29-42 (154)
96 PRK14890 putative Zn-ribbon RN 56.6 6.4 0.00014 27.8 1.2 9 369-377 48-56 (59)
97 TIGR02605 CxxC_CxxC_SSSS putat 55.3 6.5 0.00014 27.0 1.1 11 226-236 6-16 (52)
98 smart00734 ZnF_Rad18 Rad18-lik 55.3 9.8 0.00021 22.1 1.7 19 429-448 3-21 (26)
99 smart00659 RPOLCX RNA polymera 54.9 8.8 0.00019 25.5 1.6 10 226-235 3-12 (44)
100 PRK06266 transcription initiat 54.8 5.2 0.00011 35.8 0.7 27 459-490 118-144 (178)
101 PF02892 zf-BED: BED zinc fing 53.8 12 0.00026 24.7 2.2 25 367-391 14-42 (45)
102 COG1592 Rubrerythrin [Energy p 53.6 7.6 0.00016 34.1 1.5 23 225-260 134-156 (166)
103 smart00614 ZnF_BED BED zinc fi 52.9 9.7 0.00021 26.0 1.7 21 428-448 19-44 (50)
104 smart00531 TFIIE Transcription 51.3 12 0.00026 32.4 2.3 15 280-294 98-112 (147)
105 PF14353 CpXC: CpXC protein 51.1 2.5 5.4E-05 35.6 -1.9 25 458-482 38-62 (128)
106 COG1198 PriA Primosomal protei 50.4 11 0.00024 41.4 2.4 8 311-318 437-444 (730)
107 COG1592 Rubrerythrin [Energy p 49.8 10 0.00022 33.3 1.7 13 220-232 144-156 (166)
108 PF02176 zf-TRAF: TRAF-type zi 49.7 20 0.00042 25.3 2.9 27 266-292 23-53 (60)
109 COG1996 RPC10 DNA-directed RNA 48.0 6.8 0.00015 26.6 0.2 10 459-468 7-16 (49)
110 PHA00626 hypothetical protein 47.6 4.9 0.00011 27.9 -0.5 12 369-380 23-34 (59)
111 TIGR00373 conserved hypothetic 47.2 18 0.0004 31.6 2.9 33 277-318 105-137 (158)
112 KOG2807|consensus 47.2 23 0.00051 34.3 3.7 23 367-389 288-310 (378)
113 COG1997 RPL43A Ribosomal prote 46.9 10 0.00022 29.1 1.1 12 427-438 53-64 (89)
114 PF12907 zf-met2: Zinc-binding 46.5 9 0.00019 24.9 0.6 14 209-222 17-30 (40)
115 TIGR02300 FYDLN_acid conserved 45.1 14 0.0003 30.7 1.6 7 283-289 28-34 (129)
116 PF06524 NOA36: NOA36 protein; 45.1 13 0.00029 34.6 1.7 24 459-482 210-233 (314)
117 PF07754 DUF1610: Domain of un 45.0 14 0.00031 21.0 1.2 8 427-434 16-23 (24)
118 PRK09678 DNA-binding transcrip 44.4 8 0.00017 28.8 0.2 13 253-265 27-41 (72)
119 PF15269 zf-C2H2_7: Zinc-finge 43.8 14 0.0003 24.3 1.2 23 458-480 20-42 (54)
120 PF04959 ARS2: Arsenite-resist 43.7 8.7 0.00019 35.3 0.4 26 483-508 78-103 (214)
121 TIGR02300 FYDLN_acid conserved 43.7 15 0.00033 30.4 1.7 15 426-440 25-39 (129)
122 PRK09678 DNA-binding transcrip 42.0 9.3 0.0002 28.4 0.2 17 224-240 26-44 (72)
123 PF05443 ROS_MUCR: ROS/MUCR tr 42.0 12 0.00026 31.6 0.8 24 483-509 73-96 (132)
124 KOG2272|consensus 41.5 11 0.00023 35.0 0.5 16 481-496 279-294 (332)
125 KOG2807|consensus 40.9 40 0.00086 32.7 4.2 34 252-299 275-308 (378)
126 PF15269 zf-C2H2_7: Zinc-finge 40.5 18 0.00038 23.8 1.3 22 483-504 21-42 (54)
127 PF12907 zf-met2: Zinc-binding 39.8 22 0.00047 23.1 1.6 12 410-421 18-29 (40)
128 COG2331 Uncharacterized protei 39.4 9.2 0.0002 28.4 -0.1 9 428-436 13-21 (82)
129 PF13240 zinc_ribbon_2: zinc-r 38.1 14 0.0003 20.8 0.4 7 461-467 2-8 (23)
130 cd00730 rubredoxin Rubredoxin; 37.3 11 0.00023 25.9 -0.1 11 428-438 2-12 (50)
131 PRK04023 DNA polymerase II lar 37.3 28 0.00062 39.2 3.0 24 459-490 1038-1061(1121)
132 PF08274 PhnA_Zn_Ribbon: PhnA 37.0 15 0.00034 22.1 0.6 10 252-261 18-27 (30)
133 PF05443 ROS_MUCR: ROS/MUCR tr 36.8 20 0.00043 30.3 1.4 25 428-455 73-97 (132)
134 PRK04023 DNA polymerase II lar 36.3 32 0.0007 38.8 3.2 9 282-290 627-635 (1121)
135 PF03604 DNA_RNApol_7kD: DNA d 36.1 18 0.00039 22.2 0.8 11 368-378 16-26 (32)
136 PF06524 NOA36: NOA36 protein; 36.0 18 0.00038 33.8 1.1 40 193-233 139-179 (314)
137 PF04959 ARS2: Arsenite-resist 35.7 26 0.00057 32.3 2.1 26 335-360 77-102 (214)
138 PF05290 Baculo_IE-1: Baculovi 34.5 26 0.00056 29.3 1.7 12 224-235 79-90 (140)
139 COG5151 SSL1 RNA polymerase II 33.5 23 0.00049 34.0 1.4 19 369-387 322-340 (421)
140 PF00301 Rubredoxin: Rubredoxi 33.1 48 0.001 22.4 2.5 13 226-238 2-14 (47)
141 COG5151 SSL1 RNA polymerase II 32.5 38 0.00083 32.5 2.7 13 334-346 321-333 (421)
142 KOG3362|consensus 31.6 19 0.00041 30.4 0.5 21 483-503 130-150 (156)
143 KOG4167|consensus 30.3 15 0.00032 39.5 -0.4 26 427-452 792-817 (907)
144 KOG1280|consensus 29.3 48 0.001 32.5 2.8 25 397-421 80-104 (381)
145 KOG2907|consensus 29.1 31 0.00067 27.9 1.3 11 483-493 103-113 (116)
146 KOG1842|consensus 28.1 31 0.00068 34.9 1.4 29 396-424 15-43 (505)
147 PF04606 Ogr_Delta: Ogr/Delta- 28.0 16 0.00034 24.6 -0.5 34 461-494 2-39 (47)
148 KOG4167|consensus 27.8 17 0.00037 39.0 -0.5 25 483-507 793-817 (907)
149 TIGR00595 priA primosomal prot 27.5 39 0.00084 35.8 2.1 45 312-403 216-260 (505)
150 KOG2593|consensus 27.4 52 0.0011 33.4 2.8 16 427-442 128-143 (436)
151 PF10013 DUF2256: Uncharacteri 26.6 27 0.00058 22.8 0.4 13 461-473 11-23 (42)
152 COG1198 PriA Primosomal protei 25.9 43 0.00094 37.0 2.1 10 197-206 436-445 (730)
153 TIGR01206 lysW lysine biosynth 25.9 19 0.00041 25.2 -0.4 10 428-437 3-12 (54)
154 PF12760 Zn_Tnp_IS1595: Transp 25.6 97 0.0021 20.6 3.0 11 251-261 35-45 (46)
155 PF01096 TFIIS_C: Transcriptio 25.1 20 0.00043 23.1 -0.4 10 254-263 29-38 (39)
156 COG3364 Zn-ribbon containing p 25.0 44 0.00096 26.4 1.4 12 428-439 3-14 (112)
157 COG4888 Uncharacterized Zn rib 24.4 11 0.00024 29.6 -1.9 14 426-439 21-34 (104)
158 PRK03824 hypA hydrogenase nick 23.9 45 0.00098 28.3 1.4 16 194-209 68-83 (135)
159 KOG1280|consensus 23.8 63 0.0014 31.7 2.5 38 367-404 77-117 (381)
160 KOG0782|consensus 22.9 21 0.00045 37.2 -0.9 51 442-496 240-290 (1004)
161 PF08209 Sgf11: Sgf11 (transcr 22.5 49 0.0011 20.5 1.0 24 483-507 5-28 (33)
162 KOG3408|consensus 22.4 55 0.0012 26.9 1.6 24 426-449 56-79 (129)
163 PF04780 DUF629: Protein of un 22.3 64 0.0014 33.5 2.4 29 195-223 56-84 (466)
164 PRK14714 DNA polymerase II lar 21.7 77 0.0017 36.9 3.1 9 397-405 693-701 (1337)
165 PF13821 DUF4187: Domain of un 21.7 81 0.0018 22.1 2.1 26 474-499 19-44 (55)
166 PF13451 zf-trcl: Probable zin 21.6 77 0.0017 21.6 1.9 13 196-208 4-16 (49)
167 PF09845 DUF2072: Zn-ribbon co 21.5 55 0.0012 27.4 1.4 15 427-441 1-15 (131)
168 KOG2272|consensus 21.1 48 0.001 30.8 1.1 12 370-381 222-233 (332)
169 PRK14714 DNA polymerase II lar 20.9 84 0.0018 36.7 3.1 8 310-317 668-675 (1337)
170 KOG2593|consensus 20.7 70 0.0015 32.5 2.3 35 224-261 127-161 (436)
171 PF05191 ADK_lid: Adenylate ki 20.2 92 0.002 19.6 2.0 8 228-235 4-11 (36)
172 PF13453 zf-TFIIB: Transcripti 20.2 75 0.0016 20.5 1.6 14 283-296 21-34 (41)
173 PF08271 TF_Zn_Ribbon: TFIIB z 20.2 62 0.0013 21.1 1.2 10 253-262 19-28 (43)
174 COG1655 Uncharacterized protei 20.2 37 0.0008 31.2 0.2 23 225-247 19-41 (267)
No 1
>KOG3608|consensus
Probab=99.94 E-value=8e-27 Score=215.74 Aligned_cols=219 Identities=25% Similarity=0.520 Sum_probs=154.3
Q ss_pred ccccccccChhhhhhhhhhccC------------C-cccccCC--CcccccccccHHHHHHHhcCCCcccccccccccCC
Q psy2973 229 EVCGKSFSNNSTLKVHSTTHAD------------Q-GRFKCPI--CGKHIRHKQTFDDHMNMHSGDVKYTCTYCEKSFLY 293 (514)
Q Consensus 229 ~~C~~~f~~~~~L~~H~~~h~~------------~-~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~ 293 (514)
..|+..|.+...|.+|...|.. + ..+.|.. |-+.|.+++.|+.|.+.|++++...|+.||..|.+
T Consensus 140 edCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~ 219 (467)
T KOG3608|consen 140 EDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRT 219 (467)
T ss_pred hhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhcc
Confidence 4566666666666666655521 1 1245543 77777777777777777777777777777777777
Q ss_pred hHHHHHhhhhcc--cCCcccCCcCCCcCCCHHHHHHHHHhcCCccccCccccccCCHHHHHHHHHHhcCCCCCCCCCCcc
Q psy2973 294 KKSLKHHLFVHS--SEFTSKCSVCDRLYKSDYLRRKHERSHSSIFQCRICDKILSNIQGIRRHMENMHNSDKDNTERPFQ 371 (514)
Q Consensus 294 ~~~L~~H~~~H~--~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~ 371 (514)
+..|..|++.-+ ...+|.|..|.+.|.+...|..|+..|-.-|+|+.|+...+..++|.+|++..|.. .+||+
T Consensus 220 ~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn~ykCplCdmtc~~~ssL~~H~r~rHs~-----dkpfK 294 (467)
T KOG3608|consen 220 KTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVNCYKCPLCDMTCSSASSLTTHIRYRHSK-----DKPFK 294 (467)
T ss_pred ccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCCChHHHHHHHHhhhcc-----CCCcc
Confidence 777777776543 34567777777777777777777777777777777777777777777777777764 66777
Q ss_pred cCcCCcccCChHHHHHHHHhhcCCCccccc--cccccCChHHHHHHHHhhcCCCCCCcccccccccccCChHHHHHHHHh
Q psy2973 372 CNDCGSTFRLKRHLNSHINRLHKLKLNCVH--CQAIFTNKQRLNYHILTKHSGPNYKAWKCEKCDMRFVTNKNLKRHMKV 449 (514)
Q Consensus 372 C~~C~~~f~~~~~L~~H~~~~h~~~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~k~~~C~~C~~~F~~~~~L~~H~~~ 449 (514)
|..|+..|.+.+.|.+|+..+....|+|.. |..+|.+...|..|++.+|.|.+..+|.|..|++.|++..+|..|++.
T Consensus 295 Cd~Cd~~c~~esdL~kH~~~HS~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~k 374 (467)
T KOG3608|consen 295 CDECDTRCVRESDLAKHVQVHSKTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMK 374 (467)
T ss_pred ccchhhhhccHHHHHHHHHhccccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHH
Confidence 777777777777777777633334677766 777777777777777777766666677777777777777777777765
Q ss_pred hcC
Q psy2973 450 HEK 452 (514)
Q Consensus 450 H~~ 452 (514)
.++
T Consensus 375 kH~ 377 (467)
T KOG3608|consen 375 KHG 377 (467)
T ss_pred hhc
Confidence 554
No 2
>KOG1074|consensus
Probab=99.94 E-value=5.8e-28 Score=246.01 Aligned_cols=226 Identities=20% Similarity=0.403 Sum_probs=168.1
Q ss_pred ccccCCCcccccccccHHHHHHHhcCCCcccccccccccCChHHHHHhhhhcccC----CcccCC---cCCCcCCCHHHH
Q psy2973 253 RFKCPICGKHIRHKQTFDDHMNMHSGDVKYTCTYCEKSFLYKKSLKHHLFVHSSE----FTSKCS---VCDRLYKSDYLR 325 (514)
Q Consensus 253 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~----~~~~C~---~C~~~f~~~~~l 325 (514)
|-+|-+|.++..-++.|+.|.++|+|++||+|.+||+.|.++.+|+.||-.|... .++.|+ +|-+.|...-.|
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~l 684 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTL 684 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccc
Confidence 3689999999999999999999999999999999999999999999999998654 468899 999999999999
Q ss_pred HHHHHhcCC---------------ccccCccccccCCHHHHHHHHHHhcCCCC-----------CCC----CCCcccCcC
Q psy2973 326 RKHERSHSS---------------IFQCRICDKILSNIQGIRRHMENMHNSDK-----------DNT----ERPFQCNDC 375 (514)
Q Consensus 326 ~~H~~~h~~---------------~~~C~~C~~~f~~~~~l~~H~~~~H~~~~-----------~~~----~~~~~C~~C 375 (514)
..|+++|.. .-+|..|.+.|.....+..++ ..|.+.. ... ..+..+..|
T Consensus 685 pQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~-se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~ 763 (958)
T KOG1074|consen 685 PQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQI-SEQPSPESEPDEQMDERTETEELDVTPPPPENSC 763 (958)
T ss_pred cceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhh-hccCCcccCCcccccccccccccccCCCcccccc
Confidence 999999872 147999999999999999888 4442110 111 226788889
Q ss_pred CcccCChHHHHHHHHhh----------------------cC--CCc-cccccccccCChHHHHHHHH-------------
Q psy2973 376 GSTFRLKRHLNSHINRL----------------------HK--LKL-NCVHCQAIFTNKQRLNYHIL------------- 417 (514)
Q Consensus 376 ~~~f~~~~~L~~H~~~~----------------------h~--~~~-~C~~C~~~f~~~~~L~~H~~------------- 417 (514)
+..+.....+..+-... +. .+. .+.+++-.-...-.+.--..
T Consensus 764 ~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t 843 (958)
T KOG1074|consen 764 GRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLAT 843 (958)
T ss_pred ccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccccccccc
Confidence 98886544433321100 00 123 56666544332211110000
Q ss_pred ----hhcC----------------------CCCCCcccccccccccCChHHHHHHHHhhcCCCCcccccccccccccCCh
Q psy2973 418 ----TKHS----------------------GPNYKAWKCEKCDMRFVTNKNLKRHMKVHEKNRNMLDSSCNECGKQYKWE 471 (514)
Q Consensus 418 ----~~H~----------------------~~~~k~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~ 471 (514)
..+. .+......|.+|++.|.+.++|..|+|+|++++| |.|.+|++.|..+
T Consensus 844 ~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KP---F~C~fC~~aFttr 920 (958)
T KOG1074|consen 844 KTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKP---FFCHFCEEAFTTR 920 (958)
T ss_pred ccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCC---ccchhhhhhhhhh
Confidence 0000 0001226799999999999999999999999999 8999999999999
Q ss_pred hhHHHHhcccc
Q psy2973 472 SFLQKHKKICT 482 (514)
Q Consensus 472 ~~L~~H~~~h~ 482 (514)
..|+.||.+|.
T Consensus 921 gnLKvHMgtH~ 931 (958)
T KOG1074|consen 921 GNLKVHMGTHM 931 (958)
T ss_pred hhhhhhhcccc
Confidence 99999999987
No 3
>KOG1074|consensus
Probab=99.94 E-value=2.1e-28 Score=249.15 Aligned_cols=229 Identities=21% Similarity=0.346 Sum_probs=164.9
Q ss_pred ccccccccccCChHHHHHhhhhcccCCcccCCcCCCcCCCHHHHHHHHHhcC------CccccC---ccccccCCHHHHH
Q psy2973 282 YTCTYCEKSFLYKKSLKHHLFVHSSEFTSKCSVCDRLYKSDYLRRKHERSHS------SIFQCR---ICDKILSNIQGIR 352 (514)
Q Consensus 282 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~------~~~~C~---~C~~~f~~~~~l~ 352 (514)
-+|-+|.+....++.|+.|.++|+|++||+|.+||++|.++.+|+.|+-+|. ..+.|+ +|-+.|.+.-.|.
T Consensus 606 NqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lp 685 (958)
T KOG1074|consen 606 NQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLP 685 (958)
T ss_pred cceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccccc
Confidence 5799999999999999999999999999999999999999999999999886 258899 9999999999999
Q ss_pred HHHHHhcC-CCCCCC----C---CCcccCcCCcccCChHHHHHHHHhh-------c--------C--C----Cccccccc
Q psy2973 353 RHMENMHN-SDKDNT----E---RPFQCNDCGSTFRLKRHLNSHINRL-------H--------K--L----KLNCVHCQ 403 (514)
Q Consensus 353 ~H~~~~H~-~~~~~~----~---~~~~C~~C~~~f~~~~~L~~H~~~~-------h--------~--~----~~~C~~C~ 403 (514)
.|+ .+|. +....+ + ..-+|..|.+.|.....+..++..+ + . . +..+..|+
T Consensus 686 QhI-riH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~ 764 (958)
T KOG1074|consen 686 QHI-RIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSCG 764 (958)
T ss_pred ceE-EeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccccccc
Confidence 999 5565 333322 2 2357888999998777777776533 1 0 0 34678888
Q ss_pred cccCChHHHHHHHHhh--cC------------------CCCCCcc-cccccccccCChHHHHHHHH--------------
Q psy2973 404 AIFTNKQRLNYHILTK--HS------------------GPNYKAW-KCEKCDMRFVTNKNLKRHMK-------------- 448 (514)
Q Consensus 404 ~~f~~~~~L~~H~~~~--H~------------------~~~~k~~-~C~~C~~~F~~~~~L~~H~~-------------- 448 (514)
..+.....+..+-.+. +. ..+.++. .+..++..-...-.+.-=+.
T Consensus 765 ~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~ 844 (958)
T KOG1074|consen 765 RELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATK 844 (958)
T ss_pred cccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccccccc
Confidence 8877666655543211 00 0112445 56665543332221110000
Q ss_pred ----hhcCCC-------------------Cc--ccccccccccccCChhhHHHHhcccc----cccccccccccCHHHHH
Q psy2973 449 ----VHEKNR-------------------NM--LDSSCNECGKQYKWESFLQKHKKICT----TTCKSCGKRYSSIYVLR 499 (514)
Q Consensus 449 ----~H~~~~-------------------~~--~~~~C~~C~~~f~~~~~L~~H~~~h~----~~C~~C~~~f~~~~~l~ 499 (514)
++.+.. .. ....|..||+.|.+.++|..|+|+|+ |.|.+|++.|..+..|+
T Consensus 845 ~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLK 924 (958)
T KOG1074|consen 845 TNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLK 924 (958)
T ss_pred cccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhh
Confidence 000000 00 13579999999999999999999998 89999999999999999
Q ss_pred HhhhhhhCCCCC
Q psy2973 500 NHVKSCYGQQKN 511 (514)
Q Consensus 500 ~H~~~h~ge~~~ 511 (514)
.||.+|++..|+
T Consensus 925 vHMgtH~w~q~~ 936 (958)
T KOG1074|consen 925 VHMGTHMWVQPP 936 (958)
T ss_pred hhhccccccCCC
Confidence 999999998875
No 4
>KOG3608|consensus
Probab=99.93 E-value=7.8e-26 Score=209.24 Aligned_cols=216 Identities=24% Similarity=0.459 Sum_probs=188.8
Q ss_pred ccccC--CCcccccccccHHHHHHHhcC------------C-Ccccccc--cccccCChHHHHHhhhhcccCCcccCCcC
Q psy2973 253 RFKCP--ICGKHIRHKQTFDDHMNMHSG------------D-VKYTCTY--CEKSFLYKKSLKHHLFVHSSEFTSKCSVC 315 (514)
Q Consensus 253 ~~~C~--~C~~~f~~~~~L~~H~~~h~~------------~-~~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~~C~~C 315 (514)
.|.|. .|+..|.+...|..|+-.|.- + ..+.|.. |.+.|.++..|+.|++.|++++...|+.|
T Consensus 134 ~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~C 213 (467)
T KOG3608|consen 134 NFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHC 213 (467)
T ss_pred hhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchH
Confidence 36774 599999999999999987742 2 2356765 99999999999999999999999999999
Q ss_pred CCcCCCHHHHHHHHHhcC----CccccCccccccCCHHHHHHHHHHhcCCCCCCCCCCcccCcCCcccCChHHHHHHHHh
Q psy2973 316 DRLYKSDYLRRKHERSHS----SIFQCRICDKILSNIQGIRRHMENMHNSDKDNTERPFQCNDCGSTFRLKRHLNSHINR 391 (514)
Q Consensus 316 ~~~f~~~~~l~~H~~~h~----~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~ 391 (514)
|.-|.++..|-.|.+..+ .+|+|..|.+.|.+...|..|+.++- .-|+|+.|+.+....+.|..|++.
T Consensus 214 g~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--------n~ykCplCdmtc~~~ssL~~H~r~ 285 (467)
T KOG3608|consen 214 GELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV--------NCYKCPLCDMTCSSASSLTTHIRY 285 (467)
T ss_pred HHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--------hcccccccccCCCChHHHHHHHHh
Confidence 999999999999988765 58999999999999999999995543 359999999999999999999998
Q ss_pred hcC--CCccccccccccCChHHHHHHHHhhcCCCCCCcccccc--cccccCChHHHHHHHHhhc-CCCCccccccccccc
Q psy2973 392 LHK--LKLNCVHCQAIFTNKQRLNYHILTKHSGPNYKAWKCEK--CDMRFVTNKNLKRHMKVHE-KNRNMLDSSCNECGK 466 (514)
Q Consensus 392 ~h~--~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~k~~~C~~--C~~~F~~~~~L~~H~~~H~-~~~~~~~~~C~~C~~ 466 (514)
.|. .||+|+.|++.|.+.+.|..|+. .|+. ..|.|.. |..+|.+..+|++|++.++ |+.| +.|.|-.|.+
T Consensus 286 rHs~dkpfKCd~Cd~~c~~esdL~kH~~-~HS~---~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np-~~Y~CH~Cdr 360 (467)
T KOG3608|consen 286 RHSKDKPFKCDECDTRCVRESDLAKHVQ-VHSK---TVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNP-ILYACHCCDR 360 (467)
T ss_pred hhccCCCccccchhhhhccHHHHHHHHH-hccc---cceecCCCCCcHHHHHHHHHHHHHHHhccCCCC-Cceeeecchh
Confidence 887 59999999999999999999998 6773 5699988 9999999999999999766 5555 5699999999
Q ss_pred ccCChhhHHHHhccc
Q psy2973 467 QYKWESFLQKHKKIC 481 (514)
Q Consensus 467 ~f~~~~~L~~H~~~h 481 (514)
.|++..+|..|++..
T Consensus 361 ~ft~G~~L~~HL~kk 375 (467)
T KOG3608|consen 361 FFTSGKSLSAHLMKK 375 (467)
T ss_pred hhccchhHHHHHHHh
Confidence 999999999996543
No 5
>KOG2462|consensus
Probab=99.92 E-value=1.5e-25 Score=203.14 Aligned_cols=107 Identities=32% Similarity=0.695 Sum_probs=100.0
Q ss_pred CCCcccCcCCcccCChHHHHHHHHhhcCCCccccccccccCChHHHHHHHHhhcCCCCCCcccccccccccCChHHHHHH
Q psy2973 367 ERPFQCNDCGSTFRLKRHLNSHINRLHKLKLNCVHCQAIFTNKQRLNYHILTKHSGPNYKAWKCEKCDMRFVTNKNLKRH 446 (514)
Q Consensus 367 ~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~k~~~C~~C~~~F~~~~~L~~H 446 (514)
.+.+.|++||+.|.+...|..|++ .|+.+..|.+||+.|...-.|+-|++ .|+|+ |||.|+.|+++|.++++|+.|
T Consensus 159 ~ka~~C~~C~K~YvSmpALkMHir-TH~l~c~C~iCGKaFSRPWLLQGHiR-THTGE--KPF~C~hC~kAFADRSNLRAH 234 (279)
T KOG2462|consen 159 KKAFSCKYCGKVYVSMPALKMHIR-THTLPCECGICGKAFSRPWLLQGHIR-THTGE--KPFSCPHCGKAFADRSNLRAH 234 (279)
T ss_pred cccccCCCCCceeeehHHHhhHhh-ccCCCcccccccccccchHHhhcccc-cccCC--CCccCCcccchhcchHHHHHH
Confidence 677999999999999999999998 88899999999999999999999998 79998 999999999999999999999
Q ss_pred HHhhcCCCCcccccccccccccCChhhHHHHhcc
Q psy2973 447 MKVHEKNRNMLDSSCNECGKQYKWESFLQKHKKI 480 (514)
Q Consensus 447 ~~~H~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~ 480 (514)
|+||.+.++ |+|..|+|+|...+.|.+|...
T Consensus 235 mQTHS~~K~---~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 235 MQTHSDVKK---HQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred HHhhcCCcc---ccCcchhhHHHHHHHHHHhhhh
Confidence 999999997 9999999999999999999765
No 6
>KOG2462|consensus
Probab=99.91 E-value=3.1e-25 Score=201.03 Aligned_cols=134 Identities=33% Similarity=0.652 Sum_probs=84.9
Q ss_pred CCccccCcccccccChHHHHhhHhcccCC---CCceeccCCCcccccHHHHHHHHHHhhcCCCCeecccccccccChhhh
Q psy2973 165 ENKFKCQFCPMWFEQRGKRRKHVNGVHLK---IKRFKCEVCPMKFHRQVSLKKHVEAVHLKLKPCKCEVCGKSFSNNSTL 241 (514)
Q Consensus 165 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~---~k~~~C~~C~~~f~~~~~l~~H~~~~H~~~k~~~C~~C~~~f~~~~~L 241 (514)
...|+|+.||+.+.+.+.|.+|.+ .|.. .+.+.|++|++.|.+...|+.|+ ++|+ -++.|.+||+.|+....|
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ-~H~~~~s~ka~~C~~C~K~YvSmpALkMHi-rTH~--l~c~C~iCGKaFSRPWLL 203 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQ-THRSLDSKKAFSCKYCGKVYVSMPALKMHI-RTHT--LPCECGICGKAFSRPWLL 203 (279)
T ss_pred CCceeccccccccccccccchhhc-ccccccccccccCCCCCceeeehHHHhhHh-hccC--CCcccccccccccchHHh
Confidence 345677777777777777777765 4532 34566666666666666666665 4554 356666666666666666
Q ss_pred hhhhhhccCCcccccCCCcccccccccHHHHHHHhcCCCcccccccccccCChHHHHHhhh
Q psy2973 242 KVHSTTHADQGRFKCPICGKHIRHKQTFDDHMNMHSGDVKYTCTYCEKSFLYKKSLKHHLF 302 (514)
Q Consensus 242 ~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~ 302 (514)
+-|+|+|+|++||.|+.|+++|.++++|+.||++|.+.++|+|..|+|+|...+.|.+|..
T Consensus 204 QGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 204 QGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred hcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 6666666666666666666666666666666666666666666666666666666666654
No 7
>KOG3623|consensus
Probab=99.84 E-value=7.1e-22 Score=198.31 Aligned_cols=108 Identities=28% Similarity=0.653 Sum_probs=88.9
Q ss_pred ccccCcccccccChHHHHhhHhcccC-CCCceeccCCCcccccHHHHHHHHHHhhcC-------------CCCeeccccc
Q psy2973 167 KFKCQFCPMWFEQRGKRRKHVNGVHL-KIKRFKCEVCPMKFHRQVSLKKHVEAVHLK-------------LKPCKCEVCG 232 (514)
Q Consensus 167 ~~~C~~C~~~f~~~~~l~~H~~~~h~-~~k~~~C~~C~~~f~~~~~l~~H~~~~H~~-------------~k~~~C~~C~ 232 (514)
...|++|++.+.....|+.|+...|. .+..|.|..|.++|..+..|.+|| ..|.. .+.|+|.+||
T Consensus 210 lltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm-~~hkpg~dqa~sltqsa~lRKFKCtECg 288 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHM-QLHKPGGDQAISLTQSALLRKFKCTECG 288 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHH-HhhcCCCcccccccchhhhccccccccc
Confidence 46799999999988899999876664 356688999999999999999998 45532 2568888888
Q ss_pred ccccChhhhhhhhhhccCCcccccCCCcccccccccHHHHHHH
Q psy2973 233 KSFSNNSTLKVHSTTHADQGRFKCPICGKHIRHKQTFDDHMNM 275 (514)
Q Consensus 233 ~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~ 275 (514)
+.|..+..|+.|+|+|.|++||.|+.|+|.|.....+..||..
T Consensus 289 KAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 289 KAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred hhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence 8888888888888888888888888888888888888888753
No 8
>KOG3623|consensus
Probab=99.80 E-value=3.1e-20 Score=186.60 Aligned_cols=108 Identities=30% Similarity=0.637 Sum_probs=94.1
Q ss_pred ceeccCCCcccccHHHHHHHHHHhhcC-CCCeecccccccccChhhhhhhhhhccC-------------CcccccCCCcc
Q psy2973 196 RFKCEVCPMKFHRQVSLKKHVEAVHLK-LKPCKCEVCGKSFSNNSTLKVHSTTHAD-------------QGRFKCPICGK 261 (514)
Q Consensus 196 ~~~C~~C~~~f~~~~~l~~H~~~~H~~-~k~~~C~~C~~~f~~~~~L~~H~~~h~~-------------~~~~~C~~C~~ 261 (514)
...|++|++.+.....|+.|+...|.. +..|.|..|.++|.++..|.+||.+|.. .+.|+|.+|||
T Consensus 210 lltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgK 289 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGK 289 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccch
Confidence 368999999999999999998777754 5568999999999999999999998843 34589999999
Q ss_pred cccccccHHHHHHHhcCCCcccccccccccCChHHHHHhhhh
Q psy2973 262 HIRHKQTFDDHMNMHSGDVKYTCTYCEKSFLYKKSLKHHLFV 303 (514)
Q Consensus 262 ~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~ 303 (514)
+|..+..|+.|+|+|.|++||.|+.|++.|....++..||..
T Consensus 290 AFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 290 AFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred hhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence 999999999999999999999999999999998888888864
No 9
>KOG3576|consensus
Probab=99.57 E-value=7.6e-16 Score=132.72 Aligned_cols=113 Identities=29% Similarity=0.577 Sum_probs=78.1
Q ss_pred CCCceeccCCCcccccHHHHHHHHHHhhcCCCCeecccccccccChhhhhhhhhhccCCcccccCCCcccccccccHHHH
Q psy2973 193 KIKRFKCEVCPMKFHRQVSLKKHVEAVHLKLKPCKCEVCGKSFSNNSTLKVHSTTHADQGRFKCPICGKHIRHKQTFDDH 272 (514)
Q Consensus 193 ~~k~~~C~~C~~~f~~~~~l~~H~~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H 272 (514)
+...|.|.+|++.|.-..-|.+|+ ..|...+.|-|..||+.|...-+|++|+++|++.+||+|..|+++|..+-.|..|
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~-kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcslesh 192 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHL-KCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESH 192 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHh-hhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHH
Confidence 345678888888888888888887 6777777777888888887777788888888777778777777777777777777
Q ss_pred HHHhcCC-----------CcccccccccccCChHHHHHhhhhccc
Q psy2973 273 MNMHSGD-----------VKYTCTYCEKSFLYKKSLKHHLFVHSS 306 (514)
Q Consensus 273 ~~~h~~~-----------~~~~C~~C~~~f~~~~~L~~H~~~H~~ 306 (514)
++.-+|. +.|.|..||.+-.....+..|+..|+.
T Consensus 193 l~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp 237 (267)
T KOG3576|consen 193 LKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHP 237 (267)
T ss_pred HHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCC
Confidence 7655442 334555555554444444455444443
No 10
>KOG3576|consensus
Probab=99.57 E-value=1e-15 Score=131.97 Aligned_cols=118 Identities=25% Similarity=0.546 Sum_probs=93.2
Q ss_pred ccCCCccccCcccccccChHHHHhhHhcccCCCCceeccCCCcccccHHHHHHHHHHhhcCCCCeecccccccccChhhh
Q psy2973 162 SFGENKFKCQFCPMWFEQRGKRRKHVNGVHLKIKRFKCEVCPMKFHRQVSLKKHVEAVHLKLKPCKCEVCGKSFSNNSTL 241 (514)
Q Consensus 162 ~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~~H~~~k~~~C~~C~~~f~~~~~L 241 (514)
..+...|.|.+|++.|.-...|.+|++ .|...+.+.|..|++.|.+..+|++|+ +.|+|.+||+|..|+++|..+-.|
T Consensus 112 ssd~d~ftCrvCgK~F~lQRmlnrh~k-ch~~vkr~lct~cgkgfndtfdlkrh~-rthtgvrpykc~~c~kaftqrcsl 189 (267)
T KOG3576|consen 112 SSDQDSFTCRVCGKKFGLQRMLNRHLK-CHSDVKRHLCTFCGKGFNDTFDLKRHT-RTHTGVRPYKCSLCEKAFTQRCSL 189 (267)
T ss_pred CCCCCeeeeehhhhhhhHHHHHHHHhh-hccHHHHHHHhhccCcccchhhhhhhh-ccccCccccchhhhhHHHHhhccH
Confidence 335567888888888888888888887 688888888888888888888888887 678888888888888888888888
Q ss_pred hhhhhhccC-----------CcccccCCCcccccccccHHHHHHHhcCCCc
Q psy2973 242 KVHSTTHAD-----------QGRFKCPICGKHIRHKQTFDDHMNMHSGDVK 281 (514)
Q Consensus 242 ~~H~~~h~~-----------~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~ 281 (514)
..|++.-++ .+.|.|..||.+-.....+..|++.|+...|
T Consensus 190 eshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 190 ESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred HHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 888754333 3558888888888888888888888876544
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.09 E-value=1.4e-10 Score=119.10 Aligned_cols=116 Identities=21% Similarity=0.488 Sum_probs=71.6
Q ss_pred cccCc--CCcccCChHHHHHHHHhhcCCCccccccccccCChHHHHHHHHhhcCCCCCCcccccccccccCChHHHHHHH
Q psy2973 370 FQCND--CGSTFRLKRHLNSHINRLHKLKLNCVHCQAIFTNKQRLNYHILTKHSGPNYKAWKCEKCDMRFVTNKNLKRHM 447 (514)
Q Consensus 370 ~~C~~--C~~~f~~~~~L~~H~~~~h~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~k~~~C~~C~~~F~~~~~L~~H~ 447 (514)
..|+. ||..| .+..|..| +.|+.|++.|. ...|..|+...| ++|.|+ |++.+ .+..|..|+
T Consensus 434 V~Cp~~~Cg~v~-~r~el~~H--------~~C~~Cgk~f~-~s~LekH~~~~H-----kpv~Cp-Cg~~~-~R~~L~~H~ 496 (567)
T PLN03086 434 VVCPHDGCGIVL-RVEEAKNH--------VHCEKCGQAFQ-QGEMEKHMKVFH-----EPLQCP-CGVVL-EKEQMVQHQ 496 (567)
T ss_pred eeCCccccccee-eccccccC--------ccCCCCCCccc-hHHHHHHHHhcC-----CCccCC-CCCCc-chhHHHhhh
Confidence 34553 66666 23333444 24777776664 455677766543 457777 77544 556777777
Q ss_pred HhhcCCCCcccccccccccccCC----------hhhHHHHhcccc---cccccccccccCHHHHHHhhhhhh
Q psy2973 448 KVHEKNRNMLDSSCNECGKQYKW----------ESFLQKHKKICT---TTCKSCGKRYSSIYVLRNHVKSCY 506 (514)
Q Consensus 448 ~~H~~~~~~~~~~C~~C~~~f~~----------~~~L~~H~~~h~---~~C~~C~~~f~~~~~l~~H~~~h~ 506 (514)
.+|.+.++ +.|..|++.|.. .+.|..|...+. +.|..||+.|..+. +..|+...|
T Consensus 497 ~thCp~Kp---i~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~CG~rt~~C~~Cgk~Vrlrd-m~~H~~~~h 564 (567)
T PLN03086 497 ASTCPLRL---ITCRFCGDMVQAGGSAMDVRDRLRGMSEHESICGSRTAPCDSCGRSVMLKE-MDIHQIAVH 564 (567)
T ss_pred hccCCCCc---eeCCCCCCccccCccccchhhhhhhHHHHHHhcCCcceEccccCCeeeehh-HHHHHHHhh
Confidence 77777666 677777777741 346777776654 67777777776554 666766554
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.07 E-value=5e-10 Score=115.08 Aligned_cols=107 Identities=20% Similarity=0.424 Sum_probs=73.4
Q ss_pred ceeccCCCcccccHHHHHHHHHHhhcCCCCeeccc--ccccccChhhhhhhhhhccCCcccccCCCcccccccccHHHHH
Q psy2973 196 RFKCEVCPMKFHRQVSLKKHVEAVHLKLKPCKCEV--CGKSFSNNSTLKVHSTTHADQGRFKCPICGKHIRHKQTFDDHM 273 (514)
Q Consensus 196 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~k~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~ 273 (514)
.-.|+.|...... ..|..|. .... -..-.|+. |+..|. +..+..| +.|+.|++.|. ...|..|+
T Consensus 407 ~V~C~NC~~~i~l-~~l~lHe-~~C~-r~~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~ 472 (567)
T PLN03086 407 TVECRNCKHYIPS-RSIALHE-AYCS-RHNVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHM 472 (567)
T ss_pred eEECCCCCCccch-hHHHHHH-hhCC-CcceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHH
Confidence 3568888876653 4455674 2222 12345763 888773 3444444 47888888885 56788888
Q ss_pred HHhcCCCcccccccccccCChHHHHHhhhhcccCCcccCCcCCCcCC
Q psy2973 274 NMHSGDVKYTCTYCEKSFLYKKSLKHHLFVHSSEFTSKCSVCDRLYK 320 (514)
Q Consensus 274 ~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~ 320 (514)
.+++ .++.|+ ||+.+ .+..|..|+.+|.+.+++.|+.|++.|.
T Consensus 473 ~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~ 515 (567)
T PLN03086 473 KVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQ 515 (567)
T ss_pred HhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccc
Confidence 8875 678888 88654 5678888888888888888888888774
No 13
>PHA00733 hypothetical protein
Probab=98.93 E-value=1.3e-09 Score=91.64 Aligned_cols=82 Identities=17% Similarity=0.268 Sum_probs=49.3
Q ss_pred CCcccccccccccCChHHHHHh--h---hhcccCCcccCCcCCCcCCCHHHHHHHHHhcCCccccCccccccCCHHHHHH
Q psy2973 279 DVKYTCTYCEKSFLYKKSLKHH--L---FVHSSEFTSKCSVCDRLYKSDYLRRKHERSHSSIFQCRICDKILSNIQGIRR 353 (514)
Q Consensus 279 ~~~~~C~~C~~~f~~~~~L~~H--~---~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~~f~~~~~l~~ 353 (514)
.+++.|.+|.+.|.....|..| + ..+++.+||.|+.|++.|.+...|..|++.+..+|.|+.|++.|.....|.+
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~~~~~C~~CgK~F~~~~sL~~ 117 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSKVCPVCGKEFRNTDSTLD 117 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcCCcCccCCCCCCccCCHHHHHH
Confidence 3455666666555555444443 1 1223456677777777777777777776665555666666666666666666
Q ss_pred HHHHhcC
Q psy2973 354 HMENMHN 360 (514)
Q Consensus 354 H~~~~H~ 360 (514)
|+...|+
T Consensus 118 H~~~~h~ 124 (128)
T PHA00733 118 HVCKKHN 124 (128)
T ss_pred HHHHhcC
Confidence 6666654
No 14
>PHA00733 hypothetical protein
Probab=98.85 E-value=2.5e-09 Score=89.88 Aligned_cols=85 Identities=21% Similarity=0.398 Sum_probs=48.5
Q ss_pred CccccccccccCChHHHHHH--HHhhcCCCCCCcccccccccccCChHHHHHHHHhhcCCCCcccccccccccccCChhh
Q psy2973 396 KLNCVHCQAIFTNKQRLNYH--ILTKHSGPNYKAWKCEKCDMRFVTNKNLKRHMKVHEKNRNMLDSSCNECGKQYKWESF 473 (514)
Q Consensus 396 ~~~C~~C~~~f~~~~~L~~H--~~~~H~~~~~k~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~ 473 (514)
++.|.+|...|.+...|..| +..+......+||.|+.|++.|.+.+.|..|++.|. .+ |.|+.|+++|.+...
T Consensus 40 ~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~--~~---~~C~~CgK~F~~~~s 114 (128)
T PHA00733 40 RLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTE--HS---KVCPVCGKEFRNTDS 114 (128)
T ss_pred hHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcCC--cC---ccCCCCCCccCCHHH
Confidence 56677777666666665555 211111111256666666666666666666666552 22 566666666666666
Q ss_pred HHHHhccccccc
Q psy2973 474 LQKHKKICTTTC 485 (514)
Q Consensus 474 L~~H~~~h~~~C 485 (514)
|..|+...+..|
T Consensus 115 L~~H~~~~h~~~ 126 (128)
T PHA00733 115 TLDHVCKKHNIC 126 (128)
T ss_pred HHHHHHHhcCcc
Confidence 666665554443
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.67 E-value=7.9e-09 Score=71.13 Aligned_cols=44 Identities=16% Similarity=0.409 Sum_probs=35.1
Q ss_pred cccccccccccCChHHHHHHHHhhcCCCCcccccccccccccCChhhHH
Q psy2973 427 AWKCEKCDMRFVTNKNLKRHMKVHEKNRNMLDSSCNECGKQYKWESFLQ 475 (514)
Q Consensus 427 ~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~L~ 475 (514)
.|+|+.||+.|...++|..|+++|+ ++ |+|..|++.|...+.|+
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~---~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TN---LKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--Cc---ccCCcccceecccceeE
Confidence 4788888888888888888888887 34 78888888888776654
No 16
>KOG3993|consensus
Probab=98.61 E-value=3.4e-09 Score=101.93 Aligned_cols=111 Identities=21% Similarity=0.418 Sum_probs=66.3
Q ss_pred CCceeccCCCcccccHHHHHHHHHHhhcCCCCeecccccccccChhhhhhhhhhccCCccccc---CCCcccccccccHH
Q psy2973 194 IKRFKCEVCPMKFHRQVSLKKHVEAVHLKLKPCKCEVCGKSFSNNSTLKVHSTTHADQGRFKC---PICGKHIRHKQTFD 270 (514)
Q Consensus 194 ~k~~~C~~C~~~f~~~~~l~~H~~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C---~~C~~~f~~~~~L~ 270 (514)
...|.|..|...|.+...|.+|. -.-...-.|+|++|+|.|+-..+|..|.|.|....--.= +-=.+...+....+
T Consensus 265 iGdyiCqLCK~kYeD~F~LAQHr-C~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ 343 (500)
T KOG3993|consen 265 IGDYICQLCKEKYEDAFALAQHR-CPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQ 343 (500)
T ss_pred HHHHHHHHHHHhhhhHHHHhhcc-CCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhh
Confidence 34588999999999999999884 222222358888888888888888888888843211000 00000000000000
Q ss_pred HHHHH--hcCCCcccccccccccCChHHHHHhhhhcc
Q psy2973 271 DHMNM--HSGDVKYTCTYCEKSFLYKKSLKHHLFVHS 305 (514)
Q Consensus 271 ~H~~~--h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~ 305 (514)
.--+. -..+.-|.|..|++.|++...|+.|+.+|.
T Consensus 344 ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq 380 (500)
T KOG3993|consen 344 EAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQ 380 (500)
T ss_pred hccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhh
Confidence 00000 012235888888888888888888887775
No 17
>KOG3993|consensus
Probab=98.61 E-value=9.2e-09 Score=98.96 Aligned_cols=53 Identities=23% Similarity=0.470 Sum_probs=31.4
Q ss_pred ccccCCCcccccccccHHHHHHHhcCCCcccccccccccCChHHHHHhhhhcc
Q psy2973 253 RFKCPICGKHIRHKQTFDDHMNMHSGDVKYTCTYCEKSFLYKKSLKHHLFVHS 305 (514)
Q Consensus 253 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~ 305 (514)
.|.|..|...|.+.-.|.+|.-..---.-|+|++|+|.|+...+|..|.+.|.
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHK 319 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHK 319 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccC
Confidence 36666666666666666666422111123666666666666666666666664
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.54 E-value=2.5e-08 Score=68.69 Aligned_cols=36 Identities=25% Similarity=0.660 Sum_probs=14.3
Q ss_pred ccCCCcccccccccHHHHHHHhcCCCcccccccccccC
Q psy2973 255 KCPICGKHIRHKQTFDDHMNMHSGDVKYTCTYCEKSFL 292 (514)
Q Consensus 255 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~ 292 (514)
.|+.||+.|...+.|..||++|+ ++|.|..|++.|.
T Consensus 7 ~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~ 42 (55)
T PHA02768 7 ECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISL 42 (55)
T ss_pred CcchhCCeeccHHHHHHHHHhcC--CcccCCcccceec
Confidence 34444444444444444444433 2334444444333
No 19
>PHA00616 hypothetical protein
Probab=98.29 E-value=2.8e-07 Score=60.31 Aligned_cols=32 Identities=22% Similarity=0.276 Sum_probs=30.2
Q ss_pred cccccccccccCHHHHHHhhhhhhCCCCCCCC
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKSCYGQQKNLSH 514 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~h~ge~~~~c~ 514 (514)
|+|+.||+.|..++.|+.|+++|||++|+.|+
T Consensus 2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 78999999999999999999999999999874
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.24 E-value=5.9e-07 Score=52.75 Aligned_cols=24 Identities=46% Similarity=1.029 Sum_probs=13.5
Q ss_pred HHHHHHHhcCCCcccccccccccC
Q psy2973 269 FDDHMNMHSGDVKYTCTYCEKSFL 292 (514)
Q Consensus 269 L~~H~~~h~~~~~~~C~~C~~~f~ 292 (514)
|..|+++|++++||.|+.|++.|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 455555555555555555555554
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.20 E-value=8.2e-07 Score=52.14 Aligned_cols=25 Identities=40% Similarity=0.913 Sum_probs=18.1
Q ss_pred HHHHHHHhhcCCCCcccccccccccccC
Q psy2973 442 NLKRHMKVHEKNRNMLDSSCNECGKQYK 469 (514)
Q Consensus 442 ~L~~H~~~H~~~~~~~~~~C~~C~~~f~ 469 (514)
+|..||++|++++| |.|+.|+++|.
T Consensus 1 ~l~~H~~~H~~~k~---~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKP---YKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSS---EEESSSSEEES
T ss_pred CHHHHhhhcCCCCC---CCCCCCcCeeC
Confidence 46777777777777 67777777775
No 22
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.13 E-value=4e-06 Score=59.16 Aligned_cols=51 Identities=25% Similarity=0.444 Sum_probs=33.4
Q ss_pred ccccccccccCChHHHHHHHHhhcCCCCCCcccccccccccCChHHHHHHHHhhc
Q psy2973 397 LNCVHCQAIFTNKQRLNYHILTKHSGPNYKAWKCEKCDMRFVTNKNLKRHMKVHE 451 (514)
Q Consensus 397 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~k~~~C~~C~~~F~~~~~L~~H~~~H~ 451 (514)
|.|++|++ ..+...|..|+...|..+. +.+.|++|...+. .+|..|+..++
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~-~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSES-KNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCC-CCccCCCchhhhh--hHHHHHHHHhc
Confidence 56677776 4445667777666776643 4577888877544 37777777665
No 23
>PHA00732 hypothetical protein
Probab=98.00 E-value=4e-06 Score=63.76 Aligned_cols=44 Identities=34% Similarity=0.623 Sum_probs=21.0
Q ss_pred ccccccccccCChHHHHHHHHh-hcCCCCcccccccccccccCChhhHHHHhcc
Q psy2973 428 WKCEKCDMRFVTNKNLKRHMKV-HEKNRNMLDSSCNECGKQYKWESFLQKHKKI 480 (514)
Q Consensus 428 ~~C~~C~~~F~~~~~L~~H~~~-H~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~ 480 (514)
|.|+.|++.|.+.++|..|++. |.+ +.|+.||++|. .|..|.++
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~~------~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHTL------TKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccCC------CccCCCCCEeC---Chhhhhcc
Confidence 4455555555555555555543 321 34555555554 24444433
No 24
>PHA00616 hypothetical protein
Probab=97.99 E-value=3.4e-06 Score=55.30 Aligned_cols=29 Identities=21% Similarity=0.474 Sum_probs=23.7
Q ss_pred cccccccccccCChHHHHHHHHhhcCCCC
Q psy2973 427 AWKCEKCDMRFVTNKNLKRHMKVHEKNRN 455 (514)
Q Consensus 427 ~~~C~~C~~~F~~~~~L~~H~~~H~~~~~ 455 (514)
||+|+.||+.|...++|..|++.|+++++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~ 29 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNK 29 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCc
Confidence 57888888888888888888888888777
No 25
>PHA00732 hypothetical protein
Probab=97.94 E-value=6e-06 Score=62.79 Aligned_cols=48 Identities=21% Similarity=0.405 Sum_probs=26.2
Q ss_pred ccccccccccCChHHHHHHHHhhcCCCCCCcccccccccccCChHHHHHHHHhhcC
Q psy2973 397 LNCVHCQAIFTNKQRLNYHILTKHSGPNYKAWKCEKCDMRFVTNKNLKRHMKVHEK 452 (514)
Q Consensus 397 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~k~~~C~~C~~~F~~~~~L~~H~~~H~~ 452 (514)
|.|..|++.|.+...|..|++..|. ++.|+.|++.|. .|..|++++..
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~-----~~~C~~CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT-----LTKCPVCNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC-----CCccCCCCCEeC---ChhhhhcccCC
Confidence 4555555555555555555543332 235666666665 46666655443
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.83 E-value=3.4e-05 Score=54.38 Aligned_cols=51 Identities=29% Similarity=0.553 Sum_probs=29.6
Q ss_pred cccCccccccCCHHHHHHHHHHhcCCCCCCCCCCcccCcCCcccCChHHHHHHHHhhc
Q psy2973 336 FQCRICDKILSNIQGIRRHMENMHNSDKDNTERPFQCNDCGSTFRLKRHLNSHINRLH 393 (514)
Q Consensus 336 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h 393 (514)
|.|+.|++ ..+...|..|....|..+ .+.+.|++|...+. .+|..|+...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~----~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSE----SKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCC----CCCccCCCchhhhh--hHHHHHHHHhc
Confidence 56666666 444566666666666542 33566777765433 36666665444
No 27
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.62 E-value=3.7e-05 Score=43.73 Aligned_cols=23 Identities=48% Similarity=0.974 Sum_probs=13.5
Q ss_pred ccccccccccCChHHHHHHHHhh
Q psy2973 428 WKCEKCDMRFVTNKNLKRHMKVH 450 (514)
Q Consensus 428 ~~C~~C~~~F~~~~~L~~H~~~H 450 (514)
|.|+.|++.|.+...|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 45666666666666666666553
No 28
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.59 E-value=3.8e-05 Score=61.99 Aligned_cols=72 Identities=22% Similarity=0.476 Sum_probs=18.3
Q ss_pred cCcCCcccCChHHHHHHHHhhcCCCccccccccccCChHHHHHHHHhhcCCCCCCcccccccccccCChHHHHHHHHhh
Q psy2973 372 CNDCGSTFRLKRHLNSHINRLHKLKLNCVHCQAIFTNKQRLNYHILTKHSGPNYKAWKCEKCDMRFVTNKNLKRHMKVH 450 (514)
Q Consensus 372 C~~C~~~f~~~~~L~~H~~~~h~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~k~~~C~~C~~~F~~~~~L~~H~~~H 450 (514)
|..|+..|.+...|..|+...|+.... ....+.....+..+++..- . ..+.|.+|++.|.+...|..||+.+
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~~---~~~~l~~~~~~~~~~~~~~--~--~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDIP---DQKYLVDPNRLLNYLRKKV--K--ESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred ccccccccccccccccccccccccccc---ccccccccccccccccccc--C--CCCCCCccCCCCcCHHHHHHHHcCc
Confidence 556666666666666666555543221 1111223333333333111 0 2477888888888888888888765
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.46 E-value=6.7e-05 Score=42.62 Aligned_cols=22 Identities=36% Similarity=0.739 Sum_probs=13.6
Q ss_pred cccccccccccCHHHHHHhhhh
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKS 504 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~ 504 (514)
|.|+.|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4566666666666666666655
No 30
>KOG1146|consensus
Probab=97.34 E-value=6.6e-05 Score=82.95 Aligned_cols=77 Identities=22% Similarity=0.380 Sum_probs=57.5
Q ss_pred cCcccccccChHHHHhhHhcccCCCCceeccCCCcccccHHHHHHHHHHhhcCC------------------------CC
Q psy2973 170 CQFCPMWFEQRGKRRKHVNGVHLKIKRFKCEVCPMKFHRQVSLKKHVEAVHLKL------------------------KP 225 (514)
Q Consensus 170 C~~C~~~f~~~~~l~~H~~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~~H~~~------------------------k~ 225 (514)
|..|+..+.....+..|+..+|...+.|.|+.|+..|.....|-.||+..|... ++
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p 518 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKP 518 (1406)
T ss_pred ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCc
Confidence 566777777777777787777877888999999999999999999997766432 45
Q ss_pred eecccccccccChhhhhhhhh
Q psy2973 226 CKCEVCGKSFSNNSTLKVHST 246 (514)
Q Consensus 226 ~~C~~C~~~f~~~~~L~~H~~ 246 (514)
|.|..|..++.....|.+|+.
T Consensus 519 ~~C~~C~~stttng~Lsihlq 539 (1406)
T KOG1146|consen 519 YPCRACNYSTTTNGNLSIHLQ 539 (1406)
T ss_pred ccceeeeeeeecchHHHHHHH
Confidence 666666666666666666653
No 31
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.31 E-value=0.00016 Score=41.34 Aligned_cols=23 Identities=30% Similarity=0.862 Sum_probs=13.1
Q ss_pred ccccccccccCChHHHHHHHHhh
Q psy2973 428 WKCEKCDMRFVTNKNLKRHMKVH 450 (514)
Q Consensus 428 ~~C~~C~~~F~~~~~L~~H~~~H 450 (514)
|.|++|++.|.+...|..|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 45666666666666666666655
No 32
>KOG2231|consensus
Probab=97.30 E-value=0.00047 Score=72.48 Aligned_cols=54 Identities=26% Similarity=0.571 Sum_probs=42.4
Q ss_pred cccccccccCChHHHHHHHHhhcCCCCccccccccc------ccccCChhhHHHHhcccccccc--ccc
Q psy2973 429 KCEKCDMRFVTNKNLKRHMKVHEKNRNMLDSSCNEC------GKQYKWESFLQKHKKICTTTCK--SCG 489 (514)
Q Consensus 429 ~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~~~C~~C------~~~f~~~~~L~~H~~~h~~~C~--~C~ 489 (514)
.|..|...|.....|.+|++.++ |.|-+| +.-|.....|..|-|..+|.|. .|.
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~h-------~~chfC~~~~~~neyy~~~~dLe~HfR~~HflCE~~~C~ 245 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFDH-------EFCHFCDYKTGQNEYYNDYDDLEEHFRKGHFLCEEEFCR 245 (669)
T ss_pred cchhhhhhhccHHHHHHhhccce-------eheeecCcccccchhcccchHHHHHhhhcCccccccccc
Confidence 68889889999999999988776 556666 3556777888999888888877 565
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.30 E-value=0.00014 Score=58.60 Aligned_cols=73 Identities=23% Similarity=0.430 Sum_probs=17.4
Q ss_pred ccCccccccCCHHHHHHHHHHhcCCCCCCCCCCcccCcCCcccCChHHHHHHHHhhcCCCccccccccccCChHHHHHHH
Q psy2973 337 QCRICDKILSNIQGIRRHMENMHNSDKDNTERPFQCNDCGSTFRLKRHLNSHINRLHKLKLNCVHCQAIFTNKQRLNYHI 416 (514)
Q Consensus 337 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~C~~C~~~f~~~~~L~~H~ 416 (514)
+|..|+..|.+...|..|+...|..... ....+.....+..+++..-...+.|.+|++.|.+...|..||
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~----------~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm 70 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP----------DQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHM 70 (100)
T ss_dssp ----------------------------------------------------------SSEEBSSSS-EESSHHHHHHHH
T ss_pred Cccccccccccccccccccccccccccc----------cccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHH
Confidence 4677777777777777777766653211 111122333344444322223577777777777777777777
Q ss_pred Hhh
Q psy2973 417 LTK 419 (514)
Q Consensus 417 ~~~ 419 (514)
+..
T Consensus 71 ~~~ 73 (100)
T PF12756_consen 71 RSK 73 (100)
T ss_dssp HHT
T ss_pred cCc
Confidence 753
No 34
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.28 E-value=0.00015 Score=41.51 Aligned_cols=24 Identities=33% Similarity=0.652 Sum_probs=16.4
Q ss_pred cccccccccccCHHHHHHhhhhhh
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKSCY 506 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~h~ 506 (514)
|.|++|++.|.+...|+.|+++||
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 467777777777777777777765
No 35
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.27 E-value=0.00016 Score=42.88 Aligned_cols=25 Identities=24% Similarity=0.683 Sum_probs=16.8
Q ss_pred cccccccccccCChHHHHHHHHhhc
Q psy2973 427 AWKCEKCDMRFVTNKNLKRHMKVHE 451 (514)
Q Consensus 427 ~~~C~~C~~~F~~~~~L~~H~~~H~ 451 (514)
||+|..|++.|.+..+|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4667777777777777777766654
No 36
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.66 E-value=0.0012 Score=39.04 Aligned_cols=23 Identities=30% Similarity=0.639 Sum_probs=13.1
Q ss_pred ccccccccccCChHHHHHhhhhc
Q psy2973 282 YTCTYCEKSFLYKKSLKHHLFVH 304 (514)
Q Consensus 282 ~~C~~C~~~f~~~~~L~~H~~~H 304 (514)
|.|..|++.|.+...|..|++.|
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCccCCccCChhHHHHHhHHh
Confidence 55555555555555555555544
No 37
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.45 E-value=0.0013 Score=61.76 Aligned_cols=70 Identities=24% Similarity=0.414 Sum_probs=36.9
Q ss_pred CCceeccC--CCcccccHHHHHHHHHHhhcCCCCeecccccccccChhhhhhhhhhccCCcccccCCCcccccccccHHH
Q psy2973 194 IKRFKCEV--CPMKFHRQVSLKKHVEAVHLKLKPCKCEVCGKSFSNNSTLKVHSTTHADQGRFKCPICGKHIRHKQTFDD 271 (514)
Q Consensus 194 ~k~~~C~~--C~~~f~~~~~l~~H~~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~ 271 (514)
+|||+|++ |++.+.....|+-|+.--|-..+...=+ .-..|...-...+||.|++|+|.+.+...|+-
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p----------~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKY 416 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENP----------SPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKY 416 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCC----------CccccccccccCCceeccccchhhccCcccee
Confidence 47888865 8888888888888864333211111000 01111111234456666666666666665555
Q ss_pred HH
Q psy2973 272 HM 273 (514)
Q Consensus 272 H~ 273 (514)
|.
T Consensus 417 Hr 418 (423)
T COG5189 417 HR 418 (423)
T ss_pred cc
Confidence 53
No 38
>KOG2231|consensus
Probab=96.43 E-value=0.0086 Score=63.25 Aligned_cols=36 Identities=31% Similarity=0.615 Sum_probs=20.8
Q ss_pred cccCCCcccccccccHHHHHHHhcCCCcccccccccccCChHHHHHhhh
Q psy2973 254 FKCPICGKHIRHKQTFDDHMNMHSGDVKYTCTYCEKSFLYKKSLKHHLF 302 (514)
Q Consensus 254 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~ 302 (514)
+.|.+|+..|.... ..-.|..| -.|.+...|+.|+.
T Consensus 100 ~~C~~C~~~~~~~~------------~~~~~~~c-~~~~s~~~Lk~H~~ 135 (669)
T KOG2231|consen 100 HSCHICDRRFRALY------------NKKECLHC-TEFKSVENLKNHMR 135 (669)
T ss_pred hhcCccccchhhhc------------ccCCCccc-cchhHHHHHHHHHH
Confidence 56777766664222 11256666 66666667777764
No 39
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.40 E-value=0.0021 Score=36.76 Aligned_cols=24 Identities=25% Similarity=0.353 Sum_probs=18.1
Q ss_pred cccccccccccCHHHHHHhhhhhhC
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKSCYG 507 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~h~g 507 (514)
|+|+.|++... ...|..|++.|||
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 57888888887 8888888888775
No 40
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.35 E-value=0.0024 Score=42.84 Aligned_cols=30 Identities=23% Similarity=0.345 Sum_probs=22.7
Q ss_pred ccccccccccccCHHHHHHhhhhhhCCCCC
Q psy2973 482 TTTCKSCGKRYSSIYVLRNHVKSCYGQQKN 511 (514)
Q Consensus 482 ~~~C~~C~~~f~~~~~l~~H~~~h~ge~~~ 511 (514)
+..|++|+..+.+...|+.|+.++|+.||-
T Consensus 24 PatCP~C~a~~~~srnLrRHle~~H~~k~~ 53 (54)
T PF09237_consen 24 PATCPICGAVIRQSRNLRRHLEIRHFKKPG 53 (54)
T ss_dssp -EE-TTT--EESSHHHHHHHHHHHTTTS--
T ss_pred CCCCCcchhhccchhhHHHHHHHHhcccCC
Confidence 378999999999999999999999998873
No 41
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.31 E-value=0.0022 Score=43.07 Aligned_cols=30 Identities=20% Similarity=0.526 Sum_probs=21.6
Q ss_pred CcccccccccccCChHHHHHHHHhhcCCCC
Q psy2973 426 KAWKCEKCDMRFVTNKNLKRHMKVHEKNRN 455 (514)
Q Consensus 426 k~~~C~~C~~~F~~~~~L~~H~~~H~~~~~ 455 (514)
.|-.|++|+..+.+..+|++|+.++|+.+|
T Consensus 23 ~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 23 QPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp --EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred CCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 788999999999999999999988888776
No 42
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.29 E-value=0.0011 Score=62.21 Aligned_cols=56 Identities=25% Similarity=0.637 Sum_probs=35.1
Q ss_pred CccccCc--cccccCCHHHHHHHHHHhcCCCCC-------------CCCCCcccCcCCcccCChHHHHHHH
Q psy2973 334 SIFQCRI--CDKILSNIQGIRRHMENMHNSDKD-------------NTERPFQCNDCGSTFRLKRHLNSHI 389 (514)
Q Consensus 334 ~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~-------------~~~~~~~C~~C~~~f~~~~~L~~H~ 389 (514)
+||+|++ |++.+.+...|+.|+..-|..... ...+||.|+.|++.+.....|+-|.
T Consensus 348 KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 348 KPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred ceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence 5788876 888888888888888766633211 1235555555555555555555553
No 43
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.25 E-value=0.0036 Score=36.15 Aligned_cols=23 Identities=35% Similarity=0.886 Sum_probs=13.6
Q ss_pred ccccccccccCChHHHHHHHHhh
Q psy2973 428 WKCEKCDMRFVTNKNLKRHMKVH 450 (514)
Q Consensus 428 ~~C~~C~~~F~~~~~L~~H~~~H 450 (514)
|.|+.|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 35666666666666666666544
No 44
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.16 E-value=0.0032 Score=36.36 Aligned_cols=23 Identities=22% Similarity=0.798 Sum_probs=14.9
Q ss_pred ccccccccccCChHHHHHHHHhh
Q psy2973 428 WKCEKCDMRFVTNKNLKRHMKVH 450 (514)
Q Consensus 428 ~~C~~C~~~F~~~~~L~~H~~~H 450 (514)
|.|.+|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 45666666666666666666543
No 45
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.04 E-value=0.0045 Score=35.35 Aligned_cols=23 Identities=43% Similarity=0.873 Sum_probs=12.9
Q ss_pred ccccccccccCChHHHHHHHHhhc
Q psy2973 428 WKCEKCDMRFVTNKNLKRHMKVHE 451 (514)
Q Consensus 428 ~~C~~C~~~F~~~~~L~~H~~~H~ 451 (514)
|+|+.|+.... ...|..|++.|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45666666665 666666666654
No 46
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.03 E-value=0.0046 Score=35.71 Aligned_cols=23 Identities=30% Similarity=0.723 Sum_probs=13.6
Q ss_pred cccccccccccCHHHHHHhhhhh
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKSC 505 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~h 505 (514)
|.|..|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 34566666666666666666544
No 47
>KOG2482|consensus
Probab=95.98 E-value=0.022 Score=54.35 Aligned_cols=71 Identities=15% Similarity=0.162 Sum_probs=37.8
Q ss_pred ccccccccccCChHHHHHHHHhhcC-C--CCcccccccccccccCChhhHHHHhccc--ccccccccccccCHHHHHHhh
Q psy2973 428 WKCEKCDMRFVTNKNLKRHMKVHEK-N--RNMLDSSCNECGKQYKWESFLQKHKKIC--TTTCKSCGKRYSSIYVLRNHV 502 (514)
Q Consensus 428 ~~C~~C~~~F~~~~~L~~H~~~H~~-~--~~~~~~~C~~C~~~f~~~~~L~~H~~~h--~~~C~~C~~~f~~~~~l~~H~ 502 (514)
..|-.|.........|..||++-|. + +....| +.-|.+.-.+..-.|.. .-.|-.|+-+|.....|..||
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~-----~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm 354 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDY-----SLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHM 354 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhcccc-----ccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhc
Confidence 5788888887788888888885443 1 000000 11122222222222221 135666666677766666666
Q ss_pred h
Q psy2973 503 K 503 (514)
Q Consensus 503 ~ 503 (514)
.
T Consensus 355 ~ 355 (423)
T KOG2482|consen 355 V 355 (423)
T ss_pred c
Confidence 4
No 48
>KOG2482|consensus
Probab=95.70 E-value=0.031 Score=53.31 Aligned_cols=89 Identities=18% Similarity=0.328 Sum_probs=52.9
Q ss_pred HHHHHHHHHHhcCCCCCCCCCCcccCcCCccc-CChHHHHHHHHhhcCCCccccccccccCChHHHHHHHHhhcCCCCCC
Q psy2973 348 IQGIRRHMENMHNSDKDNTERPFQCNDCGSTF-RLKRHLNSHINRLHKLKLNCVHCQAIFTNKQRLNYHILTKHSGPNYK 426 (514)
Q Consensus 348 ~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f-~~~~~L~~H~~~~h~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~k 426 (514)
...|.+|++.. . ......+|-.|...+ -.++....|+-..|....-= .........|..|++. ...
T Consensus 128 ~eaLeqqQ~Er-e----dt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGl---pDniVyvnelLehLke-----kL~ 194 (423)
T KOG2482|consen 128 KEALEQQQKER-E----DTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGL---PDNIVYVNELLEHLKE-----KLE 194 (423)
T ss_pred HHHHHHHHHHh-c----CCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCC---CcceeeHHHHHHHHHH-----HHh
Confidence 45566666443 2 124557899998765 46778888888777521000 0111223445666653 223
Q ss_pred cccccccccccCChHHHHHHHHh
Q psy2973 427 AWKCEKCDMRFVTNKNLKRHMKV 449 (514)
Q Consensus 427 ~~~C~~C~~~F~~~~~L~~H~~~ 449 (514)
.+.|-.|.+.|..+..|+.||+.
T Consensus 195 r~~CLyCekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 195 RLRCLYCEKIFRDKNTLKEHMRK 217 (423)
T ss_pred hheeeeeccccCCcHHHHHHHHh
Confidence 46788888888888888888774
No 49
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.68 E-value=0.024 Score=53.99 Aligned_cols=144 Identities=25% Similarity=0.434 Sum_probs=76.4
Q ss_pred ceeccC--CCcccccHHHHHHHHHHhhcCCCCeecccccc---cccChhhhhhhhhhccCCcccccCCCcccccccccHH
Q psy2973 196 RFKCEV--CPMKFHRQVSLKKHVEAVHLKLKPCKCEVCGK---SFSNNSTLKVHSTTHADQGRFKCPICGKHIRHKQTFD 270 (514)
Q Consensus 196 ~~~C~~--C~~~f~~~~~l~~H~~~~H~~~k~~~C~~C~~---~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~ 270 (514)
.|.|+. |..+...+..|+.|....|. .+.|.+|-. .|... -..-+...|.
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~---~~~C~~C~~nKk~F~~E----------------------~~lF~~~~Lr 205 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG---FVLCSECIGNKKDFWNE----------------------IRLFRSSTLR 205 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC---cEEhHhhhcCcccCccc----------------------eeeeeccccc
Confidence 366754 66666667778888766553 356666632 22111 1111223333
Q ss_pred HHHHHhcCCCc----ccccccccccCChHHHHHhhhhcccCCcccCCcCCC-------cCCCHHHHHHHHHhcCCccccC
Q psy2973 271 DHMNMHSGDVK----YTCTYCEKSFLYKKSLKHHLFVHSSEFTSKCSVCDR-------LYKSDYLRRKHERSHSSIFQCR 339 (514)
Q Consensus 271 ~H~~~h~~~~~----~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~-------~f~~~~~l~~H~~~h~~~~~C~ 339 (514)
.|...-..+.. -.|..|...|.+-..|..|++.-+. .|-+|++ -|.+-.+|..|.+..+ |.|.
T Consensus 206 ~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h--y~ct 279 (493)
T COG5236 206 DHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH--YCCT 279 (493)
T ss_pred ccccCCccccCcCCCchhhhccceecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcCc--eEEE
Confidence 33332211111 2566777777777777777664332 2444442 3556666777766544 3332
Q ss_pred c--c----ccccCCHHHHHHHHHHhcCCCCCCCCCCc
Q psy2973 340 I--C----DKILSNIQGIRRHMENMHNSDKDNTERPF 370 (514)
Q Consensus 340 ~--C----~~~f~~~~~l~~H~~~~H~~~~~~~~~~~ 370 (514)
. | -.+|.....|..|+...|.......++++
T Consensus 280 ~qtc~~~k~~vf~~~~el~~h~~~~h~~~~~~~~~~~ 316 (493)
T COG5236 280 FQTCRVGKCYVFPYHTELLEHLTRFHKVNARLSEIPR 316 (493)
T ss_pred EEEEecCcEEEeccHHHHHHHHHHHhhcccccCcCCC
Confidence 1 2 14678888888888888765444334333
No 50
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.60 E-value=0.0074 Score=34.79 Aligned_cols=23 Identities=39% Similarity=0.648 Sum_probs=15.6
Q ss_pred cccccccccccCHHHHHHhhhhh
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKSC 505 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~h 505 (514)
|.|..|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 45677777777777777777653
No 51
>KOG2785|consensus
Probab=95.50 E-value=0.028 Score=54.74 Aligned_cols=26 Identities=19% Similarity=0.528 Sum_probs=21.5
Q ss_pred CCCCcccCcCCcccCChHHHHHHHHh
Q psy2973 366 TERPFQCNDCGSTFRLKRHLNSHINR 391 (514)
Q Consensus 366 ~~~~~~C~~C~~~f~~~~~L~~H~~~ 391 (514)
.+.++.|..|.+.|.+......|+..
T Consensus 65 ~~~~~~c~~c~k~~~s~~a~~~hl~S 90 (390)
T KOG2785|consen 65 AESVVYCEACNKSFASPKAHENHLKS 90 (390)
T ss_pred cccceehHHhhccccChhhHHHHHHH
Confidence 45678999999999998888888764
No 52
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.40 E-value=0.022 Score=54.30 Aligned_cols=70 Identities=21% Similarity=0.471 Sum_probs=37.6
Q ss_pred ccccccccCChHHHHHHHHhhcCCCCccccccccccc-------ccCChhhHHHHhcccccccc--ccc----ccccCHH
Q psy2973 430 CEKCDMRFVTNKNLKRHMKVHEKNRNMLDSSCNECGK-------QYKWESFLQKHKKICTTTCK--SCG----KRYSSIY 496 (514)
Q Consensus 430 C~~C~~~F~~~~~L~~H~~~H~~~~~~~~~~C~~C~~-------~f~~~~~L~~H~~~h~~~C~--~C~----~~f~~~~ 496 (514)
|..|...|-+-..|..|+|.-|. .|-+|++ -|.+-..|..|.+.-+|.|. .|- ..|....
T Consensus 223 C~FC~~~FYdDDEL~~HcR~~HE-------~ChICD~v~p~~~QYFK~Y~~Le~HF~~~hy~ct~qtc~~~k~~vf~~~~ 295 (493)
T COG5236 223 CIFCKIYFYDDDELRRHCRLRHE-------ACHICDMVGPIRYQYFKSYEDLEAHFRNAHYCCTFQTCRVGKCYVFPYHT 295 (493)
T ss_pred hhhccceecChHHHHHHHHhhhh-------hhhhhhccCccchhhhhCHHHHHHHhhcCceEEEEEEEecCcEEEeccHH
Confidence 55566556666666666554332 2444432 25555566666665555543 232 3566666
Q ss_pred HHHHhhhhhh
Q psy2973 497 VLRNHVKSCY 506 (514)
Q Consensus 497 ~l~~H~~~h~ 506 (514)
.|..|+..-|
T Consensus 296 el~~h~~~~h 305 (493)
T COG5236 296 ELLEHLTRFH 305 (493)
T ss_pred HHHHHHHHHh
Confidence 6777765444
No 53
>KOG1146|consensus
Probab=95.29 E-value=0.0058 Score=68.20 Aligned_cols=134 Identities=21% Similarity=0.299 Sum_probs=85.1
Q ss_pred cCcCCcccCChHHHHHHHHhhcC--CCccccccccccCChHHHHHHHHhhcCC----------------------CCCCc
Q psy2973 372 CNDCGSTFRLKRHLNSHINRLHK--LKLNCVHCQAIFTNKQRLNYHILTKHSG----------------------PNYKA 427 (514)
Q Consensus 372 C~~C~~~f~~~~~L~~H~~~~h~--~~~~C~~C~~~f~~~~~L~~H~~~~H~~----------------------~~~k~ 427 (514)
|.-|+..+.....+..|+...+. ..|.|+.|+..|.....|..||+..|.. ..+++
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p 518 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKP 518 (1406)
T ss_pred ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCc
Confidence 33344444455555555544444 3567777777777777777777765541 12478
Q ss_pred ccccccccccCChHHHHHHHHh--hcC--------------------------------------CCCcccccccccccc
Q psy2973 428 WKCEKCDMRFVTNKNLKRHMKV--HEK--------------------------------------NRNMLDSSCNECGKQ 467 (514)
Q Consensus 428 ~~C~~C~~~F~~~~~L~~H~~~--H~~--------------------------------------~~~~~~~~C~~C~~~ 467 (514)
|.|..|..++....+|..||.. |.. .+++..+.|..|+..
T Consensus 519 ~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~ye 598 (1406)
T KOG1146|consen 519 YPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYE 598 (1406)
T ss_pred ccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcch
Confidence 9999999999999999999873 311 112234678888888
Q ss_pred cCChhhHHHHhcccc-----cccccccccccCHHHHHHhhhhh
Q psy2973 468 YKWESFLQKHKKICT-----TTCKSCGKRYSSIYVLRNHVKSC 505 (514)
Q Consensus 468 f~~~~~L~~H~~~h~-----~~C~~C~~~f~~~~~l~~H~~~h 505 (514)
.+-..+|..|+..-+ --|-.|.-.+.....+..+-+-+
T Consensus 599 tniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~~~lp 641 (1406)
T KOG1146|consen 599 TNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQGRLP 641 (1406)
T ss_pred hhhhhccccccccCCCCCChHHHhhhcchhhccccccCcCCCC
Confidence 777777888876432 34555666666555555555544
No 54
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.86 E-value=0.0078 Score=35.50 Aligned_cols=22 Identities=41% Similarity=0.899 Sum_probs=12.7
Q ss_pred ccccccccccCChHHHHHHHHh
Q psy2973 428 WKCEKCDMRFVTNKNLKRHMKV 449 (514)
Q Consensus 428 ~~C~~C~~~F~~~~~L~~H~~~ 449 (514)
|.|..|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4556666666666666555543
No 55
>PRK04860 hypothetical protein; Provisional
Probab=94.54 E-value=0.022 Score=49.81 Aligned_cols=37 Identities=24% Similarity=0.793 Sum_probs=24.0
Q ss_pred ccccCCCcccccccccHHHHHHHhcCCCcccccccccccCC
Q psy2973 253 RFKCPICGKHIRHKQTFDDHMNMHSGDVKYTCTYCEKSFLY 293 (514)
Q Consensus 253 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~ 293 (514)
+|.|. |+. ....+..|.++|.++++|.|..|+..|..
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~ 155 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVF 155 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEE
Confidence 56666 665 55566667777766666777767666543
No 56
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.43 E-value=0.021 Score=33.64 Aligned_cols=21 Identities=43% Similarity=0.770 Sum_probs=11.3
Q ss_pred ccccccccccCHHHHHHhhhh
Q psy2973 484 TCKSCGKRYSSIYVLRNHVKS 504 (514)
Q Consensus 484 ~C~~C~~~f~~~~~l~~H~~~ 504 (514)
.|..|++.|.+...|..|+++
T Consensus 3 ~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 3 YCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp BBTTTTBBBSSHHHHHCCTTS
T ss_pred CcccCCCCcCCHHHHHHHHcc
Confidence 455555555555555555543
No 57
>KOG2785|consensus
Probab=94.37 E-value=0.061 Score=52.46 Aligned_cols=26 Identities=31% Similarity=0.492 Sum_probs=20.0
Q ss_pred CCCcccccccccccCChHHHHHhhhh
Q psy2973 278 GDVKYTCTYCEKSFLYKKSLKHHLFV 303 (514)
Q Consensus 278 ~~~~~~C~~C~~~f~~~~~L~~H~~~ 303 (514)
++-++.|..|.+.|.+......|+..
T Consensus 65 ~~~~~~c~~c~k~~~s~~a~~~hl~S 90 (390)
T KOG2785|consen 65 AESVVYCEACNKSFASPKAHENHLKS 90 (390)
T ss_pred cccceehHHhhccccChhhHHHHHHH
Confidence 34567888888888888888888763
No 58
>PRK04860 hypothetical protein; Provisional
Probab=94.30 E-value=0.018 Score=50.39 Aligned_cols=39 Identities=21% Similarity=0.622 Sum_probs=32.1
Q ss_pred CcccccccccccCChHHHHHHHHhhcCCCCcccccccccccccCCh
Q psy2973 426 KAWKCEKCDMRFVTNKNLKRHMKVHEKNRNMLDSSCNECGKQYKWE 471 (514)
Q Consensus 426 k~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~ 471 (514)
-+|.|. |+. ....+.+|.++|.++++ |.|..|+..|...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~---YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAV---YRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCcc---EECCCCCceeEEe
Confidence 368998 887 77788999999999887 8899998887644
No 59
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.54 E-value=0.026 Score=58.38 Aligned_cols=62 Identities=34% Similarity=0.574 Sum_probs=35.1
Q ss_pred ceeccCCCcccccHHHHHHHHHH-hhcCC--CCeecc--cccccccChhhhhhhhhhccCCcccccC
Q psy2973 196 RFKCEVCPMKFHRQVSLKKHVEA-VHLKL--KPCKCE--VCGKSFSNNSTLKVHSTTHADQGRFKCP 257 (514)
Q Consensus 196 ~~~C~~C~~~f~~~~~l~~H~~~-~H~~~--k~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~C~ 257 (514)
++.|..|...|.....|..|... .|.++ +++.|+ .|++.|.....+..|...|.+..++.+.
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEK 355 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccc
Confidence 45556666666666666666522 55555 556665 4666666666666666655555554443
No 60
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=91.85 E-value=0.051 Score=56.17 Aligned_cols=61 Identities=30% Similarity=0.588 Sum_probs=34.2
Q ss_pred CeecccccccccChhhhhhhhh--hccCC--cccccC--CCcccccccccHHHHHHHhcCCCccccc
Q psy2973 225 PCKCEVCGKSFSNNSTLKVHST--THADQ--GRFKCP--ICGKHIRHKQTFDDHMNMHSGDVKYTCT 285 (514)
Q Consensus 225 ~~~C~~C~~~f~~~~~L~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~C~ 285 (514)
++.|..|...|.....|..|.+ .|.++ .++.|+ .|++.|.....+..|...|.+..++.+.
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEK 355 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccc
Confidence 4555555555555555555555 45555 555555 4555555555555555555555444444
No 61
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.70 E-value=0.12 Score=29.78 Aligned_cols=19 Identities=37% Similarity=0.845 Sum_probs=10.0
Q ss_pred cccccccccCChhhHHHHhc
Q psy2973 460 SCNECGKQYKWESFLQKHKK 479 (514)
Q Consensus 460 ~C~~C~~~f~~~~~L~~H~~ 479 (514)
.|+.||+.| ....|..|++
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 455555555 4445555544
No 62
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=90.92 E-value=0.16 Score=31.75 Aligned_cols=23 Identities=22% Similarity=0.660 Sum_probs=18.4
Q ss_pred cccccccccccCChHHHHHHHHh
Q psy2973 427 AWKCEKCDMRFVTNKNLKRHMKV 449 (514)
Q Consensus 427 ~~~C~~C~~~F~~~~~L~~H~~~ 449 (514)
+|.|.+|+..|.+..++..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 57788888888888888888764
No 63
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=90.62 E-value=0.082 Score=34.59 Aligned_cols=20 Identities=25% Similarity=0.303 Sum_probs=15.6
Q ss_pred eccCCchhhhhhhhhccccc
Q psy2973 100 IFDCQDSEWRDVLKKKQGVL 119 (514)
Q Consensus 100 ~~~~q~~~~~~v~~e~~~~~ 119 (514)
..+.|+++|+|||+||++++
T Consensus 18 L~~~Qk~ly~dvm~Eny~~l 37 (41)
T PF01352_consen 18 LDPAQKNLYRDVMLENYRNL 37 (41)
T ss_dssp S-HHHHHHHHHHHHHTTTS-
T ss_pred ccceecccchhHHHHhhccc
Confidence 34568899999999999854
No 64
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=90.15 E-value=0.24 Score=28.49 Aligned_cols=19 Identities=32% Similarity=0.761 Sum_probs=11.6
Q ss_pred cccccccccCChHHHHHHHH
Q psy2973 429 KCEKCDMRFVTNKNLKRHMK 448 (514)
Q Consensus 429 ~C~~C~~~F~~~~~L~~H~~ 448 (514)
.|+.|++.| ....|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 566666666 4555666654
No 65
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=89.45 E-value=0.57 Score=37.95 Aligned_cols=23 Identities=39% Similarity=0.699 Sum_probs=15.8
Q ss_pred CCCcccCcCCcccCChHHHHHHH
Q psy2973 367 ERPFQCNDCGSTFRLKRHLNSHI 389 (514)
Q Consensus 367 ~~~~~C~~C~~~f~~~~~L~~H~ 389 (514)
+-|..|+.||.+..+..+|.+-.
T Consensus 13 ~LP~~CpiCgLtLVss~HLARSy 35 (112)
T TIGR00622 13 ELPVECPICGLTLILSTHLARSY 35 (112)
T ss_pred CCCCcCCcCCCEEeccchHHHhh
Confidence 44667777777777777777653
No 66
>KOG4173|consensus
Probab=88.38 E-value=0.3 Score=43.37 Aligned_cols=22 Identities=27% Similarity=0.535 Sum_probs=10.6
Q ss_pred ccCcCCcccCChHHHHHHHHhh
Q psy2973 371 QCNDCGSTFRLKRHLNSHINRL 392 (514)
Q Consensus 371 ~C~~C~~~f~~~~~L~~H~~~~ 392 (514)
.|..|.+.|.+...|..|+...
T Consensus 108 sCs~C~r~~Pt~hLLd~HI~E~ 129 (253)
T KOG4173|consen 108 SCSFCKRAFPTGHLLDAHILEW 129 (253)
T ss_pred hhHHHHHhCCchhhhhHHHHHH
Confidence 3555555555554454444433
No 67
>KOG2893|consensus
Probab=88.10 E-value=0.13 Score=46.60 Aligned_cols=47 Identities=30% Similarity=0.576 Sum_probs=37.3
Q ss_pred ccCcccccccChHHHHhhHhcccCCCCceeccCCCcccccHHHHHHHHHHhh
Q psy2973 169 KCQFCPMWFEQRGKRRKHVNGVHLKIKRFKCEVCPMKFHRQVSLKKHVEAVH 220 (514)
Q Consensus 169 ~C~~C~~~f~~~~~l~~H~~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~~H 220 (514)
.|..|++.|.....|..|++. |-|+|.+|.+...+--.|..|-..+|
T Consensus 12 wcwycnrefddekiliqhqka-----khfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKA-----KHFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeeecccccchhhhhhhhhhh-----ccceeeeehhhhccCCCceeehhhhh
Confidence 499999999999999998774 44899999888777777777765555
No 68
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=87.57 E-value=0.27 Score=33.68 Aligned_cols=30 Identities=30% Similarity=0.705 Sum_probs=25.2
Q ss_pred CCCCCcccCcCCcccCChHHHHHHHHhhcC
Q psy2973 365 NTERPFQCNDCGSTFRLKRHLNSHINRLHK 394 (514)
Q Consensus 365 ~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~ 394 (514)
.++..+.|+.||..|.......+|+...|.
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 357788999999999999999999887774
No 69
>KOG4173|consensus
Probab=87.11 E-value=0.37 Score=42.79 Aligned_cols=83 Identities=29% Similarity=0.577 Sum_probs=64.2
Q ss_pred CCcccCc--CCcccCChHHHHHHHHhhcCCCccccccccccCChHHHHHHHHhhcCC-------CCCCcccc--cccccc
Q psy2973 368 RPFQCND--CGSTFRLKRHLNSHINRLHKLKLNCVHCQAIFTNKQRLNYHILTKHSG-------PNYKAWKC--EKCDMR 436 (514)
Q Consensus 368 ~~~~C~~--C~~~f~~~~~L~~H~~~~h~~~~~C~~C~~~f~~~~~L~~H~~~~H~~-------~~~k~~~C--~~C~~~ 436 (514)
..|.|++ |...|........|....|+ -.|..|.+.|.+...|..|+...|.. .+.-.|.| ..|+..
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~--~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~K 155 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHG--NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEK 155 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhccc--chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhh
Confidence 4577876 77788888888888765554 47999999999999999999876641 12356999 569999
Q ss_pred cCChHHHHHHHHhhcC
Q psy2973 437 FVTNKNLKRHMKVHEK 452 (514)
Q Consensus 437 F~~~~~L~~H~~~H~~ 452 (514)
|.+..+-+.||-.-|.
T Consensus 156 FkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 156 FKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhHHHHhcc
Confidence 9999999999865443
No 70
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=86.76 E-value=0.27 Score=33.70 Aligned_cols=29 Identities=38% Similarity=0.661 Sum_probs=22.9
Q ss_pred CCCccccCcccccccChHHHHhhHhcccC
Q psy2973 164 GENKFKCQFCPMWFEQRGKRRKHVNGVHL 192 (514)
Q Consensus 164 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 192 (514)
||.-+.|+.||..|.......+|+...|.
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 66778888888888888888888876664
No 71
>KOG2893|consensus
Probab=86.72 E-value=0.18 Score=45.74 Aligned_cols=41 Identities=27% Similarity=0.577 Sum_probs=23.1
Q ss_pred ccccccccCChHHHHHHHHhhcCCCCCCcccccccccccCChHHHHHH
Q psy2973 399 CVHCQAIFTNKQRLNYHILTKHSGPNYKAWKCEKCDMRFVTNKNLKRH 446 (514)
Q Consensus 399 C~~C~~~f~~~~~L~~H~~~~H~~~~~k~~~C~~C~~~F~~~~~L~~H 446 (514)
|.+|++-|.+..-|.+|++. |-|+|.+|.+..-+.-.|..|
T Consensus 13 cwycnrefddekiliqhqka-------khfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA-------KHFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeecccccchhhhhhhhhhh-------ccceeeeehhhhccCCCceee
Confidence 55555555555555555543 336666666655555555555
No 72
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=86.53 E-value=1.1 Score=36.57 Aligned_cols=27 Identities=15% Similarity=0.545 Sum_probs=22.6
Q ss_pred Ccccc----cccccccCChHHHHHHHHhhcC
Q psy2973 426 KAWKC----EKCDMRFVTNKNLKRHMKVHEK 452 (514)
Q Consensus 426 k~~~C----~~C~~~F~~~~~L~~H~~~H~~ 452 (514)
..|.| ..|++.+.+...|..|++.+||
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 34889 8899999999999999988775
No 73
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=86.14 E-value=1.2 Score=36.13 Aligned_cols=24 Identities=29% Similarity=0.476 Sum_probs=12.4
Q ss_pred cccCCcCCCcCCCHHHHHHHHHhc
Q psy2973 309 TSKCSVCDRLYKSDYLRRKHERSH 332 (514)
Q Consensus 309 ~~~C~~C~~~f~~~~~l~~H~~~h 332 (514)
.|.|+.|...|=-.-+.-.|...|
T Consensus 81 ~y~C~~C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 81 RYVCAVCKNVFCVDCDVFVHESLH 104 (112)
T ss_pred ceeCCCCCCccccccchhhhhhcc
Confidence 455555555555544444444444
No 74
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=85.93 E-value=0.51 Score=29.41 Aligned_cols=21 Identities=24% Similarity=0.542 Sum_probs=12.6
Q ss_pred cccccccccccCHHHHHHhhh
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVK 503 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~ 503 (514)
|.|.+|+..|.+...+..|++
T Consensus 4 ~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 4 FYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred eEccccCCccCCHHHHHHHHC
Confidence 446666666666666666654
No 75
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=84.23 E-value=1.9 Score=35.19 Aligned_cols=25 Identities=20% Similarity=0.361 Sum_probs=21.6
Q ss_pred cccc----cccccccCChhhHHHHhcccc
Q psy2973 458 DSSC----NECGKQYKWESFLQKHKKICT 482 (514)
Q Consensus 458 ~~~C----~~C~~~f~~~~~L~~H~~~h~ 482 (514)
.|.| ..|+..+.+...|..|.+.++
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 4889 999999999999999987654
No 76
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=83.11 E-value=0.79 Score=28.35 Aligned_cols=10 Identities=40% Similarity=1.049 Sum_probs=4.5
Q ss_pred eecccccccc
Q psy2973 226 CKCEVCGKSF 235 (514)
Q Consensus 226 ~~C~~C~~~f 235 (514)
|.|..||..+
T Consensus 2 ~~C~~CGy~y 11 (33)
T cd00350 2 YVCPVCGYIY 11 (33)
T ss_pred EECCCCCCEE
Confidence 3444454443
No 77
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=73.91 E-value=0.93 Score=28.97 Aligned_cols=12 Identities=17% Similarity=0.714 Sum_probs=5.6
Q ss_pred cccccccccCCh
Q psy2973 460 SCNECGKQYKWE 471 (514)
Q Consensus 460 ~C~~C~~~f~~~ 471 (514)
.|+.|+..|.-.
T Consensus 4 ~CP~C~~~~~v~ 15 (38)
T TIGR02098 4 QCPNCKTSFRVV 15 (38)
T ss_pred ECCCCCCEEEeC
Confidence 455555444433
No 78
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=72.75 E-value=1.8 Score=25.19 Aligned_cols=9 Identities=33% Similarity=1.084 Sum_probs=5.0
Q ss_pred ccccccccc
Q psy2973 484 TCKSCGKRY 492 (514)
Q Consensus 484 ~C~~C~~~f 492 (514)
.|+.||+.|
T Consensus 16 ~Cp~CG~~F 24 (26)
T PF10571_consen 16 FCPHCGYDF 24 (26)
T ss_pred cCCCCCCCC
Confidence 455555555
No 79
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=72.48 E-value=2.9 Score=29.41 Aligned_cols=8 Identities=50% Similarity=1.805 Sum_probs=3.6
Q ss_pred CcccCcCC
Q psy2973 369 PFQCNDCG 376 (514)
Q Consensus 369 ~~~C~~C~ 376 (514)
+|.|+.||
T Consensus 50 ~Y~Cp~CG 57 (61)
T COG2888 50 PYRCPKCG 57 (61)
T ss_pred ceECCCcC
Confidence 34444444
No 80
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=72.03 E-value=1.9 Score=39.96 Aligned_cols=22 Identities=32% Similarity=0.597 Sum_probs=12.7
Q ss_pred CCeecccccccccChhhhhhhh
Q psy2973 224 KPCKCEVCGKSFSNNSTLKVHS 245 (514)
Q Consensus 224 k~~~C~~C~~~f~~~~~L~~H~ 245 (514)
+.+.|++|+..|.+...+....
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~ 25 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKI 25 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCc
Confidence 4456777776666655444443
No 81
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=69.82 E-value=1 Score=30.22 Aligned_cols=11 Identities=27% Similarity=0.909 Sum_probs=5.2
Q ss_pred ccccccccccC
Q psy2973 428 WKCEKCDMRFV 438 (514)
Q Consensus 428 ~~C~~C~~~F~ 438 (514)
|.|+.||..|.
T Consensus 4 y~C~~CG~~~~ 14 (46)
T PRK00398 4 YKCARCGREVE 14 (46)
T ss_pred EECCCCCCEEE
Confidence 44555554443
No 82
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=69.09 E-value=1.9 Score=37.27 Aligned_cols=35 Identities=17% Similarity=0.555 Sum_probs=18.9
Q ss_pred CcccccccccccCChHHHHH-HHHhhcCCCCccccccccccccc
Q psy2973 426 KAWKCEKCDMRFVTNKNLKR-HMKVHEKNRNMLDSSCNECGKQY 468 (514)
Q Consensus 426 k~~~C~~C~~~F~~~~~L~~-H~~~H~~~~~~~~~~C~~C~~~f 468 (514)
..|.|+.|+..|....++.. + ... .|.|+.||...
T Consensus 98 ~~Y~Cp~C~~~y~~~ea~~~~d-----~~~---~f~Cp~Cg~~l 133 (147)
T smart00531 98 AYYKCPNCQSKYTFLEANQLLD-----MDG---TFTCPRCGEEL 133 (147)
T ss_pred cEEECcCCCCEeeHHHHHHhcC-----CCC---cEECCCCCCEE
Confidence 45777777777765444332 1 011 16677776553
No 83
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=67.55 E-value=0.83 Score=30.06 Aligned_cols=12 Identities=33% Similarity=1.079 Sum_probs=6.0
Q ss_pred ccccccccccCC
Q psy2973 428 WKCEKCDMRFVT 439 (514)
Q Consensus 428 ~~C~~C~~~F~~ 439 (514)
|.|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (42)
T PF09723_consen 6 YRCEECGHEFEV 17 (42)
T ss_pred EEeCCCCCEEEE
Confidence 455555554443
No 84
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=65.77 E-value=1.1 Score=29.07 Aligned_cols=11 Identities=36% Similarity=1.138 Sum_probs=5.8
Q ss_pred ccccccccccC
Q psy2973 428 WKCEKCDMRFV 438 (514)
Q Consensus 428 ~~C~~C~~~F~ 438 (514)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (41)
T smart00834 6 YRCEDCGHTFE 16 (41)
T ss_pred EEcCCCCCEEE
Confidence 45555555554
No 85
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=65.41 E-value=1.4 Score=38.27 Aligned_cols=13 Identities=31% Similarity=0.746 Sum_probs=6.6
Q ss_pred ccccccccccCCh
Q psy2973 459 SSCNECGKQYKWE 471 (514)
Q Consensus 459 ~~C~~C~~~f~~~ 471 (514)
|+|+.||++|.+.
T Consensus 29 ~~c~~c~~~f~~~ 41 (154)
T PRK00464 29 RECLACGKRFTTF 41 (154)
T ss_pred eeccccCCcceEe
Confidence 4455555555443
No 86
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=65.40 E-value=2.1 Score=39.67 Aligned_cols=22 Identities=23% Similarity=0.539 Sum_probs=15.0
Q ss_pred CCccccCcccccccChHHHHhh
Q psy2973 165 ENKFKCQFCPMWFEQRGKRRKH 186 (514)
Q Consensus 165 ~~~~~C~~C~~~f~~~~~l~~H 186 (514)
++.+.|++|+..|.+...+...
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~ 24 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGK 24 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCC
Confidence 3567888888888766544443
No 87
>KOG2186|consensus
Probab=65.09 E-value=3.4 Score=38.30 Aligned_cols=46 Identities=30% Similarity=0.704 Sum_probs=21.1
Q ss_pred cccCCCcccccccccHHHHHHHhcCCCcccccccccccCChHHHHHhhh
Q psy2973 254 FKCPICGKHIRHKQTFDDHMNMHSGDVKYTCTYCEKSFLYKKSLKHHLF 302 (514)
Q Consensus 254 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~ 302 (514)
|.|..||.... +..+..|+...++ .-|.|-.|++.|.. .++..|..
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 44445544443 2234445544444 33555555555544 44444443
No 88
>KOG2186|consensus
Probab=64.47 E-value=3.8 Score=37.97 Aligned_cols=45 Identities=31% Similarity=0.742 Sum_probs=20.5
Q ss_pred cccCccccccCCHHHHHHHHHHhcCCCCCCCCCCcccCcCCcccCChHHHHHHH
Q psy2973 336 FQCRICDKILSNIQGIRRHMENMHNSDKDNTERPFQCNDCGSTFRLKRHLNSHI 389 (514)
Q Consensus 336 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~ 389 (514)
|.|..||...... .+.+|+...++ .-|.|-.|+..|-. .....|.
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn-------~~fSCIDC~k~F~~-~sYknH~ 48 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRN-------AYFSCIDCGKTFER-VSYKNHT 48 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccC-------CeeEEeeccccccc-chhhhhh
Confidence 4555555544332 23335544442 23555555555544 3344443
No 89
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=64.01 E-value=2.6 Score=26.67 Aligned_cols=32 Identities=22% Similarity=0.582 Sum_probs=13.6
Q ss_pred cccccccccCChhhHHHHhcccccccccccccc
Q psy2973 460 SCNECGKQYKWESFLQKHKKICTTTCKSCGKRY 492 (514)
Q Consensus 460 ~C~~C~~~f~~~~~L~~H~~~h~~~C~~C~~~f 492 (514)
.|+.|+..|.-...... -.-...+|+.|+..|
T Consensus 4 ~Cp~C~~~y~i~d~~ip-~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 4 TCPNCQAKYEIDDEKIP-PKGRKVRCSKCGHVF 35 (36)
T ss_pred ECCCCCCEEeCCHHHCC-CCCcEEECCCCCCEe
Confidence 45555555554443322 111124555555544
No 90
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=63.52 E-value=4.9 Score=25.03 Aligned_cols=8 Identities=50% Similarity=1.152 Sum_probs=3.4
Q ss_pred eecccccc
Q psy2973 226 CKCEVCGK 233 (514)
Q Consensus 226 ~~C~~C~~ 233 (514)
|.|..||.
T Consensus 3 ~~C~~CG~ 10 (34)
T cd00729 3 WVCPVCGY 10 (34)
T ss_pred EECCCCCC
Confidence 34444443
No 91
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=62.17 E-value=5.1 Score=32.50 Aligned_cols=14 Identities=29% Similarity=0.681 Sum_probs=6.9
Q ss_pred cccccccccccCCh
Q psy2973 427 AWKCEKCDMRFVTN 440 (514)
Q Consensus 427 ~~~C~~C~~~F~~~ 440 (514)
|-.|+.||..|.-.
T Consensus 26 PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 26 PIVCPKCGTEFPPE 39 (108)
T ss_pred CccCCCCCCccCcc
Confidence 44455555555433
No 92
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=61.87 E-value=6 Score=34.68 Aligned_cols=30 Identities=23% Similarity=0.575 Sum_probs=19.4
Q ss_pred CcccccccccccCChHHHHHHHHhhcCCCCcccccccccccc
Q psy2973 426 KAWKCEKCDMRFVTNKNLKRHMKVHEKNRNMLDSSCNECGKQ 467 (514)
Q Consensus 426 k~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~ 467 (514)
.-|.|+.|+..|+...+|.. .|.|+.||..
T Consensus 108 ~~Y~Cp~c~~r~tf~eA~~~------------~F~Cp~Cg~~ 137 (158)
T TIGR00373 108 MFFICPNMCVRFTFNEAMEL------------NFTCPRCGAM 137 (158)
T ss_pred CeEECCCCCcEeeHHHHHHc------------CCcCCCCCCE
Confidence 44677777777777776641 1677777754
No 93
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=60.20 E-value=3.5 Score=36.91 Aligned_cols=31 Identities=23% Similarity=0.702 Sum_probs=20.2
Q ss_pred CcccccccccccCChHHHHHHHHhhcCCCCccccccccccccc
Q psy2973 426 KAWKCEKCDMRFVTNKNLKRHMKVHEKNRNMLDSSCNECGKQY 468 (514)
Q Consensus 426 k~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f 468 (514)
.-|.|+.|+..|+...++.. .|.|+.||-..
T Consensus 116 ~~Y~Cp~C~~rytf~eA~~~------------~F~Cp~Cg~~L 146 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEAMEY------------GFRCPQCGEML 146 (178)
T ss_pred CEEECCCCCcEEeHHHHhhc------------CCcCCCCCCCC
Confidence 44777777777777766631 16777777553
No 94
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=58.71 E-value=3.9 Score=26.02 Aligned_cols=13 Identities=15% Similarity=0.777 Sum_probs=5.9
Q ss_pred cccccccccCChh
Q psy2973 460 SCNECGKQYKWES 472 (514)
Q Consensus 460 ~C~~C~~~f~~~~ 472 (514)
.|+.|+..|.-..
T Consensus 4 ~CP~C~~~f~v~~ 16 (37)
T PF13719_consen 4 TCPNCQTRFRVPD 16 (37)
T ss_pred ECCCCCceEEcCH
Confidence 3444544444333
No 95
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=56.70 E-value=4.4 Score=35.19 Aligned_cols=14 Identities=29% Similarity=0.648 Sum_probs=6.6
Q ss_pred cccCCCcccccccc
Q psy2973 254 FKCPICGKHIRHKQ 267 (514)
Q Consensus 254 ~~C~~C~~~f~~~~ 267 (514)
++|+.||++|.+..
T Consensus 29 ~~c~~c~~~f~~~e 42 (154)
T PRK00464 29 RECLACGKRFTTFE 42 (154)
T ss_pred eeccccCCcceEeE
Confidence 44555555544433
No 96
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=56.64 E-value=6.4 Score=27.79 Aligned_cols=9 Identities=44% Similarity=1.446 Sum_probs=4.5
Q ss_pred CcccCcCCc
Q psy2973 369 PFQCNDCGS 377 (514)
Q Consensus 369 ~~~C~~C~~ 377 (514)
+|.|+.||+
T Consensus 48 ~Y~CP~CGF 56 (59)
T PRK14890 48 PYTCPKCGF 56 (59)
T ss_pred ceECCCCCC
Confidence 455555543
No 97
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=55.28 E-value=6.5 Score=27.03 Aligned_cols=11 Identities=36% Similarity=1.192 Sum_probs=5.3
Q ss_pred eeccccccccc
Q psy2973 226 CKCEVCGKSFS 236 (514)
Q Consensus 226 ~~C~~C~~~f~ 236 (514)
|.|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 44555554443
No 98
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=55.27 E-value=9.8 Score=22.05 Aligned_cols=19 Identities=16% Similarity=0.494 Sum_probs=9.5
Q ss_pred cccccccccCChHHHHHHHH
Q psy2973 429 KCEKCDMRFVTNKNLKRHMK 448 (514)
Q Consensus 429 ~C~~C~~~F~~~~~L~~H~~ 448 (514)
.|++|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 355555555 3445555543
No 99
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=54.88 E-value=8.8 Score=25.51 Aligned_cols=10 Identities=30% Similarity=0.803 Sum_probs=4.2
Q ss_pred eecccccccc
Q psy2973 226 CKCEVCGKSF 235 (514)
Q Consensus 226 ~~C~~C~~~f 235 (514)
|.|..||..|
T Consensus 3 Y~C~~Cg~~~ 12 (44)
T smart00659 3 YICGECGREN 12 (44)
T ss_pred EECCCCCCEe
Confidence 3444444433
No 100
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=54.83 E-value=5.2 Score=35.79 Aligned_cols=27 Identities=19% Similarity=0.688 Sum_probs=12.5
Q ss_pred ccccccccccCChhhHHHHhcccccccccccc
Q psy2973 459 SSCNECGKQYKWESFLQKHKKICTTTCKSCGK 490 (514)
Q Consensus 459 ~~C~~C~~~f~~~~~L~~H~~~h~~~C~~C~~ 490 (514)
|.|+.|+..|+...++. ..|.|+.||.
T Consensus 118 Y~Cp~C~~rytf~eA~~-----~~F~Cp~Cg~ 144 (178)
T PRK06266 118 FFCPNCHIRFTFDEAME-----YGFRCPQCGE 144 (178)
T ss_pred EECCCCCcEEeHHHHhh-----cCCcCCCCCC
Confidence 44555555544444432 2345555553
No 101
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=53.75 E-value=12 Score=24.69 Aligned_cols=25 Identities=20% Similarity=0.692 Sum_probs=12.7
Q ss_pred CCCcccCcCCcccCC----hHHHHHHHHh
Q psy2973 367 ERPFQCNDCGSTFRL----KRHLNSHINR 391 (514)
Q Consensus 367 ~~~~~C~~C~~~f~~----~~~L~~H~~~ 391 (514)
.....|..|++.+.. -+.|.+|++.
T Consensus 14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~~ 42 (45)
T PF02892_consen 14 KKKAKCKYCGKVIKYSSGGTSNLKRHLKK 42 (45)
T ss_dssp SS-EEETTTTEE-----SSTHHHHHHHHH
T ss_pred cCeEEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence 445667777766644 3566666643
No 102
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=53.60 E-value=7.6 Score=34.10 Aligned_cols=23 Identities=43% Similarity=1.019 Sum_probs=13.8
Q ss_pred CeecccccccccChhhhhhhhhhccCCcccccCCCc
Q psy2973 225 PCKCEVCGKSFSNNSTLKVHSTTHADQGRFKCPICG 260 (514)
Q Consensus 225 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~ 260 (514)
.|.|++||+. +.++.|-.||+||
T Consensus 134 ~~vC~vCGy~-------------~~ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYT-------------HEGEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCc-------------ccCCCCCcCCCCC
Confidence 4666666653 2345666666666
No 103
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=52.88 E-value=9.7 Score=25.98 Aligned_cols=21 Identities=33% Similarity=0.819 Sum_probs=14.0
Q ss_pred ccccccccccCCh-----HHHHHHHH
Q psy2973 428 WKCEKCDMRFVTN-----KNLKRHMK 448 (514)
Q Consensus 428 ~~C~~C~~~F~~~-----~~L~~H~~ 448 (514)
-.|..|++.+... +.|.+|++
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHH
Confidence 4577777766544 57777776
No 104
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=51.28 E-value=12 Score=32.36 Aligned_cols=15 Identities=20% Similarity=0.806 Sum_probs=8.0
Q ss_pred CcccccccccccCCh
Q psy2973 280 VKYTCTYCEKSFLYK 294 (514)
Q Consensus 280 ~~~~C~~C~~~f~~~ 294 (514)
.-|.|+.|+..|...
T Consensus 98 ~~Y~Cp~C~~~y~~~ 112 (147)
T smart00531 98 AYYKCPNCQSKYTFL 112 (147)
T ss_pred cEEECcCCCCEeeHH
Confidence 345566665555543
No 105
>PF14353 CpXC: CpXC protein
Probab=51.10 E-value=2.5 Score=35.57 Aligned_cols=25 Identities=24% Similarity=0.481 Sum_probs=13.9
Q ss_pred cccccccccccCChhhHHHHhcccc
Q psy2973 458 DSSCNECGKQYKWESFLQKHKKICT 482 (514)
Q Consensus 458 ~~~C~~C~~~f~~~~~L~~H~~~h~ 482 (514)
.|.|+.||..|.-...+.-|-..+.
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D~~~~ 62 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHDPEKK 62 (128)
T ss_pred EEECCCCCCceecCCCEEEEcCCCC
Confidence 3566666666665555555544443
No 106
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=50.35 E-value=11 Score=41.42 Aligned_cols=8 Identities=25% Similarity=0.787 Sum_probs=4.3
Q ss_pred cCCcCCCc
Q psy2973 311 KCSVCDRL 318 (514)
Q Consensus 311 ~C~~C~~~ 318 (514)
.|..||..
T Consensus 437 ~C~~Cg~v 444 (730)
T COG1198 437 LCRDCGYI 444 (730)
T ss_pred ecccCCCc
Confidence 35666554
No 107
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=49.78 E-value=10 Score=33.29 Aligned_cols=13 Identities=38% Similarity=0.871 Sum_probs=6.8
Q ss_pred hcCCCCeeccccc
Q psy2973 220 HLKLKPCKCEVCG 232 (514)
Q Consensus 220 H~~~k~~~C~~C~ 232 (514)
+.++.|-.||+||
T Consensus 144 ~~ge~P~~CPiCg 156 (166)
T COG1592 144 HEGEAPEVCPICG 156 (166)
T ss_pred ccCCCCCcCCCCC
Confidence 4445555555555
No 108
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=49.75 E-value=20 Score=25.30 Aligned_cols=27 Identities=22% Similarity=0.614 Sum_probs=11.9
Q ss_pred cccHHHHHHHhcCCCcccccc----cccccC
Q psy2973 266 KQTFDDHMNMHSGDVKYTCTY----CEKSFL 292 (514)
Q Consensus 266 ~~~L~~H~~~h~~~~~~~C~~----C~~~f~ 292 (514)
+..|..|+...-...+..|.. |+..+.
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 445555555444444455555 555443
No 109
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=47.98 E-value=6.8 Score=26.63 Aligned_cols=10 Identities=30% Similarity=1.039 Sum_probs=4.9
Q ss_pred cccccccccc
Q psy2973 459 SSCNECGKQY 468 (514)
Q Consensus 459 ~~C~~C~~~f 468 (514)
|.|..||+.|
T Consensus 7 Y~C~~Cg~~~ 16 (49)
T COG1996 7 YKCARCGREV 16 (49)
T ss_pred EEhhhcCCee
Confidence 4455555444
No 110
>PHA00626 hypothetical protein
Probab=47.63 E-value=4.9 Score=27.85 Aligned_cols=12 Identities=33% Similarity=0.814 Sum_probs=6.2
Q ss_pred CcccCcCCcccC
Q psy2973 369 PFQCNDCGSTFR 380 (514)
Q Consensus 369 ~~~C~~C~~~f~ 380 (514)
.|.|+.||..|+
T Consensus 23 rYkCkdCGY~ft 34 (59)
T PHA00626 23 DYVCCDCGYNDS 34 (59)
T ss_pred ceEcCCCCCeec
Confidence 455555555543
No 111
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=47.18 E-value=18 Score=31.62 Aligned_cols=33 Identities=12% Similarity=0.533 Sum_probs=18.4
Q ss_pred cCCCcccccccccccCChHHHHHhhhhcccCCcccCCcCCCc
Q psy2973 277 SGDVKYTCTYCEKSFLYKKSLKHHLFVHSSEFTSKCSVCDRL 318 (514)
Q Consensus 277 ~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~ 318 (514)
.+..-|.|+.|+..|+....+. .-|.|+.||..
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM 137 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence 3344466666666666555553 13666666644
No 112
>KOG2807|consensus
Probab=47.16 E-value=23 Score=34.25 Aligned_cols=23 Identities=26% Similarity=0.481 Sum_probs=15.9
Q ss_pred CCCcccCcCCcccCChHHHHHHH
Q psy2973 367 ERPFQCNDCGSTFRLKRHLNSHI 389 (514)
Q Consensus 367 ~~~~~C~~C~~~f~~~~~L~~H~ 389 (514)
.-|..|++|+.+.....+|.+-.
T Consensus 288 sLP~eCpiC~ltLVss~hLARSy 310 (378)
T KOG2807|consen 288 SLPIECPICSLTLVSSPHLARSY 310 (378)
T ss_pred cCCccCCccceeEecchHHHHHH
Confidence 34677777777777777776654
No 113
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=46.87 E-value=10 Score=29.07 Aligned_cols=12 Identities=50% Similarity=1.453 Sum_probs=6.1
Q ss_pred cccccccccccC
Q psy2973 427 AWKCEKCDMRFV 438 (514)
Q Consensus 427 ~~~C~~C~~~F~ 438 (514)
.|.|..|+..|.
T Consensus 53 IW~C~kCg~~fA 64 (89)
T COG1997 53 IWKCRKCGAKFA 64 (89)
T ss_pred eEEcCCCCCeec
Confidence 355555555543
No 114
>PF12907 zf-met2: Zinc-binding
Probab=46.51 E-value=9 Score=24.85 Aligned_cols=14 Identities=43% Similarity=0.551 Sum_probs=6.5
Q ss_pred HHHHHHHHHHhhcC
Q psy2973 209 QVSLKKHVEAVHLK 222 (514)
Q Consensus 209 ~~~l~~H~~~~H~~ 222 (514)
...|..|....|.+
T Consensus 17 ~~~L~eH~enKHpK 30 (40)
T PF12907_consen 17 EPQLKEHAENKHPK 30 (40)
T ss_pred HHHHHHHHHccCCC
Confidence 34455555444444
No 115
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=45.12 E-value=14 Score=30.68 Aligned_cols=7 Identities=14% Similarity=0.377 Sum_probs=2.6
Q ss_pred ccccccc
Q psy2973 283 TCTYCEK 289 (514)
Q Consensus 283 ~C~~C~~ 289 (514)
.|+.||.
T Consensus 28 vcP~cg~ 34 (129)
T TIGR02300 28 VSPYTGE 34 (129)
T ss_pred cCCCcCC
Confidence 3333333
No 116
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=45.10 E-value=13 Score=34.59 Aligned_cols=24 Identities=17% Similarity=0.297 Sum_probs=15.1
Q ss_pred ccccccccccCChhhHHHHhcccc
Q psy2973 459 SSCNECGKQYKWESFLQKHKKICT 482 (514)
Q Consensus 459 ~~C~~C~~~f~~~~~L~~H~~~h~ 482 (514)
+.|+.||........|..-.|+|.
T Consensus 210 ~PCPKCg~et~eTkdLSmStR~hk 233 (314)
T PF06524_consen 210 IPCPKCGYETQETKDLSMSTRSHK 233 (314)
T ss_pred CCCCCCCCcccccccceeeeecch
Confidence 567777766666666665555554
No 117
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=44.99 E-value=14 Score=20.96 Aligned_cols=8 Identities=25% Similarity=1.111 Sum_probs=4.2
Q ss_pred cccccccc
Q psy2973 427 AWKCEKCD 434 (514)
Q Consensus 427 ~~~C~~C~ 434 (514)
+|.|+.||
T Consensus 16 ~f~CPnCG 23 (24)
T PF07754_consen 16 PFPCPNCG 23 (24)
T ss_pred eEeCCCCC
Confidence 45555554
No 118
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=44.41 E-value=8 Score=28.79 Aligned_cols=13 Identities=15% Similarity=0.452 Sum_probs=5.8
Q ss_pred ccccC--CCcccccc
Q psy2973 253 RFKCP--ICGKHIRH 265 (514)
Q Consensus 253 ~~~C~--~C~~~f~~ 265 (514)
.++|. .||.+|..
T Consensus 27 Y~qC~N~eCg~tF~t 41 (72)
T PRK09678 27 YHQCQNVNCSATFIT 41 (72)
T ss_pred eeecCCCCCCCEEEE
Confidence 34444 44444443
No 119
>PF15269 zf-C2H2_7: Zinc-finger
Probab=43.82 E-value=14 Score=24.30 Aligned_cols=23 Identities=26% Similarity=0.349 Sum_probs=15.7
Q ss_pred cccccccccccCChhhHHHHhcc
Q psy2973 458 DSSCNECGKQYKWESFLQKHKKI 480 (514)
Q Consensus 458 ~~~C~~C~~~f~~~~~L~~H~~~ 480 (514)
+|+|-.|..+...++.|-.||+.
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 36677777777777777777654
No 120
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=43.75 E-value=8.7 Score=35.34 Aligned_cols=26 Identities=23% Similarity=0.433 Sum_probs=14.8
Q ss_pred cccccccccccCHHHHHHhhhhhhCC
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKSCYGQ 508 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~h~ge 508 (514)
|.|++|++.|.-..-+++||.+-|+|
T Consensus 78 ~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 78 WRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp EEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred ECCCCCCcccCChHHHHHHHhhcCHH
Confidence 56666666666666666666665543
No 121
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=43.66 E-value=15 Score=30.41 Aligned_cols=15 Identities=7% Similarity=0.014 Sum_probs=7.3
Q ss_pred CcccccccccccCCh
Q psy2973 426 KAWKCEKCDMRFVTN 440 (514)
Q Consensus 426 k~~~C~~C~~~F~~~ 440 (514)
.|..|+.||..|.-.
T Consensus 25 ~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 25 RPAVSPYTGEQFPPE 39 (129)
T ss_pred CCccCCCcCCccCcc
Confidence 344555555554433
No 122
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=42.03 E-value=9.3 Score=28.44 Aligned_cols=17 Identities=18% Similarity=0.585 Sum_probs=8.8
Q ss_pred CCeecc--cccccccChhh
Q psy2973 224 KPCKCE--VCGKSFSNNST 240 (514)
Q Consensus 224 k~~~C~--~C~~~f~~~~~ 240 (514)
.-+.|. .||.+|.....
T Consensus 26 ~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 26 RYHQCQNVNCSATFITYES 44 (72)
T ss_pred eeeecCCCCCCCEEEEEEE
Confidence 344555 56666554433
No 123
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=41.99 E-value=12 Score=31.59 Aligned_cols=24 Identities=29% Similarity=0.827 Sum_probs=14.0
Q ss_pred cccccccccccCHHHHHHhhhhhhCCC
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKSCYGQQ 509 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~h~ge~ 509 (514)
..|-+||+.|.+ |+.|++.|||--
T Consensus 73 i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred eEEccCCcccch---HHHHHHHccCCC
Confidence 468888888765 478888887753
No 124
>KOG2272|consensus
Probab=41.50 E-value=11 Score=35.01 Aligned_cols=16 Identities=19% Similarity=0.513 Sum_probs=11.3
Q ss_pred cccccccccccccCHH
Q psy2973 481 CTTTCKSCGKRYSSIY 496 (514)
Q Consensus 481 h~~~C~~C~~~f~~~~ 496 (514)
+.|.|+.|++..+.+.
T Consensus 279 ~cf~Cs~Cdkkl~~K~ 294 (332)
T KOG2272|consen 279 ECFSCSTCDKKLTQKN 294 (332)
T ss_pred cccccccccccccccc
Confidence 3388888888766554
No 125
>KOG2807|consensus
Probab=40.93 E-value=40 Score=32.74 Aligned_cols=34 Identities=21% Similarity=0.632 Sum_probs=21.8
Q ss_pred cccccCCCcccccccccHHHHHHHhcCCCcccccccccccCChHHHHH
Q psy2973 252 GRFKCPICGKHIRHKQTFDDHMNMHSGDVKYTCTYCEKSFLYKKSLKH 299 (514)
Q Consensus 252 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~ 299 (514)
.-|.|+.|....- .-|..|+.|+-+..+...|.+
T Consensus 275 ~Gy~CP~CkakvC--------------sLP~eCpiC~ltLVss~hLAR 308 (378)
T KOG2807|consen 275 GGYFCPQCKAKVC--------------SLPIECPICSLTLVSSPHLAR 308 (378)
T ss_pred CceeCCcccCeee--------------cCCccCCccceeEecchHHHH
Confidence 3477888765432 346778888877776666643
No 126
>PF15269 zf-C2H2_7: Zinc-finger
Probab=40.54 E-value=18 Score=23.82 Aligned_cols=22 Identities=27% Similarity=0.321 Sum_probs=20.2
Q ss_pred cccccccccccCHHHHHHhhhh
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKS 504 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~ 504 (514)
|+|-+|+.....++.|..||+-
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 7899999999999999999973
No 127
>PF12907 zf-met2: Zinc-binding
Probab=39.84 E-value=22 Score=23.10 Aligned_cols=12 Identities=33% Similarity=0.443 Sum_probs=5.3
Q ss_pred HHHHHHHHhhcC
Q psy2973 410 QRLNYHILTKHS 421 (514)
Q Consensus 410 ~~L~~H~~~~H~ 421 (514)
..|..|....|.
T Consensus 18 ~~L~eH~enKHp 29 (40)
T PF12907_consen 18 PQLKEHAENKHP 29 (40)
T ss_pred HHHHHHHHccCC
Confidence 344444444444
No 128
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=39.45 E-value=9.2 Score=28.35 Aligned_cols=9 Identities=33% Similarity=1.184 Sum_probs=4.5
Q ss_pred ccccccccc
Q psy2973 428 WKCEKCDMR 436 (514)
Q Consensus 428 ~~C~~C~~~ 436 (514)
|+|..|+..
T Consensus 13 Y~c~~cg~~ 21 (82)
T COG2331 13 YECTECGNR 21 (82)
T ss_pred EeecccchH
Confidence 455555543
No 129
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=38.12 E-value=14 Score=20.75 Aligned_cols=7 Identities=43% Similarity=1.469 Sum_probs=3.2
Q ss_pred ccccccc
Q psy2973 461 CNECGKQ 467 (514)
Q Consensus 461 C~~C~~~ 467 (514)
|+.||..
T Consensus 2 Cp~CG~~ 8 (23)
T PF13240_consen 2 CPNCGAE 8 (23)
T ss_pred CcccCCC
Confidence 4444443
No 130
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=37.35 E-value=11 Score=25.88 Aligned_cols=11 Identities=18% Similarity=0.797 Sum_probs=6.1
Q ss_pred ccccccccccC
Q psy2973 428 WKCEKCDMRFV 438 (514)
Q Consensus 428 ~~C~~C~~~F~ 438 (514)
|+|..|+..+.
T Consensus 2 y~C~~CgyiYd 12 (50)
T cd00730 2 YECRICGYIYD 12 (50)
T ss_pred cCCCCCCeEEC
Confidence 45556655554
No 131
>PRK04023 DNA polymerase II large subunit; Validated
Probab=37.26 E-value=28 Score=39.16 Aligned_cols=24 Identities=29% Similarity=0.905 Sum_probs=14.6
Q ss_pred ccccccccccCChhhHHHHhcccccccccccc
Q psy2973 459 SSCNECGKQYKWESFLQKHKKICTTTCKSCGK 490 (514)
Q Consensus 459 ~~C~~C~~~f~~~~~L~~H~~~h~~~C~~C~~ 490 (514)
|.|..||..|.+.. =..+|+.||-
T Consensus 1038 fRC~kC~~kYRR~P--------L~G~C~kCGg 1061 (1121)
T PRK04023 1038 FRCTKCGAKYRRPP--------LSGKCPKCGG 1061 (1121)
T ss_pred eeecccCcccccCC--------CCCcCccCCC
Confidence 67777777765331 1256777764
No 132
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=37.01 E-value=15 Score=22.12 Aligned_cols=10 Identities=40% Similarity=1.079 Sum_probs=4.8
Q ss_pred cccccCCCcc
Q psy2973 252 GRFKCPICGK 261 (514)
Q Consensus 252 ~~~~C~~C~~ 261 (514)
..|.|+.|+.
T Consensus 18 ~~~vCp~C~~ 27 (30)
T PF08274_consen 18 ELLVCPECGH 27 (30)
T ss_dssp SSEEETTTTE
T ss_pred CEEeCCcccc
Confidence 3455555543
No 133
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=36.79 E-value=20 Score=30.27 Aligned_cols=25 Identities=36% Similarity=0.684 Sum_probs=16.7
Q ss_pred ccccccccccCChHHHHHHHHhhcCCCC
Q psy2973 428 WKCEKCDMRFVTNKNLKRHMKVHEKNRN 455 (514)
Q Consensus 428 ~~C~~C~~~F~~~~~L~~H~~~H~~~~~ 455 (514)
..|-+||+.|.. |++|+++|+|-.|
T Consensus 73 i~clecGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S-H
T ss_pred eEEccCCcccch---HHHHHHHccCCCH
Confidence 689999999986 4999999987544
No 134
>PRK04023 DNA polymerase II large subunit; Validated
Probab=36.32 E-value=32 Score=38.76 Aligned_cols=9 Identities=33% Similarity=0.818 Sum_probs=5.2
Q ss_pred ccccccccc
Q psy2973 282 YTCTYCEKS 290 (514)
Q Consensus 282 ~~C~~C~~~ 290 (514)
..|+.||..
T Consensus 627 RfCpsCG~~ 635 (1121)
T PRK04023 627 RKCPSCGKE 635 (1121)
T ss_pred ccCCCCCCc
Confidence 456666654
No 135
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=36.11 E-value=18 Score=22.18 Aligned_cols=11 Identities=36% Similarity=1.235 Sum_probs=6.1
Q ss_pred CCcccCcCCcc
Q psy2973 368 RPFQCNDCGST 378 (514)
Q Consensus 368 ~~~~C~~C~~~ 378 (514)
.+.+|+.||..
T Consensus 16 ~~irC~~CG~R 26 (32)
T PF03604_consen 16 DPIRCPECGHR 26 (32)
T ss_dssp STSSBSSSS-S
T ss_pred CcEECCcCCCe
Confidence 35667777653
No 136
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=35.99 E-value=18 Score=33.84 Aligned_cols=40 Identities=23% Similarity=0.456 Sum_probs=20.3
Q ss_pred CCCceeccCCCcccccHHHHHHHHHHh-hcCCCCeecccccc
Q psy2973 193 KIKRFKCEVCPMKFHRQVSLKKHVEAV-HLKLKPCKCEVCGK 233 (514)
Q Consensus 193 ~~k~~~C~~C~~~f~~~~~l~~H~~~~-H~~~k~~~C~~C~~ 233 (514)
+.+.|.|..|..-. -...--.|+-.- -.....|+|.-|++
T Consensus 139 GGrif~CsfC~~fl-CEDDQFEHQAsCQvLe~E~~KC~SCNr 179 (314)
T PF06524_consen 139 GGRIFKCSFCDNFL-CEDDQFEHQASCQVLESETFKCQSCNR 179 (314)
T ss_pred CCeEEEeecCCCee-eccchhhhhhhhhhhhccccccccccc
Confidence 45678888887532 222222342111 12234577777765
No 137
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=35.73 E-value=26 Score=32.26 Aligned_cols=26 Identities=27% Similarity=0.680 Sum_probs=18.4
Q ss_pred ccccCccccccCCHHHHHHHHHHhcC
Q psy2973 335 IFQCRICDKILSNIQGIRRHMENMHN 360 (514)
Q Consensus 335 ~~~C~~C~~~f~~~~~l~~H~~~~H~ 360 (514)
.|.|.+|+|.|....-..+|+...|.
T Consensus 77 K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 77 KWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp EEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred EECCCCCCcccCChHHHHHHHhhcCH
Confidence 47788888888888888888888775
No 138
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=34.52 E-value=26 Score=29.31 Aligned_cols=12 Identities=25% Similarity=0.761 Sum_probs=5.8
Q ss_pred CCeecccccccc
Q psy2973 224 KPCKCEVCGKSF 235 (514)
Q Consensus 224 k~~~C~~C~~~f 235 (514)
+.|.|.+|..+.
T Consensus 79 ~lYeCnIC~etS 90 (140)
T PF05290_consen 79 KLYECNICKETS 90 (140)
T ss_pred CceeccCccccc
Confidence 445555554443
No 139
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=33.52 E-value=23 Score=33.98 Aligned_cols=19 Identities=32% Similarity=0.674 Sum_probs=9.2
Q ss_pred CcccCcCCcccCChHHHHH
Q psy2973 369 PFQCNDCGSTFRLKRHLNS 387 (514)
Q Consensus 369 ~~~C~~C~~~f~~~~~L~~ 387 (514)
|..|+.|........+|.+
T Consensus 322 Pi~CP~Csl~LilsthLar 340 (421)
T COG5151 322 PISCPICSLQLILSTHLAR 340 (421)
T ss_pred CccCcchhHHHHHHHHHHH
Confidence 4555555554444444443
No 140
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=33.14 E-value=48 Score=22.38 Aligned_cols=13 Identities=31% Similarity=0.922 Sum_probs=7.4
Q ss_pred eecccccccccCh
Q psy2973 226 CKCEVCGKSFSNN 238 (514)
Q Consensus 226 ~~C~~C~~~f~~~ 238 (514)
|.|..|++.+.-.
T Consensus 2 y~C~~CgyvYd~~ 14 (47)
T PF00301_consen 2 YQCPVCGYVYDPE 14 (47)
T ss_dssp EEETTTSBEEETT
T ss_pred cCCCCCCEEEcCC
Confidence 5566666655443
No 141
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=32.47 E-value=38 Score=32.51 Aligned_cols=13 Identities=31% Similarity=0.547 Sum_probs=6.3
Q ss_pred CccccCccccccC
Q psy2973 334 SIFQCRICDKILS 346 (514)
Q Consensus 334 ~~~~C~~C~~~f~ 346 (514)
-|..|+.|.....
T Consensus 321 LPi~CP~Csl~Li 333 (421)
T COG5151 321 LPISCPICSLQLI 333 (421)
T ss_pred CCccCcchhHHHH
Confidence 3555555554443
No 142
>KOG3362|consensus
Probab=31.61 E-value=19 Score=30.41 Aligned_cols=21 Identities=33% Similarity=0.869 Sum_probs=12.7
Q ss_pred cccccccccccCHHHHHHhhh
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVK 503 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~ 503 (514)
|.|--||..+-+...|..|..
T Consensus 130 ysC~~CG~kyCsv~C~~~Hne 150 (156)
T KOG3362|consen 130 YSCVNCGTKYCSVRCLKTHNE 150 (156)
T ss_pred hHHHhcCCceeechhhhhccc
Confidence 556666666666666666654
No 143
>KOG4167|consensus
Probab=30.27 E-value=15 Score=39.49 Aligned_cols=26 Identities=31% Similarity=0.659 Sum_probs=23.8
Q ss_pred cccccccccccCChHHHHHHHHhhcC
Q psy2973 427 AWKCEKCDMRFVTNKNLKRHMKVHEK 452 (514)
Q Consensus 427 ~~~C~~C~~~F~~~~~L~~H~~~H~~ 452 (514)
.|.|..|++.|.-..++..||++|.-
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 49999999999999999999999964
No 144
>KOG1280|consensus
Probab=29.29 E-value=48 Score=32.50 Aligned_cols=25 Identities=20% Similarity=0.567 Sum_probs=11.7
Q ss_pred ccccccccccCChHHHHHHHHhhcC
Q psy2973 397 LNCVHCQAIFTNKQRLNYHILTKHS 421 (514)
Q Consensus 397 ~~C~~C~~~f~~~~~L~~H~~~~H~ 421 (514)
|.|++|+..=.+...|..|+...|.
T Consensus 80 ftCPyC~~~Gfte~~f~~Hv~s~Hp 104 (381)
T KOG1280|consen 80 FTCPYCGIMGFTERQFGTHVLSQHP 104 (381)
T ss_pred ccCCcccccccchhHHHHHhhhcCc
Confidence 4444444444444444444444443
No 145
>KOG2907|consensus
Probab=29.11 E-value=31 Score=27.88 Aligned_cols=11 Identities=27% Similarity=0.884 Sum_probs=5.4
Q ss_pred ccccccccccc
Q psy2973 483 TTCKSCGKRYS 493 (514)
Q Consensus 483 ~~C~~C~~~f~ 493 (514)
|.|+.|++.|+
T Consensus 103 YTC~kC~~k~~ 113 (116)
T KOG2907|consen 103 YTCPKCKYKFT 113 (116)
T ss_pred EEcCccceeee
Confidence 44555555443
No 146
>KOG1842|consensus
Probab=28.08 E-value=31 Score=34.92 Aligned_cols=29 Identities=24% Similarity=0.332 Sum_probs=21.3
Q ss_pred CccccccccccCChHHHHHHHHhhcCCCC
Q psy2973 396 KLNCVHCQAIFTNKQRLNYHILTKHSGPN 424 (514)
Q Consensus 396 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~ 424 (514)
.|.|++|..-|.+.+.|..|+...|.+++
T Consensus 15 gflCPiC~~dl~~~~~L~~H~d~eH~~ed 43 (505)
T KOG1842|consen 15 GFLCPICLLDLPNLSALNDHLDVEHFEED 43 (505)
T ss_pred cccCchHhhhhhhHHHHHHHHhhhccccc
Confidence 56777777777777777777777777664
No 147
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=27.97 E-value=16 Score=24.64 Aligned_cols=34 Identities=21% Similarity=0.557 Sum_probs=17.3
Q ss_pred ccccccccCChhhHHHHhcccc--ccccc--ccccccC
Q psy2973 461 CNECGKQYKWESFLQKHKKICT--TTCKS--CGKRYSS 494 (514)
Q Consensus 461 C~~C~~~f~~~~~L~~H~~~h~--~~C~~--C~~~f~~ 494 (514)
|+.||.......+...+..... ++|.. ||..|..
T Consensus 2 CP~Cg~~a~ir~S~~~s~~~~~~Y~qC~N~~Cg~tfv~ 39 (47)
T PF04606_consen 2 CPHCGSKARIRTSRQLSPLTRELYCQCTNPECGHTFVA 39 (47)
T ss_pred cCCCCCeeEEEEchhhCcceEEEEEEECCCcCCCEEEE
Confidence 6666655444444333322222 56664 7777653
No 148
>KOG4167|consensus
Probab=27.77 E-value=17 Score=39.03 Aligned_cols=25 Identities=28% Similarity=0.546 Sum_probs=22.7
Q ss_pred cccccccccccCHHHHHHhhhhhhC
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKSCYG 507 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~h~g 507 (514)
|-|.+|++.|....++..||++|.-
T Consensus 793 FpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 793 FPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred eehHHHHHHHHHHhhhhHHHHHHHH
Confidence 6799999999999999999999863
No 149
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=27.51 E-value=39 Score=35.83 Aligned_cols=45 Identities=29% Similarity=0.785 Sum_probs=0.0
Q ss_pred CCcCCCcCCCHHHHHHHHHhcCCccccCccccccCCHHHHHHHHHHhcCCCCCCCCCCcccCcCCcccCChHHHHHHHHh
Q psy2973 312 CSVCDRLYKSDYLRRKHERSHSSIFQCRICDKILSNIQGIRRHMENMHNSDKDNTERPFQCNDCGSTFRLKRHLNSHINR 391 (514)
Q Consensus 312 C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~ 391 (514)
|..||.. ..|+.|+-.. ..|.. .....|..||.
T Consensus 216 C~~Cg~~-----------------~~C~~C~~~l-----------~~h~~-----~~~l~Ch~Cg~-------------- 248 (505)
T TIGR00595 216 CRSCGYI-----------------LCCPNCDVSL-----------TYHKK-----EGKLRCHYCGY-------------- 248 (505)
T ss_pred hhhCcCc-----------------cCCCCCCCce-----------EEecC-----CCeEEcCCCcC--------------
Q ss_pred hcCCCccccccc
Q psy2973 392 LHKLKLNCVHCQ 403 (514)
Q Consensus 392 ~h~~~~~C~~C~ 403 (514)
....+..|+.|+
T Consensus 249 ~~~~~~~Cp~C~ 260 (505)
T TIGR00595 249 QEPIPKTCPQCG 260 (505)
T ss_pred cCCCCCCCCCCC
No 150
>KOG2593|consensus
Probab=27.37 E-value=52 Score=33.37 Aligned_cols=16 Identities=13% Similarity=0.686 Sum_probs=7.7
Q ss_pred cccccccccccCChHH
Q psy2973 427 AWKCEKCDMRFVTNKN 442 (514)
Q Consensus 427 ~~~C~~C~~~F~~~~~ 442 (514)
.|.|+.|++.|+...+
T Consensus 128 ~Y~Cp~C~kkyt~Lea 143 (436)
T KOG2593|consen 128 GYVCPNCQKKYTSLEA 143 (436)
T ss_pred cccCCccccchhhhHH
Confidence 3555555555544443
No 151
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=26.63 E-value=27 Score=22.83 Aligned_cols=13 Identities=31% Similarity=1.175 Sum_probs=7.7
Q ss_pred ccccccccCChhh
Q psy2973 461 CNECGKQYKWESF 473 (514)
Q Consensus 461 C~~C~~~f~~~~~ 473 (514)
|+.||+.|++...
T Consensus 11 C~~C~rpf~WRKK 23 (42)
T PF10013_consen 11 CPVCGRPFTWRKK 23 (42)
T ss_pred CcccCCcchHHHH
Confidence 6666666665543
No 152
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=25.93 E-value=43 Score=36.97 Aligned_cols=10 Identities=20% Similarity=0.474 Sum_probs=6.3
Q ss_pred eeccCCCccc
Q psy2973 197 FKCEVCPMKF 206 (514)
Q Consensus 197 ~~C~~C~~~f 206 (514)
..|..|+..+
T Consensus 436 l~C~~Cg~v~ 445 (730)
T COG1198 436 LLCRDCGYIA 445 (730)
T ss_pred eecccCCCcc
Confidence 3577777654
No 153
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=25.92 E-value=19 Score=25.15 Aligned_cols=10 Identities=20% Similarity=0.953 Sum_probs=4.7
Q ss_pred cccccccccc
Q psy2973 428 WKCEKCDMRF 437 (514)
Q Consensus 428 ~~C~~C~~~F 437 (514)
|+|+.|+..+
T Consensus 3 ~~CP~CG~~i 12 (54)
T TIGR01206 3 FECPDCGAEI 12 (54)
T ss_pred cCCCCCCCEE
Confidence 3455555443
No 154
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=25.57 E-value=97 Score=20.58 Aligned_cols=11 Identities=45% Similarity=1.343 Sum_probs=5.4
Q ss_pred CcccccCCCcc
Q psy2973 251 QGRFKCPICGK 261 (514)
Q Consensus 251 ~~~~~C~~C~~ 261 (514)
...|.|..|++
T Consensus 35 ~~~~~C~~C~~ 45 (46)
T PF12760_consen 35 RGRYRCKACRK 45 (46)
T ss_pred CCeEECCCCCC
Confidence 34455555543
No 155
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=25.08 E-value=20 Score=23.07 Aligned_cols=10 Identities=30% Similarity=0.883 Sum_probs=5.7
Q ss_pred cccCCCcccc
Q psy2973 254 FKCPICGKHI 263 (514)
Q Consensus 254 ~~C~~C~~~f 263 (514)
|.|..|+..|
T Consensus 29 y~C~~C~~~w 38 (39)
T PF01096_consen 29 YVCCNCGHRW 38 (39)
T ss_dssp EEESSSTEEE
T ss_pred EEeCCCCCee
Confidence 5666666544
No 156
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=24.96 E-value=44 Score=26.44 Aligned_cols=12 Identities=25% Similarity=0.736 Sum_probs=6.3
Q ss_pred ccccccccccCC
Q psy2973 428 WKCEKCDMRFVT 439 (514)
Q Consensus 428 ~~C~~C~~~F~~ 439 (514)
++|..||..|.+
T Consensus 3 H~CtrCG~vf~~ 14 (112)
T COG3364 3 HQCTRCGEVFDD 14 (112)
T ss_pred ceeccccccccc
Confidence 455555555554
No 157
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=24.40 E-value=11 Score=29.65 Aligned_cols=14 Identities=29% Similarity=0.797 Sum_probs=8.6
Q ss_pred CcccccccccccCC
Q psy2973 426 KAWKCEKCDMRFVT 439 (514)
Q Consensus 426 k~~~C~~C~~~F~~ 439 (514)
+.|.|+.|+..-.+
T Consensus 21 k~FtCp~Cghe~vs 34 (104)
T COG4888 21 KTFTCPRCGHEKVS 34 (104)
T ss_pred ceEecCccCCeeee
Confidence 55777777754433
No 158
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=23.85 E-value=45 Score=28.31 Aligned_cols=16 Identities=19% Similarity=0.447 Sum_probs=10.7
Q ss_pred CCceeccCCCcccccH
Q psy2973 194 IKRFKCEVCPMKFHRQ 209 (514)
Q Consensus 194 ~k~~~C~~C~~~f~~~ 209 (514)
...+.|..|+..|...
T Consensus 68 p~~~~C~~CG~~~~~~ 83 (135)
T PRK03824 68 EAVLKCRNCGNEWSLK 83 (135)
T ss_pred ceEEECCCCCCEEecc
Confidence 3457888888777543
No 159
>KOG1280|consensus
Probab=23.78 E-value=63 Score=31.70 Aligned_cols=38 Identities=26% Similarity=0.529 Sum_probs=30.1
Q ss_pred CCCcccCcCCcccCChHHHHHHHHhhcCCC---cccccccc
Q psy2973 367 ERPFQCNDCGSTFRLKRHLNSHINRLHKLK---LNCVHCQA 404 (514)
Q Consensus 367 ~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~---~~C~~C~~ 404 (514)
...|.|++|+..-.+...|..|+...|... ..|++|+.
T Consensus 77 ~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~~ 117 (381)
T KOG1280|consen 77 PQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCAA 117 (381)
T ss_pred cccccCCcccccccchhHHHHHhhhcCcccCcceeeecccc
Confidence 347999999998888899999999888753 35777753
No 160
>KOG0782|consensus
Probab=22.95 E-value=21 Score=37.20 Aligned_cols=51 Identities=27% Similarity=0.651 Sum_probs=28.5
Q ss_pred HHHHHHHhhcCCCCcccccccccccccCChhhHHHHhcccccccccccccccCHH
Q psy2973 442 NLKRHMKVHEKNRNMLDSSCNECGKQYKWESFLQKHKKICTTTCKSCGKRYSSIY 496 (514)
Q Consensus 442 ~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~C~~C~~~f~~~~ 496 (514)
.|.+|-.+|..... =+|..||+.|.++..+..-. +-...|+.|...|..+.
T Consensus 240 ~fvrHHWVHrrRqe---GkC~~CgKgFQQKf~FhsKE-ivAisCSWCKqayH~Kv 290 (1004)
T KOG0782|consen 240 GFVRHHWVHRRRQE---GKCNTCGKGFQQKFFFHSKE-IVAISCSWCKQAYHLKV 290 (1004)
T ss_pred cchHHhHhhHhhhc---cccchhhhhhhhheeecccc-EEEEEehHHHHHhhcch
Confidence 55566555544333 25777777776654432211 11256777777777665
No 161
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=22.51 E-value=49 Score=20.49 Aligned_cols=24 Identities=25% Similarity=0.755 Sum_probs=13.6
Q ss_pred cccccccccccCHHHHHHhhhhhhC
Q psy2973 483 TTCKSCGKRYSSIYVLRNHVKSCYG 507 (514)
Q Consensus 483 ~~C~~C~~~f~~~~~l~~H~~~h~g 507 (514)
+.|+.|++.. ..+.+..|+..+.|
T Consensus 5 ~~C~nC~R~v-~a~RfA~HLekCmg 28 (33)
T PF08209_consen 5 VECPNCGRPV-AASRFAPHLEKCMG 28 (33)
T ss_dssp EE-TTTSSEE-EGGGHHHHHHHHTC
T ss_pred EECCCCcCCc-chhhhHHHHHHHHc
Confidence 4566777655 34456666665544
No 162
>KOG3408|consensus
Probab=22.37 E-value=55 Score=26.86 Aligned_cols=24 Identities=33% Similarity=0.672 Sum_probs=19.4
Q ss_pred CcccccccccccCChHHHHHHHHh
Q psy2973 426 KAWKCEKCDMRFVTNKNLKRHMKV 449 (514)
Q Consensus 426 k~~~C~~C~~~F~~~~~L~~H~~~ 449 (514)
..|.|-.|.+-|.+...|..|.++
T Consensus 56 GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 56 GQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred ceeehhhhhhhhcchHHHHHHHhc
Confidence 448888888888888888888774
No 163
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=22.25 E-value=64 Score=33.49 Aligned_cols=29 Identities=28% Similarity=0.401 Sum_probs=23.2
Q ss_pred CceeccCCCcccccHHHHHHHHHHhhcCC
Q psy2973 195 KRFKCEVCPMKFHRQVSLKKHVEAVHLKL 223 (514)
Q Consensus 195 k~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 223 (514)
+-+.|+.|.+.|.+...+..|+...|.+.
T Consensus 56 rFWiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 56 RFWICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred eEeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 34678888888888888888888888654
No 164
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=21.75 E-value=77 Score=36.94 Aligned_cols=9 Identities=33% Similarity=0.670 Sum_probs=4.6
Q ss_pred ccccccccc
Q psy2973 397 LNCVHCQAI 405 (514)
Q Consensus 397 ~~C~~C~~~ 405 (514)
|.|+.||..
T Consensus 693 y~CPsCGae 701 (1337)
T PRK14714 693 YVCPDCGAE 701 (1337)
T ss_pred eeCccCCCc
Confidence 455555543
No 165
>PF13821 DUF4187: Domain of unknown function (DUF4187)
Probab=21.69 E-value=81 Score=22.08 Aligned_cols=26 Identities=23% Similarity=0.312 Sum_probs=12.6
Q ss_pred HHHHhcccccccccccccccCHHHHH
Q psy2973 474 LQKHKKICTTTCKSCGKRYSSIYVLR 499 (514)
Q Consensus 474 L~~H~~~h~~~C~~C~~~f~~~~~l~ 499 (514)
|...+|..++-|-+||..|.+...|.
T Consensus 19 l~~YLR~~~~YC~~Cg~~Y~d~~dL~ 44 (55)
T PF13821_consen 19 LLSYLREEHNYCFWCGTKYDDEEDLE 44 (55)
T ss_pred HHHHHHhhCceeeeeCCccCCHHHHH
Confidence 34444444444555555555554443
No 166
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=21.61 E-value=77 Score=21.64 Aligned_cols=13 Identities=23% Similarity=0.613 Sum_probs=6.1
Q ss_pred ceeccCCCccccc
Q psy2973 196 RFKCEVCPMKFHR 208 (514)
Q Consensus 196 ~~~C~~C~~~f~~ 208 (514)
.+.|..|+..|..
T Consensus 4 ~l~C~dCg~~Fvf 16 (49)
T PF13451_consen 4 TLTCKDCGAEFVF 16 (49)
T ss_pred eEEcccCCCeEEE
Confidence 3445555544433
No 167
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=21.49 E-value=55 Score=27.42 Aligned_cols=15 Identities=27% Similarity=0.647 Sum_probs=8.4
Q ss_pred cccccccccccCChH
Q psy2973 427 AWKCEKCDMRFVTNK 441 (514)
Q Consensus 427 ~~~C~~C~~~F~~~~ 441 (514)
|++|..||+.|.+.+
T Consensus 1 PH~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGS 15 (131)
T ss_pred CcccCcCCCCcCCCc
Confidence 345666666665543
No 168
>KOG2272|consensus
Probab=21.10 E-value=48 Score=30.84 Aligned_cols=12 Identities=33% Similarity=0.791 Sum_probs=6.7
Q ss_pred cccCcCCcccCC
Q psy2973 370 FQCNDCGSTFRL 381 (514)
Q Consensus 370 ~~C~~C~~~f~~ 381 (514)
|.|..|.+-|--
T Consensus 222 FvCa~CekPFlG 233 (332)
T KOG2272|consen 222 FVCAKCEKPFLG 233 (332)
T ss_pred eeehhcCCcccc
Confidence 556666655543
No 169
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=20.90 E-value=84 Score=36.65 Aligned_cols=8 Identities=25% Similarity=1.032 Sum_probs=3.9
Q ss_pred ccCCcCCC
Q psy2973 310 SKCSVCDR 317 (514)
Q Consensus 310 ~~C~~C~~ 317 (514)
+.|+.||.
T Consensus 668 rkCPkCG~ 675 (1337)
T PRK14714 668 RRCPSCGT 675 (1337)
T ss_pred EECCCCCC
Confidence 44555554
No 170
>KOG2593|consensus
Probab=20.73 E-value=70 Score=32.48 Aligned_cols=35 Identities=26% Similarity=0.659 Sum_probs=16.6
Q ss_pred CCeecccccccccChhhhhhhhhhccCCcccccCCCcc
Q psy2973 224 KPCKCEVCGKSFSNNSTLKVHSTTHADQGRFKCPICGK 261 (514)
Q Consensus 224 k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~ 261 (514)
..|.|+.|.+.|+....++ ..-.....|.|..|+-
T Consensus 127 ~~Y~Cp~C~kkyt~Lea~~---L~~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 127 AGYVCPNCQKKYTSLEALQ---LLDNETGEFHCENCGG 161 (436)
T ss_pred ccccCCccccchhhhHHHH---hhcccCceEEEecCCC
Confidence 3466666666665443332 2222234456666653
No 171
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=20.24 E-value=92 Score=19.64 Aligned_cols=8 Identities=38% Similarity=1.389 Sum_probs=3.2
Q ss_pred cccccccc
Q psy2973 228 CEVCGKSF 235 (514)
Q Consensus 228 C~~C~~~f 235 (514)
|+.||..|
T Consensus 4 C~~Cg~~Y 11 (36)
T PF05191_consen 4 CPKCGRIY 11 (36)
T ss_dssp ETTTTEEE
T ss_pred cCCCCCcc
Confidence 34444433
No 172
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=20.21 E-value=75 Score=20.50 Aligned_cols=14 Identities=14% Similarity=0.453 Sum_probs=5.8
Q ss_pred cccccccccCChHH
Q psy2973 283 TCTYCEKSFLYKKS 296 (514)
Q Consensus 283 ~C~~C~~~f~~~~~ 296 (514)
.|+.|+-.+.....
T Consensus 21 ~C~~C~G~W~d~~e 34 (41)
T PF13453_consen 21 VCPSCGGIWFDAGE 34 (41)
T ss_pred ECCCCCeEEccHHH
Confidence 34444444443333
No 173
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=20.21 E-value=62 Score=21.14 Aligned_cols=10 Identities=40% Similarity=1.032 Sum_probs=4.4
Q ss_pred ccccCCCccc
Q psy2973 253 RFKCPICGKH 262 (514)
Q Consensus 253 ~~~C~~C~~~ 262 (514)
.+.|+.||..
T Consensus 19 ~~vC~~CG~V 28 (43)
T PF08271_consen 19 ELVCPNCGLV 28 (43)
T ss_dssp EEEETTT-BB
T ss_pred eEECCCCCCE
Confidence 3455555544
No 174
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.19 E-value=37 Score=31.24 Aligned_cols=23 Identities=17% Similarity=0.401 Sum_probs=10.9
Q ss_pred CeecccccccccChhhhhhhhhh
Q psy2973 225 PCKCEVCGKSFSNNSTLKVHSTT 247 (514)
Q Consensus 225 ~~~C~~C~~~f~~~~~L~~H~~~ 247 (514)
.+.|++|+..|.....+..-.++
T Consensus 19 ~ieCPvC~tkFkkeev~tgsiRi 41 (267)
T COG1655 19 TIECPVCNTKFKKEEVKTGSIRI 41 (267)
T ss_pred eeccCcccchhhhhheeccceeE
Confidence 34555555555544443333333
Done!