Query psy18077
Match_columns 451
No_of_seqs 568 out of 3944
Neff 10.2
Searched_HMMs 46136
Date Fri Aug 16 22:03:18 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy18077.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18077hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 1.3E-31 2.9E-36 258.3 3.3 54 398-451 880-935 (958)
2 KOG2462|consensus 100.0 2E-30 4.2E-35 221.7 7.3 145 142-286 126-270 (279)
3 KOG1074|consensus 100.0 1.6E-30 3.4E-35 250.9 5.4 214 206-422 608-932 (958)
4 KOG3608|consensus 100.0 2.6E-28 5.5E-33 213.5 11.9 217 206-448 139-377 (467)
5 KOG3608|consensus 100.0 6.2E-28 1.3E-32 211.1 12.4 264 146-435 69-398 (467)
6 KOG2462|consensus 99.9 7.6E-28 1.7E-32 205.8 5.6 136 205-350 132-270 (279)
7 KOG3623|consensus 99.9 2.3E-24 5.1E-29 205.0 5.1 107 145-252 209-330 (1007)
8 KOG3623|consensus 99.9 1.3E-23 2.8E-28 200.0 9.1 104 177-280 210-330 (1007)
9 KOG3576|consensus 99.7 1.8E-17 4E-22 134.7 0.8 124 175-323 115-240 (267)
10 KOG3576|consensus 99.6 6.8E-17 1.5E-21 131.4 3.3 123 257-391 115-237 (267)
11 PLN03086 PRLI-interacting fact 99.3 9.5E-12 2.1E-16 120.9 7.4 97 232-343 454-560 (567)
12 PLN03086 PRLI-interacting fact 99.2 1.5E-11 3.2E-16 119.6 8.3 145 146-318 407-563 (567)
13 PHA00733 hypothetical protein 99.2 1.1E-11 2.4E-16 98.3 5.3 84 365-448 39-124 (128)
14 PHA00733 hypothetical protein 99.1 1.1E-10 2.3E-15 92.7 4.8 86 321-423 38-125 (128)
15 PHA02768 hypothetical protein; 99.0 3.6E-10 7.7E-15 73.1 2.3 42 398-439 6-47 (55)
16 KOG3993|consensus 98.9 2E-10 4.4E-15 104.3 -0.6 194 203-419 267-480 (500)
17 KOG3993|consensus 98.8 8.2E-10 1.8E-14 100.5 1.2 210 231-447 267-482 (500)
18 PHA02768 hypothetical protein; 98.8 1.5E-09 3.2E-14 70.2 2.0 44 295-340 5-48 (55)
19 PF13465 zf-H2C2_2: Zinc-finge 98.7 2.1E-08 4.5E-13 55.4 2.7 26 310-335 1-26 (26)
20 PHA00732 hypothetical protein 98.5 5.9E-08 1.3E-12 69.5 3.2 50 366-423 1-50 (79)
21 PF13465 zf-H2C2_2: Zinc-finge 98.4 1.2E-07 2.6E-12 52.3 2.2 26 381-409 1-26 (26)
22 PHA00732 hypothetical protein 98.3 3.7E-07 8E-12 65.4 2.9 48 397-447 1-48 (79)
23 PHA00616 hypothetical protein 98.3 2.3E-07 4.9E-12 57.0 1.5 39 366-407 1-39 (44)
24 PHA00616 hypothetical protein 98.3 3.7E-07 8.1E-12 56.1 1.9 34 295-328 1-34 (44)
25 PF05605 zf-Di19: Drought indu 98.2 2.8E-06 6E-11 56.5 4.6 51 367-421 3-53 (54)
26 PF05605 zf-Di19: Drought indu 98.2 2.3E-06 5.1E-11 56.9 4.1 49 397-448 2-54 (54)
27 PF00096 zf-C2H2: Zinc finger, 97.9 4.5E-06 9.7E-11 44.7 1.4 22 424-445 1-22 (23)
28 PF12756 zf-C2H2_2: C2H2 type 97.9 6.6E-06 1.4E-10 63.1 1.9 72 368-446 1-73 (100)
29 PF00096 zf-C2H2: Zinc finger, 97.9 7.9E-06 1.7E-10 43.7 1.6 22 367-388 1-22 (23)
30 PF13894 zf-C2H2_4: C2H2-type 97.7 1.9E-05 4.1E-10 42.7 1.6 24 424-447 1-24 (24)
31 PF13894 zf-C2H2_4: C2H2-type 97.6 3.8E-05 8.3E-10 41.4 1.9 23 367-389 1-23 (24)
32 PF12756 zf-C2H2_2: C2H2 type 97.6 4.2E-05 9.2E-10 58.5 2.7 24 366-389 50-73 (100)
33 PF13912 zf-C2H2_6: C2H2-type 97.4 6.9E-05 1.5E-09 41.8 1.5 17 400-416 4-20 (27)
34 PF13912 zf-C2H2_6: C2H2-type 97.4 8.2E-05 1.8E-09 41.5 1.5 26 366-391 1-26 (27)
35 COG5189 SFP1 Putative transcri 97.3 7.6E-05 1.6E-09 66.1 1.4 53 292-344 346-419 (423)
36 KOG2231|consensus 97.2 0.00051 1.1E-08 68.6 5.8 20 233-253 117-136 (669)
37 COG5189 SFP1 Putative transcri 97.2 0.00014 3.1E-09 64.4 1.4 54 363-416 346-417 (423)
38 KOG1146|consensus 97.2 0.00013 2.8E-09 76.9 0.9 51 294-344 588-639 (1406)
39 KOG2231|consensus 97.1 0.0011 2.3E-08 66.3 7.0 97 179-280 117-233 (669)
40 KOG1146|consensus 97.1 0.00019 4.2E-09 75.6 1.1 128 206-344 439-610 (1406)
41 PF13909 zf-H2C2_5: C2H2-type 97.0 0.00034 7.3E-09 37.7 1.3 24 424-448 1-24 (24)
42 smart00355 ZnF_C2H2 zinc finge 97.0 0.00065 1.4E-08 37.2 2.3 23 424-446 1-23 (26)
43 PF09237 GAGA: GAGA factor; I 96.9 0.00043 9.4E-09 43.6 1.1 28 424-451 25-52 (54)
44 smart00355 ZnF_C2H2 zinc finge 96.8 0.001 2.2E-08 36.4 2.1 23 367-389 1-23 (26)
45 COG5236 Uncharacterized conser 96.7 0.0022 4.8E-08 57.7 4.6 127 177-319 151-305 (493)
46 PF09237 GAGA: GAGA factor; I 96.7 0.0031 6.8E-08 39.8 3.7 33 291-323 20-52 (54)
47 KOG2785|consensus 96.6 0.0031 6.8E-08 58.0 5.1 49 397-445 166-242 (390)
48 PF13909 zf-H2C2_5: C2H2-type 96.5 0.0014 3.1E-08 35.2 1.5 23 367-390 1-23 (24)
49 PF12874 zf-met: Zinc-finger o 96.5 0.0012 2.6E-08 35.9 1.2 22 424-445 1-22 (25)
50 PF12874 zf-met: Zinc-finger o 96.5 0.0013 2.7E-08 35.8 1.1 22 398-419 1-22 (25)
51 PRK04860 hypothetical protein; 96.2 0.0028 6E-08 52.4 2.1 39 294-336 118-156 (160)
52 KOG2785|consensus 96.2 0.012 2.7E-07 54.2 6.2 54 366-419 166-242 (390)
53 COG5236 Uncharacterized conser 96.1 0.008 1.7E-07 54.3 4.6 128 146-286 151-308 (493)
54 KOG2482|consensus 96.0 0.013 2.8E-07 53.0 5.2 49 296-344 280-355 (423)
55 PRK04860 hypothetical protein; 95.8 0.0057 1.2E-07 50.5 2.0 38 366-410 119-156 (160)
56 KOG2482|consensus 95.7 0.031 6.6E-07 50.6 6.4 162 147-318 145-357 (423)
57 PF12171 zf-C2H2_jaz: Zinc-fin 95.7 0.0034 7.3E-08 34.8 0.3 22 424-445 2-23 (27)
58 PF12171 zf-C2H2_jaz: Zinc-fin 95.3 0.0041 8.8E-08 34.5 -0.2 21 398-418 2-22 (27)
59 COG5048 FOG: Zn-finger [Genera 93.8 0.016 3.4E-07 57.3 -0.3 145 294-439 288-461 (467)
60 PF13913 zf-C2HC_2: zinc-finge 93.8 0.048 1E-06 29.5 1.7 19 399-418 4-22 (25)
61 PF13913 zf-C2HC_2: zinc-finge 93.7 0.056 1.2E-06 29.2 1.9 21 367-388 3-23 (25)
62 COG5048 FOG: Zn-finger [Genera 93.3 0.028 6.1E-07 55.5 0.5 157 231-412 289-460 (467)
63 smart00451 ZnF_U1 U1-like zinc 92.7 0.086 1.9E-06 31.1 1.8 22 397-418 3-24 (35)
64 TIGR00622 ssl1 transcription f 92.6 0.13 2.7E-06 39.3 3.0 23 424-446 82-104 (112)
65 smart00451 ZnF_U1 U1-like zinc 92.1 0.11 2.5E-06 30.6 1.8 23 366-388 3-25 (35)
66 TIGR00622 ssl1 transcription f 91.8 0.33 7.1E-06 37.1 4.5 26 396-421 80-105 (112)
67 KOG2893|consensus 91.4 0.059 1.3E-06 46.2 0.1 46 368-420 12-58 (341)
68 PF12013 DUF3505: Protein of u 89.9 0.51 1.1E-05 36.5 4.2 26 366-391 80-109 (109)
69 cd00350 rubredoxin_like Rubred 89.9 0.18 3.9E-06 29.3 1.2 24 296-331 2-25 (33)
70 KOG2893|consensus 89.9 0.093 2E-06 45.0 -0.0 43 205-251 12-54 (341)
71 KOG4173|consensus 89.0 0.26 5.7E-06 41.4 2.0 49 205-253 108-168 (253)
72 PF12013 DUF3505: Protein of u 88.0 1 2.3E-05 34.7 4.7 25 424-448 81-109 (109)
73 KOG4173|consensus 86.6 0.18 4E-06 42.3 -0.3 86 294-387 78-167 (253)
74 PF09538 FYDLN_acid: Protein o 84.4 0.5 1.1E-05 36.2 1.2 30 367-410 10-39 (108)
75 COG4049 Uncharacterized protei 83.9 0.66 1.4E-05 30.0 1.4 30 290-319 12-41 (65)
76 PF09538 FYDLN_acid: Protein o 82.5 1 2.2E-05 34.5 2.2 11 148-158 11-21 (108)
77 COG4049 Uncharacterized protei 82.3 0.64 1.4E-05 30.0 0.8 27 364-390 15-41 (65)
78 PF09986 DUF2225: Uncharacteri 82.1 0.67 1.5E-05 40.7 1.2 57 322-379 4-61 (214)
79 COG1997 RPL43A Ribosomal prote 81.9 0.48 1E-05 34.0 0.2 30 397-434 35-64 (89)
80 cd00729 rubredoxin_SM Rubredox 80.9 0.92 2E-05 26.6 1.1 10 296-305 3-12 (34)
81 smart00531 TFIIE Transcription 80.6 0.69 1.5E-05 37.9 0.7 12 367-378 100-111 (147)
82 PF10571 UPF0547: Uncharacteri 79.7 0.95 2.1E-05 24.7 0.8 9 205-213 16-24 (26)
83 COG2888 Predicted Zn-ribbon RN 78.1 1.7 3.6E-05 28.8 1.7 13 366-378 27-39 (61)
84 PF09986 DUF2225: Uncharacteri 77.5 0.47 1E-05 41.6 -1.2 22 366-387 5-26 (214)
85 PF13717 zinc_ribbon_4: zinc-r 76.6 1.1 2.5E-05 26.6 0.6 12 368-379 4-15 (36)
86 PHA00626 hypothetical protein 75.5 1.5 3.3E-05 28.5 1.0 13 366-378 23-35 (59)
87 PF15269 zf-C2H2_7: Zinc-finge 74.4 2 4.3E-05 26.4 1.3 21 424-444 21-41 (54)
88 TIGR00373 conserved hypothetic 74.2 3.2 6.9E-05 34.5 2.9 19 321-339 107-125 (158)
89 TIGR00373 conserved hypothetic 73.8 1.8 3.9E-05 36.0 1.4 16 367-382 110-125 (158)
90 COG5151 SSL1 RNA polymerase II 73.5 1.7 3.6E-05 39.2 1.2 48 399-446 364-411 (421)
91 TIGR02300 FYDLN_acid conserved 73.3 2 4.2E-05 33.5 1.3 31 366-410 9-39 (129)
92 PF06524 NOA36: NOA36 protein; 73.0 1.2 2.6E-05 39.1 0.1 14 318-331 137-150 (314)
93 COG5151 SSL1 RNA polymerase II 72.7 4.3 9.3E-05 36.7 3.5 23 397-419 388-410 (421)
94 PRK06266 transcription initiat 72.1 2 4.3E-05 36.5 1.3 17 366-382 117-133 (178)
95 TIGR02098 MJ0042_CXXC MJ0042 f 71.8 2.9 6.2E-05 25.1 1.6 11 295-305 25-35 (38)
96 KOG2186|consensus 71.7 2.6 5.5E-05 37.0 1.8 47 366-418 3-49 (276)
97 PF13719 zinc_ribbon_5: zinc-r 71.2 3.3 7.2E-05 24.8 1.7 32 297-333 4-35 (37)
98 smart00734 ZnF_Rad18 Rad18-lik 70.7 3.8 8.3E-05 22.3 1.8 19 368-387 3-21 (26)
99 COG1592 Rubrerythrin [Energy p 70.2 2.6 5.7E-05 35.0 1.5 23 295-330 134-156 (166)
100 smart00659 RPOLCX RNA polymera 69.6 3.6 7.8E-05 25.7 1.7 26 296-332 3-28 (44)
101 PRK00464 nrdR transcriptional 69.6 0.86 1.9E-05 37.4 -1.4 16 397-412 28-43 (154)
102 PRK06266 transcription initiat 69.5 4.1 8.9E-05 34.6 2.6 17 322-338 116-132 (178)
103 COG1592 Rubrerythrin [Energy p 69.1 2.9 6.3E-05 34.7 1.6 13 291-303 145-157 (166)
104 TIGR02300 FYDLN_acid conserved 68.4 3.4 7.4E-05 32.3 1.7 11 148-158 11-21 (129)
105 PRK00398 rpoP DNA-directed RNA 68.1 3.7 8E-05 25.9 1.6 10 260-269 4-13 (46)
106 KOG2807|consensus 67.5 7 0.00015 35.8 3.7 36 294-343 275-310 (378)
107 PF02892 zf-BED: BED zinc fing 66.8 4.5 9.6E-05 25.3 1.8 10 367-376 17-26 (45)
108 PRK14890 putative Zn-ribbon RN 65.5 4.3 9.3E-05 27.0 1.5 10 366-375 25-34 (59)
109 smart00834 CxxC_CXXC_SSSS Puta 64.7 1.6 3.5E-05 26.6 -0.6 12 367-378 6-17 (41)
110 PF03604 DNA_RNApol_7kD: DNA d 64.2 3.7 8.1E-05 23.6 0.9 11 396-406 16-26 (32)
111 COG1996 RPC10 DNA-directed RNA 64.0 4.1 8.9E-05 26.0 1.2 30 293-332 4-33 (49)
112 smart00531 TFIIE Transcription 63.8 9.8 0.00021 31.2 3.7 11 232-242 100-110 (147)
113 KOG2186|consensus 63.7 3.3 7.1E-05 36.4 0.9 46 204-252 4-49 (276)
114 COG1198 PriA Primosomal protei 63.4 3.8 8.1E-05 42.8 1.4 49 178-268 436-484 (730)
115 TIGR02605 CxxC_CxxC_SSSS putat 63.4 1.9 4E-05 28.1 -0.5 12 367-378 6-17 (52)
116 KOG2807|consensus 63.0 9 0.00019 35.1 3.5 125 160-319 226-369 (378)
117 PF01780 Ribosomal_L37ae: Ribo 62.6 1.2 2.7E-05 32.5 -1.5 30 397-434 35-64 (90)
118 smart00614 ZnF_BED BED zinc fi 62.5 5.1 0.00011 25.8 1.5 24 366-389 18-47 (50)
119 PF09723 Zn-ribbon_8: Zinc rib 61.4 4.4 9.4E-05 25.0 0.9 14 323-336 5-18 (42)
120 KOG4167|consensus 61.3 9.7 0.00021 38.9 3.8 26 145-170 791-816 (907)
121 KOG2593|consensus 60.4 7.1 0.00015 37.4 2.6 19 321-339 126-144 (436)
122 TIGR01384 TFS_arch transcripti 60.1 1.5 3.3E-05 33.4 -1.5 13 323-335 16-28 (104)
123 PF15269 zf-C2H2_7: Zinc-finge 58.8 7 0.00015 24.1 1.5 21 398-418 21-41 (54)
124 PF05443 ROS_MUCR: ROS/MUCR tr 56.8 4.6 0.0001 32.2 0.6 22 398-422 73-94 (132)
125 PRK09678 DNA-binding transcrip 56.7 2.3 5.1E-05 29.8 -0.9 15 395-409 25-41 (72)
126 TIGR00280 L37a ribosomal prote 56.1 2.9 6.3E-05 30.6 -0.6 31 397-435 35-65 (91)
127 KOG4377|consensus 55.9 8.7 0.00019 36.3 2.3 117 293-422 269-428 (480)
128 PF00301 Rubredoxin: Rubredoxi 55.2 6.6 0.00014 25.0 1.0 13 324-336 2-14 (47)
129 cd00730 rubredoxin Rubredoxin; 54.9 6.4 0.00014 25.4 0.9 12 324-335 2-13 (50)
130 PTZ00255 60S ribosomal protein 54.3 3.7 8.1E-05 30.0 -0.2 13 422-434 53-65 (90)
131 KOG1280|consensus 54.0 8 0.00017 35.8 1.7 38 366-404 79-116 (381)
132 PRK00432 30S ribosomal protein 51.7 7.3 0.00016 25.2 0.8 12 396-407 36-47 (50)
133 KOG2272|consensus 50.7 2.1 4.5E-05 37.4 -2.4 17 322-338 279-295 (332)
134 PF04959 ARS2: Arsenite-resist 50.0 4.6 0.0001 35.2 -0.4 24 396-419 76-99 (214)
135 COG4530 Uncharacterized protei 49.9 8.1 0.00018 29.1 0.9 28 367-408 10-37 (129)
136 PRK03976 rpl37ae 50S ribosomal 49.5 4.2 9.1E-05 29.8 -0.6 31 397-435 36-66 (90)
137 PF05443 ROS_MUCR: ROS/MUCR tr 49.3 10 0.00022 30.2 1.5 23 366-391 72-94 (132)
138 PF14353 CpXC: CpXC protein 49.1 13 0.00028 29.6 2.1 24 396-419 37-60 (128)
139 PRK04023 DNA polymerase II lar 47.8 14 0.0003 39.7 2.5 10 323-332 626-635 (1121)
140 PRK04023 DNA polymerase II lar 47.2 16 0.00035 39.2 2.9 10 424-433 1038-1047(1121)
141 KOG2272|consensus 47.1 9.2 0.0002 33.6 1.0 77 258-336 98-176 (332)
142 KOG4167|consensus 46.1 5 0.00011 40.9 -0.9 26 396-421 791-816 (907)
143 PF09845 DUF2072: Zn-ribbon co 44.1 11 0.00025 29.7 1.0 15 366-380 1-15 (131)
144 COG0068 HypF Hydrogenase matur 43.2 2.6 5.5E-05 43.0 -3.4 44 298-342 126-170 (750)
145 COG3357 Predicted transcriptio 42.5 14 0.00031 26.9 1.2 14 258-271 57-70 (97)
146 PF02176 zf-TRAF: TRAF-type zi 42.1 15 0.00033 24.4 1.3 6 295-300 38-43 (60)
147 KOG2593|consensus 41.1 27 0.00058 33.6 3.1 14 231-244 128-141 (436)
148 TIGR00595 priA primosomal prot 40.1 13 0.00027 37.6 0.9 48 178-267 214-261 (505)
149 KOG2071|consensus 39.8 15 0.00033 36.8 1.3 27 229-255 416-442 (579)
150 COG4957 Predicted transcriptio 37.8 18 0.00039 28.5 1.2 21 398-421 77-97 (148)
151 PF08274 PhnA_Zn_Ribbon: PhnA 37.8 17 0.00037 20.6 0.8 11 321-331 17-27 (30)
152 PRK14714 DNA polymerase II lar 37.5 32 0.00069 38.1 3.3 10 367-376 710-719 (1337)
153 PF13240 zinc_ribbon_2: zinc-r 36.9 15 0.00032 19.3 0.4 6 400-405 16-21 (23)
154 KOG4377|consensus 36.8 18 0.00039 34.4 1.2 24 367-390 402-427 (480)
155 PF12907 zf-met2: Zinc-binding 36.7 13 0.00027 22.7 0.2 7 425-431 3-9 (40)
156 PF07754 DUF1610: Domain of un 36.7 16 0.00034 19.5 0.5 9 294-302 15-23 (24)
157 PRK14873 primosome assembly pr 36.5 16 0.00035 38.1 1.0 38 205-268 394-431 (665)
158 PF08271 TF_Zn_Ribbon: TFIIB z 36.1 17 0.00036 22.5 0.6 11 366-376 19-29 (43)
159 PF15135 UPF0515: Uncharacteri 34.8 33 0.00072 30.3 2.4 19 362-380 151-169 (278)
160 PRK14714 DNA polymerase II lar 34.7 30 0.00065 38.3 2.6 9 232-240 693-701 (1337)
161 COG3364 Zn-ribbon containing p 34.6 25 0.00054 26.2 1.4 13 177-189 2-14 (112)
162 KOG3408|consensus 34.1 23 0.00049 27.4 1.2 22 397-418 57-78 (129)
163 PF10446 DUF2457: Protein of u 33.3 58 0.0013 31.6 4.0 11 3-13 37-47 (458)
164 PF04959 ARS2: Arsenite-resist 33.2 27 0.00059 30.5 1.7 26 365-390 76-101 (214)
165 KOG1280|consensus 32.2 35 0.00077 31.7 2.3 36 395-430 77-116 (381)
166 COG1998 RPS31 Ribosomal protei 32.1 27 0.00059 22.2 1.1 10 397-406 37-46 (51)
167 KOG3408|consensus 31.6 31 0.00068 26.7 1.6 27 291-317 53-79 (129)
168 PF14446 Prok-RING_1: Prokaryo 31.4 27 0.00059 22.9 1.1 10 148-157 7-16 (54)
169 PF14311 DUF4379: Domain of un 31.3 31 0.00068 22.6 1.4 14 366-379 28-41 (55)
170 PF05290 Baculo_IE-1: Baculovi 31.1 19 0.00042 28.4 0.4 11 400-410 124-134 (140)
171 COG1198 PriA Primosomal protei 30.8 36 0.00078 35.8 2.4 10 177-186 462-471 (730)
172 PF13878 zf-C2H2_3: zinc-finge 30.6 48 0.001 20.3 2.0 24 147-170 14-39 (41)
173 TIGR01206 lysW lysine biosynth 28.2 30 0.00066 22.7 0.9 9 296-304 23-31 (54)
174 PF07282 OrfB_Zn_ribbon: Putat 28.1 28 0.00061 24.0 0.8 30 367-409 29-58 (69)
175 PF05191 ADK_lid: Adenylate ki 28.1 15 0.00033 21.8 -0.4 31 296-334 2-32 (36)
176 smart00440 ZnF_C2C2 C2C2 Zinc 28.0 20 0.00043 21.8 0.0 14 364-377 26-39 (40)
177 KOG0957|consensus 27.6 97 0.0021 30.5 4.4 11 366-376 544-554 (707)
178 KOG4124|consensus 26.9 13 0.00027 34.5 -1.4 51 293-343 347-418 (442)
179 PF07649 C1_3: C1-like domain; 26.6 31 0.00068 19.3 0.7 11 294-304 14-24 (30)
180 PRK05580 primosome assembly pr 26.5 32 0.00069 36.2 1.2 47 178-266 382-428 (679)
181 TIGR00100 hypA hydrogenase nic 26.3 27 0.00058 27.2 0.5 13 366-378 70-82 (115)
182 PRK12380 hydrogenase nickel in 25.8 33 0.00071 26.6 0.9 26 294-331 69-94 (113)
183 smart00154 ZnF_AN1 AN1-like Zi 25.5 33 0.00071 20.8 0.6 13 397-409 12-24 (39)
184 PF04931 DNA_pol_phi: DNA poly 25.5 68 0.0015 34.5 3.5 6 3-8 628-633 (784)
185 COG4957 Predicted transcriptio 25.3 46 0.001 26.3 1.6 23 205-230 78-100 (148)
186 COG3677 Transposase and inacti 25.2 36 0.00077 27.2 1.0 13 231-243 53-65 (129)
187 COG3091 SprT Zn-dependent meta 25.0 34 0.00075 27.8 0.9 35 293-332 115-149 (156)
188 PF07975 C1_4: TFIIH C1-like d 25.0 16 0.00035 23.7 -0.8 23 366-388 21-43 (51)
189 PRK03824 hypA hydrogenase nick 24.8 25 0.00054 28.3 0.1 14 294-307 69-82 (135)
190 COG1571 Predicted DNA-binding 24.2 39 0.00085 32.7 1.2 31 367-411 351-381 (421)
191 PF04606 Ogr_Delta: Ogr/Delta- 24.2 12 0.00026 23.7 -1.5 6 429-434 33-38 (47)
192 PF04216 FdhE: Protein involve 24.0 6.2 0.00014 36.5 -4.1 14 320-333 194-207 (290)
193 PF12760 Zn_Tnp_IS1595: Transp 24.0 61 0.0013 20.3 1.7 7 232-238 19-25 (46)
194 COG1675 TFA1 Transcription ini 23.8 92 0.002 26.3 3.2 17 319-335 109-125 (176)
195 KOG0717|consensus 23.4 40 0.00087 32.8 1.1 21 398-418 293-313 (508)
196 PF01096 TFIIS_C: Transcriptio 23.3 12 0.00027 22.6 -1.5 13 364-376 26-38 (39)
197 PRK05978 hypothetical protein; 22.8 38 0.00083 27.6 0.8 10 368-377 54-63 (148)
198 KOG0402|consensus 22.7 29 0.00063 24.8 0.1 13 422-434 53-65 (92)
199 COG2331 Uncharacterized protei 22.5 38 0.00083 23.8 0.6 10 296-305 13-22 (82)
200 KOG0943|consensus 22.4 84 0.0018 34.8 3.2 9 155-163 1926-1934(3015)
201 PF09963 DUF2197: Uncharacteri 22.4 27 0.00059 23.1 -0.1 37 367-406 3-40 (56)
202 PF01363 FYVE: FYVE zinc finge 22.3 48 0.001 22.8 1.1 28 295-334 9-36 (69)
203 KOG0978|consensus 22.3 40 0.00087 35.0 1.0 20 424-443 679-698 (698)
204 COG1779 C4-type Zn-finger prot 22.2 15 0.00032 31.3 -1.7 8 367-374 15-22 (201)
205 PF13451 zf-trcl: Probable zin 22.0 64 0.0014 20.7 1.5 15 294-308 3-17 (49)
206 KOG4124|consensus 22.0 21 0.00046 33.1 -0.9 58 258-315 348-418 (442)
207 PF09332 Mcm10: Mcm10 replicat 21.9 7.6 0.00017 36.5 -3.9 58 366-432 252-312 (344)
208 KOG0717|consensus 21.8 49 0.0011 32.3 1.4 21 367-387 293-313 (508)
209 smart00661 RPOL9 RNA polymeras 21.6 55 0.0012 20.9 1.2 8 296-303 21-28 (52)
210 PF04780 DUF629: Protein of un 21.5 56 0.0012 32.3 1.7 21 398-418 58-78 (466)
211 COG4888 Uncharacterized Zn rib 21.2 21 0.00046 26.6 -0.9 10 324-333 47-56 (104)
212 COG4896 Uncharacterized protei 21.2 68 0.0015 21.5 1.5 41 296-336 3-44 (68)
213 TIGR00595 priA primosomal prot 20.7 72 0.0016 32.3 2.4 23 176-211 239-261 (505)
214 PF13453 zf-TFIIB: Transcripti 20.3 44 0.00096 20.3 0.5 18 294-311 18-35 (41)
215 COG1571 Predicted DNA-binding 20.1 57 0.0012 31.6 1.4 14 323-336 367-380 (421)
216 PRK14873 primosome assembly pr 20.0 70 0.0015 33.5 2.2 23 176-212 409-431 (665)
No 1
>KOG1074|consensus
Probab=99.97 E-value=1.3e-31 Score=258.28 Aligned_cols=54 Identities=31% Similarity=0.552 Sum_probs=50.7
Q ss_pred cccCccccccCChHHHHHHHHHcCCc--cccccccccCCChhhHHHHHhhhcCCCC
Q psy18077 398 RTCPQCGKLYVSMPAYSMHLRTHNQI--CQCPHCSKSFSRPWLLQGHIRTHTGGTN 451 (451)
Q Consensus 398 ~~C~~C~~~f~~~~~l~~H~~~H~~~--~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 451 (451)
..|..|++.|.+...|..|+|+|+++ |.|.+|++.|..+..|+.||.+|....|
T Consensus 880 h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKvHMgtH~w~q~ 935 (958)
T KOG1074|consen 880 HVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKVHMGTHMWVQP 935 (958)
T ss_pred hhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhhhhccccccCC
Confidence 78999999999999999999999876 9999999999999999999999987653
No 2
>KOG2462|consensus
Probab=99.96 E-value=2e-30 Score=221.66 Aligned_cols=145 Identities=66% Similarity=1.180 Sum_probs=138.0
Q ss_pred CCCccccccccccccCChHhHHHHHhhhcCCCCCCCcccccccccccCHHHHHHHHHHcCCCcccCcCccccCCHHHHHH
Q psy18077 142 NNNESFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSRPWLLQG 221 (451)
Q Consensus 142 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~~f~~~~~L~~ 221 (451)
.....|+|..||+.|.+..+|.+|.+.|..-+..+.+.|+.|++.|.+...|..|+++|.-++.|.+||+.|.+++.|+-
T Consensus 126 ~~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~l~c~C~iCGKaFSRPWLLQG 205 (279)
T KOG2462|consen 126 AKHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHTLPCECGICGKAFSRPWLLQG 205 (279)
T ss_pred ccCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccCCCcccccccccccchHHhhc
Confidence 34567999999999999999999999999888889999999999999999999999999999999999999999999999
Q ss_pred HHHHhcCCCCcccccchhhcCChHhHHHHHHHcCCCCCeecccccccccChHHHHhchhhcCCCC
Q psy18077 222 HIRTHTGEKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHEDSSCLKS 286 (451)
Q Consensus 222 H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 286 (451)
|+|+|+||+||.|+.|+++|..+++|+.||++|.+.++|+|..|++.|..++.|.+|...-|.+.
T Consensus 206 HiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~C~~~ 270 (279)
T KOG2462|consen 206 HIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSESACLKY 270 (279)
T ss_pred ccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhhccccc
Confidence 99999999999999999999999999999999999999999999999999999999988765543
No 3
>KOG1074|consensus
Probab=99.96 E-value=1.6e-30 Score=250.92 Aligned_cols=214 Identities=29% Similarity=0.502 Sum_probs=152.2
Q ss_pred cCcCccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHHHHcCCC----CCeecc---cccccccChHHHHhc
Q psy18077 206 CPHCSKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHIQTHSKV----KPFVCE---KCGKAFALKSYLYKH 278 (451)
Q Consensus 206 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~----~~~~C~---~C~~~f~~~~~l~~H 278 (451)
|-+|-++..-+..|+.|.++|+|++||+|.+||+.|.++.+|+.||-.|... .++.|+ +|.+.|.....|..|
T Consensus 608 CiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lpQh 687 (958)
T KOG1074|consen 608 CIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLPQH 687 (958)
T ss_pred eeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccccce
Confidence 8888888888888888888888888888888888888888888888877543 347788 888888888888888
Q ss_pred hhhcCCCCcCCC-----CCCcCeeccccccccCChHHHHHHHHHhc----------------CCC----ceecccCcccc
Q psy18077 279 EDSSCLKSYDSS-----SSEKPFKCKVCDKAFADKSNLRAHIQTHS----------------KVK----PFVCEKCGKAF 333 (451)
Q Consensus 279 ~~~~~~~~~~~~-----~~~~~~~C~~C~~~f~~~~~l~~H~~~h~----------------~~~----~~~C~~C~~~f 333 (451)
+++|.+...... ...-.-+|..|.+.|.....+..++..|. .+. +..+..|+..+
T Consensus 688 IriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~~~~ 767 (958)
T KOG1074|consen 688 IRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSCGREL 767 (958)
T ss_pred EEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCcccccccccc
Confidence 888764332211 11122468888888877777777776552 112 46677787777
Q ss_pred cChhhHHhcccc------------------------cCCCcc-----C-----CCCC--------------c-------c
Q psy18077 334 ALKSYLYKHEDS------------------------SCLKSY-----D-----SSSN--------------G-------H 358 (451)
Q Consensus 334 ~~~~~L~~H~~~------------------------~~~~~~-----~-----~~~~--------------~-------~ 358 (451)
.....+..+-.. .-++.+ . ...+ . .
T Consensus 768 ~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~~n~ 847 (958)
T KOG1074|consen 768 EGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATKTNE 847 (958)
T ss_pred CcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccccccccccccc
Confidence 766555444320 000000 0 0000 0 0
Q ss_pred cC----------------------ccc--ccccccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCChHHHH
Q psy18077 359 LN----------------------KFT--TISFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYS 414 (451)
Q Consensus 359 ~~----------------------~~~--~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~l~ 414 (451)
+. ... .....|..|++.|...++|..|+++|++ +|||.|.+|++.|..+..|+
T Consensus 848 ~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg---~KPF~C~fC~~aFttrgnLK 924 (958)
T KOG1074|consen 848 ITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTG---PKPFFCHFCEEAFTTRGNLK 924 (958)
T ss_pred ccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCC---CCCccchhhhhhhhhhhhhh
Confidence 00 000 0048899999999999999999999997 99999999999999999999
Q ss_pred HHHHHcCC
Q psy18077 415 MHLRTHNQ 422 (451)
Q Consensus 415 ~H~~~H~~ 422 (451)
.||.+|..
T Consensus 925 vHMgtH~w 932 (958)
T KOG1074|consen 925 VHMGTHMW 932 (958)
T ss_pred hhhccccc
Confidence 99999964
No 4
>KOG3608|consensus
Probab=99.95 E-value=2.6e-28 Score=213.49 Aligned_cols=217 Identities=27% Similarity=0.527 Sum_probs=139.1
Q ss_pred cCcCccccCCHHHHHHHHHHhcC------------CCC-cccc--cchhhcCChHhHHHHHHHcCCCCCeeccccccccc
Q psy18077 206 CPHCSKSFSRPWLLQGHIRTHTG------------EKP-FKCK--VCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFA 270 (451)
Q Consensus 206 C~~C~~~f~~~~~L~~H~~~h~~------------~~~-~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 270 (451)
...|+..|.+...|..|+..|.- ++| +.|. .|.+.|.++..|+.|++.|.+++...|+.||..|.
T Consensus 139 WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~ 218 (467)
T KOG3608|consen 139 WEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFR 218 (467)
T ss_pred hhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhc
Confidence 34566666666666666554421 111 3343 36677777777777777777777777777777777
Q ss_pred ChHHHHhchhhcCCCCcCCCCCCcCeeccccccccCChHHHHHHHHHhcCCCceecccCcccccChhhHHhcccccCCCc
Q psy18077 271 LKSYLYKHEDSSCLKSYDSSSSEKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHEDSSCLKS 350 (451)
Q Consensus 271 ~~~~l~~H~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~~ 350 (451)
++..|..|.+.. +.....+|.|..|.+.|.+...|+.|+..|- .-|+|++|+.+....+.|..|++.
T Consensus 219 ~~tkl~DH~rRq------t~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmtc~~~ssL~~H~r~----- 285 (467)
T KOG3608|consen 219 TKTKLFDHLRRQ------TELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMTCSSASSLTTHIRY----- 285 (467)
T ss_pred cccHHHHHHHhh------hhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccCCCChHHHHHHHHh-----
Confidence 777777776543 1133446777777777777777777777663 357777777777777777777652
Q ss_pred cCCCCCcccCcccccccccccccccCCChHHHHHHHHhccCCCCCCCcccCc--cccccCChHHHHHHHHHcC-Cc----
Q psy18077 351 YDSSSNGHLNKFTTISFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQ--CGKLYVSMPAYSMHLRTHN-QI---- 423 (451)
Q Consensus 351 ~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~H~-~~---- 423 (451)
+....+||+|..|++.|.+.+.|.+|+.+|. +..|+|.. |.++|.+..+|++|++.++ +.
T Consensus 286 ---------rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS----~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~ 352 (467)
T KOG3608|consen 286 ---------RHSKDKPFKCDECDTRCVRESDLAKHVQVHS----KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPIL 352 (467)
T ss_pred ---------hhccCCCccccchhhhhccHHHHHHHHHhcc----ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCc
Confidence 2223446777777777777777777777665 45677766 7777777777777776554 21
Q ss_pred cccccccccCCChhhHHHHHhhhcC
Q psy18077 424 CQCPHCSKSFSRPWLLQGHIRTHTG 448 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~~~h~~ 448 (451)
|+|..|++.|++-.+|..|++..||
T Consensus 353 Y~CH~Cdr~ft~G~~L~~HL~kkH~ 377 (467)
T KOG3608|consen 353 YACHCCDRFFTSGKSLSAHLMKKHG 377 (467)
T ss_pred eeeecchhhhccchhHHHHHHHhhc
Confidence 7777777777777777777655443
No 5
>KOG3608|consensus
Probab=99.95 E-value=6.2e-28 Score=211.08 Aligned_cols=264 Identities=28% Similarity=0.528 Sum_probs=221.4
Q ss_pred cccccc--cccccCC-hHhHHHHHhhhc-------------------C------------CCCCCCccc--ccccccccC
Q psy18077 146 SFQCRD--CGKHYST-SSNLARHRQIHR-------------------S------------PDDTKARTC--PQCGKLYVS 189 (451)
Q Consensus 146 ~~~C~~--C~~~f~~-~~~l~~H~~~h~-------------------~------------~~~~~~~~C--~~C~~~f~~ 189 (451)
.++|.. |++...+ ...|.+|.-.|. + |.....|.| ..|+..|.+
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s 148 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVS 148 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccC
Confidence 567754 8887766 588999976551 0 011223455 679999999
Q ss_pred HHHHHHHHHHcCC-----------------CcccCcCccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHHH
Q psy18077 190 MPAYSMHLRTHNQ-----------------ICQCPHCSKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHIQ 252 (451)
Q Consensus 190 ~~~l~~H~~~h~~-----------------~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 252 (451)
...|..|...|.- ++....|.+.|.++..|++|++.|++++...|+.||..|.++..|..|++
T Consensus 149 ~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~r 228 (467)
T KOG3608|consen 149 IVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLR 228 (467)
T ss_pred HHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHH
Confidence 9999999877632 34466799999999999999999999999999999999999999999987
Q ss_pred Hc--CCCCCeecccccccccChHHHHhchhhcCCCCcCCCCCCcCeeccccccccCChHHHHHHHHH-hcCCCceecccC
Q psy18077 253 TH--SKVKPFVCEKCGKAFALKSYLYKHEDSSCLKSYDSSSSEKPFKCKVCDKAFADKSNLRAHIQT-HSKVKPFVCEKC 329 (451)
Q Consensus 253 ~h--~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C 329 (451)
.- ....+|+|..|.+.|.+...|..|+..| -.-|+|+.|+.+....+.|..|++. |...+||+|+.|
T Consensus 229 Rqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rH----------vn~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~C 298 (467)
T KOG3608|consen 229 RQTELNTNSFQCAQCFKRFATEKLLKSHVVRH----------VNCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDEC 298 (467)
T ss_pred hhhhhcCCchHHHHHHHHHhHHHHHHHHHHHh----------hhcccccccccCCCChHHHHHHHHhhhccCCCccccch
Confidence 64 3456899999999999999999999877 4579999999999999999999986 888999999999
Q ss_pred cccccChhhHHhcccccCCCccCCCCCcccCccccccccccc--ccccCCChHHHHHHHHhccCCCCCCCcccCcccccc
Q psy18077 330 GKAFALKSYLYKHEDSSCLKSYDSSSNGHLNKFTTISFQCRD--CGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLY 407 (451)
Q Consensus 330 ~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~~~~~~~~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f 407 (451)
++.|.+.+.|.+|. ..|....|+|.. |..+|.+...|++|++.++.+.++-+|.|-.|++.|
T Consensus 299 d~~c~~esdL~kH~----------------~~HS~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~f 362 (467)
T KOG3608|consen 299 DTRCVRESDLAKHV----------------QVHSKTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFF 362 (467)
T ss_pred hhhhccHHHHHHHH----------------HhccccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhh
Confidence 99999999999995 345566899988 999999999999999999976778899999999999
Q ss_pred CChHHHHHHHHHcCCc--------cccccccccCCC
Q psy18077 408 VSMPAYSMHLRTHNQI--------CQCPHCSKSFSR 435 (451)
Q Consensus 408 ~~~~~l~~H~~~H~~~--------~~C~~C~~~f~~ 435 (451)
++-.+|..|++.-++- |.=..|.-+|.+
T Consensus 363 t~G~~L~~HL~kkH~f~~PsGh~RFtYk~~edG~mR 398 (467)
T KOG3608|consen 363 TSGKSLSAHLMKKHGFRLPSGHKRFTYKVDEDGFMR 398 (467)
T ss_pred ccchhHHHHHHHhhcccCCCCCCceeeeeccCceee
Confidence 9999999998765431 444555555543
No 6
>KOG2462|consensus
Probab=99.94 E-value=7.6e-28 Score=205.83 Aligned_cols=136 Identities=51% Similarity=0.929 Sum_probs=109.8
Q ss_pred ccCcCccccCCHHHHHHHHHHhcC---CCCcccccchhhcCChHhHHHHHHHcCCCCCeecccccccccChHHHHhchhh
Q psy18077 205 QCPHCSKSFSRPWLLQGHIRTHTG---EKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHEDS 281 (451)
Q Consensus 205 ~C~~C~~~f~~~~~L~~H~~~h~~---~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 281 (451)
+|+.||+.+.+...|.+|..+|.. .+.+.|.+|++.|.+...|+.|+++|. .++.|.+||+.|...+-|+-|++.
T Consensus 132 ~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiRT 209 (279)
T KOG2462|consen 132 KCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIRT 209 (279)
T ss_pred eccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhccccc
Confidence 344444444444444444444432 455788888888888888888888886 678889999999999999999887
Q ss_pred cCCCCcCCCCCCcCeeccccccccCChHHHHHHHHHhcCCCceecccCcccccChhhHHhcccccCCCc
Q psy18077 282 SCLKSYDSSSSEKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHEDSSCLKS 350 (451)
Q Consensus 282 ~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~~ 350 (451)
| +|||||.|..|++.|.++++|+.||++|.+.|+|+|..|+|.|..++.|.+|..+-|.+.
T Consensus 210 H--------TGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~C~~~ 270 (279)
T KOG2462|consen 210 H--------TGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSESACLKY 270 (279)
T ss_pred c--------cCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhhccccc
Confidence 7 999999999999999999999999999999999999999999999999999998777654
No 7
>KOG3623|consensus
Probab=99.90 E-value=2.3e-24 Score=204.96 Aligned_cols=107 Identities=25% Similarity=0.612 Sum_probs=96.1
Q ss_pred ccccccccccccCChHhHHHHHhhhcCCCCCCCcccccccccccCHHHHHHHHHHcCC---------------CcccCcC
Q psy18077 145 ESFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRTHNQ---------------ICQCPHC 209 (451)
Q Consensus 145 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~---------------~~~C~~C 209 (451)
....|++|.+.+.....|+.|++.-+. .....|.|.+|.++|.....|..||.+|.. .|+|..|
T Consensus 209 qlltcpycdrgykrltslkeHikyrhe-kne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtEC 287 (1007)
T KOG3623|consen 209 QLLTCPYCDRGYKRLTSLKEHIKYRHE-KNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTEC 287 (1007)
T ss_pred hhhcchhHHHHHHHHHHHHHHHHHHHh-hCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhcccccccc
Confidence 346899999999999999999986542 245679999999999999999999998842 5899999
Q ss_pred ccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHHH
Q psy18077 210 SKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHIQ 252 (451)
Q Consensus 210 ~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 252 (451)
++.|..+..|+.|+|+|.|++||.|+.|+++|.....+..||.
T Consensus 288 gKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 288 GKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred chhhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 9999999999999999999999999999999999999988874
No 8
>KOG3623|consensus
Probab=99.89 E-value=1.3e-23 Score=199.96 Aligned_cols=104 Identities=32% Similarity=0.626 Sum_probs=94.9
Q ss_pred CcccccccccccCHHHHHHHHHHcCC----CcccCcCccccCCHHHHHHHHHHhcC-------------CCCcccccchh
Q psy18077 177 ARTCPQCGKLYVSMPAYSMHLRTHNQ----ICQCPHCSKSFSRPWLLQGHIRTHTG-------------EKPFKCKVCDK 239 (451)
Q Consensus 177 ~~~C~~C~~~f~~~~~l~~H~~~h~~----~~~C~~C~~~f~~~~~L~~H~~~h~~-------------~~~~~C~~C~~ 239 (451)
...|++|.+.+.....|..|++..+. .|.|..|.++|..+..|.+||.+|.. .+.|+|..|++
T Consensus 210 lltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgK 289 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGK 289 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccch
Confidence 46899999999999999999876432 78999999999999999999998853 25699999999
Q ss_pred hcCChHhHHHHHHHcCCCCCeecccccccccChHHHHhchh
Q psy18077 240 AFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHED 280 (451)
Q Consensus 240 ~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 280 (451)
+|+.+-.|+.|+|+|.|++||.|+.|++.|.....+..||.
T Consensus 290 AFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 290 AFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 99999999999999999999999999999999999988875
No 9
>KOG3576|consensus
Probab=99.65 E-value=1.8e-17 Score=134.67 Aligned_cols=124 Identities=28% Similarity=0.589 Sum_probs=86.5
Q ss_pred CCCcccccccccccCHHHHHHHHHHcCC--CcccCcCccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHHH
Q psy18077 175 TKARTCPQCGKLYVSMPAYSMHLRTHNQ--ICQCPHCSKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHIQ 252 (451)
Q Consensus 175 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~--~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 252 (451)
...|.|.+|++.|.-..-|.+|++-|.. .+-|..||+.|.+...|++|+++|+|.+||+|..|++.|.....|..|++
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence 4567888888888888778888777754 56777777777777777777777777777777777777777777777776
Q ss_pred HcCCCCCeecccccccccChHHHHhchhhcCCCCcCCCCCCcCeeccccccccCChHHHHHHHHHhcCCCc
Q psy18077 253 THSKVKPFVCEKCGKAFALKSYLYKHEDSSCLKSYDSSSSEKPFKCKVCDKAFADKSNLRAHIQTHSKVKP 323 (451)
Q Consensus 253 ~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 323 (451)
.-+|... ....+. ...+.|.|..||.+-.....+..|+..|+...|
T Consensus 195 kvhgv~~------------~yayke-------------rr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 195 KVHGVQH------------QYAYKE-------------RRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred HHcCchH------------HHHHHH-------------hhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 5544310 000000 234678888888887777788888877765443
No 10
>KOG3576|consensus
Probab=99.65 E-value=6.8e-17 Score=131.41 Aligned_cols=123 Identities=28% Similarity=0.539 Sum_probs=88.8
Q ss_pred CCCeecccccccccChHHHHhchhhcCCCCcCCCCCCcCeeccccccccCChHHHHHHHHHhcCCCceecccCcccccCh
Q psy18077 257 VKPFVCEKCGKAFALKSYLYKHEDSSCLKSYDSSSSEKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALK 336 (451)
Q Consensus 257 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 336 (451)
...|.|.+|++.|....-|.+|++-| ...+.|.|..||+.|.+...|++|+++|+|.+||+|..|++.|.++
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch--------~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqr 186 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCH--------SDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQR 186 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhc--------cHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhh
Confidence 45688888888888888888888776 6667888888888888888888888888888888888888888888
Q ss_pred hhHHhcccccCCCccCCCCCcccCcccccccccccccccCCChHHHHHHHHhccC
Q psy18077 337 SYLYKHEDSSCLKSYDSSSNGHLNKFTTISFQCRDCGKHYSTSSNLARHRQIHRS 391 (451)
Q Consensus 337 ~~L~~H~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~ 391 (451)
-.|..|++..+|-.-. ........+.|.|..||.+-.....+..|++.|+.
T Consensus 187 csleshl~kvhgv~~~----yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp 237 (267)
T KOG3576|consen 187 CSLESHLKKVHGVQHQ----YAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHP 237 (267)
T ss_pred ccHHHHHHHHcCchHH----HHHHHhhhheeeecccCCCCCChhHHHHHHHhcCC
Confidence 8888887654443200 00111222356666666666666666666666654
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.26 E-value=9.5e-12 Score=120.92 Aligned_cols=97 Identities=22% Similarity=0.586 Sum_probs=52.7
Q ss_pred cccccchhhcCChHhHHHHHHHcCCCCCeecccccccccChHHHHhchhhcCCCCcCCCCCCcCeeccccccccC-----
Q psy18077 232 FKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHEDSSCLKSYDSSSSEKPFKCKVCDKAFA----- 306 (451)
Q Consensus 232 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~C~~C~~~f~----- 306 (451)
+.|+.|++.|. ...|..|++.++ +++.|+ |++.+ .+..|..|+..+ .+.+++.|.+|++.|.
T Consensus 454 ~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~th--------Cp~Kpi~C~fC~~~v~~g~~~ 520 (567)
T PLN03086 454 VHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQAST--------CPLRLITCRFCGDMVQAGGSA 520 (567)
T ss_pred ccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhcc--------CCCCceeCCCCCCccccCccc
Confidence 35555555553 445555555543 455555 55433 345555555444 3345566666665553
Q ss_pred -----ChHHHHHHHHHhcCCCceecccCcccccChhhHHhcc
Q psy18077 307 -----DKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHE 343 (451)
Q Consensus 307 -----~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~ 343 (451)
..+.|..|..+. |.+++.|..||+.+..+ .|..|+
T Consensus 521 ~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~ 560 (567)
T PLN03086 521 MDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQ 560 (567)
T ss_pred cchhhhhhhHHHHHHhc-CCcceEccccCCeeeeh-hHHHHH
Confidence 134566666664 66667777776666543 344554
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.25 E-value=1.5e-11 Score=119.59 Aligned_cols=145 Identities=19% Similarity=0.524 Sum_probs=114.2
Q ss_pred cccccccccccCChHhHHHHHhhhcCCCCCCCccccc--ccccccCHHHHHHHHHHcCCCcccCcCccccCCHHHHHHHH
Q psy18077 146 SFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQ--CGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSRPWLLQGHI 223 (451)
Q Consensus 146 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~C~~C~~~f~~~~~L~~H~ 223 (451)
.-.|..|.+.... ..|..|.... ......|+. |+..|. ...+..| +.|+.|++.|. ...|..|+
T Consensus 407 ~V~C~NC~~~i~l-~~l~lHe~~C----~r~~V~Cp~~~Cg~v~~-r~el~~H-------~~C~~Cgk~f~-~s~LekH~ 472 (567)
T PLN03086 407 TVECRNCKHYIPS-RSIALHEAYC----SRHNVVCPHDGCGIVLR-VEEAKNH-------VHCEKCGQAFQ-QGEMEKHM 472 (567)
T ss_pred eEECCCCCCccch-hHHHHHHhhC----CCcceeCCcccccceee-ccccccC-------ccCCCCCCccc-hHHHHHHH
Confidence 3479999887655 4566777544 455677984 999984 3444444 47999999996 68899999
Q ss_pred HHhcCCCCcccccchhhcCChHhHHHHHHHcCCCCCeecccccccccC----------hHHHHhchhhcCCCCcCCCCCC
Q psy18077 224 RTHTGEKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFAL----------KSYLYKHEDSSCLKSYDSSSSE 293 (451)
Q Consensus 224 ~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~----------~~~l~~H~~~~~~~~~~~~~~~ 293 (451)
++++ .++.|+ |++.+ .+..|..|+..|.+.+++.|.+|++.|.. ...|..|+.. .+.
T Consensus 473 ~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~---------CG~ 539 (567)
T PLN03086 473 KVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESI---------CGS 539 (567)
T ss_pred HhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHh---------cCC
Confidence 9986 789999 99755 67899999999999999999999999952 3578889765 478
Q ss_pred cCeeccccccccCChHHHHHHHHHh
Q psy18077 294 KPFKCKVCDKAFADKSNLRAHIQTH 318 (451)
Q Consensus 294 ~~~~C~~C~~~f~~~~~l~~H~~~h 318 (451)
+++.|..|++.+..+ .|..|+..-
T Consensus 540 rt~~C~~Cgk~Vrlr-dm~~H~~~~ 563 (567)
T PLN03086 540 RTAPCDSCGRSVMLK-EMDIHQIAV 563 (567)
T ss_pred cceEccccCCeeeeh-hHHHHHHHh
Confidence 999999999998766 577886543
No 13
>PHA00733 hypothetical protein
Probab=99.23 E-value=1.1e-11 Score=98.27 Aligned_cols=84 Identities=15% Similarity=0.192 Sum_probs=73.0
Q ss_pred cccccccccccCCChHHHHHH--HHhccCCCCCCCcccCccccccCChHHHHHHHHHcCCccccccccccCCChhhHHHH
Q psy18077 365 ISFQCRDCGKHYSTSSNLARH--RQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSRPWLLQGH 442 (451)
Q Consensus 365 ~~~~C~~C~~~f~~~~~L~~H--~~~H~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~C~~C~~~f~~~~~l~~H 442 (451)
+++.|.+|.+.|.....|..| ++.|....+.+||.|+.|++.|.+...|..|++.|..+|.|++|+++|.....|..|
T Consensus 39 ~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~~~~~C~~CgK~F~~~~sL~~H 118 (128)
T PHA00733 39 KRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSKVCPVCGKEFRNTDSTLDH 118 (128)
T ss_pred hhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcCCcCccCCCCCCccCCHHHHHHH
Confidence 479999999999998888777 443333234789999999999999999999999887789999999999999999999
Q ss_pred HhhhcC
Q psy18077 443 IRTHTG 448 (451)
Q Consensus 443 ~~~h~~ 448 (451)
++..|+
T Consensus 119 ~~~~h~ 124 (128)
T PHA00733 119 VCKKHN 124 (128)
T ss_pred HHHhcC
Confidence 998876
No 14
>PHA00733 hypothetical protein
Probab=99.09 E-value=1.1e-10 Score=92.69 Aligned_cols=86 Identities=19% Similarity=0.293 Sum_probs=52.7
Q ss_pred CCceecccCcccccChhhHHhc--ccccCCCccCCCCCcccCcccccccccccccccCCChHHHHHHHHhccCCCCCCCc
Q psy18077 321 VKPFVCEKCGKAFALKSYLYKH--EDSSCLKSYDSSSNGHLNKFTTISFQCRDCGKHYSTSSNLARHRQIHRSPDDTKAR 398 (451)
Q Consensus 321 ~~~~~C~~C~~~f~~~~~L~~H--~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~ 398 (451)
.+++.|.+|...|..+..|..| ++. ++..+...+|.|+.|++.|.+...|..|++.|. .+|
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~------------~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~-----~~~ 100 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYK------------LLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTE-----HSK 100 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHh------------hcccCCCCCccCCCCCCcCCCHHHHHHHHhcCC-----cCc
Confidence 4556666666666655555444 111 022333446777777777777777777776652 357
Q ss_pred ccCccccccCChHHHHHHHHHcCCc
Q psy18077 399 TCPQCGKLYVSMPAYSMHLRTHNQI 423 (451)
Q Consensus 399 ~C~~C~~~f~~~~~l~~H~~~H~~~ 423 (451)
.|..|++.|.....|..|+...++.
T Consensus 101 ~C~~CgK~F~~~~sL~~H~~~~h~~ 125 (128)
T PHA00733 101 VCPVCGKEFRNTDSTLDHVCKKHNI 125 (128)
T ss_pred cCCCCCCccCCHHHHHHHHHHhcCc
Confidence 7777777777777777777766553
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.96 E-value=3.6e-10 Score=73.06 Aligned_cols=42 Identities=29% Similarity=0.573 Sum_probs=21.6
Q ss_pred cccCccccccCChHHHHHHHHHcCCccccccccccCCChhhH
Q psy18077 398 RTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSRPWLL 439 (451)
Q Consensus 398 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~C~~C~~~f~~~~~l 439 (451)
|.|+.||+.|.....|..|+++|+++|+|..|++.|.+.+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHNTNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcCCcccCCcccceeccccee
Confidence 455555555555555555555555445555555555544443
No 16
>KOG3993|consensus
Probab=98.89 E-value=2e-10 Score=104.35 Aligned_cols=194 Identities=19% Similarity=0.246 Sum_probs=101.3
Q ss_pred CcccCcCccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHHHHcCCCCCeec--cc-ccccccChHHHHhch
Q psy18077 203 ICQCPHCSKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVC--EK-CGKAFALKSYLYKHE 279 (451)
Q Consensus 203 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C--~~-C~~~f~~~~~l~~H~ 279 (451)
.|.|..|...|.+...|.+|.-.-.--..|+|+.|+|.|.-..+|..|.|+|.....-.= .. =.+...+....+.-.
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~ 346 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAE 346 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhcc
Confidence 567888888888888888875221112238888888888888888888888854221000 00 000000011111110
Q ss_pred hhcCCCCcCCCCCCcCeeccccccccCChHHHHHHHHHhcCCC-----------------ceecccCcccccChhhHHhc
Q psy18077 280 DSSCLKSYDSSSSEKPFKCKVCDKAFADKSNLRAHIQTHSKVK-----------------PFVCEKCGKAFALKSYLYKH 342 (451)
Q Consensus 280 ~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~-----------------~~~C~~C~~~f~~~~~L~~H 342 (451)
+ . .....+..|.|..|++.|.+...|+.|+.+|+..- .+-|..|.-.+.....--.+
T Consensus 347 r--s----g~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~ 420 (500)
T KOG3993|consen 347 R--S----GDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDE 420 (500)
T ss_pred c--c----CCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccc
Confidence 0 0 00123347888888888888888888877765310 13455555544433222222
Q ss_pred ccccCCCccCCCCCcccCcccccccccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCChHHHHHHHHH
Q psy18077 343 EDSSCLKSYDSSSNGHLNKFTTISFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRT 419 (451)
Q Consensus 343 ~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~ 419 (451)
...+-+ ......|++|+..+.++..--.+.+.-.. +..|.|.+|...|.+...|.+|+..
T Consensus 421 vl~~a~--------------sael~~pp~~~~ppsss~~sgg~~rlg~~---~q~f~~ky~~atfyss~~ltrhin~ 480 (500)
T KOG3993|consen 421 VLYVAG--------------SAELELPPYDGSPPSSSGSSGGYGRLGIA---EQGFTCKYCPATFYSSPGLTRHINK 480 (500)
T ss_pred eeeeec--------------cccccCCCCCCCCcccCCCCCccccccch---hhccccccchHhhhcCcchHhHhhh
Confidence 110000 00023455566555554444444433222 4556666666666666666666554
No 17
>KOG3993|consensus
Probab=98.84 E-value=8.2e-10 Score=100.48 Aligned_cols=210 Identities=19% Similarity=0.266 Sum_probs=135.2
Q ss_pred CcccccchhhcCChHhHHHHHHHcCCCCCeecccccccccChHHHHhchhhcCCCCcCCCCCCcCeeccccccccCChHH
Q psy18077 231 PFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHEDSSCLKSYDSSSSEKPFKCKVCDKAFADKSN 310 (451)
Q Consensus 231 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~ 310 (451)
.|.|..|...|...-.|.+|.-.-.-...|+|+.|+|.|.-..+|..|.+-|-.+.-....+..|-+= ...+...
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~-----~~~~rae 341 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQ-----AVETRAE 341 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhh-----hhhhhhh
Confidence 39999999999999999999743333345999999999999999999999885544333223222110 0001111
Q ss_pred HHHHHHH--hcCCCceecccCcccccChhhHHhcccccCCCccCCCCCc--ccCcccccccccccccccCCChHHHHHHH
Q psy18077 311 LRAHIQT--HSKVKPFVCEKCGKAFALKSYLYKHEDSSCLKSYDSSSNG--HLNKFTTISFQCRDCGKHYSTSSNLARHR 386 (451)
Q Consensus 311 l~~H~~~--h~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~--~~~~~~~~~~~C~~C~~~f~~~~~L~~H~ 386 (451)
.+.=-+. -.....|.|.+|++.|++...|+.|+.+|........... .+.......+.|..|.-.+.....--.+.
T Consensus 342 ~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~v 421 (500)
T KOG3993|consen 342 VQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEV 421 (500)
T ss_pred hhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccce
Confidence 1110000 0223479999999999999999999876654432221100 01112223577888888877766555555
Q ss_pred HhccCCCCCCCcccCccccccCChHHHHHHHHHc--CCccccccccccCCChhhHHHHHhhhc
Q psy18077 387 QIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRTH--NQICQCPHCSKSFSRPWLLQGHIRTHT 447 (451)
Q Consensus 387 ~~H~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H--~~~~~C~~C~~~f~~~~~l~~H~~~h~ 447 (451)
..+.+ ....-.|++|+-.+.+...--.+.+-- ...|.|.+|.-+|.....|.+|+..-|
T Consensus 422 l~~a~--sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~H 482 (500)
T KOG3993|consen 422 LYVAG--SAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCH 482 (500)
T ss_pred eeeec--cccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcC
Confidence 55544 234466888887777765544443322 233999999999999999999987644
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.83 E-value=1.5e-09 Score=70.21 Aligned_cols=44 Identities=11% Similarity=0.453 Sum_probs=39.7
Q ss_pred CeeccccccccCChHHHHHHHHHhcCCCceecccCcccccChhhHH
Q psy18077 295 PFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLY 340 (451)
Q Consensus 295 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~ 340 (451)
-|.|+.||+.|...+.|..|+++|+ ++|+|..|++.|.+.+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4899999999999999999999998 7999999999999877664
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.65 E-value=2.1e-08 Score=55.39 Aligned_cols=26 Identities=50% Similarity=1.018 Sum_probs=23.7
Q ss_pred HHHHHHHHhcCCCceecccCcccccC
Q psy18077 310 NLRAHIQTHSKVKPFVCEKCGKAFAL 335 (451)
Q Consensus 310 ~l~~H~~~h~~~~~~~C~~C~~~f~~ 335 (451)
+|.+|+++|++++||.|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 48899999999999999999999964
No 20
>PHA00732 hypothetical protein
Probab=98.54 E-value=5.9e-08 Score=69.45 Aligned_cols=50 Identities=28% Similarity=0.551 Sum_probs=40.2
Q ss_pred ccccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCChHHHHHHHHHcCCc
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRTHNQI 423 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~ 423 (451)
||.|..|++.|.+...|..|++.++. ++.|+.|++.|. .|..|++++.-+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~-----~~~C~~CgKsF~---~l~~H~~~~~~~ 50 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT-----LTKCPVCNKSYR---RLNQHFYSQYDI 50 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC-----CCccCCCCCEeC---ChhhhhcccCCc
Confidence 58899999999999999999985332 358999999997 588888776655
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.45 E-value=1.2e-07 Score=52.32 Aligned_cols=26 Identities=42% Similarity=0.853 Sum_probs=22.3
Q ss_pred HHHHHHHhccCCCCCCCcccCccccccCC
Q psy18077 381 NLARHRQIHRSPDDTKARTCPQCGKLYVS 409 (451)
Q Consensus 381 ~L~~H~~~H~~~~~~~~~~C~~C~~~f~~ 409 (451)
+|.+|+++|++ ++||.|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~---~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTG---EKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSS---SSSEEESSSSEEESS
T ss_pred CHHHHhhhcCC---CCCCCCCCCcCeeCc
Confidence 48899999987 899999999998863
No 22
>PHA00732 hypothetical protein
Probab=98.33 E-value=3.7e-07 Score=65.38 Aligned_cols=48 Identities=31% Similarity=0.647 Sum_probs=42.0
Q ss_pred CcccCccccccCChHHHHHHHHHcCCccccccccccCCChhhHHHHHhhhc
Q psy18077 397 ARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSRPWLLQGHIRTHT 447 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~C~~C~~~f~~~~~l~~H~~~h~ 447 (451)
||.|..|++.|.+...|+.|++.++..+.|+.|++.|. .|..|++++.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~~~C~~CgKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTLTKCPVCNKSYR---RLNQHFYSQY 48 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCCCccCCCCCEeC---ChhhhhcccC
Confidence 68999999999999999999996555689999999998 5888887653
No 23
>PHA00616 hypothetical protein
Probab=98.33 E-value=2.3e-07 Score=57.05 Aligned_cols=39 Identities=15% Similarity=0.275 Sum_probs=31.7
Q ss_pred ccccccccccCCChHHHHHHHHhccCCCCCCCcccCcccccc
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLY 407 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f 407 (451)
||+|+.||+.|..++.|..|++.|++ ++++.|.+=-..|
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg---~~~~~~~~~y~~f 39 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHK---QNKLTLEYFYIYF 39 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcC---CCccceeEEEEEE
Confidence 58899999999999999999999997 7888887543333
No 24
>PHA00616 hypothetical protein
Probab=98.29 E-value=3.7e-07 Score=56.12 Aligned_cols=34 Identities=21% Similarity=0.415 Sum_probs=31.6
Q ss_pred CeeccccccccCChHHHHHHHHHhcCCCceeccc
Q psy18077 295 PFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEK 328 (451)
Q Consensus 295 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 328 (451)
||+|..||+.|...+.|..|++.|++++++.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 6899999999999999999999999999998875
No 25
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.19 E-value=2.8e-06 Score=56.51 Aligned_cols=51 Identities=27% Similarity=0.538 Sum_probs=32.3
Q ss_pred cccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCChHHHHHHHHHcC
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRTHN 421 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~ 421 (451)
|.|++|++ ..+...|..|....|..+ .+.+.|++|...+. ..|..||..++
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~-~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSE-SKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCC-CCCccCCCchhhhh--hHHHHHHHHhc
Confidence 67777777 455667777766555432 44677777776543 36666766554
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.18 E-value=2.3e-06 Score=56.86 Aligned_cols=49 Identities=29% Similarity=0.498 Sum_probs=40.2
Q ss_pred CcccCccccccCChHHHHHHHHHcCC----ccccccccccCCChhhHHHHHhhhcC
Q psy18077 397 ARTCPQCGKLYVSMPAYSMHLRTHNQ----ICQCPHCSKSFSRPWLLQGHIRTHTG 448 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~l~~H~~~H~~----~~~C~~C~~~f~~~~~l~~H~~~h~~ 448 (451)
.|.|++|++ ..+...|..|+...+. .+.|++|...+. ..|..||+.+|+
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 489999999 5667899999876543 389999999766 499999998875
No 27
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.93 E-value=4.5e-06 Score=44.72 Aligned_cols=22 Identities=45% Similarity=0.830 Sum_probs=12.2
Q ss_pred cccccccccCCChhhHHHHHhh
Q psy18077 424 CQCPHCSKSFSRPWLLQGHIRT 445 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~~~ 445 (451)
|+|++|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4455555555555555555554
No 28
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.86 E-value=6.6e-06 Score=63.08 Aligned_cols=72 Identities=29% Similarity=0.478 Sum_probs=20.0
Q ss_pred ccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCChHHHHHHHHHcCC-ccccccccccCCChhhHHHHHhhh
Q psy18077 368 QCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRTHNQ-ICQCPHCSKSFSRPWLLQGHIRTH 446 (451)
Q Consensus 368 ~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~-~~~C~~C~~~f~~~~~l~~H~~~h 446 (451)
+|.+|+..|.+...|..|+...|+.. -. ....+.....+..+++.-.. .+.|.+|++.|.+...|..||+.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~------~~-~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFD------IP-DQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp -------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Cccccccccccccccccccccccccc------cc-cccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 59999999999999999998777521 11 12223344555556554433 488999999999999999999864
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.86 E-value=7.9e-06 Score=43.73 Aligned_cols=22 Identities=45% Similarity=0.925 Sum_probs=11.6
Q ss_pred cccccccccCCChHHHHHHHHh
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQI 388 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~~ 388 (451)
|.|+.|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 3455555555555555555544
No 30
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.71 E-value=1.9e-05 Score=42.70 Aligned_cols=24 Identities=50% Similarity=0.859 Sum_probs=12.3
Q ss_pred cccccccccCCChhhHHHHHhhhc
Q psy18077 424 CQCPHCSKSFSRPWLLQGHIRTHT 447 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~~~h~ 447 (451)
|.|++|++.|.+...|..|+++||
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 445555555555555555555543
No 31
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.61 E-value=3.8e-05 Score=41.44 Aligned_cols=23 Identities=43% Similarity=0.917 Sum_probs=11.2
Q ss_pred cccccccccCCChHHHHHHHHhc
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQIH 389 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~~H 389 (451)
|.|++|++.|.+...|..|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 34555555555555555555544
No 32
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.59 E-value=4.2e-05 Score=58.52 Aligned_cols=24 Identities=29% Similarity=0.670 Sum_probs=14.5
Q ss_pred ccccccccccCCChHHHHHHHHhc
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQIH 389 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~H 389 (451)
.+.|.+|++.|.+...|..||+.+
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHHHT
T ss_pred CCCCCccCCCCcCHHHHHHHHcCc
Confidence 366666666666666666666653
No 33
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.44 E-value=6.9e-05 Score=41.82 Aligned_cols=17 Identities=41% Similarity=0.944 Sum_probs=6.0
Q ss_pred cCccccccCChHHHHHH
Q psy18077 400 CPQCGKLYVSMPAYSMH 416 (451)
Q Consensus 400 C~~C~~~f~~~~~l~~H 416 (451)
|..|++.|.+...|..|
T Consensus 4 C~~C~~~F~~~~~l~~H 20 (27)
T PF13912_consen 4 CDECGKTFSSLSALREH 20 (27)
T ss_dssp ETTTTEEESSHHHHHHH
T ss_pred CCccCCccCChhHHHHH
Confidence 33333333333333333
No 34
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.40 E-value=8.2e-05 Score=41.50 Aligned_cols=26 Identities=42% Similarity=0.907 Sum_probs=24.0
Q ss_pred ccccccccccCCChHHHHHHHHhccC
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQIHRS 391 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~H~~ 391 (451)
+|.|..|++.|.+...|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 58999999999999999999998864
No 35
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.35 E-value=7.6e-05 Score=66.10 Aligned_cols=53 Identities=36% Similarity=0.770 Sum_probs=41.7
Q ss_pred CCcCeeccc--cccccCChHHHHHHHHH-h------------------cCCCceecccCcccccChhhHHhccc
Q psy18077 292 SEKPFKCKV--CDKAFADKSNLRAHIQT-H------------------SKVKPFVCEKCGKAFALKSYLYKHED 344 (451)
Q Consensus 292 ~~~~~~C~~--C~~~f~~~~~l~~H~~~-h------------------~~~~~~~C~~C~~~f~~~~~L~~H~~ 344 (451)
+++||+|++ |.++|.....|+.|+.- | ...|||+|+.|+|+|.....|+-|+.
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceeccc
Confidence 358888877 88888888888888753 2 13489999999999999999988864
No 36
>KOG2231|consensus
Probab=97.23 E-value=0.00051 Score=68.61 Aligned_cols=20 Identities=30% Similarity=0.767 Sum_probs=13.4
Q ss_pred ccccchhhcCChHhHHHHHHH
Q psy18077 233 KCKVCDKAFADKSNLRAHIQT 253 (451)
Q Consensus 233 ~C~~C~~~f~~~~~l~~H~~~ 253 (451)
.|..| ..|.+...|+.|+..
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~ 136 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRD 136 (669)
T ss_pred CCccc-cchhHHHHHHHHHHH
Confidence 56666 666677777777744
No 37
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.19 E-value=0.00014 Score=64.39 Aligned_cols=54 Identities=24% Similarity=0.533 Sum_probs=34.0
Q ss_pred ccccccccc--ccccCCChHHHHHHHHhccC-------CC---------CCCCcccCccccccCChHHHHHH
Q psy18077 363 TTISFQCRD--CGKHYSTSSNLARHRQIHRS-------PD---------DTKARTCPQCGKLYVSMPAYSMH 416 (451)
Q Consensus 363 ~~~~~~C~~--C~~~f~~~~~L~~H~~~H~~-------~~---------~~~~~~C~~C~~~f~~~~~l~~H 416 (451)
..+||+|++ |.|.+.+...|+.|+.--|. +. ..|||+|.+|+|.+.....|+.|
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYH 417 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYH 417 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceec
Confidence 346899976 88999998899998753221 00 13555555555555555555555
No 38
>KOG1146|consensus
Probab=97.16 E-value=0.00013 Score=76.87 Aligned_cols=51 Identities=24% Similarity=0.331 Sum_probs=35.1
Q ss_pred cCeeccccccccCChHHHHHHHHH-hcCCCceecccCcccccChhhHHhccc
Q psy18077 294 KPFKCKVCDKAFADKSNLRAHIQT-HSKVKPFVCEKCGKAFALKSYLYKHED 344 (451)
Q Consensus 294 ~~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~L~~H~~ 344 (451)
..+.|.+|++.-+-..+|+.||.. ++...|.-|-.|.-.+.....+..+.+
T Consensus 588 P~~~C~vc~yetniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~~~ 639 (1406)
T KOG1146|consen 588 PSWRCEVCSYETNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQGR 639 (1406)
T ss_pred CCcchhhhcchhhhhhccccccccCCCCCChHHHhhhcchhhccccccCcCC
Confidence 458899999888888888888875 333444666666666666665555554
No 39
>KOG2231|consensus
Probab=97.14 E-value=0.0011 Score=66.32 Aligned_cols=97 Identities=21% Similarity=0.410 Sum_probs=48.2
Q ss_pred ccccccccccCHHHHHHHHHHcCCCcccCcCcc---c------cCCHHHHHHHHHHhc-CCC----CcccccchhhcCCh
Q psy18077 179 TCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSK---S------FSRPWLLQGHIRTHT-GEK----PFKCKVCDKAFADK 244 (451)
Q Consensus 179 ~C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~---~------f~~~~~L~~H~~~h~-~~~----~~~C~~C~~~f~~~ 244 (451)
.|..| ..|.+...|..|+...+..+.|.+|-. . ..+...|..|++.-- +++ .-.|..|...|...
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H~~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~ 195 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQHKLHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDD 195 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhhhhhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhccH
Confidence 35555 555555555555544444344443311 1 123345555554311 111 13466666666666
Q ss_pred HhHHHHHHHcCCCCCeeccccc------ccccChHHHHhchh
Q psy18077 245 SNLRAHIQTHSKVKPFVCEKCG------KAFALKSYLYKHED 280 (451)
Q Consensus 245 ~~l~~H~~~h~~~~~~~C~~C~------~~f~~~~~l~~H~~ 280 (451)
..|..|++.++ |.|..|. ..|.....|..|.+
T Consensus 196 ~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR 233 (669)
T KOG2231|consen 196 DELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFR 233 (669)
T ss_pred HHHHHhhccce----eheeecCcccccchhcccchHHHHHhh
Confidence 66666666544 4555553 33455566666654
No 40
>KOG1146|consensus
Probab=97.06 E-value=0.00019 Score=75.60 Aligned_cols=128 Identities=20% Similarity=0.339 Sum_probs=73.2
Q ss_pred cCcCccccCCHHHHHHHHHH-hcCCCCcccccchhhcCChHhHHHHHHHcCCCCCeecccccccccChHHHHhchhhcCC
Q psy18077 206 CPHCSKSFSRPWLLQGHIRT-HTGEKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHEDSSCL 284 (451)
Q Consensus 206 C~~C~~~f~~~~~L~~H~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~ 284 (451)
|..|+..+.+...+..|+.. +...+.|+|+.|+..|+....|-.|||..+.+-.- .+|. ..+.|.+.--+
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~--~~c~-------~gq~~~~~arg 509 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS--AYCK-------AGQNHPRLARG 509 (1406)
T ss_pred ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccch--hHhH-------hcccccccccc
Confidence 44445555555555554433 34446677777777777777777777763322110 1111 11112111100
Q ss_pred CCcCCCCCCcCeeccccccccCChHHHHHHHHH--hc----------C-------------------------------C
Q psy18077 285 KSYDSSSSEKPFKCKVCDKAFADKSNLRAHIQT--HS----------K-------------------------------V 321 (451)
Q Consensus 285 ~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~--h~----------~-------------------------------~ 321 (451)
.. -..+.++|.|..|...+.....|.+|+.. |. | +
T Consensus 510 ~~--~~~~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktk 587 (1406)
T KOG1146|consen 510 EV--YRCPGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTK 587 (1406)
T ss_pred cc--ccCCCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCC
Confidence 00 01345678888888888888888888752 10 0 1
Q ss_pred CceecccCcccccChhhHHhccc
Q psy18077 322 KPFVCEKCGKAFALKSYLYKHED 344 (451)
Q Consensus 322 ~~~~C~~C~~~f~~~~~L~~H~~ 344 (451)
-.+.|..|++.-.-...|..|+.
T Consensus 588 P~~~C~vc~yetniarnlrihmt 610 (1406)
T KOG1146|consen 588 PSWRCEVCSYETNIARNLRIHMT 610 (1406)
T ss_pred CCcchhhhcchhhhhhccccccc
Confidence 24889999999998999999987
No 41
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=97.02 E-value=0.00034 Score=37.71 Aligned_cols=24 Identities=42% Similarity=0.735 Sum_probs=13.2
Q ss_pred cccccccccCCChhhHHHHHhhhcC
Q psy18077 424 CQCPHCSKSFSRPWLLQGHIRTHTG 448 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~~~h~~ 448 (451)
|+|+.|++... ...|..|++.||+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 45666666665 5666666666553
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.98 E-value=0.00065 Score=37.16 Aligned_cols=23 Identities=43% Similarity=0.909 Sum_probs=13.1
Q ss_pred cccccccccCCChhhHHHHHhhh
Q psy18077 424 CQCPHCSKSFSRPWLLQGHIRTH 446 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~~~h 446 (451)
|.|..|++.|.....|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 34555555555555555555544
No 43
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.89 E-value=0.00043 Score=43.56 Aligned_cols=28 Identities=18% Similarity=0.386 Sum_probs=13.9
Q ss_pred cccccccccCCChhhHHHHHhhhcCCCC
Q psy18077 424 CQCPHCSKSFSRPWLLQGHIRTHTGGTN 451 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 451 (451)
-.|++|+..+.+...|+.|+.++|+.||
T Consensus 25 atCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 25 ATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred CCCCcchhhccchhhHHHHHHHHhcccC
Confidence 4455566666666666666655555544
No 44
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.79 E-value=0.001 Score=36.36 Aligned_cols=23 Identities=35% Similarity=0.914 Sum_probs=13.6
Q ss_pred cccccccccCCChHHHHHHHHhc
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQIH 389 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~~H 389 (451)
|.|..|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 35666666666666666666544
No 45
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.70 E-value=0.0022 Score=57.70 Aligned_cols=127 Identities=24% Similarity=0.488 Sum_probs=86.1
Q ss_pred Cccccc--ccccccCHHHHHHHHHHcCCCcccCcCc---cccC------CHHHHHHHHHHhcCCCCc----ccccchhhc
Q psy18077 177 ARTCPQ--CGKLYVSMPAYSMHLRTHNQICQCPHCS---KSFS------RPWLLQGHIRTHTGEKPF----KCKVCDKAF 241 (451)
Q Consensus 177 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~C~~C~---~~f~------~~~~L~~H~~~h~~~~~~----~C~~C~~~f 241 (451)
.|.|+. |..+......|..|.+..++.+.|.+|- +.|. +...|..|...-..+..| .|..|...|
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~F 230 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGFVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIYF 230 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCcEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhcccee
Confidence 466754 7777777888888888877777777773 3343 335566665443222223 599999999
Q ss_pred CChHhHHHHHHHcCCCCCeecccccc-------cccChHHHHhchhhcCCCCcCCCCCCcCeeccc--cc----cccCCh
Q psy18077 242 ADKSNLRAHIQTHSKVKPFVCEKCGK-------AFALKSYLYKHEDSSCLKSYDSSSSEKPFKCKV--CD----KAFADK 308 (451)
Q Consensus 242 ~~~~~l~~H~~~h~~~~~~~C~~C~~-------~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~C~~--C~----~~f~~~ 308 (451)
.....|..|++.-+. .|-+|++ -|.+...|..|.+. ..|.|.+ |. ..|...
T Consensus 231 YdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~------------~hy~ct~qtc~~~k~~vf~~~ 294 (493)
T COG5236 231 YDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRN------------AHYCCTFQTCRVGKCYVFPYH 294 (493)
T ss_pred cChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhc------------CceEEEEEEEecCcEEEeccH
Confidence 999999999987552 4666664 47777888888642 3466654 42 468889
Q ss_pred HHHHHHHHHhc
Q psy18077 309 SNLRAHIQTHS 319 (451)
Q Consensus 309 ~~l~~H~~~h~ 319 (451)
..|..|+..-+
T Consensus 295 ~el~~h~~~~h 305 (493)
T COG5236 295 TELLEHLTRFH 305 (493)
T ss_pred HHHHHHHHHHh
Confidence 99999976543
No 46
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.67 E-value=0.0031 Score=39.77 Aligned_cols=33 Identities=33% Similarity=0.681 Sum_probs=23.3
Q ss_pred CCCcCeeccccccccCChHHHHHHHHHhcCCCc
Q psy18077 291 SSEKPFKCKVCDKAFADKSNLRAHIQTHSKVKP 323 (451)
Q Consensus 291 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 323 (451)
..+.|-.|++|+..+....+|++|+.++++.+|
T Consensus 20 ~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 20 QSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred ccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 456888999999999999999999988887775
No 47
>KOG2785|consensus
Probab=96.63 E-value=0.0031 Score=57.97 Aligned_cols=49 Identities=22% Similarity=0.428 Sum_probs=37.1
Q ss_pred CcccCccccccCChHHHHHHHHHcCCc-------------------------ccccccc---ccCCChhhHHHHHhh
Q psy18077 397 ARTCPQCGKLYVSMPAYSMHLRTHNQI-------------------------CQCPHCS---KSFSRPWLLQGHIRT 445 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~l~~H~~~H~~~-------------------------~~C~~C~---~~f~~~~~l~~H~~~ 445 (451)
|-.|-.|++.|.+...-..||..+++. |.|-.|+ +.|.+....+.||+.
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 456777777777777777777666542 6799998 999999999999964
No 48
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.55 E-value=0.0014 Score=35.18 Aligned_cols=23 Identities=39% Similarity=0.745 Sum_probs=10.3
Q ss_pred cccccccccCCChHHHHHHHHhcc
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQIHR 390 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~~H~ 390 (451)
|+|+.|++... ...|.+|++.|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 34555555444 445555555443
No 49
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.54 E-value=0.0012 Score=35.88 Aligned_cols=22 Identities=41% Similarity=0.700 Sum_probs=12.7
Q ss_pred cccccccccCCChhhHHHHHhh
Q psy18077 424 CQCPHCSKSFSRPWLLQGHIRT 445 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~~~ 445 (451)
|.|.+|++.|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 3455666666666666666554
No 50
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.50 E-value=0.0013 Score=35.81 Aligned_cols=22 Identities=32% Similarity=0.551 Sum_probs=13.1
Q ss_pred cccCccccccCChHHHHHHHHH
Q psy18077 398 RTCPQCGKLYVSMPAYSMHLRT 419 (451)
Q Consensus 398 ~~C~~C~~~f~~~~~l~~H~~~ 419 (451)
|.|.+|++.|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4566666666666666666553
No 51
>PRK04860 hypothetical protein; Provisional
Probab=96.21 E-value=0.0028 Score=52.36 Aligned_cols=39 Identities=26% Similarity=0.759 Sum_probs=34.1
Q ss_pred cCeeccccccccCChHHHHHHHHHhcCCCceecccCcccccCh
Q psy18077 294 KPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALK 336 (451)
Q Consensus 294 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 336 (451)
-+|.|. |+. ....+++|.++|+++++|.|..|+..|...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 479998 997 667889999999999999999999988654
No 52
>KOG2785|consensus
Probab=96.17 E-value=0.012 Score=54.21 Aligned_cols=54 Identities=19% Similarity=0.256 Sum_probs=46.6
Q ss_pred ccccccccccCCChHHHHHHHHhccCCC--------------------CCCCcccCccc---cccCChHHHHHHHHH
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQIHRSPD--------------------DTKARTCPQCG---KLYVSMPAYSMHLRT 419 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~--------------------~~~~~~C~~C~---~~f~~~~~l~~H~~~ 419 (451)
|-.|-.|++.+.+...-..||..+||.- ...-|.|-.|+ +.|.+..+.+.||+.
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 6789999999999999999999998732 12358999999 999999999999974
No 53
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.10 E-value=0.008 Score=54.26 Aligned_cols=128 Identities=21% Similarity=0.391 Sum_probs=90.9
Q ss_pred cccccc--cccccCChHhHHHHHhhhcCCCCCCCcccccccc---------cccCHHHHHHHHHHcCC------CcccCc
Q psy18077 146 SFQCRD--CGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGK---------LYVSMPAYSMHLRTHNQ------ICQCPH 208 (451)
Q Consensus 146 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~---------~f~~~~~l~~H~~~h~~------~~~C~~ 208 (451)
.|.|+. |.........|..|.+..+ ..+.|.+|-. ...+...|..|...-.. --.|..
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H-----~~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~F 225 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQH-----GFVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIF 225 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhc-----CcEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhh
Confidence 377875 7777778899999998865 2567777742 23455667777654321 236999
Q ss_pred CccccCCHHHHHHHHHHhcCCCCcccccchh-------hcCChHhHHHHHHHcCCCCCeeccc--cc----ccccChHHH
Q psy18077 209 CSKSFSRPWLLQGHIRTHTGEKPFKCKVCDK-------AFADKSNLRAHIQTHSKVKPFVCEK--CG----KAFALKSYL 275 (451)
Q Consensus 209 C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~-------~f~~~~~l~~H~~~h~~~~~~~C~~--C~----~~f~~~~~l 275 (451)
|...|.+-..|..|++..+ -.|.+|++ -|++...|..|.+..+ |.|.. |. ..|.....|
T Consensus 226 C~~~FYdDDEL~~HcR~~H----E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el 297 (493)
T COG5236 226 CKIYFYDDDELRRHCRLRH----EACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTEL 297 (493)
T ss_pred ccceecChHHHHHHHHhhh----hhhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHH
Confidence 9999999999999998754 35777754 4788888888887544 66643 43 468888999
Q ss_pred HhchhhcCCCC
Q psy18077 276 YKHEDSSCLKS 286 (451)
Q Consensus 276 ~~H~~~~~~~~ 286 (451)
..|+...|+..
T Consensus 298 ~~h~~~~h~~~ 308 (493)
T COG5236 298 LEHLTRFHKVN 308 (493)
T ss_pred HHHHHHHhhcc
Confidence 99987665543
No 54
>KOG2482|consensus
Probab=95.95 E-value=0.013 Score=52.95 Aligned_cols=49 Identities=20% Similarity=0.348 Sum_probs=39.1
Q ss_pred eeccccccccCChHHHHHHHHHhcC---------------------------CCceecccCcccccChhhHHhccc
Q psy18077 296 FKCKVCDKAFADKSNLRAHIQTHSK---------------------------VKPFVCEKCGKAFALKSYLYKHED 344 (451)
Q Consensus 296 ~~C~~C~~~f~~~~~l~~H~~~h~~---------------------------~~~~~C~~C~~~f~~~~~L~~H~~ 344 (451)
..|-.|.....+...|..||.+-+. .+.-.|-.|...|.....|..|+.
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~ 355 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMV 355 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcc
Confidence 5899999888888899999875331 112458889999999999999986
No 55
>PRK04860 hypothetical protein; Provisional
Probab=95.76 E-value=0.0057 Score=50.55 Aligned_cols=38 Identities=24% Similarity=0.684 Sum_probs=32.4
Q ss_pred ccccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCCh
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSM 410 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~ 410 (451)
+|.|. |+. ....+.+|.++|.+ +++|.|..|+..|...
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g---~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRG---EAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcC---CccEECCCCCceeEEe
Confidence 79998 987 67788999999987 8899999999887643
No 56
>KOG2482|consensus
Probab=95.70 E-value=0.031 Score=50.63 Aligned_cols=162 Identities=20% Similarity=0.289 Sum_probs=85.4
Q ss_pred ccccccccccC-ChHhHHHHHhhhcCCCCCCCcccccccccccCHHHHHHHHHHcCCCcccCcCccccCCHHHHHHHHHH
Q psy18077 147 FQCRDCGKHYS-TSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSRPWLLQGHIRT 225 (451)
Q Consensus 147 ~~C~~C~~~f~-~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~~f~~~~~L~~H~~~ 225 (451)
.+|-.|+..+. +++....|+-.-++-.-..| -.......|..|++..-..+.|-.|.+.|+.+..|+.||+.
T Consensus 145 lqClFCn~e~lgnRs~~l~Hlf~~H~lniGlp-------DniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 145 LQCLFCNNEGLGNRSEILEHLFHVHGLNIGLP-------DNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRK 217 (423)
T ss_pred eEEEEecchhcccHHHHHHHHHHHhhhccCCC-------cceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHh
Confidence 46777776663 34555556533221000000 11233345555655555566788888888888888888765
Q ss_pred --hcCCCC--------cccccchhhcCChHhHHHHHH----Hc---------------CCCCC--eecccccccccChHH
Q psy18077 226 --HTGEKP--------FKCKVCDKAFADKSNLRAHIQ----TH---------------SKVKP--FVCEKCGKAFALKSY 274 (451)
Q Consensus 226 --h~~~~~--------~~C~~C~~~f~~~~~l~~H~~----~h---------------~~~~~--~~C~~C~~~f~~~~~ 274 (451)
|....| |.=.+ .-..++-...|.. +. .+..+ ..|-.|.....+...
T Consensus 218 K~HrrinPknreYDkfyiINY---~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~en~~~ 294 (423)
T KOG2482|consen 218 KRHRRINPKNREYDKFYIINY---LEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYENPVF 294 (423)
T ss_pred ccCcccCCCccccceEEEEeH---hhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccchhhHHH
Confidence 433322 11000 0000111111110 00 01112 578888887777888
Q ss_pred HHhchhhcCCCCcCCC-------------------CCCcCeeccccccccCChHHHHHHHHHh
Q psy18077 275 LYKHEDSSCLKSYDSS-------------------SSEKPFKCKVCDKAFADKSNLRAHIQTH 318 (451)
Q Consensus 275 l~~H~~~~~~~~~~~~-------------------~~~~~~~C~~C~~~f~~~~~l~~H~~~h 318 (451)
|..||+..|.-.+... .....-.|-.|...|.....|..||.-+
T Consensus 295 l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 295 LFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred HHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence 8888775543221110 1123457888999999999999998753
No 57
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.69 E-value=0.0034 Score=34.82 Aligned_cols=22 Identities=32% Similarity=0.626 Sum_probs=12.7
Q ss_pred cccccccccCCChhhHHHHHhh
Q psy18077 424 CQCPHCSKSFSRPWLLQGHIRT 445 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~~~ 445 (451)
|.|.+|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4456666666666666655544
No 58
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.31 E-value=0.0041 Score=34.51 Aligned_cols=21 Identities=24% Similarity=0.542 Sum_probs=11.3
Q ss_pred cccCccccccCChHHHHHHHH
Q psy18077 398 RTCPQCGKLYVSMPAYSMHLR 418 (451)
Q Consensus 398 ~~C~~C~~~f~~~~~l~~H~~ 418 (451)
|.|..|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 445555555555555555544
No 59
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.81 E-value=0.016 Score=57.34 Aligned_cols=145 Identities=27% Similarity=0.327 Sum_probs=86.8
Q ss_pred cCeeccccccccCChHHHHHHHH--HhcCC--Cceecc--cCcccccChhhHHhcccccCCCccCCCC------------
Q psy18077 294 KPFKCKVCDKAFADKSNLRAHIQ--THSKV--KPFVCE--KCGKAFALKSYLYKHEDSSCLKSYDSSS------------ 355 (451)
Q Consensus 294 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~~~~~~~~~~~------------ 355 (451)
.++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.....+..|...+.+.......
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 35777777777777777777777 67777 777777 6777777777777776644333211000
Q ss_pred Cc--------ccCcccccccccc--cccccCCChHHHHHHHHhccCCCCCCCcccCccccccCChHHHHHHHHHcCCc-c
Q psy18077 356 NG--------HLNKFTTISFQCR--DCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRTHNQI-C 424 (451)
Q Consensus 356 ~~--------~~~~~~~~~~~C~--~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~-~ 424 (451)
.. .........+.+. .|-..+.....+..|...|... ....+.+..|.+.|.....+..|++.|... .
T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (467)
T COG5048 368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSF-RPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAP 446 (467)
T ss_pred CCCCccchhhccCccCCccccccccchhhhhcccccccccccccccc-CCcCCCCCcchhhccCcccccccccccccCCc
Confidence 00 0001111133442 3666667777777777666542 133577788888888888888888887543 3
Q ss_pred ccccccccCCChhhH
Q psy18077 425 QCPHCSKSFSRPWLL 439 (451)
Q Consensus 425 ~C~~C~~~f~~~~~l 439 (451)
-+..+-+.|.....+
T Consensus 447 ~~~~~~~~~~~~~~~ 461 (467)
T COG5048 447 LLCSILKSFRRDLDL 461 (467)
T ss_pred eeeccccccchhhhh
Confidence 344444444444443
No 60
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=93.79 E-value=0.048 Score=29.48 Aligned_cols=19 Identities=26% Similarity=0.754 Sum_probs=9.5
Q ss_pred ccCccccccCChHHHHHHHH
Q psy18077 399 TCPQCGKLYVSMPAYSMHLR 418 (451)
Q Consensus 399 ~C~~C~~~f~~~~~l~~H~~ 418 (451)
.|+.|+++| ....|..|++
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 455555555 4445555543
No 61
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=93.74 E-value=0.056 Score=29.21 Aligned_cols=21 Identities=29% Similarity=0.849 Sum_probs=18.1
Q ss_pred cccccccccCCChHHHHHHHHh
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQI 388 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~~ 388 (451)
..|+.||+.| ....|.+|+.+
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 5799999999 77889999864
No 62
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.28 E-value=0.028 Score=55.54 Aligned_cols=157 Identities=27% Similarity=0.385 Sum_probs=78.9
Q ss_pred CcccccchhhcCChHhHHHHHH--HcCCC--CCeecc--cccccccChHHHHhchhhcCCCCcCCCCCCcCeeccc--cc
Q psy18077 231 PFKCKVCDKAFADKSNLRAHIQ--THSKV--KPFVCE--KCGKAFALKSYLYKHEDSSCLKSYDSSSSEKPFKCKV--CD 302 (451)
Q Consensus 231 ~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~C~~--C~ 302 (451)
++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.+...+..|...+ .+..++.+.. |.
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~ 360 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLH--------TSISPAKEKLLNSS 360 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccc--------cCCCccccccccCc
Confidence 4666666666666666666666 56666 666666 5666666666666665544 2333333322 22
Q ss_pred cccCChHHH-----HHHHHHhcCCCceecc--cCcccccChhhHHhcccccCCCccCCCCCcccCccccccccccccccc
Q psy18077 303 KAFADKSNL-----RAHIQTHSKVKPFVCE--KCGKAFALKSYLYKHEDSSCLKSYDSSSNGHLNKFTTISFQCRDCGKH 375 (451)
Q Consensus 303 ~~f~~~~~l-----~~H~~~h~~~~~~~C~--~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~ 375 (451)
..+.....- ......-...+.+.+. .|...+.....+..|...+. ......+.+..|.+.
T Consensus 361 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~ 427 (467)
T COG5048 361 SKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHL-------------SFRPYNCKNPPCSKS 427 (467)
T ss_pred cccccccCCCCccchhhccCccCCccccccccchhhhhcccccccccccccc-------------ccCCcCCCCCcchhh
Confidence 222221110 0000111112233332 24444444444444432111 111125778889999
Q ss_pred CCChHHHHHHHHhccCCCCCCCcccCccccccCChHH
Q psy18077 376 YSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPA 412 (451)
Q Consensus 376 f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~ 412 (451)
|.....+..|++.|.. ..++-|..+ +.|.....
T Consensus 428 ~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~~ 460 (467)
T COG5048 428 FNRHYNLIPHKKIHTN---HAPLLCSIL-KSFRRDLD 460 (467)
T ss_pred ccCccccccccccccc---CCceeeccc-cccchhhh
Confidence 9999999999988874 344444333 33433333
No 63
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=92.65 E-value=0.086 Score=31.10 Aligned_cols=22 Identities=18% Similarity=0.407 Sum_probs=13.9
Q ss_pred CcccCccccccCChHHHHHHHH
Q psy18077 397 ARTCPQCGKLYVSMPAYSMHLR 418 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~l~~H~~ 418 (451)
+|.|.+|++.|.+...+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4566666666666666666654
No 64
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=92.56 E-value=0.13 Score=39.33 Aligned_cols=23 Identities=22% Similarity=0.325 Sum_probs=12.2
Q ss_pred cccccccccCCChhhHHHHHhhh
Q psy18077 424 CQCPHCSKSFSRPWLLQGHIRTH 446 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~~~h 446 (451)
|+|+.|...|-..-.+-.|...|
T Consensus 82 y~C~~C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 82 YVCAVCKNVFCVDCDVFVHESLH 104 (112)
T ss_pred eeCCCCCCccccccchhhhhhcc
Confidence 55555555555555555554444
No 65
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=92.08 E-value=0.11 Score=30.58 Aligned_cols=23 Identities=17% Similarity=0.489 Sum_probs=20.7
Q ss_pred ccccccccccCCChHHHHHHHHh
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQI 388 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~ 388 (451)
+|.|.+|++.|.+...+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 58899999999999999999854
No 66
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=91.83 E-value=0.33 Score=37.10 Aligned_cols=26 Identities=15% Similarity=0.294 Sum_probs=15.0
Q ss_pred CCcccCccccccCChHHHHHHHHHcC
Q psy18077 396 KARTCPQCGKLYVSMPAYSMHLRTHN 421 (451)
Q Consensus 396 ~~~~C~~C~~~f~~~~~l~~H~~~H~ 421 (451)
..|+|+.|...|-..-.+-.|...|+
T Consensus 80 ~~y~C~~C~~~FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 80 HRYVCAVCKNVFCVDCDVFVHESLHC 105 (112)
T ss_pred cceeCCCCCCccccccchhhhhhccC
Confidence 35666666666655555555555444
No 67
>KOG2893|consensus
Probab=91.39 E-value=0.059 Score=46.18 Aligned_cols=46 Identities=28% Similarity=0.597 Sum_probs=34.5
Q ss_pred ccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCChHHHHHH-HHHc
Q psy18077 368 QCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMH-LRTH 420 (451)
Q Consensus 368 ~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~l~~H-~~~H 420 (451)
=|-+|++.|....-|..|++. |-|+|-+|.+...+-..|..| |++|
T Consensus 12 wcwycnrefddekiliqhqka-------khfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKA-------KHFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeeecccccchhhhhhhhhhh-------ccceeeeehhhhccCCCceeehhhhh
Confidence 388899999998888888764 448899998887776666666 3444
No 68
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=89.95 E-value=0.51 Score=36.49 Aligned_cols=26 Identities=15% Similarity=0.539 Sum_probs=19.4
Q ss_pred cccc----ccccccCCChHHHHHHHHhccC
Q psy18077 366 SFQC----RDCGKHYSTSSNLARHRQIHRS 391 (451)
Q Consensus 366 ~~~C----~~C~~~f~~~~~L~~H~~~H~~ 391 (451)
.|.| ..|++.+.+...|..|++.+||
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 4777 7787777777778777777663
No 69
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=89.94 E-value=0.18 Score=29.35 Aligned_cols=24 Identities=33% Similarity=1.025 Sum_probs=13.7
Q ss_pred eeccccccccCChHHHHHHHHHhcCCCceecccCcc
Q psy18077 296 FKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGK 331 (451)
Q Consensus 296 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 331 (451)
|+|..||..+.... .++.|+.|+.
T Consensus 2 ~~C~~CGy~y~~~~------------~~~~CP~Cg~ 25 (33)
T cd00350 2 YVCPVCGYIYDGEE------------APWVCPVCGA 25 (33)
T ss_pred EECCCCCCEECCCc------------CCCcCcCCCC
Confidence 56666666554322 5566666654
No 70
>KOG2893|consensus
Probab=89.89 E-value=0.093 Score=45.01 Aligned_cols=43 Identities=30% Similarity=0.631 Sum_probs=32.1
Q ss_pred ccCcCccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHH
Q psy18077 205 QCPHCSKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHI 251 (451)
Q Consensus 205 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 251 (451)
.|-.|++.|....-|.+|++. +.|+|.+|.+.+.+--.|..|.
T Consensus 12 wcwycnrefddekiliqhqka----khfkchichkkl~sgpglsihc 54 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHC 54 (341)
T ss_pred eeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeeh
Confidence 477788888888888888765 3388888888877776776664
No 71
>KOG4173|consensus
Probab=89.05 E-value=0.26 Score=41.41 Aligned_cols=49 Identities=27% Similarity=0.692 Sum_probs=29.9
Q ss_pred ccCcCccccCCHHHHHHHHHHh----------cCCCCccccc--chhhcCChHhHHHHHHH
Q psy18077 205 QCPHCSKSFSRPWLLQGHIRTH----------TGEKPFKCKV--CDKAFADKSNLRAHIQT 253 (451)
Q Consensus 205 ~C~~C~~~f~~~~~L~~H~~~h----------~~~~~~~C~~--C~~~f~~~~~l~~H~~~ 253 (451)
.|..|.+.|.+...|..|+... .|...|+|-+ |+..|.+...-+.|+-.
T Consensus 108 sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~ 168 (253)
T KOG4173|consen 108 SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIR 168 (253)
T ss_pred hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHH
Confidence 3555555555555555554332 2444577765 88888888887777754
No 72
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=87.96 E-value=1 Score=34.73 Aligned_cols=25 Identities=28% Similarity=0.601 Sum_probs=23.2
Q ss_pred ccc----ccccccCCChhhHHHHHhhhcC
Q psy18077 424 CQC----PHCSKSFSRPWLLQGHIRTHTG 448 (451)
Q Consensus 424 ~~C----~~C~~~f~~~~~l~~H~~~h~~ 448 (451)
|.| ..|++.+.+...+.+|++.+||
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 889 9999999999999999999886
No 73
>KOG4173|consensus
Probab=86.61 E-value=0.18 Score=42.31 Aligned_cols=86 Identities=23% Similarity=0.401 Sum_probs=46.9
Q ss_pred cCeeccc--cccccCChHHHHHHHHHhcCCCceecccCcccccChhhHHhcccccCCCccCCCCCcccCccccccccc--
Q psy18077 294 KPFKCKV--CDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHEDSSCLKSYDSSSNGHLNKFTTISFQC-- 369 (451)
Q Consensus 294 ~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~~~~~~~~~~~~~C-- 369 (451)
..|.|++ |...|.....+..|..+-++ -.|..|.+.|.+...|..|+...+..-|.. +...+...|.|
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa-----~veRG~dMy~Clv 149 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQA-----LVERGQDMYQCLV 149 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHH-----HHHcCccHHHHHH
Confidence 3455655 55666666656666544333 246666666666666666653222111110 11111225777
Q ss_pred ccccccCCChHHHHHHHH
Q psy18077 370 RDCGKHYSTSSNLARHRQ 387 (451)
Q Consensus 370 ~~C~~~f~~~~~L~~H~~ 387 (451)
..|+..|.+...-..|+.
T Consensus 150 EgCt~KFkT~r~RkdH~I 167 (253)
T KOG4173|consen 150 EGCTEKFKTSRDRKDHMI 167 (253)
T ss_pred HhhhhhhhhhhhhhhHHH
Confidence 458888888887788864
No 74
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=84.39 E-value=0.5 Score=36.21 Aligned_cols=30 Identities=23% Similarity=0.716 Sum_probs=22.5
Q ss_pred cccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCCh
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSM 410 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~ 410 (451)
..|+.||++|.... ..|..|++||..|.-.
T Consensus 10 R~Cp~CG~kFYDLn--------------k~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDLN--------------KDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccCC--------------CCCccCCCCCCccCcc
Confidence 57888888887642 5678888888887654
No 75
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=83.87 E-value=0.66 Score=29.95 Aligned_cols=30 Identities=17% Similarity=0.474 Sum_probs=21.8
Q ss_pred CCCCcCeeccccccccCChHHHHHHHHHhc
Q psy18077 290 SSSEKPFKCKVCDKAFADKSNLRAHIQTHS 319 (451)
Q Consensus 290 ~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 319 (451)
..|+.-+.|+.|+..|.......+|+..-+
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 356777788888888888777777776533
No 76
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=82.51 E-value=1 Score=34.50 Aligned_cols=11 Identities=27% Similarity=0.987 Sum_probs=5.9
Q ss_pred cccccccccCC
Q psy18077 148 QCRDCGKHYST 158 (451)
Q Consensus 148 ~C~~C~~~f~~ 158 (451)
.|+.||+.|..
T Consensus 11 ~Cp~CG~kFYD 21 (108)
T PF09538_consen 11 TCPSCGAKFYD 21 (108)
T ss_pred cCCCCcchhcc
Confidence 45555555544
No 77
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=82.32 E-value=0.64 Score=30.03 Aligned_cols=27 Identities=19% Similarity=0.445 Sum_probs=16.2
Q ss_pred ccccccccccccCCChHHHHHHHHhcc
Q psy18077 364 TISFQCRDCGKHYSTSSNLARHRQIHR 390 (451)
Q Consensus 364 ~~~~~C~~C~~~f~~~~~L~~H~~~H~ 390 (451)
+.-++|+.|+..|....++.+|+..-|
T Consensus 15 E~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 15 EEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred ceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 335566666666666666666665444
No 78
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=82.09 E-value=0.67 Score=40.69 Aligned_cols=57 Identities=19% Similarity=0.282 Sum_probs=28.6
Q ss_pred CceecccCcccccChhhHHhccccc-CCCccCCCCCcccCcccccccccccccccCCCh
Q psy18077 322 KPFVCEKCGKAFALKSYLYKHEDSS-CLKSYDSSSNGHLNKFTTISFQCRDCGKHYSTS 379 (451)
Q Consensus 322 ~~~~C~~C~~~f~~~~~L~~H~~~~-~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~ 379 (451)
+.+.||+|+..|.++.-+....+.. ....|...-. .++..-=....|+.||.+|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~-~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYK-GVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccC-CCCCeeeeEEECCCCCCccccc
Confidence 5578888888888776555554321 0000000000 0111001146899999987644
No 79
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=81.95 E-value=0.48 Score=34.00 Aligned_cols=30 Identities=30% Similarity=0.774 Sum_probs=15.3
Q ss_pred CcccCccccccCChHHHHHHHHHcCCccccccccccCC
Q psy18077 397 ARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFS 434 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~C~~C~~~f~ 434 (451)
.|.|+.|++. .+.|.-++.+.|..|++.|+
T Consensus 35 ~~~Cp~C~~~--------~VkR~a~GIW~C~kCg~~fA 64 (89)
T COG1997 35 KHVCPFCGRT--------TVKRIATGIWKCRKCGAKFA 64 (89)
T ss_pred CCcCCCCCCc--------ceeeeccCeEEcCCCCCeec
Confidence 4556666553 12333445556666665554
No 80
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=80.89 E-value=0.92 Score=26.58 Aligned_cols=10 Identities=30% Similarity=0.826 Sum_probs=5.8
Q ss_pred eecccccccc
Q psy18077 296 FKCKVCDKAF 305 (451)
Q Consensus 296 ~~C~~C~~~f 305 (451)
|+|..||..+
T Consensus 3 ~~C~~CG~i~ 12 (34)
T cd00729 3 WVCPVCGYIH 12 (34)
T ss_pred EECCCCCCEe
Confidence 5566666554
No 81
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=80.57 E-value=0.69 Score=37.95 Aligned_cols=12 Identities=25% Similarity=1.046 Sum_probs=6.0
Q ss_pred cccccccccCCC
Q psy18077 367 FQCRDCGKHYST 378 (451)
Q Consensus 367 ~~C~~C~~~f~~ 378 (451)
|.|+.|+..|..
T Consensus 100 Y~Cp~C~~~y~~ 111 (147)
T smart00531 100 YKCPNCQSKYTF 111 (147)
T ss_pred EECcCCCCEeeH
Confidence 555555555443
No 82
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=79.72 E-value=0.95 Score=24.67 Aligned_cols=9 Identities=56% Similarity=1.472 Sum_probs=4.2
Q ss_pred ccCcCcccc
Q psy18077 205 QCPHCSKSF 213 (451)
Q Consensus 205 ~C~~C~~~f 213 (451)
.|+.||..|
T Consensus 16 ~Cp~CG~~F 24 (26)
T PF10571_consen 16 FCPHCGYDF 24 (26)
T ss_pred cCCCCCCCC
Confidence 344454444
No 83
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=78.09 E-value=1.7 Score=28.75 Aligned_cols=13 Identities=31% Similarity=0.784 Sum_probs=7.5
Q ss_pred ccccccccccCCC
Q psy18077 366 SFQCRDCGKHYST 378 (451)
Q Consensus 366 ~~~C~~C~~~f~~ 378 (451)
.|.|+.||.....
T Consensus 27 ~F~CPnCGe~~I~ 39 (61)
T COG2888 27 KFPCPNCGEVEIY 39 (61)
T ss_pred EeeCCCCCceeee
Confidence 4667777654433
No 84
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=77.53 E-value=0.47 Score=41.65 Aligned_cols=22 Identities=23% Similarity=0.513 Sum_probs=14.4
Q ss_pred ccccccccccCCChHHHHHHHH
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQ 387 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~ 387 (451)
.+.||+|++.|.+..-+....+
T Consensus 5 ~~~CPvC~~~F~~~~vrs~~~r 26 (214)
T PF09986_consen 5 KITCPVCGKEFKTKKVRSGKIR 26 (214)
T ss_pred ceECCCCCCeeeeeEEEcCCce
Confidence 4678888888877655544443
No 85
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=76.58 E-value=1.1 Score=26.59 Aligned_cols=12 Identities=25% Similarity=0.872 Sum_probs=6.1
Q ss_pred ccccccccCCCh
Q psy18077 368 QCRDCGKHYSTS 379 (451)
Q Consensus 368 ~C~~C~~~f~~~ 379 (451)
.|+.|+..|.-.
T Consensus 4 ~Cp~C~~~y~i~ 15 (36)
T PF13717_consen 4 TCPNCQAKYEID 15 (36)
T ss_pred ECCCCCCEEeCC
Confidence 455555555443
No 86
>PHA00626 hypothetical protein
Probab=75.47 E-value=1.5 Score=28.49 Aligned_cols=13 Identities=38% Similarity=0.892 Sum_probs=10.0
Q ss_pred ccccccccccCCC
Q psy18077 366 SFQCRDCGKHYST 378 (451)
Q Consensus 366 ~~~C~~C~~~f~~ 378 (451)
.|.|+.||+.|+.
T Consensus 23 rYkCkdCGY~ft~ 35 (59)
T PHA00626 23 DYVCCDCGYNDSK 35 (59)
T ss_pred ceEcCCCCCeech
Confidence 5888888887764
No 87
>PF15269 zf-C2H2_7: Zinc-finger
Probab=74.36 E-value=2 Score=26.38 Aligned_cols=21 Identities=19% Similarity=0.441 Sum_probs=13.2
Q ss_pred cccccccccCCChhhHHHHHh
Q psy18077 424 CQCPHCSKSFSRPWLLQGHIR 444 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~~ 444 (451)
|+|-.|..+...++.|-+||+
T Consensus 21 ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred ceeecCCcccchHHHHHHHHH
Confidence 556666666666666666664
No 88
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=74.22 E-value=3.2 Score=34.50 Aligned_cols=19 Identities=16% Similarity=0.361 Sum_probs=11.2
Q ss_pred CCceecccCcccccChhhH
Q psy18077 321 VKPFVCEKCGKAFALKSYL 339 (451)
Q Consensus 321 ~~~~~C~~C~~~f~~~~~L 339 (451)
..-|.|+.|+..|+.-..+
T Consensus 107 ~~~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 107 NMFFICPNMCVRFTFNEAM 125 (158)
T ss_pred CCeEECCCCCcEeeHHHHH
Confidence 3456666666666655554
No 89
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=73.81 E-value=1.8 Score=35.95 Aligned_cols=16 Identities=13% Similarity=0.457 Sum_probs=7.7
Q ss_pred cccccccccCCChHHH
Q psy18077 367 FQCRDCGKHYSTSSNL 382 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L 382 (451)
|.|+.|+..|+...++
T Consensus 110 Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 110 FICPNMCVRFTFNEAM 125 (158)
T ss_pred EECCCCCcEeeHHHHH
Confidence 4455555444444444
No 90
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=73.52 E-value=1.7 Score=39.23 Aligned_cols=48 Identities=21% Similarity=0.329 Sum_probs=26.6
Q ss_pred ccCccccccCChHHHHHHHHHcCCccccccccccCCChhhHHHHHhhh
Q psy18077 399 TCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSRPWLLQGHIRTH 446 (451)
Q Consensus 399 ~C~~C~~~f~~~~~l~~H~~~H~~~~~C~~C~~~f~~~~~l~~H~~~h 446 (451)
.|-.|.-.|.....-..-..+-.+.|+|+.|...|-.-...-.|-..|
T Consensus 364 ~Cf~CQ~~fp~~~~~~~~~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 364 HCFVCQGPFPKPPVSPFDESTSSGRYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred cceeccCCCCCCCCCcccccccccceechhhhhhhhhhhHHHHHHHHh
Confidence 366676666543221111112233477777777777777766666555
No 91
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=73.32 E-value=2 Score=33.54 Aligned_cols=31 Identities=13% Similarity=0.348 Sum_probs=22.9
Q ss_pred ccccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCCh
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSM 410 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~ 410 (451)
...|+.||++|.... ..|..|++||..|.-.
T Consensus 9 Kr~Cp~cg~kFYDLn--------------k~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 9 KRICPNTGSKFYDLN--------------RRPAVSPYTGEQFPPE 39 (129)
T ss_pred cccCCCcCccccccC--------------CCCccCCCcCCccCcc
Confidence 357888888887642 5678888888887554
No 92
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=73.01 E-value=1.2 Score=39.14 Aligned_cols=14 Identities=21% Similarity=0.525 Sum_probs=9.2
Q ss_pred hcCCCceecccCcc
Q psy18077 318 HSKVKPFVCEKCGK 331 (451)
Q Consensus 318 h~~~~~~~C~~C~~ 331 (451)
.+|-+.|+|..|..
T Consensus 137 ~hGGrif~CsfC~~ 150 (314)
T PF06524_consen 137 DHGGRIFKCSFCDN 150 (314)
T ss_pred cCCCeEEEeecCCC
Confidence 34567777777764
No 93
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=72.73 E-value=4.3 Score=36.72 Aligned_cols=23 Identities=17% Similarity=0.254 Sum_probs=10.5
Q ss_pred CcccCccccccCChHHHHHHHHH
Q psy18077 397 ARTCPQCGKLYVSMPAYSMHLRT 419 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~l~~H~~~ 419 (451)
.|+|..|...|-.--....|...
T Consensus 388 rY~Ce~CK~~FC~dCdvfiHe~L 410 (421)
T COG5151 388 RYQCELCKSTFCSDCDVFIHETL 410 (421)
T ss_pred ceechhhhhhhhhhhHHHHHHHH
Confidence 35555555544444444444333
No 94
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=72.08 E-value=2 Score=36.45 Aligned_cols=17 Identities=18% Similarity=0.600 Sum_probs=9.7
Q ss_pred ccccccccccCCChHHH
Q psy18077 366 SFQCRDCGKHYSTSSNL 382 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L 382 (451)
-|.|+.|+..|+...++
T Consensus 117 ~Y~Cp~C~~rytf~eA~ 133 (178)
T PRK06266 117 FFFCPNCHIRFTFDEAM 133 (178)
T ss_pred EEECCCCCcEEeHHHHh
Confidence 36666666666555544
No 95
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=71.75 E-value=2.9 Score=25.08 Aligned_cols=11 Identities=18% Similarity=0.812 Sum_probs=6.0
Q ss_pred Ceecccccccc
Q psy18077 295 PFKCKVCDKAF 305 (451)
Q Consensus 295 ~~~C~~C~~~f 305 (451)
...|+.|+..|
T Consensus 25 ~v~C~~C~~~~ 35 (38)
T TIGR02098 25 KVRCGKCGHVW 35 (38)
T ss_pred EEECCCCCCEE
Confidence 35566665554
No 96
>KOG2186|consensus
Probab=71.65 E-value=2.6 Score=37.05 Aligned_cols=47 Identities=26% Similarity=0.598 Sum_probs=39.1
Q ss_pred ccccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCChHHHHHHHH
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLR 418 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 418 (451)
.|.|..||....- ..|.+|+-.-++ .-|.|-.|++.|.. .++..|..
T Consensus 3 ~FtCnvCgEsvKK-p~vekH~srCrn----~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 3 FFTCNVCGESVKK-PQVEKHMSRCRN----AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred EEehhhhhhhccc-cchHHHHHhccC----CeeEEeeccccccc-chhhhhhh
Confidence 4889999998765 457789988875 67999999999988 88999975
No 97
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=71.20 E-value=3.3 Score=24.76 Aligned_cols=32 Identities=25% Similarity=0.649 Sum_probs=15.5
Q ss_pred eccccccccCChHHHHHHHHHhcCCCceecccCcccc
Q psy18077 297 KCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAF 333 (451)
Q Consensus 297 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 333 (451)
.|+.|+..|.-... ++-.+....+|+.|+..|
T Consensus 4 ~CP~C~~~f~v~~~-----~l~~~~~~vrC~~C~~~f 35 (37)
T PF13719_consen 4 TCPNCQTRFRVPDD-----KLPAGGRKVRCPKCGHVF 35 (37)
T ss_pred ECCCCCceEEcCHH-----HcccCCcEEECCCCCcEe
Confidence 45666665554432 111233445666666554
No 98
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=70.71 E-value=3.8 Score=22.29 Aligned_cols=19 Identities=21% Similarity=0.420 Sum_probs=11.8
Q ss_pred ccccccccCCChHHHHHHHH
Q psy18077 368 QCRDCGKHYSTSSNLARHRQ 387 (451)
Q Consensus 368 ~C~~C~~~f~~~~~L~~H~~ 387 (451)
.|++|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 567777766 4456666654
No 99
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=70.19 E-value=2.6 Score=34.96 Aligned_cols=23 Identities=39% Similarity=0.889 Sum_probs=16.3
Q ss_pred CeeccccccccCChHHHHHHHHHhcCCCceecccCc
Q psy18077 295 PFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCG 330 (451)
Q Consensus 295 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 330 (451)
.|.|++||.. +.++.|-+||+||
T Consensus 134 ~~vC~vCGy~-------------~~ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYT-------------HEGEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCc-------------ccCCCCCcCCCCC
Confidence 6778888764 3456777788887
No 100
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=69.63 E-value=3.6 Score=25.74 Aligned_cols=26 Identities=23% Similarity=0.588 Sum_probs=16.9
Q ss_pred eeccccccccCChHHHHHHHHHhcCCCceecccCccc
Q psy18077 296 FKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKA 332 (451)
Q Consensus 296 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 332 (451)
|.|..||..|... ...+..|+.||..
T Consensus 3 Y~C~~Cg~~~~~~-----------~~~~irC~~CG~r 28 (44)
T smart00659 3 YICGECGRENEIK-----------SKDVVRCRECGYR 28 (44)
T ss_pred EECCCCCCEeecC-----------CCCceECCCCCce
Confidence 6777777766533 2355777777754
No 101
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=69.62 E-value=0.86 Score=37.43 Aligned_cols=16 Identities=31% Similarity=0.816 Sum_probs=10.8
Q ss_pred CcccCccccccCChHH
Q psy18077 397 ARTCPQCGKLYVSMPA 412 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~ 412 (451)
.++|+.||++|.+...
T Consensus 28 ~~~c~~c~~~f~~~e~ 43 (154)
T PRK00464 28 RRECLACGKRFTTFER 43 (154)
T ss_pred eeeccccCCcceEeEe
Confidence 4777777777766543
No 102
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=69.51 E-value=4.1 Score=34.57 Aligned_cols=17 Identities=24% Similarity=0.573 Sum_probs=9.5
Q ss_pred CceecccCcccccChhh
Q psy18077 322 KPFVCEKCGKAFALKSY 338 (451)
Q Consensus 322 ~~~~C~~C~~~f~~~~~ 338 (451)
.-|.|+.|+..|+.-..
T Consensus 116 ~~Y~Cp~C~~rytf~eA 132 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEA 132 (178)
T ss_pred CEEECCCCCcEEeHHHH
Confidence 34666666666655444
No 103
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=69.10 E-value=2.9 Score=34.70 Aligned_cols=13 Identities=31% Similarity=0.795 Sum_probs=8.6
Q ss_pred CCCcCeecccccc
Q psy18077 291 SSEKPFKCKVCDK 303 (451)
Q Consensus 291 ~~~~~~~C~~C~~ 303 (451)
.++.|-.|++||.
T Consensus 145 ~ge~P~~CPiCga 157 (166)
T COG1592 145 EGEAPEVCPICGA 157 (166)
T ss_pred cCCCCCcCCCCCC
Confidence 3466777777773
No 104
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=68.40 E-value=3.4 Score=32.25 Aligned_cols=11 Identities=18% Similarity=0.634 Sum_probs=5.5
Q ss_pred cccccccccCC
Q psy18077 148 QCRDCGKHYST 158 (451)
Q Consensus 148 ~C~~C~~~f~~ 158 (451)
.|+.||+.|..
T Consensus 11 ~Cp~cg~kFYD 21 (129)
T TIGR02300 11 ICPNTGSKFYD 21 (129)
T ss_pred cCCCcCccccc
Confidence 45555555533
No 105
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=68.07 E-value=3.7 Score=25.91 Aligned_cols=10 Identities=30% Similarity=1.281 Sum_probs=4.3
Q ss_pred eecccccccc
Q psy18077 260 FVCEKCGKAF 269 (451)
Q Consensus 260 ~~C~~C~~~f 269 (451)
|.|+.||..|
T Consensus 4 y~C~~CG~~~ 13 (46)
T PRK00398 4 YKCARCGREV 13 (46)
T ss_pred EECCCCCCEE
Confidence 4444444433
No 106
>KOG2807|consensus
Probab=67.46 E-value=7 Score=35.80 Aligned_cols=36 Identities=17% Similarity=0.400 Sum_probs=25.4
Q ss_pred cCeeccccccccCChHHHHHHHHHhcCCCceecccCcccccChhhHHhcc
Q psy18077 294 KPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFALKSYLYKHE 343 (451)
Q Consensus 294 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~ 343 (451)
.-|.|+.|+...- .-|-.|+.|+........|.+-.
T Consensus 275 ~Gy~CP~CkakvC--------------sLP~eCpiC~ltLVss~hLARSy 310 (378)
T KOG2807|consen 275 GGYFCPQCKAKVC--------------SLPIECPICSLTLVSSPHLARSY 310 (378)
T ss_pred CceeCCcccCeee--------------cCCccCCccceeEecchHHHHHH
Confidence 4588888876532 34778888988888777776543
No 107
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=66.80 E-value=4.5 Score=25.26 Aligned_cols=10 Identities=40% Similarity=0.972 Sum_probs=4.1
Q ss_pred cccccccccC
Q psy18077 367 FQCRDCGKHY 376 (451)
Q Consensus 367 ~~C~~C~~~f 376 (451)
.+|.+|++.+
T Consensus 17 a~C~~C~~~~ 26 (45)
T PF02892_consen 17 AKCKYCGKVI 26 (45)
T ss_dssp EEETTTTEE-
T ss_pred EEeCCCCeEE
Confidence 3444444444
No 108
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=65.49 E-value=4.3 Score=26.97 Aligned_cols=10 Identities=40% Similarity=1.185 Sum_probs=6.6
Q ss_pred cccccccccc
Q psy18077 366 SFQCRDCGKH 375 (451)
Q Consensus 366 ~~~C~~C~~~ 375 (451)
.|.|+.||..
T Consensus 25 ~F~CPnCG~~ 34 (59)
T PRK14890 25 KFLCPNCGEV 34 (59)
T ss_pred EeeCCCCCCe
Confidence 4677777764
No 109
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=64.67 E-value=1.6 Score=26.62 Aligned_cols=12 Identities=33% Similarity=1.301 Sum_probs=7.6
Q ss_pred cccccccccCCC
Q psy18077 367 FQCRDCGKHYST 378 (451)
Q Consensus 367 ~~C~~C~~~f~~ 378 (451)
|+|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (41)
T smart00834 6 YRCEDCGHTFEV 17 (41)
T ss_pred EEcCCCCCEEEE
Confidence 666667666643
No 110
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=64.18 E-value=3.7 Score=23.64 Aligned_cols=11 Identities=36% Similarity=1.075 Sum_probs=6.0
Q ss_pred CCcccCccccc
Q psy18077 396 KARTCPQCGKL 406 (451)
Q Consensus 396 ~~~~C~~C~~~ 406 (451)
.+.+|+.||..
T Consensus 16 ~~irC~~CG~R 26 (32)
T PF03604_consen 16 DPIRCPECGHR 26 (32)
T ss_dssp STSSBSSSS-S
T ss_pred CcEECCcCCCe
Confidence 34567776654
No 111
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=63.99 E-value=4.1 Score=26.02 Aligned_cols=30 Identities=20% Similarity=0.583 Sum_probs=20.1
Q ss_pred CcCeeccccccccCChHHHHHHHHHhcCCCceecccCccc
Q psy18077 293 EKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKA 332 (451)
Q Consensus 293 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 332 (451)
...|.|..||+.|... .......|+.||..
T Consensus 4 ~~~Y~C~~Cg~~~~~~----------~~~~~irCp~Cg~r 33 (49)
T COG1996 4 MMEYKCARCGREVELD----------QETRGIRCPYCGSR 33 (49)
T ss_pred eEEEEhhhcCCeeehh----------hccCceeCCCCCcE
Confidence 3468888888887211 23456788888865
No 112
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=63.83 E-value=9.8 Score=31.16 Aligned_cols=11 Identities=27% Similarity=1.053 Sum_probs=4.7
Q ss_pred cccccchhhcC
Q psy18077 232 FKCKVCDKAFA 242 (451)
Q Consensus 232 ~~C~~C~~~f~ 242 (451)
|.|+.|+..|.
T Consensus 100 Y~Cp~C~~~y~ 110 (147)
T smart00531 100 YKCPNCQSKYT 110 (147)
T ss_pred EECcCCCCEee
Confidence 44444444443
No 113
>KOG2186|consensus
Probab=63.71 E-value=3.3 Score=36.40 Aligned_cols=46 Identities=22% Similarity=0.509 Sum_probs=24.0
Q ss_pred cccCcCccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHHH
Q psy18077 204 CQCPHCSKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHIQ 252 (451)
Q Consensus 204 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 252 (451)
|.|..||....- ..+.+|+...++ .-|.|--|++.|.. ..+..|..
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 345555554443 234446555555 33666666666655 45555544
No 114
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=63.38 E-value=3.8 Score=42.82 Aligned_cols=49 Identities=27% Similarity=0.828 Sum_probs=33.0
Q ss_pred cccccccccccCHHHHHHHHHHcCCCcccCcCccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHHHHcCCC
Q psy18077 178 RTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHIQTHSKV 257 (451)
Q Consensus 178 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 257 (451)
..|..||..+ .|+.|+..+ . .|...+...|.+||.. ..
T Consensus 436 l~C~~Cg~v~-----------------~Cp~Cd~~l------t----~H~~~~~L~CH~Cg~~---------------~~ 473 (730)
T COG1198 436 LLCRDCGYIA-----------------ECPNCDSPL------T----LHKATGQLRCHYCGYQ---------------EP 473 (730)
T ss_pred eecccCCCcc-----------------cCCCCCcce------E----EecCCCeeEeCCCCCC---------------CC
Confidence 4688888764 688887643 2 2444456788888753 45
Q ss_pred CCeeccccccc
Q psy18077 258 KPFVCEKCGKA 268 (451)
Q Consensus 258 ~~~~C~~C~~~ 268 (451)
.|..|+.||..
T Consensus 474 ~p~~Cp~Cgs~ 484 (730)
T COG1198 474 IPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCC
Confidence 67788888854
No 115
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=63.37 E-value=1.9 Score=28.10 Aligned_cols=12 Identities=25% Similarity=1.171 Sum_probs=8.1
Q ss_pred cccccccccCCC
Q psy18077 367 FQCRDCGKHYST 378 (451)
Q Consensus 367 ~~C~~C~~~f~~ 378 (451)
|+|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (52)
T TIGR02605 6 YRCTACGHRFEV 17 (52)
T ss_pred EEeCCCCCEeEE
Confidence 677777776653
No 116
>KOG2807|consensus
Probab=62.95 E-value=9 Score=35.13 Aligned_cols=125 Identities=20% Similarity=0.428 Sum_probs=70.1
Q ss_pred HhHHHHHhhhcCCCC-CCCcccccccccccCHHHHH------HHHHHcCCCcccCcCccccCCHHHHHHHHHHhcCCCCc
Q psy18077 160 SNLARHRQIHRSPDD-TKARTCPQCGKLYVSMPAYS------MHLRTHNQICQCPHCSKSFSRPWLLQGHIRTHTGEKPF 232 (451)
Q Consensus 160 ~~l~~H~~~h~~~~~-~~~~~C~~C~~~f~~~~~l~------~H~~~h~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~ 232 (451)
..|+.=+..|..|.. .+...|..=...|.+..... -|...-.+-|.|+.|+...- .-|.
T Consensus 226 ~HlkeLl~e~~~Pp~~~~~~~~sLvkmGFP~~~~e~~ps~C~CH~~~~~~Gy~CP~CkakvC--------------sLP~ 291 (378)
T KOG2807|consen 226 GHLKELLLEHTHPPPANKSKECSLVKMGFPSRSPEDTPSFCACHSELSGGGYFCPQCKAKVC--------------SLPI 291 (378)
T ss_pred HHHHHHHHhcCCCCCcccccCCceEEecCCCcccccCcchheeccccccCceeCCcccCeee--------------cCCc
Confidence 444444444543322 23335666666677433221 23222234678888875432 3578
Q ss_pred ccccchhhcCChHhHHHHHHHcCCCCCe------------ecccccccccChHHHHhchhhcCCCCcCCCCCCcCeeccc
Q psy18077 233 KCKVCDKAFADKSNLRAHIQTHSKVKPF------------VCEKCGKAFALKSYLYKHEDSSCLKSYDSSSSEKPFKCKV 300 (451)
Q Consensus 233 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~------------~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~C~~ 300 (451)
.|++|+-+......|.+-.+---.-++| .|-.|+- ...+...|+|..
T Consensus 292 eCpiC~ltLVss~hLARSyhhL~PL~~F~Eip~~~~~~~~~Cf~C~~---------------------~~~~~~~y~C~~ 350 (378)
T KOG2807|consen 292 ECPICSLTLVSSPHLARSYHHLFPLKPFVEIPETEYNGSRFCFACQG---------------------ELLSSGRYRCES 350 (378)
T ss_pred cCCccceeEecchHHHHHHHhhcCCcchhhccccccCCCcceeeecc---------------------ccCCCCcEEchh
Confidence 8999988887777776543221111111 1555511 113456788999
Q ss_pred cccccCChHHHHHHHHHhc
Q psy18077 301 CDKAFADKSNLRAHIQTHS 319 (451)
Q Consensus 301 C~~~f~~~~~l~~H~~~h~ 319 (451)
|...|-..-..-.|-..|.
T Consensus 351 Ck~~FCldCDv~iHesLh~ 369 (378)
T KOG2807|consen 351 CKNVFCLDCDVFIHESLHN 369 (378)
T ss_pred ccceeeccchHHHHhhhhc
Confidence 9988887777778877774
No 117
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=62.58 E-value=1.2 Score=32.49 Aligned_cols=30 Identities=43% Similarity=0.900 Sum_probs=17.4
Q ss_pred CcccCccccccCChHHHHHHHHHcCCccccccccccCC
Q psy18077 397 ARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFS 434 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~C~~C~~~f~ 434 (451)
.|.|+.|++.-.. |.-.+.+.|..|++.|+
T Consensus 35 ky~Cp~Cgk~~vk--------R~a~GIW~C~~C~~~~A 64 (90)
T PF01780_consen 35 KYTCPFCGKTSVK--------RVATGIWKCKKCGKKFA 64 (90)
T ss_dssp -BEESSSSSSEEE--------EEETTEEEETTTTEEEE
T ss_pred CCcCCCCCCceeE--------EeeeEEeecCCCCCEEe
Confidence 4777777765321 22345577777777665
No 118
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=62.53 E-value=5.1 Score=25.79 Aligned_cols=24 Identities=50% Similarity=0.830 Sum_probs=18.5
Q ss_pred ccccccccccCCCh-----HHHHHHHH-hc
Q psy18077 366 SFQCRDCGKHYSTS-----SNLARHRQ-IH 389 (451)
Q Consensus 366 ~~~C~~C~~~f~~~-----~~L~~H~~-~H 389 (451)
.-.|..|++.+... +.|.+|++ .|
T Consensus 18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h 47 (50)
T smart00614 18 RAKCKYCGKKLSRSSKGGTSNLRRHLRRKH 47 (50)
T ss_pred EEEecCCCCEeeeCCCCCcHHHHHHHHhHC
Confidence 35799999988654 68888887 45
No 119
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=61.39 E-value=4.4 Score=25.04 Aligned_cols=14 Identities=36% Similarity=1.054 Sum_probs=11.2
Q ss_pred ceecccCcccccCh
Q psy18077 323 PFVCEKCGKAFALK 336 (451)
Q Consensus 323 ~~~C~~C~~~f~~~ 336 (451)
.|.|..||..|...
T Consensus 5 ey~C~~Cg~~fe~~ 18 (42)
T PF09723_consen 5 EYRCEECGHEFEVL 18 (42)
T ss_pred EEEeCCCCCEEEEE
Confidence 48899999888654
No 120
>KOG4167|consensus
Probab=61.33 E-value=9.7 Score=38.92 Aligned_cols=26 Identities=35% Similarity=0.728 Sum_probs=14.6
Q ss_pred ccccccccccccCChHhHHHHHhhhc
Q psy18077 145 ESFQCRDCGKHYSTSSNLARHRQIHR 170 (451)
Q Consensus 145 ~~~~C~~C~~~f~~~~~l~~H~~~h~ 170 (451)
..|.|..|++.|....++..||++|.
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 34555556655555555555555553
No 121
>KOG2593|consensus
Probab=60.41 E-value=7.1 Score=37.38 Aligned_cols=19 Identities=32% Similarity=0.676 Sum_probs=9.8
Q ss_pred CCceecccCcccccChhhH
Q psy18077 321 VKPFVCEKCGKAFALKSYL 339 (451)
Q Consensus 321 ~~~~~C~~C~~~f~~~~~L 339 (451)
..-|.|+.|.+.|.....+
T Consensus 126 ~~~Y~Cp~C~kkyt~Lea~ 144 (436)
T KOG2593|consen 126 VAGYVCPNCQKKYTSLEAL 144 (436)
T ss_pred cccccCCccccchhhhHHH
Confidence 3445555555555554443
No 122
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=60.11 E-value=1.5 Score=33.44 Aligned_cols=13 Identities=31% Similarity=0.728 Sum_probs=7.8
Q ss_pred ceecccCcccccC
Q psy18077 323 PFVCEKCGKAFAL 335 (451)
Q Consensus 323 ~~~C~~C~~~f~~ 335 (451)
.+.|+.|++.+..
T Consensus 16 ~~~C~~C~~~~~~ 28 (104)
T TIGR01384 16 VYVCPSCGYEKEK 28 (104)
T ss_pred eEECcCCCCcccc
Confidence 4667777665543
No 123
>PF15269 zf-C2H2_7: Zinc-finger
Probab=58.76 E-value=7 Score=24.06 Aligned_cols=21 Identities=19% Similarity=0.259 Sum_probs=14.9
Q ss_pred cccCccccccCChHHHHHHHH
Q psy18077 398 RTCPQCGKLYVSMPAYSMHLR 418 (451)
Q Consensus 398 ~~C~~C~~~f~~~~~l~~H~~ 418 (451)
|+|-.|.++...++.|-.||+
T Consensus 21 ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred ceeecCCcccchHHHHHHHHH
Confidence 567777777777777777765
No 124
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=56.78 E-value=4.6 Score=32.18 Aligned_cols=22 Identities=41% Similarity=0.815 Sum_probs=9.4
Q ss_pred cccCccccccCChHHHHHHHHHcCC
Q psy18077 398 RTCPQCGKLYVSMPAYSMHLRTHNQ 422 (451)
Q Consensus 398 ~~C~~C~~~f~~~~~l~~H~~~H~~ 422 (451)
..|-.||+.|.. |++|++.|++
T Consensus 73 i~clecGk~~k~---LkrHL~~~~g 94 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHG 94 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-
T ss_pred eEEccCCcccch---HHHHHHHccC
Confidence 455566665543 3555555543
No 125
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=56.65 E-value=2.3 Score=29.77 Aligned_cols=15 Identities=13% Similarity=0.731 Sum_probs=8.0
Q ss_pred CCCcccC--ccccccCC
Q psy18077 395 TKARTCP--QCGKLYVS 409 (451)
Q Consensus 395 ~~~~~C~--~C~~~f~~ 409 (451)
+.-++|. .||.+|..
T Consensus 25 ~~Y~qC~N~eCg~tF~t 41 (72)
T PRK09678 25 ERYHQCQNVNCSATFIT 41 (72)
T ss_pred eeeeecCCCCCCCEEEE
Confidence 3445565 56665554
No 126
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=56.05 E-value=2.9 Score=30.63 Aligned_cols=31 Identities=35% Similarity=0.739 Sum_probs=16.2
Q ss_pred CcccCccccccCChHHHHHHHHHcCCccccccccccCCC
Q psy18077 397 ARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSR 435 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~C~~C~~~f~~ 435 (451)
.|.|+.|++.-. .|.-.+++.|..|++.|+-
T Consensus 35 ~y~CpfCgk~~v--------kR~a~GIW~C~~C~~~~AG 65 (91)
T TIGR00280 35 KYVCPFCGKKTV--------KRGSTGIWTCRKCGAKFAG 65 (91)
T ss_pred CccCCCCCCCce--------EEEeeEEEEcCCCCCEEeC
Confidence 466666664321 1223455666666666653
No 127
>KOG4377|consensus
Probab=55.94 E-value=8.7 Score=36.35 Aligned_cols=117 Identities=23% Similarity=0.430 Sum_probs=63.7
Q ss_pred CcCeecc--ccccccCChHHHHHHHHHhcCC------------Cceec--ccCcccccChhhHHhcccccCCCccCCCCC
Q psy18077 293 EKPFKCK--VCDKAFADKSNLRAHIQTHSKV------------KPFVC--EKCGKAFALKSYLYKHEDSSCLKSYDSSSN 356 (451)
Q Consensus 293 ~~~~~C~--~C~~~f~~~~~l~~H~~~h~~~------------~~~~C--~~C~~~f~~~~~L~~H~~~~~~~~~~~~~~ 356 (451)
...|.|. .|+..+-.+..+.+|+.+|... ..|.| ..|.+ +.+.+..|..-|..+.+
T Consensus 269 rEhyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n----- 340 (480)
T KOG4377|consen 269 REHYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRN----- 340 (480)
T ss_pred chhhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCcccccccc-----
Confidence 3446663 4888777788888888887532 12566 35777 44455556543322221
Q ss_pred cccCcccccccccccccccCCChHHHHHHHHhccCCC-----CC-C-------------------CcccC--ccccccCC
Q psy18077 357 GHLNKFTTISFQCRDCGKHYSTSSNLARHRQIHRSPD-----DT-K-------------------ARTCP--QCGKLYVS 409 (451)
Q Consensus 357 ~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~-----~~-~-------------------~~~C~--~C~~~f~~ 409 (451)
....+.-|-|..|+=++..+ ...|...|-.++ +. + -|.|. -|+..+.+
T Consensus 341 ---~GfrrthfhC~r~gCTdtfK--~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~s 415 (480)
T KOG4377|consen 341 ---NGFRRTHFHCQRIGCTDTFK--DSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLYS 415 (480)
T ss_pred ---CceecceeEEeccCCccccc--cccccccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEEe
Confidence 11222357888877443333 444544443221 00 0 13332 26677777
Q ss_pred hHHHHHHHHHcCC
Q psy18077 410 MPAYSMHLRTHNQ 422 (451)
Q Consensus 410 ~~~l~~H~~~H~~ 422 (451)
.+++..|.|.|.+
T Consensus 416 ~sqm~shkrkheR 428 (480)
T KOG4377|consen 416 VSQMASHKRKHER 428 (480)
T ss_pred hhhhhhhhhhhhh
Confidence 7777777777754
No 128
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=55.18 E-value=6.6 Score=24.95 Aligned_cols=13 Identities=23% Similarity=0.838 Sum_probs=9.9
Q ss_pred eecccCcccccCh
Q psy18077 324 FVCEKCGKAFALK 336 (451)
Q Consensus 324 ~~C~~C~~~f~~~ 336 (451)
|+|..|++.|.-.
T Consensus 2 y~C~~CgyvYd~~ 14 (47)
T PF00301_consen 2 YQCPVCGYVYDPE 14 (47)
T ss_dssp EEETTTSBEEETT
T ss_pred cCCCCCCEEEcCC
Confidence 7888888877644
No 129
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=54.85 E-value=6.4 Score=25.39 Aligned_cols=12 Identities=25% Similarity=0.852 Sum_probs=9.2
Q ss_pred eecccCcccccC
Q psy18077 324 FVCEKCGKAFAL 335 (451)
Q Consensus 324 ~~C~~C~~~f~~ 335 (451)
|+|..||+.|.-
T Consensus 2 y~C~~CgyiYd~ 13 (50)
T cd00730 2 YECRICGYIYDP 13 (50)
T ss_pred cCCCCCCeEECC
Confidence 788888887764
No 130
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=54.31 E-value=3.7 Score=30.05 Aligned_cols=13 Identities=31% Similarity=0.782 Sum_probs=7.6
Q ss_pred CccccccccccCC
Q psy18077 422 QICQCPHCSKSFS 434 (451)
Q Consensus 422 ~~~~C~~C~~~f~ 434 (451)
+++.|..|++.|+
T Consensus 53 GIW~C~~C~~~~A 65 (90)
T PTZ00255 53 GIWRCKGCKKTVA 65 (90)
T ss_pred EEEEcCCCCCEEe
Confidence 4456666666554
No 131
>KOG1280|consensus
Probab=54.03 E-value=8 Score=35.76 Aligned_cols=38 Identities=26% Similarity=0.544 Sum_probs=25.7
Q ss_pred ccccccccccCCChHHHHHHHHhccCCCCCCCcccCccc
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCG 404 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~ 404 (451)
.|.|++|++.=.+...|..|+..-|... .....|++|.
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda-~~~~icp~c~ 116 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEA-STSVICPLCA 116 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCccc-Ccceeeeccc
Confidence 6888888888888888888876655421 2334566664
No 132
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=51.73 E-value=7.3 Score=25.16 Aligned_cols=12 Identities=25% Similarity=0.595 Sum_probs=7.3
Q ss_pred CCcccCcccccc
Q psy18077 396 KARTCPQCGKLY 407 (451)
Q Consensus 396 ~~~~C~~C~~~f 407 (451)
..+.|..|+..+
T Consensus 36 ~r~~C~~Cgyt~ 47 (50)
T PRK00432 36 DRWHCGKCGYTE 47 (50)
T ss_pred CcEECCCcCCEE
Confidence 356677776654
No 133
>KOG2272|consensus
Probab=50.67 E-value=2.1 Score=37.44 Aligned_cols=17 Identities=29% Similarity=0.653 Sum_probs=11.2
Q ss_pred CceecccCcccccChhh
Q psy18077 322 KPFVCEKCGKAFALKSY 338 (451)
Q Consensus 322 ~~~~C~~C~~~f~~~~~ 338 (451)
.-|.|..|++....+..
T Consensus 279 ~cf~Cs~Cdkkl~~K~K 295 (332)
T KOG2272|consen 279 ECFSCSTCDKKLTQKNK 295 (332)
T ss_pred ccccccccccccccccc
Confidence 34788888877665543
No 134
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=49.95 E-value=4.6 Score=35.20 Aligned_cols=24 Identities=29% Similarity=0.599 Sum_probs=13.0
Q ss_pred CCcccCccccccCChHHHHHHHHH
Q psy18077 396 KARTCPQCGKLYVSMPAYSMHLRT 419 (451)
Q Consensus 396 ~~~~C~~C~~~f~~~~~l~~H~~~ 419 (451)
..|.|..|+|.|.-..-.++|+..
T Consensus 76 ~K~~C~lc~KlFkg~eFV~KHI~n 99 (214)
T PF04959_consen 76 DKWRCPLCGKLFKGPEFVRKHIFN 99 (214)
T ss_dssp EEEEE-SSS-EESSHHHHHHHHHH
T ss_pred CEECCCCCCcccCChHHHHHHHhh
Confidence 346666666666666666666554
No 135
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=49.94 E-value=8.1 Score=29.07 Aligned_cols=28 Identities=21% Similarity=0.273 Sum_probs=20.6
Q ss_pred cccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccC
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYV 408 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~ 408 (451)
-.|+.|++.|.... ..|..|++||++|.
T Consensus 10 ridPetg~KFYDLN--------------rdPiVsPytG~s~P 37 (129)
T COG4530 10 RIDPETGKKFYDLN--------------RDPIVSPYTGKSYP 37 (129)
T ss_pred ccCccccchhhccC--------------CCccccCcccccch
Confidence 36888888886532 56888888888883
No 136
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=49.54 E-value=4.2 Score=29.80 Aligned_cols=31 Identities=32% Similarity=0.784 Sum_probs=16.9
Q ss_pred CcccCccccccCChHHHHHHHHHcCCccccccccccCCC
Q psy18077 397 ARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSR 435 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~C~~C~~~f~~ 435 (451)
.|.|+.|++.-. .|.-.+++.|..|++.|+-
T Consensus 36 ~y~CpfCgk~~v--------kR~a~GIW~C~~C~~~~AG 66 (90)
T PRK03976 36 KHVCPVCGRPKV--------KRVGTGIWECRKCGAKFAG 66 (90)
T ss_pred CccCCCCCCCce--------EEEEEEEEEcCCCCCEEeC
Confidence 466777754321 1223455667777766653
No 137
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=49.25 E-value=10 Score=30.22 Aligned_cols=23 Identities=39% Similarity=0.852 Sum_probs=15.8
Q ss_pred ccccccccccCCChHHHHHHHHhccC
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQIHRS 391 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~H~~ 391 (451)
...|-+||+.|.. |++|++.|+|
T Consensus 72 ~i~clecGk~~k~---LkrHL~~~~g 94 (132)
T PF05443_consen 72 YIICLECGKKFKT---LKRHLRTHHG 94 (132)
T ss_dssp -EE-TBT--EESB---HHHHHHHTT-
T ss_pred eeEEccCCcccch---HHHHHHHccC
Confidence 4789999999876 5899999986
No 138
>PF14353 CpXC: CpXC protein
Probab=49.08 E-value=13 Score=29.56 Aligned_cols=24 Identities=25% Similarity=0.389 Sum_probs=13.9
Q ss_pred CCcccCccccccCChHHHHHHHHH
Q psy18077 396 KARTCPQCGKLYVSMPAYSMHLRT 419 (451)
Q Consensus 396 ~~~~C~~C~~~f~~~~~l~~H~~~ 419 (451)
..|.|+.||..|.-...+..|-..
T Consensus 37 ~~~~CP~Cg~~~~~~~p~lY~D~~ 60 (128)
T PF14353_consen 37 FSFTCPSCGHKFRLEYPLLYHDPE 60 (128)
T ss_pred CEEECCCCCCceecCCCEEEEcCC
Confidence 456777777776655555444433
No 139
>PRK04023 DNA polymerase II large subunit; Validated
Probab=47.84 E-value=14 Score=39.66 Aligned_cols=10 Identities=40% Similarity=0.886 Sum_probs=6.9
Q ss_pred ceecccCccc
Q psy18077 323 PFVCEKCGKA 332 (451)
Q Consensus 323 ~~~C~~C~~~ 332 (451)
...|+.||..
T Consensus 626 ~RfCpsCG~~ 635 (1121)
T PRK04023 626 RRKCPSCGKE 635 (1121)
T ss_pred CccCCCCCCc
Confidence 4567777765
No 140
>PRK04023 DNA polymerase II large subunit; Validated
Probab=47.21 E-value=16 Score=39.16 Aligned_cols=10 Identities=20% Similarity=0.986 Sum_probs=4.3
Q ss_pred cccccccccC
Q psy18077 424 CQCPHCSKSF 433 (451)
Q Consensus 424 ~~C~~C~~~f 433 (451)
|+|.-|+..|
T Consensus 1038 fRC~kC~~kY 1047 (1121)
T PRK04023 1038 FRCTKCGAKY 1047 (1121)
T ss_pred eeecccCccc
Confidence 3444444443
No 141
>KOG2272|consensus
Probab=47.14 E-value=9.2 Score=33.57 Aligned_cols=77 Identities=18% Similarity=0.450 Sum_probs=38.0
Q ss_pred CCeecccccccccChHHHHhchhhcCCCCc--CCCCCCcCeeccccccccCChHHHHHHHHHhcCCCceecccCcccccC
Q psy18077 258 KPFVCEKCGKAFALKSYLYKHEDSSCLKSY--DSSSSEKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFAL 335 (451)
Q Consensus 258 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~--~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 335 (451)
.-|.|.+|.+...+...++.--+..|..-- ....+...|.|..|...... ..|.---..-+ .--|+|..|++...+
T Consensus 98 ~CF~Cd~Cn~~Lad~gf~rnqgr~LC~~Cn~k~Ka~~~g~YvC~KCh~~iD~-~~l~fr~d~yH-~yHFkCt~C~keL~s 175 (332)
T KOG2272|consen 98 ACFRCDLCNKHLADQGFYRNQGRALCRECNQKEKAKGRGRYVCQKCHAHIDE-QPLTFRGDPYH-PYHFKCTTCGKELTS 175 (332)
T ss_pred ccchhHHHHHHHhhhhhHhhcchHHhhhhhhhhcccccceeehhhhhhhccc-ccccccCCCCC-ccceecccccccccc
Confidence 348888888888777766654433221110 01133446777777654332 11111000000 123678888776554
Q ss_pred h
Q psy18077 336 K 336 (451)
Q Consensus 336 ~ 336 (451)
.
T Consensus 176 d 176 (332)
T KOG2272|consen 176 D 176 (332)
T ss_pred h
Confidence 3
No 142
>KOG4167|consensus
Probab=46.09 E-value=5 Score=40.88 Aligned_cols=26 Identities=27% Similarity=0.736 Sum_probs=23.8
Q ss_pred CCcccCccccccCChHHHHHHHHHcC
Q psy18077 396 KARTCPQCGKLYVSMPAYSMHLRTHN 421 (451)
Q Consensus 396 ~~~~C~~C~~~f~~~~~l~~H~~~H~ 421 (451)
..|.|..|++.|..-.++..||++|.
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 35999999999999999999999985
No 143
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=44.11 E-value=11 Score=29.66 Aligned_cols=15 Identities=33% Similarity=0.922 Sum_probs=9.5
Q ss_pred ccccccccccCCChH
Q psy18077 366 SFQCRDCGKHYSTSS 380 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~ 380 (451)
|++|..||+.|..-+
T Consensus 1 PH~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGS 15 (131)
T ss_pred CcccCcCCCCcCCCc
Confidence 456777777776544
No 144
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=43.22 E-value=2.6 Score=43.03 Aligned_cols=44 Identities=20% Similarity=0.410 Sum_probs=20.7
Q ss_pred ccccccccCChHHHHHHHHHhcCCCce-ecccCcccccChhhHHhc
Q psy18077 298 CKVCDKAFADKSNLRAHIQTHSKVKPF-VCEKCGKAFALKSYLYKH 342 (451)
Q Consensus 298 C~~C~~~f~~~~~l~~H~~~h~~~~~~-~C~~C~~~f~~~~~L~~H 342 (451)
|..||-+|+-...|-.- |-++.-+.| .|+.|.+.|.++.+-+-|
T Consensus 126 CT~CGPRfTIi~alPYD-R~nTsM~~F~lC~~C~~EY~dP~nRRfH 170 (750)
T COG0068 126 CTNCGPRFTIIEALPYD-RENTSMADFPLCPFCDKEYKDPLNRRFH 170 (750)
T ss_pred cCCCCcceeeeccCCCC-cccCccccCcCCHHHHHHhcCccccccc
Confidence 66666666544333111 112222222 366676666665544444
No 145
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=42.48 E-value=14 Score=26.86 Aligned_cols=14 Identities=43% Similarity=0.930 Sum_probs=7.7
Q ss_pred CCeecccccccccC
Q psy18077 258 KPFVCEKCGKAFAL 271 (451)
Q Consensus 258 ~~~~C~~C~~~f~~ 271 (451)
.|-.|..||+.|..
T Consensus 57 ~Pa~CkkCGfef~~ 70 (97)
T COG3357 57 RPARCKKCGFEFRD 70 (97)
T ss_pred cChhhcccCccccc
Confidence 34556666665544
No 146
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=42.07 E-value=15 Score=24.45 Aligned_cols=6 Identities=33% Similarity=0.938 Sum_probs=2.1
Q ss_pred Ceeccc
Q psy18077 295 PFKCKV 300 (451)
Q Consensus 295 ~~~C~~ 300 (451)
+..|..
T Consensus 38 ~v~C~~ 43 (60)
T PF02176_consen 38 PVPCPY 43 (60)
T ss_dssp EEE-SS
T ss_pred cEECCC
Confidence 334444
No 147
>KOG2593|consensus
Probab=41.06 E-value=27 Score=33.65 Aligned_cols=14 Identities=21% Similarity=0.710 Sum_probs=6.5
Q ss_pred CcccccchhhcCCh
Q psy18077 231 PFKCKVCDKAFADK 244 (451)
Q Consensus 231 ~~~C~~C~~~f~~~ 244 (451)
.|.|+.|.+.|...
T Consensus 128 ~Y~Cp~C~kkyt~L 141 (436)
T KOG2593|consen 128 GYVCPNCQKKYTSL 141 (436)
T ss_pred cccCCccccchhhh
Confidence 34455554444443
No 148
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=40.13 E-value=13 Score=37.62 Aligned_cols=48 Identities=31% Similarity=0.802 Sum_probs=31.1
Q ss_pred cccccccccccCHHHHHHHHHHcCCCcccCcCccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHHHHcCCC
Q psy18077 178 RTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHIQTHSKV 257 (451)
Q Consensus 178 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 257 (451)
..|..||... .|+.|+-.. ..|...+...|..||... .
T Consensus 214 ~~C~~Cg~~~-----------------~C~~C~~~l----------~~h~~~~~l~Ch~Cg~~~---------------~ 251 (505)
T TIGR00595 214 LLCRSCGYIL-----------------CCPNCDVSL----------TYHKKEGKLRCHYCGYQE---------------P 251 (505)
T ss_pred eEhhhCcCcc-----------------CCCCCCCce----------EEecCCCeEEcCCCcCcC---------------C
Confidence 4788888753 688887532 234445567888887543 3
Q ss_pred CCeecccccc
Q psy18077 258 KPFVCEKCGK 267 (451)
Q Consensus 258 ~~~~C~~C~~ 267 (451)
.|..|+.|+.
T Consensus 252 ~~~~Cp~C~s 261 (505)
T TIGR00595 252 IPKTCPQCGS 261 (505)
T ss_pred CCCCCCCCCC
Confidence 4567888875
No 149
>KOG2071|consensus
Probab=39.83 E-value=15 Score=36.79 Aligned_cols=27 Identities=30% Similarity=0.698 Sum_probs=19.2
Q ss_pred CCCcccccchhhcCChHhHHHHHHHcC
Q psy18077 229 EKPFKCKVCDKAFADKSNLRAHIQTHS 255 (451)
Q Consensus 229 ~~~~~C~~C~~~f~~~~~l~~H~~~h~ 255 (451)
..+-+|..||.+|........||..|.
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhhh
Confidence 345678888888888777777766553
No 150
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=37.81 E-value=18 Score=28.54 Aligned_cols=21 Identities=33% Similarity=0.512 Sum_probs=11.2
Q ss_pred cccCccccccCChHHHHHHHHHcC
Q psy18077 398 RTCPQCGKLYVSMPAYSMHLRTHN 421 (451)
Q Consensus 398 ~~C~~C~~~f~~~~~l~~H~~~H~ 421 (451)
..|-.+|+.|. +|++|+.+|.
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~ 97 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHY 97 (148)
T ss_pred EEEeccCcchH---HHHHHHhccc
Confidence 34555565553 4555555554
No 151
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=37.76 E-value=17 Score=20.58 Aligned_cols=11 Identities=36% Similarity=0.966 Sum_probs=6.1
Q ss_pred CCceecccCcc
Q psy18077 321 VKPFVCEKCGK 331 (451)
Q Consensus 321 ~~~~~C~~C~~ 331 (451)
...|.|+.|+.
T Consensus 17 ~~~~vCp~C~~ 27 (30)
T PF08274_consen 17 GELLVCPECGH 27 (30)
T ss_dssp SSSEEETTTTE
T ss_pred CCEEeCCcccc
Confidence 34466666654
No 152
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=37.45 E-value=32 Score=38.07 Aligned_cols=10 Identities=20% Similarity=0.747 Sum_probs=7.1
Q ss_pred cccccccccC
Q psy18077 367 FQCRDCGKHY 376 (451)
Q Consensus 367 ~~C~~C~~~f 376 (451)
..|+.|+...
T Consensus 710 ~~CP~CGtpl 719 (1337)
T PRK14714 710 VECPRCDVEL 719 (1337)
T ss_pred ccCCCCCCcc
Confidence 4688888544
No 153
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=36.90 E-value=15 Score=19.30 Aligned_cols=6 Identities=67% Similarity=2.106 Sum_probs=2.6
Q ss_pred cCcccc
Q psy18077 400 CPQCGK 405 (451)
Q Consensus 400 C~~C~~ 405 (451)
|+.||.
T Consensus 16 C~~CG~ 21 (23)
T PF13240_consen 16 CPNCGT 21 (23)
T ss_pred hhhhCC
Confidence 444443
No 154
>KOG4377|consensus
Probab=36.85 E-value=18 Score=34.38 Aligned_cols=24 Identities=29% Similarity=0.701 Sum_probs=19.4
Q ss_pred cccc--cccccCCChHHHHHHHHhcc
Q psy18077 367 FQCR--DCGKHYSTSSNLARHRQIHR 390 (451)
Q Consensus 367 ~~C~--~C~~~f~~~~~L~~H~~~H~ 390 (451)
|-|. -|+..|.+.+.+..|.+.|-
T Consensus 402 fhc~r~Gc~~tl~s~sqm~shkrkhe 427 (480)
T KOG4377|consen 402 FHCDRLGCEATLYSVSQMASHKRKHE 427 (480)
T ss_pred eeecccCCceEEEehhhhhhhhhhhh
Confidence 5564 48899999999999988874
No 155
>PF12907 zf-met2: Zinc-binding
Probab=36.68 E-value=13 Score=22.75 Aligned_cols=7 Identities=29% Similarity=0.933 Sum_probs=3.2
Q ss_pred ccccccc
Q psy18077 425 QCPHCSK 431 (451)
Q Consensus 425 ~C~~C~~ 431 (451)
.|.+|-.
T Consensus 3 ~C~iC~q 9 (40)
T PF12907_consen 3 ICKICRQ 9 (40)
T ss_pred CcHHhhH
Confidence 3444443
No 156
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=36.68 E-value=16 Score=19.48 Aligned_cols=9 Identities=44% Similarity=1.206 Sum_probs=4.7
Q ss_pred cCeeccccc
Q psy18077 294 KPFKCKVCD 302 (451)
Q Consensus 294 ~~~~C~~C~ 302 (451)
.+|.|+.||
T Consensus 15 v~f~CPnCG 23 (24)
T PF07754_consen 15 VPFPCPNCG 23 (24)
T ss_pred ceEeCCCCC
Confidence 345555554
No 157
>PRK14873 primosome assembly protein PriA; Provisional
Probab=36.47 E-value=16 Score=38.13 Aligned_cols=38 Identities=24% Similarity=0.788 Sum_probs=22.5
Q ss_pred ccCcCccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHHHHcCCCCCeeccccccc
Q psy18077 205 QCPHCSKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKA 268 (451)
Q Consensus 205 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 268 (451)
.|+.|+-... .|...+...|..||.. ..|+.|+.|+..
T Consensus 394 ~C~~C~~~L~----------~h~~~~~l~Ch~CG~~----------------~~p~~Cp~Cgs~ 431 (665)
T PRK14873 394 RCRHCTGPLG----------LPSAGGTPRCRWCGRA----------------APDWRCPRCGSD 431 (665)
T ss_pred ECCCCCCcee----------EecCCCeeECCCCcCC----------------CcCccCCCCcCC
Confidence 5777765432 2334455778888752 135678888754
No 158
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=36.08 E-value=17 Score=22.50 Aligned_cols=11 Identities=27% Similarity=0.833 Sum_probs=7.2
Q ss_pred ccccccccccC
Q psy18077 366 SFQCRDCGKHY 376 (451)
Q Consensus 366 ~~~C~~C~~~f 376 (451)
.+.|+.||...
T Consensus 19 ~~vC~~CG~Vl 29 (43)
T PF08271_consen 19 ELVCPNCGLVL 29 (43)
T ss_dssp EEEETTT-BBE
T ss_pred eEECCCCCCEe
Confidence 47888888654
No 159
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=34.76 E-value=33 Score=30.29 Aligned_cols=19 Identities=16% Similarity=0.614 Sum_probs=14.3
Q ss_pred ccccccccccccccCCChH
Q psy18077 362 FTTISFQCRDCGKHYSTSS 380 (451)
Q Consensus 362 ~~~~~~~C~~C~~~f~~~~ 380 (451)
.+...|.|+.|+..|....
T Consensus 151 wG~aef~C~~C~h~F~G~~ 169 (278)
T PF15135_consen 151 WGIAEFHCPKCRHNFRGFA 169 (278)
T ss_pred cceeeeecccccccchhhh
Confidence 3344699999999998654
No 160
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=34.71 E-value=30 Score=38.29 Aligned_cols=9 Identities=22% Similarity=0.837 Sum_probs=4.4
Q ss_pred cccccchhh
Q psy18077 232 FKCKVCDKA 240 (451)
Q Consensus 232 ~~C~~C~~~ 240 (451)
|.|+.|+..
T Consensus 693 y~CPsCGae 701 (1337)
T PRK14714 693 YVCPDCGAE 701 (1337)
T ss_pred eeCccCCCc
Confidence 445555543
No 161
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=34.64 E-value=25 Score=26.24 Aligned_cols=13 Identities=23% Similarity=0.968 Sum_probs=6.8
Q ss_pred CcccccccccccC
Q psy18077 177 ARTCPQCGKLYVS 189 (451)
Q Consensus 177 ~~~C~~C~~~f~~ 189 (451)
+++|..||..|.+
T Consensus 2 pH~CtrCG~vf~~ 14 (112)
T COG3364 2 PHQCTRCGEVFDD 14 (112)
T ss_pred Cceeccccccccc
Confidence 3455555555554
No 162
>KOG3408|consensus
Probab=34.10 E-value=23 Score=27.43 Aligned_cols=22 Identities=18% Similarity=0.606 Sum_probs=13.2
Q ss_pred CcccCccccccCChHHHHHHHH
Q psy18077 397 ARTCPQCGKLYVSMPAYSMHLR 418 (451)
Q Consensus 397 ~~~C~~C~~~f~~~~~l~~H~~ 418 (451)
.|.|-.|.+-|.+...|..|.+
T Consensus 57 qfyCi~CaRyFi~~~~l~~H~k 78 (129)
T KOG3408|consen 57 QFYCIECARYFIDAKALKTHFK 78 (129)
T ss_pred eeehhhhhhhhcchHHHHHHHh
Confidence 4556666666666666666654
No 163
>PF10446 DUF2457: Protein of unknown function (DUF2457); InterPro: IPR018853 This entry represents a family of uncharacterised proteins.
Probab=33.26 E-value=58 Score=31.57 Aligned_cols=11 Identities=18% Similarity=0.262 Sum_probs=6.0
Q ss_pred hHHHHHhhccc
Q psy18077 3 VIDQLVSASNR 13 (451)
Q Consensus 3 ~~~~~~~~~~~ 13 (451)
.|-.|+++...
T Consensus 37 ~IRkLgeEaEE 47 (458)
T PF10446_consen 37 AIRKLGEEAEE 47 (458)
T ss_pred HHhhhhHHHHH
Confidence 35567665443
No 164
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=33.23 E-value=27 Score=30.49 Aligned_cols=26 Identities=19% Similarity=0.415 Sum_probs=20.4
Q ss_pred cccccccccccCCChHHHHHHHHhcc
Q psy18077 365 ISFQCRDCGKHYSTSSNLARHRQIHR 390 (451)
Q Consensus 365 ~~~~C~~C~~~f~~~~~L~~H~~~H~ 390 (451)
..|.|++|+|.|.-..-..+|+..-|
T Consensus 76 ~K~~C~lc~KlFkg~eFV~KHI~nKH 101 (214)
T PF04959_consen 76 DKWRCPLCGKLFKGPEFVRKHIFNKH 101 (214)
T ss_dssp EEEEE-SSS-EESSHHHHHHHHHHH-
T ss_pred CEECCCCCCcccCChHHHHHHHhhcC
Confidence 36999999999999999999987644
No 165
>KOG1280|consensus
Probab=32.18 E-value=35 Score=31.73 Aligned_cols=36 Identities=25% Similarity=0.668 Sum_probs=27.9
Q ss_pred CCCcccCccccccCChHHHHHHHHHcCCc----ccccccc
Q psy18077 395 TKARTCPQCGKLYVSMPAYSMHLRTHNQI----CQCPHCS 430 (451)
Q Consensus 395 ~~~~~C~~C~~~f~~~~~l~~H~~~H~~~----~~C~~C~ 430 (451)
...|.|++|++.=.+...|..|+..-+.. -.|++|.
T Consensus 77 ~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 77 PQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 45799999999988999999998765533 4566664
No 166
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=32.11 E-value=27 Score=22.25 Aligned_cols=10 Identities=30% Similarity=0.817 Sum_probs=6.5
Q ss_pred CcccCccccc
Q psy18077 397 ARTCPQCGKL 406 (451)
Q Consensus 397 ~~~C~~C~~~ 406 (451)
.|.|..|+++
T Consensus 37 R~~CGkCgyT 46 (51)
T COG1998 37 RWACGKCGYT 46 (51)
T ss_pred eeEeccccce
Confidence 5667777654
No 167
>KOG3408|consensus
Probab=31.60 E-value=31 Score=26.70 Aligned_cols=27 Identities=30% Similarity=0.635 Sum_probs=23.6
Q ss_pred CCCcCeeccccccccCChHHHHHHHHH
Q psy18077 291 SSEKPFKCKVCDKAFADKSNLRAHIQT 317 (451)
Q Consensus 291 ~~~~~~~C~~C~~~f~~~~~l~~H~~~ 317 (451)
.|...|.|-.|.+-|.+...|..|.++
T Consensus 53 PG~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 53 PGGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCCceeehhhhhhhhcchHHHHHHHhc
Confidence 566779999999999999999999875
No 168
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=31.37 E-value=27 Score=22.90 Aligned_cols=10 Identities=40% Similarity=1.208 Sum_probs=4.9
Q ss_pred cccccccccC
Q psy18077 148 QCRDCGKHYS 157 (451)
Q Consensus 148 ~C~~C~~~f~ 157 (451)
.|..|++.|.
T Consensus 7 ~C~~Cg~~~~ 16 (54)
T PF14446_consen 7 KCPVCGKKFK 16 (54)
T ss_pred cChhhCCccc
Confidence 4555555543
No 169
>PF14311 DUF4379: Domain of unknown function (DUF4379)
Probab=31.31 E-value=31 Score=22.60 Aligned_cols=14 Identities=29% Similarity=0.928 Sum_probs=7.8
Q ss_pred ccccccccccCCCh
Q psy18077 366 SFQCRDCGKHYSTS 379 (451)
Q Consensus 366 ~~~C~~C~~~f~~~ 379 (451)
-++|+.||..|...
T Consensus 28 ~W~C~~Cgh~w~~~ 41 (55)
T PF14311_consen 28 WWKCPKCGHEWKAS 41 (55)
T ss_pred EEECCCCCCeeEcc
Confidence 35666666555443
No 170
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=31.09 E-value=19 Score=28.40 Aligned_cols=11 Identities=36% Similarity=0.915 Sum_probs=6.2
Q ss_pred cCccccccCCh
Q psy18077 400 CPQCGKLYVSM 410 (451)
Q Consensus 400 C~~C~~~f~~~ 410 (451)
|+.|..+|.+.
T Consensus 124 CPvCkTSFKss 134 (140)
T PF05290_consen 124 CPVCKTSFKSS 134 (140)
T ss_pred CCccccccccc
Confidence 66666655543
No 171
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=30.81 E-value=36 Score=35.84 Aligned_cols=10 Identities=30% Similarity=0.683 Sum_probs=4.8
Q ss_pred Cccccccccc
Q psy18077 177 ARTCPQCGKL 186 (451)
Q Consensus 177 ~~~C~~C~~~ 186 (451)
...|.+||..
T Consensus 462 ~L~CH~Cg~~ 471 (730)
T COG1198 462 QLRCHYCGYQ 471 (730)
T ss_pred eeEeCCCCCC
Confidence 3445555544
No 172
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=30.57 E-value=48 Score=20.31 Aligned_cols=24 Identities=29% Similarity=0.638 Sum_probs=18.0
Q ss_pred ccccccccccCC--hHhHHHHHhhhc
Q psy18077 147 FQCRDCGKHYST--SSNLARHRQIHR 170 (451)
Q Consensus 147 ~~C~~C~~~f~~--~~~l~~H~~~h~ 170 (451)
-.|+.||..|.. ..+-..|.+.|.
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 589999998854 566777877764
No 173
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=28.22 E-value=30 Score=22.72 Aligned_cols=9 Identities=22% Similarity=0.715 Sum_probs=4.2
Q ss_pred eeccccccc
Q psy18077 296 FKCKVCDKA 304 (451)
Q Consensus 296 ~~C~~C~~~ 304 (451)
..|+.||..
T Consensus 23 V~Cp~CGae 31 (54)
T TIGR01206 23 VICDECGAE 31 (54)
T ss_pred EeCCCCCCE
Confidence 345555443
No 174
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=28.13 E-value=28 Score=23.99 Aligned_cols=30 Identities=30% Similarity=0.669 Sum_probs=18.1
Q ss_pred cccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCC
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVS 409 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~ 409 (451)
-.|+.||..-.. . ...+.|.|+.||..+..
T Consensus 29 q~C~~CG~~~~~-----------~--~~~r~~~C~~Cg~~~~r 58 (69)
T PF07282_consen 29 QTCPRCGHRNKK-----------R--RSGRVFTCPNCGFEMDR 58 (69)
T ss_pred cCccCccccccc-----------c--cccceEEcCCCCCEECc
Confidence 467777765443 0 12456778888776543
No 175
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=28.11 E-value=15 Score=21.77 Aligned_cols=31 Identities=19% Similarity=0.492 Sum_probs=15.5
Q ss_pred eeccccccccCChHHHHHHHHHhcCCCceecccCccccc
Q psy18077 296 FKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFA 334 (451)
Q Consensus 296 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 334 (451)
+.|+.||..|.... -.....-.|+.||-.+.
T Consensus 2 r~C~~Cg~~Yh~~~--------~pP~~~~~Cd~cg~~L~ 32 (36)
T PF05191_consen 2 RICPKCGRIYHIEF--------NPPKVEGVCDNCGGELV 32 (36)
T ss_dssp EEETTTTEEEETTT--------B--SSTTBCTTTTEBEB
T ss_pred cCcCCCCCcccccc--------CCCCCCCccCCCCCeeE
Confidence 35666766664321 12334456666665444
No 176
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=28.02 E-value=20 Score=21.82 Aligned_cols=14 Identities=29% Similarity=0.985 Sum_probs=10.2
Q ss_pred ccccccccccccCC
Q psy18077 364 TISFQCRDCGKHYS 377 (451)
Q Consensus 364 ~~~~~C~~C~~~f~ 377 (451)
.+-|.|..|+..+.
T Consensus 26 T~fy~C~~C~~~w~ 39 (40)
T smart00440 26 TVFYVCTKCGHRWR 39 (40)
T ss_pred eEEEEeCCCCCEeC
Confidence 34688999987764
No 177
>KOG0957|consensus
Probab=27.58 E-value=97 Score=30.52 Aligned_cols=11 Identities=27% Similarity=0.739 Sum_probs=7.9
Q ss_pred ccccccccccC
Q psy18077 366 SFQCRDCGKHY 376 (451)
Q Consensus 366 ~~~C~~C~~~f 376 (451)
.|.|.+|.+.-
T Consensus 544 ~ysCgiCkks~ 554 (707)
T KOG0957|consen 544 NYSCGICKKST 554 (707)
T ss_pred ceeeeeeccch
Confidence 57888887753
No 178
>KOG4124|consensus
Probab=26.91 E-value=13 Score=34.51 Aligned_cols=51 Identities=39% Similarity=0.801 Sum_probs=34.9
Q ss_pred CcCeeccc--cccccCChHHHHHHHHH-h------------------cCCCceecccCcccccChhhHHhcc
Q psy18077 293 EKPFKCKV--CDKAFADKSNLRAHIQT-H------------------SKVKPFVCEKCGKAFALKSYLYKHE 343 (451)
Q Consensus 293 ~~~~~C~~--C~~~f~~~~~l~~H~~~-h------------------~~~~~~~C~~C~~~f~~~~~L~~H~ 343 (451)
.++|+|.+ |++.+.....|+.|... | ...|+|+|++|.+.+.....|.-|.
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~ 418 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHR 418 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCcee
Confidence 45677755 77777776666666432 1 1247899999999988777766664
No 179
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=26.57 E-value=31 Score=19.25 Aligned_cols=11 Identities=27% Similarity=0.691 Sum_probs=3.7
Q ss_pred cCeeccccccc
Q psy18077 294 KPFKCKVCDKA 304 (451)
Q Consensus 294 ~~~~C~~C~~~ 304 (451)
..|.|..|+..
T Consensus 14 ~~Y~C~~Cdf~ 24 (30)
T PF07649_consen 14 WFYRCSECDFD 24 (30)
T ss_dssp -EEE-TTT---
T ss_pred ceEECccCCCc
Confidence 45666666543
No 180
>PRK05580 primosome assembly protein PriA; Validated
Probab=26.54 E-value=32 Score=36.24 Aligned_cols=47 Identities=32% Similarity=0.875 Sum_probs=0.0
Q ss_pred cccccccccccCHHHHHHHHHHcCCCcccCcCccccCCHHHHHHHHHHhcCCCCcccccchhhcCChHhHHHHHHHcCCC
Q psy18077 178 RTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKSFSRPWLLQGHIRTHTGEKPFKCKVCDKAFADKSNLRAHIQTHSKV 257 (451)
Q Consensus 178 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 257 (451)
..|..||... .|+.|+... ..|...+...|..||.. ..
T Consensus 382 ~~C~~Cg~~~-----------------~C~~C~~~l----------~~h~~~~~l~Ch~Cg~~---------------~~ 419 (679)
T PRK05580 382 LLCRDCGWVA-----------------ECPHCDASL----------TLHRFQRRLRCHHCGYQ---------------EP 419 (679)
T ss_pred eEhhhCcCcc-----------------CCCCCCCce----------eEECCCCeEECCCCcCC---------------CC
Q ss_pred CCeeccccc
Q psy18077 258 KPFVCEKCG 266 (451)
Q Consensus 258 ~~~~C~~C~ 266 (451)
.|..|+.|+
T Consensus 420 ~~~~Cp~Cg 428 (679)
T PRK05580 420 IPKACPECG 428 (679)
T ss_pred CCCCCCCCc
No 181
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=26.34 E-value=27 Score=27.20 Aligned_cols=13 Identities=31% Similarity=0.830 Sum_probs=8.5
Q ss_pred ccccccccccCCC
Q psy18077 366 SFQCRDCGKHYST 378 (451)
Q Consensus 366 ~~~C~~C~~~f~~ 378 (451)
.+.|..|+..|..
T Consensus 70 ~~~C~~Cg~~~~~ 82 (115)
T TIGR00100 70 ECECEDCSEEVSP 82 (115)
T ss_pred EEEcccCCCEEec
Confidence 4677777766654
No 182
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=25.82 E-value=33 Score=26.62 Aligned_cols=26 Identities=15% Similarity=0.336 Sum_probs=15.8
Q ss_pred cCeeccccccccCChHHHHHHHHHhcCCCceecccCcc
Q psy18077 294 KPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGK 331 (451)
Q Consensus 294 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 331 (451)
..+.|..|+..|.... ..+.||.||.
T Consensus 69 ~~~~C~~Cg~~~~~~~------------~~~~CP~Cgs 94 (113)
T PRK12380 69 AQAWCWDCSQVVEIHQ------------HDAQCPHCHG 94 (113)
T ss_pred cEEEcccCCCEEecCC------------cCccCcCCCC
Confidence 3467777776665432 3356887773
No 183
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=25.47 E-value=33 Score=20.78 Aligned_cols=13 Identities=31% Similarity=0.963 Sum_probs=10.1
Q ss_pred CcccCccccccCC
Q psy18077 397 ARTCPQCGKLYVS 409 (451)
Q Consensus 397 ~~~C~~C~~~f~~ 409 (451)
||+|..|+..|=.
T Consensus 12 ~f~C~~C~~~FC~ 24 (39)
T smart00154 12 GFKCRHCGNLFCG 24 (39)
T ss_pred CeECCccCCcccc
Confidence 7888888888743
No 184
>PF04931 DNA_pol_phi: DNA polymerase phi; InterPro: IPR007015 Proteins of this family are predominantly nucleolar. The majority are described as transcription factor transactivators. The family also includes the fifth essential DNA polymerase (Pol5p) of Schizosaccharomyces pombe (Fission yeast) and Saccharomyces cerevisiae (Baker's yeast) (2.7.7.7 from EC). Pol5p is localized exclusively to the nucleolus and binds near or at the enhancer region of rRNA-encoding DNA repeating units.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006351 transcription, DNA-dependent
Probab=25.46 E-value=68 Score=34.54 Aligned_cols=6 Identities=33% Similarity=0.495 Sum_probs=2.7
Q ss_pred hHHHHH
Q psy18077 3 VIDQLV 8 (451)
Q Consensus 3 ~~~~~~ 8 (451)
|+..|.
T Consensus 628 ll~vl~ 633 (784)
T PF04931_consen 628 LLDVLD 633 (784)
T ss_pred HHHHhc
Confidence 344444
No 185
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=25.25 E-value=46 Score=26.35 Aligned_cols=23 Identities=35% Similarity=0.482 Sum_probs=16.7
Q ss_pred ccCcCccccCCHHHHHHHHHHhcCCC
Q psy18077 205 QCPHCSKSFSRPWLLQGHIRTHTGEK 230 (451)
Q Consensus 205 ~C~~C~~~f~~~~~L~~H~~~h~~~~ 230 (451)
.|-.+|+.| ..|++|+.+|.+-.
T Consensus 78 icLEDGkkf---KSLKRHL~t~~gmT 100 (148)
T COG4957 78 ICLEDGKKF---KSLKRHLTTHYGLT 100 (148)
T ss_pred EEeccCcch---HHHHHHHhcccCCC
Confidence 577777777 45888888877643
No 186
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=25.20 E-value=36 Score=27.16 Aligned_cols=13 Identities=38% Similarity=0.976 Sum_probs=7.2
Q ss_pred CcccccchhhcCC
Q psy18077 231 PFKCKVCDKAFAD 243 (451)
Q Consensus 231 ~~~C~~C~~~f~~ 243 (451)
.|+|+.|++.|..
T Consensus 53 RyrC~~C~~tf~~ 65 (129)
T COG3677 53 RYKCKSCGSTFTV 65 (129)
T ss_pred ccccCCcCcceee
Confidence 3556666555543
No 187
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=25.05 E-value=34 Score=27.79 Aligned_cols=35 Identities=26% Similarity=0.736 Sum_probs=20.3
Q ss_pred CcCeeccccccccCChHHHHHHHHHhcCCCceecccCccc
Q psy18077 293 EKPFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKA 332 (451)
Q Consensus 293 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 332 (451)
.-+|.|. |+..|.+. ++|-.+-.|+ .|.|..|+-.
T Consensus 115 ~~~Y~C~-C~q~~l~~---RRhn~~~~g~-~YrC~~C~gk 149 (156)
T COG3091 115 TYPYRCQ-CQQHYLRI---RRHNTVRRGE-VYRCGKCGGK 149 (156)
T ss_pred ceeEEee-cCCccchh---hhcccccccc-eEEeccCCce
Confidence 3467777 77765433 3444444455 6777777644
No 188
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=25.01 E-value=16 Score=23.67 Aligned_cols=23 Identities=17% Similarity=0.701 Sum_probs=11.1
Q ss_pred ccccccccccCCChHHHHHHHHh
Q psy18077 366 SFQCRDCGKHYSTSSNLARHRQI 388 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~~H~~~ 388 (451)
.|+|+.|+..|=---.+-.|...
T Consensus 21 ~y~C~~C~~~FC~dCD~fiHE~L 43 (51)
T PF07975_consen 21 RYRCPKCKNHFCIDCDVFIHETL 43 (51)
T ss_dssp EE--TTTT--B-HHHHHTTTTTS
T ss_pred eEECCCCCCccccCcChhhhccc
Confidence 57788877777666655555333
No 189
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=24.80 E-value=25 Score=28.28 Aligned_cols=14 Identities=21% Similarity=0.669 Sum_probs=9.0
Q ss_pred cCeeccccccccCC
Q psy18077 294 KPFKCKVCDKAFAD 307 (451)
Q Consensus 294 ~~~~C~~C~~~f~~ 307 (451)
..+.|..||..|..
T Consensus 69 ~~~~C~~CG~~~~~ 82 (135)
T PRK03824 69 AVLKCRNCGNEWSL 82 (135)
T ss_pred eEEECCCCCCEEec
Confidence 44677777766654
No 190
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=24.21 E-value=39 Score=32.71 Aligned_cols=31 Identities=19% Similarity=0.482 Sum_probs=22.0
Q ss_pred cccccccccCCChHHHHHHHHhccCCCCCCCcccCccccccCChH
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQIHRSPDDTKARTCPQCGKLYVSMP 411 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~C~~C~~~f~~~~ 411 (451)
=.|+.||.+..+.. ..-|+|+.|++.+....
T Consensus 351 p~Cp~Cg~~m~S~G--------------~~g~rC~kCg~~~~~~~ 381 (421)
T COG1571 351 PVCPRCGGRMKSAG--------------RNGFRCKKCGTRARETL 381 (421)
T ss_pred CCCCccCCchhhcC--------------CCCcccccccccCCccc
Confidence 37999998654432 33799999998886543
No 191
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=24.18 E-value=12 Score=23.70 Aligned_cols=6 Identities=33% Similarity=1.110 Sum_probs=2.8
Q ss_pred ccccCC
Q psy18077 429 CSKSFS 434 (451)
Q Consensus 429 C~~~f~ 434 (451)
||.+|.
T Consensus 33 Cg~tfv 38 (47)
T PF04606_consen 33 CGHTFV 38 (47)
T ss_pred CCCEEE
Confidence 444443
No 192
>PF04216 FdhE: Protein involved in formate dehydrogenase formation; InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex []. The exact nature of the function of FdhE in the assembly of the complex is unknown, but considering the presence of selenocysteine, molybdopterin, iron-sulphur clusters and cytochrome b556, it is likely to be involved in the insertion of cofactors. ; GO: 0005737 cytoplasm; PDB: 2FIY_B.
Probab=24.04 E-value=6.2 Score=36.55 Aligned_cols=14 Identities=21% Similarity=0.672 Sum_probs=7.6
Q ss_pred CCCceecccCcccc
Q psy18077 320 KVKPFVCEKCGKAF 333 (451)
Q Consensus 320 ~~~~~~C~~C~~~f 333 (451)
|.+-..|..|+...
T Consensus 194 G~R~L~Cs~C~t~W 207 (290)
T PF04216_consen 194 GKRYLHCSLCGTEW 207 (290)
T ss_dssp -EEEEEETTT--EE
T ss_pred ccEEEEcCCCCCee
Confidence 55667788887543
No 193
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=23.96 E-value=61 Score=20.30 Aligned_cols=7 Identities=43% Similarity=1.227 Sum_probs=3.2
Q ss_pred cccccch
Q psy18077 232 FKCKVCD 238 (451)
Q Consensus 232 ~~C~~C~ 238 (451)
+.|+.|+
T Consensus 19 ~~CP~Cg 25 (46)
T PF12760_consen 19 FVCPHCG 25 (46)
T ss_pred CCCCCCC
Confidence 3444444
No 194
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=23.76 E-value=92 Score=26.32 Aligned_cols=17 Identities=18% Similarity=0.680 Sum_probs=8.9
Q ss_pred cCCCceecccCcccccC
Q psy18077 319 SKVKPFVCEKCGKAFAL 335 (451)
Q Consensus 319 ~~~~~~~C~~C~~~f~~ 335 (451)
....-|.|+.|.-.|+.
T Consensus 109 ~~~~~y~C~~~~~r~sf 125 (176)
T COG1675 109 TENNYYVCPNCHVKYSF 125 (176)
T ss_pred ccCCceeCCCCCCcccH
Confidence 34445666666555543
No 195
>KOG0717|consensus
Probab=23.44 E-value=40 Score=32.80 Aligned_cols=21 Identities=24% Similarity=0.339 Sum_probs=12.0
Q ss_pred cccCccccccCChHHHHHHHH
Q psy18077 398 RTCPQCGKLYVSMPAYSMHLR 418 (451)
Q Consensus 398 ~~C~~C~~~f~~~~~l~~H~~ 418 (451)
+.|..|+++|.+..+|..|..
T Consensus 293 lyC~vCnKsFKseKq~kNHEn 313 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHEN 313 (508)
T ss_pred eEEeeccccccchHHHHhhHH
Confidence 556666666666555555543
No 196
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=23.29 E-value=12 Score=22.58 Aligned_cols=13 Identities=31% Similarity=1.050 Sum_probs=9.6
Q ss_pred ccccccccccccC
Q psy18077 364 TISFQCRDCGKHY 376 (451)
Q Consensus 364 ~~~~~C~~C~~~f 376 (451)
..-|.|..|+..|
T Consensus 26 T~fy~C~~C~~~w 38 (39)
T PF01096_consen 26 TLFYVCCNCGHRW 38 (39)
T ss_dssp EEEEEESSSTEEE
T ss_pred eEEEEeCCCCCee
Confidence 3358899998765
No 197
>PRK05978 hypothetical protein; Provisional
Probab=22.78 E-value=38 Score=27.63 Aligned_cols=10 Identities=30% Similarity=1.268 Sum_probs=6.4
Q ss_pred ccccccccCC
Q psy18077 368 QCRDCGKHYS 377 (451)
Q Consensus 368 ~C~~C~~~f~ 377 (451)
.|+.||..|.
T Consensus 54 ~C~~CG~~~~ 63 (148)
T PRK05978 54 HCAACGEDFT 63 (148)
T ss_pred CccccCCccc
Confidence 5777776654
No 198
>KOG0402|consensus
Probab=22.70 E-value=29 Score=24.76 Aligned_cols=13 Identities=31% Similarity=0.723 Sum_probs=6.6
Q ss_pred CccccccccccCC
Q psy18077 422 QICQCPHCSKSFS 434 (451)
Q Consensus 422 ~~~~C~~C~~~f~ 434 (451)
+.+.|.-|.+.|+
T Consensus 53 GiW~C~~C~kv~a 65 (92)
T KOG0402|consen 53 GIWKCGSCKKVVA 65 (92)
T ss_pred eEEecCCccceec
Confidence 4455555555443
No 199
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.49 E-value=38 Score=23.79 Aligned_cols=10 Identities=30% Similarity=1.158 Sum_probs=5.2
Q ss_pred eecccccccc
Q psy18077 296 FKCKVCDKAF 305 (451)
Q Consensus 296 ~~C~~C~~~f 305 (451)
|.|..|+..|
T Consensus 13 Y~c~~cg~~~ 22 (82)
T COG2331 13 YECTECGNRF 22 (82)
T ss_pred EeecccchHH
Confidence 5555555444
No 200
>KOG0943|consensus
Probab=22.43 E-value=84 Score=34.80 Aligned_cols=9 Identities=22% Similarity=0.486 Sum_probs=3.7
Q ss_pred ccCChHhHH
Q psy18077 155 HYSTSSNLA 163 (451)
Q Consensus 155 ~f~~~~~l~ 163 (451)
.|++++++.
T Consensus 1926 rfssrssfl 1934 (3015)
T KOG0943|consen 1926 RFSSRSSFL 1934 (3015)
T ss_pred cccchhhhh
Confidence 344444433
No 201
>PF09963 DUF2197: Uncharacterized protein conserved in bacteria (DUF2197); InterPro: IPR019241 This family represents various hypothetical bacterial proteins with no known function.
Probab=22.38 E-value=27 Score=23.07 Aligned_cols=37 Identities=19% Similarity=0.407 Sum_probs=16.5
Q ss_pred cccccccccCCC-hHHHHHHHHhccCCCCCCCcccCccccc
Q psy18077 367 FQCRDCGKHYST-SSNLARHRQIHRSPDDTKARTCPQCGKL 406 (451)
Q Consensus 367 ~~C~~C~~~f~~-~~~L~~H~~~H~~~~~~~~~~C~~C~~~ 406 (451)
.+|-+|++.+.- ...+..-...-.. ...|-|..|...
T Consensus 3 vkC~lCdk~~~Id~~~~~aKrLrnrP---i~tYmC~eC~~R 40 (56)
T PF09963_consen 3 VKCILCDKKEEIDEDTPEAKRLRNRP---IHTYMCDECKER 40 (56)
T ss_pred eEEEecCCEEEeccCCHHHHHhhcCC---CcceeChhHHHH
Confidence 356777765432 2222222211111 445667766543
No 202
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=22.34 E-value=48 Score=22.78 Aligned_cols=28 Identities=25% Similarity=0.698 Sum_probs=10.9
Q ss_pred CeeccccccccCChHHHHHHHHHhcCCCceecccCccccc
Q psy18077 295 PFKCKVCDKAFADKSNLRAHIQTHSKVKPFVCEKCGKAFA 334 (451)
Q Consensus 295 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 334 (451)
.-.|..|++.|.. -.+.+.|..||..|=
T Consensus 9 ~~~C~~C~~~F~~------------~~rrhhCr~CG~~vC 36 (69)
T PF01363_consen 9 ASNCMICGKKFSL------------FRRRHHCRNCGRVVC 36 (69)
T ss_dssp -SB-TTT--B-BS------------SS-EEE-TTT--EEE
T ss_pred CCcCcCcCCcCCC------------ceeeEccCCCCCEEC
Confidence 3457777777742 144566777776553
No 203
>KOG0978|consensus
Probab=22.33 E-value=40 Score=34.96 Aligned_cols=20 Identities=25% Similarity=0.642 Sum_probs=15.2
Q ss_pred cccccccccCCChhhHHHHH
Q psy18077 424 CQCPHCSKSFSRPWLLQGHI 443 (451)
Q Consensus 424 ~~C~~C~~~f~~~~~l~~H~ 443 (451)
=+||.|+.+|.....+..|+
T Consensus 679 RKCP~Cn~aFganDv~~I~l 698 (698)
T KOG0978|consen 679 RKCPKCNAAFGANDVHRIHL 698 (698)
T ss_pred CCCCCCCCCCCcccccccCC
Confidence 46999999998887766553
No 204
>COG1779 C4-type Zn-finger protein [General function prediction only]
Probab=22.19 E-value=15 Score=31.31 Aligned_cols=8 Identities=38% Similarity=1.107 Sum_probs=4.4
Q ss_pred cccccccc
Q psy18077 367 FQCRDCGK 374 (451)
Q Consensus 367 ~~C~~C~~ 374 (451)
..||.||.
T Consensus 15 ~~CPvCg~ 22 (201)
T COG1779 15 IDCPVCGG 22 (201)
T ss_pred ecCCcccc
Confidence 44666654
No 205
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=22.01 E-value=64 Score=20.72 Aligned_cols=15 Identities=33% Similarity=0.691 Sum_probs=8.0
Q ss_pred cCeeccccccccCCh
Q psy18077 294 KPFKCKVCDKAFADK 308 (451)
Q Consensus 294 ~~~~C~~C~~~f~~~ 308 (451)
+.+.|..||..|.-.
T Consensus 3 k~l~C~dCg~~FvfT 17 (49)
T PF13451_consen 3 KTLTCKDCGAEFVFT 17 (49)
T ss_pred eeEEcccCCCeEEEe
Confidence 445566666555433
No 206
>KOG4124|consensus
Probab=22.00 E-value=21 Score=33.08 Aligned_cols=58 Identities=28% Similarity=0.594 Sum_probs=34.1
Q ss_pred CCeecc--cccccccChHHHHhchhhcCCCCcC-----------CCCCCcCeeccccccccCChHHHHHHH
Q psy18077 258 KPFVCE--KCGKAFALKSYLYKHEDSSCLKSYD-----------SSSSEKPFKCKVCDKAFADKSNLRAHI 315 (451)
Q Consensus 258 ~~~~C~--~C~~~f~~~~~l~~H~~~~~~~~~~-----------~~~~~~~~~C~~C~~~f~~~~~l~~H~ 315 (451)
++|.|+ .|.+.+.....|..|...-|...+. .....|+|+|++|.+.+.....|..|.
T Consensus 348 ~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~ 418 (442)
T KOG4124|consen 348 KPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHR 418 (442)
T ss_pred CCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCcee
Confidence 456664 3666666666665554322211111 113468999999999988776666554
No 207
>PF09332 Mcm10: Mcm10 replication factor; InterPro: IPR015411 Mcm10 is a eukaryotic DNA replication factor that regulates the stability and chromatin association of DNA polymerase alpha []. ; PDB: 2KWQ_A.
Probab=21.87 E-value=7.6 Score=36.52 Aligned_cols=58 Identities=19% Similarity=0.436 Sum_probs=26.4
Q ss_pred ccccccccccCCChHHHH---HHHHhccCCCCCCCcccCccccccCChHHHHHHHHHcCCcccccccccc
Q psy18077 366 SFQCRDCGKHYSTSSNLA---RHRQIHRSPDDTKARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKS 432 (451)
Q Consensus 366 ~~~C~~C~~~f~~~~~L~---~H~~~H~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~C~~C~~~ 432 (451)
.+.|..|.++.......- .|....+. ...+-|+|..|++...+...|-. -.|..||..
T Consensus 252 av~C~~C~yt~~~~~~~C~~~~H~l~~~~-a~KRFFkC~~C~~Rt~sl~r~P~--------~~C~~Cg~~ 312 (344)
T PF09332_consen 252 AVTCKQCKYTAFKPSDRCKEEGHPLKWHD-AVKRFFKCKDCGNRTISLERLPK--------KHCSNCGSS 312 (344)
T ss_dssp EEEETTT--EESS--HHHHHTT--EEEEE-EE-EEEE-T-TS-EEEESSSS----------S--TTT-S-
T ss_pred EEEcCCCCCcccCcchhHHhcCCceEEee-eeeeeEECCCCCCeeeecccCCC--------CCCCcCCcC
Confidence 689999998766655543 34322211 13567999999998765533321 258889853
No 208
>KOG0717|consensus
Probab=21.84 E-value=49 Score=32.27 Aligned_cols=21 Identities=24% Similarity=0.643 Sum_probs=19.3
Q ss_pred cccccccccCCChHHHHHHHH
Q psy18077 367 FQCRDCGKHYSTSSNLARHRQ 387 (451)
Q Consensus 367 ~~C~~C~~~f~~~~~L~~H~~ 387 (451)
+.|.+|+++|.+...|..|..
T Consensus 293 lyC~vCnKsFKseKq~kNHEn 313 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHEN 313 (508)
T ss_pred eEEeeccccccchHHHHhhHH
Confidence 899999999999999999964
No 209
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=21.62 E-value=55 Score=20.89 Aligned_cols=8 Identities=38% Similarity=1.136 Sum_probs=3.4
Q ss_pred eecccccc
Q psy18077 296 FKCKVCDK 303 (451)
Q Consensus 296 ~~C~~C~~ 303 (451)
|.|+.||.
T Consensus 21 ~vC~~Cg~ 28 (52)
T smart00661 21 FVCRKCGY 28 (52)
T ss_pred EECCcCCC
Confidence 44444443
No 210
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=21.49 E-value=56 Score=32.25 Aligned_cols=21 Identities=24% Similarity=0.653 Sum_probs=12.1
Q ss_pred cccCccccccCChHHHHHHHH
Q psy18077 398 RTCPQCGKLYVSMPAYSMHLR 418 (451)
Q Consensus 398 ~~C~~C~~~f~~~~~l~~H~~ 418 (451)
|.|+.|.+.|.....+..|+.
T Consensus 58 WiCp~CskkF~d~~~~~~H~~ 78 (466)
T PF04780_consen 58 WICPRCSKKFSDAESCLSHME 78 (466)
T ss_pred eeCCcccceeCCHHHHHHHHH
Confidence 455555555555555555554
No 211
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=21.20 E-value=21 Score=26.60 Aligned_cols=10 Identities=50% Similarity=1.222 Sum_probs=4.7
Q ss_pred eecccCcccc
Q psy18077 324 FVCEKCGKAF 333 (451)
Q Consensus 324 ~~C~~C~~~f 333 (451)
..|..||..|
T Consensus 47 ~~Cg~CGls~ 56 (104)
T COG4888 47 AVCGNCGLSF 56 (104)
T ss_pred EEcccCcceE
Confidence 3444454444
No 212
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.19 E-value=68 Score=21.47 Aligned_cols=41 Identities=22% Similarity=0.669 Sum_probs=22.0
Q ss_pred eeccccccc-cCChHHHHHHHHHhcCCCceecccCcccccCh
Q psy18077 296 FKCKVCDKA-FADKSNLRAHIQTHSKVKPFVCEKCGKAFALK 336 (451)
Q Consensus 296 ~~C~~C~~~-f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 336 (451)
.+|.+|++. +.+...+..-.......+.|-|+.|......+
T Consensus 3 vkCiiCd~v~~iD~rt~~tKrLrN~PIrtymC~eC~~Rva~k 44 (68)
T COG4896 3 VKCIICDRVDEIDNRTFKTKRLRNKPIRTYMCPECEHRVAIK 44 (68)
T ss_pred ceEEEecceeeecchhHHHHHhhCCCceeEechhhHhhhchh
Confidence 356666653 34444444443334445567777776554443
No 213
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=20.67 E-value=72 Score=32.28 Aligned_cols=23 Identities=26% Similarity=0.577 Sum_probs=13.8
Q ss_pred CCcccccccccccCHHHHHHHHHHcCCCcccCcCcc
Q psy18077 176 KARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSK 211 (451)
Q Consensus 176 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~ 211 (451)
....|..||....- +..|+.|+.
T Consensus 239 ~~l~Ch~Cg~~~~~-------------~~~Cp~C~s 261 (505)
T TIGR00595 239 GKLRCHYCGYQEPI-------------PKTCPQCGS 261 (505)
T ss_pred CeEEcCCCcCcCCC-------------CCCCCCCCC
Confidence 35567777766432 346777765
No 214
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=20.31 E-value=44 Score=20.32 Aligned_cols=18 Identities=22% Similarity=0.425 Sum_probs=8.9
Q ss_pred cCeeccccccccCChHHH
Q psy18077 294 KPFKCKVCDKAFADKSNL 311 (451)
Q Consensus 294 ~~~~C~~C~~~f~~~~~l 311 (451)
..+.|+.|+-.+.+...|
T Consensus 18 ~id~C~~C~G~W~d~~el 35 (41)
T PF13453_consen 18 EIDVCPSCGGIWFDAGEL 35 (41)
T ss_pred EEEECCCCCeEEccHHHH
Confidence 334455555555544444
No 215
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=20.07 E-value=57 Score=31.65 Aligned_cols=14 Identities=36% Similarity=0.745 Sum_probs=7.8
Q ss_pred ceecccCcccccCh
Q psy18077 323 PFVCEKCGKAFALK 336 (451)
Q Consensus 323 ~~~C~~C~~~f~~~ 336 (451)
-|+|+.|++.+...
T Consensus 367 g~rC~kCg~~~~~~ 380 (421)
T COG1571 367 GFRCKKCGTRARET 380 (421)
T ss_pred CcccccccccCCcc
Confidence 45566666555543
No 216
>PRK14873 primosome assembly protein PriA; Provisional
Probab=20.03 E-value=70 Score=33.54 Aligned_cols=23 Identities=26% Similarity=0.759 Sum_probs=13.8
Q ss_pred CCcccccccccccCHHHHHHHHHHcCCCcccCcCccc
Q psy18077 176 KARTCPQCGKLYVSMPAYSMHLRTHNQICQCPHCSKS 212 (451)
Q Consensus 176 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~C~~C~~~ 212 (451)
....|..||... .++.|+.|+..
T Consensus 409 ~~l~Ch~CG~~~--------------~p~~Cp~Cgs~ 431 (665)
T PRK14873 409 GTPRCRWCGRAA--------------PDWRCPRCGSD 431 (665)
T ss_pred CeeECCCCcCCC--------------cCccCCCCcCC
Confidence 356677777632 14467777654
Done!