Query psy11032
Match_columns 514
No_of_seqs 325 out of 3277
Neff 10.5
Searched_HMMs 46136
Date Fri Aug 16 16:23:31 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11032.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11032hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 3.7E-32 8.1E-37 264.1 6.7 230 273-502 606-933 (958)
2 KOG1074|consensus 100.0 3.7E-31 8.1E-36 257.2 7.0 145 169-319 605-776 (958)
3 KOG3608|consensus 99.9 3.2E-27 6.9E-32 207.6 9.3 261 197-502 69-378 (467)
4 KOG3608|consensus 99.9 1.1E-26 2.3E-31 204.3 11.2 249 170-475 70-379 (467)
5 KOG2462|consensus 99.9 2.3E-27 5E-32 203.9 6.3 134 299-470 129-265 (279)
6 KOG2462|consensus 99.9 1.4E-27 2.9E-32 205.3 3.9 137 326-497 128-264 (279)
7 KOG3623|consensus 99.9 9.9E-23 2.1E-27 195.4 12.3 119 85-219 211-331 (1007)
8 KOG3623|consensus 99.9 1.3E-22 2.9E-27 194.5 5.7 77 421-497 895-971 (1007)
9 KOG3576|consensus 99.8 1.1E-19 2.3E-24 148.0 1.9 112 389-500 114-236 (267)
10 KOG3576|consensus 99.6 2.8E-16 6.1E-21 128.2 2.7 126 167-328 115-240 (267)
11 PLN03086 PRLI-interacting fact 99.2 2.4E-11 5.1E-16 119.3 8.2 146 273-473 408-565 (567)
12 PLN03086 PRLI-interacting fact 99.2 2.9E-11 6.2E-16 118.7 5.7 102 392-500 453-564 (567)
13 PHA00733 hypothetical protein 99.0 1.2E-10 2.6E-15 93.0 3.3 83 390-474 38-125 (128)
14 KOG3993|consensus 99.0 5.5E-11 1.2E-15 108.7 0.9 193 300-497 267-479 (500)
15 KOG3993|consensus 98.9 2.3E-10 5.1E-15 104.6 1.6 158 196-353 266-483 (500)
16 PHA00733 hypothetical protein 98.9 2.9E-10 6.2E-15 90.8 1.5 95 405-501 25-124 (128)
17 PHA02768 hypothetical protein; 98.8 2.7E-09 5.8E-14 69.1 1.4 22 422-443 7-28 (55)
18 PHA02768 hypothetical protein; 98.7 1.1E-08 2.4E-13 66.2 2.3 44 448-493 5-48 (55)
19 PHA00616 hypothetical protein 98.5 6.1E-08 1.3E-12 59.6 1.7 42 448-489 1-42 (44)
20 PF13465 zf-H2C2_2: Zinc-finge 98.5 5.5E-08 1.2E-12 53.7 1.2 24 436-459 2-25 (26)
21 PF13465 zf-H2C2_2: Zinc-finge 98.4 4.3E-08 9.4E-13 54.1 0.2 26 407-432 1-26 (26)
22 PHA00616 hypothetical protein 98.4 1E-07 2.3E-12 58.5 0.5 34 420-453 1-34 (44)
23 KOG1146|consensus 98.3 1.2E-06 2.7E-11 92.3 6.5 130 62-191 442-611 (1406)
24 PHA00732 hypothetical protein 98.1 1.3E-06 2.8E-11 62.8 2.5 43 420-468 1-44 (79)
25 PHA00732 hypothetical protein 98.1 1.8E-06 3.8E-11 62.1 2.2 46 448-498 1-46 (79)
26 PF05605 zf-Di19: Drought indu 98.0 6.7E-06 1.4E-10 54.9 3.7 50 421-473 3-54 (54)
27 KOG1146|consensus 97.9 1.7E-05 3.7E-10 84.0 5.8 158 87-248 439-641 (1406)
28 PF05605 zf-Di19: Drought indu 97.6 3.1E-05 6.8E-10 51.6 2.1 51 448-501 2-54 (54)
29 PF00096 zf-C2H2: Zinc finger, 97.5 2.5E-05 5.5E-10 41.8 0.4 20 450-469 2-21 (23)
30 PF13894 zf-C2H2_4: C2H2-type 97.4 6.6E-05 1.4E-09 40.6 1.4 24 449-472 1-24 (24)
31 PF00096 zf-C2H2: Zinc finger, 97.4 8.7E-05 1.9E-09 39.7 1.5 22 198-219 1-22 (23)
32 PF13894 zf-C2H2_4: C2H2-type 97.3 9.8E-05 2.1E-09 39.9 0.9 23 421-443 1-23 (24)
33 COG5189 SFP1 Putative transcri 97.2 0.00012 2.7E-09 65.1 1.2 53 418-470 347-420 (423)
34 PF13912 zf-C2H2_6: C2H2-type 97.2 0.00015 3.2E-09 40.5 1.1 25 448-472 1-25 (27)
35 PF12756 zf-C2H2_2: C2H2 type 97.2 0.00016 3.6E-09 55.8 1.7 72 394-471 1-73 (100)
36 PF12756 zf-C2H2_2: C2H2 type 97.1 0.00021 4.5E-09 55.2 1.7 22 115-136 2-23 (100)
37 PF13912 zf-C2H2_6: C2H2-type 97.1 0.00016 3.4E-09 40.4 0.7 25 420-444 1-25 (27)
38 PF13909 zf-H2C2_5: C2H2-type 96.9 0.00043 9.2E-09 37.4 1.2 24 449-473 1-24 (24)
39 KOG2231|consensus 96.8 0.0013 2.9E-08 66.4 4.4 140 301-500 100-263 (669)
40 KOG2231|consensus 96.8 0.0024 5.1E-08 64.6 6.0 46 171-220 184-235 (669)
41 PF13909 zf-H2C2_5: C2H2-type 96.6 0.00067 1.4E-08 36.6 0.6 24 421-445 1-24 (24)
42 COG5189 SFP1 Putative transcri 96.6 0.0013 2.8E-08 58.9 2.7 71 325-412 346-418 (423)
43 PF09237 GAGA: GAGA factor; I 96.6 0.0014 3E-08 41.4 1.7 30 418-447 22-51 (54)
44 smart00355 ZnF_C2H2 zinc finge 96.3 0.0028 6E-08 34.7 1.8 19 451-469 3-21 (26)
45 PF09237 GAGA: GAGA factor; I 96.2 0.0033 7.1E-08 39.7 1.9 40 437-476 12-52 (54)
46 PRK04860 hypothetical protein; 96.1 0.003 6.5E-08 52.5 1.7 36 392-431 119-154 (160)
47 smart00355 ZnF_C2H2 zinc finge 95.9 0.0047 1E-07 33.7 1.6 22 421-442 1-22 (26)
48 KOG2785|consensus 95.8 0.015 3.3E-07 54.0 5.1 78 273-350 4-90 (390)
49 KOG2785|consensus 95.7 0.024 5.1E-07 52.8 6.0 51 420-470 166-242 (390)
50 PF12874 zf-met: Zinc-finger o 95.7 0.0055 1.2E-07 33.3 1.1 21 58-78 2-22 (25)
51 KOG2482|consensus 95.6 0.042 9.1E-07 50.1 7.1 180 169-350 144-356 (423)
52 COG5236 Uncharacterized conser 95.6 0.015 3.3E-07 52.9 4.2 72 171-250 222-306 (493)
53 PRK04860 hypothetical protein; 95.5 0.0054 1.2E-07 51.0 1.1 40 419-462 118-157 (160)
54 PF12874 zf-met: Zinc-finger o 95.5 0.0059 1.3E-07 33.2 0.9 21 449-469 1-21 (25)
55 COG5048 FOG: Zn-finger [Genera 95.3 0.032 7E-07 56.2 6.0 115 391-505 288-447 (467)
56 PF12171 zf-C2H2_jaz: Zinc-fin 95.1 0.011 2.5E-07 32.7 1.2 22 393-414 2-23 (27)
57 COG5236 Uncharacterized conser 94.9 0.025 5.3E-07 51.6 3.4 20 200-219 223-242 (493)
58 COG5048 FOG: Zn-finger [Genera 94.9 0.019 4.2E-07 57.8 3.1 156 299-455 288-453 (467)
59 PF12171 zf-C2H2_jaz: Zinc-fin 94.7 0.01 2.2E-07 32.9 0.4 21 449-469 2-22 (27)
60 KOG2482|consensus 93.6 0.18 3.9E-06 46.2 5.9 50 85-134 145-217 (423)
61 cd00350 rubredoxin_like Rubred 92.6 0.089 1.9E-06 30.7 1.8 24 393-428 2-25 (33)
62 PF13913 zf-C2HC_2: zinc-finge 92.4 0.11 2.5E-06 28.1 1.9 21 57-78 3-23 (25)
63 PF12013 DUF3505: Protein of u 92.3 0.15 3.3E-06 39.8 3.3 25 449-473 81-109 (109)
64 COG4049 Uncharacterized protei 92.0 0.074 1.6E-06 34.2 1.0 32 442-473 11-42 (65)
65 PF13913 zf-C2HC_2: zinc-finge 91.6 0.15 3.3E-06 27.5 1.8 20 329-349 3-22 (25)
66 smart00451 ZnF_U1 U1-like zinc 90.5 0.17 3.6E-06 30.0 1.5 22 420-441 3-24 (35)
67 smart00451 ZnF_U1 U1-like zinc 89.8 0.19 4.1E-06 29.8 1.3 23 448-470 3-25 (35)
68 TIGR00622 ssl1 transcription f 89.1 1.1 2.5E-05 34.4 5.3 25 392-416 81-105 (112)
69 PF12013 DUF3505: Protein of u 88.9 0.4 8.7E-06 37.4 2.9 24 393-416 81-108 (109)
70 KOG4173|consensus 88.3 0.34 7.5E-06 40.9 2.2 79 56-137 79-171 (253)
71 TIGR00622 ssl1 transcription f 87.8 2 4.3E-05 33.2 5.8 49 394-444 57-105 (112)
72 COG4049 Uncharacterized protei 87.8 0.25 5.4E-06 31.9 0.9 28 51-78 12-39 (65)
73 KOG2893|consensus 87.3 0.36 7.8E-06 41.8 1.8 49 450-502 12-60 (341)
74 cd00729 rubredoxin_SM Rubredox 86.3 0.52 1.1E-05 27.7 1.6 25 392-428 2-26 (34)
75 KOG2893|consensus 84.4 0.3 6.5E-06 42.2 -0.0 42 58-103 12-53 (341)
76 PF09986 DUF2225: Uncharacteri 83.9 0.31 6.8E-06 43.2 -0.1 13 449-461 49-61 (214)
77 KOG4173|consensus 83.0 0.42 9E-06 40.5 0.2 79 391-472 78-170 (253)
78 PF09538 FYDLN_acid: Protein o 82.7 0.93 2E-05 35.0 2.0 32 55-97 8-39 (108)
79 PF06524 NOA36: NOA36 protein; 80.6 1.1 2.4E-05 39.6 1.9 88 106-243 136-227 (314)
80 COG1592 Rubrerythrin [Energy p 79.6 1.3 2.7E-05 37.1 1.9 24 392-428 134-157 (166)
81 PF09986 DUF2225: Uncharacteri 78.2 1.4 3.1E-05 39.0 2.1 43 391-433 4-61 (214)
82 PF10571 UPF0547: Uncharacteri 77.0 1.8 3.9E-05 23.6 1.4 10 422-431 16-25 (26)
83 PF06524 NOA36: NOA36 protein; 76.2 1.1 2.5E-05 39.5 0.8 94 219-352 136-233 (314)
84 PF09538 FYDLN_acid: Protein o 76.1 2.4 5.1E-05 32.8 2.4 15 327-341 25-39 (108)
85 PRK00464 nrdR transcriptional 74.5 1.3 2.7E-05 36.8 0.6 14 421-434 29-42 (154)
86 smart00659 RPOLCX RNA polymera 74.3 1.6 3.4E-05 27.4 0.9 11 421-431 3-13 (44)
87 PRK00398 rpoP DNA-directed RNA 73.7 1.7 3.7E-05 27.6 1.0 13 392-404 3-15 (46)
88 PF15269 zf-C2H2_7: Zinc-finge 73.3 3.4 7.4E-05 25.4 2.1 27 165-191 16-42 (54)
89 PRK14890 putative Zn-ribbon RN 72.8 4 8.7E-05 27.2 2.5 9 448-456 48-56 (59)
90 KOG2186|consensus 72.4 2 4.4E-05 37.9 1.4 46 449-497 4-49 (276)
91 COG1592 Rubrerythrin [Energy p 71.7 2.4 5.3E-05 35.4 1.7 24 420-456 134-157 (166)
92 COG1996 RPC10 DNA-directed RNA 69.3 1.6 3.4E-05 28.0 0.1 29 55-93 5-33 (49)
93 TIGR02300 FYDLN_acid conserved 69.1 3.5 7.5E-05 32.4 1.9 33 54-97 7-39 (129)
94 PRK00464 nrdR transcriptional 69.0 1.5 3.2E-05 36.4 -0.2 17 449-465 29-45 (154)
95 PF13717 zinc_ribbon_4: zinc-r 68.3 5.3 0.00012 23.8 2.2 33 393-430 3-35 (36)
96 COG2888 Predicted Zn-ribbon RN 67.6 4.6 9.9E-05 26.9 1.9 16 328-343 27-42 (61)
97 KOG2186|consensus 67.6 3.2 6.9E-05 36.8 1.5 38 198-237 4-41 (276)
98 PF02176 zf-TRAF: TRAF-type zi 67.2 3.7 8.1E-05 27.7 1.6 40 196-236 8-53 (60)
99 PF03604 DNA_RNApol_7kD: DNA d 66.5 3.3 7.2E-05 23.9 1.0 26 57-93 1-26 (32)
100 COG1198 PriA Primosomal protei 66.3 5.2 0.00011 42.3 3.1 10 393-402 436-445 (730)
101 PF13719 zinc_ribbon_5: zinc-r 66.1 4.9 0.00011 24.1 1.8 12 394-405 4-15 (37)
102 PF09723 Zn-ribbon_8: Zinc rib 65.8 4.2 9.1E-05 25.2 1.5 30 392-428 5-34 (42)
103 PF02176 zf-TRAF: TRAF-type zi 65.7 3.6 7.9E-05 27.8 1.3 40 83-123 8-53 (60)
104 TIGR00373 conserved hypothetic 64.7 5.7 0.00012 33.3 2.5 32 194-234 106-137 (158)
105 TIGR02605 CxxC_CxxC_SSSS putat 64.6 3.7 8E-05 26.8 1.1 30 392-428 5-34 (52)
106 TIGR02098 MJ0042_CXXC MJ0042 f 64.5 5.1 0.00011 24.1 1.6 13 393-405 3-15 (38)
107 smart00531 TFIIE Transcription 64.5 5.8 0.00013 32.8 2.5 36 390-430 97-133 (147)
108 smart00531 TFIIE Transcription 63.5 4.5 9.9E-05 33.5 1.7 34 170-207 100-133 (147)
109 smart00834 CxxC_CXXC_SSSS Puta 63.4 2.7 5.8E-05 25.8 0.3 11 449-459 6-16 (41)
110 PHA00626 hypothetical protein 63.1 4.9 0.00011 26.3 1.4 12 448-459 23-34 (59)
111 PRK06266 transcription initiat 63.0 3.9 8.4E-05 35.0 1.3 16 197-212 117-132 (178)
112 PF05290 Baculo_IE-1: Baculovi 62.5 7.5 0.00016 30.8 2.6 23 386-408 74-96 (140)
113 TIGR00373 conserved hypothetic 60.0 8.3 0.00018 32.3 2.7 34 296-338 105-138 (158)
114 TIGR02300 FYDLN_acid conserved 59.4 8.2 0.00018 30.4 2.3 12 273-284 10-21 (129)
115 PF02892 zf-BED: BED zinc fing 59.4 6 0.00013 24.8 1.4 22 419-440 15-40 (45)
116 TIGR01384 TFS_arch transcripti 59.4 3.2 6.9E-05 32.0 0.1 39 393-431 63-101 (104)
117 smart00734 ZnF_Rad18 Rad18-lik 58.8 9.1 0.0002 20.9 1.8 20 329-349 2-21 (26)
118 PRK09678 DNA-binding transcrip 58.2 2.5 5.3E-05 29.8 -0.6 17 447-463 26-44 (72)
119 PRK06266 transcription initiat 58.0 7.7 0.00017 33.3 2.2 33 167-208 115-147 (178)
120 COG1773 Rubredoxin [Energy pro 56.2 8.4 0.00018 25.4 1.6 12 112-123 3-14 (55)
121 COG5188 PRP9 Splicing factor 3 55.0 16 0.00035 34.1 3.8 28 192-219 369-397 (470)
122 PF00301 Rubredoxin: Rubredoxi 52.9 13 0.00028 23.7 2.1 14 393-406 2-15 (47)
123 COG5151 SSL1 RNA polymerase II 52.3 10 0.00022 34.7 2.1 23 273-295 389-411 (421)
124 PF14353 CpXC: CpXC protein 51.0 2.5 5.5E-05 34.0 -1.8 48 393-440 2-58 (128)
125 COG5151 SSL1 RNA polymerase II 50.0 13 0.00029 33.9 2.5 18 328-345 322-339 (421)
126 COG1997 RPL43A Ribosomal prote 49.3 9.4 0.0002 27.7 1.1 30 392-431 35-64 (89)
127 smart00614 ZnF_BED BED zinc fi 48.7 11 0.00025 24.3 1.4 11 393-403 19-29 (50)
128 PF05443 ROS_MUCR: ROS/MUCR tr 47.9 9.4 0.0002 30.7 1.1 24 449-475 73-96 (132)
129 KOG2807|consensus 47.5 29 0.00062 32.3 4.1 22 111-132 289-310 (378)
130 PF04959 ARS2: Arsenite-resist 47.3 9 0.0002 33.7 1.0 26 449-474 78-103 (214)
131 cd00730 rubredoxin Rubredoxin; 46.8 12 0.00027 24.2 1.3 11 113-123 2-12 (50)
132 COG1594 RPB9 DNA-directed RNA 46.8 4 8.6E-05 31.9 -1.2 39 273-311 73-111 (113)
133 KOG2807|consensus 46.4 27 0.00059 32.4 3.8 34 300-347 276-309 (378)
134 COG3357 Predicted transcriptio 44.9 14 0.0003 27.0 1.4 27 392-428 58-84 (97)
135 PF13240 zinc_ribbon_2: zinc-r 44.4 14 0.0003 19.5 1.0 6 423-428 16-21 (23)
136 KOG4377|consensus 42.8 21 0.00046 34.2 2.7 25 329-353 272-298 (480)
137 KOG2461|consensus 41.8 37 0.0008 33.3 4.3 55 281-335 191-245 (396)
138 KOG2593|consensus 41.8 21 0.00046 34.7 2.5 38 193-233 124-161 (436)
139 PF08274 PhnA_Zn_Ribbon: PhnA 41.8 8.6 0.00019 21.8 0.0 7 449-455 20-26 (30)
140 PRK04023 DNA polymerase II lar 40.1 27 0.00058 38.0 3.2 22 392-429 626-647 (1121)
141 KOG4377|consensus 40.1 21 0.00046 34.2 2.3 65 273-340 272-359 (480)
142 PF12760 Zn_Tnp_IS1595: Transp 38.5 22 0.00047 22.5 1.5 12 195-206 16-27 (46)
143 PF09845 DUF2072: Zn-ribbon co 38.5 20 0.00043 28.6 1.5 14 393-406 2-15 (131)
144 KOG1280|consensus 37.9 30 0.00066 32.4 2.8 23 420-442 79-101 (381)
145 PF05443 ROS_MUCR: ROS/MUCR tr 37.5 16 0.00035 29.4 0.9 25 55-82 71-95 (132)
146 KOG2071|consensus 37.3 25 0.00053 35.8 2.3 26 390-415 416-441 (579)
147 TIGR00595 priA primosomal prot 35.8 30 0.00064 35.5 2.8 49 393-457 214-262 (505)
148 PRK03824 hypA hydrogenase nick 35.8 24 0.00051 28.7 1.7 14 392-405 70-83 (135)
149 PF09332 Mcm10: Mcm10 replicat 34.5 8.9 0.00019 36.5 -1.1 56 392-457 252-312 (344)
150 KOG2071|consensus 34.2 46 0.001 33.9 3.6 24 328-351 418-441 (579)
151 COG4530 Uncharacterized protei 34.2 23 0.00049 26.9 1.2 27 57-94 10-36 (129)
152 PRK14714 DNA polymerase II lar 34.0 40 0.00086 37.9 3.4 56 393-490 668-723 (1337)
153 PF03811 Zn_Tnp_IS1: InsA N-te 33.6 18 0.00039 21.6 0.5 30 393-426 6-35 (36)
154 PF07800 DUF1644: Protein of u 32.8 92 0.002 25.9 4.5 19 109-130 26-44 (162)
155 KOG2907|consensus 32.4 29 0.00063 26.7 1.5 40 393-432 75-114 (116)
156 PF13453 zf-TFIIB: Transcripti 31.8 25 0.00055 21.5 1.0 20 446-465 17-36 (41)
157 PF07754 DUF1610: Domain of un 31.7 23 0.0005 18.9 0.6 8 392-399 16-23 (24)
158 PF12907 zf-met2: Zinc-binding 30.5 34 0.00073 21.0 1.3 26 449-474 2-30 (40)
159 smart00440 ZnF_C2C2 C2C2 Zinc 30.2 23 0.0005 21.7 0.6 8 421-428 29-36 (40)
160 TIGR01206 lysW lysine biosynth 29.6 18 0.00039 23.9 0.0 9 421-429 3-11 (54)
161 KOG2593|consensus 29.4 31 0.00066 33.6 1.5 36 418-456 126-161 (436)
162 PF13878 zf-C2H2_3: zinc-finge 29.3 66 0.0014 19.8 2.5 23 170-192 14-38 (41)
163 PF14446 Prok-RING_1: Prokaryo 29.2 28 0.0006 23.0 0.8 14 393-406 6-19 (54)
164 PF04959 ARS2: Arsenite-resist 29.1 19 0.00041 31.8 0.1 29 327-355 76-104 (214)
165 COG4957 Predicted transcriptio 29.0 25 0.00054 28.0 0.7 26 55-83 75-100 (148)
166 KOG4167|consensus 28.7 17 0.00037 37.6 -0.3 26 196-221 791-816 (907)
167 COG4888 Uncharacterized Zn rib 28.4 19 0.00042 26.9 0.0 13 392-404 22-34 (104)
168 KOG4167|consensus 28.0 16 0.00034 37.9 -0.6 26 55-80 791-816 (907)
169 PF04810 zf-Sec23_Sec24: Sec23 27.5 40 0.00087 20.6 1.3 12 417-428 21-32 (40)
170 PRK12380 hydrogenase nickel in 27.1 37 0.0008 26.6 1.4 25 392-428 70-94 (113)
171 PF07975 C1_4: TFIIH C1-like d 27.0 38 0.00082 22.1 1.2 25 392-416 21-45 (51)
172 TIGR00100 hypA hydrogenase nic 26.9 37 0.0008 26.7 1.4 26 392-429 70-95 (115)
173 PRK00432 30S ribosomal protein 26.6 34 0.00073 22.2 0.9 10 420-429 37-46 (50)
174 COG4391 Uncharacterized protei 26.4 83 0.0018 21.3 2.7 42 26-67 15-59 (62)
175 PF14311 DUF4379: Domain of un 25.7 53 0.0012 21.6 1.8 16 393-408 29-44 (55)
176 KOG3408|consensus 25.5 36 0.00079 26.5 1.0 24 55-78 56-79 (129)
177 KOG1280|consensus 25.3 47 0.001 31.3 1.9 28 447-474 78-105 (381)
178 COG3364 Zn-ribbon containing p 25.1 43 0.00093 25.2 1.3 14 392-405 2-15 (112)
179 PF01780 Ribosomal_L37ae: Ribo 25.0 19 0.00041 26.6 -0.5 10 421-430 54-63 (90)
180 PRK14714 DNA polymerase II lar 24.9 70 0.0015 36.1 3.3 11 300-310 692-702 (1337)
181 smart00154 ZnF_AN1 AN1-like Zi 24.8 36 0.00077 20.7 0.7 13 420-432 12-24 (39)
182 PF10276 zf-CHCC: Zinc-finger 24.4 42 0.00092 20.6 1.0 12 55-66 28-39 (40)
183 KOG3408|consensus 24.2 44 0.00096 26.1 1.2 21 449-469 58-78 (129)
184 KOG2932|consensus 24.2 81 0.0017 29.2 3.1 73 391-473 89-172 (389)
185 PF13248 zf-ribbon_3: zinc-rib 23.9 34 0.00073 18.5 0.4 8 58-65 4-11 (26)
186 COG3091 SprT Zn-dependent meta 23.7 37 0.00081 27.8 0.8 32 391-427 116-147 (156)
187 COG1655 Uncharacterized protei 23.5 26 0.00057 30.8 -0.1 9 449-457 63-71 (267)
188 PF07649 C1_3: C1-like domain; 23.4 45 0.00098 18.7 0.9 7 421-427 16-22 (30)
189 PRK14873 primosome assembly pr 23.2 65 0.0014 34.3 2.7 45 393-457 384-431 (665)
190 PTZ00255 60S ribosomal protein 23.1 37 0.00081 25.1 0.7 11 420-430 54-64 (90)
191 PF05191 ADK_lid: Adenylate ki 23.1 33 0.00072 20.4 0.4 9 302-310 3-11 (36)
192 TIGR00280 L37a ribosomal prote 22.9 34 0.00075 25.3 0.5 11 420-430 53-63 (91)
193 PF13451 zf-trcl: Probable zin 22.8 59 0.0013 21.0 1.4 38 447-484 3-41 (49)
194 PF01096 TFIIS_C: Transcriptio 22.7 19 0.00041 21.9 -0.8 7 421-427 29-35 (39)
195 PF08271 TF_Zn_Ribbon: TFIIB z 22.5 30 0.00064 21.5 0.1 12 196-207 18-29 (43)
196 PF01286 XPA_N: XPA protein N- 22.4 41 0.00089 19.8 0.6 16 393-408 4-19 (34)
197 COG4896 Uncharacterized protei 22.3 45 0.00098 22.3 0.9 7 58-64 4-10 (68)
198 PRK00564 hypA hydrogenase nick 21.9 50 0.0011 26.0 1.3 26 392-429 71-97 (117)
199 PF07282 OrfB_Zn_ribbon: Putat 21.7 51 0.0011 22.9 1.1 11 419-429 45-55 (69)
200 KOG2272|consensus 21.5 75 0.0016 28.3 2.3 19 323-341 216-234 (332)
201 PRK14559 putative protein seri 20.9 88 0.0019 33.0 3.1 10 450-459 43-52 (645)
202 PRK03681 hypA hydrogenase nick 20.7 53 0.0012 25.7 1.2 27 392-429 70-96 (114)
203 PF05129 Elf1: Transcription e 20.6 16 0.00035 26.5 -1.6 8 392-399 22-29 (81)
204 PF01363 FYVE: FYVE zinc finge 20.2 50 0.0011 22.9 0.8 26 393-430 10-35 (69)
205 PRK05580 primosome assembly pr 20.1 78 0.0017 33.9 2.6 50 390-455 379-428 (679)
No 1
>KOG1074|consensus
Probab=99.97 E-value=3.7e-32 Score=264.09 Aligned_cols=230 Identities=24% Similarity=0.409 Sum_probs=154.7
Q ss_pred ccCCcccCCCCChhHHHHHHHhhCCCCCcccccccccccCchHHHhHhhhccCC----CCccCC---cCcccCCChHHHH
Q psy11032 273 HECPTCGKKWRTRSELNKHTKTHSDLRPFVCEICGQGYRTKTHLLVHVGMHNGI----HPFTCH---FCNKSFTQKTGLE 345 (514)
Q Consensus 273 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~----~~~~C~---~C~~~f~~~~~l~ 345 (514)
-+|-+|-+...-.+.|+.|+++|+|++||+|.+||+.|+++.+|+.|+-+|... ..+.|+ +|.+.|.+...|.
T Consensus 606 NqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lp 685 (958)
T KOG1074|consen 606 NQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLP 685 (958)
T ss_pred cceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccccc
Confidence 689999999999999999999999999999999999999999999999888654 348899 9999999999999
Q ss_pred HHhhhhcCCCCC-------------Ccchhhhhhccccccc--------Cccchhhhh----hcccCCCCCceeCCcCCC
Q psy11032 346 RHLTIHNNEKKH-------------QSGRRQKRNQQHDRQR--------SESDTTLAE----EDNLFHSTPIFQCHLCGR 400 (514)
Q Consensus 346 ~H~~~h~~~~~~-------------~~~~~~~~~~~~~~~~--------~~~~~~~~~----~~~~~~~~~~~~C~~C~~ 400 (514)
.|+++|.+.... ++..+.+.+.....-. ......... +....+..++..+..|+.
T Consensus 686 QhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~~ 765 (958)
T KOG1074|consen 686 QHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSCGR 765 (958)
T ss_pred ceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCcccccccc
Confidence 999999854321 1233333332211100 000001111 111233445778888888
Q ss_pred ccCCchhhhhhcccc-----------------------cCCccc-ccCCCcCCCCCch----hHHhH--------h----
Q psy11032 401 CFNSPTTLTQHKIVH-----------------------TGERRF-KCDICGQALTTKP----KLNDH--------M---- 440 (514)
Q Consensus 401 ~f~~~~~l~~H~~~H-----------------------~~~~~~-~C~~C~~~f~~~~----~l~~H--------~---- 440 (514)
.+.....+..+-..+ +++++. .+..++..-...- .|..= .
T Consensus 766 ~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~~ 845 (958)
T KOG1074|consen 766 ELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATKT 845 (958)
T ss_pred ccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccccccccccc
Confidence 888777666653222 234555 5555543221110 00000 0
Q ss_pred --hhccC----------------------C--ccccCCccchhcccchhHHHHhhhhcCCCCCCCccccccccccCCccc
Q psy11032 441 --LLHTG----------------------E--KPHECNVCGARFAKKWNMQVHKQKVHGPDVRGTTESQLGMQVSSGGEK 494 (514)
Q Consensus 441 --~~h~~----------------------~--~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~~ 494 (514)
.++-+ . ....|.+||+.|...++|+.|+|+|.|++||+|..|+++|..+.+|..
T Consensus 846 n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKv 925 (958)
T KOG1074|consen 846 NEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKV 925 (958)
T ss_pred ccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhh
Confidence 00000 0 026788888888888888888888888888888888888888888888
Q ss_pred hhhhHhhH
Q psy11032 495 KDRAVTKQ 502 (514)
Q Consensus 495 h~~~~~~~ 502 (514)
||++|...
T Consensus 926 HMgtH~w~ 933 (958)
T KOG1074|consen 926 HMGTHMWV 933 (958)
T ss_pred hhcccccc
Confidence 88877643
No 2
>KOG1074|consensus
Probab=99.97 E-value=3.7e-31 Score=257.16 Aligned_cols=145 Identities=26% Similarity=0.544 Sum_probs=112.5
Q ss_pred ccccccccccCCChHHHHhHhhhhCCCCCcccccccccccChhHHHHHhhhhcCC----CCccCC---cchhhccChhhh
Q psy11032 169 IHECPTCGKKWRTRSELNKHTKTHSDLRPFVCEICGQGYRTKTHLLVHVGMHNGI----HPFTCH---FCNKSFTQKTGL 241 (514)
Q Consensus 169 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~----~~~~C~---~C~~~f~~~~~l 241 (514)
.-+|-+|.++..-.+.|+.|.++|.+++||+|.+||+.|.++.+|+.||..|... ..|.|+ +|.+.|.+.-.|
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~l 684 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTL 684 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccc
Confidence 3579999999999999999999999999999999999999999999999988643 458899 999999999999
Q ss_pred hhhhhhhcCCCccccCCcccccccccCCCccccCCcccCCCCChhHHHHHHHhhCCC----------------C----Cc
Q psy11032 242 ERHLTIHNNEKKHQVSGRFSFNRAREGMPLIHECPTCGKKWRTRSELNKHTKTHSDL----------------R----PF 301 (514)
Q Consensus 242 ~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~----------------~----~~ 301 (514)
..|+++|.+..-...... .+..-..-+|..|.+.|.+...+..++..+.+. . +.
T Consensus 685 pQhIriH~~~~~s~g~~a------~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~ 758 (958)
T KOG1074|consen 685 PQHIRIHLGGQISNGGTA------AEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPP 758 (958)
T ss_pred cceEEeecCCCCCCCccc------ccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCc
Confidence 999999986543222111 022222357999999998888887777665322 1 35
Q ss_pred ccccccccccCchHHHhH
Q psy11032 302 VCEICGQGYRTKTHLLVH 319 (514)
Q Consensus 302 ~C~~C~~~f~~~~~l~~H 319 (514)
.+..|+..+.....+..+
T Consensus 759 ~e~~~~~~~~~e~~i~~~ 776 (958)
T KOG1074|consen 759 PENSCGRELEGEMAISVR 776 (958)
T ss_pred cccccccccCcccccccc
Confidence 667777776665555444
No 3
>KOG3608|consensus
Probab=99.94 E-value=3.2e-27 Score=207.63 Aligned_cols=261 Identities=24% Similarity=0.418 Sum_probs=216.9
Q ss_pred Ccccc--cccccccC-hhHHHHHhhhhcC----------------------------------CCCccC--CcchhhccC
Q psy11032 197 PFVCE--ICGQGYRT-KTHLLVHVGMHNG----------------------------------IHPFTC--HFCNKSFTQ 237 (514)
Q Consensus 197 ~~~C~--~C~~~f~~-~~~l~~H~~~h~~----------------------------------~~~~~C--~~C~~~f~~ 237 (514)
.+.|. .|++...+ ...|.+|...|.- ...|.| ..|+..|.+
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s 148 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVS 148 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccC
Confidence 47775 47777666 5889999877620 012667 469999999
Q ss_pred hhhhhhhhhhhcCCCccccCCcccccccccCCCccccC--CcccCCCCChhHHHHHHHhhCCCCCcccccccccccCchH
Q psy11032 238 KTGLERHLTIHNNEKKHQVSGRFSFNRAREGMPLIHEC--PTCGKKWRTRSELNKHTKTHSDLRPFVCEICGQGYRTKTH 315 (514)
Q Consensus 238 ~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~C--~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~ 315 (514)
...+..|+..|.---.+. .+.........+.| ..|.+.+.++..|+.|++.|.+++...|+.||..|.++..
T Consensus 149 ~~ef~dHV~~H~l~ceyd------~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tk 222 (467)
T KOG3608|consen 149 IVEFQDHVVKHALFCEYD------IQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTK 222 (467)
T ss_pred HHHHHHHHHHhhhhhhhh------hhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccH
Confidence 999999998776432211 11111111112445 4599999999999999999999999999999999999999
Q ss_pred HHhHhhhc--cCCCCccCCcCcccCCChHHHHHHhhhhcCCCCCCcchhhhhhcccccccCccchhhhhhcccCCCCCce
Q psy11032 316 LLVHVGMH--NGIHPFTCHFCNKSFTQKTGLERHLTIHNNEKKHQSGRRQKRNQQHDRQRSESDTTLAEEDNLFHSTPIF 393 (514)
Q Consensus 316 l~~H~~~h--~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (514)
|..|.+.- ....+|.|..|.+.|.+...|..|+..|-. -|
T Consensus 223 l~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn--------------------------------------~y 264 (467)
T KOG3608|consen 223 LFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN--------------------------------------CY 264 (467)
T ss_pred HHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh--------------------------------------cc
Confidence 99998754 345789999999999999999999998854 49
Q ss_pred eCCcCCCccCCchhhhhhcc-cccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCc--cchhcccchhHHHHhhh
Q psy11032 394 QCHLCGRCFNSPTTLTQHKI-VHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNV--CGARFAKKWNMQVHKQK 470 (514)
Q Consensus 394 ~C~~C~~~f~~~~~l~~H~~-~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~l~~H~~~ 470 (514)
+|+.|+-+....++|..|++ .|..++||+|..|++.|...+.|.+|+.+|. +..|.|.. |.+.|.+..+|..|++.
T Consensus 265 kCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~e 343 (467)
T KOG3608|consen 265 KCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLE 343 (467)
T ss_pred cccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHH
Confidence 99999999999999999987 6889999999999999999999999999998 66799988 99999999999999999
Q ss_pred hc-CC--CCCCCccccccccccCCccchhhhHhhH
Q psy11032 471 VH-GP--DVRGTTESQLGMQVSSGGEKKDRAVTKQ 502 (514)
Q Consensus 471 ~h-~~--~~~~c~~c~~~~~~~~~~~~h~~~~~~~ 502 (514)
+| |. -+|+|..|++.|..--+|..|++..|+-
T Consensus 344 vhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f 378 (467)
T KOG3608|consen 344 VHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGF 378 (467)
T ss_pred hccCCCCCceeeecchhhhccchhHHHHHHHhhcc
Confidence 99 33 4699999999999999999998876653
No 4
>KOG3608|consensus
Probab=99.94 E-value=1.1e-26 Score=204.33 Aligned_cols=249 Identities=25% Similarity=0.523 Sum_probs=210.7
Q ss_pred ccc--ccccccCCC-hHHHHhHhhhhC-----------------C-----------------CCCcccc--cccccccCh
Q psy11032 170 HEC--PTCGKKWRT-RSELNKHTKTHS-----------------D-----------------LRPFVCE--ICGQGYRTK 210 (514)
Q Consensus 170 ~~C--~~C~~~f~~-~~~l~~H~~~h~-----------------~-----------------~~~~~C~--~C~~~f~~~ 210 (514)
++| ..|++...+ ..+|.+|...|- + ...|.|. .|+..|.+.
T Consensus 70 ~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s~ 149 (467)
T KOG3608|consen 70 HQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVSI 149 (467)
T ss_pred eeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccCH
Confidence 556 457777666 467999976652 0 0126775 599999999
Q ss_pred hHHHHHhhhhcC------------CC-CccC--CcchhhccChhhhhhhhhhhcCCCccccCCcccccccccCCCccccC
Q psy11032 211 THLLVHVGMHNG------------IH-PFTC--HFCNKSFTQKTGLERHLTIHNNEKKHQVSGRFSFNRAREGMPLIHEC 275 (514)
Q Consensus 211 ~~l~~H~~~h~~------------~~-~~~C--~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~C 275 (514)
..+..|+..|.- ++ .+.| ..|...+.++..|++|++.|.+++. ..|
T Consensus 150 ~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKv-------------------vAC 210 (467)
T KOG3608|consen 150 VEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKV-------------------VAC 210 (467)
T ss_pred HHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeE-------------------Eec
Confidence 999999987631 11 2456 4599999999999999999999987 789
Q ss_pred CcccCCCCChhHHHHHHHh--hCCCCCcccccccccccCchHHHhHhhhccCCCCccCCcCcccCCChHHHHHHhhhhcC
Q psy11032 276 PTCGKKWRTRSELNKHTKT--HSDLRPFVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSFTQKTGLERHLTIHNN 353 (514)
Q Consensus 276 ~~C~~~f~~~~~l~~H~~~--h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 353 (514)
+.||..|.+...|..|.+. .....+|.|..|.+.|.+...|..|+..|.. -|+|+.|+......++|..||+..+.
T Consensus 211 p~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn--~ykCplCdmtc~~~ssL~~H~r~rHs 288 (467)
T KOG3608|consen 211 PHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN--CYKCPLCDMTCSSASSLTTHIRYRHS 288 (467)
T ss_pred chHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh--cccccccccCCCChHHHHHHHHhhhc
Confidence 9999999999999999864 4456789999999999999999999998865 49999999999999999999998554
Q ss_pred CCCCCcchhhhhhcccccccCccchhhhhhcccCCCCCceeCCcCCCccCCchhhhhhcccccCCcccccCC--CcCCCC
Q psy11032 354 EKKHQSGRRQKRNQQHDRQRSESDTTLAEEDNLFHSTPIFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDI--CGQALT 431 (514)
Q Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~--C~~~f~ 431 (514)
+. +||+|+.|++.|...+.|.+|+..|. +-.|+|.. |.++|.
T Consensus 289 ~d-----------------------------------kpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r 332 (467)
T KOG3608|consen 289 KD-----------------------------------KPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVR 332 (467)
T ss_pred cC-----------------------------------CCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHH
Confidence 32 27999999999999999999999998 67799999 999999
Q ss_pred CchhHHhHhhhcc-CC--ccccCCccchhcccchhHHHHhhhhcCCC
Q psy11032 432 TKPKLNDHMLLHT-GE--KPHECNVCGARFAKKWNMQVHKQKVHGPD 475 (514)
Q Consensus 432 ~~~~l~~H~~~h~-~~--~~~~C~~C~~~f~~~~~l~~H~~~~h~~~ 475 (514)
+...|++|++.++ |. .+|.|..|++.|++-.+|..|+.+.|+=+
T Consensus 333 ~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 333 TYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred HHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhccc
Confidence 9999999998766 43 46999999999999999999999998643
No 5
>KOG2462|consensus
Probab=99.94 E-value=2.3e-27 Score=203.89 Aligned_cols=134 Identities=33% Similarity=0.725 Sum_probs=112.1
Q ss_pred CCcccccccccccCchHHHhHhhhccC---CCCccCCcCcccCCChHHHHHHhhhhcCCCCCCcchhhhhhcccccccCc
Q psy11032 299 RPFVCEICGQGYRTKTHLLVHVGMHNG---IHPFTCHFCNKSFTQKTGLERHLTIHNNEKKHQSGRRQKRNQQHDRQRSE 375 (514)
Q Consensus 299 ~~~~C~~C~~~f~~~~~l~~H~~~h~~---~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~ 375 (514)
..|+|..||+.+.+.++|.+|.+.|-. .+.+.|++|++.|.+-..|.+|+++|+-
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~l---------------------- 186 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHTL---------------------- 186 (279)
T ss_pred CceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccCC----------------------
Confidence 347888888888888888888877754 4568888888888888888888888863
Q ss_pred cchhhhhhcccCCCCCceeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCccc
Q psy11032 376 SDTTLAEEDNLFHSTPIFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCG 455 (514)
Q Consensus 376 ~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 455 (514)
+..|.+||+.|...+.|+-|+|+|+|||||.|+.|+++|...++|+.||++|.+.|.|.|..|+
T Consensus 187 ----------------~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~ 250 (279)
T KOG2462|consen 187 ----------------PCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCG 250 (279)
T ss_pred ----------------CcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchh
Confidence 5788888888888888888888888888888888888888888888888888888888888888
Q ss_pred hhcccchhHHHHhhh
Q psy11032 456 ARFAKKWNMQVHKQK 470 (514)
Q Consensus 456 ~~f~~~~~l~~H~~~ 470 (514)
++|..++-|.+|...
T Consensus 251 KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 251 KSFALKSYLNKHSES 265 (279)
T ss_pred hHHHHHHHHHHhhhh
Confidence 888888888888765
No 6
>KOG2462|consensus
Probab=99.94 E-value=1.4e-27 Score=205.30 Aligned_cols=137 Identities=30% Similarity=0.543 Sum_probs=127.9
Q ss_pred CCCccCCcCcccCCChHHHHHHhhhhcCCCCCCcchhhhhhcccccccCccchhhhhhcccCCCCCceeCCcCCCccCCc
Q psy11032 326 IHPFTCHFCNKSFTQKTGLERHLTIHNNEKKHQSGRRQKRNQQHDRQRSESDTTLAEEDNLFHSTPIFQCHLCGRCFNSP 405 (514)
Q Consensus 326 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~ 405 (514)
...|+|+.|++.+.+.+.|.+|..+|-... ..+.+.|+.|++.|.+.
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~---------------------------------s~ka~~C~~C~K~YvSm 174 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLD---------------------------------SKKAFSCKYCGKVYVSM 174 (279)
T ss_pred CCceeccccccccccccccchhhccccccc---------------------------------ccccccCCCCCceeeeh
Confidence 345999999999999999999999986532 13369999999999999
Q ss_pred hhhhhhcccccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCccchhcccchhHHHHhhhhcCCCCCCCcccccc
Q psy11032 406 TTLTQHKIVHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGARFAKKWNMQVHKQKVHGPDVRGTTESQLG 485 (514)
Q Consensus 406 ~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~ 485 (514)
.+|..|+++|+ -+++|.+||+.|...+.|+-|+|+|+|||||.|+.|+++|.++++|+.|+++|.+.+.|+|..|+++
T Consensus 175 pALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~Ks 252 (279)
T KOG2462|consen 175 PALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKS 252 (279)
T ss_pred HHHhhHhhccC--CCcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhH
Confidence 99999999998 6799999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccCCccchhh
Q psy11032 486 MQVSSGGEKKDR 497 (514)
Q Consensus 486 ~~~~~~~~~h~~ 497 (514)
|...+.|..|..
T Consensus 253 Fsl~SyLnKH~E 264 (279)
T KOG2462|consen 253 FALKSYLNKHSE 264 (279)
T ss_pred HHHHHHHHHhhh
Confidence 999999999975
No 7
>KOG3623|consensus
Probab=99.89 E-value=9.9e-23 Score=195.36 Aligned_cols=119 Identities=23% Similarity=0.513 Sum_probs=94.0
Q ss_pred cccccchhhcCCchHHHHHHhhhc--CCCCccccccchhcCChhHHHHHHHHhhhcCCCCCCCCCchhhhhcccccccCc
Q psy11032 85 IKCEVCSLALLSKSHLKQHMRVHT--GERPHECEVCGKRFAKRSNLNAHKKKVTRHTGRDEVPSIKPLKRKRNISFLTGA 162 (514)
Q Consensus 85 ~~C~~C~~~f~~~~~l~~H~~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 162 (514)
..|++|+..+.....|+.|++..+ .+..|.|..|.++|..+..|.+||..+..-... .... .
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dq--a~sl--------------t 274 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQ--AISL--------------T 274 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcc--cccc--------------c
Confidence 479999999999999999987543 345699999999999999999999854221111 0000 0
Q ss_pred CCCCCcccccccccccCCChHHHHhHhhhhCCCCCcccccccccccChhHHHHHhhh
Q psy11032 163 REGMPLIHECPTCGKKWRTRSELNKHTKTHSDLRPFVCEICGQGYRTKTHLLVHVGM 219 (514)
Q Consensus 163 ~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 219 (514)
..+...-|+|..|++.|..+-.|..|+|+|.|++||.|+.|+++|+....+..||..
T Consensus 275 qsa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 275 QSALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred chhhhccccccccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence 011122399999999999999999999999999999999999999999999999854
No 8
>KOG3623|consensus
Probab=99.86 E-value=1.3e-22 Score=194.47 Aligned_cols=77 Identities=22% Similarity=0.450 Sum_probs=75.2
Q ss_pred cccCCCcCCCCCchhHHhHhhhccCCccccCCccchhcccchhHHHHhhhhcCCCCCCCccccccccccCCccchhh
Q psy11032 421 FKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGARFAKKWNMQVHKQKVHGPDVRGTTESQLGMQVSSGGEKKDR 497 (514)
Q Consensus 421 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~~h~~ 497 (514)
|.|..|++.|...++|.+|.--|+|.+||+|.+|.++|..+--|..|.|.|.|++||.|..|++.|.+......||-
T Consensus 895 yaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQHMN 971 (1007)
T KOG3623|consen 895 YACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQHMN 971 (1007)
T ss_pred chHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhhhc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999984
No 9
>KOG3576|consensus
Probab=99.76 E-value=1.1e-19 Score=147.99 Aligned_cols=112 Identities=23% Similarity=0.484 Sum_probs=107.0
Q ss_pred CCCceeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCccchhcccchhHHHHh
Q psy11032 389 STPIFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGARFAKKWNMQVHK 468 (514)
Q Consensus 389 ~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 468 (514)
....|.|.+|++.|.....|.+|++-|...+.|-|..||+.|.....|++|+++|+|.+||+|..|+++|+.+.+|..|+
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl 193 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHL 193 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHH
Confidence 34579999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhcCC-----------CCCCCccccccccccCCccchhhhHh
Q psy11032 469 QKVHGP-----------DVRGTTESQLGMQVSSGGEKKDRAVT 500 (514)
Q Consensus 469 ~~~h~~-----------~~~~c~~c~~~~~~~~~~~~h~~~~~ 500 (514)
++.||. ++|.|+.||++.......-.|+..|+
T Consensus 194 ~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~h 236 (267)
T KOG3576|consen 194 KKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHH 236 (267)
T ss_pred HHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcC
Confidence 999976 57999999999999999999999887
No 10
>KOG3576|consensus
Probab=99.60 E-value=2.8e-16 Score=128.22 Aligned_cols=126 Identities=31% Similarity=0.677 Sum_probs=104.0
Q ss_pred CcccccccccccCCChHHHHhHhhhhCCCCCcccccccccccChhHHHHHhhhhcCCCCccCCcchhhccChhhhhhhhh
Q psy11032 167 PLIHECPTCGKKWRTRSELNKHTKTHSDLRPFVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSFTQKTGLERHLT 246 (514)
Q Consensus 167 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 246 (514)
...|.|.+|++.|.-...|.+|++-|...+.+.|..||+.|.....|++|+++|+|.+||+|..|++.|+..-.|..|++
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence 45699999999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred hhcCCCccccCCcccccccccCCCccccCCcccCCCCChhHHHHHHHhhCCCCCcccccccccccCchHHHhHhhhccCC
Q psy11032 247 IHNNEKKHQVSGRFSFNRAREGMPLIHECPTCGKKWRTRSELNKHTKTHSDLRPFVCEICGQGYRTKTHLLVHVGMHNGI 326 (514)
Q Consensus 247 ~h~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 326 (514)
.-|+.... | .. .....+.|+|+.||++-.....+..|++.|+..
T Consensus 195 kvhgv~~~------------------y-------------ay-----kerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 195 KVHGVQHQ------------------Y-------------AY-----KERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HHcCchHH------------------H-------------HH-----HHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 87775421 0 00 011235577777777777777777777776655
Q ss_pred CC
Q psy11032 327 HP 328 (514)
Q Consensus 327 ~~ 328 (514)
.|
T Consensus 239 Sp 240 (267)
T KOG3576|consen 239 SP 240 (267)
T ss_pred CH
Confidence 44
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.22 E-value=2.4e-11 Score=119.28 Aligned_cols=146 Identities=24% Similarity=0.539 Sum_probs=95.5
Q ss_pred ccCCcccCCCCChhHHHHHHHhhCCCCCcccc--cccccccCchHHHhHhhhccCCCCccCCcCcccCCChHHHHHHhhh
Q psy11032 273 HECPTCGKKWRTRSELNKHTKTHSDLRPFVCE--ICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSFTQKTGLERHLTI 350 (514)
Q Consensus 273 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 350 (514)
-.|+.|...... ..|..|...-. ...-.|+ .|+..|. +..+..| +.|+.|++.|. ...|..|+..
T Consensus 408 V~C~NC~~~i~l-~~l~lHe~~C~-r~~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~~~ 474 (567)
T PLN03086 408 VECRNCKHYIPS-RSIALHEAYCS-RHNVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHMKV 474 (567)
T ss_pred EECCCCCCccch-hHHHHHHhhCC-CcceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHHHh
Confidence 468888776553 34556654321 1224566 3777773 2333444 37888888775 5667888777
Q ss_pred hcCCCCCCcchhhhhhcccccccCccchhhhhhcccCCCCCceeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCC
Q psy11032 351 HNNEKKHQSGRRQKRNQQHDRQRSESDTTLAEEDNLFHSTPIFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQAL 430 (514)
Q Consensus 351 h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f 430 (514)
++ . ++.|+ |+..+ ....|..|+.+|.+.+++.|++|++.|
T Consensus 475 ~H--k------------------------------------pv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v 514 (567)
T PLN03086 475 FH--E------------------------------------PLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMV 514 (567)
T ss_pred cC--C------------------------------------CccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCcc
Confidence 64 2 47787 87654 557777888778888888888888777
Q ss_pred CC----------chhHHhHhhhccCCccccCCccchhcccchhHHHHhhhhcC
Q psy11032 431 TT----------KPKLNDHMLLHTGEKPHECNVCGARFAKKWNMQVHKQKVHG 473 (514)
Q Consensus 431 ~~----------~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 473 (514)
.. .+.|..|.... |.+++.|..||+.+. ...|..|+...|.
T Consensus 515 ~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vr-lrdm~~H~~~~h~ 565 (567)
T PLN03086 515 QAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVM-LKEMDIHQIAVHQ 565 (567)
T ss_pred ccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeee-ehhHHHHHHHhhc
Confidence 41 34677777774 777788888887764 4567777777664
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.17 E-value=2.9e-11 Score=118.71 Aligned_cols=102 Identities=20% Similarity=0.378 Sum_probs=87.5
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCccchhccc----------c
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGARFAK----------K 461 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~----------~ 461 (514)
.+.|+.|++.|. ...|..|+.+|+ .++.|+ |++.+ ....|..|+.+|.+.+++.|++|++.|.. .
T Consensus 453 H~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~ 527 (567)
T PLN03086 453 HVHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRL 527 (567)
T ss_pred CccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhh
Confidence 478999999996 678999999985 889999 99755 67999999999999999999999999952 4
Q ss_pred hhHHHHhhhhcCCCCCCCccccccccccCCccchhhhHh
Q psy11032 462 WNMQVHKQKVHGPDVRGTTESQLGMQVSSGGEKKDRAVT 500 (514)
Q Consensus 462 ~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~~h~~~~~ 500 (514)
..|..|... .|.+++.|..||+.++.+ +|..|+-..|
T Consensus 528 s~Lt~HE~~-CG~rt~~C~~Cgk~Vrlr-dm~~H~~~~h 564 (567)
T PLN03086 528 RGMSEHESI-CGSRTAPCDSCGRSVMLK-EMDIHQIAVH 564 (567)
T ss_pred hhHHHHHHh-cCCcceEccccCCeeeeh-hHHHHHHHhh
Confidence 589999887 599999999999997765 5777876555
No 13
>PHA00733 hypothetical protein
Probab=99.04 E-value=1.2e-10 Score=92.96 Aligned_cols=83 Identities=18% Similarity=0.319 Sum_probs=66.8
Q ss_pred CCceeCCcCCCccCCchhhhhh--cc---cccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCccchhcccchhH
Q psy11032 390 TPIFQCHLCGRCFNSPTTLTQH--KI---VHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGARFAKKWNM 464 (514)
Q Consensus 390 ~~~~~C~~C~~~f~~~~~l~~H--~~---~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 464 (514)
.+.+.|.+|+..|.....|..| +. .+.+.+||.|+.|++.|.+...|..|++.| ..+|.|..|++.|.....|
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL 115 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDST 115 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHH
Confidence 4468899999888887777665 22 334578899999999999999999998876 3568999999999999999
Q ss_pred HHHhhhhcCC
Q psy11032 465 QVHKQKVHGP 474 (514)
Q Consensus 465 ~~H~~~~h~~ 474 (514)
..|+++.||-
T Consensus 116 ~~H~~~~h~~ 125 (128)
T PHA00733 116 LDHVCKKHNI 125 (128)
T ss_pred HHHHHHhcCc
Confidence 9999888863
No 14
>KOG3993|consensus
Probab=99.03 E-value=5.5e-11 Score=108.66 Aligned_cols=193 Identities=17% Similarity=0.216 Sum_probs=112.7
Q ss_pred CcccccccccccCchHHHhHhhhccCCCCccCCcCcccCCChHHHHHHhhhhcCCCCCCcchhhhhhcccccccCccchh
Q psy11032 300 PFVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSFTQKTGLERHLTIHNNEKKHQSGRRQKRNQQHDRQRSESDTT 379 (514)
Q Consensus 300 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 379 (514)
.|+|.+|...|.+...|..|.-.-.--..|+|+.|++.|.-..+|..|.|+|................. ++....
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~-----~~~rae 341 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQA-----VETRAE 341 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhh-----hhhhhh
Confidence 499999999999999999996322222349999999999999999999999975432221000000000 000000
Q ss_pred hhhh--cccCCCCCceeCCcCCCccCCchhhhhhcccccCCc------c-----------cccCCCcCCCCCchhHHhHh
Q psy11032 380 LAEE--DNLFHSTPIFQCHLCGRCFNSPTTLTQHKIVHTGER------R-----------FKCDICGQALTTKPKLNDHM 440 (514)
Q Consensus 380 ~~~~--~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~------~-----------~~C~~C~~~f~~~~~l~~H~ 440 (514)
..+. ...-..+..|.|.+|++.|+....|+.|+.+|...- + +-|..|.-.+.....--.+.
T Consensus 342 ~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~v 421 (500)
T KOG3993|consen 342 VQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEV 421 (500)
T ss_pred hhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccce
Confidence 0000 001122337999999999999999999988775310 0 23444433332222111111
Q ss_pred hhccC-CccccCCccchhcccchhHHHHhhhhcCCCCCCCccccccccccCCccchhh
Q psy11032 441 LLHTG-EKPHECNVCGARFAKKWNMQVHKQKVHGPDVRGTTESQLGMQVSSGGEKKDR 497 (514)
Q Consensus 441 ~~h~~-~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~~h~~ 497 (514)
..+.+ .....|++|+.-+.++..-.-+.+.-+-...|.|.+|--+|.+.-+|..|+-
T Consensus 422 l~~a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin 479 (500)
T KOG3993|consen 422 LYVAGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHIN 479 (500)
T ss_pred eeeeccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhh
Confidence 11111 1124566666666666655555555555566777777777777777777655
No 15
>KOG3993|consensus
Probab=98.94 E-value=2.3e-10 Score=104.60 Aligned_cols=158 Identities=23% Similarity=0.360 Sum_probs=108.7
Q ss_pred CCcccccccccccChhHHHHHhhhhcCCCCccCCcchhhccChhhhhhhhhhhcCCCcccc----------CCc----cc
Q psy11032 196 RPFVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSFTQKTGLERHLTIHNNEKKHQV----------SGR----FS 261 (514)
Q Consensus 196 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~----------~~~----~~ 261 (514)
..|+|.+|...|.+...|.+|.-..--...|+|+.|++.|....+|..|.+.|........ +.. ..
T Consensus 266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea 345 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA 345 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence 3599999999999999999997543334469999999999999999999998865432210 000 00
Q ss_pred ccccccCCCccccCCcccCCCCChhHHHHHHHhhCCCC------------------------------------------
Q psy11032 262 FNRAREGMPLIHECPTCGKKWRTRSELNKHTKTHSDLR------------------------------------------ 299 (514)
Q Consensus 262 ~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~------------------------------------------ 299 (514)
.....+..+.+|.|..|+++|....-|+.|+.+|+...
T Consensus 346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a 425 (500)
T KOG3993|consen 346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVA 425 (500)
T ss_pred cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeee
Confidence 11222344557999999999999999999987764311
Q ss_pred ----CcccccccccccCchHHHhHhhhccCCCCccCCcCcccCCChHHHHHHhhhhcC
Q psy11032 300 ----PFVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSFTQKTGLERHLTIHNN 353 (514)
Q Consensus 300 ----~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 353 (514)
.-.|..|+-.+.++..-..+.+.-..+.-|.|.+|.-.|.+...|.+|+...|.
T Consensus 426 ~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hp 483 (500)
T KOG3993|consen 426 GSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHP 483 (500)
T ss_pred ccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcCh
Confidence 012444444444433333333333344558999999999999999999877553
No 16
>PHA00733 hypothetical protein
Probab=98.93 E-value=2.9e-10 Score=90.76 Aligned_cols=95 Identities=9% Similarity=0.140 Sum_probs=75.3
Q ss_pred chhhhhhcccccCCcccccCCCcCCCCCchhHHhH--h---hhccCCccccCCccchhcccchhHHHHhhhhcCCCCCCC
Q psy11032 405 PTTLTQHKIVHTGERRFKCDICGQALTTKPKLNDH--M---LLHTGEKPHECNVCGARFAKKWNMQVHKQKVHGPDVRGT 479 (514)
Q Consensus 405 ~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H--~---~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c 479 (514)
.+.|..+-..-...+++.|.+|+..|.....|..| + ..+++.+||.|+.|++.|.....|..|++.+ ..+|.|
T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C 102 (128)
T PHA00733 25 LEELKRYHSLTPEQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVC 102 (128)
T ss_pred HHHhhhhhcCChhhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccC
Confidence 34444433333345789999999999888777766 2 2345688999999999999999999999864 457999
Q ss_pred ccccccccccCCccchhhhHhh
Q psy11032 480 TESQLGMQVSSGGEKKDRAVTK 501 (514)
Q Consensus 480 ~~c~~~~~~~~~~~~h~~~~~~ 501 (514)
..|+++|.....|..|++..|+
T Consensus 103 ~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 103 PVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCCCccCCHHHHHHHHHHhcC
Confidence 9999999999999999987664
No 17
>PHA02768 hypothetical protein; Provisional
Probab=98.76 E-value=2.7e-09 Score=69.07 Aligned_cols=22 Identities=27% Similarity=0.715 Sum_probs=8.3
Q ss_pred ccCCCcCCCCCchhHHhHhhhc
Q psy11032 422 KCDICGQALTTKPKLNDHMLLH 443 (514)
Q Consensus 422 ~C~~C~~~f~~~~~l~~H~~~h 443 (514)
.|+.||+.|+..+.|..|+++|
T Consensus 7 ~C~~CGK~Fs~~~~L~~H~r~H 28 (55)
T PHA02768 7 ECPICGEIYIKRKSMITHLRKH 28 (55)
T ss_pred CcchhCCeeccHHHHHHHHHhc
Confidence 3333333333333333333333
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.67 E-value=1.1e-08 Score=66.23 Aligned_cols=44 Identities=18% Similarity=0.206 Sum_probs=40.9
Q ss_pred cccCCccchhcccchhHHHHhhhhcCCCCCCCccccccccccCCcc
Q psy11032 448 PHECNVCGARFAKKWNMQVHKQKVHGPDVRGTTESQLGMQVSSGGE 493 (514)
Q Consensus 448 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~ 493 (514)
.|.|+.||+.|+..+.|..|+++|+ ++++|..|++.|...+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4899999999999999999999988 6999999999999888874
No 19
>PHA00616 hypothetical protein
Probab=98.48 E-value=6.1e-08 Score=59.56 Aligned_cols=42 Identities=24% Similarity=0.256 Sum_probs=33.2
Q ss_pred cccCCccchhcccchhHHHHhhhhcCCCCCCCcccccccccc
Q psy11032 448 PHECNVCGARFAKKWNMQVHKQKVHGPDVRGTTESQLGMQVS 489 (514)
Q Consensus 448 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~ 489 (514)
||+|+.||+.|.+++.|.+|+++|||.+++.|++==..|+++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~y~~f~v~ 42 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYFYIYFRVR 42 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEEEEEEEee
Confidence 578888888888888888888888888888887765555543
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.47 E-value=5.5e-08 Score=53.71 Aligned_cols=24 Identities=54% Similarity=1.032 Sum_probs=12.8
Q ss_pred HHhHhhhccCCccccCCccchhcc
Q psy11032 436 LNDHMLLHTGEKPHECNVCGARFA 459 (514)
Q Consensus 436 l~~H~~~h~~~~~~~C~~C~~~f~ 459 (514)
|.+|+++|+|++||.|+.|++.|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 455555555555555555555553
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.44 E-value=4.3e-08 Score=54.12 Aligned_cols=26 Identities=38% Similarity=0.917 Sum_probs=23.8
Q ss_pred hhhhhcccccCCcccccCCCcCCCCC
Q psy11032 407 TLTQHKIVHTGERRFKCDICGQALTT 432 (514)
Q Consensus 407 ~l~~H~~~H~~~~~~~C~~C~~~f~~ 432 (514)
+|.+|+++|++++||+|++|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 48899999999999999999999863
No 22
>PHA00616 hypothetical protein
Probab=98.36 E-value=1e-07 Score=58.54 Aligned_cols=34 Identities=18% Similarity=0.385 Sum_probs=27.8
Q ss_pred ccccCCCcCCCCCchhHHhHhhhccCCccccCCc
Q psy11032 420 RFKCDICGQALTTKPKLNDHMLLHTGEKPHECNV 453 (514)
Q Consensus 420 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 453 (514)
||+|+.||+.|...+.|.+|++.|+|++++.|..
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 5788888888888888888888888888887754
No 23
>KOG1146|consensus
Probab=98.28 E-value=1.2e-06 Score=92.27 Aligned_cols=130 Identities=17% Similarity=0.256 Sum_probs=77.1
Q ss_pred ccccCCChhHHHHHHh-hccCCCCcccccchhhcCCchHHHHHHhhhc-------------------------CCCCccc
Q psy11032 62 CGKGFIHLKSYQIHKM-IHSGERHIKCEVCSLALLSKSHLKQHMRVHT-------------------------GERPHEC 115 (514)
Q Consensus 62 C~~~f~~~~~l~~H~~-~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~-------------------------~~~~~~C 115 (514)
|+..+.....+..|+. .++..+.++|+.|++.++....|..||+..+ +..+|.|
T Consensus 442 ~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C 521 (1406)
T KOG1146|consen 442 AEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPC 521 (1406)
T ss_pred hhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccc
Confidence 3333444444444433 3445577788888888888888888887622 2367999
Q ss_pred cccchhcCChhHHHHHHHHhhhcCCCCCC---------CCC----chhhhhcccccccCcC-CCCCcccccccccccCCC
Q psy11032 116 EVCGKRFAKRSNLNAHKKKVTRHTGRDEV---------PSI----KPLKRKRNISFLTGAR-EGMPLIHECPTCGKKWRT 181 (514)
Q Consensus 116 ~~C~~~f~~~~~l~~H~~~~~~~~~~~~~---------~~~----~~~~~~~~~~~~~~~~-~~~~~~~~C~~C~~~f~~ 181 (514)
..|+..+.....|..|++..-......+. ... .......+.....+.. ......|.|.+|++.-+-
T Consensus 522 ~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetni 601 (1406)
T KOG1146|consen 522 RACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETNI 601 (1406)
T ss_pred eeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhhh
Confidence 99999999999999999854221110000 000 0000011111111111 223345899999999999
Q ss_pred hHHHHhHhhh
Q psy11032 182 RSELNKHTKT 191 (514)
Q Consensus 182 ~~~l~~H~~~ 191 (514)
...|+.||..
T Consensus 602 arnlrihmts 611 (1406)
T KOG1146|consen 602 ARNLRIHMTA 611 (1406)
T ss_pred hhcccccccc
Confidence 9999999854
No 24
>PHA00732 hypothetical protein
Probab=98.15 E-value=1.3e-06 Score=62.84 Aligned_cols=43 Identities=35% Similarity=0.642 Sum_probs=23.0
Q ss_pred ccccCCCcCCCCCchhHHhHhhh-ccCCccccCCccchhcccchhHHHHh
Q psy11032 420 RFKCDICGQALTTKPKLNDHMLL-HTGEKPHECNVCGARFAKKWNMQVHK 468 (514)
Q Consensus 420 ~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~ 468 (514)
||.|+.|++.|.+...|..|++. |. ++.|+.||+.|. .+..|+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~ 44 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHF 44 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhh
Confidence 35555555555555555555553 33 235666665554 345555
No 25
>PHA00732 hypothetical protein
Probab=98.09 E-value=1.8e-06 Score=62.15 Aligned_cols=46 Identities=11% Similarity=0.244 Sum_probs=40.0
Q ss_pred cccCCccchhcccchhHHHHhhhhcCCCCCCCccccccccccCCccchhhh
Q psy11032 448 PHECNVCGARFAKKWNMQVHKQKVHGPDVRGTTESQLGMQVSSGGEKKDRA 498 (514)
Q Consensus 448 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~~h~~~ 498 (514)
||.|..|++.|.....|+.|++.+|. ++.|+.|++.|. .+..|.++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~--~~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT--LTKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC--CCccCCCCCEeC---Chhhhhcc
Confidence 68999999999999999999986554 368999999997 68888754
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.01 E-value=6.7e-06 Score=54.90 Aligned_cols=50 Identities=28% Similarity=0.543 Sum_probs=30.7
Q ss_pred cccCCCcCCCCCchhHHhHhhhcc-CC-ccccCCccchhcccchhHHHHhhhhcC
Q psy11032 421 FKCDICGQALTTKPKLNDHMLLHT-GE-KPHECNVCGARFAKKWNMQVHKQKVHG 473 (514)
Q Consensus 421 ~~C~~C~~~f~~~~~l~~H~~~h~-~~-~~~~C~~C~~~f~~~~~l~~H~~~~h~ 473 (514)
|.||+|++. .+...|..|+...| .+ +.+.|++|...+. .+|..|+...|+
T Consensus 3 f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 3 FTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred cCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 667777773 44566777765433 22 3577777776544 377777776664
No 27
>KOG1146|consensus
Probab=97.88 E-value=1.7e-05 Score=84.00 Aligned_cols=158 Identities=15% Similarity=0.145 Sum_probs=99.4
Q ss_pred cccchhhcCCchHHHHHHh-hhcCCCCccccccchhcCChhHHHHHHHHhhhcCCCCCCCCCchhhhhcccccccCcCCC
Q psy11032 87 CEVCSLALLSKSHLKQHMR-VHTGERPHECEVCGKRFAKRSNLNAHKKKVTRHTGRDEVPSIKPLKRKRNISFLTGAREG 165 (514)
Q Consensus 87 C~~C~~~f~~~~~l~~H~~-~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (514)
|.-|+..+.+...+..|+. .+.-.+.|+|+.|++.|.....|..|||..|......-.....+....... .....
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg----~~~~~ 514 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARG----EVYRC 514 (1406)
T ss_pred ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhcccccccccc----ccccC
Confidence 4445556666666666654 344457899999999999999999999976544332111111111111110 01111
Q ss_pred CCcccccccccccCCChHHHHhHhhhhC-------------------------------------------CCCCccccc
Q psy11032 166 MPLIHECPTCGKKWRTRSELNKHTKTHS-------------------------------------------DLRPFVCEI 202 (514)
Q Consensus 166 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~-------------------------------------------~~~~~~C~~ 202 (514)
....|.|..|.+.+.+...|-+|+..-. ....|.|.+
T Consensus 515 ~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~v 594 (1406)
T KOG1146|consen 515 PGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEV 594 (1406)
T ss_pred CCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhh
Confidence 2234899999999999999999985421 011388999
Q ss_pred ccccccChhHHHHHhhhh-cCCCCccCCcchhhccChhhhhhhhhhh
Q psy11032 203 CGQGYRTKTHLLVHVGMH-NGIHPFTCHFCNKSFTQKTGLERHLTIH 248 (514)
Q Consensus 203 C~~~f~~~~~l~~H~~~h-~~~~~~~C~~C~~~f~~~~~l~~H~~~h 248 (514)
|++..+-..+|+.||..- +...|.-+-.++-.+.....+..+.+.+
T Consensus 595 c~yetniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~~~lp 641 (1406)
T KOG1146|consen 595 CSYETNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQGRLP 641 (1406)
T ss_pred hcchhhhhhccccccccCCCCCChHHHhhhcchhhccccccCcCCCC
Confidence 999988888999998753 3333366666666666655555555544
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.63 E-value=3.1e-05 Score=51.64 Aligned_cols=51 Identities=10% Similarity=0.075 Sum_probs=42.3
Q ss_pred cccCCccchhcccchhHHHHhhhhcCC--CCCCCccccccccccCCccchhhhHhh
Q psy11032 448 PHECNVCGARFAKKWNMQVHKQKVHGP--DVRGTTESQLGMQVSSGGEKKDRAVTK 501 (514)
Q Consensus 448 ~~~C~~C~~~f~~~~~l~~H~~~~h~~--~~~~c~~c~~~~~~~~~~~~h~~~~~~ 501 (514)
.|.|++|++ ..+..+|..|+...|.. +.+.|+.|...+. .+|..|+..+|+
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 489999999 55678999999999875 4699999998644 599999987764
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.53 E-value=2.5e-05 Score=41.79 Aligned_cols=20 Identities=35% Similarity=0.878 Sum_probs=8.9
Q ss_pred cCCccchhcccchhHHHHhh
Q psy11032 450 ECNVCGARFAKKWNMQVHKQ 469 (514)
Q Consensus 450 ~C~~C~~~f~~~~~l~~H~~ 469 (514)
.|+.|++.|.++..|..|++
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~ 21 (23)
T PF00096_consen 2 KCPICGKSFSSKSNLKRHMR 21 (23)
T ss_dssp EETTTTEEESSHHHHHHHHH
T ss_pred CCCCCCCccCCHHHHHHHHh
Confidence 34444444444444444443
No 30
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.44 E-value=6.6e-05 Score=40.63 Aligned_cols=24 Identities=29% Similarity=0.737 Sum_probs=13.5
Q ss_pred ccCCccchhcccchhHHHHhhhhc
Q psy11032 449 HECNVCGARFAKKWNMQVHKQKVH 472 (514)
Q Consensus 449 ~~C~~C~~~f~~~~~l~~H~~~~h 472 (514)
|.|++|++.|.+...|..|++++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 456666666666666666666554
No 31
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.39 E-value=8.7e-05 Score=39.68 Aligned_cols=22 Identities=32% Similarity=0.841 Sum_probs=10.9
Q ss_pred cccccccccccChhHHHHHhhh
Q psy11032 198 FVCEICGQGYRTKTHLLVHVGM 219 (514)
Q Consensus 198 ~~C~~C~~~f~~~~~l~~H~~~ 219 (514)
|.|+.|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 3455555555555555555443
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.28 E-value=9.8e-05 Score=39.91 Aligned_cols=23 Identities=43% Similarity=1.015 Sum_probs=13.3
Q ss_pred cccCCCcCCCCCchhHHhHhhhc
Q psy11032 421 FKCDICGQALTTKPKLNDHMLLH 443 (514)
Q Consensus 421 ~~C~~C~~~f~~~~~l~~H~~~h 443 (514)
|.|++|++.|.+...|..|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 45666666666666666666654
No 33
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.21 E-value=0.00012 Score=65.15 Aligned_cols=53 Identities=28% Similarity=0.716 Sum_probs=35.7
Q ss_pred CcccccCC--CcCCCCCchhHHhHhhh-c------------------cCCccccCCccchhcccchhHHHHhhh
Q psy11032 418 ERRFKCDI--CGQALTTKPKLNDHMLL-H------------------TGEKPHECNVCGARFAKKWNMQVHKQK 470 (514)
Q Consensus 418 ~~~~~C~~--C~~~f~~~~~l~~H~~~-h------------------~~~~~~~C~~C~~~f~~~~~l~~H~~~ 470 (514)
+|||+|++ |+++++....|+-|+.- | ...|||+|++|++++.+...|+-|...
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~H 420 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRKH 420 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecccc
Confidence 36666666 66666666666666542 2 123678888888888888888877643
No 34
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.21 E-value=0.00015 Score=40.55 Aligned_cols=25 Identities=32% Similarity=0.704 Sum_probs=15.6
Q ss_pred cccCCccchhcccchhHHHHhhhhc
Q psy11032 448 PHECNVCGARFAKKWNMQVHKQKVH 472 (514)
Q Consensus 448 ~~~C~~C~~~f~~~~~l~~H~~~~h 472 (514)
||.|..|++.|.+..+|..|++.++
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4566666666666666666665554
No 35
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.20 E-value=0.00016 Score=55.80 Aligned_cols=72 Identities=22% Similarity=0.380 Sum_probs=20.1
Q ss_pred eCCcCCCccCCchhhhhhcccccC-CcccccCCCcCCCCCchhHHhHhhhccCCccccCCccchhcccchhHHHHhhhh
Q psy11032 394 QCHLCGRCFNSPTTLTQHKIVHTG-ERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGARFAKKWNMQVHKQKV 471 (514)
Q Consensus 394 ~C~~C~~~f~~~~~l~~H~~~H~~-~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 471 (514)
+|..|+..|.+...|..|+...++ ..+ ....+.....+..+++.-.. ..+.|..|++.|.+..+|..|++.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~-----~~~~l~~~~~~~~~~~~~~~-~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP-----DQKYLVDPNRLLNYLRKKVK-ESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Cccccccccccccccccccccccccccc-----cccccccccccccccccccC-CCCCCCccCCCCcCHHHHHHHHcCc
Confidence 488888888888888888764433 221 11122233444444433222 2588999999999999999998865
No 36
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.14 E-value=0.00021 Score=55.23 Aligned_cols=22 Identities=32% Similarity=0.608 Sum_probs=0.0
Q ss_pred ccccchhcCChhHHHHHHHHhh
Q psy11032 115 CEVCGKRFAKRSNLNAHKKKVT 136 (514)
Q Consensus 115 C~~C~~~f~~~~~l~~H~~~~~ 136 (514)
|..|+..|.+...|..|+...|
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H 23 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKH 23 (100)
T ss_dssp ----------------------
T ss_pred cccccccccccccccccccccc
Confidence 5566666666666666665443
No 37
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.13 E-value=0.00016 Score=40.43 Aligned_cols=25 Identities=32% Similarity=0.756 Sum_probs=14.6
Q ss_pred ccccCCCcCCCCCchhHHhHhhhcc
Q psy11032 420 RFKCDICGQALTTKPKLNDHMLLHT 444 (514)
Q Consensus 420 ~~~C~~C~~~f~~~~~l~~H~~~h~ 444 (514)
||.|..|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 3556666666666666666665554
No 38
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.92 E-value=0.00043 Score=37.36 Aligned_cols=24 Identities=25% Similarity=0.452 Sum_probs=13.7
Q ss_pred ccCCccchhcccchhHHHHhhhhcC
Q psy11032 449 HECNVCGARFAKKWNMQVHKQKVHG 473 (514)
Q Consensus 449 ~~C~~C~~~f~~~~~l~~H~~~~h~ 473 (514)
|+|+.|++... ...|..|++.+|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 45666666665 6666666666653
No 39
>KOG2231|consensus
Probab=96.80 E-value=0.0013 Score=66.38 Aligned_cols=140 Identities=18% Similarity=0.366 Sum_probs=90.4
Q ss_pred cccccccccccCchHHHhHhhhccCCCCccCCcCcccCCChHHHHHHhhhhcCCCCCCcchhhhhhcccccccCccchhh
Q psy11032 301 FVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSFTQKTGLERHLTIHNNEKKHQSGRRQKRNQQHDRQRSESDTTL 380 (514)
Q Consensus 301 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 380 (514)
+.|.+|++.|.-.. ..-.|..| -.|.+...|+.|+..-|..
T Consensus 100 ~~C~~C~~~~~~~~------------~~~~~~~c-~~~~s~~~Lk~H~~~~H~~-------------------------- 140 (669)
T KOG2231|consen 100 HSCHICDRRFRALY------------NKKECLHC-TEFKSVENLKNHMRDQHKL-------------------------- 140 (669)
T ss_pred hhcCccccchhhhc------------ccCCCccc-cchhHHHHHHHHHHHhhhh--------------------------
Confidence 78999998874321 11368888 7888999999999654432
Q ss_pred hhhcccCCCCCceeCCcC----------CCccCCchhhhhhcccc-cCCcc----cccCCCcCCCCCchhHHhHhhhccC
Q psy11032 381 AEEDNLFHSTPIFQCHLC----------GRCFNSPTTLTQHKIVH-TGERR----FKCDICGQALTTKPKLNDHMLLHTG 445 (514)
Q Consensus 381 ~~~~~~~~~~~~~~C~~C----------~~~f~~~~~l~~H~~~H-~~~~~----~~C~~C~~~f~~~~~l~~H~~~h~~ 445 (514)
+.|.+| .+.+ +...|..|+..- .+++. -.|..|...|.....|.+|++.++
T Consensus 141 ------------~~c~lC~~~~kif~~e~k~Y-t~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h- 206 (669)
T KOG2231|consen 141 ------------HLCSLCLQNLKIFINERKLY-TRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH- 206 (669)
T ss_pred ------------hccccccccceeeeeeeehe-hHHHHHHHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccce-
Confidence 333333 2222 334555664421 12222 368888888888888888887766
Q ss_pred CccccCCccc------hhcccchhHHHHhhhhcCCCCCCCc--cccccccccC-CccchhhhHh
Q psy11032 446 EKPHECNVCG------ARFAKKWNMQVHKQKVHGPDVRGTT--ESQLGMQVSS-GGEKKDRAVT 500 (514)
Q Consensus 446 ~~~~~C~~C~------~~f~~~~~l~~H~~~~h~~~~~~c~--~c~~~~~~~~-~~~~h~~~~~ 500 (514)
|.|.+|+ ..|.+-..|..|.|..| |.|+ .|.-...... .+..|++.|+
T Consensus 207 ---~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f~~~~~~ei~lk~~~ 263 (669)
T KOG2231|consen 207 ---EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKFYVAFELEIELKAHN 263 (669)
T ss_pred ---eheeecCcccccchhcccchHHHHHhhhcC----ccccccccccceeeehhHHHHHHHhhc
Confidence 5677763 55777888888888887 7777 6765544444 6666666444
No 40
>KOG2231|consensus
Probab=96.79 E-value=0.0024 Score=64.62 Aligned_cols=46 Identities=22% Similarity=0.452 Sum_probs=25.1
Q ss_pred ccccccccCCChHHHHhHhhhhCCCCCccccccc------ccccChhHHHHHhhhh
Q psy11032 171 ECPTCGKKWRTRSELNKHTKTHSDLRPFVCEICG------QGYRTKTHLLVHVGMH 220 (514)
Q Consensus 171 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~------~~f~~~~~l~~H~~~h 220 (514)
.|..|...|-....|.+|++.++ |.|..|. .-|.....|..|.+.+
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~ 235 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKG 235 (669)
T ss_pred cchhhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhc
Confidence 46666666666666666665443 3444442 2344455565665543
No 41
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.64 E-value=0.00067 Score=36.58 Aligned_cols=24 Identities=29% Similarity=0.638 Sum_probs=15.6
Q ss_pred cccCCCcCCCCCchhHHhHhhhccC
Q psy11032 421 FKCDICGQALTTKPKLNDHMLLHTG 445 (514)
Q Consensus 421 ~~C~~C~~~f~~~~~l~~H~~~h~~ 445 (514)
|+|+.|++... ...|.+|++.|++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 56777777776 7777777776653
No 42
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.64 E-value=0.0013 Score=58.90 Aligned_cols=71 Identities=18% Similarity=0.516 Sum_probs=42.6
Q ss_pred CCCCccCCc--CcccCCChHHHHHHhhhhcCCCCCCcchhhhhhcccccccCccchhhhhhcccCCCCCceeCCcCCCcc
Q psy11032 325 GIHPFTCHF--CNKSFTQKTGLERHLTIHNNEKKHQSGRRQKRNQQHDRQRSESDTTLAEEDNLFHSTPIFQCHLCGRCF 402 (514)
Q Consensus 325 ~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f 402 (514)
+++||+|++ |.+.+.+...|+-|+.--|..+.... ..............+||+|.+|++.+
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~-----------------~p~p~~~~~F~~~~KPYrCevC~KRY 408 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHE-----------------NPSPEKMNIFSAKDKPYRCEVCDKRY 408 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCC-----------------CCCccccccccccCCceeccccchhh
Confidence 458999987 99999999999999875332211000 00000111112334567777777777
Q ss_pred CCchhhhhhc
Q psy11032 403 NSPTTLTQHK 412 (514)
Q Consensus 403 ~~~~~l~~H~ 412 (514)
.....|+.|.
T Consensus 409 KNlNGLKYHr 418 (423)
T COG5189 409 KNLNGLKYHR 418 (423)
T ss_pred ccCccceecc
Confidence 7777777664
No 43
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.56 E-value=0.0014 Score=41.36 Aligned_cols=30 Identities=27% Similarity=0.553 Sum_probs=11.9
Q ss_pred CcccccCCCcCCCCCchhHHhHhhhccCCc
Q psy11032 418 ERRFKCDICGQALTTKPKLNDHMLLHTGEK 447 (514)
Q Consensus 418 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~ 447 (514)
+.|-.|++|+..+.+..+|++|+.++|+.+
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 344445555555555555555554444433
No 44
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.29 E-value=0.0028 Score=34.71 Aligned_cols=19 Identities=32% Similarity=0.688 Sum_probs=7.9
Q ss_pred CCccchhcccchhHHHHhh
Q psy11032 451 CNVCGARFAKKWNMQVHKQ 469 (514)
Q Consensus 451 C~~C~~~f~~~~~l~~H~~ 469 (514)
|..|++.|.....|..|++
T Consensus 3 C~~C~~~f~~~~~l~~H~~ 21 (26)
T smart00355 3 CPECGKVFKSKSALKEHMR 21 (26)
T ss_pred CCCCcchhCCHHHHHHHHH
Confidence 4444444444444444433
No 45
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.22 E-value=0.0033 Score=39.71 Aligned_cols=40 Identities=23% Similarity=0.392 Sum_probs=24.2
Q ss_pred HhHhhhc-cCCccccCCccchhcccchhHHHHhhhhcCCCC
Q psy11032 437 NDHMLLH-TGEKPHECNVCGARFAKKWNMQVHKQKVHGPDV 476 (514)
Q Consensus 437 ~~H~~~h-~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~ 476 (514)
..+.+.+ ..+.|-.|++|+..+....+|++|+...|+.+|
T Consensus 12 ~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp ----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred hhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 3444433 345788999999999999999999999998764
No 46
>PRK04860 hypothetical protein; Provisional
Probab=96.07 E-value=0.003 Score=52.55 Aligned_cols=36 Identities=31% Similarity=0.788 Sum_probs=18.6
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALT 431 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~ 431 (514)
+|.|. |+. ....+.+|.++|+++++|.|..|+..|.
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~ 154 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLV 154 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence 45554 554 4444555555555555555555555443
No 47
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.95 E-value=0.0047 Score=33.75 Aligned_cols=22 Identities=32% Similarity=0.859 Sum_probs=14.1
Q ss_pred cccCCCcCCCCCchhHHhHhhh
Q psy11032 421 FKCDICGQALTTKPKLNDHMLL 442 (514)
Q Consensus 421 ~~C~~C~~~f~~~~~l~~H~~~ 442 (514)
|.|+.|++.|.....|..|++.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~ 22 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRT 22 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHH
Confidence 4566666666666666666653
No 48
>KOG2785|consensus
Probab=95.79 E-value=0.015 Score=54.03 Aligned_cols=78 Identities=24% Similarity=0.398 Sum_probs=43.5
Q ss_pred ccCCcccCCCCChhHHHHHHHh--hC-CCC--Cc-ccccccccccCchHHHhHhh---hccCCCCccCCcCcccCCChHH
Q psy11032 273 HECPTCGKKWRTRSELNKHTKT--HS-DLR--PF-VCEICGQGYRTKTHLLVHVG---MHNGIHPFTCHFCNKSFTQKTG 343 (514)
Q Consensus 273 ~~C~~C~~~f~~~~~l~~H~~~--h~-~~~--~~-~C~~C~~~f~~~~~l~~H~~---~h~~~~~~~C~~C~~~f~~~~~ 343 (514)
|.|.-|...|.+....+.|+++ |. +.+ .+ .-++--..|..+-....-.. .-.+.-++.|..|.+.|.+...
T Consensus 4 ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~a 83 (390)
T KOG2785|consen 4 FTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPKA 83 (390)
T ss_pred ceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChhh
Confidence 8999999999999999999864 21 100 00 01111112222111100000 1233445778888888888887
Q ss_pred HHHHhhh
Q psy11032 344 LERHLTI 350 (514)
Q Consensus 344 l~~H~~~ 350 (514)
...|+..
T Consensus 84 ~~~hl~S 90 (390)
T KOG2785|consen 84 HENHLKS 90 (390)
T ss_pred HHHHHHH
Confidence 7777654
No 49
>KOG2785|consensus
Probab=95.73 E-value=0.024 Score=52.81 Aligned_cols=51 Identities=18% Similarity=0.315 Sum_probs=34.4
Q ss_pred ccccCCCcCCCCCchhHHhHhhhccCC-----------------------ccccCCccc---hhcccchhHHHHhhh
Q psy11032 420 RFKCDICGQALTTKPKLNDHMLLHTGE-----------------------KPHECNVCG---ARFAKKWNMQVHKQK 470 (514)
Q Consensus 420 ~~~C~~C~~~f~~~~~l~~H~~~h~~~-----------------------~~~~C~~C~---~~f~~~~~l~~H~~~ 470 (514)
|-.|-+|+..|.+...-..||..+||- ..|.|-.|+ +.|.+..++++||..
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 345566666666666666666655552 137888888 888888999998764
No 50
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.67 E-value=0.0055 Score=33.31 Aligned_cols=21 Identities=33% Similarity=0.602 Sum_probs=11.3
Q ss_pred cCCcccccCCChhHHHHHHhh
Q psy11032 58 RCDLCGKGFIHLKSYQIHKMI 78 (514)
Q Consensus 58 ~C~~C~~~f~~~~~l~~H~~~ 78 (514)
.|.+|++.|.+...|+.|++.
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCcCCHHHHHHHHCc
Confidence 455555555555555555543
No 51
>KOG2482|consensus
Probab=95.64 E-value=0.042 Score=50.10 Aligned_cols=180 Identities=19% Similarity=0.250 Sum_probs=86.6
Q ss_pred ccccccccccCC-ChHHHHhHhhhhCCC----------------------CCcccccccccccChhHHHHHhhh--hcCC
Q psy11032 169 IHECPTCGKKWR-TRSELNKHTKTHSDL----------------------RPFVCEICGQGYRTKTHLLVHVGM--HNGI 223 (514)
Q Consensus 169 ~~~C~~C~~~f~-~~~~l~~H~~~h~~~----------------------~~~~C~~C~~~f~~~~~l~~H~~~--h~~~ 223 (514)
..+|-.|...+. +++....|+-.-++. ..+.|-.|.+.|+.+..|+.||+. |...
T Consensus 144 slqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrkK~Hrri 223 (423)
T KOG2482|consen 144 SLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRKKRHRRI 223 (423)
T ss_pred eeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHhccCccc
Confidence 478999988775 467788887543321 237788888888888888888864 4333
Q ss_pred CCccCCc-----chhhccChhhhhhhhhh--hcCCCccccCCcccccccccCCCccccCCcccCCCCChhHHHHHHHhhC
Q psy11032 224 HPFTCHF-----CNKSFTQKTGLERHLTI--HNNEKKHQVSGRFSFNRAREGMPLIHECPTCGKKWRTRSELNKHTKTHS 296 (514)
Q Consensus 224 ~~~~C~~-----C~~~f~~~~~l~~H~~~--h~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 296 (514)
.|-.=.+ =++.-..++....|... -..+......+..+--......+....|-.|.....+...|..||.+-+
T Consensus 224 nPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~en~~~l~eHmk~vH 303 (423)
T KOG2482|consen 224 NPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYENPVFLFEHMKIVH 303 (423)
T ss_pred CCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccchhhHHHHHHHHHHHH
Confidence 3310000 01111111222222221 0100000001111111111122233678889888888888999987643
Q ss_pred CCCCccc-ccccccccCchHHHhHhhhccCCCCccCCcCcccCCChHHHHHHhhh
Q psy11032 297 DLRPFVC-EICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSFTQKTGLERHLTI 350 (514)
Q Consensus 297 ~~~~~~C-~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 350 (514)
.-.-.+= ..=+..|...-.+..-.|. ......|-.|.-.|.....|..||..
T Consensus 304 e~Dl~Ki~sd~~Ln~YqrvrviNyiRk--q~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 304 EFDLLKIQSDYSLNFYQRVRVINYIRK--QKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred HhhHHhhccccccchhhhhhHHHHHHH--HhhccccccccccccCcchhhhhccc
Confidence 2100000 0000011111111111111 11224566777788888888888764
No 52
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.61 E-value=0.015 Score=52.89 Aligned_cols=72 Identities=26% Similarity=0.546 Sum_probs=43.3
Q ss_pred ccccccccCCChHHHHhHhhhhCCCCCcccccccc-------cccChhHHHHHhhhhcCCCCccCCc--ch----hhccC
Q psy11032 171 ECPTCGKKWRTRSELNKHTKTHSDLRPFVCEICGQ-------GYRTKTHLLVHVGMHNGIHPFTCHF--CN----KSFTQ 237 (514)
Q Consensus 171 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~-------~f~~~~~l~~H~~~h~~~~~~~C~~--C~----~~f~~ 237 (514)
.|..|...|-+.+.|..|++..+. .|-+|++ -|.+...|..|.+.. -|.|.+ |. ..|+.
T Consensus 222 ~C~FC~~~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~----hy~ct~qtc~~~k~~vf~~ 293 (493)
T COG5236 222 LCIFCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNA----HYCCTFQTCRVGKCYVFPY 293 (493)
T ss_pred hhhhccceecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcC----ceEEEEEEEecCcEEEecc
Confidence 477777777777777777765542 3444543 255666677776532 244432 32 46777
Q ss_pred hhhhhhhhhhhcC
Q psy11032 238 KTGLERHLTIHNN 250 (514)
Q Consensus 238 ~~~l~~H~~~h~~ 250 (514)
...|..|+...|+
T Consensus 294 ~~el~~h~~~~h~ 306 (493)
T COG5236 294 HTELLEHLTRFHK 306 (493)
T ss_pred HHHHHHHHHHHhh
Confidence 7777777765554
No 53
>PRK04860 hypothetical protein; Provisional
Probab=95.53 E-value=0.0054 Score=51.03 Aligned_cols=40 Identities=23% Similarity=0.516 Sum_probs=35.8
Q ss_pred cccccCCCcCCCCCchhHHhHhhhccCCccccCCccchhcccch
Q psy11032 419 RRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGARFAKKW 462 (514)
Q Consensus 419 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~ 462 (514)
-+|.|. |+. ....+.+|.++|.++++|.|..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 479998 987 7889999999999999999999999987654
No 54
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.53 E-value=0.0059 Score=33.18 Aligned_cols=21 Identities=19% Similarity=0.638 Sum_probs=12.4
Q ss_pred ccCCccchhcccchhHHHHhh
Q psy11032 449 HECNVCGARFAKKWNMQVHKQ 469 (514)
Q Consensus 449 ~~C~~C~~~f~~~~~l~~H~~ 469 (514)
|.|.+|+..|.+..+|+.|++
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCCcCCHHHHHHHHC
Confidence 455666666666666666654
No 55
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.27 E-value=0.032 Score=56.20 Aligned_cols=115 Identities=21% Similarity=0.351 Sum_probs=87.1
Q ss_pred CceeCCcCCCccCCchhhhhhcc--cccCC--cccccC--CCcCCCCCchhHHhHhhhccCCccccCCc-----------
Q psy11032 391 PIFQCHLCGRCFNSPTTLTQHKI--VHTGE--RRFKCD--ICGQALTTKPKLNDHMLLHTGEKPHECNV----------- 453 (514)
Q Consensus 391 ~~~~C~~C~~~f~~~~~l~~H~~--~H~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~C~~----------- 453 (514)
.++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.+...+..|..+|.+..++.+..
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 36899999999999999999999 89999 999999 79999999999999999987765444422
Q ss_pred --------------------------cchhcccchhHHHHhhhhcCCC--CCCCccccccccccCCccchhhhHhhHHHH
Q psy11032 454 --------------------------CGARFAKKWNMQVHKQKVHGPD--VRGTTESQLGMQVSSGGEKKDRAVTKQKFW 505 (514)
Q Consensus 454 --------------------------C~~~f~~~~~l~~H~~~~h~~~--~~~c~~c~~~~~~~~~~~~h~~~~~~~~~~ 505 (514)
|...+.....+..|...+-... .+.+..|...|.....+..|++.+....+.
T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (467)
T COG5048 368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPL 447 (467)
T ss_pred CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCce
Confidence 3333334444444433333322 577888999999999999888877755443
No 56
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.09 E-value=0.011 Score=32.74 Aligned_cols=22 Identities=36% Similarity=0.730 Sum_probs=15.6
Q ss_pred eeCCcCCCccCCchhhhhhccc
Q psy11032 393 FQCHLCGRCFNSPTTLTQHKIV 414 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l~~H~~~ 414 (514)
|.|..|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777654
No 57
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=94.94 E-value=0.025 Score=51.56 Aligned_cols=20 Identities=20% Similarity=0.436 Sum_probs=10.4
Q ss_pred cccccccccChhHHHHHhhh
Q psy11032 200 CEICGQGYRTKTHLLVHVGM 219 (514)
Q Consensus 200 C~~C~~~f~~~~~l~~H~~~ 219 (514)
|..|...|.+-..|..|++.
T Consensus 223 C~FC~~~FYdDDEL~~HcR~ 242 (493)
T COG5236 223 CIFCKIYFYDDDELRRHCRL 242 (493)
T ss_pred hhhccceecChHHHHHHHHh
Confidence 55555555555555555544
No 58
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.89 E-value=0.019 Score=57.80 Aligned_cols=156 Identities=22% Similarity=0.397 Sum_probs=91.0
Q ss_pred CCcccccccccccCchHHHhHhh--hccCC--CCccCC--cCcccCCChHHHHHHhhhhcCCCCCCcchhhhhhcccccc
Q psy11032 299 RPFVCEICGQGYRTKTHLLVHVG--MHNGI--HPFTCH--FCNKSFTQKTGLERHLTIHNNEKKHQSGRRQKRNQQHDRQ 372 (514)
Q Consensus 299 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~ 372 (514)
.++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.....+..|...|.+..+................
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 45778888888888888888888 68888 788888 6888888888888888877766543322111111000000
Q ss_pred cCccchhhhhhcccCCCCCceeCC--cCCCccCCchhhhhhcccccCCc--ccccCCCcCCCCCchhHHhHhhhccCCcc
Q psy11032 373 RSESDTTLAEEDNLFHSTPIFQCH--LCGRCFNSPTTLTQHKIVHTGER--RFKCDICGQALTTKPKLNDHMLLHTGEKP 448 (514)
Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~C~--~C~~~f~~~~~l~~H~~~H~~~~--~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 448 (514)
.......... .........+.+. .|...+.....+..|...|...+ .+.+..|...|.....+..|++.|....+
T Consensus 368 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (467)
T COG5048 368 NNEPPQSLQQ-YKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAP 446 (467)
T ss_pred CCCCccchhh-ccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCc
Confidence 0000000000 0001111122222 25566666667777777666555 45677788888877788888777776665
Q ss_pred ccCCccc
Q psy11032 449 HECNVCG 455 (514)
Q Consensus 449 ~~C~~C~ 455 (514)
+.|..++
T Consensus 447 ~~~~~~~ 453 (467)
T COG5048 447 LLCSILK 453 (467)
T ss_pred eeecccc
Confidence 5554443
No 59
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.73 E-value=0.01 Score=32.94 Aligned_cols=21 Identities=19% Similarity=0.569 Sum_probs=10.1
Q ss_pred ccCCccchhcccchhHHHHhh
Q psy11032 449 HECNVCGARFAKKWNMQVHKQ 469 (514)
Q Consensus 449 ~~C~~C~~~f~~~~~l~~H~~ 469 (514)
|.|..|++.|.+..+|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 344445555555555544443
No 60
>KOG2482|consensus
Probab=93.57 E-value=0.18 Score=46.18 Aligned_cols=50 Identities=26% Similarity=0.542 Sum_probs=37.3
Q ss_pred cccccchhhcC-CchHHHHHHhhhcC---C-------------------CCccccccchhcCChhHHHHHHHH
Q psy11032 85 IKCEVCSLALL-SKSHLKQHMRVHTG---E-------------------RPHECEVCGKRFAKRSNLNAHKKK 134 (514)
Q Consensus 85 ~~C~~C~~~f~-~~~~l~~H~~~h~~---~-------------------~~~~C~~C~~~f~~~~~l~~H~~~ 134 (514)
.+|-.|+..+. +.+....|+..-|+ . ..+.|-+|.+.|..+..|+.||++
T Consensus 145 lqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 145 LQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRK 217 (423)
T ss_pred eEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHh
Confidence 57888887654 55667777754322 0 237899999999999999999985
No 61
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=92.60 E-value=0.089 Score=30.74 Aligned_cols=24 Identities=25% Similarity=0.976 Sum_probs=15.3
Q ss_pred eeCCcCCCccCCchhhhhhcccccCCcccccCCCcC
Q psy11032 393 FQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQ 428 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~ 428 (514)
|+|..||+.+.... .++.||.|+.
T Consensus 2 ~~C~~CGy~y~~~~------------~~~~CP~Cg~ 25 (33)
T cd00350 2 YVCPVCGYIYDGEE------------APWVCPVCGA 25 (33)
T ss_pred EECCCCCCEECCCc------------CCCcCcCCCC
Confidence 67777777654332 5667777764
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.38 E-value=0.11 Score=28.05 Aligned_cols=21 Identities=29% Similarity=0.762 Sum_probs=14.0
Q ss_pred ccCCcccccCCChhHHHHHHhh
Q psy11032 57 TRCDLCGKGFIHLKSYQIHKMI 78 (514)
Q Consensus 57 ~~C~~C~~~f~~~~~l~~H~~~ 78 (514)
..|+.||+.| ....|..|+.+
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4677777777 56667777654
No 63
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=92.30 E-value=0.15 Score=39.78 Aligned_cols=25 Identities=36% Similarity=0.531 Sum_probs=22.9
Q ss_pred ccC----CccchhcccchhHHHHhhhhcC
Q psy11032 449 HEC----NVCGARFAKKWNMQVHKQKVHG 473 (514)
Q Consensus 449 ~~C----~~C~~~f~~~~~l~~H~~~~h~ 473 (514)
|.| ..|++.+.+...|..|.+.+||
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 899 9999999999999999999886
No 64
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=92.03 E-value=0.074 Score=34.21 Aligned_cols=32 Identities=31% Similarity=0.563 Sum_probs=25.9
Q ss_pred hccCCccccCCccchhcccchhHHHHhhhhcC
Q psy11032 442 LHTGEKPHECNVCGARFAKKWNMQVHKQKVHG 473 (514)
Q Consensus 442 ~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 473 (514)
+-.|+.-++|+-||..|....+..+|..+.||
T Consensus 11 ~RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 11 DRDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred ccCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 34566678888888888888888888888886
No 65
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.63 E-value=0.15 Score=27.54 Aligned_cols=20 Identities=30% Similarity=0.767 Sum_probs=11.8
Q ss_pred ccCCcCcccCCChHHHHHHhh
Q psy11032 329 FTCHFCNKSFTQKTGLERHLT 349 (514)
Q Consensus 329 ~~C~~C~~~f~~~~~l~~H~~ 349 (514)
..|+.|++.| ..+.|..|+.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHH
Confidence 3566666666 4556666654
No 66
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=90.51 E-value=0.17 Score=29.98 Aligned_cols=22 Identities=23% Similarity=0.637 Sum_probs=14.1
Q ss_pred ccccCCCcCCCCCchhHHhHhh
Q psy11032 420 RFKCDICGQALTTKPKLNDHML 441 (514)
Q Consensus 420 ~~~C~~C~~~f~~~~~l~~H~~ 441 (514)
+|.|.+|+..|.+...+..|+.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 3566666666666666666654
No 67
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=89.81 E-value=0.19 Score=29.77 Aligned_cols=23 Identities=17% Similarity=0.546 Sum_probs=17.8
Q ss_pred cccCCccchhcccchhHHHHhhh
Q psy11032 448 PHECNVCGARFAKKWNMQVHKQK 470 (514)
Q Consensus 448 ~~~C~~C~~~f~~~~~l~~H~~~ 470 (514)
+|.|.+|+..|.+..++..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 47788888888888888888653
No 68
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=89.15 E-value=1.1 Score=34.44 Aligned_cols=25 Identities=20% Similarity=0.510 Sum_probs=14.7
Q ss_pred ceeCCcCCCccCCchhhhhhccccc
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHT 416 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~ 416 (514)
.|+|+.|...|=..-.+..|...|.
T Consensus 81 ~y~C~~C~~~FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 81 RYVCAVCKNVFCVDCDVFVHESLHC 105 (112)
T ss_pred ceeCCCCCCccccccchhhhhhccC
Confidence 4666666666655555555555543
No 69
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=88.93 E-value=0.4 Score=37.40 Aligned_cols=24 Identities=25% Similarity=0.490 Sum_probs=14.6
Q ss_pred eeC----CcCCCccCCchhhhhhccccc
Q psy11032 393 FQC----HLCGRCFNSPTTLTQHKIVHT 416 (514)
Q Consensus 393 ~~C----~~C~~~f~~~~~l~~H~~~H~ 416 (514)
|.| ..|++.+.+...+..|++.++
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 666 666666666666666655443
No 70
>KOG4173|consensus
Probab=88.28 E-value=0.34 Score=40.95 Aligned_cols=79 Identities=24% Similarity=0.502 Sum_probs=64.2
Q ss_pred cccCCc--ccccCCChhHHHHHHhhccCCCCcccccchhhcCCchHHHHHHhh-h---------cCCCCccc--cccchh
Q psy11032 56 YTRCDL--CGKGFIHLKSYQIHKMIHSGERHIKCEVCSLALLSKSHLKQHMRV-H---------TGERPHEC--EVCGKR 121 (514)
Q Consensus 56 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~-h---------~~~~~~~C--~~C~~~ 121 (514)
-+.|++ |...|.....+..|..+-++ -.|..|.+.|.+...|..|+.. | -|...|.| .-|+..
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~K 155 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEK 155 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhh
Confidence 477888 88899999999999866443 3599999999999999999753 3 24567889 459999
Q ss_pred cCChhHHHHHHHHhhh
Q psy11032 122 FAKRSNLNAHKKKVTR 137 (514)
Q Consensus 122 f~~~~~l~~H~~~~~~ 137 (514)
|.+...-..|+-..|.
T Consensus 156 FkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 156 FKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhHHHHhcc
Confidence 9999999999976554
No 71
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=87.83 E-value=2 Score=33.15 Aligned_cols=49 Identities=20% Similarity=0.367 Sum_probs=28.9
Q ss_pred eCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCCCchhHHhHhhhcc
Q psy11032 394 QCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALTTKPKLNDHMLLHT 444 (514)
Q Consensus 394 ~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 444 (514)
.|--|+..|........ ..-.....|+|+.|...|-..=++-.|...|.
T Consensus 57 ~C~~C~~~f~~~~~~~~--~~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 57 FCFGCQGPFPKPPVSPF--DELKDSHRYVCAVCKNVFCVDCDVFVHESLHC 105 (112)
T ss_pred cccCcCCCCCCcccccc--cccccccceeCCCCCCccccccchhhhhhccC
Confidence 48888888876532110 00112335778888877776666666665553
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=87.78 E-value=0.25 Score=31.87 Aligned_cols=28 Identities=29% Similarity=0.482 Sum_probs=22.2
Q ss_pred ccCcccccCCcccccCCChhHHHHHHhh
Q psy11032 51 NEIAKYTRCDLCGKGFIHLKSYQIHKMI 78 (514)
Q Consensus 51 ~~~~k~~~C~~C~~~f~~~~~l~~H~~~ 78 (514)
..++..++|+.|+..|....++.+|...
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNK 39 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNK 39 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhH
Confidence 3456778888898888888888888764
No 73
>KOG2893|consensus
Probab=87.34 E-value=0.36 Score=41.78 Aligned_cols=49 Identities=16% Similarity=0.157 Sum_probs=28.0
Q ss_pred cCCccchhcccchhHHHHhhhhcCCCCCCCccccccccccCCccchhhhHhhH
Q psy11032 450 ECNVCGARFAKKWNMQVHKQKVHGPDVRGTTESQLGMQVSSGGEKKDRAVTKQ 502 (514)
Q Consensus 450 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~~h~~~~~~~ 502 (514)
.|=+|++.|.+..-|..|++..| |+|..|-+....--.|..|--.+|+|
T Consensus 12 wcwycnrefddekiliqhqkakh----fkchichkkl~sgpglsihcmqvhke 60 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAKH----FKCHICHKKLFSGPGLSIHCMQVHKE 60 (341)
T ss_pred eeeecccccchhhhhhhhhhhcc----ceeeeehhhhccCCCceeehhhhhhh
Confidence 45556666666666666665444 66666666666656666664444444
No 74
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=86.33 E-value=0.52 Score=27.71 Aligned_cols=25 Identities=36% Similarity=0.898 Sum_probs=15.6
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQ 428 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~ 428 (514)
.|+|..||+.+... +.|..|++|+.
T Consensus 2 ~~~C~~CG~i~~g~------------~~p~~CP~Cg~ 26 (34)
T cd00729 2 VWVCPVCGYIHEGE------------EAPEKCPICGA 26 (34)
T ss_pred eEECCCCCCEeECC------------cCCCcCcCCCC
Confidence 37788887765332 14567777765
No 75
>KOG2893|consensus
Probab=84.39 E-value=0.3 Score=42.23 Aligned_cols=42 Identities=31% Similarity=0.582 Sum_probs=28.1
Q ss_pred cCCcccccCCChhHHHHHHhhccCCCCcccccchhhcCCchHHHHH
Q psy11032 58 RCDLCGKGFIHLKSYQIHKMIHSGERHIKCEVCSLALLSKSHLKQH 103 (514)
Q Consensus 58 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H 103 (514)
.|=+|++.|.++.-|.+|.+. +-|+|.+|.+.+-+-..|..|
T Consensus 12 wcwycnrefddekiliqhqka----khfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceee
Confidence 477777777777777777654 347777777666666655555
No 76
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=83.90 E-value=0.31 Score=43.20 Aligned_cols=13 Identities=23% Similarity=0.488 Sum_probs=9.0
Q ss_pred ccCCccchhcccc
Q psy11032 449 HECNVCGARFAKK 461 (514)
Q Consensus 449 ~~C~~C~~~f~~~ 461 (514)
..|+.||++|...
T Consensus 49 ~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 49 WVCPHCGYAAFEE 61 (214)
T ss_pred EECCCCCCccccc
Confidence 5788888776643
No 77
>KOG4173|consensus
Probab=82.97 E-value=0.42 Score=40.47 Aligned_cols=79 Identities=27% Similarity=0.632 Sum_probs=57.5
Q ss_pred CceeCCc--CCCccCCchhhhhhcccccCCcccccCCCcCCCCCchhHHhHhhh-c---------cCCccccC--Cccch
Q psy11032 391 PIFQCHL--CGRCFNSPTTLTQHKIVHTGERRFKCDICGQALTTKPKLNDHMLL-H---------TGEKPHEC--NVCGA 456 (514)
Q Consensus 391 ~~~~C~~--C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~-h---------~~~~~~~C--~~C~~ 456 (514)
+.+.|++ |-..|........|..+-++. .|..|.+.|.+...|..|+.- | -|..-|.| ..|+.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~ 154 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE 154 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence 4577876 667777777777776543333 599999999988888888742 2 34445888 44999
Q ss_pred hcccchhHHHHhhhhc
Q psy11032 457 RFAKKWNMQVHKQKVH 472 (514)
Q Consensus 457 ~f~~~~~l~~H~~~~h 472 (514)
.|.+..+-..|+-..|
T Consensus 155 KFkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 155 KFKTSRDRKDHMIRMH 170 (253)
T ss_pred hhhhhhhhhhHHHHhc
Confidence 9999988888977666
No 78
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=82.71 E-value=0.93 Score=34.98 Aligned_cols=32 Identities=25% Similarity=0.597 Sum_probs=20.5
Q ss_pred ccccCCcccccCCChhHHHHHHhhccCCCCcccccchhhcCCc
Q psy11032 55 KYTRCDLCGKGFIHLKSYQIHKMIHSGERHIKCEVCSLALLSK 97 (514)
Q Consensus 55 k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 97 (514)
....|+.||+.|... +..|..|+.||..|.-.
T Consensus 8 tKR~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 8 TKRTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred CcccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 445677777777653 23567777777766543
No 79
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=80.58 E-value=1.1 Score=39.61 Aligned_cols=88 Identities=17% Similarity=0.377 Sum_probs=48.1
Q ss_pred hhcCCCCccccccchhcCChhHHHHHHHHhhhcCCCCCCCCCchhhhhcccccccCcCCCCCcccccccccccCCChHHH
Q psy11032 106 VHTGERPHECEVCGKRFAKRSNLNAHKKKVTRHTGRDEVPSIKPLKRKRNISFLTGAREGMPLIHECPTCGKKWRTRSEL 185 (514)
Q Consensus 106 ~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l 185 (514)
+.++.+.|.|++|+... -...--.|+......... .|+|..|++.
T Consensus 136 w~hGGrif~CsfC~~fl-CEDDQFEHQAsCQvLe~E---------------------------~~KC~SCNrl------- 180 (314)
T PF06524_consen 136 WDHGGRIFKCSFCDNFL-CEDDQFEHQASCQVLESE---------------------------TFKCQSCNRL------- 180 (314)
T ss_pred ccCCCeEEEeecCCCee-eccchhhhhhhhhhhhcc---------------------------cccccccccc-------
Confidence 34577888888887433 333333555433221111 2788888652
Q ss_pred HhHhhhhCCCCCcccccccccccChhHHHHHhhh----hcCCCCccCCcchhhccChhhhhh
Q psy11032 186 NKHTKTHSDLRPFVCEICGQGYRTKTHLLVHVGM----HNGIHPFTCHFCNKSFTQKTGLER 243 (514)
Q Consensus 186 ~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~----h~~~~~~~C~~C~~~f~~~~~l~~ 243 (514)
..|.|--|..-|-. .|++. ....+++.||.|++.......|-.
T Consensus 181 ----------Gq~sCLRCK~cfCd-----dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSm 227 (314)
T PF06524_consen 181 ----------GQYSCLRCKICFCD-----DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSM 227 (314)
T ss_pred ----------cchhhhheeeeehh-----hhhhhcccccccCCCCCCCCCCCccccccccee
Confidence 23445555544432 33322 234578899999987665555433
No 80
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=79.62 E-value=1.3 Score=37.09 Aligned_cols=24 Identities=38% Similarity=0.955 Sum_probs=19.6
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQ 428 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~ 428 (514)
.|.|++||+.+ .++.|-+||+|+-
T Consensus 134 ~~vC~vCGy~~-------------~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTH-------------EGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCcc-------------cCCCCCcCCCCCC
Confidence 69999999864 5678899999983
No 81
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=78.21 E-value=1.4 Score=39.02 Aligned_cols=43 Identities=23% Similarity=0.527 Sum_probs=30.5
Q ss_pred CceeCCcCCCccCCchhhhhhccc---cc-------CCcc-----cccCCCcCCCCCc
Q psy11032 391 PIFQCHLCGRCFNSPTTLTQHKIV---HT-------GERR-----FKCDICGQALTTK 433 (514)
Q Consensus 391 ~~~~C~~C~~~f~~~~~l~~H~~~---H~-------~~~~-----~~C~~C~~~f~~~ 433 (514)
+.+.|++|+..|....-.....+. .. +..| ..||.||++|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 358999999999988766655542 22 2233 4899999998654
No 82
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=76.95 E-value=1.8 Score=23.62 Aligned_cols=10 Identities=30% Similarity=0.776 Sum_probs=5.5
Q ss_pred ccCCCcCCCC
Q psy11032 422 KCDICGQALT 431 (514)
Q Consensus 422 ~C~~C~~~f~ 431 (514)
.|+.||+.|.
T Consensus 16 ~Cp~CG~~F~ 25 (26)
T PF10571_consen 16 FCPHCGYDFE 25 (26)
T ss_pred cCCCCCCCCc
Confidence 3566665553
No 83
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=76.22 E-value=1.1 Score=39.49 Aligned_cols=94 Identities=19% Similarity=0.391 Sum_probs=56.2
Q ss_pred hhcCCCCccCCcchhhccChhhhhhhhhhhcCCCccccCCcccccccccCCCccccCCcccCCCCChhHHHHHHHhhCCC
Q psy11032 219 MHNGIHPFTCHFCNKSFTQKTGLERHLTIHNNEKKHQVSGRFSFNRAREGMPLIHECPTCGKKWRTRSELNKHTKTHSDL 298 (514)
Q Consensus 219 ~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 298 (514)
+.+|.+.|+|.+|+ .|.-...--.|+.+..--.. ..|+|..|++.-
T Consensus 136 w~hGGrif~CsfC~-~flCEDDQFEHQAsCQvLe~-----------------E~~KC~SCNrlG---------------- 181 (314)
T PF06524_consen 136 WDHGGRIFKCSFCD-NFLCEDDQFEHQASCQVLES-----------------ETFKCQSCNRLG---------------- 181 (314)
T ss_pred ccCCCeEEEeecCC-Ceeeccchhhhhhhhhhhhc-----------------cccccccccccc----------------
Confidence 34677889999997 35555555566654331110 128888887642
Q ss_pred CCcccccccccccCchHHHhHhhh----ccCCCCccCCcCcccCCChHHHHHHhhhhc
Q psy11032 299 RPFVCEICGQGYRTKTHLLVHVGM----HNGIHPFTCHFCNKSFTQKTGLERHLTIHN 352 (514)
Q Consensus 299 ~~~~C~~C~~~f~~~~~l~~H~~~----h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 352 (514)
.|.|-.|...|-.. |++. ....+++.||.|++.......|..-.++|.
T Consensus 182 -q~sCLRCK~cfCdd-----HvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmStR~hk 233 (314)
T PF06524_consen 182 -QYSCLRCKICFCDD-----HVRRKGFKYEKGKPIPCPKCGYETQETKDLSMSTRSHK 233 (314)
T ss_pred -chhhhheeeeehhh-----hhhhcccccccCCCCCCCCCCCcccccccceeeeecch
Confidence 25565565555432 4432 223467888888887777666666655553
No 84
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=76.12 E-value=2.4 Score=32.77 Aligned_cols=15 Identities=27% Similarity=0.809 Sum_probs=8.7
Q ss_pred CCccCCcCcccCCCh
Q psy11032 327 HPFTCHFCNKSFTQK 341 (514)
Q Consensus 327 ~~~~C~~C~~~f~~~ 341 (514)
.|..||+||..|.-.
T Consensus 25 ~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 25 DPIVCPKCGTEFPPE 39 (108)
T ss_pred CCccCCCCCCccCcc
Confidence 455666666665544
No 85
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=74.53 E-value=1.3 Score=36.76 Aligned_cols=14 Identities=36% Similarity=0.641 Sum_probs=8.8
Q ss_pred cccCCCcCCCCCch
Q psy11032 421 FKCDICGQALTTKP 434 (514)
Q Consensus 421 ~~C~~C~~~f~~~~ 434 (514)
++|+-||++|+...
T Consensus 29 ~~c~~c~~~f~~~e 42 (154)
T PRK00464 29 RECLACGKRFTTFE 42 (154)
T ss_pred eeccccCCcceEeE
Confidence 66777776665543
No 86
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=74.29 E-value=1.6 Score=27.43 Aligned_cols=11 Identities=27% Similarity=0.860 Sum_probs=5.5
Q ss_pred cccCCCcCCCC
Q psy11032 421 FKCDICGQALT 431 (514)
Q Consensus 421 ~~C~~C~~~f~ 431 (514)
|.|..|+..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 45555555443
No 87
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=73.74 E-value=1.7 Score=27.61 Aligned_cols=13 Identities=31% Similarity=0.797 Sum_probs=8.0
Q ss_pred ceeCCcCCCccCC
Q psy11032 392 IFQCHLCGRCFNS 404 (514)
Q Consensus 392 ~~~C~~C~~~f~~ 404 (514)
.|.|+.||..|..
T Consensus 3 ~y~C~~CG~~~~~ 15 (46)
T PRK00398 3 EYKCARCGREVEL 15 (46)
T ss_pred EEECCCCCCEEEE
Confidence 4667777766543
No 88
>PF15269 zf-C2H2_7: Zinc-finger
Probab=73.33 E-value=3.4 Score=25.40 Aligned_cols=27 Identities=30% Similarity=0.482 Sum_probs=21.8
Q ss_pred CCCcccccccccccCCChHHHHhHhhh
Q psy11032 165 GMPLIHECPTCGKKWRTRSELNKHTKT 191 (514)
Q Consensus 165 ~~~~~~~C~~C~~~f~~~~~l~~H~~~ 191 (514)
+.+..|+|-.|.++....+.|-.||+-
T Consensus 16 gkp~~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 16 GKPFKYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred CCCccceeecCCcccchHHHHHHHHHH
Confidence 445568999999999999988888864
No 89
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=72.79 E-value=4 Score=27.21 Aligned_cols=9 Identities=44% Similarity=1.110 Sum_probs=5.3
Q ss_pred cccCCccch
Q psy11032 448 PHECNVCGA 456 (514)
Q Consensus 448 ~~~C~~C~~ 456 (514)
+|.|+.||+
T Consensus 48 ~Y~CP~CGF 56 (59)
T PRK14890 48 PYTCPKCGF 56 (59)
T ss_pred ceECCCCCC
Confidence 466666654
No 90
>KOG2186|consensus
Probab=72.42 E-value=2 Score=37.92 Aligned_cols=46 Identities=17% Similarity=0.320 Sum_probs=23.0
Q ss_pred ccCCccchhcccchhHHHHhhhhcCCCCCCCccccccccccCCccchhh
Q psy11032 449 HECNVCGARFAKKWNMQVHKQKVHGPDVRGTTESQLGMQVSSGGEKKDR 497 (514)
Q Consensus 449 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~~h~~ 497 (514)
|.|..||... .+..+.+|+-..+| .-|.|..|+..|.. .+...|..
T Consensus 4 FtCnvCgEsv-KKp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESV-KKPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhc-cccchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 4555555553 23344455555554 33555555555554 44444443
No 91
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=71.74 E-value=2.4 Score=35.42 Aligned_cols=24 Identities=42% Similarity=1.095 Sum_probs=19.6
Q ss_pred ccccCCCcCCCCCchhHHhHhhhccCCccccCCccch
Q psy11032 420 RFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGA 456 (514)
Q Consensus 420 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 456 (514)
-|.|++||+.+ -|+.|-.||.||.
T Consensus 134 ~~vC~vCGy~~-------------~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTH-------------EGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCcc-------------cCCCCCcCCCCCC
Confidence 59999999864 3678899999994
No 92
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=69.27 E-value=1.6 Score=27.95 Aligned_cols=29 Identities=24% Similarity=0.525 Sum_probs=18.9
Q ss_pred ccccCCcccccCCChhHHHHHHhhccCCCCcccccchhh
Q psy11032 55 KYTRCDLCGKGFIHLKSYQIHKMIHSGERHIKCEVCSLA 93 (514)
Q Consensus 55 k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 93 (514)
..|+|..|++.|... .......|+.||..
T Consensus 5 ~~Y~C~~Cg~~~~~~----------~~~~~irCp~Cg~r 33 (49)
T COG1996 5 MEYKCARCGREVELD----------QETRGIRCPYCGSR 33 (49)
T ss_pred EEEEhhhcCCeeehh----------hccCceeCCCCCcE
Confidence 468888888887322 13355778888754
No 93
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=69.08 E-value=3.5 Score=32.40 Aligned_cols=33 Identities=15% Similarity=0.185 Sum_probs=21.7
Q ss_pred cccccCCcccccCCChhHHHHHHhhccCCCCcccccchhhcCCc
Q psy11032 54 AKYTRCDLCGKGFIHLKSYQIHKMIHSGERHIKCEVCSLALLSK 97 (514)
Q Consensus 54 ~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 97 (514)
.....|+.||+.|... +..|..|+.||..|.-.
T Consensus 7 GtKr~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 7 GTKRICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPE 39 (129)
T ss_pred CccccCCCcCcccccc-----------CCCCccCCCcCCccCcc
Confidence 3445788888887753 34577788877776544
No 94
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=69.01 E-value=1.5 Score=36.38 Aligned_cols=17 Identities=35% Similarity=0.763 Sum_probs=12.8
Q ss_pred ccCCccchhcccchhHH
Q psy11032 449 HECNVCGARFAKKWNMQ 465 (514)
Q Consensus 449 ~~C~~C~~~f~~~~~l~ 465 (514)
+.|+.||++|.+...+.
T Consensus 29 ~~c~~c~~~f~~~e~~~ 45 (154)
T PRK00464 29 RECLACGKRFTTFERVE 45 (154)
T ss_pred eeccccCCcceEeEecc
Confidence 78999998888655443
No 95
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=68.28 E-value=5.3 Score=23.80 Aligned_cols=33 Identities=18% Similarity=0.563 Sum_probs=16.0
Q ss_pred eeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCC
Q psy11032 393 FQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQAL 430 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f 430 (514)
..|+.|+..|.-...... ......+|+.|+..|
T Consensus 3 i~Cp~C~~~y~i~d~~ip-----~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEKIP-----PKGRKVRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHHCC-----CCCcEEECCCCCCEe
Confidence 356666666544443321 222345666666554
No 96
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=67.65 E-value=4.6 Score=26.85 Aligned_cols=16 Identities=19% Similarity=0.468 Sum_probs=10.5
Q ss_pred CccCCcCcccCCChHH
Q psy11032 328 PFTCHFCNKSFTQKTG 343 (514)
Q Consensus 328 ~~~C~~C~~~f~~~~~ 343 (514)
.|.|+.||..-..+..
T Consensus 27 ~F~CPnCGe~~I~Rc~ 42 (61)
T COG2888 27 KFPCPNCGEVEIYRCA 42 (61)
T ss_pred EeeCCCCCceeeehhh
Confidence 4788888866554433
No 97
>KOG2186|consensus
Probab=67.57 E-value=3.2 Score=36.76 Aligned_cols=38 Identities=26% Similarity=0.723 Sum_probs=15.7
Q ss_pred cccccccccccChhHHHHHhhhhcCCCCccCCcchhhccC
Q psy11032 198 FVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSFTQ 237 (514)
Q Consensus 198 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 237 (514)
|.|..||-...-+ .+.+|+..-++ .-|.|--|+..|..
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~ 41 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER 41 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc
Confidence 3444444433322 23334444333 33444444444443
No 98
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=67.17 E-value=3.7 Score=27.71 Aligned_cols=40 Identities=23% Similarity=0.540 Sum_probs=20.9
Q ss_pred CCccccc--ccccccChhHHHHHhhhhcCCCCccCCc----chhhcc
Q psy11032 196 RPFVCEI--CGQGYRTKTHLLVHVGMHNGIHPFTCHF----CNKSFT 236 (514)
Q Consensus 196 ~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~ 236 (514)
.+..|+. |...+. +..|..|....-...+..|++ |+..+.
T Consensus 8 ~~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 8 RPVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp SEEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred CEeeCCCCCccccee-HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 3456666 433333 446777777666666677777 766554
No 99
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=66.48 E-value=3.3 Score=23.90 Aligned_cols=26 Identities=23% Similarity=0.493 Sum_probs=13.1
Q ss_pred ccCCcccccCCChhHHHHHHhhccCCCCcccccchhh
Q psy11032 57 TRCDLCGKGFIHLKSYQIHKMIHSGERHIKCEVCSLA 93 (514)
Q Consensus 57 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 93 (514)
|.|..|+..+.. ....+.+|+.||..
T Consensus 1 Y~C~~Cg~~~~~-----------~~~~~irC~~CG~R 26 (32)
T PF03604_consen 1 YICGECGAEVEL-----------KPGDPIRCPECGHR 26 (32)
T ss_dssp EBESSSSSSE-B-----------STSSTSSBSSSS-S
T ss_pred CCCCcCCCeeEc-----------CCCCcEECCcCCCe
Confidence 456666665542 12345667777654
No 100
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=66.30 E-value=5.2 Score=42.33 Aligned_cols=10 Identities=30% Similarity=0.713 Sum_probs=7.5
Q ss_pred eeCCcCCCcc
Q psy11032 393 FQCHLCGRCF 402 (514)
Q Consensus 393 ~~C~~C~~~f 402 (514)
..|..||..+
T Consensus 436 l~C~~Cg~v~ 445 (730)
T COG1198 436 LLCRDCGYIA 445 (730)
T ss_pred eecccCCCcc
Confidence 5688888775
No 101
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=66.12 E-value=4.9 Score=24.11 Aligned_cols=12 Identities=33% Similarity=0.816 Sum_probs=6.1
Q ss_pred eCCcCCCccCCc
Q psy11032 394 QCHLCGRCFNSP 405 (514)
Q Consensus 394 ~C~~C~~~f~~~ 405 (514)
.|+.|+..|.-.
T Consensus 4 ~CP~C~~~f~v~ 15 (37)
T PF13719_consen 4 TCPNCQTRFRVP 15 (37)
T ss_pred ECCCCCceEEcC
Confidence 455555555433
No 102
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=65.84 E-value=4.2 Score=25.25 Aligned_cols=30 Identities=23% Similarity=0.719 Sum_probs=17.4
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQ 428 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~ 428 (514)
.|+|..||..|.-...+ .. ..+-.|+.|+.
T Consensus 5 ey~C~~Cg~~fe~~~~~------~~-~~~~~CP~Cg~ 34 (42)
T PF09723_consen 5 EYRCEECGHEFEVLQSI------SE-DDPVPCPECGS 34 (42)
T ss_pred EEEeCCCCCEEEEEEEc------CC-CCCCcCCCCCC
Confidence 37788888776433222 11 34456777775
No 103
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=65.72 E-value=3.6 Score=27.76 Aligned_cols=40 Identities=25% Similarity=0.632 Sum_probs=19.2
Q ss_pred CCccccc--chhhcCCchHHHHHHhhhcCCCCccccc----cchhcC
Q psy11032 83 RHIKCEV--CSLALLSKSHLKQHMRVHTGERPHECEV----CGKRFA 123 (514)
Q Consensus 83 ~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~ 123 (514)
.+..|+. |...+. ...|..|+...-..++-.|++ |+..+.
T Consensus 8 ~~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 8 RPVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp SEEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred CEeeCCCCCccccee-HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 3455666 333333 445666666555555666666 666554
No 104
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=64.66 E-value=5.7 Score=33.31 Aligned_cols=32 Identities=19% Similarity=0.347 Sum_probs=19.9
Q ss_pred CCCCcccccccccccChhHHHHHhhhhcCCCCccCCcchhh
Q psy11032 194 DLRPFVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKS 234 (514)
Q Consensus 194 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 234 (514)
+..-|.|+.|+..|+.-.++. ..|.|+.||..
T Consensus 106 ~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 106 NNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM 137 (158)
T ss_pred CCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence 344567777777666666553 24777777654
No 105
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=64.56 E-value=3.7 Score=26.83 Aligned_cols=30 Identities=23% Similarity=0.699 Sum_probs=17.3
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQ 428 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~ 428 (514)
.|+|..|+..|.....+.. +..-.|+.|+.
T Consensus 5 ey~C~~Cg~~fe~~~~~~~-------~~~~~CP~Cg~ 34 (52)
T TIGR02605 5 EYRCTACGHRFEVLQKMSD-------DPLATCPECGG 34 (52)
T ss_pred EEEeCCCCCEeEEEEecCC-------CCCCCCCCCCC
Confidence 3778888877754322111 33456777775
No 106
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=64.54 E-value=5.1 Score=24.11 Aligned_cols=13 Identities=31% Similarity=0.664 Sum_probs=6.6
Q ss_pred eeCCcCCCccCCc
Q psy11032 393 FQCHLCGRCFNSP 405 (514)
Q Consensus 393 ~~C~~C~~~f~~~ 405 (514)
+.|+.|+..|.-.
T Consensus 3 ~~CP~C~~~~~v~ 15 (38)
T TIGR02098 3 IQCPNCKTSFRVV 15 (38)
T ss_pred EECCCCCCEEEeC
Confidence 4555555555433
No 107
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=64.49 E-value=5.8 Score=32.82 Aligned_cols=36 Identities=19% Similarity=0.543 Sum_probs=22.8
Q ss_pred CCceeCCcCCCccCCchhhhh-hcccccCCcccccCCCcCCC
Q psy11032 390 TPIFQCHLCGRCFNSPTTLTQ-HKIVHTGERRFKCDICGQAL 430 (514)
Q Consensus 390 ~~~~~C~~C~~~f~~~~~l~~-H~~~H~~~~~~~C~~C~~~f 430 (514)
...|.|+.|+..|.....+.. . . ...|.||.|+...
T Consensus 97 ~~~Y~Cp~C~~~y~~~ea~~~~d----~-~~~f~Cp~Cg~~l 133 (147)
T smart00531 97 NAYYKCPNCQSKYTFLEANQLLD----M-DGTFTCPRCGEEL 133 (147)
T ss_pred CcEEECcCCCCEeeHHHHHHhcC----C-CCcEECCCCCCEE
Confidence 335888888888876554433 2 1 2338888887653
No 108
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=63.54 E-value=4.5 Score=33.46 Aligned_cols=34 Identities=24% Similarity=0.541 Sum_probs=18.4
Q ss_pred cccccccccCCChHHHHhHhhhhCCCCCcccccccccc
Q psy11032 170 HECPTCGKKWRTRSELNKHTKTHSDLRPFVCEICGQGY 207 (514)
Q Consensus 170 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 207 (514)
|.|+.|+..|.....+.. . .. ...|.|+.||...
T Consensus 100 Y~Cp~C~~~y~~~ea~~~-~--d~-~~~f~Cp~Cg~~l 133 (147)
T smart00531 100 YKCPNCQSKYTFLEANQL-L--DM-DGTFTCPRCGEEL 133 (147)
T ss_pred EECcCCCCEeeHHHHHHh-c--CC-CCcEECCCCCCEE
Confidence 677777776665443322 0 01 2336777776654
No 109
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=63.35 E-value=2.7 Score=25.76 Aligned_cols=11 Identities=36% Similarity=0.963 Sum_probs=5.4
Q ss_pred ccCCccchhcc
Q psy11032 449 HECNVCGARFA 459 (514)
Q Consensus 449 ~~C~~C~~~f~ 459 (514)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (41)
T smart00834 6 YRCEDCGHTFE 16 (41)
T ss_pred EEcCCCCCEEE
Confidence 44555555444
No 110
>PHA00626 hypothetical protein
Probab=63.13 E-value=4.9 Score=26.27 Aligned_cols=12 Identities=25% Similarity=0.412 Sum_probs=6.2
Q ss_pred cccCCccchhcc
Q psy11032 448 PHECNVCGARFA 459 (514)
Q Consensus 448 ~~~C~~C~~~f~ 459 (514)
.|+|+.||+.|+
T Consensus 23 rYkCkdCGY~ft 34 (59)
T PHA00626 23 DYVCCDCGYNDS 34 (59)
T ss_pred ceEcCCCCCeec
Confidence 355555555553
No 111
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=63.04 E-value=3.9 Score=35.04 Aligned_cols=16 Identities=19% Similarity=0.438 Sum_probs=7.4
Q ss_pred CcccccccccccChhH
Q psy11032 197 PFVCEICGQGYRTKTH 212 (514)
Q Consensus 197 ~~~C~~C~~~f~~~~~ 212 (514)
-|.|+.|+..|+.-.+
T Consensus 117 ~Y~Cp~C~~rytf~eA 132 (178)
T PRK06266 117 FFFCPNCHIRFTFDEA 132 (178)
T ss_pred EEECCCCCcEEeHHHH
Confidence 3445555544444433
No 112
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=62.47 E-value=7.5 Score=30.80 Aligned_cols=23 Identities=17% Similarity=0.587 Sum_probs=16.7
Q ss_pred cCCCCCceeCCcCCCccCCchhh
Q psy11032 386 LFHSTPIFQCHLCGRCFNSPTTL 408 (514)
Q Consensus 386 ~~~~~~~~~C~~C~~~f~~~~~l 408 (514)
++...+.|.|.+|..+...+.-|
T Consensus 74 vF~d~~lYeCnIC~etS~ee~FL 96 (140)
T PF05290_consen 74 VFLDPKLYECNICKETSAEERFL 96 (140)
T ss_pred eecCCCceeccCcccccchhhcC
Confidence 34445789999999987666544
No 113
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=59.98 E-value=8.3 Score=32.33 Aligned_cols=34 Identities=18% Similarity=0.361 Sum_probs=23.6
Q ss_pred CCCCCcccccccccccCchHHHhHhhhccCCCCccCCcCcccC
Q psy11032 296 SDLRPFVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSF 338 (514)
Q Consensus 296 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 338 (514)
.+..-|.|+.|+..|+...++. ..|.||.||...
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAML 138 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEe
Confidence 3445588888888887776664 248888888753
No 114
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=59.42 E-value=8.2 Score=30.40 Aligned_cols=12 Identities=33% Similarity=0.797 Sum_probs=6.6
Q ss_pred ccCCcccCCCCC
Q psy11032 273 HECPTCGKKWRT 284 (514)
Q Consensus 273 ~~C~~C~~~f~~ 284 (514)
..|+.||.+|..
T Consensus 10 r~Cp~cg~kFYD 21 (129)
T TIGR02300 10 RICPNTGSKFYD 21 (129)
T ss_pred ccCCCcCccccc
Confidence 456666655543
No 115
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=59.41 E-value=6 Score=24.83 Aligned_cols=22 Identities=27% Similarity=0.801 Sum_probs=9.1
Q ss_pred cccccCCCcCCCCC----chhHHhHh
Q psy11032 419 RRFKCDICGQALTT----KPKLNDHM 440 (514)
Q Consensus 419 ~~~~C~~C~~~f~~----~~~l~~H~ 440 (514)
...+|.+|++.+.. .+.|.+|+
T Consensus 15 ~~a~C~~C~~~~~~~~~~ts~l~~HL 40 (45)
T PF02892_consen 15 KKAKCKYCGKVIKYSSGGTSNLKRHL 40 (45)
T ss_dssp S-EEETTTTEE-----SSTHHHHHHH
T ss_pred CeEEeCCCCeEEeeCCCcHHHHHHhh
Confidence 34556666555443 24444444
No 116
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=59.37 E-value=3.2 Score=31.97 Aligned_cols=39 Identities=23% Similarity=0.428 Sum_probs=19.3
Q ss_pred eeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCC
Q psy11032 393 FQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALT 431 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~ 431 (514)
..|+.||..-.....++.-...-....-|.|..|+..+.
T Consensus 63 ~~Cp~Cg~~~a~f~~~Q~RsadE~~T~fy~C~~C~~~w~ 101 (104)
T TIGR01384 63 VECPKCGHKEAYYWLLQTRRADEPETRFYKCTKCGYVWR 101 (104)
T ss_pred CCCCCCCCCeeEEEEeccCCCCCCcEEEEEeCCCCCeeE
Confidence 688888864332222221111111123478888877653
No 117
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=58.81 E-value=9.1 Score=20.86 Aligned_cols=20 Identities=20% Similarity=0.544 Sum_probs=14.7
Q ss_pred ccCCcCcccCCChHHHHHHhh
Q psy11032 329 FTCHFCNKSFTQKTGLERHLT 349 (514)
Q Consensus 329 ~~C~~C~~~f~~~~~l~~H~~ 349 (514)
..||+|++.+ ....+..|+.
T Consensus 2 v~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLD 21 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHH
Confidence 3688888887 5567777765
No 118
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=58.23 E-value=2.5 Score=29.83 Aligned_cols=17 Identities=29% Similarity=0.632 Sum_probs=8.5
Q ss_pred ccccCC--ccchhcccchh
Q psy11032 447 KPHECN--VCGARFAKKWN 463 (514)
Q Consensus 447 ~~~~C~--~C~~~f~~~~~ 463 (514)
.-+.|. .||.+|....+
T Consensus 26 ~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 26 RYHQCQNVNCSATFITYES 44 (72)
T ss_pred eeeecCCCCCCCEEEEEEE
Confidence 335554 55555554433
No 119
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=58.02 E-value=7.7 Score=33.26 Aligned_cols=33 Identities=21% Similarity=0.620 Sum_probs=26.6
Q ss_pred CcccccccccccCCChHHHHhHhhhhCCCCCccccccccccc
Q psy11032 167 PLIHECPTCGKKWRTRSELNKHTKTHSDLRPFVCEICGQGYR 208 (514)
Q Consensus 167 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 208 (514)
+..|.|+.|+..|+...++. ..|.|+.||....
T Consensus 115 ~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 115 NMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLE 147 (178)
T ss_pred CCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCe
Confidence 34599999999999888763 3699999998654
No 120
>COG1773 Rubredoxin [Energy production and conversion]
Probab=56.16 E-value=8.4 Score=25.40 Aligned_cols=12 Identities=33% Similarity=0.905 Sum_probs=10.3
Q ss_pred CccccccchhcC
Q psy11032 112 PHECEVCGKRFA 123 (514)
Q Consensus 112 ~~~C~~C~~~f~ 123 (514)
.|+|..|++.|.
T Consensus 3 ~~~C~~CG~vYd 14 (55)
T COG1773 3 RWRCSVCGYVYD 14 (55)
T ss_pred ceEecCCceEec
Confidence 589999999885
No 121
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=55.02 E-value=16 Score=34.12 Aligned_cols=28 Identities=32% Similarity=0.481 Sum_probs=20.0
Q ss_pred hCCCCCccccccc-ccccChhHHHHHhhh
Q psy11032 192 HSDLRPFVCEICG-QGYRTKTHLLVHVGM 219 (514)
Q Consensus 192 h~~~~~~~C~~C~-~~f~~~~~l~~H~~~ 219 (514)
|--.+.|.|.+|| +++..+..+.+|...
T Consensus 369 hgLd~ef~CEICgNyvy~GR~~FdrHF~E 397 (470)
T COG5188 369 HGLDIEFECEICGNYVYYGRDRFDRHFEE 397 (470)
T ss_pred cCCCcceeeeecccccccchHHHHhhhhh
Confidence 3335568888887 777788888887643
No 122
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=52.92 E-value=13 Score=23.74 Aligned_cols=14 Identities=29% Similarity=1.037 Sum_probs=8.8
Q ss_pred eeCCcCCCccCCch
Q psy11032 393 FQCHLCGRCFNSPT 406 (514)
Q Consensus 393 ~~C~~C~~~f~~~~ 406 (514)
|+|..|++.+....
T Consensus 2 y~C~~CgyvYd~~~ 15 (47)
T PF00301_consen 2 YQCPVCGYVYDPEK 15 (47)
T ss_dssp EEETTTSBEEETTT
T ss_pred cCCCCCCEEEcCCc
Confidence 66777776665443
No 123
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=52.35 E-value=10 Score=34.65 Aligned_cols=23 Identities=17% Similarity=0.472 Sum_probs=14.8
Q ss_pred ccCCcccCCCCChhHHHHHHHhh
Q psy11032 273 HECPTCGKKWRTRSELNKHTKTH 295 (514)
Q Consensus 273 ~~C~~C~~~f~~~~~l~~H~~~h 295 (514)
|+|+.|...|-..-+.-.|...|
T Consensus 389 Y~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 389 YQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred eechhhhhhhhhhhHHHHHHHHh
Confidence 67777777776666666665543
No 124
>PF14353 CpXC: CpXC protein
Probab=50.99 E-value=2.5 Score=34.03 Aligned_cols=48 Identities=21% Similarity=0.331 Sum_probs=24.5
Q ss_pred eeCCcCCCccCCchhhhhhcccc---------cCCcccccCCCcCCCCCchhHHhHh
Q psy11032 393 FQCHLCGRCFNSPTTLTQHKIVH---------TGERRFKCDICGQALTTKPKLNDHM 440 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l~~H~~~H---------~~~~~~~C~~C~~~f~~~~~l~~H~ 440 (514)
..|+.|+..|............. ..--.|.||.||..|.-...+.-|-
T Consensus 2 itCP~C~~~~~~~v~~~I~~~~~p~l~e~il~g~l~~~~CP~Cg~~~~~~~p~lY~D 58 (128)
T PF14353_consen 2 ITCPHCGHEFEFEVWTSINADEDPELKEKILDGSLFSFTCPSCGHKFRLEYPLLYHD 58 (128)
T ss_pred cCCCCCCCeeEEEEEeEEcCcCCHHHHHHHHcCCcCEEECCCCCCceecCCCEEEEc
Confidence 35777777765443222211111 1113467777777776555555443
No 125
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=49.98 E-value=13 Score=33.92 Aligned_cols=18 Identities=28% Similarity=0.685 Sum_probs=9.8
Q ss_pred CccCCcCcccCCChHHHH
Q psy11032 328 PFTCHFCNKSFTQKTGLE 345 (514)
Q Consensus 328 ~~~C~~C~~~f~~~~~l~ 345 (514)
|..||.|.-.......|.
T Consensus 322 Pi~CP~Csl~LilsthLa 339 (421)
T COG5151 322 PISCPICSLQLILSTHLA 339 (421)
T ss_pred CccCcchhHHHHHHHHHH
Confidence 566777765544433333
No 126
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=49.31 E-value=9.4 Score=27.75 Aligned_cols=30 Identities=30% Similarity=0.708 Sum_probs=16.3
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALT 431 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~ 431 (514)
.|.|+.|++.- +.+. +..-|.|..|++.|.
T Consensus 35 ~~~Cp~C~~~~--------VkR~--a~GIW~C~kCg~~fA 64 (89)
T COG1997 35 KHVCPFCGRTT--------VKRI--ATGIWKCRKCGAKFA 64 (89)
T ss_pred CCcCCCCCCcc--------eeee--ccCeEEcCCCCCeec
Confidence 47777777642 1111 223467777776664
No 127
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=48.69 E-value=11 Score=24.31 Aligned_cols=11 Identities=27% Similarity=0.975 Sum_probs=5.6
Q ss_pred eeCCcCCCccC
Q psy11032 393 FQCHLCGRCFN 403 (514)
Q Consensus 393 ~~C~~C~~~f~ 403 (514)
-.|..|++.+.
T Consensus 19 a~C~~C~~~l~ 29 (50)
T smart00614 19 AKCKYCGKKLS 29 (50)
T ss_pred EEecCCCCEee
Confidence 34555555543
No 128
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=47.86 E-value=9.4 Score=30.68 Aligned_cols=24 Identities=29% Similarity=0.624 Sum_probs=14.2
Q ss_pred ccCCccchhcccchhHHHHhhhhcCCC
Q psy11032 449 HECNVCGARFAKKWNMQVHKQKVHGPD 475 (514)
Q Consensus 449 ~~C~~C~~~f~~~~~l~~H~~~~h~~~ 475 (514)
..|-.||+.|.. |.+|++.|||-.
T Consensus 73 i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred eEEccCCcccch---HHHHHHHccCCC
Confidence 678888888754 478888888754
No 129
>KOG2807|consensus
Probab=47.51 E-value=29 Score=32.27 Aligned_cols=22 Identities=23% Similarity=0.529 Sum_probs=16.2
Q ss_pred CCccccccchhcCChhHHHHHH
Q psy11032 111 RPHECEVCGKRFAKRSNLNAHK 132 (514)
Q Consensus 111 ~~~~C~~C~~~f~~~~~l~~H~ 132 (514)
-|-.|+.|+.+..+...|.+-.
T Consensus 289 LP~eCpiC~ltLVss~hLARSy 310 (378)
T KOG2807|consen 289 LPIECPICSLTLVSSPHLARSY 310 (378)
T ss_pred CCccCCccceeEecchHHHHHH
Confidence 4678888988888777776543
No 130
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=47.31 E-value=9 Score=33.75 Aligned_cols=26 Identities=23% Similarity=0.395 Sum_probs=15.3
Q ss_pred ccCCccchhcccchhHHHHhhhhcCC
Q psy11032 449 HECNVCGARFAKKWNMQVHKQKVHGP 474 (514)
Q Consensus 449 ~~C~~C~~~f~~~~~l~~H~~~~h~~ 474 (514)
|.|++|++.|.-..-+.+|+.+.|.+
T Consensus 78 ~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 78 WRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp EEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred ECCCCCCcccCChHHHHHHHhhcCHH
Confidence 66666666666666666776666643
No 131
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=46.82 E-value=12 Score=24.23 Aligned_cols=11 Identities=36% Similarity=1.053 Sum_probs=9.1
Q ss_pred ccccccchhcC
Q psy11032 113 HECEVCGKRFA 123 (514)
Q Consensus 113 ~~C~~C~~~f~ 123 (514)
|.|..|++.+.
T Consensus 2 y~C~~CgyiYd 12 (50)
T cd00730 2 YECRICGYIYD 12 (50)
T ss_pred cCCCCCCeEEC
Confidence 78999998875
No 132
>COG1594 RPB9 DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription]
Probab=46.76 E-value=4 Score=31.92 Aligned_cols=39 Identities=28% Similarity=0.675 Sum_probs=18.6
Q ss_pred ccCCcccCCCCChhHHHHHHHhhCCCCCccccccccccc
Q psy11032 273 HECPTCGKKWRTRSELNKHTKTHSDLRPFVCEICGQGYR 311 (514)
Q Consensus 273 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 311 (514)
..|+.|+..=.....|+.-..-.....-|.|..||+.|+
T Consensus 73 ~~CpkCg~~ea~y~~~QtRsaDEp~T~Fy~C~~Cg~~wr 111 (113)
T COG1594 73 EKCPKCGNKEAYYWQLQTRSADEPETRFYKCTRCGYRWR 111 (113)
T ss_pred ccCCCCCCceeEEEeeehhccCCCceEEEEecccCCEee
Confidence 568888754322222221111111112377888887764
No 133
>KOG2807|consensus
Probab=46.36 E-value=27 Score=32.40 Aligned_cols=34 Identities=21% Similarity=0.550 Sum_probs=21.4
Q ss_pred CcccccccccccCchHHHhHhhhccCCCCccCCcCcccCCChHHHHHH
Q psy11032 300 PFVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNKSFTQKTGLERH 347 (514)
Q Consensus 300 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H 347 (514)
-|.|+.|+...- .-|..|++|+-...+...|.+-
T Consensus 276 Gy~CP~CkakvC--------------sLP~eCpiC~ltLVss~hLARS 309 (378)
T KOG2807|consen 276 GYFCPQCKAKVC--------------SLPIECPICSLTLVSSPHLARS 309 (378)
T ss_pred ceeCCcccCeee--------------cCCccCCccceeEecchHHHHH
Confidence 377777754332 3456788888777766666554
No 134
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=44.88 E-value=14 Score=27.02 Aligned_cols=27 Identities=22% Similarity=0.536 Sum_probs=17.2
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQ 428 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~ 428 (514)
|-+|..||+.|.+. .-.+|-.||.|..
T Consensus 58 Pa~CkkCGfef~~~----------~ik~pSRCP~CKS 84 (97)
T COG3357 58 PARCKKCGFEFRDD----------KIKKPSRCPKCKS 84 (97)
T ss_pred ChhhcccCcccccc----------ccCCcccCCcchh
Confidence 56788888887652 1224667887753
No 135
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=44.40 E-value=14 Score=19.47 Aligned_cols=6 Identities=50% Similarity=1.448 Sum_probs=2.9
Q ss_pred cCCCcC
Q psy11032 423 CDICGQ 428 (514)
Q Consensus 423 C~~C~~ 428 (514)
|+.||.
T Consensus 16 C~~CG~ 21 (23)
T PF13240_consen 16 CPNCGT 21 (23)
T ss_pred hhhhCC
Confidence 555543
No 136
>KOG4377|consensus
Probab=42.76 E-value=21 Score=34.20 Aligned_cols=25 Identities=32% Similarity=0.713 Sum_probs=19.3
Q ss_pred ccC--CcCcccCCChHHHHHHhhhhcC
Q psy11032 329 FTC--HFCNKSFTQKTGLERHLTIHNN 353 (514)
Q Consensus 329 ~~C--~~C~~~f~~~~~l~~H~~~h~~ 353 (514)
|.| ++|++.+..+..+.+|...|..
T Consensus 272 yhcl~e~C~ykr~~k~DvirH~~~hkk 298 (480)
T KOG4377|consen 272 YHCLNEYCFYKRGQKNDVIRHVEIHKK 298 (480)
T ss_pred hcccCccccccccchhhhHHHHHHHhh
Confidence 455 4688888888999999888754
No 137
>KOG2461|consensus
Probab=41.82 E-value=37 Score=33.29 Aligned_cols=55 Identities=11% Similarity=0.067 Sum_probs=33.4
Q ss_pred CCCChhHHHHHHHhhCCCCCcccccccccccCchHHHhHhhhccCCCCccCCcCc
Q psy11032 281 KWRTRSELNKHTKTHSDLRPFVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCN 335 (514)
Q Consensus 281 ~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 335 (514)
.+.+......+...........+..++..+.....+..+.....+.++..+..+.
T Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (396)
T KOG2461|consen 191 SILTSAMSSNSAEVLDSDSHNLADEESVSLKEQGQLLEPVEVASGPQPVKAGTGH 245 (396)
T ss_pred ccccccCCCcchhhcccccCCccccccccccccccccccccccCcccccccCCCC
Confidence 3333333334444455556667777777777777777777777777666555443
No 138
>KOG2593|consensus
Probab=41.82 E-value=21 Score=34.67 Aligned_cols=38 Identities=24% Similarity=0.625 Sum_probs=24.4
Q ss_pred CCCCCcccccccccccChhHHHHHhhhhcCCCCccCCcchh
Q psy11032 193 SDLRPFVCEICGQGYRTKTHLLVHVGMHNGIHPFTCHFCNK 233 (514)
Q Consensus 193 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 233 (514)
.....|.|+.|.+.|+....++. .-.....|.|..|+-
T Consensus 124 t~~~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGG 161 (436)
T ss_pred cccccccCCccccchhhhHHHHh---hcccCceEEEecCCC
Confidence 34556888888888877655532 222344688888874
No 139
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=41.79 E-value=8.6 Score=21.83 Aligned_cols=7 Identities=43% Similarity=1.094 Sum_probs=2.8
Q ss_pred ccCCccc
Q psy11032 449 HECNVCG 455 (514)
Q Consensus 449 ~~C~~C~ 455 (514)
|.|+.|+
T Consensus 20 ~vCp~C~ 26 (30)
T PF08274_consen 20 LVCPECG 26 (30)
T ss_dssp EEETTTT
T ss_pred EeCCccc
Confidence 3444443
No 140
>PRK04023 DNA polymerase II large subunit; Validated
Probab=40.14 E-value=27 Score=38.04 Aligned_cols=22 Identities=27% Similarity=0.658 Sum_probs=15.8
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcCC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQA 429 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~ 429 (514)
...|+.||... -.+.|+.||..
T Consensus 626 ~RfCpsCG~~t----------------~~frCP~CG~~ 647 (1121)
T PRK04023 626 RRKCPSCGKET----------------FYRRCPFCGTH 647 (1121)
T ss_pred CccCCCCCCcC----------------CcccCCCCCCC
Confidence 47899999873 23678888765
No 141
>KOG4377|consensus
Probab=40.07 E-value=21 Score=34.17 Aligned_cols=65 Identities=22% Similarity=0.480 Sum_probs=42.1
Q ss_pred ccC--CcccCCCCChhHHHHHHHhhCCCC------------Cccc--ccccccccCchHHHhHhhhccCCC-------Cc
Q psy11032 273 HEC--PTCGKKWRTRSELNKHTKTHSDLR------------PFVC--EICGQGYRTKTHLLVHVGMHNGIH-------PF 329 (514)
Q Consensus 273 ~~C--~~C~~~f~~~~~l~~H~~~h~~~~------------~~~C--~~C~~~f~~~~~l~~H~~~h~~~~-------~~ 329 (514)
|.| +.|+..+-.+.++..|..+|...+ .|.| .+|.+ +.++...|..-|+... -|
T Consensus 272 yhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~Gfrrthf 348 (480)
T KOG4377|consen 272 YHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRTHF 348 (480)
T ss_pred hcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCceeccee
Confidence 555 458888888899999998875432 2667 46877 4455666776664322 27
Q ss_pred cCCcCcccCCC
Q psy11032 330 TCHFCNKSFTQ 340 (514)
Q Consensus 330 ~C~~C~~~f~~ 340 (514)
.|..|+-.+..
T Consensus 349 hC~r~gCTdtf 359 (480)
T KOG4377|consen 349 HCQRIGCTDTF 359 (480)
T ss_pred EEeccCCcccc
Confidence 78877743333
No 142
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=38.54 E-value=22 Score=22.51 Aligned_cols=12 Identities=42% Similarity=0.894 Sum_probs=6.8
Q ss_pred CCCccccccccc
Q psy11032 195 LRPFVCEICGQG 206 (514)
Q Consensus 195 ~~~~~C~~C~~~ 206 (514)
...+.|+.||..
T Consensus 16 ~~g~~CP~Cg~~ 27 (46)
T PF12760_consen 16 PDGFVCPHCGST 27 (46)
T ss_pred CCCCCCCCCCCe
Confidence 334667777653
No 143
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=38.48 E-value=20 Score=28.59 Aligned_cols=14 Identities=43% Similarity=1.018 Sum_probs=7.8
Q ss_pred eeCCcCCCccCCch
Q psy11032 393 FQCHLCGRCFNSPT 406 (514)
Q Consensus 393 ~~C~~C~~~f~~~~ 406 (514)
++|..||+.|.+.+
T Consensus 2 H~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 2 HQCTKCGRVFEDGS 15 (131)
T ss_pred cccCcCCCCcCCCc
Confidence 45666666665443
No 144
>KOG1280|consensus
Probab=37.92 E-value=30 Score=32.43 Aligned_cols=23 Identities=30% Similarity=0.610 Sum_probs=12.7
Q ss_pred ccccCCCcCCCCCchhHHhHhhh
Q psy11032 420 RFKCDICGQALTTKPKLNDHMLL 442 (514)
Q Consensus 420 ~~~C~~C~~~f~~~~~l~~H~~~ 442 (514)
.|.|++|+..-.+...|..|+..
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s 101 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLS 101 (381)
T ss_pred cccCCcccccccchhHHHHHhhh
Confidence 45555555555555555555543
No 145
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=37.53 E-value=16 Score=29.41 Aligned_cols=25 Identities=36% Similarity=0.602 Sum_probs=14.1
Q ss_pred ccccCCcccccCCChhHHHHHHhhccCC
Q psy11032 55 KYTRCDLCGKGFIHLKSYQIHKMIHSGE 82 (514)
Q Consensus 55 k~~~C~~C~~~f~~~~~l~~H~~~h~~~ 82 (514)
....|-+||+.|.. |.+|++.|++-
T Consensus 71 d~i~clecGk~~k~---LkrHL~~~~gl 95 (132)
T PF05443_consen 71 DYIICLECGKKFKT---LKRHLRTHHGL 95 (132)
T ss_dssp S-EE-TBT--EESB---HHHHHHHTT-S
T ss_pred CeeEEccCCcccch---HHHHHHHccCC
Confidence 34678888888775 47788777654
No 146
>KOG2071|consensus
Probab=37.32 E-value=25 Score=35.76 Aligned_cols=26 Identities=27% Similarity=0.535 Sum_probs=18.5
Q ss_pred CCceeCCcCCCccCCchhhhhhcccc
Q psy11032 390 TPIFQCHLCGRCFNSPTTLTQHKIVH 415 (514)
Q Consensus 390 ~~~~~C~~C~~~f~~~~~l~~H~~~H 415 (514)
..+-+|..||..|........|+..|
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H 441 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIH 441 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhh
Confidence 34567888888888777777666655
No 147
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=35.79 E-value=30 Score=35.48 Aligned_cols=49 Identities=27% Similarity=0.697 Sum_probs=27.7
Q ss_pred eeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCccchh
Q psy11032 393 FQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGAR 457 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 457 (514)
..|..||....-... ...+..|.......|.+||+... .|..|+.|+..
T Consensus 214 ~~C~~Cg~~~~C~~C-~~~l~~h~~~~~l~Ch~Cg~~~~---------------~~~~Cp~C~s~ 262 (505)
T TIGR00595 214 LLCRSCGYILCCPNC-DVSLTYHKKEGKLRCHYCGYQEP---------------IPKTCPQCGSE 262 (505)
T ss_pred eEhhhCcCccCCCCC-CCceEEecCCCeEEcCCCcCcCC---------------CCCCCCCCCCC
Confidence 467777766543321 12233444555677888875532 35678888753
No 148
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=35.79 E-value=24 Score=28.71 Aligned_cols=14 Identities=21% Similarity=0.783 Sum_probs=11.5
Q ss_pred ceeCCcCCCccCCc
Q psy11032 392 IFQCHLCGRCFNSP 405 (514)
Q Consensus 392 ~~~C~~C~~~f~~~ 405 (514)
.+.|..||..|...
T Consensus 70 ~~~C~~CG~~~~~~ 83 (135)
T PRK03824 70 VLKCRNCGNEWSLK 83 (135)
T ss_pred EEECCCCCCEEecc
Confidence 58999999888664
No 149
>PF09332 Mcm10: Mcm10 replication factor; InterPro: IPR015411 Mcm10 is a eukaryotic DNA replication factor that regulates the stability and chromatin association of DNA polymerase alpha []. ; PDB: 2KWQ_A.
Probab=34.52 E-value=8.9 Score=36.47 Aligned_cols=56 Identities=27% Similarity=0.573 Sum_probs=24.9
Q ss_pred ceeCCcCCCccCCchhh---hhh-cccccC-CcccccCCCcCCCCCchhHHhHhhhccCCccccCCccchh
Q psy11032 392 IFQCHLCGRCFNSPTTL---TQH-KIVHTG-ERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGAR 457 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l---~~H-~~~H~~-~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 457 (514)
.+.|..|.++.-..... ..| +..|.+ .+-|+|.-|++.+.+...| ..-.|..||..
T Consensus 252 av~C~~C~yt~~~~~~~C~~~~H~l~~~~a~KRFFkC~~C~~Rt~sl~r~----------P~~~C~~Cg~~ 312 (344)
T PF09332_consen 252 AVTCKQCKYTAFKPSDRCKEEGHPLKWHDAVKRFFKCKDCGNRTISLERL----------PKKHCSNCGSS 312 (344)
T ss_dssp EEEETTT--EESS--HHHHHTT--EEEEEEE-EEEE-T-TS-EEEESSSS------------S--TTT-S-
T ss_pred EEEcCCCCCcccCcchhHHhcCCceEEeeeeeeeEECCCCCCeeeecccC----------CCCCCCcCCcC
Confidence 58899998876655544 345 334444 3457899998875533211 11368888854
No 150
>KOG2071|consensus
Probab=34.23 E-value=46 Score=33.92 Aligned_cols=24 Identities=29% Similarity=0.695 Sum_probs=13.1
Q ss_pred CccCCcCcccCCChHHHHHHhhhh
Q psy11032 328 PFTCHFCNKSFTQKTGLERHLTIH 351 (514)
Q Consensus 328 ~~~C~~C~~~f~~~~~l~~H~~~h 351 (514)
|-+|..||..|........||..|
T Consensus 418 pnqC~~CG~R~~~~ee~sk~md~H 441 (579)
T KOG2071|consen 418 PNQCKSCGLRFDDSEERSKHMDIH 441 (579)
T ss_pred cchhcccccccccchhhhhHhhhh
Confidence 455666666666555555554444
No 151
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=34.22 E-value=23 Score=26.94 Aligned_cols=27 Identities=19% Similarity=0.110 Sum_probs=19.6
Q ss_pred ccCCcccccCCChhHHHHHHhhccCCCCcccccchhhc
Q psy11032 57 TRCDLCGKGFIHLKSYQIHKMIHSGERHIKCEVCSLAL 94 (514)
Q Consensus 57 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 94 (514)
-.|+.|++.|... +..|..|++|++.|
T Consensus 10 ridPetg~KFYDL-----------NrdPiVsPytG~s~ 36 (129)
T COG4530 10 RIDPETGKKFYDL-----------NRDPIVSPYTGKSY 36 (129)
T ss_pred ccCccccchhhcc-----------CCCccccCcccccc
Confidence 4678888887652 45677888888777
No 152
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=33.99 E-value=40 Score=37.85 Aligned_cols=56 Identities=21% Similarity=0.390 Sum_probs=32.9
Q ss_pred eeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCccchhcccchhHHHHhhhhc
Q psy11032 393 FQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGARFAKKWNMQVHKQKVH 472 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h 472 (514)
++|+.||..... ..|+.||... ..+|.|+.||....... +
T Consensus 668 rkCPkCG~~t~~----------------~fCP~CGs~t---------------e~vy~CPsCGaev~~de---------s 707 (1337)
T PRK14714 668 RRCPSCGTETYE----------------NRCPDCGTHT---------------EPVYVCPDCGAEVPPDE---------S 707 (1337)
T ss_pred EECCCCCCcccc----------------ccCcccCCcC---------------CCceeCccCCCccCCCc---------c
Confidence 789999974211 2588887553 22467888887643211 1
Q ss_pred CCCCCCCccccccccccC
Q psy11032 473 GPDVRGTTESQLGMQVSS 490 (514)
Q Consensus 473 ~~~~~~c~~c~~~~~~~~ 490 (514)
+ ...|++|+.......
T Consensus 708 ~--a~~CP~CGtplv~~~ 723 (1337)
T PRK14714 708 G--RVECPRCDVELTPYQ 723 (1337)
T ss_pred c--cccCCCCCCcccccc
Confidence 2 346777776544433
No 153
>PF03811 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=33.56 E-value=18 Score=21.57 Aligned_cols=30 Identities=20% Similarity=0.663 Sum_probs=14.2
Q ss_pred eeCCcCCCccCCchhhhhhcccccCCcccccCCC
Q psy11032 393 FQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDIC 426 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C 426 (514)
..|+.|+.. ..+.+|-+...|...|.|..|
T Consensus 6 v~CP~C~s~----~~v~k~G~~~~G~qryrC~~C 35 (36)
T PF03811_consen 6 VHCPRCQST----EGVKKNGKSPSGHQRYRCKDC 35 (36)
T ss_pred eeCCCCCCC----CcceeCCCCCCCCEeEecCcC
Confidence 345555432 224444444445555555555
No 154
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=32.79 E-value=92 Score=25.85 Aligned_cols=19 Identities=26% Similarity=0.583 Sum_probs=9.2
Q ss_pred CCCCccccccchhcCChhHHHH
Q psy11032 109 GERPHECEVCGKRFAKRSNLNA 130 (514)
Q Consensus 109 ~~~~~~C~~C~~~f~~~~~l~~ 130 (514)
|-+||.| +-.+....-|.+
T Consensus 26 gcRpymc---~Ts~rhSNCLdq 44 (162)
T PF07800_consen 26 GCRPYMC---DTSYRHSNCLDQ 44 (162)
T ss_pred Ccccccc---CCccchhHHHHH
Confidence 4466654 344444444443
No 155
>KOG2907|consensus
Probab=32.38 E-value=29 Score=26.65 Aligned_cols=40 Identities=23% Similarity=0.331 Sum_probs=21.1
Q ss_pred eeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCCC
Q psy11032 393 FQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALTT 432 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~ 432 (514)
.+|+.||..=.....|+.-...-..---|.|+.|++.|+.
T Consensus 75 ~kCpkCghe~m~Y~T~QlRSADEGQTVFYTC~kC~~k~~e 114 (116)
T KOG2907|consen 75 HKCPKCGHEEMSYHTLQLRSADEGQTVFYTCPKCKYKFTE 114 (116)
T ss_pred ccCcccCCchhhhhhhhcccccCCceEEEEcCccceeeec
Confidence 6899998654333333221111111123788888887764
No 156
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=31.78 E-value=25 Score=21.52 Aligned_cols=20 Identities=15% Similarity=0.398 Sum_probs=11.3
Q ss_pred CccccCCccchhcccchhHH
Q psy11032 446 EKPHECNVCGARFAKKWNMQ 465 (514)
Q Consensus 446 ~~~~~C~~C~~~f~~~~~l~ 465 (514)
...+.|+.|+-.+-+...|.
T Consensus 17 ~~id~C~~C~G~W~d~~el~ 36 (41)
T PF13453_consen 17 VEIDVCPSCGGIWFDAGELE 36 (41)
T ss_pred EEEEECCCCCeEEccHHHHH
Confidence 34456666666555555544
No 157
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=31.67 E-value=23 Score=18.92 Aligned_cols=8 Identities=50% Similarity=1.236 Sum_probs=4.1
Q ss_pred ceeCCcCC
Q psy11032 392 IFQCHLCG 399 (514)
Q Consensus 392 ~~~C~~C~ 399 (514)
+|.|+.||
T Consensus 16 ~f~CPnCG 23 (24)
T PF07754_consen 16 PFPCPNCG 23 (24)
T ss_pred eEeCCCCC
Confidence 35555554
No 158
>PF12907 zf-met2: Zinc-binding
Probab=30.46 E-value=34 Score=20.99 Aligned_cols=26 Identities=19% Similarity=0.478 Sum_probs=14.4
Q ss_pred ccCCccchhcc---cchhHHHHhhhhcCC
Q psy11032 449 HECNVCGARFA---KKWNMQVHKQKVHGP 474 (514)
Q Consensus 449 ~~C~~C~~~f~---~~~~l~~H~~~~h~~ 474 (514)
+.|.+|..+|. +...|..|....|..
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK 30 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPK 30 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCC
Confidence 45666664443 345566666655543
No 159
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=30.21 E-value=23 Score=21.65 Aligned_cols=8 Identities=38% Similarity=1.335 Sum_probs=3.6
Q ss_pred cccCCCcC
Q psy11032 421 FKCDICGQ 428 (514)
Q Consensus 421 ~~C~~C~~ 428 (514)
|.|..|+.
T Consensus 29 y~C~~C~~ 36 (40)
T smart00440 29 YVCTKCGH 36 (40)
T ss_pred EEeCCCCC
Confidence 44444443
No 160
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=29.60 E-value=18 Score=23.91 Aligned_cols=9 Identities=44% Similarity=1.224 Sum_probs=4.3
Q ss_pred cccCCCcCC
Q psy11032 421 FKCDICGQA 429 (514)
Q Consensus 421 ~~C~~C~~~ 429 (514)
|.|+.|+..
T Consensus 3 ~~CP~CG~~ 11 (54)
T TIGR01206 3 FECPDCGAE 11 (54)
T ss_pred cCCCCCCCE
Confidence 345555543
No 161
>KOG2593|consensus
Probab=29.41 E-value=31 Score=33.61 Aligned_cols=36 Identities=22% Similarity=0.489 Sum_probs=18.3
Q ss_pred CcccccCCCcCCCCCchhHHhHhhhccCCccccCCccch
Q psy11032 418 ERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGA 456 (514)
Q Consensus 418 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 456 (514)
..-|.||.|.++|++...++- .-...-.|.|..|+-
T Consensus 126 ~~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 126 VAGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGG 161 (436)
T ss_pred cccccCCccccchhhhHHHHh---hcccCceEEEecCCC
Confidence 344677777766655443332 111122366666663
No 162
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=29.30 E-value=66 Score=19.79 Aligned_cols=23 Identities=35% Similarity=0.741 Sum_probs=14.9
Q ss_pred cccccccccCCC--hHHHHhHhhhh
Q psy11032 170 HECPTCGKKWRT--RSELNKHTKTH 192 (514)
Q Consensus 170 ~~C~~C~~~f~~--~~~l~~H~~~h 192 (514)
-.|+.|++.|.. ..+-..|.+-|
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH 38 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYH 38 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHH
Confidence 468888877754 45566666555
No 163
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=29.15 E-value=28 Score=22.97 Aligned_cols=14 Identities=29% Similarity=0.895 Sum_probs=8.1
Q ss_pred eeCCcCCCccCCch
Q psy11032 393 FQCHLCGRCFNSPT 406 (514)
Q Consensus 393 ~~C~~C~~~f~~~~ 406 (514)
.+|..|+..|....
T Consensus 6 ~~C~~Cg~~~~~~d 19 (54)
T PF14446_consen 6 CKCPVCGKKFKDGD 19 (54)
T ss_pred ccChhhCCcccCCC
Confidence 45666666665443
No 164
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=29.10 E-value=19 Score=31.76 Aligned_cols=29 Identities=24% Similarity=0.520 Sum_probs=19.2
Q ss_pred CCccCCcCcccCCChHHHHHHhhhhcCCC
Q psy11032 327 HPFTCHFCNKSFTQKTGLERHLTIHNNEK 355 (514)
Q Consensus 327 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~ 355 (514)
..|.|++|+|.|.-...+..|+...|.+.
T Consensus 76 ~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 76 DKWRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp EEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred CEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 34778888888888888888887766543
No 165
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=29.01 E-value=25 Score=27.96 Aligned_cols=26 Identities=35% Similarity=0.413 Sum_probs=19.5
Q ss_pred ccccCCcccccCCChhHHHHHHhhccCCC
Q psy11032 55 KYTRCDLCGKGFIHLKSYQIHKMIHSGER 83 (514)
Q Consensus 55 k~~~C~~C~~~f~~~~~l~~H~~~h~~~~ 83 (514)
....|-++|+.|. +|.+|+.+|.+--
T Consensus 75 D~IicLEDGkkfK---SLKRHL~t~~gmT 100 (148)
T COG4957 75 DYIICLEDGKKFK---SLKRHLTTHYGLT 100 (148)
T ss_pred CeEEEeccCcchH---HHHHHHhcccCCC
Confidence 3577899998886 4788888876643
No 166
>KOG4167|consensus
Probab=28.73 E-value=17 Score=37.63 Aligned_cols=26 Identities=23% Similarity=0.437 Sum_probs=16.4
Q ss_pred CCcccccccccccChhHHHHHhhhhc
Q psy11032 196 RPFVCEICGQGYRTKTHLLVHVGMHN 221 (514)
Q Consensus 196 ~~~~C~~C~~~f~~~~~l~~H~~~h~ 221 (514)
..|-|..|++.|.....+..||++|.
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 34666666666666666666666654
No 167
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=28.42 E-value=19 Score=26.93 Aligned_cols=13 Identities=38% Similarity=0.692 Sum_probs=6.9
Q ss_pred ceeCCcCCCccCC
Q psy11032 392 IFQCHLCGRCFNS 404 (514)
Q Consensus 392 ~~~C~~C~~~f~~ 404 (514)
.|.|+.||..-.+
T Consensus 22 ~FtCp~Cghe~vs 34 (104)
T COG4888 22 TFTCPRCGHEKVS 34 (104)
T ss_pred eEecCccCCeeee
Confidence 3566666654433
No 168
>KOG4167|consensus
Probab=27.97 E-value=16 Score=37.86 Aligned_cols=26 Identities=35% Similarity=0.575 Sum_probs=16.9
Q ss_pred ccccCCcccccCCChhHHHHHHhhcc
Q psy11032 55 KYTRCDLCGKGFIHLKSYQIHKMIHS 80 (514)
Q Consensus 55 k~~~C~~C~~~f~~~~~l~~H~~~h~ 80 (514)
..|-|.+|++.|....++..||++|.
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 35666666666666666666666664
No 169
>PF04810 zf-Sec23_Sec24: Sec23/Sec24 zinc finger; InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=27.51 E-value=40 Score=20.58 Aligned_cols=12 Identities=33% Similarity=1.182 Sum_probs=6.0
Q ss_pred CCcccccCCCcC
Q psy11032 417 GERRFKCDICGQ 428 (514)
Q Consensus 417 ~~~~~~C~~C~~ 428 (514)
+.+.|.|++|+.
T Consensus 21 ~~~~w~C~~C~~ 32 (40)
T PF04810_consen 21 GGKTWICNFCGT 32 (40)
T ss_dssp TTTEEEETTT--
T ss_pred CCCEEECcCCCC
Confidence 445566666654
No 170
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=27.14 E-value=37 Score=26.59 Aligned_cols=25 Identities=16% Similarity=0.381 Sum_probs=17.1
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQ 428 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~ 428 (514)
.+.|..|+..|.... ..|.||.|+.
T Consensus 70 ~~~C~~Cg~~~~~~~------------~~~~CP~Cgs 94 (113)
T PRK12380 70 QAWCWDCSQVVEIHQ------------HDAQCPHCHG 94 (113)
T ss_pred EEEcccCCCEEecCC------------cCccCcCCCC
Confidence 478999987775432 3356888874
No 171
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=27.00 E-value=38 Score=22.09 Aligned_cols=25 Identities=20% Similarity=0.490 Sum_probs=13.7
Q ss_pred ceeCCcCCCccCCchhhhhhccccc
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHT 416 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~ 416 (514)
.|+|+.|+..|=-.-.+..|...|.
T Consensus 21 ~y~C~~C~~~FC~dCD~fiHE~LH~ 45 (51)
T PF07975_consen 21 RYRCPKCKNHFCIDCDVFIHETLHN 45 (51)
T ss_dssp EE--TTTT--B-HHHHHTTTTTS-S
T ss_pred eEECCCCCCccccCcChhhhccccC
Confidence 5888888888877777777766654
No 172
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=26.86 E-value=37 Score=26.68 Aligned_cols=26 Identities=15% Similarity=0.492 Sum_probs=17.9
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcCC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQA 429 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~ 429 (514)
...|..|+..|..... .+.||.|+..
T Consensus 70 ~~~C~~Cg~~~~~~~~------------~~~CP~Cgs~ 95 (115)
T TIGR00100 70 ECECEDCSEEVSPEID------------LYRCPKCHGI 95 (115)
T ss_pred EEEcccCCCEEecCCc------------CccCcCCcCC
Confidence 4789999887755432 3678888743
No 173
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=26.58 E-value=34 Score=22.22 Aligned_cols=10 Identities=40% Similarity=1.178 Sum_probs=5.0
Q ss_pred ccccCCCcCC
Q psy11032 420 RFKCDICGQA 429 (514)
Q Consensus 420 ~~~C~~C~~~ 429 (514)
.+.|..|++.
T Consensus 37 r~~C~~Cgyt 46 (50)
T PRK00432 37 RWHCGKCGYT 46 (50)
T ss_pred cEECCCcCCE
Confidence 3455555543
No 174
>COG4391 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.44 E-value=83 Score=21.29 Aligned_cols=42 Identities=12% Similarity=0.226 Sum_probs=24.4
Q ss_pred EEeeeCCcceeeccCCCCCCCccccc---cCcccccCCcccccCC
Q psy11032 26 RKLVDNGHKIFLCNTEPGMPDSLLIN---EIAKYTRCDLCGKGFI 67 (514)
Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~k~~~C~~C~~~f~ 67 (514)
...++.+..++.|.-...-..|-++. ..+....|++|+..|.
T Consensus 15 ~~~I~~~~~~l~C~g~~~p~~HPrV~L~mg~~gev~CPYC~t~y~ 59 (62)
T COG4391 15 HETIEIGDLPLMCPGPEPPNDHPRVFLDMGDEGEVVCPYCSTRYR 59 (62)
T ss_pred ceEEEeCCeeEEcCCCCCCCCCCEEEEEcCCCCcEecCccccEEE
Confidence 34566666777776443334443221 3345677888888775
No 175
>PF14311 DUF4379: Domain of unknown function (DUF4379)
Probab=25.67 E-value=53 Score=21.64 Aligned_cols=16 Identities=19% Similarity=0.657 Sum_probs=10.7
Q ss_pred eeCCcCCCccCCchhh
Q psy11032 393 FQCHLCGRCFNSPTTL 408 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l 408 (514)
++|..||..|...-..
T Consensus 29 W~C~~Cgh~w~~~v~~ 44 (55)
T PF14311_consen 29 WKCPKCGHEWKASVND 44 (55)
T ss_pred EECCCCCCeeEccHhh
Confidence 7888887776554433
No 176
>KOG3408|consensus
Probab=25.52 E-value=36 Score=26.52 Aligned_cols=24 Identities=25% Similarity=0.347 Sum_probs=21.2
Q ss_pred ccccCCcccccCCChhHHHHHHhh
Q psy11032 55 KYTRCDLCGKGFIHLKSYQIHKMI 78 (514)
Q Consensus 55 k~~~C~~C~~~f~~~~~l~~H~~~ 78 (514)
-.|.|-+|.+-|.+...|..|.++
T Consensus 56 GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 56 GQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred ceeehhhhhhhhcchHHHHHHHhc
Confidence 358899999999999999999875
No 177
>KOG1280|consensus
Probab=25.34 E-value=47 Score=31.26 Aligned_cols=28 Identities=18% Similarity=0.305 Sum_probs=25.4
Q ss_pred ccccCCccchhcccchhHHHHhhhhcCC
Q psy11032 447 KPHECNVCGARFAKKWNMQVHKQKVHGP 474 (514)
Q Consensus 447 ~~~~C~~C~~~f~~~~~l~~H~~~~h~~ 474 (514)
..|.|++|+..-.+...|..|+...|-+
T Consensus 78 qSftCPyC~~~Gfte~~f~~Hv~s~Hpd 105 (381)
T KOG1280|consen 78 QSFTCPYCGIMGFTERQFGTHVLSQHPE 105 (381)
T ss_pred ccccCCcccccccchhHHHHHhhhcCcc
Confidence 3699999999999999999999999965
No 178
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=25.07 E-value=43 Score=25.18 Aligned_cols=14 Identities=36% Similarity=0.847 Sum_probs=8.8
Q ss_pred ceeCCcCCCccCCc
Q psy11032 392 IFQCHLCGRCFNSP 405 (514)
Q Consensus 392 ~~~C~~C~~~f~~~ 405 (514)
|.+|..||..|.+-
T Consensus 2 pH~CtrCG~vf~~g 15 (112)
T COG3364 2 PHQCTRCGEVFDDG 15 (112)
T ss_pred Cceecccccccccc
Confidence 45666666666654
No 179
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=25.04 E-value=19 Score=26.61 Aligned_cols=10 Identities=40% Similarity=1.261 Sum_probs=5.9
Q ss_pred cccCCCcCCC
Q psy11032 421 FKCDICGQAL 430 (514)
Q Consensus 421 ~~C~~C~~~f 430 (514)
|+|..|++.|
T Consensus 54 W~C~~C~~~~ 63 (90)
T PF01780_consen 54 WKCKKCGKKF 63 (90)
T ss_dssp EEETTTTEEE
T ss_pred eecCCCCCEE
Confidence 5666666554
No 180
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=24.88 E-value=70 Score=36.07 Aligned_cols=11 Identities=36% Similarity=0.905 Sum_probs=6.8
Q ss_pred Ccccccccccc
Q psy11032 300 PFVCEICGQGY 310 (514)
Q Consensus 300 ~~~C~~C~~~f 310 (514)
+|.|+.||...
T Consensus 692 vy~CPsCGaev 702 (1337)
T PRK14714 692 VYVCPDCGAEV 702 (1337)
T ss_pred ceeCccCCCcc
Confidence 36677776654
No 181
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=24.80 E-value=36 Score=20.71 Aligned_cols=13 Identities=38% Similarity=0.928 Sum_probs=10.3
Q ss_pred ccccCCCcCCCCC
Q psy11032 420 RFKCDICGQALTT 432 (514)
Q Consensus 420 ~~~C~~C~~~f~~ 432 (514)
||.|+.|+..|-.
T Consensus 12 ~f~C~~C~~~FC~ 24 (39)
T smart00154 12 GFKCRHCGNLFCG 24 (39)
T ss_pred CeECCccCCcccc
Confidence 7888888888854
No 182
>PF10276 zf-CHCC: Zinc-finger domain; InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=24.36 E-value=42 Score=20.57 Aligned_cols=12 Identities=25% Similarity=0.747 Sum_probs=8.7
Q ss_pred ccccCCcccccC
Q psy11032 55 KYTRCDLCGKGF 66 (514)
Q Consensus 55 k~~~C~~C~~~f 66 (514)
+...|++|+..|
T Consensus 28 ~~~~CpYCg~~y 39 (40)
T PF10276_consen 28 GPVVCPYCGTRY 39 (40)
T ss_dssp CEEEETTTTEEE
T ss_pred CeEECCCCCCEE
Confidence 457788888766
No 183
>KOG3408|consensus
Probab=24.15 E-value=44 Score=26.09 Aligned_cols=21 Identities=19% Similarity=0.392 Sum_probs=14.3
Q ss_pred ccCCccchhcccchhHHHHhh
Q psy11032 449 HECNVCGARFAKKWNMQVHKQ 469 (514)
Q Consensus 449 ~~C~~C~~~f~~~~~l~~H~~ 469 (514)
|.|-.|.+.|.+..+|..|.+
T Consensus 58 fyCi~CaRyFi~~~~l~~H~k 78 (129)
T KOG3408|consen 58 FYCIECARYFIDAKALKTHFK 78 (129)
T ss_pred eehhhhhhhhcchHHHHHHHh
Confidence 666677777777777776654
No 184
>KOG2932|consensus
Probab=24.15 E-value=81 Score=29.21 Aligned_cols=73 Identities=15% Similarity=0.322 Sum_probs=48.0
Q ss_pred CceeCCcCCCccCCchhhh--hhc------ccccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCc---cchhcc
Q psy11032 391 PIFQCHLCGRCFNSPTTLT--QHK------IVHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNV---CGARFA 459 (514)
Q Consensus 391 ~~~~C~~C~~~f~~~~~l~--~H~------~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~---C~~~f~ 459 (514)
.+..|+.|++-..-+..|. .|. +.+. + -.|+.|+...-. +..-.-...|.|.. |.++|-
T Consensus 89 ~VHfCd~Cd~PI~IYGRmIPCkHvFCl~CAr~~~-d--K~Cp~C~d~Vqr-------Ieq~~~g~iFmC~~~~GC~RTyL 158 (389)
T KOG2932|consen 89 RVHFCDRCDFPIAIYGRMIPCKHVFCLECARSDS-D--KICPLCDDRVQR-------IEQIMMGGIFMCAAPHGCLRTYL 158 (389)
T ss_pred ceEeecccCCcceeeecccccchhhhhhhhhcCc-c--ccCcCcccHHHH-------HHHhcccceEEeecchhHHHHHh
Confidence 3688999998766555553 342 1222 2 258888654311 11123345699954 999999
Q ss_pred cchhHHHHhhhhcC
Q psy11032 460 KKWNMQVHKQKVHG 473 (514)
Q Consensus 460 ~~~~l~~H~~~~h~ 473 (514)
+..+|+.|+...|+
T Consensus 159 sqrDlqAHInhrH~ 172 (389)
T KOG2932|consen 159 SQRDLQAHINHRHG 172 (389)
T ss_pred hHHHHHHHhhhhhc
Confidence 99999999999887
No 185
>PF13248 zf-ribbon_3: zinc-ribbon domain
Probab=23.91 E-value=34 Score=18.54 Aligned_cols=8 Identities=38% Similarity=0.895 Sum_probs=5.0
Q ss_pred cCCccccc
Q psy11032 58 RCDLCGKG 65 (514)
Q Consensus 58 ~C~~C~~~ 65 (514)
.|+.||..
T Consensus 4 ~Cp~Cg~~ 11 (26)
T PF13248_consen 4 FCPNCGAE 11 (26)
T ss_pred CCcccCCc
Confidence 56667663
No 186
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=23.72 E-value=37 Score=27.80 Aligned_cols=32 Identities=28% Similarity=0.817 Sum_probs=14.4
Q ss_pred CceeCCcCCCccCCchhhhhhcccccCCcccccCCCc
Q psy11032 391 PIFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICG 427 (514)
Q Consensus 391 ~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~ 427 (514)
-+|.|. |+..+.+. ++|-..-.|+ .|.|..|+
T Consensus 116 ~~Y~C~-C~q~~l~~---RRhn~~~~g~-~YrC~~C~ 147 (156)
T COG3091 116 YPYRCQ-CQQHYLRI---RRHNTVRRGE-VYRCGKCG 147 (156)
T ss_pred eeEEee-cCCccchh---hhcccccccc-eEEeccCC
Confidence 356666 66553222 2332222333 45555554
No 187
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.54 E-value=26 Score=30.77 Aligned_cols=9 Identities=22% Similarity=0.538 Sum_probs=6.5
Q ss_pred ccCCccchh
Q psy11032 449 HECNVCGAR 457 (514)
Q Consensus 449 ~~C~~C~~~ 457 (514)
..||.|.++
T Consensus 63 vvCP~C~yA 71 (267)
T COG1655 63 VVCPICYYA 71 (267)
T ss_pred EEcchhhHH
Confidence 478888765
No 188
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=23.36 E-value=45 Score=18.69 Aligned_cols=7 Identities=29% Similarity=1.364 Sum_probs=2.1
Q ss_pred cccCCCc
Q psy11032 421 FKCDICG 427 (514)
Q Consensus 421 ~~C~~C~ 427 (514)
|.|..|+
T Consensus 16 Y~C~~Cd 22 (30)
T PF07649_consen 16 YRCSECD 22 (30)
T ss_dssp EE-TTT-
T ss_pred EECccCC
Confidence 3444443
No 189
>PRK14873 primosome assembly protein PriA; Provisional
Probab=23.18 E-value=65 Score=34.26 Aligned_cols=45 Identities=22% Similarity=0.519 Sum_probs=24.3
Q ss_pred eeCCcCCCccCCc---hhhhhhcccccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCccchh
Q psy11032 393 FQCHLCGRCFNSP---TTLTQHKIVHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCGAR 457 (514)
Q Consensus 393 ~~C~~C~~~f~~~---~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 457 (514)
..|..||..+.-. ..|.. |...+...|.+||+. ..+..|+.||..
T Consensus 384 l~C~~Cg~~~~C~~C~~~L~~----h~~~~~l~Ch~CG~~----------------~~p~~Cp~Cgs~ 431 (665)
T PRK14873 384 LACARCRTPARCRHCTGPLGL----PSAGGTPRCRWCGRA----------------APDWRCPRCGSD 431 (665)
T ss_pred eEhhhCcCeeECCCCCCceeE----ecCCCeeECCCCcCC----------------CcCccCCCCcCC
Confidence 4677777665332 12222 334445667777642 135677777754
No 190
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=23.13 E-value=37 Score=25.12 Aligned_cols=11 Identities=18% Similarity=0.779 Sum_probs=6.6
Q ss_pred ccccCCCcCCC
Q psy11032 420 RFKCDICGQAL 430 (514)
Q Consensus 420 ~~~C~~C~~~f 430 (514)
.|.|..|++.|
T Consensus 54 IW~C~~C~~~~ 64 (90)
T PTZ00255 54 IWRCKGCKKTV 64 (90)
T ss_pred EEEcCCCCCEE
Confidence 46666666655
No 191
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=23.06 E-value=33 Score=20.43 Aligned_cols=9 Identities=44% Similarity=1.291 Sum_probs=4.3
Q ss_pred ccccccccc
Q psy11032 302 VCEICGQGY 310 (514)
Q Consensus 302 ~C~~C~~~f 310 (514)
.|+.||..|
T Consensus 3 ~C~~Cg~~Y 11 (36)
T PF05191_consen 3 ICPKCGRIY 11 (36)
T ss_dssp EETTTTEEE
T ss_pred CcCCCCCcc
Confidence 344455444
No 192
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=22.85 E-value=34 Score=25.35 Aligned_cols=11 Identities=27% Similarity=0.905 Sum_probs=6.8
Q ss_pred ccccCCCcCCC
Q psy11032 420 RFKCDICGQAL 430 (514)
Q Consensus 420 ~~~C~~C~~~f 430 (514)
.|.|..|++.|
T Consensus 53 IW~C~~C~~~~ 63 (91)
T TIGR00280 53 IWTCRKCGAKF 63 (91)
T ss_pred EEEcCCCCCEE
Confidence 36666666655
No 193
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=22.76 E-value=59 Score=20.97 Aligned_cols=38 Identities=18% Similarity=0.253 Sum_probs=16.2
Q ss_pred ccccCCccchhcccchhHHHHhhhhc-CCCCCCCccccc
Q psy11032 447 KPHECNVCGARFAKKWNMQVHKQKVH-GPDVRGTTESQL 484 (514)
Q Consensus 447 ~~~~C~~C~~~f~~~~~l~~H~~~~h-~~~~~~c~~c~~ 484 (514)
+.+.|..||..|.....=+.-...-- -..|-.|..|..
T Consensus 3 k~l~C~dCg~~FvfTa~EQ~fy~eKgf~n~p~RC~~CR~ 41 (49)
T PF13451_consen 3 KTLTCKDCGAEFVFTAGEQKFYAEKGFDNEPKRCPSCRQ 41 (49)
T ss_pred eeEEcccCCCeEEEehhHHHHHHhcCCcCCCccCHHHHH
Confidence 34555556555554333333322210 123555555543
No 194
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=22.73 E-value=19 Score=21.91 Aligned_cols=7 Identities=43% Similarity=1.284 Sum_probs=2.8
Q ss_pred cccCCCc
Q psy11032 421 FKCDICG 427 (514)
Q Consensus 421 ~~C~~C~ 427 (514)
|.|..|+
T Consensus 29 y~C~~C~ 35 (39)
T PF01096_consen 29 YVCCNCG 35 (39)
T ss_dssp EEESSST
T ss_pred EEeCCCC
Confidence 3444443
No 195
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=22.51 E-value=30 Score=21.49 Aligned_cols=12 Identities=33% Similarity=0.703 Sum_probs=5.8
Q ss_pred CCcccccccccc
Q psy11032 196 RPFVCEICGQGY 207 (514)
Q Consensus 196 ~~~~C~~C~~~f 207 (514)
..+.|..||.+.
T Consensus 18 g~~vC~~CG~Vl 29 (43)
T PF08271_consen 18 GELVCPNCGLVL 29 (43)
T ss_dssp TEEEETTT-BBE
T ss_pred CeEECCCCCCEe
Confidence 345566666543
No 196
>PF01286 XPA_N: XPA protein N-terminal; InterPro: IPR022652 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. Skin cells of individual's with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-A is the most severe form of the disease and is due to defects in a 30 kDa nuclear protein called XPA (or XPAC) []. The sequence of the XPA protein is conserved from higher eukaryotes [] to yeast (gene RAD14) []. XPA is a hydrophilic protein of 247 to 296 amino-acid residues which has a C4-type zinc finger motif in its central section. This entry contains the zinc-finger containing region in the XPA protein. It is found N-terminal to PF05181 from PFAM ; PDB: 1D4U_A 1XPA_A.
Probab=22.42 E-value=41 Score=19.77 Aligned_cols=16 Identities=31% Similarity=0.646 Sum_probs=7.3
Q ss_pred eeCCcCCCccCCchhh
Q psy11032 393 FQCHLCGRCFNSPTTL 408 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l 408 (514)
-.|..|++.|.....+
T Consensus 4 ~~C~eC~~~f~dSyL~ 19 (34)
T PF01286_consen 4 PKCDECGKPFMDSYLL 19 (34)
T ss_dssp EE-TTT--EES-SSCC
T ss_pred chHhHhCCHHHHHHHH
Confidence 3577777777655433
No 197
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.30 E-value=45 Score=22.35 Aligned_cols=7 Identities=29% Similarity=1.308 Sum_probs=3.2
Q ss_pred cCCcccc
Q psy11032 58 RCDLCGK 64 (514)
Q Consensus 58 ~C~~C~~ 64 (514)
+|.+|++
T Consensus 4 kCiiCd~ 10 (68)
T COG4896 4 KCIICDR 10 (68)
T ss_pred eEEEecc
Confidence 3444444
No 198
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=21.89 E-value=50 Score=26.02 Aligned_cols=26 Identities=19% Similarity=0.683 Sum_probs=16.8
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCccc-ccCCCcCC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRF-KCDICGQA 429 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~-~C~~C~~~ 429 (514)
.+.|..||..|..... .| .||.|+..
T Consensus 71 ~~~C~~Cg~~~~~~~~------------~~~~CP~Cgs~ 97 (117)
T PRK00564 71 ELECKDCSHVFKPNAL------------DYGVCEKCHSK 97 (117)
T ss_pred EEEhhhCCCccccCCc------------cCCcCcCCCCC
Confidence 4788888877754421 23 48888743
No 199
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=21.66 E-value=51 Score=22.88 Aligned_cols=11 Identities=45% Similarity=1.008 Sum_probs=5.9
Q ss_pred cccccCCCcCC
Q psy11032 419 RRFKCDICGQA 429 (514)
Q Consensus 419 ~~~~C~~C~~~ 429 (514)
+.|.|+.||..
T Consensus 45 r~~~C~~Cg~~ 55 (69)
T PF07282_consen 45 RVFTCPNCGFE 55 (69)
T ss_pred ceEEcCCCCCE
Confidence 34556666554
No 200
>KOG2272|consensus
Probab=21.46 E-value=75 Score=28.35 Aligned_cols=19 Identities=32% Similarity=0.686 Sum_probs=12.3
Q ss_pred ccCCCCccCCcCcccCCCh
Q psy11032 323 HNGIHPFTCHFCNKSFTQK 341 (514)
Q Consensus 323 h~~~~~~~C~~C~~~f~~~ 341 (514)
|.-..-|.|..|.+-|.--
T Consensus 216 hWHveHFvCa~CekPFlGH 234 (332)
T KOG2272|consen 216 HWHVEHFVCAKCEKPFLGH 234 (332)
T ss_pred ccchhheeehhcCCcccch
Confidence 3334448888888877653
No 201
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=20.91 E-value=88 Score=33.04 Aligned_cols=10 Identities=40% Similarity=0.933 Sum_probs=5.8
Q ss_pred cCCccchhcc
Q psy11032 450 ECNVCGARFA 459 (514)
Q Consensus 450 ~C~~C~~~f~ 459 (514)
.|+.||....
T Consensus 43 fC~~CG~~~~ 52 (645)
T PRK14559 43 HCPNCGAETG 52 (645)
T ss_pred cccccCCccc
Confidence 5666665544
No 202
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=20.71 E-value=53 Score=25.73 Aligned_cols=27 Identities=22% Similarity=0.427 Sum_probs=16.8
Q ss_pred ceeCCcCCCccCCchhhhhhcccccCCcccccCCCcCC
Q psy11032 392 IFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQA 429 (514)
Q Consensus 392 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~ 429 (514)
.+.|..|+..|..... ..+.||.|+..
T Consensus 70 ~~~C~~Cg~~~~~~~~-----------~~~~CP~Cgs~ 96 (114)
T PRK03681 70 ECWCETCQQYVTLLTQ-----------RVRRCPQCHGD 96 (114)
T ss_pred EEEcccCCCeeecCCc-----------cCCcCcCcCCC
Confidence 4789999876643311 11568888843
No 203
>PF05129 Elf1: Transcription elongation factor Elf1 like; InterPro: IPR007808 This family of uncharacterised, mostly short, proteins contain a putative zinc binding domain with four conserved cysteines.; PDB: 1WII_A.
Probab=20.57 E-value=16 Score=26.55 Aligned_cols=8 Identities=38% Similarity=1.468 Sum_probs=1.7
Q ss_pred ceeCCcCC
Q psy11032 392 IFQCHLCG 399 (514)
Q Consensus 392 ~~~C~~C~ 399 (514)
.|.|+.|+
T Consensus 22 ~F~CPfC~ 29 (81)
T PF05129_consen 22 VFDCPFCN 29 (81)
T ss_dssp ----TTT-
T ss_pred eEcCCcCC
Confidence 35555555
No 204
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=20.17 E-value=50 Score=22.90 Aligned_cols=26 Identities=35% Similarity=0.893 Sum_probs=10.4
Q ss_pred eeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCC
Q psy11032 393 FQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQAL 430 (514)
Q Consensus 393 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f 430 (514)
-.|..|++.|.- ..+.+.|..||..|
T Consensus 10 ~~C~~C~~~F~~------------~~rrhhCr~CG~~v 35 (69)
T PF01363_consen 10 SNCMICGKKFSL------------FRRRHHCRNCGRVV 35 (69)
T ss_dssp SB-TTT--B-BS------------SS-EEE-TTT--EE
T ss_pred CcCcCcCCcCCC------------ceeeEccCCCCCEE
Confidence 468888888832 12445677777654
No 205
>PRK05580 primosome assembly protein PriA; Validated
Probab=20.05 E-value=78 Score=33.90 Aligned_cols=50 Identities=30% Similarity=0.718 Sum_probs=0.0
Q ss_pred CCceeCCcCCCccCCchhhhhhcccccCCcccccCCCcCCCCCchhHHhHhhhccCCccccCCccc
Q psy11032 390 TPIFQCHLCGRCFNSPTTLTQHKIVHTGERRFKCDICGQALTTKPKLNDHMLLHTGEKPHECNVCG 455 (514)
Q Consensus 390 ~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 455 (514)
.+...|..||....-...=. -+..|...+...|.+||+... .+..|+.||
T Consensus 379 ~~~~~C~~Cg~~~~C~~C~~-~l~~h~~~~~l~Ch~Cg~~~~---------------~~~~Cp~Cg 428 (679)
T PRK05580 379 APFLLCRDCGWVAECPHCDA-SLTLHRFQRRLRCHHCGYQEP---------------IPKACPECG 428 (679)
T ss_pred CCceEhhhCcCccCCCCCCC-ceeEECCCCeEECCCCcCCCC---------------CCCCCCCCc
Done!