Query psy359
Match_columns 405
No_of_seqs 161 out of 2408
Neff 10.4
Searched_HMMs 46136
Date Fri Aug 16 20:29:57 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy359.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/359hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 99.9 7.7E-27 1.7E-31 193.9 4.9 133 216-392 130-265 (279)
2 KOG2462|consensus 99.9 1.3E-24 2.9E-29 180.7 10.0 141 123-301 125-265 (279)
3 KOG3608|consensus 99.9 2.1E-24 4.5E-29 183.8 11.0 230 127-397 133-379 (467)
4 KOG1074|consensus 99.9 1.4E-24 3E-29 204.2 8.1 213 187-399 604-936 (958)
5 KOG1074|consensus 99.9 2.7E-22 5.8E-27 189.0 9.7 211 160-370 603-935 (958)
6 KOG3623|consensus 99.9 3.5E-22 7.6E-27 185.0 9.0 107 128-238 210-331 (1007)
7 KOG3608|consensus 99.8 4.1E-20 8.8E-25 157.9 9.7 204 128-382 177-398 (467)
8 KOG3576|consensus 99.7 4.8E-16 1E-20 123.0 9.1 125 126-306 115-239 (267)
9 KOG3576|consensus 99.6 1.8E-16 4E-21 125.3 0.7 126 185-369 114-239 (267)
10 KOG3623|consensus 99.5 5.7E-15 1.2E-19 137.8 2.8 122 217-393 211-332 (1007)
11 PHA00733 hypothetical protein 99.2 1.6E-11 3.5E-16 94.5 4.2 89 276-396 37-125 (128)
12 PLN03086 PRLI-interacting fact 99.2 8E-11 1.7E-15 111.2 9.2 103 278-396 452-566 (567)
13 PLN03086 PRLI-interacting fact 99.0 2.9E-10 6.3E-15 107.5 6.5 102 279-405 433-548 (567)
14 PHA02768 hypothetical protein; 98.9 5.1E-10 1.1E-14 70.3 1.6 45 342-388 5-49 (55)
15 PHA00733 hypothetical protein 98.8 7.6E-09 1.6E-13 79.7 4.1 82 186-304 38-124 (128)
16 KOG3993|consensus 98.7 3.4E-09 7.4E-14 93.9 0.1 203 188-396 267-484 (500)
17 PF13465 zf-H2C2_2: Zinc-finge 98.6 4.3E-08 9.3E-13 52.6 2.1 24 358-381 2-25 (26)
18 PF13465 zf-H2C2_2: Zinc-finge 98.6 3.4E-08 7.4E-13 53.0 1.7 26 329-354 1-26 (26)
19 PHA02768 hypothetical protein; 98.5 3.4E-08 7.3E-13 62.2 1.3 43 188-232 5-47 (55)
20 PHA00616 hypothetical protein 98.5 6E-08 1.3E-12 57.9 1.6 34 342-375 1-34 (44)
21 PHA00616 hypothetical protein 98.4 7.7E-08 1.7E-12 57.4 1.3 33 370-402 1-33 (44)
22 KOG3993|consensus 98.4 8.5E-08 1.8E-12 85.2 1.2 110 128-244 267-384 (500)
23 PHA00732 hypothetical protein 98.3 2.6E-07 5.5E-12 64.3 2.1 46 342-393 1-47 (79)
24 PF05605 zf-Di19: Drought indu 98.3 8E-07 1.7E-11 57.4 3.8 50 343-395 3-54 (54)
25 PHA00732 hypothetical protein 97.9 7.2E-06 1.6E-10 57.1 3.0 49 279-367 1-49 (79)
26 PF00096 zf-C2H2: Zinc finger, 97.9 1E-05 2.2E-10 42.1 1.9 22 371-392 1-22 (23)
27 PF13894 zf-C2H2_4: C2H2-type 97.8 1.8E-05 3.8E-10 41.6 2.2 24 371-394 1-24 (24)
28 PF00096 zf-C2H2: Zinc finger, 97.8 1.3E-05 2.9E-10 41.6 1.4 23 343-365 1-23 (23)
29 PF05605 zf-Di19: Drought indu 97.8 5.1E-05 1.1E-09 49.0 4.4 53 279-367 2-54 (54)
30 PF12756 zf-C2H2_2: C2H2 type 97.7 1.9E-05 4.1E-10 58.7 2.3 54 344-397 1-78 (100)
31 PF12756 zf-C2H2_2: C2H2 type 97.6 3E-05 6.5E-10 57.6 1.9 73 281-365 1-73 (100)
32 PF13912 zf-C2H2_6: C2H2-type 97.6 2.7E-05 5.9E-10 42.2 1.2 26 370-395 1-26 (27)
33 PF13894 zf-C2H2_4: C2H2-type 97.5 6.2E-05 1.4E-09 39.4 1.6 24 343-366 1-24 (24)
34 PF13912 zf-C2H2_6: C2H2-type 97.4 7.3E-05 1.6E-09 40.5 1.4 26 342-367 1-26 (27)
35 PF09237 GAGA: GAGA factor; I 97.2 0.00028 6E-09 43.2 2.3 42 357-398 10-52 (54)
36 COG5189 SFP1 Putative transcri 97.1 0.00022 4.7E-09 61.6 1.7 72 276-363 346-419 (423)
37 COG5189 SFP1 Putative transcri 97.1 0.00014 3E-09 62.7 0.4 53 339-391 346-419 (423)
38 KOG2231|consensus 97.0 0.0018 4E-08 62.9 6.8 81 218-303 117-206 (669)
39 PF13909 zf-H2C2_5: C2H2-type 96.9 0.00071 1.5E-08 35.3 2.1 24 371-395 1-24 (24)
40 smart00355 ZnF_C2H2 zinc finge 96.8 0.0012 2.7E-08 34.9 2.2 23 371-393 1-23 (26)
41 KOG2785|consensus 96.6 0.0066 1.4E-07 54.4 6.9 54 339-392 163-242 (390)
42 smart00355 ZnF_C2H2 zinc finge 96.6 0.0015 3.2E-08 34.6 1.9 24 343-366 1-24 (26)
43 KOG2231|consensus 96.5 0.0047 1E-07 60.2 5.7 101 281-398 117-240 (669)
44 PF12874 zf-met: Zinc-finger o 96.5 0.0014 3.1E-08 34.5 1.2 22 371-392 1-22 (25)
45 COG5236 Uncharacterized conser 96.5 0.0054 1.2E-07 53.8 5.2 82 217-303 152-244 (493)
46 PF13909 zf-H2C2_5: C2H2-type 96.2 0.0042 9E-08 32.4 2.0 24 129-153 1-24 (24)
47 PRK04860 hypothetical protein; 96.1 0.0025 5.4E-08 51.1 1.5 38 342-383 119-156 (160)
48 PF12874 zf-met: Zinc-finger o 96.0 0.0034 7.3E-08 33.1 1.2 23 343-365 1-23 (25)
49 COG5236 Uncharacterized conser 95.8 0.0086 1.9E-07 52.6 3.3 88 128-240 151-244 (493)
50 PF09237 GAGA: GAGA factor; I 95.7 0.012 2.5E-07 36.2 2.7 31 186-216 22-52 (54)
51 PF12171 zf-C2H2_jaz: Zinc-fin 95.4 0.0037 8E-08 33.7 -0.2 22 371-392 2-23 (27)
52 PRK04860 hypothetical protein; 95.4 0.0061 1.3E-07 48.9 0.9 38 318-356 120-157 (160)
53 KOG2482|consensus 95.4 0.03 6.6E-07 49.3 5.1 147 128-300 144-355 (423)
54 PF13913 zf-C2HC_2: zinc-finge 95.2 0.017 3.6E-07 30.4 2.0 21 371-392 3-23 (25)
55 KOG4173|consensus 95.0 0.018 3.9E-07 46.8 2.6 80 127-241 78-171 (253)
56 PF12171 zf-C2H2_jaz: Zinc-fin 94.9 0.013 2.7E-07 31.5 1.0 21 280-300 2-22 (27)
57 KOG2482|consensus 94.8 0.22 4.8E-06 44.1 8.7 107 276-395 141-304 (423)
58 KOG2893|consensus 94.6 0.0074 1.6E-07 50.1 -0.6 46 219-303 13-58 (341)
59 KOG1146|consensus 94.5 0.018 3.9E-07 59.8 1.7 107 276-398 462-617 (1406)
60 KOG2785|consensus 93.3 0.16 3.4E-06 45.9 5.0 73 129-210 167-242 (390)
61 smart00451 ZnF_U1 U1-like zinc 93.1 0.065 1.4E-06 30.7 1.7 23 370-392 3-25 (35)
62 PF13913 zf-C2HC_2: zinc-finge 93.0 0.088 1.9E-06 27.6 1.9 20 280-300 3-22 (25)
63 KOG4173|consensus 92.7 0.055 1.2E-06 44.1 1.3 51 345-395 109-171 (253)
64 smart00451 ZnF_U1 U1-like zinc 92.0 0.13 2.9E-06 29.4 2.0 23 279-301 3-25 (35)
65 KOG1146|consensus 92.0 0.046 9.9E-07 57.0 0.0 106 245-395 1231-1353(1406)
66 PF12013 DUF3505: Protein of u 91.4 0.3 6.5E-06 36.7 3.9 26 370-395 80-109 (109)
67 TIGR00622 ssl1 transcription f 91.1 0.22 4.7E-06 37.0 2.7 26 276-301 78-103 (112)
68 TIGR00622 ssl1 transcription f 90.4 0.46 1E-05 35.3 3.9 88 277-366 13-105 (112)
69 KOG2893|consensus 89.4 0.1 2.2E-06 43.6 -0.2 48 190-241 12-59 (341)
70 cd00350 rubredoxin_like Rubred 89.3 0.22 4.7E-06 28.2 1.2 24 343-378 2-25 (33)
71 COG4049 Uncharacterized protei 87.4 0.42 9.2E-06 30.0 1.7 31 211-241 12-42 (65)
72 COG4049 Uncharacterized protei 86.6 0.26 5.5E-06 30.9 0.4 30 337-366 12-41 (65)
73 PF09538 FYDLN_acid: Protein o 86.2 0.32 6.9E-06 36.2 0.8 13 215-227 25-37 (108)
74 COG5048 FOG: Zn-finger [Genera 86.1 0.18 3.9E-06 48.3 -0.7 61 341-401 288-354 (467)
75 PF12013 DUF3505: Protein of u 84.8 1 2.2E-05 33.8 3.0 84 277-367 9-109 (109)
76 COG5048 FOG: Zn-finger [Genera 83.7 0.25 5.5E-06 47.3 -0.9 57 187-243 288-350 (467)
77 PF04959 ARS2: Arsenite-resist 82.5 0.55 1.2E-05 39.6 0.8 30 368-397 75-104 (214)
78 PRK00464 nrdR transcriptional 82.4 0.34 7.4E-06 38.6 -0.5 15 343-357 29-43 (154)
79 PF02892 zf-BED: BED zinc fing 80.7 1.3 2.8E-05 27.0 1.8 27 368-394 14-44 (45)
80 COG2888 Predicted Zn-ribbon RN 80.4 1.3 2.8E-05 28.5 1.7 13 343-355 28-40 (61)
81 PF10571 UPF0547: Uncharacteri 79.1 1 2.2E-05 23.8 0.8 8 373-380 17-24 (26)
82 TIGR02098 MJ0042_CXXC MJ0042 f 78.5 1.1 2.4E-05 26.1 1.0 33 343-380 3-35 (38)
83 KOG2186|consensus 77.6 1.1 2.5E-05 38.1 1.1 47 189-238 4-50 (276)
84 cd00729 rubredoxin_SM Rubredox 76.9 1.4 3E-05 25.1 1.0 24 343-378 3-26 (34)
85 COG1198 PriA Primosomal protei 76.6 1.7 3.7E-05 44.0 2.3 8 318-325 436-443 (730)
86 PF06524 NOA36: NOA36 protein; 76.0 1 2.3E-05 38.4 0.5 90 276-393 139-232 (314)
87 PF09986 DUF2225: Uncharacteri 75.8 0.67 1.4E-05 39.5 -0.7 43 341-383 4-61 (214)
88 smart00659 RPOLCX RNA polymera 75.5 1.8 3.8E-05 26.3 1.3 26 343-379 3-28 (44)
89 smart00734 ZnF_Rad18 Rad18-lik 74.6 2.8 6E-05 22.1 1.7 19 372-391 3-21 (26)
90 PF12907 zf-met2: Zinc-binding 74.5 0.74 1.6E-05 27.2 -0.5 27 371-397 2-31 (40)
91 TIGR02300 FYDLN_acid conserved 74.3 1.3 2.9E-05 33.5 0.6 14 215-228 25-38 (129)
92 TIGR00373 conserved hypothetic 74.0 2.6 5.7E-05 34.0 2.3 44 330-382 97-140 (158)
93 PF13717 zinc_ribbon_4: zinc-r 73.6 1.9 4.2E-05 24.8 1.1 33 343-380 3-35 (36)
94 PRK14890 putative Zn-ribbon RN 73.6 1.1 2.4E-05 28.9 0.0 10 341-350 47-56 (59)
95 PF09538 FYDLN_acid: Protein o 72.8 2.2 4.8E-05 31.8 1.5 14 369-382 25-38 (108)
96 PRK00398 rpoP DNA-directed RNA 72.3 1.5 3.4E-05 26.9 0.5 29 343-381 4-32 (46)
97 PHA00626 hypothetical protein 72.2 1.4 3E-05 27.8 0.3 12 343-354 24-35 (59)
98 COG1996 RPC10 DNA-directed RNA 71.3 2 4.4E-05 26.6 0.8 27 342-378 6-32 (49)
99 KOG2807|consensus 70.6 6.6 0.00014 35.0 4.1 81 277-377 288-374 (378)
100 PF13719 zinc_ribbon_5: zinc-r 70.4 2 4.2E-05 25.0 0.6 32 190-226 4-35 (37)
101 PRK06266 transcription initiat 70.2 3 6.6E-05 34.3 1.9 42 332-382 107-148 (178)
102 TIGR02605 CxxC_CxxC_SSSS putat 69.7 1.6 3.4E-05 27.6 0.1 29 343-378 6-34 (52)
103 smart00834 CxxC_CXXC_SSSS Puta 68.8 1.7 3.7E-05 25.7 0.1 29 343-378 6-34 (41)
104 smart00614 ZnF_BED BED zinc fi 68.4 4.3 9.4E-05 25.4 1.9 21 372-392 20-45 (50)
105 PF03604 DNA_RNApol_7kD: DNA d 67.9 3.7 8E-05 23.0 1.3 11 343-353 1-11 (32)
106 smart00531 TFIIE Transcription 67.0 7.2 0.00016 31.0 3.4 41 337-381 94-134 (147)
107 PF09723 Zn-ribbon_8: Zinc rib 66.6 2.6 5.6E-05 25.3 0.6 29 343-378 6-34 (42)
108 PRK09678 DNA-binding transcrip 66.4 1.7 3.6E-05 29.6 -0.3 20 339-358 24-45 (72)
109 PRK04023 DNA polymerase II lar 66.2 4.3 9.4E-05 42.0 2.4 8 371-378 664-671 (1121)
110 COG1198 PriA Primosomal protei 65.9 4.5 9.7E-05 41.1 2.5 38 319-378 446-483 (730)
111 KOG2807|consensus 65.7 7.4 0.00016 34.7 3.4 26 276-301 342-367 (378)
112 PF08274 PhnA_Zn_Ribbon: PhnA 64.7 3 6.5E-05 22.9 0.6 24 344-378 4-27 (30)
113 TIGR00373 conserved hypothetic 62.8 6.8 0.00015 31.6 2.5 31 366-405 105-135 (158)
114 PF09986 DUF2225: Uncharacteri 62.4 2 4.3E-05 36.7 -0.6 23 278-300 4-26 (214)
115 KOG2186|consensus 61.7 3.9 8.4E-05 35.0 1.0 32 330-363 18-49 (276)
116 PF02176 zf-TRAF: TRAF-type zi 60.8 6.7 0.00014 25.4 1.8 40 341-381 8-53 (60)
117 PF15269 zf-C2H2_7: Zinc-finge 60.5 9.3 0.0002 22.9 2.1 25 126-150 18-42 (54)
118 PF05443 ROS_MUCR: ROS/MUCR tr 58.5 6.5 0.00014 30.5 1.6 26 187-215 71-96 (132)
119 PF05443 ROS_MUCR: ROS/MUCR tr 58.4 6.8 0.00015 30.4 1.7 24 343-369 73-96 (132)
120 KOG2272|consensus 57.9 4 8.7E-05 34.8 0.4 41 343-391 281-321 (332)
121 PRK06266 transcription initiat 57.1 8.2 0.00018 31.8 2.2 30 367-405 114-143 (178)
122 TIGR02300 FYDLN_acid conserved 56.9 7 0.00015 29.7 1.5 15 341-355 25-39 (129)
123 cd00730 rubredoxin Rubredoxin; 56.9 5.7 0.00012 24.9 0.9 13 280-292 2-14 (50)
124 TIGR00595 priA primosomal prot 56.6 6.2 0.00013 38.7 1.6 28 337-379 235-262 (505)
125 COG1997 RPL43A Ribosomal prote 56.4 5.2 0.00011 28.1 0.7 14 369-382 52-65 (89)
126 smart00531 TFIIE Transcription 55.9 10 0.00022 30.2 2.5 35 367-405 96-130 (147)
127 PF06524 NOA36: NOA36 protein; 53.5 8.6 0.00019 33.1 1.7 89 124-233 138-226 (314)
128 COG1998 RPS31 Ribosomal protei 52.8 8.3 0.00018 23.8 1.1 9 370-378 37-45 (51)
129 PF09845 DUF2072: Zn-ribbon co 52.7 8.1 0.00018 29.6 1.3 15 342-356 1-15 (131)
130 PF13240 zinc_ribbon_2: zinc-r 52.7 6.3 0.00014 20.1 0.5 6 373-378 16-21 (23)
131 KOG2593|consensus 52.3 11 0.00024 35.1 2.4 48 330-380 116-163 (436)
132 PRK14873 primosome assembly pr 52.2 7.7 0.00017 39.3 1.5 29 318-350 384-418 (665)
133 COG4530 Uncharacterized protei 50.9 5.9 0.00013 29.0 0.3 11 188-198 26-36 (129)
134 PF04216 FdhE: Protein involve 50.9 1.7 3.7E-05 39.1 -3.0 73 318-405 173-245 (290)
135 PF07754 DUF1610: Domain of un 50.2 8.6 0.00019 19.9 0.8 9 397-405 15-23 (24)
136 COG3364 Zn-ribbon containing p 49.6 11 0.00023 27.3 1.4 18 342-359 2-19 (112)
137 PRK04023 DNA polymerase II lar 49.2 16 0.00035 38.1 3.1 9 371-379 652-660 (1121)
138 PF07975 C1_4: TFIIH C1-like d 48.9 5 0.00011 25.2 -0.3 26 341-366 20-45 (51)
139 PRK14714 DNA polymerase II lar 46.9 13 0.00028 39.9 2.1 29 342-381 692-720 (1337)
140 COG5151 SSL1 RNA polymerase II 45.9 15 0.00033 32.4 2.1 26 341-366 387-412 (421)
141 COG5151 SSL1 RNA polymerase II 45.3 11 0.00025 33.2 1.3 26 276-301 385-410 (421)
142 KOG1280|consensus 44.4 17 0.00038 32.8 2.2 37 341-377 78-116 (381)
143 PF01363 FYVE: FYVE zinc finge 44.4 10 0.00023 25.4 0.7 26 343-380 10-35 (69)
144 PF13696 zf-CCHC_2: Zinc knuck 44.2 13 0.00029 20.7 1.0 16 2-17 2-17 (32)
145 COG3091 SprT Zn-dependent meta 43.4 9.5 0.00021 30.0 0.5 34 341-379 116-149 (156)
146 KOG4167|consensus 43.3 31 0.00068 34.6 3.9 28 126-153 790-817 (907)
147 PF12760 Zn_Tnp_IS1595: Transp 43.2 17 0.00037 22.2 1.5 11 368-378 35-45 (46)
148 KOG1701|consensus 43.1 8.1 0.00018 35.8 0.1 12 189-200 303-314 (468)
149 COG3357 Predicted transcriptio 42.2 13 0.00029 26.3 0.9 31 340-380 56-86 (97)
150 KOG2907|consensus 41.5 19 0.00042 26.7 1.7 11 343-353 103-113 (116)
151 PF05495 zf-CHY: CHY zinc fing 41.3 7.7 0.00017 26.4 -0.3 35 338-380 37-71 (71)
152 KOG2071|consensus 41.2 19 0.00041 35.2 2.1 29 276-305 415-443 (579)
153 KOG1280|consensus 40.4 25 0.00055 31.8 2.6 39 128-170 79-117 (381)
154 PRK14714 DNA polymerase II lar 40.1 30 0.00066 37.3 3.5 9 371-379 693-701 (1337)
155 PRK03564 formate dehydrogenase 39.8 18 0.00039 32.6 1.7 33 318-350 188-220 (309)
156 PF13453 zf-TFIIB: Transcripti 39.3 18 0.00039 21.4 1.2 20 369-388 18-37 (41)
157 COG1592 Rubrerythrin [Energy p 38.3 15 0.00033 29.7 0.9 24 188-224 134-157 (166)
158 PF04959 ARS2: Arsenite-resist 37.5 22 0.00048 30.2 1.8 31 276-306 74-104 (214)
159 PRK03824 hypA hydrogenase nick 37.2 13 0.00029 29.0 0.4 14 342-355 70-83 (135)
160 cd00065 FYVE FYVE domain; Zinc 37.1 20 0.00044 22.8 1.2 26 344-381 4-29 (57)
161 PF14353 CpXC: CpXC protein 37.0 20 0.00044 27.6 1.4 33 8-40 38-70 (128)
162 PF01096 TFIIS_C: Transcriptio 36.1 4.8 0.0001 23.7 -1.7 10 343-352 29-38 (39)
163 PTZ00255 60S ribosomal protein 36.1 13 0.00028 26.6 0.1 14 369-382 53-66 (90)
164 TIGR01206 lysW lysine biosynth 35.5 16 0.00035 23.3 0.5 30 343-380 3-32 (54)
165 PF15135 UPF0515: Uncharacteri 35.2 29 0.00063 29.8 2.1 57 161-228 111-167 (278)
166 PF01780 Ribosomal_L37ae: Ribo 35.0 11 0.00023 27.0 -0.4 12 370-381 53-64 (90)
167 TIGR01562 FdhE formate dehydro 34.8 29 0.00062 31.4 2.1 33 318-350 185-218 (305)
168 KOG3408|consensus 34.3 31 0.00067 26.0 1.9 28 123-150 52-79 (129)
169 TIGR00280 L37a ribosomal prote 33.4 13 0.00028 26.6 -0.2 12 370-381 53-64 (91)
170 KOG2071|consensus 33.2 28 0.00061 34.1 1.9 27 186-212 416-442 (579)
171 PF04780 DUF629: Protein of un 32.9 30 0.00065 33.1 2.1 26 371-396 58-83 (466)
172 PRK12380 hydrogenase nickel in 31.4 24 0.00052 26.6 0.9 25 342-378 70-94 (113)
173 smart00440 ZnF_C2C2 C2C2 Zinc 31.3 9 0.00019 22.7 -1.1 10 371-380 29-38 (40)
174 COG4888 Uncharacterized Zn rib 30.5 15 0.00033 26.6 -0.2 11 371-381 47-57 (104)
175 PF03833 PolC_DP2: DNA polymer 30.1 17 0.00037 37.2 0.0 11 372-382 694-704 (900)
176 TIGR00100 hypA hydrogenase nic 29.8 25 0.00054 26.6 0.8 26 342-379 70-95 (115)
177 PF01155 HypA: Hydrogenase exp 29.7 21 0.00046 26.9 0.5 25 343-379 71-95 (113)
178 KOG4124|consensus 29.6 9.1 0.0002 34.4 -1.7 55 340-394 347-422 (442)
179 PF14446 Prok-RING_1: Prokaryo 29.4 30 0.00064 22.1 1.0 28 343-382 6-33 (54)
180 COG1327 Predicted transcriptio 29.3 16 0.00034 28.8 -0.3 42 319-360 2-46 (156)
181 TIGR00244 transcriptional regu 28.8 16 0.00034 28.8 -0.4 42 319-360 2-46 (147)
182 PF10013 DUF2256: Uncharacteri 28.7 34 0.00074 20.4 1.1 16 372-387 10-25 (42)
183 KOG3408|consensus 28.7 33 0.00071 25.9 1.2 22 342-363 57-78 (129)
184 PRK00564 hypA hydrogenase nick 28.3 24 0.00053 26.7 0.5 27 341-379 70-97 (117)
185 PF09416 UPF1_Zn_bind: RNA hel 28.1 49 0.0011 26.3 2.2 40 339-378 11-68 (152)
186 KOG4167|consensus 27.7 25 0.00055 35.2 0.7 26 216-241 792-817 (907)
187 smart00064 FYVE Protein presen 27.6 35 0.00075 22.7 1.2 11 344-354 12-22 (68)
188 PRK03976 rpl37ae 50S ribosomal 27.5 18 0.00039 25.8 -0.3 12 370-381 54-65 (90)
189 PF12773 DZR: Double zinc ribb 27.2 35 0.00076 21.0 1.1 26 343-379 13-38 (50)
190 COG2879 Uncharacterized small 26.9 65 0.0014 21.2 2.2 19 381-399 23-41 (65)
191 PF07295 DUF1451: Protein of u 26.7 17 0.00038 28.8 -0.5 9 343-351 113-121 (146)
192 KOG2593|consensus 26.6 40 0.00086 31.7 1.7 21 276-296 125-145 (436)
193 PF13248 zf-ribbon_3: zinc-rib 26.6 30 0.00066 18.1 0.6 7 344-350 4-10 (26)
194 COG1571 Predicted DNA-binding 25.5 39 0.00085 31.8 1.4 12 370-381 367-378 (421)
195 PRK04351 hypothetical protein; 25.4 34 0.00073 27.3 0.9 32 343-382 113-144 (149)
196 PF06397 Desulfoferrod_N: Desu 24.4 31 0.00068 19.9 0.4 16 4-19 2-17 (36)
197 KOG0717|consensus 24.4 35 0.00075 32.4 0.9 22 217-238 293-314 (508)
198 KOG2906|consensus 24.4 6.3 0.00014 28.3 -2.9 17 278-294 20-36 (105)
199 KOG4124|consensus 24.0 21 0.00045 32.3 -0.6 70 277-362 347-418 (442)
200 COG3677 Transposase and inacti 24.0 34 0.00073 26.5 0.6 14 342-355 53-66 (129)
201 PF13878 zf-C2H2_3: zinc-finge 23.7 82 0.0018 18.7 2.1 24 129-152 14-39 (41)
202 PF13824 zf-Mss51: Zinc-finger 23.7 75 0.0016 20.4 2.0 17 368-384 12-28 (55)
203 PF08271 TF_Zn_Ribbon: TFIIB z 23.6 30 0.00065 20.7 0.2 8 371-378 20-27 (43)
204 smart00154 ZnF_AN1 AN1-like Zi 23.6 37 0.00081 19.9 0.6 12 370-381 12-23 (39)
205 PF14311 DUF4379: Domain of un 23.5 50 0.0011 20.9 1.3 16 343-358 29-44 (55)
206 COG1594 RPB9 DNA-directed RNA 23.0 23 0.0005 26.7 -0.4 12 342-353 100-111 (113)
207 TIGR03826 YvyF flagellar opero 22.9 43 0.00094 26.2 1.0 16 276-291 78-93 (137)
208 PF08209 Sgf11: Sgf11 (transcr 22.4 48 0.001 18.7 0.9 21 371-392 5-25 (33)
209 PF04423 Rad50_zn_hook: Rad50 22.4 32 0.00069 21.8 0.2 12 372-383 22-33 (54)
210 PRK03681 hypA hydrogenase nick 22.2 45 0.00097 25.2 1.0 26 342-378 70-95 (114)
211 COG0068 HypF Hydrogenase matur 22.2 22 0.00047 35.7 -0.9 30 344-379 153-182 (750)
212 PF14205 Cys_rich_KTR: Cystein 22.1 39 0.00084 21.5 0.5 13 276-288 25-37 (55)
213 PF10276 zf-CHCC: Zinc-finger 21.4 48 0.001 19.6 0.8 13 6-18 27-39 (40)
214 PF08790 zf-LYAR: LYAR-type C2 21.4 42 0.00091 18.1 0.5 19 9-28 1-19 (28)
215 PF14787 zf-CCHC_5: GAG-polypr 21.2 40 0.00087 19.3 0.4 15 372-386 4-18 (36)
216 cd00924 Cyt_c_Oxidase_Vb Cytoc 21.1 44 0.00095 24.4 0.7 16 340-355 77-92 (97)
217 KOG1842|consensus 21.1 48 0.001 31.2 1.1 26 341-366 14-39 (505)
218 PF04780 DUF629: Protein of un 21.0 70 0.0015 30.8 2.2 28 279-306 57-84 (466)
219 COG1655 Uncharacterized protei 20.9 20 0.00044 30.4 -1.2 39 341-379 18-71 (267)
220 PLN02294 cytochrome c oxidase 20.6 46 0.001 26.9 0.8 17 339-355 138-154 (174)
221 smart00731 SprT SprT homologue 20.4 42 0.00092 26.5 0.6 32 342-380 112-143 (146)
222 smart00661 RPOL9 RNA polymeras 20.4 44 0.00095 20.7 0.5 10 343-352 21-30 (52)
223 KOG4118|consensus 20.0 57 0.0012 21.5 1.0 28 371-398 39-66 (74)
No 1
>KOG2462|consensus
Probab=99.93 E-value=7.7e-27 Score=193.94 Aligned_cols=133 Identities=31% Similarity=0.655 Sum_probs=111.4
Q ss_pred CcccccchHhhhChHHHHHHHhhhCCCCcccccCCCCcchHHhhhhhcccCcchhhhhhhcccceecCCCcccCCChHHH
Q psy359 216 PYACYLCDKRFAQISDRIKHLKSSHNFDFESVKNKTTPLTRYLNKHLHDAHPEAIKTERERKLIFKCDLCGNILSSKHIL 295 (405)
Q Consensus 216 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l 295 (405)
.|+|..|++.+.+.+.|.+|.+.|-..+ ..+.+.|+.|++.|.+...|
T Consensus 130 r~~c~eCgk~ysT~snLsrHkQ~H~~~~--------------------------------s~ka~~C~~C~K~YvSmpAL 177 (279)
T KOG2462|consen 130 RYKCPECGKSYSTSSNLSRHKQTHRSLD--------------------------------SKKAFSCKYCGKVYVSMPAL 177 (279)
T ss_pred ceeccccccccccccccchhhccccccc--------------------------------ccccccCCCCCceeeehHHH
Confidence 3666666666666666666666643222 35578899999999999999
Q ss_pred HHHHhhhccCCCccccccCCCCCcCCCcccc---hHHHHhhhhhcCCCCccccccccccccCchhHHHHhhhhCCCCccc
Q psy359 296 QEHVRVVHMGLSRKYHYEYKPDGVCDVCGEY---KKQLLQHKRLHFPLRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQ 372 (405)
Q Consensus 296 ~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~---~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~ 372 (405)
..|++. |... +.|.+||+. .=.|+.|+|+|||||||.|+.|+++|+.++.|+.||+.|.+.+.|+
T Consensus 178 kMHirT-H~l~-----------c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~q 245 (279)
T KOG2462|consen 178 KMHIRT-HTLP-----------CECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQ 245 (279)
T ss_pred hhHhhc-cCCC-----------cccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCcccc
Confidence 999994 5432 789999997 4489999999999999999999999999999999999999999999
Q ss_pred CCccccccCChHHHHHHHHh
Q psy359 373 CDICGRGFAQSNDMKKHRRT 392 (405)
Q Consensus 373 C~~C~~~f~~~~~l~~H~~~ 392 (405)
|..|+|.|...+.|.+|...
T Consensus 246 C~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 246 CPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred CcchhhHHHHHHHHHHhhhh
Confidence 99999999999999999765
No 2
>KOG2462|consensus
Probab=99.91 E-value=1.3e-24 Score=180.65 Aligned_cols=141 Identities=30% Similarity=0.597 Sum_probs=124.1
Q ss_pred CCcCcccccCCCccccCChhhHHHHHHhhcCcccccCCCCCcCCCchHhhhcHHHHHHhhccCCCCccCccchhccCChH
Q psy359 123 KTLPLTYPCDQCDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVTNLAAHHNEVHAHERKFPCTFCEKSFKRKL 202 (405)
Q Consensus 123 ~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~~l~~~~~~~h~~~~~~~C~~C~~~f~~~~ 202 (405)
......|.|..|++.+.+..+|-+|.++|..... .+.+.|..|++++.+|...-|.+-+...+..|.+|||.|....
T Consensus 125 ~~~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s---~ka~~C~~C~K~YvSmpALkMHirTH~l~c~C~iCGKaFSRPW 201 (279)
T KOG2462|consen 125 AAKHPRYKCPECGKSYSTSSNLSRHKQTHRSLDS---KKAFSCKYCGKVYVSMPALKMHIRTHTLPCECGICGKAFSRPW 201 (279)
T ss_pred cccCCceeccccccccccccccchhhcccccccc---cccccCCCCCceeeehHHHhhHhhccCCCcccccccccccchH
Confidence 4455679999999999999999999999988654 5678999999999977664444444447899999999999999
Q ss_pred HHHHHHhhhcCCCCcccccchHhhhChHHHHHHHhhhCCCCcccccCCCCcchHHhhhhhcccCcchhhhhhhcccceec
Q psy359 203 HLKVHTRTHTGEKPYACYLCDKRFAQISDRIKHLKSSHNFDFESVKNKTTPLTRYLNKHLHDAHPEAIKTERERKLIFKC 282 (405)
Q Consensus 203 ~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~C 282 (405)
.|+.|+++|+|+|||.|+.|++.|..++.|+.||++|- +.+.|.|
T Consensus 202 LLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS-----------------------------------~~K~~qC 246 (279)
T KOG2462|consen 202 LLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHS-----------------------------------DVKKHQC 246 (279)
T ss_pred HhhcccccccCCCCccCCcccchhcchHHHHHHHHhhc-----------------------------------CCccccC
Confidence 99999999999999999999999999999999999965 4568999
Q ss_pred CCCcccCCChHHHHHHHhh
Q psy359 283 DLCGNILSSKHILQEHVRV 301 (405)
Q Consensus 283 ~~C~~~f~~~~~l~~H~~~ 301 (405)
..|+++|...+-|.+|...
T Consensus 247 ~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 247 PRCGKSFALKSYLNKHSES 265 (279)
T ss_pred cchhhHHHHHHHHHHhhhh
Confidence 9999999999999999873
No 3
>KOG3608|consensus
Probab=99.91 E-value=2.1e-24 Score=183.84 Aligned_cols=230 Identities=25% Similarity=0.475 Sum_probs=179.8
Q ss_pred ccccc--CCCccccCChhhHHHHHHhhcCcccccCCCCCcCCCchHhhhcHHHHHHhhccCCC-CccCcc--chhccCCh
Q psy359 127 LTYPC--DQCDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVTNLAAHHNEVHAHER-KFPCTF--CEKSFKRK 201 (405)
Q Consensus 127 ~~~~C--~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~~l~~~~~~~h~~~~-~~~C~~--C~~~f~~~ 201 (405)
.-|.| ..|+..|.+..+|.+|...|.--. ++.. ...-..++ .+.|.+ |-+.|.++
T Consensus 133 ~~f~C~WedCe~~F~s~~ef~dHV~~H~l~c-eyd~-------------------~~~~~D~~pv~~C~W~~Ct~~~~~k 192 (467)
T KOG3608|consen 133 QNFRCGWEDCEREFVSIVEFQDHVVKHALFC-EYDI-------------------QKTPEDERPVTMCNWAMCTKHMGNK 192 (467)
T ss_pred hhhccChhhcCCcccCHHHHHHHHHHhhhhh-hhhh-------------------hhCCCCCCceeeccchhhhhhhccH
Confidence 34788 679999999999999998775410 0000 01111111 244543 77788888
Q ss_pred HHHHHHHhhhcCCCCcccccchHhhhChHHHHHHHhhhCCCCc----ccccCCCCcchHHhhhhhcccCcchhhhhhhcc
Q psy359 202 LHLKVHTRTHTGEKPYACYLCDKRFAQISDRIKHLKSSHNFDF----ESVKNKTTPLTRYLNKHLHDAHPEAIKTERERK 277 (405)
Q Consensus 202 ~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~----c~~c~~~~~~~~~l~~h~~~~~~~~~~~~~~~~ 277 (405)
+.|+.|++.|++++..-|+.|+..|.+...|..|++.-..... |..|.++|.+...|..|+.+-.
T Consensus 193 ~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv----------- 261 (467)
T KOG3608|consen 193 YRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV----------- 261 (467)
T ss_pred HHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh-----------
Confidence 8888888888888888888888888888888888876444332 7778888888888877773322
Q ss_pred cceecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccc---hHHHHhhhhhcCCCCcccccc--ccccc
Q psy359 278 LIFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEY---KKQLLQHKRLHFPLRPYACTQ--CDKTF 352 (405)
Q Consensus 278 ~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~---~~~l~~H~~~h~~~~~~~C~~--C~~~f 352 (405)
..|+|+.|+.+...+++|.+|++.-|... +| |+|+.|... .++|..|+.+|. +-.|+|.. |...|
T Consensus 262 n~ykCplCdmtc~~~ssL~~H~r~rHs~d--------kp-fKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~ 331 (467)
T KOG3608|consen 262 NCYKCPLCDMTCSSASSLTTHIRYRHSKD--------KP-FKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSV 331 (467)
T ss_pred hcccccccccCCCChHHHHHHHHhhhccC--------CC-ccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHH
Confidence 35999999999999999999999888764 55 999999886 899999999888 56799988 99999
Q ss_pred cCchhHHHHhhhhC-CC--CcccCCccccccCChHHHHHHHHhhcccc
Q psy359 353 KKKNHLTTHYRIHT-GE--KPYQCDICGRGFAQSNDMKKHRRTVHKAQ 397 (405)
Q Consensus 353 ~~~~~l~~H~~~h~-~~--~~~~C~~C~~~f~~~~~l~~H~~~~h~~~ 397 (405)
.+...|+.|++.++ |. -+|.|-.|.+.|++..+|..|++..|+-+
T Consensus 332 r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 332 RTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred HHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhccc
Confidence 99999999998766 54 46999999999999999999999999986
No 4
>KOG1074|consensus
Probab=99.91 E-value=1.4e-24 Score=204.18 Aligned_cols=213 Identities=21% Similarity=0.374 Sum_probs=154.3
Q ss_pred CCccCccchhccCChHHHHHHHhhhcCCCCcccccchHhhhChHHHHHHHhhhCCCC------ccc---ccCCCCcchHH
Q psy359 187 RKFPCTFCEKSFKRKLHLKVHTRTHTGEKPYACYLCDKRFAQISDRIKHLKSSHNFD------FES---VKNKTTPLTRY 257 (405)
Q Consensus 187 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~------~c~---~c~~~~~~~~~ 257 (405)
.+-.|-+|.+....++.|+-|.++|+|++||+|.+|++.|.++..|+.|+-.|-..- .|+ +|-+.|...-.
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccc
Confidence 569999999999999999999999999999999999999999999999998754322 288 99999998888
Q ss_pred hhhhhcccCcchhhhh----hhcccceecCCCcccCCChHHHHHHHhhh-------ccCCCcccccc---CCCCCcCCCc
Q psy359 258 LNKHLHDAHPEAIKTE----RERKLIFKCDLCGNILSSKHILQEHVRVV-------HMGLSRKYHYE---YKPDGVCDVC 323 (405)
Q Consensus 258 l~~h~~~~~~~~~~~~----~~~~~~~~C~~C~~~f~~~~~l~~H~~~~-------h~~~~~~~~~~---~~~~~~C~~C 323 (405)
|.+|+.......+... ......-+|..|.+.|.....+..++..+ +..+......+ .-|...+..|
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~ 763 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSC 763 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCcccccc
Confidence 8888744322222211 22334457999999998888888888743 11110000000 1112334444
Q ss_pred ccc---hHHHHhhhh-----------------------hcCCCC------------------------------------
Q psy359 324 GEY---KKQLLQHKR-----------------------LHFPLR------------------------------------ 341 (405)
Q Consensus 324 ~~~---~~~l~~H~~-----------------------~h~~~~------------------------------------ 341 (405)
+.. ...+..+-. .++.++
T Consensus 764 ~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t 843 (958)
T KOG1074|consen 764 GRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLAT 843 (958)
T ss_pred ccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccccccccc
Confidence 432 000000000 000000
Q ss_pred -----------------------------------ccccccccccccCchhHHHHhhhhCCCCcccCCccccccCChHHH
Q psy359 342 -----------------------------------PYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFAQSNDM 386 (405)
Q Consensus 342 -----------------------------------~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 386 (405)
...|..|++.|....+|..|++.|.+++||.|.+|++.|+...+|
T Consensus 844 ~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnL 923 (958)
T KOG1074|consen 844 KTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNL 923 (958)
T ss_pred ccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhh
Confidence 189999999999999999999999999999999999999999999
Q ss_pred HHHHHhhcccccc
Q psy359 387 KKHRRTVHKAQIH 399 (405)
Q Consensus 387 ~~H~~~~h~~~~~ 399 (405)
..||.+|+..+|.
T Consensus 924 KvHMgtH~w~q~~ 936 (958)
T KOG1074|consen 924 KVHMGTHMWVQPP 936 (958)
T ss_pred hhhhccccccCCC
Confidence 9999999998753
No 5
>KOG1074|consensus
Probab=99.87 E-value=2.7e-22 Score=188.98 Aligned_cols=211 Identities=22% Similarity=0.434 Sum_probs=154.8
Q ss_pred CCCCcCCCchHhhh--cHHHHHHhhccCCCCccCccchhccCChHHHHHHHhhhcCC----CCcccc---cchHhhhChH
Q psy359 160 SKTDRCELCDKVVT--NLAAHHNEVHAHERKFPCTFCEKSFKRKLHLKVHTRTHTGE----KPYACY---LCDKRFAQIS 230 (405)
Q Consensus 160 ~~~~~C~~C~~~f~--~l~~~~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~----~~~~C~---~C~~~f~~~~ 230 (405)
..+-+|-+|.+++. +.+..|.+.|.|++||+|.+|++.|.++.+|+.|+-+|... ..+.|+ +|.+.|.+..
T Consensus 603 TdPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V 682 (958)
T KOG1074|consen 603 TDPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAV 682 (958)
T ss_pred CCccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccc
Confidence 34578999999887 66677899999999999999999999999999999887642 458999 9999999999
Q ss_pred HHHHHHhhhCCC---------------CcccccCCCCcchHHhhhhhcccCcch------hh-hhhhccc----ceecCC
Q psy359 231 DRIKHLKSSHNF---------------DFESVKNKTTPLTRYLNKHLHDAHPEA------IK-TERERKL----IFKCDL 284 (405)
Q Consensus 231 ~l~~H~~~~h~~---------------~~c~~c~~~~~~~~~l~~h~~~~~~~~------~~-~~~~~~~----~~~C~~ 284 (405)
.|-+|++.|.+. +.|..|.+.|.....+-.++...-... +. .+..++. +..+..
T Consensus 683 ~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~ 762 (958)
T KOG1074|consen 683 TLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENS 762 (958)
T ss_pred cccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccccc
Confidence 999999997742 238889999988888877774442111 10 1111222 566777
Q ss_pred CcccCCChHHHHHHHhhh----------------------ccCCCcc---------------------------------
Q psy359 285 CGNILSSKHILQEHVRVV----------------------HMGLSRK--------------------------------- 309 (405)
Q Consensus 285 C~~~f~~~~~l~~H~~~~----------------------h~~~~~~--------------------------------- 309 (405)
|+..+.....+..+-... +....+.
T Consensus 763 ~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~ 842 (958)
T KOG1074|consen 763 CGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLA 842 (958)
T ss_pred cccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccccc
Confidence 887776655554442211 0000000
Q ss_pred -----cc------------------------ccCCCCCcCCCcccc---hHHHHhhhhhcCCCCccccccccccccCchh
Q psy359 310 -----YH------------------------YEYKPDGVCDVCGEY---KKQLLQHKRLHFPLRPYACTQCDKTFKKKNH 357 (405)
Q Consensus 310 -----~~------------------------~~~~~~~~C~~C~~~---~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~ 357 (405)
.+ ...+-...|..||+. .+.|..|+++|+++|||.|.+|+++|..+..
T Consensus 843 t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgn 922 (958)
T KOG1074|consen 843 TKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGN 922 (958)
T ss_pred cccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhh
Confidence 00 000000459999997 8899999999999999999999999999999
Q ss_pred HHHHhhhhCCCCc
Q psy359 358 LTTHYRIHTGEKP 370 (405)
Q Consensus 358 l~~H~~~h~~~~~ 370 (405)
|..||..|+...+
T Consensus 923 LKvHMgtH~w~q~ 935 (958)
T KOG1074|consen 923 LKVHMGTHMWVQP 935 (958)
T ss_pred hhhhhccccccCC
Confidence 9999999986543
No 6
>KOG3623|consensus
Probab=99.87 E-value=3.5e-22 Score=185.01 Aligned_cols=107 Identities=37% Similarity=0.636 Sum_probs=92.5
Q ss_pred ccccCCCccccCChhhHHHHHHhhcCcccccCCCCCcCCCchHhhh--cHHHHHHhhccC-------------CCCccCc
Q psy359 128 TYPCDQCDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVT--NLAAHHNEVHAH-------------ERKFPCT 192 (405)
Q Consensus 128 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~--~l~~~~~~~h~~-------------~~~~~C~ 192 (405)
...|++|++.+.....|+.|++..|... ...|.|..|..+|- ..+..|+..|.. .+.|+|.
T Consensus 210 lltcpycdrgykrltslkeHikyrhekn----e~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCt 285 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKYRHEKN----EPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCT 285 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHHhhC----CCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhcccccc
Confidence 4679999999999999999999877755 67899999999998 333334666532 2569999
Q ss_pred cchhccCChHHHHHHHhhhcCCCCcccccchHhhhChHHHHHHHhh
Q psy359 193 FCEKSFKRKLHLKVHTRTHTGEKPYACYLCDKRFAQISDRIKHLKS 238 (405)
Q Consensus 193 ~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 238 (405)
.|+|+|..+..|+.|+++|.|++||.|+.|++.|....++..||-.
T Consensus 286 ECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 286 ECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred ccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence 9999999999999999999999999999999999999999999854
No 7
>KOG3608|consensus
Probab=99.82 E-value=4.1e-20 Score=157.85 Aligned_cols=204 Identities=26% Similarity=0.414 Sum_probs=148.1
Q ss_pred cccc--CCCccccCChhhHHHHHHhhcCcccccCCCCCcCCCchHhhhcHHHHHHhhccCCCCccCccchhccCChHHHH
Q psy359 128 TYPC--DQCDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVTNLAAHHNEVHAHERKFPCTFCEKSFKRKLHLK 205 (405)
Q Consensus 128 ~~~C--~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~~l~~~~~~~h~~~~~~~C~~C~~~f~~~~~l~ 205 (405)
.+.| ..|.+.|.+++.|+.|++.|.+ + +..-|+.|+.-|.+...|-
T Consensus 177 v~~C~W~~Ct~~~~~k~~LreH~r~Hs~------e--------------------------KvvACp~Cg~~F~~~tkl~ 224 (467)
T KOG3608|consen 177 VTMCNWAMCTKHMGNKYRLREHIRTHSN------E--------------------------KVVACPHCGELFRTKTKLF 224 (467)
T ss_pred eeeccchhhhhhhccHHHHHHHHHhcCC------C--------------------------eEEecchHHHHhccccHHH
Confidence 3567 5699999999999999999887 3 3455666666666666666
Q ss_pred HHHhhhc--CCCCcccccchHhhhChHHHHHHHhhhCCCCcccccCCCCcchHHhhhhhcccCcchhhhhhhcccceecC
Q psy359 206 VHTRTHT--GEKPYACYLCDKRFAQISDRIKHLKSSHNFDFESVKNKTTPLTRYLNKHLHDAHPEAIKTERERKLIFKCD 283 (405)
Q Consensus 206 ~H~~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~C~ 283 (405)
+|++.-+ ...+|+|..|.+.|.++..|..|+..|.....|+.|......++.|..|+...|. ..++|+|+
T Consensus 225 DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn~ykCplCdmtc~~~ssL~~H~r~rHs--------~dkpfKCd 296 (467)
T KOG3608|consen 225 DHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVNCYKCPLCDMTCSSASSLTTHIRYRHS--------KDKPFKCD 296 (467)
T ss_pred HHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCCChHHHHHHHHhhhc--------cCCCcccc
Confidence 6654422 2345666666666666666666666666666666666666666666666666663 56799999
Q ss_pred CCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCC--cccc---hHHHHhhhhhcCC---CCccccccccccccCc
Q psy359 284 LCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDV--CGEY---KKQLLQHKRLHFP---LRPYACTQCDKTFKKK 355 (405)
Q Consensus 284 ~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~--C~~~---~~~l~~H~~~h~~---~~~~~C~~C~~~f~~~ 355 (405)
.|++.|.+.+.|.+|.. .|. ++.|+|+. |-.. ..++.+|++.+++ +-+|.|-.|.+.|++.
T Consensus 297 ~Cd~~c~~esdL~kH~~-~HS----------~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G 365 (467)
T KOG3608|consen 297 ECDTRCVRESDLAKHVQ-VHS----------KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSG 365 (467)
T ss_pred chhhhhccHHHHHHHHH-hcc----------ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccc
Confidence 99999999999999999 564 34589977 8664 7889999987763 4569999999999999
Q ss_pred hhHHHHhhhhCC------CCcccCCccccccCC
Q psy359 356 NHLTTHYRIHTG------EKPYQCDICGRGFAQ 382 (405)
Q Consensus 356 ~~l~~H~~~h~~------~~~~~C~~C~~~f~~ 382 (405)
.+|..|++.-++ -..|.=..|.-.|..
T Consensus 366 ~~L~~HL~kkH~f~~PsGh~RFtYk~~edG~mR 398 (467)
T KOG3608|consen 366 KSLSAHLMKKHGFRLPSGHKRFTYKVDEDGFMR 398 (467)
T ss_pred hhHHHHHHHhhcccCCCCCCceeeeeccCceee
Confidence 999999865544 234555666666654
No 8
>KOG3576|consensus
Probab=99.66 E-value=4.8e-16 Score=123.00 Aligned_cols=125 Identities=31% Similarity=0.621 Sum_probs=103.5
Q ss_pred CcccccCCCccccCChhhHHHHHHhhcCcccccCCCCCcCCCchHhhhcHHHHHHhhccCCCCccCccchhccCChHHHH
Q psy359 126 PLTYPCDQCDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVTNLAAHHNEVHAHERKFPCTFCEKSFKRKLHLK 205 (405)
Q Consensus 126 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~~l~~~~~~~h~~~~~~~C~~C~~~f~~~~~l~ 205 (405)
...|.|.+|++.|.-...|.+|++-|.. .+.|-|..||+.|.+..+|+
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~--------------------------------vkr~lct~cgkgfndtfdlk 162 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSD--------------------------------VKRHLCTFCGKGFNDTFDLK 162 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccH--------------------------------HHHHHHhhccCcccchhhhh
Confidence 3457888888888888888888876655 35689999999999999999
Q ss_pred HHHhhhcCCCCcccccchHhhhChHHHHHHHhhhCCCCcccccCCCCcchHHhhhhhcccCcchhhhhhhcccceecCCC
Q psy359 206 VHTRTHTGEKPYACYLCDKRFAQISDRIKHLKSSHNFDFESVKNKTTPLTRYLNKHLHDAHPEAIKTERERKLIFKCDLC 285 (405)
Q Consensus 206 ~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~C~~C 285 (405)
+|.++|+|.+||+|..|++.|..+-+|..|++..|+..- .-+ -.+...+.|.|..|
T Consensus 163 rh~rthtgvrpykc~~c~kaftqrcsleshl~kvhgv~~----------------------~ya--ykerr~kl~vcedc 218 (267)
T KOG3576|consen 163 RHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKVHGVQH----------------------QYA--YKERRAKLYVCEDC 218 (267)
T ss_pred hhhccccCccccchhhhhHHHHhhccHHHHHHHHcCchH----------------------HHH--HHHhhhheeeeccc
Confidence 999999999999999999999999999999998887331 001 11125678999999
Q ss_pred cccCCChHHHHHHHhhhccCC
Q psy359 286 GNILSSKHILQEHVRVVHMGL 306 (405)
Q Consensus 286 ~~~f~~~~~l~~H~~~~h~~~ 306 (405)
|.+-.....+..|+..+|...
T Consensus 219 g~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 219 GYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred CCCCCChhHHHHHHHhcCCCC
Confidence 999999999999999877664
No 9
>KOG3576|consensus
Probab=99.59 E-value=1.8e-16 Score=125.32 Aligned_cols=126 Identities=24% Similarity=0.450 Sum_probs=104.1
Q ss_pred CCCCccCccchhccCChHHHHHHHhhhcCCCCcccccchHhhhChHHHHHHHhhhCCCCcccccCCCCcchHHhhhhhcc
Q psy359 185 HERKFPCTFCEKSFKRKLHLKVHTRTHTGEKPYACYLCDKRFAQISDRIKHLKSSHNFDFESVKNKTTPLTRYLNKHLHD 264 (405)
Q Consensus 185 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~l~~h~~~ 264 (405)
+...|.|.+|++.|..+..|.+|++-|...+.|-|..|++.|+...+|++|+++|.
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rtht------------------------ 169 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHT------------------------ 169 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhcccc------------------------
Confidence 34679999999999999999999999999999999999999999999999999976
Q ss_pred cCcchhhhhhhcccceecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhcCCCCccc
Q psy359 265 AHPEAIKTERERKLIFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLHFPLRPYA 344 (405)
Q Consensus 265 ~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h~~~~~~~ 344 (405)
+-+||+|..|++.|..+-+|..|.+.+|+......+-| ...+.|.
T Consensus 170 -----------gvrpykc~~c~kaftqrcsleshl~kvhgv~~~yayke------------------------rr~kl~v 214 (267)
T KOG3576|consen 170 -----------GVRPYKCSLCEKAFTQRCSLESHLKKVHGVQHQYAYKE------------------------RRAKLYV 214 (267)
T ss_pred -----------CccccchhhhhHHHHhhccHHHHHHHHcCchHHHHHHH------------------------hhhheee
Confidence 46699999999999999999999998887652222111 1245677
Q ss_pred cccccccccCchhHHHHhhhhCCCC
Q psy359 345 CTQCDKTFKKKNHLTTHYRIHTGEK 369 (405)
Q Consensus 345 C~~C~~~f~~~~~l~~H~~~h~~~~ 369 (405)
|..||.+-.....+..|++.|+...
T Consensus 215 cedcg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 215 CEDCGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred ecccCCCCCChhHHHHHHHhcCCCC
Confidence 7777777777777777777776543
No 10
>KOG3623|consensus
Probab=99.51 E-value=5.7e-15 Score=137.76 Aligned_cols=122 Identities=30% Similarity=0.634 Sum_probs=100.4
Q ss_pred cccccchHhhhChHHHHHHHhhhCCCCcccccCCCCcchHHhhhhhcccCcchhhhhhhcccceecCCCcccCCChHHHH
Q psy359 217 YACYLCDKRFAQISDRIKHLKSSHNFDFESVKNKTTPLTRYLNKHLHDAHPEAIKTERERKLIFKCDLCGNILSSKHILQ 296 (405)
Q Consensus 217 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~ 296 (405)
..|++|.+.+.....|+.|++-.|.. .+..|.|..|..+|..+..|.
T Consensus 211 ltcpycdrgykrltslkeHikyrhek---------------------------------ne~nfsC~lCsytFAyRtQLE 257 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEK---------------------------------NEPNFSCMLCSYTFAYRTQLE 257 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhh---------------------------------CCCCCcchhhhhhhhhHHHHH
Confidence 79999999999999999999875531 344588999999999999999
Q ss_pred HHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhcCCCCccccccccccccCchhHHHHhhhhCCCCcccCCcc
Q psy359 297 EHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDIC 376 (405)
Q Consensus 297 ~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C 376 (405)
+|+..+-.+. +. ..++-.-...|.|+|+.|+|+|..+-.|..|+|+|.|++||.|+.|
T Consensus 258 rhm~~hkpg~-dq---------------------a~sltqsa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnC 315 (1007)
T KOG3623|consen 258 RHMQLHKPGG-DQ---------------------AISLTQSALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNC 315 (1007)
T ss_pred HHHHhhcCCC-cc---------------------cccccchhhhccccccccchhhhhHHHHHhhheeecCCCCcCCccc
Confidence 9999543332 11 0111122345779999999999999999999999999999999999
Q ss_pred ccccCChHHHHHHHHhh
Q psy359 377 GRGFAQSNDMKKHRRTV 393 (405)
Q Consensus 377 ~~~f~~~~~l~~H~~~~ 393 (405)
+|.|...+++..||.++
T Consensus 316 kKRFSHSGSySSHmSSK 332 (1007)
T KOG3623|consen 316 KKRFSHSGSYSSHMSSK 332 (1007)
T ss_pred ccccccCCccccccccc
Confidence 99999999999998763
No 11
>PHA00733 hypothetical protein
Probab=99.19 E-value=1.6e-11 Score=94.47 Aligned_cols=89 Identities=20% Similarity=0.403 Sum_probs=74.8
Q ss_pred cccceecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhcCCCCccccccccccccCc
Q psy359 276 RKLIFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKKK 355 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 355 (405)
..+.+.|.+|.+.|.....|..|.- |.+|+.. .+.+||.|+.|++.|.+.
T Consensus 37 ~~~~~~~~~~~~~~~~~~~l~~~~~-----------------------------l~~~~~~-~~~kPy~C~~Cgk~Fss~ 86 (128)
T PHA00733 37 EQKRLIRAVVKTLIYNPQLLDESSY-----------------------------LYKLLTS-KAVSPYVCPLCLMPFSSS 86 (128)
T ss_pred hhhhHHHHHHhhhccChhhhcchHH-----------------------------HHhhccc-CCCCCccCCCCCCcCCCH
Confidence 3567899999999998888777643 2334333 458899999999999999
Q ss_pred hhHHHHhhhhCCCCcccCCccccccCChHHHHHHHHhhccc
Q psy359 356 NHLTTHYRIHTGEKPYQCDICGRGFAQSNDMKKHRRTVHKA 396 (405)
Q Consensus 356 ~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~ 396 (405)
..|..|++.| ..+|.|+.|++.|.....|..|+..+|+.
T Consensus 87 s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~~ 125 (128)
T PHA00733 87 VSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHNI 125 (128)
T ss_pred HHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcCc
Confidence 9999999976 35799999999999999999999999975
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.18 E-value=8e-11 Score=111.23 Aligned_cols=103 Identities=29% Similarity=0.623 Sum_probs=85.9
Q ss_pred cceecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccc--hHHHHhhhhhcCCCCccccccccccccC-
Q psy359 278 LIFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEY--KKQLLQHKRLHFPLRPYACTQCDKTFKK- 354 (405)
Q Consensus 278 ~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~--~~~l~~H~~~h~~~~~~~C~~C~~~f~~- 354 (405)
..+.|+.|++.|. ...|..|+...| .. +.|+ |+.. +..|..|+.+|.+.+++.|.+|++.|..
T Consensus 452 ~H~~C~~Cgk~f~-~s~LekH~~~~H-kp-----------v~Cp-Cg~~~~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g 517 (567)
T PLN03086 452 NHVHCEKCGQAFQ-QGEMEKHMKVFH-EP-----------LQCP-CGVVLEKEQMVQHQASTCPLRLITCRFCGDMVQAG 517 (567)
T ss_pred cCccCCCCCCccc-hHHHHHHHHhcC-CC-----------ccCC-CCCCcchhHHHhhhhccCCCCceeCCCCCCccccC
Confidence 3468999999996 678999999765 22 7899 9865 8899999999999999999999999952
Q ss_pred ---------chhHHHHhhhhCCCCcccCCccccccCChHHHHHHHHhhccc
Q psy359 355 ---------KNHLTTHYRIHTGEKPYQCDICGRGFAQSNDMKKHRRTVHKA 396 (405)
Q Consensus 355 ---------~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~ 396 (405)
...|..|.... |.+++.|..|++.|. ...|..|+...|+.
T Consensus 518 ~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vr-lrdm~~H~~~~h~~ 566 (567)
T PLN03086 518 GSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVM-LKEMDIHQIAVHQK 566 (567)
T ss_pred ccccchhhhhhhHHHHHHhc-CCcceEccccCCeee-ehhHHHHHHHhhcC
Confidence 35799998885 889999999998866 56788999887763
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.05 E-value=2.9e-10 Score=107.50 Aligned_cols=102 Identities=20% Similarity=0.480 Sum_probs=84.4
Q ss_pred ceecCC--CcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccc--hHHHHhhhhhcCCCCccccccccccccC
Q psy359 279 IFKCDL--CGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEY--KKQLLQHKRLHFPLRPYACTQCDKTFKK 354 (405)
Q Consensus 279 ~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~--~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 354 (405)
...|+. |+..|.. ..|..| +.|+.|++. .+.|..|+.+++ ++|.|+ |++.+ .
T Consensus 433 ~V~Cp~~~Cg~v~~r-~el~~H-------------------~~C~~Cgk~f~~s~LekH~~~~H--kpv~Cp-Cg~~~-~ 488 (567)
T PLN03086 433 NVVCPHDGCGIVLRV-EEAKNH-------------------VHCEKCGQAFQQGEMEKHMKVFH--EPLQCP-CGVVL-E 488 (567)
T ss_pred ceeCCcccccceeec-cccccC-------------------ccCCCCCCccchHHHHHHHHhcC--CCccCC-CCCCc-c
Confidence 356874 9998843 333433 459999886 788999999986 789999 99765 6
Q ss_pred chhHHHHhhhhCCCCcccCCccccccCC----------hHHHHHHHHhhccccccccccCC
Q psy359 355 KNHLTTHYRIHTGEKPYQCDICGRGFAQ----------SNDMKKHRRTVHKAQIHAVEDKG 405 (405)
Q Consensus 355 ~~~l~~H~~~h~~~~~~~C~~C~~~f~~----------~~~l~~H~~~~h~~~~~~~~~~~ 405 (405)
+..|..|+..|.+++++.|.+|++.|.. ...|..|..+ .|.+++.|..||
T Consensus 489 R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~-CG~rt~~C~~Cg 548 (567)
T PLN03086 489 KEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESI-CGSRTAPCDSCG 548 (567)
T ss_pred hhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHh-cCCcceEccccC
Confidence 6899999999999999999999999952 4589999988 599999999997
No 14
>PHA02768 hypothetical protein; Provisional
Probab=98.91 E-value=5.1e-10 Score=70.30 Aligned_cols=45 Identities=22% Similarity=0.489 Sum_probs=41.2
Q ss_pred ccccccccccccCchhHHHHhhhhCCCCcccCCccccccCChHHHHH
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFAQSNDMKK 388 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~ 388 (405)
.|+|+.||+.|.....|..|++.|. ++|+|..|++.|.+.+.|..
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~~ 49 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYIE 49 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeEE
Confidence 3899999999999999999999999 79999999999998887753
No 15
>PHA00733 hypothetical protein
Probab=98.76 E-value=7.6e-09 Score=79.72 Aligned_cols=82 Identities=18% Similarity=0.315 Sum_probs=63.8
Q ss_pred CCCccCccchhccCChHHHHHH--Hh---hhcCCCCcccccchHhhhChHHHHHHHhhhCCCCcccccCCCCcchHHhhh
Q psy359 186 ERKFPCTFCEKSFKRKLHLKVH--TR---THTGEKPYACYLCDKRFAQISDRIKHLKSSHNFDFESVKNKTTPLTRYLNK 260 (405)
Q Consensus 186 ~~~~~C~~C~~~f~~~~~l~~H--~~---~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~l~~ 260 (405)
.+++.|.+|.+.|.....|..+ +. .+++.+||.|..|++.|.+...|..|++.+
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--------------------- 96 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--------------------- 96 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC---------------------
Confidence 3568888888888877766665 21 234578899999999999999999998752
Q ss_pred hhcccCcchhhhhhhcccceecCCCcccCCChHHHHHHHhhhcc
Q psy359 261 HLHDAHPEAIKTERERKLIFKCDLCGNILSSKHILQEHVRVVHM 304 (405)
Q Consensus 261 h~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 304 (405)
+.+|.|..|++.|.....|..|+...|.
T Consensus 97 ----------------~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 97 ----------------EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred ----------------CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 1248899999999999999999987664
No 16
>KOG3993|consensus
Probab=98.68 E-value=3.4e-09 Score=93.88 Aligned_cols=203 Identities=15% Similarity=0.182 Sum_probs=125.1
Q ss_pred CccCccchhccCChHHHHHHHhhhcCCCCcccccchHhhhChHHHHHHHhhhCCCCcccccCCC----CcchHHhhhhhc
Q psy359 188 KFPCTFCEKSFKRKLHLKVHTRTHTGEKPYACYLCDKRFAQISDRIKHLKSSHNFDFESVKNKT----TPLTRYLNKHLH 263 (405)
Q Consensus 188 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~----~~~~~~l~~h~~ 263 (405)
-|+|.+|...|.+...|.+|.-......-|+|+.|++.|+....|..|.+.|-....-...+-. ......+.+.-.
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~ 346 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAE 346 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhcc
Confidence 4999999999999999999954333333499999999999999999999986542211100000 000111111110
Q ss_pred ccCcchhhhhhhcccceecCCCcccCCChHHHHHHHhhhccCCCccc-----cccC-CC-CCcCCCcccchHH---HHhh
Q psy359 264 DAHPEAIKTERERKLIFKCDLCGNILSSKHILQEHVRVVHMGLSRKY-----HYEY-KP-DGVCDVCGEYKKQ---LLQH 333 (405)
Q Consensus 264 ~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~-----~~~~-~~-~~~C~~C~~~~~~---l~~H 333 (405)
+ .+. + +....|.|.+|+++|....-|+.|+..+|....... ..+. .+ .+-|+.+.-..+. -..+
T Consensus 347 r---sg~--d-ss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~ 420 (500)
T KOG3993|consen 347 R---SGD--D-SSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDE 420 (500)
T ss_pred c---cCC--c-ccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccc
Confidence 1 111 1 234489999999999999999999987776654331 1110 11 1234444433111 0111
Q ss_pred hhhcCC-CCccccccccccccCchhHHHHhhhhCCCCcccCCccccccCChHHHHHHHHhhccc
Q psy359 334 KRLHFP-LRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFAQSNDMKKHRRTVHKA 396 (405)
Q Consensus 334 ~~~h~~-~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~ 396 (405)
+.-+.+ .....|++|+..+.++..=--+.+....+..|.|.+|.-.|.+...|.+|+..-|..
T Consensus 421 vl~~a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hps 484 (500)
T KOG3993|consen 421 VLYVAGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPS 484 (500)
T ss_pred eeeeeccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChH
Confidence 111111 122577888877777665555555455566799999999999999999998887754
No 17
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.56 E-value=4.3e-08 Score=52.57 Aligned_cols=24 Identities=58% Similarity=1.272 Sum_probs=13.5
Q ss_pred HHHHhhhhCCCCcccCCccccccC
Q psy359 358 LTTHYRIHTGEKPYQCDICGRGFA 381 (405)
Q Consensus 358 l~~H~~~h~~~~~~~C~~C~~~f~ 381 (405)
|..|++.|.+++||+|++|++.|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 455555555555555555555554
No 18
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.55 E-value=3.4e-08 Score=52.96 Aligned_cols=26 Identities=38% Similarity=0.810 Sum_probs=23.6
Q ss_pred HHHhhhhhcCCCCccccccccccccC
Q psy359 329 QLLQHKRLHFPLRPYACTQCDKTFKK 354 (405)
Q Consensus 329 ~l~~H~~~h~~~~~~~C~~C~~~f~~ 354 (405)
+|.+|+++|++++||+|++|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 47889999999999999999999974
No 19
>PHA02768 hypothetical protein; Provisional
Probab=98.52 E-value=3.4e-08 Score=62.15 Aligned_cols=43 Identities=14% Similarity=0.403 Sum_probs=38.2
Q ss_pred CccCccchhccCChHHHHHHHhhhcCCCCcccccchHhhhChHHH
Q psy359 188 KFPCTFCEKSFKRKLHLKVHTRTHTGEKPYACYLCDKRFAQISDR 232 (405)
Q Consensus 188 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 232 (405)
.|.|+.|++.|.....|..|+++|+ ++|+|..|++.|...+.|
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 4899999999999999999999998 689999999999865544
No 20
>PHA00616 hypothetical protein
Probab=98.48 E-value=6e-08 Score=57.88 Aligned_cols=34 Identities=21% Similarity=0.454 Sum_probs=18.5
Q ss_pred ccccccccccccCchhHHHHhhhhCCCCcccCCc
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDI 375 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 375 (405)
||+|+.||+.|.+.+.|..|++.|+|++++.|++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 3555555555555555555555555555555543
No 21
>PHA00616 hypothetical protein
Probab=98.43 E-value=7.7e-08 Score=57.42 Aligned_cols=33 Identities=33% Similarity=0.568 Sum_probs=30.8
Q ss_pred cccCCccccccCChHHHHHHHHhhccccccccc
Q psy359 370 PYQCDICGRGFAQSNDMKKHRRTVHKAQIHAVE 402 (405)
Q Consensus 370 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~ 402 (405)
||+|+.||+.|.+.++|.+|++.||+++++-|+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 699999999999999999999999999987764
No 22
>KOG3993|consensus
Probab=98.40 E-value=8.5e-08 Score=85.23 Aligned_cols=110 Identities=23% Similarity=0.423 Sum_probs=63.2
Q ss_pred ccccCCCccccCChhhHHHHHHhhcCcccccCCCCCcCCCchHhhh---cHHHHHHhhccCCCC-ccCcc--chhccCCh
Q psy359 128 TYPCDQCDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVT---NLAAHHNEVHAHERK-FPCTF--CEKSFKRK 201 (405)
Q Consensus 128 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~---~l~~~~~~~h~~~~~-~~C~~--C~~~f~~~ 201 (405)
.|.|..|...|.+-..|..|.-.... -..|.|++|+++|+ +|..| +++|.-... -+-.. =.+...+.
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV------~vEYrCPEC~KVFsCPANLASH-RRWHKPR~eaa~a~~~P~k~~~~~r 339 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIV------HVEYRCPECDKVFSCPANLASH-RRWHKPRPEAAKAGSPPPKQAVETR 339 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeE------EeeecCCcccccccCchhhhhh-hcccCCchhhhhcCCCChhhhhhhh
Confidence 48899999999999999888654333 34577777777776 45544 555532110 00000 00000000
Q ss_pred HHHHHHHhh--hcCCCCcccccchHhhhChHHHHHHHhhhCCCCc
Q psy359 202 LHLKVHTRT--HTGEKPYACYLCDKRFAQISDRIKHLKSSHNFDF 244 (405)
Q Consensus 202 ~~l~~H~~~--h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~ 244 (405)
...+.--+. ...+..|.|.+|++.|.....|+.|+..|+....
T Consensus 340 ae~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~ 384 (500)
T KOG3993|consen 340 AEVQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPL 384 (500)
T ss_pred hhhhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhcccc
Confidence 000000000 0112359999999999999999999999886554
No 23
>PHA00732 hypothetical protein
Probab=98.33 E-value=2.6e-07 Score=64.29 Aligned_cols=46 Identities=24% Similarity=0.495 Sum_probs=38.7
Q ss_pred ccccccccccccCchhHHHHhhh-hCCCCcccCCccccccCChHHHHHHHHhh
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRI-HTGEKPYQCDICGRGFAQSNDMKKHRRTV 393 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 393 (405)
||.|..|++.|.+...|..|++. |. ++.|+.|++.|. .|..|++++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence 58999999999999999999985 54 358999999998 577888553
No 24
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.31 E-value=8e-07 Score=57.41 Aligned_cols=50 Identities=22% Similarity=0.515 Sum_probs=41.2
Q ss_pred cccccccccccCchhHHHHhhhhC-C-CCcccCCccccccCChHHHHHHHHhhcc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHT-G-EKPYQCDICGRGFAQSNDMKKHRRTVHK 395 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~-~-~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 395 (405)
|.|++|++ ..+...|..|+...| . .+.+.|++|...+. .+|..|+..+|+
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 89999999 666789999986554 3 35699999998765 599999999885
No 25
>PHA00732 hypothetical protein
Probab=97.94 E-value=7.2e-06 Score=57.08 Aligned_cols=49 Identities=29% Similarity=0.679 Sum_probs=36.7
Q ss_pred ceecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhcCCCCccccccccccccCchhH
Q psy359 279 IFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKKKNHL 358 (405)
Q Consensus 279 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 358 (405)
+|.|..|++.|.+.+.|..|++..|. ++.|+.|++.|. .|
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~-------------------------------------~~~C~~CgKsF~---~l 40 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT-------------------------------------LTKCPVCNKSYR---RL 40 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC-------------------------------------CCccCCCCCEeC---Ch
Confidence 47899999999999999999874332 246888888887 46
Q ss_pred HHHhhhhCC
Q psy359 359 TTHYRIHTG 367 (405)
Q Consensus 359 ~~H~~~h~~ 367 (405)
..|+....+
T Consensus 41 ~~H~~~~~~ 49 (79)
T PHA00732 41 NQHFYSQYD 49 (79)
T ss_pred hhhhcccCC
Confidence 777765544
No 26
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.86 E-value=1e-05 Score=42.08 Aligned_cols=22 Identities=41% Similarity=1.124 Sum_probs=15.1
Q ss_pred ccCCccccccCChHHHHHHHHh
Q psy359 371 YQCDICGRGFAQSNDMKKHRRT 392 (405)
Q Consensus 371 ~~C~~C~~~f~~~~~l~~H~~~ 392 (405)
|+|+.|++.|.+...|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 5667777777777777777665
No 27
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.78 E-value=1.8e-05 Score=41.56 Aligned_cols=24 Identities=42% Similarity=1.063 Sum_probs=16.1
Q ss_pred ccCCccccccCChHHHHHHHHhhc
Q psy359 371 YQCDICGRGFAQSNDMKKHRRTVH 394 (405)
Q Consensus 371 ~~C~~C~~~f~~~~~l~~H~~~~h 394 (405)
|.|++|++.|.+...|..|+.++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 567777777777777777777765
No 28
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.76 E-value=1.3e-05 Score=41.58 Aligned_cols=23 Identities=43% Similarity=0.925 Sum_probs=21.5
Q ss_pred cccccccccccCchhHHHHhhhh
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIH 365 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h 365 (405)
|+|++|++.|.+...|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 68999999999999999999875
No 29
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.75 E-value=5.1e-05 Score=48.96 Aligned_cols=53 Identities=25% Similarity=0.432 Sum_probs=37.5
Q ss_pred ceecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhcCCCCccccccccccccCchhH
Q psy359 279 IFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKKKNHL 358 (405)
Q Consensus 279 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 358 (405)
.|.|+.|++ ..+...|..|....|... .+.+.|++|...+.. .|
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~---------------------------------~~~v~CPiC~~~~~~--~l 45 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSE---------------------------------SKNVVCPICSSRVTD--NL 45 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCC---------------------------------CCCccCCCchhhhhh--HH
Confidence 378999999 455678888888777553 234778888776553 78
Q ss_pred HHHhhhhCC
Q psy359 359 TTHYRIHTG 367 (405)
Q Consensus 359 ~~H~~~h~~ 367 (405)
..|+..+|+
T Consensus 46 ~~Hl~~~H~ 54 (54)
T PF05605_consen 46 IRHLNSQHR 54 (54)
T ss_pred HHHHHHhcC
Confidence 888776653
No 30
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.72 E-value=1.9e-05 Score=58.70 Aligned_cols=54 Identities=26% Similarity=0.710 Sum_probs=23.3
Q ss_pred ccccccccccCchhHHHHhhhhCC-----------------------CCcccCCccccccCChHHHHHHHHhh-cccc
Q psy359 344 ACTQCDKTFKKKNHLTTHYRIHTG-----------------------EKPYQCDICGRGFAQSNDMKKHRRTV-HKAQ 397 (405)
Q Consensus 344 ~C~~C~~~f~~~~~l~~H~~~h~~-----------------------~~~~~C~~C~~~f~~~~~l~~H~~~~-h~~~ 397 (405)
+|.+|+..|.+...|..|+...|+ ...+.|.+|++.|.+...|..|++.+ |...
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~~~H~~~ 78 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIPDQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHMRSKHHKKR 78 (100)
T ss_dssp ------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHTTTTC-
T ss_pred CccccccccccccccccccccccccccccccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHHcCccCCCc
Confidence 466666666666666666654332 12589999999999999999999985 5443
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.61 E-value=3e-05 Score=57.62 Aligned_cols=73 Identities=25% Similarity=0.366 Sum_probs=21.8
Q ss_pred ecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhcCCCCccccccccccccCchhHHH
Q psy359 281 KCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKKKNHLTT 360 (405)
Q Consensus 281 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~ 360 (405)
.|..|+..|.+...|..|+...|...-+.. .--.....+..++..-. ...+.|.+|++.|.+...|..
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~-----------~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~ 68 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIPDQ-----------KYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQE 68 (100)
T ss_dssp ------------------------------------------------------------SSEEBSSSS-EESSHHHHHH
T ss_pred Cccccccccccccccccccccccccccccc-----------ccccccccccccccccc-CCCCCCCccCCCCcCHHHHHH
Confidence 488999999999999999988886541100 00001222333332221 226999999999999999999
Q ss_pred Hhhhh
Q psy359 361 HYRIH 365 (405)
Q Consensus 361 H~~~h 365 (405)
||+.+
T Consensus 69 Hm~~~ 73 (100)
T PF12756_consen 69 HMRSK 73 (100)
T ss_dssp HHHHT
T ss_pred HHcCc
Confidence 99865
No 32
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.60 E-value=2.7e-05 Score=42.22 Aligned_cols=26 Identities=31% Similarity=0.933 Sum_probs=15.4
Q ss_pred cccCCccccccCChHHHHHHHHhhcc
Q psy359 370 PYQCDICGRGFAQSNDMKKHRRTVHK 395 (405)
Q Consensus 370 ~~~C~~C~~~f~~~~~l~~H~~~~h~ 395 (405)
+|+|..|++.|.+...|..|++.|++
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 35566666666666666666655543
No 33
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.48 E-value=6.2e-05 Score=39.39 Aligned_cols=24 Identities=38% Similarity=0.897 Sum_probs=20.4
Q ss_pred cccccccccccCchhHHHHhhhhC
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHT 366 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~ 366 (405)
|.|++|++.|.+...|..|+..|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 689999999999999999998875
No 34
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.42 E-value=7.3e-05 Score=40.49 Aligned_cols=26 Identities=38% Similarity=0.830 Sum_probs=23.8
Q ss_pred ccccccccccccCchhHHHHhhhhCC
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRIHTG 367 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~h~~ 367 (405)
||+|..|++.|.+...|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 58999999999999999999988764
No 35
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=97.21 E-value=0.00028 Score=43.15 Aligned_cols=42 Identities=26% Similarity=0.454 Sum_probs=26.3
Q ss_pred hHHHHhhhh-CCCCcccCCccccccCChHHHHHHHHhhccccc
Q psy359 357 HLTTHYRIH-TGEKPYQCDICGRGFAQSNDMKKHRRTVHKAQI 398 (405)
Q Consensus 357 ~l~~H~~~h-~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~ 398 (405)
.+..+.+.+ ..+.|..|++|+..+.+..+|+.|+...|+.+|
T Consensus 10 ~~~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 10 PSTKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp S-----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred hhhhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 345555444 356789999999999999999999999999886
No 36
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.11 E-value=0.00022 Score=61.57 Aligned_cols=72 Identities=26% Similarity=0.460 Sum_probs=48.4
Q ss_pred cccceecCC--CcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhcCCCCcccccccccccc
Q psy359 276 RKLIFKCDL--CGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFK 353 (405)
Q Consensus 276 ~~~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 353 (405)
++++|+|++ |.+++.....|+-|+.-.|... ......-| ..|..--...|||+|.+|+|.|+
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~--~~~~~p~p--------------~~~~~F~~~~KPYrCevC~KRYK 409 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQ--KLHENPSP--------------EKMNIFSAKDKPYRCEVCDKRYK 409 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCc--ccCCCCCc--------------cccccccccCCceeccccchhhc
Confidence 358999986 9999999999999998555221 11100001 11222234568899999999999
Q ss_pred CchhHHHHhh
Q psy359 354 KKNHLTTHYR 363 (405)
Q Consensus 354 ~~~~l~~H~~ 363 (405)
+...|.-|..
T Consensus 410 NlNGLKYHr~ 419 (423)
T COG5189 410 NLNGLKYHRK 419 (423)
T ss_pred cCccceeccc
Confidence 8888887753
No 37
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.10 E-value=0.00014 Score=62.73 Aligned_cols=53 Identities=36% Similarity=0.860 Sum_probs=45.6
Q ss_pred CCCcccccc--ccccccCchhHHHHhh-hhC------------------CCCcccCCccccccCChHHHHHHHH
Q psy359 339 PLRPYACTQ--CDKTFKKKNHLTTHYR-IHT------------------GEKPYQCDICGRGFAQSNDMKKHRR 391 (405)
Q Consensus 339 ~~~~~~C~~--C~~~f~~~~~l~~H~~-~h~------------------~~~~~~C~~C~~~f~~~~~l~~H~~ 391 (405)
++|||+|++ |.|+|++...|.-|+. .|. ..+||+|++|+|.+.+..-|..|+.
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceeccc
Confidence 359999998 9999999999999986 441 2479999999999999999988854
No 38
>KOG2231|consensus
Probab=96.98 E-value=0.0018 Score=62.94 Aligned_cols=81 Identities=21% Similarity=0.387 Sum_probs=43.8
Q ss_pred ccccchHhhhChHHHHHHHhhhCCCCcccccCCCC---------cchHHhhhhhcccCcchhhhhhhcccceecCCCccc
Q psy359 218 ACYLCDKRFAQISDRIKHLKSSHNFDFESVKNKTT---------PLTRYLNKHLHDAHPEAIKTERERKLIFKCDLCGNI 288 (405)
Q Consensus 218 ~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~---------~~~~~l~~h~~~~~~~~~~~~~~~~~~~~C~~C~~~ 288 (405)
.|..| ..|.+...|++|+..-|+...|..|-... -+...|+.|++..-++ +.+....-.|..|...
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H~~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d----~~s~rGhp~C~~C~~~ 191 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQHKLHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPD----DESCRGHPLCKFCHER 191 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhhhhhccccccccceeeeeeeehehHHHHHHHHhcCCCc----cccccCCccchhhhhh
Confidence 34455 45555555555555555555544442221 1233455555444331 0111113468889999
Q ss_pred CCChHHHHHHHhhhc
Q psy359 289 LSSKHILQEHVRVVH 303 (405)
Q Consensus 289 f~~~~~l~~H~~~~h 303 (405)
|-....|..|++..|
T Consensus 192 fld~~el~rH~~~~h 206 (669)
T KOG2231|consen 192 FLDDDELYRHLRFDH 206 (669)
T ss_pred hccHHHHHHhhccce
Confidence 999999998888655
No 39
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.93 E-value=0.00071 Score=35.35 Aligned_cols=24 Identities=25% Similarity=0.679 Sum_probs=16.2
Q ss_pred ccCCccccccCChHHHHHHHHhhcc
Q psy359 371 YQCDICGRGFAQSNDMKKHRRTVHK 395 (405)
Q Consensus 371 ~~C~~C~~~f~~~~~l~~H~~~~h~ 395 (405)
|+|+.|+.... ...|.+|++.+|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 56777777776 7777777777764
No 40
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.75 E-value=0.0012 Score=34.93 Aligned_cols=23 Identities=39% Similarity=0.856 Sum_probs=18.2
Q ss_pred ccCCccccccCChHHHHHHHHhh
Q psy359 371 YQCDICGRGFAQSNDMKKHRRTV 393 (405)
Q Consensus 371 ~~C~~C~~~f~~~~~l~~H~~~~ 393 (405)
|+|..|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 57888888888888888888754
No 41
>KOG2785|consensus
Probab=96.63 E-value=0.0066 Score=54.39 Aligned_cols=54 Identities=28% Similarity=0.464 Sum_probs=46.1
Q ss_pred CCCccccccccccccCchhHHHHhhhhCCC-----------------------CcccCCccc---cccCChHHHHHHHHh
Q psy359 339 PLRPYACTQCDKTFKKKNHLTTHYRIHTGE-----------------------KPYQCDICG---RGFAQSNDMKKHRRT 392 (405)
Q Consensus 339 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~-----------------------~~~~C~~C~---~~f~~~~~l~~H~~~ 392 (405)
...|-.|.+|++.|.+...-..||..+||- ..+-|-.|+ +.|.+..+.+.||..
T Consensus 163 e~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 163 ELIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred ccCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 345688999999999999999999888862 148899999 999999999999987
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.60 E-value=0.0015 Score=34.61 Aligned_cols=24 Identities=46% Similarity=0.905 Sum_probs=21.8
Q ss_pred cccccccccccCchhHHHHhhhhC
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHT 366 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~ 366 (405)
|+|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 689999999999999999998765
No 43
>KOG2231|consensus
Probab=96.50 E-value=0.0047 Score=60.17 Aligned_cols=101 Identities=24% Similarity=0.426 Sum_probs=78.0
Q ss_pred ecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccc------------hHHHHhhhhhcCC-CC---c-c
Q psy359 281 KCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEY------------KKQLLQHKRLHFP-LR---P-Y 343 (405)
Q Consensus 281 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~------------~~~l~~H~~~h~~-~~---~-~ 343 (405)
.|..| ..|.+-..|+.|+...|.. +.|..|... ...|.+|+..-.. ++ . -
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H~~------------~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp 183 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQHKL------------HLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHP 183 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhhhh------------hccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCc
Confidence 57788 7788999999999877766 567777542 7788888765543 22 2 5
Q ss_pred ccccccccccCchhHHHHhhhhCCCCcccCCcc------ccccCChHHHHHHHHhhccccc
Q psy359 344 ACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDIC------GRGFAQSNDMKKHRRTVHKAQI 398 (405)
Q Consensus 344 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C------~~~f~~~~~l~~H~~~~h~~~~ 398 (405)
.|..|...|.....|..|++.++ |.|-+| +..|..-..|..|.+..|-.-.
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~HflCE 240 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGHFLCE 240 (669)
T ss_pred cchhhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcCcccc
Confidence 89999999999999999988766 566666 5677888999999999886543
No 44
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.47 E-value=0.0014 Score=34.52 Aligned_cols=22 Identities=36% Similarity=1.035 Sum_probs=15.4
Q ss_pred ccCCccccccCChHHHHHHHHh
Q psy359 371 YQCDICGRGFAQSNDMKKHRRT 392 (405)
Q Consensus 371 ~~C~~C~~~f~~~~~l~~H~~~ 392 (405)
|.|.+|++.|.+...|+.|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4577777777777777777655
No 45
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.45 E-value=0.0054 Score=53.82 Aligned_cols=82 Identities=23% Similarity=0.306 Sum_probs=58.2
Q ss_pred ccccc--chHhhhChHHHHHHHhhhCCCCcccccCCC---Cc------chHHhhhhhcccCcchhhhhhhcccceecCCC
Q psy359 217 YACYL--CDKRFAQISDRIKHLKSSHNFDFESVKNKT---TP------LTRYLNKHLHDAHPEAIKTERERKLIFKCDLC 285 (405)
Q Consensus 217 ~~C~~--C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~---~~------~~~~l~~h~~~~~~~~~~~~~~~~~~~~C~~C 285 (405)
|.|+. |.........|..|.+..|+.-.|.+|-.. |. ....|++|......++ .-+..=.|..|
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~-----GFKGHP~C~FC 226 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHGFVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEE-----GFKGHPLCIFC 226 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcCcEEhHhhhcCcccCccceeeeecccccccccCCcccc-----CcCCCchhhhc
Confidence 78864 777777778899999999998888888433 22 2456777764333221 11222359999
Q ss_pred cccCCChHHHHHHHhhhc
Q psy359 286 GNILSSKHILQEHVRVVH 303 (405)
Q Consensus 286 ~~~f~~~~~l~~H~~~~h 303 (405)
...|-...+|..|++..|
T Consensus 227 ~~~FYdDDEL~~HcR~~H 244 (493)
T COG5236 227 KIYFYDDDELRRHCRLRH 244 (493)
T ss_pred cceecChHHHHHHHHhhh
Confidence 999999999999999766
No 46
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.18 E-value=0.0042 Score=32.36 Aligned_cols=24 Identities=29% Similarity=0.580 Sum_probs=19.6
Q ss_pred cccCCCccccCChhhHHHHHHhhcC
Q psy359 129 YPCDQCDRTYQTKKSLYVHRRAHLG 153 (405)
Q Consensus 129 ~~C~~C~~~f~~~~~l~~H~~~h~~ 153 (405)
|+|+.|++... ...|..|++.+|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 68999999998 9999999998764
No 47
>PRK04860 hypothetical protein; Provisional
Probab=96.14 E-value=0.0025 Score=51.10 Aligned_cols=38 Identities=32% Similarity=0.820 Sum_probs=28.5
Q ss_pred ccccccccccccCchhHHHHhhhhCCCCcccCCccccccCCh
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFAQS 383 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 383 (405)
+|.|. |+. ....+..|.++|.++++|.|..|+..|...
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 57887 776 666677888888888888888888777644
No 48
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.03 E-value=0.0034 Score=33.07 Aligned_cols=23 Identities=35% Similarity=0.814 Sum_probs=21.0
Q ss_pred cccccccccccCchhHHHHhhhh
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIH 365 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h 365 (405)
|.|.+|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 67999999999999999999764
No 49
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.79 E-value=0.0086 Score=52.61 Aligned_cols=88 Identities=24% Similarity=0.408 Sum_probs=52.3
Q ss_pred ccccCC--CccccCChhhHHHHHHhhcCcccccCCCCCcCCCchHhhhcHHHHHHhhccCCCCccCccchhccCChHHHH
Q psy359 128 TYPCDQ--CDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVTNLAAHHNEVHAHERKFPCTFCEKSFKRKLHLK 205 (405)
Q Consensus 128 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~~l~~~~~~~h~~~~~~~C~~C~~~f~~~~~l~ 205 (405)
.|.|+. |..++.....|..|.+..|+ .+.|.+|-. ..+.|. |.-..-++..|+
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~--------~~~C~~C~~--------------nKk~F~---~E~~lF~~~~Lr 205 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG--------FVLCSECIG--------------NKKDFW---NEIRLFRSSTLR 205 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC--------cEEhHhhhc--------------CcccCc---cceeeeeccccc
Confidence 477865 78888888888888887665 234444422 011122 222233345566
Q ss_pred HHHhhhcCCCC----cccccchHhhhChHHHHHHHhhhC
Q psy359 206 VHTRTHTGEKP----YACYLCDKRFAQISDRIKHLKSSH 240 (405)
Q Consensus 206 ~H~~~h~~~~~----~~C~~C~~~f~~~~~l~~H~~~~h 240 (405)
.|...-..+.. =.|..|...|-+-..|..|++..|
T Consensus 206 ~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~H 244 (493)
T COG5236 206 DHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRH 244 (493)
T ss_pred ccccCCccccCcCCCchhhhccceecChHHHHHHHHhhh
Confidence 66544222211 268888888888888888888755
No 50
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.70 E-value=0.012 Score=36.21 Aligned_cols=31 Identities=23% Similarity=0.483 Sum_probs=17.3
Q ss_pred CCCccCccchhccCChHHHHHHHhhhcCCCC
Q psy359 186 ERKFPCTFCEKSFKRKLHLKVHTRTHTGEKP 216 (405)
Q Consensus 186 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 216 (405)
+.|-.|++|+..+....+|++|+.++|+.+|
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 3466777777777777777777777666543
No 51
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.40 E-value=0.0037 Score=33.65 Aligned_cols=22 Identities=32% Similarity=0.883 Sum_probs=14.2
Q ss_pred ccCCccccccCChHHHHHHHHh
Q psy359 371 YQCDICGRGFAQSNDMKKHRRT 392 (405)
Q Consensus 371 ~~C~~C~~~f~~~~~l~~H~~~ 392 (405)
|-|..|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4566666666666666666554
No 52
>PRK04860 hypothetical protein; Provisional
Probab=95.38 E-value=0.0061 Score=48.89 Aligned_cols=38 Identities=26% Similarity=0.488 Sum_probs=28.3
Q ss_pred CcCCCcccchHHHHhhhhhcCCCCccccccccccccCch
Q psy359 318 GVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKKKN 356 (405)
Q Consensus 318 ~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~ 356 (405)
|.|. |+.....+.+|.++|+++++|+|..|+..|....
T Consensus 120 Y~C~-C~~~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 120 YRCK-CQEHQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEcC-CCCeeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 4444 4444445667888899999999999999987654
No 53
>KOG2482|consensus
Probab=95.37 E-value=0.03 Score=49.27 Aligned_cols=147 Identities=22% Similarity=0.397 Sum_probs=86.7
Q ss_pred ccccCCCcccc-CChhhHHHHHHhhcCcccccCCCCCcCCCchHhhh-cHHHHHHhhccCCCCccCccchhccCChHHHH
Q psy359 128 TYPCDQCDRTY-QTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVT-NLAAHHNEVHAHERKFPCTFCEKSFKRKLHLK 205 (405)
Q Consensus 128 ~~~C~~C~~~f-~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~-~l~~~~~~~h~~~~~~~C~~C~~~f~~~~~l~ 205 (405)
...|-.|...+ -+++.+..|+-..|+.+.-.+.. .++. .+..+..+. -..+.|-.|.+.|+.+..|+
T Consensus 144 slqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDn--------iVyvnelLehLkek---L~r~~CLyCekifrdkntLk 212 (423)
T KOG2482|consen 144 SLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDN--------IVYVNELLEHLKEK---LERLRCLYCEKIFRDKNTLK 212 (423)
T ss_pred eeEEEEecchhcccHHHHHHHHHHHhhhccCCCcc--------eeeHHHHHHHHHHH---HhhheeeeeccccCCcHHHH
Confidence 35688887776 56677777876655532211111 1111 222221111 12378888999999988899
Q ss_pred HHHhh--hcCC----------------------------------------------------CC--cccccchHhhhCh
Q psy359 206 VHTRT--HTGE----------------------------------------------------KP--YACYLCDKRFAQI 229 (405)
Q Consensus 206 ~H~~~--h~~~----------------------------------------------------~~--~~C~~C~~~f~~~ 229 (405)
.||+. |..- .+ ..|.-|.....+.
T Consensus 213 eHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~en~ 292 (423)
T KOG2482|consen 213 EHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYENP 292 (423)
T ss_pred HHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccchhhH
Confidence 88754 3210 11 4788888888888
Q ss_pred HHHHHHHhhhCCCCcccccCC---CC----cchHHhhhhhcccCcchhhhhhhcccceecCCCcccCCChHHHHHHHh
Q psy359 230 SDRIKHLKSSHNFDFESVKNK---TT----PLTRYLNKHLHDAHPEAIKTERERKLIFKCDLCGNILSSKHILQEHVR 300 (405)
Q Consensus 230 ~~l~~H~~~~h~~~~c~~c~~---~~----~~~~~l~~h~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 300 (405)
..|..||+..|..+.-.+-.. .| +.-.+++. ......|-.|.-.|-.+..|..|+.
T Consensus 293 ~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRk---------------q~~~~~c~~cd~~F~~e~~l~~hm~ 355 (423)
T KOG2482|consen 293 VFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRK---------------QKKKSRCAECDLSFWKEPGLLIHMV 355 (423)
T ss_pred HHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHH---------------HhhccccccccccccCcchhhhhcc
Confidence 888888888886553211111 00 00011111 1234578999999999999999986
No 54
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=95.24 E-value=0.017 Score=30.39 Aligned_cols=21 Identities=38% Similarity=0.966 Sum_probs=14.8
Q ss_pred ccCCccccccCChHHHHHHHHh
Q psy359 371 YQCDICGRGFAQSNDMKKHRRT 392 (405)
Q Consensus 371 ~~C~~C~~~f~~~~~l~~H~~~ 392 (405)
.+|+.||+.| .+..|..|+..
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4677888887 66777777653
No 55
>KOG4173|consensus
Probab=95.04 E-value=0.018 Score=46.77 Aligned_cols=80 Identities=24% Similarity=0.567 Sum_probs=63.2
Q ss_pred cccccCC--CccccCChhhHHHHHHhhcCcccccCCCCCcCCCchHhhhcHHHHHHhhccCCCCccCccchhccCChHHH
Q psy359 127 LTYPCDQ--CDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVTNLAAHHNEVHAHERKFPCTFCEKSFKRKLHL 204 (405)
Q Consensus 127 ~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~~l~~~~~~~h~~~~~~~C~~C~~~f~~~~~l 204 (405)
..|.|.+ |...|.+...+..|-.+-|+ -.|..|.+.|++...|
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~-----------------------------------~sCs~C~r~~Pt~hLL 122 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHG-----------------------------------NSCSFCKRAFPTGHLL 122 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhccc-----------------------------------chhHHHHHhCCchhhh
Confidence 4477865 88888888888777766555 3799999999999999
Q ss_pred HHHHhhhc----------CCCCcccc--cchHhhhChHHHHHHHhhhCC
Q psy359 205 KVHTRTHT----------GEKPYACY--LCDKRFAQISDRIKHLKSSHN 241 (405)
Q Consensus 205 ~~H~~~h~----------~~~~~~C~--~C~~~f~~~~~l~~H~~~~h~ 241 (405)
..|+...| |.-+|+|. .|+..|.+...-+.|+-..|.
T Consensus 123 d~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 123 DAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhcc
Confidence 99975543 45579996 499999999999999987775
No 56
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.92 E-value=0.013 Score=31.53 Aligned_cols=21 Identities=38% Similarity=0.888 Sum_probs=19.6
Q ss_pred eecCCCcccCCChHHHHHHHh
Q psy359 280 FKCDLCGNILSSKHILQEHVR 300 (405)
Q Consensus 280 ~~C~~C~~~f~~~~~l~~H~~ 300 (405)
|.|..|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 789999999999999999987
No 57
>KOG2482|consensus
Probab=94.78 E-value=0.22 Score=44.08 Aligned_cols=107 Identities=24% Similarity=0.350 Sum_probs=71.6
Q ss_pred cccceecCCCcccCC-ChHHHHHHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhcCCCCccccccccccccC
Q psy359 276 RKLIFKCDLCGNILS-SKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKK 354 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~-~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 354 (405)
..+...|-.|+..+. .++.+..|+-..|.-. ..=++.-..-..|..|+..-.. .++|-+|.+.|..
T Consensus 141 t~fslqClFCn~e~lgnRs~~l~Hlf~~H~ln-----------iGlpDniVyvnelLehLkekL~--r~~CLyCekifrd 207 (423)
T KOG2482|consen 141 TIFSLQCLFCNNEGLGNRSEILEHLFHVHGLN-----------IGLPDNIVYVNELLEHLKEKLE--RLRCLYCEKIFRD 207 (423)
T ss_pred CeeeeEEEEecchhcccHHHHHHHHHHHhhhc-----------cCCCcceeeHHHHHHHHHHHHh--hheeeeeccccCC
Confidence 345567888887654 4677777776555432 1111111223445555544333 3899999999999
Q ss_pred chhHHHHhhh--hCCCC----------------------------------------------------c--ccCCcccc
Q psy359 355 KNHLTTHYRI--HTGEK----------------------------------------------------P--YQCDICGR 378 (405)
Q Consensus 355 ~~~l~~H~~~--h~~~~----------------------------------------------------~--~~C~~C~~ 378 (405)
+..|..||+. |-... + ..|-+|..
T Consensus 208 kntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~ 287 (423)
T KOG2482|consen 208 KNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTN 287 (423)
T ss_pred cHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeecc
Confidence 9999999964 32110 2 58999999
Q ss_pred ccCChHHHHHHHHhhcc
Q psy359 379 GFAQSNDMKKHRRTVHK 395 (405)
Q Consensus 379 ~f~~~~~l~~H~~~~h~ 395 (405)
..-++..|-.|+..-|-
T Consensus 288 ~~en~~~l~eHmk~vHe 304 (423)
T KOG2482|consen 288 FYENPVFLFEHMKIVHE 304 (423)
T ss_pred chhhHHHHHHHHHHHHH
Confidence 88889999999998774
No 58
>KOG2893|consensus
Probab=94.62 E-value=0.0074 Score=50.11 Aligned_cols=46 Identities=37% Similarity=0.694 Sum_probs=37.4
Q ss_pred cccchHhhhChHHHHHHHhhhCCCCcccccCCCCcchHHhhhhhcccCcchhhhhhhcccceecCCCcccCCChHHHHHH
Q psy359 219 CYLCDKRFAQISDRIKHLKSSHNFDFESVKNKTTPLTRYLNKHLHDAHPEAIKTERERKLIFKCDLCGNILSSKHILQEH 298 (405)
Q Consensus 219 C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H 298 (405)
|.+|++.|..+.-|.+|++..| |+|.+|-+.+-+-..|..|
T Consensus 13 cwycnrefddekiliqhqkakh---------------------------------------fkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH---------------------------------------FKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeecccccchhhhhhhhhhhcc---------------------------------------ceeeeehhhhccCCCceee
Confidence 8899999999999999998754 8888888887776677777
Q ss_pred Hhhhc
Q psy359 299 VRVVH 303 (405)
Q Consensus 299 ~~~~h 303 (405)
...+|
T Consensus 54 cmqvh 58 (341)
T KOG2893|consen 54 CMQVH 58 (341)
T ss_pred hhhhh
Confidence 66555
No 59
>KOG1146|consensus
Probab=94.51 E-value=0.018 Score=59.77 Aligned_cols=107 Identities=21% Similarity=0.440 Sum_probs=77.8
Q ss_pred cccceecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhc------CCCCcccccccc
Q psy359 276 RKLIFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLH------FPLRPYACTQCD 349 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h------~~~~~~~C~~C~ 349 (405)
..+.|+|+.|+..|.....|..|+|..|..... . .|- ..+.|.+.- -+.++|.|..|.
T Consensus 462 ~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~-~-----------~c~----~gq~~~~~arg~~~~~~~~p~~C~~C~ 525 (1406)
T KOG1146|consen 462 FFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS-A-----------YCK----AGQNHPRLARGEVYRCPGKPYPCRACN 525 (1406)
T ss_pred ccccccCCccchhhhhHHHhhhcccccccccch-h-----------HhH----hccccccccccccccCCCCcccceeee
Confidence 457899999999999999999999987765422 1 110 111121111 235789999999
Q ss_pred ccccCchhHHHHhhh--hC----------C-------------------------------CCcccCCccccccCChHHH
Q psy359 350 KTFKKKNHLTTHYRI--HT----------G-------------------------------EKPYQCDICGRGFAQSNDM 386 (405)
Q Consensus 350 ~~f~~~~~l~~H~~~--h~----------~-------------------------------~~~~~C~~C~~~f~~~~~l 386 (405)
..++....|..|+.. |- | +-.+.|.+|++.-+-..+|
T Consensus 526 ~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetniarnl 605 (1406)
T KOG1146|consen 526 YSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETNIARNL 605 (1406)
T ss_pred eeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhhhhhcc
Confidence 999999999999864 20 0 0148999999999999999
Q ss_pred HHHHHhhccccc
Q psy359 387 KKHRRTVHKAQI 398 (405)
Q Consensus 387 ~~H~~~~h~~~~ 398 (405)
+-|+...+..++
T Consensus 606 rihmtss~~s~~ 617 (1406)
T KOG1146|consen 606 RIHMTASPSSSP 617 (1406)
T ss_pred ccccccCCCCCC
Confidence 999988666554
No 60
>KOG2785|consensus
Probab=93.26 E-value=0.16 Score=45.91 Aligned_cols=73 Identities=19% Similarity=0.314 Sum_probs=44.9
Q ss_pred cccCCCccccCChhhHHHHHHhhcCcccccCCCCCcCCCchHhhhcHHHHHHhhccCCCCccCccch---hccCChHHHH
Q psy359 129 YPCDQCDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVTNLAAHHNEVHAHERKFPCTFCE---KSFKRKLHLK 205 (405)
Q Consensus 129 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~~l~~~~~~~h~~~~~~~C~~C~---~~f~~~~~l~ 205 (405)
-.|=.|+..+.+...-+.||-.+|+-..+ ...|.-. ...|.......- ..-+.|-.|+ +.|.+-...+
T Consensus 167 t~CLfC~~~~k~~e~~~~HM~~~HgffIP--dreYL~D-----~~GLl~YLgeKV--~~~~~CL~CN~~~~~f~sleavr 237 (390)
T KOG2785|consen 167 TDCLFCDKKSKSLEENLKHMFKEHGFFIP--DREYLTD-----EKGLLKYLGEKV--GIGFICLFCNELGRPFSSLEAVR 237 (390)
T ss_pred cceeecCCCcccHHHHHHHHhhccCCcCC--chHhhhc-----hhHHHHHHHHHh--ccCceEEEeccccCcccccHHHH
Confidence 46888999999999999999999983221 1111000 001222211111 1247888888 7888888888
Q ss_pred HHHhh
Q psy359 206 VHTRT 210 (405)
Q Consensus 206 ~H~~~ 210 (405)
.||..
T Consensus 238 ~HM~~ 242 (390)
T KOG2785|consen 238 AHMRD 242 (390)
T ss_pred HHHhh
Confidence 88755
No 61
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=93.07 E-value=0.065 Score=30.71 Aligned_cols=23 Identities=17% Similarity=0.574 Sum_probs=17.4
Q ss_pred cccCCccccccCChHHHHHHHHh
Q psy359 370 PYQCDICGRGFAQSNDMKKHRRT 392 (405)
Q Consensus 370 ~~~C~~C~~~f~~~~~l~~H~~~ 392 (405)
+|.|.+|++.|.+...+..|+..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 46788888888877777777765
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.99 E-value=0.088 Score=27.61 Aligned_cols=20 Identities=25% Similarity=0.723 Sum_probs=16.7
Q ss_pred eecCCCcccCCChHHHHHHHh
Q psy359 280 FKCDLCGNILSSKHILQEHVR 300 (405)
Q Consensus 280 ~~C~~C~~~f~~~~~l~~H~~ 300 (405)
..|+.||+.| ..+.|..|+.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHH
Confidence 5799999999 6778888876
No 63
>KOG4173|consensus
Probab=92.73 E-value=0.055 Score=44.09 Aligned_cols=51 Identities=33% Similarity=0.653 Sum_probs=26.6
Q ss_pred cccccccccCchhHHHHhhhhC----------CCCcccCCc--cccccCChHHHHHHHHhhcc
Q psy359 345 CTQCDKTFKKKNHLTTHYRIHT----------GEKPYQCDI--CGRGFAQSNDMKKHRRTVHK 395 (405)
Q Consensus 345 C~~C~~~f~~~~~l~~H~~~h~----------~~~~~~C~~--C~~~f~~~~~l~~H~~~~h~ 395 (405)
|..|.+.|.+.--|..|+..-| |...|+|-+ |+-.|.+...-..|+...|+
T Consensus 109 Cs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 109 CSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhcc
Confidence 5555555555555555554322 233455544 55556555555555555554
No 64
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=91.97 E-value=0.13 Score=29.35 Aligned_cols=23 Identities=22% Similarity=0.543 Sum_probs=20.8
Q ss_pred ceecCCCcccCCChHHHHHHHhh
Q psy359 279 IFKCDLCGNILSSKHILQEHVRV 301 (405)
Q Consensus 279 ~~~C~~C~~~f~~~~~l~~H~~~ 301 (405)
+|.|++|+..|.+...+..|+..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 58899999999999999999873
No 65
>KOG1146|consensus
Probab=91.96 E-value=0.046 Score=56.97 Aligned_cols=106 Identities=18% Similarity=0.252 Sum_probs=65.2
Q ss_pred ccccCCCCcchHHhhhhhcccCcchhhhhhhcccceecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcc
Q psy359 245 ESVKNKTTPLTRYLNKHLHDAHPEAIKTERERKLIFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCG 324 (405)
Q Consensus 245 c~~c~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~ 324 (405)
+..+..+|........+-.....+ ....-.+.|..|++.|...-.+. |+..
T Consensus 1231 ~~~~e~~f~~~~~~~~~a~~~~~~-----~~~sGe~~c~~~~~~~~~~~~~~-~l~~----------------------- 1281 (1406)
T KOG1146|consen 1231 PNALEQPFPQEPEPTATAPPKPPE-----LPASGEGECGAVDELLTPSFGIS-TLDV----------------------- 1281 (1406)
T ss_pred HHhhhcCccCcccccccCCCCCCc-----CcCCCcchhhhccccccCcccee-eccc-----------------------
Confidence 777777777665555443222211 11333577888888776665444 4432
Q ss_pred cchHHHHhhhhhcCCCCccccccccccccCchhHHHHhhh-----------------hCCCCcccCCccccccCChHHHH
Q psy359 325 EYKKQLLQHKRLHFPLRPYACTQCDKTFKKKNHLTTHYRI-----------------HTGEKPYQCDICGRGFAQSNDMK 387 (405)
Q Consensus 325 ~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~-----------------h~~~~~~~C~~C~~~f~~~~~l~ 387 (405)
..+|.|..|...|.....|..|.+. +-...+| |..|...|.....|+
T Consensus 1282 ---------------~~~~~~~~~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~~c~~~~~~~~alq 1345 (1406)
T KOG1146|consen 1282 ---------------THRYLCRQCKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CLACEVLLSGREALQ 1345 (1406)
T ss_pred ---------------chhHHHHHHHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc-chHHHhhcchhHHHH
Confidence 2336666666666666666666532 2233456 999999999999999
Q ss_pred HHHHhhcc
Q psy359 388 KHRRTVHK 395 (405)
Q Consensus 388 ~H~~~~h~ 395 (405)
.|+++-+.
T Consensus 1346 ihm~~~~~ 1353 (1406)
T KOG1146|consen 1346 IHMRSSAH 1353 (1406)
T ss_pred HHHHHhhh
Confidence 99887443
No 66
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=91.37 E-value=0.3 Score=36.66 Aligned_cols=26 Identities=35% Similarity=0.626 Sum_probs=24.4
Q ss_pred cccC----CccccccCChHHHHHHHHhhcc
Q psy359 370 PYQC----DICGRGFAQSNDMKKHRRTVHK 395 (405)
Q Consensus 370 ~~~C----~~C~~~f~~~~~l~~H~~~~h~ 395 (405)
.|.| ..|++.+.+...++.|++.+||
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 4999 9999999999999999999997
No 67
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=91.07 E-value=0.22 Score=36.98 Aligned_cols=26 Identities=15% Similarity=0.369 Sum_probs=19.9
Q ss_pred cccceecCCCcccCCChHHHHHHHhh
Q psy359 276 RKLIFKCDLCGNILSSKHILQEHVRV 301 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~~~~~l~~H~~~ 301 (405)
....|.|+.|...|-..-.+..|...
T Consensus 78 ~~~~y~C~~C~~~FC~dCD~fiHe~L 103 (112)
T TIGR00622 78 DSHRYVCAVCKNVFCVDCDVFVHESL 103 (112)
T ss_pred cccceeCCCCCCccccccchhhhhhc
Confidence 34579999999999888777777663
No 68
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=90.41 E-value=0.46 Score=35.30 Aligned_cols=88 Identities=18% Similarity=0.277 Sum_probs=44.0
Q ss_pred ccceecCCCcccCCChHHHHHHHhhhccCCCccccc-c---CCCCCcCCCcccchHHHHhh-hhhcCCCCcccccccccc
Q psy359 277 KLIFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHY-E---YKPDGVCDVCGEYKKQLLQH-KRLHFPLRPYACTQCDKT 351 (405)
Q Consensus 277 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~-~---~~~~~~C~~C~~~~~~l~~H-~~~h~~~~~~~C~~C~~~ 351 (405)
+.|..|++|+-+..+...|.+-. +|+..-+.+.. . ......|-.|.......... ...-.....|+|+.|+..
T Consensus 13 ~LP~~CpiCgLtLVss~HLARSy--HHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~~~~~~~~y~C~~C~~~ 90 (112)
T TIGR00622 13 ELPVECPICGLTLILSTHLARSY--HHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFDELKDSHRYVCAVCKNV 90 (112)
T ss_pred CCCCcCCcCCCEEeccchHHHhh--hccCCCcccccccccccCCCCcccCcCCCCCCcccccccccccccceeCCCCCCc
Confidence 34678888888888887777654 34433222210 0 01111244444420000000 000122335777777777
Q ss_pred ccCchhHHHHhhhhC
Q psy359 352 FKKKNHLTTHYRIHT 366 (405)
Q Consensus 352 f~~~~~l~~H~~~h~ 366 (405)
|-..-+...|...|.
T Consensus 91 FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 91 FCVDCDVFVHESLHC 105 (112)
T ss_pred cccccchhhhhhccC
Confidence 777777777755544
No 69
>KOG2893|consensus
Probab=89.35 E-value=0.1 Score=43.57 Aligned_cols=48 Identities=21% Similarity=0.494 Sum_probs=36.3
Q ss_pred cCccchhccCChHHHHHHHhhhcCCCCcccccchHhhhChHHHHHHHhhhCC
Q psy359 190 PCTFCEKSFKRKLHLKVHTRTHTGEKPYACYLCDKRFAQISDRIKHLKSSHN 241 (405)
Q Consensus 190 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 241 (405)
.|=.|++.|.+..-|.+|++..| |+|.+|.+.+.+-.-|..|....|.
T Consensus 12 wcwycnrefddekiliqhqkakh----fkchichkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAKH----FKCHICHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred eeeecccccchhhhhhhhhhhcc----ceeeeehhhhccCCCceeehhhhhh
Confidence 46678888888888888876644 8888888888777777777766554
No 70
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=89.33 E-value=0.22 Score=28.16 Aligned_cols=24 Identities=29% Similarity=1.036 Sum_probs=14.0
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCcccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGR 378 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 378 (405)
|+|..||..+.... .++.|++|+.
T Consensus 2 ~~C~~CGy~y~~~~------------~~~~CP~Cg~ 25 (33)
T cd00350 2 YVCPVCGYIYDGEE------------APWVCPVCGA 25 (33)
T ss_pred EECCCCCCEECCCc------------CCCcCcCCCC
Confidence 56666666554432 5566766664
No 71
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=87.43 E-value=0.42 Score=29.96 Aligned_cols=31 Identities=29% Similarity=0.575 Sum_probs=24.4
Q ss_pred hcCCCCcccccchHhhhChHHHHHHHhhhCC
Q psy359 211 HTGEKPYACYLCDKRFAQISDRIKHLKSSHN 241 (405)
Q Consensus 211 h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 241 (405)
..|+..++|+.|+..|....++.+|+...|+
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 3466678888888888888888888877665
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=86.65 E-value=0.26 Score=30.90 Aligned_cols=30 Identities=13% Similarity=0.318 Sum_probs=23.0
Q ss_pred cCCCCccccccccccccCchhHHHHhhhhC
Q psy359 337 HFPLRPYACTQCDKTFKKKNHLTTHYRIHT 366 (405)
Q Consensus 337 h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 366 (405)
-.+|..++|+.|+..|...-++..|...-|
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 456777888888888888888888875443
No 73
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=86.25 E-value=0.32 Score=36.20 Aligned_cols=13 Identities=31% Similarity=0.851 Sum_probs=6.4
Q ss_pred CCcccccchHhhh
Q psy359 215 KPYACYLCDKRFA 227 (405)
Q Consensus 215 ~~~~C~~C~~~f~ 227 (405)
.|..|+.||..|.
T Consensus 25 ~PivCP~CG~~~~ 37 (108)
T PF09538_consen 25 DPIVCPKCGTEFP 37 (108)
T ss_pred CCccCCCCCCccC
Confidence 3444555555554
No 74
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=86.15 E-value=0.18 Score=48.31 Aligned_cols=61 Identities=31% Similarity=0.660 Sum_probs=53.3
Q ss_pred CccccccccccccCchhHHHHhh--hhCCC--CcccCC--ccccccCChHHHHHHHHhhcccccccc
Q psy359 341 RPYACTQCDKTFKKKNHLTTHYR--IHTGE--KPYQCD--ICGRGFAQSNDMKKHRRTVHKAQIHAV 401 (405)
Q Consensus 341 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~h~~~~~~~ 401 (405)
.++.|..|...|.....|..|.+ .|.++ +++.|+ +|++.|.....|..|...|.+..+..+
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKE 354 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCcccc
Confidence 46889999999999999999998 79888 899999 799999999999999999888876544
No 75
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=84.79 E-value=1 Score=33.79 Aligned_cols=84 Identities=18% Similarity=0.286 Sum_probs=48.7
Q ss_pred ccceecCCCcccCCChHHHHHHHhhhccCCCcccc----------c--cCCCC-CcCCCcccchHHHHhhhhhcCCCCcc
Q psy359 277 KLIFKCDLCGNILSSKHILQEHVRVVHMGLSRKYH----------Y--EYKPD-GVCDVCGEYKKQLLQHKRLHFPLRPY 343 (405)
Q Consensus 277 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~----------~--~~~~~-~~C~~C~~~~~~l~~H~~~h~~~~~~ 343 (405)
-+...|..|+..... +++..|++..|........ . ...+. ..=+. ........+.++ ..|
T Consensus 9 ~~vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~p~---~~~~Pi~gLp~~---~G~ 81 (109)
T PF12013_consen 9 YRVLICRQCQYAVQP-SEVESHLRKRHHILKSQERQRIVEAIRQWPDLLPDPDDLQIPP---DPSPPIPGLPVY---DGY 81 (109)
T ss_pred CCEEEeCCCCcccCc-hHHHHHHHHhcccccHHHHHHHHHHHHhhhhcccCccccCCCC---CCCCcCCCCCCC---CCe
Confidence 346899999987766 8999999966654421100 0 00000 00000 000000011111 239
Q ss_pred cc----ccccccccCchhHHHHhhhhCC
Q psy359 344 AC----TQCDKTFKKKNHLTTHYRIHTG 367 (405)
Q Consensus 344 ~C----~~C~~~f~~~~~l~~H~~~h~~ 367 (405)
.| ..|+..+.+...+..|++.+||
T Consensus 82 ~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 82 RCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 99 9999999999999999998875
No 76
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=83.75 E-value=0.25 Score=47.26 Aligned_cols=57 Identities=32% Similarity=0.585 Sum_probs=50.4
Q ss_pred CCccCccchhccCChHHHHHHHh--hhcCC--CCcccc--cchHhhhChHHHHHHHhhhCCCC
Q psy359 187 RKFPCTFCEKSFKRKLHLKVHTR--THTGE--KPYACY--LCDKRFAQISDRIKHLKSSHNFD 243 (405)
Q Consensus 187 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~h~~~ 243 (405)
.++.|..|...|.....|.+|.. .|.++ +++.|+ .|++.|.....+..|...+.+..
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 350 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSIS 350 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCC
Confidence 36899999999999999999999 79999 999999 79999999999999998866543
No 77
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=82.48 E-value=0.55 Score=39.64 Aligned_cols=30 Identities=27% Similarity=0.613 Sum_probs=20.2
Q ss_pred CCcccCCccccccCChHHHHHHHHhhcccc
Q psy359 368 EKPYQCDICGRGFAQSNDMKKHRRTVHKAQ 397 (405)
Q Consensus 368 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~ 397 (405)
+..|.|.+|+|.|.-..-.++|+...|.+.
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 445777778888888888888888877765
No 78
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=82.35 E-value=0.34 Score=38.62 Aligned_cols=15 Identities=27% Similarity=0.627 Sum_probs=8.2
Q ss_pred cccccccccccCchh
Q psy359 343 YACTQCDKTFKKKNH 357 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~ 357 (405)
++|+-||+.|.....
T Consensus 29 ~~c~~c~~~f~~~e~ 43 (154)
T PRK00464 29 RECLACGKRFTTFER 43 (154)
T ss_pred eeccccCCcceEeEe
Confidence 555566655555433
No 79
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=80.70 E-value=1.3 Score=26.97 Aligned_cols=27 Identities=30% Similarity=0.688 Sum_probs=13.6
Q ss_pred CCcccCCccccccCCh----HHHHHHHHhhc
Q psy359 368 EKPYQCDICGRGFAQS----NDMKKHRRTVH 394 (405)
Q Consensus 368 ~~~~~C~~C~~~f~~~----~~l~~H~~~~h 394 (405)
....+|.+|++.+... +.|..|++..|
T Consensus 14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 14 KKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp SS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred cCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 3445666666666553 66777775544
No 80
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=80.42 E-value=1.3 Score=28.46 Aligned_cols=13 Identities=15% Similarity=0.644 Sum_probs=7.7
Q ss_pred cccccccccccCc
Q psy359 343 YACTQCDKTFKKK 355 (405)
Q Consensus 343 ~~C~~C~~~f~~~ 355 (405)
|.|+.||..--..
T Consensus 28 F~CPnCGe~~I~R 40 (61)
T COG2888 28 FPCPNCGEVEIYR 40 (61)
T ss_pred eeCCCCCceeeeh
Confidence 6777777544443
No 81
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=79.13 E-value=1 Score=23.85 Aligned_cols=8 Identities=50% Similarity=1.331 Sum_probs=3.8
Q ss_pred CCcccccc
Q psy359 373 CDICGRGF 380 (405)
Q Consensus 373 C~~C~~~f 380 (405)
|+.||..|
T Consensus 17 Cp~CG~~F 24 (26)
T PF10571_consen 17 CPHCGYDF 24 (26)
T ss_pred CCCCCCCC
Confidence 44444444
No 82
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=78.54 E-value=1.1 Score=26.11 Aligned_cols=33 Identities=18% Similarity=0.573 Sum_probs=17.8
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCcccccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGF 380 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 380 (405)
++|+.|+..|.-...... .+.....|+.|+..|
T Consensus 3 ~~CP~C~~~~~v~~~~~~-----~~~~~v~C~~C~~~~ 35 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLG-----ANGGKVRCGKCGHVW 35 (38)
T ss_pred EECCCCCCEEEeCHHHcC-----CCCCEEECCCCCCEE
Confidence 466777766665543321 111235677777655
No 83
>KOG2186|consensus
Probab=77.61 E-value=1.1 Score=38.09 Aligned_cols=47 Identities=28% Similarity=0.574 Sum_probs=30.9
Q ss_pred ccCccchhccCChHHHHHHHhhhcCCCCcccccchHhhhChHHHHHHHhh
Q psy359 189 FPCTFCEKSFKRKLHLKVHTRTHTGEKPYACYLCDKRFAQISDRIKHLKS 238 (405)
Q Consensus 189 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 238 (405)
|.|..||.... +..+.+|+-..++ .-|.|..|+++|-. .++..|.+-
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kC 50 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKC 50 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhhh
Confidence 56666766554 3456667766666 45778888888775 667777653
No 84
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=76.92 E-value=1.4 Score=25.10 Aligned_cols=24 Identities=33% Similarity=0.920 Sum_probs=17.4
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCcccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGR 378 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 378 (405)
|+|..||..+... +-|..|++|+.
T Consensus 3 ~~C~~CG~i~~g~------------~~p~~CP~Cg~ 26 (34)
T cd00729 3 WVCPVCGYIHEGE------------EAPEKCPICGA 26 (34)
T ss_pred EECCCCCCEeECC------------cCCCcCcCCCC
Confidence 7888888776543 24568888885
No 85
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=76.57 E-value=1.7 Score=43.97 Aligned_cols=8 Identities=38% Similarity=0.879 Sum_probs=4.3
Q ss_pred CcCCCccc
Q psy359 318 GVCDVCGE 325 (405)
Q Consensus 318 ~~C~~C~~ 325 (405)
..|..||.
T Consensus 436 l~C~~Cg~ 443 (730)
T COG1198 436 LLCRDCGY 443 (730)
T ss_pred eecccCCC
Confidence 34666654
No 86
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=76.03 E-value=1 Score=38.40 Aligned_cols=90 Identities=27% Similarity=0.553 Sum_probs=53.4
Q ss_pred cccceecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhcCCCCccccccccccccCc
Q psy359 276 RKLIFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKKK 355 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 355 (405)
+.+.|+|..|...+-....+ .|+..-.... -..|+|..|.+. ..|.|..|.-.|-.
T Consensus 139 GGrif~CsfC~~flCEDDQF-EHQAsCQvLe--------~E~~KC~SCNrl--------------Gq~sCLRCK~cfCd- 194 (314)
T PF06524_consen 139 GGRIFKCSFCDNFLCEDDQF-EHQASCQVLE--------SETFKCQSCNRL--------------GQYSCLRCKICFCD- 194 (314)
T ss_pred CCeEEEeecCCCeeeccchh-hhhhhhhhhh--------cccccccccccc--------------cchhhhheeeeehh-
Confidence 67889999998765554443 4655322221 112777777654 12556666655544
Q ss_pred hhHHHHhhhh----CCCCcccCCccccccCChHHHHHHHHhh
Q psy359 356 NHLTTHYRIH----TGEKPYQCDICGRGFAQSNDMKKHRRTV 393 (405)
Q Consensus 356 ~~l~~H~~~h----~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 393 (405)
.|.+.- ...+++.|+.|+..-.....|..-.++|
T Consensus 195 ----dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 195 ----DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred ----hhhhhcccccccCCCCCCCCCCCcccccccceeeeecc
Confidence 344332 1236788999998877777776655543
No 87
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=75.84 E-value=0.67 Score=39.54 Aligned_cols=43 Identities=28% Similarity=0.631 Sum_probs=30.6
Q ss_pred CccccccccccccCchhHHHHhhhh----------CCCCc-----ccCCccccccCCh
Q psy359 341 RPYACTQCDKTFKKKNHLTTHYRIH----------TGEKP-----YQCDICGRGFAQS 383 (405)
Q Consensus 341 ~~~~C~~C~~~f~~~~~l~~H~~~h----------~~~~~-----~~C~~C~~~f~~~ 383 (405)
+.+.||+|++.|....-+....+.- .+.+| ..|+.||.+|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 4578999999999886666655431 12233 5899999998754
No 88
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=75.51 E-value=1.8 Score=26.34 Aligned_cols=26 Identities=27% Similarity=0.731 Sum_probs=18.9
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCccccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRG 379 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 379 (405)
|.|..||..|... ...+.+|+.||..
T Consensus 3 Y~C~~Cg~~~~~~-----------~~~~irC~~CG~r 28 (44)
T smart00659 3 YICGECGRENEIK-----------SKDVVRCRECGYR 28 (44)
T ss_pred EECCCCCCEeecC-----------CCCceECCCCCce
Confidence 7888888887754 2356788888854
No 89
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=74.55 E-value=2.8 Score=22.15 Aligned_cols=19 Identities=32% Similarity=0.590 Sum_probs=13.9
Q ss_pred cCCccccccCChHHHHHHHH
Q psy359 372 QCDICGRGFAQSNDMKKHRR 391 (405)
Q Consensus 372 ~C~~C~~~f~~~~~l~~H~~ 391 (405)
.|++|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 588888887 5567777765
No 90
>PF12907 zf-met2: Zinc-binding
Probab=74.52 E-value=0.74 Score=27.18 Aligned_cols=27 Identities=30% Similarity=0.630 Sum_probs=18.2
Q ss_pred ccCCccccccCC---hHHHHHHHHhhcccc
Q psy359 371 YQCDICGRGFAQ---SNDMKKHRRTVHKAQ 397 (405)
Q Consensus 371 ~~C~~C~~~f~~---~~~l~~H~~~~h~~~ 397 (405)
+.|.+|...|.. ...|.+|-.+.|...
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~ 31 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKN 31 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCC
Confidence 467777755543 456888887777765
No 91
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=74.25 E-value=1.3 Score=33.49 Aligned_cols=14 Identities=14% Similarity=0.349 Sum_probs=7.1
Q ss_pred CCcccccchHhhhC
Q psy359 215 KPYACYLCDKRFAQ 228 (405)
Q Consensus 215 ~~~~C~~C~~~f~~ 228 (405)
.|..|+.|+..|.-
T Consensus 25 ~p~vcP~cg~~~~~ 38 (129)
T TIGR02300 25 RPAVSPYTGEQFPP 38 (129)
T ss_pred CCccCCCcCCccCc
Confidence 34555555555543
No 92
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=73.99 E-value=2.6 Score=33.96 Aligned_cols=44 Identities=14% Similarity=0.272 Sum_probs=31.9
Q ss_pred HHhhhhhcCCCCccccccccccccCchhHHHHhhhhCCCCcccCCccccccCC
Q psy359 330 LLQHKRLHFPLRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFAQ 382 (405)
Q Consensus 330 l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 382 (405)
|..-+....+...|.|+.|+..|+...++. .-|.|+.||.....
T Consensus 97 lk~~l~~e~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L~~ 140 (158)
T TIGR00373 97 LREKLEFETNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAMLDY 140 (158)
T ss_pred HHHHHhhccCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEeee
Confidence 333344445566799999999999988885 35999999976543
No 93
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=73.63 E-value=1.9 Score=24.83 Aligned_cols=33 Identities=18% Similarity=0.627 Sum_probs=18.4
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCcccccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGF 380 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 380 (405)
..|+.|+..|.-...... .+-...+|+.|+..|
T Consensus 3 i~Cp~C~~~y~i~d~~ip-----~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEKIP-----PKGRKVRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHHCC-----CCCcEEECCCCCCEe
Confidence 356677777666554321 122346777777665
No 94
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=73.63 E-value=1.1 Score=28.87 Aligned_cols=10 Identities=40% Similarity=1.255 Sum_probs=7.0
Q ss_pred Cccccccccc
Q psy359 341 RPYACTQCDK 350 (405)
Q Consensus 341 ~~~~C~~C~~ 350 (405)
.+|+|+.||+
T Consensus 47 ~~Y~CP~CGF 56 (59)
T PRK14890 47 NPYTCPKCGF 56 (59)
T ss_pred CceECCCCCC
Confidence 4578877775
No 95
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=72.77 E-value=2.2 Score=31.78 Aligned_cols=14 Identities=36% Similarity=0.923 Sum_probs=7.4
Q ss_pred CcccCCccccccCC
Q psy359 369 KPYQCDICGRGFAQ 382 (405)
Q Consensus 369 ~~~~C~~C~~~f~~ 382 (405)
.|..|++||..|.-
T Consensus 25 ~PivCP~CG~~~~~ 38 (108)
T PF09538_consen 25 DPIVCPKCGTEFPP 38 (108)
T ss_pred CCccCCCCCCccCc
Confidence 45555555555543
No 96
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=72.25 E-value=1.5 Score=26.85 Aligned_cols=29 Identities=21% Similarity=0.613 Sum_probs=17.7
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCccccccC
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFA 381 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 381 (405)
|+|+.||..|...... ....|+.||..+.
T Consensus 4 y~C~~CG~~~~~~~~~----------~~~~Cp~CG~~~~ 32 (46)
T PRK00398 4 YKCARCGREVELDEYG----------TGVRCPYCGYRIL 32 (46)
T ss_pred EECCCCCCEEEECCCC----------CceECCCCCCeEE
Confidence 6777777776654311 1467777776544
No 97
>PHA00626 hypothetical protein
Probab=72.20 E-value=1.4 Score=27.84 Aligned_cols=12 Identities=33% Similarity=0.794 Sum_probs=6.8
Q ss_pred cccccccccccC
Q psy359 343 YACTQCDKTFKK 354 (405)
Q Consensus 343 ~~C~~C~~~f~~ 354 (405)
|+|+.||..|+.
T Consensus 24 YkCkdCGY~ft~ 35 (59)
T PHA00626 24 YVCCDCGYNDSK 35 (59)
T ss_pred eEcCCCCCeech
Confidence 555555555544
No 98
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=71.26 E-value=2 Score=26.59 Aligned_cols=27 Identities=22% Similarity=0.783 Sum_probs=16.4
Q ss_pred ccccccccccccCchhHHHHhhhhCCCCcccCCcccc
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGR 378 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 378 (405)
.|.|..||+.|... .......|+.||.
T Consensus 6 ~Y~C~~Cg~~~~~~----------~~~~~irCp~Cg~ 32 (49)
T COG1996 6 EYKCARCGREVELD----------QETRGIRCPYCGS 32 (49)
T ss_pred EEEhhhcCCeeehh----------hccCceeCCCCCc
Confidence 37777788777211 1235567777774
No 99
>KOG2807|consensus
Probab=70.56 E-value=6.6 Score=34.96 Aligned_cols=81 Identities=22% Similarity=0.439 Sum_probs=51.4
Q ss_pred ccceecCCCcccCCChHHHHHHHhhhccCCCccccccCCCCC------cCCCcccchHHHHhhhhhcCCCCccccccccc
Q psy359 277 KLIFKCDLCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDG------VCDVCGEYKKQLLQHKRLHFPLRPYACTQCDK 350 (405)
Q Consensus 277 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~------~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~ 350 (405)
..|..|++|+-+..+...|.+-.+ |...- ..+...|.- .|-.|+. .-.+...|+|..|.-
T Consensus 288 sLP~eCpiC~ltLVss~hLARSyh--hL~PL--~~F~Eip~~~~~~~~~Cf~C~~----------~~~~~~~y~C~~Ck~ 353 (378)
T KOG2807|consen 288 SLPIECPICSLTLVSSPHLARSYH--HLFPL--KPFVEIPETEYNGSRFCFACQG----------ELLSSGRYRCESCKN 353 (378)
T ss_pred cCCccCCccceeEecchHHHHHHH--hhcCC--cchhhccccccCCCcceeeecc----------ccCCCCcEEchhccc
Confidence 457889999999999888877554 33221 122222211 1444411 112234599999999
Q ss_pred cccCchhHHHHhhhhCCCCcccCCccc
Q psy359 351 TFKKKNHLTTHYRIHTGEKPYQCDICG 377 (405)
Q Consensus 351 ~f~~~~~l~~H~~~h~~~~~~~C~~C~ 377 (405)
.|-.-.+..-|...|. |+.|.
T Consensus 354 ~FCldCDv~iHesLh~------CpgCe 374 (378)
T KOG2807|consen 354 VFCLDCDVFIHESLHN------CPGCE 374 (378)
T ss_pred eeeccchHHHHhhhhc------CCCcC
Confidence 9999999999866654 77776
No 100
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=70.40 E-value=2 Score=25.00 Aligned_cols=32 Identities=22% Similarity=0.582 Sum_probs=15.0
Q ss_pred cCccchhccCChHHHHHHHhhhcCCCCcccccchHhh
Q psy359 190 PCTFCEKSFKRKLHLKVHTRTHTGEKPYACYLCDKRF 226 (405)
Q Consensus 190 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 226 (405)
.|+.|+..|.-... ++-.+....+|+.|+..|
T Consensus 4 ~CP~C~~~f~v~~~-----~l~~~~~~vrC~~C~~~f 35 (37)
T PF13719_consen 4 TCPNCQTRFRVPDD-----KLPAGGRKVRCPKCGHVF 35 (37)
T ss_pred ECCCCCceEEcCHH-----HcccCCcEEECCCCCcEe
Confidence 45556555544332 111223345666666554
No 101
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=70.20 E-value=3 Score=34.34 Aligned_cols=42 Identities=14% Similarity=0.381 Sum_probs=30.1
Q ss_pred hhhhhcCCCCccccccccccccCchhHHHHhhhhCCCCcccCCccccccCC
Q psy359 332 QHKRLHFPLRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFAQ 382 (405)
Q Consensus 332 ~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 382 (405)
.-+........|.|+.|+..|+...++. ..|.|+.||-....
T Consensus 107 ~~l~~e~~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~~ 148 (178)
T PRK06266 107 EQLEEEENNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLEE 148 (178)
T ss_pred HHhhhccCCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCee
Confidence 3333344556689999999998888763 36999999976654
No 102
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=69.72 E-value=1.6 Score=27.60 Aligned_cols=29 Identities=28% Similarity=0.756 Sum_probs=16.2
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCcccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGR 378 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 378 (405)
|+|..|+..|.....+.. ...-.|+.|+.
T Consensus 6 y~C~~Cg~~fe~~~~~~~-------~~~~~CP~Cg~ 34 (52)
T TIGR02605 6 YRCTACGHRFEVLQKMSD-------DPLATCPECGG 34 (52)
T ss_pred EEeCCCCCEeEEEEecCC-------CCCCCCCCCCC
Confidence 677777776765432211 23445777774
No 103
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=68.78 E-value=1.7 Score=25.74 Aligned_cols=29 Identities=28% Similarity=0.770 Sum_probs=16.2
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCcccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGR 378 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 378 (405)
|+|+.||..|........ ...-.|+.||.
T Consensus 6 y~C~~Cg~~fe~~~~~~~-------~~~~~CP~Cg~ 34 (41)
T smart00834 6 YRCEDCGHTFEVLQKISD-------DPLATCPECGG 34 (41)
T ss_pred EEcCCCCCEEEEEEecCC-------CCCCCCCCCCC
Confidence 677777776665433211 23456777765
No 104
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=68.42 E-value=4.3 Score=25.37 Aligned_cols=21 Identities=29% Similarity=0.934 Sum_probs=10.1
Q ss_pred cCCccccccCCh-----HHHHHHHHh
Q psy359 372 QCDICGRGFAQS-----NDMKKHRRT 392 (405)
Q Consensus 372 ~C~~C~~~f~~~-----~~l~~H~~~ 392 (405)
.|.+|++.+... +.|..|+..
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~~ 45 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLRR 45 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHHh
Confidence 444444444332 455555553
No 105
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=67.89 E-value=3.7 Score=22.96 Aligned_cols=11 Identities=27% Similarity=0.954 Sum_probs=5.4
Q ss_pred ccccccccccc
Q psy359 343 YACTQCDKTFK 353 (405)
Q Consensus 343 ~~C~~C~~~f~ 353 (405)
|.|..|+..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 44555555554
No 106
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=66.97 E-value=7.2 Score=31.02 Aligned_cols=41 Identities=15% Similarity=0.433 Sum_probs=26.9
Q ss_pred cCCCCccccccccccccCchhHHHHhhhhCCCCcccCCccccccC
Q psy359 337 HFPLRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFA 381 (405)
Q Consensus 337 h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 381 (405)
-.+...|.|+.|+..|.....+.. .. ....|.|+.||....
T Consensus 94 e~~~~~Y~Cp~C~~~y~~~ea~~~---~d-~~~~f~Cp~Cg~~l~ 134 (147)
T smart00531 94 ETNNAYYKCPNCQSKYTFLEANQL---LD-MDGTFTCPRCGEELE 134 (147)
T ss_pred ccCCcEEECcCCCCEeeHHHHHHh---cC-CCCcEECCCCCCEEE
Confidence 345556899999988886554432 01 134499999997653
No 107
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=66.63 E-value=2.6 Score=25.29 Aligned_cols=29 Identities=28% Similarity=0.885 Sum_probs=18.0
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCcccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGR 378 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 378 (405)
|+|..||..|.....+.. ...-.|+.|+.
T Consensus 6 y~C~~Cg~~fe~~~~~~~-------~~~~~CP~Cg~ 34 (42)
T PF09723_consen 6 YRCEECGHEFEVLQSISE-------DDPVPCPECGS 34 (42)
T ss_pred EEeCCCCCEEEEEEEcCC-------CCCCcCCCCCC
Confidence 677777777766544321 34456777775
No 108
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=66.36 E-value=1.7 Score=29.64 Aligned_cols=20 Identities=25% Similarity=0.459 Sum_probs=11.8
Q ss_pred CCCccccc--cccccccCchhH
Q psy359 339 PLRPYACT--QCDKTFKKKNHL 358 (405)
Q Consensus 339 ~~~~~~C~--~C~~~f~~~~~l 358 (405)
.++-++|. .||..|.....+
T Consensus 24 ~~~Y~qC~N~eCg~tF~t~es~ 45 (72)
T PRK09678 24 KERYHQCQNVNCSATFITYESV 45 (72)
T ss_pred heeeeecCCCCCCCEEEEEEEE
Confidence 34446776 677777665443
No 109
>PRK04023 DNA polymerase II large subunit; Validated
Probab=66.23 E-value=4.3 Score=42.03 Aligned_cols=8 Identities=50% Similarity=1.406 Sum_probs=4.0
Q ss_pred ccCCcccc
Q psy359 371 YQCDICGR 378 (405)
Q Consensus 371 ~~C~~C~~ 378 (405)
+.|+.|+.
T Consensus 664 y~CPKCG~ 671 (1121)
T PRK04023 664 DECEKCGR 671 (1121)
T ss_pred CcCCCCCC
Confidence 44555554
No 110
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=65.88 E-value=4.5 Score=41.11 Aligned_cols=38 Identities=29% Similarity=0.621 Sum_probs=20.6
Q ss_pred cCCCcccchHHHHhhhhhcCCCCccccccccccccCchhHHHHhhhhCCCCcccCCcccc
Q psy359 319 VCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGR 378 (405)
Q Consensus 319 ~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 378 (405)
+|+.|..+ | ..|-..+..+|.+||. ....|..|+.||-
T Consensus 446 ~Cp~Cd~~---l----t~H~~~~~L~CH~Cg~---------------~~~~p~~Cp~Cgs 483 (730)
T COG1198 446 ECPNCDSP---L----TLHKATGQLRCHYCGY---------------QEPIPQSCPECGS 483 (730)
T ss_pred cCCCCCcc---e----EEecCCCeeEeCCCCC---------------CCCCCCCCCCCCC
Confidence 56666654 1 1122334467777773 3345667777774
No 111
>KOG2807|consensus
Probab=65.70 E-value=7.4 Score=34.69 Aligned_cols=26 Identities=15% Similarity=0.359 Sum_probs=19.6
Q ss_pred cccceecCCCcccCCChHHHHHHHhh
Q psy359 276 RKLIFKCDLCGNILSSKHILQEHVRV 301 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~~~~~l~~H~~~ 301 (405)
....|.|..|...|-..-....|...
T Consensus 342 ~~~~y~C~~Ck~~FCldCDv~iHesL 367 (378)
T KOG2807|consen 342 SSGRYRCESCKNVFCLDCDVFIHESL 367 (378)
T ss_pred CCCcEEchhccceeeccchHHHHhhh
Confidence 34568999999888888777777663
No 112
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=64.68 E-value=3 Score=22.90 Aligned_cols=24 Identities=21% Similarity=0.663 Sum_probs=9.8
Q ss_pred ccccccccccCchhHHHHhhhhCCCCcccCCcccc
Q psy359 344 ACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGR 378 (405)
Q Consensus 344 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 378 (405)
+|+.|+..|.. .+...|.|+.|+.
T Consensus 4 ~Cp~C~se~~y-----------~D~~~~vCp~C~~ 27 (30)
T PF08274_consen 4 KCPLCGSEYTY-----------EDGELLVCPECGH 27 (30)
T ss_dssp --TTT-----E-----------E-SSSEEETTTTE
T ss_pred CCCCCCCccee-----------ccCCEEeCCcccc
Confidence 57777766666 3344566776664
No 113
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=62.78 E-value=6.8 Score=31.61 Aligned_cols=31 Identities=16% Similarity=0.158 Sum_probs=25.8
Q ss_pred CCCCcccCCccccccCChHHHHHHHHhhccccccccccCC
Q psy359 366 TGEKPYQCDICGRGFAQSNDMKKHRRTVHKAQIHAVEDKG 405 (405)
Q Consensus 366 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~ 405 (405)
.+..-|.|+.|+..|+....+.. -|.||.||
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~~---------~F~Cp~Cg 135 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAMEL---------NFTCPRCG 135 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHHc---------CCcCCCCC
Confidence 34456999999999999998862 59999998
No 114
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=62.36 E-value=2 Score=36.67 Aligned_cols=23 Identities=22% Similarity=0.606 Sum_probs=16.8
Q ss_pred cceecCCCcccCCChHHHHHHHh
Q psy359 278 LIFKCDLCGNILSSKHILQEHVR 300 (405)
Q Consensus 278 ~~~~C~~C~~~f~~~~~l~~H~~ 300 (405)
+.+.|++|+..|.+..-.....+
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r 26 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIR 26 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCce
Confidence 45889999999988765444444
No 115
>KOG2186|consensus
Probab=61.66 E-value=3.9 Score=35.00 Aligned_cols=32 Identities=22% Similarity=0.467 Sum_probs=18.6
Q ss_pred HHhhhhhcCCCCccccccccccccCchhHHHHhh
Q psy359 330 LLQHKRLHFPLRPYACTQCDKTFKKKNHLTTHYR 363 (405)
Q Consensus 330 l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 363 (405)
+.+|+-.-++ .-|.|-.|++.|.. ..+..|..
T Consensus 18 vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 18 VEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred hHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 3344433333 34777778877777 55666643
No 116
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=60.80 E-value=6.7 Score=25.44 Aligned_cols=40 Identities=23% Similarity=0.657 Sum_probs=23.6
Q ss_pred Ccccccc--ccccccCchhHHHHhhhhCCCCcccCCc----cccccC
Q psy359 341 RPYACTQ--CDKTFKKKNHLTTHYRIHTGEKPYQCDI----CGRGFA 381 (405)
Q Consensus 341 ~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~ 381 (405)
.+..|+. |...+. +..|..|+...-..++..|++ |+..+.
T Consensus 8 ~~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 8 RPVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp SEEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred CEeeCCCCCccccee-HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 4567777 444444 556888887766777778888 877654
No 117
>PF15269 zf-C2H2_7: Zinc-finger
Probab=60.46 E-value=9.3 Score=22.88 Aligned_cols=25 Identities=36% Similarity=0.688 Sum_probs=21.8
Q ss_pred CcccccCCCccccCChhhHHHHHHh
Q psy359 126 PLTYPCDQCDRTYQTKKSLYVHRRA 150 (405)
Q Consensus 126 ~~~~~C~~C~~~f~~~~~l~~H~~~ 150 (405)
+..|+|-+|..+...++.|-.||+.
T Consensus 18 p~~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 18 PFKYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred CccceeecCCcccchHHHHHHHHHH
Confidence 3458899999999999999999975
No 118
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=58.51 E-value=6.5 Score=30.46 Aligned_cols=26 Identities=46% Similarity=0.680 Sum_probs=15.7
Q ss_pred CCccCccchhccCChHHHHHHHhhhcCCC
Q psy359 187 RKFPCTFCEKSFKRKLHLKVHTRTHTGEK 215 (405)
Q Consensus 187 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~ 215 (405)
....|.+||+.|.. |.+|+..|+|-.
T Consensus 71 d~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 71 DYIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp S-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred CeeEEccCCcccch---HHHHHHHccCCC
Confidence 45788999998865 588988887643
No 119
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=58.44 E-value=6.8 Score=30.36 Aligned_cols=24 Identities=38% Similarity=0.737 Sum_probs=11.2
Q ss_pred cccccccccccCchhHHHHhhhhCCCC
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEK 369 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~ 369 (405)
..|-+||+.|... ..|++.|+|..
T Consensus 73 i~clecGk~~k~L---krHL~~~~glt 96 (132)
T PF05443_consen 73 IICLECGKKFKTL---KRHLRTHHGLT 96 (132)
T ss_dssp EE-TBT--EESBH---HHHHHHTT-S-
T ss_pred eEEccCCcccchH---HHHHHHccCCC
Confidence 5666666666553 56666665554
No 120
>KOG2272|consensus
Probab=57.86 E-value=4 Score=34.77 Aligned_cols=41 Identities=32% Similarity=0.737 Sum_probs=25.2
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCccccccCChHHHHHHHH
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFAQSNDMKKHRR 391 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 391 (405)
|.|..|++....+..+..- +.+| .|..|--.|+ ..|.+-++
T Consensus 281 f~Cs~Cdkkl~~K~Kf~E~-----DmkP-~CKkCy~rfp--~elkkrl~ 321 (332)
T KOG2272|consen 281 FSCSTCDKKLTQKNKFYEF-----DMKP-VCKKCYDRFP--LELKKRLK 321 (332)
T ss_pred ccccccccccccccceeee-----ccch-HHHHHHhhcc--HHHHHHHH
Confidence 8888888887777655432 2233 5777777666 44444433
No 121
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=57.14 E-value=8.2 Score=31.81 Aligned_cols=30 Identities=17% Similarity=0.248 Sum_probs=24.6
Q ss_pred CCCcccCCccccccCChHHHHHHHHhhccccccccccCC
Q psy359 367 GEKPYQCDICGRGFAQSNDMKKHRRTVHKAQIHAVEDKG 405 (405)
Q Consensus 367 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~ 405 (405)
...-|.|+.|+..|+....+. --|.||.||
T Consensus 114 ~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg 143 (178)
T PRK06266 114 NNMFFFCPNCHIRFTFDEAME---------YGFRCPQCG 143 (178)
T ss_pred CCCEEECCCCCcEEeHHHHhh---------cCCcCCCCC
Confidence 345699999999999988764 259999998
No 122
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=56.94 E-value=7 Score=29.73 Aligned_cols=15 Identities=20% Similarity=0.368 Sum_probs=7.0
Q ss_pred CccccccccccccCc
Q psy359 341 RPYACTQCDKTFKKK 355 (405)
Q Consensus 341 ~~~~C~~C~~~f~~~ 355 (405)
.|..|++||..|...
T Consensus 25 ~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 25 RPAVSPYTGEQFPPE 39 (129)
T ss_pred CCccCCCcCCccCcc
Confidence 344455555544443
No 123
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=56.94 E-value=5.7 Score=24.87 Aligned_cols=13 Identities=31% Similarity=1.063 Sum_probs=9.8
Q ss_pred eecCCCcccCCCh
Q psy359 280 FKCDLCGNILSSK 292 (405)
Q Consensus 280 ~~C~~C~~~f~~~ 292 (405)
|+|..|+..+...
T Consensus 2 y~C~~CgyiYd~~ 14 (50)
T cd00730 2 YECRICGYIYDPA 14 (50)
T ss_pred cCCCCCCeEECCC
Confidence 7888888877643
No 124
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=56.65 E-value=6.2 Score=38.67 Aligned_cols=28 Identities=25% Similarity=0.489 Sum_probs=17.8
Q ss_pred cCCCCccccccccccccCchhHHHHhhhhCCCCcccCCccccc
Q psy359 337 HFPLRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRG 379 (405)
Q Consensus 337 h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 379 (405)
|.......|.+||.... -|..|+.|+..
T Consensus 235 h~~~~~l~Ch~Cg~~~~---------------~~~~Cp~C~s~ 262 (505)
T TIGR00595 235 HKKEGKLRCHYCGYQEP---------------IPKTCPQCGSE 262 (505)
T ss_pred ecCCCeEEcCCCcCcCC---------------CCCCCCCCCCC
Confidence 33455678888885443 34578888763
No 125
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=56.41 E-value=5.2 Score=28.09 Aligned_cols=14 Identities=36% Similarity=0.852 Sum_probs=8.6
Q ss_pred CcccCCccccccCC
Q psy359 369 KPYQCDICGRGFAQ 382 (405)
Q Consensus 369 ~~~~C~~C~~~f~~ 382 (405)
..+.|..|+..|.-
T Consensus 52 GIW~C~kCg~~fAG 65 (89)
T COG1997 52 GIWKCRKCGAKFAG 65 (89)
T ss_pred CeEEcCCCCCeecc
Confidence 44666667666653
No 126
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=55.91 E-value=10 Score=30.16 Aligned_cols=35 Identities=11% Similarity=0.203 Sum_probs=24.2
Q ss_pred CCCcccCCccccccCChHHHHHHHHhhccccccccccCC
Q psy359 367 GEKPYQCDICGRGFAQSNDMKKHRRTVHKAQIHAVEDKG 405 (405)
Q Consensus 367 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~ 405 (405)
+..-|.|+.|+..|.....+.- .. .+.-|.||.||
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~--~d--~~~~f~Cp~Cg 130 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQL--LD--MDGTFTCPRCG 130 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHh--cC--CCCcEECCCCC
Confidence 3456999999999996554432 11 23349999998
No 127
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=53.54 E-value=8.6 Score=33.07 Aligned_cols=89 Identities=18% Similarity=0.381 Sum_probs=45.9
Q ss_pred CcCcccccCCCccccCChhhHHHHHHhhcCcccccCCCCCcCCCchHhhhcHHHHHHhhccCCCCccCccchhccCChHH
Q psy359 124 TLPLTYPCDQCDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDKVVTNLAAHHNEVHAHERKFPCTFCEKSFKRKLH 203 (405)
Q Consensus 124 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~~f~~l~~~~~~~h~~~~~~~C~~C~~~f~~~~~ 203 (405)
.+...|+|..|....---.+| .|+..--.+ ....|.|..|++. ..|.|..|.-.|-.-..
T Consensus 138 hGGrif~CsfC~~flCEDDQF-EHQAsCQvL----e~E~~KC~SCNrl---------------Gq~sCLRCK~cfCddHv 197 (314)
T PF06524_consen 138 HGGRIFKCSFCDNFLCEDDQF-EHQASCQVL----ESETFKCQSCNRL---------------GQYSCLRCKICFCDDHV 197 (314)
T ss_pred CCCeEEEeecCCCeeeccchh-hhhhhhhhh----hcccccccccccc---------------cchhhhheeeeehhhhh
Confidence 344557788777554333333 343332111 1455666666652 34666666666654222
Q ss_pred HHHHHhhhcCCCCcccccchHhhhChHHHH
Q psy359 204 LKVHTRTHTGEKPYACYLCDKRFAQISDRI 233 (405)
Q Consensus 204 l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~ 233 (405)
.+.=++. ...+++.|+.|+........|.
T Consensus 198 rrKg~ky-~k~k~~PCPKCg~et~eTkdLS 226 (314)
T PF06524_consen 198 RRKGFKY-EKGKPIPCPKCGYETQETKDLS 226 (314)
T ss_pred hhccccc-ccCCCCCCCCCCCcccccccce
Confidence 2211222 2347899999997765444433
No 128
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=52.77 E-value=8.3 Score=23.79 Aligned_cols=9 Identities=33% Similarity=1.065 Sum_probs=5.5
Q ss_pred cccCCcccc
Q psy359 370 PYQCDICGR 378 (405)
Q Consensus 370 ~~~C~~C~~ 378 (405)
.+-|..|+.
T Consensus 37 R~~CGkCgy 45 (51)
T COG1998 37 RWACGKCGY 45 (51)
T ss_pred eeEeccccc
Confidence 466666664
No 129
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=52.72 E-value=8.1 Score=29.63 Aligned_cols=15 Identities=33% Similarity=0.966 Sum_probs=12.2
Q ss_pred ccccccccccccCch
Q psy359 342 PYACTQCDKTFKKKN 356 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~ 356 (405)
|++|..||+.|...+
T Consensus 1 PH~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGS 15 (131)
T ss_pred CcccCcCCCCcCCCc
Confidence 468888999888876
No 130
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=52.66 E-value=6.3 Score=20.10 Aligned_cols=6 Identities=50% Similarity=1.436 Sum_probs=2.8
Q ss_pred CCcccc
Q psy359 373 CDICGR 378 (405)
Q Consensus 373 C~~C~~ 378 (405)
|+.||.
T Consensus 16 C~~CG~ 21 (23)
T PF13240_consen 16 CPNCGT 21 (23)
T ss_pred hhhhCC
Confidence 444443
No 131
>KOG2593|consensus
Probab=52.30 E-value=11 Score=35.13 Aligned_cols=48 Identities=21% Similarity=0.505 Sum_probs=32.7
Q ss_pred HHhhhhhcCCCCccccccccccccCchhHHHHhhhhCCCCcccCCcccccc
Q psy359 330 LLQHKRLHFPLRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGF 380 (405)
Q Consensus 330 l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 380 (405)
|..-++..+....|.|+.|++.|.....++.= -....-|.|..|+-..
T Consensus 116 led~~~d~t~~~~Y~Cp~C~kkyt~Lea~~L~---~~~~~~F~C~~C~gel 163 (436)
T KOG2593|consen 116 LEDRLRDDTNVAGYVCPNCQKKYTSLEALQLL---DNETGEFHCENCGGEL 163 (436)
T ss_pred HHHHhhhccccccccCCccccchhhhHHHHhh---cccCceEEEecCCCch
Confidence 44445556667779999999999887776542 2224568898887543
No 132
>PRK14873 primosome assembly protein PriA; Provisional
Probab=52.21 E-value=7.7 Score=39.33 Aligned_cols=29 Identities=17% Similarity=0.224 Sum_probs=14.5
Q ss_pred CcCCCcccc------hHHHHhhhhhcCCCCccccccccc
Q psy359 318 GVCDVCGEY------KKQLLQHKRLHFPLRPYACTQCDK 350 (405)
Q Consensus 318 ~~C~~C~~~------~~~l~~H~~~h~~~~~~~C~~C~~ 350 (405)
..|..|+.. ...|.- |...+...|.+||.
T Consensus 384 l~C~~Cg~~~~C~~C~~~L~~----h~~~~~l~Ch~CG~ 418 (665)
T PRK14873 384 LACARCRTPARCRHCTGPLGL----PSAGGTPRCRWCGR 418 (665)
T ss_pred eEhhhCcCeeECCCCCCceeE----ecCCCeeECCCCcC
Confidence 467777664 222222 22334456666663
No 133
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=50.94 E-value=5.9 Score=28.97 Aligned_cols=11 Identities=18% Similarity=0.398 Sum_probs=5.5
Q ss_pred CccCccchhcc
Q psy359 188 KFPCTFCEKSF 198 (405)
Q Consensus 188 ~~~C~~C~~~f 198 (405)
|..|+.||+.|
T Consensus 26 PiVsPytG~s~ 36 (129)
T COG4530 26 PIVSPYTGKSY 36 (129)
T ss_pred ccccCcccccc
Confidence 44555555544
No 134
>PF04216 FdhE: Protein involved in formate dehydrogenase formation; InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex []. The exact nature of the function of FdhE in the assembly of the complex is unknown, but considering the presence of selenocysteine, molybdopterin, iron-sulphur clusters and cytochrome b556, it is likely to be involved in the insertion of cofactors. ; GO: 0005737 cytoplasm; PDB: 2FIY_B.
Probab=50.85 E-value=1.7 Score=39.08 Aligned_cols=73 Identities=18% Similarity=0.311 Sum_probs=25.1
Q ss_pred CcCCCcccchHHHHhhhhhcCCCCccccccccccccCchhHHHHhhhhCCCCcccCCccccccCChHHHHHHHHhhcccc
Q psy359 318 GVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFAQSNDMKKHRRTVHKAQ 397 (405)
Q Consensus 318 ~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~ 397 (405)
..||.||..+..-........|.|-..|..|+...... ...|+.||..-...-..-. ....-+.+
T Consensus 173 g~CPvCGs~P~~s~l~~~~~~G~R~L~Cs~C~t~W~~~--------------R~~Cp~Cg~~~~~~l~~~~-~e~~~~~r 237 (290)
T PF04216_consen 173 GYCPVCGSPPVLSVLRGGEREGKRYLHCSLCGTEWRFV--------------RIKCPYCGNTDHEKLEYFT-VEGEPAYR 237 (290)
T ss_dssp SS-TTT---EEEEEEE------EEEEEETTT--EEE----------------TTS-TTT---SS-EEE---------SEE
T ss_pred CcCCCCCCcCceEEEecCCCCccEEEEcCCCCCeeeec--------------CCCCcCCCCCCCcceeeEe-cCCCCcEE
Confidence 56888877511100000000245667888887655442 2468888865443332211 12222445
Q ss_pred ccccccCC
Q psy359 398 IHAVEDKG 405 (405)
Q Consensus 398 ~~~~~~~~ 405 (405)
++.|..|+
T Consensus 238 ve~C~~C~ 245 (290)
T PF04216_consen 238 VEVCESCG 245 (290)
T ss_dssp EEEETTTT
T ss_pred EEECCccc
Confidence 67777775
No 135
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=50.22 E-value=8.6 Score=19.88 Aligned_cols=9 Identities=11% Similarity=0.062 Sum_probs=5.1
Q ss_pred cccccccCC
Q psy359 397 QIHAVEDKG 405 (405)
Q Consensus 397 ~~~~~~~~~ 405 (405)
.+|.||+||
T Consensus 15 v~f~CPnCG 23 (24)
T PF07754_consen 15 VPFPCPNCG 23 (24)
T ss_pred ceEeCCCCC
Confidence 346666665
No 136
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=49.63 E-value=11 Score=27.33 Aligned_cols=18 Identities=28% Similarity=0.709 Sum_probs=13.8
Q ss_pred ccccccccccccCchhHH
Q psy359 342 PYACTQCDKTFKKKNHLT 359 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~ 359 (405)
|++|..||..|...+.+.
T Consensus 2 pH~CtrCG~vf~~g~~~i 19 (112)
T COG3364 2 PHQCTRCGEVFDDGSEEI 19 (112)
T ss_pred CceecccccccccccHHH
Confidence 578999999998865443
No 137
>PRK04023 DNA polymerase II large subunit; Validated
Probab=49.17 E-value=16 Score=38.11 Aligned_cols=9 Identities=44% Similarity=1.206 Sum_probs=4.3
Q ss_pred ccCCccccc
Q psy359 371 YQCDICGRG 379 (405)
Q Consensus 371 ~~C~~C~~~ 379 (405)
+.|+.|+..
T Consensus 652 ~fCP~CG~~ 660 (1121)
T PRK04023 652 YRCPRCGIE 660 (1121)
T ss_pred eeCccccCc
Confidence 445555443
No 138
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=48.95 E-value=5 Score=25.23 Aligned_cols=26 Identities=23% Similarity=0.568 Sum_probs=15.5
Q ss_pred CccccccccccccCchhHHHHhhhhC
Q psy359 341 RPYACTQCDKTFKKKNHLTTHYRIHT 366 (405)
Q Consensus 341 ~~~~C~~C~~~f~~~~~l~~H~~~h~ 366 (405)
..|+|+.|+..|-.--++..|...|.
T Consensus 20 ~~y~C~~C~~~FC~dCD~fiHE~LH~ 45 (51)
T PF07975_consen 20 SRYRCPKCKNHFCIDCDVFIHETLHN 45 (51)
T ss_dssp EEE--TTTT--B-HHHHHTTTTTS-S
T ss_pred CeEECCCCCCccccCcChhhhccccC
Confidence 45899999999988888888855554
No 139
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=46.88 E-value=13 Score=39.89 Aligned_cols=29 Identities=21% Similarity=0.662 Sum_probs=15.9
Q ss_pred ccccccccccccCchhHHHHhhhhCCCCcccCCccccccC
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFA 381 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 381 (405)
+|.|+.||....... ++ ...|+.|+....
T Consensus 692 vy~CPsCGaev~~de---------s~--a~~CP~CGtplv 720 (1337)
T PRK14714 692 VYVCPDCGAEVPPDE---------SG--RVECPRCDVELT 720 (1337)
T ss_pred ceeCccCCCccCCCc---------cc--cccCCCCCCccc
Confidence 467777776544311 11 346777775544
No 140
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=45.88 E-value=15 Score=32.43 Aligned_cols=26 Identities=27% Similarity=0.542 Sum_probs=18.8
Q ss_pred CccccccccccccCchhHHHHhhhhC
Q psy359 341 RPYACTQCDKTFKKKNHLTTHYRIHT 366 (405)
Q Consensus 341 ~~~~C~~C~~~f~~~~~l~~H~~~h~ 366 (405)
..|+|..|...|-.-....-|...|.
T Consensus 387 ~rY~Ce~CK~~FC~dCdvfiHe~Lh~ 412 (421)
T COG5151 387 GRYQCELCKSTFCSDCDVFIHETLHF 412 (421)
T ss_pred cceechhhhhhhhhhhHHHHHHHHhh
Confidence 44788888888887777777765553
No 141
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=45.35 E-value=11 Score=33.22 Aligned_cols=26 Identities=19% Similarity=0.392 Sum_probs=20.8
Q ss_pred cccceecCCCcccCCChHHHHHHHhh
Q psy359 276 RKLIFKCDLCGNILSSKHILQEHVRV 301 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~~~~~l~~H~~~ 301 (405)
....|.|+.|...|-..-....|...
T Consensus 385 ss~rY~Ce~CK~~FC~dCdvfiHe~L 410 (421)
T COG5151 385 SSGRYQCELCKSTFCSDCDVFIHETL 410 (421)
T ss_pred cccceechhhhhhhhhhhHHHHHHHH
Confidence 34579999999999888888877763
No 142
>KOG1280|consensus
Probab=44.39 E-value=17 Score=32.75 Aligned_cols=37 Identities=19% Similarity=0.456 Sum_probs=28.5
Q ss_pred CccccccccccccCchhHHHHhhhhCCCCc--ccCCccc
Q psy359 341 RPYACTQCDKTFKKKNHLTTHYRIHTGEKP--YQCDICG 377 (405)
Q Consensus 341 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~--~~C~~C~ 377 (405)
..|.|++|+.+=.+...|..|....|.+-+ ..|++|.
T Consensus 78 qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 78 QSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred ccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 458999999998888899999876665443 4677776
No 143
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=44.35 E-value=10 Score=25.42 Aligned_cols=26 Identities=31% Similarity=0.674 Sum_probs=10.4
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCcccccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGF 380 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 380 (405)
-.|..|++.|... .+.+.|..||..|
T Consensus 10 ~~C~~C~~~F~~~------------~rrhhCr~CG~~v 35 (69)
T PF01363_consen 10 SNCMICGKKFSLF------------RRRHHCRNCGRVV 35 (69)
T ss_dssp SB-TTT--B-BSS------------S-EEE-TTT--EE
T ss_pred CcCcCcCCcCCCc------------eeeEccCCCCCEE
Confidence 3688888888432 2445566666654
No 144
>PF13696 zf-CCHC_2: Zinc knuckle
Probab=44.22 E-value=13 Score=20.73 Aligned_cols=16 Identities=19% Similarity=0.542 Sum_probs=12.1
Q ss_pred CCCcccceeccccccc
Q psy359 2 KPKETTSIKCEACHRT 17 (405)
Q Consensus 2 ~~~~~~~~~C~~C~~~ 17 (405)
+..-+..|+|.+|+..
T Consensus 2 ~k~pP~~Y~C~~C~~~ 17 (32)
T PF13696_consen 2 RKKPPPGYVCHRCGQK 17 (32)
T ss_pred CCCCCCCCEeecCCCC
Confidence 3456788999999864
No 145
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=43.44 E-value=9.5 Score=30.02 Aligned_cols=34 Identities=32% Similarity=0.905 Sum_probs=20.9
Q ss_pred CccccccccccccCchhHHHHhhhhCCCCcccCCccccc
Q psy359 341 RPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRG 379 (405)
Q Consensus 341 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 379 (405)
-+|.|. |+..|.+. +.|=..-.|+ .|.|..|+-.
T Consensus 116 ~~Y~C~-C~q~~l~~---RRhn~~~~g~-~YrC~~C~gk 149 (156)
T COG3091 116 YPYRCQ-CQQHYLRI---RRHNTVRRGE-VYRCGKCGGK 149 (156)
T ss_pred eeEEee-cCCccchh---hhcccccccc-eEEeccCCce
Confidence 358888 88876654 2333333344 6888888754
No 146
>KOG4167|consensus
Probab=43.32 E-value=31 Score=34.60 Aligned_cols=28 Identities=25% Similarity=0.570 Sum_probs=24.6
Q ss_pred CcccccCCCccccCChhhHHHHHHhhcC
Q psy359 126 PLTYPCDQCDRTYQTKKSLYVHRRAHLG 153 (405)
Q Consensus 126 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 153 (405)
..-|.|.+|++.|.....+..||++|..
T Consensus 790 ~giFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 790 TGIFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred CceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 3458999999999999999999999876
No 147
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=43.25 E-value=17 Score=22.21 Aligned_cols=11 Identities=27% Similarity=0.972 Sum_probs=6.3
Q ss_pred CCcccCCcccc
Q psy359 368 EKPYQCDICGR 378 (405)
Q Consensus 368 ~~~~~C~~C~~ 378 (405)
...|+|..|++
T Consensus 35 ~~~~~C~~C~~ 45 (46)
T PF12760_consen 35 RGRYRCKACRK 45 (46)
T ss_pred CCeEECCCCCC
Confidence 44566666654
No 148
>KOG1701|consensus
Probab=43.13 E-value=8.1 Score=35.84 Aligned_cols=12 Identities=25% Similarity=0.877 Sum_probs=7.3
Q ss_pred ccCccchhccCC
Q psy359 189 FPCTFCEKSFKR 200 (405)
Q Consensus 189 ~~C~~C~~~f~~ 200 (405)
|.|..|++.+.-
T Consensus 303 FtC~~C~r~L~G 314 (468)
T KOG1701|consen 303 FTCRTCRRQLAG 314 (468)
T ss_pred eehHhhhhhhcc
Confidence 666666665544
No 149
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=42.23 E-value=13 Score=26.27 Aligned_cols=31 Identities=29% Similarity=0.784 Sum_probs=22.1
Q ss_pred CCccccccccccccCchhHHHHhhhhCCCCcccCCcccccc
Q psy359 340 LRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGF 380 (405)
Q Consensus 340 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 380 (405)
.+|-+|..||+.|.... -..|-.|+.|.--.
T Consensus 56 v~Pa~CkkCGfef~~~~----------ik~pSRCP~CKSE~ 86 (97)
T COG3357 56 VRPARCKKCGFEFRDDK----------IKKPSRCPKCKSEW 86 (97)
T ss_pred ecChhhcccCccccccc----------cCCcccCCcchhhc
Confidence 35778999999998721 13578899997543
No 150
>KOG2907|consensus
Probab=41.53 E-value=19 Score=26.68 Aligned_cols=11 Identities=36% Similarity=1.168 Sum_probs=5.3
Q ss_pred ccccccccccc
Q psy359 343 YACTQCDKTFK 353 (405)
Q Consensus 343 ~~C~~C~~~f~ 353 (405)
|.|+-|++.|.
T Consensus 103 YTC~kC~~k~~ 113 (116)
T KOG2907|consen 103 YTCPKCKYKFT 113 (116)
T ss_pred EEcCccceeee
Confidence 45555554443
No 151
>PF05495 zf-CHY: CHY zinc finger; InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins: Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation: ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom. More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=41.33 E-value=7.7 Score=26.43 Aligned_cols=35 Identities=29% Similarity=0.701 Sum_probs=16.5
Q ss_pred CCCCccccccccccccCchhHHHHhhhhCCCCcccCCcccccc
Q psy359 338 FPLRPYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGF 380 (405)
Q Consensus 338 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 380 (405)
.......|..|+........ ..+ ..|.|+.|+-.|
T Consensus 37 ~~~~~v~Cg~C~~~~~~~~~-------~c~-~~~~C~~C~~~~ 71 (71)
T PF05495_consen 37 WPVKRVICGKCRTEQPIDEY-------SCG-ADYFCPICGLYF 71 (71)
T ss_dssp TT--EEEETTT--EEES-SB-------TT---SEEETTTTEEE
T ss_pred ccccCeECCCCCCccChhhh-------hcC-CCccCcCcCCCC
Confidence 33445677777777666543 111 456777776543
No 152
>KOG2071|consensus
Probab=41.23 E-value=19 Score=35.21 Aligned_cols=29 Identities=17% Similarity=0.381 Sum_probs=23.2
Q ss_pred cccceecCCCcccCCChHHHHHHHhhhccC
Q psy359 276 RKLIFKCDLCGNILSSKHILQEHVRVVHMG 305 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~ 305 (405)
...+..|..||.+|........|+. .|..
T Consensus 415 ~~~pnqC~~CG~R~~~~ee~sk~md-~H~d 443 (579)
T KOG2071|consen 415 KDSPNQCKSCGLRFDDSEERSKHMD-IHDD 443 (579)
T ss_pred cCCcchhcccccccccchhhhhHhh-hhhh
Confidence 3456789999999999999888888 4443
No 153
>KOG1280|consensus
Probab=40.35 E-value=25 Score=31.77 Aligned_cols=39 Identities=21% Similarity=0.383 Sum_probs=29.9
Q ss_pred ccccCCCccccCChhhHHHHHHhhcCcccccCCCCCcCCCchH
Q psy359 128 TYPCDQCDRTYQTKKSLYVHRRAHLGIVYRYKSKTDRCELCDK 170 (405)
Q Consensus 128 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~C~~C~~ 170 (405)
.|.|++|+.+=.+...|+.|....|... .....|+.|..
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda----~~~~icp~c~~ 117 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEA----STSVICPLCAA 117 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCccc----Ccceeeecccc
Confidence 4999999999999999999988766643 34455666654
No 154
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=40.08 E-value=30 Score=37.26 Aligned_cols=9 Identities=44% Similarity=1.110 Sum_probs=4.6
Q ss_pred ccCCccccc
Q psy359 371 YQCDICGRG 379 (405)
Q Consensus 371 ~~C~~C~~~ 379 (405)
|.|+.|+..
T Consensus 693 y~CPsCGae 701 (1337)
T PRK14714 693 YVCPDCGAE 701 (1337)
T ss_pred eeCccCCCc
Confidence 455555544
No 155
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=39.78 E-value=18 Score=32.63 Aligned_cols=33 Identities=24% Similarity=0.499 Sum_probs=16.8
Q ss_pred CcCCCcccchHHHHhhhhhcCCCCccccccccc
Q psy359 318 GVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDK 350 (405)
Q Consensus 318 ~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~ 350 (405)
..||.||..+..-..+.....|.|-..|..|+.
T Consensus 188 ~~CPvCGs~P~~s~v~~~~~~G~RyL~CslC~t 220 (309)
T PRK03564 188 QFCPVCGSMPVSSVVQIGTTQGLRYLHCNLCES 220 (309)
T ss_pred CCCCCCCCcchhheeeccCCCCceEEEcCCCCC
Confidence 567777776321112333344555556666653
No 156
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=39.33 E-value=18 Score=21.40 Aligned_cols=20 Identities=20% Similarity=0.493 Sum_probs=11.3
Q ss_pred CcccCCccccccCChHHHHH
Q psy359 369 KPYQCDICGRGFAQSNDMKK 388 (405)
Q Consensus 369 ~~~~C~~C~~~f~~~~~l~~ 388 (405)
..+.|+.|+-.|.....|.+
T Consensus 18 ~id~C~~C~G~W~d~~el~~ 37 (41)
T PF13453_consen 18 EIDVCPSCGGIWFDAGELEK 37 (41)
T ss_pred EEEECCCCCeEEccHHHHHH
Confidence 44556666666665555543
No 157
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=38.25 E-value=15 Score=29.69 Aligned_cols=24 Identities=29% Similarity=0.818 Sum_probs=16.4
Q ss_pred CccCccchhccCChHHHHHHHhhhcCCCCcccccchH
Q psy359 188 KFPCTFCEKSFKRKLHLKVHTRTHTGEKPYACYLCDK 224 (405)
Q Consensus 188 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 224 (405)
.|+|.+||..+ -++.|-+||+|+.
T Consensus 134 ~~vC~vCGy~~-------------~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTH-------------EGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCcc-------------cCCCCCcCCCCCC
Confidence 57888887643 3466777888873
No 158
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=37.52 E-value=22 Score=30.21 Aligned_cols=31 Identities=19% Similarity=0.518 Sum_probs=24.0
Q ss_pred cccceecCCCcccCCChHHHHHHHhhhccCC
Q psy359 276 RKLIFKCDLCGNILSSKHILQEHVRVVHMGL 306 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~ 306 (405)
.+..|.|..|++.|.-...+..|+..-|...
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 4556999999999999999999999888654
No 159
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=37.21 E-value=13 Score=29.02 Aligned_cols=14 Identities=21% Similarity=0.641 Sum_probs=9.6
Q ss_pred ccccccccccccCc
Q psy359 342 PYACTQCDKTFKKK 355 (405)
Q Consensus 342 ~~~C~~C~~~f~~~ 355 (405)
.+.|..||..|...
T Consensus 70 ~~~C~~CG~~~~~~ 83 (135)
T PRK03824 70 VLKCRNCGNEWSLK 83 (135)
T ss_pred EEECCCCCCEEecc
Confidence 37777777777654
No 160
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=37.12 E-value=20 Score=22.82 Aligned_cols=26 Identities=35% Similarity=0.851 Sum_probs=14.9
Q ss_pred ccccccccccCchhHHHHhhhhCCCCcccCCccccccC
Q psy359 344 ACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFA 381 (405)
Q Consensus 344 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 381 (405)
.|..|++.|... .+.+.|..||+.|=
T Consensus 4 ~C~~C~~~F~~~------------~rk~~Cr~Cg~~~C 29 (57)
T cd00065 4 SCMGCGKPFTLT------------RRRHHCRNCGRIFC 29 (57)
T ss_pred cCcccCccccCC------------ccccccCcCcCCcC
Confidence 466677777652 23445666666544
No 161
>PF14353 CpXC: CpXC protein
Probab=37.01 E-value=20 Score=27.61 Aligned_cols=33 Identities=21% Similarity=0.394 Sum_probs=26.9
Q ss_pred ceeccccccccCCCcccCCceeeeeEEEEeecC
Q psy359 8 SIKCEACHRTFSSPYEYGGHKLKCNLFLVQTTD 40 (405)
Q Consensus 8 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~ 40 (405)
.|+|+.|+..|...+.+..|=..++.++.-.++
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D~~~~~~i~~~P~ 70 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHDPEKKFMIYYFPD 70 (128)
T ss_pred EEECCCCCCceecCCCEEEEcCCCCEEEEEcCC
Confidence 689999999999999999997777665555444
No 162
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=36.14 E-value=4.8 Score=23.68 Aligned_cols=10 Identities=30% Similarity=1.172 Sum_probs=5.4
Q ss_pred cccccccccc
Q psy359 343 YACTQCDKTF 352 (405)
Q Consensus 343 ~~C~~C~~~f 352 (405)
|.|..|+..|
T Consensus 29 y~C~~C~~~w 38 (39)
T PF01096_consen 29 YVCCNCGHRW 38 (39)
T ss_dssp EEESSSTEEE
T ss_pred EEeCCCCCee
Confidence 5555555543
No 163
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=36.06 E-value=13 Score=26.56 Aligned_cols=14 Identities=21% Similarity=0.546 Sum_probs=7.9
Q ss_pred CcccCCccccccCC
Q psy359 369 KPYQCDICGRGFAQ 382 (405)
Q Consensus 369 ~~~~C~~C~~~f~~ 382 (405)
..+.|..|++.|.-
T Consensus 53 GIW~C~~C~~~~AG 66 (90)
T PTZ00255 53 GIWRCKGCKKTVAG 66 (90)
T ss_pred EEEEcCCCCCEEeC
Confidence 34566666665543
No 164
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=35.54 E-value=16 Score=23.28 Aligned_cols=30 Identities=27% Similarity=0.703 Sum_probs=17.9
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCcccccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGF 380 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 380 (405)
++|+.|+..+.-.... .| ....|+.||..|
T Consensus 3 ~~CP~CG~~iev~~~~-------~G-eiV~Cp~CGael 32 (54)
T TIGR01206 3 FECPDCGAEIELENPE-------LG-ELVICDECGAEL 32 (54)
T ss_pred cCCCCCCCEEecCCCc-------cC-CEEeCCCCCCEE
Confidence 6788888776543322 12 235788887654
No 165
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=35.25 E-value=29 Score=29.80 Aligned_cols=57 Identities=19% Similarity=0.419 Sum_probs=31.8
Q ss_pred CCCcCCCchHhhhcHHHHHHhhccCCCCccCccchhccCChHHHHHHHhhhcCCCCcccccchHhhhC
Q psy359 161 KTDRCELCDKVVTNLAAHHNEVHAHERKFPCTFCEKSFKRKLHLKVHTRTHTGEKPYACYLCDKRFAQ 228 (405)
Q Consensus 161 ~~~~C~~C~~~f~~l~~~~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 228 (405)
..|.|..|...+.. ++...+..-.|..|.+.|.-... - +. -|...|.|+.|+..|.-
T Consensus 111 rqFaC~~Cd~~WwR------rvp~rKeVSRCr~C~~rYDPVP~---d-km-wG~aef~C~~C~h~F~G 167 (278)
T PF15135_consen 111 RQFACSSCDHMWWR------RVPQRKEVSRCRKCRKRYDPVPC---D-KM-WGIAEFHCPKCRHNFRG 167 (278)
T ss_pred eeeeccccchHHHh------ccCcccccccccccccccCCCcc---c-cc-cceeeeecccccccchh
Confidence 45666666553321 11122334678888887764331 0 11 23345888888888874
No 166
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=34.97 E-value=11 Score=26.95 Aligned_cols=12 Identities=42% Similarity=1.165 Sum_probs=6.6
Q ss_pred cccCCccccccC
Q psy359 370 PYQCDICGRGFA 381 (405)
Q Consensus 370 ~~~C~~C~~~f~ 381 (405)
.++|..|++.|.
T Consensus 53 IW~C~~C~~~~A 64 (90)
T PF01780_consen 53 IWKCKKCGKKFA 64 (90)
T ss_dssp EEEETTTTEEEE
T ss_pred EeecCCCCCEEe
Confidence 355666665554
No 167
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=34.76 E-value=29 Score=31.39 Aligned_cols=33 Identities=21% Similarity=0.557 Sum_probs=15.0
Q ss_pred CcCCCcccch-HHHHhhhhhcCCCCccccccccc
Q psy359 318 GVCDVCGEYK-KQLLQHKRLHFPLRPYACTQCDK 350 (405)
Q Consensus 318 ~~C~~C~~~~-~~l~~H~~~h~~~~~~~C~~C~~ 350 (405)
..|+.||..+ .++...-....|.|-..|..|+.
T Consensus 185 ~~CPvCGs~P~~s~~~~~~~~~G~RyL~CslC~t 218 (305)
T TIGR01562 185 TLCPACGSPPVASMVRQGGKETGLRYLSCSLCAT 218 (305)
T ss_pred CcCCCCCChhhhhhhcccCCCCCceEEEcCCCCC
Confidence 4566666663 22222111234445555555553
No 168
>KOG3408|consensus
Probab=34.26 E-value=31 Score=26.00 Aligned_cols=28 Identities=21% Similarity=0.336 Sum_probs=24.1
Q ss_pred CCcCcccccCCCccccCChhhHHHHHHh
Q psy359 123 KTLPLTYPCDQCDRTYQTKKSLYVHRRA 150 (405)
Q Consensus 123 ~~~~~~~~C~~C~~~f~~~~~l~~H~~~ 150 (405)
..+...|.|-+|.+.|.+...|+.|.++
T Consensus 52 lPG~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 52 LPGGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCCCceeehhhhhhhhcchHHHHHHHhc
Confidence 3445679999999999999999999876
No 169
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=33.39 E-value=13 Score=26.59 Aligned_cols=12 Identities=42% Similarity=1.010 Sum_probs=6.5
Q ss_pred cccCCccccccC
Q psy359 370 PYQCDICGRGFA 381 (405)
Q Consensus 370 ~~~C~~C~~~f~ 381 (405)
.+.|..|++.|.
T Consensus 53 IW~C~~C~~~~A 64 (91)
T TIGR00280 53 IWTCRKCGAKFA 64 (91)
T ss_pred EEEcCCCCCEEe
Confidence 455555555554
No 170
>KOG2071|consensus
Probab=33.20 E-value=28 Score=34.06 Aligned_cols=27 Identities=19% Similarity=0.340 Sum_probs=21.0
Q ss_pred CCCccCccchhccCChHHHHHHHhhhc
Q psy359 186 ERKFPCTFCEKSFKRKLHLKVHTRTHT 212 (405)
Q Consensus 186 ~~~~~C~~C~~~f~~~~~l~~H~~~h~ 212 (405)
..+..|..||.+|........|+..|.
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhhh
Confidence 356889999999988888777776664
No 171
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=32.89 E-value=30 Score=33.14 Aligned_cols=26 Identities=23% Similarity=0.556 Sum_probs=15.7
Q ss_pred ccCCccccccCChHHHHHHHHhhccc
Q psy359 371 YQCDICGRGFAQSNDMKKHRRTVHKA 396 (405)
Q Consensus 371 ~~C~~C~~~f~~~~~l~~H~~~~h~~ 396 (405)
..|+.|.+.|.....+..|+...|..
T Consensus 58 WiCp~CskkF~d~~~~~~H~~~eH~~ 83 (466)
T PF04780_consen 58 WICPRCSKKFSDAESCLSHMEQEHPA 83 (466)
T ss_pred eeCCcccceeCCHHHHHHHHHHhhhh
Confidence 45666666666666666666665544
No 172
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=31.36 E-value=24 Score=26.61 Aligned_cols=25 Identities=20% Similarity=0.468 Sum_probs=16.6
Q ss_pred ccccccccccccCchhHHHHhhhhCCCCcccCCcccc
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGR 378 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 378 (405)
.++|..|+..|.... ..+.|+.||.
T Consensus 70 ~~~C~~Cg~~~~~~~------------~~~~CP~Cgs 94 (113)
T PRK12380 70 QAWCWDCSQVVEIHQ------------HDAQCPHCHG 94 (113)
T ss_pred EEEcccCCCEEecCC------------cCccCcCCCC
Confidence 378888887776643 3356888873
No 173
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=31.34 E-value=9 Score=22.67 Aligned_cols=10 Identities=40% Similarity=1.232 Sum_probs=4.6
Q ss_pred ccCCcccccc
Q psy359 371 YQCDICGRGF 380 (405)
Q Consensus 371 ~~C~~C~~~f 380 (405)
|.|..|+..|
T Consensus 29 y~C~~C~~~w 38 (40)
T smart00440 29 YVCTKCGHRW 38 (40)
T ss_pred EEeCCCCCEe
Confidence 4444444433
No 174
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=30.46 E-value=15 Score=26.58 Aligned_cols=11 Identities=36% Similarity=0.860 Sum_probs=6.3
Q ss_pred ccCCccccccC
Q psy359 371 YQCDICGRGFA 381 (405)
Q Consensus 371 ~~C~~C~~~f~ 381 (405)
..|..||.+|-
T Consensus 47 ~~Cg~CGls~e 57 (104)
T COG4888 47 AVCGNCGLSFE 57 (104)
T ss_pred EEcccCcceEE
Confidence 45666666553
No 175
>PF03833 PolC_DP2: DNA polymerase II large subunit DP2; InterPro: IPR016033 DP2 is the large subunit of a two-subunit novel archaebacterial replicative DNA polymerase first characterised for Pyrococcus furiosus. The structure of DP2 appears to be organised as a ~950 residue component separated from a ~300 residue component by a ~150 residue intein. The other subunit, DP1, has sequence similarity to the eukaryotic DNA polymerase delta small subunit. This entry represents the N-terminal ~950 residue component of DP2.; GO: 0003887 DNA-directed DNA polymerase activity; PDB: 3O59_X.
Probab=30.14 E-value=17 Score=37.19 Aligned_cols=11 Identities=36% Similarity=0.939 Sum_probs=0.0
Q ss_pred cCCccccccCC
Q psy359 372 QCDICGRGFAQ 382 (405)
Q Consensus 372 ~C~~C~~~f~~ 382 (405)
.|+.|+.....
T Consensus 694 ~C~~C~~~~~~ 704 (900)
T PF03833_consen 694 ECPKCGRETTS 704 (900)
T ss_dssp -----------
T ss_pred ccccccccCcc
Confidence 57777765443
No 176
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=29.81 E-value=25 Score=26.61 Aligned_cols=26 Identities=19% Similarity=0.553 Sum_probs=17.5
Q ss_pred ccccccccccccCchhHHHHhhhhCCCCcccCCccccc
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRG 379 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 379 (405)
..+|..|+..|..... .+.|+.|+..
T Consensus 70 ~~~C~~Cg~~~~~~~~------------~~~CP~Cgs~ 95 (115)
T TIGR00100 70 ECECEDCSEEVSPEID------------LYRCPKCHGI 95 (115)
T ss_pred EEEcccCCCEEecCCc------------CccCcCCcCC
Confidence 3788888877766532 3568888853
No 177
>PF01155 HypA: Hydrogenase expression/synthesis hypA family; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=29.72 E-value=21 Score=26.85 Aligned_cols=25 Identities=24% Similarity=0.796 Sum_probs=16.1
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCccccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRG 379 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 379 (405)
++|..|+..|..... .+.|+.|+..
T Consensus 71 ~~C~~Cg~~~~~~~~------------~~~CP~Cgs~ 95 (113)
T PF01155_consen 71 ARCRDCGHEFEPDEF------------DFSCPRCGSP 95 (113)
T ss_dssp EEETTTS-EEECHHC------------CHH-SSSSSS
T ss_pred EECCCCCCEEecCCC------------CCCCcCCcCC
Confidence 788888888876542 2558888864
No 178
>KOG4124|consensus
Probab=29.61 E-value=9.1 Score=34.43 Aligned_cols=55 Identities=38% Similarity=0.850 Sum_probs=40.5
Q ss_pred CCcccccc--ccccccCchhHHHHhh-hhCC------------------CCcccCCccccccCChHHHHHHHHhhc
Q psy359 340 LRPYACTQ--CDKTFKKKNHLTTHYR-IHTG------------------EKPYQCDICGRGFAQSNDMKKHRRTVH 394 (405)
Q Consensus 340 ~~~~~C~~--C~~~f~~~~~l~~H~~-~h~~------------------~~~~~C~~C~~~f~~~~~l~~H~~~~h 394 (405)
.++|+|++ |++++.+...|.-|.. .|.. .++|+|++|.+.+....-|..|....|
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~~~~~ 422 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHRTHSH 422 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCceeehhh
Confidence 36789966 9999999888887754 3321 358999999999888776766655443
No 179
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=29.36 E-value=30 Score=22.06 Aligned_cols=28 Identities=29% Similarity=0.726 Sum_probs=17.5
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCccccccCC
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFAQ 382 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 382 (405)
.+|..|++.|.... ..-.|+.|+..+.+
T Consensus 6 ~~C~~Cg~~~~~~d------------DiVvCp~CgapyHR 33 (54)
T PF14446_consen 6 CKCPVCGKKFKDGD------------DIVVCPECGAPYHR 33 (54)
T ss_pred ccChhhCCcccCCC------------CEEECCCCCCcccH
Confidence 46777777776553 23457777766543
No 180
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=29.25 E-value=16 Score=28.79 Aligned_cols=42 Identities=17% Similarity=0.438 Sum_probs=22.2
Q ss_pred cCCCcccchHHHHhhhhhcCC---CCccccccccccccCchhHHH
Q psy359 319 VCDVCGEYKKQLLQHKRLHFP---LRPYACTQCDKTFKKKNHLTT 360 (405)
Q Consensus 319 ~C~~C~~~~~~l~~H~~~h~~---~~~~~C~~C~~~f~~~~~l~~ 360 (405)
.||.|+...+....-.....| -+.-.|+.|+..|++......
T Consensus 2 ~CPfC~~~~tkViDSR~~edg~aIRRRReC~~C~~RFTTfE~~El 46 (156)
T COG1327 2 KCPFCGHEDTKVIDSRPAEEGNAIRRRRECLECGERFTTFERAEL 46 (156)
T ss_pred CCCCCCCCCCeeeecccccccchhhhhhcccccccccchhheeee
Confidence 488887652222222111111 133578888888887654433
No 181
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=28.76 E-value=16 Score=28.78 Aligned_cols=42 Identities=17% Similarity=0.441 Sum_probs=22.9
Q ss_pred cCCCcccchHHHHhhhhhcCC--C-CccccccccccccCchhHHH
Q psy359 319 VCDVCGEYKKQLLQHKRLHFP--L-RPYACTQCDKTFKKKNHLTT 360 (405)
Q Consensus 319 ~C~~C~~~~~~l~~H~~~h~~--~-~~~~C~~C~~~f~~~~~l~~ 360 (405)
+||.|+...+....-.....| . +.-+|..|++.|++......
T Consensus 2 ~CP~C~~~dtkViDSR~~~dg~~IRRRReC~~C~~RFTTyErve~ 46 (147)
T TIGR00244 2 HCPFCQHHNTRVLDSRLVEDGQSIRRRRECLECHERFTTFERAEL 46 (147)
T ss_pred CCCCCCCCCCEeeeccccCCCCeeeecccCCccCCccceeeeccc
Confidence 588888752222221111111 1 33688888888888765543
No 182
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=28.75 E-value=34 Score=20.43 Aligned_cols=16 Identities=31% Similarity=0.787 Sum_probs=10.8
Q ss_pred cCCccccccCChHHHH
Q psy359 372 QCDICGRGFAQSNDMK 387 (405)
Q Consensus 372 ~C~~C~~~f~~~~~l~ 387 (405)
.|..|++.|+...-..
T Consensus 10 ~C~~C~rpf~WRKKW~ 25 (42)
T PF10013_consen 10 ICPVCGRPFTWRKKWA 25 (42)
T ss_pred cCcccCCcchHHHHHH
Confidence 5777888877655443
No 183
>KOG3408|consensus
Probab=28.71 E-value=33 Score=25.85 Aligned_cols=22 Identities=27% Similarity=0.798 Sum_probs=12.2
Q ss_pred ccccccccccccCchhHHHHhh
Q psy359 342 PYACTQCDKTFKKKNHLTTHYR 363 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~ 363 (405)
.|.|-.|.+.|.+...|..|.+
T Consensus 57 qfyCi~CaRyFi~~~~l~~H~k 78 (129)
T KOG3408|consen 57 QFYCIECARYFIDAKALKTHFK 78 (129)
T ss_pred eeehhhhhhhhcchHHHHHHHh
Confidence 3555555555555555555543
No 184
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=28.29 E-value=24 Score=26.75 Aligned_cols=27 Identities=26% Similarity=0.707 Sum_probs=19.3
Q ss_pred CccccccccccccCchhHHHHhhhhCCCCcc-cCCccccc
Q psy359 341 RPYACTQCDKTFKKKNHLTTHYRIHTGEKPY-QCDICGRG 379 (405)
Q Consensus 341 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~-~C~~C~~~ 379 (405)
-.++|..|+..|..... .| .|+.|+..
T Consensus 70 ~~~~C~~Cg~~~~~~~~------------~~~~CP~Cgs~ 97 (117)
T PRK00564 70 VELECKDCSHVFKPNAL------------DYGVCEKCHSK 97 (117)
T ss_pred CEEEhhhCCCccccCCc------------cCCcCcCCCCC
Confidence 34899999988876531 33 49999954
No 185
>PF09416 UPF1_Zn_bind: RNA helicase (UPF2 interacting domain); InterPro: IPR018999 UPF1 (or regulator of nonsense transcripts 1 homologue) is an essential RNA helicase that detects mRNAs containing premature stop codons and triggers their degradation. This domain contains 3 zinc binding motifs and forms interactions with another protein (UPF2) that is also involved nonsense-mediated mRNA decay (NMD) []. ; GO: 0003677 DNA binding, 0004386 helicase activity, 0005524 ATP binding, 0008270 zinc ion binding, 0000184 nuclear-transcribed mRNA catabolic process, nonsense-mediated decay, 0005737 cytoplasm; PDB: 2IYK_B 2WJY_A 2WJV_A 2XZL_A.
Probab=28.12 E-value=49 Score=26.31 Aligned_cols=40 Identities=28% Similarity=0.626 Sum_probs=20.6
Q ss_pred CCCccccccccccccC------chhHHHHhh-hhC-----------CCCcccCCcccc
Q psy359 339 PLRPYACTQCDKTFKK------KNHLTTHYR-IHT-----------GEKPYQCDICGR 378 (405)
Q Consensus 339 ~~~~~~C~~C~~~f~~------~~~l~~H~~-~h~-----------~~~~~~C~~C~~ 378 (405)
..-..+|..|++.|-+ .+.+..|+. ..| |+..++|..||-
T Consensus 11 p~~vv~C~~c~kWFCNg~~~~s~SHIv~HLv~srh~ev~LH~~s~lgdt~leCy~Cg~ 68 (152)
T PF09416_consen 11 PSCVVKCNTCNKWFCNGRGNTSGSHIVNHLVRSRHKEVSLHPDSPLGDTVLECYNCGS 68 (152)
T ss_dssp CCCEEEETTTTEEEES--TTSSS-HHHHHHHHHT---EEE-TTSTT-S-B---TTT--
T ss_pred cccEeEcCCCCcEeecCCCCCcccHHHHHHHHccCCceeeCCCCCCCCcEEEEEecCC
Confidence 3345789999999965 556666653 222 234578888874
No 186
>KOG4167|consensus
Probab=27.70 E-value=25 Score=35.19 Aligned_cols=26 Identities=31% Similarity=0.634 Sum_probs=23.4
Q ss_pred CcccccchHhhhChHHHHHHHhhhCC
Q psy359 216 PYACYLCDKRFAQISDRIKHLKSSHN 241 (405)
Q Consensus 216 ~~~C~~C~~~f~~~~~l~~H~~~~h~ 241 (405)
.|.|..|++.|....++..||++|..
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 39999999999999999999999653
No 187
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=27.47 E-value=18 Score=25.85 Aligned_cols=12 Identities=42% Similarity=1.057 Sum_probs=6.5
Q ss_pred cccCCccccccC
Q psy359 370 PYQCDICGRGFA 381 (405)
Q Consensus 370 ~~~C~~C~~~f~ 381 (405)
.+.|..|++.|.
T Consensus 54 IW~C~~C~~~~A 65 (90)
T PRK03976 54 IWECRKCGAKFA 65 (90)
T ss_pred EEEcCCCCCEEe
Confidence 355555555554
No 189
>PF12773 DZR: Double zinc ribbon
Probab=27.21 E-value=35 Score=21.04 Aligned_cols=26 Identities=19% Similarity=0.547 Sum_probs=17.5
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCccccc
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRG 379 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 379 (405)
-.|+.||.... -.......|+.|+..
T Consensus 13 ~fC~~CG~~l~-----------~~~~~~~~C~~Cg~~ 38 (50)
T PF12773_consen 13 KFCPHCGTPLP-----------PPDQSKKICPNCGAE 38 (50)
T ss_pred cCChhhcCChh-----------hccCCCCCCcCCcCC
Confidence 46888887777 122345679888874
No 190
>COG2879 Uncharacterized small protein [Function unknown]
Probab=26.86 E-value=65 Score=21.15 Aligned_cols=19 Identities=16% Similarity=0.235 Sum_probs=14.4
Q ss_pred CChHHHHHHHHhhcccccc
Q psy359 381 AQSNDMKKHRRTVHKAQIH 399 (405)
Q Consensus 381 ~~~~~l~~H~~~~h~~~~~ 399 (405)
+.-.+...|++.+|+++|.
T Consensus 23 pdYdnYVehmr~~hPd~p~ 41 (65)
T COG2879 23 PDYDNYVEHMRKKHPDKPP 41 (65)
T ss_pred CcHHHHHHHHHHhCcCCCc
Confidence 4556778888888888874
No 191
>PF07295 DUF1451: Protein of unknown function (DUF1451); InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=26.74 E-value=17 Score=28.75 Aligned_cols=9 Identities=22% Similarity=0.899 Sum_probs=5.5
Q ss_pred ccccccccc
Q psy359 343 YACTQCDKT 351 (405)
Q Consensus 343 ~~C~~C~~~ 351 (405)
|.|..||..
T Consensus 113 l~C~~Cg~~ 121 (146)
T PF07295_consen 113 LVCENCGHE 121 (146)
T ss_pred EecccCCCE
Confidence 666666644
No 192
>KOG2593|consensus
Probab=26.65 E-value=40 Score=31.66 Aligned_cols=21 Identities=24% Similarity=0.338 Sum_probs=16.3
Q ss_pred cccceecCCCcccCCChHHHH
Q psy359 276 RKLIFKCDLCGNILSSKHILQ 296 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~~~~~l~ 296 (405)
....|.|+.|.++|+....+.
T Consensus 125 ~~~~Y~Cp~C~kkyt~Lea~~ 145 (436)
T KOG2593|consen 125 NVAGYVCPNCQKKYTSLEALQ 145 (436)
T ss_pred ccccccCCccccchhhhHHHH
Confidence 455699999999998876655
No 193
>PF13248 zf-ribbon_3: zinc-ribbon domain
Probab=26.62 E-value=30 Score=18.07 Aligned_cols=7 Identities=29% Similarity=1.008 Sum_probs=3.2
Q ss_pred ccccccc
Q psy359 344 ACTQCDK 350 (405)
Q Consensus 344 ~C~~C~~ 350 (405)
.|+.||.
T Consensus 4 ~Cp~Cg~ 10 (26)
T PF13248_consen 4 FCPNCGA 10 (26)
T ss_pred CCcccCC
Confidence 3444444
No 194
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=25.46 E-value=39 Score=31.84 Aligned_cols=12 Identities=25% Similarity=0.802 Sum_probs=6.0
Q ss_pred cccCCccccccC
Q psy359 370 PYQCDICGRGFA 381 (405)
Q Consensus 370 ~~~C~~C~~~f~ 381 (405)
.|+|+.|++.+.
T Consensus 367 g~rC~kCg~~~~ 378 (421)
T COG1571 367 GFRCKKCGTRAR 378 (421)
T ss_pred CcccccccccCC
Confidence 455555554444
No 195
>PRK04351 hypothetical protein; Provisional
Probab=25.39 E-value=34 Score=27.28 Aligned_cols=32 Identities=25% Similarity=0.698 Sum_probs=19.1
Q ss_pred cccccccccccCchhHHHHhhhhCCCCcccCCccccccCC
Q psy359 343 YACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGFAQ 382 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 382 (405)
|+|..|+..+.... .+ +...|.|..|+-.+..
T Consensus 113 Y~C~~Cg~~~~r~R-------r~-n~~~yrCg~C~g~L~~ 144 (149)
T PRK04351 113 YECQSCGQQYLRKR-------RI-NTKRYRCGKCRGKLKL 144 (149)
T ss_pred EECCCCCCEeeeee-------ec-CCCcEEeCCCCcEeee
Confidence 77777776664421 11 3466778888765543
No 196
>PF06397 Desulfoferrod_N: Desulfoferrodoxin, N-terminal domain; InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=24.43 E-value=31 Score=19.88 Aligned_cols=16 Identities=31% Similarity=0.569 Sum_probs=9.0
Q ss_pred CcccceeccccccccC
Q psy359 4 KETTSIKCEACHRTFS 19 (405)
Q Consensus 4 ~~~~~~~C~~C~~~f~ 19 (405)
+.-+.|+|..|+..+.
T Consensus 2 ~~~~~YkC~~CGniVe 17 (36)
T PF06397_consen 2 KKGEFYKCEHCGNIVE 17 (36)
T ss_dssp -TTEEEE-TTT--EEE
T ss_pred CcccEEEccCCCCEEE
Confidence 4567899999987654
No 197
>KOG0717|consensus
Probab=24.41 E-value=35 Score=32.35 Aligned_cols=22 Identities=27% Similarity=0.523 Sum_probs=20.3
Q ss_pred cccccchHhhhChHHHHHHHhh
Q psy359 217 YACYLCDKRFAQISDRIKHLKS 238 (405)
Q Consensus 217 ~~C~~C~~~f~~~~~l~~H~~~ 238 (405)
+.|.+|+++|.+...|+.|...
T Consensus 293 lyC~vCnKsFKseKq~kNHEnS 314 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHENS 314 (508)
T ss_pred eEEeeccccccchHHHHhhHHH
Confidence 8999999999999999999864
No 198
>KOG2906|consensus
Probab=24.38 E-value=6.3 Score=28.25 Aligned_cols=17 Identities=18% Similarity=0.428 Sum_probs=12.4
Q ss_pred cceecCCCcccCCChHH
Q psy359 278 LIFKCDLCGNILSSKHI 294 (405)
Q Consensus 278 ~~~~C~~C~~~f~~~~~ 294 (405)
..|.|..|+..|.-...
T Consensus 20 ~rf~C~tCpY~~~I~~e 36 (105)
T KOG2906|consen 20 NRFSCRTCPYVFPISRE 36 (105)
T ss_pred eeEEcCCCCceeeEeee
Confidence 35889999988766533
No 199
>KOG4124|consensus
Probab=24.01 E-value=21 Score=32.26 Aligned_cols=70 Identities=24% Similarity=0.386 Sum_probs=45.1
Q ss_pred ccceecC--CCcccCCChHHHHHHHhhhccCCCccccccCCCCCcCCCcccchHHHHhhhhhcCCCCccccccccccccC
Q psy359 277 KLIFKCD--LCGNILSSKHILQEHVRVVHMGLSRKYHYEYKPDGVCDVCGEYKKQLLQHKRLHFPLRPYACTQCDKTFKK 354 (405)
Q Consensus 277 ~~~~~C~--~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~C~~C~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 354 (405)
.++|+|. .|++.+.....|..|-...|-..-.-......| |.......|+|+|++|.+.++.
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~p----------------h~~~~~~nk~~r~~i~~~~~k~ 410 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIP----------------HQGFVVENKPYRCEVCSKRYKN 410 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCC----------------cceeeeccCcccChhhhhhhcc
Confidence 4567885 599999888888888765443221000111122 3334456789999999999998
Q ss_pred chhHHHHh
Q psy359 355 KNHLTTHY 362 (405)
Q Consensus 355 ~~~l~~H~ 362 (405)
...|.-|.
T Consensus 411 ~~~l~~~~ 418 (442)
T KOG4124|consen 411 LNGLKYHR 418 (442)
T ss_pred CCCCCcee
Confidence 87766553
No 200
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=23.96 E-value=34 Score=26.52 Aligned_cols=14 Identities=36% Similarity=0.930 Sum_probs=8.3
Q ss_pred ccccccccccccCc
Q psy359 342 PYACTQCDKTFKKK 355 (405)
Q Consensus 342 ~~~C~~C~~~f~~~ 355 (405)
.|+|+.|++.|...
T Consensus 53 RyrC~~C~~tf~~~ 66 (129)
T COG3677 53 RYKCKSCGSTFTVE 66 (129)
T ss_pred ccccCCcCcceeee
Confidence 36666666666553
No 201
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=23.72 E-value=82 Score=18.68 Aligned_cols=24 Identities=21% Similarity=0.505 Sum_probs=17.0
Q ss_pred cccCCCccccC--ChhhHHHHHHhhc
Q psy359 129 YPCDQCDRTYQ--TKKSLYVHRRAHL 152 (405)
Q Consensus 129 ~~C~~C~~~f~--~~~~l~~H~~~h~ 152 (405)
-.|+.|+-.|. ...+-..|.+.|.
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 47999998884 4555566777653
No 202
>PF13824 zf-Mss51: Zinc-finger of mitochondrial splicing suppressor 51
Probab=23.71 E-value=75 Score=20.37 Aligned_cols=17 Identities=24% Similarity=0.460 Sum_probs=10.9
Q ss_pred CCcccCCccccccCChH
Q psy359 368 EKPYQCDICGRGFAQSN 384 (405)
Q Consensus 368 ~~~~~C~~C~~~f~~~~ 384 (405)
...|.|+.||-.+.-+.
T Consensus 12 ~v~~~Cp~cGipthcS~ 28 (55)
T PF13824_consen 12 HVNFECPDCGIPTHCSE 28 (55)
T ss_pred ccCCcCCCCCCcCccCH
Confidence 34577888877665443
No 203
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=23.59 E-value=30 Score=20.69 Aligned_cols=8 Identities=38% Similarity=0.933 Sum_probs=3.6
Q ss_pred ccCCcccc
Q psy359 371 YQCDICGR 378 (405)
Q Consensus 371 ~~C~~C~~ 378 (405)
+.|+.||.
T Consensus 20 ~vC~~CG~ 27 (43)
T PF08271_consen 20 LVCPNCGL 27 (43)
T ss_dssp EEETTT-B
T ss_pred EECCCCCC
Confidence 45555553
No 204
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=23.56 E-value=37 Score=19.90 Aligned_cols=12 Identities=33% Similarity=0.985 Sum_probs=8.0
Q ss_pred cccCCccccccC
Q psy359 370 PYQCDICGRGFA 381 (405)
Q Consensus 370 ~~~C~~C~~~f~ 381 (405)
|++|..|++.|=
T Consensus 12 ~f~C~~C~~~FC 23 (39)
T smart00154 12 GFKCRHCGNLFC 23 (39)
T ss_pred CeECCccCCccc
Confidence 567777776664
No 205
>PF14311 DUF4379: Domain of unknown function (DUF4379)
Probab=23.53 E-value=50 Score=20.95 Aligned_cols=16 Identities=19% Similarity=0.634 Sum_probs=9.6
Q ss_pred cccccccccccCchhH
Q psy359 343 YACTQCDKTFKKKNHL 358 (405)
Q Consensus 343 ~~C~~C~~~f~~~~~l 358 (405)
.+|+.||..|.....-
T Consensus 29 W~C~~Cgh~w~~~v~~ 44 (55)
T PF14311_consen 29 WKCPKCGHEWKASVND 44 (55)
T ss_pred EECCCCCCeeEccHhh
Confidence 5677776666655433
No 206
>COG1594 RPB9 DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription]
Probab=22.99 E-value=23 Score=26.70 Aligned_cols=12 Identities=33% Similarity=0.963 Sum_probs=7.5
Q ss_pred cccccccccccc
Q psy359 342 PYACTQCDKTFK 353 (405)
Q Consensus 342 ~~~C~~C~~~f~ 353 (405)
.|+|..||..|.
T Consensus 100 Fy~C~~Cg~~wr 111 (113)
T COG1594 100 FYKCTRCGYRWR 111 (113)
T ss_pred EEEecccCCEee
Confidence 467777766554
No 207
>TIGR03826 YvyF flagellar operon protein TIGR03826. This gene is found in flagellar operons of Bacillus-related organisms. Its function has not been determined and an official gene symbol has not been assigned, although the gene is designated yvyF in B. subtilus. A tentative assignment as a regulator is suggested in the NCBI record GI:16080597.
Probab=22.86 E-value=43 Score=26.20 Aligned_cols=16 Identities=25% Similarity=0.740 Sum_probs=11.2
Q ss_pred cccceecCCCcccCCC
Q psy359 276 RKLIFKCDLCGNILSS 291 (405)
Q Consensus 276 ~~~~~~C~~C~~~f~~ 291 (405)
....|.|..||....+
T Consensus 78 ~nl~~~CE~CG~~I~~ 93 (137)
T TIGR03826 78 PNLGYPCERCGTSIRE 93 (137)
T ss_pred CCCcCcccccCCcCCC
Confidence 4556889999875533
No 208
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=22.44 E-value=48 Score=18.70 Aligned_cols=21 Identities=24% Similarity=0.577 Sum_probs=13.1
Q ss_pred ccCCccccccCChHHHHHHHHh
Q psy359 371 YQCDICGRGFAQSNDMKKHRRT 392 (405)
Q Consensus 371 ~~C~~C~~~f~~~~~l~~H~~~ 392 (405)
+.|+.|++.+ ..+-+..|+..
T Consensus 5 ~~C~nC~R~v-~a~RfA~HLek 25 (33)
T PF08209_consen 5 VECPNCGRPV-AASRFAPHLEK 25 (33)
T ss_dssp EE-TTTSSEE-EGGGHHHHHHH
T ss_pred EECCCCcCCc-chhhhHHHHHH
Confidence 6788888764 35556677653
No 209
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=22.38 E-value=32 Score=21.80 Aligned_cols=12 Identities=33% Similarity=0.999 Sum_probs=6.5
Q ss_pred cCCccccccCCh
Q psy359 372 QCDICGRGFAQS 383 (405)
Q Consensus 372 ~C~~C~~~f~~~ 383 (405)
.|++|++.|...
T Consensus 22 ~CPlC~r~l~~e 33 (54)
T PF04423_consen 22 CCPLCGRPLDEE 33 (54)
T ss_dssp E-TTT--EE-HH
T ss_pred cCCCCCCCCCHH
Confidence 899999998863
No 210
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=22.21 E-value=45 Score=25.18 Aligned_cols=26 Identities=15% Similarity=0.451 Sum_probs=16.0
Q ss_pred ccccccccccccCchhHHHHhhhhCCCCcccCCcccc
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGR 378 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 378 (405)
.++|..|+..|..... ..+.|+.||.
T Consensus 70 ~~~C~~Cg~~~~~~~~-----------~~~~CP~Cgs 95 (114)
T PRK03681 70 ECWCETCQQYVTLLTQ-----------RVRRCPQCHG 95 (114)
T ss_pred EEEcccCCCeeecCCc-----------cCCcCcCcCC
Confidence 3788888876655321 1156888884
No 211
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=22.20 E-value=22 Score=35.73 Aligned_cols=30 Identities=33% Similarity=0.751 Sum_probs=24.7
Q ss_pred ccccccccccCchhHHHHhhhhCCCCcccCCccccc
Q psy359 344 ACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRG 379 (405)
Q Consensus 344 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 379 (405)
.|+.|.+.|......+.| ..|.-|+.||-.
T Consensus 153 lC~~C~~EY~dP~nRRfH------AQp~aCp~CGP~ 182 (750)
T COG0068 153 LCPFCDKEYKDPLNRRFH------AQPIACPKCGPH 182 (750)
T ss_pred CCHHHHHHhcCccccccc------cccccCcccCCC
Confidence 799999999998888777 456789999864
No 212
>PF14205 Cys_rich_KTR: Cysteine-rich KTR
Probab=22.11 E-value=39 Score=21.48 Aligned_cols=13 Identities=15% Similarity=0.511 Sum_probs=10.0
Q ss_pred cccceecCCCccc
Q psy359 276 RKLIFKCDLCGNI 288 (405)
Q Consensus 276 ~~~~~~C~~C~~~ 288 (405)
.+.|+.|+.|.+.
T Consensus 25 kNfPlyCpKCK~E 37 (55)
T PF14205_consen 25 KNFPLYCPKCKQE 37 (55)
T ss_pred ccccccCCCCCce
Confidence 4678889999764
No 213
>PF10276 zf-CHCC: Zinc-finger domain; InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=21.45 E-value=48 Score=19.65 Aligned_cols=13 Identities=15% Similarity=0.626 Sum_probs=10.9
Q ss_pred ccceecccccccc
Q psy359 6 TTSIKCEACHRTF 18 (405)
Q Consensus 6 ~~~~~C~~C~~~f 18 (405)
++...|+.|+.+|
T Consensus 27 ~~~~~CpYCg~~y 39 (40)
T PF10276_consen 27 PGPVVCPYCGTRY 39 (40)
T ss_dssp TCEEEETTTTEEE
T ss_pred CCeEECCCCCCEE
Confidence 3578999999887
No 214
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=21.37 E-value=42 Score=18.12 Aligned_cols=19 Identities=26% Similarity=0.704 Sum_probs=14.3
Q ss_pred eeccccccccCCCcccCCce
Q psy359 9 IKCEACHRTFSSPYEYGGHK 28 (405)
Q Consensus 9 ~~C~~C~~~f~~~~~l~~H~ 28 (405)
|.|--|++.| ....+..|.
T Consensus 1 ~sCiDC~~~F-~~~~y~~Ht 19 (28)
T PF08790_consen 1 FSCIDCSKDF-DGDSYKSHT 19 (28)
T ss_dssp EEETTTTEEE-EGGGTTT--
T ss_pred CeeecCCCCc-CcCCcCCCC
Confidence 5788999999 667888883
No 215
>PF14787 zf-CCHC_5: GAG-polyprotein viral zinc-finger; PDB: 1CL4_A 1DSV_A.
Probab=21.17 E-value=40 Score=19.34 Aligned_cols=15 Identities=33% Similarity=0.922 Sum_probs=9.1
Q ss_pred cCCccccccCChHHH
Q psy359 372 QCDICGRGFAQSNDM 386 (405)
Q Consensus 372 ~C~~C~~~f~~~~~l 386 (405)
.|+.|++.|..+++.
T Consensus 4 ~CprC~kg~Hwa~~C 18 (36)
T PF14787_consen 4 LCPRCGKGFHWASEC 18 (36)
T ss_dssp C-TTTSSSCS-TTT-
T ss_pred cCcccCCCcchhhhh
Confidence 588888888776654
No 216
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region. Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A. The abnormally extended conformation is stable only in the CcO assembly.
Probab=21.15 E-value=44 Score=24.39 Aligned_cols=16 Identities=31% Similarity=0.731 Sum_probs=12.9
Q ss_pred CCccccccccccccCc
Q psy359 340 LRPYACTQCDKTFKKK 355 (405)
Q Consensus 340 ~~~~~C~~C~~~f~~~ 355 (405)
.++++|..||..|.-.
T Consensus 77 g~~~rC~eCG~~fkL~ 92 (97)
T cd00924 77 GKPKRCPECGHVFKLV 92 (97)
T ss_pred CCceeCCCCCcEEEEE
Confidence 3789999999998753
No 217
>KOG1842|consensus
Probab=21.09 E-value=48 Score=31.23 Aligned_cols=26 Identities=15% Similarity=0.404 Sum_probs=22.3
Q ss_pred CccccccccccccCchhHHHHhhhhC
Q psy359 341 RPYACTQCDKTFKKKNHLTTHYRIHT 366 (405)
Q Consensus 341 ~~~~C~~C~~~f~~~~~l~~H~~~h~ 366 (405)
..|.||+|...|.+.+.|..|.-..|
T Consensus 14 egflCPiC~~dl~~~~~L~~H~d~eH 39 (505)
T KOG1842|consen 14 EGFLCPICLLDLPNLSALNDHLDVEH 39 (505)
T ss_pred hcccCchHhhhhhhHHHHHHHHhhhc
Confidence 34899999999999999999986544
No 218
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=21.03 E-value=70 Score=30.75 Aligned_cols=28 Identities=18% Similarity=0.311 Sum_probs=25.3
Q ss_pred ceecCCCcccCCChHHHHHHHhhhccCC
Q psy359 279 IFKCDLCGNILSSKHILQEHVRVVHMGL 306 (405)
Q Consensus 279 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~ 306 (405)
-+.|+.|.+.|.....+..|+...|.+.
T Consensus 57 FWiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 57 FWICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred EeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 3679999999999999999999889776
No 219
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.91 E-value=20 Score=30.37 Aligned_cols=39 Identities=33% Similarity=0.694 Sum_probs=27.5
Q ss_pred CccccccccccccCchhHHHHhhhhCCC----------Cc-----ccCCccccc
Q psy359 341 RPYACTQCDKTFKKKNHLTTHYRIHTGE----------KP-----YQCDICGRG 379 (405)
Q Consensus 341 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~----------~~-----~~C~~C~~~ 379 (405)
+.+.||+|+..|.....+..-+++-.|+ .| ..|+.|..+
T Consensus 18 k~ieCPvC~tkFkkeev~tgsiRiiagDld~~lkygninP~fY~VvvCP~C~yA 71 (267)
T COG1655 18 KTIECPVCNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPYFYDVVVCPICYYA 71 (267)
T ss_pred ceeccCcccchhhhhheeccceeEecccccceeeccccCCceeEEEEcchhhHH
Confidence 4478999999998887666655554442 12 489999864
No 220
>PLN02294 cytochrome c oxidase subunit Vb
Probab=20.62 E-value=46 Score=26.89 Aligned_cols=17 Identities=18% Similarity=0.622 Sum_probs=13.9
Q ss_pred CCCccccccccccccCc
Q psy359 339 PLRPYACTQCDKTFKKK 355 (405)
Q Consensus 339 ~~~~~~C~~C~~~f~~~ 355 (405)
..+|++|+.||..|.-.
T Consensus 138 kGkp~RCpeCG~~fkL~ 154 (174)
T PLN02294 138 KGKSFECPVCTQYFELE 154 (174)
T ss_pred CCCceeCCCCCCEEEEE
Confidence 45789999999999764
No 221
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=20.45 E-value=42 Score=26.54 Aligned_cols=32 Identities=31% Similarity=0.852 Sum_probs=17.3
Q ss_pred ccccccccccccCchhHHHHhhhhCCCCcccCCcccccc
Q psy359 342 PYACTQCDKTFKKKNHLTTHYRIHTGEKPYQCDICGRGF 380 (405)
Q Consensus 342 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 380 (405)
.|+|..|+..+... +.+.....|.|..|+-.|
T Consensus 112 ~y~C~~C~~~~~~~-------rr~~~~~~y~C~~C~g~l 143 (146)
T smart00731 112 PYRCTGCGQRYLRV-------RRSNNVSRYRCGKCGGKL 143 (146)
T ss_pred EEECCCCCCCCceE-------ccccCcceEEcCCCCCEE
Confidence 46777777665421 112222557777777554
No 222
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=20.42 E-value=44 Score=20.74 Aligned_cols=10 Identities=20% Similarity=0.783 Sum_probs=4.7
Q ss_pred cccccccccc
Q psy359 343 YACTQCDKTF 352 (405)
Q Consensus 343 ~~C~~C~~~f 352 (405)
|.|+.||..+
T Consensus 21 ~vC~~Cg~~~ 30 (52)
T smart00661 21 FVCRKCGYEE 30 (52)
T ss_pred EECCcCCCeE
Confidence 4455555433
No 223
>KOG4118|consensus
Probab=20.05 E-value=57 Score=21.49 Aligned_cols=28 Identities=18% Similarity=0.421 Sum_probs=16.5
Q ss_pred ccCCccccccCChHHHHHHHHhhccccc
Q psy359 371 YQCDICGRGFAQSNDMKKHRRTVHKAQI 398 (405)
Q Consensus 371 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~ 398 (405)
|.|.+|--..+.+..+.+|..+.|+..|
T Consensus 39 ~kCtVC~~~mpdpktfkqhfe~kHpk~~ 66 (74)
T KOG4118|consen 39 HKCTVCMVQMPDPKTFKQHFENKHPKEP 66 (74)
T ss_pred hhhHhhHhhCCCCchHHHHHhhcCCCCC
Confidence 5566666666666666666666665543
Done!