Query psy12647
Match_columns 410
No_of_seqs 210 out of 2745
Neff 10.4
Searched_HMMs 46136
Date Fri Aug 16 15:57:41 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12647.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12647hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 99.9 3.2E-28 6.9E-33 203.2 4.6 136 248-403 127-264 (279)
2 KOG3608|consensus 99.9 5.8E-27 1.2E-31 200.3 6.7 218 165-407 133-377 (467)
3 KOG1074|consensus 99.9 1E-26 2.2E-31 219.5 4.7 194 194-408 605-933 (958)
4 KOG3608|consensus 99.9 5.9E-26 1.3E-30 194.2 5.9 313 10-394 69-398 (467)
5 KOG1074|consensus 99.9 2.4E-25 5.2E-30 210.3 3.4 208 165-383 604-936 (958)
6 KOG3623|consensus 99.9 9.4E-24 2E-28 196.3 5.4 98 165-265 209-323 (1007)
7 KOG2462|consensus 99.9 6E-23 1.3E-27 171.7 6.5 128 117-265 128-257 (279)
8 KOG3623|consensus 99.8 4.1E-20 8.9E-25 172.3 4.0 86 318-404 887-972 (1007)
9 KOG3576|consensus 99.6 3.1E-16 6.7E-21 124.7 0.6 111 295-408 115-238 (267)
10 KOG3576|consensus 99.5 2.2E-14 4.8E-19 114.2 2.2 71 194-265 117-187 (267)
11 PLN03086 PRLI-interacting fact 99.3 4.5E-12 9.7E-17 120.2 7.3 147 224-408 408-566 (567)
12 PLN03086 PRLI-interacting fact 99.1 1.5E-10 3.3E-15 109.9 7.6 136 88-264 407-554 (567)
13 PHA00733 hypothetical protein 99.1 3.2E-11 6.9E-16 93.3 1.5 82 324-407 39-124 (128)
14 PHA02768 hypothetical protein; 99.0 1.1E-10 2.4E-15 73.6 2.5 45 354-400 5-49 (55)
15 PHA00733 hypothetical protein 98.9 2.1E-09 4.6E-14 83.2 4.8 80 295-379 38-124 (128)
16 KOG3993|consensus 98.8 7.3E-10 1.6E-14 98.5 -1.0 196 194-409 267-485 (500)
17 PF13465 zf-H2C2_2: Zinc-finge 98.8 3.4E-09 7.4E-14 57.2 2.0 26 369-394 1-26 (26)
18 PHA02768 hypothetical protein; 98.7 6.8E-09 1.5E-13 65.6 2.1 44 325-371 5-48 (55)
19 KOG3993|consensus 98.7 3.4E-09 7.5E-14 94.3 0.5 173 165-352 266-485 (500)
20 PHA00616 hypothetical protein 98.5 4E-08 8.8E-13 58.9 1.7 40 354-393 1-40 (44)
21 PF13465 zf-H2C2_2: Zinc-finge 98.4 1.4E-07 3.1E-12 50.8 1.7 26 340-366 1-26 (26)
22 PHA00732 hypothetical protein 98.3 5.2E-07 1.1E-11 63.1 3.1 48 325-378 1-48 (79)
23 PF05605 zf-Di19: Drought indu 98.2 2.2E-06 4.8E-11 55.6 4.5 52 325-379 2-54 (54)
24 PHA00732 hypothetical protein 98.2 1.5E-06 3.2E-11 60.8 3.0 43 354-402 1-44 (79)
25 PHA00616 hypothetical protein 98.0 1.9E-06 4.1E-11 51.8 1.3 36 325-361 1-36 (44)
26 PF05605 zf-Di19: Drought indu 97.9 2E-05 4.3E-10 51.2 4.3 51 194-247 2-53 (54)
27 PF00096 zf-C2H2: Zinc finger, 97.8 1.4E-05 3.1E-10 41.7 1.6 23 355-377 1-23 (23)
28 PF12756 zf-C2H2_2: C2H2 type 97.5 1.1E-05 2.4E-10 60.3 -2.1 72 300-377 2-73 (100)
29 PF13894 zf-C2H2_4: C2H2-type 97.4 9.8E-05 2.1E-09 38.8 1.9 23 355-377 1-23 (24)
30 PF13912 zf-C2H2_6: C2H2-type 97.4 8E-05 1.7E-09 40.5 1.3 25 354-378 1-25 (27)
31 COG5189 SFP1 Putative transcri 97.3 0.00017 3.6E-09 62.6 2.6 28 379-406 395-422 (423)
32 PF00096 zf-C2H2: Zinc finger, 97.3 0.00011 2.4E-09 38.2 1.0 20 326-345 1-20 (23)
33 PF13894 zf-C2H2_4: C2H2-type 97.1 0.00038 8.1E-09 36.5 1.8 22 326-347 1-22 (24)
34 COG5189 SFP1 Putative transcri 97.0 0.0002 4.4E-09 62.1 0.8 27 296-322 348-374 (423)
35 PF09237 GAGA: GAGA factor; I 97.0 0.00051 1.1E-08 42.3 1.9 41 342-382 12-52 (54)
36 smart00355 ZnF_C2H2 zinc finge 96.8 0.0012 2.6E-08 35.1 2.4 24 355-378 1-24 (26)
37 PRK04860 hypothetical protein; 96.7 0.001 2.2E-08 53.6 2.5 39 354-396 119-157 (160)
38 PF12756 zf-C2H2_2: C2H2 type 96.7 0.00098 2.1E-08 49.6 2.3 73 327-405 1-73 (100)
39 PF13909 zf-H2C2_5: C2H2-type 96.6 0.0013 2.8E-08 34.5 1.6 23 355-378 1-23 (24)
40 PF13912 zf-C2H2_6: C2H2-type 96.6 0.0013 2.9E-08 35.6 1.6 24 325-349 1-24 (27)
41 KOG2231|consensus 96.4 0.0026 5.6E-08 62.2 3.2 20 326-345 183-202 (669)
42 PF12874 zf-met: Zinc-finger o 96.2 0.0024 5.3E-08 33.8 1.2 22 355-376 1-22 (25)
43 PF09237 GAGA: GAGA factor; I 95.9 0.0068 1.5E-07 37.4 2.1 40 212-251 13-52 (54)
44 KOG2482|consensus 95.6 0.059 1.3E-06 47.7 7.8 23 13-35 31-53 (423)
45 smart00355 ZnF_C2H2 zinc finge 95.6 0.011 2.3E-07 31.3 2.1 24 326-350 1-24 (26)
46 PF12874 zf-met: Zinc-finger o 95.4 0.0084 1.8E-07 31.7 1.1 22 89-110 1-22 (25)
47 PRK04860 hypothetical protein; 95.3 0.008 1.7E-07 48.5 1.4 39 325-368 119-157 (160)
48 PF13909 zf-H2C2_5: C2H2-type 95.3 0.011 2.3E-07 30.9 1.4 23 326-349 1-23 (24)
49 PF12171 zf-C2H2_jaz: Zinc-fin 95.2 0.0048 1E-07 33.4 -0.1 21 355-375 2-22 (27)
50 PF12171 zf-C2H2_jaz: Zinc-fin 95.2 0.0058 1.3E-07 33.1 0.2 22 383-404 2-23 (27)
51 COG5236 Uncharacterized conser 94.8 0.01 2.2E-07 52.4 0.8 105 297-406 151-275 (493)
52 KOG1146|consensus 94.8 0.01 2.3E-07 61.7 0.8 97 197-321 439-540 (1406)
53 KOG2231|consensus 94.1 0.085 1.8E-06 52.0 5.3 47 299-349 184-236 (669)
54 KOG2785|consensus 94.0 0.067 1.5E-06 48.4 4.0 52 194-245 166-242 (390)
55 KOG1146|consensus 92.8 0.015 3.2E-07 60.7 -2.2 77 328-404 439-540 (1406)
56 KOG2785|consensus 92.7 0.49 1.1E-05 43.0 7.4 25 165-191 2-26 (390)
57 KOG2482|consensus 92.7 0.17 3.7E-06 45.0 4.3 25 326-350 280-304 (423)
58 PF13913 zf-C2HC_2: zinc-finge 92.2 0.12 2.7E-06 27.2 1.9 19 356-375 4-22 (25)
59 COG5236 Uncharacterized conser 92.2 0.084 1.8E-06 46.9 1.8 79 166-245 151-242 (493)
60 KOG4124|consensus 91.7 0.078 1.7E-06 47.1 1.2 57 353-409 348-425 (442)
61 smart00451 ZnF_U1 U1-like zinc 90.7 0.17 3.8E-06 29.0 1.6 22 354-375 3-24 (35)
62 cd00350 rubredoxin_like Rubred 90.5 0.15 3.3E-06 29.0 1.2 10 353-362 16-25 (33)
63 smart00451 ZnF_U1 U1-like zinc 90.5 0.21 4.5E-06 28.7 1.9 21 325-345 3-23 (35)
64 COG5048 FOG: Zn-finger [Genera 90.2 0.07 1.5E-06 51.4 -0.6 61 194-254 289-354 (467)
65 PF12013 DUF3505: Protein of u 90.2 0.43 9.3E-06 36.0 3.8 25 355-379 81-109 (109)
66 COG5048 FOG: Zn-finger [Genera 89.0 0.13 2.7E-06 49.6 0.1 151 222-391 288-455 (467)
67 KOG2893|consensus 88.8 0.13 2.9E-06 43.1 0.1 48 168-219 12-59 (341)
68 KOG2893|consensus 87.7 0.19 4.2E-06 42.2 0.5 45 328-377 13-58 (341)
69 KOG4173|consensus 86.9 0.31 6.6E-06 40.1 1.1 80 165-245 78-168 (253)
70 COG4049 Uncharacterized protei 84.7 0.11 2.3E-06 32.7 -1.9 30 350-379 13-42 (65)
71 KOG4173|consensus 84.6 0.41 8.8E-06 39.4 0.8 80 296-376 78-168 (253)
72 cd00729 rubredoxin_SM Rubredox 83.1 0.68 1.5E-05 26.5 1.1 11 326-336 3-13 (34)
73 PF09986 DUF2225: Uncharacteri 81.9 0.33 7.1E-06 41.6 -0.8 43 353-395 4-61 (214)
74 PF12013 DUF3505: Protein of u 81.7 2.5 5.5E-05 31.8 4.1 74 165-248 10-109 (109)
75 COG4049 Uncharacterized protei 79.6 0.88 1.9E-05 28.7 0.8 25 325-349 17-41 (65)
76 COG2888 Predicted Zn-ribbon RN 77.0 1.9 4.1E-05 27.9 1.7 10 353-362 49-58 (61)
77 PF09538 FYDLN_acid: Protein o 75.8 1.5 3.3E-05 32.8 1.3 14 381-394 25-38 (108)
78 TIGR00622 ssl1 transcription f 74.7 4.9 0.00011 30.1 3.6 54 299-361 57-110 (112)
79 PF13719 zinc_ribbon_5: zinc-r 74.7 2.3 4.9E-05 24.8 1.6 7 173-179 7-13 (37)
80 PHA00626 hypothetical protein 74.2 0.92 2E-05 28.8 -0.2 9 355-363 24-32 (59)
81 smart00659 RPOLCX RNA polymera 74.0 2 4.3E-05 26.2 1.3 11 355-365 3-13 (44)
82 PF13717 zinc_ribbon_4: zinc-r 73.9 2.7 5.8E-05 24.4 1.7 7 173-179 7-13 (36)
83 PRK00464 nrdR transcriptional 73.8 0.79 1.7E-05 36.8 -0.7 13 383-395 29-41 (154)
84 PRK14890 putative Zn-ribbon RN 73.6 1.9 4.2E-05 27.9 1.1 10 353-362 47-56 (59)
85 PRK09678 DNA-binding transcrip 73.2 1.1 2.3E-05 30.7 -0.1 42 355-398 2-45 (72)
86 PF09986 DUF2225: Uncharacteri 72.9 0.78 1.7E-05 39.3 -1.0 43 325-367 5-61 (214)
87 TIGR00373 conserved hypothetic 72.6 3.1 6.7E-05 33.7 2.4 18 353-370 108-125 (158)
88 TIGR02098 MJ0042_CXXC MJ0042 f 72.4 2.5 5.3E-05 24.8 1.4 9 224-232 26-34 (38)
89 PF12907 zf-met2: Zinc-binding 72.1 2.8 6E-05 24.9 1.5 32 195-226 2-36 (40)
90 KOG2186|consensus 72.1 2 4.3E-05 36.9 1.2 45 89-139 4-48 (276)
91 PRK06266 transcription initiat 70.7 1.9 4E-05 35.8 0.8 30 325-364 117-146 (178)
92 smart00614 ZnF_BED BED zinc fi 70.3 3.6 7.8E-05 25.9 1.9 12 339-350 37-48 (50)
93 PF12907 zf-met2: Zinc-binding 68.2 3 6.6E-05 24.8 1.1 29 326-354 2-33 (40)
94 COG1592 Rubrerythrin [Energy p 68.0 3.1 6.8E-05 33.7 1.5 11 351-361 146-156 (166)
95 PF02892 zf-BED: BED zinc fing 67.5 2.8 6E-05 25.6 0.9 22 296-319 15-40 (45)
96 COG1592 Rubrerythrin [Energy p 65.2 3.7 8E-05 33.3 1.4 24 354-390 134-157 (166)
97 PF06524 NOA36: NOA36 protein; 63.3 1.5 3.2E-05 37.7 -1.2 64 296-378 170-233 (314)
98 PF10571 UPF0547: Uncharacteri 62.8 4.8 0.0001 21.4 1.1 10 326-335 15-24 (26)
99 smart00834 CxxC_CXXC_SSSS Puta 62.6 2.2 4.8E-05 25.4 -0.2 11 326-336 6-16 (41)
100 COG1997 RPL43A Ribosomal prote 62.6 2.8 6E-05 29.5 0.2 11 383-393 54-64 (89)
101 PF09723 Zn-ribbon_8: Zinc rib 62.2 1.6 3.5E-05 26.3 -0.9 13 326-338 6-18 (42)
102 COG1996 RPC10 DNA-directed RNA 62.2 3.5 7.6E-05 25.7 0.6 11 354-364 6-16 (49)
103 KOG4377|consensus 61.9 3.9 8.5E-05 37.7 1.1 28 296-323 400-427 (480)
104 TIGR02605 CxxC_CxxC_SSSS putat 60.5 2.6 5.6E-05 26.7 -0.2 12 326-337 6-17 (52)
105 smart00734 ZnF_Rad18 Rad18-lik 60.0 8.5 0.00018 20.4 1.8 19 356-375 3-21 (26)
106 TIGR02300 FYDLN_acid conserved 58.6 5.8 0.00013 30.3 1.4 13 353-365 25-37 (129)
107 smart00531 TFIIE Transcription 56.1 6.3 0.00014 31.5 1.3 13 325-337 99-111 (147)
108 PF09845 DUF2072: Zn-ribbon co 52.9 8.2 0.00018 29.7 1.4 15 325-339 1-15 (131)
109 COG3364 Zn-ribbon containing p 52.6 8.3 0.00018 28.0 1.3 16 325-340 2-17 (112)
110 PF06524 NOA36: NOA36 protein; 52.5 11 0.00025 32.5 2.3 19 222-240 208-226 (314)
111 PF08274 PhnA_Zn_Ribbon: PhnA 50.6 6.4 0.00014 21.8 0.4 25 356-391 4-28 (30)
112 PRK00398 rpoP DNA-directed RNA 50.3 11 0.00024 23.2 1.4 15 119-133 3-17 (46)
113 PF09538 FYDLN_acid: Protein o 48.1 11 0.00024 28.2 1.4 31 119-181 9-39 (108)
114 PF02176 zf-TRAF: TRAF-type zi 48.1 7.1 0.00015 25.5 0.3 33 25-57 23-60 (60)
115 TIGR00622 ssl1 transcription f 46.9 43 0.00092 25.2 4.3 21 87-107 14-34 (112)
116 PF15269 zf-C2H2_7: Zinc-finge 46.8 13 0.00028 22.4 1.3 22 355-376 21-42 (54)
117 TIGR00373 conserved hypothetic 45.6 14 0.00031 29.9 1.8 30 165-203 108-137 (158)
118 KOG2593|consensus 44.6 12 0.00025 35.2 1.3 35 324-362 127-161 (436)
119 PF08790 zf-LYAR: LYAR-type C2 44.3 11 0.00023 20.5 0.6 19 383-402 1-19 (28)
120 PF05290 Baculo_IE-1: Baculovi 42.2 26 0.00056 27.1 2.5 43 84-134 76-136 (140)
121 KOG1044|consensus 41.5 7.9 0.00017 37.6 -0.3 42 326-370 193-234 (670)
122 PTZ00255 60S ribosomal protein 40.7 14 0.0003 26.5 0.9 14 87-100 53-66 (90)
123 PRK06266 transcription initiat 40.1 20 0.00044 29.6 2.0 29 165-202 116-144 (178)
124 COG1198 PriA Primosomal protei 39.7 20 0.00043 36.8 2.1 30 347-391 455-484 (730)
125 PF04959 ARS2: Arsenite-resist 39.6 18 0.00038 30.9 1.5 25 326-350 78-102 (214)
126 PF04606 Ogr_Delta: Ogr/Delta- 38.9 20 0.00044 22.1 1.3 17 165-181 24-40 (47)
127 PF07754 DUF1610: Domain of un 38.9 15 0.00032 19.1 0.6 9 118-126 15-23 (24)
128 TIGR01384 TFS_arch transcripti 38.6 6.7 0.00015 29.2 -1.0 13 251-263 16-28 (104)
129 PF14353 CpXC: CpXC protein 37.5 23 0.0005 27.4 1.8 57 89-145 2-64 (128)
130 TIGR00280 L37a ribosomal prote 37.4 15 0.00033 26.3 0.7 14 87-100 52-65 (91)
131 smart00531 TFIIE Transcription 37.1 19 0.00042 28.7 1.4 15 165-181 98-112 (147)
132 COG3357 Predicted transcriptio 37.1 24 0.00052 25.1 1.6 29 324-363 57-85 (97)
133 PF08271 TF_Zn_Ribbon: TFIIB z 37.1 23 0.0005 21.3 1.4 9 89-97 20-28 (43)
134 COG4530 Uncharacterized protei 36.6 19 0.0004 26.6 1.0 12 381-392 25-36 (129)
135 KOG2593|consensus 36.2 40 0.00086 31.8 3.3 15 36-50 124-138 (436)
136 COG0068 HypF Hydrogenase matur 35.9 7.3 0.00016 39.0 -1.5 80 89-202 102-181 (750)
137 KOG2186|consensus 34.4 16 0.00035 31.5 0.5 30 315-345 18-48 (276)
138 COG5151 SSL1 RNA polymerase II 33.2 27 0.00058 31.2 1.6 22 324-345 387-408 (421)
139 PF04959 ARS2: Arsenite-resist 32.3 29 0.00062 29.7 1.7 27 194-220 77-103 (214)
140 PRK00432 30S ribosomal protein 32.2 25 0.00054 22.1 1.0 11 382-392 37-47 (50)
141 PF03604 DNA_RNApol_7kD: DNA d 32.2 26 0.00057 19.7 1.0 12 120-131 1-12 (32)
142 KOG1280|consensus 31.4 37 0.0008 30.9 2.2 37 194-230 79-116 (381)
143 PF01363 FYVE: FYVE zinc finge 31.4 24 0.00053 23.7 0.9 9 356-364 11-19 (69)
144 PRK03976 rpl37ae 50S ribosomal 31.3 21 0.00044 25.7 0.5 14 87-100 53-66 (90)
145 COG3677 Transposase and inacti 30.1 22 0.00048 27.7 0.6 14 353-366 52-65 (129)
146 KOG1280|consensus 29.8 47 0.001 30.3 2.6 36 326-361 80-116 (381)
147 PF08209 Sgf11: Sgf11 (transcr 29.2 21 0.00045 20.3 0.2 20 39-58 3-23 (33)
148 PRK03824 hypA hydrogenase nick 28.8 22 0.00047 28.0 0.3 8 355-362 108-115 (135)
149 cd00065 FYVE FYVE domain; Zinc 28.7 38 0.00082 21.6 1.5 9 328-336 5-13 (57)
150 KOG1842|consensus 28.4 31 0.00066 32.6 1.3 30 324-353 14-43 (505)
151 KOG4377|consensus 28.1 35 0.00077 31.8 1.6 25 355-379 402-428 (480)
152 TIGR00595 priA primosomal prot 28.1 38 0.00082 33.4 2.0 27 326-363 223-249 (505)
153 PF12760 Zn_Tnp_IS1595: Transp 27.8 1.3E+02 0.0029 18.2 3.8 10 39-48 17-26 (46)
154 KOG3408|consensus 27.2 35 0.00076 25.8 1.2 24 353-376 56-79 (129)
155 PRK14873 primosome assembly pr 27.1 37 0.00081 34.7 1.8 14 349-362 405-418 (665)
156 PF05191 ADK_lid: Adenylate ki 27.1 21 0.00045 20.7 -0.0 9 384-392 23-31 (36)
157 smart00154 ZnF_AN1 AN1-like Zi 27.0 28 0.00061 20.5 0.6 14 382-395 12-25 (39)
158 smart00440 ZnF_C2C2 C2C2 Zinc 26.7 52 0.0011 19.5 1.7 14 164-179 26-39 (40)
159 smart00064 FYVE Protein presen 26.7 36 0.00078 22.8 1.1 10 327-336 12-21 (68)
160 PF05443 ROS_MUCR: ROS/MUCR tr 26.6 38 0.00083 26.4 1.4 24 354-380 72-95 (132)
161 PF01096 TFIIS_C: Transcriptio 26.5 42 0.00092 19.7 1.2 13 164-178 26-38 (39)
162 KOG4124|consensus 26.5 22 0.00048 32.3 0.0 67 118-187 348-417 (442)
163 COG2331 Uncharacterized protei 26.0 26 0.00056 24.0 0.3 30 326-363 13-42 (82)
164 PRK04023 DNA polymerase II lar 26.0 61 0.0013 34.3 3.0 11 383-393 664-674 (1121)
165 PF03811 Zn_Tnp_IS1: InsA N-te 25.6 27 0.00059 20.2 0.3 14 81-94 22-35 (36)
166 PF14446 Prok-RING_1: Prokaryo 25.4 32 0.0007 22.0 0.6 14 252-265 6-19 (54)
167 COG1773 Rubredoxin [Energy pro 25.1 30 0.00065 22.2 0.4 12 382-393 3-14 (55)
168 PF13451 zf-trcl: Probable zin 24.9 71 0.0015 20.0 2.0 41 221-261 2-43 (49)
169 PF07282 OrfB_Zn_ribbon: Putat 24.8 42 0.00091 22.5 1.2 11 381-391 45-55 (69)
170 KOG4727|consensus 24.3 35 0.00077 27.5 0.8 23 353-375 74-96 (193)
171 KOG2071|consensus 22.9 72 0.0016 31.5 2.7 78 164-261 416-523 (579)
172 PF04780 DUF629: Protein of un 22.4 53 0.0012 31.7 1.7 29 324-352 56-84 (466)
173 TIGR00100 hypA hydrogenase nic 21.8 50 0.0011 25.1 1.2 28 86-128 68-95 (115)
174 PF01286 XPA_N: XPA protein N- 21.7 35 0.00076 19.5 0.2 18 40-57 3-21 (34)
175 KOG1701|consensus 21.5 12 0.00026 34.9 -2.5 73 324-396 359-441 (468)
176 KOG3408|consensus 21.0 84 0.0018 23.9 2.1 29 160-190 51-79 (129)
177 PF04780 DUF629: Protein of un 20.8 61 0.0013 31.3 1.7 27 195-221 58-84 (466)
178 PF13824 zf-Mss51: Zinc-finger 20.5 75 0.0016 20.5 1.6 16 350-365 10-25 (55)
179 PF13453 zf-TFIIB: Transcripti 20.2 48 0.001 19.7 0.6 17 355-371 20-36 (41)
No 1
>KOG2462|consensus
Probab=99.94 E-value=3.2e-28 Score=203.21 Aligned_cols=136 Identities=26% Similarity=0.657 Sum_probs=128.2
Q ss_pred CCCCCCCccchhhhhhcCCcccccccchhcccCchhhcccccccCC--CCcccccCcccccccccChhhHHHHHHcCCCC
Q psy12647 248 DERPFPCPVCQRTFRRKGLESFTFSLRLFCLYFPQELISHSSYHGG--EERRYVCNHMQCTRRFHSASLLARHVSTHAIQ 325 (410)
Q Consensus 248 ~~~~~~C~~C~~~f~~~~~~~~~~~~~~~~~~~~~~l~~H~~~~~~--~~~~~~C~~~~C~~~f~~~~~l~~H~~~~~~~ 325 (410)
....|.|++|++.+.+.+ +|..|..+|-. +.+-+.|+ .|++.|.+...|+.|+++|.-+
T Consensus 127 ~~~r~~c~eCgk~ysT~s-----------------nLsrHkQ~H~~~~s~ka~~C~--~C~K~YvSmpALkMHirTH~l~ 187 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSS-----------------NLSRHKQTHRSLDSKKAFSCK--YCGKVYVSMPALKMHIRTHTLP 187 (279)
T ss_pred cCCceecccccccccccc-----------------ccchhhcccccccccccccCC--CCCceeeehHHHhhHhhccCCC
Confidence 345689999999999998 99999999864 34778899 9999999999999999999999
Q ss_pred cccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHHHHHHhhcCCCCccCccccccccccccCccCCC
Q psy12647 326 YTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKVCTTPSPQ 403 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 403 (410)
.+|.+||+.|+..+.|+.|+ ++|+|||||.|+.|+++|.+.+||+.||++|.+.|.|+|..|+|+|+.++-|..|.-
T Consensus 188 c~C~iCGKaFSRPWLLQGHi-RTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 188 CECGICGKAFSRPWLLQGHI-RTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred cccccccccccchHHhhccc-ccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 99999999999999999999 999999999999999999999999999999999999999999999999999999864
No 2
>KOG3608|consensus
Probab=99.93 E-value=5.8e-27 Score=200.34 Aligned_cols=218 Identities=25% Similarity=0.516 Sum_probs=196.7
Q ss_pred ceeecCcccccccccChhHHHHHHhhCCC---------------Cccc--ccccccccChhHHHHHHhhhcCCCCccccC
Q psy12647 165 RRYVCNHMQCTRRFHSASLLARHVSTHAI---------------QYTC--HMCGHMFTCKRRLTQHLARLHTGSLSYPCT 227 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~~~l~~H~~~~~~---------------~~~C--~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~ 227 (410)
..|.|.++.|+..|.+...|..|+..|.. .+.| ..|.+.+.++..|++|+ +.|++++...|+
T Consensus 133 ~~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~-r~Hs~eKvvACp 211 (467)
T KOG3608|consen 133 QNFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHI-RTHSNEKVVACP 211 (467)
T ss_pred hhhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHH-HhcCCCeEEecc
Confidence 56899999999999999999999987742 2445 45899999999999999 999999999999
Q ss_pred CCCcccCCHHHHHHHHHHh--cCCCCCCCccchhhhhhcCCcccccccchhcccCchhhcccccccCCCCcccccCcccc
Q psy12647 228 QCPGKFYSAYELRQHVRQH--TDERPFPCPVCQRTFRRKGLESFTFSLRLFCLYFPQELISHSSYHGGEERRYVCNHMQC 305 (410)
Q Consensus 228 ~C~~~f~~~~~l~~H~~~~--~~~~~~~C~~C~~~f~~~~~~~~~~~~~~~~~~~~~~l~~H~~~~~~~~~~~~C~~~~C 305 (410)
.|+..|.++..|.-|.+.- ....+|.|..|.+.|.++. .|..|+..|. ..|.|+ .|
T Consensus 212 ~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTek-----------------lL~~Hv~rHv---n~ykCp--lC 269 (467)
T KOG3608|consen 212 HCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEK-----------------LLKSHVVRHV---NCYKCP--LC 269 (467)
T ss_pred hHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHH-----------------HHHHHHHHhh---hccccc--cc
Confidence 9999999999999998754 4456899999999999998 9999998885 679999 99
Q ss_pred cccccChhhHHHHHHcCC---CCcccccccCccCChhhHHHHHHhhcCCCCCcccCc--CcccccCcHhHHHHHHhhc-C
Q psy12647 306 TRRFHSASLLARHVSTHA---IQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQ--CPGKFYSAYELRQHVRQHT-D 379 (410)
Q Consensus 306 ~~~f~~~~~l~~H~~~~~---~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~-~ 379 (410)
+.+....++|..|++..+ +||+|+.|.+.|.+.++|.+|+ .+|+ +..|.|.. |.++|.+..+|.+|++-++ |
T Consensus 270 dmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~-~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg 347 (467)
T KOG3608|consen 270 DMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHV-QVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEG 347 (467)
T ss_pred ccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHH-Hhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccC
Confidence 999999999999999877 8999999999999999999998 6998 77899988 9999999999999998555 5
Q ss_pred CC--CccCccccccccccccCccCCCCCCC
Q psy12647 380 ER--PFPCPVCQRTFRRKVCTTPSPQPQHL 407 (410)
Q Consensus 380 ~~--~~~C~~C~~~f~~~~~l~~H~~~~H~ 407 (410)
.. +|.|..|++.|++--+|.+|+++.|+
T Consensus 348 ~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~ 377 (467)
T KOG3608|consen 348 NNPILYACHCCDRFFTSGKSLSAHLMKKHG 377 (467)
T ss_pred CCCCceeeecchhhhccchhHHHHHHHhhc
Confidence 54 59999999999999999999999986
No 3
>KOG1074|consensus
Probab=99.93 E-value=1e-26 Score=219.46 Aligned_cols=194 Identities=24% Similarity=0.498 Sum_probs=161.4
Q ss_pred CcccccccccccChhHHHHHHhhhcCCCCccccCCCCcccCCHHHHHHHHHHhcCCC----CCCCc---cchhhhhhcCC
Q psy12647 194 QYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDER----PFPCP---VCQRTFRRKGL 266 (410)
Q Consensus 194 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~----~~~C~---~C~~~f~~~~~ 266 (410)
+..|-+|.++.....+|+-|. ++|+||+||+|.+|++.|.++.+|+.|+-.|.... .+.|+ +|-+.|.+.-
T Consensus 605 PNqCiiC~rVlSC~saLqmHy-rtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V- 682 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHY-RTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAV- 682 (958)
T ss_pred ccceeeeeecccchhhhhhhh-hcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccc-
Confidence 689999999999999999998 99999999999999999999999999999886654 47899 9999999988
Q ss_pred cccccccchhcccCchhhcccccccCCCC------------cccccCcccccccccChhhHHHHHHcCC-----------
Q psy12647 267 ESFTFSLRLFCLYFPQELISHSSYHGGEE------------RRYVCNHMQCTRRFHSASLLARHVSTHA----------- 323 (410)
Q Consensus 267 ~~~~~~~~~~~~~~~~~l~~H~~~~~~~~------------~~~~C~~~~C~~~f~~~~~l~~H~~~~~----------- 323 (410)
.|..|++.|.+.. ..-.|. .|.+.|.+...+..++..+.
T Consensus 683 ----------------~lpQhIriH~~~~~s~g~~a~e~~~~adq~~--~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~ 744 (958)
T KOG1074|consen 683 ----------------TLPQHIRIHLGGQISNGGTAAEGILAADQCS--SCQKTFSDARSFSQQISEQPSPESEPDEQMD 744 (958)
T ss_pred ----------------cccceEEeecCCCCCCCcccccccchhcccc--hhhhcccccccchhhhhccCCcccCCccccc
Confidence 8888888887421 123477 89999988888888877661
Q ss_pred -----------CCcccccccCccCChhhHHHHHHhhcC-----------------------CC-----------------
Q psy12647 324 -----------IQYTCHMCGHMFTCKRRLTQHLARLHT-----------------------GS----------------- 352 (410)
Q Consensus 324 -----------~~~~C~~C~~~f~~~~~l~~H~~~~H~-----------------------~~----------------- 352 (410)
.+.-+..|+..+.....+..+- ..+. ++
T Consensus 745 ~~t~t~~~~~tp~~~e~~~~~~~~~e~~i~~~g-~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v 823 (958)
T KOG1074|consen 745 ERTETEELDVTPPPPENSCGRELEGEMAISVRG-STEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSV 823 (958)
T ss_pred ccccccccccCCCccccccccccCccccccccc-chhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCcccc
Confidence 2466777888777766655553 2220 00
Q ss_pred ------------------------------------------------------CCcccCcCcccccCcHhHHHHHHhhc
Q psy12647 353 ------------------------------------------------------LSYPCTQCPGKFYSAYELRQHVRQHT 378 (410)
Q Consensus 353 ------------------------------------------------------~~~~C~~C~~~f~~~~~l~~H~~~h~ 378 (410)
....|.+|++.|...++|+.|+|+|+
T Consensus 824 ~~~pvl~~~~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHt 903 (958)
T KOG1074|consen 824 NMDPVLWNQETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHT 903 (958)
T ss_pred ccCchhhcccccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCC
Confidence 11569999999999999999999999
Q ss_pred CCCCccCccccccccccccCccCCCCCCCC
Q psy12647 379 DERPFPCPVCQRTFRRKVCTTPSPQPQHLD 408 (410)
Q Consensus 379 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 408 (410)
++|||.|.+|++.|..+..|+.||..++|.
T Consensus 904 g~KPF~C~fC~~aFttrgnLKvHMgtH~w~ 933 (958)
T KOG1074|consen 904 GPKPFFCHFCEEAFTTRGNLKVHMGTHMWV 933 (958)
T ss_pred CCCCccchhhhhhhhhhhhhhhhhcccccc
Confidence 999999999999999999999999977764
No 4
>KOG3608|consensus
Probab=99.92 E-value=5.9e-26 Score=194.20 Aligned_cols=313 Identities=25% Similarity=0.457 Sum_probs=214.8
Q ss_pred HHHHHHHHhhcCCCc-HHHHHHHHhhhcCCCcccCCCCCCccccchhceeeeeeecCCCcccchhhHHHHhhhhcCCCCc
Q psy12647 10 VILMLMIMTRYSLPC-VDFIQVHLRLHSGERRFPCEYCSLLVYPVLILFRFIFVCTQGSADSHKHHLKDHVNKIHLHPKT 88 (410)
Q Consensus 10 ~~~c~~~~c~~~~~~-~~~l~~H~~~h~~~~~~~C~~C~~~f~~~l~~h~~~~~c~~~~~~~~~~~l~~H~~~~h~~~~~ 88 (410)
.+.|.+-.|++...+ ..+|++|+..|--.. +|..=|+ ..|+.|..+-.|. ..|... .++-.....
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~--~l~q~G~---~al~~~~dig~c~--~~f~~~-------~~ip~~g~~ 134 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHT--KLKQQGK---LALDLHPDIGACT--APFRLM-------EKIPALGQN 134 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHH--HHHHHHH---HHHhcCCCcCccc--CCcchh-------hccccchhh
Confidence 678888899888888 578999987762100 0000000 0022222211111 111111 222223344
Q ss_pred eecc--cccccccCchhHHHHHHHhhhcCCCCcccCCcCcccccCcccccccccccccccccchhhhhccccccCCCcce
Q psy12647 89 FSCN--SCGQTWLGYTSWKQHVAACREKRRKETFPCKRCDERFSTRDGLESFTFSLRLFCLYFPQELISHSSYHGGEERR 166 (410)
Q Consensus 89 ~~C~--~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~l~~h~~~~~~~~~~ 166 (410)
|.|. .|+..|.+...|.+|+.. +.. -|. =+..+...+ +...
T Consensus 135 f~C~WedCe~~F~s~~ef~dHV~~---H~l----~ce-yd~~~~~~D-----------------------------~~pv 177 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSIVEFQDHVVK---HAL----FCE-YDIQKTPED-----------------------------ERPV 177 (467)
T ss_pred hccChhhcCCcccCHHHHHHHHHH---hhh----hhh-hhhhhCCCC-----------------------------CCce
Confidence 8886 599999999999999876 221 110 001111111 1144
Q ss_pred eecCcccccccccChhHHHHHHhhCCC--CcccccccccccChhHHHHHHhhhcC-CCCccccCCCCcccCCHHHHHHHH
Q psy12647 167 YVCNHMQCTRRFHSASLLARHVSTHAI--QYTCHMCGHMFTCKRRLTQHLARLHT-GSLSYPCTQCPGKFYSAYELRQHV 243 (410)
Q Consensus 167 ~~C~~~~C~~~f~~~~~l~~H~~~~~~--~~~C~~C~~~f~~~~~l~~H~~~~H~-~~~~~~C~~C~~~f~~~~~l~~H~ 243 (410)
+.|.+..|.+.+.++..|..|++.|.. ...|+.||.-|+++..|-.|+++.-. ...+|+|..|.+.|.++..|..|+
T Consensus 178 ~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv 257 (467)
T KOG3608|consen 178 TMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHV 257 (467)
T ss_pred eeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHH
Confidence 667777888888888888888888765 56788888888888888888754432 345788888888888888888888
Q ss_pred HHhcCCCCCCCccchhhhhhcCCcccccccchhcccCchhhcccccccCCCCcccccCcccccccccChhhHHHHHHcCC
Q psy12647 244 RQHTDERPFPCPVCQRTFRRKGLESFTFSLRLFCLYFPQELISHSSYHGGEERRYVCNHMQCTRRFHSASLLARHVSTHA 323 (410)
Q Consensus 244 ~~~~~~~~~~C~~C~~~f~~~~~~~~~~~~~~~~~~~~~~l~~H~~~~~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~~ 323 (410)
..|-. .|+|+.|+-.....+ +|..|++..+.+.+||+|+ .|++.+.+.++|.+|...|.
T Consensus 258 ~rHvn--~ykCplCdmtc~~~s-----------------sL~~H~r~rHs~dkpfKCd--~Cd~~c~~esdL~kH~~~HS 316 (467)
T KOG3608|consen 258 VRHVN--CYKCPLCDMTCSSAS-----------------SLTTHIRYRHSKDKPFKCD--ECDTRCVRESDLAKHVQVHS 316 (467)
T ss_pred HHhhh--cccccccccCCCChH-----------------HHHHHHHhhhccCCCcccc--chhhhhccHHHHHHHHHhcc
Confidence 87754 378888887777766 8888887766666899999 89999999999999998887
Q ss_pred -CCccccc--ccCccCChhhHHHHHHhhcCCCC--CcccCcCcccccCcHhHHHHHHhhcC------CCCccCccccccc
Q psy12647 324 -IQYTCHM--CGHMFTCKRRLTQHLARLHTGSL--SYPCTQCPGKFYSAYELRQHVRQHTD------ERPFPCPVCQRTF 392 (410)
Q Consensus 324 -~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~--~~~C~~C~~~f~~~~~l~~H~~~h~~------~~~~~C~~C~~~f 392 (410)
..|+|.. |..+|.....|++|++.+|.|.. +|.|-.|++.|+...+|..|+..-++ -+.|.-.+|.-.|
T Consensus 317 ~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~PsGh~RFtYk~~edG~ 396 (467)
T KOG3608|consen 317 KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFRLPSGHKRFTYKVDEDGF 396 (467)
T ss_pred ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcccCCCCCCceeeeeccCce
Confidence 6799988 99999999999999988886554 59999999999999999999765443 3446666666665
Q ss_pred cc
Q psy12647 393 RR 394 (410)
Q Consensus 393 ~~ 394 (410)
-.
T Consensus 397 mR 398 (467)
T KOG3608|consen 397 MR 398 (467)
T ss_pred ee
Confidence 43
No 5
>KOG1074|consensus
Probab=99.91 E-value=2.4e-25 Score=210.30 Aligned_cols=208 Identities=27% Similarity=0.457 Sum_probs=136.5
Q ss_pred ceeecCcccccccccChhHHHHHHhhCCC--CcccccccccccChhHHHHHHhhhcCCC----CccccC---CCCcccCC
Q psy12647 165 RRYVCNHMQCTRRFHSASLLARHVSTHAI--QYTCHMCGHMFTCKRRLTQHLARLHTGS----LSYPCT---QCPGKFYS 235 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~~~l~~H~~~~~~--~~~C~~C~~~f~~~~~l~~H~~~~H~~~----~~~~C~---~C~~~f~~ 235 (410)
.+.+|. +|-++..-.+.|+.|.++|.+ ||+|.+||+.|.++..|+.|| .+|... .++.|+ +|.+.|.+
T Consensus 604 dPNqCi--iC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~-~vHka~p~~R~q~ScP~~~ic~~kftn 680 (958)
T KOG1074|consen 604 DPNQCI--ICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHM-SVHKAKPPARVQFSCPSTFICQKKFTN 680 (958)
T ss_pred Ccccee--eeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcc-cccccCccccccccCCchhhhcccccc
Confidence 467888 888888888888888888877 888888888888888888888 777654 447788 88888888
Q ss_pred HHHHHHHHHHhcCCC-------------CCCCccchhhhhhcCC----cccccccchhcccCchhhcccccccCCC---C
Q psy12647 236 AYELRQHVRQHTDER-------------PFPCPVCQRTFRRKGL----ESFTFSLRLFCLYFPQELISHSSYHGGE---E 295 (410)
Q Consensus 236 ~~~l~~H~~~~~~~~-------------~~~C~~C~~~f~~~~~----~~~~~~~~~~~~~~~~~l~~H~~~~~~~---~ 295 (410)
.-.|..|+++|.+.. .-+|..|.+.|..... .....+-..++ ....+.+...++ .
T Consensus 681 ~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~-----~~~~~~~t~t~~~~~t 755 (958)
T KOG1074|consen 681 AVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEP-----DEQMDERTETEELDVT 755 (958)
T ss_pred cccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCC-----cccccccccccccccC
Confidence 888888888876321 1357777777643320 00000000000 000000111110 0
Q ss_pred cccccCcccccccccChhhHHHHHHcC-----------------------------------------------------
Q psy12647 296 RRYVCNHMQCTRRFHSASLLARHVSTH----------------------------------------------------- 322 (410)
Q Consensus 296 ~~~~C~~~~C~~~f~~~~~l~~H~~~~----------------------------------------------------- 322 (410)
.+..+. .|+..+.....+..+-...
T Consensus 756 p~~~e~--~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~ 833 (958)
T KOG1074|consen 756 PPPPEN--SCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQE 833 (958)
T ss_pred CCcccc--ccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccc
Confidence 011122 2222221111111110000
Q ss_pred -------------------------------------------CCCcccccccCccCChhhHHHHHHhhcCCCCCcccCc
Q psy12647 323 -------------------------------------------AIQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQ 359 (410)
Q Consensus 323 -------------------------------------------~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~ 359 (410)
.....|..|++.|.+.+.|..|+ ++|++++||.|.+
T Consensus 834 ~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~-rTHtg~KPF~C~f 912 (958)
T KOG1074|consen 834 TSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHM-RTHTGPKPFFCHF 912 (958)
T ss_pred ccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhh-hcCCCCCCccchh
Confidence 01278999999999999999999 9999999999999
Q ss_pred CcccccCcHhHHHHHHhhcCCCCc
Q psy12647 360 CPGKFYSAYELRQHVRQHTDERPF 383 (410)
Q Consensus 360 C~~~f~~~~~l~~H~~~h~~~~~~ 383 (410)
|++.|+.+.+|..||.+|.+..++
T Consensus 913 C~~aFttrgnLKvHMgtH~w~q~~ 936 (958)
T KOG1074|consen 913 CEEAFTTRGNLKVHMGTHMWVQPP 936 (958)
T ss_pred hhhhhhhhhhhhhhhccccccCCC
Confidence 999999999999999999987654
No 6
>KOG3623|consensus
Probab=99.89 E-value=9.4e-24 Score=196.26 Aligned_cols=98 Identities=29% Similarity=0.750 Sum_probs=88.0
Q ss_pred ceeecCcccccccccChhHHHHHHhhCCC----CcccccccccccChhHHHHHHhhhcCCC-------------CccccC
Q psy12647 165 RRYVCNHMQCTRRFHSASLLARHVSTHAI----QYTCHMCGHMFTCKRRLTQHLARLHTGS-------------LSYPCT 227 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~~~l~~H~~~~~~----~~~C~~C~~~f~~~~~l~~H~~~~H~~~-------------~~~~C~ 227 (410)
....|+ .|++.+.....|+.|++..+. .|.|..|.+.|.....|.+|| .+|... +.|+|+
T Consensus 209 qlltcp--ycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm-~~hkpg~dqa~sltqsa~lRKFKCt 285 (1007)
T KOG3623|consen 209 QLLTCP--YCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHM-QLHKPGGDQAISLTQSALLRKFKCT 285 (1007)
T ss_pred hhhcch--hHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHH-HhhcCCCcccccccchhhhcccccc
Confidence 457899 999999999999999987554 799999999999999999999 777532 459999
Q ss_pred CCCcccCCHHHHHHHHHHhcCCCCCCCccchhhhhhcC
Q psy12647 228 QCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKG 265 (410)
Q Consensus 228 ~C~~~f~~~~~l~~H~~~~~~~~~~~C~~C~~~f~~~~ 265 (410)
.|++.|..+..|+.|+++|.|++||.|+.|++.|....
T Consensus 286 ECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSG 323 (1007)
T KOG3623|consen 286 ECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSG 323 (1007)
T ss_pred ccchhhhhHHHHHhhheeecCCCCcCCcccccccccCC
Confidence 99999999999999999999999999999999997543
No 7
>KOG2462|consensus
Probab=99.88 E-value=6e-23 Score=171.75 Aligned_cols=128 Identities=29% Similarity=0.687 Sum_probs=118.6
Q ss_pred CCcccCCcCcccccCcccccccccccccccccchhhhhccccccCCC--cceeecCcccccccccChhHHHHHHhhCCCC
Q psy12647 117 KETFPCKRCDERFSTRDGLESFTFSLRLFCLYFPQELISHSSYHGGE--ERRYVCNHMQCTRRFHSASLLARHVSTHAIQ 194 (410)
Q Consensus 117 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~l~~h~~~~~~~--~~~~~C~~~~C~~~f~~~~~l~~H~~~~~~~ 194 (410)
...|.|+.||+.+.+..+ |..|..+|-.- .+.+.|+ .|++.|.+..+|..|+++|.-+
T Consensus 128 ~~r~~c~eCgk~ysT~sn------------------LsrHkQ~H~~~~s~ka~~C~--~C~K~YvSmpALkMHirTH~l~ 187 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSN------------------LSRHKQTHRSLDSKKAFSCK--YCGKVYVSMPALKMHIRTHTLP 187 (279)
T ss_pred CCceeccccccccccccc------------------cchhhcccccccccccccCC--CCCceeeehHHHhhHhhccCCC
Confidence 346899999999999999 66666666442 3668899 9999999999999999999999
Q ss_pred cccccccccccChhHHHHHHhhhcCCCCccccCCCCcccCCHHHHHHHHHHhcCCCCCCCccchhhhhhcC
Q psy12647 195 YTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKG 265 (410)
Q Consensus 195 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~C~~C~~~f~~~~ 265 (410)
..|.+||+.|...=-|+-|+ ++|+||+||.|+.|++.|..+++|+.|+++|.+.+.|+|..|++.|...+
T Consensus 188 c~C~iCGKaFSRPWLLQGHi-RTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~S 257 (279)
T KOG2462|consen 188 CECGICGKAFSRPWLLQGHI-RTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKS 257 (279)
T ss_pred cccccccccccchHHhhccc-ccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHH
Confidence 99999999999999999999 99999999999999999999999999999999999999999999999887
No 8
>KOG3623|consensus
Probab=99.79 E-value=4.1e-20 Score=172.30 Aligned_cols=86 Identities=28% Similarity=0.579 Sum_probs=80.0
Q ss_pred HHHcCCCCcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHHHHHHhhcCCCCccCcccccccccccc
Q psy12647 318 HVSTHAIQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKVC 397 (410)
Q Consensus 318 H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~ 397 (410)
|.+....+|.|+.|++.|.-.+.|.+|. ..|+|.+||+|.+|.+.|..+..|.+|+|.|.|+|||+|+-|+|.|+-..+
T Consensus 887 ~~kte~gmyaCDqCDK~FqKqSSLaRHK-YEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGS 965 (1007)
T KOG3623|consen 887 HAKTEDGMYACDQCDKAFQKQSSLARHK-YEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGS 965 (1007)
T ss_pred cccCccccchHHHHHHHHHhhHHHHHhh-hhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccc
Confidence 4444558999999999999999999996 999999999999999999999999999999999999999999999999999
Q ss_pred CccCCCC
Q psy12647 398 TTPSPQP 404 (410)
Q Consensus 398 l~~H~~~ 404 (410)
..+||.-
T Consensus 966 YSQHMNH 972 (1007)
T KOG3623|consen 966 YSQHMNH 972 (1007)
T ss_pred hHhhhcc
Confidence 9999864
No 9
>KOG3576|consensus
Probab=99.58 E-value=3.1e-16 Score=124.68 Aligned_cols=111 Identities=25% Similarity=0.519 Sum_probs=102.8
Q ss_pred CcccccCcccccccccChhhHHHHHHcCC--CCcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHHH
Q psy12647 295 ERRYVCNHMQCTRRFHSASLLARHVSTHA--IQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQ 372 (410)
Q Consensus 295 ~~~~~C~~~~C~~~f~~~~~l~~H~~~~~--~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~ 372 (410)
...|.|. .|++.|.-...|.+|++-|. +.|-|..||+.|.+--+|++|+ ++|+|.+||+|..|+++|++...|..
T Consensus 115 ~d~ftCr--vCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~-rthtgvrpykc~~c~kaftqrcsles 191 (267)
T KOG3576|consen 115 QDSFTCR--VCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHT-RTHTGVRPYKCSLCEKAFTQRCSLES 191 (267)
T ss_pred CCeeeee--hhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhh-ccccCccccchhhhhHHHHhhccHHH
Confidence 3789999 99999999999999999998 8899999999999999999998 99999999999999999999999999
Q ss_pred HHHhhcC-----------CCCccCccccccccccccCccCCCCCCCC
Q psy12647 373 HVRQHTD-----------ERPFPCPVCQRTFRRKVCTTPSPQPQHLD 408 (410)
Q Consensus 373 H~~~h~~-----------~~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 408 (410)
|++.-+| ++.|.|..||++-.....+..|++..|+.
T Consensus 192 hl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 192 HLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 9875443 45699999999999999999999998864
No 10
>KOG3576|consensus
Probab=99.45 E-value=2.2e-14 Score=114.21 Aligned_cols=71 Identities=34% Similarity=0.844 Sum_probs=51.3
Q ss_pred CcccccccccccChhHHHHHHhhhcCCCCccccCCCCcccCCHHHHHHHHHHhcCCCCCCCccchhhhhhcC
Q psy12647 194 QYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKG 265 (410)
Q Consensus 194 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~C~~C~~~f~~~~ 265 (410)
.+.|.+|++.|.-..-|.+|+ .-|...+.+.|..|++.|+....|++|+++|++.+||+|..|++.|+++-
T Consensus 117 ~ftCrvCgK~F~lQRmlnrh~-kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrc 187 (267)
T KOG3576|consen 117 SFTCRVCGKKFGLQRMLNRHL-KCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRC 187 (267)
T ss_pred eeeeehhhhhhhHHHHHHHHh-hhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhc
Confidence 466777777777777777776 66776677777777777777777777777777777777777777777766
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.30 E-value=4.5e-12 Score=120.18 Aligned_cols=147 Identities=20% Similarity=0.390 Sum_probs=119.3
Q ss_pred cccCCCCcccCCHHHHHHHHHHhcCCCCCCCcc--chhhhhhcCCcccccccchhcccCchhhcccccccCCCCcccccC
Q psy12647 224 YPCTQCPGKFYSAYELRQHVRQHTDERPFPCPV--CQRTFRRKGLESFTFSLRLFCLYFPQELISHSSYHGGEERRYVCN 301 (410)
Q Consensus 224 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~C~~--C~~~f~~~~~~~~~~~~~~~~~~~~~~l~~H~~~~~~~~~~~~C~ 301 (410)
..|+.|.+... ...|..|...... ....|+. |+..|... .+.+| +.|+
T Consensus 408 V~C~NC~~~i~-l~~l~lHe~~C~r-~~V~Cp~~~Cg~v~~r~------------------el~~H----------~~C~ 457 (567)
T PLN03086 408 VECRNCKHYIP-SRSIALHEAYCSR-HNVVCPHDGCGIVLRVE------------------EAKNH----------VHCE 457 (567)
T ss_pred EECCCCCCccc-hhHHHHHHhhCCC-cceeCCcccccceeecc------------------ccccC----------ccCC
Confidence 57999988765 4556678765433 3456874 99888655 44444 5899
Q ss_pred cccccccccChhhHHHHHHcCCCCcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccC----------cHhHH
Q psy12647 302 HMQCTRRFHSASLLARHVSTHAIQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYS----------AYELR 371 (410)
Q Consensus 302 ~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~----------~~~l~ 371 (410)
.|++.|. ...|..|+..++.++.|+ |++.+ .+..|..|+ .+|..++|+.|+.|++.|.. .+.|.
T Consensus 458 --~Cgk~f~-~s~LekH~~~~Hkpv~Cp-Cg~~~-~R~~L~~H~-~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt 531 (567)
T PLN03086 458 --KCGQAFQ-QGEMEKHMKVFHEPLQCP-CGVVL-EKEQMVQHQ-ASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMS 531 (567)
T ss_pred --CCCCccc-hHHHHHHHHhcCCCccCC-CCCCc-chhHHHhhh-hccCCCCceeCCCCCCccccCccccchhhhhhhHH
Confidence 9999996 688999999999999999 99765 678999997 88999999999999999952 46899
Q ss_pred HHHHhhcCCCCccCccccccccccccCccCCCCCCCC
Q psy12647 372 QHVRQHTDERPFPCPVCQRTFRRKVCTTPSPQPQHLD 408 (410)
Q Consensus 372 ~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 408 (410)
.|..+. |.+++.|..||+.|..+. +..|+...|.+
T Consensus 532 ~HE~~C-G~rt~~C~~Cgk~Vrlrd-m~~H~~~~h~~ 566 (567)
T PLN03086 532 EHESIC-GSRTAPCDSCGRSVMLKE-MDIHQIAVHQK 566 (567)
T ss_pred HHHHhc-CCcceEccccCCeeeehh-HHHHHHHhhcC
Confidence 999985 889999999999998875 88998888864
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.11 E-value=1.5e-10 Score=109.91 Aligned_cols=136 Identities=21% Similarity=0.441 Sum_probs=110.1
Q ss_pred ceecccccccccCchhHHHHHHHhhhcCCCCcccCCc--CcccccCcccccccccccccccccchhhhhccccccCCCcc
Q psy12647 88 TFSCNSCGQTWLGYTSWKQHVAACREKRRKETFPCKR--CDERFSTRDGLESFTFSLRLFCLYFPQELISHSSYHGGEER 165 (410)
Q Consensus 88 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~~~~~~~~~~~~l~~h~~~~~~~~~ 165 (410)
...|+.|.+.... ..|..|...|. -....|+. |+..|.... +. .
T Consensus 407 ~V~C~NC~~~i~l-~~l~lHe~~C~----r~~V~Cp~~~Cg~v~~r~e-------------------l~----------~ 452 (567)
T PLN03086 407 TVECRNCKHYIPS-RSIALHEAYCS----RHNVVCPHDGCGIVLRVEE-------------------AK----------N 452 (567)
T ss_pred eEECCCCCCccch-hHHHHHHhhCC----CcceeCCcccccceeeccc-------------------cc----------c
Confidence 4679999887764 45778887642 23456884 999985544 33 3
Q ss_pred eeecCcccccccccChhHHHHHHhhCCCCcccccccccccChhHHHHHHhhhcCCCCccccCCCCcccCC----------
Q psy12647 166 RYVCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYS---------- 235 (410)
Q Consensus 166 ~~~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~---------- 235 (410)
.+.|+ .|++.|. ...|..|+.+++.++.|+ ||+.+ ....|..|+ ..|.+.+++.|++|++.+..
T Consensus 453 H~~C~--~Cgk~f~-~s~LekH~~~~Hkpv~Cp-Cg~~~-~R~~L~~H~-~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~ 526 (567)
T PLN03086 453 HVHCE--KCGQAFQ-QGEMEKHMKVFHEPLQCP-CGVVL-EKEQMVQHQ-ASTCPLRLITCRFCGDMVQAGGSAMDVRDR 526 (567)
T ss_pred CccCC--CCCCccc-hHHHHHHHHhcCCCccCC-CCCCc-chhHHHhhh-hccCCCCceeCCCCCCccccCccccchhhh
Confidence 35899 9999996 678999999998999999 99755 679999998 88999999999999999852
Q ss_pred HHHHHHHHHHhcCCCCCCCccchhhhhhc
Q psy12647 236 AYELRQHVRQHTDERPFPCPVCQRTFRRK 264 (410)
Q Consensus 236 ~~~l~~H~~~~~~~~~~~C~~C~~~f~~~ 264 (410)
...|..|.... +.+++.|..|++.+..+
T Consensus 527 ~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr 554 (567)
T PLN03086 527 LRGMSEHESIC-GSRTAPCDSCGRSVMLK 554 (567)
T ss_pred hhhHHHHHHhc-CCcceEccccCCeeeeh
Confidence 35799998885 89999999999999877
No 13
>PHA00733 hypothetical protein
Probab=99.08 E-value=3.2e-11 Score=93.34 Aligned_cols=82 Identities=24% Similarity=0.373 Sum_probs=52.0
Q ss_pred CCcccccccCccCChhhHHHH--H--HhhcCCCCCcccCcCcccccCcHhHHHHHHhhcCCCCccCccccccccccccCc
Q psy12647 324 IQYTCHMCGHMFTCKRRLTQH--L--ARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKVCTT 399 (410)
Q Consensus 324 ~~~~C~~C~~~f~~~~~l~~H--~--~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~ 399 (410)
+++.|.+|+..|.....|..+ + ...+.+++||.|+.|++.|.....|..|++.| +.+|.|.+|+++|.....|.
T Consensus 39 ~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~ 116 (128)
T PHA00733 39 KRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTL 116 (128)
T ss_pred hhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHH
Confidence 455555555555555544444 1 01233466777777777777777777777765 24577777777777777777
Q ss_pred cCCCCCCC
Q psy12647 400 PSPQPQHL 407 (410)
Q Consensus 400 ~H~~~~H~ 407 (410)
.|+...|.
T Consensus 117 ~H~~~~h~ 124 (128)
T PHA00733 117 DHVCKKHN 124 (128)
T ss_pred HHHHHhcC
Confidence 77777665
No 14
>PHA02768 hypothetical protein; Provisional
Probab=99.04 E-value=1.1e-10 Score=73.61 Aligned_cols=45 Identities=20% Similarity=0.386 Sum_probs=39.3
Q ss_pred CcccCcCcccccCcHhHHHHHHhhcCCCCccCccccccccccccCcc
Q psy12647 354 SYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKVCTTP 400 (410)
Q Consensus 354 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~ 400 (410)
.|.|+.||+.|+..++|..|+++|+ ++|+|..|+++|..+++|..
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~~ 49 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYIE 49 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeEE
Confidence 5889999999999999999999988 68899999999998887753
No 15
>PHA00733 hypothetical protein
Probab=98.89 E-value=2.1e-09 Score=83.20 Aligned_cols=80 Identities=23% Similarity=0.398 Sum_probs=68.4
Q ss_pred CcccccCcccccccccChhhHHHH--HHc----CC-CCcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCc
Q psy12647 295 ERRYVCNHMQCTRRFHSASLLARH--VST----HA-IQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSA 367 (410)
Q Consensus 295 ~~~~~C~~~~C~~~f~~~~~l~~H--~~~----~~-~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~ 367 (410)
.+++.|. .|...|.....|..+ +.. +. ++|.|+.|++.|.+.+.|..|+ ..| +.+|.|+.|++.|...
T Consensus 38 ~~~~~~~--~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~-r~h--~~~~~C~~CgK~F~~~ 112 (128)
T PHA00733 38 QKRLIRA--VVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHI-RYT--EHSKVCPVCGKEFRNT 112 (128)
T ss_pred hhhHHHH--HHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHH-hcC--CcCccCCCCCCccCCH
Confidence 3889999 999999888777776 222 22 7999999999999999999998 655 4679999999999999
Q ss_pred HhHHHHHHhhcC
Q psy12647 368 YELRQHVRQHTD 379 (410)
Q Consensus 368 ~~l~~H~~~h~~ 379 (410)
..|.+|++..++
T Consensus 113 ~sL~~H~~~~h~ 124 (128)
T PHA00733 113 DSTLDHVCKKHN 124 (128)
T ss_pred HHHHHHHHHhcC
Confidence 999999988775
No 16
>KOG3993|consensus
Probab=98.78 E-value=7.3e-10 Score=98.50 Aligned_cols=196 Identities=16% Similarity=0.207 Sum_probs=123.9
Q ss_pred CcccccccccccChhHHHHHHhhhcCCCCccccCCCCcccCCHHHHHHHHHHhcCCCCCC--Cccchh-hhhhcCCcccc
Q psy12647 194 QYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERPFP--CPVCQR-TFRRKGLESFT 270 (410)
Q Consensus 194 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~--C~~C~~-~f~~~~~~~~~ 270 (410)
.|.|..|...|.+...|.+|. =.-.....|+|+.|++.|+-..+|..|.++|.+...-. -..=.+ ......
T Consensus 267 dyiCqLCK~kYeD~F~LAQHr-C~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~ra----- 340 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHR-CPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRA----- 340 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhcc-CCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhh-----
Confidence 588999999999999999885 22223346899999999999999999999886543211 000000 000000
Q ss_pred cccchhcccCchhhccccc-ccCCCCcccccCcccccccccChhhHHHHHHcCCCC-------------------ccccc
Q psy12647 271 FSLRLFCLYFPQELISHSS-YHGGEERRYVCNHMQCTRRFHSASLLARHVSTHAIQ-------------------YTCHM 330 (410)
Q Consensus 271 ~~~~~~~~~~~~~l~~H~~-~~~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~~~~-------------------~~C~~ 330 (410)
...+-.+ -...+...|.|. .|++.|.....|+.|+.+|++. +.+..
T Consensus 341 ------------e~~ea~rsg~dss~gi~~C~--~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~ 406 (500)
T KOG3993|consen 341 ------------EVQEAERSGDDSSSGIFSCH--TCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQA 406 (500)
T ss_pred ------------hhhhccccCCcccCceeecH--HhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccc
Confidence 0000000 001223689999 9999999999999998777611 33444
Q ss_pred ccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHHHHHHhhcCCCCccCccccccccccccCccCCCCCCCCC
Q psy12647 331 CGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKVCTTPSPQPQHLDV 409 (410)
Q Consensus 331 C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 409 (410)
|.-.+.....-..+.-.+-..+..-.|+.|+..+..+..-.-+.+.-..+..|.|.+|.-+|.+...|.+|+++-|...
T Consensus 407 ~a~h~~a~~~~g~~vl~~a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse 485 (500)
T KOG3993|consen 407 VATHSSASDSHGDEVLYVAGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSE 485 (500)
T ss_pred cccccccccccccceeeeeccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChHH
Confidence 4433333222222210111112224688888888777766666666566677999999999999999999999999764
No 17
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.78 E-value=3.4e-09 Score=57.18 Aligned_cols=26 Identities=46% Similarity=1.155 Sum_probs=20.9
Q ss_pred hHHHHHHhhcCCCCccCccccccccc
Q psy12647 369 ELRQHVRQHTDERPFPCPVCQRTFRR 394 (410)
Q Consensus 369 ~l~~H~~~h~~~~~~~C~~C~~~f~~ 394 (410)
+|.+|+++|+|++||+|++|+++|++
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 47788888888888888888888863
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.71 E-value=6.8e-09 Score=65.61 Aligned_cols=44 Identities=23% Similarity=0.539 Sum_probs=34.5
Q ss_pred CcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHH
Q psy12647 325 QYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELR 371 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~ 371 (410)
.|+|+.||+.|+..+.|..|+ ++|. +||+|..|++.|...++|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~-r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHL-RKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHH-HhcC--CcccCCcccceecccceeE
Confidence 478888888888888888887 6677 6788888888888777664
No 19
>KOG3993|consensus
Probab=98.70 E-value=3.4e-09 Score=94.30 Aligned_cols=173 Identities=24% Similarity=0.401 Sum_probs=109.8
Q ss_pred ceeecCcccccccccChhHHHHHHh--hCCCCcccccccccccChhHHHHHHhhhcCC----------------------
Q psy12647 165 RRYVCNHMQCTRRFHSASLLARHVS--THAIQYTCHMCGHMFTCKRRLTQHLARLHTG---------------------- 220 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~~~l~~H~~--~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~---------------------- 220 (410)
..|.|. .|...|.+.-.|.+|.= +-+..|+|+.|+++|.....|..|. ++|..
T Consensus 266 GdyiCq--LCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHR-RWHKPR~eaa~a~~~P~k~~~~~rae~ 342 (500)
T KOG3993|consen 266 GDYICQ--LCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHR-RWHKPRPEAAKAGSPPPKQAVETRAEV 342 (500)
T ss_pred HHHHHH--HHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhh-cccCCchhhhhcCCCChhhhhhhhhhh
Confidence 348899 99999999999999963 3344899999999999999999997 77742
Q ss_pred -----------CCccccCCCCcccCCHHHHHHHHHHhcCCCCCCCccchhhhhhcCCcccccccchhcccCchhhcccc-
Q psy12647 221 -----------SLSYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKGLESFTFSLRLFCLYFPQELISHS- 288 (410)
Q Consensus 221 -----------~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~C~~C~~~f~~~~~~~~~~~~~~~~~~~~~~l~~H~- 288 (410)
+..|.|..|+++|....-|+.|+.+|.....-+-. .-.|....+ ......+..+..|.
T Consensus 343 ~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~--a~~f~~s~~--------~~l~~~~~~~a~h~~ 412 (500)
T KOG3993|consen 343 QEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEK--APKFLLSRV--------IPLMHFNQAVATHSS 412 (500)
T ss_pred hhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhc--ccCcchhhc--------ccccccccccccccc
Confidence 12388999999999999999998887643211100 001111100 00000111222222
Q ss_pred ---------cccCCCCcccccCcccccccccChhhHHHHHHcCC--CCcccccccCccCChhhHHHHHHhhcCCC
Q psy12647 289 ---------SYHGGEERRYVCNHMQCTRRFHSASLLARHVSTHA--IQYTCHMCGHMFTCKRRLTQHLARLHTGS 352 (410)
Q Consensus 289 ---------~~~~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~~--~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 352 (410)
..+.+....-.++ .++..+.++..-..+.+... ..|.|.+|.-.|.+...|.+|+...|..+
T Consensus 413 a~~~~g~~vl~~a~sael~~pp--~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse 485 (500)
T KOG3993|consen 413 ASDSHGDEVLYVAGSAELELPP--YDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSE 485 (500)
T ss_pred cccccccceeeeeccccccCCC--CCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChHH
Confidence 2222222333445 66666666655444443333 56888888888888888888876666654
No 20
>PHA00616 hypothetical protein
Probab=98.53 E-value=4e-08 Score=58.93 Aligned_cols=40 Identities=23% Similarity=0.410 Sum_probs=33.6
Q ss_pred CcccCcCcccccCcHhHHHHHHhhcCCCCccCcccccccc
Q psy12647 354 SYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFR 393 (410)
Q Consensus 354 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 393 (410)
||+|+.||+.|.++++|.+|++.|+|++++.|++=-..|.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~y~~f~ 40 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYFYIYFR 40 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEEEEEEE
Confidence 6899999999999999999999999999988876544444
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.40 E-value=1.4e-07 Score=50.82 Aligned_cols=26 Identities=38% Similarity=0.812 Sum_probs=23.4
Q ss_pred hHHHHHHhhcCCCCCcccCcCcccccC
Q psy12647 340 RLTQHLARLHTGSLSYPCTQCPGKFYS 366 (410)
Q Consensus 340 ~l~~H~~~~H~~~~~~~C~~C~~~f~~ 366 (410)
+|..|+ ++|++++||.|+.|+++|.+
T Consensus 1 ~l~~H~-~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHM-RTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHH-HHHSSSSSEEESSSSEEESS
T ss_pred CHHHHh-hhcCCCCCCCCCCCcCeeCc
Confidence 488998 78999999999999999964
No 22
>PHA00732 hypothetical protein
Probab=98.30 E-value=5.2e-07 Score=63.09 Aligned_cols=48 Identities=29% Similarity=0.533 Sum_probs=30.3
Q ss_pred CcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHHHHHHhhc
Q psy12647 325 QYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHT 378 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 378 (410)
||.|..|++.|.+.+.|..|++..|. ++.|+.|++.|. .|..|+++..
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQY 48 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---ChhhhhcccC
Confidence 46677777777777777777633454 246777777776 4666665543
No 23
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.22 E-value=2.2e-06 Score=55.64 Aligned_cols=52 Identities=21% Similarity=0.477 Sum_probs=40.8
Q ss_pred CcccccccCccCChhhHHHHHHhhcCCC-CCcccCcCcccccCcHhHHHHHHhhcC
Q psy12647 325 QYTCHMCGHMFTCKRRLTQHLARLHTGS-LSYPCTQCPGKFYSAYELRQHVRQHTD 379 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~-~~~~C~~C~~~f~~~~~l~~H~~~h~~ 379 (410)
.|.|++|++. .+...|..|+...|..+ +.+.||+|...+. .+|..|+..+++
T Consensus 2 ~f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 4889999994 55678999988888876 4689999998655 488999887653
No 24
>PHA00732 hypothetical protein
Probab=98.16 E-value=1.5e-06 Score=60.78 Aligned_cols=43 Identities=30% Similarity=0.656 Sum_probs=22.3
Q ss_pred CcccCcCcccccCcHhHHHHHHh-hcCCCCccCccccccccccccCccCC
Q psy12647 354 SYPCTQCPGKFYSAYELRQHVRQ-HTDERPFPCPVCQRTFRRKVCTTPSP 402 (410)
Q Consensus 354 ~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~ 402 (410)
||.|+.|++.|....+|..|++. |.+ +.|+.|+++|. .|.+|+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~ 44 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR---RLNQHF 44 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeC---Chhhhh
Confidence 35555555555555555555553 332 35555555555 244444
No 25
>PHA00616 hypothetical protein
Probab=98.04 E-value=1.9e-06 Score=51.81 Aligned_cols=36 Identities=22% Similarity=0.368 Sum_probs=31.4
Q ss_pred CcccccccCccCChhhHHHHHHhhcCCCCCcccCcCc
Q psy12647 325 QYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCP 361 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~ 361 (410)
||+|+.||+.|...+.|..|+ +.|++++++.|+.=-
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~-r~~hg~~~~~~~~~y 36 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHL-LSVHKQNKLTLEYFY 36 (44)
T ss_pred CCccchhhHHHhhHHHHHHHH-HHhcCCCccceeEEE
Confidence 689999999999999999998 888888999887533
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.90 E-value=2e-05 Score=51.15 Aligned_cols=51 Identities=22% Similarity=0.477 Sum_probs=40.7
Q ss_pred CcccccccccccChhHHHHHHhhhcCCC-CccccCCCCcccCCHHHHHHHHHHhc
Q psy12647 194 QYTCHMCGHMFTCKRRLTQHLARLHTGS-LSYPCTQCPGKFYSAYELRQHVRQHT 247 (410)
Q Consensus 194 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~-~~~~C~~C~~~f~~~~~l~~H~~~~~ 247 (410)
.|.|++|++ .-+...|..|+...|..+ +.+.||+|...+. ..|..|+...+
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 478999999 566788999998888875 4689999988644 48888888765
No 27
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.77 E-value=1.4e-05 Score=41.71 Aligned_cols=23 Identities=39% Similarity=0.865 Sum_probs=14.3
Q ss_pred cccCcCcccccCcHhHHHHHHhh
Q psy12647 355 YPCTQCPGKFYSAYELRQHVRQH 377 (410)
Q Consensus 355 ~~C~~C~~~f~~~~~l~~H~~~h 377 (410)
|.|+.|++.|.+..+|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 45666666666666666666553
No 28
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.47 E-value=1.1e-05 Score=60.27 Aligned_cols=72 Identities=24% Similarity=0.344 Sum_probs=19.4
Q ss_pred cCcccccccccChhhHHHHHHcCCCCcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHHHHHHhh
Q psy12647 300 CNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQH 377 (410)
Q Consensus 300 C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h 377 (410)
|. .|+..|.+...|..|+...+... -+ ....+.....+..+++. .....+.|..|++.|.+...|..||+.+
T Consensus 2 C~--~C~~~f~~~~~l~~H~~~~H~~~-~~-~~~~l~~~~~~~~~~~~--~~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 2 CL--FCDESFSSVDDLLQHMKKKHGFD-IP-DQKYLVDPNRLLNYLRK--KVKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp -----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred cc--ccccccccccccccccccccccc-cc-ccccccccccccccccc--ccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 55 77777777777777776655211 01 12222233344444311 1223678888888888888888888854
No 29
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.42 E-value=9.8e-05 Score=38.82 Aligned_cols=23 Identities=48% Similarity=0.896 Sum_probs=12.2
Q ss_pred cccCcCcccccCcHhHHHHHHhh
Q psy12647 355 YPCTQCPGKFYSAYELRQHVRQH 377 (410)
Q Consensus 355 ~~C~~C~~~f~~~~~l~~H~~~h 377 (410)
|.|++|++.|.+...|..|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 45566666666666666665554
No 30
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.39 E-value=8e-05 Score=40.55 Aligned_cols=25 Identities=36% Similarity=0.752 Sum_probs=15.7
Q ss_pred CcccCcCcccccCcHhHHHHHHhhc
Q psy12647 354 SYPCTQCPGKFYSAYELRQHVRQHT 378 (410)
Q Consensus 354 ~~~C~~C~~~f~~~~~l~~H~~~h~ 378 (410)
||.|+.|++.|.+..+|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4566666666666666666666554
No 31
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.27 E-value=0.00017 Score=62.56 Aligned_cols=28 Identities=18% Similarity=0.596 Sum_probs=23.4
Q ss_pred CCCCccCccccccccccccCccCCCCCC
Q psy12647 379 DERPFPCPVCQRTFRRKVCTTPSPQPQH 406 (410)
Q Consensus 379 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H 406 (410)
.+|||.|.+|+|.+++.--|+-|++-.|
T Consensus 395 ~~KPYrCevC~KRYKNlNGLKYHr~Hsh 422 (423)
T COG5189 395 KDKPYRCEVCDKRYKNLNGLKYHRKHSH 422 (423)
T ss_pred cCCceeccccchhhccCccceecccccC
Confidence 4588999999999999988988887655
No 32
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.27 E-value=0.00011 Score=38.24 Aligned_cols=20 Identities=40% Similarity=1.072 Sum_probs=14.4
Q ss_pred cccccccCccCChhhHHHHH
Q psy12647 326 YTCHMCGHMFTCKRRLTQHL 345 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~ 345 (410)
|.|++|++.|.+...|..|+
T Consensus 1 y~C~~C~~~f~~~~~l~~H~ 20 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHM 20 (23)
T ss_dssp EEETTTTEEESSHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHH
Confidence 56777777777777777776
No 33
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.06 E-value=0.00038 Score=36.50 Aligned_cols=22 Identities=36% Similarity=0.910 Sum_probs=15.1
Q ss_pred cccccccCccCChhhHHHHHHh
Q psy12647 326 YTCHMCGHMFTCKRRLTQHLAR 347 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~~~ 347 (410)
|.|++|++.|.+...|+.|+..
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~ 22 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRT 22 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHh
Confidence 5678888888888888888743
No 34
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.03 E-value=0.0002 Score=62.06 Aligned_cols=27 Identities=19% Similarity=0.487 Sum_probs=21.4
Q ss_pred cccccCcccccccccChhhHHHHHHcC
Q psy12647 296 RRYVCNHMQCTRRFHSASLLARHVSTH 322 (410)
Q Consensus 296 ~~~~C~~~~C~~~f~~~~~l~~H~~~~ 322 (410)
+||+|+.+.|++.+++...|+.|+.-.
T Consensus 348 KpykCpV~gC~K~YknqnGLKYH~lhG 374 (423)
T COG5189 348 KPYKCPVEGCNKKYKNQNGLKYHMLHG 374 (423)
T ss_pred ceecCCCCCchhhhccccchhhhhhcc
Confidence 888888888888888888888886543
No 35
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.98 E-value=0.00051 Score=42.26 Aligned_cols=41 Identities=17% Similarity=0.314 Sum_probs=25.4
Q ss_pred HHHHHhhcCCCCCcccCcCcccccCcHhHHHHHHhhcCCCC
Q psy12647 342 TQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERP 382 (410)
Q Consensus 342 ~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 382 (410)
..+.+..+..+.|-.|++|+..+....+|++|+.++++.+|
T Consensus 12 ~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp ----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred hhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 34444455667888999999999999999999988887765
No 36
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.79 E-value=0.0012 Score=35.15 Aligned_cols=24 Identities=38% Similarity=0.767 Sum_probs=15.2
Q ss_pred cccCcCcccccCcHhHHHHHHhhc
Q psy12647 355 YPCTQCPGKFYSAYELRQHVRQHT 378 (410)
Q Consensus 355 ~~C~~C~~~f~~~~~l~~H~~~h~ 378 (410)
|.|+.|++.|...+.|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 456666666666666666666543
No 37
>PRK04860 hypothetical protein; Provisional
Probab=96.73 E-value=0.001 Score=53.65 Aligned_cols=39 Identities=28% Similarity=0.564 Sum_probs=25.6
Q ss_pred CcccCcCcccccCcHhHHHHHHhhcCCCCccCccccccccccc
Q psy12647 354 SYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKV 396 (410)
Q Consensus 354 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~ 396 (410)
+|.|. |+. ....+.+|.++|+++++|.|..|+..|....
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 46676 665 5566667777777777777777777666543
No 38
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.72 E-value=0.00098 Score=49.60 Aligned_cols=73 Identities=22% Similarity=0.473 Sum_probs=19.5
Q ss_pred ccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHHHHHHhhcCCCCccCccccccccccccCccCCCCC
Q psy12647 327 TCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKVCTTPSPQPQ 405 (410)
Q Consensus 327 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 405 (410)
+|..|+..|.+...|..|+...|.-.-+ ....+.....+..+++.-.. ..+.|.+|++.|.+...|..||+..
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~-----~~~~l~~~~~~~~~~~~~~~-~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP-----DQKYLVDPNRLLNYLRKKVK-ESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Cccccccccccccccccccccccccccc-----cccccccccccccccccccC-CCCCCCccCCCCcCHHHHHHHHcCc
Confidence 5888888888888888888777764432 22223355555555544322 3588888888888888888888854
No 39
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.62 E-value=0.0013 Score=34.53 Aligned_cols=23 Identities=26% Similarity=0.701 Sum_probs=14.5
Q ss_pred cccCcCcccccCcHhHHHHHHhhc
Q psy12647 355 YPCTQCPGKFYSAYELRQHVRQHT 378 (410)
Q Consensus 355 ~~C~~C~~~f~~~~~l~~H~~~h~ 378 (410)
|+|+.|++... ..+|.+|++.|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 56777777766 777777777654
No 40
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.60 E-value=0.0013 Score=35.62 Aligned_cols=24 Identities=33% Similarity=0.785 Sum_probs=16.3
Q ss_pred CcccccccCccCChhhHHHHHHhhc
Q psy12647 325 QYTCHMCGHMFTCKRRLTQHLARLH 349 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~l~~H~~~~H 349 (410)
+|+|..|++.|.+...|..|+ ..|
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~-~~h 24 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHK-RSH 24 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHH-CTT
T ss_pred CCCCCccCCccCChhHHHHHh-HHh
Confidence 466777777777777777776 444
No 41
>KOG2231|consensus
Probab=96.38 E-value=0.0026 Score=62.23 Aligned_cols=20 Identities=30% Similarity=0.580 Sum_probs=11.2
Q ss_pred cccccccCccCChhhHHHHH
Q psy12647 326 YTCHMCGHMFTCKRRLTQHL 345 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~ 345 (410)
-.|..|...|-....|..|+
T Consensus 183 p~C~~C~~~fld~~el~rH~ 202 (669)
T KOG2231|consen 183 PLCKFCHERFLDDDELYRHL 202 (669)
T ss_pred ccchhhhhhhccHHHHHHhh
Confidence 44555555555555555555
No 42
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.20 E-value=0.0024 Score=33.82 Aligned_cols=22 Identities=41% Similarity=0.779 Sum_probs=13.7
Q ss_pred cccCcCcccccCcHhHHHHHHh
Q psy12647 355 YPCTQCPGKFYSAYELRQHVRQ 376 (410)
Q Consensus 355 ~~C~~C~~~f~~~~~l~~H~~~ 376 (410)
|.|++|++.|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4566666666666666666654
No 43
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.87 E-value=0.0068 Score=37.41 Aligned_cols=40 Identities=15% Similarity=0.301 Sum_probs=21.3
Q ss_pred HHHhhhcCCCCccccCCCCcccCCHHHHHHHHHHhcCCCC
Q psy12647 212 QHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERP 251 (410)
Q Consensus 212 ~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~ 251 (410)
.+.+.....+.|-.|++|+..+....+|+.|+...++.+|
T Consensus 13 ~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 13 KKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp ---CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred hHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 3333344456677777887777777778777777666554
No 44
>KOG2482|consensus
Probab=95.65 E-value=0.059 Score=47.75 Aligned_cols=23 Identities=17% Similarity=0.059 Sum_probs=14.9
Q ss_pred HHHHHhhcCCCcHHHHHHHHhhh
Q psy12647 13 MLMIMTRYSLPCVDFIQVHLRLH 35 (410)
Q Consensus 13 c~~~~c~~~~~~~~~l~~H~~~h 35 (410)
|..|.=-+-|+....+..|+..-
T Consensus 31 C~~C~e~f~~peq~~~l~Hl~~e 53 (423)
T KOG2482|consen 31 CLKCDEVFIFPEQKDCLAHLMSE 53 (423)
T ss_pred eeehhhcccCcchHHHHHHHHhh
Confidence 55555455666767788898543
No 45
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.56 E-value=0.011 Score=31.29 Aligned_cols=24 Identities=42% Similarity=0.920 Sum_probs=16.9
Q ss_pred cccccccCccCChhhHHHHHHhhcC
Q psy12647 326 YTCHMCGHMFTCKRRLTQHLARLHT 350 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~~~~H~ 350 (410)
|+|..|++.|.+...|..|+ ..|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~-~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHM-RTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHH-HHhc
Confidence 46777777777777777776 4553
No 46
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.35 E-value=0.0084 Score=31.67 Aligned_cols=22 Identities=27% Similarity=0.709 Sum_probs=19.5
Q ss_pred eecccccccccCchhHHHHHHH
Q psy12647 89 FSCNSCGQTWLGYTSWKQHVAA 110 (410)
Q Consensus 89 ~~C~~C~~~f~~~~~l~~H~~~ 110 (410)
|.|.+|+..|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 6799999999999999999865
No 47
>PRK04860 hypothetical protein; Provisional
Probab=95.32 E-value=0.008 Score=48.47 Aligned_cols=39 Identities=23% Similarity=0.563 Sum_probs=33.0
Q ss_pred CcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcH
Q psy12647 325 QYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAY 368 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~ 368 (410)
+|.|. |+. ....+..|. .+|.++++|.|..|+..|....
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~-ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHN-RVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEcC-CCC---eeCHHHHHH-HHhcCCccEECCCCCceeEEec
Confidence 68898 887 667788997 9999999999999999887654
No 48
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.30 E-value=0.011 Score=30.94 Aligned_cols=23 Identities=39% Similarity=0.752 Sum_probs=14.6
Q ss_pred cccccccCccCChhhHHHHHHhhc
Q psy12647 326 YTCHMCGHMFTCKRRLTQHLARLH 349 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~~~~H 349 (410)
|+|+.|+.... ...|..|++..|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 56777777776 777777775444
No 49
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.23 E-value=0.0048 Score=33.40 Aligned_cols=21 Identities=33% Similarity=0.765 Sum_probs=10.1
Q ss_pred cccCcCcccccCcHhHHHHHH
Q psy12647 355 YPCTQCPGKFYSAYELRQHVR 375 (410)
Q Consensus 355 ~~C~~C~~~f~~~~~l~~H~~ 375 (410)
|.|..|++.|.+..+|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 344455555555444444443
No 50
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.21 E-value=0.0058 Score=33.06 Aligned_cols=22 Identities=18% Similarity=0.442 Sum_probs=20.7
Q ss_pred ccCccccccccccccCccCCCC
Q psy12647 383 FPCPVCQRTFRRKVCTTPSPQP 404 (410)
Q Consensus 383 ~~C~~C~~~f~~~~~l~~H~~~ 404 (410)
|.|..|++.|++...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 7899999999999999999975
No 51
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=94.85 E-value=0.01 Score=52.42 Aligned_cols=105 Identities=24% Similarity=0.406 Sum_probs=60.7
Q ss_pred ccccCcccccccccChhhHHHHHHcCCCCccccccc---CccC------ChhhHHHHHHhhcCCCCCc----ccCcCccc
Q psy12647 297 RYVCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCG---HMFT------CKRRLTQHLARLHTGSLSY----PCTQCPGK 363 (410)
Q Consensus 297 ~~~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~---~~f~------~~~~l~~H~~~~H~~~~~~----~C~~C~~~ 363 (410)
.|.|+...|..+....-.|..|.+..+..+-|.+|- +.|. +...|..|. ..-..+..| .|-.|...
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~-~~G~~e~GFKGHP~C~FC~~~ 229 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGFVLCSECIGNKKDFWNEIRLFRSSTLRDHK-NGGLEEEGFKGHPLCIFCKIY 229 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCcEEhHhhhcCcccCccceeeeecccccccc-cCCccccCcCCCchhhhccce
Confidence 466666667777777777888887777666666664 2333 233455553 221112122 47777777
Q ss_pred ccCcHhHHHHHHhhcCCCCccCccccc-------cccccccCccCCCCCC
Q psy12647 364 FYSAYELRQHVRQHTDERPFPCPVCQR-------TFRRKVCTTPSPQPQH 406 (410)
Q Consensus 364 f~~~~~l~~H~~~h~~~~~~~C~~C~~-------~f~~~~~l~~H~~~~H 406 (410)
|.+-..|..|+|.-+. .|-+|++ .|++-.+|..|.++.|
T Consensus 230 FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h 275 (493)
T COG5236 230 FYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH 275 (493)
T ss_pred ecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcCc
Confidence 7777777777765332 4444443 3555556666665555
No 52
>KOG1146|consensus
Probab=94.79 E-value=0.01 Score=61.75 Aligned_cols=97 Identities=20% Similarity=0.312 Sum_probs=67.7
Q ss_pred cccccccccChhHHHHHHhhhcCCCCccccCCCCcccCCHHHHHHHHHHhcCCCCCCCccchhhhhhcCCcccccccchh
Q psy12647 197 CHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRTFRRKGLESFTFSLRLF 276 (410)
Q Consensus 197 C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~C~~C~~~f~~~~~~~~~~~~~~~ 276 (410)
|.-|+..+.....+.-|+...++-.+.++|+.|++.|.....|..|++.-+.+... ..|.
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~--~~c~------------------ 498 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS--AYCK------------------ 498 (1406)
T ss_pred ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccch--hHhH------------------
Confidence 44455666666777777777777778899999999999999999999985554322 2221
Q ss_pred cccCchhhcccc-----cccCCCCcccccCcccccccccChhhHHHHHHc
Q psy12647 277 CLYFPQELISHS-----SYHGGEERRYVCNHMQCTRRFHSASLLARHVST 321 (410)
Q Consensus 277 ~~~~~~~l~~H~-----~~~~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~ 321 (410)
....|. .+.....++|.|. .|...+..+.+|.+|+..
T Consensus 499 ------~gq~~~~~arg~~~~~~~~p~~C~--~C~~stttng~LsihlqS 540 (1406)
T KOG1146|consen 499 ------AGQNHPRLARGEVYRCPGKPYPCR--ACNYSTTTNGNLSIHLQS 540 (1406)
T ss_pred ------hccccccccccccccCCCCcccce--eeeeeeecchHHHHHHHH
Confidence 112222 1222234889999 999999999999999765
No 53
>KOG2231|consensus
Probab=94.13 E-value=0.085 Score=52.02 Aligned_cols=47 Identities=28% Similarity=0.607 Sum_probs=29.7
Q ss_pred ccCcccccccccChhhHHHHHHcCCCCcccccc------cCccCChhhHHHHHHhhc
Q psy12647 299 VCNHMQCTRRFHSASLLARHVSTHAIQYTCHMC------GHMFTCKRRLTQHLARLH 349 (410)
Q Consensus 299 ~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C------~~~f~~~~~l~~H~~~~H 349 (410)
.|. .|...|.....|.+|++..+ |.|..| +..|.....|..|.+.-|
T Consensus 184 ~C~--~C~~~fld~~el~rH~~~~h--~~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 184 LCK--FCHERFLDDDELYRHLRFDH--EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred cch--hhhhhhccHHHHHHhhccce--eheeecCcccccchhcccchHHHHHhhhcC
Confidence 455 67777777777777776664 444444 355666677777764444
No 54
>KOG2785|consensus
Probab=93.99 E-value=0.067 Score=48.38 Aligned_cols=52 Identities=21% Similarity=0.417 Sum_probs=39.5
Q ss_pred CcccccccccccChhHHHHHHhhhcCC----------------------CCccccCCCC---cccCCHHHHHHHHHH
Q psy12647 194 QYTCHMCGHMFTCKRRLTQHLARLHTG----------------------SLSYPCTQCP---GKFYSAYELRQHVRQ 245 (410)
Q Consensus 194 ~~~C~~C~~~f~~~~~l~~H~~~~H~~----------------------~~~~~C~~C~---~~f~~~~~l~~H~~~ 245 (410)
|-.|-.|++.+.+...-..||...|+- ..-+.|-+|+ +.|.+..+.+.||..
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 567777888888888888888555542 1236788888 999999999999876
No 55
>KOG1146|consensus
Probab=92.84 E-value=0.015 Score=60.72 Aligned_cols=77 Identities=22% Similarity=0.381 Sum_probs=50.4
Q ss_pred cccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHHHHHHh-h------------------------cCCCC
Q psy12647 328 CHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVRQ-H------------------------TDERP 382 (410)
Q Consensus 328 C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~-h------------------------~~~~~ 382 (410)
|.-|+..+.+...+..|+...|...+.|+|+.|+-.|.+..+|..|||. | -+.++
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p 518 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKP 518 (1406)
T ss_pred ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCc
Confidence 3334444555555555555555555666666666666666666666665 1 13467
Q ss_pred ccCccccccccccccCccCCCC
Q psy12647 383 FPCPVCQRTFRRKVCTTPSPQP 404 (410)
Q Consensus 383 ~~C~~C~~~f~~~~~l~~H~~~ 404 (410)
|.|..|.++++.+.+|..|++.
T Consensus 519 ~~C~~C~~stttng~LsihlqS 540 (1406)
T KOG1146|consen 519 YPCRACNYSTTTNGNLSIHLQS 540 (1406)
T ss_pred ccceeeeeeeecchHHHHHHHH
Confidence 9999999999999999988764
No 56
>KOG2785|consensus
Probab=92.74 E-value=0.49 Score=43.01 Aligned_cols=25 Identities=24% Similarity=0.557 Sum_probs=20.3
Q ss_pred ceeecCcccccccccChhHHHHHHhhC
Q psy12647 165 RRYVCNHMQCTRRFHSASLLARHVSTH 191 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~~~l~~H~~~~ 191 (410)
..|.|. .|...|.+...-..|+++.
T Consensus 2 t~ftC~--tC~v~F~~ad~Qr~HyKSd 26 (390)
T KOG2785|consen 2 TGFTCN--TCNVEFDDADEQRAHYKSD 26 (390)
T ss_pred Ccceee--ceeeeeccHHHHHHHhhhh
Confidence 357888 9999999998888888753
No 57
>KOG2482|consensus
Probab=92.68 E-value=0.17 Score=44.98 Aligned_cols=25 Identities=20% Similarity=0.506 Sum_probs=17.7
Q ss_pred cccccccCccCChhhHHHHHHhhcC
Q psy12647 326 YTCHMCGHMFTCKRRLTQHLARLHT 350 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~~~~H~ 350 (410)
..|-.|.....+...|..||..+|.
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe 304 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHE 304 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHH
Confidence 4677777777777777777766663
No 58
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.24 E-value=0.12 Score=27.19 Aligned_cols=19 Identities=37% Similarity=0.840 Sum_probs=10.2
Q ss_pred ccCcCcccccCcHhHHHHHH
Q psy12647 356 PCTQCPGKFYSAYELRQHVR 375 (410)
Q Consensus 356 ~C~~C~~~f~~~~~l~~H~~ 375 (410)
.|+.|++.| ....|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 455555555 4455555543
No 59
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=92.18 E-value=0.084 Score=46.86 Aligned_cols=79 Identities=27% Similarity=0.417 Sum_probs=44.2
Q ss_pred eeecCcccccccccChhHHHHHHhhCCCCccccccc---ccccC------hhHHHHHHhhhcCCCCcc----ccCCCCcc
Q psy12647 166 RYVCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCG---HMFTC------KRRLTQHLARLHTGSLSY----PCTQCPGK 232 (410)
Q Consensus 166 ~~~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~---~~f~~------~~~l~~H~~~~H~~~~~~----~C~~C~~~ 232 (410)
.|.|+...|+........|..|.++.+..+-|.+|- +.|.. ...|..|. ..-..+.-| .|.+|...
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~-~~G~~e~GFKGHP~C~FC~~~ 229 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGFVLCSECIGNKKDFWNEIRLFRSSTLRDHK-NGGLEEEGFKGHPLCIFCKIY 229 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCcEEhHhhhcCcccCccceeeeecccccccc-cCCccccCcCCCchhhhccce
Confidence 467776667777666777888888877777777774 23322 23444443 221111111 35555555
Q ss_pred cCCHHHHHHHHHH
Q psy12647 233 FYSAYELRQHVRQ 245 (410)
Q Consensus 233 f~~~~~l~~H~~~ 245 (410)
|.....|..|++.
T Consensus 230 FYdDDEL~~HcR~ 242 (493)
T COG5236 230 FYDDDELRRHCRL 242 (493)
T ss_pred ecChHHHHHHHHh
Confidence 5555555555554
No 60
>KOG4124|consensus
Probab=91.74 E-value=0.078 Score=47.12 Aligned_cols=57 Identities=23% Similarity=0.554 Sum_probs=45.1
Q ss_pred CCcccCc--CcccccCcHhHHHHHHh-h--------------cC----CCCccCccccccccccccCccCCCCCCCCC
Q psy12647 353 LSYPCTQ--CPGKFYSAYELRQHVRQ-H--------------TD----ERPFPCPVCQRTFRRKVCTTPSPQPQHLDV 409 (410)
Q Consensus 353 ~~~~C~~--C~~~f~~~~~l~~H~~~-h--------------~~----~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 409 (410)
++|.|++ |++...+.+.|..|... | .+ .|+|.|++|.++++..-.|..|..-.|.++
T Consensus 348 ~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~~~~~~~~ 425 (442)
T KOG4124|consen 348 KPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHRTHSHLQV 425 (442)
T ss_pred CCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCceeehhhhhh
Confidence 6688966 99999998888888643 2 11 368999999999998888888887777654
No 61
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=90.69 E-value=0.17 Score=29.01 Aligned_cols=22 Identities=18% Similarity=0.567 Sum_probs=14.0
Q ss_pred CcccCcCcccccCcHhHHHHHH
Q psy12647 354 SYPCTQCPGKFYSAYELRQHVR 375 (410)
Q Consensus 354 ~~~C~~C~~~f~~~~~l~~H~~ 375 (410)
+|.|.+|+..|.+..++..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4566666666666666666654
No 62
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=90.54 E-value=0.15 Score=28.97 Aligned_cols=10 Identities=20% Similarity=0.733 Sum_probs=5.3
Q ss_pred CCcccCcCcc
Q psy12647 353 LSYPCTQCPG 362 (410)
Q Consensus 353 ~~~~C~~C~~ 362 (410)
.++.||+|+.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 4455555553
No 63
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=90.53 E-value=0.21 Score=28.67 Aligned_cols=21 Identities=29% Similarity=0.679 Sum_probs=18.4
Q ss_pred CcccccccCccCChhhHHHHH
Q psy12647 325 QYTCHMCGHMFTCKRRLTQHL 345 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~l~~H~ 345 (410)
+|.|.+|+..|.+...+..|+
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~ 23 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHL 23 (35)
T ss_pred CeEccccCCccCCHHHHHHHH
Confidence 578999999999888888887
No 64
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=90.23 E-value=0.07 Score=51.39 Aligned_cols=61 Identities=26% Similarity=0.463 Sum_probs=37.6
Q ss_pred CcccccccccccChhHHHHHHhh-hcCCC--CccccC--CCCcccCCHHHHHHHHHHhcCCCCCCC
Q psy12647 194 QYTCHMCGHMFTCKRRLTQHLAR-LHTGS--LSYPCT--QCPGKFYSAYELRQHVRQHTDERPFPC 254 (410)
Q Consensus 194 ~~~C~~C~~~f~~~~~l~~H~~~-~H~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~~~~~~~~C 254 (410)
++.|..|...|.....|..|.+. .|.++ .++.|+ .|++.|.....+..|...+.+..++.+
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKE 354 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCcccc
Confidence 45666666666666666666632 56666 666666 566666666666666666665555444
No 65
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=90.20 E-value=0.43 Score=36.01 Aligned_cols=25 Identities=28% Similarity=0.506 Sum_probs=18.1
Q ss_pred ccc----CcCcccccCcHhHHHHHHhhcC
Q psy12647 355 YPC----TQCPGKFYSAYELRQHVRQHTD 379 (410)
Q Consensus 355 ~~C----~~C~~~f~~~~~l~~H~~~h~~ 379 (410)
|.| +.|++.+.+...|.+|++.++|
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 677 7777777777777777776654
No 66
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=88.96 E-value=0.13 Score=49.60 Aligned_cols=151 Identities=23% Similarity=0.398 Sum_probs=113.2
Q ss_pred CccccCCCCcccCCHHHHHHHHH--HhcCC--CCCCCc--cchhhhhhcCCcccccccchhcccCchhhcccccccCCCC
Q psy12647 222 LSYPCTQCPGKFYSAYELRQHVR--QHTDE--RPFPCP--VCQRTFRRKGLESFTFSLRLFCLYFPQELISHSSYHGGEE 295 (410)
Q Consensus 222 ~~~~C~~C~~~f~~~~~l~~H~~--~~~~~--~~~~C~--~C~~~f~~~~~~~~~~~~~~~~~~~~~~l~~H~~~~~~~~ 295 (410)
.++.|..|...|.....|..|.. .|..+ +++.|+ .|++.|.... .+..|...+.+.
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~-----------------~~~~~~~~~~~~- 349 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRND-----------------ALKRHILLHTSI- 349 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccc-----------------cccCCcccccCC-
Confidence 46899999999999999999999 79999 899999 7999999988 899999988885
Q ss_pred cccccCcccccccccChhhHHHHH------HcCC-CCccccc--ccCccCChhhHHHHHHhhcCCCC--CcccCcCcccc
Q psy12647 296 RRYVCNHMQCTRRFHSASLLARHV------STHA-IQYTCHM--CGHMFTCKRRLTQHLARLHTGSL--SYPCTQCPGKF 364 (410)
Q Consensus 296 ~~~~C~~~~C~~~f~~~~~l~~H~------~~~~-~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~--~~~C~~C~~~f 364 (410)
.++.+....+...+.....-..+. .... ..+.+.. |...+.....+..|. ..|...+ .+.+..|.+.|
T Consensus 350 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 428 (467)
T COG5048 350 SPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHI-ITHLSFRPYNCKNPPCSKSF 428 (467)
T ss_pred CccccccccCccccccccCCCCccchhhccCccCCccccccccchhhhhcccccccccc-ccccccCCcCCCCCcchhhc
Confidence 666666556665555444321111 1111 4444444 667778888888886 7777766 46778899999
Q ss_pred cCcHhHHHHHHhhcCCCCccCcccccc
Q psy12647 365 YSAYELRQHVRQHTDERPFPCPVCQRT 391 (410)
Q Consensus 365 ~~~~~l~~H~~~h~~~~~~~C~~C~~~ 391 (410)
.....+..|++.|....++.|..++..
T Consensus 429 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (467)
T COG5048 429 NRHYNLIPHKKIHTNHAPLLCSILKSF 455 (467)
T ss_pred cCcccccccccccccCCceeecccccc
Confidence 999999999999988777666544443
No 67
>KOG2893|consensus
Probab=88.82 E-value=0.13 Score=43.11 Aligned_cols=48 Identities=25% Similarity=0.621 Sum_probs=35.6
Q ss_pred ecCcccccccccChhHHHHHHhhCCCCcccccccccccChhHHHHHHhhhcC
Q psy12647 168 VCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGHMFTCKRRLTQHLARLHT 219 (410)
Q Consensus 168 ~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 219 (410)
-|+ .|++.|.+...|+.|++.-+ |+|.+|.+..-+--.|..|.-.+|.
T Consensus 12 wcw--ycnrefddekiliqhqkakh--fkchichkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 12 WCW--YCNREFDDEKILIQHQKAKH--FKCHICHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred eee--ecccccchhhhhhhhhhhcc--ceeeeehhhhccCCCceeehhhhhh
Confidence 366 78888888888888887655 8888888877776667666545553
No 68
>KOG2893|consensus
Probab=87.74 E-value=0.19 Score=42.16 Aligned_cols=45 Identities=29% Similarity=0.602 Sum_probs=34.0
Q ss_pred cccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHHHH-HHhh
Q psy12647 328 CHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQH-VRQH 377 (410)
Q Consensus 328 C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H-~~~h 377 (410)
|.+|++.|.+..-|.+|++. +-|+|.+|-+..-+.-.|..| |++|
T Consensus 13 cwycnrefddekiliqhqka-----khfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA-----KHFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeecccccchhhhhhhhhhh-----ccceeeeehhhhccCCCceeehhhhh
Confidence 88888888888888888744 447888888877777777777 4444
No 69
>KOG4173|consensus
Probab=86.91 E-value=0.31 Score=40.10 Aligned_cols=80 Identities=23% Similarity=0.516 Sum_probs=66.5
Q ss_pred ceeecCcccccccccChhHHHHHHhhCCCCcccccccccccChhHHHHHHhhhcC---------CCCccccC--CCCccc
Q psy12647 165 RRYVCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGHMFTCKRRLTQHLARLHT---------GSLSYPCT--QCPGKF 233 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~---------~~~~~~C~--~C~~~f 233 (410)
..+.|+...|...|.+......|..+-+ ...|..|.+.|++..-|..|+...|. |...|+|- -|+..|
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h-~~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KF 156 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLH-GNSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKF 156 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcc-cchhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhh
Confidence 5688998889999999999999876655 37899999999999999999977774 45568884 599999
Q ss_pred CCHHHHHHHHHH
Q psy12647 234 YSAYELRQHVRQ 245 (410)
Q Consensus 234 ~~~~~l~~H~~~ 245 (410)
.+...-..|+-.
T Consensus 157 kT~r~RkdH~I~ 168 (253)
T KOG4173|consen 157 KTSRDRKDHMIR 168 (253)
T ss_pred hhhhhhhhHHHH
Confidence 998888888654
No 70
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=84.67 E-value=0.11 Score=32.70 Aligned_cols=30 Identities=20% Similarity=0.361 Sum_probs=21.2
Q ss_pred CCCCCcccCcCcccccCcHhHHHHHHhhcC
Q psy12647 350 TGSLSYPCTQCPGKFYSAYELRQHVRQHTD 379 (410)
Q Consensus 350 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 379 (410)
.||--+.||.|+..|.+.....+|.+.-++
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 355567788888888777777777765554
No 71
>KOG4173|consensus
Probab=84.61 E-value=0.41 Score=39.40 Aligned_cols=80 Identities=23% Similarity=0.513 Sum_probs=63.7
Q ss_pred cccccCcccccccccChhhHHHHHHcCCCCcccccccCccCChhhHHHHHHhhc---------CCCCCcccCc--Ccccc
Q psy12647 296 RRYVCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGHMFTCKRRLTQHLARLH---------TGSLSYPCTQ--CPGKF 364 (410)
Q Consensus 296 ~~~~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H---------~~~~~~~C~~--C~~~f 364 (410)
..+.|....|...|.+......|....+ .-.|..|.+.|.+...|..|+..-| .|...|.|-+ |+..|
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h-~~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KF 156 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLH-GNSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKF 156 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcc-cchhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhh
Confidence 5678888889999999988888854433 2479999999999999999984444 4567899954 99999
Q ss_pred cCcHhHHHHHHh
Q psy12647 365 YSAYELRQHVRQ 376 (410)
Q Consensus 365 ~~~~~l~~H~~~ 376 (410)
.+...-..|+..
T Consensus 157 kT~r~RkdH~I~ 168 (253)
T KOG4173|consen 157 KTSRDRKDHMIR 168 (253)
T ss_pred hhhhhhhhHHHH
Confidence 999999999753
No 72
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=83.06 E-value=0.68 Score=26.48 Aligned_cols=11 Identities=27% Similarity=1.038 Sum_probs=6.6
Q ss_pred cccccccCccC
Q psy12647 326 YTCHMCGHMFT 336 (410)
Q Consensus 326 ~~C~~C~~~f~ 336 (410)
|.|..||..+.
T Consensus 3 ~~C~~CG~i~~ 13 (34)
T cd00729 3 WVCPVCGYIHE 13 (34)
T ss_pred EECCCCCCEeE
Confidence 56666665544
No 73
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=81.93 E-value=0.33 Score=41.61 Aligned_cols=43 Identities=19% Similarity=0.419 Sum_probs=24.3
Q ss_pred CCcccCcCcccccCcHhHHHHHHh---h-------cCCCC-----ccCcccccccccc
Q psy12647 353 LSYPCTQCPGKFYSAYELRQHVRQ---H-------TDERP-----FPCPVCQRTFRRK 395 (410)
Q Consensus 353 ~~~~C~~C~~~f~~~~~l~~H~~~---h-------~~~~~-----~~C~~C~~~f~~~ 395 (410)
+.+.||+|+..|..+.-+....+. . .+..| ..|+.||++|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 345666666666665444444332 1 12233 5788888888755
No 74
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=81.73 E-value=2.5 Score=31.79 Aligned_cols=74 Identities=19% Similarity=0.353 Sum_probs=44.7
Q ss_pred ceeecCcccccccccChhHHHHHHhhCCCCcccccccccccChhHHHHHHhhhc-------------CC---------CC
Q psy12647 165 RRYVCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGHMFTCKRRLTQHLARLH-------------TG---------SL 222 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H-------------~~---------~~ 222 (410)
+...|. .|+..... +.+..|++..+....- .....+.+-++..- .+ -.
T Consensus 10 ~vlIC~--~C~~av~~-~~v~~HL~~~H~~~~~-------~~~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~~ 79 (109)
T PF12013_consen 10 RVLICR--QCQYAVQP-SEVESHLRKRHHILKS-------QERQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVYD 79 (109)
T ss_pred CEEEeC--CCCcccCc-hHHHHHHHHhcccccH-------HHHHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCCCCC
Confidence 567899 89988776 7899999854421100 00111111111100 00 02
Q ss_pred cccc----CCCCcccCCHHHHHHHHHHhcC
Q psy12647 223 SYPC----TQCPGKFYSAYELRQHVRQHTD 248 (410)
Q Consensus 223 ~~~C----~~C~~~f~~~~~l~~H~~~~~~ 248 (410)
-|.| +.|++...+...|..|+..+++
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 2789 9999999999999999887764
No 75
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=79.62 E-value=0.88 Score=28.73 Aligned_cols=25 Identities=24% Similarity=0.666 Sum_probs=15.7
Q ss_pred CcccccccCccCChhhHHHHHHhhc
Q psy12647 325 QYTCHMCGHMFTCKRRLTQHLARLH 349 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~l~~H~~~~H 349 (410)
.++|+.|+..|....++.+|+...|
T Consensus 17 ~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 17 FLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred eeeCCchhHHHHHhHHHHHHhhHHh
Confidence 4666666666666666666664444
No 76
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=77.05 E-value=1.9 Score=27.86 Aligned_cols=10 Identities=30% Similarity=0.736 Sum_probs=6.1
Q ss_pred CCcccCcCcc
Q psy12647 353 LSYPCTQCPG 362 (410)
Q Consensus 353 ~~~~C~~C~~ 362 (410)
.+|+|+.||+
T Consensus 49 ~~Y~Cp~CGF 58 (61)
T COG2888 49 NPYRCPKCGF 58 (61)
T ss_pred CceECCCcCc
Confidence 5566666664
No 77
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=75.77 E-value=1.5 Score=32.75 Aligned_cols=14 Identities=36% Similarity=0.935 Sum_probs=6.7
Q ss_pred CCccCccccccccc
Q psy12647 381 RPFPCPVCQRTFRR 394 (410)
Q Consensus 381 ~~~~C~~C~~~f~~ 394 (410)
.|..|+.||..|..
T Consensus 25 ~PivCP~CG~~~~~ 38 (108)
T PF09538_consen 25 DPIVCPKCGTEFPP 38 (108)
T ss_pred CCccCCCCCCccCc
Confidence 44445555554443
No 78
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=74.72 E-value=4.9 Score=30.15 Aligned_cols=54 Identities=22% Similarity=0.514 Sum_probs=30.5
Q ss_pred ccCcccccccccChhhHHHHHHcCCCCcccccccCccCChhhHHHHHHhhcCCCCCcccCcCc
Q psy12647 299 VCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCP 361 (410)
Q Consensus 299 ~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~ 361 (410)
.|- .|...|........-.......|+|+.|...|-..-++-.|- ..| .|+.|.
T Consensus 57 ~C~--~C~~~f~~~~~~~~~~~~~~~~y~C~~C~~~FC~dCD~fiHe-~Lh------~CPGC~ 110 (112)
T TIGR00622 57 FCF--GCQGPFPKPPVSPFDELKDSHRYVCAVCKNVFCVDCDVFVHE-SLH------CCPGCI 110 (112)
T ss_pred ccc--CcCCCCCCcccccccccccccceeCCCCCCccccccchhhhh-hcc------CCcCCC
Confidence 366 777777654311100011114677777777777777776674 545 366665
No 79
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=74.66 E-value=2.3 Score=24.85 Aligned_cols=7 Identities=43% Similarity=1.156 Sum_probs=2.8
Q ss_pred ccccccc
Q psy12647 173 QCTRRFH 179 (410)
Q Consensus 173 ~C~~~f~ 179 (410)
.|+..|.
T Consensus 7 ~C~~~f~ 13 (37)
T PF13719_consen 7 NCQTRFR 13 (37)
T ss_pred CCCceEE
Confidence 4444443
No 80
>PHA00626 hypothetical protein
Probab=74.17 E-value=0.92 Score=28.77 Aligned_cols=9 Identities=33% Similarity=0.910 Sum_probs=3.5
Q ss_pred cccCcCccc
Q psy12647 355 YPCTQCPGK 363 (410)
Q Consensus 355 ~~C~~C~~~ 363 (410)
|.|+.|++.
T Consensus 24 YkCkdCGY~ 32 (59)
T PHA00626 24 YVCCDCGYN 32 (59)
T ss_pred eEcCCCCCe
Confidence 333333333
No 81
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=74.02 E-value=2 Score=26.23 Aligned_cols=11 Identities=27% Similarity=0.655 Sum_probs=5.5
Q ss_pred cccCcCccccc
Q psy12647 355 YPCTQCPGKFY 365 (410)
Q Consensus 355 ~~C~~C~~~f~ 365 (410)
|.|..||..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 45555555443
No 82
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=73.94 E-value=2.7 Score=24.37 Aligned_cols=7 Identities=14% Similarity=0.942 Sum_probs=2.9
Q ss_pred ccccccc
Q psy12647 173 QCTRRFH 179 (410)
Q Consensus 173 ~C~~~f~ 179 (410)
.|+..|.
T Consensus 7 ~C~~~y~ 13 (36)
T PF13717_consen 7 NCQAKYE 13 (36)
T ss_pred CCCCEEe
Confidence 4444443
No 83
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=73.79 E-value=0.79 Score=36.76 Aligned_cols=13 Identities=23% Similarity=0.562 Sum_probs=6.9
Q ss_pred ccCcccccccccc
Q psy12647 383 FPCPVCQRTFRRK 395 (410)
Q Consensus 383 ~~C~~C~~~f~~~ 395 (410)
++|+.||++|..-
T Consensus 29 ~~c~~c~~~f~~~ 41 (154)
T PRK00464 29 RECLACGKRFTTF 41 (154)
T ss_pred eeccccCCcceEe
Confidence 5555555555543
No 84
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=73.60 E-value=1.9 Score=27.90 Aligned_cols=10 Identities=30% Similarity=0.770 Sum_probs=6.0
Q ss_pred CCcccCcCcc
Q psy12647 353 LSYPCTQCPG 362 (410)
Q Consensus 353 ~~~~C~~C~~ 362 (410)
.+|.|+.||+
T Consensus 47 ~~Y~CP~CGF 56 (59)
T PRK14890 47 NPYTCPKCGF 56 (59)
T ss_pred CceECCCCCC
Confidence 5566666664
No 85
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=73.17 E-value=1.1 Score=30.72 Aligned_cols=42 Identities=21% Similarity=0.345 Sum_probs=22.5
Q ss_pred cccCcCcccccCcHhHHHHHHhhcCCCCccCc--cccccccccccC
Q psy12647 355 YPCTQCPGKFYSAYELRQHVRQHTDERPFPCP--VCQRTFRRKVCT 398 (410)
Q Consensus 355 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~~~f~~~~~l 398 (410)
+.|+.||........-... .-..++.++|. +||..|....++
T Consensus 2 m~CP~Cg~~a~irtSr~~s--~~~~~~Y~qC~N~eCg~tF~t~es~ 45 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYIT--DTTKERYHQCQNVNCSATFITYESV 45 (72)
T ss_pred ccCCCCCCccEEEEChhcC--hhhheeeeecCCCCCCCEEEEEEEE
Confidence 5677777654222211111 11344557887 788888766544
No 86
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=72.94 E-value=0.78 Score=39.32 Aligned_cols=43 Identities=21% Similarity=0.556 Sum_probs=26.1
Q ss_pred CcccccccCccCChhhHHHHHHhhc---------CCCCC-----cccCcCcccccCc
Q psy12647 325 QYTCHMCGHMFTCKRRLTQHLARLH---------TGSLS-----YPCTQCPGKFYSA 367 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~l~~H~~~~H---------~~~~~-----~~C~~C~~~f~~~ 367 (410)
.+.|++|+..|.+........+... .+..| ..||.||++|...
T Consensus 5 ~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 5 KITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred ceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 4667777777776655444442222 12344 3799999988754
No 87
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=72.63 E-value=3.1 Score=33.74 Aligned_cols=18 Identities=11% Similarity=0.229 Sum_probs=9.2
Q ss_pred CCcccCcCcccccCcHhH
Q psy12647 353 LSYPCTQCPGKFYSAYEL 370 (410)
Q Consensus 353 ~~~~C~~C~~~f~~~~~l 370 (410)
.-|.|+.|+..|+...++
T Consensus 108 ~~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 108 MFFICPNMCVRFTFNEAM 125 (158)
T ss_pred CeEECCCCCcEeeHHHHH
Confidence 345555555555554444
No 88
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=72.42 E-value=2.5 Score=24.76 Aligned_cols=9 Identities=22% Similarity=0.619 Sum_probs=3.7
Q ss_pred cccCCCCcc
Q psy12647 224 YPCTQCPGK 232 (410)
Q Consensus 224 ~~C~~C~~~ 232 (410)
..|+.|+..
T Consensus 26 v~C~~C~~~ 34 (38)
T TIGR02098 26 VRCGKCGHV 34 (38)
T ss_pred EECCCCCCE
Confidence 344444433
No 89
>PF12907 zf-met2: Zinc-binding
Probab=72.06 E-value=2.8 Score=24.95 Aligned_cols=32 Identities=22% Similarity=0.451 Sum_probs=21.3
Q ss_pred cccccccc---cccChhHHHHHHhhhcCCCCcccc
Q psy12647 195 YTCHMCGH---MFTCKRRLTQHLARLHTGSLSYPC 226 (410)
Q Consensus 195 ~~C~~C~~---~f~~~~~l~~H~~~~H~~~~~~~C 226 (410)
+.|.+|-. ...+...|..|....|++...-.|
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~~~C 36 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTFEEC 36 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCHHHc
Confidence 56778874 345557788888778887544333
No 90
>KOG2186|consensus
Probab=72.05 E-value=2 Score=36.87 Aligned_cols=45 Identities=29% Similarity=0.716 Sum_probs=34.9
Q ss_pred eecccccccccCchhHHHHHHHhhhcCCCCcccCCcCcccccCcccccccc
Q psy12647 89 FSCNSCGQTWLGYTSWKQHVAACREKRRKETFPCKRCDERFSTRDGLESFT 139 (410)
Q Consensus 89 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~ 139 (410)
|.|..||.+..-.. +.+|+..|+. .-|.|-.|+..|.. ...+.|.
T Consensus 4 FtCnvCgEsvKKp~-vekH~srCrn----~~fSCIDC~k~F~~-~sYknH~ 48 (276)
T KOG2186|consen 4 FTCNVCGESVKKPQ-VEKHMSRCRN----AYFSCIDCGKTFER-VSYKNHT 48 (276)
T ss_pred Eehhhhhhhccccc-hHHHHHhccC----CeeEEeeccccccc-chhhhhh
Confidence 78999999887554 7789988644 57999999999998 4545553
No 91
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=70.67 E-value=1.9 Score=35.75 Aligned_cols=30 Identities=23% Similarity=0.611 Sum_probs=16.5
Q ss_pred CcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccc
Q psy12647 325 QYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKF 364 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f 364 (410)
.|.|+.|+..|+....+. ..|.||.||...
T Consensus 117 ~Y~Cp~C~~rytf~eA~~----------~~F~Cp~Cg~~L 146 (178)
T PRK06266 117 FFFCPNCHIRFTFDEAME----------YGFRCPQCGEML 146 (178)
T ss_pred EEECCCCCcEEeHHHHhh----------cCCcCCCCCCCC
Confidence 466666666666554432 246666666543
No 92
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=70.30 E-value=3.6 Score=25.85 Aligned_cols=12 Identities=42% Similarity=0.653 Sum_probs=5.9
Q ss_pred hhHHHHHHhhcC
Q psy12647 339 RRLTQHLARLHT 350 (410)
Q Consensus 339 ~~l~~H~~~~H~ 350 (410)
+.|.+|+...|+
T Consensus 37 s~L~rHl~~~h~ 48 (50)
T smart00614 37 SNLRRHLRRKHP 48 (50)
T ss_pred HHHHHHHHhHCc
Confidence 455566533443
No 93
>PF12907 zf-met2: Zinc-binding
Probab=68.23 E-value=3 Score=24.77 Aligned_cols=29 Identities=21% Similarity=0.402 Sum_probs=14.8
Q ss_pred cccccccC---ccCChhhHHHHHHhhcCCCCC
Q psy12647 326 YTCHMCGH---MFTCKRRLTQHLARLHTGSLS 354 (410)
Q Consensus 326 ~~C~~C~~---~f~~~~~l~~H~~~~H~~~~~ 354 (410)
+.|.+|.. ...+...|..|....|+...+
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~ 33 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTF 33 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCH
Confidence 34555553 333344566666556655433
No 94
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=68.00 E-value=3.1 Score=33.70 Aligned_cols=11 Identities=27% Similarity=0.640 Sum_probs=6.6
Q ss_pred CCCCcccCcCc
Q psy12647 351 GSLSYPCTQCP 361 (410)
Q Consensus 351 ~~~~~~C~~C~ 361 (410)
++.|-+||+||
T Consensus 146 ge~P~~CPiCg 156 (166)
T COG1592 146 GEAPEVCPICG 156 (166)
T ss_pred CCCCCcCCCCC
Confidence 35566666666
No 95
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=67.51 E-value=2.8 Score=25.56 Aligned_cols=22 Identities=27% Similarity=0.588 Sum_probs=8.9
Q ss_pred cccccCcccccccccC----hhhHHHHH
Q psy12647 296 RRYVCNHMQCTRRFHS----ASLLARHV 319 (410)
Q Consensus 296 ~~~~C~~~~C~~~f~~----~~~l~~H~ 319 (410)
....|. .|++.+.. .+.|.+|+
T Consensus 15 ~~a~C~--~C~~~~~~~~~~ts~l~~HL 40 (45)
T PF02892_consen 15 KKAKCK--YCGKVIKYSSGGTSNLKRHL 40 (45)
T ss_dssp S-EEET--TTTEE-----SSTHHHHHHH
T ss_pred CeEEeC--CCCeEEeeCCCcHHHHHHhh
Confidence 445555 55554443 24444444
No 96
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=65.21 E-value=3.7 Score=33.31 Aligned_cols=24 Identities=33% Similarity=0.838 Sum_probs=20.0
Q ss_pred CcccCcCcccccCcHhHHHHHHhhcCCCCccCccccc
Q psy12647 354 SYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQR 390 (410)
Q Consensus 354 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 390 (410)
.|.|++||+. +.|+.|-+|++||-
T Consensus 134 ~~vC~vCGy~-------------~~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYT-------------HEGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCc-------------ccCCCCCcCCCCCC
Confidence 6999999975 45678999999993
No 97
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=63.32 E-value=1.5 Score=37.68 Aligned_cols=64 Identities=22% Similarity=0.489 Sum_probs=41.5
Q ss_pred cccccCcccccccccChhhHHHHHHcCCCCcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhHHHHHH
Q psy12647 296 RRYVCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELRQHVR 375 (410)
Q Consensus 296 ~~~~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~ 375 (410)
..|+|. .|++. ..|.|..|...|-.......=+ .. ...+++.||.|+........|..-.|
T Consensus 170 E~~KC~--SCNrl---------------Gq~sCLRCK~cfCddHvrrKg~-ky-~k~k~~PCPKCg~et~eTkdLSmStR 230 (314)
T PF06524_consen 170 ETFKCQ--SCNRL---------------GQYSCLRCKICFCDDHVRRKGF-KY-EKGKPIPCPKCGYETQETKDLSMSTR 230 (314)
T ss_pred cccccc--ccccc---------------cchhhhheeeeehhhhhhhccc-cc-ccCCCCCCCCCCCcccccccceeeee
Confidence 677787 77643 3578888888776542222111 11 23478999999998888777776666
Q ss_pred hhc
Q psy12647 376 QHT 378 (410)
Q Consensus 376 ~h~ 378 (410)
+|.
T Consensus 231 ~hk 233 (314)
T PF06524_consen 231 SHK 233 (314)
T ss_pred cch
Confidence 653
No 98
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=62.81 E-value=4.8 Score=21.40 Aligned_cols=10 Identities=40% Similarity=0.823 Sum_probs=4.9
Q ss_pred cccccccCcc
Q psy12647 326 YTCHMCGHMF 335 (410)
Q Consensus 326 ~~C~~C~~~f 335 (410)
-.|+.||..|
T Consensus 15 ~~Cp~CG~~F 24 (26)
T PF10571_consen 15 KFCPHCGYDF 24 (26)
T ss_pred CcCCCCCCCC
Confidence 3455555554
No 99
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=62.61 E-value=2.2 Score=25.36 Aligned_cols=11 Identities=55% Similarity=1.419 Sum_probs=5.6
Q ss_pred cccccccCccC
Q psy12647 326 YTCHMCGHMFT 336 (410)
Q Consensus 326 ~~C~~C~~~f~ 336 (410)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (41)
T smart00834 6 YRCEDCGHTFE 16 (41)
T ss_pred EEcCCCCCEEE
Confidence 45555555544
No 100
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=62.60 E-value=2.8 Score=29.53 Aligned_cols=11 Identities=27% Similarity=0.857 Sum_probs=6.2
Q ss_pred ccCcccccccc
Q psy12647 383 FPCPVCQRTFR 393 (410)
Q Consensus 383 ~~C~~C~~~f~ 393 (410)
++|.-|+..|+
T Consensus 54 W~C~kCg~~fA 64 (89)
T COG1997 54 WKCRKCGAKFA 64 (89)
T ss_pred EEcCCCCCeec
Confidence 55555555554
No 101
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=62.19 E-value=1.6 Score=26.34 Aligned_cols=13 Identities=46% Similarity=1.137 Sum_probs=7.0
Q ss_pred cccccccCccCCh
Q psy12647 326 YTCHMCGHMFTCK 338 (410)
Q Consensus 326 ~~C~~C~~~f~~~ 338 (410)
|+|..||..|...
T Consensus 6 y~C~~Cg~~fe~~ 18 (42)
T PF09723_consen 6 YRCEECGHEFEVL 18 (42)
T ss_pred EEeCCCCCEEEEE
Confidence 5555555555543
No 102
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=62.18 E-value=3.5 Score=25.70 Aligned_cols=11 Identities=27% Similarity=0.945 Sum_probs=6.8
Q ss_pred CcccCcCcccc
Q psy12647 354 SYPCTQCPGKF 364 (410)
Q Consensus 354 ~~~C~~C~~~f 364 (410)
.|.|..||+.|
T Consensus 6 ~Y~C~~Cg~~~ 16 (49)
T COG1996 6 EYKCARCGREV 16 (49)
T ss_pred EEEhhhcCCee
Confidence 36666666665
No 103
>KOG4377|consensus
Probab=61.87 E-value=3.9 Score=37.70 Aligned_cols=28 Identities=25% Similarity=0.579 Sum_probs=22.4
Q ss_pred cccccCcccccccccChhhHHHHHHcCC
Q psy12647 296 RRYVCNHMQCTRRFHSASLLARHVSTHA 323 (410)
Q Consensus 296 ~~~~C~~~~C~~~f~~~~~l~~H~~~~~ 323 (410)
.-+.|.+..|+.++.+.+.+..|.+.|.
T Consensus 400 nhfhc~r~Gc~~tl~s~sqm~shkrkhe 427 (480)
T KOG4377|consen 400 NHFHCDRLGCEATLYSVSQMASHKRKHE 427 (480)
T ss_pred eeeeecccCCceEEEehhhhhhhhhhhh
Confidence 3466778888888888888888887775
No 104
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=60.47 E-value=2.6 Score=26.74 Aligned_cols=12 Identities=50% Similarity=1.315 Sum_probs=6.9
Q ss_pred cccccccCccCC
Q psy12647 326 YTCHMCGHMFTC 337 (410)
Q Consensus 326 ~~C~~C~~~f~~ 337 (410)
|+|..|+..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (52)
T TIGR02605 6 YRCTACGHRFEV 17 (52)
T ss_pred EEeCCCCCEeEE
Confidence 556666655553
No 105
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=60.02 E-value=8.5 Score=20.40 Aligned_cols=19 Identities=16% Similarity=0.438 Sum_probs=11.3
Q ss_pred ccCcCcccccCcHhHHHHHH
Q psy12647 356 PCTQCPGKFYSAYELRQHVR 375 (410)
Q Consensus 356 ~C~~C~~~f~~~~~l~~H~~ 375 (410)
.||+|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 467777665 4455555654
No 106
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=58.65 E-value=5.8 Score=30.30 Aligned_cols=13 Identities=8% Similarity=-0.120 Sum_probs=6.6
Q ss_pred CCcccCcCccccc
Q psy12647 353 LSYPCTQCPGKFY 365 (410)
Q Consensus 353 ~~~~C~~C~~~f~ 365 (410)
.|-.|+.||..|.
T Consensus 25 ~p~vcP~cg~~~~ 37 (129)
T TIGR02300 25 RPAVSPYTGEQFP 37 (129)
T ss_pred CCccCCCcCCccC
Confidence 4445555555543
No 107
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=56.13 E-value=6.3 Score=31.49 Aligned_cols=13 Identities=31% Similarity=0.772 Sum_probs=6.7
Q ss_pred CcccccccCccCC
Q psy12647 325 QYTCHMCGHMFTC 337 (410)
Q Consensus 325 ~~~C~~C~~~f~~ 337 (410)
.|.|+.|+..|+.
T Consensus 99 ~Y~Cp~C~~~y~~ 111 (147)
T smart00531 99 YYKCPNCQSKYTF 111 (147)
T ss_pred EEECcCCCCEeeH
Confidence 3555555555553
No 108
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=52.93 E-value=8.2 Score=29.74 Aligned_cols=15 Identities=27% Similarity=0.669 Sum_probs=9.9
Q ss_pred CcccccccCccCChh
Q psy12647 325 QYTCHMCGHMFTCKR 339 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~ 339 (410)
|++|..||+.|.+-+
T Consensus 1 PH~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGS 15 (131)
T ss_pred CcccCcCCCCcCCCc
Confidence 456777777777654
No 109
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=52.64 E-value=8.3 Score=28.00 Aligned_cols=16 Identities=25% Similarity=0.627 Sum_probs=10.0
Q ss_pred CcccccccCccCChhh
Q psy12647 325 QYTCHMCGHMFTCKRR 340 (410)
Q Consensus 325 ~~~C~~C~~~f~~~~~ 340 (410)
|++|..||..|.+-+.
T Consensus 2 pH~CtrCG~vf~~g~~ 17 (112)
T COG3364 2 PHQCTRCGEVFDDGSE 17 (112)
T ss_pred CceecccccccccccH
Confidence 4567777777766443
No 110
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=52.54 E-value=11 Score=32.53 Aligned_cols=19 Identities=21% Similarity=0.469 Sum_probs=9.5
Q ss_pred CccccCCCCcccCCHHHHH
Q psy12647 222 LSYPCTQCPGKFYSAYELR 240 (410)
Q Consensus 222 ~~~~C~~C~~~f~~~~~l~ 240 (410)
+++.||.|+........|.
T Consensus 208 k~~PCPKCg~et~eTkdLS 226 (314)
T PF06524_consen 208 KPIPCPKCGYETQETKDLS 226 (314)
T ss_pred CCCCCCCCCCcccccccce
Confidence 4455555555544444443
No 111
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=50.62 E-value=6.4 Score=21.76 Aligned_cols=25 Identities=24% Similarity=0.700 Sum_probs=10.0
Q ss_pred ccCcCcccccCcHhHHHHHHhhcCCCCccCcccccc
Q psy12647 356 PCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRT 391 (410)
Q Consensus 356 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 391 (410)
+|+.|+..++. .+.-.|.|+.|+..
T Consensus 4 ~Cp~C~se~~y-----------~D~~~~vCp~C~~e 28 (30)
T PF08274_consen 4 KCPLCGSEYTY-----------EDGELLVCPECGHE 28 (30)
T ss_dssp --TTT-----E-----------E-SSSEEETTTTEE
T ss_pred CCCCCCCccee-----------ccCCEEeCCccccc
Confidence 46667666554 33345677777643
No 112
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=50.27 E-value=11 Score=23.15 Aligned_cols=15 Identities=20% Similarity=0.756 Sum_probs=11.0
Q ss_pred cccCCcCcccccCcc
Q psy12647 119 TFPCKRCDERFSTRD 133 (410)
Q Consensus 119 ~~~C~~C~~~f~~~~ 133 (410)
.|.|+.||..|.-..
T Consensus 3 ~y~C~~CG~~~~~~~ 17 (46)
T PRK00398 3 EYKCARCGREVELDE 17 (46)
T ss_pred EEECCCCCCEEEECC
Confidence 578888888776544
No 113
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=48.08 E-value=11 Score=28.23 Aligned_cols=31 Identities=23% Similarity=0.608 Sum_probs=23.7
Q ss_pred cccCCcCcccccCcccccccccccccccccchhhhhccccccCCCcceeecCcccccccccCh
Q psy12647 119 TFPCKRCDERFSTRDGLESFTFSLRLFCLYFPQELISHSSYHGGEERRYVCNHMQCTRRFHSA 181 (410)
Q Consensus 119 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~l~~h~~~~~~~~~~~~C~~~~C~~~f~~~ 181 (410)
...|+.||..|-.... .+..|+ .||..|.-.
T Consensus 9 KR~Cp~CG~kFYDLnk------------------------------~PivCP--~CG~~~~~~ 39 (108)
T PF09538_consen 9 KRTCPSCGAKFYDLNK------------------------------DPIVCP--KCGTEFPPE 39 (108)
T ss_pred cccCCCCcchhccCCC------------------------------CCccCC--CCCCccCcc
Confidence 3579999999988543 667888 888888755
No 114
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=48.06 E-value=7.1 Score=25.45 Aligned_cols=33 Identities=27% Similarity=0.362 Sum_probs=22.2
Q ss_pred HHHHHHHHhhhcCCCcccCCC----CCCccccc-hhce
Q psy12647 25 VDFIQVHLRLHSGERRFPCEY----CSLLVYPV-LILF 57 (410)
Q Consensus 25 ~~~l~~H~~~h~~~~~~~C~~----C~~~f~~~-l~~h 57 (410)
+..|..|+...-+.++..|++ |+..+... |..|
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~~~~l~~H 60 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPYSPYGCKERVPREDLEEH 60 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS----S--EEEHHHHHHC
T ss_pred HHHHHHHHHccCCCCcEECCCCCCCCCCccchhHHhCC
Confidence 567899998776778889999 87776555 5443
No 115
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=46.90 E-value=43 Score=25.24 Aligned_cols=21 Identities=24% Similarity=0.389 Sum_probs=15.3
Q ss_pred CceecccccccccCchhHHHH
Q psy12647 87 KTFSCNSCGQTWLGYTSWKQH 107 (410)
Q Consensus 87 ~~~~C~~C~~~f~~~~~l~~H 107 (410)
-|..|+.|+.+..+...|.+-
T Consensus 14 LP~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLARS 34 (112)
T ss_pred CCCcCCcCCCEEeccchHHHh
Confidence 467788888888776666664
No 116
>PF15269 zf-C2H2_7: Zinc-finger
Probab=46.84 E-value=13 Score=22.38 Aligned_cols=22 Identities=32% Similarity=0.597 Sum_probs=15.7
Q ss_pred cccCcCcccccCcHhHHHHHHh
Q psy12647 355 YPCTQCPGKFYSAYELRQHVRQ 376 (410)
Q Consensus 355 ~~C~~C~~~f~~~~~l~~H~~~ 376 (410)
|+|=+|......++.|-+||+-
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 5677777777777777777653
No 117
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=45.61 E-value=14 Score=29.89 Aligned_cols=30 Identities=27% Similarity=0.697 Sum_probs=18.5
Q ss_pred ceeecCcccccccccChhHHHHHHhhCCCCccccccccc
Q psy12647 165 RRYVCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGHM 203 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~ 203 (410)
..|.|+ .|+..|+...++. ..|.|+.||..
T Consensus 108 ~~Y~Cp--~c~~r~tf~eA~~-------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 108 MFFICP--NMCVRFTFNEAME-------LNFTCPRCGAM 137 (158)
T ss_pred CeEECC--CCCcEeeHHHHHH-------cCCcCCCCCCE
Confidence 566676 6777666665553 14666666653
No 118
>KOG2593|consensus
Probab=44.61 E-value=12 Score=35.17 Aligned_cols=35 Identities=26% Similarity=0.620 Sum_probs=19.3
Q ss_pred CCcccccccCccCChhhHHHHHHhhcCCCCCcccCcCcc
Q psy12647 324 IQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPG 362 (410)
Q Consensus 324 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~ 362 (410)
..|.|+.|++.|+....+. .+-.....|.|..|+-
T Consensus 127 ~~Y~Cp~C~kkyt~Lea~~----L~~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 127 AGYVCPNCQKKYTSLEALQ----LLDNETGEFHCENCGG 161 (436)
T ss_pred ccccCCccccchhhhHHHH----hhcccCceEEEecCCC
Confidence 4466666666666655442 3333344566666663
No 119
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=44.34 E-value=11 Score=20.48 Aligned_cols=19 Identities=21% Similarity=0.511 Sum_probs=12.4
Q ss_pred ccCccccccccccccCccCC
Q psy12647 383 FPCPVCQRTFRRKVCTTPSP 402 (410)
Q Consensus 383 ~~C~~C~~~f~~~~~l~~H~ 402 (410)
|.|-.|++.| ...+.+.|.
T Consensus 1 ~sCiDC~~~F-~~~~y~~Ht 19 (28)
T PF08790_consen 1 FSCIDCSKDF-DGDSYKSHT 19 (28)
T ss_dssp EEETTTTEEE-EGGGTTT--
T ss_pred CeeecCCCCc-CcCCcCCCC
Confidence 5688899999 555566664
No 120
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=42.16 E-value=26 Score=27.11 Aligned_cols=43 Identities=23% Similarity=0.450 Sum_probs=29.0
Q ss_pred CCCCceecccccccccCch------------------hHHHHHHHhhhcCCCCcccCCcCcccccCccc
Q psy12647 84 LHPKTFSCNSCGQTWLGYT------------------SWKQHVAACREKRRKETFPCKRCDERFSTRDG 134 (410)
Q Consensus 84 ~~~~~~~C~~C~~~f~~~~------------------~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~ 134 (410)
.+.+.|.|++|..+..... .|.+|... + =.||.|.-+|++...
T Consensus 76 ~d~~lYeCnIC~etS~ee~FLKPneCCgY~iCn~Cya~LWK~~~~---y-----pvCPvCkTSFKss~~ 136 (140)
T PF05290_consen 76 LDPKLYECNICKETSAEERFLKPNECCGYSICNACYANLWKFCNL---Y-----PVCPVCKTSFKSSSS 136 (140)
T ss_pred cCCCceeccCcccccchhhcCCcccccchHHHHHHHHHHHHHccc---C-----CCCCccccccccccc
Confidence 4567899999987776543 23444322 1 269999999988764
No 121
>KOG1044|consensus
Probab=41.47 E-value=7.9 Score=37.56 Aligned_cols=42 Identities=19% Similarity=0.424 Sum_probs=22.1
Q ss_pred cccccccCccCChhhHHHHHHhhcCCCCCcccCcCcccccCcHhH
Q psy12647 326 YTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYEL 370 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l 370 (410)
.+|..|.+-++.+.....= ...|+.- -.|..|+.-|.....|
T Consensus 193 vkc~~c~~fisgkvLqag~-kh~HPtC--ARCsRCgqmF~eGEEM 234 (670)
T KOG1044|consen 193 VKCEECEKFISGKVLQAGD-KHFHPTC--ARCSRCGQMFGEGEEM 234 (670)
T ss_pred eehHHhhhhhhhhhhhccC-cccCcch--hhhhhhccccccchhe
Confidence 5677777766665322211 1222221 2577777777766544
No 122
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=40.67 E-value=14 Score=26.51 Aligned_cols=14 Identities=29% Similarity=0.752 Sum_probs=8.0
Q ss_pred CceecccccccccC
Q psy12647 87 KTFSCNSCGQTWLG 100 (410)
Q Consensus 87 ~~~~C~~C~~~f~~ 100 (410)
..+.|..|+++|..
T Consensus 53 GIW~C~~C~~~~AG 66 (90)
T PTZ00255 53 GIWRCKGCKKTVAG 66 (90)
T ss_pred EEEEcCCCCCEEeC
Confidence 44666666666543
No 123
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=40.07 E-value=20 Score=29.64 Aligned_cols=29 Identities=24% Similarity=0.708 Sum_probs=16.2
Q ss_pred ceeecCcccccccccChhHHHHHHhhCCCCcccccccc
Q psy12647 165 RRYVCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGH 202 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~ 202 (410)
..|.|+ .|+..|+...++. ..|.|+.||.
T Consensus 116 ~~Y~Cp--~C~~rytf~eA~~-------~~F~Cp~Cg~ 144 (178)
T PRK06266 116 MFFFCP--NCHIRFTFDEAME-------YGFRCPQCGE 144 (178)
T ss_pred CEEECC--CCCcEEeHHHHhh-------cCCcCCCCCC
Confidence 556666 6666666554432 1466666654
No 124
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=39.72 E-value=20 Score=36.83 Aligned_cols=30 Identities=27% Similarity=0.626 Sum_probs=18.6
Q ss_pred hhcCCCCCcccCcCcccccCcHhHHHHHHhhcCCCCccCcccccc
Q psy12647 347 RLHTGSLSYPCTQCPGKFYSAYELRQHVRQHTDERPFPCPVCQRT 391 (410)
Q Consensus 347 ~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 391 (410)
..|...+...|..||.. ...|..|+.||-.
T Consensus 455 t~H~~~~~L~CH~Cg~~---------------~~~p~~Cp~Cgs~ 484 (730)
T COG1198 455 TLHKATGQLRCHYCGYQ---------------EPIPQSCPECGSE 484 (730)
T ss_pred EEecCCCeeEeCCCCCC---------------CCCCCCCCCCCCC
Confidence 34445566777777753 2366778887754
No 125
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=39.62 E-value=18 Score=30.94 Aligned_cols=25 Identities=24% Similarity=0.723 Sum_probs=14.0
Q ss_pred cccccccCccCChhhHHHHHHhhcC
Q psy12647 326 YTCHMCGHMFTCKRRLTQHLARLHT 350 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~~~~H~ 350 (410)
|.|.+|+|.|....-+.+|+...|.
T Consensus 78 ~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 78 WRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp EEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred ECCCCCCcccCChHHHHHHHhhcCH
Confidence 5666666666666666666656554
No 126
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=38.93 E-value=20 Score=22.13 Aligned_cols=17 Identities=18% Similarity=0.454 Sum_probs=13.5
Q ss_pred ceeecCcccccccccCh
Q psy12647 165 RRYVCNHMQCTRRFHSA 181 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~ 181 (410)
.-++|.+..||..|...
T Consensus 24 ~Y~qC~N~~Cg~tfv~~ 40 (47)
T PF04606_consen 24 LYCQCTNPECGHTFVAN 40 (47)
T ss_pred EEEEECCCcCCCEEEEE
Confidence 66889988899988643
No 127
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=38.91 E-value=15 Score=19.11 Aligned_cols=9 Identities=44% Similarity=1.316 Sum_probs=5.8
Q ss_pred CcccCCcCc
Q psy12647 118 ETFPCKRCD 126 (410)
Q Consensus 118 ~~~~C~~C~ 126 (410)
..|.||.||
T Consensus 15 v~f~CPnCG 23 (24)
T PF07754_consen 15 VPFPCPNCG 23 (24)
T ss_pred ceEeCCCCC
Confidence 456777766
No 128
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=38.56 E-value=6.7 Score=29.16 Aligned_cols=13 Identities=23% Similarity=0.718 Sum_probs=7.6
Q ss_pred CCCCccchhhhhh
Q psy12647 251 PFPCPVCQRTFRR 263 (410)
Q Consensus 251 ~~~C~~C~~~f~~ 263 (410)
.+.|+.|+..+..
T Consensus 16 ~~~C~~C~~~~~~ 28 (104)
T TIGR01384 16 VYVCPSCGYEKEK 28 (104)
T ss_pred eEECcCCCCcccc
Confidence 4667677665443
No 129
>PF14353 CpXC: CpXC protein
Probab=37.50 E-value=23 Score=27.41 Aligned_cols=57 Identities=14% Similarity=0.233 Sum_probs=33.2
Q ss_pred eecccccccccCchhHHHH------HHHhhhcCCCCcccCCcCcccccCcccccccccccccc
Q psy12647 89 FSCNSCGQTWLGYTSWKQH------VAACREKRRKETFPCKRCDERFSTRDGLESFTFSLRLF 145 (410)
Q Consensus 89 ~~C~~C~~~f~~~~~l~~H------~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~ 145 (410)
..|+.|+..|.......-. ++..-....-..+.||.||..|.-...+..|-......
T Consensus 2 itCP~C~~~~~~~v~~~I~~~~~p~l~e~il~g~l~~~~CP~Cg~~~~~~~p~lY~D~~~~~~ 64 (128)
T PF14353_consen 2 ITCPHCGHEFEFEVWTSINADEDPELKEKILDGSLFSFTCPSCGHKFRLEYPLLYHDPEKKFM 64 (128)
T ss_pred cCCCCCCCeeEEEEEeEEcCcCCHHHHHHHHcCCcCEEECCCCCCceecCCCEEEEcCCCCEE
Confidence 4689998888543321111 11100122235689999999999888877665444433
No 130
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=37.37 E-value=15 Score=26.34 Aligned_cols=14 Identities=29% Similarity=1.006 Sum_probs=7.9
Q ss_pred CceecccccccccC
Q psy12647 87 KTFSCNSCGQTWLG 100 (410)
Q Consensus 87 ~~~~C~~C~~~f~~ 100 (410)
..+.|..|+++|..
T Consensus 52 GIW~C~~C~~~~AG 65 (91)
T TIGR00280 52 GIWTCRKCGAKFAG 65 (91)
T ss_pred EEEEcCCCCCEEeC
Confidence 34666666666543
No 131
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=37.11 E-value=19 Score=28.70 Aligned_cols=15 Identities=20% Similarity=0.662 Sum_probs=8.5
Q ss_pred ceeecCcccccccccCh
Q psy12647 165 RRYVCNHMQCTRRFHSA 181 (410)
Q Consensus 165 ~~~~C~~~~C~~~f~~~ 181 (410)
..|.|+ .|+..|...
T Consensus 98 ~~Y~Cp--~C~~~y~~~ 112 (147)
T smart00531 98 AYYKCP--NCQSKYTFL 112 (147)
T ss_pred cEEECc--CCCCEeeHH
Confidence 456666 666665543
No 132
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=37.08 E-value=24 Score=25.13 Aligned_cols=29 Identities=21% Similarity=0.520 Sum_probs=17.5
Q ss_pred CCcccccccCccCChhhHHHHHHhhcCCCCCcccCcCccc
Q psy12647 324 IQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGK 363 (410)
Q Consensus 324 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~ 363 (410)
.|-+|..||..|.+.. -..|-.||.|...
T Consensus 57 ~Pa~CkkCGfef~~~~-----------ik~pSRCP~CKSE 85 (97)
T COG3357 57 RPARCKKCGFEFRDDK-----------IKKPSRCPKCKSE 85 (97)
T ss_pred cChhhcccCccccccc-----------cCCcccCCcchhh
Confidence 5667777777776521 1345667777643
No 133
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=37.05 E-value=23 Score=21.32 Aligned_cols=9 Identities=33% Similarity=1.073 Sum_probs=3.6
Q ss_pred eeccccccc
Q psy12647 89 FSCNSCGQT 97 (410)
Q Consensus 89 ~~C~~C~~~ 97 (410)
+.|..||..
T Consensus 20 ~vC~~CG~V 28 (43)
T PF08271_consen 20 LVCPNCGLV 28 (43)
T ss_dssp EEETTT-BB
T ss_pred EECCCCCCE
Confidence 444444443
No 134
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=36.60 E-value=19 Score=26.61 Aligned_cols=12 Identities=17% Similarity=0.581 Sum_probs=5.9
Q ss_pred CCccCccccccc
Q psy12647 381 RPFPCPVCQRTF 392 (410)
Q Consensus 381 ~~~~C~~C~~~f 392 (410)
.|..|++||++|
T Consensus 25 dPiVsPytG~s~ 36 (129)
T COG4530 25 DPIVSPYTGKSY 36 (129)
T ss_pred CccccCcccccc
Confidence 444555555544
No 135
>KOG2593|consensus
Probab=36.20 E-value=40 Score=31.81 Aligned_cols=15 Identities=13% Similarity=0.394 Sum_probs=9.1
Q ss_pred cCCCcccCCCCCCcc
Q psy12647 36 SGERRFPCEYCSLLV 50 (410)
Q Consensus 36 ~~~~~~~C~~C~~~f 50 (410)
+....|+|+.|++.|
T Consensus 124 t~~~~Y~Cp~C~kky 138 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKY 138 (436)
T ss_pred cccccccCCccccch
Confidence 344567777775444
No 136
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=35.91 E-value=7.3 Score=39.04 Aligned_cols=80 Identities=19% Similarity=0.402 Sum_probs=52.1
Q ss_pred eecccccccccCchhHHHHHHHhhhcCCCCcccCCcCcccccCcccccccccccccccccchhhhhccccccCCCcceee
Q psy12647 89 FSCNSCGQTWLGYTSWKQHVAACREKRRKETFPCKRCDERFSTRDGLESFTFSLRLFCLYFPQELISHSSYHGGEERRYV 168 (410)
Q Consensus 89 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~l~~h~~~~~~~~~~~~ 168 (410)
=.|+.|-+...+..+-+-+.- ...|..||-+|+-...|- -- +..+.-...-.
T Consensus 102 a~C~~Cl~Ei~dp~~rrY~YP---------F~~CT~CGPRfTIi~alP------------------YD-R~nTsM~~F~l 153 (750)
T COG0068 102 ATCEDCLEEIFDPNSRRYLYP---------FINCTNCGPRFTIIEALP------------------YD-RENTSMADFPL 153 (750)
T ss_pred hhhHHHHHHhcCCCCcceecc---------ccccCCCCcceeeeccCC------------------CC-cccCccccCcC
Confidence 458888777666655433321 236999999999877732 11 22222224457
Q ss_pred cCcccccccccChhHHHHHHhhCCCCcccccccc
Q psy12647 169 CNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGH 202 (410)
Q Consensus 169 C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~ 202 (410)
|+ .|.+.|.+..+ .+.|..+..|+.||=
T Consensus 154 C~--~C~~EY~dP~n----RRfHAQp~aCp~CGP 181 (750)
T COG0068 154 CP--FCDKEYKDPLN----RRFHAQPIACPKCGP 181 (750)
T ss_pred CH--HHHHHhcCccc----cccccccccCcccCC
Confidence 88 99999988876 344556899999984
No 137
>KOG2186|consensus
Probab=34.39 E-value=16 Score=31.53 Aligned_cols=30 Identities=23% Similarity=0.542 Sum_probs=16.1
Q ss_pred HHHHHHcCC-CCcccccccCccCChhhHHHHH
Q psy12647 315 LARHVSTHA-IQYTCHMCGHMFTCKRRLTQHL 345 (410)
Q Consensus 315 l~~H~~~~~-~~~~C~~C~~~f~~~~~l~~H~ 345 (410)
+.+|+...+ .-|.|-.|+.+|.. ..+..|.
T Consensus 18 vekH~srCrn~~fSCIDC~k~F~~-~sYknH~ 48 (276)
T KOG2186|consen 18 VEKHMSRCRNAYFSCIDCGKTFER-VSYKNHT 48 (276)
T ss_pred hHHHHHhccCCeeEEeeccccccc-chhhhhh
Confidence 333444433 34666666666665 4555665
No 138
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=33.21 E-value=27 Score=31.15 Aligned_cols=22 Identities=23% Similarity=0.525 Sum_probs=14.9
Q ss_pred CCcccccccCccCChhhHHHHH
Q psy12647 324 IQYTCHMCGHMFTCKRRLTQHL 345 (410)
Q Consensus 324 ~~~~C~~C~~~f~~~~~l~~H~ 345 (410)
..|+|+.|...|-..-+.-.|-
T Consensus 387 ~rY~Ce~CK~~FC~dCdvfiHe 408 (421)
T COG5151 387 GRYQCELCKSTFCSDCDVFIHE 408 (421)
T ss_pred cceechhhhhhhhhhhHHHHHH
Confidence 4577777777777766666664
No 139
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=32.34 E-value=29 Score=29.68 Aligned_cols=27 Identities=22% Similarity=0.678 Sum_probs=19.0
Q ss_pred CcccccccccccChhHHHHHHhhhcCC
Q psy12647 194 QYTCHMCGHMFTCKRRLTQHLARLHTG 220 (410)
Q Consensus 194 ~~~C~~C~~~f~~~~~l~~H~~~~H~~ 220 (410)
.|.|..|++.|....-+.+|+...|..
T Consensus 77 K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 77 KWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp EEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred EECCCCCCcccCChHHHHHHHhhcCHH
Confidence 577888888888888888888777763
No 140
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=32.15 E-value=25 Score=22.14 Aligned_cols=11 Identities=27% Similarity=0.519 Sum_probs=6.5
Q ss_pred CccCccccccc
Q psy12647 382 PFPCPVCQRTF 392 (410)
Q Consensus 382 ~~~C~~C~~~f 392 (410)
.+.|..||+.+
T Consensus 37 r~~C~~Cgyt~ 47 (50)
T PRK00432 37 RWHCGKCGYTE 47 (50)
T ss_pred cEECCCcCCEE
Confidence 45666666554
No 141
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=32.15 E-value=26 Score=19.66 Aligned_cols=12 Identities=17% Similarity=0.675 Sum_probs=7.3
Q ss_pred ccCCcCcccccC
Q psy12647 120 FPCKRCDERFST 131 (410)
Q Consensus 120 ~~C~~C~~~f~~ 131 (410)
|.|..|+..+..
T Consensus 1 Y~C~~Cg~~~~~ 12 (32)
T PF03604_consen 1 YICGECGAEVEL 12 (32)
T ss_dssp EBESSSSSSE-B
T ss_pred CCCCcCCCeeEc
Confidence 467777776653
No 142
>KOG1280|consensus
Probab=31.45 E-value=37 Score=30.93 Aligned_cols=37 Identities=27% Similarity=0.607 Sum_probs=25.7
Q ss_pred CcccccccccccChhHHHHHHhhhcCCCCc-cccCCCC
Q psy12647 194 QYTCHMCGHMFTCKRRLTQHLARLHTGSLS-YPCTQCP 230 (410)
Q Consensus 194 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~-~~C~~C~ 230 (410)
.|.|++|+..=-++..|..|+...|+.-.+ ..|++|.
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 478888888888888888888777764332 3455553
No 143
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=31.44 E-value=24 Score=23.70 Aligned_cols=9 Identities=44% Similarity=1.003 Sum_probs=2.5
Q ss_pred ccCcCcccc
Q psy12647 356 PCTQCPGKF 364 (410)
Q Consensus 356 ~C~~C~~~f 364 (410)
.|..|++.|
T Consensus 11 ~C~~C~~~F 19 (69)
T PF01363_consen 11 NCMICGKKF 19 (69)
T ss_dssp B-TTT--B-
T ss_pred cCcCcCCcC
Confidence 355555555
No 144
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=31.32 E-value=21 Score=25.67 Aligned_cols=14 Identities=29% Similarity=0.975 Sum_probs=8.0
Q ss_pred CceecccccccccC
Q psy12647 87 KTFSCNSCGQTWLG 100 (410)
Q Consensus 87 ~~~~C~~C~~~f~~ 100 (410)
..+.|..|+++|..
T Consensus 53 GIW~C~~C~~~~AG 66 (90)
T PRK03976 53 GIWECRKCGAKFAG 66 (90)
T ss_pred EEEEcCCCCCEEeC
Confidence 34666666666543
No 145
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=30.15 E-value=22 Score=27.65 Aligned_cols=14 Identities=29% Similarity=0.755 Sum_probs=7.2
Q ss_pred CCcccCcCcccccC
Q psy12647 353 LSYPCTQCPGKFYS 366 (410)
Q Consensus 353 ~~~~C~~C~~~f~~ 366 (410)
..|.|+.|++.|+.
T Consensus 52 qRyrC~~C~~tf~~ 65 (129)
T COG3677 52 QRYKCKSCGSTFTV 65 (129)
T ss_pred cccccCCcCcceee
Confidence 34555555555543
No 146
>KOG1280|consensus
Probab=29.80 E-value=47 Score=30.25 Aligned_cols=36 Identities=28% Similarity=0.622 Sum_probs=26.6
Q ss_pred cccccccCccCChhhHHHHHHhhcCCCCC-cccCcCc
Q psy12647 326 YTCHMCGHMFTCKRRLTQHLARLHTGSLS-YPCTQCP 361 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~-~~C~~C~ 361 (410)
|.|++|+..=.+...|..|+.+.|....+ -.|++|+
T Consensus 80 ftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 80 FTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred ccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 88888888888888888888888875432 2556655
No 147
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=29.16 E-value=21 Score=20.26 Aligned_cols=20 Identities=15% Similarity=0.185 Sum_probs=12.8
Q ss_pred CcccCCCCCCccccc-hhcee
Q psy12647 39 RRFPCEYCSLLVYPV-LILFR 58 (410)
Q Consensus 39 ~~~~C~~C~~~f~~~-l~~h~ 58 (410)
..+.|+.|+..+... ++.|+
T Consensus 3 ~~~~C~nC~R~v~a~RfA~HL 23 (33)
T PF08209_consen 3 PYVECPNCGRPVAASRFAPHL 23 (33)
T ss_dssp -EEE-TTTSSEEEGGGHHHHH
T ss_pred CeEECCCCcCCcchhhhHHHH
Confidence 357899999877655 55554
No 148
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=28.77 E-value=22 Score=27.97 Aligned_cols=8 Identities=25% Similarity=0.949 Sum_probs=4.5
Q ss_pred cccCcCcc
Q psy12647 355 YPCTQCPG 362 (410)
Q Consensus 355 ~~C~~C~~ 362 (410)
+.||.||.
T Consensus 108 ~~CP~Cgs 115 (135)
T PRK03824 108 LKCPKCGS 115 (135)
T ss_pred cCCcCCCC
Confidence 45666663
No 149
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=28.67 E-value=38 Score=21.63 Aligned_cols=9 Identities=56% Similarity=1.202 Sum_probs=4.6
Q ss_pred cccccCccC
Q psy12647 328 CHMCGHMFT 336 (410)
Q Consensus 328 C~~C~~~f~ 336 (410)
|..|++.|.
T Consensus 5 C~~C~~~F~ 13 (57)
T cd00065 5 CMGCGKPFT 13 (57)
T ss_pred CcccCcccc
Confidence 455555554
No 150
>KOG1842|consensus
Probab=28.39 E-value=31 Score=32.58 Aligned_cols=30 Identities=20% Similarity=0.247 Sum_probs=26.0
Q ss_pred CCcccccccCccCChhhHHHHHHhhcCCCC
Q psy12647 324 IQYTCHMCGHMFTCKRRLTQHLARLHTGSL 353 (410)
Q Consensus 324 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~ 353 (410)
..|.|++|...|.+...|..|+...|.++.
T Consensus 14 egflCPiC~~dl~~~~~L~~H~d~eH~~ed 43 (505)
T KOG1842|consen 14 EGFLCPICLLDLPNLSALNDHLDVEHFEED 43 (505)
T ss_pred hcccCchHhhhhhhHHHHHHHHhhhccccc
Confidence 468999999999999999999988898763
No 151
>KOG4377|consensus
Probab=28.09 E-value=35 Score=31.78 Aligned_cols=25 Identities=28% Similarity=0.786 Sum_probs=20.1
Q ss_pred cccCc--CcccccCcHhHHHHHHhhcC
Q psy12647 355 YPCTQ--CPGKFYSAYELRQHVRQHTD 379 (410)
Q Consensus 355 ~~C~~--C~~~f~~~~~l~~H~~~h~~ 379 (410)
|.|.+ |+..+...+++..|.|.|..
T Consensus 402 fhc~r~Gc~~tl~s~sqm~shkrkheR 428 (480)
T KOG4377|consen 402 FHCDRLGCEATLYSVSQMASHKRKHER 428 (480)
T ss_pred eeecccCCceEEEehhhhhhhhhhhhh
Confidence 45654 88999999999999888753
No 152
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=28.06 E-value=38 Score=33.41 Aligned_cols=27 Identities=19% Similarity=0.428 Sum_probs=14.1
Q ss_pred cccccccCccCChhhHHHHHHhhcCCCCCcccCcCccc
Q psy12647 326 YTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGK 363 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~ 363 (410)
.+|+.|+-.. ..|...+...|..||..
T Consensus 223 ~~C~~C~~~l-----------~~h~~~~~l~Ch~Cg~~ 249 (505)
T TIGR00595 223 LCCPNCDVSL-----------TYHKKEGKLRCHYCGYQ 249 (505)
T ss_pred cCCCCCCCce-----------EEecCCCeEEcCCCcCc
Confidence 6677776432 23334445566666643
No 153
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=27.84 E-value=1.3e+02 Score=18.25 Aligned_cols=10 Identities=30% Similarity=0.750 Sum_probs=6.6
Q ss_pred CcccCCCCCC
Q psy12647 39 RRFPCEYCSL 48 (410)
Q Consensus 39 ~~~~C~~C~~ 48 (410)
..+.|+.|+.
T Consensus 17 ~g~~CP~Cg~ 26 (46)
T PF12760_consen 17 DGFVCPHCGS 26 (46)
T ss_pred CCCCCCCCCC
Confidence 3367888864
No 154
>KOG3408|consensus
Probab=27.16 E-value=35 Score=25.85 Aligned_cols=24 Identities=25% Similarity=0.580 Sum_probs=15.7
Q ss_pred CCcccCcCcccccCcHhHHHHHHh
Q psy12647 353 LSYPCTQCPGKFYSAYELRQHVRQ 376 (410)
Q Consensus 353 ~~~~C~~C~~~f~~~~~l~~H~~~ 376 (410)
..|.|-.|.+-|.+..+|..|.++
T Consensus 56 GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 56 GQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred ceeehhhhhhhhcchHHHHHHHhc
Confidence 346677777777777777776653
No 155
>PRK14873 primosome assembly protein PriA; Provisional
Probab=27.12 E-value=37 Score=34.70 Aligned_cols=14 Identities=14% Similarity=0.285 Sum_probs=8.1
Q ss_pred cCCCCCcccCcCcc
Q psy12647 349 HTGSLSYPCTQCPG 362 (410)
Q Consensus 349 H~~~~~~~C~~C~~ 362 (410)
|...+...|.+||+
T Consensus 405 h~~~~~l~Ch~CG~ 418 (665)
T PRK14873 405 PSAGGTPRCRWCGR 418 (665)
T ss_pred ecCCCeeECCCCcC
Confidence 33445566777774
No 156
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=27.09 E-value=21 Score=20.70 Aligned_cols=9 Identities=22% Similarity=0.674 Sum_probs=4.1
Q ss_pred cCccccccc
Q psy12647 384 PCPVCQRTF 392 (410)
Q Consensus 384 ~C~~C~~~f 392 (410)
.|..||-.+
T Consensus 23 ~Cd~cg~~L 31 (36)
T PF05191_consen 23 VCDNCGGEL 31 (36)
T ss_dssp BCTTTTEBE
T ss_pred ccCCCCCee
Confidence 454454433
No 157
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=27.03 E-value=28 Score=20.52 Aligned_cols=14 Identities=29% Similarity=0.664 Sum_probs=10.8
Q ss_pred CccCcccccccccc
Q psy12647 382 PFPCPVCQRTFRRK 395 (410)
Q Consensus 382 ~~~C~~C~~~f~~~ 395 (410)
||+|..|+..|=.+
T Consensus 12 ~f~C~~C~~~FC~~ 25 (39)
T smart00154 12 GFKCRHCGNLFCGE 25 (39)
T ss_pred CeECCccCCccccc
Confidence 78888888888544
No 158
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=26.67 E-value=52 Score=19.50 Aligned_cols=14 Identities=36% Similarity=0.935 Sum_probs=10.1
Q ss_pred cceeecCccccccccc
Q psy12647 164 ERRYVCNHMQCTRRFH 179 (410)
Q Consensus 164 ~~~~~C~~~~C~~~f~ 179 (410)
...|.|. .|+..|.
T Consensus 26 T~fy~C~--~C~~~w~ 39 (40)
T smart00440 26 TVFYVCT--KCGHRWR 39 (40)
T ss_pred eEEEEeC--CCCCEeC
Confidence 3568888 8887764
No 159
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=26.56 E-value=38 Score=26.41 Aligned_cols=24 Identities=33% Similarity=0.678 Sum_probs=13.4
Q ss_pred CcccCcCcccccCcHhHHHHHHhhcCC
Q psy12647 354 SYPCTQCPGKFYSAYELRQHVRQHTDE 380 (410)
Q Consensus 354 ~~~C~~C~~~f~~~~~l~~H~~~h~~~ 380 (410)
-..|-+||+.|.. |.+|++.|+|.
T Consensus 72 ~i~clecGk~~k~---LkrHL~~~~gl 95 (132)
T PF05443_consen 72 YIICLECGKKFKT---LKRHLRTHHGL 95 (132)
T ss_dssp -EE-TBT--EESB---HHHHHHHTT-S
T ss_pred eeEEccCCcccch---HHHHHHHccCC
Confidence 3568888887764 47788887664
No 161
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=26.52 E-value=42 Score=19.74 Aligned_cols=13 Identities=38% Similarity=0.907 Sum_probs=9.6
Q ss_pred cceeecCcccccccc
Q psy12647 164 ERRYVCNHMQCTRRF 178 (410)
Q Consensus 164 ~~~~~C~~~~C~~~f 178 (410)
...|.|. .|+..|
T Consensus 26 T~fy~C~--~C~~~w 38 (39)
T PF01096_consen 26 TLFYVCC--NCGHRW 38 (39)
T ss_dssp EEEEEES--SSTEEE
T ss_pred eEEEEeC--CCCCee
Confidence 3568888 888765
No 162
>KOG4124|consensus
Probab=26.47 E-value=22 Score=32.26 Aligned_cols=67 Identities=22% Similarity=0.556 Sum_probs=39.0
Q ss_pred CcccC--CcCcccccCcccccccccccccccccchh-hhhccccccCCCcceeecCcccccccccChhHHHHH
Q psy12647 118 ETFPC--KRCDERFSTRDGLESFTFSLRLFCLYFPQ-ELISHSSYHGGEERRYVCNHMQCTRRFHSASLLARH 187 (410)
Q Consensus 118 ~~~~C--~~C~~~f~~~~~l~~H~~~~~~~~~~~~~-~l~~h~~~~~~~~~~~~C~~~~C~~~f~~~~~l~~H 187 (410)
++|+| +.|.+..+...+|+.|....|-.....+. .-+-|..... ..++|.|+ +|.+.+.....|..|
T Consensus 348 ~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~-~nk~~r~~--i~~~~~k~~~~l~~~ 417 (442)
T KOG4124|consen 348 KPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVV-ENKPYRCE--VCSKRYKNLNGLKYH 417 (442)
T ss_pred CCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeee-ccCcccCh--hhhhhhccCCCCCce
Confidence 46777 67999999999999887665543333222 1222333222 23666666 666666655554443
No 163
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.99 E-value=26 Score=24.03 Aligned_cols=30 Identities=30% Similarity=0.790 Sum_probs=14.8
Q ss_pred cccccccCccCChhhHHHHHHhhcCCCCCcccCcCccc
Q psy12647 326 YTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGK 363 (410)
Q Consensus 326 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~ 363 (410)
|+|..|+..|. +.+++ +....-.|+.|+..
T Consensus 13 Y~c~~cg~~~d----vvq~~----~ddplt~ce~c~a~ 42 (82)
T COG2331 13 YECTECGNRFD----VVQAM----TDDPLTTCEECGAR 42 (82)
T ss_pred EeecccchHHH----HHHhc----ccCccccChhhChH
Confidence 66666665543 44444 12222356666653
No 164
>PRK04023 DNA polymerase II large subunit; Validated
Probab=25.98 E-value=61 Score=34.32 Aligned_cols=11 Identities=27% Similarity=0.467 Sum_probs=6.6
Q ss_pred ccCcccccccc
Q psy12647 383 FPCPVCQRTFR 393 (410)
Q Consensus 383 ~~C~~C~~~f~ 393 (410)
+.|+.||..-.
T Consensus 664 y~CPKCG~El~ 674 (1121)
T PRK04023 664 DECEKCGREPT 674 (1121)
T ss_pred CcCCCCCCCCC
Confidence 56777765443
No 165
>PF03811 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=25.60 E-value=27 Score=20.23 Aligned_cols=14 Identities=21% Similarity=0.624 Sum_probs=8.2
Q ss_pred hhcCCCCceecccc
Q psy12647 81 KIHLHPKTFSCNSC 94 (410)
Q Consensus 81 ~~h~~~~~~~C~~C 94 (410)
....|...|+|..|
T Consensus 22 ~~~~G~qryrC~~C 35 (36)
T PF03811_consen 22 KSPSGHQRYRCKDC 35 (36)
T ss_pred CCCCCCEeEecCcC
Confidence 33445556777766
No 166
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=25.38 E-value=32 Score=22.02 Aligned_cols=14 Identities=36% Similarity=1.018 Sum_probs=10.7
Q ss_pred CCCccchhhhhhcC
Q psy12647 252 FPCPVCQRTFRRKG 265 (410)
Q Consensus 252 ~~C~~C~~~f~~~~ 265 (410)
..|+.|+..|....
T Consensus 6 ~~C~~Cg~~~~~~d 19 (54)
T PF14446_consen 6 CKCPVCGKKFKDGD 19 (54)
T ss_pred ccChhhCCcccCCC
Confidence 46889998887654
No 167
>COG1773 Rubredoxin [Energy production and conversion]
Probab=25.09 E-value=30 Score=22.24 Aligned_cols=12 Identities=25% Similarity=0.850 Sum_probs=8.1
Q ss_pred CccCcccccccc
Q psy12647 382 PFPCPVCQRTFR 393 (410)
Q Consensus 382 ~~~C~~C~~~f~ 393 (410)
.|+|.+||+.|-
T Consensus 3 ~~~C~~CG~vYd 14 (55)
T COG1773 3 RWRCSVCGYVYD 14 (55)
T ss_pred ceEecCCceEec
Confidence 467777777664
No 168
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=24.88 E-value=71 Score=20.03 Aligned_cols=41 Identities=20% Similarity=0.494 Sum_probs=24.0
Q ss_pred CCccccCCCCcccCCHHHHHHHHHH-hcCCCCCCCccchhhh
Q psy12647 221 SLSYPCTQCPGKFYSAYELRQHVRQ-HTDERPFPCPVCQRTF 261 (410)
Q Consensus 221 ~~~~~C~~C~~~f~~~~~l~~H~~~-~~~~~~~~C~~C~~~f 261 (410)
++.+.|..|+..|.....=...... .....|-.|+.|...-
T Consensus 2 Dk~l~C~dCg~~FvfTa~EQ~fy~eKgf~n~p~RC~~CR~~r 43 (49)
T PF13451_consen 2 DKTLTCKDCGAEFVFTAGEQKFYAEKGFDNEPKRCPSCRQAR 43 (49)
T ss_pred CeeEEcccCCCeEEEehhHHHHHHhcCCcCCCccCHHHHHHH
Confidence 4567788888877655443333322 2234567788876544
No 169
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=24.77 E-value=42 Score=22.54 Aligned_cols=11 Identities=45% Similarity=1.032 Sum_probs=5.9
Q ss_pred CCccCcccccc
Q psy12647 381 RPFPCPVCQRT 391 (410)
Q Consensus 381 ~~~~C~~C~~~ 391 (410)
+.|.|+.||..
T Consensus 45 r~~~C~~Cg~~ 55 (69)
T PF07282_consen 45 RVFTCPNCGFE 55 (69)
T ss_pred ceEEcCCCCCE
Confidence 44555555554
No 170
>KOG4727|consensus
Probab=24.29 E-value=35 Score=27.52 Aligned_cols=23 Identities=17% Similarity=0.403 Sum_probs=18.7
Q ss_pred CCcccCcCcccccCcHhHHHHHH
Q psy12647 353 LSYPCTQCPGKFYSAYELRQHVR 375 (410)
Q Consensus 353 ~~~~C~~C~~~f~~~~~l~~H~~ 375 (410)
..|.|.+|.-.|.+..++..|++
T Consensus 74 ~GyyCdVCdcvvKDSinflDHiN 96 (193)
T KOG4727|consen 74 GGYYCDVCDCVVKDSINFLDHIN 96 (193)
T ss_pred CceeeeecceeehhhHHHHHHhc
Confidence 34888888888888888888875
No 171
>KOG2071|consensus
Probab=22.85 E-value=72 Score=31.52 Aligned_cols=78 Identities=23% Similarity=0.356 Sum_probs=0.0
Q ss_pred cceeecCcccccccccChhHHHHHHhhCCCCcccccccccccChhHHHHHHhhhcCCCCccccCCCCcccCCHHHHH---
Q psy12647 164 ERRYVCNHMQCTRRFHSASLLARHVSTHAIQYTCHMCGHMFTCKRRLTQHLARLHTGSLSYPCTQCPGKFYSAYELR--- 240 (410)
Q Consensus 164 ~~~~~C~~~~C~~~f~~~~~l~~H~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~--- 240 (410)
+.+-+|. .||..|.+......|+..|. .+|++..-.....-+-..+...|.+.+.+.
T Consensus 416 ~~pnqC~--~CG~R~~~~ee~sk~md~H~------------------dwh~r~n~~~r~s~~~~~sR~Wf~s~s~W~~~K 475 (579)
T KOG2071|consen 416 DSPNQCK--SCGLRFDDSEERSKHMDIHD------------------DWHRRKNTTIRGSSKWQKSRSWFPSKSGWLAAK 475 (579)
T ss_pred CCcchhc--ccccccccchhhhhHhhhhh------------------hhhhhhhhhcccccccccccceeechhhhcccc
Q ss_pred ---------------------------HHHHHhcCCCCCCCccchhhh
Q psy12647 241 ---------------------------QHVRQHTDERPFPCPVCQRTF 261 (410)
Q Consensus 241 ---------------------------~H~~~~~~~~~~~C~~C~~~f 261 (410)
.=+....++....|++|+..|
T Consensus 476 ~~e~t~e~~~~~~~~~~~~~~~s~~~k~~~Vp~d~e~~~~C~IC~EkF 523 (579)
T KOG2071|consen 476 AGEETDEKEKVEHEELQIKKELSLRSKYELVPADSERQASCPICQEKF 523 (579)
T ss_pred ccccccccccccchhhcchhhhhhhccceecccCcccccCCccccccc
No 172
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=22.41 E-value=53 Score=31.67 Aligned_cols=29 Identities=17% Similarity=0.407 Sum_probs=24.9
Q ss_pred CCcccccccCccCChhhHHHHHHhhcCCC
Q psy12647 324 IQYTCHMCGHMFTCKRRLTQHLARLHTGS 352 (410)
Q Consensus 324 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 352 (410)
+-+.|+.|.+.|.+...+..|+...|.+.
T Consensus 56 rFWiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 56 RFWICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred eEeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 45889999999999999999998888765
No 173
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=21.79 E-value=50 Score=25.08 Aligned_cols=28 Identities=18% Similarity=0.568 Sum_probs=20.3
Q ss_pred CCceecccccccccCchhHHHHHHHhhhcCCCCcccCCcCccc
Q psy12647 86 PKTFSCNSCGQTWLGYTSWKQHVAACREKRRKETFPCKRCDER 128 (410)
Q Consensus 86 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~C~~C~~~ 128 (410)
+-...|..|+..|..... .+.||.||..
T Consensus 68 p~~~~C~~Cg~~~~~~~~---------------~~~CP~Cgs~ 95 (115)
T TIGR00100 68 PVECECEDCSEEVSPEID---------------LYRCPKCHGI 95 (115)
T ss_pred CcEEEcccCCCEEecCCc---------------CccCcCCcCC
Confidence 345999999988865321 3679999854
No 174
>PF01286 XPA_N: XPA protein N-terminal; InterPro: IPR022652 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. Skin cells of individual's with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-A is the most severe form of the disease and is due to defects in a 30 kDa nuclear protein called XPA (or XPAC) []. The sequence of the XPA protein is conserved from higher eukaryotes [] to yeast (gene RAD14) []. XPA is a hydrophilic protein of 247 to 296 amino-acid residues which has a C4-type zinc finger motif in its central section. This entry contains the zinc-finger containing region in the XPA protein. It is found N-terminal to PF05181 from PFAM ; PDB: 1D4U_A 1XPA_A.
Probab=21.66 E-value=35 Score=19.51 Aligned_cols=18 Identities=17% Similarity=0.218 Sum_probs=9.3
Q ss_pred cccCCCCCCccccc-hhce
Q psy12647 40 RFPCEYCSLLVYPV-LILF 57 (410)
Q Consensus 40 ~~~C~~C~~~f~~~-l~~h 57 (410)
...|..|++.|.++ |..+
T Consensus 3 ~~~C~eC~~~f~dSyL~~~ 21 (34)
T PF01286_consen 3 YPKCDECGKPFMDSYLLNN 21 (34)
T ss_dssp -EE-TTT--EES-SSCCCC
T ss_pred CchHhHhCCHHHHHHHHHh
Confidence 34799999998877 5443
No 175
>KOG1701|consensus
Probab=21.55 E-value=12 Score=34.90 Aligned_cols=73 Identities=12% Similarity=0.246 Sum_probs=36.0
Q ss_pred CCcccccccCccCChhhHHHHHHhhcCCC---CCc--ccCcCcccccCcHhHHHHHHhhcCC-----CCccCcccccccc
Q psy12647 324 IQYTCHMCGHMFTCKRRLTQHLARLHTGS---LSY--PCTQCPGKFYSAYELRQHVRQHTDE-----RPFPCPVCQRTFR 393 (410)
Q Consensus 324 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~---~~~--~C~~C~~~f~~~~~l~~H~~~h~~~-----~~~~C~~C~~~f~ 393 (410)
.=|.|-.|.+..........-...+|-.. +.| +|.+|+....-...-.+=+|+-.-+ .-|+|..|+.-.+
T Consensus 359 ~CF~Cv~C~r~ldgipFtvd~~n~v~Cv~dfh~kfAPrCs~C~~PI~P~~G~~etvRvvamdr~fHv~CY~CEDCg~~LS 438 (468)
T KOG1701|consen 359 GCFTCVVCARCLDGIPFTVDSQNNVYCVPDFHKKFAPRCSVCGNPILPRDGKDETVRVVAMDRDFHVNCYKCEDCGLLLS 438 (468)
T ss_pred CceEEEEeccccCCccccccCCCceeeehhhhhhcCcchhhccCCccCCCCCcceEEEEEccccccccceehhhcCcccc
Confidence 45788888776554432111111222111 112 6888886543332222222221111 2389999998887
Q ss_pred ccc
Q psy12647 394 RKV 396 (410)
Q Consensus 394 ~~~ 396 (410)
.+.
T Consensus 439 ~e~ 441 (468)
T KOG1701|consen 439 SEE 441 (468)
T ss_pred ccC
Confidence 554
No 176
>KOG3408|consensus
Probab=21.00 E-value=84 Score=23.89 Aligned_cols=29 Identities=28% Similarity=0.477 Sum_probs=19.3
Q ss_pred cCCCcceeecCcccccccccChhHHHHHHhh
Q psy12647 160 HGGEERRYVCNHMQCTRRFHSASLLARHVST 190 (410)
Q Consensus 160 ~~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~ 190 (410)
..++...|+|- .|.+-|.+...|..|.++
T Consensus 51 dlPG~GqfyCi--~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 51 DLPGGGQFYCI--ECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCCCCceeehh--hhhhhhcchHHHHHHHhc
Confidence 33344567777 777777777777777653
No 177
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=20.82 E-value=61 Score=31.31 Aligned_cols=27 Identities=19% Similarity=0.498 Sum_probs=18.8
Q ss_pred cccccccccccChhHHHHHHhhhcCCC
Q psy12647 195 YTCHMCGHMFTCKRRLTQHLARLHTGS 221 (410)
Q Consensus 195 ~~C~~C~~~f~~~~~l~~H~~~~H~~~ 221 (410)
..|+.|++.|.+...+..|+...|.+.
T Consensus 58 WiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 58 WICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred eeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 357777777777777777777777653
No 178
>PF13824 zf-Mss51: Zinc-finger of mitochondrial splicing suppressor 51
Probab=20.51 E-value=75 Score=20.47 Aligned_cols=16 Identities=13% Similarity=0.430 Sum_probs=9.9
Q ss_pred CCCCCcccCcCccccc
Q psy12647 350 TGSLSYPCTQCPGKFY 365 (410)
Q Consensus 350 ~~~~~~~C~~C~~~f~ 365 (410)
.+.-.|.|+.||..+-
T Consensus 10 ~~~v~~~Cp~cGipth 25 (55)
T PF13824_consen 10 PAHVNFECPDCGIPTH 25 (55)
T ss_pred ccccCCcCCCCCCcCc
Confidence 3445577777776543
No 179
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=20.23 E-value=48 Score=19.66 Aligned_cols=17 Identities=35% Similarity=0.729 Sum_probs=8.5
Q ss_pred cccCcCcccccCcHhHH
Q psy12647 355 YPCTQCPGKFYSAYELR 371 (410)
Q Consensus 355 ~~C~~C~~~f~~~~~l~ 371 (410)
+.|+.|+-.+-+...|.
T Consensus 20 d~C~~C~G~W~d~~el~ 36 (41)
T PF13453_consen 20 DVCPSCGGIWFDAGELE 36 (41)
T ss_pred EECCCCCeEEccHHHHH
Confidence 44555555555444443
Done!