Query psy10074
Match_columns 420
No_of_seqs 211 out of 2516
Neff 10.3
Searched_HMMs 46136
Date Fri Aug 16 17:17:01 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10074.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10074hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 1.5E-28 3.3E-33 233.1 8.5 206 169-405 603-934 (958)
2 KOG3608|consensus 99.9 3.6E-27 7.9E-32 202.7 8.9 229 125-404 135-379 (467)
3 KOG2462|consensus 99.9 5.8E-26 1.3E-30 190.9 5.9 135 255-398 127-264 (279)
4 KOG2462|consensus 99.9 2.3E-25 4.9E-30 187.3 4.5 130 284-419 127-256 (279)
5 KOG3608|consensus 99.9 2.4E-24 5.3E-29 185.4 9.2 238 171-418 69-364 (467)
6 KOG1074|consensus 99.9 1.1E-24 2.3E-29 207.2 4.1 109 259-369 606-730 (958)
7 KOG3623|consensus 99.9 3E-22 6.4E-27 187.5 8.8 111 13-137 209-322 (1007)
8 KOG3623|consensus 99.8 3.5E-21 7.6E-26 180.4 5.3 68 349-418 895-962 (1007)
9 KOG3576|consensus 99.6 8.5E-17 1.8E-21 128.6 1.3 118 283-405 113-240 (267)
10 KOG3576|consensus 99.5 1.8E-15 4E-20 121.0 2.4 125 119-282 112-236 (267)
11 PLN03086 PRLI-interacting fact 99.2 3.4E-11 7.4E-16 115.0 6.0 136 258-419 407-554 (567)
12 PLN03086 PRLI-interacting fact 99.1 2.1E-10 4.6E-15 109.6 7.9 121 258-402 433-565 (567)
13 PHA00733 hypothetical protein 99.0 1.7E-10 3.7E-15 89.8 3.5 82 318-402 38-124 (128)
14 PHA00733 hypothetical protein 98.9 2.4E-09 5.2E-14 83.4 5.4 85 283-373 36-124 (128)
15 PHA02768 hypothetical protein; 98.7 1.1E-08 2.3E-13 65.1 1.9 44 348-394 5-48 (55)
16 KOG3993|consensus 98.7 4.6E-09 9.9E-14 94.1 0.0 145 257-404 266-485 (500)
17 KOG3993|consensus 98.6 9.3E-09 2E-13 92.1 1.6 202 169-374 265-484 (500)
18 PHA02768 hypothetical protein; 98.6 2.1E-08 4.6E-13 63.8 1.5 44 320-365 5-48 (55)
19 KOG1146|consensus 98.4 2.2E-07 4.7E-12 95.1 3.3 93 13-114 464-613 (1406)
20 PF13465 zf-H2C2_2: Zinc-finge 98.3 2.8E-07 6E-12 50.0 1.9 24 364-388 2-25 (26)
21 PF13465 zf-H2C2_2: Zinc-finge 98.3 4.9E-07 1.1E-11 49.0 2.2 24 336-359 2-25 (26)
22 PF05605 zf-Di19: Drought indu 98.2 9.8E-07 2.1E-11 57.6 3.1 52 348-402 2-54 (54)
23 PHA00732 hypothetical protein 98.1 1.8E-06 4E-11 60.7 2.7 44 348-397 1-44 (79)
24 PHA00616 hypothetical protein 98.0 1.8E-06 4E-11 52.2 1.3 21 349-369 2-22 (44)
25 PHA00732 hypothetical protein 97.9 6.8E-06 1.5E-10 57.8 2.0 39 377-418 1-39 (79)
26 PF05605 zf-Di19: Drought indu 97.9 1.3E-05 2.9E-10 52.3 3.1 49 321-372 3-53 (54)
27 PHA00616 hypothetical protein 97.9 7.1E-06 1.5E-10 49.6 1.6 37 171-207 1-37 (44)
28 KOG1146|consensus 97.7 1.4E-05 3E-10 82.3 1.4 87 17-113 439-541 (1406)
29 PF00096 zf-C2H2: Zinc finger, 97.6 5.9E-05 1.3E-09 39.5 2.3 22 172-193 1-22 (23)
30 PF00096 zf-C2H2: Zinc finger, 97.5 8.3E-05 1.8E-09 39.0 1.9 21 349-369 1-21 (23)
31 PF12756 zf-C2H2_2: C2H2 type 97.4 0.00012 2.6E-09 55.0 2.6 71 290-370 2-72 (100)
32 PF13894 zf-C2H2_4: C2H2-type 97.4 0.00018 3.9E-09 38.0 2.5 21 349-369 1-21 (24)
33 PF12756 zf-C2H2_2: C2H2 type 97.3 8.2E-05 1.8E-09 55.9 1.3 75 322-402 1-75 (100)
34 PF13894 zf-C2H2_4: C2H2-type 97.3 0.00021 4.7E-09 37.7 2.2 23 172-194 1-23 (24)
35 PF13912 zf-C2H2_6: C2H2-type 97.2 0.00018 3.9E-09 39.3 1.2 20 378-397 2-21 (27)
36 COG5189 SFP1 Putative transcri 97.0 0.00029 6.3E-09 61.4 1.7 55 345-399 346-420 (423)
37 PF13912 zf-C2H2_6: C2H2-type 96.9 0.00075 1.6E-08 36.8 2.2 23 200-222 2-24 (27)
38 COG5189 SFP1 Putative transcri 96.9 0.00034 7.3E-09 61.1 0.8 72 284-369 346-419 (423)
39 PF09237 GAGA: GAGA factor; I 96.2 0.0013 2.7E-08 40.8 0.1 28 346-373 22-49 (54)
40 PF13909 zf-H2C2_5: C2H2-type 96.2 0.0038 8.3E-08 32.9 1.9 23 200-223 1-23 (24)
41 COG5236 Uncharacterized conser 96.1 0.0037 8E-08 55.4 2.4 137 50-225 151-307 (493)
42 smart00355 ZnF_C2H2 zinc finge 96.0 0.0061 1.3E-07 32.5 2.3 23 172-194 1-23 (26)
43 PF09237 GAGA: GAGA factor; I 96.0 0.0057 1.2E-07 38.0 2.3 31 318-348 22-52 (54)
44 PF13909 zf-H2C2_5: C2H2-type 96.0 0.0063 1.4E-07 32.0 2.2 24 172-196 1-24 (24)
45 smart00355 ZnF_C2H2 zinc finge 96.0 0.007 1.5E-07 32.2 2.3 20 350-369 2-21 (26)
46 KOG2231|consensus 95.8 0.0064 1.4E-07 59.9 2.8 134 52-220 117-260 (669)
47 KOG2482|consensus 95.7 0.015 3.3E-07 51.6 4.6 55 139-193 147-217 (423)
48 PF12874 zf-met: Zinc-finger o 95.7 0.0066 1.4E-07 32.3 1.5 20 173-192 2-21 (25)
49 PF12874 zf-met: Zinc-finger o 95.7 0.009 1.9E-07 31.8 2.0 23 200-222 1-23 (25)
50 KOG2482|consensus 95.6 0.019 4.1E-07 51.0 4.6 155 245-399 131-356 (423)
51 KOG2785|consensus 95.5 0.045 9.8E-07 49.7 6.8 53 348-400 166-243 (390)
52 PF12171 zf-C2H2_jaz: Zinc-fin 95.2 0.017 3.6E-07 31.4 2.0 22 378-399 2-23 (27)
53 PF12171 zf-C2H2_jaz: Zinc-fin 95.0 0.018 3.9E-07 31.3 1.7 22 321-342 2-23 (27)
54 PRK04860 hypothetical protein; 94.6 0.029 6.3E-07 45.6 2.7 36 348-388 119-154 (160)
55 KOG2231|consensus 94.5 0.051 1.1E-06 53.8 4.7 112 260-384 117-245 (669)
56 PRK04860 hypothetical protein; 94.5 0.034 7.4E-07 45.2 2.9 39 170-212 118-156 (160)
57 KOG4173|consensus 94.2 0.012 2.5E-07 48.4 -0.3 79 286-372 78-170 (253)
58 KOG2785|consensus 93.6 0.13 2.8E-06 46.9 5.0 25 169-193 66-90 (390)
59 KOG2893|consensus 92.7 0.053 1.2E-06 45.7 1.2 46 17-74 13-59 (341)
60 COG5048 FOG: Zn-finger [Genera 92.6 0.072 1.6E-06 51.6 2.3 61 14-82 289-355 (467)
61 PF13913 zf-C2HC_2: zinc-finge 92.5 0.11 2.4E-06 27.5 1.9 20 349-369 3-22 (25)
62 PF12013 DUF3505: Protein of u 92.1 0.16 3.5E-06 38.6 3.2 25 378-402 81-109 (109)
63 COG5236 Uncharacterized conser 91.9 0.48 1E-05 42.5 6.1 139 93-253 157-305 (493)
64 KOG4173|consensus 91.8 0.07 1.5E-06 44.0 0.9 84 257-344 78-170 (253)
65 PF13913 zf-C2HC_2: zinc-finge 91.3 0.19 4.2E-06 26.6 2.0 18 173-191 4-21 (25)
66 cd00350 rubredoxin_like Rubred 90.5 0.19 4E-06 28.8 1.6 10 321-330 2-11 (33)
67 smart00451 ZnF_U1 U1-like zinc 89.8 0.24 5.1E-06 28.6 1.7 22 348-369 3-24 (35)
68 smart00451 ZnF_U1 U1-like zinc 89.7 0.28 6.1E-06 28.3 2.0 23 320-342 3-25 (35)
69 TIGR00622 ssl1 transcription f 89.1 0.66 1.4E-05 34.8 3.9 21 257-277 14-34 (112)
70 KOG2893|consensus 89.1 0.13 2.8E-06 43.4 0.2 46 351-401 13-58 (341)
71 COG4049 Uncharacterized protei 88.4 0.13 2.9E-06 32.5 -0.1 26 377-402 17-42 (65)
72 COG4049 Uncharacterized protei 87.6 0.3 6.5E-06 30.9 1.1 30 344-373 13-42 (65)
73 PF12013 DUF3505: Protein of u 86.9 1.3 2.7E-05 33.7 4.5 27 347-373 79-109 (109)
74 COG5048 FOG: Zn-finger [Genera 86.7 0.25 5.5E-06 47.8 0.7 64 170-233 288-357 (467)
75 TIGR00622 ssl1 transcription f 84.9 1.3 2.8E-05 33.3 3.5 24 171-194 81-104 (112)
76 cd00729 rubredoxin_SM Rubredox 83.3 0.85 1.8E-05 26.2 1.5 10 349-358 3-12 (34)
77 TIGR02098 MJ0042_CXXC MJ0042 f 82.1 1.1 2.3E-05 26.5 1.8 8 351-358 5-12 (38)
78 PF09986 DUF2225: Uncharacteri 81.7 0.52 1.1E-05 40.6 0.4 13 349-361 49-61 (214)
79 COG5151 SSL1 RNA polymerase II 81.4 3.5 7.6E-05 36.6 5.3 96 77-194 307-411 (421)
80 PF09986 DUF2225: Uncharacteri 81.1 0.32 7E-06 41.9 -1.1 12 378-389 49-60 (214)
81 PF09538 FYDLN_acid: Protein o 80.6 0.87 1.9E-05 34.3 1.2 14 347-360 25-38 (108)
82 COG2888 Predicted Zn-ribbon RN 79.3 1.3 2.9E-05 28.7 1.5 8 377-384 50-57 (61)
83 PF10571 UPF0547: Uncharacteri 77.8 1.3 2.9E-05 23.6 1.0 11 172-182 15-25 (26)
84 PF13719 zinc_ribbon_5: zinc-r 76.9 2 4.4E-05 25.2 1.8 6 379-384 27-32 (37)
85 PHA00626 hypothetical protein 74.9 1.7 3.6E-05 27.8 1.0 12 171-182 23-34 (59)
86 TIGR00373 conserved hypothetic 74.8 2.7 5.9E-05 34.2 2.6 16 321-336 110-125 (158)
87 smart00834 CxxC_CXXC_SSSS Puta 73.8 1.3 2.8E-05 26.6 0.4 12 15-26 6-17 (41)
88 PF02892 zf-BED: BED zinc fing 73.5 1.2 2.7E-05 27.3 0.3 23 347-369 15-41 (45)
89 PF09538 FYDLN_acid: Protein o 72.9 1.8 3.9E-05 32.6 1.1 30 349-390 10-39 (108)
90 PF09723 Zn-ribbon_8: Zinc rib 72.6 1.4 3E-05 26.8 0.3 12 15-26 6-17 (42)
91 PF12907 zf-met2: Zinc-binding 72.5 1.4 2.9E-05 26.4 0.3 25 379-403 3-30 (40)
92 smart00531 TFIIE Transcription 72.0 4.3 9.4E-05 32.7 3.2 13 320-332 99-111 (147)
93 PRK14890 putative Zn-ribbon RN 71.8 1.9 4.1E-05 28.1 0.8 10 376-385 47-56 (59)
94 KOG2186|consensus 71.3 1.8 3.8E-05 37.4 0.8 46 15-71 4-49 (276)
95 PF12907 zf-met2: Zinc-binding 70.7 2.2 4.9E-05 25.5 0.9 32 349-380 2-36 (40)
96 smart00734 ZnF_Rad18 Rad18-lik 69.8 4.5 9.8E-05 21.6 1.9 19 350-369 3-21 (26)
97 TIGR02605 CxxC_CxxC_SSSS putat 69.6 1.8 4E-05 27.6 0.4 12 15-26 6-17 (52)
98 PRK00464 nrdR transcriptional 69.5 1.1 2.4E-05 36.2 -0.8 23 49-71 27-49 (154)
99 PRK00398 rpoP DNA-directed RNA 68.7 1.9 4.1E-05 26.7 0.3 13 14-26 3-15 (46)
100 PF02176 zf-TRAF: TRAF-type zi 68.5 3.6 7.9E-05 27.0 1.7 46 171-217 9-59 (60)
101 PRK06266 transcription initiat 68.4 4.1 9E-05 33.9 2.4 17 320-336 117-133 (178)
102 PRK00464 nrdR transcriptional 67.9 1.6 3.4E-05 35.3 -0.2 18 321-338 29-46 (154)
103 PF06524 NOA36: NOA36 protein; 67.4 3.1 6.8E-05 36.0 1.4 82 47-137 139-222 (314)
104 TIGR00373 conserved hypothetic 66.3 3.5 7.6E-05 33.6 1.5 34 344-387 105-138 (158)
105 smart00659 RPOLCX RNA polymera 65.4 5.1 0.00011 24.6 1.7 27 171-208 2-28 (44)
106 smart00531 TFIIE Transcription 64.7 3.2 6.9E-05 33.4 1.0 12 15-26 100-111 (147)
107 TIGR02300 FYDLN_acid conserved 63.7 4.1 8.9E-05 31.3 1.3 13 347-359 25-37 (129)
108 PF05290 Baculo_IE-1: Baculovi 62.8 6 0.00013 30.6 2.1 55 318-391 78-135 (140)
109 PRK06266 transcription initiat 62.1 4.3 9.4E-05 33.8 1.4 15 348-362 117-131 (178)
110 COG1592 Rubrerythrin [Energy p 62.0 5.4 0.00012 32.6 1.8 24 320-356 134-157 (166)
111 PRK09678 DNA-binding transcrip 61.3 1.9 4.1E-05 29.7 -0.8 44 15-68 2-47 (72)
112 COG1997 RPL43A Ribosomal prote 61.0 4.2 9.2E-05 28.8 0.9 32 13-62 34-65 (89)
113 COG1198 PriA Primosomal protei 60.7 5.4 0.00012 41.0 1.9 49 125-208 436-484 (730)
114 PF04959 ARS2: Arsenite-resist 58.7 3.2 6.9E-05 35.5 0.0 25 378-402 78-102 (214)
115 smart00614 ZnF_BED BED zinc fi 58.5 9.6 0.00021 24.0 2.2 23 350-372 20-47 (50)
116 PF14803 Nudix_N_2: Nudix N-te 58.4 4.4 9.5E-05 23.3 0.5 10 171-180 22-31 (34)
117 KOG2272|consensus 58.0 19 0.0004 31.2 4.4 16 88-103 98-113 (332)
118 PF00301 Rubredoxin: Rubredoxi 57.0 6.6 0.00014 24.5 1.2 11 288-298 2-12 (47)
119 COG1996 RPC10 DNA-directed RNA 56.9 7.4 0.00016 24.4 1.4 11 320-330 6-16 (49)
120 PF13240 zinc_ribbon_2: zinc-r 56.1 6.9 0.00015 20.2 1.0 6 174-179 16-21 (23)
121 PF08274 PhnA_Zn_Ribbon: PhnA 55.2 5.7 0.00012 22.1 0.6 9 348-356 19-27 (30)
122 COG5151 SSL1 RNA polymerase II 54.3 11 0.00024 33.6 2.6 20 348-367 388-407 (421)
123 TIGR02300 FYDLN_acid conserved 53.7 7.2 0.00016 30.0 1.2 33 349-393 10-42 (129)
124 PF13878 zf-C2H2_3: zinc-finge 53.6 13 0.00029 22.3 2.1 25 171-195 13-39 (41)
125 PF13248 zf-ribbon_3: zinc-rib 53.5 8 0.00017 20.5 1.0 7 126-132 4-10 (26)
126 KOG4124|consensus 52.2 5.2 0.00011 36.3 0.3 117 284-401 175-422 (442)
127 KOG2593|consensus 51.9 6.9 0.00015 36.8 1.0 35 50-87 128-162 (436)
128 KOG2272|consensus 51.6 15 0.00033 31.7 2.9 20 193-212 215-234 (332)
129 PF06524 NOA36: NOA36 protein; 51.3 5.5 0.00012 34.6 0.2 18 119-136 33-50 (314)
130 COG1592 Rubrerythrin [Energy p 51.1 9.5 0.00021 31.2 1.6 25 171-208 134-158 (166)
131 cd00730 rubredoxin Rubredoxin; 49.4 8.9 0.00019 24.3 0.9 9 289-297 3-11 (50)
132 KOG2186|consensus 49.0 14 0.00031 32.0 2.4 50 171-223 3-52 (276)
133 PF14353 CpXC: CpXC protein 48.1 3.1 6.7E-05 32.6 -1.6 22 50-71 38-59 (128)
134 PRK14714 DNA polymerase II lar 46.6 17 0.00037 39.4 2.9 22 125-146 668-689 (1337)
135 KOG2807|consensus 46.2 23 0.0005 32.0 3.3 20 348-367 345-364 (378)
136 smart00154 ZnF_AN1 AN1-like Zi 45.0 8.8 0.00019 22.8 0.4 15 406-420 12-26 (39)
137 PF08271 TF_Zn_Ribbon: TFIIB z 44.6 9.5 0.00021 23.2 0.5 15 47-61 16-30 (43)
138 PRK00432 30S ribosomal protein 43.6 11 0.00024 23.9 0.7 13 49-61 36-48 (50)
139 PF04959 ARS2: Arsenite-resist 43.5 18 0.00039 31.0 2.2 26 348-373 77-102 (214)
140 PF07754 DUF1610: Domain of un 43.4 13 0.00028 19.5 0.8 8 348-355 16-23 (24)
141 PF03604 DNA_RNApol_7kD: DNA d 41.9 11 0.00024 21.3 0.4 10 171-180 17-26 (32)
142 KOG2593|consensus 41.5 21 0.00046 33.7 2.5 17 347-363 127-143 (436)
143 TIGR00595 priA primosomal prot 41.0 16 0.00036 36.1 1.8 49 125-208 214-262 (505)
144 COG4306 Uncharacterized protei 38.9 20 0.00043 27.3 1.5 39 378-416 40-78 (160)
145 COG4896 Uncharacterized protei 38.9 16 0.00034 23.9 0.9 39 16-62 4-43 (68)
146 PF04810 zf-Sec23_Sec24: Sec23 38.3 17 0.00037 21.7 0.9 32 14-59 2-33 (40)
147 KOG1280|consensus 38.3 22 0.00048 32.4 2.0 40 345-384 76-116 (381)
148 PF08790 zf-LYAR: LYAR-type C2 38.0 13 0.00028 20.3 0.3 9 52-60 2-10 (28)
149 PTZ00255 60S ribosomal protein 38.0 6.3 0.00014 28.4 -1.2 14 48-61 52-65 (90)
150 PF03811 Zn_Tnp_IS1: InsA N-te 37.0 7.3 0.00016 22.7 -0.8 15 42-56 21-35 (36)
151 PF15269 zf-C2H2_7: Zinc-finge 36.7 26 0.00057 21.3 1.5 21 200-220 21-41 (54)
152 PF01927 Mut7-C: Mut7-C RNAse 35.6 23 0.00051 28.4 1.6 10 172-181 125-134 (147)
153 PF04780 DUF629: Protein of un 35.5 27 0.00058 33.8 2.2 29 347-375 56-84 (466)
154 COG4888 Uncharacterized Zn rib 34.9 9.4 0.0002 27.9 -0.7 11 407-417 47-57 (104)
155 KOG1701|consensus 34.5 9.8 0.00021 35.7 -0.8 47 50-103 302-348 (468)
156 PF05443 ROS_MUCR: ROS/MUCR tr 34.4 16 0.00035 28.6 0.5 28 48-78 70-97 (132)
157 smart00661 RPOL9 RNA polymeras 33.8 23 0.0005 22.3 1.1 12 171-182 20-31 (52)
158 PF01428 zf-AN1: AN1-like Zinc 33.7 11 0.00025 22.9 -0.4 15 406-420 13-27 (43)
159 KOG4167|consensus 33.6 12 0.00025 37.8 -0.5 28 48-75 790-817 (907)
160 KOG1280|consensus 33.6 37 0.00079 31.1 2.6 38 318-355 77-116 (381)
161 PF05495 zf-CHY: CHY zinc fing 33.5 17 0.00037 25.0 0.5 8 289-296 12-19 (71)
162 TIGR01206 lysW lysine biosynth 33.3 21 0.00045 23.1 0.8 8 350-357 24-31 (54)
163 KOG2907|consensus 32.8 21 0.00047 26.7 0.9 11 378-388 103-113 (116)
164 PF01363 FYVE: FYVE zinc finge 32.6 18 0.0004 24.5 0.5 9 350-358 11-19 (69)
165 COG0068 HypF Hydrogenase matur 32.5 14 0.00031 37.3 -0.1 79 90-207 102-181 (750)
166 COG4530 Uncharacterized protei 32.5 22 0.00048 26.4 0.9 12 376-387 25-36 (129)
167 PRK03564 formate dehydrogenase 32.3 32 0.0007 31.4 2.1 14 344-357 208-221 (309)
168 PRK14873 primosome assembly pr 32.2 24 0.00051 36.3 1.4 46 125-206 384-429 (665)
169 PF05129 Elf1: Transcription e 32.0 8.9 0.00019 27.2 -1.2 14 49-62 45-58 (81)
170 COG3091 SprT Zn-dependent meta 31.9 18 0.00038 28.9 0.3 34 13-59 116-149 (156)
171 PF09845 DUF2072: Zn-ribbon co 31.6 31 0.00066 26.9 1.6 9 351-359 4-12 (131)
172 PF04780 DUF629: Protein of un 31.4 20 0.00042 34.7 0.6 33 372-404 52-84 (466)
173 COG3364 Zn-ribbon containing p 31.2 25 0.00055 25.8 1.0 13 14-26 2-14 (112)
174 PRK14714 DNA polymerase II lar 30.9 32 0.0007 37.4 2.1 11 378-388 1254-1264(1337)
175 PF12773 DZR: Double zinc ribb 30.7 35 0.00077 21.3 1.5 9 172-180 30-38 (50)
176 PRK04023 DNA polymerase II lar 30.3 39 0.00085 35.9 2.5 23 125-147 652-674 (1121)
177 PF04606 Ogr_Delta: Ogr/Delta- 30.2 9.6 0.00021 23.7 -1.1 19 48-66 23-43 (47)
178 KOG2807|consensus 29.7 70 0.0015 29.1 3.6 19 171-189 290-308 (378)
179 PF08209 Sgf11: Sgf11 (transcr 29.2 58 0.0012 18.6 2.0 21 348-369 4-24 (33)
180 COG1655 Uncharacterized protei 29.0 17 0.00036 31.2 -0.3 39 319-357 18-71 (267)
181 PF07975 C1_4: TFIIH C1-like d 28.3 36 0.00078 21.7 1.2 24 320-343 21-44 (51)
182 PF07649 C1_3: C1-like domain; 28.2 31 0.00067 18.9 0.8 11 171-181 15-25 (30)
183 PF13453 zf-TFIIB: Transcripti 28.1 35 0.00075 20.4 1.1 18 171-188 19-36 (41)
184 KOG1044|consensus 28.0 30 0.00065 34.0 1.2 37 13-59 15-51 (670)
185 PRK12496 hypothetical protein; 27.9 29 0.00063 28.5 1.0 22 125-146 128-153 (164)
186 PF07503 zf-HYPF: HypF finger; 27.6 45 0.00098 19.3 1.4 31 92-135 2-32 (35)
187 PRK14892 putative transcriptio 27.4 19 0.00041 26.6 -0.2 12 378-389 43-54 (99)
188 PRK05580 primosome assembly pr 27.4 36 0.00077 35.3 1.7 49 125-208 382-430 (679)
189 PRK14559 putative protein seri 26.9 38 0.00082 34.6 1.7 22 127-148 30-53 (645)
190 PRK03824 hypA hydrogenase nick 26.8 32 0.0007 27.1 1.0 11 288-298 71-81 (135)
191 TIGR03826 YvyF flagellar opero 26.5 18 0.00039 28.6 -0.4 21 126-146 5-28 (137)
192 PF13824 zf-Mss51: Zinc-finger 26.4 32 0.00069 22.3 0.7 12 171-182 14-25 (55)
193 PF05191 ADK_lid: Adenylate ki 26.3 38 0.00083 19.7 1.0 9 322-330 3-11 (36)
194 PF14446 Prok-RING_1: Prokaryo 26.1 33 0.00073 22.1 0.8 9 126-134 7-15 (54)
195 TIGR01562 FdhE formate dehydro 25.9 51 0.0011 30.1 2.2 14 344-357 206-219 (305)
196 PF13451 zf-trcl: Probable zin 25.4 46 0.001 21.0 1.3 16 348-363 4-19 (49)
197 PF07282 OrfB_Zn_ribbon: Putat 25.4 52 0.0011 22.2 1.7 9 349-357 47-55 (69)
198 PF01096 TFIIS_C: Transcriptio 24.8 9.4 0.0002 22.7 -1.8 7 321-327 29-35 (39)
199 COG2331 Uncharacterized protei 24.6 32 0.00069 23.8 0.5 10 349-358 13-22 (82)
200 KOG1842|consensus 24.5 39 0.00085 32.1 1.2 24 321-344 16-39 (505)
201 cd00065 FYVE FYVE domain; Zinc 24.5 44 0.00095 21.4 1.2 27 350-389 4-30 (57)
202 PF08792 A2L_zn_ribbon: A2L zi 24.3 27 0.00058 19.9 0.1 10 125-134 22-31 (33)
203 KOG4167|consensus 24.0 23 0.0005 35.8 -0.3 26 199-224 792-817 (907)
204 PF09963 DUF2197: Uncharacteri 23.7 35 0.00075 22.2 0.5 13 47-59 28-40 (56)
205 smart00064 FYVE Protein presen 23.7 44 0.00095 22.5 1.1 28 349-389 11-38 (68)
206 COG1998 RPS31 Ribosomal protei 23.6 34 0.00073 21.5 0.4 11 49-59 36-46 (51)
207 PF01780 Ribosomal_L37ae: Ribo 22.3 24 0.00053 25.5 -0.4 12 348-359 53-64 (90)
208 PF04438 zf-HIT: HIT zinc fing 22.0 29 0.00062 19.3 -0.1 18 81-98 5-22 (30)
209 TIGR00280 L37a ribosomal prote 21.9 35 0.00075 24.7 0.3 12 348-359 53-64 (91)
210 COG0068 HypF Hydrogenase matur 21.7 36 0.00078 34.6 0.5 60 51-134 124-183 (750)
211 COG1773 Rubredoxin [Energy pro 21.5 34 0.00074 22.1 0.2 14 50-63 3-16 (55)
212 TIGR00244 transcriptional regu 21.3 22 0.00048 28.3 -0.9 22 48-69 26-47 (147)
213 COG1327 Predicted transcriptio 20.8 24 0.00052 28.1 -0.7 22 48-69 26-47 (156)
214 PF14311 DUF4379: Domain of un 20.7 80 0.0017 20.2 1.8 12 349-360 29-40 (55)
215 COG1656 Uncharacterized conser 20.7 63 0.0014 26.3 1.6 11 200-210 131-141 (165)
216 PRK04023 DNA polymerase II lar 20.7 66 0.0014 34.3 2.1 22 124-145 626-647 (1121)
217 PRK12380 hydrogenase nickel in 20.6 56 0.0012 24.9 1.2 11 321-331 71-81 (113)
218 TIGR00100 hypA hydrogenase nic 20.4 53 0.0012 25.1 1.1 11 321-331 71-81 (115)
219 KOG3408|consensus 20.3 71 0.0015 24.4 1.6 23 199-221 57-79 (129)
220 KOG0717|consensus 20.2 45 0.00097 32.0 0.8 19 15-34 293-311 (508)
221 PF10083 DUF2321: Uncharacteri 20.0 36 0.00077 27.3 0.1 52 126-184 30-81 (158)
No 1
>KOG1074|consensus
Probab=99.95 E-value=1.5e-28 Score=233.15 Aligned_cols=206 Identities=27% Similarity=0.482 Sum_probs=149.8
Q ss_pred CcceeccccCccccChhHHHHHHHHhcCCCceecccccccccCHHHHHHHHhhccCCCcceeceeecccccChHHHHHHH
Q psy10074 169 SYRYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIYCAYRCTQKSDLKKHLRKKHEVKHCFVNIVMNSYGTNIKKCMKRH 248 (420)
Q Consensus 169 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~l~~h 248 (420)
..+.+|-+|.++..-+..|+.|.++|+|++||+|.+|++.|+++.+|+.||-+|....+
T Consensus 603 TdPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~--------------------- 661 (958)
T KOG1074|consen 603 TDPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPP--------------------- 661 (958)
T ss_pred CCccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCcc---------------------
Confidence 34678999999999999999999999999999999999999999999999987654332
Q ss_pred HhhhhcCccccccc---ccccccCChHHHHhchhhcCCCC-C------------CccCCccchhccChhHHhhhhhcccC
Q psy10074 249 VERIHLNITKTRCE---LCDLRFFDKVGLKMHVMRKHTGE-K------------PFHCHYCDYICRTKYDLSSHIGRKHT 312 (420)
Q Consensus 249 ~~~~~~~~~~~~C~---~C~~~f~~~~~l~~H~~~~H~~~-~------------~~~C~~C~~~f~~~~~l~~H~~~~H~ 312 (420)
-...+.|+ +|.+.|.+...|..| .++|.+. . .-+|..|.+.|.....+..++ ..|.
T Consensus 662 ------~R~q~ScP~~~ic~~kftn~V~lpQh-IriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~-se~~ 733 (958)
T KOG1074|consen 662 ------ARVQFSCPSTFICQKKFTNAVTLPQH-IRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQI-SEQP 733 (958)
T ss_pred ------ccccccCCchhhhcccccccccccce-EEeecCCCCCCCcccccccchhcccchhhhcccccccchhhh-hccC
Confidence 22456677 788888888778887 5666521 1 135777777777777776663 3332
Q ss_pred Ccc------------cccC----ceecCCCCCccCChHHHhhhhhhhc-----------------------CC-------
Q psy10074 313 DKE------------VLEK----RYVCQLCPKQFLTSGELKYHQTTVH-----------------------RK------- 346 (420)
Q Consensus 313 ~~~------------~~~~----~~~C~~C~~~f~~~~~l~~H~~~h~-----------------------~~------- 346 (420)
... ..+. +..+..|+..+.....+..+-..+. ++
T Consensus 734 ~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~ 813 (958)
T KOG1074|consen 734 SPESEPDEQMDERTETEELDVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASS 813 (958)
T ss_pred CcccCCcccccccccccccccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCccccc
Confidence 100 0011 3444555555544444333322110 00
Q ss_pred ----------------------------------------------------------------cceecCcCCccccChh
Q psy10074 347 ----------------------------------------------------------------RVHECSECGFTFFLEK 362 (420)
Q Consensus 347 ----------------------------------------------------------------~~~~C~~C~~~f~~~~ 362 (420)
....|..||+.|...+
T Consensus 814 ~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSs 893 (958)
T KOG1074|consen 814 FPGEILAPSVNMDPVLWNQETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSA 893 (958)
T ss_pred CCCcCCccccccCchhhcccccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchH
Confidence 1167999999999999
Q ss_pred hHHHHHHhhcCCCCCcccCcCccccCCHHHHHccccccccCCC
Q psy10074 363 TLRAHIRETHRKETPFPCPFCEYKAKRKFELKSHMKRKHTFED 405 (420)
Q Consensus 363 ~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~ 405 (420)
.|..|++ .|++++||.|..|++.|..+.+|+.||. .|.+..
T Consensus 894 ALqiH~r-THtg~KPF~C~fC~~aFttrgnLKvHMg-tH~w~q 934 (958)
T KOG1074|consen 894 ALEIHMR-THTGPKPFFCHFCEEAFTTRGNLKVHMG-THMWVQ 934 (958)
T ss_pred HHHHhhh-cCCCCCCccchhhhhhhhhhhhhhhhhc-cccccC
Confidence 9999999 9999999999999999999999999995 665443
No 2
>KOG3608|consensus
Probab=99.94 E-value=3.6e-27 Score=202.72 Aligned_cols=229 Identities=25% Similarity=0.477 Sum_probs=196.0
Q ss_pred ccc--ccccCCCCCccccccccccCCchhhhhhcccccccccCC-------CCC-ccee--ccccCccccChhHHHHHHH
Q psy10074 125 FVC--ILCNYHIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRP-------DVS-YRYV--CFTCDYHSFCSSHMKDHIR 192 (420)
Q Consensus 125 ~~C--~~C~~~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~-------~~~-~~~~--C~~C~~~f~~~~~l~~H~~ 192 (420)
|.| ..|+..|.+. ..|..|+ ..|..-.. ..+ ..+. ...|-+.+.++..|++|++
T Consensus 135 f~C~WedCe~~F~s~-------------~ef~dHV-~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r 200 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSI-------------VEFQDHV-VKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIR 200 (467)
T ss_pred hccChhhcCCcccCH-------------HHHHHHH-HHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHH
Confidence 555 4588888888 8888887 66654311 111 1244 4579999999999999999
Q ss_pred HhcCCCceecccccccccCHHHHHHHHhhccCCCcceeceeecccccChHHHHHHHHhhhhcCcccccccccccccCChH
Q psy10074 193 VHLGEKPYMCIYCAYRCTQKSDLKKHLRKKHEVKHCFVNIVMNSYGTNIKKCMKRHVERIHLNITKTRCELCDLRFFDKV 272 (420)
Q Consensus 193 ~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~l~~h~~~~~~~~~~~~C~~C~~~f~~~~ 272 (420)
.|.+++...|+.|+..|.++..|..|++. .......+|.|..|.+.|.+..
T Consensus 201 ~Hs~eKvvACp~Cg~~F~~~tkl~DH~rR-----------------------------qt~l~~n~fqC~~C~KrFaTek 251 (467)
T KOG3608|consen 201 THSNEKVVACPHCGELFRTKTKLFDHLRR-----------------------------QTELNTNSFQCAQCFKRFATEK 251 (467)
T ss_pred hcCCCeEEecchHHHHhccccHHHHHHHh-----------------------------hhhhcCCchHHHHHHHHHhHHH
Confidence 99999999999999999999999999864 2233345899999999999999
Q ss_pred HHHhchhhcCCCCCCccCCccchhccChhHHhhhhhcccCCcccccCceecCCCCCccCChHHHhhhhhhhcCCcceecC
Q psy10074 273 GLKMHVMRKHTGEKPFHCHYCDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFLTSGELKYHQTTVHRKRVHECS 352 (420)
Q Consensus 273 ~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~ 352 (420)
.|..| ...|.+ -|+|+.|+.+....+.|.+||+..|+. .+||+|+.|+..|.+.+.|.+|+..|. +-.|.|.
T Consensus 252 lL~~H-v~rHvn--~ykCplCdmtc~~~ssL~~H~r~rHs~----dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~ 323 (467)
T KOG3608|consen 252 LLKSH-VVRHVN--CYKCPLCDMTCSSASSLTTHIRYRHSK----DKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCE 323 (467)
T ss_pred HHHHH-HHHhhh--cccccccccCCCChHHHHHHHHhhhcc----CCCccccchhhhhccHHHHHHHHHhcc-ccceecC
Confidence 99999 566754 599999999999999999999999998 999999999999999999999999887 6789998
Q ss_pred c--CCccccChhhHHHHHHhhcCCC--CCcccCcCccccCCHHHHHccccccccCC
Q psy10074 353 E--CGFTFFLEKTLRAHIRETHRKE--TPFPCPFCEYKAKRKFELKSHMKRKHTFE 404 (420)
Q Consensus 353 ~--C~~~f~~~~~L~~H~~~~H~~~--~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 404 (420)
. |.++|.+...|.+|++.+|.+. .+|.|..|.+.|++-.+|..|+.+.|+-.
T Consensus 324 h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 324 HPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred CCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhccc
Confidence 8 9999999999999999888654 45999999999999999999999999754
No 3
>KOG2462|consensus
Probab=99.92 E-value=5.8e-26 Score=190.86 Aligned_cols=135 Identities=29% Similarity=0.522 Sum_probs=124.7
Q ss_pred CcccccccccccccCChHHHHhchhhcCCC---CCCccCCccchhccChhHHhhhhhcccCCcccccCceecCCCCCccC
Q psy10074 255 NITKTRCELCDLRFFDKVGLKMHVMRKHTG---EKPFHCHYCDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFL 331 (420)
Q Consensus 255 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~---~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~ 331 (420)
....|+|+.||+.+.+...|.+| ++.|-. .+.+.|++|++.+.+...|..|+ +.|. -+.+|.+||+.|.
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrH-kQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHi-rTH~------l~c~C~iCGKaFS 198 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRH-KQTHRSLDSKKAFSCKYCGKVYVSMPALKMHI-RTHT------LPCECGICGKAFS 198 (279)
T ss_pred cCCceeccccccccccccccchh-hcccccccccccccCCCCCceeeehHHHhhHh-hccC------CCccccccccccc
Confidence 34568999999999999999999 788864 56799999999999999999995 6664 3699999999999
Q ss_pred ChHHHhhhhhhhcCCcceecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCHHHHHcccc
Q psy10074 332 TSGELKYHQTTVHRKRVHECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRKFELKSHMK 398 (420)
Q Consensus 332 ~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~ 398 (420)
..+.|+.|+++|+|||||.|+.|+++|++++.|+.||+ .|++.++|+|+.|+++|..++.|.+|..
T Consensus 199 RPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQ-THS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 199 RPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQ-THSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred chHHhhcccccccCCCCccCCcccchhcchHHHHHHHH-hhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 99999999999999999999999999999999999999 9999999999999999999999999974
No 4
>KOG2462|consensus
Probab=99.91 E-value=2.3e-25 Score=187.29 Aligned_cols=130 Identities=28% Similarity=0.499 Sum_probs=121.2
Q ss_pred CCCCccCCccchhccChhHHhhhhhcccCCcccccCceecCCCCCccCChHHHhhhhhhhcCCcceecCcCCccccChhh
Q psy10074 284 GEKPFHCHYCDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFLTSGELKYHQTTVHRKRVHECSECGFTFFLEKT 363 (420)
Q Consensus 284 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~ 363 (420)
....|+|+.|++.+.+..+|.+| +..|...+. .+.+.|..|++.|.+..+|..|+++|. -+.+|++||+.|.....
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrH-kQ~H~~~~s-~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWL 202 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRH-KQTHRSLDS-KKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWL 202 (279)
T ss_pred cCCceeccccccccccccccchh-hcccccccc-cccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHH
Confidence 34569999999999999999999 788876544 778999999999999999999999997 68999999999999999
Q ss_pred HHHHHHhhcCCCCCcccCcCccccCCHHHHHccccccccCCCCccccccchhhhcc
Q psy10074 364 LRAHIRETHRKETPFPCPFCEYKAKRKFELKSHMKRKHTFEDSLTTEETEYYKILE 419 (420)
Q Consensus 364 L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~ 419 (420)
|+-|+| .|+|++||.|+.|++.|.++++|+.|| +.|++.+.|+|..|++.|..+
T Consensus 203 LQGHiR-THTGEKPF~C~hC~kAFADRSNLRAHm-QTHS~~K~~qC~~C~KsFsl~ 256 (279)
T KOG2462|consen 203 LQGHIR-THTGEKPFSCPHCGKAFADRSNLRAHM-QTHSDVKKHQCPRCGKSFALK 256 (279)
T ss_pred hhcccc-cccCCCCccCCcccchhcchHHHHHHH-HhhcCCccccCcchhhHHHHH
Confidence 999999 999999999999999999999999999 999999999999999999854
No 5
>KOG3608|consensus
Probab=99.91 E-value=2.4e-24 Score=185.41 Aligned_cols=238 Identities=22% Similarity=0.390 Sum_probs=200.8
Q ss_pred ceec--cccCccccC-hhHHHHHHHHhcC----------------------------------CCceeccc--ccccccC
Q psy10074 171 RYVC--FTCDYHSFC-SSHMKDHIRVHLG----------------------------------EKPYMCIY--CAYRCTQ 211 (420)
Q Consensus 171 ~~~C--~~C~~~f~~-~~~l~~H~~~h~~----------------------------------~~~~~C~~--C~~~f~~ 211 (420)
.++| ..|++...+ ...|.+|+-.|.= ...|.|.+ |+..|.+
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s 148 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVS 148 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccC
Confidence 4566 468887776 5889999866520 01278866 9999999
Q ss_pred HHHHHHHHhhccCC-------------CcceeceeecccccChHHHHHHHHhhhhcCcccccccccccccCChHHHHhch
Q psy10074 212 KSDLKKHLRKKHEV-------------KHCFVNIVMNSYGTNIKKCMKRHVERIHLNITKTRCELCDLRFFDKVGLKMHV 278 (420)
Q Consensus 212 ~~~l~~H~~~~h~~-------------~~~~c~~c~~~~~~~~~~~l~~h~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~ 278 (420)
...+..|+..|.-. ....|..-.+...+.++..|.+|+ +.|++++...|+.||..|.+...|..|.
T Consensus 149 ~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~-r~Hs~eKvvACp~Cg~~F~~~tkl~DH~ 227 (467)
T KOG3608|consen 149 IVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHI-RTHSNEKVVACPHCGELFRTKTKLFDHL 227 (467)
T ss_pred HHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHH-HhcCCCeEEecchHHHHhccccHHHHHH
Confidence 99999998766421 224566666688899999999999 9999999999999999999999999995
Q ss_pred hh-cCCCCCCccCCccchhccChhHHhhhhhcccCCcccccCceecCCCCCccCChHHHhhhhhh-hcCCcceecCcCCc
Q psy10074 279 MR-KHTGEKPFHCHYCDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFLTSGELKYHQTT-VHRKRVHECSECGF 356 (420)
Q Consensus 279 ~~-~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~ 356 (420)
++ .-.+..+|.|..|.+.|.+...|..|+.+ | ...|+|++|+-+....++|..|++. |...+||+|+.|..
T Consensus 228 rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~r-H------vn~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~ 300 (467)
T KOG3608|consen 228 RRQTELNTNSFQCAQCFKRFATEKLLKSHVVR-H------VNCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDT 300 (467)
T ss_pred HhhhhhcCCchHHHHHHHHHhHHHHHHHHHHH-h------hhcccccccccCCCChHHHHHHHHhhhccCCCccccchhh
Confidence 43 22356789999999999999999999644 4 3469999999999999999999985 66889999999999
Q ss_pred cccChhhHHHHHHhhcCCCCCcccCc--CccccCCHHHHHccccccccCC--CCccccccchhhhc
Q psy10074 357 TFFLEKTLRAHIRETHRKETPFPCPF--CEYKAKRKFELKSHMKRKHTFE--DSLTTEETEYYKIL 418 (420)
Q Consensus 357 ~f~~~~~L~~H~~~~H~~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~--~~~~C~~C~~~f~~ 418 (420)
.|.+.+.|.+|+. .|+ +..|+|.. |.++|....+|++|++.+|-|. .+|.|..|++.||+
T Consensus 301 ~c~~esdL~kH~~-~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~ 364 (467)
T KOG3608|consen 301 RCVRESDLAKHVQ-VHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTS 364 (467)
T ss_pred hhccHHHHHHHHH-hcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhcc
Confidence 9999999999999 998 67799988 9999999999999999999664 45999999999985
No 6
>KOG1074|consensus
Probab=99.90 E-value=1.1e-24 Score=207.23 Aligned_cols=109 Identities=29% Similarity=0.554 Sum_probs=91.1
Q ss_pred ccccccccccCChHHHHhchhhcCCCCCCccCCccchhccChhHHhhhhhcccCCcccccCceecC---CCCCccCChHH
Q psy10074 259 TRCELCDLRFFDKVGLKMHVMRKHTGEKPFHCHYCDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQ---LCPKQFLTSGE 335 (420)
Q Consensus 259 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~---~C~~~f~~~~~ 335 (420)
..|-+|.++..-...|+.| ++.|+|++||+|.+|++.|.++.+|+.|+ ..|........++.|+ +|-+.|+..-.
T Consensus 606 NqCiiC~rVlSC~saLqmH-yrtHtGERPFkCKiCgRAFtTkGNLkaH~-~vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 606 NQCIICLRVLSCPSALQMH-YRTHTGERPFKCKICGRAFTTKGNLKAHM-SVHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred cceeeeeecccchhhhhhh-hhcccCcCccccccccchhccccchhhcc-cccccCccccccccCCchhhhccccccccc
Confidence 6799999999999999999 79999999999999999999999999994 7777655445679999 99999999999
Q ss_pred HhhhhhhhcCC-c------------ceecCcCCccccChhhHHHHHH
Q psy10074 336 LKYHQTTVHRK-R------------VHECSECGFTFFLEKTLRAHIR 369 (420)
Q Consensus 336 l~~H~~~h~~~-~------------~~~C~~C~~~f~~~~~L~~H~~ 369 (420)
|..|+++|.+. . .=+|..|.+.|.....+..++.
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~s 730 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQIS 730 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhh
Confidence 99999999832 1 1357777777766555555544
No 7
>KOG3623|consensus
Probab=99.87 E-value=3e-22 Score=187.49 Aligned_cols=111 Identities=19% Similarity=0.367 Sum_probs=91.2
Q ss_pred cccccccchhhhccChHHHHHhcccccchhccc-ccCCcccccchhhccccccccchhhhhccCCCCc--cccccccccc
Q psy10074 13 ADIKCCHCNKYISSDLRDLLEHCRLCPAMTRTD-LLKHKYVCYACSYATYYNGNMKKHLRIHLGDRPF--MCTLLICQYC 89 (420)
Q Consensus 13 ~~~~C~~C~~~f~~~~~~l~~H~~~~~~~~~~~-~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~--~c~~c~~~~~ 89 (420)
...+|++|.+.+.. ...|..|+ +..| ..+..|.|..|.++|..+..|.+||.+|.....- .-..-.....
T Consensus 209 qlltcpycdrgykr-ltslkeHi------kyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRK 281 (1007)
T KOG3623|consen 209 QLLTCPYCDRGYKR-LTSLKEHI------KYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRK 281 (1007)
T ss_pred hhhcchhHHHHHHH-HHHHHHHH------HHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhcc
Confidence 56899999999999 99999994 2233 3456699999999999999999999999654321 1111112256
Q ss_pred chhhhhhhhhhhhhhhhhhhcccccccccCCCCcccccccccCCCCCc
Q psy10074 90 LICQYCNQYFVRSTQDILQHGKVCNFAPRSDANFNFVCILCNYHIPCK 137 (420)
Q Consensus 90 ~~C~~C~~~f~~~~~~l~~H~~~~~~~~~h~~~~~~~C~~C~~~f~~~ 137 (420)
|+|..||+.|..+- .|..|+|+ |.|+++|.|+-|+++|...
T Consensus 282 FKCtECgKAFKfKH-HLKEHlRI------HSGEKPfeCpnCkKRFSHS 322 (1007)
T KOG3623|consen 282 FKCTECGKAFKFKH-HLKEHLRI------HSGEKPFECPNCKKRFSHS 322 (1007)
T ss_pred ccccccchhhhhHH-HHHhhhee------ecCCCCcCCcccccccccC
Confidence 78899999999999 99999999 9999999999999999877
No 8
>KOG3623|consensus
Probab=99.83 E-value=3.5e-21 Score=180.39 Aligned_cols=68 Identities=24% Similarity=0.413 Sum_probs=62.0
Q ss_pred eecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCHHHHHccccccccCCCCccccccchhhhc
Q psy10074 349 HECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRKFELKSHMKRKHTFEDSLTTEETEYYKIL 418 (420)
Q Consensus 349 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~ 418 (420)
|.|+.|.++|...++|.+|.- .|+|.+||+|.+|.+.|..+..|..|+ +.|.|++||+|..|++-|+.
T Consensus 895 yaCDqCDK~FqKqSSLaRHKY-EHsGqRPyqC~iCkKAFKHKHHLtEHk-RLHSGEKPfQCdKClKRFSH 962 (1007)
T KOG3623|consen 895 YACDQCDKAFQKQSSLARHKY-EHSGQRPYQCIICKKAFKHKHHLTEHK-RLHSGEKPFQCDKCLKRFSH 962 (1007)
T ss_pred chHHHHHHHHHhhHHHHHhhh-hhcCCCCcccchhhHhhhhhhhhhhhh-hhccCCCcchhhhhhhhccc
Confidence 889999999999999999976 999999999999999999999999998 89999999999999998873
No 9
>KOG3576|consensus
Probab=99.62 E-value=8.5e-17 Score=128.58 Aligned_cols=118 Identities=28% Similarity=0.498 Sum_probs=106.6
Q ss_pred CCCCCccCCccchhccChhHHhhhhhcccCCcccccCceecCCCCCccCChHHHhhhhhhhcCCcceecCcCCccccChh
Q psy10074 283 TGEKPFHCHYCDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFLTSGELKYHQTTVHRKRVHECSECGFTFFLEK 362 (420)
Q Consensus 283 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~ 362 (420)
.+...|.|.+|++.|.-...|.+| ...|++ -+.+.|..||+.|.+.-.|++|+++|+|.+||+|..|+++|++.-
T Consensus 113 sd~d~ftCrvCgK~F~lQRmlnrh-~kch~~----vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrc 187 (267)
T KOG3576|consen 113 SDQDSFTCRVCGKKFGLQRMLNRH-LKCHSD----VKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRC 187 (267)
T ss_pred CCCCeeeeehhhhhhhHHHHHHHH-hhhccH----HHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhc
Confidence 345679999999999999999999 477777 788999999999999999999999999999999999999999999
Q ss_pred hHHHHHHhhcCCC----------CCcccCcCccccCCHHHHHccccccccCCC
Q psy10074 363 TLRAHIRETHRKE----------TPFPCPFCEYKAKRKFELKSHMKRKHTFED 405 (420)
Q Consensus 363 ~L~~H~~~~H~~~----------~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~ 405 (420)
+|..|++.+|+.. +.|.|..||..-.....+..|+...|....
T Consensus 188 sleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 188 SLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred cHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 9999999999753 569999999999999999999988887654
No 10
>KOG3576|consensus
Probab=99.54 E-value=1.8e-15 Score=121.02 Aligned_cols=125 Identities=26% Similarity=0.445 Sum_probs=103.5
Q ss_pred CCCCcccccccccCCCCCccccccccccCCchhhhhhcccccccccCCCCCcceeccccCccccChhHHHHHHHHhcCCC
Q psy10074 119 SDANFNFVCILCNYHIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRPDVSYRYVCFTCDYHSFCSSHMKDHIRVHLGEK 198 (420)
Q Consensus 119 h~~~~~~~C~~C~~~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~ 198 (420)
..+...|.|.+|++.|... ..|+.|+ ..|+..+ .+-|..||+.|...-+|++|+++|++.+
T Consensus 112 ssd~d~ftCrvCgK~F~lQ-------------Rmlnrh~-kch~~vk-----r~lct~cgkgfndtfdlkrh~rthtgvr 172 (267)
T KOG3576|consen 112 SSDQDSFTCRVCGKKFGLQ-------------RMLNRHL-KCHSDVK-----RHLCTFCGKGFNDTFDLKRHTRTHTGVR 172 (267)
T ss_pred CCCCCeeeeehhhhhhhHH-------------HHHHHHh-hhccHHH-----HHHHhhccCcccchhhhhhhhccccCcc
Confidence 3445679999999999998 9999999 9999988 8999999999999999999999999999
Q ss_pred ceecccccccccCHHHHHHHHhhccCCCcceeceeecccccChHHHHHHHHhhhhcCcccccccccccccCChHHHHhch
Q psy10074 199 PYMCIYCAYRCTQKSDLKKHLRKKHEVKHCFVNIVMNSYGTNIKKCMKRHVERIHLNITKTRCELCDLRFFDKVGLKMHV 278 (420)
Q Consensus 199 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~~~l~~h~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~ 278 (420)
||+|..|++.|+.+-.|..|++..|+... .+. -.....+.|.|..||.+-.....+..|+
T Consensus 173 pykc~~c~kaftqrcsleshl~kvhgv~~----------~ya----------ykerr~kl~vcedcg~t~~~~e~~~~h~ 232 (267)
T KOG3576|consen 173 PYKCSLCEKAFTQRCSLESHLKKVHGVQH----------QYA----------YKERRAKLYVCEDCGYTSERPEVYYLHL 232 (267)
T ss_pred ccchhhhhHHHHhhccHHHHHHHHcCchH----------HHH----------HHHhhhheeeecccCCCCCChhHHHHHH
Confidence 99999999999999999999998887531 010 1122345688888888888888888885
Q ss_pred hhcC
Q psy10074 279 MRKH 282 (420)
Q Consensus 279 ~~~H 282 (420)
...|
T Consensus 233 ~~~h 236 (267)
T KOG3576|consen 233 KLHH 236 (267)
T ss_pred HhcC
Confidence 4444
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.16 E-value=3.4e-11 Score=114.98 Aligned_cols=136 Identities=17% Similarity=0.369 Sum_probs=100.7
Q ss_pred cccccccccccCChHHHHhchhhcCCCCCCccCCc--cchhccChhHHhhhhhcccCCcccccCceecCCCCCccCChHH
Q psy10074 258 KTRCELCDLRFFDKVGLKMHVMRKHTGEKPFHCHY--CDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFLTSGE 335 (420)
Q Consensus 258 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~ 335 (420)
...|+.|....... .|..| . .+-.-..-.|+. |+..|. ...+.+| +.|+.|++.|. ...
T Consensus 407 ~V~C~NC~~~i~l~-~l~lH-e-~~C~r~~V~Cp~~~Cg~v~~-r~el~~H--------------~~C~~Cgk~f~-~s~ 467 (567)
T PLN03086 407 TVECRNCKHYIPSR-SIALH-E-AYCSRHNVVCPHDGCGIVLR-VEEAKNH--------------VHCEKCGQAFQ-QGE 467 (567)
T ss_pred eEECCCCCCccchh-HHHHH-H-hhCCCcceeCCcccccceee-ccccccC--------------ccCCCCCCccc-hHH
Confidence 34688888877654 56677 2 333334456874 888773 2233333 57999999886 677
Q ss_pred HhhhhhhhcCCcceecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCC----------HHHHHccccccccCCC
Q psy10074 336 LKYHQTTVHRKRVHECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKR----------KFELKSHMKRKHTFED 405 (420)
Q Consensus 336 l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~----------~~~l~~H~~~~H~~~~ 405 (420)
|..|+.+++ +++.|+ ||..+ ....|..|+. .|...+++.|+.|++.|.. .+.|..|. ... |.+
T Consensus 468 LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~-thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE-~~C-G~r 540 (567)
T PLN03086 468 MEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQA-STCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHE-SIC-GSR 540 (567)
T ss_pred HHHHHHhcC--CCccCC-CCCCc-chhHHHhhhh-ccCCCCceeCCCCCCccccCccccchhhhhhhHHHHH-Hhc-CCc
Confidence 899999876 789999 99755 5688999987 8888899999999998852 45788998 444 788
Q ss_pred Cccccccchhhhcc
Q psy10074 406 SLTTEETEYYKILE 419 (420)
Q Consensus 406 ~~~C~~C~~~f~~~ 419 (420)
++.|..||+.|..+
T Consensus 541 t~~C~~Cgk~Vrlr 554 (567)
T PLN03086 541 TAPCDSCGRSVMLK 554 (567)
T ss_pred ceEccccCCeeeeh
Confidence 89999999988754
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.09 E-value=2.1e-10 Score=109.65 Aligned_cols=121 Identities=21% Similarity=0.382 Sum_probs=94.2
Q ss_pred cccccc--cccccCChHHHHhchhhcCCCCCCccCCccchhccChhHHhhhhhcccCCcccccCceecCCCCCccCChHH
Q psy10074 258 KTRCEL--CDLRFFDKVGLKMHVMRKHTGEKPFHCHYCDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFLTSGE 335 (420)
Q Consensus 258 ~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~ 335 (420)
...|+. ||..|. +..+..| +.|+.|++.|. ...|..|+...| .++.|+ |+..+ .+..
T Consensus 433 ~V~Cp~~~Cg~v~~-r~el~~H----------~~C~~Cgk~f~-~s~LekH~~~~H-------kpv~Cp-Cg~~~-~R~~ 491 (567)
T PLN03086 433 NVVCPHDGCGIVLR-VEEAKNH----------VHCEKCGQAFQ-QGEMEKHMKVFH-------EPLQCP-CGVVL-EKEQ 491 (567)
T ss_pred ceeCCcccccceee-ccccccC----------ccCCCCCCccc-hHHHHHHHHhcC-------CCccCC-CCCCc-chhH
Confidence 446874 998884 3344444 47999999886 677899976654 358899 99755 6789
Q ss_pred HhhhhhhhcCCcceecCcCCcccc----------ChhhHHHHHHhhcCCCCCcccCcCccccCCHHHHHcccccccc
Q psy10074 336 LKYHQTTVHRKRVHECSECGFTFF----------LEKTLRAHIRETHRKETPFPCPFCEYKAKRKFELKSHMKRKHT 402 (420)
Q Consensus 336 l~~H~~~h~~~~~~~C~~C~~~f~----------~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 402 (420)
|..|+.+|.+.+++.|+.|+..|. ..+.|..|.. .+ +.+++.|..|++.+..+ .|..|+..+|.
T Consensus 492 L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~-~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~~~h~ 565 (567)
T PLN03086 492 MVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHES-IC-GSRTAPCDSCGRSVMLK-EMDIHQIAVHQ 565 (567)
T ss_pred HHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHH-hc-CCcceEccccCCeeeeh-hHHHHHHHhhc
Confidence 999999999999999999999985 2457999988 55 77889999999988764 57888888875
No 13
>PHA00733 hypothetical protein
Probab=99.03 E-value=1.7e-10 Score=89.82 Aligned_cols=82 Identities=22% Similarity=0.318 Sum_probs=56.9
Q ss_pred cCceecCCCCCccCChHHHhhh--h---hhhcCCcceecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCHHH
Q psy10074 318 EKRYVCQLCPKQFLTSGELKYH--Q---TTVHRKRVHECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRKFE 392 (420)
Q Consensus 318 ~~~~~C~~C~~~f~~~~~l~~H--~---~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~ 392 (420)
.+++.|.+|+..|.....|..+ + ..+++.+||.|+.|++.|.+...|..|++ .| ..+|.|+.|++.|.....
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r-~h--~~~~~C~~CgK~F~~~~s 114 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIR-YT--EHSKVCPVCGKEFRNTDS 114 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHh-cC--CcCccCCCCCCccCCHHH
Confidence 4456666666666555544444 1 12334678888888888888888888877 44 346888888888888888
Q ss_pred HHcccccccc
Q psy10074 393 LKSHMKRKHT 402 (420)
Q Consensus 393 l~~H~~~~H~ 402 (420)
|..|+...|+
T Consensus 115 L~~H~~~~h~ 124 (128)
T PHA00733 115 TLDHVCKKHN 124 (128)
T ss_pred HHHHHHHhcC
Confidence 8888877775
No 14
>PHA00733 hypothetical protein
Probab=98.89 E-value=2.4e-09 Score=83.39 Aligned_cols=85 Identities=21% Similarity=0.313 Sum_probs=63.5
Q ss_pred CCCCCccCCccchhccChhHHhhh--hhc--ccCCcccccCceecCCCCCccCChHHHhhhhhhhcCCcceecCcCCccc
Q psy10074 283 TGEKPFHCHYCDYICRTKYDLSSH--IGR--KHTDKEVLEKRYVCQLCPKQFLTSGELKYHQTTVHRKRVHECSECGFTF 358 (420)
Q Consensus 283 ~~~~~~~C~~C~~~f~~~~~l~~H--~~~--~H~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 358 (420)
...+++.|.+|...|.....|..+ +.+ .+.+ .++|.|+.|++.|.+...|..|+..+ +.+|.|+.|++.|
T Consensus 36 ~~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~----~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F 109 (128)
T PHA00733 36 PEQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKA----VSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEF 109 (128)
T ss_pred hhhhhHHHHHHhhhccChhhhcchHHHHhhcccCC----CCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCcc
Confidence 345667777777777666655555 111 2222 67899999999999999999998875 3578999999999
Q ss_pred cChhhHHHHHHhhcC
Q psy10074 359 FLEKTLRAHIRETHR 373 (420)
Q Consensus 359 ~~~~~L~~H~~~~H~ 373 (420)
.....|..|+...|.
T Consensus 110 ~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 110 RNTDSTLDHVCKKHN 124 (128)
T ss_pred CCHHHHHHHHHHhcC
Confidence 999999999886664
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.66 E-value=1.1e-08 Score=65.12 Aligned_cols=44 Identities=27% Similarity=0.445 Sum_probs=32.6
Q ss_pred ceecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCHHHHH
Q psy10074 348 VHECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRKFELK 394 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~ 394 (420)
.|+|+.||+.|+....|..|++ .|. ++|+|..|++.|...+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r-~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLR-KHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHH-hcC--CcccCCcccceecccceeE
Confidence 4778888888888888888877 566 5777888887777666653
No 16
>KOG3993|consensus
Probab=98.65 E-value=4.6e-09 Score=94.07 Aligned_cols=145 Identities=23% Similarity=0.332 Sum_probs=100.9
Q ss_pred ccccccccccccCChHHHHhchhhcCCCCCCccCCccchhccChhHHhhhhhcccCCcc---------------------
Q psy10074 257 TKTRCELCDLRFFDKVGLKMHVMRKHTGEKPFHCHYCDYICRTKYDLSSHIGRKHTDKE--------------------- 315 (420)
Q Consensus 257 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~--------------------- 315 (420)
..|+|.+|...|.+...|.+| +-.-.-..-|+|+.|++.|.-..+|..| ++.|....
T Consensus 266 GdyiCqLCK~kYeD~F~LAQH-rC~RIV~vEYrCPEC~KVFsCPANLASH-RRWHKPR~eaa~a~~~P~k~~~~~rae~~ 343 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQH-RCPRIVHVEYRCPECDKVFSCPANLASH-RRWHKPRPEAAKAGSPPPKQAVETRAEVQ 343 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhc-cCCeeEEeeecCCcccccccCchhhhhh-hcccCCchhhhhcCCCChhhhhhhhhhhh
Confidence 348899999999999999999 4322233458999999999999999999 67774311
Q ss_pred --------cccCceecCCCCCccCChHHHhhhhhhhcCCc----------------------------------------
Q psy10074 316 --------VLEKRYVCQLCPKQFLTSGELKYHQTTVHRKR---------------------------------------- 347 (420)
Q Consensus 316 --------~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~---------------------------------------- 347 (420)
..+.-|.|..|++.|....-|+.|+.+|+...
T Consensus 344 ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~ 423 (500)
T KOG3993|consen 344 EAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLY 423 (500)
T ss_pred hccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceee
Confidence 11334888888888888888888887776210
Q ss_pred ------ceecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCHHHHHccccccccCC
Q psy10074 348 ------VHECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRKFELKSHMKRKHTFE 404 (420)
Q Consensus 348 ------~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 404 (420)
.--++.++..+.++..--.+.+ --.....|.|.+|...|.+.-.|.+|+...|..+
T Consensus 424 ~a~sael~~pp~~~~ppsss~~sgg~~r-lg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse 485 (500)
T KOG3993|consen 424 VAGSAELELPPYDGSPPSSSGSSGGYGR-LGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSE 485 (500)
T ss_pred eeccccccCCCCCCCCcccCCCCCcccc-ccchhhccccccchHhhhcCcchHhHhhhcChHH
Confidence 0224555544444433222222 1112345899999999999999999999999754
No 17
>KOG3993|consensus
Probab=98.64 E-value=9.3e-09 Score=92.15 Aligned_cols=202 Identities=12% Similarity=0.091 Sum_probs=124.4
Q ss_pred CcceeccccCccccChhHHHHHHHHhcCCCceecccccccccCHHHHHHHHhhccCCCcceeceeeccc-ccChHHHHHH
Q psy10074 169 SYRYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIYCAYRCTQKSDLKKHLRKKHEVKHCFVNIVMNSY-GTNIKKCMKR 247 (420)
Q Consensus 169 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~-~~~~~~~l~~ 247 (420)
...|.|..|...|.+.-.|.+|.=.......|+|+.|++.|.-..+|..|.++|......--.--.... ...+..+.++
T Consensus 265 iGdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~e 344 (500)
T KOG3993|consen 265 IGDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQE 344 (500)
T ss_pred HHHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhh
Confidence 345999999999999999999954333335699999999999999999999998754321100000000 0011111111
Q ss_pred HH-hhhhcCcccccccccccccCChHHHHhchhhcCCCCCC----------------ccCCccchhccChhHHhhhhhcc
Q psy10074 248 HV-ERIHLNITKTRCELCDLRFFDKVGLKMHVMRKHTGEKP----------------FHCHYCDYICRTKYDLSSHIGRK 310 (420)
Q Consensus 248 h~-~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~----------------~~C~~C~~~f~~~~~l~~H~~~~ 310 (420)
-. ......+..|.|..|++.|.....|+.|+...|..... +.+..+.-.+.....-..+ ...
T Consensus 345 a~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~-vl~ 423 (500)
T KOG3993|consen 345 AERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDE-VLY 423 (500)
T ss_pred ccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccc-eee
Confidence 11 00112334699999999999999999995433322111 1233333222222221122 111
Q ss_pred cCCcccccCceecCCCCCccCChHHHhhhhhhhcCCcceecCcCCccccChhhHHHHHHhhcCC
Q psy10074 311 HTDKEVLEKRYVCQLCPKQFLTSGELKYHQTTVHRKRVHECSECGFTFFLEKTLRAHIRETHRK 374 (420)
Q Consensus 311 H~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~ 374 (420)
+.+. +..-.+++++..+.++..-..+.+.-..+..|.|.+|...|.+...|.+|+...|..
T Consensus 424 ~a~s---ael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hps 484 (500)
T KOG3993|consen 424 VAGS---AELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPS 484 (500)
T ss_pred eecc---ccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChH
Confidence 1111 223467888877777766666666556667899999999999999999999877754
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.58 E-value=2.1e-08 Score=63.77 Aligned_cols=44 Identities=14% Similarity=0.273 Sum_probs=39.0
Q ss_pred ceecCCCCCccCChHHHhhhhhhhcCCcceecCcCCccccChhhHH
Q psy10074 320 RYVCQLCPKQFLTSGELKYHQTTVHRKRVHECSECGFTFFLEKTLR 365 (420)
Q Consensus 320 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~ 365 (420)
.|.|+.||+.|...+.|..|+++|+ ++|+|..|++.|.+.+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4899999999999999999999988 6899999999998877663
No 19
>KOG1146|consensus
Probab=98.37 E-value=2.2e-07 Score=95.12 Aligned_cols=93 Identities=25% Similarity=0.408 Sum_probs=69.7
Q ss_pred cccccccchhhhccChHHHHHhcccccchhcccc-------------------------cCCcccccchhhccccccccc
Q psy10074 13 ADIKCCHCNKYISSDLRDLLEHCRLCPAMTRTDL-------------------------LKHKYVCYACSYATYYNGNMK 67 (420)
Q Consensus 13 ~~~~C~~C~~~f~~~~~~l~~H~~~~~~~~~~~~-------------------------~~~~~~C~~C~~~f~~~~~l~ 67 (420)
+.|.|+.|+..|.. ...|..| ++.-+ +.++|.|..|++.+.++.+|.
T Consensus 464 kt~~cpkc~~~yk~-a~~L~vh-------mRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~Ls 535 (1406)
T KOG1146|consen 464 KTLKCPKCNWHYKL-AQTLGVH-------MRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLS 535 (1406)
T ss_pred ccccCCccchhhhh-HHHhhhc-------ccccccccchhHhHhccccccccccccccCCCCcccceeeeeeeecchHHH
Confidence 89999999999999 9999999 66511 235799999999999999999
Q ss_pred hhhhhcc--CC----------C----Cc-cccccccc---------------ccchhhhhhhhhhhhhhhhhhhccccc
Q psy10074 68 KHLRIHL--GD----------R----PF-MCTLLICQ---------------YCLICQYCNQYFVRSTQDILQHGKVCN 114 (420)
Q Consensus 68 ~H~~~h~--~~----------~----~~-~c~~c~~~---------------~~~~C~~C~~~f~~~~~~l~~H~~~~~ 114 (420)
.||+.-. .+ . +. .+..++.. ..+.|..|++...-.. .|+.||....
T Consensus 536 ihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetniar-nlrihmtss~ 613 (1406)
T KOG1146|consen 536 IHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETNIAR-NLRIHMTASP 613 (1406)
T ss_pred HHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhhhhh-ccccccccCC
Confidence 9987421 10 0 00 12222222 6789999999988888 8999987743
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.34 E-value=2.8e-07 Score=49.95 Aligned_cols=24 Identities=46% Similarity=1.041 Sum_probs=14.0
Q ss_pred HHHHHHhhcCCCCCcccCcCccccC
Q psy10074 364 LRAHIRETHRKETPFPCPFCEYKAK 388 (420)
Q Consensus 364 L~~H~~~~H~~~~~~~C~~C~~~f~ 388 (420)
|.+|++ .|++++||+|++|+++|.
T Consensus 2 l~~H~~-~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMR-THTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHH-HHSSSSSEEESSSSEEES
T ss_pred HHHHhh-hcCCCCCCCCCCCcCeeC
Confidence 555655 555666666666665554
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.29 E-value=4.9e-07 Score=48.98 Aligned_cols=24 Identities=29% Similarity=0.552 Sum_probs=16.2
Q ss_pred HhhhhhhhcCCcceecCcCCcccc
Q psy10074 336 LKYHQTTVHRKRVHECSECGFTFF 359 (420)
Q Consensus 336 l~~H~~~h~~~~~~~C~~C~~~f~ 359 (420)
|..|+++|++++||.|+.|+++|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 566666666666666766666664
No 22
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.24 E-value=9.8e-07 Score=57.62 Aligned_cols=52 Identities=31% Similarity=0.649 Sum_probs=37.8
Q ss_pred ceecCcCCccccChhhHHHHHHhhcCCC-CCcccCcCccccCCHHHHHcccccccc
Q psy10074 348 VHECSECGFTFFLEKTLRAHIRETHRKE-TPFPCPFCEYKAKRKFELKSHMKRKHT 402 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~-~~~~C~~C~~~f~~~~~l~~H~~~~H~ 402 (420)
.|.||.|++. .+...|..|+...|..+ +.+.||+|...+. .+|..|+...|+
T Consensus 2 ~f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 3778888884 45677888888777765 4588888887654 478888876663
No 23
>PHA00732 hypothetical protein
Probab=98.12 E-value=1.8e-06 Score=60.69 Aligned_cols=44 Identities=32% Similarity=0.696 Sum_probs=26.8
Q ss_pred ceecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCHHHHHccc
Q psy10074 348 VHECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRKFELKSHM 397 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~ 397 (420)
||.|+.|++.|.+...|..|++..|.+ +.|+.|+++|. .|..|+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~ 44 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR---RLNQHF 44 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeC---Chhhhh
Confidence 456666666666666666666533442 45666666665 355665
No 24
>PHA00616 hypothetical protein
Probab=98.05 E-value=1.8e-06 Score=52.15 Aligned_cols=21 Identities=29% Similarity=0.534 Sum_probs=9.3
Q ss_pred eecCcCCccccChhhHHHHHH
Q psy10074 349 HECSECGFTFFLEKTLRAHIR 369 (420)
Q Consensus 349 ~~C~~C~~~f~~~~~L~~H~~ 369 (420)
|+|+.||..|...+.|..|++
T Consensus 2 YqC~~CG~~F~~~s~l~~H~r 22 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHLL 22 (44)
T ss_pred CccchhhHHHhhHHHHHHHHH
Confidence 344444444444444444444
No 25
>PHA00732 hypothetical protein
Probab=97.88 E-value=6.8e-06 Score=57.84 Aligned_cols=39 Identities=28% Similarity=0.519 Sum_probs=34.5
Q ss_pred CcccCcCccccCCHHHHHccccccccCCCCccccccchhhhc
Q psy10074 377 PFPCPFCEYKAKRKFELKSHMKRKHTFEDSLTTEETEYYKIL 418 (420)
Q Consensus 377 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~ 418 (420)
||.|+.|++.|.+.+.|..|++..|++ +.|+.||+.|+.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~~ 39 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYRR 39 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeCC
Confidence 589999999999999999999656764 589999999974
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.87 E-value=1.3e-05 Score=52.27 Aligned_cols=49 Identities=24% Similarity=0.480 Sum_probs=19.6
Q ss_pred eecCCCCCccCChHHHhhhhhhhcC--CcceecCcCCccccChhhHHHHHHhhc
Q psy10074 321 YVCQLCPKQFLTSGELKYHQTTVHR--KRVHECSECGFTFFLEKTLRAHIRETH 372 (420)
Q Consensus 321 ~~C~~C~~~f~~~~~l~~H~~~h~~--~~~~~C~~C~~~f~~~~~L~~H~~~~H 372 (420)
|.||+|++ -.+...|..|....+. .+.+.||+|...+. ..|..|+...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 34444444 2233344444433221 12344444444332 24455544333
No 27
>PHA00616 hypothetical protein
Probab=97.86 E-value=7.1e-06 Score=49.64 Aligned_cols=37 Identities=14% Similarity=0.174 Sum_probs=23.0
Q ss_pred ceeccccCccccChhHHHHHHHHhcCCCceecccccc
Q psy10074 171 RYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIYCAY 207 (420)
Q Consensus 171 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 207 (420)
+|+|+.||+.|...+.|..|++.|++++++.|..=..
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~y~ 37 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYFYI 37 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEEEE
Confidence 3566666666666666666666666666666654333
No 28
>KOG1146|consensus
Probab=97.68 E-value=1.4e-05 Score=82.33 Aligned_cols=87 Identities=13% Similarity=0.144 Sum_probs=62.8
Q ss_pred cccchhhhccChHHHHHhcccccchhcccccCCcccccchhhccccccccchhhhhccCCCCccccccccc---------
Q psy10074 17 CCHCNKYISSDLRDLLEHCRLCPAMTRTDLLKHKYVCYACSYATYYNGNMKKHLRIHLGDRPFMCTLLICQ--------- 87 (420)
Q Consensus 17 C~~C~~~f~~~~~~l~~H~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~c~~c~~~--------- 87 (420)
|.-|+..+.+ .+.+..|. ...+.-.+.+.|+.|++.|.....|..||++-+.+... .+|...
T Consensus 439 ~~~~e~~~~s-~r~~~~~t------~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~--~~c~~gq~~~~~arg 509 (1406)
T KOG1146|consen 439 LTKAEPLLES-KRSLEGQT------VVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS--AYCKAGQNHPRLARG 509 (1406)
T ss_pred ccchhhhhhh-hcccccce------eeeecccccccCCccchhhhhHHHhhhcccccccccch--hHhHhcccccccccc
Confidence 4445555666 56666662 44555568899999999999999999999974433221 122211
Q ss_pred -------ccchhhhhhhhhhhhhhhhhhhcccc
Q psy10074 88 -------YCLICQYCNQYFVRSTQDILQHGKVC 113 (420)
Q Consensus 88 -------~~~~C~~C~~~f~~~~~~l~~H~~~~ 113 (420)
.+|.|..|+.++.++. .|.+|+...
T Consensus 510 ~~~~~~~~p~~C~~C~~stttng-~LsihlqS~ 541 (1406)
T KOG1146|consen 510 EVYRCPGKPYPCRACNYSTTTNG-NLSIHLQSD 541 (1406)
T ss_pred ccccCCCCcccceeeeeeeecch-HHHHHHHHH
Confidence 7889999999999999 899998764
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.58 E-value=5.9e-05 Score=39.53 Aligned_cols=22 Identities=32% Similarity=0.567 Sum_probs=13.5
Q ss_pred eeccccCccccChhHHHHHHHH
Q psy10074 172 YVCFTCDYHSFCSSHMKDHIRV 193 (420)
Q Consensus 172 ~~C~~C~~~f~~~~~l~~H~~~ 193 (420)
|.|+.|++.|.+...|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4566666666666666666654
No 30
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.46 E-value=8.3e-05 Score=38.95 Aligned_cols=21 Identities=33% Similarity=0.757 Sum_probs=10.8
Q ss_pred eecCcCCccccChhhHHHHHH
Q psy10074 349 HECSECGFTFFLEKTLRAHIR 369 (420)
Q Consensus 349 ~~C~~C~~~f~~~~~L~~H~~ 369 (420)
|+|+.|++.|.+...|..|++
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~ 21 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMR 21 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHh
Confidence 345555555555555555544
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.38 E-value=0.00012 Score=54.97 Aligned_cols=71 Identities=25% Similarity=0.398 Sum_probs=14.1
Q ss_pred CCccchhccChhHHhhhhhcccCCcccccCceecCCCCCccCChHHHhhhhhhhcCCcceecCcCCccccChhhHHHHHH
Q psy10074 290 CHYCDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFLTSGELKYHQTTVHRKRVHECSECGFTFFLEKTLRAHIR 369 (420)
Q Consensus 290 C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~ 369 (420)
|..|+..|.+...|..|+...|.. ..+ ....+.....+..++..- ....+.|..|++.|.+...|..|++
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~----~~~-----~~~~l~~~~~~~~~~~~~-~~~~~~C~~C~~~f~s~~~l~~Hm~ 71 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGF----DIP-----DQKYLVDPNRLLNYLRKK-VKESFRCPYCNKTFRSREALQEHMR 71 (100)
T ss_dssp ---------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHH
T ss_pred cccccccccccccccccccccccc----ccc-----cccccccccccccccccc-cCCCCCCCccCCCCcCHHHHHHHHc
Confidence 566666666666666666555543 111 111111222233332221 1124666666666666666666666
Q ss_pred h
Q psy10074 370 E 370 (420)
Q Consensus 370 ~ 370 (420)
.
T Consensus 72 ~ 72 (100)
T PF12756_consen 72 S 72 (100)
T ss_dssp H
T ss_pred C
Confidence 3
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.36 E-value=0.00018 Score=38.00 Aligned_cols=21 Identities=38% Similarity=0.820 Sum_probs=9.5
Q ss_pred eecCcCCccccChhhHHHHHH
Q psy10074 349 HECSECGFTFFLEKTLRAHIR 369 (420)
Q Consensus 349 ~~C~~C~~~f~~~~~L~~H~~ 369 (420)
|.|++|++.|.+...|..|+.
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~ 21 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMR 21 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHH
Confidence 345555555555555555544
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.33 E-value=8.2e-05 Score=55.87 Aligned_cols=75 Identities=23% Similarity=0.453 Sum_probs=22.6
Q ss_pred ecCCCCCccCChHHHhhhhhhhcCCcceecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCHHHHHccccccc
Q psy10074 322 VCQLCPKQFLTSGELKYHQTTVHRKRVHECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRKFELKSHMKRKH 401 (420)
Q Consensus 322 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H 401 (420)
.|..|+..|.+...|..|+...++-..- ....+.....+..+++.. ....+.|.+|++.|.+...|..||+..+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~----~~~~l~~~~~~~~~~~~~--~~~~~~C~~C~~~f~s~~~l~~Hm~~~~ 74 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP----DQKYLVDPNRLLNYLRKK--VKESFRCPYCNKTFRSREALQEHMRSKH 74 (100)
T ss_dssp ------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHTT
T ss_pred Cccccccccccccccccccccccccccc----cccccccccccccccccc--cCCCCCCCccCCCCcCHHHHHHHHcCcc
Confidence 4889999999999999999776643211 112223444455554411 1236999999999999999999997654
Q ss_pred c
Q psy10074 402 T 402 (420)
Q Consensus 402 ~ 402 (420)
.
T Consensus 75 H 75 (100)
T PF12756_consen 75 H 75 (100)
T ss_dssp T
T ss_pred C
Confidence 3
No 34
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.28 E-value=0.00021 Score=37.69 Aligned_cols=23 Identities=26% Similarity=0.615 Sum_probs=10.5
Q ss_pred eeccccCccccChhHHHHHHHHh
Q psy10074 172 YVCFTCDYHSFCSSHMKDHIRVH 194 (420)
Q Consensus 172 ~~C~~C~~~f~~~~~l~~H~~~h 194 (420)
|.|++|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 34555555555555555555443
No 35
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.17 E-value=0.00018 Score=39.34 Aligned_cols=20 Identities=25% Similarity=0.439 Sum_probs=11.0
Q ss_pred cccCcCccccCCHHHHHccc
Q psy10074 378 FPCPFCEYKAKRKFELKSHM 397 (420)
Q Consensus 378 ~~C~~C~~~f~~~~~l~~H~ 397 (420)
|+|..|++.|.+...|..|+
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~ 21 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHK 21 (27)
T ss_dssp EEETTTTEEESSHHHHHHHH
T ss_pred CCCCccCCccCChhHHHHHh
Confidence 45555555555555555555
No 36
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.02 E-value=0.00029 Score=61.44 Aligned_cols=55 Identities=25% Similarity=0.518 Sum_probs=44.3
Q ss_pred CCcceecCc--CCccccChhhHHHHHHhhc------------------CCCCCcccCcCccccCCHHHHHccccc
Q psy10074 345 RKRVHECSE--CGFTFFLEKTLRAHIRETH------------------RKETPFPCPFCEYKAKRKFELKSHMKR 399 (420)
Q Consensus 345 ~~~~~~C~~--C~~~f~~~~~L~~H~~~~H------------------~~~~~~~C~~C~~~f~~~~~l~~H~~~ 399 (420)
++|||+|++ |.+.+.....|+-|+.--| ...+||.|++|++.|.+...|+.|+..
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~H 420 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRKH 420 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecccc
Confidence 358888877 8888888888888877444 124789999999999999999999743
No 37
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.92 E-value=0.00075 Score=36.84 Aligned_cols=23 Identities=26% Similarity=0.518 Sum_probs=12.4
Q ss_pred eecccccccccCHHHHHHHHhhc
Q psy10074 200 YMCIYCAYRCTQKSDLKKHLRKK 222 (420)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~~~~ 222 (420)
|.|..|++.|.+...|..|++.+
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCccCCccCChhHHHHHhHHh
Confidence 45555555555555555555444
No 38
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.87 E-value=0.00034 Score=61.06 Aligned_cols=72 Identities=17% Similarity=0.344 Sum_probs=48.7
Q ss_pred CCCCccCCc--cchhccChhHHhhhhhcccCCcccccCceecCCCCCccCChHHHhhhhhhhcCCcceecCcCCccccCh
Q psy10074 284 GEKPFHCHY--CDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFLTSGELKYHQTTVHRKRVHECSECGFTFFLE 361 (420)
Q Consensus 284 ~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 361 (420)
+++||+|++ |.+.+++...|+.|+..-|......+.| -+. .|.......|||.|++|++.+...
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p--~p~------------~~~~F~~~~KPYrCevC~KRYKNl 411 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENP--SPE------------KMNIFSAKDKPYRCEVCDKRYKNL 411 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCC--Ccc------------ccccccccCCceeccccchhhccC
Confidence 358999987 9999999999999987777553322222 111 111123456788888888888888
Q ss_pred hhHHHHHH
Q psy10074 362 KTLRAHIR 369 (420)
Q Consensus 362 ~~L~~H~~ 369 (420)
..|+-|..
T Consensus 412 NGLKYHr~ 419 (423)
T COG5189 412 NGLKYHRK 419 (423)
T ss_pred ccceeccc
Confidence 88877744
No 39
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.22 E-value=0.0013 Score=40.80 Aligned_cols=28 Identities=25% Similarity=0.498 Sum_probs=13.5
Q ss_pred CcceecCcCCccccChhhHHHHHHhhcC
Q psy10074 346 KRVHECSECGFTFFLEKTLRAHIRETHR 373 (420)
Q Consensus 346 ~~~~~C~~C~~~f~~~~~L~~H~~~~H~ 373 (420)
+.|-.||+|+..+.+...|++|+...|.
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~ 49 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHF 49 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTT
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhc
Confidence 3455666666666666666666654443
No 40
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.20 E-value=0.0038 Score=32.87 Aligned_cols=23 Identities=48% Similarity=1.068 Sum_probs=9.9
Q ss_pred eecccccccccCHHHHHHHHhhcc
Q psy10074 200 YMCIYCAYRCTQKSDLKKHLRKKH 223 (420)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~~~~h 223 (420)
|+|+.|++... +..|..|++.+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 34455554444 445555544433
No 41
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.11 E-value=0.0037 Score=55.41 Aligned_cols=137 Identities=24% Similarity=0.469 Sum_probs=87.0
Q ss_pred ccccc--chhhccccccccchhhhhccCCCCccccccccc-ccchhhhhhhhhhhhhhhhhhhcccccccccCCCCccc-
Q psy10074 50 KYVCY--ACSYATYYNGNMKKHLRIHLGDRPFMCTLLICQ-YCLICQYCNQYFVRSTQDILQHGKVCNFAPRSDANFNF- 125 (420)
Q Consensus 50 ~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~c~~c~~~-~~~~C~~C~~~f~~~~~~l~~H~~~~~~~~~h~~~~~~- 125 (420)
.|.|+ .|..+......|..|.+..++ .+.|..|-+. ..|. |.....+.. .|+.|... ...+..|
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~---~E~~lF~~~-~Lr~H~~~------G~~e~GFK 218 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFW---NEIRLFRSS-TLRDHKNG------GLEEEGFK 218 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCc---cceeeeecc-cccccccC------CccccCcC
Confidence 48886 488888888899999886553 3444332211 1111 112223344 67777655 3323222
Q ss_pred ---ccccccCCCCCccccccccccCCchhhhhhcccccccccCCCCCcceeccccCc-------cccChhHHHHHHHHhc
Q psy10074 126 ---VCILCNYHIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRPDVSYRYVCFTCDY-------HSFCSSHMKDHIRVHL 195 (420)
Q Consensus 126 ---~C~~C~~~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~-------~f~~~~~l~~H~~~h~ 195 (420)
.|..|+..|.+. .+|..|++..|. +|.+|++ -|++...|.+|.+.-
T Consensus 219 GHP~C~FC~~~FYdD-------------DEL~~HcR~~HE----------~ChICD~v~p~~~QYFK~Y~~Le~HF~~~- 274 (493)
T COG5236 219 GHPLCIFCKIYFYDD-------------DELRRHCRLRHE----------ACHICDMVGPIRYQYFKSYEDLEAHFRNA- 274 (493)
T ss_pred CCchhhhccceecCh-------------HHHHHHHHhhhh----------hhhhhhccCccchhhhhCHHHHHHHhhcC-
Confidence 478888888877 899999855553 5777764 467788888887643
Q ss_pred CCCceeccc--cc----ccccCHHHHHHHHhhccCC
Q psy10074 196 GEKPYMCIY--CA----YRCTQKSDLKKHLRKKHEV 225 (420)
Q Consensus 196 ~~~~~~C~~--C~----~~f~~~~~l~~H~~~~h~~ 225 (420)
-|.|.+ |. ..|....+|..|+...|+.
T Consensus 275 ---hy~ct~qtc~~~k~~vf~~~~el~~h~~~~h~~ 307 (493)
T COG5236 275 ---HYCCTFQTCRVGKCYVFPYHTELLEHLTRFHKV 307 (493)
T ss_pred ---ceEEEEEEEecCcEEEeccHHHHHHHHHHHhhc
Confidence 267755 42 4688889999998877754
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.04 E-value=0.0061 Score=32.48 Aligned_cols=23 Identities=35% Similarity=0.642 Sum_probs=14.7
Q ss_pred eeccccCccccChhHHHHHHHHh
Q psy10074 172 YVCFTCDYHSFCSSHMKDHIRVH 194 (420)
Q Consensus 172 ~~C~~C~~~f~~~~~l~~H~~~h 194 (420)
|.|+.|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 35666666666666666666654
No 43
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.03 E-value=0.0057 Score=37.95 Aligned_cols=31 Identities=26% Similarity=0.415 Sum_probs=17.4
Q ss_pred cCceecCCCCCccCChHHHhhhhhhhcCCcc
Q psy10074 318 EKRYVCQLCPKQFLTSGELKYHQTTVHRKRV 348 (420)
Q Consensus 318 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 348 (420)
+.|-.|++|+..+....+|++|+...|+.+|
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 5566777777777777777777766665554
No 44
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.02 E-value=0.0063 Score=32.02 Aligned_cols=24 Identities=33% Similarity=0.736 Sum_probs=19.4
Q ss_pred eeccccCccccChhHHHHHHHHhcC
Q psy10074 172 YVCFTCDYHSFCSSHMKDHIRVHLG 196 (420)
Q Consensus 172 ~~C~~C~~~f~~~~~l~~H~~~h~~ 196 (420)
|+|+.|++... ...|.+|++.+++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 78999999998 9999999998753
No 45
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.95 E-value=0.007 Score=32.24 Aligned_cols=20 Identities=40% Similarity=0.901 Sum_probs=9.8
Q ss_pred ecCcCCccccChhhHHHHHH
Q psy10074 350 ECSECGFTFFLEKTLRAHIR 369 (420)
Q Consensus 350 ~C~~C~~~f~~~~~L~~H~~ 369 (420)
.|+.|++.|.+...|..|++
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~ 21 (26)
T smart00355 2 RCPECGKVFKSKSALKEHMR 21 (26)
T ss_pred CCCCCcchhCCHHHHHHHHH
Confidence 44445555555555555544
No 46
>KOG2231|consensus
Probab=95.80 E-value=0.0064 Score=59.93 Aligned_cols=134 Identities=17% Similarity=0.330 Sum_probs=60.4
Q ss_pred cccchhhccccccccchhhhhccCCCCcccccccccccchhhhhhhhhhhhhhhhhhhccc-ccccccCCCCcccccccc
Q psy10074 52 VCYACSYATYYNGNMKKHLRIHLGDRPFMCTLLICQYCLICQYCNQYFVRSTQDILQHGKV-CNFAPRSDANFNFVCILC 130 (420)
Q Consensus 52 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~c~~c~~~~~~~C~~C~~~f~~~~~~l~~H~~~-~~~~~~h~~~~~~~C~~C 130 (420)
.|..| -.|.+...|..||..-+ +.+.|.+|-.. .-.+..+.+.|... .|..|+.. +.-..++.| -..|..|
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H--~~~~c~lC~~~-~kif~~e~k~Yt~~--el~~h~~~gd~d~~s~rG--hp~C~~C 188 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQH--KLHLCSLCLQN-LKIFINERKLYTRA--ELNLHLMFGDPDDESCRG--HPLCKFC 188 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhh--hhhcccccccc-ceeeeeeeehehHH--HHHHHHhcCCCccccccC--Cccchhh
Confidence 45555 55557777777774322 22333332111 00112334444443 34455433 110011111 1235555
Q ss_pred cCCCCCccccccccccCCchhhhhhcccccccccCCCCCcceecccc------CccccChhHHHHHHHHhcCCCceecc-
Q psy10074 131 NYHIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRPDVSYRYVCFTC------DYHSFCSSHMKDHIRVHLGEKPYMCI- 203 (420)
Q Consensus 131 ~~~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C------~~~f~~~~~l~~H~~~h~~~~~~~C~- 203 (420)
...|-.. ..|..|+...| |.|..| +.-|.....|..|.+.++ |.|.
T Consensus 189 ~~~fld~-------------~el~rH~~~~h----------~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE~ 241 (669)
T KOG2231|consen 189 HERFLDD-------------DELYRHLRFDH----------EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCEE 241 (669)
T ss_pred hhhhccH-------------HHHHHhhccce----------eheeecCcccccchhcccchHHHHHhhhcC----ccccc
Confidence 5555444 66666663333 344444 344566777777776654 5565
Q ss_pred -ccc-ccccCHHHHHHHHh
Q psy10074 204 -YCA-YRCTQKSDLKKHLR 220 (420)
Q Consensus 204 -~C~-~~f~~~~~l~~H~~ 220 (420)
.|. +.|.....+..+++
T Consensus 242 ~~C~~~~f~~~~~~ei~lk 260 (669)
T KOG2231|consen 242 EFCRTKKFYVAFELEIELK 260 (669)
T ss_pred cccccceeeehhHHHHHHH
Confidence 453 22333334444444
No 47
>KOG2482|consensus
Probab=95.73 E-value=0.015 Score=51.61 Aligned_cols=55 Identities=20% Similarity=0.298 Sum_probs=39.0
Q ss_pred ccccccccCCchhhhhhcccccccccCCCC----------------CcceeccccCccccChhHHHHHHHH
Q psy10074 139 CKYCKTKLDNNTEVLLQHCKLCVNVIRPDV----------------SYRYVCFTCDYHSFCSSHMKDHIRV 193 (420)
Q Consensus 139 c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~----------------~~~~~C~~C~~~f~~~~~l~~H~~~ 193 (420)
|-.|+..+.++++.+..|+-..|......+ -..+.|-.|.+.|+.+..|+.||+.
T Consensus 147 ClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 147 CLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRK 217 (423)
T ss_pred EEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHh
Confidence 555555555566888888745554433222 2258999999999999999999986
No 48
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.69 E-value=0.0066 Score=32.28 Aligned_cols=20 Identities=20% Similarity=0.534 Sum_probs=9.9
Q ss_pred eccccCccccChhHHHHHHH
Q psy10074 173 VCFTCDYHSFCSSHMKDHIR 192 (420)
Q Consensus 173 ~C~~C~~~f~~~~~l~~H~~ 192 (420)
.|.+|++.|.+...|+.|++
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCcCCHHHHHHHHC
Confidence 44555555555555555543
No 49
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.68 E-value=0.009 Score=31.75 Aligned_cols=23 Identities=30% Similarity=0.732 Sum_probs=20.9
Q ss_pred eecccccccccCHHHHHHHHhhc
Q psy10074 200 YMCIYCAYRCTQKSDLKKHLRKK 222 (420)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~~~~ 222 (420)
|.|.+|+..|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 68999999999999999999753
No 50
>KOG2482|consensus
Probab=95.57 E-value=0.019 Score=51.04 Aligned_cols=155 Identities=19% Similarity=0.271 Sum_probs=93.0
Q ss_pred HHHHHhhhhcCccccccccccccc-CChHHHHhchhhcCCC---------------------CCCccCCccchhccChhH
Q psy10074 245 MKRHVERIHLNITKTRCELCDLRF-FDKVGLKMHVMRKHTG---------------------EKPFHCHYCDYICRTKYD 302 (420)
Q Consensus 245 l~~h~~~~~~~~~~~~C~~C~~~f-~~~~~l~~H~~~~H~~---------------------~~~~~C~~C~~~f~~~~~ 302 (420)
|.+++...........|-.|...+ .+++.+..|+...|.- -..+.|-+|.+.|+.+..
T Consensus 131 LeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdknt 210 (423)
T KOG2482|consen 131 LEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNT 210 (423)
T ss_pred HHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHH
Confidence 444443333344457899998765 4567788887777731 124789999999999999
Q ss_pred Hhhhhhcc-cCCcccccCce----ec--CCCCCccCCh-HHHh-----hhhhhh--------cCCcc--eecCcCCcccc
Q psy10074 303 LSSHIGRK-HTDKEVLEKRY----VC--QLCPKQFLTS-GELK-----YHQTTV--------HRKRV--HECSECGFTFF 359 (420)
Q Consensus 303 l~~H~~~~-H~~~~~~~~~~----~C--~~C~~~f~~~-~~l~-----~H~~~h--------~~~~~--~~C~~C~~~f~ 359 (420)
|+.||+.. |...+-..+-| .= ..=+++.+.. +.+. .+.... .+..+ .+|-.|.....
T Consensus 211 LkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~e 290 (423)
T KOG2482|consen 211 LKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYE 290 (423)
T ss_pred HHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccchh
Confidence 99997553 33321111111 00 0011111111 1110 000000 01122 68999999999
Q ss_pred ChhhHHHHHHhhcCC--------------------------CCCcccCcCccccCCHHHHHccccc
Q psy10074 360 LEKTLRAHIRETHRK--------------------------ETPFPCPFCEYKAKRKFELKSHMKR 399 (420)
Q Consensus 360 ~~~~L~~H~~~~H~~--------------------------~~~~~C~~C~~~f~~~~~l~~H~~~ 399 (420)
+...|..||..+|.= ...-.|-.|.-+|.....|..||..
T Consensus 291 n~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 291 NPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred hHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence 999999999988831 1124488899999999999999953
No 51
>KOG2785|consensus
Probab=95.49 E-value=0.045 Score=49.74 Aligned_cols=53 Identities=21% Similarity=0.283 Sum_probs=39.0
Q ss_pred ceecCcCCccccChhhHHHHHHhhcCCC----------------------CCcccCcCc---cccCCHHHHHcccccc
Q psy10074 348 VHECSECGFTFFLEKTLRAHIRETHRKE----------------------TPFPCPFCE---YKAKRKFELKSHMKRK 400 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~----------------------~~~~C~~C~---~~f~~~~~l~~H~~~~ 400 (420)
|-.|-.|+..+.+...-..||...|+-. .-|.|-.|+ +.|.+..+++.||+.+
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~K 243 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRDK 243 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhhc
Confidence 4567777777777777777777666432 227788888 8888888888888653
No 52
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.20 E-value=0.017 Score=31.41 Aligned_cols=22 Identities=36% Similarity=0.614 Sum_probs=18.6
Q ss_pred cccCcCccccCCHHHHHccccc
Q psy10074 378 FPCPFCEYKAKRKFELKSHMKR 399 (420)
Q Consensus 378 ~~C~~C~~~f~~~~~l~~H~~~ 399 (420)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 6789999999999999998853
No 53
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.95 E-value=0.018 Score=31.26 Aligned_cols=22 Identities=32% Similarity=0.732 Sum_probs=14.7
Q ss_pred eecCCCCCccCChHHHhhhhhh
Q psy10074 321 YVCQLCPKQFLTSGELKYHQTT 342 (420)
Q Consensus 321 ~~C~~C~~~f~~~~~l~~H~~~ 342 (420)
|.|..|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5667777777777777666654
No 54
>PRK04860 hypothetical protein; Provisional
Probab=94.55 E-value=0.029 Score=45.58 Aligned_cols=36 Identities=28% Similarity=0.458 Sum_probs=17.9
Q ss_pred ceecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccC
Q psy10074 348 VHECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAK 388 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~ 388 (420)
+|.|. |+. ....+.+|.+ .|++.++|.|..|+..|.
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~r-i~~g~~~YrC~~C~~~l~ 154 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNR-VVRGEAVYRCRRCGETLV 154 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHH-HhcCCccEECCCCCceeE
Confidence 35554 544 3344455555 555555555555554443
No 55
>KOG2231|consensus
Probab=94.51 E-value=0.051 Score=53.82 Aligned_cols=112 Identities=29% Similarity=0.502 Sum_probs=66.2
Q ss_pred cccccccccCChHHHHhchhhcCCCCCCccCCccch---------hccChhHHhhhhhcccCCcccccCceecCCCCCcc
Q psy10074 260 RCELCDLRFFDKVGLKMHVMRKHTGEKPFHCHYCDY---------ICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQF 330 (420)
Q Consensus 260 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~---------~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f 330 (420)
.|..| ..|.+...|+.|+...|.. +.|..|-. ..-+...|..|+..--.+...-..--.|..|...|
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H~~---~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~f 192 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQHKL---HLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERF 192 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhhhh---hccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhh
Confidence 36666 6677777777776555643 33333321 11235566677543333211112224678888888
Q ss_pred CChHHHhhhhhhhcCCcceecCcC------CccccChhhHHHHHHhhcCCCCCcccC--cCc
Q psy10074 331 LTSGELKYHQTTVHRKRVHECSEC------GFTFFLEKTLRAHIRETHRKETPFPCP--FCE 384 (420)
Q Consensus 331 ~~~~~l~~H~~~h~~~~~~~C~~C------~~~f~~~~~L~~H~~~~H~~~~~~~C~--~C~ 384 (420)
.+...|..|++.++ |.|-.| +.-|..-..|..|.+..| |.|. -|.
T Consensus 193 ld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H-----flCE~~~C~ 245 (669)
T KOG2231|consen 193 LDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH-----FLCEEEFCR 245 (669)
T ss_pred ccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC-----ccccccccc
Confidence 88888888887765 444444 456777788888888666 5666 454
No 56
>PRK04860 hypothetical protein; Provisional
Probab=94.47 E-value=0.034 Score=45.15 Aligned_cols=39 Identities=31% Similarity=0.557 Sum_probs=34.6
Q ss_pred cceeccccCccccChhHHHHHHHHhcCCCceecccccccccCH
Q psy10074 170 YRYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIYCAYRCTQK 212 (420)
Q Consensus 170 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 212 (420)
.+|.|. |+. ....+++|.++|.++.+|.|..|+..|...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 479998 987 788999999999999999999999988643
No 57
>KOG4173|consensus
Probab=94.20 E-value=0.012 Score=48.36 Aligned_cols=79 Identities=23% Similarity=0.452 Sum_probs=50.9
Q ss_pred CCccCCc--cchhccChhHHhhhhhcccCCcccccCceecCCCCCccCChHHHhhhhhhhc----------CCcceecC-
Q psy10074 286 KPFHCHY--CDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFLTSGELKYHQTTVH----------RKRVHECS- 352 (420)
Q Consensus 286 ~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~----------~~~~~~C~- 352 (420)
..+.|+. |...|.+......|....|.+ .|..|.+.|.+...|..|+...| |...|+|-
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~--------sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~Clv 149 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN--------SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLV 149 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc--------hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHH
Confidence 3466765 667777777777776555554 57777777777777777776544 33446663
Q ss_pred -cCCccccChhhHHHHHHhhc
Q psy10074 353 -ECGFTFFLEKTLRAHIRETH 372 (420)
Q Consensus 353 -~C~~~f~~~~~L~~H~~~~H 372 (420)
.|+..|.+...-..|+...|
T Consensus 150 EgCt~KFkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 150 EGCTEKFKTSRDRKDHMIRMH 170 (253)
T ss_pred HhhhhhhhhhhhhhhHHHHhc
Confidence 36666666666666665555
No 58
>KOG2785|consensus
Probab=93.58 E-value=0.13 Score=46.92 Aligned_cols=25 Identities=12% Similarity=0.281 Sum_probs=21.3
Q ss_pred CcceeccccCccccChhHHHHHHHH
Q psy10074 169 SYRYVCFTCDYHSFCSSHMKDHIRV 193 (420)
Q Consensus 169 ~~~~~C~~C~~~f~~~~~l~~H~~~ 193 (420)
..++.|.+|.+.|.+......|++.
T Consensus 66 ~~~~~c~~c~k~~~s~~a~~~hl~S 90 (390)
T KOG2785|consen 66 ESVVYCEACNKSFASPKAHENHLKS 90 (390)
T ss_pred ccceehHHhhccccChhhHHHHHHH
Confidence 3378999999999999999999875
No 59
>KOG2893|consensus
Probab=92.68 E-value=0.053 Score=45.67 Aligned_cols=46 Identities=20% Similarity=0.584 Sum_probs=39.7
Q ss_pred cccchhhhccChHHHHHhcccccchhcccccCCcccccchhhccccccccchh-hhhcc
Q psy10074 17 CCHCNKYISSDLRDLLEHCRLCPAMTRTDLLKHKYVCYACSYATYYNGNMKKH-LRIHL 74 (420)
Q Consensus 17 C~~C~~~f~~~~~~l~~H~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H-~~~h~ 74 (420)
|.+|++.|.+ +.-|.+| .+- +-|+|-+|.+...+--.|..| |++|.
T Consensus 13 cwycnrefdd-ekiliqh-------qka----khfkchichkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 13 CWYCNREFDD-EKILIQH-------QKA----KHFKCHICHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred eeecccccch-hhhhhhh-------hhh----ccceeeeehhhhccCCCceeehhhhhh
Confidence 9999999999 9999999 432 349999999999999999999 77774
No 60
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=92.59 E-value=0.072 Score=51.62 Aligned_cols=61 Identities=25% Similarity=0.341 Sum_probs=44.5
Q ss_pred ccccccchhhhccChHHHHHhcccccchhc--ccccC--Cccccc--chhhccccccccchhhhhccCCCCcccc
Q psy10074 14 DIKCCHCNKYISSDLRDLLEHCRLCPAMTR--TDLLK--HKYVCY--ACSYATYYNGNMKKHLRIHLGDRPFMCT 82 (420)
Q Consensus 14 ~~~C~~C~~~f~~~~~~l~~H~~~~~~~~~--~~~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~c~ 82 (420)
++.|..|...|.. ...|..| .+ .|.++ +++.|+ .|++.|.....+..|...|.+..+..+.
T Consensus 289 ~~~~~~~~~~~s~-~~~l~~~-------~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (467)
T COG5048 289 PIKSKQCNISFSR-SSPLTRH-------LRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEK 355 (467)
T ss_pred CCCCccccCCccc-ccccccc-------ccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccc
Confidence 5777777777777 7777777 66 57777 777777 6777777777777777777776665543
No 61
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.47 E-value=0.11 Score=27.53 Aligned_cols=20 Identities=30% Similarity=0.592 Sum_probs=13.2
Q ss_pred eecCcCCccccChhhHHHHHH
Q psy10074 349 HECSECGFTFFLEKTLRAHIR 369 (420)
Q Consensus 349 ~~C~~C~~~f~~~~~L~~H~~ 369 (420)
..|+.||+.| ....|..|+.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHH
Confidence 3577777777 4566777755
No 62
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=92.14 E-value=0.16 Score=38.57 Aligned_cols=25 Identities=28% Similarity=0.861 Sum_probs=21.8
Q ss_pred ccc----CcCccccCCHHHHHcccccccc
Q psy10074 378 FPC----PFCEYKAKRKFELKSHMKRKHT 402 (420)
Q Consensus 378 ~~C----~~C~~~f~~~~~l~~H~~~~H~ 402 (420)
|.| +.|++...+...+.+|++..|+
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 889 8999999999999999987774
No 63
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=91.90 E-value=0.48 Score=42.51 Aligned_cols=139 Identities=17% Similarity=0.223 Sum_probs=78.8
Q ss_pred hhhhhhhhhhhhhhhhhcccccccccCCCCcccccccccC---CCCCccccccccccCCchhhhhhcccccccccCCCCC
Q psy10074 93 QYCNQYFVRSTQDILQHGKVCNFAPRSDANFNFVCILCNY---HIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRPDVS 169 (420)
Q Consensus 93 ~~C~~~f~~~~~~l~~H~~~~~~~~~h~~~~~~~C~~C~~---~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~~ 169 (420)
..|..+..... .|..|.+.. .+ .+.|.+|-. .|... ..+... ..|..|. ....... .-.
T Consensus 157 skc~~~C~~~k-~lk~H~K~~------H~--~~~C~~C~~nKk~F~~E------~~lF~~-~~Lr~H~-~~G~~e~-GFK 218 (493)
T COG5236 157 SKCHRRCGSLK-ELKKHYKAQ------HG--FVLCSECIGNKKDFWNE------IRLFRS-STLRDHK-NGGLEEE-GFK 218 (493)
T ss_pred hhhhhhhhhHH-HHHHHHHhh------cC--cEEhHhhhcCcccCccc------eeeeec-ccccccc-cCCcccc-CcC
Confidence 56777666666 888898773 22 255666643 33322 111111 6677776 4443332 001
Q ss_pred cceeccccCccccChhHHHHHHHHhcCCCceecccccc-------cccCHHHHHHHHhhccCCCcceeceeecccccChH
Q psy10074 170 YRYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIYCAY-------RCTQKSDLKKHLRKKHEVKHCFVNIVMNSYGTNIK 242 (420)
Q Consensus 170 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~-------~f~~~~~l~~H~~~~h~~~~~~c~~c~~~~~~~~~ 242 (420)
.--.|..|...|-+...|..|++..+. .|.+|+. -|.+..+|..|.+..|-.-.++--.-..-..|...
T Consensus 219 GHP~C~FC~~~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~hy~ct~qtc~~~k~~vf~~~ 294 (493)
T COG5236 219 GHPLCIFCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNAHYCCTFQTCRVGKCYVFPYH 294 (493)
T ss_pred CCchhhhccceecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcCceEEEEEEEecCcEEEeccH
Confidence 123699999999999999999987543 5777764 46667778888765542211111111112345555
Q ss_pred HHHHHHHhhhh
Q psy10074 243 KCMKRHVERIH 253 (420)
Q Consensus 243 ~~l~~h~~~~~ 253 (420)
.+|..|+...|
T Consensus 295 ~el~~h~~~~h 305 (493)
T COG5236 295 TELLEHLTRFH 305 (493)
T ss_pred HHHHHHHHHHh
Confidence 55666654444
No 64
>KOG4173|consensus
Probab=91.82 E-value=0.07 Score=43.96 Aligned_cols=84 Identities=26% Similarity=0.413 Sum_probs=58.3
Q ss_pred ccccccc--cccccCChHHHHhchhhcCCCCCCccCCccchhccChhHHhhhhhcccCC-----cccccCceecCC--CC
Q psy10074 257 TKTRCEL--CDLRFFDKVGLKMHVMRKHTGEKPFHCHYCDYICRTKYDLSSHIGRKHTD-----KEVLEKRYVCQL--CP 327 (420)
Q Consensus 257 ~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~-----~~~~~~~~~C~~--C~ 327 (420)
..+.|++ |...|.+...+..|.-..|.+ .|..|.+.|.+...|..||...|.. .+.+...|.|-+ |+
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~----sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN----SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc----hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 3456754 667888777777775455544 4888888888888888887777643 222255678854 88
Q ss_pred CccCChHHHhhhhhhhc
Q psy10074 328 KQFLTSGELKYHQTTVH 344 (420)
Q Consensus 328 ~~f~~~~~l~~H~~~h~ 344 (420)
..|.+...-..|+..-|
T Consensus 154 ~KFkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMH 170 (253)
T ss_pred hhhhhhhhhhhHHHHhc
Confidence 88888888888876443
No 65
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.30 E-value=0.19 Score=26.62 Aligned_cols=18 Identities=17% Similarity=0.379 Sum_probs=8.6
Q ss_pred eccccCccccChhHHHHHH
Q psy10074 173 VCFTCDYHSFCSSHMKDHI 191 (420)
Q Consensus 173 ~C~~C~~~f~~~~~l~~H~ 191 (420)
.|+.|+..| ..+.|..|+
T Consensus 4 ~C~~CgR~F-~~~~l~~H~ 21 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHE 21 (25)
T ss_pred cCCCCCCEE-CHHHHHHHH
Confidence 445555554 444444444
No 66
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=90.53 E-value=0.19 Score=28.77 Aligned_cols=10 Identities=40% Similarity=1.175 Sum_probs=4.9
Q ss_pred eecCCCCCcc
Q psy10074 321 YVCQLCPKQF 330 (420)
Q Consensus 321 ~~C~~C~~~f 330 (420)
|.|.+||..+
T Consensus 2 ~~C~~CGy~y 11 (33)
T cd00350 2 YVCPVCGYIY 11 (33)
T ss_pred EECCCCCCEE
Confidence 4455555444
No 67
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=89.78 E-value=0.24 Score=28.61 Aligned_cols=22 Identities=32% Similarity=0.563 Sum_probs=13.0
Q ss_pred ceecCcCCccccChhhHHHHHH
Q psy10074 348 VHECSECGFTFFLEKTLRAHIR 369 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~L~~H~~ 369 (420)
+|.|++|+..|.+...+..|+.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 3556666666666666666544
No 68
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=89.69 E-value=0.28 Score=28.28 Aligned_cols=23 Identities=22% Similarity=0.504 Sum_probs=18.2
Q ss_pred ceecCCCCCccCChHHHhhhhhh
Q psy10074 320 RYVCQLCPKQFLTSGELKYHQTT 342 (420)
Q Consensus 320 ~~~C~~C~~~f~~~~~l~~H~~~ 342 (420)
+|.|++|+..|.+...+..|+..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 47788888888888888888764
No 69
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=89.10 E-value=0.66 Score=34.85 Aligned_cols=21 Identities=19% Similarity=0.431 Sum_probs=13.4
Q ss_pred ccccccccccccCChHHHHhc
Q psy10074 257 TKTRCELCDLRFFDKVGLKMH 277 (420)
Q Consensus 257 ~~~~C~~C~~~f~~~~~l~~H 277 (420)
-|..|++||-+..+...|.+-
T Consensus 14 LP~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLARS 34 (112)
T ss_pred CCCcCCcCCCEEeccchHHHh
Confidence 355677777777666665544
No 70
>KOG2893|consensus
Probab=89.06 E-value=0.13 Score=43.44 Aligned_cols=46 Identities=33% Similarity=0.474 Sum_probs=30.0
Q ss_pred cCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCHHHHHccccccc
Q psy10074 351 CSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRKFELKSHMKRKH 401 (420)
Q Consensus 351 C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H 401 (420)
|=+|++.|.....|.+|++..| |+|.+|.++.-+--.|..|--.+|
T Consensus 13 cwycnrefddekiliqhqkakh-----fkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH-----FKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeecccccchhhhhhhhhhhcc-----ceeeeehhhhccCCCceeehhhhh
Confidence 5566777777777777766433 777777766666555666655555
No 71
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=88.40 E-value=0.13 Score=32.50 Aligned_cols=26 Identities=23% Similarity=0.545 Sum_probs=12.3
Q ss_pred CcccCcCccccCCHHHHHcccccccc
Q psy10074 377 PFPCPFCEYKAKRKFELKSHMKRKHT 402 (420)
Q Consensus 377 ~~~C~~C~~~f~~~~~l~~H~~~~H~ 402 (420)
.+.||.|+..|...-...+|+...|+
T Consensus 17 ~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 17 FLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred eeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 34444444444444444444444443
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=87.59 E-value=0.3 Score=30.95 Aligned_cols=30 Identities=23% Similarity=0.556 Sum_probs=22.5
Q ss_pred cCCcceecCcCCccccChhhHHHHHHhhcC
Q psy10074 344 HRKRVHECSECGFTFFLEKTLRAHIRETHR 373 (420)
Q Consensus 344 ~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~ 373 (420)
.|+..+.||.|+..|....+..+|+...|.
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 356667888888888888888888776663
No 73
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=86.90 E-value=1.3 Score=33.65 Aligned_cols=27 Identities=33% Similarity=0.630 Sum_probs=24.0
Q ss_pred cceec----CcCCccccChhhHHHHHHhhcC
Q psy10074 347 RVHEC----SECGFTFFLEKTLRAHIRETHR 373 (420)
Q Consensus 347 ~~~~C----~~C~~~f~~~~~L~~H~~~~H~ 373 (420)
..|.| +.|++.+.+...+..|++..|+
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 45999 9999999999999999997774
No 74
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=86.73 E-value=0.25 Score=47.79 Aligned_cols=64 Identities=23% Similarity=0.427 Sum_probs=56.4
Q ss_pred cceeccccCccccChhHHHHHHH--HhcCC--Cceecc--cccccccCHHHHHHHHhhccCCCcceecee
Q psy10074 170 YRYVCFTCDYHSFCSSHMKDHIR--VHLGE--KPYMCI--YCAYRCTQKSDLKKHLRKKHEVKHCFVNIV 233 (420)
Q Consensus 170 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~h~~~~~~c~~c 233 (420)
..+.|..|...|.....+..|.+ .|.++ .++.|+ .|++.|.....+..|...+....+..+...
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 357 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLL 357 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccc
Confidence 36899999999999999999999 89999 899999 799999999999999998888776555443
No 75
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=84.91 E-value=1.3 Score=33.33 Aligned_cols=24 Identities=29% Similarity=0.520 Sum_probs=17.9
Q ss_pred ceeccccCccccChhHHHHHHHHh
Q psy10074 171 RYVCFTCDYHSFCSSHMKDHIRVH 194 (420)
Q Consensus 171 ~~~C~~C~~~f~~~~~l~~H~~~h 194 (420)
.|+|+.|...|--.-+.-.|...|
T Consensus 81 ~y~C~~C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 81 RYVCAVCKNVFCVDCDVFVHESLH 104 (112)
T ss_pred ceeCCCCCCccccccchhhhhhcc
Confidence 678888888887777777776665
No 76
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=83.26 E-value=0.85 Score=26.25 Aligned_cols=10 Identities=30% Similarity=0.992 Sum_probs=4.5
Q ss_pred eecCcCCccc
Q psy10074 349 HECSECGFTF 358 (420)
Q Consensus 349 ~~C~~C~~~f 358 (420)
|+|..||..+
T Consensus 3 ~~C~~CG~i~ 12 (34)
T cd00729 3 WVCPVCGYIH 12 (34)
T ss_pred EECCCCCCEe
Confidence 4444444443
No 77
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=82.06 E-value=1.1 Score=26.48 Aligned_cols=8 Identities=38% Similarity=1.227 Sum_probs=3.1
Q ss_pred cCcCCccc
Q psy10074 351 CSECGFTF 358 (420)
Q Consensus 351 C~~C~~~f 358 (420)
||.|+..|
T Consensus 5 CP~C~~~~ 12 (38)
T TIGR02098 5 CPNCKTSF 12 (38)
T ss_pred CCCCCCEE
Confidence 33333333
No 78
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=81.71 E-value=0.52 Score=40.65 Aligned_cols=13 Identities=38% Similarity=0.997 Sum_probs=9.5
Q ss_pred eecCcCCccccCh
Q psy10074 349 HECSECGFTFFLE 361 (420)
Q Consensus 349 ~~C~~C~~~f~~~ 361 (420)
..||.||.+|...
T Consensus 49 ~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 49 WVCPHCGYAAFEE 61 (214)
T ss_pred EECCCCCCccccc
Confidence 6788888877643
No 79
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=81.39 E-value=3.5 Score=36.64 Aligned_cols=96 Identities=18% Similarity=0.268 Sum_probs=60.5
Q ss_pred CCccccccccc---ccchhhhhhhhhhhhhhhhhhh------cccccccccCCCCcccccccccCCCCCccccccccccC
Q psy10074 77 RPFMCTLLICQ---YCLICQYCNQYFVRSTQDILQH------GKVCNFAPRSDANFNFVCILCNYHIPCKWCKYCKTKLD 147 (420)
Q Consensus 77 ~~~~c~~c~~~---~~~~C~~C~~~f~~~~~~l~~H------~~~~~~~~~h~~~~~~~C~~C~~~f~~~~c~~~~~~~~ 147 (420)
..|.|+.|... -|..|+.|+....... .|.+- ++...+.|...+.+.-.|..|.-.|+..
T Consensus 307 gGy~CP~CktkVCsLPi~CP~Csl~Lilst-hLarSyhhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~~---------- 375 (421)
T COG5151 307 GGYECPVCKTKVCSLPISCPICSLQLILST-HLARSYHHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPKP---------- 375 (421)
T ss_pred CceeCCcccceeecCCccCcchhHHHHHHH-HHHHHHHhhccCcccccccCCCCCCCccceeccCCCCCC----------
Confidence 34677777665 6778899987764444 33321 1111222323334445588888888776
Q ss_pred CchhhhhhcccccccccCCCCCcceeccccCccccChhHHHHHHHHh
Q psy10074 148 NNTEVLLQHCKLCVNVIRPDVSYRYVCFTCDYHSFCSSHMKDHIRVH 194 (420)
Q Consensus 148 ~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h 194 (420)
.++..........|+|+.|...|-..-..-.|...|
T Consensus 376 -----------~~~~~~~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 376 -----------PVSPFDESTSSGRYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred -----------CCCcccccccccceechhhhhhhhhhhHHHHHHHHh
Confidence 444444434445799999999998888888887766
No 80
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=81.11 E-value=0.32 Score=41.90 Aligned_cols=12 Identities=42% Similarity=1.065 Sum_probs=8.1
Q ss_pred cccCcCccccCC
Q psy10074 378 FPCPFCEYKAKR 389 (420)
Q Consensus 378 ~~C~~C~~~f~~ 389 (420)
..||.||.+|..
T Consensus 49 ~vCP~CgyA~~~ 60 (214)
T PF09986_consen 49 WVCPHCGYAAFE 60 (214)
T ss_pred EECCCCCCcccc
Confidence 457777777664
No 81
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=80.60 E-value=0.87 Score=34.25 Aligned_cols=14 Identities=29% Similarity=0.562 Sum_probs=6.6
Q ss_pred cceecCcCCccccC
Q psy10074 347 RVHECSECGFTFFL 360 (420)
Q Consensus 347 ~~~~C~~C~~~f~~ 360 (420)
.|..||.||..|.-
T Consensus 25 ~PivCP~CG~~~~~ 38 (108)
T PF09538_consen 25 DPIVCPKCGTEFPP 38 (108)
T ss_pred CCccCCCCCCccCc
Confidence 34445555554443
No 82
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=79.35 E-value=1.3 Score=28.70 Aligned_cols=8 Identities=50% Similarity=1.593 Sum_probs=4.3
Q ss_pred CcccCcCc
Q psy10074 377 PFPCPFCE 384 (420)
Q Consensus 377 ~~~C~~C~ 384 (420)
+|.|+.||
T Consensus 50 ~Y~Cp~CG 57 (61)
T COG2888 50 PYRCPKCG 57 (61)
T ss_pred ceECCCcC
Confidence 45555555
No 83
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=77.77 E-value=1.3 Score=23.64 Aligned_cols=11 Identities=27% Similarity=0.561 Sum_probs=8.3
Q ss_pred eeccccCcccc
Q psy10074 172 YVCFTCDYHSF 182 (420)
Q Consensus 172 ~~C~~C~~~f~ 182 (420)
-.|+.||..|.
T Consensus 15 ~~Cp~CG~~F~ 25 (26)
T PF10571_consen 15 KFCPHCGYDFE 25 (26)
T ss_pred CcCCCCCCCCc
Confidence 46888888774
No 84
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=76.94 E-value=2 Score=25.20 Aligned_cols=6 Identities=50% Similarity=1.420 Sum_probs=2.2
Q ss_pred ccCcCc
Q psy10074 379 PCPFCE 384 (420)
Q Consensus 379 ~C~~C~ 384 (420)
.|+.|+
T Consensus 27 rC~~C~ 32 (37)
T PF13719_consen 27 RCPKCG 32 (37)
T ss_pred ECCCCC
Confidence 333333
No 85
>PHA00626 hypothetical protein
Probab=74.85 E-value=1.7 Score=27.83 Aligned_cols=12 Identities=42% Similarity=1.107 Sum_probs=8.1
Q ss_pred ceeccccCcccc
Q psy10074 171 RYVCFTCDYHSF 182 (420)
Q Consensus 171 ~~~C~~C~~~f~ 182 (420)
.|+|+.|++.|.
T Consensus 23 rYkCkdCGY~ft 34 (59)
T PHA00626 23 DYVCCDCGYNDS 34 (59)
T ss_pred ceEcCCCCCeec
Confidence 577777776664
No 86
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=74.80 E-value=2.7 Score=34.25 Aligned_cols=16 Identities=13% Similarity=0.391 Sum_probs=7.9
Q ss_pred eecCCCCCccCChHHH
Q psy10074 321 YVCQLCPKQFLTSGEL 336 (420)
Q Consensus 321 ~~C~~C~~~f~~~~~l 336 (420)
|.|+.|+..|+...++
T Consensus 110 Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 110 FICPNMCVRFTFNEAM 125 (158)
T ss_pred EECCCCCcEeeHHHHH
Confidence 4455555555444444
No 87
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=73.79 E-value=1.3 Score=26.55 Aligned_cols=12 Identities=17% Similarity=0.603 Sum_probs=6.7
Q ss_pred cccccchhhhcc
Q psy10074 15 IKCCHCNKYISS 26 (420)
Q Consensus 15 ~~C~~C~~~f~~ 26 (420)
|+|+.|+..|.-
T Consensus 6 y~C~~Cg~~fe~ 17 (41)
T smart00834 6 YRCEDCGHTFEV 17 (41)
T ss_pred EEcCCCCCEEEE
Confidence 455555555554
No 88
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=73.54 E-value=1.2 Score=27.34 Aligned_cols=23 Identities=22% Similarity=0.598 Sum_probs=9.8
Q ss_pred cceecCcCCccccCh----hhHHHHHH
Q psy10074 347 RVHECSECGFTFFLE----KTLRAHIR 369 (420)
Q Consensus 347 ~~~~C~~C~~~f~~~----~~L~~H~~ 369 (420)
...+|..|++.+... +.|..|++
T Consensus 15 ~~a~C~~C~~~~~~~~~~ts~l~~HL~ 41 (45)
T PF02892_consen 15 KKAKCKYCGKVIKYSSGGTSNLKRHLK 41 (45)
T ss_dssp S-EEETTTTEE-----SSTHHHHHHHH
T ss_pred CeEEeCCCCeEEeeCCCcHHHHHHhhh
Confidence 334566666555442 45555553
No 89
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=72.88 E-value=1.8 Score=32.58 Aligned_cols=30 Identities=27% Similarity=0.731 Sum_probs=21.7
Q ss_pred eecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCH
Q psy10074 349 HECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRK 390 (420)
Q Consensus 349 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~ 390 (420)
..|+.||..|.... ..|..||+||..|.-.
T Consensus 10 R~Cp~CG~kFYDLn------------k~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDLN------------KDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccCC------------CCCccCCCCCCccCcc
Confidence 47888888887642 2577788888888755
No 90
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=72.57 E-value=1.4 Score=26.75 Aligned_cols=12 Identities=17% Similarity=0.581 Sum_probs=6.0
Q ss_pred cccccchhhhcc
Q psy10074 15 IKCCHCNKYISS 26 (420)
Q Consensus 15 ~~C~~C~~~f~~ 26 (420)
|+|..||..|.-
T Consensus 6 y~C~~Cg~~fe~ 17 (42)
T PF09723_consen 6 YRCEECGHEFEV 17 (42)
T ss_pred EEeCCCCCEEEE
Confidence 445555555444
No 91
>PF12907 zf-met2: Zinc-binding
Probab=72.54 E-value=1.4 Score=26.39 Aligned_cols=25 Identities=28% Similarity=0.474 Sum_probs=11.3
Q ss_pred ccCcCccccC---CHHHHHccccccccC
Q psy10074 379 PCPFCEYKAK---RKFELKSHMKRKHTF 403 (420)
Q Consensus 379 ~C~~C~~~f~---~~~~l~~H~~~~H~~ 403 (420)
+|.+|..+|. +...|..|....|+.
T Consensus 3 ~C~iC~qtF~~t~~~~~L~eH~enKHpK 30 (40)
T PF12907_consen 3 ICKICRQTFMQTTNEPQLKEHAENKHPK 30 (40)
T ss_pred CcHHhhHHHHhcCCHHHHHHHHHccCCC
Confidence 4555553332 234455555555544
No 92
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=72.03 E-value=4.3 Score=32.65 Aligned_cols=13 Identities=23% Similarity=0.695 Sum_probs=6.2
Q ss_pred ceecCCCCCccCC
Q psy10074 320 RYVCQLCPKQFLT 332 (420)
Q Consensus 320 ~~~C~~C~~~f~~ 332 (420)
.|.|+.|+..|+.
T Consensus 99 ~Y~Cp~C~~~y~~ 111 (147)
T smart00531 99 YYKCPNCQSKYTF 111 (147)
T ss_pred EEECcCCCCEeeH
Confidence 3445555544443
No 93
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=71.84 E-value=1.9 Score=28.08 Aligned_cols=10 Identities=40% Similarity=1.494 Sum_probs=5.3
Q ss_pred CCcccCcCcc
Q psy10074 376 TPFPCPFCEY 385 (420)
Q Consensus 376 ~~~~C~~C~~ 385 (420)
.+|.|+.||+
T Consensus 47 ~~Y~CP~CGF 56 (59)
T PRK14890 47 NPYTCPKCGF 56 (59)
T ss_pred CceECCCCCC
Confidence 3455555553
No 94
>KOG2186|consensus
Probab=71.27 E-value=1.8 Score=37.37 Aligned_cols=46 Identities=15% Similarity=0.322 Sum_probs=33.2
Q ss_pred cccccchhhhccChHHHHHhcccccchhcccccCCcccccchhhccccccccchhhh
Q psy10074 15 IKCCHCNKYISSDLRDLLEHCRLCPAMTRTDLLKHKYVCYACSYATYYNGNMKKHLR 71 (420)
Q Consensus 15 ~~C~~C~~~f~~~~~~l~~H~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 71 (420)
|.|..||....- ..+.+| +..-.+ .-|.|..||..|.. .++..|..
T Consensus 4 FtCnvCgEsvKK--p~vekH-------~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKK--PQVEKH-------MSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhccc--cchHHH-------HHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 678888877776 677788 444443 56888888888887 66667753
No 95
>PF12907 zf-met2: Zinc-binding
Probab=70.68 E-value=2.2 Score=25.47 Aligned_cols=32 Identities=34% Similarity=0.496 Sum_probs=22.6
Q ss_pred eecCcCCccc---cChhhHHHHHHhhcCCCCCccc
Q psy10074 349 HECSECGFTF---FLEKTLRAHIRETHRKETPFPC 380 (420)
Q Consensus 349 ~~C~~C~~~f---~~~~~L~~H~~~~H~~~~~~~C 380 (420)
++|.+|..+| .+...|..|..+.|....+-.|
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~~~C 36 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTFEEC 36 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCHHHc
Confidence 5788888544 4557799998888887654444
No 96
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=69.82 E-value=4.5 Score=21.59 Aligned_cols=19 Identities=21% Similarity=0.497 Sum_probs=11.8
Q ss_pred ecCcCCccccChhhHHHHHH
Q psy10074 350 ECSECGFTFFLEKTLRAHIR 369 (420)
Q Consensus 350 ~C~~C~~~f~~~~~L~~H~~ 369 (420)
.||+|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 466676666 4556666654
No 97
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=69.55 E-value=1.8 Score=27.59 Aligned_cols=12 Identities=17% Similarity=0.642 Sum_probs=5.8
Q ss_pred cccccchhhhcc
Q psy10074 15 IKCCHCNKYISS 26 (420)
Q Consensus 15 ~~C~~C~~~f~~ 26 (420)
|+|..|+.+|..
T Consensus 6 y~C~~Cg~~fe~ 17 (52)
T TIGR02605 6 YRCTACGHRFEV 17 (52)
T ss_pred EEeCCCCCEeEE
Confidence 445555544444
No 98
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=69.48 E-value=1.1 Score=36.15 Aligned_cols=23 Identities=13% Similarity=0.108 Sum_probs=18.0
Q ss_pred Ccccccchhhccccccccchhhh
Q psy10074 49 HKYVCYACSYATYYNGNMKKHLR 71 (420)
Q Consensus 49 ~~~~C~~C~~~f~~~~~l~~H~~ 71 (420)
+.++|+.||..|.+...+..=|.
T Consensus 27 ~~~~c~~c~~~f~~~e~~~~~~i 49 (154)
T PRK00464 27 RRRECLACGKRFTTFERVELVPL 49 (154)
T ss_pred eeeeccccCCcceEeEeccCccc
Confidence 34899999999998877766553
No 99
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=68.66 E-value=1.9 Score=26.75 Aligned_cols=13 Identities=23% Similarity=0.792 Sum_probs=9.5
Q ss_pred ccccccchhhhcc
Q psy10074 14 DIKCCHCNKYISS 26 (420)
Q Consensus 14 ~~~C~~C~~~f~~ 26 (420)
.|.|+.|+..|..
T Consensus 3 ~y~C~~CG~~~~~ 15 (46)
T PRK00398 3 EYKCARCGREVEL 15 (46)
T ss_pred EEECCCCCCEEEE
Confidence 4778888877766
No 100
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=68.51 E-value=3.6 Score=27.03 Aligned_cols=46 Identities=24% Similarity=0.527 Sum_probs=27.0
Q ss_pred ceeccc-cCccccChhHHHHHHHHhcCCCceeccc----ccccccCHHHHHH
Q psy10074 171 RYVCFT-CDYHSFCSSHMKDHIRVHLGEKPYMCIY----CAYRCTQKSDLKK 217 (420)
Q Consensus 171 ~~~C~~-C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~~~~~l~~ 217 (420)
+..|+. |+..-.....|..|+.......+..|++ |...+. +..|..
T Consensus 9 ~v~C~~~cc~~~i~r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~-~~~l~~ 59 (60)
T PF02176_consen 9 PVPCPNGCCNEMIPRKELDDHLENECPKRPVPCPYSPYGCKERVP-REDLEE 59 (60)
T ss_dssp EEE-TT--S-BEEECCCHHHHHHTTSTTSEEE-SS----S--EEE-HHHHHH
T ss_pred EeeCCCCCcccceeHHHHHHHHHccCCCCcEECCCCCCCCCCccc-hhHHhC
Confidence 567877 3333334668888888777778888888 888765 344443
No 101
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=68.42 E-value=4.1 Score=33.91 Aligned_cols=17 Identities=18% Similarity=0.512 Sum_probs=8.9
Q ss_pred ceecCCCCCccCChHHH
Q psy10074 320 RYVCQLCPKQFLTSGEL 336 (420)
Q Consensus 320 ~~~C~~C~~~f~~~~~l 336 (420)
-|.|+.|+..|+...++
T Consensus 117 ~Y~Cp~C~~rytf~eA~ 133 (178)
T PRK06266 117 FFFCPNCHIRFTFDEAM 133 (178)
T ss_pred EEECCCCCcEEeHHHHh
Confidence 35555555555555444
No 102
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=67.86 E-value=1.6 Score=35.26 Aligned_cols=18 Identities=28% Similarity=0.512 Sum_probs=11.0
Q ss_pred eecCCCCCccCChHHHhh
Q psy10074 321 YVCQLCPKQFLTSGELKY 338 (420)
Q Consensus 321 ~~C~~C~~~f~~~~~l~~ 338 (420)
+.|+.||.+|.+...+..
T Consensus 29 ~~c~~c~~~f~~~e~~~~ 46 (154)
T PRK00464 29 RECLACGKRFTTFERVEL 46 (154)
T ss_pred eeccccCCcceEeEeccC
Confidence 666677766666554443
No 103
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=67.40 E-value=3.1 Score=36.00 Aligned_cols=82 Identities=17% Similarity=0.282 Sum_probs=52.0
Q ss_pred cCCcccccchhhccccccccchhhhhc--cCCCCcccccccccccchhhhhhhhhhhhhhhhhhhcccccccccCCCCcc
Q psy10074 47 LKHKYVCYACSYATYYNGNMKKHLRIH--LGDRPFMCTLLICQYCLICQYCNQYFVRSTQDILQHGKVCNFAPRSDANFN 124 (420)
Q Consensus 47 ~~~~~~C~~C~~~f~~~~~l~~H~~~h--~~~~~~~c~~c~~~~~~~C~~C~~~f~~~~~~l~~H~~~~~~~~~h~~~~~ 124 (420)
+-+.|+|..|+...---..| .|+-+- .....|+|.-|+....|.|--|..-|=+ .|.+..... -....+
T Consensus 139 GGrif~CsfC~~flCEDDQF-EHQAsCQvLe~E~~KC~SCNrlGq~sCLRCK~cfCd------dHvrrKg~k--y~k~k~ 209 (314)
T PF06524_consen 139 GGRIFKCSFCDNFLCEDDQF-EHQASCQVLESETFKCQSCNRLGQYSCLRCKICFCD------DHVRRKGFK--YEKGKP 209 (314)
T ss_pred CCeEEEeecCCCeeeccchh-hhhhhhhhhhcccccccccccccchhhhheeeeehh------hhhhhcccc--cccCCC
Confidence 66789999998654433444 454321 2235688888888888888777766643 233332111 244578
Q ss_pred cccccccCCCCCc
Q psy10074 125 FVCILCNYHIPCK 137 (420)
Q Consensus 125 ~~C~~C~~~f~~~ 137 (420)
+.|+.|+......
T Consensus 210 ~PCPKCg~et~eT 222 (314)
T PF06524_consen 210 IPCPKCGYETQET 222 (314)
T ss_pred CCCCCCCCccccc
Confidence 9999999877666
No 104
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=66.27 E-value=3.5 Score=33.61 Aligned_cols=34 Identities=18% Similarity=0.448 Sum_probs=19.3
Q ss_pred cCCcceecCcCCccccChhhHHHHHHhhcCCCCCcccCcCcccc
Q psy10074 344 HRKRVHECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKA 387 (420)
Q Consensus 344 ~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f 387 (420)
.+..-|.||.|+..|+....+. .-|.||.||...
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~----------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME----------LNFTCPRCGAML 138 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH----------cCCcCCCCCCEe
Confidence 3445566666666666655553 136666666543
No 105
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=65.37 E-value=5.1 Score=24.60 Aligned_cols=27 Identities=26% Similarity=0.609 Sum_probs=18.0
Q ss_pred ceeccccCccccChhHHHHHHHHhcCCCceeccccccc
Q psy10074 171 RYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIYCAYR 208 (420)
Q Consensus 171 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 208 (420)
.|.|..|+..|.-. ...+.+|+.|+..
T Consensus 2 ~Y~C~~Cg~~~~~~-----------~~~~irC~~CG~r 28 (44)
T smart00659 2 IYICGECGRENEIK-----------SKDVVRCRECGYR 28 (44)
T ss_pred EEECCCCCCEeecC-----------CCCceECCCCCce
Confidence 47788888776533 3356788888754
No 106
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=64.68 E-value=3.2 Score=33.42 Aligned_cols=12 Identities=25% Similarity=0.564 Sum_probs=6.3
Q ss_pred cccccchhhhcc
Q psy10074 15 IKCCHCNKYISS 26 (420)
Q Consensus 15 ~~C~~C~~~f~~ 26 (420)
|.|+.|+..|..
T Consensus 100 Y~Cp~C~~~y~~ 111 (147)
T smart00531 100 YKCPNCQSKYTF 111 (147)
T ss_pred EECcCCCCEeeH
Confidence 455555555554
No 107
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=63.74 E-value=4.1 Score=31.27 Aligned_cols=13 Identities=23% Similarity=0.089 Sum_probs=6.5
Q ss_pred cceecCcCCcccc
Q psy10074 347 RVHECSECGFTFF 359 (420)
Q Consensus 347 ~~~~C~~C~~~f~ 359 (420)
.|..||.||..|.
T Consensus 25 ~p~vcP~cg~~~~ 37 (129)
T TIGR02300 25 RPAVSPYTGEQFP 37 (129)
T ss_pred CCccCCCcCCccC
Confidence 3445555555543
No 108
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=62.85 E-value=6 Score=30.60 Aligned_cols=55 Identities=24% Similarity=0.442 Sum_probs=29.3
Q ss_pred cCceecCCCCCccCChHHHhhhhhhhcCCcceecC---cCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCHH
Q psy10074 318 EKRYVCQLCPKQFLTSGELKYHQTTVHRKRVHECS---ECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRKF 391 (420)
Q Consensus 318 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~---~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~ 391 (420)
.+.|+|.+|..+.....-| +|-+|- +|+.- -..|+.|-. .|. .||+|.-+|.+.+
T Consensus 78 ~~lYeCnIC~etS~ee~FL----------KPneCCgY~iCn~C---ya~LWK~~~-~yp-----vCPvCkTSFKss~ 135 (140)
T PF05290_consen 78 PKLYECNICKETSAEERFL----------KPNECCGYSICNAC---YANLWKFCN-LYP-----VCPVCKTSFKSSS 135 (140)
T ss_pred CCceeccCcccccchhhcC----------CcccccchHHHHHH---HHHHHHHcc-cCC-----CCCcccccccccc
Confidence 4557777777666554433 233331 12222 245666644 332 4888887777643
No 109
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=62.13 E-value=4.3 Score=33.79 Aligned_cols=15 Identities=20% Similarity=0.678 Sum_probs=6.8
Q ss_pred ceecCcCCccccChh
Q psy10074 348 VHECSECGFTFFLEK 362 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~ 362 (420)
-|.||.|+..|+...
T Consensus 117 ~Y~Cp~C~~rytf~e 131 (178)
T PRK06266 117 FFFCPNCHIRFTFDE 131 (178)
T ss_pred EEECCCCCcEEeHHH
Confidence 344444444444433
No 110
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=61.99 E-value=5.4 Score=32.58 Aligned_cols=24 Identities=25% Similarity=0.623 Sum_probs=15.4
Q ss_pred ceecCCCCCccCChHHHhhhhhhhcCCcceecCcCCc
Q psy10074 320 RYVCQLCPKQFLTSGELKYHQTTVHRKRVHECSECGF 356 (420)
Q Consensus 320 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 356 (420)
.|.|++||..+. ++.|-+||+||.
T Consensus 134 ~~vC~vCGy~~~-------------ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTHE-------------GEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCccc-------------CCCCCcCCCCCC
Confidence 477777776443 345677777774
No 111
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=61.28 E-value=1.9 Score=29.71 Aligned_cols=44 Identities=11% Similarity=0.196 Sum_probs=27.5
Q ss_pred cccccchhhhccChHHHHHhcccccchhcccccCCccccc--chhhccccccccch
Q psy10074 15 IKCCHCNKYISSDLRDLLEHCRLCPAMTRTDLLKHKYVCY--ACSYATYYNGNMKK 68 (420)
Q Consensus 15 ~~C~~C~~~f~~~~~~l~~H~~~~~~~~~~~~~~~~~~C~--~C~~~f~~~~~l~~ 68 (420)
+.|+.|+....- ...-... ....+..++|. +||..|.+..++..
T Consensus 2 m~CP~Cg~~a~i-rtSr~~s---------~~~~~~Y~qC~N~eCg~tF~t~es~s~ 47 (72)
T PRK09678 2 FHCPLCQHAAHA-RTSRYIT---------DTTKERYHQCQNVNCSATFITYESVQR 47 (72)
T ss_pred ccCCCCCCccEE-EEChhcC---------hhhheeeeecCCCCCCCEEEEEEEEEE
Confidence 468888866533 2222222 22456678898 89999888777643
No 112
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=61.03 E-value=4.2 Score=28.81 Aligned_cols=32 Identities=16% Similarity=0.315 Sum_probs=21.6
Q ss_pred cccccccchhhhccChHHHHHhcccccchhcccccCCcccccchhhcccc
Q psy10074 13 ADIKCCHCNKYISSDLRDLLEHCRLCPAMTRTDLLKHKYVCYACSYATYY 62 (420)
Q Consensus 13 ~~~~C~~C~~~f~~~~~~l~~H~~~~~~~~~~~~~~~~~~C~~C~~~f~~ 62 (420)
.+|.|+.|++. . + .......|.|..|++.|.-
T Consensus 34 ~~~~Cp~C~~~--~-------V---------kR~a~GIW~C~kCg~~fAG 65 (89)
T COG1997 34 AKHVCPFCGRT--T-------V---------KRIATGIWKCRKCGAKFAG 65 (89)
T ss_pred cCCcCCCCCCc--c-------e---------eeeccCeEEcCCCCCeecc
Confidence 67888888855 2 1 2224557889999888753
No 113
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=60.67 E-value=5.4 Score=41.00 Aligned_cols=49 Identities=27% Similarity=0.614 Sum_probs=37.2
Q ss_pred cccccccCCCCCccccccccccCCchhhhhhcccccccccCCCCCcceeccccCccccChhHHHHHHHHhcCCCceeccc
Q psy10074 125 FVCILCNYHIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRPDVSYRYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIY 204 (420)
Q Consensus 125 ~~C~~C~~~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 204 (420)
..|..||..+ .||.|. ..|..|. .. . ...|..||.. ...|..|+.
T Consensus 436 l~C~~Cg~v~---~Cp~Cd-------~~lt~H~-~~----~-----~L~CH~Cg~~---------------~~~p~~Cp~ 480 (730)
T COG1198 436 LLCRDCGYIA---ECPNCD-------SPLTLHK-AT----G-----QLRCHYCGYQ---------------EPIPQSCPE 480 (730)
T ss_pred eecccCCCcc---cCCCCC-------cceEEec-CC----C-----eeEeCCCCCC---------------CCCCCCCCC
Confidence 4699999988 588888 6677776 22 2 6899999864 457889999
Q ss_pred cccc
Q psy10074 205 CAYR 208 (420)
Q Consensus 205 C~~~ 208 (420)
|+-.
T Consensus 481 Cgs~ 484 (730)
T COG1198 481 CGSE 484 (730)
T ss_pred CCCC
Confidence 9865
No 114
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=58.71 E-value=3.2 Score=35.53 Aligned_cols=25 Identities=28% Similarity=0.550 Sum_probs=12.1
Q ss_pred cccCcCccccCCHHHHHcccccccc
Q psy10074 378 FPCPFCEYKAKRKFELKSHMKRKHT 402 (420)
Q Consensus 378 ~~C~~C~~~f~~~~~l~~H~~~~H~ 402 (420)
|.|+.|++.|.....++.||..+|+
T Consensus 78 ~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 78 WRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp EEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred ECCCCCCcccCChHHHHHHHhhcCH
Confidence 5555555555555555555555554
No 115
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=58.52 E-value=9.6 Score=24.04 Aligned_cols=23 Identities=35% Similarity=0.738 Sum_probs=12.3
Q ss_pred ecCcCCccccCh-----hhHHHHHHhhc
Q psy10074 350 ECSECGFTFFLE-----KTLRAHIRETH 372 (420)
Q Consensus 350 ~C~~C~~~f~~~-----~~L~~H~~~~H 372 (420)
.|..|++.++.. +.|.+|++..|
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~~~h 47 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLRRKH 47 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHHhHC
Confidence 455555555433 46666666433
No 116
>PF14803 Nudix_N_2: Nudix N-terminal; PDB: 3CNG_C.
Probab=58.37 E-value=4.4 Score=23.30 Aligned_cols=10 Identities=40% Similarity=1.029 Sum_probs=7.9
Q ss_pred ceeccccCcc
Q psy10074 171 RYVCFTCDYH 180 (420)
Q Consensus 171 ~~~C~~C~~~ 180 (420)
++.|+.|+..
T Consensus 22 R~vC~~Cg~I 31 (34)
T PF14803_consen 22 RLVCPACGFI 31 (34)
T ss_dssp EEEETTTTEE
T ss_pred ceECCCCCCE
Confidence 6889988864
No 117
>KOG2272|consensus
Probab=57.95 E-value=19 Score=31.22 Aligned_cols=16 Identities=25% Similarity=0.766 Sum_probs=12.2
Q ss_pred ccchhhhhhhhhhhhh
Q psy10074 88 YCLICQYCNQYFVRST 103 (420)
Q Consensus 88 ~~~~C~~C~~~f~~~~ 103 (420)
.-|.|..|++...+..
T Consensus 98 ~CF~Cd~Cn~~Lad~g 113 (332)
T KOG2272|consen 98 ACFRCDLCNKHLADQG 113 (332)
T ss_pred ccchhHHHHHHHhhhh
Confidence 4577888888877766
No 118
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=57.03 E-value=6.6 Score=24.52 Aligned_cols=11 Identities=27% Similarity=1.129 Sum_probs=4.6
Q ss_pred ccCCccchhcc
Q psy10074 288 FHCHYCDYICR 298 (420)
Q Consensus 288 ~~C~~C~~~f~ 298 (420)
|+|..|+..+.
T Consensus 2 y~C~~CgyvYd 12 (47)
T PF00301_consen 2 YQCPVCGYVYD 12 (47)
T ss_dssp EEETTTSBEEE
T ss_pred cCCCCCCEEEc
Confidence 34444444433
No 119
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=56.93 E-value=7.4 Score=24.43 Aligned_cols=11 Identities=27% Similarity=0.908 Sum_probs=5.9
Q ss_pred ceecCCCCCcc
Q psy10074 320 RYVCQLCPKQF 330 (420)
Q Consensus 320 ~~~C~~C~~~f 330 (420)
.|.|..|++.|
T Consensus 6 ~Y~C~~Cg~~~ 16 (49)
T COG1996 6 EYKCARCGREV 16 (49)
T ss_pred EEEhhhcCCee
Confidence 35555555555
No 120
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=56.09 E-value=6.9 Score=20.19 Aligned_cols=6 Identities=33% Similarity=0.993 Sum_probs=2.7
Q ss_pred ccccCc
Q psy10074 174 CFTCDY 179 (420)
Q Consensus 174 C~~C~~ 179 (420)
|+.||.
T Consensus 16 C~~CG~ 21 (23)
T PF13240_consen 16 CPNCGT 21 (23)
T ss_pred hhhhCC
Confidence 444443
No 121
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=55.15 E-value=5.7 Score=22.08 Aligned_cols=9 Identities=44% Similarity=1.165 Sum_probs=4.3
Q ss_pred ceecCcCCc
Q psy10074 348 VHECSECGF 356 (420)
Q Consensus 348 ~~~C~~C~~ 356 (420)
.|.|+.|+.
T Consensus 19 ~~vCp~C~~ 27 (30)
T PF08274_consen 19 LLVCPECGH 27 (30)
T ss_dssp SEEETTTTE
T ss_pred EEeCCcccc
Confidence 445555543
No 122
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=54.32 E-value=11 Score=33.61 Aligned_cols=20 Identities=25% Similarity=0.406 Sum_probs=10.5
Q ss_pred ceecCcCCccccChhhHHHH
Q psy10074 348 VHECSECGFTFFLEKTLRAH 367 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~L~~H 367 (420)
.|+|+.|...|-.--....|
T Consensus 388 rY~Ce~CK~~FC~dCdvfiH 407 (421)
T COG5151 388 RYQCELCKSTFCSDCDVFIH 407 (421)
T ss_pred ceechhhhhhhhhhhHHHHH
Confidence 35555555555554444444
No 123
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=53.68 E-value=7.2 Score=29.99 Aligned_cols=33 Identities=18% Similarity=0.322 Sum_probs=22.2
Q ss_pred eecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCCHHHH
Q psy10074 349 HECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKRKFEL 393 (420)
Q Consensus 349 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l 393 (420)
..|+.||..|... +..|..|++||..|.-...+
T Consensus 10 r~Cp~cg~kFYDL------------nk~p~vcP~cg~~~~~~~~~ 42 (129)
T TIGR02300 10 RICPNTGSKFYDL------------NRRPAVSPYTGEQFPPEEAL 42 (129)
T ss_pred ccCCCcCcccccc------------CCCCccCCCcCCccCcchhh
Confidence 4688888887653 22577788888887655333
No 124
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=53.58 E-value=13 Score=22.32 Aligned_cols=25 Identities=24% Similarity=0.246 Sum_probs=20.2
Q ss_pred ceeccccCcccc--ChhHHHHHHHHhc
Q psy10074 171 RYVCFTCDYHSF--CSSHMKDHIRVHL 195 (420)
Q Consensus 171 ~~~C~~C~~~f~--~~~~l~~H~~~h~ 195 (420)
...|+.|++.|. .++.-..|.+.|.
T Consensus 13 ~~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 13 ATTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred CcCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 358999999886 5778888988874
No 125
>PF13248 zf-ribbon_3: zinc-ribbon domain
Probab=53.53 E-value=8 Score=20.55 Aligned_cols=7 Identities=29% Similarity=0.748 Sum_probs=3.3
Q ss_pred ccccccC
Q psy10074 126 VCILCNY 132 (420)
Q Consensus 126 ~C~~C~~ 132 (420)
.|+.||.
T Consensus 4 ~Cp~Cg~ 10 (26)
T PF13248_consen 4 FCPNCGA 10 (26)
T ss_pred CCcccCC
Confidence 3445554
No 126
>KOG4124|consensus
Probab=52.16 E-value=5.2 Score=36.26 Aligned_cols=117 Identities=16% Similarity=0.190 Sum_probs=0.0
Q ss_pred CCCCccCCc--cchhccChhHHhhhhhcccCCcccccCceecCCCCCccCChHHHhhhhhhhcCC---------------
Q psy10074 284 GEKPFHCHY--CDYICRTKYDLSSHIGRKHTDKEVLEKRYVCQLCPKQFLTSGELKYHQTTVHRK--------------- 346 (420)
Q Consensus 284 ~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~--------------- 346 (420)
.+.++.|+. |.+..........|-...|...+....+|+|+ |+....++..|..|-..-+..
T Consensus 175 EE~~~S~~vp~~~~~~~~~Ns~~~~S~~~~~T~~t~~~p~k~~-~~~~~~T~~~l~~HS~N~~~~~Sa~n~~~~~Q~~~~ 253 (442)
T KOG4124|consen 175 EEYRVSVVVPAAAAAAAAANSSDMSSDEASSTAETTGTPKKMP-ESLVMDTSSPLSDHSMNIDVGESAANNVFAFQQKDM 253 (442)
T ss_pred ccccccccCchhhhhhhccccccccccccccccccccCCccCc-ccccccccchhhhccccCCCCccHHHHHHHHhhccc
Q ss_pred --------------------------------------------------------------------------------
Q psy10074 347 -------------------------------------------------------------------------------- 346 (420)
Q Consensus 347 -------------------------------------------------------------------------------- 346 (420)
T Consensus 254 ~~st~~~~~~~~h~~~~~~~~~~s~~~~~~s~~~ptp~v~~~a~~~s~~~~p~v~~~s~~~~~~~s~~~~s~~~~~~~~~ 333 (442)
T KOG4124|consen 254 VDSTYLPPFNYDHDVFSFAPSVASADQFTESSMSPTPEVVSPAATNSAISSPFVRKSSSDLEAKPSKKQRSTPAFSHDSP 333 (442)
T ss_pred ccccccCCcCccchhhhhhhhhhccccccccCCCCCcccccccccccccCCCceeeccCCcccCcchhccCcccccCCCc
Q ss_pred --------------cceecCc--CCccccChhhHHHHHHhhc------------------CCCCCcccCcCccccCCHHH
Q psy10074 347 --------------RVHECSE--CGFTFFLEKTLRAHIRETH------------------RKETPFPCPFCEYKAKRKFE 392 (420)
Q Consensus 347 --------------~~~~C~~--C~~~f~~~~~L~~H~~~~H------------------~~~~~~~C~~C~~~f~~~~~ 392 (420)
++|.|++ |.+.+.....|..|....| .-.++|.|++|.+++...-.
T Consensus 334 ~~~~~~~~~~~~~~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~ 413 (442)
T KOG4124|consen 334 LTIDYPGSNLVVVDKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNG 413 (442)
T ss_pred eeeecCCCeEEEecCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCC
Q ss_pred HHccccccc
Q psy10074 393 LKSHMKRKH 401 (420)
Q Consensus 393 l~~H~~~~H 401 (420)
|..|+...|
T Consensus 414 l~~~~~~~~ 422 (442)
T KOG4124|consen 414 LKYHRTHSH 422 (442)
T ss_pred CCceeehhh
No 127
>KOG2593|consensus
Probab=51.92 E-value=6.9 Score=36.82 Aligned_cols=35 Identities=17% Similarity=0.214 Sum_probs=16.1
Q ss_pred cccccchhhccccccccchhhhhccCCCCccccccccc
Q psy10074 50 KYVCYACSYATYYNGNMKKHLRIHLGDRPFMCTLLICQ 87 (420)
Q Consensus 50 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~c~~c~~~ 87 (420)
.|.|+.|++.|.....+ ...-.....|.|..|++.
T Consensus 128 ~Y~Cp~C~kkyt~Lea~---~L~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 128 GYVCPNCQKKYTSLEAL---QLLDNETGEFHCENCGGE 162 (436)
T ss_pred cccCCccccchhhhHHH---HhhcccCceEEEecCCCc
Confidence 35666666655544433 222222334555444443
No 128
>KOG2272|consensus
Probab=51.57 E-value=15 Score=31.73 Aligned_cols=20 Identities=15% Similarity=0.363 Sum_probs=12.5
Q ss_pred HhcCCCceecccccccccCH
Q psy10074 193 VHLGEKPYMCIYCAYRCTQK 212 (420)
Q Consensus 193 ~h~~~~~~~C~~C~~~f~~~ 212 (420)
.|.-..-|.|..|.+.|-..
T Consensus 215 KhWHveHFvCa~CekPFlGH 234 (332)
T KOG2272|consen 215 KHWHVEHFVCAKCEKPFLGH 234 (332)
T ss_pred cccchhheeehhcCCcccch
Confidence 34444557788787777653
No 129
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=51.30 E-value=5.5 Score=34.56 Aligned_cols=18 Identities=17% Similarity=0.189 Sum_probs=9.1
Q ss_pred CCCCcccccccccCCCCC
Q psy10074 119 SDANFNFVCILCNYHIPC 136 (420)
Q Consensus 119 h~~~~~~~C~~C~~~f~~ 136 (420)
|.-+....|..|++.-.+
T Consensus 33 ~pcN~~MeCdkC~r~QKn 50 (314)
T PF06524_consen 33 HPCNALMECDKCQRKQKN 50 (314)
T ss_pred CCCCccccchhhhhhccc
Confidence 444444556666554433
No 130
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=51.10 E-value=9.5 Score=31.17 Aligned_cols=25 Identities=36% Similarity=0.836 Sum_probs=19.2
Q ss_pred ceeccccCccccChhHHHHHHHHhcCCCceeccccccc
Q psy10074 171 RYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIYCAYR 208 (420)
Q Consensus 171 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 208 (420)
.+.|++||..+ .++.|-+||+|+..
T Consensus 134 ~~vC~vCGy~~-------------~ge~P~~CPiCga~ 158 (166)
T COG1592 134 VWVCPVCGYTH-------------EGEAPEVCPICGAP 158 (166)
T ss_pred EEEcCCCCCcc-------------cCCCCCcCCCCCCh
Confidence 58899998764 36778899999843
No 131
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=49.36 E-value=8.9 Score=24.30 Aligned_cols=9 Identities=44% Similarity=1.283 Sum_probs=3.6
Q ss_pred cCCccchhc
Q psy10074 289 HCHYCDYIC 297 (420)
Q Consensus 289 ~C~~C~~~f 297 (420)
.|..|+..+
T Consensus 3 ~C~~CgyiY 11 (50)
T cd00730 3 ECRICGYIY 11 (50)
T ss_pred CCCCCCeEE
Confidence 344444333
No 132
>KOG2186|consensus
Probab=49.01 E-value=14 Score=32.00 Aligned_cols=50 Identities=16% Similarity=0.327 Sum_probs=34.7
Q ss_pred ceeccccCccccChhHHHHHHHHhcCCCceecccccccccCHHHHHHHHhhcc
Q psy10074 171 RYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIYCAYRCTQKSDLKKHLRKKH 223 (420)
Q Consensus 171 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h 223 (420)
-|.|..||....-+ .+.+|+-.-++ ..|.|--|+..|.. .++..|.+--.
T Consensus 3 ~FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCIT 52 (276)
T KOG2186|consen 3 FFTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCIT 52 (276)
T ss_pred EEehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcc
Confidence 47788888766544 34448776655 66888888888876 77777865433
No 133
>PF14353 CpXC: CpXC protein
Probab=48.12 E-value=3.1 Score=32.57 Aligned_cols=22 Identities=14% Similarity=0.239 Sum_probs=16.0
Q ss_pred cccccchhhccccccccchhhh
Q psy10074 50 KYVCYACSYATYYNGNMKKHLR 71 (420)
Q Consensus 50 ~~~C~~C~~~f~~~~~l~~H~~ 71 (420)
.+.|+.||..|.-...+.-|-.
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D~ 59 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHDP 59 (128)
T ss_pred EEECCCCCCceecCCCEEEEcC
Confidence 4888888888877777666643
No 134
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=46.55 E-value=17 Score=39.39 Aligned_cols=22 Identities=23% Similarity=0.377 Sum_probs=17.7
Q ss_pred cccccccCCCCCcccccccccc
Q psy10074 125 FVCILCNYHIPCKWCKYCKTKL 146 (420)
Q Consensus 125 ~~C~~C~~~f~~~~c~~~~~~~ 146 (420)
+.|+.||.......|+.||...
T Consensus 668 rkCPkCG~~t~~~fCP~CGs~t 689 (1337)
T PRK14714 668 RRCPSCGTETYENRCPDCGTHT 689 (1337)
T ss_pred EECCCCCCccccccCcccCCcC
Confidence 7899999876666899999553
No 135
>KOG2807|consensus
Probab=46.24 E-value=23 Score=32.02 Aligned_cols=20 Identities=25% Similarity=0.489 Sum_probs=10.6
Q ss_pred ceecCcCCccccChhhHHHH
Q psy10074 348 VHECSECGFTFFLEKTLRAH 367 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~L~~H 367 (420)
.|.|..|...|-.--....|
T Consensus 345 ~y~C~~Ck~~FCldCDv~iH 364 (378)
T KOG2807|consen 345 RYRCESCKNVFCLDCDVFIH 364 (378)
T ss_pred cEEchhccceeeccchHHHH
Confidence 45566665555544444444
No 136
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=44.97 E-value=8.8 Score=22.83 Aligned_cols=15 Identities=13% Similarity=0.022 Sum_probs=9.6
Q ss_pred CccccccchhhhccC
Q psy10074 406 SLTTEETEYYKILEH 420 (420)
Q Consensus 406 ~~~C~~C~~~f~~~~ 420 (420)
+++|..|++.|-++|
T Consensus 12 ~f~C~~C~~~FC~~H 26 (39)
T smart00154 12 GFKCRHCGNLFCGEH 26 (39)
T ss_pred CeECCccCCcccccc
Confidence 566777777666554
No 137
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=44.61 E-value=9.5 Score=23.17 Aligned_cols=15 Identities=20% Similarity=0.428 Sum_probs=9.0
Q ss_pred cCCcccccchhhccc
Q psy10074 47 LKHKYVCYACSYATY 61 (420)
Q Consensus 47 ~~~~~~C~~C~~~f~ 61 (420)
....+.|+.||.+..
T Consensus 16 ~~g~~vC~~CG~Vl~ 30 (43)
T PF08271_consen 16 ERGELVCPNCGLVLE 30 (43)
T ss_dssp TTTEEEETTT-BBEE
T ss_pred CCCeEECCCCCCEee
Confidence 345578888876643
No 138
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=43.59 E-value=11 Score=23.87 Aligned_cols=13 Identities=23% Similarity=0.963 Sum_probs=10.5
Q ss_pred Ccccccchhhccc
Q psy10074 49 HKYVCYACSYATY 61 (420)
Q Consensus 49 ~~~~C~~C~~~f~ 61 (420)
..+.|..||+++.
T Consensus 36 ~r~~C~~Cgyt~~ 48 (50)
T PRK00432 36 DRWHCGKCGYTEF 48 (50)
T ss_pred CcEECCCcCCEEe
Confidence 5789999998764
No 139
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=43.53 E-value=18 Score=31.03 Aligned_cols=26 Identities=31% Similarity=0.479 Sum_probs=15.5
Q ss_pred ceecCcCCccccChhhHHHHHHhhcC
Q psy10074 348 VHECSECGFTFFLEKTLRAHIRETHR 373 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~L~~H~~~~H~ 373 (420)
-|.|+.|++.|........|+...|.
T Consensus 77 K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 77 KWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp EEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred EECCCCCCcccCChHHHHHHHhhcCH
Confidence 46677777777777777777766664
No 140
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=43.45 E-value=13 Score=19.47 Aligned_cols=8 Identities=38% Similarity=1.186 Sum_probs=4.0
Q ss_pred ceecCcCC
Q psy10074 348 VHECSECG 355 (420)
Q Consensus 348 ~~~C~~C~ 355 (420)
+|.||.||
T Consensus 16 ~f~CPnCG 23 (24)
T PF07754_consen 16 PFPCPNCG 23 (24)
T ss_pred eEeCCCCC
Confidence 35555554
No 141
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=41.93 E-value=11 Score=21.32 Aligned_cols=10 Identities=20% Similarity=0.467 Sum_probs=5.3
Q ss_pred ceeccccCcc
Q psy10074 171 RYVCFTCDYH 180 (420)
Q Consensus 171 ~~~C~~C~~~ 180 (420)
+.+|+.||..
T Consensus 17 ~irC~~CG~R 26 (32)
T PF03604_consen 17 PIRCPECGHR 26 (32)
T ss_dssp TSSBSSSS-S
T ss_pred cEECCcCCCe
Confidence 3466666654
No 142
>KOG2593|consensus
Probab=41.53 E-value=21 Score=33.71 Aligned_cols=17 Identities=12% Similarity=0.235 Sum_probs=8.0
Q ss_pred cceecCcCCccccChhh
Q psy10074 347 RVHECSECGFTFFLEKT 363 (420)
Q Consensus 347 ~~~~C~~C~~~f~~~~~ 363 (420)
..|.||.|.+.|+....
T Consensus 127 ~~Y~Cp~C~kkyt~Lea 143 (436)
T KOG2593|consen 127 AGYVCPNCQKKYTSLEA 143 (436)
T ss_pred ccccCCccccchhhhHH
Confidence 34555555555544433
No 143
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=40.98 E-value=16 Score=36.14 Aligned_cols=49 Identities=27% Similarity=0.670 Sum_probs=34.6
Q ss_pred cccccccCCCCCccccccccccCCchhhhhhcccccccccCCCCCcceeccccCccccChhHHHHHHHHhcCCCceeccc
Q psy10074 125 FVCILCNYHIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRPDVSYRYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIY 204 (420)
Q Consensus 125 ~~C~~C~~~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 204 (420)
..|..||... .|+.|+ ..|..|. ... ...|..||... ..+..|+.
T Consensus 214 ~~C~~Cg~~~---~C~~C~-------~~l~~h~-----~~~-----~l~Ch~Cg~~~---------------~~~~~Cp~ 258 (505)
T TIGR00595 214 LLCRSCGYIL---CCPNCD-------VSLTYHK-----KEG-----KLRCHYCGYQE---------------PIPKTCPQ 258 (505)
T ss_pred eEhhhCcCcc---CCCCCC-------CceEEec-----CCC-----eEEcCCCcCcC---------------CCCCCCCC
Confidence 4699999887 588887 5666664 222 57899998653 35678998
Q ss_pred cccc
Q psy10074 205 CAYR 208 (420)
Q Consensus 205 C~~~ 208 (420)
|+..
T Consensus 259 C~s~ 262 (505)
T TIGR00595 259 CGSE 262 (505)
T ss_pred CCCC
Confidence 8753
No 144
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=38.92 E-value=20 Score=27.30 Aligned_cols=39 Identities=10% Similarity=0.225 Sum_probs=17.4
Q ss_pred cccCcCccccCCHHHHHccccccccCCCCccccccchhh
Q psy10074 378 FPCPFCEYKAKRKFELKSHMKRKHTFEDSLTTEETEYYK 416 (420)
Q Consensus 378 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f 416 (420)
.+||+|..+-.....+.-=+---|..+.|..|..||.-|
T Consensus 40 ~qcp~csasirgd~~vegvlglg~dye~psfchncgs~f 78 (160)
T COG4306 40 TQCPICSASIRGDYYVEGVLGLGGDYEPPSFCHNCGSRF 78 (160)
T ss_pred hcCCccCCcccccceeeeeeccCCCCCCcchhhcCCCCC
Confidence 456666655443222211111223334455566666655
No 145
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=38.91 E-value=16 Score=23.92 Aligned_cols=39 Identities=18% Similarity=0.364 Sum_probs=24.6
Q ss_pred ccccchhh-hccChHHHHHhcccccchhcccccCCcccccchhhcccc
Q psy10074 16 KCCHCNKY-ISSDLRDLLEHCRLCPAMTRTDLLKHKYVCYACSYATYY 62 (420)
Q Consensus 16 ~C~~C~~~-f~~~~~~l~~H~~~~~~~~~~~~~~~~~~C~~C~~~f~~ 62 (420)
+|.+|++. +.+ ...|..- .......+.|.|++|......
T Consensus 4 kCiiCd~v~~iD-~rt~~tK-------rLrN~PIrtymC~eC~~Rva~ 43 (68)
T COG4896 4 KCIICDRVDEID-NRTFKTK-------RLRNKPIRTYMCPECEHRVAI 43 (68)
T ss_pred eEEEecceeeec-chhHHHH-------HhhCCCceeEechhhHhhhch
Confidence 58888876 444 5555554 334445667889988765543
No 146
>PF04810 zf-Sec23_Sec24: Sec23/Sec24 zinc finger; InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=38.33 E-value=17 Score=21.71 Aligned_cols=32 Identities=19% Similarity=0.621 Sum_probs=16.0
Q ss_pred ccccccchhhhccChHHHHHhcccccchhcccccCCcccccchhhc
Q psy10074 14 DIKCCHCNKYISSDLRDLLEHCRLCPAMTRTDLLKHKYVCYACSYA 59 (420)
Q Consensus 14 ~~~C~~C~~~f~~~~~~l~~H~~~~~~~~~~~~~~~~~~C~~C~~~ 59 (420)
|.+|..|+..+.- . .......+.|.|..|+..
T Consensus 2 p~rC~~C~aylNp-~-------------~~~~~~~~~w~C~~C~~~ 33 (40)
T PF04810_consen 2 PVRCRRCRAYLNP-F-------------CQFDDGGKTWICNFCGTK 33 (40)
T ss_dssp S-B-TTT--BS-T-T-------------SEEETTTTEEEETTT--E
T ss_pred ccccCCCCCEECC-c-------------ceEcCCCCEEECcCCCCc
Confidence 5677777766665 2 334444567888888865
No 147
>KOG1280|consensus
Probab=38.26 E-value=22 Score=32.44 Aligned_cols=40 Identities=25% Similarity=0.672 Sum_probs=31.2
Q ss_pred CCcceecCcCCccccChhhHHHHHHhhcCCCCC-cccCcCc
Q psy10074 345 RKRVHECSECGFTFFLEKTLRAHIRETHRKETP-FPCPFCE 384 (420)
Q Consensus 345 ~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~-~~C~~C~ 384 (420)
....|.|++|+..=.+...|..|+...|....+ ..|++|.
T Consensus 76 ~~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 76 DPQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred ccccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 345799999999888899999999988876543 4466664
No 148
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=38.00 E-value=13 Score=20.29 Aligned_cols=9 Identities=33% Similarity=0.689 Sum_probs=4.4
Q ss_pred cccchhhcc
Q psy10074 52 VCYACSYAT 60 (420)
Q Consensus 52 ~C~~C~~~f 60 (420)
.|-.|++.|
T Consensus 2 sCiDC~~~F 10 (28)
T PF08790_consen 2 SCIDCSKDF 10 (28)
T ss_dssp EETTTTEEE
T ss_pred eeecCCCCc
Confidence 344555555
No 149
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=37.97 E-value=6.3 Score=28.39 Aligned_cols=14 Identities=14% Similarity=0.351 Sum_probs=10.5
Q ss_pred CCcccccchhhccc
Q psy10074 48 KHKYVCYACSYATY 61 (420)
Q Consensus 48 ~~~~~C~~C~~~f~ 61 (420)
...|.|..|++.|.
T Consensus 52 ~GIW~C~~C~~~~A 65 (90)
T PTZ00255 52 VGIWRCKGCKKTVA 65 (90)
T ss_pred eEEEEcCCCCCEEe
Confidence 45788888888775
No 150
>PF03811 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=37.05 E-value=7.3 Score=22.73 Aligned_cols=15 Identities=20% Similarity=0.596 Sum_probs=9.3
Q ss_pred hcccccCCcccccch
Q psy10074 42 TRTDLLKHKYVCYAC 56 (420)
Q Consensus 42 ~~~~~~~~~~~C~~C 56 (420)
=....+..+|.|..|
T Consensus 21 G~~~~G~qryrC~~C 35 (36)
T PF03811_consen 21 GKSPSGHQRYRCKDC 35 (36)
T ss_pred CCCCCCCEeEecCcC
Confidence 445555666777766
No 151
>PF15269 zf-C2H2_7: Zinc-finger
Probab=36.67 E-value=26 Score=21.26 Aligned_cols=21 Identities=38% Similarity=1.053 Sum_probs=14.2
Q ss_pred eecccccccccCHHHHHHHHh
Q psy10074 200 YMCIYCAYRCTQKSDLKKHLR 220 (420)
Q Consensus 200 ~~C~~C~~~f~~~~~l~~H~~ 220 (420)
|+|-.|.+....++.|..||+
T Consensus 21 ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred ceeecCCcccchHHHHHHHHH
Confidence 566666666666777777765
No 152
>PF01927 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=35.65 E-value=23 Score=28.41 Aligned_cols=10 Identities=20% Similarity=0.471 Sum_probs=4.9
Q ss_pred eeccccCccc
Q psy10074 172 YVCFTCDYHS 181 (420)
Q Consensus 172 ~~C~~C~~~f 181 (420)
+.|+.|++.|
T Consensus 125 ~~C~~C~kiy 134 (147)
T PF01927_consen 125 WRCPGCGKIY 134 (147)
T ss_pred EECCCCCCEe
Confidence 4455555444
No 153
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=35.53 E-value=27 Score=33.82 Aligned_cols=29 Identities=21% Similarity=0.426 Sum_probs=24.1
Q ss_pred cceecCcCCccccChhhHHHHHHhhcCCC
Q psy10074 347 RVHECSECGFTFFLEKTLRAHIRETHRKE 375 (420)
Q Consensus 347 ~~~~C~~C~~~f~~~~~L~~H~~~~H~~~ 375 (420)
+-|.|+.|.+.|.+...+..|+...|.+.
T Consensus 56 rFWiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 56 RFWICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred eEeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 34789999999999999999998888654
No 154
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=34.90 E-value=9.4 Score=27.88 Aligned_cols=11 Identities=0% Similarity=-0.324 Sum_probs=5.4
Q ss_pred ccccccchhhh
Q psy10074 407 LTTEETEYYKI 417 (420)
Q Consensus 407 ~~C~~C~~~f~ 417 (420)
..|..||..|.
T Consensus 47 ~~Cg~CGls~e 57 (104)
T COG4888 47 AVCGNCGLSFE 57 (104)
T ss_pred EEcccCcceEE
Confidence 44555555443
No 155
>KOG1701|consensus
Probab=34.52 E-value=9.8 Score=35.70 Aligned_cols=47 Identities=13% Similarity=0.283 Sum_probs=23.1
Q ss_pred cccccchhhccccccccchhhhhccCCCCcccccccccccchhhhhhhhhhhhh
Q psy10074 50 KYVCYACSYATYYNGNMKKHLRIHLGDRPFMCTLLICQYCLICQYCNQYFVRST 103 (420)
Q Consensus 50 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~c~~c~~~~~~~C~~C~~~f~~~~ 103 (420)
=|+|..|++...-++-+..- .++ .|+-|-....-+|..||....++.
T Consensus 302 CFtC~~C~r~L~Gq~FY~v~------~k~-~CE~cyq~tlekC~~Cg~~I~d~i 348 (468)
T KOG1701|consen 302 CFTCRTCRRQLAGQSFYQVD------GKP-YCEGCYQDTLEKCNKCGEPIMDRI 348 (468)
T ss_pred ceehHhhhhhhccccccccC------Ccc-cchHHHHHHHHHHhhhhhHHHHHH
Confidence 37777776654333222211 122 233333334456677777666553
No 156
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=34.45 E-value=16 Score=28.65 Aligned_cols=28 Identities=32% Similarity=0.644 Sum_probs=18.0
Q ss_pred CCcccccchhhccccccccchhhhhccCCCC
Q psy10074 48 KHKYVCYACSYATYYNGNMKKHLRIHLGDRP 78 (420)
Q Consensus 48 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 78 (420)
.....|-+||+.|. .|.+|+..|++..|
T Consensus 70 ~d~i~clecGk~~k---~LkrHL~~~~gltp 97 (132)
T PF05443_consen 70 PDYIICLECGKKFK---TLKRHLRTHHGLTP 97 (132)
T ss_dssp SS-EE-TBT--EES---BHHHHHHHTT-S-H
T ss_pred cCeeEEccCCcccc---hHHHHHHHccCCCH
Confidence 34589999999998 45899999987665
No 157
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=33.78 E-value=23 Score=22.29 Aligned_cols=12 Identities=42% Similarity=1.049 Sum_probs=9.0
Q ss_pred ceeccccCcccc
Q psy10074 171 RYVCFTCDYHSF 182 (420)
Q Consensus 171 ~~~C~~C~~~f~ 182 (420)
.+.|+.|+..+.
T Consensus 20 ~~vC~~Cg~~~~ 31 (52)
T smart00661 20 RFVCRKCGYEEP 31 (52)
T ss_pred EEECCcCCCeEE
Confidence 578888887654
No 158
>PF01428 zf-AN1: AN1-like Zinc finger; InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include: Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=33.72 E-value=11 Score=22.85 Aligned_cols=15 Identities=13% Similarity=0.115 Sum_probs=8.0
Q ss_pred CccccccchhhhccC
Q psy10074 406 SLTTEETEYYKILEH 420 (420)
Q Consensus 406 ~~~C~~C~~~f~~~~ 420 (420)
++.|..|+..|-.+|
T Consensus 13 ~~~C~~C~~~FC~~H 27 (43)
T PF01428_consen 13 PFKCKHCGKSFCLKH 27 (43)
T ss_dssp HEE-TTTS-EE-TTT
T ss_pred CeECCCCCcccCccc
Confidence 566777777776654
No 159
>KOG4167|consensus
Probab=33.63 E-value=12 Score=37.80 Aligned_cols=28 Identities=14% Similarity=0.355 Sum_probs=24.9
Q ss_pred CCcccccchhhccccccccchhhhhccC
Q psy10074 48 KHKYVCYACSYATYYNGNMKKHLRIHLG 75 (420)
Q Consensus 48 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 75 (420)
...|.|.+|++.|....++..||+.|..
T Consensus 790 ~giFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 790 TGIFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred CceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 3469999999999999999999999963
No 160
>KOG1280|consensus
Probab=33.59 E-value=37 Score=31.11 Aligned_cols=38 Identities=18% Similarity=0.418 Sum_probs=30.0
Q ss_pred cCceecCCCCCccCChHHHhhhhhhhcCCcc--eecCcCC
Q psy10074 318 EKRYVCQLCPKQFLTSGELKYHQTTVHRKRV--HECSECG 355 (420)
Q Consensus 318 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~--~~C~~C~ 355 (420)
...|.|++|+..-.+...|..|+...|.+.+ ..|++|+
T Consensus 77 ~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 77 PQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 4579999999999999999999988775543 3566664
No 161
>PF05495 zf-CHY: CHY zinc finger; InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins: Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation: ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom. More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=33.50 E-value=17 Score=24.98 Aligned_cols=8 Identities=13% Similarity=0.293 Sum_probs=3.1
Q ss_pred cCCccchh
Q psy10074 289 HCHYCDYI 296 (420)
Q Consensus 289 ~C~~C~~~ 296 (420)
+++-|++.
T Consensus 12 ~~~cC~~~ 19 (71)
T PF05495_consen 12 RFPCCGKY 19 (71)
T ss_dssp EETTTTEE
T ss_pred ECCcccCe
Confidence 33334433
No 162
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=33.26 E-value=21 Score=23.05 Aligned_cols=8 Identities=50% Similarity=1.190 Sum_probs=3.6
Q ss_pred ecCcCCcc
Q psy10074 350 ECSECGFT 357 (420)
Q Consensus 350 ~C~~C~~~ 357 (420)
.|+.||..
T Consensus 24 ~Cp~CGae 31 (54)
T TIGR01206 24 ICDECGAE 31 (54)
T ss_pred eCCCCCCE
Confidence 44444443
No 163
>KOG2907|consensus
Probab=32.75 E-value=21 Score=26.71 Aligned_cols=11 Identities=45% Similarity=1.286 Sum_probs=6.3
Q ss_pred cccCcCccccC
Q psy10074 378 FPCPFCEYKAK 388 (420)
Q Consensus 378 ~~C~~C~~~f~ 388 (420)
|.|+.|++.|+
T Consensus 103 YTC~kC~~k~~ 113 (116)
T KOG2907|consen 103 YTCPKCKYKFT 113 (116)
T ss_pred EEcCccceeee
Confidence 55666665554
No 164
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=32.64 E-value=18 Score=24.46 Aligned_cols=9 Identities=44% Similarity=1.117 Sum_probs=3.0
Q ss_pred ecCcCCccc
Q psy10074 350 ECSECGFTF 358 (420)
Q Consensus 350 ~C~~C~~~f 358 (420)
.|..|++.|
T Consensus 11 ~C~~C~~~F 19 (69)
T PF01363_consen 11 NCMICGKKF 19 (69)
T ss_dssp B-TTT--B-
T ss_pred cCcCcCCcC
Confidence 566666666
No 165
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=32.51 E-value=14 Score=37.33 Aligned_cols=79 Identities=16% Similarity=0.243 Sum_probs=51.1
Q ss_pred chhhhhhhhhhhhhhhhhhhcccccccccCCCCcccccccccCCCCCccccccccccCCchhhhhhcccccccccCCCCC
Q psy10074 90 LICQYCNQYFVRSTQDILQHGKVCNFAPRSDANFNFVCILCNYHIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRPDVS 169 (420)
Q Consensus 90 ~~C~~C~~~f~~~~~~l~~H~~~~~~~~~h~~~~~~~C~~C~~~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~~ 169 (420)
-.|..|-+...+.. +-+.+.- -..|..||-+|..- ..|..= +......
T Consensus 102 a~C~~Cl~Ei~dp~-~rrY~YP------------F~~CT~CGPRfTIi-------------~alPYD--R~nTsM~---- 149 (750)
T COG0068 102 ATCEDCLEEIFDPN-SRRYLYP------------FINCTNCGPRFTII-------------EALPYD--RENTSMA---- 149 (750)
T ss_pred hhhHHHHHHhcCCC-Ccceecc------------ccccCCCCcceeee-------------ccCCCC--cccCccc----
Confidence 35688888777765 2222211 13499999999877 555443 2222222
Q ss_pred cce-eccccCccccChhHHHHHHHHhcCCCceecccccc
Q psy10074 170 YRY-VCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIYCAY 207 (420)
Q Consensus 170 ~~~-~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~ 207 (420)
.| -|+.|.+.|.++.+-+-|. ++..|+.|+-
T Consensus 150 -~F~lC~~C~~EY~dP~nRRfHA------Qp~aCp~CGP 181 (750)
T COG0068 150 -DFPLCPFCDKEYKDPLNRRFHA------QPIACPKCGP 181 (750)
T ss_pred -cCcCCHHHHHHhcCcccccccc------ccccCcccCC
Confidence 23 6999999998888877764 6788999975
No 166
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.50 E-value=22 Score=26.38 Aligned_cols=12 Identities=17% Similarity=0.393 Sum_probs=5.8
Q ss_pred CCcccCcCcccc
Q psy10074 376 TPFPCPFCEYKA 387 (420)
Q Consensus 376 ~~~~C~~C~~~f 387 (420)
.|..||+||++|
T Consensus 25 dPiVsPytG~s~ 36 (129)
T COG4530 25 DPIVSPYTGKSY 36 (129)
T ss_pred CccccCcccccc
Confidence 344455555544
No 167
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=32.28 E-value=32 Score=31.43 Aligned_cols=14 Identities=21% Similarity=0.420 Sum_probs=7.4
Q ss_pred cCCcceecCcCCcc
Q psy10074 344 HRKRVHECSECGFT 357 (420)
Q Consensus 344 ~~~~~~~C~~C~~~ 357 (420)
.|.+-..|..|+-.
T Consensus 208 ~G~RyL~CslC~te 221 (309)
T PRK03564 208 QGLRYLHCNLCESE 221 (309)
T ss_pred CCceEEEcCCCCCc
Confidence 34455566666544
No 168
>PRK14873 primosome assembly protein PriA; Provisional
Probab=32.23 E-value=24 Score=36.29 Aligned_cols=46 Identities=20% Similarity=0.424 Sum_probs=0.0
Q ss_pred cccccccCCCCCccccccccccCCchhhhhhcccccccccCCCCCcceeccccCccccChhHHHHHHHHhcCCCceeccc
Q psy10074 125 FVCILCNYHIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRPDVSYRYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIY 204 (420)
Q Consensus 125 ~~C~~C~~~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 204 (420)
..|..||..+ .|+.|+ ..|..|. ... ...|..||... .+..|+.
T Consensus 384 l~C~~Cg~~~---~C~~C~-------~~L~~h~-~~~---------~l~Ch~CG~~~----------------~p~~Cp~ 427 (665)
T PRK14873 384 LACARCRTPA---RCRHCT-------GPLGLPS-AGG---------TPRCRWCGRAA----------------PDWRCPR 427 (665)
T ss_pred eEhhhCcCee---ECCCCC-------CceeEec-CCC---------eeECCCCcCCC----------------cCccCCC
Q ss_pred cc
Q psy10074 205 CA 206 (420)
Q Consensus 205 C~ 206 (420)
|+
T Consensus 428 Cg 429 (665)
T PRK14873 428 CG 429 (665)
T ss_pred Cc
No 169
>PF05129 Elf1: Transcription elongation factor Elf1 like; InterPro: IPR007808 This family of uncharacterised, mostly short, proteins contain a putative zinc binding domain with four conserved cysteines.; PDB: 1WII_A.
Probab=32.03 E-value=8.9 Score=27.19 Aligned_cols=14 Identities=14% Similarity=0.235 Sum_probs=6.3
Q ss_pred Ccccccchhhcccc
Q psy10074 49 HKYVCYACSYATYY 62 (420)
Q Consensus 49 ~~~~C~~C~~~f~~ 62 (420)
....|..|+..|.+
T Consensus 45 ~~~~C~~Cg~~~~~ 58 (81)
T PF05129_consen 45 GILSCRVCGESFQT 58 (81)
T ss_dssp EEEEESSS--EEEE
T ss_pred EEEEecCCCCeEEE
Confidence 34556666555543
No 170
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=31.94 E-value=18 Score=28.87 Aligned_cols=34 Identities=21% Similarity=0.431 Sum_probs=18.7
Q ss_pred cccccccchhhhccChHHHHHhcccccchhcccccCCcccccchhhc
Q psy10074 13 ADIKCCHCNKYISSDLRDLLEHCRLCPAMTRTDLLKHKYVCYACSYA 59 (420)
Q Consensus 13 ~~~~C~~C~~~f~~~~~~l~~H~~~~~~~~~~~~~~~~~~C~~C~~~ 59 (420)
-+|.|. |+..|.+ +. +| ...-.++ .|.|..|+-.
T Consensus 116 ~~Y~C~-C~q~~l~-~R---Rh-------n~~~~g~-~YrC~~C~gk 149 (156)
T COG3091 116 YPYRCQ-CQQHYLR-IR---RH-------NTVRRGE-VYRCGKCGGK 149 (156)
T ss_pred eeEEee-cCCccch-hh---hc-------ccccccc-eEEeccCCce
Confidence 456777 7766666 32 34 3333344 5777777643
No 171
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=31.62 E-value=31 Score=26.86 Aligned_cols=9 Identities=44% Similarity=1.191 Sum_probs=3.4
Q ss_pred cCcCCcccc
Q psy10074 351 CSECGFTFF 359 (420)
Q Consensus 351 C~~C~~~f~ 359 (420)
|..||+.|.
T Consensus 4 Ct~Cg~~f~ 12 (131)
T PF09845_consen 4 CTKCGRVFE 12 (131)
T ss_pred cCcCCCCcC
Confidence 333333333
No 172
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=31.40 E-value=20 Score=34.71 Aligned_cols=33 Identities=24% Similarity=0.351 Sum_probs=28.4
Q ss_pred cCCCCCcccCcCccccCCHHHHHccccccccCC
Q psy10074 372 HRKETPFPCPFCEYKAKRKFELKSHMKRKHTFE 404 (420)
Q Consensus 372 H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 404 (420)
+..-+-+.|+.|.+.|.+...+..|+...|.+.
T Consensus 52 n~sWrFWiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 52 NKSWRFWICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred cCceeEeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 444556889999999999999999999999765
No 173
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=31.19 E-value=25 Score=25.75 Aligned_cols=13 Identities=15% Similarity=0.621 Sum_probs=10.2
Q ss_pred ccccccchhhhcc
Q psy10074 14 DIKCCHCNKYISS 26 (420)
Q Consensus 14 ~~~C~~C~~~f~~ 26 (420)
|.+|.-||..|.+
T Consensus 2 pH~CtrCG~vf~~ 14 (112)
T COG3364 2 PHQCTRCGEVFDD 14 (112)
T ss_pred Cceeccccccccc
Confidence 5678888888887
No 174
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=30.91 E-value=32 Score=37.42 Aligned_cols=11 Identities=36% Similarity=0.742 Sum_probs=6.0
Q ss_pred cccCcCccccC
Q psy10074 378 FPCPFCEYKAK 388 (420)
Q Consensus 378 ~~C~~C~~~f~ 388 (420)
|.|..|+.+|.
T Consensus 1254 ~RC~kC~~kyR 1264 (1337)
T PRK14714 1254 FRCLKCGTKYR 1264 (1337)
T ss_pred eeecccCcccc
Confidence 55555555553
No 175
>PF12773 DZR: Double zinc ribbon
Probab=30.65 E-value=35 Score=21.27 Aligned_cols=9 Identities=22% Similarity=0.704 Sum_probs=4.7
Q ss_pred eeccccCcc
Q psy10074 172 YVCFTCDYH 180 (420)
Q Consensus 172 ~~C~~C~~~ 180 (420)
..|+.|+..
T Consensus 30 ~~C~~Cg~~ 38 (50)
T PF12773_consen 30 KICPNCGAE 38 (50)
T ss_pred CCCcCCcCC
Confidence 455555543
No 176
>PRK04023 DNA polymerase II large subunit; Validated
Probab=30.34 E-value=39 Score=35.86 Aligned_cols=23 Identities=17% Similarity=0.536 Sum_probs=11.4
Q ss_pred cccccccCCCCCccccccccccC
Q psy10074 125 FVCILCNYHIPCKWCKYCKTKLD 147 (420)
Q Consensus 125 ~~C~~C~~~f~~~~c~~~~~~~~ 147 (420)
+.|+.|+.......||.||....
T Consensus 652 ~fCP~CG~~~~~y~CPKCG~El~ 674 (1121)
T PRK04023 652 YRCPRCGIEVEEDECEKCGREPT 674 (1121)
T ss_pred eeCccccCcCCCCcCCCCCCCCC
Confidence 45555555444444555554443
No 177
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=30.21 E-value=9.6 Score=23.73 Aligned_cols=19 Identities=16% Similarity=0.198 Sum_probs=11.4
Q ss_pred CCcccccc--hhhcccccccc
Q psy10074 48 KHKYVCYA--CSYATYYNGNM 66 (420)
Q Consensus 48 ~~~~~C~~--C~~~f~~~~~l 66 (420)
+.-++|.. ||..|....++
T Consensus 23 ~~Y~qC~N~~Cg~tfv~~~~~ 43 (47)
T PF04606_consen 23 ELYCQCTNPECGHTFVANLEF 43 (47)
T ss_pred EEEEEECCCcCCCEEEEEEEE
Confidence 34467765 77777655443
No 178
>KOG2807|consensus
Probab=29.69 E-value=70 Score=29.12 Aligned_cols=19 Identities=21% Similarity=0.424 Sum_probs=11.6
Q ss_pred ceeccccCccccChhHHHH
Q psy10074 171 RYVCFTCDYHSFCSSHMKD 189 (420)
Q Consensus 171 ~~~C~~C~~~f~~~~~l~~ 189 (420)
|..|++|+........|.+
T Consensus 290 P~eCpiC~ltLVss~hLAR 308 (378)
T KOG2807|consen 290 PIECPICSLTLVSSPHLAR 308 (378)
T ss_pred CccCCccceeEecchHHHH
Confidence 4566666666666665554
No 179
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=29.22 E-value=58 Score=18.59 Aligned_cols=21 Identities=24% Similarity=0.586 Sum_probs=14.2
Q ss_pred ceecCcCCccccChhhHHHHHH
Q psy10074 348 VHECSECGFTFFLEKTLRAHIR 369 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~L~~H~~ 369 (420)
.+.|+.|++.+. ...+..|+.
T Consensus 4 ~~~C~nC~R~v~-a~RfA~HLe 24 (33)
T PF08209_consen 4 YVECPNCGRPVA-ASRFAPHLE 24 (33)
T ss_dssp EEE-TTTSSEEE-GGGHHHHHH
T ss_pred eEECCCCcCCcc-hhhhHHHHH
Confidence 467888888776 455777776
No 180
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.03 E-value=17 Score=31.18 Aligned_cols=39 Identities=18% Similarity=0.366 Sum_probs=20.2
Q ss_pred CceecCCCCCccCChHHHhhhhhhhcCC---------------cceecCcCCcc
Q psy10074 319 KRYVCQLCPKQFLTSGELKYHQTTVHRK---------------RVHECSECGFT 357 (420)
Q Consensus 319 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~---------------~~~~C~~C~~~ 357 (420)
+.+.||+|+-.|....-+..-+++-.++ ..+.||.|..+
T Consensus 18 k~ieCPvC~tkFkkeev~tgsiRiiagDld~~lkygninP~fY~VvvCP~C~yA 71 (267)
T COG1655 18 KTIECPVCNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPYFYDVVVCPICYYA 71 (267)
T ss_pred ceeccCcccchhhhhheeccceeEecccccceeeccccCCceeEEEEcchhhHH
Confidence 3456666666665544433333333222 12578888864
No 181
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=28.33 E-value=36 Score=21.67 Aligned_cols=24 Identities=29% Similarity=0.603 Sum_probs=10.0
Q ss_pred ceecCCCCCccCChHHHhhhhhhh
Q psy10074 320 RYVCQLCPKQFLTSGELKYHQTTV 343 (420)
Q Consensus 320 ~~~C~~C~~~f~~~~~l~~H~~~h 343 (420)
.|.|+.|...|=..=.+..|...|
T Consensus 21 ~y~C~~C~~~FC~dCD~fiHE~LH 44 (51)
T PF07975_consen 21 RYRCPKCKNHFCIDCDVFIHETLH 44 (51)
T ss_dssp EE--TTTT--B-HHHHHTTTTTS-
T ss_pred eEECCCCCCccccCcChhhhcccc
Confidence 466666666665555555554443
No 182
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=28.19 E-value=31 Score=18.91 Aligned_cols=11 Identities=36% Similarity=0.933 Sum_probs=5.0
Q ss_pred ceeccccCccc
Q psy10074 171 RYVCFTCDYHS 181 (420)
Q Consensus 171 ~~~C~~C~~~f 181 (420)
.|.|..|++.+
T Consensus 15 ~Y~C~~Cdf~l 25 (30)
T PF07649_consen 15 FYRCSECDFDL 25 (30)
T ss_dssp EEE-TTT----
T ss_pred eEECccCCCcc
Confidence 68888887653
No 183
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=28.10 E-value=35 Score=20.39 Aligned_cols=18 Identities=22% Similarity=0.357 Sum_probs=10.7
Q ss_pred ceeccccCccccChhHHH
Q psy10074 171 RYVCFTCDYHSFCSSHMK 188 (420)
Q Consensus 171 ~~~C~~C~~~f~~~~~l~ 188 (420)
...|+.|+-.+.....|.
T Consensus 19 id~C~~C~G~W~d~~el~ 36 (41)
T PF13453_consen 19 IDVCPSCGGIWFDAGELE 36 (41)
T ss_pred EEECCCCCeEEccHHHHH
Confidence 456666666665555554
No 184
>KOG1044|consensus
Probab=28.00 E-value=30 Score=34.04 Aligned_cols=37 Identities=24% Similarity=0.437 Sum_probs=20.5
Q ss_pred cccccccchhhhccChHHHHHhcccccchhcccccCCcccccchhhc
Q psy10074 13 ADIKCCHCNKYISSDLRDLLEHCRLCPAMTRTDLLKHKYVCYACSYA 59 (420)
Q Consensus 13 ~~~~C~~C~~~f~~~~~~l~~H~~~~~~~~~~~~~~~~~~C~~C~~~ 59 (420)
....|..|.+.-.- .-|..+ +.|....=|+|..|+..
T Consensus 15 ~~i~c~~c~~kc~g--evlrv~--------d~~fhi~cf~c~~cg~~ 51 (670)
T KOG1044|consen 15 QGIKCDKCRKKCSG--EVLRVN--------DNHFHINCFQCKKCGRN 51 (670)
T ss_pred cceehhhhCCcccc--ceeEee--------ccccceeeeeccccCCC
Confidence 33567777766655 122222 23334455888888754
No 185
>PRK12496 hypothetical protein; Provisional
Probab=27.89 E-value=29 Score=28.46 Aligned_cols=22 Identities=27% Similarity=0.649 Sum_probs=15.8
Q ss_pred cccccccCCCCCc----ccccccccc
Q psy10074 125 FVCILCNYHIPCK----WCKYCKTKL 146 (420)
Q Consensus 125 ~~C~~C~~~f~~~----~c~~~~~~~ 146 (420)
+.|.-|++.|+.. .||.||..+
T Consensus 128 ~~C~gC~~~~~~~~~~~~C~~CG~~~ 153 (164)
T PRK12496 128 KVCKGCKKKYPEDYPDDVCEICGSPV 153 (164)
T ss_pred EECCCCCccccCCCCCCcCCCCCChh
Confidence 6799999999542 477777443
No 186
>PF07503 zf-HYPF: HypF finger; InterPro: IPR011125 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Proteins of the HypF family are involved in the maturation and regulation of hydrogenase []. In the N terminus they appear to have two zinc finger domains that are similar to those found in the DnaJ chaperone []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3TTD_A 3TSQ_A 3TTC_A 3TSP_A 3TTF_A 3TSU_A.
Probab=27.57 E-value=45 Score=19.30 Aligned_cols=31 Identities=16% Similarity=0.424 Sum_probs=16.2
Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccCCCCcccccccccCCCC
Q psy10074 92 CQYCNQYFVRSTQDILQHGKVCNFAPRSDANFNFVCILCNYHIP 135 (420)
Q Consensus 92 C~~C~~~f~~~~~~l~~H~~~~~~~~~h~~~~~~~C~~C~~~f~ 135 (420)
|..|.+.+.+.. +-+-|... ..|..||-.+.
T Consensus 2 C~~C~~Ey~~p~-~RR~~~~~------------isC~~CGPr~~ 32 (35)
T PF07503_consen 2 CDDCLKEYFDPS-NRRFHYQF------------ISCTNCGPRYS 32 (35)
T ss_dssp -HHHHHHHCSTT-STTTT-TT--------------BTTCC-SCC
T ss_pred CHHHHHHHcCCC-CCcccCcC------------ccCCCCCCCEE
Confidence 577877777665 43333322 56777876554
No 187
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=27.43 E-value=19 Score=26.64 Aligned_cols=12 Identities=17% Similarity=0.420 Sum_probs=7.0
Q ss_pred cccCcCccccCC
Q psy10074 378 FPCPFCEYKAKR 389 (420)
Q Consensus 378 ~~C~~C~~~f~~ 389 (420)
..|+.|+..+..
T Consensus 43 ~~C~~CG~y~~~ 54 (99)
T PRK14892 43 ITCGNCGLYTEF 54 (99)
T ss_pred EECCCCCCccCE
Confidence 456666665543
No 188
>PRK05580 primosome assembly protein PriA; Validated
Probab=27.37 E-value=36 Score=35.26 Aligned_cols=49 Identities=29% Similarity=0.668 Sum_probs=32.8
Q ss_pred cccccccCCCCCccccccccccCCchhhhhhcccccccccCCCCCcceeccccCccccChhHHHHHHHHhcCCCceeccc
Q psy10074 125 FVCILCNYHIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRPDVSYRYVCFTCDYHSFCSSHMKDHIRVHLGEKPYMCIY 204 (420)
Q Consensus 125 ~~C~~C~~~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 204 (420)
..|..||... .|+.|+ ..|..|. ... ...|..||... ..+..|+.
T Consensus 382 ~~C~~Cg~~~---~C~~C~-------~~l~~h~-----~~~-----~l~Ch~Cg~~~---------------~~~~~Cp~ 426 (679)
T PRK05580 382 LLCRDCGWVA---ECPHCD-------ASLTLHR-----FQR-----RLRCHHCGYQE---------------PIPKACPE 426 (679)
T ss_pred eEhhhCcCcc---CCCCCC-------CceeEEC-----CCC-----eEECCCCcCCC---------------CCCCCCCC
Confidence 5699999888 578888 5555553 122 57888888653 24567888
Q ss_pred cccc
Q psy10074 205 CAYR 208 (420)
Q Consensus 205 C~~~ 208 (420)
|+..
T Consensus 427 Cg~~ 430 (679)
T PRK05580 427 CGST 430 (679)
T ss_pred CcCC
Confidence 8653
No 189
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=26.87 E-value=38 Score=34.62 Aligned_cols=22 Identities=23% Similarity=0.744 Sum_probs=9.2
Q ss_pred cccccCCCCCc--cccccccccCC
Q psy10074 127 CILCNYHIPCK--WCKYCKTKLDN 148 (420)
Q Consensus 127 C~~C~~~f~~~--~c~~~~~~~~~ 148 (420)
|+.||...+.. -|+.||..+..
T Consensus 30 Cp~CG~~~~~~~~fC~~CG~~~~~ 53 (645)
T PRK14559 30 CPQCGTEVPVDEAHCPNCGAETGT 53 (645)
T ss_pred CCCCCCCCCcccccccccCCcccc
Confidence 44444433222 34444444443
No 190
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=26.79 E-value=32 Score=27.15 Aligned_cols=11 Identities=18% Similarity=0.594 Sum_probs=5.2
Q ss_pred ccCCccchhcc
Q psy10074 288 FHCHYCDYICR 298 (420)
Q Consensus 288 ~~C~~C~~~f~ 298 (420)
+.|..|+..|.
T Consensus 71 ~~C~~CG~~~~ 81 (135)
T PRK03824 71 LKCRNCGNEWS 81 (135)
T ss_pred EECCCCCCEEe
Confidence 44555554443
No 191
>TIGR03826 YvyF flagellar operon protein TIGR03826. This gene is found in flagellar operons of Bacillus-related organisms. Its function has not been determined and an official gene symbol has not been assigned, although the gene is designated yvyF in B. subtilus. A tentative assignment as a regulator is suggested in the NCBI record GI:16080597.
Probab=26.46 E-value=18 Score=28.56 Aligned_cols=21 Identities=19% Similarity=0.373 Sum_probs=10.8
Q ss_pred ccccccCCCCCc---ccccccccc
Q psy10074 126 VCILCNYHIPCK---WCKYCKTKL 146 (420)
Q Consensus 126 ~C~~C~~~f~~~---~c~~~~~~~ 146 (420)
.|+.||+.|... .|+.|..+.
T Consensus 5 nC~~CgklF~~~~~~iCp~C~~~~ 28 (137)
T TIGR03826 5 NCPKCGRLFVKTGRDVCPSCYEEE 28 (137)
T ss_pred cccccchhhhhcCCccCHHHhHHH
Confidence 355555555432 566666433
No 192
>PF13824 zf-Mss51: Zinc-finger of mitochondrial splicing suppressor 51
Probab=26.41 E-value=32 Score=22.26 Aligned_cols=12 Identities=17% Similarity=0.622 Sum_probs=9.8
Q ss_pred ceeccccCcccc
Q psy10074 171 RYVCFTCDYHSF 182 (420)
Q Consensus 171 ~~~C~~C~~~f~ 182 (420)
.|.|+.||..+-
T Consensus 14 ~~~Cp~cGipth 25 (55)
T PF13824_consen 14 NFECPDCGIPTH 25 (55)
T ss_pred CCcCCCCCCcCc
Confidence 789999998764
No 193
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=26.29 E-value=38 Score=19.71 Aligned_cols=9 Identities=22% Similarity=1.040 Sum_probs=4.1
Q ss_pred ecCCCCCcc
Q psy10074 322 VCQLCPKQF 330 (420)
Q Consensus 322 ~C~~C~~~f 330 (420)
.|+.|+.+|
T Consensus 3 ~C~~Cg~~Y 11 (36)
T PF05191_consen 3 ICPKCGRIY 11 (36)
T ss_dssp EETTTTEEE
T ss_pred CcCCCCCcc
Confidence 344444444
No 194
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=26.08 E-value=33 Score=22.07 Aligned_cols=9 Identities=22% Similarity=0.755 Sum_probs=4.4
Q ss_pred ccccccCCC
Q psy10074 126 VCILCNYHI 134 (420)
Q Consensus 126 ~C~~C~~~f 134 (420)
.|..|++.|
T Consensus 7 ~C~~Cg~~~ 15 (54)
T PF14446_consen 7 KCPVCGKKF 15 (54)
T ss_pred cChhhCCcc
Confidence 345555444
No 195
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=25.89 E-value=51 Score=30.12 Aligned_cols=14 Identities=29% Similarity=0.456 Sum_probs=7.5
Q ss_pred cCCcceecCcCCcc
Q psy10074 344 HRKRVHECSECGFT 357 (420)
Q Consensus 344 ~~~~~~~C~~C~~~ 357 (420)
.|.+-..|+.|+-.
T Consensus 206 ~G~RyL~CslC~te 219 (305)
T TIGR01562 206 TGLRYLSCSLCATE 219 (305)
T ss_pred CCceEEEcCCCCCc
Confidence 34455566666544
No 196
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=25.41 E-value=46 Score=20.97 Aligned_cols=16 Identities=25% Similarity=0.740 Sum_probs=7.5
Q ss_pred ceecCcCCccccChhh
Q psy10074 348 VHECSECGFTFFLEKT 363 (420)
Q Consensus 348 ~~~C~~C~~~f~~~~~ 363 (420)
.+.|..||..|..-..
T Consensus 4 ~l~C~dCg~~FvfTa~ 19 (49)
T PF13451_consen 4 TLTCKDCGAEFVFTAG 19 (49)
T ss_pred eEEcccCCCeEEEehh
Confidence 3455555554443333
No 197
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=25.35 E-value=52 Score=22.21 Aligned_cols=9 Identities=44% Similarity=1.390 Sum_probs=3.9
Q ss_pred eecCcCCcc
Q psy10074 349 HECSECGFT 357 (420)
Q Consensus 349 ~~C~~C~~~ 357 (420)
|.|+.||..
T Consensus 47 ~~C~~Cg~~ 55 (69)
T PF07282_consen 47 FTCPNCGFE 55 (69)
T ss_pred EEcCCCCCE
Confidence 444444443
No 198
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=24.77 E-value=9.4 Score=22.69 Aligned_cols=7 Identities=57% Similarity=1.293 Sum_probs=2.7
Q ss_pred eecCCCC
Q psy10074 321 YVCQLCP 327 (420)
Q Consensus 321 ~~C~~C~ 327 (420)
|.|..|+
T Consensus 29 y~C~~C~ 35 (39)
T PF01096_consen 29 YVCCNCG 35 (39)
T ss_dssp EEESSST
T ss_pred EEeCCCC
Confidence 3333333
No 199
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.58 E-value=32 Score=23.75 Aligned_cols=10 Identities=60% Similarity=1.325 Sum_probs=5.1
Q ss_pred eecCcCCccc
Q psy10074 349 HECSECGFTF 358 (420)
Q Consensus 349 ~~C~~C~~~f 358 (420)
|+|..|+..|
T Consensus 13 Y~c~~cg~~~ 22 (82)
T COG2331 13 YECTECGNRF 22 (82)
T ss_pred EeecccchHH
Confidence 4555555444
No 200
>KOG1842|consensus
Probab=24.50 E-value=39 Score=32.07 Aligned_cols=24 Identities=21% Similarity=0.528 Sum_probs=11.6
Q ss_pred eecCCCCCccCChHHHhhhhhhhc
Q psy10074 321 YVCQLCPKQFLTSGELKYHQTTVH 344 (420)
Q Consensus 321 ~~C~~C~~~f~~~~~l~~H~~~h~ 344 (420)
|.||+|...|.+...|..|....|
T Consensus 16 flCPiC~~dl~~~~~L~~H~d~eH 39 (505)
T KOG1842|consen 16 FLCPICLLDLPNLSALNDHLDVEH 39 (505)
T ss_pred ccCchHhhhhhhHHHHHHHHhhhc
Confidence 445555555555555555444333
No 201
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=24.49 E-value=44 Score=21.44 Aligned_cols=27 Identities=26% Similarity=0.500 Sum_probs=16.3
Q ss_pred ecCcCCccccChhhHHHHHHhhcCCCCCcccCcCccccCC
Q psy10074 350 ECSECGFTFFLEKTLRAHIRETHRKETPFPCPFCEYKAKR 389 (420)
Q Consensus 350 ~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~ 389 (420)
.|..|++.|.. ..+.+.|..||..|=.
T Consensus 4 ~C~~C~~~F~~-------------~~rk~~Cr~Cg~~~C~ 30 (57)
T cd00065 4 SCMGCGKPFTL-------------TRRRHHCRNCGRIFCS 30 (57)
T ss_pred cCcccCccccC-------------CccccccCcCcCCcCh
Confidence 46677777763 1234557777766654
No 202
>PF08792 A2L_zn_ribbon: A2L zinc ribbon domain; InterPro: IPR014900 This zinc ribbon protein is found associated with some viral A2L transcription factors [].
Probab=24.33 E-value=27 Score=19.90 Aligned_cols=10 Identities=40% Similarity=0.926 Sum_probs=4.4
Q ss_pred cccccccCCC
Q psy10074 125 FVCILCNYHI 134 (420)
Q Consensus 125 ~~C~~C~~~f 134 (420)
+.|..|+.+|
T Consensus 22 ~~C~~Cg~~~ 31 (33)
T PF08792_consen 22 EVCIFCGSSF 31 (33)
T ss_pred EEcccCCcEe
Confidence 3444444443
No 203
>KOG4167|consensus
Probab=24.02 E-value=23 Score=35.82 Aligned_cols=26 Identities=12% Similarity=0.310 Sum_probs=23.0
Q ss_pred ceecccccccccCHHHHHHHHhhccC
Q psy10074 199 PYMCIYCAYRCTQKSDLKKHLRKKHE 224 (420)
Q Consensus 199 ~~~C~~C~~~f~~~~~l~~H~~~~h~ 224 (420)
.|.|+.|++.|....++..||+.|..
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 49999999999999999999998753
No 204
>PF09963 DUF2197: Uncharacterized protein conserved in bacteria (DUF2197); InterPro: IPR019241 This family represents various hypothetical bacterial proteins with no known function.
Probab=23.72 E-value=35 Score=22.18 Aligned_cols=13 Identities=31% Similarity=0.720 Sum_probs=7.1
Q ss_pred cCCcccccchhhc
Q psy10074 47 LKHKYVCYACSYA 59 (420)
Q Consensus 47 ~~~~~~C~~C~~~ 59 (420)
....|.|+.|...
T Consensus 28 Pi~tYmC~eC~~R 40 (56)
T PF09963_consen 28 PIHTYMCDECKER 40 (56)
T ss_pred CCcceeChhHHHH
Confidence 3444666666543
No 205
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=23.58 E-value=34 Score=21.47 Aligned_cols=11 Identities=27% Similarity=1.050 Sum_probs=9.2
Q ss_pred Ccccccchhhc
Q psy10074 49 HKYVCYACSYA 59 (420)
Q Consensus 49 ~~~~C~~C~~~ 59 (420)
.+|.|..||++
T Consensus 36 dR~~CGkCgyT 46 (51)
T COG1998 36 DRWACGKCGYT 46 (51)
T ss_pred ceeEeccccce
Confidence 37999999986
No 207
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=22.27 E-value=24 Score=25.46 Aligned_cols=12 Identities=33% Similarity=0.996 Sum_probs=7.5
Q ss_pred ceecCcCCcccc
Q psy10074 348 VHECSECGFTFF 359 (420)
Q Consensus 348 ~~~C~~C~~~f~ 359 (420)
.|.|..|++.|+
T Consensus 53 IW~C~~C~~~~A 64 (90)
T PF01780_consen 53 IWKCKKCGKKFA 64 (90)
T ss_dssp EEEETTTTEEEE
T ss_pred EeecCCCCCEEe
Confidence 466666666665
No 208
>PF04438 zf-HIT: HIT zinc finger; InterPro: IPR007529 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the HIT-type zinc finger, which contains 7 conserved cysteines and one histidine that can potentially coordinate two zinc atoms. It has been named after the first protein that originally defined the domain: the yeast HIT1 protein (P46973 from SWISSPROT) []. The HIT-type zinc finger displays some sequence similarities to the MYND-type zinc finger. The function of this domain is unknown but it is mainly found in nuclear proteins involved in gene regulation and chromatin remodeling. This domain is also found in the thyroid receptor interacting protein 3 (TRIP-3) Q15649 from SWISSPROT, that specifically interacts with the ligand binding domain of the thyroid receptor. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 2YQP_A 2YQQ_A 1X4S_A.
Probab=22.04 E-value=29 Score=19.26 Aligned_cols=18 Identities=17% Similarity=0.349 Sum_probs=7.9
Q ss_pred cccccccccchhhhhhhh
Q psy10074 81 CTLLICQYCLICQYCNQY 98 (420)
Q Consensus 81 c~~c~~~~~~~C~~C~~~ 98 (420)
|.+|+....|+|+.|+..
T Consensus 5 C~vC~~~~kY~Cp~C~~~ 22 (30)
T PF04438_consen 5 CSVCGNPAKYRCPRCGAR 22 (30)
T ss_dssp ETSSSSEESEE-TTT--E
T ss_pred CccCcCCCEEECCCcCCc
Confidence 444555555556666544
No 209
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=21.86 E-value=35 Score=24.71 Aligned_cols=12 Identities=33% Similarity=0.980 Sum_probs=7.1
Q ss_pred ceecCcCCcccc
Q psy10074 348 VHECSECGFTFF 359 (420)
Q Consensus 348 ~~~C~~C~~~f~ 359 (420)
.|.|..|++.|+
T Consensus 53 IW~C~~C~~~~A 64 (91)
T TIGR00280 53 IWTCRKCGAKFA 64 (91)
T ss_pred EEEcCCCCCEEe
Confidence 456666666555
No 210
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=21.65 E-value=36 Score=34.60 Aligned_cols=60 Identities=17% Similarity=0.388 Sum_probs=39.2
Q ss_pred ccccchhhccccccccchhhhhccCCCCcccccccccccchhhhhhhhhhhhhhhhhhhcccccccccCCCCcccccccc
Q psy10074 51 YVCYACSYATYYNGNMKKHLRIHLGDRPFMCTLLICQYCLICQYCNQYFVRSTQDILQHGKVCNFAPRSDANFNFVCILC 130 (420)
Q Consensus 51 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~c~~c~~~~~~~C~~C~~~f~~~~~~l~~H~~~~~~~~~h~~~~~~~C~~C 130 (420)
..|+.||=.|+-...|--- +..+....|. .|+.|.+++.+.. +-+-|.+. ..|+.|
T Consensus 124 ~~CT~CGPRfTIi~alPYD-R~nTsM~~F~----------lC~~C~~EY~dP~-nRRfHAQp------------~aCp~C 179 (750)
T COG0068 124 INCTNCGPRFTIIEALPYD-RENTSMADFP----------LCPFCDKEYKDPL-NRRFHAQP------------IACPKC 179 (750)
T ss_pred cccCCCCcceeeeccCCCC-cccCccccCc----------CCHHHHHHhcCcc-cccccccc------------ccCccc
Confidence 4688888888777666443 3344444442 3588999998877 55555544 678888
Q ss_pred cCCC
Q psy10074 131 NYHI 134 (420)
Q Consensus 131 ~~~f 134 (420)
|-..
T Consensus 180 GP~~ 183 (750)
T COG0068 180 GPHL 183 (750)
T ss_pred CCCe
Confidence 7644
No 211
>COG1773 Rubredoxin [Energy production and conversion]
Probab=21.50 E-value=34 Score=22.12 Aligned_cols=14 Identities=21% Similarity=0.653 Sum_probs=7.8
Q ss_pred cccccchhhccccc
Q psy10074 50 KYVCYACSYATYYN 63 (420)
Q Consensus 50 ~~~C~~C~~~f~~~ 63 (420)
.|+|..||+.|.-.
T Consensus 3 ~~~C~~CG~vYd~e 16 (55)
T COG1773 3 RWRCSVCGYVYDPE 16 (55)
T ss_pred ceEecCCceEeccc
Confidence 35666666665433
No 212
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=21.30 E-value=22 Score=28.29 Aligned_cols=22 Identities=9% Similarity=0.134 Sum_probs=17.6
Q ss_pred CCcccccchhhccccccccchh
Q psy10074 48 KHKYVCYACSYATYYNGNMKKH 69 (420)
Q Consensus 48 ~~~~~C~~C~~~f~~~~~l~~H 69 (420)
.++-.|..||+.|++-..+..-
T Consensus 26 RRRReC~~C~~RFTTyErve~~ 47 (147)
T TIGR00244 26 RRRRECLECHERFTTFERAELL 47 (147)
T ss_pred eecccCCccCCccceeeecccc
Confidence 3557899999999998877665
No 213
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=20.78 E-value=24 Score=28.13 Aligned_cols=22 Identities=9% Similarity=0.181 Sum_probs=17.1
Q ss_pred CCcccccchhhccccccccchh
Q psy10074 48 KHKYVCYACSYATYYNGNMKKH 69 (420)
Q Consensus 48 ~~~~~C~~C~~~f~~~~~l~~H 69 (420)
.++-.|..|+..|++......=
T Consensus 26 RRRReC~~C~~RFTTfE~~El~ 47 (156)
T COG1327 26 RRRRECLECGERFTTFERAELR 47 (156)
T ss_pred hhhhcccccccccchhheeeec
Confidence 4567899999999988776554
No 214
>PF14311 DUF4379: Domain of unknown function (DUF4379)
Probab=20.75 E-value=80 Score=20.23 Aligned_cols=12 Identities=25% Similarity=0.847 Sum_probs=5.9
Q ss_pred eecCcCCccccC
Q psy10074 349 HECSECGFTFFL 360 (420)
Q Consensus 349 ~~C~~C~~~f~~ 360 (420)
++|+.||..|..
T Consensus 29 W~C~~Cgh~w~~ 40 (55)
T PF14311_consen 29 WKCPKCGHEWKA 40 (55)
T ss_pred EECCCCCCeeEc
Confidence 455555554443
No 215
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=20.70 E-value=63 Score=26.32 Aligned_cols=11 Identities=27% Similarity=0.498 Sum_probs=6.0
Q ss_pred eeccccccccc
Q psy10074 200 YMCIYCAYRCT 210 (420)
Q Consensus 200 ~~C~~C~~~f~ 210 (420)
+.|+.|++.|-
T Consensus 131 ~~C~~CgkiYW 141 (165)
T COG1656 131 YRCPKCGKIYW 141 (165)
T ss_pred eECCCCccccc
Confidence 45555555543
No 216
>PRK04023 DNA polymerase II large subunit; Validated
Probab=20.69 E-value=66 Score=34.30 Aligned_cols=22 Identities=23% Similarity=0.578 Sum_probs=18.1
Q ss_pred ccccccccCCCCCccccccccc
Q psy10074 124 NFVCILCNYHIPCKWCKYCKTK 145 (420)
Q Consensus 124 ~~~C~~C~~~f~~~~c~~~~~~ 145 (420)
...|+.||.......||.||..
T Consensus 626 ~RfCpsCG~~t~~frCP~CG~~ 647 (1121)
T PRK04023 626 RRKCPSCGKETFYRRCPFCGTH 647 (1121)
T ss_pred CccCCCCCCcCCcccCCCCCCC
Confidence 4679999998877789999965
No 217
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=20.57 E-value=56 Score=24.89 Aligned_cols=11 Identities=18% Similarity=0.280 Sum_probs=5.4
Q ss_pred eecCCCCCccC
Q psy10074 321 YVCQLCPKQFL 331 (420)
Q Consensus 321 ~~C~~C~~~f~ 331 (420)
+.|..|+..|.
T Consensus 71 ~~C~~Cg~~~~ 81 (113)
T PRK12380 71 AWCWDCSQVVE 81 (113)
T ss_pred EEcccCCCEEe
Confidence 45555554443
No 218
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=20.44 E-value=53 Score=25.08 Aligned_cols=11 Identities=18% Similarity=0.594 Sum_probs=5.5
Q ss_pred eecCCCCCccC
Q psy10074 321 YVCQLCPKQFL 331 (420)
Q Consensus 321 ~~C~~C~~~f~ 331 (420)
..|..|+..|.
T Consensus 71 ~~C~~Cg~~~~ 81 (115)
T TIGR00100 71 CECEDCSEEVS 81 (115)
T ss_pred EEcccCCCEEe
Confidence 45555554443
No 219
>KOG3408|consensus
Probab=20.26 E-value=71 Score=24.40 Aligned_cols=23 Identities=30% Similarity=0.649 Sum_probs=18.3
Q ss_pred ceecccccccccCHHHHHHHHhh
Q psy10074 199 PYMCIYCAYRCTQKSDLKKHLRK 221 (420)
Q Consensus 199 ~~~C~~C~~~f~~~~~l~~H~~~ 221 (420)
.|-|-.|.+.|.+...|..|.++
T Consensus 57 qfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 57 QFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred eeehhhhhhhhcchHHHHHHHhc
Confidence 47888888888888888888764
No 220
>KOG0717|consensus
Probab=20.24 E-value=45 Score=31.96 Aligned_cols=19 Identities=37% Similarity=0.697 Sum_probs=17.8
Q ss_pred cccccchhhhccChHHHHHh
Q psy10074 15 IKCCHCNKYISSDLRDLLEH 34 (420)
Q Consensus 15 ~~C~~C~~~f~~~~~~l~~H 34 (420)
+.|.+|+++|.+ ..+|..|
T Consensus 293 lyC~vCnKsFKs-eKq~kNH 311 (508)
T KOG0717|consen 293 LYCVVCNKSFKS-EKQLKNH 311 (508)
T ss_pred eEEeeccccccc-hHHHHhh
Confidence 789999999999 9999999
No 221
>PF10083 DUF2321: Uncharacterized protein conserved in bacteria (DUF2321); InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=20.03 E-value=36 Score=27.33 Aligned_cols=52 Identities=15% Similarity=0.340 Sum_probs=27.3
Q ss_pred ccccccCCCCCccccccccccCCchhhhhhcccccccccCCCCCcceeccccCccccCh
Q psy10074 126 VCILCNYHIPCKWCKYCKTKLDNNTEVLLQHCKLCVNVIRPDVSYRYVCFTCDYHSFCS 184 (420)
Q Consensus 126 ~C~~C~~~f~~~~c~~~~~~~~~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~ 184 (420)
-|..||...-.. ||.|+..+.+. .|..-+-. ....-..|.-|..||+.|+.-
T Consensus 30 fC~kCG~~tI~~-Cp~C~~~IrG~-----y~v~gv~~-~g~~~~~PsYC~~CGkpyPWt 81 (158)
T PF10083_consen 30 FCSKCGAKTITS-CPNCSTPIRGD-----YHVEGVFG-LGGHYEAPSYCHNCGKPYPWT 81 (158)
T ss_pred HHHHhhHHHHHH-CcCCCCCCCCc-----eecCCeee-eCCCCCCChhHHhCCCCCchH
Confidence 377777766664 88888777663 22211110 001111255677777777643
Done!