Query psy3491
Match_columns 108
No_of_seqs 118 out of 1296
Neff 10.5
Searched_HMMs 46136
Date Fri Aug 16 18:04:25 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3491.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3491hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 99.8 4E-19 8.7E-24 110.3 3.1 86 13-99 163-264 (279)
2 KOG2462|consensus 99.7 1.3E-18 2.9E-23 108.0 4.4 97 11-108 130-245 (279)
3 KOG3623|consensus 99.4 1.1E-13 2.4E-18 95.8 2.5 91 8-99 237-330 (1007)
4 KOG3623|consensus 99.3 3.2E-13 6.9E-18 93.6 1.6 75 13-99 896-971 (1007)
5 KOG3576|consensus 99.3 1.7E-13 3.7E-18 82.4 0.1 81 12-104 118-199 (267)
6 KOG3576|consensus 99.1 3.5E-11 7.7E-16 72.6 1.3 62 46-108 114-175 (267)
7 PHA02768 hypothetical protein; 99.0 1.1E-10 2.3E-15 56.6 1.7 44 49-95 5-48 (55)
8 PHA00733 hypothetical protein 99.0 3.3E-10 7.1E-15 64.8 3.7 55 46-103 70-124 (128)
9 KOG1074|consensus 98.9 1.5E-10 3.3E-15 81.5 0.3 58 49-107 353-410 (958)
10 KOG3608|consensus 98.8 5.2E-09 1.1E-13 68.0 4.0 83 8-102 259-345 (467)
11 KOG1074|consensus 98.8 2.1E-09 4.6E-14 75.9 1.4 58 49-107 879-936 (958)
12 PHA00616 hypothetical protein 98.8 4.9E-09 1.1E-13 48.5 1.7 32 50-82 2-33 (44)
13 KOG3608|consensus 98.6 1.3E-08 2.9E-13 66.1 1.0 90 13-102 209-316 (467)
14 PF13465 zf-H2C2_2: Zinc-finge 98.6 4.8E-08 1E-12 40.6 2.1 26 64-90 1-26 (26)
15 PHA00732 hypothetical protein 98.5 6.1E-08 1.3E-12 50.9 1.6 45 49-99 1-45 (79)
16 PHA00616 hypothetical protein 98.4 1.3E-07 2.8E-12 43.8 1.7 30 78-107 1-30 (44)
17 KOG3993|consensus 98.4 4.2E-08 9E-13 65.0 -0.5 80 13-104 269-382 (500)
18 PHA02768 hypothetical protein; 98.3 3.3E-07 7.2E-12 44.5 2.0 24 79-102 6-29 (55)
19 PF05605 zf-Di19: Drought indu 98.3 9.8E-07 2.1E-11 43.1 3.6 52 49-103 2-54 (54)
20 PLN03086 PRLI-interacting fact 98.3 2E-06 4.3E-11 59.8 5.2 59 13-88 455-514 (567)
21 PF00096 zf-C2H2: Zinc finger, 98.2 7.5E-07 1.6E-11 35.8 1.7 22 79-100 1-22 (23)
22 PF13465 zf-H2C2_2: Zinc-finge 98.1 2E-06 4.4E-11 35.6 1.5 25 26-61 2-26 (26)
23 PF13894 zf-C2H2_4: C2H2-type 98.1 3E-06 6.6E-11 34.0 2.0 24 79-102 1-24 (24)
24 PF00096 zf-C2H2: Zinc finger, 98.0 3.7E-06 8E-11 33.7 1.8 23 50-72 1-23 (23)
25 PF09237 GAGA: GAGA factor; I 98.0 4.1E-06 8.8E-11 39.7 1.5 31 77-107 23-53 (54)
26 PF13912 zf-C2H2_6: C2H2-type 98.0 2.8E-06 6.2E-11 35.4 0.7 24 79-102 2-25 (27)
27 PLN03086 PRLI-interacting fact 97.9 3.1E-05 6.7E-10 54.1 5.4 77 12-103 479-565 (567)
28 PHA00733 hypothetical protein 97.9 8E-06 1.7E-10 46.8 1.9 55 46-101 37-96 (128)
29 PF13912 zf-C2H2_6: C2H2-type 97.8 1.1E-05 2.5E-10 33.5 1.0 26 49-74 1-26 (27)
30 PF12756 zf-C2H2_2: C2H2 type 97.7 2.2E-05 4.7E-10 42.6 2.1 72 13-101 1-73 (100)
31 PF13894 zf-C2H2_4: C2H2-type 97.7 3.3E-05 7.1E-10 30.8 1.9 24 50-73 1-24 (24)
32 smart00355 ZnF_C2H2 zinc finge 97.6 9.5E-05 2.1E-09 29.9 2.5 24 79-102 1-24 (26)
33 PF05605 zf-Di19: Drought indu 97.4 0.00058 1.3E-08 33.2 4.4 52 11-73 2-53 (54)
34 COG5189 SFP1 Putative transcri 97.4 4.8E-05 1E-09 49.4 0.5 53 46-99 346-419 (423)
35 KOG3993|consensus 97.3 3.8E-05 8.2E-10 51.5 -0.8 53 50-103 268-320 (500)
36 smart00355 ZnF_C2H2 zinc finge 97.0 0.00045 9.7E-09 27.7 1.5 24 50-73 1-24 (26)
37 PF13909 zf-H2C2_5: C2H2-type 96.9 0.00081 1.8E-08 26.9 1.8 23 79-102 1-23 (24)
38 PHA00732 hypothetical protein 96.9 0.00077 1.7E-08 35.4 2.0 25 78-102 1-26 (79)
39 PF12874 zf-met: Zinc-finger o 96.9 0.0005 1.1E-08 27.8 0.9 22 79-100 1-22 (25)
40 PF09237 GAGA: GAGA factor; I 96.7 0.0024 5.1E-08 30.5 2.5 32 46-77 21-52 (54)
41 PRK04860 hypothetical protein; 96.6 0.0021 4.5E-08 38.3 2.7 38 49-91 119-156 (160)
42 PF12874 zf-met: Zinc-finger o 96.5 0.0018 3.8E-08 26.1 1.3 23 50-72 1-23 (25)
43 PF13909 zf-H2C2_5: C2H2-type 96.2 0.0049 1.1E-07 24.6 1.9 23 50-73 1-23 (24)
44 PF12171 zf-C2H2_jaz: Zinc-fin 96.1 0.0012 2.7E-08 27.3 -0.3 21 79-99 2-22 (27)
45 PF12756 zf-C2H2_2: C2H2 type 96.0 0.0045 9.7E-08 33.4 1.8 25 49-73 50-74 (100)
46 COG5189 SFP1 Putative transcri 96.0 0.0041 9E-08 40.7 1.7 59 12-70 350-419 (423)
47 PF13913 zf-C2HC_2: zinc-finge 95.9 0.0091 2E-07 24.2 1.9 20 79-99 3-22 (25)
48 smart00451 ZnF_U1 U1-like zinc 94.8 0.021 4.6E-07 24.8 1.4 21 79-99 4-24 (35)
49 KOG2893|consensus 93.3 0.018 4E-07 36.4 -0.5 49 48-102 10-58 (341)
50 PRK04860 hypothetical protein; 92.4 0.085 1.8E-06 31.5 1.5 27 78-108 119-145 (160)
51 cd00350 rubredoxin_like Rubred 92.1 0.12 2.5E-06 22.4 1.4 24 50-86 2-25 (33)
52 KOG2186|consensus 92.1 0.13 2.9E-06 32.8 2.1 54 11-77 3-56 (276)
53 COG5048 FOG: Zn-finger [Genera 91.4 0.029 6.2E-07 37.6 -1.4 58 48-106 288-351 (467)
54 COG4049 Uncharacterized protei 89.0 0.38 8.3E-06 23.4 1.7 31 44-74 12-42 (65)
55 PF04959 ARS2: Arsenite-resist 88.5 0.15 3.2E-06 32.0 0.1 26 79-104 78-103 (214)
56 PF13719 zinc_ribbon_5: zinc-r 88.1 0.3 6.5E-06 21.7 1.0 34 50-89 3-36 (37)
57 cd00729 rubredoxin_SM Rubredox 87.7 0.27 5.9E-06 21.4 0.7 25 49-86 2-26 (34)
58 PF05443 ROS_MUCR: ROS/MUCR tr 86.2 0.37 8E-06 27.9 0.9 24 79-105 73-96 (132)
59 smart00614 ZnF_BED BED zinc fi 86.1 0.71 1.5E-05 21.8 1.8 21 79-99 19-44 (50)
60 PF13717 zinc_ribbon_4: zinc-r 85.8 0.42 9.1E-06 21.1 0.8 33 50-88 3-35 (36)
61 TIGR00373 conserved hypothetic 85.6 0.43 9.3E-06 28.4 1.0 34 47-90 107-140 (158)
62 TIGR02098 MJ0042_CXXC MJ0042 f 85.3 0.46 1E-05 21.0 0.8 33 50-88 3-35 (38)
63 COG2888 Predicted Zn-ribbon RN 84.8 0.93 2E-05 22.4 1.8 32 49-86 27-58 (61)
64 PRK06266 transcription initiat 84.3 0.51 1.1E-05 28.7 0.9 34 47-90 115-148 (178)
65 PRK00464 nrdR transcriptional 84.3 0.14 3E-06 30.4 -1.5 15 50-64 29-43 (154)
66 PF09538 FYDLN_acid: Protein o 83.4 0.79 1.7E-05 25.6 1.4 30 50-91 10-39 (108)
67 COG1592 Rubrerythrin [Energy p 83.0 0.77 1.7E-05 27.7 1.3 24 49-86 134-157 (166)
68 TIGR02605 CxxC_CxxC_SSSS putat 82.8 0.41 8.9E-06 22.7 0.1 30 49-86 5-34 (52)
69 PF02892 zf-BED: BED zinc fing 82.6 1.4 3E-05 20.1 1.9 22 79-100 17-42 (45)
70 smart00531 TFIIE Transcription 82.5 1 2.2E-05 26.5 1.7 39 46-89 96-134 (147)
71 KOG1146|consensus 82.5 0.93 2E-05 35.5 1.8 52 47-99 463-539 (1406)
72 smart00834 CxxC_CXXC_SSSS Puta 81.9 0.47 1E-05 21.2 0.1 30 49-86 5-34 (41)
73 TIGR00622 ssl1 transcription f 81.7 2 4.4E-05 24.1 2.6 84 13-102 17-105 (112)
74 smart00734 ZnF_Rad18 Rad18-lik 81.4 1.7 3.8E-05 17.6 1.7 20 79-99 2-21 (26)
75 PF12013 DUF3505: Protein of u 79.8 1.7 3.8E-05 24.0 2.0 25 79-103 81-109 (109)
76 KOG4173|consensus 79.1 1.2 2.6E-05 27.8 1.2 53 50-103 107-171 (253)
77 smart00659 RPOLCX RNA polymera 78.0 1.8 3.9E-05 20.0 1.4 27 49-87 2-28 (44)
78 KOG2186|consensus 77.8 1.1 2.5E-05 28.7 0.9 46 50-99 4-49 (276)
79 PF09986 DUF2225: Uncharacteri 77.3 0.34 7.4E-06 30.3 -1.5 11 79-89 49-59 (214)
80 PF13878 zf-C2H2_3: zinc-finge 77.2 2.2 4.7E-05 19.4 1.5 24 50-73 14-39 (41)
81 PF09723 Zn-ribbon_8: Zinc rib 76.1 1.1 2.4E-05 20.4 0.4 30 49-86 5-34 (42)
82 PF04959 ARS2: Arsenite-resist 75.7 2.6 5.5E-05 26.6 2.0 29 46-74 74-102 (214)
83 PF08790 zf-LYAR: LYAR-type C2 75.1 0.57 1.2E-05 19.5 -0.7 11 50-60 1-11 (28)
84 PF07754 DUF1610: Domain of un 73.9 1.6 3.4E-05 17.5 0.5 10 48-57 15-24 (24)
85 KOG2785|consensus 72.4 4 8.6E-05 27.9 2.4 51 49-100 166-242 (390)
86 PRK00398 rpoP DNA-directed RNA 72.2 1.4 2.9E-05 20.4 0.2 29 49-88 3-31 (46)
87 TIGR02300 FYDLN_acid conserved 71.5 2.8 6.1E-05 24.1 1.3 30 50-91 10-39 (129)
88 PRK09678 DNA-binding transcrip 68.4 1.4 2.9E-05 22.8 -0.3 15 49-63 27-43 (72)
89 KOG4167|consensus 68.4 2 4.3E-05 31.9 0.4 25 49-73 792-816 (907)
90 KOG1146|consensus 67.9 3.9 8.4E-05 32.4 1.8 56 47-103 1282-1353(1406)
91 PRK14890 putative Zn-ribbon RN 67.8 4.1 8.9E-05 20.2 1.3 32 49-86 25-56 (59)
92 PF13451 zf-trcl: Probable zin 67.8 3.5 7.5E-05 19.6 1.0 19 47-65 2-20 (49)
93 PF12013 DUF3505: Protein of u 66.0 5 0.00011 22.1 1.7 26 49-74 80-109 (109)
94 COG1996 RPC10 DNA-directed RNA 65.1 4 8.7E-05 19.4 1.0 28 48-86 5-32 (49)
95 PF14353 CpXC: CpXC protein 65.0 6.7 0.00015 22.3 2.1 17 49-65 38-54 (128)
96 PF09963 DUF2197: Uncharacteri 64.8 2.7 5.7E-05 20.6 0.3 36 51-86 4-39 (56)
97 KOG4173|consensus 64.5 4.9 0.00011 25.2 1.5 51 48-102 78-130 (253)
98 PF06524 NOA36: NOA36 protein; 63.6 7 0.00015 25.5 2.1 71 13-87 99-180 (314)
99 COG4957 Predicted transcriptio 61.6 3.6 7.8E-05 23.9 0.5 24 79-105 77-100 (148)
100 COG1571 Predicted DNA-binding 61.2 4.3 9.4E-05 28.2 0.9 15 49-63 367-381 (421)
101 KOG2593|consensus 60.9 4.4 9.6E-05 28.2 0.9 39 45-87 124-162 (436)
102 COG5151 SSL1 RNA polymerase II 60.9 4.2 9E-05 27.2 0.8 48 51-101 364-411 (421)
103 PHA00626 hypothetical protein 60.1 4.5 9.7E-05 19.8 0.6 17 47-63 21-37 (59)
104 COG5048 FOG: Zn-finger [Genera 59.8 7 0.00015 26.2 1.8 58 47-105 31-90 (467)
105 PF10571 UPF0547: Uncharacteri 58.7 6 0.00013 16.0 0.8 11 79-89 15-25 (26)
106 COG2331 Uncharacterized protei 57.7 5.2 0.00011 20.9 0.7 31 49-87 12-42 (82)
107 PF02176 zf-TRAF: TRAF-type zi 56.5 5 0.00011 19.3 0.5 39 49-89 9-53 (60)
108 PRK03824 hypA hydrogenase nick 54.7 4.8 0.0001 23.4 0.3 15 48-62 69-83 (135)
109 KOG2071|consensus 54.0 7.5 0.00016 28.2 1.2 28 47-74 416-443 (579)
110 KOG4167|consensus 54.0 2.6 5.6E-05 31.3 -1.0 25 79-103 793-817 (907)
111 KOG2231|consensus 52.7 14 0.00029 27.5 2.3 48 51-103 184-237 (669)
112 PF10537 WAC_Acf1_DNA_bd: ATP- 51.7 22 0.00047 19.7 2.5 38 48-87 2-39 (102)
113 PF08274 PhnA_Zn_Ribbon: PhnA 51.1 6 0.00013 16.7 0.2 7 79-85 20-26 (30)
114 smart00154 ZnF_AN1 AN1-like Zi 50.7 7.2 0.00016 17.4 0.5 13 49-61 12-24 (39)
115 PF09845 DUF2072: Zn-ribbon co 50.6 6.3 0.00014 22.8 0.4 15 49-63 1-15 (131)
116 KOG3408|consensus 50.6 14 0.00031 21.1 1.7 26 46-71 54-79 (129)
117 PF10013 DUF2256: Uncharacteri 47.1 12 0.00027 17.1 1.0 14 79-92 9-22 (42)
118 COG4306 Uncharacterized protei 46.8 5.1 0.00011 23.1 -0.4 15 48-62 67-81 (160)
119 KOG2636|consensus 46.7 15 0.00032 25.9 1.7 29 70-99 394-423 (497)
120 TIGR00100 hypA hydrogenase nic 46.7 11 0.00024 21.2 0.9 25 49-86 70-94 (115)
121 KOG2807|consensus 46.5 25 0.00054 23.9 2.6 16 13-28 292-308 (378)
122 KOG0717|consensus 46.4 11 0.00024 26.6 1.1 21 79-99 293-313 (508)
123 COG3364 Zn-ribbon containing p 45.5 10 0.00023 21.0 0.7 15 49-63 2-16 (112)
124 KOG2231|consensus 45.5 42 0.00092 25.1 3.8 52 11-82 182-240 (669)
125 PF04780 DUF629: Protein of un 44.9 18 0.00039 25.7 1.9 22 79-100 58-79 (466)
126 KOG0978|consensus 43.8 2.1 4.6E-05 31.5 -2.7 15 79-93 679-693 (698)
127 PRK12380 hydrogenase nickel in 43.8 15 0.00032 20.7 1.1 25 49-86 70-94 (113)
128 KOG2071|consensus 42.7 19 0.00041 26.3 1.7 27 76-103 417-443 (579)
129 KOG2482|consensus 42.6 16 0.00034 24.9 1.3 51 49-100 144-217 (423)
130 PF01428 zf-AN1: AN1-like Zinc 42.1 9.4 0.0002 17.3 0.2 14 48-61 12-25 (43)
131 PF04423 Rad50_zn_hook: Rad50 41.1 10 0.00022 18.0 0.2 12 80-91 22-33 (54)
132 PF15269 zf-C2H2_7: Zinc-finge 40.9 23 0.0005 16.5 1.3 16 50-65 21-36 (54)
133 PF14311 DUF4379: Domain of un 40.1 25 0.00054 16.7 1.5 14 49-62 28-41 (55)
134 PF08209 Sgf11: Sgf11 (transcr 40.1 26 0.00056 15.1 1.4 21 49-70 4-24 (33)
135 COG5236 Uncharacterized conser 39.5 33 0.00072 23.6 2.4 24 51-74 222-245 (493)
136 PF12907 zf-met2: Zinc-binding 38.7 7.9 0.00017 17.6 -0.4 24 51-74 3-29 (40)
137 PRK00564 hypA hydrogenase nick 37.1 19 0.00041 20.3 0.9 27 49-87 71-97 (117)
138 PF07282 OrfB_Zn_ribbon: Putat 36.4 20 0.00043 17.8 0.8 12 79-90 47-58 (69)
139 PF01363 FYVE: FYVE zinc finge 36.1 16 0.00034 18.2 0.5 11 50-60 10-20 (69)
140 PLN02294 cytochrome c oxidase 36.0 16 0.00035 22.2 0.5 17 46-62 138-154 (174)
141 PF11931 DUF3449: Domain of un 34.8 13 0.00028 23.2 0.0 21 76-97 100-121 (196)
142 cd00924 Cyt_c_Oxidase_Vb Cytoc 33.8 19 0.00041 19.8 0.5 15 47-61 77-91 (97)
143 KOG1842|consensus 32.8 32 0.0007 24.4 1.6 26 49-74 15-40 (505)
144 PF04780 DUF629: Protein of un 31.5 40 0.00086 24.1 1.9 26 48-73 56-81 (466)
145 PF01155 HypA: Hydrogenase exp 31.4 23 0.00049 19.8 0.6 26 49-87 70-95 (113)
146 cd00065 FYVE FYVE domain; Zinc 31.4 38 0.00082 15.9 1.4 8 52-59 5-12 (57)
147 COG4338 Uncharacterized protei 31.2 17 0.00037 17.2 0.1 12 80-91 14-25 (54)
148 PF10276 zf-CHCC: Zinc-finger 30.6 18 0.00038 16.4 0.1 10 49-58 29-38 (40)
149 PF10083 DUF2321: Uncharacteri 30.6 6.9 0.00015 23.4 -1.6 15 48-62 67-81 (158)
150 KOG3014|consensus 30.5 45 0.00097 21.7 1.9 32 43-74 28-64 (257)
151 COG1773 Rubredoxin [Energy pro 30.1 20 0.00043 17.5 0.2 14 49-62 3-16 (55)
152 PF06397 Desulfoferrod_N: Desu 29.8 26 0.00056 15.5 0.5 12 48-59 5-16 (36)
153 smart00064 FYVE Protein presen 29.7 36 0.00077 16.8 1.1 10 51-60 12-21 (68)
154 KOG1842|consensus 29.5 34 0.00074 24.3 1.3 26 79-104 16-41 (505)
155 PLN02748 tRNA dimethylallyltra 28.4 48 0.001 23.7 1.9 23 76-99 417-440 (468)
156 PHA02998 RNA polymerase subuni 27.9 20 0.00043 22.1 -0.0 38 50-91 144-184 (195)
157 PRK05452 anaerobic nitric oxid 27.8 21 0.00044 25.4 0.1 41 46-87 422-467 (479)
158 COG3677 Transposase and inacti 26.7 37 0.0008 19.6 1.0 14 75-89 51-64 (129)
159 cd00730 rubredoxin Rubredoxin; 26.6 33 0.00071 16.3 0.6 12 79-90 2-13 (50)
160 PF00301 Rubredoxin: Rubredoxi 26.2 25 0.00055 16.4 0.2 36 50-86 2-42 (47)
161 COG1675 TFA1 Transcription ini 26.2 43 0.00094 20.5 1.2 32 47-88 111-142 (176)
162 COG5188 PRP9 Splicing factor 3 26.0 41 0.0009 23.2 1.2 28 70-98 367-395 (470)
163 COG3357 Predicted transcriptio 25.1 24 0.00053 19.1 0.0 28 47-85 56-83 (97)
164 PF01215 COX5B: Cytochrome c o 25.0 29 0.00063 20.3 0.3 16 47-62 110-125 (136)
165 COG4391 Uncharacterized protei 25.0 29 0.00063 17.3 0.3 40 45-89 20-59 (62)
166 PF02891 zf-MIZ: MIZ/SP-RING z 24.7 58 0.0013 15.3 1.3 8 79-86 42-49 (50)
167 PF02591 DUF164: Putative zinc 24.7 71 0.0015 15.2 1.7 33 50-87 23-55 (56)
168 COG1326 Uncharacterized archae 24.6 66 0.0014 20.2 1.8 10 79-88 31-40 (201)
169 PRK04351 hypothetical protein; 23.8 44 0.00095 19.8 0.9 32 49-89 112-143 (149)
170 KOG1280|consensus 23.1 90 0.0019 21.5 2.3 38 48-85 78-116 (381)
171 COG2879 Uncharacterized small 23.1 1E+02 0.0022 15.6 2.0 18 90-107 24-41 (65)
172 PF03194 LUC7: LUC7 N_terminus 23.1 35 0.00076 22.2 0.5 7 50-56 191-197 (254)
173 PF12773 DZR: Double zinc ribb 22.9 60 0.0013 14.8 1.2 9 79-87 30-38 (50)
174 PTZ00448 hypothetical protein; 22.8 66 0.0014 22.3 1.7 22 79-100 315-336 (373)
175 PF07503 zf-HYPF: HypF finger; 22.6 17 0.00037 15.9 -0.7 10 79-88 22-31 (35)
176 PF10263 SprT-like: SprT-like 22.3 22 0.00047 20.8 -0.6 31 49-88 123-153 (157)
177 PRK00762 hypA hydrogenase nick 22.3 39 0.00085 19.3 0.5 31 49-86 70-100 (124)
178 PF07975 C1_4: TFIIH C1-like d 22.2 12 0.00026 18.0 -1.4 23 47-69 19-41 (51)
179 PF01286 XPA_N: XPA protein N- 21.4 42 0.0009 14.6 0.4 14 50-63 4-17 (34)
180 KOG1994|consensus 21.4 61 0.0013 20.9 1.3 23 74-97 236-258 (268)
181 COG0675 Transposase and inacti 21.2 45 0.00097 21.8 0.7 15 76-91 321-335 (364)
182 COG1198 PriA Primosomal protei 20.9 29 0.00063 26.2 -0.3 6 13-18 437-442 (730)
183 TIGR03831 YgiT_finger YgiT-typ 20.4 38 0.00082 15.0 0.2 13 49-61 32-44 (46)
184 PRK14892 putative transcriptio 20.3 35 0.00076 18.8 0.1 34 48-89 20-53 (99)
185 PF14447 Prok-RING_4: Prokaryo 20.3 60 0.0013 15.9 0.8 11 80-90 41-51 (55)
186 PRK08222 hydrogenase 4 subunit 20.1 77 0.0017 19.3 1.5 19 79-97 115-133 (181)
No 1
>KOG2462|consensus
Probab=99.76 E-value=4e-19 Score=110.28 Aligned_cols=86 Identities=24% Similarity=0.415 Sum_probs=55.6
Q ss_pred cCCCCC-CCcChhHHHHHhhhcC-------CCCCcc--------ccccCCCceeeccchhhhccCchHHHHHHhHHhcCC
Q psy3491 13 VDPVCP-AFVDAPWLYFLLYSFS-------SGASFS--------SFKPLQSALFMCESCHKTYKAKGSLERHKKFECGKE 76 (108)
Q Consensus 13 ~~~~c~-~~~~~~~~~~~~~~~~-------~~~~~~--------~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~ 76 (108)
.|++|. .+...+.|.-|++++. .++.+. .+.++|+|||.|..|+++|...++|+.|+++|.+.+
T Consensus 163 ~C~~C~K~YvSmpALkMHirTH~l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K 242 (279)
T KOG2462|consen 163 SCKYCGKVYVSMPALKMHIRTHTLPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVK 242 (279)
T ss_pred cCCCCCceeeehHHHhhHhhccCCCcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCc
Confidence 677777 6777777777766553 111211 123466777777777777777777777777777666
Q ss_pred CCccCCCCccccCCchhHHHHHH
Q psy3491 77 AGLQCPFCPYKSKHKSNLKTHMA 99 (108)
Q Consensus 77 ~~~~c~~C~~~f~~~~~l~~H~~ 99 (108)
+ |.|..|+|.|...+.|.+|..
T Consensus 243 ~-~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 243 K-HQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred c-ccCcchhhHHHHHHHHHHhhh
Confidence 5 777777777777777777664
No 2
>KOG2462|consensus
Probab=99.75 E-value=1.3e-18 Score=107.97 Aligned_cols=97 Identities=20% Similarity=0.312 Sum_probs=81.4
Q ss_pred CccCCCCC-CCcChhHHHHHhhhcCCC------------------CCccccccCCCceeeccchhhhccCchHHHHHHhH
Q psy3491 11 NIVDPVCP-AFVDAPWLYFLLYSFSSG------------------ASFSSFKPLQSALFMCESCHKTYKAKGSLERHKKF 71 (108)
Q Consensus 11 ~~~~~~c~-~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~ 71 (108)
.+.|+.|. .+....+|..|...+... .++.+|..++.-+++|.+||+.|.+..-|+.|+++
T Consensus 130 r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~l~c~C~iCGKaFSRPWLLQGHiRT 209 (279)
T KOG2462|consen 130 RYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHTLPCECGICGKAFSRPWLLQGHIRT 209 (279)
T ss_pred ceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccCCCcccccccccccchHHhhccccc
Confidence 34889998 788888887775544322 23335666677799999999999999999999999
Q ss_pred HhcCCCCccCCCCccccCCchhHHHHHHHhcCCCCCC
Q psy3491 72 ECGKEAGLQCPFCPYKSKHKSNLKTHMAIRHQNGLHV 108 (108)
Q Consensus 72 ~~~~~~~~~c~~C~~~f~~~~~l~~H~~~h~~~k~~~ 108 (108)
|+|+++ |.|+.|+|.|.-+++|+.|+++|.+.|.|.
T Consensus 210 HTGEKP-F~C~hC~kAFADRSNLRAHmQTHS~~K~~q 245 (279)
T KOG2462|consen 210 HTGEKP-FSCPHCGKAFADRSNLRAHMQTHSDVKKHQ 245 (279)
T ss_pred ccCCCC-ccCCcccchhcchHHHHHHHHhhcCCcccc
Confidence 999988 999999999999999999999999999874
No 3
>KOG3623|consensus
Probab=99.40 E-value=1.1e-13 Score=95.82 Aligned_cols=91 Identities=22% Similarity=0.419 Sum_probs=73.4
Q ss_pred CccCccCCCCC-CCcChhHHHHHhhhcCCCCCcccc--ccCCCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCC
Q psy3491 8 RTVNIVDPVCP-AFVDAPWLYFLLYSFSSGASFSSF--KPLQSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFC 84 (108)
Q Consensus 8 ~~~~~~~~~c~-~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C 84 (108)
...|+-|+.|. .|.-...|..|+..+..+..-... ...+.+.|+|..||++|..+-.|..|+++|.|+++ |.|+.|
T Consensus 237 ne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKP-feCpnC 315 (1007)
T KOG3623|consen 237 NEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKP-FECPNC 315 (1007)
T ss_pred CCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchhhhhHHHHHhhheeecCCCC-cCCccc
Confidence 35677899998 888888998998887665433222 22356779999999999999999999999999887 999999
Q ss_pred ccccCCchhHHHHHH
Q psy3491 85 PYKSKHKSNLKTHMA 99 (108)
Q Consensus 85 ~~~f~~~~~l~~H~~ 99 (108)
+|.|....++..|+.
T Consensus 316 kKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 316 KKRFSHSGSYSSHMS 330 (1007)
T ss_pred ccccccCCccccccc
Confidence 999999888888864
No 4
>KOG3623|consensus
Probab=99.34 E-value=3.2e-13 Score=93.59 Aligned_cols=75 Identities=27% Similarity=0.483 Sum_probs=55.9
Q ss_pred cCCCCC-CCcChhHHHHHhhhcCCCCCccccccCCCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCc
Q psy3491 13 VDPVCP-AFVDAPWLYFLLYSFSSGASFSSFKPLQSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHK 91 (108)
Q Consensus 13 ~~~~c~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~ 91 (108)
.|..|. +|...+.|..|-+-+ +|.+||+|.+|.++|..+-.|..|++.|.|+++ |.|+-|+|.|...
T Consensus 896 aCDqCDK~FqKqSSLaRHKYEH-----------sGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKP-fQCdKClKRFSHS 963 (1007)
T KOG3623|consen 896 ACDQCDKAFQKQSSLARHKYEH-----------SGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKP-FQCDKCLKRFSHS 963 (1007)
T ss_pred hHHHHHHHHHhhHHHHHhhhhh-----------cCCCCcccchhhHhhhhhhhhhhhhhhccCCCc-chhhhhhhhcccc
Confidence 666666 666666666664433 677788888888888888888888888888776 8888888888888
Q ss_pred hhHHHHHH
Q psy3491 92 SNLKTHMA 99 (108)
Q Consensus 92 ~~l~~H~~ 99 (108)
.++..|+-
T Consensus 964 GSYSQHMN 971 (1007)
T KOG3623|consen 964 GSYSQHMN 971 (1007)
T ss_pred cchHhhhc
Confidence 77777763
No 5
>KOG3576|consensus
Probab=99.33 E-value=1.7e-13 Score=82.38 Aligned_cols=81 Identities=21% Similarity=0.365 Sum_probs=66.0
Q ss_pred ccCCCCC-CCcChhHHHHHhhhcCCCCCccccccCCCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCC
Q psy3491 12 IVDPVCP-AFVDAPWLYFLLYSFSSGASFSSFKPLQSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKH 90 (108)
Q Consensus 12 ~~~~~c~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~ 90 (108)
+.|-+|. .|.-...|..|++++ ...+.+.|..||+.|....+|.+|.++|++.++ |.|..|++.|.+
T Consensus 118 ftCrvCgK~F~lQRmlnrh~kch-----------~~vkr~lct~cgkgfndtfdlkrh~rthtgvrp-ykc~~c~kaftq 185 (267)
T KOG3576|consen 118 FTCRVCGKKFGLQRMLNRHLKCH-----------SDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRP-YKCSLCEKAFTQ 185 (267)
T ss_pred eeeehhhhhhhHHHHHHHHhhhc-----------cHHHHHHHhhccCcccchhhhhhhhccccCccc-cchhhhhHHHHh
Confidence 4788888 788778888888877 345668888888888888888888888888776 888888888888
Q ss_pred chhHHHHHHHhcCC
Q psy3491 91 KSNLKTHMAIRHQN 104 (108)
Q Consensus 91 ~~~l~~H~~~h~~~ 104 (108)
+-+|..|.+..|+.
T Consensus 186 rcsleshl~kvhgv 199 (267)
T KOG3576|consen 186 RCSLESHLKKVHGV 199 (267)
T ss_pred hccHHHHHHHHcCc
Confidence 88888888776664
No 6
>KOG3576|consensus
Probab=99.08 E-value=3.5e-11 Score=72.57 Aligned_cols=62 Identities=23% Similarity=0.364 Sum_probs=57.9
Q ss_pred CCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHHHHHHHhcCCCCCC
Q psy3491 46 QSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLKTHMAIRHQNGLHV 108 (108)
Q Consensus 46 ~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~~H~~~h~~~k~~~ 108 (108)
+...|.|..|++.|.-...|.+|++.|...+. +.|..||+-|+-...|++|.|+|+|.+||.
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr-~lct~cgkgfndtfdlkrh~rthtgvrpyk 175 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKR-HLCTFCGKGFNDTFDLKRHTRTHTGVRPYK 175 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccHHHH-HHHhhccCcccchhhhhhhhccccCccccc
Confidence 45679999999999999999999999999888 999999999999999999999999999983
No 7
>PHA02768 hypothetical protein; Provisional
Probab=99.03 E-value=1.1e-10 Score=56.57 Aligned_cols=44 Identities=18% Similarity=0.285 Sum_probs=39.2
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHH
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLK 95 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~ 95 (108)
.|.|+.||+.|...+.|..|++.|. ++ +.|..|++.|...+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~-~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TN-LKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--Cc-ccCCcccceecccceeE
Confidence 3899999999999999999999988 45 99999999999877653
No 8
>PHA00733 hypothetical protein
Probab=99.03 E-value=3.3e-10 Score=64.78 Aligned_cols=55 Identities=18% Similarity=0.503 Sum_probs=49.3
Q ss_pred CCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHHHHHHHhcC
Q psy3491 46 QSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLKTHMAIRHQ 103 (108)
Q Consensus 46 ~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~~H~~~h~~ 103 (108)
+.+||.|..|++.|.....|..|++.+. .+ +.|.+|++.|.....|.+|+...|+
T Consensus 70 ~~kPy~C~~Cgk~Fss~s~L~~H~r~h~--~~-~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 70 AVSPYVCPLCLMPFSSSVSLKQHIRYTE--HS-KVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCCccCCCCCCcCCCHHHHHHHHhcCC--cC-ccCCCCCCccCCHHHHHHHHHHhcC
Confidence 4679999999999999999999998762 44 9999999999999999999988775
No 9
>KOG1074|consensus
Probab=98.94 E-value=1.5e-10 Score=81.45 Aligned_cols=58 Identities=26% Similarity=0.510 Sum_probs=53.9
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHHHHHHHhcCCCCC
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLKTHMAIRHQNGLH 107 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~~H~~~h~~~k~~ 107 (108)
+++|..|.+.|...+.|+.|.+.|+++++ |+|.+||..|.++.+|+.|...|+...|+
T Consensus 353 khkCr~CakvfgS~SaLqiHlRSHTGERP-fqCnvCG~~FSTkGNLKvH~~rH~e~~p~ 410 (958)
T KOG1074|consen 353 KHKCRFCAKVFGSDSALQIHLRSHTGERP-FQCNVCGNRFSTKGNLKVHFQRHREKYPH 410 (958)
T ss_pred cchhhhhHhhcCchhhhhhhhhccCCCCC-eeecccccccccccceeeeeeeccccCCc
Confidence 57899999999999999999999999887 99999999999999999999988877654
No 10
>KOG3608|consensus
Probab=98.82 E-value=5.2e-09 Score=67.96 Aligned_cols=83 Identities=23% Similarity=0.441 Sum_probs=62.7
Q ss_pred CccC-ccCCCCC-CCcChhHHHHHhhhcCCCCCccccccCCCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCC--
Q psy3491 8 RTVN-IVDPVCP-AFVDAPWLYFLLYSFSSGASFSSFKPLQSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPF-- 83 (108)
Q Consensus 8 ~~~~-~~~~~c~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~-- 83 (108)
+++| +.||.|. .....+.|..|++.-+ ...+||+|..|.+.+.+.++|.+|...|. +.. |.|..
T Consensus 259 rHvn~ykCplCdmtc~~~ssL~~H~r~rH----------s~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~-y~C~h~~ 326 (467)
T KOG3608|consen 259 RHVNCYKCPLCDMTCSSASSLTTHIRYRH----------SKDKPFKCDECDTRCVRESDLAKHVQVHS-KTV-YQCEHPD 326 (467)
T ss_pred HhhhcccccccccCCCChHHHHHHHHhhh----------ccCCCccccchhhhhccHHHHHHHHHhcc-ccc-eecCCCC
Confidence 3444 4889998 7888888888887553 45789999999999999999999998776 344 67754
Q ss_pred CccccCCchhHHHHHHHhc
Q psy3491 84 CPYKSKHKSNLKTHMAIRH 102 (108)
Q Consensus 84 C~~~f~~~~~l~~H~~~h~ 102 (108)
|..++.+...+++|.+.++
T Consensus 327 C~~s~r~~~q~~~H~~evh 345 (467)
T KOG3608|consen 327 CHYSVRTYTQMRRHFLEVH 345 (467)
T ss_pred CcHHHHHHHHHHHHHHHhc
Confidence 6666666666777766555
No 11
>KOG1074|consensus
Probab=98.78 E-value=2.1e-09 Score=75.93 Aligned_cols=58 Identities=26% Similarity=0.499 Sum_probs=54.7
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHHHHHHHhcCCCCC
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLKTHMAIRHQNGLH 107 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~~H~~~h~~~k~~ 107 (108)
.+.|..||+.|...++|..|+++|.++++ |.|.+|++.|....+|+.|+.+|.+..+.
T Consensus 879 ~h~C~vCgk~FsSSsALqiH~rTHtg~KP-F~C~fC~~aFttrgnLKvHMgtH~w~q~~ 936 (958)
T KOG1074|consen 879 AHVCNVCGKQFSSSAALEIHMRTHTGPKP-FFCHFCEEAFTTRGNLKVHMGTHMWVQPP 936 (958)
T ss_pred hhhhccchhcccchHHHHHhhhcCCCCCC-ccchhhhhhhhhhhhhhhhhccccccCCC
Confidence 47899999999999999999999999998 99999999999999999999999998764
No 12
>PHA00616 hypothetical protein
Probab=98.75 E-value=4.9e-09 Score=48.51 Aligned_cols=32 Identities=16% Similarity=0.302 Sum_probs=15.4
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCCccCC
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAGLQCP 82 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~ 82 (108)
|+|..||+.|...+.|..|++.+++..+ +.|+
T Consensus 2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~-~~~~ 33 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHLLSVHKQNK-LTLE 33 (44)
T ss_pred CccchhhHHHhhHHHHHHHHHHhcCCCc-ccee
Confidence 4455555555555555555554444433 4443
No 13
>KOG3608|consensus
Probab=98.60 E-value=1.3e-08 Score=66.11 Aligned_cols=90 Identities=20% Similarity=0.365 Sum_probs=48.1
Q ss_pred cCCCCC-CCcChhHHHHHhhhcCC-----------------CCCccccccCCCceeeccchhhhccCchHHHHHHhHHhc
Q psy3491 13 VDPVCP-AFVDAPWLYFLLYSFSS-----------------GASFSSFKPLQSALFMCESCHKTYKAKGSLERHKKFECG 74 (108)
Q Consensus 13 ~~~~c~-~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~ 74 (108)
.||.|. .|....-+..|++.... .+.+.+|......-|.|+.|.......+.|..|++..+.
T Consensus 209 ACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn~ykCplCdmtc~~~ssL~~H~r~rHs 288 (467)
T KOG3608|consen 209 ACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVNCYKCPLCDMTCSSASSLTTHIRYRHS 288 (467)
T ss_pred ecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCCChHHHHHHHHhhhc
Confidence 688888 67777777666553321 111112222223345555555555555555555554333
Q ss_pred CCCCccCCCCccccCCchhHHHHHHHhc
Q psy3491 75 KEAGLQCPFCPYKSKHKSNLKTHMAIRH 102 (108)
Q Consensus 75 ~~~~~~c~~C~~~f~~~~~l~~H~~~h~ 102 (108)
...||+|+.|.+.+...+.|++|...|.
T Consensus 289 ~dkpfKCd~Cd~~c~~esdL~kH~~~HS 316 (467)
T KOG3608|consen 289 KDKPFKCDECDTRCVRESDLAKHVQVHS 316 (467)
T ss_pred cCCCccccchhhhhccHHHHHHHHHhcc
Confidence 2223666666666666666666666544
No 14
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.58 E-value=4.8e-08 Score=40.61 Aligned_cols=26 Identities=27% Similarity=0.723 Sum_probs=21.8
Q ss_pred HHHHHHhHHhcCCCCccCCCCccccCC
Q psy3491 64 SLERHKKFECGKEAGLQCPFCPYKSKH 90 (108)
Q Consensus 64 ~l~~h~~~~~~~~~~~~c~~C~~~f~~ 90 (108)
+|..|++.|.++++ |.|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~-~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKP-YKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSS-EEESSSSEEESS
T ss_pred CHHHHhhhcCCCCC-CCCCCCcCeeCc
Confidence 47889999999887 999999988863
No 15
>PHA00732 hypothetical protein
Probab=98.49 E-value=6.1e-08 Score=50.89 Aligned_cols=45 Identities=24% Similarity=0.490 Sum_probs=29.8
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHHHHHH
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLKTHMA 99 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~~H~~ 99 (108)
||.|..|++.|.....|..|++.++. + +.|+.|++.|. .+..|.+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~--~-~~C~~CgKsF~---~l~~H~~ 45 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT--L-TKCPVCNKSYR---RLNQHFY 45 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC--C-CccCCCCCEeC---Chhhhhc
Confidence 46777777777777777777764222 2 57777777776 3555653
No 16
>PHA00616 hypothetical protein
Probab=98.43 E-value=1.3e-07 Score=43.81 Aligned_cols=30 Identities=20% Similarity=0.243 Sum_probs=28.4
Q ss_pred CccCCCCccccCCchhHHHHHHHhcCCCCC
Q psy3491 78 GLQCPFCPYKSKHKSNLKTHMAIRHQNGLH 107 (108)
Q Consensus 78 ~~~c~~C~~~f~~~~~l~~H~~~h~~~k~~ 107 (108)
||+|..||+.|...++|.+|++.||+++++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~ 30 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKL 30 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcc
Confidence 499999999999999999999999999985
No 17
>KOG3993|consensus
Probab=98.40 E-value=4.2e-08 Score=65.05 Aligned_cols=80 Identities=18% Similarity=0.374 Sum_probs=60.0
Q ss_pred cCCCCC-CCcChhHHHHHhhhcCCCCCccccccCCCceeeccchhhhccCchHHHHHHhHHhc-----------------
Q psy3491 13 VDPVCP-AFVDAPWLYFLLYSFSSGASFSSFKPLQSALFMCESCHKTYKAKGSLERHKKFECG----------------- 74 (108)
Q Consensus 13 ~~~~c~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~----------------- 74 (108)
+|-.|. .+.+...|.+|. +. .+-.-.|.|+.|++.|....+|..|+++|-.
T Consensus 269 iCqLCK~kYeD~F~LAQHr-C~----------RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~ 337 (500)
T KOG3993|consen 269 ICQLCKEKYEDAFALAQHR-CP----------RIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVE 337 (500)
T ss_pred HHHHHHHhhhhHHHHhhcc-CC----------eeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhh
Confidence 566666 566665655542 22 1222359999999999999999999998852
Q ss_pred ----------------CCCCccCCCCccccCCchhHHHHHHHhcCC
Q psy3491 75 ----------------KEAGLQCPFCPYKSKHKSNLKTHMAIRHQN 104 (108)
Q Consensus 75 ----------------~~~~~~c~~C~~~f~~~~~l~~H~~~h~~~ 104 (108)
+.. |.|..|++.|....-|+.|+.+|+..
T Consensus 338 ~rae~~ea~rsg~dss~gi-~~C~~C~KkFrRqAYLrKHqlthq~~ 382 (500)
T KOG3993|consen 338 TRAEVQEAERSGDDSSSGI-FSCHTCGKKFRRQAYLRKHQLTHQRA 382 (500)
T ss_pred hhhhhhhccccCCcccCce-eecHHhhhhhHHHHHHHHhHHhhhcc
Confidence 122 88999999999999999999988864
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.34 E-value=3.3e-07 Score=44.46 Aligned_cols=24 Identities=21% Similarity=0.533 Sum_probs=23.6
Q ss_pred ccCCCCccccCCchhHHHHHHHhc
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAIRH 102 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~h~ 102 (108)
|.|+.||+.|...++|..|+++|+
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~ 29 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN 29 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC
Confidence 899999999999999999999999
No 19
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.33 E-value=9.8e-07 Score=43.08 Aligned_cols=52 Identities=29% Similarity=0.544 Sum_probs=38.6
Q ss_pred eeeccchhhhccCchHHHHHHhHHh-cCCCCccCCCCccccCCchhHHHHHHHhcC
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFEC-GKEAGLQCPFCPYKSKHKSNLKTHMAIRHQ 103 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~-~~~~~~~c~~C~~~f~~~~~l~~H~~~h~~ 103 (108)
.|.|+.|++ ......|..|....+ .+...+.|++|...+. .+|..|+..+|+
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 488999999 555678999966544 3333499999987544 489999988774
No 20
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.27 E-value=2e-06 Score=59.76 Aligned_cols=59 Identities=20% Similarity=0.383 Sum_probs=33.9
Q ss_pred cCCCCC-CCcChhHHHHHhhhcCCCCCccccccCCCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcccc
Q psy3491 13 VDPVCP-AFVDAPWLYFLLYSFSSGASFSSFKPLQSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKS 88 (108)
Q Consensus 13 ~~~~c~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f 88 (108)
.|+.|. .| ....+..|++.++ +++.|+ ||..+ ....|..|+..+.+.++ +.|..|++.+
T Consensus 455 ~C~~Cgk~f-~~s~LekH~~~~H-------------kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kp-i~C~fC~~~v 514 (567)
T PLN03086 455 HCEKCGQAF-QQGEMEKHMKVFH-------------EPLQCP-CGVVL-EKEQMVQHQASTCPLRL-ITCRFCGDMV 514 (567)
T ss_pred cCCCCCCcc-chHHHHHHHHhcC-------------CCccCC-CCCCc-chhHHHhhhhccCCCCc-eeCCCCCCcc
Confidence 677887 44 4566777776541 345555 65433 33555566555555554 6666666555
No 21
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=98.24 E-value=7.5e-07 Score=35.76 Aligned_cols=22 Identities=45% Similarity=0.759 Sum_probs=14.3
Q ss_pred ccCCCCccccCCchhHHHHHHH
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAI 100 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~ 100 (108)
|.|+.|++.|.....|.+|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4566666666666666666665
No 22
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.09 E-value=2e-06 Score=35.61 Aligned_cols=25 Identities=16% Similarity=0.364 Sum_probs=19.9
Q ss_pred HHHHhhhcCCCCCccccccCCCceeeccchhhhccC
Q psy3491 26 LYFLLYSFSSGASFSSFKPLQSALFMCESCHKTYKA 61 (108)
Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~~~f~~ 61 (108)
|..|++.+ ++++||.|+.|++.|..
T Consensus 2 l~~H~~~H-----------~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 2 LRRHMRTH-----------TGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHH-----------SSSSSEEESSSSEEESS
T ss_pred HHHHhhhc-----------CCCCCCCCCCCcCeeCc
Confidence 45666655 78999999999999863
No 23
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=98.09 E-value=3e-06 Score=34.01 Aligned_cols=24 Identities=42% Similarity=0.848 Sum_probs=15.5
Q ss_pred ccCCCCccccCCchhHHHHHHHhc
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAIRH 102 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~h~ 102 (108)
|.|++|++.|.+...|..|++.+|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 457777777777777777777664
No 24
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=98.04 E-value=3.7e-06 Score=33.66 Aligned_cols=23 Identities=30% Similarity=0.766 Sum_probs=20.8
Q ss_pred eeccchhhhccCchHHHHHHhHH
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFE 72 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~ 72 (108)
|.|+.|++.|.....|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 68999999999999999998864
No 25
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=97.97 E-value=4.1e-06 Score=39.75 Aligned_cols=31 Identities=29% Similarity=0.563 Sum_probs=21.4
Q ss_pred CCccCCCCccccCCchhHHHHHHHhcCCCCC
Q psy3491 77 AGLQCPFCPYKSKHKSNLKTHMAIRHQNGLH 107 (108)
Q Consensus 77 ~~~~c~~C~~~f~~~~~l~~H~~~h~~~k~~ 107 (108)
.|-.|++|+..+.+..+|++|+++.|+.||-
T Consensus 23 ~PatCP~C~a~~~~srnLrRHle~~H~~k~~ 53 (54)
T PF09237_consen 23 QPATCPICGAVIRQSRNLRRHLEIRHFKKPG 53 (54)
T ss_dssp --EE-TTT--EESSHHHHHHHHHHHTTTS--
T ss_pred CCCCCCcchhhccchhhHHHHHHHHhcccCC
Confidence 3488899998888889999999888888863
No 26
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.96 E-value=2.8e-06 Score=35.38 Aligned_cols=24 Identities=21% Similarity=0.328 Sum_probs=15.8
Q ss_pred ccCCCCccccCCchhHHHHHHHhc
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAIRH 102 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~h~ 102 (108)
|.|..|++.|.+...|..|++.|.
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCccCCccCChhHHHHHhHHhc
Confidence 566666666666666666666554
No 27
>PLN03086 PRLI-interacting factor K; Provisional
Probab=97.91 E-value=3.1e-05 Score=54.07 Aligned_cols=77 Identities=19% Similarity=0.318 Sum_probs=58.2
Q ss_pred ccCCCCCCCcChhHHHHHhhhcCCCCCccccccCCCceeeccchhhhccC----------chHHHHHHhHHhcCCCCccC
Q psy3491 12 IVDPVCPAFVDAPWLYFLLYSFSSGASFSSFKPLQSALFMCESCHKTYKA----------KGSLERHKKFECGKEAGLQC 81 (108)
Q Consensus 12 ~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~~~f~~----------~~~l~~h~~~~~~~~~~~~c 81 (108)
+.|| |........+..|+... -..+++.|..|+..+.. ...|..|.... +.++ +.|
T Consensus 479 v~Cp-Cg~~~~R~~L~~H~~th-----------Cp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt-~~C 544 (567)
T PLN03086 479 LQCP-CGVVLEKEQMVQHQAST-----------CPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRT-APC 544 (567)
T ss_pred ccCC-CCCCcchhHHHhhhhcc-----------CCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcc-eEc
Confidence 4788 88544566777776543 45689999999998852 34788998875 6677 999
Q ss_pred CCCccccCCchhHHHHHHHhcC
Q psy3491 82 PFCPYKSKHKSNLKTHMAIRHQ 103 (108)
Q Consensus 82 ~~C~~~f~~~~~l~~H~~~h~~ 103 (108)
..||+.+..+ .+..|+...|+
T Consensus 545 ~~Cgk~Vrlr-dm~~H~~~~h~ 565 (567)
T PLN03086 545 DSCGRSVMLK-EMDIHQIAVHQ 565 (567)
T ss_pred cccCCeeeeh-hHHHHHHHhhc
Confidence 9999998877 47888877665
No 28
>PHA00733 hypothetical protein
Probab=97.88 E-value=8e-06 Score=46.80 Aligned_cols=55 Identities=13% Similarity=0.182 Sum_probs=42.3
Q ss_pred CCceeeccchhhhccCchHHHHH--Hh---HHhcCCCCccCCCCccccCCchhHHHHHHHh
Q psy3491 46 QSALFMCESCHKTYKAKGSLERH--KK---FECGKEAGLQCPFCPYKSKHKSNLKTHMAIR 101 (108)
Q Consensus 46 ~~~~~~C~~c~~~f~~~~~l~~h--~~---~~~~~~~~~~c~~C~~~f~~~~~l~~H~~~h 101 (108)
..+++.|..|...|.....|..+ .. ...+.++ |.|..|++.|.+...|..|++.+
T Consensus 37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kP-y~C~~Cgk~Fss~s~L~~H~r~h 96 (128)
T PHA00733 37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSP-YVCPLCLMPFSSSVSLKQHIRYT 96 (128)
T ss_pred hhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCC-ccCCCCCCcCCCHHHHHHHHhcC
Confidence 45679999999888877666555 11 2223455 99999999999999999999865
No 29
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.77 E-value=1.1e-05 Score=33.48 Aligned_cols=26 Identities=35% Similarity=0.830 Sum_probs=23.2
Q ss_pred eeeccchhhhccCchHHHHHHhHHhc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECG 74 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~ 74 (108)
||.|..|++.|.....|..|++.|..
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 58999999999999999999987753
No 30
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.74 E-value=2.2e-05 Score=42.62 Aligned_cols=72 Identities=17% Similarity=0.262 Sum_probs=16.6
Q ss_pred cCCCCC-CCcChhHHHHHhhhcCCCCCccccccCCCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCc
Q psy3491 13 VDPVCP-AFVDAPWLYFLLYSFSSGASFSSFKPLQSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHK 91 (108)
Q Consensus 13 ~~~~c~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~ 91 (108)
.|.+|. .|.+...+..|+...+.. .+. . ...+.....+..+...... .. +.|..|++.|.+.
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~-~~~-------------~-~~~l~~~~~~~~~~~~~~~-~~-~~C~~C~~~f~s~ 63 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGF-DIP-------------D-QKYLVDPNRLLNYLRKKVK-ES-FRCPYCNKTFRSR 63 (100)
T ss_dssp ----------------------------------------------------------------SS-EEBSSSS-EESSH
T ss_pred Ccccccccccccccccccccccccc-ccc-------------c-ccccccccccccccccccC-CC-CCCCccCCCCcCH
Confidence 478888 788888888887544321 111 0 0000111112222221111 23 7777777777777
Q ss_pred hhHHHHHHHh
Q psy3491 92 SNLKTHMAIR 101 (108)
Q Consensus 92 ~~l~~H~~~h 101 (108)
..|..|++.+
T Consensus 64 ~~l~~Hm~~~ 73 (100)
T PF12756_consen 64 EALQEHMRSK 73 (100)
T ss_dssp HHHHHHHHHT
T ss_pred HHHHHHHcCc
Confidence 7777777653
No 31
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.70 E-value=3.3e-05 Score=30.83 Aligned_cols=24 Identities=29% Similarity=0.733 Sum_probs=19.3
Q ss_pred eeccchhhhccCchHHHHHHhHHh
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFEC 73 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~ 73 (108)
|.|+.|++.|.....|..|+..++
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 689999999999999999988753
No 32
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.57 E-value=9.5e-05 Score=29.85 Aligned_cols=24 Identities=42% Similarity=0.613 Sum_probs=16.0
Q ss_pred ccCCCCccccCCchhHHHHHHHhc
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAIRH 102 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~h~ 102 (108)
|.|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 456677777777777777776543
No 33
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.41 E-value=0.00058 Score=33.18 Aligned_cols=52 Identities=17% Similarity=0.179 Sum_probs=37.7
Q ss_pred CccCCCCCCCcChhHHHHHhhhcCCCCCccccccCCCceeeccchhhhccCchHHHHHHhHHh
Q psy3491 11 NIVDPVCPAFVDAPWLYFLLYSFSSGASFSSFKPLQSALFMCESCHKTYKAKGSLERHKKFEC 73 (108)
Q Consensus 11 ~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~ 73 (108)
.+.||+|....+...|..|....+.. +.+...|+.|...+. .+|..|+...+
T Consensus 2 ~f~CP~C~~~~~~~~L~~H~~~~H~~---------~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 2 SFTCPYCGKGFSESSLVEHCEDEHRS---------ESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred CcCCCCCCCccCHHHHHHHHHhHCcC---------CCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 36799999656677888887766432 234689999998655 37888987654
No 34
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.39 E-value=4.8e-05 Score=49.38 Aligned_cols=53 Identities=25% Similarity=0.509 Sum_probs=42.6
Q ss_pred CCceeeccc--hhhhccCchHHHHHHhHHh-------------------cCCCCccCCCCccccCCchhHHHHHH
Q psy3491 46 QSALFMCES--CHKTYKAKGSLERHKKFEC-------------------GKEAGLQCPFCPYKSKHKSNLKTHMA 99 (108)
Q Consensus 46 ~~~~~~C~~--c~~~f~~~~~l~~h~~~~~-------------------~~~~~~~c~~C~~~f~~~~~l~~H~~ 99 (108)
+++||.|+. |.+.|.+...|..|+.-.+ .+++ |.|++|+|.++....|+.|..
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KP-YrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKP-YRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCc-eeccccchhhccCccceeccc
Confidence 469999975 9999999999998875322 1354 999999999999888887754
No 35
>KOG3993|consensus
Probab=97.29 E-value=3.8e-05 Score=51.46 Aligned_cols=53 Identities=21% Similarity=0.445 Sum_probs=46.0
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHHHHHHHhcC
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLKTHMAIRHQ 103 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~~H~~~h~~ 103 (108)
|.|..|...|.....|.+|+-....... |.|++|+|+|+-..+|..|.|=|..
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC~RIV~vE-YrCPEC~KVFsCPANLASHRRWHKP 320 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRCPRIVHVE-YRCPECDKVFSCPANLASHRRWHKP 320 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccCCeeEEee-ecCCcccccccCchhhhhhhcccCC
Confidence 8999999999999999999765555456 9999999999999999999886643
No 36
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.03 E-value=0.00045 Score=27.74 Aligned_cols=24 Identities=29% Similarity=0.735 Sum_probs=20.7
Q ss_pred eeccchhhhccCchHHHHHHhHHh
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFEC 73 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~ 73 (108)
+.|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 579999999999999999988653
No 37
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.93 E-value=0.00081 Score=26.94 Aligned_cols=23 Identities=48% Similarity=1.024 Sum_probs=12.6
Q ss_pred ccCCCCccccCCchhHHHHHHHhc
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAIRH 102 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~h~ 102 (108)
|.|+.|+.... ...|.+|++.+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45666665555 556666666655
No 38
>PHA00732 hypothetical protein
Probab=96.90 E-value=0.00077 Score=35.43 Aligned_cols=25 Identities=24% Similarity=0.525 Sum_probs=22.3
Q ss_pred CccCCCCccccCCchhHHHHHHH-hc
Q psy3491 78 GLQCPFCPYKSKHKSNLKTHMAI-RH 102 (108)
Q Consensus 78 ~~~c~~C~~~f~~~~~l~~H~~~-h~ 102 (108)
||.|..|++.|.+..+|..|++. |.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~ 26 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT 26 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC
Confidence 48999999999999999999985 44
No 39
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.85 E-value=0.0005 Score=27.79 Aligned_cols=22 Identities=18% Similarity=0.514 Sum_probs=13.9
Q ss_pred ccCCCCccccCCchhHHHHHHH
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAI 100 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~ 100 (108)
|.|.+|++.|.+...+..|.+.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 3566666666666666666643
No 40
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.67 E-value=0.0024 Score=30.46 Aligned_cols=32 Identities=19% Similarity=0.309 Sum_probs=22.8
Q ss_pred CCceeeccchhhhccCchHHHHHHhHHhcCCC
Q psy3491 46 QSALFMCESCHKTYKAKGSLERHKKFECGKEA 77 (108)
Q Consensus 46 ~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~ 77 (108)
.+.|..|+.|+..+....+|..|+...++.++
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 46789999999999999999999988777554
No 41
>PRK04860 hypothetical protein; Provisional
Probab=96.63 E-value=0.0021 Score=38.31 Aligned_cols=38 Identities=21% Similarity=0.538 Sum_probs=31.7
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHK 91 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~ 91 (108)
+|.|. |+. ....+..|.+++.+.++ |.|..|+..|...
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~-YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAV-YRCRRCGETLVFK 156 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCcc-EECCCCCceeEEe
Confidence 78998 986 56678899999999887 9999999887644
No 42
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.49 E-value=0.0018 Score=26.11 Aligned_cols=23 Identities=30% Similarity=0.834 Sum_probs=19.5
Q ss_pred eeccchhhhccCchHHHHHHhHH
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFE 72 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~ 72 (108)
|.|..|+..|.....+..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 57899999999999999988754
No 43
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.23 E-value=0.0049 Score=24.56 Aligned_cols=23 Identities=30% Similarity=0.592 Sum_probs=17.1
Q ss_pred eeccchhhhccCchHHHHHHhHHh
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFEC 73 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~ 73 (108)
|.|+.|..... ...|..|++.++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 67899998887 888999988765
No 44
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=96.07 E-value=0.0012 Score=27.26 Aligned_cols=21 Identities=29% Similarity=0.515 Sum_probs=14.8
Q ss_pred ccCCCCccccCCchhHHHHHH
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMA 99 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~ 99 (108)
|.|..|++.|.....+..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 567777777777777766654
No 45
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.05 E-value=0.0045 Score=33.36 Aligned_cols=25 Identities=28% Similarity=0.708 Sum_probs=21.8
Q ss_pred eeeccchhhhccCchHHHHHHhHHh
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFEC 73 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~ 73 (108)
.+.|..|++.|.....|..|++.+.
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~~~~ 74 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMRSKH 74 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHHHTT
T ss_pred CCCCCccCCCCcCHHHHHHHHcCcc
Confidence 6999999999999999999998754
No 46
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.01 E-value=0.0041 Score=40.71 Aligned_cols=59 Identities=22% Similarity=0.404 Sum_probs=41.3
Q ss_pred ccCCC--CC-CCcChhHHHHHhhhcCCCCCcccc-------c-cCCCceeeccchhhhccCchHHHHHHh
Q psy3491 12 IVDPV--CP-AFVDAPWLYFLLYSFSSGASFSSF-------K-PLQSALFMCESCHKTYKAKGSLERHKK 70 (108)
Q Consensus 12 ~~~~~--c~-~~~~~~~~~~~~~~~~~~~~~~~~-------~-~~~~~~~~C~~c~~~f~~~~~l~~h~~ 70 (108)
+.||+ |. ++.....|++|+.--++...+..- . ....|||.|..|++.|.+.-.|..|+.
T Consensus 350 ykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 350 YKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred ecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceeccc
Confidence 37876 66 788888999987644433333211 1 135689999999999999888877743
No 47
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=95.89 E-value=0.0091 Score=24.25 Aligned_cols=20 Identities=30% Similarity=0.675 Sum_probs=12.1
Q ss_pred ccCCCCccccCCchhHHHHHH
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMA 99 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~ 99 (108)
..|+.||+.| ..+.|.+|+.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHH
Confidence 4567777777 3345666654
No 48
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=94.80 E-value=0.021 Score=24.75 Aligned_cols=21 Identities=14% Similarity=0.521 Sum_probs=16.3
Q ss_pred ccCCCCccccCCchhHHHHHH
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMA 99 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~ 99 (108)
|.|.+|++.|.....+..|..
T Consensus 4 ~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 4 FYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred eEccccCCccCCHHHHHHHHC
Confidence 778888888887777777764
No 49
>KOG2893|consensus
Probab=93.32 E-value=0.018 Score=36.38 Aligned_cols=49 Identities=22% Similarity=0.458 Sum_probs=34.7
Q ss_pred ceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHHHHHHHhc
Q psy3491 48 ALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLKTHMAIRH 102 (108)
Q Consensus 48 ~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~~H~~~h~ 102 (108)
+|| |=.|++.|.....|.+|++. +. |+|-+|.|.+-+...|..|-...|
T Consensus 10 kpw-cwycnrefddekiliqhqka----kh-fkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 10 KPW-CWYCNREFDDEKILIQHQKA----KH-FKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred Cce-eeecccccchhhhhhhhhhh----cc-ceeeeehhhhccCCCceeehhhhh
Confidence 454 66788899999889988875 44 899999877666555655544333
No 50
>PRK04860 hypothetical protein; Provisional
Probab=92.43 E-value=0.085 Score=31.51 Aligned_cols=27 Identities=15% Similarity=0.402 Sum_probs=22.9
Q ss_pred CccCCCCccccCCchhHHHHHHHhcCCCCCC
Q psy3491 78 GLQCPFCPYKSKHKSNLKTHMAIRHQNGLHV 108 (108)
Q Consensus 78 ~~~c~~C~~~f~~~~~l~~H~~~h~~~k~~~ 108 (108)
+|.|. |++ ....+.+|.+++.++++|.
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~Yr 145 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYR 145 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEE
Confidence 39998 986 7778899999999999873
No 51
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=92.10 E-value=0.12 Score=22.35 Aligned_cols=24 Identities=25% Similarity=0.783 Sum_probs=16.3
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
|.|..||..+.... . ++.|++|+.
T Consensus 2 ~~C~~CGy~y~~~~------------~-~~~CP~Cg~ 25 (33)
T cd00350 2 YVCPVCGYIYDGEE------------A-PWVCPVCGA 25 (33)
T ss_pred EECCCCCCEECCCc------------C-CCcCcCCCC
Confidence 67888887665432 3 488888874
No 52
>KOG2186|consensus
Probab=92.09 E-value=0.13 Score=32.77 Aligned_cols=54 Identities=24% Similarity=0.308 Sum_probs=30.2
Q ss_pred CccCCCCCCCcChhHHHHHhhhcCCCCCccccccCCCceeeccchhhhccCchHHHHHHhHHhcCCC
Q psy3491 11 NIVDPVCPAFVDAPWLYFLLYSFSSGASFSSFKPLQSALFMCESCHKTYKAKGSLERHKKFECGKEA 77 (108)
Q Consensus 11 ~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~ 77 (108)
.|.|.+|.....++.+..|+..- .+ .-|.|..|++.|.. .++..|....+....
T Consensus 3 ~FtCnvCgEsvKKp~vekH~srC-----------rn-~~fSCIDC~k~F~~-~sYknH~kCITEaQK 56 (276)
T KOG2186|consen 3 FFTCNVCGESVKKPQVEKHMSRC-----------RN-AYFSCIDCGKTFER-VSYKNHTKCITEAQK 56 (276)
T ss_pred EEehhhhhhhccccchHHHHHhc-----------cC-CeeEEeeccccccc-chhhhhhhhcchHHH
Confidence 45666666555555555554322 11 34777777777766 345666655554444
No 53
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=91.41 E-value=0.029 Score=37.61 Aligned_cols=58 Identities=19% Similarity=0.449 Sum_probs=50.6
Q ss_pred ceeeccchhhhccCchHHHHHHh--HHhcC--CCCccCC--CCccccCCchhHHHHHHHhcCCCC
Q psy3491 48 ALFMCESCHKTYKAKGSLERHKK--FECGK--EAGLQCP--FCPYKSKHKSNLKTHMAIRHQNGL 106 (108)
Q Consensus 48 ~~~~C~~c~~~f~~~~~l~~h~~--~~~~~--~~~~~c~--~C~~~f~~~~~l~~H~~~h~~~k~ 106 (108)
.++.|..|...|.....+..|.+ .|.+. .+ +.|+ .|++.|.....+..|..+|.+-.+
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~-~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKP-FSCPYSLCGKLFSRNDALKRHILLHTSISP 351 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCc-eeeeccCCCccccccccccCCcccccCCCc
Confidence 46889999999999999999999 78887 55 9999 799999999999999988887654
No 54
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=88.95 E-value=0.38 Score=23.43 Aligned_cols=31 Identities=16% Similarity=0.372 Sum_probs=24.1
Q ss_pred cCCCceeeccchhhhccCchHHHHHHhHHhc
Q psy3491 44 PLQSALFMCESCHKTYKAKGSLERHKKFECG 74 (108)
Q Consensus 44 ~~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~ 74 (108)
..|+.-+.|+-|+..|....+...|..-.++
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 3466678899999999988888888765444
No 55
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=88.49 E-value=0.15 Score=31.96 Aligned_cols=26 Identities=23% Similarity=0.520 Sum_probs=16.5
Q ss_pred ccCCCCccccCCchhHHHHHHHhcCC
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAIRHQN 104 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~h~~~ 104 (108)
|.|.+|+|.|.-..-+.+|+..-|.+
T Consensus 78 ~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 78 WRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp EEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred ECCCCCCcccCChHHHHHHHhhcCHH
Confidence 77777777777777777777665543
No 56
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=88.12 E-value=0.3 Score=21.66 Aligned_cols=34 Identities=21% Similarity=0.672 Sum_probs=19.3
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccC
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSK 89 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~ 89 (108)
-.|+.|+..|.-..+ +...+... ..|..|+..|.
T Consensus 3 i~CP~C~~~f~v~~~-----~l~~~~~~-vrC~~C~~~f~ 36 (37)
T PF13719_consen 3 ITCPNCQTRFRVPDD-----KLPAGGRK-VRCPKCGHVFR 36 (37)
T ss_pred EECCCCCceEEcCHH-----HcccCCcE-EECCCCCcEee
Confidence 357777777765542 11122344 77777776653
No 57
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=87.69 E-value=0.27 Score=21.40 Aligned_cols=25 Identities=28% Similarity=0.747 Sum_probs=16.7
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
.|.|..||..+... .+|..|++|+.
T Consensus 2 ~~~C~~CG~i~~g~-------------~~p~~CP~Cg~ 26 (34)
T cd00729 2 VWVCPVCGYIHEGE-------------EAPEKCPICGA 26 (34)
T ss_pred eEECCCCCCEeECC-------------cCCCcCcCCCC
Confidence 47888998665332 23478999874
No 58
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=86.24 E-value=0.37 Score=27.91 Aligned_cols=24 Identities=29% Similarity=0.380 Sum_probs=13.4
Q ss_pred ccCCCCccccCCchhHHHHHHHhcCCC
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAIRHQNG 105 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~h~~~k 105 (108)
..|-+||+.|..- ++|.+.|||-.
T Consensus 73 i~clecGk~~k~L---krHL~~~~glt 96 (132)
T PF05443_consen 73 IICLECGKKFKTL---KRHLRTHHGLT 96 (132)
T ss_dssp EE-TBT--EESBH---HHHHHHTT-S-
T ss_pred eEEccCCcccchH---HHHHHHccCCC
Confidence 6777788776653 78888777643
No 59
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=86.12 E-value=0.71 Score=21.80 Aligned_cols=21 Identities=33% Similarity=0.741 Sum_probs=12.1
Q ss_pred ccCCCCccccCCc-----hhHHHHHH
Q psy3491 79 LQCPFCPYKSKHK-----SNLKTHMA 99 (108)
Q Consensus 79 ~~c~~C~~~f~~~-----~~l~~H~~ 99 (108)
..|..|++.+... ++|.+|+.
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHH
Confidence 4566666655332 46666666
No 60
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=85.78 E-value=0.42 Score=21.09 Aligned_cols=33 Identities=15% Similarity=0.424 Sum_probs=18.6
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcccc
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKS 88 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f 88 (108)
..|+.|+..|.-.... ....... ..|..|+..|
T Consensus 3 i~Cp~C~~~y~i~d~~-----ip~~g~~-v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEK-----IPPKGRK-VRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHH-----CCCCCcE-EECCCCCCEe
Confidence 3577777777655432 1122234 7777777665
No 61
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=85.60 E-value=0.43 Score=28.44 Aligned_cols=34 Identities=18% Similarity=0.464 Sum_probs=25.6
Q ss_pred CceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCC
Q psy3491 47 SALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKH 90 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~ 90 (108)
..-|.|+.|+..|+....+. .. |.|+.||.....
T Consensus 107 ~~~Y~Cp~c~~r~tf~eA~~---------~~-F~Cp~Cg~~L~~ 140 (158)
T TIGR00373 107 NMFFICPNMCVRFTFNEAME---------LN-FTCPRCGAMLDY 140 (158)
T ss_pred CCeEECCCCCcEeeHHHHHH---------cC-CcCCCCCCEeee
Confidence 44589999998888776653 24 999999987543
No 62
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=85.26 E-value=0.46 Score=20.95 Aligned_cols=33 Identities=15% Similarity=0.406 Sum_probs=18.2
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcccc
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKS 88 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f 88 (108)
..|+.|+..|.-..+.. ...... ..|+.|+..|
T Consensus 3 ~~CP~C~~~~~v~~~~~-----~~~~~~-v~C~~C~~~~ 35 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQL-----GANGGK-VRCGKCGHVW 35 (38)
T ss_pred EECCCCCCEEEeCHHHc-----CCCCCE-EECCCCCCEE
Confidence 46777777765544321 111123 6777777665
No 63
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=84.84 E-value=0.93 Score=22.41 Aligned_cols=32 Identities=22% Similarity=0.506 Sum_probs=19.5
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
.|.|+.||+.-..... +...-..+ |.|+.||-
T Consensus 27 ~F~CPnCGe~~I~Rc~-----~CRk~g~~-Y~Cp~CGF 58 (61)
T COG2888 27 KFPCPNCGEVEIYRCA-----KCRKLGNP-YRCPKCGF 58 (61)
T ss_pred EeeCCCCCceeeehhh-----hHHHcCCc-eECCCcCc
Confidence 5999999966544321 11111244 99999973
No 64
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=84.30 E-value=0.51 Score=28.72 Aligned_cols=34 Identities=24% Similarity=0.697 Sum_probs=25.2
Q ss_pred CceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCC
Q psy3491 47 SALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKH 90 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~ 90 (108)
..-|.|+.|+..|+....+. .. |.|+.||..+..
T Consensus 115 ~~~Y~Cp~C~~rytf~eA~~---------~~-F~Cp~Cg~~L~~ 148 (178)
T PRK06266 115 NMFFFCPNCHIRFTFDEAME---------YG-FRCPQCGEMLEE 148 (178)
T ss_pred CCEEECCCCCcEEeHHHHhh---------cC-CcCCCCCCCCee
Confidence 34589999998887765442 24 999999987544
No 65
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=84.25 E-value=0.14 Score=30.42 Aligned_cols=15 Identities=20% Similarity=0.428 Sum_probs=10.8
Q ss_pred eeccchhhhccCchH
Q psy3491 50 FMCESCHKTYKAKGS 64 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~ 64 (108)
++|+.||..|.....
T Consensus 29 ~~c~~c~~~f~~~e~ 43 (154)
T PRK00464 29 RECLACGKRFTTFER 43 (154)
T ss_pred eeccccCCcceEeEe
Confidence 778888888776543
No 66
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=83.37 E-value=0.79 Score=25.61 Aligned_cols=30 Identities=23% Similarity=0.541 Sum_probs=20.2
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCc
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHK 91 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~ 91 (108)
..|+.||..|.- | + +.|..|+.||..|.-.
T Consensus 10 R~Cp~CG~kFYD---L--------n-k~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYD---L--------N-KDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhcc---C--------C-CCCccCCCCCCccCcc
Confidence 678888877643 2 1 2348888888877654
No 67
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=82.96 E-value=0.77 Score=27.67 Aligned_cols=24 Identities=38% Similarity=0.789 Sum_probs=18.0
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
-|.|++||..+ .+ .+|-.|++||-
T Consensus 134 ~~vC~vCGy~~-------------~g-e~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTH-------------EG-EAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCcc-------------cC-CCCCcCCCCCC
Confidence 69999998543 34 44699999984
No 68
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=82.84 E-value=0.41 Score=22.73 Aligned_cols=30 Identities=17% Similarity=0.506 Sum_probs=19.3
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
-|.|..||..|.....+. . ... ..|+.||.
T Consensus 5 ey~C~~Cg~~fe~~~~~~------~-~~~-~~CP~Cg~ 34 (52)
T TIGR02605 5 EYRCTACGHRFEVLQKMS------D-DPL-ATCPECGG 34 (52)
T ss_pred EEEeCCCCCEeEEEEecC------C-CCC-CCCCCCCC
Confidence 378999998886543221 1 233 67999885
No 69
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=82.60 E-value=1.4 Score=20.05 Aligned_cols=22 Identities=36% Similarity=0.621 Sum_probs=12.4
Q ss_pred ccCCCCccccCC----chhHHHHHHH
Q psy3491 79 LQCPFCPYKSKH----KSNLKTHMAI 100 (108)
Q Consensus 79 ~~c~~C~~~f~~----~~~l~~H~~~ 100 (108)
..|..|++.+.. .+.|.+|++.
T Consensus 17 a~C~~C~~~~~~~~~~ts~l~~HL~~ 42 (45)
T PF02892_consen 17 AKCKYCGKVIKYSSGGTSNLKRHLKK 42 (45)
T ss_dssp EEETTTTEE-----SSTHHHHHHHHH
T ss_pred EEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence 677777766554 3567777643
No 70
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=82.49 E-value=1 Score=26.49 Aligned_cols=39 Identities=18% Similarity=0.456 Sum_probs=25.5
Q ss_pred CCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccC
Q psy3491 46 QSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSK 89 (108)
Q Consensus 46 ~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~ 89 (108)
...-|.|+.|+..|.....+. ....... |.|+.||....
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~----~~d~~~~-f~Cp~Cg~~l~ 134 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQ----LLDMDGT-FTCPRCGEELE 134 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHH----hcCCCCc-EECCCCCCEEE
Confidence 344699999998887544322 1112344 99999997653
No 71
>KOG1146|consensus
Probab=82.46 E-value=0.93 Score=35.54 Aligned_cols=52 Identities=23% Similarity=0.472 Sum_probs=43.1
Q ss_pred CceeeccchhhhccCchHHHHHHhHHhc-------------------------CCCCccCCCCccccCCchhHHHHHH
Q psy3491 47 SALFMCESCHKTYKAKGSLERHKKFECG-------------------------KEAGLQCPFCPYKSKHKSNLKTHMA 99 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~~l~~h~~~~~~-------------------------~~~~~~c~~C~~~f~~~~~l~~H~~ 99 (108)
.+.+.|+.|+..|.....|..|++..+. .+ +|.|..|...+....+|..|+.
T Consensus 463 ~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~-p~~C~~C~~stttng~Lsihlq 539 (1406)
T KOG1146|consen 463 FKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGK-PYPCRACNYSTTTNGNLSIHLQ 539 (1406)
T ss_pred cccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCC-cccceeeeeeeecchHHHHHHH
Confidence 3679999999999999999999987221 23 4999999999999999999976
No 72
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=81.87 E-value=0.47 Score=21.17 Aligned_cols=30 Identities=23% Similarity=0.571 Sum_probs=19.6
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
.|.|..||..|....... . ... ..|+.||.
T Consensus 5 ~y~C~~Cg~~fe~~~~~~------~-~~~-~~CP~Cg~ 34 (41)
T smart00834 5 EYRCEDCGHTFEVLQKIS------D-DPL-ATCPECGG 34 (41)
T ss_pred EEEcCCCCCEEEEEEecC------C-CCC-CCCCCCCC
Confidence 478999998886543321 1 233 77888886
No 73
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=81.73 E-value=2 Score=24.14 Aligned_cols=84 Identities=12% Similarity=0.189 Sum_probs=46.2
Q ss_pred cCCCCC-CCcChhHHHHHhhhcCCCCCccc--cccC--CCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccc
Q psy3491 13 VDPVCP-AFVDAPWLYFLLYSFSSGASFSS--FKPL--QSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYK 87 (108)
Q Consensus 13 ~~~~c~-~~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~ 87 (108)
.||+|. .+...+.|... ++.-..+.. .... ....-.|-.|...|........-. ...... |.|+.|...
T Consensus 17 ~CpiCgLtLVss~HLARS---yHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~~--~~~~~~-y~C~~C~~~ 90 (112)
T TIGR00622 17 ECPICGLTLILSTHLARS---YHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFDE--LKDSHR-YVCAVCKNV 90 (112)
T ss_pred cCCcCCCEEeccchHHHh---hhccCCCcccccccccccCCCCcccCcCCCCCCcccccccc--cccccc-eeCCCCCCc
Confidence 789998 77777777541 111111110 0000 111234888888886543111000 112234 999999999
Q ss_pred cCCchhHHHHHHHhc
Q psy3491 88 SKHKSNLKTHMAIRH 102 (108)
Q Consensus 88 f~~~~~l~~H~~~h~ 102 (108)
|--.-.+-.|..+|.
T Consensus 91 FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 91 FCVDCDVFVHESLHC 105 (112)
T ss_pred cccccchhhhhhccC
Confidence 988877777776654
No 74
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=81.36 E-value=1.7 Score=17.59 Aligned_cols=20 Identities=25% Similarity=0.690 Sum_probs=13.7
Q ss_pred ccCCCCccccCCchhHHHHHH
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMA 99 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~ 99 (108)
..|++|++.+ ....+..|..
T Consensus 2 v~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLD 21 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHH
Confidence 4678888776 5556777765
No 75
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=79.78 E-value=1.7 Score=24.00 Aligned_cols=25 Identities=28% Similarity=0.762 Sum_probs=22.6
Q ss_pred ccC----CCCccccCCchhHHHHHHHhcC
Q psy3491 79 LQC----PFCPYKSKHKSNLKTHMAIRHQ 103 (108)
Q Consensus 79 ~~c----~~C~~~f~~~~~l~~H~~~h~~ 103 (108)
|.| ..|+....+...+.+|.+.+||
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 889 8999999999999999998875
No 76
>KOG4173|consensus
Probab=79.14 E-value=1.2 Score=27.84 Aligned_cols=53 Identities=26% Similarity=0.478 Sum_probs=37.4
Q ss_pred eeccchhhhccCchHHHHHHh-HHh---------cCCCCccC--CCCccccCCchhHHHHHHHhcC
Q psy3491 50 FMCESCHKTYKAKGSLERHKK-FEC---------GKEAGLQC--PFCPYKSKHKSNLKTHMAIRHQ 103 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~-~~~---------~~~~~~~c--~~C~~~f~~~~~l~~H~~~h~~ 103 (108)
..|..|.+.|.+..-|..|+. +|. |... |.| ..|+..|.+...-..|+...|.
T Consensus 107 ~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dM-y~ClvEgCt~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 107 NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDM-YQCLVEGCTEKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccH-HHHHHHhhhhhhhhhhhhhhHHHHhcc
Confidence 478889888888877777754 222 3334 888 4688888888888888765553
No 77
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=77.95 E-value=1.8 Score=20.03 Aligned_cols=27 Identities=26% Similarity=0.901 Sum_probs=17.6
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYK 87 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~ 87 (108)
.|.|..||..+... ...+ ..|+.||..
T Consensus 2 ~Y~C~~Cg~~~~~~-----------~~~~-irC~~CG~r 28 (44)
T smart00659 2 IYICGECGRENEIK-----------SKDV-VRCRECGYR 28 (44)
T ss_pred EEECCCCCCEeecC-----------CCCc-eECCCCCce
Confidence 37888888766533 2233 888888853
No 78
>KOG2186|consensus
Probab=77.77 E-value=1.1 Score=28.74 Aligned_cols=46 Identities=22% Similarity=0.402 Sum_probs=36.0
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHHHHHH
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLKTHMA 99 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~~H~~ 99 (108)
|.|..||....-.. +..|+...++ .- |.|-.|++.|-. .++..|..
T Consensus 4 FtCnvCgEsvKKp~-vekH~srCrn-~~-fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKKPQ-VEKHMSRCRN-AY-FSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhccccc-hHHHHHhccC-Ce-eEEeeccccccc-chhhhhhh
Confidence 78999998876654 6779877766 55 999999999987 56677754
No 79
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=77.34 E-value=0.34 Score=30.32 Aligned_cols=11 Identities=36% Similarity=0.966 Sum_probs=6.6
Q ss_pred ccCCCCccccC
Q psy3491 79 LQCPFCPYKSK 89 (108)
Q Consensus 79 ~~c~~C~~~f~ 89 (108)
..|+.||-.+.
T Consensus 49 ~vCP~CgyA~~ 59 (214)
T PF09986_consen 49 WVCPHCGYAAF 59 (214)
T ss_pred EECCCCCCccc
Confidence 46677776543
No 80
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=77.22 E-value=2.2 Score=19.36 Aligned_cols=24 Identities=25% Similarity=0.491 Sum_probs=18.2
Q ss_pred eeccchhhhccC--chHHHHHHhHHh
Q psy3491 50 FMCESCHKTYKA--KGSLERHKKFEC 73 (108)
Q Consensus 50 ~~C~~c~~~f~~--~~~l~~h~~~~~ 73 (108)
-.|+.||..|.. ..+...|.+.|.
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 489999998875 466777877764
No 81
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=76.11 E-value=1.1 Score=20.42 Aligned_cols=30 Identities=23% Similarity=0.670 Sum_probs=20.0
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
.|.|..||..|.....+ .. ... ..|+.||.
T Consensus 5 ey~C~~Cg~~fe~~~~~------~~-~~~-~~CP~Cg~ 34 (42)
T PF09723_consen 5 EYRCEECGHEFEVLQSI------SE-DDP-VPCPECGS 34 (42)
T ss_pred EEEeCCCCCEEEEEEEc------CC-CCC-CcCCCCCC
Confidence 37899999888664322 12 233 78999986
No 82
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=75.74 E-value=2.6 Score=26.57 Aligned_cols=29 Identities=17% Similarity=0.310 Sum_probs=22.0
Q ss_pred CCceeeccchhhhccCchHHHHHHhHHhc
Q psy3491 46 QSALFMCESCHKTYKAKGSLERHKKFECG 74 (108)
Q Consensus 46 ~~~~~~C~~c~~~f~~~~~l~~h~~~~~~ 74 (108)
++..|.|..|++.|.-...+.+|+...+.
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCH
Confidence 44579999999999999999999876554
No 83
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=75.05 E-value=0.57 Score=19.52 Aligned_cols=11 Identities=36% Similarity=1.059 Sum_probs=6.2
Q ss_pred eeccchhhhcc
Q psy3491 50 FMCESCHKTYK 60 (108)
Q Consensus 50 ~~C~~c~~~f~ 60 (108)
|.|-.|++.|.
T Consensus 1 ~sCiDC~~~F~ 11 (28)
T PF08790_consen 1 FSCIDCSKDFD 11 (28)
T ss_dssp EEETTTTEEEE
T ss_pred CeeecCCCCcC
Confidence 34566666663
No 84
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=73.85 E-value=1.6 Score=17.51 Aligned_cols=10 Identities=30% Similarity=0.703 Sum_probs=7.7
Q ss_pred ceeeccchhh
Q psy3491 48 ALFMCESCHK 57 (108)
Q Consensus 48 ~~~~C~~c~~ 57 (108)
..|.|+.||+
T Consensus 15 v~f~CPnCG~ 24 (24)
T PF07754_consen 15 VPFPCPNCGF 24 (24)
T ss_pred ceEeCCCCCC
Confidence 3699999874
No 85
>KOG2785|consensus
Probab=72.36 E-value=4 Score=27.89 Aligned_cols=51 Identities=24% Similarity=0.290 Sum_probs=40.9
Q ss_pred eeeccchhhhccCchHHHHHHhHHhc-----------------------CCCCccCCCCc---cccCCchhHHHHHHH
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECG-----------------------KEAGLQCPFCP---YKSKHKSNLKTHMAI 100 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~-----------------------~~~~~~c~~C~---~~f~~~~~l~~H~~~ 100 (108)
|-.|-.|+..+........||..+++ ... +.|-.|. +.|.+..+.+.|+..
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~-~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIG-FICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccC-ceEEEeccccCcccccHHHHHHHhh
Confidence 46789999999888888889887776 233 7777777 889999999999863
No 86
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=72.23 E-value=1.4 Score=20.43 Aligned_cols=29 Identities=21% Similarity=0.770 Sum_probs=17.1
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcccc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKS 88 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f 88 (108)
.|.|+.||..+..... ... ..|+.||..+
T Consensus 3 ~y~C~~CG~~~~~~~~----------~~~-~~Cp~CG~~~ 31 (46)
T PRK00398 3 EYKCARCGREVELDEY----------GTG-VRCPYCGYRI 31 (46)
T ss_pred EEECCCCCCEEEECCC----------CCc-eECCCCCCeE
Confidence 4778888766644321 113 7788887543
No 87
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=71.48 E-value=2.8 Score=24.11 Aligned_cols=30 Identities=10% Similarity=0.205 Sum_probs=18.6
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCc
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHK 91 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~ 91 (108)
..|+.||..|.- | + +.|..|+.||..|...
T Consensus 10 r~Cp~cg~kFYD---L--------n-k~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 10 RICPNTGSKFYD---L--------N-RRPAVSPYTGEQFPPE 39 (129)
T ss_pred ccCCCcCccccc---c--------C-CCCccCCCcCCccCcc
Confidence 578888776633 2 1 2347788888776544
No 88
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=68.42 E-value=1.4 Score=22.80 Aligned_cols=15 Identities=20% Similarity=0.603 Sum_probs=8.7
Q ss_pred eeecc--chhhhccCch
Q psy3491 49 LFMCE--SCHKTYKAKG 63 (108)
Q Consensus 49 ~~~C~--~c~~~f~~~~ 63 (108)
=++|. .||..|....
T Consensus 27 Y~qC~N~eCg~tF~t~e 43 (72)
T PRK09678 27 YHQCQNVNCSATFITYE 43 (72)
T ss_pred eeecCCCCCCCEEEEEE
Confidence 35665 6666665543
No 89
>KOG4167|consensus
Probab=68.42 E-value=2 Score=31.85 Aligned_cols=25 Identities=28% Similarity=0.469 Sum_probs=22.2
Q ss_pred eeeccchhhhccCchHHHHHHhHHh
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFEC 73 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~ 73 (108)
-|.|..|++.|....++..||+.|.
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHH
Confidence 5999999999998888889988875
No 90
>KOG1146|consensus
Probab=67.91 E-value=3.9 Score=32.44 Aligned_cols=56 Identities=18% Similarity=0.437 Sum_probs=43.1
Q ss_pred CceeeccchhhhccCchHHHHHHhHHh----------------cCCCCccCCCCccccCCchhHHHHHHHhcC
Q psy3491 47 SALFMCESCHKTYKAKGSLERHKKFEC----------------GKEAGLQCPFCPYKSKHKSNLKTHMAIRHQ 103 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~~l~~h~~~~~----------------~~~~~~~c~~C~~~f~~~~~l~~H~~~h~~ 103 (108)
..+|.|..|...|.....|..|++... .+..+| |..|...|.....|+.|+++.+.
T Consensus 1282 ~~~~~~~~~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~~c~~~~~~~~alqihm~~~~~ 1353 (1406)
T KOG1146|consen 1282 THRYLCRQCKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CLACEVLLSGREALQIHMRSSAH 1353 (1406)
T ss_pred chhHHHHHHHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc-chHHHhhcchhHHHHHHHHHhhh
Confidence 456889999999999998988875322 123335 99999999999999999986444
No 91
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=67.81 E-value=4.1 Score=20.15 Aligned_cols=32 Identities=19% Similarity=0.602 Sum_probs=18.4
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
.|.|+.||+.....-. .-+.. ..+ |.|+.||-
T Consensus 25 ~F~CPnCG~~~I~RC~---~CRk~--~~~-Y~CP~CGF 56 (59)
T PRK14890 25 KFLCPNCGEVIIYRCE---KCRKQ--SNP-YTCPKCGF 56 (59)
T ss_pred EeeCCCCCCeeEeech---hHHhc--CCc-eECCCCCC
Confidence 5999999976322211 11111 134 99999974
No 92
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=67.78 E-value=3.5 Score=19.62 Aligned_cols=19 Identities=11% Similarity=0.392 Sum_probs=13.8
Q ss_pred CceeeccchhhhccCchHH
Q psy3491 47 SALFMCESCHKTYKAKGSL 65 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~~l 65 (108)
.+.+.|..||..|.....-
T Consensus 2 Dk~l~C~dCg~~FvfTa~E 20 (49)
T PF13451_consen 2 DKTLTCKDCGAEFVFTAGE 20 (49)
T ss_pred CeeEEcccCCCeEEEehhH
Confidence 3568899999888765533
No 93
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=65.96 E-value=5 Score=22.14 Aligned_cols=26 Identities=23% Similarity=0.506 Sum_probs=22.2
Q ss_pred eeec----cchhhhccCchHHHHHHhHHhc
Q psy3491 49 LFMC----ESCHKTYKAKGSLERHKKFECG 74 (108)
Q Consensus 49 ~~~C----~~c~~~f~~~~~l~~h~~~~~~ 74 (108)
-|.| ..|+........+..|.+.+++
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 4889 9999999999999999887653
No 94
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=65.08 E-value=4 Score=19.40 Aligned_cols=28 Identities=18% Similarity=0.689 Sum_probs=18.2
Q ss_pred ceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 48 ALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 48 ~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
..|.|..||+.+... ..... ..|+.||.
T Consensus 5 ~~Y~C~~Cg~~~~~~----------~~~~~-irCp~Cg~ 32 (49)
T COG1996 5 MEYKCARCGREVELD----------QETRG-IRCPYCGS 32 (49)
T ss_pred EEEEhhhcCCeeehh----------hccCc-eeCCCCCc
Confidence 358999999887111 11233 88999984
No 95
>PF14353 CpXC: CpXC protein
Probab=65.03 E-value=6.7 Score=22.28 Aligned_cols=17 Identities=29% Similarity=0.716 Sum_probs=11.3
Q ss_pred eeeccchhhhccCchHH
Q psy3491 49 LFMCESCHKTYKAKGSL 65 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l 65 (108)
.+.|+.||..+.-...+
T Consensus 38 ~~~CP~Cg~~~~~~~p~ 54 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPL 54 (128)
T ss_pred EEECCCCCCceecCCCE
Confidence 57788888777655443
No 96
>PF09963 DUF2197: Uncharacterized protein conserved in bacteria (DUF2197); InterPro: IPR019241 This family represents various hypothetical bacterial proteins with no known function.
Probab=64.82 E-value=2.7 Score=20.60 Aligned_cols=36 Identities=17% Similarity=0.285 Sum_probs=18.2
Q ss_pred eccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 51 MCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 51 ~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
.|..|++.+........-.+.-......|.|+.|..
T Consensus 4 kC~lCdk~~~Id~~~~~aKrLrnrPi~tYmC~eC~~ 39 (56)
T PF09963_consen 4 KCILCDKKEEIDEDTPEAKRLRNRPIHTYMCDECKE 39 (56)
T ss_pred EEEecCCEEEeccCCHHHHHhhcCCCcceeChhHHH
Confidence 466777766544322222232222222388888853
No 97
>KOG4173|consensus
Probab=64.47 E-value=4.9 Score=25.22 Aligned_cols=51 Identities=25% Similarity=0.496 Sum_probs=37.9
Q ss_pred ceeeccc--hhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHHHHHHHhc
Q psy3491 48 ALFMCES--CHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLKTHMAIRH 102 (108)
Q Consensus 48 ~~~~C~~--c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~~H~~~h~ 102 (108)
..+.|.. |-..|....+...|-...++ -.|.+|.+.|.+...|..|+...|
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~----~sCs~C~r~~Pt~hLLd~HI~E~H 130 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHG----NSCSFCKRAFPTGHLLDAHILEWH 130 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhccc----chhHHHHHhCCchhhhhHHHHHHH
Confidence 3477865 77778777777766655444 579999999999999998886544
No 98
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=63.57 E-value=7 Score=25.49 Aligned_cols=71 Identities=15% Similarity=0.255 Sum_probs=38.6
Q ss_pred cCCCCCCCcChhHHHHHhhhcCCCCCccccc---------cCCCceeeccchhhhccCchHHHHHHhHH--hcCCCCccC
Q psy3491 13 VDPVCPAFVDAPWLYFLLYSFSSGASFSSFK---------PLQSALFMCESCHKTYKAKGSLERHKKFE--CGKEAGLQC 81 (108)
Q Consensus 13 ~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~C~~c~~~f~~~~~l~~h~~~~--~~~~~~~~c 81 (108)
+|.+|++|..-+ +.++.++....++.... ..|.+.|.|..|...+ ...+--.|+.+- ..... |+|
T Consensus 99 iCDfCEawvCHg--rkCl~~HaC~Cpl~da~C~EC~R~vw~hGGrif~CsfC~~fl-CEDDQFEHQAsCQvLe~E~-~KC 174 (314)
T PF06524_consen 99 ICDFCEAWVCHG--RKCLSTHACTCPLQDAVCIECERGVWDHGGRIFKCSFCDNFL-CEDDQFEHQASCQVLESET-FKC 174 (314)
T ss_pred hhccchhheecc--ccccccccccCcCCCcEeeeeecccccCCCeEEEeecCCCee-eccchhhhhhhhhhhhccc-ccc
Confidence 777788776432 23444454444444221 1266789999997544 333333454421 12244 888
Q ss_pred CCCccc
Q psy3491 82 PFCPYK 87 (108)
Q Consensus 82 ~~C~~~ 87 (108)
..|.+.
T Consensus 175 ~SCNrl 180 (314)
T PF06524_consen 175 QSCNRL 180 (314)
T ss_pred cccccc
Confidence 777654
No 99
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=61.61 E-value=3.6 Score=23.94 Aligned_cols=24 Identities=25% Similarity=0.181 Sum_probs=14.8
Q ss_pred ccCCCCccccCCchhHHHHHHHhcCCC
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAIRHQNG 105 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~h~~~k 105 (108)
..|-++|+.|. +|++|..+|++.-
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gmT 100 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGLT 100 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCCC
Confidence 45666666654 4577777776643
No 100
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=61.23 E-value=4.3 Score=28.19 Aligned_cols=15 Identities=20% Similarity=0.443 Sum_probs=11.4
Q ss_pred eeeccchhhhccCch
Q psy3491 49 LFMCESCHKTYKAKG 63 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~ 63 (108)
-|.|..||..+....
T Consensus 367 g~rC~kCg~~~~~~~ 381 (421)
T COG1571 367 GFRCKKCGTRARETL 381 (421)
T ss_pred CcccccccccCCccc
Confidence 588998988876653
No 101
>KOG2593|consensus
Probab=60.88 E-value=4.4 Score=28.15 Aligned_cols=39 Identities=21% Similarity=0.524 Sum_probs=25.6
Q ss_pred CCCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccc
Q psy3491 45 LQSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYK 87 (108)
Q Consensus 45 ~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~ 87 (108)
+..+-|.|+.|.+.|.....+ +..-...-. |.|..|+..
T Consensus 124 t~~~~Y~Cp~C~kkyt~Lea~---~L~~~~~~~-F~C~~C~ge 162 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKYTSLEAL---QLLDNETGE-FHCENCGGE 162 (436)
T ss_pred cccccccCCccccchhhhHHH---HhhcccCce-EEEecCCCc
Confidence 445679999999988776544 233332234 899888754
No 102
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=60.85 E-value=4.2 Score=27.18 Aligned_cols=48 Identities=13% Similarity=0.280 Sum_probs=27.5
Q ss_pred eccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccCCchhHHHHHHHh
Q psy3491 51 MCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSKHKSNLKTHMAIR 101 (108)
Q Consensus 51 ~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~~~~~l~~H~~~h 101 (108)
.|-.|.-.|.....-.--.. ..... |.|+.|...|...-+.-.|...|
T Consensus 364 ~Cf~CQ~~fp~~~~~~~~~~--~ss~r-Y~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 364 HCFVCQGPFPKPPVSPFDES--TSSGR-YQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred cceeccCCCCCCCCCccccc--ccccc-eechhhhhhhhhhhHHHHHHHHh
Confidence 36666666654321110011 11234 89999988888877777776655
No 103
>PHA00626 hypothetical protein
Probab=60.09 E-value=4.5 Score=19.80 Aligned_cols=17 Identities=18% Similarity=0.393 Sum_probs=13.2
Q ss_pred CceeeccchhhhccCch
Q psy3491 47 SALFMCESCHKTYKAKG 63 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~ 63 (108)
...|.|+.||..|+...
T Consensus 21 snrYkCkdCGY~ft~~~ 37 (59)
T PHA00626 21 SDDYVCCDCGYNDSKDA 37 (59)
T ss_pred CcceEcCCCCCeechhh
Confidence 34699999998887643
No 104
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=59.81 E-value=7 Score=26.19 Aligned_cols=58 Identities=22% Similarity=0.480 Sum_probs=45.9
Q ss_pred CceeeccchhhhccCchHHHHHHhHHhcCCCCccCCC--CccccCCchhHHHHHHHhcCCC
Q psy3491 47 SALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPF--CPYKSKHKSNLKTHMAIRHQNG 105 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~--C~~~f~~~~~l~~H~~~h~~~k 105 (108)
..++.|..|...|........|.+.+.+..+ ..|.. |...+.....+.+|.+.++...
T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (467)
T COG5048 31 PRPDSCPNCTDSFSRLEHLTRHIRSHTGEKP-SQCSYSGCDKSFSRPLELSRHLRTHHNNP 90 (467)
T ss_pred CchhhcccccccccccchhhhhcccccccCC-ccccccccccccCCcchhhhhcccccccc
Confidence 4568899999999999889999999988776 88854 5567777778888887776643
No 105
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=58.70 E-value=6 Score=16.04 Aligned_cols=11 Identities=36% Similarity=0.936 Sum_probs=6.0
Q ss_pred ccCCCCccccC
Q psy3491 79 LQCPFCPYKSK 89 (108)
Q Consensus 79 ~~c~~C~~~f~ 89 (108)
-.|+.||..|.
T Consensus 15 ~~Cp~CG~~F~ 25 (26)
T PF10571_consen 15 KFCPHCGYDFE 25 (26)
T ss_pred CcCCCCCCCCc
Confidence 34566665553
No 106
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=57.65 E-value=5.2 Score=20.87 Aligned_cols=31 Identities=13% Similarity=0.464 Sum_probs=19.0
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYK 87 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~ 87 (108)
.|.|..||+.| ++.+++. . .+.-.|+.|+-.
T Consensus 12 ~Y~c~~cg~~~----dvvq~~~---d-dplt~ce~c~a~ 42 (82)
T COG2331 12 SYECTECGNRF----DVVQAMT---D-DPLTTCEECGAR 42 (82)
T ss_pred EEeecccchHH----HHHHhcc---c-CccccChhhChH
Confidence 48899998776 3333332 2 333678888753
No 107
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=56.53 E-value=5 Score=19.32 Aligned_cols=39 Identities=26% Similarity=0.701 Sum_probs=21.3
Q ss_pred eeeccc--hhhhccCchHHHHHHhHHhcCCCCccCCC----CccccC
Q psy3491 49 LFMCES--CHKTYKAKGSLERHKKFECGKEAGLQCPF----CPYKSK 89 (108)
Q Consensus 49 ~~~C~~--c~~~f~~~~~l~~h~~~~~~~~~~~~c~~----C~~~f~ 89 (108)
+-.|+. |...+. ...|..|....-.... ..|.. |+..+.
T Consensus 9 ~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~-v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 9 PVPCPNGCCNEMIP-RKELDDHLENECPKRP-VPCPYSPYGCKERVP 53 (60)
T ss_dssp EEE-TT--S-BEEE-CCCHHHHHHTTSTTSE-EE-SS----S--EEE
T ss_pred EeeCCCCCccccee-HHHHHHHHHccCCCCc-EECCCCCCCCCCccc
Confidence 567776 444344 4458888886666555 77877 766543
No 108
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=54.67 E-value=4.8 Score=23.37 Aligned_cols=15 Identities=27% Similarity=0.769 Sum_probs=11.9
Q ss_pred ceeeccchhhhccCc
Q psy3491 48 ALFMCESCHKTYKAK 62 (108)
Q Consensus 48 ~~~~C~~c~~~f~~~ 62 (108)
..+.|..||..|...
T Consensus 69 ~~~~C~~CG~~~~~~ 83 (135)
T PRK03824 69 AVLKCRNCGNEWSLK 83 (135)
T ss_pred eEEECCCCCCEEecc
Confidence 458999999887654
No 109
>KOG2071|consensus
Probab=54.04 E-value=7.5 Score=28.17 Aligned_cols=28 Identities=14% Similarity=0.320 Sum_probs=19.7
Q ss_pred CceeeccchhhhccCchHHHHHHhHHhc
Q psy3491 47 SALFMCESCHKTYKAKGSLERHKKFECG 74 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~~l~~h~~~~~~ 74 (108)
..|..|..||.+|........|+..|..
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H~d 443 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIHDD 443 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhhhh
Confidence 4567788888888877766666666543
No 110
>KOG4167|consensus
Probab=53.98 E-value=2.6 Score=31.32 Aligned_cols=25 Identities=20% Similarity=0.274 Sum_probs=21.8
Q ss_pred ccCCCCccccCCchhHHHHHHHhcC
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAIRHQ 103 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~h~~ 103 (108)
|.|.+|+|+|..-.+++.|+++|-.
T Consensus 793 FpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 793 FPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred eehHHHHHHHHHHhhhhHHHHHHHH
Confidence 8999999999888888899988754
No 111
>KOG2231|consensus
Probab=52.67 E-value=14 Score=27.48 Aligned_cols=48 Identities=27% Similarity=0.596 Sum_probs=32.1
Q ss_pred eccchhhhccCchHHHHHHhHHhcCCCCccCCCC------ccccCCchhHHHHHHHhcC
Q psy3491 51 MCESCHKTYKAKGSLERHKKFECGKEAGLQCPFC------PYKSKHKSNLKTHMAIRHQ 103 (108)
Q Consensus 51 ~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C------~~~f~~~~~l~~H~~~h~~ 103 (108)
.|..|...|.....|..|++.++ |.|..| +.-|.....|..|-+..|-
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~h-----~~chfC~~~~~~neyy~~~~dLe~HfR~~Hf 237 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFDH-----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGHF 237 (669)
T ss_pred cchhhhhhhccHHHHHHhhccce-----eheeecCcccccchhcccchHHHHHhhhcCc
Confidence 57788888888888887777544 444444 2346677777777776653
No 112
>PF10537 WAC_Acf1_DNA_bd: ATP-utilising chromatin assembly and remodelling N-terminal; InterPro: IPR013136 ACF (for ATP-utilising chromatin assembly and remodeling factor) is a chromatin-remodeling complex that catalyzes the ATP-dependent assembly of periodic nucleosome arrays. This reaction utilises the energy of ATP hydrolysis by ISWI, the smaller of the two subunits of ACF. Acf1, the large subunit of ACF, is essential for the full activity of the complex. The WAC (WSTF/Acf1/cbp146) domain is an ~110-residue module present at the N-termini of Acf1-related proteins in a variety of organisms. It is found in association with other domains such as the bromodomain, the PHD-type zinc finger, DDT or WAKS. The DNA-binding region of Acf1 includes the WAC domain, which is necessary for the efficient binding of ACF complex to DNA. It seems probable that the WAC domain will be involved in DNA binding in other related factors [, ]. Some proteins known to contain a WAC domain are the Drosophila melanogaster (Fruit fly) ATP-dependent chromatin assembly factor large subunit Acf1, human WSTF (Williams syndrome transcription factor), mouse cbp146, yeast imitation switch two complex protein 1 (ITC1 or YGL133w), and yeast protein YPL216w.
Probab=51.75 E-value=22 Score=19.69 Aligned_cols=38 Identities=5% Similarity=0.080 Sum_probs=27.9
Q ss_pred ceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccc
Q psy3491 48 ALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYK 87 (108)
Q Consensus 48 ~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~ 87 (108)
..|.|+.-|..|.........+..... .. +.|...|+.
T Consensus 2 eVf~i~~T~EiF~dYe~Y~~R~~~y~~-~v-wtC~~TGk~ 39 (102)
T PF10537_consen 2 EVFYIPFTGEIFRDYEEYLKRMILYNQ-RV-WTCEITGKS 39 (102)
T ss_pred ceEEeCCCCcccCCHHHHHHHHHHHhC-Ce-eEEecCCCC
Confidence 357788888888888777777666654 44 888887775
No 113
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=51.09 E-value=6 Score=16.71 Aligned_cols=7 Identities=57% Similarity=1.312 Sum_probs=4.1
Q ss_pred ccCCCCc
Q psy3491 79 LQCPFCP 85 (108)
Q Consensus 79 ~~c~~C~ 85 (108)
+.|+.|+
T Consensus 20 ~vCp~C~ 26 (30)
T PF08274_consen 20 LVCPECG 26 (30)
T ss_dssp EEETTTT
T ss_pred EeCCccc
Confidence 5666654
No 114
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=50.66 E-value=7.2 Score=17.42 Aligned_cols=13 Identities=23% Similarity=0.649 Sum_probs=10.4
Q ss_pred eeeccchhhhccC
Q psy3491 49 LFMCESCHKTYKA 61 (108)
Q Consensus 49 ~~~C~~c~~~f~~ 61 (108)
|+.|..|++.|-.
T Consensus 12 ~f~C~~C~~~FC~ 24 (39)
T smart00154 12 GFKCRHCGNLFCG 24 (39)
T ss_pred CeECCccCCcccc
Confidence 7899999888754
No 115
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=50.61 E-value=6.3 Score=22.82 Aligned_cols=15 Identities=13% Similarity=0.583 Sum_probs=12.8
Q ss_pred eeeccchhhhccCch
Q psy3491 49 LFMCESCHKTYKAKG 63 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~ 63 (108)
|+.|..||+.|...+
T Consensus 1 PH~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGS 15 (131)
T ss_pred CcccCcCCCCcCCCc
Confidence 689999999998765
No 116
>KOG3408|consensus
Probab=50.61 E-value=14 Score=21.12 Aligned_cols=26 Identities=23% Similarity=0.470 Sum_probs=22.4
Q ss_pred CCceeeccchhhhccCchHHHHHHhH
Q psy3491 46 QSALFMCESCHKTYKAKGSLERHKKF 71 (108)
Q Consensus 46 ~~~~~~C~~c~~~f~~~~~l~~h~~~ 71 (108)
|.-.|-|-.|.+-|.....|..|.+.
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhc
Confidence 55579999999999999999988764
No 117
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=47.10 E-value=12 Score=17.10 Aligned_cols=14 Identities=21% Similarity=0.524 Sum_probs=8.6
Q ss_pred ccCCCCccccCCch
Q psy3491 79 LQCPFCPYKSKHKS 92 (108)
Q Consensus 79 ~~c~~C~~~f~~~~ 92 (108)
-.|.+|++.|...-
T Consensus 9 K~C~~C~rpf~WRK 22 (42)
T PF10013_consen 9 KICPVCGRPFTWRK 22 (42)
T ss_pred CcCcccCCcchHHH
Confidence 34677777776543
No 118
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=46.79 E-value=5.1 Score=23.06 Aligned_cols=15 Identities=13% Similarity=0.459 Sum_probs=9.8
Q ss_pred ceeeccchhhhccCc
Q psy3491 48 ALFMCESCHKTYKAK 62 (108)
Q Consensus 48 ~~~~C~~c~~~f~~~ 62 (108)
.|.-|..||+.|...
T Consensus 67 ~psfchncgs~fpwt 81 (160)
T COG4306 67 PPSFCHNCGSRFPWT 81 (160)
T ss_pred CcchhhcCCCCCCcH
Confidence 355677777777654
No 119
>KOG2636|consensus
Probab=46.70 E-value=15 Score=25.91 Aligned_cols=29 Identities=21% Similarity=0.434 Sum_probs=21.6
Q ss_pred hHHhcCCCCccCCCCc-cccCCchhHHHHHH
Q psy3491 70 KFECGKEAGLQCPFCP-YKSKHKSNLKTHMA 99 (108)
Q Consensus 70 ~~~~~~~~~~~c~~C~-~~f~~~~~l~~H~~ 99 (108)
+.|.-+.. |.|.+|| .++.-+..+.+|-.
T Consensus 394 KLHGL~~e-y~CEICGNy~Y~GrkaF~RHF~ 423 (497)
T KOG2636|consen 394 KLHGLDIE-YNCEICGNYVYKGRKAFDRHFN 423 (497)
T ss_pred hhcCCCcc-cceeeccCccccCcHHHHHHhH
Confidence 34444565 9999999 88888888888853
No 120
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=46.69 E-value=11 Score=21.20 Aligned_cols=25 Identities=24% Similarity=0.553 Sum_probs=17.8
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
...|..||..|..... . +.|+.||.
T Consensus 70 ~~~C~~Cg~~~~~~~~------------~-~~CP~Cgs 94 (115)
T TIGR00100 70 ECECEDCSEEVSPEID------------L-YRCPKCHG 94 (115)
T ss_pred EEEcccCCCEEecCCc------------C-ccCcCCcC
Confidence 4889999977755421 3 77999974
No 121
>KOG2807|consensus
Probab=46.48 E-value=25 Score=23.87 Aligned_cols=16 Identities=31% Similarity=0.636 Sum_probs=12.2
Q ss_pred cCCCCC-CCcChhHHHH
Q psy3491 13 VDPVCP-AFVDAPWLYF 28 (108)
Q Consensus 13 ~~~~c~-~~~~~~~~~~ 28 (108)
+||+|. .+..+++|..
T Consensus 292 eCpiC~ltLVss~hLAR 308 (378)
T KOG2807|consen 292 ECPICSLTLVSSPHLAR 308 (378)
T ss_pred cCCccceeEecchHHHH
Confidence 788888 7887777743
No 122
>KOG0717|consensus
Probab=46.37 E-value=11 Score=26.63 Aligned_cols=21 Identities=33% Similarity=0.545 Sum_probs=18.6
Q ss_pred ccCCCCccccCCchhHHHHHH
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMA 99 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~ 99 (108)
+.|.+|.++|.+.-.+..|..
T Consensus 293 lyC~vCnKsFKseKq~kNHEn 313 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHEN 313 (508)
T ss_pred eEEeeccccccchHHHHhhHH
Confidence 789999999999988888865
No 123
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=45.47 E-value=10 Score=20.97 Aligned_cols=15 Identities=13% Similarity=0.527 Sum_probs=12.6
Q ss_pred eeeccchhhhccCch
Q psy3491 49 LFMCESCHKTYKAKG 63 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~ 63 (108)
|+.|..||..|..-+
T Consensus 2 pH~CtrCG~vf~~g~ 16 (112)
T COG3364 2 PHQCTRCGEVFDDGS 16 (112)
T ss_pred Cceeccccccccccc
Confidence 689999999998754
No 124
>KOG2231|consensus
Probab=45.45 E-value=42 Score=25.10 Aligned_cols=52 Identities=25% Similarity=0.382 Sum_probs=32.0
Q ss_pred CccCCCCC-CCcChhHHHHHhhhcCCCCCccccccCCCceeeccch------hhhccCchHHHHHHhHHhcCCCCccCC
Q psy3491 11 NIVDPVCP-AFVDAPWLYFLLYSFSSGASFSSFKPLQSALFMCESC------HKTYKAKGSLERHKKFECGKEAGLQCP 82 (108)
Q Consensus 11 ~~~~~~c~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c------~~~f~~~~~l~~h~~~~~~~~~~~~c~ 82 (108)
.++|..|. .|.+...+..|+... -+.|..| +..|....+|..|-+..+ |.|.
T Consensus 182 hp~C~~C~~~fld~~el~rH~~~~---------------h~~chfC~~~~~~neyy~~~~dLe~HfR~~H-----flCE 240 (669)
T KOG2231|consen 182 HPLCKFCHERFLDDDELYRHLRFD---------------HEFCHFCDYKTGQNEYYNDYDDLEEHFRKGH-----FLCE 240 (669)
T ss_pred CccchhhhhhhccHHHHHHhhccc---------------eeheeecCcccccchhcccchHHHHHhhhcC-----cccc
Confidence 35777777 677766666666532 1334433 456667778888876543 6665
No 125
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=44.91 E-value=18 Score=25.69 Aligned_cols=22 Identities=27% Similarity=0.471 Sum_probs=11.9
Q ss_pred ccCCCCccccCCchhHHHHHHH
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAI 100 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~ 100 (108)
+.|+.|.+.|.....+..|+..
T Consensus 58 WiCp~CskkF~d~~~~~~H~~~ 79 (466)
T PF04780_consen 58 WICPRCSKKFSDAESCLSHMEQ 79 (466)
T ss_pred eeCCcccceeCCHHHHHHHHHH
Confidence 4455555555555555555543
No 126
>KOG0978|consensus
Probab=43.84 E-value=2.1 Score=31.52 Aligned_cols=15 Identities=20% Similarity=0.390 Sum_probs=9.4
Q ss_pred ccCCCCccccCCchh
Q psy3491 79 LQCPFCPYKSKHKSN 93 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~ 93 (108)
-.||.|+..|...+.
T Consensus 679 RKCP~Cn~aFganDv 693 (698)
T KOG0978|consen 679 RKCPKCNAAFGANDV 693 (698)
T ss_pred CCCCCCCCCCCcccc
Confidence 567777777665543
No 127
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=43.81 E-value=15 Score=20.66 Aligned_cols=25 Identities=24% Similarity=0.554 Sum_probs=17.3
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
...|..||..|.... .. +.|+.||.
T Consensus 70 ~~~C~~Cg~~~~~~~------------~~-~~CP~Cgs 94 (113)
T PRK12380 70 QAWCWDCSQVVEIHQ------------HD-AQCPHCHG 94 (113)
T ss_pred EEEcccCCCEEecCC------------cC-ccCcCCCC
Confidence 478999997665432 22 67999984
No 128
>KOG2071|consensus
Probab=42.68 E-value=19 Score=26.25 Aligned_cols=27 Identities=22% Similarity=0.422 Sum_probs=21.6
Q ss_pred CCCccCCCCccccCCchhHHHHHHHhcC
Q psy3491 76 EAGLQCPFCPYKSKHKSNLKTHMAIRHQ 103 (108)
Q Consensus 76 ~~~~~c~~C~~~f~~~~~l~~H~~~h~~ 103 (108)
.+ -.|..||..|.......+|+.+|-.
T Consensus 417 ~p-nqC~~CG~R~~~~ee~sk~md~H~d 443 (579)
T KOG2071|consen 417 SP-NQCKSCGLRFDDSEERSKHMDIHDD 443 (579)
T ss_pred Cc-chhcccccccccchhhhhHhhhhhh
Confidence 44 8999999999988888877776643
No 129
>KOG2482|consensus
Probab=42.62 E-value=16 Score=24.92 Aligned_cols=51 Identities=25% Similarity=0.441 Sum_probs=35.7
Q ss_pred eeeccchhhhccC-chHHHHHHhHHhc----------------------CCCCccCCCCccccCCchhHHHHHHH
Q psy3491 49 LFMCESCHKTYKA-KGSLERHKKFECG----------------------KEAGLQCPFCPYKSKHKSNLKTHMAI 100 (108)
Q Consensus 49 ~~~C~~c~~~f~~-~~~l~~h~~~~~~----------------------~~~~~~c~~C~~~f~~~~~l~~H~~~ 100 (108)
..+|-.|+..+.. .+....|+-.-++ -.. +.|-.|.+.|.-+..|+.|+|.
T Consensus 144 slqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r-~~CLyCekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 144 SLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLER-LRCLYCEKIFRDKNTLKEHMRK 217 (423)
T ss_pred eeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhh-heeeeeccccCCcHHHHHHHHh
Confidence 4578888766653 4555556433222 123 8899999999999999999984
No 130
>PF01428 zf-AN1: AN1-like Zinc finger; InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include: Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=42.11 E-value=9.4 Score=17.31 Aligned_cols=14 Identities=29% Similarity=0.726 Sum_probs=8.8
Q ss_pred ceeeccchhhhccC
Q psy3491 48 ALFMCESCHKTYKA 61 (108)
Q Consensus 48 ~~~~C~~c~~~f~~ 61 (108)
-|+.|..|+..|=.
T Consensus 12 ~~~~C~~C~~~FC~ 25 (43)
T PF01428_consen 12 LPFKCKHCGKSFCL 25 (43)
T ss_dssp SHEE-TTTS-EE-T
T ss_pred CCeECCCCCcccCc
Confidence 47899999988854
No 131
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=41.09 E-value=10 Score=18.05 Aligned_cols=12 Identities=25% Similarity=0.728 Sum_probs=6.3
Q ss_pred cCCCCccccCCc
Q psy3491 80 QCPFCPYKSKHK 91 (108)
Q Consensus 80 ~c~~C~~~f~~~ 91 (108)
.|++|++.|...
T Consensus 22 ~CPlC~r~l~~e 33 (54)
T PF04423_consen 22 CCPLCGRPLDEE 33 (54)
T ss_dssp E-TTT--EE-HH
T ss_pred cCCCCCCCCCHH
Confidence 899999988643
No 132
>PF15269 zf-C2H2_7: Zinc-finger
Probab=40.89 E-value=23 Score=16.49 Aligned_cols=16 Identities=31% Similarity=0.600 Sum_probs=6.2
Q ss_pred eeccchhhhccCchHH
Q psy3491 50 FMCESCHKTYKAKGSL 65 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l 65 (108)
|.|-.|.......+.|
T Consensus 21 ykcfqcpftc~~kshl 36 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHL 36 (54)
T ss_pred ceeecCCcccchHHHH
Confidence 3444443333333333
No 133
>PF14311 DUF4379: Domain of unknown function (DUF4379)
Probab=40.12 E-value=25 Score=16.72 Aligned_cols=14 Identities=29% Similarity=0.786 Sum_probs=10.6
Q ss_pred eeeccchhhhccCc
Q psy3491 49 LFMCESCHKTYKAK 62 (108)
Q Consensus 49 ~~~C~~c~~~f~~~ 62 (108)
-+.|..||..|...
T Consensus 28 ~W~C~~Cgh~w~~~ 41 (55)
T PF14311_consen 28 WWKCPKCGHEWKAS 41 (55)
T ss_pred EEECCCCCCeeEcc
Confidence 48899998777654
No 134
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=40.11 E-value=26 Score=15.12 Aligned_cols=21 Identities=19% Similarity=0.475 Sum_probs=11.5
Q ss_pred eeeccchhhhccCchHHHHHHh
Q psy3491 49 LFMCESCHKTYKAKGSLERHKK 70 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~ 70 (108)
-+.|+.|++..... .+..|+.
T Consensus 4 ~~~C~nC~R~v~a~-RfA~HLe 24 (33)
T PF08209_consen 4 YVECPNCGRPVAAS-RFAPHLE 24 (33)
T ss_dssp EEE-TTTSSEEEGG-GHHHHHH
T ss_pred eEECCCCcCCcchh-hhHHHHH
Confidence 36777777655443 3555544
No 135
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=39.51 E-value=33 Score=23.55 Aligned_cols=24 Identities=21% Similarity=0.497 Sum_probs=19.1
Q ss_pred eccchhhhccCchHHHHHHhHHhc
Q psy3491 51 MCESCHKTYKAKGSLERHKKFECG 74 (108)
Q Consensus 51 ~C~~c~~~f~~~~~l~~h~~~~~~ 74 (108)
.|..|...|-....|..|.+..+.
T Consensus 222 ~C~FC~~~FYdDDEL~~HcR~~HE 245 (493)
T COG5236 222 LCIFCKIYFYDDDELRRHCRLRHE 245 (493)
T ss_pred hhhhccceecChHHHHHHHHhhhh
Confidence 488888888888888888876553
No 136
>PF12907 zf-met2: Zinc-binding
Probab=38.71 E-value=7.9 Score=17.55 Aligned_cols=24 Identities=21% Similarity=0.562 Sum_probs=11.6
Q ss_pred eccchhhhccC---chHHHHHHhHHhc
Q psy3491 51 MCESCHKTYKA---KGSLERHKKFECG 74 (108)
Q Consensus 51 ~C~~c~~~f~~---~~~l~~h~~~~~~ 74 (108)
.|..|...|.. ...|..|...-+.
T Consensus 3 ~C~iC~qtF~~t~~~~~L~eH~enKHp 29 (40)
T PF12907_consen 3 ICKICRQTFMQTTNEPQLKEHAENKHP 29 (40)
T ss_pred CcHHhhHHHHhcCCHHHHHHHHHccCC
Confidence 46666644432 3345555544433
No 137
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=37.12 E-value=19 Score=20.34 Aligned_cols=27 Identities=22% Similarity=0.477 Sum_probs=17.0
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYK 87 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~ 87 (108)
.+.|..||..|.... .....|+.||..
T Consensus 71 ~~~C~~Cg~~~~~~~------------~~~~~CP~Cgs~ 97 (117)
T PRK00564 71 ELECKDCSHVFKPNA------------LDYGVCEKCHSK 97 (117)
T ss_pred EEEhhhCCCccccCC------------ccCCcCcCCCCC
Confidence 488999996665432 121349989843
No 138
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=36.45 E-value=20 Score=17.83 Aligned_cols=12 Identities=25% Similarity=0.941 Sum_probs=8.2
Q ss_pred ccCCCCccccCC
Q psy3491 79 LQCPFCPYKSKH 90 (108)
Q Consensus 79 ~~c~~C~~~f~~ 90 (108)
+.|+.||.....
T Consensus 47 ~~C~~Cg~~~~r 58 (69)
T PF07282_consen 47 FTCPNCGFEMDR 58 (69)
T ss_pred EEcCCCCCEECc
Confidence 778778766543
No 139
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=36.07 E-value=16 Score=18.16 Aligned_cols=11 Identities=27% Similarity=0.646 Sum_probs=3.8
Q ss_pred eeccchhhhcc
Q psy3491 50 FMCESCHKTYK 60 (108)
Q Consensus 50 ~~C~~c~~~f~ 60 (108)
-.|..|++.|.
T Consensus 10 ~~C~~C~~~F~ 20 (69)
T PF01363_consen 10 SNCMICGKKFS 20 (69)
T ss_dssp SB-TTT--B-B
T ss_pred CcCcCcCCcCC
Confidence 35666666663
No 140
>PLN02294 cytochrome c oxidase subunit Vb
Probab=35.96 E-value=16 Score=22.24 Aligned_cols=17 Identities=18% Similarity=0.618 Sum_probs=13.2
Q ss_pred CCceeeccchhhhccCc
Q psy3491 46 QSALFMCESCHKTYKAK 62 (108)
Q Consensus 46 ~~~~~~C~~c~~~f~~~ 62 (108)
..+|..|+.||..|.-.
T Consensus 138 kGkp~RCpeCG~~fkL~ 154 (174)
T PLN02294 138 KGKSFECPVCTQYFELE 154 (174)
T ss_pred CCCceeCCCCCCEEEEE
Confidence 35688999999888654
No 141
>PF11931 DUF3449: Domain of unknown function (DUF3449); InterPro: IPR024598 This presumed domain is functionally uncharacterised. It has two conserved sequence motifs: PIP and CEICG and contains a zinc-finger of the C2H2-type.; PDB: 4DGW_A.
Probab=34.80 E-value=13 Score=23.18 Aligned_cols=21 Identities=19% Similarity=0.502 Sum_probs=0.0
Q ss_pred CCCccCCCCc-cccCCchhHHHH
Q psy3491 76 EAGLQCPFCP-YKSKHKSNLKTH 97 (108)
Q Consensus 76 ~~~~~c~~C~-~~f~~~~~l~~H 97 (108)
.. |.|.+|| .+|.-+..+.+|
T Consensus 100 ~e-y~CEICGN~~Y~GrkaFekH 121 (196)
T PF11931_consen 100 VE-YKCEICGNQSYKGRKAFEKH 121 (196)
T ss_dssp -----------------------
T ss_pred Ce-eeeEeCCCcceecHHHHHHh
Confidence 44 7777776 344445555555
No 142
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region. Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A. The abnormally extended conformation is stable only in the CcO assembly.
Probab=33.77 E-value=19 Score=19.75 Aligned_cols=15 Identities=20% Similarity=0.483 Sum_probs=12.3
Q ss_pred CceeeccchhhhccC
Q psy3491 47 SALFMCESCHKTYKA 61 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~ 61 (108)
.++..|..||..|.-
T Consensus 77 g~~~rC~eCG~~fkL 91 (97)
T cd00924 77 GKPKRCPECGHVFKL 91 (97)
T ss_pred CCceeCCCCCcEEEE
Confidence 368999999988864
No 143
>KOG1842|consensus
Probab=32.80 E-value=32 Score=24.37 Aligned_cols=26 Identities=23% Similarity=0.345 Sum_probs=17.9
Q ss_pred eeeccchhhhccCchHHHHHHhHHhc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECG 74 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~ 74 (108)
-|.|+.|..-|...+.|..|....++
T Consensus 15 gflCPiC~~dl~~~~~L~~H~d~eH~ 40 (505)
T KOG1842|consen 15 GFLCPICLLDLPNLSALNDHLDVEHF 40 (505)
T ss_pred cccCchHhhhhhhHHHHHHHHhhhcc
Confidence 47777777777777777777655443
No 144
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=31.51 E-value=40 Score=24.07 Aligned_cols=26 Identities=23% Similarity=0.511 Sum_probs=21.9
Q ss_pred ceeeccchhhhccCchHHHHHHhHHh
Q psy3491 48 ALFMCESCHKTYKAKGSLERHKKFEC 73 (108)
Q Consensus 48 ~~~~C~~c~~~f~~~~~l~~h~~~~~ 73 (108)
+=+.|+.|.+.|.....+..|+...+
T Consensus 56 rFWiCp~CskkF~d~~~~~~H~~~eH 81 (466)
T PF04780_consen 56 RFWICPRCSKKFSDAESCLSHMEQEH 81 (466)
T ss_pred eEeeCCcccceeCCHHHHHHHHHHhh
Confidence 35789999999999999999987543
No 145
>PF01155 HypA: Hydrogenase expression/synthesis hypA family; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=31.43 E-value=23 Score=19.83 Aligned_cols=26 Identities=19% Similarity=0.570 Sum_probs=16.4
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYK 87 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~ 87 (108)
...|..||..|.... .. +.|+.||..
T Consensus 70 ~~~C~~Cg~~~~~~~------------~~-~~CP~Cgs~ 95 (113)
T PF01155_consen 70 RARCRDCGHEFEPDE------------FD-FSCPRCGSP 95 (113)
T ss_dssp EEEETTTS-EEECHH------------CC-HH-SSSSSS
T ss_pred cEECCCCCCEEecCC------------CC-CCCcCCcCC
Confidence 478999998886543 12 558888754
No 146
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=31.37 E-value=38 Score=15.92 Aligned_cols=8 Identities=38% Similarity=1.182 Sum_probs=3.2
Q ss_pred ccchhhhc
Q psy3491 52 CESCHKTY 59 (108)
Q Consensus 52 C~~c~~~f 59 (108)
|..|++.|
T Consensus 5 C~~C~~~F 12 (57)
T cd00065 5 CMGCGKPF 12 (57)
T ss_pred CcccCccc
Confidence 33444333
No 147
>COG4338 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.24 E-value=17 Score=17.21 Aligned_cols=12 Identities=25% Similarity=0.706 Sum_probs=7.0
Q ss_pred cCCCCccccCCc
Q psy3491 80 QCPFCPYKSKHK 91 (108)
Q Consensus 80 ~c~~C~~~f~~~ 91 (108)
-|++|++.|...
T Consensus 14 ICpvCqRPFsWR 25 (54)
T COG4338 14 ICPVCQRPFSWR 25 (54)
T ss_pred hhhhhcCchHHH
Confidence 356666666544
No 148
>PF10276 zf-CHCC: Zinc-finger domain; InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=30.61 E-value=18 Score=16.38 Aligned_cols=10 Identities=20% Similarity=0.617 Sum_probs=4.9
Q ss_pred eeeccchhhh
Q psy3491 49 LFMCESCHKT 58 (108)
Q Consensus 49 ~~~C~~c~~~ 58 (108)
+-.|+.|+..
T Consensus 29 ~~~CpYCg~~ 38 (40)
T PF10276_consen 29 PVVCPYCGTR 38 (40)
T ss_dssp EEEETTTTEE
T ss_pred eEECCCCCCE
Confidence 3455555543
No 149
>PF10083 DUF2321: Uncharacterized protein conserved in bacteria (DUF2321); InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=30.58 E-value=6.9 Score=23.37 Aligned_cols=15 Identities=33% Similarity=0.600 Sum_probs=10.3
Q ss_pred ceeeccchhhhccCc
Q psy3491 48 ALFMCESCHKTYKAK 62 (108)
Q Consensus 48 ~~~~C~~c~~~f~~~ 62 (108)
.|.-|..||+.|++.
T Consensus 67 ~PsYC~~CGkpyPWt 81 (158)
T PF10083_consen 67 APSYCHNCGKPYPWT 81 (158)
T ss_pred CChhHHhCCCCCchH
Confidence 466677777777764
No 150
>KOG3014|consensus
Probab=30.50 E-value=45 Score=21.73 Aligned_cols=32 Identities=22% Similarity=0.368 Sum_probs=23.4
Q ss_pred ccCCCcee---eccchhhhccC--chHHHHHHhHHhc
Q psy3491 43 KPLQSALF---MCESCHKTYKA--KGSLERHKKFECG 74 (108)
Q Consensus 43 ~~~~~~~~---~C~~c~~~f~~--~~~l~~h~~~~~~ 74 (108)
...|.+-| .|..||..|.. ..+...|.+.|+.
T Consensus 28 ld~Gqk~fg~~~C~~Cgm~Yt~~s~EDe~~H~~fH~~ 64 (257)
T KOG3014|consen 28 LDAGQKDFGAVKCKECGMKYTVTSPEDEALHEKFHNR 64 (257)
T ss_pred eecCccccCceehhhcCceecCCCHHHHHHHHHHHHh
Confidence 34566555 79999998875 4567779888874
No 151
>COG1773 Rubredoxin [Energy production and conversion]
Probab=30.06 E-value=20 Score=17.50 Aligned_cols=14 Identities=21% Similarity=0.672 Sum_probs=11.2
Q ss_pred eeeccchhhhccCc
Q psy3491 49 LFMCESCHKTYKAK 62 (108)
Q Consensus 49 ~~~C~~c~~~f~~~ 62 (108)
.|+|..||..|...
T Consensus 3 ~~~C~~CG~vYd~e 16 (55)
T COG1773 3 RWRCSVCGYVYDPE 16 (55)
T ss_pred ceEecCCceEeccc
Confidence 48899999888654
No 152
>PF06397 Desulfoferrod_N: Desulfoferrodoxin, N-terminal domain; InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=29.83 E-value=26 Score=15.47 Aligned_cols=12 Identities=25% Similarity=0.827 Sum_probs=6.5
Q ss_pred ceeeccchhhhc
Q psy3491 48 ALFMCESCHKTY 59 (108)
Q Consensus 48 ~~~~C~~c~~~f 59 (108)
+-|.|..||+..
T Consensus 5 ~~YkC~~CGniV 16 (36)
T PF06397_consen 5 EFYKCEHCGNIV 16 (36)
T ss_dssp EEEE-TTT--EE
T ss_pred cEEEccCCCCEE
Confidence 358899998764
No 153
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the KOG1842|consensus
Probab=29.51 E-value=34 Score=24.27 Aligned_cols=26 Identities=27% Similarity=0.636 Sum_probs=22.9
Q ss_pred ccCCCCccccCCchhHHHHHHHhcCC
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAIRHQN 104 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~h~~~ 104 (108)
|.|+.|..-|.....|..|...-|.+
T Consensus 16 flCPiC~~dl~~~~~L~~H~d~eH~~ 41 (505)
T KOG1842|consen 16 FLCPICLLDLPNLSALNDHLDVEHFE 41 (505)
T ss_pred ccCchHhhhhhhHHHHHHHHhhhccc
Confidence 99999999999999999998765554
No 155
>PLN02748 tRNA dimethylallyltransferase
Probab=28.40 E-value=48 Score=23.68 Aligned_cols=23 Identities=13% Similarity=0.264 Sum_probs=18.7
Q ss_pred CCCccCCCCcc-ccCCchhHHHHHH
Q psy3491 76 EAGLQCPFCPY-KSKHKSNLKTHMA 99 (108)
Q Consensus 76 ~~~~~c~~C~~-~f~~~~~l~~H~~ 99 (108)
+. |.|+.|++ ++.....+..|+.
T Consensus 417 ~~-~~Ce~C~~~~~~G~~eW~~Hlk 440 (468)
T PLN02748 417 TQ-YVCEACGNKVLRGAHEWEQHKQ 440 (468)
T ss_pred cc-ccccCCCCcccCCHHHHHHHhc
Confidence 44 88999997 7888888888875
No 156
>PHA02998 RNA polymerase subunit; Provisional
Probab=27.88 E-value=20 Score=22.07 Aligned_cols=38 Identities=16% Similarity=0.238 Sum_probs=21.8
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCC---ccCCCCccccCCc
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAG---LQCPFCPYKSKHK 91 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~---~~c~~C~~~f~~~ 91 (108)
-.|+.|+..=..... +++...++++ |.|..||..|..+
T Consensus 144 v~CPkCg~~~A~f~q----lQTRSADEPmT~FYkC~~CG~~wkpp 184 (195)
T PHA02998 144 TPCPNCKSKNTTPMM----IQTRAADEPPLVRHACRDCKKHFKPP 184 (195)
T ss_pred CCCCCCCCCceEEEE----EeeccCCCCceEEEEcCCCCCccCCc
Confidence 578888744333322 2233334443 7899999887654
No 157
>PRK05452 anaerobic nitric oxide reductase flavorubredoxin; Provisional
Probab=27.81 E-value=21 Score=25.39 Aligned_cols=41 Identities=15% Similarity=0.360 Sum_probs=23.0
Q ss_pred CCceeeccchhhhccCchHHHHH-Hh----HHhcCCCCccCCCCccc
Q psy3491 46 QSALFMCESCHKTYKAKGSLERH-KK----FECGKEAGLQCPFCPYK 87 (108)
Q Consensus 46 ~~~~~~C~~c~~~f~~~~~l~~h-~~----~~~~~~~~~~c~~C~~~ 87 (108)
....|.|..||..|.....-... .. ...--.. +.|++|+..
T Consensus 422 ~~~~~~c~~c~~~yd~~~g~~~~~~~~gt~~~~lp~~-~~cp~c~~~ 467 (479)
T PRK05452 422 LGPRMQCSVCQWIYDPAKGEPMQDVAPGTPWSEVPDN-FLCPECSLG 467 (479)
T ss_pred CCCeEEECCCCeEECCCCCCcccCCCCCCChhhCCCC-CcCcCCCCc
Confidence 34579999999988764211110 00 1011123 899999843
No 158
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=26.71 E-value=37 Score=19.59 Aligned_cols=14 Identities=14% Similarity=0.399 Sum_probs=11.2
Q ss_pred CCCCccCCCCccccC
Q psy3491 75 KEAGLQCPFCPYKSK 89 (108)
Q Consensus 75 ~~~~~~c~~C~~~f~ 89 (108)
... |.|..|++.|.
T Consensus 51 ~qR-yrC~~C~~tf~ 64 (129)
T COG3677 51 HQR-YKCKSCGSTFT 64 (129)
T ss_pred ccc-cccCCcCccee
Confidence 455 99999999875
No 159
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=26.63 E-value=33 Score=16.31 Aligned_cols=12 Identities=25% Similarity=0.733 Sum_probs=8.3
Q ss_pred ccCCCCccccCC
Q psy3491 79 LQCPFCPYKSKH 90 (108)
Q Consensus 79 ~~c~~C~~~f~~ 90 (108)
|.|..||..+..
T Consensus 2 y~C~~CgyiYd~ 13 (50)
T cd00730 2 YECRICGYIYDP 13 (50)
T ss_pred cCCCCCCeEECC
Confidence 677778776653
No 160
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=26.25 E-value=25 Score=16.45 Aligned_cols=36 Identities=22% Similarity=0.531 Sum_probs=17.7
Q ss_pred eeccchhhhccCchHHHHH-----HhHHhcCCCCccCCCCcc
Q psy3491 50 FMCESCHKTYKAKGSLERH-----KKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h-----~~~~~~~~~~~~c~~C~~ 86 (108)
|.|..||..+.....-..+ .....--.. +.|++|+.
T Consensus 2 y~C~~CgyvYd~~~Gd~~~~i~pGt~F~~Lp~~-w~CP~C~a 42 (47)
T PF00301_consen 2 YQCPVCGYVYDPEKGDPENGIPPGTPFEDLPDD-WVCPVCGA 42 (47)
T ss_dssp EEETTTSBEEETTTBBGGGTB-TT--GGGS-TT--B-TTTSS
T ss_pred cCCCCCCEEEcCCcCCcccCcCCCCCHHHCCCC-CcCcCCCC
Confidence 7899999888654321100 111111233 89999974
No 161
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=26.21 E-value=43 Score=20.55 Aligned_cols=32 Identities=28% Similarity=0.767 Sum_probs=19.3
Q ss_pred CceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcccc
Q psy3491 47 SALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKS 88 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f 88 (108)
..-|.|+.|...++... .+..+ |.|+.||...
T Consensus 111 ~~~y~C~~~~~r~sfde------A~~~~----F~Cp~Cg~~L 142 (176)
T COG1675 111 NNYYVCPNCHVKYSFDE------AMELG----FTCPKCGEDL 142 (176)
T ss_pred CCceeCCCCCCcccHHH------HHHhC----CCCCCCCchh
Confidence 33488887776654433 22222 8899998653
No 162
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=26.01 E-value=41 Score=23.16 Aligned_cols=28 Identities=18% Similarity=0.438 Sum_probs=19.4
Q ss_pred hHHhcCCCCccCCCCc-cccCCchhHHHHH
Q psy3491 70 KFECGKEAGLQCPFCP-YKSKHKSNLKTHM 98 (108)
Q Consensus 70 ~~~~~~~~~~~c~~C~-~~f~~~~~l~~H~ 98 (108)
+.|.-++. |.|.+|| .++.-+..+.+|-
T Consensus 367 klhgLd~e-f~CEICgNyvy~GR~~FdrHF 395 (470)
T COG5188 367 KLHGLDIE-FECEICGNYVYYGRDRFDRHF 395 (470)
T ss_pred HhcCCCcc-eeeeecccccccchHHHHhhh
Confidence 34444566 9999998 7777777777664
No 163
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=25.14 E-value=24 Score=19.13 Aligned_cols=28 Identities=18% Similarity=0.497 Sum_probs=19.1
Q ss_pred CceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCc
Q psy3491 47 SALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCP 85 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~ 85 (108)
..|-.|..||..|.... .+.|-.|+.|.
T Consensus 56 v~Pa~CkkCGfef~~~~-----------ik~pSRCP~CK 83 (97)
T COG3357 56 VRPARCKKCGFEFRDDK-----------IKKPSRCPKCK 83 (97)
T ss_pred ecChhhcccCccccccc-----------cCCcccCCcch
Confidence 45789999998886621 12237888774
No 164
>PF01215 COX5B: Cytochrome c oxidase subunit Vb This family consists of chains F and S ; InterPro: IPR002124 Cytochrome c oxidase (1.9.3.1 from EC) is an oligomeric enzymatic complex which is a component of the respiratory chain complex and is involved in the transfer of electrons from cytochrome c to oxygen []. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane. In eukaryotes, in addition to the three large subunits, I, II and III, that form the catalytic centre of the enzyme complex, there are a variable number of small polypeptidic subunits. One of these subunits, which is known as Vb in mammals, V in Dictyostelium discoideum (Slime mold) and IV in yeast, binds a zinc atom. The sequence of subunit Vb is well conserved and includes three conserved cysteines that coordinate the zinc ion [, ]. Two of these cysteines are clustered in the C-terminal section of the subunit.; GO: 0004129 cytochrome-c oxidase activity, 0005740 mitochondrial envelope; PDB: 2EIL_S 2ZXW_S 3ASN_S 1OCO_S 3AG4_S 3ABK_S 1OCZ_S 1OCC_F 3ASO_S 3ABL_S ....
Probab=25.05 E-value=29 Score=20.32 Aligned_cols=16 Identities=25% Similarity=0.553 Sum_probs=12.9
Q ss_pred CceeeccchhhhccCc
Q psy3491 47 SALFMCESCHKTYKAK 62 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~ 62 (108)
.++..|..||..|.-.
T Consensus 110 g~~~RCpeCG~~fkL~ 125 (136)
T PF01215_consen 110 GKPQRCPECGQVFKLK 125 (136)
T ss_dssp TSEEEETTTEEEEEEE
T ss_pred CCccCCCCCCeEEEEE
Confidence 4589999999988654
No 165
>COG4391 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.04 E-value=29 Score=17.35 Aligned_cols=40 Identities=18% Similarity=0.496 Sum_probs=23.7
Q ss_pred CCCceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccC
Q psy3491 45 LQSALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSK 89 (108)
Q Consensus 45 ~~~~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~ 89 (108)
.+..++.|..-+..+.....+ +.+ ++.....|++|+..|.
T Consensus 20 ~~~~~l~C~g~~~p~~HPrV~---L~m--g~~gev~CPYC~t~y~ 59 (62)
T COG4391 20 IGDLPLMCPGPEPPNDHPRVF---LDM--GDEGEVVCPYCSTRYR 59 (62)
T ss_pred eCCeeEEcCCCCCCCCCCEEE---EEc--CCCCcEecCccccEEE
Confidence 466788887665555443222 122 2233388999998875
No 166
>PF02891 zf-MIZ: MIZ/SP-RING zinc finger; InterPro: IPR004181 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents MIZ-type zinc finger domains. Miz1 (Msx-interacting-zinc finger) is a zinc finger-containing protein with homology to the yeast protein, Nfi-1. Miz1 is a sequence specific DNA binding protein that can function as a positive-acting transcription factor. Miz1 binds to the homeobox protein Msx2, enhancing the specific DNA-binding ability of Msx2 []. Other proteins containing this domain include the human pias family (protein inhibitor of activated STAT protein). More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3I2D_A.
Probab=24.66 E-value=58 Score=15.27 Aligned_cols=8 Identities=38% Similarity=1.223 Sum_probs=3.5
Q ss_pred ccCCCCcc
Q psy3491 79 LQCPFCPY 86 (108)
Q Consensus 79 ~~c~~C~~ 86 (108)
+.|++|++
T Consensus 42 W~CPiC~~ 49 (50)
T PF02891_consen 42 WKCPICNK 49 (50)
T ss_dssp -B-TTT--
T ss_pred eECcCCcC
Confidence 88888865
No 167
>PF02591 DUF164: Putative zinc ribbon domain; InterPro: IPR003743 This entry describes proteins of unknown function.
Probab=24.66 E-value=71 Score=15.19 Aligned_cols=33 Identities=27% Similarity=0.590 Sum_probs=19.3
Q ss_pred eeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccc
Q psy3491 50 FMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYK 87 (108)
Q Consensus 50 ~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~ 87 (108)
-.|..|+...+... + .....++.. ..|+.||+.
T Consensus 23 ~~C~gC~~~l~~~~-~---~~i~~~~~i-~~Cp~CgRi 55 (56)
T PF02591_consen 23 GTCSGCHMELPPQE-L---NEIRKGDEI-VFCPNCGRI 55 (56)
T ss_pred CccCCCCEEcCHHH-H---HHHHcCCCe-EECcCCCcc
Confidence 36777775554432 2 222333455 889999875
No 168
>COG1326 Uncharacterized archaeal Zn-finger protein [General function prediction only]
Probab=24.61 E-value=66 Score=20.16 Aligned_cols=10 Identities=20% Similarity=0.551 Sum_probs=5.1
Q ss_pred ccCCCCcccc
Q psy3491 79 LQCPFCPYKS 88 (108)
Q Consensus 79 ~~c~~C~~~f 88 (108)
+.|..||..+
T Consensus 31 vrC~eCG~V~ 40 (201)
T COG1326 31 VRCEECGTVH 40 (201)
T ss_pred EEccCCCcEe
Confidence 5555555443
No 169
>PRK04351 hypothetical protein; Provisional
Probab=23.84 E-value=44 Score=19.83 Aligned_cols=32 Identities=31% Similarity=0.618 Sum_probs=20.9
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccC
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSK 89 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~ 89 (108)
.|.|..||..+... +.+ +... |.|..|+..+.
T Consensus 112 ~Y~C~~Cg~~~~r~-------Rr~-n~~~-yrCg~C~g~L~ 143 (149)
T PRK04351 112 LYECQSCGQQYLRK-------RRI-NTKR-YRCGKCRGKLK 143 (149)
T ss_pred EEECCCCCCEeeee-------eec-CCCc-EEeCCCCcEee
Confidence 58899898666432 222 2355 99999986654
No 170
>KOG1280|consensus
Probab=23.13 E-value=90 Score=21.51 Aligned_cols=38 Identities=18% Similarity=0.431 Sum_probs=25.3
Q ss_pred ceeeccchhhhccCchHHHHHHhHHhcCCC-CccCCCCc
Q psy3491 48 ALFMCESCHKTYKAKGSLERHKKFECGKEA-GLQCPFCP 85 (108)
Q Consensus 48 ~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~-~~~c~~C~ 85 (108)
..|.|+.|+..=.+...|..|....+.+-. ...|++|+
T Consensus 78 qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 78 QSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred ccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 368888888877777788888665444322 25566665
No 171
>COG2879 Uncharacterized small protein [Function unknown]
Probab=23.12 E-value=1e+02 Score=15.56 Aligned_cols=18 Identities=22% Similarity=0.272 Sum_probs=12.4
Q ss_pred CchhHHHHHHHhcCCCCC
Q psy3491 90 HKSNLKTHMAIRHQNGLH 107 (108)
Q Consensus 90 ~~~~l~~H~~~h~~~k~~ 107 (108)
.-++.-.|++.+|..+|.
T Consensus 24 dYdnYVehmr~~hPd~p~ 41 (65)
T COG2879 24 DYDNYVEHMRKKHPDKPP 41 (65)
T ss_pred cHHHHHHHHHHhCcCCCc
Confidence 345667788888877763
No 172
>PF03194 LUC7: LUC7 N_terminus; InterPro: IPR004882 This family consists of several LUC7 protein homologues that are restricted to eukaryotes. LUC7 has been shown to be a U1 snRNA associated protein [] with a role in splice site recognition []. The entry contains human and mouse LUC7 like (LUC7L) proteins [] and human cisplatin resistance-associated overexpressed protein (CROP) [].
Probab=23.06 E-value=35 Score=22.15 Aligned_cols=7 Identities=43% Similarity=1.042 Sum_probs=3.3
Q ss_pred eeccchh
Q psy3491 50 FMCESCH 56 (108)
Q Consensus 50 ~~C~~c~ 56 (108)
-.|.+||
T Consensus 191 ~VCeVCG 197 (254)
T PF03194_consen 191 EVCEVCG 197 (254)
T ss_pred cchhhhh
Confidence 3444554
No 173
>PF12773 DZR: Double zinc ribbon
Probab=22.94 E-value=60 Score=14.83 Aligned_cols=9 Identities=33% Similarity=0.774 Sum_probs=5.2
Q ss_pred ccCCCCccc
Q psy3491 79 LQCPFCPYK 87 (108)
Q Consensus 79 ~~c~~C~~~ 87 (108)
..|+.|+..
T Consensus 30 ~~C~~Cg~~ 38 (50)
T PF12773_consen 30 KICPNCGAE 38 (50)
T ss_pred CCCcCCcCC
Confidence 556666554
No 174
>PTZ00448 hypothetical protein; Provisional
Probab=22.81 E-value=66 Score=22.28 Aligned_cols=22 Identities=23% Similarity=0.422 Sum_probs=17.3
Q ss_pred ccCCCCccccCCchhHHHHHHH
Q psy3491 79 LQCPFCPYKSKHKSNLKTHMAI 100 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H~~~ 100 (108)
|.|..|+-.|......+.|.++
T Consensus 315 ~tC~~C~v~F~~~~~qR~H~KS 336 (373)
T PTZ00448 315 LLCRKCNIQLMDHNAFKQHYRS 336 (373)
T ss_pred ccccccccccCCHHHHHHHhhh
Confidence 7899999999877777777653
No 175
>PF07503 zf-HYPF: HypF finger; InterPro: IPR011125 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Proteins of the HypF family are involved in the maturation and regulation of hydrogenase []. In the N terminus they appear to have two zinc finger domains that are similar to those found in the DnaJ chaperone []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3TTD_A 3TSQ_A 3TTC_A 3TSP_A 3TTF_A 3TSU_A.
Probab=22.65 E-value=17 Score=15.87 Aligned_cols=10 Identities=20% Similarity=0.637 Sum_probs=4.1
Q ss_pred ccCCCCcccc
Q psy3491 79 LQCPFCPYKS 88 (108)
Q Consensus 79 ~~c~~C~~~f 88 (108)
..|..||..+
T Consensus 22 isC~~CGPr~ 31 (35)
T PF07503_consen 22 ISCTNCGPRY 31 (35)
T ss_dssp --BTTCC-SC
T ss_pred ccCCCCCCCE
Confidence 5566666543
No 176
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=22.32 E-value=22 Score=20.77 Aligned_cols=31 Identities=23% Similarity=0.627 Sum_probs=18.8
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcccc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKS 88 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f 88 (108)
.|.|..|+..+... .+. .... |.|..|+..|
T Consensus 123 ~~~C~~C~~~~~r~------~~~--~~~~-~~C~~C~~~l 153 (157)
T PF10263_consen 123 VYRCPSCGREYKRH------RRS--KRKR-YRCGRCGGPL 153 (157)
T ss_pred EEEcCCCCCEeeee------ccc--chhh-EECCCCCCEE
Confidence 57888888665332 222 2234 7888887554
No 177
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=22.29 E-value=39 Score=19.29 Aligned_cols=31 Identities=23% Similarity=0.429 Sum_probs=17.8
Q ss_pred eeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCcc
Q psy3491 49 LFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPY 86 (108)
Q Consensus 49 ~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~ 86 (108)
...| .||..|.....-..+ ..+.+.|+.||.
T Consensus 70 ~~~C-~Cg~~~~~~~~~~~~------~~~~~~CP~Cgs 100 (124)
T PRK00762 70 EIEC-ECGYEGVVDEDEIDH------YAAVIECPVCGN 100 (124)
T ss_pred eEEe-eCcCcccccccchhc------cccCCcCcCCCC
Confidence 5889 999777654210011 011267999984
No 178
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=22.23 E-value=12 Score=17.96 Aligned_cols=23 Identities=17% Similarity=0.456 Sum_probs=11.6
Q ss_pred CceeeccchhhhccCchHHHHHH
Q psy3491 47 SALFMCESCHKTYKAKGSLERHK 69 (108)
Q Consensus 47 ~~~~~C~~c~~~f~~~~~l~~h~ 69 (108)
...|.|+.|+..|=..=++-.|.
T Consensus 19 ~~~y~C~~C~~~FC~dCD~fiHE 41 (51)
T PF07975_consen 19 SSRYRCPKCKNHFCIDCDVFIHE 41 (51)
T ss_dssp -EEE--TTTT--B-HHHHHTTTT
T ss_pred CCeEECCCCCCccccCcChhhhc
Confidence 34699999998887766654443
No 179
>PF01286 XPA_N: XPA protein N-terminal; InterPro: IPR022652 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. Skin cells of individual's with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-A is the most severe form of the disease and is due to defects in a 30 kDa nuclear protein called XPA (or XPAC) []. The sequence of the XPA protein is conserved from higher eukaryotes [] to yeast (gene RAD14) []. XPA is a hydrophilic protein of 247 to 296 amino-acid residues which has a C4-type zinc finger motif in its central section. This entry contains the zinc-finger containing region in the XPA protein. It is found N-terminal to PF05181 from PFAM ; PDB: 1D4U_A 1XPA_A.
Probab=21.40 E-value=42 Score=14.61 Aligned_cols=14 Identities=21% Similarity=0.527 Sum_probs=7.1
Q ss_pred eeccchhhhccCch
Q psy3491 50 FMCESCHKTYKAKG 63 (108)
Q Consensus 50 ~~C~~c~~~f~~~~ 63 (108)
-.|..|++.|..+.
T Consensus 4 ~~C~eC~~~f~dSy 17 (34)
T PF01286_consen 4 PKCDECGKPFMDSY 17 (34)
T ss_dssp EE-TTT--EES-SS
T ss_pred chHhHhCCHHHHHH
Confidence 46888888887654
No 180
>KOG1994|consensus
Probab=21.39 E-value=61 Score=20.88 Aligned_cols=23 Identities=35% Similarity=0.586 Sum_probs=18.7
Q ss_pred cCCCCccCCCCccccCCchhHHHH
Q psy3491 74 GKEAGLQCPFCPYKSKHKSNLKTH 97 (108)
Q Consensus 74 ~~~~~~~c~~C~~~f~~~~~l~~H 97 (108)
.... |-|-.||-.|.....|..|
T Consensus 236 R~eh-~YC~fCG~~y~~~edl~eh 258 (268)
T KOG1994|consen 236 RSEH-YYCFFCGIKYKDEEDLYEH 258 (268)
T ss_pred hccc-eEEEEeccccCCHHHHHHh
Confidence 3455 8999999999998888776
No 181
>COG0675 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=21.18 E-value=45 Score=21.84 Aligned_cols=15 Identities=20% Similarity=0.740 Sum_probs=9.7
Q ss_pred CCCccCCCCccccCCc
Q psy3491 76 EAGLQCPFCPYKSKHK 91 (108)
Q Consensus 76 ~~~~~c~~C~~~f~~~ 91 (108)
+. +.|+.||..+...
T Consensus 321 r~-~~C~~cg~~~~rD 335 (364)
T COG0675 321 RL-FKCPRCGFVHDRD 335 (364)
T ss_pred ee-EECCCCCCeehhh
Confidence 44 7888888765443
No 182
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=20.88 E-value=29 Score=26.23 Aligned_cols=6 Identities=17% Similarity=0.263 Sum_probs=3.1
Q ss_pred cCCCCC
Q psy3491 13 VDPVCP 18 (108)
Q Consensus 13 ~~~~c~ 18 (108)
.|..|.
T Consensus 437 ~C~~Cg 442 (730)
T COG1198 437 LCRDCG 442 (730)
T ss_pred ecccCC
Confidence 455554
No 183
>TIGR03831 YgiT_finger YgiT-type zinc finger domain. This domain model describes a small domain with two copies of a putative zinc-binding motif CXXC (usually CXXCG). Most member proteins consist largely of this domain or else carry an additional C-terminal helix-turn-helix domain, resembling that of the phage protein Cro and modeled by pfam01381.
Probab=20.36 E-value=38 Score=15.00 Aligned_cols=13 Identities=23% Similarity=0.774 Sum_probs=9.7
Q ss_pred eeeccchhhhccC
Q psy3491 49 LFMCESCHKTYKA 61 (108)
Q Consensus 49 ~~~C~~c~~~f~~ 61 (108)
-+.|+.||..+..
T Consensus 32 ~~~C~~CGE~~~~ 44 (46)
T TIGR03831 32 ALVCPQCGEEYLD 44 (46)
T ss_pred ccccccCCCEeeC
Confidence 3689999987643
No 184
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=20.32 E-value=35 Score=18.80 Aligned_cols=34 Identities=21% Similarity=0.438 Sum_probs=17.9
Q ss_pred ceeeccchhhhccCchHHHHHHhHHhcCCCCccCCCCccccC
Q psy3491 48 ALFMCESCHKTYKAKGSLERHKKFECGKEAGLQCPFCPYKSK 89 (108)
Q Consensus 48 ~~~~C~~c~~~f~~~~~l~~h~~~~~~~~~~~~c~~C~~~f~ 89 (108)
.-|.|+.|+.. ...-.+. .+ +....|..||..+.
T Consensus 20 t~f~CP~Cge~-~v~v~~~------k~-~~h~~C~~CG~y~~ 53 (99)
T PRK14892 20 KIFECPRCGKV-SISVKIK------KN-IAIITCGNCGLYTE 53 (99)
T ss_pred cEeECCCCCCe-EeeeecC------CC-cceEECCCCCCccC
Confidence 46888888832 1111111 12 22377888876543
No 185
>PF14447 Prok-RING_4: Prokaryotic RING finger family 4
Probab=20.30 E-value=60 Score=15.86 Aligned_cols=11 Identities=36% Similarity=0.932 Sum_probs=5.9
Q ss_pred cCCCCccccCC
Q psy3491 80 QCPFCPYKSKH 90 (108)
Q Consensus 80 ~c~~C~~~f~~ 90 (108)
.|+.|++.|..
T Consensus 41 gCPfC~~~~~~ 51 (55)
T PF14447_consen 41 GCPFCGTPFEF 51 (55)
T ss_pred CCCCCCCcccC
Confidence 45566655543
No 186
>PRK08222 hydrogenase 4 subunit H; Validated
Probab=20.08 E-value=77 Score=19.33 Aligned_cols=19 Identities=11% Similarity=0.252 Sum_probs=14.8
Q ss_pred ccCCCCccccCCchhHHHH
Q psy3491 79 LQCPFCPYKSKHKSNLKTH 97 (108)
Q Consensus 79 ~~c~~C~~~f~~~~~l~~H 97 (108)
..|..||+.|.....+..-
T Consensus 115 ~~C~~Cg~~f~~~k~i~~~ 133 (181)
T PRK08222 115 QRCSRCERPFAPQKTVALA 133 (181)
T ss_pred CcCcccCCccCcHhHHHHH
Confidence 8899999999876655443
Done!