Query 000554
Match_columns 1428
No_of_seqs 804 out of 4735
Neff 5.7
Searched_HMMs 46136
Date Mon Apr 1 19:04:17 2013
Command hhsearch -i /work/01045/syshi/lefta3m/000554.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/leftcdd/000554hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1082 Histone H3 (Lys9) meth 99.9 7.6E-26 1.7E-30 267.6 12.4 169 1224-1428 52-227 (364)
2 KOG1141 Predicted histone meth 99.9 3.7E-25 8E-30 263.8 2.2 239 1160-1421 569-841 (1262)
3 KOG2462 C2H2-type Zn-finger pr 99.9 2.7E-22 5.8E-27 220.7 4.5 133 882-1038 131-263 (279)
4 KOG2462 C2H2-type Zn-finger pr 99.8 1.3E-21 2.9E-26 215.1 4.2 137 915-1069 130-266 (279)
5 KOG1074 Transcriptional repres 99.8 7.4E-21 1.6E-25 230.5 6.1 173 882-1070 606-931 (958)
6 KOG1074 Transcriptional repres 99.8 3.4E-19 7.5E-24 216.2 12.5 88 846-942 604-694 (958)
7 KOG3608 Zn finger proteins [Ge 99.8 1.5E-19 3.3E-24 201.8 4.3 199 833-1059 193-399 (467)
8 PF05033 Pre-SET: Pre-SET moti 99.7 1.8E-18 3.9E-23 169.8 7.6 103 1236-1371 1-103 (103)
9 KOG3608 Zn finger proteins [Ge 99.7 4.9E-18 1.1E-22 189.9 0.5 188 851-1070 183-377 (467)
10 smart00468 PreSET N-terminal t 99.7 6.6E-17 1.4E-21 157.6 7.8 96 1234-1363 1-98 (98)
11 KOG3623 Homeobox transcription 99.6 1.4E-16 3E-21 190.4 3.3 79 983-1067 893-971 (1007)
12 KOG4442 Clathrin coat binding 99.5 4.2E-15 9.1E-20 179.6 5.1 73 1353-1425 92-166 (729)
13 KOG3576 Ovo and related transc 99.3 2E-13 4.4E-18 143.9 -0.4 118 915-1045 117-239 (267)
14 KOG3576 Ovo and related transc 99.2 1.7E-12 3.6E-17 137.1 0.4 88 843-942 113-200 (267)
15 KOG3623 Homeobox transcription 99.2 3.1E-12 6.6E-17 153.9 1.5 123 882-1042 211-333 (1007)
16 KOG1079 Transcriptional repres 99.1 1.7E-11 3.7E-16 147.9 3.9 99 1294-1428 534-650 (739)
17 KOG1141 Predicted histone meth 99.0 4.1E-10 9E-15 136.9 5.7 185 1216-1420 850-1054(1262)
18 PLN03086 PRLI-interacting fact 98.8 6.1E-09 1.3E-13 128.0 8.0 144 848-1041 408-564 (567)
19 PLN03086 PRLI-interacting fact 98.8 8.3E-09 1.8E-13 126.9 6.9 140 882-1064 408-559 (567)
20 PHA00733 hypothetical protein 98.6 3.8E-08 8.2E-13 101.0 3.5 86 914-1043 39-124 (128)
21 PF01352 KRAB: KRAB box; Inte 98.4 6.9E-08 1.5E-12 79.7 0.5 35 732-771 1-40 (41)
22 PHA00733 hypothetical protein 98.3 3E-07 6.6E-12 94.3 2.4 93 835-940 28-124 (128)
23 KOG3993 Transcription factor ( 98.2 2.5E-07 5.4E-12 107.5 0.4 39 834-873 282-320 (500)
24 PHA02768 hypothetical protein; 98.1 1.3E-06 2.8E-11 76.2 2.5 43 985-1035 6-48 (55)
25 KOG3993 Transcription factor ( 98.1 4E-07 8.7E-12 105.8 -1.2 181 847-1042 267-483 (500)
26 KOG1083 Putative transcription 98.1 4.1E-07 8.9E-12 114.5 -2.8 56 1368-1423 1166-1222(1306)
27 PHA02768 hypothetical protein; 97.9 1.9E-06 4.1E-11 75.2 0.2 45 1018-1064 5-49 (55)
28 PF13465 zf-H2C2_2: Zinc-finge 97.9 6.6E-06 1.4E-10 61.5 1.8 26 971-996 1-26 (26)
29 PHA00732 hypothetical protein 97.4 9.7E-05 2.1E-09 69.9 3.2 48 984-1043 1-49 (79)
30 PF13465 zf-H2C2_2: Zinc-finge 97.4 0.0001 2.2E-09 55.2 1.9 26 999-1030 1-26 (26)
31 PHA00616 hypothetical protein 97.2 5.9E-05 1.3E-09 63.1 -0.4 26 984-1010 1-26 (44)
32 smart00317 SET SET (Su(var)3-9 97.2 0.00044 9.4E-09 67.9 5.0 43 1380-1422 1-43 (116)
33 PHA00616 hypothetical protein 97.0 0.00027 6E-09 59.2 1.0 34 1018-1051 1-34 (44)
34 PHA00732 hypothetical protein 96.9 0.00047 1E-08 65.3 2.2 45 881-936 1-45 (79)
35 PF05605 zf-Di19: Drought indu 96.8 0.00087 1.9E-08 58.9 2.8 52 984-1042 2-53 (54)
36 KOG1085 Predicted methyltransf 96.7 0.0011 2.4E-08 74.6 3.7 53 1375-1427 252-304 (392)
37 COG5189 SFP1 Putative transcri 96.6 0.0013 2.8E-08 75.0 2.7 57 982-1038 347-418 (423)
38 KOG1080 Histone H3 (Lys4) meth 96.4 0.002 4.2E-08 85.0 3.0 45 1379-1423 866-910 (1005)
39 PF05605 zf-Di19: Drought indu 96.1 0.0021 4.6E-08 56.4 1.1 51 882-939 3-53 (54)
40 PF00096 zf-C2H2: Zinc finger, 95.6 0.0068 1.5E-07 43.6 1.6 23 985-1008 1-23 (23)
41 PF00096 zf-C2H2: Zinc finger, 95.6 0.0029 6.2E-08 45.6 -0.4 23 1019-1041 1-23 (23)
42 COG5189 SFP1 Putative transcri 95.3 0.0063 1.4E-07 69.5 1.1 71 844-935 346-418 (423)
43 PF12756 zf-C2H2_2: C2H2 type 95.3 0.0069 1.5E-07 58.3 1.1 73 849-938 1-73 (100)
44 PF12756 zf-C2H2_2: C2H2 type 95.2 0.011 2.3E-07 57.1 2.3 71 917-1005 1-71 (100)
45 COG5048 FOG: Zn-finger [Genera 94.8 0.028 6.2E-07 66.9 4.7 62 990-1055 394-455 (467)
46 cd01395 HMT_MBD Methyl-CpG bin 94.6 0.0072 1.6E-07 54.3 -0.8 37 1184-1220 1-49 (60)
47 KOG2231 Predicted E3 ubiquitin 94.4 0.031 6.8E-07 70.9 3.9 140 880-1050 114-275 (669)
48 PF13912 zf-C2H2_6: C2H2-type 94.2 0.016 3.5E-07 43.4 0.5 24 984-1008 1-24 (27)
49 PF13912 zf-C2H2_6: C2H2-type 94.0 0.033 7.2E-07 41.7 1.9 26 1018-1043 1-26 (27)
50 PF13894 zf-C2H2_4: C2H2-type 94.0 0.024 5.2E-07 40.5 1.0 22 883-904 2-23 (24)
51 KOG2231 Predicted E3 ubiquitin 93.2 0.088 1.9E-06 66.9 4.7 74 917-1003 117-201 (669)
52 PF13894 zf-C2H2_4: C2H2-type 93.1 0.07 1.5E-06 38.0 2.2 18 985-1002 1-18 (24)
53 COG5048 FOG: Zn-finger [Genera 92.7 0.1 2.2E-06 62.3 4.1 168 846-1036 288-463 (467)
54 KOG1146 Homeobox protein [Gene 92.7 0.069 1.5E-06 71.2 2.8 157 884-1067 439-639 (1406)
55 COG2940 Proteins containing SE 92.6 0.045 9.8E-07 68.4 1.0 72 1354-1425 307-378 (480)
56 KOG1146 Homeobox protein [Gene 92.4 0.04 8.7E-07 73.3 0.2 84 849-937 438-540 (1406)
57 PRK04860 hypothetical protein; 92.1 0.088 1.9E-06 56.5 2.3 38 984-1031 119-156 (160)
58 PF09237 GAGA: GAGA factor; I 91.9 0.053 1.1E-06 46.9 0.3 30 1016-1045 22-51 (54)
59 smart00355 ZnF_C2H2 zinc finge 91.4 0.072 1.6E-06 38.4 0.6 24 985-1009 1-24 (26)
60 smart00355 ZnF_C2H2 zinc finge 91.1 0.14 3.1E-06 36.8 1.9 24 1019-1042 1-24 (26)
61 smart00570 AWS associated with 90.9 0.093 2E-06 45.9 0.8 25 1353-1377 26-50 (51)
62 cd05162 PWWP The PWWP domain, 90.3 0.26 5.7E-06 47.3 3.4 60 157-220 6-66 (87)
63 PRK04860 hypothetical protein; 89.4 0.14 3E-06 55.1 0.8 39 1017-1059 118-156 (160)
64 cd05840 SPBC215_ISWI_like The 88.9 0.31 6.6E-06 47.9 2.7 59 157-216 6-65 (93)
65 PF09237 GAGA: GAGA factor; I 86.5 0.38 8.2E-06 41.9 1.5 29 880-908 23-51 (54)
66 COG5236 Uncharacterized conser 85.3 0.43 9.4E-06 55.6 1.8 103 915-1039 151-272 (493)
67 PF12874 zf-met: Zinc-finger o 84.7 0.26 5.7E-06 36.1 -0.2 21 1019-1039 1-21 (25)
68 PF11722 zf-TRM13_CCCH: CCCH z 84.0 0.35 7.5E-06 38.1 0.2 29 533-561 2-30 (31)
69 PF13909 zf-H2C2_5: C2H2-type 81.8 0.6 1.3E-05 34.1 0.7 23 882-905 1-23 (24)
70 PF12874 zf-met: Zinc-finger o 81.4 0.63 1.4E-05 34.1 0.7 21 883-903 2-22 (25)
71 cd07765 KRAB_A-box KRAB (Krupp 81.2 0.88 1.9E-05 32.7 1.5 28 732-764 1-28 (40)
72 PF12171 zf-C2H2_jaz: Zinc-fin 80.8 0.92 2E-05 34.2 1.4 22 1019-1040 2-23 (27)
73 PF13909 zf-H2C2_5: C2H2-type 77.1 1.8 3.9E-05 31.5 2.0 17 985-1002 1-17 (24)
74 PF12171 zf-C2H2_jaz: Zinc-fin 76.5 1.3 2.9E-05 33.3 1.2 21 882-902 2-22 (27)
75 COG5236 Uncharacterized conser 75.6 1.8 3.9E-05 50.7 2.4 135 848-1010 152-307 (493)
76 KOG2893 Zn finger protein [Gen 74.3 1.4 2.9E-05 49.4 1.0 46 987-1042 13-59 (341)
77 KOG4173 Alpha-SNAP protein [In 74.1 0.64 1.4E-05 51.0 -1.5 91 880-1010 78-172 (253)
78 KOG2482 Predicted C2H2-type Zn 72.8 3.7 8.1E-05 48.3 4.0 76 895-978 129-217 (423)
79 KOG2482 Predicted C2H2-type Zn 70.2 2.7 5.9E-05 49.4 2.2 78 916-1008 280-357 (423)
80 cd05837 MSH6_like The PWWP dom 67.5 5.7 0.00012 40.2 3.7 63 157-219 8-71 (110)
81 KOG2893 Zn finger protein [Gen 62.2 3.1 6.7E-05 46.7 0.6 47 884-940 13-59 (341)
82 KOG2785 C2H2-type Zn-finger pr 61.2 9.5 0.0002 46.0 4.4 55 983-1038 165-240 (390)
83 smart00391 MBD Methyl-CpG bind 56.4 4.8 0.0001 38.3 0.8 36 1184-1219 3-52 (77)
84 PF13913 zf-C2HC_2: zinc-finge 55.2 8.3 0.00018 28.9 1.7 18 985-1003 3-20 (25)
85 smart00451 ZnF_U1 U1-like zinc 54.9 4.3 9.3E-05 32.0 0.2 21 1018-1038 3-23 (35)
86 PF13913 zf-C2HC_2: zinc-finge 53.8 7 0.00015 29.3 1.1 19 883-902 4-22 (25)
87 KOG4173 Alpha-SNAP protein [In 51.6 5.5 0.00012 44.0 0.4 93 952-1048 78-177 (253)
88 smart00451 ZnF_U1 U1-like zinc 50.4 8.6 0.00019 30.3 1.2 22 915-936 3-24 (35)
89 COG4049 Uncharacterized protei 47.4 8.3 0.00018 34.4 0.7 32 978-1009 11-42 (65)
90 PF09986 DUF2225: Uncharacteri 47.2 6.1 0.00013 44.6 -0.1 20 983-1002 4-23 (214)
91 smart00293 PWWP domain with co 42.5 27 0.00059 31.7 3.3 56 157-215 6-62 (63)
92 cd00350 rubredoxin_like Rubred 41.4 18 0.0004 28.8 1.8 11 985-995 2-12 (33)
93 PF00855 PWWP: PWWP domain; I 41.1 26 0.00057 33.1 3.2 56 157-219 6-62 (86)
94 COG1997 RPL43A Ribosomal prote 39.3 13 0.00028 36.2 0.8 34 983-1032 34-67 (89)
95 PF06524 NOA36: NOA36 protein; 38.9 32 0.0007 39.6 3.8 27 1016-1042 207-233 (314)
96 TIGR02098 MJ0042_CXXC MJ0042 f 38.5 16 0.00035 29.6 1.1 34 985-1029 3-36 (38)
97 cd05838 WHSC1_related The PWWP 38.0 25 0.00055 34.8 2.6 54 158-214 7-61 (95)
98 TIGR00622 ssl1 transcription f 37.8 39 0.00085 34.6 3.9 48 883-939 57-104 (112)
99 TIGR00373 conserved hypothetic 37.5 24 0.00052 38.1 2.6 40 974-1028 99-138 (158)
100 PF14353 CpXC: CpXC protein 37.1 22 0.00049 36.6 2.2 50 986-1042 3-62 (128)
101 KOG3813 Uncharacterized conser 37.0 16 0.00035 45.4 1.2 19 1299-1318 307-325 (640)
102 PF09538 FYDLN_acid: Protein o 37.0 19 0.00041 36.6 1.6 30 985-1031 10-39 (108)
103 smart00531 TFIIE Transcription 35.9 29 0.00062 36.9 2.8 39 980-1028 95-133 (147)
104 cd01397 HAT_MBD Methyl-CpG bin 35.2 13 0.00029 35.2 0.1 25 1194-1218 23-48 (73)
105 smart00834 CxxC_CXXC_SSSS Puta 35.2 14 0.0003 30.3 0.2 12 985-996 6-17 (41)
106 PRK00464 nrdR transcriptional 33.6 18 0.00039 39.0 0.8 19 1017-1035 27-45 (154)
107 COG1198 PriA Primosomal protei 33.3 27 0.00058 46.3 2.5 43 1111-1154 602-645 (730)
108 KOG2461 Transcription factor B 32.2 90 0.002 38.7 6.5 78 971-1054 318-395 (396)
109 PRK06266 transcription initiat 32.1 30 0.00065 38.1 2.3 35 980-1029 113-147 (178)
110 PF09723 Zn-ribbon_8: Zinc rib 31.5 16 0.00034 30.8 -0.0 12 985-996 6-17 (42)
111 PHA00626 hypothetical protein 31.4 19 0.00041 32.3 0.4 13 1018-1030 23-35 (59)
112 cd00122 MBD MeCP2, MBD1, MBD2, 31.2 15 0.00033 33.3 -0.1 27 1194-1220 23-50 (62)
113 PF13891 zf-C3Hc3H: Potential 31.0 15 0.00033 33.8 -0.2 23 587-609 3-25 (65)
114 PF12013 DUF3505: Protein of u 30.8 52 0.0011 33.1 3.5 27 1017-1043 79-109 (109)
115 TIGR02605 CxxC_CxxC_SSSS putat 30.5 19 0.0004 31.3 0.3 12 985-996 6-17 (52)
116 cd05839 BR140_related The PWWP 30.1 78 0.0017 32.5 4.6 61 157-217 6-80 (111)
117 PF09986 DUF2225: Uncharacteri 30.0 28 0.0006 39.4 1.6 42 1016-1057 3-59 (214)
118 cd00729 rubredoxin_SM Rubredox 29.6 35 0.00077 27.5 1.7 10 985-994 3-12 (34)
119 PF11722 zf-TRM13_CCCH: CCCH z 29.4 31 0.00066 27.5 1.3 21 589-609 11-31 (31)
120 COG4049 Uncharacterized protei 27.8 15 0.00033 32.8 -0.7 31 841-871 11-41 (65)
121 KOG2186 Cell growth-regulating 27.8 21 0.00046 40.9 0.2 48 848-904 4-51 (276)
122 COG2888 Predicted Zn-ribbon RN 26.9 46 0.001 30.3 2.1 32 984-1026 27-58 (61)
123 PF13717 zinc_ribbon_4: zinc-r 26.1 39 0.00084 27.7 1.4 33 985-1028 3-35 (36)
124 COG1996 RPC10 DNA-directed RNA 25.9 34 0.00075 30.1 1.1 29 983-1027 5-33 (49)
125 PF09723 Zn-ribbon_8: Zinc rib 25.2 38 0.00083 28.5 1.2 13 848-860 6-18 (42)
126 PF02892 zf-BED: BED zinc fing 25.1 54 0.0012 27.4 2.1 28 981-1008 13-44 (45)
127 TIGR02300 FYDLN_acid conserved 24.9 47 0.001 34.7 2.0 34 985-1035 10-43 (129)
128 KOG2186 Cell growth-regulating 24.0 39 0.00085 38.9 1.4 47 881-936 3-49 (276)
129 cd05834 HDGF_related The PWWP 23.4 1E+02 0.0022 29.9 3.9 52 157-218 8-60 (83)
130 PRK14890 putative Zn-ribbon RN 23.1 54 0.0012 29.9 1.8 32 983-1026 24-56 (59)
131 PF09845 DUF2072: Zn-ribbon co 22.8 45 0.00096 35.0 1.4 15 984-998 1-15 (131)
132 TIGR00622 ssl1 transcription f 22.8 86 0.0019 32.2 3.4 50 848-905 56-105 (112)
133 PF03604 DNA_RNApol_7kD: DNA d 22.7 44 0.00095 26.9 1.0 11 985-995 1-11 (32)
134 PF08879 WRC: WRC; InterPro: 22.6 30 0.00065 30.0 0.1 20 589-608 13-32 (46)
135 PF12013 DUF3505: Protein of u 22.2 1E+02 0.0022 31.0 3.8 24 985-1008 81-108 (109)
136 PF13719 zinc_ribbon_5: zinc-r 21.5 61 0.0013 26.6 1.7 32 986-1028 4-35 (37)
137 PRK00464 nrdR transcriptional 21.3 46 0.001 35.9 1.2 16 882-897 29-44 (154)
138 KOG2593 Transcription initiati 21.0 63 0.0014 39.9 2.4 42 977-1027 121-162 (436)
139 PF14353 CpXC: CpXC protein 20.1 38 0.00083 34.9 0.3 15 848-862 2-16 (128)
140 PRK00398 rpoP DNA-directed RNA 20.0 46 0.001 28.3 0.8 13 984-996 3-15 (46)
No 1
>KOG1082 consensus Histone H3 (Lys9) methyltransferase SUV39H1/Clr4, required for transcriptional silencing [Chromatin structure and dynamics; Transcription]
Probab=99.93 E-value=7.6e-26 Score=267.56 Aligned_cols=169 Identities=34% Similarity=0.582 Sum_probs=137.1
Q ss_pred CCCCCcCeeEeecCcCCCCCCCeEEEECCCCcccccccCCCCCcccccCCCCCCCcEEccccCCCCCCCCcccCCCCCcc
Q 000554 1224 RKPLLRGTVLCDDISSGLESVPVACVVDDGLLETLCISADSSDSQKTRCSMPWESFTYVTKPLLDQSLDLDAESLQLGCA 1303 (1428)
Q Consensus 1224 ~~~~~r~~vi~~DIS~G~E~~PV~~vnd~d~~~~~~~~g~~s~~~~~~~~~Pp~~F~Yit~~i~~~~~~~~~~~~~~gC~ 1303 (1428)
.....+...+.+||+.|.|++||+.+|++|++ .| ..|+|++..++..+. ........+|.
T Consensus 52 ~~~~~~~~~~~~d~~~~~e~~~v~~~n~id~~------------------~~-~~f~y~~~~~~~~~~-~~~~~~~~~c~ 111 (364)
T KOG1082|consen 52 DKDKLEAKSELEDIALGSENLPVPLVNRIDED------------------AP-LYFQYIATEIVDPGE-LSDCENSTGCR 111 (364)
T ss_pred cccccccccccccccCccccCceeeeeeccCC------------------cc-ccceeccccccCccc-cccCccccCCC
Confidence 34456777889999999999999999999974 12 579999999888852 22334467999
Q ss_pred cCCCCcCCCC---CCccccccccccccccccCCCCCCCcccCCCCC--eeecCCccccccCcCCCCCCCCCCceeeccce
Q 000554 1304 CANSTCFPET---CDHVYLFDNDYEDAKDIDGKSVHGRFPYDQTGR--VILEEGYLIYECNHMCSCDRTCPNRVLQNGVR 1378 (1428)
Q Consensus 1304 C~~~~C~~~~---C~C~~l~~~~y~~~~~~~g~~~~~~~~Y~~~G~--l~~~~~~~IyECn~~C~C~~~C~NRvvQ~G~~ 1378 (1428)
|.+ .|.... |.|.. .+.+.++|..+|. .....+.+||||+..|+|+.+|.|||+|+|++
T Consensus 112 C~~-~~~~~~~~~C~C~~---------------~n~~~~~~~~~~~~~~~~~~~~~i~EC~~~C~C~~~C~nRv~q~g~~ 175 (364)
T KOG1082|consen 112 CCS-SCSSVLPLTCLCER---------------HNGGLVAYTCDGDCGTLGKFKEPVFECSVACGCHPDCANRVVQKGLQ 175 (364)
T ss_pred ccC-CCCCCCCccccChH---------------hhCCccccccCCccccccccCccccccccCCCCCCcCcchhhccccc
Confidence 986 343332 77743 2345678877763 33456679999999999999999999999999
Q ss_pred eeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhhh--ccCC
Q 000554 1379 VKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRSR--LLFD 1428 (1428)
Q Consensus 1379 ~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~~--YlFD 1428 (1428)
.+|+||||..+|||||++++||+|+|||||+|||++..|+++|... |+||
T Consensus 176 ~~leIfrt~~kGwgvRs~~~I~~G~fvcEyaGe~~t~~e~~~~~~~~~~~~~ 227 (364)
T KOG1082|consen 176 FHLEVFRTPEKGWGVRTLDPIPAGEFVCEYAGEVLTSEEAQRRTHLREYLDD 227 (364)
T ss_pred cceEEEecCCceeeecccccccCCCeeEEEeeEecChHHhhhcccccccccc
Confidence 9999999999999999999999999999999999999999998543 6654
No 2
>KOG1141 consensus Predicted histone methyl transferase [Chromatin structure and dynamics]
Probab=99.90 E-value=3.7e-25 Score=263.83 Aligned_cols=239 Identities=22% Similarity=0.268 Sum_probs=181.0
Q ss_pred ccceecccccCCCcccCCC--CCCCCCCC-CCcccC-----------CcccccccCCCCCCc-cccccceeee-ccC---
Q 000554 1160 VEWHREGFLCSNGCKIFKD--PHLPPHLE-PLPSVS-----------AGIRSSDSSDFVNNQ-WEVDECHCII-DSR--- 1220 (1428)
Q Consensus 1160 ~~wh~~~~~c~~g~~~~~~--~~~~~Pl~-p~~~~~-----------~~~k~v~~~~p~~~~-w~~~e~~~~l-~~~--- 1220 (1428)
+-.|.|...|-+.-....+ +.+-.||+ |..+.| ...-.|.|.+|||.. +.|.|+.+|| +.+
T Consensus 569 y~sh~cs~acl~~~~~~~~~~~~g~npl~lp~~~~F~r~~a~~rs~~~~~fhv~yktpcg~~lr~~~el~ryL~et~c~f 648 (1262)
T KOG1141|consen 569 YFSHKCSIACLNAAQIAIMVGQPGGNPLNLPYFLTFHRIRASHRSAYIRDFHVEYKTPCGMPLRMRIELYRYLVETRCKF 648 (1262)
T ss_pred ccchhhHHHHHhccchhhhccCCCCCccccceEEEeeehhhhhhhhhhhcceeeccCCCccchHHHHHHHHHHHHhcCcE
Confidence 3467788778666555543 56778998 988888 233368899999988 8888877655 321
Q ss_pred ----cc---------CCCCCCcCeeEeecCcCCCCCCCeEEEECCCCcccccccCCCCCcccccCCCCCCCcEEccccCC
Q 000554 1221 ----HL---------GRKPLLRGTVLCDDISSGLESVPVACVVDDGLLETLCISADSSDSQKTRCSMPWESFTYVTKPLL 1287 (1428)
Q Consensus 1221 ----~~---------~~~~~~r~~vi~~DIS~G~E~~PV~~vnd~d~~~~~~~~g~~s~~~~~~~~~Pp~~F~Yit~~i~ 1287 (1428)
.| +..++.++++.|-||++|+|.+||.++|++|.. |++.|.|-.+.|.
T Consensus 649 lf~~~f~~~~yV~~~r~~~p~kp~~~~~Di~~g~e~vpis~~neids~-------------------~lpq~ay~K~~ip 709 (1262)
T KOG1141|consen 649 LFVIGFDRAFYVVRHRAPNPLKPGNRCTDIPCGREHVPISEKNEIDSH-------------------RLPQAAYKKHMIP 709 (1262)
T ss_pred EEEeecccchheeecccCCCcCCcceeccccCCccccccceeecccCc-------------------CCccchhheeecc
Confidence 11 334578999999999999999999999999852 3468999988887
Q ss_pred CCCCCC-cccCCCCCcccCCCCcCCCCCCccccccccccccccccCCCCCCCcccCCCCCeeecCCccccccCcCCCCCC
Q 000554 1288 DQSLDL-DAESLQLGCACANSTCFPETCDHVYLFDNDYEDAKDIDGKSVHGRFPYDQTGRVILEEGYLIYECNHMCSCDR 1366 (1428)
Q Consensus 1288 ~~~~~~-~~~~~~~gC~C~~~~C~~~~C~C~~l~~~~y~~~~~~~g~~~~~~~~Y~~~G~l~~~~~~~IyECn~~C~C~~ 1366 (1428)
+...-. -.+.|..+|+|..||-+...|+|.++....-... .........++.|. |++......+|||+.+|+|.+
T Consensus 710 ~~~nl~n~~~~fl~scdc~~gcid~~kcachQltvk~~~t~-p~~~v~~t~gykyK---Rl~e~~ptg~yEc~k~ckc~~ 785 (1262)
T KOG1141|consen 710 TNNNLSNRRKDFLQSCDCPTGCIDSMKCACHQLTVKKKTTG-PNQNVASTNGYKYK---RLIEIRPTGPYECLKACKCCG 785 (1262)
T ss_pred CCCcccccChhhhhcCCCCcchhhhhhhhHHHHHHHhhccC-CCcccccCcchhhH---HHHHhcCCCHHHHHHhhccCc
Confidence 765312 2366789999999877778999988743211100 01111122345553 444445678999999999986
Q ss_pred -CCCCceeeccceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHH
Q 000554 1367 -TCPNRVLQNGVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKR 1421 (1428)
Q Consensus 1367 -~C~NRvvQ~G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R 1421 (1428)
.|.||++|+|.+++|++|+|.++|||+|++++|.+|.|||.|.|-+++++-+++-
T Consensus 786 ~~C~nrmvqhg~qvRlq~fkt~~kGWg~rclddi~~g~fVciy~g~~l~~~~sdks 841 (1262)
T KOG1141|consen 786 PDCLNRMVQHGYQVRLQRFKTIHKGWGRRCLDDITGGNFVCIYPGGALLHQISDKS 841 (1262)
T ss_pred HHHHHHHhhcCceeEeeeccccccccceEeeeecCCceEEEEecchhhhhhhchhh
Confidence 6999999999999999999999999999999999999999999999998887765
No 3
>KOG2462 consensus C2H2-type Zn-finger protein [Transcription]
Probab=99.85 E-value=2.7e-22 Score=220.67 Aligned_cols=133 Identities=20% Similarity=0.302 Sum_probs=77.0
Q ss_pred ccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCC
Q 000554 882 YACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSP 961 (1428)
Q Consensus 882 ykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~ 961 (1428)
|+|..|||.+.+.++|.+|.+.|-.-.. .+.+.|..|+|.|.+...|..|+| +|+ -++.|.+|
T Consensus 131 ~~c~eCgk~ysT~snLsrHkQ~H~~~~s---~ka~~C~~C~K~YvSmpALkMHir-TH~-----------l~c~C~iC-- 193 (279)
T KOG2462|consen 131 YKCPECGKSYSTSSNLSRHKQTHRSLDS---KKAFSCKYCGKVYVSMPALKMHIR-THT-----------LPCECGIC-- 193 (279)
T ss_pred eeccccccccccccccchhhcccccccc---cccccCCCCCceeeehHHHhhHhh-ccC-----------CCcccccc--
Confidence 3344444444444444444444332211 124455555555555555555544 443 23444444
Q ss_pred CccccCChhhhhhhhhhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccc
Q 000554 962 KKLELGYSASVENHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRP 1038 (1428)
Q Consensus 962 k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~ 1038 (1428)
++.|...--|+-|+|+|||||||.|+.|+|+|..+++|+. |+++|.+ .|+|+|..|+|+|..++.|.+|.
T Consensus 194 -GKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRA-HmQTHS~-----~K~~qC~~C~KsFsl~SyLnKH~ 263 (279)
T KOG2462|consen 194 -GKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRA-HMQTHSD-----VKKHQCPRCGKSFALKSYLNKHS 263 (279)
T ss_pred -cccccchHHhhcccccccCCCCccCCcccchhcchHHHHH-HHHhhcC-----CccccCcchhhHHHHHHHHHHhh
Confidence 3333333355555666777777777778888888888877 6777777 67778888888887777777776
No 4
>KOG2462 consensus C2H2-type Zn-finger protein [Transcription]
Probab=99.83 E-value=1.3e-21 Score=215.15 Aligned_cols=137 Identities=15% Similarity=0.095 Sum_probs=126.7
Q ss_pred ccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCccc
Q 000554 915 LQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKF 994 (1428)
Q Consensus 915 pfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsF 994 (1428)
.|+|..|||.+.+.++|.+|.+ +|-.- ..++.+.|.+| ++.+.+.-.|+.|+|+|+ -+++|.+|||.|
T Consensus 130 r~~c~eCgk~ysT~snLsrHkQ-~H~~~------~s~ka~~C~~C---~K~YvSmpALkMHirTH~--l~c~C~iCGKaF 197 (279)
T KOG2462|consen 130 RYKCPECGKSYSTSSNLSRHKQ-THRSL------DSKKAFSCKYC---GKVYVSMPALKMHIRTHT--LPCECGICGKAF 197 (279)
T ss_pred ceeccccccccccccccchhhc-ccccc------cccccccCCCC---CceeeehHHHhhHhhccC--CCcccccccccc
Confidence 6899999999999999999986 77432 22577999999 888888889999999998 789999999999
Q ss_pred CChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccccCCCccccCCCCCcCcChHHHHhhcCCC
Q 000554 995 DLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRIRNRGAAGMKKRIQTLKPL 1069 (1428)
Q Consensus 995 s~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~C~ksf~~~~~l~~H~ksh 1069 (1428)
.+.--|+- |.|+||| ||||.|+.|+|+|.++++|+.||++|.+.|+|+|..|+|+|..+..|.+|..+-
T Consensus 198 SRPWLLQG-HiRTHTG-----EKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~ 266 (279)
T KOG2462|consen 198 SRPWLLQG-HIRTHTG-----EKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSESA 266 (279)
T ss_pred cchHHhhc-ccccccC-----CCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhhc
Confidence 99999999 8999999 999999999999999999999999999999999999999999999999998853
No 5
>KOG1074 consensus Transcriptional repressor SALM [Transcription]
Probab=99.82 E-value=7.4e-21 Score=230.52 Aligned_cols=173 Identities=13% Similarity=0.122 Sum_probs=142.6
Q ss_pred ccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccC---c
Q 000554 882 YACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVG---E 958 (1428)
Q Consensus 882 ykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~---~ 958 (1428)
-+|-+|-+...-++.|+.|.++|+|++ ||+|.+||+.|.++.+|+.|+- +|... ..-+-++.|. +
T Consensus 606 NqCiiC~rVlSC~saLqmHyrtHtGER------PFkCKiCgRAFtTkGNLkaH~~-vHka~-----p~~R~q~ScP~~~i 673 (958)
T KOG1074|consen 606 NQCIICLRVLSCPSALQMHYRTHTGER------PFKCKICGRAFTTKGNLKAHMS-VHKAK-----PPARVQFSCPSTFI 673 (958)
T ss_pred cceeeeeecccchhhhhhhhhcccCcC------ccccccccchhccccchhhccc-ccccC-----ccccccccCCchhh
Confidence 489999999999999999999999997 9999999999999999999995 88643 1222467788 7
Q ss_pred CCCCccccCChhhhhhhhhhcCCc-c------------ceecCccCcccCChhhHHHHHHhhccC---------------
Q 000554 959 DSPKKLELGYSASVENHSENLGSI-R------------KFICRFCGLKFDLLPDLGRHHQAAHMG--------------- 1010 (1428)
Q Consensus 959 C~~k~~sf~sks~L~~H~rtHtGe-K------------pykC~~CGKsFs~~s~L~rHHqrvHtg--------------- 1010 (1428)
| ...|.+.-.|.+|+++|.+. . .-+|..|.+.|.....+.. ++.-|.+
T Consensus 674 c---~~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~-~~se~~~~~s~~~~~~~~~t~t 749 (958)
T KOG1074|consen 674 C---QKKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQ-QISEQPSPESEPDEQMDERTET 749 (958)
T ss_pred h---cccccccccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchh-hhhccCCcccCCcccccccccc
Confidence 8 66777777899999999842 2 2469999999988877777 5555511
Q ss_pred --------------------------------------------------------CCCC--------------------
Q 000554 1011 --------------------------------------------------------PNLV-------------------- 1014 (1428)
Q Consensus 1011 --------------------------------------------------------e~~~-------------------- 1014 (1428)
++..
T Consensus 750 ~~~~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl 829 (958)
T KOG1074|consen 750 EELDVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVL 829 (958)
T ss_pred cccccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchh
Confidence 0000
Q ss_pred ----------------------------------------------CCCCcccCCCCcccCCchhhhcccccccCCCccc
Q 000554 1015 ----------------------------------------------NSRPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVS 1048 (1428)
Q Consensus 1015 ----------------------------------------------~eKpykC~~CgKsFs~ks~L~~H~r~H~gekpy~ 1048 (1428)
......|.+|++.|...+.|..|+|+|+++|||.
T Consensus 830 ~~~~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~ 909 (958)
T KOG1074|consen 830 WNQETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFF 909 (958)
T ss_pred hcccccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCcc
Confidence 0223789999999999999999999999999999
Q ss_pred cCCCCCcCcChHHHHhhcCCCC
Q 000554 1049 YRIRNRGAAGMKKRIQTLKPLA 1070 (1428)
Q Consensus 1049 C~~C~ksf~~~~~l~~H~ksh~ 1070 (1428)
|.+|+++|..+..|..|+.+|.
T Consensus 910 C~fC~~aFttrgnLKvHMgtH~ 931 (958)
T KOG1074|consen 910 CHFCEEAFTTRGNLKVHMGTHM 931 (958)
T ss_pred chhhhhhhhhhhhhhhhhcccc
Confidence 9999999999999999999886
No 6
>KOG1074 consensus Transcriptional repressor SALM [Transcription]
Probab=99.79 E-value=3.4e-19 Score=216.24 Aligned_cols=88 Identities=27% Similarity=0.498 Sum_probs=79.4
Q ss_pred CcccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccccccccccc---cCC
Q 000554 846 KTHKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCI---PCG 922 (1428)
Q Consensus 846 kpykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~---~Cg 922 (1428)
.|-.|-+|-+....++.|+.| .++|++| +||+|.+||+.|.++.+|+.|+-.|....... ..|.|+ +|-
T Consensus 604 dPNqCiiC~rVlSC~saLqmH-yrtHtGE-----RPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R--~q~ScP~~~ic~ 675 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMH-YRTHTGE-----RPFKCKICGRAFTTKGNLKAHMSVHKAKPPAR--VQFSCPSTFICQ 675 (958)
T ss_pred Cccceeeeeecccchhhhhhh-hhcccCc-----CccccccccchhccccchhhcccccccCcccc--ccccCCchhhhc
Confidence 357899999999999999999 9999999 99999999999999999999999998765332 468999 999
Q ss_pred CCCCChhhhhhhhhhccccc
Q 000554 923 SHFGNTEELWLHVQSVHAID 942 (1428)
Q Consensus 923 KsF~sks~L~~H~rsvHsgE 942 (1428)
+.|.+.-.|.+|++ +|.+.
T Consensus 676 ~kftn~V~lpQhIr-iH~~~ 694 (958)
T KOG1074|consen 676 KKFTNAVTLPQHIR-IHLGG 694 (958)
T ss_pred ccccccccccceEE-eecCC
Confidence 99999999999997 89843
No 7
>KOG3608 consensus Zn finger proteins [General function prediction only]
Probab=99.77 E-value=1.5e-19 Score=201.77 Aligned_cols=199 Identities=17% Similarity=0.254 Sum_probs=172.1
Q ss_pred chhhhhhcccCCCCcccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhccccccccc
Q 000554 833 VLPLAIAGRSEDEKTHKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQ 912 (1428)
Q Consensus 833 ~~L~~H~r~H~gekpykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~ 912 (1428)
..|.+|.+.|+++|...|+.||..|.++..|-.|+++ .+.-. ..+|.|..|.|.|.+...|..|+..|-.
T Consensus 193 ~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rR-qt~l~---~n~fqC~~C~KrFaTeklL~~Hv~rHvn------ 262 (467)
T KOG3608|consen 193 YRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRR-QTELN---TNSFQCAQCFKRFATEKLLKSHVVRHVN------ 262 (467)
T ss_pred HHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHh-hhhhc---CCchHHHHHHHHHhHHHHHHHHHHHhhh------
Confidence 3599999999999999999999999999999999543 33221 1689999999999999999999998875
Q ss_pred ccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCc--c
Q 000554 913 CMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRF--C 990 (1428)
Q Consensus 913 ~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~--C 990 (1428)
-|+|+.|..+....+.|..|++..|+.+ |||+|+.| ...+.+.+.|.+|..+|+ +-.|.|+. |
T Consensus 263 --~ykCplCdmtc~~~ssL~~H~r~rHs~d---------kpfKCd~C---d~~c~~esdL~kH~~~HS-~~~y~C~h~~C 327 (467)
T KOG3608|consen 263 --CYKCPLCDMTCSSASSLTTHIRYRHSKD---------KPFKCDEC---DTRCVRESDLAKHVQVHS-KTVYQCEHPDC 327 (467)
T ss_pred --cccccccccCCCChHHHHHHHHhhhccC---------CCccccch---hhhhccHHHHHHHHHhcc-ccceecCCCCC
Confidence 6899999999999999999999889877 99999999 777888889999999998 77899988 9
Q ss_pred CcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccc-ccc-----CCCccccCCCCCcCcCh
Q 000554 991 GLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPR-FKK-----GLGAVSYRIRNRGAAGM 1059 (1428)
Q Consensus 991 GKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r-~H~-----gekpy~C~~C~ksf~~~ 1059 (1428)
..+|.....|++|...+|.|.+ +-+|.|..|++.|++..+|..|++ .|. |-+.|.++.|..+|.++
T Consensus 328 ~~s~r~~~q~~~H~~evhEg~n---p~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~PsGh~RFtYk~~edG~mRL 399 (467)
T KOG3608|consen 328 HYSVRTYTQMRRHFLEVHEGNN---PILYACHCCDRFFTSGKSLSAHLMKKHGFRLPSGHKRFTYKVDEDGFMRL 399 (467)
T ss_pred cHHHHHHHHHHHHHHHhccCCC---CCceeeecchhhhccchhHHHHHHHhhcccCCCCCCceeeeeccCceeee
Confidence 9999999999998888887854 458999999999999999999984 443 44566677777777543
No 8
>PF05033 Pre-SET: Pre-SET motif; InterPro: IPR007728 This region is found in a number of histone lysine methyltransferases (HMTase), N-terminal to the SET domain; it is generally described as the pre-SET domain. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils and stabilising the SET domain. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site [] when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity []. ; GO: 0008270 zinc ion binding, 0018024 histone-lysine N-methyltransferase activity, 0034968 histone lysine methylation, 0005634 nucleus; PDB: 3K5K_A 2O8J_D 3RJW_B 1ML9_A 1PEG_B 1MVH_A 1MVX_A 3BO5_A 2RFI_B 3MO5_B ....
Probab=99.75 E-value=1.8e-18 Score=169.76 Aligned_cols=103 Identities=31% Similarity=0.621 Sum_probs=71.1
Q ss_pred cCcCCCCCCCeEEEECCCCcccccccCCCCCcccccCCCCCCCcEEccccCCCCCCCCcccCCCCCcccCCCCcCCCCCC
Q 000554 1236 DISSGLESVPVACVVDDGLLETLCISADSSDSQKTRCSMPWESFTYVTKPLLDQSLDLDAESLQLGCACANSTCFPETCD 1315 (1428)
Q Consensus 1236 DIS~G~E~~PV~~vnd~d~~~~~~~~g~~s~~~~~~~~~Pp~~F~Yit~~i~~~~~~~~~~~~~~gC~C~~~~C~~~~C~ 1315 (1428)
|||.|+|++||+++|++|++ .||+.|+||+++++..++......+..||+|.++|-.+.+|.
T Consensus 1 Dis~g~e~~pI~~~N~vd~~------------------~~p~~F~Yi~~~~~~~~~~~~~~~~~~~C~C~~~C~~~~~C~ 62 (103)
T PF05033_consen 1 DISRGKENVPIPVVNDVDDE------------------PPPPNFEYIPENIYGEGVPDIDPEFLQGCDCSGDCSNPSNCE 62 (103)
T ss_dssp -TTCTSSSS-EEEEESSSS--------------------SSTSSEE-SS-EESTTSS-TBGGGTS----SSSSTCTTTSH
T ss_pred CCCCCccCCCEEEEeCCCCC------------------CCCCCeEEeeeEEcCCCccccccccCccCccCCCCCCCCCCc
Confidence 89999999999999999975 345799999999999987634466678999986433778999
Q ss_pred ccccccccccccccccCCCCCCCcccCCCCCeeecCCccccccCcCCCCCCCCCCc
Q 000554 1316 HVYLFDNDYEDAKDIDGKSVHGRFPYDQTGRVILEEGYLIYECNHMCSCDRTCPNR 1371 (1428)
Q Consensus 1316 C~~l~~~~y~~~~~~~g~~~~~~~~Y~~~G~l~~~~~~~IyECn~~C~C~~~C~NR 1371 (1428)
|+.++ ++.++|+.+|+|.+....+|||||+.|.|+.+|+||
T Consensus 63 C~~~~---------------~~~~~Y~~~g~l~~~~~~~i~EC~~~C~C~~~C~NR 103 (103)
T PF05033_consen 63 CLQRN---------------GGIFAYDSNGRLRIPDKPPIFECNDNCGCSPSCRNR 103 (103)
T ss_dssp HHCCT---------------SSS-SB-TTSSBSSSSTSEEE---TTSSS-TTSTT-
T ss_pred Ccccc---------------CccccccCCCcCccCCCCeEEeCCCCCCCCCCCCCC
Confidence 97642 235799999998877889999999999999999998
No 9
>KOG3608 consensus Zn finger proteins [General function prediction only]
Probab=99.68 E-value=4.9e-18 Score=189.87 Aligned_cols=188 Identities=19% Similarity=0.208 Sum_probs=165.6
Q ss_pred CCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhh
Q 000554 851 KICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEE 930 (1428)
Q Consensus 851 ~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~ 930 (1428)
..|.+.|.++..|+.| .+.|+++ |...|+.||.-|.++..|..|++..+.-.. .+|.|..|.|.|.+...
T Consensus 183 ~~Ct~~~~~k~~LreH-~r~Hs~e-----KvvACp~Cg~~F~~~tkl~DH~rRqt~l~~----n~fqC~~C~KrFaTekl 252 (467)
T KOG3608|consen 183 AMCTKHMGNKYRLREH-IRTHSNE-----KVVACPHCGELFRTKTKLFDHLRRQTELNT----NSFQCAQCFKRFATEKL 252 (467)
T ss_pred hhhhhhhccHHHHHHH-HHhcCCC-----eEEecchHHHHhccccHHHHHHHhhhhhcC----CchHHHHHHHHHhHHHH
Confidence 4699999999999999 8999999 999999999999999999999998765432 38999999999999999
Q ss_pred hhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhh-cCCccceecCccCcccCChhhHHHHHHhhcc
Q 000554 931 LWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSEN-LGSIRKFICRFCGLKFDLLPDLGRHHQAAHM 1009 (1428)
Q Consensus 931 L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rt-HtGeKpykC~~CGKsFs~~s~L~rHHqrvHt 1009 (1428)
|..|++ .|.. -|+|+.| ..+.+..++|..|++. |...|||+|+.|++.|.+.++|.+ |..+|+
T Consensus 253 L~~Hv~-rHvn-----------~ykCplC---dmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~k-H~~~HS 316 (467)
T KOG3608|consen 253 LKSHVV-RHVN-----------CYKCPLC---DMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAK-HVQVHS 316 (467)
T ss_pred HHHHHH-Hhhh-----------ccccccc---ccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHH-HHHhcc
Confidence 999997 6753 5889998 8888888999999985 889999999999999999999999 666998
Q ss_pred CCCCCCCCCcccCC--CCcccCCchhhhcccc-cccCC--CccccCCCCCcCcChHHHHhhc-CCCC
Q 000554 1010 GPNLVNSRPHKKGI--RFYAYKLKSGRLSRPR-FKKGL--GAVSYRIRNRGAAGMKKRIQTL-KPLA 1070 (1428)
Q Consensus 1010 ge~~~~eKpykC~~--CgKsFs~ks~L~~H~r-~H~ge--kpy~C~~C~ksf~~~~~l~~H~-ksh~ 1070 (1428)
. -.|+|.. |.++|+....|++|++ +|.|. -+|.|..|.+.|.+-..|..|. |.|+
T Consensus 317 ~------~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~ 377 (467)
T KOG3608|consen 317 K------TVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHG 377 (467)
T ss_pred c------cceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhc
Confidence 6 4799999 9999999999999995 55454 6899999999999888887664 3354
No 10
>smart00468 PreSET N-terminal to some SET domains. A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.
Probab=99.67 E-value=6.6e-17 Score=157.60 Aligned_cols=96 Identities=34% Similarity=0.652 Sum_probs=79.5
Q ss_pred eecCcCCCCCCCeEEEECCCCcccccccCCCCCcccccCCCCCCCcEEccccCCCCCCCC-cccCCCCCcccCCCCcCCC
Q 000554 1234 CDDISSGLESVPVACVVDDGLLETLCISADSSDSQKTRCSMPWESFTYVTKPLLDQSLDL-DAESLQLGCACANSTCFPE 1312 (1428)
Q Consensus 1234 ~~DIS~G~E~~PV~~vnd~d~~~~~~~~g~~s~~~~~~~~~Pp~~F~Yit~~i~~~~~~~-~~~~~~~gC~C~~~~C~~~ 1312 (1428)
+.|||+|+|++||++||++|++ .||++|+||++++++.++.+ ....+..||+|.+ .|.+.
T Consensus 1 ~~Dis~G~E~~pI~~vN~vD~~------------------~~p~~F~Yi~~~~~~~gv~~~~~~~~~~gC~C~~-~C~~~ 61 (98)
T smart00468 1 CLDISNGKENVPVPLVNEVDED------------------PPPPDFEYISEYIYGQGVPIDRSPSPLVGCSCSG-DCSSS 61 (98)
T ss_pred CccccCCccCCCcceEecCCCC------------------CCCCCcEECcceEcCCCcccccCCCCCCCCcCCC-CCCCC
Confidence 3699999999999999999985 23479999999999998753 4467788999998 57776
Q ss_pred C-CCccccccccccccccccCCCCCCCcccCCCCCeeecCCccccccCcCCC
Q 000554 1313 T-CDHVYLFDNDYEDAKDIDGKSVHGRFPYDQTGRVILEEGYLIYECNHMCS 1363 (1428)
Q Consensus 1313 ~-C~C~~l~~~~y~~~~~~~g~~~~~~~~Y~~~G~l~~~~~~~IyECn~~C~ 1363 (1428)
. |.|+.+ .++.|+|+..+++++..+.+|||||+.|+
T Consensus 62 ~~C~C~~~---------------~~~~~~Y~~~~~~~~~~~~~IyECn~~C~ 98 (98)
T smart00468 62 NKCECARK---------------NGGEFAYELNGGLRLKRKPLIYECNSRCS 98 (98)
T ss_pred CcCCcHhh---------------cCCccCcccCCCEEeCCCCEEEcCCCCCC
Confidence 6 999754 24679997777788889999999999985
No 11
>KOG3623 consensus Homeobox transcription factor SIP1 [Transcription]
Probab=99.62 E-value=1.4e-16 Score=190.41 Aligned_cols=79 Identities=22% Similarity=0.255 Sum_probs=72.2
Q ss_pred cceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccccCCCccccCCCCCcCcChHHH
Q 000554 983 RKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRIRNRGAAGMKKR 1062 (1428)
Q Consensus 983 KpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~C~ksf~~~~~l 1062 (1428)
.+|.|+.|+|.|...+.|.| |+--|+| .|||+|.+|.|+|..+.+|..|+|.|.|+|||.|+.|+|.|+....-
T Consensus 893 gmyaCDqCDK~FqKqSSLaR-HKYEHsG-----qRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSY 966 (1007)
T KOG3623|consen 893 GMYACDQCDKAFQKQSSLAR-HKYEHSG-----QRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSY 966 (1007)
T ss_pred ccchHHHHHHHHHhhHHHHH-hhhhhcC-----CCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccch
Confidence 57999999999999999999 8999999 99999999999999999999999999999999999999999755444
Q ss_pred HhhcC
Q 000554 1063 IQTLK 1067 (1428)
Q Consensus 1063 ~~H~k 1067 (1428)
.+|+.
T Consensus 967 SQHMN 971 (1007)
T KOG3623|consen 967 SQHMN 971 (1007)
T ss_pred Hhhhc
Confidence 44444
No 12
>KOG4442 consensus Clathrin coat binding protein/Huntingtin interacting protein HIP1, involved in regulation of endocytosis [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.53 E-value=4.2e-15 Score=179.58 Aligned_cols=73 Identities=42% Similarity=0.715 Sum_probs=69.6
Q ss_pred ccccccCc-CCC-CCCCCCCceeeccceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhhhc
Q 000554 1353 YLIYECNH-MCS-CDRTCPNRVLQNGVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRSRL 1425 (1428)
Q Consensus 1353 ~~IyECn~-~C~-C~~~C~NRvvQ~G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~~Y 1425 (1428)
....||++ .|. |+..|.|+.+|+....+++||+|++|||||||..+||+|+||.||+||||+.+|+++|...|
T Consensus 92 ~t~iECs~~~C~~cg~~C~NQRFQkkqyA~vevF~Te~KG~GLRA~~dI~~g~FI~EY~GEVI~~~Ef~kR~~~Y 166 (729)
T KOG4442|consen 92 MTSIECSDRECPRCGVYCKNQRFQKKQYAKVEVFLTEKKGCGLRAEEDIPKGQFILEYIGEVIEEKEFEKRVKRY 166 (729)
T ss_pred hhhcccCCccCCCccccccchhhhhhccCceeEEEecCcccceeeccccCCCcEEeeeccccccHHHHHHHHHHH
Confidence 35679988 999 99999999999999999999999999999999999999999999999999999999998865
No 13
>KOG3576 consensus Ovo and related transcription factors [Transcription]
Probab=99.32 E-value=2e-13 Score=143.90 Aligned_cols=118 Identities=14% Similarity=0.167 Sum_probs=95.4
Q ss_pred ccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCccc
Q 000554 915 LQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKF 994 (1428)
Q Consensus 915 pfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsF 994 (1428)
.|.|.+|+|.|.-...|.+|++ .|..- +.+.|..| ++.|.....|++|+|+|+|.+||+|..|+|+|
T Consensus 117 ~ftCrvCgK~F~lQRmlnrh~k-ch~~v---------kr~lct~c---gkgfndtfdlkrh~rthtgvrpykc~~c~kaf 183 (267)
T KOG3576|consen 117 SFTCRVCGKKFGLQRMLNRHLK-CHSDV---------KRHLCTFC---GKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAF 183 (267)
T ss_pred eeeeehhhhhhhHHHHHHHHhh-hccHH---------HHHHHhhc---cCcccchhhhhhhhccccCccccchhhhhHHH
Confidence 5778888888888888888876 67654 67778877 55555556899999999999999999999999
Q ss_pred CChhhHHHHHHhhccCCCCC-----CCCCcccCCCCcccCCchhhhcccccccCCC
Q 000554 995 DLLPDLGRHHQAAHMGPNLV-----NSRPHKKGIRFYAYKLKSGRLSRPRFKKGLG 1045 (1428)
Q Consensus 995 s~~s~L~rHHqrvHtge~~~-----~eKpykC~~CgKsFs~ks~L~~H~r~H~gek 1045 (1428)
.++-.|..|.+++|.-...+ ..|.|.|..||++-.....+..|++.|+...
T Consensus 184 tqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 184 TQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred HhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence 99999999888999764322 2567999999999999999999998887443
No 14
>KOG3576 consensus Ovo and related transcription factors [Transcription]
Probab=99.22 E-value=1.7e-12 Score=137.11 Aligned_cols=88 Identities=23% Similarity=0.500 Sum_probs=81.7
Q ss_pred CCCCcccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccccccccccccCC
Q 000554 843 EDEKTHKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCG 922 (1428)
Q Consensus 843 ~gekpykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~Cg 922 (1428)
.+...|.|.+|+|.|....-|.+| ++.|... |.|-|..|||.|...-.|++|+++|+|.+ ||+|..|+
T Consensus 113 sd~d~ftCrvCgK~F~lQRmlnrh-~kch~~v-----kr~lct~cgkgfndtfdlkrh~rthtgvr------pykc~~c~ 180 (267)
T KOG3576|consen 113 SDQDSFTCRVCGKKFGLQRMLNRH-LKCHSDV-----KRHLCTFCGKGFNDTFDLKRHTRTHTGVR------PYKCSLCE 180 (267)
T ss_pred CCCCeeeeehhhhhhhHHHHHHHH-hhhccHH-----HHHHHhhccCcccchhhhhhhhccccCcc------ccchhhhh
Confidence 445679999999999999999999 8999988 89999999999999999999999999997 99999999
Q ss_pred CCCCChhhhhhhhhhccccc
Q 000554 923 SHFGNTEELWLHVQSVHAID 942 (1428)
Q Consensus 923 KsF~sks~L~~H~rsvHsgE 942 (1428)
|.|.+...|..|.+.+|...
T Consensus 181 kaftqrcsleshl~kvhgv~ 200 (267)
T KOG3576|consen 181 KAFTQRCSLESHLKKVHGVQ 200 (267)
T ss_pred HHHHhhccHHHHHHHHcCch
Confidence 99999999999999888643
No 15
>KOG3623 consensus Homeobox transcription factor SIP1 [Transcription]
Probab=99.21 E-value=3.1e-12 Score=153.88 Aligned_cols=123 Identities=23% Similarity=0.344 Sum_probs=100.5
Q ss_pred ccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCC
Q 000554 882 YACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSP 961 (1428)
Q Consensus 882 ykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~ 961 (1428)
..|++|.+.+.+...|+.|++..|.... ..|.|..|..+|..+..|.+|+. .|..-
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne----~nfsC~lCsytFAyRtQLErhm~-~hkpg------------------- 266 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNE----PNFSCMLCSYTFAYRTQLERHMQ-LHKPG------------------- 266 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCC----CCCcchhhhhhhhhHHHHHHHHH-hhcCC-------------------
Confidence 5799999999999999999998776542 36899999999999999999996 67421
Q ss_pred CccccCChhhhhhhhhhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccc
Q 000554 962 KKLELGYSASVENHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFK 1041 (1428)
Q Consensus 962 k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H 1041 (1428)
+-. .+|+-.-.+.|.|+|.+|||+|..+.+|+. |.|+|.| +|||.|+.|+|.|+....+..||...
T Consensus 267 -~dq-------a~sltqsa~lRKFKCtECgKAFKfKHHLKE-HlRIHSG-----EKPfeCpnCkKRFSHSGSySSHmSSK 332 (1007)
T KOG3623|consen 267 -GDQ-------AISLTQSALLRKFKCTECGKAFKFKHHLKE-HLRIHSG-----EKPFECPNCKKRFSHSGSYSSHMSSK 332 (1007)
T ss_pred -Ccc-------cccccchhhhccccccccchhhhhHHHHHh-hheeecC-----CCCcCCcccccccccCCccccccccc
Confidence 000 011222234588999999999999999999 8999999 89999999999999999999999665
Q ss_pred c
Q 000554 1042 K 1042 (1428)
Q Consensus 1042 ~ 1042 (1428)
+
T Consensus 333 K 333 (1007)
T KOG3623|consen 333 K 333 (1007)
T ss_pred c
Confidence 5
No 16
>KOG1079 consensus Transcriptional repressor EZH1 [Transcription]
Probab=99.15 E-value=1.7e-11 Score=147.89 Aligned_cols=99 Identities=25% Similarity=0.625 Sum_probs=83.3
Q ss_pred cccCCCCCcccCCCCcCCCCCCccccccccccccccccCCCCCCCcccCCCCCeeecCCccccccC-cCCCC-C------
Q 000554 1294 DAESLQLGCACANSTCFPETCDHVYLFDNDYEDAKDIDGKSVHGRFPYDQTGRVILEEGYLIYECN-HMCSC-D------ 1365 (1428)
Q Consensus 1294 ~~~~~~~gC~C~~~~C~~~~C~C~~l~~~~y~~~~~~~g~~~~~~~~Y~~~G~l~~~~~~~IyECn-~~C~C-~------ 1365 (1428)
+-.+.+.||.| .+.|....|+|.. ...||. +.|.+ +
T Consensus 534 dC~nrF~GC~C-k~QC~tkqCpC~~-----------------------------------A~rECdPd~Cl~cg~~~~~d 577 (739)
T KOG1079|consen 534 DCRNRFPGCRC-KAQCNTKQCPCYL-----------------------------------AVRECDPDVCLMCGNVDHFD 577 (739)
T ss_pred HHHhcCCCCCc-ccccccCcCchhh-----------------------------------hccccCchHHhccCcccccc
Confidence 33556789999 4588888899842 245785 57754 2
Q ss_pred ---CCCCCceeeccceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhhh-------ccCC
Q 000554 1366 ---RTCPNRVLQNGVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRSR-------LLFD 1428 (1428)
Q Consensus 1366 ---~~C~NRvvQ~G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~~-------YlFD 1428 (1428)
-+|+|.-+|+|.+.++.|-.+.-.|||++..+++.+++||.||+||+|+++||++|+.. ||||
T Consensus 578 ~~~~~C~N~~l~~~~qkr~llapSdVaGwGlFlKe~v~KnefisEY~GE~IS~dEADrRGkiYDr~~cSflFn 650 (739)
T KOG1079|consen 578 SSKISCKNTNLQRGEQKRVLLAPSDVAGWGLFLKESVSKNEFISEYTGEIISHDEADRRGKIYDRYMCSFLFN 650 (739)
T ss_pred cCccccccchhhhhhhcceeechhhccccceeeccccCCCceeeeecceeccchhhhhcccccccccceeeee
Confidence 27999999999999999999999999999999999999999999999999999999973 7775
No 17
>KOG1141 consensus Predicted histone methyl transferase [Chromatin structure and dynamics]
Probab=98.97 E-value=4.1e-10 Score=136.87 Aligned_cols=185 Identities=23% Similarity=0.399 Sum_probs=121.2
Q ss_pred eeccCccCCCC---------CCcCeeEeecCcCCCCCCCeEEEECCCCcccccccCCCCCcccccCCCCCCCcEEccccC
Q 000554 1216 IIDSRHLGRKP---------LLRGTVLCDDISSGLESVPVACVVDDGLLETLCISADSSDSQKTRCSMPWESFTYVTKPL 1286 (1428)
Q Consensus 1216 ~l~~~~~~~~~---------~~r~~vi~~DIS~G~E~~PV~~vnd~d~~~~~~~~g~~s~~~~~~~~~Pp~~F~Yit~~i 1286 (1428)
+++.++|.|.. ....++-.+|.+.|.+.+|||.||.+|..+.+.-+ ++. -.|.|..+..
T Consensus 850 ~~~id~~~f~~~~dt~~~~tvD~~g~d~~d~~~g~sg~~~p~~~~~d~~~~~~c~----d~~--------~~~~~~~~~~ 917 (1262)
T KOG1141|consen 850 LLTIDCFSFDARIDTATYITVDDKGLDVADFSLGTSGIPIPLVNSVDNDEPPSCE----DSK--------RRFQYNDQVD 917 (1262)
T ss_pred hhcccccchhccccccceeeccccccchhhhhccccCCCCccccccccCCCcccc----ccc--------eeecccccch
Confidence 44456665543 23455667899999999999999999875433211 111 1233433221
Q ss_pred CCCCCCCcccCCCCCcccCCCCcCCCCCCccccccccccccc---cccCCCCCCCcccCCCCCeeecCCccccccCcCCC
Q 000554 1287 LDQSLDLDAESLQLGCACANSTCFPETCDHVYLFDNDYEDAK---DIDGKSVHGRFPYDQTGRVILEEGYLIYECNHMCS 1363 (1428)
Q Consensus 1287 ~~~~~~~~~~~~~~gC~C~~~~C~~~~C~C~~l~~~~y~~~~---~~~g~~~~~~~~Y~~~G~l~~~~~~~IyECn~~C~ 1363 (1428)
+ ......+..||.|.+++-+-+.|.|.++......... ...|...--.-+|+.+..+ ....|||++.|.
T Consensus 918 ~----s~~~~~~~~~~s~d~hp~d~~~~~~~~~~~~~~~~cpp~~s~d~~~~~~eS~~~~ns~~----~~~f~e~~~hss 989 (1262)
T KOG1141|consen 918 I----SSVSRDFCSGCSCDGHPSDASKCECQQLSIEAMKRCPPNLSFDGHDELYESSEKQNSFL----KLFFFECNDHSS 989 (1262)
T ss_pred h----hhhccccccccccCCCCcccCcccCCCCChhhhcCCCCccccCchhhhhhhhhhcchhh----hccceeccccch
Confidence 1 1123567789999876555677888765332221110 0111111111122222211 235789999999
Q ss_pred CCCCCCCceeeccceee--------EEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHH
Q 000554 1364 CDRTCPNRVLQNGVRVK--------LEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNK 1420 (1428)
Q Consensus 1364 C~~~C~NRvvQ~G~~~~--------LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~ 1420 (1428)
|...|.||++|++++++ |+||+|..-|||+|...+||.-+|||+|+|...++.-|++
T Consensus 990 ~~~~e~~~~v~~~~~~~me~~s~~~l~i~~~~~~~~~~~edtD~~~~~~~~~~~~~ppt~~l~~~ 1054 (1262)
T KOG1141|consen 990 CHRKEYNRVVQNNIKYPMEVSSFNDLQIFKTAQSGWGVREDTDIPQSTFICTYVGAPPTDDLADE 1054 (1262)
T ss_pred hcccccchhhhcCCccceeeeecccccccccccccccccccccCCCCcccccccCCCCchhhHHH
Confidence 99999999999998876 5578888999999999999999999999999999988775
No 18
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.81 E-value=6.1e-09 Score=128.04 Aligned_cols=144 Identities=19% Similarity=0.307 Sum_probs=85.2
Q ss_pred ccCCCCCcccccccccccccccccchhhhcccCcccccc--cccccCChhhhhhhhhhcccccccccccccccccCCCCC
Q 000554 848 HKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAI--CLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHF 925 (1428)
Q Consensus 848 ykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~--CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF 925 (1428)
-.|+.|.+..... .|..| ...+ .- ..-.|+. |+..|. +..+..| +.|+.|++.|
T Consensus 408 V~C~NC~~~i~l~-~l~lH-e~~C-~r-----~~V~Cp~~~Cg~v~~-r~el~~H---------------~~C~~Cgk~f 463 (567)
T PLN03086 408 VECRNCKHYIPSR-SIALH-EAYC-SR-----HNVVCPHDGCGIVLR-VEEAKNH---------------VHCEKCGQAF 463 (567)
T ss_pred EECCCCCCccchh-HHHHH-HhhC-CC-----cceeCCcccccceee-ccccccC---------------ccCCCCCCcc
Confidence 3566666655433 24455 2221 11 2334663 777662 3333333 3477777777
Q ss_pred CChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCcccC----------
Q 000554 926 GNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKFD---------- 995 (1428)
Q Consensus 926 ~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs---------- 995 (1428)
. ...|..|++. |. +++.|. | +..+ .+..|..|+++|.+++++.|++|++.|.
T Consensus 464 ~-~s~LekH~~~-~H-----------kpv~Cp-C---g~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d 525 (567)
T PLN03086 464 Q-QGEMEKHMKV-FH-----------EPLQCP-C---GVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRD 525 (567)
T ss_pred c-hHHHHHHHHh-cC-----------CCccCC-C---CCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhh
Confidence 4 4667777663 32 367776 6 3322 3457777777788888888888888774
Q ss_pred ChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccc-ccc
Q 000554 996 LLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRP-RFK 1041 (1428)
Q Consensus 996 ~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~-r~H 1041 (1428)
..+.|.. |..++ | .+++.|..||+.|..+ .|..|+ ..|
T Consensus 526 ~~s~Lt~-HE~~C-G-----~rt~~C~~Cgk~Vrlr-dm~~H~~~~h 564 (567)
T PLN03086 526 RLRGMSE-HESIC-G-----SRTAPCDSCGRSVMLK-EMDIHQIAVH 564 (567)
T ss_pred hhhhHHH-HHHhc-C-----CcceEccccCCeeeeh-hHHHHHHHhh
Confidence 2356777 56664 5 6788888888777654 355565 344
No 19
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.76 E-value=8.3e-09 Score=126.87 Aligned_cols=140 Identities=16% Similarity=0.159 Sum_probs=105.7
Q ss_pred ccccccccccCChhhhhhhhhhccccccccccccccccc--CCCCCCChhhhhhhhhhcccccccchhhhhccccccCcC
Q 000554 882 YACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIP--CGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGED 959 (1428)
Q Consensus 882 ykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~--CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C 959 (1428)
-.|..|..... ...|..|....... .-.|+. ||..|. +..+..| +.|..|
T Consensus 408 V~C~NC~~~i~-l~~l~lHe~~C~r~-------~V~Cp~~~Cg~v~~-r~el~~H-------------------~~C~~C 459 (567)
T PLN03086 408 VECRNCKHYIP-SRSIALHEAYCSRH-------NVVCPHDGCGIVLR-VEEAKNH-------------------VHCEKC 459 (567)
T ss_pred EECCCCCCccc-hhHHHHHHhhCCCc-------ceeCCcccccceee-ccccccC-------------------ccCCCC
Confidence 45999987654 45566887543332 356885 999883 3333333 468888
Q ss_pred CCCccccCChhhhhhhhhhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCC---------
Q 000554 960 SPKKLELGYSASVENHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKL--------- 1030 (1428)
Q Consensus 960 ~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~--------- 1030 (1428)
+..|. ...|..|+++|+ ++|.|+ ||+.| .+..|.. |+++|.. .+++.|++|++.|..
T Consensus 460 ---gk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~-H~~thCp-----~Kpi~C~fC~~~v~~g~~~~d~~d 525 (567)
T PLN03086 460 ---GQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQ-HQASTCP-----LRLITCRFCGDMVQAGGSAMDVRD 525 (567)
T ss_pred ---CCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHh-hhhccCC-----CCceeCCCCCCccccCccccchhh
Confidence 55554 468999999986 899999 99765 6689999 7899999 899999999999952
Q ss_pred -chhhhcccccccCCCccccCCCCCcCcChHHHHh
Q 000554 1031 -KSGRLSRPRFKKGLGAVSYRIRNRGAAGMKKRIQ 1064 (1428)
Q Consensus 1031 -ks~L~~H~r~H~gekpy~C~~C~ksf~~~~~l~~ 1064 (1428)
.+.|..|...+ |.+++.|..|++.+..+..-.|
T Consensus 526 ~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlrdm~~H 559 (567)
T PLN03086 526 RLRGMSEHESIC-GSRTAPCDSCGRSVMLKEMDIH 559 (567)
T ss_pred hhhhHHHHHHhc-CCcceEccccCCeeeehhHHHH
Confidence 35899999886 9999999999999875544433
No 20
>PHA00733 hypothetical protein
Probab=98.55 E-value=3.8e-08 Score=100.96 Aligned_cols=86 Identities=10% Similarity=0.031 Sum_probs=65.4
Q ss_pred cccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCcc
Q 000554 914 MLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLK 993 (1428)
Q Consensus 914 kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKs 993 (1428)
+++.|.+|.+.|.....|..|. .|.+|+..| +.+||.|+.||+.
T Consensus 39 ~~~~~~~~~~~~~~~~~l~~~~-----------------------------------~l~~~~~~~-~~kPy~C~~Cgk~ 82 (128)
T PHA00733 39 KRLIRAVVKTLIYNPQLLDESS-----------------------------------YLYKLLTSK-AVSPYVCPLCLMP 82 (128)
T ss_pred hhHHHHHHhhhccChhhhcchH-----------------------------------HHHhhcccC-CCCCccCCCCCCc
Confidence 3677777777777665555542 355565444 4789999999999
Q ss_pred cCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccccC
Q 000554 994 FDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKG 1043 (1428)
Q Consensus 994 Fs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~g 1043 (1428)
|.+...|.+ |++.|+. +|.|+.|++.|.....|.+|+..+++
T Consensus 83 Fss~s~L~~-H~r~h~~-------~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 83 FSSSVSLKQ-HIRYTEH-------SKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCHHHHHH-HHhcCCc-------CccCCCCCCccCCHHHHHHHHHHhcC
Confidence 999999999 6777643 68999999999999999999866553
No 21
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=98.39 E-value=6.9e-08 Score=79.71 Aligned_cols=35 Identities=23% Similarity=0.256 Sum_probs=20.5
Q ss_pred eecceeeeecccccCChhhhcccchhhhhhhhc-----hhhHhhc
Q 000554 732 IISKEVFLELLKDCCSLEQKLHLHLACELFYKL-----LKSILSL 771 (1428)
Q Consensus 732 VTFkDVAV~F~r~c~SqEEW~~LdPaCrkLYrd-----y~nLvSH 771 (1428)
|||+||||+| |+|||.+|+|+|+.+|++ |++++++
T Consensus 1 Vtf~Dvav~f-----s~eEW~~L~~~Qk~ly~dvm~Eny~~l~sl 40 (41)
T PF01352_consen 1 VTFEDVAVYF-----SQEEWELLDPAQKNLYRDVMLENYRNLVSL 40 (41)
T ss_dssp ------TT--------HHHHHTS-HHHHHHHHHHHHHTTTS---S
T ss_pred CeEEEEEEEc-----ChhhcccccceecccchhHHHHhhcccEec
Confidence 7999999999 999999999999999998 6777665
No 22
>PHA00733 hypothetical protein
Probab=98.29 E-value=3e-07 Score=94.35 Aligned_cols=93 Identities=22% Similarity=0.352 Sum_probs=73.0
Q ss_pred hhhhhcccCCCCcccCCCCCcccccccccccc--c--ccccchhhhcccCcccccccccccCChhhhhhhhhhccccccc
Q 000554 835 PLAIAGRSEDEKTHKCKICSQVFLHDQELGVH--W--MDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFV 910 (1428)
Q Consensus 835 L~~H~r~H~gekpykC~~CgK~F~s~s~L~~H--~--~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~ 910 (1428)
|..+......++++.|.+|.+.|.....|..| + ...+.+. +||.|..|++.|.....|..|++.| +.
T Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~-----kPy~C~~Cgk~Fss~s~L~~H~r~h--~~-- 98 (128)
T PHA00733 28 LKRYHSLTPEQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAV-----SPYVCPLCLMPFSSSVSLKQHIRYT--EH-- 98 (128)
T ss_pred hhhhhcCChhhhhHHHHHHhhhccChhhhcchHHHHhhcccCCC-----CCccCCCCCCcCCCHHHHHHHHhcC--Cc--
Confidence 33333444557889999999999988777665 1 1122334 8999999999999999999999976 22
Q ss_pred ccccccccccCCCCCCChhhhhhhhhhccc
Q 000554 911 EQCMLQQCIPCGSHFGNTEELWLHVQSVHA 940 (1428)
Q Consensus 911 e~~kpfkC~~CgKsF~sks~L~~H~rsvHs 940 (1428)
+|.|..|++.|.....|..|+...|.
T Consensus 99 ----~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 99 ----SKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred ----CccCCCCCCccCCHHHHHHHHHHhcC
Confidence 78999999999999999999986553
No 23
>KOG3993 consensus Transcription factor (contains Zn finger) [Transcription]
Probab=98.23 E-value=2.5e-07 Score=107.53 Aligned_cols=39 Identities=21% Similarity=0.206 Sum_probs=29.3
Q ss_pred hhhhhhcccCCCCcccCCCCCcccccccccccccccccch
Q 000554 834 LPLAIAGRSEDEKTHKCKICSQVFLHDQELGVHWMDNHKK 873 (1428)
Q Consensus 834 ~L~~H~r~H~gekpykC~~CgK~F~s~s~L~~H~~r~Ht~ 873 (1428)
.|.+|.-...----|+|++|+|.|+...+|..| ++.|..
T Consensus 282 ~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASH-RRWHKP 320 (500)
T KOG3993|consen 282 ALAQHRCPRIVHVEYRCPECDKVFSCPANLASH-RRWHKP 320 (500)
T ss_pred HHhhccCCeeEEeeecCCcccccccCchhhhhh-hcccCC
Confidence 466665333333349999999999999999999 899964
No 24
>PHA02768 hypothetical protein; Provisional
Probab=98.13 E-value=1.3e-06 Score=76.25 Aligned_cols=43 Identities=14% Similarity=0.084 Sum_probs=33.5
Q ss_pred eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhh
Q 000554 985 FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRL 1035 (1428)
Q Consensus 985 ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~ 1035 (1428)
|+|+.||+.|.+.++|.. |+++|+. +|+|..|++.|.+++.|.
T Consensus 6 y~C~~CGK~Fs~~~~L~~-H~r~H~k-------~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 6 YECPICGEIYIKRKSMIT-HLRKHNT-------NLKLSNCKRISLRTGEYI 48 (55)
T ss_pred cCcchhCCeeccHHHHHH-HHHhcCC-------cccCCcccceecccceeE
Confidence 778888888888888888 6777773 678888888888777664
No 25
>KOG3993 consensus Transcription factor (contains Zn finger) [Transcription]
Probab=98.11 E-value=4e-07 Score=105.84 Aligned_cols=181 Identities=12% Similarity=0.077 Sum_probs=107.0
Q ss_pred cccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhccccccc----------------
Q 000554 847 THKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFV---------------- 910 (1428)
Q Consensus 847 pykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~---------------- 910 (1428)
-|.|..|...|...-.|.+| +-..--. --|+|++|+|.|.-..+|..|.|.|......
T Consensus 267 dyiCqLCK~kYeD~F~LAQH-rC~RIV~-----vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~ra 340 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQH-RCPRIVH-----VEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRA 340 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhc-cCCeeEE-----eeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhh
Confidence 39999999999999999999 3211111 3499999999999999999999998643211
Q ss_pred -----------ccccccccccCCCCCCChhhhhhhhhhcccccccc-------hhhhhccccccCcCCCCccccCChhhh
Q 000554 911 -----------EQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKM-------SEVAQQHNQSVGEDSPKKLELGYSASV 972 (1428)
Q Consensus 911 -----------e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~-------~s~~~~kp~~C~~C~~k~~sf~sks~L 972 (1428)
..+..|.|.+|+|.|.....|+.|+.+.|...... .+....-.+-|..| .-.+.....-
T Consensus 341 e~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~---a~h~~a~~~~ 417 (500)
T KOG3993|consen 341 EVQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAV---ATHSSASDSH 417 (500)
T ss_pred hhhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhccccccccccc---cccccccccc
Confidence 11246999999999999999999987444333000 00000011223333 1111110000
Q ss_pred hhhhhhcCC-ccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccc-cccc
Q 000554 973 ENHSENLGS-IRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRP-RFKK 1042 (1428)
Q Consensus 973 ~~H~rtHtG-eKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~-r~H~ 1042 (1428)
-.|...+.+ .....|+.||-.+..+..-.. +.+.-.. +.-|.|.+|.-+|....+|.+|+ +-|-
T Consensus 418 g~~vl~~a~sael~~pp~~~~ppsss~~sgg-~~rlg~~-----~q~f~~ky~~atfyss~~ltrhin~~Hp 483 (500)
T KOG3993|consen 418 GDEVLYVAGSAELELPPYDGSPPSSSGSSGG-YGRLGIA-----EQGFTCKYCPATFYSSPGLTRHINKCHP 483 (500)
T ss_pred ccceeeeeccccccCCCCCCCCcccCCCCCc-cccccch-----hhccccccchHhhhcCcchHhHhhhcCh
Confidence 011111111 122346777766665554444 2222111 45677888888888888888877 3343
No 26
>KOG1083 consensus Putative transcription factor ASH1/LIN-59 [Transcription]
Probab=98.05 E-value=4.1e-07 Score=114.53 Aligned_cols=56 Identities=43% Similarity=0.732 Sum_probs=51.0
Q ss_pred CCCceeec-cceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhh
Q 000554 1368 CPNRVLQN-GVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRS 1423 (1428)
Q Consensus 1368 C~NRvvQ~-G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~ 1423 (1428)
|.|+.+|+ +.-.+|+||++..+||||++..+|++|+|||||+||||+.++++.|+.
T Consensus 1166 c~nqrm~r~e~cp~L~v~~gp~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~mm 1222 (1306)
T KOG1083|consen 1166 CSNQRMQRHEECPPLEVFRGPKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPRMM 1222 (1306)
T ss_pred hhhHHhhhhccCCCcceeccCCCCccccccccccccchHHHHHHHHHHHHhhccccc
Confidence 88888876 456889999999999999999999999999999999999999998843
No 27
>PHA02768 hypothetical protein; Provisional
Probab=97.95 E-value=1.9e-06 Score=75.21 Aligned_cols=45 Identities=11% Similarity=-0.033 Sum_probs=41.1
Q ss_pred CcccCCCCcccCCchhhhcccccccCCCccccCCCCCcCcChHHHHh
Q 000554 1018 PHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRIRNRGAAGMKKRIQ 1064 (1428)
Q Consensus 1018 pykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~C~ksf~~~~~l~~ 1064 (1428)
-|+|+.||+.|++.++|..|+++|+ ++|+|..|++.|.....++.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~~ 49 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYIE 49 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeEE
Confidence 5899999999999999999999999 79999999999997776653
No 28
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=97.86 E-value=6.6e-06 Score=61.51 Aligned_cols=26 Identities=19% Similarity=0.577 Sum_probs=19.2
Q ss_pred hhhhhhhhcCCccceecCccCcccCC
Q 000554 971 SVENHSENLGSIRKFICRFCGLKFDL 996 (1428)
Q Consensus 971 ~L~~H~rtHtGeKpykC~~CGKsFs~ 996 (1428)
+|.+|+++|+|+|||+|+.|+++|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 36777777777777777777777753
No 29
>PHA00732 hypothetical protein
Probab=97.42 E-value=9.7e-05 Score=69.87 Aligned_cols=48 Identities=21% Similarity=0.214 Sum_probs=37.8
Q ss_pred ceecCccCcccCChhhHHHHHHh-hccCCCCCCCCCcccCCCCcccCCchhhhcccccccC
Q 000554 984 KFICRFCGLKFDLLPDLGRHHQA-AHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKG 1043 (1428)
Q Consensus 984 pykC~~CGKsFs~~s~L~rHHqr-vHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~g 1043 (1428)
||.|+.||+.|.+.++|.+ |++ .|++ +.|+.|+++|. .|..|++.+..
T Consensus 1 py~C~~Cgk~F~s~s~Lk~-H~r~~H~~--------~~C~~CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQ-HARRNHTL--------TKCPVCNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred CccCCCCCCccCCHHHHHH-HhhcccCC--------CccCCCCCEeC---ChhhhhcccCC
Confidence 5889999999999999999 555 4654 47999999997 58888866654
No 30
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=97.36 E-value=0.0001 Score=55.24 Aligned_cols=26 Identities=27% Similarity=0.349 Sum_probs=20.5
Q ss_pred hHHHHHHhhccCCCCCCCCCcccCCCCcccCC
Q 000554 999 DLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKL 1030 (1428)
Q Consensus 999 ~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ 1030 (1428)
+|.+ |+++|+| ++||+|++|+++|.+
T Consensus 1 ~l~~-H~~~H~~-----~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRR-HMRTHTG-----EKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHH-HHHHHSS-----SSSEEESSSSEEESS
T ss_pred CHHH-HhhhcCC-----CCCCCCCCCcCeeCc
Confidence 4777 6778888 788888888888863
No 31
>PHA00616 hypothetical protein
Probab=97.24 E-value=5.9e-05 Score=63.14 Aligned_cols=26 Identities=19% Similarity=0.280 Sum_probs=13.9
Q ss_pred ceecCccCcccCChhhHHHHHHhhccC
Q 000554 984 KFICRFCGLKFDLLPDLGRHHQAAHMG 1010 (1428)
Q Consensus 984 pykC~~CGKsFs~~s~L~rHHqrvHtg 1010 (1428)
||+|+.||+.|.++++|.+ |.+.|+|
T Consensus 1 pYqC~~CG~~F~~~s~l~~-H~r~~hg 26 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIE-HLLSVHK 26 (44)
T ss_pred CCccchhhHHHhhHHHHHH-HHHHhcC
Confidence 3555555555555555555 4455555
No 32
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=97.19 E-value=0.00044 Score=67.87 Aligned_cols=43 Identities=49% Similarity=0.921 Sum_probs=39.5
Q ss_pred eEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHh
Q 000554 1380 KLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRR 1422 (1428)
Q Consensus 1380 ~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~ 1422 (1428)
+++++++..+|+||+|..+|++|++|++|.|+++...++..+.
T Consensus 1 ~~~~~~~~~~G~gl~a~~~i~~g~~i~~~~g~~~~~~~~~~~~ 43 (116)
T smart00317 1 KLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERS 43 (116)
T ss_pred CcEEEecCCCcEEEEECCccCCCCEEEEEEeEEECHHHHHHHH
Confidence 4688999999999999999999999999999999998888764
No 33
>PHA00616 hypothetical protein
Probab=96.96 E-value=0.00027 Score=59.22 Aligned_cols=34 Identities=3% Similarity=-0.224 Sum_probs=31.3
Q ss_pred CcccCCCCcccCCchhhhcccccccCCCccccCC
Q 000554 1018 PHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRI 1051 (1428)
Q Consensus 1018 pykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~ 1051 (1428)
||+|+.||+.|..++.|.+|++.|+|++++.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 6899999999999999999999999999998864
No 34
>PHA00732 hypothetical protein
Probab=96.91 E-value=0.00047 Score=65.30 Aligned_cols=45 Identities=24% Similarity=0.487 Sum_probs=35.6
Q ss_pred cccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhh
Q 000554 881 GYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQ 936 (1428)
Q Consensus 881 pykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~r 936 (1428)
||.|..|++.|.+...|..|++.+|. ++.|+.|++.|. .|..|++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~--------~~~C~~CgKsF~---~l~~H~~ 45 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT--------LTKCPVCNKSYR---RLNQHFY 45 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC--------CCccCCCCCEeC---Chhhhhc
Confidence 57899999999999999999885432 346999999987 5788875
No 35
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.79 E-value=0.00087 Score=58.86 Aligned_cols=52 Identities=17% Similarity=0.198 Sum_probs=41.0
Q ss_pred ceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccccccc
Q 000554 984 KFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKK 1042 (1428)
Q Consensus 984 pykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~ 1042 (1428)
.|.|++|++. .....|..|....|..+ .+.+.|++|...+. .+|.+|+..++
T Consensus 2 ~f~CP~C~~~-~~~~~L~~H~~~~H~~~----~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 2 SFTCPYCGKG-FSESSLVEHCEDEHRSE----SKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred CcCCCCCCCc-cCHHHHHHHHHhHCcCC----CCCccCCCchhhhh--hHHHHHHHHhc
Confidence 4899999995 45678999888888884 45799999998655 48899986654
No 36
>KOG1085 consensus Predicted methyltransferase (contains a SET domain) [General function prediction only]
Probab=96.74 E-value=0.0011 Score=74.59 Aligned_cols=53 Identities=30% Similarity=0.429 Sum_probs=45.4
Q ss_pred ccceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhhhccC
Q 000554 1375 NGVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRSRLLF 1427 (1428)
Q Consensus 1375 ~G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~~YlF 1427 (1428)
.|....|.+..-.+||-||++..++.+|+||.||.|.||.-.||..|+..|--
T Consensus 252 ~g~~egl~~~~~dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~Ya~ 304 (392)
T KOG1085|consen 252 KGTNEGLLEVYKDGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQYAN 304 (392)
T ss_pred hccccceeEEeeccccceeEeecccccCceEEEEecceeeechHHHHHHHhcc
Confidence 45556667766677999999999999999999999999999999999986543
No 37
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.57 E-value=0.0013 Score=74.95 Aligned_cols=57 Identities=18% Similarity=0.244 Sum_probs=44.1
Q ss_pred ccceecCc--cCcccCChhhHHHHHHhhccCCC-------------CCCCCCcccCCCCcccCCchhhhccc
Q 000554 982 IRKFICRF--CGLKFDLLPDLGRHHQAAHMGPN-------------LVNSRPHKKGIRFYAYKLKSGRLSRP 1038 (1428)
Q Consensus 982 eKpykC~~--CGKsFs~~s~L~rHHqrvHtge~-------------~~~eKpykC~~CgKsFs~ks~L~~H~ 1038 (1428)
+|||+|++ |.|++.....|+.|...-|...+ ..+.|||.|++|+|.|.....|+.|.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence 58899965 88999999999997555553321 11368999999999999999999986
No 38
>KOG1080 consensus Histone H3 (Lys4) methyltransferase complex, subunit SET1 and related methyltransferases [Chromatin structure and dynamics; Transcription]
Probab=96.35 E-value=0.002 Score=84.96 Aligned_cols=45 Identities=24% Similarity=0.394 Sum_probs=39.7
Q ss_pred eeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhh
Q 000554 1379 VKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRS 1423 (1428)
Q Consensus 1379 ~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~ 1423 (1428)
..|..-++.-.||||+|.++|.+|+||.||+||+|...=|+.|+.
T Consensus 866 k~~~F~~s~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~ 910 (1005)
T KOG1080|consen 866 KYVKFGRSGIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREA 910 (1005)
T ss_pred hhhccccccccccceeeccCccccceEEEeeceehhhhHHHHHHH
Confidence 336666777899999999999999999999999999888888876
No 39
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.12 E-value=0.0021 Score=56.43 Aligned_cols=51 Identities=24% Similarity=0.440 Sum_probs=28.2
Q ss_pred ccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcc
Q 000554 882 YACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVH 939 (1428)
Q Consensus 882 ykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvH 939 (1428)
|.|+.|++ ..+...|..|....|.... +.+.|++|...+. .+|..|+...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~----~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSES----KNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCC----CCccCCCchhhhh--hHHHHHHHHhc
Confidence 56666666 3344566666655554321 3566666666543 26666665444
No 40
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=95.57 E-value=0.0068 Score=43.62 Aligned_cols=23 Identities=35% Similarity=0.782 Sum_probs=14.6
Q ss_pred eecCccCcccCChhhHHHHHHhhc
Q 000554 985 FICRFCGLKFDLLPDLGRHHQAAH 1008 (1428)
Q Consensus 985 ykC~~CGKsFs~~s~L~rHHqrvH 1008 (1428)
|+|+.|++.|.+...|.+ |++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~-H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKR-HMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHH-HHHHH
T ss_pred CCCCCCCCccCCHHHHHH-HHhHC
Confidence 567777777777777777 44434
No 41
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=95.55 E-value=0.0029 Score=45.57 Aligned_cols=23 Identities=22% Similarity=-0.042 Sum_probs=21.2
Q ss_pred cccCCCCcccCCchhhhcccccc
Q 000554 1019 HKKGIRFYAYKLKSGRLSRPRFK 1041 (1428)
Q Consensus 1019 ykC~~CgKsFs~ks~L~~H~r~H 1041 (1428)
|+|+.|++.|..+..|.+|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 78999999999999999999765
No 42
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.33 E-value=0.0063 Score=69.52 Aligned_cols=71 Identities=20% Similarity=0.327 Sum_probs=45.6
Q ss_pred CCCcccCCC--CCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccccccccccccC
Q 000554 844 DEKTHKCKI--CSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPC 921 (1428)
Q Consensus 844 gekpykC~~--CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~C 921 (1428)
++|||+|++ |.|.++....|+-|+..-|... +...-+ .-..|.-. ..+.|||.|++|
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~-----~~~~~p----------~p~~~~~F------~~~~KPYrCevC 404 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQ-----KLHENP----------SPEKMNIF------SAKDKPYRCEVC 404 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCc-----ccCCCC----------Cccccccc------cccCCceecccc
Confidence 359999987 9999999999999954444332 111111 11111111 112358888888
Q ss_pred CCCCCChhhhhhhh
Q 000554 922 GSHFGNTEELWLHV 935 (1428)
Q Consensus 922 gKsF~sks~L~~H~ 935 (1428)
+|.+++...|+-|.
T Consensus 405 ~KRYKNlNGLKYHr 418 (423)
T COG5189 405 DKRYKNLNGLKYHR 418 (423)
T ss_pred chhhccCccceecc
Confidence 88888888888885
No 43
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=95.27 E-value=0.0069 Score=58.34 Aligned_cols=73 Identities=19% Similarity=0.303 Sum_probs=20.9
Q ss_pred cCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCCh
Q 000554 849 KCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNT 928 (1428)
Q Consensus 849 kC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sk 928 (1428)
+|..|+..|.+...|..|+...|.-. -+ ....+.....+..+.+..... .+.|..|++.|.+.
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~-----~~-----~~~~l~~~~~~~~~~~~~~~~-------~~~C~~C~~~f~s~ 63 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFD-----IP-----DQKYLVDPNRLLNYLRKKVKE-------SFRCPYCNKTFRSR 63 (100)
T ss_dssp ----------------------------------------------------------S-------SEEBSSSS-EESSH
T ss_pred Cccccccccccccccccccccccccc-----cc-----cccccccccccccccccccCC-------CCCCCccCCCCcCH
Confidence 58999999999999999976677543 11 222233444455554432222 58999999999999
Q ss_pred hhhhhhhhhc
Q 000554 929 EELWLHVQSV 938 (1428)
Q Consensus 929 s~L~~H~rsv 938 (1428)
..|..|++..
T Consensus 64 ~~l~~Hm~~~ 73 (100)
T PF12756_consen 64 EALQEHMRSK 73 (100)
T ss_dssp HHHHHHHHHT
T ss_pred HHHHHHHcCc
Confidence 9999999854
No 44
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=95.24 E-value=0.011 Score=57.06 Aligned_cols=71 Identities=23% Similarity=0.440 Sum_probs=17.1
Q ss_pred ccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCcccCC
Q 000554 917 QCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKFDL 996 (1428)
Q Consensus 917 kC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs~ 996 (1428)
+|..|+..|.+...|..|+...|.-.+ + . .........+..+.+.. -...+.|..|++.|..
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~---------~----~----~~~l~~~~~~~~~~~~~-~~~~~~C~~C~~~f~s 62 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDI---------P----D----QKYLVDPNRLLNYLRKK-VKESFRCPYCNKTFRS 62 (100)
T ss_dssp ------------------------------------------------------------------SSEEBSSSS-EESS
T ss_pred Ccccccccccccccccccccccccccc---------c----c----ccccccccccccccccc-cCCCCCCCccCCCCcC
Confidence 488899999999999999887775330 0 0 00111111333333221 1126888888888888
Q ss_pred hhhHHHHHH
Q 000554 997 LPDLGRHHQ 1005 (1428)
Q Consensus 997 ~s~L~rHHq 1005 (1428)
...|..|..
T Consensus 63 ~~~l~~Hm~ 71 (100)
T PF12756_consen 63 REALQEHMR 71 (100)
T ss_dssp HHHHHHHHH
T ss_pred HHHHHHHHc
Confidence 888888443
No 45
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.78 E-value=0.028 Score=66.92 Aligned_cols=62 Identities=11% Similarity=0.084 Sum_probs=40.4
Q ss_pred cCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccccCCCccccCCCCCc
Q 000554 990 CGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRIRNRG 1055 (1428)
Q Consensus 990 CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~C~ks 1055 (1428)
|-..+.....+.. |...|.... ...+.+..|.+.|.....+..|++.|....+..|..+...
T Consensus 394 ~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (467)
T COG5048 394 CIRNFKRDSNLSL-HIITHLSFR---PYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLCSILKSF 455 (467)
T ss_pred hhhhhcccccccc-ccccccccC---CcCCCCCcchhhccCcccccccccccccCCceeecccccc
Confidence 5566666666666 555555511 2256677788888888888888888877776666555443
No 46
>cd01395 HMT_MBD Methyl-CpG binding domains (MBD) present in putative histone methyltransferases (HMT) such as CLLD8 and SETDB1 proteins; CLLD8 contains a MBD, a PreSET and a bifurcated SET domain, suggesting that CLLD8 might be associated with methylation-mediated transcriptional repression. SETDB1 and other proteins in this group have a similar domain architecture. SETDB1 is a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins.
Probab=94.59 E-value=0.0072 Score=54.32 Aligned_cols=37 Identities=14% Similarity=0.039 Sum_probs=31.6
Q ss_pred CCC-CCcccC----------CcccccccCCCCCCc-cccccceeeeccC
Q 000554 1184 HLE-PLPSVS----------AGIRSSDSSDFVNNQ-WEVDECHCIIDSR 1220 (1428)
Q Consensus 1184 Pl~-p~~~~~----------~~~k~v~~~~p~~~~-w~~~e~~~~l~~~ 1220 (1428)
||+ |+.+|| +.++.|+|++|||.. ++|.|++.||...
T Consensus 1 PL~~Pll~gw~R~~~~~~~~~~k~~V~Y~aPCGr~Lr~~~EV~~YL~~t 49 (60)
T cd01395 1 PLHTPLLCGFQRMKYRARVGKVKKHVIYKAPCGRSLRNMSEVHRYLRET 49 (60)
T ss_pred CcccccccCeEEEEEeccCCCcccceEEECCcchhhhcHHHHHHHHHhc
Confidence 677 889999 257789999999999 9999999988743
No 47
>KOG2231 consensus Predicted E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=94.40 E-value=0.031 Score=70.86 Aligned_cols=140 Identities=20% Similarity=0.247 Sum_probs=70.0
Q ss_pred CcccccccccccCChhhhhhhhhhcccccccccccccccccCC---CCC------CChhhhhhhhhhcccccccchhhhh
Q 000554 880 RGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCG---SHF------GNTEELWLHVQSVHAIDFKMSEVAQ 950 (1428)
Q Consensus 880 KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~Cg---KsF------~sks~L~~H~rsvHsgEf~~~s~~~ 950 (1428)
..-.|..| -.|.....|+.|+...|. .+.|..|- +.| -+...|.+|++. ++.-..+..+
T Consensus 114 ~~~~~~~c-~~~~s~~~Lk~H~~~~H~--------~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~---gd~d~~s~rG 181 (669)
T KOG2231|consen 114 NKKECLHC-TEFKSVENLKNHMRDQHK--------LHLCSLCLQNLKIFINERKLYTRAELNLHLMF---GDPDDESCRG 181 (669)
T ss_pred ccCCCccc-cchhHHHHHHHHHHHhhh--------hhccccccccceeeeeeeehehHHHHHHHHhc---CCCccccccC
Confidence 33456666 666677777777765554 34455442 222 234556666541 1100000000
Q ss_pred ccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccC------cccCChhhHHHHHHhhccCCCCCCCCCcccC--
Q 000554 951 QHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCG------LKFDLLPDLGRHHQAAHMGPNLVNSRPHKKG-- 1022 (1428)
Q Consensus 951 ~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CG------KsFs~~s~L~rHHqrvHtge~~~~eKpykC~-- 1022 (1428)
.-.|..| ...|-....|.+|++.++ |.|.+|. .-|.....|..|-+.-| |.|.
T Consensus 182 --hp~C~~C---~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H----------flCE~~ 242 (669)
T KOG2231|consen 182 --HPLCKFC---HERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----------FLCEEE 242 (669)
T ss_pred --Cccchhh---hhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC----------cccccc
Confidence 1234444 445555556666666543 5666663 34666667777433333 2343
Q ss_pred CCC-----cccCCchhhhcccccccCCCccccC
Q 000554 1023 IRF-----YAYKLKSGRLSRPRFKKGLGAVSYR 1050 (1428)
Q Consensus 1023 ~Cg-----KsFs~ks~L~~H~r~H~gekpy~C~ 1050 (1428)
.|- -.|.....|+.|.+.+.-++.|.|.
T Consensus 243 ~C~~~~f~~~~~~ei~lk~~~~~~~~e~~~~~~ 275 (669)
T KOG2231|consen 243 FCRTKKFYVAFELEIELKAHNRFIQHEKCYICR 275 (669)
T ss_pred ccccceeeehhHHHHHHHhhccccchheeccCC
Confidence 232 2334455566666655566666664
No 48
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=94.24 E-value=0.016 Score=43.36 Aligned_cols=24 Identities=38% Similarity=0.750 Sum_probs=12.0
Q ss_pred ceecCccCcccCChhhHHHHHHhhc
Q 000554 984 KFICRFCGLKFDLLPDLGRHHQAAH 1008 (1428)
Q Consensus 984 pykC~~CGKsFs~~s~L~rHHqrvH 1008 (1428)
||+|..|++.|.....|.. |++.|
T Consensus 1 ~~~C~~C~~~F~~~~~l~~-H~~~h 24 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALRE-HKRSH 24 (27)
T ss_dssp SEEETTTTEEESSHHHHHH-HHCTT
T ss_pred CCCCCccCCccCChhHHHH-HhHHh
Confidence 3455555555555555555 34433
No 49
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=94.03 E-value=0.033 Score=41.65 Aligned_cols=26 Identities=12% Similarity=-0.074 Sum_probs=23.5
Q ss_pred CcccCCCCcccCCchhhhcccccccC
Q 000554 1018 PHKKGIRFYAYKLKSGRLSRPRFKKG 1043 (1428)
Q Consensus 1018 pykC~~CgKsFs~ks~L~~H~r~H~g 1043 (1428)
||+|..|++.|.....|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 68999999999999999999988864
No 50
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=93.96 E-value=0.024 Score=40.46 Aligned_cols=22 Identities=36% Similarity=0.738 Sum_probs=9.8
Q ss_pred cccccccccCChhhhhhhhhhc
Q 000554 883 ACAICLDSFTNKKVLESHVQER 904 (1428)
Q Consensus 883 kC~~CgKsF~~ks~L~~H~r~H 904 (1428)
.|++|++.|.+...|..|++.|
T Consensus 2 ~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 2 QCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp E-SSTS-EESSHHHHHHHHHHH
T ss_pred CCcCCCCcCCcHHHHHHHHHhh
Confidence 4445555555555555554444
No 51
>KOG2231 consensus Predicted E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=93.17 E-value=0.088 Score=66.94 Aligned_cols=74 Identities=22% Similarity=0.280 Sum_probs=36.1
Q ss_pred ccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccC------ChhhhhhhhhhcC-Ccc----ce
Q 000554 917 QCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELG------YSASVENHSENLG-SIR----KF 985 (1428)
Q Consensus 917 kC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~------sks~L~~H~rtHt-GeK----py 985 (1428)
.|.+| -.|.+...|+.|+...| +.+.|..|..-.+.|. ....|..|++.-. +++ .-
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H------------~~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp 183 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQH------------KLHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHP 183 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhh------------hhhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCc
Confidence 36666 55666666666665555 2344544432222111 2345555554311 111 13
Q ss_pred ecCccCcccCChhhHHHH
Q 000554 986 ICRFCGLKFDLLPDLGRH 1003 (1428)
Q Consensus 986 kC~~CGKsFs~~s~L~rH 1003 (1428)
.|..|...|-....|.+|
T Consensus 184 ~C~~C~~~fld~~el~rH 201 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRH 201 (669)
T ss_pred cchhhhhhhccHHHHHHh
Confidence 466666666666666664
No 52
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=93.06 E-value=0.07 Score=38.02 Aligned_cols=18 Identities=33% Similarity=0.794 Sum_probs=10.3
Q ss_pred eecCccCcccCChhhHHH
Q 000554 985 FICRFCGLKFDLLPDLGR 1002 (1428)
Q Consensus 985 ykC~~CGKsFs~~s~L~r 1002 (1428)
|.|+.|++.|.+...|.+
T Consensus 1 ~~C~~C~~~~~~~~~l~~ 18 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQ 18 (24)
T ss_dssp EE-SSTS-EESSHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHH
Confidence 456666666666666666
No 53
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=92.70 E-value=0.1 Score=62.30 Aligned_cols=168 Identities=15% Similarity=0.204 Sum_probs=109.4
Q ss_pred CcccCCCCCccccccccccccccc--ccchhhhcccCccccc--ccccccCChhhhhhhhhhccccccccccccccccc-
Q 000554 846 KTHKCKICSQVFLHDQELGVHWMD--NHKKEAQWLFRGYACA--ICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIP- 920 (1428)
Q Consensus 846 kpykC~~CgK~F~s~s~L~~H~~r--~Ht~e~~~l~KpykC~--~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~- 920 (1428)
.++.|..|...|.....|..| .+ .|..+. .+++.|+ .|++.|.+...+..|...|.+.. ++.|..
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~-~~~~~h~~~~---~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~ 357 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRH-LRSVNHSGES---LKPFSCPYSLCGKLFSRNDALKRHILLHTSIS------PAKEKLL 357 (467)
T ss_pred cCCCCccccCCcccccccccc-cccccccccc---CCceeeeccCCCccccccccccCCcccccCCC------ccccccc
Confidence 578999999999999999999 66 787762 2689999 79999999999999999999876 455543
Q ss_pred -CCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCcc--ceecCccCcccCCh
Q 000554 921 -CGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIR--KFICRFCGLKFDLL 997 (1428)
Q Consensus 921 -CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeK--pykC~~CGKsFs~~ 997 (1428)
|.+.+.....-..+.. .+... .....+.+.+..- .+...+.....+..|...|...+ .+.|..|.+.|...
T Consensus 358 ~~~~~~~~~~~~~~~~~-~~~~~----~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (467)
T COG5048 358 NSSSKFSPLLNNEPPQS-LQQYK----DLKNDKKSETLSN-SCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRH 431 (467)
T ss_pred cCccccccccCCCCccc-hhhcc----CccCCcccccccc-chhhhhccccccccccccccccCCcCCCCCcchhhccCc
Confidence 5555544433221211 11000 0011123333222 22444455556777777777665 57778999999999
Q ss_pred hhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhc
Q 000554 998 PDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLS 1036 (1428)
Q Consensus 998 s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~ 1036 (1428)
..|.. |.+.|.. ..++.|..+ +.|.....+..
T Consensus 432 ~~~~~-~~~~~~~-----~~~~~~~~~-~~~~~~~~~~~ 463 (467)
T COG5048 432 YNLIP-HKKIHTN-----HAPLLCSIL-KSFRRDLDLSN 463 (467)
T ss_pred ccccc-ccccccc-----CCceeeccc-cccchhhhhhc
Confidence 99999 7888887 455555444 34444444433
No 54
>KOG1146 consensus Homeobox protein [General function prediction only]
Probab=92.67 E-value=0.069 Score=71.18 Aligned_cols=157 Identities=15% Similarity=0.110 Sum_probs=94.5
Q ss_pred ccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCc
Q 000554 884 CAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKK 963 (1428)
Q Consensus 884 C~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~ 963 (1428)
|..|+..+.++..+..|+..-+... +.|+|+.|+..|.....|..|+|..|..- .. ..| .
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~-----kt~~cpkc~~~yk~a~~L~vhmRskhp~~---------~~---~~c---~ 498 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFF-----KTLKCPKCNWHYKLAQTLGVHMRSKHPES---------QS---AYC---K 498 (1406)
T ss_pred ccchhhhhhhhcccccceeeeeccc-----ccccCCccchhhhhHHHhhhccccccccc---------ch---hHh---H
Confidence 4556666666677777666555443 46788888888888888888887666532 00 222 0
Q ss_pred cccCChhhhhhhhhhc------CCccceecCccCcccCChhhHHHHHHhh-ccCC-------------------------
Q 000554 964 LELGYSASVENHSENL------GSIRKFICRFCGLKFDLLPDLGRHHQAA-HMGP------------------------- 1011 (1428)
Q Consensus 964 ~sf~sks~L~~H~rtH------tGeKpykC~~CGKsFs~~s~L~rHHqrv-Htge------------------------- 1011 (1428)
....|.+.- .+.++|.|..|..+|....+|.+|.+.. |..+
T Consensus 499 -------~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~ 571 (1406)
T KOG1146|consen 499 -------AGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPE 571 (1406)
T ss_pred -------hccccccccccccccCCCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcc
Confidence 111222211 2347888888888888888888864432 2110
Q ss_pred ----------C-CCCCCCcccCCCCcccCCchhhhcccc-cccCCCccccCCCCCcCcChHHHHhhcC
Q 000554 1012 ----------N-LVNSRPHKKGIRFYAYKLKSGRLSRPR-FKKGLGAVSYRIRNRGAAGMKKRIQTLK 1067 (1428)
Q Consensus 1012 ----------~-~~~eKpykC~~CgKsFs~ks~L~~H~r-~H~gekpy~C~~C~ksf~~~~~l~~H~k 1067 (1428)
. +...-.+.|.+|++--.-..+|+.||. .|+-..|.-|-.|+-.+.....+..+.+
T Consensus 572 ~ag~~~~ags~~pktkP~~~C~vc~yetniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~~~ 639 (1406)
T KOG1146|consen 572 EAGLGPSAGSSGPKTKPSWRCEVCSYETNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQGR 639 (1406)
T ss_pred cccCCCCCCCCCCCCCCCcchhhhcchhhhhhccccccccCCCCCChHHHhhhcchhhccccccCcCC
Confidence 0 111335899999999999999999993 4444444556555555444333333333
No 55
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=92.60 E-value=0.045 Score=68.44 Aligned_cols=72 Identities=25% Similarity=0.215 Sum_probs=59.8
Q ss_pred cccccCcCCCCCCCCCCceeeccceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhhhc
Q 000554 1354 LIYECNHMCSCDRTCPNRVLQNGVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRSRL 1425 (1428)
Q Consensus 1354 ~IyECn~~C~C~~~C~NRvvQ~G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~~Y 1425 (1428)
.+.+++..+.....+.|...+.....+..+..+..+||||+++..|++|+||.+|.|+++...++..|...|
T Consensus 307 ~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~ 378 (480)
T COG2940 307 SSDFSKSNVSKLKELLNSNGCKKRREPNVVQESEIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENY 378 (480)
T ss_pred ccccccccCccccchhhhcccccccchhhhhhhcccccceeehhhccchHHHHHhcCcccchHHHHhhhccc
Confidence 344455555555567777777788888888999999999999999999999999999999999999887753
No 56
>KOG1146 consensus Homeobox protein [General function prediction only]
Probab=92.37 E-value=0.04 Score=73.29 Aligned_cols=84 Identities=15% Similarity=0.200 Sum_probs=65.4
Q ss_pred cCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccc-----------------
Q 000554 849 KCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVE----------------- 911 (1428)
Q Consensus 849 kC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e----------------- 911 (1428)
.|..|+..|.....+..|+...|... +.|+|+.|+..|+....|..|||..|.+-...
T Consensus 438 e~~~~e~~~~s~r~~~~~t~~L~S~~-----kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~ 512 (1406)
T KOG1146|consen 438 ELTKAEPLLESKRSLEGQTVVLHSFF-----KTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVY 512 (1406)
T ss_pred cccchhhhhhhhcccccceeeeeccc-----ccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccc
Confidence 35667777777777888866667665 88999999999999999999999855432111
Q ss_pred --cccccccccCCCCCCChhhhhhhhhh
Q 000554 912 --QCMLQQCIPCGSHFGNTEELWLHVQS 937 (1428)
Q Consensus 912 --~~kpfkC~~CgKsF~sks~L~~H~rs 937 (1428)
.-++|.|..|...+..+.+|.+|++.
T Consensus 513 ~~~~~p~~C~~C~~stttng~LsihlqS 540 (1406)
T KOG1146|consen 513 RCPGKPYPCRACNYSTTTNGNLSIHLQS 540 (1406)
T ss_pred cCCCCcccceeeeeeeecchHHHHHHHH
Confidence 11679999999999999999999874
No 57
>PRK04860 hypothetical protein; Provisional
Probab=92.06 E-value=0.088 Score=56.54 Aligned_cols=38 Identities=16% Similarity=0.168 Sum_probs=23.7
Q ss_pred ceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCc
Q 000554 984 KFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLK 1031 (1428)
Q Consensus 984 pykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~k 1031 (1428)
+|.|. |++ ....+++ |.++|++ +++|.|..|+..|...
T Consensus 119 ~Y~C~-C~~---~~~~~rr-H~ri~~g-----~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRR-HNRVVRG-----EAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEcC-CCC---eeCHHHH-HHHHhcC-----CccEECCCCCceeEEe
Confidence 56665 665 5555666 6666666 5666666666666543
No 58
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=91.88 E-value=0.053 Score=46.94 Aligned_cols=30 Identities=7% Similarity=-0.115 Sum_probs=11.9
Q ss_pred CCCcccCCCCcccCCchhhhcccccccCCC
Q 000554 1016 SRPHKKGIRFYAYKLKSGRLSRPRFKKGLG 1045 (1428)
Q Consensus 1016 eKpykC~~CgKsFs~ks~L~~H~r~H~gek 1045 (1428)
+.|..|++|+..+++..+|++|+.++++.|
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 444445555555555555555544444433
No 59
>smart00355 ZnF_C2H2 zinc finger.
Probab=91.42 E-value=0.072 Score=38.37 Aligned_cols=24 Identities=29% Similarity=0.544 Sum_probs=13.8
Q ss_pred eecCccCcccCChhhHHHHHHhhcc
Q 000554 985 FICRFCGLKFDLLPDLGRHHQAAHM 1009 (1428)
Q Consensus 985 ykC~~CGKsFs~~s~L~rHHqrvHt 1009 (1428)
|+|..|++.|.....|.. |++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~-H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKE-HMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHH-HHHHhc
Confidence 456666666666666666 444443
No 60
>smart00355 ZnF_C2H2 zinc finger.
Probab=91.13 E-value=0.14 Score=36.76 Aligned_cols=24 Identities=17% Similarity=-0.100 Sum_probs=21.1
Q ss_pred cccCCCCcccCCchhhhccccccc
Q 000554 1019 HKKGIRFYAYKLKSGRLSRPRFKK 1042 (1428)
Q Consensus 1019 ykC~~CgKsFs~ks~L~~H~r~H~ 1042 (1428)
|+|+.|+++|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 679999999999999999998775
No 61
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=90.90 E-value=0.093 Score=45.87 Aligned_cols=25 Identities=32% Similarity=0.777 Sum_probs=22.5
Q ss_pred ccccccCcCCCCCCCCCCceeeccc
Q 000554 1353 YLIYECNHMCSCDRTCPNRVLQNGV 1377 (1428)
Q Consensus 1353 ~~IyECn~~C~C~~~C~NRvvQ~G~ 1377 (1428)
.+.+||+..|+|+..|.||.+|+..
T Consensus 26 ~l~~EC~~~C~~G~~C~NqrFqk~~ 50 (51)
T smart00570 26 MLLIECSSDCPCGSYCSNQRFQKRQ 50 (51)
T ss_pred HHhhhcCCCCCCCcCccCcccccCc
Confidence 5679999999999999999999863
No 62
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes. Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=90.32 E-value=0.26 Score=47.32 Aligned_cols=60 Identities=18% Similarity=0.474 Sum_probs=47.7
Q ss_pred EEEEEecc-ccccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhccccccCCCc
Q 000554 157 ALWVKWRG-KWQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSINEFPQ 220 (1428)
Q Consensus 157 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (1428)
-+|+|.+| -|--|+-+...+.+... .+......|.|.||+ +++|.||+---|.+..++-.
T Consensus 6 lVwaK~~g~pwWPa~V~~~~~~~~~~---~~~~~~~~~~V~Ffg-~~~~~wv~~~~l~pf~~~~~ 66 (87)
T cd05162 6 LVWAKMKGYPWWPALVVDPPKDSKKA---KKKAKEGKVLVLFFG-DKTFAWVGAERLKPFTEHKE 66 (87)
T ss_pred EEEEeCCCCCCCCEEEccccccchhh---hccCCCCEEEEEEeC-CCcEEEeCccceeeccchHH
Confidence 48999999 78888888777776543 233345789999999 99999999999988887653
No 63
>PRK04860 hypothetical protein; Provisional
Probab=89.39 E-value=0.14 Score=55.09 Aligned_cols=39 Identities=10% Similarity=-0.119 Sum_probs=34.8
Q ss_pred CCcccCCCCcccCCchhhhcccccccCCCccccCCCCCcCcCh
Q 000554 1017 RPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRIRNRGAAGM 1059 (1428)
Q Consensus 1017 KpykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~C~ksf~~~ 1059 (1428)
-+|.|. |++ ....+++|.++|+++++|.|..|+..+...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 379998 998 888899999999999999999999987643
No 64
>cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S. pombe hypothetical protein SPBC215, as well as ISWI complex protein 4. The ISWI (imitation switch) proteins are ATPases responsible for chromatin remodeling in eukaryotes, and SPBC215 is proposed to also bind chromatin. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=88.90 E-value=0.31 Score=47.89 Aligned_cols=59 Identities=24% Similarity=0.430 Sum_probs=48.9
Q ss_pred EEEEEeccc-cccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhcccccc
Q 000554 157 ALWVKWRGK-WQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSIN 216 (1428)
Q Consensus 157 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (1428)
-+|.|-+|- |=-|+=|...+-|-.-|++++......|.|.||+. ++|.|++--.+.+..
T Consensus 6 lVwaK~~GyPwWPA~V~~~~~~p~~~l~~~~~~~~~~~~V~FFg~-~~~~Wv~~~~l~pl~ 65 (93)
T cd05840 6 RVLAKVKGFPAWPAIVVPEEMLPDSVLKGKKKKNKRTYPVMFFPD-GDYYWVPNKDLKPLT 65 (93)
T ss_pred EEEEeCCCCCCCCEEECChHHCCHHHHhcccCCCCCeEEEEEeCC-CcEEEEChhhcccCC
Confidence 389999994 66677777777888888888888899999999995 699999887777665
No 65
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=86.52 E-value=0.38 Score=41.90 Aligned_cols=29 Identities=24% Similarity=0.500 Sum_probs=17.8
Q ss_pred CcccccccccccCChhhhhhhhhhccccc
Q 000554 880 RGYACAICLDSFTNKKVLESHVQERHHVQ 908 (1428)
Q Consensus 880 KpykC~~CgKsF~~ks~L~~H~r~Hhgek 908 (1428)
.|-.|++|+..+.+..+|++|+..+|+.+
T Consensus 23 ~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 23 QPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp --EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred CCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 66777777777777777777777777655
No 66
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=85.33 E-value=0.43 Score=55.58 Aligned_cols=103 Identities=21% Similarity=0.224 Sum_probs=58.1
Q ss_pred cccccc--CCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccC------ChhhhhhhhhhcCCccc--
Q 000554 915 LQQCIP--CGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELG------YSASVENHSENLGSIRK-- 984 (1428)
Q Consensus 915 pfkC~~--CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~------sks~L~~H~rtHtGeKp-- 984 (1428)
.|.|+. |.........|..|.+..|. .+.|.+|......|. ++..|..|...-..+.-
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~------------~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFK 218 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG------------FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFK 218 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC------------cEEhHhhhcCcccCccceeeeecccccccccCCccccCcC
Confidence 356654 55555556667777765552 455666643333333 23456666543332222
Q ss_pred --eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcc-------cCCchhhhcccc
Q 000554 985 --FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYA-------YKLKSGRLSRPR 1039 (1428)
Q Consensus 985 --ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKs-------Fs~ks~L~~H~r 1039 (1428)
-.|.+|.+.|-.-..|.+|.+..|.. |.+|++. |..-..|..|.+
T Consensus 219 GHP~C~FC~~~FYdDDEL~~HcR~~HE~----------ChICD~v~p~~~QYFK~Y~~Le~HF~ 272 (493)
T COG5236 219 GHPLCIFCKIYFYDDDELRRHCRLRHEA----------CHICDMVGPIRYQYFKSYEDLEAHFR 272 (493)
T ss_pred CCchhhhccceecChHHHHHHHHhhhhh----------hhhhhccCccchhhhhCHHHHHHHhh
Confidence 24788888888888888854444544 6666653 555566666653
No 67
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=84.73 E-value=0.26 Score=36.10 Aligned_cols=21 Identities=10% Similarity=-0.042 Sum_probs=11.0
Q ss_pred cccCCCCcccCCchhhhcccc
Q 000554 1019 HKKGIRFYAYKLKSGRLSRPR 1039 (1428)
Q Consensus 1019 ykC~~CgKsFs~ks~L~~H~r 1039 (1428)
|.|.+|++.|.....|..|++
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCCcCCHHHHHHHHC
Confidence 345555555555555555554
No 68
>PF11722 zf-TRM13_CCCH: CCCH zinc finger in TRM13 protein; InterPro: IPR021721 This domain is found at the N terminus of TRM13 methyltransferase proteins. It is presumed to be a zinc binding domain. ; GO: 0008168 methyltransferase activity
Probab=83.99 E-value=0.35 Score=38.14 Aligned_cols=29 Identities=28% Similarity=0.619 Sum_probs=27.0
Q ss_pred cccchhhhhcCceeeEeecCCceEEEEec
Q 000554 533 RQCTAFIESKGRQCVRWANEGDVYCCVHL 561 (1428)
Q Consensus 533 ~~c~a~~~~kgrqc~r~a~~~~~ycc~h~ 561 (1428)
-+|.-||+.|.|.|.=.+..|..||--|+
T Consensus 2 ~~C~f~l~~K~R~C~m~~~~g~~fC~~H~ 30 (31)
T PF11722_consen 2 GRCEFFLPRKKRFCKMTRKPGSRFCGEHM 30 (31)
T ss_pred CcceEECCccccccCCeecCcCCccccCC
Confidence 37999999999999999999999999885
No 69
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=81.78 E-value=0.6 Score=34.05 Aligned_cols=23 Identities=35% Similarity=0.689 Sum_probs=11.9
Q ss_pred ccccccccccCChhhhhhhhhhcc
Q 000554 882 YACAICLDSFTNKKVLESHVQERH 905 (1428)
Q Consensus 882 ykC~~CgKsF~~ks~L~~H~r~Hh 905 (1428)
|+|+.|+.... +..|..|++.||
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45566665555 555666665554
No 70
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=81.43 E-value=0.63 Score=34.07 Aligned_cols=21 Identities=33% Similarity=0.773 Sum_probs=11.1
Q ss_pred cccccccccCChhhhhhhhhh
Q 000554 883 ACAICLDSFTNKKVLESHVQE 903 (1428)
Q Consensus 883 kC~~CgKsF~~ks~L~~H~r~ 903 (1428)
.|.+|++.|.+...|..|++.
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCcCCHHHHHHHHCc
Confidence 455555555555555555543
No 71
>cd07765 KRAB_A-box KRAB (Kruppel-associated box) domain -A box. The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through
Probab=81.21 E-value=0.88 Score=32.73 Aligned_cols=28 Identities=21% Similarity=0.127 Sum_probs=25.3
Q ss_pred eecceeeeecccccCChhhhcccchhhhhhhhc
Q 000554 732 IISKEVFLELLKDCCSLEQKLHLHLACELFYKL 764 (1428)
Q Consensus 732 VTFkDVAV~F~r~c~SqEEW~~LdPaCrkLYrd 764 (1428)
++|+||++.| +.++|.++.+.++.+|.+
T Consensus 1 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~ 28 (40)
T cd07765 1 VTFEDVAVYF-----SQEEWELLDPAQRDLYRD 28 (40)
T ss_pred Ccceeeeeec-----CHHHHhcCCHHHHHHHHH
Confidence 3678999999 999999999999999886
No 72
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=80.75 E-value=0.92 Score=34.19 Aligned_cols=22 Identities=0% Similarity=-0.277 Sum_probs=17.3
Q ss_pred cccCCCCcccCCchhhhccccc
Q 000554 1019 HKKGIRFYAYKLKSGRLSRPRF 1040 (1428)
Q Consensus 1019 ykC~~CgKsFs~ks~L~~H~r~ 1040 (1428)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 6788888888888888888754
No 73
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=77.11 E-value=1.8 Score=31.53 Aligned_cols=17 Identities=24% Similarity=0.626 Sum_probs=7.8
Q ss_pred eecCccCcccCChhhHHH
Q 000554 985 FICRFCGLKFDLLPDLGR 1002 (1428)
Q Consensus 985 ykC~~CGKsFs~~s~L~r 1002 (1428)
|+|+.|+.... ...|.+
T Consensus 1 y~C~~C~y~t~-~~~l~~ 17 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKR 17 (24)
T ss_dssp EE-SSSS-EES-HHHHHH
T ss_pred CCCCCCCCcCC-HHHHHH
Confidence 44555555554 555555
No 74
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=76.45 E-value=1.3 Score=33.28 Aligned_cols=21 Identities=24% Similarity=0.695 Sum_probs=10.3
Q ss_pred ccccccccccCChhhhhhhhh
Q 000554 882 YACAICLDSFTNKKVLESHVQ 902 (1428)
Q Consensus 882 ykC~~CgKsF~~ks~L~~H~r 902 (1428)
|.|..|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 345555555555555555544
No 75
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=75.57 E-value=1.8 Score=50.69 Aligned_cols=135 Identities=22% Similarity=0.313 Sum_probs=70.0
Q ss_pred ccCCC--CCcccccccccccccccccchhhhcccCcccccccc---cccC------Chhhhhhhhhhccccccccccccc
Q 000554 848 HKCKI--CSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICL---DSFT------NKKVLESHVQERHHVQFVEQCMLQ 916 (1428)
Q Consensus 848 ykC~~--CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~Cg---KsF~------~ks~L~~H~r~Hhgek~~e~~kpf 916 (1428)
|.|+. |.........|+.|.+..|. .+-|.+|- +.|. ++..|..|...-..+.... .-=
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~--------~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFK--GHP 221 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHG--------FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFK--GHP 221 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcC--------cEEhHhhhcCcccCccceeeeecccccccccCCccccCcC--CCc
Confidence 56654 44444445556666332332 24566663 2333 3344555544322221000 122
Q ss_pred ccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCC----CCccccCChhhhhhhhhhcCCccceecCc--c
Q 000554 917 QCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDS----PKKLELGYSASVENHSENLGSIRKFICRF--C 990 (1428)
Q Consensus 917 kC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~----~k~~sf~sks~L~~H~rtHtGeKpykC~~--C 990 (1428)
.|..|...|-+-..|..|+|..|. .|.+|- ..-..|.+-..|..|.+.-+ |.|.+ |
T Consensus 222 ~C~FC~~~FYdDDEL~~HcR~~HE--------------~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc 283 (493)
T COG5236 222 LCIFCKIYFYDDDELRRHCRLRHE--------------ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTC 283 (493)
T ss_pred hhhhccceecChHHHHHHHHhhhh--------------hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEE
Confidence 588888888888888888876553 344441 11122445556666664322 55532 3
Q ss_pred C----cccCChhhHHHHHHhhccC
Q 000554 991 G----LKFDLLPDLGRHHQAAHMG 1010 (1428)
Q Consensus 991 G----KsFs~~s~L~rHHqrvHtg 1010 (1428)
- ..|.....|..|..+.|..
T Consensus 284 ~~~k~~vf~~~~el~~h~~~~h~~ 307 (493)
T COG5236 284 RVGKCYVFPYHTELLEHLTRFHKV 307 (493)
T ss_pred ecCcEEEeccHHHHHHHHHHHhhc
Confidence 2 3577777777776666755
No 76
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=74.34 E-value=1.4 Score=49.37 Aligned_cols=46 Identities=24% Similarity=0.247 Sum_probs=35.9
Q ss_pred cCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccc-cccc
Q 000554 987 CRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRP-RFKK 1042 (1428)
Q Consensus 987 C~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~-r~H~ 1042 (1428)
|-+|.+.|....-|.+ |++ .|-|+|.+|.|...+.-.|..|- ++|+
T Consensus 13 cwycnrefddekiliq-hqk---------akhfkchichkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQ-HQK---------AKHFKCHICHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred eeecccccchhhhhhh-hhh---------hccceeeeehhhhccCCCceeehhhhhh
Confidence 8888888888888888 443 46788888888888888888874 6665
No 77
>KOG4173 consensus Alpha-SNAP protein [Intracellular trafficking, secretion, and vesicular transport]
Probab=74.06 E-value=0.64 Score=50.98 Aligned_cols=91 Identities=23% Similarity=0.325 Sum_probs=67.7
Q ss_pred Ccccccc--cccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccC
Q 000554 880 RGYACAI--CLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVG 957 (1428)
Q Consensus 880 KpykC~~--CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~ 957 (1428)
..|.|++ |...|........|-...|+.. |..|.+.|.+...|..|+...|..-|.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~s---------Cs~C~r~~Pt~hLLd~HI~E~HDs~Fq------------- 135 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGNS---------CSFCKRAFPTGHLLDAHILEWHDSLFQ------------- 135 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccch---------hHHHHHhCCchhhhhHHHHHHHHHHHH-------------
Confidence 3477876 7788888888888887777764 999999999999999998766631100
Q ss_pred cCCCCccccCChhhhhhhhhhcCCccceec--CccCcccCChhhHHHHHHhhccC
Q 000554 958 EDSPKKLELGYSASVENHSENLGSIRKFIC--RFCGLKFDLLPDLGRHHQAAHMG 1010 (1428)
Q Consensus 958 ~C~~k~~sf~sks~L~~H~rtHtGeKpykC--~~CGKsFs~~s~L~rHHqrvHtg 1010 (1428)
..+-.|.-.|+| ..|+..|.+...-+.|..+.|.=
T Consensus 136 ------------------a~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk~ 172 (253)
T KOG4173|consen 136 ------------------ALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMHKY 172 (253)
T ss_pred ------------------HHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhccC
Confidence 112334556888 56999999999999988888876
No 78
>KOG2482 consensus Predicted C2H2-type Zn-finger protein [Transcription]
Probab=72.78 E-value=3.7 Score=48.34 Aligned_cols=76 Identities=20% Similarity=0.286 Sum_probs=40.0
Q ss_pred hhhhhhhhhcccccccccccccccccCCCCC-CChhhhhhhhhhcccccccc----------hhhh--hccccccCcCCC
Q 000554 895 KVLESHVQERHHVQFVEQCMLQQCIPCGSHF-GNTEELWLHVQSVHAIDFKM----------SEVA--QQHNQSVGEDSP 961 (1428)
Q Consensus 895 s~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF-~sks~L~~H~rsvHsgEf~~----------~s~~--~~kp~~C~~C~~ 961 (1428)
..|..|++...+.. ...+|-.|...+ .+.+....|+-.+|.-.... .... +-..+.|-.|
T Consensus 129 eaLeqqQ~Eredt~-----fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyC-- 201 (423)
T KOG2482|consen 129 EALEQQQKEREDTI-----FSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYC-- 201 (423)
T ss_pred HHHHHHHHHhcCCe-----eeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeee--
Confidence 44555655554433 245677776544 34455666655455321000 0001 1124667666
Q ss_pred CccccCChhhhhhhhhh
Q 000554 962 KKLELGYSASVENHSEN 978 (1428)
Q Consensus 962 k~~sf~sks~L~~H~rt 978 (1428)
.+.|+.+..|+.|||.
T Consensus 202 -ekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 202 -EKIFRDKNTLKEHMRK 217 (423)
T ss_pred -ccccCCcHHHHHHHHh
Confidence 7777777788888853
No 79
>KOG2482 consensus Predicted C2H2-type Zn-finger protein [Transcription]
Probab=70.18 E-value=2.7 Score=49.41 Aligned_cols=78 Identities=24% Similarity=0.334 Sum_probs=47.6
Q ss_pred cccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCcccC
Q 000554 916 QQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKFD 995 (1428)
Q Consensus 916 fkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs 995 (1428)
..|-.|.....+...|..||+.+|.-++.. . +- ..+..|-..-.+....|. ..+.-.|-.|.-.|.
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~K--i------~s----d~~Ln~YqrvrviNyiRk--q~~~~~c~~cd~~F~ 345 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLK--I------QS----DYSLNFYQRVRVINYIRK--QKKKSRCAECDLSFW 345 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHh--h------cc----ccccchhhhhhHHHHHHH--Hhhcccccccccccc
Confidence 589999999999999999999999644100 0 00 111222221222222221 113356788889999
Q ss_pred ChhhHHHHHHhhc
Q 000554 996 LLPDLGRHHQAAH 1008 (1428)
Q Consensus 996 ~~s~L~rHHqrvH 1008 (1428)
....|.. |+.-|
T Consensus 346 ~e~~l~~-hm~e~ 357 (423)
T KOG2482|consen 346 KEPGLLI-HMVED 357 (423)
T ss_pred Ccchhhh-hcccc
Confidence 9999999 44433
No 80
>cd05837 MSH6_like The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS. The PWWP domain of histone-lysine N-methyltransferase, also known as Nuclear SET domain-containing protein 3, is also included. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans. The role of the PWWP domain in MSH6 is not clear; MSH6 orthologs found in S. cerevisiae, Caenorhabditis elegans and Arabidopsis thaliana lack the PWWP domain. Histone methyltransferases (HMTases) induce the posttranslational methylation of lysine residues in histones and play a role in apoptosis. In the HMTase Whistle, the PWWP domain is necessary for HMTase activity. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain pro
Probab=67.51 E-value=5.7 Score=40.23 Aligned_cols=63 Identities=17% Similarity=0.374 Sum_probs=45.8
Q ss_pred EEEEEeccc-cccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhccccccCCC
Q 000554 157 ALWVKWRGK-WQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSINEFP 219 (1428)
Q Consensus 157 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (1428)
-+|.|=+|- |--|+-+...+=|..+.+..+....+.|.|.||..+.+|.||.---+.++.+.-
T Consensus 8 lVWaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~~V~FFG~~~~~aWv~~~~l~pf~~~~ 71 (110)
T cd05837 8 LVWAKVSGYPWWPCMVCSDPLLGTYTKTKRNKRKPRQYHVQFFGDNPERAWISEKSLKPFKGSK 71 (110)
T ss_pred EEEEeCCCCCCCCEEEecccccchhhhhhhccCCCCeEEEEEcCCCCCEEEecHHHccccCCch
Confidence 479999884 666666654444444444445555689999999999999999988888877654
No 81
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=62.23 E-value=3.1 Score=46.67 Aligned_cols=47 Identities=26% Similarity=0.497 Sum_probs=35.8
Q ss_pred ccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhccc
Q 000554 884 CAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHA 940 (1428)
Q Consensus 884 C~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHs 940 (1428)
|-+|++.|....-|..|++.. -|+|.+|.|..-+--.|..|-..+|.
T Consensus 13 cwycnrefddekiliqhqkak----------hfkchichkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAK----------HFKCHICHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred eeecccccchhhhhhhhhhhc----------cceeeeehhhhccCCCceeehhhhhh
Confidence 888888888888888887753 47788888877777777777655664
No 82
>KOG2785 consensus C2H2-type Zn-finger protein [General function prediction only]
Probab=61.20 E-value=9.5 Score=45.97 Aligned_cols=55 Identities=13% Similarity=-0.028 Sum_probs=41.2
Q ss_pred cceecCccCcccCChhhHHHHHHhhccCCCCC------------------CCCCcccCCCC---cccCCchhhhccc
Q 000554 983 RKFICRFCGLKFDLLPDLGRHHQAAHMGPNLV------------------NSRPHKKGIRF---YAYKLKSGRLSRP 1038 (1428)
Q Consensus 983 KpykC~~CGKsFs~~s~L~rHHqrvHtge~~~------------------~eKpykC~~Cg---KsFs~ks~L~~H~ 1038 (1428)
-|-.|-+|++.|.+...-..| +..|.|.-.. ...-|.|-.|+ +.|.+-...+.||
T Consensus 165 ~Pt~CLfC~~~~k~~e~~~~H-M~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM 240 (390)
T KOG2785|consen 165 IPTDCLFCDKKSKSLEENLKH-MFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHM 240 (390)
T ss_pred CCcceeecCCCcccHHHHHHH-HhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHH
Confidence 357899999999999999994 5444441100 03468888898 8999999999999
No 83
>smart00391 MBD Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain
Probab=56.38 E-value=4.8 Score=38.32 Aligned_cols=36 Identities=19% Similarity=0.106 Sum_probs=28.5
Q ss_pred CCC-CCcccC------------CcccccccCCCCCCc-cccccceeeecc
Q 000554 1184 HLE-PLPSVS------------AGIRSSDSSDFVNNQ-WEVDECHCIIDS 1219 (1428)
Q Consensus 1184 Pl~-p~~~~~------------~~~k~v~~~~p~~~~-w~~~e~~~~l~~ 1219 (1428)
|+. |++.|| .++..|.|..|||.. +.+.|+..||..
T Consensus 3 ~~~~Plp~GW~R~~~~r~~g~~~~~~dV~Y~sP~GkklRs~~ev~~YL~~ 52 (77)
T smart00391 3 PLRLPLPCGWRRETKQRKSGRSAGKFDVYYISPCGKKLRSKSELARYLHK 52 (77)
T ss_pred cccCCCCCCcEEEEEEecCCCCCCcccEEEECCCCCeeeCHHHHHHHHHh
Confidence 444 677777 135678999999999 999999998863
No 84
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=55.24 E-value=8.3 Score=28.93 Aligned_cols=18 Identities=39% Similarity=0.755 Sum_probs=12.9
Q ss_pred eecCccCcccCChhhHHHH
Q 000554 985 FICRFCGLKFDLLPDLGRH 1003 (1428)
Q Consensus 985 ykC~~CGKsFs~~s~L~rH 1003 (1428)
..|+.||+.| ....|.+|
T Consensus 3 ~~C~~CgR~F-~~~~l~~H 20 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKH 20 (25)
T ss_pred CcCCCCCCEE-CHHHHHHH
Confidence 4688888888 56677773
No 85
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=54.89 E-value=4.3 Score=31.99 Aligned_cols=21 Identities=0% Similarity=-0.236 Sum_probs=13.6
Q ss_pred CcccCCCCcccCCchhhhccc
Q 000554 1018 PHKKGIRFYAYKLKSGRLSRP 1038 (1428)
Q Consensus 1018 pykC~~CgKsFs~ks~L~~H~ 1038 (1428)
+|.|.+|++.|.....+..|+
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~ 23 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHL 23 (35)
T ss_pred CeEccccCCccCCHHHHHHHH
Confidence 456666666666666666666
No 86
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=53.76 E-value=7 Score=29.34 Aligned_cols=19 Identities=42% Similarity=0.789 Sum_probs=10.0
Q ss_pred cccccccccCChhhhhhhhh
Q 000554 883 ACAICLDSFTNKKVLESHVQ 902 (1428)
Q Consensus 883 kC~~CgKsF~~ks~L~~H~r 902 (1428)
.|+.||+.| ....|.+|++
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 355555555 4455555543
No 87
>KOG4173 consensus Alpha-SNAP protein [Intracellular trafficking, secretion, and vesicular transport]
Probab=51.59 E-value=5.5 Score=44.03 Aligned_cols=93 Identities=15% Similarity=0.015 Sum_probs=66.9
Q ss_pred cccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCcccCChhhHHHHHHhhccC----CCCCCCCCcccCC--CC
Q 000554 952 HNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMG----PNLVNSRPHKKGI--RF 1025 (1428)
Q Consensus 952 kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtg----e~~~~eKpykC~~--Cg 1025 (1428)
..+.|.+- +|...+.+...+..|..+-+| -.|.+|.+.|.+..-|..|....|.. ....+.-.|+|-+ |+
T Consensus 78 ~~~~cqva-gc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 78 PAFACQVA-GCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred cccccccc-chHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 45778776 666777766666777644333 38999999999999999987666732 0001145799955 99
Q ss_pred cccCCchhhhccc-ccccCCCccc
Q 000554 1026 YAYKLKSGRLSRP-RFKKGLGAVS 1048 (1428)
Q Consensus 1026 KsFs~ks~L~~H~-r~H~gekpy~ 1048 (1428)
..|.+....+.|+ ++|.--..|.
T Consensus 154 ~KFkT~r~RkdH~I~~Hk~Pa~fr 177 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMHKYPADFR 177 (253)
T ss_pred hhhhhhhhhhhHHHHhccCCccee
Confidence 9999999999999 7887544444
No 88
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=50.37 E-value=8.6 Score=30.26 Aligned_cols=22 Identities=18% Similarity=0.306 Sum_probs=19.2
Q ss_pred ccccccCCCCCCChhhhhhhhh
Q 000554 915 LQQCIPCGSHFGNTEELWLHVQ 936 (1428)
Q Consensus 915 pfkC~~CgKsF~sks~L~~H~r 936 (1428)
+|.|..|++.|.+...+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 5789999999999889988876
No 89
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=47.40 E-value=8.3 Score=34.36 Aligned_cols=32 Identities=28% Similarity=0.385 Sum_probs=22.3
Q ss_pred hcCCccceecCccCcccCChhhHHHHHHhhcc
Q 000554 978 NLGSIRKFICRFCGLKFDLLPDLGRHHQAAHM 1009 (1428)
Q Consensus 978 tHtGeKpykC~~CGKsFs~~s~L~rHHqrvHt 1009 (1428)
.-.||--+.|+.||+.|....+..+|.-+.|.
T Consensus 11 ~RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 11 DRDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred ccCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 34566677788888888877777776555553
No 90
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=47.25 E-value=6.1 Score=44.60 Aligned_cols=20 Identities=25% Similarity=0.534 Sum_probs=13.5
Q ss_pred cceecCccCcccCChhhHHH
Q 000554 983 RKFICRFCGLKFDLLPDLGR 1002 (1428)
Q Consensus 983 KpykC~~CGKsFs~~s~L~r 1002 (1428)
|.++|+.|++.|....-+..
T Consensus 4 k~~~CPvC~~~F~~~~vrs~ 23 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSG 23 (214)
T ss_pred CceECCCCCCeeeeeEEEcC
Confidence 56778888888876544333
No 91
>smart00293 PWWP domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues
Probab=42.52 E-value=27 Score=31.74 Aligned_cols=56 Identities=20% Similarity=0.433 Sum_probs=38.5
Q ss_pred EEEEEecc-ccccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhccccc
Q 000554 157 ALWVKWRG-KWQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSI 215 (1428)
Q Consensus 157 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (1428)
-+|.|=+| -|--|+-+...+-|...++ +.-..+.|.|.||.. .+|.|++--.+.++
T Consensus 6 lVwaK~~G~p~WPa~V~~~~~~~~~~~~--~~~~~~~~~V~Ffg~-~~~awv~~~~l~p~ 62 (63)
T smart00293 6 LVWAKMKGFPWWPALVVSPKETPDNIRK--RKRFENLYPVLFFGD-KDTAWISSSKLFPL 62 (63)
T ss_pred EEEEECCCCCCCCeEEcCcccCChhHhh--ccCCCCEEEEEEeCC-CCEEEECccceeeC
Confidence 37999999 7777777766665554332 334456788888875 55699987766654
No 92
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=41.44 E-value=18 Score=28.81 Aligned_cols=11 Identities=36% Similarity=1.352 Sum_probs=6.7
Q ss_pred eecCccCcccC
Q 000554 985 FICRFCGLKFD 995 (1428)
Q Consensus 985 ykC~~CGKsFs 995 (1428)
|+|..||..+.
T Consensus 2 ~~C~~CGy~y~ 12 (33)
T cd00350 2 YVCPVCGYIYD 12 (33)
T ss_pred EECCCCCCEEC
Confidence 56666666544
No 93
>PF00855 PWWP: PWWP domain; InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=41.14 E-value=26 Score=33.13 Aligned_cols=56 Identities=23% Similarity=0.577 Sum_probs=38.4
Q ss_pred EEEEEecc-ccccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhccccccCCC
Q 000554 157 ALWVKWRG-KWQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSINEFP 219 (1428)
Q Consensus 157 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (1428)
-+|+|=+| -|=-|+=|...+.+- + ......|.|.||... +|.|++.-.|.+.+++-
T Consensus 6 lVWaK~~g~pwWPa~V~~~~~~~~-----~-~~~~~~~~V~Ffg~~-~~~wv~~~~i~~f~~~~ 62 (86)
T PF00855_consen 6 LVWAKLKGYPWWPARVCDPDEKSK-----K-KRKDGHVLVRFFGDN-DYAWVKPSNIKPFSEFK 62 (86)
T ss_dssp EEEEEETTSEEEEEEEEECCHCTS-----C-SSSSTEEEEEETTTT-EEEEEEGGGEEECCHHH
T ss_pred EEEEEeCCCCCCceEEeecccccc-----c-CCCCCEEEEEecCCC-CEEEECHHHhhChhhhH
Confidence 48999987 355666666664443 1 334466777777766 99999998888877544
No 94
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=39.33 E-value=13 Score=36.16 Aligned_cols=34 Identities=21% Similarity=0.263 Sum_probs=23.2
Q ss_pred cceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCch
Q 000554 983 RKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKS 1032 (1428)
Q Consensus 983 KpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks 1032 (1428)
.+|.|+.|++. . +.|+-+| -+.|..|++.|.-..
T Consensus 34 ~~~~Cp~C~~~-~--------VkR~a~G-------IW~C~kCg~~fAGga 67 (89)
T COG1997 34 AKHVCPFCGRT-T--------VKRIATG-------IWKCRKCGAKFAGGA 67 (89)
T ss_pred cCCcCCCCCCc-c--------eeeeccC-------eEEcCCCCCeecccc
Confidence 46788888876 1 4555555 788888888876443
No 95
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=38.94 E-value=32 Score=39.62 Aligned_cols=27 Identities=15% Similarity=-0.012 Sum_probs=21.1
Q ss_pred CCCcccCCCCcccCCchhhhccccccc
Q 000554 1016 SRPHKKGIRFYAYKLKSGRLSRPRFKK 1042 (1428)
Q Consensus 1016 eKpykC~~CgKsFs~ks~L~~H~r~H~ 1042 (1428)
.+++.|+.|+........|..-.|.|.
T Consensus 207 ~k~~PCPKCg~et~eTkdLSmStR~hk 233 (314)
T PF06524_consen 207 GKPIPCPKCGYETQETKDLSMSTRSHK 233 (314)
T ss_pred CCCCCCCCCCCcccccccceeeeecch
Confidence 578889999888888777877666665
No 96
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=38.52 E-value=16 Score=29.64 Aligned_cols=34 Identities=12% Similarity=0.123 Sum_probs=19.9
Q ss_pred eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccC
Q 000554 985 FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYK 1029 (1428)
Q Consensus 985 ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs 1029 (1428)
++|+.|+..|.-...... .. .....|+.|+..|.
T Consensus 3 ~~CP~C~~~~~v~~~~~~------~~-----~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLG------AN-----GGKVRCGKCGHVWY 36 (38)
T ss_pred EECCCCCCEEEeCHHHcC------CC-----CCEEECCCCCCEEE
Confidence 567777777766544322 11 22467777777663
No 97
>cd05838 WHSC1_related The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS). When translocated, WHSC1 plays a role in lymphoid multiple myeloma (MM) disease, also known as plasmacytoma. WHCS1 proteins typically contain two copies of the PWWP domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=38.00 E-value=25 Score=34.75 Aligned_cols=54 Identities=26% Similarity=0.543 Sum_probs=34.1
Q ss_pred EEEEecc-ccccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhcccc
Q 000554 158 LWVKWRG-KWQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRS 214 (1428)
Q Consensus 158 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (1428)
+|+|-+| -|=-|+-|-..+=|-..+..+ +....|.|.|| .+++|.|++--.|-+
T Consensus 7 VWaK~~g~pwWPa~V~~~~~~p~~~~~~~--~~~~~~~V~Ff-gs~~y~Wv~~~~l~p 61 (95)
T cd05838 7 VWAKLGNFRWWPAIICDPREVPPNIQVLR--HCIGEFCVMFF-GTHDYYWVHRGRVFP 61 (95)
T ss_pred EEEECCCCCCCCeEEcChhhcChhHhhcc--CCCCeEEEEEe-CCCCEEEeccccccc
Confidence 7999998 455666665543333222211 23356888888 589999999744443
No 98
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=37.85 E-value=39 Score=34.59 Aligned_cols=48 Identities=19% Similarity=0.390 Sum_probs=29.3
Q ss_pred cccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcc
Q 000554 883 ACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVH 939 (1428)
Q Consensus 883 kC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvH 939 (1428)
.|--|.+.|....... .++ ......|+|+.|...|-..-+...|.. .|
T Consensus 57 ~C~~C~~~f~~~~~~~------~~~--~~~~~~y~C~~C~~~FC~dCD~fiHe~-Lh 104 (112)
T TIGR00622 57 FCFGCQGPFPKPPVSP------FDE--LKDSHRYVCAVCKNVFCVDCDVFVHES-LH 104 (112)
T ss_pred cccCcCCCCCCccccc------ccc--cccccceeCCCCCCccccccchhhhhh-cc
Confidence 3777888887653211 110 001136888888888888888887863 45
No 99
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=37.53 E-value=24 Score=38.09 Aligned_cols=40 Identities=13% Similarity=-0.029 Sum_probs=28.1
Q ss_pred hhhhhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCccc
Q 000554 974 NHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAY 1028 (1428)
Q Consensus 974 ~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsF 1028 (1428)
.-+.......-|.|+.|+..|+....+.. -|.|+.||...
T Consensus 99 ~~l~~e~~~~~Y~Cp~c~~r~tf~eA~~~---------------~F~Cp~Cg~~L 138 (158)
T TIGR00373 99 EKLEFETNNMFFICPNMCVRFTFNEAMEL---------------NFTCPRCGAML 138 (158)
T ss_pred HHHhhccCCCeEECCCCCcEeeHHHHHHc---------------CCcCCCCCCEe
Confidence 33334455567889999988887776643 58899998653
No 100
>PF14353 CpXC: CpXC protein
Probab=37.15 E-value=22 Score=36.60 Aligned_cols=50 Identities=22% Similarity=0.264 Sum_probs=32.5
Q ss_pred ecCccCcccCC----------hhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccccccc
Q 000554 986 ICRFCGLKFDL----------LPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKK 1042 (1428)
Q Consensus 986 kC~~CGKsFs~----------~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~ 1042 (1428)
.|+.||..|.. ...|+. ++-.|. --.|.|+.||+.|.-...+..|-..|.
T Consensus 3 tCP~C~~~~~~~v~~~I~~~~~p~l~e---~il~g~----l~~~~CP~Cg~~~~~~~p~lY~D~~~~ 62 (128)
T PF14353_consen 3 TCPHCGHEFEFEVWTSINADEDPELKE---KILDGS----LFSFTCPSCGHKFRLEYPLLYHDPEKK 62 (128)
T ss_pred CCCCCCCeeEEEEEeEEcCcCCHHHHH---HHHcCC----cCEEECCCCCCceecCCCEEEEcCCCC
Confidence 57888877753 223332 233442 346889999999988888888765543
No 101
>KOG3813 consensus Uncharacterized conserved protein (tumor-suppressor AXUD1 in humans) [General function prediction only]
Probab=37.04 E-value=16 Score=45.37 Aligned_cols=19 Identities=42% Similarity=1.027 Sum_probs=16.5
Q ss_pred CCCcccCCCCcCCCCCCccc
Q 000554 1299 QLGCACANSTCFPETCDHVY 1318 (1428)
Q Consensus 1299 ~~gC~C~~~~C~~~~C~C~~ 1318 (1428)
.+||+|.. -|+|++|+|.+
T Consensus 307 eCGCsCr~-~CdPETCaCSq 325 (640)
T KOG3813|consen 307 ECGCSCRG-VCDPETCACSQ 325 (640)
T ss_pred hhCCcccc-eeChhhcchhc
Confidence 57999994 89999999964
No 102
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=37.02 E-value=19 Score=36.58 Aligned_cols=30 Identities=23% Similarity=0.222 Sum_probs=22.0
Q ss_pred eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCc
Q 000554 985 FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLK 1031 (1428)
Q Consensus 985 ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~k 1031 (1428)
..|+.||++|--. . ..|..|+.||..|.-.
T Consensus 10 R~Cp~CG~kFYDL---n--------------k~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDL---N--------------KDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccC---C--------------CCCccCCCCCCccCcc
Confidence 5788888888643 2 2477888888888766
No 103
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=35.90 E-value=29 Score=36.93 Aligned_cols=39 Identities=13% Similarity=0.059 Sum_probs=25.0
Q ss_pred CCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCccc
Q 000554 980 GSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAY 1028 (1428)
Q Consensus 980 tGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsF 1028 (1428)
....-|.|+.|++.|.....+.. .+. ...|.|+.||...
T Consensus 95 ~~~~~Y~Cp~C~~~y~~~ea~~~----~d~------~~~f~Cp~Cg~~l 133 (147)
T smart00531 95 TNNAYYKCPNCQSKYTFLEANQL----LDM------DGTFTCPRCGEEL 133 (147)
T ss_pred cCCcEEECcCCCCEeeHHHHHHh----cCC------CCcEECCCCCCEE
Confidence 34456899999988886544332 111 2348999998764
No 104
>cd01397 HAT_MBD Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as BAZ2A; BAZ2A contains a MBD, DDT, PHD-type zinc finger and Bromo domain suggesting that BAZ2A might be associated with histone acetyltransferase (HAT) activity. The Drosophila melanogaster toutatis protein, a putative subunit of the chromatin-remodeling complex, and other such proteins in this group share a similar domain architecture with BAZ2A, as does the Caenorhabditis elegans flectin homolog.
Probab=35.18 E-value=13 Score=35.16 Aligned_cols=25 Identities=4% Similarity=-0.186 Sum_probs=21.4
Q ss_pred cccccccCCCCCCc-cccccceeeec
Q 000554 1194 GIRSSDSSDFVNNQ-WEVDECHCIID 1218 (1428)
Q Consensus 1194 ~~k~v~~~~p~~~~-w~~~e~~~~l~ 1218 (1428)
++..|.|.+|||.. +++.|++.||.
T Consensus 23 ~~~dV~Y~aPcGKklRs~~ev~~yL~ 48 (73)
T cd01397 23 IQGEVAYYAPCGKKLRQYPEVIKYLS 48 (73)
T ss_pred ccceEEEECCCCcccccHHHHHHHHH
Confidence 34468899999999 99999998886
No 105
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=35.17 E-value=14 Score=30.27 Aligned_cols=12 Identities=33% Similarity=1.052 Sum_probs=8.4
Q ss_pred eecCccCcccCC
Q 000554 985 FICRFCGLKFDL 996 (1428)
Q Consensus 985 ykC~~CGKsFs~ 996 (1428)
|+|..||+.|..
T Consensus 6 y~C~~Cg~~fe~ 17 (41)
T smart00834 6 YRCEDCGHTFEV 17 (41)
T ss_pred EEcCCCCCEEEE
Confidence 677777777754
No 106
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=33.63 E-value=18 Score=39.02 Aligned_cols=19 Identities=5% Similarity=-0.303 Sum_probs=13.8
Q ss_pred CCcccCCCCcccCCchhhh
Q 000554 1017 RPHKKGIRFYAYKLKSGRL 1035 (1428)
Q Consensus 1017 KpykC~~CgKsFs~ks~L~ 1035 (1428)
+.++|+.||++|..-..+.
T Consensus 27 ~~~~c~~c~~~f~~~e~~~ 45 (154)
T PRK00464 27 RRRECLACGKRFTTFERVE 45 (154)
T ss_pred eeeeccccCCcceEeEecc
Confidence 3488888888887665544
No 107
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=33.28 E-value=27 Score=46.25 Aligned_cols=43 Identities=21% Similarity=0.178 Sum_probs=28.4
Q ss_pred CCCChhhhhhhhhhhhHHHHHHHHhhh-cCCCCcccccccccccc
Q 000554 1111 RPNSHEILSMARLACCKVSLKASLEEK-YGALPENICLKAAKLCS 1154 (1428)
Q Consensus 1111 ~P~n~diLsiars~CcK~~l~~~L~~k-~g~lpe~l~~~aakl~~ 1154 (1428)
.|.+..|..+-.. =.-.|..+.|..+ -..+||--++-+...-+
T Consensus 602 ~P~hp~i~~~~~~-dy~~F~~~El~~Rk~~~~PPf~~l~~v~~~~ 645 (730)
T COG1198 602 NPDHPAIQALKRG-DYEAFYEQELAERKELGLPPFSRLAAVIASA 645 (730)
T ss_pred CCCcHHHHHHHhc-CHHHHHHHHHHHHHhcCCCChhhheeeEecC
Confidence 3666666555554 3446777888777 68889988776655543
No 108
>KOG2461 consensus Transcription factor BLIMP-1/PRDI-BF1, contains C2H2-type Zn-finger and SET domains [Transcription]
Probab=32.17 E-value=90 Score=38.68 Aligned_cols=78 Identities=0% Similarity=-0.293 Sum_probs=53.8
Q ss_pred hhhhhhhhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccccCCCccccC
Q 000554 971 SVENHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYR 1050 (1428)
Q Consensus 971 ~L~~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~ 1050 (1428)
.+..|...|++..++.+..+.+.+.....+.. +...|.+ +.++.+..+...+.....+..+..+|...+.+.+.
T Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 391 (396)
T KOG2461|consen 318 VLDQSEVPATVSVWTGETIPVRTPAGQLIYTQ-SHSMEVA-----EPTDMAPNQIWKIYHTGVLGFLIITTDESECNNMS 391 (396)
T ss_pred ccccccccccccccCcCcccccccccccchhh-hhhcccC-----CCCcccccccccceeccccceeeeecccccccccc
Confidence 55667777888888888888888888778888 6666776 55555555555555556666666677766667666
Q ss_pred CCCC
Q 000554 1051 IRNR 1054 (1428)
Q Consensus 1051 ~C~k 1054 (1428)
.|.+
T Consensus 392 ~~~~ 395 (396)
T KOG2461|consen 392 FVCK 395 (396)
T ss_pred ccCC
Confidence 6554
No 109
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=32.12 E-value=30 Score=38.14 Aligned_cols=35 Identities=11% Similarity=0.125 Sum_probs=24.7
Q ss_pred CCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccC
Q 000554 980 GSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYK 1029 (1428)
Q Consensus 980 tGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs 1029 (1428)
....-|.|+.|++.|+....+.. -|.|+.||....
T Consensus 113 ~~~~~Y~Cp~C~~rytf~eA~~~---------------~F~Cp~Cg~~L~ 147 (178)
T PRK06266 113 ENNMFFFCPNCHIRFTFDEAMEY---------------GFRCPQCGEMLE 147 (178)
T ss_pred cCCCEEECCCCCcEEeHHHHhhc---------------CCcCCCCCCCCe
Confidence 34456889888888887765532 588888886543
No 110
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=31.47 E-value=16 Score=30.80 Aligned_cols=12 Identities=33% Similarity=1.099 Sum_probs=8.1
Q ss_pred eecCccCcccCC
Q 000554 985 FICRFCGLKFDL 996 (1428)
Q Consensus 985 ykC~~CGKsFs~ 996 (1428)
|+|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (42)
T PF09723_consen 6 YRCEECGHEFEV 17 (42)
T ss_pred EEeCCCCCEEEE
Confidence 667777776654
No 111
>PHA00626 hypothetical protein
Probab=31.38 E-value=19 Score=32.33 Aligned_cols=13 Identities=8% Similarity=-0.455 Sum_probs=7.9
Q ss_pred CcccCCCCcccCC
Q 000554 1018 PHKKGIRFYAYKL 1030 (1428)
Q Consensus 1018 pykC~~CgKsFs~ 1030 (1428)
.|+|+.||+.|+.
T Consensus 23 rYkCkdCGY~ft~ 35 (59)
T PHA00626 23 DYVCCDCGYNDSK 35 (59)
T ss_pred ceEcCCCCCeech
Confidence 5666666666653
No 112
>cd00122 MBD MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD). The MBD consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. The MBDs present in putative chromatin remodelling subunit, BAZ2A, and putative histone methyltransferase, CLLD8, represent two phylogenetically distinct groups within the MBD protein family.
Probab=31.17 E-value=15 Score=33.32 Aligned_cols=27 Identities=7% Similarity=-0.031 Sum_probs=22.7
Q ss_pred cccccccCCCCCCc-cccccceeeeccC
Q 000554 1194 GIRSSDSSDFVNNQ-WEVDECHCIIDSR 1220 (1428)
Q Consensus 1194 ~~k~v~~~~p~~~~-w~~~e~~~~l~~~ 1220 (1428)
++..|.|..|+|.. +.+.|+..||..+
T Consensus 23 ~k~dv~Y~sP~Gk~~Rs~~ev~~yL~~~ 50 (62)
T cd00122 23 GKGDVYYYSPCGKKLRSKPEVARYLEKT 50 (62)
T ss_pred CcceEEEECCCCceecCHHHHHHHHHhC
Confidence 45578999999988 9999999988754
No 113
>PF13891 zf-C3Hc3H: Potential DNA-binding domain
Probab=31.05 E-value=15 Score=33.77 Aligned_cols=23 Identities=39% Similarity=0.686 Sum_probs=20.4
Q ss_pred eeccCcccccccCCCcccccCCC
Q 000554 587 TVLGTRCKHRALYGSSFCKKHRP 609 (1428)
Q Consensus 587 ~~~g~~ckh~~~~~~~~c~~~~~ 609 (1428)
+..|+.|+.+++||+.||-+|-.
T Consensus 3 ~~~~~~C~~~~lp~~~yC~~HIl 25 (65)
T PF13891_consen 3 TYSGRGCSQPALPGSKYCIRHIL 25 (65)
T ss_pred CCCCCCcCcccCchhhHHHHHhc
Confidence 45789999999999999999874
No 114
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=30.77 E-value=52 Score=33.06 Aligned_cols=27 Identities=15% Similarity=-0.070 Sum_probs=22.8
Q ss_pred CCccc----CCCCcccCCchhhhcccccccC
Q 000554 1017 RPHKK----GIRFYAYKLKSGRLSRPRFKKG 1043 (1428)
Q Consensus 1017 KpykC----~~CgKsFs~ks~L~~H~r~H~g 1043 (1428)
.-|.| ..|++.+.+...+++|.+.++|
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 45889 9999999999999999977654
No 115
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=30.55 E-value=19 Score=31.28 Aligned_cols=12 Identities=33% Similarity=1.168 Sum_probs=7.7
Q ss_pred eecCccCcccCC
Q 000554 985 FICRFCGLKFDL 996 (1428)
Q Consensus 985 ykC~~CGKsFs~ 996 (1428)
|+|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (52)
T TIGR02605 6 YRCTACGHRFEV 17 (52)
T ss_pred EEeCCCCCEeEE
Confidence 666666666653
No 116
>cd05839 BR140_related The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2. BR140 is the only family to contain the PWWP domain at the C terminus, with PHD and bromo domains in the N-terminal region. In myeloid leukemias, BR140 is disrupted by chromosomal translocations, similar to translocations of WHSC1 in lymphoid multiple myeloma. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Probab=30.07 E-value=78 Score=32.48 Aligned_cols=61 Identities=20% Similarity=0.359 Sum_probs=40.7
Q ss_pred EEEEEeccc-cccceeeeec----cCC-----Ccccc----ccccCCCccEEEEEeccCCcchhhhhhccccccC
Q 000554 157 ALWVKWRGK-WQAGIRCARA----DWP-----LPTLK----AKPTHDRKKYFVIFFPHTRNYSWADMLLVRSINE 217 (1428)
Q Consensus 157 ~~~~~~~~~-~~~~~~~~~~----~~~-----~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (1428)
-||.|-+|- |.-|+-.-.. ..+ ++-|+ .+.-.+.+.|+|-||=.+++|.|++---+.+..+
T Consensus 6 lVwaK~~g~P~wPa~iidp~~~~~~~~~~~~p~~~l~~~~~~~~~~~~~~~lV~FFd~~~s~~Wv~~~~l~pl~~ 80 (111)
T cd05839 6 LVWAKCRGYPSYPALIIDPKMPRDGVFHNGVPPDVLTLGEARAQNADERLYLVLFFDNKRTWQWLPGDKLEPLGV 80 (111)
T ss_pred EeeeeecCCCCCCeEeeCCCCCCcccccCCCCchhhhHHHHHhccCCCcEEEEEEecCCCcceecCHHHCccccc
Confidence 379998883 6666554422 111 12222 2334688889999999999999999887776654
No 117
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=29.98 E-value=28 Score=39.41 Aligned_cols=42 Identities=17% Similarity=0.059 Sum_probs=30.7
Q ss_pred CCCcccCCCCcccCCchhhhcccccc----------cCCCccc-----cCCCCCcCc
Q 000554 1016 SRPHKKGIRFYAYKLKSGRLSRPRFK----------KGLGAVS-----YRIRNRGAA 1057 (1428)
Q Consensus 1016 eKpykC~~CgKsFs~ks~L~~H~r~H----------~gekpy~-----C~~C~ksf~ 1057 (1428)
.+.+.||+|++.|..+.-+....++- .+..|+- |+.||.++.
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~ 59 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAF 59 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccc
Confidence 57889999999999987777666431 2233332 999998875
No 118
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=29.61 E-value=35 Score=27.54 Aligned_cols=10 Identities=30% Similarity=1.109 Sum_probs=6.1
Q ss_pred eecCccCccc
Q 000554 985 FICRFCGLKF 994 (1428)
Q Consensus 985 ykC~~CGKsF 994 (1428)
|+|..||..+
T Consensus 3 ~~C~~CG~i~ 12 (34)
T cd00729 3 WVCPVCGYIH 12 (34)
T ss_pred EECCCCCCEe
Confidence 5666666543
No 119
>PF11722 zf-TRM13_CCCH: CCCH zinc finger in TRM13 protein; InterPro: IPR021721 This domain is found at the N terminus of TRM13 methyltransferase proteins. It is presumed to be a zinc binding domain. ; GO: 0008168 methyltransferase activity
Probab=29.38 E-value=31 Score=27.52 Aligned_cols=21 Identities=38% Similarity=0.634 Sum_probs=18.4
Q ss_pred ccCcccccccCCCcccccCCC
Q 000554 589 LGTRCKHRALYGSSFCKKHRP 609 (1428)
Q Consensus 589 ~g~~ckh~~~~~~~~c~~~~~ 609 (1428)
-.|.|+-...+|+.||.-|.|
T Consensus 11 K~R~C~m~~~~g~~fC~~H~~ 31 (31)
T PF11722_consen 11 KKRFCKMTRKPGSRFCGEHMP 31 (31)
T ss_pred cccccCCeecCcCCccccCCC
Confidence 357899999999999999975
No 120
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=27.83 E-value=15 Score=32.80 Aligned_cols=31 Identities=23% Similarity=0.417 Sum_probs=18.5
Q ss_pred ccCCCCcccCCCCCccccccccccccccccc
Q 000554 841 RSEDEKTHKCKICSQVFLHDQELGVHWMDNH 871 (1428)
Q Consensus 841 ~H~gekpykC~~CgK~F~s~s~L~~H~~r~H 871 (1428)
...|+..++|+.|+..|.....+.+|.-+.|
T Consensus 11 ~RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 11 DRDGEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred ccCCceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 3445556666666666666666666644444
No 121
>KOG2186 consensus Cell growth-regulating nucleolar protein [Cell cycle control, cell division, chromosome partitioning]
Probab=27.78 E-value=21 Score=40.93 Aligned_cols=48 Identities=17% Similarity=0.463 Sum_probs=22.8
Q ss_pred ccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhc
Q 000554 848 HKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQER 904 (1428)
Q Consensus 848 ykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~H 904 (1428)
|.|..||.....+ .+.+| +....+ .-|.|-.|++.|.. ..+..|.+--
T Consensus 4 FtCnvCgEsvKKp-~vekH-~srCrn------~~fSCIDC~k~F~~-~sYknH~kCI 51 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKH-MSRCRN------AYFSCIDCGKTFER-VSYKNHTKCI 51 (276)
T ss_pred Eehhhhhhhcccc-chHHH-HHhccC------CeeEEeeccccccc-chhhhhhhhc
Confidence 4555555554433 24445 222222 23555556655555 4455554433
No 122
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=26.89 E-value=46 Score=30.34 Aligned_cols=32 Identities=22% Similarity=0.104 Sum_probs=19.6
Q ss_pred ceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCc
Q 000554 984 KFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFY 1026 (1428)
Q Consensus 984 pykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgK 1026 (1428)
.|.|+.||..-..+..--+ .+. .+|.|+.||.
T Consensus 27 ~F~CPnCGe~~I~Rc~~CR----k~g-------~~Y~Cp~CGF 58 (61)
T COG2888 27 KFPCPNCGEVEIYRCAKCR----KLG-------NPYRCPKCGF 58 (61)
T ss_pred EeeCCCCCceeeehhhhHH----HcC-------CceECCCcCc
Confidence 5888888866554433222 233 4888888873
No 123
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=26.08 E-value=39 Score=27.66 Aligned_cols=33 Identities=12% Similarity=0.146 Sum_probs=19.2
Q ss_pred eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCccc
Q 000554 985 FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAY 1028 (1428)
Q Consensus 985 ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsF 1028 (1428)
..|+.|+..|.-...... .. .+..+|+.|+..|
T Consensus 3 i~Cp~C~~~y~i~d~~ip------~~-----g~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEKIP------PK-----GRKVRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHHCC------CC-----CcEEECCCCCCEe
Confidence 457777777766554332 11 3456777777665
No 124
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=25.91 E-value=34 Score=30.06 Aligned_cols=29 Identities=14% Similarity=0.135 Sum_probs=19.8
Q ss_pred cceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcc
Q 000554 983 RKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYA 1027 (1428)
Q Consensus 983 KpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKs 1027 (1428)
..|+|-.||+.|. .+.. .....|+.||..
T Consensus 5 ~~Y~C~~Cg~~~~---~~~~-------------~~~irCp~Cg~r 33 (49)
T COG1996 5 MEYKCARCGREVE---LDQE-------------TRGIRCPYCGSR 33 (49)
T ss_pred EEEEhhhcCCeee---hhhc-------------cCceeCCCCCcE
Confidence 4588999999882 1222 456789998854
No 125
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=25.25 E-value=38 Score=28.48 Aligned_cols=13 Identities=23% Similarity=0.511 Sum_probs=8.7
Q ss_pred ccCCCCCcccccc
Q 000554 848 HKCKICSQVFLHD 860 (1428)
Q Consensus 848 ykC~~CgK~F~s~ 860 (1428)
|+|..||..|...
T Consensus 6 y~C~~Cg~~fe~~ 18 (42)
T PF09723_consen 6 YRCEECGHEFEVL 18 (42)
T ss_pred EEeCCCCCEEEEE
Confidence 6777777776544
No 126
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=25.12 E-value=54 Score=27.44 Aligned_cols=28 Identities=29% Similarity=0.642 Sum_probs=15.4
Q ss_pred CccceecCccCcccCCh----hhHHHHHHhhc
Q 000554 981 SIRKFICRFCGLKFDLL----PDLGRHHQAAH 1008 (1428)
Q Consensus 981 GeKpykC~~CGKsFs~~----s~L~rHHqrvH 1008 (1428)
+....+|..|++.+... +.|.+|..+.|
T Consensus 13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp CSS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred CcCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 34557788888777664 67777544555
No 127
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=24.87 E-value=47 Score=34.67 Aligned_cols=34 Identities=24% Similarity=0.188 Sum_probs=23.1
Q ss_pred eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhh
Q 000554 985 FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRL 1035 (1428)
Q Consensus 985 ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~ 1035 (1428)
..|+.||++|-.. . ..|..|+.||..|.....++
T Consensus 10 r~Cp~cg~kFYDL---n--------------k~p~vcP~cg~~~~~~~~~~ 43 (129)
T TIGR02300 10 RICPNTGSKFYDL---N--------------RRPAVSPYTGEQFPPEEALK 43 (129)
T ss_pred ccCCCcCcccccc---C--------------CCCccCCCcCCccCcchhhc
Confidence 5788888888642 2 35788888888876553333
No 128
>KOG2186 consensus Cell growth-regulating nucleolar protein [Cell cycle control, cell division, chromosome partitioning]
Probab=23.99 E-value=39 Score=38.88 Aligned_cols=47 Identities=26% Similarity=0.551 Sum_probs=39.1
Q ss_pred cccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhh
Q 000554 881 GYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQ 936 (1428)
Q Consensus 881 pykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~r 936 (1428)
-|.|..||.... +..+.+|+-.-++. -|.|-.|++.|.. .....|..
T Consensus 3 ~FtCnvCgEsvK-Kp~vekH~srCrn~-------~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 3 FFTCNVCGESVK-KPQVEKHMSRCRNA-------YFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred EEehhhhhhhcc-ccchHHHHHhccCC-------eeEEeeccccccc-chhhhhhh
Confidence 488999999876 45577799888885 5899999999988 78888876
No 129
>cd05834 HDGF_related The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF. This family of endogenous nuclear-targeted mitogens includes HRP (HDGF-related proteins 1, 2, 3, 4, or HPR1, HPR2, HPR3, HPR4, respectively) and lens epithelium-derived growth factor, LEDGF. Members of the HDGF family have been linked to human diseases, and HDGF is a prognostic factor in several types of cancer. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=23.41 E-value=1e+02 Score=29.86 Aligned_cols=52 Identities=23% Similarity=0.233 Sum_probs=35.6
Q ss_pred EEEEEeccc-cccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhccccccCC
Q 000554 157 ALWVKWRGK-WQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSINEF 218 (1428)
Q Consensus 157 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (1428)
-+|.|=+|- |=-|+=|...+. +-..++|.|.||. |..|.||..-.+.++.++
T Consensus 8 lVwaK~kGyp~WPa~I~~~~~~---------~~~~~~~~V~FfG-t~~~a~v~~~~l~pf~~~ 60 (83)
T cd05834 8 LVFAKVKGYPAWPARVDEPEDW---------KPPGKKYPVYFFG-THETAFLKPEDLFPYTEN 60 (83)
T ss_pred EEEEecCCCCCCCEEEeccccc---------CCCCCEEEEEEeC-CCCEeEECHHHceecccc
Confidence 368887773 333444444332 2235789999999 789999998888888775
No 130
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=23.12 E-value=54 Score=29.90 Aligned_cols=32 Identities=22% Similarity=0.285 Sum_probs=18.0
Q ss_pred cceecCccCcc-cCChhhHHHHHHhhccCCCCCCCCCcccCCCCc
Q 000554 983 RKFICRFCGLK-FDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFY 1026 (1428)
Q Consensus 983 KpykC~~CGKs-Fs~~s~L~rHHqrvHtge~~~~eKpykC~~CgK 1026 (1428)
-.|.|+.||+. -.+-..-++ + ..+|.|+.||.
T Consensus 24 ~~F~CPnCG~~~I~RC~~CRk-----~-------~~~Y~CP~CGF 56 (59)
T PRK14890 24 VKFLCPNCGEVIIYRCEKCRK-----Q-------SNPYTCPKCGF 56 (59)
T ss_pred CEeeCCCCCCeeEeechhHHh-----c-------CCceECCCCCC
Confidence 34777777776 333222222 2 34788888874
No 131
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=22.83 E-value=45 Score=35.03 Aligned_cols=15 Identities=27% Similarity=0.474 Sum_probs=12.0
Q ss_pred ceecCccCcccCChh
Q 000554 984 KFICRFCGLKFDLLP 998 (1428)
Q Consensus 984 pykC~~CGKsFs~~s 998 (1428)
|++|..||+.|...+
T Consensus 1 PH~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGS 15 (131)
T ss_pred CcccCcCCCCcCCCc
Confidence 578888888888765
No 132
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=22.81 E-value=86 Score=32.22 Aligned_cols=50 Identities=20% Similarity=0.276 Sum_probs=34.9
Q ss_pred ccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcc
Q 000554 848 HKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERH 905 (1428)
Q Consensus 848 ykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hh 905 (1428)
..|--|.+.|........- . -... ..|.|+.|...|-..-+...|...|.
T Consensus 56 ~~C~~C~~~f~~~~~~~~~-~--~~~~-----~~y~C~~C~~~FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 56 RFCFGCQGPFPKPPVSPFD-E--LKDS-----HRYVCAVCKNVFCVDCDVFVHESLHC 105 (112)
T ss_pred CcccCcCCCCCCccccccc-c--cccc-----cceeCCCCCCccccccchhhhhhccC
Confidence 3599999999865422211 0 0112 56999999999999999999976554
No 133
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=22.74 E-value=44 Score=26.86 Aligned_cols=11 Identities=36% Similarity=1.108 Sum_probs=6.9
Q ss_pred eecCccCcccC
Q 000554 985 FICRFCGLKFD 995 (1428)
Q Consensus 985 ykC~~CGKsFs 995 (1428)
|.|..||..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 56777777665
No 134
>PF08879 WRC: WRC; InterPro: IPR014977 WRC is named after the conserved Trp-Arg-Cys motif, it contains two distinctive features: a putative nuclear localisation signal and a zinc-finger motif (C3H). It is suggested that WRC functions in DNA binding []. ; GO: 0005515 protein binding
Probab=22.57 E-value=30 Score=30.02 Aligned_cols=20 Identities=50% Similarity=0.865 Sum_probs=18.1
Q ss_pred ccCcccccccCCCcccccCC
Q 000554 589 LGTRCKHRALYGSSFCKKHR 608 (1428)
Q Consensus 589 ~g~~ckh~~~~~~~~c~~~~ 608 (1428)
-|=||+..+++|.++|.+|.
T Consensus 13 K~WrC~~~a~~g~~~Ce~H~ 32 (46)
T PF08879_consen 13 KGWRCSRRALPGYSLCEHHL 32 (46)
T ss_pred CccccCCccCCCccHHHHHH
Confidence 45699999999999999997
No 135
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=22.22 E-value=1e+02 Score=31.01 Aligned_cols=24 Identities=21% Similarity=0.538 Sum_probs=20.2
Q ss_pred eec----CccCcccCChhhHHHHHHhhc
Q 000554 985 FIC----RFCGLKFDLLPDLGRHHQAAH 1008 (1428)
Q Consensus 985 ykC----~~CGKsFs~~s~L~rHHqrvH 1008 (1428)
|.| ..|+..+.+...+.+|....|
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 899 999999999999999544444
No 136
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=21.49 E-value=61 Score=26.61 Aligned_cols=32 Identities=19% Similarity=0.229 Sum_probs=16.6
Q ss_pred ecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCccc
Q 000554 986 ICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAY 1028 (1428)
Q Consensus 986 kC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsF 1028 (1428)
.|+.|+..|.-..+-.. .+ .+..+|+.|+..|
T Consensus 4 ~CP~C~~~f~v~~~~l~------~~-----~~~vrC~~C~~~f 35 (37)
T PF13719_consen 4 TCPNCQTRFRVPDDKLP------AG-----GRKVRCPKCGHVF 35 (37)
T ss_pred ECCCCCceEEcCHHHcc------cC-----CcEEECCCCCcEe
Confidence 56666666665443211 11 3456666666655
No 137
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=21.33 E-value=46 Score=35.93 Aligned_cols=16 Identities=25% Similarity=0.476 Sum_probs=9.4
Q ss_pred ccccccccccCChhhh
Q 000554 882 YACAICLDSFTNKKVL 897 (1428)
Q Consensus 882 ykC~~CgKsF~~ks~L 897 (1428)
++|+.||++|.+...+
T Consensus 29 ~~c~~c~~~f~~~e~~ 44 (154)
T PRK00464 29 RECLACGKRFTTFERV 44 (154)
T ss_pred eeccccCCcceEeEec
Confidence 5666666666654443
No 138
>KOG2593 consensus Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=20.96 E-value=63 Score=39.93 Aligned_cols=42 Identities=12% Similarity=0.105 Sum_probs=29.1
Q ss_pred hhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcc
Q 000554 977 ENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYA 1027 (1428)
Q Consensus 977 rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKs 1027 (1428)
+.-+...-|.|+.|.++|+....|+- +-.. ...|.|..|+--
T Consensus 121 ~d~t~~~~Y~Cp~C~kkyt~Lea~~L----~~~~-----~~~F~C~~C~ge 162 (436)
T KOG2593|consen 121 RDDTNVAGYVCPNCQKKYTSLEALQL----LDNE-----TGEFHCENCGGE 162 (436)
T ss_pred hhccccccccCCccccchhhhHHHHh----hccc-----CceEEEecCCCc
Confidence 33445567999999999988777655 2221 347999999743
No 139
>PF14353 CpXC: CpXC protein
Probab=20.12 E-value=38 Score=34.92 Aligned_cols=15 Identities=20% Similarity=0.419 Sum_probs=9.3
Q ss_pred ccCCCCCcccccccc
Q 000554 848 HKCKICSQVFLHDQE 862 (1428)
Q Consensus 848 ykC~~CgK~F~s~s~ 862 (1428)
..|+.|+..|.....
T Consensus 2 itCP~C~~~~~~~v~ 16 (128)
T PF14353_consen 2 ITCPHCGHEFEFEVW 16 (128)
T ss_pred cCCCCCCCeeEEEEE
Confidence 357777777765443
No 140
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=20.01 E-value=46 Score=28.35 Aligned_cols=13 Identities=31% Similarity=0.917 Sum_probs=7.9
Q ss_pred ceecCccCcccCC
Q 000554 984 KFICRFCGLKFDL 996 (1428)
Q Consensus 984 pykC~~CGKsFs~ 996 (1428)
.|+|+.||..|..
T Consensus 3 ~y~C~~CG~~~~~ 15 (46)
T PRK00398 3 EYKCARCGREVEL 15 (46)
T ss_pred EEECCCCCCEEEE
Confidence 4666666666543
Done!