Query 002995
Match_columns 860
No_of_seqs 699 out of 2771
Neff 7.7
Searched_HMMs 46136
Date Thu Mar 28 14:56:32 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/002995.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/002995hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1080 Histone H3 (Lys4) meth 100.0 2.2E-64 4.9E-69 599.3 25.9 506 301-836 480-1005(1005)
2 KOG0954 PHD finger protein [Ge 100.0 9.7E-40 2.1E-44 362.9 8.1 313 221-580 103-440 (893)
3 KOG4442 Clathrin coat binding 100.0 2.5E-37 5.3E-42 345.5 14.3 160 681-841 105-265 (729)
4 KOG0956 PHD finger protein AF1 100.0 3.7E-37 8E-42 339.3 5.5 167 398-578 7-182 (900)
5 COG5141 PHD zinc finger-contai 100.0 3.8E-37 8.2E-42 327.9 4.3 192 381-583 177-369 (669)
6 KOG0955 PHD finger protein BR1 100.0 5.1E-35 1.1E-39 348.5 9.0 178 395-579 218-396 (1051)
7 KOG1079 Transcriptional repres 100.0 1.8E-32 3.8E-37 304.4 9.9 179 637-819 523-714 (739)
8 KOG0957 PHD finger protein [Ge 100.0 5.9E-30 1.3E-34 273.5 3.8 171 395-580 118-302 (707)
9 PF13832 zf-HC5HC2H_2: PHD-zin 99.9 9.4E-25 2E-29 202.4 7.3 109 454-576 2-110 (110)
10 cd05834 HDGF_related The PWWP 99.9 9.2E-24 2E-28 183.3 8.1 81 81-170 2-82 (83)
11 cd05836 N_Pac_NP60 The PWWP do 99.9 8.5E-24 1.8E-28 185.0 7.4 86 82-172 1-86 (86)
12 KOG1082 Histone H3 (Lys9) meth 99.9 1.5E-23 3.2E-28 233.9 8.5 159 681-839 161-357 (364)
13 cd05838 WHSC1_related The PWWP 99.9 9.8E-23 2.1E-27 182.0 7.8 92 82-174 1-94 (95)
14 cd05840 SPBC215_ISWI_like The 99.9 6.5E-23 1.4E-27 181.8 6.5 88 82-170 1-92 (93)
15 smart00317 SET SET (Su(var)3-9 99.9 9.7E-22 2.1E-26 183.2 13.7 115 698-813 2-116 (116)
16 cd05835 Dnmt3b_related The PWW 99.9 2.5E-22 5.5E-27 176.6 6.9 84 82-171 1-86 (87)
17 cd05162 PWWP The PWWP domain, 99.9 5.5E-22 1.2E-26 175.7 8.1 86 82-170 1-86 (87)
18 cd05841 BS69_related The PWWP 99.9 9.7E-22 2.1E-26 168.2 7.0 76 81-170 6-82 (83)
19 KOG1083 Putative transcription 99.8 3.7E-22 8E-27 230.2 2.9 133 685-819 1166-1299(1306)
20 PF00855 PWWP: PWWP domain; I 99.8 2.7E-21 5.8E-26 171.2 4.1 84 82-171 1-86 (86)
21 smart00293 PWWP domain with co 99.8 2.2E-20 4.8E-25 154.2 6.4 63 82-145 1-63 (63)
22 KOG1085 Predicted methyltransf 99.8 1.3E-19 2.8E-24 183.1 10.0 124 693-817 253-380 (392)
23 cd05837 MSH6_like The PWWP dom 99.8 3.1E-19 6.8E-24 164.1 8.0 90 81-170 2-101 (110)
24 cd06080 MUM1_like Mutated mela 99.7 1.7E-17 3.8E-22 141.3 5.9 75 82-168 1-76 (80)
25 cd05839 BR140_related The PWWP 99.7 1.9E-17 4.1E-22 149.6 6.3 66 82-147 1-81 (111)
26 PF05964 FYRN: F/Y-rich N-term 99.7 5.4E-17 1.2E-21 128.9 5.3 54 230-283 1-54 (54)
27 PF13771 zf-HC5HC2H: PHD-like 99.6 3E-16 6.5E-21 140.0 4.9 88 475-577 1-90 (90)
28 COG2940 Proteins containing SE 99.6 1.3E-16 2.8E-21 185.1 1.2 141 694-836 330-480 (480)
29 PF05965 FYRC: F/Y rich C-term 99.6 5.8E-16 1.3E-20 136.6 4.5 79 290-373 3-83 (86)
30 smart00542 FYRC "FY-rich" doma 99.6 8.6E-16 1.9E-20 134.7 5.5 79 292-375 1-81 (86)
31 KOG1141 Predicted histone meth 99.6 1.4E-15 3E-20 171.9 5.6 76 760-835 1179-1261(1262)
32 KOG1904 Transcription coactiva 99.5 6.5E-14 1.4E-18 156.8 6.8 88 79-173 10-99 (496)
33 PF00856 SET: SET domain; Int 99.4 2.4E-13 5.3E-18 133.3 7.4 108 707-814 1-162 (162)
34 smart00541 FYRN "FY-rich" doma 99.3 2E-12 4.3E-17 97.5 4.5 41 241-281 2-42 (44)
35 PF13831 PHD_2: PHD-finger; PD 99.1 2.5E-11 5.4E-16 87.6 0.6 36 410-445 1-36 (36)
36 KOG1081 Transcription factor N 98.8 1.1E-09 2.4E-14 125.0 2.1 127 711-844 319-445 (463)
37 KOG2589 Histone tail methylase 98.8 5.8E-09 1.3E-13 110.2 5.4 115 705-827 136-251 (453)
38 PF00628 PHD: PHD-finger; Int 97.6 1.6E-05 3.6E-10 62.7 0.4 45 398-445 1-49 (51)
39 KOG4323 Polycomb-like PHD Zn-f 97.6 6.3E-05 1.4E-09 84.3 4.4 117 396-541 83-203 (464)
40 smart00249 PHD PHD zinc finger 97.5 9E-05 1.9E-09 56.8 3.2 45 398-444 1-47 (47)
41 PF09465 LBR_tudor: Lamin-B re 97.5 0.0001 2.3E-09 57.6 3.3 32 2-33 20-51 (55)
42 KOG2461 Transcription factor B 97.4 0.00017 3.6E-09 81.0 4.9 110 694-817 26-146 (396)
43 KOG1244 Predicted transcriptio 97.1 0.0002 4.3E-09 73.5 1.6 49 395-446 280-330 (336)
44 KOG4323 Polycomb-like PHD Zn-f 96.9 0.00045 9.8E-09 77.6 2.8 53 396-449 168-226 (464)
45 KOG1512 PHD Zn-finger protein 96.8 0.00041 9E-09 71.5 1.0 49 395-446 313-363 (381)
46 KOG1081 Transcription factor N 96.8 0.00062 1.4E-08 78.3 2.3 55 80-144 134-188 (463)
47 KOG1084 Transcription factor T 96.5 0.00087 1.9E-08 75.0 0.9 85 452-543 221-306 (375)
48 smart00508 PostSET Cysteine-ri 96.0 0.0036 7.7E-08 41.5 1.5 19 821-839 2-20 (26)
49 PF15057 DUF4537: Domain of un 95.6 0.034 7.4E-07 52.5 7.1 100 4-147 14-115 (124)
50 PF15446 zf-PHD-like: PHD/FYVE 95.4 0.015 3.2E-07 56.4 3.7 70 398-467 1-86 (175)
51 KOG0825 PHD Zn-finger protein 95.1 0.0095 2.1E-07 69.5 1.7 49 395-446 214-265 (1134)
52 KOG4299 PHD Zn-finger protein 93.8 0.021 4.5E-07 66.0 0.6 48 397-447 254-305 (613)
53 PF08169 RBB1NT: RBB1NT (NUC16 92.3 0.24 5.2E-06 44.0 4.9 85 84-178 8-94 (96)
54 KOG1141 Predicted histone meth 92.0 0.15 3.2E-06 60.3 4.0 59 681-739 784-842 (1262)
55 KOG1973 Chromatin remodeling p 91.7 0.079 1.7E-06 57.2 1.4 36 411-447 230-268 (274)
56 PF14446 Prok-RING_1: Prokaryo 91.5 0.12 2.6E-06 40.8 1.9 33 395-428 4-36 (54)
57 COG5034 TNG2 Chromatin remodel 91.1 0.13 2.9E-06 53.1 2.3 54 388-446 213-269 (271)
58 PF11717 Tudor-knot: RNA bindi 90.9 0.28 6.1E-06 39.2 3.5 52 82-142 1-54 (55)
59 smart00743 Agenet Tudor-like d 90.5 0.41 8.8E-06 39.0 4.3 53 81-145 2-57 (61)
60 smart00249 PHD PHD zinc finger 89.9 0.17 3.7E-06 38.2 1.5 30 510-541 1-32 (47)
61 smart00333 TUDOR Tudor domain. 89.3 0.29 6.3E-06 39.1 2.5 33 2-35 16-49 (57)
62 smart00333 TUDOR Tudor domain. 88.4 0.69 1.5E-05 36.9 4.1 54 81-146 2-55 (57)
63 KOG0383 Predicted helicase [Ge 87.7 0.22 4.7E-06 59.8 1.0 50 391-446 42-93 (696)
64 KOG0957 PHD finger protein [Ge 85.0 0.69 1.5E-05 52.0 3.1 80 357-444 510-595 (707)
65 PF00628 PHD: PHD-finger; Int 84.6 0.51 1.1E-05 36.9 1.4 29 511-541 2-32 (51)
66 KOG1245 Chromatin remodeling c 84.1 0.25 5.3E-06 64.2 -1.0 49 396-447 1108-1158(1404)
67 smart00743 Agenet Tudor-like d 84.1 0.78 1.7E-05 37.3 2.3 23 2-25 17-39 (61)
68 KOG1244 Predicted transcriptio 83.8 0.49 1.1E-05 49.3 1.2 75 397-479 225-311 (336)
69 PF07649 C1_3: C1-like domain; 83.0 0.54 1.2E-05 32.5 0.8 28 398-427 2-29 (30)
70 cd04508 TUDOR Tudor domains ar 81.4 1.3 2.7E-05 34.0 2.4 32 2-34 12-44 (48)
71 KOG1512 PHD Zn-finger protein 81.2 0.55 1.2E-05 49.2 0.4 49 397-445 259-315 (381)
72 PF00130 C1_1: Phorbol esters/ 80.2 1.9 4.2E-05 33.9 3.2 35 396-430 11-45 (53)
73 KOG2041 WD40 repeat protein [G 80.0 1 2.2E-05 52.9 2.1 110 343-463 1069-1185(1189)
74 PF03107 C1_2: C1 domain; Int 79.7 1.5 3.3E-05 30.3 2.1 28 398-427 2-29 (30)
75 KOG2084 Predicted histone tail 79.2 2.8 6E-05 48.8 5.5 40 773-816 208-248 (482)
76 KOG1973 Chromatin remodeling p 78.5 0.93 2E-05 49.0 1.2 29 510-538 220-250 (274)
77 cd04508 TUDOR Tudor domains ar 77.9 3 6.4E-05 31.9 3.5 47 85-143 1-48 (48)
78 PF11793 FANCL_C: FANCL C-term 77.7 1.6 3.4E-05 36.9 2.0 50 397-446 3-63 (70)
79 cd00029 C1 Protein kinase C co 75.2 1.4 3.1E-05 34.0 1.1 34 397-430 12-45 (50)
80 KOG1337 N-methyltransferase [G 73.6 2.3 5E-05 49.9 2.7 40 773-815 239-278 (472)
81 PF07039 DUF1325: SGF29 tudor- 72.8 16 0.00035 34.8 7.8 107 2-143 17-127 (130)
82 PF02318 FYVE_2: FYVE-type zin 70.9 3.5 7.7E-05 38.6 2.8 49 396-446 54-102 (118)
83 KOG4443 Putative transcription 69.8 2.1 4.5E-05 50.3 1.2 81 339-446 34-117 (694)
84 KOG1473 Nucleosome remodeling 68.5 5.4 0.00012 49.6 4.3 110 396-538 344-457 (1414)
85 cd04718 BAH_plant_2 BAH, or Br 68.3 3.2 7E-05 40.1 2.0 26 422-448 1-28 (148)
86 KOG1044 Actin-binding LIM Zn-f 68.0 2.9 6.3E-05 48.4 1.8 102 437-544 112-225 (670)
87 smart00109 C1 Protein kinase C 65.7 2.4 5.2E-05 32.3 0.5 33 396-429 11-43 (49)
88 PF13901 DUF4206: Domain of un 64.4 4.5 9.8E-05 41.7 2.3 42 397-446 153-197 (202)
89 PF14569 zf-UDP: Zinc-binding 60.7 3.9 8.5E-05 34.7 0.9 50 395-446 8-59 (80)
90 PF11764 N-SET: COMPASS (Compl 57.6 2.5 5.5E-05 42.1 -0.9 16 603-618 78-93 (167)
91 KOG4443 Putative transcription 56.3 4.1 8.8E-05 48.0 0.3 48 397-445 19-69 (694)
92 PF05502 Dynactin_p62: Dynacti 55.8 8.5 0.00018 45.1 2.9 53 412-468 4-65 (483)
93 KOG3799 Rab3 effector RIM1 and 55.0 5.5 0.00012 37.3 0.9 52 394-446 63-115 (169)
94 PF11781 RRN7: RNA polymerase 51.9 11 0.00023 27.5 1.8 26 397-422 9-34 (36)
95 COG5475 Uncharacterized small 51.1 35 0.00077 27.1 4.6 52 80-146 3-56 (60)
96 PF08746 zf-RING-like: RING-li 50.7 7.4 0.00016 29.5 0.8 30 511-540 1-30 (43)
97 COG5034 TNG2 Chromatin remodel 49.3 12 0.00026 39.3 2.3 40 499-538 211-252 (271)
98 PF07227 DUF1423: Protein of u 46.7 9.8 0.00021 43.2 1.4 14 470-483 152-165 (446)
99 PHA02862 5L protein; Provision 44.0 7.7 0.00017 37.1 0.0 50 397-462 3-52 (156)
100 PF09926 DUF2158: Uncharacteri 43.5 45 0.00097 26.6 4.2 41 82-132 1-41 (53)
101 PF13639 zf-RING_2: Ring finge 42.9 13 0.00027 28.0 1.1 31 397-429 1-31 (44)
102 PF10080 DUF2318: Predicted me 42.5 13 0.00028 33.9 1.2 34 396-429 35-68 (102)
103 PF05641 Agenet: Agenet domain 41.9 22 0.00048 29.6 2.5 26 3-29 19-44 (68)
104 cd04712 BAH_DCM_I BAH, or Brom 41.3 24 0.00052 33.6 3.0 47 79-130 3-58 (130)
105 KOG0955 PHD finger protein BR1 41.2 19 0.00042 45.6 2.8 64 82-145 943-1025(1051)
106 PF07649 C1_3: C1-like domain; 41.1 14 0.0003 25.5 1.0 27 510-538 2-30 (30)
107 PF10447 EXOSC1: Exosome compo 41.0 17 0.00037 31.7 1.7 14 79-92 66-79 (82)
108 COG5194 APC11 Component of SCF 39.0 9.6 0.00021 32.6 -0.1 31 510-540 22-65 (88)
109 PF08666 SAF: SAF domain; Int 39.0 18 0.00038 29.2 1.5 16 796-811 3-18 (63)
110 PLN00104 MYST -like histone ac 38.3 45 0.00097 38.4 4.9 61 78-143 50-113 (450)
111 PF14446 Prok-RING_1: Prokaryo 37.8 30 0.00065 27.6 2.4 37 508-546 5-44 (54)
112 PLN02436 cellulose synthase A 36.8 21 0.00046 45.0 2.3 49 396-446 36-86 (1094)
113 PF13771 zf-HC5HC2H: PHD-like 35.2 34 0.00073 30.0 2.8 34 452-485 36-72 (90)
114 PF12906 RINGv: RING-variant d 34.9 13 0.00029 28.6 0.1 30 399-429 1-33 (47)
115 PLN02400 cellulose synthase 34.7 27 0.00057 44.3 2.6 50 395-446 35-86 (1085)
116 PLN02638 cellulose synthase A 34.6 23 0.0005 44.8 2.1 50 395-446 16-67 (1079)
117 KOG0954 PHD finger protein [Ge 34.4 20 0.00043 43.0 1.4 30 510-541 325-354 (893)
118 PF08274 PhnA_Zn_Ribbon: PhnA 34.3 20 0.00044 25.0 0.9 25 397-421 3-27 (30)
119 PF12773 DZR: Double zinc ribb 33.5 32 0.00068 26.6 2.0 38 413-458 12-49 (50)
120 PF13832 zf-HC5HC2H_2: PHD-zin 33.2 24 0.00053 32.3 1.6 29 396-428 55-85 (110)
121 smart00547 ZnF_RBZ Zinc finger 32.6 25 0.00054 23.1 1.1 23 438-460 2-24 (26)
122 cd04370 BAH BAH, or Bromo Adja 31.2 53 0.0011 30.1 3.6 41 80-128 2-46 (123)
123 PLN02915 cellulose synthase A 29.9 32 0.00069 43.5 2.2 49 396-446 15-65 (1044)
124 KOG0383 Predicted helicase [Ge 29.1 29 0.00062 42.3 1.6 31 509-542 48-78 (696)
125 KOG3409 Exosomal 3'-5' exoribo 28.8 74 0.0016 31.6 4.0 16 78-93 119-134 (193)
126 PF07227 DUF1423: Protein of u 28.6 45 0.00097 38.1 2.9 31 398-428 130-161 (446)
127 PF12861 zf-Apc11: Anaphase-pr 27.8 25 0.00054 30.8 0.6 34 395-429 20-62 (85)
128 PLN02195 cellulose synthase A 27.7 36 0.00079 42.7 2.2 48 397-446 7-56 (977)
129 KOG3038 Histone acetyltransfer 27.7 2.9E+02 0.0063 29.3 8.3 43 80-132 197-240 (264)
130 PF04810 zf-Sec23_Sec24: Sec23 27.4 33 0.00071 25.5 1.1 31 413-446 2-32 (40)
131 smart00744 RINGv The RING-vari 26.9 24 0.00052 27.5 0.3 31 398-429 1-34 (49)
132 PF10497 zf-4CXXC_R1: Zinc-fin 26.8 47 0.001 30.4 2.2 61 396-464 7-82 (105)
133 PF11302 DUF3104: Protein of u 26.7 1.6E+02 0.0036 25.2 5.2 59 81-144 5-70 (75)
134 cd04721 BAH_plant_1 BAH, or Br 26.5 65 0.0014 30.7 3.2 53 78-138 4-58 (130)
135 KOG0696 Serine/threonine prote 26.3 19 0.0004 40.6 -0.5 33 397-429 57-89 (683)
136 PF07282 OrfB_Zn_ribbon: Putat 26.1 60 0.0013 26.9 2.6 35 395-429 27-62 (69)
137 PLN02189 cellulose synthase 26.0 41 0.00088 42.6 2.2 49 396-446 34-84 (1040)
138 PF00641 zf-RanBP: Zn-finger i 25.6 22 0.00048 24.4 -0.1 22 438-459 4-25 (30)
139 KOG2807 RNA polymerase II tran 25.2 48 0.001 36.1 2.2 84 434-539 272-361 (378)
140 PF09038 53-BP1_Tudor: Tumour 25.2 62 0.0013 30.3 2.7 28 5-33 20-47 (122)
141 cd05791 S1_CSL4 S1_CSL4: CSL4, 25.1 46 0.001 29.6 1.9 17 80-96 59-75 (92)
142 PF11793 FANCL_C: FANCL C-term 24.6 34 0.00074 28.8 0.8 33 509-541 3-40 (70)
143 PF08792 A2L_zn_ribbon: A2L zi 23.7 69 0.0015 22.8 2.1 27 396-422 3-30 (33)
144 KOG2752 Uncharacterized conser 23.0 89 0.0019 34.0 3.7 112 398-543 57-169 (345)
145 KOG0782 Predicted diacylglycer 22.8 23 0.00051 40.9 -0.6 20 413-432 199-218 (1004)
146 KOG0695 Serine/threonine prote 22.7 32 0.0007 37.6 0.4 35 397-431 142-176 (593)
147 KOG4299 PHD Zn-finger protein 21.8 33 0.00071 40.6 0.3 46 396-446 47-94 (613)
148 KOG1844 PHD Zn-finger proteins 21.7 56 0.0012 38.6 2.2 49 397-447 87-135 (508)
149 PF10367 Vps39_2: Vacuolar sor 21.4 76 0.0017 28.5 2.6 31 395-428 77-107 (109)
150 PF10367 Vps39_2: Vacuolar sor 21.3 39 0.00085 30.4 0.6 30 509-539 79-108 (109)
151 PF09465 LBR_tudor: Lamin-B re 21.0 1.7E+02 0.0037 23.5 4.0 50 80-141 4-54 (55)
152 PF12148 DUF3590: Protein of u 20.5 1E+02 0.0022 27.1 2.9 31 1-31 9-45 (85)
153 smart00439 BAH Bromo adjacent 20.2 1.2E+02 0.0027 27.6 3.8 39 82-128 2-43 (120)
No 1
>KOG1080 consensus Histone H3 (Lys4) methyltransferase complex, subunit SET1 and related methyltransferases [Chromatin structure and dynamics; Transcription]
Probab=100.00 E-value=2.2e-64 Score=599.30 Aligned_cols=506 Identities=35% Similarity=0.617 Sum_probs=377.4
Q ss_pred CceecCCchHHHHHHHHHHHhhccCCCCCcccccccccccccCccccCCCcHHHHHHHhhcCCCCCCCchhhhhhcc-cc
Q 002995 301 EQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLTKSRPTSKSSLCKLTS-KY 379 (860)
Q Consensus 301 ~~~~~~s~~~~W~~i~~~i~~~~~~~~~~~~~~~~~~~~~~sG~~~FGls~p~v~~li~~Lp~a~~c~k~~~~k~~s-~~ 379 (860)
..+-..+++.+|....+...+.+......... ...+..++..+|++..+.....+.........+....++-.. +.
T Consensus 480 ~~~~~~~~~~~~k~~~~e~~k~~~~~~~~~k~---~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~p~s~~~~~~~s~~~~ 556 (1005)
T KOG1080|consen 480 RVFHMLRYSVKKKKFLSEWERHTGATAKIWKD---SSRVKDELLPLPKWVESRGRSIMNTYNSEKPKSIVLMGKTSVQRM 556 (1005)
T ss_pred eeecccCcchhhhhcccchhhhhccccccccc---ccccccccccchhhhhhccccccccccccCCcchhhhccchhhhh
Confidence 34778889999999988877666555433211 112334666777777776655566655555555544443322 23
Q ss_pred cCCCCCCCCccccccCCCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccC
Q 002995 380 RDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCP 459 (860)
Q Consensus 380 ~~~p~~~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~ 459 (860)
+..+..+......|.+...|.+|.+.+.+..|.++.|++|++.+|+.|||...++.+..|+|+.|... ...+.|+||+
T Consensus 557 ~~~~~~~~~~~l~~~~t~~c~~~~~~~~~~~n~~~~~~~~~~~~~s~~~g~~~~~~~~~~~~~~~~~~--~~~r~~~l~~ 634 (1005)
T KOG1080|consen 557 LLELIEKREPRLSKWTTERCAVCRDDEDWEKNVSIICDRCTRSVHSECYGNLKSYDGTSWVCDSCETL--DIKRSCCLCP 634 (1005)
T ss_pred cCcccccchhhhcCCCcccccccccccccccceeeeeccccccCCCcccccCCCCCCCcchhhccccc--cCCchhhhcc
Confidence 44455566666666677899999999999999999999999999999999999898999999999864 4467899999
Q ss_pred CCCCCceecCCCceeccccccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcCccccchhhh
Q 002995 460 VVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCAR 539 (860)
Q Consensus 460 ~~gGalk~t~~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~ 539 (860)
.+||||+++..|+|+|+.||.|.|++.+.++..|+|..++..++...+...|.+ .|.|.||. .|.+.||.+||+
T Consensus 635 ~~g~al~p~d~gr~~~~e~a~~~~e~~~~~~~~~~p~~~~~~~p~~~~~~~~~~----~~~~~~~~--~~~~~~~~~~a~ 708 (1005)
T KOG1080|consen 635 VKGGALKPTDEGRWVHVECAWFRPEVCLASPERMEPAVGTFKIPALSFLKICFI----HGSCRQCC--KCETGSHAMCAS 708 (1005)
T ss_pred ccCcccCCCCccchhhhhchhccccccCCCccCCCCcccccccCccchhhhccc----cccccccc--hhhhcceehhhc
Confidence 999999999999999999999999999999999999999999999988888877 78899998 899999999999
Q ss_pred hcCeeEEEeccccc----------------cccccCCCccccccceeeccccCCCcccchhhhhhccccccccccCCCCC
Q 002995 540 AAGLCVELEDEDRL----------------NLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPP 603 (860)
Q Consensus 540 ~~g~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~C~~H~~~~~~~~~~~~~~~~~~~~~~~~~p~ 603 (860)
.+|+.++...-... .++.++........+..+++.+|+...... .+.. .++.+
T Consensus 709 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~~~~k~~~~~g~~~-~~~~---------~~~~~- 777 (1005)
T KOG1080|consen 709 RAGYIMEAVSLEEVSQQTTSYVKEDGPGPDSVLKVNTPSGKFGAENLSQNKKSRTDGVRL-VLTY---------KEYAP- 777 (1005)
T ss_pred CccChhhhhhhhhhhhhhhhhhhhccCCcccceeecCccccccccchhhhhhcccccccc-cccc---------ccccc-
Confidence 99987644311111 111121111122222333344433321111 0000 11111
Q ss_pred CCCCCcccCCcccccc--cCCCCChhHHHHhhhccccccCCCcccCCcccCCCCCCCcCcccccccccccccccCCCCcC
Q 002995 604 SNPSGCARSEPYNYFG--RRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFL 681 (860)
Q Consensus 604 ~npsGcaRsE~~~~~~--Rr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 681 (860)
.+.-|+|.+|++... ...+..+..++.+..+++..+..++..++..++.+..... ....... +.-..+...
T Consensus 778 -~~~~~~~~~~~~~~~~~s~~~~~~~~r~~~~~~~~~~~s~~~~~s~~~s~~~s~~~r----l~q~rl~--a~~~~~~~~ 850 (1005)
T KOG1080|consen 778 -LREIKSRAEPLNRIDFSSEARCRSESRSDNSKSPLAEESESDITSGGSSHDLSAEER----LNQFRLS--ASFTASFIL 850 (1005)
T ss_pred -cccchhcccCCcccCccccccccchhhcccccccccccccccccccccccchhHHhh----hHHHHhh--hhccccccc
Confidence 123366667765331 1122333445566667777777788777666665432111 1011111 011223446
Q ss_pred ChhhhhHHhhhcccee-EEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcc
Q 002995 682 SMADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDER 760 (860)
Q Consensus 682 ~~~~~~~~l~~~~~~~-l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~ 760 (860)
..+...+.+|..++++ |.|++|.||||||||.++|.+||||+||+||+|++..||.|+.+ |......++|+|++|++.
T Consensus 851 ~~~~~~~~~~~~~rkk~~~F~~s~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~~-Y~~~gi~~sYlfrid~~~ 929 (1005)
T KOG1080|consen 851 DEAEVLRYNQLKFRKKYVKFGRSGIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREAR-YERMGIGDSYLFRIDDEV 929 (1005)
T ss_pred chHHHHHHHHHhhhhhhhccccccccccceeeccCccccceEEEeeceehhhhHHHHHHHH-HhccCcccceeeecccce
Confidence 6777888888887755 99999999999999999999999999999999999999999975 877766889999999999
Q ss_pred eeeccccCCccccCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCCCCeEEeeCCCCCccccC
Q 002995 761 VIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836 (860)
Q Consensus 761 ~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~~~~~C~Cg~~~Cr~~~~ 836 (860)
|||||++||+||||||||+|||+++++.|+|+.+|+|||.|+|.+||||||||.|+.++.+++|+||+++|||+||
T Consensus 930 ViDAtk~gniAr~InHsC~PNCyakvi~V~g~~~IvIyakr~I~~~EElTYDYkF~~e~~kipClCgap~Crg~~n 1005 (1005)
T KOG1080|consen 930 VVDATKKGNIARFINHSCNPNCYAKVITVEGDKRIVIYSKRDIAAGEELTYDYKFPTEDDKIPCLCGAPNCRGFLN 1005 (1005)
T ss_pred EEeccccCchhheeecccCCCceeeEEEecCeeEEEEEEecccccCceeeeeccccccccccccccCCCccccccC
Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999997
No 2
>KOG0954 consensus PHD finger protein [General function prediction only]
Probab=100.00 E-value=9.7e-40 Score=362.92 Aligned_cols=313 Identities=30% Similarity=0.618 Sum_probs=226.5
Q ss_pred CcccccceeeCCEEEEeeeeeeeCCCCCCCCCcccCCCeEE-----EEEeccCCC-CCCcceEEEEEEecCCCCccceEE
Q 002995 221 GSIGISPYVFGDLQILSLGKIVKDSEYFQDDRFIWPEGYTA-----VRKFTSLAD-PRVCNSYKMEVLRDTESKIRPLFR 294 (860)
Q Consensus 221 ~~~~~~P~~~g~l~i~slG~iv~~~~~fh~~~~i~P~Gy~s-----~r~~~S~~d-p~~~~~y~~~I~~~~~~~~~P~F~ 294 (860)
.+....|-++..-.+++.+++++.-+.-.+++++||.-++- .+...|..- |.-.+.|..+=+.-...-.+|.-.
T Consensus 103 ~~adkkpkel~Rqdli~amkl~dse~~~p~~y~~~~D~wk~ewekgvQvpaspd~lpqp~v~~dse~v~~~~~fs~pkky 182 (893)
T KOG0954|consen 103 RGADKKPKELYRQDLISAMKLPDSEHLNPDEYLDFADPWKQEWEKGVQVPASPDTLPQPSVRVDSEDVQPETDFSRPKKY 182 (893)
T ss_pred cccccchHHHHHhhhhhhccCccccccCccceeecCCccchhhhccccccCCCCcCCCcceeccchhcchhhhhcCCcce
Confidence 45567788887777778777765433333666776554441 222222211 233344444433222222456655
Q ss_pred EEcCCCC------------------ceecCCchHHHHHHHHHHHhhccCCCCCcccccccccccccCccccCCCcHHHHH
Q 002995 295 VTLDNGE------------------QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMK 356 (860)
Q Consensus 295 Vt~~~~~------------------~~~~~s~~~~W~~i~~~i~~~~~~~~~~~~~~~~~~~~~~sG~~~FGls~p~v~~ 356 (860)
|...++. -|+-+-.+.+|...++. .+..+| +|-+..-++.+
T Consensus 183 ivc~~~~~~e~~yn~~~~~lae~tcrydid~~d~awL~~~n~-------------------e~~~~G--~~~l~~~~~eR 241 (893)
T KOG0954|consen 183 IVCSDGEVPELGYNLLIKDLAESTCRYDIDDMDPAWLQLVNE-------------------ERAEMG--SLELDEGTFER 241 (893)
T ss_pred EEeCCCCCcccchhhhHHHHhhhhhhcccccccHHHHHHhcc-------------------hHHhhC--CcccchHHHHH
Confidence 5543332 13334455666655541 123355 56666777888
Q ss_pred HHhhcCCCCCCCchhhhhhcccccCCCCCCCCccccccCCCccccccCCcccCCCeeEEccccCceeccccccccccCCC
Q 002995 357 LILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNG 436 (860)
Q Consensus 357 li~~Lp~a~~c~k~~~~k~~s~~~~~p~~~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~ 436 (860)
+|++|..- |-+ +.......++-.+ +..+++..|+||...++++.|+||+||+|++.|||.|||+..+|++
T Consensus 242 iieelE~~--c~k---qi~~~l~~eeglg-----ie~dedviCDvCrspD~e~~neMVfCd~Cn~cVHqaCyGIle~p~g 311 (893)
T KOG0954|consen 242 IIEELERR--CKK---QINHALETEEGLG-----IEYDEDVICDVCRSPDSEEANEMVFCDKCNICVHQACYGILEVPEG 311 (893)
T ss_pred HHHHHHHH--HHH---HHHhhhhhcccce-----eeccccceeceecCCCccccceeEEeccchhHHHHhhhceeecCCC
Confidence 99888521 221 2222222233333 3346889999999999999999999999999999999999998987
Q ss_pred CceeeecccCCCCCCCCCccccCCCCCCceecCCC-ceeccccccccCceEEccCCCccccccchhhhccccCccceecC
Q 002995 437 VLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICG 515 (860)
Q Consensus 437 ~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~~g-~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~ 515 (860)
+|+|..|..+ ..+.|+|||.+|||||++..| +|+|+.||||+||++|.++..|+||..++.|+..||.+.|.+|+
T Consensus 312 -pWlCr~Calg---~~ppCvLCPkkGGamK~~~sgT~wAHvsCALwIPEVsie~~ekmePItkfs~IpesRwslvC~LCk 387 (893)
T KOG0954|consen 312 -PWLCRTCALG---IEPPCVLCPKKGGAMKPTKSGTKWAHVSCALWIPEVSIECPEKMEPITKFSHIPESRWSLVCNLCK 387 (893)
T ss_pred -Ceeehhcccc---CCCCeeeccccCCcccccCCCCeeeEeeeeeccceeeccCHhhcCcccccCCCcHHHHHHHHHHhc
Confidence 9999999986 468999999999999999866 79999999999999999999999999999999999999999999
Q ss_pred CcceeeEEcCCCCcCccccchhhhhcCeeEEEeccccccccccCCCccccccceeeccccCCCcc
Q 002995 516 VSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPL 580 (860)
Q Consensus 516 ~~~Ga~iqC~~~~C~~~fH~~CA~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~H~~~~ 580 (860)
.+.||||||+...|.++||+.||+.+|+.|++.... -+.+.+..||.+|+.-.
T Consensus 388 ~k~GACIqCs~k~C~t~fHv~CA~~aG~~~~~~~~~------------~D~v~~~s~c~khs~~~ 440 (893)
T KOG0954|consen 388 VKSGACIQCSNKTCRTAFHVTCAFEAGLEMKTILKE------------NDEVKFKSYCSKHSDHR 440 (893)
T ss_pred ccCcceEEecccchhhhccchhhhhcCCeeeeeecc------------CCchhheeecccccccc
Confidence 999999999999999999999999999998876432 11267889999998644
No 3
>KOG4442 consensus Clathrin coat binding protein/Huntingtin interacting protein HIP1, involved in regulation of endocytosis [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=2.5e-37 Score=345.47 Aligned_cols=160 Identities=28% Similarity=0.491 Sum_probs=151.0
Q ss_pred CChhhhhHHhhhccceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcc
Q 002995 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDER 760 (860)
Q Consensus 681 ~~~~~~~~~l~~~~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~ 760 (860)
+-..|+|+..|+.-..+++|+.+..+||||.|..+|++|+||+||.||||...+++.|... |......+.|+|.+..+.
T Consensus 105 cg~~C~NQRFQkkqyA~vevF~Te~KG~GLRA~~dI~~g~FI~EY~GEVI~~~Ef~kR~~~-Y~~d~~kh~Yfm~L~~~e 183 (729)
T KOG4442|consen 105 CGVYCKNQRFQKKQYAKVEVFLTEKKGCGLRAEEDIPKGQFILEYIGEVIEEKEFEKRVKR-YAKDGIKHYYFMALQGGE 183 (729)
T ss_pred ccccccchhhhhhccCceeEEEecCcccceeeccccCCCcEEeeeccccccHHHHHHHHHH-HHhcCCceEEEEEecCCc
Confidence 3467899999998889999999999999999999999999999999999999999999986 777777889999999999
Q ss_pred eeeccccCCccccCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCC-CCeEEeeCCCCCccccCCCh
Q 002995 761 VIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID-EQLACYCGFPRCRGVVNDTE 839 (860)
Q Consensus 761 ~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~-~~~~C~Cg~~~Cr~~~~~~~ 839 (860)
+||||.+||+||||||||+|||++..|.|++..||+|||.|.|.+||||||||+|...+ ..++|+||+++|+|||++..
T Consensus 184 ~IDAT~KGnlaRFiNHSC~PNa~~~KWtV~~~lRvGiFakk~I~~GEEITFDYqf~rYGr~AQ~CyCgeanC~G~IGgk~ 263 (729)
T KOG4442|consen 184 YIDATKKGNLARFINHSCDPNAEVQKWTVPDELRVGIFAKKVIKPGEEITFDYQFDRYGRDAQPCYCGEANCRGWIGGKP 263 (729)
T ss_pred eecccccCcHHHhhcCCCCCCceeeeeeeCCeeEEEEeEecccCCCceeeEecccccccccccccccCCcccccccCCCC
Confidence 99999999999999999999999999999999999999999999999999999998876 67999999999999999995
Q ss_pred HH
Q 002995 840 AE 841 (860)
Q Consensus 840 ~~ 841 (860)
.+
T Consensus 264 q~ 265 (729)
T KOG4442|consen 264 QT 265 (729)
T ss_pred cc
Confidence 54
No 4
>KOG0956 consensus PHD finger protein AF10 [General function prediction only]
Probab=100.00 E-value=3.7e-37 Score=339.25 Aligned_cols=167 Identities=37% Similarity=0.839 Sum_probs=150.0
Q ss_pred ccccccCCcccCCCeeEEcc--ccCceeccccccccccCCCCceeeecccCCCCCCCCCccccCCCCCCceecCCCceec
Q 002995 398 KCSVCHMDEEYQNNLFLQCD--KCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAH 475 (860)
Q Consensus 398 ~C~vC~~~~~~~~n~ll~C~--~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~~g~wvH 475 (860)
-|+||-+...|.+|+||+|| +|-++|||.||||..+|.| +|||++|+..-......|.|||.++||||+|++|.|+|
T Consensus 7 GCCVCSDErGWaeNPLVYCDG~nCsVAVHQaCYGIvqVPtG-pWfCrKCesqeraarvrCeLCP~kdGALKkTDn~GWAH 85 (900)
T KOG0956|consen 7 GCCVCSDERGWAENPLVYCDGHNCSVAVHQACYGIVQVPTG-PWFCRKCESQERAARVRCELCPHKDGALKKTDNGGWAH 85 (900)
T ss_pred ceeeecCcCCCccCceeeecCCCceeeeehhcceeEecCCC-chhhhhhhhhhhhccceeecccCcccceecccCCCceE
Confidence 69999988899999999999 7999999999999999999 99999999765445678999999999999999999999
Q ss_pred cccccccCceEEccCCCccccccchhhhccccCccceecCC-------cceeeEEcCCCCcCccccchhhhhcCeeEEEe
Q 002995 476 LACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-------SYGACIQCSNTTCRVAYHPLCARAAGLCVELE 548 (860)
Q Consensus 476 ~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~-------~~Ga~iqC~~~~C~~~fH~~CA~~~g~~~~~~ 548 (860)
|+|||+||||.|.++..||||. +..++..|+.++|.||.. ..|||++|...+|+++||++||+.+||..|.+
T Consensus 86 VVCALYIPEVrFgNV~TMEPIi-Lq~VP~dRfnKtCYIC~E~GrpnkA~~GACMtCNKs~CkqaFHVTCAQ~~GLLCEE~ 164 (900)
T KOG0956|consen 86 VVCALYIPEVRFGNVHTMEPII-LQDVPHDRFNKTCYICNEEGRPNKAAKGACMTCNKSGCKQAFHVTCAQRAGLLCEEE 164 (900)
T ss_pred EEEEeeccceeeccccccccee-eccCchhhhcceeeeecccCCccccccccceecccccchhhhhhhHhhhhccceecc
Confidence 9999999999999999999998 999999999999999975 48999999999999999999999999988654
Q ss_pred ccccccccccCCCccccccceeeccccCCC
Q 002995 549 DEDRLNLLSLDEDDEDQCIRLLSFCKKHKQ 578 (860)
Q Consensus 549 ~~~~~~~~~~~~~~~~~~~~~~~~C~~H~~ 578 (860)
.. ...++++--||+.|-.
T Consensus 165 gn------------~~dNVKYCGYCk~Hfs 182 (900)
T KOG0956|consen 165 GN------------ISDNVKYCGYCKYHFS 182 (900)
T ss_pred cc------------ccccceechhHHHHHH
Confidence 21 1223566668888864
No 5
>COG5141 PHD zinc finger-containing protein [General function prediction only]
Probab=100.00 E-value=3.8e-37 Score=327.89 Aligned_cols=192 Identities=36% Similarity=0.714 Sum_probs=169.4
Q ss_pred CCCCC-CCCccccccCCCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccC
Q 002995 381 DLPGG-YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCP 459 (860)
Q Consensus 381 ~~p~~-~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~ 459 (860)
.+|+. ++|++++..-+..|.+|.+.+.++.|.||+||+|.+.|||+|||+...|+| .|+|++|..+.. ....|.+||
T Consensus 177 ~lp~k~vepi~~~d~~d~~C~~c~~t~~eN~naiVfCdgC~i~VHq~CYGI~f~peG-~WlCrkCi~~~~-~i~~C~fCp 254 (669)
T COG5141 177 GLPDKHVEPIEPSDEFDDICTKCTSTHNENSNAIVFCDGCEICVHQSCYGIQFLPEG-FWLCRKCIYGEY-QIRCCSFCP 254 (669)
T ss_pred cCccccccccCCchhhhhhhHhccccccCCcceEEEecCcchhhhhhcccceecCcc-hhhhhhhccccc-ceeEEEecc
Confidence 46644 888888877778999999999988999999999999999999999999988 999999998643 345699999
Q ss_pred CCCCCceecCCCceeccccccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcCccccchhhh
Q 002995 460 VVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCAR 539 (860)
Q Consensus 460 ~~gGalk~t~~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~ 539 (860)
.+.||+|+|.+|+|+|+.||+|+|+.+|.+...++||+||..++..||++.|.||+...|+||||++.+|.++||++||+
T Consensus 255 s~dGaFkqT~dgrW~H~iCA~~~pelsF~~l~~~dpI~~i~sVs~srwkl~C~iCk~~~GtcIqCs~~nC~~aYHVtCAr 334 (669)
T COG5141 255 SSDGAFKQTSDGRWGHVICAMFNPELSFGHLLSKDPIDNIASVSSSRWKLGCLICKEFGGTCIQCSYFNCTRAYHVTCAR 334 (669)
T ss_pred CCCCceeeccCCchHhHhHHHhcchhccccccccchhhhhcccchhhHhheeeEEcccCcceeeecccchhhhhhhhhhh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hcCeeEEEeccccccccccCCCccccccceeeccccCCCcccch
Q 002995 540 AAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDR 583 (860)
Q Consensus 540 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~H~~~~~~~ 583 (860)
++|+++.- + ..-++-.+++....||.+|.++...+
T Consensus 335 rag~f~~~-------~--~s~n~~s~~id~e~~c~kh~p~gy~~ 369 (669)
T COG5141 335 RAGYFDLN-------I--YSHNGISYCIDHEPLCRKHYPLGYGR 369 (669)
T ss_pred hcchhhhh-------h--hcccccceeecchhhhcCCCCcchhc
Confidence 99997631 0 11123355667788999999876544
No 6
>KOG0955 consensus PHD finger protein BR140/LIN-49 [General function prediction only]
Probab=100.00 E-value=5.1e-35 Score=348.53 Aligned_cols=178 Identities=42% Similarity=0.902 Sum_probs=158.8
Q ss_pred CCCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccCCCCCCceecCCCcee
Q 002995 395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWA 474 (860)
Q Consensus 395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~~g~wv 474 (860)
++.+|+||.+.+....|.||+||+|+++|||+|||+..+|+| .|+|.+|..+.. ....|.|||.++||+|+|.+|+|+
T Consensus 218 ~D~~C~iC~~~~~~n~n~ivfCD~Cnl~VHq~Cygi~~ipeg-~WlCr~Cl~s~~-~~v~c~~cp~~~gAFkqt~dgrw~ 295 (1051)
T KOG0955|consen 218 EDAVCCICLDGECQNSNVIVFCDGCNLAVHQECYGIPFIPEG-QWLCRRCLQSPQ-RPVRCLLCPSKGGAFKQTDDGRWA 295 (1051)
T ss_pred CCccceeecccccCCCceEEEcCCCcchhhhhccCCCCCCCC-cEeehhhccCcC-cccceEeccCCCCcceeccCCcee
Confidence 457999999999998999999999999999999998888888 999999987443 447899999999999999999999
Q ss_pred ccccccccCceEEccCCCccccccchhhhccccCccceecCCcc-eeeEEcCCCCcCccccchhhhhcCeeEEEeccccc
Q 002995 475 HLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY-GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRL 553 (860)
Q Consensus 475 H~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~-Ga~iqC~~~~C~~~fH~~CA~~~g~~~~~~~~~~~ 553 (860)
|++||+|+|++.|.+...+++|++|+.|+..||++.|.+|++.. |+||||+..+|.++||++||+++|++|.+....
T Consensus 296 Hv~caiwipev~F~nt~~~E~I~~i~~i~~aRwkL~cy~cK~~~~gaciqcs~~~c~~a~hvtca~~agl~m~~~~~~-- 373 (1051)
T KOG0955|consen 296 HVVCAIWIPEVSFANTVFLEPIDSIENIPPARWKLTCYICKQKGLGACIQCSKANCYTAFHVTCARRAGLYMKSNTVK-- 373 (1051)
T ss_pred eeehhhcccccccccchhhccccchhcCcHhhhhceeeeeccCCCCcceecchhhhhhhhhhhhHhhcCceEeecccc--
Confidence 99999999999999999999999999999999999999999988 999999999999999999999999999844221
Q ss_pred cccccCCCccccccceeeccccCCCc
Q 002995 554 NLLSLDEDDEDQCIRLLSFCKKHKQP 579 (860)
Q Consensus 554 ~~~~~~~~~~~~~~~~~~~C~~H~~~ 579 (860)
....++-.+.+...+||++|.++
T Consensus 374 ---~~s~~~~s~~v~~~syC~~H~pp 396 (1051)
T KOG0955|consen 374 ---ELSKNGTSQSVNKISYCDKHTPP 396 (1051)
T ss_pred ---cccccccccccceeeeccCCCCc
Confidence 11112223456789999999998
No 7
>KOG1079 consensus Transcriptional repressor EZH1 [Transcription]
Probab=99.97 E-value=1.8e-32 Score=304.43 Aligned_cols=179 Identities=31% Similarity=0.546 Sum_probs=154.3
Q ss_pred ccccCCCcccCCcccCCCCCCCcCcccccccccccccccCCC-CcC------------ChhhhhHHhhhccceeEEEEEe
Q 002995 637 LFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAP-NFL------------SMADKYKHMKETFRKRLAFGKS 703 (860)
Q Consensus 637 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~------------~~~~~~~~l~~~~~~~l~v~~s 703 (860)
.|.+..- .|...|+++|+||.|...+..+.|.+..+.|+|. +++ ...|+|..|+++.++++.++.|
T Consensus 523 ~~CEk~C-~C~~dC~nrF~GC~Ck~QC~tkqCpC~~A~rECdPd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapS 601 (739)
T KOG1079|consen 523 TFCEKFC-YCSPDCRNRFPGCRCKAQCNTKQCPCYLAVRECDPDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPS 601 (739)
T ss_pred cchhhcc-cCCHHHHhcCCCCCcccccccCcCchhhhccccCchHHhccCcccccccCccccccchhhhhhhcceeechh
Confidence 3444443 3778899999999997554444444444445443 222 3489999999999999999999
Q ss_pred cccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcceeeccccCCccccCCCCCCCCee
Q 002995 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY 783 (860)
Q Consensus 704 ~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~ 783 (860)
.+.|||||+++.+.+++||.||+||+|++.|||+|.+ +|+.. ..+|+|+++.+++|||++.||.+||+|||-+|||+
T Consensus 602 dVaGwGlFlKe~v~KnefisEY~GE~IS~dEADrRGk-iYDr~--~cSflFnln~dyviDs~rkGnk~rFANHS~nPNCY 678 (739)
T KOG1079|consen 602 DVAGWGLFLKESVSKNEFISEYTGEIISHDEADRRGK-IYDRY--MCSFLFNLNNDYVIDSTRKGNKIRFANHSFNPNCY 678 (739)
T ss_pred hccccceeeccccCCCceeeeecceeccchhhhhccc-ccccc--cceeeeeccccceEeeeeecchhhhccCCCCCCcE
Confidence 9999999999999999999999999999999999998 57754 47999999999999999999999999999999999
Q ss_pred EEEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCC
Q 002995 784 SRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID 819 (860)
Q Consensus 784 ~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~ 819 (860)
+++++|.|.+||.|||+|+|.+||||||||++..+.
T Consensus 679 Akvm~V~GdhRIGifAkRaIeagEELffDYrYs~~~ 714 (739)
T KOG1079|consen 679 AKVMMVAGDHRIGIFAKRAIEAGEELFFDYRYSPEH 714 (739)
T ss_pred EEEEEecCCcceeeeehhhcccCceeeeeeccCccc
Confidence 999999999999999999999999999999987554
No 8
>KOG0957 consensus PHD finger protein [General function prediction only]
Probab=99.96 E-value=5.9e-30 Score=273.46 Aligned_cols=171 Identities=35% Similarity=0.732 Sum_probs=146.9
Q ss_pred CCCccccccCCcccCCCeeEEccccCceeccccccccc---cCCC------CceeeecccCCCCCCCCCccccCCCCCCc
Q 002995 395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE---PVNG------VLWLCNLCRPGAPEPPPPCCLCPVVGGAM 465 (860)
Q Consensus 395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~---~~~~------~~W~C~~C~~~~~~~~~~C~lC~~~gGal 465 (860)
+...|+||.+..+.+-+.||+||+||+.||..|||+.. .+.+ .+|||+.|..+.. .+.|-|||.++|++
T Consensus 118 k~~iCcVClg~rs~da~ei~qCd~CGi~VHEgCYGv~dn~si~s~~s~~stepWfCeaC~~Gvs--~P~CElCPn~~Gif 195 (707)
T KOG0957|consen 118 KAVICCVCLGQRSVDAGEILQCDKCGINVHEGCYGVLDNVSIPSGSSDCSTEPWFCEACLYGVS--LPHCELCPNRFGIF 195 (707)
T ss_pred cceEEEEeecCccccccceeeccccCceecccccccccccccCCCCccCCCCchhhhhHhcCCC--CCccccCCCcCCcc
Confidence 34599999998888889999999999999999999751 1211 4799999998865 48999999999999
Q ss_pred eecCCCceeccccccccCceEEccCCCccccccchhhhccccC-ccceecCC----cceeeEEcCCCCcCccccchhhhh
Q 002995 466 KPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWK-LLCSICGV----SYGACIQCSNTTCRVAYHPLCARA 540 (860)
Q Consensus 466 k~t~~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~-~~C~~C~~----~~Ga~iqC~~~~C~~~fH~~CA~~ 540 (860)
|.|+-|+|||++|||++|+|.|.....+.+|. +.......|. ..|++|.. +.|.||.|..+.|..+||++||+.
T Consensus 196 KetDigrWvH~iCALYvpGVafg~~~~l~~Vt-l~em~ysk~Gak~Cs~Ced~~fARtGvci~CdaGMCk~YfHVTCAQk 274 (707)
T KOG0957|consen 196 KETDIGRWVHAICALYVPGVAFGQTHTLCGVT-LEEMDYSKFGAKTCSACEDKIFARTGVCIRCDAGMCKEYFHVTCAQK 274 (707)
T ss_pred cccchhhHHHHHHHhhcCcccccccccccccc-HHHhhhhhhccchhccccchhhhhcceeeeccchhhhhhhhhhHHhh
Confidence 99999999999999999999999999999887 6655555554 78999985 589999999999999999999999
Q ss_pred cCeeEEEeccccccccccCCCccccccceeeccccCCCcc
Q 002995 541 AGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPL 580 (860)
Q Consensus 541 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~H~~~~ 580 (860)
+|+.++..+++ +....+.+||++|+...
T Consensus 275 ~GlLvea~~e~------------DiAdpfya~CK~Ht~r~ 302 (707)
T KOG0957|consen 275 LGLLVEATDEN------------DIADPFYAFCKKHTNRD 302 (707)
T ss_pred hcceeeccccc------------cchhhHHHHHHhhcchh
Confidence 99998877654 33457889999999754
No 9
>PF13832 zf-HC5HC2H_2: PHD-zinc-finger like domain
Probab=99.91 E-value=9.4e-25 Score=202.43 Aligned_cols=109 Identities=50% Similarity=1.122 Sum_probs=101.9
Q ss_pred CccccCCCCCCceecCCCceeccccccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcCccc
Q 002995 454 PCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAY 533 (860)
Q Consensus 454 ~C~lC~~~gGalk~t~~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~f 533 (860)
.|+|||.+|||||+|.++.|||++||+|+|++.|.+...+++++ ++.++.+++++.|.+|++..|++|+|..++|.++|
T Consensus 2 ~C~lC~~~~Galk~t~~~~WvHv~Cal~~~~~~~~~~~~~~~v~-~~~i~~~~~~~~C~iC~~~~G~~i~C~~~~C~~~f 80 (110)
T PF13832_consen 2 SCVLCPKRGGALKRTSDGQWVHVLCALWIPEVIFNNGESMEPVD-ISNIPPSRFKLKCSICGKSGGACIKCSHPGCSTAF 80 (110)
T ss_pred ccEeCCCCCCcccCccCCcEEEeEccceeCccEEeechhcCccc-ceeecchhcCCcCcCCCCCCceeEEcCCCCCCcCC
Confidence 69999999999999999999999999999999999999999999 99999999999999999999999999999999999
Q ss_pred cchhhhhcCeeEEEeccccccccccCCCccccccceeeccccC
Q 002995 534 HPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576 (860)
Q Consensus 534 H~~CA~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~H 576 (860)
||+||+.+|+.+++...+. ...+.+||++|
T Consensus 81 H~~CA~~~g~~~~~~~~~~-------------~~~~~~~C~~H 110 (110)
T PF13832_consen 81 HPTCARKAGLYFEIENEED-------------NVQFIAYCPKH 110 (110)
T ss_pred CHHHHHHCCCeEEeeecCC-------------CceEEEECCCC
Confidence 9999999999998775421 35789999999
No 10
>cd05834 HDGF_related The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF. This family of endogenous nuclear-targeted mitogens includes HRP (HDGF-related proteins 1, 2, 3, 4, or HPR1, HPR2, HPR3, HPR4, respectively) and lens epithelium-derived growth factor, LEDGF. Members of the HDGF family have been linked to human diseases, and HDGF is a prognostic factor in several types of cancer. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.89 E-value=9.2e-24 Score=183.34 Aligned_cols=81 Identities=27% Similarity=0.515 Sum_probs=73.0
Q ss_pred CCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccCChHH
Q 002995 81 ELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPR 160 (860)
Q Consensus 81 ~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~k~~~ 160 (860)
+|++||||||||+|||||||+|++++.. .+..+.|+|+|||++++|||++++|+||+++ ..++..+++++.
T Consensus 2 ~f~~GdlVwaK~kGyp~WPa~I~~~~~~--------~~~~~~~~V~FfGt~~~a~v~~~~l~pf~~~-~~~~~~~~k~k~ 72 (83)
T cd05834 2 QFKAGDLVFAKVKGYPAWPARVDEPEDW--------KPPGKKYPVYFFGTHETAFLKPEDLFPYTEN-KKKFGKPKKRKG 72 (83)
T ss_pred CCCCCCEEEEecCCCCCCCEEEeccccc--------CCCCCEEEEEEeCCCCEeEECHHHceecccc-hhhhccccchHH
Confidence 6999999999999999999999998653 2345779999999999999999999999997 677888889999
Q ss_pred HHHHHHHHHH
Q 002995 161 FTQSLEEAKV 170 (860)
Q Consensus 161 ~~~A~~ea~~ 170 (860)
|++||+|+++
T Consensus 73 F~~Av~eie~ 82 (83)
T cd05834 73 FNEAVWEIEK 82 (83)
T ss_pred HHHHHHHHhh
Confidence 9999999975
No 11
>cd05836 N_Pac_NP60 The PWWP domain is an essential part of the cytokine-like nuclear factor n-pac protein, or NP60, which enhances the activity of MAP2K4 and MAP2K6 kinases to phosphorylate p38-alpha. In a variety of cell lines, NP60 has been shown to localize to the nucleus. In addition to the PWWP domain, NP60 also contains an AT-hook and a C-terminal NAD-binding domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Probab=99.89 E-value=8.5e-24 Score=185.03 Aligned_cols=86 Identities=24% Similarity=0.458 Sum_probs=73.3
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccCChHHH
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRF 161 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~k~~~~ 161 (860)
|++||||||||+|||||||+|++++.. ..+.+.+.+.++|+|||+++||||.+++|+||+++ ...+...+|.+.|
T Consensus 1 f~~GDlVwaK~~g~P~WPa~V~~~~~~----~~~~~~~~~~~~V~FFG~~~~~wv~~~~l~pF~~~-~~~~~~~~k~~~F 75 (86)
T cd05836 1 LKLGDLVWAKMKGFPPWPGRIVKPPKD----LKKPRGKAKCFFVFFFGSENHAWIKEENIKPYHEH-KEEMIKLNKGARF 75 (86)
T ss_pred CCCCCEEEEeCCCCCCCCEEEechhhh----cccccCCCCeEEEEEeCCCCEEEECHHhCeechhh-HHHHhcccchHHH
Confidence 789999999999999999999986532 22222335889999999999999999999999997 6677777888999
Q ss_pred HHHHHHHHHHH
Q 002995 162 TQSLEEAKVYL 172 (860)
Q Consensus 162 ~~A~~ea~~~~ 172 (860)
++||+|+++++
T Consensus 76 ~~Av~~ie~~~ 86 (86)
T cd05836 76 QQAVDAIEEYI 86 (86)
T ss_pred HHHHHHHHHhC
Confidence 99999999873
No 12
>KOG1082 consensus Histone H3 (Lys9) methyltransferase SUV39H1/Clr4, required for transcriptional silencing [Chromatin structure and dynamics; Transcription]
Probab=99.89 E-value=1.5e-23 Score=233.91 Aligned_cols=159 Identities=30% Similarity=0.429 Sum_probs=128.9
Q ss_pred CChhhhhHHhhhccceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHh-hhcccCCccee------
Q 002995 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFI-YNSLVGAGTYM------ 753 (860)
Q Consensus 681 ~~~~~~~~~l~~~~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~-~~~~~~~~~y~------ 753 (860)
+...|.|+.+|.+.+.+|+|.+++.+||||++.+.|++|+||+||+||+++..+++.+.... |........+.
T Consensus 161 C~~~C~nRv~q~g~~~~leIfrt~~kGwgvRs~~~I~~G~fvcEyaGe~~t~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (364)
T KOG1082|consen 161 CHPDCANRVVQKGLQFHLEVFRTPEKGWGVRTLDPIPAGEFVCEYAGEVLTSEEAQRRTHLREYLDDDCDAYSIADREWV 240 (364)
T ss_pred CCCcCcchhhccccccceEEEecCCceeeecccccccCCCeeEEEeeEecChHHhhhccccccccccccccchhhhcccc
Confidence 34578999999999999999999999999999999999999999999999999998774211 11100111111
Q ss_pred ---------------eeecCcceeeccccCCccccCCCCCCCCeeEEEEEECCe----eEEEEEEccCCCCCCeEEEecC
Q 002995 754 ---------------FRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGD----EHIIIFAKRDIKQWEELTYDYR 814 (860)
Q Consensus 754 ---------------~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~----~~i~i~A~RdI~~GEELt~dY~ 814 (860)
......+.|||...||++|||||||.||+.+..+..++. .+|+|||+++|.+||||||||+
T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~ida~~~GNv~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~I~p~~ELT~dYg 320 (364)
T KOG1082|consen 241 DESPVGNTFVAPSLPGGPGRELLIDAKPHGNVARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRDISPGEELTLDYG 320 (364)
T ss_pred ccccccccccccccccCCCcceEEchhhcccccccccCCCCccceeeeeeecCCccchheeeeeeccccCCCcccchhhc
Confidence 122467789999999999999999999999988877633 5899999999999999999999
Q ss_pred CCCC------------CCCeEEeeCCCCCccccCCCh
Q 002995 815 FFSI------------DEQLACYCGFPRCRGVVNDTE 839 (860)
Q Consensus 815 ~~~~------------~~~~~C~Cg~~~Cr~~~~~~~ 839 (860)
.... ..+..|.||+.+||+.+....
T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~c~c~~~~cr~~~~~~~ 357 (364)
T KOG1082|consen 321 KAYKLLVQDGANIYTPVMKKNCNCGLEKCRGLLGSAP 357 (364)
T ss_pred ccccccccccccccccccchhhcCCCHHhCcccCCCc
Confidence 6532 245789999999999886654
No 13
>cd05838 WHSC1_related The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS). When translocated, WHSC1 plays a role in lymphoid multiple myeloma (MM) disease, also known as plasmacytoma. WHCS1 proteins typically contain two copies of the PWWP domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.87 E-value=9.8e-23 Score=181.95 Aligned_cols=92 Identities=29% Similarity=0.404 Sum_probs=74.7
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccccccccccccc--ccCChH
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFH--LKCKKP 159 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~--~k~k~~ 159 (860)
+.+||||||||+|||||||+|++++.++. +....+++.+.++|+|||+++|+||.+++|+||.+++..... ++++.+
T Consensus 1 ~~~GdlVWaK~~g~pwWPa~V~~~~~~p~-~~~~~~~~~~~~~V~Ffgs~~y~Wv~~~~l~pf~e~~~~~~~~~~~~~~~ 79 (95)
T cd05838 1 PLYGDIVWAKLGNFRWWPAIICDPREVPP-NIQVLRHCIGEFCVMFFGTHDYYWVHRGRVFPYQEGDKGFKEQTKSYLAK 79 (95)
T ss_pred CCcCCEEEEECCCCCCCCeEEcChhhcCh-hHhhccCCCCeEEEEEeCCCCEEEeccccccchhhhhhhhhhhhhhhhHH
Confidence 35899999999999999999999887766 333334556889999999999999999999999987433222 224669
Q ss_pred HHHHHHHHHHHHHHh
Q 002995 160 RFTQSLEEAKVYLSE 174 (860)
Q Consensus 160 ~~~~A~~ea~~~~~~ 174 (860)
.|++||+||.++++.
T Consensus 80 ~f~~AleEA~~~~~~ 94 (95)
T cd05838 80 RFRKALEEASLAFKG 94 (95)
T ss_pred HHHHHHHHHHHHhhc
Confidence 999999999998753
No 14
>cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S. pombe hypothetical protein SPBC215, as well as ISWI complex protein 4. The ISWI (imitation switch) proteins are ATPases responsible for chromatin remodeling in eukaryotes, and SPBC215 is proposed to also bind chromatin. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.87 E-value=6.5e-23 Score=181.82 Aligned_cols=88 Identities=32% Similarity=0.528 Sum_probs=76.2
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCccccc--cccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccC--C
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGL--NKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKC--K 157 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~--~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~--k 157 (860)
|++||||||||+|||||||+|++++.++. ..+ +.+++.+.|+|+|||+++|+||.+++|+||+++.+..+..+. |
T Consensus 1 f~~GDlVwaK~~GyPwWPA~V~~~~~~p~-~~l~~~~~~~~~~~~V~FFg~~~~~Wv~~~~l~pl~~~~~~~~l~~~~~k 79 (93)
T cd05840 1 FQPGDRVLAKVKGFPAWPAIVVPEEMLPD-SVLKGKKKKNKRTYPVMFFPDGDYYWVPNKDLKPLTEEKIAKFLKKPKRK 79 (93)
T ss_pred CCCCCEEEEeCCCCCCCCEEECChHHCCH-HHHhcccCCCCCeEEEEEeCCCcEEEEChhhcccCCHHHHHHHhhcCCCC
Confidence 78999999999999999999999988877 445 445677999999999999999999999999977677777654 5
Q ss_pred hHHHHHHHHHHHH
Q 002995 158 KPRFTQSLEEAKV 170 (860)
Q Consensus 158 ~~~~~~A~~ea~~ 170 (860)
++.+.+|++.|.+
T Consensus 80 ~k~l~~ay~~A~~ 92 (93)
T cd05840 80 DKELIKAYKAAKD 92 (93)
T ss_pred CHHHHHHHHHhcC
Confidence 6889999998853
No 15
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.87 E-value=9.7e-22 Score=183.20 Aligned_cols=115 Identities=45% Similarity=0.712 Sum_probs=100.4
Q ss_pred EEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcceeeccccCCccccCCCC
Q 002995 698 LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777 (860)
Q Consensus 698 l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~~iDa~~~gn~aRfiNHS 777 (860)
+.+.+++++|+||||+++|++|++|++|.|.++...++..+... +........|+|......+||+...||++||||||
T Consensus 2 ~~~~~~~~~G~gl~a~~~i~~g~~i~~~~g~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~iNHs 80 (116)
T smart00317 2 LEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERSKA-YDTDGADSFYLFEIDSDLCIDARRKGNIARFINHS 80 (116)
T ss_pred cEEEecCCCcEEEEECCccCCCCEEEEEEeEEECHHHHHHHHHH-HHhcCCCCEEEEECCCCEEEeCCccCcHHHeeCCC
Confidence 56777789999999999999999999999999998887776542 22222225788998888999999999999999999
Q ss_pred CCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEec
Q 002995 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDY 813 (860)
Q Consensus 778 C~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY 813 (860)
|.|||.......++..++.|+|+|||++|||||+||
T Consensus 81 c~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i~Y 116 (116)
T smart00317 81 CEPNCELLFVEVNGDSRIVIFALRDIKPGEELTIDY 116 (116)
T ss_pred CCCCEEEEEEEECCCcEEEEEECCCcCCCCEEeecC
Confidence 999999998888777799999999999999999999
No 16
>cd05835 Dnmt3b_related The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.86 E-value=2.5e-22 Score=176.60 Aligned_cols=84 Identities=23% Similarity=0.410 Sum_probs=71.1
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccccccccccc--ccccCChH
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSS--FHLKCKKP 159 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~--~~~k~k~~ 159 (860)
|.+||||||||+|||||||+|++++.... .+...+.++|+|||+++++||++++|+||.++ ... ...++|+.
T Consensus 1 f~vGDlVWaK~kg~pwWP~~V~~~~~~~~-----~~~~~~~~~V~fFGs~~~a~v~~~~l~pf~e~-~~~f~~~~~~k~~ 74 (87)
T cd05835 1 FNVGDLVWGKIKGFPWWPGRVVSITVTSK-----RPPVVGMRWVTWFGSGTFSEVSVDKLSPFSEF-FKAFSRYNRKKKG 74 (87)
T ss_pred CCCCCEEEEecCCCCCCCeEEechhhccc-----ccCCCCeEEEEEeCCCCEeEECHHHCcChhHh-HHHHhhhhhhhhH
Confidence 78999999999999999999999865443 13556889999999999999999999999997 555 34445679
Q ss_pred HHHHHHHHHHHH
Q 002995 160 RFTQSLEEAKVY 171 (860)
Q Consensus 160 ~~~~A~~ea~~~ 171 (860)
.|++||+||.+.
T Consensus 75 ~f~~Ai~eA~e~ 86 (87)
T cd05835 75 LYKKAIYEALEV 86 (87)
T ss_pred HHHHHHHHHHHc
Confidence 999999999874
No 17
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes. Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=99.86 E-value=5.5e-22 Score=175.65 Aligned_cols=86 Identities=30% Similarity=0.539 Sum_probs=74.8
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccCChHHH
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRF 161 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~k~~~~ 161 (860)
|++|||||||++|||||||+|+++...+... ..+...+.++|+|||+++|+||+.++|.||.++... ...+++++.|
T Consensus 1 f~~GdlVwaK~~g~pwWPa~V~~~~~~~~~~--~~~~~~~~~~V~Ffg~~~~~wv~~~~l~pf~~~~~~-~~~~~k~~~f 77 (87)
T cd05162 1 FRPGDLVWAKMKGYPWWPALVVDPPKDSKKA--KKKAKEGKVLVLFFGDKTFAWVGAERLKPFTEHKES-EAKQSKRKGF 77 (87)
T ss_pred CCCCCEEEEeCCCCCCCCEEEccccccchhh--hccCCCCEEEEEEeCCCcEEEeCccceeeccchHHh-hccCCccHHH
Confidence 7899999999999999999999998776622 334567889999999999999999999999998444 6667788999
Q ss_pred HHHHHHHHH
Q 002995 162 TQSLEEAKV 170 (860)
Q Consensus 162 ~~A~~ea~~ 170 (860)
++||++|.+
T Consensus 78 ~~A~~eA~~ 86 (87)
T cd05162 78 KKAYDEALE 86 (87)
T ss_pred HHHHHHHHh
Confidence 999999976
No 18
>cd05841 BS69_related The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions. BS69 is a multi-domain protein, containing bromo, PHD, PWWP, and MYND domains. The specific role of the PWWP domain within BS69 is not clearly identified, but BS69 functions in chromatin remodeling, consistent with other PWWP-containing proteins. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.85 E-value=9.7e-22 Score=168.22 Aligned_cols=76 Identities=28% Similarity=0.562 Sum_probs=67.4
Q ss_pred CCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeC-CCCEEEEeCCCcccccccccccccccCChH
Q 002995 81 ELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFG-THDFARINVKQVISFLKGLLSSFHLKCKKP 159 (860)
Q Consensus 81 ~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg-~~~~awv~~~~l~~f~~~~~~~~~~k~k~~ 159 (860)
-++|||||||||+|||||||+|+.++ .+.|+|+||| ++++|||+.++|+||+++ +.++..++|++
T Consensus 6 c~~p~dLVwAK~kGyp~WPAkV~~~~-------------~~~~~V~FFG~t~~~a~v~~~~i~~~~~~-~~~~~~~~k~~ 71 (83)
T cd05841 6 CRPPHELVWAKLKGFPYWPAKVMRVE-------------DNQVDVRFFGGQHDRAWIPSNNIQPISTE-IPQQLVKKRSR 71 (83)
T ss_pred cCCCCCEEEEeCCCCCCCCEEEeecC-------------CCeEEEEEcCCCCCeEEEehHHeeehhhh-hhhhccccccH
Confidence 36899999999999999999998753 1459999999 999999999999999997 55677788889
Q ss_pred HHHHHHHHHHH
Q 002995 160 RFTQSLEEAKV 170 (860)
Q Consensus 160 ~~~~A~~ea~~ 170 (860)
.|++||+||+.
T Consensus 72 ~f~~A~~Eie~ 82 (83)
T cd05841 72 GFNKAMDELEL 82 (83)
T ss_pred HHHHHHHHHHh
Confidence 99999999975
No 19
>KOG1083 consensus Putative transcription factor ASH1/LIN-59 [Transcription]
Probab=99.84 E-value=3.7e-22 Score=230.17 Aligned_cols=133 Identities=35% Similarity=0.568 Sum_probs=118.3
Q ss_pred hhhHHhhh-ccceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcceee
Q 002995 685 DKYKHMKE-TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763 (860)
Q Consensus 685 ~~~~~l~~-~~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~~iD 763 (860)
|+|+.|++ +....|.+.+.+..||||.|+++|++||||+||+|+||...+++.+...+|. ...+.|...++.+.+||
T Consensus 1166 c~nqrm~r~e~cp~L~v~~gp~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~mmtl~~--~d~~~~cL~I~p~l~id 1243 (1306)
T KOG1083|consen 1166 CSNQRMQRHEECPPLEVFRGPKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPRMMTLYH--NDDDHYCLVIDPGLFID 1243 (1306)
T ss_pred hhhHHhhhhccCCCcceeccCCCCccccccccccccchHHHHHHHHHHHHhhcccccccCC--CCCcccccccCccccCC
Confidence 66766655 5678999999999999999999999999999999999998877776332232 23467999999999999
Q ss_pred ccccCCccccCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCC
Q 002995 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID 819 (860)
Q Consensus 764 a~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~ 819 (860)
+.++||.+|||||||.|||.++.+.|+|..||++||+|||.+||||||||+|...+
T Consensus 1244 ~~R~~n~~RfinhscKPNc~~qkwSVNG~~Rv~L~A~rDi~kGEELtYDYN~ks~~ 1299 (1306)
T KOG1083|consen 1244 IPRMGNGARFINHSCKPNCEMQKWSVNGEYRVGLFALRDLPKGEELTYDYNFKSFN 1299 (1306)
T ss_pred hhhccccccccccccCCCCccccccccceeeeeeeecCCCCCCceEEEeccccccC
Confidence 99999999999999999999999999999999999999999999999999987655
No 20
>PF00855 PWWP: PWWP domain; InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=99.83 E-value=2.7e-21 Score=171.17 Aligned_cols=84 Identities=32% Similarity=0.579 Sum_probs=71.5
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccccccccccccccc--CChH
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLK--CKKP 159 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k--~k~~ 159 (860)
|++|||||||++|||||||+|++++.... .....+.++|+|||+++++||+.++|.||.+. ...+..+ ++++
T Consensus 1 f~~GdlVWaK~~g~pwWPa~V~~~~~~~~-----~~~~~~~~~V~Ffg~~~~~wv~~~~i~~f~~~-~~~~~~~~~~k~~ 74 (86)
T PF00855_consen 1 FRPGDLVWAKLKGYPWWPARVCDPDEKSK-----KKRKDGHVLVRFFGDNDYAWVKPSNIKPFSEF-KEKLKKKKKKKRK 74 (86)
T ss_dssp -STTEEEEEEETTSEEEEEEEEECCHCTS-----CSSSSTEEEEEETTTTEEEEEEGGGEEECCHH-HHHHHHHHHHHSH
T ss_pred CCCCCEEEEEeCCCCCCceEEeecccccc-----cCCCCCEEEEEecCCCCEEEECHHHhhChhhh-HHHHHHhhccchH
Confidence 78999999999999999999999875444 24667889999999999999999999999954 5556654 5669
Q ss_pred HHHHHHHHHHHH
Q 002995 160 RFTQSLEEAKVY 171 (860)
Q Consensus 160 ~~~~A~~ea~~~ 171 (860)
.|++||+||+++
T Consensus 75 ~~~~Ai~eA~~~ 86 (86)
T PF00855_consen 75 SFRKAIEEAEEA 86 (86)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHHhC
Confidence 999999999875
No 21
>smart00293 PWWP domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues
Probab=99.81 E-value=2.2e-20 Score=154.24 Aligned_cols=63 Identities=33% Similarity=0.636 Sum_probs=56.2
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccccc
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFL 145 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~ 145 (860)
|++||||||||+|||||||+|++++..+. ...+.++..+.|+|+|||++++|||.+++|+||+
T Consensus 1 f~~GdlVwaK~~G~p~WPa~V~~~~~~~~-~~~~~~~~~~~~~V~Ffg~~~~awv~~~~l~p~~ 63 (63)
T smart00293 1 FKPGDLVWAKMKGFPWWPALVVSPKETPD-NIRKRKRFENLYPVLFFGDKDTAWISSSKLFPLT 63 (63)
T ss_pred CCCCCEEEEECCCCCCCCeEEcCcccCCh-hHhhccCCCCEEEEEEeCCCCEEEECccceeeCC
Confidence 78999999999999999999999987766 3444566778999999999999999999999985
No 22
>KOG1085 consensus Predicted methyltransferase (contains a SET domain) [General function prediction only]
Probab=99.80 E-value=1.3e-19 Score=183.13 Aligned_cols=124 Identities=37% Similarity=0.507 Sum_probs=110.2
Q ss_pred ccceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeee---cCcceeeccccC-
Q 002995 693 TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRI---DDERVIDATRAG- 768 (860)
Q Consensus 693 ~~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~---~~~~~iDa~~~g- 768 (860)
+....+.+..-.++|.||.|++.+++|+||.||.|.+|.-.++..|+.. |......++|||.+ +..++||||+--
T Consensus 253 g~~egl~~~~~dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~-Ya~De~~GcYMYyF~h~sk~yCiDAT~et~ 331 (392)
T KOG1085|consen 253 GTNEGLLEVYKDGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQ-YANDEEIGCYMYYFEHNSKKYCIDATKETP 331 (392)
T ss_pred ccccceeEEeeccccceeEeecccccCceEEEEecceeeechHHHHHHH-hccCcccceEEEeeeccCeeeeeecccccc
Confidence 3456677888888999999999999999999999999999999999986 66666778888765 467899999854
Q ss_pred CccccCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCC
Q 002995 769 SIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFS 817 (860)
Q Consensus 769 n~aRfiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~ 817 (860)
-+.|+||||-.+||..+++.++|.+|++++|.|||++||||+||||.-+
T Consensus 332 ~lGRLINHS~~gNl~TKvv~Idg~pHLiLvA~rdIa~GEELlYDYGDRS 380 (392)
T KOG1085|consen 332 WLGRLINHSVRGNLKTKVVEIDGSPHLILVARRDIAQGEELLYDYGDRS 380 (392)
T ss_pred cchhhhcccccCcceeeEEEecCCceEEEEeccccccchhhhhhccccc
Confidence 4799999999999999999999999999999999999999999998643
No 23
>cd05837 MSH6_like The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS. The PWWP domain of histone-lysine N-methyltransferase, also known as Nuclear SET domain-containing protein 3, is also included. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans. The role of the PWWP domain in MSH6 is not clear; MSH6 orthologs found in S. cerevisiae, Caenorhabditis elegans and Arabidopsis thaliana lack the PWWP domain. Histone methyltransferases (HMTases) induce the posttranslational methylation of lysine residues in histones and play a role in apoptosis. In the HMTase Whistle, the PWWP domain is necessary for HMTase activity. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain pro
Probab=99.78 E-value=3.1e-19 Score=164.05 Aligned_cols=90 Identities=26% Similarity=0.416 Sum_probs=65.5
Q ss_pred CCCCCCEEEEEecccCCCceEeeCCCCcCccc-cccccCCCCcEEEEEeCCC-CEEEEeCCCccccccccccc-------
Q 002995 81 ELEPGDIIWAKLTGHAMWPAIVVDESLIGDYK-GLNKISGGRSIPVQFFGTH-DFARINVKQVISFLKGLLSS------- 151 (860)
Q Consensus 81 ~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~-~~~~~~~~~~~~V~FFg~~-~~awv~~~~l~~f~~~~~~~------- 151 (860)
.|.+||||||||+|||||||+|++++..+... .....+..+.|+|+|||++ ++|||+.++|.||++.....
T Consensus 2 ~~~~GdlVWaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~~V~FFG~~~~~aWv~~~~l~pf~~~~~~~~~~~~~~ 81 (110)
T cd05837 2 KYQVGDLVWAKVSGYPWWPCMVCSDPLLGTYTKTKRNKRKPRQYHVQFFGDNPERAWISEKSLKPFKGSKQFESEKGEKF 81 (110)
T ss_pred CCCCCCEEEEeCCCCCCCCEEEecccccchhhhhhhccCCCCeEEEEEcCCCCCEEEecHHHccccCCchhhhhhhhhhh
Confidence 58999999999999999999999877665522 1133456688999999985 79999999999999752111
Q ss_pred -ccccCChHHHHHHHHHHHH
Q 002995 152 -FHLKCKKPRFTQSLEEAKV 170 (860)
Q Consensus 152 -~~~k~k~~~~~~A~~ea~~ 170 (860)
..+.+.++.+++|..+++.
T Consensus 82 ~~~K~~~~~~~~~a~~~~~~ 101 (110)
T cd05837 82 KVRKPNIKKARQKADIAIMQ 101 (110)
T ss_pred hccCCcchhHHHHHHHHHHH
Confidence 1122233566666666554
No 24
>cd06080 MUM1_like Mutated melanoma-associated antigen 1 (MUM-1) is a melanoma-associated antigen (MAA). MUM-1 belongs to the mutated or aberrantly expressed type of MAAs, along with antigens such as CDK4, beta-catenin, gp100-in4, p15, and N-acetylglucosaminyltransferase V. It is highly expressed in several types of human cancers. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.70 E-value=1.7e-17 Score=141.30 Aligned_cols=75 Identities=25% Similarity=0.476 Sum_probs=57.3
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCC-CEEEEeCCCcccccccccccccccCChHH
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTH-DFARINVKQVISFLKGLLSSFHLKCKKPR 160 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~-~~awv~~~~l~~f~~~~~~~~~~k~k~~~ 160 (860)
|++|||||||++|||||||+|.+... +.+.++|+||||+ +++|+..++|+||.++ .....+....+.
T Consensus 1 f~~gdlVWaK~~g~P~WPa~I~~~~~-----------~~~k~~V~FfG~~~~~a~~~~~~l~p~~~~-~~~~ek~~~~~k 68 (80)
T cd06080 1 FEKNDLVWAKIQGYPWWPAVIKSISR-----------KKQKARVNFIGDNMQSEKKGIRVVKRWLKH-FDCTEKQKLTNK 68 (80)
T ss_pred CCCCCEEEEeCCCCCCCCEEEeeecC-----------CCCEEEEEEeCCCCceeccchhhccccccc-HHHHHHHHHHHH
Confidence 78999999999999999999987532 2457999999999 7999999999999997 333222223344
Q ss_pred HHHHHHHH
Q 002995 161 FTQSLEEA 168 (860)
Q Consensus 161 ~~~A~~ea 168 (860)
.++++++|
T Consensus 69 ~ke~~~~a 76 (80)
T cd06080 69 AKESYEQA 76 (80)
T ss_pred HHHHHHHH
Confidence 45555544
No 25
>cd05839 BR140_related The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2. BR140 is the only family to contain the PWWP domain at the C terminus, with PHD and bromo domains in the N-terminal region. In myeloid leukemias, BR140 is disrupted by chromosomal translocations, similar to translocations of WHSC1 in lymphoid multiple myeloma. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Probab=99.69 E-value=1.9e-17 Score=149.61 Aligned_cols=66 Identities=23% Similarity=0.394 Sum_probs=53.1
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCcc---c-----------cccccCCCCcEEEEEeCC-CCEEEEeCCCcccccc
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDY---K-----------GLNKISGGRSIPVQFFGT-HDFARINVKQVISFLK 146 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~---~-----------~~~~~~~~~~~~V~FFg~-~~~awv~~~~l~~f~~ 146 (860)
+.||||||||++|||||||+|+++...... . .++.......|+|+||++ .+++||+.++|.||..
T Consensus 1 ~~pg~lVwaK~~g~P~wPa~iidp~~~~~~~~~~~~p~~~l~~~~~~~~~~~~~~~lV~FFd~~~s~~Wv~~~~l~pl~~ 80 (111)
T cd05839 1 LEPLTLVWAKCRGYPSYPALIIDPKMPRDGVFHNGVPPDVLTLGEARAQNADERLYLVLFFDNKRTWQWLPGDKLEPLGV 80 (111)
T ss_pred CCCcCEeeeeecCCCCCCeEeeCCCCCCcccccCCCCchhhhHHHHHhccCCCcEEEEEEecCCCcceecCHHHCccccc
Confidence 469999999999999999999998653321 1 112345667899999998 6799999999999986
Q ss_pred c
Q 002995 147 G 147 (860)
Q Consensus 147 ~ 147 (860)
+
T Consensus 81 ~ 81 (111)
T cd05839 81 D 81 (111)
T ss_pred c
Confidence 4
No 26
>PF05964 FYRN: F/Y-rich N-terminus; InterPro: IPR003888 The "FY-rich" domain N-terminal region is sometimes closely juxtaposed with the C-terminal region (IPR003889 from INTERPRO), but sometimes is far distant. It is of unknown function, but occurs frequently in chromatin-associated proteins like trithorax and its homologues.; GO: 0005634 nucleus; PDB: 2WZO_A.
Probab=99.67 E-value=5.4e-17 Score=128.93 Aligned_cols=54 Identities=43% Similarity=0.746 Sum_probs=42.3
Q ss_pred eCCEEEEeeeeeeeCCCCCCCCCcccCCCeEEEEEeccCCCCCCcceEEEEEEe
Q 002995 230 FGDLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLR 283 (860)
Q Consensus 230 ~g~l~i~slG~iv~~~~~fh~~~~i~P~Gy~s~r~~~S~~dp~~~~~y~~~I~~ 283 (860)
+|+|+|+|||+|++++++||+++||||+||+++|.|+|+.||.+++.|+|+|+|
T Consensus 1 igsl~v~sLG~i~~~~~~fh~~~~IyP~Gy~s~R~y~S~~~p~~~~~Y~~~Ild 54 (54)
T PF05964_consen 1 IGSLTVHSLGKIVPDRPAFHSERYIYPVGYKSSRLYWSTVDPRRRCRYTCEILD 54 (54)
T ss_dssp -TTEEEEEEEE---SSGGGB-SS-B--EEEEEEEEEE-SS-TTSEEEEEEEEE-
T ss_pred CCceEEEECeEEeCCCCCccCCCEEeeCCEEEEEEEccccCCCCEEEEEEEEeC
Confidence 589999999999999989999999999999999999999999999999999986
No 27
>PF13771 zf-HC5HC2H: PHD-like zinc-binding domain
Probab=99.62 E-value=3e-16 Score=140.03 Aligned_cols=88 Identities=38% Similarity=0.759 Sum_probs=79.0
Q ss_pred ccccccccCceEEccCCC--ccccccchhhhccccCccceecCCcceeeEEcCCCCcCccccchhhhhcCeeEEEecccc
Q 002995 475 HLACAIWIPETCLTDVKR--MEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDR 552 (860)
Q Consensus 475 H~~Ca~~~p~~~~~~~~~--~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~~~g~~~~~~~~~~ 552 (860)
|+.||+|+|++.+.+... +.++.+++.+..++++++|.+|+++.||+|+|..++|.+.||++||+.+|+.+++...
T Consensus 1 H~~Calwsp~v~~~~~~~~~~~~i~~v~~~~~~~~~~~C~~C~~~~Ga~i~C~~~~C~~~fH~~CA~~~~~~~~~~~~-- 78 (90)
T PF13771_consen 1 HENCALWSPEVYFDESEDIGGFSIEDVEKEIKRRRKLKCSICKKKGGACIGCSHPGCSRSFHVPCARKAGCFIEFDED-- 78 (90)
T ss_pred ChHHheecCceEEeCCCccccccHHhHHHHHHHHhCCCCcCCCCCCCeEEEEeCCCCCcEEChHHHccCCeEEEEccC--
Confidence 899999999999998764 6788999999999999999999998899999999999999999999999999876531
Q ss_pred ccccccCCCccccccceeeccccCC
Q 002995 553 LNLLSLDEDDEDQCIRLLSFCKKHK 577 (860)
Q Consensus 553 ~~~~~~~~~~~~~~~~~~~~C~~H~ 577 (860)
...+.+||++|+
T Consensus 79 -------------~~~~~~~C~~H~ 90 (90)
T PF13771_consen 79 -------------NGKFRIFCPKHS 90 (90)
T ss_pred -------------CCceEEEChhcC
Confidence 136889999996
No 28
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=99.60 E-value=1.3e-16 Score=185.14 Aligned_cols=141 Identities=40% Similarity=0.674 Sum_probs=118.7
Q ss_pred cceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecC-cceeeccccCCccc
Q 002995 694 FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDD-ERVIDATRAGSIAH 772 (860)
Q Consensus 694 ~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~-~~~iDa~~~gn~aR 772 (860)
......+..+.++|+|+||++.|++|++|.+|.|++++..++..++... . ..+....++.+.. ..++|+...|+.+|
T Consensus 330 ~~~~~~~~~~~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~d~~~~g~~~r 407 (480)
T COG2940 330 RREPNVVQESEIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENY-D-LLGNEFSFGLLEDKDKVRDSQKAGDVAR 407 (480)
T ss_pred ccchhhhhhhcccccceeehhhccchHHHHHhcCcccchHHHHhhhccc-c-ccccccchhhccccchhhhhhhcccccc
Confidence 3556778888999999999999999999999999999999998888753 2 2233333333333 78999999999999
Q ss_pred cCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCC---------CCeEEeeCCCCCccccC
Q 002995 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID---------EQLACYCGFPRCRGVVN 836 (860)
Q Consensus 773 fiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~---------~~~~C~Cg~~~Cr~~~~ 836 (860)
||||||.|||......+.|..++.++|+|||.+||||++||+...++ ..+.|.|+++.|+++|+
T Consensus 408 ~~nHS~~pN~~~~~~~~~g~~~~~~~~~rDI~~geEl~~dy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480)
T COG2940 408 FINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPSLEDNRELKKLLEKRWGCACGEDRCSHTMS 480 (480)
T ss_pred eeecCCCCCcceecccccccceeeecccccchhhhhhccccccccccchhhhhhhhhhhccccCCCccCCCCC
Confidence 99999999999998888777799999999999999999999886655 35789999999999874
No 29
>PF05965 FYRC: F/Y rich C-terminus; InterPro: IPR003889 The "FY-rich" domain C-terminal region is sometimes closely juxtaposed with the N-terminal region (IPR003888 from INTERPRO), but sometimes is far distant. It is of unknown function, but occurs frequently in chromatin-associated proteins like trithorax and its homologues.; GO: 0005634 nucleus; PDB: 2WZO_A.
Probab=99.60 E-value=5.8e-16 Score=136.62 Aligned_cols=79 Identities=37% Similarity=0.594 Sum_probs=57.7
Q ss_pred cceEEEEcCC--CCceecCCchHHHHHHHHHHHhhccCCCCCcccccccccccccCccccCCCcHHHHHHHhhcCCCCCC
Q 002995 290 RPLFRVTLDN--GEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLTKSRPT 367 (860)
Q Consensus 290 ~P~F~Vt~~~--~~~~~~~s~~~~W~~i~~~i~~~~~~~~~~~~~~~~~~~~~~sG~~~FGls~p~v~~li~~Lp~a~~c 367 (860)
+|+|+|+.+| +..|.|+||++||.+|+++|++.+..... .......++|++||||++|.|++|||+||+|++|
T Consensus 3 ~P~F~Vt~~d~p~~~~~g~s~~~~W~~i~~~v~~~r~~~~~-----~~~~~~~isG~~~FGls~p~V~~lie~Lp~a~~c 77 (86)
T PF05965_consen 3 GPLFEVTSEDDPGEVFEGSSPTEAWSEILERVNEARKQSGL-----LKLPPNSISGPEMFGLSNPAVQRLIESLPGADKC 77 (86)
T ss_dssp SEEEEEEETT-GGG-EEESSHHHHHHHHHHHHHHHHT------------TT----HHHHHSTTSHHHHHHHTTSTTGGG-
T ss_pred CCEEEEEECCCCCCEEEeCCHHHHHHHHHHHHHHHHhhccc-----cccCCCCCCHhHhcCCCCHHHHHHHHhCCCcchh
Confidence 3999999999 45699999999999999999987766431 1122457899999999999999999999999999
Q ss_pred Cchhhh
Q 002995 368 SKSSLC 373 (860)
Q Consensus 368 ~k~~~~ 373 (860)
.+|.++
T Consensus 78 ~~Y~f~ 83 (86)
T PF05965_consen 78 SNYKFR 83 (86)
T ss_dssp TT----
T ss_pred hcCCcc
Confidence 999654
No 30
>smart00542 FYRC "FY-rich" domain, C-terminal region. is sometimes closely juxtaposed with the N-terminal region (FYRN), but sometimes is far distant. Unknown function, but occurs frequently in chromatin-associated proteins.
Probab=99.60 E-value=8.6e-16 Score=134.73 Aligned_cols=79 Identities=30% Similarity=0.498 Sum_probs=66.2
Q ss_pred eEEEEcCC--CCceecCCchHHHHHHHHHHHhhccCCCCCcccccccccccccCccccCCCcHHHHHHHhhcCCCCCCCc
Q 002995 292 LFRVTLDN--GEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLTKSRPTSK 369 (860)
Q Consensus 292 ~F~Vt~~~--~~~~~~~s~~~~W~~i~~~i~~~~~~~~~~~~~~~~~~~~~~sG~~~FGls~p~v~~li~~Lp~a~~c~k 369 (860)
+|+|+.++ +..|.|+||++||.+|+++|++.+...... ......+||++||||++|+|++|||+||+|++|.+
T Consensus 1 lF~v~~~~~~~~~~~~~S~~~~W~~vl~~v~~~r~~~~~~-----~~~~~~isG~~mFGls~p~V~~lie~Lpga~~C~~ 75 (86)
T smart00542 1 LFRVEIESDPDEVFKGESPEKCWEMVLERVQEARIVARLL-----QLLPEGVSGEDMFGLSSPAVVKLIEQLPGVHQCTN 75 (86)
T ss_pred CeEEEEecCCCCeEEeCCHHHHHHHHHHHHHHHHHHcccC-----CCCCCCCCcHHHhCCCcHHHHHHHHhCCCchhhhh
Confidence 58999877 567999999999999999999887664321 11234579999999999999999999999999999
Q ss_pred hhhhhh
Q 002995 370 SSLCKL 375 (860)
Q Consensus 370 ~~~~k~ 375 (860)
|.++..
T Consensus 76 Y~~~~~ 81 (86)
T smart00542 76 YWFRYH 81 (86)
T ss_pred hhhccC
Confidence 998754
No 31
>KOG1141 consensus Predicted histone methyl transferase [Chromatin structure and dynamics]
Probab=99.57 E-value=1.4e-15 Score=171.85 Aligned_cols=76 Identities=34% Similarity=0.711 Sum_probs=68.4
Q ss_pred ceeeccccCCccccCCCCCCCCeeEEEEEECCe----eEEEEEEccCCCCCCeEEEecCCCCC---CCCeEEeeCCCCCc
Q 002995 760 RVIDATRAGSIAHLINHSCEPNCYSRVISVNGD----EHIIIFAKRDIKQWEELTYDYRFFSI---DEQLACYCGFPRCR 832 (860)
Q Consensus 760 ~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~----~~i~i~A~RdI~~GEELt~dY~~~~~---~~~~~C~Cg~~~Cr 832 (860)
++|||+..||++||+||||.||+.++.++|+.. +-++|||.|-|+||.|||+||++.-. .....|.||+.+||
T Consensus 1179 yvIDAk~eGNlGRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Ye~g~v~~keL~C~CGa~~Cr 1258 (1262)
T KOG1141|consen 1179 YVIDAKQEGNLGRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQYEQGQVATKELTCHCGAENCR 1258 (1262)
T ss_pred EEEecccccchhhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeeccccccccccceEEEecChhhhh
Confidence 479999999999999999999999999999865 45899999999999999999998643 25689999999999
Q ss_pred ccc
Q 002995 833 GVV 835 (860)
Q Consensus 833 ~~~ 835 (860)
|.|
T Consensus 1259 grL 1261 (1262)
T KOG1141|consen 1259 GRL 1261 (1262)
T ss_pred ccc
Confidence 976
No 32
>KOG1904 consensus Transcription coactivator [Transcription]
Probab=99.45 E-value=6.5e-14 Score=156.80 Aligned_cols=88 Identities=23% Similarity=0.374 Sum_probs=76.7
Q ss_pred CCCCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccCCh
Q 002995 79 CQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKK 158 (860)
Q Consensus 79 ~~~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~k~ 158 (860)
...|++||||||||+|||.|||+|.+...... ++....|.|+|||++++|||.+++|+||..+ .+.+....++
T Consensus 10 ~~~~~~GDLV~AKlkgyp~WParI~~~~~~~~------kp~pkky~V~FfGT~e~Afl~p~dlqpy~~~-k~~~g~~~k~ 82 (496)
T KOG1904|consen 10 AGNFKCGDLVFAKLKGYPPWPARIRNGPDGAV------KPPPKKYTVFFFGTKETAFLKPKDLQPYMLN-KEKLGKPNKR 82 (496)
T ss_pred cCCCCCCceeeecccCCCCCcccccCcccccc------cCCCceeEEEEeccCcccccchhhccchhhh-hhhcccchhh
Confidence 45799999999999999999999988654322 2356779999999999999999999999998 7888888888
Q ss_pred --HHHHHHHHHHHHHHH
Q 002995 159 --PRFTQSLEEAKVYLS 173 (860)
Q Consensus 159 --~~~~~A~~ea~~~~~ 173 (860)
+.|++|++++.+++.
T Consensus 83 ~~k~F~~av~eI~~a~~ 99 (496)
T KOG1904|consen 83 VWKGFIEAVEEIREAFN 99 (496)
T ss_pred hhHHHHHHHHHHHHHhc
Confidence 999999999999873
No 33
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.42 E-value=2.4e-13 Score=133.29 Aligned_cols=108 Identities=26% Similarity=0.358 Sum_probs=74.8
Q ss_pred CeeEEeeecCCCCCEEEEecCEeeChhhHhhh-------------------H---------H-----------Hhhhc--
Q 002995 707 GFGIFAKHPHRAGDMVIEYTGELVRPSIADRR-------------------E---------H-----------FIYNS-- 745 (860)
Q Consensus 707 G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r-------------------~---------~-----------~~~~~-- 745 (860)
|+||||+++|++|++|++..+.+++....... . . .....
T Consensus 1 GrGl~At~dI~~Ge~I~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (162)
T PF00856_consen 1 GRGLFATRDIKAGEVILIPRPAILTPDEVSPQPELLRLQLSKALEEQSRSDFSIQKKQKAEKSERSPQLESLHSISLRSE 80 (162)
T ss_dssp SEEEEESS-B-TTEEEEEESEEEEEHHHHHCHHHHSHHTTCSSSCSHHTTHHHHHHHHHHHHHHHHHHHHHHHHHCHTTT
T ss_pred CEEEEECccCCCCCEEEEECcceEEehhhhhcccchhhhhhhhhcccccccccccccccccccccccccccccccccccc
Confidence 89999999999999999999999886665431 0 0 00000
Q ss_pred c-------------cCCcceeeeecCcceeeccccCCccccCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEe
Q 002995 746 L-------------VGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYD 812 (860)
Q Consensus 746 ~-------------~~~~~y~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~d 812 (860)
. ................++....+++.|+||||.|||.+..........+.|+|.|+|++|||||++
T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~is 160 (162)
T PF00856_consen 81 LQFSQAFQWSWFISWTRSDFSSRSFSEDDRDGIALYPFADMLNHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFIS 160 (162)
T ss_dssp CCTCCHHHHHHHHHHHHHEEEEEEETTEEEEEEEEETGGGGSEEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEEE
T ss_pred ccccccccchhhccccceeeeccccccccccccccCcHhHheccccccccceeeEeecccceEEEEECCccCCCCEEEEE
Confidence 0 000011112223334566777889999999999999998877777899999999999999999999
Q ss_pred cC
Q 002995 813 YR 814 (860)
Q Consensus 813 Y~ 814 (860)
||
T Consensus 161 YG 162 (162)
T PF00856_consen 161 YG 162 (162)
T ss_dssp ST
T ss_pred EC
Confidence 97
No 34
>smart00541 FYRN "FY-rich" domain, N-terminal region. is sometimes closely juxtaposed with the C-terminal region (FYRC), but sometimes is far distant. Unknown function, but occurs frequently in chromatin-associated proteins.
Probab=99.31 E-value=2e-12 Score=97.54 Aligned_cols=41 Identities=27% Similarity=0.553 Sum_probs=38.1
Q ss_pred eeeCCCCCCCCCcccCCCeEEEEEeccCCCCCCcceEEEEE
Q 002995 241 IVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEV 281 (860)
Q Consensus 241 iv~~~~~fh~~~~i~P~Gy~s~r~~~S~~dp~~~~~y~~~I 281 (860)
++.++++||++++|||+||+++|.|+|+.||.++|.|+|.|
T Consensus 2 ~~~~~~~fh~~~~IyP~Gy~s~R~y~S~~dp~~~c~Y~c~i 42 (44)
T smart00541 2 LPIQGKLFHSEDAIFPVGYKSTRKYWSVKDPNRRCNYSCVI 42 (44)
T ss_pred ccccCCCcccCCEEecCCEEEEEEEecccCCCCEEEEEEEE
Confidence 56788999999999999999999999999999999996655
No 35
>PF13831 PHD_2: PHD-finger; PDB: 2L43_A 2KU3_A.
Probab=99.07 E-value=2.5e-11 Score=87.57 Aligned_cols=36 Identities=44% Similarity=1.013 Sum_probs=21.9
Q ss_pred CCeeEEccccCceeccccccccccCCCCceeeeccc
Q 002995 410 NNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445 (860)
Q Consensus 410 ~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~ 445 (860)
+|+||+|++|++.||+.|||+..++++..|+|++|+
T Consensus 1 ~n~ll~C~~C~v~VH~~CYGv~~~~~~~~W~C~~C~ 36 (36)
T PF13831_consen 1 TNPLLFCDNCNVAVHQSCYGVSEVPDGDDWLCDRCE 36 (36)
T ss_dssp -CEEEE-SSS--EEEHHHHT-SS--SS-----HHH-
T ss_pred CCceEEeCCCCCcCChhhCCcccCCCCCcEECCcCC
Confidence 478999999999999999999988888789999995
No 36
>KOG1081 consensus Transcription factor NSD1 and related SET domain proteins [Transcription]
Probab=98.82 E-value=1.1e-09 Score=124.98 Aligned_cols=127 Identities=30% Similarity=0.481 Sum_probs=106.4
Q ss_pred EeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcceeeccccCCccccCCCCCCCCeeEEEEEEC
Q 002995 711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVN 790 (860)
Q Consensus 711 fA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~ 790 (860)
+|..+|.+| +|+++...+...+..+.-.. .....|+..+..+..||+...||+.||+||||+|||....+.+.
T Consensus 319 ~~~~~~~k~------vg~~i~~~e~~~~~~~~~~~-~~~~~~~~~~e~~~~id~~~~~n~sr~~nh~~~~~v~~~k~~~~ 391 (463)
T KOG1081|consen 319 TAKADIRKG------VGEVIDDKECKARLQRVKES-DLVDFYMVFIQKDRIIDAGPKGNYSRFLNHSCQPNVETEKWQVI 391 (463)
T ss_pred hhHHhhhcc------cCcccchhhheeehhhhhcc-chhhhhhhhhhcccccccccccchhhhhcccCCCceeechhhee
Confidence 888888888 89999888877776643322 23345555555555999999999999999999999999988899
Q ss_pred CeeEEEEEEccCCCCCCeEEEecCCCCCCCCeEEeeCCCCCccccCCChHHHHH
Q 002995 791 GDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQV 844 (860)
Q Consensus 791 ~~~~i~i~A~RdI~~GEELt~dY~~~~~~~~~~C~Cg~~~Cr~~~~~~~~~~~~ 844 (860)
+..++.++|.+.|.+|||||++|+.......+.|.|++.+|.++++....+...
T Consensus 392 ~~t~~~~~a~~~i~~g~e~t~~~n~~~~~~~~~~~~~~e~~~~~~~k~~~~~~~ 445 (463)
T KOG1081|consen 392 GDTRVGLFAPRQIEAGEELTFNYNGNCEGNEKRCCCGSENCTETKGKKKKKKKQ 445 (463)
T ss_pred cccccccccccccccchhhhheeeccccCCcceEeecccccccCCcccccchhh
Confidence 999999999999999999999999887778899999999999999988774443
No 37
>KOG2589 consensus Histone tail methylase [Chromatin structure and dynamics]
Probab=98.78 E-value=5.8e-09 Score=110.21 Aligned_cols=115 Identities=24% Similarity=0.260 Sum_probs=83.4
Q ss_pred ccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcceeeccccCCccccCCCCCCCCeeE
Q 002995 705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS 784 (860)
Q Consensus 705 ~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~ 784 (860)
..|--|.+++.+.+||=|--.+|-|+.-.+++++.... ..+.+ |...-+ ..--.|...=+.|+||||.|.|||.+
T Consensus 136 ~~gAkivst~~w~~ndkIe~LvGcIaeLse~eE~~ll~---~g~nD-FSvmyS-tRk~caqLwLGPaafINHDCrpnCkF 210 (453)
T KOG2589|consen 136 QNGAKIVSTKSWSRNDKIELLVGCIAELSEAEERSLLR---GGGND-FSVMYS-TRKRCAQLWLGPAAFINHDCRPNCKF 210 (453)
T ss_pred CCCceEEeeccccCCccHHHhhhhhhhcChhhhHHHHh---ccCCc-eeeeee-cccchhhheeccHHhhcCCCCCCcee
Confidence 45778999999999999999999998888888775321 11221 111111 01112333345799999999999988
Q ss_pred EEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCC-CCeEEeeC
Q 002995 785 RVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID-EQLACYCG 827 (860)
Q Consensus 785 ~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~-~~~~C~Cg 827 (860)
.. .|..++.+.++|||++|||||.-|+..+.+ ....|.|-
T Consensus 211 vs---~g~~tacvkvlRDIePGeEITcFYgs~fFG~~N~~CeC~ 251 (453)
T KOG2589|consen 211 VS---TGRDTACVKVLRDIEPGEEITCFYGSGFFGENNEECECV 251 (453)
T ss_pred ec---CCCceeeeehhhcCCCCceeEEeecccccCCCCceeEEe
Confidence 63 466899999999999999999999877665 45678773
No 38
>PF00628 PHD: PHD-finger; InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=97.60 E-value=1.6e-05 Score=62.66 Aligned_cols=45 Identities=36% Similarity=0.842 Sum_probs=36.2
Q ss_pred ccccccCCcccCCCeeEEccccCceecccccccccc----CCCCceeeeccc
Q 002995 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEP----VNGVLWLCNLCR 445 (860)
Q Consensus 398 ~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~----~~~~~W~C~~C~ 445 (860)
+|.||.. ..+.+.+|+|+.|+..+|..|++.... +.+ .|+|+.|.
T Consensus 1 ~C~vC~~--~~~~~~~i~C~~C~~~~H~~C~~~~~~~~~~~~~-~w~C~~C~ 49 (51)
T PF00628_consen 1 YCPVCGQ--SDDDGDMIQCDSCNRWYHQECVGPPEKAEEIPSG-DWYCPNCR 49 (51)
T ss_dssp EBTTTTS--SCTTSSEEEBSTTSCEEETTTSTSSHSHHSHHSS-SBSSHHHH
T ss_pred eCcCCCC--cCCCCCeEEcCCCChhhCcccCCCChhhccCCCC-cEECcCCc
Confidence 4889998 344567999999999999999997533 223 89999996
No 39
>KOG4323 consensus Polycomb-like PHD Zn-finger protein [General function prediction only]
Probab=97.56 E-value=6.3e-05 Score=84.30 Aligned_cols=117 Identities=24% Similarity=0.345 Sum_probs=79.7
Q ss_pred CCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccCCCCCCceecCCCceec
Q 002995 396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAH 475 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~~g~wvH 475 (860)
...|.||........|.+++|++|+...||.|--... +.-+.|.+..|..... .+.||++| .|.-+|
T Consensus 83 e~~~nv~~s~~~~p~~e~~~~~r~~~~~~q~~~i~~~-~~~~~~~~~~c~~~~~---------~~~g~a~K---~g~~a~ 149 (464)
T KOG4323|consen 83 ELNPNVLTSETVLPENEKVICGRCKSGYHQGCNIPRF-PSLDIGESTECVFPIF---------SQEGGALK---KGRLAR 149 (464)
T ss_pred ccCCcccccccccCchhhhhhhhhccCcccccCccCc-CcCCcccccccccccc---------cccccccc---cccccc
Confidence 4689999998877889999999999999999975433 3334788888887544 46788888 455565
Q ss_pred cccccccCceEEccCCCccccccchhhhccccCccceecCC----cceeeEEcCCCCcCccccchhhhhc
Q 002995 476 LACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV----SYGACIQCSNTTCRVAYHPLCARAA 541 (860)
Q Consensus 476 ~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~----~~Ga~iqC~~~~C~~~fH~~CA~~~ 541 (860)
|-+.+.. .+++.-...+..+.|++|.. ..-.+|||. +|.++||-.|-+-.
T Consensus 150 -------~~l~y~~-------~~l~wD~~~~~n~qc~vC~~g~~~~~NrmlqC~--~C~~~fHq~Chqp~ 203 (464)
T KOG4323|consen 150 -------PSLPYPE-------ASLDWDSGHKVNLQCSVCYCGGPGAGNRMLQCD--KCRQWYHQACHQPL 203 (464)
T ss_pred -------ccccCcc-------cccccCccccccceeeeeecCCcCccceeeeec--ccccHHHHHhccCC
Confidence 2111111 11111111222345999974 123899998 99999999997543
No 40
>smart00249 PHD PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the PF09465 LBR_tudor: Lamin-B receptor of TUDOR domain; InterPro: IPR019023 The Lamin-B receptor is a chromatin and lamin binding protein in the inner nuclear membrane. It is one of the integral inner nuclear envelope membrane proteins responsible for targeting nuclear membranes to chromatin, being a downstream effector of Ran, a small Ras-like nuclear GTPase which regulates NE assembly. Lamin-B receptor interacts with importin beta, a Ran-binding protein, thereby directly contributing to the fusion of membrane vesicles and the formation of the nuclear envelope []. ; PDB: 2L8D_A 2DIG_A.
Probab=97.47 E-value=0.0001 Score=57.61 Aligned_cols=32 Identities=44% Similarity=0.634 Sum_probs=27.1
Q ss_pred CCceeeeeeeecCCCCceEEEecCCCcccccc
Q 002995 2 QDWYSGFVVGYDSESNRHHVKYVDGDEEDLIL 33 (860)
Q Consensus 2 ~~~~~g~v~~~~~~~~~h~~~y~d~d~e~~~l 33 (860)
..||.|.|.+||+.+++|.|.|+||++.++..
T Consensus 20 ~lYYe~kV~~~d~~~~~y~V~Y~DGtel~lke 51 (55)
T PF09465_consen 20 SLYYEGKVLSYDSKSDRYTVLYEDGTELELKE 51 (55)
T ss_dssp S-EEEEEEEEEETTTTEEEEEETTS-EEEEEC
T ss_pred CcEEEEEEEEecccCceEEEEEcCCCEEEecc
Confidence 46999999999999999999999999966554
No 42
>KOG2461 consensus Transcription factor BLIMP-1/PRDI-BF1, contains C2H2-type Zn-finger and SET domains [Transcription]
Probab=97.37 E-value=0.00017 Score=81.00 Aligned_cols=110 Identities=22% Similarity=0.273 Sum_probs=81.6
Q ss_pred cceeEEEEEeccc--CeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeec----Ccceeeccc-
Q 002995 694 FRKRLAFGKSGIH--GFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRID----DERVIDATR- 766 (860)
Q Consensus 694 ~~~~l~v~~s~~~--G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~----~~~~iDa~~- 766 (860)
....|.|..|.+- |.||++...|.+|+--+=|.|+++... ... .....|++.+- .-.+||++.
T Consensus 26 LP~~l~i~~Ssv~~~~lgV~s~~~i~~G~~FGP~~G~~~~~~----~~~------~~n~~y~W~I~~~d~~~~~iDg~d~ 95 (396)
T KOG2461|consen 26 LPPELRIKPSSVPVTGLGVWSNASILPGTSFGPFEGEIIASI----DSK------SANNRYMWEIFSSDNGYEYIDGTDE 95 (396)
T ss_pred CCCceEeeccccCCccccccccccccCcccccCccCcccccc----ccc------cccCcceEEEEeCCCceEEeccCCh
Confidence 4566788887654 789999999999999999999982111 111 12234555543 235899875
Q ss_pred -cCCccccCCCCCCC---CeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCC
Q 002995 767 -AGSIAHLINHSCEP---NCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFS 817 (860)
Q Consensus 767 -~gn~aRfiNHSC~P---N~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~ 817 (860)
..|+.||+|=.++. |..+. .....|.++|+|+|.+||||-+.|+-++
T Consensus 96 ~~sNWmRYV~~Ar~~eeQNL~A~----Q~~~~Ifyrt~r~I~p~eELlVWY~~e~ 146 (396)
T KOG2461|consen 96 EHSNWMRYVNSARSEEEQNLLAF----QIGENIFYRTIRDIRPNEELLVWYGSEY 146 (396)
T ss_pred hhcceeeeecccCChhhhhHHHH----hccCceEEEecccCCCCCeEEEEeccch
Confidence 88999999988865 75543 2456899999999999999999997654
No 43
>KOG1244 consensus Predicted transcription factor Requiem/NEURO-D4 [Transcription]
Probab=97.09 E-value=0.0002 Score=73.46 Aligned_cols=49 Identities=35% Similarity=0.868 Sum_probs=40.1
Q ss_pred CCCccccccCCcccCCCeeEEccccCceeccccccc--cccCCCCceeeecccC
Q 002995 395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGE--LEPVNGVLWLCNLCRP 446 (860)
Q Consensus 395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~--~~~~~~~~W~C~~C~~ 446 (860)
+--.|+||+..+ ++++||+||.|+.++|++|+.. ..+|+| +|-|..|..
T Consensus 280 eck~csicgtse--nddqllfcddcdrgyhmyclsppm~eppeg-swsc~KOG~ 330 (336)
T KOG1244|consen 280 ECKYCSICGTSE--NDDQLLFCDDCDRGYHMYCLSPPMVEPPEG-SWSCHLCLE 330 (336)
T ss_pred ecceeccccCcC--CCceeEeecccCCceeeEecCCCcCCCCCC-chhHHHHHH
Confidence 445899998754 4578999999999999999984 456777 999999964
No 44
>KOG4323 consensus Polycomb-like PHD Zn-finger protein [General function prediction only]
Probab=96.95 E-value=0.00045 Score=77.58 Aligned_cols=53 Identities=36% Similarity=0.698 Sum_probs=42.4
Q ss_pred CCccccccCCcccCCCeeEEccccCceecccccccc------ccCCCCceeeecccCCCC
Q 002995 396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL------EPVNGVLWLCNLCRPGAP 449 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~------~~~~~~~W~C~~C~~~~~ 449 (860)
+..|+||........|.||+|++|+-.+|+.|.--. ..+ ...|+|..|..+..
T Consensus 168 n~qc~vC~~g~~~~~NrmlqC~~C~~~fHq~Chqp~i~~~l~~D~-~~~w~C~~C~~~~~ 226 (464)
T KOG4323|consen 168 NLQCSVCYCGGPGAGNRMLQCDKCRQWYHQACHQPLIKDELAGDP-FYEWFCDVCNRGPK 226 (464)
T ss_pred cceeeeeecCCcCccceeeeecccccHHHHHhccCCCCHhhccCc-cceEeehhhccchh
Confidence 457999998888888899999999999999998522 122 23799999997643
No 45
>KOG1512 consensus PHD Zn-finger protein [General function prediction only]
Probab=96.81 E-value=0.00041 Score=71.54 Aligned_cols=49 Identities=24% Similarity=0.529 Sum_probs=41.2
Q ss_pred CCCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeee--cccC
Q 002995 395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCN--LCRP 446 (860)
Q Consensus 395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~--~C~~ 446 (860)
+-..|.||+...-+ +++++||.|+.++|..|.|....|.| .|+|+ .|+.
T Consensus 313 ~C~lC~IC~~P~~E--~E~~FCD~CDRG~HT~CVGL~~lP~G-~WICD~~C~~~ 363 (381)
T KOG1512|consen 313 SCELCRICLGPVIE--SEHLFCDVCDRGPHTLCVGLQDLPRG-EWICDMRCREA 363 (381)
T ss_pred ccHhhhccCCcccc--hheeccccccCCCCccccccccccCc-cchhhhHHHHh
Confidence 45689999987654 44999999999999999999888888 99999 5543
No 46
>KOG1081 consensus Transcription factor NSD1 and related SET domain proteins [Transcription]
Probab=96.78 E-value=0.00062 Score=78.35 Aligned_cols=55 Identities=29% Similarity=0.655 Sum_probs=46.1
Q ss_pred CCCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccc
Q 002995 80 QELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISF 144 (860)
Q Consensus 80 ~~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f 144 (860)
..++.||+||-++..|+|||++|+.+..++ +.. . . .+|+||+. ++|+....+.++
T Consensus 134 ~~~~~~~~vw~~vg~~~~~~c~vc~~~~~~----~~~-~--~-~~~~f~~~--~~~~~~~~~~~~ 188 (463)
T KOG1081|consen 134 KKREVGDLVWSKVGEYPWWPCMVCHDPLLP----KGM-K--H-DHVNFFGC--YAWTHEKRVFPY 188 (463)
T ss_pred ccccceeEEeEEcCcccccccceecCcccc----hhh-c--c-ccceeccc--hhhHHHhhhhhc
Confidence 368999999999999999999999887665 111 1 1 18999999 999999999999
No 47
>KOG1084 consensus Transcription factor TCF20 [Transcription]
Probab=96.50 E-value=0.00087 Score=75.03 Aligned_cols=85 Identities=26% Similarity=0.440 Sum_probs=65.0
Q ss_pred CCCccccCCCCCCceec-CCCceeccccccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcC
Q 002995 452 PPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCR 530 (860)
Q Consensus 452 ~~~C~lC~~~gGalk~t-~~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~ 530 (860)
...|+|++... .+. ....|+|+.|++|.|.+.+.....+..+. ........+.|..|.+ .|+.+.|...+|.
T Consensus 221 e~~~~l~~~~~---~~d~~~~~~~h~~c~~~~~~~~~~q~~~l~~~~---~~v~r~~~~~c~~c~k-~ga~~~c~~~~~~ 293 (375)
T KOG1084|consen 221 EFFCALSPKAT---IPDIGFELWYHRYCALWAPNVHESQGGQLTNVD---NAVIRFPSLQCILCQK-PGATLKCVQASLL 293 (375)
T ss_pred hhhhhhcCCCc---CCccchhHHHHHHHHhcCCcceeccCccccCch---hhhhcccchhcccccC-CCCchhhhhhhhh
Confidence 34788886443 333 45689999999999999988876655444 3222333379999999 8999999999999
Q ss_pred ccccchhhhhcCe
Q 002995 531 VAYHPLCARAAGL 543 (860)
Q Consensus 531 ~~fH~~CA~~~g~ 543 (860)
..+|.+|+..+-.
T Consensus 294 ~~~h~~c~~~~~~ 306 (375)
T KOG1084|consen 294 SNAHFPCARAKNG 306 (375)
T ss_pred cccCcccccCccc
Confidence 9999999987755
No 48
>smart00508 PostSET Cysteine-rich motif following a subset of SET domains.
Probab=96.02 E-value=0.0036 Score=41.49 Aligned_cols=19 Identities=37% Similarity=1.015 Sum_probs=16.3
Q ss_pred CeEEeeCCCCCccccCCCh
Q 002995 821 QLACYCGFPRCRGVVNDTE 839 (860)
Q Consensus 821 ~~~C~Cg~~~Cr~~~~~~~ 839 (860)
.++|+||+++|||+|+...
T Consensus 2 ~~~C~CGs~~CRG~l~~~~ 20 (26)
T smart00508 2 KQPCLCGAPNCRGFLGXXX 20 (26)
T ss_pred CeeeeCCCccccceecccc
Confidence 5799999999999996643
No 49
>PF15057 DUF4537: Domain of unknown function (DUF4537)
Probab=95.63 E-value=0.034 Score=52.52 Aligned_cols=100 Identities=22% Similarity=0.335 Sum_probs=66.4
Q ss_pred ceeeeeeeecCCCCceEEEecCCCccccccccceEEEEEccchhhhhhcccCCCCCCCCCCChhhHHHhhcCccCCCCCC
Q 002995 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDGYDYDEMVVLAASLDDCQELE 83 (860)
Q Consensus 4 ~~~g~v~~~~~~~~~h~~~y~d~d~e~~~l~~e~i~~~~s~~e~~~l~~~~~~~~~~~~~~~~de~~~~~~~~~~~~~f~ 83 (860)
+|.|.|++.- ....-.|.+++++.+.+...+ +..+.. .-...++
T Consensus 14 YY~GtV~~~~-~~~~~lV~f~~~~~~~v~~~~---------------------------------iI~~~~--~~~~~L~ 57 (124)
T PF15057_consen 14 YYPGTVKKCV-SSGQFLVEFDDGDTQEVPISD---------------------------------IIALSD--AMRHSLQ 57 (124)
T ss_pred EEeEEEEEcc-CCCEEEEEECCCCEEEeChHH---------------------------------eEEccC--cccCcCC
Confidence 6889999887 666777778777766654411 111111 1156789
Q ss_pred CCCEEEEEec--ccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccccccc
Q 002995 84 PGDIIWAKLT--GHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKG 147 (860)
Q Consensus 84 ~GdlVwaK~~--g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~ 147 (860)
+||-|+|+.. ++.|=||+|..-.+.. ......|.|.||.. ..+.++...+....+.
T Consensus 58 ~GD~VLA~~~~~~~~Y~Pg~V~~~~~~~-------~~~~~~~~V~f~ng-~~~~vp~~~~~~I~~~ 115 (124)
T PF15057_consen 58 VGDKVLAPWEPDDCRYGPGTVIAGPERR-------ASEDKEYTVRFYNG-KTAKVPRGEVIWISPS 115 (124)
T ss_pred CCCEEEEecCcCCCEEeCEEEEECcccc-------ccCCceEEEEEECC-CCCccchhhEEECCHH
Confidence 9999999953 7778899998632211 34556799999954 4677777776665543
No 50
>PF15446 zf-PHD-like: PHD/FYVE-zinc-finger like domain
Probab=95.41 E-value=0.015 Score=56.44 Aligned_cols=70 Identities=21% Similarity=0.485 Sum_probs=48.0
Q ss_pred ccccccC-CcccCCCeeEEccccCceeccccccccc--------cCCC-CceeeecccCCCC------CCCCCccccCCC
Q 002995 398 KCSVCHM-DEEYQNNLFLQCDKCRMMVHARCYGELE--------PVNG-VLWLCNLCRPGAP------EPPPPCCLCPVV 461 (860)
Q Consensus 398 ~C~vC~~-~~~~~~n~ll~C~~C~~~vH~~CYg~~~--------~~~~-~~W~C~~C~~~~~------~~~~~C~lC~~~ 461 (860)
.|.+|.. +.+.+..+||+|.+|..++|+.|.|... +-++ ----|.+|..... ...-.|.-|...
T Consensus 1 ~C~~C~~~g~~~~kG~Lv~CQGCs~sYHk~CLG~Rs~ReHlVTKVg~d~FVLQCr~Cig~~~kKD~~aP~~~~C~~C~~~ 80 (175)
T PF15446_consen 1 TCDTCGYEGDDRNKGPLVYCQGCSSSYHKACLGPRSQREHLVTKVGDDDFVLQCRRCIGIAHKKDPRAPHHGMCQQCKKP 80 (175)
T ss_pred CcccccCCCCCccCCCeEEcCccChHHHhhhcCCccccceeeEEEcCCceEEechhhcChhhcccCCCCCCCcccccCCC
Confidence 5999976 4566677899999999999999999432 2221 1355999975321 123469999887
Q ss_pred CCCcee
Q 002995 462 GGAMKP 467 (860)
Q Consensus 462 gGalk~ 467 (860)
|-+-++
T Consensus 81 G~~c~p 86 (175)
T PF15446_consen 81 GPSCKP 86 (175)
T ss_pred CCCCcc
Confidence 755443
No 51
>KOG0825 consensus PHD Zn-finger protein [General function prediction only]
Probab=95.11 E-value=0.0095 Score=69.49 Aligned_cols=49 Identities=27% Similarity=0.713 Sum_probs=39.1
Q ss_pred CCCccccccCCcccCCCeeEEccccCce-ecccccccc--ccCCCCceeeecccC
Q 002995 395 DLDKCSVCHMDEEYQNNLFLQCDKCRMM-VHARCYGEL--EPVNGVLWLCNLCRP 446 (860)
Q Consensus 395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~-vH~~CYg~~--~~~~~~~W~C~~C~~ 446 (860)
+...|.||...+.+ +.||.||.|+.+ +|.+|+... .++-+ .|+|+-|..
T Consensus 214 E~~~C~IC~~~DpE--dVLLLCDsCN~~~YH~YCLDPdl~eiP~~-eWYC~NC~d 265 (1134)
T KOG0825|consen 214 EEVKCDICTVHDPE--DVLLLCDSCNKVYYHVYCLDPDLSESPVN-EWYCTNCSL 265 (1134)
T ss_pred ccccceeeccCChH--HhheeecccccceeeccccCccccccccc-ceecCcchh
Confidence 34689999987654 569999999999 999999753 24555 899999965
No 52
>KOG4299 consensus PHD Zn-finger protein [General function prediction only]
Probab=93.79 E-value=0.021 Score=66.04 Aligned_cols=48 Identities=29% Similarity=0.684 Sum_probs=40.0
Q ss_pred CccccccCCcccCCCeeEEccccCceecccccccc----ccCCCCceeeecccCC
Q 002995 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL----EPVNGVLWLCNLCRPG 447 (860)
Q Consensus 397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~----~~~~~~~W~C~~C~~~ 447 (860)
++|+-|.+.+.. |.||.||.|...|||.|.--+ ..|.| .|+|..|...
T Consensus 254 ~fCsaCn~~~~F--~~~i~CD~Cp~sFH~~CLePPl~~eniP~g-~W~C~ec~~k 305 (613)
T KOG4299|consen 254 DFCSACNGSGLF--NDIICCDGCPRSFHQTCLEPPLEPENIPPG-SWFCPECKIK 305 (613)
T ss_pred HHHHHhCCcccc--ccceeecCCchHHHHhhcCCCCCcccCCCC-ccccCCCeee
Confidence 599999998776 779999999999999998643 34555 8999999753
No 53
>PF08169 RBB1NT: RBB1NT (NUC162) domain; InterPro: IPR012603 This domain is found N-terminal to the ARID/BRIGHT domain in DNA-binding proteins of the Retinoblastoma-binding protein 1 family [].; PDB: 2YRV_A.
Probab=92.32 E-value=0.24 Score=44.00 Aligned_cols=85 Identities=21% Similarity=0.289 Sum_probs=48.8
Q ss_pred CCCEEEEEe--cccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccCChHHH
Q 002995 84 PGDIIWAKL--TGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRF 161 (860)
Q Consensus 84 ~GdlVwaK~--~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~k~~~~ 161 (860)
+|-+|-... +.-.|-||.|+.++.-.. . .-++.+++|+=|-|..|+=|+.+++..|..+ .........
T Consensus 8 lGkVV~V~~~~~k~~W~PALVVsPsc~dd--v---~VkKD~~lVRSFkD~KfysV~rkd~~e~~~~-----~~~k~e~s~ 77 (96)
T PF08169_consen 8 LGKVVCVESTKKKTSWFPALVVSPSCNDD--V---TVKKDQCLVRSFKDGKFYSVARKDVREFDID-----SLPKSESSL 77 (96)
T ss_dssp TTSEEEEE-SS-SS-EEEEEEE--SS-SS---------TT-EEEEESSS--EEEE-TTTEE---ST-----TS-HHHHHH
T ss_pred cCcEEEEEcCCCCCceeeEEEEcCCccce--e---eeccceEEEEEeccCceEEEEhhhhhhcccc-----cCCcccchh
Confidence 788887764 344699999999754333 1 1346789999999999999999999999864 111223467
Q ss_pred HHHHHHHHHHHHhCCCc
Q 002995 162 TQSLEEAKVYLSEQKLP 178 (860)
Q Consensus 162 ~~A~~ea~~~~~~~~~~ 178 (860)
+.|++.|..+++....|
T Consensus 78 k~al~~A~~Fl~~~~vP 94 (96)
T PF08169_consen 78 KPALDKASTFLKTGVVP 94 (96)
T ss_dssp -HHHHHHHHHHHS----
T ss_pred hHHHHHHHHHHhcCCCC
Confidence 89999999998887665
No 54
>KOG1141 consensus Predicted histone methyl transferase [Chromatin structure and dynamics]
Probab=91.95 E-value=0.15 Score=60.32 Aligned_cols=59 Identities=20% Similarity=0.236 Sum_probs=53.3
Q ss_pred CChhhhhHHhhhccceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhH
Q 002995 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE 739 (860)
Q Consensus 681 ~~~~~~~~~l~~~~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~ 739 (860)
+..-|.|+..|-+.+.+|.+.+...+|||++...+|..|.||+-|.|.+++...++.-+
T Consensus 784 ~~~~C~nrmvqhg~qvRlq~fkt~~kGWg~rclddi~~g~fVciy~g~~l~~~~sdks~ 842 (1262)
T KOG1141|consen 784 CGPDCLNRMVQHGYQVRLQRFKTIHKGWGRRCLDDITGGNFVCIYPGGALLHQISDKSE 842 (1262)
T ss_pred CcHHHHHHHhhcCceeEeeeccccccccceEeeeecCCceEEEEecchhhhhhhchhhh
Confidence 34467899999999999999999999999999999999999999999999888777764
No 55
>KOG1973 consensus Chromatin remodeling protein, contains PHD Zn-finger [Chromatin structure and dynamics]
Probab=91.66 E-value=0.079 Score=57.25 Aligned_cols=36 Identities=28% Similarity=0.751 Sum_probs=32.0
Q ss_pred CeeEEccc--cC-ceeccccccccccCCCCceeeecccCC
Q 002995 411 NLFLQCDK--CR-MMVHARCYGELEPVNGVLWLCNLCRPG 447 (860)
Q Consensus 411 n~ll~C~~--C~-~~vH~~CYg~~~~~~~~~W~C~~C~~~ 447 (860)
..||-||+ |. -=||..|.|+..+|.| .|+|..|...
T Consensus 230 g~Mi~CDn~~C~~eWFH~~CVGL~~~Pkg-kWyC~~C~~~ 268 (274)
T KOG1973|consen 230 GKMIGCDNPGCPIEWFHFTCVGLKTKPKG-KWYCPRCKAE 268 (274)
T ss_pred ccccccCCCCCCcceEEEeccccccCCCC-cccchhhhhh
Confidence 34999997 99 8899999999999988 6999999864
No 56
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=91.47 E-value=0.12 Score=40.76 Aligned_cols=33 Identities=24% Similarity=0.568 Sum_probs=26.6
Q ss_pred CCCccccccCCcccCCCeeEEccccCceeccccc
Q 002995 395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCY 428 (860)
Q Consensus 395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CY 428 (860)
...+|.+|+..-.. .+.+|+|..|+...|..||
T Consensus 4 ~~~~C~~Cg~~~~~-~dDiVvCp~CgapyHR~C~ 36 (54)
T PF14446_consen 4 EGCKCPVCGKKFKD-GDDIVVCPECGAPYHRDCW 36 (54)
T ss_pred cCccChhhCCcccC-CCCEEECCCCCCcccHHHH
Confidence 34689999986532 2349999999999999999
No 57
>COG5034 TNG2 Chromatin remodeling protein, contains PhD zinc finger [Chromatin structure and dynamics]
Probab=91.09 E-value=0.13 Score=53.11 Aligned_cols=54 Identities=26% Similarity=0.606 Sum_probs=40.0
Q ss_pred CccccccCCCccccccCCcccCCCeeEEcc--ccCc-eeccccccccccCCCCceeeecccC
Q 002995 388 PVRVDWKDLDKCSVCHMDEEYQNNLFLQCD--KCRM-MVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 388 ~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~--~C~~-~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
++....++...|= |...-. .+||-|| +|.. =||..|.|+..+|.| .|+|+.|+.
T Consensus 213 s~d~se~e~lYCf-CqqvSy---GqMVaCDn~nCkrEWFH~~CVGLk~pPKG-~WYC~eCk~ 269 (271)
T COG5034 213 SEDNSEGEELYCF-CQQVSY---GQMVACDNANCKREWFHLECVGLKEPPKG-KWYCPECKK 269 (271)
T ss_pred ccccccCceeEEE-eccccc---ccceecCCCCCchhheeccccccCCCCCC-cEeCHHhHh
Confidence 3343444555664 776532 3499999 6876 499999999999998 999999974
No 58
>PF11717 Tudor-knot: RNA binding activity-knot of a chromodomain ; PDB: 2EKO_A 2RO0_A 2RNZ_A 1WGS_A 3E9G_A 3E9F_A 2K3X_A 2K3Y_A 2EFI_A 2F5K_F ....
Probab=90.85 E-value=0.28 Score=39.25 Aligned_cols=52 Identities=15% Similarity=0.209 Sum_probs=40.7
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCE--EEEeCCCcc
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDF--ARINVKQVI 142 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~--awv~~~~l~ 142 (860)
|++|+.|+++.+.--|-+|.|++... ..+...|.|.|.|-+.+ -||+.++|.
T Consensus 1 ~~vG~~v~~~~~~~~~y~A~I~~~r~---------~~~~~~YyVHY~g~nkR~DeWV~~~~i~ 54 (55)
T PF11717_consen 1 FEVGEKVLCKYKDGQWYEAKILDIRE---------KNGEPEYYVHYQGWNKRLDEWVPESRIR 54 (55)
T ss_dssp --TTEEEEEEETTTEEEEEEEEEEEE---------CTTCEEEEEEETTSTGCC-EEEETTTEE
T ss_pred CCcCCEEEEEECCCcEEEEEEEEEEe---------cCCCEEEEEEcCCCCCCceeeecHHHcc
Confidence 67999999999777788999987432 23446799999999885 899999875
No 59
>smart00743 Agenet Tudor-like domain present in plant sequences. Domain in plant sequences with possible chromatin-associated functions.
Probab=90.48 E-value=0.41 Score=39.04 Aligned_cols=53 Identities=13% Similarity=0.206 Sum_probs=38.5
Q ss_pred CCCCCCEEEEEe-cccCCCceEeeCCCCcCccccccccCCCCcEEEEEeC--CCCEEEEeCCCccccc
Q 002995 81 ELEPGDIIWAKL-TGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFG--THDFARINVKQVISFL 145 (860)
Q Consensus 81 ~f~~GdlVwaK~-~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg--~~~~awv~~~~l~~f~ 145 (860)
.|++||+|-|+. ..-.||+|+|.... +.+.|.|.|.+ ....-=++.++|.|-.
T Consensus 2 ~~~~G~~Ve~~~~~~~~W~~a~V~~~~------------~~~~~~V~~~~~~~~~~e~v~~~~LRp~~ 57 (61)
T smart00743 2 DFKKGDRVEVFSKEEDSWWEAVVTKVL------------GDGKYLVRYLTESEPLKETVDWSDLRPHP 57 (61)
T ss_pred CcCCCCEEEEEECCCCEEEEEEEEEEC------------CCCEEEEEECCCCcccEEEEeHHHcccCC
Confidence 589999999998 45679999997631 13458999988 5555556666666643
No 60
>smart00249 PHD PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the smart00333 TUDOR Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
Probab=89.34 E-value=0.29 Score=39.13 Aligned_cols=33 Identities=30% Similarity=0.582 Sum_probs=29.1
Q ss_pred CCceeeeeeeecCCCCceEEEecC-CCcccccccc
Q 002995 2 QDWYSGFVVGYDSESNRHHVKYVD-GDEEDLILSN 35 (860)
Q Consensus 2 ~~~~~g~v~~~~~~~~~h~~~y~d-~d~e~~~l~~ 35 (860)
..||.|.|+++++. +..+|.|.| |..|++...+
T Consensus 16 ~~wyra~I~~~~~~-~~~~V~f~D~G~~~~v~~~~ 49 (57)
T smart00333 16 GEWYRARIIKVDGE-QLYEVFFIDYGNEEVVPPSD 49 (57)
T ss_pred CCEEEEEEEEECCC-CEEEEEEECCCccEEEeHHH
Confidence 36999999999998 889999999 9999887643
No 62
>smart00333 TUDOR Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
Probab=88.40 E-value=0.69 Score=36.92 Aligned_cols=54 Identities=11% Similarity=0.110 Sum_probs=44.3
Q ss_pred CCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccc
Q 002995 81 ELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLK 146 (860)
Q Consensus 81 ~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~ 146 (860)
.|++|++|-|+...=-|.+|+|..... .+.+.|.|.+.....||+..+|.++..
T Consensus 2 ~~~~G~~~~a~~~d~~wyra~I~~~~~------------~~~~~V~f~D~G~~~~v~~~~l~~l~~ 55 (57)
T smart00333 2 TFKVGDKVAARWEDGEWYRARIIKVDG------------EQLYEVFFIDYGNEEVVPPSDLRPLPE 55 (57)
T ss_pred CCCCCCEEEEEeCCCCEEEEEEEEECC------------CCEEEEEEECCCccEEEeHHHeecCCC
Confidence 478999999999666699999987421 146899999989999999999988754
No 63
>KOG0383 consensus Predicted helicase [General function prediction only]
Probab=87.68 E-value=0.22 Score=59.78 Aligned_cols=50 Identities=32% Similarity=0.684 Sum_probs=39.6
Q ss_pred ccccCCCccccccCCcccCCCeeEEccccCceecccccccc--ccCCCCceeeecccC
Q 002995 391 VDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL--EPVNGVLWLCNLCRP 446 (860)
Q Consensus 391 ~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~--~~~~~~~W~C~~C~~ 446 (860)
..|.+...|.||...+. +|-|+.|-..+|..|.+.. ..+.++ |+|.+|..
T Consensus 42 ~~~~~~e~c~ic~~~g~-----~l~c~tC~~s~h~~cl~~pl~~~p~~~-~~c~Rc~~ 93 (696)
T KOG0383|consen 42 WDDAEQEACRICADGGE-----LLWCDTCPASFHASCLGPPLTPQPNGE-FICPRCFC 93 (696)
T ss_pred cchhhhhhhhhhcCCCc-----EEEeccccHHHHHHccCCCCCcCCccc-eeeeeecc
Confidence 34566789999987643 7889999999999999854 345565 99999954
No 64
>KOG0957 consensus PHD finger protein [General function prediction only]
Probab=85.02 E-value=0.69 Score=52.00 Aligned_cols=80 Identities=23% Similarity=0.351 Sum_probs=49.9
Q ss_pred HHhhcCCCCCCCchhh-hhhcccccCCCCCCCCccccccCCCccccccCCcccCCCeeEEccccCceecccccccc--cc
Q 002995 357 LILGLTKSRPTSKSSL-CKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL--EP 433 (860)
Q Consensus 357 li~~Lp~a~~c~k~~~-~k~~s~~~~~p~~~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~--~~ 433 (860)
|++.-|+...++++.+ +|..++ ..|... ....-...|.||....+ ..++++||.|.+.+|..|+.-+ ..
T Consensus 510 l~e~~pks~~p~~~~~~rk~g~~--st~~~s----~~~a~~ysCgiCkks~d--QHll~~CDtC~lhYHlGCL~PPLTR~ 581 (707)
T KOG0957|consen 510 LTELGPKSGLPLSDFNNRKSGSR--STPAIS----APKAMNYSCGICKKSTD--QHLLTQCDTCHLHYHLGCLSPPLTRL 581 (707)
T ss_pred hhhcCCccCCcCccccccccccc--CCcccc----cccccceeeeeeccchh--hHHHhhcchhhceeeccccCCccccC
Confidence 5666677777766433 222211 112111 11233458999987644 4569999999999999999732 33
Q ss_pred CC---CCceeeecc
Q 002995 434 VN---GVLWLCNLC 444 (860)
Q Consensus 434 ~~---~~~W~C~~C 444 (860)
|. ...|.|..|
T Consensus 582 Pkk~kn~gWqCsEC 595 (707)
T KOG0957|consen 582 PKKNKNFGWQCSEC 595 (707)
T ss_pred cccccCcceeeccc
Confidence 32 347999999
No 65
>PF00628 PHD: PHD-finger; InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=84.63 E-value=0.51 Score=36.88 Aligned_cols=29 Identities=38% Similarity=0.954 Sum_probs=24.2
Q ss_pred ceecCC--cceeeEEcCCCCcCccccchhhhhc
Q 002995 511 CSICGV--SYGACIQCSNTTCRVAYHPLCARAA 541 (860)
Q Consensus 511 C~~C~~--~~Ga~iqC~~~~C~~~fH~~CA~~~ 541 (860)
|.+|++ ..+.+|+|. .|..+||..|....
T Consensus 2 C~vC~~~~~~~~~i~C~--~C~~~~H~~C~~~~ 32 (51)
T PF00628_consen 2 CPVCGQSDDDGDMIQCD--SCNRWYHQECVGPP 32 (51)
T ss_dssp BTTTTSSCTTSSEEEBS--TTSCEEETTTSTSS
T ss_pred CcCCCCcCCCCCeEEcC--CCChhhCcccCCCC
Confidence 677876 478999998 89999999998544
No 66
>KOG1245 consensus Chromatin remodeling complex WSTF-ISWI, large subunit (contains heterochromatin localization, PHD and BROMO domains) [Chromatin structure and dynamics]
Probab=84.07 E-value=0.25 Score=64.19 Aligned_cols=49 Identities=29% Similarity=0.636 Sum_probs=40.4
Q ss_pred CCccccccCCcccCCCeeEEccccCceeccccccc--cccCCCCceeeecccCC
Q 002995 396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGE--LEPVNGVLWLCNLCRPG 447 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~--~~~~~~~~W~C~~C~~~ 447 (860)
...|.||...... +.++.|+.|..++|..|... ..++.+ .|+|..|...
T Consensus 1108 ~~~c~~cr~k~~~--~~m~lc~~c~~~~h~~C~rp~~~~~~~~-dW~C~~c~~e 1158 (1404)
T KOG1245|consen 1108 NALCKVCRRKKQD--EKMLLCDECLSGFHLFCLRPALSSVPPG-DWMCPSCRKE 1158 (1404)
T ss_pred hhhhhhhhhcccc--hhhhhhHhhhhhHHHHhhhhhhccCCcC-CccCCccchh
Confidence 3589999987654 44999999999999999984 456767 8999999864
No 67
>smart00743 Agenet Tudor-like domain present in plant sequences. Domain in plant sequences with possible chromatin-associated functions.
Probab=84.06 E-value=0.78 Score=37.34 Aligned_cols=23 Identities=17% Similarity=0.484 Sum_probs=21.0
Q ss_pred CCceeeeeeeecCCCCceEEEecC
Q 002995 2 QDWYSGFVVGYDSESNRHHVKYVD 25 (860)
Q Consensus 2 ~~~~~g~v~~~~~~~~~h~~~y~d 25 (860)
..||.|.|++++. .+++.|.|.+
T Consensus 17 ~~W~~a~V~~~~~-~~~~~V~~~~ 39 (61)
T smart00743 17 DSWWEAVVTKVLG-DGKYLVRYLT 39 (61)
T ss_pred CEEEEEEEEEECC-CCEEEEEECC
Confidence 4799999999998 7789999998
No 68
>KOG1244 consensus Predicted transcription factor Requiem/NEURO-D4 [Transcription]
Probab=83.76 E-value=0.49 Score=49.32 Aligned_cols=75 Identities=21% Similarity=0.477 Sum_probs=51.9
Q ss_pred CccccccCCccc-----CCCeeEEccccCceecccccccc----ccCCCCceeeecccCCCCCCCCCccccCCCCCC--c
Q 002995 397 DKCSVCHMDEEY-----QNNLFLQCDKCRMMVHARCYGEL----EPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGA--M 465 (860)
Q Consensus 397 ~~C~vC~~~~~~-----~~n~ll~C~~C~~~vH~~CYg~~----~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGa--l 465 (860)
..|+-|+++... -...||.|..|+..-|.+|+-.. ..+....|-|..|+. |.+|+..... |
T Consensus 225 ~YCDFclgdsr~nkkt~~peelvscsdcgrsghpsclqft~nm~~avk~yrwqcieck~--------csicgtsenddql 296 (336)
T KOG1244|consen 225 PYCDFCLGDSRENKKTGMPEELVSCSDCGRSGHPSCLQFTANMIAAVKTYRWQCIECKY--------CSICGTSENDDQL 296 (336)
T ss_pred cccceeccccccccccCCchhhcchhhcCCCCCcchhhhhHHHHHHHHhheeeeeecce--------eccccCcCCCcee
Confidence 489999876522 12459999999999999999642 234456899999984 8888765432 2
Q ss_pred eec-CCCceeccccc
Q 002995 466 KPT-TDGRWAHLACA 479 (860)
Q Consensus 466 k~t-~~g~wvH~~Ca 479 (860)
.-. ...+=.|+-|.
T Consensus 297 lfcddcdrgyhmycl 311 (336)
T KOG1244|consen 297 LFCDDCDRGYHMYCL 311 (336)
T ss_pred EeecccCCceeeEec
Confidence 222 23456787775
No 69
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=82.97 E-value=0.54 Score=32.54 Aligned_cols=28 Identities=25% Similarity=0.829 Sum_probs=12.5
Q ss_pred ccccccCCcccCCCeeEEccccCceecccc
Q 002995 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARC 427 (860)
Q Consensus 398 ~C~vC~~~~~~~~n~ll~C~~C~~~vH~~C 427 (860)
.|++|...... .....|..|+..+|..|
T Consensus 2 ~C~~C~~~~~~--~~~Y~C~~Cdf~lH~~C 29 (30)
T PF07649_consen 2 RCDACGKPIDG--GWFYRCSECDFDLHEEC 29 (30)
T ss_dssp --TTTS----S----EEE-TTT-----HHH
T ss_pred cCCcCCCcCCC--CceEECccCCCccChhc
Confidence 59999876543 46899999999999988
No 70
>cd04508 TUDOR Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Probab=81.43 E-value=1.3 Score=34.03 Aligned_cols=32 Identities=31% Similarity=0.517 Sum_probs=28.0
Q ss_pred CCceeeeeeeecCCCCceEEEecC-CCccccccc
Q 002995 2 QDWYSGFVVGYDSESNRHHVKYVD-GDEEDLILS 34 (860)
Q Consensus 2 ~~~~~g~v~~~~~~~~~h~~~y~d-~d~e~~~l~ 34 (860)
..||.|.|.+.++ .+.-.|.|.| |..|.+.+.
T Consensus 12 ~~wyra~V~~~~~-~~~~~V~f~DyG~~~~v~~~ 44 (48)
T cd04508 12 GKWYRAKITSILS-DGKVEVFFVDYGNTEVVPLS 44 (48)
T ss_pred CeEEEEEEEEECC-CCcEEEEEEcCCCcEEEeHH
Confidence 4799999999998 7789999999 999887653
No 71
>KOG1512 consensus PHD Zn-finger protein [General function prediction only]
Probab=81.16 E-value=0.55 Score=49.18 Aligned_cols=49 Identities=16% Similarity=0.455 Sum_probs=37.1
Q ss_pred CccccccCCccc----CCCeeEEccccCceecccccccc----ccCCCCceeeeccc
Q 002995 397 DKCSVCHMDEEY----QNNLFLQCDKCRMMVHARCYGEL----EPVNGVLWLCNLCR 445 (860)
Q Consensus 397 ~~C~vC~~~~~~----~~n~ll~C~~C~~~vH~~CYg~~----~~~~~~~W~C~~C~ 445 (860)
..|.+|...... .-|-+|.|..|....|..|.... .......|-|-.|+
T Consensus 259 ~~~~~~~~~~~~~~~~r~~S~I~C~~C~~~~HP~Ci~M~~elv~~~KTY~W~C~~C~ 315 (381)
T KOG1512|consen 259 NERKHFWDIQTNIIQSRRNSWIVCKPCATRPHPYCVAMIPELVGQYKTYFWKCSSCE 315 (381)
T ss_pred hhhhhhhcchhhhhhhhhccceeecccccCCCCcchhcCHHHHhHHhhcchhhcccH
Confidence 579999875432 34669999999999999998643 22344589999997
No 72
>PF00130 C1_1: Phorbol esters/diacylglycerol binding domain (C1 domain); InterPro: IPR002219 Diacylglycerol (DAG) is an important second messenger. Phorbol esters (PE) are analogues of DAG and potent tumour promoters that cause a variety of physiological changes when administered to both cells and tissues. DAG activates a family of serine/threonine protein kinases, collectively known as protein kinase C (PKC) []. Phorbol esters can directly stimulate PKC. The N-terminal region of PKC, known as C1, has been shown [] to bind PE and DAG in a phospholipid and zinc-dependent fashion. The C1 region contains one or two copies (depending on the isozyme of PKC) of a cysteine-rich domain, which is about 50 amino-acid residues long, and which is essential for DAG/PE-binding. The DAG/PE-binding domain binds two zinc ions; the ligands of these metal ions are probably the six cysteines and two histidines that are conserved in this domain.; GO: 0035556 intracellular signal transduction; PDB: 1RFH_A 2FNF_X 3PFQ_A 1PTQ_A 1PTR_A 2VRW_B 1XA6_A 2ENN_A 1TBN_A 1TBO_A ....
Probab=80.17 E-value=1.9 Score=33.90 Aligned_cols=35 Identities=20% Similarity=0.414 Sum_probs=25.6
Q ss_pred CCccccccCCcccCCCeeEEccccCceeccccccc
Q 002995 396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGE 430 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~ 430 (860)
...|++|...--.....-+.|..|++.+|..|...
T Consensus 11 ~~~C~~C~~~i~g~~~~g~~C~~C~~~~H~~C~~~ 45 (53)
T PF00130_consen 11 PTYCDVCGKFIWGLGKQGYRCSWCGLVCHKKCLSK 45 (53)
T ss_dssp TEB-TTSSSBECSSSSCEEEETTTT-EEETTGGCT
T ss_pred CCCCcccCcccCCCCCCeEEECCCCChHhhhhhhh
Confidence 36899998754224455789999999999999864
No 73
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=80.01 E-value=1 Score=52.89 Aligned_cols=110 Identities=23% Similarity=0.412 Sum_probs=63.4
Q ss_pred CccccCCCcHHHHH--HHhhcCCCCCCCchhhhhhcccccCCCCCCCCccccccCCCccccccCCcccCCCeeEEccccC
Q 002995 343 GSDMFGFSNPEVMK--LILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCR 420 (860)
Q Consensus 343 G~~~FGls~p~v~~--li~~Lp~a~~c~k~~~~k~~s~~~~~p~~~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~ 420 (860)
...-||.-...+++ ..+.||.|.+ +.+-...-++-+.+.|+..+. ...-|.+|+..-+.- -++|..|+
T Consensus 1069 a~raFGtCSKAfmkLe~~e~l~~a~k------q~ye~La~~iFsk~~p~d~~~-~~vdc~~cg~~i~~~---~~~c~ec~ 1138 (1189)
T KOG2041|consen 1069 AVRAFGTCSKAFMKLEAFEELDDAEK------QEYENLAFRIFSKNPPVDPNS-AKVDCSVCGAKIDPY---DLQCSECQ 1138 (1189)
T ss_pred hhhhhhhhHHHHHHHHhhhhCCHHHH------HHHHHHHHHHhccCCCCCCCc-cceeeeecCCcCCcc---CCCChhhc
Confidence 34467777777765 3556665553 111112223344455555443 346899998753321 36899998
Q ss_pred ceecccccccccc-CCCCceeeecccCCCCC----CCCCccccCCCCC
Q 002995 421 MMVHARCYGELEP-VNGVLWLCNLCRPGAPE----PPPPCCLCPVVGG 463 (860)
Q Consensus 421 ~~vH~~CYg~~~~-~~~~~W~C~~C~~~~~~----~~~~C~lC~~~gG 463 (860)
-.+- -|-....+ .+..-|+|.+|+..... .-..|.||...-+
T Consensus 1139 ~kfP-~CiasG~pIt~~~fWlC~~CkH~a~~~EIs~y~~CPLCHs~~~ 1185 (1189)
T KOG2041|consen 1139 TKFP-VCIASGRPITDNIFWLCPRCKHRAHQHEISKYNCCPLCHSMES 1185 (1189)
T ss_pred CcCc-eeeccCCccccceEEEccccccccccccccccccCccccChhh
Confidence 7764 46543222 23347999999986543 2245888875433
No 74
>PF03107 C1_2: C1 domain; InterPro: IPR004146 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in DAG_PE-bind (IPR002219 from INTERPRO), therefore we have termed this domain DC1 for divergent C1 domain. This domain probably also binds to two zinc ions. The function of proteins with this domain is uncertain, however this domain may bind to molecules such as diacylglycerol. This family are found in plant proteins.
Probab=79.71 E-value=1.5 Score=30.33 Aligned_cols=28 Identities=32% Similarity=0.840 Sum_probs=22.4
Q ss_pred ccccccCCcccCCCeeEEccccCceecccc
Q 002995 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARC 427 (860)
Q Consensus 398 ~C~vC~~~~~~~~n~ll~C~~C~~~vH~~C 427 (860)
.|+||+..-+... ...|+.|+..+|..|
T Consensus 2 ~C~~C~~~~~~~~--~Y~C~~c~f~lh~~C 29 (30)
T PF03107_consen 2 WCDVCRRKIDGFY--FYHCSECCFTLHVRC 29 (30)
T ss_pred CCCCCCCCcCCCE--eEEeCCCCCeEcCcc
Confidence 5999976544322 899999999999988
No 75
>KOG2084 consensus Predicted histone tail methylase containing SET domain [Chromatin structure and dynamics]
Probab=79.21 E-value=2.8 Score=48.81 Aligned_cols=40 Identities=33% Similarity=0.422 Sum_probs=28.8
Q ss_pred cCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCC-eEEEecCCC
Q 002995 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWE-ELTYDYRFF 816 (860)
Q Consensus 773 fiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GE-ELt~dY~~~ 816 (860)
++||||.||+.. ..+ .....+++...+.+++ ||++.|-..
T Consensus 208 ~~~hsC~pn~~~---~~~-~~~~~~~~~~~~~~~~~~l~~~y~~~ 248 (482)
T KOG2084|consen 208 LFNHSCFPNISV---IFD-GRGLALLVPAGIDAGEEELTISYTDP 248 (482)
T ss_pred hcccCCCCCeEE---EEC-CceeEEEeecccCCCCCEEEEeeccc
Confidence 679999999872 233 3445566777777776 999999543
No 76
>KOG1973 consensus Chromatin remodeling protein, contains PHD Zn-finger [Chromatin structure and dynamics]
Probab=78.51 E-value=0.93 Score=49.02 Aligned_cols=29 Identities=38% Similarity=0.787 Sum_probs=24.7
Q ss_pred cceecCC-cceeeEEcCCCCcC-ccccchhh
Q 002995 510 LCSICGV-SYGACIQCSNTTCR-VAYHPLCA 538 (860)
Q Consensus 510 ~C~~C~~-~~Ga~iqC~~~~C~-~~fH~~CA 538 (860)
.-++|.+ .+|-||.|...+|. .|||..|-
T Consensus 220 ~yC~Cnqvsyg~Mi~CDn~~C~~eWFH~~CV 250 (274)
T KOG1973|consen 220 TYCICNQVSYGKMIGCDNPGCPIEWFHFTCV 250 (274)
T ss_pred EEEEecccccccccccCCCCCCcceEEEecc
Confidence 3456665 69999999999999 99999996
No 77
>cd04508 TUDOR Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Probab=77.91 E-value=3 Score=31.92 Aligned_cols=47 Identities=15% Similarity=0.205 Sum_probs=37.3
Q ss_pred CCEEEEEecc-cCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccc
Q 002995 85 GDIIWAKLTG-HAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVIS 143 (860)
Q Consensus 85 GdlVwaK~~g-~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~ 143 (860)
|++|.|+... --|.+|+|.... ..+.+.|.|.+-.....|+.++|.+
T Consensus 1 G~~c~a~~~~d~~wyra~V~~~~------------~~~~~~V~f~DyG~~~~v~~~~l~~ 48 (48)
T cd04508 1 GDLCLAKYSDDGKWYRAKITSIL------------SDGKVEVFFVDYGNTEVVPLSDLRP 48 (48)
T ss_pred CCEEEEEECCCCeEEEEEEEEEC------------CCCcEEEEEEcCCCcEEEeHHHcCC
Confidence 7899999986 779999997632 1455899999888888888877754
No 78
>PF11793 FANCL_C: FANCL C-terminal domain; PDB: 3K1L_A.
Probab=77.71 E-value=1.6 Score=36.90 Aligned_cols=50 Identities=18% Similarity=0.365 Sum_probs=19.0
Q ss_pred CccccccCCcc-cCCCeeEEcc--ccCceeccccccc-----ccc---CCCCceeeecccC
Q 002995 397 DKCSVCHMDEE-YQNNLFLQCD--KCRMMVHARCYGE-----LEP---VNGVLWLCNLCRP 446 (860)
Q Consensus 397 ~~C~vC~~~~~-~~~n~ll~C~--~C~~~vH~~CYg~-----~~~---~~~~~W~C~~C~~ 446 (860)
..|.||..... .+.-+.+.|+ .|+..+|..|.-. ... ..-..+.|+.|..
T Consensus 3 ~~C~IC~~~~~~~~~~p~~~C~n~~C~~~fH~~CL~~wf~~~~~~~~~~~~~~G~CP~C~~ 63 (70)
T PF11793_consen 3 LECGICYSYRLDDGEIPDVVCPNPSCGKKFHLLCLSEWFLSLEKSRQSFIPIFGECPYCSS 63 (70)
T ss_dssp -S-SSS--SS-TT-----B--S-TT----B-SGGGHHHHHHHHSSS-TTT--EEE-TTT-S
T ss_pred CCCCcCCcEecCCCCcCceEcCCcccCCHHHHHHHHHHHHHcccCCeeecccccCCcCCCC
Confidence 57999998654 3344578998 9999999999852 111 1112467888874
No 79
>cd00029 C1 Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
Probab=75.21 E-value=1.4 Score=33.96 Aligned_cols=34 Identities=24% Similarity=0.480 Sum_probs=25.7
Q ss_pred CccccccCCcccCCCeeEEccccCceeccccccc
Q 002995 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGE 430 (860)
Q Consensus 397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~ 430 (860)
..|++|...--.-...-+.|+.|++.||..|...
T Consensus 12 ~~C~~C~~~i~~~~~~~~~C~~C~~~~H~~C~~~ 45 (50)
T cd00029 12 TFCDVCRKSIWGLFKQGLRCSWCKVKCHKKCADK 45 (50)
T ss_pred CChhhcchhhhccccceeEcCCCCCchhhhhhcc
Confidence 5899998643222245678999999999999863
No 80
>KOG1337 consensus N-methyltransferase [General function prediction only]
Probab=73.55 E-value=2.3 Score=49.86 Aligned_cols=40 Identities=33% Similarity=0.359 Sum_probs=30.5
Q ss_pred cCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCC
Q 002995 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF 815 (860)
Q Consensus 773 fiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~ 815 (860)
+.||++.+. ...+......+.+++.++|.+|||++++||.
T Consensus 239 ~~NH~~~~~---~~~~~~~d~~~~l~~~~~v~~geevfi~YG~ 278 (472)
T KOG1337|consen 239 LLNHSPEVI---KAGYNQEDEAVELVAERDVSAGEEVFINYGP 278 (472)
T ss_pred hhccCchhc---cccccCCCCcEEEEEeeeecCCCeEEEecCC
Confidence 569999992 1122222348999999999999999999976
No 81
>PF07039 DUF1325: SGF29 tudor-like domain; InterPro: IPR010750 SAGA-associated factor 29 is involved in transcriptional regulation, probably through association with histone acetyltransferase (HAT) complexes like the TFTC-HAT or STAGA complexes. It also may be involved in MYC-mediated oncogenic transformation. It is a component of the ATAC complex, which is a complex with histone acetyltransferase activity on histones H3 and H4 []. This entry represents a domain found in yeast and human SAGA-associated factor 29 proteins that is related to the tudor domain. ; PDB: 3MP6_A 3MP1_A 3MP8_A 3MET_B 3ME9_A 3MEU_B 3MEA_A 3MEV_B 3LX7_A 3MEW_A.
Probab=72.77 E-value=16 Score=34.79 Aligned_cols=107 Identities=22% Similarity=0.351 Sum_probs=59.6
Q ss_pred CCceeeeeeeecCCCCceEEEecCCCccccccccceEEEEEccchhhhhhcccCCCCCCCCCCChhhHHHhhcCccCCCC
Q 002995 2 QDWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDGYDYDEMVVLAASLDDCQE 81 (860)
Q Consensus 2 ~~~~~g~v~~~~~~~~~h~~~y~d~d~e~~~l~~e~i~~~~s~~e~~~l~~~~~~~~~~~~~~~~de~~~~~~~~~~~~~ 81 (860)
+.|.-+.|++|+.++++..|.=.|-++ +.-++.++....--|-.+.. ........
T Consensus 17 ~~WIla~Vv~~~~~~~rYeV~D~d~~~-------~~~~~~~~~~~iIPLP~~~~------------------~~~~~~~~ 71 (130)
T PF07039_consen 17 EEWILAEVVKYNSDGNRYEVEDPDPEE-------EKKRYKLSRKQIIPLPKKAP------------------PDTDPLAE 71 (130)
T ss_dssp CEEEEEEEEEEETTTTEEEEEETTTCT-------TTEEEEEEGGGEEEE-SB--------------------TTT-GGGS
T ss_pred CCEEEEEEEEEeCCCCEEEEecCCCCC-------CCceEEeCHHHEEECCCccC------------------CCCCchhh
Confidence 469999999999999877776544443 11144444433333322111 01122457
Q ss_pred CCCCCEEEEEe-cccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCE---EEEeCCCccc
Q 002995 82 LEPGDIIWAKL-TGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDF---ARINVKQVIS 143 (860)
Q Consensus 82 f~~GdlVwaK~-~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~---awv~~~~l~~ 143 (860)
|..|..|+|.. ..--.-||.|...+ ....+.|.|.|=|+.+- --|+..-+.+
T Consensus 72 f~~g~~VLAlYP~TT~FY~A~V~~~p----------~~~~~~y~l~Fedd~~~~~~~~V~~r~Vv~ 127 (130)
T PF07039_consen 72 FPKGTKVLALYPDTTCFYPATVVSPP----------KKKSGEYKLKFEDDEDADGYREVPQRYVVA 127 (130)
T ss_dssp --TT-EEEEE-TTSSEEEEEEEEEE-----------SSTTS-EEEEECTTTSTTSBEEE-GGGEEE
T ss_pred CCCCCEEEEECCCCceEEEEEEEeCC----------CCCCCcEEEEEeCCCCcCCcEEEccceEEc
Confidence 99999999995 33346789997742 13456799999988662 3444444443
No 82
>PF02318 FYVE_2: FYVE-type zinc finger; InterPro: IPR003315 This entry represents the zinc-binding domain found in rabphilin Rab3A. The small G protein Rab3A plays an important role in the regulation of neurotransmitter release. The crystal structure of the small G protein Rab3A complexed with the effector domain of rabphilin-3A shows that the effector domain of rabphilin-3A contacts Rab3A in two distinct areas. The first interface involves the Rab3A switch I and switch II regions, which are sensitive to the nucleotide-binding state of Rab3A. The second interface consists of a deep pocket in Rab3A that interacts with a SGAWFF structural element of rabphilin-3A. Sequence and structure analysis, and biochemical data suggest that this pocket, or Rab complementarity-determining region (RabCDR), establishes a specific interaction between each Rab protein and its effectors. It has been suggested that RabCDRs could be major determinants of effector specificity during vesicle trafficking and fusion [].; GO: 0008270 zinc ion binding, 0017137 Rab GTPase binding, 0006886 intracellular protein transport; PDB: 2CSZ_A 2ZET_C 1ZBD_B 3BC1_B 2CJS_C 2A20_A.
Probab=70.86 E-value=3.5 Score=38.57 Aligned_cols=49 Identities=22% Similarity=0.510 Sum_probs=35.1
Q ss_pred CCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995 396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
...|..|......-.|.-..|..|...|=..|-.. . .....|+|..|..
T Consensus 54 ~~~C~~C~~~fg~l~~~~~~C~~C~~~VC~~C~~~-~-~~~~~WlC~vC~k 102 (118)
T PF02318_consen 54 ERHCARCGKPFGFLFNRGRVCVDCKHRVCKKCGVY-S-KKEPIWLCKVCQK 102 (118)
T ss_dssp CSB-TTTS-BCSCTSTTCEEETTTTEEEETTSEEE-T-SSSCCEEEHHHHH
T ss_pred CcchhhhCCcccccCCCCCcCCcCCccccCccCCc-C-CCCCCEEChhhHH
Confidence 35899998765444555689999999999999543 2 2334899999975
No 83
>KOG4443 consensus Putative transcription factor HALR/MLL3, involved in embryonic development [General function prediction only]
Probab=69.82 E-value=2.1 Score=50.31 Aligned_cols=81 Identities=21% Similarity=0.338 Sum_probs=51.8
Q ss_pred ccccCccccCCCcHHHHHH-HhhcCCCCCCCchhhhhhcccccCCCCCCCCccccccCCCccccccCCcccCCCeeEEcc
Q 002995 339 ISESGSDMFGFSNPEVMKL-ILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCD 417 (860)
Q Consensus 339 ~~~sG~~~FGls~p~v~~l-i~~Lp~a~~c~k~~~~k~~s~~~~~p~~~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~ 417 (860)
....+.+.|++.||....+ ++.+. -..+... ..-.+|..|+..++ ...+++|+
T Consensus 34 ~m~ac~~c~~~yH~~cvt~~~~~~~-------------l~~gWrC-----------~~crvCe~c~~~gD--~~kf~~Ck 87 (694)
T KOG4443|consen 34 RLLACSDCGQKYHPYCVTSWAQHAV-------------LSGGWRC-----------PSCRVCEACGTTGD--PKKFLLCK 87 (694)
T ss_pred cchhhhhhcccCCcchhhHHHhHHH-------------hcCCccc-----------CCceeeeeccccCC--cccccccc
Confidence 3468889999999986653 22110 0011111 23357888874443 45699999
Q ss_pred ccCceeccccccc--cccCCCCceeeecccC
Q 002995 418 KCRMMVHARCYGE--LEPVNGVLWLCNLCRP 446 (860)
Q Consensus 418 ~C~~~vH~~CYg~--~~~~~~~~W~C~~C~~ 446 (860)
+|.+.+|-+|--- ..++.+ +|+|..|..
T Consensus 88 ~cDvsyh~yc~~P~~~~v~sg-~~~ckk~~~ 117 (694)
T KOG4443|consen 88 RCDVSYHCYCQKPPNDKVPSG-PWLCKKCTR 117 (694)
T ss_pred cccccccccccCCccccccCc-ccccHHHHh
Confidence 9999999999642 234555 899998853
No 84
>KOG1473 consensus Nucleosome remodeling factor, subunit NURF301/BPTF [Chromatin structure and dynamics; Transcription]
Probab=68.54 E-value=5.4 Score=49.59 Aligned_cols=110 Identities=25% Similarity=0.423 Sum_probs=70.0
Q ss_pred CCccccccCCcccCCCeeEEccccCceecccccccc--ccCCCCceeeecccCCCCCCCCCccccCCCCCCceecC-CCc
Q 002995 396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL--EPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTT-DGR 472 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~--~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~-~g~ 472 (860)
++.|-+|....+ ++.|..|...||..|.--+ .++. ..|-|..|..-+.+....|+|=+.+.+...+.+ .|.
T Consensus 344 ddhcrf~~d~~~-----~lc~Et~prvvhlEcv~hP~~~~~s-~~~e~evc~~hkvngvvd~vl~~~K~~~~iR~~~iG~ 417 (1414)
T KOG1473|consen 344 DDHCRFCHDLGD-----LLCCETCPRVVHLECVFHPRFAVPS-AFWECEVCNIHKVNGVVDCVLPPSKNVDSIRHTPIGR 417 (1414)
T ss_pred cccccccCcccc-----eeecccCCceEEeeecCCccccCCC-ccchhhhhhhhccCcccccccChhhcccceeccCCCc
Confidence 468999986544 8899999999999996422 2333 389999998766555566887777766663332 111
Q ss_pred eeccccccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcCccccc-hhh
Q 002995 473 WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHP-LCA 538 (860)
Q Consensus 473 wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~fH~-~CA 538 (860)
==|-. -+| .+ .-.|.||.. .+..+ |+++.|...||. .|-
T Consensus 418 dr~gr-~yw-------------------fi-----~rrl~Ie~~-det~l-~yysT~pqly~ll~cL 457 (1414)
T KOG1473|consen 418 DRYGR-KYW-------------------FI-----SRRLRIEGM-DETLL-WYYSTCPQLYHLLRCL 457 (1414)
T ss_pred Ccccc-chh-------------------ce-----eeeeEEecC-CCcEE-EEecCcHHHHHHHHHh
Confidence 00000 000 00 126888886 44444 556789999999 664
No 85
>cd04718 BAH_plant_2 BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=68.30 E-value=3.2 Score=40.14 Aligned_cols=26 Identities=27% Similarity=0.600 Sum_probs=20.6
Q ss_pred eecccccc--ccccCCCCceeeecccCCC
Q 002995 422 MVHARCYG--ELEPVNGVLWLCNLCRPGA 448 (860)
Q Consensus 422 ~vH~~CYg--~~~~~~~~~W~C~~C~~~~ 448 (860)
++|..|.. +..+|+| .|+|+.|....
T Consensus 1 g~H~~CL~Ppl~~~P~g-~W~Cp~C~~~~ 28 (148)
T cd04718 1 GFHLCCLRPPLKEVPEG-DWICPFCEVEK 28 (148)
T ss_pred CcccccCCCCCCCCCCC-CcCCCCCcCCC
Confidence 47999997 4457777 89999998653
No 86
>KOG1044 consensus Actin-binding LIM Zn-finger protein Limatin involved in axon guidance [Signal transduction mechanisms; Cytoskeleton]
Probab=68.01 E-value=2.9 Score=48.35 Aligned_cols=102 Identities=23% Similarity=0.472 Sum_probs=51.0
Q ss_pred CceeeecccCCCCC------CCCCccccC---CCCCCceecCCCceeccccccccCceEEccCCCcc--ccccchhhhcc
Q 002995 437 VLWLCNLCRPGAPE------PPPPCCLCP---VVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRME--PIDGLNRVSKD 505 (860)
Q Consensus 437 ~~W~C~~C~~~~~~------~~~~C~lC~---~~gGalk~t~~g~wvH~~Ca~~~p~~~~~~~~~~~--~v~~i~~i~~~ 505 (860)
..-+|+.|....+. ....|+-|. ..|++|.-. +.+| |+.|..---...+.+.+.|. -+---..-...
T Consensus 112 k~~~c~~c~~~~~~~p~~~~~ps~cagc~~~lk~gq~llal-d~qw-hv~cfkc~~c~~vL~gey~skdg~pyce~dy~~ 189 (670)
T KOG1044|consen 112 KECLCQTCSQPMPVSPAESYGPSTCAGCGEELKNGQALLAL-DKQW-HVSCFKCKSCSAVLNGEYMSKDGVPYCEKDYQA 189 (670)
T ss_pred hhhhhhhhcCcccCCcccccCCccccchhhhhhccceeeee-ccce-eeeeeehhhhcccccceeeccCCCcchhhhhhh
Confidence 34667777654332 234677775 344554433 3456 44443221111111111111 01112233445
Q ss_pred ccCccceecCC-cceeeEEcCCCCcCccccchhhhhcCee
Q 002995 506 RWKLLCSICGV-SYGACIQCSNTTCRVAYHPLCARAAGLC 544 (860)
Q Consensus 506 ~~~~~C~~C~~-~~Ga~iqC~~~~C~~~fH~~CA~~~g~~ 544 (860)
.+..+|..|.+ ..|..+|=. . ..|||+||+=.-+.
T Consensus 190 ~fgvkc~~c~~fisgkvLqag--~--kh~HPtCARCsRCg 225 (670)
T KOG1044|consen 190 KFGVKCEECEKFISGKVLQAG--D--KHFHPTCARCSRCG 225 (670)
T ss_pred hcCeehHHhhhhhhhhhhhcc--C--cccCcchhhhhhhc
Confidence 56678888875 456666653 3 78999999776554
No 87
>smart00109 C1 Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
Probab=65.67 E-value=2.4 Score=32.34 Aligned_cols=33 Identities=27% Similarity=0.522 Sum_probs=24.9
Q ss_pred CCccccccCCcccCCCeeEEccccCceecccccc
Q 002995 396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYG 429 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg 429 (860)
...|++|...-.... .-+.|..|++.+|..|..
T Consensus 11 ~~~C~~C~~~i~~~~-~~~~C~~C~~~~H~~C~~ 43 (49)
T smart00109 11 PTKCCVCRKSIWGSF-QGLRCSWCKVKCHKKCAE 43 (49)
T ss_pred CCCccccccccCcCC-CCcCCCCCCchHHHHHHh
Confidence 358999987532211 367899999999999975
No 88
>PF13901 DUF4206: Domain of unknown function (DUF4206)
Probab=64.39 E-value=4.5 Score=41.67 Aligned_cols=42 Identities=26% Similarity=0.789 Sum_probs=32.4
Q ss_pred CccccccCCc---ccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995 397 DKCSVCHMDE---EYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 397 ~~C~vC~~~~---~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
..|.+|..++ ..+.+..+.|.+|+..+|..|+.- . -|++|..
T Consensus 153 fiCe~C~~~~~IfPF~~~~~~~C~~C~~v~H~~C~~~-----~---~CpkC~R 197 (202)
T PF13901_consen 153 FICEICNSDDIIFPFQIDTTVRCPKCKSVFHKSCFRK-----K---SCPKCAR 197 (202)
T ss_pred CCCccCCCCCCCCCCCCCCeeeCCcCccccchhhcCC-----C---CCCCcHh
Confidence 5899998764 234456889999999999999972 1 2888864
No 89
>PF14569 zf-UDP: Zinc-binding RING-finger; PDB: 1WEO_A.
Probab=60.67 E-value=3.9 Score=34.73 Aligned_cols=50 Identities=24% Similarity=0.616 Sum_probs=23.2
Q ss_pred CCCccccccCC--cccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995 395 DLDKCSVCHMD--EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 395 ~~~~C~vC~~~--~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
....|.||+.. -..+.+.++.|..|+..|-..||--.. .+| .-.|..|..
T Consensus 8 ~~qiCqiCGD~VGl~~~Ge~FVAC~eC~fPvCr~CyEYEr-keg-~q~CpqCkt 59 (80)
T PF14569_consen 8 NGQICQICGDDVGLTENGEVFVACHECAFPVCRPCYEYER-KEG-NQVCPQCKT 59 (80)
T ss_dssp SS-B-SSS--B--B-SSSSB--S-SSS-----HHHHHHHH-HTS--SB-TTT--
T ss_pred CCcccccccCccccCCCCCEEEEEcccCCccchhHHHHHh-hcC-cccccccCC
Confidence 34699999874 245678999999999999999996322 223 567888864
No 90
>PF11764 N-SET: COMPASS (Complex proteins associated with Set1p) component N; InterPro: IPR024657 The COMPASS complex (complex proteins associated with Set1p) is conserved in yeasts and in other eukaryotes up to humans. The COMPASS complex functions to methylate the fourth lysine of Histone 3 and for the silencing of genes close to the telomeres of chromosomes []. Five domains are conserved in Saccharomyces cerevisiae Set1 and other eukaryotic Set1-related proteins: an amino-terminal RNA-recognition motif (RRM), a semi-conserved central domain, an N-SET domain, the catalytic SET domain, and the C-terminal post-SET domain. This entry represents the N-SET domain, which promotes trimethylation in conjunction with the RRM domain [] and is necessary for binding of the Spp1 component of COMPASS into the complex [].; PDB: 3UVO_B 4ES0_C 3UVN_D 4EWR_C.
Probab=57.65 E-value=2.5 Score=42.12 Aligned_cols=16 Identities=31% Similarity=0.507 Sum_probs=6.0
Q ss_pred CCCCCCcccCCccccc
Q 002995 603 PSNPSGCARSEPYNYF 618 (860)
Q Consensus 603 ~~npsGcaRsE~~~~~ 618 (860)
..|++||||+|||+++
T Consensus 78 ~~h~TGsARtEGy~KI 93 (167)
T PF11764_consen 78 VPHSTGSARTEGYYKI 93 (167)
T ss_dssp -----SSHHHH-----
T ss_pred ccCCCcCcccccceeC
Confidence 3578999999999754
No 91
>KOG4443 consensus Putative transcription factor HALR/MLL3, involved in embryonic development [General function prediction only]
Probab=56.27 E-value=4.1 Score=47.98 Aligned_cols=48 Identities=25% Similarity=0.637 Sum_probs=38.0
Q ss_pred CccccccCCcccCCCeeEEccccCceecccccccc---ccCCCCceeeeccc
Q 002995 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL---EPVNGVLWLCNLCR 445 (860)
Q Consensus 397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~---~~~~~~~W~C~~C~ 445 (860)
..|-+|...+....+.|+.|..|+...|..|.... .+..+ .|.|..|.
T Consensus 19 ~mc~l~~s~G~~~ag~m~ac~~c~~~yH~~cvt~~~~~~~l~~-gWrC~~cr 69 (694)
T KOG4443|consen 19 LMCPLCGSSGKGRAGRLLACSDCGQKYHPYCVTSWAQHAVLSG-GWRCPSCR 69 (694)
T ss_pred hhhhhhccccccccCcchhhhhhcccCCcchhhHHHhHHHhcC-CcccCCce
Confidence 47888888777778889999999999999998731 22333 59999997
No 92
>PF05502 Dynactin_p62: Dynactin p62 family; InterPro: IPR008603 Dynactin is a multi-subunit complex and a required cofactor for most, or all, o f the cellular processes powered by the microtubule-based motor cytoplasmic dyn ein. p62 binds directly to the Arp1 subunit of dynactin [, ].
Probab=55.82 E-value=8.5 Score=45.10 Aligned_cols=53 Identities=21% Similarity=0.478 Sum_probs=37.4
Q ss_pred eeEEccccCceeccccccccccCCCCceeeecccCCCCC--------CC-CCccccCCCCCCceec
Q 002995 412 LFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPE--------PP-PPCCLCPVVGGAMKPT 468 (860)
Q Consensus 412 ~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~--------~~-~~C~lC~~~gGalk~t 468 (860)
.|++|..|.......|... +-..|||..|....+. .- ..|+-||.=.+.|.-+
T Consensus 4 ~L~fC~~C~~irc~~c~~~----Ei~~~yCp~CL~~~p~~e~~~~~nrC~r~Cf~CP~C~~~L~~~ 65 (483)
T PF05502_consen 4 ELYFCEHCHKIRCPRCVSE----EIDSYYCPNCLFEVPSSEARSEKNRCSRNCFDCPICFSPLSVR 65 (483)
T ss_pred cceecccccccCChhhccc----ccceeECccccccCChhhheeccceeccccccCCCCCCcceeE
Confidence 4889999999888888542 2237999999875442 12 3788998766666543
No 93
>KOG3799 consensus Rab3 effector RIM1 and related proteins, contain Rab3a binding domain [Intracellular trafficking, secretion, and vesicular transport]
Probab=54.98 E-value=5.5 Score=37.29 Aligned_cols=52 Identities=35% Similarity=0.749 Sum_probs=35.9
Q ss_pred cCCCccccccCCcccCCCeeEEccccCceecccccccc-ccCCCCceeeecccC
Q 002995 394 KDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL-EPVNGVLWLCNLCRP 446 (860)
Q Consensus 394 ~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~-~~~~~~~W~C~~C~~ 446 (860)
.++..|.||....-.+. .=-.|.-|++.+-+.|-|-. .-....-|+|..|..
T Consensus 63 ~ddatC~IC~KTKFADG-~GH~C~YCq~r~CARCGGrv~lrsNKv~wvcnlc~k 115 (169)
T KOG3799|consen 63 GDDATCGICHKTKFADG-CGHNCSYCQTRFCARCGGRVSLRSNKVMWVCNLCRK 115 (169)
T ss_pred CcCcchhhhhhcccccc-cCcccchhhhhHHHhcCCeeeeccCceEEeccCCcH
Confidence 35679999987432221 12358999999999998743 333455899999975
No 94
>PF11781 RRN7: RNA polymerase I-specific transcription initiation factor Rrn7; InterPro: IPR021752 Rrn7 is a transcription binding factor that associates strongly with both Rrn6 and Rrn11 to form a complex which itself binds the TATA-binding protein and is required for transcription by the core domain of the RNA PolI promoter [],[].
Probab=51.92 E-value=11 Score=27.45 Aligned_cols=26 Identities=31% Similarity=0.744 Sum_probs=20.0
Q ss_pred CccccccCCcccCCCeeEEccccCce
Q 002995 397 DKCSVCHMDEEYQNNLFLQCDKCRMM 422 (860)
Q Consensus 397 ~~C~vC~~~~~~~~n~ll~C~~C~~~ 422 (860)
..|.+|....-..++-..+|+.|+-.
T Consensus 9 ~~C~~C~~~~~~~~dG~~yC~~cG~~ 34 (36)
T PF11781_consen 9 EPCPVCGSRWFYSDDGFYYCDRCGHQ 34 (36)
T ss_pred CcCCCCCCeEeEccCCEEEhhhCceE
Confidence 45999998765566668899999854
No 95
>COG5475 Uncharacterized small protein [Function unknown]
Probab=51.13 E-value=35 Score=27.14 Aligned_cols=52 Identities=15% Similarity=0.232 Sum_probs=40.6
Q ss_pred CCCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCC--CCEEEEeCCCcccccc
Q 002995 80 QELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGT--HDFARINVKQVISFLK 146 (860)
Q Consensus 80 ~~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~--~~~awv~~~~l~~f~~ 146 (860)
..|..||+|-.|-+| |++++..- ..++.|..+||.- ..++=.+++.|+|+..
T Consensus 3 ~~FstgdvV~lKsGG----P~Mtvs~~-----------ss~Gmy~C~Wf~g~g~~~~~F~ed~Lvp~~a 56 (60)
T COG5475 3 MSFSTGDVVTLKSGG----PRMTVSGY-----------SSDGMYECRWFDGYGVKREAFHEDELVPGEA 56 (60)
T ss_pred ceeecCcEEEeecCC----ceEEEecc-----------ccCCeEEEEEecCCCcccccccccceecccc
Confidence 469999999999988 88998731 2347899999964 3477788889988764
No 96
>PF08746 zf-RING-like: RING-like domain; InterPro: IPR014857 This is a zinc finger domain that is related to the C3HC4 RING finger domain (IPR001841 from INTERPRO). ; PDB: 3NW0_A 2CT0_A.
Probab=50.70 E-value=7.4 Score=29.46 Aligned_cols=30 Identities=30% Similarity=0.639 Sum_probs=17.0
Q ss_pred ceecCCcceeeEEcCCCCcCccccchhhhh
Q 002995 511 CSICGVSYGACIQCSNTTCRVAYHPLCARA 540 (860)
Q Consensus 511 C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~~ 540 (860)
|.+|+...-.-+.|....|...+|..|+..
T Consensus 1 C~~C~~iv~~G~~C~~~~C~~r~H~~C~~~ 30 (43)
T PF08746_consen 1 CEACKEIVTQGQRCSNRDCNVRLHDDCFKK 30 (43)
T ss_dssp -TTT-SB-SSSEE-SS--S--EE-HHHHHH
T ss_pred CcccchhHeeeccCCCCccCchHHHHHHHH
Confidence 667877666668999999999999999854
No 97
>COG5034 TNG2 Chromatin remodeling protein, contains PhD zinc finger [Chromatin structure and dynamics]
Probab=49.28 E-value=12 Score=39.27 Aligned_cols=40 Identities=25% Similarity=0.603 Sum_probs=29.9
Q ss_pred chhhhccccCccceecCC-cceeeEEcCCCCcCc-cccchhh
Q 002995 499 LNRVSKDRWKLLCSICGV-SYGACIQCSNTTCRV-AYHPLCA 538 (860)
Q Consensus 499 i~~i~~~~~~~~C~~C~~-~~Ga~iqC~~~~C~~-~fH~~CA 538 (860)
++....+-.....++|++ ++|-||.|...+|.. |||..|-
T Consensus 211 vss~d~se~e~lYCfCqqvSyGqMVaCDn~nCkrEWFH~~CV 252 (271)
T COG5034 211 VSSEDNSEGEELYCFCQQVSYGQMVACDNANCKREWFHLECV 252 (271)
T ss_pred cCccccccCceeEEEecccccccceecCCCCCchhheecccc
Confidence 333333344455678987 699999999999975 6999996
No 98
>PF07227 DUF1423: Protein of unknown function (DUF1423); InterPro: IPR004082 A total of 715 potential protein-coding genes have been identified in the nucleotide sequence of Arabidopsis thaliana chromosome 5, with an average gene density of 1 gene per 4001 bp []. Amongst the gene products is a well-conserved family of 130.7kDa proteins that share no sequence similarity with any other known proteins, other than in plants. The sequences are characterised by an N-terminal domain of variable length, a central cysteine-rich region and a relatively acidic C-terminal domain. The sequences may possess a PHD finger.
Probab=46.75 E-value=9.8 Score=43.24 Aligned_cols=14 Identities=43% Similarity=0.971 Sum_probs=11.8
Q ss_pred CCceeccccccccC
Q 002995 470 DGRWAHLACAIWIP 483 (860)
Q Consensus 470 ~g~wvH~~Ca~~~p 483 (860)
.|.|+|+-||+-..
T Consensus 152 CgH~cH~dCALr~~ 165 (446)
T PF07227_consen 152 CGHWCHLDCALRHE 165 (446)
T ss_pred CCceehhhhhcccc
Confidence 67999999998743
No 99
>PHA02862 5L protein; Provisional
Probab=43.98 E-value=7.7 Score=37.10 Aligned_cols=50 Identities=28% Similarity=0.606 Sum_probs=34.2
Q ss_pred CccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccCCCC
Q 002995 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVG 462 (860)
Q Consensus 397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~g 462 (860)
+.|=||+..++++. .--.|.+-..-||+.|.. .|+ .......|.||....
T Consensus 3 diCWIC~~~~~e~~-~PC~C~GS~K~VHq~CL~--------~WI-------n~S~k~~CeLCkteY 52 (156)
T PHA02862 3 DICWICNDVCDERN-NFCGCNEEYKVVHIKCMQ--------LWI-------NYSKKKECNLCKTKY 52 (156)
T ss_pred CEEEEecCcCCCCc-ccccccCcchhHHHHHHH--------HHH-------hcCCCcCccCCCCeE
Confidence 58999998765443 345688889999999986 455 122344677776543
No 100
>PF09926 DUF2158: Uncharacterized small protein (DUF2158); InterPro: IPR019226 This entry represents a family of predominantly prokaryotic proteins with no known function.
Probab=43.53 E-value=45 Score=26.56 Aligned_cols=41 Identities=32% Similarity=0.570 Sum_probs=27.7
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCC
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHD 132 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~ 132 (860)
|++||+|--|-+| |.|++.. +.. ..-...+.+..+||..+.
T Consensus 1 f~~GDvV~LKSGG----p~MTV~~--v~~----~~~~~~~~v~C~WFd~~~ 41 (53)
T PF09926_consen 1 FKIGDVVQLKSGG----PRMTVTE--VGP----NAGASGGWVECQWFDGHG 41 (53)
T ss_pred CCCCCEEEEccCC----CCeEEEE--ccc----cccCCCCeEEEEeCCCCC
Confidence 7899999999998 6677762 111 111234568899997654
No 101
>PF13639 zf-RING_2: Ring finger domain; PDB: 2KIZ_A 4EPO_C 1IYM_A 2EP4_A 2ECT_A 2JRJ_A 2ECN_A 2ECM_A 3NG2_A 2EA6_A ....
Probab=42.90 E-value=13 Score=27.97 Aligned_cols=31 Identities=23% Similarity=0.584 Sum_probs=22.6
Q ss_pred CccccccCCcccCCCeeEEccccCceecccccc
Q 002995 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYG 429 (860)
Q Consensus 397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg 429 (860)
+.|.||...-.. .+.++... |+-.+|..|..
T Consensus 1 d~C~IC~~~~~~-~~~~~~l~-C~H~fh~~Ci~ 31 (44)
T PF13639_consen 1 DECPICLEEFED-GEKVVKLP-CGHVFHRSCIK 31 (44)
T ss_dssp -CETTTTCBHHT-TSCEEEET-TSEEEEHHHHH
T ss_pred CCCcCCChhhcC-CCeEEEcc-CCCeeCHHHHH
Confidence 379999987644 33455555 99999999975
No 102
>PF10080 DUF2318: Predicted membrane protein (DUF2318); InterPro: IPR018758 This domain of unknown function is found in hypothetical bacterial membrane proteins with no known function.
Probab=42.49 E-value=13 Score=33.86 Aligned_cols=34 Identities=26% Similarity=0.601 Sum_probs=26.3
Q ss_pred CCccccccCCcccCCCeeEEccccCceecccccc
Q 002995 396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYG 429 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg 429 (860)
.+.|+||....-.....-|.|..|++.++..=.|
T Consensus 35 ~daCeiC~~~GY~q~g~~lvC~~C~~~~~~~~ig 68 (102)
T PF10080_consen 35 FDACEICGPKGYYQEGDQLVCKNCGVRFNLPTIG 68 (102)
T ss_pred EEeccccCCCceEEECCEEEEecCCCEEehhhcc
Confidence 3689999776655555577899999999987554
No 103
>PF05641 Agenet: Agenet domain; InterPro: IPR008395 This domain is related to the TUDOR domain IPR008191 from INTERPRO []. The function of the agenet domain is unknown. This signature matches one of the two Agenet domains in the FMR proteins [].; GO: 0003723 RNA binding; PDB: 2BKD_N 3O8V_A 3KUF_A 3H8Z_A.
Probab=41.92 E-value=22 Score=29.65 Aligned_cols=26 Identities=23% Similarity=0.500 Sum_probs=20.5
Q ss_pred CceeeeeeeecCCCCceEEEecCCCcc
Q 002995 3 DWYSGFVVGYDSESNRHHVKYVDGDEE 29 (860)
Q Consensus 3 ~~~~g~v~~~~~~~~~h~~~y~d~d~e 29 (860)
+||.|.|.+.+... +..|+|+|-..+
T Consensus 19 aWf~a~V~~~~~~~-~~~V~Y~~~~~~ 44 (68)
T PF05641_consen 19 AWFPATVLKENGDD-KYLVEYDDLPDE 44 (68)
T ss_dssp EEEEEEEEEEETT--EEEEEETT-SS-
T ss_pred EEEEEEEEEeCCCc-EEEEEECCcccc
Confidence 69999999999888 999999765444
No 104
>cd04712 BAH_DCM_I BAH, or Bromo Adjacent Homology domain, as present in DNA (Cytosine-5)-methyltransferases (DCM) 1. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the genome. These effects include transcriptional repression via inhibition of transcription factor binding, the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting, and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=41.32 E-value=24 Score=33.64 Aligned_cols=47 Identities=11% Similarity=0.191 Sum_probs=30.7
Q ss_pred CCCCCCCCEEEEEecccCC---------CceEeeCCCCcCccccccccCCCCcEEEEEeCC
Q 002995 79 CQELEPGDIIWAKLTGHAM---------WPAIVVDESLIGDYKGLNKISGGRSIPVQFFGT 130 (860)
Q Consensus 79 ~~~f~~GdlVwaK~~g~Pw---------WPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~ 130 (860)
+..+++||.|..+-.+-+- ||+.|.....+.. ...+...++|+||--
T Consensus 3 ~~~i~vGD~V~v~~d~~~~~~~~~~~~~~~~~i~~V~~~~e-----~~~g~~~~h~~W~yr 58 (130)
T cd04712 3 GLTIRVGDVVSVERDDADSTTKWNDDHRWLPLVQFVEYMKK-----GSDGSKMFHGRWLYR 58 (130)
T ss_pred CCEEeCCCEEEEcCCCCCccccccccccccceEEEEEEeee-----cCCCceEEEEEEEEc
Confidence 4568999999999876553 5777765433222 223445788998843
No 105
>KOG0955 consensus PHD finger protein BR140/LIN-49 [General function prediction only]
Probab=41.18 E-value=19 Score=45.62 Aligned_cols=64 Identities=22% Similarity=0.365 Sum_probs=46.3
Q ss_pred CCCCCEEEEEecccCCCceEeeCCCCcCcccccc------------------ccCCCCcEEEEEeCCCC-EEEEeCCCcc
Q 002995 82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLN------------------KISGGRSIPVQFFGTHD-FARINVKQVI 142 (860)
Q Consensus 82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~------------------~~~~~~~~~V~FFg~~~-~awv~~~~l~ 142 (860)
+.+=++||||..||||-|+.++++.........+ .+.....+.|-||.+.- .-|++...+.
T Consensus 943 l~~~~~~~akc~g~~s~~~l~l~p~~~~~~~~~~g~~~~p~~dv~~l~eq~~~~~~~~l~~~L~~~n~~~~~~~~~s~~~ 1022 (1051)
T KOG0955|consen 943 LEELKLVWAKCRGYPSYPALILDPKMPREGNFHNGPDPAPPTDVLALPEQRTNKAPETLFLVLFFDNKRCWQWLPRSKVL 1022 (1051)
T ss_pred eeehhceeehhcCCccchhhhcccccccccCccCCCCCCCCcccccchHHHhcccChhheEEEeecccccccccCCCCcc
Confidence 7777899999999999999999986655433211 11122345689997755 5799999887
Q ss_pred ccc
Q 002995 143 SFL 145 (860)
Q Consensus 143 ~f~ 145 (860)
+.-
T Consensus 1023 ~l~ 1025 (1051)
T KOG0955|consen 1023 ELG 1025 (1051)
T ss_pred ccc
Confidence 764
No 106
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=41.12 E-value=14 Score=25.46 Aligned_cols=27 Identities=33% Similarity=0.775 Sum_probs=12.2
Q ss_pred cceecCCcce--eeEEcCCCCcCccccchhh
Q 002995 510 LCSICGVSYG--ACIQCSNTTCRVAYHPLCA 538 (860)
Q Consensus 510 ~C~~C~~~~G--a~iqC~~~~C~~~fH~~CA 538 (860)
.|..|+.+.. ....|. .|.-.+|..||
T Consensus 2 ~C~~C~~~~~~~~~Y~C~--~Cdf~lH~~Ca 30 (30)
T PF07649_consen 2 RCDACGKPIDGGWFYRCS--ECDFDLHEECA 30 (30)
T ss_dssp --TTTS----S--EEE-T--TT-----HHHH
T ss_pred cCCcCCCcCCCCceEECc--cCCCccChhcC
Confidence 5888988644 577897 89999999997
No 107
>PF10447 EXOSC1: Exosome component EXOSC1/CSL4; InterPro: IPR019495 The exosome mediates degradation of unstable mRNAs that contain AU-rich elements (AREs) within their 3' untranslated regions []. The proteins in this entry are components of the exosome 3'->5' exoribonuclease complex. They do not have exonuclease activity, but are required for the 3'-processing of the 7S pre-RNA to the mature 5.8S rRNA and for mRNA decay [, ].; PDB: 2NN6_I.
Probab=40.97 E-value=17 Score=31.72 Aligned_cols=14 Identities=36% Similarity=0.741 Sum_probs=10.4
Q ss_pred CCCCCCCCEEEEEe
Q 002995 79 CQELEPGDIIWAKL 92 (860)
Q Consensus 79 ~~~f~~GdlVwaK~ 92 (860)
.+-|+|||+|-|||
T Consensus 66 ~~~FrpGDIVrA~V 79 (82)
T PF10447_consen 66 YDCFRPGDIVRARV 79 (82)
T ss_dssp GGT--SSSEEEEEE
T ss_pred HhccCCCCEEEEEE
Confidence 45699999999997
No 108
>COG5194 APC11 Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning]
Probab=39.05 E-value=9.6 Score=32.58 Aligned_cols=31 Identities=32% Similarity=0.866 Sum_probs=25.5
Q ss_pred cceecCCc-ceeeEEcCC------------CCcCccccchhhhh
Q 002995 510 LCSICGVS-YGACIQCSN------------TTCRVAYHPLCARA 540 (860)
Q Consensus 510 ~C~~C~~~-~Ga~iqC~~------------~~C~~~fH~~CA~~ 540 (860)
.|.||+.. .|.|++|.. +.|..+||..|-.+
T Consensus 22 ~CaICRnhim~~C~eCq~~~~~~~eC~v~wG~CnHaFH~HCI~r 65 (88)
T COG5194 22 VCAICRNHIMGTCPECQFGMTPGDECPVVWGVCNHAFHDHCIYR 65 (88)
T ss_pred hhhhhhccccCcCcccccCCCCCCcceEEEEecchHHHHHHHHH
Confidence 69999863 789999987 35999999999854
No 109
>PF08666 SAF: SAF domain; InterPro: IPR013974 This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=38.98 E-value=18 Score=29.24 Aligned_cols=16 Identities=31% Similarity=0.376 Sum_probs=11.9
Q ss_pred EEEEccCCCCCCeEEE
Q 002995 796 IIFAKRDIKQWEELTY 811 (860)
Q Consensus 796 ~i~A~RdI~~GEELt~ 811 (860)
+++|.|||++|+.|+-
T Consensus 3 vvVA~~di~~G~~i~~ 18 (63)
T PF08666_consen 3 VVVAARDIPAGTVITA 18 (63)
T ss_dssp EEEESSTB-TT-BECT
T ss_pred EEEEeCccCCCCEEcc
Confidence 4799999999999963
No 110
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=38.31 E-value=45 Score=38.41 Aligned_cols=61 Identities=13% Similarity=0.104 Sum_probs=45.7
Q ss_pred CCCCCCCCCEEEEEecccCCC-ceEeeCCCCcCccccccccCCCCcEEEEEeCCCCE--EEEeCCCccc
Q 002995 78 DCQELEPGDIIWAKLTGHAMW-PAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDF--ARINVKQVIS 143 (860)
Q Consensus 78 ~~~~f~~GdlVwaK~~g~PwW-Pa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~--awv~~~~l~~ 143 (860)
....|.+|+.|.|+..+-+-| +|.|+.-.... ....+...|.|.|-|.+.+ -||..++|..
T Consensus 50 ~~~~~~VGekVla~~~~Dg~~~~A~VI~~R~~~-----~~~~~~~~YYVHY~g~nrRlDEWV~~~rLdl 113 (450)
T PLN00104 50 VMLPLEVGTRVMCRWRFDGKYHPVKVIERRRGG-----SGGPNDYEYYVHYTEFNRRLDEWVKLEQLDL 113 (450)
T ss_pred ccceeccCCEEEEEECCCCCEEEEEEEEEeccC-----CCCCCCceEEEEEecCCccHhhccCHhhccc
Confidence 344699999999999977877 99998854211 0012334699999999988 8999999853
No 111
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=37.76 E-value=30 Score=27.60 Aligned_cols=37 Identities=30% Similarity=0.745 Sum_probs=31.0
Q ss_pred CccceecCCc---ceeeEEcCCCCcCccccchhhhhcCeeEE
Q 002995 508 KLLCSICGVS---YGACIQCSNTTCRVAYHPLCARAAGLCVE 546 (860)
Q Consensus 508 ~~~C~~C~~~---~Ga~iqC~~~~C~~~fH~~CA~~~g~~~~ 546 (860)
..+|.+|+.. .+-.++|- .|.+.||-.|...+|-...
T Consensus 5 ~~~C~~Cg~~~~~~dDiVvCp--~CgapyHR~C~~~~g~C~~ 44 (54)
T PF14446_consen 5 GCKCPVCGKKFKDGDDIVVCP--ECGAPYHRDCWEKAGGCIN 44 (54)
T ss_pred CccChhhCCcccCCCCEEECC--CCCCcccHHHHhhCCceEe
Confidence 3479999974 57889997 9999999999999887653
No 112
>PLN02436 cellulose synthase A
Probab=36.78 E-value=21 Score=44.99 Aligned_cols=49 Identities=24% Similarity=0.643 Sum_probs=38.0
Q ss_pred CCccccccCCc--ccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995 396 LDKCSVCHMDE--EYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 396 ~~~C~vC~~~~--~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
..+|.||+++- ..++++.|-|.-|+..|-..||- ..-.+| .=.|+.|+.
T Consensus 36 ~~iCqICGD~Vg~t~dGe~FVACn~C~fpvCr~Cye-yer~eg-~~~Cpqckt 86 (1094)
T PLN02436 36 GQTCQICGDEIELTVDGEPFVACNECAFPVCRPCYE-YERREG-NQACPQCKT 86 (1094)
T ss_pred CccccccccccCcCCCCCEEEeeccCCCccccchhh-hhhhcC-CccCcccCC
Confidence 35999998752 55678899999999999999994 222333 678999975
No 113
>PF13771 zf-HC5HC2H: PHD-like zinc-binding domain
Probab=35.21 E-value=34 Score=29.95 Aligned_cols=34 Identities=26% Similarity=0.346 Sum_probs=27.3
Q ss_pred CCCccccCCCCCCceecC---CCceeccccccccCce
Q 002995 452 PPPCCLCPVVGGAMKPTT---DGRWAHLACAIWIPET 485 (860)
Q Consensus 452 ~~~C~lC~~~gGalk~t~---~g~wvH~~Ca~~~p~~ 485 (860)
...|.+|...+||..+.. .....|+.||+...-.
T Consensus 36 ~~~C~~C~~~~Ga~i~C~~~~C~~~fH~~CA~~~~~~ 72 (90)
T PF13771_consen 36 KLKCSICKKKGGACIGCSHPGCSRSFHVPCARKAGCF 72 (90)
T ss_pred CCCCcCCCCCCCeEEEEeCCCCCcEEChHHHccCCeE
Confidence 468999999988887765 4689999999886543
No 114
>PF12906 RINGv: RING-variant domain; PDB: 2D8S_A 1VYX_A.
Probab=34.92 E-value=13 Score=28.63 Aligned_cols=30 Identities=33% Similarity=0.824 Sum_probs=18.0
Q ss_pred cccccCCcccCCCeeEEccccCc---eecccccc
Q 002995 399 CSVCHMDEEYQNNLFLQCDKCRM---MVHARCYG 429 (860)
Q Consensus 399 C~vC~~~~~~~~n~ll~C~~C~~---~vH~~CYg 429 (860)
|-||+.+++.+...+.-| +|.- .||+.|+-
T Consensus 1 CrIC~~~~~~~~~li~pC-~C~Gs~~~vH~~CL~ 33 (47)
T PF12906_consen 1 CRICLEGEEEDEPLISPC-RCKGSMKYVHRSCLE 33 (47)
T ss_dssp ETTTTEE-SSSS-EE-SS-S-SSCCGSEECCHHH
T ss_pred CeEeCCcCCCCCceeccc-ccCCCcchhHHHHHH
Confidence 678988766544233344 3555 99999985
No 115
>PLN02400 cellulose synthase
Probab=34.69 E-value=27 Score=44.32 Aligned_cols=50 Identities=26% Similarity=0.688 Sum_probs=38.2
Q ss_pred CCCccccccCC--cccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995 395 DLDKCSVCHMD--EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 395 ~~~~C~vC~~~--~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
...+|.||+++ -..+.++.|-|..|+..|-.-||- -.-.+| .=.|+.|+.
T Consensus 35 ~gqiCqICGD~VG~t~dGe~FVAC~eCaFPVCRpCYE-YERkeG-nq~CPQCkT 86 (1085)
T PLN02400 35 NGQICQICGDDVGVTETGDVFVACNECAFPVCRPCYE-YERKDG-TQCCPQCKT 86 (1085)
T ss_pred CCceeeecccccCcCCCCCEEEEEccCCCccccchhh-eecccC-CccCcccCC
Confidence 34599999874 245677899999999999999994 222334 678999975
No 116
>PLN02638 cellulose synthase A (UDP-forming), catalytic subunit
Probab=34.55 E-value=23 Score=44.82 Aligned_cols=50 Identities=26% Similarity=0.644 Sum_probs=38.0
Q ss_pred CCCccccccCC--cccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995 395 DLDKCSVCHMD--EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 395 ~~~~C~vC~~~--~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
...+|.||+++ -..+.++.|-|..|+..|-..||- ..-.+| .=.|+.|+.
T Consensus 16 ~~qiCqICGD~vg~~~~Ge~FVAC~eC~FPVCrpCYE-YEr~eG-~q~CPqCkt 67 (1079)
T PLN02638 16 GGQVCQICGDNVGKTVDGEPFVACDVCAFPVCRPCYE-YERKDG-NQSCPQCKT 67 (1079)
T ss_pred CCceeeecccccCcCCCCCEEEEeccCCCccccchhh-hhhhcC-CccCCccCC
Confidence 34599999874 245677899999999999999994 222334 678999975
No 117
>KOG0954 consensus PHD finger protein [General function prediction only]
Probab=34.37 E-value=20 Score=43.00 Aligned_cols=30 Identities=23% Similarity=0.238 Sum_probs=23.0
Q ss_pred cceecCCcceeeEEcCCCCcCccccchhhhhc
Q 002995 510 LCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541 (860)
Q Consensus 510 ~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~~~ 541 (860)
.|.+|-..+|++-.=.. =..|-|+.||+.-
T Consensus 325 pCvLCPkkGGamK~~~s--gT~wAHvsCALwI 354 (893)
T KOG0954|consen 325 PCVLCPKKGGAMKPTKS--GTKWAHVSCALWI 354 (893)
T ss_pred CeeeccccCCcccccCC--CCeeeEeeeeecc
Confidence 69999998888876553 3488999999543
No 118
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=34.26 E-value=20 Score=24.96 Aligned_cols=25 Identities=28% Similarity=0.797 Sum_probs=13.4
Q ss_pred CccccccCCcccCCCeeEEccccCc
Q 002995 397 DKCSVCHMDEEYQNNLFLQCDKCRM 421 (860)
Q Consensus 397 ~~C~vC~~~~~~~~n~ll~C~~C~~ 421 (860)
..|-.|.+.....++.++.|..|+-
T Consensus 3 p~Cp~C~se~~y~D~~~~vCp~C~~ 27 (30)
T PF08274_consen 3 PKCPLCGSEYTYEDGELLVCPECGH 27 (30)
T ss_dssp ---TTT-----EE-SSSEEETTTTE
T ss_pred CCCCCCCCcceeccCCEEeCCcccc
Confidence 4788999888888888889998864
No 119
>PF12773 DZR: Double zinc ribbon
Probab=33.50 E-value=32 Score=26.59 Aligned_cols=38 Identities=21% Similarity=0.458 Sum_probs=23.0
Q ss_pred eEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCcccc
Q 002995 413 FLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458 (860)
Q Consensus 413 ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC 458 (860)
..+|..|+..+- ......++|..|..........|.-|
T Consensus 12 ~~fC~~CG~~l~--------~~~~~~~~C~~Cg~~~~~~~~fC~~C 49 (50)
T PF12773_consen 12 AKFCPHCGTPLP--------PPDQSKKICPNCGAENPPNAKFCPNC 49 (50)
T ss_pred ccCChhhcCChh--------hccCCCCCCcCCcCCCcCCcCccCcc
Confidence 456777776665 22334678888877655445556555
No 120
>PF13832 zf-HC5HC2H_2: PHD-zinc-finger like domain
Probab=33.15 E-value=24 Score=32.26 Aligned_cols=29 Identities=28% Similarity=0.711 Sum_probs=24.4
Q ss_pred CCccccccCCcccCCCeeEEccc--cCceeccccc
Q 002995 396 LDKCSVCHMDEEYQNNLFLQCDK--CRMMVHARCY 428 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C~~--C~~~vH~~CY 428 (860)
...|.+|..... ..+.|.. |...+|..|-
T Consensus 55 ~~~C~iC~~~~G----~~i~C~~~~C~~~fH~~CA 85 (110)
T PF13832_consen 55 KLKCSICGKSGG----ACIKCSHPGCSTAFHPTCA 85 (110)
T ss_pred CCcCcCCCCCCc----eeEEcCCCCCCcCCCHHHH
Confidence 469999998633 3899997 9999999996
No 121
>smart00547 ZnF_RBZ Zinc finger domain. Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. In RanBPs, this domain binds RanGDP.
Probab=32.64 E-value=25 Score=23.12 Aligned_cols=23 Identities=22% Similarity=0.502 Sum_probs=18.3
Q ss_pred ceeeecccCCCCCCCCCccccCC
Q 002995 438 LWLCNLCRPGAPEPPPPCCLCPV 460 (860)
Q Consensus 438 ~W~C~~C~~~~~~~~~~C~lC~~ 460 (860)
.|.|..|..........|..|..
T Consensus 2 ~W~C~~C~~~N~~~~~~C~~C~~ 24 (26)
T smart00547 2 DWECPACTFLNFASRSKCFACGA 24 (26)
T ss_pred cccCCCCCCcChhhhccccccCC
Confidence 79999998766666678888864
No 122
>cd04370 BAH BAH, or Bromo Adjacent Homology domain (also called ELM1 and BAM for Bromo Adjacent Motif). BAH domains have first been described as domains found in the polybromo protein and Yeast Rsc1/Rsc2 (Remodeling of the Structure of Chromatin). They also occur in mammalian DNA methyltransferases and the MTA1 subunits of histone deacetylase complexes. A BAH domain is also found in Yeast Sir3p and in the origin receptor complex protein 1 (Orc1p), where it was found to interact with the N-terminal lobe of the silence information regulator 1 protein (Sir1p), confirming the initial hypothesis that BAH plays a role in protein-protein interactions.
Probab=31.23 E-value=53 Score=30.07 Aligned_cols=41 Identities=20% Similarity=0.353 Sum_probs=29.4
Q ss_pred CCCCCCCEEEEEecc----cCCCceEeeCCCCcCccccccccCCCCcEEEEEe
Q 002995 80 QELEPGDIIWAKLTG----HAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFF 128 (860)
Q Consensus 80 ~~f~~GdlVwaK~~g----~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FF 128 (860)
..|++||.|..+-.. -|.|.|+|..--. . ..+...+.|+||
T Consensus 2 ~~y~vgd~V~v~~~~~~~~~~~~i~~I~~i~~--~------~~~~~~~~v~wf 46 (123)
T cd04370 2 ITYEVGDSVYVEPDDSIKSDPPYIARIEELWE--D------TNGSKQVKVRWF 46 (123)
T ss_pred CEEecCCEEEEecCCcCCCCCCEEEEEeeeeE--C------CCCCEEEEEEEE
Confidence 468899999999766 6899999965321 1 123346788888
No 123
>PLN02915 cellulose synthase A [UDP-forming], catalytic subunit
Probab=29.90 E-value=32 Score=43.48 Aligned_cols=49 Identities=27% Similarity=0.619 Sum_probs=37.5
Q ss_pred CCccccccCC--cccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995 396 LDKCSVCHMD--EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 396 ~~~C~vC~~~--~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
..+|.||+++ -..+.++.|-|.-|+..|-..||- -...+| .=.|+.|+.
T Consensus 15 ~~~c~iCGd~vg~~~~Ge~FVAC~eC~fpvCr~cye-ye~~~g-~~~cp~c~t 65 (1044)
T PLN02915 15 AKTCRVCGDEVGVKEDGQPFVACHVCGFPVCKPCYE-YERSEG-NQCCPQCNT 65 (1044)
T ss_pred cchhhccccccCcCCCCCEEEEeccCCCccccchhh-hhhhcC-CccCCccCC
Confidence 3699999874 245667899999999999999994 222334 678888874
No 124
>KOG0383 consensus Predicted helicase [General function prediction only]
Probab=29.06 E-value=29 Score=42.26 Aligned_cols=31 Identities=29% Similarity=0.674 Sum_probs=26.0
Q ss_pred ccceecCCcceeeEEcCCCCcCccccchhhhhcC
Q 002995 509 LLCSICGVSYGACIQCSNTTCRVAYHPLCARAAG 542 (860)
Q Consensus 509 ~~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~~~g 542 (860)
..|.+|.. +|-.|-|. .|..+||..|.-.-+
T Consensus 48 e~c~ic~~-~g~~l~c~--tC~~s~h~~cl~~pl 78 (696)
T KOG0383|consen 48 EACRICAD-GGELLWCD--TCPASFHASCLGPPL 78 (696)
T ss_pred hhhhhhcC-CCcEEEec--cccHHHHHHccCCCC
Confidence 47999998 78888898 999999999984443
No 125
>KOG3409 consensus Exosomal 3'-5' exoribonuclease complex, subunit ski4 (Csl4) [Translation, ribosomal structure and biogenesis]
Probab=28.78 E-value=74 Score=31.58 Aligned_cols=16 Identities=38% Similarity=0.684 Sum_probs=13.1
Q ss_pred CCCCCCCCCEEEEEec
Q 002995 78 DCQELEPGDIIWAKLT 93 (860)
Q Consensus 78 ~~~~f~~GdlVwaK~~ 93 (860)
-...|+|||+|.|||=
T Consensus 119 v~ksFrPgDiVlAkVi 134 (193)
T KOG3409|consen 119 VYKSFRPGDIVLAKVI 134 (193)
T ss_pred hhhccCCCcEEEEEEe
Confidence 3557999999999973
No 126
>PF07227 DUF1423: Protein of unknown function (DUF1423); InterPro: IPR004082 A total of 715 potential protein-coding genes have been identified in the nucleotide sequence of Arabidopsis thaliana chromosome 5, with an average gene density of 1 gene per 4001 bp []. Amongst the gene products is a well-conserved family of 130.7kDa proteins that share no sequence similarity with any other known proteins, other than in plants. The sequences are characterised by an N-terminal domain of variable length, a central cysteine-rich region and a relatively acidic C-terminal domain. The sequences may possess a PHD finger.
Probab=28.64 E-value=45 Score=38.14 Aligned_cols=31 Identities=23% Similarity=0.639 Sum_probs=23.0
Q ss_pred ccccccCCc-ccCCCeeEEccccCceeccccc
Q 002995 398 KCSVCHMDE-EYQNNLFLQCDKCRMMVHARCY 428 (860)
Q Consensus 398 ~C~vC~~~~-~~~~n~ll~C~~C~~~vH~~CY 428 (860)
-|.||...+ ..+.-.-|.||-|+...|..|-
T Consensus 130 ~C~iC~kfD~~~n~~~Wi~Cd~CgH~cH~dCA 161 (446)
T PF07227_consen 130 MCCICSKFDDNKNTCSWIGCDVCGHWCHLDCA 161 (446)
T ss_pred CccccCCcccCCCCeeEEeccCCCceehhhhh
Confidence 577887643 2334457899999999999994
No 127
>PF12861 zf-Apc11: Anaphase-promoting complex subunit 11 RING-H2 finger
Probab=27.80 E-value=25 Score=30.85 Aligned_cols=34 Identities=21% Similarity=0.447 Sum_probs=23.7
Q ss_pred CCCccccccCCcc---------cCCCeeEEccccCceecccccc
Q 002995 395 DLDKCSVCHMDEE---------YQNNLFLQCDKCRMMVHARCYG 429 (860)
Q Consensus 395 ~~~~C~vC~~~~~---------~~~n~ll~C~~C~~~vH~~CYg 429 (860)
+++.|.||+...+ .++.+|+.+. |+-.||..|.-
T Consensus 20 ~dd~CgICr~~fdg~Cp~Ck~Pgd~Cplv~g~-C~H~FH~hCI~ 62 (85)
T PF12861_consen 20 NDDVCGICRMPFDGCCPDCKFPGDDCPLVWGK-CSHNFHMHCIL 62 (85)
T ss_pred CCCceeeEecccccCCCCccCCCCCCceeecc-CccHHHHHHHH
Confidence 4678888876443 2234566555 99999999964
No 128
>PLN02195 cellulose synthase A
Probab=27.71 E-value=36 Score=42.69 Aligned_cols=48 Identities=21% Similarity=0.549 Sum_probs=36.9
Q ss_pred CccccccCC--cccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995 397 DKCSVCHMD--EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 397 ~~C~vC~~~--~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
..|.||++. -..+.++.|-|..|+..|-..||- ..-.+| .=-|+.|+.
T Consensus 7 ~~c~~cgd~~~~~~~g~~fvaC~eC~~pvCrpCye-yer~eg-~q~CpqCkt 56 (977)
T PLN02195 7 PICATCGEEVGVDSNGEAFVACHECSYPLCKACLE-YEIKEG-RKVCLRCGG 56 (977)
T ss_pred ccceecccccCcCCCCCeEEEeccCCCccccchhh-hhhhcC-CccCCccCC
Confidence 589999873 345667899999999999999994 222334 678888864
No 129
>KOG3038 consensus Histone acetyltransferase SAGA associated factor SGF29 [General function prediction only]
Probab=27.69 E-value=2.9e+02 Score=29.35 Aligned_cols=43 Identities=28% Similarity=0.461 Sum_probs=31.5
Q ss_pred CCCCCCCEEEEEecccC-CCceEeeCCCCcCccccccccCCCCcEEEEEeCCCC
Q 002995 80 QELEPGDIIWAKLTGHA-MWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHD 132 (860)
Q Consensus 80 ~~f~~GdlVwaK~~g~P-wWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~ 132 (860)
.-|.+|.+|+|...|-- ..+|+|..++. .+...|.|.||.+..
T Consensus 197 ~~fpp~~~VLA~YP~TTcFY~aiVh~tp~----------d~s~~y~vlffD~~e 240 (264)
T KOG3038|consen 197 ALFPPGTIVLAVYPGTTCFYKAIVHSTPR----------DGSCDYYVLFFDDEE 240 (264)
T ss_pred cCCCCCCEEEEEcCCcceeeeeEeecCCC----------CCCCcceeeeecCcc
Confidence 46999999999975543 77899987532 334568999996643
No 130
>PF04810 zf-Sec23_Sec24: Sec23/Sec24 zinc finger; InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=27.41 E-value=33 Score=25.48 Aligned_cols=31 Identities=29% Similarity=0.860 Sum_probs=17.2
Q ss_pred eEEccccCceeccccccccccCCCCceeeecccC
Q 002995 413 FLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 413 ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
++.|.+|+.-+-..|- ....+..|.|..|..
T Consensus 2 p~rC~~C~aylNp~~~---~~~~~~~w~C~~C~~ 32 (40)
T PF04810_consen 2 PVRCRRCRAYLNPFCQ---FDDGGKTWICNFCGT 32 (40)
T ss_dssp S-B-TTT--BS-TTSE---EETTTTEEEETTT--
T ss_pred ccccCCCCCEECCcce---EcCCCCEEECcCCCC
Confidence 5678888887777763 223456899999975
No 131
>smart00744 RINGv The RING-variant domain is a C4HC3 zinc-finger like motif found in a number of cellular and viral proteins. Some of these proteins have been shown both in vivo and in vitro to have ubiquitin E3 ligase activity. The RING-variant domain is reminiscent of both the RING and the PHD domains and may represent an evolutionary intermediate. To describe this domain the term PHD/LAP domain has been used in the past. Extended description: The RING-variant (RINGv) domain contains a C4HC3 zinc-finger-like motif similar to the PHD domain, while some of the spacing between the Cys/His residues follow a pattern somewhat closer to that found in the RING domain. The RINGv domain, similar to the RING, PHD and LIM domains, is thought to bind two zinc ions co-ordinated by the highly conserved Cys and His residues. RING variant domain: C-x (2) -C-x(10-45)-C-x (1) -C-x (7) -H-x(2)-C-x(11-25)-C-x(2)-C As opposed to a PHD: C-x(1-2) -C-x (7-13)-C-x(2-4)-C-x(4-5)-H-x(2)-C-x(10-21)-C-x(2)-C Class
Probab=26.89 E-value=24 Score=27.50 Aligned_cols=31 Identities=23% Similarity=0.685 Sum_probs=18.9
Q ss_pred ccccccCCcccCCCeeE-Ecc--ccCceecccccc
Q 002995 398 KCSVCHMDEEYQNNLFL-QCD--KCRMMVHARCYG 429 (860)
Q Consensus 398 ~C~vC~~~~~~~~n~ll-~C~--~C~~~vH~~CYg 429 (860)
.|-||++.++ ++++++ -|. +-...||+.|.-
T Consensus 1 ~CrIC~~~~~-~~~~l~~PC~C~G~~~~vH~~Cl~ 34 (49)
T smart00744 1 ICRICHDEGD-EGDPLVSPCRCKGSLKYVHQECLE 34 (49)
T ss_pred CccCCCCCCC-CCCeeEeccccCCchhHHHHHHHH
Confidence 4889998333 334443 333 223679999975
No 132
>PF10497 zf-4CXXC_R1: Zinc-finger domain of monoamine-oxidase A repressor R1; InterPro: IPR018866 R1 is a transcription factor repressor that inhibits monoamine oxidase A gene expression. This domain is a four-CXXC zinc finger putative DNA-binding domain found at the C-terminal end of R1. The domain carries 12 cysteines of which four pairs are of the CXXC type [].
Probab=26.78 E-value=47 Score=30.43 Aligned_cols=61 Identities=26% Similarity=0.510 Sum_probs=38.0
Q ss_pred CCccccccCCcccCCCeeEEc------cccCceecccccc-------c--cccCCCCceeeecccCCCCCCCCCccccCC
Q 002995 396 LDKCSVCHMDEEYQNNLFLQC------DKCRMMVHARCYG-------E--LEPVNGVLWLCNLCRPGAPEPPPPCCLCPV 460 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C------~~C~~~vH~~CYg-------~--~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~ 460 (860)
...|-.|..... +....| ..|...-=+.|++ . ..+.....|.|+.|+.. -.|..|..
T Consensus 7 g~~CHqCrqKt~---~~~~~C~~~~~~~~C~~~~~~fC~~CL~~ryge~~~ev~~~~~W~CP~Crgi-----CnCs~Crr 78 (105)
T PF10497_consen 7 GKTCHQCRQKTL---DFKTICTGHWKNSSCRGCRGKFCGGCLRNRYGENVEEVLEDPNWKCPKCRGI-----CNCSFCRR 78 (105)
T ss_pred CCCchhhcCCCC---CCceEcCCCCCCCCCccCcceehHhHHHHHHhhhHHHHhcCCceECCCCCCe-----eCCHhhhc
Confidence 357888887543 234678 7783333345654 2 12334458999999963 36889977
Q ss_pred CCCC
Q 002995 461 VGGA 464 (860)
Q Consensus 461 ~gGa 464 (860)
+.|.
T Consensus 79 k~g~ 82 (105)
T PF10497_consen 79 KRGW 82 (105)
T ss_pred cCCC
Confidence 7653
No 133
>PF11302 DUF3104: Protein of unknown function (DUF3104); InterPro: IPR021453 This family of proteins with unknown function appears to be restricted to Cyanobacteria.
Probab=26.73 E-value=1.6e+02 Score=25.17 Aligned_cols=59 Identities=17% Similarity=0.198 Sum_probs=43.0
Q ss_pred CCCCCCEEEEEecc-------cCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccc
Q 002995 81 ELEPGDIIWAKLTG-------HAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISF 144 (860)
Q Consensus 81 ~f~~GdlVwaK~~g-------~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f 144 (860)
..++||.|..+-.. --||=|.|+.-.. +.+.+.....+.|.=-.+...-||+.+.+...
T Consensus 5 ~Vk~Gd~ViV~~~~~~~~~~~~dWWmg~Vi~~~g-----gaR~P~~~tlFQVadVDtG~I~wVnaD~Vt~I 70 (75)
T PF11302_consen 5 SVKPGDTVIVQDEQEVGQKQDKDWWMGQVIHCEG-----GARDPKVPTLFQVADVDTGVIRWVNADEVTHI 70 (75)
T ss_pred ccCCCCEEEEecCccccccCCCCcEEEEEEEEec-----cccCCCCCceEEEEEccCCeEEEEEchheeee
Confidence 46899999998766 7899999987532 22333334446788888999999998766543
No 134
>cd04721 BAH_plant_1 BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=26.47 E-value=65 Score=30.71 Aligned_cols=53 Identities=19% Similarity=0.266 Sum_probs=35.7
Q ss_pred CCCCCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCC--EEEEeC
Q 002995 78 DCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHD--FARINV 138 (860)
Q Consensus 78 ~~~~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~--~awv~~ 138 (860)
++..|++||.|..+-.+-..|=|+|.+-- .. ..+...+.|+||-..+ .+..++
T Consensus 4 ~~~~i~vGD~V~v~~~~~~~~va~Ie~i~--ed------~~g~~~v~v~WF~~p~E~~~~~~~ 58 (130)
T cd04721 4 NGVTISVHDFVYVLSEEEDRYVAYIEDLY--ED------KKGSKMVKVRWFHTTDEVGAALSP 58 (130)
T ss_pred CCEEEECCCEEEEeCCCCCcEEEEEEEEE--Ec------CCCCEEEEEEEecCHHHhccccCC
Confidence 34568999999999877677999997631 11 1233468899997654 344443
No 135
>KOG0696 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=26.33 E-value=19 Score=40.61 Aligned_cols=33 Identities=30% Similarity=0.444 Sum_probs=23.7
Q ss_pred CccccccCCcccCCCeeEEccccCceecccccc
Q 002995 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYG 429 (860)
Q Consensus 397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg 429 (860)
..|+-|-.---.=.-+=.+|.-|...||+.|.-
T Consensus 57 TfCsHCkDFiwGfgKQGfQCqvC~fvvHkrChe 89 (683)
T KOG0696|consen 57 TFCSHCKDFIWGFGKQGFQCQVCCFVVHKRCHE 89 (683)
T ss_pred chhhhhhhheeccccCceeeeEEeehhhhhhcc
Confidence 689999643211223357999999999999974
No 136
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=26.14 E-value=60 Score=26.89 Aligned_cols=35 Identities=11% Similarity=0.400 Sum_probs=26.6
Q ss_pred CCCccccccCCccc-CCCeeEEccccCceecccccc
Q 002995 395 DLDKCSVCHMDEEY-QNNLFLQCDKCRMMVHARCYG 429 (860)
Q Consensus 395 ~~~~C~vC~~~~~~-~~n~ll~C~~C~~~vH~~CYg 429 (860)
....|..|+..... ..+....|..|+...|..--+
T Consensus 27 TSq~C~~CG~~~~~~~~~r~~~C~~Cg~~~~rD~na 62 (69)
T PF07282_consen 27 TSQTCPRCGHRNKKRRSGRVFTCPNCGFEMDRDVNA 62 (69)
T ss_pred CccCccCcccccccccccceEEcCCCCCEECcHHHH
Confidence 44689999986544 566789999999999876443
No 137
>PLN02189 cellulose synthase
Probab=25.99 E-value=41 Score=42.55 Aligned_cols=49 Identities=27% Similarity=0.679 Sum_probs=37.5
Q ss_pred CCccccccCCc--ccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995 396 LDKCSVCHMDE--EYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446 (860)
Q Consensus 396 ~~~C~vC~~~~--~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~ 446 (860)
...|.||++.- ..+++..+-|.-|+..|-..||- ..-.+| .=.|+.|+.
T Consensus 34 ~~~C~iCgd~vg~~~~g~~fvaC~~C~fpvCr~Cye-yer~eg-~q~CpqCkt 84 (1040)
T PLN02189 34 GQVCEICGDEIGLTVDGDLFVACNECGFPVCRPCYE-YERREG-TQNCPQCKT 84 (1040)
T ss_pred CccccccccccCcCCCCCEEEeeccCCCccccchhh-hhhhcC-CccCcccCC
Confidence 35999999753 35667899999999999999994 222334 678999975
No 138
>PF00641 zf-RanBP: Zn-finger in Ran binding protein and others; InterPro: IPR001876 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the zinc finger domain found in RanBP2 proteins. Ran is an evolutionary conserved member of the Ras superfamily that regulates all receptor-mediated transport between the nucleus and the cytoplasm. Ran binding protein 2 (RanBP2) is a 358kDa nucleoporin located on the cytoplasmic side of the nuclear pore complex which plays a role in nuclear protein import []. RanBP2 contains multiple zinc fingers which mediate binding to RanGDP []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9G_A 2EBR_A 2WX0_C 2WX1_C 2WWZ_C 3GJ6_B 2LK0_A 2LK1_A 3GJ5_B 3GJ8_B ....
Probab=25.60 E-value=22 Score=24.40 Aligned_cols=22 Identities=27% Similarity=0.718 Sum_probs=16.7
Q ss_pred ceeeecccCCCCCCCCCccccC
Q 002995 438 LWLCNLCRPGAPEPPPPCCLCP 459 (860)
Q Consensus 438 ~W~C~~C~~~~~~~~~~C~lC~ 459 (860)
.|.|..|..........|..|.
T Consensus 4 ~W~C~~C~~~N~~~~~~C~~C~ 25 (30)
T PF00641_consen 4 DWKCPSCTFMNPASRSKCVACG 25 (30)
T ss_dssp SEEETTTTEEEESSSSB-TTT-
T ss_pred CccCCCCcCCchHHhhhhhCcC
Confidence 8999999986666667899885
No 139
>KOG2807 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription; Replication, recombination and repair]
Probab=25.24 E-value=48 Score=36.11 Aligned_cols=84 Identities=15% Similarity=0.501 Sum_probs=51.0
Q ss_pred CCCCceeeecccCCCCCCCCCccccCCCCCCceecC--CCceeccccccccCceEEccCCCccccccchhhhccccC--c
Q 002995 434 VNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTT--DGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWK--L 509 (860)
Q Consensus 434 ~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~--~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~--~ 509 (860)
+.+..++|+.|+...-.....|..|... |.-+. ...+-| +-|+.....++..... .
T Consensus 272 ~~~~Gy~CP~CkakvCsLP~eCpiC~lt---LVss~hLARSyhh-----------------L~PL~~F~Eip~~~~~~~~ 331 (378)
T KOG2807|consen 272 LSGGGYFCPQCKAKVCSLPIECPICSLT---LVSSPHLARSYHH-----------------LFPLKPFVEIPETEYNGSR 331 (378)
T ss_pred cccCceeCCcccCeeecCCccCCcccee---EecchHHHHHHHh-----------------hcCCcchhhccccccCCCc
Confidence 4455899999997655556788888643 22111 011222 2244445555555433 2
Q ss_pred cceecCC--cceeeEEcCCCCcCccccchhhh
Q 002995 510 LCSICGV--SYGACIQCSNTTCRVAYHPLCAR 539 (860)
Q Consensus 510 ~C~~C~~--~~Ga~iqC~~~~C~~~fH~~CA~ 539 (860)
.|..|+. ..+...+|. .|...|=.-|-.
T Consensus 332 ~Cf~C~~~~~~~~~y~C~--~Ck~~FCldCDv 361 (378)
T KOG2807|consen 332 FCFACQGELLSSGRYRCE--SCKNVFCLDCDV 361 (378)
T ss_pred ceeeeccccCCCCcEEch--hccceeeccchH
Confidence 4999943 356778997 898888877763
No 140
>PF09038 53-BP1_Tudor: Tumour suppressor p53-binding protein-1 Tudor; InterPro: IPR015125 This domain consist of ten beta-strands and a carboxy-terminal alpha-helix. The amino-terminal five beta-strands and the C-terminal five beta-strands adopt folds that are identical to each other. The domain is essential for the recruitment of proteins to double stranded breaks in DNA, which is mediated by interaction with methylated Lys 79 of histone H3 []. ; PDB: 3LGL_A 1XNI_B 3LGF_A 2G3R_A 2IG0_A 3LH0_A 1SSF_A.
Probab=25.20 E-value=62 Score=30.28 Aligned_cols=28 Identities=29% Similarity=0.436 Sum_probs=22.7
Q ss_pred eeeeeeeecCCCCceEEEecCCCcccccc
Q 002995 5 YSGFVVGYDSESNRHHVKYVDGDEEDLIL 33 (860)
Q Consensus 5 ~~g~v~~~~~~~~~h~~~y~d~d~e~~~l 33 (860)
|+|.|+. |....+..|..+||.+-.+..
T Consensus 20 Y~G~I~~-~~~~~kykv~FdDG~~~~v~~ 47 (122)
T PF09038_consen 20 YPGKITS-DKGKNKYKVLFDDGYECRVLG 47 (122)
T ss_dssp EEEEEEE-EETTTEEEEEETTS-EEEEEC
T ss_pred cCceEee-cCCCCeEEEEecCCccceecc
Confidence 7999999 578889999999999865543
No 141
>cd05791 S1_CSL4 S1_CSL4: CSL4, S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. ScCSL4 protein is a subunit of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In S. cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure".
Probab=25.14 E-value=46 Score=29.58 Aligned_cols=17 Identities=35% Similarity=0.610 Sum_probs=14.3
Q ss_pred CCCCCCCEEEEEecccC
Q 002995 80 QELEPGDIIWAKLTGHA 96 (860)
Q Consensus 80 ~~f~~GdlVwaK~~g~P 96 (860)
.-|+|||+|-||+-++-
T Consensus 59 ~~f~~GDiV~AkVis~~ 75 (92)
T cd05791 59 KCFRPGDIVRAKVISLG 75 (92)
T ss_pred hhcCCCCEEEEEEEEcC
Confidence 45999999999997664
No 142
>PF11793 FANCL_C: FANCL C-terminal domain; PDB: 3K1L_A.
Probab=24.57 E-value=34 Score=28.80 Aligned_cols=33 Identities=33% Similarity=0.699 Sum_probs=13.5
Q ss_pred ccceecCCc---ce--eeEEcCCCCcCccccchhhhhc
Q 002995 509 LLCSICGVS---YG--ACIQCSNTTCRVAYHPLCARAA 541 (860)
Q Consensus 509 ~~C~~C~~~---~G--a~iqC~~~~C~~~fH~~CA~~~ 541 (860)
..|.||... .+ .-+.|....|...||..|-...
T Consensus 3 ~~C~IC~~~~~~~~~~p~~~C~n~~C~~~fH~~CL~~w 40 (70)
T PF11793_consen 3 LECGICYSYRLDDGEIPDVVCPNPSCGKKFHLLCLSEW 40 (70)
T ss_dssp -S-SSS--SS-TT-----B--S-TT----B-SGGGHHH
T ss_pred CCCCcCCcEecCCCCcCceEcCCcccCCHHHHHHHHHH
Confidence 468889752 22 3579999999999999997543
No 143
>PF08792 A2L_zn_ribbon: A2L zinc ribbon domain; InterPro: IPR014900 This zinc ribbon protein is found associated with some viral A2L transcription factors [].
Probab=23.69 E-value=69 Score=22.83 Aligned_cols=27 Identities=26% Similarity=0.463 Sum_probs=19.0
Q ss_pred CCccccccCCccc-CCCeeEEccccCce
Q 002995 396 LDKCSVCHMDEEY-QNNLFLQCDKCRMM 422 (860)
Q Consensus 396 ~~~C~vC~~~~~~-~~n~ll~C~~C~~~ 422 (860)
+..|..|...... .++-.++|-.|+..
T Consensus 3 ~~~C~~C~~~~i~~~~~~~~~C~~Cg~~ 30 (33)
T PF08792_consen 3 LKKCSKCGGNGIVNKEDDYEVCIFCGSS 30 (33)
T ss_pred ceEcCCCCCCeEEEecCCeEEcccCCcE
Confidence 3578889876654 55557788888754
No 144
>KOG2752 consensus Uncharacterized conserved protein, contains N-recognin-type Zn-finger [General function prediction only]
Probab=23.03 E-value=89 Score=33.99 Aligned_cols=112 Identities=16% Similarity=0.288 Sum_probs=60.7
Q ss_pred ccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccCCCCCCceecCCCceeccc
Q 002995 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLA 477 (860)
Q Consensus 398 ~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~~g~wvH~~ 477 (860)
.|--|--... ...+|-.|.+..|..=-++...... .|.|+.|..... .|.+=+...+.-+ -+.+.|.|-+
T Consensus 57 sClTC~P~~~----~agvC~~C~~~CH~~H~lveL~tKR-~FrCDCg~sk~g----~~sc~l~~~~~~~-n~~N~YNhNf 126 (345)
T KOG2752|consen 57 SCLTCTPAPE----MAGVCYACSLSCHDGHELVELYTKR-NFRCDCGNSKFG----RCSCNLLEDKDAE-NSENLYNHNF 126 (345)
T ss_pred EeecccCChh----hceeEEEeeeeecCCceeeeccccC-Cccccccccccc----ccccccccccccc-cchhhhhhhh
Confidence 4555643221 3678999999999876555444444 899998875332 2222222222222 3344566633
Q ss_pred cccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcCcccc-chhhhhcCe
Q 002995 478 CAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYH-PLCARAAGL 543 (860)
Q Consensus 478 Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~fH-~~CA~~~g~ 543 (860)
=.++. .-+.....|+.. -.|.++||. -|..||| -.|-....+
T Consensus 127 qG~~C----~Cd~~Ypdp~~~------------------~e~~m~QC~--iCEDWFHce~c~~~~~~ 169 (345)
T KOG2752|consen 127 QGLFC----KCDTPYPDPVRT------------------EEGEMLQCV--ICEDWFHCEGCMQAKTF 169 (345)
T ss_pred cceeE----EecCCCCCcccc------------------ccceeeeEE--eccchhcccccCcccch
Confidence 22221 011111112211 268899998 8999999 666655544
No 145
>KOG0782 consensus Predicted diacylglycerol kinase [Signal transduction mechanisms]
Probab=22.84 E-value=23 Score=40.92 Aligned_cols=20 Identities=35% Similarity=0.913 Sum_probs=16.2
Q ss_pred eEEccccCceeccccccccc
Q 002995 413 FLQCDKCRMMVHARCYGELE 432 (860)
Q Consensus 413 ll~C~~C~~~vH~~CYg~~~ 432 (860)
--.|.-|.+.||..|.....
T Consensus 199 rRkCAaCkIVvHT~CieqLe 218 (1004)
T KOG0782|consen 199 RRKCAACKIVVHTNCIEQLE 218 (1004)
T ss_pred hccceeeeEEEechHHHHHH
Confidence 34799999999999986443
No 146
>KOG0695 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=22.68 E-value=32 Score=37.65 Aligned_cols=35 Identities=23% Similarity=0.556 Sum_probs=25.8
Q ss_pred CccccccCCcccCCCeeEEccccCceecccccccc
Q 002995 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL 431 (860)
Q Consensus 397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~ 431 (860)
..|.||..--=.-..+=..|-+|.+.||.+|.+..
T Consensus 142 ~~c~ic~d~iwglgrqgyrcinckl~vhkkch~~v 176 (593)
T KOG0695|consen 142 AYCGICSDRIWGLGRQGYRCINCKLLVHKKCHGLV 176 (593)
T ss_pred eeeeechhhhhhcccccceeecceeehhhhhcccc
Confidence 58999975322223345689999999999999854
No 147
>KOG4299 consensus PHD Zn-finger protein [General function prediction only]
Probab=21.84 E-value=33 Score=40.62 Aligned_cols=46 Identities=24% Similarity=0.570 Sum_probs=30.0
Q ss_pred CCccccccCCcccCCCeeEEccccCceeccccccccccC--CCCceeeecccC
Q 002995 396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPV--NGVLWLCNLCRP 446 (860)
Q Consensus 396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~--~~~~W~C~~C~~ 446 (860)
...|.+|...+. ++-|+.|.-.+|+.|-+....+ .+..|.|..|..
T Consensus 47 ~ts~~~~~~~gn-----~~~~~~~~~s~h~~~~~~~~sp~~~~~~~~~~~~~~ 94 (613)
T KOG4299|consen 47 ATSCGICKSGGN-----LLCCDHCPASFHLECDKPPLSPDLKGSEINCSRCPK 94 (613)
T ss_pred hhhcchhhhcCC-----ccccccCccccchhccCcccCcccccccccccCCCc
Confidence 567888876543 5678888888888886643322 234677777754
No 148
>KOG1844 consensus PHD Zn-finger proteins [General function prediction only]
Probab=21.71 E-value=56 Score=38.62 Aligned_cols=49 Identities=27% Similarity=0.851 Sum_probs=36.1
Q ss_pred CccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCC
Q 002995 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPG 447 (860)
Q Consensus 397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~ 447 (860)
..|. |...++. +..++.|+.|+.--|..|+|.....+-....|..|...
T Consensus 87 ~~c~-c~~~~~~-~g~~i~c~~c~~Wqh~~C~g~~~~~~p~~y~c~~c~~~ 135 (508)
T KOG1844|consen 87 SRCD-CGLEDDM-EGLMIQCDWCGRWQHKICCGSFKSTKPDKYVCEICTPR 135 (508)
T ss_pred cccc-cccccCC-CceeeCCcccCcccCceeeeecCCCCchhceeeeeccc
Confidence 3554 7765542 66799999999999999999654333236889999754
No 149
>PF10367 Vps39_2: Vacuolar sorting protein 39 domain 2; InterPro: IPR019453 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised In Vps39 this domain is involved in localisation and in mediating the interactions with Vps11 [].
Probab=21.40 E-value=76 Score=28.49 Aligned_cols=31 Identities=32% Similarity=0.634 Sum_probs=21.2
Q ss_pred CCCccccccCCcccCCCeeEEccccCceeccccc
Q 002995 395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCY 428 (860)
Q Consensus 395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CY 428 (860)
+...|.||+..-.. ..++ -.-|+..||..|.
T Consensus 77 ~~~~C~vC~k~l~~--~~f~-~~p~~~v~H~~C~ 107 (109)
T PF10367_consen 77 ESTKCSVCGKPLGN--SVFV-VFPCGHVVHYSCI 107 (109)
T ss_pred CCCCccCcCCcCCC--ceEE-EeCCCeEEecccc
Confidence 45689999875432 2333 3366799999996
No 150
>PF10367 Vps39_2: Vacuolar sorting protein 39 domain 2; InterPro: IPR019453 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised In Vps39 this domain is involved in localisation and in mediating the interactions with Vps11 [].
Probab=21.27 E-value=39 Score=30.43 Aligned_cols=30 Identities=27% Similarity=0.609 Sum_probs=20.4
Q ss_pred ccceecCCcceeeEEcCCCCcCccccchhhh
Q 002995 509 LLCSICGVSYGACIQCSNTTCRVAYHPLCAR 539 (860)
Q Consensus 509 ~~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~ 539 (860)
..|.+|+++-|...--.+ -|...||..|+.
T Consensus 79 ~~C~vC~k~l~~~~f~~~-p~~~v~H~~C~~ 108 (109)
T PF10367_consen 79 TKCSVCGKPLGNSVFVVF-PCGHVVHYSCIK 108 (109)
T ss_pred CCccCcCCcCCCceEEEe-CCCeEEeccccc
Confidence 469999997553322222 456899999985
No 151
>PF09465 LBR_tudor: Lamin-B receptor of TUDOR domain; InterPro: IPR019023 The Lamin-B receptor is a chromatin and lamin binding protein in the inner nuclear membrane. It is one of the integral inner nuclear envelope membrane proteins responsible for targeting nuclear membranes to chromatin, being a downstream effector of Ran, a small Ras-like nuclear GTPase which regulates NE assembly. Lamin-B receptor interacts with importin beta, a Ran-binding protein, thereby directly contributing to the fusion of membrane vesicles and the formation of the nuclear envelope []. ; PDB: 2L8D_A 2DIG_A.
Probab=20.96 E-value=1.7e+02 Score=23.53 Aligned_cols=50 Identities=10% Similarity=0.258 Sum_probs=29.6
Q ss_pred CCCCCCCEEEEEecccCCC-ceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCc
Q 002995 80 QELEPGDIIWAKLTGHAMW-PAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQV 141 (860)
Q Consensus 80 ~~f~~GdlVwaK~~g~PwW-Pa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l 141 (860)
..|..|++|-+|-.|-.-| +|+|.+-+ .....|.|.|= |++--=++..+|
T Consensus 4 ~k~~~Ge~V~~rWP~s~lYYe~kV~~~d-----------~~~~~y~V~Y~-DGtel~lke~di 54 (55)
T PF09465_consen 4 RKFAIGEVVMVRWPGSSLYYEGKVLSYD-----------SKSDRYTVLYE-DGTELELKENDI 54 (55)
T ss_dssp SSS-SS-EEEEE-TTTS-EEEEEEEEEE-----------TTTTEEEEEET-TS-EEEEECCCE
T ss_pred ccccCCCEEEEECCCCCcEEEEEEEEec-----------ccCceEEEEEc-CCCEEEeccccc
Confidence 6799999999998888877 99997621 12334667764 444444444443
No 152
>PF12148 DUF3590: Protein of unknown function (DUF3590); InterPro: IPR021991 This domain is found in eukaryotes, and is typically between 83 and 97 amino acids in length. It is found in association with PF00097 from PFAM, PF02182 from PFAM, PF00628 from PFAM, PF00240 from PFAM. There are two conserved sequence motifs: RAR and NYN. The domain is part of the protein NIRF which has zinc finger and ubiquitinating domains. The function of this domain is likely to be mainly structural, however this has not been confirmed. ; PDB: 3DB4_A 3ASK_A 3DB3_A 2L3R_A.
Probab=20.48 E-value=1e+02 Score=27.10 Aligned_cols=31 Identities=26% Similarity=0.467 Sum_probs=20.0
Q ss_pred CCCceeeeeeeecCCC------CceEEEecCCCcccc
Q 002995 1 MQDWYSGFVVGYDSES------NRHHVKYVDGDEEDL 31 (860)
Q Consensus 1 ~~~~~~g~v~~~~~~~------~~h~~~y~d~d~e~~ 31 (860)
|-+||.+.|+.--... =..||||||=.++-+
T Consensus 9 ~gAWfEa~i~~i~~~~~~~~e~viYhIkyddype~gv 45 (85)
T PF12148_consen 9 MGAWFEAQIVTITKKCMSDDEDVIYHIKYDDYPENGV 45 (85)
T ss_dssp T-EEEEEEEEEEEES-SSSSTTEEEEEEETT-GGG-E
T ss_pred CcceEEEEEEEeeccCCCCCCCEEEEEEeccCCCcCc
Confidence 4579999887655433 278999998765544
No 153
>smart00439 BAH Bromo adjacent homology domain.
Probab=20.25 E-value=1.2e+02 Score=27.58 Aligned_cols=39 Identities=10% Similarity=0.250 Sum_probs=28.0
Q ss_pred CCCCCEEEEEecc--cCCCceEeeCCCCcCccccccccCCC-CcEEEEEe
Q 002995 82 LEPGDIIWAKLTG--HAMWPAIVVDESLIGDYKGLNKISGG-RSIPVQFF 128 (860)
Q Consensus 82 f~~GdlVwaK~~g--~PwWPa~V~~~~~~~~~~~~~~~~~~-~~~~V~FF 128 (860)
|++||.|..+-.. =|.|-|+|..--... .+. ..+.|+||
T Consensus 2 ~~vgd~V~v~~~~~~~~~~i~~I~~i~~~~--------~~~~~~~~v~Wf 43 (120)
T smart00439 2 IRVGDFVLVEPDDADEPYYIGRIEEIFETK--------KNSEKMVRVRWF 43 (120)
T ss_pred cccCCEEEEeCCCCCCCCEEEEEEEEEECC--------CCCEEEEEEEEE
Confidence 7899999999764 689999997632211 122 36789999
Done!