Query 001243
Match_columns 1116
No_of_seqs 427 out of 1523
Neff 4.9
Searched_HMMs 46136
Date Thu Mar 28 19:40:47 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/001243.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/001243hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0954 PHD finger protein [Ge 100.0 8.9E-43 1.9E-47 404.2 8.3 517 3-596 328-892 (893)
2 KOG0954 PHD finger protein [Ge 100.0 1.6E-36 3.6E-41 352.4 7.5 167 702-886 268-440 (893)
3 KOG0955 PHD finger protein BR1 100.0 3.4E-35 7.3E-40 361.1 9.5 169 702-886 216-397 (1051)
4 KOG0956 PHD finger protein AF1 100.0 3.1E-35 6.8E-40 339.1 5.4 170 702-887 2-185 (900)
5 COG5141 PHD zinc finger-contai 100.0 2.7E-35 5.9E-40 330.0 4.5 169 703-887 191-367 (669)
6 KOG0957 PHD finger protein [Ge 100.0 2E-33 4.3E-38 315.4 5.2 285 707-1030 121-431 (707)
7 PF13832 zf-HC5HC2H_2: PHD-zin 99.9 1.7E-24 3.7E-29 207.4 6.1 107 2-113 4-110 (110)
8 PF13832 zf-HC5HC2H_2: PHD-zin 99.9 3.9E-23 8.3E-28 198.0 7.3 106 776-882 2-110 (110)
9 KOG0956 PHD finger protein AF1 99.9 9.2E-24 2E-28 244.9 2.7 113 3-119 67-186 (900)
10 COG5141 PHD zinc finger-contai 99.9 2.2E-23 4.8E-28 234.9 2.6 117 3-121 252-370 (669)
11 KOG0955 PHD finger protein BR1 99.8 8.5E-22 1.8E-26 243.6 3.9 114 2-117 277-397 (1051)
12 PF13771 zf-HC5HC2H: PHD-like 99.7 5.5E-18 1.2E-22 156.7 4.7 88 23-114 1-90 (90)
13 KOG0957 PHD finger protein [Ge 99.7 5.6E-18 1.2E-22 191.9 1.1 113 3-119 187-304 (707)
14 PF13771 zf-HC5HC2H: PHD-like 99.5 5.9E-15 1.3E-19 136.6 3.0 85 797-883 1-90 (90)
15 KOG1080 Histone H3 (Lys4) meth 99.0 3.6E-10 7.9E-15 141.7 6.4 143 700-865 568-715 (1005)
16 PF13831 PHD_2: PHD-finger; PD 98.9 2.6E-10 5.6E-15 90.0 0.2 35 718-752 1-36 (36)
17 KOG1080 Histone H3 (Lys4) meth 97.7 1.5E-05 3.4E-10 101.0 2.1 86 2-95 631-716 (1005)
18 PF00628 PHD: PHD-finger; Int 97.5 4.2E-05 9E-10 64.0 1.9 46 707-753 1-50 (51)
19 smart00249 PHD PHD zinc finger 97.5 0.0001 2.2E-09 59.1 3.6 44 707-751 1-47 (47)
20 KOG1084 Transcription factor T 97.0 0.00031 6.8E-09 81.8 2.1 98 774-883 221-321 (375)
21 KOG1244 Predicted transcriptio 96.8 0.00052 1.1E-08 75.4 1.9 51 705-756 281-333 (336)
22 KOG1512 PHD Zn-finger protein 96.7 0.00072 1.6E-08 74.6 1.7 45 704-749 313-357 (381)
23 KOG1084 Transcription factor T 96.5 0.0014 3E-08 76.5 2.6 86 17-114 236-321 (375)
24 PF15446 zf-PHD-like: PHD/FYVE 96.5 0.0043 9.3E-08 64.6 5.7 127 707-858 1-140 (175)
25 KOG4323 Polycomb-like PHD Zn-f 96.3 0.0036 7.8E-08 74.2 4.5 134 706-883 84-222 (464)
26 KOG4323 Polycomb-like PHD Zn-f 96.2 0.0024 5.2E-08 75.6 2.4 51 704-754 167-224 (464)
27 COG5034 TNG2 Chromatin remodel 96.0 0.057 1.2E-06 59.7 11.4 52 699-753 215-269 (271)
28 KOG0825 PHD Zn-finger protein 95.1 0.011 2.3E-07 72.7 2.1 52 702-754 212-266 (1134)
29 KOG4299 PHD Zn-finger protein 94.6 0.015 3.2E-07 70.6 1.6 48 706-754 254-305 (613)
30 KOG1973 Chromatin remodeling p 93.6 0.039 8.4E-07 62.1 2.4 50 702-754 216-268 (274)
31 smart00249 PHD PHD zinc finger 91.3 0.17 3.7E-06 40.4 2.7 32 829-862 1-34 (47)
32 PF14446 Prok-RING_1: Prokaryo 91.2 0.15 3.2E-06 44.4 2.2 33 704-736 4-36 (54)
33 KOG0383 Predicted helicase [Ge 89.4 0.24 5.2E-06 62.1 3.0 50 700-753 42-93 (696)
34 TIGR02844 spore_III_D sporulat 87.5 0.76 1.7E-05 43.1 4.3 50 222-272 9-60 (80)
35 PF10198 Ada3: Histone acetylt 87.0 4 8.7E-05 41.7 9.4 82 531-612 14-104 (131)
36 PF09012 FeoC: FeoC like trans 83.1 0.79 1.7E-05 41.1 2.1 31 224-254 5-35 (69)
37 PF00628 PHD: PHD-finger; Int 80.4 1.1 2.3E-05 37.6 1.8 30 829-860 1-32 (51)
38 KOG1512 PHD Zn-finger protein 80.2 1 2.2E-05 50.8 2.0 50 704-753 257-316 (381)
39 PF00356 LacI: Bacterial regul 79.3 1.9 4.1E-05 36.4 2.9 44 235-278 1-45 (46)
40 PF08220 HTH_DeoR: DeoR-like h 76.0 1.9 4.1E-05 37.5 2.1 33 222-254 3-35 (57)
41 KOG4443 Putative transcription 75.9 1.3 2.9E-05 54.7 1.6 49 706-755 69-119 (694)
42 KOG1973 Chromatin remodeling p 75.7 1.4 3E-05 49.9 1.6 45 828-882 220-265 (274)
43 KOG1473 Nucleosome remodeling 74.6 2.9 6.2E-05 54.5 4.0 118 699-859 338-459 (1414)
44 KOG1044 Actin-binding LIM Zn-f 74.2 1.6 3.4E-05 53.4 1.5 35 828-866 193-228 (670)
45 KOG1245 Chromatin remodeling c 72.1 1.1 2.3E-05 60.7 -0.6 51 704-755 1107-1159(1404)
46 PF00130 C1_1: Phorbol esters/ 71.8 3.1 6.7E-05 35.1 2.4 35 705-739 11-46 (53)
47 cd00029 C1 Protein kinase C co 70.6 2.5 5.3E-05 34.8 1.5 34 705-738 11-45 (50)
48 PF02318 FYVE_2: FYVE-type zin 70.1 3 6.6E-05 41.4 2.3 48 706-754 55-103 (118)
49 smart00109 C1 Protein kinase C 67.1 2.6 5.6E-05 34.3 0.9 33 705-737 11-43 (49)
50 smart00530 HTH_XRE Helix-turn- 66.3 8.4 0.00018 30.1 3.8 48 225-272 2-50 (56)
51 TIGR02607 antidote_HigA addict 65.6 10 0.00023 34.0 4.6 54 219-272 3-58 (78)
52 PF02796 HTH_7: Helix-turn-hel 64.9 7.2 0.00016 32.3 3.1 31 223-254 12-42 (45)
53 PF07649 C1_3: C1-like domain; 64.2 3.1 6.8E-05 31.7 0.8 28 707-735 2-29 (30)
54 PF13443 HTH_26: Cro/C1-type H 62.7 8.4 0.00018 33.3 3.3 48 225-272 2-51 (63)
55 PF01381 HTH_3: Helix-turn-hel 59.2 13 0.00029 31.1 3.8 48 225-272 1-49 (55)
56 PF13404 HTH_AsnC-type: AsnC-t 57.3 9.5 0.00021 31.5 2.5 32 223-254 7-38 (42)
57 PF13901 DUF4206: Domain of un 55.8 9.9 0.00021 41.3 3.1 44 704-754 151-198 (202)
58 cd04718 BAH_plant_2 BAH, or Br 53.7 7 0.00015 40.8 1.5 27 730-756 1-29 (148)
59 PF13412 HTH_24: Winged helix- 53.1 11 0.00024 31.1 2.3 33 222-254 6-38 (48)
60 PF03107 C1_2: C1 domain; Int 52.9 11 0.00023 29.0 2.0 27 707-735 2-29 (30)
61 KOG4236 Serine/threonine prote 52.4 6.9 0.00015 48.0 1.3 34 703-736 154-188 (888)
62 cd00093 HTH_XRE Helix-turn-hel 51.2 23 0.0005 27.8 3.9 48 224-271 3-51 (58)
63 PF01978 TrmB: Sugar-specific 49.9 12 0.00026 33.2 2.2 33 222-254 11-43 (68)
64 PF14197 Cep57_CLD_2: Centroso 49.3 51 0.0011 30.3 6.1 60 545-611 2-63 (69)
65 PF10668 Phage_terminase: Phag 48.9 10 0.00022 34.0 1.5 22 231-252 20-41 (60)
66 PF07227 DUF1423: Protein of u 48.9 13 0.00028 44.8 2.9 48 707-754 130-192 (446)
67 PF13542 HTH_Tnp_ISL3: Helix-t 47.5 17 0.00037 30.3 2.6 31 222-254 18-48 (52)
68 TIGR03070 couple_hipB transcri 47.2 32 0.0007 28.5 4.3 36 220-255 2-37 (58)
69 KOG4443 Putative transcription 46.6 8.7 0.00019 47.9 1.0 49 706-754 19-71 (694)
70 COG5034 TNG2 Chromatin remodel 45.4 13 0.00029 41.8 2.1 32 826-857 219-252 (271)
71 KOG1701 Focal adhesion adaptor 43.7 9.6 0.00021 45.5 0.8 151 707-882 276-458 (468)
72 PRK09492 treR trehalose repres 43.1 21 0.00045 39.7 3.3 51 231-281 2-53 (315)
73 cd00569 HTH_Hin_like Helix-tur 43.1 34 0.00074 24.3 3.5 30 222-252 11-40 (42)
74 PHA02591 hypothetical protein; 42.9 25 0.00054 33.2 3.1 37 217-254 44-80 (83)
75 smart00550 Zalpha Z-DNA-bindin 42.8 19 0.00042 32.4 2.4 33 222-254 9-43 (68)
76 PF07649 C1_3: C1-like domain; 42.3 15 0.00033 28.0 1.4 27 829-857 2-30 (30)
77 PRK10014 DNA-binding transcrip 42.3 22 0.00047 40.1 3.3 52 231-282 4-56 (342)
78 PF13936 HTH_38: Helix-turn-he 41.9 16 0.00035 30.2 1.6 30 224-254 12-41 (44)
79 PRK14987 gluconate operon tran 40.9 22 0.00048 39.9 3.1 52 231-282 3-55 (331)
80 smart00420 HTH_DEOR helix_turn 40.5 24 0.00052 28.6 2.4 32 223-254 4-35 (53)
81 KOG3799 Rab3 effector RIM1 and 39.0 14 0.00031 37.9 1.1 50 705-754 65-116 (169)
82 KOG4362 Transcriptional regula 38.6 9.8 0.00021 48.0 -0.2 67 16-88 328-394 (684)
83 PF13518 HTH_28: Helix-turn-he 38.2 22 0.00049 29.3 2.0 28 224-253 5-32 (52)
84 smart00354 HTH_LACI helix_turn 37.7 33 0.00071 30.9 3.0 47 234-280 1-48 (70)
85 PF12324 HTH_15: Helix-turn-he 37.0 25 0.00055 33.1 2.2 34 222-255 27-60 (77)
86 PF04967 HTH_10: HTH DNA bindi 35.8 21 0.00045 31.2 1.4 22 233-254 23-44 (53)
87 PF03107 C1_2: C1 domain; Int 35.1 26 0.00056 26.9 1.7 27 58-86 2-30 (30)
88 PRK10339 DNA-binding transcrip 35.0 32 0.0007 38.6 3.2 49 233-281 1-52 (327)
89 PHA01976 helix-turn-helix prot 35.0 66 0.0014 28.0 4.5 52 220-271 2-54 (67)
90 PF12844 HTH_19: Helix-turn-he 34.7 48 0.001 28.6 3.6 48 224-271 3-51 (64)
91 PRK10681 DNA-binding transcrip 34.5 30 0.00064 38.7 2.8 34 222-255 10-43 (252)
92 PF08279 HTH_11: HTH domain; 34.5 30 0.00065 29.2 2.2 32 223-254 4-36 (55)
93 COG5194 APC11 Component of SCF 34.1 14 0.00031 34.8 0.2 32 57-88 21-65 (88)
94 COG5194 APC11 Component of SCF 33.8 16 0.00034 34.7 0.3 33 828-860 21-66 (88)
95 PF14446 Prok-RING_1: Prokaryo 33.6 33 0.00072 30.3 2.3 37 827-865 5-44 (54)
96 TIGR02405 trehalos_R_Ecol treh 33.5 35 0.00076 38.1 3.2 49 233-281 1-50 (311)
97 PF08746 zf-RING-like: RING-li 33.2 25 0.00054 29.3 1.4 30 830-859 1-30 (43)
98 PRK15431 ferrous iron transpor 33.1 27 0.00058 33.0 1.7 27 228-254 11-37 (78)
99 KOG1244 Predicted transcriptio 33.1 29 0.00064 39.4 2.4 48 706-753 225-283 (336)
100 PF01022 HTH_5: Bacterial regu 33.0 32 0.00069 28.5 2.1 31 223-254 6-36 (47)
101 PF00165 HTH_AraC: Bacterial r 32.5 23 0.0005 28.5 1.1 25 231-255 6-30 (42)
102 PRK09726 antitoxin HipB; Provi 32.2 69 0.0015 30.1 4.4 58 219-276 11-69 (88)
103 PF10367 Vps39_2: Vacuolar sor 32.0 36 0.00078 32.2 2.5 30 705-736 78-107 (109)
104 KOG4299 PHD Zn-finger protein 31.3 19 0.00041 44.9 0.7 30 56-88 253-285 (613)
105 PF14569 zf-UDP: Zinc-binding 31.1 10 0.00022 35.6 -1.3 50 704-754 8-60 (80)
106 PF11793 FANCL_C: FANCL C-term 31.0 24 0.00051 32.2 1.0 32 706-737 3-38 (70)
107 COG1349 GlpR Transcriptional r 30.7 37 0.0008 38.1 2.7 33 222-254 8-40 (253)
108 PF01325 Fe_dep_repress: Iron 29.5 50 0.0011 29.3 2.8 25 230-254 19-43 (60)
109 KOG1844 PHD Zn-finger proteins 29.1 32 0.00068 41.9 2.0 46 709-754 89-135 (508)
110 PRK06266 transcription initiat 28.4 43 0.00093 35.9 2.6 35 221-255 24-58 (178)
111 KOG0825 PHD Zn-finger protein 28.3 51 0.0011 42.3 3.5 32 827-860 215-249 (1134)
112 smart00345 HTH_GNTR helix_turn 28.3 53 0.0011 27.3 2.6 20 235-254 22-41 (60)
113 PF13764 E3_UbLigase_R4: E3 ub 28.1 1.4E+02 0.003 39.2 7.4 31 700-730 463-498 (802)
114 KOG0383 Predicted helicase [Ge 28.1 40 0.00087 43.1 2.7 28 727-754 2-31 (696)
115 PRK10703 DNA-binding transcrip 27.6 51 0.0011 37.2 3.2 50 233-282 1-51 (341)
116 PF08746 zf-RING-like: RING-li 27.5 37 0.00081 28.3 1.5 30 59-88 1-30 (43)
117 KOG0695 Serine/threonine prote 27.5 22 0.00048 41.6 0.3 35 705-739 141-176 (593)
118 COG2522 Predicted transcriptio 27.4 51 0.0011 33.4 2.8 33 222-255 12-44 (119)
119 smart00744 RINGv The RING-vari 26.8 21 0.00045 30.5 -0.1 31 707-737 1-34 (49)
120 PF08221 HTH_9: RNA polymerase 26.5 49 0.0011 29.5 2.2 35 221-255 15-49 (62)
121 TIGR00180 parB_part ParB-like 26.4 55 0.0012 34.9 3.0 52 217-268 104-155 (187)
122 PRK05472 redox-sensing transcr 26.4 42 0.00092 36.3 2.2 34 222-255 19-54 (213)
123 TIGR00373 conserved hypothetic 26.4 47 0.001 34.9 2.4 34 222-255 17-50 (158)
124 PRK10401 DNA-binding transcrip 26.3 55 0.0012 37.1 3.1 49 233-281 1-50 (346)
125 TIGR03830 CxxCG_CxxCG_HTH puta 26.2 1.1E+02 0.0024 29.9 4.9 53 222-274 67-119 (127)
126 PRK09526 lacI lac repressor; R 26.1 58 0.0013 36.7 3.3 52 231-282 3-55 (342)
127 PF07227 DUF1423: Protein of u 25.8 43 0.00094 40.6 2.3 21 705-731 113-133 (446)
128 PRK10727 DNA-binding transcrip 25.5 58 0.0013 36.9 3.2 50 233-282 1-51 (343)
129 KOG0696 Serine/threonine prote 25.4 27 0.00059 42.0 0.5 33 705-737 56-89 (683)
130 PF04760 IF2_N: Translation in 23.9 35 0.00075 29.2 0.7 23 232-254 2-24 (54)
131 PF10367 Vps39_2: Vacuolar sor 23.8 35 0.00076 32.3 0.9 30 828-858 79-108 (109)
132 TIGR02531 yecD_yerC TrpR-relat 23.7 58 0.0013 31.3 2.3 29 223-252 41-69 (88)
133 PRK11169 leucine-responsive tr 23.4 52 0.0011 34.3 2.1 33 222-254 17-49 (164)
134 PLN02638 cellulose synthase A 23.4 48 0.001 44.2 2.2 50 704-754 16-68 (1079)
135 PF13639 zf-RING_2: Ring finge 23.3 30 0.00065 28.1 0.2 30 707-737 2-31 (44)
136 PLN02400 cellulose synthase 23.1 58 0.0012 43.5 2.8 49 705-754 36-87 (1085)
137 PF08280 HTH_Mga: M protein tr 22.1 78 0.0017 27.7 2.6 33 223-255 9-41 (59)
138 smart00342 HTH_ARAC helix_turn 22.0 70 0.0015 27.9 2.4 42 233-274 1-49 (84)
139 PRK10434 srlR DNA-bindng trans 21.8 65 0.0014 36.1 2.6 33 222-254 8-40 (256)
140 PRK10072 putative transcriptio 21.8 1.1E+02 0.0024 29.8 3.8 52 224-275 37-88 (96)
141 smart00344 HTH_ASNC helix_turn 21.7 70 0.0015 30.5 2.4 32 223-254 7-38 (108)
142 PF05043 Mga: Mga helix-turn-h 21.6 87 0.0019 28.9 3.0 42 222-273 19-60 (87)
143 PF13551 HTH_29: Winged helix- 21.4 1.1E+02 0.0023 28.9 3.6 50 228-277 7-76 (112)
144 PRK04424 fatty acid biosynthes 21.1 73 0.0016 34.1 2.7 35 221-255 9-43 (185)
145 smart00342 HTH_ARAC helix_turn 20.8 66 0.0014 28.0 2.0 29 226-254 43-72 (84)
146 TIGR01481 ccpA catabolite cont 20.5 81 0.0018 35.3 3.0 48 234-281 2-50 (329)
147 PF11793 FANCL_C: FANCL C-term 20.5 55 0.0012 29.9 1.4 33 828-860 3-40 (70)
148 PF04405 ScdA_N: Domain of Unk 20.2 52 0.0011 29.0 1.1 37 220-256 14-54 (56)
149 PF10078 DUF2316: Uncharacteri 20.2 63 0.0014 31.3 1.7 42 231-272 21-62 (89)
150 PRK10141 DNA-binding transcrip 20.1 78 0.0017 31.8 2.5 33 222-254 19-51 (117)
151 PRK11179 DNA-binding transcrip 20.0 75 0.0016 32.7 2.4 33 222-254 12-44 (153)
152 PLN02436 cellulose synthase A 20.0 66 0.0014 43.0 2.4 49 705-754 36-87 (1094)
No 1
>KOG0954 consensus PHD finger protein [General function prediction only]
Probab=100.00 E-value=8.9e-43 Score=404.22 Aligned_cols=517 Identities=30% Similarity=0.393 Sum_probs=322.2
Q ss_pred CCCCCCCCcccccCCCCCcHhhHhhcccCceeeccCCccccccccccchhhcccccccccccccCceeeCCCCCCCcccc
Q 001243 3 SLPKAGGALKPVNGGSMEFAHLFCSLLMPEVYIEDTMKVEPLMNVGGIKETRMKLVCNICRVKCGACVRCSHGTCRTSFH 82 (1116)
Q Consensus 3 lCP~~gGALK~T~~G~~~WvHv~CALw~PEv~f~~~~~~epV~~V~~I~~~R~~LkC~iC~~k~GAcIqCs~~~C~~~FH 82 (1116)
|||++||+||+|.+| ..|||+.||||||||+|+++..|+||++|+.|+..||.|.|.+|+.+.||||||+.+.|.++||
T Consensus 328 LCPkkGGamK~~~sg-T~wAHvsCALwIPEVsie~~ekmePItkfs~IpesRwslvC~LCk~k~GACIqCs~k~C~t~fH 406 (893)
T KOG0954|consen 328 LCPKKGGAMKPTKSG-TKWAHVSCALWIPEVSIECPEKMEPITKFSHIPESRWSLVCNLCKVKSGACIQCSNKTCRTAFH 406 (893)
T ss_pred eccccCCcccccCCC-CeeeEeeeeeccceeeccCHhhcCcccccCCCcHHHHHHHHHHhcccCcceEEecccchhhhcc
Confidence 799999999999987 5999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhhhhcCceEEEccccCCcceeEeecCCCCCCCCCCCCCCCCCCCCCCCCC--ccccccccccccccCcccceeeeccC
Q 001243 83 PICAREARHRLEVWGKYGCNNVELRAFCAKHSDIQDNSSTPRTGDPCSAIGS--ESCVSNNLHETLSMSKLHKLKFSCKN 160 (1116)
Q Consensus 83 vtCA~~aG~~~e~~~~~g~~~~~~~~fC~~Hr~~~~~~~~~~~~~~~~~d~~--~~~~~~~~~~~L~~~~l~Q~q~~~~~ 160 (1116)
|+||+.+|..|.++.+. .+.+.|+.||.+|+..+........++....... +...........+.+.++++...
T Consensus 407 v~CA~~aG~~~~~~~~~-~D~v~~~s~c~khs~~~~~~s~g~~~e~p~p~~~~p~~~~~e~~~~s~r~q~l~~~e~e--- 482 (893)
T KOG0954|consen 407 VTCAFEAGLEMKTILKE-NDEVKFKSYCSKHSDHREGKSLGNEAESPHPRCHLPEQSVGEGHRSSDRAQKLQELEGE--- 482 (893)
T ss_pred chhhhhcCCeeeeeecc-CCchhheeecccccccccccccccccCCCCccccChhhhhhhhhhhhHHHHHHhhcchh---
Confidence 99999999999999754 6788999999999988753222111111100000 00123344445555555555422
Q ss_pred CCeeeEeeecCCCCCCCCCCccccccCCCccccccccccccCCCCCCCCCCCCCCCCCcchHHHHHHHhhhCCcchhhhh
Q 001243 161 GDKIGVHTETSDANSDRSTDSEVTGFSDSRLISVPTSECTNAGKPDRSEFEDVNPSDALNFTLILKKLIDRGKVNVKDIA 240 (1116)
Q Consensus 161 Gd~~~~~~~t~~~~s~~~~~~e~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lkkli~~gkv~~~~~a 240 (1116)
.|.-.++.-.-|..++|.-.+++.- .=+. .+..+-....-.+..+.+|.+++||++|.|||+++++|
T Consensus 483 ----------f~~~v~~~diae~l~~~e~~vs~iy--nywk-lkrks~~n~~lippk~d~~~~i~kk~~~~~kv~~kl~a 549 (893)
T KOG0954|consen 483 ----------FYDIVRNEDIAELLSMPEFAVSAIY--NYWK-LKRKSRFNKELIPPKSDEVGLIAKKLEDLGKVRVKLVA 549 (893)
T ss_pred ----------HhhhhhHHHHHHHhcCchHHHHHHH--HHHH-HhhhccCCCcCCCCcchhccchhhHHHHhhhhhhHHHH
Confidence 1122222112333445554444411 0011 44455555688999999999999999999999999999
Q ss_pred hhhcCChhhhhhccccccccc-----------hhhHHHHHHhhhcccccccccceeeccccccccccc-------cccCC
Q 001243 241 SDIGISPDLLKTTLADGTFAS-----------DLQCKLVKWLSNHAYLGGLLKNVKLKIKSSISSKAD-------IKNSD 302 (1116)
Q Consensus 241 ~~~g~s~~~~~a~l~~~~~~~-----------~~~~k~~~wl~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~ 302 (1116)
..+ +|.+.+.+-..+.+- +-|.--..+|-.|.||++.++...++.+.++.+..- +.-.|
T Consensus 550 hlr---qdlerv~~~~~~~trrekas~s~~ki~eq~f~~ql~l~~q~~~~~~~~~n~~~n~~f~~~~r~tl~~k~~~s~~ 626 (893)
T KOG0954|consen 550 HLR---QDLERVRNLCYTKTRREKASNSYAKIDEQLFPDQLLLQHQHMGSSDKGKNLKRNTTFYSERRATLCTKGIVSLD 626 (893)
T ss_pred HHH---HHHHHhhcccchhcccchhhhhHHHHHhHHHHHHHHHHHHhhcccccchhhhhhccccCCcchhHhhhccccCC
Confidence 988 777776644222221 112222234778999999999988887665544322 22233
Q ss_pred CCCcccc---ccCCCCccccccc--------------------CCCCcccccceecccCcccccccceec-CCCcccccc
Q 001243 303 SDGLMVS---ESDVADPVAVKSV--------------------PPRRRTKSSIRILRDDKMVSSSEEIFS-GNGIAADKD 358 (1116)
Q Consensus 303 ~~~~~~~---~~~~~~~~~~~~v--------------------p~~~rt~~~~~il~dn~~~cs~e~~~~-~~g~~~~~~ 358 (1116)
+++...+ .--+..|.+++.+ +.----++|.|||+....+=+-+---+ -|-+..+
T Consensus 627 ~d~~~~a~q~lq~il~p~~~~~~~~i~n~~r~~~t~n~rkns~~~v~ak~~nnrl~~s~Shsp~~~h~~sp~~~t~s~-- 704 (893)
T KOG0954|consen 627 SDILDPAVQKLQSILRPHEINICNNITNNTRCTLTENCRKNSIVVVPAKANNNRLLKSGSHSPAPDHSPSPKNSTVSD-- 704 (893)
T ss_pred ccccCHHHHHhhcccCcchhhhhhccccCcccccChhhccCcceeeecccccCccccCCCcCCccccCCCcCCCccch--
Confidence 4443221 1112222222111 000011222222222221111100000 0000000
Q ss_pred hhhhcccCCCCCccCccccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCCcccccccccccchhhccc---cC
Q 001243 359 EVKVEQLDGEEPAIHNKVSTPDCTEKSPTDPTGSEDSLARGSPMSEGSAAKPSDCGFFESCQSEEAALPDQINLL---NV 435 (1116)
Q Consensus 359 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~ 435 (1116)
++. .-++-|..|+. -... ..++.++. .+-+.+|- |+
T Consensus 705 -------~~~--h~gk~g~~pr~-------d~~s----~sasss~n---------------------~ksq~~skirsn~ 743 (893)
T KOG0954|consen 705 -------QKV--HHGKSGVIPRD-------DHGS----QSASSSSN---------------------VKSQNASKIRSNS 743 (893)
T ss_pred -------hhc--CCccCCCCccc-------cccc----cccccccC---------------------cccccccccccCc
Confidence 000 00111111111 1111 11111111 11111111 33
Q ss_pred CCCCCCCCCCCCcccccccCCCCCccccchhhhhh-hccccCccCCCcccccCCcccccccccccCcCccccccCccCcc
Q 001243 436 DQENPICSSVDTLVPYFINAKPSSGFFWHPYIHKS-LQMQSGLLSGNKVHKSDGDTEISRLEASSTASVCCNHQGRHSKC 514 (1116)
Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~~~y~hp~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (1116)
.++--.-+...+.++.+++.+.+.+|..|-||++. ..+.-..+++ ...+..+.++..+..-+.=.-..++++..
T Consensus 744 s~~s~n~ni~~~~sss~~~~~~~p~fsph~~~~~s~s~s~~e~~sk-----s~~~s~~~~~kq~y~~~~~~~~~~~q~~g 818 (893)
T KOG0954|consen 744 SQNSGNGNIPNPISSSLFNQEAYPGFSPHRYIHKSLSESGKEQTSK-----SSTDSDVARMKQTYTHLAGSEEGNKQLQG 818 (893)
T ss_pred ccccCCCcCCCCcchhhhccccCCCCCcchhhhhhhhhhccccccc-----ccccCCcchhhheecccccccchhhHHHH
Confidence 33333333334677799999999999999999999 5554444443 34455555554211111111122222222
Q ss_pred CCCcccCCccchHHHHHhhhccccccCCCcchhhHHHHHHHHHhhhhhhhhhhhHHHHHHHHHHhHHHHHhhhhcchhHH
Q 001243 515 NDMSCKSDGVNLEQVFKARTRGVLELSPTDEVEGEIIYFQHRLLGNAFSRKRLADNLVCKAVKTLNQEIDVARGRRWDAV 594 (1116)
Q Consensus 515 ~~~~~~~~~~~~~q~~~~~~~~~~~~~p~de~e~E~~~~q~~ll~~~~~~r~~~~~lv~~V~k~l~~E~~~~~~r~~d~~ 594 (1116)
..++-|+++++.+|+++.+|.|+.|+|.+|.|..+++.+..+++..+++..+++++++.|++....|+||..
T Consensus 819 --------~e~~~~~s~~~p~~~~d~s~~D~e~~~~~~~q~~~~g~~r~rkqssd~~n~~~asr~~~~~~~~~g~~~~~s 890 (893)
T KOG0954|consen 819 --------AETFLQLSKARPLGILDLSPEDEEEGELLYYQLQLLGTARSRKQSSDNLNYEVASRLPLEIDEQHGRRWDDS 890 (893)
T ss_pred --------HHHHHHhhccCCcccccCCCCchhhhhHHhhhhccccceecccccccCcChhhhccCCCccccccCcCcchh
Confidence 367899999999999999999999999999999999999999999999999999999999999999999987
Q ss_pred HH
Q 001243 595 LV 596 (1116)
Q Consensus 595 ~~ 596 (1116)
++
T Consensus 891 ~~ 892 (893)
T KOG0954|consen 891 LV 892 (893)
T ss_pred hc
Confidence 64
No 2
>KOG0954 consensus PHD finger protein [General function prediction only]
Probab=100.00 E-value=1.6e-36 Score=352.37 Aligned_cols=167 Identities=32% Similarity=0.767 Sum_probs=151.7
Q ss_pred CCCCCcCcccCCCCC-CCCCEEEccccCcccccccccCccCCCCceecccccccccCCCCCCCCCCccCCCccccccccC
Q 001243 702 KEHPRSCDICRRSET-ILNPILICSGCKVAVHLDCYRNAKESTGPWYCELCEELLSSRSSGAPSVNFWEKPYFVAECSLC 780 (1116)
Q Consensus 702 ke~d~~CsVC~~~E~-~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd~C~~~~~~~~s~~~~vn~~~~p~~~~~C~LC 780 (1116)
.+++..|+||..+++ ..|+|||||+|+++|||.||||..+|+++|+|+.|... + .+.|+||
T Consensus 268 ~dedviCDvCrspD~e~~neMVfCd~Cn~cVHqaCyGIle~p~gpWlCr~Calg----------------~--~ppCvLC 329 (893)
T KOG0954|consen 268 YDEDVICDVCRSPDSEEANEMVFCDKCNICVHQACYGILEVPEGPWLCRTCALG----------------I--EPPCVLC 329 (893)
T ss_pred ccccceeceecCCCccccceeEEeccchhHHHHhhhceeecCCCCeeehhcccc----------------C--CCCeeec
Confidence 447899999999887 58999999999999999999999999999999999986 1 6789999
Q ss_pred CCCCCcceeccCC-chhhhccccccccceeecC-ccccccCccccCCCC--cccccccCcCCceeecCCcCcccccchhh
Q 001243 781 GGTTGAFRKSANG-QWVHAFCAEWVFESTFRRG-QVNPVAGMEAFPKGI--DVCCICRHKHGICIKCNYGNCQTTFHPTC 856 (1116)
Q Consensus 781 p~~gGALK~T~~g-~WVHV~CALW~PEv~f~n~-~lepVegie~I~k~r--~~C~iC~~k~GAcIqCs~~~C~~sFHvtC 856 (1116)
|.+||+||++..| .|+|++||||+|||+|.+. .|+||..+..|+..+ +.|.+|+.+.||||+|+.+.|.++||++|
T Consensus 330 PkkGGamK~~~sgT~wAHvsCALwIPEVsie~~ekmePItkfs~IpesRwslvC~LCk~k~GACIqCs~k~C~t~fHv~C 409 (893)
T KOG0954|consen 330 PKKGGAMKPTKSGTKWAHVSCALWIPEVSIECPEKMEPITKFSHIPESRWSLVCNLCKVKSGACIQCSNKTCRTAFHVTC 409 (893)
T ss_pred cccCCcccccCCCCeeeEeeeeeccceeeccCHhhcCcccccCCCcHHHHHHHHHHhcccCcceEEecccchhhhccchh
Confidence 9999999999877 6999999999999999987 799999999999875 89999999999999999999999999999
Q ss_pred hhhcCceEEEee-CCCceeeeecCCCCchhh
Q 001243 857 ARSAGFYLNVKS-TGGNFQHKAYCEKHSLEQ 886 (1116)
Q Consensus 857 A~~aG~~~~~k~-~~g~~~~~iyC~kHs~~~ 886 (1116)
|+.+|..|.+.. .++...++.||.+|+..+
T Consensus 410 A~~aG~~~~~~~~~~D~v~~~s~c~khs~~~ 440 (893)
T KOG0954|consen 410 AFEAGLEMKTILKENDEVKFKSYCSKHSDHR 440 (893)
T ss_pred hhhcCCeeeeeeccCCchhheeecccccccc
Confidence 999999997643 456678899999987654
No 3
>KOG0955 consensus PHD finger protein BR140/LIN-49 [General function prediction only]
Probab=100.00 E-value=3.4e-35 Score=361.10 Aligned_cols=169 Identities=37% Similarity=0.834 Sum_probs=151.2
Q ss_pred CCCCCcCcccCCCCC-CCCCEEEccccCcccccccccCccCCCCceecccccccccCCCCCCCCCCccCCCccccccccC
Q 001243 702 KEHPRSCDICRRSET-ILNPILICSGCKVAVHLDCYRNAKESTGPWYCELCEELLSSRSSGAPSVNFWEKPYFVAECSLC 780 (1116)
Q Consensus 702 ke~d~~CsVC~~~E~-~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd~C~~~~~~~~s~~~~vn~~~~p~~~~~C~LC 780 (1116)
-+.|.+|+||.+.+. +.|.|||||+|+++|||+|||++.+|+|.|+|+.|... |...+.|.||
T Consensus 216 ~~~D~~C~iC~~~~~~n~n~ivfCD~Cnl~VHq~Cygi~~ipeg~WlCr~Cl~s----------------~~~~v~c~~c 279 (1051)
T KOG0955|consen 216 LEEDAVCCICLDGECQNSNVIVFCDGCNLAVHQECYGIPFIPEGQWLCRRCLQS----------------PQRPVRCLLC 279 (1051)
T ss_pred cCCCccceeecccccCCCceEEEcCCCcchhhhhccCCCCCCCCcEeehhhccC----------------cCcccceEec
Confidence 356789999999986 57999999999999999999999999999999999976 2235799999
Q ss_pred CCCCCcceeccCCchhhhccccccccceeecC-ccccccCccccCCC--CcccccccCcC-CceeecCCcCcccccchhh
Q 001243 781 GGTTGAFRKSANGQWVHAFCAEWVFESTFRRG-QVNPVAGMEAFPKG--IDVCCICRHKH-GICIKCNYGNCQTTFHPTC 856 (1116)
Q Consensus 781 p~~gGALK~T~~g~WVHV~CALW~PEv~f~n~-~lepVegie~I~k~--r~~C~iC~~k~-GAcIqCs~~~C~~sFHvtC 856 (1116)
|..+||||+|++|+|+|++||+|+||++|.+. .+++|+++++|+.. ++.|++|+.++ ||||||+..+|.++||++|
T Consensus 280 p~~~gAFkqt~dgrw~Hv~caiwipev~F~nt~~~E~I~~i~~i~~aRwkL~cy~cK~~~~gaciqcs~~~c~~a~hvtc 359 (1051)
T KOG0955|consen 280 PSKGGAFKQTDDGRWAHVVCAIWIPEVSFANTVFLEPIDSIENIPPARWKLTCYICKQKGLGACIQCSKANCYTAFHVTC 359 (1051)
T ss_pred cCCCCcceeccCCceeeeehhhcccccccccchhhccccchhcCcHhhhhceeeeeccCCCCcceecchhhhhhhhhhhh
Confidence 99999999999999999999999999999998 79999999999965 69999999998 9999999999999999999
Q ss_pred hhhcCceEEEeeC--C---C---ceeeeecCCCCchhh
Q 001243 857 ARSAGFYLNVKST--G---G---NFQHKAYCEKHSLEQ 886 (1116)
Q Consensus 857 A~~aG~~~~~k~~--~---g---~~~~~iyC~kHs~~~ 886 (1116)
|+++|++|..... . + .+....||.+|.+..
T Consensus 360 a~~agl~m~~~~~~~~s~~~~s~~v~~~syC~~H~pp~ 397 (1051)
T KOG0955|consen 360 ARRAGLYMKSNTVKELSKNGTSQSVNKISYCDKHTPPG 397 (1051)
T ss_pred HhhcCceEeecccccccccccccccceeeeccCCCCch
Confidence 9999999984311 1 1 246788999999863
No 4
>KOG0956 consensus PHD finger protein AF10 [General function prediction only]
Probab=100.00 E-value=3.1e-35 Score=339.08 Aligned_cols=170 Identities=35% Similarity=0.800 Sum_probs=147.0
Q ss_pred CCCCCcCcccCCCCC-CCCCEEEccc--cCcccccccccCccCCCCceecccccccccCCCCCCCCCCccCCCccccccc
Q 001243 702 KEHPRSCDICRRSET-ILNPILICSG--CKVAVHLDCYRNAKESTGPWYCELCEELLSSRSSGAPSVNFWEKPYFVAECS 778 (1116)
Q Consensus 702 ke~d~~CsVC~~~E~-~~N~IL~Cd~--C~laVHq~CYGi~~ipeg~WlCd~C~~~~~~~~s~~~~vn~~~~p~~~~~C~ 778 (1116)
|+.-.-|+||-|--. ..|+|||||+ |.++|||.||||.++|+|+|||++|+... +. ..+.|.
T Consensus 2 KEMVGGCCVCSDErGWaeNPLVYCDG~nCsVAVHQaCYGIvqVPtGpWfCrKCesqe--ra-------------arvrCe 66 (900)
T KOG0956|consen 2 KEMVGGCCVCSDERGWAENPLVYCDGHNCSVAVHQACYGIVQVPTGPWFCRKCESQE--RA-------------ARVRCE 66 (900)
T ss_pred cccccceeeecCcCCCccCceeeecCCCceeeeehhcceeEecCCCchhhhhhhhhh--hh-------------ccceee
Confidence 455677999998555 4899999996 99999999999999999999999998751 11 268999
Q ss_pred cCCCCCCcceeccCCchhhhccccccccceeecC-ccccccCccccCCC--CcccccccCc-------CCceeecCCcCc
Q 001243 779 LCGGTTGAFRKSANGQWVHAFCAEWVFESTFRRG-QVNPVAGMEAFPKG--IDVCCICRHK-------HGICIKCNYGNC 848 (1116)
Q Consensus 779 LCp~~gGALK~T~~g~WVHV~CALW~PEv~f~n~-~lepVegie~I~k~--r~~C~iC~~k-------~GAcIqCs~~~C 848 (1116)
|||.++||||+|++|-|+||+||||+|||.|.|- .|+||- +..|+.. ...|+||... .|||++|+..+|
T Consensus 67 LCP~kdGALKkTDn~GWAHVVCALYIPEVrFgNV~TMEPIi-Lq~VP~dRfnKtCYIC~E~GrpnkA~~GACMtCNKs~C 145 (900)
T KOG0956|consen 67 LCPHKDGALKKTDNGGWAHVVCALYIPEVRFGNVHTMEPII-LQDVPHDRFNKTCYICNEEGRPNKAAKGACMTCNKSGC 145 (900)
T ss_pred cccCcccceecccCCCceEEEEEeeccceeeccccccccee-eccCchhhhcceeeeecccCCccccccccceecccccc
Confidence 9999999999999999999999999999999997 788876 4556655 5899999973 899999999999
Q ss_pred ccccchhhhhhcCceEEEee-CCCceeeeecCCCCchhhH
Q 001243 849 QTTFHPTCARSAGFYLNVKS-TGGNFQHKAYCEKHSLEQK 887 (1116)
Q Consensus 849 ~~sFHvtCA~~aG~~~~~k~-~~g~~~~~iyC~kHs~~~k 887 (1116)
.+.|||+||+.+|+.-+... .-++++|--||+.|..+.+
T Consensus 146 kqaFHVTCAQ~~GLLCEE~gn~~dNVKYCGYCk~HfsKlk 185 (900)
T KOG0956|consen 146 KQAFHVTCAQRAGLLCEEEGNISDNVKYCGYCKYHFSKLK 185 (900)
T ss_pred hhhhhhhHhhhhccceeccccccccceechhHHHHHHHhh
Confidence 99999999999999997653 2357888899999987544
No 5
>COG5141 PHD zinc finger-containing protein [General function prediction only]
Probab=100.00 E-value=2.7e-35 Score=330.03 Aligned_cols=169 Identities=32% Similarity=0.717 Sum_probs=148.2
Q ss_pred CCCCcCcccCCCCC-CCCCEEEccccCcccccccccCccCCCCceecccccccccCCCCCCCCCCccCCCccccccccCC
Q 001243 703 EHPRSCDICRRSET-ILNPILICSGCKVAVHLDCYRNAKESTGPWYCELCEELLSSRSSGAPSVNFWEKPYFVAECSLCG 781 (1116)
Q Consensus 703 e~d~~CsVC~~~E~-~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd~C~~~~~~~~s~~~~vn~~~~p~~~~~C~LCp 781 (1116)
+-+..|.+|...++ +.|.||+||+|+++|||.||||..+|+|.|||++|.+. ++...-|.+||
T Consensus 191 ~~d~~C~~c~~t~~eN~naiVfCdgC~i~VHq~CYGI~f~peG~WlCrkCi~~----------------~~~i~~C~fCp 254 (669)
T COG5141 191 EFDDICTKCTSTHNENSNAIVFCDGCEICVHQSCYGIQFLPEGFWLCRKCIYG----------------EYQIRCCSFCP 254 (669)
T ss_pred hhhhhhHhccccccCCcceEEEecCcchhhhhhcccceecCcchhhhhhhccc----------------ccceeEEEecc
Confidence 45778999999876 57999999999999999999999999999999999987 22245699999
Q ss_pred CCCCcceeccCCchhhhccccccccceeecC-ccccccCccccCCC--CcccccccCcCCceeecCCcCcccccchhhhh
Q 001243 782 GTTGAFRKSANGQWVHAFCAEWVFESTFRRG-QVNPVAGMEAFPKG--IDVCCICRHKHGICIKCNYGNCQTTFHPTCAR 858 (1116)
Q Consensus 782 ~~gGALK~T~~g~WVHV~CALW~PEv~f~n~-~lepVegie~I~k~--r~~C~iC~~k~GAcIqCs~~~C~~sFHvtCA~ 858 (1116)
..+||||+|.+|.|+|++||+|+||++|.+- .++||+||.+++.. ++.|++|+..+|+||||++.+|.++||++||+
T Consensus 255 s~dGaFkqT~dgrW~H~iCA~~~pelsF~~l~~~dpI~~i~sVs~srwkl~C~iCk~~~GtcIqCs~~nC~~aYHVtCAr 334 (669)
T COG5141 255 SSDGAFKQTSDGRWGHVICAMFNPELSFGHLLSKDPIDNIASVSSSRWKLGCLICKEFGGTCIQCSYFNCTRAYHVTCAR 334 (669)
T ss_pred CCCCceeeccCCchHhHhHHHhcchhccccccccchhhhhcccchhhHhheeeEEcccCcceeeecccchhhhhhhhhhh
Confidence 9999999999999999999999999999987 79999999999987 58899999999999999999999999999999
Q ss_pred hcCceEE-EeeCCC---ceeeeecCCCCchhhH
Q 001243 859 SAGFYLN-VKSTGG---NFQHKAYCEKHSLEQK 887 (1116)
Q Consensus 859 ~aG~~~~-~k~~~g---~~~~~iyC~kHs~~~k 887 (1116)
++|+++. ....++ .+....||.+|.+..-
T Consensus 335 rag~f~~~~~s~n~~s~~id~e~~c~kh~p~gy 367 (669)
T COG5141 335 RAGYFDLNIYSHNGISYCIDHEPLCRKHYPLGY 367 (669)
T ss_pred hcchhhhhhhcccccceeecchhhhcCCCCcch
Confidence 9999885 222222 2344569999988543
No 6
>KOG0957 consensus PHD finger protein [General function prediction only]
Probab=99.97 E-value=2e-33 Score=315.40 Aligned_cols=285 Identities=26% Similarity=0.468 Sum_probs=199.1
Q ss_pred cCcccCCCCC-CCCCEEEccccCcccccccccCcc---CC-------CCceecccccccccCCCCCCCCCCccCCCcccc
Q 001243 707 SCDICRRSET-ILNPILICSGCKVAVHLDCYRNAK---ES-------TGPWYCELCEELLSSRSSGAPSVNFWEKPYFVA 775 (1116)
Q Consensus 707 ~CsVC~~~E~-~~N~IL~Cd~C~laVHq~CYGi~~---ip-------eg~WlCd~C~~~~~~~~s~~~~vn~~~~p~~~~ 775 (1116)
.|+||...-. ..|+||.|++|++.||..|||+.. |+ ..+|||+.|.+..+ .+
T Consensus 121 iCcVClg~rs~da~ei~qCd~CGi~VHEgCYGv~dn~si~s~~s~~stepWfCeaC~~Gvs-----------------~P 183 (707)
T KOG0957|consen 121 ICCVCLGQRSVDAGEILQCDKCGINVHEGCYGVLDNVSIPSGSSDCSTEPWFCEACLYGVS-----------------LP 183 (707)
T ss_pred EEEEeecCccccccceeeccccCceecccccccccccccCCCCccCCCCchhhhhHhcCCC-----------------CC
Confidence 7999998654 469999999999999999999762 22 36799999999743 46
Q ss_pred ccccCCCCCCcceeccCCchhhhccccccccceeecC-ccccccC--ccccCCCCcccccccCc----CCceeecCCcCc
Q 001243 776 ECSLCGGTTGAFRKSANGQWVHAFCAEWVFESTFRRG-QVNPVAG--MEAFPKGIDVCCICRHK----HGICIKCNYGNC 848 (1116)
Q Consensus 776 ~C~LCp~~gGALK~T~~g~WVHV~CALW~PEv~f~n~-~lepVeg--ie~I~k~r~~C~iC~~k----~GAcIqCs~~~C 848 (1116)
.|.|||.++|+||.|+-|+|||++||||+|+|.|+.. .+.+|.. +.....+...|.+|..+ .|.||.|..+.|
T Consensus 184 ~CElCPn~~GifKetDigrWvH~iCALYvpGVafg~~~~l~~Vtl~em~ysk~Gak~Cs~Ced~~fARtGvci~CdaGMC 263 (707)
T KOG0957|consen 184 HCELCPNRFGIFKETDIGRWVHAICALYVPGVAFGQTHTLCGVTLEEMDYSKFGAKTCSACEDKIFARTGVCIRCDAGMC 263 (707)
T ss_pred ccccCCCcCCcccccchhhHHHHHHHhhcCccccccccccccccHHHhhhhhhccchhccccchhhhhcceeeeccchhh
Confidence 9999999999999999999999999999999999876 5555543 23233446899999974 899999999999
Q ss_pred ccccchhhhhhcCceEEEeeCCC-ceeeeecCCCCchhhHhhhhhcccchhhhhhhhhhHHHHHHH----HHHHHHHHHH
Q 001243 849 QTTFHPTCARSAGFYLNVKSTGG-NFQHKAYCEKHSLEQKMKAETQKHGVEELKGIKQIRVELERL----RLLCERIIKR 923 (1116)
Q Consensus 849 ~~sFHvtCA~~aG~~~~~k~~~g-~~~~~iyC~kHs~~~k~k~~~q~~~~eel~s~rr~rv~lE~l----rll~eri~kR 923 (1116)
..+|||+||+..|+.++...+++ ...|.+||++|+.....|.-...|..++...|+|+|+..... ..-.+.-..|
T Consensus 264 k~YfHVTCAQk~GlLvea~~e~DiAdpfya~CK~Ht~r~~~K~~rrny~~l~~~~~~r~~~k~~L~~~e~~~~p~~~eaq 343 (707)
T KOG0957|consen 264 KEYFHVTCAQKLGLLVEATDENDIADPFYAFCKKHTNRDNLKPYRRNYDDLEKSEARRITVKRRLRSGELEKNPQKKEAQ 343 (707)
T ss_pred hhhhhhhHHhhhcceeeccccccchhhHHHHHHhhcchhhhhhhhhhhHHHHHHHHHHHHHHHHHHhcccccCCCccHHH
Confidence 99999999999999998755444 346789999999865544445556666666777776431110 0000001112
Q ss_pred HHHHHHHhhhhHHHHHhhccccccccccccCcccCCCCCccccccccccCCCCCCchhHhhccCCceeeecccccccccc
Q 001243 924 EKIKRELILCSHEILAFKRDHHAARLVHGRIPFFPPDVSSESATTSLKGHTDSFKSCSEAFQRSDDVTVDSAASVKNRIK 1003 (1116)
Q Consensus 924 Eklkrel~~~~~dil~~k~~~~~a~s~~v~sp~~~~~~s~~~atTs~~~~~~~~~s~~~~~~r~d~~~~ds~~s~k~~~r 1003 (1116)
.+++++|....+ +. ..+... -..+|||....+...|||. .|+.|...++.--+++.|+ +|
T Consensus 344 ari~~~l~kv~~----k~---~~~k~~-~p~~wvp~~K~~RlLtsSA-----------sa~rrl~~KAE~mg~s~~~-f~ 403 (707)
T KOG0957|consen 344 ARIREELDKVIE----KE---CKNKPK-GPISWVPKPKQARLLTSSA-----------SAFRRLETKAEEMGLSRKE-FR 403 (707)
T ss_pred HHHHHHHHHHHH----HH---HhccCC-CCCCCCccccccccccchH-----------HHHHHHHHHHHHhcccHhh-hc
Confidence 222233322222 11 112222 2468999998888755555 6777777777777778777 65
Q ss_pred ccccC-ccccc--cCCCCcccCCCCCCCcc
Q 001243 1004 VYVPM-DADQR--TDDSSMSQNLYPRKPSE 1030 (1116)
Q Consensus 1004 ~~~~~-d~~~~--~~~~s~s~~~~~~~~~~ 1030 (1116)
+ .. |+--+ .-.=|+.++.|+..|.+
T Consensus 404 ~--~ead~~~~id~r~k~Hv~pafs~efi~ 431 (707)
T KOG0957|consen 404 Q--READPFFNIDLRSKSHVPPAFSKEFIE 431 (707)
T ss_pred c--cccCccccccccccccCCccccHHHHH
Confidence 3 11 22222 22346888888776543
No 7
>PF13832 zf-HC5HC2H_2: PHD-zinc-finger like domain
Probab=99.90 E-value=1.7e-24 Score=207.36 Aligned_cols=107 Identities=42% Similarity=0.859 Sum_probs=98.9
Q ss_pred CCCCCCCCCcccccCCCCCcHhhHhhcccCceeeccCCccccccccccchhhcccccccccccccCceeeCCCCCCCccc
Q 001243 2 CSLPKAGGALKPVNGGSMEFAHLFCSLLMPEVYIEDTMKVEPLMNVGGIKETRMKLVCNICRVKCGACVRCSHGTCRTSF 81 (1116)
Q Consensus 2 ClCP~~gGALK~T~~G~~~WvHv~CALw~PEv~f~~~~~~epV~~V~~I~~~R~~LkC~iC~~k~GAcIqCs~~~C~~~F 81 (1116)
.|||.+|||||+|.+| .|||++||+|+|+++|.+...++||. +..|+.+|++++|.||+++.|+||+|++++|.++|
T Consensus 4 ~lC~~~~Galk~t~~~--~WvHv~Cal~~~~~~~~~~~~~~~v~-~~~i~~~~~~~~C~iC~~~~G~~i~C~~~~C~~~f 80 (110)
T PF13832_consen 4 VLCPKRGGALKRTSDG--QWVHVLCALWIPEVIFNNGESMEPVD-ISNIPPSRFKLKCSICGKSGGACIKCSHPGCSTAF 80 (110)
T ss_pred EeCCCCCCcccCccCC--cEEEeEccceeCccEEeechhcCccc-ceeecchhcCCcCcCCCCCCceeEEcCCCCCCcCC
Confidence 4899999999999965 89999999999999999999999995 99999999999999999999999999999999999
Q ss_pred chhhhhhcCceEEEccccCCcceeEeecCCCC
Q 001243 82 HPICAREARHRLEVWGKYGCNNVELRAFCAKH 113 (1116)
Q Consensus 82 HvtCA~~aG~~~e~~~~~g~~~~~~~~fC~~H 113 (1116)
||+||+.+|+.+++...+. ...+.+||++|
T Consensus 81 H~~CA~~~g~~~~~~~~~~--~~~~~~~C~~H 110 (110)
T PF13832_consen 81 HPTCARKAGLYFEIENEED--NVQFIAYCPKH 110 (110)
T ss_pred CHHHHHHCCCeEEeeecCC--CceEEEECCCC
Confidence 9999999999998874322 56789999999
No 8
>PF13832 zf-HC5HC2H_2: PHD-zinc-finger like domain
Probab=99.88 E-value=3.9e-23 Score=198.04 Aligned_cols=106 Identities=43% Similarity=0.980 Sum_probs=97.4
Q ss_pred ccccCCCCCCcceeccCCchhhhccccccccceeecC-ccccccCccccCCC--CcccccccCcCCceeecCCcCccccc
Q 001243 776 ECSLCGGTTGAFRKSANGQWVHAFCAEWVFESTFRRG-QVNPVAGMEAFPKG--IDVCCICRHKHGICIKCNYGNCQTTF 852 (1116)
Q Consensus 776 ~C~LCp~~gGALK~T~~g~WVHV~CALW~PEv~f~n~-~lepVegie~I~k~--r~~C~iC~~k~GAcIqCs~~~C~~sF 852 (1116)
.|.|||..+||||+|.++.|||++||+|+|++.|.+. .+++++ ++.++.. +..|.+|++..|++|+|..++|.++|
T Consensus 2 ~C~lC~~~~Galk~t~~~~WvHv~Cal~~~~~~~~~~~~~~~v~-~~~i~~~~~~~~C~iC~~~~G~~i~C~~~~C~~~f 80 (110)
T PF13832_consen 2 SCVLCPKRGGALKRTSDGQWVHVLCALWIPEVIFNNGESMEPVD-ISNIPPSRFKLKCSICGKSGGACIKCSHPGCSTAF 80 (110)
T ss_pred ccEeCCCCCCcccCccCCcEEEeEccceeCccEEeechhcCccc-ceeecchhcCCcCcCCCCCCceeEEcCCCCCCcCC
Confidence 6999999999999999999999999999999999987 577877 7787765 69999999999999999999999999
Q ss_pred chhhhhhcCceEEEeeCCCceeeeecCCCC
Q 001243 853 HPTCARSAGFYLNVKSTGGNFQHKAYCEKH 882 (1116)
Q Consensus 853 HvtCA~~aG~~~~~k~~~g~~~~~iyC~kH 882 (1116)
||+||+.+|+++.+...+....+.+||++|
T Consensus 81 H~~CA~~~g~~~~~~~~~~~~~~~~~C~~H 110 (110)
T PF13832_consen 81 HPTCARKAGLYFEIENEEDNVQFIAYCPKH 110 (110)
T ss_pred CHHHHHHCCCeEEeeecCCCceEEEECCCC
Confidence 999999999999887655567889999999
No 9
>KOG0956 consensus PHD finger protein AF10 [General function prediction only]
Probab=99.88 E-value=9.2e-24 Score=244.90 Aligned_cols=113 Identities=31% Similarity=0.646 Sum_probs=104.3
Q ss_pred CCCCCCCCcccccCCCCCcHhhHhhcccCceeeccCCccccccccccchhhcccccccccccc-------cCceeeCCCC
Q 001243 3 SLPKAGGALKPVNGGSMEFAHLFCSLLMPEVYIEDTMKVEPLMNVGGIKETRMKLVCNICRVK-------CGACVRCSHG 75 (1116)
Q Consensus 3 lCP~~gGALK~T~~G~~~WvHv~CALw~PEv~f~~~~~~epV~~V~~I~~~R~~LkC~iC~~k-------~GAcIqCs~~ 75 (1116)
|||.+.||||+|+.| .|+||+||||||||.|+++..||||+ +..||.+|++..|+||... .|||++|...
T Consensus 67 LCP~kdGALKkTDn~--GWAHVVCALYIPEVrFgNV~TMEPIi-Lq~VP~dRfnKtCYIC~E~GrpnkA~~GACMtCNKs 143 (900)
T KOG0956|consen 67 LCPHKDGALKKTDNG--GWAHVVCALYIPEVRFGNVHTMEPII-LQDVPHDRFNKTCYICNEEGRPNKAAKGACMTCNKS 143 (900)
T ss_pred cccCcccceecccCC--CceEEEEEeeccceeeccccccccee-eccCchhhhcceeeeecccCCccccccccceecccc
Confidence 799999999999966 79999999999999999999999996 9999999999999999853 8999999999
Q ss_pred CCCcccchhhhhhcCceEEEccccCCcceeEeecCCCCCCCCCC
Q 001243 76 TCRTSFHPICAREARHRLEVWGKYGCNNVELRAFCAKHSDIQDN 119 (1116)
Q Consensus 76 ~C~~~FHvtCA~~aG~~~e~~~~~g~~~~~~~~fC~~Hr~~~~~ 119 (1116)
.|..+||||||+.+|+++|..+ .+-|+|+|-.||++|-.+-.+
T Consensus 144 ~CkqaFHVTCAQ~~GLLCEE~g-n~~dNVKYCGYCk~HfsKlkk 186 (900)
T KOG0956|consen 144 GCKQAFHVTCAQRAGLLCEEEG-NISDNVKYCGYCKYHFSKLKK 186 (900)
T ss_pred cchhhhhhhHhhhhccceeccc-cccccceechhHHHHHHHhhc
Confidence 9999999999999999999875 466889999999999876554
No 10
>COG5141 PHD zinc finger-containing protein [General function prediction only]
Probab=99.87 E-value=2.2e-23 Score=234.86 Aligned_cols=117 Identities=26% Similarity=0.589 Sum_probs=105.3
Q ss_pred CCCCCCCCcccccCCCCCcHhhHhhcccCceeeccCCccccccccccchhhcccccccccccccCceeeCCCCCCCcccc
Q 001243 3 SLPKAGGALKPVNGGSMEFAHLFCSLLMPEVYIEDTMKVEPLMNVGGIKETRMKLVCNICRVKCGACVRCSHGTCRTSFH 82 (1116)
Q Consensus 3 lCP~~gGALK~T~~G~~~WvHv~CALw~PEv~f~~~~~~epV~~V~~I~~~R~~LkC~iC~~k~GAcIqCs~~~C~~~FH 82 (1116)
+||.+.||||.|.+| +|+|++||+|+||.+|++...++||.||..|+..||+|.|.||+.+.|+||||++.+|.++||
T Consensus 252 fCps~dGaFkqT~dg--rW~H~iCA~~~pelsF~~l~~~dpI~~i~sVs~srwkl~C~iCk~~~GtcIqCs~~nC~~aYH 329 (669)
T COG5141 252 FCPSSDGAFKQTSDG--RWGHVICAMFNPELSFGHLLSKDPIDNIASVSSSRWKLGCLICKEFGGTCIQCSYFNCTRAYH 329 (669)
T ss_pred eccCCCCceeeccCC--chHhHhHHHhcchhccccccccchhhhhcccchhhHhheeeEEcccCcceeeecccchhhhhh
Confidence 689999999999988 899999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhhhhcCceEE-EccccCC-cceeEeecCCCCCCCCCCCC
Q 001243 83 PICAREARHRLE-VWGKYGC-NNVELRAFCAKHSDIQDNSS 121 (1116)
Q Consensus 83 vtCA~~aG~~~e-~~~~~g~-~~~~~~~fC~~Hr~~~~~~~ 121 (1116)
||||++||+++- +...+|- ..+....||.+|.|......
T Consensus 330 VtCArrag~f~~~~~s~n~~s~~id~e~~c~kh~p~gy~~~ 370 (669)
T COG5141 330 VTCARRAGYFDLNIYSHNGISYCIDHEPLCRKHYPLGYGRM 370 (669)
T ss_pred hhhhhhcchhhhhhhcccccceeecchhhhcCCCCcchhcc
Confidence 999999999875 4433342 22456789999999998743
No 11
>KOG0955 consensus PHD finger protein BR140/LIN-49 [General function prediction only]
Probab=99.83 E-value=8.5e-22 Score=243.63 Aligned_cols=114 Identities=36% Similarity=0.692 Sum_probs=103.8
Q ss_pred CCCCCCCCCcccccCCCCCcHhhHhhcccCceeeccCCccccccccccchhhccccccccccccc-CceeeCCCCCCCcc
Q 001243 2 CSLPKAGGALKPVNGGSMEFAHLFCSLLMPEVYIEDTMKVEPLMNVGGIKETRMKLVCNICRVKC-GACVRCSHGTCRTS 80 (1116)
Q Consensus 2 ClCP~~gGALK~T~~G~~~WvHv~CALw~PEv~f~~~~~~epV~~V~~I~~~R~~LkC~iC~~k~-GAcIqCs~~~C~~~ 80 (1116)
++||+++||||+|++| +|+|++||+|+||++|.+...++||.+|+.|+..||+|.|.+|+++. ||||||+..+|.++
T Consensus 277 ~~cp~~~gAFkqt~dg--rw~Hv~caiwipev~F~nt~~~E~I~~i~~i~~aRwkL~cy~cK~~~~gaciqcs~~~c~~a 354 (1051)
T KOG0955|consen 277 LLCPSKGGAFKQTDDG--RWAHVVCAIWIPEVSFANTVFLEPIDSIENIPPARWKLTCYICKQKGLGACIQCSKANCYTA 354 (1051)
T ss_pred EeccCCCCcceeccCC--ceeeeehhhcccccccccchhhccccchhcCcHhhhhceeeeeccCCCCcceecchhhhhhh
Confidence 5899999999999987 89999999999999999999999999999999999999999999998 99999999999999
Q ss_pred cchhhhhhcCceEEEc-cccC----C-cceeEeecCCCCCCCC
Q 001243 81 FHPICAREARHRLEVW-GKYG----C-NNVELRAFCAKHSDIQ 117 (1116)
Q Consensus 81 FHvtCA~~aG~~~e~~-~~~g----~-~~~~~~~fC~~Hr~~~ 117 (1116)
||||||+++|++|... ..++ . ..+.+.+||+.|.|..
T Consensus 355 ~hvtca~~agl~m~~~~~~~~s~~~~s~~v~~~syC~~H~pp~ 397 (1051)
T KOG0955|consen 355 FHVTCARRAGLYMKSNTVKELSKNGTSQSVNKISYCDKHTPPG 397 (1051)
T ss_pred hhhhhHhhcCceEeecccccccccccccccceeeeccCCCCch
Confidence 9999999999999843 1111 1 2367899999999996
No 12
>PF13771 zf-HC5HC2H: PHD-like zinc-binding domain
Probab=99.71 E-value=5.5e-18 Score=156.71 Aligned_cols=88 Identities=42% Similarity=0.737 Sum_probs=80.5
Q ss_pred hhHhhcccCceeeccCCc--cccccccccchhhcccccccccccccCceeeCCCCCCCcccchhhhhhcCceEEEccccC
Q 001243 23 HLFCSLLMPEVYIEDTMK--VEPLMNVGGIKETRMKLVCNICRVKCGACVRCSHGTCRTSFHPICAREARHRLEVWGKYG 100 (1116)
Q Consensus 23 Hv~CALw~PEv~f~~~~~--~epV~~V~~I~~~R~~LkC~iC~~k~GAcIqCs~~~C~~~FHvtCA~~aG~~~e~~~~~g 100 (1116)
|++||||+||+++.+... +.+|.+|..++.++++++|++|+++.||+|+|.+++|.+.||++||+.+|+.+++..
T Consensus 1 H~~Calwsp~v~~~~~~~~~~~~i~~v~~~~~~~~~~~C~~C~~~~Ga~i~C~~~~C~~~fH~~CA~~~~~~~~~~~--- 77 (90)
T PF13771_consen 1 HENCALWSPEVYFDESEDIGGFSIEDVEKEIKRRRKLKCSICKKKGGACIGCSHPGCSRSFHVPCARKAGCFIEFDE--- 77 (90)
T ss_pred ChHHheecCceEEeCCCccccccHHhHHHHHHHHhCCCCcCCCCCCCeEEEEeCCCCCcEEChHHHccCCeEEEEcc---
Confidence 899999999999988864 677889999999999999999998889999999999999999999999999998874
Q ss_pred CcceeEeecCCCCC
Q 001243 101 CNNVELRAFCAKHS 114 (1116)
Q Consensus 101 ~~~~~~~~fC~~Hr 114 (1116)
++..+.+||++|+
T Consensus 78 -~~~~~~~~C~~H~ 90 (90)
T PF13771_consen 78 -DNGKFRIFCPKHS 90 (90)
T ss_pred -CCCceEEEChhcC
Confidence 2346899999996
No 13
>KOG0957 consensus PHD finger protein [General function prediction only]
Probab=99.68 E-value=5.6e-18 Score=191.88 Aligned_cols=113 Identities=26% Similarity=0.525 Sum_probs=97.8
Q ss_pred CCCCCCCCcccccCCCCCcHhhHhhcccCceeeccCCccccccccccchhhcccc-cccccccc----cCceeeCCCCCC
Q 001243 3 SLPKAGGALKPVNGGSMEFAHLFCSLLMPEVYIEDTMKVEPLMNVGGIKETRMKL-VCNICRVK----CGACVRCSHGTC 77 (1116)
Q Consensus 3 lCP~~gGALK~T~~G~~~WvHv~CALw~PEv~f~~~~~~epV~~V~~I~~~R~~L-kC~iC~~k----~GAcIqCs~~~C 77 (1116)
|||+++|+||.|+.| +|||++||||+|+|-|++...+-+|. +..+....|.. .|++|..+ .|.||.|..|.|
T Consensus 187 lCPn~~GifKetDig--rWvH~iCALYvpGVafg~~~~l~~Vt-l~em~ysk~Gak~Cs~Ced~~fARtGvci~CdaGMC 263 (707)
T KOG0957|consen 187 LCPNRFGIFKETDIG--RWVHAICALYVPGVAFGQTHTLCGVT-LEEMDYSKFGAKTCSACEDKIFARTGVCIRCDAGMC 263 (707)
T ss_pred cCCCcCCcccccchh--hHHHHHHHhhcCcccccccccccccc-HHHhhhhhhccchhccccchhhhhcceeeeccchhh
Confidence 799999999999987 89999999999999999999998884 66666666654 79999864 899999999999
Q ss_pred CcccchhhhhhcCceEEEccccCCcceeEeecCCCCCCCCCC
Q 001243 78 RTSFHPICAREARHRLEVWGKYGCNNVELRAFCAKHSDIQDN 119 (1116)
Q Consensus 78 ~~~FHvtCA~~aG~~~e~~~~~g~~~~~~~~fC~~Hr~~~~~ 119 (1116)
.++||||||+.+|++++...++ +-.+.|++||++|..+...
T Consensus 264 k~YfHVTCAQk~GlLvea~~e~-DiAdpfya~CK~Ht~r~~~ 304 (707)
T KOG0957|consen 264 KEYFHVTCAQKLGLLVEATDEN-DIADPFYAFCKKHTNRDNL 304 (707)
T ss_pred hhhhhhhHHhhhcceeeccccc-cchhhHHHHHHhhcchhhh
Confidence 9999999999999999877532 2346899999999887653
No 14
>PF13771 zf-HC5HC2H: PHD-like zinc-binding domain
Probab=99.51 E-value=5.9e-15 Score=136.58 Aligned_cols=85 Identities=34% Similarity=0.686 Sum_probs=73.2
Q ss_pred hhccccccccceeecCc---cccccCccccCCC--CcccccccCcCCceeecCCcCcccccchhhhhhcCceEEEeeCCC
Q 001243 797 HAFCAEWVFESTFRRGQ---VNPVAGMEAFPKG--IDVCCICRHKHGICIKCNYGNCQTTFHPTCARSAGFYLNVKSTGG 871 (1116)
Q Consensus 797 HV~CALW~PEv~f~n~~---lepVegie~I~k~--r~~C~iC~~k~GAcIqCs~~~C~~sFHvtCA~~aG~~~~~k~~~g 871 (1116)
|++||+|+|++++.+.. +.++.+++.+.+. +++|++|+++.||+|+|..++|...||++||+.+|+.+.+.. .
T Consensus 1 H~~Calwsp~v~~~~~~~~~~~~i~~v~~~~~~~~~~~C~~C~~~~Ga~i~C~~~~C~~~fH~~CA~~~~~~~~~~~--~ 78 (90)
T PF13771_consen 1 HENCALWSPEVYFDESEDIGGFSIEDVEKEIKRRRKLKCSICKKKGGACIGCSHPGCSRSFHVPCARKAGCFIEFDE--D 78 (90)
T ss_pred ChHHheecCceEEeCCCccccccHHhHHHHHHHHhCCCCcCCCCCCCeEEEEeCCCCCcEEChHHHccCCeEEEEcc--C
Confidence 89999999999998763 4567777766544 699999999989999999999999999999999999998864 2
Q ss_pred ceeeeecCCCCc
Q 001243 872 NFQHKAYCEKHS 883 (1116)
Q Consensus 872 ~~~~~iyC~kHs 883 (1116)
...+.+||++|+
T Consensus 79 ~~~~~~~C~~H~ 90 (90)
T PF13771_consen 79 NGKFRIFCPKHS 90 (90)
T ss_pred CCceEEEChhcC
Confidence 336899999996
No 15
>KOG1080 consensus Histone H3 (Lys4) methyltransferase complex, subunit SET1 and related methyltransferases [Chromatin structure and dynamics; Transcription]
Probab=98.99 E-value=3.6e-10 Score=141.66 Aligned_cols=143 Identities=34% Similarity=0.755 Sum_probs=122.6
Q ss_pred CCCCCCCcCcccCCCCC-CCCCEEEccccCcccccccccCccCC-CCceecccccccccCCCCCCCCCCccCCCcccccc
Q 001243 700 FSKEHPRSCDICRRSET-ILNPILICSGCKVAVHLDCYRNAKES-TGPWYCELCEELLSSRSSGAPSVNFWEKPYFVAEC 777 (1116)
Q Consensus 700 ~~ke~d~~CsVC~~~E~-~~N~IL~Cd~C~laVHq~CYGi~~ip-eg~WlCd~C~~~~~~~~s~~~~vn~~~~p~~~~~C 777 (1116)
+.+.....|.+|.+.+. ..|.++.|+.|...||+.|||....+ ...|+|+.|... .....|
T Consensus 568 l~~~~t~~c~~~~~~~~~~~n~~~~~~~~~~~~~s~~~g~~~~~~~~~~~~~~~~~~-----------------~~~r~~ 630 (1005)
T KOG1080|consen 568 LSKWTTERCAVCRDDEDWEKNVSIICDRCTRSVHSECYGNLKSYDGTSWVCDSCETL-----------------DIKRSC 630 (1005)
T ss_pred hcCCCcccccccccccccccceeeeeccccccCCCcccccCCCCCCCcchhhccccc-----------------cCCchh
Confidence 55566678999998764 67999999999999999999988776 457999999874 125789
Q ss_pred ccCCCCCCcceeccCCchhhhccccccccceeecC-ccccccCccccCCC--CcccccccCcCCceeecCCcCcccccch
Q 001243 778 SLCGGTTGAFRKSANGQWVHAFCAEWVFESTFRRG-QVNPVAGMEAFPKG--IDVCCICRHKHGICIKCNYGNCQTTFHP 854 (1116)
Q Consensus 778 ~LCp~~gGALK~T~~g~WVHV~CALW~PEv~f~n~-~lepVegie~I~k~--r~~C~iC~~k~GAcIqCs~~~C~~sFHv 854 (1116)
++|+..+||+++++.|.|+|+-||.|.|++.+.+. .+.|..++..++.. -..|.+ .|-|.+|. .|...||.
T Consensus 631 ~l~~~~g~al~p~d~gr~~~~e~a~~~~e~~~~~~~~~~p~~~~~~~p~~~~~~~~~~----~~~~~~~~--~~~~~~~~ 704 (1005)
T KOG1080|consen 631 CLCPVKGGALKPTDEGRWVHVECAWFRPEVCLASPERMEPAVGTFKIPALSFLKICFI----HGSCRQCC--KCETGSHA 704 (1005)
T ss_pred hhccccCcccCCCCccchhhhhchhccccccCCCccCCCCcccccccCccchhhhccc----cccccccc--hhhhccee
Confidence 99999999999999999999999999999999887 78898888777765 255666 57788888 89999999
Q ss_pred hhhhhcCceEE
Q 001243 855 TCARSAGFYLN 865 (1116)
Q Consensus 855 tCA~~aG~~~~ 865 (1116)
.||..+++.+.
T Consensus 705 ~~a~~~~~~~~ 715 (1005)
T KOG1080|consen 705 MCASRAGYIME 715 (1005)
T ss_pred hhhcCccChhh
Confidence 99999998874
No 16
>PF13831 PHD_2: PHD-finger; PDB: 2L43_A 2KU3_A.
Probab=98.90 E-value=2.6e-10 Score=89.97 Aligned_cols=35 Identities=46% Similarity=1.149 Sum_probs=21.6
Q ss_pred CCCEEEccccCcccccccccCccCCCC-ceeccccc
Q 001243 718 LNPILICSGCKVAVHLDCYRNAKESTG-PWYCELCE 752 (1116)
Q Consensus 718 ~N~IL~Cd~C~laVHq~CYGi~~ipeg-~WlCd~C~ 752 (1116)
.|+||+|++|++.||+.|||+...+++ .|+|++|+
T Consensus 1 ~n~ll~C~~C~v~VH~~CYGv~~~~~~~~W~C~~C~ 36 (36)
T PF13831_consen 1 TNPLLFCDNCNVAVHQSCYGVSEVPDGDDWLCDRCE 36 (36)
T ss_dssp -CEEEE-SSS--EEEHHHHT-SS--SS-----HHH-
T ss_pred CCceEEeCCCCCcCChhhCCcccCCCCCcEECCcCC
Confidence 388999999999999999999988865 89999995
No 17
>KOG1080 consensus Histone H3 (Lys4) methyltransferase complex, subunit SET1 and related methyltransferases [Chromatin structure and dynamics; Transcription]
Probab=97.68 E-value=1.5e-05 Score=101.05 Aligned_cols=86 Identities=34% Similarity=0.696 Sum_probs=79.0
Q ss_pred CCCCCCCCCcccccCCCCCcHhhHhhcccCceeeccCCccccccccccchhhcccccccccccccCceeeCCCCCCCccc
Q 001243 2 CSLPKAGGALKPVNGGSMEFAHLFCSLLMPEVYIEDTMKVEPLMNVGGIKETRMKLVCNICRVKCGACVRCSHGTCRTSF 81 (1116)
Q Consensus 2 ClCP~~gGALK~T~~G~~~WvHv~CALw~PEv~f~~~~~~epV~~V~~I~~~R~~LkC~iC~~k~GAcIqCs~~~C~~~F 81 (1116)
|+||..||||+|++.| +|+|+-||-|.||+.+.++..|+|..++..++...+...|.+ .|-|.||. .|.+.|
T Consensus 631 ~l~~~~g~al~p~d~g--r~~~~e~a~~~~e~~~~~~~~~~p~~~~~~~p~~~~~~~~~~----~~~~~~~~--~~~~~~ 702 (1005)
T KOG1080|consen 631 CLCPVKGGALKPTDEG--RWVHVECAWFRPEVCLASPERMEPAVGTFKIPALSFLKICFI----HGSCRQCC--KCETGS 702 (1005)
T ss_pred hhccccCcccCCCCcc--chhhhhchhccccccCCCccCCCCcccccccCccchhhhccc----cccccccc--hhhhcc
Confidence 8999999999999955 999999999999999999999999999999999999998888 58888888 899999
Q ss_pred chhhhhhcCceEEE
Q 001243 82 HPICAREARHRLEV 95 (1116)
Q Consensus 82 HvtCA~~aG~~~e~ 95 (1116)
|..||..+|+.++.
T Consensus 703 ~~~~a~~~~~~~~~ 716 (1005)
T KOG1080|consen 703 HAMCASRAGYIMEA 716 (1005)
T ss_pred eehhhcCccChhhh
Confidence 99999998887654
No 18
>PF00628 PHD: PHD-finger; InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=97.53 E-value=4.2e-05 Score=64.01 Aligned_cols=46 Identities=26% Similarity=0.775 Sum_probs=38.3
Q ss_pred cCcccCCCCCCCCCEEEccccCcccccccccCccC----CCCceecccccc
Q 001243 707 SCDICRRSETILNPILICSGCKVAVHLDCYRNAKE----STGPWYCELCEE 753 (1116)
Q Consensus 707 ~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYGi~~i----peg~WlCd~C~~ 753 (1116)
+|.||... ...+.+|+|+.|+..+|..|+|+... +.+.|+|..|..
T Consensus 1 ~C~vC~~~-~~~~~~i~C~~C~~~~H~~C~~~~~~~~~~~~~~w~C~~C~~ 50 (51)
T PF00628_consen 1 YCPVCGQS-DDDGDMIQCDSCNRWYHQECVGPPEKAEEIPSGDWYCPNCRP 50 (51)
T ss_dssp EBTTTTSS-CTTSSEEEBSTTSCEEETTTSTSSHSHHSHHSSSBSSHHHHH
T ss_pred eCcCCCCc-CCCCCeEEcCCCChhhCcccCCCChhhccCCCCcEECcCCcC
Confidence 48899983 34789999999999999999998743 345899999963
No 19
>smart00249 PHD PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the KOG1084 consensus Transcription factor TCF20 [Transcription]
Probab=96.99 E-value=0.00031 Score=81.79 Aligned_cols=98 Identities=22% Similarity=0.380 Sum_probs=74.1
Q ss_pred ccccccCCCCCCcceec-cCCchhhhccccccccceeecC-ccccccCccccCCC-CcccccccCcCCceeecCCcCccc
Q 001243 774 VAECSLCGGTTGAFRKS-ANGQWVHAFCAEWVFESTFRRG-QVNPVAGMEAFPKG-IDVCCICRHKHGICIKCNYGNCQT 850 (1116)
Q Consensus 774 ~~~C~LCp~~gGALK~T-~~g~WVHV~CALW~PEv~f~n~-~lepVegie~I~k~-r~~C~iC~~k~GAcIqCs~~~C~~ 850 (1116)
...|++++.. ..++ ....|+|+.|++|.|.+.+..+ .+..+... +.+. .+.|..|.++ |+.+.|....|..
T Consensus 221 e~~~~l~~~~---~~~d~~~~~~~h~~c~~~~~~~~~~q~~~l~~~~~~--v~r~~~~~c~~c~k~-ga~~~c~~~~~~~ 294 (375)
T KOG1084|consen 221 EFFCALSPKA---TIPDIGFELWYHRYCALWAPNVHESQGGQLTNVDNA--VIRFPSLQCILCQKP-GATLKCVQASLLS 294 (375)
T ss_pred hhhhhhcCCC---cCCccchhHHHHHHHHhcCCcceeccCccccCchhh--hhcccchhcccccCC-CCchhhhhhhhhc
Confidence 3477787643 3444 4568999999999999999876 66665542 2222 3799999997 9999999999999
Q ss_pred ccchhhhhhcCceEEEeeCCCceeeeecCCCCc
Q 001243 851 TFHPTCARSAGFYLNVKSTGGNFQHKAYCEKHS 883 (1116)
Q Consensus 851 sFHvtCA~~aG~~~~~k~~~g~~~~~iyC~kHs 883 (1116)
.+|.+|+.........+ ..+++|+.|.
T Consensus 295 ~~h~~c~~~~~~~~~~~------~r~v~~~~h~ 321 (375)
T KOG1084|consen 295 NAHFPCARAKNGIPLDY------DRKVSCPRHR 321 (375)
T ss_pred ccCcccccCcccccchh------hhhccCCCCC
Confidence 99999998876554221 3567999999
No 21
>KOG1244 consensus Predicted transcription factor Requiem/NEURO-D4 [Transcription]
Probab=96.82 E-value=0.00052 Score=75.43 Aligned_cols=51 Identities=33% Similarity=0.811 Sum_probs=43.4
Q ss_pred CCcCcccCCCCCCCCCEEEccccCcccccccccCc--cCCCCceeccccccccc
Q 001243 705 PRSCDICRRSETILNPILICSGCKVAVHLDCYRNA--KESTGPWYCELCEELLS 756 (1116)
Q Consensus 705 d~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYGi~--~ipeg~WlCd~C~~~~~ 756 (1116)
--.|+||+-+|+ ++++||||.|...+|-+|...+ ..|+|.|-|..|...+.
T Consensus 281 ck~csicgtsen-ddqllfcddcdrgyhmyclsppm~eppegswsc~KOG~~~~ 333 (336)
T KOG1244|consen 281 CKYCSICGTSEN-DDQLLFCDDCDRGYHMYCLSPPMVEPPEGSWSCHLCLEELK 333 (336)
T ss_pred cceeccccCcCC-CceeEeecccCCceeeEecCCCcCCCCCCchhHHHHHHHHh
Confidence 457999998765 6899999999999999999865 45799999999987643
No 22
>KOG1512 consensus PHD Zn-finger protein [General function prediction only]
Probab=96.68 E-value=0.00072 Score=74.63 Aligned_cols=45 Identities=24% Similarity=0.540 Sum_probs=40.4
Q ss_pred CCCcCcccCCCCCCCCCEEEccccCcccccccccCccCCCCceecc
Q 001243 704 HPRSCDICRRSETILNPILICSGCKVAVHLDCYRNAKESTGPWYCE 749 (1116)
Q Consensus 704 ~d~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd 749 (1116)
+-..|.||..++- ..+++|||.|...+|..|.|...+|.|.|.||
T Consensus 313 ~C~lC~IC~~P~~-E~E~~FCD~CDRG~HT~CVGL~~lP~G~WICD 357 (381)
T KOG1512|consen 313 SCELCRICLGPVI-ESEHLFCDVCDRGPHTLCVGLQDLPRGEWICD 357 (381)
T ss_pred ccHhhhccCCccc-chheeccccccCCCCccccccccccCccchhh
Confidence 3457999998654 58899999999999999999999999999999
No 23
>KOG1084 consensus Transcription factor TCF20 [Transcription]
Probab=96.49 E-value=0.0014 Score=76.49 Aligned_cols=86 Identities=21% Similarity=0.391 Sum_probs=66.9
Q ss_pred CCCCcHhhHhhcccCceeeccCCccccccccccchhhcccccccccccccCceeeCCCCCCCcccchhhhhhcCceEEEc
Q 001243 17 GSMEFAHLFCSLLMPEVYIEDTMKVEPLMNVGGIKETRMKLVCNICRVKCGACVRCSHGTCRTSFHPICAREARHRLEVW 96 (1116)
Q Consensus 17 G~~~WvHv~CALw~PEv~f~~~~~~epV~~V~~I~~~R~~LkC~iC~~k~GAcIqCs~~~C~~~FHvtCA~~aG~~~e~~ 96 (1116)
|...|.|+.|++|.|++.+.+...+..+ .....+-+.+.|..|.++ |+.+.|...+|...+|.+|+..+-...--.
T Consensus 236 ~~~~~~h~~c~~~~~~~~~~q~~~l~~~---~~~v~r~~~~~c~~c~k~-ga~~~c~~~~~~~~~h~~c~~~~~~~~~~~ 311 (375)
T KOG1084|consen 236 GFELWYHRYCALWAPNVHESQGGQLTNV---DNAVIRFPSLQCILCQKP-GATLKCVQASLLSNAHFPCARAKNGIPLDY 311 (375)
T ss_pred chhHHHHHHHHhcCCcceeccCccccCc---hhhhhcccchhcccccCC-CCchhhhhhhhhcccCcccccCcccccchh
Confidence 5568999999999999999988777544 333333333899999975 999999999999999999997766432111
Q ss_pred cccCCcceeEeecCCCCC
Q 001243 97 GKYGCNNVELRAFCAKHS 114 (1116)
Q Consensus 97 ~~~g~~~~~~~~fC~~Hr 114 (1116)
.-.++|..|+
T Consensus 312 --------~r~v~~~~h~ 321 (375)
T KOG1084|consen 312 --------DRKVSCPRHR 321 (375)
T ss_pred --------hhhccCCCCC
Confidence 1378999999
No 24
>PF15446 zf-PHD-like: PHD/FYVE-zinc-finger like domain
Probab=96.47 E-value=0.0043 Score=64.59 Aligned_cols=127 Identities=17% Similarity=0.405 Sum_probs=72.8
Q ss_pred cCcccCCC-C-CCCCCEEEccccCcccccccccCc--------cCCCCc--eecccccccccCCCCCCCCCCccCCCccc
Q 001243 707 SCDICRRS-E-TILNPILICSGCKVAVHLDCYRNA--------KESTGP--WYCELCEELLSSRSSGAPSVNFWEKPYFV 774 (1116)
Q Consensus 707 ~CsVC~~~-E-~~~N~IL~Cd~C~laVHq~CYGi~--------~ipeg~--WlCd~C~~~~~~~~s~~~~vn~~~~p~~~ 774 (1116)
.|++|... + ...++||+|.||..++|+.|.|.. ++.++. .-|+.|......+.. ..| ..
T Consensus 1 ~C~~C~~~g~~~~kG~Lv~CQGCs~sYHk~CLG~Rs~ReHlVTKVg~d~FVLQCr~Cig~~~kKD~--------~aP-~~ 71 (175)
T PF15446_consen 1 TCDTCGYEGDDRNKGPLVYCQGCSSSYHKACLGPRSQREHLVTKVGDDDFVLQCRRCIGIAHKKDP--------RAP-HH 71 (175)
T ss_pred CcccccCCCCCccCCCeEEcCccChHHHhhhcCCccccceeeEEEcCCceEEechhhcChhhcccC--------CCC-CC
Confidence 49999752 3 357899999999999999999964 344444 569999876443322 113 36
Q ss_pred cccccCCCCCCcceeccCCchhhhcccccc-ccceeecCccccccCccccCCCCcccccccCcCCceeecCCcCcccccc
Q 001243 775 AECSLCGGTTGAFRKSANGQWVHAFCAEWV-FESTFRRGQVNPVAGMEAFPKGIDVCCICRHKHGICIKCNYGNCQTTFH 853 (1116)
Q Consensus 775 ~~C~LCp~~gGALK~T~~g~WVHV~CALW~-PEv~f~n~~lepVegie~I~k~r~~C~iC~~k~GAcIqCs~~~C~~sFH 853 (1116)
..|.-|...|-+-++.....- -+. -.....|+-..||..+..-.-+ ...-....|. .|...||
T Consensus 72 ~~C~~C~~~G~~c~pfr~r~T------~kQEe~~ReeNgG~DPit~Vd~~lvn--------N~~nVLFRC~--~C~RawH 135 (175)
T PF15446_consen 72 GMCQQCKKPGPSCKPFRPRKT------PKQEEKLREENGGVDPITPVDPELVN--------NPDNVLFRCT--SCHRAWH 135 (175)
T ss_pred CcccccCCCCCCCcccCCCCC------cHHHHHHHHHcCCCCCCccCCHHHcc--------ChhheEEecC--Cccceee
Confidence 689999887643332211000 000 0111223334454444321111 1134566788 8988999
Q ss_pred hhhhh
Q 001243 854 PTCAR 858 (1116)
Q Consensus 854 vtCA~ 858 (1116)
..-.-
T Consensus 136 ~~HLP 140 (175)
T PF15446_consen 136 FEHLP 140 (175)
T ss_pred hhhCC
Confidence 76543
No 25
>KOG4323 consensus Polycomb-like PHD Zn-finger protein [General function prediction only]
Probab=96.30 E-value=0.0036 Score=74.16 Aligned_cols=134 Identities=16% Similarity=0.236 Sum_probs=84.3
Q ss_pred CcCcccCCCCC-CCCCEEEccccCcccccccccCccCCCCceecccccccccCCCCCCCCCCccCCCccccccccCCCCC
Q 001243 706 RSCDICRRSET-ILNPILICSGCKVAVHLDCYRNAKESTGPWYCELCEELLSSRSSGAPSVNFWEKPYFVAECSLCGGTT 784 (1116)
Q Consensus 706 ~~CsVC~~~E~-~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd~C~~~~~~~~s~~~~vn~~~~p~~~~~C~LCp~~g 784 (1116)
..|.||..... ..|.++.|++|+...||.|--......+.|.|..|..... ...|
T Consensus 84 ~~~nv~~s~~~~p~~e~~~~~r~~~~~~q~~~i~~~~~~~~~~~~~c~~~~~------------------------~~~g 139 (464)
T KOG4323|consen 84 LNPNVLTSETVLPENEKVICGRCKSGYHQGCNIPRFPSLDIGESTECVFPIF------------------------SQEG 139 (464)
T ss_pred cCCcccccccccCchhhhhhhhhccCcccccCccCcCcCCcccccccccccc------------------------cccc
Confidence 56999997543 4789999999999999999644444467888888876421 1346
Q ss_pred CcceeccCCchhhhccccccccceeecCccccccCccccCCCCcccccccCc----CCceeecCCcCcccccchhhhhhc
Q 001243 785 GAFRKSANGQWVHAFCAEWVFESTFRRGQVNPVAGMEAFPKGIDVCCICRHK----HGICIKCNYGNCQTTFHPTCARSA 860 (1116)
Q Consensus 785 GALK~T~~g~WVHV~CALW~PEv~f~n~~lepVegie~I~k~r~~C~iC~~k----~GAcIqCs~~~C~~sFHvtCA~~a 860 (1116)
|++|... -+| |-+.|....+.. ....+..+.|+||..- .--+|+|. .|.++||-.|-+--
T Consensus 140 ~a~K~g~---~a~-------~~l~y~~~~l~w----D~~~~~n~qc~vC~~g~~~~~NrmlqC~--~C~~~fHq~Chqp~ 203 (464)
T KOG4323|consen 140 GALKKGR---LAR-------PSLPYPEASLDW----DSGHKVNLQCSVCYCGGPGAGNRMLQCD--KCRQWYHQACHQPL 203 (464)
T ss_pred ccccccc---ccc-------ccccCccccccc----CccccccceeeeeecCCcCccceeeeec--ccccHHHHHhccCC
Confidence 7776543 344 333332111110 1122334669999852 22689999 99999999998765
Q ss_pred CceEEEeeCCCceeeeecCCCCc
Q 001243 861 GFYLNVKSTGGNFQHKAYCEKHS 883 (1116)
Q Consensus 861 G~~~~~k~~~g~~~~~iyC~kHs 883 (1116)
--.+.+ +...+..||..=.
T Consensus 204 i~~~l~----~D~~~~w~C~~C~ 222 (464)
T KOG4323|consen 204 IKDELA----GDPFYEWFCDVCN 222 (464)
T ss_pred CCHhhc----cCccceEeehhhc
Confidence 433332 2234555776443
No 26
>KOG4323 consensus Polycomb-like PHD Zn-finger protein [General function prediction only]
Probab=96.20 E-value=0.0024 Score=75.56 Aligned_cols=51 Identities=22% Similarity=0.585 Sum_probs=41.6
Q ss_pred CCCcCcccCCCCC-CCCCEEEccccCcccccccccCcc------CCCCceeccccccc
Q 001243 704 HPRSCDICRRSET-ILNPILICSGCKVAVHLDCYRNAK------ESTGPWYCELCEEL 754 (1116)
Q Consensus 704 ~d~~CsVC~~~E~-~~N~IL~Cd~C~laVHq~CYGi~~------ipeg~WlCd~C~~~ 754 (1116)
.+..|+||..+.+ ..|.||+|++|+--+|+.|.-... .+...|+|..|.+.
T Consensus 167 ~n~qc~vC~~g~~~~~NrmlqC~~C~~~fHq~Chqp~i~~~l~~D~~~~w~C~~C~~~ 224 (464)
T KOG4323|consen 167 VNLQCSVCYCGGPGAGNRMLQCDKCRQWYHQACHQPLIKDELAGDPFYEWFCDVCNRG 224 (464)
T ss_pred ccceeeeeecCCcCccceeeeecccccHHHHHhccCCCCHhhccCccceEeehhhccc
Confidence 3566999997654 678999999999999999986432 34678999999986
No 27
>COG5034 TNG2 Chromatin remodeling protein, contains PhD zinc finger [Chromatin structure and dynamics]
Probab=96.00 E-value=0.057 Score=59.66 Aligned_cols=52 Identities=29% Similarity=0.789 Sum_probs=41.7
Q ss_pred cCCCCCCCcCcccCCCCCCCCCEEEccc--cCcc-cccccccCccCCCCceecccccc
Q 001243 699 DFSKEHPRSCDICRRSETILNPILICSG--CKVA-VHLDCYRNAKESTGPWYCELCEE 753 (1116)
Q Consensus 699 e~~ke~d~~CsVC~~~E~~~N~IL~Cd~--C~la-VHq~CYGi~~ipeg~WlCd~C~~ 753 (1116)
+.+.++.++| -|... ..++||-||+ |..- ||..|.|+...|.|.|+|+-|+.
T Consensus 215 d~se~e~lYC-fCqqv--SyGqMVaCDn~nCkrEWFH~~CVGLk~pPKG~WYC~eCk~ 269 (271)
T COG5034 215 DNSEGEELYC-FCQQV--SYGQMVACDNANCKREWFHLECVGLKEPPKGKWYCPECKK 269 (271)
T ss_pred ccccCceeEE-Eeccc--ccccceecCCCCCchhheeccccccCCCCCCcEeCHHhHh
Confidence 3455566665 57664 3699999997 8764 89999999999999999999975
No 28
>KOG0825 consensus PHD Zn-finger protein [General function prediction only]
Probab=95.10 E-value=0.011 Score=72.73 Aligned_cols=52 Identities=27% Similarity=0.776 Sum_probs=43.6
Q ss_pred CCCCCcCcccCCCCCCCCCEEEccccCcc-cccccccCc--cCCCCceeccccccc
Q 001243 702 KEHPRSCDICRRSETILNPILICSGCKVA-VHLDCYRNA--KESTGPWYCELCEEL 754 (1116)
Q Consensus 702 ke~d~~CsVC~~~E~~~N~IL~Cd~C~la-VHq~CYGi~--~ipeg~WlCd~C~~~ 754 (1116)
-.+...|.||...|. .+.||.|+.|+.. +|-+|...+ .+|-+.|+|+-|...
T Consensus 212 ~~E~~~C~IC~~~Dp-EdVLLLCDsCN~~~YH~YCLDPdl~eiP~~eWYC~NC~dL 266 (1134)
T KOG0825|consen 212 SQEEVKCDICTVHDP-EDVLLLCDSCNKVYYHVYCLDPDLSESPVNEWYCTNCSLL 266 (1134)
T ss_pred ccccccceeeccCCh-HHhheeecccccceeeccccCcccccccccceecCcchhh
Confidence 344567999998765 5789999999999 999999764 478899999999764
No 29
>KOG4299 consensus PHD Zn-finger protein [General function prediction only]
Probab=94.58 E-value=0.015 Score=70.60 Aligned_cols=48 Identities=29% Similarity=0.800 Sum_probs=41.9
Q ss_pred CcCcccCCCCCCCCCEEEccccCcccccccccCc----cCCCCceeccccccc
Q 001243 706 RSCDICRRSETILNPILICSGCKVAVHLDCYRNA----KESTGPWYCELCEEL 754 (1116)
Q Consensus 706 ~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYGi~----~ipeg~WlCd~C~~~ 754 (1116)
.+|+-|...+. -|.|+.||+|..+|||.|.-.+ .+|.|.|+|.-|...
T Consensus 254 ~fCsaCn~~~~-F~~~i~CD~Cp~sFH~~CLePPl~~eniP~g~W~C~ec~~k 305 (613)
T KOG4299|consen 254 DFCSACNGSGL-FNDIICCDGCPRSFHQTCLEPPLEPENIPPGSWFCPECKIK 305 (613)
T ss_pred HHHHHhCCccc-cccceeecCCchHHHHhhcCCCCCcccCCCCccccCCCeee
Confidence 58999999765 3999999999999999999765 467899999999874
No 30
>KOG1973 consensus Chromatin remodeling protein, contains PHD Zn-finger [Chromatin structure and dynamics]
Probab=93.64 E-value=0.039 Score=62.15 Aligned_cols=50 Identities=22% Similarity=0.624 Sum_probs=40.7
Q ss_pred CCCCCcCcccCCCCCCCCCEEEccc--cC-cccccccccCccCCCCceeccccccc
Q 001243 702 KEHPRSCDICRRSETILNPILICSG--CK-VAVHLDCYRNAKESTGPWYCELCEEL 754 (1116)
Q Consensus 702 ke~d~~CsVC~~~E~~~N~IL~Cd~--C~-laVHq~CYGi~~ipeg~WlCd~C~~~ 754 (1116)
.++..+|-.+ . ...+.||-||+ |. -=||..|.|+...|.|.|||..|...
T Consensus 216 ~~e~~yC~Cn-q--vsyg~Mi~CDn~~C~~eWFH~~CVGL~~~PkgkWyC~~C~~~ 268 (274)
T KOG1973|consen 216 PDEPTYCICN-Q--VSYGKMIGCDNPGCPIEWFHFTCVGLKTKPKGKWYCPRCKAE 268 (274)
T ss_pred CCCCEEEEec-c--cccccccccCCCCCCcceEEEeccccccCCCCcccchhhhhh
Confidence 3455666444 2 23799999999 99 78999999999999999999999875
No 31
>smart00249 PHD PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=91.17 E-value=0.15 Score=44.44 Aligned_cols=33 Identities=24% Similarity=0.679 Sum_probs=28.5
Q ss_pred CCCcCcccCCCCCCCCCEEEccccCcccccccc
Q 001243 704 HPRSCDICRRSETILNPILICSGCKVAVHLDCY 736 (1116)
Q Consensus 704 ~d~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CY 736 (1116)
....|.+|.+.=..++++|+|..|+..+|+.||
T Consensus 4 ~~~~C~~Cg~~~~~~dDiVvCp~CgapyHR~C~ 36 (54)
T PF14446_consen 4 EGCKCPVCGKKFKDGDDIVVCPECGAPYHRDCW 36 (54)
T ss_pred cCccChhhCCcccCCCCEEECCCCCCcccHHHH
Confidence 346799999854447899999999999999999
No 33
>KOG0383 consensus Predicted helicase [General function prediction only]
Probab=89.44 E-value=0.24 Score=62.06 Aligned_cols=50 Identities=22% Similarity=0.636 Sum_probs=40.6
Q ss_pred CCCCCCCcCcccCCCCCCCCCEEEccccCcccccccccCc--cCCCCceecccccc
Q 001243 700 FSKEHPRSCDICRRSETILNPILICSGCKVAVHLDCYRNA--KESTGPWYCELCEE 753 (1116)
Q Consensus 700 ~~ke~d~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYGi~--~ipeg~WlCd~C~~ 753 (1116)
++..+...|.||.+ +..++.|+.|..++|..|-+.+ .+|.+.|+|.+|..
T Consensus 42 ~~~~~~e~c~ic~~----~g~~l~c~tC~~s~h~~cl~~pl~~~p~~~~~c~Rc~~ 93 (696)
T KOG0383|consen 42 WDDAEQEACRICAD----GGELLWCDTCPASFHASCLGPPLTPQPNGEFICPRCFC 93 (696)
T ss_pred cchhhhhhhhhhcC----CCcEEEeccccHHHHHHccCCCCCcCCccceeeeeecc
Confidence 34556678999998 5788899999999999999866 45556699999943
No 34
>TIGR02844 spore_III_D sporulation transcriptional regulator SpoIIID. Members of this protein are the transcriptional regulator SpoIIID, or stage III sporulation protein D. It is present in genomes if and only if the species is capable of endospore formation as occurs in the model species Bacillus subtilis. SpoIIID is a DNA binding protein that, in B. subtilis, downregulates many genes but also turns on ten genes.
Probab=87.54 E-value=0.76 Score=43.08 Aligned_cols=50 Identities=18% Similarity=0.282 Sum_probs=45.3
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhcccc--ccccchhhHHHHHHhh
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTLAD--GTFASDLQCKLVKWLS 272 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~~--~~~~~~~~~k~~~wl~ 272 (1116)
..|+.-|.+ |+++++|||.+.|+|..++--.|.. ..++|..+.+|..-.+
T Consensus 9 ~~I~e~l~~-~~~ti~dvA~~~gvS~~TVsr~L~~~~~~Vs~~Tr~rV~~aa~ 60 (80)
T TIGR02844 9 LEIGKYIVE-TKATVRETAKVFGVSKSTVHKDVTERLPEINPELAEEVKEVLD 60 (80)
T ss_pred HHHHHHHHH-CCCCHHHHHHHhCCCHHHHHHHhcCCCCCCCHHHHHHHHHHHc
Confidence 567888888 9999999999999999999999985 4799999999998877
No 35
>PF10198 Ada3: Histone acetyltransferases subunit 3; InterPro: IPR019340 This entry is found in Ada3 and homologous proteins which function as part of histone acetyltransferase complexes []. Ada3 is an essential component of the Ada transcriptional coactivator (alteration/deficiency in activation) complex. It plays a key role in linking histone acetyltransferase-containing complexes to p53 (tumour suppressor protein) thereby regulating p53 acetylation, stability and transcriptional activation following DNA damage [].
Probab=86.97 E-value=4 Score=41.66 Aligned_cols=82 Identities=20% Similarity=0.197 Sum_probs=55.8
Q ss_pred Hhhhcccc---------ccCCCcchhhHHHHHHHHHhhhhhhhhhhhHHHHHHHHHHhHHHHHhhhhcchhHHHHHHHHH
Q 001243 531 KARTRGVL---------ELSPTDEVEGEIIYFQHRLLGNAFSRKRLADNLVCKAVKTLNQEIDVARGRRWDAVLVNQYLC 601 (1116)
Q Consensus 531 ~~~~~~~~---------~~~p~de~e~E~~~~q~~ll~~~~~~r~~~~~lv~~V~k~l~~E~~~~~~r~~d~~~~nq~L~ 601 (1116)
-.+..||+ .-..+|||-.||-.+|.+|-.....|+.+...|+.-|..++...=-+.-..--|..+..-|++
T Consensus 14 EL~~~Gll~~~d~~d~~~~~eDDEI~aeLR~lQ~eLr~~~~~N~~rk~rL~~~~~e~ma~QE~~~~l~~lD~~V~~aY~K 93 (131)
T PF10198_consen 14 ELRYIGLLSEDDDPDWQDNREDDEISAELRRLQAELREQSAHNNARKKRLLKIAKEEMARQEYKRILDDLDKQVEQAYKK 93 (131)
T ss_pred HHHHcCCcCCCCccccccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 57888999 336679999999999999999888888887777755555553222222223344555666888
Q ss_pred HHHHHHHccCc
Q 001243 602 ELREAKKQGRK 612 (1116)
Q Consensus 602 ~vreakkq~~k 612 (1116)
.++..++..++
T Consensus 94 r~~~~~kkkk~ 104 (131)
T PF10198_consen 94 RMRARKKKKKK 104 (131)
T ss_pred HHHHhhcccCc
Confidence 87776555443
No 36
>PF09012 FeoC: FeoC like transcriptional regulator; InterPro: IPR015102 This entry contains several transcriptional regulators, including FeoC, which contain a HTH motif. FeoC acts as a [Fe-S] dependent transcriptional repressor []. ; PDB: 1XN7_A 2K02_A.
Probab=83.08 E-value=0.79 Score=41.10 Aligned_cols=31 Identities=35% Similarity=0.694 Sum_probs=24.2
Q ss_pred HHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 224 ILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 224 ~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
|+.-|.++|.||+.+||.++++||+.|++-|
T Consensus 5 i~~~l~~~~~~S~~eLa~~~~~s~~~ve~mL 35 (69)
T PF09012_consen 5 IRDYLRERGRVSLAELAREFGISPEAVEAML 35 (69)
T ss_dssp HHHHHHHS-SEEHHHHHHHTT--HHHHHHHH
T ss_pred HHHHHHHcCCcCHHHHHHHHCcCHHHHHHHH
Confidence 4444668999999999999999999999986
No 37
>PF00628 PHD: PHD-finger; InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=80.39 E-value=1.1 Score=37.57 Aligned_cols=30 Identities=27% Similarity=0.835 Sum_probs=25.4
Q ss_pred ccccccCc--CCceeecCCcCcccccchhhhhhc
Q 001243 829 VCCICRHK--HGICIKCNYGNCQTTFHPTCARSA 860 (1116)
Q Consensus 829 ~C~iC~~k--~GAcIqCs~~~C~~sFHvtCA~~a 860 (1116)
.|.+|++. .+.+|+|. .|..+||..|....
T Consensus 1 ~C~vC~~~~~~~~~i~C~--~C~~~~H~~C~~~~ 32 (51)
T PF00628_consen 1 YCPVCGQSDDDGDMIQCD--SCNRWYHQECVGPP 32 (51)
T ss_dssp EBTTTTSSCTTSSEEEBS--TTSCEEETTTSTSS
T ss_pred eCcCCCCcCCCCCeEEcC--CCChhhCcccCCCC
Confidence 47888873 68899999 99999999998654
No 38
>KOG1512 consensus PHD Zn-finger protein [General function prediction only]
Probab=80.23 E-value=1 Score=50.78 Aligned_cols=50 Identities=22% Similarity=0.345 Sum_probs=39.4
Q ss_pred CCCcCcccCCCCC-----CCCCEEEccccCcccccccccCcc-----CCCCceecccccc
Q 001243 704 HPRSCDICRRSET-----ILNPILICSGCKVAVHLDCYRNAK-----ESTGPWYCELCEE 753 (1116)
Q Consensus 704 ~d~~CsVC~~~E~-----~~N~IL~Cd~C~laVHq~CYGi~~-----ipeg~WlCd~C~~ 753 (1116)
....|.+|++... .-|-+|.|.-|..+.|..|...+. +....|.|--|..
T Consensus 257 ~~~~~~~~~~~~~~~~~~r~~S~I~C~~C~~~~HP~Ci~M~~elv~~~KTY~W~C~~C~l 316 (381)
T KOG1512|consen 257 RRNERKHFWDIQTNIIQSRRNSWIVCKPCATRPHPYCVAMIPELVGQYKTYFWKCSSCEL 316 (381)
T ss_pred chhhhhhhhcchhhhhhhhhccceeecccccCCCCcchhcCHHHHhHHhhcchhhcccHh
Confidence 3457999998642 257899999999999999997653 3467899988874
No 39
>PF00356 LacI: Bacterial regulatory proteins, lacI family; InterPro: IPR000843 Numerous bacterial transcription regulatory proteins bind DNA via a helix-turn-helix (HTH) motif. These proteins are very diverse, but for convenience may be grouped into subfamilies on the basis of sequence similarity. One such family groups together a range of proteins, including ascG, ccpA, cytR, ebgR, fruR, galR, galS, lacI, malI, opnR, purF, rafR, rbtR and scrR [, ]. Within this family, the HTH motif is situated towards the N terminus.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 3KJX_C 1ZAY_A 1VPW_A 2PUA_A 1QQA_A 1PNR_A 1JFT_A 1QP4_A 2PUD_A 1JH9_A ....
Probab=79.30 E-value=1.9 Score=36.38 Aligned_cols=44 Identities=18% Similarity=0.317 Sum_probs=39.2
Q ss_pred chhhhhhhhcCChhhhhhccc-cccccchhhHHHHHHhhhccccc
Q 001243 235 NVKDIASDIGISPDLLKTTLA-DGTFASDLQCKLVKWLSNHAYLG 278 (1116)
Q Consensus 235 ~~~~~a~~~g~s~~~~~a~l~-~~~~~~~~~~k~~~wl~~~~~~~ 278 (1116)
+++|||.+.|+|.-++--+|+ ....++..+.||.+..+..-|.+
T Consensus 1 Ti~dIA~~agvS~~TVSr~ln~~~~vs~~tr~rI~~~a~~lgY~p 45 (46)
T PF00356_consen 1 TIKDIAREAGVSKSTVSRVLNGPPRVSEETRERILEAAEELGYRP 45 (46)
T ss_dssp CHHHHHHHHTSSHHHHHHHHTTCSSSTHHHHHHHHHHHHHHTB-S
T ss_pred CHHHHHHHHCcCHHHHHHHHhCCCCCCHHHHHHHHHHHHHHCCCC
Confidence 478999999999999999999 57799999999999999888865
No 40
>PF08220 HTH_DeoR: DeoR-like helix-turn-helix domain; InterPro: IPR001034 The deoR-type HTH domain is a DNA-binding, helix-turn-helix (HTH) domain of about 50-60 amino acids present in transcription regulators of the deoR family, involved in sugar catabolism. This family of prokaryotic regulators is named after the Escherichia coli protein DeoR, a repressor of the deo operon, which encodes nucleotide and deoxyribonucleotide catabolic enzymes. DeoR also negatively regulates the expression of nupG and tsx, a nucleoside-specific transport protein and a channel-forming protein, respectively. DeoR-like transcription repressors occur in diverse bacteria as regulators of sugar and nucleoside metabolic systems. The effector molecules for deoR-like regulators are generally phosphorylated intermediates of the relevant metabolic pathway. The DNA-binding deoR-type HTH domain occurs usually in the N-terminal part. The C-terminal part can contain an effector-binding domain and/or an oligomerisation domain. DeoR occurs as an octamer, whilst glpR and agaR are tetramers. Several operators may be bound simultaneously, which could facilitate DNA looping [, ].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular
Probab=75.98 E-value=1.9 Score=37.51 Aligned_cols=33 Identities=36% Similarity=0.624 Sum_probs=29.7
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
..||+-|-+.|+++++++|..+|+|+.++.--|
T Consensus 3 ~~Il~~l~~~~~~s~~ela~~~~VS~~TiRRDl 35 (57)
T PF08220_consen 3 QQILELLKEKGKVSVKELAEEFGVSEMTIRRDL 35 (57)
T ss_pred HHHHHHHHHcCCEEHHHHHHHHCcCHHHHHHHH
Confidence 457888888999999999999999999998876
No 41
>KOG4443 consensus Putative transcription factor HALR/MLL3, involved in embryonic development [General function prediction only]
Probab=75.94 E-value=1.3 Score=54.67 Aligned_cols=49 Identities=31% Similarity=0.802 Sum_probs=36.2
Q ss_pred CcCcccCCCCCCCCCEEEccccCcccccccccCc--cCCCCceecccccccc
Q 001243 706 RSCDICRRSETILNPILICSGCKVAVHLDCYRNA--KESTGPWYCELCEELL 755 (1116)
Q Consensus 706 ~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYGi~--~ipeg~WlCd~C~~~~ 755 (1116)
..|-.|.. -.+.+.+++|++|.+++|-+|--.. .++.++|+|..|....
T Consensus 69 rvCe~c~~-~gD~~kf~~Ck~cDvsyh~yc~~P~~~~v~sg~~~ckk~~~c~ 119 (694)
T KOG4443|consen 69 RVCEACGT-TGDPKKFLLCKRCDVSYHCYCQKPPNDKVPSGPWLCKKCTRCR 119 (694)
T ss_pred eeeeeccc-cCCcccccccccccccccccccCCccccccCcccccHHHHhhh
Confidence 44555542 2246889999999999998886533 5789999999997653
No 42
>KOG1973 consensus Chromatin remodeling protein, contains PHD Zn-finger [Chromatin structure and dynamics]
Probab=75.74 E-value=1.4 Score=49.92 Aligned_cols=45 Identities=29% Similarity=0.651 Sum_probs=33.2
Q ss_pred cccccccCcCCceeecCCcCcc-cccchhhhhhcCceEEEeeCCCceeeeecCCCC
Q 001243 828 DVCCICRHKHGICIKCNYGNCQ-TTFHPTCARSAGFYLNVKSTGGNFQHKAYCEKH 882 (1116)
Q Consensus 828 ~~C~iC~~k~GAcIqCs~~~C~-~sFHvtCA~~aG~~~~~k~~~g~~~~~iyC~kH 882 (1116)
..|.-+...+|.+|.|...+|. .|||.+|- |+.. .+.|.| ||+.=
T Consensus 220 ~yC~Cnqvsyg~Mi~CDn~~C~~eWFH~~CV---GL~~---~PkgkW----yC~~C 265 (274)
T KOG1973|consen 220 TYCICNQVSYGKMIGCDNPGCPIEWFHFTCV---GLKT---KPKGKW----YCPRC 265 (274)
T ss_pred EEEEecccccccccccCCCCCCcceEEEecc---cccc---CCCCcc----cchhh
Confidence 5555555579999999999999 99999996 4443 334556 88743
No 43
>KOG1473 consensus Nucleosome remodeling factor, subunit NURF301/BPTF [Chromatin structure and dynamics; Transcription]
Probab=74.62 E-value=2.9 Score=54.50 Aligned_cols=118 Identities=20% Similarity=0.360 Sum_probs=75.3
Q ss_pred cCCCCCCCcCcccCCCCCCCCCEEEccccCcccccccccCcc--CCCCceecccccccccCCCCCCCCCCccCCCccccc
Q 001243 699 DFSKEHPRSCDICRRSETILNPILICSGCKVAVHLDCYRNAK--ESTGPWYCELCEELLSSRSSGAPSVNFWEKPYFVAE 776 (1116)
Q Consensus 699 e~~ke~d~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYGi~~--ipeg~WlCd~C~~~~~~~~s~~~~vn~~~~p~~~~~ 776 (1116)
+++..-+..|-+|.+ .+.++.|..|...||..|.--+. .|...|-|..|..-+-+ + .+.
T Consensus 338 e~~~~~ddhcrf~~d----~~~~lc~Et~prvvhlEcv~hP~~~~~s~~~e~evc~~hkvn---------g------vvd 398 (1414)
T KOG1473|consen 338 EGEIEYDDHCRFCHD----LGDLLCCETCPRVVHLECVFHPRFAVPSAFWECEVCNIHKVN---------G------VVD 398 (1414)
T ss_pred ccceeecccccccCc----ccceeecccCCceEEeeecCCccccCCCccchhhhhhhhccC---------c------ccc
Confidence 344555678999998 68999999999999999976543 56788999999865322 1 345
Q ss_pred cccCCCCCCccee-ccCCchhhhccccccccceeecCccccccCccccCCCCcccccccCcCCceeecCCcCcccccch-
Q 001243 777 CSLCGGTTGAFRK-SANGQWVHAFCAEWVFESTFRRGQVNPVAGMEAFPKGIDVCCICRHKHGICIKCNYGNCQTTFHP- 854 (1116)
Q Consensus 777 C~LCp~~gGALK~-T~~g~WVHV~CALW~PEv~f~n~~lepVegie~I~k~r~~C~iC~~k~GAcIqCs~~~C~~sFHv- 854 (1116)
|+|=+.+.+...+ +..|.==|- +. .+.+ ...|.||+.. |. .-|+|..|...||.
T Consensus 399 ~vl~~~K~~~~iR~~~iG~dr~g------r~----------ywfi------~rrl~Ie~~d-et-~l~yysT~pqly~ll 454 (1414)
T KOG1473|consen 399 CVLPPSKNVDSIRHTPIGRDRYG------RK----------YWFI------SRRLRIEGMD-ET-LLWYYSTCPQLYHLL 454 (1414)
T ss_pred cccChhhcccceeccCCCcCccc------cc----------hhce------eeeeEEecCC-Cc-EEEEecCcHHHHHHH
Confidence 7776665544422 222110000 00 0000 2578889864 43 34666779999999
Q ss_pred hhhhh
Q 001243 855 TCARS 859 (1116)
Q Consensus 855 tCA~~ 859 (1116)
.|.-.
T Consensus 455 ~cLd~ 459 (1414)
T KOG1473|consen 455 RCLDR 459 (1414)
T ss_pred HHhch
Confidence 78653
No 44
>KOG1044 consensus Actin-binding LIM Zn-finger protein Limatin involved in axon guidance [Signal transduction mechanisms; Cytoskeleton]
Probab=74.23 E-value=1.6 Score=53.41 Aligned_cols=35 Identities=29% Similarity=0.527 Sum_probs=23.8
Q ss_pred cccccccCc-CCceeecCCcCcccccchhhhhhcCceEEE
Q 001243 828 DVCCICRHK-HGICIKCNYGNCQTTFHPTCARSAGFYLNV 866 (1116)
Q Consensus 828 ~~C~iC~~k-~GAcIqCs~~~C~~sFHvtCA~~aG~~~~~ 866 (1116)
.+|..|.+- .|..++=- + ..|||+||+-..|--.|
T Consensus 193 vkc~~c~~fisgkvLqag--~--kh~HPtCARCsRCgqmF 228 (670)
T KOG1044|consen 193 VKCEECEKFISGKVLQAG--D--KHFHPTCARCSRCGQMF 228 (670)
T ss_pred eehHHhhhhhhhhhhhcc--C--cccCcchhhhhhhcccc
Confidence 667777763 45444433 4 79999999988765544
No 45
>KOG1245 consensus Chromatin remodeling complex WSTF-ISWI, large subunit (contains heterochromatin localization, PHD and BROMO domains) [Chromatin structure and dynamics]
Probab=72.12 E-value=1.1 Score=60.69 Aligned_cols=51 Identities=31% Similarity=0.762 Sum_probs=42.5
Q ss_pred CCCcCcccCCCCCCCCCEEEccccCcccccccccCc--cCCCCceecccccccc
Q 001243 704 HPRSCDICRRSETILNPILICSGCKVAVHLDCYRNA--KESTGPWYCELCEELL 755 (1116)
Q Consensus 704 ~d~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYGi~--~ipeg~WlCd~C~~~~ 755 (1116)
....|-||+.... .+.|+.|+.|.-.+|..|.... ..|.++|+|..|....
T Consensus 1107 ~~~~c~~cr~k~~-~~~m~lc~~c~~~~h~~C~rp~~~~~~~~dW~C~~c~~e~ 1159 (1404)
T KOG1245|consen 1107 VNALCKVCRRKKQ-DEKMLLCDECLSGFHLFCLRPALSSVPPGDWMCPSCRKEH 1159 (1404)
T ss_pred chhhhhhhhhccc-chhhhhhHhhhhhHHHHhhhhhhccCCcCCccCCccchhh
Confidence 4578999998533 4789999999999999999754 5788999999998764
No 46
>PF00130 C1_1: Phorbol esters/diacylglycerol binding domain (C1 domain); InterPro: IPR002219 Diacylglycerol (DAG) is an important second messenger. Phorbol esters (PE) are analogues of DAG and potent tumour promoters that cause a variety of physiological changes when administered to both cells and tissues. DAG activates a family of serine/threonine protein kinases, collectively known as protein kinase C (PKC) []. Phorbol esters can directly stimulate PKC. The N-terminal region of PKC, known as C1, has been shown [] to bind PE and DAG in a phospholipid and zinc-dependent fashion. The C1 region contains one or two copies (depending on the isozyme of PKC) of a cysteine-rich domain, which is about 50 amino-acid residues long, and which is essential for DAG/PE-binding. The DAG/PE-binding domain binds two zinc ions; the ligands of these metal ions are probably the six cysteines and two histidines that are conserved in this domain.; GO: 0035556 intracellular signal transduction; PDB: 1RFH_A 2FNF_X 3PFQ_A 1PTQ_A 1PTR_A 2VRW_B 1XA6_A 2ENN_A 1TBN_A 1TBO_A ....
Probab=71.85 E-value=3.1 Score=35.08 Aligned_cols=35 Identities=26% Similarity=0.592 Sum_probs=26.5
Q ss_pred CCcCcccCCCC-CCCCCEEEccccCcccccccccCc
Q 001243 705 PRSCDICRRSE-TILNPILICSGCKVAVHLDCYRNA 739 (1116)
Q Consensus 705 d~~CsVC~~~E-~~~N~IL~Cd~C~laVHq~CYGi~ 739 (1116)
...|++|...= +...+-+.|..|++.+|..|....
T Consensus 11 ~~~C~~C~~~i~g~~~~g~~C~~C~~~~H~~C~~~~ 46 (53)
T PF00130_consen 11 PTYCDVCGKFIWGLGKQGYRCSWCGLVCHKKCLSKV 46 (53)
T ss_dssp TEB-TTSSSBECSSSSCEEEETTTT-EEETTGGCTS
T ss_pred CCCCcccCcccCCCCCCeEEECCCCChHhhhhhhhc
Confidence 46899999853 246789999999999999998543
No 47
>cd00029 C1 Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
Probab=70.57 E-value=2.5 Score=34.78 Aligned_cols=34 Identities=41% Similarity=0.812 Sum_probs=26.9
Q ss_pred CCcCcccCCCCC-CCCCEEEccccCcccccccccC
Q 001243 705 PRSCDICRRSET-ILNPILICSGCKVAVHLDCYRN 738 (1116)
Q Consensus 705 d~~CsVC~~~E~-~~N~IL~Cd~C~laVHq~CYGi 738 (1116)
..+|++|...=. ...+-+.|..|++.||..|...
T Consensus 11 ~~~C~~C~~~i~~~~~~~~~C~~C~~~~H~~C~~~ 45 (50)
T cd00029 11 PTFCDVCRKSIWGLFKQGLRCSWCKVKCHKKCADK 45 (50)
T ss_pred CCChhhcchhhhccccceeEcCCCCCchhhhhhcc
Confidence 467999997532 2357889999999999999853
No 48
>PF02318 FYVE_2: FYVE-type zinc finger; InterPro: IPR003315 This entry represents the zinc-binding domain found in rabphilin Rab3A. The small G protein Rab3A plays an important role in the regulation of neurotransmitter release. The crystal structure of the small G protein Rab3A complexed with the effector domain of rabphilin-3A shows that the effector domain of rabphilin-3A contacts Rab3A in two distinct areas. The first interface involves the Rab3A switch I and switch II regions, which are sensitive to the nucleotide-binding state of Rab3A. The second interface consists of a deep pocket in Rab3A that interacts with a SGAWFF structural element of rabphilin-3A. Sequence and structure analysis, and biochemical data suggest that this pocket, or Rab complementarity-determining region (RabCDR), establishes a specific interaction between each Rab protein and its effectors. It has been suggested that RabCDRs could be major determinants of effector specificity during vesicle trafficking and fusion [].; GO: 0008270 zinc ion binding, 0017137 Rab GTPase binding, 0006886 intracellular protein transport; PDB: 2CSZ_A 2ZET_C 1ZBD_B 3BC1_B 2CJS_C 2A20_A.
Probab=70.13 E-value=3 Score=41.41 Aligned_cols=48 Identities=25% Similarity=0.645 Sum_probs=35.8
Q ss_pred CcCcccCCCC-CCCCCEEEccccCcccccccccCccCCCCceeccccccc
Q 001243 706 RSCDICRRSE-TILNPILICSGCKVAVHLDCYRNAKESTGPWYCELCEEL 754 (1116)
Q Consensus 706 ~~CsVC~~~E-~~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd~C~~~ 754 (1116)
..|..|...- ...|.-..|..|...|=+.| |+....+..|+|..|...
T Consensus 55 ~~C~~C~~~fg~l~~~~~~C~~C~~~VC~~C-~~~~~~~~~WlC~vC~k~ 103 (118)
T PF02318_consen 55 RHCARCGKPFGFLFNRGRVCVDCKHRVCKKC-GVYSKKEPIWLCKVCQKQ 103 (118)
T ss_dssp SB-TTTS-BCSCTSTTCEEETTTTEEEETTS-EEETSSSCCEEEHHHHHH
T ss_pred cchhhhCCcccccCCCCCcCCcCCccccCcc-CCcCCCCCCEEChhhHHH
Confidence 4699998743 34577799999999999999 444445778999999875
No 49
>smart00109 C1 Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
Probab=67.08 E-value=2.6 Score=34.26 Aligned_cols=33 Identities=39% Similarity=0.632 Sum_probs=25.4
Q ss_pred CCcCcccCCCCCCCCCEEEccccCccccccccc
Q 001243 705 PRSCDICRRSETILNPILICSGCKVAVHLDCYR 737 (1116)
Q Consensus 705 d~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYG 737 (1116)
..+|.+|...-....+-+.|..|++.||..|..
T Consensus 11 ~~~C~~C~~~i~~~~~~~~C~~C~~~~H~~C~~ 43 (49)
T smart00109 11 PTKCCVCRKSIWGSFQGLRCSWCKVKCHKKCAE 43 (49)
T ss_pred CCCccccccccCcCCCCcCCCCCCchHHHHHHh
Confidence 467999998532111578999999999999974
No 50
>smart00530 HTH_XRE Helix-turn-helix XRE-family like proteins.
Probab=66.32 E-value=8.4 Score=30.13 Aligned_cols=48 Identities=23% Similarity=0.261 Sum_probs=36.2
Q ss_pred HHHHhhhCCcchhhhhhhhcCChhhhhhccccc-cccchhhHHHHHHhh
Q 001243 225 LKKLIDRGKVNVKDIASDIGISPDLLKTTLADG-TFASDLQCKLVKWLS 272 (1116)
Q Consensus 225 lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~~~-~~~~~~~~k~~~wl~ 272 (1116)
++++++..+++.+++|..+|+++.+|..-+... ....+...+|..+|.
T Consensus 2 i~~~~~~~~~s~~~la~~~~i~~~~i~~~~~~~~~~~~~~~~~i~~~~~ 50 (56)
T smart00530 2 LKELREEKGLTQEELAEKLGVSRSTLSRIENGKRKPSLETLKKLAKALG 50 (56)
T ss_pred HHHHHHHcCCCHHHHHHHhCCCHHHHHHHHCCCCCCCHHHHHHHHHHhC
Confidence 456677888999999999999999998776533 335566667776663
No 51
>TIGR02607 antidote_HigA addiction module antidote protein, HigA family. Members of this family form a distinct clade within the larger family HTH_3 of helix-turn-helix proteins, described by Pfam model pfam01381. Members of this clade are strictly bacterial and nearly always shorter than 110 amino acids. This family includes the characterized member HigA, without which the killer protein HigB cannot be cloned. The hig (host inhibition of growth) system is noted to be unusual in that killer protein is uncoded by the upstream member of the gene pair.
Probab=65.64 E-value=10 Score=33.98 Aligned_cols=54 Identities=19% Similarity=0.330 Sum_probs=41.5
Q ss_pred cchHHHHH-HHhhhCCcchhhhhhhhcCChhhhhhccc-cccccchhhHHHHHHhh
Q 001243 219 LNFTLILK-KLIDRGKVNVKDIASDIGISPDLLKTTLA-DGTFASDLQCKLVKWLS 272 (1116)
Q Consensus 219 ~~~~~~lk-kli~~gkv~~~~~a~~~g~s~~~~~a~l~-~~~~~~~~~~k~~~wl~ 272 (1116)
......++ .|++...++..++|..+|||..+|-.-+. ...+.++.-.+|.+.|.
T Consensus 3 ~~~g~~i~~~~~~~~~~t~~~lA~~~gis~~tis~~~~g~~~~~~~~~~~l~~~l~ 58 (78)
T TIGR02607 3 AHPGEILREEFLEPLGLSIRALAKALGVSRSTLSRIVNGRRGITADMALRLAKALG 58 (78)
T ss_pred CCHHHHHHHHHHHHcCCCHHHHHHHhCCCHHHHHHHHcCCCCCCHHHHHHHHHHcC
Confidence 34455677 89999999999999999999999988776 33455666666666554
No 52
>PF02796 HTH_7: Helix-turn-helix domain of resolvase; InterPro: IPR006120 Site-specific recombination plays an important role in DNA rearrangement in prokaryotic organisms. Two types of site-specific recombination are known to occur: Recombination between inverted repeats resulting in the reversal of a DNA segment. Recombination between repeat sequences on two DNA molecules resulting in their cointegration, or between repeats on one DNA molecule resulting in the excision of a DNA fragment. Site-specific recombination is characterised by a strand exchange mechanism that requires no DNA synthesis or high energy cofactor; the phosphodiester bond energy is conserved in a phospho-protein linkage during strand cleavage and re-ligation. Two unrelated families of recombinases are currently known []. The first, called the 'phage integrase' family, groups a number of bacterial phage and yeast plasmid enzymes. The second [], called the 'resolvase' family, groups enzymes which share the following structural characteristics: an N-terminal catalytic and dimerization domain that contains a conserved serine residue involved in the transient covalent attachment to DNA IPR006119 from INTERPRO, and a C-terminal helix-turn-helix DNA-binding domain. ; GO: 0000150 recombinase activity, 0003677 DNA binding, 0006310 DNA recombination; PDB: 1ZR2_A 2GM4_B 1RES_A 1ZR4_A 1RET_A 1GDT_B 2R0Q_C 1JKP_C 1IJW_C 1JJ6_C ....
Probab=64.85 E-value=7.2 Score=32.28 Aligned_cols=31 Identities=26% Similarity=0.429 Sum_probs=22.8
Q ss_pred HHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 223 LILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 223 ~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
--+++|.+.| .++.+||.++|||..+|---|
T Consensus 12 ~~i~~l~~~G-~si~~IA~~~gvsr~TvyR~l 42 (45)
T PF02796_consen 12 EEIKELYAEG-MSIAEIAKQFGVSRSTVYRYL 42 (45)
T ss_dssp HHHHHHHHTT---HHHHHHHTTS-HHHHHHHH
T ss_pred HHHHHHHHCC-CCHHHHHHHHCcCHHHHHHHH
Confidence 4455688899 999999999999998886554
No 53
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=64.22 E-value=3.1 Score=31.66 Aligned_cols=28 Identities=29% Similarity=0.710 Sum_probs=12.3
Q ss_pred cCcccCCCCCCCCCEEEccccCccccccc
Q 001243 707 SCDICRRSETILNPILICSGCKVAVHLDC 735 (1116)
Q Consensus 707 ~CsVC~~~E~~~N~IL~Cd~C~laVHq~C 735 (1116)
.|++|...-. ++....|..|+..+|..|
T Consensus 2 ~C~~C~~~~~-~~~~Y~C~~Cdf~lH~~C 29 (30)
T PF07649_consen 2 RCDACGKPID-GGWFYRCSECDFDLHEEC 29 (30)
T ss_dssp --TTTS-----S--EEE-TTT-----HHH
T ss_pred cCCcCCCcCC-CCceEECccCCCccChhc
Confidence 5999998533 247889999999999988
No 54
>PF13443 HTH_26: Cro/C1-type HTH DNA-binding domain; PDB: 3TYR_A 3TYS_A 3B7H_A.
Probab=62.68 E-value=8.4 Score=33.26 Aligned_cols=48 Identities=31% Similarity=0.349 Sum_probs=31.8
Q ss_pred HHHHhhhCCcchhhhhhhhcCChhhhhhccccc--cccchhhHHHHHHhh
Q 001243 225 LKKLIDRGKVNVKDIASDIGISPDLLKTTLADG--TFASDLQCKLVKWLS 272 (1116)
Q Consensus 225 lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~~~--~~~~~~~~k~~~wl~ 272 (1116)
|++|+++..++..++|..+|||..+|..-+... .+.-+.-.+|-+.|.
T Consensus 2 L~~~m~~~~it~~~La~~~gis~~tl~~~~~~~~~~~~~~~l~~ia~~l~ 51 (63)
T PF13443_consen 2 LKELMAERGITQKDLARKTGISRSTLSRILNGKPSNPSLDTLEKIAKALN 51 (63)
T ss_dssp HHHHHHHTT--HHHHHHHHT--HHHHHHHHTTT-----HHHHHHHHHHHT
T ss_pred HHHHHHHcCCCHHHHHHHHCcCHHHHHHHHhcccccccHHHHHHHHHHcC
Confidence 678888888899999999999999998887743 455555556665553
No 55
>PF01381 HTH_3: Helix-turn-helix; InterPro: IPR001387 This is large family of DNA binding helix-turn helix proteins that include a bacterial plasmid copy control protein, bacterial methylases, various bacteriophage transcription control proteins and a vegetative specific protein from Dictyostelium discoideum (Slime mould).; GO: 0043565 sequence-specific DNA binding; PDB: 2AXU_A 2AWI_D 2AXV_D 2AXZ_C 2AW6_A 3KXA_C 3BS3_A 2CRO_A 1ZUG_A 3CRO_R ....
Probab=59.16 E-value=13 Score=31.07 Aligned_cols=48 Identities=23% Similarity=0.264 Sum_probs=36.9
Q ss_pred HHHHhhhCCcchhhhhhhhcCChhhhhhccc-cccccchhhHHHHHHhh
Q 001243 225 LKKLIDRGKVNVKDIASDIGISPDLLKTTLA-DGTFASDLQCKLVKWLS 272 (1116)
Q Consensus 225 lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~-~~~~~~~~~~k~~~wl~ 272 (1116)
||++.++-..|.+|+|..+|+|+.+|..-+. ......+.-.+|-+-|.
T Consensus 1 ik~~r~~~gls~~~la~~~gis~~~i~~~~~g~~~~~~~~~~~ia~~l~ 49 (55)
T PF01381_consen 1 IKELRKEKGLSQKELAEKLGISRSTISRIENGKRNPSLDTLKKIAKALG 49 (55)
T ss_dssp HHHHHHHTTS-HHHHHHHHTS-HHHHHHHHTTSSTSBHHHHHHHHHHHT
T ss_pred CHHHHHHcCCCHHHHHHHhCCCcchhHHHhcCCCCCCHHHHHHHHHHHC
Confidence 5778888999999999999999999998876 45566666677766554
No 56
>PF13404 HTH_AsnC-type: AsnC-type helix-turn-helix domain; PDB: 2ZNY_E 2ZNZ_G 1RI7_A 2CYY_A 2E1C_A 2VC1_B 2QZ8_A 2W29_C 2IVM_B 2VBX_B ....
Probab=57.31 E-value=9.5 Score=31.51 Aligned_cols=32 Identities=22% Similarity=0.506 Sum_probs=25.2
Q ss_pred HHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 223 LILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 223 ~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
-||+-|.+-|..+..+||.++|+|+.++..-+
T Consensus 7 ~Il~~Lq~d~r~s~~~la~~lglS~~~v~~Ri 38 (42)
T PF13404_consen 7 KILRLLQEDGRRSYAELAEELGLSESTVRRRI 38 (42)
T ss_dssp HHHHHHHH-TTS-HHHHHHHHTS-HHHHHHHH
T ss_pred HHHHHHHHcCCccHHHHHHHHCcCHHHHHHHH
Confidence 47788888999999999999999999987653
No 57
>PF13901 DUF4206: Domain of unknown function (DUF4206)
Probab=55.80 E-value=9.9 Score=41.30 Aligned_cols=44 Identities=25% Similarity=0.703 Sum_probs=34.5
Q ss_pred CCCcCcccCCCCC----CCCCEEEccccCcccccccccCccCCCCceeccccccc
Q 001243 704 HPRSCDICRRSET----ILNPILICSGCKVAVHLDCYRNAKESTGPWYCELCEEL 754 (1116)
Q Consensus 704 ~d~~CsVC~~~E~----~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd~C~~~ 754 (1116)
.+..|-+|.+.+- ..+..+.|..|+..+|+.|+.- --|.+|.-.
T Consensus 151 kGfiCe~C~~~~~IfPF~~~~~~~C~~C~~v~H~~C~~~-------~~CpkC~R~ 198 (202)
T PF13901_consen 151 KGFICEICNSDDIIFPFQIDTTVRCPKCKSVFHKSCFRK-------KSCPKCARR 198 (202)
T ss_pred CCCCCccCCCCCCCCCCCCCCeeeCCcCccccchhhcCC-------CCCCCcHhH
Confidence 3568999998763 3457899999999999999962 129999764
No 58
>cd04718 BAH_plant_2 BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=53.74 E-value=7 Score=40.77 Aligned_cols=27 Identities=37% Similarity=0.808 Sum_probs=21.9
Q ss_pred ccccccccCc--cCCCCceeccccccccc
Q 001243 730 AVHLDCYRNA--KESTGPWYCELCEELLS 756 (1116)
Q Consensus 730 aVHq~CYGi~--~ipeg~WlCd~C~~~~~ 756 (1116)
.+|..|...+ .+|+|+|+|..|.....
T Consensus 1 g~H~~CL~Ppl~~~P~g~W~Cp~C~~~~~ 29 (148)
T cd04718 1 GFHLCCLRPPLKEVPEGDWICPFCEVEKS 29 (148)
T ss_pred CcccccCCCCCCCCCCCCcCCCCCcCCCC
Confidence 3799999754 58899999999998643
No 59
>PF13412 HTH_24: Winged helix-turn-helix DNA-binding; PDB: 1I1G_B 2IA0_B 3I4P_A 2GQQ_A 2L4A_A 2CFX_B 2DBB_B 2EFO_A 2EFQ_A 2PN6_A ....
Probab=53.14 E-value=11 Score=31.10 Aligned_cols=33 Identities=27% Similarity=0.429 Sum_probs=27.1
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
--||.-|.+.|.++.++||..+|+|..++...|
T Consensus 6 ~~Il~~l~~~~~~t~~ela~~~~is~~tv~~~l 38 (48)
T PF13412_consen 6 RKILNYLRENPRITQKELAEKLGISRSTVNRYL 38 (48)
T ss_dssp HHHHHHHHHCTTS-HHHHHHHHTS-HHHHHHHH
T ss_pred HHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHH
Confidence 457888999999999999999999999887664
No 60
>PF03107 C1_2: C1 domain; InterPro: IPR004146 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in DAG_PE-bind (IPR002219 from INTERPRO), therefore we have termed this domain DC1 for divergent C1 domain. This domain probably also binds to two zinc ions. The function of proteins with this domain is uncertain, however this domain may bind to molecules such as diacylglycerol. This family are found in plant proteins.
Probab=52.85 E-value=11 Score=28.96 Aligned_cols=27 Identities=37% Similarity=0.898 Sum_probs=21.5
Q ss_pred cCcccCCCCCCCCC-EEEccccCccccccc
Q 001243 707 SCDICRRSETILNP-ILICSGCKVAVHLDC 735 (1116)
Q Consensus 707 ~CsVC~~~E~~~N~-IL~Cd~C~laVHq~C 735 (1116)
.|+||.+.-. +. ...|..|...+|..|
T Consensus 2 ~C~~C~~~~~--~~~~Y~C~~c~f~lh~~C 29 (30)
T PF03107_consen 2 WCDVCRRKID--GFYFYHCSECCFTLHVRC 29 (30)
T ss_pred CCCCCCCCcC--CCEeEEeCCCCCeEcCcc
Confidence 4999987422 33 889999999999988
No 61
>KOG4236 consensus Serine/threonine protein kinase PKC mu/PKD and related proteins [Signal transduction mechanisms]
Probab=52.35 E-value=6.9 Score=48.02 Aligned_cols=34 Identities=29% Similarity=0.705 Sum_probs=26.5
Q ss_pred CCCCcCcccCCCC-CCCCCEEEccccCcccccccc
Q 001243 703 EHPRSCDICRRSE-TILNPILICSGCKVAVHLDCY 736 (1116)
Q Consensus 703 e~d~~CsVC~~~E-~~~N~IL~Cd~C~laVHq~CY 736 (1116)
.-..+|+-|+..= +.-.+-+.|.+|++.+|+.|-
T Consensus 154 ~~PtFCD~CGEmL~GLvrQGlKC~gCglNyHKRCa 188 (888)
T KOG4236|consen 154 KAPTFCDFCGEMLFGLVRQGLKCEGCGLNYHKRCA 188 (888)
T ss_pred cCchHHHHHHHHHHHHHHccccccCCCCcHhhhhh
Confidence 3457999998642 123567899999999999996
No 62
>cd00093 HTH_XRE Helix-turn-helix XRE-family like proteins. Prokaryotic DNA binding proteins belonging to the xenobiotic response element family of transcriptional regulators.
Probab=51.17 E-value=23 Score=27.77 Aligned_cols=48 Identities=21% Similarity=0.219 Sum_probs=36.3
Q ss_pred HHHHHhhhCCcchhhhhhhhcCChhhhhhccccc-cccchhhHHHHHHh
Q 001243 224 ILKKLIDRGKVNVKDIASDIGISPDLLKTTLADG-TFASDLQCKLVKWL 271 (1116)
Q Consensus 224 ~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~~~-~~~~~~~~k~~~wl 271 (1116)
.++..+++-+++..++|..+|+++.+|..-+... .+.++...+|...|
T Consensus 3 ~l~~~~~~~~~s~~~~a~~~~~~~~~v~~~~~g~~~~~~~~~~~i~~~~ 51 (58)
T cd00093 3 RLKELRKEKGLTQEELAEKLGVSRSTISRIENGKRNPSLETLEKLAKAL 51 (58)
T ss_pred HHHHHHHHcCCCHHHHHHHHCCCHHHHHHHHcCCCCCCHHHHHHHHHHh
Confidence 4566677788999999999999999998776533 55566666666655
No 63
>PF01978 TrmB: Sugar-specific transcriptional regulator TrmB; InterPro: IPR002831 TrmB, is a protein of 38,800 apparent molecular weight, that is involved in the maltose-specific regulation of the trehalose/maltose ABC transport operon in Thermococcus litoralis. TrmB has been shown to be a maltose-specific repressor, and this inhibition is counteracted by maltose and trehalose. TrmB binds maltose and trehalose half-maximally at 20 uM and 0.5 mM sugar concentration, respectively []. Other members of this family are annotated as either transcriptional regulators or hypothetical proteins. ; PDB: 2D1H_A 3QPH_A 1SFX_A.
Probab=49.91 E-value=12 Score=33.16 Aligned_cols=33 Identities=24% Similarity=0.452 Sum_probs=29.9
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
+-|+.-|+..|.+++.|||.++||+..++...|
T Consensus 11 ~~vy~~Ll~~~~~t~~eIa~~l~i~~~~v~~~L 43 (68)
T PF01978_consen 11 AKVYLALLKNGPATAEEIAEELGISRSTVYRAL 43 (68)
T ss_dssp HHHHHHHHHHCHEEHHHHHHHHTSSHHHHHHHH
T ss_pred HHHHHHHHHcCCCCHHHHHHHHCcCHHHHHHHH
Confidence 567888999999999999999999999988775
No 64
>PF14197 Cep57_CLD_2: Centrosome localisation domain of PPC89
Probab=49.35 E-value=51 Score=30.29 Aligned_cols=60 Identities=32% Similarity=0.233 Sum_probs=46.8
Q ss_pred chhhHHHHHHHHH--hhhhhhhhhhhHHHHHHHHHHhHHHHHhhhhcchhHHHHHHHHHHHHHHHHccC
Q 001243 545 EVEGEIIYFQHRL--LGNAFSRKRLADNLVCKAVKTLNQEIDVARGRRWDAVLVNQYLCELREAKKQGR 611 (1116)
Q Consensus 545 e~e~E~~~~q~~l--l~~~~~~r~~~~~lv~~V~k~l~~E~~~~~~r~~d~~~~nq~L~~vreakkq~~ 611 (1116)
.+|.|+..||.+| +..-.+ .-....+.|..|-+.+-.+-.+...-+.-|++..++.++.-
T Consensus 2 ~Lea~~~~Lr~rLd~~~rk~~-------~~~~~~k~L~~ERd~~~~~l~~a~~e~~~Lk~E~e~L~~el 63 (69)
T PF14197_consen 2 KLEAEIATLRNRLDSLTRKNS-------VHEIENKRLRRERDSAERQLGDAYEENNKLKEENEALRKEL 63 (69)
T ss_pred hHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4789999999998 433332 12245588899999999998888899999999999987763
No 65
>PF10668 Phage_terminase: Phage terminase small subunit; InterPro: IPR018925 This entry describes the terminase small subunit from Enterococcus phage phiFL1A, related proteins in other bacteriophage, and prophage regions of bacterial genomes. Packaging of double-stranded viral DNA concatemers requires interaction of the prohead with virus DNA. This process is mediated by a phage-encoded DNA recognition and terminase protein. The terminase enzymes described so far, which are hetero-oligomers composed of a small and a large subunit, do not have a significant level of sequence homology. The small terminase subunit is thought to form a nucleoprotein structure that helps to position the terminase large subunit at the packaging initiation site [].
Probab=48.91 E-value=10 Score=34.00 Aligned_cols=22 Identities=36% Similarity=0.755 Sum_probs=19.8
Q ss_pred hCCcchhhhhhhhcCChhhhhh
Q 001243 231 RGKVNVKDIASDIGISPDLLKT 252 (1116)
Q Consensus 231 ~gkv~~~~~a~~~g~s~~~~~a 252 (1116)
.|++..+|||.++|+|+.+|..
T Consensus 20 ~g~i~lkdIA~~Lgvs~~tIr~ 41 (60)
T PF10668_consen 20 NGKIKLKDIAEKLGVSESTIRK 41 (60)
T ss_pred CCCccHHHHHHHHCCCHHHHHH
Confidence 6899999999999999988764
No 66
>PF07227 DUF1423: Protein of unknown function (DUF1423); InterPro: IPR004082 A total of 715 potential protein-coding genes have been identified in the nucleotide sequence of Arabidopsis thaliana chromosome 5, with an average gene density of 1 gene per 4001 bp []. Amongst the gene products is a well-conserved family of 130.7kDa proteins that share no sequence similarity with any other known proteins, other than in plants. The sequences are characterised by an N-terminal domain of variable length, a central cysteine-rich region and a relatively acidic C-terminal domain. The sequences may possess a PHD finger.
Probab=48.87 E-value=13 Score=44.77 Aligned_cols=48 Identities=25% Similarity=0.504 Sum_probs=33.7
Q ss_pred cCcccCCCCCCCC--CEEEccccCcccccccc--------cCcc-----CCCCceeccccccc
Q 001243 707 SCDICRRSETILN--PILICSGCKVAVHLDCY--------RNAK-----ESTGPWYCELCEEL 754 (1116)
Q Consensus 707 ~CsVC~~~E~~~N--~IL~Cd~C~laVHq~CY--------Gi~~-----ipeg~WlCd~C~~~ 754 (1116)
.|.||...+.+.| .-|.|+-|+...|..|- |+.. ..+..|+|..|-..
T Consensus 130 ~C~iC~kfD~~~n~~~Wi~Cd~CgH~cH~dCALr~~~i~~G~s~~g~~g~~d~~f~C~~C~~~ 192 (446)
T PF07227_consen 130 MCCICSKFDDNKNTCSWIGCDVCGHWCHLDCALRHELIGTGPSVKGSIGTLDMQFHCRACGKT 192 (446)
T ss_pred CccccCCcccCCCCeeEEeccCCCceehhhhhcccccccCCccCCCCCccCceEEEccCCCCh
Confidence 5778888776444 47899999999999995 1111 12456888888653
No 67
>PF13542 HTH_Tnp_ISL3: Helix-turn-helix domain of transposase family ISL3
Probab=47.48 E-value=17 Score=30.32 Aligned_cols=31 Identities=29% Similarity=0.460 Sum_probs=25.8
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
..|++.|.+. .++++||.+.|+|+++|..-+
T Consensus 18 ~~i~~~~~~~--~s~~~vA~~~~vs~~TV~ri~ 48 (52)
T PF13542_consen 18 QYILKLLRES--RSFKDVARELGVSWSTVRRIF 48 (52)
T ss_pred HHHHHHHhhc--CCHHHHHHHHCCCHHHHHHHH
Confidence 5677777755 899999999999999997654
No 68
>TIGR03070 couple_hipB transcriptional regulator, y4mF family. Members of this family belong to a clade of helix-turn-helix DNA-binding proteins, among the larger family pfam01381 (HTH_3; Helix-turn-helix). Members are similar in sequence to the HipB protein of E. coli. Genes for members of the seed alignment for this protein family were found to be closely linked to genes encoding proteins related to HipA. The HibBA operon appears to have some features in common with toxin-antitoxin post-segregational killing systems.
Probab=47.18 E-value=32 Score=28.48 Aligned_cols=36 Identities=8% Similarity=0.169 Sum_probs=31.7
Q ss_pred chHHHHHHHhhhCCcchhhhhhhhcCChhhhhhccc
Q 001243 220 NFTLILKKLIDRGKVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 220 ~~~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
.|+..|+++.++-..+..++|..+|+|+.+|..-..
T Consensus 2 ~~~~~l~~~r~~~gltq~~lA~~~gvs~~~vs~~e~ 37 (58)
T TIGR03070 2 QIGMLVRARRKALGLTQADLADLAGVGLRFIRDVEN 37 (58)
T ss_pred hHHHHHHHHHHHcCCCHHHHHHHhCCCHHHHHHHHC
Confidence 367789999999999999999999999998887764
No 69
>KOG4443 consensus Putative transcription factor HALR/MLL3, involved in embryonic development [General function prediction only]
Probab=46.55 E-value=8.7 Score=47.95 Aligned_cols=49 Identities=27% Similarity=0.572 Sum_probs=35.0
Q ss_pred CcCcccCCCC-CCCCCEEEccccCcccccccccCcc---CCCCceeccccccc
Q 001243 706 RSCDICRRSE-TILNPILICSGCKVAVHLDCYRNAK---ESTGPWYCELCEEL 754 (1116)
Q Consensus 706 ~~CsVC~~~E-~~~N~IL~Cd~C~laVHq~CYGi~~---ipeg~WlCd~C~~~ 754 (1116)
..|-+|.... ...+.++.|..|+...|.+|..+.. .-.+-|-|..|..-
T Consensus 19 ~mc~l~~s~G~~~ag~m~ac~~c~~~yH~~cvt~~~~~~~l~~gWrC~~crvC 71 (694)
T KOG4443|consen 19 LMCPLCGSSGKGRAGRLLACSDCGQKYHPYCVTSWAQHAVLSGGWRCPSCRVC 71 (694)
T ss_pred hhhhhhccccccccCcchhhhhhcccCCcchhhHHHhHHHhcCCcccCCceee
Confidence 4567776543 2468899999999999999987431 12344999988753
No 70
>COG5034 TNG2 Chromatin remodeling protein, contains PhD zinc finger [Chromatin structure and dynamics]
Probab=45.41 E-value=13 Score=41.75 Aligned_cols=32 Identities=31% Similarity=0.908 Sum_probs=26.2
Q ss_pred CCcccccccC-cCCceeecCCcCcc-cccchhhh
Q 001243 826 GIDVCCICRH-KHGICIKCNYGNCQ-TTFHPTCA 857 (1116)
Q Consensus 826 ~r~~C~iC~~-k~GAcIqCs~~~C~-~sFHvtCA 857 (1116)
+...-++|++ ..|-+|.|...+|. .|||..|.
T Consensus 219 ~e~lYCfCqqvSyGqMVaCDn~nCkrEWFH~~CV 252 (271)
T COG5034 219 GEELYCFCQQVSYGQMVACDNANCKREWFHLECV 252 (271)
T ss_pred CceeEEEecccccccceecCCCCCchhheecccc
Confidence 3445557776 69999999999998 69999996
No 71
>KOG1701 consensus Focal adhesion adaptor protein Paxillin and related LIM proteins [Signal transduction mechanisms]
Probab=43.68 E-value=9.6 Score=45.49 Aligned_cols=151 Identities=20% Similarity=0.435 Sum_probs=81.1
Q ss_pred cCcccCCCCCCCCCEEEccccCccccccccc----------Cc-cCCCCceecccccccccCCCCCCCCCCccCCCcccc
Q 001243 707 SCDICRRSETILNPILICSGCKVAVHLDCYR----------NA-KESTGPWYCELCEELLSSRSSGAPSVNFWEKPYFVA 775 (1116)
Q Consensus 707 ~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYG----------i~-~ipeg~WlCd~C~~~~~~~~s~~~~vn~~~~p~~~~ 775 (1116)
.|.-|...-. .+-.-|.-=+..+|..|+- .. ..-++.-+|+.|-... .-
T Consensus 276 iC~~C~K~V~--g~~~ac~Am~~~fHv~CFtC~~C~r~L~Gq~FY~v~~k~~CE~cyq~t------------------le 335 (468)
T KOG1701|consen 276 ICAFCHKTVS--GQGLAVEAMDQLFHVQCFTCRTCRRQLAGQSFYQVDGKPYCEGCYQDT------------------LE 335 (468)
T ss_pred hhhhcCCccc--CcchHHHHhhhhhcccceehHhhhhhhccccccccCCcccchHHHHHH------------------HH
Confidence 6888876321 2222333334455666652 11 1235667777775431 44
Q ss_pred ccccCCCCC-CcceeccCCc------hhhhccccccccceeecCccccccCccccCC-CCcccccccCc----C----Cc
Q 001243 776 ECSLCGGTT-GAFRKSANGQ------WVHAFCAEWVFESTFRRGQVNPVAGMEAFPK-GIDVCCICRHK----H----GI 839 (1116)
Q Consensus 776 ~C~LCp~~g-GALK~T~~g~------WVHV~CALW~PEv~f~n~~lepVegie~I~k-~r~~C~iC~~k----~----GA 839 (1116)
+|..|...- ..+.+ .-|+ |+=|+|+--+..+-|.-+.-+.|.=+...-+ .--+|.+|++. . -.
T Consensus 336 kC~~Cg~~I~d~iLr-A~GkayHp~CF~Cv~C~r~ldgipFtvd~~n~v~Cv~dfh~kfAPrCs~C~~PI~P~~G~~etv 414 (468)
T KOG1701|consen 336 KCNKCGEPIMDRILR-ALGKAYHPGCFTCVVCARCLDGIPFTVDSQNNVYCVPDFHKKFAPRCSVCGNPILPRDGKDETV 414 (468)
T ss_pred HHhhhhhHHHHHHHH-hcccccCCCceEEEEeccccCCccccccCCCceeeehhhhhhcCcchhhccCCccCCCCCcceE
Confidence 788886421 11111 1233 4445555566666665443344444444333 35789999984 1 22
Q ss_pred eeecCCcCcccccchhhhhhcCceEEEe-eCCCc----eeeeecCCCC
Q 001243 840 CIKCNYGNCQTTFHPTCARSAGFYLNVK-STGGN----FQHKAYCEKH 882 (1116)
Q Consensus 840 cIqCs~~~C~~sFHvtCA~~aG~~~~~k-~~~g~----~~~~iyC~kH 882 (1116)
-|-|. .+.||+.|=+-..|-|.+. ..+++ +.-.++|+.=
T Consensus 415 Rvvam----dr~fHv~CY~CEDCg~~LS~e~e~qgCyPld~HllCk~C 458 (468)
T KOG1701|consen 415 RVVAM----DRDFHVNCYKCEDCGLLLSSEEEGQGCYPLDGHLLCKTC 458 (468)
T ss_pred EEEEc----cccccccceehhhcCccccccCCCCcceeccCceeechh
Confidence 23344 4789999998887777664 22222 3446788753
No 72
>PRK09492 treR trehalose repressor; Provisional
Probab=43.12 E-value=21 Score=39.69 Aligned_cols=51 Identities=16% Similarity=0.291 Sum_probs=44.7
Q ss_pred hCCcchhhhhhhhcCChhhhhhccc-cccccchhhHHHHHHhhhcccccccc
Q 001243 231 RGKVNVKDIASDIGISPDLLKTTLA-DGTFASDLQCKLVKWLSNHAYLGGLL 281 (1116)
Q Consensus 231 ~gkv~~~~~a~~~g~s~~~~~a~l~-~~~~~~~~~~k~~~wl~~~~~~~~~~ 281 (1116)
+.|++++|||.+.|+|.-+|--+|+ ....++..+.||.+-.+.--|.+...
T Consensus 2 ~~~~ti~dIA~~agVS~~TVSrvLn~~~~vs~~tr~rV~~~a~elgY~pn~~ 53 (315)
T PRK09492 2 QNKLTIKDIARLSGVGKSTVSRVLNNESGVSEETRERVEAVINQHGFSPSKS 53 (315)
T ss_pred CCCCcHHHHHHHhCCCHHHHhHHhCCCCCCCHHHHHHHHHHHHHHCCCcCHH
Confidence 3589999999999999999999998 46789999999999998888877543
No 73
>cd00569 HTH_Hin_like Helix-turn-helix domain of Hin and related proteins, a family of DNA-binding domains unique to bacteria and represented by the Hin protein of Salmonella. The basic HTH domain is a simple fold comprised of three core helices that form a right-handed helical bundle. The principal DNA-protein interface is formed by the third helix, the recognition helix, inserting itself into the major groove of the DNA. A diverse array of HTH domains participate in a variety of functions that depend on their DNA-binding properties. HTH_Hin represents one of the simplest versions of the HTH domains; the characterization of homologous relationships between various sequence-diverse HTH domain families remains difficult. The Hin recombinase induces the site-specific inversion of a chromosomal DNA segment containing a promoter, which controls the alternate expression of two genes by reversibly switching orientation. The Hin recombinase consists of a single polypeptide chain containing a D
Probab=43.06 E-value=34 Score=24.31 Aligned_cols=30 Identities=27% Similarity=0.434 Sum_probs=23.1
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhh
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKT 252 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a 252 (1116)
-..+.++++.|. ++++||.++|+|.-++-.
T Consensus 11 ~~~i~~~~~~~~-s~~~ia~~~~is~~tv~~ 40 (42)
T cd00569 11 IEEARRLLAAGE-SVAEIARRLGVSRSTLYR 40 (42)
T ss_pred HHHHHHHHHcCC-CHHHHHHHHCCCHHHHHH
Confidence 345556677776 999999999999887643
No 74
>PHA02591 hypothetical protein; Provisional
Probab=42.93 E-value=25 Score=33.21 Aligned_cols=37 Identities=24% Similarity=0.371 Sum_probs=31.3
Q ss_pred CCcchHHHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 217 DALNFTLILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 217 ~~~~~~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
+-.|+--+-++|.++| .++.+||..+|++-+++..-|
T Consensus 44 ~~dd~~~vA~eL~eqG-lSqeqIA~~LGVsqetVrKYL 80 (83)
T PHA02591 44 SEDDLISVTHELARKG-FTVEKIASLLGVSVRKVRRYL 80 (83)
T ss_pred ccchHHHHHHHHHHcC-CCHHHHHHHhCCCHHHHHHHH
Confidence 3457788889999999 699999999999999887654
No 75
>smart00550 Zalpha Z-DNA-binding domain in adenosine deaminases. Helix-turn-helix-containing domain. Also known as Zab.
Probab=42.80 E-value=19 Score=32.39 Aligned_cols=33 Identities=21% Similarity=0.366 Sum_probs=29.4
Q ss_pred HHHHHHHhhhCC--cchhhhhhhhcCChhhhhhcc
Q 001243 222 TLILKKLIDRGK--VNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 222 ~~~lkkli~~gk--v~~~~~a~~~g~s~~~~~a~l 254 (1116)
.-||.-|-++|. +++++||.++||+..++...|
T Consensus 9 ~~IL~~L~~~g~~~~ta~eLa~~lgl~~~~v~r~L 43 (68)
T smart00550 9 EKILEFLENSGDETSTALQLAKNLGLPKKEVNRVL 43 (68)
T ss_pred HHHHHHHHHCCCCCcCHHHHHHHHCCCHHHHHHHH
Confidence 568888888988 999999999999999888776
No 76
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=42.34 E-value=15 Score=27.97 Aligned_cols=27 Identities=26% Similarity=0.717 Sum_probs=12.2
Q ss_pred ccccccCcCC--ceeecCCcCcccccchhhh
Q 001243 829 VCCICRHKHG--ICIKCNYGNCQTTFHPTCA 857 (1116)
Q Consensus 829 ~C~iC~~k~G--AcIqCs~~~C~~sFHvtCA 857 (1116)
.|..|+...+ ....|. .|...+|..||
T Consensus 2 ~C~~C~~~~~~~~~Y~C~--~Cdf~lH~~Ca 30 (30)
T PF07649_consen 2 RCDACGKPIDGGWFYRCS--ECDFDLHEECA 30 (30)
T ss_dssp --TTTS----S--EEE-T--TT-----HHHH
T ss_pred cCCcCCCcCCCCceEECc--cCCCccChhcC
Confidence 5889998754 466798 99999999997
No 77
>PRK10014 DNA-binding transcriptional repressor MalI; Provisional
Probab=42.32 E-value=22 Score=40.07 Aligned_cols=52 Identities=13% Similarity=0.261 Sum_probs=45.2
Q ss_pred hCCcchhhhhhhhcCChhhhhhccc-cccccchhhHHHHHHhhhccccccccc
Q 001243 231 RGKVNVKDIASDIGISPDLLKTTLA-DGTFASDLQCKLVKWLSNHAYLGGLLK 282 (1116)
Q Consensus 231 ~gkv~~~~~a~~~g~s~~~~~a~l~-~~~~~~~~~~k~~~wl~~~~~~~~~~~ 282 (1116)
..||+++|||.+.|+|.-+|-.+|+ ....++..+.||.+-.+..=|.+....
T Consensus 4 ~~~~Ti~dIA~~agVS~~TVSr~Ln~~~~vs~~tr~~V~~~a~elgY~p~~~a 56 (342)
T PRK10014 4 AKKITIHDVALAAGVSVSTVSLVLSGKGRISTATGERVNQAIEELGFVRNRQA 56 (342)
T ss_pred CCCCcHHHHHHHhCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHhCCCcCHHH
Confidence 3579999999999999999999999 566899999999999999888775443
No 78
>PF13936 HTH_38: Helix-turn-helix domain; PDB: 2W48_A.
Probab=41.94 E-value=16 Score=30.21 Aligned_cols=30 Identities=23% Similarity=0.552 Sum_probs=20.6
Q ss_pred HHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 224 ILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 224 ~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
.+..|.++| .|+.+||..+|.|+.+|-..|
T Consensus 12 ~I~~l~~~G-~s~~~IA~~lg~s~sTV~rel 41 (44)
T PF13936_consen 12 QIEALLEQG-MSIREIAKRLGRSRSTVSREL 41 (44)
T ss_dssp HHHHHHCS----HHHHHHHTT--HHHHHHHH
T ss_pred HHHHHHHcC-CCHHHHHHHHCcCcHHHHHHH
Confidence 466788888 899999999999999986553
No 79
>PRK14987 gluconate operon transcriptional regulator; Provisional
Probab=40.87 E-value=22 Score=39.92 Aligned_cols=52 Identities=15% Similarity=0.275 Sum_probs=45.7
Q ss_pred hCCcchhhhhhhhcCChhhhhhcccc-ccccchhhHHHHHHhhhccccccccc
Q 001243 231 RGKVNVKDIASDIGISPDLLKTTLAD-GTFASDLQCKLVKWLSNHAYLGGLLK 282 (1116)
Q Consensus 231 ~gkv~~~~~a~~~g~s~~~~~a~l~~-~~~~~~~~~k~~~wl~~~~~~~~~~~ 282 (1116)
+++|+.+|||...|+|.-+|--+|+. ...++..+.||.+-.+..=|.+....
T Consensus 3 ~~~~ti~dIA~~agVS~~TVSrvLn~~~~vs~~tr~rV~~~a~elgY~pn~~a 55 (331)
T PRK14987 3 KKRPVLQDVADRVGVTKMTVSRFLRNPEQVSVALRGKIAAALDELGYIPNRAP 55 (331)
T ss_pred CCCCcHHHHHHHhCCCHHHhhhhhCCCCCCCHHHHHHHHHHHHHhCCCccHHH
Confidence 68899999999999999999999984 45899999999999999888765443
No 80
>smart00420 HTH_DEOR helix_turn_helix, Deoxyribose operon repressor.
Probab=40.45 E-value=24 Score=28.63 Aligned_cols=32 Identities=34% Similarity=0.554 Sum_probs=28.0
Q ss_pred HHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 223 LILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 223 ~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
.+++.|.+.|.+++.++|..+|+|+.++...|
T Consensus 4 ~il~~l~~~~~~s~~~l~~~l~~s~~tv~~~l 35 (53)
T smart00420 4 QILELLAQQGKVSVEELAELLGVSEMTIRRDL 35 (53)
T ss_pred HHHHHHHHcCCcCHHHHHHHHCCCHHHHHHHH
Confidence 47778888889999999999999999987765
No 81
>KOG3799 consensus Rab3 effector RIM1 and related proteins, contain Rab3a binding domain [Intracellular trafficking, secretion, and vesicular transport]
Probab=38.98 E-value=14 Score=37.86 Aligned_cols=50 Identities=22% Similarity=0.489 Sum_probs=35.3
Q ss_pred CCcCcccCCCCCCCCCEEEccccCcccccccccCccCC--CCceeccccccc
Q 001243 705 PRSCDICRRSETILNPILICSGCKVAVHLDCYRNAKES--TGPWYCELCEEL 754 (1116)
Q Consensus 705 d~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYGi~~ip--eg~WlCd~C~~~ 754 (1116)
+..|-||......++---.|.-|++.+-..|-|-.... ...|.|.+|.-.
T Consensus 65 datC~IC~KTKFADG~GH~C~YCq~r~CARCGGrv~lrsNKv~wvcnlc~k~ 116 (169)
T KOG3799|consen 65 DATCGICHKTKFADGCGHNCSYCQTRFCARCGGRVSLRSNKVMWVCNLCRKQ 116 (169)
T ss_pred CcchhhhhhcccccccCcccchhhhhHHHhcCCeeeeccCceEEeccCCcHH
Confidence 56799999865444444578888888888887755444 335999999653
No 82
>KOG4362 consensus Transcriptional regulator BRCA1 [Replication, recombination and repair; Transcription]
Probab=38.61 E-value=9.8 Score=48.04 Aligned_cols=67 Identities=24% Similarity=0.350 Sum_probs=55.5
Q ss_pred CCCCCcHhhHhhcccCceeeccCCccccccccccchhhcccccccccccccCceeeCCCCCCCcccchhhhhh
Q 001243 16 GGSMEFAHLFCSLLMPEVYIEDTMKVEPLMNVGGIKETRMKLVCNICRVKCGACVRCSHGTCRTSFHPICARE 88 (1116)
Q Consensus 16 ~G~~~WvHv~CALw~PEv~f~~~~~~epV~~V~~I~~~R~~LkC~iC~~k~GAcIqCs~~~C~~~FHvtCA~~ 88 (1116)
+++..-+|+.|.+|.++++-..... +....+.+.+...|.+|+.+ |+=.+|-.+.|...||++||+.
T Consensus 328 ~~~~~~~~v~~~~d~~~v~d~cs~~-----~~~~~l~r~~~~~~~~c~l~-~~h~~~~~~s~~~~~~~~~a~~ 394 (684)
T KOG4362|consen 328 NGNVRKPSVAVSDDDEQVLDECSTS-----GKECELGRSFPITCEDCKLK-GAHLGCLEKSCGSSEHVKCARG 394 (684)
T ss_pred CccccccccccccchHHHHHhcccc-----ccccccccCCcceeeecccc-chhhhhhhcccccceeeeeccc
Confidence 5566899999999999888755433 23457778888999999986 9999999999999999999954
No 83
>PF13518 HTH_28: Helix-turn-helix domain
Probab=38.22 E-value=22 Score=29.25 Aligned_cols=28 Identities=29% Similarity=0.508 Sum_probs=21.2
Q ss_pred HHHHHhhhCCcchhhhhhhhcCChhhhhhc
Q 001243 224 ILKKLIDRGKVNVKDIASDIGISPDLLKTT 253 (1116)
Q Consensus 224 ~lkkli~~gkv~~~~~a~~~g~s~~~~~a~ 253 (1116)
+++... +|+ ++.++|.++|||+.+|..-
T Consensus 5 iv~~~~-~g~-s~~~~a~~~gis~~tv~~w 32 (52)
T PF13518_consen 5 IVELYL-EGE-SVREIAREFGISRSTVYRW 32 (52)
T ss_pred HHHHHH-cCC-CHHHHHHHHCCCHhHHHHH
Confidence 344444 787 9999999999999877543
No 84
>smart00354 HTH_LACI helix_turn _helix lactose operon repressor.
Probab=37.71 E-value=33 Score=30.88 Aligned_cols=47 Identities=19% Similarity=0.338 Sum_probs=40.4
Q ss_pred cchhhhhhhhcCChhhhhhcccc-ccccchhhHHHHHHhhhccccccc
Q 001243 234 VNVKDIASDIGISPDLLKTTLAD-GTFASDLQCKLVKWLSNHAYLGGL 280 (1116)
Q Consensus 234 v~~~~~a~~~g~s~~~~~a~l~~-~~~~~~~~~k~~~wl~~~~~~~~~ 280 (1116)
++..|||...|+|..+|--.|+. ...+|....+|.+-++..-|.+..
T Consensus 1 ~t~~~iA~~~gvS~~TVSr~ln~~~~v~~~t~~~i~~~~~~~gy~~~~ 48 (70)
T smart00354 1 ATIKDVARLAGVSKATVSRVLNGNGRVSEETREKVLAAMEELGYIPNR 48 (70)
T ss_pred CCHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHhCCCCCH
Confidence 46789999999999999999985 456789999999999999996553
No 85
>PF12324 HTH_15: Helix-turn-helix domain of alkylmercury lyase; InterPro: IPR024259 Alkylmercury lyase (EC:4.99.1.2) cleaves the carbon-mercury bond of organomercurials such as phenylmercuric acetate. This entry represents the N-terminal helix-turn-helix domain.; PDB: 3FN8_B 3F2G_B 3F0P_A 3F2F_B 3F2H_A 3F0O_B 1S6L_A.
Probab=36.97 E-value=25 Score=33.07 Aligned_cols=34 Identities=24% Similarity=0.389 Sum_probs=26.6
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhccc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
-++||-|.+=+-|++.++|.++|.+-|.|+++|+
T Consensus 27 r~LLr~LA~G~PVt~~~LA~a~g~~~e~v~~~L~ 60 (77)
T PF12324_consen 27 RPLLRLLAKGQPVTVEQLAAALGWPVEEVRAALA 60 (77)
T ss_dssp HHHHHHHTTTS-B-HHHHHHHHT--HHHHHHHHH
T ss_pred HHHHHHHHcCCCcCHHHHHHHHCCCHHHHHHHHH
Confidence 5677888877779999999999999999999986
No 86
>PF04967 HTH_10: HTH DNA binding domain; InterPro: IPR007050 Numerous bacterial transcription regulatory proteins bind DNA via a helix-turn-helix (HTH) motif. This entry represents the HTH DNA binding domain found in Halobacterium salinarium (Halobacterium halobium) and described as a putative bacterio-opsin activator.
Probab=35.80 E-value=21 Score=31.24 Aligned_cols=22 Identities=23% Similarity=0.604 Sum_probs=19.4
Q ss_pred CcchhhhhhhhcCChhhhhhcc
Q 001243 233 KVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 233 kv~~~~~a~~~g~s~~~~~a~l 254 (1116)
+++++|||.++|||+-++.--|
T Consensus 23 ~~tl~elA~~lgis~st~~~~L 44 (53)
T PF04967_consen 23 RITLEELAEELGISKSTVSEHL 44 (53)
T ss_pred cCCHHHHHHHhCCCHHHHHHHH
Confidence 7999999999999998887655
No 87
>PF03107 C1_2: C1 domain; InterPro: IPR004146 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in DAG_PE-bind (IPR002219 from INTERPRO), therefore we have termed this domain DC1 for divergent C1 domain. This domain probably also binds to two zinc ions. The function of proteins with this domain is uncertain, however this domain may bind to molecules such as diacylglycerol. This family are found in plant proteins.
Probab=35.15 E-value=26 Score=26.88 Aligned_cols=27 Identities=41% Similarity=0.880 Sum_probs=21.6
Q ss_pred cccccccc-cCc-eeeCCCCCCCcccchhhh
Q 001243 58 VCNICRVK-CGA-CVRCSHGTCRTSFHPICA 86 (1116)
Q Consensus 58 kC~iC~~k-~GA-cIqCs~~~C~~~FHvtCA 86 (1116)
.|.+|+++ .|- -..|. .|.-.+|+.||
T Consensus 2 ~C~~C~~~~~~~~~Y~C~--~c~f~lh~~Ca 30 (30)
T PF03107_consen 2 WCDVCRRKIDGFYFYHCS--ECCFTLHVRCA 30 (30)
T ss_pred CCCCCCCCcCCCEeEEeC--CCCCeEcCccC
Confidence 58999876 555 67895 78899999997
No 88
>PRK10339 DNA-binding transcriptional repressor EbgR; Provisional
Probab=35.00 E-value=32 Score=38.62 Aligned_cols=49 Identities=18% Similarity=0.284 Sum_probs=43.9
Q ss_pred CcchhhhhhhhcCChhhhhhcccccc---ccchhhHHHHHHhhhcccccccc
Q 001243 233 KVNVKDIASDIGISPDLLKTTLADGT---FASDLQCKLVKWLSNHAYLGGLL 281 (1116)
Q Consensus 233 kv~~~~~a~~~g~s~~~~~a~l~~~~---~~~~~~~k~~~wl~~~~~~~~~~ 281 (1116)
+++++|||...|+|.-++--+|+... .++..+.||.+-.+..-|.+...
T Consensus 1 ~~ti~dIA~~agVS~~TVSrvln~~~~~~vs~~tr~rV~~~a~~lgY~pn~~ 52 (327)
T PRK10339 1 MATLKDIAIEAGVSLATVSRVLNDDPTLNVKEETKHRILEIAEKLEYKTSSA 52 (327)
T ss_pred CCCHHHHHHHhCCCHHhhhhhhcCCCCCCcCHHHHHHHHHHHHHhCCCCchh
Confidence 47999999999999999999999553 88999999999999999987753
No 89
>PHA01976 helix-turn-helix protein
Probab=34.96 E-value=66 Score=28.04 Aligned_cols=52 Identities=17% Similarity=0.164 Sum_probs=37.0
Q ss_pred chHHHHHHHhhhCCcchhhhhhhhcCChhhhhhcccc-ccccchhhHHHHHHh
Q 001243 220 NFTLILKKLIDRGKVNVKDIASDIGISPDLLKTTLAD-GTFASDLQCKLVKWL 271 (1116)
Q Consensus 220 ~~~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~~-~~~~~~~~~k~~~wl 271 (1116)
+|+.-||+|-++...+.+++|..+|+|+.+|-.-... .....+.-.||.+.|
T Consensus 2 ~~~~rl~~~R~~~glt~~~lA~~~gvs~~~v~~~e~g~~~p~~~~l~~ia~~l 54 (67)
T PHA01976 2 SFAIQLIKARNARAWSAPELSRRAGVRHSLIYDFEADKRLPNLKTLLRLADAL 54 (67)
T ss_pred cHHHHHHHHHHHcCCCHHHHHHHhCCCHHHHHHHHcCCCCCCHHHHHHHHHHH
Confidence 4677789999999999999999999999888775432 222333334555444
No 90
>PF12844 HTH_19: Helix-turn-helix domain; PDB: 3LIS_B 3LFP_A 2XIU_B 2GZU_B 2XJ3_A 1UTX_A 2XI8_B 3F6W_C 3EUS_B.
Probab=34.75 E-value=48 Score=28.61 Aligned_cols=48 Identities=23% Similarity=0.243 Sum_probs=31.5
Q ss_pred HHHHHhhhCCcchhhhhhhhcCChhhhhhccc-cccccchhhHHHHHHh
Q 001243 224 ILKKLIDRGKVNVKDIASDIGISPDLLKTTLA-DGTFASDLQCKLVKWL 271 (1116)
Q Consensus 224 ~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~-~~~~~~~~~~k~~~wl 271 (1116)
-||+|.+.-..+.+++|..+|+++-.|..-.. ...+.++.-.+|.+-|
T Consensus 3 ~lk~~r~~~~lt~~~~a~~~~i~~~~i~~~e~g~~~~~~~~l~~i~~~~ 51 (64)
T PF12844_consen 3 RLKELREEKGLTQKDLAEKLGISRSTISKIENGKRKPSVSTLKKIAEAL 51 (64)
T ss_dssp HHHHHHHHCT--HHHHHHHHTS-HHHHHHHHTTSS--BHHHHHHHHHHH
T ss_pred HHHHHHHHcCCCHHHHHHHHCcCHHHHHHHHCCCcCCCHHHHHHHHHHh
Confidence 47889999999999999999999888776655 3344445555554444
No 91
>PRK10681 DNA-binding transcriptional repressor DeoR; Provisional
Probab=34.54 E-value=30 Score=38.66 Aligned_cols=34 Identities=26% Similarity=0.395 Sum_probs=31.0
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhccc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
..|+..|-..|+|+|+|+|..+|+|++++..=|.
T Consensus 10 ~~I~~~l~~~~~v~v~eLa~~~~VS~~TIRRDL~ 43 (252)
T PRK10681 10 GQLLQALKRSDKLHLKDAAALLGVSEMTIRRDLN 43 (252)
T ss_pred HHHHHHHHHcCCCcHHHHHHHhCCCHHHHHHHHH
Confidence 5688888899999999999999999999998874
No 92
>PF08279 HTH_11: HTH domain; InterPro: IPR013196 Winged helix DNA-binding proteins share a related winged helix-turn-helix DNA-binding motif, where the "wings", or loops, are small beta-sheets. The winged helix motif consists of two wings (W1, W2), three alpha helices (H1, H2, H3) and three beta-sheets (S1, S2, S3) arranged in the order H1-S1-H2-H3-S2-W1-S3-W2 []. The DNA-recognition helix makes sequence-specific DNA contacts with the major groove of DNA, while the wings make different DNA contacts, often with the minor groove or the backbone of DNA. Several winged-helix proteins display an exposed patch of hydrophobic residues thought to mediate protein-protein interactions. This entry represents a subset of the winged helix domain superfamily which is predominantly found in bacterial proteins, though there are also some archaeal and eukaryotic examples. This domain is commonly found in the biotin (vitamin H) repressor protein BirA which regulates transcription of the biotin operon []. It is also found in other proteins including regulators of amino acid biosynthsis such as LysM [], and regulators of carbohydrate metabolisms such as LicR and FrvR [, ].; PDB: 1HXD_B 2EWN_B 1BIA_A 1BIB_A 1J5Y_A 3V7S_A 3V7C_A 3RKW_A 3RIR_A 3RKX_A ....
Probab=34.45 E-value=30 Score=29.18 Aligned_cols=32 Identities=25% Similarity=0.554 Sum_probs=25.3
Q ss_pred HHHHHHhhhC-CcchhhhhhhhcCChhhhhhcc
Q 001243 223 LILKKLIDRG-KVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 223 ~~lkkli~~g-kv~~~~~a~~~g~s~~~~~a~l 254 (1116)
.||+-|...+ .|+.+++|.++|+|.-++.-.|
T Consensus 4 ~il~~L~~~~~~it~~eLa~~l~vS~rTi~~~i 36 (55)
T PF08279_consen 4 QILKLLLESKEPITAKELAEELGVSRRTIRRDI 36 (55)
T ss_dssp HHHHHHHHTTTSBEHHHHHHHCTS-HHHHHHHH
T ss_pred HHHHHHHHcCCCcCHHHHHHHhCCCHHHHHHHH
Confidence 4677775554 4999999999999999998876
No 93
>COG5194 APC11 Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning]
Probab=34.08 E-value=14 Score=34.84 Aligned_cols=32 Identities=41% Similarity=0.997 Sum_probs=26.4
Q ss_pred ccccccccc-cCceeeCCC------------CCCCcccchhhhhh
Q 001243 57 LVCNICRVK-CGACVRCSH------------GTCRTSFHPICARE 88 (1116)
Q Consensus 57 LkC~iC~~k-~GAcIqCs~------------~~C~~~FHvtCA~~ 88 (1116)
-.|.||+.. .|.|++|.. |.|.-+||.-|-.+
T Consensus 21 d~CaICRnhim~~C~eCq~~~~~~~eC~v~wG~CnHaFH~HCI~r 65 (88)
T COG5194 21 DVCAICRNHIMGTCPECQFGMTPGDECPVVWGVCNHAFHDHCIYR 65 (88)
T ss_pred chhhhhhccccCcCcccccCCCCCCcceEEEEecchHHHHHHHHH
Confidence 479999865 799999987 46889999999843
No 94
>COG5194 APC11 Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning]
Probab=33.79 E-value=16 Score=34.66 Aligned_cols=33 Identities=42% Similarity=1.092 Sum_probs=26.6
Q ss_pred cccccccCc-CCceeecCC------------cCcccccchhhhhhc
Q 001243 828 DVCCICRHK-HGICIKCNY------------GNCQTTFHPTCARSA 860 (1116)
Q Consensus 828 ~~C~iC~~k-~GAcIqCs~------------~~C~~sFHvtCA~~a 860 (1116)
..|.||+.. .|.|++|.. +.|..+||..|..+-
T Consensus 21 d~CaICRnhim~~C~eCq~~~~~~~eC~v~wG~CnHaFH~HCI~rW 66 (88)
T COG5194 21 DVCAICRNHIMGTCPECQFGMTPGDECPVVWGVCNHAFHDHCIYRW 66 (88)
T ss_pred chhhhhhccccCcCcccccCCCCCCcceEEEEecchHHHHHHHHHH
Confidence 578899874 788888876 468899999998654
No 95
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=33.56 E-value=33 Score=30.29 Aligned_cols=37 Identities=27% Similarity=0.566 Sum_probs=31.7
Q ss_pred CcccccccCc---CCceeecCCcCcccccchhhhhhcCceEE
Q 001243 827 IDVCCICRHK---HGICIKCNYGNCQTTFHPTCARSAGFYLN 865 (1116)
Q Consensus 827 r~~C~iC~~k---~GAcIqCs~~~C~~sFHvtCA~~aG~~~~ 865 (1116)
..+|.+|+.+ .+..|.|- .|.+.||-.|....|-.+.
T Consensus 5 ~~~C~~Cg~~~~~~dDiVvCp--~CgapyHR~C~~~~g~C~~ 44 (54)
T PF14446_consen 5 GCKCPVCGKKFKDGDDIVVCP--ECGAPYHRDCWEKAGGCIN 44 (54)
T ss_pred CccChhhCCcccCCCCEEECC--CCCCcccHHHHhhCCceEe
Confidence 3689999985 68889999 9999999999988876664
No 96
>TIGR02405 trehalos_R_Ecol trehalose operon repressor, proteobacterial. This family consists of repressors of the LacI family typically associated with trehalose utilization operons. Trehalose is imported as trehalose-6-phosphate and then hydrolyzed by alpha,alpha-phosphotrehalase to glucose and glucose-6-P. This family includes repressors mostly from Gammaproteobacteria and does not include the GntR family TreR of Bacillus subtilis
Probab=33.47 E-value=35 Score=38.11 Aligned_cols=49 Identities=14% Similarity=0.270 Sum_probs=42.9
Q ss_pred CcchhhhhhhhcCChhhhhhccc-cccccchhhHHHHHHhhhcccccccc
Q 001243 233 KVNVKDIASDIGISPDLLKTTLA-DGTFASDLQCKLVKWLSNHAYLGGLL 281 (1116)
Q Consensus 233 kv~~~~~a~~~g~s~~~~~a~l~-~~~~~~~~~~k~~~wl~~~~~~~~~~ 281 (1116)
||+++|||.+.|+|.-+|--+|+ ....++..+.||.+-.+..-|.+...
T Consensus 1 ~~ti~dIA~~agVS~sTVSr~Ln~~~~vs~~tr~rV~~~a~~lgY~pn~~ 50 (311)
T TIGR02405 1 KLTIKDIARLAGVGKSTVSRVLNNEPKVSIETRERVEQVIQQSGFVPSKS 50 (311)
T ss_pred CCcHHHHHHHhCCCHHHHHHHhCCCCCCCHHHHHHHHHHHHHHCCCcCHH
Confidence 78999999999999999999998 44578899999999999888876544
No 97
>PF08746 zf-RING-like: RING-like domain; InterPro: IPR014857 This is a zinc finger domain that is related to the C3HC4 RING finger domain (IPR001841 from INTERPRO). ; PDB: 3NW0_A 2CT0_A.
Probab=33.16 E-value=25 Score=29.29 Aligned_cols=30 Identities=20% Similarity=0.639 Sum_probs=16.5
Q ss_pred cccccCcCCceeecCCcCcccccchhhhhh
Q 001243 830 CCICRHKHGICIKCNYGNCQTTFHPTCARS 859 (1116)
Q Consensus 830 C~iC~~k~GAcIqCs~~~C~~sFHvtCA~~ 859 (1116)
|.+|+.-.-.-+.|...+|...+|..|+..
T Consensus 1 C~~C~~iv~~G~~C~~~~C~~r~H~~C~~~ 30 (43)
T PF08746_consen 1 CEACKEIVTQGQRCSNRDCNVRLHDDCFKK 30 (43)
T ss_dssp -TTT-SB-SSSEE-SS--S--EE-HHHHHH
T ss_pred CcccchhHeeeccCCCCccCchHHHHHHHH
Confidence 677887544556799999999999999865
No 98
>PRK15431 ferrous iron transport protein FeoC; Provisional
Probab=33.07 E-value=27 Score=33.00 Aligned_cols=27 Identities=19% Similarity=0.422 Sum_probs=25.1
Q ss_pred HhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 228 LIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 228 li~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
|-++|.++.+++|..++++++.|+|-|
T Consensus 11 l~~~gr~s~~~Ls~~~~~p~~~VeaML 37 (78)
T PRK15431 11 LALRGRMEAAQISQTLNTPQPMINAML 37 (78)
T ss_pred HHHcCcccHHHHHHHHCcCHHHHHHHH
Confidence 456999999999999999999999997
No 99
>KOG1244 consensus Predicted transcription factor Requiem/NEURO-D4 [Transcription]
Probab=33.05 E-value=29 Score=39.43 Aligned_cols=48 Identities=25% Similarity=0.597 Sum_probs=37.4
Q ss_pred CcCcccCCCC--C----CCCCEEEccccCcccccccccCcc-----CCCCceecccccc
Q 001243 706 RSCDICRRSE--T----ILNPILICSGCKVAVHLDCYRNAK-----ESTGPWYCELCEE 753 (1116)
Q Consensus 706 ~~CsVC~~~E--~----~~N~IL~Cd~C~laVHq~CYGi~~-----ipeg~WlCd~C~~ 753 (1116)
.+|+.|.... . ...++|.|..|+.+-|.+|.-... +....|.|.-|++
T Consensus 225 ~YCDFclgdsr~nkkt~~peelvscsdcgrsghpsclqft~nm~~avk~yrwqcieck~ 283 (336)
T KOG1244|consen 225 PYCDFCLGDSRENKKTGMPEELVSCSDCGRSGHPSCLQFTANMIAAVKTYRWQCIECKY 283 (336)
T ss_pred cccceeccccccccccCCchhhcchhhcCCCCCcchhhhhHHHHHHHHhheeeeeecce
Confidence 5799999642 2 245799999999999999986432 3466899999986
No 100
>PF01022 HTH_5: Bacterial regulatory protein, arsR family; InterPro: IPR001845 Bacterial transcription regulatory proteins that bind DNA via a helix-turn-helix (HTH) motif can be grouped into families on the basis of sequence similarities. One such group, termed arsR, includes several proteins that appear to dissociate from DNA in the presence of metal ions: arsR, which functions as a transcriptional repressor of an arsenic resistance operon; smtB from Synechococcus sp. (strain PCC 7942), which acts as a transcriptional repressor of the smtA gene that codes for a metallothionein; cadC, a protein required for cadmium-resistance; and hypothetical protein yqcJ from Bacillus subtilis. The HTH motif is thought to be located in the central part of these proteins []. The motif is characterised by a number of well-conserved residues: at its N-terminal extremity is a cysteine residue; a second Cys is found in arsR and cadC, but not in smtA; and at the C terminus lie one or two histidines. These residues may be involved in metal-binding (Zn in smtB; metal-oxyanions such as arsenite, antimonite and arsenate for arsR; and cadmium for cadC) []. It is believed that binding of a metal ion could induce a conformational change that would prevent the protein from binding DNA []. The crystal structure of the cyanobacterial smtB shows a fold of five alpha-helices (H) and a pair of antiparallel beta-strands (B) in the topology H1-H2-H3-H4-B1-B2-H5. Helices 3 and 4 comprise the helix-turn-helix motif and the beta-sheet is called the wing as in other wHTH, such as the dtxR-type or the merR-type. Helix 4 is termed the recognition helix, like in other HTHs where it binds the DNA major groove. Most arsR/smtB-like metalloregulators form homodimers []. The dimer interface is formed by helix 5 and an N-terminal part []. Two distinct metal-binding sites have been identified. The first site comprises cysteine thiolates located in the HTH in helix 3 and for some cases in the N terminus, called the alpha3(N) site []. The second metal-binding site is located in helix 5 (and C terminus) and is called the alpha5(C) site. The alpha3N site binds large thiophilic, toxic metals including Cd, Pb, and Bi, as in S. aureus cadC. ArsR lacks the N-terminal arm and its alpha3 site coordinates smaller thiophilic ions like As and Sb. The alpha5 site contains carboxylate and imidazole ligands and interacts preferentially with biologically required metal ions including Zn, Co, and Ni. ArsR-type metalloregulators contain one of these sites, both, or other potential metal-binding sites [, ]. Binding of metal ions to these sites leads to allosteric changes that can derepress the operator/promotor DNA. The metal-inducible operons contain one or two imperfect 12-2-12 inverted repeats, which can be recognised by multimeric arsR-type metalloregulators. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 3CUO_A 1U2W_C 3F72_C 3F6V_A 3JTH_B 2P4W_B 1KU9_B 2LKP_B 1SMT_A 1R22_B ....
Probab=33.02 E-value=32 Score=28.54 Aligned_cols=31 Identities=32% Similarity=0.572 Sum_probs=25.1
Q ss_pred HHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 223 LILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 223 ~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
-||+.|.+ |..++.|||.++|+|.-++---|
T Consensus 6 ~Il~~L~~-~~~~~~el~~~l~~s~~~vs~hL 36 (47)
T PF01022_consen 6 RILKLLSE-GPLTVSELAEELGLSQSTVSHHL 36 (47)
T ss_dssp HHHHHHTT-SSEEHHHHHHHHTS-HHHHHHHH
T ss_pred HHHHHHHh-CCCchhhHHHhccccchHHHHHH
Confidence 46777776 99999999999999998876543
No 101
>PF00165 HTH_AraC: Bacterial regulatory helix-turn-helix proteins, AraC family; PDB: 1WPK_A 1ZGW_A 1U8B_A.
Probab=32.50 E-value=23 Score=28.53 Aligned_cols=25 Identities=28% Similarity=0.541 Sum_probs=17.5
Q ss_pred hCCcchhhhhhhhcCChhhhhhccc
Q 001243 231 RGKVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 231 ~gkv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
.-+.+|+|||...|+|+..+.....
T Consensus 6 ~~~~~l~~iA~~~g~S~~~f~r~Fk 30 (42)
T PF00165_consen 6 QQKLTLEDIAEQAGFSPSYFSRLFK 30 (42)
T ss_dssp -SS--HHHHHHHHTS-HHHHHHHHH
T ss_pred cCCCCHHHHHHHHCCCHHHHHHHHH
Confidence 4468999999999999988877643
No 102
>PRK09726 antitoxin HipB; Provisional
Probab=32.20 E-value=69 Score=30.05 Aligned_cols=58 Identities=12% Similarity=0.106 Sum_probs=43.6
Q ss_pred cchHHHHHHHhhhCCcchhhhhhhhcCChhhhhhcccc-ccccchhhHHHHHHhhhccc
Q 001243 219 LNFTLILKKLIDRGKVNVKDIASDIGISPDLLKTTLAD-GTFASDLQCKLVKWLSNHAY 276 (1116)
Q Consensus 219 ~~~~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~~-~~~~~~~~~k~~~wl~~~~~ 276 (1116)
..++--||+|..+..++.+++|..+|||+.+|..-... .....+.-.+|.+.|.=.+.
T Consensus 11 ~~l~~~lk~~R~~~gltq~elA~~~gvs~~tis~~e~g~~~ps~~~l~~ia~~lgv~~~ 69 (88)
T PRK09726 11 TQLANAMKLVRQQNGWTQSELAKKIGIKQATISNFENNPDNTTLTTFFKILQSLELSMT 69 (88)
T ss_pred HHHHHHHHHHHHHcCCCHHHHHHHHCcCHHHHHHHHCCCCCCCHHHHHHHHHHcCCCcc
Confidence 35677899999999999999999999999999876653 23444555666666654444
No 103
>PF10367 Vps39_2: Vacuolar sorting protein 39 domain 2; InterPro: IPR019453 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised In Vps39 this domain is involved in localisation and in mediating the interactions with Vps11 [].
Probab=31.97 E-value=36 Score=32.18 Aligned_cols=30 Identities=23% Similarity=0.656 Sum_probs=20.5
Q ss_pred CCcCcccCCCCCCCCCEEEccccCcccccccc
Q 001243 705 PRSCDICRRSETILNPILICSGCKVAVHLDCY 736 (1116)
Q Consensus 705 d~~CsVC~~~E~~~N~IL~Cd~C~laVHq~CY 736 (1116)
+..|.||...= ++..+.---|+..||..|+
T Consensus 78 ~~~C~vC~k~l--~~~~f~~~p~~~v~H~~C~ 107 (109)
T PF10367_consen 78 STKCSVCGKPL--GNSVFVVFPCGHVVHYSCI 107 (109)
T ss_pred CCCccCcCCcC--CCceEEEeCCCeEEecccc
Confidence 45699999843 3343333356699999997
No 104
>KOG4299 consensus PHD Zn-finger protein [General function prediction only]
Probab=31.33 E-value=19 Score=44.88 Aligned_cols=30 Identities=30% Similarity=0.765 Sum_probs=25.6
Q ss_pred cccccccccccCce---eeCCCCCCCcccchhhhhh
Q 001243 56 KLVCNICRVKCGAC---VRCSHGTCRTSFHPICARE 88 (1116)
Q Consensus 56 ~LkC~iC~~k~GAc---IqCs~~~C~~~FHvtCA~~ 88 (1116)
...|+.|+++ |.. |+|. .|.++||.+|--.
T Consensus 253 ~~fCsaCn~~-~~F~~~i~CD--~Cp~sFH~~CLeP 285 (613)
T KOG4299|consen 253 EDFCSACNGS-GLFNDIICCD--GCPRSFHQTCLEP 285 (613)
T ss_pred HHHHHHhCCc-cccccceeec--CCchHHHHhhcCC
Confidence 3589999986 888 9998 4999999999754
No 105
>PF14569 zf-UDP: Zinc-binding RING-finger; PDB: 1WEO_A.
Probab=31.11 E-value=10 Score=35.64 Aligned_cols=50 Identities=24% Similarity=0.551 Sum_probs=23.8
Q ss_pred CCCcCcccCCCC--C-CCCCEEEccccCcccccccccCccCCCCceeccccccc
Q 001243 704 HPRSCDICRRSE--T-ILNPILICSGCKVAVHLDCYRNAKESTGPWYCELCEEL 754 (1116)
Q Consensus 704 ~d~~CsVC~~~E--~-~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd~C~~~ 754 (1116)
+...|.||.+.- + +++..+-|..|+..|-+.||--. ..+|.-.|..|...
T Consensus 8 ~~qiCqiCGD~VGl~~~Ge~FVAC~eC~fPvCr~CyEYE-rkeg~q~CpqCkt~ 60 (80)
T PF14569_consen 8 NGQICQICGDDVGLTENGEVFVACHECAFPVCRPCYEYE-RKEGNQVCPQCKTR 60 (80)
T ss_dssp SS-B-SSS--B--B-SSSSB--S-SSS-----HHHHHHH-HHTS-SB-TTT--B
T ss_pred CCcccccccCccccCCCCCEEEEEcccCCccchhHHHHH-hhcCcccccccCCC
Confidence 356899999853 2 47789999999999999999522 34677788888753
No 106
>PF11793 FANCL_C: FANCL C-terminal domain; PDB: 3K1L_A.
Probab=30.96 E-value=24 Score=32.22 Aligned_cols=32 Identities=28% Similarity=0.593 Sum_probs=12.5
Q ss_pred CcCcccCCCCC--CCCCEEEcc--ccCccccccccc
Q 001243 706 RSCDICRRSET--ILNPILICS--GCKVAVHLDCYR 737 (1116)
Q Consensus 706 ~~CsVC~~~E~--~~N~IL~Cd--~C~laVHq~CYG 737 (1116)
..|.||..... ..-+.+.|. .|+..+|..|.-
T Consensus 3 ~~C~IC~~~~~~~~~~p~~~C~n~~C~~~fH~~CL~ 38 (70)
T PF11793_consen 3 LECGICYSYRLDDGEIPDVVCPNPSCGKKFHLLCLS 38 (70)
T ss_dssp -S-SSS--SS-TT-----B--S-TT----B-SGGGH
T ss_pred CCCCcCCcEecCCCCcCceEcCCcccCCHHHHHHHH
Confidence 56999998643 223568998 899999999984
No 107
>COG1349 GlpR Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]
Probab=30.66 E-value=37 Score=38.06 Aligned_cols=33 Identities=33% Similarity=0.531 Sum_probs=30.1
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
..||+-|.++|+|+|+|+|..+|+|++++.-=|
T Consensus 8 ~~Il~~l~~~g~v~v~eLa~~~~VS~~TIRRDL 40 (253)
T COG1349 8 QKILELLKEKGKVSVEELAELFGVSEMTIRRDL 40 (253)
T ss_pred HHHHHHHHHcCcEEHHHHHHHhCCCHHHHHHhH
Confidence 568888889999999999999999999999865
No 108
>PF01325 Fe_dep_repress: Iron dependent repressor, N-terminal DNA binding domain; InterPro: IPR022687 The DtxR-type HTH domain is a DNA-binding, winged helix-turn-helix (wHTH) domain of about 65 residues present in metalloregulators of the DtxR/MntR family. The family is named after Corynebacterium diphtheriae DtxR, an iron-specific diphtheria toxin repressor, and Bacillus subtilis MntR, a manganese transport regulator. Iron-responsive metalloregulators such as DtxR and IdeR occur in Gram-positive bacteria of the high GC branch, while manganese-responsive metalloregulators like MntR are described in diverse genera of Gram-positive and Gram-negative bacteria and also in Archaea [].The metalloregulators like DtxR/MntR contain the DNA-binding DtxR-type HTH domain usually in the N-terminal part. The C-terminal part contains a dimerisation domain with two metal-binding sites, although the primary metal-binding site is less conserved in the Mn(II)-regulators. Fe(II)-regulated proteins contain an SH3-like domain as a C-terminal extension, which is absent in Mn(II)-regulated MntR [, ]. Metal-ion dependent regulators orchestrate the virulence of several important human pathogens. The DtxR protein regulates the expression of diphtheria toxinin response to environmental iron concentrations. Furthermore, DtxR and IdeR control iron uptake []. Homeostasis of manganese, which is an essential nutrient, is regulated by MntR. A typical DtxR-type metalloregulator binds two divalent metal effectors per monomer, upon which allosteric changes occur that moderate binding to the cognate DNA operators. Iron-bound DtxR homodimers bind to an interrupted palindrome of 19 bp, protecting a sequence of ~30 bp. The crystal structures of iron-regulated and manganese-regulated repressors show that the DNA binding domain contains three alpha-helices and a pair of antiparallel beta-strands. Helices 2 and 3 comprise the helix-turn-helix motif and the beta-strands are called the wing []. This wHTH topology is similar to the lysR-type HTH (see PDOC00043 from PROSITEDOC). Most DtxR-type metalloregulators bind as dimers to the DNA major groove. Several proteins are known to contain a DtxR-type HTH domain. These include- Corynebacterium diphtheriae DtxR, a diphtheria toxin repressor [], which regulates the expression of the high-affinity iron uptake system, other iron-sensitive genes, and the bacteriophage tox gene. Metal-bound DtxR represses transcription by binding the tox operator; if iron is limiting, conformational changes of the wHTH disrupt DNA-binding and the diphtheria toxin is produced. Mycobacterium tuberculosis IdeR, an iron-dependent regulator that is essential for this pathogen. The regulator represses genes for iron acquisition and activates iron storage genes, and is a positive regulator of oxidative stress responses []. Bacillus subtilis MntR, a manganese transport regulator, binds Mn2+ as an effector and is a transcriptional repressor of transporters for the import of manganese. Treponema pallidum troR, a metal-dependent transcriptional repressor. Archaeoglobus fulgidus MDR1 (troR), a metal-dependent transcriptional repressor, which negatively regulates its own transcription. This entry covers the entire DtxR-type HTH domain.; GO: 0005506 iron ion binding; PDB: 3HRT_B 3HRS_A 3HRU_B 2X4H_D 1ON1_B 2HYF_C 2F5E_A 3R60_B 1ON2_B 2F5F_A ....
Probab=29.51 E-value=50 Score=29.25 Aligned_cols=25 Identities=40% Similarity=0.682 Sum_probs=22.0
Q ss_pred hhCCcchhhhhhhhcCChhhhhhcc
Q 001243 230 DRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 230 ~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
+.+.|+.+|||..+|+||-|+...|
T Consensus 19 ~~~~v~~~~iA~~L~vs~~tvt~ml 43 (60)
T PF01325_consen 19 EGGPVRTKDIAERLGVSPPTVTEML 43 (60)
T ss_dssp CTSSBBHHHHHHHHTS-HHHHHHHH
T ss_pred CCCCccHHHHHHHHCCChHHHHHHH
Confidence 6889999999999999999998775
No 109
>KOG1844 consensus PHD Zn-finger proteins [General function prediction only]
Probab=29.11 E-value=32 Score=41.93 Aligned_cols=46 Identities=20% Similarity=0.432 Sum_probs=37.0
Q ss_pred cccCCCCCCCCCEEEccccCcccccccccCccCCC-Cceeccccccc
Q 001243 709 DICRRSETILNPILICSGCKVAVHLDCYRNAKEST-GPWYCELCEEL 754 (1116)
Q Consensus 709 sVC~~~E~~~N~IL~Cd~C~laVHq~CYGi~~ipe-g~WlCd~C~~~ 754 (1116)
++|...+...+.++.|+.|+.--|..|+|+..... ..+.|..|...
T Consensus 89 c~c~~~~~~~g~~i~c~~c~~Wqh~~C~g~~~~~~p~~y~c~~c~~~ 135 (508)
T KOG1844|consen 89 CDCGLEDDMEGLMIQCDWCGRWQHKICCGSFKSTKPDKYVCEICTPR 135 (508)
T ss_pred cccccccCCCceeeCCcccCcccCceeeeecCCCCchhceeeeeccc
Confidence 46776544478899999999999999999876554 67999999875
No 110
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=28.44 E-value=43 Score=35.92 Aligned_cols=35 Identities=31% Similarity=0.435 Sum_probs=31.4
Q ss_pred hHHHHHHHhhhCCcchhhhhhhhcCChhhhhhccc
Q 001243 221 FTLILKKLIDRGKVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 221 ~~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
--.||.-|+.+|-++..|+|.++||+..+|.-.|.
T Consensus 24 ~~~Vl~~L~~~g~~tdeeLA~~Lgi~~~~VRk~L~ 58 (178)
T PRK06266 24 GFEVLKALIKKGEVTDEEIAEQTGIKLNTVRKILY 58 (178)
T ss_pred HhHHHHHHHHcCCcCHHHHHHHHCCCHHHHHHHHH
Confidence 36788899999999999999999999999988763
No 111
>KOG0825 consensus PHD Zn-finger protein [General function prediction only]
Probab=28.27 E-value=51 Score=42.29 Aligned_cols=32 Identities=22% Similarity=0.647 Sum_probs=26.5
Q ss_pred CcccccccCc--CCceeecCCcCcccc-cchhhhhhc
Q 001243 827 IDVCCICRHK--HGICIKCNYGNCQTT-FHPTCARSA 860 (1116)
Q Consensus 827 r~~C~iC~~k--~GAcIqCs~~~C~~s-FHvtCA~~a 860 (1116)
.-.|.||... .-.+|.|. .|... ||..|.-..
T Consensus 215 ~~~C~IC~~~DpEdVLLLCD--sCN~~~YH~YCLDPd 249 (1134)
T KOG0825|consen 215 EVKCDICTVHDPEDVLLLCD--SCNKVYYHVYCLDPD 249 (1134)
T ss_pred cccceeeccCChHHhheeec--ccccceeeccccCcc
Confidence 4789999985 56788999 99988 999997553
No 112
>smart00345 HTH_GNTR helix_turn_helix gluconate operon transcriptional repressor.
Probab=28.26 E-value=53 Score=27.32 Aligned_cols=20 Identities=20% Similarity=0.552 Sum_probs=18.8
Q ss_pred chhhhhhhhcCChhhhhhcc
Q 001243 235 NVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 235 ~~~~~a~~~g~s~~~~~a~l 254 (1116)
++.++|..+|+|..++..+|
T Consensus 22 s~~~la~~~~vs~~tv~~~l 41 (60)
T smart00345 22 SERELAAQLGVSRTTVREAL 41 (60)
T ss_pred CHHHHHHHHCCCHHHHHHHH
Confidence 89999999999999999886
No 113
>PF13764 E3_UbLigase_R4: E3 ubiquitin-protein ligase UBR4
Probab=28.12 E-value=1.4e+02 Score=39.19 Aligned_cols=31 Identities=19% Similarity=0.346 Sum_probs=21.7
Q ss_pred CCCCCCCcCcccCCCCC-C----CCCEEEccccCcc
Q 001243 700 FSKEHPRSCDICRRSET-I----LNPILICSGCKVA 730 (1116)
Q Consensus 700 ~~ke~d~~CsVC~~~E~-~----~N~IL~Cd~C~la 730 (1116)
...+....|+||+++-. . .+...|+.+|++.
T Consensus 463 l~ee~gl~C~ICrEGy~~~p~~~lGiY~f~kr~~l~ 498 (802)
T PF13764_consen 463 LEEEDGLTCCICREGYKFRPDEVLGIYAFSKRVNLE 498 (802)
T ss_pred ccccCCCeEEEcCCccccCCccceeeEEEeecccch
Confidence 33477889999998642 2 3455688889883
No 114
>KOG0383 consensus Predicted helicase [General function prediction only]
Probab=28.12 E-value=40 Score=43.14 Aligned_cols=28 Identities=29% Similarity=0.413 Sum_probs=21.3
Q ss_pred cCcccccccccC--ccCCCCceeccccccc
Q 001243 727 CKVAVHLDCYRN--AKESTGPWYCELCEEL 754 (1116)
Q Consensus 727 C~laVHq~CYGi--~~ipeg~WlCd~C~~~ 754 (1116)
|....|..|--- ...++++|.|..|...
T Consensus 2 ~~r~~~~~~~~p~~~~~~~~~~k~~~~e~~ 31 (696)
T KOG0383|consen 2 CPRAYHRVCLDPKLKEEPEMDPKCPGCESS 31 (696)
T ss_pred CCcccCcCCCCcccccCCcCCccCcchhhc
Confidence 777889999752 2345889999999864
No 115
>PRK10703 DNA-binding transcriptional repressor PurR; Provisional
Probab=27.57 E-value=51 Score=37.16 Aligned_cols=50 Identities=14% Similarity=0.207 Sum_probs=44.3
Q ss_pred CcchhhhhhhhcCChhhhhhccc-cccccchhhHHHHHHhhhccccccccc
Q 001243 233 KVNVKDIASDIGISPDLLKTTLA-DGTFASDLQCKLVKWLSNHAYLGGLLK 282 (1116)
Q Consensus 233 kv~~~~~a~~~g~s~~~~~a~l~-~~~~~~~~~~k~~~wl~~~~~~~~~~~ 282 (1116)
+++.+|||.+.|+|.-++--+|+ ....++..+.||.+-.+..=|.+....
T Consensus 1 ~~Ti~dIA~~agVS~~TVSrvLn~~~~vs~~tr~~V~~~a~elgY~pn~~a 51 (341)
T PRK10703 1 MATIKDVAKRAGVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVA 51 (341)
T ss_pred CCCHHHHHHHhCCCHHHHHHHHcCCCCCCHHHHHHHHHHHHHHCCCcCHHH
Confidence 47899999999999999999998 456899999999999999999886543
No 116
>PF08746 zf-RING-like: RING-like domain; InterPro: IPR014857 This is a zinc finger domain that is related to the C3HC4 RING finger domain (IPR001841 from INTERPRO). ; PDB: 3NW0_A 2CT0_A.
Probab=27.51 E-value=37 Score=28.28 Aligned_cols=30 Identities=27% Similarity=0.709 Sum_probs=16.8
Q ss_pred ccccccccCceeeCCCCCCCcccchhhhhh
Q 001243 59 CNICRVKCGACVRCSHGTCRTSFHPICARE 88 (1116)
Q Consensus 59 C~iC~~k~GAcIqCs~~~C~~~FHvtCA~~ 88 (1116)
|.+|+.-.-.-+.|....|...+|..|+..
T Consensus 1 C~~C~~iv~~G~~C~~~~C~~r~H~~C~~~ 30 (43)
T PF08746_consen 1 CEACKEIVTQGQRCSNRDCNVRLHDDCFKK 30 (43)
T ss_dssp -TTT-SB-SSSEE-SS--S--EE-HHHHHH
T ss_pred CcccchhHeeeccCCCCccCchHHHHHHHH
Confidence 677886555667899899999999999954
No 117
>KOG0695 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=27.45 E-value=22 Score=41.63 Aligned_cols=35 Identities=26% Similarity=0.471 Sum_probs=28.6
Q ss_pred CCcCcccCCCC-CCCCCEEEccccCcccccccccCc
Q 001243 705 PRSCDICRRSE-TILNPILICSGCKVAVHLDCYRNA 739 (1116)
Q Consensus 705 d~~CsVC~~~E-~~~N~IL~Cd~C~laVHq~CYGi~ 739 (1116)
...|.||.+.- ..+.+-..|-+|.+.||..|.+..
T Consensus 141 r~~c~ic~d~iwglgrqgyrcinckl~vhkkch~~v 176 (593)
T KOG0695|consen 141 RAYCGICSDRIWGLGRQGYRCINCKLLVHKKCHGLV 176 (593)
T ss_pred ceeeeechhhhhhcccccceeecceeehhhhhcccc
Confidence 46799999853 246778899999999999999754
No 118
>COG2522 Predicted transcriptional regulator [General function prediction only]
Probab=27.38 E-value=51 Score=33.42 Aligned_cols=33 Identities=24% Similarity=0.395 Sum_probs=29.2
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhccc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
+++-+.||++ ..|..+||..+|||+-+|-.-|-
T Consensus 12 a~lA~~L~ee-G~Sq~~iA~LLGltqaAVS~Yls 44 (119)
T COG2522 12 ALLAKELIEE-GLSQYRIAKLLGLTQAAVSQYLS 44 (119)
T ss_pred HHHHHHHHHc-CCcHHHHHHHhCCCHHHHHHHHc
Confidence 6788899999 79999999999999999877663
No 119
>smart00744 RINGv The RING-variant domain is a C4HC3 zinc-finger like motif found in a number of cellular and viral proteins. Some of these proteins have been shown both in vivo and in vitro to have ubiquitin E3 ligase activity. The RING-variant domain is reminiscent of both the RING and the PHD domains and may represent an evolutionary intermediate. To describe this domain the term PHD/LAP domain has been used in the past. Extended description: The RING-variant (RINGv) domain contains a C4HC3 zinc-finger-like motif similar to the PHD domain, while some of the spacing between the Cys/His residues follow a pattern somewhat closer to that found in the RING domain. The RINGv domain, similar to the RING, PHD and LIM domains, is thought to bind two zinc ions co-ordinated by the highly conserved Cys and His residues. RING variant domain: C-x (2) -C-x(10-45)-C-x (1) -C-x (7) -H-x(2)-C-x(11-25)-C-x(2)-C As opposed to a PHD: C-x(1-2) -C-x (7-13)-C-x(2-4)-C-x(4-5)-H-x(2)-C-x(10-21)-C-x(2)-C Class
Probab=26.80 E-value=21 Score=30.45 Aligned_cols=31 Identities=29% Similarity=0.685 Sum_probs=19.9
Q ss_pred cCcccCCCCCCCCCEE-Ecc--ccCccccccccc
Q 001243 707 SCDICRRSETILNPIL-ICS--GCKVAVHLDCYR 737 (1116)
Q Consensus 707 ~CsVC~~~E~~~N~IL-~Cd--~C~laVHq~CYG 737 (1116)
+|-||++.+...+.++ -|. |--..||+.|.-
T Consensus 1 ~CrIC~~~~~~~~~l~~PC~C~G~~~~vH~~Cl~ 34 (49)
T smart00744 1 ICRICHDEGDEGDPLVSPCRCKGSLKYVHQECLE 34 (49)
T ss_pred CccCCCCCCCCCCeeEeccccCCchhHHHHHHHH
Confidence 4899998444455555 332 333679999973
No 120
>PF08221 HTH_9: RNA polymerase III subunit RPC82 helix-turn-helix domain; InterPro: IPR013197 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This family consists of several DNA-directed RNA polymerase III polypeptides which are related to the Saccharomyces cerevisiae (Baker's yeast) RPC82 protein. RNA polymerase C (III) promotes the transcription of tRNA and 5S RNA genes. In S. cerevisiae, the enzyme is composed of 15 subunits, ranging from 10 kDa to about 160 kDa []. This region is probably a DNA-binding helix-turn-helix.; PDB: 2XV4_S 2XUB_A.
Probab=26.50 E-value=49 Score=29.48 Aligned_cols=35 Identities=26% Similarity=0.558 Sum_probs=29.1
Q ss_pred hHHHHHHHhhhCCcchhhhhhhhcCChhhhhhccc
Q 001243 221 FTLILKKLIDRGKVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 221 ~~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
++-|..-|+.+|..++.+|....++++..+..+|+
T Consensus 15 ~~~V~~~Ll~~G~ltl~~i~~~t~l~~~~Vk~~L~ 49 (62)
T PF08221_consen 15 VAKVGEVLLSRGRLTLREIVRRTGLSPKQVKKALV 49 (62)
T ss_dssp HHHHHHHHHHC-SEEHHHHHHHHT--HHHHHHHHH
T ss_pred HHHHHHHHHHcCCcCHHHHHHHhCCCHHHHHHHHH
Confidence 47788889999999999999999999999999974
No 121
>TIGR00180 parB_part ParB-like partition proteins. This model represents the most well-conserved core of a set of chromosomal and plasmid partition proteins related to ParB, including Spo0J, RepB, and SopB. Spo0J has been shown to bind a specific DNA sequence that, when introduced into a plasmid, can serve as partition site. Study of RepB, which has nicking-closing activity, suggests that it forms a transient protein-DNA covalent intermediate during the strand transfer reaction.
Probab=26.42 E-value=55 Score=34.87 Aligned_cols=52 Identities=19% Similarity=0.273 Sum_probs=42.9
Q ss_pred CCcchHHHHHHHhhhCCcchhhhhhhhcCChhhhhhccccccccchhhHHHH
Q 001243 217 DALNFTLILKKLIDRGKVNVKDIASDIGISPDLLKTTLADGTFASDLQCKLV 268 (1116)
Q Consensus 217 ~~~~~~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~~~~~~~~~~~k~~ 268 (1116)
...+-+...++|++.+..+.++||..+|+|...|...|.-..+.++++-.+-
T Consensus 104 t~~e~a~~~~~l~~~~g~s~~~iA~~lg~s~~~V~r~l~l~~lp~~v~~~~~ 155 (187)
T TIGR00180 104 SPIEEAQAYKRLLEKFSMTQEDLAKKIGKSRAHITNLLRLLKLPSEIQSAIP 155 (187)
T ss_pred CHHHHHHHHHHHHHHhCCCHHHHHHHHCcCHHHHHHHHHHHcCCHHHHHHHH
Confidence 4456688899999887889999999999999999999886667777766554
No 122
>PRK05472 redox-sensing transcriptional repressor Rex; Provisional
Probab=26.41 E-value=42 Score=36.32 Aligned_cols=34 Identities=26% Similarity=0.534 Sum_probs=30.6
Q ss_pred HHHHHHHhhhC--CcchhhhhhhhcCChhhhhhccc
Q 001243 222 TLILKKLIDRG--KVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 222 ~~~lkkli~~g--kv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
..+|+.|..+| .|+++++|...|+||.++..=|.
T Consensus 19 ~~il~~l~~~~~~~vs~~~L~~~~~v~~~tirrDl~ 54 (213)
T PRK05472 19 YRYLKELKEEGVERVSSKELAEALGVDSAQIRKDLS 54 (213)
T ss_pred HHHHHHHHHcCCcEEeHHHHHHHhCcCHHHHHHHHH
Confidence 56899999999 99999999999999998887654
No 123
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=26.38 E-value=47 Score=34.85 Aligned_cols=34 Identities=24% Similarity=0.373 Sum_probs=31.3
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhccc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
-.|+..|+..|.++..|||.++||+.-.|...|.
T Consensus 17 v~Vl~aL~~~~~~tdEeLa~~Lgi~~~~VRk~L~ 50 (158)
T TIGR00373 17 GLVLFSLGIKGEFTDEEISLELGIKLNEVRKALY 50 (158)
T ss_pred HHHHHHHhccCCCCHHHHHHHHCCCHHHHHHHHH
Confidence 6789999999999999999999999999998863
No 124
>PRK10401 DNA-binding transcriptional regulator GalS; Provisional
Probab=26.26 E-value=55 Score=37.13 Aligned_cols=49 Identities=18% Similarity=0.289 Sum_probs=43.1
Q ss_pred CcchhhhhhhhcCChhhhhhccc-cccccchhhHHHHHHhhhcccccccc
Q 001243 233 KVNVKDIASDIGISPDLLKTTLA-DGTFASDLQCKLVKWLSNHAYLGGLL 281 (1116)
Q Consensus 233 kv~~~~~a~~~g~s~~~~~a~l~-~~~~~~~~~~k~~~wl~~~~~~~~~~ 281 (1116)
+++++|||.+.|+|.-+|--+|+ ....++..+-||++=.+..=|.+...
T Consensus 1 ~~ti~dIA~~aGVS~~TVSrvLn~~~~Vs~~tr~kV~~~a~elgY~pn~~ 50 (346)
T PRK10401 1 MITIRDVARQAGVSVATVSRVLNNSALVSADTREAVMKAVSELGYRPNAN 50 (346)
T ss_pred CCCHHHHHHHhCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCHH
Confidence 47899999999999999999998 45688999999999999988876544
No 125
>TIGR03830 CxxCG_CxxCG_HTH putative zinc finger/helix-turn-helix protein, YgiT family. This model describes a family of predicted regulatory proteins with a conserved zinc finger/HTH architecture. The amino-terminal region contains a novel domain, featuring two CXXC motifs and occuring in a number of small bacterial proteins as well as in the present family. The carboxyl-terminal region consists of a helix-turn-helix domain, modeled by pfam01381. The predicted function is DNA binding and transcriptional regulation.
Probab=26.19 E-value=1.1e+02 Score=29.95 Aligned_cols=53 Identities=13% Similarity=0.142 Sum_probs=39.4
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhccccccccchhhHHHHHHhhhc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTLADGTFASDLQCKLVKWLSNH 274 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~~~~~~~~~~~k~~~wl~~~ 274 (1116)
..-||.+..+-.++-+++|..+|+|+.+|..-.......+.-..+|+++|..+
T Consensus 67 ~~~i~~~r~~~gltq~~lA~~lg~~~~tis~~e~g~~~p~~~~~~l~~~l~~~ 119 (127)
T TIGR03830 67 PPEIRRIRKKLGLSQREAAELLGGGVNAFSRYERGEVRPSKALDKLLRLLDKH 119 (127)
T ss_pred HHHHHHHHHHcCCCHHHHHHHhCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHC
Confidence 44578888888999999999999999999877654443334456677776655
No 126
>PRK09526 lacI lac repressor; Reviewed
Probab=26.11 E-value=58 Score=36.66 Aligned_cols=52 Identities=15% Similarity=0.208 Sum_probs=45.0
Q ss_pred hCCcchhhhhhhhcCChhhhhhcccc-ccccchhhHHHHHHhhhccccccccc
Q 001243 231 RGKVNVKDIASDIGISPDLLKTTLAD-GTFASDLQCKLVKWLSNHAYLGGLLK 282 (1116)
Q Consensus 231 ~gkv~~~~~a~~~g~s~~~~~a~l~~-~~~~~~~~~k~~~wl~~~~~~~~~~~ 282 (1116)
.++|+++|||.+.|+|.-+|--+|+. ...++..+.||.+-.+..=|.+....
T Consensus 3 ~~~~ti~dIA~~aGVS~~TVSrvLn~~~~vs~~tr~rV~~~a~elgY~pn~~a 55 (342)
T PRK09526 3 SKPVTLYDVARYAGVSYQTVSRVLNQASHVSAKTREKVEAAMAELNYVPNRVA 55 (342)
T ss_pred CCCCcHHHHHHHhCCCHHHHHHHhcCCCCCCHHHHHHHHHHHHHHCCCcCHHH
Confidence 36799999999999999999999984 45889999999999999888776544
No 127
>PF07227 DUF1423: Protein of unknown function (DUF1423); InterPro: IPR004082 A total of 715 potential protein-coding genes have been identified in the nucleotide sequence of Arabidopsis thaliana chromosome 5, with an average gene density of 1 gene per 4001 bp []. Amongst the gene products is a well-conserved family of 130.7kDa proteins that share no sequence similarity with any other known proteins, other than in plants. The sequences are characterised by an N-terminal domain of variable length, a central cysteine-rich region and a relatively acidic C-terminal domain. The sequences may possess a PHD finger.
Probab=25.81 E-value=43 Score=40.58 Aligned_cols=21 Identities=24% Similarity=0.735 Sum_probs=15.3
Q ss_pred CCcCcccCCCCCCCCCEEEccccCccc
Q 001243 705 PRSCDICRRSETILNPILICSGCKVAV 731 (1116)
Q Consensus 705 d~~CsVC~~~E~~~N~IL~Cd~C~laV 731 (1116)
+-.|.||-...+ ||..|--.+
T Consensus 113 dc~C~iC~~~~g------FC~~C~C~i 133 (446)
T PF07227_consen 113 DCDCKICCSEPG------FCRRCMCCI 133 (446)
T ss_pred ccCcchhcCCCC------ccccCCccc
Confidence 356999987555 899886554
No 128
>PRK10727 DNA-binding transcriptional regulator GalR; Provisional
Probab=25.48 E-value=58 Score=36.86 Aligned_cols=50 Identities=12% Similarity=0.123 Sum_probs=44.4
Q ss_pred CcchhhhhhhhcCChhhhhhccc-cccccchhhHHHHHHhhhccccccccc
Q 001243 233 KVNVKDIASDIGISPDLLKTTLA-DGTFASDLQCKLVKWLSNHAYLGGLLK 282 (1116)
Q Consensus 233 kv~~~~~a~~~g~s~~~~~a~l~-~~~~~~~~~~k~~~wl~~~~~~~~~~~ 282 (1116)
+++++|||.+.|+|.-+|--.|+ ....++....||.+-.+..=|.+....
T Consensus 1 ~~ti~dIA~~aGVS~~TVSrvLn~~~~Vs~~tr~rV~~~a~elgY~pn~~a 51 (343)
T PRK10727 1 MATIKDVARLAGVSVATVSRVINNSPKASEASRLAVHSAMESLSYHPNANA 51 (343)
T ss_pred CCCHHHHHHHhCCCHHHHHHHhCCCCCCCHHHHHHHHHHHHHHCCCCCHHH
Confidence 47899999999999999999998 456999999999999999999876543
No 129
>KOG0696 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=25.36 E-value=27 Score=42.05 Aligned_cols=33 Identities=24% Similarity=0.491 Sum_probs=27.7
Q ss_pred CCcCcccCCCC-CCCCCEEEccccCccccccccc
Q 001243 705 PRSCDICRRSE-TILNPILICSGCKVAVHLDCYR 737 (1116)
Q Consensus 705 d~~CsVC~~~E-~~~N~IL~Cd~C~laVHq~CYG 737 (1116)
..+|+-|.+.- .-+.+-++|.-|...||+.|.-
T Consensus 56 PTfCsHCkDFiwGfgKQGfQCqvC~fvvHkrChe 89 (683)
T KOG0696|consen 56 PTFCSHCKDFIWGFGKQGFQCQVCCFVVHKRCHE 89 (683)
T ss_pred CchhhhhhhheeccccCceeeeEEeehhhhhhcc
Confidence 46899999864 2467889999999999999984
No 130
>PF04760 IF2_N: Translation initiation factor IF-2, N-terminal region; InterPro: IPR006847 This region is found in the N-terminal half of translation initiation factor IF-2. It is found in two copies in IF-2 alpha isoforms, and in only one copy in the N-terminally truncated beta and gamma isoforms []. Its function is unknown.; GO: 0003743 translation initiation factor activity, 0006413 translational initiation; PDB: 1ND9_A.
Probab=23.88 E-value=35 Score=29.19 Aligned_cols=23 Identities=22% Similarity=0.521 Sum_probs=19.4
Q ss_pred CCcchhhhhhhhcCChhhhhhcc
Q 001243 232 GKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 232 gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
.+++|.|+|.++|+++..|-..|
T Consensus 2 ~~i~V~elAk~l~v~~~~ii~~l 24 (54)
T PF04760_consen 2 EKIRVSELAKELGVPSKEIIKKL 24 (54)
T ss_dssp -EE-TTHHHHHHSSSHHHHHHHH
T ss_pred CceEHHHHHHHHCcCHHHHHHHH
Confidence 46899999999999999998887
No 131
>PF10367 Vps39_2: Vacuolar sorting protein 39 domain 2; InterPro: IPR019453 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised In Vps39 this domain is involved in localisation and in mediating the interactions with Vps11 [].
Probab=23.80 E-value=35 Score=32.25 Aligned_cols=30 Identities=20% Similarity=0.616 Sum_probs=18.6
Q ss_pred cccccccCcCCceeecCCcCcccccchhhhh
Q 001243 828 DVCCICRHKHGICIKCNYGNCQTTFHPTCAR 858 (1116)
Q Consensus 828 ~~C~iC~~k~GAcIqCs~~~C~~sFHvtCA~ 858 (1116)
..|.+|+++-|...---+ -|...||..|+.
T Consensus 79 ~~C~vC~k~l~~~~f~~~-p~~~v~H~~C~~ 108 (109)
T PF10367_consen 79 TKCSVCGKPLGNSVFVVF-PCGHVVHYSCIK 108 (109)
T ss_pred CCccCcCCcCCCceEEEe-CCCeEEeccccc
Confidence 568888887554322222 345788888875
No 132
>TIGR02531 yecD_yerC TrpR-related protein YerC/YecD. This model represents a protein subfamily found mostly in the Firmicutes (Bacillus and allies). This family is similar in sequence to the trp operon repressor TrpR described by TIGR01321, and represents a distinct clade within the broader family described by pfam01371. At least one species, Xylella fastidiosa, in the Proteobacteria, has a member of both this family and TIGR01321. Several genomes with a member of this family do not synthesize tryptophan, and members of this family should not be considered trp operon repressors without new evidence.
Probab=23.66 E-value=58 Score=31.25 Aligned_cols=29 Identities=28% Similarity=0.515 Sum_probs=24.5
Q ss_pred HHHHHHhhhCCcchhhhhhhhcCChhhhhh
Q 001243 223 LILKKLIDRGKVNVKDIASDIGISPDLLKT 252 (1116)
Q Consensus 223 ~~lkkli~~gkv~~~~~a~~~g~s~~~~~a 252 (1116)
.-+.+|+++|+ +.++||..+|||.-++.-
T Consensus 41 ~~I~~ll~~G~-S~~eIA~~LgISrsTIyR 69 (88)
T TIGR02531 41 LQVAKMLKQGK-TYSDIEAETGASTATISR 69 (88)
T ss_pred HHHHHHHHCCC-CHHHHHHHHCcCHHHHHH
Confidence 44566788886 999999999999998875
No 133
>PRK11169 leucine-responsive transcriptional regulator; Provisional
Probab=23.44 E-value=52 Score=34.30 Aligned_cols=33 Identities=21% Similarity=0.433 Sum_probs=29.5
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
--||..|.+-|..+..+||.++|+|+-++..-+
T Consensus 17 ~~IL~~Lq~d~R~s~~eiA~~lglS~~tv~~Ri 49 (164)
T PRK11169 17 RNILNELQKDGRISNVELSKRVGLSPTPCLERV 49 (164)
T ss_pred HHHHHHhccCCCCCHHHHHHHHCcCHHHHHHHH
Confidence 567889999999999999999999999987764
No 134
>PLN02638 cellulose synthase A (UDP-forming), catalytic subunit
Probab=23.36 E-value=48 Score=44.18 Aligned_cols=50 Identities=26% Similarity=0.584 Sum_probs=39.4
Q ss_pred CCCcCcccCCCC--C-CCCCEEEccccCcccccccccCccCCCCceeccccccc
Q 001243 704 HPRSCDICRRSE--T-ILNPILICSGCKVAVHLDCYRNAKESTGPWYCELCEEL 754 (1116)
Q Consensus 704 ~d~~CsVC~~~E--~-~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd~C~~~ 754 (1116)
+...|.||++.- + +++..|-|.-|+..|-+.||- .+..+|.=-|..|+..
T Consensus 16 ~~qiCqICGD~vg~~~~Ge~FVAC~eC~FPVCrpCYE-YEr~eG~q~CPqCktr 68 (1079)
T PLN02638 16 GGQVCQICGDNVGKTVDGEPFVACDVCAFPVCRPCYE-YERKDGNQSCPQCKTK 68 (1079)
T ss_pred CCceeeecccccCcCCCCCEEEEeccCCCccccchhh-hhhhcCCccCCccCCc
Confidence 346899999852 2 467789999999999999992 3345888899999764
No 135
>PF13639 zf-RING_2: Ring finger domain; PDB: 2KIZ_A 4EPO_C 1IYM_A 2EP4_A 2ECT_A 2JRJ_A 2ECN_A 2ECM_A 3NG2_A 2EA6_A ....
Probab=23.28 E-value=30 Score=28.15 Aligned_cols=30 Identities=20% Similarity=0.458 Sum_probs=22.7
Q ss_pred cCcccCCCCCCCCCEEEccccCccccccccc
Q 001243 707 SCDICRRSETILNPILICSGCKVAVHLDCYR 737 (1116)
Q Consensus 707 ~CsVC~~~E~~~N~IL~Cd~C~laVHq~CYG 737 (1116)
.|.||.+.-...+.++... |+-.+|..|..
T Consensus 2 ~C~IC~~~~~~~~~~~~l~-C~H~fh~~Ci~ 31 (44)
T PF13639_consen 2 ECPICLEEFEDGEKVVKLP-CGHVFHRSCIK 31 (44)
T ss_dssp CETTTTCBHHTTSCEEEET-TSEEEEHHHHH
T ss_pred CCcCCChhhcCCCeEEEcc-CCCeeCHHHHH
Confidence 5999998533356666666 99999999974
No 136
>PLN02400 cellulose synthase
Probab=23.11 E-value=58 Score=43.53 Aligned_cols=49 Identities=24% Similarity=0.599 Sum_probs=39.0
Q ss_pred CCcCcccCCCC--C-CCCCEEEccccCcccccccccCccCCCCceeccccccc
Q 001243 705 PRSCDICRRSE--T-ILNPILICSGCKVAVHLDCYRNAKESTGPWYCELCEEL 754 (1116)
Q Consensus 705 d~~CsVC~~~E--~-~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd~C~~~ 754 (1116)
...|.||.+.- + +++..|-|..|...|-+-||- .+..+|.=.|..|+..
T Consensus 36 gqiCqICGD~VG~t~dGe~FVAC~eCaFPVCRpCYE-YERkeGnq~CPQCkTr 87 (1085)
T PLN02400 36 GQICQICGDDVGVTETGDVFVACNECAFPVCRPCYE-YERKDGTQCCPQCKTR 87 (1085)
T ss_pred CceeeecccccCcCCCCCEEEEEccCCCccccchhh-eecccCCccCcccCCc
Confidence 45899999852 2 467889999999999999993 3345888899999764
No 137
>PF08280 HTH_Mga: M protein trans-acting positive regulator (MGA) HTH domain; InterPro: IPR013199 Mga is a DNA-binding protein that activates the expression of several important virulence genes in group A streptococcus in response to changing environmental conditions [].; PDB: 2WTE_A 3SQN_A.
Probab=22.10 E-value=78 Score=27.68 Aligned_cols=33 Identities=21% Similarity=0.407 Sum_probs=28.5
Q ss_pred HHHHHHhhhCCcchhhhhhhhcCChhhhhhccc
Q 001243 223 LILKKLIDRGKVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 223 ~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
.+|.-|++.+..+++++|..+|+|.-+|..-+.
T Consensus 9 ~Ll~~L~~~~~~~~~ela~~l~~S~rti~~~i~ 41 (59)
T PF08280_consen 9 KLLELLLKNKWITLKELAKKLNISERTIKNDIN 41 (59)
T ss_dssp HHHHHHHHHTSBBHHHHHHHCTS-HHHHHHHHH
T ss_pred HHHHHHHcCCCCcHHHHHHHHCCCHHHHHHHHH
Confidence 467888999999999999999999999987764
No 138
>smart00342 HTH_ARAC helix_turn_helix, arabinose operon control protein.
Probab=22.02 E-value=70 Score=27.89 Aligned_cols=42 Identities=17% Similarity=0.306 Sum_probs=28.6
Q ss_pred CcchhhhhhhhcCChhhhhhccccc-cccchhh------HHHHHHhhhc
Q 001243 233 KVNVKDIASDIGISPDLLKTTLADG-TFASDLQ------CKLVKWLSNH 274 (1116)
Q Consensus 233 kv~~~~~a~~~g~s~~~~~a~l~~~-~~~~~~~------~k~~~wl~~~ 274 (1116)
++++++||.++|+|+..|...+... .++|.-- .+++.||.++
T Consensus 1 ~~~~~~la~~~~~s~~~l~~~f~~~~~~s~~~~~~~~r~~~a~~~l~~~ 49 (84)
T smart00342 1 PLTLEDLAEALGMSPRHLQRLFKKETGTTPKQYLRDRRLERARRLLRDT 49 (84)
T ss_pred CCCHHHHHHHhCCCHHHHHHHHHHHhCcCHHHHHHHHHHHHHHHHHHcC
Confidence 4789999999999999999887632 2333211 2456666655
No 139
>PRK10434 srlR DNA-bindng transcriptional repressor SrlR; Provisional
Probab=21.83 E-value=65 Score=36.14 Aligned_cols=33 Identities=24% Similarity=0.404 Sum_probs=29.4
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
..||..|-.+|+|+++|+|..+|+|++++..-|
T Consensus 8 ~~Il~~L~~~~~v~v~eLa~~l~VS~~TIRRDL 40 (256)
T PRK10434 8 AAILEYLQKQGKTSVEELAQYFDTTGTTIRKDL 40 (256)
T ss_pred HHHHHHHHHcCCEEHHHHHHHHCCCHHHHHHHH
Confidence 467777888999999999999999999998876
No 140
>PRK10072 putative transcriptional regulator; Provisional
Probab=21.82 E-value=1.1e+02 Score=29.79 Aligned_cols=52 Identities=17% Similarity=0.256 Sum_probs=40.2
Q ss_pred HHHHHhhhCCcchhhhhhhhcCChhhhhhccccccccchhhHHHHHHhhhcc
Q 001243 224 ILKKLIDRGKVNVKDIASDIGISPDLLKTTLADGTFASDLQCKLVKWLSNHA 275 (1116)
Q Consensus 224 ~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~~~~~~~~~~~k~~~wl~~~~ 275 (1116)
=+|.|..+-+.+-+++|..+|||.-+|..-.........--..++++|+.+.
T Consensus 37 eik~LR~~~glTQ~elA~~lGvS~~TVs~WE~G~r~P~~~~l~Ll~~L~~~P 88 (96)
T PRK10072 37 EFEQLRKGTGLKIDDFARVLGVSVAMVKEWESRRVKPSSAELKLMRLIQANP 88 (96)
T ss_pred HHHHHHHHcCCCHHHHHHHhCCCHHHHHHHHcCCCCCCHHHHHHHHHHhhCH
Confidence 3788888899999999999999998887776544443444467888887665
No 141
>smart00344 HTH_ASNC helix_turn_helix ASNC type. AsnC: an autogenously regulated activator of asparagine synthetase A transcription in Escherichia coli
Probab=21.70 E-value=70 Score=30.50 Aligned_cols=32 Identities=22% Similarity=0.560 Sum_probs=28.7
Q ss_pred HHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 223 LILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 223 ~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
-||+.|-+.|.++..++|.++|+|+.++...|
T Consensus 7 ~il~~L~~~~~~~~~~la~~l~~s~~tv~~~l 38 (108)
T smart00344 7 KILEELQKDARISLAELAKKVGLSPSTVHNRV 38 (108)
T ss_pred HHHHHHHHhCCCCHHHHHHHHCcCHHHHHHHH
Confidence 57788888899999999999999999998775
No 142
>PF05043 Mga: Mga helix-turn-helix domain; InterPro: IPR007737 Mga is a DNA-binding protein that activates the expression of several important virulence genes in group A streptococcus in response to changing environmental conditions []. The family also contains VirR like proteins which match only at the C terminus of the alignment.; PDB: 3SQN_A.
Probab=21.56 E-value=87 Score=28.94 Aligned_cols=42 Identities=29% Similarity=0.414 Sum_probs=32.6
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhccccccccchhhHHHHHHhhh
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTLADGTFASDLQCKLVKWLSN 273 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~~~~~~~~~~~k~~~wl~~ 273 (1116)
-.+++-|+..+.+++.++|.+++||.-++...+ .+|-+||+.
T Consensus 19 ~~ll~~ll~~~~~s~~~la~~~~iS~sti~~~i----------~~l~~~l~~ 60 (87)
T PF05043_consen 19 YQLLKLLLNNEYVSIEDLAEELFISRSTIYRDI----------KKLNKYLKK 60 (87)
T ss_dssp HHHHHHHHH-SEEEHHHHHHHHT--HHHHHHHH----------HHHHHHHHC
T ss_pred HHHHHHHHcCCCcCHHHHHHHHCCCHHHHHHHH----------HHHHHHHHH
Confidence 457788889999999999999999999998876 466777774
No 143
>PF13551 HTH_29: Winged helix-turn helix
Probab=21.45 E-value=1.1e+02 Score=28.92 Aligned_cols=50 Identities=20% Similarity=0.331 Sum_probs=37.5
Q ss_pred HhhhCCcchhhhhhhhcCChhhhhhccc-----------c--------cc-ccchhhHHHHHHhhhcccc
Q 001243 228 LIDRGKVNVKDIASDIGISPDLLKTTLA-----------D--------GT-FASDLQCKLVKWLSNHAYL 277 (1116)
Q Consensus 228 li~~gkv~~~~~a~~~g~s~~~~~a~l~-----------~--------~~-~~~~~~~k~~~wl~~~~~~ 277 (1116)
|+.+|.-++.+||..+|+|+.++..-+. . .. +.+.....|+.|+..+...
T Consensus 7 l~~~g~~~~~~ia~~lg~s~~Tv~r~~~~~~~~G~~~l~~~~~~~g~~~~~l~~~~~~~l~~~~~~~p~~ 76 (112)
T PF13551_consen 7 LLAEGVSTIAEIARRLGISRRTVYRWLKRYREGGIEGLLPRKPRGGRPRKRLSEEQRAQLIELLRENPPE 76 (112)
T ss_pred HHHcCCCcHHHHHHHHCcCHHHHHHHHHHHHcccHHHHHhccccCCCCCCCCCHHHHHHHHHHHHHCCCC
Confidence 4556766899999999999999865544 1 12 6677778888888887655
No 144
>PRK04424 fatty acid biosynthesis transcriptional regulator; Provisional
Probab=21.10 E-value=73 Score=34.13 Aligned_cols=35 Identities=9% Similarity=0.159 Sum_probs=31.5
Q ss_pred hHHHHHHHhhhCCcchhhhhhhhcCChhhhhhccc
Q 001243 221 FTLILKKLIDRGKVNVKDIASDIGISPDLLKTTLA 255 (1116)
Q Consensus 221 ~~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l~ 255 (1116)
...||..|-.+|.++++|+|..+|+|.+++.--|.
T Consensus 9 ~~~Il~~l~~~~~~~~~~La~~~~vS~~TiRRDl~ 43 (185)
T PRK04424 9 QKALQELIEENPFITDEELAEKFGVSIQTIRLDRM 43 (185)
T ss_pred HHHHHHHHHHCCCEEHHHHHHHHCcCHHHHHHHHH
Confidence 46788889999999999999999999999998764
No 145
>smart00342 HTH_ARAC helix_turn_helix, arabinose operon control protein.
Probab=20.75 E-value=66 Score=28.04 Aligned_cols=29 Identities=21% Similarity=0.479 Sum_probs=23.6
Q ss_pred HHHhhhCCcchhhhhhhhcC-Chhhhhhcc
Q 001243 226 KKLIDRGKVNVKDIASDIGI-SPDLLKTTL 254 (1116)
Q Consensus 226 kkli~~gkv~~~~~a~~~g~-s~~~~~a~l 254 (1116)
..+|..+..++.|||.+.|+ |+..+....
T Consensus 43 ~~~l~~~~~~~~~ia~~~g~~s~~~f~r~F 72 (84)
T smart00342 43 RRLLRDTDLSVTEIALRVGFSSQSYFSRAF 72 (84)
T ss_pred HHHHHcCCCCHHHHHHHhCCCChHHHHHHH
Confidence 45566779999999999999 888776654
No 146
>TIGR01481 ccpA catabolite control protein A. Catabolite control protein A is a LacI family global transcriptional regulator found in Gram-positive bacteria. CcpA is involved in repressing carbohydrate utilization genes [ex: alpha-amylase (amyE), acetyl-coenzyme A synthase (acsA)] and in activating genes involved in transporting excess carbon from the cell [ex: acetate kinase (ackA), alpha-acetolactate synthase (alsS)]. Additionally, disruption of CcpA in Bacillus megaterium, Staphylococcus xylosus, Lactobacillus casei and Lactocacillus pentosus also decreases growth rate, which suggests CcpA is involved in the regulation of other metabolic pathways.
Probab=20.50 E-value=81 Score=35.28 Aligned_cols=48 Identities=15% Similarity=0.248 Sum_probs=42.9
Q ss_pred cchhhhhhhhcCChhhhhhcccc-ccccchhhHHHHHHhhhcccccccc
Q 001243 234 VNVKDIASDIGISPDLLKTTLAD-GTFASDLQCKLVKWLSNHAYLGGLL 281 (1116)
Q Consensus 234 v~~~~~a~~~g~s~~~~~a~l~~-~~~~~~~~~k~~~wl~~~~~~~~~~ 281 (1116)
++++|||.+.|+|.-+|--+|+. ...++....||.+=.+..-|.+...
T Consensus 2 ~ti~dIA~~agvS~~TVSrvLn~~~~vs~~tr~rV~~~a~~lgY~pn~~ 50 (329)
T TIGR01481 2 VTIYDVAREAGVSMATVSRVVNGNPNVKPATRKKVLEVIKRLDYRPNAV 50 (329)
T ss_pred CcHHHHHHHhCCCHHHHHHHhCCCCCCCHHHHHHHHHHHHHHCCCCCHH
Confidence 68999999999999999999994 5689999999999999999977554
No 147
>PF11793 FANCL_C: FANCL C-terminal domain; PDB: 3K1L_A.
Probab=20.49 E-value=55 Score=29.88 Aligned_cols=33 Identities=27% Similarity=0.535 Sum_probs=13.7
Q ss_pred cccccccCc---CC--ceeecCCcCcccccchhhhhhc
Q 001243 828 DVCCICRHK---HG--ICIKCNYGNCQTTFHPTCARSA 860 (1116)
Q Consensus 828 ~~C~iC~~k---~G--AcIqCs~~~C~~sFHvtCA~~a 860 (1116)
..|.||... .+ ..+.|....|...||..|-...
T Consensus 3 ~~C~IC~~~~~~~~~~p~~~C~n~~C~~~fH~~CL~~w 40 (70)
T PF11793_consen 3 LECGICYSYRLDDGEIPDVVCPNPSCGKKFHLLCLSEW 40 (70)
T ss_dssp -S-SSS--SS-TT-----B--S-TT----B-SGGGHHH
T ss_pred CCCCcCCcEecCCCCcCceEcCCcccCCHHHHHHHHHH
Confidence 468888863 22 3467999999999999997554
No 148
>PF04405 ScdA_N: Domain of Unknown function (DUF542) ; InterPro: IPR007500 This is a domain of unknown function found at the N terminus of genes involved in cell wall development and nitrous oxide protection. ScdA is required for normal cell growth and development; mutants have an increased level of peptidoglycan cross-linking and aberrant cellular morphology suggesting a role for ScdA in cell wall metabolism []. NorA1, NorA2, and YtfE are involved in the nitrous oxide response. NorA1 and NorA2, which are similar to YtfE, are co-transcribed with the membrane-bound nitrous oxide (NO) reductases. The genes appear to be involved in NO protection but their function is unknown [, ].
Probab=20.18 E-value=52 Score=29.04 Aligned_cols=37 Identities=19% Similarity=0.433 Sum_probs=30.4
Q ss_pred chHHHHHHH-hh---hCCcchhhhhhhhcCChhhhhhcccc
Q 001243 220 NFTLILKKL-ID---RGKVNVKDIASDIGISPDLLKTTLAD 256 (1116)
Q Consensus 220 ~~~~~lkkl-i~---~gkv~~~~~a~~~g~s~~~~~a~l~~ 256 (1116)
..+-|++|+ || -|+.++.+.+.+.||+++.|.+.|.+
T Consensus 14 ~~a~vf~~~gIDfCCgG~~~L~eA~~~~~ld~~~vl~~L~~ 54 (56)
T PF04405_consen 14 RAARVFRKYGIDFCCGGNRSLEEACEEKGLDPEEVLEELNA 54 (56)
T ss_pred HHHHHHHHcCCcccCCCCchHHHHHHHcCCCHHHHHHHHHH
Confidence 346677775 55 58999999999999999999999853
No 149
>PF10078 DUF2316: Uncharacterized protein conserved in bacteria (DUF2316); InterPro: IPR018757 Members of this family of hypothetical bacterial proteins have no known function.
Probab=20.18 E-value=63 Score=31.29 Aligned_cols=42 Identities=24% Similarity=0.311 Sum_probs=28.7
Q ss_pred hCCcchhhhhhhhcCChhhhhhccccccccchhhHHHHHHhh
Q 001243 231 RGKVNVKDIASDIGISPDLLKTTLADGTFASDLQCKLVKWLS 272 (1116)
Q Consensus 231 ~gkv~~~~~a~~~g~s~~~~~a~l~~~~~~~~~~~k~~~wl~ 272 (1116)
+=-++..+||.++|+|++-|+..|.-..-.|..=-.+-.+|-
T Consensus 21 ~~~ls~~~ia~dL~~s~~~le~vL~l~~~~~~~vW~lRdyL~ 62 (89)
T PF10078_consen 21 LSGLSLEQIAADLGTSPEHLEQVLNLKQPFPEDVWILRDYLN 62 (89)
T ss_pred HcCCCHHHHHHHhCCCHHHHHHHHcCCCCCcccchHHHHHHH
Confidence 445899999999999999999998732222333334444443
No 150
>PRK10141 DNA-binding transcriptional repressor ArsR; Provisional
Probab=20.07 E-value=78 Score=31.85 Aligned_cols=33 Identities=21% Similarity=0.246 Sum_probs=28.0
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
--||+.|.+.|.++|.+||..+|+++-++-.-|
T Consensus 19 l~IL~~L~~~~~~~v~ela~~l~lsqstvS~HL 51 (117)
T PRK10141 19 LGIVLLLRESGELCVCDLCTALDQSQPKISRHL 51 (117)
T ss_pred HHHHHHHHHcCCcCHHHHHHHHCcCHHHHHHHH
Confidence 457888888899999999999999999886544
No 151
>PRK11179 DNA-binding transcriptional regulator AsnC; Provisional
Probab=20.05 E-value=75 Score=32.71 Aligned_cols=33 Identities=21% Similarity=0.483 Sum_probs=29.2
Q ss_pred HHHHHHHhhhCCcchhhhhhhhcCChhhhhhcc
Q 001243 222 TLILKKLIDRGKVNVKDIASDIGISPDLLKTTL 254 (1116)
Q Consensus 222 ~~~lkkli~~gkv~~~~~a~~~g~s~~~~~a~l 254 (1116)
-.||..|-.-|..+..+||.++|+|+.++..-+
T Consensus 12 ~~Il~~Lq~d~R~s~~eiA~~lglS~~tV~~Ri 44 (153)
T PRK11179 12 RGILEALMENARTPYAELAKQFGVSPGTIHVRV 44 (153)
T ss_pred HHHHHHHHHcCCCCHHHHHHHHCcCHHHHHHHH
Confidence 457788888999999999999999999988764
No 152
>PLN02436 cellulose synthase A
Probab=20.04 E-value=66 Score=42.98 Aligned_cols=49 Identities=27% Similarity=0.626 Sum_probs=38.5
Q ss_pred CCcCcccCCCC--C-CCCCEEEccccCcccccccccCccCCCCceeccccccc
Q 001243 705 PRSCDICRRSE--T-ILNPILICSGCKVAVHLDCYRNAKESTGPWYCELCEEL 754 (1116)
Q Consensus 705 d~~CsVC~~~E--~-~~N~IL~Cd~C~laVHq~CYGi~~ipeg~WlCd~C~~~ 754 (1116)
...|.||.+.- + ++...|-|.-|+..|-..||- .+..+|.=.|..|+..
T Consensus 36 ~~iCqICGD~Vg~t~dGe~FVACn~C~fpvCr~Cye-yer~eg~~~Cpqckt~ 87 (1094)
T PLN02436 36 GQTCQICGDEIELTVDGEPFVACNECAFPVCRPCYE-YERREGNQACPQCKTR 87 (1094)
T ss_pred CccccccccccCcCCCCCEEEeeccCCCccccchhh-hhhhcCCccCcccCCc
Confidence 35799999852 2 466789999999999999993 3345788889999754
Done!