Query psy8622
Match_columns 101
No_of_seqs 213 out of 1453
Neff 9.2
Searched_HMMs 46136
Date Fri Aug 16 23:31:34 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8622.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8622hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF13639 zf-RING_2: Ring finge 99.7 2.3E-18 5.1E-23 86.8 2.7 43 41-83 2-44 (44)
2 KOG4628|consensus 99.6 4.4E-16 9.5E-21 107.2 4.3 52 40-91 230-282 (348)
3 PF12678 zf-rbx1: RING-H2 zinc 99.5 9.9E-15 2.1E-19 80.9 4.4 45 39-83 19-73 (73)
4 KOG0317|consensus 99.5 4.9E-15 1.1E-19 99.1 3.4 59 32-93 232-290 (293)
5 PLN03208 E3 ubiquitin-protein 99.5 3.6E-14 7.8E-19 90.9 4.7 58 34-94 13-86 (193)
6 PHA02929 N1R/p28-like protein; 99.5 2.4E-14 5.2E-19 94.8 4.0 51 37-87 172-227 (238)
7 KOG0823|consensus 99.5 8.8E-14 1.9E-18 90.7 5.2 61 35-98 43-106 (230)
8 COG5243 HRD1 HRD ubiquitin lig 99.4 7.7E-14 1.7E-18 96.4 3.9 53 35-87 283-345 (491)
9 PF13920 zf-C3HC4_3: Zinc fing 99.4 1.4E-13 3E-18 71.0 3.8 46 39-87 2-48 (50)
10 COG5540 RING-finger-containing 99.4 1.5E-13 3.2E-18 92.8 4.2 53 36-88 320-373 (374)
11 PF13923 zf-C3HC4_2: Zinc fing 99.4 2.4E-13 5.3E-18 66.6 3.2 39 42-82 1-39 (39)
12 PF15227 zf-C3HC4_4: zinc fing 99.3 8.8E-13 1.9E-17 65.6 2.8 38 42-82 1-42 (42)
13 KOG0320|consensus 99.3 2.8E-12 6.1E-17 80.7 4.2 55 35-90 127-181 (187)
14 cd00162 RING RING-finger (Real 99.3 2.7E-12 5.9E-17 63.9 3.2 44 41-86 1-45 (45)
15 PF14634 zf-RING_5: zinc-RING 99.3 3.2E-12 6.9E-17 64.2 3.1 44 41-84 1-44 (44)
16 KOG0802|consensus 99.3 2.5E-12 5.4E-17 94.1 2.8 58 35-92 287-346 (543)
17 smart00504 Ubox Modified RING 99.2 1.3E-11 2.9E-16 66.2 3.8 50 40-92 2-51 (63)
18 PF00097 zf-C3HC4: Zinc finger 99.2 1.2E-11 2.7E-16 61.0 3.2 39 42-82 1-41 (41)
19 PF12861 zf-Apc11: Anaphase-pr 99.2 2E-11 4.4E-16 68.7 3.7 50 38-87 20-82 (85)
20 PHA02926 zinc finger-like prot 99.2 1.3E-11 2.8E-16 80.4 2.9 53 35-87 166-230 (242)
21 KOG2164|consensus 99.2 2.1E-11 4.6E-16 87.1 3.4 50 39-91 186-240 (513)
22 TIGR00599 rad18 DNA repair pro 99.2 3.2E-11 6.8E-16 85.1 3.9 54 33-89 20-73 (397)
23 COG5574 PEX10 RING-finger-cont 99.1 6.4E-11 1.4E-15 78.7 3.9 52 37-91 213-266 (271)
24 smart00184 RING Ring finger. E 99.1 1.3E-10 2.8E-15 55.9 3.2 38 42-82 1-39 (39)
25 KOG0287|consensus 99.0 8.4E-11 1.8E-15 80.7 1.0 57 31-90 15-71 (442)
26 PF13445 zf-RING_UBOX: RING-ty 99.0 6.6E-10 1.4E-14 55.4 2.9 34 42-76 1-35 (43)
27 COG5432 RAD18 RING-finger-cont 98.9 7.6E-10 1.6E-14 74.8 2.4 54 32-88 18-71 (391)
28 COG5194 APC11 Component of SCF 98.9 2.2E-09 4.7E-14 59.4 3.3 30 60-89 54-83 (88)
29 PF04564 U-box: U-box domain; 98.8 4.6E-09 1E-13 58.1 3.3 52 38-92 3-55 (73)
30 KOG1734|consensus 98.8 1E-09 2.2E-14 73.4 0.7 55 38-92 223-286 (328)
31 KOG1493|consensus 98.7 3E-09 6.5E-14 58.4 1.0 48 40-87 21-81 (84)
32 KOG4265|consensus 98.7 1.7E-08 3.7E-13 69.8 4.5 52 35-89 286-338 (349)
33 TIGR00570 cdk7 CDK-activating 98.7 1.1E-08 2.3E-13 70.1 3.3 52 39-90 3-57 (309)
34 KOG0828|consensus 98.7 5.3E-09 1.1E-13 75.0 1.7 51 38-88 570-635 (636)
35 smart00744 RINGv The RING-vari 98.7 1.7E-08 3.6E-13 51.7 3.0 42 41-83 1-49 (49)
36 PF11793 FANCL_C: FANCL C-term 98.7 2.1E-09 4.6E-14 59.0 -0.6 51 39-89 2-68 (70)
37 KOG2177|consensus 98.7 8.2E-09 1.8E-13 69.2 1.6 48 34-84 8-55 (386)
38 KOG0804|consensus 98.7 1.3E-08 2.9E-13 72.1 2.6 51 35-87 171-222 (493)
39 COG5219 Uncharacterized conser 98.5 2.3E-08 4.9E-13 76.4 0.8 52 36-87 1466-1523(1525)
40 KOG0311|consensus 98.5 1.8E-08 4E-13 69.6 -0.6 63 31-95 35-98 (381)
41 KOG2930|consensus 98.5 1.4E-07 3E-12 54.7 2.6 51 37-87 44-108 (114)
42 KOG0824|consensus 98.4 1.2E-07 2.5E-12 64.5 1.7 50 38-90 6-56 (324)
43 PF14835 zf-RING_6: zf-RING of 98.4 5.6E-08 1.2E-12 51.9 0.1 51 38-92 6-56 (65)
44 KOG0827|consensus 98.4 1.6E-07 3.4E-12 65.9 1.9 47 40-86 5-55 (465)
45 KOG0978|consensus 98.3 2E-07 4.3E-12 69.7 0.7 58 33-93 637-695 (698)
46 KOG4172|consensus 98.3 1.3E-07 2.8E-12 48.8 -0.2 46 39-87 7-54 (62)
47 KOG1941|consensus 98.3 4.7E-07 1E-11 63.7 2.1 52 35-86 361-415 (518)
48 KOG1039|consensus 98.2 6.7E-07 1.5E-11 62.4 1.9 51 37-87 159-221 (344)
49 KOG0825|consensus 98.1 4.6E-07 9.9E-12 68.4 -0.3 49 39-87 123-171 (1134)
50 KOG3039|consensus 98.1 4.6E-06 1E-10 55.5 3.7 64 37-100 219-283 (303)
51 KOG1785|consensus 98.1 1.6E-06 3.4E-11 61.3 1.6 51 39-92 369-421 (563)
52 KOG4159|consensus 98.1 2.3E-06 5E-11 60.8 2.1 49 37-88 82-130 (398)
53 KOG2660|consensus 98.0 8.2E-07 1.8E-11 61.0 -0.3 55 32-88 8-62 (331)
54 COG5152 Uncharacterized conser 98.0 3.8E-06 8.1E-11 54.4 1.9 49 36-87 193-241 (259)
55 KOG1645|consensus 98.0 6.7E-06 1.5E-10 58.1 3.1 49 38-86 3-55 (463)
56 KOG4445|consensus 98.0 2.1E-06 4.5E-11 58.6 0.4 52 38-89 114-188 (368)
57 PF11789 zf-Nse: Zinc-finger o 97.9 9.6E-06 2.1E-10 42.7 2.8 44 36-81 8-53 (57)
58 KOG2879|consensus 97.9 1.2E-05 2.7E-10 54.2 3.8 52 34-87 234-287 (298)
59 KOG0297|consensus 97.9 5.4E-06 1.2E-10 59.0 1.5 53 36-90 18-70 (391)
60 KOG1813|consensus 97.7 1.2E-05 2.6E-10 54.7 1.3 52 35-89 237-288 (313)
61 PF04641 Rtf2: Rtf2 RING-finge 97.7 5.4E-05 1.2E-09 51.2 4.0 63 36-99 110-173 (260)
62 KOG1428|consensus 97.7 3.3E-05 7.1E-10 62.2 2.9 53 35-87 3482-3544(3738)
63 KOG3970|consensus 97.6 4.8E-05 1E-09 50.3 2.3 60 29-89 40-107 (299)
64 KOG4692|consensus 97.6 7.8E-05 1.7E-09 52.2 3.4 51 35-88 418-468 (489)
65 PF14570 zf-RING_4: RING/Ubox 97.6 7.1E-05 1.5E-09 37.9 2.3 44 42-86 1-47 (48)
66 COG5175 MOT2 Transcriptional r 97.4 9.6E-05 2.1E-09 51.5 2.5 56 36-91 11-68 (480)
67 PF10367 Vps39_2: Vacuolar sor 97.4 6.2E-05 1.3E-09 44.0 1.1 33 37-70 76-108 (109)
68 PF12906 RINGv: RING-variant d 97.4 0.00013 2.8E-09 36.9 2.1 40 42-82 1-47 (47)
69 PF05883 Baculo_RING: Baculovi 97.4 0.00011 2.3E-09 44.9 2.0 37 39-75 26-68 (134)
70 COG5236 Uncharacterized conser 97.3 0.00046 1E-08 48.4 4.5 50 34-86 56-107 (493)
71 KOG4739|consensus 97.2 0.00015 3.2E-09 48.2 1.6 51 41-94 5-55 (233)
72 KOG1002|consensus 97.2 0.00016 3.5E-09 53.1 1.4 52 35-89 532-588 (791)
73 KOG1814|consensus 97.2 0.00021 4.6E-09 50.8 1.8 37 38-74 183-219 (445)
74 PF14447 Prok-RING_4: Prokaryo 97.1 0.00024 5.2E-09 36.9 1.3 46 40-90 8-53 (55)
75 KOG1952|consensus 97.1 0.0002 4.4E-09 54.8 1.5 49 36-84 188-244 (950)
76 KOG4275|consensus 97.1 8.3E-05 1.8E-09 50.8 -0.6 42 39-87 300-342 (350)
77 KOG4185|consensus 97.1 0.00035 7.5E-09 47.9 2.1 47 40-86 4-54 (296)
78 PHA02825 LAP/PHD finger-like p 97.1 0.00064 1.4E-08 42.7 2.9 50 36-89 5-61 (162)
79 KOG1571|consensus 97.0 0.00072 1.6E-08 47.3 3.2 49 33-87 299-347 (355)
80 PF08746 zf-RING-like: RING-li 97.0 0.001 2.2E-08 33.0 2.9 38 42-82 1-43 (43)
81 PHA02862 5L protein; Provision 96.9 0.00079 1.7E-08 41.6 2.6 46 39-88 2-54 (156)
82 PHA03096 p28-like protein; Pro 96.8 0.00065 1.4E-08 46.6 1.5 36 40-75 179-219 (284)
83 COG5222 Uncharacterized conser 96.8 0.0008 1.7E-08 46.4 1.8 43 39-84 274-318 (427)
84 KOG1940|consensus 96.7 0.00078 1.7E-08 45.9 1.4 47 38-84 157-204 (276)
85 KOG3268|consensus 96.7 0.0012 2.7E-08 42.3 2.0 30 59-88 189-229 (234)
86 KOG0826|consensus 96.7 0.0016 3.4E-08 45.3 2.6 50 36-87 297-346 (357)
87 KOG2114|consensus 96.5 0.0013 2.9E-08 50.5 1.7 43 39-86 840-882 (933)
88 KOG0801|consensus 96.5 0.00082 1.8E-08 42.4 0.3 31 36-66 174-204 (205)
89 PF07800 DUF1644: Protein of u 96.5 0.003 6.5E-08 39.6 2.7 51 39-92 2-96 (162)
90 KOG1001|consensus 96.4 0.0011 2.4E-08 50.3 0.9 51 40-94 455-507 (674)
91 PF10272 Tmpp129: Putative tra 96.2 0.0027 5.8E-08 44.9 1.7 31 60-90 311-354 (358)
92 KOG4367|consensus 96.1 0.0038 8.2E-08 45.2 2.1 35 37-74 2-36 (699)
93 PF05290 Baculo_IE-1: Baculovi 96.1 0.0061 1.3E-07 37.2 2.6 57 34-92 75-137 (140)
94 KOG2932|consensus 96.1 0.0026 5.6E-08 44.0 1.1 43 41-87 92-134 (389)
95 PF14446 Prok-RING_1: Prokaryo 96.0 0.011 2.3E-07 30.7 2.9 34 38-71 4-38 (54)
96 KOG3800|consensus 95.8 0.0096 2.1E-07 40.8 2.8 47 41-87 2-51 (300)
97 KOG2817|consensus 95.6 0.013 2.7E-07 41.8 3.0 54 37-90 332-388 (394)
98 KOG0827|consensus 95.5 0.00069 1.5E-08 48.0 -3.5 50 39-88 196-246 (465)
99 KOG2034|consensus 95.5 0.0091 2E-07 46.3 2.0 37 36-73 814-850 (911)
100 KOG1100|consensus 94.7 0.014 3.1E-07 38.4 1.1 41 42-89 161-202 (207)
101 KOG3053|consensus 94.6 0.018 3.8E-07 39.0 1.3 54 34-87 15-82 (293)
102 KOG3899|consensus 94.5 0.018 3.9E-07 39.7 1.2 31 60-90 325-368 (381)
103 PF03854 zf-P11: P-11 zinc fin 94.3 0.024 5.1E-07 28.6 1.1 31 58-88 16-47 (50)
104 COG5220 TFB3 Cdk activating ki 94.2 0.023 4.9E-07 38.2 1.1 48 38-85 9-62 (314)
105 KOG3002|consensus 94.1 0.041 8.8E-07 38.2 2.3 49 34-87 43-91 (299)
106 KOG0298|consensus 94.0 0.016 3.4E-07 46.7 0.1 50 34-85 1148-1197(1394)
107 KOG0309|consensus 93.8 0.039 8.5E-07 42.6 1.8 42 39-81 1028-1069(1081)
108 KOG3113|consensus 93.4 0.11 2.3E-06 35.3 3.1 60 37-98 109-169 (293)
109 KOG1609|consensus 93.1 0.12 2.7E-06 35.4 3.3 50 39-88 78-135 (323)
110 PF02891 zf-MIZ: MIZ/SP-RING z 93.1 0.033 7.1E-07 28.4 0.3 43 40-85 3-50 (50)
111 KOG1812|consensus 93.0 0.037 8.1E-07 39.6 0.6 38 38-75 145-183 (384)
112 KOG0802|consensus 93.0 0.1 2.3E-06 38.9 3.0 56 35-97 475-530 (543)
113 KOG4362|consensus 92.0 0.039 8.5E-07 42.0 -0.3 48 38-88 20-70 (684)
114 KOG1815|consensus 90.8 0.17 3.7E-06 36.9 2.0 38 35-74 66-103 (444)
115 KOG1812|consensus 89.2 0.22 4.9E-06 35.7 1.5 44 39-82 306-351 (384)
116 COG5183 SSM4 Protein involved 88.9 0.49 1.1E-05 37.2 3.1 53 38-91 11-70 (1175)
117 KOG3161|consensus 88.2 0.17 3.7E-06 38.6 0.4 41 38-80 10-51 (861)
118 KOG4718|consensus 88.0 0.29 6.3E-06 32.4 1.3 46 38-86 180-226 (235)
119 PF14569 zf-UDP: Zinc-binding 87.9 1.4 3.1E-05 24.5 3.7 51 38-88 8-63 (80)
120 KOG1829|consensus 87.6 0.2 4.2E-06 37.8 0.4 23 59-84 536-558 (580)
121 KOG3579|consensus 86.6 0.43 9.3E-06 33.0 1.5 40 36-76 265-306 (352)
122 PF06906 DUF1272: Protein of u 86.3 1.3 2.8E-05 23.1 2.9 47 40-89 6-54 (57)
123 COG5109 Uncharacterized conser 85.9 0.78 1.7E-05 32.3 2.5 52 36-87 333-387 (396)
124 KOG0269|consensus 85.6 0.9 1.9E-05 35.4 2.9 41 40-81 780-820 (839)
125 KOG0825|consensus 85.5 0.93 2E-05 35.6 2.9 52 38-89 95-156 (1134)
126 KOG2807|consensus 85.1 0.83 1.8E-05 32.2 2.3 47 38-84 329-375 (378)
127 PLN02400 cellulose synthase 84.7 2.7 5.8E-05 34.2 5.2 50 38-87 35-89 (1085)
128 KOG2068|consensus 84.5 1.2 2.7E-05 31.3 3.0 51 38-88 248-299 (327)
129 PF10571 UPF0547: Uncharacteri 84.5 0.5 1.1E-05 20.7 0.7 9 41-49 2-10 (26)
130 PF01363 FYVE: FYVE zinc finge 84.4 0.46 1E-05 25.4 0.7 36 37-72 7-43 (69)
131 PF13901 DUF4206: Domain of un 84.4 0.74 1.6E-05 30.1 1.8 41 38-84 151-197 (202)
132 PF07191 zinc-ribbons_6: zinc- 84.2 0.082 1.8E-06 28.9 -2.3 40 40-87 2-41 (70)
133 PF06844 DUF1244: Protein of u 82.2 0.92 2E-05 24.4 1.3 12 63-74 11-22 (68)
134 PF07975 C1_4: TFIIH C1-like d 81.9 0.86 1.9E-05 23.3 1.1 24 60-83 27-50 (51)
135 smart00132 LIM Zinc-binding do 81.5 1.4 3E-05 20.1 1.7 37 41-86 1-37 (39)
136 PF10497 zf-4CXXC_R1: Zinc-fin 81.4 2.2 4.7E-05 25.1 2.8 25 61-85 37-70 (105)
137 PF00628 PHD: PHD-finger; Int 80.8 0.78 1.7E-05 22.8 0.7 43 41-84 1-50 (51)
138 smart00249 PHD PHD zinc finger 80.1 1.2 2.7E-05 21.1 1.3 29 42-71 2-31 (47)
139 PLN02638 cellulose synthase A 79.1 3.5 7.5E-05 33.6 4.0 50 38-87 16-70 (1079)
140 PLN02189 cellulose synthase 78.6 3.7 7.9E-05 33.3 4.0 50 38-87 33-87 (1040)
141 KOG3842|consensus 77.5 3.3 7.1E-05 29.3 3.1 52 36-87 338-414 (429)
142 PLN02436 cellulose synthase A 77.1 4.2 9.1E-05 33.1 4.0 50 38-87 35-89 (1094)
143 KOG2066|consensus 76.6 1.1 2.4E-05 35.1 0.7 42 40-82 785-830 (846)
144 smart00064 FYVE Protein presen 76.4 2.7 5.8E-05 22.2 2.0 35 39-73 10-45 (68)
145 PF00412 LIM: LIM domain; Int 76.3 1.9 4.1E-05 21.8 1.4 40 42-90 1-40 (58)
146 KOG3005|consensus 76.2 4.3 9.2E-05 27.9 3.3 49 39-87 182-243 (276)
147 cd00065 FYVE FYVE domain; Zinc 75.6 2.2 4.8E-05 21.6 1.5 34 40-73 3-37 (57)
148 PF04423 Rad50_zn_hook: Rad50 75.2 0.81 1.8E-05 23.4 -0.2 13 77-89 21-33 (54)
149 TIGR00622 ssl1 transcription f 74.6 4.6 9.9E-05 24.1 2.8 45 39-83 55-110 (112)
150 PF04710 Pellino: Pellino; In 71.6 1.3 2.8E-05 32.1 0.0 47 36-85 274-337 (416)
151 COG3813 Uncharacterized protei 71.5 3.2 7E-05 22.8 1.6 46 42-90 8-55 (84)
152 KOG3039|consensus 70.8 3.7 8.1E-05 28.0 2.0 32 39-73 43-74 (303)
153 KOG3799|consensus 70.5 0.91 2E-05 28.0 -0.8 50 37-86 63-117 (169)
154 COG4847 Uncharacterized protei 67.8 7.9 0.00017 22.5 2.7 35 39-74 6-40 (103)
155 PRK11088 rrmA 23S rRNA methylt 66.9 4.5 9.7E-05 27.4 1.9 25 40-64 3-27 (272)
156 PF13719 zinc_ribbon_5: zinc-r 66.6 3.9 8.5E-05 19.2 1.1 14 40-53 3-16 (37)
157 PF05605 zf-Di19: Drought indu 66.3 2.9 6.2E-05 21.3 0.7 11 39-49 2-12 (54)
158 PLN02195 cellulose synthase A 66.2 10 0.00022 30.7 3.8 50 38-87 5-59 (977)
159 PF13832 zf-HC5HC2H_2: PHD-zin 65.8 5.1 0.00011 23.2 1.8 34 37-72 53-88 (110)
160 PF04216 FdhE: Protein involve 64.1 0.94 2E-05 31.2 -1.8 46 37-85 170-220 (290)
161 PF10146 zf-C4H2: Zinc finger- 64.1 6.5 0.00014 26.4 2.2 27 64-90 196-222 (230)
162 KOG4451|consensus 64.0 6.1 0.00013 26.7 2.0 29 63-91 250-278 (286)
163 COG3492 Uncharacterized protei 62.9 4.5 9.7E-05 23.3 1.1 12 63-74 42-53 (104)
164 PLN02915 cellulose synthase A 62.4 19 0.00041 29.5 4.7 50 38-87 14-68 (1044)
165 PF13771 zf-HC5HC2H: PHD-like 61.7 5.4 0.00012 22.2 1.3 37 35-71 32-68 (90)
166 PF09943 DUF2175: Uncharacteri 60.1 10 0.00022 22.2 2.2 33 40-73 3-35 (101)
167 PLN02248 cellulose synthase-li 59.6 10 0.00023 31.1 2.9 33 60-92 150-182 (1135)
168 PF07649 C1_3: C1-like domain; 57.1 9.3 0.0002 16.8 1.4 29 41-69 2-30 (30)
169 PF06676 DUF1178: Protein of u 55.8 3.4 7.4E-05 25.9 -0.2 24 60-88 10-44 (148)
170 PF14169 YdjO: Cold-inducible 55.3 6 0.00013 20.9 0.7 15 75-89 38-52 (59)
171 PF13717 zinc_ribbon_4: zinc-r 55.0 6.5 0.00014 18.4 0.7 13 40-52 3-15 (36)
172 KOG2113|consensus 54.2 12 0.00025 26.6 2.1 43 39-86 343-386 (394)
173 KOG0824|consensus 53.2 4.3 9.2E-05 28.4 -0.1 47 38-86 104-150 (324)
174 PF14311 DUF4379: Domain of un 53.0 8.8 0.00019 19.5 1.1 22 60-82 34-55 (55)
175 PF10235 Cript: Microtubule-as 52.8 8.9 0.00019 22.0 1.2 39 39-89 44-82 (90)
176 PRK00418 DNA gyrase inhibitor; 52.7 9.9 0.00022 20.2 1.3 9 78-86 8-16 (62)
177 PF03119 DNA_ligase_ZBD: NAD-d 52.0 7.4 0.00016 17.2 0.6 10 78-87 1-10 (28)
178 PF02318 FYVE_2: FYVE-type zin 51.8 10 0.00022 22.6 1.4 46 38-84 53-102 (118)
179 smart00647 IBR In Between Ring 51.0 4.6 9.9E-05 20.7 -0.2 18 55-72 40-58 (64)
180 PF09889 DUF2116: Uncharacteri 50.7 9 0.00019 20.2 0.9 15 75-89 2-16 (59)
181 cd04718 BAH_plant_2 BAH, or Br 50.6 4.5 9.7E-05 25.4 -0.3 29 64-92 2-34 (148)
182 KOG2231|consensus 50.4 19 0.00042 28.0 2.9 44 41-87 2-52 (669)
183 KOG4218|consensus 50.3 18 0.0004 26.0 2.6 25 35-60 11-35 (475)
184 cd00350 rubredoxin_like Rubred 50.1 13 0.00028 16.8 1.3 8 77-84 18-25 (33)
185 PRK03564 formate dehydrogenase 49.1 9.7 0.00021 26.8 1.1 47 36-84 184-234 (309)
186 PF05715 zf-piccolo: Piccolo Z 48.5 13 0.00029 19.6 1.3 9 78-86 4-12 (61)
187 PF03884 DUF329: Domain of unk 46.3 9.8 0.00021 19.9 0.6 10 78-87 4-13 (57)
188 smart00734 ZnF_Rad18 Rad18-lik 45.3 9.2 0.0002 16.5 0.4 9 78-86 3-11 (26)
189 PRK11595 DNA utilization prote 44.0 24 0.00052 23.4 2.4 39 40-86 6-44 (227)
190 PF06937 EURL: EURL protein; 43.5 28 0.0006 24.1 2.6 44 37-80 28-74 (285)
191 PF00130 C1_1: Phorbol esters/ 43.2 20 0.00043 17.7 1.5 35 37-71 9-45 (53)
192 PF14353 CpXC: CpXC protein 41.9 33 0.00071 20.5 2.6 11 78-88 40-50 (128)
193 PF09237 GAGA: GAGA factor; I 40.4 10 0.00022 19.5 0.1 10 78-87 26-35 (54)
194 TIGR01562 FdhE formate dehydro 39.1 6.5 0.00014 27.6 -0.9 44 37-85 182-233 (305)
195 KOG1729|consensus 38.6 4.7 0.0001 28.0 -1.7 36 40-75 215-250 (288)
196 KOG2979|consensus 38.5 18 0.00039 24.8 1.1 44 39-84 176-221 (262)
197 COG5319 Uncharacterized protei 38.3 11 0.00024 23.1 0.1 10 78-87 34-43 (142)
198 PF09723 Zn-ribbon_8: Zinc rib 37.8 4.6 0.0001 19.5 -1.3 9 76-84 26-34 (42)
199 PF09538 FYDLN_acid: Protein o 37.7 17 0.00037 21.5 0.8 12 78-89 28-39 (108)
200 COG4098 comFA Superfamily II D 36.8 10 0.00022 27.5 -0.3 38 32-71 32-69 (441)
201 PF13913 zf-C2HC_2: zinc-finge 36.6 11 0.00023 16.1 -0.1 11 78-88 4-14 (25)
202 PF11023 DUF2614: Protein of u 35.9 38 0.00081 20.3 2.1 20 71-90 80-99 (114)
203 PRK01343 zinc-binding protein; 35.9 18 0.0004 18.9 0.7 12 76-87 9-20 (57)
204 PF07282 OrfB_Zn_ribbon: Putat 35.5 45 0.00098 17.4 2.3 35 33-67 22-59 (69)
205 smart00109 C1 Protein kinase C 35.4 33 0.00071 16.2 1.6 34 38-71 10-44 (49)
206 KOG2071|consensus 35.0 26 0.00056 26.9 1.5 36 37-72 511-556 (579)
207 PF01485 IBR: IBR domain; Int 34.7 2.8 6E-05 21.5 -2.6 33 40-72 19-58 (64)
208 PF12660 zf-TFIIIC: Putative z 34.2 7.1 0.00015 22.6 -1.2 50 40-90 15-69 (99)
209 cd00029 C1 Protein kinase C co 34.1 26 0.00057 16.8 1.1 34 38-71 10-45 (50)
210 KOG1245|consensus 34.0 14 0.00029 31.3 0.0 51 36-86 1105-1159(1404)
211 COG4647 AcxC Acetone carboxyla 33.3 28 0.0006 21.5 1.2 26 43-72 61-86 (165)
212 PF09297 zf-NADH-PPase: NADH p 31.8 4.7 0.0001 18.2 -1.8 6 78-83 23-28 (32)
213 PF10083 DUF2321: Uncharacteri 31.7 27 0.00059 22.1 1.1 26 61-89 27-52 (158)
214 PF07227 DUF1423: Protein of u 30.8 34 0.00073 25.4 1.5 31 41-72 130-164 (446)
215 PF06677 Auto_anti-p27: Sjogre 30.7 33 0.00072 16.6 1.1 19 70-88 11-29 (41)
216 PTZ00303 phosphatidylinositol 30.3 37 0.00081 27.5 1.8 33 40-72 461-499 (1374)
217 COG3024 Uncharacterized protei 29.4 36 0.00078 18.2 1.1 10 78-87 9-18 (65)
218 KOG1842|consensus 29.3 19 0.00042 26.7 0.1 38 34-71 175-213 (505)
219 TIGR00373 conserved hypothetic 29.2 55 0.0012 20.5 2.2 17 74-90 126-142 (158)
220 KOG1512|consensus 28.7 23 0.00051 24.8 0.5 32 39-70 314-345 (381)
221 COG5627 MMS21 DNA repair prote 28.7 28 0.00061 23.7 0.8 43 37-81 187-231 (275)
222 KOG4577|consensus 28.6 16 0.00035 25.6 -0.3 31 39-70 92-122 (383)
223 PF12088 DUF3565: Protein of u 28.3 47 0.001 17.6 1.5 15 53-67 10-24 (61)
224 PF15616 TerY-C: TerY-C metal 28.2 51 0.0011 20.3 1.8 46 35-89 73-118 (131)
225 TIGR02300 FYDLN_acid conserved 28.0 56 0.0012 20.1 1.9 14 76-89 26-39 (129)
226 PF15353 HECA: Headcase protei 27.4 47 0.001 19.7 1.5 15 59-73 39-53 (107)
227 PF13894 zf-C2H2_4: C2H2-type 27.4 25 0.00054 13.6 0.3 9 79-87 3-11 (24)
228 COG4068 Uncharacterized protei 27.0 36 0.00077 18.0 0.9 16 75-90 7-22 (64)
229 COG4306 Uncharacterized protei 26.9 48 0.0011 20.4 1.5 23 63-88 29-51 (160)
230 PF06750 DiS_P_DiS: Bacterial 26.8 51 0.0011 18.7 1.6 38 39-88 33-70 (92)
231 TIGR00686 phnA alkylphosphonat 26.8 38 0.00081 20.2 1.0 11 40-50 3-13 (109)
232 TIGR02652 conserved hypothetic 26.7 23 0.00051 22.1 0.2 16 74-89 7-22 (163)
233 PRK13130 H/ACA RNA-protein com 26.5 62 0.0013 16.8 1.7 21 76-96 17-37 (56)
234 cd00729 rubredoxin_SM Rubredox 26.4 35 0.00075 15.6 0.7 7 78-84 20-26 (34)
235 COG2835 Uncharacterized conser 26.3 27 0.00059 18.5 0.3 10 78-87 10-19 (60)
236 TIGR01367 pyrE_Therm orotate p 26.1 33 0.00071 22.1 0.8 16 76-91 169-184 (187)
237 PF00096 zf-C2H2: Zinc finger, 26.1 25 0.00053 13.9 0.1 9 79-87 3-11 (23)
238 PRK06266 transcription initiat 25.8 74 0.0016 20.4 2.3 17 74-90 134-150 (178)
239 PF13453 zf-TFIIB: Transcripti 25.8 29 0.00063 16.4 0.4 11 78-88 1-11 (41)
240 PF12773 DZR: Double zinc ribb 25.1 35 0.00075 16.6 0.6 13 76-88 29-41 (50)
241 PF13764 E3_UbLigase_R4: E3 ub 25.0 20 0.00043 28.6 -0.4 15 35-49 464-478 (802)
242 PF08882 Acetone_carb_G: Aceto 24.8 48 0.001 19.8 1.2 12 55-66 24-35 (112)
243 KOG3726|consensus 23.9 42 0.0009 26.4 1.1 41 40-83 655-696 (717)
244 PF10013 DUF2256: Uncharacteri 22.6 44 0.00095 16.3 0.6 12 76-87 8-19 (42)
245 KOG1818|consensus 22.5 38 0.00081 26.3 0.6 47 36-82 162-217 (634)
246 PRK00398 rpoP DNA-directed RNA 22.4 41 0.00088 16.3 0.5 13 77-89 22-34 (46)
247 PF07503 zf-HYPF: HypF finger; 22.4 70 0.0015 14.8 1.3 6 78-83 23-28 (35)
248 PRK10220 hypothetical protein; 22.3 65 0.0014 19.2 1.4 11 40-50 4-14 (111)
249 KOG1244|consensus 21.8 21 0.00046 24.8 -0.7 47 40-86 282-332 (336)
250 KOG4021|consensus 21.4 47 0.001 22.0 0.8 20 67-86 98-118 (239)
251 COG0068 HypF Hydrogenase matur 21.3 92 0.002 24.8 2.4 18 36-53 98-115 (750)
252 COG4357 Zinc finger domain con 20.6 81 0.0018 18.4 1.6 13 76-88 80-92 (105)
253 KOG2169|consensus 20.6 64 0.0014 25.1 1.5 48 39-91 306-360 (636)
254 KOG1819|consensus 20.6 41 0.00088 25.7 0.4 35 36-70 898-933 (990)
255 PRK11827 hypothetical protein; 20.2 31 0.00066 18.2 -0.2 13 76-88 8-20 (60)
256 PF05191 ADK_lid: Adenylate ki 20.2 31 0.00066 16.1 -0.2 27 59-87 6-32 (36)
257 cd00730 rubredoxin Rubredoxin; 20.0 46 0.00099 16.8 0.4 11 40-50 2-12 (50)
No 1
>PF13639 zf-RING_2: Ring finger domain; PDB: 2KIZ_A 4EPO_C 1IYM_A 2EP4_A 2ECT_A 2JRJ_A 2ECN_A 2ECM_A 3NG2_A 2EA6_A ....
Probab=99.73 E-value=2.3e-18 Score=86.79 Aligned_cols=43 Identities=47% Similarity=1.234 Sum_probs=39.8
Q ss_pred ccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccc
Q psy8622 41 QCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICR 83 (101)
Q Consensus 41 ~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr 83 (101)
.|+||++.+..++.++.++|+|.||.+||..|++.+.+||+||
T Consensus 2 ~C~IC~~~~~~~~~~~~l~C~H~fh~~Ci~~~~~~~~~CP~CR 44 (44)
T PF13639_consen 2 ECPICLEEFEDGEKVVKLPCGHVFHRSCIKEWLKRNNSCPVCR 44 (44)
T ss_dssp CETTTTCBHHTTSCEEEETTSEEEEHHHHHHHHHHSSB-TTTH
T ss_pred CCcCCChhhcCCCeEEEccCCCeeCHHHHHHHHHhCCcCCccC
Confidence 6999999998888889999999999999999999999999997
No 2
>KOG4628|consensus
Probab=99.62 E-value=4.4e-16 Score=107.24 Aligned_cols=52 Identities=35% Similarity=1.027 Sum_probs=46.2
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHHhcCC-cccccccccccCCc
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHG-TCPICRQTLHSESA 91 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~-~CP~Cr~~~~~~~~ 91 (101)
..|+||+|+|..++.+++|||+|.||..||..|+...+ .||+|++.+.....
T Consensus 230 ~~CaIClEdY~~GdklRiLPC~H~FH~~CIDpWL~~~r~~CPvCK~di~~~~~ 282 (348)
T KOG4628|consen 230 DTCAICLEDYEKGDKLRILPCSHKFHVNCIDPWLTQTRTFCPVCKRDIRTDSG 282 (348)
T ss_pred ceEEEeecccccCCeeeEecCCCchhhccchhhHhhcCccCCCCCCcCCCCCC
Confidence 59999999999999999999999999999999998765 59999998764433
No 3
>PF12678 zf-rbx1: RING-H2 zinc finger; InterPro: IPR024766 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This domain constitutes a conserved region found in proteins that participate in diverse functions relevant to chromosome metabolism and cell cycle control [].The domain contains 8 cysteine/ histidine residues which are proposed to be the conserved residues involved in zinc binding.; PDB: 4A0C_D 4A0L_I 4A0K_B 2ECL_A 1LDK_C 3RTR_F 3DQV_Y 1U6G_B 1LDJ_B 2HYE_D ....
Probab=99.54 E-value=9.9e-15 Score=80.90 Aligned_cols=45 Identities=36% Similarity=0.928 Sum_probs=35.6
Q ss_pred ccccccccccccCC----------CceEEeCCCCcccHhHHHHHHhcCCcccccc
Q psy8622 39 NLQCSVCWEQFTLD----------EAVRKLPCDHFYHTPCIEPWLQLHGTCPICR 83 (101)
Q Consensus 39 ~~~C~IC~~~~~~~----------~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr 83 (101)
+..|+||++.+... -.+...+|+|.||..||.+|+..+.+||+||
T Consensus 19 ~d~C~IC~~~l~~~~~~~~~~~~~~~i~~~~C~H~FH~~Ci~~Wl~~~~~CP~CR 73 (73)
T PF12678_consen 19 DDNCAICREPLEDPCPECQAPQDECPIVWGPCGHIFHFHCISQWLKQNNTCPLCR 73 (73)
T ss_dssp CSBETTTTSBTTSTTCCHHHCTTTS-EEEETTSEEEEHHHHHHHHTTSSB-TTSS
T ss_pred CCcccccChhhhChhhhhcCCccccceEecccCCCEEHHHHHHHHhcCCcCCCCC
Confidence 44599999998322 2345567999999999999999999999997
No 4
>KOG0317|consensus
Probab=99.53 E-value=4.9e-15 Score=99.10 Aligned_cols=59 Identities=24% Similarity=0.646 Sum_probs=49.4
Q ss_pred cccCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccCCccc
Q psy8622 32 QVRTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSESAES 93 (101)
Q Consensus 32 ~~~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~~~ 93 (101)
.....+....|.+|++....+ ..+||||+||+.||..|...+..||+||..+......-
T Consensus 232 ~~~i~~a~~kC~LCLe~~~~p---SaTpCGHiFCWsCI~~w~~ek~eCPlCR~~~~pskvi~ 290 (293)
T KOG0317|consen 232 LSSIPEATRKCSLCLENRSNP---SATPCGHIFCWSCILEWCSEKAECPLCREKFQPSKVIC 290 (293)
T ss_pred CccCCCCCCceEEEecCCCCC---CcCcCcchHHHHHHHHHHccccCCCcccccCCCcceee
Confidence 334456678999999987544 88999999999999999999999999999998776543
No 5
>PLN03208 E3 ubiquitin-protein ligase RMA2; Provisional
Probab=99.49 E-value=3.6e-14 Score=90.85 Aligned_cols=58 Identities=21% Similarity=0.655 Sum_probs=45.6
Q ss_pred cCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhc----------------CCcccccccccccCCcccC
Q psy8622 34 RTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL----------------HGTCPICRQTLHSESAESL 94 (101)
Q Consensus 34 ~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~----------------~~~CP~Cr~~~~~~~~~~~ 94 (101)
....++..|+||++.+..+ ++++|||.||+.||..|+.. ...||+||..+...+....
T Consensus 13 ~~~~~~~~CpICld~~~dP---VvT~CGH~FC~~CI~~wl~~s~~s~~~~~~~~~~k~~~~CPvCR~~Is~~~LvPi 86 (193)
T PLN03208 13 VDSGGDFDCNICLDQVRDP---VVTLCGHLFCWPCIHKWTYASNNSRQRVDQYDHKREPPKCPVCKSDVSEATLVPI 86 (193)
T ss_pred ccCCCccCCccCCCcCCCc---EEcCCCchhHHHHHHHHHHhccccccccccccccCCCCcCCCCCCcCChhcEEEe
Confidence 3444578999999988544 77899999999999999742 2479999999976665543
No 6
>PHA02929 N1R/p28-like protein; Provisional
Probab=99.49 E-value=2.4e-14 Score=94.76 Aligned_cols=51 Identities=27% Similarity=0.726 Sum_probs=42.1
Q ss_pred CCccccccccccccCCCc-----eEEeCCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 37 DSNLQCSVCWEQFTLDEA-----VRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~-----~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
..+..|+||++.+..++. .+.++|+|.||..||.+|+..+.+||+||..+.
T Consensus 172 ~~~~eC~ICle~~~~~~~~~~~~~vl~~C~H~FC~~CI~~Wl~~~~tCPlCR~~~~ 227 (238)
T PHA02929 172 SKDKECAICMEKVYDKEIKNMYFGILSNCNHVFCIECIDIWKKEKNTCPVCRTPFI 227 (238)
T ss_pred CCCCCCccCCcccccCccccccceecCCCCCcccHHHHHHHHhcCCCCCCCCCEee
Confidence 446799999998764331 244579999999999999999999999999875
No 7
>KOG0823|consensus
Probab=99.47 E-value=8.8e-14 Score=90.70 Aligned_cols=61 Identities=25% Similarity=0.646 Sum_probs=50.0
Q ss_pred CCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhc---CCcccccccccccCCcccCCCCC
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL---HGTCPICRQTLHSESAESLGPAG 98 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~---~~~CP~Cr~~~~~~~~~~~~~~~ 98 (101)
.+...+.|.|||+.-+++ +++.|||.||+.||.+|+.. .+.||+|+..|..+..+++.+-+
T Consensus 43 ~~~~~FdCNICLd~akdP---VvTlCGHLFCWpClyqWl~~~~~~~~cPVCK~~Vs~~~vvPlYGrG 106 (230)
T KOG0823|consen 43 RDGGFFDCNICLDLAKDP---VVTLCGHLFCWPCLYQWLQTRPNSKECPVCKAEVSIDTVVPLYGRG 106 (230)
T ss_pred CCCCceeeeeeccccCCC---EEeecccceehHHHHHHHhhcCCCeeCCccccccccceEEeeeccC
Confidence 366788999999986544 77889999999999999974 45799999999888877765543
No 8
>COG5243 HRD1 HRD ubiquitin ligase complex, ER membrane component [Posttranslational modification, protein turnover, chaperones]
Probab=99.44 E-value=7.7e-14 Score=96.40 Aligned_cols=53 Identities=34% Similarity=0.938 Sum_probs=44.6
Q ss_pred CCCCccccccccccc-cCC---------CceEEeCCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 35 TDDSNLQCSVCWEQF-TLD---------EAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~-~~~---------~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
....+..|.||++++ ..+ ..+..|||||+||..|+..|++++++||+||.++.
T Consensus 283 l~n~D~~C~ICmde~~h~~~~~~~~~~~~~pKrLpCGHilHl~CLknW~ERqQTCPICr~p~i 345 (491)
T COG5243 283 LTNSDRTCTICMDEMFHPDHEPLPRGLDMTPKRLPCGHILHLHCLKNWLERQQTCPICRRPVI 345 (491)
T ss_pred hcCCCCeEEEecccccCCCCccCcccccCCcccccccceeeHHHHHHHHHhccCCCcccCccc
Confidence 366788999999984 333 14578999999999999999999999999999953
No 9
>PF13920 zf-C3HC4_3: Zinc finger, C3HC4 type (RING finger); PDB: 2YHN_B 2YHO_G 3T6P_A 2CSY_A 2VJE_B 2VJF_B 2HDP_B 2EA5_A 2ECG_A 3EB5_A ....
Probab=99.44 E-value=1.4e-13 Score=71.00 Aligned_cols=46 Identities=37% Similarity=0.911 Sum_probs=39.0
Q ss_pred ccccccccccccCCCceEEeCCCCc-ccHhHHHHHHhcCCcccccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHF-YHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~-f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
+..|.||++... .+..+||||. ||..|+.+|+.....||+||+++.
T Consensus 2 ~~~C~iC~~~~~---~~~~~pCgH~~~C~~C~~~~~~~~~~CP~Cr~~i~ 48 (50)
T PF13920_consen 2 DEECPICFENPR---DVVLLPCGHLCFCEECAERLLKRKKKCPICRQPIE 48 (50)
T ss_dssp HSB-TTTSSSBS---SEEEETTCEEEEEHHHHHHHHHTTSBBTTTTBB-S
T ss_pred cCCCccCCccCC---ceEEeCCCChHHHHHHhHHhcccCCCCCcCChhhc
Confidence 468999999863 4588899999 999999999999999999999875
No 10
>COG5540 RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=99.42 E-value=1.5e-13 Score=92.80 Aligned_cols=53 Identities=34% Similarity=1.029 Sum_probs=47.3
Q ss_pred CCCccccccccccccCCCceEEeCCCCcccHhHHHHHHh-cCCccccccccccc
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQ-LHGTCPICRQTLHS 88 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~-~~~~CP~Cr~~~~~ 88 (101)
......|+||++.|...+.+++|||.|.||..|+.+|+. .+..||+||.++++
T Consensus 320 a~~GveCaICms~fiK~d~~~vlPC~H~FH~~Cv~kW~~~y~~~CPvCrt~iPP 373 (374)
T COG5540 320 ADKGVECAICMSNFIKNDRLRVLPCDHRFHVGCVDKWLLGYSNKCPVCRTAIPP 373 (374)
T ss_pred cCCCceEEEEhhhhcccceEEEeccCceechhHHHHHHhhhcccCCccCCCCCC
Confidence 445689999999998888899999999999999999997 67789999998864
No 11
>PF13923 zf-C3HC4_2: Zinc finger, C3HC4 type (RING finger); PDB: 3HCU_A 2ECI_A 2JMD_A 3HCS_B 3HCT_A 3ZTG_A 2YUR_A 3L11_A.
Probab=99.41 E-value=2.4e-13 Score=66.64 Aligned_cols=39 Identities=33% Similarity=1.014 Sum_probs=32.9
Q ss_pred cccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccc
Q psy8622 42 CSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPIC 82 (101)
Q Consensus 42 C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~C 82 (101)
|+||++.+. +.++.++|||.||..|+.+|++.+..||+|
T Consensus 1 C~iC~~~~~--~~~~~~~CGH~fC~~C~~~~~~~~~~CP~C 39 (39)
T PF13923_consen 1 CPICLDELR--DPVVVTPCGHSFCKECIEKYLEKNPKCPVC 39 (39)
T ss_dssp ETTTTSB-S--SEEEECTTSEEEEHHHHHHHHHCTSB-TTT
T ss_pred CCCCCCccc--CcCEECCCCCchhHHHHHHHHHCcCCCcCC
Confidence 899999875 345788999999999999999998899987
No 12
>PF15227 zf-C3HC4_4: zinc finger of C3HC4-type, RING; PDB: 2EGP_A 2ECV_A 2ECJ_A 2YSL_A 2YSJ_A.
Probab=99.34 E-value=8.8e-13 Score=65.62 Aligned_cols=38 Identities=32% Similarity=0.835 Sum_probs=28.8
Q ss_pred cccccccccCCCceEEeCCCCcccHhHHHHHHhcC----Cccccc
Q psy8622 42 CSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLH----GTCPIC 82 (101)
Q Consensus 42 C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~----~~CP~C 82 (101)
|+||++.|..+ +.|+|||+||..||.+|++.. ..||.|
T Consensus 1 CpiC~~~~~~P---v~l~CGH~FC~~Cl~~~~~~~~~~~~~CP~C 42 (42)
T PF15227_consen 1 CPICLDLFKDP---VSLPCGHSFCRSCLERLWKEPSGSGFSCPEC 42 (42)
T ss_dssp ETTTTSB-SSE---EE-SSSSEEEHHHHHHHHCCSSSST---SSS
T ss_pred CCccchhhCCc---cccCCcCHHHHHHHHHHHHccCCcCCCCcCC
Confidence 89999999655 889999999999999998653 369987
No 13
>KOG0320|consensus
Probab=99.30 E-value=2.8e-12 Score=80.75 Aligned_cols=55 Identities=20% Similarity=0.642 Sum_probs=45.1
Q ss_pred CCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccCC
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSES 90 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~ 90 (101)
..+..+.|||||+.+... .++.+.|||+||..||...+.....||+||+.+....
T Consensus 127 ~~~~~~~CPiCl~~~sek-~~vsTkCGHvFC~~Cik~alk~~~~CP~C~kkIt~k~ 181 (187)
T KOG0320|consen 127 RKEGTYKCPICLDSVSEK-VPVSTKCGHVFCSQCIKDALKNTNKCPTCRKKITHKQ 181 (187)
T ss_pred ccccccCCCceecchhhc-cccccccchhHHHHHHHHHHHhCCCCCCcccccchhh
Confidence 345568999999998633 2355789999999999999999999999999876543
No 14
>cd00162 RING RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H)
Probab=99.30 E-value=2.7e-12 Score=63.90 Aligned_cols=44 Identities=43% Similarity=1.179 Sum_probs=36.1
Q ss_pred ccccccccccCCCceEEeCCCCcccHhHHHHHHhc-CCccccccccc
Q psy8622 41 QCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL-HGTCPICRQTL 86 (101)
Q Consensus 41 ~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~-~~~CP~Cr~~~ 86 (101)
.|+||++.+. ......+|+|.||..|+..|+.. ...||.||..+
T Consensus 1 ~C~iC~~~~~--~~~~~~~C~H~~c~~C~~~~~~~~~~~Cp~C~~~~ 45 (45)
T cd00162 1 ECPICLEEFR--EPVVLLPCGHVFCRSCIDKWLKSGKNTCPLCRTPI 45 (45)
T ss_pred CCCcCchhhh--CceEecCCCChhcHHHHHHHHHhCcCCCCCCCCcC
Confidence 4899999883 34455569999999999999987 77899998764
No 15
>PF14634 zf-RING_5: zinc-RING finger domain
Probab=99.28 E-value=3.2e-12 Score=64.23 Aligned_cols=44 Identities=27% Similarity=0.842 Sum_probs=37.9
Q ss_pred ccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccc
Q psy8622 41 QCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQ 84 (101)
Q Consensus 41 ~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~ 84 (101)
.|+||++.+.......+++|||+||..|+..+......||+||+
T Consensus 1 ~C~~C~~~~~~~~~~~l~~CgH~~C~~C~~~~~~~~~~CP~C~k 44 (44)
T PF14634_consen 1 HCNICFEKYSEERRPRLTSCGHIFCEKCLKKLKGKSVKCPICRK 44 (44)
T ss_pred CCcCcCccccCCCCeEEcccCCHHHHHHHHhhcCCCCCCcCCCC
Confidence 48999999955567888999999999999998766778999985
No 16
>KOG0802|consensus
Probab=99.26 E-value=2.5e-12 Score=94.14 Aligned_cols=58 Identities=29% Similarity=0.745 Sum_probs=47.8
Q ss_pred CCCCccccccccccccCCCc--eEEeCCCCcccHhHHHHHHhcCCcccccccccccCCcc
Q psy8622 35 TDDSNLQCSVCWEQFTLDEA--VRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSESAE 92 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~--~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~~ 92 (101)
....+..|+||++.+..... +.+++|+|+||..|+..|+++.++||.||..+......
T Consensus 287 ~~~~~~~C~IC~e~l~~~~~~~~~rL~C~Hifh~~CL~~W~er~qtCP~CR~~~~~~~~~ 346 (543)
T KOG0802|consen 287 LALSDELCIICLEELHSGHNITPKRLPCGHIFHDSCLRSWFERQQTCPTCRTVLYDYVLW 346 (543)
T ss_pred hhhcCCeeeeechhhccccccccceeecccchHHHHHHHHHHHhCcCCcchhhhhccccc
Confidence 44557899999999875432 57899999999999999999999999999966554443
No 17
>smart00504 Ubox Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
Probab=99.23 E-value=1.3e-11 Score=66.18 Aligned_cols=50 Identities=32% Similarity=0.516 Sum_probs=43.0
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccCCcc
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSESAE 92 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~~ 92 (101)
..|+||.+.+..+ +.++|||+|+..||.+|+..+..||+|+..+..++..
T Consensus 2 ~~Cpi~~~~~~~P---v~~~~G~v~~~~~i~~~~~~~~~cP~~~~~~~~~~l~ 51 (63)
T smart00504 2 FLCPISLEVMKDP---VILPSGQTYERRAIEKWLLSHGTDPVTGQPLTHEDLI 51 (63)
T ss_pred cCCcCCCCcCCCC---EECCCCCEEeHHHHHHHHHHCCCCCCCcCCCChhhce
Confidence 5799999998654 7789999999999999998888999999988655443
No 18
>PF00097 zf-C3HC4: Zinc finger, C3HC4 type (RING finger); InterPro: IPR018957 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C3HC4 type zinc-finger (RING finger) is a cysteine-rich domain of 40 to 60 residues that coordinates two zinc ions, and has the consensus sequence: C-X2-C-X(9-39)-C-X(1-3)-H-X(2-3)-C-X2-C-X(4-48)-C-X2-C where X is any amino acid []. Many proteins containing a RING finger play a key role in the ubiquitination pathway []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1CHC_A 2ECW_A 2Y43_B 1V87_A 2DJB_A 2H0D_B 3RPG_C 3KNV_A 2CKL_B 1JM7_A ....
Probab=99.22 E-value=1.2e-11 Score=61.02 Aligned_cols=39 Identities=38% Similarity=1.018 Sum_probs=32.9
Q ss_pred cccccccccCCCceEEeCCCCcccHhHHHHHHh--cCCccccc
Q psy8622 42 CSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQ--LHGTCPIC 82 (101)
Q Consensus 42 C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~--~~~~CP~C 82 (101)
|+||++.+..+ ...++|+|.||..|+.+|+. ....||+|
T Consensus 1 C~iC~~~~~~~--~~~~~C~H~fC~~C~~~~~~~~~~~~CP~C 41 (41)
T PF00097_consen 1 CPICLEPFEDP--VILLPCGHSFCRDCLRKWLENSGSVKCPLC 41 (41)
T ss_dssp ETTTSSBCSSE--EEETTTSEEEEHHHHHHHHHHTSSSBTTTT
T ss_pred CCcCCccccCC--CEEecCCCcchHHHHHHHHHhcCCccCCcC
Confidence 89999988533 35889999999999999998 45689987
No 19
>PF12861 zf-Apc11: Anaphase-promoting complex subunit 11 RING-H2 finger
Probab=99.20 E-value=2e-11 Score=68.71 Aligned_cols=50 Identities=32% Similarity=0.768 Sum_probs=37.8
Q ss_pred CccccccccccccCC--------C--ceEEeCCCCcccHhHHHHHHhc---CCcccccccccc
Q psy8622 38 SNLQCSVCWEQFTLD--------E--AVRKLPCDHFYHTPCIEPWLQL---HGTCPICRQTLH 87 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~--------~--~~~~l~C~H~f~~~Ci~~w~~~---~~~CP~Cr~~~~ 87 (101)
.+..|+||...|... + +++.-.|+|.||..||.+|+.. +..||+||++..
T Consensus 20 ~dd~CgICr~~fdg~Cp~Ck~Pgd~Cplv~g~C~H~FH~hCI~kWl~~~~~~~~CPmCR~~w~ 82 (85)
T PF12861_consen 20 NDDVCGICRMPFDGCCPDCKFPGDDCPLVWGKCSHNFHMHCILKWLSTQSSKGQCPMCRQPWK 82 (85)
T ss_pred CCCceeeEecccccCCCCccCCCCCCceeeccCccHHHHHHHHHHHccccCCCCCCCcCCeee
Confidence 367899999887411 1 2233359999999999999975 468999999875
No 20
>PHA02926 zinc finger-like protein; Provisional
Probab=99.19 E-value=1.3e-11 Score=80.43 Aligned_cols=53 Identities=25% Similarity=0.676 Sum_probs=39.6
Q ss_pred CCCCccccccccccccCCC------ceEEeCCCCcccHhHHHHHHhcC------Ccccccccccc
Q psy8622 35 TDDSNLQCSVCWEQFTLDE------AVRKLPCDHFYHTPCIEPWLQLH------GTCPICRQTLH 87 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~------~~~~l~C~H~f~~~Ci~~w~~~~------~~CP~Cr~~~~ 87 (101)
..+.+..|+||++.+..+. .....+|+|.||..||..|...+ .+||+||..+.
T Consensus 166 ~~SkE~eCgICmE~I~eK~~~~eRrFGIL~~CnHsFCl~CIr~Wr~~r~~~~~~rsCPiCR~~f~ 230 (242)
T PHA02926 166 RVSKEKECGICYEVVYSKRLENDRYFGLLDSCNHIFCITCINIWHRTRRETGASDNCPICRTRFR 230 (242)
T ss_pred hccCCCCCccCccccccccccccccccccCCCCchHHHHHHHHHHHhccccCcCCcCCCCcceee
Confidence 3455789999999863221 12444699999999999998743 36999999874
No 21
>KOG2164|consensus
Probab=99.16 E-value=2.1e-11 Score=87.09 Aligned_cols=50 Identities=30% Similarity=0.794 Sum_probs=40.3
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHHHHhcC-----CcccccccccccCCc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLH-----GTCPICRQTLHSESA 91 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~-----~~CP~Cr~~~~~~~~ 91 (101)
+..||||++... .+..+.|||+||..||.+++... ..||+||..+...+.
T Consensus 186 ~~~CPICL~~~~---~p~~t~CGHiFC~~CiLqy~~~s~~~~~~~CPiC~s~I~~kdl 240 (513)
T KOG2164|consen 186 DMQCPICLEPPS---VPVRTNCGHIFCGPCILQYWNYSAIKGPCSCPICRSTITLKDL 240 (513)
T ss_pred CCcCCcccCCCC---cccccccCceeeHHHHHHHHhhhcccCCccCCchhhhccccce
Confidence 789999999753 33667799999999999988643 479999998876443
No 22
>TIGR00599 rad18 DNA repair protein rad18. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=99.15 E-value=3.2e-11 Score=85.06 Aligned_cols=54 Identities=26% Similarity=0.639 Sum_probs=45.8
Q ss_pred ccCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccC
Q psy8622 33 VRTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSE 89 (101)
Q Consensus 33 ~~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~ 89 (101)
....+....|+||++.+..+ ++++|+|.||..||..|+.....||+||..+...
T Consensus 20 l~~Le~~l~C~IC~d~~~~P---vitpCgH~FCs~CI~~~l~~~~~CP~Cr~~~~~~ 73 (397)
T TIGR00599 20 LYPLDTSLRCHICKDFFDVP---VLTSCSHTFCSLCIRRCLSNQPKCPLCRAEDQES 73 (397)
T ss_pred ccccccccCCCcCchhhhCc---cCCCCCCchhHHHHHHHHhCCCCCCCCCCccccc
Confidence 34566678999999998644 6789999999999999998888999999988654
No 23
>COG5574 PEX10 RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=99.12 E-value=6.4e-11 Score=78.73 Aligned_cols=52 Identities=23% Similarity=0.671 Sum_probs=43.3
Q ss_pred CCccccccccccccCCCceEEeCCCCcccHhHHHH-HHhcCCc-ccccccccccCCc
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEP-WLQLHGT-CPICRQTLHSESA 91 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~-w~~~~~~-CP~Cr~~~~~~~~ 91 (101)
..+..|+||++... .+..++|||+||+.||.. |-..+.. ||+||+.+....+
T Consensus 213 ~~d~kC~lC~e~~~---~ps~t~CgHlFC~~Cl~~~~t~~k~~~CplCRak~~pk~v 266 (271)
T COG5574 213 LADYKCFLCLEEPE---VPSCTPCGHLFCLSCLLISWTKKKYEFCPLCRAKVYPKKV 266 (271)
T ss_pred ccccceeeeecccC---CcccccccchhhHHHHHHHHHhhccccCchhhhhccchhh
Confidence 56889999999864 448899999999999999 8766554 9999999877654
No 24
>smart00184 RING Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s)
Probab=99.09 E-value=1.3e-10 Score=55.89 Aligned_cols=38 Identities=42% Similarity=1.128 Sum_probs=32.1
Q ss_pred cccccccccCCCceEEeCCCCcccHhHHHHHHh-cCCccccc
Q psy8622 42 CSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQ-LHGTCPIC 82 (101)
Q Consensus 42 C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~-~~~~CP~C 82 (101)
|+||++.. ..+..++|+|.||..|+..|+. ....||+|
T Consensus 1 C~iC~~~~---~~~~~~~C~H~~c~~C~~~~~~~~~~~CP~C 39 (39)
T smart00184 1 CPICLEEL---KDPVVLPCGHTFCRSCIRKWLKSGNNTCPIC 39 (39)
T ss_pred CCcCccCC---CCcEEecCCChHHHHHHHHHHHhCcCCCCCC
Confidence 78999884 4457889999999999999998 56679987
No 25
>KOG0287|consensus
Probab=99.01 E-value=8.4e-11 Score=80.74 Aligned_cols=57 Identities=30% Similarity=0.733 Sum_probs=48.8
Q ss_pred ccccCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccCC
Q psy8622 31 VQVRTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSES 90 (101)
Q Consensus 31 ~~~~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~ 90 (101)
......+....|.||.+.|..+ .++||+|.||.-||..++..+..||.|+.++....
T Consensus 15 pslk~lD~lLRC~IC~eyf~ip---~itpCsHtfCSlCIR~~L~~~p~CP~C~~~~~Es~ 71 (442)
T KOG0287|consen 15 PSLKTLDDLLRCGICFEYFNIP---MITPCSHTFCSLCIRKFLSYKPQCPTCCVTVTESD 71 (442)
T ss_pred chhhhhHHHHHHhHHHHHhcCc---eeccccchHHHHHHHHHhccCCCCCceecccchhh
Confidence 3445566788999999999655 88999999999999999999999999999886544
No 26
>PF13445 zf-RING_UBOX: RING-type zinc-finger; PDB: 2CT2_A.
Probab=98.96 E-value=6.6e-10 Score=55.37 Aligned_cols=34 Identities=26% Similarity=0.753 Sum_probs=21.4
Q ss_pred cccccccccC-CCceEEeCCCCcccHhHHHHHHhcC
Q psy8622 42 CSVCWEQFTL-DEAVRKLPCDHFYHTPCIEPWLQLH 76 (101)
Q Consensus 42 C~IC~~~~~~-~~~~~~l~C~H~f~~~Ci~~w~~~~ 76 (101)
|+||.+ +.. ...+.+|+|||+|+.+|+.+++..+
T Consensus 1 CpIc~e-~~~~~n~P~~L~CGH~~c~~cl~~l~~~~ 35 (43)
T PF13445_consen 1 CPICKE-FSTEENPPMVLPCGHVFCKDCLQKLSKKS 35 (43)
T ss_dssp -TTT-----TTSS-EEE-SSS-EEEHHHHHHHHHH-
T ss_pred CCcccc-ccCCCCCCEEEeCccHHHHHHHHHHHhcC
Confidence 899999 644 3356889999999999999998743
No 27
>COG5432 RAD18 RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms]
Probab=98.90 E-value=7.6e-10 Score=74.81 Aligned_cols=54 Identities=26% Similarity=0.616 Sum_probs=46.0
Q ss_pred cccCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccccccc
Q psy8622 32 QVRTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHS 88 (101)
Q Consensus 32 ~~~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~ 88 (101)
.....+....|.||.+.|..+ ..++|||.||.-||..++..+..||+||.+...
T Consensus 18 SL~~LDs~lrC~IC~~~i~ip---~~TtCgHtFCslCIR~hL~~qp~CP~Cr~~~~e 71 (391)
T COG5432 18 SLKGLDSMLRCRICDCRISIP---CETTCGHTFCSLCIRRHLGTQPFCPVCREDPCE 71 (391)
T ss_pred chhcchhHHHhhhhhheeecc---eecccccchhHHHHHHHhcCCCCCccccccHHh
Confidence 344556678999999998655 889999999999999999999999999987653
No 28
>COG5194 APC11 Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning]
Probab=98.88 E-value=2.2e-09 Score=59.39 Aligned_cols=30 Identities=43% Similarity=0.978 Sum_probs=27.8
Q ss_pred CCCcccHhHHHHHHhcCCcccccccccccC
Q psy8622 60 CDHFYHTPCIEPWLQLHGTCPICRQTLHSE 89 (101)
Q Consensus 60 C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~ 89 (101)
|.|.||..||.+|+..+..||++|+++...
T Consensus 54 CnHaFH~HCI~rWL~Tk~~CPld~q~w~~~ 83 (88)
T COG5194 54 CNHAFHDHCIYRWLDTKGVCPLDRQTWVLA 83 (88)
T ss_pred cchHHHHHHHHHHHhhCCCCCCCCceeEEe
Confidence 999999999999999999999999987644
No 29
>PF04564 U-box: U-box domain; InterPro: IPR003613 Quality control of intracellular proteins is essential for cellular homeostasis. Molecular chaperones recognise and contribute to the refolding of misfolded or unfolded proteins, whereas the ubiquitin-proteasome system mediates the degradation of such abnormal proteins. Ubiquitin-protein ligases (E3s) determine the substrate specificity for ubiquitylation and have been classified into HECT and RING-finger families. More recently, however, U-box proteins, which contain a domain (the U box) of about 70 amino acids that is conserved from yeast to humans, have been identified as a new type of E3 []. Members of the U-box family of proteins constitute a class of ubiquitin-protein ligases (E3s) distinct from the HECT-type and RING finger-containing E3 families []. Using yeast two-hybrid technology, all mammalian U-box proteins have been reported to interact with molecular chaperones or co-chaperones, including Hsp90, Hsp70, DnaJc7, EKN1, CRN, and VCP. This suggests that the function of U box-type E3s is to mediate the degradation of unfolded or misfolded proteins in conjunction with molecular chaperones as receptors that recognise such abnormal proteins [, ]. Unlike the RING finger domain, IPR001841 from INTERPRO, that is stabilised by Zn2+ ions coordinated by the cysteines and a histidine, the U-box scaffold is probably stabilised by a system of salt-bridges and hydrogen bonds. The charged and polar residues that participate in this network of bonds are more strongly conserved in the U-box proteins than in classic RING fingers, which supports their role in maintaining the stability of the U box. Thus, the U box appears to have evolved from a RING finger domain by appropriation of a new set of residues required to stabilise its structure, concomitant with the loss of the original, metal-chelating residues [].; GO: 0004842 ubiquitin-protein ligase activity, 0016567 protein ubiquitination, 0000151 ubiquitin ligase complex; PDB: 1T1H_A 2C2L_D 2C2V_V 1WGM_A 2KR4_A 3L1Z_B 3L1X_A 2KRE_A 3M63_A 2QIZ_A ....
Probab=98.81 E-value=4.6e-09 Score=58.09 Aligned_cols=52 Identities=31% Similarity=0.496 Sum_probs=40.1
Q ss_pred CccccccccccccCCCceEEeCCCCcccHhHHHHHHhc-CCcccccccccccCCcc
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL-HGTCPICRQTLHSESAE 92 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~-~~~CP~Cr~~~~~~~~~ 92 (101)
+.+.|+|+.+.+.++ +++++||+|...+|.+|+.. ..+||+++..+...+..
T Consensus 3 ~~f~CpIt~~lM~dP---Vi~~~G~tyer~~I~~~l~~~~~~~P~t~~~l~~~~l~ 55 (73)
T PF04564_consen 3 DEFLCPITGELMRDP---VILPSGHTYERSAIERWLEQNGGTDPFTRQPLSESDLI 55 (73)
T ss_dssp GGGB-TTTSSB-SSE---EEETTSEEEEHHHHHHHHCTTSSB-TTT-SB-SGGGSE
T ss_pred cccCCcCcCcHhhCc---eeCCcCCEEcHHHHHHHHHcCCCCCCCCCCcCCcccce
Confidence 467899999999655 88999999999999999998 88999999988765443
No 30
>KOG1734|consensus
Probab=98.81 E-value=1e-09 Score=73.42 Aligned_cols=55 Identities=31% Similarity=0.724 Sum_probs=43.6
Q ss_pred CccccccccccccCCC-------ceEEeCCCCcccHhHHHHHH--hcCCcccccccccccCCcc
Q psy8622 38 SNLQCSVCWEQFTLDE-------AVRKLPCDHFYHTPCIEPWL--QLHGTCPICRQTLHSESAE 92 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~-------~~~~l~C~H~f~~~Ci~~w~--~~~~~CP~Cr~~~~~~~~~ 92 (101)
++..|++|-..+.... ....|.|+|+||..||..|- ..+++||.|+..+..+...
T Consensus 223 ~d~vCaVCg~~~~~s~~eegvienty~LsCnHvFHEfCIrGWcivGKkqtCPYCKekVdl~rmf 286 (328)
T KOG1734|consen 223 SDSVCAVCGQQIDVSVDEEGVIENTYKLSCNHVFHEFCIRGWCIVGKKQTCPYCKEKVDLKRMF 286 (328)
T ss_pred CcchhHhhcchheeecchhhhhhhheeeecccchHHHhhhhheeecCCCCCchHHHHhhHhhhc
Confidence 4568999998774333 56778999999999999996 4678999999988765443
No 31
>KOG1493|consensus
Probab=98.74 E-value=3e-09 Score=58.38 Aligned_cols=48 Identities=35% Similarity=0.814 Sum_probs=35.4
Q ss_pred cccccccccccCC---------CceEEeC-CCCcccHhHHHHHHhc---CCcccccccccc
Q psy8622 40 LQCSVCWEQFTLD---------EAVRKLP-CDHFYHTPCIEPWLQL---HGTCPICRQTLH 87 (101)
Q Consensus 40 ~~C~IC~~~~~~~---------~~~~~l~-C~H~f~~~Ci~~w~~~---~~~CP~Cr~~~~ 87 (101)
..|.||.-.|... +.+.++. |.|.||..||.+|+.. +..||+||+.+.
T Consensus 21 e~CGiCRm~Fdg~Cp~Ck~PgDdCPLv~G~C~h~fh~hCI~~wl~~~tsq~~CPmcRq~~~ 81 (84)
T KOG1493|consen 21 ETCGICRMPFDGCCPDCKLPGDDCPLVWGYCLHAFHAHCILKWLNTPTSQGQCPMCRQTWQ 81 (84)
T ss_pred CccceEecccCCcCCCCcCCCCCCccHHHHHHHHHHHHHHHHHhcCccccccCCcchheeE
Confidence 3788888877321 2233333 9999999999999964 457999999764
No 32
>KOG4265|consensus
Probab=98.72 E-value=1.7e-08 Score=69.75 Aligned_cols=52 Identities=27% Similarity=0.645 Sum_probs=42.1
Q ss_pred CCCCccccccccccccCCCceEEeCCCC-cccHhHHHHHHhcCCcccccccccccC
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDH-FYHTPCIEPWLQLHGTCPICRQTLHSE 89 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H-~f~~~Ci~~w~~~~~~CP~Cr~~~~~~ 89 (101)
..+...+|.||+.+.. .+.+|||.| -.|..|.....-..+.||+||+.+...
T Consensus 286 ~~~~gkeCVIClse~r---dt~vLPCRHLCLCs~Ca~~Lr~q~n~CPICRqpi~~l 338 (349)
T KOG4265|consen 286 ESESGKECVICLSESR---DTVVLPCRHLCLCSGCAKSLRYQTNNCPICRQPIEEL 338 (349)
T ss_pred cccCCCeeEEEecCCc---ceEEecchhhehhHhHHHHHHHhhcCCCccccchHhh
Confidence 4455789999999864 448899999 689999988766778899999998643
No 33
>TIGR00570 cdk7 CDK-activating kinase assembly factor MAT1. All proteins in this family for which functions are known are cyclin dependent protein kinases that are components of TFIIH, a complex that is involved in nucleotide excision repair and transcription initiation. Also known as MAT1 (menage a trois 1). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=98.72 E-value=1.1e-08 Score=70.11 Aligned_cols=52 Identities=19% Similarity=0.453 Sum_probs=37.6
Q ss_pred ccccccccccc-cCCC-ceEEeCCCCcccHhHHHHHHh-cCCcccccccccccCC
Q psy8622 39 NLQCSVCWEQF-TLDE-AVRKLPCDHFYHTPCIEPWLQ-LHGTCPICRQTLHSES 90 (101)
Q Consensus 39 ~~~C~IC~~~~-~~~~-~~~~l~C~H~f~~~Ci~~w~~-~~~~CP~Cr~~~~~~~ 90 (101)
+..||+|.... .+++ ...+-+|||.||..|+...+. ....||.|+..+...+
T Consensus 3 ~~~CP~Ck~~~y~np~~kl~i~~CGH~~C~sCv~~l~~~~~~~CP~C~~~lrk~~ 57 (309)
T TIGR00570 3 DQGCPRCKTTKYRNPSLKLMVNVCGHTLCESCVDLLFVRGSGSCPECDTPLRKNN 57 (309)
T ss_pred CCCCCcCCCCCccCcccccccCCCCCcccHHHHHHHhcCCCCCCCCCCCccchhh
Confidence 46899999953 2222 223337999999999999664 4568999999886554
No 34
>KOG0828|consensus
Probab=98.71 E-value=5.3e-09 Score=74.98 Aligned_cols=51 Identities=27% Similarity=0.797 Sum_probs=39.1
Q ss_pred CccccccccccccCC--------------CceEEeCCCCcccHhHHHHHHh-cCCccccccccccc
Q psy8622 38 SNLQCSVCWEQFTLD--------------EAVRKLPCDHFYHTPCIEPWLQ-LHGTCPICRQTLHS 88 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~--------------~~~~~l~C~H~f~~~Ci~~w~~-~~~~CP~Cr~~~~~ 88 (101)
....|+||+..+.-. .....+||.|+||..|+.+|.. .+-.||+||.+++.
T Consensus 570 ~t~dC~ICMt~I~l~~~~s~~~~~~~~~~~nYm~tPC~HifH~~CL~~WMd~ykl~CPvCR~pLPp 635 (636)
T KOG0828|consen 570 RTNDCVICMTPIDLRSTGSDCMVASMMVRRNYMLTPCHHIFHRQCLLQWMDTYKLICPVCRCPLPP 635 (636)
T ss_pred ccccceEeccccceeeccCcchhhhhhhhccccccchHHHHHHHHHHHHHhhhcccCCccCCCCCC
Confidence 356799999876211 1134469999999999999998 55599999998853
No 35
>smart00744 RINGv The RING-variant domain is a C4HC3 zinc-finger like motif found in a number of cellular and viral proteins. Some of these proteins have been shown both in vivo and in vitro to have ubiquitin E3 ligase activity. The RING-variant domain is reminiscent of both the RING and the PHD domains and may represent an evolutionary intermediate. To describe this domain the term PHD/LAP domain has been used in the past. Extended description: The RING-variant (RINGv) domain contains a C4HC3 zinc-finger-like motif similar to the PHD domain, while some of the spacing between the Cys/His residues follow a pattern somewhat closer to that found in the RING domain. The RINGv domain, similar to the RING, PHD and LIM domains, is thought to bind two zinc ions co-ordinated by the highly conserved Cys and His residues. RING variant domain: C-x (2) -C-x(10-45)-C-x (1) -C-x (7) -H-x(2)-C-x(11-25)-C-x(2)-C As opposed to a PHD: C-x(1-2) -C-x (7-13)-C-x(2-4)-C-x(4-5)-H-x(2)-C-x(10-21)-C-x(2)-C Class
Probab=98.70 E-value=1.7e-08 Score=51.71 Aligned_cols=42 Identities=29% Similarity=0.831 Sum_probs=32.0
Q ss_pred ccccccccccCCCceEEeCCC-----CcccHhHHHHHHhc--CCcccccc
Q psy8622 41 QCSVCWEQFTLDEAVRKLPCD-----HFYHTPCIEPWLQL--HGTCPICR 83 (101)
Q Consensus 41 ~C~IC~~~~~~~~~~~~l~C~-----H~f~~~Ci~~w~~~--~~~CP~Cr 83 (101)
.|.||++.. .++...+.||. |.+|..|+.+|+.. +.+||+|+
T Consensus 1 ~CrIC~~~~-~~~~~l~~PC~C~G~~~~vH~~Cl~~W~~~~~~~~C~iC~ 49 (49)
T smart00744 1 ICRICHDEG-DEGDPLVSPCRCKGSLKYVHQECLERWINESGNKTCEICK 49 (49)
T ss_pred CccCCCCCC-CCCCeeEeccccCCchhHHHHHHHHHHHHHcCCCcCCCCC
Confidence 488999833 33455678974 88999999999964 45899984
No 36
>PF11793 FANCL_C: FANCL C-terminal domain; PDB: 3K1L_A.
Probab=98.69 E-value=2.1e-09 Score=59.01 Aligned_cols=51 Identities=24% Similarity=0.668 Sum_probs=23.9
Q ss_pred cccccccccccc-CCCce-EEe---CCCCcccHhHHHHHHhc---C--------CcccccccccccC
Q psy8622 39 NLQCSVCWEQFT-LDEAV-RKL---PCDHFYHTPCIEPWLQL---H--------GTCPICRQTLHSE 89 (101)
Q Consensus 39 ~~~C~IC~~~~~-~~~~~-~~l---~C~H~f~~~Ci~~w~~~---~--------~~CP~Cr~~~~~~ 89 (101)
+..|+||+..+. .++.+ ..- .|+..||..||.+|+.. . ..||.|+.++..+
T Consensus 2 ~~~C~IC~~~~~~~~~~p~~~C~n~~C~~~fH~~CL~~wf~~~~~~~~~~~~~~G~CP~C~~~i~~~ 68 (70)
T PF11793_consen 2 ELECGICYSYRLDDGEIPDVVCPNPSCGKKFHLLCLSEWFLSLEKSRQSFIPIFGECPYCSSPISWS 68 (70)
T ss_dssp --S-SSS--SS-TT-----B--S-TT----B-SGGGHHHHHHHHSSS-TTT--EEE-TTT-SEEEGG
T ss_pred CCCCCcCCcEecCCCCcCceEcCCcccCCHHHHHHHHHHHHHcccCCeeecccccCCcCCCCeeeEe
Confidence 467999999865 32222 222 48999999999999853 1 1599999988643
No 37
>KOG2177|consensus
Probab=98.67 E-value=8.2e-09 Score=69.22 Aligned_cols=48 Identities=29% Similarity=0.695 Sum_probs=40.9
Q ss_pred cCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccc
Q psy8622 34 RTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQ 84 (101)
Q Consensus 34 ~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~ 84 (101)
....+...|+||++.|..+ .+++|+|.||..|+..++.....||.||.
T Consensus 8 ~~~~~~~~C~iC~~~~~~p---~~l~C~H~~c~~C~~~~~~~~~~Cp~cr~ 55 (386)
T KOG2177|consen 8 EVLQEELTCPICLEYFREP---VLLPCGHNFCRACLTRSWEGPLSCPVCRP 55 (386)
T ss_pred hhccccccChhhHHHhhcC---ccccccchHhHHHHHHhcCCCcCCcccCC
Confidence 4456678999999999755 88999999999999998875568999993
No 38
>KOG0804|consensus
Probab=98.66 E-value=1.3e-08 Score=72.07 Aligned_cols=51 Identities=25% Similarity=0.737 Sum_probs=39.7
Q ss_pred CCCCccccccccccccCCC-ceEEeCCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 35 TDDSNLQCSVCWEQFTLDE-AVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~-~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
...+..+||+|||.+.+.. .++.+.|.|.||..|+..|. ..+||+||....
T Consensus 171 ~~tELPTCpVCLERMD~s~~gi~t~~c~Hsfh~~cl~~w~--~~scpvcR~~q~ 222 (493)
T KOG0804|consen 171 GLTELPTCPVCLERMDSSTTGILTILCNHSFHCSCLMKWW--DSSCPVCRYCQS 222 (493)
T ss_pred CcccCCCcchhHhhcCccccceeeeecccccchHHHhhcc--cCcChhhhhhcC
Confidence 3445679999999986443 23445699999999999994 568999998765
No 39
>COG5219 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=98.53 E-value=2.3e-08 Score=76.45 Aligned_cols=52 Identities=23% Similarity=0.823 Sum_probs=37.8
Q ss_pred CCCcccccccccccc-CCC--ceEEe-CCCCcccHhHHHHHHhc--CCcccccccccc
Q psy8622 36 DDSNLQCSVCWEQFT-LDE--AVRKL-PCDHFYHTPCIEPWLQL--HGTCPICRQTLH 87 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~-~~~--~~~~l-~C~H~f~~~Ci~~w~~~--~~~CP~Cr~~~~ 87 (101)
.++..+|+||...+. .+. +.... .|.|.||.+|+.+|+.. +.+||+||.++.
T Consensus 1466 fsG~eECaICYsvL~~vdr~lPskrC~TCknKFH~~CLyKWf~Ss~~s~CPlCRseit 1523 (1525)
T COG5219 1466 FSGHEECAICYSVLDMVDRSLPSKRCATCKNKFHTRCLYKWFASSARSNCPLCRSEIT 1523 (1525)
T ss_pred cCCcchhhHHHHHHHHHhccCCccccchhhhhhhHHHHHHHHHhcCCCCCCccccccc
Confidence 455678999998764 110 11112 29999999999999975 568999998764
No 40
>KOG0311|consensus
Probab=98.49 E-value=1.8e-08 Score=69.57 Aligned_cols=63 Identities=24% Similarity=0.480 Sum_probs=46.3
Q ss_pred ccccCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHh-cCCcccccccccccCCcccCC
Q psy8622 31 VQVRTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQ-LHGTCPICRQTLHSESAESLG 95 (101)
Q Consensus 31 ~~~~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~-~~~~CP~Cr~~~~~~~~~~~~ 95 (101)
+.......+..|+||++.+... .....|.|.||..||...+. .+..||.||+.+.+.......
T Consensus 35 ~~l~~~~~~v~c~icl~llk~t--mttkeClhrfc~~ci~~a~r~gn~ecptcRk~l~SkrsLr~D 98 (381)
T KOG0311|consen 35 VDLAMFDIQVICPICLSLLKKT--MTTKECLHRFCFDCIWKALRSGNNECPTCRKKLVSKRSLRID 98 (381)
T ss_pred ecHHHhhhhhccHHHHHHHHhh--cccHHHHHHHHHHHHHHHHHhcCCCCchHHhhccccccCCCC
Confidence 3344456678999999998633 23334999999999988775 477999999998766444433
No 41
>KOG2930|consensus
Probab=98.46 E-value=1.4e-07 Score=54.73 Aligned_cols=51 Identities=25% Similarity=0.640 Sum_probs=37.5
Q ss_pred CCccccccccccc-------------cCCCceEEe-CCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 37 DSNLQCSVCWEQF-------------TLDEAVRKL-PCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 37 ~~~~~C~IC~~~~-------------~~~~~~~~l-~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
...+.|+||..-+ ...+..+.- .|.|.||..||.+|++.+..||+|.++..
T Consensus 44 i~vDnCAICRnHIMd~CieCQa~~~~~~~EC~VaWG~CNHaFH~hCisrWlktr~vCPLdn~eW~ 108 (114)
T KOG2930|consen 44 IVVDNCAICRNHIMDLCIECQANQSATSEECTVAWGVCNHAFHFHCISRWLKTRNVCPLDNKEWV 108 (114)
T ss_pred eeechhHHHHHHHHHHHHhhccCCCCCCCceEEEeeecchHHHHHHHHHHHhhcCcCCCcCccee
Confidence 3456899987543 112223333 39999999999999999999999987653
No 42
>KOG0824|consensus
Probab=98.41 E-value=1.2e-07 Score=64.48 Aligned_cols=50 Identities=24% Similarity=0.498 Sum_probs=40.3
Q ss_pred CccccccccccccCCCceEEeCCCCcccHhHHHHHHhc-CCcccccccccccCC
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL-HGTCPICRQTLHSES 90 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~-~~~CP~Cr~~~~~~~ 90 (101)
....|+||+..... ++.|+|+|.||.-||...... ..+|++||.++.+.-
T Consensus 6 ~~~eC~IC~nt~n~---Pv~l~C~HkFCyiCiKGsy~ndk~~CavCR~pids~i 56 (324)
T KOG0824|consen 6 KKKECLICYNTGNC---PVNLYCFHKFCYICIKGSYKNDKKTCAVCRFPIDSTI 56 (324)
T ss_pred cCCcceeeeccCCc---CccccccchhhhhhhcchhhcCCCCCceecCCCCcch
Confidence 35689999998643 388999999999999877655 456999999997653
No 43
>PF14835 zf-RING_6: zf-RING of BARD1-type protein; PDB: 1JM7_B.
Probab=98.40 E-value=5.6e-08 Score=51.91 Aligned_cols=51 Identities=24% Similarity=0.686 Sum_probs=24.7
Q ss_pred CccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccCCcc
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSESAE 92 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~~ 92 (101)
....|++|.+.+.. ++....|.|+||..|+..-+. ..||+|+.+.-..+..
T Consensus 6 ~lLrCs~C~~~l~~--pv~l~~CeH~fCs~Ci~~~~~--~~CPvC~~Paw~qD~~ 56 (65)
T PF14835_consen 6 ELLRCSICFDILKE--PVCLGGCEHIFCSSCIRDCIG--SECPVCHTPAWIQDIQ 56 (65)
T ss_dssp HTTS-SSS-S--SS---B---SSS--B-TTTGGGGTT--TB-SSS--B-S-SS--
T ss_pred HhcCCcHHHHHhcC--CceeccCccHHHHHHhHHhcC--CCCCCcCChHHHHHHH
Confidence 35689999999863 344556999999999987433 4599999987655543
No 44
>KOG0827|consensus
Probab=98.38 E-value=1.6e-07 Score=65.86 Aligned_cols=47 Identities=30% Similarity=0.878 Sum_probs=34.8
Q ss_pred cccccccccccCCCceEEeC-CCCcccHhHHHHHHhc---CCccccccccc
Q psy8622 40 LQCSVCWEQFTLDEAVRKLP-CDHFYHTPCIEPWLQL---HGTCPICRQTL 86 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~-C~H~f~~~Ci~~w~~~---~~~CP~Cr~~~ 86 (101)
..|.||.+.+-....+..+. |||+||..|+..|+.. ++.||.|+-.+
T Consensus 5 A~C~Ic~d~~p~~~~l~~i~~cGhifh~~cl~qwfe~~Ps~R~cpic~ik~ 55 (465)
T KOG0827|consen 5 AECHICIDGRPNDHELGPIGTCGHIFHTTCLTQWFEGDPSNRGCPICQIKL 55 (465)
T ss_pred ceeeEeccCCccccccccccchhhHHHHHHHHHHHccCCccCCCCceeecc
Confidence 47999955543333444454 9999999999999975 35899999444
No 45
>KOG0978|consensus
Probab=98.28 E-value=2e-07 Score=69.69 Aligned_cols=58 Identities=19% Similarity=0.557 Sum_probs=45.2
Q ss_pred ccCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHh-cCCcccccccccccCCccc
Q psy8622 33 VRTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQ-LHGTCPICRQTLHSESAES 93 (101)
Q Consensus 33 ~~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~-~~~~CP~Cr~~~~~~~~~~ 93 (101)
....-+-..|+.|-.... ..+++.|+|.||..|+...+. +.+.||.|...|...+.-.
T Consensus 637 lk~yK~~LkCs~Cn~R~K---d~vI~kC~H~FC~~Cvq~r~etRqRKCP~Cn~aFganDv~~ 695 (698)
T KOG0978|consen 637 LKEYKELLKCSVCNTRWK---DAVITKCGHVFCEECVQTRYETRQRKCPKCNAAFGANDVHR 695 (698)
T ss_pred HHHHHhceeCCCccCchh---hHHHHhcchHHHHHHHHHHHHHhcCCCCCCCCCCCcccccc
Confidence 344556789999997654 336677999999999998875 5778999999997766543
No 46
>KOG4172|consensus
Probab=98.28 E-value=1.3e-07 Score=48.78 Aligned_cols=46 Identities=30% Similarity=0.701 Sum_probs=35.1
Q ss_pred ccccccccccccCCCceEEeCCCC-cccHhHHHHHHh-cCCcccccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDH-FYHTPCIEPWLQ-LHGTCPICRQTLH 87 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H-~f~~~Ci~~w~~-~~~~CP~Cr~~~~ 87 (101)
..+|.||++.-.+ .+...||| -.|..|..+.+. ....||+||+++.
T Consensus 7 ~dECTICye~pvd---sVlYtCGHMCmCy~Cg~rl~~~~~g~CPiCRapi~ 54 (62)
T KOG4172|consen 7 SDECTICYEHPVD---SVLYTCGHMCMCYACGLRLKKALHGCCPICRAPIK 54 (62)
T ss_pred ccceeeeccCcch---HHHHHcchHHhHHHHHHHHHHccCCcCcchhhHHH
Confidence 3689999996432 25567999 578899877655 5779999999885
No 47
>KOG1941|consensus
Probab=98.26 E-value=4.7e-07 Score=63.69 Aligned_cols=52 Identities=31% Similarity=0.828 Sum_probs=41.6
Q ss_pred CCCCccccccccccccCCC-ceEEeCCCCcccHhHHHHHHhc--CCccccccccc
Q psy8622 35 TDDSNLQCSVCWEQFTLDE-AVRKLPCDHFYHTPCIEPWLQL--HGTCPICRQTL 86 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~-~~~~l~C~H~f~~~Ci~~w~~~--~~~CP~Cr~~~ 86 (101)
..+.+..|..|-+.+...+ ....|||.|+||..|+...+.+ .++||.||+-.
T Consensus 361 ~~e~~L~Cg~CGe~~Glk~e~LqALpCsHIfH~rCl~e~L~~n~~rsCP~Crklr 415 (518)
T KOG1941|consen 361 VEETELYCGLCGESIGLKNERLQALPCSHIFHLRCLQEILENNGTRSCPNCRKLR 415 (518)
T ss_pred HHHHhhhhhhhhhhhcCCcccccccchhHHHHHHHHHHHHHhCCCCCCccHHHHH
Confidence 3455788999999886443 6788999999999999999865 45899999533
No 48
>KOG1039|consensus
Probab=98.20 E-value=6.7e-07 Score=62.38 Aligned_cols=51 Identities=31% Similarity=0.779 Sum_probs=38.0
Q ss_pred CCccccccccccccCCC----ceEEeC-CCCcccHhHHHHHHh--c-----CCcccccccccc
Q psy8622 37 DSNLQCSVCWEQFTLDE----AVRKLP-CDHFYHTPCIEPWLQ--L-----HGTCPICRQTLH 87 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~----~~~~l~-C~H~f~~~Ci~~w~~--~-----~~~CP~Cr~~~~ 87 (101)
..+..|.||++.+.... ...++| |.|.||..|+..|.. + .+.||.||....
T Consensus 159 s~~k~CGICme~i~ek~~~~~rfgilpnC~H~~Cl~Cir~wr~~~q~~~~~sksCP~CRv~s~ 221 (344)
T KOG1039|consen 159 SSEKECGICMETINEKAASERRFGILPNCNHSFCLNCIRKWRQATQFESKTSKSCPFCRVPSS 221 (344)
T ss_pred cccccceehhhhccccchhhhhcccCCCcchhhhhcHhHhhhhhhccccccccCCCcccCccc
Confidence 45779999999875332 123344 999999999999983 3 468999998653
No 49
>KOG0825|consensus
Probab=98.12 E-value=4.6e-07 Score=68.35 Aligned_cols=49 Identities=20% Similarity=0.517 Sum_probs=39.6
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
...|++|+..+.........+|+|.||..|+..|-....+||+||..+.
T Consensus 123 ~~~CP~Ci~s~~DqL~~~~k~c~H~FC~~Ci~sWsR~aqTCPiDR~EF~ 171 (1134)
T KOG0825|consen 123 ENQCPNCLKSCNDQLEESEKHTAHYFCEECVGSWSRCAQTCPVDRGEFG 171 (1134)
T ss_pred hhhhhHHHHHHHHHhhccccccccccHHHHhhhhhhhcccCchhhhhhh
Confidence 4568888877754444445579999999999999999999999999774
No 50
>KOG3039|consensus
Probab=98.07 E-value=4.6e-06 Score=55.55 Aligned_cols=64 Identities=20% Similarity=0.387 Sum_probs=54.9
Q ss_pred CCccccccccccccCCCceEEe-CCCCcccHhHHHHHHhcCCcccccccccccCCcccCCCCCCC
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKL-PCDHFYHTPCIEPWLQLHGTCPICRQTLHSESAESLGPAGAG 100 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l-~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~~~~~~~~~~ 100 (101)
+..+.||+|.+.+.+......| ||||+|+..|..+.+.....||+|-.++.+.+++.+..+|.|
T Consensus 219 s~ryiCpvtrd~LtNt~~ca~Lr~sg~Vv~~ecvEklir~D~v~pv~d~plkdrdiI~LqrGGTG 283 (303)
T KOG3039|consen 219 SKRYICPVTRDTLTNTTPCAVLRPSGHVVTKECVEKLIRKDMVDPVTDKPLKDRDIIGLQRGGTG 283 (303)
T ss_pred ccceecccchhhhcCccceEEeccCCcEeeHHHHHHhccccccccCCCCcCcccceEeeeccccc
Confidence 3568999999999877655555 699999999999999999999999999999999987766654
No 51
>KOG1785|consensus
Probab=98.07 E-value=1.6e-06 Score=61.30 Aligned_cols=51 Identities=29% Similarity=0.706 Sum_probs=40.6
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHHHHhc--CCcccccccccccCCcc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL--HGTCPICRQTLHSESAE 92 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~--~~~CP~Cr~~~~~~~~~ 92 (101)
-..|.||-+.- ..+.+-||||..|..|+..|-.. .++||.||.++...+.+
T Consensus 369 FeLCKICaend---KdvkIEPCGHLlCt~CLa~WQ~sd~gq~CPFCRcEIKGte~v 421 (563)
T KOG1785|consen 369 FELCKICAEND---KDVKIEPCGHLLCTSCLAAWQDSDEGQTCPFCRCEIKGTEPV 421 (563)
T ss_pred HHHHHHhhccC---CCcccccccchHHHHHHHhhcccCCCCCCCceeeEeccccce
Confidence 45799998852 45577799999999999999743 57999999999765544
No 52
>KOG4159|consensus
Probab=98.05 E-value=2.3e-06 Score=60.81 Aligned_cols=49 Identities=29% Similarity=0.699 Sum_probs=42.8
Q ss_pred CCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccccccc
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHS 88 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~ 88 (101)
..++.|.||...+..+ +.+||||.||..||.+-+.....||.||..+..
T Consensus 82 ~sef~c~vc~~~l~~p---v~tpcghs~c~~Cl~r~ld~~~~cp~Cr~~l~e 130 (398)
T KOG4159|consen 82 RSEFECCVCSRALYPP---VVTPCGHSFCLECLDRSLDQETECPLCRDELVE 130 (398)
T ss_pred cchhhhhhhHhhcCCC---ccccccccccHHHHHHHhccCCCCccccccccc
Confidence 5678999999988554 778999999999999988888899999998864
No 53
>KOG2660|consensus
Probab=98.04 E-value=8.2e-07 Score=61.03 Aligned_cols=55 Identities=20% Similarity=0.521 Sum_probs=44.2
Q ss_pred cccCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccccccc
Q psy8622 32 QVRTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHS 88 (101)
Q Consensus 32 ~~~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~ 88 (101)
..........|.+|-..|... ..+..|-|.||.+||.+++.....||.|...+..
T Consensus 8 k~~~~n~~itC~LC~GYliDA--TTI~eCLHTFCkSCivk~l~~~~~CP~C~i~ih~ 62 (331)
T KOG2660|consen 8 KLTELNPHITCRLCGGYLIDA--TTITECLHTFCKSCIVKYLEESKYCPTCDIVIHK 62 (331)
T ss_pred hhhhcccceehhhccceeecc--hhHHHHHHHHHHHHHHHHHHHhccCCccceeccC
Confidence 344555677999999988633 3555699999999999999999999999887754
No 54
>COG5152 Uncharacterized conserved protein, contains RING and CCCH-type Zn-fingers [General function prediction only]
Probab=97.98 E-value=3.8e-06 Score=54.38 Aligned_cols=49 Identities=16% Similarity=0.446 Sum_probs=41.5
Q ss_pred CCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
..-.+.|.||-++|..+ +++.|||.||..|..+-++....|-+|.+...
T Consensus 193 e~IPF~C~iCKkdy~sp---vvt~CGH~FC~~Cai~~y~kg~~C~~Cgk~t~ 241 (259)
T COG5152 193 EKIPFLCGICKKDYESP---VVTECGHSFCSLCAIRKYQKGDECGVCGKATY 241 (259)
T ss_pred CCCceeehhchhhccch---hhhhcchhHHHHHHHHHhccCCcceecchhhc
Confidence 34467999999999665 78889999999999998888889999977654
No 55
>KOG1645|consensus
Probab=97.97 E-value=6.7e-06 Score=58.15 Aligned_cols=49 Identities=29% Similarity=0.842 Sum_probs=38.1
Q ss_pred CccccccccccccCCC--ceEEeCCCCcccHhHHHHHHhc--CCccccccccc
Q psy8622 38 SNLQCSVCWEQFTLDE--AVRKLPCDHFYHTPCIEPWLQL--HGTCPICRQTL 86 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~--~~~~l~C~H~f~~~Ci~~w~~~--~~~CP~Cr~~~ 86 (101)
....||||++.+.-.- .+..+.|+|.|...||++|+.. ...||.|..+.
T Consensus 3 ~g~tcpiclds~~~~g~hr~vsl~cghlFgs~cie~wl~k~~~~~cp~c~~ka 55 (463)
T KOG1645|consen 3 CGTTCPICLDSYTTAGNHRIVSLQCGHLFGSQCIEKWLGKKTKMQCPLCSGKA 55 (463)
T ss_pred ccccCceeeeeeeecCceEEeeecccccccHHHHHHHHhhhhhhhCcccCChh
Confidence 3568999999875332 5566779999999999999952 34799997654
No 56
>KOG4445|consensus
Probab=97.96 E-value=2.1e-06 Score=58.61 Aligned_cols=52 Identities=33% Similarity=0.796 Sum_probs=42.4
Q ss_pred CccccccccccccCCCceEEeCCCCcccHhHHHHHHhc-----------------------CCcccccccccccC
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL-----------------------HGTCPICRQTLHSE 89 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~-----------------------~~~CP~Cr~~~~~~ 89 (101)
....|.|||.-|.+++...+++|.|.||..|+.+++.. ...||+||..+..+
T Consensus 114 p~gqCvICLygfa~~~~ft~T~C~Hy~H~~ClaRyl~~~~~~lrqe~q~~~~~~qh~~~~~eavcpVcre~i~~e 188 (368)
T KOG4445|consen 114 PNGQCVICLYGFASSPAFTVTACDHYMHFACLARYLTECLTGLRQEIQDAQKERQHMKEQVEAVCPVCRERIKIE 188 (368)
T ss_pred CCCceEEEEEeecCCCceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHhhhhhHhhhhcccc
Confidence 45689999999988777888999999999999887521 12599999998755
No 57
>PF11789 zf-Nse: Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=97.94 E-value=9.6e-06 Score=42.71 Aligned_cols=44 Identities=23% Similarity=0.633 Sum_probs=29.0
Q ss_pred CCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhc--CCcccc
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL--HGTCPI 81 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~--~~~CP~ 81 (101)
......|||.+..|. +++....|+|+|-...|..|++. ...||+
T Consensus 8 ~~~~~~CPiT~~~~~--~PV~s~~C~H~fek~aI~~~i~~~~~~~CPv 53 (57)
T PF11789_consen 8 GTISLKCPITLQPFE--DPVKSKKCGHTFEKEAILQYIQRNGSKRCPV 53 (57)
T ss_dssp SB--SB-TTTSSB-S--SEEEESSS--EEEHHHHHHHCTTTS-EE-SC
T ss_pred cEeccCCCCcCChhh--CCcCcCCCCCeecHHHHHHHHHhcCCCCCCC
Confidence 445679999999987 45566689999999999999943 447998
No 58
>KOG2879|consensus
Probab=97.93 E-value=1.2e-05 Score=54.21 Aligned_cols=52 Identities=31% Similarity=0.516 Sum_probs=40.7
Q ss_pred cCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhc--CCcccccccccc
Q psy8622 34 RTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL--HGTCPICRQTLH 87 (101)
Q Consensus 34 ~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~--~~~CP~Cr~~~~ 87 (101)
.....+.+|++|.+.-. .+....+|+|+||..|+..-+.. ..+||.|...+.
T Consensus 234 s~~t~~~~C~~Cg~~Pt--iP~~~~~C~HiyCY~Ci~ts~~~~asf~Cp~Cg~~~~ 287 (298)
T KOG2879|consen 234 STGTSDTECPVCGEPPT--IPHVIGKCGHIYCYYCIATSRLWDASFTCPLCGENVE 287 (298)
T ss_pred ccccCCceeeccCCCCC--CCeeeccccceeehhhhhhhhcchhhcccCccCCCCc
Confidence 34556789999999744 45566779999999999887653 468999988776
No 59
>KOG0297|consensus
Probab=97.88 E-value=5.4e-06 Score=58.99 Aligned_cols=53 Identities=32% Similarity=0.683 Sum_probs=43.5
Q ss_pred CCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccCC
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSES 90 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~ 90 (101)
.+.+..|++|...+.++ +..+.|||.||..|+..|+..+..||.|+..+....
T Consensus 18 ~~~~l~C~~C~~vl~~p--~~~~~cgh~fC~~C~~~~~~~~~~cp~~~~~~~~~~ 70 (391)
T KOG0297|consen 18 LDENLLCPICMSVLRDP--VQTTTCGHRFCAGCLLESLSNHQKCPVCRQELTQAE 70 (391)
T ss_pred CcccccCccccccccCC--CCCCCCCCcccccccchhhccCcCCcccccccchhh
Confidence 56678999999998644 232579999999999999999999999988775443
No 60
>KOG1813|consensus
Probab=97.74 E-value=1.2e-05 Score=54.75 Aligned_cols=52 Identities=21% Similarity=0.466 Sum_probs=43.6
Q ss_pred CCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccC
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSE 89 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~ 89 (101)
...-.+.|.||...|..+ +++.|+|.||..|...-++....|++|.+.....
T Consensus 237 ~~~~Pf~c~icr~~f~~p---Vvt~c~h~fc~~ca~~~~qk~~~c~vC~~~t~g~ 288 (313)
T KOG1813|consen 237 IELLPFKCFICRKYFYRP---VVTKCGHYFCEVCALKPYQKGEKCYVCSQQTHGS 288 (313)
T ss_pred cccCCccccccccccccc---hhhcCCceeehhhhccccccCCcceecccccccc
Confidence 344467899999999766 7889999999999999888889999998876543
No 61
>PF04641 Rtf2: Rtf2 RING-finger
Probab=97.70 E-value=5.4e-05 Score=51.20 Aligned_cols=63 Identities=17% Similarity=0.446 Sum_probs=50.7
Q ss_pred CCCccccccccccccCCCce-EEeCCCCcccHhHHHHHHhcCCcccccccccccCCcccCCCCCC
Q psy8622 36 DDSNLQCSVCWEQFTLDEAV-RKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSESAESLGPAGA 99 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~-~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~~~~~~~~~ 99 (101)
....+.|||....|...... .+.+|||+|...++...- ....||+|-.++...+++.+.|...
T Consensus 110 ~~~~~~CPvt~~~~~~~~~fv~l~~cG~V~s~~alke~k-~~~~Cp~c~~~f~~~DiI~Lnp~~e 173 (260)
T PF04641_consen 110 SEGRFICPVTGKEFNGKHKFVYLRPCGCVFSEKALKELK-KSKKCPVCGKPFTEEDIIPLNPPEE 173 (260)
T ss_pred CCceeECCCCCcccCCceeEEEEcCCCCEeeHHHHHhhc-ccccccccCCccccCCEEEecCCcc
Confidence 46678999999998544444 444799999999999863 3567999999999999998888654
No 62
>KOG1428|consensus
Probab=97.67 E-value=3.3e-05 Score=62.21 Aligned_cols=53 Identities=25% Similarity=0.686 Sum_probs=42.2
Q ss_pred CCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcC----------Ccccccccccc
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLH----------GTCPICRQTLH 87 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~----------~~CP~Cr~~~~ 87 (101)
..+.++.|-||+.+--...+.+.|.|+|+||..|..+.+++. -.||+|..++.
T Consensus 3482 kQD~DDmCmICFTE~L~AAP~IqL~C~HiFHlqC~R~vLE~RW~GPRItF~FisCPiC~n~In 3544 (3738)
T KOG1428|consen 3482 KQDADDMCMICFTEALSAAPAIQLDCSHIFHLQCCRRVLENRWLGPRITFGFISCPICKNKIN 3544 (3738)
T ss_pred hcccCceEEEEehhhhCCCcceecCCccchhHHHHHHHHHhcccCCeeEEeeeecccccchhh
Confidence 456678999999876566677889999999999998766532 16999998875
No 63
>KOG3970|consensus
Probab=97.58 E-value=4.8e-05 Score=50.26 Aligned_cols=60 Identities=23% Similarity=0.541 Sum_probs=45.7
Q ss_pred ccccccCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhc--------CCcccccccccccC
Q psy8622 29 ALVQVRTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL--------HGTCPICRQTLHSE 89 (101)
Q Consensus 29 ~~~~~~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~--------~~~CP~Cr~~~~~~ 89 (101)
........+....|.+|-..+..++. +.|-|-|.||++|+..|... .-.||.|..++...
T Consensus 40 YLqWL~DsDY~pNC~LC~t~La~gdt-~RLvCyhlfHW~ClneraA~lPanTAPaGyqCP~Cs~eiFPp 107 (299)
T KOG3970|consen 40 YLQWLQDSDYNPNCRLCNTPLASGDT-TRLVCYHLFHWKCLNERAANLPANTAPAGYQCPCCSQEIFPP 107 (299)
T ss_pred HHHHHhhcCCCCCCceeCCccccCcc-eeehhhhhHHHHHhhHHHhhCCCcCCCCcccCCCCCCccCCC
Confidence 33445566777889999999877655 44779999999999999643 22699999988653
No 64
>KOG4692|consensus
Probab=97.58 E-value=7.8e-05 Score=52.22 Aligned_cols=51 Identities=24% Similarity=0.475 Sum_probs=43.0
Q ss_pred CCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccccccc
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHS 88 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~ 88 (101)
.+.++..|+||...- ...+..||+|.-|..||.+.+...+.|=.|+..+..
T Consensus 418 p~sEd~lCpICyA~p---i~Avf~PC~H~SC~~CI~qHlmN~k~CFfCktTv~~ 468 (489)
T KOG4692|consen 418 PDSEDNLCPICYAGP---INAVFAPCSHRSCYGCITQHLMNCKRCFFCKTTVID 468 (489)
T ss_pred CCcccccCcceeccc---chhhccCCCCchHHHHHHHHHhcCCeeeEecceeee
Confidence 456788999998853 334678999999999999999999999999998864
No 65
>PF14570 zf-RING_4: RING/Ubox like zinc-binding domain; PDB: 1E4U_A 1UR6_B.
Probab=97.56 E-value=7.1e-05 Score=37.92 Aligned_cols=44 Identities=27% Similarity=0.628 Sum_probs=22.2
Q ss_pred cccccccccCCCceEEeC--CCCcccHhHHHHHHh-cCCccccccccc
Q psy8622 42 CSVCWEQFTLDEAVRKLP--CDHFYHTPCIEPWLQ-LHGTCPICRQTL 86 (101)
Q Consensus 42 C~IC~~~~~~~~~~~~l~--C~H~f~~~Ci~~w~~-~~~~CP~Cr~~~ 86 (101)
|++|.+++...+. ..+| |+..+|..|+...+. ....||-||.+.
T Consensus 1 cp~C~e~~d~~d~-~~~PC~Cgf~IC~~C~~~i~~~~~g~CPgCr~~Y 47 (48)
T PF14570_consen 1 CPLCDEELDETDK-DFYPCECGFQICRFCYHDILENEGGRCPGCREPY 47 (48)
T ss_dssp -TTTS-B--CCCT-T--SSTTS----HHHHHHHTTSS-SB-TTT--B-
T ss_pred CCCcccccccCCC-ccccCcCCCcHHHHHHHHHHhccCCCCCCCCCCC
Confidence 7899998843332 3344 899999999999886 478999999864
No 66
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=97.45 E-value=9.6e-05 Score=51.54 Aligned_cols=56 Identities=25% Similarity=0.568 Sum_probs=40.7
Q ss_pred CCCccccccccccccCCCc-eEEeCCCCcccHhHHHHHHhc-CCcccccccccccCCc
Q psy8622 36 DDSNLQCSVCWEQFTLDEA-VRKLPCDHFYHTPCIEPWLQL-HGTCPICRQTLHSESA 91 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~-~~~l~C~H~f~~~Ci~~w~~~-~~~CP~Cr~~~~~~~~ 91 (101)
.++++.||+|++.+...+. ..-.+||-..|.-|+....+. +..||.||+...+++.
T Consensus 11 edeed~cplcie~mditdknf~pc~cgy~ic~fc~~~irq~lngrcpacrr~y~denv 68 (480)
T COG5175 11 EDEEDYCPLCIEPMDITDKNFFPCPCGYQICQFCYNNIRQNLNGRCPACRRKYDDENV 68 (480)
T ss_pred ccccccCcccccccccccCCcccCCcccHHHHHHHHHHHhhccCCChHhhhhccccce
Confidence 3445569999999865543 344568988888898776543 6789999998876553
No 67
>PF10367 Vps39_2: Vacuolar sorting protein 39 domain 2; InterPro: IPR019453 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised In Vps39 this domain is involved in localisation and in mediating the interactions with Vps11 [].
Probab=97.42 E-value=6.2e-05 Score=43.97 Aligned_cols=33 Identities=30% Similarity=0.696 Sum_probs=27.0
Q ss_pred CCccccccccccccCCCceEEeCCCCcccHhHHH
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIE 70 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~ 70 (101)
+....|++|...+.. ......||||.||..|+.
T Consensus 76 ~~~~~C~vC~k~l~~-~~f~~~p~~~v~H~~C~~ 108 (109)
T PF10367_consen 76 TESTKCSVCGKPLGN-SVFVVFPCGHVVHYSCIK 108 (109)
T ss_pred CCCCCccCcCCcCCC-ceEEEeCCCeEEeccccc
Confidence 345679999999865 566778999999999975
No 68
>PF12906 RINGv: RING-variant domain; PDB: 2D8S_A 1VYX_A.
Probab=97.40 E-value=0.00013 Score=36.87 Aligned_cols=40 Identities=35% Similarity=0.888 Sum_probs=25.8
Q ss_pred cccccccccCCCceEEeCCC--C---cccHhHHHHHHh--cCCccccc
Q psy8622 42 CSVCWEQFTLDEAVRKLPCD--H---FYHTPCIEPWLQ--LHGTCPIC 82 (101)
Q Consensus 42 C~IC~~~~~~~~~~~~l~C~--H---~f~~~Ci~~w~~--~~~~CP~C 82 (101)
|-||++.-.... ..+.||. - ..|..|+.+|+. .+..|++|
T Consensus 1 CrIC~~~~~~~~-~li~pC~C~Gs~~~vH~~CL~~W~~~~~~~~C~~C 47 (47)
T PF12906_consen 1 CRICLEGEEEDE-PLISPCRCKGSMKYVHRSCLERWIRESGNRKCEIC 47 (47)
T ss_dssp ETTTTEE-SSSS--EE-SSS-SSCCGSEECCHHHHHHHHHT-SB-TTT
T ss_pred CeEeCCcCCCCC-ceecccccCCCcchhHHHHHHHHHHhcCCCcCCCC
Confidence 678998764433 4567753 3 689999999997 45678877
No 69
>PF05883 Baculo_RING: Baculovirus U-box/Ring-like domain; InterPro: IPR008573 This family consists of several Baculovirus proteins of around 130 residues in length. The function of this family is unknown, but it appears to be related to the U-box and ring finger domain by profile-profile comparison.
Probab=97.39 E-value=0.00011 Score=44.94 Aligned_cols=37 Identities=14% Similarity=0.579 Sum_probs=29.9
Q ss_pred ccccccccccccCCCceEEeCCC------CcccHhHHHHHHhc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCD------HFYHTPCIEPWLQL 75 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~------H~f~~~Ci~~w~~~ 75 (101)
..+|.||++.+.....++.++|+ |.||..|+.+|-..
T Consensus 26 ~~EC~IC~~~I~~~~GvV~vt~~g~lnLEkmfc~~C~~rw~~~ 68 (134)
T PF05883_consen 26 TVECQICFDRIDNNDGVVYVTDGGTLNLEKMFCADCDKRWRRE 68 (134)
T ss_pred CeeehhhhhhhhcCCCEEEEecCCeehHHHHHHHHHHHHHHhh
Confidence 57899999999764566777776 78999999999543
No 70
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.31 E-value=0.00046 Score=48.38 Aligned_cols=50 Identities=30% Similarity=0.713 Sum_probs=39.9
Q ss_pred cCCCCccccccccccccCCCceEEeCCCCcccHhHHHHH--HhcCCccccccccc
Q psy8622 34 RTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPW--LQLHGTCPICRQTL 86 (101)
Q Consensus 34 ~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w--~~~~~~CP~Cr~~~ 86 (101)
..+++...|.||.+.+. ...++||+|..|..|..+. +...+.||+||.+.
T Consensus 56 dtDEen~~C~ICA~~~T---Ys~~~PC~H~~CH~Ca~RlRALY~~K~C~~CrTE~ 107 (493)
T COG5236 56 DTDEENMNCQICAGSTT---YSARYPCGHQICHACAVRLRALYMQKGCPLCRTET 107 (493)
T ss_pred ccccccceeEEecCCce---EEEeccCCchHHHHHHHHHHHHHhccCCCcccccc
Confidence 34566778999999874 4478999999999998654 45688999999865
No 71
>KOG4739|consensus
Probab=97.25 E-value=0.00015 Score=48.19 Aligned_cols=51 Identities=18% Similarity=0.613 Sum_probs=36.3
Q ss_pred ccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccCCcccC
Q psy8622 41 QCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSESAESL 94 (101)
Q Consensus 41 ~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~~~~ 94 (101)
.|.-|...-. ++...++.|+|+||..|...- ....||+|++.+........
T Consensus 5 hCn~C~~~~~-~~~f~LTaC~HvfC~~C~k~~--~~~~C~lCkk~ir~i~l~~s 55 (233)
T KOG4739|consen 5 HCNKCFRFPS-QDPFFLTACRHVFCEPCLKAS--SPDVCPLCKKSIRIIQLNRS 55 (233)
T ss_pred EeccccccCC-CCceeeeechhhhhhhhcccC--Cccccccccceeeeeecccc
Confidence 5776766433 567788889999999998652 33389999998765444433
No 72
>KOG1002|consensus
Probab=97.19 E-value=0.00016 Score=53.06 Aligned_cols=52 Identities=23% Similarity=0.675 Sum_probs=40.1
Q ss_pred CCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhc-----CCcccccccccccC
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL-----HGTCPICRQTLHSE 89 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~-----~~~CP~Cr~~~~~~ 89 (101)
...+...|.+|.+.- +..+...|.|.||..|+..++.. +.+||.|...+..+
T Consensus 532 enk~~~~C~lc~d~a---ed~i~s~ChH~FCrlCi~eyv~~f~~~~nvtCP~C~i~LsiD 588 (791)
T KOG1002|consen 532 ENKGEVECGLCHDPA---EDYIESSCHHKFCRLCIKEYVESFMENNNVTCPVCHIGLSID 588 (791)
T ss_pred cccCceeecccCChh---hhhHhhhhhHHHHHHHHHHHHHhhhcccCCCCcccccccccc
Confidence 344567899999875 34477889999999999888742 45899998877644
No 73
>KOG1814|consensus
Probab=97.18 E-value=0.00021 Score=50.81 Aligned_cols=37 Identities=24% Similarity=0.773 Sum_probs=32.1
Q ss_pred CccccccccccccCCCceEEeCCCCcccHhHHHHHHh
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQ 74 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~ 74 (101)
....|.||++..........+||+|+||..|+..++.
T Consensus 183 slf~C~ICf~e~~G~~c~~~lpC~Hv~Ck~C~kdY~~ 219 (445)
T KOG1814|consen 183 SLFDCCICFEEQMGQHCFKFLPCSHVFCKSCLKDYFT 219 (445)
T ss_pred hcccceeeehhhcCcceeeecccchHHHHHHHHHHHH
Confidence 3578999999976667888899999999999999874
No 74
>PF14447 Prok-RING_4: Prokaryotic RING finger family 4
Probab=97.14 E-value=0.00024 Score=36.86 Aligned_cols=46 Identities=22% Similarity=0.499 Sum_probs=33.1
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccCC
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSES 90 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~ 90 (101)
..|..|...- ....++||+|..+..|+.- .+-+.||+|.+++...+
T Consensus 8 ~~~~~~~~~~---~~~~~~pCgH~I~~~~f~~--~rYngCPfC~~~~~~~~ 53 (55)
T PF14447_consen 8 QPCVFCGFVG---TKGTVLPCGHLICDNCFPG--ERYNGCPFCGTPFEFDD 53 (55)
T ss_pred eeEEEccccc---cccccccccceeeccccCh--hhccCCCCCCCcccCCC
Confidence 3455555542 3347789999999999865 35567999999887654
No 75
>KOG1952|consensus
Probab=97.14 E-value=0.0002 Score=54.81 Aligned_cols=49 Identities=27% Similarity=0.770 Sum_probs=36.4
Q ss_pred CCCccccccccccccCCCceEE-eCCCCcccHhHHHHHHhcCC-------ccccccc
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRK-LPCDHFYHTPCIEPWLQLHG-------TCPICRQ 84 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~-l~C~H~f~~~Ci~~w~~~~~-------~CP~Cr~ 84 (101)
....+.|.||.+.+.....+.. ..|-|+||..||..|..... .||.|..
T Consensus 188 ~~~~yeCmIC~e~I~~t~~~WSC~sCYhVFHl~CI~~WArs~ek~~~~~WrCP~Cqs 244 (950)
T KOG1952|consen 188 SNRKYECMICTERIKRTAPVWSCKSCYHVFHLNCIKKWARSSEKTGQDGWRCPACQS 244 (950)
T ss_pred hcCceEEEEeeeeccccCCceecchhhhhhhHHHHHHHHHHhhhccCccccCCcccc
Confidence 3456899999999865543332 24899999999999975411 5999984
No 76
>KOG4275|consensus
Probab=97.12 E-value=8.3e-05 Score=50.78 Aligned_cols=42 Identities=31% Similarity=0.664 Sum_probs=32.5
Q ss_pred ccccccccccccCCCceEEeCCCC-cccHhHHHHHHhcCCcccccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDH-FYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H-~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
...|.||++.- ..-+.|+||| +-|..|-.+. ..||+||+.+.
T Consensus 300 ~~LC~ICmDaP---~DCvfLeCGHmVtCt~CGkrm----~eCPICRqyi~ 342 (350)
T KOG4275|consen 300 RRLCAICMDAP---RDCVFLECGHMVTCTKCGKRM----NECPICRQYIV 342 (350)
T ss_pred HHHHHHHhcCC---cceEEeecCcEEeehhhcccc----ccCchHHHHHH
Confidence 67899999964 3447799999 5578887654 37999998764
No 77
>KOG4185|consensus
Probab=97.07 E-value=0.00035 Score=47.87 Aligned_cols=47 Identities=30% Similarity=0.684 Sum_probs=38.1
Q ss_pred cccccccccccCCC---ceEEeCCCCcccHhHHHHHHhc-CCccccccccc
Q psy8622 40 LQCSVCWEQFTLDE---AVRKLPCDHFYHTPCIEPWLQL-HGTCPICRQTL 86 (101)
Q Consensus 40 ~~C~IC~~~~~~~~---~~~~l~C~H~f~~~Ci~~w~~~-~~~CP~Cr~~~ 86 (101)
..|-||-++|...+ .++.|.|||.||..|+...+.. ...||.||...
T Consensus 4 ~~c~~c~~~~s~~~~~~~p~~l~c~h~~c~~c~~~l~~~~~i~cpfcR~~~ 54 (296)
T KOG4185|consen 4 PECEICNEDYSSEDGDHIPRVLKCGHTICQNCASKLLGNSRILCPFCRETT 54 (296)
T ss_pred CceeecCccccccCcccCCcccccCceehHhHHHHHhcCceeeccCCCCcc
Confidence 57999999986553 5677789999999999987754 44799999984
No 78
>PHA02825 LAP/PHD finger-like protein; Provisional
Probab=97.06 E-value=0.00064 Score=42.66 Aligned_cols=50 Identities=20% Similarity=0.615 Sum_probs=35.9
Q ss_pred CCCccccccccccccCCCceEEeCC--CC---cccHhHHHHHHhc--CCcccccccccccC
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLPC--DH---FYHTPCIEPWLQL--HGTCPICRQTLHSE 89 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~C--~H---~f~~~Ci~~w~~~--~~~CP~Cr~~~~~~ 89 (101)
...+..|-||.+... . ..-|| .. ..|.+|+.+|+.. ...|++|+.++.-.
T Consensus 5 s~~~~~CRIC~~~~~--~--~~~PC~CkGs~k~VH~sCL~rWi~~s~~~~CeiC~~~Y~i~ 61 (162)
T PHA02825 5 SLMDKCCWICKDEYD--V--VTNYCNCKNENKIVHKECLEEWINTSKNKSCKICNGPYNIK 61 (162)
T ss_pred CCCCCeeEecCCCCC--C--ccCCcccCCCchHHHHHHHHHHHhcCCCCcccccCCeEEEE
Confidence 345678999998742 1 22464 44 5699999999964 55899999987543
No 79
>KOG1571|consensus
Probab=97.02 E-value=0.00072 Score=47.34 Aligned_cols=49 Identities=27% Similarity=0.539 Sum_probs=35.0
Q ss_pred ccCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 33 VRTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 33 ~~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
.........|.||++...+ ...+||||+-| |..-. +....||+||+.+.
T Consensus 299 ~~~~~~p~lcVVcl~e~~~---~~fvpcGh~cc--ct~cs-~~l~~CPvCR~rI~ 347 (355)
T KOG1571|consen 299 FRELPQPDLCVVCLDEPKS---AVFVPCGHVCC--CTLCS-KHLPQCPVCRQRIR 347 (355)
T ss_pred ccccCCCCceEEecCCccc---eeeecCCcEEE--chHHH-hhCCCCchhHHHHH
Confidence 4455667889999997643 47799999866 54332 33456999998774
No 80
>PF08746 zf-RING-like: RING-like domain; InterPro: IPR014857 This is a zinc finger domain that is related to the C3HC4 RING finger domain (IPR001841 from INTERPRO). ; PDB: 3NW0_A 2CT0_A.
Probab=97.01 E-value=0.001 Score=32.98 Aligned_cols=38 Identities=26% Similarity=0.765 Sum_probs=22.2
Q ss_pred cccccccccCCCceEEeC---CCCcccHhHHHHHHhcCC--ccccc
Q psy8622 42 CSVCWEQFTLDEAVRKLP---CDHFYHTPCIEPWLQLHG--TCPIC 82 (101)
Q Consensus 42 C~IC~~~~~~~~~~~~l~---C~H~f~~~Ci~~w~~~~~--~CP~C 82 (101)
|.+|.+.+..+ +.-+ |+-.+|..|+..++.... .||.|
T Consensus 1 C~~C~~iv~~G---~~C~~~~C~~r~H~~C~~~y~r~~~~~~CP~C 43 (43)
T PF08746_consen 1 CEACKEIVTQG---QRCSNRDCNVRLHDDCFKKYFRHRSNPKCPNC 43 (43)
T ss_dssp -TTT-SB-SSS---EE-SS--S--EE-HHHHHHHTTT-SS-B-TTT
T ss_pred CcccchhHeee---ccCCCCccCchHHHHHHHHHHhcCCCCCCcCC
Confidence 56777776555 3333 888999999999997655 79987
No 81
>PHA02862 5L protein; Provisional
Probab=96.95 E-value=0.00079 Score=41.63 Aligned_cols=46 Identities=17% Similarity=0.574 Sum_probs=33.9
Q ss_pred ccccccccccccCCCceEEeCCC-----CcccHhHHHHHHhc--CCccccccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCD-----HFYHTPCIEPWLQL--HGTCPICRQTLHS 88 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~-----H~f~~~Ci~~w~~~--~~~CP~Cr~~~~~ 88 (101)
...|-||.+.-... .-||. ...|..|+.+|+.. +..|++|+.++.-
T Consensus 2 ~diCWIC~~~~~e~----~~PC~C~GS~K~VHq~CL~~WIn~S~k~~CeLCkteY~I 54 (156)
T PHA02862 2 SDICWICNDVCDER----NNFCGCNEEYKVVHIKCMQLWINYSKKKECNLCKTKYNI 54 (156)
T ss_pred CCEEEEecCcCCCC----cccccccCcchhHHHHHHHHHHhcCCCcCccCCCCeEEE
Confidence 35799999974322 35653 46899999999964 5589999998753
No 82
>PHA03096 p28-like protein; Provisional
Probab=96.79 E-value=0.00065 Score=46.60 Aligned_cols=36 Identities=22% Similarity=0.496 Sum_probs=26.6
Q ss_pred cccccccccccCCC----ceEEeC-CCCcccHhHHHHHHhc
Q psy8622 40 LQCSVCWEQFTLDE----AVRKLP-CDHFYHTPCIEPWLQL 75 (101)
Q Consensus 40 ~~C~IC~~~~~~~~----~~~~l~-C~H~f~~~Ci~~w~~~ 75 (101)
-.|.||++...... ....|+ |.|.||..|+..|...
T Consensus 179 k~c~ic~e~~~~k~~~~~~fgil~~c~h~fc~~ci~~wr~~ 219 (284)
T PHA03096 179 KICGICLENIKAKYIIKKYYGILSEIKHEFNIFCIKIWMTE 219 (284)
T ss_pred hhcccchhhhhhhccccccccccccCCcHHHHHHHHHHHHh
Confidence 57999999764321 223344 9999999999999753
No 83
>COG5222 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.77 E-value=0.0008 Score=46.37 Aligned_cols=43 Identities=30% Similarity=0.692 Sum_probs=34.9
Q ss_pred ccccccccccccCCCceEEeC-CCCcccHhHHHHHH-hcCCccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLP-CDHFYHTPCIEPWL-QLHGTCPICRQ 84 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~-C~H~f~~~Ci~~w~-~~~~~CP~Cr~ 84 (101)
.+.|+.|...+.++ ..++ |+|.||..||...+ .....||.|.+
T Consensus 274 ~LkCplc~~Llrnp---~kT~cC~~~fc~eci~~al~dsDf~CpnC~r 318 (427)
T COG5222 274 SLKCPLCHCLLRNP---MKTPCCGHTFCDECIGTALLDSDFKCPNCSR 318 (427)
T ss_pred cccCcchhhhhhCc---ccCccccchHHHHHHhhhhhhccccCCCccc
Confidence 48899999988655 4455 89999999999776 45778999965
No 84
>KOG1940|consensus
Probab=96.71 E-value=0.00078 Score=45.91 Aligned_cols=47 Identities=36% Similarity=0.804 Sum_probs=37.6
Q ss_pred CccccccccccccCC-CceEEeCCCCcccHhHHHHHHhcCCccccccc
Q psy8622 38 SNLQCSVCWEQFTLD-EAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQ 84 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~-~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~ 84 (101)
....||||.+.+... ..+..++|+|..|..|+.......-+||+|.+
T Consensus 157 ~~~ncPic~e~l~~s~~~~~~~~CgH~~h~~cf~e~~~~~y~CP~C~~ 204 (276)
T KOG1940|consen 157 SEFNCPICKEYLFLSFEDAGVLKCGHYMHSRCFEEMICEGYTCPICSK 204 (276)
T ss_pred ccCCCchhHHHhccccccCCccCcccchHHHHHHHHhccCCCCCcccc
Confidence 344599999976433 35567889999999999988776789999988
No 85
>KOG3268|consensus
Probab=96.67 E-value=0.0012 Score=42.26 Aligned_cols=30 Identities=30% Similarity=0.924 Sum_probs=24.1
Q ss_pred CCCCcccHhHHHHHHhc----C-------Cccccccccccc
Q psy8622 59 PCDHFYHTPCIEPWLQL----H-------GTCPICRQTLHS 88 (101)
Q Consensus 59 ~C~H~f~~~Ci~~w~~~----~-------~~CP~Cr~~~~~ 88 (101)
.|+..||.-|+..|++. + ..||.|..++..
T Consensus 189 qCgkpFHqiCL~dWLRgilTsRQSFdiiFGeCPYCS~Pial 229 (234)
T KOG3268|consen 189 QCGKPFHQICLTDWLRGILTSRQSFDIIFGECPYCSDPIAL 229 (234)
T ss_pred ccCCcHHHHHHHHHHHHHhhccceeeeeeccCCCCCCccee
Confidence 49999999999999863 1 159999887754
No 86
>KOG0826|consensus
Probab=96.66 E-value=0.0016 Score=45.30 Aligned_cols=50 Identities=18% Similarity=0.342 Sum_probs=38.4
Q ss_pred CCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
..+...||+|+....++ .+..--|-+||..|+..++...+.||+=..++.
T Consensus 297 ~~~~~~CpvClk~r~Np--tvl~vSGyVfCY~Ci~~Yv~~~~~CPVT~~p~~ 346 (357)
T KOG0826|consen 297 PPDREVCPVCLKKRQNP--TVLEVSGYVFCYPCIFSYVVNYGHCPVTGYPAS 346 (357)
T ss_pred CCccccChhHHhccCCC--ceEEecceEEeHHHHHHHHHhcCCCCccCCcch
Confidence 44467899999987654 232335899999999999999999998766554
No 87
>KOG2114|consensus
Probab=96.52 E-value=0.0013 Score=50.52 Aligned_cols=43 Identities=30% Similarity=0.699 Sum_probs=34.2
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTL 86 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~ 86 (101)
...|.+|--.+. -+.+...|+|.||..|+. .....||.|+.++
T Consensus 840 ~skCs~C~~~Ld--lP~VhF~CgHsyHqhC~e---~~~~~CP~C~~e~ 882 (933)
T KOG2114|consen 840 VSKCSACEGTLD--LPFVHFLCGHSYHQHCLE---DKEDKCPKCLPEL 882 (933)
T ss_pred eeeecccCCccc--cceeeeecccHHHHHhhc---cCcccCCccchhh
Confidence 368999988875 345777899999999998 4566899998733
No 88
>KOG0801|consensus
Probab=96.48 E-value=0.00082 Score=42.41 Aligned_cols=31 Identities=29% Similarity=0.803 Sum_probs=27.1
Q ss_pred CCCccccccccccccCCCceEEeCCCCcccH
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLPCDHFYHT 66 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~ 66 (101)
.++..+|.||+|++..++.+..|||-.+||.
T Consensus 174 ~ddkGECvICLEdL~~GdtIARLPCLCIYHK 204 (205)
T KOG0801|consen 174 KDDKGECVICLEDLEAGDTIARLPCLCIYHK 204 (205)
T ss_pred cccCCcEEEEhhhccCCCceeccceEEEeec
Confidence 3456789999999999999999999999885
No 89
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=96.45 E-value=0.003 Score=39.65 Aligned_cols=51 Identities=24% Similarity=0.607 Sum_probs=33.4
Q ss_pred ccccccccccccCCCceEEeC------------CCCc-ccHhHHHHHHhc------------------------------
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLP------------CDHF-YHTPCIEPWLQL------------------------------ 75 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~------------C~H~-f~~~Ci~~w~~~------------------------------ 75 (101)
+..|||||+--.+. +.|- |+.. -|..||.++.+.
T Consensus 2 d~~CpICme~PHNA---VLLlCSS~~kgcRpymc~Ts~rhSNCLdqfkka~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (162)
T PF07800_consen 2 DVTCPICMEHPHNA---VLLLCSSHEKGCRPYMCDTSYRHSNCLDQFKKAYGKSSSSSSQSSSSAPSDSSSSESSESQEQ 78 (162)
T ss_pred CccCceeccCCCce---EEEEeccccCCccccccCCccchhHHHHHHHHHhcCCCCccccccccCcCCCccccccccccc
Confidence 56899999975433 4443 4432 367899887431
Q ss_pred -CCcccccccccccCCcc
Q psy8622 76 -HGTCPICRQTLHSESAE 92 (101)
Q Consensus 76 -~~~CP~Cr~~~~~~~~~ 92 (101)
...||+||..|.....+
T Consensus 79 ~~L~CPLCRG~V~GWtvv 96 (162)
T PF07800_consen 79 PELACPLCRGEVKGWTVV 96 (162)
T ss_pred ccccCccccCceeceEEc
Confidence 11599999999766544
No 90
>KOG1001|consensus
Probab=96.45 E-value=0.0011 Score=50.34 Aligned_cols=51 Identities=24% Similarity=0.634 Sum_probs=40.7
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHHhc--CCcccccccccccCCcccC
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL--HGTCPICRQTLHSESAESL 94 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~--~~~CP~Cr~~~~~~~~~~~ 94 (101)
..|.||++ . +..++++|+|.||..|+...+.. ...||.||..+......+.
T Consensus 455 ~~c~ic~~-~---~~~~it~c~h~~c~~c~~~~i~~~~~~~~~~cr~~l~~~~l~s~ 507 (674)
T KOG1001|consen 455 HWCHICCD-L---DSFFITRCGHDFCVECLKKSIQQSENAPCPLCRNVLKEKKLLSA 507 (674)
T ss_pred cccccccc-c---ccceeecccchHHHHHHHhccccccCCCCcHHHHHHHHHHHhhc
Confidence 78999999 2 56688899999999999987754 3369999998876655543
No 91
>PF10272 Tmpp129: Putative transmembrane protein precursor; InterPro: IPR018801 This entry consists of proteins conserved from worms to humans. They are purported to be transmembrane protein-precursors but their function is unknown.
Probab=96.21 E-value=0.0027 Score=44.91 Aligned_cols=31 Identities=23% Similarity=0.754 Sum_probs=24.3
Q ss_pred CCCcccHhHHHHHHhcCC-------------cccccccccccCC
Q psy8622 60 CDHFYHTPCIEPWLQLHG-------------TCPICRQTLHSES 90 (101)
Q Consensus 60 C~H~f~~~Ci~~w~~~~~-------------~CP~Cr~~~~~~~ 90 (101)
|....|.+|+.+|+..++ .||.||+.+...+
T Consensus 311 CRPmWC~~Cm~kwFasrQd~~~~~~Wl~~~~~CPtCRa~FCilD 354 (358)
T PF10272_consen 311 CRPMWCLECMGKWFASRQDQQHPETWLSGKCPCPTCRAKFCILD 354 (358)
T ss_pred ccchHHHHHHHHHhhhcCCCCChhhhhcCCCCCCCCcccceeee
Confidence 667889999999985322 6999999987544
No 92
>KOG4367|consensus
Probab=96.13 E-value=0.0038 Score=45.21 Aligned_cols=35 Identities=29% Similarity=0.600 Sum_probs=29.9
Q ss_pred CCccccccccccccCCCceEEeCCCCcccHhHHHHHHh
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQ 74 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~ 74 (101)
+++..|+||...|..+ ++|||+|..|..|....+.
T Consensus 2 eeelkc~vc~~f~~ep---iil~c~h~lc~~ca~~~~~ 36 (699)
T KOG4367|consen 2 EEELKCPVCGSFYREP---IILPCSHNLCQACARNILV 36 (699)
T ss_pred cccccCceehhhccCc---eEeecccHHHHHHHHhhcc
Confidence 4578999999998655 8999999999999987654
No 93
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=96.10 E-value=0.0061 Score=37.25 Aligned_cols=57 Identities=23% Similarity=0.497 Sum_probs=41.4
Q ss_pred cCCCCccccccccccccCCCceEEeC---CCCcccHhHHHHHHhc---CCcccccccccccCCcc
Q psy8622 34 RTDDSNLQCSVCWEQFTLDEAVRKLP---CDHFYHTPCIEPWLQL---HGTCPICRQTLHSESAE 92 (101)
Q Consensus 34 ~~~~~~~~C~IC~~~~~~~~~~~~l~---C~H~f~~~Ci~~w~~~---~~~CP~Cr~~~~~~~~~ 92 (101)
-.+..-.+|.||.|... +.-..-| ||-..|..|-...|+. .+.||+|+..+.+++..
T Consensus 75 F~d~~lYeCnIC~etS~--ee~FLKPneCCgY~iCn~Cya~LWK~~~~ypvCPvCkTSFKss~~~ 137 (140)
T PF05290_consen 75 FLDPKLYECNICKETSA--EERFLKPNECCGYSICNACYANLWKFCNLYPVCPVCKTSFKSSSSA 137 (140)
T ss_pred ecCCCceeccCcccccc--hhhcCCcccccchHHHHHHHHHHHHHcccCCCCCcccccccccccc
Confidence 34446789999999754 3334344 8999999988776654 56899999999776543
No 94
>KOG2932|consensus
Probab=96.08 E-value=0.0026 Score=44.04 Aligned_cols=43 Identities=23% Similarity=0.641 Sum_probs=30.1
Q ss_pred ccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 41 QCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 41 ~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
.|.-|--.+. ...+.+||.|+||.+|... ...+.||.|-..|.
T Consensus 92 fCd~Cd~PI~--IYGRmIPCkHvFCl~CAr~--~~dK~Cp~C~d~Vq 134 (389)
T KOG2932|consen 92 FCDRCDFPIA--IYGRMIPCKHVFCLECARS--DSDKICPLCDDRVQ 134 (389)
T ss_pred eecccCCcce--eeecccccchhhhhhhhhc--CccccCcCcccHHH
Confidence 4555544332 2346789999999999865 45678999977664
No 95
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=96.01 E-value=0.011 Score=30.70 Aligned_cols=34 Identities=29% Similarity=0.684 Sum_probs=27.7
Q ss_pred CccccccccccccCCCceEEeC-CCCcccHhHHHH
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKLP-CDHFYHTPCIEP 71 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l~-C~H~f~~~Ci~~ 71 (101)
....|++|-+.|.+++.+++-| |+-.+|+.|+..
T Consensus 4 ~~~~C~~Cg~~~~~~dDiVvCp~CgapyHR~C~~~ 38 (54)
T PF14446_consen 4 EGCKCPVCGKKFKDGDDIVVCPECGAPYHRDCWEK 38 (54)
T ss_pred cCccChhhCCcccCCCCEEECCCCCCcccHHHHhh
Confidence 4568999999997666666666 999999999764
No 96
>KOG3800|consensus
Probab=95.78 E-value=0.0096 Score=40.80 Aligned_cols=47 Identities=21% Similarity=0.495 Sum_probs=33.7
Q ss_pred ccccccccc-cCCC-ceEEeCCCCcccHhHHHHHHhc-CCcccccccccc
Q psy8622 41 QCSVCWEQF-TLDE-AVRKLPCDHFYHTPCIEPWLQL-HGTCPICRQTLH 87 (101)
Q Consensus 41 ~C~IC~~~~-~~~~-~~~~l~C~H~f~~~Ci~~w~~~-~~~CP~Cr~~~~ 87 (101)
.||+|.... .+++ ...+-+|+|..|.+|+...+.. ...||.|...+.
T Consensus 2 ~Cp~CKt~~Y~np~lk~~in~C~H~lCEsCvd~iF~~g~~~CpeC~~iLR 51 (300)
T KOG3800|consen 2 ACPKCKTDRYLNPDLKLMINECGHRLCESCVDRIFSLGPAQCPECMVILR 51 (300)
T ss_pred CCcccccceecCccceeeeccccchHHHHHHHHHHhcCCCCCCcccchhh
Confidence 588998753 2322 2223379999999999998865 558999977654
No 97
>KOG2817|consensus
Probab=95.64 E-value=0.013 Score=41.79 Aligned_cols=54 Identities=20% Similarity=0.293 Sum_probs=41.9
Q ss_pred CCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcC---CcccccccccccCC
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLH---GTCPICRQTLHSES 90 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~---~~CP~Cr~~~~~~~ 90 (101)
...+.|||=.+.-...+++..|.|||+....-+.+..+.. -.||+|-......+
T Consensus 332 HSvF~CPVlKeqtsdeNPPm~L~CGHVISkdAlnrLS~ng~~sfKCPYCP~e~~~~~ 388 (394)
T KOG2817|consen 332 HSVFICPVLKEQTSDENPPMMLICGHVISKDALNRLSKNGSQSFKCPYCPVEQLASD 388 (394)
T ss_pred cceeecccchhhccCCCCCeeeeccceecHHHHHHHhhCCCeeeeCCCCCcccCHHh
Confidence 3467899988877666788999999999999999876643 37999966655443
No 98
>KOG0827|consensus
Probab=95.51 E-value=0.00069 Score=48.04 Aligned_cols=50 Identities=26% Similarity=0.642 Sum_probs=41.1
Q ss_pred ccccccccccccCC-CceEEeCCCCcccHhHHHHHHhcCCccccccccccc
Q psy8622 39 NLQCSVCWEQFTLD-EAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHS 88 (101)
Q Consensus 39 ~~~C~IC~~~~~~~-~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~ 88 (101)
...|+||.+.+... +....+.|||.+|..||.+|+.....||.|++.+..
T Consensus 196 v~sl~I~~~slK~~y~k~~~~~~g~~~~~~kL~k~L~~~~kl~~~~rel~~ 246 (465)
T KOG0827|consen 196 VGSLSICFESLKQNYDKISAIVCGHIYHHGKLSKWLATKRKLPSCRRELPK 246 (465)
T ss_pred HhhhHhhHHHHHHHHHHHHHHhhcccchhhHHHHHHHHHHHhHHHHhhhhh
Confidence 45789999887543 345567799999999999999988899999998854
No 99
>KOG2034|consensus
Probab=95.48 E-value=0.0091 Score=46.34 Aligned_cols=37 Identities=19% Similarity=0.573 Sum_probs=29.2
Q ss_pred CCCccccccccccccCCCceEEeCCCCcccHhHHHHHH
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWL 73 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~ 73 (101)
.+....|.+|...+.. .+..+.||||.||+.|+.+..
T Consensus 814 ~ep~d~C~~C~~~ll~-~pF~vf~CgH~FH~~Cl~~~v 850 (911)
T KOG2034|consen 814 LEPQDSCDHCGRPLLI-KPFYVFPCGHCFHRDCLIRHV 850 (911)
T ss_pred ecCccchHHhcchhhc-CcceeeeccchHHHHHHHHHH
Confidence 4456689999988754 355778999999999998764
No 100
>KOG1100|consensus
Probab=94.73 E-value=0.014 Score=38.35 Aligned_cols=41 Identities=39% Similarity=0.722 Sum_probs=30.4
Q ss_pred cccccccccCCCceEEeCCCC-cccHhHHHHHHhcCCcccccccccccC
Q psy8622 42 CSVCWEQFTLDEAVRKLPCDH-FYHTPCIEPWLQLHGTCPICRQTLHSE 89 (101)
Q Consensus 42 C~IC~~~~~~~~~~~~l~C~H-~f~~~Ci~~w~~~~~~CP~Cr~~~~~~ 89 (101)
|-.|-+.- ..+..+||.| .+|..|-.. -..||+|+......
T Consensus 161 Cr~C~~~~---~~VlllPCrHl~lC~~C~~~----~~~CPiC~~~~~s~ 202 (207)
T KOG1100|consen 161 CRKCGERE---ATVLLLPCRHLCLCGICDES----LRICPICRSPKTSS 202 (207)
T ss_pred ceecCcCC---ceEEeecccceEeccccccc----CccCCCCcChhhce
Confidence 88887752 4578889999 678888643 45699998876543
No 101
>KOG3053|consensus
Probab=94.64 E-value=0.018 Score=38.99 Aligned_cols=54 Identities=20% Similarity=0.616 Sum_probs=36.4
Q ss_pred cCCCCccccccccccccCCC-ceEEeCC-----CCcccHhHHHHHHhcC--------Ccccccccccc
Q psy8622 34 RTDDSNLQCSVCWEQFTLDE-AVRKLPC-----DHFYHTPCIEPWLQLH--------GTCPICRQTLH 87 (101)
Q Consensus 34 ~~~~~~~~C~IC~~~~~~~~-~~~~l~C-----~H~f~~~Ci~~w~~~~--------~~CP~Cr~~~~ 87 (101)
...+.+..|-||+..-++.. ..-+-|| .|..|.+|+.+|+..+ -.||.|+.+..
T Consensus 15 ~~~e~eR~CWiCF~TdeDn~~a~WV~PCrCRGt~KWVHqsCL~rWiDEK~~~n~~q~V~C~QCqTEYi 82 (293)
T KOG3053|consen 15 DNQELERCCWICFATDEDNRLAAWVHPCRCRGTTKWVHQSCLSRWIDEKQRGNPLQTVSCPQCQTEYI 82 (293)
T ss_pred CccccceeEEEEeccCcccchhhhcccccccCccHHHHHHHHHHHHhHHhcCCCCceeechhhcchhe
Confidence 34555778999998643222 1133465 3789999999998532 26999998763
No 102
>KOG3899|consensus
Probab=94.52 E-value=0.018 Score=39.75 Aligned_cols=31 Identities=19% Similarity=0.644 Sum_probs=23.9
Q ss_pred CCCcccHhHHHHHHhc-------------CCcccccccccccCC
Q psy8622 60 CDHFYHTPCIEPWLQL-------------HGTCPICRQTLHSES 90 (101)
Q Consensus 60 C~H~f~~~Ci~~w~~~-------------~~~CP~Cr~~~~~~~ 90 (101)
|....|.+|+.+|+.. +-+||+||+.+...+
T Consensus 325 crp~wc~~cla~~f~~rq~~v~r~~~~~~~~~cp~cr~~fci~d 368 (381)
T KOG3899|consen 325 CRPLWCRSCLAQIFIGRQDNVYRYEYHRGSAQCPTCRKNFCIRD 368 (381)
T ss_pred cccHHHHHHHHHHHhhcccchhHHHHHhcCCCCcchhhceEEee
Confidence 6678899999999743 237999999886443
No 103
>PF03854 zf-P11: P-11 zinc finger; InterPro: IPR003224 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The RING-finger is a specialised type of Zn-finger of 40 to 60 residues that binds two atoms of zinc, and is probably involved in mediating protein-protein interactions [, , ]. There are two different variants, the C3HC4-type and a C3H2C3-type, which is clearly related despite the different cysteine/histidine pattern. The latter type is sometimes referred to as 'RING-H2 finger'. The RING domain is a protein interaction domain which has been implicated in a range of diverse biological processes. Several 3D-structures for RING-fingers are known [, ]. The 3D structure of the zinc ligation system is unique to the RING domain and is referred to as the 'cross-brace' motif. The spacing of the cysteines in such a domain is: C-x(2)-C-x(9 to 39)-C-x(1 to 3)-H-x(2 to 3)-C-x(2)-C-x(4 to 48)-C-x(2)-C Metal ligand pairs one and three co-ordinate to bind one zinc ion, whilst pairs two and four bind the second. This entry represents RING finger protein Z, a small polypeptide found in some negative-strand RNA viruses including Lassa fever virus, which plays a crucial role in virion assembly and budding. RING finger Z has been shown to interact with several host proteins, including promyelocytic leukemia protein and the eukaryotic translation initiation factor 4E [, ]. It is sufficient in the absence of any other viral proteins to release virus-like particles from the infected cell []. This protein is also responsible for arenavirus superinfection exclusion; expression of this protein in a host cell strongly and specifically inhibits areanavirus transcription and replication []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003723 RNA binding, 0008270 zinc ion binding; PDB: 2KO5_A.
Probab=94.34 E-value=0.024 Score=28.56 Aligned_cols=31 Identities=32% Similarity=0.840 Sum_probs=22.4
Q ss_pred eCC-CCcccHhHHHHHHhcCCccccccccccc
Q psy8622 58 LPC-DHFYHTPCIEPWLQLHGTCPICRQTLHS 88 (101)
Q Consensus 58 l~C-~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~ 88 (101)
..| .|..|..|+...+.....||+|..+++.
T Consensus 16 i~C~dHYLCl~CLt~ml~~s~~C~iC~~~LPt 47 (50)
T PF03854_consen 16 IKCSDHYLCLNCLTLMLSRSDRCPICGKPLPT 47 (50)
T ss_dssp EE-SS-EEEHHHHHHT-SSSSEETTTTEE---
T ss_pred eeecchhHHHHHHHHHhccccCCCcccCcCcc
Confidence 446 5899999999999999999999988753
No 104
>COG5220 TFB3 Cdk activating kinase (CAK)/RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB3 [Cell division and chromosome partitioning / Transcription / DNA replication, recombination, and repair]
Probab=94.20 E-value=0.023 Score=38.24 Aligned_cols=48 Identities=21% Similarity=0.508 Sum_probs=33.8
Q ss_pred Ccccccccccccc-CCC-ceEEeC-CCCcccHhHHHHHHhcC-Cccc--ccccc
Q psy8622 38 SNLQCSVCWEQFT-LDE-AVRKLP-CDHFYHTPCIEPWLQLH-GTCP--ICRQT 85 (101)
Q Consensus 38 ~~~~C~IC~~~~~-~~~-~~~~l~-C~H~f~~~Ci~~w~~~~-~~CP--~Cr~~ 85 (101)
.+..||||..+.. +++ ...+-| |-|..|.+|+.+.+... ..|| -|.+-
T Consensus 9 ~d~~CPvCksDrYLnPdik~linPECyHrmCESCvdRIFs~GpAqCP~~gC~kI 62 (314)
T COG5220 9 EDRRCPVCKSDRYLNPDIKILINPECYHRMCESCVDRIFSRGPAQCPYKGCGKI 62 (314)
T ss_pred hcccCCccccccccCCCeEEEECHHHHHHHHHHHHHHHhcCCCCCCCCccHHHH
Confidence 3558999998642 333 233335 99999999999998764 4798 67543
No 105
>KOG3002|consensus
Probab=94.15 E-value=0.041 Score=38.19 Aligned_cols=49 Identities=22% Similarity=0.546 Sum_probs=35.5
Q ss_pred cCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 34 RTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 34 ~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
....+-..||||.+.+..+ ...-.=||..|..|-.+ ....||.||.++.
T Consensus 43 ~~~~~lleCPvC~~~l~~P--i~QC~nGHlaCssC~~~---~~~~CP~Cr~~~g 91 (299)
T KOG3002|consen 43 LLDLDLLDCPVCFNPLSPP--IFQCDNGHLACSSCRTK---VSNKCPTCRLPIG 91 (299)
T ss_pred ccchhhccCchhhccCccc--ceecCCCcEehhhhhhh---hcccCCccccccc
Confidence 3455678999999998654 22222379999998753 4667999999885
No 106
>KOG0298|consensus
Probab=93.99 E-value=0.016 Score=46.72 Aligned_cols=50 Identities=24% Similarity=0.549 Sum_probs=40.0
Q ss_pred cCCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccc
Q psy8622 34 RTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQT 85 (101)
Q Consensus 34 ~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~ 85 (101)
........|.||++.+.. .-.+..|+|.+|..|...|+..+..||.|...
T Consensus 1148 ~~~~~~~~c~ic~dil~~--~~~I~~cgh~~c~~c~~~~l~~~s~~~~~ksi 1197 (1394)
T KOG0298|consen 1148 MNLSGHFVCEICLDILRN--QGGIAGCGHEPCCRCDELWLYASSRCPICKSI 1197 (1394)
T ss_pred HHhhcccchHHHHHHHHh--cCCeeeechhHhhhHHHHHHHHhccCcchhhh
Confidence 334455699999998862 22556799999999999999999999999743
No 107
>KOG0309|consensus
Probab=93.78 E-value=0.039 Score=42.60 Aligned_cols=42 Identities=26% Similarity=0.631 Sum_probs=30.6
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPI 81 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~ 81 (101)
...|.||.-.+.. ...+-..|+|+.|.+|...|+.....||.
T Consensus 1028 ~~~C~~C~l~V~g-ss~~Cg~C~Hv~H~sc~~eWf~~gd~Cps 1069 (1081)
T KOG0309|consen 1028 TFQCAICHLAVRG-SSNFCGTCGHVGHTSCMMEWFRTGDVCPS 1069 (1081)
T ss_pred eeeeeeEeeEeec-cchhhccccccccHHHHHHHHhcCCcCCC
Confidence 3457777665432 23344469999999999999998888874
No 108
>KOG3113|consensus
Probab=93.38 E-value=0.11 Score=35.29 Aligned_cols=60 Identities=10% Similarity=0.223 Sum_probs=43.7
Q ss_pred CCccccccccccccCCCce-EEeCCCCcccHhHHHHHHhcCCcccccccccccCCcccCCCCC
Q psy8622 37 DSNLQCSVCWEQFTLDEAV-RKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSESAESLGPAG 98 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~-~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~~~~~~~~ 98 (101)
...+.|||=--++...-.. ...+|||+|-..-+.+. ....|++|.+.+..++.+-+.++.
T Consensus 109 ~a~fiCPvtgleMng~~~F~~l~~CGcV~SerAlKei--kas~C~~C~a~y~~~dvIvlNg~~ 169 (293)
T KOG3113|consen 109 RARFICPVTGLEMNGKYRFCALRCCGCVFSERALKEI--KASVCHVCGAAYQEDDVIVLNGTE 169 (293)
T ss_pred cceeecccccceecceEEEEEEeccceeccHHHHHHh--hhccccccCCcccccCeEeeCCCH
Confidence 3467899876666433333 33469999988887774 467899999999988888777653
No 109
>KOG1609|consensus
Probab=93.08 E-value=0.12 Score=35.36 Aligned_cols=50 Identities=26% Similarity=0.679 Sum_probs=35.8
Q ss_pred ccccccccccccCCCc-eEEeCCC-----CcccHhHHHHHHh--cCCccccccccccc
Q psy8622 39 NLQCSVCWEQFTLDEA-VRKLPCD-----HFYHTPCIEPWLQ--LHGTCPICRQTLHS 88 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~-~~~l~C~-----H~f~~~Ci~~w~~--~~~~CP~Cr~~~~~ 88 (101)
+..|-||......... ....||. ...|..|+..|+. .+..|.+|......
T Consensus 78 ~~~cRIc~~~~~~~~~~~l~~pC~C~g~l~~vH~~cl~~W~~~~~~~~CeiC~~~~~~ 135 (323)
T KOG1609|consen 78 GPICRICHEEDEESNGLLLISPCSCKGSLAYVHRSCLEKWFSIKGNITCEICKSFFIN 135 (323)
T ss_pred CCcEEEEecccccccccccccCccccCcHHHHHHHHHHhhhccccCeeeeccccccee
Confidence 4689999997542211 3566763 4579999999997 56689999876643
No 110
>PF02891 zf-MIZ: MIZ/SP-RING zinc finger; InterPro: IPR004181 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents MIZ-type zinc finger domains. Miz1 (Msx-interacting-zinc finger) is a zinc finger-containing protein with homology to the yeast protein, Nfi-1. Miz1 is a sequence specific DNA binding protein that can function as a positive-acting transcription factor. Miz1 binds to the homeobox protein Msx2, enhancing the specific DNA-binding ability of Msx2 []. Other proteins containing this domain include the human pias family (protein inhibitor of activated STAT protein). More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3I2D_A.
Probab=93.08 E-value=0.033 Score=28.38 Aligned_cols=43 Identities=28% Similarity=0.689 Sum_probs=19.6
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHHhc---C--Ccccccccc
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL---H--GTCPICRQT 85 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~---~--~~CP~Cr~~ 85 (101)
..|||-...+. .+++...|.|.-|.+ +..|+.. . -.||+|.++
T Consensus 3 L~CPls~~~i~--~P~Rg~~C~H~~CFD-l~~fl~~~~~~~~W~CPiC~~~ 50 (50)
T PF02891_consen 3 LRCPLSFQRIR--IPVRGKNCKHLQCFD-LESFLESNQRTPKWKCPICNKP 50 (50)
T ss_dssp SB-TTTSSB-S--SEEEETT--SS--EE-HHHHHHHHHHS---B-TTT---
T ss_pred eeCCCCCCEEE--eCccCCcCcccceEC-HHHHHHHhhccCCeECcCCcCc
Confidence 57888888775 345666799975543 3344432 2 269999763
No 111
>KOG1812|consensus
Probab=93.01 E-value=0.037 Score=39.63 Aligned_cols=38 Identities=18% Similarity=0.655 Sum_probs=27.3
Q ss_pred Ccccccccc-ccccCCCceEEeCCCCcccHhHHHHHHhc
Q psy8622 38 SNLQCSVCW-EQFTLDEAVRKLPCDHFYHTPCIEPWLQL 75 (101)
Q Consensus 38 ~~~~C~IC~-~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~ 75 (101)
....|.||. +............|+|.||..|+.+.+..
T Consensus 145 ~~~~C~iC~~e~~~~~~~f~~~~C~H~fC~~C~k~~iev 183 (384)
T KOG1812|consen 145 PKEECGICFVEDPEAEDMFSVLKCGHRFCKDCVKQHIEV 183 (384)
T ss_pred ccccCccCccccccHhhhHHHhcccchhhhHHhHHHhhh
Confidence 366899999 44333233335669999999999988763
No 112
>KOG0802|consensus
Probab=93.00 E-value=0.1 Score=38.89 Aligned_cols=56 Identities=25% Similarity=0.780 Sum_probs=42.7
Q ss_pred CCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccCCcccCCCC
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSESAESLGPA 97 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~~~~~~~ 97 (101)
.......|.||+... ..+..+|. |..|+..|+..+..||+|+..+..++.......
T Consensus 475 l~~~~~~~~~~~~~~----~~~~~~~~---~~~~l~~~~~~~~~~pl~~~~~~~~~~~~~~~~ 530 (543)
T KOG0802|consen 475 LREPNDVCAICYQEM----SARITPCS---HALCLRKWLYVQEVCPLCHTYMKEDDFLSKNSS 530 (543)
T ss_pred hhcccCcchHHHHHH----Hhcccccc---chhHHHhhhhhccccCCCchhhhcccccCcccc
Confidence 344566899999986 23566777 788999999999999999998877666554443
No 113
>KOG4362|consensus
Probab=91.96 E-value=0.039 Score=42.01 Aligned_cols=48 Identities=23% Similarity=0.693 Sum_probs=37.6
Q ss_pred CccccccccccccCCCceEEeCCCCcccHhHHHHHHhc---CCccccccccccc
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL---HGTCPICRQTLHS 88 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~---~~~CP~Cr~~~~~ 88 (101)
....|+||...+..+ ..+.|.|.|+..|+..-+.. ...||+|+..+..
T Consensus 20 k~lEc~ic~~~~~~p---~~~kc~~~~l~~~~n~~f~~~~~~~~~~lc~~~~eK 70 (684)
T KOG4362|consen 20 KILECPICLEHVKEP---SLLKCDHIFLKFCLNKLFESKKGPKQCALCKSDIEK 70 (684)
T ss_pred hhccCCceeEEeecc---chhhhhHHHHhhhhhceeeccCccccchhhhhhhhh
Confidence 457899999988655 67889999999999765543 4479999876643
No 114
>KOG1815|consensus
Probab=90.85 E-value=0.17 Score=36.86 Aligned_cols=38 Identities=26% Similarity=0.693 Sum_probs=30.3
Q ss_pred CCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHh
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQ 74 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~ 74 (101)
.......|.||.+.+.. ....+.|+|.||..|+..++.
T Consensus 66 ~~~~~~~c~ic~~~~~~--~~~~~~c~H~~c~~cw~~yl~ 103 (444)
T KOG1815|consen 66 KKKGDVQCGICVESYDG--EIIGLGCGHPFCPPCWTGYLG 103 (444)
T ss_pred CCCccccCCcccCCCcc--hhhhcCCCcHHHHHHHHHHhh
Confidence 34556789999998753 456678999999999998875
No 115
>KOG1812|consensus
Probab=89.19 E-value=0.22 Score=35.72 Aligned_cols=44 Identities=27% Similarity=0.516 Sum_probs=30.6
Q ss_pred ccccccccccccCCC--ceEEeCCCCcccHhHHHHHHhcCCccccc
Q psy8622 39 NLQCSVCWEQFTLDE--AVRKLPCDHFYHTPCIEPWLQLHGTCPIC 82 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~--~~~~l~C~H~f~~~Ci~~w~~~~~~CP~C 82 (101)
-..|++|.-.+.-.. ....-.|+|.||..|...|...+..|..|
T Consensus 306 wr~CpkC~~~ie~~~GCnhm~CrC~~~fcy~C~~~~~~~~~~~~~~ 351 (384)
T KOG1812|consen 306 WRQCPKCKFMIELSEGCNHMTCRCGHQFCYMCGGDWKTHNGECYEC 351 (384)
T ss_pred cCcCcccceeeeecCCcceEEeeccccchhhcCcchhhCCccccCc
Confidence 456888876653222 23334599999999999998877766544
No 116
>COG5183 SSM4 Protein involved in mRNA turnover and stability [RNA processing and modification]
Probab=88.91 E-value=0.49 Score=37.23 Aligned_cols=53 Identities=21% Similarity=0.652 Sum_probs=37.2
Q ss_pred CccccccccccccCCCceEEeCCC-----CcccHhHHHHHHhc--CCcccccccccccCCc
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKLPCD-----HFYHTPCIEPWLQL--HGTCPICRQTLHSESA 91 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l~C~-----H~f~~~Ci~~w~~~--~~~CP~Cr~~~~~~~~ 91 (101)
++..|-||..+-..+++.. -||. ...|.+|+..|+.. ...|-+|..++.=.++
T Consensus 11 d~~~CRICr~e~~~d~pLf-hPCKC~GSIkYiH~eCL~eW~~~s~~~kCdiChy~~~Fk~I 70 (1175)
T COG5183 11 DKRSCRICRTEDIRDDPLF-HPCKCSGSIKYIHRECLMEWMECSGTKKCDICHYEYKFKDI 70 (1175)
T ss_pred cchhceeecCCCCCCCcCc-ccccccchhHHHHHHHHHHHHhcCCCcceeeecceeeeeee
Confidence 3478999998754444333 4554 24799999999974 4579999988754433
No 117
>KOG3161|consensus
Probab=88.17 E-value=0.17 Score=38.56 Aligned_cols=41 Identities=24% Similarity=0.647 Sum_probs=30.1
Q ss_pred CccccccccccccCCC-ceEEeCCCCcccHhHHHHHHhcCCccc
Q psy8622 38 SNLQCSVCWEQFTLDE-AVRKLPCDHFYHTPCIEPWLQLHGTCP 80 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~-~~~~l~C~H~f~~~Ci~~w~~~~~~CP 80 (101)
....|+||+..|.... .++.+-|||+.|..|+... .+.+||
T Consensus 10 ~~l~c~ic~n~f~~~~~~Pvsl~cghtic~~c~~~l--yn~scp 51 (861)
T KOG3161|consen 10 LLLLCDICLNLFVVQRLEPVSLQCGHTICGHCVQLL--YNASCP 51 (861)
T ss_pred HHhhchHHHHHHHHHhcCcccccccchHHHHHHHhH--hhccCC
Confidence 3467999988774332 4466679999999999874 456788
No 118
>KOG4718|consensus
Probab=87.98 E-value=0.29 Score=32.38 Aligned_cols=46 Identities=26% Similarity=0.632 Sum_probs=34.8
Q ss_pred CccccccccccccCCCceEEe-CCCCcccHhHHHHHHhcCCccccccccc
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKL-PCDHFYHTPCIEPWLQLHGTCPICRQTL 86 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l-~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~ 86 (101)
.-..|.+|...+..+ +.. .|+-.+|..|+..+++....||.|..-+
T Consensus 180 nlk~Cn~Ch~LvIqg---~rCg~c~i~~h~~c~qty~q~~~~cphc~d~w 226 (235)
T KOG4718|consen 180 NLKNCNLCHCLVIQG---IRCGSCNIQYHRGCIQTYLQRRDICPHCGDLW 226 (235)
T ss_pred HHHHHhHhHHHhhee---eccCcccchhhhHHHHHHhcccCcCCchhccc
Confidence 456899999876433 222 3677899999999999999999995433
No 119
>PF14569 zf-UDP: Zinc-binding RING-finger; PDB: 1WEO_A.
Probab=87.85 E-value=1.4 Score=24.54 Aligned_cols=51 Identities=24% Similarity=0.481 Sum_probs=19.8
Q ss_pred CccccccccccccCCC--ceEEe--CCCCcccHhHHHHH-HhcCCccccccccccc
Q psy8622 38 SNLQCSVCWEQFTLDE--AVRKL--PCDHFYHTPCIEPW-LQLHGTCPICRQTLHS 88 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~--~~~~l--~C~H~f~~~Ci~~w-~~~~~~CP~Cr~~~~~ 88 (101)
....|-||-+.+.... .+++. .|+--.|..|..-= ...++.||.|+.....
T Consensus 8 ~~qiCqiCGD~VGl~~~Ge~FVAC~eC~fPvCr~CyEYErkeg~q~CpqCkt~ykr 63 (80)
T PF14569_consen 8 NGQICQICGDDVGLTENGEVFVACHECAFPVCRPCYEYERKEGNQVCPQCKTRYKR 63 (80)
T ss_dssp SS-B-SSS--B--B-SSSSB--S-SSS-----HHHHHHHHHTS-SB-TTT--B---
T ss_pred CCcccccccCccccCCCCCEEEEEcccCCccchhHHHHHhhcCcccccccCCCccc
Confidence 4568999999874332 23322 37777899998643 3457799999987753
No 120
>KOG1829|consensus
Probab=87.62 E-value=0.2 Score=37.80 Aligned_cols=23 Identities=26% Similarity=0.826 Sum_probs=18.1
Q ss_pred CCCCcccHhHHHHHHhcCCccccccc
Q psy8622 59 PCDHFYHTPCIEPWLQLHGTCPICRQ 84 (101)
Q Consensus 59 ~C~H~f~~~Ci~~w~~~~~~CP~Cr~ 84 (101)
.|+++||..|+.+ .+..||.|-+
T Consensus 536 ~C~avfH~~C~~r---~s~~CPrC~R 558 (580)
T KOG1829|consen 536 TCLAVFHKKCLRR---KSPCCPRCER 558 (580)
T ss_pred HHHHHHHHHHHhc---cCCCCCchHH
Confidence 4999999999865 4455999944
No 121
>KOG3579|consensus
Probab=86.62 E-value=0.43 Score=33.01 Aligned_cols=40 Identities=20% Similarity=0.465 Sum_probs=28.6
Q ss_pred CCCccccccccccccCCCceEEeC--CCCcccHhHHHHHHhcC
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLP--CDHFYHTPCIEPWLQLH 76 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~--C~H~f~~~Ci~~w~~~~ 76 (101)
....+.|.+|.|.+++.. .+..| =.|.||..|-...++.+
T Consensus 265 ~~apLcCTLC~ERLEDTH-FVQCPSVp~HKFCFPCSResIK~Q 306 (352)
T KOG3579|consen 265 PSAPLCCTLCHERLEDTH-FVQCPSVPSHKFCFPCSRESIKQQ 306 (352)
T ss_pred CCCceeehhhhhhhccCc-eeecCCCcccceecccCHHHHHhh
Confidence 345689999999996442 22233 37999999999887653
No 122
>PF06906 DUF1272: Protein of unknown function (DUF1272); InterPro: IPR010696 This family consists of several hypothetical bacterial proteins of around 80 residues in length. This family contains a number of conserved cysteine residues and its function is unknown.
Probab=86.33 E-value=1.3 Score=23.08 Aligned_cols=47 Identities=26% Similarity=0.605 Sum_probs=31.4
Q ss_pred cccccccccccCCCceEEeCCCC--cccHhHHHHHHhcCCcccccccccccC
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDH--FYHTPCIEPWLQLHGTCPICRQTLHSE 89 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H--~f~~~Ci~~w~~~~~~CP~Cr~~~~~~ 89 (101)
..|-.|-.++..+. .-..-|.+ .||..|....+ ...||.|...+...
T Consensus 6 pnCE~C~~dLp~~s-~~A~ICSfECTFC~~C~e~~l--~~~CPNCgGelv~R 54 (57)
T PF06906_consen 6 PNCECCDKDLPPDS-PEAYICSFECTFCADCAETML--NGVCPNCGGELVRR 54 (57)
T ss_pred CCccccCCCCCCCC-CcceEEeEeCcccHHHHHHHh--cCcCcCCCCccccC
Confidence 45677777764433 11222543 79999999865 67899998877654
No 123
>COG5109 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=85.93 E-value=0.78 Score=32.26 Aligned_cols=52 Identities=23% Similarity=0.294 Sum_probs=37.0
Q ss_pred CCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhc---CCcccccccccc
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL---HGTCPICRQTLH 87 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~---~~~CP~Cr~~~~ 87 (101)
...-..||+=-+.-....++..+.|||+.-..-+....+. .-.||.|-..-.
T Consensus 333 fHs~FiCPVlKe~~t~ENpP~ml~CgHVIskeal~~LS~nG~~~FKCPYCP~~~~ 387 (396)
T COG5109 333 FHSLFICPVLKELCTDENPPVMLECGHVISKEALSVLSQNGVLSFKCPYCPEMSK 387 (396)
T ss_pred ccceeeccccHhhhcccCCCeeeeccceeeHHHHHHHhhcCcEEeeCCCCCcchh
Confidence 3446789987776655567899999999988888775443 226999955433
No 124
>KOG0269|consensus
Probab=85.62 E-value=0.9 Score=35.37 Aligned_cols=41 Identities=22% Similarity=0.386 Sum_probs=28.3
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccc
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPI 81 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~ 81 (101)
..|.+|--.+.. ....---|+|.-|.+|+..|+.....||.
T Consensus 780 ~~CtVC~~vi~G-~~~~c~~C~H~gH~sh~~sw~~~~s~ca~ 820 (839)
T KOG0269|consen 780 AKCTVCDLVIRG-VDVWCQVCGHGGHDSHLKSWFFKASPCAK 820 (839)
T ss_pred cCceeecceeee-eEeecccccccccHHHHHHHHhcCCCCcc
Confidence 467777665421 11111139999999999999998887776
No 125
>KOG0825|consensus
Probab=85.51 E-value=0.93 Score=35.64 Aligned_cols=52 Identities=15% Similarity=0.203 Sum_probs=36.5
Q ss_pred CccccccccccccCCC-ceEEeC---CCCcccHhHHHHHHhc------CCcccccccccccC
Q psy8622 38 SNLQCSVCWEQFTLDE-AVRKLP---CDHFYHTPCIEPWLQL------HGTCPICRQTLHSE 89 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~-~~~~l~---C~H~f~~~Ci~~w~~~------~~~CP~Cr~~~~~~ 89 (101)
....|.+|+..+..+. ..-.+| |+|.||..||..|... .-.|++|..-|...
T Consensus 95 ~s~Ss~~C~~E~S~~~ds~~i~P~~~~~~~~CP~Ci~s~~DqL~~~~k~c~H~FC~~Ci~sW 156 (1134)
T KOG0825|consen 95 ESDTSPVCEKEHSPDVDSSNICPVQTHVENQCPNCLKSCNDQLEESEKHTAHYFCEECVGSW 156 (1134)
T ss_pred cccccchhheecCCcccccCcCchhhhhhhhhhHHHHHHHHHhhccccccccccHHHHhhhh
Confidence 3567888888776522 233445 9999999999999753 33688888777543
No 126
>KOG2807|consensus
Probab=85.07 E-value=0.83 Score=32.23 Aligned_cols=47 Identities=17% Similarity=0.349 Sum_probs=32.8
Q ss_pred CccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccc
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQ 84 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~ 84 (101)
....|..|.+........+--.|.+.||.+|-.-.-+.-..||.|..
T Consensus 329 ~~~~Cf~C~~~~~~~~~y~C~~Ck~~FCldCDv~iHesLh~CpgCeh 375 (378)
T KOG2807|consen 329 GSRFCFACQGELLSSGRYRCESCKNVFCLDCDVFIHESLHNCPGCEH 375 (378)
T ss_pred CCcceeeeccccCCCCcEEchhccceeeccchHHHHhhhhcCCCcCC
Confidence 44569999777655544444459999999996554344557999963
No 127
>PLN02400 cellulose synthase
Probab=84.74 E-value=2.7 Score=34.18 Aligned_cols=50 Identities=22% Similarity=0.481 Sum_probs=33.8
Q ss_pred CccccccccccccCCC--ceE-Ee-CCCCcccHhHHHH-HHhcCCcccccccccc
Q psy8622 38 SNLQCSVCWEQFTLDE--AVR-KL-PCDHFYHTPCIEP-WLQLHGTCPICRQTLH 87 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~--~~~-~l-~C~H~f~~~Ci~~-w~~~~~~CP~Cr~~~~ 87 (101)
....|-||-+++.... .+. .. -|+--.|+.|.+- .-..++.||.|+....
T Consensus 35 ~gqiCqICGD~VG~t~dGe~FVAC~eCaFPVCRpCYEYERkeGnq~CPQCkTrYk 89 (1085)
T PLN02400 35 NGQICQICGDDVGVTETGDVFVACNECAFPVCRPCYEYERKDGTQCCPQCKTRYR 89 (1085)
T ss_pred CCceeeecccccCcCCCCCEEEEEccCCCccccchhheecccCCccCcccCCccc
Confidence 3458999999874332 222 22 3777789999842 1234778999998886
No 128
>KOG2068|consensus
Probab=84.51 E-value=1.2 Score=31.29 Aligned_cols=51 Identities=24% Similarity=0.566 Sum_probs=36.6
Q ss_pred CccccccccccccCCCce-EEeCCCCcccHhHHHHHHhcCCccccccccccc
Q psy8622 38 SNLQCSVCWEQFTLDEAV-RKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHS 88 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~-~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~ 88 (101)
....|+||.+.....+.. .-.+|++..|..|+.........||.||.+...
T Consensus 248 v~~s~p~~~~~~~~~d~~~lP~~~~~~~~l~~~~t~~~~~~~~~~~rk~~~~ 299 (327)
T KOG2068|consen 248 VPPSCPICYEDLDLTDSNFLPCPCGFRLCLFCHKTISDGDGRCPGCRKPYER 299 (327)
T ss_pred cCCCCCCCCCcccccccccccccccccchhhhhhcccccCCCCCccCCcccc
Confidence 346899999976433322 223478888888888877888899999976643
No 129
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=84.46 E-value=0.5 Score=20.68 Aligned_cols=9 Identities=22% Similarity=0.715 Sum_probs=5.0
Q ss_pred ccccccccc
Q psy8622 41 QCSVCWEQF 49 (101)
Q Consensus 41 ~C~IC~~~~ 49 (101)
.||-|...+
T Consensus 2 ~CP~C~~~V 10 (26)
T PF10571_consen 2 TCPECGAEV 10 (26)
T ss_pred cCCCCcCCc
Confidence 356665554
No 130
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=84.42 E-value=0.46 Score=25.36 Aligned_cols=36 Identities=17% Similarity=0.435 Sum_probs=17.1
Q ss_pred CCccccccccccccCCCceEEe-CCCCcccHhHHHHH
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKL-PCDHFYHTPCIEPW 72 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l-~C~H~f~~~Ci~~w 72 (101)
.+...|.+|...|..-..-..- .||++||..|....
T Consensus 7 ~~~~~C~~C~~~F~~~~rrhhCr~CG~~vC~~Cs~~~ 43 (69)
T PF01363_consen 7 SEASNCMICGKKFSLFRRRHHCRNCGRVVCSSCSSQR 43 (69)
T ss_dssp GG-SB-TTT--B-BSSS-EEE-TTT--EEECCCS-EE
T ss_pred CCCCcCcCcCCcCCCceeeEccCCCCCEECCchhCCE
Confidence 4456899999998533221222 39999999998643
No 131
>PF13901 DUF4206: Domain of unknown function (DUF4206)
Probab=84.40 E-value=0.74 Score=30.13 Aligned_cols=41 Identities=22% Similarity=0.826 Sum_probs=26.1
Q ss_pred Ccccccccccc-----ccCCCceEEe-CCCCcccHhHHHHHHhcCCccccccc
Q psy8622 38 SNLQCSVCWEQ-----FTLDEAVRKL-PCDHFYHTPCIEPWLQLHGTCPICRQ 84 (101)
Q Consensus 38 ~~~~C~IC~~~-----~~~~~~~~~l-~C~H~f~~~Ci~~w~~~~~~CP~Cr~ 84 (101)
.+..|-+|.+. |.. +.+..- .|+..||..|+.+ ..||.|.+
T Consensus 151 kGfiCe~C~~~~~IfPF~~-~~~~~C~~C~~v~H~~C~~~-----~~CpkC~R 197 (202)
T PF13901_consen 151 KGFICEICNSDDIIFPFQI-DTTVRCPKCKSVFHKSCFRK-----KSCPKCAR 197 (202)
T ss_pred CCCCCccCCCCCCCCCCCC-CCeeeCCcCccccchhhcCC-----CCCCCcHh
Confidence 35678888753 111 122222 3999999999862 67999954
No 132
>PF07191 zinc-ribbons_6: zinc-ribbons; InterPro: IPR010807 This family consists of several short, hypothetical bacterial proteins of around 70 residues in length. Members of this family 8 highly conserved cysteine residues. The function of the family is unknown.; PDB: 2JRP_A 2JNE_A.
Probab=84.24 E-value=0.082 Score=28.86 Aligned_cols=40 Identities=23% Similarity=0.582 Sum_probs=19.7
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
..||.|...+.... ++.+|..|-.. +.....||-|..++.
T Consensus 2 ~~CP~C~~~L~~~~-------~~~~C~~C~~~-~~~~a~CPdC~~~Le 41 (70)
T PF07191_consen 2 NTCPKCQQELEWQG-------GHYHCEACQKD-YKKEAFCPDCGQPLE 41 (70)
T ss_dssp -B-SSS-SBEEEET-------TEEEETTT--E-EEEEEE-TTT-SB-E
T ss_pred CcCCCCCCccEEeC-------CEEECcccccc-ceecccCCCcccHHH
Confidence 46888888764321 55566666554 344557888887763
No 133
>PF06844 DUF1244: Protein of unknown function (DUF1244); InterPro: IPR009654 This family consists of several short bacterial proteins of around 100 residues in length. The function of this family is unknown.; PDB: 2O35_A 3FYB_B.
Probab=82.25 E-value=0.92 Score=24.44 Aligned_cols=12 Identities=17% Similarity=0.753 Sum_probs=8.8
Q ss_pred cccHhHHHHHHh
Q psy8622 63 FYHTPCIEPWLQ 74 (101)
Q Consensus 63 ~f~~~Ci~~w~~ 74 (101)
.||+.||.+|..
T Consensus 11 gFCRNCLskWy~ 22 (68)
T PF06844_consen 11 GFCRNCLSKWYR 22 (68)
T ss_dssp S--HHHHHHHHH
T ss_pred HHHHHHHHHHHH
Confidence 599999999975
No 134
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=81.88 E-value=0.86 Score=23.33 Aligned_cols=24 Identities=21% Similarity=0.528 Sum_probs=13.1
Q ss_pred CCCcccHhHHHHHHhcCCcccccc
Q psy8622 60 CDHFYHTPCIEPWLQLHGTCPICR 83 (101)
Q Consensus 60 C~H~f~~~Ci~~w~~~~~~CP~Cr 83 (101)
|++.||.+|=.-.=+.-..||-|-
T Consensus 27 C~~~FC~dCD~fiHE~LH~CPGC~ 50 (51)
T PF07975_consen 27 CKNHFCIDCDVFIHETLHNCPGCE 50 (51)
T ss_dssp TT--B-HHHHHTTTTTS-SSSTT-
T ss_pred CCCccccCcChhhhccccCCcCCC
Confidence 999999999643223444788873
No 135
>smart00132 LIM Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
Probab=81.48 E-value=1.4 Score=20.09 Aligned_cols=37 Identities=22% Similarity=0.538 Sum_probs=22.0
Q ss_pred ccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccccc
Q psy8622 41 QCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTL 86 (101)
Q Consensus 41 ~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~ 86 (101)
.|..|.+.+...+. ....=+..||..|+ .|..|+..+
T Consensus 1 ~C~~C~~~i~~~~~-~~~~~~~~~H~~Cf--------~C~~C~~~L 37 (39)
T smart00132 1 KCAGCGKPIRGGEL-VLRALGKVWHPECF--------KCSKCGKPL 37 (39)
T ss_pred CccccCCcccCCcE-EEEeCCccccccCC--------CCcccCCcC
Confidence 36778777654322 22223667888775 477776655
No 136
>PF10497 zf-4CXXC_R1: Zinc-finger domain of monoamine-oxidase A repressor R1; InterPro: IPR018866 R1 is a transcription factor repressor that inhibits monoamine oxidase A gene expression. This domain is a four-CXXC zinc finger putative DNA-binding domain found at the C-terminal end of R1. The domain carries 12 cysteines of which four pairs are of the CXXC type [].
Probab=81.40 E-value=2.2 Score=25.11 Aligned_cols=25 Identities=20% Similarity=0.446 Sum_probs=18.6
Q ss_pred CCcccHhHHHHHHhc---------CCcccccccc
Q psy8622 61 DHFYHTPCIEPWLQL---------HGTCPICRQT 85 (101)
Q Consensus 61 ~H~f~~~Ci~~w~~~---------~~~CP~Cr~~ 85 (101)
.-.||..||..++.. .-.||.||..
T Consensus 37 ~~~fC~~CL~~ryge~~~ev~~~~~W~CP~Crgi 70 (105)
T PF10497_consen 37 RGKFCGGCLRNRYGENVEEVLEDPNWKCPKCRGI 70 (105)
T ss_pred cceehHhHHHHHHhhhHHHHhcCCceECCCCCCe
Confidence 667999999887642 2269999873
No 137
>PF00628 PHD: PHD-finger; InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=80.84 E-value=0.78 Score=22.83 Aligned_cols=43 Identities=28% Similarity=0.634 Sum_probs=25.8
Q ss_pred ccccccccccCCCceEEe-CCCCcccHhHHHHHHh------cCCccccccc
Q psy8622 41 QCSVCWEQFTLDEAVRKL-PCDHFYHTPCIEPWLQ------LHGTCPICRQ 84 (101)
Q Consensus 41 ~C~IC~~~~~~~~~~~~l-~C~H~f~~~Ci~~w~~------~~~~CP~Cr~ 84 (101)
.|.||...... ..++.- .|+..||..|+..-.. ..=.||.|+.
T Consensus 1 ~C~vC~~~~~~-~~~i~C~~C~~~~H~~C~~~~~~~~~~~~~~w~C~~C~~ 50 (51)
T PF00628_consen 1 YCPVCGQSDDD-GDMIQCDSCNRWYHQECVGPPEKAEEIPSGDWYCPNCRP 50 (51)
T ss_dssp EBTTTTSSCTT-SSEEEBSTTSCEEETTTSTSSHSHHSHHSSSBSSHHHHH
T ss_pred eCcCCCCcCCC-CCeEEcCCCChhhCcccCCCChhhccCCCCcEECcCCcC
Confidence 47888884322 222222 4899999999864322 1236887753
No 138
>smart00249 PHD PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the PLN02638 cellulose synthase A (UDP-forming), catalytic subunit
Probab=79.10 E-value=3.5 Score=33.57 Aligned_cols=50 Identities=18% Similarity=0.423 Sum_probs=34.1
Q ss_pred CccccccccccccCCC---ceEEe-CCCCcccHhHHHH-HHhcCCcccccccccc
Q psy8622 38 SNLQCSVCWEQFTLDE---AVRKL-PCDHFYHTPCIEP-WLQLHGTCPICRQTLH 87 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~---~~~~l-~C~H~f~~~Ci~~-w~~~~~~CP~Cr~~~~ 87 (101)
....|-||-+++.... ..+.. -|+--.|..|.+- .-+.++.||.|+....
T Consensus 16 ~~qiCqICGD~vg~~~~Ge~FVAC~eC~FPVCrpCYEYEr~eG~q~CPqCktrYk 70 (1079)
T PLN02638 16 GGQVCQICGDNVGKTVDGEPFVACDVCAFPVCRPCYEYERKDGNQSCPQCKTKYK 70 (1079)
T ss_pred CCceeeecccccCcCCCCCEEEEeccCCCccccchhhhhhhcCCccCCccCCchh
Confidence 3458999999874332 22222 2777789999842 2245779999998876
No 140
>PLN02189 cellulose synthase
Probab=78.58 E-value=3.7 Score=33.32 Aligned_cols=50 Identities=20% Similarity=0.474 Sum_probs=34.5
Q ss_pred CccccccccccccCC---CceEEeC-CCCcccHhHHHHHH-hcCCcccccccccc
Q psy8622 38 SNLQCSVCWEQFTLD---EAVRKLP-CDHFYHTPCIEPWL-QLHGTCPICRQTLH 87 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~---~~~~~l~-C~H~f~~~Ci~~w~-~~~~~CP~Cr~~~~ 87 (101)
....|.||-+++... +..+... |+--.|..|.+-=. ..++.||.|+....
T Consensus 33 ~~~~C~iCgd~vg~~~~g~~fvaC~~C~fpvCr~Cyeyer~eg~q~CpqCkt~Y~ 87 (1040)
T PLN02189 33 DGQVCEICGDEIGLTVDGDLFVACNECGFPVCRPCYEYERREGTQNCPQCKTRYK 87 (1040)
T ss_pred cCccccccccccCcCCCCCEEEeeccCCCccccchhhhhhhcCCccCcccCCchh
Confidence 445899999987422 2333333 77788999985322 34778999999886
No 141
>KOG3842|consensus
Probab=77.48 E-value=3.3 Score=29.32 Aligned_cols=52 Identities=23% Similarity=0.575 Sum_probs=32.1
Q ss_pred CCCccccccccccc---------------cCCC-ceEEeCCCCcccHhHHHHHHhc---------CCcccccccccc
Q psy8622 36 DDSNLQCSVCWEQF---------------TLDE-AVRKLPCDHFYHTPCIEPWLQL---------HGTCPICRQTLH 87 (101)
Q Consensus 36 ~~~~~~C~IC~~~~---------------~~~~-~~~~l~C~H~f~~~Ci~~w~~~---------~~~CP~Cr~~~~ 87 (101)
......||+|+..- ..+. ....-||||+--..=..-|.+. ...||.|-..+.
T Consensus 338 g~~~r~CPmC~~~gp~V~L~lG~E~~f~vD~G~pthaF~PCGHv~sekt~~YWs~iplPhGT~~f~a~CPFC~~~L~ 414 (429)
T KOG3842|consen 338 GQRERECPMCRVVGPYVPLWLGCEAGFYVDAGPPTHAFNPCGHVCSEKTVKYWSQIPLPHGTHAFHAACPFCATQLA 414 (429)
T ss_pred CcccCcCCeeeeecceeeeeccccceeEecCCCcccccCCcccccchhhhhHhhcCcCCCccccccccCcchhhhhc
Confidence 44467899998641 0000 1122379998777777777642 336999977664
No 142
>PLN02436 cellulose synthase A
Probab=77.14 E-value=4.2 Score=33.12 Aligned_cols=50 Identities=22% Similarity=0.474 Sum_probs=34.3
Q ss_pred Ccccccccccccc---CCCceEEeC-CCCcccHhHHHHHH-hcCCcccccccccc
Q psy8622 38 SNLQCSVCWEQFT---LDEAVRKLP-CDHFYHTPCIEPWL-QLHGTCPICRQTLH 87 (101)
Q Consensus 38 ~~~~C~IC~~~~~---~~~~~~~l~-C~H~f~~~Ci~~w~-~~~~~CP~Cr~~~~ 87 (101)
....|.||-+++. .++..+... |+--.|..|.+-=. ..++.||.|+....
T Consensus 35 ~~~iCqICGD~Vg~t~dGe~FVACn~C~fpvCr~Cyeyer~eg~~~Cpqckt~Y~ 89 (1094)
T PLN02436 35 SGQTCQICGDEIELTVDGEPFVACNECAFPVCRPCYEYERREGNQACPQCKTRYK 89 (1094)
T ss_pred CCccccccccccCcCCCCCEEEeeccCCCccccchhhhhhhcCCccCcccCCchh
Confidence 3458999999873 223333333 77788999985322 34778999999876
No 143
>KOG2066|consensus
Probab=76.57 E-value=1.1 Score=35.06 Aligned_cols=42 Identities=29% Similarity=0.585 Sum_probs=29.2
Q ss_pred cccccccccccCC----CceEEeCCCCcccHhHHHHHHhcCCccccc
Q psy8622 40 LQCSVCWEQFTLD----EAVRKLPCDHFYHTPCIEPWLQLHGTCPIC 82 (101)
Q Consensus 40 ~~C~IC~~~~~~~----~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~C 82 (101)
..|..|.+..... ....++-|+|+||..|+..-..++. |-.|
T Consensus 785 ~rc~~c~~~~l~~~~~~~~~~v~~c~h~yhk~c~~~~~~~~~-~~~~ 830 (846)
T KOG2066|consen 785 ERCSSCFEPNLPSGAAFDSVVVFHCGHMYHKECLMMESLRNA-CNIE 830 (846)
T ss_pred hhhhhhcccccccCcccceeeEEEccchhhhcccccHHHhcc-cChh
Confidence 3788998865322 3456678999999999976554444 5554
No 144
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PF00412 LIM: LIM domain; InterPro: IPR001781 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents LIM-type zinc finger (Znf) domains. LIM domains coordinate one or more zinc atoms, and are named after the three proteins (LIN-11, Isl1 and MEC-3) in which they were first found. They consist of two zinc-binding motifs that resemble GATA-like Znf's, however the residues holding the zinc atom(s) are variable, involving Cys, His, Asp or Glu residues. LIM domains are involved in proteins with differing functions, including gene expression, and cytoskeleton organisation and development [, ]. Protein containing LIM Znf domains include: Caenorhabditis elegans mec-3; a protein required for the differentiation of the set of six touch receptor neurons in this nematode. C. elegans. lin-11; a protein required for the asymmetric division of vulval blast cells. Vertebrate insulin gene enhancer binding protein isl-1. Isl-1 binds to one of the two cis-acting protein-binding domains of the insulin gene. Vertebrate homeobox proteins lim-1, lim-2 (lim-5) and lim3. Vertebrate lmx-1, which acts as a transcriptional activator by binding to the FLAT element; a beta-cell-specific transcriptional enhancer found in the insulin gene. Mammalian LH-2, a transcriptional regulatory protein involved in the control of cell differentiation in developing lymphoid and neural cell types. Drosophila melanogaster (Fruit fly) protein apterous, required for the normal development of the wing and halter imaginal discs. Vertebrate protein kinases LIMK-1 and LIMK-2. Mammalian rhombotins. Rhombotin 1 (RBTN1 or TTG-1) and rhombotin-2 (RBTN2 or TTG-2) are proteins of about 160 amino acids whose genes are disrupted by chromosomal translocations in T-cell leukemia. Mammalian and avian cysteine-rich protein (CRP), a 192 amino-acid protein of unknown function. Seems to interact with zyxin. Mammalian cysteine-rich intestinal protein (CRIP), a small protein which seems to have a role in zinc absorption and may function as an intracellular zinc transport protein. Vertebrate paxillin, a cytoskeletal focal adhesion protein. Mus musculus (Mouse) testin which should not be confused with rat testin which is a thiol protease homologue (see IPR000169 from INTERPRO). Helianthus annuus (Common sunflower) pollen specific protein SF3. Chicken zyxin. Zyxin is a low-abundance adhesion plaque protein which has been shown to interact with CRP. Yeast protein LRG1 which is involved in sporulation []. Saccharomyces cerevisiae (Baker's yeast) rho-type GTPase activating protein RGA1/DBM1. C. elegans homeobox protein ceh-14. C. elegans homeobox protein unc-97. S. cerevisiae hypothetical protein YKR090w. C. elegans hypothetical proteins C28H8.6. These proteins generally contain two tandem copies of the LIM domain in their N-terminal section. Zyxin and paxillin are exceptions in that they contain respectively three and four LIM domains at their C-terminal extremity. In apterous, isl-1, LH-2, lin-11, lim-1 to lim-3, lmx-1 and ceh-14 and mec-3 there is a homeobox domain some 50 to 95 amino acids after the LIM domains. LIM domains contain seven conserved cysteine residues and a histidine. The arrangement followed by these conserved residues is: C-x(2)-C-x(16,23)-H-x(2)-[CH]-x(2)-C-x(2)-C-x(16,21)-C-x(2,3)-[CHD] LIM domains bind two zinc ions []. LIM does not bind DNA, rather it seems to act as an interface for protein-protein interaction. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2CO8_A 2EGQ_A 2CUR_A 3IXE_B 1CTL_A 1B8T_A 1X62_A 2DFY_C 1IML_A 2CUQ_A ....
Probab=76.33 E-value=1.9 Score=21.82 Aligned_cols=40 Identities=23% Similarity=0.560 Sum_probs=22.8
Q ss_pred cccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccCC
Q psy8622 42 CSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSES 90 (101)
Q Consensus 42 C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~ 90 (101)
|..|...+..... .+..-+..||..|+ .|-.|+..+....
T Consensus 1 C~~C~~~I~~~~~-~~~~~~~~~H~~Cf--------~C~~C~~~l~~~~ 40 (58)
T PF00412_consen 1 CARCGKPIYGTEI-VIKAMGKFWHPECF--------KCSKCGKPLNDGD 40 (58)
T ss_dssp BTTTSSBESSSSE-EEEETTEEEETTTS--------BETTTTCBTTTSS
T ss_pred CCCCCCCccCcEE-EEEeCCcEEEcccc--------ccCCCCCccCCCe
Confidence 5566666653322 21235667777775 4777777765543
No 146
>KOG3005|consensus
Probab=76.16 E-value=4.3 Score=27.93 Aligned_cols=49 Identities=29% Similarity=0.661 Sum_probs=33.6
Q ss_pred ccccccccccccCCCceEEe----CCCCcccHhHHHHHHhc---------CCcccccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKL----PCDHFYHTPCIEPWLQL---------HGTCPICRQTLH 87 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l----~C~H~f~~~Ci~~w~~~---------~~~CP~Cr~~~~ 87 (101)
...|-+|...+...+..++. .|.-.+|..|+..-+.. ...||.|++.+.
T Consensus 182 ~~~celc~~ei~e~~~~~a~c~~~~c~~~~h~~CLa~~~~~~e~g~~~p~eg~cp~C~~~~~ 243 (276)
T KOG3005|consen 182 NVECELCEKEILETDWSRATCPNPDCDSLNHLTCLAEELLEVEPGQLIPLEGMCPKCEKFLS 243 (276)
T ss_pred chhhHHHHHHhccccceeccCCCCCCCchhhhhhhhHHHhccCCCceeccCCCCCchhceee
Confidence 35899999988444433332 28889999999884321 336999988553
No 147
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=75.59 E-value=2.2 Score=21.62 Aligned_cols=34 Identities=21% Similarity=0.467 Sum_probs=22.5
Q ss_pred cccccccccccCCCc-eEEeCCCCcccHhHHHHHH
Q psy8622 40 LQCSVCWEQFTLDEA-VRKLPCDHFYHTPCIEPWL 73 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~-~~~l~C~H~f~~~Ci~~w~ 73 (101)
..|.+|...|..... ..-..||++||..|.....
T Consensus 3 ~~C~~C~~~F~~~~rk~~Cr~Cg~~~C~~C~~~~~ 37 (57)
T cd00065 3 SSCMGCGKPFTLTRRRHHCRNCGRIFCSKCSSNRI 37 (57)
T ss_pred CcCcccCccccCCccccccCcCcCCcChHHcCCee
Confidence 468888887753321 1222499999999987643
No 148
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=75.24 E-value=0.81 Score=23.38 Aligned_cols=13 Identities=46% Similarity=1.237 Sum_probs=6.7
Q ss_pred CcccccccccccC
Q psy8622 77 GTCPICRQTLHSE 89 (101)
Q Consensus 77 ~~CP~Cr~~~~~~ 89 (101)
..||+|.+++..+
T Consensus 21 ~~CPlC~r~l~~e 33 (54)
T PF04423_consen 21 GCCPLCGRPLDEE 33 (54)
T ss_dssp EE-TTT--EE-HH
T ss_pred CcCCCCCCCCCHH
Confidence 3899999988653
No 149
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=74.59 E-value=4.6 Score=24.12 Aligned_cols=45 Identities=18% Similarity=0.276 Sum_probs=31.2
Q ss_pred ccccccccccccCCC-----------ceEEeCCCCcccHhHHHHHHhcCCcccccc
Q psy8622 39 NLQCSVCWEQFTLDE-----------AVRKLPCDHFYHTPCIEPWLQLHGTCPICR 83 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~-----------~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr 83 (101)
...|.-|+..|.... ...-..|.+.||.+|-.-+-+.-..||-|.
T Consensus 55 ~~~C~~C~~~f~~~~~~~~~~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh~CPGC~ 110 (112)
T TIGR00622 55 SRFCFGCQGPFPKPPVSPFDELKDSHRYVCAVCKNVFCVDCDVFVHESLHCCPGCI 110 (112)
T ss_pred CCcccCcCCCCCCcccccccccccccceeCCCCCCccccccchhhhhhccCCcCCC
Confidence 346999998874321 111224999999999877766666899985
No 150
>PF04710 Pellino: Pellino; InterPro: IPR006800 Pellino is involved in Toll-like signalling pathways, and associates with the kinase domain of the Pelle Ser/Thr kinase [, , ].; PDB: 3EGB_B 3EGA_A.
Probab=71.63 E-value=1.3 Score=32.06 Aligned_cols=47 Identities=30% Similarity=0.584 Sum_probs=0.4
Q ss_pred CCCccccccccccccCC-----------CceEEeCCCCcccHhHHHHHHh------cCCcccccccc
Q psy8622 36 DDSNLQCSVCWEQFTLD-----------EAVRKLPCDHFYHTPCIEPWLQ------LHGTCPICRQT 85 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~-----------~~~~~l~C~H~f~~~Ci~~w~~------~~~~CP~Cr~~ 85 (101)
......||+=+..+..+ .+.+-+.|||++.. ..|-. ....||+||..
T Consensus 274 Na~rpQCPVglnTL~fp~~~~~~~~~~~qP~VYl~CGHVhG~---h~Wg~~~~~~~~~r~CPlCr~~ 337 (416)
T PF04710_consen 274 NAGRPQCPVGLNTLVFPSKSRKDVPDERQPWVYLNCGHVHGY---HNWGQDSDRDPRSRTCPLCRQV 337 (416)
T ss_dssp S------------------------------------------------------------------
T ss_pred hhcCCCCCcCCCccccccccccccccccCceeeccccceeee---cccccccccccccccCCCcccc
Confidence 34456777776655221 13344679998754 45642 25589999874
No 151
>COG3813 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=71.53 E-value=3.2 Score=22.82 Aligned_cols=46 Identities=26% Similarity=0.642 Sum_probs=28.2
Q ss_pred cccccccccCCCceEEeCC--CCcccHhHHHHHHhcCCcccccccccccCC
Q psy8622 42 CSVCWEQFTLDEAVRKLPC--DHFYHTPCIEPWLQLHGTCPICRQTLHSES 90 (101)
Q Consensus 42 C~IC~~~~~~~~~~~~l~C--~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~ 90 (101)
|--|-.++-+ +..-.+-| .+.||.+|...- -...||.|...+....
T Consensus 8 CECCDrDLpp-~s~dA~ICtfEcTFCadCae~~--l~g~CPnCGGelv~RP 55 (84)
T COG3813 8 CECCDRDLPP-DSTDARICTFECTFCADCAENR--LHGLCPNCGGELVARP 55 (84)
T ss_pred CcccCCCCCC-CCCceeEEEEeeehhHhHHHHh--hcCcCCCCCchhhcCc
Confidence 4445555522 22222224 478999999863 4568999988775543
No 152
>KOG3039|consensus
Probab=70.79 E-value=3.7 Score=28.02 Aligned_cols=32 Identities=13% Similarity=0.131 Sum_probs=26.9
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHHHH
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWL 73 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~ 73 (101)
-.-|.+|+.....+ ++.+=||+|+..||..++
T Consensus 43 FdcCsLtLqPc~dP---vit~~GylfdrEaILe~i 74 (303)
T KOG3039|consen 43 FDCCSLTLQPCRDP---VITPDGYLFDREAILEYI 74 (303)
T ss_pred cceeeeecccccCC---ccCCCCeeeeHHHHHHHH
Confidence 45789999988655 788899999999998875
No 153
>KOG3799|consensus
Probab=70.47 E-value=0.91 Score=28.04 Aligned_cols=50 Identities=16% Similarity=0.393 Sum_probs=26.4
Q ss_pred CCcccccccccc-ccCCCceEEeCCCCcccHhHHHHHHhc-CC---ccccccccc
Q psy8622 37 DSNLQCSVCWEQ-FTLDEAVRKLPCDHFYHTPCIEPWLQL-HG---TCPICRQTL 86 (101)
Q Consensus 37 ~~~~~C~IC~~~-~~~~~~~~~l~C~H~f~~~Ci~~w~~~-~~---~CP~Cr~~~ 86 (101)
..+..|.||+.. |.++-...-.-|.-.||..|-.+...+ ++ .|-+|+...
T Consensus 63 ~ddatC~IC~KTKFADG~GH~C~YCq~r~CARCGGrv~lrsNKv~wvcnlc~k~q 117 (169)
T KOG3799|consen 63 GDDATCGICHKTKFADGCGHNCSYCQTRFCARCGGRVSLRSNKVMWVCNLCRKQQ 117 (169)
T ss_pred CcCcchhhhhhcccccccCcccchhhhhHHHhcCCeeeeccCceEEeccCCcHHH
Confidence 456789999974 332211111224445666665543222 22 588887643
No 154
>COG4847 Uncharacterized protein conserved in archaea [Function unknown]
Probab=67.83 E-value=7.9 Score=22.45 Aligned_cols=35 Identities=14% Similarity=0.228 Sum_probs=27.5
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHHHHh
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQ 74 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~ 74 (101)
...|.+|-+.+..++...-++ .-..|..|+..-..
T Consensus 6 ewkC~VCg~~iieGqkFTF~~-kGsVH~eCl~~s~~ 40 (103)
T COG4847 6 EWKCYVCGGTIIEGQKFTFTK-KGSVHYECLAESKR 40 (103)
T ss_pred eeeEeeeCCEeeeccEEEEee-CCcchHHHHHHHHh
Confidence 468999999998887766666 55689999987543
No 155
>PRK11088 rrmA 23S rRNA methyltransferase A; Provisional
Probab=66.89 E-value=4.5 Score=27.38 Aligned_cols=25 Identities=32% Similarity=0.704 Sum_probs=17.8
Q ss_pred cccccccccccCCCceEEeCCCCcc
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFY 64 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f 64 (101)
..||+|...+.........+.+|.|
T Consensus 3 ~~CP~C~~~l~~~~~~~~C~~~h~f 27 (272)
T PRK11088 3 YQCPLCHQPLTLEENSWICPQNHQF 27 (272)
T ss_pred ccCCCCCcchhcCCCEEEcCCCCCC
Confidence 5799999998654444444567877
No 156
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=66.57 E-value=3.9 Score=19.20 Aligned_cols=14 Identities=21% Similarity=0.793 Sum_probs=9.0
Q ss_pred cccccccccccCCC
Q psy8622 40 LQCSVCWEQFTLDE 53 (101)
Q Consensus 40 ~~C~IC~~~~~~~~ 53 (101)
..||-|...|..++
T Consensus 3 i~CP~C~~~f~v~~ 16 (37)
T PF13719_consen 3 ITCPNCQTRFRVPD 16 (37)
T ss_pred EECCCCCceEEcCH
Confidence 46788877765443
No 157
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=66.28 E-value=2.9 Score=21.27 Aligned_cols=11 Identities=27% Similarity=0.860 Sum_probs=8.3
Q ss_pred ccccccccccc
Q psy8622 39 NLQCSVCWEQF 49 (101)
Q Consensus 39 ~~~C~IC~~~~ 49 (101)
.+.||.|.+.+
T Consensus 2 ~f~CP~C~~~~ 12 (54)
T PF05605_consen 2 SFTCPYCGKGF 12 (54)
T ss_pred CcCCCCCCCcc
Confidence 46799998855
No 158
>PLN02195 cellulose synthase A
Probab=66.20 E-value=10 Score=30.73 Aligned_cols=50 Identities=20% Similarity=0.349 Sum_probs=34.1
Q ss_pred CccccccccccccCCC---ceEEe-CCCCcccHhHHHHH-HhcCCcccccccccc
Q psy8622 38 SNLQCSVCWEQFTLDE---AVRKL-PCDHFYHTPCIEPW-LQLHGTCPICRQTLH 87 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~---~~~~l-~C~H~f~~~Ci~~w-~~~~~~CP~Cr~~~~ 87 (101)
....|.||-+.+..+. ..+.. .|+--.|..|.+-= -+.++.||.|+....
T Consensus 5 ~~~~c~~cgd~~~~~~~g~~fvaC~eC~~pvCrpCyeyer~eg~q~CpqCkt~Yk 59 (977)
T PLN02195 5 GAPICATCGEEVGVDSNGEAFVACHECSYPLCKACLEYEIKEGRKVCLRCGGPYD 59 (977)
T ss_pred CCccceecccccCcCCCCCeEEEeccCCCccccchhhhhhhcCCccCCccCCccc
Confidence 3458999999774332 22222 38888999998421 134778999999887
No 159
>PF13832 zf-HC5HC2H_2: PHD-zinc-finger like domain
Probab=65.76 E-value=5.1 Score=23.25 Aligned_cols=34 Identities=21% Similarity=0.503 Sum_probs=22.4
Q ss_pred CCccccccccccccCCCceEEe--CCCCcccHhHHHHH
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKL--PCDHFYHTPCIEPW 72 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l--~C~H~f~~~Ci~~w 72 (101)
.....|.||.... +..+.-. .|...||..|....
T Consensus 53 ~~~~~C~iC~~~~--G~~i~C~~~~C~~~fH~~CA~~~ 88 (110)
T PF13832_consen 53 RFKLKCSICGKSG--GACIKCSHPGCSTAFHPTCARKA 88 (110)
T ss_pred hcCCcCcCCCCCC--ceeEEcCCCCCCcCCCHHHHHHC
Confidence 3567999999872 2111111 27889999998763
No 160
>PF04216 FdhE: Protein involved in formate dehydrogenase formation; InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex []. The exact nature of the function of FdhE in the assembly of the complex is unknown, but considering the presence of selenocysteine, molybdopterin, iron-sulphur clusters and cytochrome b556, it is likely to be involved in the insertion of cofactors. ; GO: 0005737 cytoplasm; PDB: 2FIY_B.
Probab=64.13 E-value=0.94 Score=31.17 Aligned_cols=46 Identities=22% Similarity=0.314 Sum_probs=19.8
Q ss_pred CCccccccccccccCCCceEEeC-----CCCcccHhHHHHHHhcCCcccccccc
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKLP-----CDHFYHTPCIEPWLQLHGTCPICRQT 85 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l~-----C~H~f~~~Ci~~w~~~~~~CP~Cr~~ 85 (101)
.....||+|-..-... .+.. =.+.+|.-|-..|-.....||.|-..
T Consensus 170 w~~g~CPvCGs~P~~s---~l~~~~~~G~R~L~Cs~C~t~W~~~R~~Cp~Cg~~ 220 (290)
T PF04216_consen 170 WQRGYCPVCGSPPVLS---VLRGGEREGKRYLHCSLCGTEWRFVRIKCPYCGNT 220 (290)
T ss_dssp TT-SS-TTT---EEEE---EEE------EEEEEETTT--EEE--TTS-TTT---
T ss_pred ccCCcCCCCCCcCceE---EEecCCCCccEEEEcCCCCCeeeecCCCCcCCCCC
Confidence 3457999998753111 1111 13456777888887777789999553
No 161
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=64.12 E-value=6.5 Score=26.42 Aligned_cols=27 Identities=19% Similarity=0.468 Sum_probs=20.9
Q ss_pred ccHhHHHHHHhcCCcccccccccccCC
Q psy8622 64 YHTPCIEPWLQLHGTCPICRQTLHSES 90 (101)
Q Consensus 64 f~~~Ci~~w~~~~~~CP~Cr~~~~~~~ 90 (101)
-|.+|....=.....||+|++.-.+.+
T Consensus 196 ~C~sC~qqIHRNAPiCPlCK~KsRSrn 222 (230)
T PF10146_consen 196 TCQSCHQQIHRNAPICPLCKAKSRSRN 222 (230)
T ss_pred hhHhHHHHHhcCCCCCcccccccccCC
Confidence 477888877677889999988765554
No 162
>KOG4451|consensus
Probab=64.01 E-value=6.1 Score=26.68 Aligned_cols=29 Identities=17% Similarity=0.395 Sum_probs=21.5
Q ss_pred cccHhHHHHHHhcCCcccccccccccCCc
Q psy8622 63 FYHTPCIEPWLQLHGTCPICRQTLHSESA 91 (101)
Q Consensus 63 ~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~ 91 (101)
..|..|..+.-.....||+|+..-.+.+.
T Consensus 250 K~ClsChqqIHRNAPiCPlCKaKsRSrNP 278 (286)
T KOG4451|consen 250 KVCLSCHQQIHRNAPICPLCKAKSRSRNP 278 (286)
T ss_pred hHHHHHHHHHhcCCCCCcchhhccccCCC
Confidence 35777887776778899999887655543
No 163
>COG3492 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=62.85 E-value=4.5 Score=23.28 Aligned_cols=12 Identities=17% Similarity=0.744 Sum_probs=10.7
Q ss_pred cccHhHHHHHHh
Q psy8622 63 FYHTPCIEPWLQ 74 (101)
Q Consensus 63 ~f~~~Ci~~w~~ 74 (101)
.||..|+..|..
T Consensus 42 gFCRNCLs~Wy~ 53 (104)
T COG3492 42 GFCRNCLSNWYR 53 (104)
T ss_pred HHHHHHHHHHHH
Confidence 599999999985
No 164
>PLN02915 cellulose synthase A [UDP-forming], catalytic subunit
Probab=62.43 E-value=19 Score=29.54 Aligned_cols=50 Identities=20% Similarity=0.446 Sum_probs=33.9
Q ss_pred CccccccccccccCC---CceEEe-CCCCcccHhHHHHH-HhcCCcccccccccc
Q psy8622 38 SNLQCSVCWEQFTLD---EAVRKL-PCDHFYHTPCIEPW-LQLHGTCPICRQTLH 87 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~---~~~~~l-~C~H~f~~~Ci~~w-~~~~~~CP~Cr~~~~ 87 (101)
....|-||-+++... +..+.. -|+--.|..|.+-= -..++.||.|+....
T Consensus 14 ~~~~c~iCGd~vg~~~~Ge~FVAC~eC~fpvCr~cyeye~~~g~~~cp~c~t~y~ 68 (1044)
T PLN02915 14 DAKTCRVCGDEVGVKEDGQPFVACHVCGFPVCKPCYEYERSEGNQCCPQCNTRYK 68 (1044)
T ss_pred CcchhhccccccCcCCCCCEEEEeccCCCccccchhhhhhhcCCccCCccCCchh
Confidence 456899999987432 222222 27777999998421 234778999998876
No 165
>PF13771 zf-HC5HC2H: PHD-like zinc-binding domain
Probab=61.67 E-value=5.4 Score=22.16 Aligned_cols=37 Identities=22% Similarity=0.503 Sum_probs=23.2
Q ss_pred CCCCccccccccccccCCCceEEeCCCCcccHhHHHH
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEP 71 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~ 71 (101)
.......|.+|.........-..-.|...||..|...
T Consensus 32 ~~~~~~~C~~C~~~~Ga~i~C~~~~C~~~fH~~CA~~ 68 (90)
T PF13771_consen 32 KRRRKLKCSICKKKGGACIGCSHPGCSRSFHVPCARK 68 (90)
T ss_pred HHHhCCCCcCCCCCCCeEEEEeCCCCCcEEChHHHcc
Confidence 3445678999987632111111124889999999865
No 166
>PF09943 DUF2175: Uncharacterized protein conserved in archaea (DUF2175); InterPro: IPR018686 This family of various hypothetical archaeal proteins has no known function.
Probab=60.05 E-value=10 Score=22.24 Aligned_cols=33 Identities=12% Similarity=0.195 Sum_probs=24.1
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHH
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWL 73 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~ 73 (101)
..|.||-+.+..++....++ +-..|+.|+..-.
T Consensus 3 WkC~iCg~~I~~gqlFTF~~-kG~VH~~C~~~~~ 35 (101)
T PF09943_consen 3 WKCYICGKPIYEGQLFTFTK-KGPVHYECFREKA 35 (101)
T ss_pred eEEEecCCeeeecceEEEec-CCcEeHHHHHHHH
Confidence 47999999987766444443 4668999998754
No 167
>PLN02248 cellulose synthase-like protein
Probab=59.60 E-value=10 Score=31.12 Aligned_cols=33 Identities=18% Similarity=0.476 Sum_probs=27.4
Q ss_pred CCCcccHhHHHHHHhcCCcccccccccccCCcc
Q psy8622 60 CDHFYHTPCIEPWLQLHGTCPICRQTLHSESAE 92 (101)
Q Consensus 60 C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~~~~ 92 (101)
|+...|.+|...-+.....||-|+.+....+..
T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (1135)
T PLN02248 150 CGFKICRDCYIDAVKSGGICPGCKEPYKVTDLD 182 (1135)
T ss_pred ccchhHHhHhhhhhhcCCCCCCCcccccccccc
Confidence 788889999999988888999999988554433
No 168
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=57.12 E-value=9.3 Score=16.82 Aligned_cols=29 Identities=24% Similarity=0.439 Sum_probs=8.7
Q ss_pred ccccccccccCCCceEEeCCCCcccHhHH
Q psy8622 41 QCSVCWEQFTLDEAVRKLPCDHFYHTPCI 69 (101)
Q Consensus 41 ~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci 69 (101)
.|.+|.+.........-..|.-.+|..|+
T Consensus 2 ~C~~C~~~~~~~~~Y~C~~Cdf~lH~~Ca 30 (30)
T PF07649_consen 2 RCDACGKPIDGGWFYRCSECDFDLHEECA 30 (30)
T ss_dssp --TTTS----S--EEE-TTT-----HHHH
T ss_pred cCCcCCCcCCCCceEECccCCCccChhcC
Confidence 47778777643222232347777888774
No 169
>PF06676 DUF1178: Protein of unknown function (DUF1178); InterPro: IPR009562 This family consists of several hypothetical bacterial proteins of around 150 residues in length. The function of this family is unknown.
Probab=55.83 E-value=3.4 Score=25.90 Aligned_cols=24 Identities=33% Similarity=0.749 Sum_probs=15.7
Q ss_pred CCCcccHhHHHHHHhcC-----------Cccccccccccc
Q psy8622 60 CDHFYHTPCIEPWLQLH-----------GTCPICRQTLHS 88 (101)
Q Consensus 60 C~H~f~~~Ci~~w~~~~-----------~~CP~Cr~~~~~ 88 (101)
++|.| ..||... -+||+|...-..
T Consensus 10 ~gH~F-----EgWF~ss~~fd~Q~~~glv~CP~Cgs~~V~ 44 (148)
T PF06676_consen 10 NGHEF-----EGWFRSSAAFDRQQARGLVSCPVCGSTEVS 44 (148)
T ss_pred CCCcc-----ceecCCHHHHHHHHHcCCccCCCCCCCeEe
Confidence 57776 5687532 279999775543
No 170
>PF14169 YdjO: Cold-inducible protein YdjO
Probab=55.26 E-value=6 Score=20.86 Aligned_cols=15 Identities=27% Similarity=0.769 Sum_probs=11.2
Q ss_pred cCCcccccccccccC
Q psy8622 75 LHGTCPICRQTLHSE 89 (101)
Q Consensus 75 ~~~~CP~Cr~~~~~~ 89 (101)
..+.||+|..+....
T Consensus 38 ~~p~CPlC~s~M~~~ 52 (59)
T PF14169_consen 38 EEPVCPLCKSPMVSG 52 (59)
T ss_pred CCccCCCcCCccccc
Confidence 356899999887654
No 171
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=54.97 E-value=6.5 Score=18.38 Aligned_cols=13 Identities=23% Similarity=0.905 Sum_probs=8.4
Q ss_pred cccccccccccCC
Q psy8622 40 LQCSVCWEQFTLD 52 (101)
Q Consensus 40 ~~C~IC~~~~~~~ 52 (101)
..|+-|...|..+
T Consensus 3 i~Cp~C~~~y~i~ 15 (36)
T PF13717_consen 3 ITCPNCQAKYEID 15 (36)
T ss_pred EECCCCCCEEeCC
Confidence 4677777776543
No 172
>KOG2113|consensus
Probab=54.17 E-value=12 Score=26.64 Aligned_cols=43 Identities=7% Similarity=-0.091 Sum_probs=30.3
Q ss_pred ccccccccccccCCCceEEeCCCC-cccHhHHHHHHhcCCccccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDH-FYHTPCIEPWLQLHGTCPICRQTL 86 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H-~f~~~Ci~~w~~~~~~CP~Cr~~~ 86 (101)
...|..|-+.+.. .+..+|+| .|+..|.. +....+||+|...+
T Consensus 343 ~~~~~~~~~~~~s---t~~~~~~~n~~~~~~a~--~s~~~~~~~c~~~~ 386 (394)
T KOG2113|consen 343 SLKGTSAGFGLLS---TIWSGGNMNLSPGSLAS--ASASPTSSTCDHND 386 (394)
T ss_pred hcccccccCceee---eEeecCCcccChhhhhh--cccCCccccccccc
Confidence 3567777665532 25568998 78888876 46678999997644
No 173
>KOG0824|consensus
Probab=53.24 E-value=4.3 Score=28.45 Aligned_cols=47 Identities=23% Similarity=0.592 Sum_probs=35.5
Q ss_pred CccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccccc
Q psy8622 38 SNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTL 86 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~ 86 (101)
....|.||...+..+. ..-.|.|.|+..|...|......||.|+...
T Consensus 104 ~~~~~~~~~g~l~vpt--~~qg~w~qf~~~~p~~~~~~~~~~~d~~~~~ 150 (324)
T KOG0824|consen 104 DHDICYICYGKLTVPT--RIQGCWHQFCYVCPKSNFAMGNDCPDCRGKI 150 (324)
T ss_pred CccceeeeeeeEEecc--cccCceeeeeecCCchhhhhhhccchhhcCc
Confidence 3457888888765331 2234999999999999999888899887754
No 174
>PF14311 DUF4379: Domain of unknown function (DUF4379)
Probab=52.99 E-value=8.8 Score=19.48 Aligned_cols=22 Identities=23% Similarity=0.641 Sum_probs=11.7
Q ss_pred CCCcccHhHHHHHHhcCCccccc
Q psy8622 60 CDHFYHTPCIEPWLQLHGTCPIC 82 (101)
Q Consensus 60 C~H~f~~~Ci~~w~~~~~~CP~C 82 (101)
|+|.|-..= .........||.|
T Consensus 34 Cgh~w~~~v-~~R~~~~~~CP~C 55 (55)
T PF14311_consen 34 CGHEWKASV-NDRTRRGKGCPYC 55 (55)
T ss_pred CCCeeEccH-hhhccCCCCCCCC
Confidence 666654432 2222455678887
No 175
>PF10235 Cript: Microtubule-associated protein CRIPT; InterPro: IPR019367 The CRIPT protein is a cytoskeletal protein involved in microtubule production. This C-terminal domain is essential for binding to the PDZ3 domain of the SAP90 protein, one of a super-family of PDZ-containing proteins that play an important role in coupling the membrane ion channels with their signalling partners [].
Probab=52.78 E-value=8.9 Score=22.00 Aligned_cols=39 Identities=18% Similarity=0.554 Sum_probs=28.3
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccC
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSE 89 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~ 89 (101)
...|-||-..+... +|.||..|.. ....|.+|-..+.+.
T Consensus 44 ~~~C~~CK~~v~q~--------g~~YCq~CAY----kkGiCamCGKki~dt 82 (90)
T PF10235_consen 44 SSKCKICKTKVHQP--------GAKYCQTCAY----KKGICAMCGKKILDT 82 (90)
T ss_pred CccccccccccccC--------CCccChhhhc----ccCcccccCCeeccc
Confidence 45799998765432 6778999964 467899998888543
No 176
>PRK00418 DNA gyrase inhibitor; Reviewed
Probab=52.73 E-value=9.9 Score=20.25 Aligned_cols=9 Identities=33% Similarity=1.217 Sum_probs=5.1
Q ss_pred ccccccccc
Q psy8622 78 TCPICRQTL 86 (101)
Q Consensus 78 ~CP~Cr~~~ 86 (101)
.||.|++.+
T Consensus 8 ~CP~C~k~~ 16 (62)
T PRK00418 8 NCPTCGKPV 16 (62)
T ss_pred cCCCCCCcc
Confidence 466665554
No 177
>PF03119 DNA_ligase_ZBD: NAD-dependent DNA ligase C4 zinc finger domain; InterPro: IPR004149 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the zinc finger domain found in NAD-dependent DNA ligases. DNA ligases catalyse the crucial step of joining the breaks in duplex DNA during DNA replication, repair and recombination, utilizing either ATP or NAD(+) as a cofactor []. This domain is a small zinc binding motif that is presumably DNA binding. It is found only in NAD-dependent DNA ligases. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair; PDB: 1DGS_A 1V9P_B 2OWO_A.
Probab=51.98 E-value=7.4 Score=17.15 Aligned_cols=10 Identities=50% Similarity=1.248 Sum_probs=5.2
Q ss_pred cccccccccc
Q psy8622 78 TCPICRQTLH 87 (101)
Q Consensus 78 ~CP~Cr~~~~ 87 (101)
.||.|...+.
T Consensus 1 ~CP~C~s~l~ 10 (28)
T PF03119_consen 1 TCPVCGSKLV 10 (28)
T ss_dssp B-TTT--BEE
T ss_pred CcCCCCCEeE
Confidence 4888888776
No 178
>PF02318 FYVE_2: FYVE-type zinc finger; InterPro: IPR003315 This entry represents the zinc-binding domain found in rabphilin Rab3A. The small G protein Rab3A plays an important role in the regulation of neurotransmitter release. The crystal structure of the small G protein Rab3A complexed with the effector domain of rabphilin-3A shows that the effector domain of rabphilin-3A contacts Rab3A in two distinct areas. The first interface involves the Rab3A switch I and switch II regions, which are sensitive to the nucleotide-binding state of Rab3A. The second interface consists of a deep pocket in Rab3A that interacts with a SGAWFF structural element of rabphilin-3A. Sequence and structure analysis, and biochemical data suggest that this pocket, or Rab complementarity-determining region (RabCDR), establishes a specific interaction between each Rab protein and its effectors. It has been suggested that RabCDRs could be major determinants of effector specificity during vesicle trafficking and fusion [].; GO: 0008270 zinc ion binding, 0017137 Rab GTPase binding, 0006886 intracellular protein transport; PDB: 2CSZ_A 2ZET_C 1ZBD_B 3BC1_B 2CJS_C 2A20_A.
Probab=51.76 E-value=10 Score=22.59 Aligned_cols=46 Identities=17% Similarity=0.405 Sum_probs=27.3
Q ss_pred CccccccccccccCC--CceEEeCCCCcccHhHHHHHHhcCC--ccccccc
Q psy8622 38 SNLQCSVCWEQFTLD--EAVRKLPCDHFYHTPCIEPWLQLHG--TCPICRQ 84 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~--~~~~~l~C~H~f~~~Ci~~w~~~~~--~CP~Cr~ 84 (101)
....|.+|...|..- ....-..|.|.+|..|-.. ..... .|.+|..
T Consensus 53 ~~~~C~~C~~~fg~l~~~~~~C~~C~~~VC~~C~~~-~~~~~~WlC~vC~k 102 (118)
T PF02318_consen 53 GERHCARCGKPFGFLFNRGRVCVDCKHRVCKKCGVY-SKKEPIWLCKVCQK 102 (118)
T ss_dssp CCSB-TTTS-BCSCTSTTCEEETTTTEEEETTSEEE-TSSSCCEEEHHHHH
T ss_pred CCcchhhhCCcccccCCCCCcCCcCCccccCccCCc-CCCCCCEEChhhHH
Confidence 456899999876322 2334445999999998654 11112 5888754
No 179
>smart00647 IBR In Between Ring fingers. the domains occurs between pairs og RING fingers
Probab=50.96 E-value=4.6 Score=20.71 Aligned_cols=18 Identities=22% Similarity=0.606 Sum_probs=13.5
Q ss_pred eEEe-CCCCcccHhHHHHH
Q psy8622 55 VRKL-PCDHFYHTPCIEPW 72 (101)
Q Consensus 55 ~~~l-~C~H~f~~~Ci~~w 72 (101)
.+.- .|++.||..|...|
T Consensus 40 ~v~C~~C~~~fC~~C~~~~ 58 (64)
T smart00647 40 RVTCPKCGFSFCFRCKVPW 58 (64)
T ss_pred eeECCCCCCeECCCCCCcC
Confidence 3444 48999999888776
No 180
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=50.69 E-value=9 Score=20.18 Aligned_cols=15 Identities=27% Similarity=0.919 Sum_probs=10.6
Q ss_pred cCCcccccccccccC
Q psy8622 75 LHGTCPICRQTLHSE 89 (101)
Q Consensus 75 ~~~~CP~Cr~~~~~~ 89 (101)
.++.||.|..+++++
T Consensus 2 ~HkHC~~CG~~Ip~~ 16 (59)
T PF09889_consen 2 PHKHCPVCGKPIPPD 16 (59)
T ss_pred CCCcCCcCCCcCCcc
Confidence 356788888877654
No 181
>cd04718 BAH_plant_2 BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=50.64 E-value=4.5 Score=25.38 Aligned_cols=29 Identities=31% Similarity=0.636 Sum_probs=16.9
Q ss_pred ccHhHHHHHHhcCC----cccccccccccCCcc
Q psy8622 64 YHTPCIEPWLQLHG----TCPICRQTLHSESAE 92 (101)
Q Consensus 64 f~~~Ci~~w~~~~~----~CP~Cr~~~~~~~~~ 92 (101)
||..||..=+..-+ .||.|...-......
T Consensus 2 ~H~~CL~Ppl~~~P~g~W~Cp~C~~~~~~~~~~ 34 (148)
T cd04718 2 FHLCCLRPPLKEVPEGDWICPFCEVEKSGQSAM 34 (148)
T ss_pred cccccCCCCCCCCCCCCcCCCCCcCCCCCCccc
Confidence 67777765443322 689997665444333
No 182
>KOG2231|consensus
Probab=50.40 E-value=19 Score=28.01 Aligned_cols=44 Identities=23% Similarity=0.570 Sum_probs=31.6
Q ss_pred ccccccccccCCCceEEeCCCC-cccHhHHHHHHh--c----CCcccccccccc
Q psy8622 41 QCSVCWEQFTLDEAVRKLPCDH-FYHTPCIEPWLQ--L----HGTCPICRQTLH 87 (101)
Q Consensus 41 ~C~IC~~~~~~~~~~~~l~C~H-~f~~~Ci~~w~~--~----~~~CP~Cr~~~~ 87 (101)
.|+||-..+ +....-.|+| ..|..|..+... . ...||+||..+.
T Consensus 2 ~c~ic~~s~---~~~~~~s~~h~~v~~~~~~R~~~~~~~~~~~~~~~vcr~~~~ 52 (669)
T KOG2231|consen 2 SCAICAFSP---DFVGRGSCGHNEVCATCVVRLRFELNNRKCSNECPVCRREVE 52 (669)
T ss_pred CcceeecCc---cccccccccccccchhhhhhhhhhcccccccccCccccccee
Confidence 588998765 3334456999 899999987642 2 346899999764
No 183
>KOG4218|consensus
Probab=50.31 E-value=18 Score=26.03 Aligned_cols=25 Identities=24% Similarity=0.506 Sum_probs=15.4
Q ss_pred CCCCccccccccccccCCCceEEeCC
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPC 60 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C 60 (101)
..+.+..||+|-+.+. +=....|.|
T Consensus 11 dedl~ElCPVCGDkVS-GYHYGLLTC 35 (475)
T KOG4218|consen 11 DEDLGELCPVCGDKVS-GYHYGLLTC 35 (475)
T ss_pred ccccccccccccCccc-cceeeeeeh
Confidence 4455678999999863 223344544
No 184
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=50.15 E-value=13 Score=16.85 Aligned_cols=8 Identities=38% Similarity=1.186 Sum_probs=5.0
Q ss_pred Cccccccc
Q psy8622 77 GTCPICRQ 84 (101)
Q Consensus 77 ~~CP~Cr~ 84 (101)
..||+|..
T Consensus 18 ~~CP~Cg~ 25 (33)
T cd00350 18 WVCPVCGA 25 (33)
T ss_pred CcCcCCCC
Confidence 36777754
No 185
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=49.12 E-value=9.7 Score=26.78 Aligned_cols=47 Identities=19% Similarity=0.401 Sum_probs=29.0
Q ss_pred CCCccccccccccccCCCceEEe--C--CCCcccHhHHHHHHhcCCccccccc
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKL--P--CDHFYHTPCIEPWLQLHGTCPICRQ 84 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l--~--C~H~f~~~Ci~~w~~~~~~CP~Cr~ 84 (101)
.+....||+|-..-... ++.+ . =.+.+|.-|-..|-.....||.|..
T Consensus 184 ~~~~~~CPvCGs~P~~s--~v~~~~~~G~RyL~CslC~teW~~~R~~C~~Cg~ 234 (309)
T PRK03564 184 GEQRQFCPVCGSMPVSS--VVQIGTTQGLRYLHCNLCESEWHVVRVKCSNCEQ 234 (309)
T ss_pred ccCCCCCCCCCCcchhh--eeeccCCCCceEEEcCCCCCcccccCccCCCCCC
Confidence 34567899998752111 1111 1 1234566688888777888999965
No 186
>PF05715 zf-piccolo: Piccolo Zn-finger; InterPro: IPR008899 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This (predicted) zinc finger is found in the bassoon and piccolo proteins, both of which are components of the presynaptic cytoskeletal matrix (PCM) assembled at the active zone of neurotransmitter release, where Piccolo plays a role in the trafficking of synaptic vesicles (SVs) [, , ]. The Piccolo zinc fingers were found to interact with the dual prenylated rab3A and VAMP2/Synaptobrevin II receptor PRA1. There are eight conserved cysteines in Piccolo-type zinc fingers, suggesting that they coordinates two zinc ligands. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding, 0045202 synapse
Probab=48.49 E-value=13 Score=19.62 Aligned_cols=9 Identities=56% Similarity=1.527 Sum_probs=4.7
Q ss_pred ccccccccc
Q psy8622 78 TCPICRQTL 86 (101)
Q Consensus 78 ~CP~Cr~~~ 86 (101)
.||+|+..+
T Consensus 4 ~CPlCkt~~ 12 (61)
T PF05715_consen 4 LCPLCKTTL 12 (61)
T ss_pred cCCcccchh
Confidence 455555544
No 187
>PF03884 DUF329: Domain of unknown function (DUF329); InterPro: IPR005584 The biological function of these short proteins is unknown, but they contain four conserved cysteines, suggesting that they all bind zinc. YacG (Q5X8H6 from SWISSPROT) from Escherichia coli has been shown to bind zinc and contains the structural motifs typical of zinc-binding proteins []. The conserved four cysteine motif in these proteins (-C-X(2)-C-X(15)-C-X(3)-C-) is not found in other zinc-binding proteins with known structures.; GO: 0008270 zinc ion binding; PDB: 1LV3_A.
Probab=46.34 E-value=9.8 Score=19.89 Aligned_cols=10 Identities=40% Similarity=1.248 Sum_probs=3.5
Q ss_pred cccccccccc
Q psy8622 78 TCPICRQTLH 87 (101)
Q Consensus 78 ~CP~Cr~~~~ 87 (101)
.||.|++.+.
T Consensus 4 ~CP~C~k~~~ 13 (57)
T PF03884_consen 4 KCPICGKPVE 13 (57)
T ss_dssp E-TTT--EEE
T ss_pred cCCCCCCeec
Confidence 3555555543
No 188
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=45.26 E-value=9.2 Score=16.49 Aligned_cols=9 Identities=33% Similarity=1.246 Sum_probs=6.9
Q ss_pred ccccccccc
Q psy8622 78 TCPICRQTL 86 (101)
Q Consensus 78 ~CP~Cr~~~ 86 (101)
.||+|.+.+
T Consensus 3 ~CPiC~~~v 11 (26)
T smart00734 3 QCPVCFREV 11 (26)
T ss_pred cCCCCcCcc
Confidence 588887776
No 189
>PRK11595 DNA utilization protein GntX; Provisional
Probab=43.99 E-value=24 Score=23.35 Aligned_cols=39 Identities=21% Similarity=0.398 Sum_probs=19.3
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccccc
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTL 86 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~ 86 (101)
..|.+|-..+... ...+|..|...+-.....||.|-.++
T Consensus 6 ~~C~~C~~~~~~~--------~~~lC~~C~~~l~~~~~~C~~Cg~~~ 44 (227)
T PRK11595 6 GLCWLCRMPLALS--------HWGICSVCSRALRTLKTCCPQCGLPA 44 (227)
T ss_pred CcCccCCCccCCC--------CCcccHHHHhhCCcccCcCccCCCcC
Confidence 3577776654211 11245666655422234566665543
No 190
>PF06937 EURL: EURL protein; InterPro: IPR009704 This family consists of several animal EURL proteins. EURL is preferentially expressed in chick retinal precursor cells as well as in the anterior epithelial cells of the lens at early stages of development. EURL transcripts are found primarily in the peripheral dorsal retina, i.e., the most undifferentiated part of the dorsal retina. EURL transcripts are also detected in the lens at stage 18 and remain abundant in the proliferating epithelial cells of the lens until at least day 11. The distribution pattern of EURL in the developing retina and lens suggest a role before the events leading to cell determination and differentiation [].
Probab=43.45 E-value=28 Score=24.14 Aligned_cols=44 Identities=20% Similarity=0.399 Sum_probs=25.5
Q ss_pred CCccccccccccccCCCceEEeC-CCC-cccHhHHHHHH-hcCCccc
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKLP-CDH-FYHTPCIEPWL-QLHGTCP 80 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l~-C~H-~f~~~Ci~~w~-~~~~~CP 80 (101)
+.-..|.||++.-..+.+-..|. -.- .=|.+|+.+|= ..++.||
T Consensus 28 ~tLsfChiCfEl~iegvpks~llHtkSlRGHrdCFEK~HlIanQ~~p 74 (285)
T PF06937_consen 28 ETLSFCHICFELSIEGVPKSNLLHTKSLRGHRDCFEKYHLIANQDCP 74 (285)
T ss_pred cceeecceeeccccccCccccccccccccchHHHHHHHHHHHcCCCC
Confidence 33457889988654332211111 111 34689999984 4577888
No 191
>PF00130 C1_1: Phorbol esters/diacylglycerol binding domain (C1 domain); InterPro: IPR002219 Diacylglycerol (DAG) is an important second messenger. Phorbol esters (PE) are analogues of DAG and potent tumour promoters that cause a variety of physiological changes when administered to both cells and tissues. DAG activates a family of serine/threonine protein kinases, collectively known as protein kinase C (PKC) []. Phorbol esters can directly stimulate PKC. The N-terminal region of PKC, known as C1, has been shown [] to bind PE and DAG in a phospholipid and zinc-dependent fashion. The C1 region contains one or two copies (depending on the isozyme of PKC) of a cysteine-rich domain, which is about 50 amino-acid residues long, and which is essential for DAG/PE-binding. The DAG/PE-binding domain binds two zinc ions; the ligands of these metal ions are probably the six cysteines and two histidines that are conserved in this domain.; GO: 0035556 intracellular signal transduction; PDB: 1RFH_A 2FNF_X 3PFQ_A 1PTQ_A 1PTR_A 2VRW_B 1XA6_A 2ENN_A 1TBN_A 1TBO_A ....
Probab=43.16 E-value=20 Score=17.69 Aligned_cols=35 Identities=20% Similarity=0.369 Sum_probs=22.2
Q ss_pred CCccccccccccc--cCCCceEEeCCCCcccHhHHHH
Q psy8622 37 DSNLQCSVCWEQF--TLDEAVRKLPCDHFYHTPCIEP 71 (101)
Q Consensus 37 ~~~~~C~IC~~~~--~~~~~~~~l~C~H~f~~~Ci~~ 71 (101)
.....|.+|.+.+ .......-..|+-..|.+|+..
T Consensus 9 ~~~~~C~~C~~~i~g~~~~g~~C~~C~~~~H~~C~~~ 45 (53)
T PF00130_consen 9 SKPTYCDVCGKFIWGLGKQGYRCSWCGLVCHKKCLSK 45 (53)
T ss_dssp SSTEB-TTSSSBECSSSSCEEEETTTT-EEETTGGCT
T ss_pred CCCCCCcccCcccCCCCCCeEEECCCCChHhhhhhhh
Confidence 4566899999987 2222333345899999999754
No 192
>PF14353 CpXC: CpXC protein
Probab=41.86 E-value=33 Score=20.46 Aligned_cols=11 Identities=36% Similarity=0.972 Sum_probs=8.5
Q ss_pred ccccccccccc
Q psy8622 78 TCPICRQTLHS 88 (101)
Q Consensus 78 ~CP~Cr~~~~~ 88 (101)
+||.|...+..
T Consensus 40 ~CP~Cg~~~~~ 50 (128)
T PF14353_consen 40 TCPSCGHKFRL 50 (128)
T ss_pred ECCCCCCceec
Confidence 79999887654
No 193
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=40.36 E-value=10 Score=19.52 Aligned_cols=10 Identities=50% Similarity=1.308 Sum_probs=4.5
Q ss_pred cccccccccc
Q psy8622 78 TCPICRQTLH 87 (101)
Q Consensus 78 ~CP~Cr~~~~ 87 (101)
+||+|.+.+.
T Consensus 26 tCP~C~a~~~ 35 (54)
T PF09237_consen 26 TCPICGAVIR 35 (54)
T ss_dssp E-TTT--EES
T ss_pred CCCcchhhcc
Confidence 5777766554
No 194
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=39.11 E-value=6.5 Score=27.55 Aligned_cols=44 Identities=18% Similarity=0.508 Sum_probs=28.2
Q ss_pred CCccccccccccccC--------CCceEEeCCCCcccHhHHHHHHhcCCcccccccc
Q psy8622 37 DSNLQCSVCWEQFTL--------DEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQT 85 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~--------~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~ 85 (101)
+....||+|-..-.. .+..+ +.+|.-|-..|-.....||.|...
T Consensus 182 ~~~~~CPvCGs~P~~s~~~~~~~~~G~R-----yL~CslC~teW~~~R~~C~~Cg~~ 233 (305)
T TIGR01562 182 ESRTLCPACGSPPVASMVRQGGKETGLR-----YLSCSLCATEWHYVRVKCSHCEES 233 (305)
T ss_pred CCCCcCCCCCChhhhhhhcccCCCCCce-----EEEcCCCCCcccccCccCCCCCCC
Confidence 345589999875311 11122 345556888887778889999753
No 195
>KOG1729|consensus
Probab=38.57 E-value=4.7 Score=28.05 Aligned_cols=36 Identities=19% Similarity=0.533 Sum_probs=26.7
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHHhc
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL 75 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~ 75 (101)
..|.+|++.+..+.......|...||..|+..|+..
T Consensus 215 rvC~~CF~el~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (288)
T KOG1729|consen 215 RVCDICFEELEKGARGDREDSLPVFHGKCYPNWLTT 250 (288)
T ss_pred eecHHHHHHHhcccccchhhcccccccccccccccc
Confidence 389999999964333444556668999999999764
No 196
>KOG2979|consensus
Probab=38.47 E-value=18 Score=24.82 Aligned_cols=44 Identities=18% Similarity=0.221 Sum_probs=31.6
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHHHHhc--CCccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL--HGTCPICRQ 84 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~--~~~CP~Cr~ 84 (101)
...|||=...+.. +++...|+|+|=.+-+...+.. .-.||+=..
T Consensus 176 s~rdPis~~~I~n--PviSkkC~HvydrDsI~~~l~~~~~i~CPv~gC 221 (262)
T KOG2979|consen 176 SNRDPISKKPIVN--PVISKKCGHVYDRDSIMQILCDEITIRCPVLGC 221 (262)
T ss_pred cccCchhhhhhhc--hhhhcCcCcchhhhhHHHHhccCceeecccccC
Confidence 5688887777653 3455669999999999988765 335776433
No 197
>COG5319 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=38.25 E-value=11 Score=23.10 Aligned_cols=10 Identities=50% Similarity=1.102 Sum_probs=7.4
Q ss_pred cccccccccc
Q psy8622 78 TCPICRQTLH 87 (101)
Q Consensus 78 ~CP~Cr~~~~ 87 (101)
+||+|...-.
T Consensus 34 ~CPvCgs~~V 43 (142)
T COG5319 34 TCPVCGSTEV 43 (142)
T ss_pred eCCCCCcHHH
Confidence 7999976443
No 198
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=37.84 E-value=4.6 Score=19.50 Aligned_cols=9 Identities=33% Similarity=0.844 Sum_probs=6.9
Q ss_pred CCccccccc
Q psy8622 76 HGTCPICRQ 84 (101)
Q Consensus 76 ~~~CP~Cr~ 84 (101)
...||.|..
T Consensus 26 ~~~CP~Cg~ 34 (42)
T PF09723_consen 26 PVPCPECGS 34 (42)
T ss_pred CCcCCCCCC
Confidence 457999977
No 199
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=37.70 E-value=17 Score=21.51 Aligned_cols=12 Identities=33% Similarity=0.897 Sum_probs=8.0
Q ss_pred cccccccccccC
Q psy8622 78 TCPICRQTLHSE 89 (101)
Q Consensus 78 ~CP~Cr~~~~~~ 89 (101)
+||.|...+...
T Consensus 28 vCP~CG~~~~~~ 39 (108)
T PF09538_consen 28 VCPKCGTEFPPE 39 (108)
T ss_pred cCCCCCCccCcc
Confidence 577777766554
No 200
>COG4098 comFA Superfamily II DNA/RNA helicase required for DNA uptake (late competence protein) [DNA replication, recombination, and repair]
Probab=36.84 E-value=10 Score=27.51 Aligned_cols=38 Identities=24% Similarity=0.589 Sum_probs=26.4
Q ss_pred cccCCCCccccccccccccCCCceEEeCCCCcccHhHHHH
Q psy8622 32 QVRTDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEP 71 (101)
Q Consensus 32 ~~~~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~ 71 (101)
..........|.-|-..- ......+||+-.||..|+.-
T Consensus 32 s~~~~~gk~~C~RC~~~~--~~~~~~lp~~~~YCr~Cl~m 69 (441)
T COG4098 32 SIIIENGKYRCNRCGNTH--IELFAKLPCGCLYCRNCLMM 69 (441)
T ss_pred ceecccCcEEehhcCCcc--hhhhcccccceEeehhhhhc
Confidence 344455677899988431 13346789999999999964
No 201
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=36.56 E-value=11 Score=16.06 Aligned_cols=11 Identities=36% Similarity=1.177 Sum_probs=6.8
Q ss_pred ccccccccccc
Q psy8622 78 TCPICRQTLHS 88 (101)
Q Consensus 78 ~CP~Cr~~~~~ 88 (101)
.||.|.+.+..
T Consensus 4 ~C~~CgR~F~~ 14 (25)
T PF13913_consen 4 PCPICGRKFNP 14 (25)
T ss_pred cCCCCCCEECH
Confidence 46777666644
No 202
>PF11023 DUF2614: Protein of unknown function (DUF2614); InterPro: IPR020912 This entry describes proteins of unknown function, which are thought to be membrane proteins.; GO: 0005887 integral to plasma membrane
Probab=35.91 E-value=38 Score=20.31 Aligned_cols=20 Identities=20% Similarity=0.426 Sum_probs=14.4
Q ss_pred HHHhcCCcccccccccccCC
Q psy8622 71 PWLQLHGTCPICRQTLHSES 90 (101)
Q Consensus 71 ~w~~~~~~CP~Cr~~~~~~~ 90 (101)
+.+.+...|+.|++++..+.
T Consensus 80 KmLGr~D~CM~C~~pLTLd~ 99 (114)
T PF11023_consen 80 KMLGRVDACMHCKEPLTLDP 99 (114)
T ss_pred hhhchhhccCcCCCcCccCc
Confidence 34455568999999997653
No 203
>PRK01343 zinc-binding protein; Provisional
Probab=35.88 E-value=18 Score=18.89 Aligned_cols=12 Identities=25% Similarity=0.692 Sum_probs=9.2
Q ss_pred CCcccccccccc
Q psy8622 76 HGTCPICRQTLH 87 (101)
Q Consensus 76 ~~~CP~Cr~~~~ 87 (101)
...||+|++++.
T Consensus 9 ~~~CP~C~k~~~ 20 (57)
T PRK01343 9 TRPCPECGKPST 20 (57)
T ss_pred CCcCCCCCCcCc
Confidence 457999998764
No 204
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=35.46 E-value=45 Score=17.41 Aligned_cols=35 Identities=17% Similarity=0.262 Sum_probs=18.7
Q ss_pred ccCCCCccccccccccccC--CCceEEeC-CCCcccHh
Q psy8622 33 VRTDDSNLQCSVCWEQFTL--DEAVRKLP-CDHFYHTP 67 (101)
Q Consensus 33 ~~~~~~~~~C~IC~~~~~~--~~~~~~l~-C~H~f~~~ 67 (101)
+.+......|+.|-..... .......+ ||+.++.+
T Consensus 22 v~~~~TSq~C~~CG~~~~~~~~~r~~~C~~Cg~~~~rD 59 (69)
T PF07282_consen 22 VDEAYTSQTCPRCGHRNKKRRSGRVFTCPNCGFEMDRD 59 (69)
T ss_pred ECCCCCccCccCcccccccccccceEEcCCCCCEECcH
Confidence 3344456678888876543 12223333 66666554
No 205
>smart00109 C1 Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
Probab=35.44 E-value=33 Score=16.18 Aligned_cols=34 Identities=24% Similarity=0.373 Sum_probs=22.1
Q ss_pred CccccccccccccCCC-ceEEeCCCCcccHhHHHH
Q psy8622 38 SNLQCSVCWEQFTLDE-AVRKLPCDHFYHTPCIEP 71 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~~-~~~~l~C~H~f~~~Ci~~ 71 (101)
....|.+|.+.+.... ...-..|+-..|..|...
T Consensus 10 ~~~~C~~C~~~i~~~~~~~~C~~C~~~~H~~C~~~ 44 (49)
T smart00109 10 KPTKCCVCRKSIWGSFQGLRCSWCKVKCHKKCAEK 44 (49)
T ss_pred CCCCccccccccCcCCCCcCCCCCCchHHHHHHhh
Confidence 3457999998875322 122223777889999875
No 206
>KOG2071|consensus
Probab=34.98 E-value=26 Score=26.87 Aligned_cols=36 Identities=22% Similarity=0.585 Sum_probs=24.0
Q ss_pred CCccccccccccccCC----C------ceEEeCCCCcccHhHHHHH
Q psy8622 37 DSNLQCSVCWEQFTLD----E------AVRKLPCDHFYHTPCIEPW 72 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~----~------~~~~l~C~H~f~~~Ci~~w 72 (101)
+....|+||.|.|..- + ..+.+.=|.+||..|+..-
T Consensus 511 e~~~~C~IC~EkFe~v~d~e~~~Wm~kdaV~le~G~ifH~~Cl~e~ 556 (579)
T KOG2071|consen 511 ERQASCPICQEKFEVVFDQEEDLWMYKDAVYLEFGRIFHSKCLSEK 556 (579)
T ss_pred ccccCCcccccccceeecchhhheeecceeeeccCceeeccccchH
Confidence 6678999999988311 1 1122224889999999764
No 207
>PF01485 IBR: IBR domain; InterPro: IPR002867 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a cysteine-rich (C6HC) zinc finger domain that is present in Triad1, and which is conserved in other proteins encoded by various eukaryotes. The C6HC consensus pattern is: C-x(4)-C-x(14-30)-C-x(1-4)-C-x(4)-C-x(2)-C-x(4)-H-x(4)-C The C6HC zinc finger motif is the fourth family member of the zinc-binding RING, LIM, and LAP/PHD fingers. Strikingly, in most of the proteins the C6HC domain is flanked by two RING finger structures IPR001841 from INTERPRO. The novel C6HC motif has been called DRIL (double RING finger linked). The strong conservation of the larger tripartite TRIAD (twoRING fingers and DRIL) structure indicates that the three subdomains are functionally linked and identifies a novel class of proteins []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2CT7_A 1WD2_A 2JMO_A 1WIM_A.
Probab=34.72 E-value=2.8 Score=21.53 Aligned_cols=33 Identities=27% Similarity=0.579 Sum_probs=16.1
Q ss_pred ccccc--ccccccCCC--ce--EEeC-CCCcccHhHHHHH
Q psy8622 40 LQCSV--CWEQFTLDE--AV--RKLP-CDHFYHTPCIEPW 72 (101)
Q Consensus 40 ~~C~I--C~~~~~~~~--~~--~~l~-C~H~f~~~Ci~~w 72 (101)
..||- |-..+.... .. +.-+ |++.||..|...|
T Consensus 19 ~~Cp~~~C~~~~~~~~~~~~~~~~C~~C~~~fC~~C~~~~ 58 (64)
T PF01485_consen 19 RWCPNPDCEYIIEKDDGCNSPIVTCPSCGTEFCFKCGEPW 58 (64)
T ss_dssp C--TTSST---ECS-SSTTS--CCTTSCCSEECSSSTSES
T ss_pred cCCCCCCCcccEEecCCCCCCeeECCCCCCcCccccCccc
Confidence 36765 666553332 11 3334 8899988887665
No 208
>PF12660 zf-TFIIIC: Putative zinc-finger of transcription factor IIIC complex; InterPro: IPR024764 This zinc-finger domain is at the very C terminus of a number of different TFIIIC subunit proteins. This domain might be involved in protein-DNA and/or protein-protein interactions [].; PDB: 2J04_C.
Probab=34.15 E-value=7.1 Score=22.63 Aligned_cols=50 Identities=18% Similarity=0.549 Sum_probs=15.1
Q ss_pred cccccccccccCCC-ceEEeCCCCcccHhHHHHHHhc----CCcccccccccccCC
Q psy8622 40 LQCSVCWEQFTLDE-AVRKLPCDHFYHTPCIEPWLQL----HGTCPICRQTLHSES 90 (101)
Q Consensus 40 ~~C~IC~~~~~~~~-~~~~l~C~H~f~~~Ci~~w~~~----~~~CP~Cr~~~~~~~ 90 (101)
..|+||.+.+.-.+ ....-.=||.| ..|....+.- .+.|++|...+....
T Consensus 15 E~C~~C~~~i~~~~~~~~~C~~GH~w-~RC~lT~l~i~~~~~r~C~~C~~~~l~~~ 69 (99)
T PF12660_consen 15 EKCPICGAPIPFDDLDEAQCENGHVW-PRCALTFLPIQTPGVRVCPVCGRRALDPE 69 (99)
T ss_dssp --------------SSEEE-TTS-EE-EB-SSS-SBS-SS-EEE-TTT--EEE-GG
T ss_pred ccccccccccccCCcCEeECCCCCEE-eeeeeeeeeeccCCeeEcCCCCCEEecCc
Confidence 58999998774333 22333347877 3565554421 257999988775543
No 209
>cd00029 C1 Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
Probab=34.10 E-value=26 Score=16.75 Aligned_cols=34 Identities=21% Similarity=0.339 Sum_probs=21.6
Q ss_pred CccccccccccccCC--CceEEeCCCCcccHhHHHH
Q psy8622 38 SNLQCSVCWEQFTLD--EAVRKLPCDHFYHTPCIEP 71 (101)
Q Consensus 38 ~~~~C~IC~~~~~~~--~~~~~l~C~H~f~~~Ci~~ 71 (101)
....|.+|.+.+... ....-..|+-..|..|...
T Consensus 10 ~~~~C~~C~~~i~~~~~~~~~C~~C~~~~H~~C~~~ 45 (50)
T cd00029 10 KPTFCDVCRKSIWGLFKQGLRCSWCKVKCHKKCADK 45 (50)
T ss_pred CCCChhhcchhhhccccceeEcCCCCCchhhhhhcc
Confidence 345699998887532 2222234788888888764
No 210
>KOG1245|consensus
Probab=34.00 E-value=14 Score=31.33 Aligned_cols=51 Identities=25% Similarity=0.477 Sum_probs=34.6
Q ss_pred CCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCC----ccccccccc
Q psy8622 36 DDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHG----TCPICRQTL 86 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~----~CP~Cr~~~ 86 (101)
......|.+|.........+.-.-|.-.||..|+..-+...+ .||-||..-
T Consensus 1105 s~~~~~c~~cr~k~~~~~m~lc~~c~~~~h~~C~rp~~~~~~~~dW~C~~c~~e~ 1159 (1404)
T KOG1245|consen 1105 SAVNALCKVCRRKKQDEKMLLCDECLSGFHLFCLRPALSSVPPGDWMCPSCRKEH 1159 (1404)
T ss_pred ccchhhhhhhhhcccchhhhhhHhhhhhHHHHhhhhhhccCCcCCccCCccchhh
Confidence 334568999998764322222223888999999988775433 699998765
No 211
>COG4647 AcxC Acetone carboxylase, gamma subunit [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=33.26 E-value=28 Score=21.51 Aligned_cols=26 Identities=23% Similarity=0.542 Sum_probs=15.8
Q ss_pred ccccccccCCCceEEeCCCCcccHhHHHHH
Q psy8622 43 SVCWEQFTLDEAVRKLPCDHFYHTPCIEPW 72 (101)
Q Consensus 43 ~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w 72 (101)
-||... ...+....|||.|+.. -..|
T Consensus 61 fi~qs~---~~rv~rcecghsf~d~-r~nw 86 (165)
T COG4647 61 FICQSA---QKRVIRCECGHSFGDY-RENW 86 (165)
T ss_pred EEEecc---cccEEEEeccccccCh-hhCc
Confidence 356553 1345667799999753 3444
No 212
>PF09297 zf-NADH-PPase: NADH pyrophosphatase zinc ribbon domain; InterPro: IPR015376 This domain has a zinc ribbon structure and is often found between two NUDIX domains.; GO: 0016787 hydrolase activity, 0046872 metal ion binding; PDB: 1VK6_A 2GB5_A.
Probab=31.80 E-value=4.7 Score=18.17 Aligned_cols=6 Identities=50% Similarity=1.531 Sum_probs=2.9
Q ss_pred cccccc
Q psy8622 78 TCPICR 83 (101)
Q Consensus 78 ~CP~Cr 83 (101)
.||.|.
T Consensus 23 ~C~~Cg 28 (32)
T PF09297_consen 23 RCPSCG 28 (32)
T ss_dssp EESSSS
T ss_pred ECCCCc
Confidence 455554
No 213
>PF10083 DUF2321: Uncharacterized protein conserved in bacteria (DUF2321); InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=31.71 E-value=27 Score=22.13 Aligned_cols=26 Identities=15% Similarity=0.551 Sum_probs=19.1
Q ss_pred CCcccHhHHHHHHhcCCcccccccccccC
Q psy8622 61 DHFYHTPCIEPWLQLHGTCPICRQTLHSE 89 (101)
Q Consensus 61 ~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~ 89 (101)
.+.||..|-.+-.. .||.|..++.-+
T Consensus 27 ~~~fC~kCG~~tI~---~Cp~C~~~IrG~ 52 (158)
T PF10083_consen 27 REKFCSKCGAKTIT---SCPNCSTPIRGD 52 (158)
T ss_pred HHHHHHHhhHHHHH---HCcCCCCCCCCc
Confidence 45799999877543 599998887543
No 214
>PF07227 DUF1423: Protein of unknown function (DUF1423); InterPro: IPR004082 A total of 715 potential protein-coding genes have been identified in the nucleotide sequence of Arabidopsis thaliana chromosome 5, with an average gene density of 1 gene per 4001 bp []. Amongst the gene products is a well-conserved family of 130.7kDa proteins that share no sequence similarity with any other known proteins, other than in plants. The sequences are characterised by an N-terminal domain of variable length, a central cysteine-rich region and a relatively acidic C-terminal domain. The sequences may possess a PHD finger.
Probab=30.76 E-value=34 Score=25.41 Aligned_cols=31 Identities=23% Similarity=0.624 Sum_probs=18.9
Q ss_pred ccccccccccCC-CceEEe---CCCCcccHhHHHHH
Q psy8622 41 QCSVCWEQFTLD-EAVRKL---PCDHFYHTPCIEPW 72 (101)
Q Consensus 41 ~C~IC~~~~~~~-~~~~~l---~C~H~f~~~Ci~~w 72 (101)
.|.||.. |... +...-+ -|+|.-|..|..+-
T Consensus 130 ~C~iC~k-fD~~~n~~~Wi~Cd~CgH~cH~dCALr~ 164 (446)
T PF07227_consen 130 MCCICSK-FDDNKNTCSWIGCDVCGHWCHLDCALRH 164 (446)
T ss_pred CccccCC-cccCCCCeeEEeccCCCceehhhhhccc
Confidence 5667755 3222 222222 28999999998764
No 215
>PF06677 Auto_anti-p27: Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27); InterPro: IPR009563 The proteins in this entry are functionally uncharacterised and include several proteins that characterise Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27). It is thought that the potential association of anti-p27 with anti-centromere antibodies suggests that autoantigen p27 might play a role in mitosis [].
Probab=30.70 E-value=33 Score=16.56 Aligned_cols=19 Identities=32% Similarity=0.767 Sum_probs=12.8
Q ss_pred HHHHhcCCccccccccccc
Q psy8622 70 EPWLQLHGTCPICRQTLHS 88 (101)
Q Consensus 70 ~~w~~~~~~CP~Cr~~~~~ 88 (101)
.-|.--..+||.|..++-.
T Consensus 11 ~G~~ML~~~Cp~C~~PL~~ 29 (41)
T PF06677_consen 11 QGWTMLDEHCPDCGTPLMR 29 (41)
T ss_pred HhHhHhcCccCCCCCeeEE
Confidence 4455556789999776644
No 216
>PTZ00303 phosphatidylinositol kinase; Provisional
Probab=30.27 E-value=37 Score=27.46 Aligned_cols=33 Identities=21% Similarity=0.295 Sum_probs=22.5
Q ss_pred cccccccccccCCCc---eE---EeCCCCcccHhHHHHH
Q psy8622 40 LQCSVCWEQFTLDEA---VR---KLPCDHFYHTPCIEPW 72 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~---~~---~l~C~H~f~~~Ci~~w 72 (101)
..|..|...|..-.. .+ --.||.+||..|-...
T Consensus 461 dtC~~C~kkFfSlsK~L~~RKHHCRkCGrVFC~~CSSnR 499 (1374)
T PTZ00303 461 DSCPSCGRAFISLSRPLGTRAHHCRSCGIRLCVFCITKR 499 (1374)
T ss_pred CcccCcCCcccccccccccccccccCCccccCccccCCc
Confidence 579999999853211 01 1249999999998654
No 217
>COG3024 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.40 E-value=36 Score=18.25 Aligned_cols=10 Identities=30% Similarity=1.016 Sum_probs=4.8
Q ss_pred cccccccccc
Q psy8622 78 TCPICRQTLH 87 (101)
Q Consensus 78 ~CP~Cr~~~~ 87 (101)
.||.|.+++.
T Consensus 9 ~CP~Cgkpv~ 18 (65)
T COG3024 9 PCPTCGKPVV 18 (65)
T ss_pred cCCCCCCccc
Confidence 3555544443
No 218
>KOG1842|consensus
Probab=29.26 E-value=19 Score=26.73 Aligned_cols=38 Identities=21% Similarity=0.364 Sum_probs=25.8
Q ss_pred cCCCCccccccccccccCCCceEEeC-CCCcccHhHHHH
Q psy8622 34 RTDDSNLQCSVCWEQFTLDEAVRKLP-CDHFYHTPCIEP 71 (101)
Q Consensus 34 ~~~~~~~~C~IC~~~~~~~~~~~~l~-C~H~f~~~Ci~~ 71 (101)
..+.....||+|-+.|.....--... ||-+.|.+|...
T Consensus 175 ~DDs~V~~CP~Ca~~F~l~rRrHHCRLCG~VmC~~C~k~ 213 (505)
T KOG1842|consen 175 LDDSSVQFCPECANSFGLTRRRHHCRLCGRVMCRDCSKF 213 (505)
T ss_pred cCCCcccccccccchhhhHHHhhhhhhcchHHHHHHHHh
Confidence 34556788999999985432111112 999999999865
No 219
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=29.23 E-value=55 Score=20.53 Aligned_cols=17 Identities=35% Similarity=0.604 Sum_probs=12.4
Q ss_pred hcCCcccccccccccCC
Q psy8622 74 QLHGTCPICRQTLHSES 90 (101)
Q Consensus 74 ~~~~~CP~Cr~~~~~~~ 90 (101)
.....||.|...+...+
T Consensus 126 ~~~F~Cp~Cg~~L~~~d 142 (158)
T TIGR00373 126 ELNFTCPRCGAMLDYLD 142 (158)
T ss_pred HcCCcCCCCCCEeeecc
Confidence 34678999998886543
No 220
>KOG1512|consensus
Probab=28.69 E-value=23 Score=24.83 Aligned_cols=32 Identities=25% Similarity=0.381 Sum_probs=21.4
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHH
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIE 70 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~ 70 (101)
-..|.||.......+.++--.|...||.-|+.
T Consensus 314 C~lC~IC~~P~~E~E~~FCD~CDRG~HT~CVG 345 (381)
T KOG1512|consen 314 CELCRICLGPVIESEHLFCDVCDRGPHTLCVG 345 (381)
T ss_pred cHhhhccCCcccchheeccccccCCCCccccc
Confidence 35688888876555444444488888888874
No 221
>COG5627 MMS21 DNA repair protein MMS21 [DNA replication, recombination, and repair]
Probab=28.68 E-value=28 Score=23.73 Aligned_cols=43 Identities=19% Similarity=0.275 Sum_probs=30.3
Q ss_pred CCccccccccccccCCCceEEeCCCCcccHhHHHHHHhc--CCcccc
Q psy8622 37 DSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL--HGTCPI 81 (101)
Q Consensus 37 ~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~--~~~CP~ 81 (101)
.-+..|||-+..+.. +.....|.|.|-.+-|...++. ...||.
T Consensus 187 ~~~nrCpitl~p~~~--pils~kcnh~~e~D~I~~~lq~~~trvcp~ 231 (275)
T COG5627 187 LLSNRCPITLNPDFY--PILSSKCNHKPEMDLINKKLQVECTRVCPR 231 (275)
T ss_pred hhcccCCcccCcchh--HHHHhhhcccccHHHHHHHhcCCceeecch
Confidence 346789998876542 2344459999999999998874 445663
No 222
>KOG4577|consensus
Probab=28.58 E-value=16 Score=25.62 Aligned_cols=31 Identities=29% Similarity=0.569 Sum_probs=16.4
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHH
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIE 70 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~ 70 (101)
...|..|.+.+-+. .++.-.=.|+||..|+.
T Consensus 92 GTKCsaC~~GIpPt-qVVRkAqd~VYHl~CF~ 122 (383)
T KOG4577|consen 92 GTKCSACQEGIPPT-QVVRKAQDFVYHLHCFA 122 (383)
T ss_pred CCcchhhcCCCChH-HHHHHhhcceeehhhhh
Confidence 34666776665322 22222346677777763
No 223
>PF12088 DUF3565: Protein of unknown function (DUF3565); InterPro: IPR021948 This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 30 to 78 amino acids in length. This protein has two conserved sequence motifs: WVA and CGH.
Probab=28.31 E-value=47 Score=17.56 Aligned_cols=15 Identities=33% Similarity=0.534 Sum_probs=11.6
Q ss_pred CceEEeCCCCcccHh
Q psy8622 53 EAVRKLPCDHFYHTP 67 (101)
Q Consensus 53 ~~~~~l~C~H~f~~~ 67 (101)
.-+..|.|||.-|..
T Consensus 10 hWVA~L~CGH~QHvR 24 (61)
T PF12088_consen 10 HWVAELSCGHTQHVR 24 (61)
T ss_pred CEEEEeccccccccc
Confidence 467889999987754
No 224
>PF15616 TerY-C: TerY-C metal binding domain
Probab=28.16 E-value=51 Score=20.28 Aligned_cols=46 Identities=20% Similarity=0.322 Sum_probs=31.1
Q ss_pred CCCCccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCcccccccccccC
Q psy8622 35 TDDSNLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHSE 89 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~~ 89 (101)
.......||-|-..+ ..+.-.|+++||.. . ....+||-|.....-.
T Consensus 73 eL~g~PgCP~CGn~~----~fa~C~CGkl~Ci~---g--~~~~~CPwCg~~g~~~ 118 (131)
T PF15616_consen 73 ELIGAPGCPHCGNQY----AFAVCGCGKLFCID---G--EGEVTCPWCGNEGSFG 118 (131)
T ss_pred HhcCCCCCCCCcChh----cEEEecCCCEEEeC---C--CCCEECCCCCCeeeec
Confidence 344568999998864 23445799998743 1 2356899998877543
No 225
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=28.02 E-value=56 Score=20.05 Aligned_cols=14 Identities=14% Similarity=0.275 Sum_probs=10.4
Q ss_pred CCcccccccccccC
Q psy8622 76 HGTCPICRQTLHSE 89 (101)
Q Consensus 76 ~~~CP~Cr~~~~~~ 89 (101)
...||.|...+...
T Consensus 26 p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 26 PAVSPYTGEQFPPE 39 (129)
T ss_pred CccCCCcCCccCcc
Confidence 34799998887554
No 226
>PF15353 HECA: Headcase protein family homologue
Probab=27.42 E-value=47 Score=19.68 Aligned_cols=15 Identities=33% Similarity=1.003 Sum_probs=12.5
Q ss_pred CCCCcccHhHHHHHH
Q psy8622 59 PCDHFYHTPCIEPWL 73 (101)
Q Consensus 59 ~C~H~f~~~Ci~~w~ 73 (101)
|.++..|..|+..|-
T Consensus 39 p~~~~MH~~CF~~wE 53 (107)
T PF15353_consen 39 PFGQYMHRECFEKWE 53 (107)
T ss_pred CCCCchHHHHHHHHH
Confidence 457889999999983
No 227
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=27.36 E-value=25 Score=13.62 Aligned_cols=9 Identities=44% Similarity=1.449 Sum_probs=3.8
Q ss_pred ccccccccc
Q psy8622 79 CPICRQTLH 87 (101)
Q Consensus 79 CP~Cr~~~~ 87 (101)
|++|...+.
T Consensus 3 C~~C~~~~~ 11 (24)
T PF13894_consen 3 CPICGKSFR 11 (24)
T ss_dssp -SSTS-EES
T ss_pred CcCCCCcCC
Confidence 666655543
No 228
>COG4068 Uncharacterized protein containing a Zn-ribbon [Function unknown]
Probab=27.00 E-value=36 Score=17.98 Aligned_cols=16 Identities=19% Similarity=0.684 Sum_probs=11.3
Q ss_pred cCCcccccccccccCC
Q psy8622 75 LHGTCPICRQTLHSES 90 (101)
Q Consensus 75 ~~~~CP~Cr~~~~~~~ 90 (101)
.++.|++|.+.++.+.
T Consensus 7 PH~HC~VCg~aIp~de 22 (64)
T COG4068 7 PHRHCVVCGKAIPPDE 22 (64)
T ss_pred CCccccccCCcCCCcc
Confidence 3557899988876553
No 229
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.90 E-value=48 Score=20.36 Aligned_cols=23 Identities=26% Similarity=0.739 Sum_probs=11.6
Q ss_pred cccHhHHHHHHhcCCccccccccccc
Q psy8622 63 FYHTPCIEPWLQLHGTCPICRQTLHS 88 (101)
Q Consensus 63 ~f~~~Ci~~w~~~~~~CP~Cr~~~~~ 88 (101)
.||..|-..-. ..||.|..++.-
T Consensus 29 afcskcgeati---~qcp~csasirg 51 (160)
T COG4306 29 AFCSKCGEATI---TQCPICSASIRG 51 (160)
T ss_pred HHHhhhchHHH---hcCCccCCcccc
Confidence 35555544322 246777666543
No 230
>PF06750 DiS_P_DiS: Bacterial Peptidase A24 N-terminal domain; InterPro: IPR010627 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This domain is found at the N terminus of bacterial aspartic peptidases belonging to MEROPS peptidase family A24 (clan AD), subfamily A24A (type IV prepilin peptidase, IPR000045 from INTERPRO). It's function has not been specifically determined; however some of the family have been characterised as bifunctional [], and this domain may contain the N-methylation activity. The domain consists of an intracellular region between a pair of transmembrane domains. This intracellular region contains an invariant proline and four conserved cysteines. These Cys residues are arranged in a two-pair motif, with the Cys residues of a pair separated (usually) by 2 aa and with each pair separated by 21 largely hydrophilic residues (C-X-X-C...X21...C-X-X-C); they have been shown to be essential to the overall function of the enzyme [, ]. The bifunctional enzyme prepilin peptidase (PilD) from Pseudomonas aeruginosa is a key determinant in both type-IV pilus biogenesis and extracellular protein secretion, in its roles as a leader peptidase and methyl transferase (MTase). It is responsible for endopeptidic cleavage of the unique leader peptides that characterise type-IV pilin precursors, as well as proteins with homologous leader sequences that are essential components of the general secretion pathway found in a variety of Gram-negative pathogens. Following removal of the leader peptides, the same enzyme is responsible for the second posttranslational modification that characterises the type-IV pilins and their homologues, namely N-methylation of the newly exposed N-terminal amino acid residue [].
Probab=26.82 E-value=51 Score=18.74 Aligned_cols=38 Identities=24% Similarity=0.589 Sum_probs=25.8
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHHHHhcCCccccccccccc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQLHGTCPICRQTLHS 88 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~~ 88 (101)
...|+-|...+.--+ .+| +..|+..+..|..|++++..
T Consensus 33 rS~C~~C~~~L~~~~---lIP---------i~S~l~lrGrCr~C~~~I~~ 70 (92)
T PF06750_consen 33 RSHCPHCGHPLSWWD---LIP---------ILSYLLLRGRCRYCGAPIPP 70 (92)
T ss_pred CCcCcCCCCcCcccc---cch---------HHHHHHhCCCCcccCCCCCh
Confidence 456777777664322 222 56788888999999998854
No 231
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=26.80 E-value=38 Score=20.15 Aligned_cols=11 Identities=27% Similarity=0.761 Sum_probs=8.1
Q ss_pred ccccccccccc
Q psy8622 40 LQCSVCWEQFT 50 (101)
Q Consensus 40 ~~C~IC~~~~~ 50 (101)
..||-|..+|.
T Consensus 3 p~CP~C~seyt 13 (109)
T TIGR00686 3 PPCPKCNSEYT 13 (109)
T ss_pred CcCCcCCCcce
Confidence 46888888764
No 232
>TIGR02652 conserved hypothetical protein TIGR02652, cyanobacterial. Members of this family of conserved hypothetical proteins are found, so far, only in the Cyanobacteria. Members are about 170 amino acids long and share a motif CxxCx(14)CxxH near the amino end.
Probab=26.69 E-value=23 Score=22.12 Aligned_cols=16 Identities=31% Similarity=0.598 Sum_probs=11.3
Q ss_pred hcCCcccccccccccC
Q psy8622 74 QLHGTCPICRQTLHSE 89 (101)
Q Consensus 74 ~~~~~CP~Cr~~~~~~ 89 (101)
...-.||.||+.+..-
T Consensus 7 Gpei~CPhCRQ~ipAL 22 (163)
T TIGR02652 7 GPEIRCPHCRQNIPAL 22 (163)
T ss_pred CCcCcCchhhcccchh
Confidence 3455799999987643
No 233
>PRK13130 H/ACA RNA-protein complex component Nop10p; Reviewed
Probab=26.53 E-value=62 Score=16.81 Aligned_cols=21 Identities=19% Similarity=0.637 Sum_probs=12.9
Q ss_pred CCcccccccccccCCcccCCC
Q psy8622 76 HGTCPICRQTLHSESAESLGP 96 (101)
Q Consensus 76 ~~~CP~Cr~~~~~~~~~~~~~ 96 (101)
+..||.|............+|
T Consensus 17 k~~CP~CG~~t~~~~P~rfSp 37 (56)
T PRK13130 17 KEICPVCGGKTKNPHPPRFSP 37 (56)
T ss_pred cccCcCCCCCCCCCCCCCCCC
Confidence 456888887766554444443
No 234
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=26.42 E-value=35 Score=15.62 Aligned_cols=7 Identities=57% Similarity=1.592 Sum_probs=4.3
Q ss_pred ccccccc
Q psy8622 78 TCPICRQ 84 (101)
Q Consensus 78 ~CP~Cr~ 84 (101)
.||+|.+
T Consensus 20 ~CP~Cg~ 26 (34)
T cd00729 20 KCPICGA 26 (34)
T ss_pred cCcCCCC
Confidence 5666654
No 235
>COG2835 Uncharacterized conserved protein [Function unknown]
Probab=26.27 E-value=27 Score=18.45 Aligned_cols=10 Identities=40% Similarity=1.268 Sum_probs=8.0
Q ss_pred cccccccccc
Q psy8622 78 TCPICRQTLH 87 (101)
Q Consensus 78 ~CP~Cr~~~~ 87 (101)
.||+||.++.
T Consensus 10 aCP~~kg~L~ 19 (60)
T COG2835 10 ACPVCKGPLV 19 (60)
T ss_pred eccCcCCcce
Confidence 5999998864
No 236
>TIGR01367 pyrE_Therm orotate phosphoribosyltransferase, Thermus family. This model represents a distinct clade of orotate phosphoribosyltransferases. Members include the experimentally determined example from Thermus aquaticus and additional examples from Caulobacter crescentus, Helicobacter pylori, Mesorhizobium loti, and related species.
Probab=26.08 E-value=33 Score=22.12 Aligned_cols=16 Identities=25% Similarity=0.729 Sum_probs=12.1
Q ss_pred CCcccccccccccCCc
Q psy8622 76 HGTCPICRQTLHSESA 91 (101)
Q Consensus 76 ~~~CP~Cr~~~~~~~~ 91 (101)
...||+|++.++.++.
T Consensus 169 ~~~c~~~~~~~~~~~~ 184 (187)
T TIGR01367 169 SHECPLCLAGIPAEKP 184 (187)
T ss_pred cccCChhhcCCCCcCC
Confidence 3469999998876654
No 237
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=26.08 E-value=25 Score=13.91 Aligned_cols=9 Identities=44% Similarity=1.420 Sum_probs=5.3
Q ss_pred ccccccccc
Q psy8622 79 CPICRQTLH 87 (101)
Q Consensus 79 CP~Cr~~~~ 87 (101)
||.|...+.
T Consensus 3 C~~C~~~f~ 11 (23)
T PF00096_consen 3 CPICGKSFS 11 (23)
T ss_dssp ETTTTEEES
T ss_pred CCCCCCccC
Confidence 666665554
No 238
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=25.83 E-value=74 Score=20.45 Aligned_cols=17 Identities=24% Similarity=0.512 Sum_probs=12.4
Q ss_pred hcCCcccccccccccCC
Q psy8622 74 QLHGTCPICRQTLHSES 90 (101)
Q Consensus 74 ~~~~~CP~Cr~~~~~~~ 90 (101)
.....||.|...+...+
T Consensus 134 ~~~F~Cp~Cg~~L~~~d 150 (178)
T PRK06266 134 EYGFRCPQCGEMLEEYD 150 (178)
T ss_pred hcCCcCCCCCCCCeecc
Confidence 34678999998886543
No 239
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=25.80 E-value=29 Score=16.39 Aligned_cols=11 Identities=36% Similarity=0.969 Sum_probs=7.7
Q ss_pred ccccccccccc
Q psy8622 78 TCPICRQTLHS 88 (101)
Q Consensus 78 ~CP~Cr~~~~~ 88 (101)
.||.|+..+..
T Consensus 1 ~CP~C~~~l~~ 11 (41)
T PF13453_consen 1 KCPRCGTELEP 11 (41)
T ss_pred CcCCCCcccce
Confidence 38899876643
No 240
>PF12773 DZR: Double zinc ribbon
Probab=25.09 E-value=35 Score=16.64 Aligned_cols=13 Identities=23% Similarity=0.557 Sum_probs=7.9
Q ss_pred CCccccccccccc
Q psy8622 76 HGTCPICRQTLHS 88 (101)
Q Consensus 76 ~~~CP~Cr~~~~~ 88 (101)
...||.|...+..
T Consensus 29 ~~~C~~Cg~~~~~ 41 (50)
T PF12773_consen 29 KKICPNCGAENPP 41 (50)
T ss_pred CCCCcCCcCCCcC
Confidence 3467777776543
No 241
>PF13764 E3_UbLigase_R4: E3 ubiquitin-protein ligase UBR4
Probab=24.99 E-value=20 Score=28.58 Aligned_cols=15 Identities=27% Similarity=0.944 Sum_probs=12.7
Q ss_pred CCCCccccccccccc
Q psy8622 35 TDDSNLQCSVCWEQF 49 (101)
Q Consensus 35 ~~~~~~~C~IC~~~~ 49 (101)
.++....|.||.|.|
T Consensus 464 ~ee~gl~C~ICrEGy 478 (802)
T PF13764_consen 464 EEEDGLTCCICREGY 478 (802)
T ss_pred cccCCCeEEEcCCcc
Confidence 357788999999987
No 242
>PF08882 Acetone_carb_G: Acetone carboxylase gamma subunit; InterPro: IPR014979 Acetone carboxylase is the key enzyme of bacterial acetone metabolism, catalysing the condensation of acetone and CO2 to form acetoacetate [] according to the following reaction: CH3COCH3 + CO2 + ATP = CH3COCH2COO- + AMP + 2P(i) + H+ It has the subunit composition: (alpha(2)beta(2)gamma(2) multimers of 85kDa, 78kDa, and 20kDa subunits). It is expressed to high levels (17 to 25% of soluble protein) in cells grown with acetone as the carbon source but are not present at detectable levels in cells grown with other carbon sources []. Acetone carboxylase may enable Helicobacter pylori to survive off acetone in the stomach of humans and other mammals where it is the etiological agent of peptic ulcer disease []. This entry represents the family of gamma subunit-related acetone carboxylase proteins.
Probab=24.78 E-value=48 Score=19.80 Aligned_cols=12 Identities=33% Similarity=0.553 Sum_probs=9.1
Q ss_pred eEEeCCCCcccH
Q psy8622 55 VRKLPCDHFYHT 66 (101)
Q Consensus 55 ~~~l~C~H~f~~ 66 (101)
.+...|||.|+.
T Consensus 24 ~vkc~CGh~f~d 35 (112)
T PF08882_consen 24 VVKCDCGHEFCD 35 (112)
T ss_pred eeeccCCCeecC
Confidence 455579999975
No 243
>KOG3726|consensus
Probab=23.95 E-value=42 Score=26.37 Aligned_cols=41 Identities=20% Similarity=0.388 Sum_probs=25.5
Q ss_pred cccccccccccCC-CceEEeCCCCcccHhHHHHHHhcCCcccccc
Q psy8622 40 LQCSVCWEQFTLD-EAVRKLPCDHFYHTPCIEPWLQLHGTCPICR 83 (101)
Q Consensus 40 ~~C~IC~~~~~~~-~~~~~l~C~H~f~~~Ci~~w~~~~~~CP~Cr 83 (101)
..|-+|...-.+. .....+.|+-.||.+|... -++.||+|-
T Consensus 655 r~C~vcq~pedse~~v~rt~~C~~~~C~~c~~~---~~~~~~vC~ 696 (717)
T KOG3726|consen 655 RTCKVCQLPEDSETDVCRTTFCYTPYCVACSLD---YASISEVCG 696 (717)
T ss_pred HHHHHhcCCcCccccccCccccCCcchHhhhhh---hhccCcccC
Confidence 4577777642211 1234456999998888554 456789994
No 244
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=22.58 E-value=44 Score=16.34 Aligned_cols=12 Identities=25% Similarity=0.946 Sum_probs=8.6
Q ss_pred CCcccccccccc
Q psy8622 76 HGTCPICRQTLH 87 (101)
Q Consensus 76 ~~~CP~Cr~~~~ 87 (101)
.+.||+|..++.
T Consensus 8 ~K~C~~C~rpf~ 19 (42)
T PF10013_consen 8 SKICPVCGRPFT 19 (42)
T ss_pred CCcCcccCCcch
Confidence 457888877764
No 245
>KOG1818|consensus
Probab=22.54 E-value=38 Score=26.34 Aligned_cols=47 Identities=19% Similarity=0.368 Sum_probs=31.0
Q ss_pred CCCccccccccccccCCCc-eEEeCCCCcccHhHHHHHHh--------cCCccccc
Q psy8622 36 DDSNLQCSVCWEQFTLDEA-VRKLPCDHFYHTPCIEPWLQ--------LHGTCPIC 82 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~~-~~~l~C~H~f~~~Ci~~w~~--------~~~~CP~C 82 (101)
..+...|..|...|..... .-...||-+||..|...-+. ..++|-.|
T Consensus 162 W~D~~~C~rCr~~F~~~~rkHHCr~CG~vFC~qcss~s~~lP~~Gi~~~VRVCd~C 217 (634)
T KOG1818|consen 162 WIDSEECLRCRVKFGLTNRKHHCRNCGQVFCGQCSSKSLTLPKLGIEKPVRVCDSC 217 (634)
T ss_pred cccccccceeeeeeeeccccccccccchhhccCccccccCcccccccccceehhhh
Confidence 3445789999998854332 22234999999999876542 13367777
No 246
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=22.44 E-value=41 Score=16.28 Aligned_cols=13 Identities=23% Similarity=0.585 Sum_probs=8.7
Q ss_pred CcccccccccccC
Q psy8622 77 GTCPICRQTLHSE 89 (101)
Q Consensus 77 ~~CP~Cr~~~~~~ 89 (101)
..||.|...+...
T Consensus 22 ~~Cp~CG~~~~~~ 34 (46)
T PRK00398 22 VRCPYCGYRILFK 34 (46)
T ss_pred eECCCCCCeEEEc
Confidence 3688887766543
No 247
>PF07503 zf-HYPF: HypF finger; InterPro: IPR011125 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Proteins of the HypF family are involved in the maturation and regulation of hydrogenase []. In the N terminus they appear to have two zinc finger domains that are similar to those found in the DnaJ chaperone []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3TTD_A 3TSQ_A 3TTC_A 3TSP_A 3TTF_A 3TSU_A.
Probab=22.42 E-value=70 Score=14.83 Aligned_cols=6 Identities=33% Similarity=1.154 Sum_probs=3.4
Q ss_pred cccccc
Q psy8622 78 TCPICR 83 (101)
Q Consensus 78 ~CP~Cr 83 (101)
+|+.|-
T Consensus 23 sC~~CG 28 (35)
T PF07503_consen 23 SCTNCG 28 (35)
T ss_dssp -BTTCC
T ss_pred cCCCCC
Confidence 577774
No 248
>PRK10220 hypothetical protein; Provisional
Probab=22.26 E-value=65 Score=19.20 Aligned_cols=11 Identities=27% Similarity=0.803 Sum_probs=7.8
Q ss_pred ccccccccccc
Q psy8622 40 LQCSVCWEQFT 50 (101)
Q Consensus 40 ~~C~IC~~~~~ 50 (101)
..||-|..+|.
T Consensus 4 P~CP~C~seyt 14 (111)
T PRK10220 4 PHCPKCNSEYT 14 (111)
T ss_pred CcCCCCCCcce
Confidence 45888887764
No 249
>KOG1244|consensus
Probab=21.82 E-value=21 Score=24.83 Aligned_cols=47 Identities=28% Similarity=0.513 Sum_probs=27.7
Q ss_pred cccccccccccCCCceEEeCCCCcccHhHHHHHHhc----CCccccccccc
Q psy8622 40 LQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEPWLQL----HGTCPICRQTL 86 (101)
Q Consensus 40 ~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~w~~~----~~~CP~Cr~~~ 86 (101)
-.|.||-..-..+.-++--.|...||..|+..-+.. .-+|-+|-..+
T Consensus 282 k~csicgtsenddqllfcddcdrgyhmyclsppm~eppegswsc~KOG~~~ 332 (336)
T KOG1244|consen 282 KYCSICGTSENDDQLLFCDDCDRGYHMYCLSPPMVEPPEGSWSCHLCLEEL 332 (336)
T ss_pred ceeccccCcCCCceeEeecccCCceeeEecCCCcCCCCCCchhHHHHHHHH
Confidence 457788765332211222248889999998765432 22688885554
No 250
>KOG4021|consensus
Probab=21.38 E-value=47 Score=22.01 Aligned_cols=20 Identities=40% Similarity=0.868 Sum_probs=12.7
Q ss_pred hHHHHHHh-cCCccccccccc
Q psy8622 67 PCIEPWLQ-LHGTCPICRQTL 86 (101)
Q Consensus 67 ~Ci~~w~~-~~~~CP~Cr~~~ 86 (101)
.||.+--. ...-||+||...
T Consensus 98 tCIrkn~~~~gnpCPICRDey 118 (239)
T KOG4021|consen 98 TCIRKNGRFLGNPCPICRDEY 118 (239)
T ss_pred HHHhhcCeecCCCCCccccce
Confidence 46665332 244699999864
No 251
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=21.32 E-value=92 Score=24.77 Aligned_cols=18 Identities=17% Similarity=0.326 Sum_probs=12.7
Q ss_pred CCCccccccccccccCCC
Q psy8622 36 DDSNLQCSVCWEQFTLDE 53 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~ 53 (101)
..+-..|.-|++++..+.
T Consensus 98 ~pD~a~C~~Cl~Ei~dp~ 115 (750)
T COG0068 98 PPDAATCEDCLEEIFDPN 115 (750)
T ss_pred CCchhhhHHHHHHhcCCC
Confidence 344568999999875444
No 252
>COG4357 Zinc finger domain containing protein (CHY type) [Function unknown]
Probab=20.61 E-value=81 Score=18.42 Aligned_cols=13 Identities=31% Similarity=1.078 Sum_probs=10.0
Q ss_pred CCccccccccccc
Q psy8622 76 HGTCPICRQTLHS 88 (101)
Q Consensus 76 ~~~CP~Cr~~~~~ 88 (101)
...||.|+.++++
T Consensus 80 ~~~Cp~C~spFNp 92 (105)
T COG4357 80 CGSCPYCQSPFNP 92 (105)
T ss_pred cCCCCCcCCCCCc
Confidence 4469999988864
No 253
>KOG2169|consensus
Probab=20.60 E-value=64 Score=25.06 Aligned_cols=48 Identities=25% Similarity=0.650 Sum_probs=25.0
Q ss_pred ccccccccccccCCCceEEeCCCCcccHhHHHH-HH-hc---CC--cccccccccccCCc
Q psy8622 39 NLQCSVCWEQFTLDEAVRKLPCDHFYHTPCIEP-WL-QL---HG--TCPICRQTLHSESA 91 (101)
Q Consensus 39 ~~~C~IC~~~~~~~~~~~~l~C~H~f~~~Ci~~-w~-~~---~~--~CP~Cr~~~~~~~~ 91 (101)
.+.|+|....+.-+ .+...|.|.= |+.. |+ .. .. .||+|.+...-++.
T Consensus 306 SL~CPl~~~Rm~~P--~r~~~CkHlQ---cFD~~~~lq~n~~~pTW~CPVC~~~~~~e~l 360 (636)
T KOG2169|consen 306 SLNCPLSKMRMSLP--ARGHTCKHLQ---CFDALSYLQMNEQKPTWRCPVCQKAAPFEGL 360 (636)
T ss_pred EecCCcccceeecC--Ccccccccce---ecchhhhHHhccCCCeeeCccCCccccccch
Confidence 35677776654322 2333466654 4432 22 21 11 69999887765443
No 254
>KOG1819|consensus
Probab=20.57 E-value=41 Score=25.70 Aligned_cols=35 Identities=20% Similarity=0.369 Sum_probs=20.2
Q ss_pred CCCccccccccccccCCC-ceEEeCCCCcccHhHHH
Q psy8622 36 DDSNLQCSVCWEQFTLDE-AVRKLPCDHFYHTPCIE 70 (101)
Q Consensus 36 ~~~~~~C~IC~~~~~~~~-~~~~l~C~H~f~~~Ci~ 70 (101)
+.....|-.|...|..-. ..-...||-+||..|--
T Consensus 898 d~~a~~cmacq~pf~afrrrhhcrncggifcg~cs~ 933 (990)
T KOG1819|consen 898 DEDAEQCMACQMPFNAFRRRHHCRNCGGIFCGKCSC 933 (990)
T ss_pred CCcchhhhhccCcHHHHHHhhhhcccCceeeccccc
Confidence 334456777777663111 11123499999998853
No 255
>PRK11827 hypothetical protein; Provisional
Probab=20.20 E-value=31 Score=18.23 Aligned_cols=13 Identities=31% Similarity=0.726 Sum_probs=8.2
Q ss_pred CCccccccccccc
Q psy8622 76 HGTCPICRQTLHS 88 (101)
Q Consensus 76 ~~~CP~Cr~~~~~ 88 (101)
--.||.|+.++..
T Consensus 8 ILaCP~ckg~L~~ 20 (60)
T PRK11827 8 IIACPVCNGKLWY 20 (60)
T ss_pred heECCCCCCcCeE
Confidence 3357777776654
No 256
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=20.18 E-value=31 Score=16.11 Aligned_cols=27 Identities=30% Similarity=0.722 Sum_probs=14.1
Q ss_pred CCCCcccHhHHHHHHhcCCcccccccccc
Q psy8622 59 PCDHFYHTPCIEPWLQLHGTCPICRQTLH 87 (101)
Q Consensus 59 ~C~H~f~~~Ci~~w~~~~~~CP~Cr~~~~ 87 (101)
.||.+||..-.-. .....|..|...+.
T Consensus 6 ~Cg~~Yh~~~~pP--~~~~~Cd~cg~~L~ 32 (36)
T PF05191_consen 6 KCGRIYHIEFNPP--KVEGVCDNCGGELV 32 (36)
T ss_dssp TTTEEEETTTB----SSTTBCTTTTEBEB
T ss_pred CCCCccccccCCC--CCCCccCCCCCeeE
Confidence 3777777432111 23446777766553
No 257
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=20.01 E-value=46 Score=16.80 Aligned_cols=11 Identities=18% Similarity=0.730 Sum_probs=7.2
Q ss_pred ccccccccccc
Q psy8622 40 LQCSVCWEQFT 50 (101)
Q Consensus 40 ~~C~IC~~~~~ 50 (101)
..|.+|.-.+.
T Consensus 2 y~C~~CgyiYd 12 (50)
T cd00730 2 YECRICGYIYD 12 (50)
T ss_pred cCCCCCCeEEC
Confidence 35777776664
Done!