Query psy12165
Match_columns 537
No_of_seqs 273 out of 3007
Neff 10.8
Searched_HMMs 46136
Date Fri Aug 16 20:21:55 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12165.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12165hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3608|consensus 100.0 7.7E-29 1.7E-33 217.2 15.7 282 208-516 69-380 (467)
2 KOG3608|consensus 100.0 1.3E-28 2.9E-33 215.7 15.3 298 167-502 69-400 (467)
3 KOG1074|consensus 99.9 3.5E-26 7.5E-31 222.6 9.1 241 263-516 603-935 (958)
4 KOG2462|consensus 99.9 2.1E-26 4.6E-31 197.5 6.3 133 357-509 128-264 (279)
5 KOG2462|consensus 99.9 8.9E-26 1.9E-30 193.7 5.5 138 298-477 129-266 (279)
6 KOG1074|consensus 99.9 5.8E-25 1.3E-29 214.1 6.2 124 97-231 606-732 (958)
7 KOG3623|consensus 99.9 2.1E-23 4.6E-28 199.6 6.2 120 96-229 210-330 (1007)
8 KOG3623|consensus 99.9 5.4E-23 1.2E-27 196.9 4.5 108 265-380 210-330 (1007)
9 KOG3576|consensus 99.7 4.2E-17 9E-22 132.6 3.0 112 357-479 115-237 (267)
10 KOG3576|consensus 99.6 4.9E-17 1.1E-21 132.2 2.4 120 383-514 114-238 (267)
11 PLN03086 PRLI-interacting fact 99.2 4.5E-11 9.8E-16 117.3 9.5 145 329-512 408-564 (567)
12 PLN03086 PRLI-interacting fact 99.2 1E-10 2.2E-15 114.8 9.4 146 300-479 408-565 (567)
13 PHA00733 hypothetical protein 99.0 1.5E-09 3.3E-14 86.5 6.5 80 357-449 38-123 (128)
14 PHA00733 hypothetical protein 98.9 8.7E-10 1.9E-14 87.8 4.7 55 452-514 71-125 (128)
15 KOG3993|consensus 98.9 5.2E-10 1.1E-14 102.2 1.2 179 125-325 267-484 (500)
16 KOG1146|consensus 98.9 3.9E-10 8.4E-15 117.6 -0.2 115 389-516 1231-1356(1406)
17 KOG3993|consensus 98.7 3.2E-09 6.9E-14 97.2 0.3 149 359-513 267-483 (500)
18 PHA02768 hypothetical protein; 98.6 1.7E-08 3.7E-13 65.1 2.2 42 387-441 6-47 (55)
19 PHA02768 hypothetical protein; 98.6 2.7E-08 5.9E-13 64.2 1.7 43 359-402 5-47 (55)
20 PF05605 zf-Di19: Drought indu 98.3 9.8E-07 2.1E-11 58.8 4.6 51 455-512 3-53 (54)
21 PF13465 zf-H2C2_2: Zinc-finge 98.2 6.7E-07 1.5E-11 49.1 1.9 24 402-436 2-25 (26)
22 PF13465 zf-H2C2_2: Zinc-finge 98.2 5.1E-07 1.1E-11 49.6 1.3 24 441-465 2-25 (26)
23 PHA00616 hypothetical protein 98.2 7.5E-07 1.6E-11 54.5 1.7 29 488-516 1-29 (44)
24 PF05605 zf-Di19: Drought indu 98.1 4.8E-06 1E-10 55.4 4.7 51 425-478 2-53 (54)
25 PHA00732 hypothetical protein 98.1 2.3E-06 5E-11 61.4 3.3 37 426-465 2-38 (79)
26 PHA00732 hypothetical protein 98.1 3E-06 6.4E-11 60.8 3.2 44 386-445 1-44 (79)
27 PHA00616 hypothetical protein 98.1 1.9E-06 4.1E-11 52.8 1.5 25 387-411 2-26 (44)
28 PF00096 zf-C2H2: Zinc finger, 97.9 6.8E-06 1.5E-10 43.9 1.5 23 489-511 1-23 (23)
29 PF00096 zf-C2H2: Zinc finger, 97.7 2.3E-05 4.9E-10 41.8 1.7 23 11-33 1-23 (23)
30 PF13894 zf-C2H2_4: C2H2-type 97.6 3.1E-05 6.8E-10 41.8 1.7 24 489-512 1-24 (24)
31 PF12756 zf-C2H2_2: C2H2 type 97.6 3.6E-05 7.9E-10 59.5 2.0 73 427-511 1-73 (100)
32 PF13894 zf-C2H2_4: C2H2-type 97.6 5E-05 1.1E-09 41.0 1.9 24 11-34 1-24 (24)
33 PF13912 zf-C2H2_6: C2H2-type 97.5 6.8E-05 1.5E-09 41.8 1.6 25 10-34 1-25 (27)
34 PF12756 zf-C2H2_2: C2H2 type 97.4 9.3E-05 2E-09 57.2 2.7 75 98-192 1-75 (100)
35 PF13912 zf-C2H2_6: C2H2-type 97.4 6.1E-05 1.3E-09 42.0 1.2 26 488-513 1-26 (27)
36 KOG2231|consensus 97.3 0.0004 8.7E-09 70.0 6.6 22 22-47 126-147 (669)
37 COG5189 SFP1 Putative transcri 97.3 0.00014 3E-09 64.8 2.7 67 164-230 346-420 (423)
38 PF13909 zf-H2C2_5: C2H2-type 97.3 0.00016 3.5E-09 38.9 1.7 24 489-513 1-24 (24)
39 COG5189 SFP1 Putative transcri 97.3 0.00011 2.3E-09 65.4 1.4 63 383-445 346-418 (423)
40 KOG1146|consensus 97.3 0.00011 2.3E-09 78.3 1.5 59 455-513 735-794 (1406)
41 COG5236 Uncharacterized conser 97.2 0.00069 1.5E-08 61.1 5.9 139 208-382 151-305 (493)
42 KOG2231|consensus 97.1 0.00089 1.9E-08 67.6 6.3 114 209-352 116-236 (669)
43 COG5236 Uncharacterized conser 97.1 0.0011 2.3E-08 60.0 5.8 146 299-489 151-316 (493)
44 smart00355 ZnF_C2H2 zinc finge 96.9 0.00076 1.7E-08 36.9 2.3 24 489-512 1-24 (26)
45 KOG2785|consensus 96.8 0.004 8.7E-08 57.7 7.4 87 9-117 2-89 (390)
46 PF09237 GAGA: GAGA factor; I 96.6 0.0013 2.8E-08 41.3 1.9 31 486-516 22-52 (54)
47 PF13909 zf-H2C2_5: C2H2-type 96.6 0.0011 2.4E-08 35.5 1.3 24 11-35 1-24 (24)
48 KOG2482|consensus 96.5 0.0095 2.1E-07 54.0 7.3 192 312-514 128-362 (423)
49 PF12874 zf-met: Zinc-finger o 96.5 0.0016 3.4E-08 35.4 1.5 23 11-33 1-23 (25)
50 PF12874 zf-met: Zinc-finger o 96.4 0.0016 3.5E-08 35.3 1.2 23 489-511 1-23 (25)
51 PF09237 GAGA: GAGA factor; I 96.3 0.002 4.4E-08 40.5 1.5 29 383-411 21-49 (54)
52 smart00355 ZnF_C2H2 zinc finge 96.2 0.0044 9.5E-08 33.8 2.4 22 388-409 2-23 (26)
53 PF12171 zf-C2H2_jaz: Zinc-fin 96.2 0.0022 4.7E-08 35.6 1.1 22 11-32 2-23 (27)
54 KOG2482|consensus 95.9 0.024 5.3E-07 51.5 6.6 25 166-190 194-218 (423)
55 KOG4173|consensus 95.6 0.0076 1.6E-07 50.4 2.2 93 164-291 76-170 (253)
56 KOG2785|consensus 95.6 0.02 4.3E-07 53.3 5.0 28 163-190 64-91 (390)
57 PF12171 zf-C2H2_jaz: Zinc-fin 95.4 0.0056 1.2E-07 33.9 0.5 22 489-510 2-23 (27)
58 PRK04860 hypothetical protein; 94.2 0.03 6.4E-07 46.6 2.1 13 423-435 141-153 (160)
59 PRK04860 hypothetical protein; 93.3 0.035 7.6E-07 46.2 0.9 39 424-467 118-156 (160)
60 COG4049 Uncharacterized protei 93.2 0.041 9E-07 35.2 1.0 33 3-35 10-42 (65)
61 PF13913 zf-C2HC_2: zinc-finge 93.0 0.076 1.6E-06 28.6 1.7 21 11-32 3-23 (25)
62 PF13913 zf-C2HC_2: zinc-finge 93.0 0.095 2.1E-06 28.2 2.1 21 489-510 3-23 (25)
63 smart00451 ZnF_U1 U1-like zinc 92.7 0.07 1.5E-06 31.6 1.5 25 9-33 2-26 (35)
64 smart00451 ZnF_U1 U1-like zinc 92.2 0.094 2E-06 31.0 1.6 22 488-509 3-24 (35)
65 KOG4173|consensus 92.1 0.087 1.9E-06 44.3 1.8 91 263-382 77-170 (253)
66 PF12013 DUF3505: Protein of u 90.0 0.56 1.2E-05 36.6 4.4 25 489-513 81-109 (109)
67 KOG2893|consensus 89.7 0.12 2.5E-06 44.6 0.4 46 428-478 13-58 (341)
68 cd00350 rubredoxin_like Rubred 88.9 0.25 5.4E-06 28.7 1.3 25 96-133 1-25 (33)
69 PF12013 DUF3505: Protein of u 88.0 0.8 1.7E-05 35.7 4.0 27 207-233 79-109 (109)
70 KOG2893|consensus 87.3 0.15 3.2E-06 44.0 -0.5 48 169-233 12-59 (341)
71 COG5048 FOG: Zn-finger [Genera 86.3 0.17 3.7E-06 51.1 -0.8 144 358-516 288-446 (467)
72 COG4049 Uncharacterized protei 86.2 0.47 1E-05 30.5 1.4 30 163-192 13-42 (65)
73 TIGR00622 ssl1 transcription f 83.8 3.6 7.7E-05 31.7 5.5 21 454-474 81-101 (112)
74 TIGR00622 ssl1 transcription f 83.4 2.6 5.7E-05 32.4 4.6 21 425-445 81-101 (112)
75 COG5048 FOG: Zn-finger [Genera 80.6 0.45 9.8E-06 48.0 -0.4 61 265-328 386-446 (467)
76 PF13719 zinc_ribbon_5: zinc-r 79.2 1.3 2.9E-05 26.5 1.5 34 97-136 3-36 (37)
77 PF13717 zinc_ribbon_4: zinc-r 77.6 1.8 3.9E-05 25.8 1.7 34 97-136 3-36 (36)
78 PF06524 NOA36: NOA36 protein; 77.3 0.82 1.8E-05 40.3 0.2 48 358-410 181-233 (314)
79 cd00729 rubredoxin_SM Rubredox 77.1 1.4 3E-05 25.8 1.1 9 360-368 3-11 (34)
80 PF02892 zf-BED: BED zinc fing 75.7 2 4.4E-05 27.0 1.7 28 7-34 13-44 (45)
81 TIGR02098 MJ0042_CXXC MJ0042 f 74.1 1.9 4E-05 26.0 1.2 34 97-136 3-36 (38)
82 COG2888 Predicted Zn-ribbon RN 73.9 2.9 6.2E-05 27.7 2.0 9 425-433 50-58 (61)
83 PF09538 FYDLN_acid: Protein o 72.6 2.2 4.7E-05 32.9 1.4 12 425-436 26-37 (108)
84 PF09538 FYDLN_acid: Protein o 71.7 2 4.4E-05 33.1 1.1 30 126-180 10-39 (108)
85 PF09986 DUF2225: Uncharacteri 71.7 1.1 2.4E-05 39.7 -0.3 14 167-180 48-61 (214)
86 PF12907 zf-met2: Zinc-binding 71.5 2.1 4.5E-05 26.1 0.9 32 97-128 2-36 (40)
87 PF09986 DUF2225: Uncharacteri 71.2 1.2 2.6E-05 39.6 -0.3 23 385-407 4-26 (214)
88 smart00734 ZnF_Rad18 Rad18-lik 69.5 4 8.6E-05 22.2 1.6 21 11-32 2-22 (26)
89 PF10571 UPF0547: Uncharacteri 68.7 3 6.5E-05 22.7 1.1 9 361-369 16-24 (26)
90 COG1592 Rubrerythrin [Energy p 68.5 2.8 6E-05 35.1 1.3 11 359-369 134-144 (166)
91 KOG2186|consensus 68.4 2.9 6.2E-05 37.0 1.5 46 426-475 4-49 (276)
92 PRK00398 rpoP DNA-directed RNA 67.5 2.9 6.4E-05 26.5 1.0 11 425-435 3-13 (46)
93 smart00614 ZnF_BED BED zinc fi 67.3 3.8 8.2E-05 26.5 1.5 24 489-512 19-48 (50)
94 KOG2186|consensus 66.7 3.1 6.8E-05 36.7 1.4 46 455-509 4-49 (276)
95 PRK14890 putative Zn-ribbon RN 65.3 3.6 7.8E-05 27.4 1.1 10 453-462 47-56 (59)
96 smart00659 RPOLCX RNA polymera 63.4 4.8 0.0001 25.2 1.4 27 96-134 2-28 (44)
97 PF12907 zf-met2: Zinc-binding 63.4 4.1 8.8E-05 24.8 1.0 13 467-479 17-29 (40)
98 PHA00626 hypothetical protein 63.0 2 4.3E-05 27.9 -0.4 11 425-435 23-33 (59)
99 COG1996 RPC10 DNA-directed RNA 62.6 3.4 7.3E-05 26.4 0.6 10 387-396 7-16 (49)
100 COG1592 Rubrerythrin [Energy p 62.0 4.1 8.9E-05 34.1 1.2 24 125-175 134-157 (166)
101 TIGR00373 conserved hypothetic 60.0 4.9 0.00011 33.7 1.3 31 95-135 108-138 (158)
102 PF09723 Zn-ribbon_8: Zinc rib 59.6 3.2 7E-05 25.7 0.1 30 96-133 5-34 (42)
103 KOG2807|consensus 59.2 17 0.00036 33.7 4.5 35 357-404 274-308 (378)
104 smart00531 TFIIE Transcription 56.6 7.3 0.00016 32.2 1.8 36 95-135 98-133 (147)
105 PRK06266 transcription initiat 56.5 5.3 0.00011 34.2 1.0 32 95-136 116-147 (178)
106 PF03604 DNA_RNApol_7kD: DNA d 55.6 6.5 0.00014 22.6 0.9 9 425-433 17-25 (32)
107 TIGR02300 FYDLN_acid conserved 54.9 7.7 0.00017 30.5 1.5 11 425-435 26-36 (129)
108 PF06524 NOA36: NOA36 protein; 54.7 5 0.00011 35.6 0.5 48 96-148 182-232 (314)
109 KOG1280|consensus 54.4 12 0.00025 35.0 2.8 37 299-335 79-116 (381)
110 COG3357 Predicted transcriptio 51.5 7.8 0.00017 28.2 1.0 14 385-398 57-70 (97)
111 smart00531 TFIIE Transcription 50.9 9.6 0.00021 31.5 1.6 37 384-435 97-133 (147)
112 TIGR02300 FYDLN_acid conserved 50.9 8.4 0.00018 30.3 1.2 33 126-183 10-42 (129)
113 KOG4377|consensus 50.7 6.6 0.00014 37.4 0.7 142 126-293 272-429 (480)
114 PRK06266 transcription initiat 50.5 8.7 0.00019 32.9 1.3 17 386-402 117-133 (178)
115 PF04959 ARS2: Arsenite-resist 50.4 11 0.00024 33.2 2.0 25 455-479 78-102 (214)
116 COG5151 SSL1 RNA polymerase II 50.4 8 0.00017 35.3 1.1 47 426-475 363-409 (421)
117 smart00834 CxxC_CXXC_SSSS Puta 50.2 3.7 8E-05 25.1 -0.7 10 387-396 6-15 (41)
118 PF15269 zf-C2H2_7: Zinc-finge 50.0 17 0.00036 22.4 2.1 22 488-509 20-41 (54)
119 TIGR02605 CxxC_CxxC_SSSS putat 49.6 4.7 0.0001 26.3 -0.3 11 360-370 6-16 (52)
120 TIGR00373 conserved hypothetic 47.1 19 0.00041 30.2 2.8 17 386-402 109-125 (158)
121 COG1997 RPL43A Ribosomal prote 46.3 11 0.00024 27.3 1.1 16 164-179 50-65 (89)
122 KOG1280|consensus 45.7 18 0.00039 33.8 2.6 37 96-132 79-116 (381)
123 PF04959 ARS2: Arsenite-resist 43.9 13 0.00028 32.8 1.4 29 423-451 75-103 (214)
124 PF05443 ROS_MUCR: ROS/MUCR tr 43.8 12 0.00027 30.0 1.2 33 7-42 69-101 (132)
125 PRK00464 nrdR transcriptional 43.5 5.4 0.00012 33.1 -0.9 12 426-437 29-40 (154)
126 PF04780 DUF629: Protein of un 43.4 15 0.00032 36.6 1.8 28 10-37 57-84 (466)
127 PF05443 ROS_MUCR: ROS/MUCR tr 42.9 15 0.00033 29.5 1.5 27 487-516 71-97 (132)
128 KOG2593|consensus 42.3 14 0.00031 35.8 1.5 36 95-134 127-162 (436)
129 KOG2807|consensus 39.9 39 0.00084 31.5 3.7 23 165-187 288-310 (378)
130 PF04780 DUF629: Protein of un 39.5 27 0.00059 34.8 3.0 28 453-480 56-83 (466)
131 PF07754 DUF1610: Domain of un 39.2 14 0.00031 19.6 0.6 9 425-433 16-24 (24)
132 COG1198 PriA Primosomal protei 38.7 21 0.00046 38.0 2.2 12 423-434 473-484 (730)
133 COG5151 SSL1 RNA polymerase II 38.5 23 0.00049 32.5 2.1 23 423-445 386-408 (421)
134 PF07649 C1_3: C1-like domain; 37.9 20 0.00044 20.1 1.2 13 165-177 13-25 (30)
135 KOG4167|consensus 37.4 18 0.0004 37.4 1.5 33 201-233 785-817 (907)
136 PF07800 DUF1644: Protein of u 36.5 1.6E+02 0.0034 24.5 6.3 54 424-480 79-134 (162)
137 PF14353 CpXC: CpXC protein 36.2 25 0.00055 28.2 1.9 18 387-404 39-56 (128)
138 PF09845 DUF2072: Zn-ribbon co 35.6 20 0.00044 28.5 1.2 26 359-394 1-27 (131)
139 TIGR00595 priA primosomal prot 35.1 24 0.00052 36.2 2.0 9 425-433 253-261 (505)
140 PF13451 zf-trcl: Probable zin 34.4 25 0.00054 22.6 1.2 37 424-460 3-39 (49)
141 KOG4167|consensus 34.2 13 0.00029 38.4 -0.0 28 7-34 789-816 (907)
142 TIGR01384 TFS_arch transcripti 34.0 4.2 9.1E-05 31.3 -2.9 12 208-219 90-101 (104)
143 PF06220 zf-U1: U1 zinc finger 33.9 29 0.00062 20.9 1.4 23 298-320 2-26 (38)
144 COG5188 PRP9 Splicing factor 3 33.7 63 0.0014 30.4 4.1 22 299-320 238-259 (470)
145 COG1198 PriA Primosomal protei 33.4 26 0.00057 37.3 1.9 40 300-368 445-484 (730)
146 PRK09678 DNA-binding transcrip 32.4 13 0.00027 26.3 -0.4 20 164-183 24-45 (72)
147 PRK04023 DNA polymerase II lar 32.0 47 0.001 36.3 3.5 11 425-435 663-673 (1121)
148 PRK03824 hypA hydrogenase nick 30.9 23 0.0005 28.7 0.9 13 164-176 104-116 (135)
149 PF01780 Ribosomal_L37ae: Ribo 30.7 15 0.00033 27.1 -0.2 15 164-178 50-64 (90)
150 KOG2593|consensus 30.2 40 0.00086 32.9 2.4 33 425-461 128-160 (436)
151 KOG3408|consensus 30.0 40 0.00086 26.3 1.9 27 164-190 54-80 (129)
152 PF08274 PhnA_Zn_Ribbon: PhnA 29.3 19 0.00041 20.4 0.1 8 425-432 19-26 (30)
153 KOG2272|consensus 29.1 28 0.0006 30.9 1.1 14 329-342 100-113 (332)
154 PRK04023 DNA polymerase II lar 28.8 47 0.001 36.3 2.9 24 358-396 650-673 (1121)
155 PRK14873 primosome assembly pr 27.4 35 0.00076 36.3 1.7 10 425-434 422-431 (665)
156 PF14446 Prok-RING_1: Prokaryo 26.8 46 0.001 21.9 1.5 27 125-177 5-31 (54)
157 PTZ00255 60S ribosomal protein 26.7 31 0.00066 25.5 0.8 16 164-179 51-66 (90)
158 KOG0717|consensus 26.7 35 0.00076 33.5 1.4 23 11-33 293-315 (508)
159 PF13240 zinc_ribbon_2: zinc-r 26.0 35 0.00075 17.9 0.7 6 128-133 16-21 (23)
160 COG3364 Zn-ribbon containing p 25.9 43 0.00093 25.1 1.4 27 359-394 2-28 (112)
161 KOG3408|consensus 25.6 68 0.0015 25.0 2.5 28 202-229 51-78 (129)
162 TIGR00280 L37a ribosomal prote 25.4 29 0.00064 25.7 0.5 16 164-179 50-65 (91)
163 PRK14873 primosome assembly pr 25.3 40 0.00088 35.8 1.7 28 322-368 404-431 (665)
164 TIGR00595 priA primosomal prot 25.0 47 0.001 34.1 2.1 39 300-367 223-261 (505)
165 TIGR01206 lysW lysine biosynth 24.6 20 0.00044 23.6 -0.4 32 96-136 2-33 (54)
166 COG1594 RPB9 DNA-directed RNA 24.5 9.9 0.00022 29.7 -2.2 40 425-465 72-111 (113)
167 COG1998 RPS31 Ribosomal protei 24.2 39 0.00084 21.6 0.8 26 426-463 20-46 (51)
168 COG1773 Rubredoxin [Energy pro 23.7 30 0.00064 22.9 0.2 14 487-500 2-15 (55)
169 PRK14714 DNA polymerase II lar 22.9 64 0.0014 36.4 2.6 10 386-395 692-701 (1337)
170 PRK14714 DNA polymerase II lar 22.9 78 0.0017 35.8 3.2 11 425-435 692-702 (1337)
171 KOG4124|consensus 22.7 14 0.00031 34.4 -1.8 26 263-288 347-372 (442)
172 PRK05580 primosome assembly pr 22.6 51 0.0011 35.4 1.8 46 329-432 382-428 (679)
173 KOG1842|consensus 22.4 48 0.001 32.4 1.4 29 9-37 14-42 (505)
174 PF08790 zf-LYAR: LYAR-type C2 22.1 20 0.00042 19.9 -0.7 17 361-378 2-18 (28)
175 smart00154 ZnF_AN1 AN1-like Zi 21.9 47 0.001 20.1 0.8 13 425-437 12-24 (39)
176 PF13821 DUF4187: Domain of un 21.8 1E+02 0.0022 20.5 2.4 18 455-472 28-45 (55)
177 PF05290 Baculo_IE-1: Baculovi 21.5 54 0.0012 26.2 1.3 55 423-502 78-135 (140)
178 PRK00432 30S ribosomal protein 21.1 38 0.00082 21.9 0.3 12 453-464 36-47 (50)
179 PF13878 zf-C2H2_3: zinc-finge 20.6 97 0.0021 19.0 2.0 25 488-512 13-39 (41)
180 COG4530 Uncharacterized protei 20.5 54 0.0012 25.0 1.0 12 424-435 25-36 (129)
181 PRK03976 rpl37ae 50S ribosomal 20.5 40 0.00086 25.0 0.4 16 164-179 51-66 (90)
182 KOG1842|consensus 20.2 53 0.0011 32.1 1.2 29 424-452 14-42 (505)
No 1
>KOG3608|consensus
Probab=99.96 E-value=7.7e-29 Score=217.20 Aligned_cols=282 Identities=19% Similarity=0.389 Sum_probs=227.4
Q ss_pred cccccc--chhccCC-HHHHHHHHHhccCcc--------------ccccccCCCCcccHHHHhhhhhhccCCCCCccccc
Q psy12165 208 IFRCKK--CVKVFVN-PDIYKKHLKHSHTKA--------------YCEVCITAPGYETKVELLHHMKTEHKDNEEYKCLI 270 (537)
Q Consensus 208 ~~~C~~--C~~~f~~-~~~l~~H~~~~h~~~--------------~c~~c~~~~~~~~~~~l~~H~~~~h~~~~~~~C~~ 270 (537)
.+.|.. |++...+ ..+|.+|.-.|--.. .-..|... |.....+ -.....|.|.+
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~--f~~~~~i-------p~~g~~f~C~W 139 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAP--FRLMEKI-------PALGQNFRCGW 139 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCC--cchhhcc-------ccchhhhccCh
Confidence 577754 8888777 588999875442100 01112111 2221111 11234489999
Q ss_pred ccccccccCHHHHHHHHHhcccc--------CC-CCCCcccc--cchhcccCcHHHHHHHhhcCCCCCccccCCcccccC
Q psy12165 271 PGCKKYFRTKLDCQKHCREHKTV--------TS-RCPTNYCN--FCADLFTNRKKLWLHLKSTHKDLTNCMCSACFVVED 339 (537)
Q Consensus 271 ~~C~~~f~~~~~l~~H~~~h~~~--------~~-~~~~~~C~--~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~~~ 339 (537)
+.|+..|.+..++..|+..|..- .. ..+.+.|. .|...+.++++|++|++ +|++++...|+.|+..|+
T Consensus 140 edCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r-~Hs~eKvvACp~Cg~~F~ 218 (467)
T KOG3608|consen 140 EDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIR-THSNEKVVACPHCGELFR 218 (467)
T ss_pred hhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHH-hcCCCeEEecchHHHHhc
Confidence 99999999999999998877422 01 11456785 69999999999999999 899999999999999999
Q ss_pred CHHHHHHHHHhhccccCCCccccccccccccChhHHHHhHhhhcCCccccccccccCCChHHHHHHHHhccCCCCCccCC
Q psy12165 340 NLSALEAHVAAKHKAIIKKPYACRICARLLGNGVKVSEHIKLHGEHLVNCRVCGSIFHSEPELKVHLEEHTDVPETKEQK 419 (537)
Q Consensus 340 ~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~ 419 (537)
+...|..|+++.... ...+|.|..|.+.|+++..|..|+..|-. -|+|+.|+......++|.+||+.-+..
T Consensus 219 ~~tkl~DH~rRqt~l-~~n~fqC~~C~KrFaTeklL~~Hv~rHvn-~ykCplCdmtc~~~ssL~~H~r~rHs~------- 289 (467)
T KOG3608|consen 219 TKTKLFDHLRRQTEL-NTNSFQCAQCFKRFATEKLLKSHVVRHVN-CYKCPLCDMTCSSASSLTTHIRYRHSK------- 289 (467)
T ss_pred cccHHHHHHHhhhhh-cCCchHHHHHHHHHhHHHHHHHHHHHhhh-cccccccccCCCChHHHHHHHHhhhcc-------
Confidence 999999999975543 35799999999999999999999999832 589999999999999999999965443
Q ss_pred CCCCCCcccCCCCCCCCChHHHHHHHHhhcCCCCcccccc--cccccCCHHHHHHHHHhhCCCCcccccCCccccccccc
Q psy12165 420 PRPELPVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGV--CLKISETYEALALHLDQAHPGYFAEEVDNPTCEVCGRY 497 (537)
Q Consensus 420 ~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~ 497 (537)
.+||+|+.|++.|.+.++|.+|+ .+|+ +..|+|.. |.+++.+...|++|++.+|.| ...-+|.|-.|++.
T Consensus 290 ---dkpfKCd~Cd~~c~~esdL~kH~-~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg---~np~~Y~CH~Cdr~ 361 (467)
T KOG3608|consen 290 ---DKPFKCDECDTRCVRESDLAKHV-QVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEG---NNPILYACHCCDRF 361 (467)
T ss_pred ---CCCccccchhhhhccHHHHHHHH-Hhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccC---CCCCceeeecchhh
Confidence 78999999999999999999998 6998 77899999 999999999999999999976 23467999999999
Q ss_pred ccChhHHHHHHhhhhCCCc
Q psy12165 498 HNARPKLESHLAVHVGLDK 516 (537)
Q Consensus 498 f~~~~~l~~H~~~h~g~~~ 516 (537)
|++..+|..|++.-+|.+.
T Consensus 362 ft~G~~L~~HL~kkH~f~~ 380 (467)
T KOG3608|consen 362 FTSGKSLSAHLMKKHGFRL 380 (467)
T ss_pred hccchhHHHHHHHhhcccC
Confidence 9999999999877776543
No 2
>KOG3608|consensus
Probab=99.96 E-value=1.3e-28 Score=215.71 Aligned_cols=298 Identities=21% Similarity=0.360 Sum_probs=240.2
Q ss_pred eeeccc--chhhccC-hHHHHHHHhhcCCCC----------------CCC-----CCCCCCCCCCccccc--cchhccCC
Q psy12165 167 VHICHL--CQKQFKQ-NWLLQKHKKTCKGPK----------------RPT-----SPLPEPDEDGIFRCK--KCVKVFVN 220 (537)
Q Consensus 167 ~~~C~~--C~~~f~~-~~~l~~H~~~h~~~~----------------~~~-----~~~~~~~~~~~~~C~--~C~~~f~~ 220 (537)
.++|.. |++...+ ...|.+|...|.-.. .+- .-...+.....|.|. .|+..|.+
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s 148 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVS 148 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccC
Confidence 577864 8887777 589999988873110 000 001123444567885 59999999
Q ss_pred HHHHHHHHHhccCccccccccCCCCcccHHHHhhhhhhccCCC-CCcccccccccccccCHHHHHHHHHhccccCCCCCC
Q psy12165 221 PDIYKKHLKHSHTKAYCEVCITAPGYETKVELLHHMKTEHKDN-EEYKCLIPGCKKYFRTKLDCQKHCREHKTVTSRCPT 299 (537)
Q Consensus 221 ~~~l~~H~~~~h~~~~c~~c~~~~~~~~~~~l~~H~~~~h~~~-~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~~ 299 (537)
..++..|+-.|... |++ + +...-..+ ..+.|.+..|-..+.++..|+.|++.|++++ .
T Consensus 149 ~~ef~dHV~~H~l~--cey-----------d----~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eK----v 207 (467)
T KOG3608|consen 149 IVEFQDHVVKHALF--CEY-----------D----IQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEK----V 207 (467)
T ss_pred HHHHHHHHHHhhhh--hhh-----------h----hhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCe----E
Confidence 99999998755421 100 0 11111222 3578999999999999999999999999886 8
Q ss_pred cccccchhcccCcHHHHHHHhhcC-CCCCccccCCcccccCCHHHHHHHHHhhccccCCCccccccccccccChhHHHHh
Q psy12165 300 NYCNFCADLFTNRKKLWLHLKSTH-KDLTNCMCSACFVVEDNLSALEAHVAAKHKAIIKKPYACRICARLLGNGVKVSEH 378 (537)
Q Consensus 300 ~~C~~C~~~f~~~~~l~~H~~~~H-~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H 378 (537)
..|+.|+..|.++..|.-|++..- .+..+|.|..|.+.|.+...|..|+..+- .-|+|+.|..+...+++|..|
T Consensus 208 vACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv-----n~ykCplCdmtc~~~ssL~~H 282 (467)
T KOG3608|consen 208 VACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV-----NCYKCPLCDMTCSSASSLTTH 282 (467)
T ss_pred EecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh-----hcccccccccCCCChHHHHHH
Confidence 889999999999999999998433 24558999999999999999999999754 559999999999999999999
Q ss_pred Hh-hh-cCCccccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCC--CCCCCCChHHHHHHHHhhcCCC--
Q psy12165 379 IK-LH-GEHLVNCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDL--CSKEWPSRKFMWQHLIRSHKAV-- 452 (537)
Q Consensus 379 ~~-~H-~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~-- 452 (537)
++ .| ..+||+|+.|++.|.+.+.|.+|+..|.. ..|+|+. |.+++.+...|++|++.+|.|.
T Consensus 283 ~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS~------------~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np 350 (467)
T KOG3608|consen 283 IRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHSK------------TVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNP 350 (467)
T ss_pred HHhhhccCCCccccchhhhhccHHHHHHHHHhccc------------cceecCCCCCcHHHHHHHHHHHHHHHhccCCCC
Confidence 99 67 88999999999999999999999998885 4799999 9999999999999999999654
Q ss_pred CcccccccccccCCHHHHHHHHHhhCCCCcccccCCcccccccccccChh
Q psy12165 453 AGLACGVCLKISETYEALALHLDQAHPGYFAEEVDNPTCEVCGRYHNARP 502 (537)
Q Consensus 453 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~ 502 (537)
.+|.|..|++.|++..+|.+|+++.|.-..+.+-+.|.=.+|.-.|.+..
T Consensus 351 ~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~PsGh~RFtYk~~edG~mRLq 400 (467)
T KOG3608|consen 351 ILYACHCCDRFFTSGKSLSAHLMKKHGFRLPSGHKRFTYKVDEDGFMRLQ 400 (467)
T ss_pred CceeeecchhhhccchhHHHHHHHhhcccCCCCCCceeeeeccCceeeee
Confidence 45999999999999999999999999887777777888778877776543
No 3
>KOG1074|consensus
Probab=99.93 E-value=3.5e-26 Score=222.60 Aligned_cols=241 Identities=21% Similarity=0.349 Sum_probs=156.6
Q ss_pred CCCcccccccccccccCHHHHHHHHHhccccCCCCCCcccccchhcccCcHHHHHHHhhcCCCCC----ccccC---Ccc
Q psy12165 263 NEEYKCLIPGCKKYFRTKLDCQKHCREHKTVTSRCPTNYCNFCADLFTNRKKLWLHLKSTHKDLT----NCMCS---ACF 335 (537)
Q Consensus 263 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~----~~~C~---~C~ 335 (537)
..+.+|-+ |.++.+-.+.|+.|.+.|.|+. +|+|.+|++.|.++.+|+.||- +|.... .+.|+ +|.
T Consensus 603 TdPNqCii--C~rVlSC~saLqmHyrtHtGER----PFkCKiCgRAFtTkGNLkaH~~-vHka~p~~R~q~ScP~~~ic~ 675 (958)
T KOG1074|consen 603 TDPNQCII--CLRVLSCPSALQMHYRTHTGER----PFKCKICGRAFTTKGNLKAHMS-VHKAKPPARVQFSCPSTFICQ 675 (958)
T ss_pred CCccceee--eeecccchhhhhhhhhcccCcC----ccccccccchhccccchhhccc-ccccCccccccccCCchhhhc
Confidence 34677877 8888888888888888888875 8888888888888888888887 665543 35677 777
Q ss_pred cccCCHHHHHHHHHhhccccC----------CCccccccccccccChhHHHHhHhhh-c----------------CC---
Q psy12165 336 VVEDNLSALEAHVAAKHKAII----------KKPYACRICARLLGNGVKVSEHIKLH-G----------------EH--- 385 (537)
Q Consensus 336 ~~~~~~~~l~~H~~~~h~~~~----------~~~~~C~~C~~~f~~~~~l~~H~~~H-~----------------~~--- 385 (537)
..|.+.-.|..|++.|-...+ ...-+|..|.+.|.....+..++..| + +.
T Consensus 676 ~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~t 755 (958)
T KOG1074|consen 676 KKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVT 755 (958)
T ss_pred ccccccccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccC
Confidence 777777777777776442211 11245777777776666666665544 1 11
Q ss_pred -ccccccccccCCChHHHHHHHHhccCCCC------------CccCCCCCCCCc-ccCCCCCCCCChHHHHHHHHh----
Q psy12165 386 -LVNCRVCGSIFHSEPELKVHLEEHTDVPE------------TKEQKPRPELPV-TCDLCSKEWPSRKFMWQHLIR---- 447 (537)
Q Consensus 386 -~~~C~~C~~~f~~~~~L~~H~~~h~~~~~------------~~~~~~~~~~~~-~C~~C~~~f~~~~~l~~H~~~---- 447 (537)
+..+..|+..+.....+..+-..+..... ........+++. .+..++......-.+.--+..
T Consensus 756 p~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~ 835 (958)
T KOG1074|consen 756 PPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETS 835 (958)
T ss_pred CCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccc
Confidence 34566677776666666665332210000 011112334444 555555433222111100000
Q ss_pred -------------hcCC------------------------CCcccccccccccCCHHHHHHHHHhhCCCCcccccCCcc
Q psy12165 448 -------------SHKA------------------------VAGLACGVCLKISETYEALALHLDQAHPGYFAEEVDNPT 490 (537)
Q Consensus 448 -------------~H~~------------------------~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~ 490 (537)
.+.+ .....|.+|++.|.+..+|+.|++ .|++ ++||.
T Consensus 836 ~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~r-THtg-----~KPF~ 909 (958)
T KOG1074|consen 836 MLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMR-THTG-----PKPFF 909 (958)
T ss_pred ccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhh-cCCC-----CCCcc
Confidence 0000 012679999999999999999999 8988 89999
Q ss_pred cccccccccChhHHHHHHhhhhCCCc
Q psy12165 491 CEVCGRYHNARPKLESHLAVHVGLDK 516 (537)
Q Consensus 491 C~~C~~~f~~~~~l~~H~~~h~g~~~ 516 (537)
|.+|++.|+.+.+|+.||.+|....+
T Consensus 910 C~fC~~aFttrgnLKvHMgtH~w~q~ 935 (958)
T KOG1074|consen 910 CHFCEEAFTTRGNLKVHMGTHMWVQP 935 (958)
T ss_pred chhhhhhhhhhhhhhhhhccccccCC
Confidence 99999999999999999999998654
No 4
>KOG2462|consensus
Probab=99.93 E-value=2.1e-26 Score=197.48 Aligned_cols=133 Identities=26% Similarity=0.607 Sum_probs=113.4
Q ss_pred CCccccccccccccChhHHHHhHhhh----cCCccccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCCCC
Q psy12165 357 KKPYACRICARLLGNGVKVSEHIKLH----GEHLVNCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDLCS 432 (537)
Q Consensus 357 ~~~~~C~~C~~~f~~~~~l~~H~~~H----~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~ 432 (537)
...|+|+.|++.+.+.++|.+|.++| ..+.+.|.+||+.|.+...|+.|+++|+- +++|.+||
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~l-------------~c~C~iCG 194 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHTL-------------PCECGICG 194 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccCC-------------Cccccccc
Confidence 45688888888888888888888888 56678888888888888888888888886 78888888
Q ss_pred CCCCChHHHHHHHHhhcCCCCcccccccccccCCHHHHHHHHHhhCCCCcccccCCcccccccccccChhHHHHHHh
Q psy12165 433 KEWPSRKFMWQHLIRSHKAVAGLACGVCLKISETYEALALHLDQAHPGYFAEEVDNPTCEVCGRYHNARPKLESHLA 509 (537)
Q Consensus 433 ~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 509 (537)
+.|.....|+-|+ +.|+||+||.|+.|++.|..+.+|+.||. .|++ .++|+|..|++.|..++-|..|..
T Consensus 195 KaFSRPWLLQGHi-RTHTGEKPF~C~hC~kAFADRSNLRAHmQ-THS~-----~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 195 KAFSRPWLLQGHI-RTHTGEKPFSCPHCGKAFADRSNLRAHMQ-THSD-----VKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred ccccchHHhhccc-ccccCCCCccCCcccchhcchHHHHHHHH-hhcC-----CccccCcchhhHHHHHHHHHHhhh
Confidence 8888888888887 88888888888888888888888888887 7777 788888888888888888888864
No 5
>KOG2462|consensus
Probab=99.92 E-value=8.9e-26 Score=193.67 Aligned_cols=138 Identities=27% Similarity=0.515 Sum_probs=106.5
Q ss_pred CCcccccchhcccCcHHHHHHHhhcCCCCCccccCCcccccCCHHHHHHHHHhhccccCCCccccccccccccChhHHHH
Q psy12165 298 PTNYCNFCADLFTNRKKLWLHLKSTHKDLTNCMCSACFVVEDNLSALEAHVAAKHKAIIKKPYACRICARLLGNGVKVSE 377 (537)
Q Consensus 298 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~ 377 (537)
..|+|+.|++.+.+.++|.+|.+ .|.. ....+.+.|++|++++.+-..|..
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ-~H~~----------------------------~~s~ka~~C~~C~K~YvSmpALkM 179 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQ-THRS----------------------------LDSKKAFSCKYCGKVYVSMPALKM 179 (279)
T ss_pred Cceeccccccccccccccchhhc-cccc----------------------------ccccccccCCCCCceeeehHHHhh
Confidence 36777777777777777777776 4433 222577788888888888888888
Q ss_pred hHhhhcCCccccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCCCCCCCCChHHHHHHHHhhcCCCCcccc
Q psy12165 378 HIKLHGEHLVNCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLAC 457 (537)
Q Consensus 378 H~~~H~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C 457 (537)
|+++|. -+.+|.+||+.|.....|+-|+|+|+| |+||.|+.|+++|.++++|+.|+ .+|.+.++|.|
T Consensus 180 HirTH~-l~c~C~iCGKaFSRPWLLQGHiRTHTG-----------EKPF~C~hC~kAFADRSNLRAHm-QTHS~~K~~qC 246 (279)
T KOG2462|consen 180 HIRTHT-LPCECGICGKAFSRPWLLQGHIRTHTG-----------EKPFSCPHCGKAFADRSNLRAHM-QTHSDVKKHQC 246 (279)
T ss_pred HhhccC-CCcccccccccccchHHhhcccccccC-----------CCCccCCcccchhcchHHHHHHH-HhhcCCccccC
Confidence 888875 467788888888888888888888888 77888888888888888888887 78888888888
Q ss_pred cccccccCCHHHHHHHHHhh
Q psy12165 458 GVCLKISETYEALALHLDQA 477 (537)
Q Consensus 458 ~~C~~~f~~~~~l~~H~~~~ 477 (537)
..|+++|..+.-|.+|...-
T Consensus 247 ~~C~KsFsl~SyLnKH~ES~ 266 (279)
T KOG2462|consen 247 PRCGKSFALKSYLNKHSESA 266 (279)
T ss_pred cchhhHHHHHHHHHHhhhhc
Confidence 88888888888888887643
No 6
>KOG1074|consensus
Probab=99.91 E-value=5.8e-25 Score=214.12 Aligned_cols=124 Identities=22% Similarity=0.464 Sum_probs=102.2
Q ss_pred cccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCcccChHHHHHHHHhcCCCCCCCCccccccCCCceeecc---cc
Q psy12165 97 YVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKVSSSRGHLEKHLIAHDNPNTEVDEPQMNEFGESVHICH---LC 173 (537)
Q Consensus 97 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~C~---~C 173 (537)
-.|-+|.++..-.++|+.|+ +.|+|++||+|.+|++.|.++.+|+.||-.|.... .-+..+.|+ +|
T Consensus 606 NqCiiC~rVlSC~saLqmHy-rtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p----------~~R~q~ScP~~~ic 674 (958)
T KOG1074|consen 606 NQCIICLRVLSCPSALQMHY-RTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKP----------PARVQFSCPSTFIC 674 (958)
T ss_pred cceeeeeecccchhhhhhhh-hcccCcCccccccccchhccccchhhcccccccCc----------cccccccCCchhhh
Confidence 58999999999999999999 78999999999999999999999999999986321 134679999 99
Q ss_pred hhhccChHHHHHHHhhcCCCCCCCCCCCCCCCCCccccccchhccCCHHHHHHHHHhc
Q psy12165 174 QKQFKQNWLLQKHKKTCKGPKRPTSPLPEPDEDGIFRCKKCVKVFVNPDIYKKHLKHS 231 (537)
Q Consensus 174 ~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~ 231 (537)
-+.|.+...|..|+++|.+...+....+.......-+|..|.+.|.+...+..++-.+
T Consensus 675 ~~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~ 732 (958)
T KOG1074|consen 675 QKKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQ 732 (958)
T ss_pred cccccccccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhcc
Confidence 9999999999999999996554433332222222357999999999999888887654
No 7
>KOG3623|consensus
Probab=99.88 E-value=2.1e-23 Score=199.62 Aligned_cols=120 Identities=28% Similarity=0.530 Sum_probs=102.0
Q ss_pred ccccCCCcCcCCChHHHHHHHHhhcCC-CCCccccccCCcccChHHHHHHHHhcCCCCCCCCccccccCCCceeecccch
Q psy12165 96 MYVCDLCPKYYSRSTHLWKHVNRVHKG-HPDVTCRICSKVSSSRGHLEKHLIAHDNPNTEVDEPQMNEFGESVHICHLCQ 174 (537)
Q Consensus 96 ~~~C~~C~~~f~~~~~l~~H~~~~H~~-~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~C~~C~ 174 (537)
..+|++|+..+.....|+.|++-.|-. +..|.|..|.+.|..+..|.+||.+|....... .........+.|+|..|+
T Consensus 210 lltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa-~sltqsa~lRKFKCtECg 288 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQA-ISLTQSALLRKFKCTECG 288 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCccc-ccccchhhhccccccccc
Confidence 368999999999999999999877754 567999999999999999999999996432211 111122356789999999
Q ss_pred hhccChHHHHHHHhhcCCCCCCCCCCCCCCCCCccccccchhccCCHHHHHHHHH
Q psy12165 175 KQFKQNWLLQKHKKTCKGPKRPTSPLPEPDEDGIFRCKKCVKVFVNPDIYKKHLK 229 (537)
Q Consensus 175 ~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 229 (537)
+.|+.+..|+.|+++|.|++ ||.|+.|++.|+....+-.||-
T Consensus 289 KAFKfKHHLKEHlRIHSGEK-------------PfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 289 KAFKFKHHLKEHLRIHSGEK-------------PFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhhhHHHHHhhheeecCCC-------------CcCCcccccccccCCccccccc
Confidence 99999999999999999999 8999999999999999999974
No 8
>KOG3623|consensus
Probab=99.87 E-value=5.4e-23 Score=196.90 Aligned_cols=108 Identities=21% Similarity=0.453 Sum_probs=85.8
Q ss_pred CcccccccccccccCHHHHHHHHHhccccCCCCCCcccccchhcccCcHHHHHHHhhcCCC-------------CCcccc
Q psy12165 265 EYKCLIPGCKKYFRTKLDCQKHCREHKTVTSRCPTNYCNFCADLFTNRKKLWLHLKSTHKD-------------LTNCMC 331 (537)
Q Consensus 265 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~-------------~~~~~C 331 (537)
..+|.+ |++.+.....|..|+...+.... +.|-|.+|.++|..+..|.+||. .|.. .+.|+|
T Consensus 210 lltcpy--cdrgykrltslkeHikyrhekne--~nfsC~lCsytFAyRtQLErhm~-~hkpg~dqa~sltqsa~lRKFKC 284 (1007)
T KOG3623|consen 210 LLTCPY--CDRGYKRLTSLKEHIKYRHEKNE--PNFSCMLCSYTFAYRTQLERHMQ-LHKPGGDQAISLTQSALLRKFKC 284 (1007)
T ss_pred hhcchh--HHHHHHHHHHHHHHHHHHHhhCC--CCCcchhhhhhhhhHHHHHHHHH-hhcCCCcccccccchhhhccccc
Confidence 467865 99999999999999877654432 47889999999999999999998 4431 345889
Q ss_pred CCcccccCCHHHHHHHHHhhccccCCCccccccccccccChhHHHHhHh
Q psy12165 332 SACFVVEDNLSALEAHVAAKHKAIIKKPYACRICARLLGNGVKVSEHIK 380 (537)
Q Consensus 332 ~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 380 (537)
.+|++.|+.+-.|++|+++|.+ +|||.|+.|++.|....++..||.
T Consensus 285 tECgKAFKfKHHLKEHlRIHSG---EKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 285 TECGKAFKFKHHLKEHLRIHSG---EKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred cccchhhhhHHHHHhhheeecC---CCCcCCcccccccccCCccccccc
Confidence 9999999988889999887554 688999999999988888888863
No 9
>KOG3576|consensus
Probab=99.65 E-value=4.2e-17 Score=132.58 Aligned_cols=112 Identities=31% Similarity=0.564 Sum_probs=75.0
Q ss_pred CCccccccccccccChhHHHHhHhhh-cCCccccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCCCCCCC
Q psy12165 357 KKPYACRICARLLGNGVKVSEHIKLH-GEHLVNCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDLCSKEW 435 (537)
Q Consensus 357 ~~~~~C~~C~~~f~~~~~l~~H~~~H-~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~~~f 435 (537)
...|.|.+|++.|.....|.+|++.| ..+.+-|..||+.|.+.-.|++|+++|++ .+||+|..|+++|
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtg-----------vrpykc~~c~kaf 183 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTG-----------VRPYKCSLCEKAF 183 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccC-----------ccccchhhhhHHH
Confidence 35577777777777777777777777 55667777777777777777777777777 5677777777777
Q ss_pred CChHHHHHHHHhhcCCC----------CcccccccccccCCHHHHHHHHHhhCC
Q psy12165 436 PSRKFMWQHLIRSHKAV----------AGLACGVCLKISETYEALALHLDQAHP 479 (537)
Q Consensus 436 ~~~~~l~~H~~~~H~~~----------~~~~C~~C~~~f~~~~~l~~H~~~~H~ 479 (537)
+.+..|..|++++|... +.|.|+.||+.-.....+..|+..+|+
T Consensus 184 tqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp 237 (267)
T KOG3576|consen 184 TQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHP 237 (267)
T ss_pred HhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCC
Confidence 77777777777777532 234455555555555555555554443
No 10
>KOG3576|consensus
Probab=99.64 E-value=4.9e-17 Score=132.15 Aligned_cols=120 Identities=27% Similarity=0.438 Sum_probs=109.0
Q ss_pred cCCccccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCCCCCCCCChHHHHHHHHhhcCCCCccccccccc
Q psy12165 383 GEHLVNCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCLK 462 (537)
Q Consensus 383 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~ 462 (537)
+...|.|.+|++.|.....|.+|++-|.+ .+.+-|..||+.|...-+|.+|+ +.|+|.+||+|..|++
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~-----------vkr~lct~cgkgfndtfdlkrh~-rthtgvrpykc~~c~k 181 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSD-----------VKRHLCTFCGKGFNDTFDLKRHT-RTHTGVRPYKCSLCEK 181 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccH-----------HHHHHHhhccCcccchhhhhhhh-ccccCccccchhhhhH
Confidence 45579999999999999999999999998 67899999999999999999998 9999999999999999
Q ss_pred ccCCHHHHHHHHHhhCCCCc-----ccccCCcccccccccccChhHHHHHHhhhhCC
Q psy12165 463 ISETYEALALHLDQAHPGYF-----AEEVDNPTCEVCGRYHNARPKLESHLAVHVGL 514 (537)
Q Consensus 463 ~f~~~~~l~~H~~~~H~~~~-----~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~g~ 514 (537)
.|+..-.|..|++.+|+... .-..+.|.|+.||+.-.....+..|++.|+-.
T Consensus 182 aftqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 182 AFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 99999999999999998621 11347799999999999999999999999864
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.21 E-value=4.5e-11 Score=117.28 Aligned_cols=145 Identities=21% Similarity=0.444 Sum_probs=95.4
Q ss_pred cccCCcccccCCHHHHHHHHHhhccccCCCcccccc--ccccccChhHHHHhHhhhcCCccccccccccCCChHHHHHHH
Q psy12165 329 CMCSACFVVEDNLSALEAHVAAKHKAIIKKPYACRI--CARLLGNGVKVSEHIKLHGEHLVNCRVCGSIFHSEPELKVHL 406 (537)
Q Consensus 329 ~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~H~~~~~~C~~C~~~f~~~~~L~~H~ 406 (537)
..|+.|..... ...|..|....-. ..-.|+. |+..|. +..+..| +.|+.|+..|. ...|..|+
T Consensus 408 V~C~NC~~~i~-l~~l~lHe~~C~r----~~V~Cp~~~Cg~v~~-r~el~~H--------~~C~~Cgk~f~-~s~LekH~ 472 (567)
T PLN03086 408 VECRNCKHYIP-SRSIALHEAYCSR----HNVVCPHDGCGIVLR-VEEAKNH--------VHCEKCGQAFQ-QGEMEKHM 472 (567)
T ss_pred EECCCCCCccc-hhHHHHHHhhCCC----cceeCCcccccceee-ccccccC--------ccCCCCCCccc-hHHHHHHH
Confidence 34555554433 2333355443221 2335763 777773 3334444 57888888885 57788888
Q ss_pred HhccCCCCCccCCCCCCCCcccCCCCCCCCChHHHHHHHHhhcCCCCcccccccccccC----------CHHHHHHHHHh
Q psy12165 407 EEHTDVPETKEQKPRPELPVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCLKISE----------TYEALALHLDQ 476 (537)
Q Consensus 407 ~~h~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~----------~~~~l~~H~~~ 476 (537)
++++. ++.|+ |++.+ .+..|..|+ ..|...+++.|.+|++.+. ....|..|..
T Consensus 473 ~~~Hk-------------pv~Cp-Cg~~~-~R~~L~~H~-~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~- 535 (567)
T PLN03086 473 KVFHE-------------PLQCP-CGVVL-EKEQMVQHQ-ASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHES- 535 (567)
T ss_pred HhcCC-------------CccCC-CCCCc-chhHHHhhh-hccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHH-
Confidence 88764 68888 88655 668888886 7788888888888888874 2347888877
Q ss_pred hCCCCcccccCCcccccccccccChhHHHHHHhhhh
Q psy12165 477 AHPGYFAEEVDNPTCEVCGRYHNARPKLESHLAVHV 512 (537)
Q Consensus 477 ~H~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 512 (537)
.++ .+++.|..|++.+..+ .|..|+...|
T Consensus 536 ~CG------~rt~~C~~Cgk~Vrlr-dm~~H~~~~h 564 (567)
T PLN03086 536 ICG------SRTAPCDSCGRSVMLK-EMDIHQIAVH 564 (567)
T ss_pred hcC------CcceEccccCCeeeeh-hHHHHHHHhh
Confidence 343 5888888888887544 4888875544
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.16 E-value=1e-10 Score=114.84 Aligned_cols=146 Identities=18% Similarity=0.368 Sum_probs=113.1
Q ss_pred cccccchhcccCcHHHHHHHhhcCCCCCccccCC--cccccCCHHHHHHHHHhhccccCCCccccccccccccChhHHHH
Q psy12165 300 NYCNFCADLFTNRKKLWLHLKSTHKDLTNCMCSA--CFVVEDNLSALEAHVAAKHKAIIKKPYACRICARLLGNGVKVSE 377 (537)
Q Consensus 300 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~--C~~~~~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~ 377 (537)
-.|+.|...... ..|..|.. +..-....|+. |+..|.. ..+ ++.+.|+.|++.|. ...|..
T Consensus 408 V~C~NC~~~i~l-~~l~lHe~--~C~r~~V~Cp~~~Cg~v~~r-~el------------~~H~~C~~Cgk~f~-~s~Lek 470 (567)
T PLN03086 408 VECRNCKHYIPS-RSIALHEA--YCSRHNVVCPHDGCGIVLRV-EEA------------KNHVHCEKCGQAFQ-QGEMEK 470 (567)
T ss_pred EECCCCCCccch-hHHHHHHh--hCCCcceeCCcccccceeec-ccc------------ccCccCCCCCCccc-hHHHHH
Confidence 369999888774 45668874 45556667884 8888732 222 45678999999996 678999
Q ss_pred hHhhhcCCccccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCCCCCCCC----------ChHHHHHHHHh
Q psy12165 378 HIKLHGEHLVNCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDLCSKEWP----------SRKFMWQHLIR 447 (537)
Q Consensus 378 H~~~H~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~~~f~----------~~~~l~~H~~~ 447 (537)
|+.++ ..++.|+ ||..+ .+..|..|+.+|.+ ++++.|+.|++.|. ....|..|. .
T Consensus 471 H~~~~-Hkpv~Cp-Cg~~~-~R~~L~~H~~thCp-----------~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE-~ 535 (567)
T PLN03086 471 HMKVF-HEPLQCP-CGVVL-EKEQMVQHQASTCP-----------LRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHE-S 535 (567)
T ss_pred HHHhc-CCCccCC-CCCCc-chhHHHhhhhccCC-----------CCceeCCCCCCccccCccccchhhhhhhHHHHH-H
Confidence 99987 3789999 99754 67899999988888 78999999999885 245799997 4
Q ss_pred hcCCCCcccccccccccCCHHHHHHHHHhhCC
Q psy12165 448 SHKAVAGLACGVCLKISETYEALALHLDQAHP 479 (537)
Q Consensus 448 ~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 479 (537)
.. +.+++.|..|++.+. ..+|..|+..+|.
T Consensus 536 ~C-G~rt~~C~~Cgk~Vr-lrdm~~H~~~~h~ 565 (567)
T PLN03086 536 IC-GSRTAPCDSCGRSVM-LKEMDIHQIAVHQ 565 (567)
T ss_pred hc-CCcceEccccCCeee-ehhHHHHHHHhhc
Confidence 44 889999999998876 5678899988884
No 13
>PHA00733 hypothetical protein
Probab=98.96 E-value=1.5e-09 Score=86.47 Aligned_cols=80 Identities=21% Similarity=0.358 Sum_probs=49.3
Q ss_pred CCccccccccccccChhHHHHh------HhhhcCCccccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCC
Q psy12165 357 KKPYACRICARLLGNGVKVSEH------IKLHGEHLVNCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDL 430 (537)
Q Consensus 357 ~~~~~C~~C~~~f~~~~~l~~H------~~~H~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~ 430 (537)
.+++.|.+|...|.....|..+ +..++.+||.|+.||+.|.....|..|++.+.. +|.|+.
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~~-------------~~~C~~ 104 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTEH-------------SKVCPV 104 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcCCc-------------CccCCC
Confidence 4566677766666655444443 223356667777777777777777777665422 466777
Q ss_pred CCCCCCChHHHHHHHHhhc
Q psy12165 431 CSKEWPSRKFMWQHLIRSH 449 (537)
Q Consensus 431 C~~~f~~~~~l~~H~~~~H 449 (537)
|++.|.....|..|+...|
T Consensus 105 CgK~F~~~~sL~~H~~~~h 123 (128)
T PHA00733 105 CGKEFRNTDSTLDHVCKKH 123 (128)
T ss_pred CCCccCCHHHHHHHHHHhc
Confidence 7777766666666665555
No 14
>PHA00733 hypothetical protein
Probab=98.94 E-value=8.7e-10 Score=87.83 Aligned_cols=55 Identities=20% Similarity=0.399 Sum_probs=26.7
Q ss_pred CCcccccccccccCCHHHHHHHHHhhCCCCcccccCCcccccccccccChhHHHHHHhhhhCC
Q psy12165 452 VAGLACGVCLKISETYEALALHLDQAHPGYFAEEVDNPTCEVCGRYHNARPKLESHLAVHVGL 514 (537)
Q Consensus 452 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~g~ 514 (537)
.+||.|+.|++.|.+...|..|++ .| ..+|.|+.|+++|.....|..|++..+|+
T Consensus 71 ~kPy~C~~Cgk~Fss~s~L~~H~r-~h-------~~~~~C~~CgK~F~~~~sL~~H~~~~h~~ 125 (128)
T PHA00733 71 VSPYVCPLCLMPFSSSVSLKQHIR-YT-------EHSKVCPVCGKEFRNTDSTLDHVCKKHNI 125 (128)
T ss_pred CCCccCCCCCCcCCCHHHHHHHHh-cC-------CcCccCCCCCCccCCHHHHHHHHHHhcCc
Confidence 444555555555555555555544 22 12355555555555555555555544443
No 15
>KOG3993|consensus
Probab=98.87 E-value=5.2e-10 Score=102.18 Aligned_cols=179 Identities=18% Similarity=0.238 Sum_probs=108.7
Q ss_pred CccccccCCcccChHHHHHHHHhcCCCCCCCCccccccCCCceeecccchhhccChHHHHHHHhhcCCCCCCC--CCCC-
Q psy12165 125 DVTCRICSKVSSSRGHLEKHLIAHDNPNTEVDEPQMNEFGESVHICHLCQKQFKQNWLLQKHKKTCKGPKRPT--SPLP- 201 (537)
Q Consensus 125 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~--~~~~- 201 (537)
.|.|..|...|.+.-.|.+|.-..- ....|+|+.|++.|+...+|..|.+-|.....+. ...+
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RI--------------V~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~ 332 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRI--------------VHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPP 332 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCee--------------EEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCCh
Confidence 4889999999999888988864221 2345999999999999999999999987654332 1111
Q ss_pred -----------------CCCCCCccccccchhccCCHHHHHHHHHhccCccccccccCCCCcccH---------HHHhhh
Q psy12165 202 -----------------EPDEDGIFRCKKCVKVFVNPDIYKKHLKHSHTKAYCEVCITAPGYETK---------VELLHH 255 (537)
Q Consensus 202 -----------------~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~---------~~l~~H 255 (537)
...+++.|.|..|++.|.....|+.|+..|+....-..- ...|... ..+..|
T Consensus 333 k~~~~~rae~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~--a~~f~~s~~~~l~~~~~~~a~h 410 (500)
T KOG3993|consen 333 KQAVETRAEVQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEK--APKFLLSRVIPLMHFNQAVATH 410 (500)
T ss_pred hhhhhhhhhhhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhc--ccCcchhhcccccccccccccc
Confidence 114567899999999999999999998877752111000 0001111 011111
Q ss_pred hhhccC----------CCCCcccccccccccccCHHHHHHHHHhccccCCCCCCcccccchhcccCcHHHHHHHhhcCCC
Q psy12165 256 MKTEHK----------DNEEYKCLIPGCKKYFRTKLDCQKHCREHKTVTSRCPTNYCNFCADLFTNRKKLWLHLKSTHKD 325 (537)
Q Consensus 256 ~~~~h~----------~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 325 (537)
....+. ....-.+.+ ++..+++...-..+.+.-... ..|.|.+|..+|.+...|.+|+.+.|..
T Consensus 411 ~~a~~~~g~~vl~~a~sael~~pp~--~~~ppsss~~sgg~~rlg~~~----q~f~~ky~~atfyss~~ltrhin~~Hps 484 (500)
T KOG3993|consen 411 SSASDSHGDEVLYVAGSAELELPPY--DGSPPSSSGSSGGYGRLGIAE----QGFTCKYCPATFYSSPGLTRHINKCHPS 484 (500)
T ss_pred cccccccccceeeeeccccccCCCC--CCCCcccCCCCCccccccchh----hccccccchHhhhcCcchHhHhhhcChH
Confidence 111110 111223333 555555444333333222222 3677888888888888888888766653
No 16
>KOG1146|consensus
Probab=98.86 E-value=3.9e-10 Score=117.62 Aligned_cols=115 Identities=14% Similarity=0.125 Sum_probs=82.9
Q ss_pred cccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCCCCCCCCChHHHHHHHHhhcCCCCcccccccccccCCHH
Q psy12165 389 CRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCLKISETYE 468 (537)
Q Consensus 389 C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~ 468 (537)
|..|..-|.....+..|-.---. .......+.|..|++.|...-.+. |+ -...+|.|..|...|....
T Consensus 1231 ~~~~e~~f~~~~~~~~~a~~~~~-------~~~~sGe~~c~~~~~~~~~~~~~~-~l----~~~~~~~~~~~~~~~~~~~ 1298 (1406)
T KOG1146|consen 1231 PNALEQPFPQEPEPTATAPPKPP-------ELPASGEGECGAVDELLTPSFGIS-TL----DVTHRYLCRQCKMAFDGEA 1298 (1406)
T ss_pred HHhhhcCccCcccccccCCCCCC-------cCcCCCcchhhhccccccCcccee-ec----ccchhHHHHHHHhhhcchh
Confidence 77788888877776664321100 011134689999999998888877 76 2346799999999999999
Q ss_pred HHHHHHHhhCCC-----------CcccccCCcccccccccccChhHHHHHHhhhhCCCc
Q psy12165 469 ALALHLDQAHPG-----------YFAEEVDNPTCEVCGRYHNARPKLESHLAVHVGLDK 516 (537)
Q Consensus 469 ~l~~H~~~~H~~-----------~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~g~~~ 516 (537)
.|..|.++.-.- .......+| |..|...|.....|..||++-.+..+
T Consensus 1299 ~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~~c~~~~~~~~alqihm~~~~~~~k 1356 (1406)
T KOG1146|consen 1299 PLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CLACEVLLSGREALQIHMRSSAHRRK 1356 (1406)
T ss_pred HHHHHHHHHHhccCccccCCCCcccCcccccc-chHHHhhcchhHHHHHHHHHhhhccc
Confidence 999998643211 111223556 99999999999999999998877654
No 17
>KOG3993|consensus
Probab=98.69 E-value=3.2e-09 Score=97.16 Aligned_cols=149 Identities=15% Similarity=0.253 Sum_probs=95.8
Q ss_pred ccccccccccccChhHHHHhHhhh-cCCccccccccccCCChHHHHHHHHhccCCCCCc---------c-----------
Q psy12165 359 PYACRICARLLGNGVKVSEHIKLH-GEHLVNCRVCGSIFHSEPELKVHLEEHTDVPETK---------E----------- 417 (537)
Q Consensus 359 ~~~C~~C~~~f~~~~~l~~H~~~H-~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~---------~----------- 417 (537)
.|.|..|...|.+...|.+|.-.- -.-.|+|+.|++.|.-..+|..|.+.|...+... .
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~ 346 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAE 346 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhcc
Confidence 478888888888888888885321 1123788888888888888888888886543221 0
Q ss_pred --CCCCCCCCcccCCCCCCCCChHHHHHHHHhhcCCCCc-----------------------------------------
Q psy12165 418 --QKPRPELPVTCDLCSKEWPSRKFMWQHLIRSHKAVAG----------------------------------------- 454 (537)
Q Consensus 418 --~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~----------------------------------------- 454 (537)
.....+..|.|.+|+++|.....|++|....|....+
T Consensus 347 rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~ 426 (500)
T KOG3993|consen 347 RSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAG 426 (500)
T ss_pred ccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeec
Confidence 0012345799999999999999999996333322110
Q ss_pred ----ccccccccccCCHHHHHHHHHhhCCCCcccccCCcccccccccccChhHHHHHHhhhhC
Q psy12165 455 ----LACGVCLKISETYEALALHLDQAHPGYFAEEVDNPTCEVCGRYHNARPKLESHLAVHVG 513 (537)
Q Consensus 455 ----~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~g 513 (537)
..|.+|+..+.+++.--.+.+--+ ....|.|.+|.-.|.+...|.+|+..-|-
T Consensus 427 sael~~pp~~~~ppsss~~sgg~~rlg~------~~q~f~~ky~~atfyss~~ltrhin~~Hp 483 (500)
T KOG3993|consen 427 SAELELPPYDGSPPSSSGSSGGYGRLGI------AEQGFTCKYCPATFYSSPGLTRHINKCHP 483 (500)
T ss_pred cccccCCCCCCCCcccCCCCCccccccc------hhhccccccchHhhhcCcchHhHhhhcCh
Confidence 224444444443333322222111 13458999999999999999999876553
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.63 E-value=1.7e-08 Score=65.10 Aligned_cols=42 Identities=19% Similarity=0.437 Sum_probs=26.0
Q ss_pred cccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCCCCCCCCChHHH
Q psy12165 387 VNCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDLCSKEWPSRKFM 441 (537)
Q Consensus 387 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l 441 (537)
|.|+.||+.|....+|..|+++|+. +|+|..|++.|...+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~k-------------~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHNT-------------NLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcCC-------------cccCCcccceeccccee
Confidence 5666666666666666666666663 46666666666555444
No 19
>PHA02768 hypothetical protein; Provisional
Probab=98.56 E-value=2.7e-08 Score=64.16 Aligned_cols=43 Identities=21% Similarity=0.403 Sum_probs=35.1
Q ss_pred ccccccccccccChhHHHHhHhhhcCCccccccccccCCChHHH
Q psy12165 359 PYACRICARLLGNGVKVSEHIKLHGEHLVNCRVCGSIFHSEPEL 402 (537)
Q Consensus 359 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~C~~C~~~f~~~~~L 402 (537)
.|.|+.||+.|...++|..|+++|. ++|+|..|++.|...+.|
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~-k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN-TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC-CcccCCcccceeccccee
Confidence 4788888888888888888888886 688888888888776654
No 20
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.32 E-value=9.8e-07 Score=58.75 Aligned_cols=51 Identities=25% Similarity=0.458 Sum_probs=25.7
Q ss_pred ccccccccccCCHHHHHHHHHhhCCCCcccccCCcccccccccccChhHHHHHHhhhh
Q psy12165 455 LACGVCLKISETYEALALHLDQAHPGYFAEEVDNPTCEVCGRYHNARPKLESHLAVHV 512 (537)
Q Consensus 455 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 512 (537)
|.|++|++ .-+...|..|+...|.. +.+.+.|++|...++ .+|..||.+++
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~----~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRS----ESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcC----CCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 45555555 23345555555555543 123455555555433 25555555544
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.23 E-value=6.7e-07 Score=49.12 Aligned_cols=24 Identities=38% Similarity=0.841 Sum_probs=12.9
Q ss_pred HHHHHHhccCCCCCccCCCCCCCCcccCCCCCCCC
Q psy12165 402 LKVHLEEHTDVPETKEQKPRPELPVTCDLCSKEWP 436 (537)
Q Consensus 402 L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~~~f~ 436 (537)
|.+|+++|++ ++||.|+.|+++|.
T Consensus 2 l~~H~~~H~~-----------~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTG-----------EKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSS-----------SSSEEESSSSEEES
T ss_pred HHHHhhhcCC-----------CCCCCCCCCcCeeC
Confidence 4555555555 45555555555553
No 22
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.22 E-value=5.1e-07 Score=49.58 Aligned_cols=24 Identities=25% Similarity=0.483 Sum_probs=13.2
Q ss_pred HHHHHHhhcCCCCcccccccccccC
Q psy12165 441 MWQHLIRSHKAVAGLACGVCLKISE 465 (537)
Q Consensus 441 l~~H~~~~H~~~~~~~C~~C~~~f~ 465 (537)
|..|+ +.|++++||.|+.|++.|.
T Consensus 2 l~~H~-~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHM-RTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHH-HHHSSSSSEEESSSSEEES
T ss_pred HHHHh-hhcCCCCCCCCCCCcCeeC
Confidence 45555 3555555555555555554
No 23
>PHA00616 hypothetical protein
Probab=98.19 E-value=7.5e-07 Score=54.55 Aligned_cols=29 Identities=21% Similarity=0.308 Sum_probs=24.6
Q ss_pred CcccccccccccChhHHHHHHhhhhCCCc
Q psy12165 488 NPTCEVCGRYHNARPKLESHLAVHVGLDK 516 (537)
Q Consensus 488 ~~~C~~C~~~f~~~~~l~~H~~~h~g~~~ 516 (537)
||+|..||+.|..++.|+.|++.|+|.++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~ 29 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNK 29 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCc
Confidence 58888888888888888888888888876
No 24
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.13 E-value=4.8e-06 Score=55.42 Aligned_cols=51 Identities=25% Similarity=0.512 Sum_probs=38.3
Q ss_pred CcccCCCCCCCCChHHHHHHHHhhcCCC-CcccccccccccCCHHHHHHHHHhhC
Q psy12165 425 PVTCDLCSKEWPSRKFMWQHLIRSHKAV-AGLACGVCLKISETYEALALHLDQAH 478 (537)
Q Consensus 425 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~-~~~~C~~C~~~f~~~~~l~~H~~~~H 478 (537)
.|.||+|++. -+...|..|+...|..+ +.+.|++|...+. .+|..|+...|
T Consensus 2 ~f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 2 SFTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred CcCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 4788888884 46778888888888765 4578888887655 48888888766
No 25
>PHA00732 hypothetical protein
Probab=98.12 E-value=2.3e-06 Score=61.39 Aligned_cols=37 Identities=27% Similarity=0.383 Sum_probs=17.5
Q ss_pred cccCCCCCCCCChHHHHHHHHhhcCCCCcccccccccccC
Q psy12165 426 VTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCLKISE 465 (537)
Q Consensus 426 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~ 465 (537)
|.|+.|++.|.+...|..|++..|. ++.|+.|++.|.
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~ 38 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR 38 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC
Confidence 4455555555555555555422332 234555555554
No 26
>PHA00732 hypothetical protein
Probab=98.08 E-value=3e-06 Score=60.83 Aligned_cols=44 Identities=25% Similarity=0.539 Sum_probs=34.3
Q ss_pred ccccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCCCCCCCCChHHHHHHH
Q psy12165 386 LVNCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDLCSKEWPSRKFMWQHL 445 (537)
Q Consensus 386 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~ 445 (537)
||.|+.||+.|.+...|..|++.++. ++.|+.|++.|. .|..|+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~-------------~~~C~~CgKsF~---~l~~H~ 44 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT-------------LTKCPVCNKSYR---RLNQHF 44 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC-------------CCccCCCCCEeC---Chhhhh
Confidence 57888888888888888888875333 257888888886 577887
No 27
>PHA00616 hypothetical protein
Probab=98.06 E-value=1.9e-06 Score=52.82 Aligned_cols=25 Identities=32% Similarity=0.596 Sum_probs=11.1
Q ss_pred cccccccccCCChHHHHHHHHhccC
Q psy12165 387 VNCRVCGSIFHSEPELKVHLEEHTD 411 (537)
Q Consensus 387 ~~C~~C~~~f~~~~~L~~H~~~h~~ 411 (537)
|+|+.||+.|...++|..|++.|++
T Consensus 2 YqC~~CG~~F~~~s~l~~H~r~~hg 26 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHLLSVHK 26 (44)
T ss_pred CccchhhHHHhhHHHHHHHHHHhcC
Confidence 4444444444444444444444444
No 28
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.88 E-value=6.8e-06 Score=43.89 Aligned_cols=23 Identities=26% Similarity=0.768 Sum_probs=17.0
Q ss_pred cccccccccccChhHHHHHHhhh
Q psy12165 489 PTCEVCGRYHNARPKLESHLAVH 511 (537)
Q Consensus 489 ~~C~~C~~~f~~~~~l~~H~~~h 511 (537)
|+|+.|++.|.++..|..||+.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 56777777777777777777764
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.69 E-value=2.3e-05 Score=41.81 Aligned_cols=23 Identities=26% Similarity=0.534 Sum_probs=21.3
Q ss_pred cccccccccccchHHHHHhHHhh
Q psy12165 11 LSCGNCLKVFRDNLSLLKHVEEE 33 (537)
Q Consensus 11 ~~C~~C~~~f~~~~~L~~H~~~~ 33 (537)
|.|++|++.|.++..|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 68999999999999999999863
No 30
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.63 E-value=3.1e-05 Score=41.81 Aligned_cols=24 Identities=25% Similarity=0.717 Sum_probs=14.3
Q ss_pred cccccccccccChhHHHHHHhhhh
Q psy12165 489 PTCEVCGRYHNARPKLESHLAVHV 512 (537)
Q Consensus 489 ~~C~~C~~~f~~~~~l~~H~~~h~ 512 (537)
|.|++|++.|.+...|+.|+++|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 456666666666666666666653
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.57 E-value=3.6e-05 Score=59.53 Aligned_cols=73 Identities=19% Similarity=0.294 Sum_probs=15.8
Q ss_pred ccCCCCCCCCChHHHHHHHHhhcCCCCcccccccccccCCHHHHHHHHHhhCCCCcccccCCcccccccccccChhHHHH
Q psy12165 427 TCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCLKISETYEALALHLDQAHPGYFAEEVDNPTCEVCGRYHNARPKLES 506 (537)
Q Consensus 427 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~l~~ 506 (537)
+|..|+..|++...|..|+...|.-..+ ....+.....+..++... . ...+.|.+|++.|.+...|..
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~-----~~~~l~~~~~~~~~~~~~--~-----~~~~~C~~C~~~f~s~~~l~~ 68 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP-----DQKYLVDPNRLLNYLRKK--V-----KESFRCPYCNKTFRSREALQE 68 (100)
T ss_dssp ------------------------------------------------------------SSEEBSSSS-EESSHHHHHH
T ss_pred Cccccccccccccccccccccccccccc-----cccccccccccccccccc--c-----CCCCCCCccCCCCcCHHHHHH
Confidence 3666666666666666666555543322 111111233333333211 0 124667777777777777777
Q ss_pred HHhhh
Q psy12165 507 HLAVH 511 (537)
Q Consensus 507 H~~~h 511 (537)
||+.+
T Consensus 69 Hm~~~ 73 (100)
T PF12756_consen 69 HMRSK 73 (100)
T ss_dssp HHHHT
T ss_pred HHcCc
Confidence 76654
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.56 E-value=5e-05 Score=40.99 Aligned_cols=24 Identities=33% Similarity=0.657 Sum_probs=20.5
Q ss_pred cccccccccccchHHHHHhHHhhC
Q psy12165 11 LSCGNCLKVFRDNLSLLKHVEEEH 34 (537)
Q Consensus 11 ~~C~~C~~~f~~~~~L~~H~~~~h 34 (537)
|.|++|++.|.+..+|..|++.+|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 689999999999999999999875
No 33
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.45 E-value=6.8e-05 Score=41.78 Aligned_cols=25 Identities=24% Similarity=0.476 Sum_probs=23.4
Q ss_pred ccccccccccccchHHHHHhHHhhC
Q psy12165 10 ALSCGNCLKVFRDNLSLLKHVEEEH 34 (537)
Q Consensus 10 ~~~C~~C~~~f~~~~~L~~H~~~~h 34 (537)
||.|..|++.|.+..+|.+|++.|+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 6899999999999999999998875
No 34
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.43 E-value=9.3e-05 Score=57.21 Aligned_cols=75 Identities=21% Similarity=0.321 Sum_probs=22.8
Q ss_pred ccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCcccChHHHHHHHHhcCCCCCCCCccccccCCCceeecccchhhc
Q psy12165 98 VCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKVSSSRGHLEKHLIAHDNPNTEVDEPQMNEFGESVHICHLCQKQF 177 (537)
Q Consensus 98 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~C~~C~~~f 177 (537)
.|..|+..|.+...|..|+...|.-..+ ....+.....+..++..- ....+.|.+|++.|
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~-----~~~~l~~~~~~~~~~~~~---------------~~~~~~C~~C~~~f 60 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP-----DQKYLVDPNRLLNYLRKK---------------VKESFRCPYCNKTF 60 (100)
T ss_dssp --------------------------------------------------------------------SSEEBSSSS-EE
T ss_pred Cccccccccccccccccccccccccccc-----cccccccccccccccccc---------------cCCCCCCCccCCCC
Confidence 3777888888888888887666643211 111222333333333321 23369999999999
Q ss_pred cChHHHHHHHhhcCC
Q psy12165 178 KQNWLLQKHKKTCKG 192 (537)
Q Consensus 178 ~~~~~l~~H~~~h~~ 192 (537)
.+...|..|+..+..
T Consensus 61 ~s~~~l~~Hm~~~~H 75 (100)
T PF12756_consen 61 RSREALQEHMRSKHH 75 (100)
T ss_dssp SSHHHHHHHHHHTTT
T ss_pred cCHHHHHHHHcCccC
Confidence 999999999987543
No 35
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.43 E-value=6.1e-05 Score=41.97 Aligned_cols=26 Identities=23% Similarity=0.506 Sum_probs=18.4
Q ss_pred CcccccccccccChhHHHHHHhhhhC
Q psy12165 488 NPTCEVCGRYHNARPKLESHLAVHVG 513 (537)
Q Consensus 488 ~~~C~~C~~~f~~~~~l~~H~~~h~g 513 (537)
||+|..|++.|.+...|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 46777777777777777777776653
No 36
>KOG2231|consensus
Probab=97.35 E-value=0.0004 Score=69.96 Aligned_cols=22 Identities=18% Similarity=0.436 Sum_probs=11.7
Q ss_pred chHHHHHhHHhhCcccccccccccce
Q psy12165 22 DNLSLLKHVEEEHPTMQRRYSCQVHI 47 (537)
Q Consensus 22 ~~~~L~~H~~~~h~~~~~~~~C~~c~ 47 (537)
+...|..||...| +-+.|..|.
T Consensus 126 s~~~Lk~H~~~~H----~~~~c~lC~ 147 (669)
T KOG2231|consen 126 SVENLKNHMRDQH----KLHLCSLCL 147 (669)
T ss_pred HHHHHHHHHHHhh----hhhcccccc
Confidence 5556666664443 345555554
No 37
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.32 E-value=0.00014 Score=64.77 Aligned_cols=67 Identities=30% Similarity=0.469 Sum_probs=50.2
Q ss_pred CCceeeccc--chhhccChHHHHHHHhhcCC-CCCCCCCCCC-----CCCCCccccccchhccCCHHHHHHHHHh
Q psy12165 164 GESVHICHL--CQKQFKQNWLLQKHKKTCKG-PKRPTSPLPE-----PDEDGIFRCKKCVKVFVNPDIYKKHLKH 230 (537)
Q Consensus 164 ~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~-~~~~~~~~~~-----~~~~~~~~C~~C~~~f~~~~~l~~H~~~ 230 (537)
+++||+|++ |++.+.++..|+.|+..-|. .+-.+.+.++ ...+.||+|++|++.|.+...|+.|...
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~H 420 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRKH 420 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecccc
Confidence 459999997 99999999999999875442 2211111111 2356799999999999999999999764
No 38
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=97.27 E-value=0.00016 Score=38.87 Aligned_cols=24 Identities=25% Similarity=0.469 Sum_probs=16.5
Q ss_pred cccccccccccChhHHHHHHhhhhC
Q psy12165 489 PTCEVCGRYHNARPKLESHLAVHVG 513 (537)
Q Consensus 489 ~~C~~C~~~f~~~~~l~~H~~~h~g 513 (537)
|+|+.|+++.+ +..|.+|+++|+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 57788887777 7778888777664
No 39
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.27 E-value=0.00011 Score=65.45 Aligned_cols=63 Identities=17% Similarity=0.377 Sum_probs=41.2
Q ss_pred cCCcccccc--ccccCCChHHHHHHHHhccCCCCCc-------cCC-CCCCCCcccCCCCCCCCChHHHHHHH
Q psy12165 383 GEHLVNCRV--CGSIFHSEPELKVHLEEHTDVPETK-------EQK-PRPELPVTCDLCSKEWPSRKFMWQHL 445 (537)
Q Consensus 383 ~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~-------~~~-~~~~~~~~C~~C~~~f~~~~~l~~H~ 445 (537)
+++||+|++ |.+.++....|+.|+.--+..+..- .+. ...++||+|++|++++.....|+-|.
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence 568999988 9999999999999986433221111 010 12346777777777777777776664
No 40
>KOG1146|consensus
Probab=97.26 E-value=0.00011 Score=78.30 Aligned_cols=59 Identities=19% Similarity=0.274 Sum_probs=40.3
Q ss_pred cccccccccc-CCHHHHHHHHHhhCCCCcccccCCcccccccccccChhHHHHHHhhhhC
Q psy12165 455 LACGVCLKIS-ETYEALALHLDQAHPGYFAEEVDNPTCEVCGRYHNARPKLESHLAVHVG 513 (537)
Q Consensus 455 ~~C~~C~~~f-~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~g 513 (537)
|.|-+|.... .+...|..|+..-=......-...|.|.+|.+....+.++..|.+..--
T Consensus 735 ~~civcd~~st~~l~~l~~h~~~~rs~ke~v~g~~~~c~l~~y~t~~kanfqlh~Ktdkh 794 (1406)
T KOG1146|consen 735 FDCIVCDVFSTDRLDQLWFHNTRERSRKEQVPGDVPSCKLKPYATNTKANFQLHNKTDKH 794 (1406)
T ss_pred HHHhhhhhhhhhhHHHHhhcchhhhhhhhcccCCCCcceecccccccchhhhhhcccchh
Confidence 7799997644 4556677775422211112234679999999999999999999876543
No 41
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.21 E-value=0.00069 Score=61.13 Aligned_cols=139 Identities=23% Similarity=0.423 Sum_probs=94.9
Q ss_pred cccccc--chhccCCHHHHHHHHHhccCccccccccCCCCcccHHHHhhhhhhccCCCCCcccccccccccccCHHHHHH
Q psy12165 208 IFRCKK--CVKVFVNPDIYKKHLKHSHTKAYCEVCITAPGYETKVELLHHMKTEHKDNEEYKCLIPGCKKYFRTKLDCQK 285 (537)
Q Consensus 208 ~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~~~l~~H~~~~h~~~~~~~C~~~~C~~~f~~~~~l~~ 285 (537)
.|.|+. |.........|..|.+..|+.+.|..|... .+.|.| .....+...|..
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~~~C~~C~~n-------------------Kk~F~~-----E~~lF~~~~Lr~ 206 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGFVLCSECIGN-------------------KKDFWN-----EIRLFRSSTLRD 206 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCcEEhHhhhcC-------------------cccCcc-----ceeeeecccccc
Confidence 378876 888888899999999999999999999432 223433 233344555666
Q ss_pred HHHhccccCCCCCCcccccchhcccCcHHHHHHHhhcCCCCCccccCCccc-------ccCCHHHHHHHHHhhccccCCC
Q psy12165 286 HCREHKTVTSRCPTNYCNFCADLFTNRKKLWLHLKSTHKDLTNCMCSACFV-------VEDNLSALEAHVAAKHKAIIKK 358 (537)
Q Consensus 286 H~~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~-------~~~~~~~l~~H~~~~h~~~~~~ 358 (537)
|...-.....-..-..|..|..-|.+-..|..|++..|.. |-+|+. -|.+-..|..|.+..
T Consensus 207 H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~HE~-----ChICD~v~p~~~QYFK~Y~~Le~HF~~~------- 274 (493)
T COG5236 207 HKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHEA-----CHICDMVGPIRYQYFKSYEDLEAHFRNA------- 274 (493)
T ss_pred cccCCccccCcCCCchhhhccceecChHHHHHHHHhhhhh-----hhhhhccCccchhhhhCHHHHHHHhhcC-------
Confidence 6543322211112346999999999999999999876754 667754 366677888888753
Q ss_pred cccccc--cc----ccccChhHHHHhHh-hh
Q psy12165 359 PYACRI--CA----RLLGNGVKVSEHIK-LH 382 (537)
Q Consensus 359 ~~~C~~--C~----~~f~~~~~l~~H~~-~H 382 (537)
-|.|.. |. .+|.....|..|+. .|
T Consensus 275 hy~ct~qtc~~~k~~vf~~~~el~~h~~~~h 305 (493)
T COG5236 275 HYCCTFQTCRVGKCYVFPYHTELLEHLTRFH 305 (493)
T ss_pred ceEEEEEEEecCcEEEeccHHHHHHHHHHHh
Confidence 355544 42 47888888999876 55
No 42
>KOG2231|consensus
Probab=97.12 E-value=0.00089 Score=67.55 Aligned_cols=114 Identities=23% Similarity=0.468 Sum_probs=70.1
Q ss_pred cccccchhccCCHHHHHHHHHhccCccccccccCCCCcccHHHHhhhhhhccCCCCCcccccccccccccCHHHHHHHHH
Q psy12165 209 FRCKKCVKVFVNPDIYKKHLKHSHTKAYCEVCITAPGYETKVELLHHMKTEHKDNEEYKCLIPGCKKYFRTKLDCQKHCR 288 (537)
Q Consensus 209 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~~~~~~~~~~~l~~H~~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~ 288 (537)
-.|..| ..|.+...|+.|+...|..+.|..|-.. .+.|.+ +.+ .-+..+|..|+.
T Consensus 116 ~~~~~c-~~~~s~~~Lk~H~~~~H~~~~c~lC~~~-------------------~kif~~----e~k-~Yt~~el~~h~~ 170 (669)
T KOG2231|consen 116 KECLHC-TEFKSVENLKNHMRDQHKLHLCSLCLQN-------------------LKIFIN----ERK-LYTRAELNLHLM 170 (669)
T ss_pred CCCccc-cchhHHHHHHHHHHHhhhhhcccccccc-------------------ceeeee----eee-hehHHHHHHHHh
Confidence 358888 8889999999999999999999999543 222222 222 223445555544
Q ss_pred hccc-cCCCCCCcccccchhcccCcHHHHHHHhhcCCCCCccccCCc------ccccCCHHHHHHHHHhhc
Q psy12165 289 EHKT-VTSRCPTNYCNFCADLFTNRKKLWLHLKSTHKDLTNCMCSAC------FVVEDNLSALEAHVAAKH 352 (537)
Q Consensus 289 ~h~~-~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C------~~~~~~~~~l~~H~~~~h 352 (537)
.-.. ..+-.....|..|...|.....|..|++..|- .|..| ..-|..-..|..|.+..|
T Consensus 171 ~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h~-----~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 171 FGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDHE-----FCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred cCCCccccccCCccchhhhhhhccHHHHHHhhcccee-----heeecCcccccchhcccchHHHHHhhhcC
Confidence 3322 11111124577777777777777777765443 34444 344566677777777555
No 43
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.09 E-value=0.0011 Score=59.99 Aligned_cols=146 Identities=23% Similarity=0.307 Sum_probs=95.5
Q ss_pred Ccccc--cchhcccCcHHHHHHHhhcCCCCCccccCCcccccCCHHHHHHHHHhhccccCCCccccccccccccChhHHH
Q psy12165 299 TNYCN--FCADLFTNRKKLWLHLKSTHKDLTNCMCSACFVVEDNLSALEAHVAAKHKAIIKKPYACRICARLLGNGVKVS 376 (537)
Q Consensus 299 ~~~C~--~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~ 376 (537)
.|.|+ .|.........|..|.+..|.. +.|.+|-.. ++.|.|.+ ..-+...|.
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~---~~C~~C~~n-------------------Kk~F~~E~---~lF~~~~Lr 205 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGF---VLCSECIGN-------------------KKDFWNEI---RLFRSSTLR 205 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCc---EEhHhhhcC-------------------cccCccce---eeeeccccc
Confidence 56775 4777777778888888876654 556666421 24444432 222345677
Q ss_pred HhHhhh-cCCcc----ccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCCCCC-------CCCChHHHHHH
Q psy12165 377 EHIKLH-GEHLV----NCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDLCSK-------EWPSRKFMWQH 444 (537)
Q Consensus 377 ~H~~~H-~~~~~----~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~~-------~f~~~~~l~~H 444 (537)
.|...- .+..| .|..|...|.+-..|..|++.-+. +|-+|++ -|.+...|..|
T Consensus 206 ~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~HE---------------~ChICD~v~p~~~QYFK~Y~~Le~H 270 (493)
T COG5236 206 DHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHE---------------ACHICDMVGPIRYQYFKSYEDLEAH 270 (493)
T ss_pred ccccCCccccCcCCCchhhhccceecChHHHHHHHHhhhh---------------hhhhhhccCccchhhhhCHHHHHHH
Confidence 776543 22122 499999999999999999986553 3555553 46777888888
Q ss_pred HHhhcCCCCcccccc--cc----cccCCHHHHHHHHHhhCCCCcccccCCc
Q psy12165 445 LIRSHKAVAGLACGV--CL----KISETYEALALHLDQAHPGYFAEEVDNP 489 (537)
Q Consensus 445 ~~~~H~~~~~~~C~~--C~----~~f~~~~~l~~H~~~~H~~~~~~~~~~~ 489 (537)
.+..| |.|.+ |- ..|.....|..|+.+-|.....+.+.++
T Consensus 271 F~~~h-----y~ct~qtc~~~k~~vf~~~~el~~h~~~~h~~~~~~~~~~~ 316 (493)
T COG5236 271 FRNAH-----YCCTFQTCRVGKCYVFPYHTELLEHLTRFHKVNARLSEIPR 316 (493)
T ss_pred hhcCc-----eEEEEEEEecCcEEEeccHHHHHHHHHHHhhcccccCcCCC
Confidence 86555 55544 31 5788999999999999987555554444
No 44
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.93 E-value=0.00076 Score=36.95 Aligned_cols=24 Identities=25% Similarity=0.692 Sum_probs=17.1
Q ss_pred cccccccccccChhHHHHHHhhhh
Q psy12165 489 PTCEVCGRYHNARPKLESHLAVHV 512 (537)
Q Consensus 489 ~~C~~C~~~f~~~~~l~~H~~~h~ 512 (537)
|.|+.|++.|.+...|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 467777777777777777777654
No 45
>KOG2785|consensus
Probab=96.84 E-value=0.004 Score=57.68 Aligned_cols=87 Identities=17% Similarity=0.350 Sum_probs=53.7
Q ss_pred cccccccccccccchHHHHHhHHhhCcccccccccccceeeehhhcccchhhhhhhhhhcCcccccccccccccHHHHHH
Q psy12165 9 AALSCGNCLKVFRDNLSLLKHVEEEHPTMQRRYSCQVHIILVLNTLLDDNLSLLKHVEEEHPTMQRRYSCQVDTMEAIKE 88 (537)
Q Consensus 9 ~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~C~~c~~~~~~~~~~~~~~l~~h~~~~h~~~~~~~~c~~~~~~~~~~ 88 (537)
..|.|..|...|.+...-+.|+++. | -+++|.+-+...-|.....|.-...+...-..
T Consensus 2 t~ftC~tC~v~F~~ad~Qr~HyKSd-----------------W-----HRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~ 59 (390)
T KOG2785|consen 2 TGFTCNTCNVEFDDADEQRAHYKSD-----------------W-----HRYNLKRKVASLPPITAEEFNEKVLSDDSEKE 59 (390)
T ss_pred CcceeeceeeeeccHHHHHHHhhhh-----------------H-----HHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhh
Confidence 3589999999999999999999863 1 34455555444332222222221111111111
Q ss_pred h-hhcCCCccccCCCcCcCCChHHHHHHHH
Q psy12165 89 H-RKTEHAMYVCDLCPKYYSRSTHLWKHVN 117 (537)
Q Consensus 89 h-~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 117 (537)
- ......++.|.+|.+.|.+..+...|+.
T Consensus 60 ~~~e~~~~~~~c~~c~k~~~s~~a~~~hl~ 89 (390)
T KOG2785|consen 60 ENLEEAESVVYCEACNKSFASPKAHENHLK 89 (390)
T ss_pred hhhhhcccceehHHhhccccChhhHHHHHH
Confidence 1 2233457889999999999999888885
No 46
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.63 E-value=0.0013 Score=41.28 Aligned_cols=31 Identities=23% Similarity=0.404 Sum_probs=17.5
Q ss_pred cCCcccccccccccChhHHHHHHhhhhCCCc
Q psy12165 486 VDNPTCEVCGRYHNARPKLESHLAVHVGLDK 516 (537)
Q Consensus 486 ~~~~~C~~C~~~f~~~~~l~~H~~~h~g~~~ 516 (537)
+.|-.|++|+..+.+..+|++||.++++.++
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 4566677777777777777777777766654
No 47
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.60 E-value=0.0011 Score=35.53 Aligned_cols=24 Identities=25% Similarity=0.593 Sum_probs=19.7
Q ss_pred cccccccccccchHHHHHhHHhhCc
Q psy12165 11 LSCGNCLKVFRDNLSLLKHVEEEHP 35 (537)
Q Consensus 11 ~~C~~C~~~f~~~~~L~~H~~~~h~ 35 (537)
|+|+.|+.... +..|.+|++.+|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 68999999998 9999999998763
No 48
>KOG2482|consensus
Probab=96.51 E-value=0.0095 Score=54.05 Aligned_cols=192 Identities=19% Similarity=0.250 Sum_probs=107.1
Q ss_pred cHHHHHHHhhcCCCCCccccCCcccc-cCCHHHHHHHHHhhccccCCCccccccccccccChhHHHHhHhhhcCCccccc
Q psy12165 312 RKKLWLHLKSTHKDLTNCMCSACFVV-EDNLSALEAHVAAKHKAIIKKPYACRICARLLGNGVKVSEHIKLHGEHLVNCR 390 (537)
Q Consensus 312 ~~~l~~H~~~~H~~~~~~~C~~C~~~-~~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~C~ 390 (537)
+..|..|++..-.......|-.|... ..+++.+..|+...|.-.+..|- ....-..|..|++.- -..+.|-
T Consensus 128 ~eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpD-------niVyvnelLehLkek-L~r~~CL 199 (423)
T KOG2482|consen 128 KEALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPD-------NIVYVNELLEHLKEK-LERLRCL 199 (423)
T ss_pred HHHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCc-------ceeeHHHHHHHHHHH-Hhhheee
Confidence 44566666544333444567777654 34566677777777654333331 112234566666533 1246899
Q ss_pred cccccCCChHHHHHHHHh--ccCCCCCccCCCCCCCCcccCC--CCCCC-CChHHH-----HHHH--------HhhcCCC
Q psy12165 391 VCGSIFHSEPELKVHLEE--HTDVPETKEQKPRPELPVTCDL--CSKEW-PSRKFM-----WQHL--------IRSHKAV 452 (537)
Q Consensus 391 ~C~~~f~~~~~L~~H~~~--h~~~~~~~~~~~~~~~~~~C~~--C~~~f-~~~~~l-----~~H~--------~~~H~~~ 452 (537)
.|.+.|..+..|+.||+. |..+.+...+- .+-|-=++ =|++. ...+.+ ..+- ...+..+
T Consensus 200 yCekifrdkntLkeHMrkK~HrrinPknreY---DkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~ 276 (423)
T KOG2482|consen 200 YCEKIFRDKNTLKEHMRKKRHRRINPKNREY---DKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAE 276 (423)
T ss_pred eeccccCCcHHHHHHHHhccCcccCCCcccc---ceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCC
Confidence 999999999999999984 54443332211 00000000 00000 000000 0000 0011111
Q ss_pred C-cccccccccccCCHHHHHHHHHhhCCCCc---------------------ccccCCcccccccccccChhHHHHHHh-
Q psy12165 453 A-GLACGVCLKISETYEALALHLDQAHPGYF---------------------AEEVDNPTCEVCGRYHNARPKLESHLA- 509 (537)
Q Consensus 453 ~-~~~C~~C~~~f~~~~~l~~H~~~~H~~~~---------------------~~~~~~~~C~~C~~~f~~~~~l~~H~~- 509 (537)
. ...|-+|.....+...|..||..+|.-.. .-....-.|-.|+.+|.....|..||.
T Consensus 277 a~~v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 277 ALSVVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred ccceEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence 1 15899999999999999999999885310 001122479999999999999999996
Q ss_pred -hhhCC
Q psy12165 510 -VHVGL 514 (537)
Q Consensus 510 -~h~g~ 514 (537)
.|+++
T Consensus 357 ~k~l~i 362 (423)
T KOG2482|consen 357 DKHLSI 362 (423)
T ss_pred cccccc
Confidence 44554
No 49
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.49 E-value=0.0016 Score=35.39 Aligned_cols=23 Identities=30% Similarity=0.579 Sum_probs=21.0
Q ss_pred cccccccccccchHHHHHhHHhh
Q psy12165 11 LSCGNCLKVFRDNLSLLKHVEEE 33 (537)
Q Consensus 11 ~~C~~C~~~f~~~~~L~~H~~~~ 33 (537)
|.|.+|+..|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 67999999999999999999754
No 50
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.41 E-value=0.0016 Score=35.32 Aligned_cols=23 Identities=22% Similarity=0.571 Sum_probs=16.3
Q ss_pred cccccccccccChhHHHHHHhhh
Q psy12165 489 PTCEVCGRYHNARPKLESHLAVH 511 (537)
Q Consensus 489 ~~C~~C~~~f~~~~~l~~H~~~h 511 (537)
|.|++|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 45777777777777777777644
No 51
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.35 E-value=0.002 Score=40.46 Aligned_cols=29 Identities=28% Similarity=0.608 Sum_probs=17.0
Q ss_pred cCCccccccccccCCChHHHHHHHHhccC
Q psy12165 383 GEHLVNCRVCGSIFHSEPELKVHLEEHTD 411 (537)
Q Consensus 383 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~ 411 (537)
.+.|-.|++|+..+....+|++|+.+.++
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~ 49 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHF 49 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTT
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhc
Confidence 56667777777777777777777766655
No 52
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.23 E-value=0.0044 Score=33.78 Aligned_cols=22 Identities=41% Similarity=0.940 Sum_probs=10.8
Q ss_pred ccccccccCCChHHHHHHHHhc
Q psy12165 388 NCRVCGSIFHSEPELKVHLEEH 409 (537)
Q Consensus 388 ~C~~C~~~f~~~~~L~~H~~~h 409 (537)
.|+.|++.|.....|..|++.|
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 2 RCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCcchhCCHHHHHHHHHHh
Confidence 4455555555555555554433
No 53
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=96.23 E-value=0.0022 Score=35.56 Aligned_cols=22 Identities=27% Similarity=0.588 Sum_probs=20.7
Q ss_pred cccccccccccchHHHHHhHHh
Q psy12165 11 LSCGNCLKVFRDNLSLLKHVEE 32 (537)
Q Consensus 11 ~~C~~C~~~f~~~~~L~~H~~~ 32 (537)
|.|.+|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 7899999999999999999975
No 54
>KOG2482|consensus
Probab=95.89 E-value=0.024 Score=51.53 Aligned_cols=25 Identities=24% Similarity=0.478 Sum_probs=22.4
Q ss_pred ceeecccchhhccChHHHHHHHhhc
Q psy12165 166 SVHICHLCQKQFKQNWLLQKHKKTC 190 (537)
Q Consensus 166 ~~~~C~~C~~~f~~~~~l~~H~~~h 190 (537)
..+.|-+|.+.|+.+..|+.||+.-
T Consensus 194 ~r~~CLyCekifrdkntLkeHMrkK 218 (423)
T KOG2482|consen 194 ERLRCLYCEKIFRDKNTLKEHMRKK 218 (423)
T ss_pred hhheeeeeccccCCcHHHHHHHHhc
Confidence 4589999999999999999999753
No 55
>KOG4173|consensus
Probab=95.62 E-value=0.0076 Score=50.40 Aligned_cols=93 Identities=22% Similarity=0.488 Sum_probs=73.2
Q ss_pred CCceeeccc--chhhccChHHHHHHHhhcCCCCCCCCCCCCCCCCCccccccchhccCCHHHHHHHHHhccCcccccccc
Q psy12165 164 GESVHICHL--CQKQFKQNWLLQKHKKTCKGPKRPTSPLPEPDEDGIFRCKKCVKVFVNPDIYKKHLKHSHTKAYCEVCI 241 (537)
Q Consensus 164 ~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~c~~c~ 241 (537)
....+.|+. |...|.+...+..|...-|+. .|.+|.+.|++...|..|+...|..+
T Consensus 76 ~~~~~~cqvagc~~~~d~lD~~E~hY~~~h~~----------------sCs~C~r~~Pt~hLLd~HI~E~HDs~------ 133 (253)
T KOG4173|consen 76 RVPAFACQVAGCCQVFDALDDYEHHYHTLHGN----------------SCSFCKRAFPTGHLLDAHILEWHDSL------ 133 (253)
T ss_pred ccccccccccchHHHHhhhhhHHHhhhhcccc----------------hhHHHHHhCCchhhhhHHHHHHHHHH------
Confidence 455688987 999999999999998777764 59999999999999999998666532
Q ss_pred CCCCcccHHHHhhhhhhccCCCCCcccccccccccccCHHHHHHHHHhcc
Q psy12165 242 TAPGYETKVELLHHMKTEHKDNEEYKCLIPGCKKYFRTKLDCQKHCREHK 291 (537)
Q Consensus 242 ~~~~~~~~~~l~~H~~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~ 291 (537)
|... ...+...|+|-+++|+..|.+...-..|+..-+
T Consensus 134 ----Fqa~---------veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 134 ----FQAL---------VERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMH 170 (253)
T ss_pred ----HHHH---------HHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhc
Confidence 2111 123466799999999999999999888876544
No 56
>KOG2785|consensus
Probab=95.59 E-value=0.02 Score=53.26 Aligned_cols=28 Identities=32% Similarity=0.442 Sum_probs=24.6
Q ss_pred CCCceeecccchhhccChHHHHHHHhhc
Q psy12165 163 FGESVHICHLCQKQFKQNWLLQKHKKTC 190 (537)
Q Consensus 163 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h 190 (537)
.+..++.|..|.+.|.+......|+..-
T Consensus 64 ~~~~~~~c~~c~k~~~s~~a~~~hl~Sk 91 (390)
T KOG2785|consen 64 EAESVVYCEACNKSFASPKAHENHLKSK 91 (390)
T ss_pred hcccceehHHhhccccChhhHHHHHHHh
Confidence 3677899999999999999999999764
No 57
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.39 E-value=0.0056 Score=33.88 Aligned_cols=22 Identities=23% Similarity=0.703 Sum_probs=16.1
Q ss_pred cccccccccccChhHHHHHHhh
Q psy12165 489 PTCEVCGRYHNARPKLESHLAV 510 (537)
Q Consensus 489 ~~C~~C~~~f~~~~~l~~H~~~ 510 (537)
|-|..|++.|.+...|.+|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777754
No 58
>PRK04860 hypothetical protein; Provisional
Probab=94.21 E-value=0.03 Score=46.65 Aligned_cols=13 Identities=23% Similarity=0.529 Sum_probs=5.9
Q ss_pred CCCcccCCCCCCC
Q psy12165 423 ELPVTCDLCSKEW 435 (537)
Q Consensus 423 ~~~~~C~~C~~~f 435 (537)
+++|.|..|+..|
T Consensus 141 ~~~YrC~~C~~~l 153 (160)
T PRK04860 141 EAVYRCRRCGETL 153 (160)
T ss_pred CccEECCCCCcee
Confidence 3344444444444
No 59
>PRK04860 hypothetical protein; Provisional
Probab=93.25 E-value=0.035 Score=46.22 Aligned_cols=39 Identities=21% Similarity=0.378 Sum_probs=26.2
Q ss_pred CCcccCCCCCCCCChHHHHHHHHhhcCCCCcccccccccccCCH
Q psy12165 424 LPVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCLKISETY 467 (537)
Q Consensus 424 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~ 467 (537)
-+|.|. |+. ....+..|. +.|+++++|.|..|+..|...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~-ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHN-RVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHH-HHhcCCccEECCCCCceeEEe
Confidence 357776 765 556667775 677777777777777666543
No 60
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=93.25 E-value=0.041 Score=35.18 Aligned_cols=33 Identities=30% Similarity=0.575 Sum_probs=29.4
Q ss_pred CCCccccccccccccccccchHHHHHhHHhhCc
Q psy12165 3 HGHKEEAALSCGNCLKVFRDNLSLLKHVEEEHP 35 (537)
Q Consensus 3 ~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~ 35 (537)
.++.+|.-++||-|+..|.+..++.+|+...|.
T Consensus 10 ~~RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 10 RDRDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred eccCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 457789999999999999999999999987763
No 61
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=93.03 E-value=0.076 Score=28.62 Aligned_cols=21 Identities=29% Similarity=0.453 Sum_probs=18.1
Q ss_pred cccccccccccchHHHHHhHHh
Q psy12165 11 LSCGNCLKVFRDNLSLLKHVEE 32 (537)
Q Consensus 11 ~~C~~C~~~f~~~~~L~~H~~~ 32 (537)
..|++|++.| ....|.+|+..
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 5799999999 67999999864
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=93.03 E-value=0.095 Score=28.24 Aligned_cols=21 Identities=33% Similarity=0.752 Sum_probs=14.1
Q ss_pred cccccccccccChhHHHHHHhh
Q psy12165 489 PTCEVCGRYHNARPKLESHLAV 510 (537)
Q Consensus 489 ~~C~~C~~~f~~~~~l~~H~~~ 510 (537)
..|+.|++.| ....|..|+.+
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4577777777 56667777654
No 63
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=92.74 E-value=0.07 Score=31.58 Aligned_cols=25 Identities=24% Similarity=0.543 Sum_probs=22.1
Q ss_pred cccccccccccccchHHHHHhHHhh
Q psy12165 9 AALSCGNCLKVFRDNLSLLKHVEEE 33 (537)
Q Consensus 9 ~~~~C~~C~~~f~~~~~L~~H~~~~ 33 (537)
.+|.|.+|+..|.+...+..|++..
T Consensus 2 ~~~~C~~C~~~~~~~~~~~~H~~gk 26 (35)
T smart00451 2 GGFYCKLCNVTFTDEISVEAHLKGK 26 (35)
T ss_pred cCeEccccCCccCCHHHHHHHHChH
Confidence 4689999999999999999998753
No 64
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=92.22 E-value=0.094 Score=31.00 Aligned_cols=22 Identities=23% Similarity=0.475 Sum_probs=19.1
Q ss_pred CcccccccccccChhHHHHHHh
Q psy12165 488 NPTCEVCGRYHNARPKLESHLA 509 (537)
Q Consensus 488 ~~~C~~C~~~f~~~~~l~~H~~ 509 (537)
+|.|++|+..|.+...+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 5889999999999999999975
No 65
>KOG4173|consensus
Probab=92.15 E-value=0.087 Score=44.30 Aligned_cols=91 Identities=22% Similarity=0.496 Sum_probs=63.0
Q ss_pred CCCcccccccccccccCHHHHHHHHHhccccCCCCCCcccccchhcccCcHHHHHHHhhcCCCCCccccCCcccccCCHH
Q psy12165 263 NEEYKCLIPGCKKYFRTKLDCQKHCREHKTVTSRCPTNYCNFCADLFTNRKKLWLHLKSTHKDLTNCMCSACFVVEDNLS 342 (537)
Q Consensus 263 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~~~~~~ 342 (537)
...|.|.+.+|-..|.+......|....++.. |..|.+.|++...|-.|+...|..-
T Consensus 77 ~~~~~cqvagc~~~~d~lD~~E~hY~~~h~~s-------Cs~C~r~~Pt~hLLd~HI~E~HDs~---------------- 133 (253)
T KOG4173|consen 77 VPAFACQVAGCCQVFDALDDYEHHYHTLHGNS-------CSFCKRAFPTGHLLDAHILEWHDSL---------------- 133 (253)
T ss_pred cccccccccchHHHHhhhhhHHHhhhhcccch-------hHHHHHhCCchhhhhHHHHHHHHHH----------------
Confidence 34588998999999999988888887766553 9999999999999999998666420
Q ss_pred HHHHHHHhhccccCCCcccccc--ccccccChhHHHHhHh-hh
Q psy12165 343 ALEAHVAAKHKAIIKKPYACRI--CARLLGNGVKVSEHIK-LH 382 (537)
Q Consensus 343 ~l~~H~~~~h~~~~~~~~~C~~--C~~~f~~~~~l~~H~~-~H 382 (537)
++.- +.. +...|.|-+ |+..|.+...-..|+- .|
T Consensus 134 -Fqa~---veR--G~dMy~ClvEgCt~KFkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 134 -FQAL---VER--GQDMYQCLVEGCTEKFKTSRDRKDHMIRMH 170 (253)
T ss_pred -HHHH---HHc--CccHHHHHHHhhhhhhhhhhhhhhHHHHhc
Confidence 0000 011 135566643 7777776666666654 44
No 66
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=89.99 E-value=0.56 Score=36.58 Aligned_cols=25 Identities=20% Similarity=0.479 Sum_probs=20.1
Q ss_pred ccc----ccccccccChhHHHHHHhhhhC
Q psy12165 489 PTC----EVCGRYHNARPKLESHLAVHVG 513 (537)
Q Consensus 489 ~~C----~~C~~~f~~~~~l~~H~~~h~g 513 (537)
|.| ..|++.+.+...+.+|++.+||
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 778 8888888888888888887775
No 67
>KOG2893|consensus
Probab=89.74 E-value=0.12 Score=44.59 Aligned_cols=46 Identities=30% Similarity=0.578 Sum_probs=24.2
Q ss_pred cCCCCCCCCChHHHHHHHHhhcCCCCcccccccccccCCHHHHHHHHHhhC
Q psy12165 428 CDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCLKISETYEALALHLDQAH 478 (537)
Q Consensus 428 C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H 478 (537)
|=+|++.|.....|.+|.+..| |+|.+|.+..-+--.|.-|..++|
T Consensus 13 cwycnrefddekiliqhqkakh-----fkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH-----FKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeecccccchhhhhhhhhhhcc-----ceeeeehhhhccCCCceeehhhhh
Confidence 5555555555555555553333 555555555444444555544444
No 68
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=88.86 E-value=0.25 Score=28.75 Aligned_cols=25 Identities=28% Similarity=0.788 Sum_probs=19.1
Q ss_pred ccccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCC
Q psy12165 96 MYVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSK 133 (537)
Q Consensus 96 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~ 133 (537)
.|+|.+||+.+.... .++.|++|+.
T Consensus 1 ~~~C~~CGy~y~~~~-------------~~~~CP~Cg~ 25 (33)
T cd00350 1 KYVCPVCGYIYDGEE-------------APWVCPVCGA 25 (33)
T ss_pred CEECCCCCCEECCCc-------------CCCcCcCCCC
Confidence 378999998775442 5789999976
No 69
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=87.95 E-value=0.8 Score=35.69 Aligned_cols=27 Identities=22% Similarity=0.513 Sum_probs=24.3
Q ss_pred Ccccc----ccchhccCCHHHHHHHHHhccC
Q psy12165 207 GIFRC----KKCVKVFVNPDIYKKHLKHSHT 233 (537)
Q Consensus 207 ~~~~C----~~C~~~f~~~~~l~~H~~~~h~ 233 (537)
..|.| ..|+..+.+...+..|++..|+
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 46999 9999999999999999998774
No 70
>KOG2893|consensus
Probab=87.27 E-value=0.15 Score=43.98 Aligned_cols=48 Identities=25% Similarity=0.564 Sum_probs=41.7
Q ss_pred ecccchhhccChHHHHHHHhhcCCCCCCCCCCCCCCCCCccccccchhccCCHHHHHHHHHhccC
Q psy12165 169 ICHLCQKQFKQNWLLQKHKKTCKGPKRPTSPLPEPDEDGIFRCKKCVKVFVNPDIYKKHLKHSHT 233 (537)
Q Consensus 169 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 233 (537)
.|.+|+..|.....|.+|++..| |+|-+|.+...+.-.|..|...+|.
T Consensus 12 wcwycnrefddekiliqhqkakh-----------------fkchichkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAKH-----------------FKCHICHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred eeeecccccchhhhhhhhhhhcc-----------------ceeeeehhhhccCCCceeehhhhhh
Confidence 59999999999999999987654 9999999998888889888776664
No 71
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=86.35 E-value=0.17 Score=51.12 Aligned_cols=144 Identities=21% Similarity=0.322 Sum_probs=89.8
Q ss_pred CccccccccccccChhHHHHhHh--hh-cC--Cccccc--cccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCC
Q psy12165 358 KPYACRICARLLGNGVKVSEHIK--LH-GE--HLVNCR--VCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDL 430 (537)
Q Consensus 358 ~~~~C~~C~~~f~~~~~l~~H~~--~H-~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~ 430 (537)
.++.|..|...|.....|..|.+ .| ++ +++.|+ .|++.|.....+..|..+|.+.. ++.+..
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~ 356 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSIS-----------PAKEKL 356 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCC-----------cccccc
Confidence 46889999999999999999998 68 77 899999 79999999999999999988853 233332
Q ss_pred --CCCCCCChHHH----HHHHHhhcCCCCcccccc--cccccCCHHHHHHHHHhhCCCCcccccCCcccccccccccChh
Q psy12165 431 --CSKEWPSRKFM----WQHLIRSHKAVAGLACGV--CLKISETYEALALHLDQAHPGYFAEEVDNPTCEVCGRYHNARP 502 (537)
Q Consensus 431 --C~~~f~~~~~l----~~H~~~~H~~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~ 502 (537)
+...+.....- .......-.....+.+.. |...+.....+..|.. .|.. .....+.+..|...|...-
T Consensus 357 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~~~~~~~~~~~~ 432 (467)
T COG5048 357 LNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHII-THLS---FRPYNCKNPPCSKSFNRHY 432 (467)
T ss_pred ccCccccccccCCCCccchhhccCccCCccccccccchhhhhccccccccccc-cccc---cCCcCCCCCcchhhccCcc
Confidence 22222211110 001001111222333322 5555555666666654 3433 1123566777788888777
Q ss_pred HHHHHHhhhhCCCc
Q psy12165 503 KLESHLAVHVGLDK 516 (537)
Q Consensus 503 ~l~~H~~~h~g~~~ 516 (537)
.|..|++.|.....
T Consensus 433 ~~~~~~~~~~~~~~ 446 (467)
T COG5048 433 NLIPHKKIHTNHAP 446 (467)
T ss_pred cccccccccccCCc
Confidence 77777777776554
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=86.17 E-value=0.47 Score=30.54 Aligned_cols=30 Identities=23% Similarity=0.392 Sum_probs=26.6
Q ss_pred CCCceeecccchhhccChHHHHHHHhhcCC
Q psy12165 163 FGESVHICHLCQKQFKQNWLLQKHKKTCKG 192 (537)
Q Consensus 163 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 192 (537)
.|+..+.|+-|+..|.....+.+|+...|+
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 378889999999999999999999976655
No 73
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=83.79 E-value=3.6 Score=31.73 Aligned_cols=21 Identities=19% Similarity=0.325 Sum_probs=11.0
Q ss_pred cccccccccccCCHHHHHHHH
Q psy12165 454 GLACGVCLKISETYEALALHL 474 (537)
Q Consensus 454 ~~~C~~C~~~f~~~~~l~~H~ 474 (537)
.|.|+.|...|--.=++-.|.
T Consensus 81 ~y~C~~C~~~FC~dCD~fiHe 101 (112)
T TIGR00622 81 RYVCAVCKNVFCVDCDVFVHE 101 (112)
T ss_pred ceeCCCCCCccccccchhhhh
Confidence 455555555555555555554
No 74
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=83.38 E-value=2.6 Score=32.42 Aligned_cols=21 Identities=14% Similarity=0.324 Sum_probs=13.4
Q ss_pred CcccCCCCCCCCChHHHHHHH
Q psy12165 425 PVTCDLCSKEWPSRKFMWQHL 445 (537)
Q Consensus 425 ~~~C~~C~~~f~~~~~l~~H~ 445 (537)
.|+|+.|...|-..-++..|.
T Consensus 81 ~y~C~~C~~~FC~dCD~fiHe 101 (112)
T TIGR00622 81 RYVCAVCKNVFCVDCDVFVHE 101 (112)
T ss_pred ceeCCCCCCccccccchhhhh
Confidence 466666666666666666664
No 75
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=80.57 E-value=0.45 Score=48.00 Aligned_cols=61 Identities=18% Similarity=0.246 Sum_probs=34.0
Q ss_pred CcccccccccccccCHHHHHHHHHhccccCCCCCCcccccchhcccCcHHHHHHHhhcCCCCCc
Q psy12165 265 EYKCLIPGCKKYFRTKLDCQKHCREHKTVTSRCPTNYCNFCADLFTNRKKLWLHLKSTHKDLTN 328 (537)
Q Consensus 265 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~ 328 (537)
.+.+....|...+.....+..|...+..... ..+.+..|...+.....+..|++ .|....+
T Consensus 386 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 446 (467)
T COG5048 386 KSETLSNSCIRNFKRDSNLSLHIITHLSFRP--YNCKNPPCSKSFNRHYNLIPHKK-IHTNHAP 446 (467)
T ss_pred cccccccchhhhhccccccccccccccccCC--cCCCCCcchhhccCccccccccc-ccccCCc
Confidence 3444444455556555555555554443321 14566777777777777777776 4444333
No 76
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=79.20 E-value=1.3 Score=26.48 Aligned_cols=34 Identities=21% Similarity=0.518 Sum_probs=24.5
Q ss_pred cccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCccc
Q psy12165 97 YVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKVSS 136 (537)
Q Consensus 97 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~ 136 (537)
..||.|+..|.-..+ +.-.+....+|+.|+..|.
T Consensus 3 i~CP~C~~~f~v~~~------~l~~~~~~vrC~~C~~~f~ 36 (37)
T PF13719_consen 3 ITCPNCQTRFRVPDD------KLPAGGRKVRCPKCGHVFR 36 (37)
T ss_pred EECCCCCceEEcCHH------HcccCCcEEECCCCCcEee
Confidence 469999999876653 2234455789999998874
No 77
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=77.61 E-value=1.8 Score=25.76 Aligned_cols=34 Identities=21% Similarity=0.455 Sum_probs=24.6
Q ss_pred cccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCccc
Q psy12165 97 YVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKVSS 136 (537)
Q Consensus 97 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~ 136 (537)
..|+.|+..|.-... .+-.+....+|+.|+..|.
T Consensus 3 i~Cp~C~~~y~i~d~------~ip~~g~~v~C~~C~~~f~ 36 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDE------KIPPKGRKVRCSKCGHVFF 36 (36)
T ss_pred EECCCCCCEEeCCHH------HCCCCCcEEECCCCCCEeC
Confidence 469999999876653 2334455789999998773
No 78
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=77.35 E-value=0.82 Score=40.29 Aligned_cols=48 Identities=19% Similarity=0.486 Sum_probs=27.4
Q ss_pred CccccccccccccChhHHHHhHhhh-----cCCccccccccccCCChHHHHHHHHhcc
Q psy12165 358 KPYACRICARLLGNGVKVSEHIKLH-----GEHLVNCRVCGSIFHSEPELKVHLEEHT 410 (537)
Q Consensus 358 ~~~~C~~C~~~f~~~~~l~~H~~~H-----~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 410 (537)
..|.|-.|...|-. .|++.- ..++++|+.||+-......|..-.++|.
T Consensus 181 Gq~sCLRCK~cfCd-----dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmStR~hk 233 (314)
T PF06524_consen 181 GQYSCLRCKICFCD-----DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMSTRSHK 233 (314)
T ss_pred cchhhhheeeeehh-----hhhhhcccccccCCCCCCCCCCCcccccccceeeeecch
Confidence 34555555555532 343322 3467777777777766666666555553
No 79
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=77.07 E-value=1.4 Score=25.83 Aligned_cols=9 Identities=22% Similarity=1.143 Sum_probs=3.6
Q ss_pred ccccccccc
Q psy12165 360 YACRICARL 368 (537)
Q Consensus 360 ~~C~~C~~~ 368 (537)
|.|.+||..
T Consensus 3 ~~C~~CG~i 11 (34)
T cd00729 3 WVCPVCGYI 11 (34)
T ss_pred EECCCCCCE
Confidence 334444433
No 80
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=75.70 E-value=2 Score=26.99 Aligned_cols=28 Identities=25% Similarity=0.564 Sum_probs=16.7
Q ss_pred cccccccccccccccch----HHHHHhHHhhC
Q psy12165 7 EEAALSCGNCLKVFRDN----LSLLKHVEEEH 34 (537)
Q Consensus 7 ~~~~~~C~~C~~~f~~~----~~L~~H~~~~h 34 (537)
++...+|.+|++.+... ..|.+|++..|
T Consensus 13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp CSS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred CcCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 35567888888887764 77888885544
No 81
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=74.13 E-value=1.9 Score=25.98 Aligned_cols=34 Identities=18% Similarity=0.344 Sum_probs=23.5
Q ss_pred cccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCccc
Q psy12165 97 YVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKVSS 136 (537)
Q Consensus 97 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~ 136 (537)
+.|+.|+..|.-.... .......+.|+.|+..|.
T Consensus 3 ~~CP~C~~~~~v~~~~------~~~~~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQ------LGANGGKVRCGKCGHVWY 36 (38)
T ss_pred EECCCCCCEEEeCHHH------cCCCCCEEECCCCCCEEE
Confidence 5799999988766532 222233689999998775
No 82
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=73.89 E-value=2.9 Score=27.69 Aligned_cols=9 Identities=33% Similarity=0.873 Sum_probs=5.6
Q ss_pred CcccCCCCC
Q psy12165 425 PVTCDLCSK 433 (537)
Q Consensus 425 ~~~C~~C~~ 433 (537)
+|+|+.||+
T Consensus 50 ~Y~Cp~CGF 58 (61)
T COG2888 50 PYRCPKCGF 58 (61)
T ss_pred ceECCCcCc
Confidence 566666664
No 83
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=72.55 E-value=2.2 Score=32.92 Aligned_cols=12 Identities=42% Similarity=1.281 Sum_probs=6.2
Q ss_pred CcccCCCCCCCC
Q psy12165 425 PVTCDLCSKEWP 436 (537)
Q Consensus 425 ~~~C~~C~~~f~ 436 (537)
|-.|++||..|.
T Consensus 26 PivCP~CG~~~~ 37 (108)
T PF09538_consen 26 PIVCPKCGTEFP 37 (108)
T ss_pred CccCCCCCCccC
Confidence 445555555553
No 84
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=71.70 E-value=2 Score=33.09 Aligned_cols=30 Identities=20% Similarity=0.463 Sum_probs=25.7
Q ss_pred ccccccCCcccChHHHHHHHHhcCCCCCCCCccccccCCCceeecccchhhccCh
Q psy12165 126 VTCRICSKVSSSRGHLEKHLIAHDNPNTEVDEPQMNEFGESVHICHLCQKQFKQN 180 (537)
Q Consensus 126 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~ 180 (537)
..|+.||.+|... +..|..|++||..|.-.
T Consensus 10 R~Cp~CG~kFYDL-------------------------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDL-------------------------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccC-------------------------CCCCccCCCCCCccCcc
Confidence 5799999999864 56789999999999776
No 85
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=71.70 E-value=1.1 Score=39.71 Aligned_cols=14 Identities=21% Similarity=0.463 Sum_probs=11.0
Q ss_pred eeecccchhhccCh
Q psy12165 167 VHICHLCQKQFKQN 180 (537)
Q Consensus 167 ~~~C~~C~~~f~~~ 180 (537)
...|+.||++|...
T Consensus 48 V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 48 VWVCPHCGYAAFEE 61 (214)
T ss_pred EEECCCCCCccccc
Confidence 36899999987654
No 86
>PF12907 zf-met2: Zinc-binding
Probab=71.48 E-value=2.1 Score=26.07 Aligned_cols=32 Identities=19% Similarity=0.560 Sum_probs=22.9
Q ss_pred cccCCCcCcC---CChHHHHHHHHhhcCCCCCccc
Q psy12165 97 YVCDLCPKYY---SRSTHLWKHVNRVHKGHPDVTC 128 (537)
Q Consensus 97 ~~C~~C~~~f---~~~~~l~~H~~~~H~~~~~~~C 128 (537)
++|.+|..+| .+...|..|....|++...-.|
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~~~C 36 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTFEEC 36 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCHHHc
Confidence 5788898554 5667899998888887544344
No 87
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=71.19 E-value=1.2 Score=39.57 Aligned_cols=23 Identities=22% Similarity=0.548 Sum_probs=12.6
Q ss_pred CccccccccccCCChHHHHHHHH
Q psy12165 385 HLVNCRVCGSIFHSEPELKVHLE 407 (537)
Q Consensus 385 ~~~~C~~C~~~f~~~~~L~~H~~ 407 (537)
+.+.|++|+..|.+...+....+
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r 26 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIR 26 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCce
Confidence 34556666666666554444443
No 88
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=69.46 E-value=4 Score=22.20 Aligned_cols=21 Identities=14% Similarity=0.474 Sum_probs=17.2
Q ss_pred cccccccccccchHHHHHhHHh
Q psy12165 11 LSCGNCLKVFRDNLSLLKHVEE 32 (537)
Q Consensus 11 ~~C~~C~~~f~~~~~L~~H~~~ 32 (537)
..||+|++.+ ....+.+|+..
T Consensus 2 v~CPiC~~~v-~~~~in~HLD~ 22 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLDS 22 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHHH
Confidence 3699999999 66888899864
No 89
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=68.72 E-value=3 Score=22.67 Aligned_cols=9 Identities=22% Similarity=0.530 Sum_probs=4.4
Q ss_pred ccccccccc
Q psy12165 361 ACRICARLL 369 (537)
Q Consensus 361 ~C~~C~~~f 369 (537)
.|+.||..|
T Consensus 16 ~Cp~CG~~F 24 (26)
T PF10571_consen 16 FCPHCGYDF 24 (26)
T ss_pred cCCCCCCCC
Confidence 455555444
No 90
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=68.54 E-value=2.8 Score=35.07 Aligned_cols=11 Identities=18% Similarity=0.700 Sum_probs=5.6
Q ss_pred ccccccccccc
Q psy12165 359 PYACRICARLL 369 (537)
Q Consensus 359 ~~~C~~C~~~f 369 (537)
.|.|++||.++
T Consensus 134 ~~vC~vCGy~~ 144 (166)
T COG1592 134 VWVCPVCGYTH 144 (166)
T ss_pred EEEcCCCCCcc
Confidence 45555555444
No 91
>KOG2186|consensus
Probab=68.42 E-value=2.9 Score=36.98 Aligned_cols=46 Identities=24% Similarity=0.416 Sum_probs=22.7
Q ss_pred cccCCCCCCCCChHHHHHHHHhhcCCCCcccccccccccCCHHHHHHHHH
Q psy12165 426 VTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCLKISETYEALALHLD 475 (537)
Q Consensus 426 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~ 475 (537)
|.|..||... -+..|.+|+.+-+. .-|.|--|+..|.. .++..|..
T Consensus 4 FtCnvCgEsv-KKp~vekH~srCrn--~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESV-KKPQVEKHMSRCRN--AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhc-cccchHHHHHhccC--CeeEEeeccccccc-chhhhhhh
Confidence 5555555544 34444455533332 44555555555554 45555543
No 92
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=67.54 E-value=2.9 Score=26.46 Aligned_cols=11 Identities=27% Similarity=0.700 Sum_probs=6.6
Q ss_pred CcccCCCCCCC
Q psy12165 425 PVTCDLCSKEW 435 (537)
Q Consensus 425 ~~~C~~C~~~f 435 (537)
.|+|+.||..|
T Consensus 3 ~y~C~~CG~~~ 13 (46)
T PRK00398 3 EYKCARCGREV 13 (46)
T ss_pred EEECCCCCCEE
Confidence 36666666655
No 93
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=67.34 E-value=3.8 Score=26.53 Aligned_cols=24 Identities=29% Similarity=0.642 Sum_probs=14.4
Q ss_pred cccccccccccCh-----hHHHHHHh-hhh
Q psy12165 489 PTCEVCGRYHNAR-----PKLESHLA-VHV 512 (537)
Q Consensus 489 ~~C~~C~~~f~~~-----~~l~~H~~-~h~ 512 (537)
-.|.+|++.+... +.|.+|++ .|.
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~ 48 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLRRKHP 48 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence 3566677666554 46666666 443
No 94
>KOG2186|consensus
Probab=66.74 E-value=3.1 Score=36.75 Aligned_cols=46 Identities=17% Similarity=0.413 Sum_probs=38.9
Q ss_pred ccccccccccCCHHHHHHHHHhhCCCCcccccCCcccccccccccChhHHHHHHh
Q psy12165 455 LACGVCLKISETYEALALHLDQAHPGYFAEEVDNPTCEVCGRYHNARPKLESHLA 509 (537)
Q Consensus 455 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 509 (537)
|.|..||.+.. +..|.+|+.+-+ ...|.|.-|+..|.. ..+..|..
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCr-------n~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCR-------NAYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhcc-------CCeeEEeeccccccc-chhhhhhh
Confidence 78999999876 677888998655 268999999999988 88999965
No 95
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=65.32 E-value=3.6 Score=27.36 Aligned_cols=10 Identities=20% Similarity=0.358 Sum_probs=4.8
Q ss_pred Cccccccccc
Q psy12165 453 AGLACGVCLK 462 (537)
Q Consensus 453 ~~~~C~~C~~ 462 (537)
.+|.|+.||+
T Consensus 47 ~~Y~CP~CGF 56 (59)
T PRK14890 47 NPYTCPKCGF 56 (59)
T ss_pred CceECCCCCC
Confidence 4455555543
No 96
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=63.41 E-value=4.8 Score=25.23 Aligned_cols=27 Identities=26% Similarity=0.624 Sum_probs=20.2
Q ss_pred ccccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCc
Q psy12165 96 MYVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKV 134 (537)
Q Consensus 96 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~ 134 (537)
.|.|..||..|... ...+.+|+.||..
T Consensus 2 ~Y~C~~Cg~~~~~~------------~~~~irC~~CG~r 28 (44)
T smart00659 2 IYICGECGRENEIK------------SKDVVRCRECGYR 28 (44)
T ss_pred EEECCCCCCEeecC------------CCCceECCCCCce
Confidence 48899999887633 2356899999864
No 97
>PF12907 zf-met2: Zinc-binding
Probab=63.35 E-value=4.1 Score=24.83 Aligned_cols=13 Identities=31% Similarity=0.483 Sum_probs=5.6
Q ss_pred HHHHHHHHHhhCC
Q psy12165 467 YEALALHLDQAHP 479 (537)
Q Consensus 467 ~~~l~~H~~~~H~ 479 (537)
...|..|....|+
T Consensus 17 ~~~L~eH~enKHp 29 (40)
T PF12907_consen 17 EPQLKEHAENKHP 29 (40)
T ss_pred HHHHHHHHHccCC
Confidence 3444444444443
No 98
>PHA00626 hypothetical protein
Probab=62.98 E-value=2 Score=27.94 Aligned_cols=11 Identities=18% Similarity=0.244 Sum_probs=5.3
Q ss_pred CcccCCCCCCC
Q psy12165 425 PVTCDLCSKEW 435 (537)
Q Consensus 425 ~~~C~~C~~~f 435 (537)
.|.|+.||+.|
T Consensus 23 rYkCkdCGY~f 33 (59)
T PHA00626 23 DYVCCDCGYND 33 (59)
T ss_pred ceEcCCCCCee
Confidence 34444444444
No 99
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=62.57 E-value=3.4 Score=26.42 Aligned_cols=10 Identities=30% Similarity=0.796 Sum_probs=5.4
Q ss_pred cccccccccC
Q psy12165 387 VNCRVCGSIF 396 (537)
Q Consensus 387 ~~C~~C~~~f 396 (537)
|.|..||..|
T Consensus 7 Y~C~~Cg~~~ 16 (49)
T COG1996 7 YKCARCGREV 16 (49)
T ss_pred EEhhhcCCee
Confidence 5555555554
No 100
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=61.97 E-value=4.1 Score=34.09 Aligned_cols=24 Identities=25% Similarity=0.754 Sum_probs=21.0
Q ss_pred CccccccCCcccChHHHHHHHHhcCCCCCCCCccccccCCCceeecccchh
Q psy12165 125 DVTCRICSKVSSSRGHLEKHLIAHDNPNTEVDEPQMNEFGESVHICHLCQK 175 (537)
Q Consensus 125 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~C~~C~~ 175 (537)
.|.|++||+.+. ++.|-+||+|+.
T Consensus 134 ~~vC~vCGy~~~---------------------------ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTHE---------------------------GEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCccc---------------------------CCCCCcCCCCCC
Confidence 699999998876 688999999984
No 101
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=60.01 E-value=4.9 Score=33.69 Aligned_cols=31 Identities=13% Similarity=0.356 Sum_probs=25.5
Q ss_pred CccccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCcc
Q psy12165 95 AMYVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKVS 135 (537)
Q Consensus 95 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f 135 (537)
.-|.|+.|+..|+...++. ..|.|+.||...
T Consensus 108 ~~Y~Cp~c~~r~tf~eA~~----------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 108 MFFICPNMCVRFTFNEAME----------LNFTCPRCGAML 138 (158)
T ss_pred CeEECCCCCcEeeHHHHHH----------cCCcCCCCCCEe
Confidence 4699999999999888774 269999998753
No 102
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=59.57 E-value=3.2 Score=25.69 Aligned_cols=30 Identities=20% Similarity=0.562 Sum_probs=21.3
Q ss_pred ccccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCC
Q psy12165 96 MYVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSK 133 (537)
Q Consensus 96 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~ 133 (537)
.|+|..||..|..... ... .....|+.|+.
T Consensus 5 ey~C~~Cg~~fe~~~~-------~~~-~~~~~CP~Cg~ 34 (42)
T PF09723_consen 5 EYRCEECGHEFEVLQS-------ISE-DDPVPCPECGS 34 (42)
T ss_pred EEEeCCCCCEEEEEEE-------cCC-CCCCcCCCCCC
Confidence 4789999998865532 222 45678999987
No 103
>KOG2807|consensus
Probab=59.17 E-value=17 Score=33.70 Aligned_cols=35 Identities=23% Similarity=0.510 Sum_probs=19.9
Q ss_pred CCccccccccccccChhHHHHhHhhhcCCccccccccccCCChHHHHH
Q psy12165 357 KKPYACRICARLLGNGVKVSEHIKLHGEHLVNCRVCGSIFHSEPELKV 404 (537)
Q Consensus 357 ~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~C~~C~~~f~~~~~L~~ 404 (537)
..-|-|+.|....- .-|..|++|+........|.+
T Consensus 274 ~~Gy~CP~CkakvC-------------sLP~eCpiC~ltLVss~hLAR 308 (378)
T KOG2807|consen 274 GGGYFCPQCKAKVC-------------SLPIECPICSLTLVSSPHLAR 308 (378)
T ss_pred cCceeCCcccCeee-------------cCCccCCccceeEecchHHHH
Confidence 34566666654432 225567777766666665555
No 104
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=56.56 E-value=7.3 Score=32.24 Aligned_cols=36 Identities=22% Similarity=0.373 Sum_probs=25.6
Q ss_pred CccccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCcc
Q psy12165 95 AMYVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKVS 135 (537)
Q Consensus 95 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f 135 (537)
..|.||.|+..|.....+.. ... ...|.|+.||...
T Consensus 98 ~~Y~Cp~C~~~y~~~ea~~~----~d~-~~~f~Cp~Cg~~l 133 (147)
T smart00531 98 AYYKCPNCQSKYTFLEANQL----LDM-DGTFTCPRCGEEL 133 (147)
T ss_pred cEEECcCCCCEeeHHHHHHh----cCC-CCcEECCCCCCEE
Confidence 47999999999986554422 122 3459999999865
No 105
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=56.49 E-value=5.3 Score=34.23 Aligned_cols=32 Identities=13% Similarity=0.421 Sum_probs=25.4
Q ss_pred CccccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCccc
Q psy12165 95 AMYVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKVSS 136 (537)
Q Consensus 95 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~ 136 (537)
.-|.|+.|+..|+...++. ..|.|+.||....
T Consensus 116 ~~Y~Cp~C~~rytf~eA~~----------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEAME----------YGFRCPQCGEMLE 147 (178)
T ss_pred CEEECCCCCcEEeHHHHhh----------cCCcCCCCCCCCe
Confidence 4799999999998877652 2699999987544
No 106
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=55.63 E-value=6.5 Score=22.63 Aligned_cols=9 Identities=33% Similarity=1.147 Sum_probs=4.5
Q ss_pred CcccCCCCC
Q psy12165 425 PVTCDLCSK 433 (537)
Q Consensus 425 ~~~C~~C~~ 433 (537)
+.+|+.||.
T Consensus 17 ~irC~~CG~ 25 (32)
T PF03604_consen 17 PIRCPECGH 25 (32)
T ss_dssp TSSBSSSS-
T ss_pred cEECCcCCC
Confidence 455666654
No 107
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=54.85 E-value=7.7 Score=30.49 Aligned_cols=11 Identities=9% Similarity=0.410 Sum_probs=5.3
Q ss_pred CcccCCCCCCC
Q psy12165 425 PVTCDLCSKEW 435 (537)
Q Consensus 425 ~~~C~~C~~~f 435 (537)
|..|++||..|
T Consensus 26 p~vcP~cg~~~ 36 (129)
T TIGR02300 26 PAVSPYTGEQF 36 (129)
T ss_pred CccCCCcCCcc
Confidence 44444454444
No 108
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=54.73 E-value=5 Score=35.62 Aligned_cols=48 Identities=21% Similarity=0.413 Sum_probs=28.8
Q ss_pred ccccCCCcCcCCChHHHHHHHHhh---cCCCCCccccccCCcccChHHHHHHHHhc
Q psy12165 96 MYVCDLCPKYYSRSTHLWKHVNRV---HKGHPDVTCRICSKVSSSRGHLEKHLIAH 148 (537)
Q Consensus 96 ~~~C~~C~~~f~~~~~l~~H~~~~---H~~~~~~~C~~C~~~f~~~~~l~~H~~~h 148 (537)
.|.|--|.-.|-. .|+++. ....+++.|+.|++.......|..-.+.|
T Consensus 182 q~sCLRCK~cfCd-----dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 182 QYSCLRCKICFCD-----DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred chhhhheeeeehh-----hhhhhcccccccCCCCCCCCCCCcccccccceeeeecc
Confidence 4555555555533 344332 23347899999999888777665554444
No 109
>KOG1280|consensus
Probab=54.42 E-value=12 Score=35.01 Aligned_cols=37 Identities=19% Similarity=0.516 Sum_probs=29.0
Q ss_pred CcccccchhcccCcHHHHHHHhhcCCCCC-ccccCCcc
Q psy12165 299 TNYCNFCADLFTNRKKLWLHLKSTHKDLT-NCMCSACF 335 (537)
Q Consensus 299 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~-~~~C~~C~ 335 (537)
.|.|++|+.+=.+...|..|+...|.+.. ...|++|.
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 78899999988888899999888887644 34566664
No 110
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=51.48 E-value=7.8 Score=28.21 Aligned_cols=14 Identities=29% Similarity=0.759 Sum_probs=9.1
Q ss_pred CccccccccccCCC
Q psy12165 385 HLVNCRVCGSIFHS 398 (537)
Q Consensus 385 ~~~~C~~C~~~f~~ 398 (537)
.|-.|..||+.|..
T Consensus 57 ~Pa~CkkCGfef~~ 70 (97)
T COG3357 57 RPARCKKCGFEFRD 70 (97)
T ss_pred cChhhcccCccccc
Confidence 35567777777754
No 111
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=50.89 E-value=9.6 Score=31.54 Aligned_cols=37 Identities=22% Similarity=0.425 Sum_probs=21.2
Q ss_pred CCccccccccccCCChHHHHHHHHhccCCCCCccCCCCCCCCcccCCCCCCC
Q psy12165 384 EHLVNCRVCGSIFHSEPELKVHLEEHTDVPETKEQKPRPELPVTCDLCSKEW 435 (537)
Q Consensus 384 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~C~~C~~~f 435 (537)
..-|.|+.|+..|.....+.. .. . ...|.||.||...
T Consensus 97 ~~~Y~Cp~C~~~y~~~ea~~~-~d---~-----------~~~f~Cp~Cg~~l 133 (147)
T smart00531 97 NAYYKCPNCQSKYTFLEANQL-LD---M-----------DGTFTCPRCGEEL 133 (147)
T ss_pred CcEEECcCCCCEeeHHHHHHh-cC---C-----------CCcEECCCCCCEE
Confidence 345778888877764433222 00 0 1248888888664
No 112
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=50.86 E-value=8.4 Score=30.29 Aligned_cols=33 Identities=12% Similarity=0.055 Sum_probs=26.5
Q ss_pred ccccccCCcccChHHHHHHHHhcCCCCCCCCccccccCCCceeecccchhhccChHHH
Q psy12165 126 VTCRICSKVSSSRGHLEKHLIAHDNPNTEVDEPQMNEFGESVHICHLCQKQFKQNWLL 183 (537)
Q Consensus 126 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l 183 (537)
..|+.|+++|... +..|..|++||..|.-...+
T Consensus 10 r~Cp~cg~kFYDL-------------------------nk~p~vcP~cg~~~~~~~~~ 42 (129)
T TIGR02300 10 RICPNTGSKFYDL-------------------------NRRPAVSPYTGEQFPPEEAL 42 (129)
T ss_pred ccCCCcCcccccc-------------------------CCCCccCCCcCCccCcchhh
Confidence 5799999999853 57889999999998766433
No 113
>KOG4377|consensus
Probab=50.69 E-value=6.6 Score=37.42 Aligned_cols=142 Identities=15% Similarity=0.249 Sum_probs=78.3
Q ss_pred ccc--cccCCcccChHHHHHHHHhcCCCCCCCCccccccCCCceeeccc--chhhccChHHHHHHHhhcCCCCCCCCCCC
Q psy12165 126 VTC--RICSKVSSSRGHLEKHLIAHDNPNTEVDEPQMNEFGESVHICHL--CQKQFKQNWLLQKHKKTCKGPKRPTSPLP 201 (537)
Q Consensus 126 ~~C--~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~~~~ 201 (537)
|.| +.|++.+-.+..+.+|+..|...+..+....... ...|-|.. |++ +-+....|-..|......
T Consensus 272 yhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rf--s~syhC~~~~C~k---sTsdV~~h~nFht~~~n~----- 341 (480)
T KOG4377|consen 272 YHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRF--SNSYHCTGQICEK---STSDVLLHDNFHTDKRNN----- 341 (480)
T ss_pred hcccCccccccccchhhhHHHHHHHhhcccccccchhhc--CccchhhhcccCc---ccccccccCccccccccC-----
Confidence 555 4588877779999999999875544433332221 22377764 888 555666676666322211
Q ss_pred CCCCCCccccccch--hccCCHHHHHHHHHhccC----------ccccccccCCCCcccHHHHhhhhhhccCCCCCcccc
Q psy12165 202 EPDEDGIFRCKKCV--KVFVNPDIYKKHLKHSHT----------KAYCEVCITAPGYETKVELLHHMKTEHKDNEEYKCL 269 (537)
Q Consensus 202 ~~~~~~~~~C~~C~--~~f~~~~~l~~H~~~~h~----------~~~c~~c~~~~~~~~~~~l~~H~~~~h~~~~~~~C~ 269 (537)
.....-|.|..|+ ..|.....-..|.+.+.. ...|.+-++. +. ..-.-+-|.
T Consensus 342 -GfrrthfhC~r~gCTdtfK~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCk--ys-------------~~cnhfhc~ 405 (480)
T KOG4377|consen 342 -GFRRTHFHCQRIGCTDTFKDSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCK--YS-------------GICNHFHCD 405 (480)
T ss_pred -ceecceeEEeccCCccccccccccccccCcchhhhhhhhhhhccccCCccCcc--cc-------------cceeeeeec
Confidence 0111247787766 555532222222221111 1123332222 11 111236788
Q ss_pred cccccccccCHHHHHHHHHhcccc
Q psy12165 270 IPGCKKYFRTKLDCQKHCREHKTV 293 (537)
Q Consensus 270 ~~~C~~~f~~~~~l~~H~~~h~~~ 293 (537)
.++|+..+.+.+.+..|.+.|...
T Consensus 406 r~Gc~~tl~s~sqm~shkrkheRq 429 (480)
T KOG4377|consen 406 RLGCEATLYSVSQMASHKRKHERQ 429 (480)
T ss_pred ccCCceEEEehhhhhhhhhhhhhh
Confidence 889999999999999988877544
No 114
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=50.46 E-value=8.7 Score=32.91 Aligned_cols=17 Identities=18% Similarity=0.436 Sum_probs=9.8
Q ss_pred ccccccccccCCChHHH
Q psy12165 386 LVNCRVCGSIFHSEPEL 402 (537)
Q Consensus 386 ~~~C~~C~~~f~~~~~L 402 (537)
-|.|+.|+..|+.-..+
T Consensus 117 ~Y~Cp~C~~rytf~eA~ 133 (178)
T PRK06266 117 FFFCPNCHIRFTFDEAM 133 (178)
T ss_pred EEECCCCCcEEeHHHHh
Confidence 46666666666555443
No 115
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=50.39 E-value=11 Score=33.21 Aligned_cols=25 Identities=28% Similarity=0.485 Sum_probs=12.1
Q ss_pred ccccccccccCCHHHHHHHHHhhCC
Q psy12165 455 LACGVCLKISETYEALALHLDQAHP 479 (537)
Q Consensus 455 ~~C~~C~~~f~~~~~l~~H~~~~H~ 479 (537)
|.|.+|++.|....-++.|+...|.
T Consensus 78 ~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 78 WRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp EEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred ECCCCCCcccCChHHHHHHHhhcCH
Confidence 5555555555555555555555554
No 116
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=50.38 E-value=8 Score=35.27 Aligned_cols=47 Identities=13% Similarity=0.304 Sum_probs=24.1
Q ss_pred cccCCCCCCCCChHHHHHHHHhhcCCCCcccccccccccCCHHHHHHHHH
Q psy12165 426 VTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCLKISETYEALALHLD 475 (537)
Q Consensus 426 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~ 475 (537)
--|-.|.-.|+.... |-...-+....|+|+.|...|-..-+...|..
T Consensus 363 ~~Cf~CQ~~fp~~~~---~~~~~~~ss~rY~Ce~CK~~FC~dCdvfiHe~ 409 (421)
T COG5151 363 THCFVCQGPFPKPPV---SPFDESTSSGRYQCELCKSTFCSDCDVFIHET 409 (421)
T ss_pred ccceeccCCCCCCCC---CcccccccccceechhhhhhhhhhhHHHHHHH
Confidence 346666666643311 11111122345777777777766666666654
No 117
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=50.20 E-value=3.7 Score=25.08 Aligned_cols=10 Identities=40% Similarity=1.082 Sum_probs=4.9
Q ss_pred cccccccccC
Q psy12165 387 VNCRVCGSIF 396 (537)
Q Consensus 387 ~~C~~C~~~f 396 (537)
|.|..||..|
T Consensus 6 y~C~~Cg~~f 15 (41)
T smart00834 6 YRCEDCGHTF 15 (41)
T ss_pred EEcCCCCCEE
Confidence 4455555444
No 118
>PF15269 zf-C2H2_7: Zinc-finger
Probab=49.99 E-value=17 Score=22.42 Aligned_cols=22 Identities=23% Similarity=0.369 Sum_probs=18.1
Q ss_pred CcccccccccccChhHHHHHHh
Q psy12165 488 NPTCEVCGRYHNARPKLESHLA 509 (537)
Q Consensus 488 ~~~C~~C~~~f~~~~~l~~H~~ 509 (537)
.|+|-.|......++.|..||+
T Consensus 20 ~ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred cceeecCCcccchHHHHHHHHH
Confidence 3688888888888888888886
No 119
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=49.61 E-value=4.7 Score=26.28 Aligned_cols=11 Identities=27% Similarity=0.887 Sum_probs=4.7
Q ss_pred ccccccccccc
Q psy12165 360 YACRICARLLG 370 (537)
Q Consensus 360 ~~C~~C~~~f~ 370 (537)
|.|..|+..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 34444444443
No 120
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=47.09 E-value=19 Score=30.19 Aligned_cols=17 Identities=12% Similarity=0.168 Sum_probs=9.1
Q ss_pred ccccccccccCCChHHH
Q psy12165 386 LVNCRVCGSIFHSEPEL 402 (537)
Q Consensus 386 ~~~C~~C~~~f~~~~~L 402 (537)
-|.|+.|+..|+....+
T Consensus 109 ~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 109 FFICPNMCVRFTFNEAM 125 (158)
T ss_pred eEECCCCCcEeeHHHHH
Confidence 35555555555555444
No 121
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=46.34 E-value=11 Score=27.29 Aligned_cols=16 Identities=19% Similarity=0.588 Sum_probs=11.5
Q ss_pred CCceeecccchhhccC
Q psy12165 164 GESVHICHLCQKQFKQ 179 (537)
Q Consensus 164 ~~~~~~C~~C~~~f~~ 179 (537)
+...++|..|+..|.-
T Consensus 50 a~GIW~C~kCg~~fAG 65 (89)
T COG1997 50 ATGIWKCRKCGAKFAG 65 (89)
T ss_pred ccCeEEcCCCCCeecc
Confidence 4666888888877753
No 122
>KOG1280|consensus
Probab=45.74 E-value=18 Score=33.81 Aligned_cols=37 Identities=22% Similarity=0.470 Sum_probs=30.2
Q ss_pred ccccCCCcCcCCChHHHHHHHHhhcCCCCC-ccccccC
Q psy12165 96 MYVCDLCPKYYSRSTHLWKHVNRVHKGHPD-VTCRICS 132 (537)
Q Consensus 96 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~-~~C~~C~ 132 (537)
.|.|++|+..=-+...|..|+...|+...+ ..|++|.
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 699999999988899999999999986532 4566664
No 123
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=43.91 E-value=13 Score=32.83 Aligned_cols=29 Identities=28% Similarity=0.588 Sum_probs=22.9
Q ss_pred CCCcccCCCCCCCCChHHHHHHHHhhcCC
Q psy12165 423 ELPVTCDLCSKEWPSRKFMWQHLIRSHKA 451 (537)
Q Consensus 423 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 451 (537)
+-.|.|+.|++.|......++||...|..
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 44699999999999999999999999964
No 124
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=43.85 E-value=12 Score=30.00 Aligned_cols=33 Identities=24% Similarity=0.486 Sum_probs=20.5
Q ss_pred cccccccccccccccchHHHHHhHHhhCcccccccc
Q psy12165 7 EEAALSCGNCLKVFRDNLSLLKHVEEEHPTMQRRYS 42 (537)
Q Consensus 7 ~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~ 42 (537)
.+.-..|-+||+.|.. |.+|++.||+.....|.
T Consensus 69 ~~d~i~clecGk~~k~---LkrHL~~~~gltp~eYR 101 (132)
T PF05443_consen 69 TPDYIICLECGKKFKT---LKRHLRTHHGLTPEEYR 101 (132)
T ss_dssp -SS-EE-TBT--EESB---HHHHHHHTT-S-HHHHH
T ss_pred ccCeeEEccCCcccch---HHHHHHHccCCCHHHHH
Confidence 4556789999999876 59999999876555554
No 125
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=43.50 E-value=5.4 Score=33.07 Aligned_cols=12 Identities=25% Similarity=0.692 Sum_probs=6.1
Q ss_pred cccCCCCCCCCC
Q psy12165 426 VTCDLCSKEWPS 437 (537)
Q Consensus 426 ~~C~~C~~~f~~ 437 (537)
++|+.||++|.+
T Consensus 29 ~~c~~c~~~f~~ 40 (154)
T PRK00464 29 RECLACGKRFTT 40 (154)
T ss_pred eeccccCCcceE
Confidence 555555555543
No 126
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=43.43 E-value=15 Score=36.59 Aligned_cols=28 Identities=43% Similarity=0.675 Sum_probs=25.1
Q ss_pred ccccccccccccchHHHHHhHHhhCccc
Q psy12165 10 ALSCGNCLKVFRDNLSLLKHVEEEHPTM 37 (537)
Q Consensus 10 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~ 37 (537)
-+.|+.|++.|.+..++..|+...|+.+
T Consensus 57 FWiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 57 FWICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred EeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence 5789999999999999999999988654
No 127
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=42.94 E-value=15 Score=29.52 Aligned_cols=27 Identities=33% Similarity=0.607 Sum_probs=15.2
Q ss_pred CCcccccccccccChhHHHHHHhhhhCCCc
Q psy12165 487 DNPTCEVCGRYHNARPKLESHLAVHVGLDK 516 (537)
Q Consensus 487 ~~~~C~~C~~~f~~~~~l~~H~~~h~g~~~ 516 (537)
.-..|=.||+.|... ++|++.|+|..+
T Consensus 71 d~i~clecGk~~k~L---krHL~~~~gltp 97 (132)
T PF05443_consen 71 DYIICLECGKKFKTL---KRHLRTHHGLTP 97 (132)
T ss_dssp S-EE-TBT--EESBH---HHHHHHTT-S-H
T ss_pred CeeEEccCCcccchH---HHHHHHccCCCH
Confidence 345788888888653 888888887544
No 128
>KOG2593|consensus
Probab=42.30 E-value=14 Score=35.78 Aligned_cols=36 Identities=28% Similarity=0.522 Sum_probs=27.0
Q ss_pred CccccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCc
Q psy12165 95 AMYVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKV 134 (537)
Q Consensus 95 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~ 134 (537)
.-|.||.|.+.|+....++ ..-.....|.|..|+-.
T Consensus 127 ~~Y~Cp~C~kkyt~Lea~~----L~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 127 AGYVCPNCQKKYTSLEALQ----LLDNETGEFHCENCGGE 162 (436)
T ss_pred ccccCCccccchhhhHHHH----hhcccCceEEEecCCCc
Confidence 4699999999998887763 33333457999999753
No 129
>KOG2807|consensus
Probab=39.86 E-value=39 Score=31.46 Aligned_cols=23 Identities=13% Similarity=0.189 Sum_probs=18.0
Q ss_pred CceeecccchhhccChHHHHHHH
Q psy12165 165 ESVHICHLCQKQFKQNWLLQKHK 187 (537)
Q Consensus 165 ~~~~~C~~C~~~f~~~~~l~~H~ 187 (537)
.-|..|+.|+.+......|.+-.
T Consensus 288 sLP~eCpiC~ltLVss~hLARSy 310 (378)
T KOG2807|consen 288 SLPIECPICSLTLVSSPHLARSY 310 (378)
T ss_pred cCCccCCccceeEecchHHHHHH
Confidence 45678999999998888887643
No 130
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=39.45 E-value=27 Score=34.77 Aligned_cols=28 Identities=29% Similarity=0.412 Sum_probs=24.6
Q ss_pred CcccccccccccCCHHHHHHHHHhhCCC
Q psy12165 453 AGLACGVCLKISETYEALALHLDQAHPG 480 (537)
Q Consensus 453 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 480 (537)
+-+.|+.|.+.|.....+..|+...|.+
T Consensus 56 rFWiCp~CskkF~d~~~~~~H~~~eH~~ 83 (466)
T PF04780_consen 56 RFWICPRCSKKFSDAESCLSHMEQEHPA 83 (466)
T ss_pred eEeeCCcccceeCCHHHHHHHHHHhhhh
Confidence 3478999999999999999999999976
No 131
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=39.19 E-value=14 Score=19.60 Aligned_cols=9 Identities=33% Similarity=0.896 Sum_probs=6.3
Q ss_pred CcccCCCCC
Q psy12165 425 PVTCDLCSK 433 (537)
Q Consensus 425 ~~~C~~C~~ 433 (537)
+|.|+.||+
T Consensus 16 ~f~CPnCG~ 24 (24)
T PF07754_consen 16 PFPCPNCGF 24 (24)
T ss_pred eEeCCCCCC
Confidence 577777764
No 132
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=38.70 E-value=21 Score=38.02 Aligned_cols=12 Identities=33% Similarity=0.930 Sum_probs=7.4
Q ss_pred CCCcccCCCCCC
Q psy12165 423 ELPVTCDLCSKE 434 (537)
Q Consensus 423 ~~~~~C~~C~~~ 434 (537)
..|..|+.||-.
T Consensus 473 ~~p~~Cp~Cgs~ 484 (730)
T COG1198 473 PIPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCCC
Confidence 346777777643
No 133
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=38.46 E-value=23 Score=32.53 Aligned_cols=23 Identities=22% Similarity=0.366 Sum_probs=17.6
Q ss_pred CCCcccCCCCCCCCChHHHHHHH
Q psy12165 423 ELPVTCDLCSKEWPSRKFMWQHL 445 (537)
Q Consensus 423 ~~~~~C~~C~~~f~~~~~l~~H~ 445 (537)
...|+|+.|...|-.--+...|.
T Consensus 386 s~rY~Ce~CK~~FC~dCdvfiHe 408 (421)
T COG5151 386 SGRYQCELCKSTFCSDCDVFIHE 408 (421)
T ss_pred ccceechhhhhhhhhhhHHHHHH
Confidence 34688888888888777777775
No 134
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=37.86 E-value=20 Score=20.06 Aligned_cols=13 Identities=15% Similarity=0.306 Sum_probs=5.4
Q ss_pred Cceeecccchhhc
Q psy12165 165 ESVHICHLCQKQF 177 (537)
Q Consensus 165 ~~~~~C~~C~~~f 177 (537)
...|.|..|+..+
T Consensus 13 ~~~Y~C~~Cdf~l 25 (30)
T PF07649_consen 13 GWFYRCSECDFDL 25 (30)
T ss_dssp --EEE-TTT----
T ss_pred CceEECccCCCcc
Confidence 3679999998764
No 135
>KOG4167|consensus
Probab=37.43 E-value=18 Score=37.45 Aligned_cols=33 Identities=36% Similarity=0.686 Sum_probs=28.1
Q ss_pred CCCCCCCccccccchhccCCHHHHHHHHHhccC
Q psy12165 201 PEPDEDGIFRCKKCVKVFVNPDIYKKHLKHSHT 233 (537)
Q Consensus 201 ~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 233 (537)
.+++.++.|.|.+|++.|.....+..||+.|..
T Consensus 785 ~~~~~~giFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 785 GETDPTGIFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred CCCCCCceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 345567899999999999999999999987763
No 136
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=36.50 E-value=1.6e+02 Score=24.51 Aligned_cols=54 Identities=24% Similarity=0.429 Sum_probs=34.1
Q ss_pred CCcccCCCCCCCCChHHHHHHHHhhcCCCCcccccc--cccccCCHHHHHHHHHhhCCC
Q psy12165 424 LPVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGV--CLKISETYEALALHLDQAHPG 480 (537)
Q Consensus 424 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~--C~~~f~~~~~l~~H~~~~H~~ 480 (537)
....||.|--........ .- .+.+.+.++-.|.. |.++ .+..+|+.|.+..|+.
T Consensus 79 ~~L~CPLCRG~V~GWtvv-e~-AR~~LN~K~RsC~~e~C~F~-GtY~eLrKHar~~HP~ 134 (162)
T PF07800_consen 79 PELACPLCRGEVKGWTVV-EP-ARRFLNAKKRSCSQESCSFS-GTYSELRKHARSEHPS 134 (162)
T ss_pred ccccCccccCceeceEEc-hH-HHHHhccCCccCcccccccc-cCHHHHHHHHHhhCCC
Confidence 457899995433222222 22 24444566655644 7665 5788999999999987
No 137
>PF14353 CpXC: CpXC protein
Probab=36.21 E-value=25 Score=28.20 Aligned_cols=18 Identities=33% Similarity=0.598 Sum_probs=8.4
Q ss_pred cccccccccCCChHHHHH
Q psy12165 387 VNCRVCGSIFHSEPELKV 404 (537)
Q Consensus 387 ~~C~~C~~~f~~~~~L~~ 404 (537)
|.|+.||..|.-...+..
T Consensus 39 ~~CP~Cg~~~~~~~p~lY 56 (128)
T PF14353_consen 39 FTCPSCGHKFRLEYPLLY 56 (128)
T ss_pred EECCCCCCceecCCCEEE
Confidence 455555555544333333
No 138
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=35.58 E-value=20 Score=28.47 Aligned_cols=26 Identities=35% Similarity=0.931 Sum_probs=17.7
Q ss_pred ccccccccccccChhH-HHHhHhhhcCCccccccccc
Q psy12165 359 PYACRICARLLGNGVK-VSEHIKLHGEHLVNCRVCGS 394 (537)
Q Consensus 359 ~~~C~~C~~~f~~~~~-l~~H~~~H~~~~~~C~~C~~ 394 (537)
|++|..||..|.+.+. |. --|+.||-
T Consensus 1 PH~Ct~Cg~~f~dgs~eil----------~GCP~CGg 27 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGSKEIL----------SGCPECGG 27 (131)
T ss_pred CcccCcCCCCcCCCcHHHH----------ccCcccCC
Confidence 4678888888887552 22 24888874
No 139
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=35.14 E-value=24 Score=36.21 Aligned_cols=9 Identities=44% Similarity=1.095 Sum_probs=4.9
Q ss_pred CcccCCCCC
Q psy12165 425 PVTCDLCSK 433 (537)
Q Consensus 425 ~~~C~~C~~ 433 (537)
|..|+.|+.
T Consensus 253 ~~~Cp~C~s 261 (505)
T TIGR00595 253 PKTCPQCGS 261 (505)
T ss_pred CCCCCCCCC
Confidence 455666653
No 140
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=34.39 E-value=25 Score=22.59 Aligned_cols=37 Identities=16% Similarity=0.217 Sum_probs=17.9
Q ss_pred CCcccCCCCCCCCChHHHHHHHHhhcCCCCccccccc
Q psy12165 424 LPVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVC 460 (537)
Q Consensus 424 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C 460 (537)
+.+.|..||..|.-...=+......=-...|-+|.-|
T Consensus 3 k~l~C~dCg~~FvfTa~EQ~fy~eKgf~n~p~RC~~C 39 (49)
T PF13451_consen 3 KTLTCKDCGAEFVFTAGEQKFYAEKGFDNEPKRCPSC 39 (49)
T ss_pred eeEEcccCCCeEEEehhHHHHHHhcCCcCCCccCHHH
Confidence 4566667766665444444443221112234555555
No 141
>KOG4167|consensus
Probab=34.21 E-value=13 Score=38.39 Aligned_cols=28 Identities=25% Similarity=0.386 Sum_probs=25.6
Q ss_pred cccccccccccccccchHHHHHhHHhhC
Q psy12165 7 EEAALSCGNCLKVFRDNLSLLKHVEEEH 34 (537)
Q Consensus 7 ~~~~~~C~~C~~~f~~~~~L~~H~~~~h 34 (537)
.+..|-|.+|++.|.-..++.+||++|.
T Consensus 789 ~~giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 789 PTGIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred CCceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 4678999999999999999999999974
No 142
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=33.96 E-value=4.2 Score=31.31 Aligned_cols=12 Identities=33% Similarity=0.963 Sum_probs=9.8
Q ss_pred ccccccchhccC
Q psy12165 208 IFRCKKCVKVFV 219 (537)
Q Consensus 208 ~~~C~~C~~~f~ 219 (537)
-|.|..|+..+.
T Consensus 90 fy~C~~C~~~w~ 101 (104)
T TIGR01384 90 FYKCTKCGYVWR 101 (104)
T ss_pred EEEeCCCCCeeE
Confidence 599999998764
No 143
>PF06220 zf-U1: U1 zinc finger; InterPro: IPR013085 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. C2H2-type (classical) zinc fingers (Znf) were the first class to be characterised. They contain a short beta hairpin and an alpha helix (beta/beta/alpha structure), where a single zinc atom is held in place by Cys(2)His(2) (C2H2) residues in a tetrahedral array. C2H2 Znf's can be divided into three groups based on the number and pattern of fingers: triple-C2H2 (binds single ligand), multiple-adjacent-C2H2 (binds multiple ligands), and separated paired-C2H2 []. C2H2 Znf's are the most common DNA-binding motifs found in eukaryotic transcription factors, and have also been identified in prokaryotes []. Transcription factors usually contain several Znf's (each with a conserved beta/beta/alpha structure) capable of making multiple contacts along the DNA, where the C2H2 Znf motifs recognise DNA sequences by binding to the major groove of DNA via a short alpha-helix in the Znf, the Znf spanning 3-4 bases of the DNA []. C2H2 Znf's can also bind to RNA and protein targets []. This entry represents a C2H2-type zinc finger motif found in several U1 small nuclear ribonucleoprotein C (U1-C) proteins. Some proteins contain multiple copies of this motif. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' splice site is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' splice site. Binding of TIA-1 in the vicinity of a 5' splice site helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2VRD_A.
Probab=33.91 E-value=29 Score=20.93 Aligned_cols=23 Identities=26% Similarity=0.686 Sum_probs=9.5
Q ss_pred CCcccccchhccc-Cc-HHHHHHHh
Q psy12165 298 PTNYCNFCADLFT-NR-KKLWLHLK 320 (537)
Q Consensus 298 ~~~~C~~C~~~f~-~~-~~l~~H~~ 320 (537)
+.|.|.+|+..|. +. +.-+.|..
T Consensus 2 ~ryyCdyC~~~~~~d~~~~Rk~H~~ 26 (38)
T PF06220_consen 2 PRYYCDYCKKYLTHDSPSIRKQHER 26 (38)
T ss_dssp -S-B-TTT--B-S--SHHHHHHHT-
T ss_pred cCeecccccceecCCChHHHHHhhc
Confidence 4688999999983 44 33356654
No 144
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=33.74 E-value=63 Score=30.39 Aligned_cols=22 Identities=32% Similarity=0.600 Sum_probs=13.3
Q ss_pred CcccccchhcccCcHHHHHHHh
Q psy12165 299 TNYCNFCADLFTNRKKLWLHLK 320 (537)
Q Consensus 299 ~~~C~~C~~~f~~~~~l~~H~~ 320 (537)
.+.|+.|++.|.+...+..|+.
T Consensus 238 ~~YC~~C~r~f~~~~VFe~Hl~ 259 (470)
T COG5188 238 KVYCVKCGREFSRSKVFEYHLE 259 (470)
T ss_pred ceeeHhhhhHhhhhHHHHHHHh
Confidence 4556666666666666666654
No 145
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=33.36 E-value=26 Score=37.35 Aligned_cols=40 Identities=28% Similarity=0.490 Sum_probs=27.9
Q ss_pred cccccchhcccCcHHHHHHHhhcCCCCCccccCCcccccCCHHHHHHHHHhhccccCCCcccccccccc
Q psy12165 300 NYCNFCADLFTNRKKLWLHLKSTHKDLTNCMCSACFVVEDNLSALEAHVAAKHKAIIKKPYACRICARL 368 (537)
Q Consensus 300 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~ 368 (537)
++|+.|+..+. .|.......|.+|+..- ..|..|+.||..
T Consensus 445 ~~Cp~Cd~~lt-----------~H~~~~~L~CH~Cg~~~------------------~~p~~Cp~Cgs~ 484 (730)
T COG1198 445 AECPNCDSPLT-----------LHKATGQLRCHYCGYQE------------------PIPQSCPECGSE 484 (730)
T ss_pred ccCCCCCcceE-----------EecCCCeeEeCCCCCCC------------------CCCCCCCCCCCC
Confidence 34666665543 56667788899888642 477889999865
No 146
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=32.36 E-value=13 Score=26.26 Aligned_cols=20 Identities=25% Similarity=0.421 Sum_probs=16.0
Q ss_pred CCceeecc--cchhhccChHHH
Q psy12165 164 GESVHICH--LCQKQFKQNWLL 183 (537)
Q Consensus 164 ~~~~~~C~--~C~~~f~~~~~l 183 (537)
.+..++|. .||.+|.+...+
T Consensus 24 ~~~Y~qC~N~eCg~tF~t~es~ 45 (72)
T PRK09678 24 KERYHQCQNVNCSATFITYESV 45 (72)
T ss_pred heeeeecCCCCCCCEEEEEEEE
Confidence 57789998 899999876543
No 147
>PRK04023 DNA polymerase II large subunit; Validated
Probab=32.02 E-value=47 Score=36.32 Aligned_cols=11 Identities=27% Similarity=0.661 Sum_probs=6.1
Q ss_pred CcccCCCCCCC
Q psy12165 425 PVTCDLCSKEW 435 (537)
Q Consensus 425 ~~~C~~C~~~f 435 (537)
++.|+.|+..-
T Consensus 663 ~y~CPKCG~El 673 (1121)
T PRK04023 663 EDECEKCGREP 673 (1121)
T ss_pred CCcCCCCCCCC
Confidence 35566666544
No 148
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=30.89 E-value=23 Score=28.74 Aligned_cols=13 Identities=15% Similarity=0.378 Sum_probs=8.9
Q ss_pred CCceeecccchhh
Q psy12165 164 GESVHICHLCQKQ 176 (537)
Q Consensus 164 ~~~~~~C~~C~~~ 176 (537)
....+.||.||..
T Consensus 104 ~~~~~~CP~Cgs~ 116 (135)
T PRK03824 104 VHAFLKCPKCGSR 116 (135)
T ss_pred cccCcCCcCCCCC
Confidence 3455779999853
No 149
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=30.69 E-value=15 Score=27.09 Aligned_cols=15 Identities=27% Similarity=0.729 Sum_probs=10.9
Q ss_pred CCceeecccchhhcc
Q psy12165 164 GESVHICHLCQKQFK 178 (537)
Q Consensus 164 ~~~~~~C~~C~~~f~ 178 (537)
....++|..|++.|.
T Consensus 50 a~GIW~C~~C~~~~A 64 (90)
T PF01780_consen 50 ATGIWKCKKCGKKFA 64 (90)
T ss_dssp ETTEEEETTTTEEEE
T ss_pred eeEEeecCCCCCEEe
Confidence 355688888888775
No 150
>KOG2593|consensus
Probab=30.20 E-value=40 Score=32.87 Aligned_cols=33 Identities=18% Similarity=0.389 Sum_probs=15.9
Q ss_pred CcccCCCCCCCCChHHHHHHHHhhcCCCCcccccccc
Q psy12165 425 PVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCL 461 (537)
Q Consensus 425 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~ 461 (537)
.|.|+.|+++|+..-.++. .-...-.|.|..|+
T Consensus 128 ~Y~Cp~C~kkyt~Lea~~L----~~~~~~~F~C~~C~ 160 (436)
T KOG2593|consen 128 GYVCPNCQKKYTSLEALQL----LDNETGEFHCENCG 160 (436)
T ss_pred cccCCccccchhhhHHHHh----hcccCceEEEecCC
Confidence 4666666666654444422 11222345566664
No 151
>KOG3408|consensus
Probab=30.00 E-value=40 Score=26.29 Aligned_cols=27 Identities=30% Similarity=0.429 Sum_probs=24.4
Q ss_pred CCceeecccchhhccChHHHHHHHhhc
Q psy12165 164 GESVHICHLCQKQFKQNWLLQKHKKTC 190 (537)
Q Consensus 164 ~~~~~~C~~C~~~f~~~~~l~~H~~~h 190 (537)
|...|.|-.|.+.|.+...|..|.+.-
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~ktK 80 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKTK 80 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhcc
Confidence 667799999999999999999999764
No 152
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=29.29 E-value=19 Score=20.39 Aligned_cols=8 Identities=25% Similarity=0.808 Sum_probs=3.6
Q ss_pred CcccCCCC
Q psy12165 425 PVTCDLCS 432 (537)
Q Consensus 425 ~~~C~~C~ 432 (537)
.|.|+.|+
T Consensus 19 ~~vCp~C~ 26 (30)
T PF08274_consen 19 LLVCPECG 26 (30)
T ss_dssp SEEETTTT
T ss_pred EEeCCccc
Confidence 34444444
No 153
>KOG2272|consensus
Probab=29.14 E-value=28 Score=30.90 Aligned_cols=14 Identities=14% Similarity=0.261 Sum_probs=7.0
Q ss_pred cccCCcccccCCHH
Q psy12165 329 CMCSACFVVEDNLS 342 (537)
Q Consensus 329 ~~C~~C~~~~~~~~ 342 (537)
|.|.+|.+...+..
T Consensus 100 F~Cd~Cn~~Lad~g 113 (332)
T KOG2272|consen 100 FRCDLCNKHLADQG 113 (332)
T ss_pred chhHHHHHHHhhhh
Confidence 45555555444433
No 154
>PRK04023 DNA polymerase II large subunit; Validated
Probab=28.77 E-value=47 Score=36.32 Aligned_cols=24 Identities=25% Similarity=0.523 Sum_probs=15.0
Q ss_pred CccccccccccccChhHHHHhHhhhcCCccccccccccC
Q psy12165 358 KPYACRICARLLGNGVKVSEHIKLHGEHLVNCRVCGSIF 396 (537)
Q Consensus 358 ~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~C~~C~~~f 396 (537)
..+.|+.|+.... ++.|+.||..-
T Consensus 650 ~i~fCP~CG~~~~---------------~y~CPKCG~El 673 (1121)
T PRK04023 650 PVYRCPRCGIEVE---------------EDECEKCGREP 673 (1121)
T ss_pred cceeCccccCcCC---------------CCcCCCCCCCC
Confidence 5567777754432 46688887543
No 155
>PRK14873 primosome assembly protein PriA; Provisional
Probab=27.42 E-value=35 Score=36.25 Aligned_cols=10 Identities=20% Similarity=0.660 Sum_probs=6.4
Q ss_pred CcccCCCCCC
Q psy12165 425 PVTCDLCSKE 434 (537)
Q Consensus 425 ~~~C~~C~~~ 434 (537)
++.|+.|+..
T Consensus 422 p~~Cp~Cgs~ 431 (665)
T PRK14873 422 DWRCPRCGSD 431 (665)
T ss_pred CccCCCCcCC
Confidence 5677777643
No 156
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=26.82 E-value=46 Score=21.90 Aligned_cols=27 Identities=22% Similarity=0.762 Sum_probs=20.6
Q ss_pred CccccccCCcccChHHHHHHHHhcCCCCCCCCccccccCCCceeecccchhhc
Q psy12165 125 DVTCRICSKVSSSRGHLEKHLIAHDNPNTEVDEPQMNEFGESVHICHLCQKQF 177 (537)
Q Consensus 125 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~C~~C~~~f 177 (537)
.-+|+.|+..|.. ++....|+.|+..+
T Consensus 5 ~~~C~~Cg~~~~~--------------------------~dDiVvCp~Cgapy 31 (54)
T PF14446_consen 5 GCKCPVCGKKFKD--------------------------GDDIVVCPECGAPY 31 (54)
T ss_pred CccChhhCCcccC--------------------------CCCEEECCCCCCcc
Confidence 3578889888873 46678899998765
No 157
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=26.73 E-value=31 Score=25.52 Aligned_cols=16 Identities=19% Similarity=0.449 Sum_probs=11.8
Q ss_pred CCceeecccchhhccC
Q psy12165 164 GESVHICHLCQKQFKQ 179 (537)
Q Consensus 164 ~~~~~~C~~C~~~f~~ 179 (537)
....+.|..|+++|.-
T Consensus 51 a~GIW~C~~C~~~~AG 66 (90)
T PTZ00255 51 AVGIWRCKGCKKTVAG 66 (90)
T ss_pred eeEEEEcCCCCCEEeC
Confidence 4567888888887754
No 158
>KOG0717|consensus
Probab=26.68 E-value=35 Score=33.53 Aligned_cols=23 Identities=30% Similarity=0.538 Sum_probs=21.4
Q ss_pred cccccccccccchHHHHHhHHhh
Q psy12165 11 LSCGNCLKVFRDNLSLLKHVEEE 33 (537)
Q Consensus 11 ~~C~~C~~~f~~~~~L~~H~~~~ 33 (537)
+.|..|+++|.+...|..|..+.
T Consensus 293 lyC~vCnKsFKseKq~kNHEnSK 315 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHENSK 315 (508)
T ss_pred eEEeeccccccchHHHHhhHHHH
Confidence 89999999999999999998764
No 159
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=25.99 E-value=35 Score=17.89 Aligned_cols=6 Identities=33% Similarity=1.099 Sum_probs=3.1
Q ss_pred ccccCC
Q psy12165 128 CRICSK 133 (537)
Q Consensus 128 C~~C~~ 133 (537)
|+.||.
T Consensus 16 C~~CG~ 21 (23)
T PF13240_consen 16 CPNCGT 21 (23)
T ss_pred hhhhCC
Confidence 555554
No 160
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=25.95 E-value=43 Score=25.14 Aligned_cols=27 Identities=26% Similarity=0.733 Sum_probs=16.8
Q ss_pred ccccccccccccChhHHHHhHhhhcCCccccccccc
Q psy12165 359 PYACRICARLLGNGVKVSEHIKLHGEHLVNCRVCGS 394 (537)
Q Consensus 359 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~C~~C~~ 394 (537)
|..|..||..|.+.+.... --|+.||-
T Consensus 2 pH~CtrCG~vf~~g~~~il---------~GCp~CG~ 28 (112)
T COG3364 2 PHQCTRCGEVFDDGSEEIL---------SGCPKCGC 28 (112)
T ss_pred CceecccccccccccHHHH---------ccCccccc
Confidence 4577888888877543322 24777764
No 161
>KOG3408|consensus
Probab=25.57 E-value=68 Score=25.04 Aligned_cols=28 Identities=29% Similarity=0.708 Sum_probs=23.8
Q ss_pred CCCCCCccccccchhccCCHHHHHHHHH
Q psy12165 202 EPDEDGIFRCKKCVKVFVNPDIYKKHLK 229 (537)
Q Consensus 202 ~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 229 (537)
+-+..+.|.|-.|.+-|.+...|..|.+
T Consensus 51 dlPG~GqfyCi~CaRyFi~~~~l~~H~k 78 (129)
T KOG3408|consen 51 DLPGGGQFYCIECARYFIDAKALKTHFK 78 (129)
T ss_pred CCCCCceeehhhhhhhhcchHHHHHHHh
Confidence 3345567999999999999999999976
No 162
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=25.45 E-value=29 Score=25.66 Aligned_cols=16 Identities=19% Similarity=0.617 Sum_probs=11.7
Q ss_pred CCceeecccchhhccC
Q psy12165 164 GESVHICHLCQKQFKQ 179 (537)
Q Consensus 164 ~~~~~~C~~C~~~f~~ 179 (537)
+...+.|..|+++|.-
T Consensus 50 a~GIW~C~~C~~~~AG 65 (91)
T TIGR00280 50 STGIWTCRKCGAKFAG 65 (91)
T ss_pred eeEEEEcCCCCCEEeC
Confidence 4667888888887753
No 163
>PRK14873 primosome assembly protein PriA; Provisional
Probab=25.26 E-value=40 Score=35.81 Aligned_cols=28 Identities=14% Similarity=0.210 Sum_probs=18.2
Q ss_pred cCCCCCccccCCcccccCCHHHHHHHHHhhccccCCCcccccccccc
Q psy12165 322 THKDLTNCMCSACFVVEDNLSALEAHVAAKHKAIIKKPYACRICARL 368 (537)
Q Consensus 322 ~H~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~ 368 (537)
.|.......|.+|++. ..++.|+.|+..
T Consensus 404 ~h~~~~~l~Ch~CG~~-------------------~~p~~Cp~Cgs~ 431 (665)
T PRK14873 404 LPSAGGTPRCRWCGRA-------------------APDWRCPRCGSD 431 (665)
T ss_pred EecCCCeeECCCCcCC-------------------CcCccCCCCcCC
Confidence 3445556778888753 246788888754
No 164
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=25.03 E-value=47 Score=34.09 Aligned_cols=39 Identities=28% Similarity=0.484 Sum_probs=25.0
Q ss_pred cccccchhcccCcHHHHHHHhhcCCCCCccccCCcccccCCHHHHHHHHHhhccccCCCccccccccc
Q psy12165 300 NYCNFCADLFTNRKKLWLHLKSTHKDLTNCMCSACFVVEDNLSALEAHVAAKHKAIIKKPYACRICAR 367 (537)
Q Consensus 300 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~C~~C~~ 367 (537)
.+|+.|+-... .|.......|.+|++.. .-+..|+.|+.
T Consensus 223 ~~C~~C~~~l~-----------~h~~~~~l~Ch~Cg~~~------------------~~~~~Cp~C~s 261 (505)
T TIGR00595 223 LCCPNCDVSLT-----------YHKKEGKLRCHYCGYQE------------------PIPKTCPQCGS 261 (505)
T ss_pred cCCCCCCCceE-----------EecCCCeEEcCCCcCcC------------------CCCCCCCCCCC
Confidence 34677765543 45556677788887653 35677888865
No 165
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=24.63 E-value=20 Score=23.62 Aligned_cols=32 Identities=19% Similarity=0.303 Sum_probs=21.1
Q ss_pred ccccCCCcCcCCChHHHHHHHHhhcCCCCCccccccCCccc
Q psy12165 96 MYVCDLCPKYYSRSTHLWKHVNRVHKGHPDVTCRICSKVSS 136 (537)
Q Consensus 96 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~ 136 (537)
.+.|+.|+..+.-.... .+ ....|+.|+..+-
T Consensus 2 ~~~CP~CG~~iev~~~~--------~G-eiV~Cp~CGaele 33 (54)
T TIGR01206 2 QFECPDCGAEIELENPE--------LG-ELVICDECGAELE 33 (54)
T ss_pred ccCCCCCCCEEecCCCc--------cC-CEEeCCCCCCEEE
Confidence 46899999877443211 12 2578999998765
No 166
>COG1594 RPB9 DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription]
Probab=24.53 E-value=9.9 Score=29.71 Aligned_cols=40 Identities=10% Similarity=0.057 Sum_probs=21.1
Q ss_pred CcccCCCCCCCCChHHHHHHHHhhcCCCCcccccccccccC
Q psy12165 425 PVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLACGVCLKISE 465 (537)
Q Consensus 425 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~ 465 (537)
.+.|+.||..=.....|+.-- ..-....-|.|..||..+.
T Consensus 72 ~~~CpkCg~~ea~y~~~QtRs-aDEp~T~Fy~C~~Cg~~wr 111 (113)
T COG1594 72 KEKCPKCGNKEAYYWQLQTRS-ADEPETRFYKCTRCGYRWR 111 (113)
T ss_pred cccCCCCCCceeEEEeeehhc-cCCCceEEEEecccCCEee
Confidence 477888886433332222221 1111223388988888765
No 167
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=24.15 E-value=39 Score=21.60 Aligned_cols=26 Identities=35% Similarity=0.634 Sum_probs=15.2
Q ss_pred cccCCCCC-CCCChHHHHHHHHhhcCCCCcccccccccc
Q psy12165 426 VTCDLCSK-EWPSRKFMWQHLIRSHKAVAGLACGVCLKI 463 (537)
Q Consensus 426 ~~C~~C~~-~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~ 463 (537)
-.||.||- +| |..| .+ .|.|..|++.
T Consensus 20 ~~CPrCG~gvf-----mA~H-----~d--R~~CGkCgyT 46 (51)
T COG1998 20 RFCPRCGPGVF-----MADH-----KD--RWACGKCGYT 46 (51)
T ss_pred ccCCCCCCcch-----hhhc-----Cc--eeEeccccce
Confidence 45777775 34 3333 22 5778888764
No 168
>COG1773 Rubredoxin [Energy production and conversion]
Probab=23.71 E-value=30 Score=22.86 Aligned_cols=14 Identities=29% Similarity=0.624 Sum_probs=10.8
Q ss_pred CCcccccccccccC
Q psy12165 487 DNPTCEVCGRYHNA 500 (537)
Q Consensus 487 ~~~~C~~C~~~f~~ 500 (537)
+.|+|.+||+.|.-
T Consensus 2 ~~~~C~~CG~vYd~ 15 (55)
T COG1773 2 KRWRCSVCGYVYDP 15 (55)
T ss_pred CceEecCCceEecc
Confidence 36889999988854
No 169
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=22.88 E-value=64 Score=36.42 Aligned_cols=10 Identities=30% Similarity=0.757 Sum_probs=7.2
Q ss_pred cccccccccc
Q psy12165 386 LVNCRVCGSI 395 (537)
Q Consensus 386 ~~~C~~C~~~ 395 (537)
+|.|+.||..
T Consensus 692 vy~CPsCGae 701 (1337)
T PRK14714 692 VYVCPDCGAE 701 (1337)
T ss_pred ceeCccCCCc
Confidence 5678888765
No 170
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=22.88 E-value=78 Score=35.77 Aligned_cols=11 Identities=27% Similarity=0.498 Sum_probs=7.6
Q ss_pred CcccCCCCCCC
Q psy12165 425 PVTCDLCSKEW 435 (537)
Q Consensus 425 ~~~C~~C~~~f 435 (537)
+|.|+.|+...
T Consensus 692 vy~CPsCGaev 702 (1337)
T PRK14714 692 VYVCPDCGAEV 702 (1337)
T ss_pred ceeCccCCCcc
Confidence 57777777754
No 171
>KOG4124|consensus
Probab=22.69 E-value=14 Score=34.37 Aligned_cols=26 Identities=27% Similarity=0.680 Sum_probs=22.0
Q ss_pred CCCcccccccccccccCHHHHHHHHH
Q psy12165 263 NEEYKCLIPGCKKYFRTKLDCQKHCR 288 (537)
Q Consensus 263 ~~~~~C~~~~C~~~f~~~~~l~~H~~ 288 (537)
.++|+|.+|.|.+.+.....|..|..
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~ 372 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKL 372 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccc
Confidence 57899999999999998888877754
No 172
>PRK05580 primosome assembly protein PriA; Validated
Probab=22.60 E-value=51 Score=35.38 Aligned_cols=46 Identities=28% Similarity=0.631 Sum_probs=0.0
Q ss_pred cccCCcccccCCHHHHHHHHHhhccccCCCccccccccccccChhHHHHhHhhh-cCCccccccccccCCChHHHHHHHH
Q psy12165 329 CMCSACFVVEDNLSALEAHVAAKHKAIIKKPYACRICARLLGNGVKVSEHIKLH-GEHLVNCRVCGSIFHSEPELKVHLE 407 (537)
Q Consensus 329 ~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H-~~~~~~C~~C~~~f~~~~~L~~H~~ 407 (537)
..|..|+ ....|+.|+.... .| ......|..||+..
T Consensus 382 ~~C~~Cg----------------------~~~~C~~C~~~l~----------~h~~~~~l~Ch~Cg~~~----------- 418 (679)
T PRK05580 382 LLCRDCG----------------------WVAECPHCDASLT----------LHRFQRRLRCHHCGYQE----------- 418 (679)
T ss_pred eEhhhCc----------------------CccCCCCCCCcee----------EECCCCeEECCCCcCCC-----------
Q ss_pred hccCCCCCccCCCCCCCCcccCCCC
Q psy12165 408 EHTDVPETKEQKPRPELPVTCDLCS 432 (537)
Q Consensus 408 ~h~~~~~~~~~~~~~~~~~~C~~C~ 432 (537)
..|..|+.|+
T Consensus 419 ---------------~~~~~Cp~Cg 428 (679)
T PRK05580 419 ---------------PIPKACPECG 428 (679)
T ss_pred ---------------CCCCCCCCCc
No 173
>KOG1842|consensus
Probab=22.39 E-value=48 Score=32.39 Aligned_cols=29 Identities=24% Similarity=0.359 Sum_probs=26.3
Q ss_pred cccccccccccccchHHHHHhHHhhCccc
Q psy12165 9 AALSCGNCLKVFRDNLSLLKHVEEEHPTM 37 (537)
Q Consensus 9 ~~~~C~~C~~~f~~~~~L~~H~~~~h~~~ 37 (537)
..|.||+|...|.+...|..|+...|+..
T Consensus 14 egflCPiC~~dl~~~~~L~~H~d~eH~~e 42 (505)
T KOG1842|consen 14 EGFLCPICLLDLPNLSALNDHLDVEHFEE 42 (505)
T ss_pred hcccCchHhhhhhhHHHHHHHHhhhcccc
Confidence 57999999999999999999999998643
No 174
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=22.11 E-value=20 Score=19.93 Aligned_cols=17 Identities=24% Similarity=0.606 Sum_probs=7.8
Q ss_pred ccccccccccChhHHHHh
Q psy12165 361 ACRICARLLGNGVKVSEH 378 (537)
Q Consensus 361 ~C~~C~~~f~~~~~l~~H 378 (537)
.|-.|++.| .......|
T Consensus 2 sCiDC~~~F-~~~~y~~H 18 (28)
T PF08790_consen 2 SCIDCSKDF-DGDSYKSH 18 (28)
T ss_dssp EETTTTEEE-EGGGTTT-
T ss_pred eeecCCCCc-CcCCcCCC
Confidence 455555555 33344444
No 175
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=21.92 E-value=47 Score=20.15 Aligned_cols=13 Identities=15% Similarity=0.480 Sum_probs=10.9
Q ss_pred CcccCCCCCCCCC
Q psy12165 425 PVTCDLCSKEWPS 437 (537)
Q Consensus 425 ~~~C~~C~~~f~~ 437 (537)
||.|+.|+..|-.
T Consensus 12 ~f~C~~C~~~FC~ 24 (39)
T smart00154 12 GFKCRHCGNLFCG 24 (39)
T ss_pred CeECCccCCcccc
Confidence 7999999998844
No 176
>PF13821 DUF4187: Domain of unknown function (DUF4187)
Probab=21.77 E-value=1e+02 Score=20.48 Aligned_cols=18 Identities=22% Similarity=0.128 Sum_probs=11.4
Q ss_pred ccccccccccCCHHHHHH
Q psy12165 455 LACGVCLKISETYEALAL 472 (537)
Q Consensus 455 ~~C~~C~~~f~~~~~l~~ 472 (537)
+.|-+|+..|.+...|..
T Consensus 28 ~YC~~Cg~~Y~d~~dL~~ 45 (55)
T PF13821_consen 28 NYCFWCGTKYDDEEDLER 45 (55)
T ss_pred ceeeeeCCccCCHHHHHh
Confidence 556666666666666654
No 177
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=21.51 E-value=54 Score=26.17 Aligned_cols=55 Identities=22% Similarity=0.556 Sum_probs=31.9
Q ss_pred CCCcccCCCCCCCCChHHHHHHHHhhcCCCCcccc-c--ccccccCCHHHHHHHHHhhCCCCcccccCCccccccccccc
Q psy12165 423 ELPVTCDLCSKEWPSRKFMWQHLIRSHKAVAGLAC-G--VCLKISETYEALALHLDQAHPGYFAEEVDNPTCEVCGRYHN 499 (537)
Q Consensus 423 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C-~--~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~C~~C~~~f~ 499 (537)
.+.|+|++|..+.....-| +|=.| + .|+.- ...|+.|-. .|+ .|++|.-+|.
T Consensus 78 ~~lYeCnIC~etS~ee~FL-----------KPneCCgY~iCn~C---ya~LWK~~~-~yp----------vCPvCkTSFK 132 (140)
T PF05290_consen 78 PKLYECNICKETSAEERFL-----------KPNECCGYSICNAC---YANLWKFCN-LYP----------VCPVCKTSFK 132 (140)
T ss_pred CCceeccCcccccchhhcC-----------CcccccchHHHHHH---HHHHHHHcc-cCC----------CCCccccccc
Confidence 3568888888776443222 33222 1 13222 356777765 554 3888888887
Q ss_pred Chh
Q psy12165 500 ARP 502 (537)
Q Consensus 500 ~~~ 502 (537)
+..
T Consensus 133 ss~ 135 (140)
T PF05290_consen 133 SSS 135 (140)
T ss_pred ccc
Confidence 754
No 178
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=21.10 E-value=38 Score=21.93 Aligned_cols=12 Identities=25% Similarity=0.506 Sum_probs=7.3
Q ss_pred Cccccccccccc
Q psy12165 453 AGLACGVCLKIS 464 (537)
Q Consensus 453 ~~~~C~~C~~~f 464 (537)
..+.|..|++.+
T Consensus 36 ~r~~C~~Cgyt~ 47 (50)
T PRK00432 36 DRWHCGKCGYTE 47 (50)
T ss_pred CcEECCCcCCEE
Confidence 456677776643
No 179
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=20.57 E-value=97 Score=19.02 Aligned_cols=25 Identities=24% Similarity=0.396 Sum_probs=16.3
Q ss_pred CcccccccccccC--hhHHHHHHhhhh
Q psy12165 488 NPTCEVCGRYHNA--RPKLESHLAVHV 512 (537)
Q Consensus 488 ~~~C~~C~~~f~~--~~~l~~H~~~h~ 512 (537)
.-.|+.||-.|+. ..+-..|.+-|.
T Consensus 13 ~~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 13 ATTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred CcCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 3578888866654 556666766664
No 180
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.48 E-value=54 Score=24.95 Aligned_cols=12 Identities=17% Similarity=0.376 Sum_probs=7.3
Q ss_pred CCcccCCCCCCC
Q psy12165 424 LPVTCDLCSKEW 435 (537)
Q Consensus 424 ~~~~C~~C~~~f 435 (537)
.|..|++||++|
T Consensus 25 dPiVsPytG~s~ 36 (129)
T COG4530 25 DPIVSPYTGKSY 36 (129)
T ss_pred CccccCcccccc
Confidence 356666666666
No 181
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=20.48 E-value=40 Score=24.96 Aligned_cols=16 Identities=25% Similarity=0.702 Sum_probs=11.5
Q ss_pred CCceeecccchhhccC
Q psy12165 164 GESVHICHLCQKQFKQ 179 (537)
Q Consensus 164 ~~~~~~C~~C~~~f~~ 179 (537)
+...+.|..|++.|.-
T Consensus 51 a~GIW~C~~C~~~~AG 66 (90)
T PRK03976 51 GTGIWECRKCGAKFAG 66 (90)
T ss_pred EEEEEEcCCCCCEEeC
Confidence 4566888888887753
No 182
>KOG1842|consensus
Probab=20.16 E-value=53 Score=32.11 Aligned_cols=29 Identities=21% Similarity=0.308 Sum_probs=24.1
Q ss_pred CCcccCCCCCCCCChHHHHHHHHhhcCCC
Q psy12165 424 LPVTCDLCSKEWPSRKFMWQHLIRSHKAV 452 (537)
Q Consensus 424 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 452 (537)
..|-||+|...|.+...|..|+...|.++
T Consensus 14 egflCPiC~~dl~~~~~L~~H~d~eH~~e 42 (505)
T KOG1842|consen 14 EGFLCPICLLDLPNLSALNDHLDVEHFEE 42 (505)
T ss_pred hcccCchHhhhhhhHHHHHHHHhhhcccc
Confidence 35889999999999999999988888654
Done!